Basic Information

Gene Symbol
ewg
Assembly
GCA_963971205.1
Location
OZ020274.1:1476837-1497471[-]

Transcription Factor Domain

TF Family
Nrf1
Domain
Nrf1_DNA-bind domain
PFAM
PF10491
TF Group
Basic Domians group
Description
In Drosophila, the erect wing (ewg) protein is required for proper development of the central nervous system and the indirect flight muscles. The fly ewg gene encodes a novel DNA-binding domain that is also found in four genes previously identified in sea urchin, chicken, zebrafish, and human [1]. Nuclear respiratory factor-1 is a transcriptional activator that has been implicated in the nuclear control of respiratory chain expression in vertebrates. The first 26 amino acids of nuclear respiratory factor-1 are required for the binding of dynein light chain. The interaction with dynein light chain is observed for both ewg and Nrf-1, transcription factors that are structurally and functionally similar between humans and Drosophila [2]. The highest level of expression of both ewg and Nrf-1 was found in the central nervous system, somites, first branchial arch, optic vesicle, and otic vesicle. In the mouse Nrf-1 protein, Swiss-Prot:Q8C4C0 there is also an NLS domain at 88-116, and a DNA binding and dimerisation domain at 127-282. Ewg is a site-specific transcriptional activator, and evolutionarily conserved regions of ewg contribute both positively and negatively to transcriptional activity [3].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 1.4e-50 5.4e-47 160.9 0.3 1 102 79 180 79 184 0.98
2 29 1.2e-12 4.7e-09 36.9 0.0 61 102 187 228 185 232 0.94
3 29 1.2e-12 4.7e-09 36.9 0.0 61 102 235 276 233 280 0.94
4 29 1.2e-12 4.7e-09 36.9 0.0 61 102 283 324 281 328 0.94
5 29 1.2e-12 4.7e-09 36.9 0.0 61 102 331 372 329 376 0.94
6 29 1.2e-12 4.7e-09 36.9 0.0 61 102 379 420 377 424 0.94
7 29 1.2e-12 4.7e-09 36.9 0.0 61 102 427 468 425 472 0.94
8 29 1.2e-12 4.7e-09 36.9 0.0 61 102 475 516 473 520 0.94
9 29 1.2e-12 4.7e-09 36.9 0.0 61 102 523 564 521 568 0.94
10 29 1.2e-12 4.7e-09 36.9 0.0 61 102 571 612 569 616 0.94
11 29 1.2e-12 4.7e-09 36.9 0.0 61 102 619 660 617 664 0.94
12 29 1.2e-12 4.7e-09 36.9 0.0 61 102 667 708 665 712 0.94
13 29 1.2e-12 4.7e-09 36.9 0.0 61 102 715 756 713 760 0.94
14 29 1.2e-12 4.7e-09 36.9 0.0 61 102 763 804 761 808 0.94
15 29 1.2e-12 4.7e-09 36.9 0.0 61 102 811 852 809 856 0.94
16 29 1.2e-12 4.7e-09 36.9 0.0 61 102 859 900 857 904 0.94
17 29 1.2e-12 4.7e-09 36.9 0.0 61 102 907 948 905 952 0.94
18 29 1.2e-12 4.7e-09 36.9 0.0 61 102 955 996 953 1000 0.94
19 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1003 1044 1001 1048 0.94
20 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1051 1092 1049 1096 0.94
21 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1099 1140 1097 1144 0.94
22 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1147 1188 1145 1192 0.94
23 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1195 1236 1193 1240 0.94
24 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1243 1284 1241 1288 0.94
25 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1291 1332 1289 1336 0.94
26 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1339 1380 1337 1384 0.94
27 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1387 1428 1385 1432 0.94
28 29 1.2e-12 4.7e-09 36.9 0.0 61 102 1435 1476 1433 1480 0.94
29 29 5.7e-69 2.3e-65 220.9 0.0 61 212 1483 1640 1481 1641 0.93

Sequence Information

Coding Sequence
ATGGTCCTTGAGAAGGACCGGGACGAAGACTACATCATGAACTCCGCAGTCAGCACTGAATCTATGGATATGGCGGAAGAGGACATGTCCCAGGTGGACGGGTGTGGTCTCAGCGGCGGGTCGGAGGACGAGGATGAGTGTGCGTCCTCGCCGGCCGGCTCAGCCTACGACGACGGCGCAGACCTAATCAAGAACGCGCTCAGCGACGAGGTCACCAAGCAGCTGGCTGCCGCAGGTCCGGTCGGCATGGCCGCGGCGGCCGCAATAGCTTCCTCGAAGAAACGCAAGAGGCCGCATTCTTTTGAAACCAACCCATCAGTCAGAAAACGTCACCAGAACAGACTGCTCAGGAAACTCAGGCAAACCATCGAAGAGTTCGCGACGCGCGTCGGCCAGCAGGCGGTGGTGCTGGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTATCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGGTGGGTGCCCACGGCTGGCCAGTGGCCACTCCGGGGAAGCCGAACACGGCCTACCGCGTGTTCGGAGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTGGAGAACGCGCTGGCGCAGCAGTTCGGCGTGGGCGCGGGccaggcgccggcgccgccgcaggACGACCCGTCGCTGTTCGAGCTGCCGCCGCTCATCATCGACGGCATCCCCACGCCCGTGGAGAAGATGACGCAGGCCCAGCTGCGCGCCTTCATCCCGCTCATGCTCAAGTACTCAATGGTGCGCGGCAAGCCGGGCTGGGGGCGCGAGTCGACGCGGCCGCCGTGGTGGCCCAAGGACCTGCCGTGGGCCAACGTGCGCATGGACGCGCGATCAGAAGATGAGAAACAGAAGATGTCGTGGACGCACGCGCTCCGACAGATCGTCATCAACTGTTACAAGTACCACGGCCGCGAGGACCTGCTGCCCGCGTTCACTGAGGACGACGAAAAACCCACGCAGCCCGTCTCGGCGTGCATGCCGTCGGCCAGCTCCAACTCCTCCCGCTCGCCCGGCGCGCGCGCGCCCACCGTCATATCCTCGCAGCAAGTCTGCATCGACCAGATGACGCTCGCCGATGTCGATATGTCCCAGTACGCGCCGGCCGTCCTGCAGACCATCACGAACCCCGATGGCTCCGTGTCCCTGGTGCAGCTGGACCCGTCGCACCCCATCATCACGCTGCCTGACGGCACCACCGCGCAAGTGATCCATACCGGGGAAGGCGGCGCGAGCGGCATGGTGCAGACTCTGGACGGAGAGTCCGTCGCCGTCGACCTCAACGCCGTCGCCGAGGCCACGCTCAACCACGACGGGCAGATCATCCTCACGGGCGAGGACGGACACGGCTACCCAGTGTCGGTGTCGGGCGTGATCACGGTGCCGGTGTCGGCCTCGGTGTACCAGAGCATGGTGGCCTCCATGCAGCAGCAGGACGGCGTCTGCGTCGCTCCCCTGGTACAGGTAGAGCAAGGGGGCGAGACCCTGGAGGCGCTGGGCATGGGCGGCGGCGTGGCGCAGGTCATGCTGCAGGGCGGCGAGCAGGTGCTGCAGGTGCTGAGCCTCAAGGACGCCACCGTGCTCACCAAGGCCATGGTGAAAGCCGAACGGGATGCCGTGGCAGCTGACTCCTAG
Protein Sequence
MVLEKDRDEDYIMNSAVSTESMDMAEEDMSQVDGCGLSGGSEDEDECASSPAGSAYDDGADLIKNALSDEVTKQLAAAGPVGMAAAAAIASSKKRKRPHSFETNPSVRKRHQNRLLRKLRQTIEEFATRVGQQAVVLVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQVGAHGWPVATPGKPNTAYRVFGAKPLEDVVRNLRCMIMEELENALAQQFGVGAGQAPAPPQDDPSLFELPPLIIDGIPTPVEKMTQAQLRAFIPLMLKYSMVRGKPGWGRESTRPPWWPKDLPWANVRMDARSEDEKQKMSWTHALRQIVINCYKYHGREDLLPAFTEDDEKPTQPVSACMPSASSNSSRSPGARAPTVISSQQVCIDQMTLADVDMSQYAPAVLQTITNPDGSVSLVQLDPSHPIITLPDGTTAQVIHTGEGGASGMVQTLDGESVAVDLNAVAEATLNHDGQIILTGEDGHGYPVSVSGVITVPVSASVYQSMVASMQQQDGVCVAPLVQVEQGGETLEALGMGGGVAQVMLQGGEQVLQVLSLKDATVLTKAMVKAERDAVAADS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-