Basic Information

Gene Symbol
ewg
Assembly
GCA_943789515.1
Location
CALSUL010000249.1:1623880-1642467[+]

Transcription Factor Domain

TF Family
Nrf1
Domain
Nrf1_DNA-bind domain
PFAM
PF10491
TF Group
Basic Domians group
Description
In Drosophila, the erect wing (ewg) protein is required for proper development of the central nervous system and the indirect flight muscles. The fly ewg gene encodes a novel DNA-binding domain that is also found in four genes previously identified in sea urchin, chicken, zebrafish, and human [1]. Nuclear respiratory factor-1 is a transcriptional activator that has been implicated in the nuclear control of respiratory chain expression in vertebrates. The first 26 amino acids of nuclear respiratory factor-1 are required for the binding of dynein light chain. The interaction with dynein light chain is observed for both ewg and Nrf-1, transcription factors that are structurally and functionally similar between humans and Drosophila [2]. The highest level of expression of both ewg and Nrf-1 was found in the central nervous system, somites, first branchial arch, optic vesicle, and otic vesicle. In the mouse Nrf-1 protein, Swiss-Prot:Q8C4C0 there is also an NLS domain at 88-116, and a DNA binding and dimerisation domain at 127-282. Ewg is a site-specific transcriptional activator, and evolutionarily conserved regions of ewg contribute both positively and negatively to transcriptional activity [3].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 20 3.9e-42 5e-38 132.8 0.7 1 78 85 162 85 167 0.98
2 20 1.2e-14 1.5e-10 42.9 0.0 42 78 170 206 166 212 0.95
3 20 1.1e-14 1.5e-10 43.0 0.0 42 78 214 250 209 256 0.94
4 20 1.2e-14 1.5e-10 42.9 0.0 42 78 258 294 253 299 0.95
5 20 1.1e-14 1.5e-10 43.0 0.0 42 78 302 338 297 344 0.94
6 20 1.1e-14 1.4e-10 43.0 0.0 42 78 346 382 340 388 0.94
7 20 1.1e-14 1.4e-10 43.0 0.0 42 78 390 426 384 432 0.94
8 20 1.1e-14 1.4e-10 43.0 0.0 42 78 434 470 428 476 0.94
9 20 1.1e-14 1.4e-10 43.0 0.0 42 78 478 514 472 520 0.94
10 20 1.2e-14 1.5e-10 42.9 0.0 42 78 522 558 518 564 0.95
11 20 1.2e-14 1.5e-10 42.9 0.0 42 78 566 602 562 608 0.95
12 20 1.2e-14 1.5e-10 42.9 0.0 42 78 610 646 605 651 0.95
13 20 1.1e-14 1.5e-10 43.0 0.0 42 78 654 690 648 695 0.95
14 20 1.1e-14 1.5e-10 43.0 0.0 42 78 698 734 692 739 0.95
15 20 1.1e-14 1.4e-10 43.0 0.0 42 78 742 778 736 784 0.94
16 20 1.1e-14 1.4e-10 43.0 0.0 42 78 786 822 780 828 0.94
17 20 1.2e-14 1.5e-10 42.9 0.0 42 78 830 866 826 872 0.95
18 20 1.1e-14 1.5e-10 43.0 0.0 42 78 874 910 869 916 0.94
19 20 1.1e-14 1.4e-10 43.0 0.0 42 78 918 954 911 960 0.94
20 20 3.1e-82 4e-78 263.8 0.0 42 212 962 1131 957 1132 0.97

Sequence Information

Coding Sequence
ATGGTCCTAGAGGTGAAGCGCACTCACGACGAAGACAGCGATCCCGACTTCGACATCATGAATTCGGCAGTCAGCACAGAGTCTATTGACATGGCGGAGGAGGACATGTCCCAGGTGGAGGGCTGCGGGCTGAGCGGGTCGGAGGACGACGACGAGTGCGCGTCGTCGCCCGCCGGCTCCGCCTACGACGACCAGCAGGACATGATCAAGAACGCCCTGAGCGACGAGGTCACCAAGCAGCTGGCTGCTGCAGGTCCCGTTGGCATGGCTGCAGCCGCCGCCATCGCCTCCTCAAAGAAACGCAAGAGGCCACATTCCTTCGAAACCAACCCCTCAGTCAGAAAAAGACACCAGAATAGACTGCTCAGGAAACTAAGACAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCGAGCCATTAGTATATCAGTCGATGTTGTGGCAGCAAACGATCGAGGAGTTCGCGACGCGCGTGGGGCAGCAGGCCGTGGTGCTGGTGGCGACGCCGGGCAAGCCCAACACCTCGTACCGCGTGTTCGGCGCCAAGCCGCTGGAGGACGTCGTGCGCAACCTGCGCTGCATGATCATGGAGGAGCTCGAGAACGCGCTCGCGCAGCAGgcgccgccgccgcccgccgACGACCCCTCGCTGTTCGAGCTGCCGCCGCTCATCATCGACGGCATCCCCACGCCCGTCGAGAAGATGACGCAGGCGCAGCTCAGGGCCTTCATACCCCTCATGCTCAAGTACTCCATGGTCCGCGGCAAGCCGGGGTGGGGCCGCGAGTCCACGCGGCCGCCGTGGTGGCCCAAGGACCTGCCCTGGGCCAACGTGCGCATGGACGCGCGCTCCGAGGACGAAAAACAAAAGATGTCGTGGACGCACGCGCTGCGGCAGATCGTGATCAACTGCTACAAGTACCACGGGCGCGAGGACCTGCTGCCCGCCTTCACCGAGGAGGACGACAAGCCGCCCACGCAGCCCATGTCGCAGTACGCTCCAGcggtgctgcagaccatcacGAACCCGGACGGCACGGTGTCGCTCATCCAGGTGGACCCCAACAACCCCATCATCACGCTGCCCGACGGGACCACCGCGCAAGTGATCCACAGCAGCGAGGGCGGCGGCAGCGTGGTGCAGGCGCTGGACGGCGAGGGCGGCTCCGTGACCGTGGACCTCAACGCCGTGGCCGAGGCCACGCTCAACCACGACGGACAACTCATACTCACCGGCGAGGACGGACACGGTACACGATGCAAAATAGCATACATACACTAG
Protein Sequence
MVLEVKRTHDEDSDPDFDIMNSAVSTESIDMAEEDMSQVEGCGLSGSEDDDECASSPAGSAYDDQQDMIKNALSDEVTKQLAAAGPVGMAAAAAIASSKKRKRPHSFETNPSVRKRHQNRLLRKLRQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAEPLVYQSMLWQQTIEEFATRVGQQAVVLVATPGKPNTSYRVFGAKPLEDVVRNLRCMIMEELENALAQQAPPPPADDPSLFELPPLIIDGIPTPVEKMTQAQLRAFIPLMLKYSMVRGKPGWGRESTRPPWWPKDLPWANVRMDARSEDEKQKMSWTHALRQIVINCYKYHGREDLLPAFTEEDDKPPTQPMSQYAPAVLQTITNPDGTVSLIQVDPNNPIITLPDGTTAQVIHSSEGGGSVVQALDGEGGSVTVDLNAVAEATLNHDGQLILTGEDGHGTRCKIAYIH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-