Dneo013273.1
Basic Information
- Insect
- Drosophila neotestacea
- Gene Symbol
- -
- Assembly
- GCA_035044365.1
- Location
- JAWNMD010000412.1:682435-697380[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 28 5.5e-15 1.1e-11 45.5 4.3 1 86 569 641 569 642 0.85 2 28 3.1e-15 6.4e-12 46.2 4.6 1 87 669 738 669 738 0.83 3 28 6.8e-16 1.4e-12 48.4 0.4 1 87 760 832 760 832 0.84 4 28 6.7e-16 1.4e-12 48.4 5.7 1 87 935 1005 935 1005 0.83 5 28 8.2e-15 1.7e-11 44.9 3.4 1 86 1029 1100 1029 1101 0.82 6 28 4.9e-13 1e-09 39.2 1.1 1 87 1136 1204 1136 1204 0.80 7 28 3.8e-10 7.7e-07 30.0 1.7 1 86 1251 1320 1251 1321 0.76 8 28 6.7e-16 1.4e-12 48.4 0.1 1 86 1348 1417 1348 1418 0.83 9 28 1.5e-12 3e-09 37.7 1.2 1 86 1439 1508 1439 1509 0.81 10 28 7.5e-15 1.5e-11 45.0 1.6 1 86 1536 1607 1536 1608 0.85 11 28 6.9e-14 1.4e-10 42.0 1.6 1 85 1684 1752 1684 1754 0.82 12 28 3.6e-12 7.3e-09 36.4 0.1 1 86 1777 1845 1777 1846 0.81 13 28 8.5e-14 1.7e-10 41.7 1.3 1 87 1996 2065 1996 2065 0.81 14 28 4.4e-12 8.9e-09 36.2 0.0 1 62 2122 2180 2122 2200 0.78 15 28 0.14 2.9e+02 2.5 0.0 1 58 2214 2264 2214 2280 0.77 16 28 2.2e-12 4.5e-09 37.1 1.0 1 86 2303 2372 2303 2373 0.84 17 28 8.6e-15 1.7e-11 44.9 2.3 1 86 2454 2523 2454 2524 0.83 18 28 5.1e-12 1e-08 36.0 0.8 1 86 2559 2630 2559 2631 0.81 19 28 2.8e-12 5.8e-09 36.8 2.4 1 87 2641 2713 2641 2713 0.82 20 28 1.3e-14 2.6e-11 44.3 0.1 1 86 2743 2813 2743 2814 0.76 21 28 0.00048 0.97 10.4 0.0 1 58 2847 2897 2847 2924 0.72 22 28 1.6e-13 3.2e-10 40.8 0.1 1 86 2935 3007 2935 3008 0.79 23 28 1.4e-15 2.8e-12 47.4 0.2 1 86 3160 3232 3160 3233 0.82 24 28 1e-13 2.1e-10 41.4 2.2 1 87 3300 3371 3300 3371 0.82 25 28 3.9e-15 8e-12 45.9 3.8 1 86 3480 3550 3480 3551 0.85 26 28 5.5e-13 1.1e-09 39.1 0.0 1 87 3634 3704 3634 3704 0.85 27 28 6.4e-09 1.3e-05 26.0 1.1 1 58 3723 3769 3723 3785 0.86 28 28 7e-10 1.4e-06 29.1 0.7 18 87 3786 3844 3772 3844 0.75
Sequence Information
- Coding Sequence
- ATGTCACAACACAACAACCCcccgcatcatcatcatcatcactactaccagcagcagcagcaacaacagcaacaacaacaacatcatcaccaccagcagcagcatcagcagcagcaactacaacataaacaaatacagcAGCAAAGTTGGTACTCACATGTTGCTTCCTACCCTCCCCCCCACCATCCGCACACCGCAGCCTTTGCGGCGCCCTGcaaaagcaataataacaacaacaacaacattatgaATGCATACGGTGCGGGAGCTGGAAGCACGCATGCAGCATATTATGGCTCTGGTGGGGTGGGCTATAACCTTGAGGGCAATACTGTGGCCTATGCGCACAACCAGCTGctacaataccaacaacaacaacaacaacagcagcaacaacaacaacaacaacatcatcagctcAGTCAACGCTCGTATATGCCGCACAGTTTAATGCATAGCTCGTATCCCTATATTAAGAGCGAGCCATTGGAGCTGCCTGATGATagacaacgccaacaacaccaacatcagcagcagcaaccgcagcaacaacatttccaGAATCCTATGGCACCGCCGCCAGCTCCCGCCAATCGTCACAGTCTCGATGCCAGCGGtgaaatgataataaaatCGGAACCCATTGACGAACATGCCTACAAGTCAAACTATATCGATGACAACACGCCCTTTGCTGATTTTAGTAAATATCCGGAGTTCGGTGACGACATGTTAAGTCCCAAGGTGGAGCTAACGGTCAAGGACGAGGGCTATGGGAGTCAAAAAGTTCCCAACCCGCTCAGCTATCCGAGACGAAAGCTACAATCGGAGCGCTCATCGGAAAGTCTTCCCATTTGTCAGCGTTGCAAGGAGGTGTTCTTTAAGAAACAAATCTACTTGCGTCATGTGGCCGAGAGCAGTTGCGGTATACAGGAGTATGACTTCAAGTGCAACATATGTCCCATGTCCTTTATGAGCACTGAAGAGTTGCAGAAGCACAAGCATCTACACAGGGCAGACAAATTCTTCTGCCACAAATATTGTGGCAAGTACTTTGACACCATTGCCGAATGTGAGTCGCATGAGTACATGCAACATGAGTATGATAGTTTTGTTTGCAACATGTGTTCCGTTACGTTTGCCACGCGGGAACAGCTTTACGCTCATTTGCCGCAACACAAATTCCAGCAGCGTTACGATTGTCCCATTTGCCGCTTGTGGTACCAAACGGCTCTGGAGCTGCACGAGCATCGTCTGGCTGCTCCCTATTTCTGTGGCAAGTATTATACAGGCGCACAATCGGCatcacaccaacaacagcagcaacagcatccacagcatcagcaacaggcCAACTACAAACTGCAGGACTGTCACATGGCCACCATGGAAATGCCAACGCCACATCACAAGGCAAATACAACTGCAAATGCATTGCCGGCAACGGCAGCTTTGAGCTcattgttgcaacaacgtCAGGCGAATGCCGATGGAGCCGCTATGTTTGCCTCAACGATGAAGAACGAGGCAAATGTGAAGCTGGAGCGAAGCTACAGCAATTCTACAAGCGAGTCTGGTTACAGTTTGCACGACAGCAGCTATAACAATGCCTATGGGAGCGATACATCGTTACATGCCGGTGGTGGTGCAGTTGGTGGTCCACAGGCGCACTCCTCGACGCTGGACGATTCGGAGGATGCTCTCTGTTGTGTGCCACTGTGCGGTGTCCGAAAGAGCACCAGCCCAACGCTTCAGTTCTTTACTTTCCCCAAAGATGAGAAGTACTTGCATCAGTGGCTCCATAATCTCAAGATGTTCCATATTCCAGCGTCGAGCTATGCAACTTTTCGGATCTGCAGCATGCACTTCCCGAAGCGTTGCATCAATCGTTATTCCCTGTGCTATTGGGCGGTGCCTACGTTCAATCTGGGCCATGACGATGTTGCCAATTTGTATCAGAATCGTGAGCTGACCAACACTTTTACCACTGGAGAGGTGGCACGTTGCAGCATGCCCAACTGCACCAGCCAGCGGGGAGAGAGTAATCTCaagttttacaattttcccAAGGATATCAAGAGTCTGATCAAGTGGTGCCAGAACGCCCGTTTGCCCGTCCAAGCCAAGGAGCCGCGTCACTTTTGCAGTCGACACTTTGAGGAGCGTTGCATTGGCAAATTTCGCCTGAAGCCCTGGGCAGTGCCCACTTTACATTTGGGGGCTCAATACGGCAAGATTCATGACAATCCGAAGAACTTGTATGTGGAGGAGAAACGTTGCTGTCTTAATTTCTGTCGTCGCAGTCGCTCTTCAGATTTTAACATGTCACTGTATCGCTTCCCCCGGGATGAGGTCCTCCTCCGACGCTGGTGCTACAATCTACGACTTGATCCTTCTGTCTATCGTGgcaaaaatcacaaaatatgCAGCGCTCACTTTATCAAAGAGGCATTGGGACTTCGCAAATTATCACCAGGAGCTGTTCCCACGTTGCATTTGGGCCACAATGATACGTTCAACATATACGAAAATGAACTGTGGCCACCACCAACGTCGACCACGCCCACCAATAAccaacagcaattgcagcagcaacagttgcaacagcaacatcaacaacatcagtcGCATCATGGTCATCATGGCAACAGCAAGTATCTACGTCATTCGGCTGCATCGACATCCTCGTCGGCCAGCTCGGCATCGCATTATGTGGATCCGGAAATGAGTGGAACATATATGGGAATGGGTAACTCGGGAGGATCTTCGTCTGGCCTGAATGTAAGCGACAGCATGGACGTGTGCTGTGTGCCCAGCTGTGAGAGTAAACGTcacaacaatgaaaatatcACATTCCATACGATACCAAGGAGACCGGAGCAGATGCGCAAATGGTGTCACAATCTAAAGATACCCGAGGATAAGATGCACAAGGGAATGCGCATCTGTAGTCTGCATTTTGAGCCATACTGCATCGGTGGATGCATGCGTCCATTTGCGGTGCCCACATTGCATCTTGGACATGACGACGAGGACATTCATCGTAATCCGGACGTGATCAAGAAGCTGAACATACGCGAAACGTGCTGTGTTGCTGTCTGCAAACGCAATCGGGATCGAGATCATGCCAATCTTCATCGCTTCCCCAGCAACGTGTCCCTGCTGACCAAGTGGTGTGCCAACTTGCAACGTCCAGTTCCAGATGGCACCAAGCTCTTTAACGATGCCATTTGTGAGGTGCACTTTGAGGATCGATGTCTGCGCAACAAGCGACTAGAGAAGTGGGCGGTGCCCACGTTGATTCTCGGTCATGAAAATATTGCTTATCCTCTGCCTACGGCAGAGCAAGTGGCCGAGTTCTATTCGCGACCCAGTGCACCCAACAATGGCGAGGAGCAAGGCGAGTGCTGTGTGGAGACTTGTAAGCGTAATCCAAGCGTGGATGACATTAAGCTCTATCGTCCGCCAGAGGAGTCACAAGTGCTGGCCAAATGGGCTCATAATCTGCAGCTAGATGTCGCTCAGTTGCCCAACATGAGGATCTGTAATCTGCACTTTGAATCCCACTGCATTGGCAAACGGATGCGACCCTGGGCCATACCTACTCTCAATCTGTCCACCAATGTTGAGAATCTCTATGAGAATCCTGAACATCAGATGCTCTACAAGCGTCGCAAGCATCTCAATCCCGACCGAGGAGCTGCCTCCCATGGCGGTGCTGGCATCGTGAAGCCCACTTGGGTGCCACGCTGTTGCTTGTCACATTGTCGCAAGGTGCGCGCTTTGCATAATGTCCAACTGTATCGATTCCCTAAGCTCAATCGTTCCACGCTCGCCAAGTGGGCGCACAATCTTCAAGTGCCAATGGTGGGCAGTGCCCAGAGACGTCTCTGCTCGGCTCACTTTGAGCCTCATGTGCTGAGTAAGAAGTGTCCAGTGCCGCTGGCGGTGCCCACACTGGAACTTAATTCACCGCCTGGCTACAAGATCTATCAGAATCCCGCCAAGCTGAAGGCTAACAAGCTCTGCCTGCAGCGTGTCTGCATTGTCGAGAGTTGTCGTCGGCAACGTGGTCAGGGAGTTCAGCTCTTCCGTCTGCCCCACAATCCCACCCAGCTTCGCAAATGGATGCACAACATACGGATGCGACCGAGAGGAGCCATGCGCCAACAATATCGCATGTGTTCCATTCACTTTGAGACGCACTCGTTCAATGGGAAGCGGTTAAGTGCCGGGGCTATACCAACTCTGGAGCTGGGACACCAGGATGATGATATCTATCCGAATGAAGCACAATCTTTTGTCGAGGAGCACTGCACTGTCGAGGGCTGTGATGCGTCCAAGGAGCAGCCGGATGTGCGTCTCTTCCGATTCCCCACCGAAGATGAGGATCTGCTCTGGAAATGGTGCAACAATCTCAAGATGAATCCCGTTGATTGCGTCGGCGTTCGCATATGCAACAAACATTTCGAGACGGACTGCATCGGACCCAAGCATCTATACAAGTGGGCGTTACCCACACTGGAACTGGGACATGATGATGCTCAGATTGAGCTCATACACAATCCCAAGCCGGAGGAACGCTACGTTGATCCCGTGTTCAAGTGCTGTGTTCCCACTTGTGGCAAAACGCGCAAGTTCGATGAGGTGCAAATGAATAGCTTCCCCAAGGATCCAACACTCTTTCAGCGCTGGCGTCACAATCTCCGTCTCGACCATCTCAATTTCAAGGAGCGCGAACGCTACAAGATCTGCAACGTTCACTTTGAGGACATTTGCATTGGAAAGACGCGGCTCAACATTGGCTCCATTCCAACTCTGGAGCTTGGGCATGAGGAGACTGAAGATCTCTACCAGGTGAATCCGGCGGAATTGCAAAGCAATCTTTTCGGACGTCAACGACGTGTCCACGAATCCATGGGCATTACCATCAAGCAGGAGGAGAACTCAGAGGTGGATGAGGACATTAAACCAGATATTGACATGTCCGAGGCATCAGACGTGAATACAAGACAGgttaaaataaagaaatcgATGTACGATTTGAAGTGCTGTGTGCCAAGCTGTGGACGTAGCCGTTTGGAGCATGGAGCACGCCTGTTTCCCTTTCCCAGTGGCAAGCAACAGCAGAGCAAGTGGCGCCATAATCTCCGTTTGGAACCCGACGACGTGGACAGAACAACACGTGTGTGCAGTGCTCATTTTAATCGACGGTGCATCGATGGGAAGCAGCTAAGGGGATGGGCAATGCCCACCCAGCAATTGGGACACCAGGAGCAACCCATCTATGAGAATCCGAAGAACATACCAGGCTTCTTTACGCCCACCTGTGCTTTGGCACATTGCCGAAAGCGTCGGAGCATTGATAATGATCTACGCACCTATCGATATCCCCGCAGCGAGGATCTGCTCGAGAAATGGCGTGTCAATTTGAGATTGGCGCCTGATCAGTGTCGCGGACGCATTTGTGCCGATCACTTTGAGCCCATGGTGCGTGGAAAGTTAAAGCTGAAGACGGGAGCGGTGCCCACGTTAAAATTGGGACATAATGAAGGCGTGGTCTTTGATAATGAAGTTATTAAGGCGGGTCTGCAGCAGGAGGCGGAGGCAGAAGAGGGTGAGGCAAGCATGGAGTCGCTGGTCAAGATTAAGCAAGAGAAAATCGATCCGGACGATGAGCTGGAAGATAACGTTGGTCGTGAAGAGcgtgatgataatgatgatgatgaggagtCGGAACAAAAGCACGAGCAGGACGCCGATCCAGAGGATCATGGCTATTTTGATCCCTTGGAACTGGTAGAAACCTTTGCGGAGCATCACAGCgaacatgatgatgatgacgaggacgacgatgatgatgatgatgaggatgaacCTGGCTATGATgatgagctgctgctgccggaTACTCCGCCAGTTCAAGTGTCGCTACCAGTGCCTCCACTACGGCGTGAGAAGCCTGTGAATAATGTGACGCCCATTTGTTGCCTAAAGCATTGTCGCAAGGAACGCACAGCGATTCATCCGCTGAGCACCTTTGGCTTTCCCAAGGATCATCAGCAATTGCTGAAGTGGAGCGCCAATCTAAAGCTGTCACCAGCCGACTGCGTTGGACGTGTTTGCATTGAGCATTTTGAGCCGGAGATGCTGGGCACGCGCAAGCTGAAGCAGAATGCGGTGCCCACTATTAATCTGGGACACACAACTCCTCTTAGCTACAGTTGCAATGGTCAATCTCAGGGCATTTATGATACCCAGCCGCAGCACTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGCTTCCAATGGAACCGGATCCGGATCAGGCAGCGACTAAGCGACGACGCTGCTGCCTGCCAAGCTGTGGCAAGCAGCCGGATCTCCATGGTGTCCAGCTACACCGATTGCCAAGCGATCGCATCCTGCTCCGCAAATGGCTATACAACCTGAAGCTATCACCAATGGTGGACATCGGCCAGGCACGTCTCTGTAGCGAACACTTTGAGCCGCAGATGGAGACGATGGAGGGATGTGTTCCAACATTGCGGTTGGGTCATGACGATACTCGTATTTATCGCAATCGTGGAAGTATTAGTGGCAGCATCTCGGCATCATCCAGTGGCTGCATGGTGGCCAGCTGTCCCTGTGCCCGCCTCAATCTCTATCGCTGCTATGATCTGCCCGCGAATCGTTTGGTGCAGAAGGTCTGGCTGGAATGGCTTCAACTGCCCATGCCTCAGCTGGCTAGCGATGGCAAGCTCTGCGTGATGCACTATATGCAGCTCTATGAGCAGGTGCCGTTGCCACAGGAGCTTCCAGAGCCTGTGCTCCGTCAGCTGCAAGAGGCTTATGATTCAATCTCCGGTTCGTCCATGGCCATGAAGCTACGCTGTGCCGTTCCAGGTTGTTACTCCAAGTACACGGACAACATTAGGCTGACTAAGCTACCAATGTGCCCGGATACCTGTGCCAAGTGGGTGCACAACACCAAGATCACCTATGATCCTACTCGACATTATATTTATCGCATTTGTATGCTTCACTTTGAGGCACGTTGCTTGGGTCCAGTGCGTCCAAAGCAGTGGGCGGTGCCAACACTGCAATTAAATCACAACGATCCGGATATCTATCTAAATCCTAAGGCTACTGAAACTCTGCCGACTCCTATGTCCATTTCCACTCCCGTTCCCGTGTCTATCTCCACGCCTGTTCCCGTATCTCTGTCAACGTCTGTTCCCATGTCTGTTCCCGTGGAGCTGCCTTTGCGTATTAAGACAGAGCTGGCATTTAGCGGCAGTCCCAGCGCCAGTGCCAGTCCAAGTCCACGTGGCAAACTGCGCATCTGCTGTATTCCCAGCTGTGCCCAGCAGGCAACATCCCAGTCGCGTCTCTTTCGCTTTCCCACCGCCGAGACGGCATTGCTCAAGTGGCTGGTGAATACACAGCAACAGCCGAGATTGGCTGATCCACAGCATCTGTTTGTCTGCCAGGATCATTTCGAGGCGGAGGCCATTTGTAAGAATCAACTACGAAGTTGGGCTGTGCCCACATTGAAGCTTGGACACGATGGTCATGTCATTCCAAATGCCAGGCACAATGGCAACATTGCAGACAGCCAGGAGAATAAGCAGGCGCTGCAGTTCATCTGggaaaactattgctcggtCTTGAGCTGCTTCCAACCACGCAGCGAGGATCTACGTCTCTATGCATATCCCACGGATAGACCCACCATACGGAAATGGGCGGCCAACTGCAAGCATCGATCCATGCAGGCCAGCAGCGATGGATTTCAGGTCTGCCAATCGCACTTTGCGCCTCATTGCTTTGACCCGGATACGGGTGAATTGCGTGAGAATGCTGTGCCGACGTTGGAGCTCAGTCGATGCATCAATGAGGTGCGCTGTGTGGTGTCCGGTTGTGTCAAGGATGAGGATGGACCGCGTCAACGCTATTACAAGATGCCCAAACGTTCCTCACAGCTTAATATTTGGTGTCACAATCTTTGCCTGGACACCGTTGCCATGAGCTCTAGTGAGCATCATGTGTGCGATCGTCACTTTGAGATGCAGTGCTTCAATCAGCAAAAACTCCTGCGTCCTGGAGCACGACCCACGCTGCATTTGGGGCATGATGAGCAAATAGATCTGATGCCCAACCCCGCTGATTGGGCAACGGATGGCTCAGATGCGATGGCTATGACCACGGTCTGCTGTGTGCCCAACTGTGGACACTCCAGGGATGAGGATGATGTGCAGCTCTTTGCCTTCCCCAAAATGAGAGTTTTGGCGGAAAAGTGGCTACAGAATATACGCCTGGAGGTTGGCAGGGAGCAGTTGGCTAAGATGAGGATATGTGGGGCACACTTTGAGCACAGTTGCCTGGAGAATGGACGACCTCAGTTGGGGGCTATGCCCACGCTCGAACTAGGACACGAGGAGCGCCACAAAATACATCGAAGCGCAGATCCGACAGTAGGCAAGGTCAAGAAATATTGTAACAGAAGTGGCTCCAGCTATGACTGCTGTTATCCTCAGTGTGTAGAGCTACAAAAGACTTATCTGAGAATTAGCTACGATCTACCCCAGGGGATTGCACTGCGTCAGCAGTGGCTAGATTATATGGCCGTTGAGGAAACGGAGGAGAAACCCCTCAAGCTCTGTCCAATGCATTTGATCCTACTCTATGATCACAGTGAGGAGCATTTTGCAGAGCACACAAAAGAGCAGCTGCTGGACTCCAACTACGAGGATGCACGGAGTAGTGTCCGCATACGTGTCATCAGTTGTGCGGTCCGTGGTTGCCGGACTCTGAAACCCCGAGACGGCGGACGACTTCATGGATTGCCCACACGTCGGGATGTGCTTGAGATGTGGCTGTATAACATGCAGCTGGTGTTTTATGAGCACCAAAGATACATGTACAAAATATGTAGCAAACACTTTGAGGCCAGTTGCTTCATGGATACGACACGACGTCTAAAGCCCTGGACTATGCCCACATTGGAGTTGCCGGATCGTGAGCCTGGCGAGGCGCCTGTCTTTCAGAATCCCACAGAAGAGGAGTGGCAGCGAATGAATGAGCTGTTCGCagaggagcagcaacaaatacagcagcaacaagtgaATCAGGAGAATAATGAGGGAGAAAACGACTTGCTTGAGCCAATTGTAAAGATTGAGCATATGGGAAATGAAGATCAACTCTACGAGGAGGAGTTggagcagcatcagcagccgGATGGGGAAGAAGACTTTGAGAACTCACAGCAGCCGCTGGAAGTGCTACTCGAGGTGGGTCATGTGGAAAAGTGTCCCACCTATGAGCAAATGGATTCAGAGGCAAATCTCAGCTATGCCGTCGAGCAGCAGGCCCAGATCAGCAGCTTTGCTCCGTCGTCATCGACGTCGCAATATGGTGGTGCTATTGTCAGCAATGGATTCAAGTACAATGCTCGCCACTGCAGCGTAAGGGGGTGTGATGTGACGGCCAATGACGTGAATGGCAATATCAAGTTGCACAAGTTCCCGACATCGCTGGATGCGATGAAGAAATGGATGCACAACACCCAGGTTGATGTGGACACGAACGTTGCTTGGCGATATCGTATTTGTAGCTATCACTTTACCGATGAATGCTTTAATGGATCACGCATAAGACGTGGTGCTATGCCCACTCTTAGTCTAGGACCACGTCGTCCTCCAAAAATCTACGACAATGAGTTCAATACAACGCTGCAGCCGGAACAGGAGCAAACCAATGAGGTGGCCAGCGAGGAGCAGCTAAACAATGAAGTGGAGTCAAGAGAGACGCACATGAAGGGCGGTGACATCAGTCTGAATCTGCCACAGCCAGCACCGCCCCGCAAGTCCAGCAAATACTGCCAGATCGAAGGTTGTCCCAATCATTTGACAAGCGAGAATCTGACACTACATAAATTCCCGCATTCGGTGGACATGTGCGCCAAGTGGCAGCATAATACTCAAGTTCCGTTTGATCCCGACTTCCGTTGGCGCTATCGCATTTGCAGTGCCCACTTTGAGCCCATCTGCCTGATGAATATGCGTCTGATGCATGGCAGCGTGCCCACACTTAATCTGGGGCCACGTGCGCCTCGCCAGCTTTTTGACAGTGACTTTGAGGCGATTAGCATGCGATTGGATAAACAGAAGAGCAGCTCGGAGCAGCATTTGGTGGACAAACACGAGCAATTACAGGTCCACGAACAGGATGAGGAGGAGTTAAGCTTCCTTGTGCCAGAGATGCAACTACATGAAGATGCAGATGCGGAGCAGTCGGACAATCCGTTGACTTACAGTagtcacaacaacagctggaAGGATCTGCGTTTGCCCAGCATTAAGCAGGAAAAGACTATGACTGCGACAAGCTATAATCCAGTCAAGTCTGGCTATGACAAGTGCTCCCTGGTGCATTGCCAGCGGCAGCGTTCTCAGCACGGCGTCCACATTTACAAGTTCCCACGATCGAAGCAACTCCAGCAGCGCTGGATGCACAATTTGAGGATACAATACGATGAGAGGCGTCCTTGGAAAACAATGATATGCAGCGTACACTTTGAACCCAACTGCATTCGACTCCGCAAGCTGCGTCCCTGGGCAGTGCCCACATTGGAACTGGGCGACAATGTGCCGGAAGAGATCTACACAAATGAACAGAGTCGACAGCAGGAGGAGACGGGCAGTGACAATGATGAATTGGAACTGGGCATGAACATGTCCATGGAGGAAGCATTTGAAGACGACgattatgatgatgaagatgatgactTCCTGGCTACAGAGCCATTAGTGAAAAGGGAGCGTCGCTCACGCTTTGATCCATTACCGCCAGGTCAGTTGCCGCCTTGGAAACTCAAATTCTGCTCCTTGCCATACTGTCGTAGTCCACGTGGTGATGGCATCAAGCTTTTCCGGCTGCCCAATAACATCAGTTCCATTCGGAAATGGGAACAGGCAACTGGAATGCGCTTTACGGAATCCCAACGCAATACGAAGCTCATCTGTAGTCGTCACTTTGATCCTCAGCTAATCGGAGTGCGCCGTCTCATGTACAATGCTGTGCCAACACTTCATCTGGGCCCAATGAGTGTAGACAATCAACCAGTGCAACGTCCTGTTGGTCCACGATGCTGTATGCCTGATTGTCAGGAGAGCGCAAAGCTGCATAAGTTTCCCAGTGATCCTATGCTGCTGAATCAATGGTGTCACGCGCTGAATCTATCGGATATTCAGCGTTATCGTGGCAAACACATCTGTGCTGCACATTTGCCTGCCAAAGCGCCGAATTGCATCATATGTGGCGTGGATGATATACAATTGCCGTTACTAAACTTTCCGGAGAATCGCAATCAGCGCGCCAAATGGTGTTACAATCTCAAAATCGAATCCATACCCAAGTGGGATAACTTAAAGCAGATATGCAGCAAACACTTTGAGAGCTACTGTTTCGTTCAGCCGGGTCAACTGCTGCCCGAGGCAGCTCCCACGTTGCATTTAAGGCACGGCGATAGCAACATATTCCTAAACGATGCCATAGATCACAGCAAGATGCTGCGTATTAAGGATGAGCCCTTGGACAGCGAGGACCTGATGctgtaa
- Protein Sequence
- MSQHNNPPHHHHHHYYQQQQQQQQQQQHHHHQQQHQQQQLQHKQIQQQSWYSHVASYPPPHHPHTAAFAAPCKSNNNNNNNIMNAYGAGAGSTHAAYYGSGGVGYNLEGNTVAYAHNQLLQYQQQQQQQQQQQQQQHHQLSQRSYMPHSLMHSSYPYIKSEPLELPDDRQRQQHQHQQQQPQQQHFQNPMAPPPAPANRHSLDASGEMIIKSEPIDEHAYKSNYIDDNTPFADFSKYPEFGDDMLSPKVELTVKDEGYGSQKVPNPLSYPRRKLQSERSSESLPICQRCKEVFFKKQIYLRHVAESSCGIQEYDFKCNICPMSFMSTEELQKHKHLHRADKFFCHKYCGKYFDTIAECESHEYMQHEYDSFVCNMCSVTFATREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYTGAQSASHQQQQQQHPQHQQQANYKLQDCHMATMEMPTPHHKANTTANALPATAALSSLLQQRQANADGAAMFASTMKNEANVKLERSYSNSTSESGYSLHDSSYNNAYGSDTSLHAGGGAVGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTSTTPTNNQQQLQQQQLQQQHQQHQSHHGHHGNSKYLRHSAASTSSSASSASHYVDPEMSGTYMGMGNSGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIAYPLPTAEQVAEFYSRPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLAKWAHNLQLDVAQLPNMRICNLHFESHCIGKRMRPWAIPTLNLSTNVENLYENPEHQMLYKRRKHLNPDRGAASHGGAGIVKPTWVPRCCLSHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLELNSPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGAMRQQYRMCSIHFETHSFNGKRLSAGAIPTLELGHQDDDIYPNEAQSFVEEHCTVEGCDASKEQPDVRLFRFPTEDEDLLWKWCNNLKMNPVDCVGVRICNKHFETDCIGPKHLYKWALPTLELGHDDAQIELIHNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPTLFQRWRHNLRLDHLNFKERERYKICNVHFEDICIGKTRLNIGSIPTLELGHEETEDLYQVNPAELQSNLFGRQRRVHESMGITIKQEENSEVDEDIKPDIDMSEASDVNTRQVKIKKSMYDLKCCVPSCGRSRLEHGARLFPFPSGKQQQSKWRHNLRLEPDDVDRTTRVCSAHFNRRCIDGKQLRGWAMPTQQLGHQEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEDLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHNEGVVFDNEVIKAGLQQEAEAEEGEASMESLVKIKQEKIDPDDELEDNVGREERDDNDDDEESEQKHEQDADPEDHGYFDPLELVETFAEHHSEHDDDDEDDDDDDDEDEPGYDDELLLPDTPPVQVSLPVPPLRREKPVNNVTPICCLKHCRKERTAIHPLSTFGFPKDHQQLLKWSANLKLSPADCVGRVCIEHFEPEMLGTRKLKQNAVPTINLGHTTPLSYSCNGQSQGIYDTQPQHSVFRLWSLKHCRKRKLPMEPDPDQAATKRRRCCLPSCGKQPDLHGVQLHRLPSDRILLRKWLYNLKLSPMVDIGQARLCSEHFEPQMETMEGCVPTLRLGHDDTRIYRNRGSISGSISASSSGCMVASCPCARLNLYRCYDLPANRLVQKVWLEWLQLPMPQLASDGKLCVMHYMQLYEQVPLPQELPEPVLRQLQEAYDSISGSSMAMKLRCAVPGCYSKYTDNIRLTKLPMCPDTCAKWVHNTKITYDPTRHYIYRICMLHFEARCLGPVRPKQWAVPTLQLNHNDPDIYLNPKATETLPTPMSISTPVPVSISTPVPVSLSTSVPMSVPVELPLRIKTELAFSGSPSASASPSPRGKLRICCIPSCAQQATSQSRLFRFPTAETALLKWLVNTQQQPRLADPQHLFVCQDHFEAEAICKNQLRSWAVPTLKLGHDGHVIPNARHNGNIADSQENKQALQFIWENYCSVLSCFQPRSEDLRLYAYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFAPHCFDPDTGELRENAVPTLELSRCINEVRCVVSGCVKDEDGPRQRYYKMPKRSSQLNIWCHNLCLDTVAMSSSEHHVCDRHFEMQCFNQQKLLRPGARPTLHLGHDEQIDLMPNPADWATDGSDAMAMTTVCCVPNCGHSRDEDDVQLFAFPKMRVLAEKWLQNIRLEVGREQLAKMRICGAHFEHSCLENGRPQLGAMPTLELGHEERHKIHRSADPTVGKVKKYCNRSGSSYDCCYPQCVELQKTYLRISYDLPQGIALRQQWLDYMAVEETEEKPLKLCPMHLILLYDHSEEHFAEHTKEQLLDSNYEDARSSVRIRVISCAVRGCRTLKPRDGGRLHGLPTRRDVLEMWLYNMQLVFYEHQRYMYKICSKHFEASCFMDTTRRLKPWTMPTLELPDREPGEAPVFQNPTEEEWQRMNELFAEEQQQIQQQQVNQENNEGENDLLEPIVKIEHMGNEDQLYEEELEQHQQPDGEEDFENSQQPLEVLLEVGHVEKCPTYEQMDSEANLSYAVEQQAQISSFAPSSSTSQYGGAIVSNGFKYNARHCSVRGCDVTANDVNGNIKLHKFPTSLDAMKKWMHNTQVDVDTNVAWRYRICSYHFTDECFNGSRIRRGAMPTLSLGPRRPPKIYDNEFNTTLQPEQEQTNEVASEEQLNNEVESRETHMKGGDISLNLPQPAPPRKSSKYCQIEGCPNHLTSENLTLHKFPHSVDMCAKWQHNTQVPFDPDFRWRYRICSAHFEPICLMNMRLMHGSVPTLNLGPRAPRQLFDSDFEAISMRLDKQKSSSEQHLVDKHEQLQVHEQDEEELSFLVPEMQLHEDADAEQSDNPLTYSSHNNSWKDLRLPSIKQEKTMTATSYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSKQLQQRWMHNLRIQYDERRPWKTMICSVHFEPNCIRLRKLRPWAVPTLELGDNVPEEIYTNEQSRQQEETGSDNDELELGMNMSMEEAFEDDDYDDEDDDFLATEPLVKRERRSRFDPLPPGQLPPWKLKFCSLPYCRSPRGDGIKLFRLPNNISSIRKWEQATGMRFTESQRNTKLICSRHFDPQLIGVRRLMYNAVPTLHLGPMSVDNQPVQRPVGPRCCMPDCQESAKLHKFPSDPMLLNQWCHALNLSDIQRYRGKHICAAHLPAKAPNCIICGVDDIQLPLLNFPENRNQRAKWCYNLKIESIPKWDNLKQICSKHFESYCFVQPGQLLPEAAPTLHLRHGDSNIFLNDAIDHSKMLRIKDEPLDSEDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00601834;
- 90% Identity
- -
- 80% Identity
- -