Dmul002324.1
Basic Information
- Insect
- Drosophila mulleri
- Gene Symbol
- -
- Assembly
- GCA_035047565.1
- Location
- JAWNPK010000012.1:21914442-21928963[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 28 5.1e-15 1e-11 45.6 4.3 1 86 574 646 574 647 0.85 2 28 3.2e-15 6.2e-12 46.2 4.6 1 87 674 743 674 743 0.83 3 28 7.7e-16 1.5e-12 48.2 0.4 1 87 765 837 765 837 0.85 4 28 6.9e-16 1.3e-12 48.4 5.6 1 87 943 1013 943 1013 0.83 5 28 7.5e-15 1.5e-11 45.0 3.0 1 86 1037 1108 1037 1109 0.82 6 28 2.3e-12 4.4e-09 37.1 0.6 1 87 1144 1212 1144 1212 0.80 7 28 9.2e-11 1.8e-07 31.9 1.4 1 86 1257 1326 1257 1327 0.76 8 28 3.8e-17 7.4e-14 52.4 0.2 1 86 1354 1423 1354 1424 0.83 9 28 8.3e-14 1.6e-10 41.7 0.6 1 86 1445 1514 1445 1515 0.81 10 28 7.2e-15 1.4e-11 45.1 1.2 1 86 1542 1613 1542 1614 0.85 11 28 1e-13 2e-10 41.4 2.4 1 86 1665 1735 1665 1736 0.83 12 28 9.9e-13 1.9e-09 38.2 0.1 1 86 1759 1827 1759 1828 0.81 13 28 3.2e-14 6.2e-11 43.0 0.7 1 87 1983 2052 1983 2052 0.80 14 28 4.5e-11 8.7e-08 32.9 0.2 1 62 2199 2260 2199 2274 0.78 15 28 1.9e-05 0.037 14.9 0.0 1 58 2280 2330 2280 2346 0.81 16 28 3e-12 5.8e-09 36.7 0.2 1 87 2369 2439 2369 2439 0.83 17 28 1.1e-05 0.021 15.7 0.2 1 42 2500 2537 2500 2547 0.81 18 28 5.5e-13 1.1e-09 39.1 0.4 1 86 2586 2657 2586 2658 0.80 19 28 1.3e-11 2.6e-08 34.6 1.1 1 87 2668 2740 2668 2740 0.81 20 28 1.5e-14 2.9e-11 44.1 0.0 1 87 2763 2834 2763 2834 0.83 21 28 0.0035 6.8 7.6 0.0 1 58 2865 2916 2865 2936 0.78 22 28 6.3e-14 1.2e-10 42.1 0.8 1 86 2951 3023 2951 3024 0.80 23 28 3.3e-13 6.4e-10 39.8 0.2 1 86 3192 3264 3192 3265 0.81 24 28 2.8e-14 5.5e-11 43.2 1.2 1 86 3323 3393 3323 3394 0.82 25 28 1.6e-14 3.1e-11 44.0 4.0 1 86 3511 3581 3511 3582 0.83 26 28 2.8e-12 5.4e-09 36.8 0.2 1 86 3666 3735 3666 3736 0.85 27 28 9.8e-09 1.9e-05 25.4 0.7 1 58 3756 3804 3756 3814 0.86 28 28 2.9e-10 5.5e-07 30.4 0.6 19 87 3822 3879 3810 3879 0.75
Sequence Information
- Coding Sequence
- ATGTCACAACACAATAATCAACCGCAttcgcatcagcatcaacaacatctCCACtactaccagcagcagcagcagcaacatcagctacagcaacaacagcaacagcaccagcagcatcagcatttacaacataaacaaatacagcagcagcacagttGGTACTCACATGTTGCTTCCTACCCGCCCCAACAGCCGCACCCGTATGCGACACCCTGCAagagcgccaacaacaacaatcacaacaacaacattatgAATGCATACGGAGCGGGTGGGACGCACTATTACGGGGCTGCtacagctactgctgctgcggggGTGGGATATAACCTTGAAGCTAATACTGTGGCCTATGCGCACAACCAGCTGCTGCAATaccagcaagaacaacaacaacaacaacaacagcaacagcagcagcaactgcaccaCAGTCAACGCTCGTATATGCCGCACAGTATAATGCACAACTCCTATCCATATATTAAGAGCGAGCCATTGGAGCTGCCCGATGAAAGACAACGccatcaacaacatcaaacgcagcaacagcaacagcagcagcaccaccacctacaacattttcaaaatcCAATGGCACCGCCGCCAGCACCCGCCTCCAATCGCCACACGCTTGATGCCAGCGGcgaaatgataataaaatCGGAACCCATTGACGAACATGCGTTCAAGTCCAACTATATTGATGATAATACGCCCTTTGCCGATTTTAGCAAATTTCCCGAATTCGACGACGAAATGTTGAGTCCAAAGGTGGAGCTAACTGTAAAGGATGATGCTTATGGCAACCAAAAGAACCCGCTCAGCTTTCCTCGCCGTAAGCTGCAAACGGAACGTTCCTCGGAGAGCCTGCCCATCTGCCAACGTTGCAAGGAGGTGTTCTTTAAGAAACAAATCTATTTGCGACATGTGGCCGAGAGCAGTTGTACGATCCAAGAGTACGACTACAAGTGCAACATCTGTCCGATGTCCTTCATGAGTGCCGAGGAGCTACAGAAGCACAAGCAACTGCACCGCGCTGACAAGTTCTTCTGCCACAAGTACTGTGGCAAGTACTTTGACAACATTGCCGATTGCGAGTCGCACGAGTACATGCAACACGACTACGATAGCTTTGTGTGCAACATGTGCTCTCTGGCATTTCCTACGCGTGAACAGCTGTACGCCCATTTGCCGCAACACAAGTTTCAGCAGCGCTTTGATTGCCCGATTTGTCGCTTATGGTATCAAACAGCGGTGGAGTTGCACGAGCATCGCCTAGCGGCCCCTTACTTTTGCGGCAAGTACTATGCTGCAgcacatcagcaacaacagcaacagcaacagcagcaccagcagatGTCGGAGCAGAGCAACTATAAACTGCAGGATTGCCACATGGGCACGATAGAGATGACTGCGTCTCAGCATAAAGCAAATGCGTTGCCTGCCACGGCGGCGCTTAGTTCCTTGCTACAGCAGCGGCAAGCGAATGCTGATGGTTGCGCCTCCCTCTTTTCCTCTTCCCTAAAGAACGAGCCGAGTGTTAAGCTGGAGCGAAGCTACAGCAACTCGACCAGCGAATCCGGATATAGCATGCACGATAACAGCGGCTACAACAATGCATATGGAAGCGACAACTCGCTGCATGCGAGCGGCGTTGCAATCGGTGGACCGCAGGCACATTCCTCCACGCTGGATGAATCGGAAGATGCGCTGTGCTGCGTGCCATTGTGTGGAGTGCGCAAAAGCACCAGCCCCACTCTGCAGTTCTTCACGTTTCCCAAGGACGAGAAGTATCTGCATCAATGGCTGCACAATCTCAAAATGTTTCACATACCTGCGTCAAGCTATGCCAGCTTTCGCATTTGTAGCATGCATTTCCCGAAGCGTTGCATAAATCGCTACTCGCTGTGCTATTGGGCAGTGCCCACGTTCAACCTGGGCCACGACGATGTCGCCAATTTGTATCAGAATAGAGAGTTGACTAACACTTTCACCACCGGTGAGGTGGCTCGCTGCAGCATGCCCAATTGCACGAGCCAGCGCGGCGAAAGTAATCTCAAGTTTTACAATTTTCCGAAGGACATCAAGAGTCTGATCAAATGGTGCCAGAACGCACGTCTGCCCGTCCAGGCCAAAGAGCCGCGTCACTTCTGCAGTCGGCACTTCGAGGAGCGTTGCATTGGCAAATTCCGACTGAAACCCTGGGCAGTACCTACGCTCCATTTGGGTGCGCAGTATGGCAAGATTCATGACAATCCGAAGAATCTGTATGTTGAGGAGAAGCGTTGCTGCCTGAACTTTTGTCGCCGCAGTCGCTCCTCGGACTTTAACATGTCGCTTTACCGGTTCCCCCGAGATGAAGTATTGCTCCGTCGCTGGTGCTATAACCTAAGACTCGATCCGGCAGTTTATCGTggaaaaaatcacaaaatatGCAGCGCTCACTTCATTAAGGAAGCCCTCGGCTTACGTAAATTGTCGCCGGGCGCTGTGCCGACTCTGCACCTGGGCCACAATGACACCTTCAACATATACGAGAATGAACTGTGGCCTCCTCCTACGGCTTCGACGCCCACCCATCacaatcatcatcagcaacagcagctgcagcagcaccacatgcagcagcatcaacagcagctgccgcagcagccgaTTCATCACAAATATCAGAGGCACTCTGCAGCGTCCACTTCCTCGTCGGCCAGCTCTACATCCCACTACGTAGATCCAGAGCTAAGTGCCTCCtacatgggcatgggcatgggatCTGGCAACTCATCCTCTGGGCTGCTGAATGTGAGCGACAGCATGGACGTCTGCTGTGTGCCCAGCTGCGAAAGCAAGCGACACAACAATGAGAACATCACATTCCATACGATACCTAGGCGCCCGGAGCAGATGCGCAAGTGGTGTCACAATCTGCGTATACCCGAGGACAAGATGCACAAGGGCATGCGCATATGCAGCCTGCATTTCGAGCCCTATTGCATTGGCGGCTGCATGCGTCCATTTGCGGTGCCAACGCTGCATTTGGGCCACGACGAAGAGGACATTCATCGCAATCCGGATGTAATCAAGAAGCTGAACATTCGCGAGACCTGTTGCGTGGCAGTGTGCAAACGCAATCGGGATCGCGATCACGCCAATCTGCATCGTTTTCCCAGCAATGTGGCGCTGCTAACCAAATGGTGCGCCAATTTGCAGCGCACCGTTCCGGATGGGATCAAGCTTTTCAACGATGCTATCTGCGAGGTGCATTTCGAGGAGCGCTGCTTACGTAACAAGCGGCTGGAGAAGTGGGCAGTGCCCACTCTTATGCTCGGCCACGATAACATTCCCTATCCCCTTCCCACTCCGGAGCAGGTGGCCGAGTTCTATGCACGACCAAGTGCTCCAAACAATGGCGAAGAGCAGGGTGAATGCTGTGTGGAGACCTGTAAGCGCAATCCCAGCGTGGACGACATCAAGCTCTATCGACCGCCAGAGGAGTCCCAAGTGCTGACCAAATGGGCGCATAATCTGCAGATCGACATCGCTCAGCTGTCTGGCATGCGCATCTGCAATCTACACTTCGAGGCCCACTGCATAGGCAAGCGAATGCGACCGTGGGCCATACCCACGCTCAATCTTGCCACCAACGTAGAGAACCTCTTTGAGAATCCCGAGCAGCAGATGCTGTACAAGCGCCGCACCTCCCATCTCAGCGTGCAAAGAGCAGGAAGTGACGGAAGCATCACCAAACCCACCTGGGTGCCACGCTGCTGTCTACCCCACTGCCGCAAGGTGCGCGCTCTGCACAACGTTCAGTTATATCGGTTTCCCAAGCTCAATCGCACCACGTTGGCAAAGTGGGCGCACAATCTGCAAGTTCCGATGGTTGGAAGTGCTCAGAGGCGTCTCTGCTCCGCGCACTTTGAGCCGCACGTCCTTAGCAAGAAGTGCCCAGTGCCGCTTGCGGTACCGACGCTCGATCTCAATACTCCACCCGGCTACAAGATTTACCAGAACCCAGCCAAGATCAAAGCAAACAAGCTGTGCATGCAGCGCGTCTGCATTGTGGAGAGCTGTCGTCGGCAGCGTGGCCAGGGAGTCCAGCTCTTCCGGCTGCCCCACAACCCCACCCAGCTGCGCAAGTGGATGCACAACATACGCATGCGACCGCGCGGTGCCATGCGGCAACAGTACCGCATCTGCTCGCGGCACTTTGAGACGCACTCGTTCAATGGCAAGCGCTTGAGCACTGGCGCCATTCCGACTCTCGAGCTGGGccatgacgacgacgacatcTATCCGAATGAGGCGCAGTCGTTTGTCGAGGAGCACTGCGCTATCGAGGGCTGCGACGCCTCCAAGGATCAGCCGGATGTGCGGCTCTTCCGGTTCCCCAATGACGACGAGGATCTGCTTTGGAAGTGGTGCAACAATCTCAAGATGAATCCAATTGATTGCATTGGCATGCGCATCTGCAACAGGCACTTCGAGCCCGATTGCATCGGCTCCAAGCACCTCTACAAGTGGGCCATTCCCACGCTGATGCTTGGGCACGACGATGTCCAGATTGAGCTGATATCGAATCCGAAGCCGGAGGCTCGCTACGTGGACCCAGTGTTTAAGTGCTGCGTTCCCACCTGTGGCAAGACTCGCAAATTCGATGAGGTGCAGATGAACAGCTTCCCCAAAGATCCGGCCCTCTTCGAGCGCTGGCGCCACAATCTTCGACTCGACCATCTGAATTTCAAGGAGCGCGAGCGGTATAAGATTTGTAATGCTCATTTCGAAGACATATGCATTGGCAAGACGCGCCTCAACCTAGGCTCCATACCCACACTGGAGTTGGGCCACGACGAGACGGAAGACCTGTACCGCGTGAATCCCGATGAGCTGCAGAGCAATATGTTTGGACGACACCAGCGCGTGGGGACTAAGCAGGTGGAGCATTTTACTAGACAGGTCAAGTTCAAGCGATCCACGCCAGAGCTCACGTGCTGTGTTCGCAGCTGTCCACGCAACAAATGGGTAGAAAGTGGAGCCCGCTTGTTTCCCTTTCCCACGggcaagcaacagcagaagaagTGGCGTCATAATCTCCGACTCAAAGTTGGTGAAGTGGATAGGACGACGCGTGTTTGCAGCGCCCATTTCAATCGCCGCTGCATTGATGGCAAACAGTTGAGGGGATGGGCcatgcccacactgcagctgGGCCACCAGGAGCCGAACATCTATGAGAATCCAAAGAACATCCCGGGCTTCTTTACTCCCACCTGTGCGTTGGCCCATTGCCGCAAAAGGCGGAGCATTGACAACGATTTGCGCACCTATCGCTATCCGCGCAACGAGGAGTTGCTTGAGAAGTGGCGTGTGAATCTCCGGCTCGCACCAGATCAATGTCGCGGTCGCATATGTGCGGATCACTTTGAGCCAATGGTGCGTggaaagctaaagctaaagacGGGGGCAGTGCCTACGCTGAAGCTGGGCCACGATGAGGGCGTAGTCTTTGACAACGAGGCTATTAAGGCAGGTCTGCAgcacgaggaggaggaggagcatgATGGTGGAGAGGCGACGAACAGCTTTGACTCGCTTGTTAAGGTGAAGAATGAGCTGAATGAAGAGGATCAGTCAATGGCTAACGACGAGCAGACAGAAGAACcggatgatgatgacgaggGGCAGACGAATCATCAGTCAGATGAAGAGGAGCAGCAAGAGCACGGCTACTTTGATCCCTTGGAGCTTGTGGAAACTTTTGCAGAGCAGCACAGCGAAGACGAACAGCTCTACAATAACGAAGATGACGACGTCGATGACAATATGGAAGAGGAGGATATACCTGGCAACGATGATGAGCTGCTGCTCCCCGATACTCTGCCTATTCAACTGGCTCTTCCTCCGAGACGTGAGAAGAGAACCGTGAATAATGTGACGCCCATATGCTGCCTCAAGCACTGCCGTAAGGAGCGCACTCCTACCCATCAGCTAAGCACCTTCGGCTTTCCCAAGGATCCGCAGCAACTGCTCAAATGGAGCGCAAACCTGCAAATAGATCTAGCCGATTGTGTTGGGCGTGTTTGCATTGAGCACTTCGAGGCGGAGATGCTAGGCACGCGCAAGCTGAAGCAAAACGCGGTGCCCACCTTGAATTTGGGCCATGACACACCCCTTAGCTATCGTTGCAATGGACGAGCTGCGGACATCCATGATGCGCAATCACAGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAAGACTCGCTGACAACGACAATGACGCCAGCGACGCCAACGACGCCAACGACGCCAGCGACGCCAACGACGTCAGCGACGACAACGATGGCAACGAcgccagcgacaacaacgacgCTAACACcgccagcagcgacaacgaTGGCAACGAcgccagcgacaacaacgacaacaacgacactAACACcgccagcagcgacaacgacaacaacgctaatgacgacgacgacggcaatGACGGAAACATCGTCaatgacgacaacgacgacgacgacgacgacgacagcgaaTCTAGCAGTGATGACGCCTAAGCGAGCGTGCTGCGTGCCCAGTTGTGGAAAGGAATGGCAGCTGCATGCCGCTAACCTGCGTCGCATGCCAAGAGATCGTTTGTTGATGCGCAAGTGGCTGCACAATTTAAGGCTTCCTGCCAGCTTGAACCTTAGCCGGACGTATGTGTGCAAGGATCATTTCGAGGATCAGGCAGTGTTGCCCACCTTGCGTCTAGGCCATACGGATAGTAACATTTATCGTAGCCACAACTCAATCGGTGGCATTGGCTGCCTGGTGCCCGGCTGCCCATTTGCCCGTCTCCATCGTGGCTACGCCCTGCCCGAGCATCCCCAAGTGCGGGAGGCGTGGTTGCAGCGGCTACAACTACCGCCTCGTTCTCCCCAATCGTCGAGCCACGACCAGCTGTGCGTCATGCACTATATGCAGCTGTTCGAGCAGGTGTCGTTGCCGGAGCAGTTGCCTGAGATAGTGCGGTGCGAGCTGAGGGAGACGTATGATCTCATCGCCACTTCCTCCGTTGCCTTGGGGCTGCGCTGCTCAGTGCCCGGCTGCCAAAGCAAATTCGTGGACAACGTGCATCTCACCAGGCTTCCAACATGCCAACGTACGCGGGCCAAATGGCTGCACAATACGAAAATTGACTACGATCCGGACCTCGAGCGCGTCTATCGAATCTGCTTGCGGCACTTCGAGCCGCATACCCTGGGTGCAGTGCGCCCTAGGCTGGGAGCGGTGCCTACACTGCACCTCAACCACAACGACCCGGATATTTATCAGAATCCCAAGCTAGAAGACAACGCTGAGGGCGCACTAACCCAGCCTCTTCAAATGCCGCTACGAATCAAAACGGAGCTACCGCTTTACCTCTCTGTCAGCCCGACGGCCAGCGCGAGCGCAAGTCCAAGTCCTCGGGGTAAACTGCGCACCTGTTGCATTGCCAGCTGTGGACAGCAGGAGAATGCGCTCACCCATCTGTTTCGCTTTCCCACCATCGAGACTGCGCTTCGCAAGTGGCTGGTgaacacgcagcagcagccacgtcTCGTGGATACACAGCATCTCTGGGCGGTGCCCACGCTGAAGCTGGGTCACGATGGTTATATCATACCCAATGCTCGACACAATGGCAACCTTGCTGACAGCAATGAGGATAGCCAGGCCCTGCAGTACATCTATGCGAACTTCTGCTCCGTTCTCACCTGCTTCCAGCAGCGCAGCGACCAGCTGCGCCTCTTTGCCTACCCGACGGATCGACCCACCCTGCGCCGGTGGGCTGCCAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGACGGATTTCAGGTGTGCCAGTCTCATTTCGCGGACGACTGCTTCGACCCTGAGACTGGGCAACTGAAGGGGGATGCTGTGCCCACTCTTGAACTGAGCCGACCCGTCTACGAGATGCGTTGCTTAGTCAATGGCTGCGTTAATGAGGAGGGTGCAAGGCGCCCTCGCTTATTTAAAATGCCAAAACGGACATCGCAGATGGAGGATTGGTGTCACAACTTGCGCATCGATGCGGCGTCTTTAGTGGGCCAGGAGCCACACGTCTGTGAGCGGCACTTCGAGGCGCACTGCTTCAATGCGGACAAGCTGCTACGCCAGGCAGCGCGGCCCACACTTCATTTGGGCCACGATGACCTGCTAGATGTGCTGCCCAATCCGGCTAGCTGGGAGGAGGAAGCCCACGTATGCGTCGTGCCCAGCTGTGGACGCTCCAAGGATGCGGATAACGTGCAGTTATTCGGGCTGCCAAAGGCTAAGTTTCTGCTAGAGAAATGGCTGCAGAATTTCCGCCTCGATCCTAGCAAGGTGCAAGTGAAGGGCCGTCGCATATGCAGCGTGCACTTCGAGCCCAGCTGCATAGACGGTGACCGTTTACATATGGGCGCCATGCCTACACTCCACCTGGGTCACGATGAGACAGACAATATACACGAAAGCATCAAAGAGCTGCCACTGACGGCTCGGCACGGGCGTGTGCGTATCCACTGCGACTGTTGCTATCCAAATTGCGTGGAACTGCAAAACGGCTTTCAAAAAGTTAACTACGACTTGCCACAGGGGCTGCCACTGCGTGATATGTGGCTTTCGTATCTGGGATTGGAGGAATtcgatcagcagcagccgctcaaGCTCTGCCCGCTGCACTTGATCATGATGTACGACGAGTTCCCTGAGCAATCggcggaggaggagctgcTTACATCCAACTACGAAGCTGCTCACAATAGCGTGCGCATTCGCAACGCCAGCTGTGCTGTGCGTGGCTGCAGAACTCTAAAGCCTAGAGACGAGTATTGCCTGCACTTGCCACCCACGCGCCGTGATGTGCTCAAGCTGTGGCTGGACAACATGCAGATGGCTTACAATGAGAATGAGCGCTACAAGTTTCGGGTGTGCGGCAGGCACTTCGAGTCCAATTGCATAGTGGAGAAGGCGCGACGACTAAAGCCCTGGAGCGTGCCTACGCTGGAGCTGCCAGTGCCACCAGTGCATCAGAATCCCACGGAGGAGCAGTGGCAACGCATGAATGAGCAACTGTCCGCACTGCAGGCATTGGAAGCCAGTGTTGAGCTGAACGATGTgcagaaacagaagcaaatacgcaaaataaaatacgcgATGCTGAAGCAGGTCATACAGGAGCAGGAGAACGACCAGGGACAGGAAgacgagcaggagcaggagcaggaggacgagcaggagcaggaggagtcGCAAATCAAACTGGAACCCCAGGTCAAGCAGGAGCCTGTTTACGAAGATTACTATtatgagcagcagccgcaagaAATGCAAGAGCTGGAGGTGCTACTCGAGGTGGGGCATGTAGAGAAGTGCACCACCTATGAGCAGATGGACACAGTGCCCTCCATAAGCTATGCCGAGCAGCCGTCCCATAATTCTTCATTTCCGTCACATTCGCTGGGTAGCGGATCAATAAACGGCAGTAACTCTCGCTACAGCGCGAGGAATTGCAGCGTGCGAGGTTGCGGTGTGACGGTGCTGGATGTGGACGGCAATCTGAAGTTGCACAAATTTCCCACATCGATAGATGCTATGAAGAAGTGGATGCACAACACCCAGGTGGATGTAGACATCAACTTCGCGTGGCGCTTTCGCATTTGCAGCCATCATTTTGTCGCCGAGTGCTTCAATGGATCACGCATTAGACGTGGAGCCATGCCCACGTTGCGTTTGGGCTCCCGACGTCCCGCACGTATCTACGATAACGAATTCAATGTTCAGCAGGAACAGACGACAGCGAATACAGTGGAAGAGCAGGCTCTACTGCCTGCTGGGGACTCCAAGTTGGACATAAAAATGCGCTTACCCTGTCCCGCGCCGCCTCGCAAATCCAGCAAATACTGTCAGATCGACGGCTGCTCGAATCATCTGACCAGTGAAAATTTGACTCTACACAAGTTCCCACACTCTGCGGACATGTGCGCCAAGTGGCAGCACAACACGCAGGTGCCCTTCGATCCAGAGTATCGTTGGCGCTATCGCATCTGCAGCGCTCACTTTGAACCCATCTGCCTGGGCAACATGCGTCTGATACAAGGCAGTGTGCCCACCCTTAAGCTGGGCCCGCGCGCTCCGAAGCAGCTCTTTGACAATGACTTTTCGGTCGCTGGCTTGCGCTCGGATAAGGATATGCATTGCGAACAAAATTTCACTGTAAAGCATGAGCAAGCGCAGCCATTCGAGGAGGATGATTCGGTGCAGgcacaggagcaggagcaggaggcggaacAGGAACAGGACTTCAGTATGCTAGAGCCAGAACTCCAATTGCATGAAGGTCAGGAAGAGGAGGAACATGAAGACAATTATTTGAACTGCAATAACAATTGGAATGAGCGGAGCATTAAGCAGGAGAAGAGCAGCGATAGCAGCTACTATAATCCGGTCAAGTCGGGCTATGACAAGTGCTCCCTAATCCATTGTCAGCGACAGCGCTCCCAGCATGGCGTGCACATCTATAAATTTCCACGTTCCCGCCAGCTTCAGCAACACTGGATGCACAATTTGCGTATTCGGTACGACGAACGGCGTCCATGGAAGACCATGATATGCAGCGCGCACTTTGAGACTCGCTGCATACGACTAAGGAAGCTCCGTCCCTGGGCAGTGCCTACTTTGGAGCTGGGCGACAATGTGCCTGAGCAGCTGTACACGAACGagcaaagccagcagcagatGGAGGTGGGCAGCGAGGGTGAGGCTGACGGCTATGATTACGATGTGGAGGATACTATGTTGGAGGACTACAACGATGATTACGAAGACAATGGGTCGGAGCAGCTGTCTGCGGAGCCACAGATCAAGCGAGAACGTCATTCACGCTGCGATCCCTGCCCTCCTGGCCAGTTGCAGCCATGGAAAATAAAGCAATGCTGTTTGCCCCACTGCCGCAAGCCACGAGGCGACGGTATCAAACTCTTCCGACTGCCCAACAACATTGGCGCCATACGCAAATGGGAGCAGGCGACAGGCATGCGCTTCTTTCCATCGCAGCGCAACACTAAGCTCATCTGTAGCCGTCATTTTGATCCGCAACTAATTGGCGTTCGTCGCCTTTTTACCAATGCCGTGCCGACAAGGAATCTGGGTCCAAATATTAAAGAGCCAGAGCTGCCTGAGTTTCCGGAAAGTACTCCACGCTGCTGCATCACGGACTGCGAGCACAATGGAACAGTCAAGTTGCACAAGTTTCCAAGTGATCCTTTGCTGCTTCATCAATGGTGCCAAGCCCTGGATTTGCCGGAGGTACAACGCTACTCTGGTAAACACATTTGTGCAGTTCACCTACCTGCGAAGGCTTTCAGTTGCATCATCTGTGGCGTCGAGGATGTGCAGCTGCCCATGCAAGACTTCCCCGAGAACCGCAATCAGCGTGTCAAATGGTGCTATAATCTTCAAATCGAACCCATTGCCAAGTGGGACAACACGAAACACATCTGCTCTAAACACTTTGAGAGTTATTGCTTTATAGAGCCGGGTCGTTTGCTCCCGGACGCAATGCCTACATTGCATTTGACGCACAATGATAACAACATATTTCTCAACGAGTATGCGATGAACCAGCCTAAAATGCTCTGCATCAAGGATGAGCCCCTGGAGAGTGACGATCTGATGCTATAA
- Protein Sequence
- MSQHNNQPHSHQHQQHLHYYQQQQQQHQLQQQQQQHQQHQHLQHKQIQQQHSWYSHVASYPPQQPHPYATPCKSANNNNHNNNIMNAYGAGGTHYYGAATATAAAGVGYNLEANTVAYAHNQLLQYQQEQQQQQQQQQQQQLHHSQRSYMPHSIMHNSYPYIKSEPLELPDERQRHQQHQTQQQQQQQHHHLQHFQNPMAPPPAPASNRHTLDASGEMIIKSEPIDEHAFKSNYIDDNTPFADFSKFPEFDDEMLSPKVELTVKDDAYGNQKNPLSFPRRKLQTERSSESLPICQRCKEVFFKKQIYLRHVAESSCTIQEYDYKCNICPMSFMSAEELQKHKQLHRADKFFCHKYCGKYFDNIADCESHEYMQHDYDSFVCNMCSLAFPTREQLYAHLPQHKFQQRFDCPICRLWYQTAVELHEHRLAAPYFCGKYYAAAHQQQQQQQQQHQQMSEQSNYKLQDCHMGTIEMTASQHKANALPATAALSSLLQQRQANADGCASLFSSSLKNEPSVKLERSYSNSTSESGYSMHDNSGYNNAYGSDNSLHASGVAIGGPQAHSSTLDESEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTASTPTHHNHHQQQQLQQHHMQQHQQQLPQQPIHHKYQRHSAASTSSSASSTSHYVDPELSASYMGMGMGSGNSSSGLLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLRIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDEEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRTVPDGIKLFNDAICEVHFEERCLRNKRLEKWAVPTLMLGHDNIPYPLPTPEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLTKWAHNLQIDIAQLSGMRICNLHFEAHCIGKRMRPWAIPTLNLATNVENLFENPEQQMLYKRRTSHLSVQRAGSDGSITKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRTTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNTPPGYKIYQNPAKIKANKLCMQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGAMRQQYRICSRHFETHSFNGKRLSTGAIPTLELGHDDDDIYPNEAQSFVEEHCAIEGCDASKDQPDVRLFRFPNDDEDLLWKWCNNLKMNPIDCIGMRICNRHFEPDCIGSKHLYKWAIPTLMLGHDDVQIELISNPKPEARYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPALFERWRHNLRLDHLNFKERERYKICNAHFEDICIGKTRLNLGSIPTLELGHDETEDLYRVNPDELQSNMFGRHQRVGTKQVEHFTRQVKFKRSTPELTCCVRSCPRNKWVESGARLFPFPTGKQQQKKWRHNLRLKVGEVDRTTRVCSAHFNRRCIDGKQLRGWAMPTLQLGHQEPNIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRNEELLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKAGLQHEEEEEHDGGEATNSFDSLVKVKNELNEEDQSMANDEQTEEPDDDDEGQTNHQSDEEEQQEHGYFDPLELVETFAEQHSEDEQLYNNEDDDVDDNMEEEDIPGNDDELLLPDTLPIQLALPPRREKRTVNNVTPICCLKHCRKERTPTHQLSTFGFPKDPQQLLKWSANLQIDLADCVGRVCIEHFEAEMLGTRKLKQNAVPTLNLGHDTPLSYRCNGRAADIHDAQSQQHSVFRLWSLKHCRKRKDSLTTTMTPATPTTPTTPATPTTSATTTMATTPATTTTLTPPAATTMATTPATTTTTTTLTPPAATTTTTLMTTTTAMTETSSMTTTTTTTTTTANLAVMTPKRACCVPSCGKEWQLHAANLRRMPRDRLLMRKWLHNLRLPASLNLSRTYVCKDHFEDQAVLPTLRLGHTDSNIYRSHNSIGGIGCLVPGCPFARLHRGYALPEHPQVREAWLQRLQLPPRSPQSSSHDQLCVMHYMQLFEQVSLPEQLPEIVRCELRETYDLIATSSVALGLRCSVPGCQSKFVDNVHLTRLPTCQRTRAKWLHNTKIDYDPDLERVYRICLRHFEPHTLGAVRPRLGAVPTLHLNHNDPDIYQNPKLEDNAEGALTQPLQMPLRIKTELPLYLSVSPTASASASPSPRGKLRTCCIASCGQQENALTHLFRFPTIETALRKWLVNTQQQPRLVDTQHLWAVPTLKLGHDGYIIPNARHNGNLADSNEDSQALQYIYANFCSVLTCFQQRSDQLRLFAYPTDRPTLRRWAANCKHRSMQASSDGFQVCQSHFADDCFDPETGQLKGDAVPTLELSRPVYEMRCLVNGCVNEEGARRPRLFKMPKRTSQMEDWCHNLRIDAASLVGQEPHVCERHFEAHCFNADKLLRQAARPTLHLGHDDLLDVLPNPASWEEEAHVCVVPSCGRSKDADNVQLFGLPKAKFLLEKWLQNFRLDPSKVQVKGRRICSVHFEPSCIDGDRLHMGAMPTLHLGHDETDNIHESIKELPLTARHGRVRIHCDCCYPNCVELQNGFQKVNYDLPQGLPLRDMWLSYLGLEEFDQQQPLKLCPLHLIMMYDEFPEQSAEEELLTSNYEAAHNSVRIRNASCAVRGCRTLKPRDEYCLHLPPTRRDVLKLWLDNMQMAYNENERYKFRVCGRHFESNCIVEKARRLKPWSVPTLELPVPPVHQNPTEEQWQRMNEQLSALQALEASVELNDVQKQKQIRKIKYAMLKQVIQEQENDQGQEDEQEQEQEDEQEQEESQIKLEPQVKQEPVYEDYYYEQQPQEMQELEVLLEVGHVEKCTTYEQMDTVPSISYAEQPSHNSSFPSHSLGSGSINGSNSRYSARNCSVRGCGVTVLDVDGNLKLHKFPTSIDAMKKWMHNTQVDVDINFAWRFRICSHHFVAECFNGSRIRRGAMPTLRLGSRRPARIYDNEFNVQQEQTTANTVEEQALLPAGDSKLDIKMRLPCPAPPRKSSKYCQIDGCSNHLTSENLTLHKFPHSADMCAKWQHNTQVPFDPEYRWRYRICSAHFEPICLGNMRLIQGSVPTLKLGPRAPKQLFDNDFSVAGLRSDKDMHCEQNFTVKHEQAQPFEEDDSVQAQEQEQEAEQEQDFSMLEPELQLHEGQEEEEHEDNYLNCNNNWNERSIKQEKSSDSSYYNPVKSGYDKCSLIHCQRQRSQHGVHIYKFPRSRQLQQHWMHNLRIRYDERRPWKTMICSAHFETRCIRLRKLRPWAVPTLELGDNVPEQLYTNEQSQQQMEVGSEGEADGYDYDVEDTMLEDYNDDYEDNGSEQLSAEPQIKRERHSRCDPCPPGQLQPWKIKQCCLPHCRKPRGDGIKLFRLPNNIGAIRKWEQATGMRFFPSQRNTKLICSRHFDPQLIGVRRLFTNAVPTRNLGPNIKEPELPEFPESTPRCCITDCEHNGTVKLHKFPSDPLLLHQWCQALDLPEVQRYSGKHICAVHLPAKAFSCIICGVEDVQLPMQDFPENRNQRVKWCYNLQIEPIAKWDNTKHICSKHFESYCFIEPGRLLPDAMPTLHLTHNDNNIFLNEYAMNQPKMLCIKDEPLESDDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00519952;
- 90% Identity
- iTF_00554413;
- 80% Identity
- -