Dlow000744.1
Basic Information
- Insect
- Drosophila lowei
- Gene Symbol
- -
- Assembly
- GCA_008121275.1
- Location
- CM017779.1:9404089-9418504[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 29 4.7 7e+03 -2.3 2.7 42 63 307 331 294 346 0.56 2 29 2.3e-07 0.00034 21.2 0.2 25 86 541 591 529 592 0.84 3 29 1.6e-14 2.3e-11 44.2 4.8 1 87 619 688 619 688 0.82 4 29 1.3e-15 1.9e-12 47.6 0.2 1 87 710 782 710 782 0.85 5 29 7.8e-16 1.1e-12 48.3 5.5 1 87 891 961 891 961 0.82 6 29 5.3e-15 7.9e-12 45.6 3.2 1 86 985 1056 985 1057 0.82 7 29 5e-13 7.3e-10 39.3 0.8 1 87 1092 1160 1092 1160 0.81 8 29 2.5e-11 3.7e-08 33.9 2.0 1 86 1199 1268 1199 1269 0.77 9 29 5.5e-17 8.2e-14 52.0 0.4 1 86 1296 1365 1296 1366 0.82 10 29 5.1e-13 7.5e-10 39.3 0.9 1 86 1387 1456 1387 1457 0.79 11 29 6.6e-14 9.8e-11 42.1 1.1 1 86 1484 1555 1484 1556 0.85 12 29 2.5e-12 3.7e-09 37.1 2.7 1 85 1627 1695 1627 1697 0.82 13 29 4.4e-12 6.5e-09 36.3 0.0 1 86 1720 1788 1720 1789 0.82 14 29 6.2e-14 9.1e-11 42.2 0.5 1 87 1964 2033 1964 2033 0.78 15 29 3.2e-10 4.7e-07 30.3 0.0 1 86 2114 2187 2114 2188 0.79 16 29 0.0013 1.9 9.1 0.0 1 58 2207 2251 2207 2265 0.80 17 29 2.3e-12 3.4e-09 37.2 0.1 1 86 2287 2356 2287 2357 0.81 18 29 1.4e-13 2.1e-10 41.1 0.1 1 86 2445 2513 2445 2514 0.81 19 29 2.5e-10 3.6e-07 30.7 0.0 1 85 2549 2619 2549 2621 0.79 20 29 1.7e-10 2.5e-07 31.2 0.5 1 87 2635 2705 2635 2705 0.80 21 29 1.9e-16 2.9e-13 50.3 0.6 1 86 2730 2802 2730 2803 0.80 22 29 0.00019 0.28 11.8 0.1 1 58 2830 2885 2830 2905 0.78 23 29 1.6e-11 2.4e-08 34.5 0.5 1 87 2923 2995 2923 2995 0.79 24 29 5.1e-12 7.5e-09 36.1 0.0 1 86 3129 3199 3129 3200 0.78 25 29 3.8e-12 5.7e-09 36.5 4.2 1 86 3255 3325 3255 3326 0.80 26 29 3.8e-14 5.6e-11 42.9 4.6 1 86 3449 3519 3449 3520 0.84 27 29 3.4e-12 5e-09 36.6 0.2 1 86 3608 3677 3608 3678 0.84 28 29 5.6e-09 8.2e-06 26.4 0.6 1 58 3698 3747 3698 3755 0.86 29 29 1.5e-09 2.2e-06 28.2 0.9 18 87 3766 3824 3752 3824 0.75
Sequence Information
- Coding Sequence
- ATGTCACAGCAAAATCCACATGCTCATCCGCACTACCATCAGACACAACATCACCACCATCacccgcagcagcagcagttgcagttgcagttccCAGTTGCATCTCagttgcagcaacaacaacagcaacaacaggcgCAAATGCCACACAGCAATTGGTACTCACATGTTGCTTCCTATCCGCcatgcagcaacaacaacatgaaTGCCTATGGAGCAGGTAGCACGCATGGATATTATGCTGCTTCCGCCGCCGCTGGCGGTGGGCTCAATGTCAATGctgtgggtgggggtggggggtcaGTTTCAGCGTATAACCTTGAGGCGAACACAGTGGCATACGCCCACAACCAGCTACTACAGtatcaacagcaacaccatcagcagcagcagcatctcaGCCATCGTTCCTATATGGGACATGATATAATGTCCGGAACATATCCGTACATAAAAAGCGAACCCATGGAGTCAGCGTATCAGCAGCCACAGAATCCAATGGCCCCACCCCCAGCGCCAGATATGATAATAAAATCGGAGCCCATGGATGAACATCCCTACAAGTCCAACTATATTGATGACAATACGCCCTTTGCTGATTTTAACAAGTTCAACGAATTCAGCGGCGATATGCTAAGCCCCAAAGTTGAGCTAACCGTCAAGGATGAGACCTACGGAAAGActtccagcagcaacagcagcagcagctttgcACGCCGCAAagcccagcaacagcaaacgaCAGATCGTTCGGCGGAGAGTCTGCCCATCTGCCAGCGCTGCAAGGAGGTCTTCTTCAAGAAGCAATCCTATCTGAGGCATGTGGCCGAGAGCAGTTGTGGCATTCAGGAGTACGATTTCAAGTGCAACATATGCCCCATGTCCTTCATGACCACTGAAGAGCTGCAGCGGCACAAGCAACTGCATCGTGCGGACAAGTTCTTCTGCCACAAATACTGCGGCAAGCATTTCGATACGATAGCCGAGTGCGAATCGCACGAGTACATGCAGCACGAGTATGAGAGTTTTGTTTGTAATATGTGCTCTGGAACCTTTGCCACGCGGGAACAGCTGTATGCCCACTTGCCACAGCACAAGTTTCAGCAGCGTTACGACTGTCCCATCTGCCGTTTGTGGTATCAAACAGCCGTCGAATTGCATGAGCATCGACTGTCGGCTCCATACTTTTGCGGCAAGTACTATACcaatcaacagcagcagcagcttgcgACGAACCAGGGGAATTACAAGCTGCAGGACTGCCATATGGCCACCATGGAAATACCCACAGCGCCACTGCATAAGGCAACGCCTTCCAATGCCTCAGCCTTGCCAGCCACAGCCGCTTTGAGCTCTCTGTTGCAACAGCGCCAGGCAAATGCCGATGGGGCAGCGGCCATGTTTGCTGCGGCCTCCTCTTCCTCCGCCTCGCTGAAGAGTGAGGTGAGCGTGAAGCTAGAGCGTAGCTACAGCAACTCCACCAGCGAGTCCTCGTACAGCCATCAAGACAACAGCAGCTACAACAATGCCTATGGCAGCGACAGCTCCGTCCATGGCGGAGCACTGGCCGGACCACAGGCGCACTCCTCAACGCTGGACGACTCCGAGGATGCCCTGTGttgtGATGAAAAATATCTCAATCAGTGGCTGCACAACCTCAAGATGTTCCACATACCAGCGGCGAGCTATGCGACATTTCGCATCTGTAGCATGCACTTCCCGAAGCGTTGTATCAATCGGTATTCGCTGTGCTATTGGGCGGTGCCCACCTTCAATCTGGGGCACGACGATGTGGCCAATCTGTACCAGAACCGAGAGCTAACCAACACTTTTACCACTGGAGAGGTGGCACGCTGCAGCATGCCGCACTGCACCAGCCAGCGGGGGGAGAGCAATCTGAAGTTCTACAATTTCCCCAAGGACATCAAGAGCCTGATCAAGTGGTGCCAGAACGCCCGCCTGCCAGTGCAGGCCAAGGAGCCGCGTCACTTTTGCAGCCGCCACTTTGAGGATCGCTGCATTGGCAAGTTCCGACTGAAGCCGTGGGCCGTGCCCACCCTCCATCTGGGAGCGCAGTACGGCAAGATCCATGACAATCCCAAGAATTTGTATGTGGAGGAGAAGCGCTGCTGCTTGAACTTTTGTCGCCGCAGCCGCTCCTCGGACTTTAACATGTCGCTGTATCGTTTCCCCAGAGATGAGGTGCTCCTGCGACGTTGGTGCTATAATTTAAGGCTGGATCCGGGCGTGTATCGTGGCAAGAATCACAAAATATGCAGTGCGCATTTCATCAAGGAAGCATTGGGTCTAAGAAAGCTGTCGCCAGGTGCCGTTCCCACATTGCATTTGGGTCACAATGACACCTTTAATATCTATGAGAACGAACTGTGGCCACCGCCATCTCCCACTGGACAACATGGCGGCAGTCTCCAgcttctccagcagcagcagacgtcGCAGCAGCTGTCGCATCACCAATCGtccctgcagcagcagcagcagcatcagccaATGCATAGCAAATCCTATCAACGCCATTCGGCGGCCTCCACTTCCTCCTCCGCCAGTTCGGCCTCTCATTATGTTGACCCCGAGATGAGTGCCTCGTATTTGAACCTGTCTGCGGGTGGCTCCTCTGGCGGGATGAATGCCAGCGACTGCATGGATGTGTGCTGCGTGCCAAGCTGCGAGAGCAAGCGGCACAACAGCGAAAACATCACATTCCACACGATACCGCGCAGGCCAGAGCAGATGCGCAAGTGGTGCCACAATCTGAAGATACCCGAGGACAAGATGCACAAGGGCATGAGGATTTGTAGCCTGCACTTTGAACCCTACTGCATTGGCGGCTGCATGCGTCCGTTCGCCGTGCCCACACTCCATTTGGGGCACGAAGATGAAGACATACATCGCAATCCGGATGTGATCAAGAAGCTGAACATCCGAGAGACCTGCTGTGTAGCCGTGTGCAAGCGGAATCGCGACAGAGACCATGCCAACCTCCATCGTTTCCCCAGCAATGTGGCGCTGCTCACGAAGTGGTGTGCGAATCTGCAACGGACTGTACCCGATGGCAGCAAACTCTTCAACGATGCCATCTGCGAGGTTCACTTTGAAGATCGTTGTCTGCGCAACAAGAGGTTGGAGAAGTGGGCTGTGCCCACTCTGATCCTCGGCCACGAGGACATTGCCTATCAGCTGCCGACGCCGGAGCAGGTGGCCGAGTTCTATGCCCGTCCCACGGCCCCCAACAATGGCGAGGAGCAGGGAGAGTGCTGTGTTGAAACGTGCAAACGGAACCCGAGTGTGGATGACATCAAATTGTATCGTCCGCCGGAGGATACTTCGGTGCTGGCCAAATGGGCGCACAATCTGCAAACGGAGGCCGCTCTCCTCACGAACATGCGGATATGCAATCTGCACTTTGAGGCTCACTGCATTGGCAAGCGCATGCGTCCGTGGGCCATACCCACGCTCAATCTGGCTGGAAACATTGAGAATCTGTACGAGAATCCCGAGCATTCGATGCTGTACAAGCGAAGGACGCACCTCAAACAGAAGGTGCCGGTGACAAAGCCCACGTGGGTGCCTCGCTGCTGTCTGCCGCACTGCCGCAAGGTGCGTGCCCTTCACAATGTCCAGCTGTATCGCTTCCCTAAGCTGAATCGTTCGACGCTGGCCAAGTGGGCACACAATCTGCAGGTGCCGCAGGTGGGAAGTGCCCAGCGGCGGGTCTGTTCCGCCCACTTTGAGCCGCATGTTTTGAGCAAAAAATGCCCGGTGCCGCTGGCGGTGCCCACACTGGACTTGAATTCACCCGCTGGCCACAAGATCTACCAGAATCCGGCCAAGCTGAAGGccaacaagctgtgcctgcaGCGCGTATGCATTGTGGAGAGCTGCAGGAAGACCAGAGCCCAGGGCGTGCAGCTCTTCCGTctgccccacagccccacGCAGCTGAGGAAATGGATGCACAATATACGGACACGCCCAAGGGCGGCCATGAGGAGCCAGTATCGCGTCTGTTCGCGACACTTTGAGACTCACTCCTTTAACGGTCGAAGGCTGAGCGCCGGGGCCATTCCCACTTTGGAGTTGGGCCATGACGACGATGATATCTTCCCGAATGAAGCGCAGGCCTTTGCGGATGAGCACTGCGCTGTGGAGGGCTGTGAATCGTCGAAGGAACAGCCCGAAGTGCGGCTCTTCCGCTTCCCCacggacgacgacgacatgCTGTGGAAGTGGTGCAACAATCTTAAGATGAATCCCGTCGACTGCATCGGTGTGCGGATCTGCAACAAGCATTTCGATGCCGATTGCATTGGACCCAAGCATCTGTATAAGTGGGCCATACCCACGATGCTGCTCGGCCACGATGATTCCCAGATCGAGCTCATACTCAACCCCAAGCCGGAGGAACGCTACGTGGATCCCGTGTTCAAGTGCATTGTCCCAACCTGCGGGAAGACGCGCCGCTTCGATGAGGTGCAAATGAACAGCTTCCCCAAGGATGCGGATCTGTTTCAGCGCTGGCGCCACAACCTCCGCCTGGAGCATCTGTGCTTCAAGGAGCGCGAGAAATACAAGATCTGCAATGCCCATTTCGAGGACATGTGCATTGGCAAGACGCGTCTGAACATTGGTTCCATACCCACTCTGGAGCTGGGCCACGAGGAAACGGAGGATCTGTTCAAGGTGAATCCGGAAGATCTGCAGAGCAATCTGTTTGGGCGTCCCCGTCGGCTGCTAAGAGGATTGAACAATGTGACCATCAAACAGGAGGTGCCAGAGATGGATGAGCAGGACATAAAGCCCGACATAAGGACCAATTTTACACAGGTAAAGATTAAGAAATCTCTGGGGGATATCAAGTGCTGTGTGCACACGTGTGGACGCAGTCGTTTGGAGCATGGGGCACGTCTCTTTCCCTTCCCCACGGGCAAGCAACAGCACCTCAAGTGGCGCCACAATCTGCGCCTGGAGCCCGACGAAGTGGACAAAAGCACGCGCGTCTGCAGCGCACACTTCAACAGGCGCTGCATCGATGGCAAGCATCTTAGGGGATGGGCCATGCCCACACAGCAGTTGGGCCACCAAGAGCAGCCTATATACGAGAATCCCAAGAATATACCTGGCTTCTTTACGCCCACCTGTGCGCTGGGGCACTGCCGCAAGCGGCGGAGCATTGACAATGATTTGCGCACATACCGGTATCCGCGGAGCGAGGATCTCCTCGAGAAGTGGCGCGCAAATCTCGGACTATCGCTGGATCAGTGCCGTGGCAGGATCTGTGCGGATCACTTTGAGCCGCAGGTGAGGGGgaaactgaagctgaagaCGGGAGCAGTACCCACGCTAAAACTGGGCCACGAGGAGGCTTTGATGTACGACAATGAGGCTATAAAGGCTGGAGTGGCCGAAGAGGAGGTTGGCAGTCCTGCGGCATCGCCTCTGGTGACACCCAAAACGGAAGTTCTGGACGAAGAGGAGCGCGAGGaagatgaggaggaggaggagaaccCCGAAGAAGAGCAGCAGGAAACGCATGATGAGGAGAAGGATGAACACGAAGATGACGCGCCCGAGGGAGCAGAGCAGCTGGGCGATGAGGATGACGACGAGGATCCAGGCAACTATTTTGATCCGTTGGAGCTGGTGGAGACGTATGCAGAGCATCCCAGCGACGATGACAACAGCCACGAGGAAGCAGACGATGCCAgagaggaggatgaggaggaggaggaggaggaggcagaaaCTCTCTTGCCTGATACACCACCCaaaataacagcagcagcagtccttCGCGTGCCGAAACCATGGGAAAGAGCTGTCGCAGTAGTGCCTCGCCGAGAGAAGCGTCCGAATAACGTGGATCCTATCTGCTGCCTCAAGCACTGCCGCAAGGAACGCTCCGCCATGTATCTGCTGAGCACATTTGGCTTTCCCAAGgaccagcagctgctgctcaagTGGTGCGCCAACCTCCAAATGGATCCCTCGGGTTGCATTGGCCGCGTCTGCATCGAACACTTCCAGTCGGAGGTTCTGGGCACGCGAAAGCTCAAACAGAATGCGGTTCCCACCCTCAATGTGGGTCACGATGTGCCACTGCGCTACACCTGCAACGGCCAGGAGATGCCtcaggcagcagcggcggccgcCACGAGCAGCTTCCCCGACGAAATGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAGAGGAAAGTGTTGGAGagtccagctccagctccagcagcgaTCAAGGATGAGGAGCAGATGcagctggagatggaggtgGAGACTAAGCCAAAGATATGCTGCCTCTCCAGTTGTGGCAATGTGGAGGGCTACGGCCCGGGCGGGCACTTTCAGCCGCTGCCCCAGGACCAAAGGATGCTGAAAAAATGGCAGCACAATCTGAGGCTGCCATCTGTCAATCCCGATTCGGATCTTCGTGACTTTCGCCTGTGCATGGAGCACTTTGAGCCGCATCAAATCGAGAACGGAGCACCAGTGAGAATGGCAGTTCCGACCCTCAAGCTTGGCCACTCCAGTCCGAATATCTTTAAGAACAGCGAGAGCACGCTGCCGGGATGCCTGTGGCCCTCGTGTCCGCCCAATCGCAAGATCTGCTACGATCTGCCTGACAATGAGGCCGTTCGAGCGGCCTGGCTGTCGTATGTGCGGCTGCCGCTGGACAGCCAGGGGCGTCTGTGTGGCCTGCACTTTCTGCAGCTGTACGAGGAGGTGGATCTGCCAGGAGATGTACCCGAAACGGTGCTCGAGCGACTGCAGGGTACCTACGATCAGGCCTCCATCTCGCTGAAGTTTCAGTGCTCGGTGCAGGGATGTGGCTCCAAGTACAAGCAGGACACGCATTTGGCAAAGCTACCACGGGACGCGGAACTGCTCGCCAAGTGGCTGCACAACACCAGGATCTCCTACGATCGCTCCTTGCATTTCAGCTACCGCATTTGTCTGCTGCACTTTGAGGCGTTCTGCTTGAATGGCGTGCGCCCGCAGACCTGGGCCATACCTACGCTGCAGCTAAATCACGACGGAGAGATCTACCAGAATACCGTCAAGCAGGAGATCCCAGAGAATCCCCTGAAGCAGGAGATCCTCGAGAATCCCGTGAAGCAGGAGAAATCGCACTTTGGCAGCATCTCCAGCCTGAGTCTCTCCATCCCCCTGCACATCAAGACCGAACAGGGTCCTGTCCAGCGACCTCGAGGCACTTGGGGCACATCTTCTCAGAGCAGTCCCTGCCTGAGCGCCAGCTCCAGTCCCCGCATGAAAAACAGAATCTGCTGCGTTTCCAATTGCGGGGAGTACGCCAGATCCCAGCGGCTGTACCGCTTTCCCACCGCCGAACCGGCACTGCTCAAGTGGCTGGTGAATACCCAGCAAAAGCCGGGACTCGTGGACATCCAGAACCTGTTTGTGTGCCAGCTGCACTTCGAGGCGGACGCCATTAACCAAACGCAGCTCAGGAGCTGGGCCGTGCCGACACTGAGGCTGGGCCACGATGGGCATGTCATACCAAATGCCAGACACAATGGAAACATAGCGAACAGCCAGGAGACGGAGCAGGCCATGGAGTTCATTCGGGCCAACTACTGCGCGGTGTTGAGCTGCTTCCAGCCGAAAGGAGATGGTGTGCGCTTCTACAAGTATCCCAGCGACATTGCCATGGCGCGCAGGTGGGCCACGAATCTCAAGCATCGCTCCATGCAGGCCAGCAGTCATGGCTTCCTGGTTTGCCAGTCCCACTTTGCAGCCGACTGTTTTGATCCGGAGACGGGAGACCTGCGCGAGGACGCCGTACCCGTGGCCACACTCGCAGGGAACGTAAAAACAGAGGGCCTGCTGCTCCGTTGTCTGGTAAGGAGTTGCTCTACGGATAACTCTGGAAAAGGACTGCTGTTCAAGGTGCCAAAAAAGAATCGTGTACGGGATGTGTGGGCCCACAATCTGTGGATGCATCCGATAGAACTGATGGGCGAGCACTACATCTGCGATCGACATTTCGAGGCGCATTGCGTGAATGAACACAAATTGCTGCACGCGGGTTCAGTGCCAACCCTCCACCTGGGACACAACGAACCGCTGGAACTACTGCCCAATCCCCAGACCTTCCAGGACTGCCCTGAGGAGTGCGAGTGCTGTGTGCCCGGCTGTGGACGCACCAATCGGAAGGAGGAGGATCTGCAGTTTATCAAATTTCCCAAGTGGCGAGTCCTGTATGAGAAGTGGCTGCACAACTTCCGCCTCGAAGTGCCCAAGGAGCAGCGCATCGGAACGCTGAGAGTGTGTCACATGCACTTTGAGGAGAGCTGCTACGATGGCCAGAATGTGCGCAGGGGAGCTATGCCCACCCTAGAGCTGGGACACTCGCATCCAGACATTTATCGCACCGACAAGGGATCGCTGTGGAAGAAGGTTCACAAGAGATTCACCGATTGTTGCTATCCGGATTGCTACGAGGAATGTCACAAGGCCAACACGAATCGCATGGCCTACGACCTGCCCAGCGATGGGCCACTGCGCGAGTCCTGGCAGCAGCACATGGCTATCCCTGCCAGCGGCGAGGATAGCTCCTCAGTGCTAAAGCTCTGTGCCCTACACTACATCATGCTGTACGAGCACAGCGAACAGAGCTTCCCAGAACACGGACCGAATCTACTGCTGGACAAGAACTACGAGCACGCCCGTCAGTTGGCGTATCTGCGGCGCTTCTTGTGTGCCGTACAGGGGTGTCGCCATCTGCAGCCGCGGGATGGGGGTCCGATGCACGGCATACCCCGGCGGAGGGAGATCCTTCGGATGTGGGTGGAGAATGCACAGCTGCGGCTGAACGAGCACGAAATTTACATGACGAAGCTGTGCAGCAAACACTTTGAGGCCCACTGCCTGTTCGAAGGCAAAAGATGCTATCCCTGGAGCGTGCCAACGCTCCATCTGCCAGAGCTGCAGCCCGGGCAGGTGCTCCACCAGAATCCCACCAAGGAGGAGTGGCAGGAAATGAAACAGAGAATGAAAGTGGAAGAGCAGACGCCGAAGACGGAAGAGCAGGCAGATGGACTACTAATGGAACCCTATGTGAAGATGGAACCCCACGACGATGAGTCACAAACGGAGTCGGAATTGCTGATAAATGAGAGCACGCTGGACTCTCAAGAACTCTCTCAAGACTTTCCACCACAAGAGCCAAATGAAATGCCCGCCCTGGAGGTGCTCCTAGAGGTGGGACATGTCGAGAGGCTGGATAGCTACGAGAAGAAGGAATACTCTGCGGATACCTCTGCCAACACGTATGCTCCGAACAAACGTTTCCGCCATCAGTACAGTGCCCACAAGTGTAGTGTCGAAGGATGTCGCGTGTCGCTCGAGGACCTTGGCGGGAATCTGAAGCTGCACAAGCTACCCAGCTCCACGGAGGCGGCCGGGAAGTGGCTGTACAACATTCAGGTGGAGATAGAGGATAAATGGCGGATTCGCGTCTGCAGCCATCACTTCGACAGGCAGTGCCTAAATGGTTCCAGGCTCAGGAGGGGATCGATGCCCACTCTGCTGCTGGGGCCGCGTGTTCCAGACACTATCCATCACAATGAGTTTGCGCAGCTGCAATTGGACGATGCGCCAGCCCAGAATGGCCATCCATTGGAGCGATCCATTGGAAAGGTTGTGCAGCTATGCGTTCCACGTCCGTCGCCGCCGCGTAAGTCCAGCAAATTCTGCCAGATCGAGGGATGTCCGAACCATTTGACCAGCGAGAATATGACGCTACACAAGTTCCCGCACTCGTCATGGATCTGCACCAAGTGGCAGCACAACACACAGGTGCCATTCGATCCGGAGTACCGCTGGCGATATCGCATCTGCAGCGCTCACTTCCATCCCGTGTGTATGGTCAATATGCGGCTGCTGCATGGCAGTGTGCCCACCCTGAAGCTGGGTCCACGGGCACCCGGTGAACTCTTTGACAGCGACTTTGAGGCCATAAACATAAAGATTGAAAAAATGGAGAAGATGGAGAGGAAATCTGAGGCTCAGAGAAGCACCACTGGAGATAGATATCCCACCATGCAGGTCATGGGGGAGAAGAAGTTCAAGACTGAGGAGCTGGAAGAtggaatggaggaggaggatgacaTGCTCTGCCTGGAGCCAGAGATGCAGCTATACGAAGATCAGGaagaacagcaacagaagccaAAGATAAATCTTGGAGTCCCCAATGGCGGCTGGAAAACAGAACTCCGTTTGCCATCGAAGGGTAGGGTGGCGTTCAATCCGGTGAGATCTGGCTACGACAAGTGCTCGCTGATGCATTGCCAGCGCCAGAGATCGAAGCACGGCGTTCACATCTACAAGTTCCCCCGATCGcaggagcaccagcagcgATGGATGCACAATCTGCGCATACGCTACGATGGTAAGCGCCCCTGGAAGTTTATGGTCTGCAGCGTGCACTTTGAACCGCATTGCATACGGCTGCGGAAGTTGCGGCCCTGGGCAGTTCCCACGCTAGAGCTGGGAGACAATGTGCCCGAGGACATCTATACGAACGAGCAGTGCCAGATGTTTGCCAGTGGACAGGTAGGAGAGATCAATGGCATCGATAGCGatgaggcggaggcagaggggGAGAGCGATGGGAATGATGAGGATGGCCTgcaggaggacgaggaggaggagacagaCGACCAGGAGCCCATCGCCAAGAGACGTCGTCGCTCGCGGCTGGATGCCGTCTGGCCTCCCGGCCAGGTGCCACCATGGAAGGTGAAACAATGCTGTCTCCCCTACTGCCGCAGTCCTCGCGGCGAGGGCATCAAGCTGTTTCGACTGCCCAACAAAGTCAATTCCATCCGCAACTGGGAGCTGGCCACGGGCATGAAGTTCAAGGAGTCGCAGCGCAACACGAGACTCATCTGCAGCCGCCACTTTGAGCCGGAGCTGATCGGAGTGCGTCGTCTGATGCGCAATGCCATTCCCACCAGGCATTTGGGACCCACGGGCGATATAAAGCCAGTGGTGGCTCCACCGACAGCTGGTCCTAAATGCTGTATGGCAGATTGTGTCTATGATGTGGCGGATGTGAAGCTGCACAAGTTTCCCAGCAATCCCAAACTCCTGAGGGAGTGGTGCCAGGCATTAAGGGTCACGGATATGCAAAGGTATCGCGGCAAGCACATTTGCTCCGCCCATCTACCCGTCCACGAGGCCGTACAGTGCATTGTTTGTGGCGCGGACAAAGCACCCCTGCTGCCGATGCTTAATTTTCCCGCTAACCGGAATCAGCGCGCCAAATGGTGCTACAATCTGAAGATCGAAACGATACCCAAGTGGGACATATCCAAGCACATTTGCTGCAAACACTTTGAGCCATATTGCTTTGCAGAGGCGGGTCTCCTAAAGCCAGAGGCGGCGCCCACACTGCATTTGAATCACAATGATACAAACATATTCCTTAACGATTGTGCCATAAATCCTGCCTACAGTGGAGGAGGAGTACGGGTGAAGGATGAGCCCATGGACAATCAGGTCCTGTCGTTGGTgtag
- Protein Sequence
- MSQQNPHAHPHYHQTQHHHHHPQQQQLQLQFPVASQLQQQQQQQQAQMPHSNWYSHVASYPPCSNNNMNAYGAGSTHGYYAASAAAGGGLNVNAVGGGGGSVSAYNLEANTVAYAHNQLLQYQQQHHQQQQHLSHRSYMGHDIMSGTYPYIKSEPMESAYQQPQNPMAPPPAPDMIIKSEPMDEHPYKSNYIDDNTPFADFNKFNEFSGDMLSPKVELTVKDETYGKTSSSNSSSSFARRKAQQQQTTDRSAESLPICQRCKEVFFKKQSYLRHVAESSCGIQEYDFKCNICPMSFMTTEELQRHKQLHRADKFFCHKYCGKHFDTIAECESHEYMQHEYESFVCNMCSGTFATREQLYAHLPQHKFQQRYDCPICRLWYQTAVELHEHRLSAPYFCGKYYTNQQQQQLATNQGNYKLQDCHMATMEIPTAPLHKATPSNASALPATAALSSLLQQRQANADGAAAMFAAASSSSASLKSEVSVKLERSYSNSTSESSYSHQDNSSYNNAYGSDSSVHGGALAGPQAHSSTLDDSEDALCCDEKYLNQWLHNLKMFHIPAASYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEDRCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPSPTGQHGGSLQLLQQQQTSQQLSHHQSSLQQQQQHQPMHSKSYQRHSAASTSSSASSASHYVDPEMSASYLNLSAGGSSGGMNASDCMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHEDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRTVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYQLPTPEQVAEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEDTSVLAKWAHNLQTEAALLTNMRICNLHFEAHCIGKRMRPWAIPTLNLAGNIENLYENPEHSMLYKRRTHLKQKVPVTKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPQVGSAQRRVCSAHFEPHVLSKKCPVPLAVPTLDLNSPAGHKIYQNPAKLKANKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRSQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDDDDIFPNEAQAFADEHCAVEGCESSKEQPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFDADCIGPKHLYKWAIPTMLLGHDDSQIELILNPKPEERYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDADLFQRWRHNLRLEHLCFKEREKYKICNAHFEDMCIGKTRLNIGSIPTLELGHEETEDLFKVNPEDLQSNLFGRPRRLLRGLNNVTIKQEVPEMDEQDIKPDIRTNFTQVKIKKSLGDIKCCVHTCGRSRLEHGARLFPFPTGKQQHLKWRHNLRLEPDEVDKSTRVCSAHFNRRCIDGKHLRGWAMPTQQLGHQEQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLGLSLDQCRGRICADHFEPQVRGKLKLKTGAVPTLKLGHEEALMYDNEAIKAGVAEEEVGSPAASPLVTPKTEVLDEEEREEDEEEEENPEEEQQETHDEEKDEHEDDAPEGAEQLGDEDDDEDPGNYFDPLELVETYAEHPSDDDNSHEEADDAREEDEEEEEEEAETLLPDTPPKITAAAVLRVPKPWERAVAVVPRREKRPNNVDPICCLKHCRKERSAMYLLSTFGFPKDQQLLLKWCANLQMDPSGCIGRVCIEHFQSEVLGTRKLKQNAVPTLNVGHDVPLRYTCNGQEMPQAAAAAATSSFPDEMPQHSVFRLWSLKHCRKRKVLESPAPAPAAIKDEEQMQLEMEVETKPKICCLSSCGNVEGYGPGGHFQPLPQDQRMLKKWQHNLRLPSVNPDSDLRDFRLCMEHFEPHQIENGAPVRMAVPTLKLGHSSPNIFKNSESTLPGCLWPSCPPNRKICYDLPDNEAVRAAWLSYVRLPLDSQGRLCGLHFLQLYEEVDLPGDVPETVLERLQGTYDQASISLKFQCSVQGCGSKYKQDTHLAKLPRDAELLAKWLHNTRISYDRSLHFSYRICLLHFEAFCLNGVRPQTWAIPTLQLNHDGEIYQNTVKQEIPENPLKQEILENPVKQEKSHFGSISSLSLSIPLHIKTEQGPVQRPRGTWGTSSQSSPCLSASSSPRMKNRICCVSNCGEYARSQRLYRFPTAEPALLKWLVNTQQKPGLVDIQNLFVCQLHFEADAINQTQLRSWAVPTLRLGHDGHVIPNARHNGNIANSQETEQAMEFIRANYCAVLSCFQPKGDGVRFYKYPSDIAMARRWATNLKHRSMQASSHGFLVCQSHFAADCFDPETGDLREDAVPVATLAGNVKTEGLLLRCLVRSCSTDNSGKGLLFKVPKKNRVRDVWAHNLWMHPIELMGEHYICDRHFEAHCVNEHKLLHAGSVPTLHLGHNEPLELLPNPQTFQDCPEECECCVPGCGRTNRKEEDLQFIKFPKWRVLYEKWLHNFRLEVPKEQRIGTLRVCHMHFEESCYDGQNVRRGAMPTLELGHSHPDIYRTDKGSLWKKVHKRFTDCCYPDCYEECHKANTNRMAYDLPSDGPLRESWQQHMAIPASGEDSSSVLKLCALHYIMLYEHSEQSFPEHGPNLLLDKNYEHARQLAYLRRFLCAVQGCRHLQPRDGGPMHGIPRRREILRMWVENAQLRLNEHEIYMTKLCSKHFEAHCLFEGKRCYPWSVPTLHLPELQPGQVLHQNPTKEEWQEMKQRMKVEEQTPKTEEQADGLLMEPYVKMEPHDDESQTESELLINESTLDSQELSQDFPPQEPNEMPALEVLLEVGHVERLDSYEKKEYSADTSANTYAPNKRFRHQYSAHKCSVEGCRVSLEDLGGNLKLHKLPSSTEAAGKWLYNIQVEIEDKWRIRVCSHHFDRQCLNGSRLRRGSMPTLLLGPRVPDTIHHNEFAQLQLDDAPAQNGHPLERSIGKVVQLCVPRPSPPRKSSKFCQIEGCPNHLTSENMTLHKFPHSSWICTKWQHNTQVPFDPEYRWRYRICSAHFHPVCMVNMRLLHGSVPTLKLGPRAPGELFDSDFEAINIKIEKMEKMERKSEAQRSTTGDRYPTMQVMGEKKFKTEELEDGMEEEDDMLCLEPEMQLYEDQEEQQQKPKINLGVPNGGWKTELRLPSKGRVAFNPVRSGYDKCSLMHCQRQRSKHGVHIYKFPRSQEHQQRWMHNLRIRYDGKRPWKFMVCSVHFEPHCIRLRKLRPWAVPTLELGDNVPEDIYTNEQCQMFASGQVGEINGIDSDEAEAEGESDGNDEDGLQEDEEEETDDQEPIAKRRRRSRLDAVWPPGQVPPWKVKQCCLPYCRSPRGEGIKLFRLPNKVNSIRNWELATGMKFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTRHLGPTGDIKPVVAPPTAGPKCCMADCVYDVADVKLHKFPSNPKLLREWCQALRVTDMQRYRGKHICSAHLPVHEAVQCIVCGADKAPLLPMLNFPANRNQRAKWCYNLKIETIPKWDISKHICCKHFEPYCFAEAGLLKPEAAPTLHLNHNDTNIFLNDCAINPAYSGGGVRVKDEPMDNQVLSLV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00474529; iTF_00514916; iTF_00601081; iTF_00517737; iTF_00603372; iTF_00480887; iTF_00473809; iTF_00484466; iTF_00471644; iTF_00574842; iTF_00580764; iTF_00487324; iTF_00563936; iTF_00611379;
- 90% Identity
- iTF_00484466;
- 80% Identity
- -