Zafr015171.1
Basic Information
- Insect
- Zaprionus africanus
- Gene Symbol
- -
- Assembly
- GCA_018151435.1
- Location
- JAECXB010000221.1:438161-451566[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 28 1.3e-15 2.5e-12 47.2 3.8 1 86 363 435 363 436 0.85 2 28 2.3e-15 4.6e-12 46.4 4.6 1 87 463 532 463 532 0.83 3 28 5.6e-16 1.1e-12 48.3 0.4 1 87 554 626 554 626 0.85 4 28 3.7e-16 7.4e-13 48.9 5.3 1 87 692 762 692 762 0.83 5 28 6.7e-15 1.4e-11 44.9 3.5 1 86 786 857 786 858 0.82 6 28 9.5e-13 1.9e-09 38.0 1.0 1 87 893 961 893 961 0.81 7 28 7.9e-11 1.6e-07 31.8 1.4 1 86 1009 1078 1009 1079 0.76 8 28 5.3e-16 1.1e-12 48.4 0.1 1 86 1106 1175 1106 1176 0.82 9 28 6.2e-13 1.2e-09 38.6 1.2 1 85 1197 1265 1197 1267 0.80 10 28 2.5e-14 4.9e-11 43.1 1.0 1 86 1294 1365 1294 1366 0.84 11 28 6.6e-14 1.3e-10 41.7 1.6 1 85 1440 1508 1440 1510 0.82 12 28 1.6e-12 3.3e-09 37.2 0.1 1 86 1533 1601 1533 1602 0.81 13 28 2.7e-13 5.3e-10 39.8 1.4 1 87 1688 1757 1688 1757 0.80 14 28 1.3e-12 2.6e-09 37.6 0.5 1 61 1831 1884 1831 1908 0.78 15 28 0.058 1.2e+02 3.4 0.0 1 58 1917 1967 1917 1989 0.74 16 28 3.1e-12 6.3e-09 36.3 0.4 1 86 2006 2075 2006 2076 0.86 17 28 3e-15 6.1e-12 46.0 2.2 1 86 2140 2209 2140 2210 0.83 18 28 1.2e-12 2.3e-09 37.7 0.9 1 86 2245 2316 2245 2317 0.81 19 28 1.3e-12 2.6e-09 37.5 1.9 1 87 2327 2399 2327 2399 0.80 20 28 3.9e-16 7.8e-13 48.8 0.1 1 86 2429 2503 2429 2504 0.79 21 28 1e-05 0.021 15.4 0.2 1 58 2540 2594 2540 2614 0.84 22 28 6.4e-15 1.3e-11 45.0 0.3 1 86 2632 2704 2632 2705 0.82 23 28 8.9e-14 1.8e-10 41.3 4.4 1 86 2852 2924 2852 2925 0.83 24 28 5.7e-14 1.1e-10 41.9 2.2 1 86 2985 3055 2985 3056 0.82 25 28 7.3e-14 1.5e-10 41.6 3.6 1 86 3169 3239 3169 3240 0.84 26 28 1.9e-12 3.8e-09 37.0 0.0 1 87 3322 3392 3322 3392 0.84 27 28 1.7e-08 3.4e-05 24.3 0.6 1 58 3414 3462 3414 3476 0.82 28 28 1.1e-10 2.2e-07 31.4 1.9 17 87 3478 3537 3465 3537 0.74
Sequence Information
- Coding Sequence
- ATGATAATAAAATCGGAACCCATTGACGAACATGCGTATAAGACCAACTATATTGATGATAATACGCCGTTCGCTGATTTTAGTAAATTTCCGGAATTTGGCGACGATATGCTGAGTCCAAAGGTAGAGCTGACCGTCAAGGATGAGGCGTATCAAAAGAACCCGCTTAGTTATCCGCGTCGCAAGCTGCAATCGGAACGCTCTTCGGAGAGCTTGCCGATTTGTCAGCGCTGCAAGGAGGTGTTCTTCAAGAAGCAAATCTACTTGCGGCATGTGGCAGAGAGTAGTTGCACCATACAGGAGTATGACTTCAAGTGCAACATATGCCCCATGTCCTTTATGAACACCGAAGAACTGCAAAAGCACAAGCATCTGCACAGGGCCGATAAGTTCTTCTGCCACAAATATTGTGGCAAGTACTTTGACACCATTGCAGAGTGCGAGTCACATGAGTACATGCAGCACGAGTATGAGAGCTTTGTTTGCAACATGTGCTCGGTTACATTTGCCACACGGGAGCAGCTCTATGCGCACTTGCCACAACACAAGTTCCAGCCACGTTTCGATTGTCCCATTTGCCGGCTATGGTATCAAACGGCATTAGAGCTGCACGAGCATCGTCTGGCGGCGCCCTATTTCTGTGGCAAGTATTACGCAGGCGCACAGGCCACCCAGGCACTTCAACAGCAGCAGCAGCACGCACAGCCACAGCATCAGCAGAGCGCCAACTACAAGTTGCAGGACTGTCACATGGCCACCATGGAAATGCCGACGCCACATCAAAAGGCAACTGCTGGCAGCGCATTACCGGCAACGGCGGCGTTGAGTTCGCTGTTGCAACAACGTCAGGCAAATGCCGATGGTGCCGCATTGTTTGCATCTTCTCTGAAGAACGAAGTGAACGTCAAACTGGAGCGAAGCTATAGCAATTCTACAACAAGCGAATCTGGCTACAGTCTGCATGACAGCAGCAACTATAACAATGCATACGGAAGCGATGCCTCGTTGCATGGCGGTGCAGGAGCCGGGGCTATTGGCGGTCCACAGGCGCATTCCTCAACGCTGGACGATTCAGAGGATGCGCTTTGCTGCGTGCCCCTATGTGGTGTGCGCAAGAGCACCAGCCCCACACTGCAGTTCTTTACGTTCCCCAAAGACGAGAAGTACTTGCATCAATGGCTGCACAACCTCAAGATGTTTCATATTCCGGCATCGAGTTATGCCACATATCGCATCTGCAGCATGCATTTTCCGAAGCGCTGCATCAATCGATATTCATTGTGCTATTGGGCTGTGCCCACGTTCAATTTGGGACACGATGATGTTGCCAATCTATATCAGAATCGCGAGCTAACGAACACCTTTACCACCGGCGAGGTGGCACGTTGCAGCATGCCCAACTGTACCAGTCAACGAGGGGAAAGCAATCTCAAGTTTTACAACTTCCCGAAGGACATAAAGAGTCTTATCAAGTGGTGCCAAAACGCACGTTTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTCTGCAGTCGACACTTTGAGGAGCGATGCATTGGCAAGTTCCGGCTAAAGCCATGGGCGGTGCCAACACTGCATTTGGGCGCCCAATACGGCAAGATTCATGACAATCCCAAGAACTTGTATGTGGAAGAGAAGCGCTGTTGCCTTAACTTCTGTCGACGCAGTCGCTCCTCGGACTTCAACATGTCGCTGTATCGCTTCCCCAGAGATGAAGTGCTCTTGCGACGTTGGTGCTACAATTTACGCCTCGATCCTGCGGTCTATCGTGGCAAGAATCACAAGATATGTAGTGCACACTTCATCAAAGAAGCACTGGGATTGCGAAAACTATCGCCGGGTGCCGTTCCTACGCTTCATTTGGGTCACAACGATACGTTCAACATCTATGAGAACGAACTTAAATATCAGCGTCATTCGGCTGCCTCAACATCTTCGTCGGCCAGTTCGGCATCGCACTATGTCGATCCAGAGTTGAGTGCATCGTATATGGGCATGGGAAGTGGATCGGGTGGATCCGCGTCGGGCCTGAATGTGAGCGACAGCATGGATGTGTGCTGTGTGCCAAGTTGTGAGAGCAAGCGACACAACAACGAGAACATCACATTCCATACGATACCGAGGCGACCCGAACAGATGCGCAAATGGTGTCACAATCTTAAGATACCCGAGGATAAGATGCACAAGGGAATGCGGATTTGTAGTCTACACTTTGAGCCCTATTGCATTGGCGGCTGTATGCGTCCGTTCGCGGTACCCACACTGAATCTCGGGCACGACGACGACGATATACATCGCAATCCGGATGTAATTAAGAAGCTGAATATACGTGAAACCTGCTGTGTGGCTGTGTGCAAGCGTAATCGGGATCGCGATCATGCCAATCTGCACCGTTTCCCCAGCAATGTCTCGCTGCTGACCAAATGGTGCGCCAATCTGCAGCGACCTGTGCCCGATGGCACTAAACTCTTCAACGATGCCATCTGTGAGGTGCACTTTGAAGATCGTTGCCTTCGCAACAAGAGATTGGAGAAATGGGCAGTGCCTACGCTGGTGCTTGGACATGACAACATTGCATATCCACTGCCAACGCCGGAGCAGGTGGCCGAGTTCTATGCCCGTCCCAGTGCACCCAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTTGAGACCTGTAAGCGTAATCCCAGTGTCGATGATATCAAACTCTATCGCCCGCCCGAGGAGTCGTCCGTGCTGGCGAAATGGGCACATAATCTCGAAGTGGATGCCGCCAAGTTGTCCACCATGCGCATATGTAATCTGCACTTTGAATCCCATTGCATTGGCAAGCGTATGCGTCCTTGGGCCATACCCACACTCAATCTGTCCAACAATGTTGAGAATATGTACGAGAATCCCGAACATCATATGATCTACAAGCGTCGCACACAAATTAAGACCGAGAGAGGATCAGGTGCATCGGGAGCGGGAGCAAGCATGTCGAAGCCCACTTGGGTGCCACGTTGCTGCTTGCCACATTGCCGCAAGGTGCGTGCTCTGCACAATGTGCAGCTCTATCGCTTCCCCAAACTCAATCGTTCCACGTTGGCAAAGTGGGCGCATAATCTGCAAGTGCCACTGGTGGGCAGTGCCCAGAGGCGACTCTGCTCTGCCCACTTTGAGCCGCACGTGTTGAGCAAAAAGTGTCCGGTACCGCTGGCGGTGCCCACGCTGGACCTCAATGCGCCACCCGGCTATAAAATCTATCAGAATCCCGCTAAGCTCAAGGCCAACAAGCTGTGTCTGCAGCGGGTCTGCATTGTGGAGAGCTGCCGTCGTCAACGTGGTCAGGGCGTGCAGCTCTTCCGACTGCCGCACAGTCCCACGCAGCTGCGCAAATGGATGTACAACATACGGATGCGACCCAGAGGTGCCATGCGGCAACAGTATCGCATCTGCTCGACACACTTTGAGACGCATTCGTTCAATGGAAAGCGACTGAGTGCGGGAGCGATACCAACACTGGAGTTGGGCCACGACGACGAGGACATCTATCCAAATGAGGCGCAATCATTTGTGGAGGAACACTGTGCCATCGAAGGGTGCGAGTCATCCAAAGAGCAGCCAGATGTGCGTCTCTTCCGCTTCCCCACCAATGATGAGGATTTGCTCTGGAAATGGTGTAACAATCTGAAAATGAATCCTGTCGATTGTATTGGCGTGCGCATCTGTAACAAACACTTCGAGGGAGATTGCATTGGACCTAAGCATCTTTACAAGTGGGCGATACCCACGCAGGAGCTGGGCCACGATGACTCTCAGATTGAGCTTATTACGAATCCCAAGCCAGAGGAGCGATATGTTGATCCCGTGTTCAAGTGTTGTGTTCCCACCTGCGGCAAGACGCGTAAATTCGATGAAGTGCAAATGAACAGCTTCCCCAAAGATCCTACGCTCTTCGAGCGCTGGAAACACAATCTGCGACTGGAGCATCTCCAGTTTAAGGAGCGCGAACGCTATAAGATCTGTGATGCACACTTTGAGGACATTTGCATTGGCAAGACACGTCTTTATATTGGTTCCATACCCACACTAGAGTTGGGACACGACGATACGGAGGATATATTCCGGGTAAATCCTGCTGAGTTGCAAAGCAACTTGTTCGGTCGCCAGCGACGTGTACAGGAATCAGATATGAGGGTCAGAATAAAGCAGGAGGAGCACTCCGAGCAAGAGGAGACGGATGTTACCATGTCGGAGGCTACGGATTTGAATACCAAGCAGGTAAAAATAAAGAAACGCCTTGCGGACTTGAAGTGCTGCGTGCCCAGTTGTGGTCGCAGTCGTCTGGAGCACGGCGCTCGTTTGTTTCCCTTTCCAACTGGCAAACAACAGCAGAGCAAGTGGCGCCACAACTTGCGTCTGGATGCCAGCGAAGTGGACAAGACGTCGCAAGTTTGCAGCGCACACTTCAATCGACGTTGCATCGATGGCAAGCAGCTGCGAGGCTGGGCAATCCCCACACAGCAGCTGGGTCACCAAGAGCAACCTATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCACTGCCGCAAGCGGCGAAGTATTGACAATGACTTGAGGACTTATCGTTATCCCCGCAGCGAGGAGTTGTTGGAGAAATGGCGAGTGAATCTACGCTTGGCTCCAGATCAGTGTCGCGGACGCATCTGTGCCGATCATTTTGAGCCAATGGTGCGTGGCAAGCTGAAGCTGAAGACGGGTGCAGTGCCAACACTTAAGCTGGGACATGATGAGGGCGTGGTCTTTGACAATGAGGCAATTAAGGCGGGACTGCAGCTGGATGGAGTCGTAGAGCCAGAGGTGGAGGAGGGTGAGACAAGCAGAGAATCGCTGATTAAGATTAAAAAGGAAAAAATAGAACCAGATGAGGAGGAGGCGGAGGAAGATGTGGATGCAGATGAGGATGACGATCATGAACATCCGCTCCCGCCGCTGCTGCGTCGCGAGAAGCCCGTGAATAATGTGACGCCAATTTGTTGTCTAAGGCATTGCCGCAAGGAACGCACAGCAACACATCAGCTGAGCACCTTTGGCTTTCCCAAGGATCGCCTGCAGCTGCGCAAGTGGAGTATCAATCTACAAATTCCCGAGAGCGATTGCGTGGGACGTGTTTGCATTGAGCATTTCGAGGCGGAGGTGCTGGGCACGCGTAAGCTCAAACAGCATGCAGTGCCTACATTGAATTTGGGCCATGACACGCCCCTGATCTACCGATGCAATGGACAAGCGCAGCCCATGGGAGGCATCTTCGATGAGCAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGCTGGAAGCAATGGAACCGCCCGATCATCATCATCATCAACGAGCAGTGATTAAGCAGGAGATGGAGGAGGAGCAGGAGGATGGGCTCAGCTGTTGCCTGCCCCATTGTGGCAAGCGGGCAAACGACGTAAAACTGCATCGTCTGCCGAGCAATCGCTTTCGACTGCGCAAATGGTTGCACAATCTCAATCTACCCCAGAGCATTCCCATCGGCAGTCAGACACGTGTCTGCAGCGAACACTTTGATGTGCAACCGGGTGAGACACTTGAGGATTGTATGCCAACCTTGAACCTGGGTCACAATGATACCAATCTATATAGAAATCAAAACTCCACCGCCTCATCTTCCTGCTTGGTGCCAAGCTGCCCGTGTGCACGACTCAATCTCTATCGTGGCTATGATTTGCCTGAGCATCATCTGGTACAGCAGGCTTGGCTGGAATTCCTTCGACAACCCATGCCGCAACTGCCAGGCGATGGCCAGCTCTGTGTGATGCACTATATGCAGCTGTACGAGCAGGTGTCGCTGCCGCAGGAGGTGCCAGCGACAGTGTTGCGTCAACTGCAGGAAACCTACGAACAGATTGCCAACTCCACCATGGCCATGAAGTTGCGTTGCGCCGTGCCCGGCTGCTACTCCAAGTATACCGATAACATAAGGTTGACCAAGCTGCCCACCTGCTCGGACATGTTTGCCAAGTGGCTGCACAATACCAAGATCAAATACGATGCCAACAGGCATTATATCTATCGCATTTGTATGCTCCACTTTGAGCCACGTTGTCTGGGTCCAGTACGTCCCAAATTGTGGGCGGTGCCGACGCTGCAACTACACCACAACGATCCAAATATCTATAGGAATCCCAAGTTGGATGGGAGTGGCCAAGAGGTAGAGCAAGTCCTTCCACAACCAGTGCCCGTGCCCTTAGAGCTGCCATTGCGTATCAAGACGGAGCTGGCTTATACTGGCAGTCCGAGTGCCAGTGCCAGTCCAAGTCCACGTGGAAAGTTGCGCATGTGTTGCATTCCCAGCTGTGCCCAGCAGGCCAACTCGTTAGTGCGTCTCTTTCGCTTTCCCACCTCTGAAACATTGCTGCTCAAGTGGCTGGTGAACACACAGCAGAAGCCACGTCTGGCAGATCCACAGCATCTGTTTGTCTGCCAAGATCATTTCGAACCGGAAGCCATTTGCAAGAAGCAGCTAAGCAGTTGGGCTGTGCCCACATTGAAGCTAGGACACGATGGTCATATCATTCCCAATGCCAAGCACAATGGAAACATTGCGGACACACATGAGAATCGACACACGCTGCAATTTATTTGGGCCAACTACTGCTCTGTGCTGGACTGTTTTGAGCCCCGCAGTGAGCAACTGCGTCTCTTTGCATATCCCACGGATCGACCCACCATACGCAAGTGGGCTGCCAATTGCAAGCATCGTTCCATGCAGGCCAGCAGCGATGGCTTTCAGGTTTGTCAATCTCACTTTGAGCCACATTGTTTTGATCCGGAGACGGGAGAGTTGCGAGCGGATGCAGTGCCCACGCTGGAGCTAAGTCGTGGCATCAATAAGACGGGTTGCGTGGTGCCCGGTTGTGTGAAGGATGAGAATGGACCGCGTCGCCGCCTGTTCAAGATGCCCAAACGTTTTACGCAGCTGTTAGATTGGTGTCACAATTTGCAGCTGGATCGATCGACCATCTCAATTGAGAAGAGTGTTGTGTGTGATCGTCACTTTGAGTCGCATTGCTTGAATGTGCATAAGCAGCTACGTCGGGGAGCACGTCCCACACTTCATTTGGGTCACACTGCGGACGTTGAGATATTGCAGAACCCTTCGAATTGGGCCCGTTGCACTGAGGTGCCTGAGACTCCCGCTGATGTCTGCTGTGTGCCCAATTGTGGGCGGGACAAGGATGCAGAGGAAGAGGATACTGATGTGCAGCTCTTTAGTTTTCCCAAGGTGCGAACACTGGCAGAAAAATGGCTGAGGAACATTCGTCTCGAGGCAAACAAGGAACAGTTGGCAGAGTTGAAAGTCTGCAATGCGCACTTCGAGGTCTACTGTCTGGAGAATGGACGCCTGCAGTTGGGTGCCATGCCCACACTGCAGTTGGGACACGATGATCATCATGATATACATCGCTCCAGTTGTCTCACTTCATTGTCGGCTTCCAAAATGAAACGATATTGCAATAGAAATGGATACAGCTATGATTGCTGCTTTCCTCAGTGTGTGGAGTTGCAGAAGAGTTATCTACGGATAAGCTACAATTTGCCACAGTCGCAAGCGATGCGTAATGCCTGGATAGCCTACATGGAGTTGGAGGAACCGAAGGAGCAAGAAAAGAAGCAGCAACTAAAATTGTGCCCGCTGCATTTGATCATTCTTTACGAGCACAGCGTAGACAATTTCCCCGAGCACACAACAGAGGAACTTCTGGAGGATAACTATGCAGCTGCGAGAAATAGTGTGCGTATACGTATCATTAGTTGTGCCGTGCGTGGCTGCAGGACTCTGAAGACCCGCGATGGCGGCTGCCTTCATGGTCTGCCCACGCGTCGGGATATACTGCAGATGTGGTTGCACAACATGCAGTTGGTGTTCTACGAGCAACAGCGTTACATGTACAAGATCTGTAGCAAACACTTTGAGCCCAATTGCTTTACGGACACAACACGCCGCCTCAAGCCATGGAGTATGCCAACGTTGGAGTTGCCTGAGCCCGAACCGGGTGAACCAGAAATCTATCAGAATCCCACAGAGGCTGAATGGCAACAAATGAATGAGCAATGGGCAGCAGAGCAGGCGCAAATAGCTCAGCCAATGATGGAGGATGAAGAGCATGATGAAGCTGTGCATGTGAAGCTTGAGCCGGAGACGGAGATGGAACAGGAGAATCTGCTGGCATATGAGGAAGAGGATTACTCGCAGCCTCCAGTGGATGGAGAAGAGGATGCGAACTCGCAGCAGCCGCTGGAAATGCAACCCTTGGAGGTGCTGCTTGAGGTGGGTCATGTGGAGAAGTGCAGAACATACGAGCAAATGGACACGAAGGCAAATCTTAATTATGCCGAACAACAGGCGGCGGCTGGACCGTTGCCTAGCAATGGAATCTTTGCCAGTAATGGCAACAAGTACACTGCACGCAACTGCAGTGTACAAGGCTGCCACGTGACCTTGAACGATATCTGTGGCAATCTGAAGTTGCACAAATTTCCCACGTCGTGGGAGGCAATGCAAAAGTGGATGCACAACACCCAAGTGAAGGTGTGCAGAAGTGTCTCGTGGCGCTTTCGCATTTGCAGTTATCATTTTAATGAAGATTGTTTTCAAAGTTCTCGCCTAAGGCGTGGTGCCATGCCCACCTTGCAGTTGGGCCCCAAGCGACCTTCGCATATCTATGAGAGTGAATTCAATGTTGGCGATGGCGATGAGGAGCAGCCCATTGAAGAGCCATCAGAACAAACACCAAAGTCAAGAGTGGTAGGAGGCGATAATATAAGTATGTGCCTGCCTAGCCCAGCACCGCCTCGCAAGTCGAGTAAGTACTGCCAAGTCGATGGTTGTCCTAATCATTTGACCAGCGAGAACATGACGCTGCACAAGTTTCCCCACTCGCCGGACATGTGTGCCAAGTGGCAGCATAATACACAGGTGCCATTTGATCCCGTCTTCCGTTGGCGTTATCGCATATGCAGCGCCCACTTTGAGCCGATCTGTCTGTTGAATATGCGTCTCCTCCACGGCAGTGTGCCCACGCTGCGCTTGGGTCCACGTGCTCCCTTGCAGCTCTTCGACAGCGACTTTGAGGCCATCAGCATGCGCTTGGACAAGCAGAAGAGCAGCTCAGATCAACAGTTGTATATCAAAGAGGAGCAGGAAGAGGATCACGATGATGCGGAGCAAGATGAGGATGAGTTTAGTTTCCTGGTGCCAGAGATGCAGCTGCATGAGGATGCAGGGGTAGATGAAGCTAACATCAAGACGGAGCACAGTCAAACCTATAACGGACGCTGGAAGGATCTGCGTTTGCCCAGTATTAAGCAGGAGGATACAACGACCATGACGTGCTATAATCCCGTCAAGTCTGGCTACGACAAGTGCTCGCTGGTGCATTGTCAGCGTCAACGATCTCTGCACGGTGTGCATATCTATAAGTTCCCCCGTTCCCATCAACTACAGCAGCACTGGATGCACAATCTGCGAATACGCTATGACGAACGGAGGCCCTGGAAAACAATGATATGCAGTGTACACTTTGAGGCGCAATGTATTCGTCTGCGTAAGCTGCGTCCCTGGGCGGTGCCCACACTGGAGCTGGGTGACAATGTGCCCCAGGAGATCTTTACCAATGAACAGAGTCGGCAACAGTTCGAAAATGATGAAATGAATGTGGATTTTGACTTGGATATAAAACACCCAATGCTGGAGGAGGATTATGGCGATGATGATGATGATAACGATGCTGATGGTGACACGGAAGATGGAAATGCGTCAGTGCGTCAGCCACTTTGGAAGAAGAAAAAGCAAAACCACAATCAGCTTGTAAAGATAAAGACTTGCTCCTTACCCTACTGTCGCAGTCCACGTGGCGATGGCATTAAGCTGTTCCGACTGCCGAACAGGCTCAGCGACATACGCAAGTGGGAGGTGGCAACGGGAATGCATTTCACCGAATCCCAGCGCAACACGAAGCTCATCTGCAGTCGTCACTTTGATCCGCAGTTGATTGGCGTGCGTCGTCTCATGTACAATGCGGTGCCGACTCTACATCTGCGACCAGAGACTGTGCGAGAAGAAAGGATGCCTCCGCGTTCCCGTCCAGCGGCGCCACGTTGTTTCATGCCCAGTTGCCCACAGGATTTGCAGCAAAAGTTGCACAAGTTTCCCAGTGATCCTCGACAGCTTCAGGAATGGTGTGCAGTGCTCAGAGTCTCCGTAAGGCAGGACTTTAGTAACTTGTACATTTGTGACGATCATTTGCCAACGCTAGCAAGAAGCTGCCTCATCTGCAGCGAAGAGGAACTGCAGCTGCCACTGCTCAACTTCCCAGAGCATCGCAATCAACGCGCCAAGTGGTGCCATAATCTTAAAATCGATCCCATTCCCAAGTGGGATAACTCAAAGCACATTTGCAGTCGACATTTTGAGAGCTACTGCTTCATCCAGCCAGGGCAACTGGATCCCGAGGCTGCTCCCACGTTGCATTTAAAGCACACGGACAGCAACATATTCCTTAACGATTATGCCATGGATTCCAGCAAGATGATGTTGCGAATTAAGGATGAGCCCTTGGAGAGCGACGATCTGATGCTGTAA
- Protein Sequence
- MIIKSEPIDEHAYKTNYIDDNTPFADFSKFPEFGDDMLSPKVELTVKDEAYQKNPLSYPRRKLQSERSSESLPICQRCKEVFFKKQIYLRHVAESSCTIQEYDFKCNICPMSFMNTEELQKHKHLHRADKFFCHKYCGKYFDTIAECESHEYMQHEYESFVCNMCSVTFATREQLYAHLPQHKFQPRFDCPICRLWYQTALELHEHRLAAPYFCGKYYAGAQATQALQQQQQHAQPQHQQSANYKLQDCHMATMEMPTPHQKATAGSALPATAALSSLLQQRQANADGAALFASSLKNEVNVKLERSYSNSTTSESGYSLHDSSNYNNAYGSDASLHGGAGAGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATYRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELKYQRHSAASTSSSASSASHYVDPELSASYMGMGSGSGGSASGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLNLGHDDDDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHDNIAYPLPTPEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESSVLAKWAHNLEVDAAKLSTMRICNLHFESHCIGKRMRPWAIPTLNLSNNVENMYENPEHHMIYKRRTQIKTERGSGASGAGASMSKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNAPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHSPTQLRKWMYNIRMRPRGAMRQQYRICSTHFETHSFNGKRLSAGAIPTLELGHDDEDIYPNEAQSFVEEHCAIEGCESSKEQPDVRLFRFPTNDEDLLWKWCNNLKMNPVDCIGVRICNKHFEGDCIGPKHLYKWAIPTQELGHDDSQIELITNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPTLFERWKHNLRLEHLQFKERERYKICDAHFEDICIGKTRLYIGSIPTLELGHDDTEDIFRVNPAELQSNLFGRQRRVQESDMRVRIKQEEHSEQEETDVTMSEATDLNTKQVKIKKRLADLKCCVPSCGRSRLEHGARLFPFPTGKQQQSKWRHNLRLDASEVDKTSQVCSAHFNRRCIDGKQLRGWAIPTQQLGHQEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKAGLQLDGVVEPEVEEGETSRESLIKIKKEKIEPDEEEAEEDVDADEDDDHEHPLPPLLRREKPVNNVTPICCLRHCRKERTATHQLSTFGFPKDRLQLRKWSINLQIPESDCVGRVCIEHFEAEVLGTRKLKQHAVPTLNLGHDTPLIYRCNGQAQPMGGIFDEQPQHSVFRLWSLKHCRKRKLEAMEPPDHHHHQRAVIKQEMEEEQEDGLSCCLPHCGKRANDVKLHRLPSNRFRLRKWLHNLNLPQSIPIGSQTRVCSEHFDVQPGETLEDCMPTLNLGHNDTNLYRNQNSTASSSCLVPSCPCARLNLYRGYDLPEHHLVQQAWLEFLRQPMPQLPGDGQLCVMHYMQLYEQVSLPQEVPATVLRQLQETYEQIANSTMAMKLRCAVPGCYSKYTDNIRLTKLPTCSDMFAKWLHNTKIKYDANRHYIYRICMLHFEPRCLGPVRPKLWAVPTLQLHHNDPNIYRNPKLDGSGQEVEQVLPQPVPVPLELPLRIKTELAYTGSPSASASPSPRGKLRMCCIPSCAQQANSLVRLFRFPTSETLLLKWLVNTQQKPRLADPQHLFVCQDHFEPEAICKKQLSSWAVPTLKLGHDGHIIPNAKHNGNIADTHENRHTLQFIWANYCSVLDCFEPRSEQLRLFAYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFEPHCFDPETGELRADAVPTLELSRGINKTGCVVPGCVKDENGPRRRLFKMPKRFTQLLDWCHNLQLDRSTISIEKSVVCDRHFESHCLNVHKQLRRGARPTLHLGHTADVEILQNPSNWARCTEVPETPADVCCVPNCGRDKDAEEEDTDVQLFSFPKVRTLAEKWLRNIRLEANKEQLAELKVCNAHFEVYCLENGRLQLGAMPTLQLGHDDHHDIHRSSCLTSLSASKMKRYCNRNGYSYDCCFPQCVELQKSYLRISYNLPQSQAMRNAWIAYMELEEPKEQEKKQQLKLCPLHLIILYEHSVDNFPEHTTEELLEDNYAAARNSVRIRIISCAVRGCRTLKTRDGGCLHGLPTRRDILQMWLHNMQLVFYEQQRYMYKICSKHFEPNCFTDTTRRLKPWSMPTLELPEPEPGEPEIYQNPTEAEWQQMNEQWAAEQAQIAQPMMEDEEHDEAVHVKLEPETEMEQENLLAYEEEDYSQPPVDGEEDANSQQPLEMQPLEVLLEVGHVEKCRTYEQMDTKANLNYAEQQAAAGPLPSNGIFASNGNKYTARNCSVQGCHVTLNDICGNLKLHKFPTSWEAMQKWMHNTQVKVCRSVSWRFRICSYHFNEDCFQSSRLRRGAMPTLQLGPKRPSHIYESEFNVGDGDEEQPIEEPSEQTPKSRVVGGDNISMCLPSPAPPRKSSKYCQVDGCPNHLTSENMTLHKFPHSPDMCAKWQHNTQVPFDPVFRWRYRICSAHFEPICLLNMRLLHGSVPTLRLGPRAPLQLFDSDFEAISMRLDKQKSSSDQQLYIKEEQEEDHDDAEQDEDEFSFLVPEMQLHEDAGVDEANIKTEHSQTYNGRWKDLRLPSIKQEDTTTMTCYNPVKSGYDKCSLVHCQRQRSLHGVHIYKFPRSHQLQQHWMHNLRIRYDERRPWKTMICSVHFEAQCIRLRKLRPWAVPTLELGDNVPQEIFTNEQSRQQFENDEMNVDFDLDIKHPMLEEDYGDDDDDNDADGDTEDGNASVRQPLWKKKKQNHNQLVKIKTCSLPYCRSPRGDGIKLFRLPNRLSDIRKWEVATGMHFTESQRNTKLICSRHFDPQLIGVRRLMYNAVPTLHLRPETVREERMPPRSRPAAPRCFMPSCPQDLQQKLHKFPSDPRQLQEWCAVLRVSVRQDFSNLYICDDHLPTLARSCLICSEEELQLPLLNFPEHRNQRAKWCHNLKIDPIPKWDNSKHICSRHFESYCFIQPGQLDPEAAPTLHLKHTDSNIFLNDYAMDSSKMMLRIKDEPLESDDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01556869; iTF_01553997; iTF_01552487; iTF_01557594;
- 90% Identity
- iTF_01558329;
- 80% Identity
- -