Pdom022989.1
Basic Information
- Insect
- Polietes domitor
- Gene Symbol
- -
- Assembly
- GCA_947397865.1
- Location
- OX377619.1:181440889-181453389[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 34 6.1e-15 7.3e-12 47.6 1.0 1 86 145 217 145 218 0.85 2 34 2.7e-14 3.3e-11 45.5 4.1 1 86 245 313 245 314 0.80 3 34 3.3e-14 4.1e-11 45.2 0.2 1 87 335 407 335 407 0.82 4 34 1.4e-13 1.7e-10 43.2 2.9 1 86 489 557 489 558 0.78 5 34 2e-14 2.4e-11 46.0 6.0 1 87 582 654 582 654 0.81 6 34 5.9e-12 7.2e-09 38.0 0.4 1 87 689 757 689 757 0.81 7 34 2.6e-11 3.2e-08 36.0 2.1 1 85 798 866 798 868 0.74 8 34 1.1e-14 1.4e-11 46.8 0.2 1 87 896 966 896 966 0.81 9 34 3.1e-14 3.7e-11 45.4 0.4 1 87 988 1058 988 1058 0.79 10 34 3.6e-13 4.4e-10 41.9 3.2 1 87 1086 1158 1086 1158 0.86 11 34 6.6e-06 0.0079 18.7 0.0 1 75 1232 1293 1232 1305 0.70 12 34 2e-10 2.5e-07 33.1 0.4 1 87 1328 1400 1328 1400 0.79 13 34 2.7e-13 3.3e-10 42.3 1.4 1 86 1430 1500 1430 1501 0.79 14 34 2.2e-11 2.7e-08 36.2 0.6 1 87 1548 1620 1548 1620 0.81 15 34 4.2e-13 5.1e-10 41.7 0.4 1 87 1643 1712 1643 1712 0.79 16 34 1.4e-14 1.7e-11 46.5 0.2 1 87 1917 1986 1917 1986 0.80 17 34 5.8e-14 7e-11 44.5 0.8 1 86 2040 2122 2040 2123 0.83 18 34 4.5e-11 5.5e-08 35.2 1.5 1 87 2151 2223 2151 2223 0.79 19 34 1e-13 1.2e-10 43.7 0.8 1 87 2249 2319 2249 2319 0.81 20 34 9.7e-13 1.2e-09 40.6 0.3 1 87 2342 2412 2342 2412 0.83 21 34 3.6e-14 4.4e-11 45.1 0.8 1 86 2433 2502 2433 2503 0.81 22 34 1.6e-06 0.0019 20.7 1.6 1 60 2520 2569 2520 2592 0.77 23 34 5.8e-14 7e-11 44.5 2.5 1 85 2606 2674 2606 2676 0.83 24 34 1.5e-13 1.8e-10 43.2 3.1 1 87 2701 2773 2701 2773 0.78 25 34 2.6e-11 3.1e-08 36.0 1.1 1 86 2793 2863 2793 2864 0.78 26 34 9.8e-09 1.2e-05 27.7 3.5 1 87 2884 2953 2884 2953 0.75 27 34 4.4e-11 5.3e-08 35.3 2.6 1 87 3196 3269 3196 3269 0.80 28 34 1.6e-09 2e-06 30.2 1.2 1 86 3291 3361 3291 3362 0.76 29 34 7e-13 8.5e-10 41.0 0.4 1 86 3393 3464 3393 3465 0.82 30 34 0.006 7.2 9.2 0.0 1 60 3500 3550 3500 3577 0.78 31 34 1.5e-11 1.8e-08 36.7 2.6 1 87 3592 3669 3592 3669 0.84 32 34 6.5e-14 7.9e-11 44.3 0.2 1 86 3694 3765 3694 3766 0.83 33 34 1.7e-14 2.1e-11 46.2 4.3 1 87 3920 3994 3920 3994 0.82 34 34 9.4e-10 1.1e-06 31.0 0.5 1 86 4013 4080 4013 4081 0.78
Sequence Information
- Coding Sequence
- ATGAATCGAATAAATCTCTTTTTTCGCCCAGATATAACCTCCATTAAAATGCCTGAGACGGTACCGTCACCTACTAAACAACAAACAAGTCATCTTCAGCATCAGCATCATCATCAAACATCACCTCCTTCGGCAAGTTTCGATGATATTCCCGACTTTGCTGCCATGCCCCATGTAGAAGTTAAGACCGAAATTAAAGTAGAGCCCGATTTCTATCCACCCATGGATCAGACTGATTTTGTCGGTTTCGACAATGACTATTCCAATTCACAAGACTTTTCAACGCCGAATTCAAATCAGAACTTGACATTCTTGCAGGATTTTCACGATAACGCCTCTAGTTCGACCAACTCTTCGTACTCCTTCAAAGCCAGCACCTCATCGACGAGCTCTAAGAAGAATAGTGAGGCCATACAGGACGAGGATGCCATTTGTTGTGTGCCCAAGTGTGGCGTGCGCAAGTTCACATCGCCCACATTGCAGTTCTTTCCATTTCCGCGGGATGAAAAGTATCTGCTGCAATGGCTGCATAACTTAAAAATGACTTATGAGCCGAGTGCCAATTACGGCGTCTATCGTGTGTGCAGTCTGCATTTTCCCAAACGTTGTGTGGCCAGATATTCGTTGAGCTATTGGGCTGTGCCCACCTTCAATCTGGGCCATGATGATGTGGGCAATTTATATCAGAATCGCGAGAGTTCGGGGGGGTTTCCTTCGGGTGAAATGGCCCGTTGCTATATGCCTGGATGTTTGTCACAGCGAGGCGAGACCAATGTAAAATTTCACAGTTTCCCGCGGGATTTGAAGACTTTGATTAAGTGGTGTCAAAACTCGCGCCTGCCTGTGCATAGCAAGGAGAATCGCTTCTTTTGCTCGCGCCACTTTGATGAGAAGTGCTTTGGGAAATTCCGTCTGAAGCCTTGGGCCATACCGACACTCCGCTTGGGAACGATATATGGCAAGATACATGACAATCCCAATATTTATCAGGAAGAAAAGAAGTGCTTTCTGCCGTTTTGCCGCAGAAGCAGATCGTACGATTGCAACTTGTCCCTGTACAGATTTCCCCGCGATGAGACGCTGCTCAGGAGATGGTGTTACAATTTAAGGCTAGACCCGGAAATGTATAGGGGGAAGAACCACAAAATTTGCTCATCTCATTTCGTCAAGGAGGCCTTGGGATTAAGGAAGCTGATTCCCGGGGCAGTGCCTACGATGAATTTGGGCCATAATGATCGCTTCAATATATATGAAAATGAGCTGTATGTACCGCCGCCACCTCCGCCTCCACCACAGCCCTCAACTTCGTCCGCTTCGGCGAAAGCCCAAAAGTTCGCCGAAATGTTTAAGCAGGAAATGGGCTCGACCTCGGCTATATACGATGAGGTCTTCATGAATTCTATGATACAGAAATTCTCCGGTTCGTCGTCTTCCAATGCCTCCAATCTGGACCTGGGAGATGTGTGTTTGGTGCCTTCGTGTAAGCGAACGCGCCACTCGGAGGAGATTACCCTGCACACGGTGCCGAAGAGAGCGGAGCAGTTGAAAAAGTGGTGTCACAATTTGAAAATGGATTTGGACAAAATGCACAAAAGCGTACGTATTTGCAGCGCCCATTTTGAGAGCTATTGCATCGGAGGCTGTATGAGACCATTTGCCGTGCCCACTTTGGAGCTGGGCCACGACGACCCGGATATATTCCGCAATCCCGATGTCATCAAGAAGCTGAATATACGAGAGACCTGCTGTGTACCTTCGTGTAAGAGAAATCGTGATCGCGATCATGCCAATCTACATCGTTTCCCCACTCATCCGGAGCTGTTGCAAAAATGGTGCGAGAACCTGGAGAAGCCTGTGCCGGATGGCACGAAGCTTTTCAACGATGCCGTCTGTGAAGTCCACTTCGAGGAGAGGTGTTTGCGCAACAAACGTTTGGAGAAGTGGTCCATACCCACCATGAATTTGGGCTACGATGATGTACCGCACAAGTTACCGTCCGAAGAGGAGATATCGGAGTATTGGACAAAACCATTTGCCCCCAACAATGGCGACGAGCAGGGCGAATGTTGTGTATCATCATGTAGACGTAATCCCCAGATTGACGATGTCAAGCTGTATCGACCGCCAGAAGATGCCGAACAATTGTTGAAGTGGGCCCACAATCTGCAGCTCGATGCTGCGAATTTGCCACTTTTGAAGATTTGCAGTTTACATTTCGAGTCGCACTGCATAGGCAAGCGTTTACTGAACTGGGCTATGCCCACTCTCAATTTGGGCTCCAAGGTGGAACATCTGTTCGAAAATCCTCCACCCACCCAAGTCGTCTATAAGAAAAAGAAAAAGGACGGCCGCTTGTCAGCCAAACACGAGATCATGAAATGGTCGCCGAGATGTTGTCTACCCCATTGTCGTAAAACGCGCGCGCTGGACAACGTGCAGCTCTTCCGATTCCCGTACGCTAATAGACAAACTCTGGCCAAATGGTGCCACAATATACAGTTGCCTTTAGTGGGCAGTTCCCACAGACGCATTTGTTCTACGCATTTCGATCCGGCGGTGCTGACCAAGAGATGTCCCATGAATTTGGCTGTGCCAACGTTGGACCTCAATACCCACCCCGGCTACAAACTCTATCAGAATCCGGCTCGCCTGAAACATGTCAAAGTGGGAGTACCACAGAGACAGTGCATCATCGAATCGTGTCTCAAAACCAAAGCGGATGGCGTCGTGCTCTTCCGTTTCCCCAACAATCGCACAGTTCTTCAGAAGTGGCGCCACAATATCAAAAATTGGCCTAAAGGCAAACTGAGCTCTCAGCTAAGAGTGTGTTCCGAACACTTTGAATCGCATTCAGTGGGCGGCAGACGCATATCGCCGGGTGCCATACCAACGTTAAATTTGGGCTATGACTCCGACGATCTGTACCCCAACGAGACGCGTTCTTTCTTTGATCTGGAAAAGTGCGTGGTCAATGGCTGCGACTCACGCAAGGACATGGATGATGTCCGTCTCTTCCGCTTCCCCCGTGACGATGAGGAGCTGCTGCAGAAATGGTGTAACAATTTGAAAATGAACACCTTGGACTGTGTGGGTATACGTATATGTGCCAAACATTTCGAGATTGAGTGTCAGGGTCCCAAACTCCTGTACAAATGGGCCATACCCACGCTAAATCTGGGCCACAAGGAGGAGGACGCCGTGGAAATAATACAGAATCCTCCGCCGGATCAAAGAACCGGAGAGTATATTTTCAAATGTTGCGTACCCAGCTGTGGCAAGACGCGCAAATACGATGATGCCCAAATGAACAGCTTCCCCAAACATCTGAAAGCGTTCCGCAAATGGAAGCACAACCTGAAGTTGGACTTCCTCAATTTCAAAGAAAGGGAAAAGTATAAGATCTGCAATGACCACTTTGAGCCGGTGTGTGTGGGGAAGACCAGACTGAATTTCGGCGCCATACCAACCATAAACTTGGGCCACAATGACGACACTGAAGATCTGTACAAAGTCAATCCCGACCGAATACGTCCGAATTTGTTTATCAAACAAAAGGATATAGAACGTATGGAAAGGAAACAGTTGAGATTGGAGGAGATCAAGTTGAATATGGATATGGACGAAGAGCAACAACAGGAAATGGATCAGGATCAAGACGACGATGCCTTGGACCCCTTGAGTACACCGGCCGAGTGTTGTGTTGAAGACTGCAAATCGCCCAAATCCATAATGAGAGAACCCTACGACTTGCCGGAGACGATGGCGTTGAGGCTACTTTGGAGCAAAGAAATAAAGAAGGATATTGGCGATTTGTCGGCGGAGAGCAAAGTGTGCGGTTTGCATTTTAAGCAGTTCTTCGACGCTTTGAAAGACGAAATGGAAGCCCTAAAGGAGGAAAGCCCTGAAGTTAAACTAGACTATGGCAAACTTTTGTTTGCCTATCAAAAAACTGAAGTCTCGCTGGTGCTCAAAGGTTTCCAGTGCCGCGTAGAAGGATGTCCCACAAATTTGCTCAATTCCGAGCATCGTTTGTATTTCTTCCCCTATGGCAAGGAAATCGTCAACAAATGGTCGCACAACACGGGCATAATACCCGATGAACATCGGCGCTATGTCAACAAAGTGTGCGCTTTGCACTTTGAGCCGTACTGTGTGACCGAAACGCAGCGCCTTAGATCGTGGGCCATACCTACGCTGAACTTAAAGCACTGCGGTCCGAAAACCGTCTACAAAAATCCAGATCTGACGAGAATAGATAGACGAATGATAGGGCCACAGATATTGAAGTGTGCCGTTGCAAATTGCGCCAGTGCCAAGGAAACGGAAACGGAGGCCCTGAAGCTGTTCAACTTCCCCACGGACGATGACCTGTTGCGGAAATGGTGTGCCAATTTGAAAATGTCACGGCATCTGACGCCACTATTCAAGATCTGTTCGGCGCACTTTGAGAAAATATGTTTTGGCAGTGCGCGCATCCGCTCCTGGGCTATTCCCACCAAGAATTTGGGGCACGACGAGAACCCGGAATATTTCAACAAGACCACCATCAAGCAAGAAGTGTATGAGCGCCGCGCAAACAACAGCGAGCAGCAACTGCAATTGAAACAGGTGAAAATTAAGAAATCCTTGGACACTGTCAAGTGTTACATAGCTACATGTCGCCGATCACGACTGCAACATGGTGTCCGCTTTTACGGCTTGCCCGTCCACGGTAAAATGAAACGCAAATGGCTGCACAATCTACAAATACCTTCCAGCAAAGCTGGCAAAGTTTTGAATCTGAAAATCTGTAATTTGCATTTCCACAAGCGCTGTTTAGAGGGCAAAACCTTGAAGGCGTGGGCAGTGCCCACCATGCATTTGGGTCATACAGAGCCCATATTCGACAATCCGCGCAGATTACAAAACCCCTTGGTCGTGCAACGTTGTGCTTTGCCACATTGCAAGAACCACGCTGTCGGAAATGGATCTCTGCGAACGTTTGTATTTCCCAAATCTCCGGAATTCCTAGAGAAGTGGTCGAAAAACCTCAAGCTGGATGTGGTGAAATGTAAGGGTCGCCTCTGTCACGAACACTTCGAAGCGGGCGTTAAGGGCGATAAGAAACTGAAGAATGGGGCAGTACCCACCGTAAATCTGGGCCACGATGACCAAATTCCCTACGACAATTGCGAGTTGATAAAGAAACTGCAAATGAAACCAAGCGACACAGAAACGGGCAACAAAGCCGAAAAAATCCTGCCGGATCAAGGTGATGAAATGGATGAGGAAGAAGAGGTAGGCGCAGACGAAGACGACGAAGAAGAATTTGAAGAGGATGAATTCGAAGAAGACGATGTTGATGGAATTGATGATGATGATGATGAAATGATCGAGCAAGACCACAATGATGAGGAGGATGACGATGATGAGCATGATGAGAATGGCTGTGAGCAAGATGAAGACGACGATGAAGAAGTGGACATGGACAAGGTTCGCATCCGTGGCACTTTGCAGCACTGGAGTTCGATAAAAATGAAAGAATTACGTGTCACCCTTGTGCCCATACGCCACGAGGATCTCTTGGAAATATCCTCGGTATCTTCGTATGATCGTGACCGCTGTTCTATAACACCCGCCAGCAGCCTCAAAGATCTGCGTTCCGAAACTCCGGCCAGTGTGGGCCTTACTTCGTCGGAGTACAACGATGACAACTCCTCCAGTACACCGCTGAGAACCGACAAACCTCTGAATAGCATTGCTCCCATGTGCTGCCTCAAGCACTGTGGCAAAGAGAAAACTCCCGAACAGCATTTAACCACATATGGTTTCCCGAAAGATGCCCAGCTACTGCAGAAATGGTGTGATAACTTGGGCCTGCAGCCTGAGGAATGTATAGGGCGTGTGTGCATAGACCATTTTGAGTTGAGGGTTATAGGCACACGCCGACTGAAGCCGGGAGCAGTGCCAACCATAAATCTGGGTAACTCTCGCATAGCGAAACACACCAACGATGAGCCAAAGAAGACTGTAAGCAGCGAGGTTGAAGCGAAGGCTGGCGGCGACAACGAGCAGATACTGACACCACCGCCACCCTATTGTAATCCTAAATCGGGAAAACAATCGGTTTTTCGGCTATGCTGCCTCAAGCACTGTCGGCGCAAGAAACAACCAGAGGCAGCGCAGCCTCAGATCCCTGAATCTGCCGATCGCACACTACTACTATTTAAGTTCCCCAAAGACTATGAGACTCTGAAGAAGTGGTCGGCGAATTTACGTTTGCCGGAGAAAACCTGCGGCCGCAAAGAACTGCGCGTGTGTTCCAAACACTTTGAAGCTGACCTTATTGTGGGCAACTCATTGAAGGCCAACGCTATACCCACTTTAGACTTAAGTTACTCACAACGTCCGGCAGTTTTCAAAAACAACAGCAGCAAACAACAGAAAGCGGCAGATGTCGAGGCTCCAAAATGTTTCCTCGCACACTGCGCCAGGAAAGAGGATGCCGAAACCTTTCTGCTCAGTTTTCCCCAGCATGATTTAACCATGCAGCGTAAATGGTGTAAAAATCTCAAATTGGACAGCAAATTGCCGACGATCAAAGATTTAAAGATCTGCAAGCACCATTTCGAAAGCTATGCTTTCTACAAGCAGCGAAATTTGAAGACCGGGGCTGTGCCCACTCTAAATCTGGGTCATACGGACCGTATTATCAAAAATGTGCCGAAAATTCGCCGCAAAGTGAGAACTGAGCCCAAGGAGAAGTGCTGTATTAAGACTTGTGACAATCACGACACCAAAAAACTCTACGCCTTCCCCAAAAACTCGGAGTTGAGACGTATATGGTCCAATAACTTGCAGATCGAGCTGAGAGAAGCTTTACGTTCCCACTATAAACTATGCGAAAAACATTTTTCGCCCGAGAGCTTTGTGGCCGGTAGCGATGTTCTCAAGATAAACGCTGTGCCGTCACTGAATCTGGGCTTTGAAGTGGACAATCTTAAGGTTTTAAGCAAAATAGAGACTGAGGACGACAAATTCAAATGTGTGGTCGAAAGCTGCCAAAAGTCGTCCAGTGTTGATAAGGTGAAACTTTATGGGCTGCCAAAATCCAAGGAGCTGCTCAAGAAATGGCTGTTCAATCTGAATCTGTCGGCCGACATAGAGCTGGACAAGACGCGCATTTGCAATAGGCATTTCGAGAAGCTGTGCATTAAGCATGGCATTCTGCACGAGAAGGCCGTGCCCACCATGTTCCTGAAGGCAAAGGCCTGGTTCTATCAGAACGACGACGATGTGTTTGAGGAGAACTACAGGTGTTGCGTGTTAAACTGCAACTACCAGACGAGCGAAGAGGACTACCGCACAATGTACAAATTTCCCAAGCTAAAGGAGGACACAGACAAGTGGCTGCATAATCTCCGATTGCAAATTGAAGACGTCAAGGACTTGCGTATATGCTCCCTACACTTTGAAGACAACTGCAAAATTAAGGATCATTTACAGGCCGGCACAGTGCCCACTTTGCAGCTGGGCCACGAACAAACCGAAGACATTCACCGCAACCACATCCAGAAATGCTGCATAGACAAATGCTGTTGGCTAGGTTTCAATTGCCACAAATTTCCCGAAGATGAGGCGCTCAGAAGCTGCTGGTTAAAGGCCTTTGCAAGTGTACAACAGCCAGTCAGTAATTTGGAATACATCTGCTCAATTCATTTTGTGTCGTGGTATGACACTATTGAGGAGGGCGCGGCGGGGGAGATGCCAGAAAATGAGATGCTAAAGCAACTGTACGACCAGCTTAAGGATTTGCCGGAACTGCAAAGTTTCAAGTGTTCCGTGCCCACGTGCGAAACGGGCTTTAAACTCTCCGTGAAGCTTTTTAAATTCCCCAAAGATCCGACGATTTTACAGAAATGGTTGCACAACTCCTCGCTGACATTCGATTATGCCGAGCGACCGCATTATCGAATTTGTGCCCAGCATTTCGAAGAGCGCTGTCTCAGCGAGAAGAAGCTACATCGCTGGGCCTTGCCCACGCAAAAGCTGCCGTACAATTTAAGCCTGTATGTGAATCCGCCCGAGGCTTTGCCCTCGCACCATGAAAACCTCAAACATTGCTGTGTGTCGTCCTGCAACACCGAAAAGGGGCCGTTCTTCAAGTTTCCCGCCAAATCGTATGATGTCAAGAAATGGATACACAATCTCGATTTGGGAGCTCAGCAGTGCACTTTGAATCTGAGAGTGTGTCACAAACATTTCGAAAGTTATTGTGTGTCCACCGATGAGGTGGGAAGTATTAAAAAACTGAAAAACTGGGCAGTGCCAACATTGAATCTGTCGCGCACCACCGAACTGCATGACAATCCGCCGGAGAAGGTGGACTATTTCGCCTGCTGTGTGTGCCGAGAATTGCAGAACAAGGCTGAGGGTTTATATCTCTTTCGATTCCCCACCAGATTGGCCAGTTTTCTAAAATGGCTGCACAATCTGAGATTACAGCGCAGCGATTATCGCGACAGTATGCGAATTTGCCTGCGACATTTCGAAAACGATTGTTTCGACAAGACACTGAAATTGTTGCGCAAACATTCTGTGCCCACGATAGATGTGGCCTGTCCGGCCAAGGATATGTTTAGGAATCCTCTAAGAAGACCCCAATCGAAGTGTTGCGTGGCAACTTGTGAGGGTCCTTGGACACATCTCAACTTATTTCCCAAGGAGAAAATCGTCTTAAAGAAATGGTGCTTCAATTTGAACATAAAAGAAACGGACTTGGAGACGCTAAAGAATTGGAAAATTTGCCAGAAACATTTCGAGGCCAAGTGTCTGAATGCATTCGGCCTGATAAGACCAACAGCTGTGCCTACCCTAAATCTGGGCCACACTAGTAAAATCTTTAAAAACTCCAGGCTAACGCAGAAAGCAAAAAGTGGAGGCGTAGGCGGCATACTGAAAGCTTCGAAATTTCTATTAAAGAAGAACATTAAAATTGAAGGTCAAAAGAGGCCAGAGGGGCCAGAGCAACTTGTGCCAAATGCAGGCAGCTCAAAATCCTCAAAACCTTTTCTAAAGAAGCATATAAAGATTGAAGGCCAAAAGAGTGCTGCTTTAAAAAAATTAAAAGCCCCTCCCCAGGAGGATGAAACAGCGAGTGTTGCTTGTGTTGTTGACGATTTGGATAAATATGAGACGTTGGGTGATTATATAAGAGAAAAGAAAAAAATAAAAGCTTCACAGATGAAACAATCCCCGCCGCCCGCCTCCTTGCAAAACACCAGAAAGAGAAAAATCGCCAAGAAATTCTTACCCTTAAAACGTGAACCTGTGAACAGACCTCTGCTGGAAATCATTTGTGAGAACACGAGCGCAACATCACCACCACCTGTTGCATTAAATCCCATCGACGAAATTACCCACTCCATGAACACATTGCTAGATTTCCATCACCACATCAAACAAGAATTCTTAGAAGATACCGAAAATTCTCTGGAGGTGCCTGAGCTCGATCATGGTTTGGTAGCAGTCAAGATGGAGACTATTGAGGAAACTAACCTAGACGAACGGCCAGCTTCTGCCCAACGACTGGGACTCCAACAGCGGTGCTCCATAACAACCTGCCCGAATGTATCCAACACTCCCGGTTGCACATTCTTCAAATACCCTCCAAATCCTAGACTTCGCGCGATATGGGTCAAACATTGTCAGGAAACGTGTAAATTTATTGTGACTTTGGCCAAGAAACGTAAAATCTGTGCCGATCATTTCGAGGATCAGTGTTTAATCGATAACCGCCTTTTGCTGGGAGCAATACCAACACTGAATCTGGGGGATGGCAGCTCATTAGAAGATTGCGCTGAGGTTTTGAAAACGTTTCGCCATCAACGATGTCGCCTGGATGATTGCCAGAGATCTGTGGAACTGGATCAAATAAACAAAATACACTTTCCTGAAGCTGAAGAACTCCGAAAGAAATGGTGTTTCAATTTGAATCTAAATGAGGCGGATATAACGCAAATGGACTGGATTTGTCACAAGCATTTCGAGAGACGTGTGCTCATCAAAAGCAAACGCATCAAGGACGAGGCAGTGCCTACACTGTTGCTGGGCTCTCAAGGCAAACCGGAGCAGGAACTTTACAAAAATCCCGAATACATCAACCAACAGCAATTCCTCAACCAACTAAGGCAGGTGTGCTGCGTAGCCAGCTGCGTGAATACCAAACAAACGCCCGGTGTGCGTTTGGCCACCTTCCCCAAGAGGAGAGACATCTACGAGAAATGGTTGCATAATTTGGATCTAGAGGACACCGTGGAAGTGCGCAATGGCTATCACATCTGCTGGCAGCATTTCGAGGAGGTCTGCTATACGAAATATAATTACTTAAAGATTGGTTCCATACCCACCTTAAAGCTGGACAAGAACGATTTGATACCCTTGGATGGCAGTCAGCTGGTGGAGGTGGACAACTTTGGCAAACGAGTCAAACATTATATGCTGCTGAAAAAGTATAAATGTGCCCATCCCCAGTGTCCAGAGGGCAAGACACATGTGCACAAATTACCCGAAATAGCCCCGCTTAAGACGTTATGGCTGGATAGCCTGGGGATACAGCAACCCCCCACCATCACGGAGGACATAAACTTTTGCGATGAACATTACTATGAGCTCTACAAGAAATTTGAAGACGATTTGCCACAAATCAACGACGAGATCTACCAAAATGAGTTGGCCACCCTGCGCAAAACATTCAAGGAACTGGCGGCGAAAAGTAAATTCTTCACCAAACAGTGTCTGGTGCCGCAATGTCCAACGGATCACTATTTTGCCCGCTACAAGGATATCAAGCTGTACAACTTTCCTTTAAATACGGATATTTCAAGGAAATGGTGCCACAACTGTGACATCGATTACAAACAACTCGATACGGGTAAAATCAACTATTACAAGATCTGTGACAGACATTTCGAGAGCTATTGCTTCAACAAGCGTTCGATATTCTCGTGGGCTTTGCCAACATTAAACTTGCCGGACTCCAGGCGGCCGCCAGACATACTGGAAAATGATGCGGACGACAAAAGCGCCTACACCGGCGAGTGTTGCATACGCTCCTGCATAAATGCCAACGGCTATAAATTGGAATCGAAAACCCGCCTATATAAATTCCCCGACAACACGGAAACCCTCACGAAATGGCTGCACAATATTGGCGGTGAAAACTATCACGAGAATGAAACGAAAATATGTGGCCTACATTTCCGTTCCAGGTACATCAAAAAGAGAAAACTAACGGCCGAAGCCATACCAACGTTGAGATTAGGTCACAGGGAGGAAGCGGAAATTTTCACTCAAGCCACGAGCAGTCCGGCGGCAATTGTAATTAAAGAAGAGCTGGAGGAGCCAGAAGAAGACATAGAAATGCTGGTGGAACAAAAAGTCCAAGGCTCTGTGACCGACGAATGGAATGAGCATGACTATTGCTATGAACTTAAGACGGAAAAATCTCCCAGCGATGATCAAACATTTGAGGTGATTGCCATAAAACAGGAAATCATTGAAAATCAATATGGGCTCGATGAACAGCCGCCACAGTACGAAACACCACCCGCCATTAAACAGGAAATCATAGAGATTGAAGAGACCCACACCTATGATCAGCAAGAAGACTATGAATACTATGACGAATTGCTTGTGGAAGGCACGCCACAAGCGCCTCCGCCCGCCCACGATTTTCTCAGTCTTGTGATATCGGAAGTGAAGTCGCACATTTTCCTTTGTTGTGTGCAAAAGTGTCAAAACTCCTCCGAATCGCAGGACATTAAACTCTACACCGAATTTCCCCGGGATTCGGAGATATTCATCAAATGGTGTTTCAATGTTAAAATCGATCCACGCAACTACAAAGAGAATCAATATGCCATATGCAGCCAACATTTCGACTCGGTATGCTTTAGTGACAGCGAGCTGACACTACACCCGTGGGCTGTGCCCACCTTAAACTTGAATCTGCCGGAGAATGCGTTTATCCATCACAACGATACGCCCAGCGAGCAGTGCATAGTCTATGGCTGTATACAACCGATGCCACCGCTCTACAAGTTTCCATTGCGTTTGGATTTGTGTCAAAAATGGTTTGCAAATCTCAAATTAGACCTGACCGACTATAGGGCACTAAATTATAGGATATGTCGTAGACATTTCATACCCGAATGTTTTGATGTAAATCATGTACTGAAGACAGAATCGATACCCACCCTCTGTTTGGGTCATGCCGATAGCATAGCACATTTAAATGCCTTCGAGGCTGGGCGACAAGAAGCGGACAGAGTTGGTGGTGGTGTGGGTGTGGTTGGAGTCGACCACCACTCTATGGCCTTAGTTGTGGGCGGCGGCCTGGACAATAGTCGTGGCAGTAGCCAGGGTTCGCAGGGGCGGCATATGATATCGCCCAATGATCTGGAAGATCATGATAGTAGTTATTACGAAGATTTCGAAGAATGCTATGGTGCGGACGATTAA
- Protein Sequence
- MNRINLFFRPDITSIKMPETVPSPTKQQTSHLQHQHHHQTSPPSASFDDIPDFAAMPHVEVKTEIKVEPDFYPPMDQTDFVGFDNDYSNSQDFSTPNSNQNLTFLQDFHDNASSSTNSSYSFKASTSSTSSKKNSEAIQDEDAICCVPKCGVRKFTSPTLQFFPFPRDEKYLLQWLHNLKMTYEPSANYGVYRVCSLHFPKRCVARYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPSGEMARCYMPGCLSQRGETNVKFHSFPRDLKTLIKWCQNSRLPVHSKENRFFCSRHFDEKCFGKFRLKPWAIPTLRLGTIYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPEMYRGKNHKICSSHFVKEALGLRKLIPGAVPTMNLGHNDRFNIYENELYVPPPPPPPPQPSTSSASAKAQKFAEMFKQEMGSTSAIYDEVFMNSMIQKFSGSSSSNASNLDLGDVCLVPSCKRTRHSEEITLHTVPKRAEQLKKWCHNLKMDLDKMHKSVRICSAHFESYCIGGCMRPFAVPTLELGHDDPDIFRNPDVIKKLNIRETCCVPSCKRNRDRDHANLHRFPTHPELLQKWCENLEKPVPDGTKLFNDAVCEVHFEERCLRNKRLEKWSIPTMNLGYDDVPHKLPSEEEISEYWTKPFAPNNGDEQGECCVSSCRRNPQIDDVKLYRPPEDAEQLLKWAHNLQLDAANLPLLKICSLHFESHCIGKRLLNWAMPTLNLGSKVEHLFENPPPTQVVYKKKKKDGRLSAKHEIMKWSPRCCLPHCRKTRALDNVQLFRFPYANRQTLAKWCHNIQLPLVGSSHRRICSTHFDPAVLTKRCPMNLAVPTLDLNTHPGYKLYQNPARLKHVKVGVPQRQCIIESCLKTKADGVVLFRFPNNRTVLQKWRHNIKNWPKGKLSSQLRVCSEHFESHSVGGRRISPGAIPTLNLGYDSDDLYPNETRSFFDLEKCVVNGCDSRKDMDDVRLFRFPRDDEELLQKWCNNLKMNTLDCVGIRICAKHFEIECQGPKLLYKWAIPTLNLGHKEEDAVEIIQNPPPDQRTGEYIFKCCVPSCGKTRKYDDAQMNSFPKHLKAFRKWKHNLKLDFLNFKEREKYKICNDHFEPVCVGKTRLNFGAIPTINLGHNDDTEDLYKVNPDRIRPNLFIKQKDIERMERKQLRLEEIKLNMDMDEEQQQEMDQDQDDDALDPLSTPAECCVEDCKSPKSIMREPYDLPETMALRLLWSKEIKKDIGDLSAESKVCGLHFKQFFDALKDEMEALKEESPEVKLDYGKLLFAYQKTEVSLVLKGFQCRVEGCPTNLLNSEHRLYFFPYGKEIVNKWSHNTGIIPDEHRRYVNKVCALHFEPYCVTETQRLRSWAIPTLNLKHCGPKTVYKNPDLTRIDRRMIGPQILKCAVANCASAKETETEALKLFNFPTDDDLLRKWCANLKMSRHLTPLFKICSAHFEKICFGSARIRSWAIPTKNLGHDENPEYFNKTTIKQEVYERRANNSEQQLQLKQVKIKKSLDTVKCYIATCRRSRLQHGVRFYGLPVHGKMKRKWLHNLQIPSSKAGKVLNLKICNLHFHKRCLEGKTLKAWAVPTMHLGHTEPIFDNPRRLQNPLVVQRCALPHCKNHAVGNGSLRTFVFPKSPEFLEKWSKNLKLDVVKCKGRLCHEHFEAGVKGDKKLKNGAVPTVNLGHDDQIPYDNCELIKKLQMKPSDTETGNKAEKILPDQGDEMDEEEEVGADEDDEEEFEEDEFEEDDVDGIDDDDDEMIEQDHNDEEDDDDEHDENGCEQDEDDDEEVDMDKVRIRGTLQHWSSIKMKELRVTLVPIRHEDLLEISSVSSYDRDRCSITPASSLKDLRSETPASVGLTSSEYNDDNSSSTPLRTDKPLNSIAPMCCLKHCGKEKTPEQHLTTYGFPKDAQLLQKWCDNLGLQPEECIGRVCIDHFELRVIGTRRLKPGAVPTINLGNSRIAKHTNDEPKKTVSSEVEAKAGGDNEQILTPPPPYCNPKSGKQSVFRLCCLKHCRRKKQPEAAQPQIPESADRTLLLFKFPKDYETLKKWSANLRLPEKTCGRKELRVCSKHFEADLIVGNSLKANAIPTLDLSYSQRPAVFKNNSSKQQKAADVEAPKCFLAHCARKEDAETFLLSFPQHDLTMQRKWCKNLKLDSKLPTIKDLKICKHHFESYAFYKQRNLKTGAVPTLNLGHTDRIIKNVPKIRRKVRTEPKEKCCIKTCDNHDTKKLYAFPKNSELRRIWSNNLQIELREALRSHYKLCEKHFSPESFVAGSDVLKINAVPSLNLGFEVDNLKVLSKIETEDDKFKCVVESCQKSSSVDKVKLYGLPKSKELLKKWLFNLNLSADIELDKTRICNRHFEKLCIKHGILHEKAVPTMFLKAKAWFYQNDDDVFEENYRCCVLNCNYQTSEEDYRTMYKFPKLKEDTDKWLHNLRLQIEDVKDLRICSLHFEDNCKIKDHLQAGTVPTLQLGHEQTEDIHRNHIQKCCIDKCCWLGFNCHKFPEDEALRSCWLKAFASVQQPVSNLEYICSIHFVSWYDTIEEGAAGEMPENEMLKQLYDQLKDLPELQSFKCSVPTCETGFKLSVKLFKFPKDPTILQKWLHNSSLTFDYAERPHYRICAQHFEERCLSEKKLHRWALPTQKLPYNLSLYVNPPEALPSHHENLKHCCVSSCNTEKGPFFKFPAKSYDVKKWIHNLDLGAQQCTLNLRVCHKHFESYCVSTDEVGSIKKLKNWAVPTLNLSRTTELHDNPPEKVDYFACCVCRELQNKAEGLYLFRFPTRLASFLKWLHNLRLQRSDYRDSMRICLRHFENDCFDKTLKLLRKHSVPTIDVACPAKDMFRNPLRRPQSKCCVATCEGPWTHLNLFPKEKIVLKKWCFNLNIKETDLETLKNWKICQKHFEAKCLNAFGLIRPTAVPTLNLGHTSKIFKNSRLTQKAKSGGVGGILKASKFLLKKNIKIEGQKRPEGPEQLVPNAGSSKSSKPFLKKHIKIEGQKSAALKKLKAPPQEDETASVACVVDDLDKYETLGDYIREKKKIKASQMKQSPPPASLQNTRKRKIAKKFLPLKREPVNRPLLEIICENTSATSPPPVALNPIDEITHSMNTLLDFHHHIKQEFLEDTENSLEVPELDHGLVAVKMETIEETNLDERPASAQRLGLQQRCSITTCPNVSNTPGCTFFKYPPNPRLRAIWVKHCQETCKFIVTLAKKRKICADHFEDQCLIDNRLLLGAIPTLNLGDGSSLEDCAEVLKTFRHQRCRLDDCQRSVELDQINKIHFPEAEELRKKWCFNLNLNEADITQMDWICHKHFERRVLIKSKRIKDEAVPTLLLGSQGKPEQELYKNPEYINQQQFLNQLRQVCCVASCVNTKQTPGVRLATFPKRRDIYEKWLHNLDLEDTVEVRNGYHICWQHFEEVCYTKYNYLKIGSIPTLKLDKNDLIPLDGSQLVEVDNFGKRVKHYMLLKKYKCAHPQCPEGKTHVHKLPEIAPLKTLWLDSLGIQQPPTITEDINFCDEHYYELYKKFEDDLPQINDEIYQNELATLRKTFKELAAKSKFFTKQCLVPQCPTDHYFARYKDIKLYNFPLNTDISRKWCHNCDIDYKQLDTGKINYYKICDRHFESYCFNKRSIFSWALPTLNLPDSRRPPDILENDADDKSAYTGECCIRSCINANGYKLESKTRLYKFPDNTETLTKWLHNIGGENYHENETKICGLHFRSRYIKKRKLTAEAIPTLRLGHREEAEIFTQATSSPAAIVIKEELEEPEEDIEMLVEQKVQGSVTDEWNEHDYCYELKTEKSPSDDQTFEVIAIKQEIIENQYGLDEQPPQYETPPAIKQEIIEIEETHTYDQQEDYEYYDELLVEGTPQAPPPAHDFLSLVISEVKSHIFLCCVQKCQNSSESQDIKLYTEFPRDSEIFIKWCFNVKIDPRNYKENQYAICSQHFDSVCFSDSELTLHPWAVPTLNLNLPENAFIHHNDTPSEQCIVYGCIQPMPPLYKFPLRLDLCQKWFANLKLDLTDYRALNYRICRRHFIPECFDVNHVLKTESIPTLCLGHADSIAHLNAFEAGRQEADRVGGGVGVVGVDHHSMALVVGGGLDNSRGSSQGSQGRHMISPNDLEDHDSSYYEDFEECYGADD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -