Dspr012222.1
Basic Information
- Insect
- Drosophila sproati
- Gene Symbol
- -
- Assembly
- GCA_018904355.1
- Location
- JAEIFY010000122.1:1480189-1491621[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 28 6.3e-15 1.1e-11 45.1 4.1 1 86 107 179 107 180 0.85 2 28 2.4e-15 4.1e-12 46.5 4.6 1 87 207 276 207 276 0.83 3 28 5.9e-16 9.9e-13 48.4 0.4 1 87 298 370 298 370 0.85 4 28 5.2e-16 8.7e-13 48.6 5.7 1 87 465 535 465 535 0.83 5 28 7.2e-15 1.2e-11 44.9 3.4 1 86 559 630 559 631 0.81 6 28 1.4e-12 2.4e-09 37.6 1.6 1 87 666 734 666 734 0.80 7 28 2.5e-11 4.2e-08 33.6 1.9 1 86 782 851 782 852 0.77 8 28 5.3e-17 8.8e-14 51.8 0.3 1 86 879 948 879 949 0.82 9 28 2.9e-12 4.8e-09 36.6 1.3 1 86 970 1039 970 1040 0.80 10 28 1.1e-15 1.9e-12 47.5 1.7 1 86 1067 1138 1067 1139 0.85 11 28 5.7e-14 9.6e-11 42.1 1.6 1 85 1216 1284 1216 1286 0.82 12 28 2.6e-12 4.4e-09 36.7 0.1 1 86 1309 1377 1309 1378 0.82 13 28 4.2e-14 7.1e-11 42.5 0.9 1 86 1527 1595 1527 1596 0.82 14 28 7.9e-12 1.3e-08 35.2 0.7 1 61 1649 1703 1649 1724 0.80 15 28 6.5e-05 0.11 13.0 0.1 1 58 1730 1781 1730 1805 0.78 16 28 2.7e-11 4.6e-08 33.5 0.1 1 86 1820 1889 1820 1890 0.83 17 28 3e-14 5e-11 43.0 1.3 1 87 1948 2018 1948 2018 0.81 18 28 9.9e-14 1.7e-10 41.3 0.8 1 86 2053 2124 2053 2125 0.83 19 28 2.2e-13 3.7e-10 40.2 1.6 1 87 2135 2207 2135 2207 0.82 20 28 6.2e-14 1e-10 41.9 0.1 1 87 2230 2301 2230 2301 0.77 21 28 4.7e-06 0.0078 16.7 0.1 1 58 2334 2387 2334 2406 0.84 22 28 6.3e-15 1.1e-11 45.1 0.1 1 86 2425 2497 2425 2498 0.80 23 28 3.9e-14 6.5e-11 42.6 1.4 1 86 2632 2704 2632 2705 0.81 24 28 1.3e-14 2.1e-11 44.1 2.4 1 87 2768 2839 2768 2839 0.83 25 28 7.3e-15 1.2e-11 44.9 4.0 1 86 2952 3022 2952 3023 0.85 26 28 1.8e-13 3e-10 40.5 0.1 1 87 3115 3185 3115 3185 0.85 27 28 3.8e-10 6.4e-07 29.8 0.4 1 58 3202 3250 3202 3262 0.87 28 28 7.4e-09 1.2e-05 25.7 2.2 18 87 3267 3325 3256 3325 0.75
Sequence Information
- Coding Sequence
- ATGGGCACCATTGAAATGACTCCACCGCAGCACAAGGCGAATGCGGCATTACCGGCAACGGCGGCGCTTAATTCGCTGTTGCAGCAACGCCAGGCGAACGCTGATGGCGCCACTTTATATGCCTCGTCGCTGAAGAACGAGACGAACGTGAAACTGGAGCGCAGCTATAGCAACTCCACCAGCGAGTCTGGTTACAGTATGCACGAGAGCAGCTATAACAATGCCTACGCCAGCGACAATTCTCTGCATGGCGGGGGCGGGGCAATTGGTGGTCCGCAGGCGCATTCCTCGACGCTGGACGATTCGGAGGATGCGCTGTGCTGTGTGCCACTTTGCGGAGTACGCAAGAGCACAAGCCCGACGCTGCAATTCTTTACGTTTCCCAAAGATGACAAGTACTTGCATCAGTGGCTACACAACCTCAAGATGTTTCACATTCCGGCGTCGAGCTATGCTACCTTTCGCATCTGCAGCATGCACTTCCCTAAGCGTTGCATCAATCGTTACTCTCTGTGCTATTGGGCGGTGCCCACATTTAATCTGGGCCACGACGATGTGGCCAATCTCTATCAGAATCGTGAGCTGACCAACACATTCACCACAGGCGAGGTGGCCCGCTGCAGTATGCCAAACTGTACTAGTCAGCGTGGTGAAAGTAATCTGAAGTTCTACAACTTTCCCAAGGACATCAAGAGTTTGATTAAGTGGTGCCAAAACGCTCGCCTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTCTGCAGTCGCCACTTCGAGGAGCGTTGCATCGGCAAGTTCCGGCTGAAGCCTTGGGCAGTGCCCACCTTACATCTTGGCGCCCAGTACGGCAAGATTCATGACAATCCCAAAAATCTGTATGTGGAGGAAAAGCGCTGCTGCCTCAACTTTTGTCGTCGCAGTCGCTCCTCCGACTTCAACATGTCGCTGTATCGCTTCCCCAGGGATGAGGTGCTACTGCGTCGTTGGTGCTACAATCTACGCCTTGATCCGGCTGTCTATCGTGGGAAGAATCACAAAATTTGTAGCGCTCACTTTATCAAAGAAGCATTGGGATTGCGCAAGCTATCTCCGGGCGCTGTGCCCACGTTGCATCTGGGTCATAATGACACCTTTAACATCTACGAGAACGAACTGTGGCCACCGCCAACGCCCTCCACGCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTCGGCTGCGTCCACATCCTCGTCGGCCTCGTCGACATCGCATTATGTGGATCCGGAGCTAAGTGCATCCTACATGAGCATGGGCGCTGGAGGGTCATCCTCTGGCCTGAATGTCAGCGACAGCATGGATGTCTGCTGTGTGCCCAGCTGCGAGAGCAAGCGCCACAACAATGAGAACATCACATTCCACACAATACCCAGGCGGCCAGAGCAGATGCGGAAGTGGTGTCACAATCTTAAGATACCCGAGGACAAGATGCACAAAGGCATGCGGATATGTAGCTTGCACTTTGAGCCCTACTGCATTGGCGGCTGCATGCGTCCGTTTGCGGTGCCGACATTGCATCTGGGCCACGACGACGAGGACATTCACCGCAATCCGGATGTGATCAAGAAGCTCAACATACGCGAAACTTGCTGCGTGGCTGTTTGCAAACGCAATCGTGACCGGGACCATGCCAATCTGCATCGCTTTCCCAGCAATGTGCCGCTGTTGACCAAATGGTGCGCAAATCTGCAGCGTCCTGTGCCGGATGGCAGTAAACTGTTCAACGATGCCATCTGTGAGGTGCACTTTGAGGATCGATGCCTGCGCAATAAACGGCTAGAGAAGTGGGCAGTGCCCACACTCATCCTTGGCCATGAGAATATACCCTATCCGCTTCCCACGCCGGAGCAAGTTGCCGAGTTCTATGCGCGTCCCACTGCGCCTAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGAGACGTGCAAGCGTAATCCCAGTGTTGATGACATCAAGCTATATCGCCCGCCCGAGGAGTCGCAGGTGCTGGTAAAGTGGGCGCACAATCTCCAACTGGAGATTGCCCAGCTACACAATATGAGAATATGCAATCTGCATTTCGAAACCCACTGCATTGGCAAGCGGATGCGTCCCTGGGCAATACCCACGCTCAATCTGGCAACTAACATAGAGAATCTCTACGAGAATCCCGAACACCAGATGCTCTACAAGCGGCGCACGCATCTCAAGCCGGGCAGAGCAGCGCGAAGCTCTGAAGCAAGCGCTGGTGGTGTGAAGCCCACCTGGGTGCCACGCTGCTGCTTGCCACACTGCCGCAAGGTGCGTGCCACACACAATGTCCAGCTGTATCGCTTCCCCAAACTCAATCGTTCCACGCTGGCCAAGTGGGCGCATAATCTGCAGGTGCCGCTCGTGGGCAGCGCTCAGCGTCGCCTCTGCTCCGCACACTTTGAGCCGCATGTGCTTAGCAAGAAATGCCCGGTGCCCATGGCGGTGCCCACACTGGACCTCAATACACCATCCGGTTACAAGATCTATCAGAATCCGGCCAAGCTCAAGGCGAATAAGCTGTGCTTGCAGCGTGTCTGCATTGTGGAGAGCTGCCGGCGTCAGCGGGCGCAGGGGGTGCAGCTCTTCCGTCTGCCTCACAGCCCCACCCAGCTGCGTAAGTGGATGCACAACATCCGGATGCGGCCCCGAGGAGCTATGCGACAACAGTATCGCATCTGCTCGAAGCACTTCGAGACGCACTCGTTCAATGGGAAGAGACTCAGTGCGGGTGCAATTCCAACGCTTGAGTTGGGCCATGAGGACGAAGACATATTTCCGAATGAGGCGCAGTCTTTCGTGGAGGAGCACTGCACCGTCGAGGGCTGCGATGCCGTCAAGGAGCAACCGGATGTGCGTCTCTTCCGCTTCCCCAACGACGATGAGGATCTGCTCTGGAAGTGGTGCAACAATCTGAAAATGAGTCCGGTCGACTGCATCGGCGTTCGCATCTGCAACAGACACTTCGAGACTGATTGCATTGGACCAAAGCACCTGTTCAAGTGGGCTATTCCCACGCTCTCCCTCGGCCACGATGATGATGACATCGAGTTGATGCTAAATCCCAAGCCGGAGGAGCGCTATATTGATCCGGTATTCAAGTGCTGTGTGCCCTCGTGCGGCAAGACGCGTAAATTCGATGAAGTGCAGATGAACAGTTTTCCCAAAGATCCGGAGCTCTTCCAGCGCTGGCGCCACAATCTCCGCCTCGAGCATCTCAACTTCAAGGAGCGCGAACGCTATAAGATCTGCAACGCCCACTTCGAGGACATTTGCATTGGTAAGACGCGCTTGAACATTGGCTCCATACCGACACTGGAGCTTGGCCATGACGAGACTGATGACTTGTTCCAAGTCAACCCCGAGGAGCTACAGAGCAATCTCTTTGGACGCCAGAGACGCGTGCAGGATTCCATGAGGATCAACATTAAGCAGGAGGCGCACTCCGACCTCGATGAAGACACTAAACCGGACATTAACATGTCGGAGGCCTCAGATTCAAATACAACACAGGTGGCTAAAATCAAAAAATCTATAACCGATTTGAAGTGCTGTGTGCCGAACTGTGGTCGCAGTCGGCTGGAGCATGGTGCCCGCCTCTTTCCGTTTCCGAACGGGAAACAGCAGCAGAGTAAGTGGCGCCACAATCTCCGGCTGCCCGCTGCCGACGTGGACAAGACGACGCGCATCTGCAGCGCCCACTTCAATCGCCGTTGCATCGATGGCAATCAGCTGAGGGGCTGGGCAATGCCCACACAGCAACTGGGACATCAGGAGCTGCCGATCTATGAAAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCACTGCCGCAAACGGCGCAGCATTGACAACGATCTGCGTACCTATCGCTATCCACGCAGTGAGGAGCTGCTCGAGAAGTGGCGTGTCAATCTGCGCTTGTCGCCGGACCAATGCCGCGGACGCATTTGTGCGGATCACTTCGAGCCACTGGTGCGTGGCAAGCTGAAGCTGAAGACTGGAGCAGTGCCTACGCTCAAATTGGGACACGACGAGGGCGTAGTCTTCGATAATGAGGGCATTAAGGCGGGTCTGCAGCTGGAAGAGGAGGCGGAGGAAGAAGAGGGCAATGCCAGCTTGAAGTCGTTGGTCAAAGTAAAGACTGAGCAGGAGGATGAGCAGGAGCTAGAGAATGAAGATGAAGAGCAGCTGGAGCAGGAGAAGTATCAAGATATGGACGAAGATGGGGAAGAGCACCGAGACTCTGAGGAACATGGCTATTTTGATCCCTTGGAACTTGTGGAAACCTANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCCGCGGCGCGAGAAGGCTGTGAATAATGTGACGCCTATTTGCTGTCTGAAGCACTGTCGCAAGGAGCGCACCGCCATCCATCATCTGAGCACCTTTGGCTTTCCCAAGGATCCGCAGCTGCTGCTCAAGTGGAGCGCCAATCTGCAGCTACCATTGGAGTCGTGCATGGGTCGTGTATGCGTCGAGCACTTTGAGCCCTCGATGCTGGGCACGCGCAAGCTGAAGCAGAATGCGGTGCCCACCTTGAAACTGGGCCATACCACACCGCTCACCTACAGCTGCAATGGCCGGATGCTATCGGGCATTTACGATGAACAGCCACAGCACTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAACGGAAACCGGATCTGGCGGAGATTAAGCCCGGTCGTCGCTGTTGCCTGCCAAGTTGCGGCAAGCAGTCGGAGTCGCACGGCGTCCAGCTGCAGCGTCTGCCGAAGGATCGTCTGATGCTGCGCAAATGGTTGCACAACCTCAAGCTGCCTCCAACGATGGACTGCACCCAAATGTTCCTCTGCAGCGATCACTTTGAGCTGAATGCGCCGTGTCCCACTTTGAAACTAGGACACTCGGATACCAATATTTATCGCCACAGTGTGGCTAGCACCAGTGGCAGCTGCCTGGTGCCCAAATGTACTTGTGCTCGTCTCAATCTCTATCGCGGCTATGATCTGCCTGCGCATCCGCAGGTGCAACATGCCTGGCTACACTGGCTGCAGCTGCCCCATCCGCAGCCGTCGCCCAGGCACGCCCAGCTGTGTGTGATGCACTTCATGCAGCTCTACGAACTGGTGCCGCTGCCCGAATCGGTGCCAGATGTTGTGCGCAGGCAGCTGCGGGAGACTTACGAACTGATATCCAGTTCCAGCATGGCCATGAAGCTGCGTTGCGCTGTGCCCGGCTGCTACTCGAAGTATACGGACAATGTGCGTCTGACCAAGCTGCCCGTTTACCCCGACACCTGCGCCAAGTGGGTGCACAACACCAAGATTCAATATGATCCGGCCCGACATTATGTCTATCGCATCTGCATGTTGCACTTCGAGCCAGGTTGCCTGGGCCCAGTGCGTCCTAAAGTGTGGGCAATGCCAACGCTGCAGCTGCACCACAAGGATGCCAACATCTATTTAAATCCCAAGCTGGATGGCAGCCAAACACAGCCGGCCGTGCCGCTGGACCTGCCACTGCGTATTAAAACTGAGCTGCCTATGTGCAACAGTCCCAGCTTTAGTGCGAGTGCTAGTCCCAGTCCGCGTGGCAAGCTGCGCACTTGCTGCATTCCCAGCTGCGGTCAGCAGGCTTCGGCCCTGACGCGTCTCTTTCGCTTTCCCAGCGCAGAGACATCGATACTGAAGTGGCTGGTGAATACCCAGCAGCAGCCACGCTTTGTCGATGCACAACGGCTGTTCGTCTGCCAGGATCACTTTGAGGCGGAGGCCATTTGCAAGAATCAGCTGCGCAGCTGGGCGGTACCAACACTGAATCTAGGACACGATGGACACATCATACCGAATGCCAAGCACAATGGCAACATTGCCGACAGCCAGGAGAACAAGCAGACGCTGCAGTTCATCTGGGCCAACTACTGTTCAGTGCTGACCTGCTTCCAGAAAAGTAGCGAGCAGCTGCGTCTCTACCAATACCCCACGGATCGGCCAACCATCCGCAAGTGGGCCGCCAATTGTAAGCATCGCTCCATGCAGGCCAGCAGTGATGGATTCCAGGTGTGTCAGTCGCATTTTACGCCGGATTGCTTTGATTCTGATACCGGGGAACTGAAAGAGGACGCTGTGCCCACACTGGCGCTGAGCCGGTCTGTCACTGAGGTGCGCTGTGTGGTCAATGGTTGCGTTAAGGACGAAGATGCATCGCGTCGCCGTCTGTTCAAGATGCCCAAGCGTAACCCACAGATATTGGATTGGTGCCACAATTTGCGGCTGGATCAGGCGGCCATGAACGGCTCGGAACAGCACGTTTGTGAACGTCACTTCGAGGCGAACTGCTTCCATGCGTCTAGAGTGCTGCGTCCAGGAGCACGACCCACACTTCATTTAGGTCACGAGGACCTAGACGATGTGATACCCAATCCACTGAACTGGGAAGAGGATGTGGTCATGTGCTGTGTCCCCCACTGCGAAAGCTCCAAGGATGCGGATGAAGTCCAACTGTTTGGGCTGCCTAAGGTGCGCCAGTTGGCGGACAAGTGGCTGCAAAATGTGCACCTCGATCCGAGCAAAGAACAACTGGCTGGCCTGAAGATCTGCAGTGTGCACTTTGAGACGAGCTGCATGGAGAATGGACGACCCACCTATGGTGCAATGCCCACACTCCATCTCGGTCACGATGAGCTCGACAATATACACCCAAGCGTAGGGTCGGTGCCGACGCAGCAGAAGCGCTACTGCAATAGAGATGGCGCCAGTCACGATTGCTGCTATCCGCAGTGCGTGGAGCTGCAGAAGAGCTATCTGCGGGTCACCTACGAACTGCCCCAGGAGCAGGAGCTCCGTCAGCAGTGGCTCTCCTATATGGGCCTGGAAGCGCAGCAGCTCGATAAACAGCAGCTGCCCAAGCTCTGTCCACTCCACCTAATCTTGCTCTACGATCACAGTGCGGATCACTTTTCGGCACACGCCGCTGAGGAGCTGTTGGACTCTAATTATGAAGCAGCGCGCAGCAGCGTTCGCATACGCGTTGTCAGCTGTGCTGTGCGCGGCTGCAGAACGCTCAAACCACGCGACGGTGGTCGGCTGCATGGTTTGCCCACGCGGCGAGATCTGCTGGAGATGTGGCTGCACAACATGCAGCTGGTGTTTTACGAGCAACAGCGTTATATGTACAAGATTTGCAGCAAGCACTTTGAGTCCACATGCTTCACGGAGACAACCAAGCGGCTGAAGCCGTGGAGCATGCCTACCCTCGAGTTGCCGGAGCGCCAACGGGGCGAGCTGCCTGCCTATCAGAATCCCACAGAGTTGGAGTGGCAACACATGAATAAGCTGCAGGTCAGCGAGAAAGTTGTTGAGGCTCAGCCGGAGCCATTACTTAATCTGGAGCCGTTGCCCAAGAAGGAGCCACCACCACCGCAGGTTGTGGAATATGAAGAGGATTGCGACAATAACTCACAGCAGCCACTGGAAATGCAGGCGCTGGAGGTGCTGCTCGAGGTGGGCCATGTCGAGAAGTGCACCACCTACGAGCAAATGGATACCGAGGCAAATCTCAACTATGCCGAGCAGTTCTCGCACAATCCCCTCAGTCCAGGTCCACCTCAATGCCGTATCCCCGTTGTCCAGAATGGACTCCACTACAGTGCACGCCACTGCAGCGTGCATGGCTGCAATGTCACCTCAAATAATCTGAGCAGCAGCATCAAGCTACACAAGTTCCCCGTCTCGCTGGATGCCATGCAAAAGTGGATGCACAACACCCAGGTGCTCGTGGACGTCAAATTCGCTTGGCGTTTTCGAATCTGCAGTCATCATTTCATCGAGGATTGCTTTCACGGCTCGCGCATCAGACGTGGGGCGATGCCCACGTTGCGACTGGGCTCACGTCGACCGAAGCATATCTATGATAATGAGTTCAACGCCCAACTGCAAGTGGAACAGTGTAAAGAAGAGGCCAGGGAGGCTCTCGCTGCCCCGCTGGAGTCTCAGCAACAGTTGCTCTCTGCGAATGTAGGTCTGCGTCTGCCGCGTCCAGCCCCGCCCTGCAAATCCAGCAAATATTGTCAGATCGAAGGCTGCTCCAATCATTTGACCAGCGAGAATGTGACGCTGCACAAGTTCCCCCATTCGTCGGATATGTGCGCCAAGTGGCAGCACAACACTCAGGTGCCCTTCGATCCCGAGTTCCGCTGGCGCTATCGCATCTGCAGCGCACACTTTGAGCCCATTTGTCTAGGCAATGTGCGACTGATGCACGGCAGTGTGCCCACCCTGAATCTGGGGCCGCATGCGCCCAAGAAACTGTTTGACAATGAATTCTTGCGTCTGGACAAGCCAATGAGCAGTTCGGAGCTGGGTATGACCGTCAAACAAGAACAAATGGAGCAATTTGATCAAATGGAGCTGGAAGATGGCAACCAGGAGCAGGATGATTTCAGTCTGCTGGAGCCCGAGCTGCAGTTACACGAGGATAGCGAGGAAGAGCAAGAATATGACAATCATTTTAGCCAAAACGATTCCTATAACTGGTCCGATCAGCAGCTGCGTCTGCCCAGCATTAATCAGGAGAAGTGCACCACCATCTACAATCCAGTCAAGTCCGGCTATGATAAGTGCTCACTGGTCCACTGCCAACGACAGCGTTCGCAGCACGGCGTGCACATCTACAAGTTTCCACGCTCGCGTCAGCTACAGCAACGATGGATGCATAATTTGCGCATCCAATACGATGAGCGACGGCCGTGGAAGACAATGATATGCAGTGTCCATTTCGAGCCGCACTGTATCCGTCTGCGAAAGTTGCGTCCCTGGGCGGTGCCCACGCTGGAACTTGGGGACAATGTGCCGCTGGAGATCTTTACGAATGAGCAGAGCCAGCAGCTGTTTGCTCAGTCCGAAGCAGGCAGCGAATGTGATGACGTTGAAGTGGATGTTGAGGACACCATACTGGAGGACATGGATGATGACTATGATGACAATGACTCTGATATGAATGTGAATGCTGATGATCAAATGCGAACAGCTCCATATGTCAAAAGAGAGCGTCGCTCTCGATTTGATCCTCTGCCACCGGGTCAGCTGCCACCGTGGAAGATCAAATGCTGCTGTTTACCCTATTGCCGCAGTCCCCGCGGTGATGGCATCAAGCTCTTTCGACTGCCCAACAACATCAGCTCCATACGTAAATGGGAGCAGGCCACAGGCATGCGCTTCTATGAGTCCCAGCGAAACACAAAGCTCATCTGCAGTCGACACTTTGATCCGCAGCTTATAGGCGTGCGTCGCCTCATGTCCAATGCGGTACCCAGCCTCCATTTGGGCCCAGACAGCGCAGAGCCCGAGCTGCCTCCTGTGGGACCACGTTGCTGCATGCCCGATTGCTCTGAGGATGTCAATGTCCAGCTGCACAAGTTTCCCAAAGATCCCATGCTGCTGCATCAATGGTGTCAGGCGCTCAATCTACCGGATGTTCAAAGCTACTCCGGCAAATTCATTTGTGCGGCACATCTGCCCTCCAACGCGATGAGCTGTCTAATTTGTGGCGTGGACGATGTACAGCTGCCAATGCTGGACTTTCCCCAGAATCGCAATCAGCGCACCAAGTGGTGCCACAATCTGAAAATCGAGCCTCTGCCCAAGTGGGACAACTCAAAGCAAATTTGCTGCAAACACTTTGAGAGCTTTTGCTTTATCCAGCCGGGTCAACTTCTGGCGGAAGCATTGCCCACTCTACACTTGGAGCACGGGGATAGCAATATATTCCTAAACGATGAGACCATGGATAACAGCAAGTTGTTGCGCATCAAGGACGAGCCCATGGAGAGCGAGGATCTGATGCTGTAA
- Protein Sequence
- MGTIEMTPPQHKANAALPATAALNSLLQQRQANADGATLYASSLKNETNVKLERSYSNSTSESGYSMHESSYNNAYASDNSLHGGGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHIPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPXXXXXXXXXXXXXXXXXXXXXXXXXXXSAASTSSSASSTSHYVDPELSASYMSMGAGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVPLLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIPYPLPTPEQVAEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLVKWAHNLQLEIAQLHNMRICNLHFETHCIGKRMRPWAIPTLNLATNIENLYENPEHQMLYKRRTHLKPGRAARSSEASAGGVKPTWVPRCCLPHCRKVRATHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPMAVPTLDLNTPSGYKIYQNPAKLKANKLCLQRVCIVESCRRQRAQGVQLFRLPHSPTQLRKWMHNIRMRPRGAMRQQYRICSKHFETHSFNGKRLSAGAIPTLELGHEDEDIFPNEAQSFVEEHCTVEGCDAVKEQPDVRLFRFPNDDEDLLWKWCNNLKMSPVDCIGVRICNRHFETDCIGPKHLFKWAIPTLSLGHDDDDIELMLNPKPEERYIDPVFKCCVPSCGKTRKFDEVQMNSFPKDPELFQRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDETDDLFQVNPEELQSNLFGRQRRVQDSMRINIKQEAHSDLDEDTKPDINMSEASDSNTTQVAKIKKSITDLKCCVPNCGRSRLEHGARLFPFPNGKQQQSKWRHNLRLPAADVDKTTRICSAHFNRRCIDGNQLRGWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLSPDQCRGRICADHFEPLVRGKLKLKTGAVPTLKLGHDEGVVFDNEGIKAGLQLEEEAEEEEGNASLKSLVKVKTEQEDEQELENEDEEQLEQEKYQDMDEDGEEHRDSEEHGYFDPLELVETXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPRREKAVNNVTPICCLKHCRKERTAIHHLSTFGFPKDPQLLLKWSANLQLPLESCMGRVCVEHFEPSMLGTRKLKQNAVPTLKLGHTTPLTYSCNGRMLSGIYDEQPQHSVFRLWSLKHCRKRKPDLAEIKPGRRCCLPSCGKQSESHGVQLQRLPKDRLMLRKWLHNLKLPPTMDCTQMFLCSDHFELNAPCPTLKLGHSDTNIYRHSVASTSGSCLVPKCTCARLNLYRGYDLPAHPQVQHAWLHWLQLPHPQPSPRHAQLCVMHFMQLYELVPLPESVPDVVRRQLRETYELISSSSMAMKLRCAVPGCYSKYTDNVRLTKLPVYPDTCAKWVHNTKIQYDPARHYVYRICMLHFEPGCLGPVRPKVWAMPTLQLHHKDANIYLNPKLDGSQTQPAVPLDLPLRIKTELPMCNSPSFSASASPSPRGKLRTCCIPSCGQQASALTRLFRFPSAETSILKWLVNTQQQPRFVDAQRLFVCQDHFEAEAICKNQLRSWAVPTLNLGHDGHIIPNAKHNGNIADSQENKQTLQFIWANYCSVLTCFQKSSEQLRLYQYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFTPDCFDSDTGELKEDAVPTLALSRSVTEVRCVVNGCVKDEDASRRRLFKMPKRNPQILDWCHNLRLDQAAMNGSEQHVCERHFEANCFHASRVLRPGARPTLHLGHEDLDDVIPNPLNWEEDVVMCCVPHCESSKDADEVQLFGLPKVRQLADKWLQNVHLDPSKEQLAGLKICSVHFETSCMENGRPTYGAMPTLHLGHDELDNIHPSVGSVPTQQKRYCNRDGASHDCCYPQCVELQKSYLRVTYELPQEQELRQQWLSYMGLEAQQLDKQQLPKLCPLHLILLYDHSADHFSAHAAEELLDSNYEAARSSVRIRVVSCAVRGCRTLKPRDGGRLHGLPTRRDLLEMWLHNMQLVFYEQQRYMYKICSKHFESTCFTETTKRLKPWSMPTLELPERQRGELPAYQNPTELEWQHMNKLQVSEKVVEAQPEPLLNLEPLPKKEPPPPQVVEYEEDCDNNSQQPLEMQALEVLLEVGHVEKCTTYEQMDTEANLNYAEQFSHNPLSPGPPQCRIPVVQNGLHYSARHCSVHGCNVTSNNLSSSIKLHKFPVSLDAMQKWMHNTQVLVDVKFAWRFRICSHHFIEDCFHGSRIRRGAMPTLRLGSRRPKHIYDNEFNAQLQVEQCKEEAREALAAPLESQQQLLSANVGLRLPRPAPPCKSSKYCQIEGCSNHLTSENVTLHKFPHSSDMCAKWQHNTQVPFDPEFRWRYRICSAHFEPICLGNVRLMHGSVPTLNLGPHAPKKLFDNEFLRLDKPMSSSELGMTVKQEQMEQFDQMELEDGNQEQDDFSLLEPELQLHEDSEEEQEYDNHFSQNDSYNWSDQQLRLPSINQEKCTTIYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQLQQRWMHNLRIQYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGDNVPLEIFTNEQSQQLFAQSEAGSECDDVEVDVEDTILEDMDDDYDDNDSDMNVNADDQMRTAPYVKRERRSRFDPLPPGQLPPWKIKCCCLPYCRSPRGDGIKLFRLPNNISSIRKWEQATGMRFYESQRNTKLICSRHFDPQLIGVRRLMSNAVPSLHLGPDSAEPELPPVGPRCCMPDCSEDVNVQLHKFPKDPMLLHQWCQALNLPDVQSYSGKFICAAHLPSNAMSCLICGVDDVQLPMLDFPQNRNQRTKWCHNLKIEPLPKWDNSKQICCKHFESFCFIQPGQLLAEALPTLHLEHGDSNIFLNDETMDNSKLLRIKDEPMESEDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00595971;
- 90% Identity
- iTF_00553068;
- 80% Identity
- -