Basic Information

Gene Symbol
-
Assembly
GCA_018904355.1
Location
JAEIFY010000122.1:1480189-1491621[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 6.3e-15 1.1e-11 45.1 4.1 1 86 107 179 107 180 0.85
2 28 2.4e-15 4.1e-12 46.5 4.6 1 87 207 276 207 276 0.83
3 28 5.9e-16 9.9e-13 48.4 0.4 1 87 298 370 298 370 0.85
4 28 5.2e-16 8.7e-13 48.6 5.7 1 87 465 535 465 535 0.83
5 28 7.2e-15 1.2e-11 44.9 3.4 1 86 559 630 559 631 0.81
6 28 1.4e-12 2.4e-09 37.6 1.6 1 87 666 734 666 734 0.80
7 28 2.5e-11 4.2e-08 33.6 1.9 1 86 782 851 782 852 0.77
8 28 5.3e-17 8.8e-14 51.8 0.3 1 86 879 948 879 949 0.82
9 28 2.9e-12 4.8e-09 36.6 1.3 1 86 970 1039 970 1040 0.80
10 28 1.1e-15 1.9e-12 47.5 1.7 1 86 1067 1138 1067 1139 0.85
11 28 5.7e-14 9.6e-11 42.1 1.6 1 85 1216 1284 1216 1286 0.82
12 28 2.6e-12 4.4e-09 36.7 0.1 1 86 1309 1377 1309 1378 0.82
13 28 4.2e-14 7.1e-11 42.5 0.9 1 86 1527 1595 1527 1596 0.82
14 28 7.9e-12 1.3e-08 35.2 0.7 1 61 1649 1703 1649 1724 0.80
15 28 6.5e-05 0.11 13.0 0.1 1 58 1730 1781 1730 1805 0.78
16 28 2.7e-11 4.6e-08 33.5 0.1 1 86 1820 1889 1820 1890 0.83
17 28 3e-14 5e-11 43.0 1.3 1 87 1948 2018 1948 2018 0.81
18 28 9.9e-14 1.7e-10 41.3 0.8 1 86 2053 2124 2053 2125 0.83
19 28 2.2e-13 3.7e-10 40.2 1.6 1 87 2135 2207 2135 2207 0.82
20 28 6.2e-14 1e-10 41.9 0.1 1 87 2230 2301 2230 2301 0.77
21 28 4.7e-06 0.0078 16.7 0.1 1 58 2334 2387 2334 2406 0.84
22 28 6.3e-15 1.1e-11 45.1 0.1 1 86 2425 2497 2425 2498 0.80
23 28 3.9e-14 6.5e-11 42.6 1.4 1 86 2632 2704 2632 2705 0.81
24 28 1.3e-14 2.1e-11 44.1 2.4 1 87 2768 2839 2768 2839 0.83
25 28 7.3e-15 1.2e-11 44.9 4.0 1 86 2952 3022 2952 3023 0.85
26 28 1.8e-13 3e-10 40.5 0.1 1 87 3115 3185 3115 3185 0.85
27 28 3.8e-10 6.4e-07 29.8 0.4 1 58 3202 3250 3202 3262 0.87
28 28 7.4e-09 1.2e-05 25.7 2.2 18 87 3267 3325 3256 3325 0.75

Sequence Information

Coding Sequence
ATGGGCACCATTGAAATGACTCCACCGCAGCACAAGGCGAATGCGGCATTACCGGCAACGGCGGCGCTTAATTCGCTGTTGCAGCAACGCCAGGCGAACGCTGATGGCGCCACTTTATATGCCTCGTCGCTGAAGAACGAGACGAACGTGAAACTGGAGCGCAGCTATAGCAACTCCACCAGCGAGTCTGGTTACAGTATGCACGAGAGCAGCTATAACAATGCCTACGCCAGCGACAATTCTCTGCATGGCGGGGGCGGGGCAATTGGTGGTCCGCAGGCGCATTCCTCGACGCTGGACGATTCGGAGGATGCGCTGTGCTGTGTGCCACTTTGCGGAGTACGCAAGAGCACAAGCCCGACGCTGCAATTCTTTACGTTTCCCAAAGATGACAAGTACTTGCATCAGTGGCTACACAACCTCAAGATGTTTCACATTCCGGCGTCGAGCTATGCTACCTTTCGCATCTGCAGCATGCACTTCCCTAAGCGTTGCATCAATCGTTACTCTCTGTGCTATTGGGCGGTGCCCACATTTAATCTGGGCCACGACGATGTGGCCAATCTCTATCAGAATCGTGAGCTGACCAACACATTCACCACAGGCGAGGTGGCCCGCTGCAGTATGCCAAACTGTACTAGTCAGCGTGGTGAAAGTAATCTGAAGTTCTACAACTTTCCCAAGGACATCAAGAGTTTGATTAAGTGGTGCCAAAACGCTCGCCTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTCTGCAGTCGCCACTTCGAGGAGCGTTGCATCGGCAAGTTCCGGCTGAAGCCTTGGGCAGTGCCCACCTTACATCTTGGCGCCCAGTACGGCAAGATTCATGACAATCCCAAAAATCTGTATGTGGAGGAAAAGCGCTGCTGCCTCAACTTTTGTCGTCGCAGTCGCTCCTCCGACTTCAACATGTCGCTGTATCGCTTCCCCAGGGATGAGGTGCTACTGCGTCGTTGGTGCTACAATCTACGCCTTGATCCGGCTGTCTATCGTGGGAAGAATCACAAAATTTGTAGCGCTCACTTTATCAAAGAAGCATTGGGATTGCGCAAGCTATCTCCGGGCGCTGTGCCCACGTTGCATCTGGGTCATAATGACACCTTTAACATCTACGAGAACGAACTGTGGCCACCGCCAACGCCCTCCACGCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTCGGCTGCGTCCACATCCTCGTCGGCCTCGTCGACATCGCATTATGTGGATCCGGAGCTAAGTGCATCCTACATGAGCATGGGCGCTGGAGGGTCATCCTCTGGCCTGAATGTCAGCGACAGCATGGATGTCTGCTGTGTGCCCAGCTGCGAGAGCAAGCGCCACAACAATGAGAACATCACATTCCACACAATACCCAGGCGGCCAGAGCAGATGCGGAAGTGGTGTCACAATCTTAAGATACCCGAGGACAAGATGCACAAAGGCATGCGGATATGTAGCTTGCACTTTGAGCCCTACTGCATTGGCGGCTGCATGCGTCCGTTTGCGGTGCCGACATTGCATCTGGGCCACGACGACGAGGACATTCACCGCAATCCGGATGTGATCAAGAAGCTCAACATACGCGAAACTTGCTGCGTGGCTGTTTGCAAACGCAATCGTGACCGGGACCATGCCAATCTGCATCGCTTTCCCAGCAATGTGCCGCTGTTGACCAAATGGTGCGCAAATCTGCAGCGTCCTGTGCCGGATGGCAGTAAACTGTTCAACGATGCCATCTGTGAGGTGCACTTTGAGGATCGATGCCTGCGCAATAAACGGCTAGAGAAGTGGGCAGTGCCCACACTCATCCTTGGCCATGAGAATATACCCTATCCGCTTCCCACGCCGGAGCAAGTTGCCGAGTTCTATGCGCGTCCCACTGCGCCTAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGAGACGTGCAAGCGTAATCCCAGTGTTGATGACATCAAGCTATATCGCCCGCCCGAGGAGTCGCAGGTGCTGGTAAAGTGGGCGCACAATCTCCAACTGGAGATTGCCCAGCTACACAATATGAGAATATGCAATCTGCATTTCGAAACCCACTGCATTGGCAAGCGGATGCGTCCCTGGGCAATACCCACGCTCAATCTGGCAACTAACATAGAGAATCTCTACGAGAATCCCGAACACCAGATGCTCTACAAGCGGCGCACGCATCTCAAGCCGGGCAGAGCAGCGCGAAGCTCTGAAGCAAGCGCTGGTGGTGTGAAGCCCACCTGGGTGCCACGCTGCTGCTTGCCACACTGCCGCAAGGTGCGTGCCACACACAATGTCCAGCTGTATCGCTTCCCCAAACTCAATCGTTCCACGCTGGCCAAGTGGGCGCATAATCTGCAGGTGCCGCTCGTGGGCAGCGCTCAGCGTCGCCTCTGCTCCGCACACTTTGAGCCGCATGTGCTTAGCAAGAAATGCCCGGTGCCCATGGCGGTGCCCACACTGGACCTCAATACACCATCCGGTTACAAGATCTATCAGAATCCGGCCAAGCTCAAGGCGAATAAGCTGTGCTTGCAGCGTGTCTGCATTGTGGAGAGCTGCCGGCGTCAGCGGGCGCAGGGGGTGCAGCTCTTCCGTCTGCCTCACAGCCCCACCCAGCTGCGTAAGTGGATGCACAACATCCGGATGCGGCCCCGAGGAGCTATGCGACAACAGTATCGCATCTGCTCGAAGCACTTCGAGACGCACTCGTTCAATGGGAAGAGACTCAGTGCGGGTGCAATTCCAACGCTTGAGTTGGGCCATGAGGACGAAGACATATTTCCGAATGAGGCGCAGTCTTTCGTGGAGGAGCACTGCACCGTCGAGGGCTGCGATGCCGTCAAGGAGCAACCGGATGTGCGTCTCTTCCGCTTCCCCAACGACGATGAGGATCTGCTCTGGAAGTGGTGCAACAATCTGAAAATGAGTCCGGTCGACTGCATCGGCGTTCGCATCTGCAACAGACACTTCGAGACTGATTGCATTGGACCAAAGCACCTGTTCAAGTGGGCTATTCCCACGCTCTCCCTCGGCCACGATGATGATGACATCGAGTTGATGCTAAATCCCAAGCCGGAGGAGCGCTATATTGATCCGGTATTCAAGTGCTGTGTGCCCTCGTGCGGCAAGACGCGTAAATTCGATGAAGTGCAGATGAACAGTTTTCCCAAAGATCCGGAGCTCTTCCAGCGCTGGCGCCACAATCTCCGCCTCGAGCATCTCAACTTCAAGGAGCGCGAACGCTATAAGATCTGCAACGCCCACTTCGAGGACATTTGCATTGGTAAGACGCGCTTGAACATTGGCTCCATACCGACACTGGAGCTTGGCCATGACGAGACTGATGACTTGTTCCAAGTCAACCCCGAGGAGCTACAGAGCAATCTCTTTGGACGCCAGAGACGCGTGCAGGATTCCATGAGGATCAACATTAAGCAGGAGGCGCACTCCGACCTCGATGAAGACACTAAACCGGACATTAACATGTCGGAGGCCTCAGATTCAAATACAACACAGGTGGCTAAAATCAAAAAATCTATAACCGATTTGAAGTGCTGTGTGCCGAACTGTGGTCGCAGTCGGCTGGAGCATGGTGCCCGCCTCTTTCCGTTTCCGAACGGGAAACAGCAGCAGAGTAAGTGGCGCCACAATCTCCGGCTGCCCGCTGCCGACGTGGACAAGACGACGCGCATCTGCAGCGCCCACTTCAATCGCCGTTGCATCGATGGCAATCAGCTGAGGGGCTGGGCAATGCCCACACAGCAACTGGGACATCAGGAGCTGCCGATCTATGAAAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCACTGCCGCAAACGGCGCAGCATTGACAACGATCTGCGTACCTATCGCTATCCACGCAGTGAGGAGCTGCTCGAGAAGTGGCGTGTCAATCTGCGCTTGTCGCCGGACCAATGCCGCGGACGCATTTGTGCGGATCACTTCGAGCCACTGGTGCGTGGCAAGCTGAAGCTGAAGACTGGAGCAGTGCCTACGCTCAAATTGGGACACGACGAGGGCGTAGTCTTCGATAATGAGGGCATTAAGGCGGGTCTGCAGCTGGAAGAGGAGGCGGAGGAAGAAGAGGGCAATGCCAGCTTGAAGTCGTTGGTCAAAGTAAAGACTGAGCAGGAGGATGAGCAGGAGCTAGAGAATGAAGATGAAGAGCAGCTGGAGCAGGAGAAGTATCAAGATATGGACGAAGATGGGGAAGAGCACCGAGACTCTGAGGAACATGGCTATTTTGATCCCTTGGAACTTGTGGAAACCTANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCCGCGGCGCGAGAAGGCTGTGAATAATGTGACGCCTATTTGCTGTCTGAAGCACTGTCGCAAGGAGCGCACCGCCATCCATCATCTGAGCACCTTTGGCTTTCCCAAGGATCCGCAGCTGCTGCTCAAGTGGAGCGCCAATCTGCAGCTACCATTGGAGTCGTGCATGGGTCGTGTATGCGTCGAGCACTTTGAGCCCTCGATGCTGGGCACGCGCAAGCTGAAGCAGAATGCGGTGCCCACCTTGAAACTGGGCCATACCACACCGCTCACCTACAGCTGCAATGGCCGGATGCTATCGGGCATTTACGATGAACAGCCACAGCACTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAACGGAAACCGGATCTGGCGGAGATTAAGCCCGGTCGTCGCTGTTGCCTGCCAAGTTGCGGCAAGCAGTCGGAGTCGCACGGCGTCCAGCTGCAGCGTCTGCCGAAGGATCGTCTGATGCTGCGCAAATGGTTGCACAACCTCAAGCTGCCTCCAACGATGGACTGCACCCAAATGTTCCTCTGCAGCGATCACTTTGAGCTGAATGCGCCGTGTCCCACTTTGAAACTAGGACACTCGGATACCAATATTTATCGCCACAGTGTGGCTAGCACCAGTGGCAGCTGCCTGGTGCCCAAATGTACTTGTGCTCGTCTCAATCTCTATCGCGGCTATGATCTGCCTGCGCATCCGCAGGTGCAACATGCCTGGCTACACTGGCTGCAGCTGCCCCATCCGCAGCCGTCGCCCAGGCACGCCCAGCTGTGTGTGATGCACTTCATGCAGCTCTACGAACTGGTGCCGCTGCCCGAATCGGTGCCAGATGTTGTGCGCAGGCAGCTGCGGGAGACTTACGAACTGATATCCAGTTCCAGCATGGCCATGAAGCTGCGTTGCGCTGTGCCCGGCTGCTACTCGAAGTATACGGACAATGTGCGTCTGACCAAGCTGCCCGTTTACCCCGACACCTGCGCCAAGTGGGTGCACAACACCAAGATTCAATATGATCCGGCCCGACATTATGTCTATCGCATCTGCATGTTGCACTTCGAGCCAGGTTGCCTGGGCCCAGTGCGTCCTAAAGTGTGGGCAATGCCAACGCTGCAGCTGCACCACAAGGATGCCAACATCTATTTAAATCCCAAGCTGGATGGCAGCCAAACACAGCCGGCCGTGCCGCTGGACCTGCCACTGCGTATTAAAACTGAGCTGCCTATGTGCAACAGTCCCAGCTTTAGTGCGAGTGCTAGTCCCAGTCCGCGTGGCAAGCTGCGCACTTGCTGCATTCCCAGCTGCGGTCAGCAGGCTTCGGCCCTGACGCGTCTCTTTCGCTTTCCCAGCGCAGAGACATCGATACTGAAGTGGCTGGTGAATACCCAGCAGCAGCCACGCTTTGTCGATGCACAACGGCTGTTCGTCTGCCAGGATCACTTTGAGGCGGAGGCCATTTGCAAGAATCAGCTGCGCAGCTGGGCGGTACCAACACTGAATCTAGGACACGATGGACACATCATACCGAATGCCAAGCACAATGGCAACATTGCCGACAGCCAGGAGAACAAGCAGACGCTGCAGTTCATCTGGGCCAACTACTGTTCAGTGCTGACCTGCTTCCAGAAAAGTAGCGAGCAGCTGCGTCTCTACCAATACCCCACGGATCGGCCAACCATCCGCAAGTGGGCCGCCAATTGTAAGCATCGCTCCATGCAGGCCAGCAGTGATGGATTCCAGGTGTGTCAGTCGCATTTTACGCCGGATTGCTTTGATTCTGATACCGGGGAACTGAAAGAGGACGCTGTGCCCACACTGGCGCTGAGCCGGTCTGTCACTGAGGTGCGCTGTGTGGTCAATGGTTGCGTTAAGGACGAAGATGCATCGCGTCGCCGTCTGTTCAAGATGCCCAAGCGTAACCCACAGATATTGGATTGGTGCCACAATTTGCGGCTGGATCAGGCGGCCATGAACGGCTCGGAACAGCACGTTTGTGAACGTCACTTCGAGGCGAACTGCTTCCATGCGTCTAGAGTGCTGCGTCCAGGAGCACGACCCACACTTCATTTAGGTCACGAGGACCTAGACGATGTGATACCCAATCCACTGAACTGGGAAGAGGATGTGGTCATGTGCTGTGTCCCCCACTGCGAAAGCTCCAAGGATGCGGATGAAGTCCAACTGTTTGGGCTGCCTAAGGTGCGCCAGTTGGCGGACAAGTGGCTGCAAAATGTGCACCTCGATCCGAGCAAAGAACAACTGGCTGGCCTGAAGATCTGCAGTGTGCACTTTGAGACGAGCTGCATGGAGAATGGACGACCCACCTATGGTGCAATGCCCACACTCCATCTCGGTCACGATGAGCTCGACAATATACACCCAAGCGTAGGGTCGGTGCCGACGCAGCAGAAGCGCTACTGCAATAGAGATGGCGCCAGTCACGATTGCTGCTATCCGCAGTGCGTGGAGCTGCAGAAGAGCTATCTGCGGGTCACCTACGAACTGCCCCAGGAGCAGGAGCTCCGTCAGCAGTGGCTCTCCTATATGGGCCTGGAAGCGCAGCAGCTCGATAAACAGCAGCTGCCCAAGCTCTGTCCACTCCACCTAATCTTGCTCTACGATCACAGTGCGGATCACTTTTCGGCACACGCCGCTGAGGAGCTGTTGGACTCTAATTATGAAGCAGCGCGCAGCAGCGTTCGCATACGCGTTGTCAGCTGTGCTGTGCGCGGCTGCAGAACGCTCAAACCACGCGACGGTGGTCGGCTGCATGGTTTGCCCACGCGGCGAGATCTGCTGGAGATGTGGCTGCACAACATGCAGCTGGTGTTTTACGAGCAACAGCGTTATATGTACAAGATTTGCAGCAAGCACTTTGAGTCCACATGCTTCACGGAGACAACCAAGCGGCTGAAGCCGTGGAGCATGCCTACCCTCGAGTTGCCGGAGCGCCAACGGGGCGAGCTGCCTGCCTATCAGAATCCCACAGAGTTGGAGTGGCAACACATGAATAAGCTGCAGGTCAGCGAGAAAGTTGTTGAGGCTCAGCCGGAGCCATTACTTAATCTGGAGCCGTTGCCCAAGAAGGAGCCACCACCACCGCAGGTTGTGGAATATGAAGAGGATTGCGACAATAACTCACAGCAGCCACTGGAAATGCAGGCGCTGGAGGTGCTGCTCGAGGTGGGCCATGTCGAGAAGTGCACCACCTACGAGCAAATGGATACCGAGGCAAATCTCAACTATGCCGAGCAGTTCTCGCACAATCCCCTCAGTCCAGGTCCACCTCAATGCCGTATCCCCGTTGTCCAGAATGGACTCCACTACAGTGCACGCCACTGCAGCGTGCATGGCTGCAATGTCACCTCAAATAATCTGAGCAGCAGCATCAAGCTACACAAGTTCCCCGTCTCGCTGGATGCCATGCAAAAGTGGATGCACAACACCCAGGTGCTCGTGGACGTCAAATTCGCTTGGCGTTTTCGAATCTGCAGTCATCATTTCATCGAGGATTGCTTTCACGGCTCGCGCATCAGACGTGGGGCGATGCCCACGTTGCGACTGGGCTCACGTCGACCGAAGCATATCTATGATAATGAGTTCAACGCCCAACTGCAAGTGGAACAGTGTAAAGAAGAGGCCAGGGAGGCTCTCGCTGCCCCGCTGGAGTCTCAGCAACAGTTGCTCTCTGCGAATGTAGGTCTGCGTCTGCCGCGTCCAGCCCCGCCCTGCAAATCCAGCAAATATTGTCAGATCGAAGGCTGCTCCAATCATTTGACCAGCGAGAATGTGACGCTGCACAAGTTCCCCCATTCGTCGGATATGTGCGCCAAGTGGCAGCACAACACTCAGGTGCCCTTCGATCCCGAGTTCCGCTGGCGCTATCGCATCTGCAGCGCACACTTTGAGCCCATTTGTCTAGGCAATGTGCGACTGATGCACGGCAGTGTGCCCACCCTGAATCTGGGGCCGCATGCGCCCAAGAAACTGTTTGACAATGAATTCTTGCGTCTGGACAAGCCAATGAGCAGTTCGGAGCTGGGTATGACCGTCAAACAAGAACAAATGGAGCAATTTGATCAAATGGAGCTGGAAGATGGCAACCAGGAGCAGGATGATTTCAGTCTGCTGGAGCCCGAGCTGCAGTTACACGAGGATAGCGAGGAAGAGCAAGAATATGACAATCATTTTAGCCAAAACGATTCCTATAACTGGTCCGATCAGCAGCTGCGTCTGCCCAGCATTAATCAGGAGAAGTGCACCACCATCTACAATCCAGTCAAGTCCGGCTATGATAAGTGCTCACTGGTCCACTGCCAACGACAGCGTTCGCAGCACGGCGTGCACATCTACAAGTTTCCACGCTCGCGTCAGCTACAGCAACGATGGATGCATAATTTGCGCATCCAATACGATGAGCGACGGCCGTGGAAGACAATGATATGCAGTGTCCATTTCGAGCCGCACTGTATCCGTCTGCGAAAGTTGCGTCCCTGGGCGGTGCCCACGCTGGAACTTGGGGACAATGTGCCGCTGGAGATCTTTACGAATGAGCAGAGCCAGCAGCTGTTTGCTCAGTCCGAAGCAGGCAGCGAATGTGATGACGTTGAAGTGGATGTTGAGGACACCATACTGGAGGACATGGATGATGACTATGATGACAATGACTCTGATATGAATGTGAATGCTGATGATCAAATGCGAACAGCTCCATATGTCAAAAGAGAGCGTCGCTCTCGATTTGATCCTCTGCCACCGGGTCAGCTGCCACCGTGGAAGATCAAATGCTGCTGTTTACCCTATTGCCGCAGTCCCCGCGGTGATGGCATCAAGCTCTTTCGACTGCCCAACAACATCAGCTCCATACGTAAATGGGAGCAGGCCACAGGCATGCGCTTCTATGAGTCCCAGCGAAACACAAAGCTCATCTGCAGTCGACACTTTGATCCGCAGCTTATAGGCGTGCGTCGCCTCATGTCCAATGCGGTACCCAGCCTCCATTTGGGCCCAGACAGCGCAGAGCCCGAGCTGCCTCCTGTGGGACCACGTTGCTGCATGCCCGATTGCTCTGAGGATGTCAATGTCCAGCTGCACAAGTTTCCCAAAGATCCCATGCTGCTGCATCAATGGTGTCAGGCGCTCAATCTACCGGATGTTCAAAGCTACTCCGGCAAATTCATTTGTGCGGCACATCTGCCCTCCAACGCGATGAGCTGTCTAATTTGTGGCGTGGACGATGTACAGCTGCCAATGCTGGACTTTCCCCAGAATCGCAATCAGCGCACCAAGTGGTGCCACAATCTGAAAATCGAGCCTCTGCCCAAGTGGGACAACTCAAAGCAAATTTGCTGCAAACACTTTGAGAGCTTTTGCTTTATCCAGCCGGGTCAACTTCTGGCGGAAGCATTGCCCACTCTACACTTGGAGCACGGGGATAGCAATATATTCCTAAACGATGAGACCATGGATAACAGCAAGTTGTTGCGCATCAAGGACGAGCCCATGGAGAGCGAGGATCTGATGCTGTAA
Protein Sequence
MGTIEMTPPQHKANAALPATAALNSLLQQRQANADGATLYASSLKNETNVKLERSYSNSTSESGYSMHESSYNNAYASDNSLHGGGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHIPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPXXXXXXXXXXXXXXXXXXXXXXXXXXXSAASTSSSASSTSHYVDPELSASYMSMGAGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVPLLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIPYPLPTPEQVAEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLVKWAHNLQLEIAQLHNMRICNLHFETHCIGKRMRPWAIPTLNLATNIENLYENPEHQMLYKRRTHLKPGRAARSSEASAGGVKPTWVPRCCLPHCRKVRATHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPMAVPTLDLNTPSGYKIYQNPAKLKANKLCLQRVCIVESCRRQRAQGVQLFRLPHSPTQLRKWMHNIRMRPRGAMRQQYRICSKHFETHSFNGKRLSAGAIPTLELGHEDEDIFPNEAQSFVEEHCTVEGCDAVKEQPDVRLFRFPNDDEDLLWKWCNNLKMSPVDCIGVRICNRHFETDCIGPKHLFKWAIPTLSLGHDDDDIELMLNPKPEERYIDPVFKCCVPSCGKTRKFDEVQMNSFPKDPELFQRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDETDDLFQVNPEELQSNLFGRQRRVQDSMRINIKQEAHSDLDEDTKPDINMSEASDSNTTQVAKIKKSITDLKCCVPNCGRSRLEHGARLFPFPNGKQQQSKWRHNLRLPAADVDKTTRICSAHFNRRCIDGNQLRGWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLSPDQCRGRICADHFEPLVRGKLKLKTGAVPTLKLGHDEGVVFDNEGIKAGLQLEEEAEEEEGNASLKSLVKVKTEQEDEQELENEDEEQLEQEKYQDMDEDGEEHRDSEEHGYFDPLELVETXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPRREKAVNNVTPICCLKHCRKERTAIHHLSTFGFPKDPQLLLKWSANLQLPLESCMGRVCVEHFEPSMLGTRKLKQNAVPTLKLGHTTPLTYSCNGRMLSGIYDEQPQHSVFRLWSLKHCRKRKPDLAEIKPGRRCCLPSCGKQSESHGVQLQRLPKDRLMLRKWLHNLKLPPTMDCTQMFLCSDHFELNAPCPTLKLGHSDTNIYRHSVASTSGSCLVPKCTCARLNLYRGYDLPAHPQVQHAWLHWLQLPHPQPSPRHAQLCVMHFMQLYELVPLPESVPDVVRRQLRETYELISSSSMAMKLRCAVPGCYSKYTDNVRLTKLPVYPDTCAKWVHNTKIQYDPARHYVYRICMLHFEPGCLGPVRPKVWAMPTLQLHHKDANIYLNPKLDGSQTQPAVPLDLPLRIKTELPMCNSPSFSASASPSPRGKLRTCCIPSCGQQASALTRLFRFPSAETSILKWLVNTQQQPRFVDAQRLFVCQDHFEAEAICKNQLRSWAVPTLNLGHDGHIIPNAKHNGNIADSQENKQTLQFIWANYCSVLTCFQKSSEQLRLYQYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFTPDCFDSDTGELKEDAVPTLALSRSVTEVRCVVNGCVKDEDASRRRLFKMPKRNPQILDWCHNLRLDQAAMNGSEQHVCERHFEANCFHASRVLRPGARPTLHLGHEDLDDVIPNPLNWEEDVVMCCVPHCESSKDADEVQLFGLPKVRQLADKWLQNVHLDPSKEQLAGLKICSVHFETSCMENGRPTYGAMPTLHLGHDELDNIHPSVGSVPTQQKRYCNRDGASHDCCYPQCVELQKSYLRVTYELPQEQELRQQWLSYMGLEAQQLDKQQLPKLCPLHLILLYDHSADHFSAHAAEELLDSNYEAARSSVRIRVVSCAVRGCRTLKPRDGGRLHGLPTRRDLLEMWLHNMQLVFYEQQRYMYKICSKHFESTCFTETTKRLKPWSMPTLELPERQRGELPAYQNPTELEWQHMNKLQVSEKVVEAQPEPLLNLEPLPKKEPPPPQVVEYEEDCDNNSQQPLEMQALEVLLEVGHVEKCTTYEQMDTEANLNYAEQFSHNPLSPGPPQCRIPVVQNGLHYSARHCSVHGCNVTSNNLSSSIKLHKFPVSLDAMQKWMHNTQVLVDVKFAWRFRICSHHFIEDCFHGSRIRRGAMPTLRLGSRRPKHIYDNEFNAQLQVEQCKEEAREALAAPLESQQQLLSANVGLRLPRPAPPCKSSKYCQIEGCSNHLTSENVTLHKFPHSSDMCAKWQHNTQVPFDPEFRWRYRICSAHFEPICLGNVRLMHGSVPTLNLGPHAPKKLFDNEFLRLDKPMSSSELGMTVKQEQMEQFDQMELEDGNQEQDDFSLLEPELQLHEDSEEEQEYDNHFSQNDSYNWSDQQLRLPSINQEKCTTIYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQLQQRWMHNLRIQYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGDNVPLEIFTNEQSQQLFAQSEAGSECDDVEVDVEDTILEDMDDDYDDNDSDMNVNADDQMRTAPYVKRERRSRFDPLPPGQLPPWKIKCCCLPYCRSPRGDGIKLFRLPNNISSIRKWEQATGMRFYESQRNTKLICSRHFDPQLIGVRRLMSNAVPSLHLGPDSAEPELPPVGPRCCMPDCSEDVNVQLHKFPKDPMLLHQWCQALNLPDVQSYSGKFICAAHLPSNAMSCLICGVDDVQLPMLDFPQNRNQRTKWCHNLKIEPLPKWDNSKQICCKHFESFCFIQPGQLLAEALPTLHLEHGDSNIFLNDETMDNSKLLRIKDEPMESEDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00595971;
90% Identity
iTF_00553068;
80% Identity
-