Basic Information

Gene Symbol
-
Assembly
GCA_018904035.1
Location
JAEIFZ010000002.1:19126639-19140154[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 1.2e-15 2.6e-12 47.3 3.8 1 86 321 393 321 394 0.85
2 28 2.2e-15 4.8e-12 46.4 4.6 1 87 421 490 421 490 0.83
3 28 5.3e-16 1.2e-12 48.4 0.4 1 87 512 584 512 584 0.85
4 28 3.5e-16 7.6e-13 49.0 5.3 1 87 608 678 608 678 0.83
5 28 6.4e-15 1.4e-11 44.9 3.5 1 86 702 773 702 774 0.82
6 28 8.6e-13 1.9e-09 38.1 1.0 1 87 809 877 809 877 0.81
7 28 7.5e-11 1.6e-07 31.9 1.4 1 86 926 995 926 996 0.76
8 28 1.4e-15 3e-12 47.1 0.1 1 86 1023 1092 1023 1093 0.82
9 28 4.4e-12 9.5e-09 35.9 1.0 1 85 1114 1182 1114 1184 0.79
10 28 2.8e-15 6e-12 46.1 1.7 1 86 1211 1282 1211 1283 0.85
11 28 7.6e-14 1.7e-10 41.5 1.5 1 85 1357 1425 1357 1427 0.81
12 28 1.6e-12 3.4e-09 37.3 0.1 1 86 1450 1518 1450 1519 0.81
13 28 7.3e-12 1.6e-08 35.1 0.3 6 87 1613 1677 1611 1677 0.77
14 28 6.1e-14 1.3e-10 41.8 0.8 1 61 1751 1804 1751 1829 0.79
15 28 0.055 1.2e+02 3.5 0.0 1 58 1837 1887 1837 1909 0.74
16 28 3e-12 6.5e-09 36.4 0.4 1 86 1926 1995 1926 1996 0.85
17 28 4.1e-15 8.9e-12 45.6 2.2 1 86 2058 2127 2058 2128 0.82
18 28 2.4e-12 5.2e-09 36.7 0.9 1 86 2163 2234 2163 2235 0.81
19 28 3.2e-13 7.1e-10 39.5 2.6 1 87 2245 2317 2245 2317 0.79
20 28 2.7e-16 5.9e-13 49.3 0.1 1 86 2347 2421 2347 2422 0.80
21 28 7.7e-05 0.17 12.6 0.1 1 58 2457 2511 2457 2531 0.84
22 28 4e-15 8.7e-12 45.6 0.3 1 86 2549 2621 2549 2622 0.81
23 28 1.7e-14 3.6e-11 43.6 1.5 1 86 2771 2843 2771 2844 0.83
24 28 4.5e-14 9.8e-11 42.2 3.0 1 87 2904 2975 2904 2975 0.82
25 28 6.9e-14 1.5e-10 41.6 3.6 1 86 3089 3159 3089 3160 0.84
26 28 7.4e-11 1.6e-07 31.9 0.0 11 87 3176 3241 3170 3241 0.79
27 28 3.4e-08 7.5e-05 23.4 0.7 1 58 3263 3311 3263 3322 0.82
28 28 2.7e-11 6e-08 33.3 2.9 6 87 3321 3386 3314 3386 0.75

Sequence Information

Coding Sequence
ATGATAATAAAATCGGAACCTATTGACGAACATGCGTACAAGTCCAACTATATTGATGATAATACGCCGTTTGCTGATTTTAGTAAATTTCCGGAATTTGGCGACGATATGCTGAGTCCCAAGGTAGAGCTGACCGTCAAGGATGAGGCGTATCAAAAGAACCCGCTTAGTTATCCGCGGCGCAAGCTGCAATCGGAACGCTCTTCGGAGAGCTTGCCGATTTGTCAGCGCTGCAAGGAGGTGTTCTTCAAGAAGCAAATCTATTTGCGTCATGTTGCAGAGAGCAGTTGCACCATACAGGAGTATGACTTCAAGTGCAACATATGCCCCATGTCCTTTATGAGCACAGAAGAGCTGCAAAAGCACAAGCATATGCACAGAGCCGATAAGTTCTTCTGTCACAAATATTGTGGCAAGTACTTTGACACCATTGCAGAGTGCGAGTCACATGAGTACATGCAGCACGAGTATGAGAGCTTTATTTGCAACATGTGCTCGGTTACATTTGCCACACGGGAACAGCTCTATGCGCACTTGCCACAACACAAGTTCCAGCCACGTTTCGATTGTCCCATTTGCCGGCTATGGTATCAAACGGCAGTAGAGCTGCACGAGCATCGTATGGCGGCACCCTATTTCTTGCCGACGCCACATCATAAGGCAACTACTGGCAGCGCTTTGCCGGCAACGGCAGCGTTGAACTCCCTGTTGCAACAGCGTCAGGCAAATGCCGATGGTGCCGCATTATTTGCATCATCTCTGAAGAACGAAGTGAATGTGAAATTGGAGCGGAGCTATAGCAATTCTACAACAAGCGAATCGGGCTACAGTCTACACGACAGCAGCAACTATAACAATGCATATGGCAGCGATGCCTCGCTACATGGCGGTGCAAGTGCTGGTGCCATTGGTGGCCCACAGGCGCATTCCTCAACGCTGGACGATTCAGAGGATGCGCTTTGCTGTGTGCCATTATGTGGTGTGCGCAAGAGCACCAGCCCCACGCTGCAGTTCTTTACGTTCCCCAAAGACGAGAAGTACTTGCATCAATGGCTGCACAATCTCAAGATGTTTCATATTCCGGCCTCGAGCTATGCCACTTATCGCATCTGCAGCATGCACTTTCCGAAGCGCTGCATCAATCGTTATTCATTGTGCTATTGGGCTGTGCCCACGTTCAATCTGGGACACGATGATGTTGCCAATCTATATCAGAATCGCGAGCTAACGAACACGTTCACCACCGGCGAGGTGGCACGTTGCAGCATGCCCAACTGTACCAGTCAACGGGGGGAAAGCAATCTCAAGTTTTACAACTTCCCGAAGGACATTAAGAGTCTTATCAAGTGGTGCCAGAACGCACGTTTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTCTGCAGTCGACACTTCGAGGAGCGTTGCATTGGCAAGTTCCGGCTAAAGCCATGGGCGGTGCCAACGTTGCATTTGGGCGCCCAATACGGCAAGATTCATGACAATCCCAAGAACTTGTATGTGGAAGAGAAGCGCTGTTGCCTTAATTTCTGTCGTCGCAGTCGCTCCTCGGACTTTAACATGTCGCTCTATCGCTTCCCCAGAGACGAAGTGCTCTTGCGACGTTGGTGCTACAATTTACGCCTTGATCCTGCCGTCTATCGTGGCAAGAATCACAAGATATGTAGTGCACACTTCATCAAGGAAGCGTTGGGATTACGAAAACTGTCTCCAGGTGCTGTTCCTACTCTCCATTTGGGTCACAACGATACTGGCTCGGCTGGATCCGCGTCGGGCATGAATGTGAGCGACAGCATGGACGTGTGCTGTGTGCCAAGTTGTGAGAGTAAGCGACACAACAACGAGAATATCACATTCCACACAATACCAAGGCGACCCGAGCAGATGCGCAAATGGTGTCACAATCTTAAAATACCCGAGGATAAGATGCACAAGGGAATGCGGATTTGTAGTCTACACTTTGAACCTTATTGCATTGGCGGCTGTATGCGTCCGTTTGCGGTGCCCACACTGAATCTCGGGCACGACGACGATGATATACATCGCAATCCGGATGTGATTAAGAAGCTGAATATACGTGAAACCTGCTGTGTGGCCGTGTGCAAACGTAATCGGGATCGTGATCATGCCAATCTGCATCGTTTCCCCAGCAATGTCTCGCTGCTGACCAAGTGGTGTGCCAATCTGCAGCGACCTGTGCCCGATGGCACCAAACTCTTCAACGATGCGATTTGTGAGGTGCACTTCGAGGATCGCTGCTTGCGCAACAAGAGACTGGAGAAATGGGCAGTGCCTACGCTGGTGTTGGGACATGACAATATTGCCTATCCGCTGCCAACGGCAGAGCAGGTGGCTGAGTTCTATGCCCGTCCCAGTGCACCCAACAATGGTGAGGAGCAGGGCGAGTGCTGTGTGGAGACGTGTAAGCGTAATCCCAGTGTCGATGATATCAAACTCTATCGACCGCCCGAGGAGTCGTCTGTGCTGGCGAAATGGGCACATAATCTTGAAGTGGATGCCGCCAAGTTGTCCAGCATGCGTATATGTAATCTGCACTTTGAGTCCCATTGCATTGGCAAGCGTATGCGACCCTGGGCCATACCCACACTCAATCTGTCCAACAATGTTGAGAATATGTACGAGAATCCCGAGCATCACATGATCTACAAGCGTCGCACACAAATTAAAACCGAGAGAGCATCGGCAGGGGCATCGGGAGCGGGGGCAAGCATGTCGAAACCCACTTGGGTGCCACGTTGTTGCTTGCCACATTGCCGCAAGGTGCGTGCTCTGCACAATGTGCAGCTCTATCGCTTCCCCAAACTCAATCGTTCCACGTTGGCAAAGTGGGCGCATAATCTGCAAGTGCCACTGGTGGGCAGTGCCCAGAGGCGACTCTGCTCTGCCCACTTTGAGCCGCATGTGCTGAGCAAAAAGTGTCCGGTACCGCTGGCGGTGCCCACGCTGGACCTCAATGCGCCACCCGGTTATAAAATCTATCAGAATCCCGCCAAGCTCAAGGCCAACAAGCTGTGCCTGCAGCGCGTCTGCATTGTGGAGAGTTGCCGTCGTCAGCGTGGCCAGGGTGTGCAGCTCTTCCGTCTGCCGCACAGTCCCACGCAGCTGCGCAAGTGGATGTACAACATACGGATGCGACCCAGAGGTGCCATGCGGCAACAGTATCGCATCTGCTCGATTCATTTTGAGACACATTCGTTCAATGGAAAGCGACTGAGTGCGGGTGCGATTCCAACACTAGAGCTGGGCCACGACAACGAGGACATTTATCCGAATGAGGCGCAATCATTTGTAGAGGAACACTGTGCCATCGACGGGTGCGATGCATCCAAAGAGCAGCCAGATGTGCGTCTCTTCCGTTTTCCCACCGATGATGAGGATTTGCTCTGGAAGTGGTGTAATAATCTGAAAATGAATCCAGTCGATTGTATCGGAGTGCGCATCTGCAACAAACACTTCGAGGTCGACTGCATAGGGCCGAAGCATCTCTATAAGTGGGCCATACCCACGCAGGAGTTGGGCCACGATGACTCTCAGATCGAATTGATCAACAATCCCAAGCCCGAGGAACGTTATGTTGATCCCGTGTTCAAGTGTTGTGTACCCACCTGCGGCAAGACGCGCAAATTCGATGAAGTGCAAATGAATAGCTTCCCCAAAGATCCAACACTCTTCCAGCGATGGCGGCATAATCTGCGACTCGAGCATCTCCAGTTCAAGGAGCGTGAACGCTATAAAATCTGCAATGCACACTTCGAGGATATTTGCATTGGTAAAACACGTCTGAATATCGGTTCCATACCCACACTGGAGTTGGGACACGACGATACGGAAGATCTCTTCCAGGTGAATCCTGCAGAGCTGCAAAGCAATCTGTTTGGTCGCCAGCGCCGTGTACAGGAATCAGGCATGAGGGTCCGAATAAAGGAAGAAGAGCACTCCGAGCAGGAGGAGACGGACATAACCATGTCGGAAGCCACGGACATGAATACCAAGCAGGTTAAAATAAAGAAACGCCTTGCGGACTTGAAGTGCTGCGTTCCCAGCTGTGGTCGCAGTCGTCTGGAGCACGGTGCACGTTTGTTTCCCTTTCCAACTGGCAAACAACAGCAGAGCAAGTGGCGCCACAATTTGCGTCTGGATGCCAGCGATGTGGACAAGACATCGCAAGTTTGTAGCGCACACTTCAATCGACGTTGCATCGATGGCAAGCAGCTGCGTGGCTGGGCAATACCCACACAGCAGCTGGGGCATCAGGAACAGCCCATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCACTGCCGCAAGCGGCGAAGTATTGACAATGATTTGCGGACGTACCGTTATCCTCGCAGCGAGGAGTTGTTGGAGAAATGGCGAGTGAATCTACGTTTGGCTCCGGATCAGTGTCGCGGACGCATCTGTGCGGATCATTTTGAGCCAATGGTGCGTGGCAAGCTGAAGTTGAAGACGGGTGCAGTACCGACGCTTAAGCTGGGACATGATGAGGGCGTGGTCTTTGACAATGAGGCAATTAAGGCGGTACTGCAGCTCGATGGTCCCGGAGAGCCAGAGGTGGAGGAGGGGGGTGAGGTAAGCAGAGAATCGCTGATAAAGATTAAAAAGGAAAAACTAGAACCAGATGAGGAGGTGGAGGCGGAGGCAGAGGCAGAGCACAGGCAGCTCGACAATGGAGATGATGATGAACATGAAGAAGATGTGGATGCAGATGAGGATGATGATCATGGCTACTTCGATCCATTGGAACTGCATTGCCGCAAAGAGCGCACGGCGACACATCAGCTGAGCACATTCGGCTTCCCAAAAGATCGTCTGCAGCTGCGCAAGTGGAGCACCAATCTACAAATCCCCGAAAGCGATTGCGTGGGGCGTGTCTGCATTGAGCATTTCGAGGCGGAGGTGCTGGGCACGCGTAAGCTGAAACAGCATGCAGTGCCCACTTTGAATTTGGGCCACGATACGCCTCTGATCTACAGATGCAATGGACAAGCGCAGCCCATGGGAGGCATCTTCGATGAGCAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGCTGGAAGCAATGGAACCGCCCGATCATCATCACCATCAACGAGCAGTGATTAAGCAGGAGTTGGAGAAGGAGCAGGAGTTTGGCCTTAGCTGTTGCCTGCCCCACTGTGGCAAGCAGGCGGATGAGGTGCAACTGCATCGTCTGCCCAGCGATCGCAATCGACTGCGCAAATGGTTGCACAATCTCAACCTGCCCCAGAGCATTCCCATCGACAGTCAGACACGCGTCTGCAGCGAACACTTCGATGTGCAACCCAGTGAGCCACCAGAGGATTGTATGCCCACCTTGAAACTGGGTCACAACGATACCAATCTATATCGCAATCAAAAAGCCTCCGCCTCCTCATCCTGCCTGGTGCCAAGCTGTCCGTGTGCCCGACTTAATCTCTATCGTGGCTATGATTTGCCTGAGCATCATCTGGTGCAGCAGGCTTGGCTAGAGTTCCTGCGACAACCCATGCCTCAGTTGCCCGGCGATGGTCAGCTTTGTGTGATGCACTATATGCAGCTGTACGAGCAGGTCTCGCTGCCGCAAGAGGTGCCAGCGACAGTGTTGCGTCAACTGCAGGAAACCTACGAACAGATTGCCAACTCCACCATGGCCATGAAACTGCGTTGCGCTGTGCCCGGCTGCTACTCCAAGTATACGGATAACATCAGGTTAACCAAGCTGCCCACCTGCTCGGACATGTTTGCCAAGTGGCTGCACAATACCAAGATCAAATACGAAGCCAACAGGCATTATATCTATCGCATTTGTATGCTTCACTTTGAGCCACGTTGTTTGGGTCCAGTGCGTCCCAAATTGTGGGCGGTGCCAACACTGCAGCTGCATCACAACGATGCGAATATCTATAGGAATCCCAAGCTGGATGGGGGTGAAGAAGAAGTACAGCAAGTCTTTTCACAACCCATTCCCGTAGAGTTGCCATTGCGTATCAAGACGGAGCTGGCCTACACTGGCAGTCCCAGTGCCAGCGCCAGTCCCAGTCCACGTGGCAAGTTGCGCATTTGTTGCATTCCCAGCTGTGCCCAGCAGGCCAACTCATTGGTGCGTCTCTTTCGCTTTCCCACCTCGGAAACATTATTGCTGAAGTGGCTGGTGAACACGCAGCAAAAGCCACGTCTGGCAGATCCACAGCATCTGTTCGTCTGTCAGGATCATTTCGAAACGGAAGCCATTTGCAAGAAGCAGCTAAGCAGTTGGGCTGTGCCCACTTTGAAGCTAGGACACGATGGTCACATCATTCCAAATGCCAAGCACAATGGCAACATTGCGGATACACATGAGAATCGCAACACGCTACAATTCATTTGGGCCAACTACTGCTCTGTGCTGGACTGCTTTGAGCCGCGCAGTGAGCAACTGCGTCTCTTCGCATATCCAACAGATCGACCCACCATACGCAAGTGGGCTGCCAATTGCAAGCATCGTTCCATGCAGGCCAGCAGCGATGGCTTTCAGGTTTGTCAATCGCACTTTGAGCCACATTGTTTTGATCCGGAAACGGGAGAGTTGCGAGTGGATGCAGTGCCCACGCTGGAGCTAAGTCGAGGCATCAATAAGATGGGCTGCATGGTGCCTGGTTGTGTTAAGGAGGAGAATGGACCGCGTCGTCGCTTTTTCAAGGTGCCAAAACGTTCGTCGCAGCTCTTGGATTGGTGTCACAATTTCCAAATGGATTTATCGACTATATCCATTGAGAAGAGTGTTGTGTGTGATCGTCACTTTGAGTCGCATTGCTTGAATGTTCACAAGCAGCTGCGTCGTGGTGCACGTCCCACCCTCCACTTGGGCCACACTGAGGATGTGGAGATATTGCAGAACCCATCGAATTGGGCCCGTTGCATTGAGGTGACTGAGACTCCAGCCGATGTCTGCTGTGTGCCCAATTGTGGACGGGAAAAGGATGCCGAGGAAGAAGATGCTGATATGCAACTCTTTAGTTTTCCCAAGGTGCGAACACTGGCGGAGAAATGGCTGAGGAACATTCGCCTCGAGGCAAACAAGGAACAGTTGGCGGAGTTGAAAGTCTGCAATGCACATTTTGAAAACTACTGTCTGGAGAATGGACGCCTACAGTTGGGTGCCATGCCCACACTGCAATTGGGACACGATGATCAACATGATATACATCGCTCCACTGGTCTCACATCCTCGGCTTCCAAAATGAAACGTTATTGTAATAGAAATGGATGCAGCTATGATTGCTGCTTCCCTCAATGTGTGGAGCTGCAGAAGAGTTATCTGCGTATAAGCTACGATATGCCACAGTCGCAAGCAATGCGTGATGCCTGGATAGCCTACATGGAGTTGGAGGAGCCGAAAGGGCAAGAAAACGAGCAGAAACTTAAGCTATGCCCGCTGCATTTGATCATCCTATACGAGCACAGCGTGGACAATTTCCCCGAGCACACAACGGAGGAACTGTTGGAGGATAACTATGCAGCAGCGAGGAATAGTGTGCGTATACGTATCATCAGTTGTGCAGTGCGTGGCTGCAAGACTCTGAAGACACGCGATGGCGGTTGCCTACATGGTCTGCCCCAACGTAGGGATATACTGCAGATGTGGTTGCACAACATGCAGTTGGTGTTCTACGAGCAACAGCGTTACATGTACAAGATCTGTAGCAAACACTTTGAGCCCACTTGCTTTATGGACACAACACGCCGCCTCAAGCCGTGGAGTATGCCAACATTGGAGTTGCCTGAGCCCGAACCGGGTGAGCCAGAAGTCTATCAGAATCCCACAGAAGCTGAATGGCAGCAAATGAATGAACAGTGGGCCGCAGAGCAGCTGCAAATAGATGAACCAATGCTGGAGGATGAGGAGCGTGAGGGAGGAGTAGAGCAAGTGAAGCTTGAGCCGCAGACGGAGATGGATCAGGAGAATCTGCTGGCATATGACGAGGGGGATTACTCACAGCCTCCAGTGCTTGATGGGGAGGAGGATGCCAGTTCGCAGCAGCCGCTGGAGATGCAACCCTTGGAGGTGCTGCTCGAGGTGGGTCATGTAGAGAAGTGCAGAACGTACGAGCAAATGGACACAAAGGCAAATCTATGCTATGCCGAGCAACAGGCAGCGGCTGGACCGCTGCTTAGTAATGGATCCGTTGGCAGCAATGGCAACAAGTACAGTGCACGCAATTGCAGTGTACAAGGCTGCCACGTGACCTTGAACGATATCTCTGGCAATGTGAAGTTGCACAAATTTCCCACCTCGTGGGAGGGAATGCAAAAGTGGATGCACAACACCCAGGTGAAGGTGTGCCGTAGTGTTTCGTGGCGTTTTCGCATTTGCAGTTATCATTTTATGGAGGATTGTTTTCAAGGCTCTCGCCTGAGGCGTGGTGCCATGCCCACATTGCAATTGGGCCCCAAGCGACCTTCGCATATCTATGAAAGTGAATTCAATGTTGGCGATGCCGATGAGGAGCATCTCATTGAAGAGTCATCAGCACAAAAACCAATGCCAATTGTGGTGGGTGGAGATAATATTAGTCTGTGCCTGCCTAACCCAGCACCGCCTCGCAAGTCGAGCAAGTTCTGCCAAGTCGATGGTTGTCCTAATCATCTCACCACCGAGAATATGACGCTGCACAAGTTCCCCCACTCGCAGGAAATGTGTGCCAAGTGGCAGCATAACACACAGGTGACATTCGATCCCGTCTTCCGTTGGCGCTATCGCATATGCAGCGCCCACTTTGAACCGATCTGTCTGTTGAATATGCGCCTGCTGCACGGCAGCGTGCCCACACTGCACTTGGGTCCGCGTGCTCCGCAAGAGCTCTTCGACAGCGACTTTGAGGCCATCAGTATGCGCTTGGATAAACAGAAGTGCAGCTCCGAGCAGCAGTTGTATATCAAACAGGAGCAACAAGAGGATCACTATGAGGATGAGGGGCAGGATGAGGATGAATTAAGTTTTCTGGTGCCAGAGATGCAGCTGCATGAAGATGCAGCGGAACATGAAGCTGACATAAAGACGGAGCACAGTCCAGCCTATAACGGACGCTGGAAGGATTTGCGTTTGCCCAGCATCAAGCAGGAGGATACAACGACCATGACGTGCTATAATCCCGTCAAGTCTGGCTATGACAAGTGCTCGCTGGTGCATTGTCAGCGTCAGCGATCTCTGCACGGCGTGCATATTTACAAGTTTCCACGTTCCCATCAACTGCAGCAGCACTGGATGCACAATCTGCGGATACGCTACGATGAACGAAGACCCTGGAAAACAATGATATGCAGTGTGCACTTTGAGGCACAATGTATTCGTCTGCGTAAGCTGCGTCCTTGGGCGGTGCCCACACTGGAGCTGGGTGACAATGTGCCCGAGGAGATCTTCACCAATGAACAGAGTCGGCAAGATCCACGTGGCGATGGCATCAAGCTGTTCCGACTGCCGAACAGAGTGAGCGATATACGAAAATGGGAGGTGGCAACGGGAATGCACTTCACCGAATCGCAGCGTAACACGAAGCTCATCTGCAGTCGTCACTTTGATCCGCAGTTGATTGGCGTGCGTCGTCTTATGTACAATGCGGTGCCGACTCTTCATCTACGACCAGAGACTGTACGAGAAGAGAGAATACCTCCACATCCTCGTCCAGCGGGGCCACGCTGTTTCATGGCCAGTTGTCCACAGGATTTGCAGCAAAAGCTACACAAGTTTCCCAGCGATCCTCAGCAGCTTCAGGAATGGTGTGCTGTGCTCAGAGTCTCCGTAACGCAGAACTTTAGCAACTTGTACATTTGTGACGATCACCTGCCAAAGCTGGCAAGAAGCTGCCTCATTTGCAATGTGGAGGAACAGCAGCTGCCACTGCTCAATTTCCCAGAGAATCGCAATCAACGCGCCAAATGGTGCCATAATCTTAAAATTGATCCCATTCCTAAGTGGGAGAACTCAAAGCACATTTGCAGTCGACACTTTGAGAGCTACTGCTTCATTCATCCGGGTCAACTTAACCCAGAAGCGGCTCCTACATTGCATTTAAAGCACACGGACAGCAATATATTCCTTAACGATTATGCCATAGATTCCAGCAAGATGATGTTGCGCATTAAGGATGAGCCCTTGGAGAGTGACGATCTGATGCTGTAA
Protein Sequence
MIIKSEPIDEHAYKSNYIDDNTPFADFSKFPEFGDDMLSPKVELTVKDEAYQKNPLSYPRRKLQSERSSESLPICQRCKEVFFKKQIYLRHVAESSCTIQEYDFKCNICPMSFMSTEELQKHKHMHRADKFFCHKYCGKYFDTIAECESHEYMQHEYESFICNMCSVTFATREQLYAHLPQHKFQPRFDCPICRLWYQTAVELHEHRMAAPYFLPTPHHKATTGSALPATAALNSLLQQRQANADGAALFASSLKNEVNVKLERSYSNSTTSESGYSLHDSSNYNNAYGSDASLHGGASAGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATYRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTGSAGSASGMNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLNLGHDDDDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHDNIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESSVLAKWAHNLEVDAAKLSSMRICNLHFESHCIGKRMRPWAIPTLNLSNNVENMYENPEHHMIYKRRTQIKTERASAGASGAGASMSKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNAPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHSPTQLRKWMYNIRMRPRGAMRQQYRICSIHFETHSFNGKRLSAGAIPTLELGHDNEDIYPNEAQSFVEEHCAIDGCDASKEQPDVRLFRFPTDDEDLLWKWCNNLKMNPVDCIGVRICNKHFEVDCIGPKHLYKWAIPTQELGHDDSQIELINNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPTLFQRWRHNLRLEHLQFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDDTEDLFQVNPAELQSNLFGRQRRVQESGMRVRIKEEEHSEQEETDITMSEATDMNTKQVKIKKRLADLKCCVPSCGRSRLEHGARLFPFPTGKQQQSKWRHNLRLDASDVDKTSQVCSAHFNRRCIDGKQLRGWAIPTQQLGHQEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKAVLQLDGPGEPEVEEGGEVSRESLIKIKKEKLEPDEEVEAEAEAEHRQLDNGDDDEHEEDVDADEDDDHGYFDPLELHCRKERTATHQLSTFGFPKDRLQLRKWSTNLQIPESDCVGRVCIEHFEAEVLGTRKLKQHAVPTLNLGHDTPLIYRCNGQAQPMGGIFDEQPQHSVFRLWSLKHCRKRKLEAMEPPDHHHHQRAVIKQELEKEQEFGLSCCLPHCGKQADEVQLHRLPSDRNRLRKWLHNLNLPQSIPIDSQTRVCSEHFDVQPSEPPEDCMPTLKLGHNDTNLYRNQKASASSSCLVPSCPCARLNLYRGYDLPEHHLVQQAWLEFLRQPMPQLPGDGQLCVMHYMQLYEQVSLPQEVPATVLRQLQETYEQIANSTMAMKLRCAVPGCYSKYTDNIRLTKLPTCSDMFAKWLHNTKIKYEANRHYIYRICMLHFEPRCLGPVRPKLWAVPTLQLHHNDANIYRNPKLDGGEEEVQQVFSQPIPVELPLRIKTELAYTGSPSASASPSPRGKLRICCIPSCAQQANSLVRLFRFPTSETLLLKWLVNTQQKPRLADPQHLFVCQDHFETEAICKKQLSSWAVPTLKLGHDGHIIPNAKHNGNIADTHENRNTLQFIWANYCSVLDCFEPRSEQLRLFAYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFEPHCFDPETGELRVDAVPTLELSRGINKMGCMVPGCVKEENGPRRRFFKVPKRSSQLLDWCHNFQMDLSTISIEKSVVCDRHFESHCLNVHKQLRRGARPTLHLGHTEDVEILQNPSNWARCIEVTETPADVCCVPNCGREKDAEEEDADMQLFSFPKVRTLAEKWLRNIRLEANKEQLAELKVCNAHFENYCLENGRLQLGAMPTLQLGHDDQHDIHRSTGLTSSASKMKRYCNRNGCSYDCCFPQCVELQKSYLRISYDMPQSQAMRDAWIAYMELEEPKGQENEQKLKLCPLHLIILYEHSVDNFPEHTTEELLEDNYAAARNSVRIRIISCAVRGCKTLKTRDGGCLHGLPQRRDILQMWLHNMQLVFYEQQRYMYKICSKHFEPTCFMDTTRRLKPWSMPTLELPEPEPGEPEVYQNPTEAEWQQMNEQWAAEQLQIDEPMLEDEEREGGVEQVKLEPQTEMDQENLLAYDEGDYSQPPVLDGEEDASSQQPLEMQPLEVLLEVGHVEKCRTYEQMDTKANLCYAEQQAAAGPLLSNGSVGSNGNKYSARNCSVQGCHVTLNDISGNVKLHKFPTSWEGMQKWMHNTQVKVCRSVSWRFRICSYHFMEDCFQGSRLRRGAMPTLQLGPKRPSHIYESEFNVGDADEEHLIEESSAQKPMPIVVGGDNISLCLPNPAPPRKSSKFCQVDGCPNHLTTENMTLHKFPHSQEMCAKWQHNTQVTFDPVFRWRYRICSAHFEPICLLNMRLLHGSVPTLHLGPRAPQELFDSDFEAISMRLDKQKCSSEQQLYIKQEQQEDHYEDEGQDEDELSFLVPEMQLHEDAAEHEADIKTEHSPAYNGRWKDLRLPSIKQEDTTTMTCYNPVKSGYDKCSLVHCQRQRSLHGVHIYKFPRSHQLQQHWMHNLRIRYDERRPWKTMICSVHFEAQCIRLRKLRPWAVPTLELGDNVPEEIFTNEQSRQDPRGDGIKLFRLPNRVSDIRKWEVATGMHFTESQRNTKLICSRHFDPQLIGVRRLMYNAVPTLHLRPETVREERIPPHPRPAGPRCFMASCPQDLQQKLHKFPSDPQQLQEWCAVLRVSVTQNFSNLYICDDHLPKLARSCLICNVEEQQLPLLNFPENRNQRAKWCHNLKIDPIPKWENSKHICSRHFESYCFIHPGQLNPEAAPTLHLKHTDSNIFLNDYAIDSSKMMLRIKDEPLESDDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01548887;
90% Identity
iTF_01556869;
80% Identity
-