Basic Information

Gene Symbol
-
Assembly
GCA_018904235.1
Location
JAEIFU010000182.1:103239-132810[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 4.1 9.2e+03 -2.7 2.6 49 63 182 200 163 217 0.59
2 29 3e-15 6.7e-12 45.8 4.6 1 86 414 486 414 487 0.86
3 29 9.1e-15 2e-11 44.3 5.0 1 87 514 583 514 583 0.83
4 29 7.8e-16 1.7e-12 47.7 0.2 1 87 605 677 605 677 0.85
5 29 5.6e-16 1.2e-12 48.2 5.9 1 87 775 845 775 845 0.81
6 29 5.8e-15 1.3e-11 44.9 3.3 1 86 869 940 869 941 0.82
7 29 1.8e-13 4.1e-10 40.1 1.0 1 87 976 1044 976 1044 0.80
8 29 1.9e-11 4.2e-08 33.7 2.0 1 86 1084 1153 1084 1154 0.78
9 29 2.2e-16 4.8e-13 49.5 0.4 1 86 1181 1250 1181 1251 0.82
10 29 1.4e-13 3.2e-10 40.4 1.3 1 86 1272 1341 1272 1342 0.80
11 29 2.6e-15 5.9e-12 46.0 0.8 1 86 1369 1440 1369 1441 0.85
12 29 7.3e-13 1.6e-09 38.2 2.8 1 85 1517 1585 1517 1587 0.82
13 29 7.3e-13 1.6e-09 38.2 0.1 1 86 1611 1679 1611 1680 0.83
14 29 4.1e-14 9.2e-11 42.2 0.3 1 87 1796 1865 1796 1865 0.79
15 29 1.1e-09 2.5e-06 27.9 0.1 1 87 1930 1997 1930 1997 0.82
16 29 2.5e-05 0.056 14.0 0.2 1 73 2017 2075 2017 2086 0.65
17 29 5.1e-14 1.1e-10 41.9 0.1 1 86 2098 2167 2098 2168 0.85
18 29 1e-14 2.3e-11 44.1 0.3 1 87 2227 2297 2227 2297 0.83
19 29 2e-11 4.4e-08 33.6 0.2 1 86 2332 2403 2332 2404 0.79
20 29 3.1e-13 7e-10 39.3 2.5 1 87 2414 2486 2414 2486 0.83
21 29 2.9e-12 6.6e-09 36.2 0.5 1 86 2512 2582 2512 2583 0.79
22 29 1.1e-06 0.0025 18.3 0.1 1 58 2616 2667 2616 2686 0.83
23 29 4.7e-13 1e-09 38.8 0.6 1 87 2705 2777 2705 2777 0.83
24 29 1e-15 2.3e-12 47.3 0.8 1 87 2923 2996 2923 2996 0.83
25 29 4.1e-13 9.2e-10 39.0 2.7 1 86 3057 3127 3057 3128 0.80
26 29 2.4e-14 5.3e-11 42.9 4.9 1 86 3236 3306 3236 3307 0.84
27 29 2.9e-12 6.4e-09 36.3 0.5 1 86 3381 3450 3381 3451 0.84
28 29 2.6e-11 5.7e-08 33.2 1.0 1 58 3475 3523 3475 3539 0.86
29 29 1.2e-08 2.6e-05 24.7 1.2 18 87 3540 3598 3528 3598 0.75

Sequence Information

Coding Sequence
ATGGATATTATGGAGCCGCTGGCGGCGGGCTCAATGTCAATGCTGTGGGCGGTTCAAAGATCTTTTATGGGAGGACATCATGGCATATATCCCTATATAAAAAGCGAACCCGTGGAGTATGCCAATCCCATGGCTCCGCCACCAGCTCCTTCAGAAATGAGAATCAAATCGGAACCCACTGACGAACATGCCTATAAGTCCAACTATATTGATGATAATACTCCCTTTGCTGATTTTAGTAAGTTTAATGAGTTCAGCGAGGATATGTTGAGTCCCAAAGTGGAACTAACCGTCAAAGATGATTCCTATGGAAAGAATCCTAATAATTATCCGCGGCGCAAAATGCAAACCGAACGGGCTGAGGAGAGTTTGCCGATTTGTCAACGCTGCAAAGAGGTTTTCTTTAAAAAGCAATCCTACCTGCGTCATGTGGCGGAAAGCAGCTGCGGCATCCAGGAGTATGATTTCAAATGTAATATTTGTCCCATGTCCTTTATGAGCACGGAAGAGTTACAAGCGCACAAACAACTACATCGAGCGGATAAATTTTTTTGTCATAAATATTGTGGCAAACATTTTGACACAATTGCCGAATGTGAATCGCATGAATATATGCAACATGAGTACGATAGTTTTGTGTGTAATATGTGCTCGGGAACATTTCCTACACGTGAGCAGCTTTATAACCACTTGCCTCAGCATAAGTTTCAACAGCGTTACGATTGTCCCATATGTCGTTTGTGGTATCAAACGGCTGGAGAACTCCACGAACATCGTTTGGCGGCACCTTATTTCTGTGGCAAATATTATACGGGAGCACAGTCGGCAGCTGCAACTCAATCGCATCAGTCGCAGTCACAGCAGCAATCACATCAGACGAATTATAAACTGCAGGATTGCCATATGGCCACCATGGAGATGCCTACGCCTCCAGCAGCTGCAACAACATCCCATCACAAATCTAGCCCCTCAACTTCAGCATTACCGGCCACAGCAGCTTTAAGTTCCCTGCTGCAGCAGCGCCAGGCTAATGCCGATGGAACCTCAATGTTTTCCCATAAAAATGATGTGAATGTCAAGCTGGAGCGCAGCTATAGCAACTCCACCAGCGAGTCCTCATTTAGTATGCACGAATCGAACTACAACAATACCTATGCCAGCGATAATTCCCTACATGGTGGAGCTCTGGGTGGTCCCCAGGCCCATTCCTCAACACTGGACGACTCGGAGGATGCTTTATGCTGTGTTCCCCTCTGCGGAGTAAGGAAGAGCACTAGTCCCACACTACAATTCTTTACCTTCCCCAAAGATGAGAAATATTTGCATCAGTGGCTACACAATCTCAAGATGTTCCACATCCCAGCCTCAAGCTATACGACTTTCCGAATCTGCAGCATGCATTTCCCCAAACGATGCATCAATCGTTATTCGTTGTGCTATTGGGCAGTACCAACATTCAATTTGGGACACGATGATGTGGCTAATCTTTATCAGAATCGCGAACTGACCAACACTTTTACGACTGGTGAAATAGCACGCTGTAGCATGCCCCATTGCAGCAGCCAAAGGGGTGAAAGTAATCTTAAATTTTATAATTTCCCTAAGGATATTAAAAGTTTGATCAAATGGTGTCAGAATGCTCGCCTGCCTGTCCAGGCCAAAGAGCCGCGACACTTTTGTAGCCGCCATTTCGAAGAGCGTTGCATTGGGAAATTTCGTTTAAAACCCTGGGCAGTTCCCACCTTGCATCTGGGTGCACAATATGGTAAAATTCATGACAATCCGAAGAATTTGTATGTGGAGGAGAAGCGTTGTTGTCTCAACTTTTGTCGACGCAGCCGCTCATCGGACTTTAATATGTCGCTTTATCGTTTCCCGAGAGATGAAGTGTTGCTGCGACGCTGGTGTTACAATCTACGCCTAGATCCCGGTGTATACCGTGGCAAAAATCATAAAATATGCAGTGCTCACTTCATTAAGGAGGCATTGGGACTAAGGAAACTATCGCCAGGTGCCGTGCCCACCCTTCATTTAGGTCACAACGATACCTTTAATATCTATGAGAACGAATTATGGCCACCACCTACGCCCAGTTCCACACCACACAGCAATCATCATCAGCAGCAGCAGCTCCATCATCAGCAACAGCATTCTCATCATTCAGCTCAGCAACACCACAGCAAAGCATATCAACGTCATTCGGCAGCATCCACTTCATCATCGGCCAGTTCGGCTTCGCACTATATGGACGCCGATATGAGTGGATCATATCTTGGTATGGGCTCTTCAAGTGGCCTGAACGTCACCGATAGCATGGACATTTGCTGTGTGCCCAGCTGTGAGAGCAAGCGTCATAACAATGAAAATATTACATTCCATACGATACCCCGAAGGCCAGAGCAGATGAGGAAATGGTGTCACAATCTTAAGATACCCGAGGACAAAATGCACAAAGGCATGCGAATATGCAGTTTGCATTTTGAATCTTATTGCATTGGCGGTTGTATGCGTCCCTTTGCGGTGCCCACCCTGCATTTGGGTCACGACGATGAGGATATACATCGTAATCCGGATGTGATCAAGAAGTTAAATATACGCGAAACCTGCTGTGTGGCGGTGTGCAAGCGGAATCGTGATCGTGATCATGCTAATCTCCATAGATTCCCCAGTAATGTACCCCTACTGACCAAATGGTGTGCGAATCTACAGAGACCAGTGCCGGATGGCACCAAATTGTTTAACGACGCCATATGTGAGGTGCATTTTGAGGATCGTTGTTTGAGAAATAAACGACTGGAAAAATGGGCAGTACCTACTCTAATATTGGGTCATGACGATATTGCCTATCAGTTGCCCACGCCCGAGGAAGTGGCCGAGTTCTTTGCCCGGCCCAGTGCACCAAATAACGGGGAAGAGCAGGGCGAGTGCTGCGTGGAGAGTTGTAAACGTAATCCCAGTGTGGATGACATCAAACTTTATCGTCCACCCGAGGAAGCGGAGGTCTTAAGCAAATGGGCGCACAATCTACAACTGGAGGTGTCCGAGTTGTCCAATCTACGGATATGTAACTTACATTTCGAGTCCCATTGCATAGGCAAACGCATGCGACCCTGGGCTATACCCACTTTGAATCTATCGGGTAATATAGAAAACCTCTATGAGAATCCCGAGCACACCATGTTGTATAAGCGACGAACCAGAGATCCCAATAGAGATTTAAGTGGTCACAAGCCCACTTGGGTGCCCAGATGCTGTCTGCCCCATTGCCGAAAGGTGAGAGCCTTGCATAATGTGCAACTCTATCGATTCCCCAAGCTAAATCGCTCCACTTTGGCTAAATGGGCCCATAATTTACAAGTGCCGATGGTGGGCAGTGCACAGCGACGTGTCTGTTCGGCTCACTTTGAACCGCATGTGTTGAGTAAAAAGTGTCCAGTTCCCATGGCGGTGCCCACTCTGGAGTTGAACTCTCCTCCGGGCTACAAAATCTACCAGAATCCAGCCAAATTGAAAGCCAGCAAACTGTGCCTGCAAAGGGTATGCATAGTGGAGAGCTGTCGAAAGACTCGTGCTCAGGGGGTACATCTATTCCGTCTGCCCCACAGTCCTACCCAGCTAAGGAAATGGATGCATAATATACGCACACGTCCCCGAACAGCAATGAGAGCTCAATACCGCATCTGCTCCGTACACTTCGAAACGCATTCATTCAATGGCAAAAGATTAAGTGCTGGAGCGATACCCACTTTGGAATTGGGCCATCAGGATGACGATATCTATCCCAACGAAGCTCAATCCTTTGTGGAGGAACATTGCGCCGTTGAGGGCTGTGACTCGACCAAGGAACAAGCGGAAGTGCGTCTCTTCCGTTTTCCTACTGACGATGATGATTTGCTATGGAAATGGTGTAATAACCTCAAAATGAACCCCGTGGATTGTGTGGGCGTTCGTATATGTAATAAACATTTCGAGACCGATTGTGTGGGCCCCAAACATCTGTTTAAATGGGCCATACCCACATTGGCCCTGGGACATGATGATTCACAAATCGAGTTGATCCCAAATCCAAAGCCCGAAGATCGTTATGCGGAACCTTTGTTCAAATGTTGTGTACCCACTTGTGGCAAGACCCGTAAATTCGATGAGGTACAAATGAATAGTTTTCCCAAGGATCCCACACTGTTTCAACGTTGGCGTCATAATCTCCGACTGGACTATTTGGACTTCAAGGAACGGGAACGTTATAAAATTTGCAACGACCATTTTGAGGATATATGCGTTGGCAAGACACGCTTAAATATTGGATCCATTCCTACCTTGGAGTTGGGACATGAGGAAACCGAGGATTTGTATCAAGTAAATCCTGCTGACTTGCAAAGTAATCTGTTCGGACGCCAGCGTAGATTGCTAGGAGATAGCAAACAGGAGGAGGAGGCTGAGGAAGTTGAACAGGAGGCTGAACTCAAACCCAGGGCTAAGGTCATGGAAGAAGCTACCGATTTGAACACAAAGCAAGTGAAGATCAAGAAATCTCTGGCCGAGCTGAAGTGCTGTGTGCGTAGCTGTGGTCGTAGCCGCTTGGAGCACGGTGCTCGCTTGTTTCCCTTTCCCACGGGCAAGCAACAGCAACTCAAATGGCGTCATAATCTTCAGCTGGAACCGCACGAGGTGAACAAAATGACTCGAGTCTGCAGTGCCCATTTCAATAGACGTTGCATAGATGGTAAACATTTGAGAGGATGGGCCATGCCTACCCAGCAATTGGGGCATAAAAAACATCAGAATATCTACGAGAATCCCAAGAATATACCCGGCTTCTTTACGCCCACCTGTGCTTTGAGTCACTGTAGGAAACGGCGCAGCATAGACAACGATTTGCGAACCTATCGCTATCCAAGGACAGACGATTTGTTAGAGAAATGGCGTGCCAATCTACGTTTAGCACCAGATCAATGTAGGGGACGAATCTGTGCAGATCATTTTGAGCCTCAGGTGAGAGGAAAACTAAAATTAAAGACAGGAGCTGTGCCCACACTAAAATTGGGACATGATGAGGGCTTGGTATATGATAATGAGGCCATCAAGGCGGGTCTAAATGAGGATGAGGAGGCTAGTTTGGATTCGCCCCCAGTACAGGCCAAAAAGAAGGATCGGGAAGAAGATGATAAAGAAGAAGATGGGGAAGATGCGTCCGCAGAAATGCAGGATCCTGCCGAAGACGGGCAAGACGAAGAGCAAAGCGGCTACTTCGATCCATTGGAACTGGTGGAAACTTTTGCACATGACCAGTCAAACGCGCCCGATGAGGACTCACCTGCAACACCTCCCCCATCGGCTGCTCCCCAATTGAAGCGTGATAAGCCCGCCAATAATGTCACCCCCATTTGTTGTCTCAAACATTGCCGTAAGGAACGCACTCCCAAATATCTTCTGAGCACTTTCGGCTTCCCCAAAGATCAACAGCTGCTTCTGAAGTGGAGTGCCAATCTACAACTGGAACTCGACGACTGTGTAGGTCGCGTGTGCATGGAACATTTTGAGGCGGAGGTCTTAGGGTCACGTAAACTCAAGCAGAATGCTGTGCCCACTCTGAATGTGGGTCATGATTTTGATTTACCCTACACCTGTAATGGTCAGGAGCGTTCCACAGAAGATCAGTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGCCGTAAAAGGAAGCAAGAGGAGCCGCCGGATGTCCCCGTAGGGAAGCAGGAGAAGACGGAGGTAAAGGAGGAGCCGAAAGAGAGTAAACCAAAGATGTGTTGCCTCAGTCATTGTGGGAAGATGAAGCGCTTACAACGCTTACCGCAGGATCAATTGCTGCTCAACAAATGGTTACGCAATCTTAAATTGGTCATGGCCAAACCCACCGATCCTCAGGAACTGCGTTTGTGCCTAGACCATTTCGAACCGCATTTGGTAAGAAATGGTGAGCTCTTGCCCATTGCTCTGCCCACTCTCAATCTGGGACACCAAGATCCCAAGATATACCGAAACACTGTGAGCGCCAACGAAAGTGGTTGTCTAGTTGCCAGTTGCCCTTGTGCACGCCTTAACCTCTATCGCAGTTGTGATCTACCTAGGAATCCGGAAGTAAGACAAGCCTGGTTAAGTCATCTTAAGCTCAGTCCAAACACCCACGGACAACTCTGTGTCATGCACTTTATGCAGCTCTACGAAGATATTAAAACGGAGGCAAATTCTGCTGAACTTAAGTCAGCCTATGAATCAATAGCCAACTCCGCACAGGCCCTGAAACTACGCTGTGCCGTGCCCGGTTGTCGTTCCAAATACACCGATAACATACACTTGACCCGGTTGCCCTATGATCAGCACATCTTCAGCAAATGGTTACACAACACGAAAATAGAATATGATCCTTCCAGACATTATATCTATCGTATTTGCCTGCTGCACTTTGAGCCCGACGCCTTGGGCGCCGTACGTCCCAAATCGTGGGCCGTTCCCACCCTAAAACTCAATCACCAGGAGGAGATCTACTTGAATTCCACAAAAGAGGAACCTCAGCTGCATACGCCACTTCACATTAAAACCGAACCTTCCGTTCCGCTGCGGGTTAAAACAGAACTCACGGCTTTGACTAGTCCCAGCATAAGTTCCACGCCCAGTCCTCGTGGCAAAATCCGTATCTGCTGTATACCCACATGCCTGCAGCAGTCCAGCTCGCAAATGCGTCTCTATCGGTTTCCTACCGCTGAGTCCGCTCTCCTCAAATGGCTGGTCAATACTCAGCAGCAACCACGTCTGGTGGATGCCCTTCAGCTGTACGTGTGTCAAACTCATTTCGAACCCGAAGCCATTGTCAAAAAGCAATTGAGAAGTTGGGCTGTGCCCACCTTGAACTTGGGACACGATGGTCACATCATTCCCAATGCCAAGCATAATGGTAATATAGCAGACAGCCAGGAAACGGAACAGGCCATGGAGTTTATCCGAGAGAACTATTGTTCGGTACTCAGCTGCTTTCAGCCACGTAGTGAGGATTTGCGGCTTTACGAATATCCCAAGGATATGCCCACAATTAGAAAATGGGCGGCTAACTGTAAACATCGCTCTATGCAGGCAAGCAGTCATGGATTTCAGGTATGTCAATTGCACTTTGGCCCCGAATGCTTCGACCCCGATACTGGAGATTTGCGTGAGGGTTCAGTGCCCTCTCTGGATATAAACAAAAGCCATACCGAATTGCATTGCCTAGTATCAGGTTGTGTTAAAGATGAATCTACACCTCGTCGTCGTTACTACAAGCTACCCAAGCGCTCCACACAATTAGCGGAATGGTGTCACAATCTCTGCATAGAACCGTCTGAGCTTTCGCCAACTGATCAGCACATTTGTGAGCGTCATTTTGAGCCAAGTTGTCTTAATAACTACAAACAGCTGCGGCCAGGAGCACGACCCACCTTGCATTTGGGTCACGATCAGCAAGTAGCATTGCTAAACAATCCGGAGAACTTTAGTGAGCCGCCGGAAGAGGAAGTCGTCTGTTGTGTTCCTCAATGTGAACGTATCCAACAGCCGGATGAGGGGCCTTTTAGTTACTTTCCCAAAATTAAACTGTTGGCCGACAAATGGCTCCACAATTTCCGCCTGGAATCAAGCAAAGAGCAACTGGGCAAGCTGCGAGTATGCCTGAGACATTTTGAGACATCGTGCTTTGAGAATGAGCGTGTGCGTTTGGGTGCCATGCCAACTTTGGAGTTAGGTCATGACGATCGTGATGTTTACCAAACCGATGAACAGGCTCTGGGTAAGGGTCGAAAACATCATAAGAAGAACAGTGGCCTGGGTATGGATTGTTGTTATGCTCAGTGTGCGGAATTGCAAAAGAATTACCAAAGAGTGGTCTATGATCTGCCCATGGATGAGCAACTACGTAGTGCTTGGCTGCAGCATATCGAAGTGCTGGAGGAGGAGGATCAGCCACCACTGAAGCTGTGTCCACTGCATTTTATTATACTCTACGAACATAGCCAGGAACACTTTCCAGACCATGCACCGGAAGAGTTACTCGATCATAACTACGAAGATGCTAGGAATAGTGTCAGAATACGTTTGATCAGTTGTGCTGTACGAGATTGCAAAACCATTCAGCCTCGAGATGGCGGGCGTTTACATGGTCTACCCCCGCGAAGAGATGTATCGCAAATGTGGCTGGAGAATACCCAGATGGAGTACTTTGAGCCACAACGATACATGCTTAAGGTGTGCAGCAAACACTTTGAGCCGCATTGCTTCACGGACAGTCGTAAATTGAAGTACTGGAGTATGCCGACGCTTCATCTCCCAGTTGAGGCGGTACATAAAAATCTCACCGAAGAGGAATGGCAAAAAATGAATGAGAAACCCGCGGAGGCAAGCAAAGAGAAGAAGTTGGAGGAGCAGGATTATGAGGAGAACTCTTTGCTGGAGCCTATTGTCATAATGGAGCATGTGGAATCCGAGAAAGAGGAGGAGGAGCAGGAGAAGGCTGCGGAAGTTGAACATGAGCAACCTTCACCCGAGCTGGTAATCGATGAGCGCACTCAGGACACTGAAGAACGCCAGGAAGAAGAAATGGAAGCACTGGAAGTGCTGCTAGAGGTGGGTCATGTGGAGAAATGTGCCAGCTATGAGAAAATGGATCAGAAATCCCATTTGCCCTACTCCCAGGAGGCCTCTGCCACACCCTCCCACTCCGCCGGTTCTGCGCTACGAAATCAATACAATGCCAGGCACTGCAGTGTCCAGGGTTGTCAGACCAGTTCCCATGACATAGATGGACAAATAAAACTACACAAATTTCCCACCTCAACGGATGCTACCAGAAAATGGATGCACAACACCCAAGTGGATGTGGACGAGAGTGTTTCGTGGCGTTATCGCATTTGCAGCTATCATTTCGAGCAGGATTGCTTTAACGGTGCACGCATTCGCAAAGGCTCCATGCCCACATTACATTTGGGTCGGTCCAGACCAGAAGAAATCTATGAAAATGAATTTGGCCAAATGGAAATGGAAGAATCTTTGGTGATGGAAACACCACCCGCTGCCACAACAAGCCGGGAAACCGCATCCAAGGGAGAGGTAACGCAATTGTTTTTGCCCCGCCCAGCCCCACCACGCAAATCGAGCAAATTCTGTCAGATTGAAGGCTGTCCCAACCATCTGACCAGTGAGAATATGACCCTGCATAAGTTTCCGCACTCGGTGGATATATGTGAACGCTGGCAACATAATACTCAAGTGCCATTTGATCCTGAGTATCGTTGGCGTTACCGCATTTGCAGTGCCCACTTCAAGTCTGTGTGTTTGGTCAATATGAGACTGCTACATGGTAGTGTACCTACCCTAAAACTGGGTCCACGAGCCCCTGCCCAACTGTTTGATAATGATTTCGATGCCATGCACAGCAGAGTGGATAAGAAGCAGACTCAGTCAGCGGTAAAGCACGAGAAATCACAGCAATATCACCAGTATCAGCCGGAGAATGAGATGTACTTCGAGCCTGAATGGGACGAGGAAGAGGATGCAGATGATCTGGAACAGGATGAAGACGGAGAGGAACCACATACCTCTTTTTCCACCCCCTTTGACAGCTGGAGGCATCAATTGCGACTACCTACCGTTAAGCAGGAAAAGTCCCCCTACAATTCCATCAAATCTGGCTACGATAAATGCTCTCTGGCGCACTGCCAACGACAGAGATCTCTACATGGTGTACATATCTTTAAATTCCCACGCTCTAGACGCCATCAGCAGCGTTGGATGCACAATCTACGCATACGCTATGACGAGCGGCGTCCCTGGAAATATATGATTTGCAGTGTACACTTTGAACCGCATTGTGTACGCCTACGGAAACTGCGACCATGGGCAGTGCCTACCCTGGAGTTGGGTGACAATGTGCCTGATCAAATTTATACCAATGAACAGTGTCAGGAAATGGCCACCGACCTGAGCGATGCTGAGACTGATGAGGAGGACGGCCTTCAGGAGGATGATGACGATGAGGAGGAGGAGGTGGACTTGGGACCTTATATCAAAAAAGAAAAACGCAGTCGCTTAGATCCTTATTCAACTGGTCAATTAACAGCCTGGAAAGCTAAACAATGTTGCCTACCATATTGCCGCCGTCCACGGGGTGACGGCATCCGGTTGTTCCGGCTGCCCAGCAATTGCAACTCCATCAAACATTGGGAACAGGCCACAGGCATGGTATTCAAGGAGTCGCAGAGGAATACTCGCCTGATTTGCAGTCGTCACTTTGAGCCGGAACTGATGGGAGTGCGGCGTCTCATGAGAAATGCCATCCCCACCAGGAATTTGGGCCCTCAGAGTGAAAGTTTTAGCAATGAGATTGAGATGGTGGTACCGACGCCCACAGGACCCATGTGCTGCATGGCTGATTGCACAAACGAGGGAAACACGAAGCTGCATCAGTTTCCCAGAGATCCCACATTGCTGCAACAATGGTGTCAATCCCTAAGACTGTCGGACTTCCAAAGCTATCGGGGCAAACATATTTGCTCTGCTCATTTGCCCGCTCAGGCCCTAACTTGCCTCATCTGTGGTGTCGACGACGTTCATCTACCTTTGCTCGATTTTCCCGATAATCGCAACCATCGCGCCAAATGGTGTTATAATCTCAAAATCGAAACCATACCAAAATGGGATACATCCAAGCACATTTGCTGCAAGCATTTCGAATCATACTGCTTTGTCCAACCGGGCGAACTATTACCCGATGCCGCACCAACATTGCATTTGAATCACAATGATACAAATATATTCCACAATGATTATACCATAGATGATCAGCAGCACTCCATGATCCGCATTAAAGATGAACCTCTGGACAATGACGACTTCTTGTTAGTGTAG
Protein Sequence
MDIMEPLAAGSMSMLWAVQRSFMGGHHGIYPYIKSEPVEYANPMAPPPAPSEMRIKSEPTDEHAYKSNYIDDNTPFADFSKFNEFSEDMLSPKVELTVKDDSYGKNPNNYPRRKMQTERAEESLPICQRCKEVFFKKQSYLRHVAESSCGIQEYDFKCNICPMSFMSTEELQAHKQLHRADKFFCHKYCGKHFDTIAECESHEYMQHEYDSFVCNMCSGTFPTREQLYNHLPQHKFQQRYDCPICRLWYQTAGELHEHRLAAPYFCGKYYTGAQSAAATQSHQSQSQQQSHQTNYKLQDCHMATMEMPTPPAAATTSHHKSSPSTSALPATAALSSLLQQRQANADGTSMFSHKNDVNVKLERSYSNSTSESSFSMHESNYNNTYASDNSLHGGALGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYTTFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEIARCSMPHCSSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSSTPHSNHHQQQQLHHQQQHSHHSAQQHHSKAYQRHSAASTSSSASSASHYMDADMSGSYLGMGSSSGLNVTDSMDICCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFESYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVPLLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDDIAYQLPTPEEVAEFFARPSAPNNGEEQGECCVESCKRNPSVDDIKLYRPPEEAEVLSKWAHNLQLEVSELSNLRICNLHFESHCIGKRMRPWAIPTLNLSGNIENLYENPEHTMLYKRRTRDPNRDLSGHKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRVCSAHFEPHVLSKKCPVPMAVPTLELNSPPGYKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVHLFRLPHSPTQLRKWMHNIRTRPRTAMRAQYRICSVHFETHSFNGKRLSAGAIPTLELGHQDDDIYPNEAQSFVEEHCAVEGCDSTKEQAEVRLFRFPTDDDDLLWKWCNNLKMNPVDCVGVRICNKHFETDCVGPKHLFKWAIPTLALGHDDSQIELIPNPKPEDRYAEPLFKCCVPTCGKTRKFDEVQMNSFPKDPTLFQRWRHNLRLDYLDFKERERYKICNDHFEDICVGKTRLNIGSIPTLELGHEETEDLYQVNPADLQSNLFGRQRRLLGDSKQEEEAEEVEQEAELKPRAKVMEEATDLNTKQVKIKKSLAELKCCVRSCGRSRLEHGARLFPFPTGKQQQLKWRHNLQLEPHEVNKMTRVCSAHFNRRCIDGKHLRGWAMPTQQLGHKKHQNIYENPKNIPGFFTPTCALSHCRKRRSIDNDLRTYRYPRTDDLLEKWRANLRLAPDQCRGRICADHFEPQVRGKLKLKTGAVPTLKLGHDEGLVYDNEAIKAGLNEDEEASLDSPPVQAKKKDREEDDKEEDGEDASAEMQDPAEDGQDEEQSGYFDPLELVETFAHDQSNAPDEDSPATPPPSAAPQLKRDKPANNVTPICCLKHCRKERTPKYLLSTFGFPKDQQLLLKWSANLQLELDDCVGRVCMEHFEAEVLGSRKLKQNAVPTLNVGHDFDLPYTCNGQERSTEDQSVFRLWSLKHCRKRKQEEPPDVPVGKQEKTEVKEEPKESKPKMCCLSHCGKMKRLQRLPQDQLLLNKWLRNLKLVMAKPTDPQELRLCLDHFEPHLVRNGELLPIALPTLNLGHQDPKIYRNTVSANESGCLVASCPCARLNLYRSCDLPRNPEVRQAWLSHLKLSPNTHGQLCVMHFMQLYEDIKTEANSAELKSAYESIANSAQALKLRCAVPGCRSKYTDNIHLTRLPYDQHIFSKWLHNTKIEYDPSRHYIYRICLLHFEPDALGAVRPKSWAVPTLKLNHQEEIYLNSTKEEPQLHTPLHIKTEPSVPLRVKTELTALTSPSISSTPSPRGKIRICCIPTCLQQSSSQMRLYRFPTAESALLKWLVNTQQQPRLVDALQLYVCQTHFEPEAIVKKQLRSWAVPTLNLGHDGHIIPNAKHNGNIADSQETEQAMEFIRENYCSVLSCFQPRSEDLRLYEYPKDMPTIRKWAANCKHRSMQASSHGFQVCQLHFGPECFDPDTGDLREGSVPSLDINKSHTELHCLVSGCVKDESTPRRRYYKLPKRSTQLAEWCHNLCIEPSELSPTDQHICERHFEPSCLNNYKQLRPGARPTLHLGHDQQVALLNNPENFSEPPEEEVVCCVPQCERIQQPDEGPFSYFPKIKLLADKWLHNFRLESSKEQLGKLRVCLRHFETSCFENERVRLGAMPTLELGHDDRDVYQTDEQALGKGRKHHKKNSGLGMDCCYAQCAELQKNYQRVVYDLPMDEQLRSAWLQHIEVLEEEDQPPLKLCPLHFIILYEHSQEHFPDHAPEELLDHNYEDARNSVRIRLISCAVRDCKTIQPRDGGRLHGLPPRRDVSQMWLENTQMEYFEPQRYMLKVCSKHFEPHCFTDSRKLKYWSMPTLHLPVEAVHKNLTEEEWQKMNEKPAEASKEKKLEEQDYEENSLLEPIVIMEHVESEKEEEEQEKAAEVEHEQPSPELVIDERTQDTEERQEEEMEALEVLLEVGHVEKCASYEKMDQKSHLPYSQEASATPSHSAGSALRNQYNARHCSVQGCQTSSHDIDGQIKLHKFPTSTDATRKWMHNTQVDVDESVSWRYRICSYHFEQDCFNGARIRKGSMPTLHLGRSRPEEIYENEFGQMEMEESLVMETPPAATTSRETASKGEVTQLFLPRPAPPRKSSKFCQIEGCPNHLTSENMTLHKFPHSVDICERWQHNTQVPFDPEYRWRYRICSAHFKSVCLVNMRLLHGSVPTLKLGPRAPAQLFDNDFDAMHSRVDKKQTQSAVKHEKSQQYHQYQPENEMYFEPEWDEEEDADDLEQDEDGEEPHTSFSTPFDSWRHQLRLPTVKQEKSPYNSIKSGYDKCSLAHCQRQRSLHGVHIFKFPRSRRHQQRWMHNLRIRYDERRPWKYMICSVHFEPHCVRLRKLRPWAVPTLELGDNVPDQIYTNEQCQEMATDLSDAETDEEDGLQEDDDDEEEEVDLGPYIKKEKRSRLDPYSTGQLTAWKAKQCCLPYCRRPRGDGIRLFRLPSNCNSIKHWEQATGMVFKESQRNTRLICSRHFEPELMGVRRLMRNAIPTRNLGPQSESFSNEIEMVVPTPTGPMCCMADCTNEGNTKLHQFPRDPTLLQQWCQSLRLSDFQSYRGKHICSAHLPAQALTCLICGVDDVHLPLLDFPDNRNHRAKWCYNLKIETIPKWDTSKHICCKHFESYCFVQPGELLPDAAPTLHLNHNDTNIFHNDYTIDDQQHSMIRIKDEPLDNDDFLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00914540;
90% Identity
iTF_00918172;
80% Identity
-