Basic Information

Gene Symbol
-
Assembly
GCA_018904275.1
Location
JAEIGQ010000229.1:247979-274761[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 4.8 9.2e+03 -2.4 2.3 49 62 160 177 141 193 0.58
2 29 4.3e-15 8.2e-12 45.8 4.6 1 86 394 466 394 467 0.86
3 29 1.3e-14 2.5e-11 44.3 5.0 1 87 494 563 494 563 0.83
4 29 1.1e-15 2.1e-12 47.7 0.2 1 87 585 657 585 657 0.85
5 29 7.9e-16 1.5e-12 48.2 5.9 1 87 755 825 755 825 0.81
6 29 8.2e-15 1.6e-11 44.9 3.3 1 86 849 920 849 921 0.82
7 29 2.6e-13 4.9e-10 40.1 1.0 1 87 956 1024 956 1024 0.81
8 29 2.6e-11 5.1e-08 33.7 2.0 1 86 1064 1133 1064 1134 0.78
9 29 3.1e-16 5.9e-13 49.5 0.4 1 86 1161 1230 1161 1231 0.82
10 29 2e-13 3.9e-10 40.5 1.3 1 86 1252 1321 1252 1322 0.80
11 29 3.7e-15 7.2e-12 46.0 0.8 1 86 1349 1420 1349 1421 0.85
12 29 1e-12 2e-09 38.2 2.8 1 85 1497 1565 1497 1567 0.82
13 29 1e-12 2e-09 38.2 0.1 1 86 1591 1659 1591 1660 0.83
14 29 2.6e-14 5e-11 43.3 0.7 1 87 1776 1845 1776 1845 0.79
15 29 1.1e-09 2.1e-06 28.5 0.1 1 87 1910 1977 1910 1977 0.82
16 29 3.8e-05 0.073 13.9 0.2 1 71 1997 2053 1997 2066 0.65
17 29 7.2e-14 1.4e-10 41.9 0.1 1 86 2078 2147 2078 2148 0.85
18 29 1.5e-14 2.8e-11 44.1 0.3 1 87 2207 2277 2207 2277 0.83
19 29 2.3e-11 4.5e-08 33.8 0.3 1 86 2312 2383 2312 2384 0.79
20 29 5.5e-13 1.1e-09 39.1 1.6 1 87 2394 2466 2394 2466 0.83
21 29 5.5e-12 1e-08 35.9 0.9 1 86 2492 2562 2492 2563 0.80
22 29 8.1e-07 0.0016 19.3 0.2 1 58 2596 2647 2596 2666 0.83
23 29 6.6e-13 1.3e-09 38.8 0.6 1 87 2685 2757 2685 2757 0.83
24 29 1.5e-15 2.8e-12 47.3 0.8 1 87 2904 2977 2904 2977 0.83
25 29 5.9e-13 1.1e-09 39.0 2.7 1 86 3039 3109 3039 3110 0.80
26 29 3.4e-14 6.5e-11 42.9 4.9 1 86 3216 3286 3216 3287 0.84
27 29 4.1e-12 7.8e-09 36.3 0.5 1 86 3361 3430 3361 3431 0.84
28 29 4.6e-11 8.8e-08 32.9 1.0 1 58 3455 3503 3455 3519 0.86
29 29 1.2e-08 2.4e-05 25.1 1.3 18 87 3520 3578 3508 3578 0.75

Sequence Information

Coding Sequence
ATGGGAAGCCACCATGGCATATATCCCTATATAAAAAGCGAACCCGTGGAGTACGCCAATCCCATGGCTCCGCCACCAGCTCCTTCAGAAATGAGAATTAAATCGGAACCCACGGACGAACATGCCTATAAGTCCAACTTCATTGATGATAATACTCCCTTTGCTGATTTTAGTAAGTTTAATGAGTTCAGCGAGGATATGTTGAGTCCCAAAGTGGAACTAACCGTCAAAGATGATTCCTATGGAAAGAATCCCAATAATTATCCGCGGCGAAAAATGCAAACCGAACGGGCTGAGGAGAGTTTGCCGATTTGTCAACGCTGCAAAGAGGTTTTCTTCAAAAAGCAATCCTACCTGCGTCATGTGGCGGAAAGCAGCTGCGGCATCCAGGAGTACGATTTCAAATGTAATATTTGTCCCATGTCCTTTATGAGCACGGAAGAGTTACAAGCGCACAAACAACTACATCGAGCGGATAAATTTTTTTGTCATAAGTATTGTGGCAAACATTTTGACACAATTGCCGAATGTGAATCGCATGAATATATGCAACATGAGTACGATAGTTTTGTGTGTAATATGTGCTCGGGAACTTTTCCTTCACGTGAGCAGCTTTATAACCACTTGCCTCAGCATAAGTTTCAACAGCGTTACGATTGTCCCATTTGTCGTTTGTGGTATCAAACGGCTGGAGAACTCCATGAACATCGCTTGGCGGCACCTTATTTCTGTGGCAAATATTACACGGGAGCACAGCCAGCAGCTGCAACTCAATCGCATCAGTCGCAGTCACAGCAGCAATCACATCAGACGAATTATAAACTGCAGGATTGCCATATGGCCACCATGGAGATGCCTACGCCTCCAGCTGCTGCAACAACATCCCATCACAAATCTAGCCCCTCAACTTCAGCACTGCCGGCCACAGCAGCGTTAAGTTCCCTGCTGCAGCAGCGCCAGGCTAATGCCGATGGAGCCTCAATGTTTTCCTCTACCCATAAAAATGATGTGAATATCAAGCTGGAGCGCAGTTATAGCAACTCCACCAGCGAGTCCTCATTTAGTATGCACGAATCGAACTACAACAATACCTATGCCAGCGATAATTCCCTACATGGTGGAGCTCTGGGTGGTCCCCAGGCTCATTCCTCAACACTGGATGACTCGGAGGATGCTTTATGTTGTGTTCCCCTCTGCGGAGTAAGGAAGAGCACTAGTCCCACACTGCAATTTTTTACCTTCCCCAAAGATGAGAAGTATTTGCATCAATGGCTACACAATCTCAAGATGTTCCACATCCCAGCCTCAAGCTATACGACTTTCCGAATCTGCAGCATGCATTTCCCCAAACGATGCATCAATCGTTATTCGTTGTGCTATTGGGCAGTACCAACTTTCAATTTGGGACACGATGATGTGGCTAATCTTTATCAGAATCGCGAACTGACCAACACTTTTACGACTGGTGAAATAGCACGCTGTAGCATGCCCCATTGCAGCAGCCAAAGGGGTGAAAGTAATCTTAAATTTTACAATTTCCCCAAGGATATTAAAAGTTTGATCAAATGGTGTCAGAATGCTCGCCTGCCTGTCCAGGCCAAAGAGCCGCGTCACTTTTGTAGTCGCCATTTTGAAGAGCGTTGCATTGGTAAATTTCGCTTGAAACCCTGGGCAGTTCCCACCTTGCATCTGGGTGCACAATATGGTAAAATTCATGACAATCCGAAGAATTTGTATGTGGAGGAGAAGCGATGTTGTCTCAACTTTTGTCGACGCAGCCGCTCATCGGACTTTAATATGTCGCTTTATCGTTTTCCGAGAGATGAAGTGTTGCTCCGACGCTGGTGTTACAATCTACGCCTAGATCCCGGTGTATATCGAGGCAAAAATCATAAAATATGCAGTGCTCACTTCATTAAGGAGGCATTGGGACTAAGGAAACTATCGCCAGGTGCCGTGCCCACCCTTCACTTAGGTCACAACGATACCTTCAATATCTATGAGAACGAATTATGGCCACCACCAACGCCCAGTTCCACACCACACAGCAATCATCATCAGCAGCAGCAGCTCCATCATCAGCAACAACATTCTCACCATTCAGCTCAGCAGCACCATAGCAAGGCATATCAGCGTCATTCGGCAGCATCCACTTCATCATCGGCCAGTTCGGCTTCGCACTATATGGACGCCGATATGAGTGGATCATATCTCGGTATGGGCACTTCAAGTGGCCTGAACGTCACCGATAGCATGGACATTTGCTGTGTGCCCAGCTGTGAGAGCAAGCGTCATAACAATGAGAATATTACATTCCATACGATACCCCGAAGGCCAGAGCAGATGAGAAAATGGTGTCACAATCTCAAGATACCCGAGGACAAAATGCACAAGGGCATGCGAATATGCAGTTTGCATTTTGAGTCCTACTGCATTGGCGGTTGTATGCGTCCCTTTGCGGTGCCCACTTTGCATTTGGGTCACGACGATGAGGATATACATCGTAATCCGGATGTGATCAAGAAGTTGAACATCCGCGAAACCTGCTGTGTGGCGGTGTGTAAGAGGAATCGTGATCGAGATCACGCTAATCTTCATAGATTCCCCAGTAATGTGCCCCTACTGACCAAATGGTGTGCGAATCTGCAGAGACCAGTGCCGGATGGCACTAAATTGTTTAACGACGCCATCTGTGAGGTGCATTTTGAGGATCGTTGTTTGAGAAATAAACGACTGGAGAAATGGGCAGTACCTACTCTAATATTGGGCCATGACGATATAGCTTATCAGTTGCCCACTCCCGAGGAAGTGGCCGAGTTCTTTGCCCGTCCCAGTGCACCCAATAACGGGGAAGAGCAGGGCGAATGCTGTGTGGAGAGTTGTAAACGTAATCCCAGTGTGGATGACATCAAACTTTACCGTCCGCCCGAGGAAGCGGAGGTCTTAAGCAAATGGGCGCACAATCTGCAACTGGAAGTATCCGAGTTGACTAATCTACGGATATGTAACTTACACTTCGAGTCCCATTGCATAGGCAAACGCATGCGACCCTGGGCTATACCCACTTTGAATCTATCGGGTAACATTGAAAACCTCTATGAGAATCCCGAGCATACCATGTTGTATAAGCGACGAACCAGAGATCCCAACAGAGATTTAATTGGTCACAAGCCCACTTGGGTGCCTAGATGCTGTCTGCCCCATTGCCGTAAGGTGAGAGCCTTGCATAATGTGCAACTCTATCGATTCCCCAAGCTAAATCGTTCCACTTTGGCCAAATGGGCGCATAATTTACAAGTGCCGATGGTGGGCAGTGCACAGCGACGTGTCTGTTCGGCACACTTTGAACCGCATGTGTTGAGTAAAAAGTGTCCGGTTCCCATGGCGGTGCCCACTCTGGAGTTGAACTCTCCTCCGGGCTATAAAATCTACCAGAATCCAGCTAAACTGAAAGCCAGCAAACTGTGTCTGCAAAGGGTATGCATAGTGGAGAGCTGTCGAAAGACTCGTGCTCAGGGGGTACATCTATTCCGTCTGCCCCACAGTCCTACCCAGCTAAGGAAATGGATGCATAACATACGCACACGTCCACGTACCGCAATGAGAGCTCAATACCGCATCTGCTCCGTGCACTTCGAAACGCATTCGTTCAATGGCAAAAGATTAAGCGCCGGAGCGATTCCCACTTTGGAATTGGGCCATCAGGATGATGATATCTATCCCAATGAAGCTCAATCCTTTGTGGAGGAACATTGCGCCGTTGAGGGCTGTGACTCGACCAAGGAACAGGCGGAAGTGCGACTGTTCCGTTTTCCTACCGATGATGATGATTTGCTCTGGAAATGGTGTAATAACCTTAAAATGAATCCCGTGGATTGTGTGGGCGTTCGTATATGTAATAAACATTTCGAGACCGATTGTGTGGGCCCCAAACATCTGTTCAAATGGGCCATACCCACCTTGGCCCTGGGACATGATGATTCGCAAATCGAGTTGATCCCCAATCCAAAGCCCGAAGATCGTTATGCGGAACCTTTGTTCAAATGTTGTGTACCCACTTGCGGCAAGACCCGCAAATTCGATGAGGTACAAATGAATAGTTTTCCCAAGGATCCCACACTGTTTCAACGTTGGCGTCACAATCTCCGACTGGACTATTTGGATTTCAAGGAACGGGAACGTTATAAAATATGCAACGACCATTTTGAGGATATATGCGTGGGCAAGACACGTTTGAATATTGGATCCATTCCTACCTTGGAGTTGGGACATGAGGAAACCGAGGATTTGTATCAAGTAAATCCAGCTGACTTGCAAAGTAATTTATTTGGACGCCAGCGTAAATTGCTAGGAGATAGCAAACAGGAGGAGGAGGCTGAGGAAGTTGAACAGGAGGCTGAAGTCAAACCTAGGGTTAAGGTCATGGAAGAAGCTACCGATTTGAACACCAAGCAAGTGAAGATCAAGAAATCTCTGGCCGAGCTGAAGTGCTGTGTGCGCAGCTGTGGTCGTAGCCGCTTGGAGCACGGTGCTCGCCTGTTTCCCTTTCCCACGGGCAAGCAACAGCAACTGAAATGGCGTCATAATCTTCAGCTGGAACCGCACGAGGTGAATAAAATGACTCGAGTCTGCAGTGCCCATTTCAATAGACGTTGCATAGATGGGAAACATTTGAGAGGATGGGCCATGCCTACCCAGCAATTGGGGCACAAAAAACATCAGAATATCTACGAGAATCCCAAGAATATACCCGGTTTCTTTACGCCCACCTGTGCATTGAGTCACTGTAGGAAACGGCGCAGCATAGACAACGATTTGCGAACCTATCGTTATCCCAGGACAGACGATCTGCTAGAGAAATGGCGTGCCAATCTACGTTTAGCACCAGATCAGTGTAGGGGACGAATCTGTGCAGATCATTTTGAGCCTCAGGTGAGAGGAAAACTTAAATTAAAGACGGGAGCTGTGCCCACATTAAAATTGGGACATGATGAGGGCTTGGTGTATGATAATGAGGCCATCAAGGCGGGTCTAAATGAGGATGAGGAGGTTAGTTTGGATTCGCCCCCAGCACAATCCAAAAAGAAGGGGCAGGAGGAAGAAGATAAAGAAGAAGATGGGGAAGATGCGTCCGCAGAAATGCAGGATCCTGCCGAAGACGCGCAAGACGAAGAGCAGAGCGGCTACTTTGATCCTCTGGAACTGGTAGAAACTTTTGCACATGACCAGTCAAACGCTCCCGATGAGGACTCACCTGCCACACCTCCCCCATCCGCTGCTCCTCAATTGAGGCGTGATAAGCCCGCCAATAATGTCACCCCCATTTGTTGTCTCAAACATTGCCGTAAGGAACGCACTCCAAAATATCTTCTAAGCACTTTTGGCTTCCCCAAAGATCAACAGCTGCTTCTGAAGTGGTGTGCCAATCTACAACTGGAGCTCGACGACTGTATAGGTCGCGTGTGTATGGAACATTTTGAGGCGGAGGTCTTGGGGTCACGTAAACTGAAGCAGAATGCGGTGCCCACTCTGAATGTGGGTCATGATTTTGATTTACCCTACACCTGTAATGGTCAGGAGCGTTCCACAGAAGATCAGTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGCCGTAAAAGGAAGCAAGAGGAACCGCCAGATTCGCCAGTAGGTAAACAGGAGAAGACGGAGGTCAAGGAGGAGCCGAAAGAGAGTAAACCAAAGATGTGTTGCCTCAGTCACTGTGGGAAAATGAAGCGCTTGCAACGCTTACCGCAGGATCAAATGCTGCTCAACAAATGGTTACGCAATCTTAAATTAGTTATGGCCAAACCCACCGATCCTCAGGAACTGCGTTTGTGCTTGGACCATTTCGAACCGCATTTGGTAAGAAATGGCGAGCTCTTACCCATTGCTCTGCCCACTCTCAATCTGGGACACGAAGATCCCAAGATATACCGAAACAATGTGAGCGCCAACGAAAGTGGTTGTCTAGTTGCCAGTTGTCCTTGTGCACGCCTTAACCTCTATCGCAGCTGTGATCTACCAAGGAATCCGGAAATAAGACAAGCCTGGTTAAGTCATCTGAAGCTCAGTCCAAACACCCATGGACAACTATGTGTCATGCACTTTATGGAGCTCTACGAAGATATCAAAATGGAGCCACATTCTGCTGAACTTAAGTCAGCCTATGAATCCATAGCCAACTCCGCTCAGGCCTTGAAACTACGCTGTGCGGTGCCCGGATGTCGTTCCAAATACACCGATAATATACACTTGACCCGGTTGCCCTATGATCAGCACATCTTCAGCAAATGGTTACACAACACGAAAATAGAGTATGATCCCTCCAGACATTATATCTATCGTATTTGCCTGCTGCACTTTGAGCCCGATGCCTTGGGTGCCGTACGTCCCAAATCGTGGGCAGTTCCCACCCTTAAACTCAATCACCAGGAGGAGATCTACTTAAATTCTACAAAAGAGGAACCTCAGCTGCATACGCCACTTCACATTAAAACCGAACCTTCTGTTCCGCTGCGAGTTAAAACCGAACTCACGGCCTTGACTAGTCCCAGCATAAGTTCCACGCCCAGTCCTCGTGGCAAAATCCGCATCTGCTGTATACCCACATGCCTGCAGCAGTCCAGCTCGCAAATGCGTCTCTATCGGTTTCCTACCGCTGAATCCGCTCTCCTCAAGTGGCTGGTCAATACTCAACAGCAACCACGTCTGGTGGATGCCCTTCAGCTGTATGTGTGTCAAACTCATTTCGAACCCGAAGCCATTGTCAAAAAGCAATTGCGAAGTTGGGCTGTGCCCACCTTGAACTTGGGACACGATGGTCACATCATTCCCAATGCCAAGCATAATGGTAATATAGCAGACAGCCAGGAAACGGAACAGGCCATGGAGTTTATCCGAGAGAACTATTGTTCGGTACTCAGCTGCTTTCAGCCACGTAGTGAGGAGTTGCGGCTTTACGAGTATCCCAAGGATATGCCCACAATAAGAAAATGGGCGGCTAACTGTAAACATCGCTCTATGCAGGCCAGCAGTCATGGATTTCAGGTATGTCAATTGCATTTTGGCCCCGAATGCTTCGACCCAGAAACTGGCGATTTGCGTGAGGGTTCGGTGCCCTCTCTGGATATAAACAAAAGCCATACCGAATTGCATTGCCTAGTATCAGGTTGTGTTAAAGATGACTCTACACCACGTCGTCGTTACTACAAGCTACCCAAGCGCGCCGCACAATTGGCGGAATGGTGTCACAATCTCTGCATGGAACCGTCTGAGCTGTTGCCAACTGATCAGCACATTTGTGAGCGTCACTTTGAGCCAAGTTGTCTTAATAACTACAAACAGCTGCGGCCTGGAGCACGACCAACCCTGCATTTGGGTCACGATCAGCAAGTAGCATTGCTGAACAATCCGGAGACATTTAGTGAACCGCCGGAAGAGGAAGTCGTCTGTTGTGTTCCTCAATGTGAACGCATCCAACAGCCGGATGAGGGGCCTTTTAGTTACTTTCCCAAAATTAAACTATTGGCCGACAAATGGCTCCACAATTTCCGCCTGGAATCGAGCAAAGAGCATCTGGCCAAGCTACGAGTATGCCTGAGACATTTTGAGGCATCGTGCTTTGAGAATGAGCGTGTGCGTTTGGGCGCCATGCCAACCTTGGAGTTGGGTCACGACGATCGCGATGTTTACCACACAGATGAACAGGCTCTGGGTAAGGGTCGAAAACATCATAAGAAGAACAGTGGCCTGGGTATGGATTGTTGTTATGCTCAGTGTGCGGAATTGCAAAAGAATTACCAAAGAGTGGTCTATGATTTGCCCATGGATGAGCAGCTACGTAGTGCTTGGCTGCAGCATCTCGACGTACAGGAGGAGGAGGAGCAGCCACCACTGAAGCTGTGTCCACTGCATTTTATTATACTCTACGAACATAGCCAGGAACACTTTCCAGACCATGCACCGGAAGAGTTACTTGATCATAACTATGAAGATGCTAGGAATAGTGTCAGAATACGTTTGATTAGTTGTGCTGTACGAGATTGCAAAACCATTCAGCCTCGGGATGGCGGGCGTTTACATGGTCTACCTCCGCGAAGAGACGTATCGCAAATGTGGCTGGAGAATACCCAGATGGAGTACTTTGAGCCACAACGATACATGCTTAAGGTGTGCAGCAAACACTTTGAACCGCATTGCTTCACGGACAGTCGAAAATTGAAGTACTGGAGTATGCCGACGCTTCATCTCCCAGGAGAGGCGGTACATAAAAATCTCACCGAAGATGAATGGCAAAAAATGAATGACAAACCCGCGGAGACAAGCAAAGAGAAGAAGGTAGAGGAAGAAGATTATGAGGAGAACTCTTTGCTGGAGCCTATTGTCATAATGGAGCATGTGGAATCCGAGAAAGAGGAGGAGGAGCAGCAGGAGAAGGCTGCGGAAGTAGAAGATGAGCAACCTTCACCCGAGCTGGTGATCGATGAGCGCTCTCAGGACACTGAAGAACGCCAGGAAGAAGAAATGGAAGCACTGGAAGTGCTGCTAGAGGTGGGTCATGTGGAGAAATGTGCCAGCTATGAGAAAATGGATCAGAAATCCCATTTGCCCTACTCCCAGGAGGCCTCTGCCACACCCTCACACTCCGCCGGTTCTGCGCTACGAAATCAATACAATGCCAGGCACTGCAGTGTCCAGGGTTGTCAGACCAGCTCCCATGACATAGATGGACAAATCAAACTACACAAGTTTCCCACCTCAACGGATGCTACCAGAAAATGGATGCACAACACCCAAGTGGATGTGGATGAGAGTGTTTCGTGGCGTTATCGCATTTGCAGCTACCACTTCGAGCAGGACTGCTTTAACGGGGCACGGATTCGCAAAGGTTCAATGCCCACATTACATTTGGGTCGATCCAGACCAGAAGAGATCTATGAAAATGAATTCGCCCAAATGGAAATGGAAGAATCTTTGGTGGTGGAACCACCACCCGCTGCCACAACAAGCCGCGAAAACACCTCATCCAAGGGTGAGGTGACGCAATTGTTTTTGCCCCGCCCAGCTCCACCACGCAAATCGAGCAAATTCTGTCAGATTGAAGGCTGTCCCAACCATCTGACCAGTGAGAACATGACACTGCATAAGTTTCCGCACTCGGTGGATATATGTGAACGCTGGCAACATAATACCCAAGTGCCATTTGATCCTGAGTATCGTTGGCGTTATCGCATATGCAGTGCCCACTTCAAGTCTGTGTGTTTGGTCAATATGAGACTGCTTCATGGTAGTGTGCCTACCCTAAAACTGGGTCCACGAGCCCCTGCCCAACTGTTTGATAATGATTTCGATGCCATGCACAGCAGAATTGATAAGAAGCCGACACAATCAGTTCCAAAGCACGAAAAATCCCTGCAATATCACCAATATCAGCCGGAGAATGAGATGTACTTTGAGCCTGAATGGGACGAGGAAGAGGAGGCAGATGATCTGGAACAGGATGAAGATGAACCGCATACCTCTTTTTCCACCCCCTTTGACAGCTGGAGGCATCAACTGAGACTACCTACCGTTAAGCAGGAAAAGGCTCCCTACAATTCCATCAAATCGGGTTATGACAAATGCTCTTTGGCGCACTGCCAACGACAAAGATCTCTGCATGGTGTGCATATCTTCAAATTCCCACGCTCTAGACGCCATCAGCAGCGCTGGATGCACAATCTACGCATACGCTATGATGAGCGGCGTCCCTGGAAATATATGATCTGCAGTGTGCACTTTGAACCGCATTGTGTACGCCTAAGGAAACTGCGACCATGGGCAGTGCCTACCCTGGAGTTGGGTGACAATGTGCCCGATCAAATTTATACCAATGAGCAGTGTCAGGAAATGGCCACCGATCTGAGCGATGCCGAAACTGATGAGGAGGATGGCCTTCAGGAGGATGATGACGACGAGGAGGAGGAGGTGGACTTGGGACCTTATATCAAAAAAGAAAAACGCAGTCGCTTAGATCCTTATTCAACTGGTCAATTAACAGCCTGGAAAGCTAAACAATGTTGTTTACCTTATTGCCGCCGTCCACGGGGTGACGGCATCCGGCTGTTCCGGCTGCCCAGCAATTGCAACTCTATTAAACATTGGGAACAGGCCACAGGCATGGTGTTCAAGGAGTCACAGAGAAATACCCGCCTGATTTGCAGTCGTCACTTTGAGCCGGAACTGATGGGAGTGCGGCGTCTCATGAGAAATGCCATCCCCACCAGGAATTTGGGCCCGCAGAGTGAAAGTTTTAGCAATGAGATTGAGATGGTGGTGCCGACGCCCACGGGACCCATGTGCTGCATGGCTGATTGCACAAACGAGGGAAACACGAAACTGCATCAGTTTCCCAGAGATCCCACTCTGCTGCAACAATGGTGTCAATCTCTAAGACTGTCGGACTTCCGGAGCTATCGGGGCAAACATATTTGCTCCGCTCATTTGCCCGCTCAGGCCCTAACTTGCCTCATCTGTGGAGTCGACGACGTTCATCTACCTTTGCTCGATTTTCCCGAGAATCGCAACCATCGCGCCAAATGGTGTTATAATCTCAAAATCGAAACCATACCAAAATGGGATACATCCAAGCACATTTGCTGCAAGCATTTCGAATCATACTGCTTTGTCCAACCGGGCGAACTATTACCCGATGCCGCACCAACATTGCATTTGAATCACAATGATACGAATATATTCCACAATGATTATACCATAGATGATCAGCAGCACTCCATGATCCGCATTAAGGATGAACCCCTGGACAATGACGACTTCTTGTTAGTGTAG
Protein Sequence
MGSHHGIYPYIKSEPVEYANPMAPPPAPSEMRIKSEPTDEHAYKSNFIDDNTPFADFSKFNEFSEDMLSPKVELTVKDDSYGKNPNNYPRRKMQTERAEESLPICQRCKEVFFKKQSYLRHVAESSCGIQEYDFKCNICPMSFMSTEELQAHKQLHRADKFFCHKYCGKHFDTIAECESHEYMQHEYDSFVCNMCSGTFPSREQLYNHLPQHKFQQRYDCPICRLWYQTAGELHEHRLAAPYFCGKYYTGAQPAAATQSHQSQSQQQSHQTNYKLQDCHMATMEMPTPPAAATTSHHKSSPSTSALPATAALSSLLQQRQANADGASMFSSTHKNDVNIKLERSYSNSTSESSFSMHESNYNNTYASDNSLHGGALGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYTTFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEIARCSMPHCSSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSSTPHSNHHQQQQLHHQQQHSHHSAQQHHSKAYQRHSAASTSSSASSASHYMDADMSGSYLGMGTSSGLNVTDSMDICCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFESYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVPLLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDDIAYQLPTPEEVAEFFARPSAPNNGEEQGECCVESCKRNPSVDDIKLYRPPEEAEVLSKWAHNLQLEVSELTNLRICNLHFESHCIGKRMRPWAIPTLNLSGNIENLYENPEHTMLYKRRTRDPNRDLIGHKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRVCSAHFEPHVLSKKCPVPMAVPTLELNSPPGYKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVHLFRLPHSPTQLRKWMHNIRTRPRTAMRAQYRICSVHFETHSFNGKRLSAGAIPTLELGHQDDDIYPNEAQSFVEEHCAVEGCDSTKEQAEVRLFRFPTDDDDLLWKWCNNLKMNPVDCVGVRICNKHFETDCVGPKHLFKWAIPTLALGHDDSQIELIPNPKPEDRYAEPLFKCCVPTCGKTRKFDEVQMNSFPKDPTLFQRWRHNLRLDYLDFKERERYKICNDHFEDICVGKTRLNIGSIPTLELGHEETEDLYQVNPADLQSNLFGRQRKLLGDSKQEEEAEEVEQEAEVKPRVKVMEEATDLNTKQVKIKKSLAELKCCVRSCGRSRLEHGARLFPFPTGKQQQLKWRHNLQLEPHEVNKMTRVCSAHFNRRCIDGKHLRGWAMPTQQLGHKKHQNIYENPKNIPGFFTPTCALSHCRKRRSIDNDLRTYRYPRTDDLLEKWRANLRLAPDQCRGRICADHFEPQVRGKLKLKTGAVPTLKLGHDEGLVYDNEAIKAGLNEDEEVSLDSPPAQSKKKGQEEEDKEEDGEDASAEMQDPAEDAQDEEQSGYFDPLELVETFAHDQSNAPDEDSPATPPPSAAPQLRRDKPANNVTPICCLKHCRKERTPKYLLSTFGFPKDQQLLLKWCANLQLELDDCIGRVCMEHFEAEVLGSRKLKQNAVPTLNVGHDFDLPYTCNGQERSTEDQSVFRLWSLKHCRKRKQEEPPDSPVGKQEKTEVKEEPKESKPKMCCLSHCGKMKRLQRLPQDQMLLNKWLRNLKLVMAKPTDPQELRLCLDHFEPHLVRNGELLPIALPTLNLGHEDPKIYRNNVSANESGCLVASCPCARLNLYRSCDLPRNPEIRQAWLSHLKLSPNTHGQLCVMHFMELYEDIKMEPHSAELKSAYESIANSAQALKLRCAVPGCRSKYTDNIHLTRLPYDQHIFSKWLHNTKIEYDPSRHYIYRICLLHFEPDALGAVRPKSWAVPTLKLNHQEEIYLNSTKEEPQLHTPLHIKTEPSVPLRVKTELTALTSPSISSTPSPRGKIRICCIPTCLQQSSSQMRLYRFPTAESALLKWLVNTQQQPRLVDALQLYVCQTHFEPEAIVKKQLRSWAVPTLNLGHDGHIIPNAKHNGNIADSQETEQAMEFIRENYCSVLSCFQPRSEELRLYEYPKDMPTIRKWAANCKHRSMQASSHGFQVCQLHFGPECFDPETGDLREGSVPSLDINKSHTELHCLVSGCVKDDSTPRRRYYKLPKRAAQLAEWCHNLCMEPSELLPTDQHICERHFEPSCLNNYKQLRPGARPTLHLGHDQQVALLNNPETFSEPPEEEVVCCVPQCERIQQPDEGPFSYFPKIKLLADKWLHNFRLESSKEHLAKLRVCLRHFEASCFENERVRLGAMPTLELGHDDRDVYHTDEQALGKGRKHHKKNSGLGMDCCYAQCAELQKNYQRVVYDLPMDEQLRSAWLQHLDVQEEEEQPPLKLCPLHFIILYEHSQEHFPDHAPEELLDHNYEDARNSVRIRLISCAVRDCKTIQPRDGGRLHGLPPRRDVSQMWLENTQMEYFEPQRYMLKVCSKHFEPHCFTDSRKLKYWSMPTLHLPGEAVHKNLTEDEWQKMNDKPAETSKEKKVEEEDYEENSLLEPIVIMEHVESEKEEEEQQEKAAEVEDEQPSPELVIDERSQDTEERQEEEMEALEVLLEVGHVEKCASYEKMDQKSHLPYSQEASATPSHSAGSALRNQYNARHCSVQGCQTSSHDIDGQIKLHKFPTSTDATRKWMHNTQVDVDESVSWRYRICSYHFEQDCFNGARIRKGSMPTLHLGRSRPEEIYENEFAQMEMEESLVVEPPPAATTSRENTSSKGEVTQLFLPRPAPPRKSSKFCQIEGCPNHLTSENMTLHKFPHSVDICERWQHNTQVPFDPEYRWRYRICSAHFKSVCLVNMRLLHGSVPTLKLGPRAPAQLFDNDFDAMHSRIDKKPTQSVPKHEKSLQYHQYQPENEMYFEPEWDEEEEADDLEQDEDEPHTSFSTPFDSWRHQLRLPTVKQEKAPYNSIKSGYDKCSLAHCQRQRSLHGVHIFKFPRSRRHQQRWMHNLRIRYDERRPWKYMICSVHFEPHCVRLRKLRPWAVPTLELGDNVPDQIYTNEQCQEMATDLSDAETDEEDGLQEDDDDEEEEVDLGPYIKKEKRSRLDPYSTGQLTAWKAKQCCLPYCRRPRGDGIRLFRLPSNCNSIKHWEQATGMVFKESQRNTRLICSRHFEPELMGVRRLMRNAIPTRNLGPQSESFSNEIEMVVPTPTGPMCCMADCTNEGNTKLHQFPRDPTLLQQWCQSLRLSDFRSYRGKHICSAHLPAQALTCLICGVDDVHLPLLDFPENRNHRAKWCYNLKIETIPKWDTSKHICCKHFESYCFVQPGELLPDAAPTLHLNHNDTNIFHNDYTIDDQQHSMIRIKDEPLDNDDFLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00914540;
90% Identity
iTF_00918172;
80% Identity
-