Basic Information

Gene Symbol
-
Assembly
GCA_017309665.1
Location
CM029565.1:37088929-37104335[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 4.9e-16 1e-12 48.8 1.1 1 86 351 423 351 424 0.82
2 42 2.9e-15 6.1e-12 46.4 2.2 1 87 447 517 447 517 0.81
3 42 4.9e-15 1e-11 45.6 0.5 1 87 538 610 538 610 0.85
4 42 1.9e-12 4e-09 37.4 1.2 1 87 657 726 657 726 0.83
5 42 1.3e-16 2.8e-13 50.7 3.5 1 86 750 822 750 823 0.83
6 42 5.7e-15 1.2e-11 45.4 0.8 1 87 858 926 858 926 0.81
7 42 1.7e-11 3.6e-08 34.3 3.5 1 87 956 1028 956 1028 0.78
8 42 1.1e-13 2.3e-10 41.3 1.7 1 86 1055 1124 1055 1125 0.81
9 42 8.8e-12 1.9e-08 35.2 2.9 1 86 1148 1217 1148 1218 0.79
10 42 5.4e-15 1.1e-11 45.5 0.6 1 86 1246 1316 1246 1317 0.83
11 42 5.9e-08 0.00013 22.9 0.1 1 59 1368 1422 1368 1441 0.80
12 42 4.6e-14 9.7e-11 42.5 2.7 1 87 1467 1538 1467 1538 0.82
13 42 2.7e-15 5.7e-12 46.5 1.3 1 85 1573 1642 1573 1644 0.78
14 42 3.9e-14 8.4e-11 42.7 2.2 1 86 1677 1746 1677 1747 0.83
15 42 8.1e-17 1.7e-13 51.3 0.2 1 86 1784 1853 1784 1854 0.81
16 42 5e-13 1.1e-09 39.2 5.3 1 87 1960 2029 1960 2029 0.73
17 42 2.3e-14 4.9e-11 43.5 0.2 1 87 2047 2115 2047 2115 0.86
18 42 1.9e-14 3.9e-11 43.8 0.2 1 86 2138 2206 2138 2207 0.79
19 42 1.3e-13 2.7e-10 41.1 1.9 1 87 2232 2303 2232 2303 0.80
20 42 2.4e-05 0.051 14.6 0.1 1 60 2330 2382 2330 2413 0.78
21 42 1.6e-14 3.5e-11 44.0 3.4 1 86 2428 2498 2428 2499 0.83
22 42 3.8e-15 8e-12 46.0 1.5 1 87 2517 2592 2517 2592 0.79
23 42 1.6e-15 3.4e-12 47.2 3.1 1 86 2622 2692 2622 2693 0.81
24 42 3.9e-15 8.3e-12 45.9 4.4 1 86 2717 2785 2717 2786 0.81
25 42 8.2e-16 1.8e-12 48.1 1.1 1 86 2812 2885 2812 2886 0.86
26 42 5.9e-14 1.3e-10 42.2 1.4 1 87 2907 2979 2907 2979 0.82
27 42 5e-12 1.1e-08 36.0 4.7 1 86 3003 3066 3003 3067 0.78
28 42 1.5e-13 3.2e-10 40.9 3.5 1 85 3093 3161 3093 3163 0.82
29 42 2.6e-12 5.5e-09 36.9 0.2 1 87 3197 3268 3197 3268 0.77
30 42 1e-15 2.2e-12 47.8 0.1 1 86 3298 3368 3298 3369 0.79
31 42 2.1e-16 4.4e-13 50.0 0.3 1 86 3393 3466 3393 3467 0.80
32 42 1.6e-08 3.5e-05 24.7 0.2 1 80 3504 3571 3504 3577 0.77
33 42 3.8e-16 8.2e-13 49.2 0.6 1 87 3605 3678 3605 3678 0.79
34 42 2.2e-15 4.8e-12 46.7 1.4 1 87 3710 3781 3710 3781 0.83
35 42 1.9e-15 4e-12 47.0 1.9 1 87 4001 4073 4001 4073 0.82
36 42 2.1e-15 4.4e-12 46.8 0.4 1 86 4094 4165 4094 4166 0.82
37 42 4.7e-12 1e-08 36.1 0.5 1 86 4237 4309 4237 4310 0.79
38 42 6.5e-14 1.4e-10 42.0 0.8 1 87 4391 4461 4391 4461 0.85
39 42 7.8e-12 1.7e-08 35.4 0.8 1 86 4524 4594 4524 4595 0.83
40 42 1.7e-13 3.5e-10 40.7 0.2 1 87 4615 4687 4615 4687 0.83
41 42 3.8e-12 8.1e-09 36.4 0.4 1 86 4708 4777 4708 4781 0.76
42 42 7e-14 1.5e-10 41.9 4.7 1 86 4802 4873 4802 4874 0.84

Sequence Information

Coding Sequence
ATGGATGAGTTAATCATAAAAACTGAACCTCCTGACGACTCGGGTTTTGTTAATTTAGTCGATAATGAAAACTCCTTTAAGATCGATCCTTTCTTTCAATCTAATGGACCCTTTCCACATATTAAAGAGGAACCTTCAACATCACAACAAAAATCCCTTAATTTCCCACGACGCAAAGTTCAGACAGAACGAGCGGAAACGTTGCCAATTTGTCAACGTTGCAAACAAGTCTTCCTGAAGAAATCCAGTTACATTAAACATGTGCGATATAGTTCCTGTTCCATCTTGGAATATAACTTCAAATGCCTTATATGTCCCATGTCTTTTATGTCGAATGAAGAGCTTGAGGCACACGAAAACGTACATCGTGCAAATCAGTTCTTTTGTCAAAAGTATTGCGGAAAATATTTTGATACATTGGATTTGTGTGAAGTTCATGAGTACACTTTTCATGATTTTAATACGTTTGTTTGTAATGTTTGTAATTCGACGTTTAAAGCTCGAGATATGTTCTTTGCCCATAAGACCCAACATCGCCATTTGATACGTTTTGATTGTCCCATTTGTCGACGTTGGTTCACATCGCCAAATGAGCTGCGTAGACACCGTTTAGAATCTCCATTCTATTGTGGAAAATTCTATGGAGACACAACGGTTCAAAATAACGATTCTTCTTTCAATAATGGAAACCAAGCTAATTATAGTCTGCAAGATTGCAATATCGGTGTTATGGAAGCTATACCGAATAGTTCGAATTCGCAGACGATTCATAACCAAATCGGAATGAATGATAGAAGCAGCTTTCTACCACCATGTCAAATCAAATGTGAACCTGTTTTTGATAATCGCTATTGTGCTGCACAAACCGCAGCACCCGAGCCATTTTTCAACAATTCTTTTCCAACTCCCGACTTCAGCACATTTAATAATGCATCCTTCAATGATTTCAATAATCAGTATCCGAATTTCGAACGACGACATTCCAACAATTTCAACTTTTCAACGCCTTTCACTCCAGCCACACCGGCTGTCAATGAAGGCGACGCCATTTGCTGTGTGCCACAATGTGGACTCCAGAAGAGCACTAGTCCCAATCTGCAATTCTTTAAATTTCCGAAAGACGAGAAATATTTGAAACAATGGTTACACAATATGAAATTGATATACAATGCAGGAGAATCGTACGAGGACTATCGCATTTGTAGTTTGCATTTCCCGAAACGTGTCATCAATTTACATTCTCTTTGCTACTGGGCAGTACCAACCTATCAACTGGGCCATGATGATACGGCCAACATTTTTCATAATCGCGAGCAAAACTACACAGTGAATCCACAATGTAGCGTACGTGGTTGTGAGAGTCGACGTGGACAAACGAATTTGAAATTCTACAACTTTCCACCCAGCGATTCGGAAACCTATGTGAAATGGTGTCAAAACACACGCCTCCCACTGAATTCCACGGAATCGCGTCAATTGTGCAGTCGTCACTTTGAAGATCAGTGCTTTGGTAAAATTCGCCTTAAGGTCTGGGCAGTGCCGACGCTTCACTTGGGCGGATTTCCACGGATCCACGACAACCCTAATGTGCCAATGTTGGAAGAGAAGAAATGTTGTTTACCCCATTGTCGCCGACGACGTTCCACCGATCCCAGTCTCTCATTATACAGCTTTCCCAGAGATGAACAGATGCTGCAGCGTTGGTGTTTCAATTTCAAACTCGATCCCATGGAATATCGTGGAAAGTTTCATAAAGTTTGCAGTTCACACTTTGTTAGAGATGTTATCGGATCACTAAAGTTATTACCAGGTGCCGTTCCAACTTTAAACTTGGGTCATAACGATACGAATATCTACAACAACGATATACTGACAATGCCCACAACACCGCCGAAATACACAAAACCATCACAATCCCCCAGCTTATCTTCACCGTCGACAGCGGTCAGTGGCGAAACACCTGATACTTGCAGTTTGCCCACATGCAAACGCAATCGTTATGCAGACAATGTTACGATGCACACGATGCCAGGACGACCGGAAATGTTGGAGAAATGGTGTCACAACCTCAAGCTGAATCCATTGAAGATTCCAAAGTATTTTCGCATTTGTAGTATACATTTTGAACCGTATTGCATTGGTGGCTGCATGCGTCCATTCGCTGTGCCCACCTTGCATTTGGGTCACGACGATCCGAACATCCATGAGAATCCCAAAGTGATTAAGAAATTGAATATACGCGAAACTTGCTGTGTGCCCACATGTCGACGTAATCGTGAACGAGACAAGGCCAACCTGCATCGATTCCCTCTTCATAATCCGGAGATTTTACAGAAATGGTTGAAAAATTTACGCCGTCCAATGCCAGATGGTACGCGTTTGTTCAACGATGCCATTTGTGAGAAACATTTCGAGGATCGCTGCATCCGAAGTAAACGATTGGAGAAATGGGCTTTGCCTACACTGCAATTAGGACACGAGGAGGAGATCATTCCACTGCCCAGTCCGACAGAGATCGATGAACTCTATGGCAAGCCTTCGACTTCAAATCCAGCCGATGCTGATGTCGAGTGTTGTGTGGAGTCGTGTAAACGTGGCCCGAAATCTGATGATGTCAAACTCTATTCGATACCGAAAGACGAAGTTCTTTTGGAAAAATGGTCACACAATTTACAACAAGATCTAAAGGAGACACCAAATGCACGCATATGCAATCTTCACTTCGAAGCGTATTGCATTGGAAAGCGTTTGCATTCGTGGGCAATTCCCACATTGAATTTACCAGAAAACAGCGAGGATCTTCATGAAAATCCCAAGGAAATGAAATCACTGAAATCTGTGGAACTCAAGGGTTACATTCAGCGTTGTTGTCTCTCGCATTGTCGCAAGACGCGCATCGACGGCGAGGTTCAACTCTTCCGCTTCCCCAGCTACAGCAAGAGTATGTTGAAGAAGTGGTGCCACAATCTACAAATATCAATACCCTTCGGCACTGCTGCCCATCGTCGACTCTGCTCTACACACTTCGAACCCCATGTGCTGACCAAACGTTGTCCCATGCCCAATGCGTTGCCTACATTAAATCTCAATTGTCCCGAGGGCTACAAAGTCTATCAGAATCCAGAGCGTCTCAAATATGCAAAACTACGAACCGAGAAACGTTGTGTTATCCCGAGTTGTCGCAAGCGTAAAGAAGATGGGGTCCAACTATTCCTCTATCCACACAATCGTGCCCTTCTGCGAAAGTGGGCCCACAATACGAAACAGAAAACAGGCGATTGCACTCGTGCCCAGCTGCGCGTTTGCAGCGAACACTTTGAGGAGAATTGTTTCGGCCCGGTGCGATTGTGTCCTGGTGCCATACCGACCCTCAAACTCGGCCACGACGATGAAGATATTTTCCCCAATATTCTACCGCCGCTGCAAGAGGAAGAGCTGCGTTGTTCGATCAATGGTTGCGGTCGTTCCAGAGCTGTCGATGGTGTTAAGCTCTTCAAGTTCCCATGTGACGATGAAGATCTACTGTGGAAGTGGTGTAATAATTTGAAACTGAATCCAATCGATTGTCGTAAGACACGCATTTGTAATTTGCACTTCGAATCGGAGTGTGTGGGTCCGAAGTGTCTCTTCCGATGGGCCATACCGACCCTGATGTTGGGCCACAATGATACGGAACTTCAGCTGCACGCGAATCCCAAACCCGAGGATCGACCCACAACGGAGATAATCTACAAGTGTTGTGTGCAAAACTGCGGTAAAACACGTAAATACGACGAGGCCCAATTGAATAGTTTCCCCAAGACACCGGAACTCTTCGAGCGTTGGGCGCATAATTTGCAAATGGATTTGGATTTCGACGATCGGGAAAAATACAAAATCTGCAACGATCACTTCGAGGAGATTTGCATGGGTAAAGTGCGATTGCATTATGGAGCGATACCAACGTTGAAGCTGGGCCACGACAGCACCGACATCTACGAAGTCGATCTCACCGAGATGAACACCTCGAATTCCAATAAACGTCAACGTCTGGCCACGCTGAATGAGAAAAACTACTATGAGGAGTGTGATTTTTTAGATGAACCTCAGGCGGAAGCCAAATGTTGTTTTCCCGATTGTACAGCATCCAAGTCGATGCAGCGTGACTCCTTCGATATGCCCAAAACGGAGGAGCTCCTCGAGTTGTGGTGTGAAGTGCTGAGGGTGGATAAAGACAACCTACTAGCCGAGACTGAGAATCCCAAACTATGTGGCATTCACTTCATTGACGTATACGAGGCGACGACGGATGAGGCCACAACTTTGTTGGACTCACAGCCGGAATTGGCCGATGAACTGATGACACTTGCCGAATCCTACGAACACTGCACAACATCCTCGTTCATAAGACGCGTTCAATGCAGCATTCCCGAGTGCAAGACGAATTCGTTGAGCGGCGTGCGACTCTTTGCGTTCCCCTCGAATCGAGAGCTGCTCGAGAAATGGTGCTTCAACACGGGCGTCCAGTTAAACGAGAATCCGCGCCAGTCGTTCAAAGTTTGTGCCCTTCACTTCGAGCCCCATTGTCTGACTGAAGCACAGCGTCTGAAACATTGGGCGATTCCAACTCTCCACTTGAACGACGACGACGACGAGGAGGAGGAGAAGAAACTCTTTCAGAACCCAGCCGGAGAGGCGGCCGAGAGAAGTTTGATGACCACGGTGCAGGGCAGCAAATGTTGTGTGCGAAGCTGTGAACGTGATCGAGCTTTGGATGATACCGTCAGATTATTCAATTTCCCCAACGATGAAGAGTTGCTCGATAAATGGGTTCACAATCTGGAGGTGGATCCAGAAAATACTCCAGTGTTGAAAATTTGCAATGCGCACTTTGAAAGACGATGCATAACGAAGTCGGGACGCCTTCAACCGTGGGCCATACCGACACAACAACTGGACATCAGTGGCGATATCGATCGTTTTCCCAATCCAGGAGAATCTCTACCCGATGGTCCGACAAATGTGAAGAAGAACCTCGAGTCGATGAAGTGCATCGTTCCATCGTGTCTCAACAATCGTCTTCAGCATGGACTGCGTTTCTTTGGAATTCCCTCCGACATGAACATTCGCAAGAAATGGTGTATTCATTTGAAGATTCCACCCGCGTCCATTCAAAAGGATCAACACATATGTAGTGATCATTTCGAAAAGCGTTGCATCGACAATGCGAGTCTTCATTCATGGGCGGTGCCCACGTTGAAATTATCACCATCACCATCAGCAAAACCAACACCATCAACACAATCAACTGTAGCGAGTTCAAAAAACTCCCTTAGTAAGGGGCCAGGCTTTGGTCCCTATTTACCGATCTGCTGTGTGCCGAACTGTGGCCGTCGTCGCACCTCGAAAAATGGTCTTCGAACATTTGGTTTTCCCAAGGACACCCAACTGTTGGAAATGTGGTGTGCAAATATCAAGCTACATCCTGAAGCTGTACAGGGCTATCGTGTGTGTGTTAATCATTTTCCCGAGAAGGTGCGTGGTAAAACGAAATTGAAGCTGGGCGCGGTGCCAACATTGGAATTGGGCCACAATGAACCAGTCATGTATGGCCTAAAAGAAGTAGCACAGAAGCCAGCCATCGTTCAGGCGAACAACGGACCAGTTAACAACAAGAATCCAGTCAACAACAAGATTCCAGTCAACAACAAGGGTTCTGTGAACAGAAAGAGTCCTGTGAATAACAAGATTCCAATCAACAACAAGATTCCAATCAACAACAAGACTCCAGTCAAGAAAAACACCTTAGTAGCCGTTTCCAACTCGGCAGAAGACTCAAAACCCATGCCCAAACAACGAACTTCCACCGAAGCCACGGTAATTAAACCCATCATTCGTTGCTGCATCACAGGTTGTCGTAAAACCACTCGAAAGCCGGACGTTCGTTTCTTCACGTTCCCGAAACGCGGTCATTTATTGTTACGGAAATGGTGCCACAATACGAAACAACCACTGAAAAACACCCGACGTTTCATCTGCGCCGATCACTTTGAAAAGCATACAATAGTCAATGGTAAGCCACGCCCATGGGCGGTGCCAACCCTGAACTTAGGCCACACCGATCAAATCTACGAGAATCCCCCGAATATGGTGGATAAATGTTGTGTTCCCGGCTGTACAGCGGGTGAAAATGTTCAACTCTTCAGTTTTCCCAAACGTAGTTCGATGCTACGCAAATGGGCGCATAACATTAAACTTGATGCTTATCTGGTCATTCGACATAACTACAAAATTTGCGCTGTTCACTTTGAGCCCAACTGCACCGGGCCGGACGGTTTGAATTTCGGTGCCATACCAACTCTCCATTTGGGTCATCAGGATGACGAACACAATTATCCCAATGAAGCTGATGCAGATGGTCACAAGTTCCACTGTGATATCCCAGGTTGTGGACGCACTCGTGCTAGCGATGGAATCACCCTCTTCACATTCCCCAACGATGAGGAGCTCTTGAAGAAATGGTGTCACAATCTGAAACGCCAACCAGAGGAATGCAAAGACTTTCGTATTTGCAATTTACACTTTGAGCCCGGTTGGGTGCGAAAGTATAAATTGCTCGATGCAGCCATCCCAACCATGCTTCTCGGACACAATGACAATGATCTGCAATTGATTACCGCCGAACTGCAACCTCATGTTGGCGTTCTCATTCGTTGCTGTGTGCCACATTGTGGTAAATACAAGCCCAAAGACGATGTTCACATGATCAGCTTCCCCAAAGGTCGTGCTCTCTTTATCAAATGGACGCATAATCTAAAACTCAAGGGTTCACAAAGACTGATCAACAGCTTTAAAGTGTGCAGCGATCATTTCGAGGACTATTGCAAGGTGGCCCGATGTGTGAAGTACGGAGCGATTCCGACTCTCAACTTGGGACACGACGATCCGAATATCTATCAAAATAACAAGGAACTCCTCACGGGAAAGAAGACAAACAATGGAAGGAATTGTGTCATATCAGAATGCCAAACCCACAAAATGTTGCATATGCAGCCCTTCGACATGCCAACAAATGAACAAATGCTGAGGTCATGGTGTCAACACTTCGGAATTGATCCATCGCAAATTTCAAGCTCACCACGATTGTGTGCCCTTCACTATATGGACGTCTACGAGAAGGTGCAACAGCCTTCATCCATCATCTCTGATGATGGTCCATCAAACAACGACAATGACGAGATTGAAGACTTGAAGAATACATATACGACTATAAAGAATAGTCAGAGGATAAGAGTTTTTCTGTGCTGTGTTTCCGGCTGTAATACAAAGTATTTGAATGGATCTCCACAACTTATAAAATTCCCTCGCGATACTGAACTTTGCGAGAAATGGTGTGACAATACCAAAGTTCAAGTGGATCAGAAACGTCGTTATCTCTATAAAGTTTGTGCTCTGCACTTCGAGGAACGTTGCTTTATAAATGCTCGTTTGCGAAATTGGGCTTTGCCCACATTACGATTGAACCACAACGATGAAATTCATTCCAATCCCGAGGAAGAACAAACGGTTTGTTGTGTGAGGAATTGTTCCAGTTCAAAGGAGCCGCAGGCGAACGATGGTACTGAAATGGTTAACAAATTGTGGACATTTCCTGACGATGAAACCCTTTTGCAGAAATGGTGTCATAATTTAGAAGTCAAGATCGCCGAGAGTAAAGGTCTCAAAGTCTGCAGTGAACATTTTGAGAGCTATTGCTTTGGCAAAGGATTGCGTCCGTGGGCGATACCAACATTGAATTTGAAACGAACAACCAAAACGAAACTTTTTAAAAATCCCGAAATAAAAGATATTACCGTTAGCTACAGTTCGGTCCACTCGATTATGTGTTGTGTTCCCGATTGTCGCCAACAACGCGATCCCGAAGCTGGCATCCAGCTCTTTGGATTCCCCCTCAAGAAGGAGCCTCTCGAAAAGTGGATGCGCAATTTAAAACTCGAAAACCCCGAGGAGTGTAAACGGTATCGCATTTGTAATTTGCACTTTGAGAAGCAATGTTTTGCCAAGAAACGGATCACATATTGGGCTGTACCCACTCTACGTCTTGGCCATGACGATGTGGAGAATATATTTCGGCCACCGAATGTCGGGAAGAAAAAGAAGAAACATCGTTGTTGTCTACCGCATTGTCGGAAAATTCGCCACATTCACAATGTACGCTTGCATGGATTTCCCAAGGATCCGGAGGTGCTCAAGAAATGGTTGAGTAATCTCAAGATGTCAGCGGAGAGTGCGACCAAAGCAAGGATTTGTGGTACACATTTCGAAAAGTGCTGTGAGTTTCGACGTGAGCTTCGGCCCTGGGCTCTACCGACTCTCGATCTGGGGCATGATGATGTGATATTTCAGAATCCCGAACCTAGTGGTAAAGCGAAAATTATCGATGACCTCAAGAGATGCTGTGTTCCGGAATGTGGCAGAGAAAATGTTGCTTGTGATGAGAATCCGATTCGAATGTTTGGATTCCCAAATAATGAAGAACTATTGCAGAAGTGGTGTGAAAATCTGAAATTTCAAATGGCTGAATATCGCTCTACCAAACGCAGAGTTTGCGATGCACATTTTGAGACCATGGCAATATATCGAAATAAATTACTCCCAGATGTTGTGCCCACAATGAAGTTGGGCCATGTAGATAAAGTGATGCATCAAAATCCCGACGACTTATCAAAGTATATAACCTGCTATTTACCTCAATGCAATACAGAGAAATCAATCGATGACAACCTCTTCTTCGATTTCCCAAAGAACAACAAGAAACAACTGCAACAATGGTGTGAAAATTTAAGAATCGATATTGAGATTCCCGAGAATAGTTTCCACAGACTTTGCAAATCACATTTTGTTCCCGAAGCAACGGCAGTGAATAAGTTAAAGCCTGGAGCGGTGCCAACGCTTAACCTGGGTCATGATCTTCCCGTCGTTCATGAAAATTCAGAAAAGACTCTCAAACTAACAGCAGGTGATTTGTGTGCCGTACCAGACTGTGAAATAACACAAGCGGACACAAAACTCTTCAACTTCCCCAAGAAACCGAGGAATTTGGAAAAATGGTGTGAGAATCTCAAGTTGAACGATCCCAAACAATGTATGGACTATAAGATTTGTCATCGGCATTTTGAGAAATTGGAGAAATTCACCAAATGGTCGTTGCCAACCTTGAACTTGGGACATTCAGAACAAGAGCCCTACGATCCGGCCACAAATGTTCTTCTCCTGGAGGTGAATACCGATCCGAGTTGTTGCATCTCCAGTTGTCCCAATTCGGACGAGTGCAGCTCTAATGTCCAGCTCTTTGGATTCCCCAAGATCACATGGCTATTGAAGAAATGGTGCTATAACTTGCGAATACCAGAGAAGGGAGCGGAAAAGTTGCAGATTTGCGGGGAACACTTTGAGAAGAATTGTCGTGGAGCGACGAAACTCAAACCGTGGGCGGTGCCTACCAAAGAACTCGGCCACGACGGACACATTCACCAGAGTCTCAAGTTGCACAAGTACCATGAGGAGGAGAAGGCGAGCGGTGAGATGAAATTCGTGCGAGCCGACTACTGTGCGATTGTATCGTGTCTCAAGTCGAAGATTGATGGAGTCCAGTTGTTCAAGTACCCGAGCAAGAATCTCATGGTCCGCAAGTGGGCAGAGAATTGCAAACACAATGCGTACCAAGCTGCTCGCTATGCCTTCCGCATTTGTAGCGAACACTTTGAGGGATTGTGCATTGGTCCCGATCATCGTTTGCGACCCGGGTCGATTCCGACCCTCAATCTCGGTCACGATGAGATCGACATCCATCCGCACGAAGTGATTCCCAGTTGTGTGACGCCATCGACTCTGTCCGAAATCGACGTGGAATGCAGTGTGACGAATTGTGGTCGATCGAAGAAAGCCGACAACGTTCGTCTCTACAAGTTTCCCACCGATGCGGAAATACTCCAGAAATGGTCTCAAAATCTCCGGCTCGATGTGGAAGATGCGAAAAATATGCGCATTTGTAATATGCACTTCGAGGAACGTTGCTTTGGCATGAAGAAAATTCGTCTCTTGATGCGAGCGGTACCGACTTTGCTGCTCGGTCACAACGATCCCGATATAATTCAGAATCCCGACTCATTCGAACGACCCGAGCAAATCATTCGCTGTTGTGTTCCACATTGCAGCAAGTCGAAGCAGGAGGATGACGTGATGCTCTTTGGCTTTCCACGCAATCGTAAACTCTTCGAACAATGGACGGACAATCTGAAGTTGGAACTGAAACCGTCAGCCGATGTTTGGCAATATTATCGCGTGTGTCGCGATCACTTTGAGAGCTTCTGCTATGGCATGGGTCGACGTCTGCTCTTCGGAGCGATACCAACACTGAAACTGGGCCACAACGACACCAACATTCATGAGGTTAAACAAGATCTACTCTCGTATTCATCGCAGGAGCCACGGAAGCGAAAGAAACCCGAGGAGAAACAGGATGTGAGATGTTGCCACCCGGATTGCAGTAGAACATCAAAGGAACAACATATGTATCGAATGCCCAAACACCAGCAACTTCTGGTTATGTGGGCCCAGAATGTGGGCGTGTCTTTGGACGATCCCTTAGGTGCAGATGAGTACAAACTCTGCATAATCCACTTCAAGATCGCCTACGATGAGAATCACCAGAAGATGCTTGAGTTGCGTAAAACGATCACCGAAACAGTGGCCGATCTTGATGCCGACATGATGGAATTGGATGCCATCTACACGGAGATCTCGAACAGCACTCGTATTCGTGTCATGCAATGCAGTGTGCCCGGCTGCAATGGCAACACCATTAATGGCGATGTGAAACTCCATACGTTCCCCTATAATCGTGATCTCTACGAGAAATGGTGTCACAACACCAAGATCGAAGTGGACGAGAGTCAACGTTACCTGTTCAAGGTGTGCTCGCTGCATTTTGAACCCTACTGCATTGTGGAGCCGTCAAAACGTTTGCGTCCATGGGCTTTACCAACCTTGAATTTGCCGCCACATGACGATACCTCGCCGATTGTTCCCAATCCCGACTTGGAAACGTTGGGCTCTCGTAGTTTGGGTTTCGTTGCGGATAAATTATGCTGTATTCCCACATGTGAAAACTCCAAGGAGGAGGCAAATGTTCCCTCATCCATGCTGCACAAGTTTCCACGCGAGGTTCATCTCTTCCGCAAATGGTTGTACAACACACAATTGGAGGCGAACGATGGCATCTACGCCCGCATCTGCAGCAATCACTTCGAGCAGGCCTGTCTCGGTAAACGTTTACGCTCGTGGTCGATTCCCACGTTGAACCTGGGCCATGACAAACCGAACATTCACCTAAATCCAACCGATAAACTCGATGGATCAGCCGGTAAAGCCGAAGACAACAATGAGAAGAAATACAAAATTAAGTTACAGGACAACGATGACTTTGAACAGGACTACAAAATGGAAGAAGAACTGCCGATAAAACGTTTCCTGAAGAAGGAGAAACCGAAATCAGAGACTGTGTATCCCAAAGAAGAAAGCAATAAGAACAATATGAAATGTCTACGAAGTAGCGATCGAATTATTCCAAAGAAGGAAGTGAAAATAAAACGTGATTTGAATATTTCTTCGCCTGACTCGATGGACCATAGCGATAAATTTGAAATCACTCTAATGGATGAGAGTAATCAGGATAATGACTCGGATCTCGACATGTCCCATGATGATTCCGACGAATTGGATGAACAGCCGTTGGCGAGTCGAAAGCGACGACTGGGAGAAGAACCCGAACAGTTTCGTCAGTCTGCGGATCGTCCGTTTTTCAACTTTAAAATCACAGCAATCGAGAGCATCAACACGGACAATCTCACCGATGGCGGTGGCAGCGGATCGTTGAGCTCCACCGGTGAGGTGACCAACAAACGCACCCGAACCCTCACATGCTACGGCGCCGAGTCGAAGATCAACACCAATCTTCACTGCAGTATACCGTCGTGTATGCGAAAGACCAAGGACGGCGTCCAACTGCACATGTTGCCGAAGGATCCGGGCCTGCATCAGAAATGGTGTCACAATTGCAAATTGTCACCAACACCGAATGAGTTGTACATCACTCGTGTCTGCAGCTATCACTTTGAGGATCGCTGCTTTACGCGAACACGTCGTCAACTCACACTTGGCTCGATTCCCACACTGAATCTGGGACACAACGATCCCGATATCTATGAGAATGATCGAAAACGCACGGGTTCCCCCACCTGCTGCGCTCCCGGATGTGGACGACGACGTGTGGAGGATGGTGCTCGGTTTTTCCGTTTCCCAACCGATAAACGAATGCTGGCCAATTGGTTGTTCAATCTTAAAATGGAATACAACATCGCAAGACCGTGGAAATACAACGTGTGCGAGTTGCATTTCGAGGATTATTGCTTTGGGAAGGCGGGCTGCTTGATGCCTTGGGCTGTGCCCACCTTGAAGCTGGGCAAGAATCGTGTGAACTTGCATATGAACGAGCCACCAGATGGTAATGACAGCATCGACGATGATGAATTGTTTCTGTTGCCTGAAAGTAATATTCCTGAAGACGAAATCATTACCGATGACATTTCATATCGCGATCCGATGGAGTATCAAGTTCAGTGTTCGTATAACAATAATTTAGCTGAAAACACAGAGGATTCGTTCAAGTGTCATGTGCCCAATTGTGGTCGTGTCCAAACAAGTGATTCCGATGTTTGTCTATTTCCATTCCCCGAGGATCGGGAACTGCGAGAAAAATGGTTCTTCAATACGAAAATCTACAACTTCGATGACGATCACCTGAAGTCGATGAAAATCTGTGATGAACACTTTGAGAGCAGTGTCTTGAGTTCGAATGGATTGGATAGATTTGCTCTGCCAACTCTTAATCTCGGACACAACGATAGCAATATATTTCTCAATGTCGAATGTAATTCCAACGCCTATGGTGGCACTGGAAATTATTACCAGGGCATGCATGAAATCAAAGAGGAACCCATCGACGAATACCCCGAAGCCGATGATGAACTATTGCAAGTTCACTACGAGACTGAATTCACAATCTATTCCGAAGACACGAACAGTTCGATGAAATCTCTAACGCCCACTTTGCCAGCCGATTCGGTACGATGTTTCCTCAAGCATTGTTTGCGCAAACGCAAGGACGGCGTGAAACTATTTCGTTTTCCCTTCGACACGGAAATGAAAAAGCAATGGGAACATAATCTGAGACTTTCCTATGATAAGAAACTACGAAATCGTATGTATTTGTGTAGTATGCACTTTGAGCCGCGATTCATTGGAAAGAAGTTCTTGCTAAGGTCCGCTGTGCCCACGTTGAATTTGGGCTACAGTAGTCCACTTCAGCATAAAGGTCCATGGGAAAGATCACAGACGAATGCGCCGCCAGCTGTGAAAGAAATCGAGCCAAAAGAATCGCCATCATCATCGTCAAACACACGAATCACTTCAAATACACATGCGAATGAAAAGAAATCGTTGAGTGGAGCGGCGGCGACTCCGCTCCGTTGTCATGTTCCGGGTTGTGAGAATCTCGCCGATGTCGATCGAAAACTCTTTAACTTTCCGAAGAACGTGGTGTTCAATAAACTCTGGCAGGAAAAGACCAAAATCCGTTATGACCCCAAGAAACATACGGAAATCGTCGTTTGCAGTGCACATTTCGAGCCGAGTTGTATCGGCCGGAGAGGAACGCTTTCATCTCAATCGATTCCCACCCTGAAACTTCCAAATGCCAAAACTCCATCATCACAAAATCCACTCGCCTCTGCCAACACATCATGCAGTGTTCCCGATTGTCTGAACAATTCCACACAAGATAATATGAAAATGTTTTCGAAATTTCCCGATGATTTCGAACTCTTCACCAAGTGGTGTTACAATCTTAAAATCGATCCACGTGCATATGTCGACGGTGCCTATAACATATGCAGCTCACATTTCGAACCCTATTGCATTGGTGGACACAGCTTGCGTGTGTGGGCCGTGCCCACGTTGAATTTGGGACACAACGATACGAATATCCATCAAGTCACGCGTCCAGCAGAAACGGAAATCAAATGTTGTCTGCCTCATTGCGGACGGCGGAGAAGCAAAGACGGCGTGAGCTTCTACAATTTCCCAAAAGGTGATCTGTACAAAACCTGGTGTAAAATTCTGCAAATCGATGAGTTGCTCTATCGCAATACGGACAAGAAAATCTGCAGTGCTCACTTCACGGCCGATTGCTTCAATGGTTTGGCCCTCAAGACGGGCGTCAAACCGTCGTTGCATTTAACAGCAGCGAAATCGAATCCCACAACAGGATCTAGAGTTGTGAACAAACAACAGCCATCAGGTGGATGCATTGTATCCAATTGCCGTAATACAAACGATTCTGGACAAATGCTATTTCCATTTCCTGATAATCAAAATCTTTCCAAGAAATGGCATCACAATCTGAAACTGGCCTTTGACGAACAACATCACAACAGTAGCAATTATCAAATATGCTCCAAACACTTTGAGCCCAGCTGTATTTATAATTGCAAATTACATACAGAAGCTATACCCACGCTGGCGCTGGGTCATGCAGACGCAAATATTTTTCAAAATACTGGCTCATTTACAACAACCATCAATGGCAGCATTTCCTCCTCCACCTCCTCCACAAGTTCATTGACATTGAATCAAACGGAATTCTCATTGGAGCCGGAAGTGAATACAACAAAAGAGCCAGCTTTTTAA
Protein Sequence
MDELIIKTEPPDDSGFVNLVDNENSFKIDPFFQSNGPFPHIKEEPSTSQQKSLNFPRRKVQTERAETLPICQRCKQVFLKKSSYIKHVRYSSCSILEYNFKCLICPMSFMSNEELEAHENVHRANQFFCQKYCGKYFDTLDLCEVHEYTFHDFNTFVCNVCNSTFKARDMFFAHKTQHRHLIRFDCPICRRWFTSPNELRRHRLESPFYCGKFYGDTTVQNNDSSFNNGNQANYSLQDCNIGVMEAIPNSSNSQTIHNQIGMNDRSSFLPPCQIKCEPVFDNRYCAAQTAAPEPFFNNSFPTPDFSTFNNASFNDFNNQYPNFERRHSNNFNFSTPFTPATPAVNEGDAICCVPQCGLQKSTSPNLQFFKFPKDEKYLKQWLHNMKLIYNAGESYEDYRICSLHFPKRVINLHSLCYWAVPTYQLGHDDTANIFHNREQNYTVNPQCSVRGCESRRGQTNLKFYNFPPSDSETYVKWCQNTRLPLNSTESRQLCSRHFEDQCFGKIRLKVWAVPTLHLGGFPRIHDNPNVPMLEEKKCCLPHCRRRRSTDPSLSLYSFPRDEQMLQRWCFNFKLDPMEYRGKFHKVCSSHFVRDVIGSLKLLPGAVPTLNLGHNDTNIYNNDILTMPTTPPKYTKPSQSPSLSSPSTAVSGETPDTCSLPTCKRNRYADNVTMHTMPGRPEMLEKWCHNLKLNPLKIPKYFRICSIHFEPYCIGGCMRPFAVPTLHLGHDDPNIHENPKVIKKLNIRETCCVPTCRRNRERDKANLHRFPLHNPEILQKWLKNLRRPMPDGTRLFNDAICEKHFEDRCIRSKRLEKWALPTLQLGHEEEIIPLPSPTEIDELYGKPSTSNPADADVECCVESCKRGPKSDDVKLYSIPKDEVLLEKWSHNLQQDLKETPNARICNLHFEAYCIGKRLHSWAIPTLNLPENSEDLHENPKEMKSLKSVELKGYIQRCCLSHCRKTRIDGEVQLFRFPSYSKSMLKKWCHNLQISIPFGTAAHRRLCSTHFEPHVLTKRCPMPNALPTLNLNCPEGYKVYQNPERLKYAKLRTEKRCVIPSCRKRKEDGVQLFLYPHNRALLRKWAHNTKQKTGDCTRAQLRVCSEHFEENCFGPVRLCPGAIPTLKLGHDDEDIFPNILPPLQEEELRCSINGCGRSRAVDGVKLFKFPCDDEDLLWKWCNNLKLNPIDCRKTRICNLHFESECVGPKCLFRWAIPTLMLGHNDTELQLHANPKPEDRPTTEIIYKCCVQNCGKTRKYDEAQLNSFPKTPELFERWAHNLQMDLDFDDREKYKICNDHFEEICMGKVRLHYGAIPTLKLGHDSTDIYEVDLTEMNTSNSNKRQRLATLNEKNYYEECDFLDEPQAEAKCCFPDCTASKSMQRDSFDMPKTEELLELWCEVLRVDKDNLLAETENPKLCGIHFIDVYEATTDEATTLLDSQPELADELMTLAESYEHCTTSSFIRRVQCSIPECKTNSLSGVRLFAFPSNRELLEKWCFNTGVQLNENPRQSFKVCALHFEPHCLTEAQRLKHWAIPTLHLNDDDDEEEEKKLFQNPAGEAAERSLMTTVQGSKCCVRSCERDRALDDTVRLFNFPNDEELLDKWVHNLEVDPENTPVLKICNAHFERRCITKSGRLQPWAIPTQQLDISGDIDRFPNPGESLPDGPTNVKKNLESMKCIVPSCLNNRLQHGLRFFGIPSDMNIRKKWCIHLKIPPASIQKDQHICSDHFEKRCIDNASLHSWAVPTLKLSPSPSAKPTPSTQSTVASSKNSLSKGPGFGPYLPICCVPNCGRRRTSKNGLRTFGFPKDTQLLEMWCANIKLHPEAVQGYRVCVNHFPEKVRGKTKLKLGAVPTLELGHNEPVMYGLKEVAQKPAIVQANNGPVNNKNPVNNKIPVNNKGSVNRKSPVNNKIPINNKIPINNKTPVKKNTLVAVSNSAEDSKPMPKQRTSTEATVIKPIIRCCITGCRKTTRKPDVRFFTFPKRGHLLLRKWCHNTKQPLKNTRRFICADHFEKHTIVNGKPRPWAVPTLNLGHTDQIYENPPNMVDKCCVPGCTAGENVQLFSFPKRSSMLRKWAHNIKLDAYLVIRHNYKICAVHFEPNCTGPDGLNFGAIPTLHLGHQDDEHNYPNEADADGHKFHCDIPGCGRTRASDGITLFTFPNDEELLKKWCHNLKRQPEECKDFRICNLHFEPGWVRKYKLLDAAIPTMLLGHNDNDLQLITAELQPHVGVLIRCCVPHCGKYKPKDDVHMISFPKGRALFIKWTHNLKLKGSQRLINSFKVCSDHFEDYCKVARCVKYGAIPTLNLGHDDPNIYQNNKELLTGKKTNNGRNCVISECQTHKMLHMQPFDMPTNEQMLRSWCQHFGIDPSQISSSPRLCALHYMDVYEKVQQPSSIISDDGPSNNDNDEIEDLKNTYTTIKNSQRIRVFLCCVSGCNTKYLNGSPQLIKFPRDTELCEKWCDNTKVQVDQKRRYLYKVCALHFEERCFINARLRNWALPTLRLNHNDEIHSNPEEEQTVCCVRNCSSSKEPQANDGTEMVNKLWTFPDDETLLQKWCHNLEVKIAESKGLKVCSEHFESYCFGKGLRPWAIPTLNLKRTTKTKLFKNPEIKDITVSYSSVHSIMCCVPDCRQQRDPEAGIQLFGFPLKKEPLEKWMRNLKLENPEECKRYRICNLHFEKQCFAKKRITYWAVPTLRLGHDDVENIFRPPNVGKKKKKHRCCLPHCRKIRHIHNVRLHGFPKDPEVLKKWLSNLKMSAESATKARICGTHFEKCCEFRRELRPWALPTLDLGHDDVIFQNPEPSGKAKIIDDLKRCCVPECGRENVACDENPIRMFGFPNNEELLQKWCENLKFQMAEYRSTKRRVCDAHFETMAIYRNKLLPDVVPTMKLGHVDKVMHQNPDDLSKYITCYLPQCNTEKSIDDNLFFDFPKNNKKQLQQWCENLRIDIEIPENSFHRLCKSHFVPEATAVNKLKPGAVPTLNLGHDLPVVHENSEKTLKLTAGDLCAVPDCEITQADTKLFNFPKKPRNLEKWCENLKLNDPKQCMDYKICHRHFEKLEKFTKWSLPTLNLGHSEQEPYDPATNVLLLEVNTDPSCCISSCPNSDECSSNVQLFGFPKITWLLKKWCYNLRIPEKGAEKLQICGEHFEKNCRGATKLKPWAVPTKELGHDGHIHQSLKLHKYHEEEKASGEMKFVRADYCAIVSCLKSKIDGVQLFKYPSKNLMVRKWAENCKHNAYQAARYAFRICSEHFEGLCIGPDHRLRPGSIPTLNLGHDEIDIHPHEVIPSCVTPSTLSEIDVECSVTNCGRSKKADNVRLYKFPTDAEILQKWSQNLRLDVEDAKNMRICNMHFEERCFGMKKIRLLMRAVPTLLLGHNDPDIIQNPDSFERPEQIIRCCVPHCSKSKQEDDVMLFGFPRNRKLFEQWTDNLKLELKPSADVWQYYRVCRDHFESFCYGMGRRLLFGAIPTLKLGHNDTNIHEVKQDLLSYSSQEPRKRKKPEEKQDVRCCHPDCSRTSKEQHMYRMPKHQQLLVMWAQNVGVSLDDPLGADEYKLCIIHFKIAYDENHQKMLELRKTITETVADLDADMMELDAIYTEISNSTRIRVMQCSVPGCNGNTINGDVKLHTFPYNRDLYEKWCHNTKIEVDESQRYLFKVCSLHFEPYCIVEPSKRLRPWALPTLNLPPHDDTSPIVPNPDLETLGSRSLGFVADKLCCIPTCENSKEEANVPSSMLHKFPREVHLFRKWLYNTQLEANDGIYARICSNHFEQACLGKRLRSWSIPTLNLGHDKPNIHLNPTDKLDGSAGKAEDNNEKKYKIKLQDNDDFEQDYKMEEELPIKRFLKKEKPKSETVYPKEESNKNNMKCLRSSDRIIPKKEVKIKRDLNISSPDSMDHSDKFEITLMDESNQDNDSDLDMSHDDSDELDEQPLASRKRRLGEEPEQFRQSADRPFFNFKITAIESINTDNLTDGGGSGSLSSTGEVTNKRTRTLTCYGAESKINTNLHCSIPSCMRKTKDGVQLHMLPKDPGLHQKWCHNCKLSPTPNELYITRVCSYHFEDRCFTRTRRQLTLGSIPTLNLGHNDPDIYENDRKRTGSPTCCAPGCGRRRVEDGARFFRFPTDKRMLANWLFNLKMEYNIARPWKYNVCELHFEDYCFGKAGCLMPWAVPTLKLGKNRVNLHMNEPPDGNDSIDDDELFLLPESNIPEDEIITDDISYRDPMEYQVQCSYNNNLAENTEDSFKCHVPNCGRVQTSDSDVCLFPFPEDRELREKWFFNTKIYNFDDDHLKSMKICDEHFESSVLSSNGLDRFALPTLNLGHNDSNIFLNVECNSNAYGGTGNYYQGMHEIKEEPIDEYPEADDELLQVHYETEFTIYSEDTNSSMKSLTPTLPADSVRCFLKHCLRKRKDGVKLFRFPFDTEMKKQWEHNLRLSYDKKLRNRMYLCSMHFEPRFIGKKFLLRSAVPTLNLGYSSPLQHKGPWERSQTNAPPAVKEIEPKESPSSSSNTRITSNTHANEKKSLSGAAATPLRCHVPGCENLADVDRKLFNFPKNVVFNKLWQEKTKIRYDPKKHTEIVVCSAHFEPSCIGRRGTLSSQSIPTLKLPNAKTPSSQNPLASANTSCSVPDCLNNSTQDNMKMFSKFPDDFELFTKWCYNLKIDPRAYVDGAYNICSSHFEPYCIGGHSLRVWAVPTLNLGHNDTNIHQVTRPAETEIKCCLPHCGRRRSKDGVSFYNFPKGDLYKTWCKILQIDELLYRNTDKKICSAHFTADCFNGLALKTGVKPSLHLTAAKSNPTTGSRVVNKQQPSGGCIVSNCRNTNDSGQMLFPFPDNQNLSKKWHHNLKLAFDEQHHNSSNYQICSKHFEPSCIYNCKLHTEAIPTLALGHADANIFQNTGSFTTTINGSISSSTSSTSSLTLNQTEFSLEPEVNTTKEPAF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-