Basic Information

Gene Symbol
GA10450
Assembly
GCA_018901815.1
Location
JAEIFI010000763.1:2825718-2841743[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 1.4e-15 2.7e-12 47.1 3.8 1 86 740 812 740 813 0.85
2 28 2.5e-15 5e-12 46.2 4.6 1 87 840 909 840 909 0.83
3 28 6.2e-16 1.2e-12 48.2 0.4 1 87 931 1003 931 1003 0.85
4 28 4e-16 8e-13 48.8 5.3 1 87 1054 1124 1054 1124 0.83
5 28 7.4e-15 1.5e-11 44.7 3.5 1 86 1148 1219 1148 1220 0.82
6 28 5.5e-05 0.11 13.1 0.1 50 87 1267 1293 1247 1293 0.77
7 28 8.7e-11 1.7e-07 31.7 1.4 1 86 1339 1408 1339 1409 0.76
8 28 5.9e-16 1.2e-12 48.3 0.1 1 86 1436 1505 1436 1506 0.82
9 28 3.3e-12 6.7e-09 36.2 0.7 1 85 1527 1595 1527 1597 0.79
10 28 3.2e-15 6.3e-12 45.9 1.7 1 86 1624 1695 1624 1696 0.85
11 28 1.7e-13 3.4e-10 40.4 1.8 1 85 1770 1838 1770 1840 0.81
12 28 3e-12 6e-09 36.4 0.1 1 86 1863 1931 1863 1932 0.81
13 28 9.6e-14 1.9e-10 41.2 1.8 1 87 2077 2146 2077 2146 0.80
14 28 1.3e-12 2.5e-09 37.6 0.5 1 61 2222 2275 2222 2293 0.78
15 28 0.042 84 3.9 0.0 1 58 2308 2358 2308 2380 0.74
16 28 5.5e-12 1.1e-08 35.5 0.3 1 86 2397 2466 2397 2467 0.84
17 28 1.6e-15 3.2e-12 46.9 1.6 1 86 2531 2600 2531 2601 0.82
18 28 2.7e-12 5.5e-09 36.5 0.9 1 86 2636 2707 2636 2708 0.81
19 28 6.8e-13 1.4e-09 38.5 2.0 1 87 2718 2790 2718 2790 0.80
20 28 9.8e-15 2e-11 44.3 0.1 1 86 2820 2894 2820 2895 0.75
21 28 5.2e-05 0.1 13.2 0.1 1 58 2931 2985 2931 3005 0.83
22 28 3.2e-15 6.3e-12 45.9 0.4 1 86 3023 3095 3023 3096 0.82
23 28 1.5e-13 2.9e-10 40.6 3.7 1 86 3241 3313 3241 3314 0.82
24 28 4.4e-14 8.7e-11 42.3 2.5 1 87 3374 3445 3374 3445 0.82
25 28 8e-14 1.6e-10 41.4 3.6 1 86 3558 3628 3558 3629 0.84
26 28 7.6e-11 1.5e-07 31.9 0.1 10 87 3644 3710 3639 3710 0.80
27 28 2.3e-08 4.6e-05 23.9 0.3 1 58 3732 3780 3732 3786 0.84
28 28 3.4e-10 6.7e-07 29.8 1.9 18 87 3797 3855 3785 3855 0.74

Sequence Information

Coding Sequence
ATGGTGCTACTATTTAAGTTTCTTTTAAAATCGAGCAAATTGCCCGCGCGCGCACATTTTCGTCCTACTTTCCTGGACACACTGCGTCTGACAGTGCGTGGCGGACATGGCGGAAACGGATTGCCAAAGTACGGCGGTGTCGGCGGCCAAGGAGGCTGTGTTTACTTTGTTGCCAAGGAGGGACTGACGCTGCGCAGGGTGGCCCAGAACCTGAAAGACAAACGAGTGCACGCCTCAAGTGGCGAGGATAGCAGCAAAGTAAGCATCTTCGGAAAACGTGGAGTGGATCAGTGCATCGAAGTGCCGGTGGGCGTGCAGGTCTATGATGAGAACAACAAGTTGCTGGCGGATTTGAATGAGAACGATGCCAAGTGTATTGTAGCAGGCGGTGGCACTGGCGGCTGCACTGGAAATAATTTTCTTGGACGTCCTGGCGACCATCGAATAGTGAATTTGGATCTCAAACTGATTGCTGATGTGGGCATGGTGGGTTTTCCCAATGCTGGGAAGAGCACGTTGCTGAAAGCCATTTCCAATGCCAAACCAAAGATAGCCGCATATCCTTTCACCACAATTCGACCACAGATCGGCACAATAGATTACAGCGATCTAAGATCCATCGCCGTTGCTGATTTGCCTGGATTGATTGAAGGGGCTCATGCCAATTTTGGAATGGGTCACAAGTTTCTGAAGCATATTGAACGCACACGTTTGCTGCTCTTCATGGTGGACATATTTGGATTCCAGTTAAGTCCCAAACATCCACATCGCGATTGTTTGAGCAATATCTATGCACTAAACAAGGAGCTGGAGCTATACGATCCCAGTCTACTGGAGAAACCCTGCGTCTTGTTGCTAAACAAAATGGACAAGGAAGGGGCGCAGGATCTGCTGAAAAAGTTCAAACCAAGCATTCGGGATTTGAACAGTGGTCTGTCTGAATGTCCCGAAGAAGTTCGTCCCAGCAGAGTACTTAAATTCGAGCATATACTACCCATATCCGCCAAGAACTCAACACGAATAACGCAAGTAAAGCAACAACTGCGCGATACACTGGACACCATAGCAGCAGAACACATGGTGGTCGATAATGAGCTCCTAAAGGAACAACTGCAGCGACGTGTGGGTGTCAGAGCGGAACCCATTGACGAACATGCGTATAAGTCCAACTATATTGATGATAATACGCCGTTTGCCGATTTTAGTAAATTTCCAGAATTTGGCGACGACATGCTGAGTCCCAAGGTCGAGCTGACCGTCAAGGATGAGGCGTATCAAAAGAACCCGCTTAGTTATCCGCGGCGCAAGCTGCAATCGGAACGTTCAGCGGAGAGCTTGCCGATTTGTCAGCGCTGCAAGGAGGTGTTCTTCAAGAAGCCAATCTACTTGCGGCATGTGGCAGAGAGTAGTTGCACCATACAGGAGTATGACTTCAAGTGCAACATATGTCCCATGTCCTTTATGAACACCGAAGAACTGCAAAAGCACAAGCATCTGCACAGGGCCGATAAGTTCTTCTGCCACAAATATTGTGGCAAGTACTTTGACACCATTGCAGAGTGCGAGTCACATGAGTACATGCACCACGAGTATGAGAGCTTTGTTTGCAACATGTGTTCGGTTACATTTGCCACACGGGAGCAGCTCTATGCGCACTTGCCACAGCACAAATTCCAGCAACGTTTCGATTGTCCCATTTGCCGGCTATGGTATCAAACGGCATTAGAGCTGCACGAGCATCGTCTGGCGGCGCCCTATTTCTGTGGCAAGTATTACGCAGGCGCACAGACCACACAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGGCCACCATGGAAATGCCGACGACACATCAAAAGGCAACTGCTGGCAGCGCATTGCCGGCAACGGCAGCGTTGAGCTCTCTGTTGCAACAACGTCAGGCAAATGCCGATGGTGCCGCATTGTTTGCATCCTCTCTGAAGAACGAAGTGAACGTTAAACTGGAGCGAAGCTATAGCAACTCTACAACAAGCGAATCTGGCTACAGTCTGCATGACAGCAGCAACTATAACAATGCATATGGAAGCGATGCCTCGTTACATGGCGGTGCAGGTGCCGGAGCTATTGGTGGTCCACAGGCGCATTCCTCAACGCTGGACGATTCAGAGGATGCGCTTTGCTGTGTGCCGTTATGTGGTGTGCGCAAGAGCACCAGCCCCACGCTGCAGTTCTTTACATTCCCCAAAGACGAGAAGTACTTGCATCAATGGCTACATAATCTCAAGATGTTTCATATCCCGGCCTCGAGCTATGCCACTTATCGCATTTGTAGCATGCACTTTCCGAAGCGGTGCATCAATCGTTATTCATTGTGCTATTGGGCTGTGCCCACGTTCAATTTGGGACACGATGATGTTGCCAATCTATATCAGAATCGCGAGCTAACGAACACCTTTACCACCGGCGAGGTGGCACGTTGCAGCATGCCCAACTGTACCAGTCAACGTGGGGAAAGCAATCTCAAGTTTTACAACTTCCCGAAGGACATTAAGAGTCTTATCAAGTGGTGCCAGAACGCACGTTTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTCTGCAGTCGACACTTTGAGGAGCGATGCATTGGCAAGTTCCGGCTAAAGCCATGGGCGGTGCCAACACTGCATTTGGGCGCCCAATACGGCAAGATTCATGACAATCCCAAGAATTTGTATGTGGAAGAGAAGCGCTGTTGCCTTAATTTCTGCCGTCGCAGTCGCTCCTCGGACTTTAACATGTCGCTGTATCGCTTCCCTAGAGATGAAGTGCTCTTGCGACGTTGGTGCTACAATTTACGCCTTGATCCAGCCGTATATCGTGGCAAGAATCACAAGATATGTAGTGCTCACTTCATCAAGGAAGCACTGGGATTGCGAAAACTATCGCCGGGTGCCGTTCCTACGCTTCATTTGGGTCACAACGATACGTTCAACATCTATGAGAACGAACTTTCGGCATCGCACTATGTCGATCCAGATTTGAGTGCATCGTATATGGGCATGGGAAGCGGATCGGGTGGATCCGCATCGGGCCTGAATGTGAGCGACAGCATGGATGTGTGCTGTGTGCCAAGTTGTGAGAGCAAGCGACACAACAACGAGAACATCACGTTCCATACGATACCGAGGCGACCCGAACAAATGCGTAAATGGTGTCACAATCTCAAGATACCCGAGGATAAGATGCATAAGGGAATGCGAATTTGTAGTCTACACTTTGAGCCCTATTGCATTGGCGGCTGTATGCGTCCGTTTGCGGTGCCCACACTGAATCTCGGGCACGACGACGATGATATACATCGCAATCCGGATGTAATTAAGAAACTGAATATACGGGAAACTTGCTGTGTGGCTGTATGCAAGCGTAATCGGGATCGCGATCACGCCAATCTGCATCGTTTCCCCAGCAATGTCTCGCTGCTGACGAAATGGTGCGCCAATCTGCAGCGACCTGTGCCCGATGGCACTAAACTCTTTAACGATGCCATTTGTGAGGTGCACTTTGAGGATCGTTGCCTGCGCAACAAGAGATTGGAGAAATGGGCAGTGCCTACGCTGGTGCTGGGACATGACAACATTGCATATCCACTGCCAACGGCAGAGCAGGTGGCCGAGTTCTATGCCCGTCCCAGTGCACCCAACAATGGGGAGGAGCAGGGCGAGTGCTGTGTGGAGACCTTGGATGCTGCCAAGTTGTGCAGCATGCGTATATGTAATCTGCACTTTGAATCCCATTGCATTGGCAAGCGTATGCGGCCCTGGGCCATACCCACACTCAATCTGTCCAACAATGTTGAGAATATGTACGAGAATCCCGAGCATCATATGATATACAAGCGTCGCACACAAATTAAAACCGAGAGAGGTGCATCGGGAGCGGGGGCAAGCATGTCGAAGCCCACTTGGGTGCCACGTTGCTGCTTGCCACATTGTCGCAAGGTGCGTGCTCTGCACAATGTGCAGCTCTATCGCTTCCCCAAACTCAATCGTTCCACGTTGGCCAAGTGGGCCCATAATCTGCAAGTGCCACTGGTGGGCAGTGCCCAGAGGCGACTCTGCTCTGCCCACTTTGAGCCGCATGTGCTGAGCAAAAAGTGTCCGGTACCGCTGGCGGTGCCCACACTGGATCTCAATGCGCCACCCGGTTATAAAATCTATCAGAATCCCGCCAAGCTAAAGGCCAACAAGCTGTGTCTGCAGCGTGTCTGCATTGTGGAGAGCTGCCGTCGTCAACGTGGTCAGGGCGTGCAGCTCTTCCGTCTGCCGCACAGTCCCACACAGCTGCGCAAATGGATGTACAACATACGGATGCGACCCAGAGGTGCCATGCGGCAACAGTATCGCATCTGCTCGACACACTTTGAGACGCATTCGTTCAATGGAAAGCGACTGAGTGCGGGAGCGATACCAACACTGGAGTTGGGCCATGACGACGAAGACATCTATCCCAATGAGGCGCAATCATTTGTGGAGGAACACTGTGCCATCGATGGGTGCGATGCATCCAAAGAGCAGCCAGATGTGCGTCTCTTCCGCTTCCCCACCGACGATGAGGATTTGCTCTGGAAATGGTGTAACAATCTGAAAATGAATCCTGTCGATTGTATTGGCGTGCGCATCTGCAACAAACACTTCGAGGGAGACTGCATAGGACCGAAGCATCTTTACAAGTGGGCCATACCCACCCAGGAGTTGGGCCACGATGATTCTCAGATTGAGCTCATTACCAATCCCAAGCCAGAGGAGCGTTATGTTGATCCCGTGTTCAAGTGTTGTGTTCCCACCTGCGGCAAGACGCGTAAATTCGATGAAGTACAAATGAATAGCTTCCCCAAAGATCCTACACTCTTCCAGCGCTGGCGACACAATCTGCGACTGGAGCATCTCCAGTTCAAGGAGCGCGAACGCTATAAGATCTGCAATGCACACTTTGAGGATATTTGCATTGGCAAGACACGTCTGAATATTGGTTCCATACCCACACTGGAGTTGGGGCACGACGATACGGAAGATCTCTTCCAGGTGAATCCTGCGGAGTTGCAAAGCAATTTGTTTGGTCGCCAGCGACGTGTACAGGAATCAGGTCTGAGGGTCAGAATAAAGCAGGAGGAGCACTCCGAGCAAGAGGAGACGGATGTTACCATGTCGGAGGCTACGGATTTGAATACCAAGCAGGTAAAAATAAAGAAACGCCTTGCGGATTTGAAGTGCTGCGTGCCCAGCTGTGGTCGCAGTCGTCTGGAGCACGGCGCACGTTTGTTCCCCTTTCCAACTGGCAAACAACAGCACAGCAAGTGGCGCCACAATTTGCGTCTAGATGCCAGCGATGTGGACAAGACGTCGCAAGTTTGCAGCGCACACTTCAATCGACGTTGCATCGATGGCAAGCAGCTGCGTGGCTGGGCAATCCCCACACAGCAGCTGGGACATCAAGAGCAACCGATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACACCCACCTGTGCGCTGGCGCACTGCCGCAAGCGGCGAAGTATTGACAATGATTTGAGAACTTATCGTTATCCCCGCAGCGAGGAGCTGTTGGAGAAATGGCGAGTGAATCTACGACTGGCTCCAGATCAGTGTCGCGGACGCATCTGTGCGGATCATTTTGAGCCAATAGTACGTGGCAAGCTGAAGCTGAAGACGGGTGCAGTGCCAACACTTAAACTGGGACATGATGAGGGCGTGGTCTTTGACAATGAGGCCATTAAGGCGGGACTGCAGCTGGATGGAGCTGGAGAGCCAGAGGTGGAGGAGGGTGAGACGAGCAGAGAATCGCTGATTAAAATTAAAAAGGAAAAACTAGAACCAGATGAGGAGGAGGCGGAGGTAGAGCAGACACAGCTTGACAATGGAGACGATGATGATGAGCAGGAAGAAGATGTGGATGCGGATGAGGATGATGATCATGGCTACTTCGATCCATTGGAACTGGTTGAAACCTTTGCTGACGACAATGCAGCCGATGATGATGAAGGTGATATTCCTGGTAATGACGATGAGCTACTGCTGCTCCCCGATACCCCGCCAGTGAAGCTAGAGCATCCGCTCCCGCCGCTGCTACGTCGCGAGAAGCCCGTGAATAATGTGACGCCAATTTGTTGTCTAAAGCATTGCCGCAAAGAACGCACTGCAACACATCAGCTGAGCACCTTTGGTTTTCCAAAGGATCGCCAGCAGCTACGCAAGTGGAGTATCAATCTACAAATTGCCGAGAGCGATTGCGTGGGACGTGTTTGCATTGAGCATTTCGAGGCGGAGGTGCTGGGCACGCGTAAGCTCAAACAGCATGCGGTGCCCACATTGAATTTGGGCCATGACACGCCCCTAATCTACAGATGCAATGGGCAGGCGCAGCCCATGGAAGGCATCTTCGATGAGCAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGCTGGAAGCAATGGAACCGCCCGATCATCATCATCATCAACGAGCAGTGATTAAGCAGGAGATGGAGGAGGAGGAGGAGCAGGAGGATGGGCTTAGCTGTTGCCTGCCCCACTGTGGCATGCGGGCGAATGACGTGCAACTGCATCGTCTGCCCAGCAATCGCATTCGACTGCGCAAATGGTTGCACAATCTCAATCTGCCCCAGAGCATTCCCATCGACAGTCAGACACGTGTCTGCAGCGAACACTTTGATGTGCAACCGAGTGAGGCACTAGAGGATTGTATGCCCACCTTGAAACTGGGTCACAATGATATCAATATATATAGAAATCAAAACTCTTTTGCCTCCTCATCCTGCTTAGTGCCAAGCTGCCCGTGTGCACGACTCAATCTCTATCGAGGGTATGATTTGCCTGAGCATCATCTGGTACAGCAGGCTTGGCTGCAATTCCTGCGACAACCCATGCCTCAGCTGCCCGCCGATGGTCAGCTCTGCGTGATGCACTATATGCAGCTGTACGAGCAGGTCTCGCTGCCCCAGGAGGTGCCAGCGACAGTGTTGCGTCAACTGCAGGAAACCTACGAACAGATTGCCAACTCCACCATGGCCATGAAGTTGCGCTGCGCCGTGCCCGGCTGCTACTCCAAGTATACGGATAACATAAGGTTAACCAAGCTGCCCACCTGCTCGGACATGTTTGCCAAGTGGCTGCACAACACCAAGATCAAATACGATGCCAACAGGCATTATATCTATCGCATTTGTATGCTCCACTTTGAGGCCCGTTGTCTGGGTCCAGTGCGTCCCAAATTGTGGGCGGTGCCAACGCTGCAATTACATCACAACGATGCAAATATCTATAGGAATCCCAAACTAGATGGGAGTGGACAAGAGGTACAGCAAGTCTTTCCACAACCTGTACCCATGCCCGTGGAGCTGCCATTGCGCATCAAGACGGAGCTGGCTTACACTGGCAGTCCGAGTGCCAGCGCTAGTCCTAGTCCACGTGGCAAGTTGCGCATTTGCTGCATTCCCGGCTGTGCCCAACAGGCCAACTCATTGGTGCGTCTCTTTCGCTTCCCCACCTCTGAAACAATGCTGCTCAAGTGGCTGGTGAACACACAGCAGAAGCCCCGTCTGGCGGATCCCCAGCATCTGTTTGTCTGCCAGGATCATTTCGAAACGGAAGCCATTTGCAAGAAGCAGCTAAGCAGTTGGGCTGTGCCCACATTGAAGCTGGGACACGATGGTCATATCATTCCCAATGCCAAGCACAATGGAAACATTGCGGACACACATGATAACCGACACACGCTGCAATTTATTTGGGCCAATTACTGCTCTGTGCTGGACTGTTTTGAGCCCCGCAGTGAGCAACTGCGTCTCTTCGCATATCCCACAGATCGACCTACCATACGCAAGTGGGCTGCCAATTGCAAGCATCGTTCCATGCAGGCCAGCAGCGATGGCTTTCAGGTTTGTCAATCGCACTTTGAGCCACATTGTTTTGATCCGGAAACGGGAGAGTTGCGAGTGGATGCTGTGCCCACGCTGGAGCTAAGTCGTGGTATCAACAAGATGGGTTGCATGGTGCCCGGTTGCGTCAAGGATGAGAATGGACCGCGTCGTCGCCTGTTTAAGATGCCCAAGCGTTCATCGCAATTGTTAGATTGGTGTCACAATTTCCAACTGGATTCATCGACCATCTCAATTGAGAAGAGTGTTGTGTGTGATCGTCACTTTGAGTCGCATTGCTTGAATGTGCATAAGCAGCTGCGTCGTGGAGCACGTCCCACACTTCACTTGGGTCACACTGCGGATGTTGAGATATTGCAGAACCCTTCGAATTGGGCCCGCTGCACTGAGGTGCCTGAGACTCCTGCTGATGTCTGCTGTGTGCCCAATTGTGGGCGGTACAAGGATGCCGAGGAAGAGGATACTGATATGCAGCTCTTTAGCTTTCCCAAGGTGCGAACGCTGGCAGAAAAATGGCTGAGGAACATTCGACTCGAGGCAAACAAGGAACAGTTGGCTGAGTTGAAAGTCTGCAATGCGCACTTTGAAATCTACTGTCTGGAGAATGGACGCCCACAGTTGGGTGCCATGCCCACACTGCAGTTGGGACACGATGATCATCATGATATACATCGCTCCAGTTGTCCCACGTCCGTGTCGGCTTCCAAAATGAAACGTTATTGCAATAGAAATGGATACAGCTATGATTGCTGCTTTCCTCAGTGTGTGGAGCTGCAGAAGAGTTATCTACGGATAAGCTACAATTTGCCACAGTCGCAAGCAATGCGTAATGCCTGGATAGCCTACATGGAGTTGGAGGAACCGAAGGAGCAAGAAAATGAGCAGCTACTGAAATTGTGCCCCCTGCACTTGATCATTCTTTACGAGCACAGCGTAGACAATTTCCCAGAGCACACAACAGAGGAACTCTTGGAGGATAACTATGCAGCTGCGAGAAATAGTGTACGGATACGTATCATCAGTTGTGCAGTGCGTGGCTGCAAGACTCTGAAGACCCGCGATGGCGGTTGCCTTCATGGTCTGCCCCAGCGTCGGGATATACTGCAAATGTGGTTGCACAACATGCAGTTGGTGTTCTACGAGCAACAGCGTTACATGTACAAGATCTGTAGCAAACACTTTGAGCCCAATTGCTTTACGGACACAACACGCCGCCTCAAGCCATGGAGTATGCCGACGTTGGAGTTGCCTGTGCCCGAACCGGGTGAACCAGAAGTCTATCAGAATCCCACAGAAGCTGAATGGCAGCAAATGAATGAACAATGGGCAGCAGAGCAGCTGCAAATAGCTCAGCCAATGATGGAGGATGAAGAGCATGAGGAAGTTGTGCAAGTGAAGCTTGAGCCGGAGACGGAGATGGAACCGGATAATCTGCTGGCATATGAGGAAGAGGATTACTCGCAGCCTCCAGTGGATGGAGAGGAGGATGCCAGTTCGCAGCAGCCGCTGGAGATGCAACCCTTGGAGGTGCTGCTTGAGGTGGGGCATGTGGAGAAGTGCAGAACATACGAGCAAATGGACACCAAGGCAAATCTAAGTTATGCCGAGCAACAGGCTGGACCGCTGCCTAGTAATGGAATCTTTGCCAGTAATGGCAATAAGTACACCGCACGCAATTGCAGTGTACAAGGCTGCCACGTGACCTTGAACGATATCTGTGGCAATGTGAAGTTGCACAAATTTCCCACGTCGTGGAAGGCAATGCAAAAGTGGATGCACAACACCCAAGTGAAGGTGTGCAGAAGTGTCTCGTGGCGTTTTCGCATTTGCAGTTATCATTTTATGGAAGATTGTTTTCAAGGCTCTCGCCTAAGGCGTGGTGCCATGCCCACATTGCAGTTGGGCCCCAAGCGACCTTCGCATATCTATGAGAGTGAATTCAATGTTGGCGATGCCGATGAGGAGCAGCCCATTGAAGAGTCATCCGAACAAACACCAAAGTCAAGGGTGGTAGGTGGCGATAATATTAGCCTGTGCCTGCCTAGCCCAGCACCACCTCGCAAGTCGAGCAAGTTCTGCCAAGTCGATGGTTGTCCTAATCATTTGACCAGCGAGAATCTGACGCTGCACAAGTTCCCCCACTCGCCGGACATGTGTGCCAAGTGGCAGCATAATACACAGGTGCCATTTGATCCCGTCTTCCGTTGGCGATATCGCATATGCAGCGCCCACTTTGAACCGATCTGTCTGTTGAATATGCGCCTGCTGCACGGCAGTGTGCCCACGCTTCACTTGGGTCCACGTGCTCCACAGCAGCTCTTCGACAGCGACTGTGAGGCCATCAGCATGCGCTTGGATAAGCAGAAGAGCAGCTCAGAGCAGCAGTTGTACATCAAACAAGAGCATGAAGAGGATCACGATGATGCGGAGCAGGATGAGGATGAGTTTAGTTTTCTGGTGCCAGAGATGCAGCTGCATGAGGATGCAGGGGAACATGAAGCTAACATAAAGACGGAGCCCAGTCAAACCAATAACGGACGCTGGAAGGATCTGCGTTTGCCCAGCATTAAGCAGGAGGATACACCGACCATGACGTGCTATAATCCCGTCAAGTCTGGCTATGACAAGTGCTCGCTGGTGCATTGTCAGCGCCAACGATCTCTGCACGGCGTGCATATCTATAAATTTCCCCGATCCCATCAACTACAGCAGCACTGGATGCACAATCTGCGAATACGCTATGATGAACGGAGACCCTGGAAAACAATGATATGCAGTGTACACTTTGAGGCGCAATGTATTCGTCTGCGTAAGCTGCGTCCTTGGGCGGTGCCCACACTGGAACTGGGTGACAACGTGCCCCAGGAGATCTTTAGCAATGAACAGAGTCGGCAACATCCACGTGGCGATGGCATCAAGCTGTTTCGACTGCCGAACCGGCTGAGCGACATACACAAATGGGAGGTGGCAACGGGAATGCATTTCACCGAATCCCAGCGCAACACGAAGCTCATCTGCAGTCGTCACTTTGATCCGCAGTTGATTGGCGTGCGTCGTCTGATGTACAATGCGGTGCCGACTCTTCATTTGCGACCAGAGACTGCGCGAGAAGAAAGGATGCCTCCGCGCCCTCGTCCAGCGGGGCCACGTTGTTTCATGCCCAGTTGCCCACAGGATTTGCATCAAAAGTTGCACAAGTTTCCCAGTGATCCTCAGCAGCTTCAAGAATGGTGTGCAGTGCTCACAGTCTCCGAAACGCAGGACTTTAGTAGCTTGTACATTTGTGACGATCATTTGCCAACGCTGCCAAGAAGCTGCCTCATCTGCAGTGTGGAGGAACTGCAGCTGCCACTGCTGAACTTCCCAGAACATCGCAATCAACGCGCCAAGTGGTGCCATAATCTCAAAATCGATACCATTCCCAAGTGGGATAACTCAAAGCACATTTGCAGTCGACATTTTGAGAGCTATTGCTTCATCCAGCCGGGTCAACTGGTTCCCGAGGCTGCTCCCACGTTGCACTTAAAGCACACGGACAGCAACATATTCCTTAACGATTATGCAATGGAGTCCAGCAAGTTGATGTTGCGCATTAAGGATGAGCCATTGGAGAGTGACGATCTGATGCTGTAA
Protein Sequence
MVLLFKFLLKSSKLPARAHFRPTFLDTLRLTVRGGHGGNGLPKYGGVGGQGGCVYFVAKEGLTLRRVAQNLKDKRVHASSGEDSSKVSIFGKRGVDQCIEVPVGVQVYDENNKLLADLNENDAKCIVAGGGTGGCTGNNFLGRPGDHRIVNLDLKLIADVGMVGFPNAGKSTLLKAISNAKPKIAAYPFTTIRPQIGTIDYSDLRSIAVADLPGLIEGAHANFGMGHKFLKHIERTRLLLFMVDIFGFQLSPKHPHRDCLSNIYALNKELELYDPSLLEKPCVLLLNKMDKEGAQDLLKKFKPSIRDLNSGLSECPEEVRPSRVLKFEHILPISAKNSTRITQVKQQLRDTLDTIAAEHMVVDNELLKEQLQRRVGVRAEPIDEHAYKSNYIDDNTPFADFSKFPEFGDDMLSPKVELTVKDEAYQKNPLSYPRRKLQSERSAESLPICQRCKEVFFKKPIYLRHVAESSCTIQEYDFKCNICPMSFMNTEELQKHKHLHRADKFFCHKYCGKYFDTIAECESHEYMHHEYESFVCNMCSVTFATREQLYAHLPQHKFQQRFDCPICRLWYQTALELHEHRLAAPYFCGKYYAGAQTTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXATMEMPTTHQKATAGSALPATAALSSLLQQRQANADGAALFASSLKNEVNVKLERSYSNSTTSESGYSLHDSSNYNNAYGSDASLHGGAGAGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATYRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELSASHYVDPDLSASYMGMGSGSGGSASGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLNLGHDDDDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHDNIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVETLDAAKLCSMRICNLHFESHCIGKRMRPWAIPTLNLSNNVENMYENPEHHMIYKRRTQIKTERGASGAGASMSKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNAPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHSPTQLRKWMYNIRMRPRGAMRQQYRICSTHFETHSFNGKRLSAGAIPTLELGHDDEDIYPNEAQSFVEEHCAIDGCDASKEQPDVRLFRFPTDDEDLLWKWCNNLKMNPVDCIGVRICNKHFEGDCIGPKHLYKWAIPTQELGHDDSQIELITNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPTLFQRWRHNLRLEHLQFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDDTEDLFQVNPAELQSNLFGRQRRVQESGLRVRIKQEEHSEQEETDVTMSEATDLNTKQVKIKKRLADLKCCVPSCGRSRLEHGARLFPFPTGKQQHSKWRHNLRLDASDVDKTSQVCSAHFNRRCIDGKQLRGWAIPTQQLGHQEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLAPDQCRGRICADHFEPIVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKAGLQLDGAGEPEVEEGETSRESLIKIKKEKLEPDEEEAEVEQTQLDNGDDDDEQEEDVDADEDDDHGYFDPLELVETFADDNAADDDEGDIPGNDDELLLLPDTPPVKLEHPLPPLLRREKPVNNVTPICCLKHCRKERTATHQLSTFGFPKDRQQLRKWSINLQIAESDCVGRVCIEHFEAEVLGTRKLKQHAVPTLNLGHDTPLIYRCNGQAQPMEGIFDEQPQHSVFRLWSLKHCRKRKLEAMEPPDHHHHQRAVIKQEMEEEEEQEDGLSCCLPHCGMRANDVQLHRLPSNRIRLRKWLHNLNLPQSIPIDSQTRVCSEHFDVQPSEALEDCMPTLKLGHNDINIYRNQNSFASSSCLVPSCPCARLNLYRGYDLPEHHLVQQAWLQFLRQPMPQLPADGQLCVMHYMQLYEQVSLPQEVPATVLRQLQETYEQIANSTMAMKLRCAVPGCYSKYTDNIRLTKLPTCSDMFAKWLHNTKIKYDANRHYIYRICMLHFEARCLGPVRPKLWAVPTLQLHHNDANIYRNPKLDGSGQEVQQVFPQPVPMPVELPLRIKTELAYTGSPSASASPSPRGKLRICCIPGCAQQANSLVRLFRFPTSETMLLKWLVNTQQKPRLADPQHLFVCQDHFETEAICKKQLSSWAVPTLKLGHDGHIIPNAKHNGNIADTHDNRHTLQFIWANYCSVLDCFEPRSEQLRLFAYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFEPHCFDPETGELRVDAVPTLELSRGINKMGCMVPGCVKDENGPRRRLFKMPKRSSQLLDWCHNFQLDSSTISIEKSVVCDRHFESHCLNVHKQLRRGARPTLHLGHTADVEILQNPSNWARCTEVPETPADVCCVPNCGRYKDAEEEDTDMQLFSFPKVRTLAEKWLRNIRLEANKEQLAELKVCNAHFEIYCLENGRPQLGAMPTLQLGHDDHHDIHRSSCPTSVSASKMKRYCNRNGYSYDCCFPQCVELQKSYLRISYNLPQSQAMRNAWIAYMELEEPKEQENEQLLKLCPLHLIILYEHSVDNFPEHTTEELLEDNYAAARNSVRIRIISCAVRGCKTLKTRDGGCLHGLPQRRDILQMWLHNMQLVFYEQQRYMYKICSKHFEPNCFTDTTRRLKPWSMPTLELPVPEPGEPEVYQNPTEAEWQQMNEQWAAEQLQIAQPMMEDEEHEEVVQVKLEPETEMEPDNLLAYEEEDYSQPPVDGEEDASSQQPLEMQPLEVLLEVGHVEKCRTYEQMDTKANLSYAEQQAGPLPSNGIFASNGNKYTARNCSVQGCHVTLNDICGNVKLHKFPTSWKAMQKWMHNTQVKVCRSVSWRFRICSYHFMEDCFQGSRLRRGAMPTLQLGPKRPSHIYESEFNVGDADEEQPIEESSEQTPKSRVVGGDNISLCLPSPAPPRKSSKFCQVDGCPNHLTSENLTLHKFPHSPDMCAKWQHNTQVPFDPVFRWRYRICSAHFEPICLLNMRLLHGSVPTLHLGPRAPQQLFDSDCEAISMRLDKQKSSSEQQLYIKQEHEEDHDDAEQDEDEFSFLVPEMQLHEDAGEHEANIKTEPSQTNNGRWKDLRLPSIKQEDTPTMTCYNPVKSGYDKCSLVHCQRQRSLHGVHIYKFPRSHQLQQHWMHNLRIRYDERRPWKTMICSVHFEAQCIRLRKLRPWAVPTLELGDNVPQEIFSNEQSRQHPRGDGIKLFRLPNRLSDIHKWEVATGMHFTESQRNTKLICSRHFDPQLIGVRRLMYNAVPTLHLRPETAREERMPPRPRPAGPRCFMPSCPQDLHQKLHKFPSDPQQLQEWCAVLTVSETQDFSSLYICDDHLPTLPRSCLICSVEELQLPLLNFPEHRNQRAKWCHNLKIDTIPKWDNSKHICSRHFESYCFIQPGQLVPEAAPTLHLKHTDSNIFLNDYAMESSKLMLRIKDEPLESDDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01558329;
90% Identity
iTF_01558329;
80% Identity
-