Basic Information

Gene Symbol
-
Assembly
GCA_018150735.1
Location
JAECWO010000027.1:515983-530931[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 4.8 1.2e+04 -2.9 3.3 40 62 342 367 332 383 0.58
2 32 3.5e-15 9.2e-12 45.6 4.3 1 86 593 665 593 666 0.85
3 32 9e-15 2.3e-11 44.3 5.2 1 87 693 762 693 762 0.83
4 32 2.6e-15 6.8e-12 46.0 0.3 1 87 785 857 785 857 0.84
5 32 5.4e-16 1.4e-12 48.2 5.5 1 87 961 1031 961 1031 0.82
6 32 4.6e-15 1.2e-11 45.2 3.2 1 86 1055 1126 1055 1127 0.82
7 32 6.3e-13 1.6e-09 38.4 0.6 1 87 1162 1230 1162 1230 0.80
8 32 7e-11 1.8e-07 31.8 1.6 1 86 1274 1343 1274 1344 0.77
9 32 1.7e-16 4.4e-13 49.8 0.4 1 86 1371 1440 1371 1441 0.83
10 32 7.6e-13 2e-09 38.1 1.7 1 86 1462 1531 1462 1532 0.80
11 32 2.8e-15 7.2e-12 45.9 2.0 1 86 1559 1630 1559 1631 0.85
12 32 2.6e-13 6.8e-10 39.6 1.4 1 85 1707 1775 1707 1777 0.84
13 32 2.8e-12 7.2e-09 36.3 0.1 1 86 1801 1869 1801 1870 0.82
14 32 1.1e-13 2.7e-10 40.8 2.4 1 87 1995 2064 1995 2064 0.80
15 32 2.7e-10 6.9e-07 30.0 0.1 1 86 2150 2216 2150 2217 0.82
16 32 0.0011 2.8 8.8 0.1 1 59 2236 2284 2236 2306 0.73
17 32 4.6e-13 1.2e-09 38.8 0.6 1 86 2313 2382 2313 2383 0.84
18 32 2.3e-13 5.9e-10 39.8 1.7 1 87 2444 2514 2444 2514 0.83
19 32 5.1e-13 1.3e-09 38.7 0.9 1 86 2549 2620 2549 2621 0.81
20 32 3.5e-12 9.1e-09 36.0 0.4 1 87 2633 2706 2633 2706 0.80
21 32 1.3e-14 3.4e-11 43.8 0.6 1 86 2732 2804 2732 2805 0.81
22 32 1.8e-07 0.00046 20.9 0.4 1 58 2831 2881 2831 2903 0.84
23 32 1.4e-13 3.6e-10 40.5 0.2 1 87 2919 2991 2919 2991 0.83
24 32 1.6e-14 4.1e-11 43.5 0.3 1 86 3043 3114 3043 3115 0.81
25 32 0.00022 0.57 11.0 0.1 1 58 3146 3195 3146 3210 0.79
26 32 4.5e-13 1.2e-09 38.8 0.3 1 87 3233 3305 3233 3305 0.82
27 32 7.4e-15 1.9e-11 44.6 0.4 1 87 3450 3523 3450 3523 0.82
28 32 1e-11 2.6e-08 34.5 3.2 1 85 3590 3659 3590 3661 0.79
29 32 7.3e-15 1.9e-11 44.6 5.4 1 86 3765 3835 3765 3836 0.85
30 32 4e-13 1e-09 39.0 0.1 1 86 3915 3984 3915 3985 0.85
31 32 1.9e-11 4.9e-08 33.6 0.6 1 58 4011 4060 4011 4065 0.86
32 32 4.7e-11 1.2e-07 32.4 0.4 18 87 4077 4136 4066 4136 0.76

Sequence Information

Coding Sequence
ATGTCACAACATAATCCACATTATCATCACCCCCATCCGCACCCTCTGCACTATCAACAACAACAACAACAACAGCAGCAGCAGCTGCACCACCATCTTTCCCCTCTTCAGCAGCAACAACATAAACAAATACAACACAGCAATTGGTATTCACATGTTGCTTCCACCTCTTCCACCTCCTACCCGCATCACCCCTCATCAGCAGCCACCTCATCCTCCTCTTCGGTGGTGGCGGCGGCAGCGGCATCAACTTCAGGCTCTAACAACAATCACATAATGAATGCCTATGGAACACATGGATATTATGGTGCCGCTGGCGGTGGCCTCAATGTCAATGCTGTGGGTGTTGGTGTTGGGGGTGGTGGGGGGAATTCAAACAGTTATANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGGGAGGTCATCATCAACATCATCATCATGGTATATATCCCTATATCAAAAGTGAACCCATGGAATATAGCCATAATACAATGGCTCCACCGCCAGCACCTACTGGACCCAACACAGAGATGAGAATTAAATCGGAACCCATTGACGAACTTGCCTACAAATCGTCCAACTATATTGATGATAATACTCCATTTGCTGATTTTACTAAATATAATGAATTTAGCGAAAATATGTTGAATCCCAAAGTGGAATTGACTGTAAAAAATGAGTCGTCCTATGGCAAGAATATTAACAATTATCCAAGACGTAAATTACAAACGGAACGTTCTTCAGAGAATTTACCCATTTGTCAACGATGTAAAGAAGTCTTCTTCAAGAAACAATCCTATCTACGCCATGTGGCCGAGAGCAGTTGTAGCATCCATGAATATGAATTCAAATGCAACATTTGTCCCATGTCCTTTATGAGTGGCGAAGAGCTGCAAAGACATAAACATCTCCATCGAACAGATAAATTCTTTTGTCATAAATATTGTGGAAAACATTTCGATACCATAGCCGAGTGTGAATCCCATGAGTATATGCAGCATGAATATGATAGTTTTGTTTGTAATATGTGTTCGATGACATTTGCCTCGCGGGAGCAGCTTTATACCCATTTACCGCAGCATAAATTCCAGCAGCGTTATGATTGTCCCATTTGTCGCTTGTGGTATCAGACGGCTCTGGAGCTGCATGAGCATAGATTGGCGGCACCGTATTTCTGTGGTAAATACTACAATACGGCACATCAGTCACAGCAGCACCAGCAGCACCATCACCCACAACAACAACAACAGTCGAATCAGGCAAATTATAAATTACAGGATTGTCATATGGCTACAATGGAAATGCCCACAGCGACGCCACCATCGGTGTCCAACGCAAGCAGTTCATCCTCAGCCTTACCAGCAACGGCGGCGTTAAGTTCGCTGCTTCAACAGCGTCAGGCTAATGCCGATGGAGCGGCTATGTTTGCTGCCGCTGCCTCCACCTCTACTACGTCACATAAGAGCGACGTGAATGTGAAGCTTGAACGCAGCTATAGCAATTCGACAAGTGATTCGTCATTTGGAATGCACGAGTCCTCCAACTATAATAATAATAATGCTTATGGCAGTGATAATTCCATTCATGGGGCAGGAGCCATTGGTGGTCCCCAAGCTCATTCCTCAACGCTGGATGACTCCGAGGATGCTCTATGTTGTGTACCCATGTGCGGGGTAAGCAAGAGCACTAGTCCCACTCTCCAGTTCTTCACATTCCCCAAAGATGATAAATACCTCCATCAATGGTTGCACAATTTGAAGATGTTTCACATACCCGCCTCAAGCTATACGACTTTTCGCATCTGTAGCATGCATTTCCCGAAACGTTGCATCAATCGGTATTCGTTGTGCTATTGGGCAGTGCCTACATTCAATCTGGGTCATGATGATGTTGCCAATCTCTATCAGAATCGCGAGCTAACCAATACCTTTACCACTGGCGAAGTGGCACGCTGCAGCATGCCACACTGTAACAGTCAGAGGGGCGAGAGTAATCTCAAGTTCTACAACTTTCCCAAGGACATTAAAAGTCTGATCAAATGGTGTCAGAATGCTCGTCTGCCGGTTCAGGCCAAGGAGCCCCGGCACTTTTGTAGCCGTCACTTTGAAGAGCGATGTATTGGCAAGTTTCGGTTGAAACCGTGGGCAGTGCCCACACTACATCTCGGTGGTGCGCAATATGGCAAGATCCATGACAATCCAAAGAATTTGTATGTGGAGGAGAAGCGTTGTTGCCTTAATTTCTGTCGCCGTAGCCGCTCAACGGATTTCAATATGTCCCTATATCGTTTCCCAAGGAATGAGGTCTTATTACGACGCTGGTGCTATAATCTACGTCTCGATCCGGGTGTCTATCGCGGCAAGAATCACAAAATATGCAGTGCTCATTTTATTAAAGAGGCATTGGGTTTACGCAAATTGTCGCCAGGCGCTGTACCTACACTTCATTTGGGTCATAATGATACATTCAATATCTATGAAAATGAATTGTGGCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATGGGAAGCGGTGGATATTTAAGCATGGGTGGTGCGAACTCCCTATCCGGTGGAATGAATGTTAGCGATAGCATGGACATTTGTTGTGTGCCAAGTTGTGAGAGCAAACGACATAACAGTGAGAATATCACATTCCATACGATACCCAGAAGGCCCGAACAGATGAGGAAGTGGTGTCACAATTTAAAAATACCCGAAGACAAAATGCACAAGGGAATGCGGATTTGCAGTCTACATTTCGAACCGTATTGCATTGGCGGATGCATGCGTCCATTTGCAGTGCCCACTCTACACTTGGGTCACGACGATACGGACATACATCGGAATCCGGATGTGATTAAGAAGCTAAACATCAGGGAAACATGCTGTGTGGCAGTTTGCAAGCGGAATCGTGATCGTGATCATGCAAATCTCCATCGATTCCCCAGCAATGTGGCCCTCTTGTCGAAATGGTGTGCGAATCTGCAACGACCTGTACCCGATGGCAGTAAACTCTTTAACGATGCCATTTGCGAAGTCCATTTCGAAGATCGTTGTTTGCGTAACAAGAGATTGGAGAAATGGGCGGTGCCCACCTTAATATTGGGTCATGAGGACATTGCATATGAATTGCCCACATCCGAGCAAGTGGCTGAATTCTATGCGCGTCCAAATGCCCCAAATAACGGCGAAGAGCAGGGCGAATGCTGTGTGGAAAGCTGTAAACGTAATCCCAGTGTGGATGATATCAAACTATATCGACCACCTGAAGAATCAGATGTTCTTGCAAAATGGGCTCACAATCTGGAATTGGATGTCGCTGATTTACCAAACATGAGAATATGCAATTTACATTTCGAATCCCATTGCATTGGCAAACGGATGAGACCGTGGGCCATACCCACATTGAATCTATCTTCGAACATAGAGAATCTCTATGAGAATCCAGAGCACTCCATGTTGTACAAGAGGAGAACGAAGAGGGATCCAAATCGAGATTTATCGCTGGGCACAGCGACGAAACCCACTTGGGTGCCCAGATGCTGTTTGCCTCATTGTCGCAAGGTCCGAGCTCTCCACAATGTTCAGCTCTATCGTTTCCCCAAACTCAATCGGTCTACATTGGCCAAATGGGCTCACAATTTGCAAGTGCCAATGGTGGGCAGTGCTCAGAGAAGACTATGTTCAGCTCATTTCGAACCGCATGTCTTGAGTAAAAAGTGTCCAGTGCCCTTGGCCGTGCCCACCATTGATTTAAATGCACCGCCTGGCTATAAAATCTATCAGAATCCAGCCAAACTCAAGGCGACAAAATTGTGCCTGCAGAGAGTTTGCATTGTGGAGAGCTGCCGACGCACAAGGGCTCAGGGAGTGCAACTCTTCCGTTTGCCTCACAGTCCCACGCAGCTAAGAAAATGGATGCACAACATAAAGACTCGTCCACGGGCGGCTACAAGGACCCAGTATCGCATCTGCTCAATACATTTTGAGTCGCATTCGTTTAATGGCAAGAGATTGAGTGCTGGAGCCATTCCCACACTGCAATTGGGTCATGACGATGATGATATCTATCCGAATGAGGCTCAAGCTTTCGTGGATGAGCATTGTGCGGTCGAGAGTTGTGAATCATCGAAAGATCAACCCGAAGTACGTCTCTTCCGTTTCCCAACCGAAGATGATGATTTACTCTGGAAATGGTGCAATAATCTTAAAATGAATCCAGTCGATTGTGTGGGTGTCCGGATATGTAATAAGCACTTTGAAGGCGATTGCATTGGTCCCAAGCACTTATTCAAGTGGGCCATACCTACGCTGGAACTGGGACATGATGATGCCCAAATCGAACTCATTCCGAATCCCAAGCCAGAAGAGCGATATGTAGATCCAGTGTTTAAATGCTGTGTGCCCACTTGCGGCAAGACCAGAAAATTCGATGAGGTGCAAATGAATAGTTTCCCCAAGGATCCGGTGCTCTTCCAGCGTTGGCGACACAATCTTCGGCTGGAGCACCTGAACTTCAAGGAACGAGAACGTTATAAGATTTGCAATGATCATTTTGAGGATGTTTGCATTGGTAAGACCCGGCTGAATATAGGCTCCATACCCACCATTCAATTGGGTCACGACGAGACGGAGGACCTCTACCAGGTGGATCCCGCAGACCTGCAAAGCAATCTTTTCGGTCGACCTCGTAGATTACATGGATCAGTTGACATTAAGGTCGAACTACTCGAAGAGAACGAACAGGAGGATGTGAAACCAAATATCTATGCTATGGCTGAAGCCACCGATATGAACACCAGGCAGGTGAAGATTAAGAAATCTCTCGCTGATCTTAAGTGTTGTGTGCGAAGCTGTGGACGCAGTCGCCTGGAGCATGGAGCCCGTCTCTTTCCCTTTCCGAGCGGCAAGCAGCAGCAACTGAAATGGCGGCACAATCTCCAACTTGAACCGGATGAGGTGGATAAATTGACACGCGTCTGCAGTGCTCATTTCAATCGTCGTTGCGTTGATGGAAAACAACTGAGGGGATGGGCCATGCCCACTCAACAATTGGGCCATCATCAAGAACAGCCCATCTATGAGAATCCCAAGAATATACCGGGTTTCTTTACACCAACATGTGCCCTCAGTCATTGTAGGAAAAGGCGAAGTATTGATAATGACTTGCGCACCTATCGCTATCCGAGGAGTGAGGATCTATTGGAGAAATGGCGAGCCAATTTGCGATTGGCTCCCGATCAATGTCGCGGACGGATTTGTGCCGATCACTTCGAGCCTCTGGTTAGGGGCAAGTTGAAATTGAAGACGGGGGCAGTGCCCACTCTTAAATTGGGTCACGATGAAGATTTAGTCTATGACAATGAAGCCATTAAAGCTAGCATGGCGGATGAGGAAGATGGCAGCATAGAATCAACACCACCGCAAATAATACCGAAAAAAGAAATTTTGGAAGATGAAGACGATGAGGATGGTGCCCCCCAAGAGGGGGAGAATGAGGAGGATGATGATGATCCACCCCAAGAAGCAGATCAGGACGATTCACATTCTGATTATTTTGATCCCTTGGAACTGGTAGAGACATACGCCGATGATGCAGTACAAGAAGATGAATATACATCTCCTCTTCTCCCGCCACCCCCGTCATTAGCCGCTCCTCCTACTGCACGGCGTGAGAAACCGGCGAATAATGTAACTCCGATTTGTTGTCTAAGGCATTGCCGTAAGGAACGCACACCCACTCATCAGTTGAGTACTTTCGGCTTTCCCAAAGATCATCAACTGCTGCTGAAATGGTGTGCCAATCTTCACCTGGAACCAGTGGACTGTGTGGGACGCGTTTGCATTGAGCATTTCGAAGCGGAAATGTTGGGAACGCGCAAGCTTAAGCAGAATGCAGTGCCCACCGTAAATGTGGGTCATCAGATGCCCTTACCCTACACGTGCAACGGACAGGAGCGAAGCGATGAGGACGAGGATCATTCGGATTTTCGGCTTTGGAGCCTGAAACATTGTCGCAAGAGGAAGCTAACGGAACCACCAGACATTCGCCCAAAACTGGAGAAGAACGAGGTGATTCCAATGATGATGATGAGTATGGGAGTGAGAGTGAAGAAGGAGAAAATGGAGGATGGGGAAGAACTGGAGATGATGACTAAACCAAAGAAGTGTTGCCTTATCCAATGCGGAAAAGAGATGAACTTGCAAAAATTCCCAAGAGATTTCCATTTGCTTCGCAAATGGTTACACAATCTGAAGTTAAACCTAGACGAGTATTTGGATCCTACCGCACTTCGTGTGTGCTTGGACCACTTTGAGCCGCATTTAGTGCCTAATGGTCAACTATTGAGAGAGGCATTGCCCACTCTCAAATTGGGTCATCAGGATACGAATATTTACCAGACCACTGTGGCAACTTCGGGAGGTTGTTTGGTGGCCAGTTGTCCGTGTGCTCGTCTCAATTTGTATCGAAGTTATGCTTTACCCAAGAATCCCCACATTAAAGAGGTCTGGTTAACATATCTGAAGCTTCCATTCACTACCCAAGGACAGTTGTGTGTAATGCACTTTATGCAGCTCTATGAGGAGATGCCCTTTAAGGAGCTGCGACATATCTACGAGACCATTGCCAACTCCACACAAGCCCTGAAACTGCGTTGTGCCGTCCCTGGATGCCATTCCAAATACACGGATAATATACATTTGACAAAGCTACCGCTGAATAAGAACCTACTCCATAAATGGTTGCACAACACCACGTTAAACTATGATCCCACTAAGCATTCGATATATCGTGTTTGCCTGCTGCACTTTGAGCCGCACGCCTTAGGCCCGGCATGCCCGAAGCCCTGGGCAGTGCCCACCTTGGAATTGAATCATCAAGATGACATTTACTTTAATCCCACAAAAGAGGAAATGGTTAATCTAACCAATGTTCCGTTGCAAATTAAAACGGAATTAACTCTGCCGTTGCGAATAAAAACCGAACTTGCCGCCTTGAGCAGTCCCAGCATTGGTTCCACTCCGAGTCCAAGGGGCAAGGTGCGAATTTGCTGCATACAGTCGTGTCAGCAACAGGCCAATTCCCAGTTGCGTCTCTATTGGTTTCCCAATGCGGAGACCGCTCTGCTCAAGTGGCTGGTCAATACGCAACAGCAACCACGTCTGGTGGATCCCCTGCAGTTGTATGTCTGTCAATCTCATTTCGAACCCGAAGCCATTTGTAAGAAGCAGCTGAGAAGTTGGGCAGTACCCACCTTGAATTTGGGTCACGATGGTTATGTTATACCCAATGCCAGGCACAATGGAAATATTGCCGATAGCCAGGAAACGGAACATGCAATGGAATTCATCAGGGAGAACTATTGTTCCGTACTCACGTGCTTTCAGCGGAAGAGTGAAGCTGTGCGTCTGCATGCCTATCCCAAGGATATGCCAACTATACGAAGATGGGCAGCCAATTGCAAGCATCGATCCATGCAGGCCAGCAGTCATGGATTCAAGGTCTGTCAATTGCATTTTGAATCAGAATGCTTTGATCCGGATACTGGAGACTTACGTGAAGGATCTGTGCCCACTCTGGATCTAACAGTTAGTCGGCTGAGCAACGAATCGCGTTGCCTGGTCGCTGGTTGTGTGAAGGATGAGTCCCAGCCGCGACGACGTTACTACAAATTGCCCAAGCGGCCAGCTCTGCTCAATGATTGGTGCGTGAATCTCGCTCTGGATCCTTCTGGACTGCCCCAAAATGCTGATCATAATATATGTGAACGACATTTCGAATCTCGCTGCTTCAATAGCTACAAACAATTGCGTACTGGAGCACGACCGACATTGCATTTGGGTCACACTCAAGATATCAAGTTGCTACCCAATCCGGAGAGTTTCAGTGACGAGGCGGAAGATATCGGGCTCTGCTGTGTGCCGCAATGTGGTGGCTCCAAGCAATTGGATGATTTAATTCAACTAAGCCATTTTCCCCGAATGCGTAAGCTGGCTGAGAAATGGATACATAATTTACATCTTCCTTCCTTTAACCGGGATCAGTTGGCCAAGCTTCGCGTGTGTCATAGGCATTTTGATGCAACTTGTTTTGAAAATGGCCAATTGCGACAGGGAGCCATGCCCACCATGGAGTTGGGTCACACGGATGCGGACATTTATCAAACAGATGAACCAAATTTGGGCAAGCTTCGAAAGCCCGGACTGGATTGCTGTTATCCTCAGTGTGTCCAATTGCAGAAGAACTACCAGCGGGTGGTCCACGATCTGCCGAAAGAGGAGAAGCTACGTCAGCGATGGCTTCAGCATTTAGAAATCGAAAATACAGAGGAGCGACCGTTGAAATTGTGCCCGCTCCATTATATTATCCTGTACGATCATAGTGTGAAAAACTTTGAAGAACACGGTCCGGATGATCTGCTCGAAAAGAACTATGACGATGCACGAAACGGTTCGAGAATCAGGCTTATCAGTTGTGCGGTTCGCGGATGTGGAACTCTTCAGCCGCGTGATGGCGGCAGGCTGCATGGTCTCCCCACAAATCCTGAGGTCTTCCAGATGTGGTTAGAGAACACCGAGCTGGTCGTCTATGAGCCGCAGCGTTATATGATTAAAGTGTGCAGTAAACACTTTGAACCTCAGTGTTTTACCGATATTCGCAAATTGAAATGCTGGAGTGTGCCGACACTTCATCTACCCGGTGAGATAGTGCATCAAAATCCCACCGAAGAGGAATGGCAAAAGATGACCGAGCGATTGGCCGTTGTACCCGTCACTCAGGGAGTGGATGCAGGCGACGACACTTTGTTGCTGGAACCGGTCGTTATTATGGAAGAGAACTCTGTCTGCTGTGTCCCCAACTGTGGACGCTCCAAGCAGACAGATGAGTCAACTCAATTCACTAGTTTCCCCAAGATAAACATTCTGGCCGAGAAATGGATGCATAACTTCCATCTGAAGGTGGGCAAAGATCAATTGGGCAATCTTCGGGTGTGTTATCGACATTTTGAGGCATCTTTGATAGAAAATGGACGCTTACGTCGCTTTGCCATGCCCACCCTGGAGTTGGGTCATGAGGATAGCGAAATCTATCACACAGAGGAGCCAGATCTCAACAGGGTGCGAAAGCAGCCCAAGAGATCCAGTGGCCAGGGCTGTTCTTATCCCCAGTGTGTGGAACTGTTGAAGAATTTCCAGCGAATGGTCTATGATCTGCCGAAGGAACCGCAACTGCGAGAATGTTGGCTTCAGTATATGGAATTGACGGAAGAGGAGCAACCACTGAAGTTGTGCCCACTCCACTACATAATTCTCTATGATCATAGTGTGAAAAACTTTGACGCACATGCTCCGGAACAGCTGCTCGACTATAACTATGAGAATGCTAGGAATTGTGTACGTATCCGAATTATCAGCTGTTCGGTTCAAGGATGTAATACACTCCAGCCACGCGATGGCGGACGAATGCATGGTCTGCCGCCGAGATCGGATATCCTCCAGATGTGGCTGGACAATACTAGATTGCTGTTCCATGAGCATCAGCGTTACATGCTTAAAGTGTGTAGCAAACACTTCGAGCCCAAATGTTTCACTGACATTCGCAAATTGAAGAGCTGGAGTATTCCGACCCTTCATCTACCCGAAGAGCCAGTGCATCAGAATCTCACAGAAAAGGAATGGCATCAGATGAATGAGAAATTTGCCGAGCCCATTAATCGGGAAGCGGAAAGTTTCGATGACAATTCAATGCTGGAGCCCATTGTTATGATGGAACATGCCGAATCTGATGGCGAAATGTTGGAGGGGGAGGAGACGGGACGAATCCCTCACACAGAATTTGTGACCAATGATCATTTGCAGGAAGATTCCCAAGATGTGGGTGATGAAGAGATGCAGGCATTGGAAGTCCTTCTCGAAGTCGGTCATGTGGAGAAATGTTCCAGCTATGAAAAAATGGACAACAAATCCCATTTGCCTTACTCCGAGACAAGGCCATTGAGTCCTTCGATTGCTTCTGTGCCTCCTGGACAACGCGGCGGTGGTGGTGGTCATTACAATGCCCGCCACTGCAGTGTCCAGGGTTGTCAGATAACTGCTCATGATGTGGACGGCAATATCAAGCTTCACAAATTCCCCTCCTCCACGGAGGCCACCCAAAAATGGATGCATAACACCCAAGTGGATGTGGATGGGAACTATTCGTGGCGCTATCGCATTTGTAGTTACCATTTCGAGCAGGAATGCTTTAATGGGGCCCGCATACGGCGGGGATCTATGCCCACTTTGCATTTGGGTCCCCTCCGACCTAAGGATATCTATGCTAATGAGTTCACACAAACGGATATGGATGAAACTGTTGGAGAAGCGAACCCTAATTTGCCACCTGAGCAGGATGAACATGAACCTGTCGTGGCTCCGCATATACGGGGTCAAGTGACGCAATTGTGTCTGCCACGTCCTGCTCCGCCACGTAAATCAAGTAAATTCTGTCAAATCGATGGATGTTCGAATCACCTGACCAGTGAAAATATGACTCTGCACAAGTTTCCTCACTCGCTGGAAATGTGTGCCCGCTGGCAACACAATACCCAGGTGCCATTCGATCCAGAGTATCGATGGCGCTACCGCATCTGCAGTATTCATTTTCATCCAGTTTGTTTGGTCAATATGCGCTTAGTGCATGGCAGTGTGCCTACCCAGAAACTTGGTCCTCGGGCGCCTGCCCAATTGTTTGACAATGATTTTGAGGCCATTAACATGAGACTGGATAAGCGATCGCACTTGGAGCAGGGAGCTAGGGTGAAGCAAGAGAAGCCGTATTCCCAGCAGCCTGATGAGGGATTCTATCTAGAGCCAGAAATGGAAATGGATGTGGATGAGATGGAAGAGGAGCAAGACCAAGATCAATCACAATCTATGACATCCTTTGAAAATTGGAGGCATCAGCTTCGACTACCGGCCGTTAAGCAAGATAAGACGCCTTATAATCCCATTAAGTCTGGCTACGACAAATGCTCCCTCACGCACTGTCAGCGTCAGAGATCTCAGCATGGTGTCCACATATACAAATTCCCGAGATCGAAGCGCCATCAGCAACGCTGGATGCACAATTTACGCATCCGGTATGATGAGCGAAAACCGTGGAAATACATGATCTGTAGTGTTCACTTTGAGCCGCATTGTATTCGCTTGAGGAAACTACGTCCATGGGCAGTGCCCACTTTGGAGTTGGGTAAGAATGTGGCAGACCAAATCTATACCAATGAACAGTGCAAAGAAATGGCCTCAGATGTCAGTGAGGAAGAGGAGAGTGGACCCGACGAAAGTCTTCTGGAAGATGACGAGGATGAAGCAGATCTAGATGGAGAAACTGGTGTGGAGTCCCACATAAAGCGGGAAAGGCGCTCTTGGGGATCAGGTGGTGCTGCTGGTGGTCAAGCGGCTCCTTGGAAAGTCAAACAATGCTGCTTACCCTATTGCCGCCGACCACGTGGAGATGGCATCAAACTCTTTCGCCTGCCCGGCAATCCAAATTCCATACGAAATTGGGAAAAGGCCACTGGCATGACATTTAAGGCGTCGCAGCGCAACACTCGCTTGATTTGCAGTCGTCACTTTGAGCCTGAGTTGATGGGAGTGCGTCGGTTGATGCGGAATGCGATACCCACGAGACATTTATATCACCAAAGGGAGAGTTATAGCCCAGAGCTGGTGATACCCACAGATACTCCAGCTCCCATTGGTCCCACTTGCTGCATTCCTGATTGCTCTCCACAAGATGGATCGTCTCAACTTCATCGGTTTCCCAGTGATCCACATCAGTTGCAGCAATGGTGCGAGTCTCTAAATCTTACGGATCCTCAACGCTATAGCGGACAATATGTTTGCTCTAATCATCTTCCAGCCCTCGACTTGGGATGTATTATCTGTGGCGTCGAGGATGTGCAATTGCCGCTACTTGATTTTCCCGAGAATCGTAATCATCGAGCAAAATGGACTTATAATCTGAAAATTGATACCATACCCAAATGGGACAACTCCAAGCATATTTGCTCGAAACATTTCGAATCCTATTGCTTTAGCCAGCAAACCGGGGAACTGCATCCGGAAGCAGCGCCCACACTGCATTTAAAACACAATGATTCGAATATATTCCTCAATGATTATGCCATAGATCAGCCCTGTATGATGCGAATTAAAGATGAGCCCTTGGACAACGATGAAATGTTGTTGGCTTAA
Protein Sequence
MSQHNPHYHHPHPHPLHYQQQQQQQQQQLHHHLSPLQQQQHKQIQHSNWYSHVASTSSTSYPHHPSSAATSSSSSVVAAAAASTSGSNNNHIMNAYGTHGYYGAAGGGLNVNAVGVGVGGGGGNSNSYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGGHHQHHHHGIYPYIKSEPMEYSHNTMAPPPAPTGPNTEMRIKSEPIDELAYKSSNYIDDNTPFADFTKYNEFSENMLNPKVELTVKNESSYGKNINNYPRRKLQTERSSENLPICQRCKEVFFKKQSYLRHVAESSCSIHEYEFKCNICPMSFMSGEELQRHKHLHRTDKFFCHKYCGKHFDTIAECESHEYMQHEYDSFVCNMCSMTFASREQLYTHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYNTAHQSQQHQQHHHPQQQQQSNQANYKLQDCHMATMEMPTATPPSVSNASSSSSALPATAALSSLLQQRQANADGAAMFAAAASTSTTSHKSDVNVKLERSYSNSTSDSSFGMHESSNYNNNNAYGSDNSIHGAGAIGGPQAHSSTLDDSEDALCCVPMCGVSKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHIPASSYTTFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCNSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSTDFNMSLYRFPRNEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMGSGGYLSMGGANSLSGGMNVSDSMDICCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDTDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLSKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYELPTSEQVAEFYARPNAPNNGEEQGECCVESCKRNPSVDDIKLYRPPEESDVLAKWAHNLELDVADLPNMRICNLHFESHCIGKRMRPWAIPTLNLSSNIENLYENPEHSMLYKRRTKRDPNRDLSLGTATKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTIDLNAPPGYKIYQNPAKLKATKLCLQRVCIVESCRRTRAQGVQLFRLPHSPTQLRKWMHNIKTRPRAATRTQYRICSIHFESHSFNGKRLSAGAIPTLQLGHDDDDIYPNEAQAFVDEHCAVESCESSKDQPEVRLFRFPTEDDDLLWKWCNNLKMNPVDCVGVRICNKHFEGDCIGPKHLFKWAIPTLELGHDDAQIELIPNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPVLFQRWRHNLRLEHLNFKERERYKICNDHFEDVCIGKTRLNIGSIPTIQLGHDETEDLYQVDPADLQSNLFGRPRRLHGSVDIKVELLEENEQEDVKPNIYAMAEATDMNTRQVKIKKSLADLKCCVRSCGRSRLEHGARLFPFPSGKQQQLKWRHNLQLEPDEVDKLTRVCSAHFNRRCVDGKQLRGWAMPTQQLGHHQEQPIYENPKNIPGFFTPTCALSHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICADHFEPLVRGKLKLKTGAVPTLKLGHDEDLVYDNEAIKASMADEEDGSIESTPPQIIPKKEILEDEDDEDGAPQEGENEEDDDDPPQEADQDDSHSDYFDPLELVETYADDAVQEDEYTSPLLPPPPSLAAPPTARREKPANNVTPICCLRHCRKERTPTHQLSTFGFPKDHQLLLKWCANLHLEPVDCVGRVCIEHFEAEMLGTRKLKQNAVPTVNVGHQMPLPYTCNGQERSDEDEDHSDFRLWSLKHCRKRKLTEPPDIRPKLEKNEVIPMMMMSMGVRVKKEKMEDGEELEMMTKPKKCCLIQCGKEMNLQKFPRDFHLLRKWLHNLKLNLDEYLDPTALRVCLDHFEPHLVPNGQLLREALPTLKLGHQDTNIYQTTVATSGGCLVASCPCARLNLYRSYALPKNPHIKEVWLTYLKLPFTTQGQLCVMHFMQLYEEMPFKELRHIYETIANSTQALKLRCAVPGCHSKYTDNIHLTKLPLNKNLLHKWLHNTTLNYDPTKHSIYRVCLLHFEPHALGPACPKPWAVPTLELNHQDDIYFNPTKEEMVNLTNVPLQIKTELTLPLRIKTELAALSSPSIGSTPSPRGKVRICCIQSCQQQANSQLRLYWFPNAETALLKWLVNTQQQPRLVDPLQLYVCQSHFEPEAICKKQLRSWAVPTLNLGHDGYVIPNARHNGNIADSQETEHAMEFIRENYCSVLTCFQRKSEAVRLHAYPKDMPTIRRWAANCKHRSMQASSHGFKVCQLHFESECFDPDTGDLREGSVPTLDLTVSRLSNESRCLVAGCVKDESQPRRRYYKLPKRPALLNDWCVNLALDPSGLPQNADHNICERHFESRCFNSYKQLRTGARPTLHLGHTQDIKLLPNPESFSDEAEDIGLCCVPQCGGSKQLDDLIQLSHFPRMRKLAEKWIHNLHLPSFNRDQLAKLRVCHRHFDATCFENGQLRQGAMPTMELGHTDADIYQTDEPNLGKLRKPGLDCCYPQCVQLQKNYQRVVHDLPKEEKLRQRWLQHLEIENTEERPLKLCPLHYIILYDHSVKNFEEHGPDDLLEKNYDDARNGSRIRLISCAVRGCGTLQPRDGGRLHGLPTNPEVFQMWLENTELVVYEPQRYMIKVCSKHFEPQCFTDIRKLKCWSVPTLHLPGEIVHQNPTEEEWQKMTERLAVVPVTQGVDAGDDTLLLEPVVIMEENSVCCVPNCGRSKQTDESTQFTSFPKINILAEKWMHNFHLKVGKDQLGNLRVCYRHFEASLIENGRLRRFAMPTLELGHEDSEIYHTEEPDLNRVRKQPKRSSGQGCSYPQCVELLKNFQRMVYDLPKEPQLRECWLQYMELTEEEQPLKLCPLHYIILYDHSVKNFDAHAPEQLLDYNYENARNCVRIRIISCSVQGCNTLQPRDGGRMHGLPPRSDILQMWLDNTRLLFHEHQRYMLKVCSKHFEPKCFTDIRKLKSWSIPTLHLPEEPVHQNLTEKEWHQMNEKFAEPINREAESFDDNSMLEPIVMMEHAESDGEMLEGEETGRIPHTEFVTNDHLQEDSQDVGDEEMQALEVLLEVGHVEKCSSYEKMDNKSHLPYSETRPLSPSIASVPPGQRGGGGGHYNARHCSVQGCQITAHDVDGNIKLHKFPSSTEATQKWMHNTQVDVDGNYSWRYRICSYHFEQECFNGARIRRGSMPTLHLGPLRPKDIYANEFTQTDMDETVGEANPNLPPEQDEHEPVVAPHIRGQVTQLCLPRPAPPRKSSKFCQIDGCSNHLTSENMTLHKFPHSLEMCARWQHNTQVPFDPEYRWRYRICSIHFHPVCLVNMRLVHGSVPTQKLGPRAPAQLFDNDFEAINMRLDKRSHLEQGARVKQEKPYSQQPDEGFYLEPEMEMDVDEMEEEQDQDQSQSMTSFENWRHQLRLPAVKQDKTPYNPIKSGYDKCSLTHCQRQRSQHGVHIYKFPRSKRHQQRWMHNLRIRYDERKPWKYMICSVHFEPHCIRLRKLRPWAVPTLELGKNVADQIYTNEQCKEMASDVSEEEESGPDESLLEDDEDEADLDGETGVESHIKRERRSWGSGGAAGGQAAPWKVKQCCLPYCRRPRGDGIKLFRLPGNPNSIRNWEKATGMTFKASQRNTRLICSRHFEPELMGVRRLMRNAIPTRHLYHQRESYSPELVIPTDTPAPIGPTCCIPDCSPQDGSSQLHRFPSDPHQLQQWCESLNLTDPQRYSGQYVCSNHLPALDLGCIICGVEDVQLPLLDFPENRNHRAKWTYNLKIDTIPKWDNSKHICSKHFESYCFSQQTGELHPEAAPTLHLKHNDSNIFLNDYAIDQPCMMRIKDEPLDNDEMLLA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00604131;
90% Identity
iTF_00604131;
80% Identity
-