Basic Information

Gene Symbol
-
Assembly
GCA_018152915.1
Location
JAECXQ010000438.1:2990717-3004600[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 2.6 3.8e+03 -1.6 2.4 49 63 180 198 161 213 0.55
2 29 3.7e-15 5.4e-12 46.0 4.6 1 86 337 409 337 410 0.86
3 29 2.8e-15 4.1e-12 46.4 4.6 1 87 437 506 437 506 0.83
4 29 2.9e-12 4.1e-09 36.8 0.6 1 61 528 584 528 595 0.83
5 29 6e-16 8.7e-13 48.5 5.7 1 87 597 667 597 667 0.83
6 29 3.7e-15 5.4e-12 46.0 3.1 1 86 691 762 691 763 0.82
7 29 3.8e-13 5.5e-10 39.6 1.0 1 87 798 866 798 866 0.80
8 29 8.5e-11 1.2e-07 32.0 1.2 1 86 915 984 915 985 0.76
9 29 3.1e-15 4.4e-12 46.3 0.1 1 86 1012 1081 1012 1082 0.82
10 29 5.5e-14 8e-11 42.3 0.8 1 86 1103 1172 1103 1173 0.81
11 29 7.3e-15 1.1e-11 45.1 1.8 1 86 1200 1271 1200 1272 0.86
12 29 8.6e-14 1.3e-10 41.6 1.4 1 85 1344 1412 1344 1414 0.82
13 29 8.7e-13 1.3e-09 38.4 0.1 1 86 1437 1505 1437 1506 0.81
14 29 6.6e-14 9.6e-11 42.0 1.0 1 87 1663 1732 1663 1732 0.80
15 29 5.6e-11 8.2e-08 32.6 0.1 1 62 1787 1846 1787 1865 0.78
16 29 0.002 2.9 8.4 0.0 1 58 1868 1918 1868 1943 0.79
17 29 4.4e-13 6.3e-10 39.4 1.8 1 87 1957 2027 1957 2027 0.86
18 29 5.7e-14 8.2e-11 42.2 1.2 1 87 2086 2156 2086 2156 0.82
19 29 3.8e-13 5.6e-10 39.6 0.7 1 86 2191 2262 2191 2263 0.82
20 29 6.6e-13 9.5e-10 38.8 1.3 1 87 2273 2344 2273 2344 0.81
21 29 2.6e-13 3.7e-10 40.1 0.0 1 86 2367 2437 2367 2438 0.82
22 29 6.3e-05 0.092 13.2 0.1 1 58 2469 2519 2469 2541 0.83
23 29 5.1e-15 7.4e-12 45.6 0.2 1 86 2558 2630 2558 2631 0.82
24 29 6.6e-14 9.6e-11 42.0 0.2 1 86 2770 2842 2770 2843 0.84
25 29 1.8e-14 2.6e-11 43.8 1.6 1 86 2908 2978 2908 2979 0.82
26 29 3.5e-14 5e-11 42.9 4.5 1 86 3091 3161 3091 3162 0.85
27 29 5.1e-13 7.4e-10 39.2 0.1 1 86 3253 3322 3253 3323 0.84
28 29 1.4e-09 2e-06 28.2 1.0 1 58 3340 3388 3340 3403 0.86
29 29 1.9e-09 2.7e-06 27.7 1.3 19 87 3406 3463 3394 3463 0.75

Sequence Information

Coding Sequence
ATGAATGCATACGGCACGGGCGCTGCTAGCGCGCACTATTATGGCGCTGCTCCTACTGCTGGGGCTGCGGCTGGGGCTGGGAATCCAATGGCCCCACCACCAGCGCCCGCCCTTAATCGTCACACGCTCGATGCTAGCGGTGAAATGATAATAAAATCGGAACCCATTGACGAACATGCGTTCAAGTCCAACTATATCGATGACAACACGCCCTTTGCCGATTTTAGTAAATTTACCGAGTTTGGCGACGACATGCTAAGTCCCAAGGTTGAGCTAACAATTAAGGACGAGGCTTATGGCAGCCAAAAGAACCCGCTGAGCTATCCGCGCCGAAAGCTGCAAACTGAGCGATCTTCGGAGAGTCTGCCTATTTGCCAACGCTGCAAGGAGGTCTTCTTCAAGAAGCAGGTCTATCTGCGTCATGTAGCCGAGAGCAGCTGTATCATACACGAGTATGAATTCAAGTGCAACATATGTCCTATGTCCTTCATGGGCGCTGAGGAGCTGCAGAAGCACAAGCAACTGCATCGCGCGGACAAGTTCTTTTGCCACAAATACTGTGGCAAGCACTTCGACAACATCGCCGAATGCGAGTCGCATGAGTATATGCAGCATGAATACGATAGCTTTGTGTGCAATATGTGCTCTGTAACGTTTTCAACGCGGGAACAGCTTTATGCTCATCTGCCGCAGCACAAGTTTCAGCAGCTGACAGCGCCGCACCACAAGACGAATGCATTGCCTGCAACGGCGGCGCTTAGTTCCTTGCTGCAGCAGCGTCAGGCGAATGCGGATGGCGCAGCGCTGTATGCTTCGACGCTGAAAAGCGATGCTAATGTGAAGTTGGAGCGCAGCTATAGCAACTCCACCAGCGAGTCTGGTTACAGTCTGCACGAGAGTAGTTATAATAATGCCTACGGCAGCGATAATTCGTTGCATGGTGGCAGCGCAGCAATTGGTGGTCCGCAGGCCCACTCCTCCACGCTGGACGAATCGGAGGATGCGCTGTGCTGTGTGCCGCTGTGCGGTGTGCGCAAAAGCACCAGCCCCACGCTGCAGTTCTTTACATTTCCCAAGGATGAAAAGTATCTGCATCAGTGGCTGCACAATCTCAAAATGTTCCATATTCCGGCCTCAAGCTATACCAGCTTTCGTATCTGCAGTATGCATTTTCCTAAGCGATGCATCAATCGTTATTCTTTGTGTTATTGGGCGGTGCCCACATTCAACCTGGGCCACGATGATGTAGCCAATCTGTACCAGAATCGCGAGCTGACTAACACCTTCACCACCGGCGAGGTGGCGCGCTGCAGCATGCCCAACTGCACCAGCCAACGCGGCGAGAGCAATCTCAAGTTTTATAATTTTCCCAAGGACATCAAGAGCCTGATCAAGTGGTGCCAGAATGCACGTTTACCCGTCCAGGCCAAGGAGCCGCGCCATTTTTGCAGTCGCCATTTCGAGGAGCGCTGCATTGGCAAGTTCCGGCTGAAGCCTTGGGCTGTGCCCACTCTACATCTGGGCGCCCAGTACGGCAAGATTCACGACAATCCCAAGAACCTGTATGTGGAAGAGAAACGCTGCTGCCTTAACTTTTGCCGTCGCAGTCGCTCCTCGGACTTTAACATGTCATTGTATCGCTTTCCCAGAGATGAAGTACTGCTGCGACGCTGGTGCTATAATCTGCGCCTCGATCCGGCTGTGTATCGCGGCAAGAACCACAAAATTTGCAGCGCTCACTTCATCAAGGAAGCTCTCGGATTGCGCAAACTCGACAGCATGGACGTGTGCTGCGTACCCAGCTGTGAGAGCAAACGGCACAACAATGAGAACATCACATTCCATACAATACCCAGGCGGCCAGAGCAGATGCGTAAATGGTGCCACAATCTGAAGATACCCGAGGACAAGATGCACAAGGGCATGCGGATATGCAGTCTACACTTTGAACCCTACTGCATTGGCGGTTGTATGCGACCGTTTGCGGTGCCCACACTACATCTGGGCCATGATGACGAGGACATTCATCGTAATCCGGATGTGATTAAGAAGCTGAACATACGTGAAACCTGTTGCGTTGCAGTTTGCAAGCGCAATCGAGATCGGGATCATGCCAATCTGCACCGTTTCCCCAGCAATCTCGCCCTGCTGACCAAGTGGTGCGCCAATCTGCAGCGACCCGTACCGGATGGCACCAAACTTTTCAATGATGCCATCTGCGAGGTGCACTTCGAGGATCGCTGTCTGCGCAACAAGCGCCTGGAGAAGTGGGCTGTGCCCACGCTTGTGCTAGGCCACGAGAATATTGCCTATCCGCTGCCCACGCCCGAGCAGGTGGCCGAGTCCTATGCGCGACCCAGTGCGCCCAACAATGGCGAGGAGCAGGGTGAATGCTGCGTGGAGACCTGTAAGCGTAATCCCAGCGTAGATGACATCAAGCTTTATCGGCCGCCCGAAGAATCACAGGTGCTCGCCAAATGGGCGCACAATCTGCAGCTGGACATTGCCCAGCTTTCTAACATGCGAATCTGTAATCTGCACTTTGAATCCCACTGCATTGGCAAACGCATGCGACCCTGGGCCATACCCACCCTCAATTTGGCCAGCAACATTGAGAATCTCTTCGAGAATCCCGAACACCAGATGCTCTACAAGCGTCGCACGCATCTCAACGCGGACAGAGCTGCCGCTCGCAGCGCTGGCGCTGACGGAGCTGCAATGAAGGCCTCTTGGGTGCCACGCTGTTGCCTGCCGCACTGCCGCAAGGTGCGCGCTCTGCACAATGTCCAGTTGTATCGCTTCCCCAAGGTGAATCGCACAACGTTGGCCAAATGGGCGCATAATCTACAAGTGCCGCTGGTCGGCAGCGCCCAAAGGCGTTTATGTTCCGCTCACTTTGAGCCGAATGTGCTGAGCAAGAAATGCCCGGTGCCGTTGGCGGTGCCCACGCTGGATCTCAATACGCCACCGGGCTACAAGATTTACCAAAACCCAGCCAAGGTGAGGGCTAACAAGCTGTGTTGGCAGCGCGTCTGCATTGTGGAGAGCTGCCGTCGACAGCGGGCACAGGGCGTACAGCTCTTCCGGCTGCCGCACAGTCGCACCCAGTTGCGCAAGTGGATGCACAATCTTCGCATGCTGCCGAGAGGCGCCATGCGGCAACAGTATCGCATCTGCTCGCTGCACTTTGAGCCGCACTCGTTTAACGGCAAGCGTCTGAGCACGGGCGCAATTCCAACGCTGGAGCTGGGCCATCAGGATGACGATATTTATCCCAATGAGGCGCAGTCGTTTGTCGAGGAACACTGCGCCGTAGAAGGCTGCGATGCGTCCAAGGAGCAGCCGGATGTGCGTCTCTTCCGCTTTCCCAACGACGATGAGGATCTGCTCTGGAAATGGTGCAACAATCTCAAAATGAATCCCGTCGATTGCTATGGCATGCGTATCTGCAACAGGCACTTCGAGCCGGACTGCATTGGGCCCAAACACCTGTACAAGTGGGCCATACCCACATTGGTTCTGGGGCACGATGATGGCCAGATAGAGCTGATACCCAATCCCAAGCCGGAGGAACGCTATGGTGATCCTGTGTTCAAGTGCTGTGTACCCACCTGCGGCAAAACGCGCAAATTTGATGAGGCGCAAATGAATAGCTTTCCCAAGGACCCATCGCTCTTCCAGCGCTGGCGCCACAATCTTCGCCTGGAACATCTCAACTTCAAGGAGCGCGAGCGCTACAAGATTTGTAATGCGCATTTTGAGGACATTTGCATTGGCAAGACGCGTCTCAATATTGGCTCCATACCCACGCTGGAGCTGGGCCATGAAGAGACCGAAGATCTGTATCAGGTTAATCCCGAAGAGCTGCAAAGCAACTTGTTTGGACGCCCGCGACGTGTGCATGAGAATCAGCGACTGAGCATCAAGCAGGAGCTGGATGAGGACATCAAGCCGGACATAACCATGTCAGAGGCCACGGATACAAACACAACACAGGTGAAGATCAAGAAATCTGTGTTGGACTTGAAGTGCTGTGTGCCCAGCTGTGGTCGCAGCCGGCTGGAGCATGGTGCTCGCCTGTTTCCCTTTCCCACTGGCAAGCAGCAGCAGACCAAGTGGCGACACAATCTCCGCCTTAGCGCCGCCGATGTGGACAGGACAACGCGCGTTTGCAGCGCTCACTTCAATCGACGCTGCATCGATGGCAAACAGCTGCGTGGCTGGGCCATACCCACACAGCAGCTGGGCCACCAGGAACAAAACATATATGAGAATCCAAAGAACATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCACTGTCGTAAACGACGAAGCATTGACAATGATTTGCGCACCTACCGCTATCCGCGCAACGAGGAGCTGCTTGAGAAATGGCGCGTGAATTTGCGTCTGGCGCCGGATCAATGTCGTGGACGCATTTGTGCGGATCACTTTGAGCCCATGGTGCGCGGAAAGCTGAAGCTTAAGACGGGAGCGGTGCCCACGCTAAAGCTAGGCCATGATGAGGGCGTAGTCTTTGACAATGAGGCCATTAAAGTAGGAATGCAGCAGGAGGAGGAAGAGGAGGAGGATGCGGGCAGCTTGGAATCGCTGGGAAAGATAAAAATTGAGAAGCAGGAGAAGGAAACCCCTGAGCAGGAGTTAGAAAACGATGATGAGAATGAAGAGGACGAGCAGCAGCAGAAAGTGGAAGATCCTGAGGATGAAATGGAGCAAGAGCAGGATGAGGAGGAAGAGGAGCTGCAGGATCATGGCTATTTTGATCCCCTAGAGCTAGTGGAAACCTTTGCCGAACAGCACAGCGATGATAATTCCGCTGACAATTATCATCTCGAAGCTGATGAAGATGATGATGATGAAGATATACCTGGCAATGATGATGAGCTGCTGCTGCCAGACACTGTTCCAATGCAGTTGCCGCCACGCCGCGAAAAGGCGGTGAATAATGTGACGCCTATTTGTTGCTTGAAACATTGCCGCAAGGAGCGCACCGCAAGTCATCAGCTGAGTACTTTTGGATTTCCCAAGGATCAGCAGCAGCTGTTTAAATGGAGCGCCAATCTGCAGCTGGATCTCGTCGATTGTGTGGGACGCGTGTGCATCGAACATTTCGAGGCGGAGATGCTAGGCACTCGTAAGCTAAAGCAGAATGCGGTGCCCACATTGAATCTGGGTCATGCCACGCCGTTGAGCTATAGCTGCAATGGCCAATCCTTGAGCATATACGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAGCTGCTGACGATGCCTCCGGATCCGGCGACAACTAAACGACGCTGTTGTTTGCCCAGCTGTGGCAAGGAGCCGGAGCTGCATGGTGTTCAATTGAAACGACTGCCCAAGAAACGTCTGCTGCTGCGCAAGTGGCTGCACAATCTGAAGCTACCGGCACACATGAGCACTAGAAACGCGTTTCTTTGCGAGGAGCACTTTGAGCCACAGGCGACGCTGCCTACCCTGAAGCTGGGCCACGCGGCTTATAACATTTATCGCAATGGCAACTCGGCTTTATCCAGTGGCTGCCTGGTGCCCAGCTGTCCGTGTGCACGGCTTAATCTATATCGCTGCTATGCTTTGCCCGAGCATCCGCAGGTGCAGCAGGCCTGGCTGCAGTGGTTGCAATTGCCGCCGCCGCAGCTGGCCAGCCTTGCCCAGCTATGCGTCATGCATTATATGCAGCTGTTTGAGCAGGTGTCACTACCCGCGGATCTGCCTGAGTCTGTGATGCGCCAACTGCAGGAAACCTACGAACAAATATCCAGCTCTAGTATGGCCATGAAACTGCGCTGTGCTGTGCCCGGCTGCTACTCCAAATACACGGACAATGTGCGTCTCACCAAGCTGCCCGTGTGCCCGCAAACCTGCGCCCAGTGGGTGCACAATACCAAAATCAAGTACGATCCGGAGCGCCATTACATGTATCGCATCTGCATGCGGCACTTTGAGCCGCAATGCCTGGGTGCAGTACGCCCTAAGCTGTGGGCTGTGCCTACGCTGCATCTAAACCATAACGATGCGGATATATATCAGAATACCATGATGGATAGCTCGGATGCCATGCCGATAGCCGAGTCTGTACCGCTGGAGTTGCCGTTGCGCATCAAGACAGAGCTGCCGCTAACTCTATCAGTCAGTCCTAGTGCCAGTCCAAGTCCACGCGGCAAAGTGCGCACCTGTTGCATTTCCACCTGCGGCCAGCAGGCCAATGCCCTAACGCGTCTCTTTCGCTTTCCCAGTGCCGAGACGGCCCTGCTTAAATGGCTGGTGAACACACAACAGCAGCCACGCCTCGTTGATACGCAGAATCTCTTTGTATGCCAGCGACACTTCGCGGCTGAGGCGATTTGCAAAAAGCAGCTGCAAAGTTGGGCAGTGCCTACCCTAAATCTGGGCCATCAGAGCCACATCATACCGAATGCCAAGCACAATGGCAATATTGCCGACAGCCAGGAGAACAAGCAGGCGCTGCAATACATCTGGGCCAATTACTGCTCGGTGCTTACCTGCTTCCAACAGCGCAGCGAACAGGTTCGTCTCTATGCCTATCCCACAGATCGCCCCACCATACGTAGGTGGGCGGCCAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGATGGATTTCAGGTCTGCCAGTCACATTTTACACCAGACTGCTTTGACCCTGATACCGGGGAGCTGAAGGAAGACGCGGTGCCCACACTTGAGCTGAGCCGACCTGTCAATGAGTTGCGCTGCGTGGTCAATGGCTGCGTTAGGGAGAAGGATGCAGCGCGTTGTCGTTTTTTCAGAGTGCCCAAACGTGCCTCCCTGCTGGAGGACTGGTGTTACAATCTGCGCATCGATGTTGCGTCAATTAGCGGGCAGGAGGTGCACGTGTGTGAACGCCACTTTGAGGCGCACTGTTTCAGTGCGTACAAACTGCGTCCGGGTGCACGACCTACACTTCATTTGGGCCACGATGATGAGTTGGATTTGTTGCCCAATCCGGCAAAGTGGGAGGAGGATGTGAATGTATGCTTTGTGCCCAGCTGTGGTCGCTCCAAGGATGTGGATAATGTGGAGCTATTCGGACTGCCCAGGATTAGAGGGGTCTTGGAGAAATGGCTTCAAAATTTCCGCCTCGAGCCGAGCAGGGAGCAGCTGCAAGGCATGCGGATATGCAGCGCACATTTTGAGCCCAGCTGCATGGAGAATGGCCGTCTACACTTAGGCTCGGTGCCCACACTGCAGCTGGGCCACGATGACTTGGACAATATACATCAAAGCGCGGAGCTGCCGTCATCGCAGCTTAAAGGCAAACGAATAGCCATGGGCTACGACTGCTGCTATCCACAGTGTATGGAGCTGCAGAAGAGCTATCAAAGAATCGCATATGAGCTGCCCCAGCAGGAGGCATTGCGTAACTTGTGGTTGTCGTATCTGGGTCTGGAGCAGCATAATCAGGAACCGCTCAAGCTCTGCCCACTGCACTTGATCATGCTTTATGAACACAGTGTCAACCATTTTCCAGAGCATTCATCGGAGGAGCAGTTGCTGGACGCCAATTACGAGGCTGCGCGAAATAGCGTGCGCATACGGATTATCAGCTGTGCGGTGCGTGGCTGCAGGACACTCAAACCACGCGACGACTACCGCCTGCACGCAATGCCTACGCGTCGGGATGTACTCCAGATGTGGCTAGACAACATGCAGCTTGTGTTCTACGAGCAGCAGCGTTATATGTATAAGGTATGCAGCAGACACTTTGAGGCCATCTGCGTAACAGAGACTACTCGCCGTCTAAAACCCTGGAGTATGCCGACGTTAGAGTTGCCGGAACGTGACCCAGAAGCGCCGCCGTTGCATCAGAATCCCTCAGAGGAGGAATGGCAGCGCATGAATGAGCAGATAGGCAGTTGCGAGGCAGTGCAGTCGTTAGAGCCCGCTGTCAAACTGGAGCCGGAGCCAATTGTCAAACAAGAGCTGCACTCTATTGTTAAGCTGGAACCGAAGCCACAGTCAGAACACTTGTATGAGGAGGAGGAGGAGCCCAATGATCAGCAGCAAGCGCTAGAGGTGCTGCTCGAAGTGGGTCACGTTGAGAAGTGCACCACATACGAGCAAATGGACACAAAACCAATTATAAGCTATGCCGAAACCCTATCACATAACTCACTGGGCCCAACGACAACAGTGGGCAGCGCCTGTATTGTCGGCAACGGATTCACCTACAGCGCGCGCCACTGCAGCGTGCGGGGTTGCGATGTGACTTCTCTGGATGTGAATGACAGTCTCAAGCTACACAAGTTTCCTACATCGCTGGATGCCATGGAGAAATGGATGCACAACACCCAGGTAAATGTGGACATCAACTTTGCGTGGCGCTTTCGCATCTGCAGCTTGCATTTTCTACCCGAGTGCTTTAATGGGTCGCGTATTAGGCGTGGGGCCATGCCCACGCTGCGTCTGGGATCGCGCCGCCTAGGGGATATCTATGACAATGAGTTCAATGTGCAGCCAGAGCAGACGAGTGTGGATCAGTCGGCTGAGGCGTCGGCAGACGCTGTAGTGCCCACTGAGCCGCACGATGGTGCAACGGAGTTTAATATAAATCTCCATTTGCCCTGCCCCGCACCACCGCGCAAGTCCAGCAAGTTCTGTCAGATCGATGGCTGTTCGAATCATTTGACTAGCGAAAATCTTACTCTGCACAAGTTTCCACACTCGGCGGAAATGTGCGCCAAGTGGCAGCACAATACACAGGTGCCGTTCGATCCGGAGTACCGCTGGCGATATCGTATCTGCAGCGCACACTTCGAGCCCATCTGCCTGGGAAACATGCGGCTGATGCATGGCAGCGTGCCCACACTGAAACTGGGCGCTCGGGCGCCCAAGCAGCTCTTTGGCAATGACTTTGCAGCGATTAGCTTGCGCCTGGATAAGGAAAAACGCAGCGCCGACCAGAGCTTGCCAGTGAAGCAAGAACAAGTGGAAGATGATCAAGAGCAGTATGATCAGGAGCAAGAGGATCTGAGCATGCTGGTACCAGAGTTGCAGCTGCACGAGGGCGACGACGAGCAGGAAGACAATCAGTTTACCTACAGCAACAGTTGGAGCGACTCGCAGCAGCAGCTGCAGCTACGTCTGCCCAGCATTAAGCAGGAGAAGGGCACCATCTATAATCCCGTCAAGTCGGGCTATGACAAGTGCTCGCTCGTGCACTGTCAGCGCCAGCGTTCACAGCATGGCGTCCACATCTATAAATTCCCACGCTCGCGGCAGCTACAGCATCGCTGGATGCACAATTTACGAATCAGATATGACGAGCGGCGACCTTGGAAGACAATGATATGTAGTGTACACTTTGAGCCGCACTGCATACGCCTGCGTAAGTTGCGTCCTTGGGCGGTACCCACACTGGAGCTGGGCGACAATGTTCCGCAGGATCTGTACAGGAACGAGCAAAGCCAACTGCAGTTTGAGCAGCAGCGCAGCAGCGACGCGGAAGCGGGCAGTGAGGGCGAGGACTATGATGCGGAGCTAGAAGACACCATACTGGAGGAGTACGATGATGAGTATGATGATAATGACAATACTGGTCAAATTCCGGCTGAGCCGCACATCAAACGAGAGTATCGCTCACGCTGCGATCCACAACCGGCGGGTCAGCTGCCACCCTGGAAAATTAAGCAATGCTGTTTGCCCTATTGCCGTCGTCCACGCGGCGATGGCATCAAGCTTTTCCGGCTGCCTAACAATATTAGCGCCATACGCAAATGGGAGCAGGCGACGGGCATGCGCTTCTATGAGTCGCAGCGCAACACAAAACTTATCTGCAGTCGTCACTTTGATCCGCAACTTATTGGTGTGCGTCGTCTTATGTCCAATGCGGTACCCACGCGCAATCTAGGTCCAAACAACGAGGAATCCGAGCCGCCAGCGAGCAGTCCACGCTGCTGCATTAAGGATTGCCAACCAGATGGACATGTCAAGCTGCACAAGTTTCCCAGCGATCCCCAGCTGCTGCATCAGTGGTGTCAGGCGCTAAGTTTAAGGGATGAGCAGCGCCACGCCGGCAAGTACATTTGTGCCGTGCACCTGCCCACCAAAGCGATGAGCTGTCTCATTTGCGGTGTGGAGGATGTACAGCTACCCCTGCAGGACTTTCCCGAGCATCGCAATCAGCGAGTCAAATGGTGCTACAATTTGAAAATCGAACCCATCGCCAAATGGGACAACTCGAAGCACATTTGCTCCAAGCACTTTGAGAGCTATTGCTTCATTAGGCCGGGTCATCTGTTGCCGGACGCCATGCCCACGCTGCATTTAAAGCACAACGACAGCAATATATTCCTCAACGAATCCGCCATAGAGAGCAGCCAGCTGCTGCGCGTCAAGGATGAGCCTATGGAGTGTGAGGATCTGATGCTGTAA
Protein Sequence
MNAYGTGAASAHYYGAAPTAGAAAGAGNPMAPPPAPALNRHTLDASGEMIIKSEPIDEHAFKSNYIDDNTPFADFSKFTEFGDDMLSPKVELTIKDEAYGSQKNPLSYPRRKLQTERSSESLPICQRCKEVFFKKQVYLRHVAESSCIIHEYEFKCNICPMSFMGAEELQKHKQLHRADKFFCHKYCGKHFDNIAECESHEYMQHEYDSFVCNMCSVTFSTREQLYAHLPQHKFQQLTAPHHKTNALPATAALSSLLQQRQANADGAALYASTLKSDANVKLERSYSNSTSESGYSLHESSYNNAYGSDNSLHGGSAAIGGPQAHSSTLDESEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYTSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNLALLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHENIAYPLPTPEQVAESYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLAKWAHNLQLDIAQLSNMRICNLHFESHCIGKRMRPWAIPTLNLASNIENLFENPEHQMLYKRRTHLNADRAAARSAGADGAAMKASWVPRCCLPHCRKVRALHNVQLYRFPKVNRTTLAKWAHNLQVPLVGSAQRRLCSAHFEPNVLSKKCPVPLAVPTLDLNTPPGYKIYQNPAKVRANKLCWQRVCIVESCRRQRAQGVQLFRLPHSRTQLRKWMHNLRMLPRGAMRQQYRICSLHFEPHSFNGKRLSTGAIPTLELGHQDDDIYPNEAQSFVEEHCAVEGCDASKEQPDVRLFRFPNDDEDLLWKWCNNLKMNPVDCYGMRICNRHFEPDCIGPKHLYKWAIPTLVLGHDDGQIELIPNPKPEERYGDPVFKCCVPTCGKTRKFDEAQMNSFPKDPSLFQRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHEETEDLYQVNPEELQSNLFGRPRRVHENQRLSIKQELDEDIKPDITMSEATDTNTTQVKIKKSVLDLKCCVPSCGRSRLEHGARLFPFPTGKQQQTKWRHNLRLSAADVDRTTRVCSAHFNRRCIDGKQLRGWAIPTQQLGHQEQNIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRNEELLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKVGMQQEEEEEEDAGSLESLGKIKIEKQEKETPEQELENDDENEEDEQQQKVEDPEDEMEQEQDEEEEELQDHGYFDPLELVETFAEQHSDDNSADNYHLEADEDDDDEDIPGNDDELLLPDTVPMQLPPRREKAVNNVTPICCLKHCRKERTASHQLSTFGFPKDQQQLFKWSANLQLDLVDCVGRVCIEHFEAEMLGTRKLKQNAVPTLNLGHATPLSYSCNGQSLSIYDAQPQHSVFRLWSLKHCRKRKLLTMPPDPATTKRRCCLPSCGKEPELHGVQLKRLPKKRLLLRKWLHNLKLPAHMSTRNAFLCEEHFEPQATLPTLKLGHAAYNIYRNGNSALSSGCLVPSCPCARLNLYRCYALPEHPQVQQAWLQWLQLPPPQLASLAQLCVMHYMQLFEQVSLPADLPESVMRQLQETYEQISSSSMAMKLRCAVPGCYSKYTDNVRLTKLPVCPQTCAQWVHNTKIKYDPERHYMYRICMRHFEPQCLGAVRPKLWAVPTLHLNHNDADIYQNTMMDSSDAMPIAESVPLELPLRIKTELPLTLSVSPSASPSPRGKVRTCCISTCGQQANALTRLFRFPSAETALLKWLVNTQQQPRLVDTQNLFVCQRHFAAEAICKKQLQSWAVPTLNLGHQSHIIPNAKHNGNIADSQENKQALQYIWANYCSVLTCFQQRSEQVRLYAYPTDRPTIRRWAANCKHRSMQASSDGFQVCQSHFTPDCFDPDTGELKEDAVPTLELSRPVNELRCVVNGCVREKDAARCRFFRVPKRASLLEDWCYNLRIDVASISGQEVHVCERHFEAHCFSAYKLRPGARPTLHLGHDDELDLLPNPAKWEEDVNVCFVPSCGRSKDVDNVELFGLPRIRGVLEKWLQNFRLEPSREQLQGMRICSAHFEPSCMENGRLHLGSVPTLQLGHDDLDNIHQSAELPSSQLKGKRIAMGYDCCYPQCMELQKSYQRIAYELPQQEALRNLWLSYLGLEQHNQEPLKLCPLHLIMLYEHSVNHFPEHSSEEQLLDANYEAARNSVRIRIISCAVRGCRTLKPRDDYRLHAMPTRRDVLQMWLDNMQLVFYEQQRYMYKVCSRHFEAICVTETTRRLKPWSMPTLELPERDPEAPPLHQNPSEEEWQRMNEQIGSCEAVQSLEPAVKLEPEPIVKQELHSIVKLEPKPQSEHLYEEEEEPNDQQQALEVLLEVGHVEKCTTYEQMDTKPIISYAETLSHNSLGPTTTVGSACIVGNGFTYSARHCSVRGCDVTSLDVNDSLKLHKFPTSLDAMEKWMHNTQVNVDINFAWRFRICSLHFLPECFNGSRIRRGAMPTLRLGSRRLGDIYDNEFNVQPEQTSVDQSAEASADAVVPTEPHDGATEFNINLHLPCPAPPRKSSKFCQIDGCSNHLTSENLTLHKFPHSAEMCAKWQHNTQVPFDPEYRWRYRICSAHFEPICLGNMRLMHGSVPTLKLGARAPKQLFGNDFAAISLRLDKEKRSADQSLPVKQEQVEDDQEQYDQEQEDLSMLVPELQLHEGDDEQEDNQFTYSNSWSDSQQQLQLRLPSIKQEKGTIYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQLQHRWMHNLRIRYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGDNVPQDLYRNEQSQLQFEQQRSSDAEAGSEGEDYDAELEDTILEEYDDEYDDNDNTGQIPAEPHIKREYRSRCDPQPAGQLPPWKIKQCCLPYCRRPRGDGIKLFRLPNNISAIRKWEQATGMRFYESQRNTKLICSRHFDPQLIGVRRLMSNAVPTRNLGPNNEESEPPASSPRCCIKDCQPDGHVKLHKFPSDPQLLHQWCQALSLRDEQRHAGKYICAVHLPTKAMSCLICGVEDVQLPLQDFPEHRNQRVKWCYNLKIEPIAKWDNSKHICSKHFESYCFIRPGHLLPDAMPTLHLKHNDSNIFLNESAIESSQLLRVKDEPMECEDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00534610;
90% Identity
iTF_00490809;
80% Identity
-