Basic Information

Gene Symbol
-
Assembly
GCA_018903575.1
Location
JAEIGH010000020.1:1881071-1892551[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 31 4.1e-15 1.2e-11 45.7 4.4 1 86 125 197 125 198 0.85
2 31 1e-14 3e-11 44.5 5.2 1 87 225 294 225 294 0.83
3 31 2.9e-15 8.7e-12 46.2 0.3 1 87 317 389 317 389 0.84
4 31 6e-16 1.8e-12 48.4 5.5 1 87 433 503 433 503 0.82
5 31 4.7e-15 1.4e-11 45.5 3.2 1 86 527 598 527 599 0.81
6 31 5.4e-13 1.6e-09 38.9 0.6 1 87 634 702 634 702 0.81
7 31 7.8e-11 2.3e-07 32.0 1.6 1 86 745 814 745 815 0.77
8 31 2e-16 5.9e-13 50.0 0.4 1 86 842 911 842 912 0.83
9 31 1.4e-12 4.2e-09 37.6 2.3 1 86 933 1002 933 1003 0.81
10 31 6.3e-15 1.9e-11 45.1 1.7 1 86 1030 1101 1030 1102 0.85
11 31 5.1e-13 1.5e-09 39.0 2.0 1 85 1182 1250 1182 1252 0.82
12 31 1.3e-12 3.9e-09 37.7 0.1 1 86 1276 1344 1276 1345 0.82
13 31 6.9e-14 2.1e-10 41.8 2.8 1 87 1468 1537 1468 1537 0.80
14 31 3.7e-11 1.1e-07 33.0 0.3 1 86 1622 1688 1622 1689 0.82
15 31 0.027 81 4.6 0.0 1 58 1708 1755 1708 1778 0.73
16 31 1.1e-12 3.2e-09 38.0 0.2 1 86 1785 1854 1785 1855 0.83
17 31 8e-14 2.4e-10 41.6 1.2 1 87 1921 1991 1921 1991 0.82
18 31 2.4e-12 7.1e-09 36.9 0.4 1 86 2026 2097 2026 2098 0.81
19 31 2.6e-11 7.7e-08 33.5 0.4 1 87 2110 2183 2110 2183 0.78
20 31 4.5e-13 1.3e-09 39.2 0.2 1 86 2209 2281 2209 2282 0.81
21 31 2.5e-07 0.00074 20.8 0.5 1 58 2318 2369 2318 2386 0.86
22 31 6.4e-13 1.9e-09 38.7 0.1 1 87 2407 2479 2407 2479 0.81
23 31 2.3e-16 7e-13 49.7 1.6 1 86 2531 2602 2531 2603 0.82
24 31 4e-05 0.12 13.7 0.2 1 58 2634 2683 2634 2702 0.79
25 31 3.3e-13 9.8e-10 39.6 0.3 1 87 2721 2793 2721 2793 0.82
26 31 8.4e-15 2.5e-11 44.7 0.4 1 87 2936 3009 2936 3009 0.83
27 31 1.7e-12 5e-09 37.4 2.4 1 86 3074 3144 3074 3145 0.81
28 31 8e-15 2.4e-11 44.8 4.5 1 86 3248 3318 3248 3319 0.85
29 31 5.7e-13 1.7e-09 38.9 0.1 1 86 3399 3468 3399 3469 0.85
30 31 1e-11 3.1e-08 34.8 0.5 1 58 3495 3544 3495 3560 0.86
31 31 8.4e-11 2.5e-07 31.9 1.1 18 87 3561 3620 3550 3620 0.77

Sequence Information

Coding Sequence
ATGGAAATGCCCACGGCACCACCACCATCATCAGCGGTAACACATCACAAGTCAAATGCATCCGGAACATCTTCTACATTACCAGCAACGGCAGCTTTGAGTTCTCTGCTCCAACAACGTCAGGCTAATGCAGATGGTGCGGCCATGTTTGCTGCTGCTGCCTCCTCAACATCCCTCAAAGGCGAAGTCAACGTGAAGTTGGAACGAAGTTATAGCAACTCCACAAGTGACTCTTCTTTTGGTGGAATGCATGAATCCAACTATAATAATAATAATAATGCCTATGGCAGTGATAACTCCATTCATGGATCTGGTGCCGTTGGTGGGCCACAAGCTCATTCCTCAACGCTGGATGACTCTGAGGATGCTCTATGCTGTGTGCCCATGTGCGGTGTAAGCAAAAGCACTAGTCCCACTCTCCAGTTCTTCACATTCCCCAAAGATGACAAATATCTCCATCAATGGCTACACAATTTAAAGATGTTCCATATATCCGCCTCAAGCTATTCGACATTTCGTATCTGTAGCATGCATTTCCCGAAACGTTGCATCAATCGGTATTCGTTATGCTATTGGGCAGTGCCTACCTTCAATTTGGGACACGATGATGTCGCTAATCTCTATCAGAATCGCGAGCTAACAAATACCTTTACCACCGGCGAGGTCGCACGCTGCAGCATGCCGCACTGTAATAGCCAGCGGGGTGAGAGTAATCTCAAGTTCTATAACTTTCCCAAGGATATTAAAAGTTTAATCAAATGGTGTCAGAATGCTCGGCTGCCTGTTCAGGCCAAGGAGCCGCGACACTTTTGTAGCCGTCACTTTGAGGAGCGTTGCATTGGCAAATTTCGTTTAAAACCCTGGGCAGTGCCCACACTACATCTGGGTGGTGCCCAATATGGGAAAATCCATGATAATCCCAAAAATTTGTATGTAGAGGAGAAGCGCTGTTGTCTTAACTTTTGTCGTCGCAGCCGTTCAACGGATTTCAATATGTCGCTTTATCGTTTCCCAAGGAATGAGGTATTATTACGACGCTGGTGCTATAATCTGCGACTCGATCCGGGTGTATATCGGGGCAAGAATCATAAAATATGCAGTGCACACTTTATTAAAGAGGCATTGGGTTTAAGAAAACTGTCGCCGGGTGCTGTTCCTACACTTCATTTGGGTCACAATGATACCTTTAATATCTATGAAAATGAATTATGGCCACCGCCGACGCCAACATATCTTGGCATGGGTGGTGCTAACTCCCTTTCCGGTGGAATGAATGTCAGCGATAGCATGGACATTTGCTGTGTACCAAGTTGTGAGAGTAAGCGACATAATAGCGAGAACATCACATTCCATACGATACCCAGAAGGCCCGAGCAGATGAGGAAATGGTGTCACAATTTAAAAATACCCGAGGATAAAATGCACAAGGGCATGCGGATATGTAGTCTACATTTCGAGCCGTATTGCATTGGCGGCTGCATGCGTCCATTTGCAGTGCCAACTCTTCATCTGGGACATGACGATAAGGATATTCATCGTAATCCGGATGTGATTAAGAAACTTAATATAAGGGAAACTTGTTGTGTGGCAGTCTGTAAAAGGAATCGTGATCGTGATCATGCCAATCTCCATCGGTTCCCTAGCAATGTGGCCCTATTAACCAAATGGTGTGCCAATCTGCAAAGGCCTGTCCCAGATGGCAGTAAACTCTTTAACGATGCCATATGCGAAGTGCATTTCGAAGATCGTTGTTTGCGCAACAAGAGATTGGAGAAATGGGCAGTGCCCACGTTAATGTTGGGTCATGAGGATATTGCGTATCAGTTGCCCACATCCGAGCAAGTGGCAGAGTTCTATGCACGTCCAAATGCACCGAATAATGGCGAGGAGCAGGGCGAATGTTGTGTGGAAAGCTGTAAGCGTAATCCCAGTGTGGATGATATTAAACTATATCGTCCACCCGAAGAGTCAGATATACTGGCCAAATGGGCGCATAATCTTGAACTGGATGTGGCCGAGTTGCCAAATATGAGGATATGCAATCTACATTTCGAATCCCATTGCATTGGTAAACGGATGAGACCGTGGGCCATACCAACATTAAACCTATCTTCTAATATTGAGAATCTCTACGAGAATCCAGAGCACTCGATGTTATACAAGAGGAGAACGAAGCGAGATCCAAATCGAGACGTATCCCTAGCGGCAACGAAACCAACTTGGGTTCCTAGATGCTGTTTGCCGCATTGTCGCAAGGTCCGAGCTCTGCATAATGTTCAACTCTATCGATTCCCCAAACTGAATCGTTCCACATTGGCCAAATGGGCACACAATCTACAAGTGCCAATGGTGGGCAGTGCCCAACGGAGACTCTGTTCGGCACATTTCGAACCTCATGTATTGAGTAAAAAGTGCCCTGTACCATTGGCTGTACCCACGATTGATTTAAATGCCCCGCCAGGTTACAAAATCTATCAAAATCCAGCCAAACTTAAAGCCAGCAAATTGTGCCTGCAAAGAGTTTGCATTGTGGAGAGTTGCCGTCGCACCAGGGCTCAAGGAGTCCAGCTCTTCCGTTTGCCTCACAGTCCGACGCAGTTAAGGAAATGGATGCACAACATCAAGACACGTCCACGGGCAGCTACAAGATCGCAGTATCGCATCTGTTCCATACACTTTGAATCGCATTCGTTTAATGGCAAAAGATTAAGTGCTGGAGCCATTCCCACCTTGGAATTGGGTCATGACGATGACGACATCTATCCAAATGAGGCGCAAGCATTTGTGGATGAGCATTGTGTGGTCGAGAGTTGTGAATCGTCAAAGGATCAACCCGAAGTGCGTCTATTCCGTTTCCCCACCGAAGATGATGATCTTCTGTGGAAATGGTGCAACAATCTCAAAATGAATCCAGTTGATTGTGTAGGAGTGCGTATTTGTAATAAACATTTTGAAGCTGATTGCATTGGTCCCAAACACCTATTCAAATGGGCCATACCCACTATGGAGCTGGGACACGATGATAGTGAAATCGAACTGATACCAAATCCCAAGCCTGAAGAGCGATATGTTGATCCAGTTTTTAAGTGTTGTGTACCAACTTGTGGCAAGACCAGGAAATTTGATGAGGTGCAAATGAATAGTTTTCCGAAAGATCCTTTGCTCTTCCAGCGCTGGCGTCACAATCTGCGCTTGGAGCACCTGAATTTTAAGGAGCGGGAACGCTACAAGATTTGCAATGATCACTTTGAGGATGTTTGCATTGGCAAAACTCGGCTTAATATAGGCTCCATACCCACCCTTCAGTTGGGTCACAATGAGACGGAGGATCTGTATCAAGTCAATCCTGCGGAATTGCAAAGTAATCTCTTTGGCAGACCACGTAGATTGCATGGTGGGGTTGACATAAAGCTAGAATATGCGGAGGATTCCGAGGCGGAATCAGGACTGCAGGATGTTAAACCAAATATCTATGAGATGGCCGAAGCCACCGATATAAATATCAGGCAGGTGAAGATTAAGAAATCTCTCGCTGATCTAAAGTGTTGTGTACGCAGCTGTGGTCGTAGTCGCCTGGAGCATGGTGCTCGCCTCTTCCCCTTCCCCAATGGCAAGCAACAGAATCTGAAATGGCGGCACAATCTCCAACTTGAACCGGAAGAAGTGGACAAAATGACACGCGTCTGCAGTGCGCATTTCAATCGGCGTTGCATAGATGGCAAACATCTGCGGGGATGGGCCATACCCACACAACAATTGGGACATCATCATGAACAGCCAATTTATGAAAATCCCAAGAATATTCCAGGCTTCTTTACCCCAACATGTGCCCTAAGCCACTGTAGACAGAGGCGAAGCATTGATAATGATTTGCGCACCTATCGCTATCCGAGAAGTGAGGATCTATTAGAGAAATGGCGTGCCAATTTACGTTTGGCGCCAGATCAATGCCGTGGACGGATTTGTGCTGATCACTTTGAGCCAATGGTTAGGGGCAAACTGAAATTAAAGACAGGAGCAGTGCCCACTCTGAAATTAGGACATGATGAGGAATTAGTTTACGATAATGAAGCTATTAAAGCTAATCTAGTGGATGAAGAGGATGTCAGTTTGGAATCACCACCGCAAGTAATAACTAAAAAGGAGATTTTGGAAGAGGAAGATGATGAAGAAGATCTGCAAGAGCATGAGGATGATGAGGAGGAGGAAGAAAACGATCCACCAGAAGAGGATTCACATTCCGATTATTTCGATCCCCTAGAATTGGTAGAGACATATGCCGATGATCAAGTACCAGAAGATGAATATAGTGCACCCGCCCATCAACTCCCGGCACCACCATCAATAGCTGCTCCACCTTTTGGCAGGCGGGAAAAGGTGGCGAATAATGTAACACCCATTTGTTGTTTGAAGCATTGTCGTAAGGAACGCACTCCCACCCATCACTTGAGTACTTTTGGCTTTCCCAAAGATCATCAGCTTTTGCTGAAATGGTGTGCCAATCTTCACCTGGAACCCATGGATTGTGTGGGACGTGTTTGCATTGAGCATTTTGAAGCAGAAATGTTGGGAACACGGAAGCTAAAGCAAAATGCTGTTCCCACCATTAATGTGGGACATCAGATGCCTTTACCGTATACCTGCAATGGCCAGGAGCGTAGCGATGAGAAGGAGGATAATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGTCGCAAGAGGAAACTAATGGAACCACCAGATATTCGCCTAAAAGTGGAGAAGATGGATCCGATGGGTCTAGTGAAAGTGAAGAAGGAGAAAATGGAAATGGAGGAGATGGAGGAGAAAGAGACGATGATGATGATGACTAAACCTAAGAGATGTTGCCTTAACCAATGTGAGCAAACTGCAGAATTGCAGAAATTTCCAAGAGATTTCAATTTGCTAAGAAAATGGTTGCACAACCTCAAGTTGACCCTTAACGAGGATTTGGATCCCTCACAGCTGCGTTTGTGTCTAAGGCACTTTGAAGGTCATTTGGTACGAAATGGACATCTTTCAAAAGAGGCATTACCCACTCTGGAACTGGGTCATCAGGATAAGAATATTTATAGAACAACTGTAGCAACATCTGGTGGTTGCTTGGTGGCGAGTTGCCCATGTGCTCGTCTCAATCTCTATCGAAGTTATGCTCTACCCAAGGAGCCCTATATTAAAGAGGCGTGGCTAAACTATCTAAAGCTGCCAGCAATCACCCATGGACAACTCTGTGTAATGCACTATATGCAACTGTACGAGGAGATGCCCTTCAAGGAATTGCGTCATATCTATGAATCCATTGCCAATTCCACACAAGCTCTGAAATTGCGCTGTGCCGTACCCGGCTGTCGATCAAAGTACACGGATAATATACACTTGACCAAGTTGCCGCAAAATCAAAGCTTACTTACCAAATGGTTGCATAACACCATGTTGACCTATGATCCCAGCAAACATTCAATTTATCGCATTTGTTTGCTGCACTTTGAGCCATTCGCATTGGGTCCAGCATGTCCCAAGCCATGGGCAGTACCCACCTTGGAATTAAATTATCAGAATGACATTTATTTGAATCCTTCGAAAGAGGAATTGGCTAACATAACAGACTATCCCCGAATTAGTACTCCGCTGCAAATTAAAACAGAATTTACTTTACCATTGAGAATAAAAACGGAATTAGCCGCCTTAAGCAGTCCCAGTGTTGGTTCCACACCTAGTCCACGGGGCAAGGTTAGAATTTGTTGCATACAATCATGTCTGCAGCAGGCGAACTCCCAGTTGCGTCTCTATCGTTTTCCCAATACAGAACCCGCTCTACTCAAGTGGCTGGTCAATACGCAGCAGCAACCACGTCTTGTGGATCCCACACAGTTGTATGTGTGTCAATCCCACTTCGAACTTGAAGCTATCTGTAAGAAACAATTGAGAAGTTGGGCTGTGCCCACATTAAATTTAGGACATGATGGTCATGTCATACCCAATGCCAGGCATAATGGAAATATTGCCGATAGCCAGGAAACGGAACAGGCAATGGAATTTATTAGGGAAAACTATTGTTCCGTGCTAAGTTGCTTTCAGCCAAAGAGTGAGGCTCTGCGTTTGCATCCCTATCCCAAGGATATGCCTACCATACGGAAATGGGCTGCCAATTGTAAGCATCGTTCCATGCAGGCCAGCAGTCATGGATTCCAGGTCTGTCAATTGCATTTTGAAGCAGATTGCTTGGATCCGGATACTGGTGACTTACGTGAGGGATCTGTACCCACTCTGGATCTAACAGTGACTCGGCTAAACAGCGAGTTGCGTTGCCTGGTCACTGGCTGTGTCAAAGATGAAACTCAGCCGCGACGTCGTTACTACAAACTACCTAAGCGACCTGCTCTGCTCAGTGAATGGTGCAGAAATCTCGGTTTAGTTCCTTCTGGACTCCTACATGGTGCTGATCATCACGTTTGCGAACGTCACTTTGAATCTCGTTGCTTCAACATCCACAAACAGTTGCGTTCAGGATCACGTCCGACCCTAAATTTGGGTCACAATGAAAATATTACGTTGCTGCCAAATCCGGAGATATTCTGTGATGAGATTGACGACGTCAGTACTTGCTCTGTGCCAAATTGTGGTCAATCCAAGCTAACGGATGAAACACTTCAACTAAATAGTTTGCCCAGAATGCGAAAGTTGGCGGAGAAATGGTTGCATAATCTGCATCTACCATACACTGGAAAGGAGCAACTGGCCAAGTTTCGTGTCTGCCAGAAACACTTTGATCCATCTTGCTTTGAAAACGGGTTTTTGCGTCAGGGAGCCCTGCCCACCTTGGAGTTGGGTCATGAGTCTGTGGACATTTACCAAACAGATGACCAGAGTGTGGGCAAATACAGAAAGCACCAAAAAGTATTGCCTGGCGTACGTGTATCGGGGCACGACTGTTGTTATCCCCAATGTGTGCAACAGCAAAAGAATTACCAACGAATGGTGTACGACTTGCCCAAAGAGGAGAAGCTGCGTCAGAGATGGCTACAGCATTTGGAAATTGATGAAAGAGAAAGGGAAAGACCTTTGATATTATGTCCACTCCATTATATATTCCTATACGATTATAGTGTGAAAAACTTTGAAGAGCATGTTCCAAATGATCTGCTGGAAAGCAACTATGAAGATGCAAGAAATGGCTCTAGAATCCGGCTTATCAGTTGTGCTGTGCGAGGATGTGGAACACTTCAGCCTCGTGATGGTGGCAGATTACATGGTCTGCCCACGAATCCAGAGATCTTCCAGATGTGGTTGGATAACACTGAATTGGTTGTATATGAGCCACAGCGTTACATGATTAAAGTCTGTAGCAAACACTTTGAGTCTATATGTTTTACGGATATTCGCAAATTGAAATGCTGGAGTGTGCCCACTCTTCATCTACCCGGTGAGGCAGTGCATCAAAATCCAACCGAAGAGGAATGGTTAAAGATAAACGAAAGAATAGCTGTATCAGCCGCTCAGCCAGGGGAACCCTGTGAGGACAATTCAATGCTGGAACCAGTTGTTATAATGCAAGAAGAGGACTGTGTCTGTTGTGTACCCAATTGTGGACGGTCCAAGCAAATGGATAATTCCATTCAGTTTACAAGCTTCCCCAAGAACAACATGCTGGCCGAGAAATGGATTCTTAATTTTCATCTGAAAGTGACCAAAGATCAGTTGTCCGATCTTCGTGTATGCAATCGGCATTTTGAGACAACTTGTTGGGAAAATGGTCGATTGCGAAGAGGAGCCATGCCAACCCTAGAATTGGGTCATGAGTGCAGTGATATTTATCGAACCGAGGAGCTAGATCTCTTCAAGAGTCGCAAGCAAACCAAGAGGACATATGGCCAGGGATGTTGTTTTCCTCAGTGCGTGGAACTTTTAAAGAATTTCCAACGTATGGTCTATGATTTGCCAAGAGAAGCTCAACTGCGACAACGCTGGCTACAATATATGGAATTGACGGAATCAGAGCAGCCATTAAAAATGTGCCCACTCCATTATATTATTCTATATGATCACAGTGTGAAAAACTTTGAGGAACATGCTCCAGAAAAGCTGCTTGATTTTAATTATGAAAACGCTAGAAATTGTGTAAGAATTCGGATTATTAGCTGTGCGGTGGAAGGATGTAATACACTGCAGCCACGTGATGGAGGTCGCATGCATGGTCTGCCACCAAGATCAGATATACTCCAGATGTGGCTGGACAACACAAGATTAGTCTTCCATGAGCATCAACGTTACATGCTAAAAGTGTGCAGTAAACATTTTGAGCCAAAATGTTTTACGGATATTCGTAAATTGAAGAGCTGGAGTATTCCGACGCTCCATCTGCCCGATGAGGTTGTGCATCAAAATCTCACCGAAAGAGAATGGCAGCAAATGAATGAGAGACTTGCCGTGCAAAACAATCGGGAAGAGGAAAGTTTTGATGAAAATTCAATGCTAGAACCGATTGTTATGATGGAGCACGCCGAATCCGAAGCGGAGATGGAGGAGCAGGGCGAAACCATGCCTCAGCAAAAACTTGTGACCCATGATAATTTAAAGCACGAGTCCCAAGATGATAATGGCAATAATGATGATGAAATGCAAGCATTGGAAGTACTCCTCGAAGTGGGTCATGTTGAAAAATGTTCCAGTTATGAGAAAATGGACAATAAATCACATTTACCATACTCCGAGACGAGTCCATTGAGTCCTTCGATGGGATCTATGCCACCGGGTCAACGCGGTGGTCATTATAATGCTCGTCACTGCAGTGTCCAGGGTTGTCAGATAACTGCCAATGATGTAGACGGTAATATCAAGCTGCACAAGTTCCCCACCTCTGTGGAGGCCACTGAAAAGTGGATGCATAACACCCAGGTAGATGTGGATGAGAACTATTCCTGGCGGTATCGCATTTGCAGTTACCATTTCGAACAGGAATGCTTCAATGGGGCCCGTATACGGCGTGGATCTATGCCCACATTGCATTTGGGTCCACTTCGACCCAAGGATATCTTTAGGAATGAGTTCCCGCAATTGGAAATGGATGAAACTATGGAAGAATCAATTCCTAAAGTTACTCCCACTGTTGAACAGGAACCTGGGGCTCAGCCTATAAAGAGTAAGGTGACACAACTATGCCTGCCACGTCCTGCTCCGCCTCGAAAATCGAGCAAATTCTGTCAGATTGAAGGCTGTTCGAATCATTTGACTAGCGAGAATATGACTTTGCACAAGTTTCCCCACTCCCTGGATATGTGTGCCCGCTGGCAGCACAATACTCAGGTGCCATTTGATCCAGAGTATCGTTGGCGCTACCGCATCTGTAGTATCCATTTTCATCCAGTCTGTTTGGTCAATATGAGATTATTGCATGGCAGTGTGCCTACTTTAAAACTGGGCCCTAGAGCTCCCGCTCAACTGTTTGACAATGATTTCGATGCCATTAACATGAGATTGGATAAGAGATCACATTTGGAGCAGGGATCTAGCAAGGTCAAGCAAGAGAGACCCCACCATCAACAGCAATCCGATGAATTCTATTTAGAGCCAGAAATGGAAATGGAAGTAGATGATGAGGAGCAAGACGCAGATCAATCCCAATCCATGACATCATTTGAAAGCTGGAGACATCAACTTCGCCTACCAACAGTTAAGCAAGACAAGGTCGCCTACAATCCCATCAAATCTGGCTACGATAAATGCTCCCTAACACACTGCCAGCGTCAGAGATCCCTGCACGGCGTCCACATATACAAATTCCCACGATCGAAACGCCATCAGCAGCGATGGATGCACAATTTGCGCATACGTTATGATGAGAAGAAACCATGGAAATACATGATCTGCAGTGTTCACTTTGAACCAAATTGTATACGCCTGAGAAAACTTCGTCCATGGGCTGTGCCCACTTTGGAATTGGGTTCGAATGTGGCAGATCAGATTTACACCAATGAACAGTGCCAGGAAATGGCTTCAGATGTGAGTGAAGAAGAGGAAACCGGACCAGAAGAAAGTGGACAAGAAGAAGATGATGACGATGAAGTAGATGACGATGGAGATACTGGTGCAGAGGCCCACATAAAGCGTGAAAGACGCCATTGGGGAACGTCCGGAGCCGCCGGTGGTCAAATGGCTCCTTGGAAAGTAAAACAATGTTGTCTGCCCTATTGTCGTCGACCACGAGGAGATGGCATCAAACTATTCCGACTGCCCGGCAATCCTACTTCCATACGTAATTGGGAAAAGGCCACGGGGATGACATTTAAAGCATCGCAACGGAACACACGACTCATTTGTAGTCGTCACTTTGAGCCGGAATTGATGGGGGTACGCCGTTTGATGCGAAATGCCATACCCACCAGACATCTATATCACCAAAGGGAGAGCTATAGCCCAGAATTGGTGATACCCACAAACACTCCAACTCCTATTGGTCCCCGTTGCTGCATTCCTGATTGCCCCCCACACGATGGGTCGTCTCAACTTCATCGATTTCCCAGTGATCCACAACTGTTGAAGCAATGGTGTGAATCTCTTAAACTCACGGATTTCCAACGCTATAGTGGACAATACGTTTGCTCTAATCATCTTCCCGCCCAGGATTTAGCATGCATTATCTGTGGCGTGGAGGATATACAATTACCGCTTCTTGACTTTCCCGAGAATCGCAATTATCGGGCTAAATGGTGTTATAATCTCAAAATTGAAACAATACCCAAATGGGACAACTCCAAGCATATTTGCTCGAAACACTTTGAATCCTATTGCTTCAGTCAGCAAACCGGTGAACTGCATCCAGAGGCAGCACCTACATTGCATTTAAATCACAATGATACGAATATATTCCTCAACGAGTATGCCATAGAACAGCATTCTTTGATGAGGATTAAAGACGAGCCCTTGGACAACGATGAGATGTTGTTGGCTTAA
Protein Sequence
MEMPTAPPPSSAVTHHKSNASGTSSTLPATAALSSLLQQRQANADGAAMFAAAASSTSLKGEVNVKLERSYSNSTSDSSFGGMHESNYNNNNNAYGSDNSIHGSGAVGGPQAHSSTLDDSEDALCCVPMCGVSKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHISASSYSTFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCNSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSTDFNMSLYRFPRNEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPTYLGMGGANSLSGGMNVSDSMDICCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDKDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLMLGHEDIAYQLPTSEQVAEFYARPNAPNNGEEQGECCVESCKRNPSVDDIKLYRPPEESDILAKWAHNLELDVAELPNMRICNLHFESHCIGKRMRPWAIPTLNLSSNIENLYENPEHSMLYKRRTKRDPNRDVSLAATKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTIDLNAPPGYKIYQNPAKLKASKLCLQRVCIVESCRRTRAQGVQLFRLPHSPTQLRKWMHNIKTRPRAATRSQYRICSIHFESHSFNGKRLSAGAIPTLELGHDDDDIYPNEAQAFVDEHCVVESCESSKDQPEVRLFRFPTEDDDLLWKWCNNLKMNPVDCVGVRICNKHFEADCIGPKHLFKWAIPTMELGHDDSEIELIPNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPLLFQRWRHNLRLEHLNFKERERYKICNDHFEDVCIGKTRLNIGSIPTLQLGHNETEDLYQVNPAELQSNLFGRPRRLHGGVDIKLEYAEDSEAESGLQDVKPNIYEMAEATDINIRQVKIKKSLADLKCCVRSCGRSRLEHGARLFPFPNGKQQNLKWRHNLQLEPEEVDKMTRVCSAHFNRRCIDGKHLRGWAIPTQQLGHHHEQPIYENPKNIPGFFTPTCALSHCRQRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEELVYDNEAIKANLVDEEDVSLESPPQVITKKEILEEEDDEEDLQEHEDDEEEEENDPPEEDSHSDYFDPLELVETYADDQVPEDEYSAPAHQLPAPPSIAAPPFGRREKVANNVTPICCLKHCRKERTPTHHLSTFGFPKDHQLLLKWCANLHLEPMDCVGRVCIEHFEAEMLGTRKLKQNAVPTINVGHQMPLPYTCNGQERSDEKEDNSVFRLWSLKHCRKRKLMEPPDIRLKVEKMDPMGLVKVKKEKMEMEEMEEKETMMMMTKPKRCCLNQCEQTAELQKFPRDFNLLRKWLHNLKLTLNEDLDPSQLRLCLRHFEGHLVRNGHLSKEALPTLELGHQDKNIYRTTVATSGGCLVASCPCARLNLYRSYALPKEPYIKEAWLNYLKLPAITHGQLCVMHYMQLYEEMPFKELRHIYESIANSTQALKLRCAVPGCRSKYTDNIHLTKLPQNQSLLTKWLHNTMLTYDPSKHSIYRICLLHFEPFALGPACPKPWAVPTLELNYQNDIYLNPSKEELANITDYPRISTPLQIKTEFTLPLRIKTELAALSSPSVGSTPSPRGKVRICCIQSCLQQANSQLRLYRFPNTEPALLKWLVNTQQQPRLVDPTQLYVCQSHFELEAICKKQLRSWAVPTLNLGHDGHVIPNARHNGNIADSQETEQAMEFIRENYCSVLSCFQPKSEALRLHPYPKDMPTIRKWAANCKHRSMQASSHGFQVCQLHFEADCLDPDTGDLREGSVPTLDLTVTRLNSELRCLVTGCVKDETQPRRRYYKLPKRPALLSEWCRNLGLVPSGLLHGADHHVCERHFESRCFNIHKQLRSGSRPTLNLGHNENITLLPNPEIFCDEIDDVSTCSVPNCGQSKLTDETLQLNSLPRMRKLAEKWLHNLHLPYTGKEQLAKFRVCQKHFDPSCFENGFLRQGALPTLELGHESVDIYQTDDQSVGKYRKHQKVLPGVRVSGHDCCYPQCVQQQKNYQRMVYDLPKEEKLRQRWLQHLEIDERERERPLILCPLHYIFLYDYSVKNFEEHVPNDLLESNYEDARNGSRIRLISCAVRGCGTLQPRDGGRLHGLPTNPEIFQMWLDNTELVVYEPQRYMIKVCSKHFESICFTDIRKLKCWSVPTLHLPGEAVHQNPTEEEWLKINERIAVSAAQPGEPCEDNSMLEPVVIMQEEDCVCCVPNCGRSKQMDNSIQFTSFPKNNMLAEKWILNFHLKVTKDQLSDLRVCNRHFETTCWENGRLRRGAMPTLELGHECSDIYRTEELDLFKSRKQTKRTYGQGCCFPQCVELLKNFQRMVYDLPREAQLRQRWLQYMELTESEQPLKMCPLHYIILYDHSVKNFEEHAPEKLLDFNYENARNCVRIRIISCAVEGCNTLQPRDGGRMHGLPPRSDILQMWLDNTRLVFHEHQRYMLKVCSKHFEPKCFTDIRKLKSWSIPTLHLPDEVVHQNLTEREWQQMNERLAVQNNREEESFDENSMLEPIVMMEHAESEAEMEEQGETMPQQKLVTHDNLKHESQDDNGNNDDEMQALEVLLEVGHVEKCSSYEKMDNKSHLPYSETSPLSPSMGSMPPGQRGGHYNARHCSVQGCQITANDVDGNIKLHKFPTSVEATEKWMHNTQVDVDENYSWRYRICSYHFEQECFNGARIRRGSMPTLHLGPLRPKDIFRNEFPQLEMDETMEESIPKVTPTVEQEPGAQPIKSKVTQLCLPRPAPPRKSSKFCQIEGCSNHLTSENMTLHKFPHSLDMCARWQHNTQVPFDPEYRWRYRICSIHFHPVCLVNMRLLHGSVPTLKLGPRAPAQLFDNDFDAINMRLDKRSHLEQGSSKVKQERPHHQQQSDEFYLEPEMEMEVDDEEQDADQSQSMTSFESWRHQLRLPTVKQDKVAYNPIKSGYDKCSLTHCQRQRSLHGVHIYKFPRSKRHQQRWMHNLRIRYDEKKPWKYMICSVHFEPNCIRLRKLRPWAVPTLELGSNVADQIYTNEQCQEMASDVSEEEETGPEESGQEEDDDDEVDDDGDTGAEAHIKRERRHWGTSGAAGGQMAPWKVKQCCLPYCRRPRGDGIKLFRLPGNPTSIRNWEKATGMTFKASQRNTRLICSRHFEPELMGVRRLMRNAIPTRHLYHQRESYSPELVIPTNTPTPIGPRCCIPDCPPHDGSSQLHRFPSDPQLLKQWCESLKLTDFQRYSGQYVCSNHLPAQDLACIICGVEDIQLPLLDFPENRNYRAKWCYNLKIETIPKWDNSKHICSKHFESYCFSQQTGELHPEAAPTLHLNHNDTNIFLNEYAIEQHSLMRIKDEPLDNDEMLLA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00604131;
90% Identity
iTF_00577817;
80% Identity
-