Basic Information

Gene Symbol
-
Assembly
GCA_933228635.1
Location
CAKOFY010003624.1:1269006-1299243[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 6e-15 8.1e-12 46.2 0.5 1 86 852 924 852 925 0.84
2 34 9.7e-15 1.3e-11 45.6 5.9 1 87 952 1021 952 1021 0.80
3 34 2.3e-15 3.2e-12 47.6 0.3 1 87 1042 1114 1042 1114 0.83
4 34 1e-13 1.4e-10 42.3 4.0 1 86 1195 1263 1195 1264 0.78
5 34 1.1e-15 1.5e-12 48.6 5.5 1 87 1288 1360 1288 1360 0.81
6 34 2.2e-12 3.1e-09 38.0 1.1 1 87 1395 1463 1395 1463 0.80
7 34 7.5e-11 1e-07 33.1 3.7 1 85 1504 1572 1504 1574 0.72
8 34 1.1e-14 1.5e-11 45.4 0.2 1 86 1601 1670 1601 1671 0.81
9 34 4.6e-14 6.2e-11 43.4 2.8 1 86 1693 1762 1693 1763 0.80
10 34 2.4e-13 3.2e-10 41.1 3.0 1 87 1791 1863 1791 1863 0.86
11 34 6.3e-06 0.0085 17.3 0.0 1 73 1931 1991 1931 2007 0.75
12 34 6e-12 8.2e-09 36.6 0.3 1 87 2028 2100 2028 2100 0.81
13 34 2.1e-14 2.9e-11 44.5 1.4 1 87 2130 2201 2130 2201 0.79
14 34 3.8e-13 5.1e-10 40.5 5.4 1 87 2246 2319 2246 2319 0.83
15 34 1.2e-14 1.7e-11 45.2 0.4 1 87 2341 2409 2341 2409 0.81
16 34 1e-14 1.4e-11 45.5 0.5 1 87 2707 2776 2707 2776 0.80
17 34 6.1e-12 8.3e-09 36.6 1.1 17 86 2871 2933 2835 2934 0.72
18 34 2.3e-11 3.1e-08 34.8 1.0 1 87 2966 3037 2966 3037 0.78
19 34 4.4e-12 6e-09 37.1 0.3 1 87 3065 3134 3065 3134 0.82
20 34 2.1e-13 2.9e-10 41.3 1.5 1 87 3155 3225 3155 3225 0.80
21 34 2.6e-14 3.5e-11 44.2 1.2 1 87 3248 3319 3248 3319 0.81
22 34 2.1e-06 0.0029 18.8 0.2 1 60 3335 3383 3335 3413 0.77
23 34 1.6e-11 2.2e-08 35.3 2.8 1 86 3423 3492 3423 3493 0.82
24 34 1.5e-12 2.1e-09 38.5 2.4 1 86 3518 3588 3518 3589 0.81
25 34 1.8e-13 2.5e-10 41.5 2.9 1 87 3609 3682 3609 3682 0.78
26 34 1.9e-13 2.6e-10 41.4 4.3 1 87 3697 3767 3697 3767 0.81
27 34 2e-12 2.7e-09 38.2 7.4 1 85 4031 4100 4031 4102 0.80
28 34 2.7e-09 3.7e-06 28.1 0.6 1 86 4121 4190 4121 4191 0.77
29 34 9e-10 1.2e-06 29.7 0.5 1 86 4200 4267 4200 4300 0.80
30 34 0.15 2e+02 3.3 0.2 31 58 4323 4356 4291 4378 0.70
31 34 1.8e-14 2.5e-11 44.7 3.0 1 87 4401 4477 4401 4477 0.84
32 34 6.7e-14 9.2e-11 42.9 0.2 1 85 4501 4573 4501 4575 0.85
33 34 9.2e-12 1.2e-08 36.0 7.2 1 87 4746 4820 4746 4820 0.79
34 34 1e-13 1.4e-10 42.3 0.1 1 86 4845 4914 4845 4915 0.84

Sequence Information

Coding Sequence
ATGTCACAAAATAATCAACGCAAACattatcatatacatgcaccctatcaacacccccaccaacagcagcagcaacaacaacaacagcagcaggctcaacttcatcatcaccaccatttaacagcaccatcgcattcacagcagcatcaacataatcttcaacagcaacaacaacatcaatggtactctcaacaacactatcaacaacaTGGTTTACATGTAAGAGACTCGCGGCATCTTCCACATCATCCTCCTTCTCCACAACAGCCACCACCAGCACACCACAATCAAACAATGTCACCACATATGTTTACAAGTGGTTATGTTGGTATGTCACAATCAGGCACTGGTGCTGGTGGTGGATCAAGTAGTGCCACACATAATGCTTCAGCACTGGGCTCCACACATAATATGCCGGCTGCTACAGCCGCCTCATCGTCTGCCCATCATTATTCTTCTTCGTCTTCATCTGCTACAAGTGCCAGTAATGTTAATAGCAATAATGCTGCTAGTAACGCAGCCAGTAGTAGTAGTGCTGCCGGGGCCTTTGCCACCAGTCGTAACAGAATGTTTGACCTTGAAATGTTACCAACACAACAACATTCACAACATAACAGTACATCAACTGCTGCACACTCACATTCTATGCTGGCTGCCACAAATAATCGTACAGCATTTGATGCCTATTCTCACAATTCTTTATATTCACAACAAAATCAAAGACATCACTCTGCcacagcatcctcgcattatcctttggctagtggtggccatcatccagctcaccatgctcatcatcatataccgccgagccatcatcagccgcattcatcttcgcttcatcatcagcaacagcagcatcatcaacttcaccaacagccgcagcatcattattatcatcatcGTGCTCCAACGACTTCTCTCCATCGGCCACATTCCCAAGTTTTAGCGCCAATGTTACAACATATTAAATCTGAGCCAGTAGAGCAAATAACCGTTACACCATCAATACAAACCGAGGAAGTCATCATCAAATCTGAACCTATCGATGATACGGGCTATCATCAGCACAAAAGTGCGCCACAAATGGAAAACAATTCTTTTCATATGGAAGAAAAACGTAAACAATATGagctgcaacaacatcagcaacagcagcgacaacaacagcaacaacaacaacaccaacaacgtgaaagagaaatccagcaacaacaacgtttacagcaacaattacaagagcaacaaagacatcaacgggaacaacaacagcagcagcgtcaacaacaacaccaacagcaacaagaactacatcaacaccagcaattacaaataaaacaagaacctcatgattatcctacacatCTTCATCATCATCATGACCGACAAAGTAAACATCATCAGCATAATGAAGATGTTACCCAACAAACACAACAACAATCACATATAAATTCTGAGAATTCCACGGCAATACAACCGGCTGAGAAAAACCAACAAGAGCAGAACCAGCAAAAACAAGAAGAAAAACCAGAACAACAACAAAAGCAGGAGCAGCAGCAGCTAATATCCTTGACTAATATAAAAACAGAAGCAAAGCCCCTTAACTTTCCTCGCCGCAAATTACAAACAGAACGTTCCTCAACTCTGCCGATATGCCAACGATGTAAACAAGTTTTTTTAAAACGCCAAAACTACACACAACATGTTGCTCTATCCAGTTGCAATATTGTCGAATATGACTTCAAGTGCTCCGTCTGTCCTATGTCCTTTATGTCCAATGAGGAGCTGCAGACGCACGAACAATTACATCGTTCCCATAGATATTTTTGTCAAAAGTATTGTGGCAAATTCTATGAAACAATTGACGAGTGTGAACAACATGAATACGGGCAGCATGAATATGAAATGTATAAATGTAATATTTGTTGTATAAGTGTAACCCAACGTGAACAATTATTTACGCATTTAAATGAGCACAAATACCAGCCACGTTTTGATTGTTGTATATGTCGTTTGTGCTTCCAAACATCCCTGGAGTTGCACGATCATTATATGACCAATGAAGATTTTTGCGGCAAATTTTATGATAAAGAAGCTTTCAAAAAAACAATAACATCGCTTTCAAAAACGTATCTAGGAAAACCGGAAAGTTCTAAACTGGAAATAGCTCATACATTTTCTTTGAAAGATATACCTTCCGCAAATAGTCAACAATTGGAACCTTTGTACACAAAACCTTCCACTTCAAAATCTTACATGGAACCGCCAACTACCCCCACAACACCTTCCTTTAAAAACTTTAGTTCTAATGAGTTTCCTTTAGAGCCACATGTTGAGGTAAAAACTGAAATAAAAGTTGAACCAGACTTTTATCCACCCATGGATCAATCGGATTTTCCATCCTATGACAATGATTACTCCAGCTCCGATTATACCTCCGGATCAAACCAAAGTCTTGCATTTTTACATGATTTTCATGACAATGCTTCCAGCTCTACAAATTCTTCGTATTCCCATAATGCTAACGATGCCATACAAGATGATGAAGCCATTTGTTGTGTACCCAAGTGTGGAGTTCGTAAATTCTCCTCACCCTCTTTGCAGTTTTTTGGCTTTCCCAGAGATGAGAAATATCTCGCTCAATGGTTGCACAATTTGAAAATGGTCTATGATCCTAATATTAATTATGGTATTTATCGTATTTGTAGTTTACATTTTCCCAAACGTTGTATAGCTAAGTATTCCTTAAGTTATTGGGCTGTGCCCACCTTTAATCTGGGACACGATGATGTCGGCAATTTATATCAAAACCGGGAAAGTTCCGGGGGGTTTCCAGCTGGTGATTTAGCCAAATGCAGCATGCCCAATTGTCCCTCGCAAAGAGGTGAGACTAACGTAAAATTTCATGTATTTCCTCGGGATTTAAAAACATTGATAAAGTGGTGCCAGAATTCCAGGCTACCAGTCCATAGTAAAGATAATAGATTTTTTTGTTCTCGTCACTTTGAGGAGAAATGTTTTGGTAAGTTCCGTTTAAAACCCTGGGCCATACCTACACTCAATTTGGGCACAGTATATGGCAAGATCCATGATAATCCCAATATTTATCAAGAAGAGAAAAAATGTTTCCTGCCTTTTTGTCGTCGCAGTAGATCTTATGATTGTAATTTGTCCTTGTATAGATTCCCAAGAGATGAGACTTTATTAAGGCGTTGGTGTTATAATTTGAGATTAGATCCCAATATGTATCGGGGTAAAAATCACAAAATTTGCTCCTCACATTTTATTAAAGAAGCCTTGGGCTTAAGGAAACTTAATCCTGGAGCAGTTCCTACTTTAAATTTGGGGCACAATGATAGATTTAATATTTACGAAAATGAATTGTACACGCCACCACCACCTCCTCCACCGCCGCAACCTTCTACTTCCTCTAAAGCCCAAAAATATGCAGAGAGGTTTAAACAGGAAATGGGAGGTTCTCATATATACGATGGTGTCTTTATGAATTCCATGGTTCACAAATTCTCCTCTTCCTCCTCATCGGCTTCGAATAACTCTAACAATTTAGATTTGGGTGATGTTTGCTTAGTGCCGTCATGTAAGCGCACTCGTCATTCAAGTGATATCACTTTACATACAGTTCCAAAACGCGCCGAACAGCTTAAGAAATGGTGCCATAACTTAAAAATGGATTTGCATAAAATGCATAAAAGTGTTCGGATCTGTAGTGCACATTTTGAGAAATATTGCATTGGAGGCTGTATGAGACCTTTTGCCGTACCCACTTTGGAGTTGGGCCATGATGACAGTAACATTTATCGCAATCCAGATGTTATAAAGAAACTGAACATCAGGGAAACCTGCTGTGTACAAACGTGCAAGCGTAACCGAGATCGGGATCATGCCAACTTACATAGGTTCCCTACTCATCCGGAGTTGTTACAAAAATGGTGTGAGAACTTGCAAAAACCCATTCCTGATGGTACTAAACTTTTTAATGATGCTGTGTGTGAAATGCACTTCGAAGACAGATGTTTGCGCAATAAACGTTTGGAGAAATGGGCCATACCTACTTTGAATTTGGGTTGGGATGAGGCTCCTCACATTTTGCCCTCCGAGGAAGAAGTCAATGAAAACTGGGTTAAACCCTTTGCACCCAATAATGGTGACGAACAAGGTGAATGTTGTGTTAGCTCCTGCAAACGCAATCCTCAAATCGATGATATTAAATTGTATAGACCACCCGAAGATGCTGAGCAATTAGTCAAATGGGCCCATAACCTGCAGGTGGATGTCACGGAACTGCCTAATATGAAAATCTGTAATTTACACTTTGAGCAGCACTGTATAGGTAAGAGATTGCTGAATTGGGCGATGCCCACTCTTAATCTGGGCGGAAAAGTTGAACATCTCTTTGAAAACCCTCCACCCATGCCCACTATATACAGGAAAAAAATTAAACCTGAAAGAATTCTAAGCAATCATGAAGCCATAAAATGGTCTCCCCGATGTTGTCTGCCCCACTGTCGCAAAATGCGCACAGTAGACAAGGTTCATCTATTCCGTTTCCCCTACAATCACCGCCAGACTTTGGCAAAATGGTGCCATAATTTACAGCTTCCTTTGGTGGGTAGTTCTCATAGACGTATTTGCTCCAGCCATTTTGAGCCCTCAGTCCTAACGAAACGTTGCCCCATGTCATTGGCGGTTCCCACACTTGATCTAAATCCTCCTACGGGCTACAAAATCTATCAAAATCCGGCGCGCTTGAAACAAATAAAACCTGGTACCCAGCGTCAATGTATCATAGAATCCTGTCGTAAGACCAAAATGGATGGTGTTTCCCTTTATCGTTTTCCCAACAATAGATCTATATTACTCAAGTGGCGCCATAATATAAAAAATTGGCCAAAGGGAAAACTGAGTAACCAACTAAGAATTTGTGCAGAGCATTTCGAAACCCACTCGGTGGGAGAGAAAAAGTTATCTCCTGGAGCTATACCCACCTTAAAACTGGGACACGACTCTAAGGACTTGTATGCTAATGAAACCAGATCTTTCTTTGATTTGGAAAAATGTGTTGTAAACGGTTGCGATTCCCGCAAAGAAATGGAGGACATTCGCTTATTCCGTTTTCCTCGAGATGACGATGAATTGCTAAAGAAATGGTGCCATAACCTCAAAATGAATCCCAATGATTGTGTGGGCATCAAGATATGCAGCAAACATTTTGAAAACGACTGTTTAGGACCACGGCAACTCTATAAATGGTGTATTCCGACATTAAAGTTGGGTTATGCGGAAGACGATTTAGTGGAAATAATACCTAATCCACCACCAGAACAAAGAACCGGAGAATATCTGTTTAAGTGCTGCGTACCCAACTGTGGCAAAACGCGTAAATATGATGATGCGCAAATGAATAGTTTTCCGAAGCATTTGAAAATGTTCCGCAAATGGAAGCACAATCTAAAGTTAGATTTTCTTAACTTCAAAGAAAGAGAAAAATATAAAATTTGCAATGACCACTTCGAGGCAGTATGTGTAGGTAAGACTCGACTTAACTTTGGGGCTTTGCCCACATTAAATTTAGGACACGATGACACAGATGACTTGTATCAGATTAATCCCGAACGCATCAGACCTAACTTGTTTATAAGACAAAAAGATGTGGAAATATTGGAGAGGAGAAGAATTCTTAGACATGAGAAACAAGAGCAATATGAGTGTGAGGAACCAGAAGAGGATCCAGTCAACGACCCTTTAGGGTTAGAGCCTGGAGACATGAAATGTGTTGTAGATGACTGTCCCGCACCCAAGTCGATAATGAGAGAACCTTATGATCTACCAGAGACTTCCGAATTAAACAAATTATGGCTGAAGGAGTTAGGCAAAAATGATGAAGATGGTATACCTTCAGAAGCTAAAGTATGTGGTCTGCATTTCCAAATGACCTACATTAAACTCAAAAACCAAATGCTTGAGTTAAGCGAAGATAACTCTTTAATCCAGGCTGATGTCAATAAACTGCAACTAAATTATCAAAAGTCTAACATATCCTTGGTGGTCAATAGCTATCAGTGCCGCGTAAATGATTGTCCTACTAACTTACTCAACTCTTCCATAAGACTGTATTACTTTCCATATGGCAAACATTTGATAAGCAAATGGTCTCAGAACACCGGCATAACTCCCGATGAACATCGCAGATATATGAATAAAGTATGCGCTTTGCACTTTGAAACTTATTGCATTACCGAAAACCAAAGACTAAGATCTTGGGCCATACCAACCCTTAATTTACCGCCCAGCGAAGGAAAACATTTGAACAAAAACCCTGATCTCACTAAACTTGATAGAAGAATGTTGGGTCCTCCGGTATGGAAATGTGCCGTGACCAACTGCAATTCTCTTAAATCGGGAGATGATGATTCCATTAAACTTTTCAATTTCCCTAGTGAAGATAAATTGCTTAAGAAATGGTGTGATAACTTAAAAATCTCCCATCACTTTACGCCTTTAATGAAAATATGTTCCTTGCATTTTGAGAAATTATGTTTTGGTAGTTCTCGCATAAGATCCTGGGCCATACCGACCCTTAATTTGGGTCATGATAACACACCCGAACATTTTAACAAAACTACCATAAGACAAGAGGTTTATGACCAAAACGAGGAAGTTGAGGCAACACAATTAAAACAAGTCAAAATCAAAAAGTCCCTGGATACTGCCAAATGTTTTGTAGCTTGCTGCCGCAAGAGCCGCCTAAAGCATGGTGTCCGTTTCTATAGTCTACCCTCGAATTCGAGAGTTAAACGCAAATGGCTGCATAATTTACAAATCAGCCAATTGAAATCCAAACACAAACTGCAAAATATAAAAATTTGTAATCTGCATTTCCACAAACGTTGCCTGGATGGCAAAATCCTAAAGCCTTGGGCAGTGCCCACTATGCATTTAGATTATACGGAGGGTATTTTTGATAATCCCCGTCGTATGCAATCGCTACCCATTTTACGTTGTGTGCTGGCACACTGCAACAATCATACTGGCCTAAAAGGTGTACGTCTCTTTGTGTTTCCCAAATCTCCAGAGTTCCTAAAGAAATGGTCGAAAAACTTGAAATTGGATTTGGAAAAATGTAAGGGCCGGATATGTCAGGAACATTTCGAAAACGAAGTTATAGGAGAGAAAAAGTTAAAAAATGGGGCAGTGCCCACATTGAATTTAGGTCATGAGGATGATGATATTTACGATAATTCAGAATTAATGGAAAAGCTAAAAATAAAGAAAATAGAAAAGGAATTAAAGCAAGATCCTTTAGAAACAAAAAATGAAGAAGACTGTGATGAGGAATATGAACCTCTAGACGGGGAAGAGGTCGAAGAAGAGGAAGAAATGTGGGAAACGGATATTGAAGAGGAAGAGGAGGGGGAAGAGGATGAAGATCAGACATATTTTGATGACGAAGAGGAAAGGGAAAACGCAGCTAAGCAGGAAGAGCCTCAAGAAGATGATGAATCCAGTGTTACCAATTCAGTCAAAGATTGGAGTTCTGTTAAGTTTAAGGAGCTAAGAGTTTCCATTACACCTCTAACCCCCGAAGACTTAATGGATTTATGTTCACGATCTTCCTATGAACGTGAATTTGGTTCGTTGACGCCAGCTAGTAGCTTAAGAGGTCGCAGATCGGTAACACCAGCATCAAGTTGGAAAGATTTACGTTCCGAGACTCCAGAACAAAAACCATTTCCAATTTTTGGCTTCAAATCGCGATCGGATACAAATGATGAAAAACCATTCAATTGTTTTAGACAACCAAGTTCAGTTACACCCGATCAGAAAACAGATAATGTAAGAGAAACGCAATCTCCCGAAGAGAAATCTAATAATTTAAACAGAAATGTAGTCAGTTCCAATTCTTCAGATTTAATAAAAGAAGTTAAGCCCAACATTCTAAAAAGAGAGTGTACTGAGACCAATAACGAAGGTATAAAACGGGAACGTGTAGACATTTCAGAAGACGAAACGAGCAGCAACTCTTTATCCAATGAAAAGTACTCTAATACCTCTACAAATTTGAGAACGGACAAAGCCCTTAATTCCGTAGCTCCTATGTGCTGTTTAAAACATTGCGGCAAGGAAAAAACTCCGGAACAGCATTTAACCACTTATGGTTTTCCCAAAGACCCTCATTTGTTACAAAAATGGTGTGAAAACTTGGGCTTACAAACTGATCAATGCATAGGACGTGTATGCATAGACCATTTTGAACTTAGAGTTATAGGCACGAGAAGACTTAAACAAGGAGCCGTGCCCACCTTGAACTTAGGACCCAATCGCATGGCAAAACATAATAATGTAGATGAAACGCCACAACAAAGAAAAAATGTTACCAAGGAGCTAGGCGAAACAGCGCACATGCAAGAGGCAGACTCTAATTTAAAAGCACCTCCTCCTTATAAGACGCCTAAACCCGCTAAGCAATCGGTTTTTCGGCTATGCTGCCTCAAACATTGCCGGCGCAAGAAGTTTGTAAAACCGGAGAAGATAGAGGAACTGACGCATCAGAAAATGAAGATGGAGAACATGGAATTGTTGAAGGAGGAGAAAATGGAACAGAGGAGGAGTATATTATTTAAATTCCCCAAGGATGAATTGACATTAAAAAAATGGTATAGAAATTTAAGATTACCCGAAAAATTGCAAATAACACCTGATTTACAAATTTGTGCGAGACATTTTGAACCCAAAGTCATCAAGGATGGTAAATTAAAACCAATGGCTGTACCCACACTAGAATTAAGTTATGCGTGTCGAGCGCCAATTTATTTGAATGAAGAAAATGAAATCTTAAATGATAATATTATTAGCAAGAATGAAGTCATGGAAAAATGTTTCCTCAAGCACTGTGGAAATATTGCCACCGACGAGATTTTTCTTTTATCCTTTCCGGAAAATCAGCCTTTAACCTTAAAGAGATGGTGCAAAAACTTGCAGTTGTCTTTTGGGAAAAATGAATTTAAAGATTTAAAAATATGCAGCGAACACTTTGAGTCTTATGCCTTCTGCAGAAAACGATTGAAAACAGGCGCTTTGCCCACCTTAAATTTGGGTCATAACGAAACGATAATAAGAAATTCTCGTAAATTACGAAGGCAAAGGGTCAATAATAACAATGCCAAAGAGAAATGTTGCCTAAAACAATGTGGGGAATCGACATTGAAACTGTATGCTTTTCCACGTAGCTCCGAATTACGCAAAATCTGGTGTAACAATTTGCAAATAGAATTACGAGAGGCCATGAGTAATCATTATAAATTGTGTGCTCGACACTTTTCTTTGGAAAGTTTTATCGTAGGCTCGGATAATTTAAAATTAAATGCTGTACCTATTTTGAATTTGGGTAAAGAAAGTGAGAAACATTTATTGCTTAATCATGAGGCAGCAAGTGAAAGTAAATGTTTGGTAGAAAACTGTCAGAAGACTCCCAGTGTGGATAGGGTTAAATTGTTTAACTTTCCCGAAAAACCTGACATACTCAAGAAATGGCTTTTTAATTTAAATCTAACTCCAAAGACTTTGAACTCCAATGATGTTATTTGCAGTAAACATTTTGAAAATACCTGCATTCGAAATGGTATAATGCATGAGAATGCTATACCCACCAAATTTCTAACACTCTCCAATAAAGATTGGTTCTATCAGAACAATGAGGAGCTATTTGAGATCTCTAGAAAGTGTTGTGTCTTAGAATGTGGCCAAAACTCGGAAGAAGCTAAACATTTGTATAGATTTCCGAAGCATAAGGAAGATTTGGAAAAATGGCTTTACAATTTAAAATTGCAAGTCGACGAAGCGGAAGTCAAAGATTTAAGGGTCTGTGAAAGACATTTTGAGCAAAGTTGTAAGATTTCCAACAAGGACCTAATAACACAAGCTTTACCCACCCTAAACTTGGGTCACAACGATACAGATATATATGGCAATTACTTTATCAAATGCTGCTTGGATGCTTGTGATACCGAGGGCTTTTACTTTCACAAACTGCCCGAGGATTTAATGTTAAGAAGCTTTTGGTTTCAGGAATTGGAAATGGAGGGTACCTTTAACTCATCCCTCTACATATGCTCTGTGCATTTTGTGGCTTTTTTCGAAAGAATCTTAGAAAAATATAGTGTTTTTCTTAAAGAATCCAAGGAATATGTTAAGCTTTCTGTAACTTATAATGAGCTTAAAGCTTTAAATAATTTACAAAGCTATAAATGCTTTATACCCAAATGTAATTCAGGCTTTAAACTCATATGGAAACTATTTAAATTTCCCAAGGATTTGGGCATGTTTAATAAATGGCAACACAACACGGGACTACAATTTGAATATGAACAGAGAAATTCGTATCGCATATGTGCACAACATTTTGAGGAAAGATGTTTAAGTAAATTTGAACTACATAGATGGTCTTTGCCCACCCTAAAATTGCCTTTCAACAACAGTTTATATGTCAATCCTCCCGAAGCATTACCCTCTAATCATGAAAACTTACAACACTGCTGCGTTGCAGAATGTTCTAACAAGAAGGGTCCCTTTTACAAGTTCCCAATAAGACCTTTAGATATCAAGAAATGGATCCACAATTTGGATTTAGGTTCGCAACAAAGCACCTTGAACTTACGAGTTTGCTATAAACATTTCGAAAACTATTGCTTCTCCAAGGCTGTGGATAAGGTAAAGCCTTTGAAATTCTGGTCTGTACCTACCTTAAAATTGAAAAGGCGTTCGCAACTTTATCTCAATCCCGCCGATAAAATAGCATTTTATGTTTGTAGTTTACCCAACTGTAGGCAGATTCTAAATAAATCAAAAAATATATATCTATACAAATTTCCCCTTAGCAATACATGGCGTCAAAAGTGGTTACACAACTTGTCTCTAAAGCCTCATGAATACCAGGAAACCATGAGAATCTGCTCTACACACTTTGAGAAAAGTTGCTTCTATAAGGACTTAATAAAAATAAAAAAGAAAATTGTGCCTACATTAAATTTGAATAATCCTCCCAAAGATATCTACAAAAATCTTCCTCAATGTTGTGCTAAATTATGTCACAATAATCGCAGTCAACTGTTTAGCTTTCCCAAAAATAAGACGCTCCTCAAAAAATGGTGTAATAACTTACAATTAGAAGGTGATTTAGACAAGGAGACCTTAAGGGATTGGAAGTTGTGCACCAAACATTTTGAAAAGAGATGTATAAATAAATTTGGTGCTTTGAGAAGTTTAGCAGTGCCTACTTTAAATTTGGGACATCAAGGCAGACGAATTTTTAAGAATCCAAATTTTGGAAATATCAAAAAAGTAGTAGTTAAGAAGGAAAATGGTAGAAAGGATGATGAAGTTTCAGAAGAGGAAATAACAAATAGTAAAGGAACTGAAAATGATCTTGTAAAGAAGTCCAAACATTTCTTAAACAAAGACCAAGCAGTAACTGTTATAAGATCCCAACGCCTTAAAGCTAAAAAACTAAATCAAAGCTCAAAGAATGATAACAACGAGATTCAGGAAAGTTTGGAGACTAATAAGGAAAGATATGACACTTCAAATATTGAAGATCACCCTTATCTAGGAAATCTTTTCGAAATTCTTACTCAAAATCATTCTGAAAAGGAGACAACACATGTTATGGAAGCCTTAAAACAAGAACATGCAGAGGAGAAATTTAATATACCCAAAAACCCAACGACAGATATAAAACTGAAGCATGAAAACTCTTCGAGCCTCCTTAAAGATGCAGCCAACATTAAACCAGAAGACTTTTCGGAATTAAATGAAATTTCCCAGGCAGAAGAGAAAAATGTTTTCACAGTTTACGATGTCAGCCAAGAAACACCAAATTCCAATGACTCTTTAGACTACAAGGAATATCCACATCAAGAACACTATCAACATTCTACACATTCCCAGGAATTCTATCTTGAAGAACGTACTGGTTATGAACCCAGTGAACAAAGTGAACAAGAATTCGAAGATGTTGGTCAAGATCAAAAGTGTGAAGAATCGCTAACTAACACAAGCCAAAGTCTTCCTAGATGTTGTATTAAATCATGTTCAAATTATAACAATTACCGGGATCATATACCGCTCTATAAATTTCCTCGGACTAGCTTTTTGCGGCAACATTGGTTGCAGAATTGTAATTTTACACGATGTTCGGCTAAAAACTACAGAATTTGCATTGAACATTTTAATAAAGAATGTTTCCGGGATAGAACACGTCTCTTGTTTGGAGCTGTGCCCACTGAAAAGCTTAAAGGAACATTTGATTTCAAGGAATTTCTCGAAAGTCATAGACAATCAAGATGTTTGGTAGAGAGTTGTCAACGTTCTTCACAACATGATAGAGTTCGTAGAATACCATTTCCCTCTGGGCCTTTACTAGATAAATGGCGTTTAAAATTGAAGCTTAACCAAAGGCATATAACCGAAGAGGATTGGATTTGCCATCGACATTTTGAAAGAAAAACTCTAACAGATGGTTTTAAGCTCAAAAAAGACTCTGTGCCAACTTTATTGTTGCCCAAAGAGGCGCTGAAAAAATGTTGGGTACAATATTGCTCGAATTCAGAGGCTAAACTTTTTAAAATTCCTCTTAAAGATGAGCTACTTTTTAGGAAATGGTTAAGAATTTTGAATTTACAAGACACTCGACTGGTAAGAAATCAGTGCCATGTATGTATTAGACATTTTGAACGAAAATGGTTAGATAAAGGATATTTAAAACCCCAAGGTTTTCCCTCGTTATATTTGAAAAGGCGAGAATTAGGAAAAGTTCAAAAGTTAAAGACAAAAACCGAAGAACTTAAAAATATTTCTAAAACCAAAAAAAGAGTAAAACCTAAAAACTGCATCTATGGTTTATGTAAGTTAGTAGGTGCCTATAACTGGTCCAATGAAGGTATTTTTGCACAGATATGGCTTGAAGAAAAACAATCGTTAGAGGTCAGCAAAACCCAAAATCTAATATCAAATGAAAATGAGGAGTATCTTAAACTATGTGATGAACACTTTTATTATCTGTACAAAAGTAATGAAAAAGTAATTTGTGACCCGGAATCTCATAAAGGACTGGAGGATATAAGACACGAAATGAAACAATTATTTGAGTATCTAAACTCTTTGGAGAAATTCTATACGAAAAAATGTGTAGTACCTCAATGTTTTGTCGATCAACATCTGGCACAAAACTATAAATCTTTAAAACTCTTTGGATTCCCTAAATCGCCTGAAATATGTAAAAAGTGGTGTCATAATGTGGAAATAGAATACAAGTCCTTGAAATCGAAACCTCTTCAAAAAGTGTGTGAACTACACTTTGAGGACTATTGTTTAAGCAAGCGAATACTTTTAAATTGGGCTGTGCCCACCTTAAATTTACCTCAGAAAAATTCTCATTTCATAATACCCAATGATCCAGATGAGGAATTTGTCTTGAAAGGCCGCTGCTGCATTAAGGCCTGTATCAATGCTTACGGTTTGGATGATAAAACCAATTCAAGATTTTATAGATTTCCTCAGGAATCGGCAAGGCTGGAAAAATGGTTAGAGCTTACGCATATTGAGGACTTTGAAGAAAACGTTACACAAATTTGTGGTTTGCATTTTGATCCAAAAGACTTTCTCAATAAAAATAGAGAATTAAAGGAAGACGCGTTGCCCCGTTATAATCTAGATCCTGAAACCCCAGATGCTTCAGTATTAGATGATTTGATAGAAGTTAAACAGGAATTGGATAACTCGGAAGAATGGTGTGATCAAGATAATATCATAAATGAGAATTTAGAAAATTTTGATATTCCCAAATGTAAAGAAATATCCAAAGAATCTGATATTTATACACAAATTGAAATAAAGCAGGAAGTTATTGAAATACAGGAAGAAGAAGCTGTAGAGCTAACAAATTGCTCTAGGAAAAGTTTGGAAACTTCAGAACTTCACAAAACAAACTTCTACGAAGAAAACTTAGATTTTAGGAAAAAAATCTCTAAAGAACTTCATTTATCTAGTGAAATGGATATCAAACAAGAAATTTTAGAAGAACCTTTTGAGGTCCCCAGTTCAAGCCAATCTACACTTTTTACCATAGAAAAATTAGAAATCTCGACTGAATTTGGAACTAGCATTAATAACCCAGAGTCCTCATTTGTTATAACCGATGTCAAATCTCAGATCTATCTATGTTGTGTACAAAAATGCTGCAACAATTCCGAAACCCCAGGAATACAGTTCTTTACACAATTCCCTCAAGATTCGGAAATTTTCATTAAATGGTGTTTTAATTTAAAAATTGACCCGCGTCATTATCAAGAGAATCAATATGCCATTTGTCAGCAACATTTTGAACCCATATGCTTTGACTCAACGACCAACGAATTACACACTTGGTCGGTACCCACTTTAAATTTGAATTTAAATGAAAATTCTTTTATACACCAAAATGATATACCCGAACATTTAAAGTCCACCTCAGAACAATGTATAGTCTATGGTTGTATACATCCCGTATTGCCACTCTTTAGATTTCCCCATAGTCCTGAAATGGCACAAAAATGGTTTTCAAATTTACGGCTTGATTATACCGATTTTCGGGCTCAAAATTATCGTATTTGTAAAAGACATTTCCCCGCGGTTAGTTTTGATATGAACGATTTGAACAAACTTAAATCCGAGGCAGTTCCTACGCTTTATTTGGGACATACGGATAAGATTGTATATTTTAATAGTTTGGATGAGAGGCCATTGGATGTGGCAGAGGCGGGCGGGAATCATGATAATAGTCGCGGCAGCAGTCAGGGATCTTTCAATAGAATAATATCGCCCCACAATCTTGAAGATCATGATAGTAGTTATTTTGAGGACTTTGAAGAATATTATGGACAGGATGATTAA
Protein Sequence
MSQNNQRKHYHIHAPYQHPHQQQQQQQQQQQAQLHHHHHLTAPSHSQQHQHNLQQQQQHQWYSQQHYQQHGLHVRDSRHLPHHPPSPQQPPPAHHNQTMSPHMFTSGYVGMSQSGTGAGGGSSSATHNASALGSTHNMPAATAASSSAHHYSSSSSSATSASNVNSNNAASNAASSSSAAGAFATSRNRMFDLEMLPTQQHSQHNSTSTAAHSHSMLAATNNRTAFDAYSHNSLYSQQNQRHHSATASSHYPLASGGHHPAHHAHHHIPPSHHQPHSSSLHHQQQQHHQLHQQPQHHYYHHRAPTTSLHRPHSQVLAPMLQHIKSEPVEQITVTPSIQTEEVIIKSEPIDDTGYHQHKSAPQMENNSFHMEEKRKQYELQQHQQQQRQQQQQQQHQQREREIQQQQRLQQQLQEQQRHQREQQQQQRQQQHQQQQELHQHQQLQIKQEPHDYPTHLHHHHDRQSKHHQHNEDVTQQTQQQSHINSENSTAIQPAEKNQQEQNQQKQEEKPEQQQKQEQQQLISLTNIKTEAKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYTQHVALSSCNIVEYDFKCSVCPMSFMSNEELQTHEQLHRSHRYFCQKYCGKFYETIDECEQHEYGQHEYEMYKCNICCISVTQREQLFTHLNEHKYQPRFDCCICRLCFQTSLELHDHYMTNEDFCGKFYDKEAFKKTITSLSKTYLGKPESSKLEIAHTFSLKDIPSANSQQLEPLYTKPSTSKSYMEPPTTPTTPSFKNFSSNEFPLEPHVEVKTEIKVEPDFYPPMDQSDFPSYDNDYSSSDYTSGSNQSLAFLHDFHDNASSSTNSSYSHNANDAIQDDEAICCVPKCGVRKFSSPSLQFFGFPRDEKYLAQWLHNLKMVYDPNINYGIYRICSLHFPKRCIAKYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPAGDLAKCSMPNCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKDNRFFCSRHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPNMYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPPPPQPSTSSKAQKYAERFKQEMGGSHIYDGVFMNSMVHKFSSSSSSASNNSNNLDLGDVCLVPSCKRTRHSSDITLHTVPKRAEQLKKWCHNLKMDLHKMHKSVRICSAHFEKYCIGGCMRPFAVPTLELGHDDSNIYRNPDVIKKLNIRETCCVQTCKRNRDRDHANLHRFPTHPELLQKWCENLQKPIPDGTKLFNDAVCEMHFEDRCLRNKRLEKWAIPTLNLGWDEAPHILPSEEEVNENWVKPFAPNNGDEQGECCVSSCKRNPQIDDIKLYRPPEDAEQLVKWAHNLQVDVTELPNMKICNLHFEQHCIGKRLLNWAMPTLNLGGKVEHLFENPPPMPTIYRKKIKPERILSNHEAIKWSPRCCLPHCRKMRTVDKVHLFRFPYNHRQTLAKWCHNLQLPLVGSSHRRICSSHFEPSVLTKRCPMSLAVPTLDLNPPTGYKIYQNPARLKQIKPGTQRQCIIESCRKTKMDGVSLYRFPNNRSILLKWRHNIKNWPKGKLSNQLRICAEHFETHSVGEKKLSPGAIPTLKLGHDSKDLYANETRSFFDLEKCVVNGCDSRKEMEDIRLFRFPRDDDELLKKWCHNLKMNPNDCVGIKICSKHFENDCLGPRQLYKWCIPTLKLGYAEDDLVEIIPNPPPEQRTGEYLFKCCVPNCGKTRKYDDAQMNSFPKHLKMFRKWKHNLKLDFLNFKEREKYKICNDHFEAVCVGKTRLNFGALPTLNLGHDDTDDLYQINPERIRPNLFIRQKDVEILERRRILRHEKQEQYECEEPEEDPVNDPLGLEPGDMKCVVDDCPAPKSIMREPYDLPETSELNKLWLKELGKNDEDGIPSEAKVCGLHFQMTYIKLKNQMLELSEDNSLIQADVNKLQLNYQKSNISLVVNSYQCRVNDCPTNLLNSSIRLYYFPYGKHLISKWSQNTGITPDEHRRYMNKVCALHFETYCITENQRLRSWAIPTLNLPPSEGKHLNKNPDLTKLDRRMLGPPVWKCAVTNCNSLKSGDDDSIKLFNFPSEDKLLKKWCDNLKISHHFTPLMKICSLHFEKLCFGSSRIRSWAIPTLNLGHDNTPEHFNKTTIRQEVYDQNEEVEATQLKQVKIKKSLDTAKCFVACCRKSRLKHGVRFYSLPSNSRVKRKWLHNLQISQLKSKHKLQNIKICNLHFHKRCLDGKILKPWAVPTMHLDYTEGIFDNPRRMQSLPILRCVLAHCNNHTGLKGVRLFVFPKSPEFLKKWSKNLKLDLEKCKGRICQEHFENEVIGEKKLKNGAVPTLNLGHEDDDIYDNSELMEKLKIKKIEKELKQDPLETKNEEDCDEEYEPLDGEEVEEEEEMWETDIEEEEEGEEDEDQTYFDDEEERENAAKQEEPQEDDESSVTNSVKDWSSVKFKELRVSITPLTPEDLMDLCSRSSYEREFGSLTPASSLRGRRSVTPASSWKDLRSETPEQKPFPIFGFKSRSDTNDEKPFNCFRQPSSVTPDQKTDNVRETQSPEEKSNNLNRNVVSSNSSDLIKEVKPNILKRECTETNNEGIKRERVDISEDETSSNSLSNEKYSNTSTNLRTDKALNSVAPMCCLKHCGKEKTPEQHLTTYGFPKDPHLLQKWCENLGLQTDQCIGRVCIDHFELRVIGTRRLKQGAVPTLNLGPNRMAKHNNVDETPQQRKNVTKELGETAHMQEADSNLKAPPPYKTPKPAKQSVFRLCCLKHCRRKKFVKPEKIEELTHQKMKMENMELLKEEKMEQRRSILFKFPKDELTLKKWYRNLRLPEKLQITPDLQICARHFEPKVIKDGKLKPMAVPTLELSYACRAPIYLNEENEILNDNIISKNEVMEKCFLKHCGNIATDEIFLLSFPENQPLTLKRWCKNLQLSFGKNEFKDLKICSEHFESYAFCRKRLKTGALPTLNLGHNETIIRNSRKLRRQRVNNNNAKEKCCLKQCGESTLKLYAFPRSSELRKIWCNNLQIELREAMSNHYKLCARHFSLESFIVGSDNLKLNAVPILNLGKESEKHLLLNHEAASESKCLVENCQKTPSVDRVKLFNFPEKPDILKKWLFNLNLTPKTLNSNDVICSKHFENTCIRNGIMHENAIPTKFLTLSNKDWFYQNNEELFEISRKCCVLECGQNSEEAKHLYRFPKHKEDLEKWLYNLKLQVDEAEVKDLRVCERHFEQSCKISNKDLITQALPTLNLGHNDTDIYGNYFIKCCLDACDTEGFYFHKLPEDLMLRSFWFQELEMEGTFNSSLYICSVHFVAFFERILEKYSVFLKESKEYVKLSVTYNELKALNNLQSYKCFIPKCNSGFKLIWKLFKFPKDLGMFNKWQHNTGLQFEYEQRNSYRICAQHFEERCLSKFELHRWSLPTLKLPFNNSLYVNPPEALPSNHENLQHCCVAECSNKKGPFYKFPIRPLDIKKWIHNLDLGSQQSTLNLRVCYKHFENYCFSKAVDKVKPLKFWSVPTLKLKRRSQLYLNPADKIAFYVCSLPNCRQILNKSKNIYLYKFPLSNTWRQKWLHNLSLKPHEYQETMRICSTHFEKSCFYKDLIKIKKKIVPTLNLNNPPKDIYKNLPQCCAKLCHNNRSQLFSFPKNKTLLKKWCNNLQLEGDLDKETLRDWKLCTKHFEKRCINKFGALRSLAVPTLNLGHQGRRIFKNPNFGNIKKVVVKKENGRKDDEVSEEEITNSKGTENDLVKKSKHFLNKDQAVTVIRSQRLKAKKLNQSSKNDNNEIQESLETNKERYDTSNIEDHPYLGNLFEILTQNHSEKETTHVMEALKQEHAEEKFNIPKNPTTDIKLKHENSSSLLKDAANIKPEDFSELNEISQAEEKNVFTVYDVSQETPNSNDSLDYKEYPHQEHYQHSTHSQEFYLEERTGYEPSEQSEQEFEDVGQDQKCEESLTNTSQSLPRCCIKSCSNYNNYRDHIPLYKFPRTSFLRQHWLQNCNFTRCSAKNYRICIEHFNKECFRDRTRLLFGAVPTEKLKGTFDFKEFLESHRQSRCLVESCQRSSQHDRVRRIPFPSGPLLDKWRLKLKLNQRHITEEDWICHRHFERKTLTDGFKLKKDSVPTLLLPKEALKKCWVQYCSNSEAKLFKIPLKDELLFRKWLRILNLQDTRLVRNQCHVCIRHFERKWLDKGYLKPQGFPSLYLKRRELGKVQKLKTKTEELKNISKTKKRVKPKNCIYGLCKLVGAYNWSNEGIFAQIWLEEKQSLEVSKTQNLISNENEEYLKLCDEHFYYLYKSNEKVICDPESHKGLEDIRHEMKQLFEYLNSLEKFYTKKCVVPQCFVDQHLAQNYKSLKLFGFPKSPEICKKWCHNVEIEYKSLKSKPLQKVCELHFEDYCLSKRILLNWAVPTLNLPQKNSHFIIPNDPDEEFVLKGRCCIKACINAYGLDDKTNSRFYRFPQESARLEKWLELTHIEDFEENVTQICGLHFDPKDFLNKNRELKEDALPRYNLDPETPDASVLDDLIEVKQELDNSEEWCDQDNIINENLENFDIPKCKEISKESDIYTQIEIKQEVIEIQEEEAVELTNCSRKSLETSELHKTNFYEENLDFRKKISKELHLSSEMDIKQEILEEPFEVPSSSQSTLFTIEKLEISTEFGTSINNPESSFVITDVKSQIYLCCVQKCCNNSETPGIQFFTQFPQDSEIFIKWCFNLKIDPRHYQENQYAICQQHFEPICFDSTTNELHTWSVPTLNLNLNENSFIHQNDIPEHLKSTSEQCIVYGCIHPVLPLFRFPHSPEMAQKWFSNLRLDYTDFRAQNYRICKRHFPAVSFDMNDLNKLKSEAVPTLYLGHTDKIVYFNSLDERPLDVAEAGGNHDNSRGSSQGSFNRIISPHNLEDHDSSYFEDFEEYYGQDD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-