Basic Information

Gene Symbol
-
Assembly
GCA_932527255.1
Location
CAKOBP010000025.1:364161-386879[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 33 2.8e-15 4.1e-12 46.8 1.2 1 86 839 911 839 912 0.86
2 33 2.8e-15 4.2e-12 46.8 4.7 1 87 939 1008 939 1008 0.80
3 33 1.6e-15 2.4e-12 47.5 0.3 1 87 1029 1101 1029 1101 0.83
4 33 7.4e-14 1.1e-10 42.2 4.2 1 86 1184 1252 1184 1253 0.78
5 33 2.1e-15 3.2e-12 47.2 5.7 1 87 1277 1349 1277 1349 0.81
6 33 1.6e-12 2.4e-09 37.9 1.1 1 87 1384 1452 1384 1452 0.81
7 33 1.2e-11 1.7e-08 35.2 2.7 1 85 1493 1561 1493 1564 0.74
8 33 1.1e-14 1.6e-11 44.9 0.2 1 86 1590 1659 1590 1660 0.81
9 33 3.5e-14 5.2e-11 43.3 1.3 1 86 1682 1751 1682 1752 0.80
10 33 7.1e-13 1.1e-09 39.1 3.2 1 87 1780 1852 1780 1852 0.85
11 33 1.1e-06 0.0017 19.2 0.3 1 79 1919 1992 1919 1996 0.75
12 33 2.9e-11 4.3e-08 33.9 0.3 1 87 2016 2088 2016 2088 0.82
13 33 2.1e-13 3.2e-10 40.7 1.6 1 87 2119 2189 2119 2189 0.81
14 33 3e-13 4.5e-10 40.2 6.5 1 87 2234 2307 2234 2307 0.85
15 33 2e-13 3e-10 40.8 0.5 1 86 2329 2396 2329 2397 0.79
16 33 1.1e-14 1.6e-11 44.9 0.3 1 87 2718 2787 2718 2787 0.80
17 33 9.2e-12 1.4e-08 35.5 3.7 1 86 2845 2934 2845 2935 0.75
18 33 4.3e-12 6.5e-09 36.5 2.7 1 87 2968 3040 2968 3040 0.79
19 33 1.2e-11 1.8e-08 35.1 2.8 1 87 3067 3136 3067 3136 0.80
20 33 3.2e-13 4.8e-10 40.2 0.4 1 87 3156 3226 3156 3226 0.80
21 33 3.6e-14 5.4e-11 43.2 0.7 1 87 3249 3321 3249 3321 0.83
22 33 3.5e-05 0.052 14.4 1.0 1 59 3337 3384 3337 3409 0.80
23 33 6.7e-13 1e-09 39.1 4.3 1 86 3425 3494 3425 3495 0.81
24 33 4.6e-13 6.8e-10 39.7 4.9 1 86 3520 3590 3520 3591 0.82
25 33 5e-13 7.4e-10 39.6 3.1 1 86 3611 3683 3611 3684 0.78
26 33 1.1e-11 1.7e-08 35.2 3.4 1 86 3704 3773 3704 3774 0.78
27 33 9.6e-11 1.4e-07 32.2 1.1 1 87 4140 4215 4140 4215 0.81
28 33 1.6e-07 0.00023 21.9 0.2 1 86 4234 4303 4234 4304 0.75
29 33 0.24 3.5e+02 2.1 0.2 32 79 4373 4430 4332 4436 0.64
30 33 7.1e-16 1.1e-12 48.7 3.7 1 87 4451 4526 4451 4526 0.87
31 33 1.4e-10 2.1e-07 31.7 3.3 1 85 4551 4620 4551 4622 0.74
32 33 1e-13 1.5e-10 41.8 3.8 1 87 4764 4837 4764 4837 0.79
33 33 6.7e-12 1e-08 35.9 0.4 1 87 4862 4932 4862 4932 0.80

Sequence Information

Coding Sequence
ATGTCACAGAataaccaacgtaaacattatcacatccatgctccctatcaacacccacaacaacagcagcagcagcagcaacagcagcagcaaactcatcttcatggccaccatttaactgcatctcagcagcagcagcagcaacaccagcaatggtatacacagcaacattatcaacatggacttcatttaagagattcgcgccatattcagcatccacaacatcatccccatcatcatacacaacaacagcagcagcaaccacatcaCAATCATACAATGTCACCACACATGTTTACAAGTGGTTATGCGGGTATGTCAGCAACTGGGGGAGTTGTAAGTGTAAGTGCCGGTGCCGGTGGTGGAGGTGGTAGTGGTAGTGGTGTTGGTGGTGGCAACAGCACTGGTAGTGGGGTTGGGGTAACTAGTTCAGCACATAATACGTCTACAGTAGGTGCTACAACGCTTAACATGCCGGTAACATCGTCTTCTTCACATCATTATTCTTCTCCTTTATCTGCTACGGGTGGGAGTGTTGCCGGCAATGCGAATAATGCTACCGGTGGTGGTACTTATGTTGCTCGTAACAGAATGTTTGACCTTGAAATGTTAACACAACCTCAATCACAACAACATTCACAACATAGTGCAGCACCTTCTGCACATTCACACTCTTTGCTATCGAGTGGAAGCTCAAGCGGACGAACAGGATTTGACGCATATTCACATAACTCTTTATATGCACAACAAAATCAGCGACATATTTTAGGAGCAGGATCTTCACACCATCATCTTTCTGCTGCACATCATTCAGCAGCTCATACTTTGCATCCTCATCATCATGCCCAACAACCACATCATACACACCAACAGCCGCCAGCGGCTCTGCATCATCATCAACAACAACAACATTACTATCACCATGCTCAGCAGACTCCTCTGCATCGGCCACATACACAATTAATGCCACCAATGTTGCAGCGCATTAAATCTGAGCCAGTAGAACAAATTACCGTAACACCGTCAATACAAACCGAGGAAGTCATCATTAAATCTGAGCCGCCCGATGATATTGGTAGTTATCATCTCAAAAATTTGCCGCCAATTGAAAGCAAATCTTTtggtagagaagaaaaacataaacaacgtgaacagcgaacacaagagcaacaacatcatcaacatcaacaacaacaacaactgcatctgcagcagcaacaacgtgcacaacatcaacatcaacaactattacatgagcaacaactacatcagccgcaactaataaaaataaaacaagaacattatcaTCATTCTCAAAGTGGTGGACACACATCTCATAATGAAGATGTTTCCCAACAAACACAACAGCAACGTAAAAATTCGGAAAATtctacaacaatgcaaccaccggcagttccagtaccagtttcagtaacagcatcatcatcatcatcagcagcaacaacggcagctgatgataagcaaaaacaacaagaacaccaagaacagccacaaATATCCTTGACTAACATAAAAACAGAAGCAAAGCCCCTCAACTTTCCTCGCCGTAAATTACAAACAGAACGTTCCTCAACTCTGCCGATATGCCAGCGATGTAAACAAGTTTTTTTGAAACGTCAAAACTATACACAACATGTTGCTCTATCAACTTGCAATATTGTGGAATATGACTTCAAATGTTCCGTTTGTCCCATGTCCTTTATGTCTAATGAGGAGCTGCAGACACACGAACAATTACATCGCTTGAATAGATACTTTTGTCAAAAATATTGTGGCAAATTTTATGAAACAATTGAGGAGTGCGAACAACATGAATATGGCCAGCATGAATATGAAATGTTTAAATGTAATATATGTTGTATAAGTCTTACAGAACGTCATCAACTGTTGATTCACCTGAATGAACACAAGTATCAGCCGCGTTTCGATTGCTGTATATGTCGTTTATGTTTTCAAACGCCCCTGGAATTGCATGATCATTACATGGCTAATGAAGATTTCTGTGGTAAATTTTATGATAAAGAAGCCTTCAAAAAACCTAATACCTCGTTAAAGTCACCTTATCTTAGAAAGCCGGAAAGCTCGGAATTGGAAATAGCTAATACATTTTCTTTAAAAGATATACCTCCTGCCAGCAGTCATCGCTTGGAAGCTTTATATGCAAAACCCTCCACATCAAAATCTTCAATGGAACCACCTAAAACACCCACAACACCTTCATCGTTTTCGTTCGGCACTACAGCAAATGAATTGGCCGTTTTGGAACCCCAAATAGAGGTTAAGACGGAAATTAAAGTTGAACCGGATTTTTATCCGCCCATGGATCAGTCCGATTTTTCAAACTACGACAATGATTATAGTACTCCGGATTATGCTACATCGAGTAATCAAAGTTTTTCATTTTTGCAAGATTATCAGGACAATGCTTCCAGTTCCACGAACTCTTCGTTTTCCTTTAGCAATAACGATGCTGTACAAGATGAAGATGCCGTTTGCTGTGTACCCAAATGTGGCGTTAACAAATTTTCATCGCCATCTTTGCAGTTCTTTGGCTTTCCCAAAGAAGAAAAGTATTTGTCACAATGGTTGCACAATTTAAAAATGGTATACGACCCCAATATCAACTATTCATTATATCGTATTTGTAGTTTACATTTTCCCAAACGTTGCGTAGCTAAATACTCTTTAAGTTATTGGGCAGTGCCAACTTTTAATTTGGGTCATGATGATGTGGGAAACTTATATCAAAATAGAGAAAGCTCTGGGGGGTTTCCAGGTGGGGAAATGGCAAAGTGTAGCATGCCTGGTTGCCCTTCGCAGCGCGGTGAAACGAATGTTAAATTTCATGTCTTTCCTAGGGACTTAAAAACCCTGATAAAATGGTGTCAGAACTCTCGTCTGCCCGTACACAGTAAAGACAATAGATTCTTTTGTTCGAGACATTTTGAAGAGAAATGTTTTGGCAAATTTCGTTTAAAGCCCTGGGCCATACCTACACTTAATTTGGGCACAGTTTATGGCAAAATCCATGATAATCCCAATATTTACCAAGAAGAGAAAAAATGTTTTCTGCCCTTCTGCCGACGTAGTAGATCTTATGATTGCAATTTGTCTTTATATAGATTTCCGAGAGATGAAACGCTATTGCGTCGCTGGTGTTATAATTTGAGATTGGATCCGAATATGTATAGAGGTAAAAATCATAAAATTTGTTCTTCTCATTTTATTAAAGAAGCTTTAGGTTTAAGAAAGTTGAATCCGGGGGCAGTGCCCACTTTAAATTTGGGTCATAATGATAGATTTAATATATATGAAAATGAATTGTATACACCACCGCCGCCGCCACCACCTCCCCAACCGTCTACATCGTCAAAGGCGCACAAATTTGCACAACTATTTAAGCAGGAAAGAGATGCTTCCTCTGCATCACATATTTATGATGGCGTATTTATGTCTTCGATGCATCAAAAATTGTCCTCTTATTCTACTCCAAACCCGAATGCAAATAATTTGGATTTGGGTGATGTCTGTCTAGTGCCATCTTGCAAAAGAACGCGTCATTCAGATAACATAACTCTGCACACGGTACCCAAGCGGGCGGAACAGCTCAAGAAATGGTGTCACAATTTAAAAATGGACTTGCATAAAATGCACAAAAGCGTGCGTATCTGTAGCGCCCATTTTGAAAAGTATTGCATAGGCGGCTGTATGAGACCATTTGCCGTACCCACTTTAGAATTAGGTCATGACGATGCGAACATTTACCGCAATCCCGATGTTATAAAGAAATTGAATATCCGAGAAACCTGCTGTGTAAAAGTATGCAAAAGAAATCGTGATCGCGATCATGCCAATTTACATAGATTTCCGACCAATCCGGATTTGTTGCAGAAATGGTGCGAAAACTTACATAAGCCTGTACCCGACGGCACTAAACTATTCAATGATGCTGTTTGTGAAGTACATTTTGAGGATAGGTGTTTGCGCAATAAACGTCTGGAGAAGTGGGCCATACCAACAATTAATTTGGGTTGGGAAGATGTGCCGCATCGTTTGCCTTCGGAGGAGGAAATCAATGAGAACTGGGTTAAACCTTTTGCTCCTAACAACGGTGATGAGCAGGGCGAATGCTGTGTAAGCAGTTGCAAACGTAATCCCCAAATTGATGATGTCAAACTGTACCGCCCGCCCGAAGATGCGGAGCAGTTGGTTAAATGGGCGCACAACTTACAAGTGGATGTGACGGAATTGCCGAATATGAAGATCTGCAATTTGCATTTTGAACAACATTGCATAGGCAAACGTTTGTTGAACTGGGCCATGCCCACTTTGAACTTGGGTGGCAAAATAGAGCATCTATTTGAAAATCCTCCTCCCATGCCTGCCATTTATAAGAAAAAAATGAAACCTGAAAGAATGCAAAGCAGCCAGGAGAGCATCAAATGGTCTCCCAGATGCTGTTTGCCGCACTGTCGTAAAATGCGTTCAGTTGATAAAATACACCTTTTCCGCTTTCCGTACAGCAATCGCCAAACTTTGGCCAAATGGTGTCACAATTTGCAATTACCTTTGGTGGGCAGTTCGCATCGTCGCATCTGCTCCACCCATTTTGAGCCATCCGTGTTAACCAAACGCTGCCCCATGAATTTGGCAGTACCCACTCTAGACCTTAATGCTCCGGCAGGCTACAAAATCTATCAAAATCCTGCTCGCCTTAAACAAATAAAGATAGGCGCCCAAAGACATTGTCTTATAGAATCTTGTCGCAAAACTAAACTGGACGGTGTTACGCTTTTCCGTTTCCCTAATAACAGGTCTATGTTATACAAGTGGCGTCATAATATTAAGAATTGGCCTAAAGGCAAACTCACATCCATCGTGCGAATATGCTCTGACCATTTTGAGCCCCATTCAGTGGGCGTTAAAAGACTTTCGCCCGGTGCTATACCCACTTTGAAATTGGGCCATGAAGGCAAAGATTTGTATGCCAACGAAACAAGATCATACTTTGATTTGGAAAAATGTGTGGTAAGCGGCTGTGATTCACGCAAAGATATGGAAGACATACGGCTCTTCCGGTTTCCGCGAGACGATGATGAATTGCTTAAAAAGTGGTGCAACAATCTAGCCATGAACCCCAATGACTGTGTGGGCATTAAAATCTGTAGCAAACATTTCGAGCCAGATTGTTTCGGACCTAGACAGCTATTCAAATGGTCTATACCGACCCTAAAATTAGGACATAAGGAAGATGATTTGGTTGAAATAATACACAATCCCCCACCCGAACAAAGATCAACTGAGACCTTGTTTAAATGTTGTGTACCTTCCTGCGGCAAGACGCGCAAATACGATGACGCACAAATGAATAGTTTTCCGAAGAATTTCAAACTATTCCGCAAATGGAAACATAATCTCAAATTGGACTTTCTCAATTTCAAAGAAAGAGAAAAATATAAAATTTGCAATGATCATTTCGAGCCGGTTTGTGTTGGCAAGACACGTCTTAATTTTGGCGCTCTGCCCACCATGAATTTGGGCCACGAAGAAGTGGATGATTTATATCAGATTAATCCGGACCGAATAAGACCGAATTTGTTTATTAAACAAAAAGACGTTGAAAGATTGGAGAGGAGAAGATTATTAAAAGAAGATAATCGGGAACAAATTGAGGGAAATGATTTAGATGAGGACAATCTAGATCTCTTGAATTTGGAACCCAGTGATGTAAAGTGTTGTGTAAGTGAATGTTCAGCACCTAAATCTATAATGAGAGAACCTTACGACTTGCCGGAAAGCGACGAGTTTAGACAATTATGGTCAAAGGAATTTGGCGACAACGACGAGGAAAATTCATTAAATGAAGCTAAAATTTGTGGCTTACATTTTCAGTTAATATTTAATAAACTTAAAGTGGACATGCTGGACATGACCAATAAAAATGAGGACCTAAAAACCGATTTTAATAAATTACAATACAATTACCAAAAAGCCGATATATCTTTGGTTATTAATAGTTATCAGTGCAGGGTTGAAGGTTGCGCCACCAATTTGCTTAACTCTAATATAAGGCTTTACTTCTTTCCCTATGGCAAGAACTTGGTTAACAAATGGTCGCACAATACCGGCATAATACCGGATGAACATCGCAGATATATGAACAAAGTATGTGCTTTACATTTTGAATACTTTTGTATAACAGAGAATCAAAGATTAAGGTCTTGGGCCATTCCTACGCTTAATTTGCCCGCCAGCACAGAAGAAAAACATTTATATAAAAACCCTGATTTGACGAAACTCGATAAGAGAATGTTGGGCCCGCAGATTTTAAAATGTGTCGTCAAAGATTGCAATTATCTTGAATTAGAAGACGAGTCTCTCAAACTGCTCAATTTTCCCAGTGACGACAAATTGTTAAAGAAATGGTGTGATAATTTAAAAATGTCACACCATTTGACGCCTTTGCTTAAAATATGTTCGTTGCATTTTGAGAAAAATTGTTTTGGCAGCTGCCGCTTACGCTCCTGGGCCATACCCACCTTAAACTTGGGGCATGAGGAGAGTCCCGAACATGAAAATAAAAATACGATCAGAAAAGAAGTTTATGAGGCTGAAGAGGAGATTTCTGAGGTCCAACTAAAACAAGTTAAAATCAAAAAATCATTGGATACCACCAAGTGTTGTGTGCCTACTTGTCGCAAGAGTCGATTGAAACATGGCGTTCGTTTTTACAGCCTGCCCTCAAACGTCAACATGAAGCGCAAATGGCTGCATAATTTGCAAATCAAACATTTAAAACTCAACCAAAAACATCATAATATTAAAATTTGTAACTTGCATTTTCATAAGAGATGTTTAGATGGTAAATTAATCAAATCCTGGGCAGTGCCCACCATGCATTTGGGTCACCACGAAGCAATCTATGACAATCCTAGAAGACTACGAGCTATAACGAATCTACGCTGTATGCTGCCACATTGTAAGAATCACACCAGTTCCAAGGCTGTACGTTTGTTTGTGTTTCCCAAGTCATTGGAATTTTTAGAGAAATGGTCAAAGAACTTGAAACTCGATACAGAGAAGTGCAAGGGTCGCATATGTTATGAACACTTTGAGAAAGGTGTTGTGGGGCACAAAAAGCTTATAAGCGGGGCTGTGCCGACGCTTGATTTGGGCCATGAGGATACCGACATTTTTAATAATAAGGAGCTATTGAACAAGTTTAGAAGCAAACAAAATGAATTGCAGAGAATCAAGGAAGTTAGCAAATTGAAAACAGTTGAAGACGAATTTGAAGAAGAAGAGTATGAACGTGAGTCGGAAGATGAAGAAGAGGAATTATGGGAATCGGAAGAAGAAGAAGAAGGGGAGGAAGAAGAGGAGGAGGGAGAAGATGAAGAACAAATATTTTACGATGATTACGAGGAGGACGAAGAGGAGAATGATGAGACGCATGACGAAGCAGACAAAACAGACCCTCAACAAGATGATGATGCCTCCAGTGTGACAAACTCCATATCAGACTGGAGCTCTATTAAATTCAAAGAATTAAGAGTGTCTATTACGCCTTTAACACCAGAAGACCTAATGGATTTGTGTTCACGTTCTTCCTATGAGAAAGAATTTGGCAGCTTAAAAGGTCGGCGCTCCGTAACACCAGCTACAATTTCCAAAGATTTGCGCTCAGAGACCCCAGAGCAAAAGTCTTCTTATTTCAGTATTAGCAGCGCCCAAGAAACGGCGGATAAAAATCCCTATAAATACTTAACAGAATCTCGCTCGCTCACACCGGAACAGGTAATGGATCATTTTCTAGAACCGAAATCGAGTGAAAGAAAATCTATTAGTCCACAAGATCCCTTGGCGGAAAATTGCGAAGAATATACTCAAAAAACCAACAAACAAATAGAATCCTTAGTTTTCTGCAAGGAAATTAAATCCGATATTGATTTGAGTAACAACAAGTTAAAAAGGGAAAACTCTAACTTTGATAAAGAAACACCAAAAAGAGAACGATTAGATTTGTCAGAGGATGTAATAACGACCAATACTGGCCTACTACACGAAAATGTAGCTGTAAATGAAACAAATTTGAGGACCGACAAGGCTTTGAATGCAGTTGCTCCTATTTGTTGCCTGAAACACTGCGGCAAAGAGAAGACACCGGAACAACATTTAACCACTTATGGTTTTCCCAAAGACCCCCAACTTTTACAAAAATGGTGCGATAATTTGGGCCTGCAGCCTGAGGAATGCATAGGACGGGTTTGTATAGATCATTTTGAACTCAGAGTAATAGGCAGCCGCCGGCTGAGACAAGGTGCTGTACCCACCTTAAATTTGGGGCCTAATCGCCAACCTAAGCATACAAATCTGGAGGAAACTCCTCAAAAGAAGAGCGTATCAAAAGATTTCAACGAAACAGGAAATATGCAAGAAGCCGACGCAACATTGAAGCCACCACCACCTTATAAGTTGCCCAAGCCCAGTAAGCATTCGGTTTTTCGGCTATGTTGCCTCAAACATTGCCGACGCAAGAAATTCTTAAAGCTAGAGAAGAAAGAGCAACAACTGCTGAACCACCGGCATCCGCAGCAGCAGGAAACAATGGAAATCTTATTTAAATTTCCCACAGATGAGAATCTGCGAAAGAAATGGTTTAAAAATTTAAGATTACCCGAAACTTTGAGCGTTAAAACTGACCTATTTATCTGTTCCCAGCACTTTGAAAACGCTGTAATACAAAACTGCAAACTACTGCCTTTGGCTGTGCCCACTCTTCAGCTAAGCTATAGCAATCGGGCCGCCATTTACTCCAACAACCCAGAAGATTTGAAACGCAGCTGCTTGAAAATTAAGCCAAAATCAAAAATAACAAAATGTTTTCTACCCCACTGTGCAAACAAAGAGACGGGGCTGGTGTTTTTAATATCTTTTCCCGAACATGAACCTTTAGCCTTGAGGAAATGGTGTAAAAATCTAAAACTCAAGTACCATCCTTCTAAATATAAAACTGCCAAAATCTGTAGCGTGCATTTCGAACCATATTGTTTTTTTAAAAAACGTCATCTGCGTGCAGGAGCTCTTCCCACGTTGAATTTGGGCCACACAGATACTATTATGCGGAATTGTCGCAAATTACGCCTAAAAAGAGAACACATTAAAGCTGAAGAGAAATGCTGTCTCAAGGATTGTCAGTCAACCAATTGCAGACTTTATGGGTTTCCCCGAAGTTGCGAATTGCGCAAAATCTGGTGTAATAATGTGCAAATCGAATTAAGTCTTGTATTAAACAATCACTACAAAATGTGCGCCAAACACTTCTCTGCAGACAGTTTTATACAGGGCTCGGAAAACCTAAAACTCAATGCGGTGCCCAGTTTGAATCTAAGCTCGAATGTAGAAGATCGTGTGCTGCAGGCCACCAACTCTGACGAAAGCAAATGTATTGTAGGAAATTGTCGAAAAACTCCCAGTGTGGATAAGGTGAAACTGTTTAAGTTTCCTGACAGTCCGGACATTCTCAAGAAATGGCTGTTCAACTTGAGTTTATCGGTAGACACCTTAAAGCCGTACGATGTTATATGCAGCAAACATTTCGATAAATCTTGCATCAAAAATGGCGTTTTACACGAAAAGGCTATACCCACCCAATTTCTGGAAGTCTCCGCTAAAGGCTGGTTTTATAAAAACAATGAAGATTTGTATGAAGTAAACCACAAATGCTGCGTTCCTGATTGCCAGTTGAGCTCGCAAGAAGCAAAGCATTTGTATAGATTTCCCAAGCATAAAGAAGACATGGAAAAATGGCTTTACAATCTTAAGCTGCCACAAATGGAGGACGCAGATGTGAAGGAGTTGCGAGTATGTGATAGACATTTTGAGCTGGGTTGCAAAGTCTCGAACAAAGACTTAATAACCCAGGCTTTGCCCACCCTCAACTTGGGCCACAATGACGCCGATATTTATGGCAATAATTTTATAAAATGTTGTTTAAACAATTGCTCCATAGAGGGTTTTTACTATCACAAGTTGCCCGAGGATTTAATGTTGCAAGGTTTTTGGTTCCAGGAACTCGAAATGGAATGCTCGCATAATTCTTCGGTTTATATTTGCTCGGTACATTTTGTTGCTTTTTTCGAGAGAATTTTAGAAAAGTACAGTGCTTTCCTTAAAGAGTTCAAGGAGTATGCCAAACTTTCGGTTACCTATAATGAGCTTAAAGCTCTGCCTGCTTTGCAGTGTTTTAAATGTCACGTGAGCAAGTGCAGTTCTGGTTTTAAGTTAATATGGAAACTGTTCAAATTTCCCAGAGACAAAACTCTATTTAATAAATGGCTGAATAATTGTGGCCTGCAGTTTGACTATGAACAACGTCTGCAATATCGCATTTGTGCTCAACATTTTGAGGAAAGATGTTTAAGTGAGAAAAAGTTGCATAGATGGTCCTTGCCCACCCTAAAATTGCCTTTAAATAACAGCTTGTATGTCAACCCGCCCGAAGCGCTGCCCTCCAATCACGAACATTTAAAACATTGTTGCGTTGCCAATTGTCCAACGGACAAGGGACCGTTTTATAAATTTCCTCAGAAAACGGTGGAGCAAAAGAAATGGATTCATAACCTCGAGTTGGGTAATCAACAGTGTACTCTTAATTTAAGGGTGTGCTACCGACATTTTGAAAACTATTGTTTCGCCAAGGCGGTTAATAAAATGAAACCTTTAAAATCTTGGTCTGTGCCCACTCTAAGATTGAAGCGCAAGTCGGAACTATATCTGAATCCTGCCGACAAAATCGCCTTCTACGTCTGCTGCATAGAAACTTGCAAACAAATCCTTAACAAATCCAAGGAAATATATCTTTTCAAATTTCCTCTCAGCAACACTTTGAAAGAGAAATGGTTGCATAATTTAAATCTCTGCAACCAGGATTATAAAGCAACCATGAGAATTTGCTCTCTTCACTTTGAGATGCACTGCTTTTACAAGGGCTTTAAGGCACTGCGTAAACATTCGGTACCCACTTTAGGTTTGACCATACAACCCGCAAAACTCTACGCAAACCCCATTAGGAGGCCGTATTTTAAATGTTGTGTCAAATTGTGTAAAGCCCCCCGGCAGCAATTGCTGTCGTTTCCCAAAGAAAAAACTCTCTTAAGGAAATGGTGTCACAACTTGCAATTAAATAAGGAAATTAAGTTAGAAGCTCTGAGAGATTGGAAGATTTGCGAACGACATTTTGAACAAATCTGCTTCAATGTCAATGGTTCTATAAGAAGTCTAGCAGTACCTACTCTAAGATTGGGCCACCATAAAAAGCTATTCCAAAACCCGGTGTTTGCCATAAAAAGGAAAGTCAAAGGAGAATTAAATACTTCGCTAAAACAAGAGAGCGCTTTGCAAATGGCGCCAGCCAAAGTTAATAATTCAGCAAGAATATTTAATGTTTGTACTGAAGAAGATAATAAGCAGAATGTAAATATAGACTTAAGCAGCTCAAAAGCTCTAAGGAAATCTAAAGTTGTAAATATTAAAAAGCCTCGTAAGCAGAATATTGCAAACAATAATAAACAACAAAAATTGAAGCAATACTTAAAGAAAGAAGTGAAGGATTGTGCAGAGAAAGGCGAAGCAGCGGGAACCGTAGAGGGAGAAGAAAAACGATATGAAAGTGTTAGAGAGGGTCAATGTGGTATAAATCAAGAGTACATGGAAGAGAGAATTGAAGAAGGATTAACCGATGACGTTCTCCAAAACCTTAATTGCTGTTTAAGGGCAAGTCAAAGCGACTTCAAATGTAAAACAGAGGGTGGTAATGGAACTGCCACAACAAAACCCCTAAAACAAGAAACACCGGCAGAATCTAAAGCTTTTCCATTTACCGTCGATAAAAAAAAGGTAGATCCCAGCTTGCAGGAAGATGTTTATCTGGAAAATTTATTGGAAATCTTAACCGAAAGTCTACCGGAAAATCACCGTCTCAGAGAAACTCCAGAGCAAGAACTATTTAAGCAAGAAATGTGTTCTCCCACAGCACTTTTACAAGCAACAGCCTACAATGCAAAGGAAGATGCAGGAGATGACGAGAGAATTGAAGCAAATGGAAGGAGGCGAGTCGACGTAAGAATGAGCTTGAACCCGAGAAGGAAAGCAGACAAGGAAAAGAGAAGTGATACGGAAAGAATGAAAAGCGAAAGAAAATCAGACAGAACTAACGTGATAAAGAGAGATAACGAAACAGAGAGGAATAACGAAGAAAACTCCTTATTTGCAATATACGAAATAAAGCAGGAAGTACACGATAATGACCTGCCAGCCAGCGAGCATAGTGATATGGAACTGGTCGAAGATGAAACAGAAAAATGCCTCAACTCAAAGAGTAAACACTTGATTTCTTGTTGTATAAAAACCTGCCCAAATTATGGTTATCTTAAACCGGACTTAGTTCTTTTCAAACTGCCAGTAATTCATGAACTTTCCACTCATTGGCTGGCCAACTGCAAACTTGTACAGTATTCAGCTAAACGTATTTTAAAAAGATTAAGAATTTGTATAGAACATTTTGATAAATGCTGTATAAGAGATAATACTCGTCTAATATTTGGAGCGGTGCCCACATTACACCTCGGTAGCAAGCTAAGCTGTAAAAAATCGTTAAACAAATTCAGTCATCTACGATGTCGCATACAGAGTTGTCAACGTTCAGTTCAATATGACCATATAAATCGTATACCATTTCCTTCAGGCACGATGAGGAGGAAATGGTGTTTGAATATGAATCTTGACGAGGCCACTGTTTCTGCGGACGATTGGATATGTCATAGACATTTTGTGCGAAAATCTTTGATAGATGGACGTAAACTAAAAGCTGGTGTATTGCCCACTTTGTTGCCGGAAACTATAGAGACTTTAAAACTACCTGAAGCCGACAAGCGGTCTAAATACAAAAACCCAAAGCCTATAAAACAACTTTGCTTATTTCCTTGCTGTAGGGCAAAGTCTCAAGTCCTTTATGAATGGCCCAGTCAAGGAGCTTTCGGCATAATCTGGTTAGTAATCCAGAAATTAGGTTCCCAAGTCGAAGACCCCAACAATACTTGGTCGAAATATTTCCAGCAAACTTCATTGGAAAGTAAATTATATGACCATTTACAAGCCATTTCCAAAAAGAACATTAAATTTTGTGGTGACCATTTTTATCAATTATATAAAATCAATCAGCAGGTCATCCAAGATTATGAAACTAATGAAGACTATCAGTCTTTGAAAGAAAAAGTTCAAAATACTTTTGAATTTTTAAATTCGTTGGAAAACGTCTATACTAGACAATGTTGTGTGCCACAATGTAAAACCGATCAAAATATCAAATTCAGTAGAACAGTGAAGCTGTTCGACTTCCCACATGACTCAGACATGGCTAGGAAATGGTGCCATAATATAGGCATTCAAATGACAACGCTCGACTCCAGACCCTTTTCGAAAGTATGTGAAAAACATTTTGAAGATTATTGTTTACAAAGAAGAAACCTCTTAGATTGGGCTTTGCCCACTCTAAATTTACCTGCTTCGAGAGATCCTCAAGATATACAACAAAATGAAAGCGACAAAGTGCTGGCTGCAAAAGAAAAGTGCTTTATTCGATCATGTCCAAATCACGAATCCTTGAGCAGAAACCTAAAATTATACAAGTTTCCTAAACATCCTTTTCTGCTTAAAAGATGGCGCGAGATAACAAATTGTAAGCAAAGCGCAAGAGAACCGCGTTTGTGCTCATTACACTTTCCTGCAACCGATTTTGTTCACAACAGCTCCCAACTTAAAGAACACGCTCTGCCAATGTTTTACCTTGAGCCACAAAACAATTTTTCCAACGCCTCCTCCCTGATCAATTCAAACATCGATGAACTTATACAGGTTAAACAAGAACTGGATATTTCGGAGGAGTGGTGCGTGCCTCCAGAAGAAACTGAGATTTTTCCCACAGAAGAGGAATGCCATAAAATTGCAATCAACGAAGATTTATTGGACTTTAAGTTCAAGGATTACAGCAGAGATACTGAGAACATGCCGTTTATCGAAATAAAGCAAGAAATTTTAGAAATCCAAGAAGAAGAACCACAAACTCAGAACTCCTCCTTGCTGACAATAAACAAATTTGAAGCTCCACAGCCGGACGAATTTCAATATCATTTTCAAAACTCAACTTTTACAAACAACAATCAGGCTTTAGGTTTTGTTATATCAGACATTAAATCACAAATCCATTTGTGTTGTGTACAAAAATGTACCAGCAGCTCCGAAACAGCCGGCATAAGTTTGTTTAATCAGTTCCCTCAAGACTCGGAACTCTTCATAAAATGGTGTTTTAATTTAAAAATTGATCCACGTAATTATAAAGAAAATCAATATAGCATATGTGAACTACATTTTGAAGCCATTTGCTTTGCAGAAAATAAACAACTCCACCCCTGGTCAGTACCCACTTTGAACCTGAATTTAATGGAAAACTCTTTTATACATGAAAATGACATACCTGAATATCTGAAACCCAGCCAAGAACAATGTATTGTTTATGGTTGTATCAGCCCCTTAAAACCTCTTTTTAAATTTCCCTACCAACCCGACGTTTCACTCAAATGGTTTGCCAATTTAAAACTAGACTATACAGACTTTCGGGCACAAAATTATCGCATCTGTAGAAGACATTTCACGAACATATGTTTTGAACTGAACGACAGCAATAAACTTACCAGCGAAGCTGTGCCCACACAGTTTCTAGGGCACACAGATAAAATTTCTTACTTTAACACGTTGTCAGAAGAACAGCAATTACAACCGGCCGAGGGCTTCAACGGCCTTCTAAGAAACCAGGACAACAGTCGTGGCAGTAGTCAGGGATCTATAGCAAGAAATATATCACCACATGATCTTGAAGACCATGATAGTAGTTATTTTGAAGATTTTGAGGAATATTATGGACAAGATGAATAA
Protein Sequence
MSQNNQRKHYHIHAPYQHPQQQQQQQQQQQQTHLHGHHLTASQQQQQQHQQWYTQQHYQHGLHLRDSRHIQHPQHHPHHHTQQQQQQPHHNHTMSPHMFTSGYAGMSATGGVVSVSAGAGGGGGSGSGVGGGNSTGSGVGVTSSAHNTSTVGATTLNMPVTSSSSHHYSSPLSATGGSVAGNANNATGGGTYVARNRMFDLEMLTQPQSQQHSQHSAAPSAHSHSLLSSGSSSGRTGFDAYSHNSLYAQQNQRHILGAGSSHHHLSAAHHSAAHTLHPHHHAQQPHHTHQQPPAALHHHQQQQHYYHHAQQTPLHRPHTQLMPPMLQRIKSEPVEQITVTPSIQTEEVIIKSEPPDDIGSYHLKNLPPIESKSFGREEKHKQREQRTQEQQHHQHQQQQQLHLQQQQRAQHQHQQLLHEQQLHQPQLIKIKQEHYHHSQSGGHTSHNEDVSQQTQQQRKNSENSTTMQPPAVPVPVSVTASSSSSAATTAADDKQKQQEHQEQPQISLTNIKTEAKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYTQHVALSTCNIVEYDFKCSVCPMSFMSNEELQTHEQLHRLNRYFCQKYCGKFYETIEECEQHEYGQHEYEMFKCNICCISLTERHQLLIHLNEHKYQPRFDCCICRLCFQTPLELHDHYMANEDFCGKFYDKEAFKKPNTSLKSPYLRKPESSELEIANTFSLKDIPPASSHRLEALYAKPSTSKSSMEPPKTPTTPSSFSFGTTANELAVLEPQIEVKTEIKVEPDFYPPMDQSDFSNYDNDYSTPDYATSSNQSFSFLQDYQDNASSSTNSSFSFSNNDAVQDEDAVCCVPKCGVNKFSSPSLQFFGFPKEEKYLSQWLHNLKMVYDPNINYSLYRICSLHFPKRCVAKYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPGGEMAKCSMPGCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKDNRFFCSRHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPNMYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPPPPQPSTSSKAHKFAQLFKQERDASSASHIYDGVFMSSMHQKLSSYSTPNPNANNLDLGDVCLVPSCKRTRHSDNITLHTVPKRAEQLKKWCHNLKMDLHKMHKSVRICSAHFEKYCIGGCMRPFAVPTLELGHDDANIYRNPDVIKKLNIRETCCVKVCKRNRDRDHANLHRFPTNPDLLQKWCENLHKPVPDGTKLFNDAVCEVHFEDRCLRNKRLEKWAIPTINLGWEDVPHRLPSEEEINENWVKPFAPNNGDEQGECCVSSCKRNPQIDDVKLYRPPEDAEQLVKWAHNLQVDVTELPNMKICNLHFEQHCIGKRLLNWAMPTLNLGGKIEHLFENPPPMPAIYKKKMKPERMQSSQESIKWSPRCCLPHCRKMRSVDKIHLFRFPYSNRQTLAKWCHNLQLPLVGSSHRRICSTHFEPSVLTKRCPMNLAVPTLDLNAPAGYKIYQNPARLKQIKIGAQRHCLIESCRKTKLDGVTLFRFPNNRSMLYKWRHNIKNWPKGKLTSIVRICSDHFEPHSVGVKRLSPGAIPTLKLGHEGKDLYANETRSYFDLEKCVVSGCDSRKDMEDIRLFRFPRDDDELLKKWCNNLAMNPNDCVGIKICSKHFEPDCFGPRQLFKWSIPTLKLGHKEDDLVEIIHNPPPEQRSTETLFKCCVPSCGKTRKYDDAQMNSFPKNFKLFRKWKHNLKLDFLNFKEREKYKICNDHFEPVCVGKTRLNFGALPTMNLGHEEVDDLYQINPDRIRPNLFIKQKDVERLERRRLLKEDNREQIEGNDLDEDNLDLLNLEPSDVKCCVSECSAPKSIMREPYDLPESDEFRQLWSKEFGDNDEENSLNEAKICGLHFQLIFNKLKVDMLDMTNKNEDLKTDFNKLQYNYQKADISLVINSYQCRVEGCATNLLNSNIRLYFFPYGKNLVNKWSHNTGIIPDEHRRYMNKVCALHFEYFCITENQRLRSWAIPTLNLPASTEEKHLYKNPDLTKLDKRMLGPQILKCVVKDCNYLELEDESLKLLNFPSDDKLLKKWCDNLKMSHHLTPLLKICSLHFEKNCFGSCRLRSWAIPTLNLGHEESPEHENKNTIRKEVYEAEEEISEVQLKQVKIKKSLDTTKCCVPTCRKSRLKHGVRFYSLPSNVNMKRKWLHNLQIKHLKLNQKHHNIKICNLHFHKRCLDGKLIKSWAVPTMHLGHHEAIYDNPRRLRAITNLRCMLPHCKNHTSSKAVRLFVFPKSLEFLEKWSKNLKLDTEKCKGRICYEHFEKGVVGHKKLISGAVPTLDLGHEDTDIFNNKELLNKFRSKQNELQRIKEVSKLKTVEDEFEEEEYERESEDEEEELWESEEEEEGEEEEEEGEDEEQIFYDDYEEDEEENDETHDEADKTDPQQDDDASSVTNSISDWSSIKFKELRVSITPLTPEDLMDLCSRSSYEKEFGSLKGRRSVTPATISKDLRSETPEQKSSYFSISSAQETADKNPYKYLTESRSLTPEQVMDHFLEPKSSERKSISPQDPLAENCEEYTQKTNKQIESLVFCKEIKSDIDLSNNKLKRENSNFDKETPKRERLDLSEDVITTNTGLLHENVAVNETNLRTDKALNAVAPICCLKHCGKEKTPEQHLTTYGFPKDPQLLQKWCDNLGLQPEECIGRVCIDHFELRVIGSRRLRQGAVPTLNLGPNRQPKHTNLEETPQKKSVSKDFNETGNMQEADATLKPPPPYKLPKPSKHSVFRLCCLKHCRRKKFLKLEKKEQQLLNHRHPQQQETMEILFKFPTDENLRKKWFKNLRLPETLSVKTDLFICSQHFENAVIQNCKLLPLAVPTLQLSYSNRAAIYSNNPEDLKRSCLKIKPKSKITKCFLPHCANKETGLVFLISFPEHEPLALRKWCKNLKLKYHPSKYKTAKICSVHFEPYCFFKKRHLRAGALPTLNLGHTDTIMRNCRKLRLKREHIKAEEKCCLKDCQSTNCRLYGFPRSCELRKIWCNNVQIELSLVLNNHYKMCAKHFSADSFIQGSENLKLNAVPSLNLSSNVEDRVLQATNSDESKCIVGNCRKTPSVDKVKLFKFPDSPDILKKWLFNLSLSVDTLKPYDVICSKHFDKSCIKNGVLHEKAIPTQFLEVSAKGWFYKNNEDLYEVNHKCCVPDCQLSSQEAKHLYRFPKHKEDMEKWLYNLKLPQMEDADVKELRVCDRHFELGCKVSNKDLITQALPTLNLGHNDADIYGNNFIKCCLNNCSIEGFYYHKLPEDLMLQGFWFQELEMECSHNSSVYICSVHFVAFFERILEKYSAFLKEFKEYAKLSVTYNELKALPALQCFKCHVSKCSSGFKLIWKLFKFPRDKTLFNKWLNNCGLQFDYEQRLQYRICAQHFEERCLSEKKLHRWSLPTLKLPLNNSLYVNPPEALPSNHEHLKHCCVANCPTDKGPFYKFPQKTVEQKKWIHNLELGNQQCTLNLRVCYRHFENYCFAKAVNKMKPLKSWSVPTLRLKRKSELYLNPADKIAFYVCCIETCKQILNKSKEIYLFKFPLSNTLKEKWLHNLNLCNQDYKATMRICSLHFEMHCFYKGFKALRKHSVPTLGLTIQPAKLYANPIRRPYFKCCVKLCKAPRQQLLSFPKEKTLLRKWCHNLQLNKEIKLEALRDWKICERHFEQICFNVNGSIRSLAVPTLRLGHHKKLFQNPVFAIKRKVKGELNTSLKQESALQMAPAKVNNSARIFNVCTEEDNKQNVNIDLSSSKALRKSKVVNIKKPRKQNIANNNKQQKLKQYLKKEVKDCAEKGEAAGTVEGEEKRYESVREGQCGINQEYMEERIEEGLTDDVLQNLNCCLRASQSDFKCKTEGGNGTATTKPLKQETPAESKAFPFTVDKKKVDPSLQEDVYLENLLEILTESLPENHRLRETPEQELFKQEMCSPTALLQATAYNAKEDAGDDERIEANGRRRVDVRMSLNPRRKADKEKRSDTERMKSERKSDRTNVIKRDNETERNNEENSLFAIYEIKQEVHDNDLPASEHSDMELVEDETEKCLNSKSKHLISCCIKTCPNYGYLKPDLVLFKLPVIHELSTHWLANCKLVQYSAKRILKRLRICIEHFDKCCIRDNTRLIFGAVPTLHLGSKLSCKKSLNKFSHLRCRIQSCQRSVQYDHINRIPFPSGTMRRKWCLNMNLDEATVSADDWICHRHFVRKSLIDGRKLKAGVLPTLLPETIETLKLPEADKRSKYKNPKPIKQLCLFPCCRAKSQVLYEWPSQGAFGIIWLVIQKLGSQVEDPNNTWSKYFQQTSLESKLYDHLQAISKKNIKFCGDHFYQLYKINQQVIQDYETNEDYQSLKEKVQNTFEFLNSLENVYTRQCCVPQCKTDQNIKFSRTVKLFDFPHDSDMARKWCHNIGIQMTTLDSRPFSKVCEKHFEDYCLQRRNLLDWALPTLNLPASRDPQDIQQNESDKVLAAKEKCFIRSCPNHESLSRNLKLYKFPKHPFLLKRWREITNCKQSAREPRLCSLHFPATDFVHNSSQLKEHALPMFYLEPQNNFSNASSLINSNIDELIQVKQELDISEEWCVPPEETEIFPTEEECHKIAINEDLLDFKFKDYSRDTENMPFIEIKQEILEIQEEEPQTQNSSLLTINKFEAPQPDEFQYHFQNSTFTNNNQALGFVISDIKSQIHLCCVQKCTSSSETAGISLFNQFPQDSELFIKWCFNLKIDPRNYKENQYSICELHFEAICFAENKQLHPWSVPTLNLNLMENSFIHENDIPEYLKPSQEQCIVYGCISPLKPLFKFPYQPDVSLKWFANLKLDYTDFRAQNYRICRRHFTNICFELNDSNKLTSEAVPTQFLGHTDKISYFNTLSEEQQLQPAEGFNGLLRNQDNSRGSSQGSIARNISPHDLEDHDSSYFEDFEEYYGQDE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01376677;
90% Identity
-
80% Identity
-