Basic Information

Gene Symbol
-
Assembly
GCA_015586225.1
Location
NW:27648-55219[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 6.2e-15 5.9e-12 45.9 0.5 1 86 841 913 841 914 0.83
2 34 3.3e-15 3.1e-12 46.8 4.7 1 87 941 1010 941 1010 0.80
3 34 1.9e-15 1.8e-12 47.5 0.3 1 87 1031 1103 1031 1103 0.83
4 34 4.7e-14 4.5e-11 43.0 3.1 1 86 1187 1255 1187 1256 0.79
5 34 3.4e-15 3.2e-12 46.7 5.4 1 87 1280 1352 1280 1352 0.81
6 34 2.9e-12 2.8e-09 37.3 0.9 1 87 1387 1455 1387 1455 0.81
7 34 1.6e-11 1.6e-08 34.9 2.8 1 86 1496 1565 1496 1571 0.73
8 34 7.3e-15 7e-12 45.6 0.1 1 86 1593 1662 1593 1663 0.80
9 34 5.1e-14 4.9e-11 42.9 0.6 1 86 1685 1754 1685 1755 0.78
10 34 7.7e-14 7.4e-11 42.4 2.0 1 87 1783 1855 1783 1855 0.86
11 34 1.2e-07 0.00012 22.5 0.1 1 73 1922 1989 1922 2000 0.73
12 34 2.4e-11 2.3e-08 34.3 0.5 1 87 2019 2091 2019 2091 0.79
13 34 2.2e-14 2.1e-11 44.1 5.8 1 87 2121 2192 2121 2192 0.81
14 34 1.4e-13 1.3e-10 41.6 5.0 1 87 2237 2310 2237 2310 0.84
15 34 2.9e-14 2.8e-11 43.7 1.1 1 87 2332 2400 2332 2400 0.80
16 34 1.3e-14 1.2e-11 44.9 0.3 1 87 2713 2782 2713 2782 0.80
17 34 3.1e-13 3e-10 40.4 3.4 1 86 2840 2922 2840 2923 0.82
18 34 2.8e-11 2.6e-08 34.2 1.0 1 87 2953 3025 2953 3025 0.73
19 34 5.6e-13 5.3e-10 39.6 0.9 1 87 3053 3122 3053 3122 0.83
20 34 6e-14 5.8e-11 42.7 1.3 1 87 3142 3212 3142 3212 0.82
21 34 1.5e-14 1.5e-11 44.6 2.3 1 87 3235 3306 3235 3306 0.81
22 34 1.1e-05 0.01 16.3 0.2 1 60 3322 3370 3322 3400 0.76
23 34 1.8e-12 1.7e-09 38.0 3.0 1 86 3410 3479 3410 3480 0.83
24 34 2.6e-12 2.5e-09 37.4 3.2 1 86 3505 3575 3505 3576 0.80
25 34 5.8e-15 5.6e-12 45.9 1.9 1 86 3596 3668 3596 3669 0.79
26 34 2.8e-11 2.7e-08 34.2 2.5 1 86 3689 3758 3689 3770 0.79
27 34 4.8e-13 4.6e-10 39.8 5.6 1 86 4116 4192 4116 4193 0.82
28 34 2.8e-08 2.6e-05 24.6 3.9 1 86 4214 4284 4214 4285 0.75
29 34 1.4e-10 1.3e-07 31.9 1.4 1 86 4309 4378 4309 4379 0.83
30 34 0.48 4.6e+02 1.4 5.5 1 62 4394 4459 4394 4478 0.65
31 34 4.4e-12 4.2e-09 36.7 0.9 1 87 4498 4572 4498 4572 0.80
32 34 1.4e-13 1.4e-10 41.5 2.4 1 86 4592 4664 4592 4665 0.79
33 34 2.5e-13 2.4e-10 40.7 4.2 1 87 4797 4870 4797 4870 0.77
34 34 3e-14 2.9e-11 43.7 1.2 1 86 4895 4964 4895 4965 0.81

Sequence Information

Coding Sequence
ATGTCACAAAATAATCAACGTAAACATTATCATATACATGCTCCCTATCAACACCcccagcagcaacaacaggctcaacaacatcatcatccgCATCACTTGACTGCGgcacagcagcagcagcagcaacatcatcatcagcaacaacagcagcaatggTACGCGCCACAACATTATCAACATGGTCTTCATTTAAGAGATTCGCGCCATATTCAGCATCCATCTGCTCAACATCCTCATCATCATTCCCATCATGTTCCACaccagcaacagcagcagccgcACCATCAAGCTCAACATCACAGCCATTCGATGGCTCCTCATATGTTTACAAGTGGTTATGTTGCAGGAGGAGTTGGTGGAGGGGGTGGGGTAGGTGGTGTTAGTAGTTCAGGTGGTGGTGGGGTTACAGTGCCGAGTTCACCACACAATACAGCAACAATGGGCAGTACACACAACATGCCGGTTTCTTCTTCCCCTGCACATCATTATTCTTCTGCTGCTGCTTTGACTGGTTCGAGTGCGAGTGTTGTTGGTGCTGGCATTAGTGGGGGTGCTGCGCCAAGCAATGCACCAACAGCTAATAGTGGTGCTGGTGGTGCCTATGCAGCTCGTAACAGAATCTTTGACCTTGAATTGTTAGCAACACAATCAACAGCACAACAACATGCTCACAGTAGTAGTGCAGCAGCACATTCCCACTCTATGTTATCAACAGCCGCCGGCAGTTCAAGTGTTCGATCAGCAGCAGGTTTTGATGCCTATTCTCACAATTCTTTATATTCTCAACCCAATCAACGCCATCATTCTTCGCCTGCCTCCTCTTCACATCATCATTTAGCTGCTGCCTCGCATTCTCTGCATCCTCATCATTCCCATCATCATCACCATGCACCAACGTCGACTCATCATCCACATCAAACGTCTCTTCATCAGCATCATCAGGCTCATCACCATCACCAGCAgcagcatcaacaacaacaacattattatcatcatgCTCCACCAACTTCGCTCCATCGGTCACATTCACAAGTTATACCCCCCATGTTACAACATATTAAATCTGAGCCAGTAGAGCAAATAACCGTAACACCGTCTATACAAACCGAGGAAATCATCATAAAATCTGAACCTGTGGATGATATGAGTAGTAGTTATCACAAAAGTGCGCCACAAATTGAAAACAATTCTTCTTTTCACATGGAAGAAAAACGTAAACAATATGAACtgcaaaaacagcaacaacaacagctacatcaacaacagcagcaacaacgtACTCAACAACAGATTTTACATGAGCAACAACTACATCATCAAcaccagcagcaacaacaacagcaaataaaGCAAGAACCTCATGATTATCCACAACATCAGAGTGAAAATACTCATAATGAAGATGTTtcccaacaaacacaacaatGTTTAAATTCCGAGAATTCTACTACTACGTTACTACCAGTAGAGCAAAAACCACagaaacaagaacaacaacaacagcaacaaccagAACAACAGCATCAGCAAATATCCTTGGCTAATATAAAAACAGAAGCAAAgCCCCTTAACTTTCCTCGTCGCAAATTACAAACAGAACGTTCCTCAACTCTGCCCATATGCCAAcgatgtaaacaagtttttttaaaacGTCAAAACTATACACAACATGTTGCTCAATCCAGTTGCAATATTGTTGAATACGATTTTAAATGCTCCGTATGTCCCATGTCCTTTATGTCTAATGAGGAGCTGCAGACACATGAGCAATTACATCGTTCACATAGATATTTTTGTCAGAAATATTGCGGGAAATTCTATGAAACCATCGAAGAGTGTGAACAACATGAATACGGACAGCATGAAtatgaaatgtttaaatgtaATATATGCTGTATAAGTGTAACACAACGCGATCAATTATTTGTCCATCTAAATGAGCATAAATACCAGCCACGCTTCGATTGCTGTATCTGTCGTTTATGTTTTCAAACTTCATTGGAACTGCATGATCACTACATAAACAATGAAGATTTCTGTGGAAAATTTTACGATAAAGAAGCCTTTAAAAAACCAATTACCTCGTCGTCAACGTCCTCCACGTCAACAACACCTTATCTGGGAAAACCGGAAAGTTCGAATTTGGAAATAGCTCATACGTTCTCCTTAAAAGATATACCTCCGGCTAATAGTCAGCATTTGGAAGCTTTATATAATAAACCTTCCTCTACAAAAACTTCCATGGAAGCAGCTAATTCGTTTAATGCCTCTAATGATTTTCCTTTGGAGCCACAGGTAGAGGTAAAAACAGAAATTAAAGTAGAACCTGACTTTTATCCACCCATGGATCAATCAGATTTTACGGCTTATGACAATGATTATCCTCCTGCCGATTATGCAGCAGGCTCCAATCCAAATTTAGCATTTCTACAGGATTTTCAAGATAATGCTTCCAGCTCTACTAATTCATCGTATTCCTTTAATAATAACGATGCCATACAAGATGAAGATGCCATTTGTTGTGTACCCAAGTGTGGGGTACGTAAATTCTCTTCGCCTTCCTTACAATTCTTTGGTTTTCCAAGAGATGAAAAGTATTTAGCACAATGGTTGCATAATCTGAAAATGATCTATGATCCTAATGTTAATTATGGTTTATATCGTATTTGTAGTTTACATTTTCCCAAACGTTGTATAGCTAAGTATTCCTTAAGTTATTGGGCAGTGCCCACTTTCAATTTGGGTCACGATGATGTGGGTAATTTATATCAGAATAGAGAAAGTTCGGGGGGGTTTCCGGCCGGTGAAATGGCCAAGTGTAGTATGCCGGGTTGTCCTTCGCAGCGTGGCGAAACTAATGTAAAATTTCATGTATTTCCACGCGATTTGAAAACCTTGATTAAATGGTGTCAAAATTCCCGTTTGCCGGTTCATAGTAAAGATAATAGATTTTTCTGTTCGCGCCATTTTGAGGAGAAATGTTTTGGCAAGTTTCGTTTAAAACCCTGGGCCATACCCACCCTCAATCTGGGCACGGTTTATGGTAAAATACACGATAATCCGAATATCTATCAAGAggaaaagaaatgttttttgcCCTTCTGTCGCCGCAGCAGATCGTATGATTGTAATTTATCTTTGTACAGATTTCCACGCGATGAAACTTTGCTGAGACGTTGGTGTTATAATTTGCGCTTAGATCCCAATATGTATAGAggtaaaaatcataaaatatgttCTTCTCATTTCATTAAAGAGGCTTTAGGCTTGAGGAAACTTAATCCTGGCGCAGTGCCTACTTTAAATTTGGGACATAATGATAGAtttaatatttacgaaaatgaaCTTTACACACCACCACCACCGCCTCCGCCACCACAACCCTCCACCTCGTCTAAGGCTCATAAATATGCTGAAATGTTTAAACAGGAAATGGCTGGATCATCACATATCTATGATGGTGTTTTTATGAATGCCTCATCCATGGTGCAGAAATATTCGGCCTCTGCTGCTGCAAACTCTAATAATTCCAATAATTTAGATTTGGGCGACGTTTGTCTAGTGCCCTCCTGTAAACGTACCCGCCATTCAGCTGATATAACTCTGCATACGGTTCCCAAGCGGCCGGAACAGCTGAAGAAATGGTGCCACAATTTGAAAATGGATTTGGTTAAAATGCATAAAAGTGTTAGAATCTGCAGTGCTCATTTTGAAAAGTATTGCATAGGCGGCTGTATGAGACCCTTTGCTGTACCTACTCTAGAGTTGGGTCACGATGACACCAATATCTATCGCAATCCGGATGTTATTAAGAAACTTAATATACGCGAAACTTGCTGCATACAATCGTGCAAAAGAAATCGTGATCGTGATCATGCTAATCTGCATAGATTCCCCACACACCCTGAATTGCTGCAGAAGTGGTGTGAGAATCTGCAGAAACCCATTCCGGATGGTACGAAACTCTTTAATGATGCTGTCTGCGAGATACATTTCGAAGATCGCTGTTTGCGCAATAAACGTTTGGAAAAATGGGCCATACCCACTTTAAATTTAGGCTGGGATGAGGCACCTCATGCTCTACCCTCTGAAGAGGAGATTAATGAGAATTGGGTTAAGCCTTTTGCCCCCAATAATGGCGATGAGCAAGGTGAATGCTGTGTAGTCAGCTGTAAACGTAATCCCCAAATTGATGATGTCAAATTATATAGACCACCCGAGGATGCTGAGCAATTGGTTAAATGGGCTCATAACTTACAGGTGGATGTGGCAGAATTGCCCAATATGAAGATTTGTAATTTACATTTCGAACAGCATTGTATAGGCAAACGTTTGCTTAACTGGGCCATGCCTACTCTCAATTTGGGCGCTAAAGTAGAGCATCTTTTCGAAAATCCACCGCCCATGCCCAccatctataaaaagaaaatcaaaccGGAGAGGTTAATCAGCAGTCAAGAAGCCATTAAATGGTCACCACGCTGTTGCCTGCCGCATTGTCGTAAAATGCGCTCGGTAGATAAGGTTCACCTGTTTCGTTTTCCCTACAATAACCGCCAGACTTTGGCGAAATGGTGTCACAATTTACAGTTACCCCTGGTGGGTAGTTCTCATCGACGCATCTGCTCCAGCCACTTTGAATCCTCGGTATTAACCAAACGTTGTCCCATGTCGTTGGCTGTACCCACTCTCGATTTAAATGCTCCTCCCGGCTATAAAATCTATCAAAATCCTGCCcgtttaaaacaaatcaaaatggGTACTCAAAGAAAATGCATCATAGAGTCATGTGGTAAAACAAAACTAGATGGTGTTTCACTCTTCCGTTTCCCTAACAATCGCTCCATATTATATAAATGGCGTCATAACATCAAAAATTGGCCTAAGGGCAAATTAAGCTCTCAGCTAAGAATCTGTGCCGAACATTTTGAGCCTCATTCGGTGGGCGAAAGAAAACTATCCCCTGGAGCTATACCCACTTTGAAACTGGGACACGAAGCTAAAGATTTATATCCCAATGAGACCAGATCTTTCTTTGATCTAGAGAAATGTGTGGTTAATGGCTGTGATTCGCGCAAAGATATGGAAGACATAAGACTTTTCCGTTTTCCCCGTGATGACGATGTTTTACTTAAGAAATGGTGTAATAATCTTAAAATGGAACCCAATGATTGTGTGGGCATTAAGATATGCAGCAAGCATTTCGAACCGGAATGTATAGGACCACGCCAGCTATATAAATGGggtatacccaccttaaagctGGGACACCGAGAAGATGATTTAGTGGATATAATACCCAATCCACCGCCCGAACAAAGAGCCGGAGAATTCCTCTTCAAATGTTGTGTACCCACCTGCGGTAAAACGCGCAAATATGATGAAGCTCAAATGAATAGTTTCCCCAAGAATTTGAAATTATTCCGCAAATGGAAACATAACCTAAAATTAGATTATCTCAATTTCAAGgaaagagaaaaatataaaatctgcaATGATCATTTCGAGGCGGTGTGTGTGGGTAAAACTAGATTAAACTTTGGAGCTTTACCCACTTTGAATTTGGGCCACGATGATGTAGATGACTTGTATCAAATCAATCCCGATAGAATAAGaccaaatttgtttataaaacaaaaggaTGCCGAAAGACTAGAGAGACGACGCATTTTAAGGGAGGAGAATAGAGAGCAATATGAGTGTGAGGAACCAGAGGAAGATAACACAGATCCTTTAAGTCTAGAGCCAGGAGATATCAAATGTTGTATCTCAGACTGTTCTGCACCCAAAAGCATAATGCGTGAACCCTATGAATTGCCGCAAACCAaagaatttaaacaattatggATAAAGGAATTGGGAGAAACGGATGAAGAGGACTTACCCAGTGAAGCTAGAATCTGTGGCCTACATTTCCAAACCCTATTCAGCCAACTTAAAGCCGAAATGATGGAATTAGCAGAGGAAAATACCGATTTGAAATCCGATTTTCAGAAACTACAATATAATTATCAAAAGTCTAACATATCTTTGGTGGTCAATAGCTATCAATGTCGTGTGGCCGACTGTCCCACCAATCTCTTAAACTCCTCTATAAGACTGTTTTTCTTTCCGTATGGCAAAAATTTGGTTAATAAATGGTCTCACAATACCGGCATAATACCCGATGAACATCGCCGCTATATGAATAAAGTATGCGCCTTGCATTTTGAATCCTATTGTATAACCGAAAACCAAAGATTAAGATCTTGGGCCATACCCACCTTAAATTTACCTTTAGCGGAAGATAAACATTTGTATAAAAACCCTGATCTTACTAAACTAGATCGACGAATGTTGGGACCTCAAATTGTTAAATGTGCGGTGAAAAATTGCAGCTATCAAAAATCTTCGGAAGATGATACCATTAAACTCTTTAATTTTCCCTGTGATGATAAGTTACTAAAGAAATGGTgtgataatttaaaaatgtctcATCATTTTACACctctattgaaaatttgctCTTTGCATTTTGAAAAACAATGCTTTGGCAGCTGTCGCATACGTTCTTGGGCCATACCCACTTTGAATCTGGGTCATGACGAGGCTCCTGAGCATCTCAATAAAACCACTATACGTCAAGAGTTGTTTGAGGCTCCCGAACAGATAGCCGACATACAATTGAAACAGGTTAAGATCAAAAAGTCTTTGGACAGCGCTAAATGCTATATAGCCAGCTGCCGCAAGTCTAGATTGAAACATGGTGTACGTTTCTACAGTTTGCCCAGCAATCCCAAGATGAAACGTAAATGGCTgcataatttacaaatttcCCAAATGAAATCCTCGCATAAAttgcaaaatgttaaaatttgcaATCATCATTTTCACAAAAGATGTCTGGAGGGCAAACAACTGAAAGCCTGGGCTGTGCCCACCATGCATTTAGGTCACACGGATGCTATTTTCGACAATCCACGCAAAGTACAATCTCTGCAAATACAACGTTGTGTGCTGGTGCACTGTAAAAATCATACGGTGGCGCAAGGAGTGCGTTTATTTGTGTTTCCCAAATCACCGGAATTTCTGGAGAAATGGTCGAAAAATTTGAAACTAGAACTGGAGAAATGTAAAGGCAAAATATGTCATGAACATTTTGAAAAGGAGGTTATAGgagaaaagaaattgaaaaatggAGCAGTGCCTACTTTAAATTTGGGCCATGAGGATGTGGATATATACGATAATAAggaattaaaagagaaaattaaattgaaatccaGGGAACAGGAAGTAAAAGTAGATCCtttagaaacaaaagaaaacacagAAGAATTGTACGAAGAGGAATATGAGCCACATTCCGGGGAGGAAGAGGAAGAGGAGGAAGAAGATGAAGAATTATGCGAAACCGAAATAGAAGAGGAAGAACGGGAGGAAGAGGTGGAAGAGGAGGAAGAAGAGGAGTTGGAGGAGGAAGAAGAAGTGGTCTACTATGATGAGCAAGAGGAGGAAGAAGATGTACTAGAAGAACAGGAAAGTAATAGCAAACGACCACAAGAAGATGATGAATTGAGTGTTACCACTAGCATAACCGATTGGagttctattaaatttaaagagGTAAGAGTCTCCATAACACCTTTAACACCCGAAGACTTAATGGATTTATGTTCTCGTTCCTCATATGAAAGAGAATTTGGCTGCTTAACACCAGCCAGTAGTTTAAGAGGACGTAGATCTATAACCCCCGCTTCTAGTTTAAAGGAATTGCGTTCCGAAACTCCTGAACAAAAGCCCAATAGCGGACAACTTAAGTTAAGATCCGAAACACCGGATAATAAATCATTCTTTGGTTTTAGAGAACCACGTTCGGTCACACCCGATCAAAAAGCTGAACAACTAAGAGCAACTCAAACTCCCGAACCTAAACTAGAAAAACTAAACGATAACCATAATGAAACTAAAACTAACTCTACTGTAAACCCCTTAAAGAGAGACAATACAGAATCGAACAGTTTTAGTGTTAAACGTGAAAGATTGGAATTGTCAGAAGATGAAAACACCAATACTTCTTTGCCCTTAGAAATGGATTATGCTAATAGCACTAATCTTAGAACCGATAAAGCCCTTAATGCAGTAGCTCCTATTTGTTGTCTAAAACACTGTGGTAAGGAGAAAACCCCGGAACAACACTTAACCACTTATGGATTTCCTAAAGATCCTCAGTTATTGCAGAAATGGTGTGATAATCTAGGCTTACAGCCCGAAGAATGTATCGGACGTGTTTGTATAGATCATTTTGAATTGCGTGTCATAGGCACCCGAAGACTAAGACAAGGAGCAGTGCCTACTTTGAATCTAGGACCTAATCGCCAGGCAAAACACTCTAATTCAGAGGAAATGCCGCCAAAGAAAACGGTAACCAAGGATTGTACGGAAACAGTTAACCTGCCAGAAGCTGACAGCAATCTAAAGCCTCCACCGCCCTATAAGCATAGTAAAACCAGCAAGCAATCGGTTTTTCGGCTATGTTGCCTCAAACATTGTCGACGCAAGAAATTTGTAAAACAGGAGAAGAAAGAGAAGGACTTGGCGAAGGAGCCAATGGAGCAGCTATTTAAATTTCCCACTGATGAGACTATGCTAAAGAAATGGTATAAAAACCTAAGATTACCCGAAAAGCTAAGCATACCCAACGATTTACAGATTTGCGCAAAACATTTCGAAAGCAATTGTATCATAAAAGGCAAATTACATCCCAAGGCTGTACCTACCCTAGAGCTAAGCTATGCTAATCGTGAGcctatttacaaaaataatccCAAGGATTTTGAGACTGTTAAGCACAAAACGCCagcaaaagaaaaatgttttcttaagcACTGTGGTAATGTTAATACGGAAAGGATTTTCCTAATACCCTTTTTGGATAACGAAAGTATGAGTGTTAAGAAATGGTGTAAAAACTTAAGACTCCCTTACGATAGAAGTAAactaaagaaattgaaaatctGCAGTGAACATTTTGAACCGTATGTGTTCTACAAAAGACGGCATTTAAGAAACGGAGCTTTACCCACTTTGAATCTAGGGCATACGGGAGCCATAACACGCAATTGTCGCAAGTTGCGCTTGAAAAGGGTTAATAGCAATAGTGTTAAGGAACAATGTTGCATTGAGCAGTGCAAGGAAACGAACTTGAAGTTGTATGCTTTTCCGCGAAGTTTCGAATTGCGTAAAATCTGGTGCAACAATTTGCAAATAGAATTGAGACAGGCTTTAAATAACCACTATAAAGTTTGTAGCAAACATTTTGGCATCGAAAGTTTCATGGTGGGTTCggataatttgaaaattaatgcTGTACCTGTTTTGAATTTGGGCATTAAAACCGAGAGTCATTTAGTGTTGAGCTCTAATACTCAAGAAAGTAAATGTCTAGTGGAGAATTGTCAAAAGACGCCGAGTGTGGATAAGGTGAAGCTATTCAAAATGCCACAAAAACCTGAGATACTTAAGAAATGGCtgtttaatttgaatttatcgGCCGAGACTTACAATAGCCAGGATGTTATCTGCAGTAAGCATTTCGATAAGAGTTGTATCAAACAAGGGATACTACATGAGAATGCCATACCCACCAAATTCCTGGAGATAGCCTCCAAAGACTGGTTCTACAAAAACAATGAGGAATTGTATGAAATGCCACGAAAATGCTGTGTTTTGAATTGCCAGCAAACATCGGAAGAAGCTAAACATTTGTATAGATTTCCCAAGCACAAAGAGGATTTGGAGAAATGGTTGTATAATTTGAAATTACAGGTAGAAGAGTCCGAGGTTAAAGATCTACGTGTATGCGATAGACATTTCGAGCAGAGCTGTAAGATATCCAATAAGGATTTAATAACTCAAGCTTTACCCACCCTCAACTTGGGTCACACGGACTCGGATATTTATggcaataattttataaaatgttgtctGGATAATTGCACCATCGAGGGTTTCTACTATCACAAATTACCCGAAGATTTAATGTTGCAAAGTTTTTGGTTTCAGGAACTGGAAATGGAAACTACTTTTAATACCTCGCTATACATCTGTTCCGTACATTTTGTGGCCTTTTTCGAAAGAATCTTGGAAAAATATAGCGCCTTTCTTAAAGAATCCAAGGAATATGTTAAGTTGTCGGTAACCTATAATGAGCTTAAAGCTTTACCTGGTCTACAAAGCTACAAATGTAATATACCCAAATGTAATTCCGGTTTCAAACTAATATGGAAGTTGTTGAAATTTCCCAAAGATCAAACTTTATTCAATAAATGGCTGCATAATACCGGTTTAAAATTTGACTATGATCAGCGCAACAATTATCGCATATGTGCCCAACACTTTGAGGAAAGATGTTTAAGTGAGAAAAAGTTACATCGTTGGTCTTTGCCCACCCTAAAGTTGCCTTTCAACAATAGTTTATATGTCAATCCACCCGAAGCTTTACCCTCTCATCATGAAAATCTTAAACACTGCTGTGTATCCAATTGTCCAACCGTTAAGGGTCCATTTTATAAGTTTCCCCTTAAGCTAGCAGAAGCCAAGAAATGGATACATAATTTGGATTTGGGCACACAACAATGTACTTTAAATTTAAGAGTCTGTTATAAACATTTCGAGAACTATTGCTTTTCCAAGGCAGTGGATAAAATTAAACCTTTAAAATCCTGGTCAGTGCCTACTTTAAAGCTGAAACGCAAAACCGAACTTTATCTCAATCCAGCCGATAAGATAGCCTATTATGTTTGCTGTATAAACAGCTGTAAgcagattttaaataaatccaaagaaatctatttatataaattccCCTCCAGCAATACTTTGACACAAAAATGGTTACACAATTTAGGCCTTAACCGAACGCAATACCAGGAAACTATGAGAATTTGTTCGGATCATTTTGAAAGGGATTGTTTTTATAAAGGTTATAAGTTATTGCGTAAATACTCCGTGCCCACTCTGTGTCTAAATAAACCGCCCAAGGATCTTCATACCAATCCCGTAAGACGTGCCTATTTAAAGTGTTGTGTTAAATTGTGTAAAGGTCCTTGGGATCAATTGTTTAACTTTCCCAAAGATAAAACTTTATTAAGGAAATGGTGCCATAATTTGCAATTGGAAAAGGAAATTCCCATGGAGTCTTTAAGAGAGGCCAAGTTATGTGGTCAACACTTTGAAAAGGAATGTTTCAATAGATTTGGTTTAATTAGAGCCATGGCTTTGCCTACTCTCAAATTGGGACATcgtaaaaagcttttcaaaaatcctaattttaatggaaaatcaAAGATTAAAGAGGAACTTAAAGAAGAGGGAATGACAGAGGAGAAAGTAACACCTAAGGCTAAAGAGGAGACAGAAGAAAAGCAACAGCAGGATTTGGAGGTCAACAAGGAGTTAGATATTAAGTTAGACACTGAGAAAGTGGAGTCAAAATCTTTCAGAGCGAAAGCTTTAAGGCCTTTGCAACATTTAAGAAAACCCGATAAACTTATCAATCGTTACGATCCCAAGGCCAAAACTCTAAGAAAGAAAGCTTTAAGTATGAAAGCTAAAATAACAAAAGTTGTAAAAACAAAACCTTTAACCAAGAAAGCTAAAGAGAAGGAAACTTTAAGAGAAAAGAAAGAGGCAAAAGAACAAGAAATTAACAATTTGGTAGAAGAAGTAGGTAAAGATAAGGAAGGTAATTTAGATGAAGTTAAGCAGCTGGAAAATTGTGGAGTTGTGCCGGACATTCAACAGCAAGATGATGcttatttggaaaatttattggaaattttaacaGAGACCGATAATGCTACAGATTCAAGGGAAATTAAGGAACAAATAAAGCAGGAACAGGGAAATGAAACTCCACAGAAATTGCTAAATATAATAACAGAAACTGAGGTTGAACTAACTTCTAGGAAAGCTAAGGAAGAGAACCAAGAGCAGCTTACTGCACAAGAAGTTTCAAAGCAATCTGAAGACTATTTGCAGAGTAATCAAGAACAGGAAAATACTTTTACTATATATGAAATTAAACAGGAATTGGAGGAGTTTAAGGAACAAGATAGTTATCAGACACAAGAACAATATCTAGAGCAGAACAAAAATCCCGAGCAAAATGATAATTTAGACTACAGTCAAGATAGTTTTACCAACGAACAACTCTATCCCTATGAAGACTCACAAGATGATTATCTACAAGCACGACCGGATTATGAACCAAGTGATCATAGTGAAAATGAATTAATGGATCCAGAAATAACAGAAGAACCAGACAAACTACCAAACAGCAAAGGTAAAAACCCTATAGGTTGCTGCATAAAAACCTGTCGCAACTATCATAAATATCAGGATCATATACCGCTCTTCAAACTACCTAATATACGCAAATTACGAGAACAATGGCTGGTCAATTGTAAACTCAATCAAAGACAGTGTAGCGCAAAAGGGGCTTTAAGACGTTTTCGAATTTGCATAGAACACTTTCACAAACAATGTCTGAAAAACCATTATCGTTTACTGATAGGAGCAGTGCCTACCCTAAAGCTGGGTTCACCCGCTATACATCAAACTAATGAATCCCTAAAATACTATAGCTATTTTCGATGTAGAGTGAAGTGTTGTCAACGTTCTACACAATATGATAAAATCAATAGAATCAAATTTCCCGAGCAAGCAGAATTGAAAAGTAAATGGTGTTACAATTTAAATATCAAAGAGAATACTTTAAGTGCTAATGATTGGATTTGTCATAGACATTTTGAGAGAAAAGCTTTAATAGATTGTCGCAAACCCAAACCGGGAATGCTGCCTACTCTACTTTCAGACAGTTTAGCAATAAAAGGAAATGCTGAAGAAGATCTGGAGGATATAGAGAATACCTCAAAAACTTGCTGTGTTAAAAACTGTGATGGTTTGGTGggagaattttgtaaaataccCACCAAATGTGAGACGGTATATAAAAAATGGTTAGAGAACTTAAAGCTAGAAGATAAACCAGAGATTAGGGAATGCACATATGTTTGCCTAAAACATTTTGAGACCACttctttacaaaacaaaaaaagaattccACTACTAGGCTCAGTGCCCACTTTATATTTAGAGGAAAGTCATGAGCCAGCAGAAATTATAAACACCAAATGTTCACATCCTTTGTGTAGGGAAGAGACTTTACAACTATATGATTGGCCTAATGAGGGTATCTGTCACAATATTTGGTTACAGGTCAACAAATCCAGTTGGGAATTAAAACAACATGAATTAAACTGGCAACGTCATACACAAGCCATTAAATTTTGtgctaaacattttataaatctcTATGAAATCaatcataaaaacataaatagttgcaaaattttgcaaacaaataataaagaaaatttaaaacaaatttttgaaaatctcaAACCAAAGTCCACTAAACTATATAAAGTGAACAGATATGATAAATGTGTGGTATCAGGTTGTAAAACAGATCATCAATTTATGttgtataaatctttaaaactaTTTCCCTTTCCCAAAACAGATATAAGTAAAACCTGGTGCCATAATATTGATATGGACTTTAATACTTTAAAATCTCTAGCAACAGCAAAAATCTGTGAATTACACTTTGACAGCGAATGTTTTGCACAAAAGAAACTTTTGGATACGGCAATACCCACTTTAAATTTACCTAAAAAAACTAAGCGTAATGTTTTGCCCTATAATCAAGATGCAACTGGCAAATGTTGCCTTAAATCTTGTCCTAATAATCAAGGTTTAAAGGAAAAATCTCTTAATAGATTATATAAGTTTCCCTTAAACCCTCAAATGCTTAAAAAATGGTTACGTCTAACGAATTGTGCGAATTATGAGGTGAAAACTTTGCGTATTTGTGATTTACATTTTGAGAAatgtgattttaataaaaataaaactttaaaggaAACAGCTATACCCATATTATATTTAGATGATCTAGAAACTTCACTCAACTCTCCTCAACCCTCGGCCATAGATGATTTTATACAGGTCAAACAGGAACTAGATAATTCCGAGGAATGGTGTGAAAACTTAGATTCCTTAAATCATGATTGCCTGAATGCATATGAAGAAAGTCAAATAACGGAAACACAATTAGAGCtcaagaaattttccgaaacaGATAACTATCCTTTGTTGGATATAAAACAGGAAATTCTTGAAATCCAAGAAGAAGAACCACTTGAAACTAATCCCTGCTCGCTCTTTAGCATACAAAAACTGGAGGGAACGGAAAACTATGAATACAACCAGACAAATCGTTCTCCTCCTCTTAAAACCGATTTTGTAATAAGCGACATTAAATCTCAAATCTATTTATGTTGTGTACAAAAATGTACCAATAATTCAGAAACCCCAGGTATACGTCTATTTACCGAATTCCCTAACGATtcggaaatatttataaaatggtgttttaatctaaaaatagaTCCTCGCAATTATCAGGAAAATCAATATAATATATGTCAACAACATTTTGAAACGATATGCTTTAATGAACAAGGTCAACTACAACCCTGGTCCGTGCCCAccttgaatttaaatttaaatgaaaattcttTTATACATCAAAACGATATACCCGAACATTTGCAAACACCACCCGAACAATGTATAGTTTATGGTTGTATTAATCCGCAAAAGCCTCTTTACAAGTTCCCCTATAATCCTGATATCTCACACAAATGGTTTGCAAAtttaaaactagactataccgATTTTCGGGCACAAAATTATCGTATATGTAAAAGACATTTCCCGGCACAATGTTTTGAACACAACGACACTAATAAACTTAAAACGGATGCAGTACCCTCTATTTATTTGGGGCATACAGataaaatctattattttaaTACGACCGAGGAAATACAATTAGAACAAGAAGGTATTGCCGCTGCTGCTGCTGGGGCTGGTGGTGCTCTTAGTAATCATGATAATAGTAGAGGCAGTAGTCAGGATTCTTTAGCTCGTCTAATATCACCGCATGATTTAGAAGATCATGATAGTAGTTATTTTGAAGATTTTGAAGAGTACTACGGACAAgaggaataa
Protein Sequence
MSQNNQRKHYHIHAPYQHPQQQQQAQQHHHPHHLTAAQQQQQQHHHQQQQQQWYAPQHYQHGLHLRDSRHIQHPSAQHPHHHSHHVPHQQQQQPHHQAQHHSHSMAPHMFTSGYVAGGVGGGGGVGGVSSSGGGGVTVPSSPHNTATMGSTHNMPVSSSPAHHYSSAAALTGSSASVVGAGISGGAAPSNAPTANSGAGGAYAARNRIFDLELLATQSTAQQHAHSSSAAAHSHSMLSTAAGSSSVRSAAGFDAYSHNSLYSQPNQRHHSSPASSSHHHLAAASHSLHPHHSHHHHHAPTSTHHPHQTSLHQHHQAHHHHQQQHQQQQHYYHHAPPTSLHRSHSQVIPPMLQHIKSEPVEQITVTPSIQTEEIIIKSEPVDDMSSSYHKSAPQIENNSSFHMEEKRKQYELQKQQQQQLHQQQQQQRTQQQILHEQQLHHQHQQQQQQQIKQEPHDYPQHQSENTHNEDVSQQTQQCLNSENSTTTLLPVEQKPQKQEQQQQQQPEQQHQQISLANIKTEAKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYTQHVAQSSCNIVEYDFKCSVCPMSFMSNEELQTHEQLHRSHRYFCQKYCGKFYETIEECEQHEYGQHEYEMFKCNICCISVTQRDQLFVHLNEHKYQPRFDCCICRLCFQTSLELHDHYINNEDFCGKFYDKEAFKKPITSSSTSSTSTTPYLGKPESSNLEIAHTFSLKDIPPANSQHLEALYNKPSSTKTSMEAANSFNASNDFPLEPQVEVKTEIKVEPDFYPPMDQSDFTAYDNDYPPADYAAGSNPNLAFLQDFQDNASSSTNSSYSFNNNDAIQDEDAICCVPKCGVRKFSSPSLQFFGFPRDEKYLAQWLHNLKMIYDPNVNYGLYRICSLHFPKRCIAKYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPAGEMAKCSMPGCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKDNRFFCSRHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPNMYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPPPPQPSTSSKAHKYAEMFKQEMAGSSHIYDGVFMNASSMVQKYSASAAANSNNSNNLDLGDVCLVPSCKRTRHSADITLHTVPKRPEQLKKWCHNLKMDLVKMHKSVRICSAHFEKYCIGGCMRPFAVPTLELGHDDTNIYRNPDVIKKLNIRETCCIQSCKRNRDRDHANLHRFPTHPELLQKWCENLQKPIPDGTKLFNDAVCEIHFEDRCLRNKRLEKWAIPTLNLGWDEAPHALPSEEEINENWVKPFAPNNGDEQGECCVVSCKRNPQIDDVKLYRPPEDAEQLVKWAHNLQVDVAELPNMKICNLHFEQHCIGKRLLNWAMPTLNLGAKVEHLFENPPPMPTIYKKKIKPERLISSQEAIKWSPRCCLPHCRKMRSVDKVHLFRFPYNNRQTLAKWCHNLQLPLVGSSHRRICSSHFESSVLTKRCPMSLAVPTLDLNAPPGYKIYQNPARLKQIKMGTQRKCIIESCGKTKLDGVSLFRFPNNRSILYKWRHNIKNWPKGKLSSQLRICAEHFEPHSVGERKLSPGAIPTLKLGHEAKDLYPNETRSFFDLEKCVVNGCDSRKDMEDIRLFRFPRDDDVLLKKWCNNLKMEPNDCVGIKICSKHFEPECIGPRQLYKWGIPTLKLGHREDDLVDIIPNPPPEQRAGEFLFKCCVPTCGKTRKYDEAQMNSFPKNLKLFRKWKHNLKLDYLNFKEREKYKICNDHFEAVCVGKTRLNFGALPTLNLGHDDVDDLYQINPDRIRPNLFIKQKDAERLERRRILREENREQYECEEPEEDNTDPLSLEPGDIKCCISDCSAPKSIMREPYELPQTKEFKQLWIKELGETDEEDLPSEARICGLHFQTLFSQLKAEMMELAEENTDLKSDFQKLQYNYQKSNISLVVNSYQCRVADCPTNLLNSSIRLFFFPYGKNLVNKWSHNTGIIPDEHRRYMNKVCALHFESYCITENQRLRSWAIPTLNLPLAEDKHLYKNPDLTKLDRRMLGPQIVKCAVKNCSYQKSSEDDTIKLFNFPCDDKLLKKWCDNLKMSHHFTPLLKICSLHFEKQCFGSCRIRSWAIPTLNLGHDEAPEHLNKTTIRQELFEAPEQIADIQLKQVKIKKSLDSAKCYIASCRKSRLKHGVRFYSLPSNPKMKRKWLHNLQISQMKSSHKLQNVKICNHHFHKRCLEGKQLKAWAVPTMHLGHTDAIFDNPRKVQSLQIQRCVLVHCKNHTVAQGVRLFVFPKSPEFLEKWSKNLKLELEKCKGKICHEHFEKEVIGEKKLKNGAVPTLNLGHEDVDIYDNKELKEKIKLKSREQEVKVDPLETKENTEELYEEEYEPHSGEEEEEEEEDEELCETEIEEEEREEEVEEEEEEELEEEEEVVYYDEQEEEEDVLEEQESNSKRPQEDDELSVTTSITDWSSIKFKEVRVSITPLTPEDLMDLCSRSSYEREFGCLTPASSLRGRRSITPASSLKELRSETPEQKPNSGQLKLRSETPDNKSFFGFREPRSVTPDQKAEQLRATQTPEPKLEKLNDNHNETKTNSTVNPLKRDNTESNSFSVKRERLELSEDENTNTSLPLEMDYANSTNLRTDKALNAVAPICCLKHCGKEKTPEQHLTTYGFPKDPQLLQKWCDNLGLQPEECIGRVCIDHFELRVIGTRRLRQGAVPTLNLGPNRQAKHSNSEEMPPKKTVTKDCTETVNLPEADSNLKPPPPYKHSKTSKQSVFRLCCLKHCRRKKFVKQEKKEKDLAKEPMEQLFKFPTDETMLKKWYKNLRLPEKLSIPNDLQICAKHFESNCIIKGKLHPKAVPTLELSYANREPIYKNNPKDFETVKHKTPAKEKCFLKHCGNVNTERIFLIPFLDNESMSVKKWCKNLRLPYDRSKLKKLKICSEHFEPYVFYKRRHLRNGALPTLNLGHTGAITRNCRKLRLKRVNSNSVKEQCCIEQCKETNLKLYAFPRSFELRKIWCNNLQIELRQALNNHYKVCSKHFGIESFMVGSDNLKINAVPVLNLGIKTESHLVLSSNTQESKCLVENCQKTPSVDKVKLFKMPQKPEILKKWLFNLNLSAETYNSQDVICSKHFDKSCIKQGILHENAIPTKFLEIASKDWFYKNNEELYEMPRKCCVLNCQQTSEEAKHLYRFPKHKEDLEKWLYNLKLQVEESEVKDLRVCDRHFEQSCKISNKDLITQALPTLNLGHTDSDIYGNNFIKCCLDNCTIEGFYYHKLPEDLMLQSFWFQELEMETTFNTSLYICSVHFVAFFERILEKYSAFLKESKEYVKLSVTYNELKALPGLQSYKCNIPKCNSGFKLIWKLLKFPKDQTLFNKWLHNTGLKFDYDQRNNYRICAQHFEERCLSEKKLHRWSLPTLKLPFNNSLYVNPPEALPSHHENLKHCCVSNCPTVKGPFYKFPLKLAEAKKWIHNLDLGTQQCTLNLRVCYKHFENYCFSKAVDKIKPLKSWSVPTLKLKRKTELYLNPADKIAYYVCCINSCKQILNKSKEIYLYKFPSSNTLTQKWLHNLGLNRTQYQETMRICSDHFERDCFYKGYKLLRKYSVPTLCLNKPPKDLHTNPVRRAYLKCCVKLCKGPWDQLFNFPKDKTLLRKWCHNLQLEKEIPMESLREAKLCGQHFEKECFNRFGLIRAMALPTLKLGHRKKLFKNPNFNGKSKIKEELKEEGMTEEKVTPKAKEETEEKQQQDLEVNKELDIKLDTEKVESKSFRAKALRPLQHLRKPDKLINRYDPKAKTLRKKALSMKAKITKVVKTKPLTKKAKEKETLREKKEAKEQEINNLVEEVGKDKEGNLDEVKQLENCGVVPDIQQQDDAYLENLLEILTETDNATDSREIKEQIKQEQGNETPQKLLNIITETEVELTSRKAKEENQEQLTAQEVSKQSEDYLQSNQEQENTFTIYEIKQELEEFKEQDSYQTQEQYLEQNKNPEQNDNLDYSQDSFTNEQLYPYEDSQDDYLQARPDYEPSDHSENELMDPEITEEPDKLPNSKGKNPIGCCIKTCRNYHKYQDHIPLFKLPNIRKLREQWLVNCKLNQRQCSAKGALRRFRICIEHFHKQCLKNHYRLLIGAVPTLKLGSPAIHQTNESLKYYSYFRCRVKCCQRSTQYDKINRIKFPEQAELKSKWCYNLNIKENTLSANDWICHRHFERKALIDCRKPKPGMLPTLLSDSLAIKGNAEEDLEDIENTSKTCCVKNCDGLVGEFCKIPTKCETVYKKWLENLKLEDKPEIRECTYVCLKHFETTSLQNKKRIPLLGSVPTLYLEESHEPAEIINTKCSHPLCREETLQLYDWPNEGICHNIWLQVNKSSWELKQHELNWQRHTQAIKFCAKHFINLYEINHKNINSCKILQTNNKENLKQIFENLKPKSTKLYKVNRYDKCVVSGCKTDHQFMLYKSLKLFPFPKTDISKTWCHNIDMDFNTLKSLATAKICELHFDSECFAQKKLLDTAIPTLNLPKKTKRNVLPYNQDATGKCCLKSCPNNQGLKEKSLNRLYKFPLNPQMLKKWLRLTNCANYEVKTLRICDLHFEKCDFNKNKTLKETAIPILYLDDLETSLNSPQPSAIDDFIQVKQELDNSEEWCENLDSLNHDCLNAYEESQITETQLELKKFSETDNYPLLDIKQEILEIQEEEPLETNPCSLFSIQKLEGTENYEYNQTNRSPPLKTDFVISDIKSQIYLCCVQKCTNNSETPGIRLFTEFPNDSEIFIKWCFNLKIDPRNYQENQYNICQQHFETICFNEQGQLQPWSVPTLNLNLNENSFIHQNDIPEHLQTPPEQCIVYGCINPQKPLYKFPYNPDISHKWFANLKLDYTDFRAQNYRICKRHFPAQCFEHNDTNKLKTDAVPSIYLGHTDKIYYFNTTEEIQLEQEGIAAAAAGAGGALSNHDNSRGSSQDSLARLISPHDLEDHDSSYFEDFEEYYGQEE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-