Basic Information

Gene Symbol
-
Assembly
GCA_963682025.1
Location
OY821518.1:291641115-291656257[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 9.3e-15 1.6e-11 47.5 1.5 1 86 231 303 231 304 0.85
2 34 1.7e-14 2.9e-11 46.7 5.1 1 87 331 400 331 400 0.81
3 34 6.3e-15 1.1e-11 48.1 0.2 1 87 421 493 421 493 0.84
4 34 1.5e-13 2.5e-10 43.7 3.4 1 86 575 643 575 644 0.78
5 34 2.2e-14 3.8e-11 46.3 4.2 1 87 668 740 668 740 0.80
6 34 1.4e-11 2.4e-08 37.4 0.7 1 87 775 843 775 843 0.81
7 34 3e-10 5.2e-07 33.1 1.9 1 85 884 952 884 959 0.72
8 34 1.6e-15 2.8e-12 50.0 0.4 1 86 981 1050 981 1051 0.81
9 34 4e-14 6.9e-11 45.5 1.0 1 86 1073 1142 1073 1143 0.80
10 34 2.9e-12 4.9e-09 39.6 2.9 1 86 1171 1242 1171 1243 0.85
11 34 2e-06 0.0036 20.8 0.6 1 63 1311 1367 1311 1387 0.72
12 34 4.1e-10 7.1e-07 32.6 0.2 1 87 1407 1479 1404 1479 0.80
13 34 4.2e-13 7.4e-10 42.2 2.1 1 87 1510 1581 1510 1581 0.79
14 34 5e-12 8.7e-09 38.8 5.3 1 87 1626 1699 1626 1699 0.83
15 34 3.3e-12 5.7e-09 39.4 1.3 1 87 1721 1789 1721 1789 0.80
16 34 4e-13 7e-10 42.3 0.1 1 87 2076 2145 2076 2145 0.80
17 34 1.5e-11 2.7e-08 37.2 5.4 1 86 2203 2287 2203 2288 0.77
18 34 5e-12 8.7e-09 38.8 0.8 1 86 2354 2426 2354 2427 0.79
19 34 4.2e-12 7.3e-09 39.0 0.6 1 87 2454 2523 2454 2523 0.81
20 34 5.8e-11 1e-07 35.4 1.4 1 87 2546 2616 2546 2616 0.78
21 34 9.9e-09 1.7e-05 28.2 0.0 19 87 2650 2708 2639 2708 0.77
22 34 0.00047 0.81 13.2 0.6 1 58 2725 2771 2725 2796 0.79
23 34 2.8e-12 4.9e-09 39.6 3.2 1 86 2813 2882 2813 2883 0.82
24 34 5.1e-13 8.8e-10 42.0 6.7 1 86 2908 2978 2908 2979 0.83
25 34 3e-12 5.2e-09 39.5 3.9 1 86 2999 3071 2999 3072 0.80
26 34 4.1e-10 7.1e-07 32.6 0.8 1 86 3092 3159 3092 3160 0.79
27 34 1.8e-10 3.1e-07 33.8 3.7 1 87 3653 3736 3653 3736 0.71
28 34 8.7e-07 0.0015 22.0 2.4 1 86 3759 3827 3759 3828 0.75
29 34 5.3e-10 9.1e-07 32.3 1.1 1 87 3858 3932 3858 3932 0.77
30 34 0.47 8.1e+02 3.6 0.2 21 58 3987 4028 3974 4050 0.66
31 34 3.8e-09 6.6e-06 29.5 1.9 1 87 4075 4151 4075 4151 0.85
32 34 4.8e-12 8.3e-09 38.8 1.3 1 85 4175 4246 4175 4248 0.85
33 34 7.7e-11 1.3e-07 35.0 6.0 1 87 4464 4535 4464 4535 0.80
34 34 2.8e-11 4.8e-08 36.4 0.5 1 86 4560 4627 4560 4628 0.81

Sequence Information

Coding Sequence
ATGCCAAGTGCGTGGAAATTGGGTCGTGTAGTACcgatcgccaaaaataaaaaccaCAATGTGATATGTTGTATATCTTTATCACAACGTGATCAACTATTTACCCATCTAGCTGAACATGGACATCAACCACGTTTTGATTGCTGTATATGTCGTTTATGTTTTCAGACTTCTTTGGAATTACATGAACACTATATGACAAATGAAGaattttgtggtaaattttaTGATAAACAAGCCTTCAAAACGCAACCAATTACCTCTTCACCTTATTTGGGAAAACCGGAAAGCTCACATTTTGAAATTgcaaatacattttcattaaaagatatTCCTCCTTCAAAATGTCGTGACTTGGAACCTTTATATGTTAAACAACAACAGAAATCAAAAACTTCCCAGGATTCACGTGCACCTTCATCGCCTCCACCATACCATAATATACCGGATTTTCCATTTGAACCACAGGTTGAGGTGAAAACTGAAATAAAAGTAGAACCTGATTTTTATCCGCCCATGGATCAATCCGACTATTCATCCTACGACAATGATTATGGTGCAACAGATTATCCTCAAAGTTCTAatgaaaatttagcttttctaCAAGATTATCAGGATAATGCTTCTAATTCTACCAACTCATCGTTTTCCTTAAATACTAATGATCCCATACAAGATGAAAATGCTATATGTTGTGTACCGAAATGTGGTGTACGCAAAATGTCTTCGCCGTCTTTGCAATTCTATACGTTTCCCCGAGATGAAAAGTATCTTTCCCAATGGTTACATAATTTGAAAATGTCTTATGATCCCAATTGTAATTACGGTATTTATCGTATATGTAGCTTGCATTTTCCAAAACGATGTATAGCTCGATATTCATTAAGTTATTGGGCAGTGCCAACATTTAATTTGGGCCACGATGATGTGGGAAATTTATATCAAAATAGAGAAAGTTCGGGCGGTTTTCCGGCTGGGGAAATGGCTAAATGCAGCATGCCCGGTTGTCCCTCGCAAAGAGgggaaacaaatgttaaatttcaTGTTTTTCCTCGCGACTTGAAGACTTTAATAAAATGGTGTCAGAATTCCAGATTGCCGGTGCATAGCAAGGAGAATAGATTCTTTTGTTCtaaacattttgaagaaaaatgttttgGGAAGTTTAGGTTAAAACCTTGGGCTATACCCACTTTGAATTTAGGCACGGTGTATGGAAAAATCCATGATAATCCCAATATTTATCAGGaagagaaaaaatgttttttgcccTATTGCCGGCGTAGTAGATCATATGATTGTAATTTATCCTTATATAGATTTCCAAGAGATGAAACTTTATTGAGAAGATGGTGCTATAACTTAAGATTAGATCCCGAGATATATCGTGGTAAAAATCATAAGATTTGCTCTTCTCATTTTATAAAGGAGGCTCTTGGTCTAAGAAAACTCAATCCTGGAGCGGTGCCTACTCTAAATTTGGGACACAATGATCGTTTTAATATCtatgaaaatgaattatatacCCCACCACCGCCACCCTCGCCCCAACCTTCGACATCGTCGGCTTCGGCAAAGGCTCAGAAATTTGCAGAAATGTTTAAGCAGGAAGCAGGAGGGTCGCACATATATGATGAGGTTTTTATGAATTCTATGATACAAAAATTCTCTTCATCCTCAAATTCAAATGCCTCTTCAGCTCAATTGGACTTAGGTGATGTCTGCTTGGTGCCCTCATGTAAGAGAACGCGCCATTCGAGTGACATAACTCTTCATACTGTGCCTAAACGTGCCGAGCAATTAAAGAAATGGTGCCATAATTTAAAAATGGATTTAGAAAAGATGCACAAAAGTGTGCGAATATGTAGTGCCCATTTTGAGAAGTATTGCATTGGCGGCTGTATGAGACCCTTTGCTGTTCCAACTATTGAATTAGGTCACGACGATGCCAATATATATCGAAATCcggatgttattaaaaaattaaatattcgtgAAACTTGTTGTATAGGCTCATGCAAAAGAAATCGCGATCGTGATCATGCCAATTTACATAGATTTCCCACACATCCGGAGCTTTTGCAAAAATGGTGTGAGAATTTACAAAAACCCATACCAGATGGTACAAAACTTTTTAATGATGCTGTTTGTGAAATTCACTTCGAGGATCGTTGTTTGCGCAATAAACGTTTAGAAAAATGGGCTATTCCTACTTTGAATTTGGGCTGGGATGAAGCTCCTCATATTTTGCCTTCGGAGGAGGAAATAACAGAATACTGGACTAAACCGTTTGCTCCTAACAATGGTGACGAACAAGGAGAATGTTGTGTCTCCACCTGTATGCGTAATCCTCAAATAGATGATGTTAAACTATATAGGCCTCCAGAAGACCCAGAACAAATGGTTAAATGGGCCCACAATCTTCAAGTGGACGTAGTAGAGTTGTCAAATCTCAAAATCTGTAGTTTACATTTCGAACAACATTGTATAGGCAAGAGATTGCTCAACTGGGCTATGCCAACTTTAAATTTGGCCTCAAAAGTGGAGCATCTTTTCGAAAATCCACCTCCTGTCCCTAATTTCTATAAGAAAAAAGAGAAACCCGAGAGAATTCACTCAACAGCAGATGTTATTAAATGGTCTCCCAGATGCAGTTTGCCACATTGTCGTAAAGCGCGTACCTTGGACAGAGTACACCTCTTTCGTTTTCCTTATAATAATCGGCAAACCTTGCATAAATGGTGTCATAATTTACAATTGCCTTTGGTGGGTAGTTCACATCGTCGCATATGCTCTAGTCACTTTGAATCTTCGGTATTAACCAAACGATGTCCTATGACTTTGGCGGTACCCACATTGGACTTGAATGCTCCGCCCGGCTACAAAATATATCCAAATCCTTCACGTTTGAAACAGCTAAAGGTAGGCCCACAAAGACAGTGTGTTATAGAATCATGTCATAAAACTAAATTAGATGGTGTCACACTGTTTAGATTCCCCAATAATAGAGGTATCTTACAGAAATGGCGCCATAACATTAAAAACTGGCCTAAAGGAAAATTAAGTAATCAAGTAAGAATTTGTTCTGAACATTTTGAAGCGCACTCTGTAGGTGAAAAGAAATTATCTCCCGGAGCCATACCCACCATACGTTTGGGTCATGAGGATAAGGACCTATATCCAAATGAGACCAGATCATTTTTCGATCTGGCAAAATGTGCAATAACAAATTGTGACTCTAGAAAAGAAATGGAAGATGTAAGACTTTTTAGATTCCCACGGGATGATGATGATTTACTTAAGAAATGGTGTCATAATCTTAAAATGGAACCCAATGATTGTGTTGGCATACGAATTTGCAGTAAACACTTTGAAATGGAATGTTTAGGTCCTAGATTGCTCTATAAATGGGCTATTCCCACACTTAAGTTGGGACATAAAGAGGATGATGCGGTTGATATAATACCCAATCCTCCACCCGAACAACGCACTGGAGAATTTCTTTATAAGTGTTGTATACCCAGTTGTGGTAAGACTCGCAAATATGATGAGGCCCAAATGAATAGTTTTCCTAAGCATTTGAAGTTGTTTAAGAAATGGAAACATAATTTGAAATTAGACTTCTTGAATTTCAAGGAAAGGGAAAAATATAAGATCTGCAATGATCATTTTGAGCCAGTGTGTGTGGGCAAAACGAGGCTTAACTTTGGTGCTTTGCCCACCTTAAGATTGGGTCATGGCGATGTAGaggatttatataaaataaatccaGAAAGAATAAGACCCAATCTATTTGTTAAACAAAAGGACATAGAAAGGATTGAAAGAAAAatgattttaagaaaagataatcaAGAATATGAGGAGGCCATTGCAGAACCAGAAGATGATGTATGTGATCCTTTGGGTTTAGAGGTCACTGATTTAAAATGCTGTGTGGTTGATTGTAAAGCACCCAAGTCAATAATGAGGGAACCTTATGATTTACCCGAatcaaaagaaataagaaatctATGGCAAAAGCTATGCCACGAAGAAAACGTGGATTTGCCAACGGAAGCCAAAATATGTGGCTTACATTTCCAGCAGTTGTTTAAACAATTAAAACCACAAATGCTGAATTTATTGGAGAAAAATCCAGAACTTAAATCGGATTTCACTAAATTACAGTACATGTATCAAAAATCCAATATTTCCCTAGTTATTTGTAGTTATCAATGTCGAGTGTCTAGCTGCGCCACGAATTTACTTAATTCGGATATAAGATTGTTTTTCTTTCCATATGGTAAAAACCTTATAAGCAAGTGGTCTTACAACACTGGCATTGTACCCGATGAACATAGGAGATATATGAATAAGGTGTGTGCTTTACATTTCGAGCCATACTGTATAAGCGAAAATCAAAGATTACGAAACTGGGCTATTCCTACATTGAATTTACCACCTCTTAAGGAGGAAGAAGAAATATATCAAAATCCCGATCTGACGAAGATAGATAAGAGAATGTTGGGTCctcaaattttaaaatgtgccgttaaaaattgtttaaccgATAAAATGACTGAAGATGAGTCGCTCAAGCTTTTCAATTTCCCCTCGGAAGAAGCTTTACTCAAAAAATGGTGTGACAATTTGAAAATGTCCCATGAACTTACACCGTTGATGAAGATATGCTCTTTACACTTTGAAAAATTCTGTTTCGGTAGTTGTCGCATAAGATCTTGGGCTATACCAACACTAAATTTGGGTCATAATGATAGACCTCAACATTTGAATAAAACTACCATAAGACAAGAAGTATATGAGGCTCCCGACATTACAACCAAGGCCCAATTGAAACaagttaaaatcaaaaaatccttGGATACTACAAAATGTTTTATATCTTCCTGTCGTAAAAGCCGCCTACAACATGGTGTACGATTCTATAATCTACCAACCAAGATCAAAATTAAGCGTAAGTGGTTGCATAATTTACAAGTAACCAAACTGAGATCAGCACAGAAACTTCATAATATTAAAGTTTGTAATCTACATTTCCAAAAAAAGTGTTTCGAGGGTAAACACTTGAAATCATGGGCCGTACCCACCATGCATTTGGCTCATTCCGAACATATTATCGATAATCCACGTAGATTGCAAGGATTGCCAATTATAAGATGTGCTCTCTCGCATTGTAAGAATAAATTGTCGATCAAAGAGATTAGATGTTTCGTGTTCCCTAAGTCACCTGAATTTTTGGAGAAATGGTCTAAAAATCTCAAATTGGACTTAGAAAAATGTAAGGGACGCATATGCATAGATCATTTCGAAAAGGCAGTGATaggagcaaaaaaattaaaaaatggagcGGTTCCTACTTTAAATTTGGGACATGATGAGCAAATATACGACAATCTagaattaatacaaaaattaaaattgaagaacATACAAAAGGAACAAAAATTGCAAATGTGTGATAAAAACAAGGATAAAGAACAGGAGCTACAAGATAATACGGCCGTAGCAGATTATGAAGGGGAGGAAGATGAAGAATATGAAGATGAATATGAAATGGAAGATGAAAACGAGGATGAAAACGAAGAGGAATACGAGTATGAGTACGGGGACGAGAATGAGGAAGAAGAAGATGATGATGACGACAAAGTTAGCATCTCCACTACGCTATCCCATTGGAGTTCTgtaataaataaagaattaagaGTAACCCTAACACCCATGACTCCGGATGATCTTTACGATTTATGCTCTCGTTCCTCATATGAAAGAGAATTTGGCGCTATGACCCCCAGTGGTCGACGGTCTGTCACACCATCCACAAGTATCAAATCAGAAACTGCAGACCAAAAGCCTTGTTTCAGAGACATAAGTTCCGATAATCTGAATCAAAAGCCAGATAATTACTTCAAAGAACCTCATACGGAATGTTTGGAGCAAAAACCAGATAAACTATTCAGAGAACCAAGATCACAAACACCCGAACAAAGTCGCTTAAGAGAACCACGAGCCCAAACACCTGATCAAAACCTCTTTAGAGTTCCTCGTTGTCAAACTCCTGACCAGAAACTTCAAATATTAAAAGAAACGCTCGACACGGAacgttttgaaataaatttaaaacaggaAGAAGGAATTTTATCTGCCGAACTGAACGAAGAGTTAGATAATAATTCTAGCACTGCTCTGAGAACCGATAAAACTCTCAACAGCGTTGCCCCCATATGCTGCTTAAAATACTGTGGGAAAGAGAAGACACCTGAACAACATTTGACAACCTATGGTTTCCCTAAGGATGCCCAATTGCTACAGAAATGGTGTGAAAATTTGGGATTGCAACCCGAAGAATGTATCGGTCGTGTTTGTATAGATCACTTTGAATTGAGAGTAATAGGAACACGGCGACTTAGGCTGGGAGCGGTGCCTACCTTAAATTTAGGGCCAAATCGTTCAGCAAAACATAGCAACACCGAAGAACcgcaacaaaaaaaatctttagcAAAAGATTCAAGCGAATCGGGAAATGTACAAGATACAGATCTGAAACTATTGCCACCACCACCATATGCAACACCCAAACCAGCTAAGCATTCGGTTTTTCGGCTATGTTGCCTCAAACATTGTCGCCGCAAGAAGGTGCTGACGAATATAAAGATGGACGAGCAGAAGATGGAGCAGGTTAAGACATGCAATAAATTATTTAAGTTCCCAAAAGATTTGAATATACTAAAAAAATGGTGTAAAAATTTACGACTACCGGAAAAATCTTGCCTAAGATACGATTTGGAAATATGTGAAAAACATTTTGATGCACAGGTGATACAGGGCGAAAAATTACATCCCAAAGCAGTGCCTACTCTGCAGCTTAGCTATGCAAATCGTGAACCAGTTTATACAAATAATCCTAAAGATTTTTTAACCGCATCCTGGAACACAAAAAATGAAGACCCCAATAATGATCGCAATAAATCGAAGTTTAAAGAAATTAGTACGAATATTCCCCAAAATTGGTCTAATAAATCCAAATCATCACTGCAAAAGATTAAAATGGAAGAAAAATGTTTCCTTAAACATTGTGGCAAATCGAGAGACTCTGACGAATTATTCCTTATACCCTTTCCCCAACACGGCATGTCCTTACAACGTAGATGGTGTAAAAACTTAAAACTCACCTCCAAATTAAGCCAGCATAAAGATTTAAAGATATGCAGTGACCATTTCGAGCCCTATGTCTTCAATAAACGCCGACTTTTAAAGACCGGAGCAGTGCCCACCCTTCAGTTGGGTCATTCCGATGAAATATGTAGGAATTTTCGTAGATTACGTTTAAAAAAGGTGGCCTCAAGTAAAAAAGAACAATGTTGCATTGCTACATGCCAGGAATCGAACCTCAAACTCTATGCCTTTCCCAAAAGTAGTGAACTGCGTAAAATATGGTGCAATAATTTACAAATTGAAGTACGAAAAGCTTTAAACAATCACTATAAAGTATGTGGAAAACATTTTTCACTGGAAAGTTTTATAGTGGGAACGGATAATTTGAAGTTAAATGCTGTACCCATTTTAAATTTAGGTCTGCAAAGTGAAAATCACATACAGGTTAAGAATAAATCAAATGATAATGACACTTTAAAGTGTTTGGTAGAAAATTGTCAAAAGACTCCTAGTGTAGATCGAGTAAAACTTTATGGTTTTCCCAcgagaaaagatattttaaaaaaatggttattcAATTTAAATATCAGCTTGGAATCTCTTAATGAAAATTCTTTAATATGCAATAggcattttcataaattttgttttagaaatggAACTCTTCATGAAAAAGCCATACCTACTCAATTTCTAGATGTTTCGCCTAAAGGATGGTTCTATCAAAATAATAAGGAATTTTTTGAAGATCCAAAACCCAAATGCCTTTTGCAATGTTTGGATTTGGGACAGTATTTATATAAGTTTCCCAAACAGAAAGATGAATTGGAAAAATGGATTttcaatatgaaaattaaaatcgaagAAAGTGAATTACAGAAGTTGAGATTGTGTGCTTTACATTTTGAGGAGTCTTGTAAAATTCCACAAAAAGACATGTTATTAGCTGGTTCTTTGCCTACTTTGAATTTGGGTCACGATTATTCGCAGGgtatttatcaaaataattttgtaaaatgttgtttAGAGACGTGTTGCCTTGAGGGATTTAAGTTTCATAAATTACCCGAAGATTTAATGCTACAAAGCTTTTGGTTTCAGGAGCTTGAAATGGAGACGTCCTATAATAATTCCCTGTATATATGTTCGGTACATTATGTCTCCTTATACGAAagagttttggaaaaattttctgcCTTTTTAAAGGAATCTAAGGAATATGTTAAGCTGACCTTAATATATCAAGAGCTTAAAGTTTTACCAGAACTAAAAAGCTTCAAGTGTCATATACCCAACTGTCCCTCGGGATTTAAGCTTATATGGAAGCTGTTTAAATTTCCAAAAGATGAAAGTTTATTCAATAAATGGCTTCATAACACAGGTTTGGAATTTGAATACTCTAATCGTCATAATTATCGTATATGTGCCCAACACTTTGAAGAACGATGTTTAAGTGAAATAAAGCTGCACCGTTGGTCGCTACCCACATTAAAGTTGCCATTCAATAATAGTTTATATGTTAATCCTCCCGAAGCTTTACCCTCCAATCATGAAAATCTAAAACACTGTTGTGTTTCGAATTGTGTAACGGAAAAGgaaccatttttcaaatttcctaAACAGCACATAGAAGTTAAAAAATGGATTCATAACTTGAAATTGGGCTCCCAGCAATGTACCTTAAATTTAAGAGTGTGCTATAGACATTTTGAAAGTTACTGTTTCATCAAAGAAGACAATAGAATTAAACAATTGAAATCATGGTCAGTGCCTACCTTAAGATTAAATCGTAGAACAGACTTGTATCTAAATCCTCCCGATAAAATGGCTTTTTCCGTATGTTGTCTACCTTCTTGTCGGCAGATTTTAAATAAATCGAATCACCTGTATTTATTTACATTTCCCAAAAGTAATACTTTAAAACATAAATGGtttcataatttgaatcttaACCCCCAAGACTATAAAGAAAAAATGAGATTGTGTGCCAGGCATTTTGAAATTGATTGTTTTGATAGAAGCTCTAAGCTTTTGCGTAAACACTCTGTACCCACTTTAGGCCTTTCCAGGCCACAAACGGAATTATTTAAGAATCCTGTGCGAAGACCCCATTTAAAATGTTCTGTTAAACAGTGTAAAGGGCCCTGGCCGGAATTAATAAATTTCCCCAAAGATAAAAGTTTACTCAGAAAATGGTGTCATAATTTACAAATCGATATTAAATTAGAATTGCTGCGAAACTGGAAAATTTGTGGTAAACATTTTGAAATGAAATCTAGAAATAAAAATGGCTTAATTCGAGATTTAGCCGTGCCCACATTGAAATTgggccacaaaaataaaaaaatatttaaaaatcccaTCTCTCAATCTGCACAACACACTAAACAGTGCAATTCTTCCTTACAGATAGTGGAAACAGTGGTTAAGGAGAAACACGTTAAAGCCACAATTGCTACTGAAGTCAGGAAAAAAGCTGTAAAAGCAAAGACACAGCACTTGAAAAGTCTAagaataaaaaatcaacaaagaaTTCTAAAATTAATCGCCGTTAAAGAGGCCAAAGAGCGAAAAAGACGAATGAGCACTTCAAAAATAAGAGATTTCAAAatgcaacaaaaacaaacaacttCAAAAGCAACTCAAGAGATACgggtaattgaaaaaaaattagagcACTCTGATGTATTTAGTTCCTCCAAACTGGTAGAAGAAGAAGAACAAGAAGGAAATAACTTTACAAAGATAAAAGCTCCCGAATGTAAAATAGATCATTTGGACAAGACAAAGGCCTTAGGTAAAACATTAACTGTAGCAACAGAAGAAGAAGAGAAGAACTTTACAAACATAAAGGCTTCTGTAGAGAATCAAGGCTACAACAAAATATCTAAAGTAGATCATTTAGAAAAGCAACAAGTTTTAGGTAAGACTGTCACAGtagaaacaaaagaaaactgTTTCGAAATGGAAACATTTCCCGCAGGTTCGATACTTACAGAAAACGAAAATATCTTAAACGAAGAAACCTATTTACAAGATTATATGAATTACATAGAGAATGAGGAATCAACTGAAGATATATTATTAGAGAGTTATAAAACTTTTCAAAAGCTAACGACACATGGAACAATCGAAACTCACAATCAAACAAactctattgaaaataataaaggcACATATATGGACAAACATCAAGAAGAATGTATTCAAACCTCTCAACAACTTTTACAAAGTTCTACAGAAATCCTAAAACAAGAGGTTCGAGAAGACTGTGCTCCCTCTAAAGAAAATCAAGAAGAgagtattcaatctatgaacgaaaCCTCTCAACAGCTTTTGCTAACTCCTACAGAAATCACGAAACAAGAGTCTCGGCAAGAATACGAAGCAGAGTGGGCCCAAAAagataataaagaagaaagtagTAAATATTTGCAAGAAACCGACAAACAGCTTTTGCAAACTCCTACAGAAATCCCAAAACAAGAGCTTCGACAAGACTGCGAAGCAGAGTGGGCCCAAAAAGATAATCAAGAAGAGAGTATGAACTCTTTGCAGGAAACCGACCAACAGCTTTTGCAAACTCCTACAGCAATTGAAGAGAGCCAAACTTTAAAAGTAAAACAAGAGTCTCCGGAAGACGACGAAGAATATCACCATATGAAACAAGATTACTTCGAAACGGAAACATACTTCCTGCAAGATGTATTCCAAGAGAGATATGCCAATGAAAACACTCTCACAAAAAATGAGGAATTGTTGGAGGAGGACCCTAAAAGAAACAAACGCATACCAATTTATTGCTACTTTAAACACTGTCCCAATCATTTAACTTGCAAGGAAGAtgtacaattatttaaatttcctaCCAATCAAAATCTTAGACAAGTTTGGATTGAAAATTTTCCATTATCGCTTGATTTATTGCCAAAAACAAAGAAAGCTAAAAATAAGAGTAAAAGAGTCTGGCTAAATAGAATATGTGCCGAACACTTTGAAAAGCAATGTTTTAGCGAAACAAGACTCCTTTATGGTTCGATACCTACTTTAAATCTGGGAGTTGAGTCGAAAATTAAGCGCAACTGTGAGGAAAGTTTAAAATGCTTTGCGAAAAACAGATGTAAAGCTCCACACTGCCAAAGGTCTCAGCAATACGATGACATTTTCACGATAAAATTTCCCCAAggagaattaaaaactaaatggtGTTTTAATTTAAACCTAAAAGAAGAAGATATTTTGCCAACTGACTGGATTtgtaataaacattttgaaaaacgAGTTCTATTGGGAAAAACACCTAGATCTCATGCTGTGCCCACATTATGTTTAGGTGATGCAGTGAAATCTTCCGATTTATATCAGAGTCCGGAATATATTACGGTTACAGAGTATGCCAAACAATGTTATGAAAAATGCTGTGTGAAGTCCTGTCCGAATACCATACATAATTCAGATATTACCTTTGCAACCTTTGTAAATTTGAAAAGTGCTGATATTTATAAGAAATGGTTGTTTAATTTAGATTTAGAGCATTCCACAGACGTAAGACTCTATTATCGTATTTGCTTAGCACATTTTGAAAGCGTTTGTATTACAAAGTATTGCAGATTAAAAGTCGGTTCAATACCCACCTTAAATTTAGATAAAAAGGACAACTTATTTGAAATTGATCGAAATGCTTTGGAGAATGCAGGGGATAATTTAAGtaggaaaaaaactaaaaagcccCTAATACAAAATACTTTAATTTCAAGATGGATATGTCAACATCCCCAATGCCTCAAATTAAGAAATACTATACAAAAATGGCCTAACAGAGGTATATTCAAAACTATATGGCTACAAACATTAGCAAAATATCCCGAAACAAAGTATAATAATATTAAGGACAAGGTTGCAGAACTAGATGTATGTGATgaccatttttatatattatataaaacgaATAAAACATATATGGAATCATTCTTTCAAGACGAGCAAGGTGATTTAAGAGAACTATTTGATGCACTCAAAACCAATTATGAGGATTTATCTGAACATCGAAAATTCATATCAAGACGCTGCGTCGTGCCACAGTGTTGCACAGATCAATTGCTCGACCAATATAAGAATATAACACTTTACAGTTTCTCTACACTTCCAGACGTAGCAGCAAAATGGTGTTACAATTGCAATATCGATATTGATgtcttgcaacaaaaaaatatcatttacaaaattTGTAAACTTCACTTTGAACCCTATTGTTTCACCAAGAGATACATGTACCATTGGTCGGTGCCCACTTTAAACTTACCTGATTCAAAGCCGGAAATTATACTAGAAAATGATCCCGACAATAAGTTTGCCTACTCGGGTCAGTGTTGCATTAAATCGTGTCCGAATGCAAAGGGCTTGGATCGAacaacaaaatctaaattttttagaTTCCCTGATAATCCAGAAATGCTCGTGAAATGGCTTCGAATTTTAAATTGTGGAAATGTCGATTTAAATCAGATACGAATATGCGGTTTACATTTTCACCATTCCCACACTTATAAAGGCACCGCATTACGACAAAGGGCTTTTCCCACCGAACGATTGGAAAGAGTTATAAATGAAGATTATGTCGACGAAGAGCCAATTGTCGAAATGGACGAAAATATAGAGGTGAAACAAGAGCTAGATAATTCCGAAGACTGGTTTGAAGATCAAGAAGATCAAGAGCTTTACAATGAAAGTCGTCGTCCATTGAAAAGAAAATCGCCGGAATATTTTCAGGAAAATCAACAACCAAAATACGAAAAGGTTCGAATAGAAACAAAGGAATTTGAATCGATTGAAATCAAAGAAGAATTACTGGATATACAAACACAAGATAACGACTTAACAACAGAACTTATTCAATACTCTCATAATAATAAAGCACAAACTTTGAAACAACATACACCAGAATCAACATACAAAGCCGGACCAATTGAATGTTCATCATACAATTTGGAAATTACAAAACCTCAGTCATTTACTAAAAATaccttacaaaattttaataattccttaAATACCTTAACAACAAAcataaaagaagaaataatagaaatattCGAACAACAAGTAACAGATAATTTTGAATATAACATTAAGTCCATAGCTCTTCGAAGGCCGGAACAACAAATGAAGGAACCCAACAACTCTGAAGAATCGTCTAATTTAATTATTACAGATGTTAAATCTCAAATCTTACTGTGTTGTGTACAAAAATGTTTGAATTCCTCGGACGACGCACAGATGTTTACCGATTTTCCTAATGATtcggaaattttcataaaatggtgtttcaatttaaaaatcGATCCTCGTAATTATCAAGACAAACAATATGCTGTGTGTCAGCAACATTTTGAATCGTTCTGCTTTACGGAAAATTCCTCCTTGCATACTTGGTCCGTACCGACGTTGCATTTAAATTTACCTGAAAATTCATTTATACATTTGAATGATCCACCCCAGCACCTGAAACCAGCCACCGAACAATGCATAGTATATGGTTGTATAAATCCCATTCAGCCTCTCTATAAATTTCCTATAAATCAAGATGTGTCCCATAAATGGTTCACAAACTTAAAACTAGACTATACGGATTTTCGAGCTCAAAACTATAGAATTTGCAAGAGACATTTTAGCACACAATGTCTTGATAATTCCAATAAACTTAAAATACAAGCAATACCCACTTTATATCTGGGCCATACAGACAAAGTAATCTATTTGAATCCTTGGGAAGAATATCACCATCAATATAGTTTACCGCAACCAGTGCAGCCAGACAATAGTCGGGGTAGTAGTAGGCAGGGCTTTCTAGTACGACCATTAATATCACCTCACGATTTAGAAGATCATGATAGTAGTTATTTTGAAGATTTCGAAGAACATTATGGTCAGGacgaataa
Protein Sequence
MPSAWKLGRVVPIAKNKNHNVICCISLSQRDQLFTHLAEHGHQPRFDCCICRLCFQTSLELHEHYMTNEEFCGKFYDKQAFKTQPITSSPYLGKPESSHFEIANTFSLKDIPPSKCRDLEPLYVKQQQKSKTSQDSRAPSSPPPYHNIPDFPFEPQVEVKTEIKVEPDFYPPMDQSDYSSYDNDYGATDYPQSSNENLAFLQDYQDNASNSTNSSFSLNTNDPIQDENAICCVPKCGVRKMSSPSLQFYTFPRDEKYLSQWLHNLKMSYDPNCNYGIYRICSLHFPKRCIARYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPAGEMAKCSMPGCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKENRFFCSKHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNIYQEEKKCFLPYCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPEIYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPSPQPSTSSASAKAQKFAEMFKQEAGGSHIYDEVFMNSMIQKFSSSSNSNASSAQLDLGDVCLVPSCKRTRHSSDITLHTVPKRAEQLKKWCHNLKMDLEKMHKSVRICSAHFEKYCIGGCMRPFAVPTIELGHDDANIYRNPDVIKKLNIRETCCIGSCKRNRDRDHANLHRFPTHPELLQKWCENLQKPIPDGTKLFNDAVCEIHFEDRCLRNKRLEKWAIPTLNLGWDEAPHILPSEEEITEYWTKPFAPNNGDEQGECCVSTCMRNPQIDDVKLYRPPEDPEQMVKWAHNLQVDVVELSNLKICSLHFEQHCIGKRLLNWAMPTLNLASKVEHLFENPPPVPNFYKKKEKPERIHSTADVIKWSPRCSLPHCRKARTLDRVHLFRFPYNNRQTLHKWCHNLQLPLVGSSHRRICSSHFESSVLTKRCPMTLAVPTLDLNAPPGYKIYPNPSRLKQLKVGPQRQCVIESCHKTKLDGVTLFRFPNNRGILQKWRHNIKNWPKGKLSNQVRICSEHFEAHSVGEKKLSPGAIPTIRLGHEDKDLYPNETRSFFDLAKCAITNCDSRKEMEDVRLFRFPRDDDDLLKKWCHNLKMEPNDCVGIRICSKHFEMECLGPRLLYKWAIPTLKLGHKEDDAVDIIPNPPPEQRTGEFLYKCCIPSCGKTRKYDEAQMNSFPKHLKLFKKWKHNLKLDFLNFKEREKYKICNDHFEPVCVGKTRLNFGALPTLRLGHGDVEDLYKINPERIRPNLFVKQKDIERIERKMILRKDNQEYEEAIAEPEDDVCDPLGLEVTDLKCCVVDCKAPKSIMREPYDLPESKEIRNLWQKLCHEENVDLPTEAKICGLHFQQLFKQLKPQMLNLLEKNPELKSDFTKLQYMYQKSNISLVICSYQCRVSSCATNLLNSDIRLFFFPYGKNLISKWSYNTGIVPDEHRRYMNKVCALHFEPYCISENQRLRNWAIPTLNLPPLKEEEEIYQNPDLTKIDKRMLGPQILKCAVKNCLTDKMTEDESLKLFNFPSEEALLKKWCDNLKMSHELTPLMKICSLHFEKFCFGSCRIRSWAIPTLNLGHNDRPQHLNKTTIRQEVYEAPDITTKAQLKQVKIKKSLDTTKCFISSCRKSRLQHGVRFYNLPTKIKIKRKWLHNLQVTKLRSAQKLHNIKVCNLHFQKKCFEGKHLKSWAVPTMHLAHSEHIIDNPRRLQGLPIIRCALSHCKNKLSIKEIRCFVFPKSPEFLEKWSKNLKLDLEKCKGRICIDHFEKAVIGAKKLKNGAVPTLNLGHDEQIYDNLELIQKLKLKNIQKEQKLQMCDKNKDKEQELQDNTAVADYEGEEDEEYEDEYEMEDENEDENEEEYEYEYGDENEEEEDDDDDKVSISTTLSHWSSVINKELRVTLTPMTPDDLYDLCSRSSYEREFGAMTPSGRRSVTPSTSIKSETADQKPCFRDISSDNLNQKPDNYFKEPHTECLEQKPDKLFREPRSQTPEQSRLREPRAQTPDQNLFRVPRCQTPDQKLQILKETLDTERFEINLKQEEGILSAELNEELDNNSSTALRTDKTLNSVAPICCLKYCGKEKTPEQHLTTYGFPKDAQLLQKWCENLGLQPEECIGRVCIDHFELRVIGTRRLRLGAVPTLNLGPNRSAKHSNTEEPQQKKSLAKDSSESGNVQDTDLKLLPPPPYATPKPAKHSVFRLCCLKHCRRKKVLTNIKMDEQKMEQVKTCNKLFKFPKDLNILKKWCKNLRLPEKSCLRYDLEICEKHFDAQVIQGEKLHPKAVPTLQLSYANREPVYTNNPKDFLTASWNTKNEDPNNDRNKSKFKEISTNIPQNWSNKSKSSLQKIKMEEKCFLKHCGKSRDSDELFLIPFPQHGMSLQRRWCKNLKLTSKLSQHKDLKICSDHFEPYVFNKRRLLKTGAVPTLQLGHSDEICRNFRRLRLKKVASSKKEQCCIATCQESNLKLYAFPKSSELRKIWCNNLQIEVRKALNNHYKVCGKHFSLESFIVGTDNLKLNAVPILNLGLQSENHIQVKNKSNDNDTLKCLVENCQKTPSVDRVKLYGFPTRKDILKKWLFNLNISLESLNENSLICNRHFHKFCFRNGTLHEKAIPTQFLDVSPKGWFYQNNKEFFEDPKPKCLLQCLDLGQYLYKFPKQKDELEKWIFNMKIKIEESELQKLRLCALHFEESCKIPQKDMLLAGSLPTLNLGHDYSQGIYQNNFVKCCLETCCLEGFKFHKLPEDLMLQSFWFQELEMETSYNNSLYICSVHYVSLYERVLEKFSAFLKESKEYVKLTLIYQELKVLPELKSFKCHIPNCPSGFKLIWKLFKFPKDESLFNKWLHNTGLEFEYSNRHNYRICAQHFEERCLSEIKLHRWSLPTLKLPFNNSLYVNPPEALPSNHENLKHCCVSNCVTEKEPFFKFPKQHIEVKKWIHNLKLGSQQCTLNLRVCYRHFESYCFIKEDNRIKQLKSWSVPTLRLNRRTDLYLNPPDKMAFSVCCLPSCRQILNKSNHLYLFTFPKSNTLKHKWFHNLNLNPQDYKEKMRLCARHFEIDCFDRSSKLLRKHSVPTLGLSRPQTELFKNPVRRPHLKCSVKQCKGPWPELINFPKDKSLLRKWCHNLQIDIKLELLRNWKICGKHFEMKSRNKNGLIRDLAVPTLKLGHKNKKIFKNPISQSAQHTKQCNSSLQIVETVVKEKHVKATIATEVRKKAVKAKTQHLKSLRIKNQQRILKLIAVKEAKERKRRMSTSKIRDFKMQQKQTTSKATQEIRVIEKKLEHSDVFSSSKLVEEEEQEGNNFTKIKAPECKIDHLDKTKALGKTLTVATEEEEKNFTNIKASVENQGYNKISKVDHLEKQQVLGKTVTVETKENCFEMETFPAGSILTENENILNEETYLQDYMNYIENEESTEDILLESYKTFQKLTTHGTIETHNQTNSIENNKGTYMDKHQEECIQTSQQLLQSSTEILKQEVREDCAPSKENQEESIQSMNETSQQLLLTPTEITKQESRQEYEAEWAQKDNKEESSKYLQETDKQLLQTPTEIPKQELRQDCEAEWAQKDNQEESMNSLQETDQQLLQTPTAIEESQTLKVKQESPEDDEEYHHMKQDYFETETYFLQDVFQERYANENTLTKNEELLEEDPKRNKRIPIYCYFKHCPNHLTCKEDVQLFKFPTNQNLRQVWIENFPLSLDLLPKTKKAKNKSKRVWLNRICAEHFEKQCFSETRLLYGSIPTLNLGVESKIKRNCEESLKCFAKNRCKAPHCQRSQQYDDIFTIKFPQGELKTKWCFNLNLKEEDILPTDWICNKHFEKRVLLGKTPRSHAVPTLCLGDAVKSSDLYQSPEYITVTEYAKQCYEKCCVKSCPNTIHNSDITFATFVNLKSADIYKKWLFNLDLEHSTDVRLYYRICLAHFESVCITKYCRLKVGSIPTLNLDKKDNLFEIDRNALENAGDNLSRKKTKKPLIQNTLISRWICQHPQCLKLRNTIQKWPNRGIFKTIWLQTLAKYPETKYNNIKDKVAELDVCDDHFYILYKTNKTYMESFFQDEQGDLRELFDALKTNYEDLSEHRKFISRRCVVPQCCTDQLLDQYKNITLYSFSTLPDVAAKWCYNCNIDIDVLQQKNIIYKICKLHFEPYCFTKRYMYHWSVPTLNLPDSKPEIILENDPDNKFAYSGQCCIKSCPNAKGLDRTTKSKFFRFPDNPEMLVKWLRILNCGNVDLNQIRICGLHFHHSHTYKGTALRQRAFPTERLERVINEDYVDEEPIVEMDENIEVKQELDNSEDWFEDQEDQELYNESRRPLKRKSPEYFQENQQPKYEKVRIETKEFESIEIKEELLDIQTQDNDLTTELIQYSHNNKAQTLKQHTPESTYKAGPIECSSYNLEITKPQSFTKNTLQNFNNSLNTLTTNIKEEIIEIFEQQVTDNFEYNIKSIALRRPEQQMKEPNNSEESSNLIITDVKSQILLCCVQKCLNSSDDAQMFTDFPNDSEIFIKWCFNLKIDPRNYQDKQYAVCQQHFESFCFTENSSLHTWSVPTLHLNLPENSFIHLNDPPQHLKPATEQCIVYGCINPIQPLYKFPINQDVSHKWFTNLKLDYTDFRAQNYRICKRHFSTQCLDNSNKLKIQAIPTLYLGHTDKVIYLNPWEEYHHQYSLPQPVQPDNSRGSSRQGFLVRPLISPHDLEDHDSSYFEDFEEHYGQDE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-