Basic Information

Gene Symbol
-
Assembly
GCA_002093755.1
Location
NW:152092-165506[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 4.3 7.8e+03 -2.6 1.4 49 62 298 314 279 327 0.52
2 29 7.9e-14 1.4e-10 41.5 2.7 1 61 521 578 521 596 0.80
3 29 1.2e-13 2.1e-10 40.9 4.7 1 87 619 687 619 687 0.81
4 29 4e-13 7.2e-10 39.2 0.1 1 87 708 779 708 779 0.82
5 29 6e-16 1.1e-12 48.2 5.2 1 86 876 945 876 946 0.81
6 29 2.2e-15 3.9e-12 46.5 3.6 1 86 970 1041 970 1042 0.82
7 29 3.5e-11 6.2e-08 33.0 1.4 1 87 1076 1143 1076 1143 0.77
8 29 1.3e-10 2.3e-07 31.2 1.4 1 86 1185 1253 1185 1254 0.75
9 29 5e-17 8.9e-14 51.7 0.4 1 86 1281 1350 1281 1351 0.82
10 29 1.3e-12 2.3e-09 37.6 1.4 1 85 1372 1440 1372 1442 0.79
11 29 2e-14 3.5e-11 43.4 0.4 1 86 1469 1540 1469 1541 0.85
12 29 2.8e-10 5.1e-07 30.0 4.2 1 85 1618 1685 1618 1687 0.77
13 29 1.3e-07 0.00023 21.6 0.1 1 86 1710 1775 1710 1776 0.72
14 29 2.5e-13 4.5e-10 39.8 2.2 1 87 1906 1975 1906 1975 0.79
15 29 2.3e-07 0.00042 20.7 0.3 1 87 2073 2143 2073 2143 0.70
16 29 0.023 41 4.7 0.7 5 61 2161 2209 2157 2228 0.68
17 29 1.1e-12 2e-09 37.8 0.0 1 87 2235 2305 2235 2305 0.82
18 29 2.9e-13 5.1e-10 39.7 0.6 1 87 2357 2426 2357 2426 0.82
19 29 2.8e-10 4.9e-07 30.1 0.0 1 86 2461 2533 2461 2534 0.76
20 29 1.5e-12 2.6e-09 37.4 0.0 1 86 2544 2616 2544 2617 0.79
21 29 6.4e-12 1.2e-08 35.3 0.0 1 86 2639 2711 2639 2712 0.79
22 29 0.088 1.6e+02 2.8 2.1 1 57 2741 2789 2741 2825 0.74
23 29 9.8e-12 1.8e-08 34.7 0.6 1 86 2828 2898 2828 2899 0.79
24 29 5.4e-16 9.7e-13 48.4 0.3 1 86 3009 3080 3009 3081 0.80
25 29 2.8e-12 5e-09 36.5 4.6 1 86 3142 3211 3142 3212 0.80
26 29 1.6e-13 2.8e-10 40.5 4.6 1 86 3303 3372 3303 3373 0.83
27 29 1.5e-11 2.6e-08 34.2 0.3 1 86 3452 3520 3452 3521 0.85
28 29 3e-10 5.4e-07 30.0 0.5 1 58 3546 3594 3546 3602 0.83
29 29 9.7e-11 1.7e-07 31.6 1.6 17 87 3611 3669 3601 3669 0.77

Sequence Information

Coding Sequence
ATGACAcagcagcagcagcatcaacagcagcagcagcacggcAGTAGTTGGTACTCACATGTTGCTTCCTACCCAGCAGCCTCGGCCTCCCACTCCCACCCCCTCTCTCACTCGGCCTTTGGCCCTGCGCCCCCTTGCAAGGccaacatcagcaacaacaacgcaAACAACATTATGGGCGCCTATGGAGGAGGAGTGGGAGGGAGTGGAGGCTCGCATGGATATTTCGGCGCCGCTGGCGGTGGCCTCAATGTCAGCGGGCGGGGTGGTGCTGGGTCGTCCTACGGCCTTGGGGCCAACACGGTGGCATATGCTCACAACCAACTGCTACAGTaccagcatcatcatcagcaaaaccagcagcagcagcagcacataGGCCTGGGCCAGCGATCCTATATGGGCCACGATGTCATGGCCGGGAGCTATCCCTATATCAAGAGCGAACCCTTAGAGGTTCAGCAGCCGCCGAATCCAATGGCCCCACCTCCGGCACCAGAAATGATGAAAAATGTGAGTATTAGTACGGAGCCCATTGACGACCTGGCCTACAAGTCAAACTACATAGACGACAATACGCCATTTGCGGACTTCAGCATTAGCGAATTCAGCGAGGACATGCGTCCCAAAGTCGAGCTGACTGTCAAGGATGAGTCCTTCGTCAGGAACCCCAATAGCTTTTTACGCCGTAAGCAACAATCGATCACGACAGCAGAGAGCCTGCCCGTCTGCCAGCGATGCAAGGAGGTGTTCTTCAAAAAGCAGACTTACCTGCGGCACGTCGCGGAGAGCAACTGCGGCATCCAGGAGTATGTTAAATGCACCATATGCCCTATGTCCTTCATGACCGCCGAGGAGTTACAACAGCACAAACAACAGCATCGAGCGGACAGGTTCTTCTGTCACAAGTGCGGAAAGCATTTTGCCACGATCGCTGAGTGCGAGTCGCACGAGTACATGCAGCGATACGAAAACATTGTGTGCAACATGTGCTCGGGATCTTTCGCCACGCGGGAACAACTGTATGCCCACTTGCCACAGCACAAGTTCCAGCAGCGCTTTGACTGCCCCGTATGTCGGCTGTGGTACCAAACGGCTATGGAACTGCACGAGCATCGCCTAGCTGCACCTTACTTCTGCGGTAAATACTACACAAGTGGACAATCCCGGTCCGTCCTCCAACAGCAACCGCACGAGAGCCAGACGAACTACAAGCTGCAGGACTGTCATATGGCACCATGGAAAGTAATGCCAAGCGCACCGCTTCTTAGACGAACTCATCCAACTCGCCTGCCTTGCCTGCGACAGCAGCGCTTATTCACTGCTGCAGCAGCGCAGGCCAATGCCGATGGAGCGGCTATTTTTGCGGCATCTTCGCTGAAGAACGAGGTTGCTGTAAAACTGGAGCGTAGCTACAGTAACTCGACCAACGAATCTTCTTATAGCGTCCAGGAGAGTGGCTACAATAATGTGTATGGCAGCAGCGACAGCTCAGTGCACGGTGCGATCGCCGGACCACAGGCACACTCTTCAACGCTGGACGACTCCGAGGATGCGCTGTGCTGTGTGCCGCTGTGCGGTGTGCGGAAGAGCACGAGTCCCACCTTGCAGTTTTTCACTTTCCCGAAGGACGAAAAGTATCTCAACCAGTGGTTGCATAACCTCAAGATGTTCCACATACCCGCTTCCAGCTACGTTAGCTTCCGGATCTGCAGCATGCACTTCCCCAAGCGATGCATCAACCGCTACTCGCTGTGCTACTGGGCGGTCACGTTTAACCTCGGCCACGATGACGTAGCCAATCTCTACCAGAACCGGGAGCTGACCAACATTACCACTGGCGAAGTGGCGCGCTGCAGCATGCCACACTGTACCAGCCAGCGGGGTGAGAGCAACCTCAAGTTTTACAACTTCCCAAAGGACATCAGCCTGATTAAGTGGTGCCAAAACGCTCGACTCCCGGTGCAGGCAAAGGAACCGCGACATTTCTGCAGCCGTCACTTCGAGGAGCGGTGCATTGGCAAGTTCCGTCTAAAGCCTTGGGCGGTGCCCACCTTGCACTTGGGCGCCCAGTACGGCAAGATCCACGACAATCCGAAGAATCTATACGTGGGAAAACGCTGTTGCCTCAACTTTTGCCGTCGGAGCCGTTCTTCTGATTTCAATATGTCGCTATATCGATTTCCTAGAGACGAAGTCCTGCTACGTCGCTGGTGCTACAATCTTCGCCTCGATCCAGGAGTGTATCGTGGAAGATCACAAATATGCAGCGCCCACTTTATAAAGGAGGCGTTGGGTCTCGTAACTATCTCtcAAGGTGCCGTGCCCACACTTCATCTGGGCCACAATGATACCTTCAATATCTACGAGAACGAACTGTGGCCCGCCAACACCGACACCTCCTCTTGTCATctccaacagcaacagcagtcaTCTATGCATTCGCTACAACAGCAGATGCATAGCAAGTCCTATCAGCGCCGTTCGGTGGCATCCACTTCGTCATCGGCTAGCTCCGTAGCTTCGCATTATGTGGACCCGGAGATGAGCGCCTCTTAccatctagccatgtccgccTCCGCCGGAGGCTCTTCGGCCATAAACGCTAGTGACAGCATGGACGTCTGTTGTGTGCCCAGTTGCGAGAGCAAGCGACACAATAGCGAGAACATTACATTCCACACGATTCCACGACGGCCCGAGCAGATGCGTAAATGGTGTCACAATCTTAAAATTCCCGAGGAGAAGATGCATAAGGGCATGCGAATCTGTAGCCTTCACTTCGAGCCCTACTGCATCGGTGGCTGTATGCGACCGTTTGCTGTGCCAACCCTTCAGTTGGGTCACGACGACGAGGACATTCACCGCAATCCGGACGTGATCAAGAAGCTGAACATCCGGGAGACATGCTGTGTGGCTGTATGCAAACGGAATAGGGACAGGGACCATGCCAACCTGCATCGTTTCCCTAGCAACGTGGCGTTACTGAAGAAGTGGTGCGCCAATTTGCAACGCAGCGTGCCCGATGGCAGTAAGCTTTTCAATGATGCCATCTGTGAGGTGCACTTTGAGGATCGCTGCCTGCGCAACAAGAGGCTTGAGAAGTGGGCAGTACCTACTCTGATCCTGGGGCATGAGGACATCGCCTATCCGCTGCCCACCCCAGAGCAGTCACTGAGTTCTATGCCCGGCCACGGCTCCTAACAATGGCGAGGAACAGGCGAGTGCTGTGTGGAGACCTGCAAGAGAAATCCTAGCGTGGACGATATCAAACTATACCGCCCGCCGGAGGAGGCCACCGTGCTAGCCAAGTGGGCGCACAACCCAACGGACGCCAACCAACTGACAAGCATGAGGATCTGCAACCTTCACTTCGAGGCGCATTGCATCGGCAAGAGGATGCGGCAGTGGGCCATACCGACTTTGAACCTAGCCGGCAACATTGAGAATCTTTACGAGAACCCAGAGCAATCGTTGCTGTACAGGCGACGCACGACTCACTTGAAGACGAAGCTGCCGCCAGCCTCCGTGAAGCCCACTTGGGTGCCCAGATGCTGTCTTCCACACTGTCGCAAAGTCAGAGCCCTGCACAATGTCCAGCTCTATCGCTTCCCCAAGCTCAATCGCTCCACATTGGCCAAGTGGGCGCATAATCTCCAGGTCCCAATGGTGGGCAGTGCCCAGCGCAGGCTATGTTCGGCCCATTTCGAGCCGCATGTGCTTAAAAAGTGTCCGGTGCCGCTGGCGGTGCCTACGCTCGACCTAAATTCACCACCCGGCTTAAAAATCTATCAGAATCCGGCCAAGCTAAAGGCCAGCAAACTGTGCCTGCAACGAGTTTGCATCGTCGAGAGCTGCCGCAAGACGCGGGCGCAGGGCGTGCAGCTTTTCCGGCTGCCGCATAGTCCTACACAGCTGCGAAAATGGATGCACAACATAAGGACGCGGCCACGAGCAGCTATGCGGGCTCAGTACCGAGTTTGTTCTCGCCACTTTGAGACGCACTCATTCAATGGCCGAAGATTAAGTGCAGGTGCCATACCGACTCTGGAACTGGGCCACGATGGCGACGATATCTATCCCAATGAAGCGCAGGCATTTGTGGACGAGCATTGTGCTGTGGAGGGCTGCGAGGCATCTAAGGAGCAACCGGAGGTGCGTCTTTTCCGCTTCCCCACCGACGACGACGATATGTTGTGGAAGTGGTGCAACAACCTCAAAATGAATCCGGTGGATTGCATTGGGGTACGCATCTGCAACAAGCACTTCGAGGCCGATTGCATCGGTCCTAAGCACCTGTACAAGTGGGCCATTCCTACGCAGGAGTTGGGCCACGATGACGCGCAGATCGAGCTGATCCCGAATCCCAAACCAGAGGAGAGGTATGTGGATCCCGTCTTCAAGTGCATCGTACCCACCTGCGGCAAGACTCGACGGTTTGACGAGGTGCAAATGAACAGCTTTCCCAAGGACCCGGATCTATTCGAGCGATGGCGGCACAACCTGCGCATGGATCATCTCAGTTTCCAGGAGCGTGACCGCTACAAGATCTGCAACGCTCACTTTGAGGAGATCTGTATTGGAAAAACACGGCTCAACATCGGTTCCGTTCCAACCTTGGAACTTGGTCatgacgatgaggatgatATTTTCCAAGTGAACCCAGCGGAGCTTCAAAGCAATTTATTCGGGCGACAGCGTCGACTGCAGCTCGAGGGATCCGGCGAACTGAGGGTCAAGCAAGAGCTATCCGAGACAGAGGACAACAACAAAGCGGATGTGATGGTCACAGGCTCCAATTCCAAGCAGGTGAGACTTTTTGGGCTTAAAAAGATCTACTTCGGATCTAAATGTTGTGTGCACAGTTGTGGAAGAAGTCGCTTGGAGCACGGTGCACGTCTGTTTCCTTTTCACGGCAAGCAGCAGCACCTAAAGTGGCGTCATAATCTGCACTTGGAACCAGAGGAGGTGGACCGATCGACGCGCGTTTGCAGCGCTCACTTTAATCGACGTTGCATCGATGGCAAACAACTGAGGAGCTGGGCGATGCCCACCCAACAACTGGGACACAACGACCAACCGATCTACGAGAACCCGAAGAACATACCCGGTTTTTTCACACCTACCTGTGCCCTGGGCCACTGTCGCAAGCGAAGAAGTATTGACAACGATCTGCGTACCTATCGTATCCTGAGCGAAGATCTTCTGGAAAAATGGCGGCTAACTACGTTGGCTCCGGATCAGTGTCGTGGTCGGATCTGTGCGAATCACTTCGAACCGCAGGTGCGGGAACTTAAGTTGAAGACGGGAGCGGTTCCCACACTACAACTGGGACACGACGAGGGATTGATCTACGACAATGAAGCTATAAGCGGCATGGCCGAAGAAGAGGAGAGCATAACCACAGAGTTCCCGAGACTGAAACCAAAAAAAGAGGTGCTCGgagaggaggaggagggtgaAGGAATTGATGGCGAGCAGGAGCACCCAGATGACCTGGACGACAATGCCGATGAAGAAGACAGAGATGAACATTATTTTGATCCTCTTGAACTAGTAGAGACTTTTGCTGAACATCGTAGCGATGACGAAGCCCAGGACTATGAGGATGAAGAAGACGAGGATAGAACAGCTTCCAAGCTCCCACCTTCCCCTCTTCCCCGGCGTTACCAAGCTCCGCGTCGAGACAAGCCGGCTAACAACGTAACGCCCATTTGCTGCCTGAAGCACTGCAGGAAGGAACGTACTGCCTTCCACCTCCTGAGCACTTTCGGATTCCCAAAGGATCGTCAGTTGCTGCTAAAGTGGTGTGCCAATTTGCATCTAAACCCGGACGACTGTATCGGTAGGGTTTGCATTGAGCACTTCCAGTCGGAGGTACTCGGCACCCGTAAGCTTAAGCAGAATGCGGTGCCCACCCTTAATGTGGGACATGAGGAACCGCTTAGATACTCGTGTCATGGCGTGGATCAGAATCTCGAGGAGCGGGACCCCCAGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGAAAAGGAAGCTAACGGAGCCGCCGGATATTCCCTAGCCAAGAGGAGAGCGCGGAGATGCCTATGTTGAAACGAGAGTGGGAGATGGAGATGCCGATGCCAATGCCGATTCAGATGCAGATGGAGCGGAAGAATGAGGCAAAGAATATAACTCAAACTAAAAATGAATCATATATATGCTGTATCAGCAGTTGCGCAAACCAGGAAATTAGCCAACTGCTGGCTTTTCCCGAAGAGATCCTGCTAAGAAAGTGGAGACATAATTAAGGCTGTCTGAGACTGACTACTTCTCTAAGCCTGAAAAAAGTTTGTGTGGCGCACTTCGATTCGCAGCTGTTGGAGAATGGAAAACTCAGAAGGGCAGAGGCAAAAGCTGTGCCCACTGTGAACGTGGGCCACAGCAGCTGGAACCTTTACAGAAGCAATGGATCTGTCTGTGCCAAATGCAGCCACACAACCTTCGGACGCGTAAGCTTCATCGACCTGCCGGATAACAGCATAATTAGGGACGCTTGCTTCTCCTGCCTCAATCTACCTGAATCCTGCGAGGAGCAGGCGAGACTATGTAGTATCCACTTTATGGAGGCATACAAGAAGTTAATCTTCCTAATGTTCTGCACCCCCAAGTATAGAGGCTCTACAAAGTGCTTGGACGAAATGCAATGTGCGGTACCCGGCTGTTGTTATAAGGATGCTGATCCGAATATTTACCTAATCCAGCTTCCCGGTAACAAGGAGATGCTGTCGAAGTGGCTGCACAACACCAAGGTCCCCTACGATCCTTCTAGGCGCCAAAGCTATCGTATCTGCACGAACCACTTTGAATTAGATTACTTGGATACGAGGCCCGCTAAAGGGGCTATACCGACGCTCCACCTAAACCATGAAGATGAGATTCACTTAAATACTAGGCCCTTGCCAGAGGAGCAGGGCTCTATATTAACTCCACTGCGGATAAAGACGGATGTGGCCTTCCTGGGCAGTCCCTGTGCAAGTGCAAGCCCCAGTCCACGGGGCAGAATCCGGATATGCTGCATTCCCACATGTGGACAGATTGGAAGCAGTCAAGTGAGGCTGTTTCGTTTTCCCACCGAGGAGCAGGCGTTGCTCCGGTGGCTGGTGAACACACAACAGCAGCCAAGACTGGTAGATCAGGACTTGTACGTGTGCCAGTCGCATTTTGAGCCGGAGGCCATTTGCAAGAAGCAACTTCGAAGCTGGGCTGAGCCCACACTAAACCTGGGACACGACGGTTATGTAATTCCAAATGCCAAACATAATGGAAATATTTCGGACAGCCAAGATACTGAGGAAGCAATGAGATTTATTCGGGAACGCTTCTGCTCAGTCACTTGCTTTCAGGCAGGAGGACAGGATGAGGAGGGAGTGAGGCTATTTGACTATCCCGAGGATATGGCGGCTACTCGAAAATGGGCAGCCGCATGCAGACATCGCTCTATGCAGGCCAGGAGCCATGGCTTCAAGGTGTGCCAGTTGCACTTCGCTAAGGTCTTTGACCACAATACTGGAGACTTGATTGAGGGTGCTGTGCCCACTTTGGAGTTGAGCAGAGATGAAATGGAGAGGCAATGTCTTGTGGCTGGATGTGTAAAAAATGATGCCAATGGAACTCGCCTCCGCTACTTTAAGATACCAAAAGTTGCTGCCCAATTGGAAGCGTGGAGCAATAACCTTAAAATCCCTCCATCGGATCTCATGCAAGGGCAGCAGTACATCTGCGAGAAACACTTTGAGGCGTTCTGCTTCGGGGCCAATAAAGGACTGCGTTCTGGTGCTCTTCCAACCCTCCTGCTAGGCCATGATGAGGAGGTGGATATGCTTCCAAATCCGGAAAGCCTTTTATGCCAGGATACGTGCTGCGTACCGGGATGCGGGCGTATCTGGCAGACTGGGGATCGTAAGTTTCGCGGATTTCCCAAATTATTGGCCATGGCCAAAAAATGGAGGCATAACCTTCGTCTGGTAGAGCCCGTGGAGCAACTAGCCAAGCTGAAGGTCTGCAGTACTCACTTTGAAGCAACCTCACCCAACCTGGGCTCAAATGGATTAGTTAGTACCTCGATACCAACCCTGGAATTGGGTCACTCTTCTCCGGATATTTACCCAGCGGATATCAGCTTTAAGTTCCAAAAGCGGTCCGGAATGCCGgcgaaaatttattgttgttatcCCAAGTGCGAGGAAACCTGTTTGCCCAAGAATCTCTCTTACGGGCTTCCCCAGGAGAGCTCTAAGGAAAGCTGGCTAAACCATATGGACATAGAAGATCCGAAAGACGAAGGAGTCGCTCTGCTTTGCCCGCTGCACTGTATTCTATATCAGCACAGTGCCACACACCATCCGGAGCATCATTTCAAGCAAATGTTTCTTGAAGACAACTACAATGAAGCGCGGAACAACAGGCGTTTGAAGATTGTGAGCTGTACGGTCAAGGGCTGTGACATGGTTAAGCCCCGGGATGGTGTAGCACTGCACGGGATGCCGCAAAACAAAGACATTCTGCAAATGTGGATAGACAATGGCCAGATTGAGTTCTTAGAGCAACAGCTAATGTTCAAGGTGTGTCACAATCACTTTGAGCCATGCTGCTTCTTCGACGACAGACGTTTGCACTCATGGAGTGTGCCCACTTTACGCTTACCTGGCGTAATTCATCAAAATCCTACCGCCGAGCAGTGGCAGAGCATGATTAAGAAGCAAGAAGGCAAAACATACGCAGAAGAGAGCGATGAGCCAGAGCCATTTGAGGACGTGGTTAAAACCGAGCCCATTTTGAAGATGGAGCATATCGAATCGGAATATGAAGATGAAAACCCTGAGATGCAAGCCCTAGAGGTCCTCCTAGAAGTTGGCCATGTCGAGCGAATGGAGAGTTatgaaaaaattgataaatcaCCTTTGATCTACAACGATAATGCACCTTTTCGATCTTCACCCATACGTTGCCAATACAATGCTAATCACTGTGCCGTAGAAGGATGCGAGGTGACTGTCGAGGGTGGACGGACGATTAAACTGCACAAATTCCCCGCGTCGCAGGAAGCCGCACAGAAGTGGATGCACAACACCCAAGTTGACATGGACGAAAAGTTCTGGTGGCGCTACCGCATATGCAGTTACCACTTTGATCAAGAGTGCTTTCAGAGTGCTAGAATTCGAAAAGGCGCGATGCCCACGCTTCTGCTAGGACCTCGGCGACCGGAAACGGTGTACGATAATGAATTTGCACTACCAGATGGGGAAGAACCTTTTCTAGAGCCACCGGAGACTCAGCTTGAGGATACTATGACTGTGGCGTCCAGACTTCGAAAGGAGGTCAATTTATGCCTGCCCCCACGGGCGCCGCCTCGAAAGTCAAGCATTTGCCAGATTGATTCTTGCACGAACCATCACACTGAGAACATGACTCTTCACAAGTTTCCCCATTCGGAGGACATGTGCCTCAAATGGCAGCACAACACGCAAGTCCCATTCGACCCATACTACCGTTGGCGTTACCGCATTTGCAGTGCGCATTTTCATCCGGTCTGTTTGGTCAATATGCGTCTGGTCCATGGAAGCGTTCCCACTTTAAAGCTGGGACCCAAGGCTCCTTCCGAGCTGTTTGACAACGATTTCGAAGCCATTAACCTAAGGTTGGATAAAAGGTTGACAGAGTCCAATGCTAATGTGTACATTAAGCATGAAAGGGAAGAGGATGAGGATTCTATGGTGTTCCTGGAGCCGGAACTCCAGTTACACGATGATCAAGACGATAAGATATCAAGCTGGAACAGCAAACTACAATTACAACCTGTGAAGCAAGAGAAAATTGTTTACAGCCAGATCAAGTCTGGCTATGATAAGTGTTCCCTGGCTCACTGCCAGCGCCAGTCCCAGCATGGCGTCCACATTTATAAGTTTCCCAGATCGAGGCGGCAACAGGAACGTTGGATGCACAATCTACATATTCGCTATGATGAGCGGACACCGTGGAAATTCATGATCTGCAGCGTTCATTTTGAGCCGCACTGTGTCAGCCTCAGGAAGCTGCGACCTTGGGCTGTGCCCACACTGGAACTTGGTGACAATGCACCAGAAACAATCTTTTCGAACGAACAGTGCGAGAAGGAGCTGCTGACCGATCGCAGTGATCTGGATAGCGATGCGGAGGACGAAGACGGCTTACAGgaggacgacgatgatgaAGACGAAGACGATGTGAAGCCCGATGTAATTGGCATAAAAAGGAGGAGACGTTTTAAGATAGATTCCCTGCCCTTAAGCCAGATTGCACCCTGGAAAGTTAAACAATGCTGCCTCCCCTATTGTCGTGCCTTTCGAGGGGATGGCAAACTGTTTCGGCTTCCGAACAACCGAAACTCCATTAGCAACTGGGAACGGGCCACCGGCATGGTATTCAAAGAGTCGCAAAGGAACACTCGACTAATCTGCAGCCGTCACTTCGAGCCAGAGTTGATTGGAGTCAGGCGTCTAATGCGTAACGCCATTCCCACGAAGCACTTGAGCCCTCAAGCAGTTAACCAGATCCGTACTAAAAAGGAGAAGAATCCTCCTGCCACTATTATACCCATCTGCTGCATGGCGGACTGTCACTACAATGGAAATGTGAAGCTGCACAAGTTTCCAAGTGATCCTACTCTTCTCAGACAGTGGTGCCAGGCTCTCCGGCTCACTGATACGCAACGGTATTTGGGCAAGCACATTTGCTCCATGCACCTGCCAATGAACAAGACGTTGAGTTGTGTCATCTGCGGTGGGGACAATGTAGAATTGCCGATGTTAAGGTTTCCGGAAAACCGCAACCAGCGCGCCAAATGGTGTTATAATCTCAAAATTGATACAATACCAAAGTGGGACCACTCCAAGCACATTTGCTGCCGGCACTTTGAGTCCCATTGCTACAAGCCGGGTGAGCTTCGTCCAGGAGCGGCTCCCACGCTCCATCTCAATCACGATGACACAAACATATTCTTCAGCGACTATGCCACTGGTCTTCCGTCCTCGCCACTAGGCAATCGAATCAAAGACGAGCCCCTGGAATCTGAGTCCGACGAGATGCTGCTGGTGTAG
Protein Sequence
MTQQQQHQQQQQHGSSWYSHVASYPAASASHSHPLSHSAFGPAPPCKANISNNNANNIMGAYGGGVGGSGGSHGYFGAAGGGLNVSGRGGAGSSYGLGANTVAYAHNQLLQYQHHHQQNQQQQQHIGLGQRSYMGHDVMAGSYPYIKSEPLEVQQPPNPMAPPPAPEMMKNVSISTEPIDDLAYKSNYIDDNTPFADFSISEFSEDMRPKVELTVKDESFVRNPNSFLRRKQQSITTAESLPVCQRCKEVFFKKQTYLRHVAESNCGIQEYVKCTICPMSFMTAEELQQHKQQHRADRFFCHKCGKHFATIAECESHEYMQRYENIVCNMCSGSFATREQLYAHLPQHKFQQRFDCPVCRLWYQTAMELHEHRLAAPYFCGKYYTSGQSRSVLQQQPHESQTNYKLQDCHMAPWKVMPSAPLLRRTHPTRLPCLRQQRLFTAAAAQANADGAAIFAASSLKNEVAVKLERSYSNSTNESSYSVQESGYNNVYGSSDSSVHGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVTFNLGHDDVANLYQNRELTNITTGEVARCSMPHCTSQRGESNLKFYNFPKDISLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVGKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGRSQICSAHFIKEALGLVTISQGAVPTLHLGHNDTFNIYENELWPANTDTSSCHLQQQQQSSMHSLQQQMHSKSYQRRSVASTSSSASSVASHYVDPEMSASYHLAMSASAGGSSAINASDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIPEEKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLKKWCANLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYPLPTPEQSLSSMPGHGS*QWRGTGECCVETCKRNPSVDDIKLYRPPEEATVLAKWAHNPTDANQLTSMRICNLHFEAHCIGKRMRQWAIPTLNLAGNIENLYENPEQSLLYRRRTTHLKTKLPPASVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLKKCPVPLAVPTLDLNSPPGLKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDGDDIYPNEAQAFVDEHCAVEGCEASKEQPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTQELGHDDAQIELIPNPKPEERYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFERWRHNLRMDHLSFQERDRYKICNAHFEEICIGKTRLNIGSVPTLELGHDDEDDIFQVNPAELQSNLFGRQRRLQLEGSGELRVKQELSETEDNNKADVMVTGSNSKQVRLFGLKKIYFGSKCCVHSCGRSRLEHGARLFPFHGKQQHLKWRHNLHLEPEEVDRSTRVCSAHFNRRCIDGKQLRSWAMPTQQLGHNDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRILSEDLLEKWRLTTLAPDQCRGRICANHFEPQVRELKLKTGAVPTLQLGHDEGLIYDNEAISGMAEEEESITTEFPRLKPKKEVLGEEEEGEGIDGEQEHPDDLDDNADEEDRDEHYFDPLELVETFAEHRSDDEAQDYEDEEDEDRTASKLPPSPLPRRYQAPRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCANLHLNPDDCIGRVCIEHFQSEVLGTRKLKQNAVPTLNVGHEEPLRYSCHGVDQNLEERDPQPQHSVFRLWSLKHCRKGS*RSRRIFPSQEESAEMPMLKREWEMEMPMPMPIQMQMERKNEAKNITQTKNESYICCISSCANQEISQLLAFPEEILLRKWRHN*GCLRLTTSLSLKKVCVAHFDSQLLENGKLRRAEAKAVPTVNVGHSSWNLYRSNGSVCAKCSHTTFGRVSFIDLPDNSIIRDACFSCLNLPESCEEQARLCSIHFMEAYKKLIFLMFCTPKYRGSTKCLDEMQCAVPGCCYKDADPNIYLIQLPGNKEMLSKWLHNTKVPYDPSRRQSYRICTNHFELDYLDTRPAKGAIPTLHLNHEDEIHLNTRPLPEEQGSILTPLRIKTDVAFLGSPCASASPSPRGRIRICCIPTCGQIGSSQVRLFRFPTEEQALLRWLVNTQQQPRLVDQDLYVCQSHFEPEAICKKQLRSWAEPTLNLGHDGYVIPNAKHNGNISDSQDTEEAMRFIRERFCSVTCFQAGGQDEEGVRLFDYPEDMAATRKWAAACRHRSMQARSHGFKVCQLHFAKVFDHNTGDLIEGAVPTLELSRDEMERQCLVAGCVKNDANGTRLRYFKIPKVAAQLEAWSNNLKIPPSDLMQGQQYICEKHFEAFCFGANKGLRSGALPTLLLGHDEEVDMLPNPESLLCQDTCCVPGCGRIWQTGDRKFRGFPKLLAMAKKWRHNLRLVEPVEQLAKLKVCSTHFEATSPNLGSNGLVSTSIPTLELGHSSPDIYPADISFKFQKRSGMPAKIYCCYPKCEETCLPKNLSYGLPQESSKESWLNHMDIEDPKDEGVALLCPLHCILYQHSATHHPEHHFKQMFLEDNYNEARNNRRLKIVSCTVKGCDMVKPRDGVALHGMPQNKDILQMWIDNGQIEFLEQQLMFKVCHNHFEPCCFFDDRRLHSWSVPTLRLPGVIHQNPTAEQWQSMIKKQEGKTYAEESDEPEPFEDVVKTEPILKMEHIESEYEDENPEMQALEVLLEVGHVERMESYEKIDKSPLIYNDNAPFRSSPIRCQYNANHCAVEGCEVTVEGGRTIKLHKFPASQEAAQKWMHNTQVDMDEKFWWRYRICSYHFDQECFQSARIRKGAMPTLLLGPRRPETVYDNEFALPDGEEPFLEPPETQLEDTMTVASRLRKEVNLCLPPRAPPRKSSICQIDSCTNHHTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPVCLVNMRLVHGSVPTLKLGPKAPSELFDNDFEAINLRLDKRLTESNANVYIKHEREEDEDSMVFLEPELQLHDDQDDKISSWNSKLQLQPVKQEKIVYSQIKSGYDKCSLAHCQRQSQHGVHIYKFPRSRRQQERWMHNLHIRYDERTPWKFMICSVHFEPHCVSLRKLRPWAVPTLELGDNAPETIFSNEQCEKELLTDRSDLDSDAEDEDGLQEDDDDEDEDDVKPDVIGIKRRRRFKIDSLPLSQIAPWKVKQCCLPYCRAFRGDGKLFRLPNNRNSISNWERATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTKHLSPQAVNQIRTKKEKNPPATIIPICCMADCHYNGNVKLHKFPSDPTLLRQWCQALRLTDTQRYLGKHICSMHLPMNKTLSCVICGGDNVELPMLRFPENRNQRAKWCYNLKIDTIPKWDHSKHICCRHFESHCYKPGELRPGAAPTLHLNHDDTNIFFSDYATGLPSSPLGNRIKDEPLESESDEMLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00525910;
90% Identity
iTF_00594581;
80% Identity
-