Basic Information

Gene Symbol
-
Assembly
GCA_000818775.1
Location
Scaffold656:50199-75883[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 4.4e-16 7.8e-13 49.1 2.4 1 86 612 684 612 685 0.85
2 34 3.1e-15 5.6e-12 46.4 4.5 1 87 712 781 712 781 0.82
3 34 4.9e-17 8.9e-14 52.2 0.6 1 87 803 875 803 875 0.85
4 34 8.4e-15 1.5e-11 45.0 4.5 1 86 951 1019 951 1020 0.78
5 34 6.7e-16 1.2e-12 48.5 7.0 1 86 1044 1115 1044 1116 0.80
6 34 5e-12 9.1e-09 36.1 1.0 1 87 1151 1219 1151 1219 0.81
7 34 5.1e-11 9.2e-08 32.9 3.8 1 85 1259 1326 1259 1328 0.74
8 34 1.1e-15 1.9e-12 47.9 1.2 1 87 1355 1425 1354 1425 0.81
9 34 2.1e-13 3.7e-10 40.6 0.7 1 87 1448 1518 1448 1518 0.80
10 34 9.4e-13 1.7e-09 38.4 2.7 1 87 1546 1618 1546 1618 0.86
11 34 0.03 53 4.8 0.2 1 72 1683 1748 1683 1758 0.62
12 34 2.1e-11 3.9e-08 34.1 1.4 1 86 1779 1850 1773 1851 0.76
13 34 7.7e-15 1.4e-11 45.1 0.1 1 86 1881 1951 1881 1952 0.80
14 34 3.5e-15 6.2e-12 46.2 2.5 1 87 1995 2066 1995 2066 0.81
15 34 2.3e-12 4.1e-09 37.2 4.6 1 87 2088 2157 2088 2157 0.81
16 34 9.4e-16 1.7e-12 48.1 0.4 1 87 2284 2353 2284 2353 0.79
17 34 1.7e-13 3.1e-10 40.8 1.6 1 87 2412 2489 2412 2489 0.81
18 34 3.9e-09 7e-06 26.9 1.2 3 87 2511 2582 2509 2582 0.73
19 34 1.6e-12 2.8e-09 37.7 1.1 1 86 2608 2676 2608 2677 0.81
20 34 6e-12 1.1e-08 35.9 0.4 1 86 2700 2769 2700 2770 0.80
21 34 1.2e-16 2.2e-13 50.9 0.6 1 87 2791 2869 2791 2869 0.84
22 34 3.4e-09 6.1e-06 27.1 0.3 1 73 2887 2950 2887 2962 0.73
23 34 6.9e-11 1.2e-07 32.5 3.9 1 86 2978 3047 2978 3048 0.83
24 34 2.4e-13 4.4e-10 40.3 5.8 1 86 3073 3144 3073 3145 0.83
25 34 3.3e-14 5.9e-11 43.1 2.5 1 87 3165 3238 3165 3238 0.78
26 34 2.4e-11 4.3e-08 33.9 0.9 1 86 3263 3336 3263 3337 0.82
27 34 9.6e-13 1.7e-09 38.4 1.1 1 87 3476 3549 3476 3549 0.79
28 34 2.7e-11 4.8e-08 33.8 1.4 1 86 3572 3641 3572 3642 0.75
29 34 4.5e-14 8.1e-11 42.7 1.5 1 86 3674 3745 3674 3746 0.82
30 34 1.9e-06 0.0033 18.3 0.7 1 58 3777 3824 3777 3846 0.79
31 34 4e-14 7.2e-11 42.8 3.8 1 86 3867 3944 3867 3945 0.83
32 34 1.2e-11 2.1e-08 35.0 0.6 1 87 3968 4039 3968 4039 0.80
33 34 2.5e-13 4.4e-10 40.3 3.8 1 87 4367 4440 4367 4440 0.81
34 34 1.3e-11 2.4e-08 34.8 1.0 1 86 4464 4531 4464 4532 0.82

Sequence Information

Coding Sequence
atgtcacaacaacaacaccaacaacaacgtaaacagcaacattatcatatgtatcatcagcagcaacaacagcatcaccaacaacaacatTGGTATGCCACCTCCATGCATCAAACACAGCACCAAGATCAACATATTCCTGGTCATATACAAGATTCCCGCCATTTGCATACGTTTGCTAGTGGTGCTGGTGGTAGTGGTGCGGGTATGTATATAGGAAATAGCATAACAAATATAAACCGACATGCATACAATATGCCGGCTTCCACGTCTACACATTATCCTTTTGTAAATGCAATGGGAGGTGGTGGACGAGCCTACGACCTTGAAATGGTAAACAGTGTAGCAAATCGAATCGGCCCGACAGCCCCAGCCTCACATTCAATGGTTGGCAATCGTTCTTACGATGCTTTTTCACACAATACACTTTATGTACAACAAGACCAACAGAGGCAGCACCATCTACACcatcatattacacagcatcatcagcatcagcaacaactgcatcatcaacaacagcaacagcacctctatcatcaaccacaacatcaacatcatcgtcaacaacatACTCAACAAGTAATACCGCCTTTGATTCAACAAAATGTTAAAAGCGAACCCATGGAAGAGATAACCGTAACGCCGACCATACAAATGGAAGAAGTTATAATTAAAACCGAACCTCACGAGGATTACAACAATTATCACAAAAACATTCGAGAAAATAATCAAATGCCATACAGCAGCTACAAGGGTATAAAGCAGGAACCGCAACACCTTcaacaacaacaacaacaacggcagcagcaccaaaatcaacatcaacatctccagttgcagcaacaacaTCAGCTGCTGCTACCAAGATCGACTCCACCCAATGATTATAATTTAGGAAATGAAGACAGTGACTTAAACAGTAAACTAGATATGAAGCCTTTAAATTTTCCACGTCGCAAAGTGCAAACAGAACGTTCCTTGACTCTGCCCATATGTCAACGGTGCAAACAAGTTTTCTTAAAGCGACAGAACTACACGCATCATGTTGCTTTATCAGTGTGCGACATTGTTGAATATGATTTAAAATGTTCCATCTGCCCGATGTCGTTTATGTCCAATGAGGAGTTGAACGCGCACGAACATTTGCATCGTTTAAATCATTATTTTTGCCAAAAATACTGTGGAAAACATTATGAGACAATATTGGAATGCGAACAACACGAGTACACGCAACACGACTACGAATCATACAAATGCAATATTTGTTTGTTAGAGTTTTCGTTACGTGAAGAATTGCTACAACATATACCGTTGCATAAATATCAAATTCGCTTTGTTTGTTCGGTGTGTCGTGAATGGTTTCAAATTTTGCCAGAATTACATGATCATTGCGTGGCGGCGCCCAACCTTTGTGGGAAGTTTTACAATAAAGATGCTGTTAACAAAAACCAAAATAACGATTCTTTGCATAGCGAATCGTCTAAACCAAGAAAAAGTTCAATTTGCAGCAAAAATCCCATCGAGACAAGCACGCAAAAAGATTCTCAATCAAATAGCGCTAATCGAGGTTTTATCAGCTTGCCTGAAGAACAAGAGGTAAAAACTGAAATTAAGGTTGAACCAGATTTCTATCCACCTCTAGAGCAAGCAGATTTCGAACGGTTCGATGGTGATTACAACTCCGAGAATTTTTCTTCAACCTCGTCAATGAGCAATCAAAATCTGAACTTTTTGCATGATTTTCAAGATAACGCGTCGAGTAGCACCAATTCTTCGTACACTATGCCGCCTGCGAATAATGAAGCAGTTGCTGGTGATGAAGATGCTGTATGTTGCGTGCGTTTGTGCGGGGTAAACAAATTCAGAAGTCCGACGCTACAGTTTTTTGGTTTTCCCCGCGACAACAAGTATCTGCAACAATGGTTACATAATTTAAAAATGCCTTACGATCATCAGGCCAATTACACGCAATATCGCATTTGTAGTTTACATTTTCCAAAAAGATGTATGAATCGTTACTCTCTAAGTTATTGGGCAGTGCCAACATTTAATTTGGGCCATGATGACGTAGCGAATATATACCAAAATCGTGAAATTAGTAATAGCATTGCCATCGGCGAGATGTCTCAGTGCTATATGCCGGGTTGTCGATCGCAGCGTGGCGAAAGTAATGTGAAATTTTATAATTTTCCGAAGGATTTAAAAACTTTGATAAAATGGTGTCAAAATGCCCGATTACCTGTGCACGCTAAAGAACCGCGGCATTTTTGTTCACGACACTTTGAAGAGAAATGTTTTGGTAAATTTCGTTTAAAACCGTGGGCTATACCTACTTTACATTTAGGCACTGTATACGGAAAAATTCATGACAATCCTAATGTTTCGTATTTGGAGGaaaaaaaaTGTTGTTTGCCATTTTGCAGGAAAAGTCGTTCAGATGATTTTAACTTATCTCTTTATCGTTTTCCTCGCGATGAAACCTTGCTGCGTAAATGGTGTTATAATTTACGTTTGCATCCAGATGTTTATCGTGGTAAAAATCAGAAAATCTGTTCTCACCATTTTATTAAGGAAGCGTTGGGGTTGCGTAAACTATCTCCGGGCGCAGTTCCAACGTTGAATTTGGGACACAATGACCGTGTGAATATCTATGAAAACGAATTATTATCCGCTCCTGTTAACCCTGCTCCAAATTCCTTCATTAAAATGTCTAAATATCATCATTCTTCCCATTCCAGTTCTTCCTCTTCCATTTACGATGAAGTTTTCACTAACAATTGCTCTTCATCGGCAAAGTTTACTTCTTCGTCCACACCAAATTCAAATGCTCTAGACCTGGGTGATATGTGTTTAGTACCATCATGTAAGAATACCCGGCACACTGAAAATATTACTTTGCATACTATACCTCGACGACCGGAACAATTGAAAAAATGGTGTCATAATCTCAAAATGAACTTGGAAAAGCATCACAAAAGTATACGCATTTGTAGCGCTCATTTTGAAAGCTACTGCATTGGCGGATGCATGAGGCCTTTTGCTGTGCCTACTTTAGAGTTAGGTCATGATGATTCTAACATATATCGCAATCCAGACGTCATCAAAAAGTTGAATATACGGGAGACTTGTTGTGTACCCTGTTGTAAACGCAATCGTGATCGCGACCATGCCAACTTGCACCGTTTCCCTACTAACCCTGAGTTATTACAAAAATGGTGTGAAAATTTGCAGAAACCTATACCTGATGGTACAAAGCTGTTTAATGATGCCGTTTGCGAGGTTCACTTCGAAGACAAATGCTTACGTAATAAACGTTTAGAAAAATGGGCTGTACCTACTCTGAAATTGGGATACGAACCTATAATTCATCAATTACCTTCAGAACAGGAGATAATGGAATTCTGGTCTAAGCCACCCGCTCCTAACAACGGCGATGAGCTAGGCGAATGTTGCGTATCTACTTGCAAAAGAAATCCGCAAGTGGATGACGTTAGATTATACCGACCGCCCGAGGACGCTGAGCAGCTAGTTAAATGGTCTCACAACTTGCAAATAGAAGTGACAGAATTATCTTCTTTGAAAATATGTAACCTTCACTTTGAGTCGCATTGCATAGGTAAACGATTACTTAACTGGGCCATGCCTACACTGAATTTGGCTTCTAACGTCGAGCATTTGTTTGAAAATCCGCCTCCCACCTCGTCTAGTTATAAAAGAAAAGTGAAAATCGAATGTCAGAAGCCTCAGGAATTCACTAAATGGTCACCGCGCTGCTGTTTATCGCACTGCCGTAAAACTCGCAATCACGATCAGATACAACTTTATAGATTTCCTGTTAATATTCATACCTTAACGAAATGGTGCCATAATCTTCAACTGCCTACAGTGGGTAGTTCTCATCGTCGTATTTGTTCAGCTCATTTTGAAGCAGCAGTGCTTACAAAACGTTGCCCTATAGCTTTTGCAGTGCCCACGCTAGATCTTAACACACCCTTAGGTTACAAAATCTATCAAAATTCTCCAAAACTTAAGCAACACAGGATTATTAATCAGCGATGCTGTGTAGTGAATACATGTCGTAAAACCCGTTCTGATGGTGTCCAACTGTTTCGTTTTCCCAATAATCGCGTCATGCTGAACAAATGGCGTCATAATCTTAAGCATCTTCCAAAAGGTAAATTAAGTTCGCAATTCCGCATTTGTTCATTGCATTTCGAAAAACATTCAGTAGGCTTGAAACGCCTATCACCTGGTGCCATTCCCACACTAAATTTAGGGCACGATAATTCTGAAGATTTATATCCGAATGAAACTAGATCGTTCTTTGAACTGGATAAGTGTGTTGTAACGGGCTGTTCGTCTAGCAAAGACATGGAAAATGTTCGTCTTTTTAAATTTCCTCGAGAGGACGAAGAATTGCTGGGGAAATGGTGTCATAACCTAAAAATGAATATCTCGGAATGCCTAGGCATAAAAATATGCAATAAGCATTTTGAAGCAGAATGTATGGGTCCCAAATTATTATACAAATGGTCTATTCCAACCTTAAACTTGGGTTACGAGGAAAATGAAACATTGGAAATTATACCTAATCCTCCGCCAGAAAAACGCTCGGGAGATGTTTTGTTTAAATGTTGTGTTTTTAGTTGCGGTAAAACACGAAAGTACGACGACGCCCAAATGAATAGTTTTCCTAAAAACATAAAAATGTTTCGGCGATGGAAACATAACCTCAAACTGGATTATTTAAATTTCAAGGAACGAGAAAAATTTAAGATTTGCAACGATCACTTCGAGCCCATATGTGTAGGAAAGACTCGATTGAATTTTGGGGCCATACCTACGTTGAACTTAGGTCATGATGATGTAGACGATTTGTACAAAATAAATCCAGATAAAGTGAAACCTAATTTATTTATAAAGCAATCAACGTACGAGGAAGATCCGCTCGAAATTGGTTACCCGAATGACAACTTGATAGGAGAAGACATAGAGGAGCACATGGAGTCTTCCAGCTTACCATTAGATATATCCAATCTAAAATGTGCGTATATCGAATGCAAAGGCCCGAAGTGTTTGTTGCGTGAACCGTATAGTTTACCACAAACTGACGTGTTTAGAAACATGTGGTTTTCTCTAATGGATTTACAAGAAAGCTTTGCTTACGGAGAAAGCAAACTGTGCGGCCTACATTACCAAAAAGTCTTCGAAAATTGCAAAGAAAAAATGTTCGCTTTAACGTTAGAAAATGACGAATTAAAAGAGGATTTCGAAAAACTTCAGAATGCTTATCACAAGTCGGAGATTTCCTTAATTATCAGGAGCTGTAAATGCAGTAGTGAAGAGTGCACTAACTCTTTAATTTTACCTAATATCAGACTCTACCAATTTCCATACGGCAAAGAGCTTAAGGAAAAGTGGTCCTATAATACCGGCATAGAGCCAGATGAACATCGTCGCTATCTAAATAAAGTATGTTGTATGCATTTCGAATCATACTGTTTTACGTCAAATCAACGTTTGCGGTCGTGGGCTGTACCTACTTTAGAACTAAATCATTCACAAGCAGATACGTTGCACAAGAATCCCGACCTAACGAAAATCGATCGTCGTTTGTTGGGTCCTTCAATTATGAAGTGTTGCGTTCGTGCCTGCAGTGGAGCAAATACCATAGAAGGTGAATCTTTGAAGCTGTTTAGTTTTCCATTGGATGAAGATTTACTAAAGAAATGGTGTGAGAACCTAAATATGTCTTTAGAACAAACCCCAATATTCAAAGTCTGTTCTTTACATTTTGAGAGGCAATGTTACGGCTTAACACGCTTACGTGTGGGGGCCATACCCACCTTACAGCTTGGCCATACTAGTGAACCTCGGCACTGCATTCCAAACAACACTAAAAAGGAAATGTATGATTTGGAATCCACTCATAATACTTTGAAACAGGTGAAAATaaaaaaaTCTTTAGGATCAGTAAAATGTTTTATACCTTCGTGTCGTCGTACACGTTTGCAGCACGGCGTACGTTTTTATACCTTACCTTCTAATTCCAAATTGCGACGTAAATGGTGTCACAATCTGCGTTTGTCTCTAAATCTAGGCAAACTGCAGAGTTTGCGCATTTGTAGCTTACATTTTCACAAGAAATGTTTAGATGGTCGCAATTTAAAGCCTTGGGCTGTGCCTACCTTGCATCTTGGACATCAAGAAGCTATTTTTGATAATCCTCGTACTTTGTATGGCCAATATGTACCACAGTGTGCTCTGTGGCACTGCCGAAATCAAAGGGTTTTGAACAAAAGCATGCGTTTCTTTACTTTTCCGAAGAGCGCTGATCTTTTGGAAAAGTGGTGTAAAAATTTAAAATTTTCCTTTGATCAATGTAATGGATGTTTGTGTGAAAGACATTTTGAAACTGAAATCATGAGTTTAAAAAATCTGAAGAAAGGTAGTGTACCCACTCTGAATCTAGGTCACTCGGAAGCGTTGGAATTTAATAATTTAAATTTAATTGAAGAGATGAAAAATGCAAATATTTTTGATATGGAAGAAGAAGACGGAGAAGACTGTTTTTACAGAGAATTTAAAGAAAGTGAAGAGTTCGATTTGGAATCGGAAAGCTTGAGAACTCCGACTAATTGGAGCAATTTAGAAGTGAAAGAATTACGCATCACACTAACTCCTCTAAAACGCGAAGATTTGCCTGAGATTATGTCGGTGGCATCTCCTTTAGAACTGGAAGAAGAGGAAAATTATATTACTTGCAATGAATCAAGAGAAGAGGATACTTCGGAACAAAGACAAATATTGCGCAAAGACAAGGCGGTGAACAATTTCAATCCCATATGTTGCTTGAAACATTGCGGGAAAGAGAAGACACCTGAACAACATTTGACGACTTTCGGTTTTCCTAAGGATCGTGATCTTTTACAAAAATGGTGCGATAATTTAGGATTAGAACCTTCTGAATGTATAGGCCGTGTGTGTGTCGACCATTTTGAGCTACGTGTCATGGGCAATAGACGTTTAAAACCGGGGGCTGTCCCAACTTTAAATTTGGGGCACAATAAACCTTTAATACACACCAATGAACCTATCAAGGCAAAAGTTATGCACGACGAACTTAAGTTCATAGAAGGCAATGAAGAAGAAAAGCAAGTTAGCCAAATTGTAAAGCCGGCTCCACCACCTTACAAAACTAAGCCAGCTAAGCAATCGGTTTTTCGGCTATGTTGCCTCAAACATTGCCGACGCAAGAGAATGCAAGGAAAGGTGGGAGTGGCAACATTAACATTTAAAATACCTCGAAATTTAAAACGATTGAAGCAGTGGTCAGCAGCGCTTAAATTACCAGAAGAAGTTTGTAGGCGACCGAGAATGCTTTTATGTGCTCAGCATTTCGAACCACATATGATTAACGAGGAAAAAGCCCAACTGAAGTCCAACGCAGTACCCACCCTAAACTTAGGATACGAGCCAAATAAATCTGAAGCACGCAATGGGACATTGGACCTAGAAAAATGCGATCTGAGTCATTGCGGTCGGGTCGCCGACAATGATGGCGTGTTTTTGTTATCTTTTCCATTCAAACCTTTGGTTTTATTGCGCAAATGGTGTTACAATACAAGAATATCTTACAAGACTAAAAACTTGAAATTCCTGAAAATTTGTAATGACCATTTTGAAAAACAAGtttttttGaaaaaaaaaTGTCTTCGCTTCAATGCAGTGCCCACGTTAAATTTGGGTCATCCAGGAAAAATTTATAAGAATCCTAAATCCTATAGGTTAAAAACTTTACTTAAGCCAAGGGAAAAATGCTGCGTAATCAATTGTCAAGAAGAACAGAAGAAGATGTATGGGTTCCCAAAAAGCTCTGAGCTTCGTCGCATATGGTCCAACAATTTGCGTATTGAAACCCGCGTGGCCTTGAAGCAACAATTCAAAGTTTGCCAACGTCATTTTGCAAGCGAAAGCTTTGTAAATGGAACGGATTGCTTAAAGATAGAAGCGATACCTATCTTGGAGCTAGGTGATGATAAAGACCACCACTTGGTTTTAGAGATGGACGCAACGGCACAGAGCAGCCGCCAATGTATGGTTAGAAATTGTGGTTGCATACCCAGTGTAGATAAAGTGAAACTGTTTAGCTTTCCTCAGCCCGGAGAAATTTTGGAAAAATGGCTTTTCAATCTACAATTATCTGCGAAATATGCCGTCGAAAACACCTATATATGTAGCCGACATTTTGAAAAGAGCTGTATACGTCGAGGAATTTTGCATGAAATGGCGATACCTACACTGTGTTTAGGGCATGCCGATTGCTTTTATGGCAACGAGGAAGAAATGTTTACAACTCCTATAAAATGTTGCGTAACGAGTTGTAATTACAATCCGGCAGAAGATGAGTCGGAAATTTTGCGTAGCATGTATCGCTTTCCCAAAGATGCAGAAAACTTaaaaaaaTGGTTAGATAATTTAAATATGAGCGATACTGTATACCAGAAACAAAAAAGTGCAAGGATTTGCGGACATCATTTTGAGGATGTTTGCAAGCTGAAGGGTCGAGAGACGCTATTACCTAATTCGATGCCTACTATGAATTTGGGTTATGAAAGCAAGGTGAAAAATATGCATCGAAATCACTTTGAAAAGTGCTGTTTGCAGTCATGCAAGATGAGAGAGAAATTCTCTGTGAGCTTACATGATCTACCGCAAGACCTAAATATACGCTCTCTTTGGTTTGAAGAACTTGCTTTGGAGGATAGAATAAATACGGAAAATTTTATTTGCTCCCCACATTTTATGGCTATATTTGAAAGACTGAAAGAGAAGCATAAGACGTACTTGAAGCAGTATTTAGAATATGGTGTGCTATCAACTTCGTACAAAGAACTGAAACAGTTAGACTTAATGCAAGGCTTTAAATGTTCTATTCCCAAGTGCTCGACAGGGTTTAAGATGACTGCCAAGCTTTTCAAATTTCCCAACGATGTTAATCTTTTCAATAAATGGCAACACAATACCGGATTACAATTTGAAATGAATAAGCGCTGCCTACATCAAATATGTGCCCTTCACTTTGAGCCCAGGTGTCTGAGTGAAGTGAAATTACACCGTTGGGCGCTCCCTACGCTAAAGTTACCCAACATCAACAGTTTGTATGTCAATCCTCCCGAAGCTTTGCCTTCCGATCACGAAAACCTTAAACATTGTTGTGTTTCCGATTGCATTTCAGAGGAAATGCCATTCTTTCAGTTTCCTTCCAAGCAGACTAATCTCAGAAAATGGATACACAATTTGGACTTGGGGCCACAGCAGTGTACCACAAATTTGCGTGTCTGTTTTAAGCATTTCGAGAAATATTGTTTTAAAAAGGAAATGAGCAACAAACAACTGACGTTAAAATCATGGTCTATACCTACTTTAAGACTAAAACGAAAATTGGACTTATATCAAAATCCGGTCGAGAAAATAAGTTTCTTCGTATGCTGCATACCGACTTGCCGCAAAGTACGGAATCCCTCTGAGGGCATATATTTCTACAAATTTCCACGAAGCAAAACCTTATCGAAGAAATGGTTATATAACGCGGGAATAGATGCGGAAAGTTTTCATGAACGTATGCGAATTTGCAGTTTGCACTTTACTTCCGATTGCTTTGTAAAGGATTGCATGATCCTTCGTAAGCACACAGTACCCACTCTAAATTTGTCAACGCCTATTGAGTTCTTGCATAAAAATCCTCCGaaaaaaaaaTTTGGTAACTTCGGGCTAACGCACTGCTTAGTAAAATCTTGCAATGTAATGGAATTAAAGGACAAAGAGTTACACGACTTGCCTAAAAATAAAACTGTGTTAAGTAAATGGTGGCATAATTTGGATTTGGATATGCATAATGACGCGACATTGGCAGATAAAAGTATGAAAATATGTAAAATGCATTTCACTGACGAATGTTTTGATAAAAAGGGTGAATTAAAGTTGAAATCGATTCCCACCTTAAAATTGGGTCATGATaaaaaaaTTTTTCAAAATTTTGACGAAGCGATCGGCGTTGATAGACTACTGAAAAGTGGGATTATTGAGGTCGACGATAGTTCAAAATCGAAGAAAGAGACAAAATGTTTCGATTTGCATGCGGAGAAATATGTTAAAGACTATCCTTTAAAAGGGTCAAAGCTTTTCAAAACTGCTATTAGAAACAGCAAATCACATTTAGCCCCAATAGGACAAACTAAAATCAATCGGATACAAAGAGTTTATAATAAGAAAAAGTCCAAACAGTTGATCTACAAATACGAACGATATCGAGACCAGTTTGAAAACGCAGCACACGTTGTTAGGAAAAATGTTAAGTCAGAAAATAGCCGCGCTCTTGGCAAACTTATTAAGAATACTCGAATAGCAAATCGCGCGAAGTGTTCTATACAAAATTGTTTAGAGCTGGaaaaaaaCCCCAAGCTACAAGTTTTTAGCTGTCCACATAAAACTGATCTGCTAGaaaaatggttagagagtattggagaaaatgtaaaaaataaagtacagttattaaaaaaatataaaatTTGCTCCTCACACTTTGAAAACCAATGCTTTCAAGATCAGCGTTTATTGTACGGGGCTATTCCTACTGTAAACTTAGTGAATAAAAATATTaaaaaaaaTTTGTGCTCGGAGTATTTGAAAGCTTATGAGTGCGAACGCTGTTTTGTGAAGAAATGTGGACGTTCAGACGAGTATGATCAAATAATCAAATGCTATTTTCCTAAAGAGAAGGATTTATTAGAGAAATGGCTCTTTAACTTGAATATAAGACGAGAAGAGATAGCCGGACATAAGTGGCTCTGTCATATGCATTTTGAGCAAAGATGTTTGAAGCAAAGAAAGCTTCTACCAGACACAGTACCCACCTTATTGTTGGATTATGAATCCAGTAGCAAGAGTGGTTTCTTCAGAAATCCTGAAGTTTGCTCTGCCACCAGGAAATGCCGACAAATGCTTTTAAACGCATGTTGCGTAAATGGCTGTCAGAATGCTAAAGGAGGGAATCCCTTTGTACAGTTAAGTCAGTTTCCAAAACAATCTCAACTTTTTCGAAAGTGGCTGCATAATTTAAAGATCATTGATTCTAACGAAGTACGGCAATACTATCGTATTTGTACCCTACACTTTGAGTTGAAGTGTTTCAATAAGTTTTCTTTAAAGTTAGGCTCGATACCTACAAGGAACTTGGGCCATCGCGACTCTGATATATACGAAATGTTGGATGAGGAATTGCACGATAATCAAATaaaaaaaTCCAAACTTTACAATAAGAAATGCTCATACCCCTGTTGTAAAGGTAATAAGACTAAACTCTATGACTTACCGCAATTTAAGGTCATACTTGAAAAATGGTGGCAAACTATGCAATTGTCTTCATTTAGCGAACGAAATCAGGCGAAAGTGTGTGATATACATTTTTACATGTTGTATAATGAGCACTATGAGGTTATAAAACAGATGGAACATGAAGATCCCGGTAGCGTTAAAGACCTGATAAACCTTTATCATAACATAGCCGCAAGAGGCAAAGTCATACGTCATAGATGTGCTGTGCCGGACTGTAATACTGATCATTTAAATAGATGCAATTCTGGTGTAAAATTATATAATTTTCCTACTGATTCAGGACGGGCCAAAAAATGGTGTTTCAACTGCCATTTGAGTTATGAAGGGAAAATTAAAGATGACCATGACCACAATTATAAAGTGTGTGCTCTGCATTTTGAGGAGTATTGCATTAACGACACCAAATTGCAATCTTGGGCAATACCCACTTTGCAACTAAGGTCATCAAAGCTCTACATCAATAATACATCAATTGAGAATACTTTTTACGATATTGACAGATGTTGCATTGGGTCATGCATTAATTCCCAAGGTCTAAGAACCAATACGTGTTTTTATGATTTCCCACAAGAACGAGTCATACGCGACAAGTGGTTACAACGCACAAATTTGAAGAATTTTAATTTTCAAAAGATGCGCATATGTGGTTTACATTTTGCaaaaaaaTTTCTGCGCGACGATAACCAACTATTACCCTTAGCCTTACCCACCTTAAATTTGGAATTAACAGATTCAACTATTAAAACACATGAAAATACTTACGATACTAGTTCTAAATTACAGCACATTGAAACCGCGGTAACTGTTAAACAGGAAGTTGAAAGCTGGGAGGATTGGTTGCCATCCGATCCTTTCAACACCCCATTCCACGAGTCTTCCAATAAAATTGAAAATAGTGCCGATAGTTATTTTCTTGAGCCTCCTCCGCAATGTGAAATTATTACTGTTAAAGAAGAAATCATCGATGTTGACTACGAGATGAAATCGGAATGTTCTAGTGGCATCGAATACTATGCGGAGAAACACTTTATGAAACCTCGAATTGTGGCCTGTTATTCTCAAAATTTGCGTGTTCATGATTTCGCTAAGAAAGAATTGATTTTAGAAGAGTTGCAAAGCGAAAATTATGCGGATGCAAAATTAAGAATATTGAACTCATCTGAAGACTCTCAAAATCAGATAAATGAGAACAGTGGTGCAATAAGTTTTCAGACTTTAAAGTTGCCTTTGAAACTTTCTCTACCCATATCTACAGTAAACGAGAAAAGCCATATATTAACAGATAGAGAGAAAATTGTTGCACAAAGTTCTGTGAATAGCTGTCACCTTTCGCAGAAATTCAGCTTTAAAATGATGCACGGAGACTTACAAAACAAAGCCTACGACCAAATCAATTCAGATTTCTCAATAATGGGAAAAGTGGAAGAAAGAGAATTTCCTCAAAATAAGAACGGAGGTACGGACGCGAAAGATAATCAAGTCGTAGAGAAGTCTTTAGAAAGAAAGACCAGTGCTCAAAATGCGACTAATAACATGGAACGAAACAAAAGTTGTAAAAATGAAGACTCTTCTGAAAATTCACCAATCGCAAGCTATTATTTATCTCCAACAGAGATAAGTGAGAGCATTGATAGACAAAGTTTTTGTACATCTCCGATTATTGCGGTTGATGTTCCTTCGAAAGGTTTATCTCAAACTTGCTGTGTAGCCAAGTGTTTAAATACCGCTGACGCACCATTGGTAAAAATTTTCACTAAATTCCCTTCCGATTCAGAGCTTTTTATCAAATGGTGTTTTAATTTAAAAATTGACCCACGCCATTTTCGGGAACATTCTTACGCGGTTTGTAGCACCCATTTTGATCCGTTATGTCTTCAGGACAATCGTTCGCTGCACCCTTGGGCGGTTCCCACTTTAAATCTAGGTCTTCCTCGCAACTCTTTTATACATCAGTACGAACTACCCATTAGTTATAAAACCTCAGAGGAATGCATAGTTTGGGGCTGCAATCAAGCAAAGCCGCCGCTTTACAAATTCCCCGCCAGTCCAGAGCAATCGAAACGATGGTTTTCAAATTTAAAATTAGAATATACAGAATTTCGAGCACAGACTTATCGCATCTGCAGGAAACATTTCGAAAATCATTTCATCGATGCGTCCGAGCAGCTAAAAAGCGAATCTTTGCCTACTTTGCAATTAAATATAAACGATAGTGAGGATCTTCCTAATAATAACAATGCAGACATAATGCATCCTTTGGTCTCCTCGCCTGAAGACTTAGAGGATCATGATAGTAGTTACTATGAAGATTTTGAAGAGTGTATCAACCATGAGGAGGAGaaaaattga
Protein Sequence
MSQQQHQQQRKQQHYHMYHQQQQQHHQQQHWYATSMHQTQHQDQHIPGHIQDSRHLHTFASGAGGSGAGMYIGNSITNINRHAYNMPASTSTHYPFVNAMGGGGRAYDLEMVNSVANRIGPTAPASHSMVGNRSYDAFSHNTLYVQQDQQRQHHLHHHITQHHQHQQQLHHQQQQQHLYHQPQHQHHRQQHTQQVIPPLIQQNVKSEPMEEITVTPTIQMEEVIIKTEPHEDYNNYHKNIRENNQMPYSSYKGIKQEPQHLQQQQQQRQQHQNQHQHLQLQQQHQLLLPRSTPPNDYNLGNEDSDLNSKLDMKPLNFPRRKVQTERSLTLPICQRCKQVFLKRQNYTHHVALSVCDIVEYDLKCSICPMSFMSNEELNAHEHLHRLNHYFCQKYCGKHYETILECEQHEYTQHDYESYKCNICLLEFSLREELLQHIPLHKYQIRFVCSVCREWFQILPELHDHCVAAPNLCGKFYNKDAVNKNQNNDSLHSESSKPRKSSICSKNPIETSTQKDSQSNSANRGFISLPEEQEVKTEIKVEPDFYPPLEQADFERFDGDYNSENFSSTSSMSNQNLNFLHDFQDNASSSTNSSYTMPPANNEAVAGDEDAVCCVRLCGVNKFRSPTLQFFGFPRDNKYLQQWLHNLKMPYDHQANYTQYRICSLHFPKRCMNRYSLSYWAVPTFNLGHDDVANIYQNREISNSIAIGEMSQCYMPGCRSQRGESNVKFYNFPKDLKTLIKWCQNARLPVHAKEPRHFCSRHFEEKCFGKFRLKPWAIPTLHLGTVYGKIHDNPNVSYLEEKKCCLPFCRKSRSDDFNLSLYRFPRDETLLRKWCYNLRLHPDVYRGKNQKICSHHFIKEALGLRKLSPGAVPTLNLGHNDRVNIYENELLSAPVNPAPNSFIKMSKYHHSSHSSSSSSIYDEVFTNNCSSSAKFTSSSTPNSNALDLGDMCLVPSCKNTRHTENITLHTIPRRPEQLKKWCHNLKMNLEKHHKSIRICSAHFESYCIGGCMRPFAVPTLELGHDDSNIYRNPDVIKKLNIRETCCVPCCKRNRDRDHANLHRFPTNPELLQKWCENLQKPIPDGTKLFNDAVCEVHFEDKCLRNKRLEKWAVPTLKLGYEPIIHQLPSEQEIMEFWSKPPAPNNGDELGECCVSTCKRNPQVDDVRLYRPPEDAEQLVKWSHNLQIEVTELSSLKICNLHFESHCIGKRLLNWAMPTLNLASNVEHLFENPPPTSSSYKRKVKIECQKPQEFTKWSPRCCLSHCRKTRNHDQIQLYRFPVNIHTLTKWCHNLQLPTVGSSHRRICSAHFEAAVLTKRCPIAFAVPTLDLNTPLGYKIYQNSPKLKQHRIINQRCCVVNTCRKTRSDGVQLFRFPNNRVMLNKWRHNLKHLPKGKLSSQFRICSLHFEKHSVGLKRLSPGAIPTLNLGHDNSEDLYPNETRSFFELDKCVVTGCSSSKDMENVRLFKFPREDEELLGKWCHNLKMNISECLGIKICNKHFEAECMGPKLLYKWSIPTLNLGYEENETLEIIPNPPPEKRSGDVLFKCCVFSCGKTRKYDDAQMNSFPKNIKMFRRWKHNLKLDYLNFKEREKFKICNDHFEPICVGKTRLNFGAIPTLNLGHDDVDDLYKINPDKVKPNLFIKQSTYEEDPLEIGYPNDNLIGEDIEEHMESSSLPLDISNLKCAYIECKGPKCLLREPYSLPQTDVFRNMWFSLMDLQESFAYGESKLCGLHYQKVFENCKEKMFALTLENDELKEDFEKLQNAYHKSEISLIIRSCKCSSEECTNSLILPNIRLYQFPYGKELKEKWSYNTGIEPDEHRRYLNKVCCMHFESYCFTSNQRLRSWAVPTLELNHSQADTLHKNPDLTKIDRRLLGPSIMKCCVRACSGANTIEGESLKLFSFPLDEDLLKKWCENLNMSLEQTPIFKVCSLHFERQCYGLTRLRVGAIPTLQLGHTSEPRHCIPNNTKKEMYDLESTHNTLKQVKIKKSLGSVKCFIPSCRRTRLQHGVRFYTLPSNSKLRRKWCHNLRLSLNLGKLQSLRICSLHFHKKCLDGRNLKPWAVPTLHLGHQEAIFDNPRTLYGQYVPQCALWHCRNQRVLNKSMRFFTFPKSADLLEKWCKNLKFSFDQCNGCLCERHFETEIMSLKNLKKGSVPTLNLGHSEALEFNNLNLIEEMKNANIFDMEEEDGEDCFYREFKESEEFDLESESLRTPTNWSNLEVKELRITLTPLKREDLPEIMSVASPLELEEEENYITCNESREEDTSEQRQILRKDKAVNNFNPICCLKHCGKEKTPEQHLTTFGFPKDRDLLQKWCDNLGLEPSECIGRVCVDHFELRVMGNRRLKPGAVPTLNLGHNKPLIHTNEPIKAKVMHDELKFIEGNEEEKQVSQIVKPAPPPYKTKPAKQSVFRLCCLKHCRRKRMQGKVGVATLTFKIPRNLKRLKQWSAALKLPEEVCRRPRMLLCAQHFEPHMINEEKAQLKSNAVPTLNLGYEPNKSEARNGTLDLEKCDLSHCGRVADNDGVFLLSFPFKPLVLLRKWCYNTRISYKTKNLKFLKICNDHFEKQVFLKKKCLRFNAVPTLNLGHPGKIYKNPKSYRLKTLLKPREKCCVINCQEEQKKMYGFPKSSELRRIWSNNLRIETRVALKQQFKVCQRHFASESFVNGTDCLKIEAIPILELGDDKDHHLVLEMDATAQSSRQCMVRNCGCIPSVDKVKLFSFPQPGEILEKWLFNLQLSAKYAVENTYICSRHFEKSCIRRGILHEMAIPTLCLGHADCFYGNEEEMFTTPIKCCVTSCNYNPAEDESEILRSMYRFPKDAENLKKWLDNLNMSDTVYQKQKSARICGHHFEDVCKLKGRETLLPNSMPTMNLGYESKVKNMHRNHFEKCCLQSCKMREKFSVSLHDLPQDLNIRSLWFEELALEDRINTENFICSPHFMAIFERLKEKHKTYLKQYLEYGVLSTSYKELKQLDLMQGFKCSIPKCSTGFKMTAKLFKFPNDVNLFNKWQHNTGLQFEMNKRCLHQICALHFEPRCLSEVKLHRWALPTLKLPNINSLYVNPPEALPSDHENLKHCCVSDCISEEMPFFQFPSKQTNLRKWIHNLDLGPQQCTTNLRVCFKHFEKYCFKKEMSNKQLTLKSWSIPTLRLKRKLDLYQNPVEKISFFVCCIPTCRKVRNPSEGIYFYKFPRSKTLSKKWLYNAGIDAESFHERMRICSLHFTSDCFVKDCMILRKHTVPTLNLSTPIEFLHKNPPKKKFGNFGLTHCLVKSCNVMELKDKELHDLPKNKTVLSKWWHNLDLDMHNDATLADKSMKICKMHFTDECFDKKGELKLKSIPTLKLGHDKKIFQNFDEAIGVDRLLKSGIIEVDDSSKSKKETKCFDLHAEKYVKDYPLKGSKLFKTAIRNSKSHLAPIGQTKINRIQRVYNKKKSKQLIYKYERYRDQFENAAHVVRKNVKSENSRALGKLIKNTRIANRAKCSIQNCLELEKNPKLQVFSCPHKTDLLEKWLESIGENVKNKVQLLKKYKICSSHFENQCFQDQRLLYGAIPTVNLVNKNIKKNLCSEYLKAYECERCFVKKCGRSDEYDQIIKCYFPKEKDLLEKWLFNLNIRREEIAGHKWLCHMHFEQRCLKQRKLLPDTVPTLLLDYESSSKSGFFRNPEVCSATRKCRQMLLNACCVNGCQNAKGGNPFVQLSQFPKQSQLFRKWLHNLKIIDSNEVRQYYRICTLHFELKCFNKFSLKLGSIPTRNLGHRDSDIYEMLDEELHDNQIKKSKLYNKKCSYPCCKGNKTKLYDLPQFKVILEKWWQTMQLSSFSERNQAKVCDIHFYMLYNEHYEVIKQMEHEDPGSVKDLINLYHNIAARGKVIRHRCAVPDCNTDHLNRCNSGVKLYNFPTDSGRAKKWCFNCHLSYEGKIKDDHDHNYKVCALHFEEYCINDTKLQSWAIPTLQLRSSKLYINNTSIENTFYDIDRCCIGSCINSQGLRTNTCFYDFPQERVIRDKWLQRTNLKNFNFQKMRICGLHFAKKFLRDDNQLLPLALPTLNLELTDSTIKTHENTYDTSSKLQHIETAVTVKQEVESWEDWLPSDPFNTPFHESSNKIENSADSYFLEPPPQCEIITVKEEIIDVDYEMKSECSSGIEYYAEKHFMKPRIVACYSQNLRVHDFAKKELILEELQSENYADAKLRILNSSEDSQNQINENSGAISFQTLKLPLKLSLPISTVNEKSHILTDREKIVAQSSVNSCHLSQKFSFKMMHGDLQNKAYDQINSDFSIMGKVEEREFPQNKNGGTDAKDNQVVEKSLERKTSAQNATNNMERNKSCKNEDSSENSPIASYYLSPTEISESIDRQSFCTSPIIAVDVPSKGLSQTCCVAKCLNTADAPLVKIFTKFPSDSELFIKWCFNLKIDPRHFREHSYAVCSTHFDPLCLQDNRSLHPWAVPTLNLGLPRNSFIHQYELPISYKTSEECIVWGCNQAKPPLYKFPASPEQSKRWFSNLKLEYTEFRAQTYRICRKHFENHFIDASEQLKSESLPTLQLNINDSEDLPNNNNADIMHPLVSSPEDLEDHDSSYYEDFEECINHEEEKN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00746752;
90% Identity
iTF_00746752;
80% Identity
-