Basic Information

Gene Symbol
-
Assembly
GCA_008042715.1
Location
VNJY01004718.1:55920-67123[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 2.1e-15 4.8e-12 46.3 4.0 1 86 105 177 105 178 0.85
2 28 7.8e-15 1.8e-11 44.5 5.0 1 87 205 274 205 274 0.83
3 28 7.1e-16 1.7e-12 47.8 0.2 1 87 296 368 296 368 0.85
4 28 4.9e-16 1.1e-12 48.3 5.3 1 87 466 536 466 536 0.82
5 28 1.7e-15 3.9e-12 46.6 3.6 1 86 560 631 560 632 0.82
6 28 8.9e-13 2.1e-09 37.9 1.1 1 87 667 735 667 735 0.81
7 28 6.3e-11 1.5e-07 32.0 1.4 1 86 777 846 777 847 0.76
8 28 3e-17 7e-14 52.2 0.4 1 86 874 943 874 944 0.82
9 28 1e-12 2.3e-09 37.7 1.4 1 85 965 1033 965 1035 0.79
10 28 6.6e-15 1.5e-11 44.7 0.5 1 86 1062 1133 1062 1134 0.85
11 28 1.4e-12 3.2e-09 37.3 3.7 1 85 1205 1273 1205 1275 0.82
12 28 9.4e-13 2.2e-09 37.8 0.1 1 86 1298 1366 1298 1367 0.83
13 28 2.3e-13 5.3e-10 39.8 2.4 1 87 1514 1583 1514 1583 0.80
14 28 1.1e-12 2.6e-09 37.6 0.1 1 86 1682 1751 1682 1752 0.78
15 28 0.00027 0.62 10.7 0.4 1 60 1767 1818 1767 1834 0.75
16 28 1.3e-13 3e-10 40.6 0.1 1 87 1846 1917 1846 1917 0.81
17 28 5.3e-13 1.2e-09 38.6 0.4 1 87 1969 2039 1969 2039 0.82
18 28 2.5e-12 5.7e-09 36.5 0.1 1 86 2074 2150 2074 2151 0.82
19 28 9.1e-13 2.1e-09 37.9 0.0 1 86 2161 2234 2161 2235 0.80
20 28 5.5e-12 1.3e-08 35.4 0.0 1 86 2260 2332 2260 2333 0.71
21 28 0.00028 0.65 10.7 0.0 1 58 2362 2412 2362 2428 0.82
22 28 6.6e-12 1.5e-08 35.1 0.9 1 86 2452 2523 2452 2524 0.81
23 28 4.9e-16 1.1e-12 48.4 0.3 1 86 2649 2721 2649 2722 0.81
24 28 3.5e-12 8.2e-09 36.0 3.4 1 86 2785 2855 2785 2856 0.80
25 28 1.7e-14 3.9e-11 43.4 4.6 1 86 2947 3017 2947 3018 0.84
26 28 1.9e-12 4.5e-09 36.8 0.2 1 86 3102 3171 3102 3172 0.85
27 28 2.3e-10 5.4e-07 30.2 0.5 1 58 3197 3245 3197 3254 0.83
28 28 2.8e-10 6.5e-07 29.9 1.5 18 87 3263 3321 3252 3321 0.77

Sequence Information

Coding Sequence
CCCAGTGCTCCACTCCATAAGGCGAACCCAAGCAACTCGCCCTCCTTGCCCGCCACAGCAGCGCTCAATTCATTGCTGCAACAGCGTCAGGCCAATGCCGATGGAGCGGCTATTTTTGCCGCATCCTCGCTGAAGAACGAAGTCCCTGTGAAGCTGGAGCGCAGCTACAGTAACTCGACCAACGAATCGTCTTATAGCGTTCAGGAGAGCGGCTACAATAATGTATATGGCAGCAGCGACAGCTCAGTCCACGGCGCCATCGCCGGACCACAGGCACACTCTTCCACGCTGGACGACTCCGAGGATGCCTTGTGCTGTGTGCCGCTGTGCGGGGTTCGGAAGAGCACCAGTCCCACCCTGCAGTTTTTCACGTTTCCGAAGGACGAAAAGTATCTCAACCAGTGGCTGCACAATCTCAAGATGTTCCATATACCCGCTTCCAGCTACGTTAGCTTCCGCATCTGCAGCATGCACTTTCCGAAGCGCTGCATCAACCGCTACTCTCTGTGCTATTGGGCTGTGCCGACATTCAACCTCGGCCACGATGACGTAGCCAATCTGTACCAGAACCGGGAGCTGACCAATACCTTCACCACTGGCGAAGTGGCGCGCTGCAGCATGCCCCACTGCACCAGCCAGCGGGGTGAGAGCAATCTCAAGTTCTACAACTTTCCAAAGGACATCAAGAGCCTGATTAAGTGGTGCCAAAATGCCCGACTTCCGGTGCAGGCGAAGGAGCCGCGACATTTCTGCAGCCGCCACTTTGAGGAGCGCTGCATTGGCAAGTTCCGGCTTAAGCCGTGGGCAGTGCCCACCCTTCACCTGGGCGCCCAGTATGGCAAGATCCATGACAATCCGAAGAATCTGTATGTGGAAGAGAAGCGCTGTTGCCTCAACTTCTGCCGCCGGAGTCGCTCCTCAGACTTCAATATGTCGCTATATCGATTTCCCAGAGACGAAGTTCTGTTACGTCGCTGGTGCTACAATCTTCGCCTGGATCCGGGAGTCTACCGCGGCAAGAATCACAAAATATGCAGCGCCCACTTTATAAAGGAAGCCCTGGGTCTGCGTAAACTGTCGCCTGGAGCCGTGCCCACTCTTCACCTGGGCCACAATGACACCTTTAACATCTACGAGAACGAATTGTGGCCACCGCCAACACCAACACCCTCCTCTTGCCATctccaacagcaacagcagtcatCCCTGCATTCGTTGCAACAGCAGATGCACAGCAAGTCCTACCAGCGTCGTTCAGTGGCATCCACCTCGTCGTCGGCGAGCTCGGCAGCCTCGCATTACGTGGACCCGGAGATAAGCGCCTCTTACCATCTGGCCATGTCCGCCTCTGCCGGTGGCTCTGCGGCCATAAACGCCAGCGACAGCATGGATGTCTGTTGCGTGCCCAGTTGCGAGAGCAAGCGGCACAATAGCGAGAACATTACATTCCACACGATTCCGCGACGGCCTGAGCAGATGCGCAAGTGGTGCCACAATCTGAAGATTGCCGAGGACAAGATGCACAAGGGAATGCGAATCTGTAGTCTTCACTTCGAGCCCTACTGCATCGGCGGCTGCATGCGACCGTTTGCTGTGCCCACTCTTCACTTGGgtcacgacgacgacgacatccACCGCAATCCCGACGTGATCAAGAAGCTGAACATTCGGGAGACATGCTGCGTGGCTGTGTGCAAGCGGAACAGGGACAGGGACCATGCCAACCTGCATCGTTTCCCCAGCAACGTGGCGTTACTGAAGAAATGGTGCGCCAATCTGCAGCGCAGCGTGCCCGATGGCAGTAAGCTCTTCAATGATGCCATCTGTGAGGTGCACTTCGAGGATCGCTGCCTGCGCAACAAGAGGCTGGAGAAGTGGGCAGTGCCCACTCTGATCCTGGGACACGAGGATATCGCTTATCCGCTTCCTACGCCGGAGCAAGTGACCGAGTTCTATGCACGGCCCACGGCTCCCAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGAGACCTGCAAGAGGAATCCCAGCGTGGATGACATCAAGCTATACCGCCCGCCGGAGGAGGCCGCCGTGCTGGCCAAATGGGCGCACAACCTGCAAACGGAGGCCAACCAACTGACAAGCCTGAGGATCTGCAATCTTCACTTCGAGGCGCATTGCATCGGCAAGAGGATGCGGCAGTGGGCCATTCCGACATTGAATCTAGCCGGCAACATCGAGAATCTGTACGAGAATCCAGAGCAATCGATGCTGTACAGGCGACGGACGACTCACTTGAAGGCGAAGCTGTCGCCAGCTTCTGCAAAGCCCACCTGGGTGCCCAGATGCTGTCTGCCACACTGTCGCAAAGTCCGAGCCCTGCACAATGTCCAGCTGTATCGCTTCCCCAAGCTCAATCGCTCCACATTGGCCAAGTGGGCGCATAATCTCCAGGTTCCGATGGTAGGCAGTGCCCAGCGCCGGCTGTGCTCGGCCCACTTCGAGCCGCATGTGCTGAGCAAAAAGTGCCCGGTGCCGCTGGCGGTGCCAACGCTCGACCTGAACTCACCGCCCGGCTTGAAGATCTACCAGAACCCGGCGAAGCTAAAGGCCAGCAAACTGTGCCTGCAGCGGGTTTGCATCGTCGAGAGCTGCCGCAAGACGCGGGCGCAGGGCGTGCAGCTTTTCCGGCTGCCGCACAGTCCCACCCAGCTGCGGAAGTGGATGCACAACATAAGGACGCGGCCACGAGCAGCTATGCGGTCTCAGTACCGGGTTTGTTCCCGCCACTTTGAGACGCACTCGTTCAATGGTCGAAGACTGAGTGCGGGTGCCATTCCGACTCTGGAACTGGGCCACGATGACGACGATATCTTTCCCAATGAAGCGCAGGCCTTTGCGGACGAGCATTGTGCTGTCGAGGGCTGCGAGGCATCCAAGGAGCAGCCGGAGGTACGGCTGTTCCGCTTCCccaccgacgacgacgatatGTTGTGGAAGTGGTGCAACAACCTGAAAATGAATCCCGTGGACTGCATTGGGGTGCGCATCTGCAACAAGCACTTCGAGGCCGATTGCATTGGACCCAAGCACCTGTACAAGTGGGCCATTCCCACACAGGAGCTGGGCCACGACGATGCGCAGATCGAGCTGATCCCGAATCCCAAGCCAGAGGATAGGTATGTGGATCCAGTGTTCAAGTGCATCGTGCCTACCTGCGGCAAGACACGACGCTTTGACGAGGTGCAAATGAACAGCTTCCCCAAGGACCCGGATCTATTTCAGCGTTGGCGGCACAACCTCCGCCTGGAACACCTCAGTTTCCAGGAGCGTGAGCGCTACAAGATCTGCAACGCTCACTTCGAGGATATCTGTATTGGGAAGACGCGGCTAAACATCGGATCGGTGCCCACCTTGGAGCTGGGTCATGACGATGAGGATGATATTTTCCAGGTGAATCCAGCGGAGCTGCAGAGCAATTTATTCGGGCGACAGCGTCGACTGCTGCACGAGGGATCCGGTGTCAAGCAGGAGCTGTCCGAGACAGAGGACAACCCCAAAGCGGATGTGACTGCCACAAGCTCCAATTCCAAGCAGACCAAGAACAAGAGAAATGCTTCCGATCTTAAGTGCTGCGTGCACAGCTGTGGAAGAAGTCGCTTGGAGCACGGGGCCCGGCTGTTTCCCTTTCCCACCggcaagcagcagcatctaAAGTGGCGTCACAACCTGCACCTGGAACCCGAGGAGGTGGACCGGTCGACGCGCGTCTGCAGCGCCCACTTCAATCGACGTTGCATCGAGGGCAAACAACTGAGAAGCTGGGCGATGCCCACCCAACAGTTGGGACACAACGACCAACCGATCTACGAGAACCCAAAGAATATACCGGGATTCTTCACCCCCACCTGTGCCCTGGGACATTGTCGCAAGCGGCGGAGTATTGACAACGATCTGCGTACTTACCGGTATCCTAGGAGCGAGGATCTTCTGGAGAAATGGCGAGCCAATCTACGGCTGTCTCCTGATCAGTGTCGCGGTCGGATCTGTGCCAATCACTTCGAACCGCAGGTGCGGGGTAAGCTAAAGTTGAAGACGGGAGCCGTGCCCACACTGCAACTGGGACACGATGAGGGATTAATCTATGACAATGAAGCTATTAAGGCGGGCATGACCGAAGAAGAGGAGGGCATAACGCCAGAGTTCACGCGACTGAAACCAAAGAAAGAGGTGTTcgaagaggaggagggggaggaggaggaggaggacgaaggGAATGATGGCGAGCAGCAGAACCCAGATGACCTAGAGGATCATCCAGGCGAGGACAAAGATGATCACTATTTTGATCCTCTTGAACTGGTAGAGACTTTTGCTGAACATCGCAGCGATGACGAGGCCCAAGACTATGAGGATGCAGAAGAAGAGGACTCCCCCTCCAGGTACGATGTCAAGGAGGAAATAGAACAGCTTCCAAGCCACCTACCACCTTCCCCTTTACCGCGACGGCACCAAGCTCCGCGACGAGACAAGCCGGCTAATAATGTGACACCCATTTGCTGCCTGAAGCACTGCAGGAAGGAACGCACTGCCTTCCACCTGCTGAGCACTTTTGGCTTTCCAAAGGATCGCCAATTGCTGCTGAAGTGGTGTGTCAATCTGCATCTAAACCCGGACGACTGTGTGGGTAGGGTTTGCATCGAGCACTTCCAGCCGGAGGTACTCGGCACGCGTAAGCTCAAGCAGAATGCGGTGCCCACTATCAATGTGGGACATGAGGAACCGCTTAGGTATTCCTGCCATGGCGTGGAGCAGAATCTCGTTGAGCAGGATCCCCAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAGCTAACGGAGCCGCCGGATATTCCCCTGGCCAAGAGGAAAGCGCTGGAGATGCCGATGATGAAGCGGGAATGGGAGATGGAGTTGAaggtggagatggagatggagatgcaGATGGAGCAACAGAAGGAAGCAAAGAAGATGACTCAAACTGAAAGTGAATCACCAATATGCTGCATCAGCAGTTGCGGAAACCAAGAAGTTAGCCAATTGCTGCATTTTCCCGAGGAAAAGACCCTGCTGAGAAAGTGGAGCCACAATTTAAGGCTGCCCACCACATCTCTAAGCCTGAAAAGAGTTTGTTTGGCGCACTTTGAATCGCAGCTTTTGGAGAATGGTAAGCTCTCTAAAGAAGCAGAGGCAGGGGCTGTGCCCACCTTGAAACTGGGCCACAGCAGCTGGAATCTATACAGAAGCAATGGGATCTGTCTAGTGCCTGACTGCACCCACAATACCTTCGGACGCTTAACCTTCATCGATCTGCCGGATAACAGTATTATCAGGGAAGCCTGCTTCTCCTGCTTCAACCTACCTGAATCCAGCGGGGAGCAGGCCAGACTATGTGCTAACCACTTTATAGAGGCTTACAAAAAGTTCCATCTTCCTTATGTTCTACATCCCCAAGTCTTAGTGGAGCTACAAAAGGCTAAGGACGAACTGCAATGCGCAGTGCCCAATTGCTGTTTCGAAGAAGCTGCTCCGGACGTTTACTTAATCCAGTTTCCCGATAACAAGGAGATACTGTTGAAGTGGTTACACAACACCAAGATCTCCTACGATTCTGAGAGGCACCAAAGCTATCGCATTTGCACACTCCACTTTGAATCCGAGTACTTGGATACGAATGGCCTGGTAAAAGGAGCTATACCCACGCTCCATCTGAACCATAAAGATGAGATTCACTTAAATGATAGGCCTCTGCCAGAGGATCAGATCTCCATGTTGACTCCCCTGCGTATTAAGACGGACCTCGCCTTCTTGGGCAGTCCCTGTGCAAGTGCAAGCCCCAGTCCCCGGGGCAGGATCCGGATATGCTGCATTCCCTCTTGTGGACAGTTTGGAAGCAGTCAAGTGAGGCTCTTTCGCTTTCCCACCGAGGAGCAGGCGCTGCTTCGGTGGCTGGTGAACACCCAACAGCAGCCACGACTGGTTGATCCCATGGACTTGTACGTGTGCCAGTCGCATTTTGAGCCCGAGGCCATTTGCATGAAGCAACTTCGAAGCTGGGCTGAGCCCACCTTGAACCTGGGACACGAGGGTCATGTAATTCCGAATGCCAAGCACAATGGCAATATTTCCGACGGCCAGGAAACAGAGCAAGCAATGCGATTTATCCGCGAACGCTTCTGCTCTGTCATTTCTTGCTTTCAGGAAGGTGgacaggaggaggatgaggggGGAGTGAGGCTATTTGATTATCCCGAGGACATGGCGACTACTCGAAAGTGGGCAGCCGCATGCAGACATCGTTCCATGCAGGCCAGGAGCCATGGGTTCAAGGTGTGCCAGTTCCACTTTGCCAGGGAGTGCTTTGACCCCAAAACTGGAGAATTGATGGAGGGCGCGGTGCCTACCCTCGAATTGAGCAGAGATGAAATGGAGAGGCAGTGTCTTGTGACGGGATGTGTAAAGAATGAAGCCAACGGAAGCCGTCTCCGCTACTTTAAGATACCAAAAATTGCTGCTCAGCTGGAGGCGTGGAGCAACAACCTTAAAGTCCATCCAACGGATCTCATGCAGGGGGAGCAGCAGTACATCTGCGAGAAGCACTTTGAGGCGTTTTGCTTTGGGGCCAACAAGGGACTGCGTTCTGGTGCACTTCCCACGCTCCTGCTGGGCCATGACGACGAGGTGGATATGCTTCCAAATCCGGAAAGTCTCTTGTGCCACAATAAGACGGATAAGTGCTGCGTGCCAGGCTGCGGGAATGTCTGGCAAACTGGGGACCGTAAGTTTCGTGGATTTCCTAAATTGTTGGCCATGGCCAACAAATGGAGGCATAACCTTCGTCTGGAAACTCCCGTGGAGCAGCTCGGCAAGCTGAGGGTCTGCAATGCTCACTTTGAGACCGCCTCCAACCTGGGTACAAATGGAATGAGCTCAAGTACCTCGATACCAACCTTGGAATTGGGCCACTCTTCTCCGGATATTTACCCAGCGGACTCGAGCTTAAAGTTACTAAAGCGTTCCGGAATGACGGcgaaaatttattgttgttatccCAAGTGCGAGGAAACCTGTTTGCCCAAGAATCTTTCTTACGTGCTGCCCATGGAGGAGCATCTGCGGAATGCCTGGCTAAGCCATATGGACATAGAAGACCCGAAAGAGGAAGGAGTCGCACTGCTGTGCCCGCTGCACTATGTCATCCTCTACCAGTATAGTGCCACACTTTATCCGGAGCTTCATGCTTCAAGCCGAATGTTTCTTGAGGACAATTACAAGGATGCGCGGAGCAACAGACGCGTGAAGATTGTGAGTTGTGCGATCAAGGGGTGCGACACGGTTAAGCCCCGGGACGGGGTGCCGTTGCACGGGATGCCGCAAAGCAAAGACATTCTGCAGATGTGGATAGCCAATGGCCAGTTTGAGTTCTTGGAGCAGCAGCGGTACATGTTCAAGGTGTGTCACAATCACTTTGAGCCATGCTGCTTCTTCGACGACAGGCGCCTGCACTCATGGAGCGTGCCTACCCTGCGCCTGCCTGGCGAGGCAATTCACCAAAATCCAACCGCCGAGCAGTGGCAGGACATGATCAAcaagcaagcagcagcagcaaaagcagacgCAGAGGAGAACGAGGAGTCGGACTCATATGGGGAGGTGGTTAAGTCCGAGGAGTCGGACTCATATGGGGATGTGGTTAAGACGGAACCCATTTTAAAGATGGAGCGGATGGAATCGGACTATGAAGATGAAAACCCGGAGATGCAGGCCCTAGAAGTACTCCTAGAAGTAGGCCATGTCGAGCGAATGGAGAGCTATGAGAAAGTGGATAAATCACCGGTGATCTATGCCGACAATGCACCCTTCCGATCTTCACCCATACGTTGCCCATACAATGCCAATCACTGTGCTGTGGAGGGATGCCAGGTGACTGTCGAGGATGTGGATGGCACGATTAAGCTGCACAAATTCCCAGCATCGCAGGACGCCGCCCAAAAGTGGATGCACAACACCCAAGTTGACATGGACGAAAAGTTCTGGTGGCGCTACCGCATATGCAGTTATCACTTCGATCAAGAGTGTTTTCAGAGTGCTAGGATCCGAAAAGGTGCAATGCCCACGCTTTTGTTGGGACCTCGGCGACCGGACAAGGTGTACGACAATGAGTTTGCCCTACCAGATGGGGATGATCCATTTCCAGAGCCACCGGAGGTTCAGTTGGAGGAAAATACGACTGCGGCATCCAGACTTCAAAAGGAGGTGTCCCATTTATGTTTACCCCCACGGGCGCCGCCTCGAAAGTCAAGCAAGTTTTGCCAGATTGATTCTTGCCCGAACCACCTGACCACCGAGAACATGACCCTGCACAAGTTTCCCCACTCGGAGGACATGTGCCTCAAGTGGCAGCACAACACGCAAGTGCCGTTCGACCCCTACTACCGCTGGCGTTACCGCATCTGCAGTGCGCATTTCCATCCGGTGTGCTTGGTCAATATGCGTCTAGTCCACGGAAGCGTGCCCACCTTAAAGTTGGGCCCCAAGGCTCCTTCTGAGCTGTTTGACAACGACTTCGAAGCCATTAACCTAAGGCTGGATAAAAGGTTGACCGAGTCAAATGCCAACGTGTATATCAAGCATGAGAGGagggaggaggatgaggattCTGTGTTCCTGGAGCCCGAGCTCCAGTTACACGAGGATCAAGACGATAAGATATCAAGCTGGAACAGCAAACTGCAATTAGCACCTGTGAAGCAGGAGAAAGTTGTCTACAGCCAGATCAAGTCTGGCTACGATAAGTGCTCGCTGGCTCACTGCCAGCGCCAAAGGTCCCAGCATGGCGTCCACATTTACAAGTTTCCCAAATCGAGGCGGCAACAGGAGCGTTGGATGCACAACCTCCGCATCCGCTATGATGAACGGACACCGTGGAAATTCATGATATGCAGCGTTCATTTCGAGCCGCATTGCGTCAGTCTAAGGAAGCTGCGACCCTGGGCGGTGCCCACACTGGAACTTGGCGACAATGTGCCAGAAACCATCTTTACGAACGAACAGtgcgaggaggagcaggtgacCGATCGCAGTGAGCTGGATAGCGACGCGGAGGAGGAAGAAGATGGCTTAcaggaggacgacgacgatgatgatgaggaagCAGAAGATGATATAAAGCCCGATGTAGTTGGCATCAAAAGGAGGCGACGTTTTAAGACAGATTCCGCCTGCCCTTCTAGCCAGATCCCGCCCTGGAAAGTCAAGCAATGCTGTCTCCCCTATTGTCGTGCCTTTCGAGGCGATGGCATCAAGCTGTTTCGACTTCCGAACAACAAAAACTCCATTAGCAACTGGGAGCGGGCCACCGGAATGGTATTTAAAGAGTCGCAGCGAAACACTCGTCTGATCTGCAGCCGTCACTTCGAGCCAGAGCTGATTGGAGTCAGGCGTCTAATGCGCAATGCCATTCCCACAAAGCATTTGAACCCTCAAGCAGCTGGCCAGATTCGTactaaaaaggaaaagaatcCTCCTGTCAAAATTACACCTATCTGCTGCATGGCAGACTGCCACTACAACGGAAATGTGAAGCTGCACAAGTTTCCAAGTGATCCCACTCTTCTTAGACAATGGTGCCAGGCTCTCCGTCTCACTGACACACAGCGGTATTTGGGCAAGCACATTTGCTCCATGCACCTGCCCATGAACAAGACGCTGAGCTGTGTCATCTGCGGTGGCGACAACGTAGAGTTGCCGATGCTTGGGTTCCCGGAGAATCGCAATCAGCGCGCCAAATGGTGTTACAATCTCAAAATAGAGACAATACCGAAGTGGGATCACTCAAAGCACATTTGCTGTCGTCACTTTGAGCCCCACTGCTTTGCCAAGCCGGGTGAGCTACGTCCAGGAGCGGCTCCCACGCTCCATCTCAATCACGATGATGCAAACATATTCTTCAGCGACTATGCCACTGGTCTTCCGTCCTCGCCACTAGGCAATCGAATCAAAGACGAGCCCCTGGAATCGGAGTCCGACGAAATGCTGCTGGTGTAG
Protein Sequence
PSAPLHKANPSNSPSLPATAALNSLLQQRQANADGAAIFAASSLKNEVPVKLERSYSNSTNESSYSVQESGYNNVYGSSDSSVHGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPTPSSCHLQQQQQSSLHSLQQQMHSKSYQRRSVASTSSSASSAASHYVDPEISASYHLAMSASAGGSAAINASDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDDDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLKKWCANLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEEAAVLAKWAHNLQTEANQLTSLRICNLHFEAHCIGKRMRQWAIPTLNLAGNIENLYENPEQSMLYRRRTTHLKAKLSPASAKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRSQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDDDDIFPNEAQAFADEHCAVEGCEASKEQPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTQELGHDDAQIELIPNPKPEDRYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFQRWRHNLRLEHLSFQERERYKICNAHFEDICIGKTRLNIGSVPTLELGHDDEDDIFQVNPAELQSNLFGRQRRLLHEGSGVKQELSETEDNPKADVTATSSNSKQTKNKRNASDLKCCVHSCGRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEEVDRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHNDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLSPDQCRGRICANHFEPQVRGKLKLKTGAVPTLQLGHDEGLIYDNEAIKAGMTEEEEGITPEFTRLKPKKEVFEEEEGEEEEEDEGNDGEQQNPDDLEDHPGEDKDDHYFDPLELVETFAEHRSDDEAQDYEDAEEEDSPSRYDVKEEIEQLPSHLPPSPLPRRHQAPRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCVNLHLNPDDCVGRVCIEHFQPEVLGTRKLKQNAVPTINVGHEEPLRYSCHGVEQNLVEQDPQPQHSVFRLWSLKHCRKRKLTEPPDIPLAKRKALEMPMMKREWEMELKVEMEMEMQMEQQKEAKKMTQTESESPICCISSCGNQEVSQLLHFPEEKTLLRKWSHNLRLPTTSLSLKRVCLAHFESQLLENGKLSKEAEAGAVPTLKLGHSSWNLYRSNGICLVPDCTHNTFGRLTFIDLPDNSIIREACFSCFNLPESSGEQARLCANHFIEAYKKFHLPYVLHPQVLVELQKAKDELQCAVPNCCFEEAAPDVYLIQFPDNKEILLKWLHNTKISYDSERHQSYRICTLHFESEYLDTNGLVKGAIPTLHLNHKDEIHLNDRPLPEDQISMLTPLRIKTDLAFLGSPCASASPSPRGRIRICCIPSCGQFGSSQVRLFRFPTEEQALLRWLVNTQQQPRLVDPMDLYVCQSHFEPEAICMKQLRSWAEPTLNLGHEGHVIPNAKHNGNISDGQETEQAMRFIRERFCSVISCFQEGGQEEDEGGVRLFDYPEDMATTRKWAAACRHRSMQARSHGFKVCQFHFARECFDPKTGELMEGAVPTLELSRDEMERQCLVTGCVKNEANGSRLRYFKIPKIAAQLEAWSNNLKVHPTDLMQGEQQYICEKHFEAFCFGANKGLRSGALPTLLLGHDDEVDMLPNPESLLCHNKTDKCCVPGCGNVWQTGDRKFRGFPKLLAMANKWRHNLRLETPVEQLGKLRVCNAHFETASNLGTNGMSSSTSIPTLELGHSSPDIYPADSSLKLLKRSGMTAKIYCCYPKCEETCLPKNLSYVLPMEEHLRNAWLSHMDIEDPKEEGVALLCPLHYVILYQYSATLYPELHASSRMFLEDNYKDARSNRRVKIVSCAIKGCDTVKPRDGVPLHGMPQSKDILQMWIANGQFEFLEQQRYMFKVCHNHFEPCCFFDDRRLHSWSVPTLRLPGEAIHQNPTAEQWQDMINKQAAAAKADAEENEESDSYGEVVKSEESDSYGDVVKTEPILKMERMESDYEDENPEMQALEVLLEVGHVERMESYEKVDKSPVIYADNAPFRSSPIRCPYNANHCAVEGCQVTVEDVDGTIKLHKFPASQDAAQKWMHNTQVDMDEKFWWRYRICSYHFDQECFQSARIRKGAMPTLLLGPRRPDKVYDNEFALPDGDDPFPEPPEVQLEENTTAASRLQKEVSHLCLPPRAPPRKSSKFCQIDSCPNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPVCLVNMRLVHGSVPTLKLGPKAPSELFDNDFEAINLRLDKRLTESNANVYIKHERREEDEDSVFLEPELQLHEDQDDKISSWNSKLQLAPVKQEKVVYSQIKSGYDKCSLAHCQRQRSQHGVHIYKFPKSRRQQERWMHNLRIRYDERTPWKFMICSVHFEPHCVSLRKLRPWAVPTLELGDNVPETIFTNEQCEEEQVTDRSELDSDAEEEEDGLQEDDDDDDEEAEDDIKPDVVGIKRRRRFKTDSACPSSQIPPWKVKQCCLPYCRAFRGDGIKLFRLPNNKNSISNWERATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTKHLNPQAAGQIRTKKEKNPPVKITPICCMADCHYNGNVKLHKFPSDPTLLRQWCQALRLTDTQRYLGKHICSMHLPMNKTLSCVICGGDNVELPMLGFPENRNQRAKWCYNLKIETIPKWDHSKHICCRHFEPHCFAKPGELRPGAAPTLHLNHDDANIFFSDYATGLPSSPLGNRIKDEPLESESDEMLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00525910;
90% Identity
iTF_00594581;
80% Identity
-