Basic Information

Gene Symbol
CG10628_1
Assembly
GCA_008042615.1
Location
VNJW01006692.1:15925-44868[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 23 5.8 1.5e+04 -3.0 1.9 43 60 643 661 633 678 0.57
2 23 2.4e-15 6.3e-12 46.3 4.0 1 86 838 910 838 911 0.85
3 23 5.7e-16 1.5e-12 48.3 5.3 1 87 1052 1122 1052 1122 0.82
4 23 2.7e-15 7.1e-12 46.2 3.7 1 86 1146 1217 1146 1218 0.82
5 23 4.2e-12 1.1e-08 35.9 0.5 1 87 1253 1321 1253 1321 0.80
6 23 7.3e-11 1.9e-07 31.9 1.4 1 86 1362 1431 1362 1432 0.76
7 23 3.5e-17 9.2e-14 52.2 0.4 1 86 1459 1528 1459 1529 0.82
8 23 1.6e-12 4.3e-09 37.2 1.9 1 85 1550 1618 1550 1620 0.80
9 23 7.5e-15 2e-11 44.7 0.6 1 86 1647 1718 1647 1719 0.85
10 23 1.9e-12 4.9e-09 37.1 3.5 1 85 1791 1859 1791 1861 0.81
11 23 1.5e-12 3.9e-09 37.4 0.1 1 86 1884 1952 1884 1953 0.83
12 23 8.4e-13 2.2e-09 38.2 2.5 1 87 2098 2167 2098 2167 0.81
13 23 1.8e-07 0.00048 21.1 0.0 23 86 2225 2278 2201 2279 0.70
14 23 1.8e-10 4.9e-07 30.7 0.3 1 61 2304 2359 2304 2385 0.78
15 23 2.7e-05 0.071 14.1 0.0 1 58 2392 2442 2392 2465 0.85
16 23 3.7e-11 9.8e-08 32.9 0.1 1 86 2482 2553 2482 2554 0.81
17 23 8 2.1e+04 -4.2 0.1 20 45 2560 2581 2559 2588 0.63
18 23 1.8e-15 4.9e-12 46.7 0.6 1 86 2666 2738 2666 2739 0.81
19 23 1.4e-12 3.7e-09 37.4 3.4 1 86 2803 2873 2803 2874 0.80
20 23 2.5e-13 6.7e-10 39.8 3.8 1 86 2966 3036 2966 3037 0.83
21 23 2.1e-11 5.5e-08 33.7 0.2 1 86 3115 3184 3115 3185 0.83
22 23 9.4e-10 2.5e-06 28.4 0.5 1 58 3210 3258 3205 3267 0.83
23 23 6.8e-11 1.8e-07 32.1 1.0 19 87 3277 3334 3266 3334 0.77

Sequence Information

Coding Sequence
ATGGTTCAGCTGTTTAAATTCCTGTTAAAGTCCACAAAGGAGCCCGCCCGAGCCCACTTTCGGCCCATCTTCTTGGACACTTTGCGTCTGGCAGTCCGAGGAGGACATGGGGGAAACGGGTTGCCCAAGTACGGCGGAGTCGGTGGTCAGGGGGGTTGCGTCTATTTAGTAGCCAAGGAGGGACTTACGCTGCGCAAAGTGTCCCAGGGATTGAGAGAAAAGCGTGTGGCCGCCTCCAGCGGAGAGGACAGCAGCAAGGCGAGCATCTTCGGGCGACGTGGAGTGGACCAGCGCATCGAGGTGCCTGTGGGCGTTCAAGTGTATGACGAACGAAACAAACTGATCGCTGATCTCAATGAAAACGATTCAACCTGCATAGTGGCGGGCGGAGGGACTGGCGGATGCACAGGAACTAACTATTTGGGACGTCCTGGAGAGAGTCGCACTGTGAATCTGGACTTGAAACTGATAGCGGACGTGGGACTCGTCGGGTTTCCCAATGCGGGCAAGAGCACGCTGCTCAAAGCAGTTTCAAATGCAAAGCCGAAGATTGCTGCTTATCCCtTTACCACTATCCGGCCGCAGATTGGAACCGTGGAGTATAGCGACTTGCGCTCTATCACCGTGGCAGATCTTCCTGGGCTCATCGAGGGTGCCCACGCCAACTTTGGCATGGGCCACAAGTTCCTCAAGCATATTGAGCGCACCCGTCTACTCCTCTTCATGGTGGACATTTTCGGCTTCCAACTAAGTCCTAAACACCCACATCGCGATTGCCTAGCTAACATATATGCCCTGAACAAGGAGTTGGAACTCTACGATCCATCTCTGCTGGAAAAGCCCAGCGTTCTGTTGCTAAATAAAATGGACAAAGACGGCGCCAGCGAGATATTCACAACATTGAAGCCGGTCATCACTGACTTGGCTAGCGGGCTGGTACAATGTCCCGAGGAACTGCGACCGAAGCAGGTGTTGAAGTTCGAGAGCGTCGTACCCATATCGGCAATTAATTCTACGAAAGTGACACAAGTAAAGTCGCAGCTGCGGCGGACGCTTTTACGGCTAGCGGAGAAGCAGTTCGTGGCGGATGAACAGCAAATTAaagagcagctgcagcagcgcgTGGGCCTGATCGCATCCTCCCAATCCCACAACCTCCCTCAGTCGGCCTTTGGCTCTGTGCCCCCTTGCAAGGCcagcatcagcaacaacaacgccaTTATAGGTGCCtacggaggaggagaaggagggaGTGTTGGGTCGCACGGATATTTTGGCGCCGCTGGCGGGGGCCTCAATGTCAGCGGGGCGGGGGGTGGTGCTGAGTCGACCTACGGCCTTGGGACCAACACTGTGGCATATGCTCACAACCAGCTGCTGCAGTaccagcatcatcatcagcaacaccagcagcagcagcacctggGTCTGAGCCAGCGATCCTATATGGGCCACGATGTCATGGCCGGGAGCTATCCCTATATCAAGAGTGAACCTTTGGAGGGCTTCCAGCAGCCGTCCAATCCAATGGCCCCACCCCCGGCCCCAGAAACGATAATAAAATCGGAACCCGTTGACGACCTTGCCTACAAGTCAAACTACATTGACGACAATACGCCATTTGCAGACTTTAGCAAGTTCAGCGAATTCAGCGAGGACATGCTGAGTCCCAAAGTTGAGCTGACAGTTAAGAATGAGTCCTTCGACAGGAATCCCAATAGCTTTTTACGCCGTAAGCAACAATCTGATCGGTCAACAACAGAGAGCCTGCCCGTCTGCCAGCGATGCAAGGAGGTGTTCTTCAAGAAGCAGACTTACCTTCGCCACGTCGCCGAGAGCAACTGCGGCATACAGGAGTACGACTTCAAGTGCACCTTATGTCCTATGTCCTTCATGACCGCCGAGGAGCTACAGCTACATAATCAACACCATCAAGAGGACAGGTTCTTTTGTCACAAGTACTGTGGTAAGCACTTTGGCACAATCGCAGAGTGCGAGGCGCACGAATACATGCAACACGAGTACGATAACATTGTTTGCAACATGTGTTCAGGAATGTTCGCCACGCTGGAACAACTGTATGCACACTTGCCACAGCACAAATTCCAGCAGCGCTTCGACTGCCCCGTATGCCGCTTGTGGTACCAAACGGCTCTTGAGCTGCACGAGCACCGCTTGGCTGCGCCTTACTTTTGCGTGCCAAACGCACCACCCCCTAGGACGAACTCATCCAACTCGCCTGCCTTGCCCGCGACTGCGGCACTTAATTCACTGTTGCAGCAGCGTCATGCAAATGCTAATAAAGCTGCCATTTTCGCCGCATCTACGCCCAAGAACGAGGTCACTGTGAAACTGGAGCGTAGCTACAGTAACTCGACTAACGATTCGTCTTATAGCGCTCAGGAGAGCGGCTACAACAATATGTATAGCGGCAGCGACACCTCAATTCACGGCGCCCTCGCCGGACCGCAGGCACATTCTTCGACACTGGACGACTCCGAGGATGCGCTGTGCTGTGTGCCGCTGTGTGGGGTGCGGAAGAGCACAAGTCCCACCTTGCAATTTTTCACGTTCCCAAAGGACGAGAAGTATCTCAACCAGTGGCTGCATAACCTCAAGATGTTTCACATACCCGCTTCTAGCTACGTTAGCTTCCGCATCTGTAGCATGCATTTCCCGAAGCGATGTATTAACCGCTACTCCCTGTGCTACTGGGCGGTGCCTACGTTTAACCTCGGCCACGATGACGTAGCCAATCTCTACCAGAATCGGGAACTGACCAACACCTTTACCACTGGCGAAGTGGCGCGCTGCAGCATGCCACATTGCACTAGCCAGCGAGGTGCCGTGCCCACGCTTCATCTAGGCCACAACGACACCTTCAACATCTACGAAAACGAGCTATGGCCCCCGCCGACACCAACACCCTCTACTAGTCatttgcaacagcaacagcagtcaTCACTTCActcacagcaacagcagatgCACAGCAAATCCTACCAGCGTCGTTCGGTGGCATCCACTTCGTCGTCAGCGAGTTCGGTAGCCTCTCATTACGTGGACCCGGAGATGAATGCCTCCTAtcatctagccatgtccgcctcCGCCAGTGGCTCTGCGACGACAAACGCCAGCGACAGCATGGATGTCTGCTGCGTGCCTAGTTGCGAAAGCAAGCGCCATAATAGCGAGAACATTACATTCCACACTATACCGCGACGGCCTGAGCAGATGCGCAAATGGTGTCACAACCTTAAGATTGCTGAGGACAAGATGCACAAGGGCATGCGAATCTGTAGCCTACACTTCGAGCCCTACTGCATCGGCGGCTGTATGCGGCCATTTGCGGTTCCCACTCTTCACCTGGGTCACGACGACGAAAACATCCACCGCAATCCGGATGTGATCAAGAAGCTAAACATTCGGGAGACATGCTGTGTTTCTGTGTGCAAGCGGAATAGGGATAGGGACCATGCTAATCTGCATCGTTTCCCCAGCAACGTGGCTCTACTTAGAAAATGGTGTGCCAATTTGCGGCGCACTGTCCCCGATGGCAGTAAACTTTTCAATGACGCTATCTGTGAGGTGCACTTTGAGGATCGCTGCCTACGAAACAAGAGGCTGGAAAAGTGGGCAGTGCCCACTCTGATCCTTGGACACGAAGACGTCGCCTACCCACTGCCCACACCGGAACAGGTGACCGAGTTCTATGCCCGGCCTACAGCTCCCAACAATGGCGAAGAACAGGGCGAGTGTTGTGTGGAAACCTGCAAGAGGAATCCTAGCGTGGACGATATTAAGCTATACCGCCCGCCGGAGGATGCCACCGTGCTGGCAAAATGGGCGCACAATCTGCAGACGGAGGCCAACCAGCTGACAGGCATGAGGATCTGTAACCTTCACTTCGAGGCGCATTGCATTGGCAAGAGGATGCGAATGTGGGCATTACCAACTTTGAATCTAGCTGGCAACATCGAGAATCTTTATGAAAATCCAGAGCAATCGCTGCTGTACAGGCGGCGGACTCATTTAAAGGCGAAGCTGCCGTCAATCTCCGCAAAGCCCACCTGGGTGCCCAGGTGCTGTCTTCCGCACTGTCGCAAGGTCAGAGCCCTGCACAACGTTCAGCTTTATCGCTTCCCCAAGCTCAATCGCTCCACATTGGCCAAATGGGCGCATAATCTCCAGGTTCCAATGGTGGGCAGTGCCCAGCGCAGGCTATGCTCAGCTCATTTCGAGCCGCATGTGCTTAGCAAAAAGTGTCCAGTGCCGCTGGCCGTGCCTACGCTTGACCTAAACTCACCACCCGGCTTGAAAATCTACCAGAATCCGGCCAAGCTAAAGGCCAGTAAACTGTGCCTGCAGCGGGTTTGCATCGTCGAGAGCTGCCGAAAGACGCGGGCGCAGGGCGTGCAACTATTTCGGCTGCCTCACAGTCCCACACAGCTGAGGAAATGGATGCACAACATAAGAACGCGACCACGGGCAGCTATGAGGTCTCAATACCGAGTCTGTTCCCGTCACTTTGAGACACACTCATTTAATGGCCGAAGACTAAGTGCAGGTGCTATTCCGACTTTGGAACTGGGTCACGATGGTGACGATATATATCCCAATGAAGCCCAGGCATTTGTGGAAGAACATTGTGCTGTCGAGGGCTGCGAAGCATCCAAGGAGCACCCGGAGGTCCGACTGTTCCGCTTCCCCACCGAAGACGACGATATGTTGTGGAAGTGGTGCAACAATCTAAAAATGAATCCTGTGGACTGCATTGGGGTACGTATCTGCAACAAGCACTTCGAGGCCGATTGTATAGGCCCCAAGCACCTGTATAAGTGGGCCATTCCCACAAATGAGCTAGGCCACGACGACGCACAGATCGAGCTGATCCCGAATCCAAAACTAGAGGAGAGGTATGTGGATCCAGTCTTTAAATGCATCGTACCCACCTGCGGCAAGACGCGACGCTTTGATGAGGTACAGATGAACAGCTTTCCCAAGGATCCGAATCTATTTCAGCGTTGGCGGCACAACCTGCGCTTGGAACATCTCAGTTTTCAAGAACGTGACCGCTACAAGATTTGTAACGCTCACTTTGAAGAGATTTGTATTGGGAAGACTCGGCTGAACATTGGATCCGTACCAACCCTGGAGCTTGGTCATGACGATGAGGATGATATTTTCCAAGTGAATCCTGCGGAACTGCAGAGCAACTTATTCGGCCGACAACGTCGACTGCTGCTTGAGGAGCCCGGCGATGTGAGTGCGAAACAAGGGCTGTGCGAAGATAACAGAAAAGCGGATCTAACGGCCACAGACTTCAATTCTAAACAGACTAATGCCAAGAGATCTATTTCGGATCTTAagtgttgtgtacacagctgtgGAAGAAGTCGTTTGGAGCACGGGGCACGGCTGTTTCCCTTTCCAACGGGCAAGCAGCAACACCTAAAGTGGCGACATAATCTGCACCTGGAACCAGAGGATGTGGACCGATCAACGCGAGTGTGCAGTGCTCACTTCAATCGGCGTTGCATCGAGGGCAAACAGCTGAGGAGCTGGGCGATGCCCACTCAACAATTGGGTCACCACGACCAGCCGATATACGAGAACCCAAAGAATATACCTGGGTTCTTCACACCTACCTGTGCCCTAGGACACTGTCGCAAGAGAAGGAGTATTGACAACGATCTGCGTACCTATCGATATCCAAGAAGCGAAGATCTGCTGGAAAAGTGGCGAGCTAATTTACAGCTGGCTCCGGATCAGTGCCGTGGTCGGATTTGTGCAAATCACTTCGAGCCGCAGGTGCGGGGCAAGCTAAAGTTAAAGACGGGAGCCGTGCCCACATTAGAACTGGGACACGATGAGGGAATAATCTATAATAATGAGGGTATTAAGGCGGGTATGATAGAAGAAGAGGAAAGCATCACCCCTGAGTTCCAGCGACTAATACCAAAAAAAGAGATGTTTGAAGAGGGGGAGGAGGGTGAAGAGAATGTTGGCGAGAAGCAGCATCCAGATTACCAAGACGAAAACGCAGATGAAGATGACAAAGATGACCACTACTTTGATCCTCTTGAACTGGTAGAGACTTTTGCTGAACATCGCAGCGATGATGACGCccaagatgatgatgatgaagaaaAGGGCCGAATCGTAGACTCCCCATCTGGTTACGAGGTTAAGAAGGAGATAGAACAGCTTCCAAGCAGCCCACCTTCACCTTTACCCCGACGGCACCCAGCTCTGCGTCGAGACAAGCCAGCTAACAATGTGACGCCCATTTGTTGTCTGAAACACTGCAGGAAGGAACGTACTGCCTTCCACCTGCTGAGCACTTTCGGCTTTCCAAAGGATCGCCAATTGATGCTGAAGTGGTGTGCTAATCTGCATCTAAACCCGGTTGACTGCATCGGTCGGGTTTGCATTGAACACTTTCAGCCGGAGGTACTCGGTACCCGTAAGCTTAAGCAGAATGCGGTGCCCACTGTTAATGTGGGACATGAGGAGCCGCTTAGGTACTCGTGTCATGGCGTGGACCAGAATctcgaggagcaggagccacagccacagcatTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAGCTAACGGAGCCGCCGGATATTGCCCTGGCCAAGAGGAAACCGCTGATTCCAAAAACTGCTGCTCAATTGGAAGCGTGGAGCAATAACCTTAAGATCCATCCAACGGTTCTAATGCAAGGGGAAATGCAGTACATCTGCGAGAAACACTTTGAGACGTTCTGCATTGGACCCAACAAAGGACTGCGTTCTGGTGCTCTTCCAACTCTCTTGCTGGGTCATGATGAGGACGAAGAGTTGCTTCTAAATCCGGAAAGTCTCTTCTGCCACAGCAAGACGGATAAGTGCTGCGTTCCAGGATGCGGACGTTTCTGGCAGGCTGGCGATCAAAAGTTCCGTGAATTTCCTAAATTGTTGACCATGGCTAAAAAATGGAGGCATAACCTACGTTTGGTAGCGACCATAGAGGAACTGGGAAAGCTTAAGGTTTGCAGTACTCACTTTGAGAGTTCTTCGATACCCACCGTGGAATTGGGTCATTCTTCTCCGGGTATTTATCAAGCGGCCATGTGCTTGAAGTCTCAAAAGCGGTCCGTGATGGTCTTTTATTGCTGTTATCCCAAATGCGAGGAAATCAGTCTGCCCAAGAATCTGACTTATGGGCTTCCTCAAGAGGAGCATCTGCGAAATGCCTGGTTAAggcacatgaacatagaagATCCGAAAAATGGAGCAGTCGCACAGCTTTGCCCGCTGCACTATGTCATTCTCTATCAGCACAGTGCCGGAAACTATCCGGATTATCACGCTCTAAGCCGATTGCATCTTGATGATAATTATAAGGAAGCGCGGAACAACCGGCGCGTAAAGATTGTGAGCTGTGTGATCAAGGGTTGTGACATGGTTAAGCCCCGGGATGGGATACCACTGCACGGTATGCCGCAAAGCCAGGACATCCTGCAGATGTGGATAGATAATGGCCAGTTTGAGTTTTTAGAGCAGCAGCGGTACATGCTTAAGGTGTGTCACAATCACTTTGAGTCATGCTGTTTCCTTGACGATAGACGCTTGCTCTCATGGAGCGTGCCGACTCTGCGCCTTCCTGGCGAAGCATTTCATCAAAATCCTACCGCCGAACAGTGGCAGAACATGATGAACAATCAATCAGCAGCCAAAACAAATGCAATGGAGAAAGAGGAGTCAGATCTATATAGCGATGGGGATAAGATGGAGCCCACTTTAAAAATGGATCATATTAAATCCGAATATCAATATAAAAACTCGGAGATGCAGGCTCTAGAGGTCCTCCTAGAAGTTGGGCATGTGGAACGAATGGACAGCTATGAGAAAATGGATGAATCACCGGTAATCTATACCGAAAATGCACCCTTCCGATCGTCACCCATACGTTGCCAATACAATGCTTATCAGTGTGCCGTAGAAGGATGCCAGGTGACCGTCGAGGATGTGGACGGCACAATTAAGCTGCACAAATTCCCCGCGTCGCAGGAAGCCGCACAGAAGTGGAAACATAACACCCAAGTTGACATGGACGAAAAGTTCTGGTGGCGATACCGCATATGCAGTTACCACTTTGATCAAGAGTGCTTTCAGAGCGCTAGAATTCGAAAAGGCGCGATGCCCACGCTTTTGCTAGGACCTCGGCGACCGGATAAGGTGTACGATAATGAATTTGCTCAACCAGGAGCGGAAGACGCTTTTCTGGAGCCACCACCGGGAATTCAGCTGGAGGAAAGTTCGACTATGGCGTCTAGAGTTCGAAAGGAGGTCTCTAGTTTATGCCTGCCGCCGCGGGCACCGCCTCGAAAGTCGAGCAAGTTTTGCCAGATTTATTCTTGTACGAATCACCTGACAACTGAGAACATGACACTTCACAAGTTTCCCCACTCAGAGGACATGTGCCTTAAGTGGCAGCACAACACGCAAGTGCCATTCGATCCCTACTACCGCTGGCGTTACCGCATCTGCAGTGCTCACTTCCATCCGGTGTGTTTGGTCAACATGCGTCTAGTCCACGGAAGCGTTCCCACTTTAAAGCTAGGTCCCAAGGCTCCATCCGAACTGTTTGACAACGACTTTAATGCCATCAACCTAAGGTTGGACAAAAGGTTGACGGAGTCCAATGCCAATGTGTATATCAAGCATGAAAAGAGGGAAGAGGATGAAGGCTCGCTAATGTTGCTGGAGCCCGAGCTCCAGTTTCAAGAGGATCAAGACGATAGGATATCAGCAAGGAACAGCAAACTGCAATTGCGACCTATAAAGCTGGAGAAAAAAATCTATAGCCAGGTTAAGTCTGGCTCTGATAAGTGTTCGCTGGCTCACTGCCAACGCCAAAGGTTCCGACATGGCGTCCACATTTATAAGTTTCCCAGATCGAGGCGCCAACAGAAGCGTTGGATGCACAATCTCCGCATCCGCTATGATGAGCGCACACTGTGGAAATTAATGATCTGTAGCGTTCATTTTGAACCTCACTGCATCAGCTTACGGAAGCTGCAGCCTTGGGCGGTGCCCACACTGGAGCTGGGCGACAATGTGCCAGAAAAGATCTTTTCGAACGAACAGTGCGAGGAGGATTTGGTGACCGATCGCAGCGAGCTGGAGAGCGACGCCGAGAAGGAAGATGGCTTACAGgaggacgacgatgatgaaGACGAGGACTATCCGAAGCCAGATGTTGGGATGAAAAGGCAAAGACGTTTCAATATAGATTCCTCCTGCCCTACCCAGGCTCCACCCTGGAAAGTCAAACAATGCTGCCTCCCCTATTGCCGTGCCTTCCGAGGCGATGGCATCAAGCTTTTTCGGCTTCCAAGCAACCGAAACTCCATTAGCAACTGGGAGCTGGCCACAGGAATGGTATTCAAAGAGTCACAACGGAATACTCGTTTGATTTGCAGCCGTCACTTTGAGAAAGAGCTGATCGGAGTCAGGCGTCTAATGCGTAACGCCATTCCCACAAAGCACTTAAATCCCCTAGGAGCTGCCCTGATCCATtctaaaatggaaaaaaatccTCAAGCCTGTGTTTTTCCCATCTGCTGCATGGCGGACTGCCACTACGATGGAAATGTGAAGCTGCACAAGTTCCCAAGTGATCCCACACTGCTTAGGCAGTGGTGCCAGACTCTTCGGCTCACCGATACGCAGCGGTATTTGGGCAAACACATTTGTTCGATGCACCTACCGATGAACAAAACACTGAGCTGTATTATCTGCGGCGGAGACAACATAGAGTTGCCGATACTTGGGTTTCCGGAGAACCGCAACCAGCGCGCCAAATGGTGTTATAATCTCAAAATTGATGCAATACCAAAGTGGGACCACTCAAAGCACATATGCAGCCGGCACTTTGAGTCCCATTGCTTCGATAAGCCGGGTGAGCTACGTCCAGGAGCAGCTCCCACGCTCCATCTGAATCACGATGACACAAACATATTCTTCAGCGACTATTCCACTGGCATTCCATCATCGCCAATACGCAATCGCATTAAAGACGAGACGCTGGAATCGGAGTCTGATGAGATGCTGCTGGTGTAG
Protein Sequence
MVQLFKFLLKSTKEPARAHFRPIFLDTLRLAVRGGHGGNGLPKYGGVGGQGGCVYLVAKEGLTLRKVSQGLREKRVAASSGEDSSKASIFGRRGVDQRIEVPVGVQVYDERNKLIADLNENDSTCIVAGGGTGGCTGTNYLGRPGESRTVNLDLKLIADVGLVGFPNAGKSTLLKAVSNAKPKIAAYPFTTIRPQIGTVEYSDLRSITVADLPGLIEGAHANFGMGHKFLKHIERTRLLLFMVDIFGFQLSPKHPHRDCLANIYALNKELELYDPSLLEKPSVLLLNKMDKDGASEIFTTLKPVITDLASGLVQCPEELRPKQVLKFESVVPISAINSTKVTQVKSQLRRTLLRLAEKQFVADEQQIKEQLQQRVGLIASSQSHNLPQSAFGSVPPCKASISNNNAIIGAYGGGEGGSVGSHGYFGAAGGGLNVSGAGGGAESTYGLGTNTVAYAHNQLLQYQHHHQQHQQQQHLGLSQRSYMGHDVMAGSYPYIKSEPLEGFQQPSNPMAPPPAPETIIKSEPVDDLAYKSNYIDDNTPFADFSKFSEFSEDMLSPKVELTVKNESFDRNPNSFLRRKQQSDRSTTESLPVCQRCKEVFFKKQTYLRHVAESNCGIQEYDFKCTLCPMSFMTAEELQLHNQHHQEDRFFCHKYCGKHFGTIAECEAHEYMQHEYDNIVCNMCSGMFATLEQLYAHLPQHKFQQRFDCPVCRLWYQTALELHEHRLAAPYFCVPNAPPPRTNSSNSPALPATAALNSLLQQRHANANKAAIFAASTPKNEVTVKLERSYSNSTNDSSYSAQESGYNNMYSGSDTSIHGALAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGAVPTLHLGHNDTFNIYENELWPPPTPTPSTSHLQQQQQSSLHSQQQQMHSKSYQRRSVASTSSSASSVASHYVDPEMNASYHLAMSASASGSATTNASDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDENIHRNPDVIKKLNIRETCCVSVCKRNRDRDHANLHRFPSNVALLRKWCANLRRTVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDVAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEDATVLAKWAHNLQTEANQLTGMRICNLHFEAHCIGKRMRMWALPTLNLAGNIENLYENPEQSLLYRRRTHLKAKLPSISAKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRSQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDGDDIYPNEAQAFVEEHCAVEGCEASKEHPEVRLFRFPTEDDDMLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTNELGHDDAQIELIPNPKLEERYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPNLFQRWRHNLRLEHLSFQERDRYKICNAHFEEICIGKTRLNIGSVPTLELGHDDEDDIFQVNPAELQSNLFGRQRRLLLEEPGDVSAKQGLCEDNRKADLTATDFNSKQTNAKRSISDLKCCVHSCGRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEDVDRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHHDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLQLAPDQCRGRICANHFEPQVRGKLKLKTGAVPTLELGHDEGIIYNNEGIKAGMIEEEESITPEFQRLIPKKEMFEEGEEGEENVGEKQHPDYQDENADEDDKDDHYFDPLELVETFAEHRSDDDAQDDDDEEKGRIVDSPSGYEVKKEIEQLPSSPPSPLPRRHPALRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLMLKWCANLHLNPVDCIGRVCIEHFQPEVLGTRKLKQNAVPTVNVGHEEPLRYSCHGVDQNLEEQEPQPQHSVFRLWSLKHCRKRKLTEPPDIALAKRKPLIPKTAAQLEAWSNNLKIHPTVLMQGEMQYICEKHFETFCIGPNKGLRSGALPTLLLGHDEDEELLLNPESLFCHSKTDKCCVPGCGRFWQAGDQKFREFPKLLTMAKKWRHNLRLVATIEELGKLKVCSTHFESSSIPTVELGHSSPGIYQAAMCLKSQKRSVMVFYCCYPKCEEISLPKNLTYGLPQEEHLRNAWLRHMNIEDPKNGAVAQLCPLHYVILYQHSAGNYPDYHALSRLHLDDNYKEARNNRRVKIVSCVIKGCDMVKPRDGIPLHGMPQSQDILQMWIDNGQFEFLEQQRYMLKVCHNHFESCCFLDDRRLLSWSVPTLRLPGEAFHQNPTAEQWQNMMNNQSAAKTNAMEKEESDLYSDGDKMEPTLKMDHIKSEYQYKNSEMQALEVLLEVGHVERMDSYEKMDESPVIYTENAPFRSSPIRCQYNAYQCAVEGCQVTVEDVDGTIKLHKFPASQEAAQKWKHNTQVDMDEKFWWRYRICSYHFDQECFQSARIRKGAMPTLLLGPRRPDKVYDNEFAQPGAEDAFLEPPPGIQLEESSTMASRVRKEVSSLCLPPRAPPRKSSKFCQIYSCTNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPVCLVNMRLVHGSVPTLKLGPKAPSELFDNDFNAINLRLDKRLTESNANVYIKHEKREEDEGSLMLLEPELQFQEDQDDRISARNSKLQLRPIKLEKKIYSQVKSGSDKCSLAHCQRQRFRHGVHIYKFPRSRRQQKRWMHNLRIRYDERTLWKLMICSVHFEPHCISLRKLQPWAVPTLELGDNVPEKIFSNEQCEEDLVTDRSELESDAEKEDGLQEDDDDEDEDYPKPDVGMKRQRRFNIDSSCPTQAPPWKVKQCCLPYCRAFRGDGIKLFRLPSNRNSISNWELATGMVFKESQRNTRLICSRHFEKELIGVRRLMRNAIPTKHLNPLGAALIHSKMEKNPQACVFPICCMADCHYDGNVKLHKFPSDPTLLRQWCQTLRLTDTQRYLGKHICSMHLPMNKTLSCIICGGDNIELPILGFPENRNQRAKWCYNLKIDAIPKWDHSKHICSRHFESHCFDKPGELRPGAAPTLHLNHDDTNIFFSDYSTGIPSSPIRNRIKDETLESESDEMLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-