Basic Information

Gene Symbol
-
Assembly
GCA_037043425.1
Location
JBAMBM010000490.1:2704939-2722299[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 30 5.2 1.1e+04 -2.7 1.9 50 64 331 347 312 362 0.61
2 30 7.6 1.7e+04 -3.2 0.1 6 58 363 416 358 434 0.52
3 30 3.5e-15 7.8e-12 46.0 4.4 1 86 561 633 561 634 0.86
4 30 2.6e-15 5.7e-12 46.4 4.8 1 87 661 730 661 730 0.83
5 30 7.8e-16 1.7e-12 48.0 0.4 1 87 752 824 752 824 0.84
6 30 2.1e-15 4.8e-12 46.6 5.1 1 87 916 986 916 986 0.82
7 30 3.3e-15 7.3e-12 46.0 3.5 1 86 1010 1081 1010 1082 0.82
8 30 1.2e-12 2.8e-09 37.8 0.9 1 87 1117 1185 1117 1185 0.80
9 30 4.1e-11 9.1e-08 32.9 1.5 1 86 1228 1297 1228 1298 0.76
10 30 2.6e-16 5.9e-13 49.5 0.1 1 86 1325 1394 1325 1395 0.82
11 30 2.5e-12 5.6e-09 36.8 2.7 1 87 1416 1486 1416 1486 0.81
12 30 1.9e-14 4.3e-11 43.6 1.0 1 86 1513 1584 1513 1585 0.84
13 30 2.2e-14 4.8e-11 43.4 1.5 1 86 1661 1730 1661 1731 0.84
14 30 5.5e-13 1.2e-09 38.9 0.4 1 86 1754 1822 1754 1823 0.83
15 30 2.4e-14 5.4e-11 43.3 1.4 1 87 1937 2006 1937 2006 0.80
16 30 3.9e-13 8.7e-10 39.4 0.4 1 62 2063 2126 2063 2150 0.77
17 30 0.00095 2.1 9.3 0.1 1 58 2159 2210 2159 2233 0.78
18 30 8.3e-13 1.8e-09 38.3 0.8 1 86 2249 2318 2249 2319 0.85
19 30 1.5e-14 3.3e-11 43.9 0.3 1 87 2392 2462 2392 2462 0.82
20 30 1.5e-12 3.3e-09 37.5 1.2 1 86 2497 2568 2497 2569 0.80
21 30 3e-13 6.6e-10 39.8 0.8 1 87 2579 2655 2579 2655 0.78
22 30 1.9e-15 4.3e-12 46.8 0.2 1 86 2681 2754 2681 2755 0.81
23 30 3.3e-06 0.0072 17.2 0.1 1 58 2788 2842 2788 2863 0.82
24 30 1.5e-14 3.3e-11 44.0 0.5 1 87 2880 2952 2880 2952 0.81
25 30 1.4e-15 3.2e-12 47.2 0.7 1 87 3108 3182 3108 3182 0.85
26 30 3.6e-14 7.9e-11 42.7 3.5 1 87 3254 3325 3254 3325 0.83
27 30 3e-14 6.7e-11 42.9 5.1 1 86 3464 3534 3464 3535 0.85
28 30 9.4e-13 2.1e-09 38.2 0.2 1 86 3612 3681 3612 3682 0.84
29 30 7.7e-11 1.7e-07 32.0 4.0 1 58 3709 3759 3709 3764 0.87
30 30 6.9e-10 1.5e-06 29.0 0.4 19 87 3778 3835 3767 3835 0.72

Sequence Information

Coding Sequence
ATGTCACAACACAATCCGCATCACCACCCCCACCCCCATGCGCATTtacaatggcagcagcagcagcaacaacaacaccatcaacataaacaaatacaacagcagcagcacagttGGTACTCACATGTTGCTTCCTACCCACCGCATCACCATGCGTCCTTTACATCACCCTGtaagagcaacaacatcatGAATGCATATGGAGTGGGAGCAGGAGGTCCTACAGGCACGCATGCATATTACGGCAATGTCAATGCTGCGGGGGGTGGTAGTGGAGCTGGGAGTGGTAACATGTCAGGTGGAGGTAGTAACATGTCAGGTGGAGGCTATAACCTTGAGGCAAATAACGTTGCCTATGCCCACAACCAACTGCTGCAgtatcaacaacagcaacaacatcaacaacagctcAGTCAAGGTTCCTCTTATATGGCGCAAAGTCTAATGCACGGCCCATATCCCTATATCAAAAGCGAGCCCATGGAGCAAATGCagtcccagcagcagcagcaacattatCAACCGCCATCGAATCCTATGGCACCGCCGCCAGCGCCAACATCAACTCATCAGTCCGTGGATGATACGATAATAAAATCGGAACCCATTGACGAACATGTCTACAAATCCAACTATATCGATGATAATACACCTTTTGCTGATTTTAGTAAATTTCCTGAATTCGGAGACGACATGCTAAGTCCAAAAGTCGAGCTAACCGTCAAGGATGAGTCCTACGGCAATAATGTAAAGAACCCACTCAATTATCCACGGCGCAAGCTGCAAAATGATCGCTCTTCGGATAGCCTACCCATATGTCAGCGCTGTAAGGAAGTTTTCTTCAAgaagcaaatttatttgcgCCATGTGGCCGATAGCAGCTGCATTATCCAAGAATACGACTTCAAGTGCAGCATTTGTCCCATGTCGTTCATGGGCGCCGAGGAACTGCAGGATCATAAGCTTCTCCACCGAGCTGACAAGTTCTTCTGCCATAAGTACTGTGGAAAGCACTTTGATACCATCGCCGAGTGTGAGTCCCATGAGTACATGCAGCATGAATACGAGAATTTCGTCTGCAATATGTGCTCGAGCACGTTTACCACACGAGAACAGCTTTATGCCCATCTGCCACAGCATAAGTTTCAGCAGCGGTACGATTGTCCAATTTGCCGATTGTGGTATCAAACGGCTTTGGAACTGCACGAGCATCGGTCGTCAGCACCTTACTTCTGCGGCAAGTACTACAGGGGAACTGAATCCCAGTCAACATCGCAGCAGCGTCAACAGCACCAGCCGCAGGAGGAACAGGCCAACTACGAACTGCAGGACTGCCATATGGGTACCATTGAAAtgccATCAACGCACCACAAGTCGAATTCATCCGCCTCAGCCTTGCCCGCAACGGCAGCTCTCAATTCGTTGCTGCAACAACGTCAGGCGAATGCCGATGGTGCCGCGGCTTTATTTGGAGCCTCGTCTTCGTCCTCATCGTTGAAGGGCGAAGTTAATATAAAGCTGGAGAGAAGTTATAGCAACTCCACTAATGAGTCCACCTACAGCTTGCACGataacagctacaacaatgcTTACGGCAGCGACACCTCAATTCATGGAGGTGGCGCTATCGGTGGGCCACAAGCGCATTCCTCGACATTGGACGATTCTGAGGACGCATTATGCTGTGTGCCACTGTGTGGGGTGAGGAAGAGCAGTAGCCCCACTCTGCAGTTTTTTACGTTTCCTAAGGACGAAAAGTATTTGCATCAATGGCTGCACAACCTTAAGATGTTCCACATACCGGCCTCCAGTTATGCGAGTTTTCGCATTTGTAGTATGCACTTCCCGAAGCGCTGCATCAATCGCTACTCTCTCTGCTATTGGGCGGTGCCCACGTTTAATCTTGGCCACGACGATGTAGCCAATCTCTATCAGAACCGCGAGCTGACTAACACCTTTACCACTGGTGAGGTGGCTCGCTGCAGCATGCCCAACTGCAATAGTCAGCGGGGAGAAAGCAATTTGAAGTTTTACAACTTTCCCAAGGACATCAAAAGCTTGATTAAGTGGTGCCAAAATGCTCGTCTTCCGGTCCAAGCCAAGGAGCCACGTCACTTTTGCAGTCGCCACTTTGAAGAACGCTGCATTGGCAAGTTCAGGTTGAAGCCCTGGGCAGTACCCACCTTGCACCTGGGTGCTCAGTATGGCAAGATTCATGACAATCCCAAGAACCTCTACGTAGAGGAGAAGCGATGTTGCTTGAATTTCTGTCGTCGGAGCCGCTCAACGGACTTTAATATGTCACTATACCGATTTCCGAGGGACGAAGTGCTTCTGCGACGTTGGTGCTATAACCTTCGGCTGGATCCGTCCGTCTATCGGGGAAAGAATCATAAGATATGCAGTGCCCACTTCATTAAAGAGGCATTGGGACTTCGGAAGCTGTCACCTGGCGCTGTGCCCACGCTTCATCTTGGTCACAACGACACCTTCAACATATATGAGAACGAACTCtggccaccaccgccgcccagcaatcatcaccagcatcagcagatgCAACACCATTCGCTTCAGCATATTTCACATCACGGCAAATCCTATCAACGACACTCGGCTGCCTCTACTTCATCATCGGCCAGTTCCGCCTCGCATTATGTGGATGCAGAGCTGAGCGCCGCATATACCAACATGTCtgctggcggcggcggttCTTCCTTGAATGTGAGCGACAGTATGGATGTGTGCTGTGTGCCCAGTTGCGACAGCAAACGTCACAACAATGAGAGCATCACATTCCATACAATTCCCAGGAGGGCAGAGCAGATGAGCAAGTGGTGCCACAATCTGAAGATACCGGAGGAGAAGATGCATAAGGGAATGAGGATTTGCAGCCTGCACTTTGAGCCCTACTGTATTGGGGGCTGCATGCGGCCATTTGCAGTTCCCACGCTACATTTGGGTCATGACGACGAAGACATTCACCGAAACCCGGATGTGATCAAGAAGCTCAACATAAGGGAAACATGCTGTGTCGCTGTGTGCAAGCGGAACAGGGATCGGGATCATGCGAATCTGCACCGCTTCCCCAGCAATGCGACCCTGCTAACAAAATGGTGTGCCAATCTCCAGCGACCCGTCCCAGATGGCAGCAAACTCTTCAACGACGCCATATGCGAGGTGCACTTCGAGGATCGTTGTTTGCGTAACAAGCGACTGGAGAAGTGGGCAGTGCCTACCTTGGTTTTGGGTCACGACGATATCGCTTATCCTCTGCCCACGCCGGAGCAGGTGGCCGAGTTTTATGCCCGGCCCAGTGCCCCCAATAACGGCGAGGAGCAGGGCGAGTGTTGTGTGGATACCTGCAAGCGTAATCCTAGTGTGGATGACATTAAGCTGTATCGACCACCGGAGGAGGCTGAGGTGCTCGCCAAGTGGGCCCATAACCTGCAGCTGGAGGTAAGCCAGTTGCCGAGCCTCAAAATTTGCAATCTTCACTTTGAGTCCCATTGCATTGGCAAGCGGATGAGGCCTTGGGCCATACCCACTCTCAATCTCGCTGGCAACATTGAGAATCTTTACGAGAATCCTGAGCACACCATGCTGTACAAGCGTCGCACCCACGTCAAATCAGATGCGACCTCCGTTGGTTCCAGTTCAAAGCCCACTTGGGTGCCTCGGTGCTGCCTTCCTCACTGCCGCAAGGTGCGTGCCCTGCACAATGTTCAACTGTATCGCTTCCCCAAACTCAACCGCTCCACATTGGCCAAATGGGCGCATAACCTGCAGGTGCCTATGATGGGCAGTGCGCAGCGTCGTGTTTGCTCCGCTCACTTTGAACCGCACGTCCTCAGCAAGAAGTGCCCCGTGCCGCTTGCGGTGCCCACTCTGGATCTGAATAGCCAACCGGGCTATAAAATCTATCAGAATCCTGCCAAGCTGAAAGCCAATAAGCTGTGTTTGCAGCGCGTTTGCATCGTGGAGAGCTGCCGTCGACAGCGGGCCCAAGGGGTGCAGCTGTTCCGACTTCCCCACAGTCCGACGCAGCTGCGTAAGTGGATGCACAACATCCGGATGCGCCCGAGAGGAGCGATGCGTGCCCAGTATCGAATCTGTTCCATGCACTTTGAGTCGCATTCGTTCAATGGCAAGCGGCTGAGCGCCGGAGCCATACCCACACTGGAGCTGGGCCATCAGGATGACGACATCTATCCAAACGAAGCTCAGGCCTTTGCCGATGAGCATTGTGCTGTGGAGCGATGTGACGCCTCCAAGGAGCAGGCCGAAGTGCGGCTTTTCCGCTTCCCCACGGATGACGACGATCTCCTATGGAAATGGTGCAACAATCTCAAAATGAATCCAGTAGACTGTGTCGGTGTTCGCATCTGCAACAAACACTTCGAACCAGACTGCATCGGGCCCAAGCACTTGTTTAAATGGGCAATTCCCACAATGCATTTGGGCCATGATGATGCTCAGATCGAGCTCATTAGTAACCCTAAGCCCGAGGAGCGCTATGTGGATCCCGTGTTTAAGTGTTGCGTTCCCACCTGCGGCAAGACCCGCAAGTTTGACGAGGTTCAAATGAACAGCTTTCCCAAGGACCCCATACTCTTTCAGCGGTGGCGCCACAATCTCCGACTCGAACATCTCGACTTTAAGAGGCGAGAGCTCTACAAGATTTGCAATGCTCACTTTGAGGACATTTGCATTGGTAAAACTCGCTTAAATATTGGCTCCATACCGACTCTAGAATTGGGCCATGATGAGACGAACGACTTGTTCCAAGTTAATCCGGCCGAATTGCAAAGCAACCTGTTTGGACGCCAACGACGACTGCTCGATTCACCGACGGATGGCAGCTTTAAAGAGGAGCTGGACGAGGATAGGGACACCAAGTCGATTGTAACGACAATGGCAGAGGCTACCGATTTGAACACCTCGCAGgtaaaaattaagaaatccCTTGCTGATTTGAAGTGTTGCGTGTCCAGTTGTGGACGCAGCCGATTGGAGCATGGAGCCCGTTTATTTGCTTTCCCCAatggcaagcagcagcaaaataaGTGGCGCCATAATTTGCGAATGGAGCCCGGTGAAGTCGATAAAATGACACGGATATGCAGTGCCCACTTCAATAGGCGCTGTGTTGATGGCAAGCAGCTACGGAGCTGGGCCATGCCCACCCTGCAGTTGGGTCATCAGGAGCAGCCCATCTACGAGAATCCGAAAAATATTCCTGGCTTCTTTACACCAACTTGCGCCCTGAGCCATTGCCGAAAAAGACGCAGCATTGACAATGATTTACGCACCTATCGATATCCTAGGTGCGAGGAGCTGTTGGAAAAGTGGCGAGCCAATTTAAGATTGGCCCCAGATCAGTGTCGTGGCCGCATTTGTGCCGATCACTTTGAGCCGCAGGTTAGGGGGAAGCTGAAATTGAAGACTGGAGCTGTGCCTACTCTGAAATTAGGTCACGATGAGGGAGTGGTTTACGATAACGAGGCTATTAAGGTGGATGAGGATGCCAGCATAGACTCGCCTTTAATTAAACCCAAGCCTGAGCCGAAAGAAGAGGATGAAGGGCGTAGGAGTccgcaggaggaggaggaagatgACCTGGATCATAGTTACTTCGATCCCCTCGAGTTGGTGGAAACATATGCCGAGCACAACAGCGATAATGACGATGATCCCGATGAATTAGAACCGGAGCAATTACTGCCAGACACGCCCCCACCTATTCTGCGTCCGGCTCCAATTCCGACTCGCCCTCGAGAAAAGCCACCCAACAATGTGACTCCTATTTGCTGCCTGAAACACTGCCGCAAGGAACGCACCCCTAGCTATCAGCTGAGCACCTTTGGCTTTCCAAAGGATCAACAGTTGCTACTTAAATGGTGTGCGAATCTTCACTTGGAGCCTTCTGATTGCATTGGTCGCGTCTGCATTGAGCATTTTGAGGCAGAGGTGCTGGGAAGTCGCAAGCTGAAGCAGAATGCAGTGCCCACTTTGAACGTGGGTCATGCCAGCCCTCTGCGCTACACCTACAATGGCCAGGAACGAGCCATCCATGATGAGAAGCCTCAGCACTCGGTTTTTCGGCTTTTCAGCCTGAAACACTGCCGCAAAAGGAAGCTAACGGAGCCGCCGGACATGAGCGAGGCAGTGAGTAAGCCTAAGCGCTGTTGCCTCCCCTCATGCGGCAAGCAAGAGGATGTAGATGGAGTGCAGCTCCATCAGCTACCCAACGATCATTCGCTGCGTCGCAAGTGGCTGCAAAATCTTAAGCTATCTCCGCACTATGAGGTGGAATTGACCAGCAATGAATATGTTTGCGTGGAACATTTTGAGACGCACCTGTTTCAGGATGACGGAGAGCCAATGGGACATGCCAATTCCACTGTAAAACAAGAGCATCCCGATAGGAATATTTTCCGTAATATTATTAGCTCCCCAACCATCTCCAACTCCTGCCTGGTGCCCAGCTGCCCATGTGCGCGGCTCAATCTCTATCGGTGCTTTGACCTGCCCGCCAATGCCGAAGTGCAAAAGGCTTGGCTGCATTGGCTCCAGCTTGATGAAGATGAGGCGACCATCTGTCAAGGGCAGCTTTGTGTGATGCACTATATGCAGCTCTATGAGGAAGTGCCCATGCCAGACACAGTCTCATCCGAGGAGCTAGAACGAATGCAAGCAACCTATGGGCAGATTGCTAACTCGTCAATGGCCCTAAAGCTTCGCTGTGCAGTTCCCGGCTGCCGTTCCAAGTACACGGACAATATTCGACTGACCAAGCTGCCCCAATGTCAAGATACATACTCTAAGTGGTTACACAACACTAAGATACCTTATGATCCGTCCAGGCATTACATTTACCGCATATGTATGCTCCATTTCGAGCCACACTGTCTGGGCCCTGTGCGTCCCAAGGCGTGGGCTGTTCCAACTTTGCGATTGAATCATAAGGACGCCATATATTCGAATCCCAAGCTGGAGCAGGATGAAACGTCAActgcagcatcatcatctacATTCAGACCAATCCTCAGTTCCACAGATGCTCTAAAGATCAAGACAGAGCTGGGACCACTGCGAATAAAGACGGAACTGCCCCTGTTGAGTAGTCCGAGCATCAGTGCCAGCCCTAGTCCACGTGGCAAACTGAAGTTCTGCTGCATCCCCAGCTGTGGCCAGCAGGCCAACGCCCTGACACGCCTTTATCGCTTTCCCATCGCAGAGCCGGCACTCCTCAAGTGGCTGGTCAACACCCAGCAGCAACCGCGCTACATGGATGCGGTGAATCTGTACGTTTGCCAGGCACACTTCGAGCCCGACGCCATCTGCAAAAAGCAGCTCCGCAGTTGGGCTGTTCCAACTCTAAATCTCGGCCACGATGGCCATGTTATACCCAATGCCAGGCACAATGGGAACATCGCCGACAGCCAGGAGAACGAGCAGGCCATGGAGTTCATCAGGGAAAACTACTGCTCGGTGCTGACCTGCTTCCAGCCACGCAGTGAAGAGCTCCGTCTGTATGCGTATCCAAAAGATATGCCCACCATTCGGAAATGGGCAGCCAACTGCAAACATCGATCCATGCAGGCCAGCAGCCATGGCTTCCAGGTATGCCAATCGCACTTCACTACTGATTGCTTTAATCCGGACACCGGAGAATTGCACGAAGACGCTGTACCCACTTTGGAGCTAGCTAGGAGCGGCACCGAACTGCGCTGTGTGGTGCCGGGTTGTATGAAGGACGAATCCTCGGCTACGCCTTCAGGGCGTCGACGTTTTCAGAAGATACCCAAGCGGACTGCGCAGCTCAGCGAGTGGTGCCATAATCTTCGTCTGGATGCGGCCACACTGGCCGTGGGTGATCATCACATATGCGAGCGTCATTTTGAGGCAAACTGCTTCAATGCGTATAAACAACTGCGTCCTGGAGCTCGACCGACCCTCCATTTGGGTCACGACGATATTGATGATTTGCTGCCTAATCCAGCGGTTTTCAGCGTGGATGAAACCGCCGAGGACGTGTGCTGTGTTCCTGGATGCGGTCGCTCGAATCAGCAGGCAGATGAAGACCACATCCAATTCAGTAGCTTTCCCAGGATGCGTGTACTGGCCGAGAAGTGGATGTATAATATCCGATTGGAGCCGGACAAAGAGCAGTTCGGCAAGCTGAAGGTTTGCAGCGTACACTTCGAATCTAGTTGCTATGAAAATGGGAAGATGCGAATGGGATCGATGCCCACGCTTATGCTGGGTCACGATAATCCCGATATCTATCAAACGGATGAGCAATCCTTTGCTAAGAATCGGAAGCACCCCAAAAAGTCCAGCGGCTCGGGTATGGATTGCTGCTTCCCGCACTGCGTGGAACTTCAGAAAAACTACCAGCGCATGGTTTACGACTTGCCGCTGACGGAGGAACTGCGTCAAGCTTGGCTAAACCACCTGGATGTGGCAGAGGAGAATTTTGGTTTAGAGGAGCAGTCGCTGAAACTCTGCCCACTGCACTTTATCATGCTTTACGATCACAGCGTTTCCCACTTCCCAGAGCATGCCGCGGAGGAGCTCATGGATAGAAACTACGACGATGCCAGAAAAAGTGTGCGAATACGCCTCATCAGTTGTGCCGTTAGGAGCTGTAGTACCCTGCAGCCAAGGGATGGCGGCCGCTTGCATGGTCTACCGCAGCGCAGGGACCTGCTCCAAATGTGGTTGCACAATACCCAGTTGGAGTTCTCTGAGCAACAGCGATATATGCTGAAGATTTGCAGCAAGCACTTCGAGAGCAGATGTTTCACAGATATCCGGAAGCTAAAACCCTGGAGCATTCCGACCCTCCACCTGCCAGATCTGGAAGAAACCGATGTTAATGCATTCGTAGCTTATCAAAATCCCACAGAGGTGGAATGGcaagaaatgaatgaaaagatGGCCAAGGAATCGAATGCAGTGAATGTAAAACCAGAGCAGGAAGATAATTCTCTGCTGAAGCCAACTGTGCTCATAGAGCATATGGATGATAACGAACAGGAGGAGCTTCCGTTTCCGTCGGTGAAGCAGGAACTGATTACCAACGacgattatgatgatgatgcggaGAGTACTAATTTTCAGCAGCCACTTGAAATGCAGGCCTTGGAAGTCCTTCTGGAAGTGGGACATGTGGAGAATTGCTCAAGCTACGAACTAATGGATACTGAAGGACTTTTAGACGACAGTAGTCCTGCGCCTTTCACTCCCCGAAAGGCAACTAATGCTTCGGTATCCACATCAACTCCAAATCGTGGTCAGTATAATGCACGCCATTGCAGCGTTCAGGGCTGCAAGGTGACCGCCAGCGACACGGATGGGATCTCTGTTAAGCTTCACAAATTCCCCACGTCCGTGGAGGCCACACAGAAGTGGATGCACAACACTCgggtggatgtggatgcaaATTTTTCATGGCGCTATCGCATCTGCAGCTATCATTTCGAGCAGGAGTGTTTCAATGGCTCGAGGATCAGACGTGGCTCGATGCCCACCCTGCATTTGGGACCGCAGATGCCTCAGGAGATCTACTACAATGAGTTTACCCAGCCCGGAACGGACCTGTCAGTGGAAGATACCGTTGAGGCAGCCCCCACTAACGCCATGGACACCGCCAATCGTGTTGGTGAGTTGAATGGCAATGGGGATGGCACTGGAAACGTTATAAAGATGCTGCTGCCACGCCCAGCTCCGCCACGTAAGTCCAGCAAGTTCTGTCAAATTGAGGGATGTCAGAATCACCTAACCAGTGAGAATATGACTCTCCATAAGTTCCCCCATGCACCAGAGATGTGCGCCAGATGGCAGCATAACACCCAGGTGCCCTTTGACCCAGAATTCCGCTGGCGCTATCGCATTTGCAGTGCCCACTTTCAACCGTCGTGCCTGGTCAACATGCGTCTGCTTCATGGCAGTGTGCCCACTCTGCACCTTGGACCACGTGCGCCGTCACAGCTCTTTGACAATGATCTGGAGATTATGATGATGAGAATGGATAAGAAAACAGCAATAGCACACCATGGCAAGCTGGCAGTGGTGAAGCAAGAGCAGAGGGTTAAGTGGGACCGTGAGCAAAAAGTGGAGGAATTTCAGCATGATTGGGTAGAGGACGATGAACACAGTTTCCTGGTGCCTGAGCTGCAACTGCAGGAACAGGAGGAGGAGTGGGAGGGGGAGGAGCTACAGCAGCAAGAGCCATCCCCTAGTTTGAGTCTTTCAGGTTGGAGGCAGCAATTACGTTTGCCCGTCATAAAGCGGGAGGGGGCCGAACAGCAACTAAAGCTGCCTCACAATCGGCATCAAAAGGTGATGTCATACAATCCCATTAAGTCTGGTTACGACAAGTGCTCGCTGGTGCATTGCCAGCGCCAGAGATCCCAGCACGGAGTCCACATCTACAAGTTTCCGCGGTCCAGGCAGCATCAACAGAGATGGATGCACAATCTGCGCATCCGCTATGATGAAAGGCGACCCTGGAAGACCATGATATGCAGTGTACACTTTGAACCGCACTGCATCCGTCTGCGCAAGCTCCGTCCCTGGGCAGTTCCCACTCTGGAACTGGGTGAGAATGTGCCGGAGCACATCTACACCAATGAGCAGTGCCAGGAAATGGCCTACAACGATAACAGTGGGATTGAAACTGAGTGTGAGGACGATGATGGTCTccaagaggaggaggaggatgacGATAACGAGGAAGAAGACGCAGTGCCCCATATCAAAAAGGAGCGGCGATCTCGACTGGACTGGCCCCCAGGCCAACTGCCACCCTGGAAAATCAAACAATGCTGCCTGCCCTACTGCCGCAGACCGCGCGGAGATGGCATCAAGCTCTTCCGGCTGCCCAACAACATCAGTTCGATTAGAAAGTGGGAAAAGGCTACGGGCATGGTGTTCAAGGAGTCTCAACGCAATACCAAACTAATCTGCAGTCGTCATTTTGATCCGGAGCTGATTGGTGTGCGTCGGCTGATGCGTAATGCCATTCCCACACGGCATCTGGGCCCGGTCAGCGATAAGAATGACCAGATGGCAGAGCCCCCGCCCCCGAGATATTCTGCACCACTTACGCCTAAATGCTGTATTCCGAACTGTCATCAGCAATCCGAAGCAGATGCCACAATGCACAAGTTTCCCAGTGATCCGTTGTTGCTCCATCAATGGTGTCAGGCCTTAAAAATCTCCAATGTCCAAGGTCATCGTCACAAGCACGTATGCTCTGCTCACCTGCCCCCCCTAAAGGTGCTCAGCTGTGTGGTTTGTGGTGTCGACAGTGTCCAGCCGCCCCTGCTTGGCTTCCCCGAGAATCGCAACCAGAGAGCCAAATGGTGCTACAATCTGAAAATAGAGCCCATACCCAAATGGGATAACTCCAAAAACATATGCAGCAAACATTTCGAGTCCTTTTGCTTCATCCAGCCGGGAATCCTTCTTCCAGATGCAGCACCCACGCTGCATTTGAATCACGATGACAGTAACATATTCCTCAATGATTATGCCATAGACTTGGACTCTACCCGGACGATGCTGATGCGCATCAAGGATGAGCCCTTGGACAACGACGAGCTGTTGGTGTAG
Protein Sequence
MSQHNPHHHPHPHAHLQWQQQQQQQHHQHKQIQQQQHSWYSHVASYPPHHHASFTSPCKSNNIMNAYGVGAGGPTGTHAYYGNVNAAGGGSGAGSGNMSGGGSNMSGGGYNLEANNVAYAHNQLLQYQQQQQHQQQLSQGSSYMAQSLMHGPYPYIKSEPMEQMQSQQQQQHYQPPSNPMAPPPAPTSTHQSVDDTIIKSEPIDEHVYKSNYIDDNTPFADFSKFPEFGDDMLSPKVELTVKDESYGNNVKNPLNYPRRKLQNDRSSDSLPICQRCKEVFFKKQIYLRHVADSSCIIQEYDFKCSICPMSFMGAEELQDHKLLHRADKFFCHKYCGKHFDTIAECESHEYMQHEYENFVCNMCSSTFTTREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRSSAPYFCGKYYRGTESQSTSQQRQQHQPQEEQANYELQDCHMGTIEMPSTHHKSNSSASALPATAALNSLLQQRQANADGAAALFGASSSSSSLKGEVNIKLERSYSNSTNESTYSLHDNSYNNAYGSDTSIHGGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSSSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCNSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSTDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPPPSNHHQHQQMQHHSLQHISHHGKSYQRHSAASTSSSASSASHYVDAELSAAYTNMSAGGGGSSLNVSDSMDVCCVPSCDSKRHNNESITFHTIPRRAEQMSKWCHNLKIPEEKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNATLLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHDDIAYPLPTPEQVAEFYARPSAPNNGEEQGECCVDTCKRNPSVDDIKLYRPPEEAEVLAKWAHNLQLEVSQLPSLKICNLHFESHCIGKRMRPWAIPTLNLAGNIENLYENPEHTMLYKRRTHVKSDATSVGSSSKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMMGSAQRRVCSAHFEPHVLSKKCPVPLAVPTLDLNSQPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRAQGVQLFRLPHSPTQLRKWMHNIRMRPRGAMRAQYRICSMHFESHSFNGKRLSAGAIPTLELGHQDDDIYPNEAQAFADEHCAVERCDASKEQAEVRLFRFPTDDDDLLWKWCNNLKMNPVDCVGVRICNKHFEPDCIGPKHLFKWAIPTMHLGHDDAQIELISNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPILFQRWRHNLRLEHLDFKRRELYKICNAHFEDICIGKTRLNIGSIPTLELGHDETNDLFQVNPAELQSNLFGRQRRLLDSPTDGSFKEELDEDRDTKSIVTTMAEATDLNTSQVKIKKSLADLKCCVSSCGRSRLEHGARLFAFPNGKQQQNKWRHNLRMEPGEVDKMTRICSAHFNRRCVDGKQLRSWAMPTLQLGHQEQPIYENPKNIPGFFTPTCALSHCRKRRSIDNDLRTYRYPRCEELLEKWRANLRLAPDQCRGRICADHFEPQVRGKLKLKTGAVPTLKLGHDEGVVYDNEAIKVDEDASIDSPLIKPKPEPKEEDEGRRSPQEEEEDDLDHSYFDPLELVETYAEHNSDNDDDPDELEPEQLLPDTPPPILRPAPIPTRPREKPPNNVTPICCLKHCRKERTPSYQLSTFGFPKDQQLLLKWCANLHLEPSDCIGRVCIEHFEAEVLGSRKLKQNAVPTLNVGHASPLRYTYNGQERAIHDEKPQHSVFRLFSLKHCRKRKLTEPPDMSEAVSKPKRCCLPSCGKQEDVDGVQLHQLPNDHSLRRKWLQNLKLSPHYEVELTSNEYVCVEHFETHLFQDDGEPMGHANSTVKQEHPDRNIFRNIISSPTISNSCLVPSCPCARLNLYRCFDLPANAEVQKAWLHWLQLDEDEATICQGQLCVMHYMQLYEEVPMPDTVSSEELERMQATYGQIANSSMALKLRCAVPGCRSKYTDNIRLTKLPQCQDTYSKWLHNTKIPYDPSRHYIYRICMLHFEPHCLGPVRPKAWAVPTLRLNHKDAIYSNPKLEQDETSTAASSSTFRPILSSTDALKIKTELGPLRIKTELPLLSSPSISASPSPRGKLKFCCIPSCGQQANALTRLYRFPIAEPALLKWLVNTQQQPRYMDAVNLYVCQAHFEPDAICKKQLRSWAVPTLNLGHDGHVIPNARHNGNIADSQENEQAMEFIRENYCSVLTCFQPRSEELRLYAYPKDMPTIRKWAANCKHRSMQASSHGFQVCQSHFTTDCFNPDTGELHEDAVPTLELARSGTELRCVVPGCMKDESSATPSGRRRFQKIPKRTAQLSEWCHNLRLDAATLAVGDHHICERHFEANCFNAYKQLRPGARPTLHLGHDDIDDLLPNPAVFSVDETAEDVCCVPGCGRSNQQADEDHIQFSSFPRMRVLAEKWMYNIRLEPDKEQFGKLKVCSVHFESSCYENGKMRMGSMPTLMLGHDNPDIYQTDEQSFAKNRKHPKKSSGSGMDCCFPHCVELQKNYQRMVYDLPLTEELRQAWLNHLDVAEENFGLEEQSLKLCPLHFIMLYDHSVSHFPEHAAEELMDRNYDDARKSVRIRLISCAVRSCSTLQPRDGGRLHGLPQRRDLLQMWLHNTQLEFSEQQRYMLKICSKHFESRCFTDIRKLKPWSIPTLHLPDLEETDVNAFVAYQNPTEVEWQEMNEKMAKESNAVNVKPEQEDNSLLKPTVLIEHMDDNEQEELPFPSVKQELITNDDYDDDAESTNFQQPLEMQALEVLLEVGHVENCSSYELMDTEGLLDDSSPAPFTPRKATNASVSTSTPNRGQYNARHCSVQGCKVTASDTDGISVKLHKFPTSVEATQKWMHNTRVDVDANFSWRYRICSYHFEQECFNGSRIRRGSMPTLHLGPQMPQEIYYNEFTQPGTDLSVEDTVEAAPTNAMDTANRVGELNGNGDGTGNVIKMLLPRPAPPRKSSKFCQIEGCQNHLTSENMTLHKFPHAPEMCARWQHNTQVPFDPEFRWRYRICSAHFQPSCLVNMRLLHGSVPTLHLGPRAPSQLFDNDLEIMMMRMDKKTAIAHHGKLAVVKQEQRVKWDREQKVEEFQHDWVEDDEHSFLVPELQLQEQEEEWEGEELQQQEPSPSLSLSGWRQQLRLPVIKREGAEQQLKLPHNRHQKVMSYNPIKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQHQQRWMHNLRIRYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGENVPEHIYTNEQCQEMAYNDNSGIETECEDDDGLQEEEEDDDNEEEDAVPHIKKERRSRLDWPPGQLPPWKIKQCCLPYCRRPRGDGIKLFRLPNNISSIRKWEKATGMVFKESQRNTKLICSRHFDPELIGVRRLMRNAIPTRHLGPVSDKNDQMAEPPPPRYSAPLTPKCCIPNCHQQSEADATMHKFPSDPLLLHQWCQALKISNVQGHRHKHVCSAHLPPLKVLSCVVCGVDSVQPPLLGFPENRNQRAKWCYNLKIEPIPKWDNSKNICSKHFESFCFIQPGILLPDAAPTLHLNHDDSNIFLNDYAIDLDSTRTMLMRIKDEPLDNDELLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-