Basic Information

Gene Symbol
-
Assembly
GCA_963924055.1
Location
OZ001353.1:6898681-6916043[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 9 2.8e+04 -5.1 1.2 13 42 20 50 14 55 0.56
2 29 5e-15 1.6e-11 45.4 4.3 1 86 577 649 577 650 0.85
3 29 2.9e-15 8.8e-12 46.2 4.6 1 87 677 746 677 746 0.83
4 29 6.2e-16 1.9e-12 48.4 0.4 1 87 768 840 768 840 0.84
5 29 6.1e-16 1.9e-12 48.4 5.7 1 87 946 1016 946 1016 0.83
6 29 5.8e-15 1.8e-11 45.3 3.1 1 86 1040 1111 1040 1112 0.82
7 29 8.8e-13 2.7e-09 38.2 1.0 1 87 1147 1215 1147 1215 0.80
8 29 9.8e-11 3e-07 31.7 1.4 1 86 1260 1329 1260 1330 0.76
9 29 4.8e-16 1.5e-12 48.7 0.1 1 86 1357 1426 1357 1427 0.83
10 29 3.1e-13 9.6e-10 39.7 1.3 1 86 1448 1517 1448 1518 0.82
11 29 4.7e-15 1.4e-11 45.6 1.7 1 86 1545 1616 1545 1617 0.85
12 29 1.3e-13 3.9e-10 41.0 1.5 1 85 1699 1767 1699 1768 0.82
13 29 3.2e-12 1e-08 36.4 0.1 1 86 1792 1860 1792 1861 0.81
14 29 1.8e-14 5.5e-11 43.7 0.6 1 87 2009 2078 2009 2078 0.79
15 29 2.7e-12 8.3e-09 36.7 0.1 1 60 2133 2186 2133 2209 0.80
16 29 0.0027 8.4 7.8 0.0 1 58 2221 2271 2221 2287 0.80
17 29 7.3e-12 2.3e-08 35.3 0.3 1 86 2310 2379 2310 2380 0.84
18 29 8.6e-15 2.6e-11 44.7 2.3 1 86 2460 2529 2460 2530 0.83
19 29 2.9e-13 8.9e-10 39.8 1.2 1 86 2565 2636 2565 2637 0.81
20 29 4.6e-13 1.4e-09 39.2 1.9 1 87 2647 2719 2647 2719 0.80
21 29 1e-15 3.2e-12 47.6 0.1 1 86 2747 2820 2747 2821 0.80
22 29 0.00042 1.3 10.4 0.0 1 58 2856 2906 2856 2934 0.73
23 29 3.2e-14 9.9e-11 42.9 0.1 1 86 2944 3016 2944 3017 0.79
24 29 4.5e-15 1.4e-11 45.6 0.2 1 86 3174 3246 3174 3247 0.83
25 29 2.4e-13 7.4e-10 40.1 2.6 1 87 3314 3385 3314 3385 0.82
26 29 1.3e-14 4e-11 44.1 3.7 1 86 3495 3565 3495 3566 0.84
27 29 6.2e-13 1.9e-09 38.8 0.0 1 87 3660 3730 3660 3730 0.85
28 29 4.6e-09 1.4e-05 26.3 2.1 1 58 3749 3795 3749 3802 0.88
29 29 5.4e-10 1.7e-06 29.3 0.9 17 87 3811 3870 3798 3870 0.73

Sequence Information

Coding Sequence
ATGTCACAACACAACAACCCCccgcatcatcatcaacatcactactaccagcagcagcagcaacaacaacagctacaactacaTCACCAGCATCCacaccatcagcagcagcaacaactacaacataaacaaatacaacagCACAGTTGGTACTCACATGTTGCTTCCTACCCTCCCCACCATCCGCACGCCGCAGCCTTTGCGGCGCCCTGCAAAACCAataataacagcagcagcaacaacaacaacataaatgCATACGGTGCGGGAACTGGAGGCACGCATGCAGGATATTATGgctctgctgttgctgcagcagGTGGGGGGGTGGGCTATAACCTTGAGGGCAACACTGTGGCCTATGCGCACAACCAGCTGCtgcaataccaacaacaacaacagcagcaccagcagcagcaacaacaacatcagctcAGTCAACGCTCGTATATGGCGCACAGTTTAATGCACAGCTCTTATCCCTATATTAAGAGCGAGCCATTGGAGCTGCCTGATGATAGACAACGCCAacagcatcatcaacaacaacaacaccagcaacaacaacatttccaAAATCCTATGGCACCGCCGCCAGCTCCCGCCAATCGACACAGTCTCGATGCTAGCGGCGAAATGGTAATAAAATcgGAACCCATTGACGAACATGCCTACAAGTCCAACTATATCGATGATAACACGCCCTTTGCGGATTTTAGTAAATATCCGGAATTCGGCGACGACATGTTGAGTCCCAAGGTGGAGCATTCGGTCAAGGATGAAGGATATGGGAGTCAAAAAGTTCCGAACCCGCTCAGCTATCCGAGACGCAAGCTGCAATCGGAGCGCTCTTCGGAAAGTCTTCCCATTTGTCAGCGTTGCAAGGAGGTGTTCTTCAAGAAGCAAATCTACTTGCGTCATGTGGCCGAGAGCAGTTGCAGCATACAGGAGTATGATTTCAAGTGCAACATTTGCCCCATGTCCTTTATGAGCACCGAGGAGTTGCAGAAGCACAAGCACCAGCACAGAGCAGACAAATTCTTTTGCCACAAATACTGTGGCAAGTACTTTGACACGATCGCCGAGTGCGAGTCGCATGAGTACATGCAGCACGAGTATGAGAGCTTTGTTTGCAACATGTGTTCCGTTACGTTCGCCACGCGGGAACAGCTGTATGCCCATTTGCCGCAACACAAATTCCAGCAGCGTTACGATTGCCCCATCTGTCGCTTGTGGTACCAAACTGCTTTGGAGCTGCACGAACATCGTCTAGCTGCTCCTTATTTTTGTGGCAAGTATTACGCAGCCGCACAATCGGCATcacaccaacagcagcagcaacagcatccacagcagcatcagcaacaggcCAACTACAAACTGCAGGACTGTCACATGGCCACCATGGAAATGCCAACGCCGCATCACAAGGCAAATGCCGCTGCCACCGCATTGCCGGCAACGGCGGCGTTGAGTTCGTTGTTGCAACAACGTCAGGCGAATGCCGATGGAGCCGCTTTGTTTGCCTCAACGCTGAAGAACGAGCCGAATGTGAAGCTGGAGCGAAGCTATAGTAATTCCACAAGCGAGTCGGGCTACAGTTTGCACGACAGCAGCTATAACAATGCCTATGGCAGCGACACCTCGCTGCATGCTAGTGGTGGGGCTGGTGCGATTGGTGGTCCACAGGCGCACTCCTCGACGCTGGACGATTCGGAGGATGCACTGTGCTGTGTGCCGCTGTGCGGTGTCCGTAAGAGCACCAGTCCGACGCTGCAGTTCTTCACTTTCCCCAAGGATGAGAAGTACTTGCATCAGTGGCTCCATAACCTTAAGATGTTCCATATTCCGGCATCGAGTTATGCGACTTTTCGTATTTGCAGCATGCACTTTCCCAAGCGCTGCATCAATCGATATTCCCTGTGCTATTGGGCTGTACCCACGTTCAATTTGGGTCACGATGATGTTGCCAATTTGTATCAGAATCGGGAACTGACCAACACCTTCACAACCGGCGAGGTGGCCCGTTGCAGCATGCCCAACTGCACCAGCCAGCGGGGAGAGAGCAATCTCAAGTTCTACAACTTTCCCAAGGATATTAAGAGTCTGATCAAGTGGTGCCAGAACGCCCGTCTGCCGGTCCAGGCCAAGGAGCCGCGTCACTTTTGCAGTCGCCACTTCGAGGAGCGCTGCATTGGCAAGTTCCGCCTGAAGCCCTGGGCAGTGCCCACATTGCATTTGGGCGCCCAATACGGCAAGATCCATGACAATCCGAAGAACTTGTATGTGGAGGAGAAACGCTGCTGCCTTAATTTCTGTCGTCGCAGTCGCTCCTCTGACTTTAACATGTCACTGTATCGCTTTCCCCGGGACGAGGTACTCCTTCGACGCTGGTGCTACAATCTTCGTCTGGACCCTTCCGTCTATCGCGGCAAGAATCACAAAATATGCAGCGCTCACTTTATCAAAGAGGCTTTGGGACTTCGCAAATTGTCACCAGGAGCTGTCCCCACGCTGCATTTGGGCCACAACGACACCTTCAACATCTACGAGAATGAACTGTGGCCCCCACCAACGCCCGCCACGCCCACAgctcaccagcagcagcagcaacaacaacagttgcatcagcagcaccagcaacagttgcagcaacaacattcgCATAATCATGGCCACAAATATCAACGTCATTCGGCAGcatcgacgtcgtcgtcggccAGCTCGGCGTCGCACTATGTGGATGCGGACATGGGTGGCTCGTATATGGGAATGGGCAACTCGGGCGGATCTGGAGGATCGTCTGGGCTGAATGTGAGCGACAGCATGGACGTGTGCTGTGTGCCCAGCTGTGAGAGTAAACGgcacaacaacgagaacatcACATTCCATACGATACCGAGGAGGCCGGAGCAGATGAGGAAGTGGTGTCACAACCTGAAGATACCCGAGGATAAGATGCACAAGGGAATGCGCATCTGTAGTCTGCACTTTGAGCCCTACTGCATTGGTGGTTGCATGCGTCCATTTGCGGTGCCCACGTTGCATCTGGGGCACGATGACGAGGACATCCATCGTAATCCGGATGTGATCAAGAAGTTGAATATACGTGAAACCTGTTGTGTAGCTGTCTGCAAACGCAACCGGGATCGGGATCATGCCAATCTGCATCGCTTCCCCAGCAATGTCGCTCTTCTGACCAAGTGGTGCGCCAATCTGCAGCGTCCTGTTCCAGATGGCACCAAGCTCTTCAACGATGCCATCTGTGAGGTGCACTTTGAGGATCGCTGTCTGCGCAACAAGCGACTGGAGAAATGGGCGGTGCCAACGTTAATCCTGGGCCATGACAATATTGCCTATCCGCTGCCCACGGCGGAGCAGGTGGCCGAGTTCTATTCCCGTCCCAGTGCACCCAACAATGGCGAGGAGCAGGGTGAGTGCTGTGTGGAGACCTGTAAGCGCAATCCCAGCGTGGACGACATTAAACTCTATCGTCCGCCAGAGGAGTCGCAGGTGCTGGCGAAATGGGCACATAATCTGGAGCTGGACGTCGCCCAGTTGCCCAACATGAGGATCTGTAATCTGCACTTTGAGTCCCATTGCATTGGCAAGCGGATGCGACCCTGGGCCATACCCACGCTCAATCTCTCCACCAATATTGAGAATCTCTTCGAGAATCCCGAGCGACAGATGCTGTACAAGCGTCGCACGCATCTCAATTCCAATTCGAATCCCGATCGAGGAGCTGGCATCGTGAAGCCGACGTGGGTGCCACGTTGCTGCTTGCCACATTGTCGCAAGGTGCGCGCCCTGCACAATGTGCAGCTGTATCGCTTCCCCAAGCTCAATCGCTCCACGCTCGCCAAGTGGGCGCACAATCTGCAGGTGCCGCTGGTGGGCAGCGCCCAGAGACGCCTCTGCTCCGCCCACTTTGAGCCGCATGTGCTGAGCAAGAAGTGCCCGGTGCCGCTGGCGGTGCCCACGCTGGACCTTAACTCACCGCCTGGCTACAAGATCTATCAGAATCCCGCCAAGCTGAAGGCCAACAAGCTCTGCCTGCAGCGGGTCTGCATTGTCGAGAGCTGTCGGCGGCAACGTGGTCAGGGGGTTCAGCTCTTCCGGCTGCCCCACAATCCAACCCAGCTGCGCAAGTGGATGCACAACATAAAGATGCGGCCGAGAGGCGCCATGCGGCAACAGTATCGCATGTGCTCCATTCACTTTGAGTCGCACTCGTTCAATGGGAAGCGGCTGAGTGCCGGGGCGATTCCAACTCTGGAACTGGGACATCAGGATGACGATATCTATCCGAACGAGGCGCAATCCTTTGTCGAGGAGCACTGCACTGTCGAGGGCTGTGATGCGAACAAGGAGCAGCCGGATGTGCGTCTTTTCCGATTCCCCACCGAAGACGAGGATCTGCTCTGGAAATGGTGCAACAATCTCAAGATGAATCCCGTCGACTGTGTCGGCGTTCGCATCTGCAACAAGCATTTCGAGGCGGACTGCATCGGGCCCAAGCATCTGTACAAGTGGGCGATACCCACTCTGGAGCTGGGGCACGATGATGACCAGATTGAGCTCATACACAATCCGAAGCCGGAGGATCGCTACGTTGATCCGGTGTTCAAGTGCTGTGTTCCCACGTGCGGCAAGACGCGCAAGTTCGACGAGGTGCAAATGAACAGCTTCCCCAAGGATCCAACACTCTTCCAGCGCTGGCGACACAATCTCCGTCTCGATCATCTCAATTTCAAGGAGCGCGAACGCTACAAGATCTGCAACGTCCACTTTGAGGACATTTGCATTGGGAAGACTCGGCTCAACATTGGCTCCGTTCCCACACTGGAGCTGGGGCACGACGAGACCGAAGATCTCTTCCAGGTGAATCCCGCGGAACTGCAAAGCAATCTGTTCGGGCGTCAACGACGTGTCCATGGATACATCAAGCAGGAGGAACCAGACGACGATCAGGAGGAACAGGAGGAGCACTCCGAGCTGGATGAGGACATCAAGCCGGACATTAGCATGTCGGAGGCGACGGACTTGAATACATCGCAGGTTAAAATAAAGAGATCTCTGGTAGATTTCAAGTGCTGTGTGCCAAGCTGTGGACGCAGTCGCTTGGAGCATGGAGCACGCCTCTTCCCCTTCCCCAGtggcaagcagcagcagagcaaGTGGCGTCACAATCTCCAACTGTCGTCCGCCGACGTGGACAGATCGACTCGTGTGTGCAGCGCCCATTTCAATCGACGGTGCATCGACGGGAAGCAGCTGCGGGGCTGGGCAATGCCCACTCAGCTGCTGGGACACCAGGAGCAGCCCATCTATGAGAATCCAAAGAATATACCGGGCTTCTTCACGCCCACCTGTGCCCTGGCCCATTGCCGCAAGCGTCGGAGCATTGACAATGATCTGCGCACCTATCGGTATCCGCGCAGCGAGGATCTCCTCGAGAAATGGCGCGTCAATCTGAGACTGGCGCCGGATCAGTGTCGCGGACGCATTTGTGCCGATCACTTTGAGCCCATGGTGCGTGgcaagctgaagctgaagacGGGAGCGGTGCCCACCCTGAAGCTGGGACATGATGAGGGCGTGGTCTTTGACAATGAGGCTATTAAGGCGGGTCTGCAGCAGGAGGCGGAGGTGGAGGCGGACGAGGGCGAGGCATTCATGGAGTCGCTTGTCAAGGTCAAGCAGGAGAAGGTGGATCCGGAGGACGAGCAGCCAGATAACGATGATGAGGATATAGATGAGCAGCCGGACAAAGATCAGGAAGATCATGGCTACTTTGATCCCTTAGAGTTGGTGgaaacctttgcggatcatcaCAGCGACGACAACGCCTCCGaagacaatgatgatgatgatgaggatggGGAAGGGGATGACgctgacgatgacgatgaagCTGTCGATGATGACGAGCTGTTGCTGCCGGATACGTCGCCAGTTCCAGTGGCGCCACTTCCATTGCCGCCACGCCGAGAGAAGCCTGTGAACAATGTGACGCCCATTTGTTGCCTGAGACACTGTCGCAAGGAGCGCACGGCAAGTCATCTGCTGAGCACCTTTGGCTTCCCCAAAGATCCGCAGCAACTGCTTAAGTGGAGCGCCAATCTGCAGCTGGCACCCGACGATTGCGTCGGTCGCGTCTGCATCGAACACTTTGAGTCCGAGATGCTGGGCACGAGGAAGCTGAAGCAGAATGCGGTGCCCACACTGAACCTGGGACACAGCACGCCCCTGCGCTACAGTTGCAATGGCCAGTCGGTGGGCATTCATGATGCCCAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTAAAGCATTGCCGCAAACGGAAGCTGCCAATGGAGCCGCCGGATCAGACAGCGATTAAGCGACGCTGCTGCCTGCCCAGCTGTGGCAAGCAGCCGGATCTCCATGGCGTCCAGCTGCATCGTCTGCCCAGCAACCGCATCCTGCTGCGCAAGTGGCTCCACAACCTGAAGCTGCCACCAATGATGGACACCAGCCAGGCGCGTCTCTGTAGCGAACACTTTGAGACGCCGCTGGAGGATTGTGTGCCCACACTGCGGCTGGGTCATGACGATACCCACATTTATCGCAATCGTGGCAGCATCTCGGCAGCATCGTCGTCATCCAGTGGCTGCATGGTGCCCAGCTGTCCCTGTGCCCGCCTCAATCTCTATCGCTGCCATGATCTGCCCGAGAATCGTCTGGTGCAGCAGGCCTGGCTGCAGTGGCTCCAACTGCCGATGCCCCAGCTGGCCAGCGATGGCAAGCTCTGCGTGATGCACTACATGCAGCTGTACGAGCAGGTGCCACTGCCCCAGGAGCTGCCAGAGCCAGTGCTCCGTCAGCTGCAGGAGACCTATGACCTAATCGCCAGTTCCTCCATGGCCATGAAGCTGCGCTGTGCCATTCCCGGCTGTTACTCCAAGTACACGGACAACATCCGGCTGACCAAGCTGCCAATGTGCCCGGATATGTCTGCCAAGTGGCTGCACAACACCAAGATCACGTACGATCCTGCCCGACATTATATTTACCGCATCTGCATGCTTCACTTCGAGGCACACTGCCTGGGTCCAGTGCGCCCGAAGCCGTGGGCGGTGCCAACGCTACAGCTGAATCACAGCGATCCTCATATCTACCATAATCCCAAGCTGGATGACAACAGGCAACCAGTGTCCGCGCCAGTTCCCGTTCCTGTGGCCATCTCCACGCCCGTTCCCGATCCCGTTCCCGTGTCTTTGTCCACGTCTGTGCCCGTGGAGCTGCCGTTGCGTATTAAAACGGAGCTGGCCTTCTGCGGCAGTCCCAGCGCCAGTGCCAGTCCGAGTCCACGTGGCAAGCTGCGCATCTGCTGCATTCCCAGCTGCGCCCAACAGGCCACATCGCAGACGCGTCTCTTTCGCTTTCCCAGCGCCGAGACGTCGCTGCTCAAGTGGCTGGTGAAtacgcagcagcagccgcgATTGGTGGACGCACAGCATCTGTTCGTCTGTCAGGACCACTTCGAGGAGGACGCCATTTGCAAGAAGCAACTGCAAAGTTGGGCCGTGCCCACGCTGAGGCTTGGACACGATGGCCATGTCATACCAAATGCGAGGCACAATGGCAACATCGCCGACAGCCAGGAGAACAAGCAGGCGCTGCAGTTCATCTGGGAGAACTACTGCTCCGTCTTGAGCTGCTTCCAGCCGCGCAGCGAGCAACTGCGTCTCTTTGCCTACCCCAAGGATCGACCCACCATACGCAAGTGGGCGGCCAACTGCAAGCATCGTTCCATGCAGGCCAGCAGCGATGGCTACCAGGTCTGTCAATCGCACTTTGCGCCGCATTGCTTTGACCCGGACACGGGTGAGTTGCGGGAGAATGCGGTGCCCACGTTGGAGCTCAGTCGTTGCATGAATGAGGTGCGCTGCGTGGTGCCCGGTTGTGTCAAGGATGAGGATGCGCCGCGTCAACGCTTTTACAGGATGCCCAAGCGCTTCTCTCAGCTCATCACTTGGTGTCACAATCTTCGACTGGATTCCACAGCCATGGCAAGCGGGGATCATCATGTGTGCGATCGTCACTTTGAGACGCACTGCTTCAACCAGCAGAAACTTCTGCGTCCTGGAGCGCGTCCCACGCTGCACTTGGGCCATGACGAGCAAAAAGACGTGTTGCCCAATCCGGCTGAGTGGGATGCCCCAGATGCGGTGGCTGCCACGAACGCCTGCTGCGTGCCCAACTGTGGACACTCCAAGGAAGAGGAGGATCCCGTCCAGGTGCAGCTCTTTGGCTTCCCAAAACTGCGGGTCCTGGCGGAGAAATGGCTACAGAATATTCGCCTGGACCTTGGCAAGGAGCAGCTGGCCAAGCTGAAGATCTGTTCGGCACACTTTGAGCCCAGTTGCCTGGAAAATGGACGACCCCAGTTGGGTGCCATGCCCACGCTGCAGCTGGGCCACGAGGAGCTCCACAATATACATCGCAGCCGGAGCACAGAGCCAACGGCCGGCAAGGCCAAGAAGTATTGCAACAGGAGTGGCTCCAGCCATGACTGCTGCTATCCGCAGTGCGTGGAGCTGCAGAAGACTTACCTGAGGAATAGCTACGATCTGCCGCAGGGCGAGGCACTGCGTCGCAGGTGGCTGGACTACATGGCCATAGAGGAAGTGGAGGATAAACCGCTCAAGCTCTGTCCGCTGCACTTGATCCTTCTCTACGATCACAGTGACGAGCATTTTGAGGAGCACACGAAGGAGCAGCTGCTGGACACCAACTACGAGGACGCACGCAGCAGTGTCCGCATCCGTGTCATCAGCTGTGCGGTGCGGGGTTGTCGGACCCTGAAGCCACGAGACGGTGGGCGACTACACGGATTGCCCCAGCGGCGGGATGTGCTCGAGATGTGGCTGCACAACATGCAGCTGGTGTTCTACGAGCACCAGCGATACCTATACAAGATATGCAGCAAACACTTTGAGGCCAGTTGCTTCATGGACACGACACGTCGTCTCAAGCCCTGGACAATGCCCACGCTGGAGCTGCCGGAGCGGGAGCCGGGCGAGGCGCCTGTCTATCAGAATCCCACTGAAGACGAATGGCGGCGCATGAATGAGCTGCTGGCAgcggagcagcagcagcaagaacaGCAGCTCATCCCGGAGGATCATGAAGGCGAAGATATTGATATGCTGGAGCCAATAGTGAAGATCGAGCATCAGCTAGACGATGAAGAGGAGGaagagcagctgcagcagcagccagaGGGGGAAGAAGATTATGACAACTCACAGCAGCCGCTGGAAGTGCTCCTCGAGGTGGGACACGTTGAGAAGTGCACCACCTATGAGCAAATGGATTCAGAGGCCAATCTTGGCTATGCCGCCGAGCAGCAGACGCAGATGAGCAGCAGCTTTAGTCCGTCAGTGTCCATGTCGATGTCCGCGTCTGCATCGCAATATGGGAGTGGCGCCATGGTCAGCAATGGATTCAAGTACAATGCCCGCCATTGCAGCGTCCGAGGATGCGATGTGACGGCCAATGATGTGAGTGGCAGCATCAAATTGCACAAGTTCCCCACCTCGCTGGATGCCATGCAGAAATGGATGCACAACACCCAGGTGGATGTGGACACGAACTTCGCATGGCGTTTTCGTATTTGCAGCTATCACTTCACCGGCGAATGCTTCAATGGATCGCGCATAAGGCGTGGTGCCATGCCAACGCTTAGTCTAGGACCATGTCGTCCTCCGACCATCTACGACAACGAGTTCAACACATCGCTGCAGCCGGAACTGGAGCAGGCCAATGAGGTGACTGGCGAGGAGCAGCTGATCCCTGATGCGGAACCAAGTGAGACGCGCACAAAGGGCAGTGAGATTAGCCTGCGTCTGCCACATCCGGCACCGCCACGTAAATCCAGCAAGTTCTGCCAGATCGAGGGTTGTCCCAATCATTTAACCAGCGAGAACCTGACACTCCACAAGTTTCCGCATTCGGTGGACATGTGCGCCAAGTGGCAGCACAATACCCAGGTCCCGTTTGATCCGGACTTCCGTTGGCGCTATCGCATCTGCAGCGCCCACTTCGAGCCCATCTgtctgattaacatgcgtctGGTGCACGGCAGCGTGCCCACACTTCATCTGGGGCCACGTGCTCCGCGCCAGCTCTTCGAGAGTGACTTTGAGGCGATAAACATGCGATTGGATAAACAGAAGAGCAGCTCGGATCAGCAGCTGGAGCAGCGGGAGCAACTGCAGCTTCAGGAAGaagaggatgaggatgagaatGCACTGAGTTATCTGGTGCCAGAGATGCAATTGCATGAGGATGCCGATGTCGATGGCGATGCGGATCATTCGGACACTCCCGTGACTTACAGCAACAAATGGAAGGATCTGCGCTTGCCCAGCATCAAGCAGGAGAAGACAATGACACTAACGAATCCAGTCAAATCTGGCTATGACAAGTGTTCCCTGGTGCATTGCCAGCGCCAGCGTTCACAGCATGGCGTCCATCTCTACAAGTTTCCACGCTCGAAGCAACTGCAGAAGCACTGGATGCACAACTTGAGGATCATATACGACGAGCGACGTCCCTGGAAGACCATGATATGCAGCGCACACTTTGAGCCCAACTGCATCCGACTCCGCAAGCTGCGTCCGTGGGCAGTGCCCACGCTGGAGCTGGGCGACAATGTGCCGCAACAGATCTACACAAATGAACAGAGCCGGATGCAGCTGGAGATGGGCAGCGATAACGAGGAATTGGAATTGAGCACGGAcgtgtccatgtccatgtccgtGTCCGTGGAGGATACTCAGTTAGAGGGAGCCTACGAAGACGACGATTATGATGATGAAGACGATGACTATTTGGCTACGGAGCCATTGGTAAAAAGGGAGCGTCGCTCACGCTTCGATCCCTTGCGGCCAGGTCAGTTGCCGCCTTGGAAAATGAAATTCTGCTCCTTGCCCTACTGTCGCAGCCCACGCGGTGATGGCATCAAGCTCTTCCGGCTGCCCAATAACATCAGTTCCATTCGCAAATGGGAAAAGGCGACCGGCATTCACTTCACTGAATCCCAACGCAATACGAAGCTCATCTGCAGTCGCCACTTTGATCCACAGCTGATCGGGGTACGTCGGCTCATGTACAATGCCGTGCCCACACTTCATCTGGGTCCAATAAGTGCAGTTAGTCAACCCGTGGCAGCTCCTGCAGCTCCACGCTGCTGTATGCCTGATTGTCAGGAGGAAGTCAAGTTGCACAAGTTTCCCAGTGATCCCCTGCTGTTGCATCAATGGTGTCACGCGCTAGATCTCTCGGATACTCAGCGATATAGTAGCAAACACATCTGCGCCAGACATCTGCCCGCCAAAGCTCCGAGTTGCATCGTCTGTGGCGTGGAACATATCAAGATGCCATTGCTAGACTTCCCGGAGAATCGCAATCAGCGCGCCAAATGGTGTTACAATCTCAAAATCGAGCCCATCCCCAAGTGGGATAGCTCAAAGCAGATATGCAGCAAGCACTTCGAGAGCTACTGCTTCGTTCAGCCGGGTCAACTGCTGCCCGAGGCAGCTCCCACGTTGCACTTACAGCACAACGATAGCAACATATTTCTGAACGATTATACCATAGATCACAGCAAGATGCTGCGCATTAAGGATGAGCCCTTGGACAGCGAGGACCTGATGCtgtaa
Protein Sequence
MSQHNNPPHHHQHHYYQQQQQQQQLQLHHQHPHHQQQQQLQHKQIQQHSWYSHVASYPPHHPHAAAFAAPCKTNNNSSSNNNNINAYGAGTGGTHAGYYGSAVAAAGGGVGYNLEGNTVAYAHNQLLQYQQQQQQHQQQQQQHQLSQRSYMAHSLMHSSYPYIKSEPLELPDDRQRQQHHQQQQHQQQQHFQNPMAPPPAPANRHSLDASGEMVIKSEPIDEHAYKSNYIDDNTPFADFSKYPEFGDDMLSPKVEHSVKDEGYGSQKVPNPLSYPRRKLQSERSSESLPICQRCKEVFFKKQIYLRHVAESSCSIQEYDFKCNICPMSFMSTEELQKHKHQHRADKFFCHKYCGKYFDTIAECESHEYMQHEYESFVCNMCSVTFATREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYAAAQSASHQQQQQQHPQQHQQQANYKLQDCHMATMEMPTPHHKANAAATALPATAALSSLLQQRQANADGAALFASTLKNEPNVKLERSYSNSTSESGYSLHDSSYNNAYGSDTSLHASGGAGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPATPTAHQQQQQQQQLHQQHQQQLQQQHSHNHGHKYQRHSAASTSSSASSASHYVDADMGGSYMGMGNSGGSGGSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDNIAYPLPTAEQVAEFYSRPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLAKWAHNLELDVAQLPNMRICNLHFESHCIGKRMRPWAIPTLNLSTNIENLFENPERQMLYKRRTHLNSNSNPDRGAGIVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIKMRPRGAMRQQYRMCSIHFESHSFNGKRLSAGAIPTLELGHQDDDIYPNEAQSFVEEHCTVEGCDANKEQPDVRLFRFPTEDEDLLWKWCNNLKMNPVDCVGVRICNKHFEADCIGPKHLYKWAIPTLELGHDDDQIELIHNPKPEDRYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPTLFQRWRHNLRLDHLNFKERERYKICNVHFEDICIGKTRLNIGSVPTLELGHDETEDLFQVNPAELQSNLFGRQRRVHGYIKQEEPDDDQEEQEEHSELDEDIKPDISMSEATDLNTSQVKIKRSLVDFKCCVPSCGRSRLEHGARLFPFPSGKQQQSKWRHNLQLSSADVDRSTRVCSAHFNRRCIDGKQLRGWAMPTQLLGHQEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEDLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKAGLQQEAEVEADEGEAFMESLVKVKQEKVDPEDEQPDNDDEDIDEQPDKDQEDHGYFDPLELVETFADHHSDDNASEDNDDDDEDGEGDDADDDDEAVDDDELLLPDTSPVPVAPLPLPPRREKPVNNVTPICCLRHCRKERTASHLLSTFGFPKDPQQLLKWSANLQLAPDDCVGRVCIEHFESEMLGTRKLKQNAVPTLNLGHSTPLRYSCNGQSVGIHDAQPQHSVFRLWSLKHCRKRKLPMEPPDQTAIKRRCCLPSCGKQPDLHGVQLHRLPSNRILLRKWLHNLKLPPMMDTSQARLCSEHFETPLEDCVPTLRLGHDDTHIYRNRGSISAASSSSSGCMVPSCPCARLNLYRCHDLPENRLVQQAWLQWLQLPMPQLASDGKLCVMHYMQLYEQVPLPQELPEPVLRQLQETYDLIASSSMAMKLRCAIPGCYSKYTDNIRLTKLPMCPDMSAKWLHNTKITYDPARHYIYRICMLHFEAHCLGPVRPKPWAVPTLQLNHSDPHIYHNPKLDDNRQPVSAPVPVPVAISTPVPDPVPVSLSTSVPVELPLRIKTELAFCGSPSASASPSPRGKLRICCIPSCAQQATSQTRLFRFPSAETSLLKWLVNTQQQPRLVDAQHLFVCQDHFEEDAICKKQLQSWAVPTLRLGHDGHVIPNARHNGNIADSQENKQALQFIWENYCSVLSCFQPRSEQLRLFAYPKDRPTIRKWAANCKHRSMQASSDGYQVCQSHFAPHCFDPDTGELRENAVPTLELSRCMNEVRCVVPGCVKDEDAPRQRFYRMPKRFSQLITWCHNLRLDSTAMASGDHHVCDRHFETHCFNQQKLLRPGARPTLHLGHDEQKDVLPNPAEWDAPDAVAATNACCVPNCGHSKEEEDPVQVQLFGFPKLRVLAEKWLQNIRLDLGKEQLAKLKICSAHFEPSCLENGRPQLGAMPTLQLGHEELHNIHRSRSTEPTAGKAKKYCNRSGSSHDCCYPQCVELQKTYLRNSYDLPQGEALRRRWLDYMAIEEVEDKPLKLCPLHLILLYDHSDEHFEEHTKEQLLDTNYEDARSSVRIRVISCAVRGCRTLKPRDGGRLHGLPQRRDVLEMWLHNMQLVFYEHQRYLYKICSKHFEASCFMDTTRRLKPWTMPTLELPEREPGEAPVYQNPTEDEWRRMNELLAAEQQQQEQQLIPEDHEGEDIDMLEPIVKIEHQLDDEEEEEQLQQQPEGEEDYDNSQQPLEVLLEVGHVEKCTTYEQMDSEANLGYAAEQQTQMSSSFSPSVSMSMSASASQYGSGAMVSNGFKYNARHCSVRGCDVTANDVSGSIKLHKFPTSLDAMQKWMHNTQVDVDTNFAWRFRICSYHFTGECFNGSRIRRGAMPTLSLGPCRPPTIYDNEFNTSLQPELEQANEVTGEEQLIPDAEPSETRTKGSEISLRLPHPAPPRKSSKFCQIEGCPNHLTSENLTLHKFPHSVDMCAKWQHNTQVPFDPDFRWRYRICSAHFEPICLINMRLVHGSVPTLHLGPRAPRQLFESDFEAINMRLDKQKSSSDQQLEQREQLQLQEEEDEDENALSYLVPEMQLHEDADVDGDADHSDTPVTYSNKWKDLRLPSIKQEKTMTLTNPVKSGYDKCSLVHCQRQRSQHGVHLYKFPRSKQLQKHWMHNLRIIYDERRPWKTMICSAHFEPNCIRLRKLRPWAVPTLELGDNVPQQIYTNEQSRMQLEMGSDNEELELSTDVSMSMSVSVEDTQLEGAYEDDDYDDEDDDYLATEPLVKRERRSRFDPLRPGQLPPWKMKFCSLPYCRSPRGDGIKLFRLPNNISSIRKWEKATGIHFTESQRNTKLICSRHFDPQLIGVRRLMYNAVPTLHLGPISAVSQPVAAPAAPRCCMPDCQEEVKLHKFPSDPLLLHQWCHALDLSDTQRYSSKHICARHLPAKAPSCIVCGVEHIKMPLLDFPENRNQRAKWCYNLKIEPIPKWDSSKQICSKHFESYCFVQPGQLLPEAAPTLHLQHNDSNIFLNDYTIDHSKMLRIKDEPLDSEDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00601834;
80% Identity
-