Basic Information

Gene Symbol
-
Assembly
GCA_035042485.1
Location
JAWNLH010000400.1:6590738-6601978[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 10 1.7e+04 -5.2 2.6 21 71 111 160 90 175 0.51
2 29 6.8e-15 1.2e-11 45.2 4.2 1 86 296 368 296 369 0.85
3 29 2.9e-15 4.8e-12 46.4 4.6 1 87 396 465 396 465 0.83
4 29 7e-16 1.2e-12 48.3 0.4 1 87 487 559 487 559 0.85
5 29 6.1e-16 1e-12 48.5 5.7 1 87 654 724 654 724 0.83
6 29 8.6e-15 1.5e-11 44.9 3.4 1 86 748 819 748 820 0.81
7 29 1.2e-12 2.1e-09 37.9 1.2 1 87 855 923 855 923 0.80
8 29 3e-11 5.1e-08 33.5 1.9 1 86 971 1040 971 1041 0.77
9 29 6.2e-17 1.1e-13 51.7 0.3 1 86 1068 1137 1068 1138 0.82
10 29 3.4e-12 5.8e-09 36.5 1.3 1 86 1159 1228 1159 1229 0.80
11 29 4.8e-15 8.2e-12 45.7 1.3 1 86 1256 1327 1256 1328 0.85
12 29 2.1e-13 3.5e-10 40.4 2.1 1 85 1404 1472 1404 1474 0.82
13 29 3.1e-12 5.2e-09 36.7 0.1 1 86 1497 1565 1497 1566 0.82
14 29 5e-14 8.5e-11 42.4 0.9 1 86 1720 1788 1720 1789 0.82
15 29 2.7e-11 4.5e-08 33.6 0.6 1 61 1842 1896 1842 1917 0.80
16 29 2.6e-05 0.044 14.5 0.1 1 58 1923 1974 1923 1998 0.79
17 29 3.2e-11 5.5e-08 33.4 0.1 1 86 2013 2082 2013 2083 0.83
18 29 4.6e-14 7.8e-11 42.5 1.5 1 87 2141 2211 2141 2211 0.81
19 29 7.4e-13 1.3e-09 38.6 0.5 1 86 2246 2317 2246 2318 0.81
20 29 1.6e-13 2.7e-10 40.8 1.2 1 87 2328 2400 2328 2400 0.81
21 29 1.8e-14 3.1e-11 43.8 0.2 1 87 2423 2494 2423 2494 0.77
22 29 6.3e-06 0.011 16.4 0.1 1 58 2527 2580 2527 2599 0.84
23 29 7.4e-15 1.3e-11 45.1 0.1 1 86 2618 2690 2618 2691 0.80
24 29 4.6e-14 7.8e-11 42.5 1.4 1 86 2825 2897 2825 2898 0.81
25 29 5.4e-15 9.2e-12 45.5 2.7 1 87 2961 3032 2961 3032 0.85
26 29 8.6e-15 1.5e-11 44.8 4.0 1 86 3145 3215 3145 3216 0.85
27 29 2.1e-13 3.6e-10 40.4 0.1 1 87 3308 3378 3308 3378 0.85
28 29 9.7e-10 1.6e-06 28.7 0.3 1 58 3395 3443 3395 3459 0.86
29 29 2.1e-08 3.6e-05 24.3 1.5 18 87 3460 3518 3450 3518 0.75

Sequence Information

Coding Sequence
ATGCCCATTTGTCAGCGCTGCAAAGAGGTGTTCTTCAAGAAGCACATTTACCTGCGGCATGTGGCCGAGAGCAGTTGCACGATACACGAGTATGAGTTCAAGTGCAACATCTGTCCCATGTCCTTTATGGGCGTCGAGGAGCTGCAGAAGCACAAGCATCTGCATCGAACCGACAAGTTCTTCTGCCACAAATACTGTGGCAAGCATTACGACTCGATTGCAGAGTGCGAATCGCACGAGTACATGGAGCACGAGTACGATAGCTTTGTGTGCAATATGTGCTCTGTTACGTTCCCCACACGTGAACAGCTGTATGCCCATTTGCCGCAACACAAGTTCCAACAGCGTTACGATTGCCCCATTTGCCGGTTGTGGTACCAAACGGCATTAGAGCTGCACGAGCATCGACTGGCGGCGCCCTACTTTTGCGGCAAGTATTACGCTGCAgcacaccagcagcagcagcaacagcagcaacaacaacagcaacaacaacaacaacagcagcagcagcagcatacacaacaacatcagcaacagggCAACTACAAACTGCAGGATTGTCACATGGGCACCATTGAAAtgaCTCCACCGCAGCACAAGGCGAATGCGGCATTACCGGCAACGGCGGCGCTCAATTCGCTGTTGCAGCAACGCCAGGCGAACGCTGATGGTGCCGCTTTATATGCCTCGGCGCTGAAGAGCGAGACGAACGTGAAACTGGAGCGCAGCTATAGCAACTCCACCAGCGAGTCTGGTTACAGTATGCACGAGAGCAGCTATAACAATGCCTACACCAGCGACACTTCTCTGCATGGTGGGGGCGGTGCAATTGGTGGTCCGCAGGCGCATTCCTCGACGCTGGACGATTCGGAGGATGCGCTGTGCTGTGTGCCACTTTGCGGGGTGCGCAAGAGCACAAGCCCCACGCTGCAATTCTTTACGTTTCCCAAAGATGACAAGTACTTGCATCAGTGGCTGCATAACCTTAAGATGTTCCACATTCCGGCGTCGAGCTATGCCAGCTTTCGCATCTGCAGCATGCACTTCCCTAAGCGTTGCATCAATCGTTACTCTCTATGCTATTGGGCGGTGCCCACATTTAATCTGGGCCACGACGATGTGGCCAATCTCTATCAGAATCGTGAGCTGACCAACACATTCACCACCGGCGAGGTGGCCCGCTGCAGTATGCCAAACTGCACTAGCCAGCGTGGCGAGAGTAATCTGAAGTTCTACAACTTTCCCAAGGACATCAAGAGTTTGATTAAGTGGTGCCAAAACGCTCGCTTGCCCGTCCAGGCCAAGGAGCCGCGCCACTTCTGCAGTCGCCACTTCGAGGAGCGTTGCATTGGCAAGTTCCGGCTGAAGCCATGGGCAGTGCCCACGTTACACCTTGGCGCCCAGTACGGCAAGATTCatgacaatcccaaaaatctGTACGTGGAGGAGAAGCGCTGCTGCCTCAACTTTTGTCGTCGCAGTCGCTCCTCCGACTTCAACATGTCGCTGTATCGCTTCCCCAGGGATGAGGTGCTACTGCGTCGTTGGTGCTACAATCTACGCCTTGATCCGGCTGTCTATCGTGGGAAGAATCACAAAATTTGTAGCGCTCACTTTATCAAGGAAGCTTTGGGATTACGCAAACTATCTCCGGgCGCTGTGCCCACGCTGCATCTGGGTCATAATGACACCTTTAACATCTACGAGAACGAACTGTGGCCACCGCCAACGCCCTCCACGCCCACCCAcaatcatcagcagcagctgcagcaacaccagttgcagcagcagccgcaacaagtGCATCATAAATATCAGCGGCACTCGGCTGCATCCACATCCTCGTCGGCCTCGTCGACATCGCATTATGTGGATCCGGAGCTAAGTGCATCCTACATGAGCATGGGCGCTGGAGGCTCATCGTCTGGCCTTAATGTCAGCGACAGCATGGATGTCTGCTGTGTGCCCAGCTGCGAGAGCAAGCGTCACAACAATGAGAACATCACATTCCACACAATACCCAGGCGGCCAGAGCAGATGCGCAAGTGGTGTCACAATCTTAAGATTCCCGAGGACAAGATGCACAAGGGCATGCGGATATGTAGCTTGCACTTTGAGCCCTACTGCATTGGCGGCTGCATGCGTCCGTTTGCGGTGCCGACATTGCATCTGGGCCACGACGACGAGGACATTCACCGCAATCCGGATGTGATCAAGAAGCTCAACATACGCGAAACTTGCTGCGTGGCTGTTTGCAAACGCAATCGTGACCGGGACCATGCCAATCTGCATCGCTTTCCCAGCAATGTGCCGCTGTTGACCAAATGGTGCGCAAATCTGCAGCGTCCTGTGCCGGATGGCAGTAAACTCTTCAACGATGCCATCTGTGAGGTGCACTTTGAGGATCGATGCCTGCGCAACAAACGACTGGAAAAGTGGGCAGTGCCCACACTCATCCTTGGGCATGAGAATATACCCTATCCGCTGCCCACACCGGAGCAAGTTGCCGAGTTCTATGCGCGTCCCACTGCGCCTAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGAGACGTGTAAGCGTAATCCCAGTGTTGATGACATCAAGCTATATCGCCCGCCCGAGGAGTCGCAGGTGCTGGTAAAGTGGGCGCACAATCTCCAACTGGAGATTGCCCAGCTGCCCAATATGAGAATATGCAATCTGCATTTCGAATCCCACTGCATTGGCAAGCGGATGCGTCCCTGGGCAATACCCACGCTCAATCTGGCAACCAACATAGAGAATCTCTACGAGAATCCCGAACACCAGATGCTCTACAAGCGGCGCACACATCTCAAGCCGGGCAGAGTAGCGCGAGGCTCTGAAGCAAGCGCTGGTGGTGTGAAGCCCACCTGGGTGCCacgctgctgcttgccacactgCCGCAAGGTGCGTGCCACGCACAATGTCCAGCTGTATCGCTTCCCCAAACTCAATCGCTCCACGCTGGCCAAGTGGGCGCATAATCTGCAGGTACCGCTCGTGGGCAGCGCTCAGCGTCGCCTCTGCTCCGCCCACTTTGAGCCGCATGTGCTCAGCAAGAAATGCCCGGTGCCCATGGCTGTGCCCACACTGGACCTCAATACACCATCCGGCTACAAGATCTATCAGAATCCGGCCAAGCTCAAGGCGAATAAGCTGTGCTTGCAGCGTGTCTGTATTGTGGAGAGCTGCCGGCGTCAGCGGGCGCAGGGGGTACAGCTCTTCCGTCTGCCTCACAGCCCCACCCAGCTGCGTAAGTGGATGCACAACATCCGCATGCGTCCCCGAGGAGCTATGCGACAACAGTATCGCATCTGCTCGAAGCACTTCGAGACGCACTCGTTCAATGGGAAGAGACTCAGTGCGGGTGCAATTCCAACGCTTGAGTTGGGCCATGAGGACGAAGACATATTTCCGAATGAGGCGCAGTCTTTCGTGGAGGAGCACTGCACCGTCGAGGGCTGCGATGCCGTCAAGGAGCAACCGGATGTGCGTCTATTCCGCTTCCCCAACGACGATGAGGATCTGCTCTGGAAGTGGTGCAACAATCTGAAAATGAGTCCGGTCGACTGCATCGGCGTTCGCATCTGCAACAGACACTTCGAGACTGATTGCATTGGACCAAAGCACCTGTTCAAGTGGGCCATTCCCACGCTCTCCCTCGGccacgatgatgatgacatcGAGCTGATGCTGAATCCCAAGCCGGAGGAGCGCTATATTGATCCGGTCTTCAAGTGCTGTGTGCCCTCGTGCGGCAAGACGCGTAAATTCGATGAAGTGCAGATGAACAGTTTTCCCAAAGATCTGGAGCTCTTCCAGCGCTGGCGCCACAATCTCCGCCTCGATCATCTCAACTTCAAGGAGCGCGAACGCTATAAGATCTGCAACGCCCACTTCGAGGACATTTGCATTGGTAAGACGCGCTTGAACATTGGCTCCATACCGACACTGGAGCTTGGCCATGACGAGACTGATGACTTGTTCCAAGTCAACCCCGAGGAGCTGCAGAGCAATCTCTTTGGACGCCAGAGACGCGTGCAAGATTCCACGAGGATCAACATTAAGCAGGAGGCGCACTCCGACCTCGATGAAGACACTAAACCGGACATTAACATGTCGGTGGCCACAGATTCAAATACAACACAGgttaaaatcaagaaatcTATGACCGATTTCAAGTGCTGTGTGCCGAGCTGTAGTCGCAGTCGCCTGGAGCATGGTGCCCGCCTTTTTCCGTTTCCGAACgggaaacagcagcagagcaagTGGCGCCACAATCTCCGGCTGCCTGCTGCCGACGTGGACAAGACGACGCGCATCTGCAGCGCTCATTTCAATCGCCGTTGCATCGATGGCAATCAGCTGAGGGGCTGGGCAATGCCCACACAGCAGCTGGGCCATCAGGAGCTGCCGATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCACTGCCGCAAGCGGCGGAGCATTGACAACGATCTGCGTACCTATCGCTATCCACGCAGCGAGGAGCTGCTCGAGAAGTGGCGTGTCAATCTGCGCTTGTCGCCGGATCAATGCCGCGGACGCATCTGTGCGGATCATTTCGAGCCACTGGTGCGTGgcaagctgaagctgaagacTGGAGCAGTGCCTACGCTCAAATTGGGACACGACGAGGGCGTAGTCTTTGATAATGAAGGCATTAAAGCGGGTctgcagcaggaggaggagggggcgGAGGAAGAAGAGGGCGATACTAGCTTGAAGTCGTTGATCAAAGTAAAGACTGAGCAGGAGGATGATCAGCAGCTAGAGAATGAAGACGaagagcagcaggagcaggagcaggatcTGGAGCAGGATCAAGATATGGACGAGGATGGGGAAGAGCATCGGGACTCTGAGGAACATGGTTATTTTGATCCCTTGGAACTTGTGGAAACCTACGCTGAGCACCACAGCGATGATAACTCTGCCGGACATGATAATctcgacgatgatgatgacgaagaTGAGGACATTCCCGGCAATGACGATGAGCTGCTTCTGCCTGATACACGGTCACTTCGAATGACAACGGCTCCGCGGCGCGAGAAGGCTGTGAATAATGTGACGCCCATTTGCTGTCTGAAGCACTGTCGAAAGGAGCGCACTGCCATCCATCATCTGAGCACCTTTGGTTTTCCCAAGGatccgcagctgctgctcaagtgGAGCGCCAATCTGCAGCTGCCATTGGAGTCGTGCATGGGTCGTGTATGCGTCGAGCACTTTGAGCCCTCGATGCTGGGCACGCGCAAGCTGAAGCAGAATGCGGTGCCCACCTTGAAACTGGGCCATGCCACACCGCTCACCTACAGCTGCAATGGCCGGATGCTGTCGGGCATTTACGACGAACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAACGGAAACCGGATCTGGCGGAGATTAAGCCCGGTCGTCGCTGTTGCCTGCCAAGTTGCGGCAAGCAGTCGGAGTTGCATGGCGTCCAGCTGCAGCGTCTGCCCAAGGATCGTCTGATGCTGCGCAAATGGTTGCACAACCTGAAGCTGCCTCCAACGATGGACTGCACCCAAATGTTCCTCTGCAGCGATCACTTTGAGCTGAATGCGCCGTGTCCCACTTTGAAACTGGGCCACTCGGATACCAATATTTATCGCCACAATGTGGCTAGCACCAGTGGCAGCTGCCTGGTGCCCAAATGTACTTGTGCTCGTCTCAATCTCTATCGCGGCTATGATCTGCCTGCGCATCCGCAGGTGCAACAGGCCTGGCTACACTGGCTGCAGCTGCCCCATCCGCAGCCGTCGCCCAGGCACGCCCAGCTGTGTGTGATGCATTTTATGCAGCTCTACGAACTGGTGCCGCTGCCCGAATCGGTGCCAGATGTTGTGCGCAGGCAGCTGCGTGAGACTTACGAACTGATATCCAGCTCCAGCATGGCCATGAAGCTGCGTTGCGCTGTGCCCGGCTGCTACTCGAAGTATACGGACAATGTGCGTCTGACCAAGCTGCCCGTTTACCCCGACACCTGCGCCAAGTGGGTGCACAACACCAAGATTCAATATGACCCGGCCCGACATTATGTCTATCGCATCTGCATGTTGCACTTTGAGCCAGGCTGCCTGGGCCCAGTGCGTCCCAAAGTGTGGGCAATGCCAACGCTGCAGCTGCACCACAAGGATGCCAACATCTATTTAAATCCAAAGCTGGATGGCAGCCAAACACAGCCGGCCGTGCCGCTGGACCTGCCGCTGCGCATTAAAACTGAGCTGCCGATGAGCAACAGTCCCAGCTTTAGTGCGAGTGCCAGTCCCAGTCCGCGTGGCAAGCTGCGCACTTGCTGCATTCCCAGCTGCGGTCAGCAGGCTTCGGCCCTGACGCGTCTCTTTCGCTTTCCCAGCGCAGAGACGTCGATGCTGAAGTGGCTGGTGAAtacccagcagcagccgcgCTTTGTCGATGCACAACGGCTGTTCGTCTGCCAGGATCACTTCGAGGCGGAGGCCATTTGCAAGAATCAGCTGCGCAGCTGGGCGGTGCCAACAGTGAATCTAGGACACGATGGACACGTCATACCGAATGCCAAGCACAATGGCAACATTGCCGACAGCCAGGAGAACAAGCAGACGCTGCAGTTTATCTGGGCCAACTACTGTTCGGTGCTGACCTGCTTCCAGCTAAGTAGCGAGCAGCTGCGTCTCTACCAATACCCCACGGATCGGCCAACCATCCGCAAGTGGGCCGCCAATTGTAAGCATCGCTCCATGCAGGCCAGCAGTGATGGATTCCAGGTGTGTCAGTCGCATTTTACGCAGGATTGCTTTGATCCTGATACCGGGGAGCTGAAGGAGGACGCAGTGCCCACACTGGCGCTGAGCCGGTCTGTCACTGAGGTGCGCTGTGTGGTCAATGGTTGCGTTAAGGACGAAGATGCATCGCGTCGCCGTCTGTTCAAGATGCCCAAGCGTAACCCACAGATATTGGATTGGTGCCACAATTTACGACTGGATCAGGCGGCCATGAGCGGCTCGGAACAGCACGTTTGTGAACGCCACTTCGAGGCAAACTGCTTCAATGCATCTAGAGTGCTGCGTCCAGGAGCACGACCCACACTTCATTTAGGCCATGAGGACCTAGACGATGTGATACCCAATCCGGCGAACTGGGAAGAGGATGTGATCGTGTGCTGTGTGCCCCACTGCGAAAGCTCCAAGGATGCGGACGAAGTCCAACTGTTTGGGCTGCCAAAGGTGCGCCAGTTGGCGGACAAGTGGCTGCAAAATGTGCGCCTCGATCCGAGCAAAGAACAACTGGCCAGCCTGAAGATCTGCAGTGTGCACTTTGAGGCGAGCTGCATGGAGAATGGACGACCCACCTATGGTGCAATGCCCACACTCCATCTCGGTCACGATGAGCTCGACAATATACACCCAAGCGTAGAGTCGGTGCCGACGCAGCAGAAGCGCTACTGCAATAGAGATGGCGCCAGTCACGACTGCTGCTATCCGCAGTGCGTGGAGCTGCAGAAGAGCTATCTGCGGGTCACCTACGAGCTGCCCCAGGAGCAGGAGCTCCGTCAGCAGTGGCTCTCCTATATGGGCCTGGAAGCGCAGCAGCTCGATAAGCAGCATCTGCCCAAGCTCTGTCCACTCCACCTAATCTTGCTCTACGATCACAGTGCGGATCACTTTTCGGCACACGCCGCTGAGGAGCTGTTGGACTCCAATTATGAGGCTGCGCGCAGCAGCGTTCGCATACGCGTTGTCAGCTGTGCTGTGCGCGGCTGCAGAACGCTCAAACCACGCGACGGTGGTCGGCTGCATGGCTTGCCCACGCGGCGAGATCTGCTGGAGATGTGGCTGCACAACATGCAGCTGGTTTTTTACGAGCAGCAGCGTTATATGTACAAGATTTGCAGCAAGCACTTTGAGTCCACATGCTTCACGGAGACAACCAAGCGCCTGAAGCCGTGGAGCATGCCTACGCTGGAGTTGCCGGAGCGCGAACCGGGCGAGATGCCTGCCTATCAGAATCCCACTGAGTTGGAGTGGCAACACATGAATGAGCTGCAGGTCAGCGAGAAAGTTGTTGAGGCTCAGCCGGAGCCATTACTCAAGCTGGAGCCGTTGCCCAAGAAggagccaccaccaccgcagGTTATGGAATATGAAGAGGATTGCGACAATAACTCACAGCAGCCACTTGAAATGCAGGCGCTGGAGGTGCTGCTCGAGGTGGGCCATGTCGAGAAGTGCACCACCTACGAGCAAATGGATACCGAGGCAAATCTCAGCTATGCTGAACAGTTCTTGCACAATCCCCTCAGTCCAGGTCCACCCCAATGCCGTATCCCCGTTGTCCAGAATGGACTCCACTACAGTGCACGCCACTGCAGCGTGCATGGCTGCAATGTCACCTCCAATAAtctgagcagcagcatcaagcTACACAAGTTCCCCGTCTCGCTGGATGCCATGCAAAAGTGGATGCACAACACCCAGGTGCTCGTGGACGTCAAATTCGCTTGGCGTTTTCGCATCTGCAGTCATCATTTCATCGAGGATTGCTTTCACGGCTCGCGCATCAGACGTGGGGCGATGCCCACGTTGCGACTGGGCTCACGTCGACCGAAGCATATCTATGATAATGAGTTCAACGCCCAATTGCAACTGGAACAGTCCAAAGAAGAGGCCAAGGAGGCTCTCGCTGCCCCGCTGGAGTctcagcaacagttgctctCTGCGAATGTAGGTCTTCGCCTGCCGCGTCCAGCCCCGCCCTGCAAATCCAGCAAATACTGTCAGATCGAGGGCTGCTCCAATCATTTGACCAGCGAAAATGTGACGTTGCACAAGTTCCCCCATTCGTCGGATATGTGCGCCAAGTGGCAGCACAACACTCAGGTGCCCTTCGATCCCGAGTTCCGTTGGCGCTATCGCATCTGCAGCGCACACTTTGAGCCCAGCTGTCTAGGCAATGTGCGACTGATGCACGGCAGTGTGCCCACCTTGAATCTGGGACCGCTTGCGCCCAAGAAACTGTTTGACAATGAATTCTTGCGTCTGGACAAGCCAATGAGCGGTTCGGAGCTGGGTATGACCGTCAAACAGGAAGAAATGGAGCAATTTGATCAACTGGAGCTGGAAGATGGCAACCAGGAGCAGGATGATTTCAGTCTCCTGGAGCCCGAGCTACAGTTGCACGAGGGTAGCGAGGATGAGCAACAATATGACAATCATTTCAGCCAAACCGATTCCTATAACTGGTCCGATCAGCAGCTGCGTCTGCCCAGCATTAATCAGGAGAAGTGCACCACCAGCTACAATCCAGTCAAGTCCGGCTATGATAAGTGCTCACTGGTGCACTGCCAACGACAGCGTTCCCAGCACGGCGTGCACATCTACAAGTTTCCGCGCTCGCGTCAGCTACAGCAACGATGGATGCATAATTTGCGCATCCAATACGATGAGCGACGGCCGTGGAAGACAATGATATGCAGTGTCCATTTCGAGCCGCACTGCATCCGTCTGCGCAAGTTGCGTCCCTGGGCGGTGCCCACGCTAGAGCTGGGGGACAATGTGCCGCTGGAGATCTTTACGAATGAGCAGAGCCAGCAGCTGTTTGCTCAGTCCGAAGCAGGCAGCGAGTGTGATGAGGTTGAAATGGATGTTGAGGACACCATGCTGGAGGACTTGGATGATGACTATGATGACAATGATGCtgatgtgaatgtgaatgctgATGATCAAATGCGAACAGCTCCATATGTCAAAAGAGAGCGTCGCTCTCGATTTGATCCTCTGCCACCGGGTCAGCTGCCACCGTGGAAGATCAAATGCTGCTGTTTGCCCTATTGCCGCAGTCCTCGCGGTGATGGCATCAAGCTCTTTCGACTGCCCAACAACATCAGCTCCATACGTAAATGGGAGCAGGCCACAGGCATGCGCTTCTATGAGTCCCAGCGCAACACAAAGCTCATCTGCAGTCGACACTTTGATCCGCAGCTTATAGGCGTGCGTCGCCTCATGTCTAATGCGGTACCCAGCCTCCATTTGGGCCCAGACAGCGCAGAGCCCGAGCTGCCTCCTGTGGGACCACGTTGCTGCATGTCCGATTGCTCTGAGGATGACAATGTCCAGCTGCACAAGTTTCCCAAAgATCCCATGCTGCTGCATCAATGGTGTCAGGCGCTCAATCTACCGGATGTTCAAAGCTACTCCGGCAAACTCATTTGTGCGGCACATCTGCCCTCCAACGCGATGAGCTGTCTAATTTGTGGCGTGGACGATGTACAGCTGCCAATGCTGGACTTTCCCCAGAATCGCAATCAGCGCACTAAATGGTGctacaatttgaaaatcgaGCCTCTGCCCAAGTGGGACAACTCAAAGCAAATTTGCTGCAAACACTTTGAGAGCTTTTGCTTTATCCAGCCTGGTCAACTTCTTGCGGAGGCATTGCCCACGCTACACTTGGAGCACGGCGATAGCAACATATTCCTAAACGATGAGGCCATGGATAACAGCAAGTTGTTGCGCATCAAGGACGAGCCCATGGAGAGTGAAGATCTGATGCTGTAA
Protein Sequence
MPICQRCKEVFFKKHIYLRHVAESSCTIHEYEFKCNICPMSFMGVEELQKHKHLHRTDKFFCHKYCGKHYDSIAECESHEYMEHEYDSFVCNMCSVTFPTREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYAAAHQQQQQQQQQQQQQQQQQQQQQHTQQHQQQGNYKLQDCHMGTIEMTPPQHKANAALPATAALNSLLQQRQANADGAALYASALKSETNVKLERSYSNSTSESGYSMHESSYNNAYTSDTSLHGGGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPTHNHQQQLQQHQLQQQPQQVHHKYQRHSAASTSSSASSTSHYVDPELSASYMSMGAGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVPLLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIPYPLPTPEQVAEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLVKWAHNLQLEIAQLPNMRICNLHFESHCIGKRMRPWAIPTLNLATNIENLYENPEHQMLYKRRTHLKPGRVARGSEASAGGVKPTWVPRCCLPHCRKVRATHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPMAVPTLDLNTPSGYKIYQNPAKLKANKLCLQRVCIVESCRRQRAQGVQLFRLPHSPTQLRKWMHNIRMRPRGAMRQQYRICSKHFETHSFNGKRLSAGAIPTLELGHEDEDIFPNEAQSFVEEHCTVEGCDAVKEQPDVRLFRFPNDDEDLLWKWCNNLKMSPVDCIGVRICNRHFETDCIGPKHLFKWAIPTLSLGHDDDDIELMLNPKPEERYIDPVFKCCVPSCGKTRKFDEVQMNSFPKDLELFQRWRHNLRLDHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDETDDLFQVNPEELQSNLFGRQRRVQDSTRINIKQEAHSDLDEDTKPDINMSVATDSNTTQVKIKKSMTDFKCCVPSCSRSRLEHGARLFPFPNGKQQQSKWRHNLRLPAADVDKTTRICSAHFNRRCIDGNQLRGWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLSPDQCRGRICADHFEPLVRGKLKLKTGAVPTLKLGHDEGVVFDNEGIKAGLQQEEEGAEEEEGDTSLKSLIKVKTEQEDDQQLENEDEEQQEQEQDLEQDQDMDEDGEEHRDSEEHGYFDPLELVETYAEHHSDDNSAGHDNLDDDDDEDEDIPGNDDELLLPDTRSLRMTTAPRREKAVNNVTPICCLKHCRKERTAIHHLSTFGFPKDPQLLLKWSANLQLPLESCMGRVCVEHFEPSMLGTRKLKQNAVPTLKLGHATPLTYSCNGRMLSGIYDEQPQHSVFRLWSLKHCRKRKPDLAEIKPGRRCCLPSCGKQSELHGVQLQRLPKDRLMLRKWLHNLKLPPTMDCTQMFLCSDHFELNAPCPTLKLGHSDTNIYRHNVASTSGSCLVPKCTCARLNLYRGYDLPAHPQVQQAWLHWLQLPHPQPSPRHAQLCVMHFMQLYELVPLPESVPDVVRRQLRETYELISSSSMAMKLRCAVPGCYSKYTDNVRLTKLPVYPDTCAKWVHNTKIQYDPARHYVYRICMLHFEPGCLGPVRPKVWAMPTLQLHHKDANIYLNPKLDGSQTQPAVPLDLPLRIKTELPMSNSPSFSASASPSPRGKLRTCCIPSCGQQASALTRLFRFPSAETSMLKWLVNTQQQPRFVDAQRLFVCQDHFEAEAICKNQLRSWAVPTVNLGHDGHVIPNAKHNGNIADSQENKQTLQFIWANYCSVLTCFQLSSEQLRLYQYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFTQDCFDPDTGELKEDAVPTLALSRSVTEVRCVVNGCVKDEDASRRRLFKMPKRNPQILDWCHNLRLDQAAMSGSEQHVCERHFEANCFNASRVLRPGARPTLHLGHEDLDDVIPNPANWEEDVIVCCVPHCESSKDADEVQLFGLPKVRQLADKWLQNVRLDPSKEQLASLKICSVHFEASCMENGRPTYGAMPTLHLGHDELDNIHPSVESVPTQQKRYCNRDGASHDCCYPQCVELQKSYLRVTYELPQEQELRQQWLSYMGLEAQQLDKQHLPKLCPLHLILLYDHSADHFSAHAAEELLDSNYEAARSSVRIRVVSCAVRGCRTLKPRDGGRLHGLPTRRDLLEMWLHNMQLVFYEQQRYMYKICSKHFESTCFTETTKRLKPWSMPTLELPEREPGEMPAYQNPTELEWQHMNELQVSEKVVEAQPEPLLKLEPLPKKEPPPPQVMEYEEDCDNNSQQPLEMQALEVLLEVGHVEKCTTYEQMDTEANLSYAEQFLHNPLSPGPPQCRIPVVQNGLHYSARHCSVHGCNVTSNNLSSSIKLHKFPVSLDAMQKWMHNTQVLVDVKFAWRFRICSHHFIEDCFHGSRIRRGAMPTLRLGSRRPKHIYDNEFNAQLQLEQSKEEAKEALAAPLESQQQLLSANVGLRLPRPAPPCKSSKYCQIEGCSNHLTSENVTLHKFPHSSDMCAKWQHNTQVPFDPEFRWRYRICSAHFEPSCLGNVRLMHGSVPTLNLGPLAPKKLFDNEFLRLDKPMSGSELGMTVKQEEMEQFDQLELEDGNQEQDDFSLLEPELQLHEGSEDEQQYDNHFSQTDSYNWSDQQLRLPSINQEKCTTSYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQLQQRWMHNLRIQYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGDNVPLEIFTNEQSQQLFAQSEAGSECDEVEMDVEDTMLEDLDDDYDDNDADVNVNADDQMRTAPYVKRERRSRFDPLPPGQLPPWKIKCCCLPYCRSPRGDGIKLFRLPNNISSIRKWEQATGMRFYESQRNTKLICSRHFDPQLIGVRRLMSNAVPSLHLGPDSAEPELPPVGPRCCMSDCSEDDNVQLHKFPKDPMLLHQWCQALNLPDVQSYSGKLICAAHLPSNAMSCLICGVDDVQLPMLDFPQNRNQRTKWCYNLKIEPLPKWDNSKQICCKHFESFCFIQPGQLLAEALPTLHLEHGDSNIFLNDEAMDNSKLLRIKDEPMESEDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00595971;
90% Identity
iTF_00553068;
80% Identity
-