Basic Information

Gene Symbol
-
Assembly
GCA_900245975.1
Location
NW:2792211-2806257[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 27 5.8e-15 5.7e-12 45.8 3.6 1 86 597 669 597 670 0.85
2 27 1.9e-14 1.9e-11 44.1 4.8 1 87 697 766 697 766 0.82
3 27 1.6e-15 1.6e-12 47.6 0.2 1 87 788 860 788 860 0.85
4 27 9.6e-16 9.6e-13 48.3 5.0 1 86 963 1032 963 1033 0.82
5 27 6.5e-15 6.4e-12 45.6 3.2 1 86 1057 1128 1057 1129 0.82
6 27 1e-12 1e-09 38.6 0.5 1 87 1164 1233 1164 1233 0.82
7 27 3e-11 3e-08 33.9 2.0 1 86 1273 1342 1273 1343 0.77
8 27 6.7e-17 6.6e-14 52.0 0.4 1 86 1370 1439 1370 1440 0.82
9 27 5.3e-13 5.3e-10 39.5 0.8 1 86 1461 1530 1461 1531 0.79
10 27 4.1e-14 4.1e-11 43.0 0.8 1 86 1558 1629 1558 1630 0.85
11 27 7.6e-13 7.6e-10 39.0 2.0 1 85 1705 1773 1705 1775 0.82
12 27 5.8e-12 5.7e-09 36.2 0.0 1 86 1798 1866 1798 1867 0.82
13 27 7.4e-14 7.4e-11 42.2 0.5 1 87 2042 2111 2042 2111 0.78
14 27 4.4e-09 4.4e-06 26.9 0.0 1 86 2184 2254 2184 2255 0.76
15 27 0.0047 4.7 7.6 0.0 1 58 2269 2313 2269 2327 0.79
16 27 9e-12 9e-09 35.5 0.0 1 86 2349 2418 2349 2419 0.82
17 27 4.4e-13 4.4e-10 39.7 0.1 1 86 2549 2617 2549 2618 0.83
18 27 2.4e-10 2.4e-07 31.0 0.2 1 84 2653 2722 2653 2724 0.80
19 27 1.2e-11 1.2e-08 35.2 0.2 1 87 2736 2808 2736 2808 0.81
20 27 1.6e-14 1.6e-11 44.3 1.7 1 86 2830 2903 2830 2904 0.82
21 27 0.00042 0.42 11.0 0.0 1 58 2932 2987 2932 3004 0.77
22 27 2e-12 2e-09 37.7 0.3 1 87 3022 3094 3022 3094 0.80
23 27 1e-12 9.9e-10 38.6 0.5 1 85 3227 3296 3227 3298 0.76
24 27 8.2e-12 8.1e-09 35.7 4.0 1 86 3355 3425 3355 3426 0.80
25 27 1.9e-14 1.9e-11 44.1 5.2 1 86 3558 3628 3558 3629 0.84
26 27 4.5e-12 4.5e-09 36.5 0.2 1 86 3726 3795 3726 3796 0.84
27 27 4.4e-10 4.4e-07 30.1 1.3 1 87 3817 3899 3817 3899 0.79

Sequence Information

Coding Sequence
ATGTCACAGCATAATCCACATGCCCATCCGCACTACCATCACCACCCACTGCACCAGCATCAGAcacaacaccaccaccaccatcatcagctgcagttgcagcagcagcaacaacaacaacagcaacaacaacaacaacaacaacaacagcagcattcGCAAATGCCACACAGTACACACAATTGGTACTCACATGTTGCTTCCTacccgccaccgccaccccaCCACCATGCGACGGCTACCTCGGCATTTGCCGCCACCTCCACACCTTGtaagggcagtggcagtggcagtgccaacaacaacaacatgagTGCCTATGGAGCAGGCAGCACGCATGGatattatgctgctgctgctgccgccgccgctggcgGTGGGCTCAATGTTAATGCTGTGGGTGGCGTTGGGGGTGGGTCAGCAGCCTATAACCTTGAGGCCAACACAGTGGCTTATGCCCACAACCAGCTGCTGCAGTACCAACAgcaccatcagcatcagcaacagcagcaacaccatcatcaacagcagcagcatctcagTCAGCGTTCCTATATGGGGCATGATATAATGTCTGGGACATATCCGTACATCAAAAGCGAACCAATGGATGTATCAGCGGCGTATCAGCAGCCACCCAATCCGATGGCACCACCCCCAGCGCCAGATATGATAATAAAATCGGAACCCATGGATGAACATCCGTACAAGTCCAACTACATAGACGACAATACGCCCTTTGctgattttaataaattcaacGAATTCAGCGGCGATATGCTCAGCCCCAAAGTTGAGCTAACCATCAAAGATGAGACCTATGGCAAGACctccagtggcagcagctttgCACGCCGCAAAgttctgcagcagccacaacagcaacagccaacgACAGATCGTTCGGCGGAGAGTCTGCCCATTTGTCAGCGCTGCAAGGAAGTGTTCTTCAAGAAGCAAGCCTACCTTAGGCATGTGGCCGAGAGCAGTTGTGGCATACAGGAGTACGATTTCAAGTGCAACATTTGCCCCATGTCCTTCATGAGCAGCGAagagctgcagcggcacaaGCATTTGCATCGCGCGGACAAGTTCTTCTGCCACAAATACTGCGGCAAGCATTTCGATACGATAGCCGAGTGCGAAACGCATGAGTACATGCAGCACGAGTATGAGAGTTTTGTGTGTAATATGTGTTCGGGCACGTTTGCCACACGCGAGCAGCTGTACGCCCACTTGCCGCAGCACAAGTTTCAGCAGCGCTACGACTGTCCCATCTGTCGTTTGTGGTATCAAACGGCCGTCGAATTGCACGAGCATCGGCTGGCGGCTCCATACTTTTGCGGCAAGTACTACAccaatcaacagcaacagcagcagcagcagggcaactATAAGCTGCAGGACTGTCATATGGCCACCATGGAAATGCCCACAGCACCACTGCTTAAAGTACCAACGCCTGCGAATGCCTCGGCCCTGCCAGCCACAGCGGCGTTAAGctctctgctgcagcagcgacaagcGAATGCCGATGGAGCAGCGGCCatgtttgcttctgctgcctcctccaccGCTGCCGTCGCCTCCACCGCCGTGTCGCTGAAGAGGGAGGTAAGCGTCAAGCTGGAGCGCAGCTACAGCAACTCCACCAGCGAGTCCTCGTACAGCCATCAGGAcaacagcagctacaacaatgCCTATGGCAGTGATAGCTCCATTCATGGCGGCGCTCTAGCCGGACCCCAAGCGCACTCCTCAACGCTGGACGACTCCGAGGATGCTTTGTGCTGTGTGCCGCTGTGCGGCGTGCGCAAGAGTACCAGTCCCACGCTGCAGTTCTTTACGTTCCCCAAGGACGAAAAGTATCTGAACCAGTGGCTGCACAACCTCAAGATGTTCCACATACCAGCGGCCAGCTATGCGACATTTCGCATCTGCAGCATGCACTTCCCAAAGCGCTGCATTAATCGCTATTCGCTGTGCTACTGGGCGGTGCCCACGTTCAATCTGGGCCACGACGATGTCGCCAATTTGTACCAGAATCGCGAGCTTACCAACACATTTACCACCGGCGAGGTGGCGCGCTGCAGCATGCCACACTGCACTAGCCAGCGGGGCGAGAGCAATCTGAAGTTTTACAACTTCCCCAAGGACATCAAGAGCCTGATAAAGTGGTGCCAGAATGCCCGCCTGCCGGTGCAAGCCAAGGAGCCGAGACACTTTTGCAGCCGCCACTTTGAGGATCGATGCATTGGCAAGTTCCGACTGAAGCCCTGGGCCGTACCCACTTTGCATCTGGGTGCGCAGTACGGCAAGATCCATGACAATCCCAAGAACCTGTATGTGGAGGAGAAGCGCTGCTGCCTGAACTTTTGCCGTCGGAGCCGCTCCTCGGACTTTAACATGTCGCTGTATCGTTTTCCACGCGATGAGGTGTTGCTGCGACGCTGGTGCTACAACTTGAGACTGGATCCGGGCGTCTATCGTGGCAAGAATCATAAAATATGCAGTGCACATTTCATCAAGGAAGCGCTGGGCTTGAGGAAACTGTCGCCAGGTGCCGTGCCCACGCTGCACTTGGGACACAATGACACGTTTAATATCTACGAGAATGAGTTGTGGCCACCGCCATCGCCCACTGGACAGCATGGGCAGCAGCTTATGCCGCAGCAATCGACGTCGCAGCATCATCATtcctcgctgcagcagcagcagcatccggcAATGCTCAGCAAATCCTATCAGCGCCATTCGGCAGCCTCCACATCATCCTCCGCCAGTTCGGCCTCGCATTATGTGGATCCCGAGATGAGTGGCTCGTACTTGAGCCTCTCTGCGGGTAGCTCCTCGGGCGGCATGAATGCCAGCGACTGCATGGACGTGTGCTGCGTGCCCAGCTGCGAGAGCAAGCGGCACAACAGCGAGAACATCACATTCCACACGATACCGCGGAGGCCCGAGCAGATGCGCAAGTGGTGCCACAATCTAAAGATACCCGAGGACAAGATGCACAAGGGCATGCGGATATGCAGCCTGCACTTTGAGCCATACTGCATTGGCGGCTGCATGCGTCCGTTCGCCGTGCCCACACTGCAGCTGGGgcacgaggacgaggacatACACCGCAATCCGGATGTGATCAAGAAGCTGAACATACGCGAGACCTGCTGCGTGGCCGTGTGCAAGCGAAATCGGGACAGAGACCACGCCAACCTCCATCGCTTCCCCAGCAATGTGGCGCTGCTGACCAAGTGGTGTGCGAACCTCCAACGCACAGTGCCCGATGGCAGCAAACTCTTCAACGATGCCATCTGTGAGGTGCACTTTGAGGATCGCTGTCTGCGCAACAAGCGGCTGGAGAAGTGGGCTGTGCCCACGCTAATACTCGGCCACGAGGACATTGCCTATCAGCTGCCCACGCCCGAGCAGGTGGCCGAATTCTATGCACGGCCTACCGCTCCCAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGAGACATGCAAGCGGAATCCCAGTGTGGATGACATCAAGCTGTATCGCCCGCCAGAGGACACCTCGGTGCTGGCCAAATGGGCGCACAATCTACAAACGGAGGCCGCGGTGCTCACCAATGGATTGCGGATATGCAATCTGCACTTTGAGGCTCACTGCATTGGAAAGCGCATGCGTCCGTGGGCCATACCCACGCTCAATCTGGCCGGCAACATTGAGAATCTGTACGAGAACCCCGAGCACTCGATGCTGTACAAGCGGCGGACGCATCTCAAGCAGAAGTCCCTGCCCGTGGCGAAGCCCACGTGGGTGCCGCGTTGCTGTCTGCCGCATTGCCGCAAGGTGCGTGCCCTGCACAATGTCCAGCTGTATCGCTTCCCCAAACTGAATCGCTCCACGCTGGCCAAGTGGGCGCACAATCTACAGGTGCCGCAGGTGGGCAGTGCCCAGCGGCGTGTGTGCTCCGCACACTTTGAGCCGCATGTTCTCAGCAAAAAGTGCCCGGTGCCGCTGGCGGTGCCCACACTGGACCTGAACTCGCCACCCGGCCACAAGATCTACCAGAATCCGGCCAAGCTGAAGGCCAACAAGCTGTGCCTGCAGCGCGTGTGCATCGTGGAGAGCTGCCGCAAGACCAGGGCCCAGGGCGTGCAGCTCTTCCGGCTGCCGCACAGCCCCACGCAGCTGCGGAAGTGGATGCACAACATCAGGACACGCCCGAGGGCGGCCATGAGGAGTCAGTATCGCGTGTGCTCGCGCCACTTTGAGACGCACTCCTTCAACGGGCGGAGACTGAGCGCCGGTGCCATACCCACCCTGGAGCTGGGCCACGACGACGAGGATATCTTCCCGAACGAAGCGCAGGCCTTTGCCGACGAGCACTGCGCCGTCGAGGGCTGTGAATCGtccaaggagcagctggaggtgcGTCTCTTCCGCTTCCCCacggacgacgacgacatgcTGTGGAAGTGGTGCAACAATCTCAAGATGAATCCCGTCGATTGCATCGGTGTGCGGATCTGCAACAAGCATTTCGAAGCGGATTGCATCGGACCCAAGCATCTGTACAAGTGGGCCATACCCACGATGCTGCTCGGCCACGACGACTCACAGATCGAGCTGATACTAAATCCCAAGCCGGAGGAGCGCTACGTGGATCCGGTGTTCAAATGCATTGTGCCCACGTGCGGCAAGACGCGTCGCTTCGACGAGGTCCAAATGAATAGCTTCCCGAAGGATGCGGAACACTTTCAGCGCTGGCGGCACAACCTCCGCCTGGAGCATCTCAGCTTCAGTGAGCGCGAAAAGTACAAGATTTGCAATGCCCACTTCGAGGACATGTGCATTGGCAAGACGCGCCTGAACATTGGTTCGATACCCACGCTGGAGCTGGGCCACGACGAAACGGAGGATCTGTTCAAGGTGAATCCCGCAGAGTTGCAGAGCAACCTGTTTGGGCGTCCGCGACGCCTGTTGCGTGGATTCAACAATGTGACCATCAAACAGGAGCTGCCAGACACGGAGACGGAGCCAGACTTCCGGGACGTAAAGCCCGATGTGAGCACGGATTTTACACAGGTTAAAATCAAGAAATCTCTGGGCGATGTCAAGTGCTGTGTGCGCACCTGTGGCCGCAGTCGTCTGGAGCATGGAGCGCgtctctttccctttcccacCGGCAAGCAGCAACACCTAAAGTGGCGCCACAATTTGCGCCTCGAGCCGGACGAAGTGGACAAGTCTACTCGCGTTTGCAGCGCACACTTCAACAGGCGCTGCATCGATGGCAAGCAGCTCAGGGGCTGGGCCATGCCCACGCAGCAGTTGGGGCACCAGGAGCAGCCAATCTACGAGAACCCCAAGAACATTCCTGGCTTCTTTACGCCCACCTGTGCGCTGGGGCACTGCCGCAAGCggcggagcattgacaatgatTTGCGCACGTATCGGTATCCGCGCGCCGAGGATCTACTCGAGAAGTGGCGTGCAAATCTCGGCCTATCGCTGGATCAGTGCCGTGGGAGGATCTGTGCTGATCACTTCGAGCCGCAGGTGCGGGGCAAACTGAAGCTCAAGACAGGCGCTGTGCCCACGCTAAAGCTGGGTCATGAGGAGCCGCTGGTCTACGACAATGAGGCTATCAAGGCGGGCgtggccgaggaggaggcagccTCGCCGCTGGTCACACCCAAAACGGAAGTGCTGGACGACGAGGAGCgcgaggaggacgaggaggaggaggagaatccCGAAGAGGAGCCGCAGGAGACGCACGAAGAGGAGAAGGACGAGCATGAGGATGACACGCCCGagggagcagagcagctgAGAGATGCTGATGACGCGGAAGAAGACGATGAGGATCCAGGCAACTATTTTGATCCGTTGGAGCTGGTGGAGACGTACGCAGAGCATCTCagcgacgatgacgacaacCACAGCCATGAGGCACGCGAAGATGATGCCCGAGGggtggatgaggaggagccAGAAACTCTTCTGCCTGACACACCACATAAACCGGATCCAGTGGCAGGCCTTGGCATGCCCAAACCCTGGGACAGAGCTGCCGCCGTTGTGCCTGTGCGCCGTGAGAAGCGTCCGAATAACGTGGATCCCATCTGCTGCCTCAAGCACTGCCGCAAGGAACGCTCAGCCATGTATCTGCTGAGCACGTTTGGCTTCCCCAAggaccagcagctgctgctcaagtgGTGCGCCAATCTGCAAATGGATCCCTCGGGCTGCATTGGTCGCGTGTGCATCGAGCATTTCCAGTCGGAGGTGCTGGGCACTCGCAAACTCAAACAGAATGCGGTGCCCACGCTCAATGTGGGTCACGATGTGCCACTGCGTTACAGCTGCAATGGCCAGGATTTTCCCGATGAAATGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTAAAACACTGCCGCAAGAGGAAGCTGTCGGAgagtccagctccagctccagcagcgatcaaggaggaggagcagcagcagcagcagcatcagatgccgatgccgatggaGACTAAACCAAAGATTTGCTGCCTACCCAGCTGCGGCAATGTGGAGGGTTACGGACCAGGCGGACAATTCCAGCCCCTGCCCATGGACCAAAGAGTGCTCAAAAAGTGGCAGCACAATCTGCGGCTGCCAGTCATTAATCCTGCAGCGGATATGCATCTGTGCATGGAGCACTTTGAGGCGTGTCAGATCGAGAATGGAACAGCAATGCGCATGGCAGTGCCGACCCTCAAGCTGGGCAACACCAGTCCGAATGTGTTCCGGAACAGGAAGTGCCTGTGGCCCACGTGTCCGCCCAATCGCAAGCTGTGCTACGATCTGCCGGAGCACGAGTCGGTACGTGTGGCCTGGCTGTCGTTTATGCGACTGCCGCTGGACAGCCAAGGTCGACTGTGTGGCGTGCACTTTCTGCAGCTGTACGAGGAGCTGGATCTCCCAGGCGATGTGCCCGAAGAGGTGCTCGAGCGGCTGCAGGTGACCTACGATCAAGCCTCCATCTCGCTGAAGTTTCAGTGCTCCGTTGTGGGCTGTGGCTCCAAGTACAAGCAGGACACGCACCTGGCGAAGCTGCCAGGCGACCCGCAGCTGCTCGCCAGGTGGCTGCACAACACGAGGATATCGTATGATCGCTCGCTGCATTTTAGTTATCGCATTTGTCTGCTGCACTTTGAGGAATATTGCCTGAACGGCGTGCGTCCACAGCAATGGGCCATACCCACACTGCAGCTAAATCACGATGGAGAGATCCACCAGAATACCATGAAGCAGGATGCGCTCGGGACTTCCCCCGTGAAGCGGGAGTTGGCCGAGACTTCACCCGTCAAGCAGGAAGCGCTCGAGAGTCCCCCCGTGAAGCAGGAGTTGCTCGAGAGTCCCCCCATGAAGCAGGAGATGCCCGAGACTTCCCCCGTCAAGCAGGATGCGCTCGAGAGTCCACCCGTGAAGCAGGAGTCGCAGGAGAAAGCTCCCTTAAGCAGCAGCTTGAGTCTCTCCATTCCGCTGCACATCAAGACGGAGCAGCAGGGTCCTGTGCTGCGCTCTCGAGGCACATCTTCGCTGAGCAGTCCCTGcctcagtggcagcagtgccAGTCCTCGCCTCAAGAACAGAATTTGCTGCATTGCCAACTGCGGCGAGGGTGCCAGATCGCAGCGGCTCCATCCCTTTCCAAGCGACGAACCAACGCTGCTCAAGTGGCTGGTAAACACGCAGCAAAAGCCCGGACTGGTGGACATCCAGAGTCTGTTTGTGTGCCAGCTGCACTTCGAGGCGGATGCCATCAACAAGACGCAGCTAAGGAGCTGGGCCTTGCCCACACTGCGGCTGGGCCACGACGGGCATGTCATACCGAATGCCCGGCACAATGGGAACATTGCCAACAGCCAGGAGACGGAGCAGGCCATGGAGTTTATTCGGGCCAACTACTGCTCCGTGCTGAGCTGCTTCCAGCCAAAGACTGAGGGTGTGCGCTTCCACAAGTATCCCAGCGACATTGCCATGGTGCGCAGGTGGTCCACCAATCTCAAGCATCGCTCAATGCAGGCCAGCAGTCATGGTTTTCTCGTCTGCCAGTCGCACTTCCAGCCCGAATGCTACGATCCGCAGACGGGCGACCTACTCGAGGAGGCCGTACCGCTCGCGGTGGGTGTGAAAACCGAAGGCCAGCTGCTCCATTGCCTCGTCAGGGGATGCCCTGTGACGAGTAACTCCAGCAGGGGATTGCTTCTCTTCAGGGTGCCAAAGAGGACGCGTGTGAGGGAAGTGTGGGGCCACAATCTATGGATGTATCCGATGCAGCTGGTGGGCGAGCATTACATCTGCGAGCGACACTTTGAGCCGCACTGCGTGAatgagcagaagcagctgcacgCGGGCTCGGTGCCAACGCTCCACCTGGGCCACAACGAGCAGCCGGAGCTGCTGCCCAATCCAGCGTCCTTCCAAGAGGAGTGCGAATGCTGTGTGCCCGGCTGTTTGCGCAGCAATCGCAAGGACCCCGATCTGCAGTTTAGCAAATTTCCCAAGTGGCGCGTGCTGTCCGACAAGTGGCTGCACAACTTCCGCTTGGAAAAGCCCCGCTTGGATCAGCGCATTAAAGCGCTGCGAGTGTGCCACACGCACTTCGAGGAGAGCTGCTTCGATGATCAGGGCGTGCGGCGGGGTGCCATGCCCACGCTGGAGCTGGGACACTCGCATTCGGACATTTATCACACCGACATGGGTTCAGCTATGTTGAAGCCGTTGCACAAGAGTTTCAGCGACTGCTGCTATCCCGACTGCTATGAGGACTGCCAAAAGGCCAACATCCATCGCATGGCCTACGACTTGCCCAGCGATGGGCCGGTGCGTGAGGCCTGGCTGCAGCACATGGGCATACCCAGCAGCGGCGACGATTGCCCCTCAGGGCTCAGGCTGTGTCCGGTTCACTACATCATGCTGTATGAGCGTAGCTTCCCCGATCGAGCCGACAACCTCGTGCTGGACCGCAGCTATGAGCATGCTCGTCAGTTGCCACAGCTGCGGCGCTTCATGTGCGTCGTGCAGGGATGCCGCCACCTGCACCAGCGGGATGGCGGAACGATGCATGGAATGCCCAGAAGTGAGCAGATCCTGCGCATGTGGGTGGAGAACGCACAGCTGCGGCTGAACGAGCATGAGATTTACATGACGAAGATGTGCAGCAAACACTTTGAGGCCAGCTGCCTGTACGAGGGCAGGAGATGTTACCCGTGGAGCGTGCCAACGCTGCATCTGCCAGCGCTGCTGCCGGGACAGGTGCTGCACCAGAATCCCACCCCTGAGGAGTGGCAGCAAATGAAGCAGCAAATGAAACTGGAGGAGCAGGCACAGAAAGCCGAGCAGCAGATAGCAGAGGGACTGCCAATGGAGCCGTACGTGAAGATGGAACCCCACGACAATGACGACGAAGAAGACGCTTCGCAAACGGAATCGGAGCTGCTGATAAATGAAAGCACCATGGACTCGCAGGAGACGGACTTTCAGCCGAATGAGCCGCTGGAGGTGCTGCTCGAGGTGGGCCATGTCGAGCGGCTGGAGAGCTACGAGAAGCGGGATTACTCGACGGATATCCGCCACAGCCGCTATGGGCCTGGCAAGCGTTTTCGCCATCCGTACAGCGCCCACAAGTGCTGTGTCGAGGGCTGTGGCGTGTCGGTGGAGGATTTGGGTTGGAATCTGAAGCTGCACAAGCTGCCCAGCTCCACGGAGGTGGCCAGAAAGTGGCTGTACAACATACAGGTGGACACAGAGGACACGTGGCGCATACGCGTCTGCAGCCATCACTTTGACCGTCAGTGTCTCAATGGCTCGCGACTCAGGCGAGGATCGATGCCTACGCAGCATCTGGGGCCACGTGTGCCCGAAACTATCCACCAAAATGAGTTcttacagctgcagctggacgaTGCGCCAGCGCAGAATGACCAACACGTGGAGCCGGTGCGAACAATTGGCAAGGTGGTGAATCCGTGCGTGCCGCGACCCTCACCGCCGCGCAAGTCCAGCAAATTTTGCCAGATCGAGGGATGTTCGAATCATCTTACCAGCGAGAACATGACACTCCACAAGTTCCCGCACTCGTCGTGGATCTGCGTCAAGTGGCAGCACAACACACAGGTTCCCTTCGATCCAGAGTATCGCTGGCGCTATCGCATCTGCAGCGCCCACTTCCATCCCGTGTGCATGGTGAacatgcggctgctgcacgGCAGCGTGCCCACCCTCAAGCTGGGCCCCCGCGCACCCGCCGAGCTCTTTGACAGCGACTTTGAGGCCATCAACATCAAGATGGATAAGTTGGAGAAGTTGGAGAGAAAATCAGAGGCGCGTAGAGCTACCACGGGAGATGGCGTTGGTTATCCCGGCATGCTGGACATGGGGGAGAAGAAGGTTAAGactgagctggagctggacgaTGGAGGAGCACTCgacgaggaggaagaggagcaggaggatgcCATGCTCTACCTGGAGCCCGAGATGCATATATACGAGGATCAagaggaacaacagcaaaagcggGCACCAAAGGCAAATCCAAATGGTGGCTGGAAAACGGAGCTCCGTTTGCCTTCCAAGAGCCGCGTGACCCTCAATCCGGTGAGATCTGGCTACGACAAGTGCTCGCTGATGCATTGCCAGCGCCAGAGATCAAAGCACGGCGTCCACATCTACAAGTTCCCACGCTcggaggagcaccagcagcgctGGATGCACAATCTGCGCATCCGCTACGACGAGAAGCGCCCGTGGAAGTTTATGGTGTGCAGCGTGCACTTTGAGCCGCACTGCATACGGCTAAGGAAGCTGCGACCCTGGGCAGTGCCCACTCTGGAGCTGGGCGACAATGTACCCGAGGATATCTACACGAATGAGCAGTGCCAAATGTTTGCCAGTGGACAGGCGGCAGGAGAGTCAAATTGCAACTATAGCGATGCAGAGgcgcaggcagaggcagagcctgAGGCGGATAGCGATGGGAATGATGACGAGGATGGCCTGCAAGAGGACGAGGATGAAGAGACAGATGAGCAGGAAGCGAGCACCGCCAAGAAGCGTCGTCGTTCGCGTCTGGCGGATGCCTCCTGGCCACTGGGCCAAGTGCCACCGTGGAAAGTGAAGCAATGCTGTCTGCCCTACTGCCGCAGTCCACGCGGCGATGGCATCAAGCTGTTCCGACTGCCCAACAAAGTCAACTCCATTCGCAATTGGGAGCTGGCCACCGGCATGAAGTTCAAGGAGTCGCAGCGCAACACTCGACTCATCTGCAGCCGCCACTTTGAACCAGAACTGATTGGAGTGCGGCGTCTCATGCGCAATGCCATTCCCACGCGGCACCTGGAACCCACGGGCGATGTGAAGCCAGTGCTGCCACCACCGACAACAGCTGGCCCTAGATGCTGCATGGCCGACTGTGCCTACGATGTGGCGGATGTGAAGCTGCACAAATTTCCCAGCAATCCCAAGTTGCTGAGGGAGTGGTGCCAGGCATTGAGGGTTACCGACCTGCAAAGGTATCGCGGCAAGCACATTTGCTCCGCCCATCTGCCTGTCCACAAGGCCGTGCACTGCATTGTTTGTGGCGCGGATAGGGCGCCGCAGTTGCCGATGCTTAGCTTCAAGCCCGAGGCGGCGCCCACACTGCATTTGAATCATAATGATACAAACATATTCCTTAACGATTGTGCCATAAATCCCGCCTACACGGGAGTACGGGTAAAGGATGAGCCCATGGACAATCAGGTCCTGTCGTTGTAG
Protein Sequence
MSQHNPHAHPHYHHHPLHQHQTQHHHHHHQLQLQQQQQQQQQQQQQQQQQQHSQMPHSTHNWYSHVASYPPPPPHHHATATSAFAATSTPCKGSGSGSANNNNMSAYGAGSTHGYYAAAAAAAAGGGLNVNAVGGVGGGSAAYNLEANTVAYAHNQLLQYQQHHQHQQQQQHHHQQQQHLSQRSYMGHDIMSGTYPYIKSEPMDVSAAYQQPPNPMAPPPAPDMIIKSEPMDEHPYKSNYIDDNTPFADFNKFNEFSGDMLSPKVELTIKDETYGKTSSGSSFARRKVLQQPQQQQPTTDRSAESLPICQRCKEVFFKKQAYLRHVAESSCGIQEYDFKCNICPMSFMSSEELQRHKHLHRADKFFCHKYCGKHFDTIAECETHEYMQHEYESFVCNMCSGTFATREQLYAHLPQHKFQQRYDCPICRLWYQTAVELHEHRLAAPYFCGKYYTNQQQQQQQQGNYKLQDCHMATMEMPTAPLLKVPTPANASALPATAALSSLLQQRQANADGAAAMFASAASSTAAVASTAVSLKREVSVKLERSYSNSTSESSYSHQDNSSYNNAYGSDSSIHGGALAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPAASYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEDRCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPSPTGQHGQQLMPQQSTSQHHHSSLQQQQHPAMLSKSYQRHSAASTSSSASSASHYVDPEMSGSYLSLSAGSSSGGMNASDCMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHEDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRTVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYQLPTPEQVAEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEDTSVLAKWAHNLQTEAAVLTNGLRICNLHFEAHCIGKRMRPWAIPTLNLAGNIENLYENPEHSMLYKRRTHLKQKSLPVAKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPQVGSAQRRVCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGHKIYQNPAKLKANKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRSQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDDEDIFPNEAQAFADEHCAVEGCESSKEQLEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTMLLGHDDSQIELILNPKPEERYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDAEHFQRWRHNLRLEHLSFSEREKYKICNAHFEDMCIGKTRLNIGSIPTLELGHDETEDLFKVNPAELQSNLFGRPRRLLRGFNNVTIKQELPDTETEPDFRDVKPDVSTDFTQVKIKKSLGDVKCCVRTCGRSRLEHGARLFPFPTGKQQHLKWRHNLRLEPDEVDKSTRVCSAHFNRRCIDGKQLRGWAMPTQQLGHQEQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRAEDLLEKWRANLGLSLDQCRGRICADHFEPQVRGKLKLKTGAVPTLKLGHEEPLVYDNEAIKAGVAEEEAASPLVTPKTEVLDDEEREEDEEEEENPEEEPQETHEEEKDEHEDDTPEGAEQLRDADDAEEDDEDPGNYFDPLELVETYAEHLSDDDDNHSHEAREDDARGVDEEEPETLLPDTPHKPDPVAGLGMPKPWDRAAAVVPVRREKRPNNVDPICCLKHCRKERSAMYLLSTFGFPKDQQLLLKWCANLQMDPSGCIGRVCIEHFQSEVLGTRKLKQNAVPTLNVGHDVPLRYSCNGQDFPDEMPQHSVFRLWSLKHCRKRKLSESPAPAPAAIKEEEQQQQQHQMPMPMETKPKICCLPSCGNVEGYGPGGQFQPLPMDQRVLKKWQHNLRLPVINPAADMHLCMEHFEACQIENGTAMRMAVPTLKLGNTSPNVFRNRKCLWPTCPPNRKLCYDLPEHESVRVAWLSFMRLPLDSQGRLCGVHFLQLYEELDLPGDVPEEVLERLQVTYDQASISLKFQCSVVGCGSKYKQDTHLAKLPGDPQLLARWLHNTRISYDRSLHFSYRICLLHFEEYCLNGVRPQQWAIPTLQLNHDGEIHQNTMKQDALGTSPVKRELAETSPVKQEALESPPVKQELLESPPMKQEMPETSPVKQDALESPPVKQESQEKAPLSSSLSLSIPLHIKTEQQGPVLRSRGTSSLSSPCLSGSSASPRLKNRICCIANCGEGARSQRLHPFPSDEPTLLKWLVNTQQKPGLVDIQSLFVCQLHFEADAINKTQLRSWALPTLRLGHDGHVIPNARHNGNIANSQETEQAMEFIRANYCSVLSCFQPKTEGVRFHKYPSDIAMVRRWSTNLKHRSMQASSHGFLVCQSHFQPECYDPQTGDLLEEAVPLAVGVKTEGQLLHCLVRGCPVTSNSSRGLLLFRVPKRTRVREVWGHNLWMYPMQLVGEHYICERHFEPHCVNEQKQLHAGSVPTLHLGHNEQPELLPNPASFQEECECCVPGCLRSNRKDPDLQFSKFPKWRVLSDKWLHNFRLEKPRLDQRIKALRVCHTHFEESCFDDQGVRRGAMPTLELGHSHSDIYHTDMGSAMLKPLHKSFSDCCYPDCYEDCQKANIHRMAYDLPSDGPVREAWLQHMGIPSSGDDCPSGLRLCPVHYIMLYERSFPDRADNLVLDRSYEHARQLPQLRRFMCVVQGCRHLHQRDGGTMHGMPRSEQILRMWVENAQLRLNEHEIYMTKMCSKHFEASCLYEGRRCYPWSVPTLHLPALLPGQVLHQNPTPEEWQQMKQQMKLEEQAQKAEQQIAEGLPMEPYVKMEPHDNDDEEDASQTESELLINESTMDSQETDFQPNEPLEVLLEVGHVERLESYEKRDYSTDIRHSRYGPGKRFRHPYSAHKCCVEGCGVSVEDLGWNLKLHKLPSSTEVARKWLYNIQVDTEDTWRIRVCSHHFDRQCLNGSRLRRGSMPTQHLGPRVPETIHQNEFLQLQLDDAPAQNDQHVEPVRTIGKVVNPCVPRPSPPRKSSKFCQIEGCSNHLTSENMTLHKFPHSSWICVKWQHNTQVPFDPEYRWRYRICSAHFHPVCMVNMRLLHGSVPTLKLGPRAPAELFDSDFEAINIKMDKLEKLERKSEARRATTGDGVGYPGMLDMGEKKVKTELELDDGGALDEEEEEQEDAMLYLEPEMHIYEDQEEQQQKRAPKANPNGGWKTELRLPSKSRVTLNPVRSGYDKCSLMHCQRQRSKHGVHIYKFPRSEEHQQRWMHNLRIRYDEKRPWKFMVCSVHFEPHCIRLRKLRPWAVPTLELGDNVPEDIYTNEQCQMFASGQAAGESNCNYSDAEAQAEAEPEADSDGNDDEDGLQEDEDEETDEQEASTAKKRRRSRLADASWPLGQVPPWKVKQCCLPYCRSPRGDGIKLFRLPNKVNSIRNWELATGMKFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTRHLEPTGDVKPVLPPPTTAGPRCCMADCAYDVADVKLHKFPSNPKLLREWCQALRVTDLQRYRGKHICSAHLPVHKAVHCIVCGADRAPQLPMLSFKPEAAPTLHLNHNDTNIFLNDCAINPAYTGVRVKDEPMDNQVLSL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00536072;
90% Identity
iTF_00601081;
80% Identity
-