Basic Information

Gene Symbol
-
Assembly
GCA_963978525.1
Location
OZ021690.1:8399461-8417989[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 8.3e-16 1.1e-12 48.6 3.2 1 86 692 764 692 765 0.85
2 32 1.4e-15 1.9e-12 47.8 6.2 1 87 792 861 792 861 0.82
3 32 3.2e-16 4.4e-13 49.9 0.7 1 87 883 955 883 955 0.85
4 32 6e-14 8.2e-11 42.6 2.8 1 86 1045 1113 1045 1114 0.78
5 32 2e-16 2.7e-13 50.5 5.2 1 86 1138 1209 1138 1210 0.80
6 32 4.9e-12 6.7e-09 36.5 1.1 1 87 1245 1313 1245 1313 0.81
7 32 1.3e-11 1.8e-08 35.1 3.7 1 87 1353 1422 1353 1422 0.77
8 32 4.4e-15 6e-12 46.2 0.2 1 87 1449 1519 1449 1519 0.80
9 32 5.1e-14 7e-11 42.8 0.3 1 86 1541 1610 1541 1611 0.81
10 32 4.1e-13 5.5e-10 39.9 1.6 1 87 1640 1712 1640 1712 0.85
11 32 0.00018 0.25 12.2 0.1 1 64 1782 1838 1782 1858 0.70
12 32 3.1e-11 4.3e-08 33.9 2.1 1 87 1878 1950 1878 1950 0.79
13 32 3.6e-15 4.9e-12 46.5 2.0 1 85 1980 2049 1980 2051 0.78
14 32 1.2e-14 1.7e-11 44.8 4.0 1 87 2097 2168 2097 2168 0.80
15 32 8.9e-14 1.2e-10 42.1 4.1 1 87 2190 2260 2190 2260 0.80
16 32 4.6e-15 6.3e-12 46.2 0.4 1 87 2388 2457 2388 2457 0.80
17 32 7.9e-13 1.1e-09 39.0 4.6 1 87 2514 2593 2514 2593 0.80
18 32 3.3e-10 4.6e-07 30.6 4.6 1 87 2605 2679 2605 2679 0.79
19 32 2.2e-11 3e-08 34.4 0.0 1 86 2705 2773 2705 2774 0.80
20 32 2.1e-13 2.9e-10 40.9 0.4 1 87 2797 2867 2797 2867 0.80
21 32 1.6e-17 2.1e-14 54.1 0.4 1 87 2886 2966 2886 2966 0.81
22 32 2.8e-09 3.9e-06 27.6 0.8 1 61 2983 3036 2983 3057 0.76
23 32 1.1e-11 1.4e-08 35.4 2.3 1 86 3075 3144 3075 3145 0.83
24 32 3.1e-13 4.3e-10 40.3 3.2 1 86 3170 3240 3170 3241 0.84
25 32 1.2e-14 1.7e-11 44.8 0.7 1 86 3261 3333 3261 3334 0.81
26 32 0.00032 0.44 11.4 0.0 1 69 3360 3421 3360 3444 0.81
27 32 1.9e-14 2.5e-11 44.2 3.9 1 86 3590 3660 3590 3661 0.83
28 32 1.6e-07 0.00021 22.0 1.2 1 58 3711 3758 3711 3778 0.81
29 32 6e-14 8.2e-11 42.6 0.7 1 86 3801 3879 3801 3880 0.81
30 32 8.9e-07 0.0012 19.6 0.1 1 86 3904 3968 3904 3969 0.70
31 32 1.2e-13 1.7e-10 41.6 2.7 1 87 4279 4353 4279 4353 0.80
32 32 2.4e-12 3.2e-09 37.5 0.4 1 86 4379 4446 4379 4447 0.82

Sequence Information

Coding Sequence
atgtcacatcaacatcatcatcaccaccatcaccaccaacaacaccaacaacaatatcaacgcaaacaacagcagcaacaacaacagcaacatccttttcatatatatgctCAACCTCAGacacaacatcaacaacagccACCGCATCAACAATGGTATGCACATATAAATGATatacaaccacaacaacagcagcaacagcagcaacaacatcaacaatatatTGCTGGTCATGTAAGGGATTCCCGCCATTTACATGCGCCACATATATACGGTAGTATGTTGGGGCATAGTAGTGGAGCTTATATGGCTGGTGCTAGTATGCAGAGTATGGGACGGCATGCATATAATATGCCGGGTACTACTGCTACATCTGTGCCATACACATTTCCACACAATCCAATTAGGGGTGACAATAACATTGTGCATACAAGGAACTATGACCTTGAAATGGTAAACAGAGCACACAGCATACCATCAACTACTCATACAATGATTGGCGGTGATAATCATAGAAGCTATGATGCCTATTCGCATAATGCTGCTATATTTacgcaacaacagcaacagcaacaacaacaccaacagcagcaacagcaacagcaagtACGTTTACACCATCATCATGCCGTCCATCCTATACATCATCAATCGCATCAACAGCAACATTCTATACCTGGACATCAGGCaacacatcatcatcatcatcatcagcagcagcagcagcagcagcagcagcaacgtCACCAACAACAtattcaacaacaacttgTACCCAATCCAATGCATAATATGAAAACGAAACCAGTGGAGGAGATTACCATAACACCAACCATACAAATGgatgaaataattatcaaagCTGAACCTCCTGACGAATATATTTACCATAGAAATTTGCagcagcaacatcaacaacaacagcagcagccaTCCTATAGTGTGcttaaatgtataaaacaGGAACCACAGGCACAACCACaactattacaacaacaacaacagcaacaacaacaccatcatcagcagcagcaacagcgaCAGCAGCAACATCAGTTAGCGCAACAACGACAGTCTAAACAATTACAATCTCCACCGCCTCCGCAGCCGCCACCTCCTCCACCACCACAAGCACCACCAGCAGCAACAGTAACATCAATAGCAACTACACCTCCTCCTGAAGCCTCATCACCAATTGACGAGAAAGATATAAAACCTATGAACTTTCCACGGCGTAAAGTGCAAACGGAACGTTCCTCAACGTTGCCGATTTGTCACCGGTGTAAACAAGTATTTCTGAAGCGTCAAAATTACACACAACATGTGGCCCTGTCCACGTGCGATATGGTTGAATATGACTTCAGATGCTCGATCTGTCCAATGTCTTATATGTCCAATGAAGAATTGCGTGCCCATGAACGTTTGCATCGCTTGTATCGGTACTTTTGCATGCTGAATTGTGGTAAACATTTCGAGACAATACAAGAATGTGAACACCACGAATACATGGAACATGAccaatatatgtacaagtgCGGTATGTGTGGTTTGGATTATCCAACACGTGAAGAACTTTTGATTCATCGAAAATGTCATAAATATGCAACACGTTTTGTTTGTACGATATGTCGAGGCTGGTTTAGAAACATGTCAAGATTACACAACCACTATTTGAACAACCCATATAGATGTGGGAAATTTTACAACAAGGACGATTTCAATGCTTGTGCCACTGAAACCAGCATCAAGTTCAGAAAAGTTTCAACGGATACCAATGGCGAACATGATTCAAAACAGAGTACAgatgacCTAAATGCTTCAGATGATATTGTAGAGGAAATGGACACTGCTCCGCCAGATGAGTCATCTTCCGACGAAGAGGTTGAAACTGAAATAAAAGTTGAACCTGATTTTTATCCACCGATGGATCAAACCGATTATCAATCTGAAGAGTTTTCAAccaatcaaaatcttaattttttacatgactTCCAGGAAAATGCTTCAAATAGCACAAATTCGTCGTATACCATGGGTGGTAGTGAGGCGATTGCGGGAGATCAAGATGTAGTttgttgtgtaaaattttgtggCGTTACAAAGCATGCCAGTCCTTCtttgcaatttttcaatttccctCGTGAAGACAAATACCTGCAACAGTGGTTGCATAACTTAAAAATGCCTTACGACCCGCAGGTTAAGTACACGCAATATCGCATTTGTAGTTTACACTTTCCCAAACGTTGCATAAATCGTTACTCGTTAAGTTATTGGGCAGTGCCCACCTTTAATTTAGGCCACGATGAAGTAGCCAATTTGTATCAAAATCGTGAAATTAGTAACAGTTTATTGAGCAGTGATGCTGCCCGGTGCTGTATGCCGGGCTGTCGTGCTCAACGTGGTCAAACGAATGttaagttttataattttcctaAAGATCTAAAGACTTTGATTAAGTGGTGTCAAAACGCTCGTCTGCCCGTGCACACGAAAGAGTCGAGACATTTCTGTTCGCGTCACTTTGAGGAGAAATGTTTCGGGAAATTCCGTTTGAAGCCTTGGGCTATACCCACGCTAAATTTGGGCACGGTGTATGGTAAGATTCATGACAATCCCAACGTTTCGTATTTGGAAGAGAAGAAATGTTGTTTGACGTTCTGTCGTAAAAGTCGTTCAGACGATTTCAGTTTATCTTTATATCGGTTTCCGCGTGACGAGTCCATGCTGCGTAAATGGTGCTACAATTTACGCTTGCATCCGGACGTTTACAGGGGTAAGAATCACAAAATCTGTTCACATCATTTCATTAAAGAAGCGTTAGGGTTGCGTAAACTGTCGCCGGGTGCGGTGCCCACTTTAAATTTGGGCCATAATGATCGCATAAATATCTACGACAATGAATTGCATCCACCATCCAATGTGGCGTCAACCTCATATAGCGCCAAAGCGGCAGCCGCCTCGGCTCTTACCGCATCTATACGTAAAGCTCAGTTATTAAAATACCGTAATGCTTCGCAATCAGCCAACGCATCAGCATCTTCCATATATGATGAGGTTCTCAGTAACAGTCAAAAGttctcctcctcctcctcttcGGTGGCATCCAATGCCATAGAATTGGGTGACGTTTGTTTAGTGCCCTCCTGCAAGCGTTCCCGGCACACGGAAAATGTCACGCTACACACTGTGCCGCGAAGACCCGAACAATTGGAGAAATGGTGTCACAATctgaaaatgaatttgaatgGTTTGCATAAGAATGCTCGCATTTGTAGTGCACACTTTGAGAATTACTGCATTGGCGGGTGCATGCGGCCATTTGCTGTGCCCACGCTGGAGTTGGGTCACGAGGACTCCGACATTTATCGCAATCCGGatgttattaaaaagttaaacatTCGCGAGACATGTTGCGTACCAAGTTGTAAGCGAAATCGGGACCGGGATCACGCCAATTTGCACCGTTTCCCCACCAATCCAGATTTGTTGCAGAAGTGGTGTGCAAATCTACAAAAGAGCATCCCAGATGGCACCAAGCTGTTCAATGATGCGGTGTGTGAGGTACACTTTGAAGATAAGTGTTTGCGCAACAAACGATTAGAGAAGTGGGCGGTGCCCACTTTGAAATTAGGCTACGAACCAATACCGCATTATTTGCCATCGCAAGAAGAAATTGATGAATATTGGTCCAAACCAATGGCGCCAAATAACGGTGATGAAACGGGTGAATGTTGTGTGGTCACCTGTAAAAGAAATCCACAAATGGATGATGTGAAATTGTATCGGCCACCCGAAGACGCCGAACAGTTGGTCAAGTGGTCGCACAATCTGCAAATTGATGTTACAAAATTGTCCACAATGAAGATATGCAATTTACATTTTGAAACGCATTGTATTGGCAAGCGGTTGTTGAACTGGGCGATGCCCACTCTCAATTTAGCCGCCAAGGTTgaacatttatttgaaaatccTCCACCCACGATTTCGCATTATCGTCGCCGAGTGAAATTGGGTCTCAAAGCGGAGCATGACTTAATTAAGTGGTCGCCGCGCTGTTGTTTAGCCCATTGTCGCAAAATGCGTAGTCGCGATAATGTACAATTGTATCGGTTCCCTGTTAATTTGAATACCATGAGTAAATGGTGTCACAATGTACAATTGCCGGTGGTTGGCAGTTCACATCGACGCATCTGTTCGGCACACTTTGAATCGAGCGTACTTACCAAACGTTGTCCCATAGCTACAGCTGTACCCACAGTAAATCTGAATACACCGCCTGGCtataaaatttaccaaaactCACCGAAACTGAAACAGCACAAAATATTCAGCCAACGTTGGTGTGTGGTGAGTACTTGTCGCAAGACGCGCAGTGAGGGTGTACAATTGTTTCGTTTCCCTCATAATCgtataattttaacaaaatggcGTTATAATCTGAAGAATCTACCGAAAGGGAAGCTTAGCTCCCAATTTCGCATTTGTTCGTtgcattttgaaaatcattcgATTGGTATGAAGCGACTGTCACCGGGTGCCATACCCACATTAAATCTGGGACATGAGGCAGAGGATATATATCCTAATGAGACGCGTTCCTTTTTCGATCTAGATAAATGTGTGGTGAATGGTTGCTCGTCGAGTAAGGAGACAGAAAATATACGGCTTTTTAAATTCCCCGGAGATGATGAAGAGCTATTGGGCAAGTGGTGTCATAATCTGAAAATGAATCCCAACGATTGCATTGGCATCAAAATATGCAGTTTACACTTCGAAACAGATTGCATGGGTCCCCGACTATTGTACAAATGGTCTATACCTACTCTACAGCTGGGTTATGAAAATGCTGAGGATGCCCCAGAAATTATACCGAATCCACCAATTGAGAAACGCTGCGGTGAGGTGTTATTCAAATGTTGTTTGCCCAGCTGCGGCAAAACGCGCAAATACGATGAGGCTCAAATGAATAGTTTTcctaaaaatgttaaaatgtttCGTCGCTGGCAACATAATCTCAAGTTGGATTTTTTAGATTTCAAAGATcgagaaaaatataaaatctgCAATGATCACTTTGAGGCGATTTGTTTAGGCAAGACGCGACTTAATTTTGGAGCAATACCCACAATAAATTTGGGTCACAATCTCACCAACGACCTGTACAAGGTTAATCCACAAAAAATCTGGCCCAATTTGTTTACTAAACAATCGGAATATGAAAAAGAATTGTACGAGGGAGAAAATAAGCGAACAGATGGGGAATATTCACGCCTAGAAGGCGAAGTaggtgaagaagaagaagtcgTTGTGGAAACCTCTAATATACCAATGAATTTTACATGTTTATTCAACGAGTGCAATGGTCCAAAATCGTTGATGCGAGAACCCTACGATATACCACAAACAGAACAATTAAAAACCCTATGGTGTACCCTGATGAAAGTGGAGCCTAACAGTATTTCAGGGGATAACAAACTGTGTGGATTGCATTTCCAGCAGTTATTCAATGAAACCAGAGAACAAATGTTAGCTTTAAGTACCGAGGACACTGACATGAAACGCGATTTTGAAAAGTTGGATTATGCTTATAAAAAATCAGAAATATCTTTAGTTATCAAAGGATCAAAATGTAGCATACAGGATTGCTACAAGACGTTAGTGGAACCACATGTAAAACTCTATCAATTTCCCTACGGAAAAGAATTGATAGAGAAATGGTCCCATAACACGAACATCAAGCCCGATGAACATCGTcgttatttaacaaaaatatgctGCCTTCATTTCGAAGCGTATTGTTTTACACCCAATCAACGATTACGAACATGGGCTATACCCACACTGAACTTAACCAATGCTCCAACGAAAACTATACATAAAAATCCCGATTTAACCACATTGGATCGTCGCCTTGTAGGACCGCCTATTTTAAAATGCTGTGTTCCAAATTGCACTCAAGAGAATAATGAGACGATTGAGGGTAATAAACTTTTTAGCTTTCCCATGGATGATAACATACTACAGAAATGgtgtgaaaatttaaaattgtcgcGTGAGCAAACGccaatttttaagatttgtgCTCAACACTTTGAGAAACAGTGCTTTGGACTTAGCCGTTTACGATCAGGTGCTATACCAACTGCAAAATTGGGCCACAATGAGGAACCAATCCATCATAATCAATCAAGCCTAAAGGAGGAGATGTACGAACCAAAAGTTGATCAAAATGTTGGATTGGGATTAAAGCAGGCAAAAATAAAGAAGTCTTTAGATAGCATGAAATGCTTTATACCCTCTTGCCGTCGCAGTCGCCTACAACATGGTGTACGTTTCTTTACCTTTCCCTCCAACCCCGTACTGAGGCATAAATGGTGTCACAACTTGCAAATGCCAGCAAATTTTGGCAAACTGCTAAGTATACGCATTTGTAGCATacattttcacaaaaagtgCTTGGagggtaaaaatttaaaagactGGGCGGTACCCACATTGCATTTGGGACATAATGAATCCATTTATGACAATCCACGGGCAATGCGACGTCACAACATACCAAAATGTATTCTACCCCACTGTGGCGAACAGAGATCTCATGGTAAAGAACTGCGTTTCTTTACATTTCCCAAGGATAatcaaatattgaaaaaatggtGTAAAAACCTGAAGCTCTCGGGGGAACAATGCAAGGGACGACATTTATGCGAGAAACATTTTGAAGCTAAAGTTTTGAGTTATAAAAGATTGAAAACGAGTTGCGTACCAACTTTAAATCTAGGCCATTCTGAGCCTTTGGTATATAACAATGTAGCTTTGTTGGAAGATAGACAGCATTCACTTACCGATGCTAATGCTAATGAAGCTGAAGATATAGATTCCTTGGAGCTAGATTTGGAAGAAGCCGAAGGTAATGACTTTGAAACAGAATCTTTAAGGACTCCAATAAGTTGGAGTAACTTGGAGTCTAGAGAGTTGCGAGTAAAAATCACACCTTTGAAACACGAAGACCTAACAGACATAGCTTCCATATGTTCTTCCATGAGtaaggaaaaagaagaaaatgattCCATTTATAGTGGTTGCGAGTCTCGAGAAGATACAGCCACCTTGTCTGCAAATCGCAAATCGAAGACCGTGAACAGCTTCAATGCCATCTGCTGTTTGAAACATTGTCGTAAAGAAAAAACACCAGAACAACATTTGACCACTTATGGCTTTCCCAAAGATCTGGAGCTTTTACGAAAATGGTGTGACAATTTGGGTTTGGAACTAAACCAGTGCATTGGGCGCGTGTGTGTGGATCATTTTGAGTTGAGAGTAATGGGGAGACGACGTTTAAAACCTGGAGCAGTTCCCACTTTAAATTTAGGTCATGACCGACCCTTAAAACATACCAACGatgcaattaaattaaaaatgaacgaGAAAAGTAAGACTTCCGAACAAACCGAAGATAAATTTTCTAGCCCAGAACCGAAACTGACTCCACCACCCTATAAAACTAGACCCACCAAACAATCGGTTTTTCGGCTATGTTGCCTCAAGCATTGTCGCCGCAAGAAAGACCTTGACACTAGCAATAAAGAAGTGCCCATAGTATTTAAATTCCCTCAAGAACccaaattgttaaaaaagtgGTCTGAGGCTTTACACATCCCTCTACAGCAGTGCACGCGCCCCAATTTGGGTTTATGTGCTGATCACTTTGAAACACattgttttgaaaatgaagcaaaatatcaattaaaagCGAATAGTGTACCAACCATAAATCTGAAGGAAGATTTTcagaaaaaggaagaaataTGTTGCTTGAAACATTGCGCCAGTTCAACAAAATGCCATGATAACGTTTTTTTGCTGTCGTTCCCCTTAAAACCGCAAGTTTTAATACGTAAATGGTGCTACAATACCAGAATCTCTCATAAAGTTAAGGGACTGAAATCCCTAAAGATCTGTAGCCTGCATTTCGAAAAGCAGGTCTTCTTTAGAGGTTGTTTACTGCGGATTAATGCTGTACCCACAATTAATCTGGGACATACGGGAAAGATTTATAAGAATCCCAAATCGTATCGtataataaatgtacaaaaaccGTTAGAAAAGTGTTGTATAGTTAGTTGCCAACAGGAGAGTGACAAGTTGTATAGTTTCCCGAAAAATTCCGAATTGCGTCGTATTTGGTCGAATAATTCGGGTATTGAGACTCGCTTAGCATTAAAGCAACAATTGAAATTATGCAAGCGACACTTCACCGCGGACAGTTTCATAAGTGGTGGTGATTCTTTGAAATTGGAAGCAGTACCCTTGTTGTATTTGGACGTGGACAAAAGTCAGCATTTGGTATTGGACATGTCTACGATGATACAAAACAATCCTCATTGTTTGATACATAATTGCGGCTGCATACCCAGCGTTGATAAGGTCAAATTGTATCCATTTCCCCAGGAGAAAGAGGTCTTGGAAAAATGGCTTTTCAATCTGCAACTGCCAGAGAACTACGCCCCCGAAAATGCTTACATATGTAGCCGCCACTTTGACAAGGCATGCATACAGCGCGGattattacataaaagtgCTGTACCAACAATATTTTTGGGACATTCTGGCGGATTTTATAGAAACGGTGATGATATATTTAACACACCCTGTGCTGTGCCTCATTGTAAGTATGATCTGAACGAAGAAGACGATGGTGCTCATGATGTGCGTTTAATGTATAAATTTCCGAAAGATTCGCAACGTTTGAAAAAgtggttggataacatacGTATTACGGATGATGTTTATCAAAAGCAAAAGAATCGCCGCATATGTTCAGAACATTTTGAAGAGATCTGTAAGGTGGCCGGTAAGGATACTTTGTTACCGCATTCAGTGCCCACTCTAAATTTAGGCTACAATCCAGCTGAAGTGCCGCATAgaaatcatcatcaaaattGTTGCTTTGATTCATGTAAACTTAAGGACAAATATTCCTCGATGACAATGCATAAATTGCCACAAAATGAGAAGATGCGTGCGTTGTGGTTGGAGGAACTTGATTCGAAAGATGATGCTATATCCAAGCAATTCTTATGTGCGGCACACTTTCTGGCTATTTATGAGAGAGTTAAAGAGAAACACAAAGTTTTTGTTAAGCAATTGAACGAGTACGAAGCTTTATCTAATGTGTACAGGGACCTTAAACAAAGCGATTTACTGCAAAGTTTCAAGTGTTCTATACCGCAATGCTCCACAGGCTTTAAACAGACCATTAAGCTATTCAAATTTCCCATGGATGGTAACCTTTTCAATAAATGGCAACATAATACCGGCTTACAATTTGACATGAGCCAACGCAGCTGCCATTTAATGTGCGCTCTACATTTCGAGCCGCGCTGCCTATGCGAAGTGCAATTGCATCGCTGGGCTGTGCCCACCTTAGGATTACCCACATCTAACAGCTTGTACGTAAATCCACCCGAAGCCTTACCATCGGATCATGAAAATTTGCAACATTGTTGTGTGTCATCGTGCAGCTCAACACGGGGACCCTTCTTTCAATTCCccacaaaacaaattaatctAAAGAGATGGATACATAATCTGGGCTTGGGTACACAACAATGTACCACAAACCTACGTGTCTGTTATAagcattttgaaaaatactgCTTCATGAAACGTGAAGATCAAGCACTGACGTTAAAAATATGGTCCATTCCAACACTAAAACTACCCGCCAACCAGGACTTGTATAAAAATCCTATAGATAAAGTTTGCTATTTTTCGTGCAGCGTACCCGGCTGCAAACAGATACGCAATAGTACCGAAggtatttacttttattgtttccctaaaaataaaactttagaaAGGAACTGGCTTTTAAATACGGGCATAAAGCCGGGGAATTTTCGCGAAGATATGCGCATTTGTAGCTTGCACTTTGAACCAGAGTGCTTCCTAAAAGATTCTATGCAGCTAAGAAAGCATACAGTGCCCACTTTGAAACTACGTACAGCCaacaatttattacataaaaatccCATACGCAAAAAACTTCTTATCAAGAATACAACCGAAAAGTGTTTGGTAAAGTCTTGTACATATGCGGGGGACACATTATATGATCTGCCAAAAAATATCAGTGAATTGAAAAGCTTAGGCCGTAATTTAGAATTGGAAGACGTGCCATTGGATCAGCTGGAGAGGAAtttgaaaatatgtaaaaaacattacaacgAAGAATTGTTAAAAAGGTCAGACGAAATGAAAGCCAAGCCTTTAACTGCAGCAACAATGGACCCGGAAAGTTTTGCAACTGAACACAAAGAGGAAGAGGAGGAAAACACAAACAGAGAAGCTAATATGGAAAACTTTACAATTGAACAAGTGGATTTGACAAGCTATGAAGAACAAAACTTAGGTAAAGCTGCGAAAACTATAAGAAACTTGGAGGATGGCGTCACTGTAACTACCTTTATTGATCCTTTGAGTGTAAAAGCTGAAAAGTTAGCGGAAAGAAAAGTCATACAAACAAAGAATTTAACTCACAAATACAAGCAATTTAATAACGATGGGGCTGCCACCACATCTCCCATCAAAACGGAAttgataaaagaaaataacaccCTTTGTAGGGTGGTATTGAAAAGGAAACTCTCCCTCTCCACGGAAACATCATTAAAAGTGTGCAAAATCGAAACCAGAATGAACCCCTTCAAAGAGTCCCCAAATGCACTAACAAATGATTCACCCTCCAACCTAACGCAAATATGTTGTGTGAAAAATTGTGGCAGCAAACAGAAAGACTCACCGGTTCAATTTACCGAATTTCCCAAAACAATtgccatttataaaaaatggttGCAAAACTTGCGCATACCGCACTCGCCCACTGTGCGACATTATTATCGCGTCTGTTGGCAACACTTTGAAACCGTTTGTTGTGGAAAGAATGGTTTAAAAATTGACTCAGTGCCAACATTAAAATTAGGACATAACAACACAGACATACATCCCTGTTTAGATGAAAATCTACCAAACACATCTAACCTACCACATCAGGGAGCTTCAATGCTAATGCAAACCCCAGCACTGGCACTGGCACAACCAAAAccgaaatataatataaagaagtGTTCATATCCAGAGTGCaaaagtaataaaacaaaattgtacgATTTGCCGCCATTTCAGAAATTGTGCGAAAAATGGTGGCAGTCCATGCAGCTGTATTCCGAGCATGACAAAAGCCAAGCAAAAGTTTGCGACATACACTTTTACATGTTATACCATCAACATGAAGATATTGTGTATGCCATAAAGCGAGAACAGCCGGGCAAGTATGGGGAATTGAAAGAATTGTACGCAAGTATAGCGGCCAGGGGGAAAGTGATACGTCATAGATGCATTGTACCGAACTGTACAACCGATTATCTAATCAACAGTAGTCTGGATATTAAACTGTACAACTTCCCCTCCGAATACCAGCTGGCGCAGAAATGGTGCAGCAATTGCCAAATCGATTATGAGACCTGCATCAATGGGGATCGTGATCATAACTACAAGGTGTGTGCTTTACACTTTGAAACCTATTGTGTGGGTAATGACTTGAAATTATACAACTGGGCAGTACCGACCCTACGGTTGTTACATGTTGATTATAACCAACTATGTACCAACAATGCGGATGATGTATTTTCCACATCAGGCCGTTGTTGCATCAGGGACTGCATTAATGAAAATGGCTTAAAAACCAACACAAGATTCTATCGACTGCTTGACAAGTGGATACATGGCATTGATGCGAAACTGCTTAATTTGAATGAGTTGCGTATATGTGGTGTACACTTCTCGAAGCGATTCTTTAGAAAAGATAAAAGTTTAACAGCCCAGGCCACACCGTATCTACAAATAAATCCAAACAATGCCAATTTGCATCATCATACGCAAGACACAAGAACAACAGCTGCAAAGGAGGAAGCTGTTGAGCAAGTGGACGATTCTACAAATGCTGTTGTCACTGTTAAACAAGAAATTGAATACTGGGATGATTGGTACAACAGTCAACCAATAGACAGAGACGAGAGCATTGTCACACCTCAATacgaaattattaccataaagGAGGAAATCATTGATGATACGTATACCAATGACTTTCAAATGAATACGTCGTATGTAAATAccaatgatgataataataataaggatAATAATGATAAGGAGACTAGGCCACAAATTGTCGCCTGTTACTCACAACATCTACCCATCGATGAGACATTTATTAAGCAAGAAACTGATATAGATGAGAGAGATTACAAGATGGAGGAATCTATAATACCTGAACCAATCACTTTTCCATATCATAATAATGATGAGTCCGAAGCGATGCAAGAAAACGATCATCACACAAATAGGAAGAGTATTGTAATGCCTTTAGAAATAATACCAACCATTAGCACAAATGAACTGAATGAGAATAATgcgaatgtaaaaaaaagtaaaacgaTAAATGAGACAAATGCGCAAAGCACAGAATTATGTGAAATCAATTCAAATCAACAATTAGCAGACACAACAATAGCTACAGTAAGCAGCCCTTTAATGTCGATCAACAGCACACTTAACCAGGATGTTAACttcaataaaacaacaacaacaacaacatcatatgAAAATCGTTTCCAtcatattattacaacaatatcattaaatacaaaaaatgttgcTGATGAAAATCATAAAACATCTACAGCCACCTCTTCAATCCGTGTAGTGGCACCGGCCGCCTCCAACCAATCAATGATTAATCCCGTTTTTAAATTACACATTCTAACATGTTGTGTGGCCAGTTGTTTAAATTCCACCCAAACACCGCTAATTAAACTTTACACGGAATTCCCATCCGACTCGGatctatttataaaatggtgtttcaatttgaaaattgATCCGCGTCACTATAGAGAGCACTTGTATGCGGTGTGTAGTGCTCATTTTGATAGTGTTTGCTTTAAAGAGAGCAATCGCTCCTTACAGCCCTGGGCTGTACCAACATTAAATTTAGGTTTGCCCCACAATTCCTTCATACACCAATACGATATGCCGCATAGTTTGAAAGCAACAAATGAACAACAATGCATTGTATGGGGTTGTCATCAATCGCAAACACCTTTCTATCCATTTCCGGCTGATCCCCAGCAGTCCCGTAAATGGTTTACCAATTTACAGTTAGAATATACCGAATTTCGTGCACAAACGTATCGTGTTTGTCGCAAGCATTTTAACAATTCATTAATCGATGAGCATGGCCAGTTAGATAATGAGGCTTTGCCCACCTTAGATTTGAACCATAATAATAGTGATAATAATAGTGTCGGGTGTGCTCAGTCGCATAATGTTGATAGTGAAAATTTTAGGGCAGTACGTTTGGCAGCCGCTTTGGCACCACAAGATTTAGAAGATCACGACAGTAGTTATTATGAGGATTTTGAAGAGTGCCTGCAACACAATGAAcaggaaaattga
Protein Sequence
MSHQHHHHHHHHQQHQQQYQRKQQQQQQQQHPFHIYAQPQTQHQQQPPHQQWYAHINDIQPQQQQQQQQQHQQYIAGHVRDSRHLHAPHIYGSMLGHSSGAYMAGASMQSMGRHAYNMPGTTATSVPYTFPHNPIRGDNNIVHTRNYDLEMVNRAHSIPSTTHTMIGGDNHRSYDAYSHNAAIFTQQQQQQQQHQQQQQQQQVRLHHHHAVHPIHHQSHQQQHSIPGHQATHHHHHHQQQQQQQQQQRHQQHIQQQLVPNPMHNMKTKPVEEITITPTIQMDEIIIKAEPPDEYIYHRNLQQQHQQQQQQPSYSVLKCIKQEPQAQPQLLQQQQQQQQHHHQQQQQRQQQHQLAQQRQSKQLQSPPPPQPPPPPPPQAPPAATVTSIATTPPPEASSPIDEKDIKPMNFPRRKVQTERSSTLPICHRCKQVFLKRQNYTQHVALSTCDMVEYDFRCSICPMSYMSNEELRAHERLHRLYRYFCMLNCGKHFETIQECEHHEYMEHDQYMYKCGMCGLDYPTREELLIHRKCHKYATRFVCTICRGWFRNMSRLHNHYLNNPYRCGKFYNKDDFNACATETSIKFRKVSTDTNGEHDSKQSTDDLNASDDIVEEMDTAPPDESSSDEEVETEIKVEPDFYPPMDQTDYQSEEFSTNQNLNFLHDFQENASNSTNSSYTMGGSEAIAGDQDVVCCVKFCGVTKHASPSLQFFNFPREDKYLQQWLHNLKMPYDPQVKYTQYRICSLHFPKRCINRYSLSYWAVPTFNLGHDEVANLYQNREISNSLLSSDAARCCMPGCRAQRGQTNVKFYNFPKDLKTLIKWCQNARLPVHTKESRHFCSRHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNVSYLEEKKCCLTFCRKSRSDDFSLSLYRFPRDESMLRKWCYNLRLHPDVYRGKNHKICSHHFIKEALGLRKLSPGAVPTLNLGHNDRINIYDNELHPPSNVASTSYSAKAAAASALTASIRKAQLLKYRNASQSANASASSIYDEVLSNSQKFSSSSSSVASNAIELGDVCLVPSCKRSRHTENVTLHTVPRRPEQLEKWCHNLKMNLNGLHKNARICSAHFENYCIGGCMRPFAVPTLELGHEDSDIYRNPDVIKKLNIRETCCVPSCKRNRDRDHANLHRFPTNPDLLQKWCANLQKSIPDGTKLFNDAVCEVHFEDKCLRNKRLEKWAVPTLKLGYEPIPHYLPSQEEIDEYWSKPMAPNNGDETGECCVVTCKRNPQMDDVKLYRPPEDAEQLVKWSHNLQIDVTKLSTMKICNLHFETHCIGKRLLNWAMPTLNLAAKVEHLFENPPPTISHYRRRVKLGLKAEHDLIKWSPRCCLAHCRKMRSRDNVQLYRFPVNLNTMSKWCHNVQLPVVGSSHRRICSAHFESSVLTKRCPIATAVPTVNLNTPPGYKIYQNSPKLKQHKIFSQRWCVVSTCRKTRSEGVQLFRFPHNRIILTKWRYNLKNLPKGKLSSQFRICSLHFENHSIGMKRLSPGAIPTLNLGHEAEDIYPNETRSFFDLDKCVVNGCSSSKETENIRLFKFPGDDEELLGKWCHNLKMNPNDCIGIKICSLHFETDCMGPRLLYKWSIPTLQLGYENAEDAPEIIPNPPIEKRCGEVLFKCCLPSCGKTRKYDEAQMNSFPKNVKMFRRWQHNLKLDFLDFKDREKYKICNDHFEAICLGKTRLNFGAIPTINLGHNLTNDLYKVNPQKIWPNLFTKQSEYEKELYEGENKRTDGEYSRLEGEVGEEEEVVVETSNIPMNFTCLFNECNGPKSLMREPYDIPQTEQLKTLWCTLMKVEPNSISGDNKLCGLHFQQLFNETREQMLALSTEDTDMKRDFEKLDYAYKKSEISLVIKGSKCSIQDCYKTLVEPHVKLYQFPYGKELIEKWSHNTNIKPDEHRRYLTKICCLHFEAYCFTPNQRLRTWAIPTLNLTNAPTKTIHKNPDLTTLDRRLVGPPILKCCVPNCTQENNETIEGNKLFSFPMDDNILQKWCENLKLSREQTPIFKICAQHFEKQCFGLSRLRSGAIPTAKLGHNEEPIHHNQSSLKEEMYEPKVDQNVGLGLKQAKIKKSLDSMKCFIPSCRRSRLQHGVRFFTFPSNPVLRHKWCHNLQMPANFGKLLSIRICSIHFHKKCLEGKNLKDWAVPTLHLGHNESIYDNPRAMRRHNIPKCILPHCGEQRSHGKELRFFTFPKDNQILKKWCKNLKLSGEQCKGRHLCEKHFEAKVLSYKRLKTSCVPTLNLGHSEPLVYNNVALLEDRQHSLTDANANEAEDIDSLELDLEEAEGNDFETESLRTPISWSNLESRELRVKITPLKHEDLTDIASICSSMSKEKEENDSIYSGCESREDTATLSANRKSKTVNSFNAICCLKHCRKEKTPEQHLTTYGFPKDLELLRKWCDNLGLELNQCIGRVCVDHFELRVMGRRRLKPGAVPTLNLGHDRPLKHTNDAIKLKMNEKSKTSEQTEDKFSSPEPKLTPPPYKTRPTKQSVFRLCCLKHCRRKKDLDTSNKEVPIVFKFPQEPKLLKKWSEALHIPLQQCTRPNLGLCADHFETHCFENEAKYQLKANSVPTINLKEDFQKKEEICCLKHCASSTKCHDNVFLLSFPLKPQVLIRKWCYNTRISHKVKGLKSLKICSLHFEKQVFFRGCLLRINAVPTINLGHTGKIYKNPKSYRIINVQKPLEKCCIVSCQQESDKLYSFPKNSELRRIWSNNSGIETRLALKQQLKLCKRHFTADSFISGGDSLKLEAVPLLYLDVDKSQHLVLDMSTMIQNNPHCLIHNCGCIPSVDKVKLYPFPQEKEVLEKWLFNLQLPENYAPENAYICSRHFDKACIQRGLLHKSAVPTIFLGHSGGFYRNGDDIFNTPCAVPHCKYDLNEEDDGAHDVRLMYKFPKDSQRLKKWLDNIRITDDVYQKQKNRRICSEHFEEICKVAGKDTLLPHSVPTLNLGYNPAEVPHRNHHQNCCFDSCKLKDKYSSMTMHKLPQNEKMRALWLEELDSKDDAISKQFLCAAHFLAIYERVKEKHKVFVKQLNEYEALSNVYRDLKQSDLLQSFKCSIPQCSTGFKQTIKLFKFPMDGNLFNKWQHNTGLQFDMSQRSCHLMCALHFEPRCLCEVQLHRWAVPTLGLPTSNSLYVNPPEALPSDHENLQHCCVSSCSSTRGPFFQFPTKQINLKRWIHNLGLGTQQCTTNLRVCYKHFEKYCFMKREDQALTLKIWSIPTLKLPANQDLYKNPIDKVCYFSCSVPGCKQIRNSTEGIYFYCFPKNKTLERNWLLNTGIKPGNFREDMRICSLHFEPECFLKDSMQLRKHTVPTLKLRTANNLLHKNPIRKKLLIKNTTEKCLVKSCTYAGDTLYDLPKNISELKSLGRNLELEDVPLDQLERNLKICKKHYNEELLKRSDEMKAKPLTAATMDPESFATEHKEEEEENTNREANMENFTIEQVDLTSYEEQNLGKAAKTIRNLEDGVTVTTFIDPLSVKAEKLAERKVIQTKNLTHKYKQFNNDGAATTSPIKTELIKENNTLCRVVLKRKLSLSTETSLKVCKIETRMNPFKESPNALTNDSPSNLTQICCVKNCGSKQKDSPVQFTEFPKTIAIYKKWLQNLRIPHSPTVRHYYRVCWQHFETVCCGKNGLKIDSVPTLKLGHNNTDIHPCLDENLPNTSNLPHQGASMLMQTPALALAQPKPKYNIKKCSYPECKSNKTKLYDLPPFQKLCEKWWQSMQLYSEHDKSQAKVCDIHFYMLYHQHEDIVYAIKREQPGKYGELKELYASIAARGKVIRHRCIVPNCTTDYLINSSLDIKLYNFPSEYQLAQKWCSNCQIDYETCINGDRDHNYKVCALHFETYCVGNDLKLYNWAVPTLRLLHVDYNQLCTNNADDVFSTSGRCCIRDCINENGLKTNTRFYRLLDKWIHGIDAKLLNLNELRICGVHFSKRFFRKDKSLTAQATPYLQINPNNANLHHHTQDTRTTAAKEEAVEQVDDSTNAVVTVKQEIEYWDDWYNSQPIDRDESIVTPQYEIITIKEEIIDDTYTNDFQMNTSYVNTNDDNNNKDNNDKETRPQIVACYSQHLPIDETFIKQETDIDERDYKMEESIIPEPITFPYHNNDESEAMQENDHHTNRKSIVMPLEIIPTISTNELNENNANVKKSKTINETNAQSTELCEINSNQQLADTTIATVSSPLMSINSTLNQDVNFNKTTTTTTSYENRFHHIITTISLNTKNVADENHKTSTATSSIRVVAPAASNQSMINPVFKLHILTCCVASCLNSTQTPLIKLYTEFPSDSDLFIKWCFNLKIDPRHYREHLYAVCSAHFDSVCFKESNRSLQPWAVPTLNLGLPHNSFIHQYDMPHSLKATNEQQCIVWGCHQSQTPFYPFPADPQQSRKWFTNLQLEYTEFRAQTYRVCRKHFNNSLIDEHGQLDNEALPTLDLNHNNSDNNSVGCAQSHNVDSENFRAVRLAAALAPQDLEDHDSSYYEDFEECLQHNEQEN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00384614;
90% Identity
iTF_00384614;
80% Identity
-