Basic Information

Gene Symbol
-
Assembly
GCA_951800035.1
Location
OX637533.1:11202253-11226947[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 33 4.2e-15 4.1e-12 48.5 2.0 1 86 811 883 811 884 0.84
2 33 1.2e-14 1.2e-11 47.0 5.0 1 87 911 980 911 980 0.80
3 33 8e-15 7.8e-12 47.6 0.3 1 87 1001 1073 1001 1073 0.83
4 33 2.9e-12 2.8e-09 39.4 4.8 1 86 1155 1223 1155 1224 0.79
5 33 1.7e-14 1.6e-11 46.6 6.2 1 86 1248 1319 1248 1320 0.80
6 33 2.4e-11 2.3e-08 36.4 0.8 1 87 1355 1423 1355 1423 0.81
7 33 8.6e-11 8.4e-08 34.6 2.4 1 85 1464 1532 1464 1539 0.73
8 33 8e-15 7.9e-12 47.6 0.6 1 86 1561 1630 1561 1631 0.81
9 33 9.4e-14 9.1e-11 44.1 1.1 1 86 1653 1722 1653 1723 0.78
10 33 5.7e-13 5.6e-10 41.6 2.0 1 87 1751 1823 1751 1823 0.85
11 33 3.5e-06 0.0034 19.9 0.1 1 63 1887 1944 1887 1964 0.70
12 33 7.7e-11 7.5e-08 34.8 0.3 1 87 1984 2056 1984 2056 0.80
13 33 1.5e-12 1.5e-09 40.3 2.4 1 87 2087 2158 2087 2158 0.80
14 33 1.9e-13 1.9e-10 43.2 5.1 1 86 2203 2275 2203 2276 0.84
15 33 1.2e-12 1.2e-09 40.6 0.9 1 87 2298 2366 2298 2366 0.80
16 33 2.3e-13 2.3e-10 42.9 0.3 1 87 2683 2752 2683 2752 0.80
17 33 3.5e-12 3.4e-09 39.1 4.2 1 86 2810 2899 2810 2900 0.75
18 33 3e-11 2.9e-08 36.1 0.2 1 87 2932 3004 2932 3004 0.76
19 33 6.8e-12 6.6e-09 38.2 1.0 1 87 3032 3101 3032 3101 0.82
20 33 6.3e-12 6.2e-09 38.3 0.8 1 85 3121 3189 3121 3191 0.78
21 33 1.1e-12 1.1e-09 40.7 1.5 1 87 3214 3285 3214 3285 0.80
22 33 2.3e-05 0.022 17.3 0.2 1 60 3301 3349 3301 3380 0.76
23 33 1.3e-11 1.3e-08 37.2 3.2 1 86 3389 3458 3389 3459 0.80
24 33 4.4e-11 4.3e-08 35.6 5.6 1 86 3484 3554 3484 3555 0.82
25 33 1.5e-12 1.5e-09 40.3 3.5 1 86 3575 3647 3575 3648 0.78
26 33 8.8e-12 8.6e-09 37.8 4.9 1 86 3668 3737 3668 3738 0.82
27 33 4.2e-13 4.1e-10 42.1 0.9 1 87 4032 4109 4032 4109 0.81
28 33 1.2e-06 0.0012 21.4 2.2 1 86 4128 4197 4128 4198 0.72
29 33 0.0046 4.5 9.9 1.8 1 62 4242 4323 4242 4343 0.62
30 33 4.9e-13 4.8e-10 41.8 3.8 1 87 4361 4435 4361 4435 0.85
31 33 2.6e-13 2.5e-10 42.7 2.1 1 85 4460 4535 4460 4537 0.78
32 33 1.9e-11 1.9e-08 36.8 3.2 1 87 4665 4738 4665 4738 0.79
33 33 8.9e-11 8.7e-08 34.6 0.2 1 87 4763 4833 4763 4833 0.81

Sequence Information

Coding Sequence
ATGTCACAAAATAATCAACGCAAACATTATCACATTCATGCTCCCTATCAACACccccagcaacaacaacagcaacaagctCAACAGCATCATCATGGTCACCATATAACGCCGtcgcagcagcaacaacaacatcaacaatgGTATTCGCAGCAACATTACCAACATGGTCTTCATTTGAGAGATTCGCGCCATATTCAACATCCGCAACATCATCCCCATCATCATCATactcagcagcagcagcaacagcaaccgCATCACAATCATACAATGTCAACACACATGTTTACAAGTGGTTATGTTGGTATGACACCAAGTGGGGGAGTTATAACTAGTGGTGCGGGTGGTAGCAGTTCTGGCAGTGGGGTTGGTGTAACTGGTACAGTACATAATACGGCAACAATGGGTTCTGCTGCTTCACATAATATGCCGGCTTCTTCCTCCTCTGCACATCATTATTCTTCCGTTTTGCCGGCATCTGGTGCGAGTGTTGGTGGCAATGCGGCTAATGCTAATAGTGGTGGTGCTTATGTTAGTCGAAGTAGAATCTTTGACCTTGAGATGTTGGCAACACAACCACAACAACAGCAATCTCAACATAGTACAGCTTCTGCTACACATTCACATTCTATGTTAACGAGTGGACGAGCAGGATTTGATGCATATTCGCATAGCTCATTGTATGCACAGCAGCAAAATCAACGGCATATTTTAGCTTCTGGCTCATCGCATCACCACCATGCCACTACACACCATTCTACATCTAATGCCTTGCATTCTCATCATCATACTCAGCAACTGCATCATCCACACCAGCAACCACAGTCTGCTCTGCATCATCATCAACACCATCAGCAtccacagcagcagcaacaacaacaacaacattattacCACCATTCTTCGCTGCACCGGCCGCATACACAACTTATGCCGCCAGCGCTGCAGCATATTAAATCTGAGCCAGTAGAGCAAATAGCCCCAACACCGTCGATACAAACCGAGGAAGTCATCATTAAATCTGAGCCCGTCGATGACATTGGttatcattataaaagtgcgccacaatttgaaaacaaatctTTTCTCATCGCTGAAAAACGTAAACAAGATGAGCTGCAACAACTGCGGCTGCAACAACGGCAGCATCAAAAACAGCAGGAACTACAACAGCTGCggcaccaacaacaacaacaacagcagcaccaGGAGCATCAACGTGTAAAACACCAACAACAATTACAAGAACAACAACTTCATCAACATCAActgcagaaaataaaacaagagcATTATCATCAGCCTCAGAGTGAACATACTGATAATGAAGAAGTTTCCCAACAAACGCAACAACATACAAATTCCGAGAATTCTTCAATACTGCAACGTGCAGCAGCAGATGAAAAACAAGAAGAACAACATCAGCCGCAAATATCCTTgacaaatataaaaacagaagcaaagCCCCTTAACTTTCCTCGCCGCAAATTACAAACAGAACGTTCCTCAACTCTGCCGATATGCCAACGATgtaaacaagtttttttaaagCGTCAAAACTATTCACAACATGTTGCTCTATCCACTTGCAATATTGTGGAGTACGACTTTAAATGCTCCGTATGCCCCATGTCCTTTATGTCTAATGAGGAACTGCAGACACACGAACAACTACATCGctcaaatagatatttttgtcaaaaatattgTGGCAAATTCTATGAAACAATTGATGAATGCGAAGAACATGAATATGGCCAGCATGAATATGAAATGTTTAAATGTAATATTTGTTGTATAAGTGTAACACATCGTGATCAATTATTAGAACATCTTACTGATCACAAATATCAGCCACGTTTCGATTGCTGTATATGTCGGTTATGTTTTCAAACTTCAATTGAACTGCATGAACATTATCTGGCCAATGAAGATTTTTGTGgtaaattttatgacaaagaagCCTTTAAAAAACCTAATACCTCGTCATCTTCGGCTTATCTGGGAAAGCCGGAAAGTTCGAATCTGGAAATAGCTAATACATTTTCGTTAAAAGATATACCTCCTGGCAATAGTCATCACCTGGCGGGCTTGTACCCAAAACCTGCCAGCTCAAAAACTTCCATGGAACCACCTAACACACCAACTACAACATCGTCTTCACCTTTTATAACGGTAAATGAGTTTGCCGCCTTAGAGCCCCATATTGAGgtaaaaactgaaattaaagTAGAGCCTGATTTTTATCCACCAATGGATCAATCTGACTTCGCTGCCTATGACAATGATTACGGGACACCCGACTATACATCTAGTtctaatcaaaatttttcatttctacaTGACTACCAAGATAATGCTTCCAGTTCCACTAATTCATCGTATTCCTTTAACAATAACGATGCCATACAAGATGAAGCTGCCATTTGCTGCGTACCCAAGTGCGGCACACGCAAACAATCTTCACCATCCTTGCAGTTCTTTAGTTTTCCCAGAGAAGAAAAGTATTTGTCACAATGGTTGCACAATTTGAAAATGGTATACGATCCCAATGTAAATTATTCCATATATCGTATCTGTAGTCTACATTTTCCTAAACGTTgtatagcgaaatattccttaaGTTATTGGGCTGTGCCCACGTTTAATTTAGGCCATGACGACGTGggaaatttatatcaaaatagaGAAAGTTCTGGAGGGTTTCCAGGTGGTGAAATGGCCAAATGTAGTATGCCGGGTTGCCCGTCCCAACGAGGAGAAACCAATGTAAAATTTCATGTCTTCCCTCGGGACTTGAAGACCCTAATTAAATGGTGTCAAAATTCACGACTGCCAGTACACAGTAAAGATAATAGATTTTTCTGCTCTAAACATTTTGAGGAAAAATGTTTTGGCAAGTTCCGCTTAAAACCTTGGGCCATACCTACTATAAATTTGGGTACGATTTATGGTAAGATACACGACAATCCCAATATTTATCAAGAAGAGAAGAAATGTTTTCTGCCATTCTGCCGTCGCAGTAGATCCTATGATTGCAATTTGTCTTTGTATAGATTTCCAAGAGATGAAACTTTGTTGCGCCGCTGGTGTTATAATTTAAGGTTAGATCCCAACATGTACAGAGggaaaaatcacaaaatttgCTCGTCTCATTTTATTAAGGAAGCTTTAGGTTTAAGGAAACTTAATCCAGGAGCAGTGCCCACATTAAACTTAGGTCATAATGATAGATTTAACATATATGAAAACGAACTATATACACCACCGCCACCACCTCCGCCACCACCTCAACCTTCTACGTCATCAAAGGCCCACAAATATGCCCAATTATTTAAGCAAGAAAAGGAAGGTTCTTCCGGATCACATATCTACGATGGTGTATTCATGAATTCCATGGTGCAAAAATTCTCTTCTGCTTCTTCGAACAGCTCCAATAACCTTGACTTGGGAGATGTTTGCCTAGTACCATCTTGCAAAAGGACCCGTCATTGCGACGACATTACTCTGCACACCGTACCCAAACGGGCAGAACAGCTTAAGAAATGGtgtcataatttaaaaatgaatttggttAAAATGCATAAAAGTGCCAGAATTTGTAGCGCTCACTTCGAAAAGTACTGCATAGGAGGCTGCATGAGACCGTTTGCTGTTCCCACTCTGGAGTTGGGACATGATGACACAAATATATTCCGTAATCCCGAtgttataaagaaattaaatattagaGAAACCTGTTGCGTGCAATCTTGCAAAAGAAATCGGGACCGTGATCATGCAAATTTGCATAGATTTCCCACTCATCCAGAATTGCTGCAGAAATGGtgtgaaaatttacaaaaacccATTCCGGATGGCACTAAACTTTTCAATGATGCAGTTTGTGAGGTACATTTCGAAGATCGATGCTTGCGTAACAAGCGTTTGGAAAAATGGGCCATACCTACAACGAATTTGGGTTGGGAAGGAGCTCCCCACTGTTTGCCTTCAGAAGAAGACATCAACGAGAACTGGGTAAAACCTTTTGCACCTAACAATGGCGATGAGCAAGGTGAATGCTGTGTTAGCAGCTGCAAACGCAATCCACAAATCGATGATGTCAAATTGTACAGGCCTCCCGAAGACGCCGAGCAGTTGGTTAAGTGGGCCCATAACCTGCAAGTAGATGTTATGGACTTGCCGAATTTAAAAATCTGTAACTTACACTTTGAACAACACTGTATAGGGAAGCGTTTGTTAAATTGGGCTATGCCTACCTTAAATTTGGGTGGCAAAGTGGAACATCTGTTTGAAAATCCTCCACCTATGCCTGCCATTTATAAGAAGAAAATCAAACCTGATAGACTTTTAAGCAGTCAGGAAGGCATTAAATGGTCTCCGAGATGCTGCCTGCCACATTGTCGTAAAATGCGTTCTGTAGACAAGATTCATCTCTTCCGTTTTCCCTACAGTAATCGCCAGACTTTGGCAAAATGGTGCCACAACTTGCAATTGCCTTTGGTGGGCAGCTCACATCGCCGTATCTGTTCCAGTCATTTTGAATCATCTGTTTTAACGAAACGTTGTCCCATGACATTGGCTGTACCTACGCTAGATCTGAACGCCCCACCGGGTTATAAGATCTATCAAAATCCTGCTagactcaaacaaataaaaataggaCCTCAAAGGCATTGCGTAATAGAATCTTGCCGTAAAACTAAATTGGATGATGTTACACTATTTCGTTTCCCCAACAATAGATCGATTTTGTATAAATGGCgtcataatattaaaaattggcCTAAGGGCAAATTGAGTTCGCAAATGAGAATTTGCTCCCAACATTTTGAGCCTCATTCAGTGGGAGTGAAGAAACTGTCACCCGGTGCTATTCCCACTTTGAAGTTGGGCCACGATGCCAAGGATTTGTATCCCAATGAAACAAGATCGTTCTTTGATTTGGAGAAATGTGTAGTTAGTGGCTGTGACTCCCGCAAAGATATGGAAGATGTAAGACTGTTCCGTTTCCCGCGAGATGACGAAGAATTGCTTAAGAAATGGTGCAATAATCTGAAAATGAATACCAACGATTGTGTGGGCATCAAAATATGCAGCAAACACTTTGAATTAGAATGTGTAGGTCCAAGACAGTTATACAAATGGTCCATACCCACTTTAAAACTGGGTCACAAAGAAGACGATTTGGTGGATATAATACCAAATCCACCGCCCGAGCAAAGAACCGGGGAATTCCTATTCAAATGTTGTGTACCCTCATGTGGAAAGACACGCAAATATGATGACGCCCAAATGAATAGTtttcccaaaaatttaaaattattccgCAAATGGAAACATAATCTGAAGCTAGACTTTCTCAATTTCaaagaaagagaaaaatataaaatttgcaatGATCATTTCGAGATGGTTTGTTTGGGAAAAACTCGGCTAAATTTTGGTGCTCTGCCCACCTTGAATTTAGGACATGATGAGTTGGatgatttatatcaaattaatcCTGATCGAATAAGACCGAATTTGTTTATTAGACAAAAAGACGTGGAAAAACTAGAAAGAAGGAGGATATTGAAGGAAGAAAACTCAGAACAATATGATGGTGAAGAGCAAGACGATGATCCCTTGGGCTTAGAACCCAGCGACGTAAAATGTTGTGTTACTGAATGCACTGCCCCCAAATCAATAATGAGAGAGCCTTATGATTTGCCGGAAACTAAAGAAATTAGGCAGATGTGGTTGAAAGAATTTGAGAAGAATGAAGATGAAGTTCTCCTACCAGAAGCAAAAATATGTGGCTTGCATTtccaattaatatttaaaaaattggaaagCCAAATGTTAGAAATGCTAGAAGAAAGTGAGAACTTGAAATCAGATTTTAATAAACTGCAATACAATTACCAAAAGTCGACTATATCTCTGGTTATTAATAGTTATCAGTGTAGGGTGCAAGATTGTCCCACCAATTTGCTTAATTCCTCTATACGACTGTATTTCTTTCCCTATGGCAAACAGCTGGTGAACAAATGGTCTCAGAATACAGGCATAATACCCGATGAGAATCGCAGATATATGAACAAGGTATGTGCCTTACATTTCGAGTCCTATTGCATTACGGAAAATCAAAGATTGAGATCATGGGCCATACCCACTATCAACTTGCCTTCCTTTGAAGAGGAACGACGTTTGTATAAGAATCCTGATTTAACCAAAATCGACAGAAGAATGCTGGGCCCTCAAATCCTGAAGTGTGCAGTCAACAATTGTAATTATCTGAAACTGGTAGATGATGAATCACTCAAACTTTTCAACTTTCCCACGGACGACAAACTGTTAAAAAAATGGTgtgataatttaaaaatgtctcACCATTTTACACCCTTGCTTAAAATATGCTCATTGCATTTTGAGAAAATATGCTTCGGCAGCTGCCGCATACGTTCTTGGGCCATACCCACCTTAAATTTGGGCCATAGCACTGCTCCTGAACATCTAAATAAAACCACGATAAGGCAAGAAGTCTATGACGCCCCTGAAGATATTTCCGAAATACAACtgaaacaagttaaaataaagaaatcacTGGACAGCACCAAATGCTACATACCCAGCTGTCGCAAGAGTCGACTAAAGCATGGAGTACGTTTCCACAGTCTGCCCACAAACTTAAAGATGAAACGCAAATGGCTGCATAATTTACAAATCAGGCATTTGAAGGCCAGCCAAAAAGtgcataatattaaaatttgcaaTCTACACTTCCACAAACGTTGCTTAGAAGGCAAAATGCTAAAGCCTTGGGCAGTGCCCACAAGGCATTTGGGTCATAGTGAGGCAATTTTTGATAATCCCCGCAAAGTAGGGGCATTTTCGTCATTACGCTGTGTCCTCACACACTGCAAAAACCATGCGGCACCACGACCAGTACGTGCATTTATGTTTCCAAAATTGCCAGAATTTCTGGAGAAATGGTCGAAAAATTTGAAGCTGGAACTGGAGAAGTGCAAGGGCAAAATATGTCATGAACACTTCGAAGAAGAAGTTTTGGGTTTGAAAAAGTTGAAAAGCGGAGCGGTACCAACTCTTAATTTGGGGCATGACGATAAAGATATTTACGATAATACGGATTTAAtcgagaaatttaaattaaaagaagtgGAAAAAGAGCTAAGCAAGGACTCCTGCCAAGTAAAAATAACGGAGGATGATGATTTCGAAGAGGAATTTGAGCCCCGGTCCGAGGGTGAAGAGGAGGAGGAAGACGAGGAGGAAATCTGGGAGTCAGAAGAAGGCGAGGAAGAGGAGGAAGACGATGAACAAATACATTATGATGATGATGCGGAAGAAGAGGATGAAGACGAGGAAGAGCTTGACGCAGATAAGTTGGAGAAACAGCAAGAAGACGATGAAATCAGTGTATCAAATTCGATATCTGACTGGAGTTCTGTTAAATTTAAAGAACTTAGAGTGTCTATAACACCCTTAACATCTGAAGATTTAATGGATTTATGCTCACGTTCTTCATACGAACGAGAATTCGGTTCTTTAACACCGGCAAACAATTTAAGAGGTCGCAGATCGGCTACACCGGCTTCTTCAAACTGGAAAGATTTGCGCTCGGAGACTCCTGATCAAAAATCCAATAGATCAGACACACCAGACAAGAAAGCATTTCATTATTTTAGAGAACCTCGCTCTGTTTCACCTGAACAAAAACTTGATAATTTTCTAGAGCCTAAATCTTCCGAACGAGTCTCTGACAGTCCGAAGGATCCTTTAGGTGAAAACTTAGAAGAGGACTTAAGTTCGAAAACTCCTAACCAGATAGAAGCACTTGTTTTTGCAAACGTAACAAAGTCCGAACTTGACTTAACATCCAAGTGTCTGAAAAGACAAAAACCACACAAACTCCTCGAGAGTTTCAAAAAGGAACGTTTGGAGATGTCAGAAGATGAAGCCACAAATACTTGTTCGCCAAATGAAATGGACACTACGAACTTGAGGACAGATAAAGCCCTCAATGCGGTGGCTCCCATGTGTTGTTTGAAACACTGTAGCAGGGAAAAAACACCAGAACAACATTTAACCACCTATGGTTTCCCCAAAGATCCGCAACTTTTGCAAAAATGGTGCGACAACCTGGGCTTGCAACCCGAGCAGTGCATTGGACGTGTCTGTATAGATCATTTTGAATTAAGAGTTATCGGTGCACGCCGTCTGAGAGTAGGAGCTGTGCCAACTTTGAATCTAGGACCACATCGAATTGCCAAGCACACTAACATGGAGGATACCGCTCAAAAGAAAAGTGTAACTAAAGAGTTTTCCGAAACAGGAAATATGCAGGAGGCGGACTCAAGTCTAAAGCCCCCGCCACCATATAAAACCCCCAAGCCCAGTAAGCAATCGGTTTTTCGGCTATGCTGCCTCAAACATTGTCGGCGCAAGAAAGTTTCGAACCTCGAGAAGAAAGACAAGCAACTGACGAAGGAAAGAATGGATTGTCAGGAGAAATCACAGGAAGCCTTCTTTAAATTTCCTACTGAgccaaatattttaaagaaatggtACAAAAACTTAAGATTGCCTGAGAAACTAAGCATAACAACGGAGCTAGAAATATGTTCCAAACATTTTGAATCTAGtgttataaaaaatggaaaactgCATCCCAAAGCCGTGCCTACGTTACAGCTAAGTTATGCGAATAGGTCTCCAATTTATACGAATAATCAACAAGACTTTAATGGTTTCGACCTGCAAGTCAAACAAAAGTCCAAGCGAAAATGTTTTCTTTCTCATTGCGGTAATAAGATATCGGATCATATATTCTTGCTATCATTTCCGGAAAACGAACCTCTGACTTTACGGAAATGGttcaaaaatctaaaaataaatctgaAACGTGAAGAATATAAAAACCTGAAAATATGCAGTGCCCATTTTGAGCCGTATGTGTTCTtcaaaagtatatatttacgtGCCGGTGCTGTGCCAACAATCAATTTGGGACACAATGAACGCTTTATAAGGAATTGCCAGAAATTGCGTTTAAAAAGGGAAAATGTTCATACGATTCAGGAGAAATGTTGCATAACTGAGTGCACGGCCACAAATCTCAAGCTTTATTCATTTCCTCGTAGCTCGGAATTACGGAAAATTTGGTGCAACAATTTGCAAATTGAACTGCGTCAGGCTCTTAATAGTCATAGCAAATTGTGCGCCCATCACTTTACACCCGATAGTTTTATTGTGGGCACAGAAAATCTCAAGCTAAATGCTGTACCTGTACTAAACTTGGGCCTAAAGCACGACAACCATGTGTTAATAACAACAAATCCAGCTGAAAGCAAGTGTATAGTGGAAAACTGTCAAAGGACTCCGAGTGTAGACAAAGTGAAATTGTACAAGTTTCCGCTAAAGCAAGACATACTTAAGAAGTGGCTTTTCAACTTGAATTTATCAGCCGACACTCTTAATCCGCATGATGTGGTTTGCAGCAAACACTTCGATAAGAGCTGCATTAAGAATGGCATTATGCATGAAAAGGCCATACCCACGAAATATTTGGAAATGTCGCCCAAAGGCtggttttacaaaaacaatGAGGATCTGTATGAAATATCCAGGAAATGCTGTGTCCTGAGTTGCCAGCAGGCTGCTGAAGAAGCAAAACATTTATATAGATTTCCCAAGCACAAAGAGGATTTGGATAAATGGATATATAATTTAAAGCTGCAAGTAGACGAGGCGGATGTTAAGGATTTGCGAGTATGTGATCGACATTTTGAGCAAAGTTGCAAAATTTCCCACAAGGATTTGATAACTCAGGCCTTGCCTACCCTCAATCTGGGCCATAACGACAACGACATCTATGgcaataactttattaaatGCTGCCTGGATAACTGTAATATAGAGGGattttattatcacaaattgCCCGAAGATTTAATGCTGCAAAGCTTTTGGTTCCAGGAACTAGAAATGGAAACCACTTACAACAGTTCTTTGTATATATGCTCCGTGCATTTTGTTACTTTCTTCGAAAGAATATTGGAAAAATACAGCGCTTTTCTCAAAGAGTCGAAGGAATATGTAAAACTTTCTGTAACCTATAACGAGCTTAAAGCTCTACCTGCCTTACAATCTTACAAATGCCACATAAGCAAATGCACTTCTGGATTTAAACTAATCTGGAAgctatttaaatttcccaaagaTGTTACTTTGTTCAATAAATGGCTGCATAATACCAGTTTACAATTTGACTATGACCAACGTCTTTCTTATCGCATCTGCTCGCAACATTTCGAGGAAAGATGTGTAAGTAACAAAGAGCTACGTCGCTGGTCCCTGCCCACCCTAAAGTTGCCTTTCAACAATAGTCTATACGTCAATCCACCCGAAGCTTTGCCCTCCAATCATGAAAATCTAAGGCATTGCTGTGTGTCCAACTGTCCTACCAATAAGGGCCCCTTTTACAAATTCCCTATTAACAAATTGGAGGCCAGAAAATGGATACATAATTTAGATTTGGGCAACCAACAGTGTACCCTGAACTTACGTGTTTGCTTTAAACATTTCGAAAACTATTGCTTCTCCAAGGCTGCGAATAAAGTCAAGCCTTTGAAATGGTGGTCCATACCAACTCTTAGGCTAAAGAGAAAAACCGATCTTTATCTCAATCCGGCAGACAAGATAGCCTTCTACGTTTGCTGCATCCCAAGTTGTCAACAAATTCTCAATAAAGCCAAGGATATTTATCTGTTCAAGTTTCCCGCCAGTAATaccttaaaacaaaaatggttgcaCAATTTGAATATTGCCAAACAAGATTACAAGGAAACCATGAGAATTTGCACGGCCCACtttgaaatgaattgttttcACAAGGACTGCGGACTGCTACGCAAACATTCTGTTCCCACTTTGGCACTATTCGCCCCACCTAATGATCTCTATAGGAATCCAGTCAGAAGGGCTTATTTCAAATGTTGTGTCAAATTGTGCAAAGCGCCCTGGGAACAATTGCTAAATTTCCCCAAGAATAAAACACTATTGCGAAAATGGTGTCATAACTTGCAGCTGGACAAGGACATAAAATTGGAAACCTTGAGAGATTGGAAAATATGCAAGCGACATTTTGAGCAacaatgtataaataaaatcggTACGCTGAGAAGTACGGCAGTGCCTACGCTTAAATTAGGACATcgtaagaaattgtttttaaattctgatCTTGTTTTGAAAACGAATATTAAAATGGAACAGGAAAAGCTAAGTAATGgggaaaaagaaaagcaaattgaAGACCAGGAAAACGTTACGGTGGTGGCATGTGATAGAGAACAAACAGCAGAGGACACTATTTCAAAAGAAGATGGTCCAAACATTATAAATAAGGAAGCGGAAATTGCAACGGCATTACCTACTAAAGCTAAAGGCTTGAAAAAGCCTGtgaaaattttaaggaaaactCCAAAACTCTTTATAAGAAAACCGCAAAAGTCAAAGATGGCTGTTAAAAAGAAtccaaaaattgttaaagataAAGAACGATCATTAAAGCCAAGCACAAGTGAAGAGAAAAAAGAGAGTTTAACACCAGCTACTCCACGCGCTAAAGAGAAAGATATGCAGgaagaaaacaaaacacaatCGATAAAACATGATGCGCTCCCAACGGCCACTAACTCTCTGACCAGTGAAGTTAGTAAATCTCAAAATTCCGAGACTGAAGGCAGCGGACAAAAGATTTTAGAAACTAATCTGCCAGAAGATGTTTATCTAGAGAATTTGTTGGAAATATTAACTGAGAGCATGCCGGAGAATGATGACATGAAGGAAACtccttcaatatttaaaaaagaacctACTGAGCCGCCAACATCTCTGAAGCAAGAAAACATGAATGATTCCCCTAGTGATACGGAAGAGGTCTGCACGTATAAAATTTACGAAATAAAGCAGGAAGTAGAGGAGCAACCAATGGAAGAATACCTAGAAGAACGAGCGGTCGAAGAAGTGGCCGACaaggaaaatgaaaaaattataaattttaaaaacaaaaatcttatgTTTTGCCGTGTAAAAAGCTGTCCTAACAGTGGCAGCTACAAGCCGGACGTGACGATTTTCAAACTGCCCGGTATACGCAAATTGCGCGATCAATGGATGGCAAATTGTAAACTAAATCAACGGCAATATTCGGCAAATGGCAGATTAAAGAAACTTAGAGTTTGTATAGAACACTTTGACAAACAGTGTTTTAAAGACAATAATCGTCTACTGTTCGGGGCGGTGCCGACACTGCATCTCGGCAGTAGTCTAGACTGTGAAGAATCTCTAGCGGAATACACCTATTTCAGATGCCGTATAAACAGTTGCCAACGATCTACTCAGCATGATAAAATTAATCGTATACCATTTCCTGAAGgagaattgaaaacaaaatggTGTTTGGCTTTGAATATGAAAGAGGAAACTATAACGAAAGATTCTTGGATATGTCATAAACATTTCGAAAGAAAATCTTTAATAGATTGTAGAAAACCCAAACCGGGTATCCTGCCCACTCTACTACTGGATATGGCAGATGACAGTGTAACCACGGCAGTAAGATCTGAAGAATGTAACACGTCTCCAGTACAAGGTGAGAGTAATACTCGAACTGCTGGTAGGCAGAAAAATCCCAAAGAAATCAAGCCTAAATGTTCATTTccattttgtaaagaaaagaaCCAGACTTTACACGATTGGCCCGATAAagttatttttgctaaaatatggCAGGTGGTCAAGAAACTAGGCAGACAAGCCGAAGATATCAAGGCTTGGAAGAAAACGTTTAATAAAGAAACCGGAATAAATGAAGAGCCAAATCAAACTGTGGAACAAAGTTCcgaaaagaatattaaattgtgcgatgaacatttttattatctatacaaaacaaataatgaaGCCATCAATGAATATGAGACCTCTGAAGAAGACCATGACTTAAAGCGAAATGTTCAAAATATATACGACTGCTTGTATTCCTTAGACAAGTTTTCGGCAAAACAATGTGCTGTGCCGCAATGTAAAACCGATCAAAATATCAAATCCTGTAAATCGGTAAAGTTATTTGCTTTCCCCCACAAAGAAATGGCCCAAAAATGGTGCCACAATATCGGCATAGAATATAGCCACCTTAAAGCAAAGCCCTTTCTTAAGGTATGTGAAATTCATTTCGAAGATTATTGCCTGCAAAGAAGAAACCTGTTGGACTGGGCGCTGCCAACTTTAAATTTACCCTCTACTAAAGATCCTCAggatattaaacaaaatgaCGCTCATAAGGTATTTAGTGTGAGAAGAAAATGTTGCATCAAAACTTGTCCTTCTGCGCAGAACCTGCAGGATCTGCACACAAACACcaatgtaaaattatataaattcccCAAAGATCCAGTGCTGCTTAAGATATGGCTACAAAATACCAACTGTGACAAGACATTTGATGAAAATGTAACACGTGTATGTGCGTTACATTTCCATGATTCAGATCTAAAGGAAAATAACGAGTTAAAAGAACAGGCTATTCCTAAATATTATTTAGATCCCAGCAATCCAAACCTTTTGTATCCCTCACTGAACAGTTCCAATATAGATGAGCATATACAAGTTAAGCAGGAATTGGACAATAGCGAAGAGTGGGGTGTTCCTTCGCAGCTTGAAGATGTTACATTCAAAACTCCAACAATCGAAAATGTGTTTGAGCATAAACTGAAAGAAAGCAGCAAAGAACAAGATTACAATCAATTGATAGAAATAAAACAGGAAATTATAGAAATTCAAGAAGAACAGCCACAACCCTCCTTGTATAGCCCACCCGATTTTAAATACTCTAATTACACTAATtcgaataacaataataataataataatcaactAGCATCTTTTGTTATAAGCGATGTCAAGTCGCAAATATATTTGTGTTGTGTGCAAAAATGTACCAACAATTCCCAAACACCTGGCATTCGTATTTATACTGAGTTTCCCCACGATtcggaaatttttattaaatggtgTTTTAACCTAAAAATTGATCCCCGTAACTATAAAGAAAATCAATATGCCATATGTGATCAACATTTTGAGCCAGTATGCTTTAGCGAAAATGGTCTGCTACAAAATTGGTCCGTACCGACGTTAAATCTTAATCTAAGTGAACTTTCTTTTATACACCAAAACGATATACCTGAACATTTGAAACCCTCCAACGACCAGTGTATTGTATATGGCTGTATAAATCCCTTAAGGCCGCTTTTTAAATTTCCCCATAATCCGGATATTTCACTGAAATGGTTTGCAAATCTAAAACTAGACTATACCGACTTTCGTGCTCAGAATTATCGTATTTGTAGAAGGCATTTCCCACCCATATGTTTCGACATAAatgatattaataaattaacagGCGAGGCTATACCTACGCAGTTCCTGGGTCACACCGATAAAATTAGCCATTTTAATAGTGTGGAAGAACAGCAATTACAACAGGATGGCGGCCTTAGGTATCAGGATAATAGTCGTGGCAGCAGTCAAGGATCCTTAGTAAGATTAATATCGCCACATGATCTAGAAGATCATGATAGTAGTTATTTTGAAGATTTTGAAGAATATTACGGACAAGATGAATAA
Protein Sequence
MSQNNQRKHYHIHAPYQHPQQQQQQQAQQHHHGHHITPSQQQQQHQQWYSQQHYQHGLHLRDSRHIQHPQHHPHHHHTQQQQQQQPHHNHTMSTHMFTSGYVGMTPSGGVITSGAGGSSSGSGVGVTGTVHNTATMGSAASHNMPASSSSAHHYSSVLPASGASVGGNAANANSGGAYVSRSRIFDLEMLATQPQQQQSQHSTASATHSHSMLTSGRAGFDAYSHSSLYAQQQNQRHILASGSSHHHHATTHHSTSNALHSHHHTQQLHHPHQQPQSALHHHQHHQHPQQQQQQQQHYYHHSSLHRPHTQLMPPALQHIKSEPVEQIAPTPSIQTEEVIIKSEPVDDIGYHYKSAPQFENKSFLIAEKRKQDELQQLRLQQRQHQKQQELQQLRHQQQQQQQHQEHQRVKHQQQLQEQQLHQHQLQKIKQEHYHQPQSEHTDNEEVSQQTQQHTNSENSSILQRAAADEKQEEQHQPQISLTNIKTEAKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYSQHVALSTCNIVEYDFKCSVCPMSFMSNEELQTHEQLHRSNRYFCQKYCGKFYETIDECEEHEYGQHEYEMFKCNICCISVTHRDQLLEHLTDHKYQPRFDCCICRLCFQTSIELHEHYLANEDFCGKFYDKEAFKKPNTSSSSAYLGKPESSNLEIANTFSLKDIPPGNSHHLAGLYPKPASSKTSMEPPNTPTTTSSSPFITVNEFAALEPHIEVKTEIKVEPDFYPPMDQSDFAAYDNDYGTPDYTSSSNQNFSFLHDYQDNASSSTNSSYSFNNNDAIQDEAAICCVPKCGTRKQSSPSLQFFSFPREEKYLSQWLHNLKMVYDPNVNYSIYRICSLHFPKRCIAKYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPGGEMAKCSMPGCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKDNRFFCSKHFEEKCFGKFRLKPWAIPTINLGTIYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPNMYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPPPPPQPSTSSKAHKYAQLFKQEKEGSSGSHIYDGVFMNSMVQKFSSASSNSSNNLDLGDVCLVPSCKRTRHCDDITLHTVPKRAEQLKKWCHNLKMNLVKMHKSARICSAHFEKYCIGGCMRPFAVPTLELGHDDTNIFRNPDVIKKLNIRETCCVQSCKRNRDRDHANLHRFPTHPELLQKWCENLQKPIPDGTKLFNDAVCEVHFEDRCLRNKRLEKWAIPTTNLGWEGAPHCLPSEEDINENWVKPFAPNNGDEQGECCVSSCKRNPQIDDVKLYRPPEDAEQLVKWAHNLQVDVMDLPNLKICNLHFEQHCIGKRLLNWAMPTLNLGGKVEHLFENPPPMPAIYKKKIKPDRLLSSQEGIKWSPRCCLPHCRKMRSVDKIHLFRFPYSNRQTLAKWCHNLQLPLVGSSHRRICSSHFESSVLTKRCPMTLAVPTLDLNAPPGYKIYQNPARLKQIKIGPQRHCVIESCRKTKLDDVTLFRFPNNRSILYKWRHNIKNWPKGKLSSQMRICSQHFEPHSVGVKKLSPGAIPTLKLGHDAKDLYPNETRSFFDLEKCVVSGCDSRKDMEDVRLFRFPRDDEELLKKWCNNLKMNTNDCVGIKICSKHFELECVGPRQLYKWSIPTLKLGHKEDDLVDIIPNPPPEQRTGEFLFKCCVPSCGKTRKYDDAQMNSFPKNLKLFRKWKHNLKLDFLNFKEREKYKICNDHFEMVCLGKTRLNFGALPTLNLGHDELDDLYQINPDRIRPNLFIRQKDVEKLERRRILKEENSEQYDGEEQDDDPLGLEPSDVKCCVTECTAPKSIMREPYDLPETKEIRQMWLKEFEKNEDEVLLPEAKICGLHFQLIFKKLESQMLEMLEESENLKSDFNKLQYNYQKSTISLVINSYQCRVQDCPTNLLNSSIRLYFFPYGKQLVNKWSQNTGIIPDENRRYMNKVCALHFESYCITENQRLRSWAIPTINLPSFEEERRLYKNPDLTKIDRRMLGPQILKCAVNNCNYLKLVDDESLKLFNFPTDDKLLKKWCDNLKMSHHFTPLLKICSLHFEKICFGSCRIRSWAIPTLNLGHSTAPEHLNKTTIRQEVYDAPEDISEIQLKQVKIKKSLDSTKCYIPSCRKSRLKHGVRFHSLPTNLKMKRKWLHNLQIRHLKASQKVHNIKICNLHFHKRCLEGKMLKPWAVPTRHLGHSEAIFDNPRKVGAFSSLRCVLTHCKNHAAPRPVRAFMFPKLPEFLEKWSKNLKLELEKCKGKICHEHFEEEVLGLKKLKSGAVPTLNLGHDDKDIYDNTDLIEKFKLKEVEKELSKDSCQVKITEDDDFEEEFEPRSEGEEEEEDEEEIWESEEGEEEEEDDEQIHYDDDAEEEDEDEEELDADKLEKQQEDDEISVSNSISDWSSVKFKELRVSITPLTSEDLMDLCSRSSYEREFGSLTPANNLRGRRSATPASSNWKDLRSETPDQKSNRSDTPDKKAFHYFREPRSVSPEQKLDNFLEPKSSERVSDSPKDPLGENLEEDLSSKTPNQIEALVFANVTKSELDLTSKCLKRQKPHKLLESFKKERLEMSEDEATNTCSPNEMDTTNLRTDKALNAVAPMCCLKHCSREKTPEQHLTTYGFPKDPQLLQKWCDNLGLQPEQCIGRVCIDHFELRVIGARRLRVGAVPTLNLGPHRIAKHTNMEDTAQKKSVTKEFSETGNMQEADSSLKPPPPYKTPKPSKQSVFRLCCLKHCRRKKVSNLEKKDKQLTKERMDCQEKSQEAFFKFPTEPNILKKWYKNLRLPEKLSITTELEICSKHFESSVIKNGKLHPKAVPTLQLSYANRSPIYTNNQQDFNGFDLQVKQKSKRKCFLSHCGNKISDHIFLLSFPENEPLTLRKWFKNLKINLKREEYKNLKICSAHFEPYVFFKSIYLRAGAVPTINLGHNERFIRNCQKLRLKRENVHTIQEKCCITECTATNLKLYSFPRSSELRKIWCNNLQIELRQALNSHSKLCAHHFTPDSFIVGTENLKLNAVPVLNLGLKHDNHVLITTNPAESKCIVENCQRTPSVDKVKLYKFPLKQDILKKWLFNLNLSADTLNPHDVVCSKHFDKSCIKNGIMHEKAIPTKYLEMSPKGWFYKNNEDLYEISRKCCVLSCQQAAEEAKHLYRFPKHKEDLDKWIYNLKLQVDEADVKDLRVCDRHFEQSCKISHKDLITQALPTLNLGHNDNDIYGNNFIKCCLDNCNIEGFYYHKLPEDLMLQSFWFQELEMETTYNSSLYICSVHFVTFFERILEKYSAFLKESKEYVKLSVTYNELKALPALQSYKCHISKCTSGFKLIWKLFKFPKDVTLFNKWLHNTSLQFDYDQRLSYRICSQHFEERCVSNKELRRWSLPTLKLPFNNSLYVNPPEALPSNHENLRHCCVSNCPTNKGPFYKFPINKLEARKWIHNLDLGNQQCTLNLRVCFKHFENYCFSKAANKVKPLKWWSIPTLRLKRKTDLYLNPADKIAFYVCCIPSCQQILNKAKDIYLFKFPASNTLKQKWLHNLNIAKQDYKETMRICTAHFEMNCFHKDCGLLRKHSVPTLALFAPPNDLYRNPVRRAYFKCCVKLCKAPWEQLLNFPKNKTLLRKWCHNLQLDKDIKLETLRDWKICKRHFEQQCINKIGTLRSTAVPTLKLGHRKKLFLNSDLVLKTNIKMEQEKLSNGEKEKQIEDQENVTVVACDREQTAEDTISKEDGPNIINKEAEIATALPTKAKGLKKPVKILRKTPKLFIRKPQKSKMAVKKNPKIVKDKERSLKPSTSEEKKESLTPATPRAKEKDMQEENKTQSIKHDALPTATNSLTSEVSKSQNSETEGSGQKILETNLPEDVYLENLLEILTESMPENDDMKETPSIFKKEPTEPPTSLKQENMNDSPSDTEEVCTYKIYEIKQEVEEQPMEEYLEERAVEEVADKENEKIINFKNKNLMFCRVKSCPNSGSYKPDVTIFKLPGIRKLRDQWMANCKLNQRQYSANGRLKKLRVCIEHFDKQCFKDNNRLLFGAVPTLHLGSSLDCEESLAEYTYFRCRINSCQRSTQHDKINRIPFPEGELKTKWCLALNMKEETITKDSWICHKHFERKSLIDCRKPKPGILPTLLLDMADDSVTTAVRSEECNTSPVQGESNTRTAGRQKNPKEIKPKCSFPFCKEKNQTLHDWPDKVIFAKIWQVVKKLGRQAEDIKAWKKTFNKETGINEEPNQTVEQSSEKNIKLCDEHFYYLYKTNNEAINEYETSEEDHDLKRNVQNIYDCLYSLDKFSAKQCAVPQCKTDQNIKSCKSVKLFAFPHKEMAQKWCHNIGIEYSHLKAKPFLKVCEIHFEDYCLQRRNLLDWALPTLNLPSTKDPQDIKQNDAHKVFSVRRKCCIKTCPSAQNLQDLHTNTNVKLYKFPKDPVLLKIWLQNTNCDKTFDENVTRVCALHFHDSDLKENNELKEQAIPKYYLDPSNPNLLYPSLNSSNIDEHIQVKQELDNSEEWGVPSQLEDVTFKTPTIENVFEHKLKESSKEQDYNQLIEIKQEIIEIQEEQPQPSLYSPPDFKYSNYTNSNNNNNNNNQLASFVISDVKSQIYLCCVQKCTNNSQTPGIRIYTEFPHDSEIFIKWCFNLKIDPRNYKENQYAICDQHFEPVCFSENGLLQNWSVPTLNLNLSELSFIHQNDIPEHLKPSNDQCIVYGCINPLRPLFKFPHNPDISLKWFANLKLDYTDFRAQNYRICRRHFPPICFDINDINKLTGEAIPTQFLGHTDKISHFNSVEEQQLQQDGGLRYQDNSRGSSQGSLVRLISPHDLEDHDSSYFEDFEEYYGQDE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01138050;
90% Identity
-
80% Identity
-