Basic Information

Gene Symbol
-
Assembly
GCA_956483585.1
Location
OY101444.1:20753987-20780592[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 33 9.4e-15 1.1e-11 46.5 1.6 1 86 806 878 806 879 0.84
2 33 8.2e-15 9.8e-12 46.7 4.9 1 87 906 975 906 975 0.80
3 33 4.3e-15 5.1e-12 47.6 0.3 1 87 996 1068 996 1068 0.83
4 33 4.6e-14 5.5e-11 44.3 3.2 1 86 1150 1218 1150 1219 0.79
5 33 2.5e-15 3e-12 48.3 6.1 1 87 1243 1315 1243 1315 0.81
6 33 9.2e-12 1.1e-08 36.9 0.9 1 87 1350 1418 1350 1418 0.80
7 33 2.6e-11 3e-08 35.5 2.3 1 85 1459 1527 1459 1534 0.72
8 33 8.1e-15 9.6e-12 46.7 0.5 1 86 1556 1625 1556 1626 0.81
9 33 7.1e-14 8.5e-11 43.6 1.0 1 86 1648 1717 1648 1718 0.79
10 33 3.6e-13 4.3e-10 41.4 2.6 1 86 1746 1817 1746 1818 0.86
11 33 7.3e-08 8.7e-05 24.4 0.3 1 62 1885 1942 1885 1966 0.77
12 33 1.9e-10 2.2e-07 32.7 0.5 1 87 1982 2054 1982 2054 0.80
13 33 1.3e-12 1.6e-09 39.6 2.5 1 87 2086 2157 2086 2157 0.80
14 33 2.2e-13 2.7e-10 42.0 4.6 1 86 2202 2274 2202 2275 0.84
15 33 2.3e-13 2.7e-10 42.0 0.6 1 86 2297 2364 2297 2365 0.80
16 33 8.7e-14 1e-10 43.4 0.2 1 87 2685 2754 2685 2754 0.80
17 33 2.5e-11 3e-08 35.5 1.2 1 86 2812 2901 2812 2902 0.70
18 33 5.5e-13 6.6e-10 40.8 0.8 1 86 2932 3002 2932 3003 0.81
19 33 1.5e-12 1.8e-09 39.4 1.0 1 87 3031 3100 3031 3100 0.82
20 33 9.1e-13 1.1e-09 40.1 2.8 1 87 3120 3190 3120 3190 0.81
21 33 3.1e-13 3.7e-10 41.6 1.5 1 87 3213 3284 3213 3284 0.83
22 33 1.3e-05 0.016 17.1 0.3 1 60 3300 3348 3300 3373 0.76
23 33 1e-10 1.2e-07 33.5 7.0 1 86 3388 3457 3388 3458 0.81
24 33 4.4e-11 5.2e-08 34.7 3.6 1 86 3483 3553 3483 3554 0.81
25 33 1.6e-12 1.9e-09 39.3 1.4 1 86 3574 3646 3574 3647 0.77
26 33 1e-11 1.2e-08 36.8 2.2 1 86 3667 3736 3667 3737 0.80
27 33 1.2e-13 1.4e-10 42.9 0.9 1 87 4041 4118 4041 4118 0.82
28 33 8.4e-07 0.001 21.0 0.9 1 86 4137 4206 4137 4207 0.71
29 33 0.59 7e+02 2.2 0.1 24 58 4285 4335 4259 4361 0.55
30 33 1.1e-12 1.3e-09 39.8 0.4 1 87 4380 4456 4380 4456 0.84
31 33 1.1e-13 1.4e-10 43.0 1.7 1 85 4483 4555 4483 4557 0.81
32 33 3.8e-12 4.6e-09 38.1 3.2 1 87 4689 4762 4689 4762 0.79
33 33 9.2e-12 1.1e-08 36.9 0.7 1 87 4787 4857 4787 4857 0.79

Sequence Information

Coding Sequence
atgtCACAAAATAATCAACGAAAACATTATCACATTCATGCTCCCTATCAACACCCCCAACAGCAGCAGCCACagcaacaacaaccacaacaaatTCAACAACATCATCATAGCCACCATTTAACGCCGtcgcagcagcagcaacatcaaCAATGGTATTCGCAGCAACATTATCAACATGGTCTTCATATGAGAGATTCGCGCCATATTCAACATCCCCAACATCATCCCCATCATCATATTGCTCAACAGCAGCAACCACATCACAATCATACAATGTCACCACACATGTTTACAAGTGGTTATGTCGGTATCACAGCGAGTGGGGGTGTTGCAAGTGGTGGTGCCGGTGTTAGCAGTGCCGGAAGTGGGGTTGGTGTAACTAGTTCAGCACATAATTCGGCAGCAATGGCTGCTACATCGCACAACATGCCGGCTTCTTCGTCCTCTGCACATCATTATTCTTCCGCTATGCCGGCTTCTGGTGCGAGTGTTGGTGGTAATGCGGCTAATGCTAATAGTGGTGGTGCTTATGCTGGCCGTAATAGAATCTTTGACCTTGAAATGTTAACAACACAACCACAACAACATTCACAGCATAGTACAGCATCAGCTACACCTTCACATTCTATGTTATCGAGTGGCAGTTCGAGTGGAAGAGCAGGCTTTGATGCATATTCACATAGCTCCTTATATGCACAACAAAATCAACGGCATCTTTTAGCTCCTGCTTCCTCCCACCATCATCTGGCTACTACACATCATTCTACAGCTAACGCCTTGCATCCACATCATCATACCCAGCAACTACATCATCCACACCAGCAGCCACAGTCTGCTCTGCATCATCATCAGCTTCAACATCACCAACAGACACAACATTATTATCACCATACTCAACAAACTTCTCTGCAACGGTCACATACACAAGTCATGCCACCAATGCTGCAGCATGTTAAATCTGAGCCAGTAGAACAAATGCCCACAACACCGTCAATACAAACCGAGGAAGTCATCATTAAATCTGAGCCAGTCGATGAAATTAGTTATCATCACAAAAGTGCGCCACAATTTGAAAACAAACCTTTTCACATCGAAGAAAAACGTAAACAACATGaacagcatcaacaacaacagcagcagcagcaacaacaggaGCAGCATCAGCGTGACCAACATCAACGTgcacaacaccaacaacaattaCATGAGCAACAATTTCATCAACAtcaactacaacaaataaaacaagaacaTTATTATCATCCTCAGAATGAAAATACTAATAATGAAGATGTCTCCCAACAAACGCAACAACGTACAAATTCCGAGAATTCTTCTATAATGCaaccagcagcagcagcagtgaCGGCAGCAGAtgaaaagcaacaacaacaagaacaacaacagcaaccgCAAATATCCttgacaaatataaaaacagaagcaaagcctCTTAACTTTCCTCGTCGCAAATTACAAACAGAACGTTCCTCAACTCTGCCTATATGCCAACGatgtaaacaagtttttttaaaacgtCAAAACTATTCACAACATGTTGCTCTATCCACTTGCAATATTGTGGAGTACGACTTTAAGTGCTCCGTATGTCCCATGTCCTTTATGTCTAATGAGGAACTGCAGATACACGAACAACTCCATCGTTCAAATAggtatttttgccaaaaatattgTGGCAAATTCTACGAAACAATTGATGAATGCGAACAACATGAATATGGCCAGCATGAATatgaaatgtttaaatgcaatATTTGTTGTATAAGTGTTACGCAACGTGATCAATTATTGGAACATCTTAATGACCATAAATATCAGCCACGTTTCGATTGCTGCATCTGCCGTTTATGTTTTCAAACTTCAAGCGAACTACATGATCATTATATGGCTAATGAAGATTTCTGTggtaaattttatgacaaagaagCTTTTAAAAAACCTAATACTTCGTCGTCTGCTGCTTATTTGGGAAAGCCGGAAAGTTCCAATCTGGAAATAGCCAACTCATTTTCGTTAAAAGatataccTCCTGGCAATAGTCATCAGTTAGAAGGTTTGTACCCCAAGCCCTCCAGCTCAAAAACTTCTATGGAACCACCTAGCACACCAACTACAGCGTCGTCTTCATCATTTAGCACGGTAAATGAGTTTGCCGTCTTAGAGCCCCAAATAGAGGTAAAAACGGAAATTAAAGTAGAGCCTGATTTTTATCCCCCGATGGATCAATCTGATTTTGCTAGCTTTGACAGTGATTACGGCACACCCGACTATACATCTAGTTCTAATCAGAGTTTTTCATTTCTACATGATTATCAAGATAATGCTTCCAGTTCTACCAATTCATCGTTTTCCTTGAACAATAACGATGCCATACAAGATGATGCTGCCATTTGTTGTGTTCCTAAATGTGGTGTACGCAAATTTTCATCGCCATCCTTACAATTCTTTGGCTTTCCCAAAGAAGAAAAGTATTTATCGCAATGGCTGCACAATTTGAAAATGGTATACAATCCTAATGTAAATTATTCTATGTATCGTATTTGTAGTTTACACTTTCCTAAACGTTGTATAGCCAAATATTCATTAAGTTACTGGGCTGTACCCACCTTTAATTTAGGTCACGATGATGTTGGGAACTTATATCAAAATAGGGAAAGTTCTGGGGGGTTTCCAGGTGGTGAAATGGCTAAATGCAGTATGCCTGGTTGCCCTTCACAGCGTGGAGAAACCAATGTAAAATTTCATGTATTTCCACGGGACTTAAAGACCTTAATTAAATGGTGTCAGAACTCACGACTACCGGTACATAGTAAAGATAATCGCTTTTTCTGCTCCAAACACTTTGAAGAGAAATGCTTTGGCAAGTTTCGCTTGAAACCTTGGGCCATACCCACACTTAATTTAGGCACGGTCTATGGCAAGATACATGATAATCCCAATATCTATCAAGAggagaaaaaatgttttttgcccTTCTGCCGACGTAGTAGATCGTACGATTGCAATTTGTCTTTATATAGATTTCCAAGAGACGAAACTTTGTTGCGCCGTTGGTGTTATAACTTAAGATTAGATCCCAATATGTACAGAggcaaaaatcataaaatttgctCTTCTCATTTTATTAAAGAGGCATTAGGTTTAAGAAAGCTCAACCCAGGAGCAGTGCCCactttgaatttgggtcataatgATAGAtttaatatatatgaaaatgaaCTATATACACCACCGCCACCACCTCCACCACCACCACAACCTTCTACGTCGTCAAAGGCCCAAAAATATGCCCAACTATTTAAACAAGAAAGGGACAGTTCTTCCGGTTCACGTATCTATGATGGTGTATTCATAAACTCCATGGTGAAAAAATTCTCTTCCGGTTCTTCGAACAGCTCTAATAATCAGGATTTGGGAGATGTCTGCCTTGTGCCATCTTGCAAAAGAACTCGTAATTCCGATGACATTACTCTGCACACCGTGCCCAAACGCCCGGAACAGCTTAAGAAATGGTGtcacaatttaaaaatgaatttagttAAAATGCACAAAAGTGCCAGAGTTTGTAGTGCTCATTTCGAAAAGTACTGCATAGGCGGCTGTATGAGACCTTTTGCCGTACCCACCTTAGAGTTGGGGCATGATGATCCAAATATATTCCGCAATCCCGATGTCATAAAGAAATTGAATATTAGAGAAACTTGTTGCGTACAATCCTGTAAAAGAAACCGTGATCGCGATCATGCCAATTTGCATAGATTTCCCACTCATCCGGAATTGATGCAGAAGTGGTGTGAGAATTTACAAAAACGCATACCGGATGGCACTAAACTTTTCAATGACGCAGTTTGTGAGGTACATTTCGAAGATAGATGTTTGCGCAATAAGCGTTTGGAAAAATGGGCCATACCCACGTTGAATTTGGGCTGGAATGGGGCTCCCCACAGTTTGCCCTCAGAAGAAGAGATCAACGAGAACTGGGTAAAACCTTTTGCACCTAACAATGGAGATGAACAGGGCGAATGCTGTGTGGTCAGCTGCAAACGCAATCCTCAAATCGATGATGTCAAATTATACAGGCCTCCGGAAGATGCAGAACAGTTAGTTAAATGGGCGCATAACTTGCAAGTAGATGTTACGGACTTGcccaatttgaaaatttgtaatttacacTTTGAACAACATTGCATAGGCAAGCGTTTGCTGAATTGGGCTATGCCTACTTTAAATTTGGGCGGAAAAGTAGAGCATCTATTTGAAAATCCTCCACCAATGCCCGCCATTTACAAGAAGAAAATCAAACCTGATAGAATTTTAAACAGTCAAGATGGCATCAAGTGGTCACCAAGGTGCTGTCTGCCGCATTGTCGTAAAATGCGTTCCGCAGACAAGATTCATCTTTTCCGTTTTCCCTACAACAACCGCCAGACTTTGGCGAAATGGTGTCACAATTTGCAATTACCTTTGGTGGGCAGCTCACATCGTCGTATCTGTTCCAGTCATTTCGAGGCATCTGTTTTAACTAAACGTTGTCCCATGACATTGGCAGTGCCCACACTAGACCTGAATGCTCCTCCCGGCTATAAAATCTATCAAAACCCAGCAAggctaaaacaaataaaaataggcGCTCAAAGACACTGTGTAATCGAGTCTTGTCGCAAAACTAAATTAGATGAAGTTATACTATTTCGTTTCCCCAACAATAGATCCATTTTGTATAAATGGCgtcataatattaaaaattggcCCAAGGGCAAATTAAGTTCTCAGATGAGAATTTGTTCCGAACATTTTGAGCCTCATTCAGTTGGCGTAAAGAAATTATCACCCGGCGCCATACCCACTTTGAAGTTGGGACACGAAGCCAAGGATTTGTATCCCAATGAAATAAGATCCTTCTTTGATTTGGAAAAATGTGTAGTTAGTGGCTGTGACTCCCGCAAGGAAATGGAGGACATTAAACTTTTCCGTTTTCCGCGAGATGATGATGAATTGCTTAAGAAATGGTGCAACAATCTCAAAATGAATGCCAATGACTGTGTGGGCATTAAAATAtgcagcaagcattttgaattAGAATGTATAGGTCCCAGGCAGCTATACAAATGGTCGATACCCACTTTAAAATTGGGTCACAAAGAAGACGATATGGTGGAAATAATAGCCAACCCTCCGCCCGAGCAAAGAACCGGAGAATTTCTTTTCAAGTGTTGTGTACCTTCATGTGGCAAAACACGCAAATACGATGATGCACAAATGAACAGTTTTCCCAAACACTTGAAATTGTTCCGCAAATGGACACATAATCTAAAGTTAGATTTTCTTAACTTCaaagaaagagaaaaatataaaatttgcaacGATCATTTCGAGCCAGTTTGTGTGGGAAAGACCCGACTCAATTTCGGTGCTCTGCCCACTTTGAAGTTGGGGCATGACGAGCTAGATGATTTATATCAAATTAATCCGGATAGAATAAGACCAAATTTGTTTATCAAACAAAAAGACGTGGAAAGATTAGAAAGGAGAAGGATATTGAGAGAAGAAAATGCCGAACAATATACTGGCGAAGAGCAGGACGATGATGTGGGTGATCCCTTGGGATTAGAGCCAAGTGACATAAAATGCTGCGTTACAGAATGCACTGCCCCTAAATCAATAATGAGGGAGCCCTATGATTTACCAGAAACTAAACAAATCCGACAGCTGTGGTTAAAAGAATTCGAAAAAACTGATGAAGAAGATTTGCCAACAGAATCTAAAATTTGTGGCTTACACTTccaatcaatatttaaaaaattaaaacaccaaATGCTAGAGATAATAGACGAAAATGATGACTTAAAATCAGATTTCAATAAACTACAATACAACCTTCAAAAGTCCAACATATCTCTGGTTATAAGTAGTTATCAGTGCAGGGTTGAAGATTGCCCTACCAACCTACTTAATTCTTCTATAAgactatatttttttccatatggCAAACAACTGGTAAGCAAATGGTCTCACAACACAGGCATAATACCCGATGAACATCGCAGATACATGAACAAGGTATGTGCCTTGCATTTCGAGTCGTTTTGTATAACAGAAAATCAAAGACTGCGATCATGGGCCATACCTACACTCAACTTACCAGCTGGCGAGGAGAAAGAAAAGCATTTATATAAGAATCCTGATCTAACTAAAATTGACCGAAGAATATTGGGACCTCAAATTTTGAAATGTGCCGTCAACAATTGCAGCTATCCCAAACTGGTAGATGACGAATCCCTCAAACTATTTAACTTTCCCACGGATGACAAGTTGTTGAGAAAATGGTGTGATAACTTGAAAATGTCTCACCATTTTACACCCTTGCTTAAAATCTGTTCTttgcattttgaaaaaatatgctttGGCAGCTGTCGCATACGTTCTTGGGCCATACCCACCTTAAATTTGGGTCATAGCGATGCTCCCGAACATCTAAATAAAACAACTATAAGACAAGAGGTTTATGATGTACCCGAAGATGTGTCTGAAATACAATTGAAACAAGTTAAAATCAAAAAGTCACTAGATAGTACGAAATGTTATATACCCAGCTGTCGCAAAAGCCGATTAAAACATGGAGTGCGTTTTTACAATCTGCCCTCAAATTTAAAGATGAAACGCAAATGGCTGCACAATTTGCAAATCAGGCATTTGAAGTCCAATCAAAAAAtgcataatattaaaatttgcaacttGCACTTTCACAAAAGATGTTTGGAGGGTAAACTTTTGAAACCTTGGGCAGTGCCCACAAGGCATTTGGGTCACAGCGAATCCGTTTATGATAATCCCCGAAAAGTAAGGGCATTGCCGCCATTACGCTGTGCTCTCTCACACTGTAAAAATCATGCAGGATCGAGGGCAGTACGTACTTTTGTATTTCCCAAATCGCCAGAGTTCTTAGAGAAATGGGCGAAAAACTTGAAATTGGAATTGGAAAagtgcaaaggaaaaatatgtCATGAACACTTCGATAAGGAAATTGTGGGTATGAAAAAGTTGCAAAGTGGTGCGGTACCTACTCTCGATTTAGGCCATAGCGATAAGGTTATGTATGATAATACAGAATTAATGgagaaacttaaattaaaacaaattgaaaaagagTTAAACAGAGATTCGTGCAAAATGAATATAATAGAACAAGACGATTTGGATGAGGAATATGAGCCGCACTCAGAGGAGGAAGAGGAGGAGATATGGGAGTATGAAGAATGCGAGGATGAGGAAGACGAGGAGGAGGAAGATGATGAACAAATATGTTATGATGATGAAGATGAAGAAGAGGAGGAGGAGGAGAAAAGGCATGATGAAGATAAGGAGGAGACTCCACAAGATGACGATGAAATCAGCATAACGAATTCAACATCCGACTGGAGTTCTGTTAAGTTTAAGGAACTTAGAGTCTCCATAACTCCCTTGACACCGGAAGATTTAATGGATTTATGTTCACGTTCTTCCTATGAAAGAGAATTTGGGGCTTTAACACCGGCCAACAATTTAAGGGGCCGCAGATCTGTTACACCAGCTTCAAGCTGGAAAGATTCTCGCTCAGAAACTTCTGATCAAAAGTCTAACAGTTTCAACTTTAACTCTAACAGATCAGAAACACCCGATAAAAAAGCATCTAATTATTTTAGAGAACCTCGCTCCGTCTCACCTGaacaaaaaccaaatattaGAACTGCTGATGAAAAATGTAACAGTCCGAAAGATCCGCTTGGTGAAAACCTGGAGGATTTTTGTACCAAAACCCCAAACCAGATAGAAGCACTTGTTTTCAAAGAGGAAACAACGTCTGAATGTGATTTAACTGTTAACAAATTGAAAAGGAGAACTTCTCAAATACCCAACGAAAGTTTCAAAAGGGAATGTTTGGAATTCTCGGAATATGAAATTGTTAATACCATGTTGCCAAATGAAATAGAGCTTACTGGCACTACCAACCTAAGAACAGATAAAGCTCTCAATGCGGTGGCACCCATTTGCTGTTTGAAACACTGTGGCAAGGAAAAGACGCCGGAACAGCATCTAACCACTTACGGGTTTCCCAAAGATCCTCAACTTTTACAAAAATGGTGCGATAACTTGGGCTTACAACCCGAAGAGTGTATTGGACGTGTCTGCATAGACCATTTTGAACTAAGAGTTATCGGCACGCGACGACTCAGATTAGGAGCTGTGCCAACTCTGAACTTAGCTCCAAATCAAGTTGCCAAGCACACTAACATGGAGGATACTCCACAAAAGAAAAGTGTAACCAAGGAGTTCTCCGAAACAGCGAATATGCAAGAGGCAGACTCAAGCTTAGAGCCACCGCCACCTTATAAAACACCCAAACCCAGTAAGCAATCGGTTTTTCGGCTATGTTGCCTCAAACATTGTCGACGCAAGAAACTCTTGAACCTGGACAAGGTAGACAACCAACCGCTGATGGAAAGAATGGTTTGCCAGGAAGAACCCCAGGAAATCTTGTTTAAATTTCCCACTGagcaaaatatgttaaataaatggTATAAAAACTTAAGATTGCCGGAAAATCTAACCGTAACACAGGACTTGCAAATATGCTCCCAACACTTTCAATCGAATGTTATTGAAAATGGCAAATTGCATCCCGAAGCCGTACCCACTTTACAACTAAGTTATGCTAATCTGCCACCTATTTATACAAACTATCAACTTCTAGGCTACAAATCGGAGATGAAGGAAAAGCCCATCCAAAAGTGTTGCCTTCCTCATTGCGGCAATAAAATGTCGGAACATATACACCTGTTCGCGTTCCCTGAAAATCAACCCTGGCTTCTGAGGAAATGGtgtcaaaacttaaaactaaatctCTTACCGGGTCAATATAAAAGTTTGTATATATGCAATGTGCACTTTGAGCCGTATGTGTTCTTTAGAAAAAGATTACGTTCGGGTGCTTTGCCAACACTTGATTTGGGACATACGGATGCAATTATTCGAAATTGTCGCAAATTGCGTTTGCAAACTGAAAATATTAGTACCATTAAGGAGAAATGTTGTATAGCCGATTGCGAGACAACTAAccttaaactttattcatttccCCGTAGCTCCGAGTTAAGGAAAATTTGGTGCAACAATTTGCAAATTGAACCACGCCAGGCTCTCAACAATCATAGTAAACTATGTGCGCACCATTTTACGGTAGATAGTTTCATAGTGGGCACCGACAATCTCAAACTAAATGCTGTACCTGTATTAAACTTGGGAATAAAAAATGAAAGCCATTTATTGATGACAACAAATCCAGCTGAAAGCAAATGTATAGTGGAGAACTGTCAAAAAACACCCAGTGTCGATAAAGTGAAGCTGTTCAATTTTCCCCAAAAGCAGGAGATACTTAAGAAGTGGCTTTTTAACTTGAATTTAGCAGCCGATAACCTTCGAAAGGATGATGTGGTCTGCAGTAAACATTTCGATAAATGTTGCATTAAGAATGgtattttacatgaaaaagccATACCCACCCAGTTTCTAGAATTTTCGCCGAAAGGATGGTTTTACAAAAACAACGAGGATTTATatgaaataccaaaaaaatgctGTGCCCTCAGTTGTCAACAAACTTCGGAAGATGCCAAACATCTGTATAGATTTCCTAAGCACAAAGAGGATTTGGACAAATGGgtgtacaatttaaaattacaagtgGACGAGTCAGATGTTAAGGATTTAAGGGTATGTGATAGACATTTCGAGCCGAGTTGTAAAATTTCCAACAAGGACTTGCTAACCCAGGCCTTGCCCACCCTTAATCTGGGTCATGACGATGCCGACATCTATggcaataactttattaaatgcTGTTTAGATAACTGTTCCATAGAGGGCTTTTACTATCATAAATTGCCCGAGGATTTAATGCTGCagagtttttggtttcaggaACTGGAAATGGAGACAACCTACAACAATTCTTTGTATATATGTTCCGTTCATTTTGTAACATTCTTCGAAAGAACATTGGAAAAGTACAGTGCTTTTCTGAAAGAGTCCAAGGAGTATGTAAAACTATCTGTAACTTATAATGAGATTAAAGCTCTACCTGCCTTGCAATCTTACAAATGTCATATAAGCAAATGTACTTCTGGTTTTAAACTGAtctggaaattatttaaatttccaaaagaTGTTAAATTGTTCAATAAGTGGATGCATAATACGAGTTTACAATTTGAATATGAGCAACGCCATTGTTATCGCATTTGCTCGCAACATTTTGAGGAAAGAtgtttaagtgaaaaaaaattacaccgCTGGTCTCTGCCCACTCTCAAGTTGCCTTTCAACAACAGTTTATATGTCAATCCCCCCGAAGCTTTGCCCTCCAATCACGAAAACCTGAGGCACTGTTGTGTCTCTAATTGCACTACCCTAAAAGGACCATTTTACAAGTTCCCCGTCAGGCAGGTGGAGGTAAAGAAATGGATACATAATTTAGATTTGGGCAACCAACAATGTACGCTTAACTTGCGCGTGTGCTATAAGCATTTCGAGAACTATTGCTTTTCCAAGGCTGTTAACAAAGTTAAACCGTTGATATCGTGGTCAGTGCCAACTCTTAGATTGAAACGAAAAGTTGCTCTTTTCCTCAATCCAGCAGACAAGATTGCCTTCCATGTTTGCTGCATCGAAAGCTGTAGAAAAATTCTCAATAAATCCAAAGGGATCTATCTGTTTAAATTTCCCTTCAGTAACACCTTCAAACAAAGATGGCTGCACAATTTAAACATTGGCCAACAGGATTATAAGGAAACAATGAGAGTTTGTTCGGCTCACTTTGAAATGGAGTGCTTTTACAAGGGCTTTAAATTAATGCGCAAAGATTCGGTACCCACCTTGGCACTATTCAAACCGCCTCCTGATCTCTATACAAATCCTGTGCGTAGGGCTTATTTTAAATGTTGTGTTAAATTGTGTAAAGCACCCTGGGAACAACTTTTAAGTTTCCCTAAGGATAAGATACTTTTGAGAAAGTGGTCTCATAATTTACAGTTggacaaagaaataaaattagaaactcTGAGGGATTGGAAAATATGTAGCCGGCATTTTGAACAACAATGCATAAATTCAAATGGGACAATAAGAAGTGTGGCGGTACCTACTCTTAAACTGGGACACcgcaagaaattgtttctaaatcCTGATTTCGCTTTGAAATCGAAccttaaaagtaaacaaaaaaagttacatGATGAGTCCAGTGCAAAGATTGACGAACATGAAGTAACAAATACTTTGGAAACTAATATAGAACCAGAGATACTGGATGATATTTCGCTAGAAGAtcagaatataaatataaagacGCAAACGGAAATTAAAACCAAATTATCTTCTAAAGTTAAAAGCTTAAAGCCTCGGAAACGTTTAAGGAAACGTAAATGTCTGTTTGGAAAAAAACGGAAGGCTAAAATTGTGGCTAAGAAACTTCTTAACGAAAACGAACAAAACGTTATTAAAGAGAAAGAAAACTCTGCAACGTCACAGCAATTTACAATAGAAGAGAAAAAAGAAAGAGCAAACGAACAAAAATCTCTAACAGAGTTGCACACACATGAGAACTTGGACGAAACCACAAACTCCTTATTTACTGAAGTTGTTAATCCAAACATTTCGGACACTGTAGTGTACGAACAAAAGGTCGAGGAAACTATTAACCTGCAAGAAGATGCTTATCTGGAGAACTTGTTGGAAATCTTAACAGAGAGTTTGCCGGAAAATGACGAACTAAAAGAAACTCCACAACTGTTCAAACAAGAACCCACTGATTCCGATACACTTTTGCAGGTTGCTGGAGAACCGCAGCATGTAGACTTGAAGTATTTTCCAAATGATAGTGACGAAATTGCCACATTTCAAATTACCGAAATAAAACAGGAAGTAGACCAGATGCCAATCGAAGAAGAGTTCCTAGAAGAACGAGCCGACTATGAACCCAGCCAACATAGTGAAGCCGAAGTATCAAAGGAGGAAAAAGCTTTAAGTTTTAAAAGCAAACATCTCAAAAATCTTATCTCTTGCTGTATAAAAACCTGTCGCAATTATCTCAATTACAAACCGGACCTACTCCTTTTCAAGCTACCCGTTGTACGCAAACTGCGTACTCATTGGCTAGAAAATTGTAAACTCAATCAGCGCCAATATTCGGCAAATGGCGTGTTGAAAAAACTTAGAATTTGTGCTGAACACTTTGACAAAAACTGCATTAAAGATGACAGCCGTCTACTGCTGGGCGCAGTGCCGACATTACACCTCGGAAGCAATCTAGACTATAAAGAAAGTTTAATCAAATTTACCTATTTGAGATGCAAGATACAAAGTTGCCAGCGATCTGTCCAACATGATAAGATCCATCGTATACCATTTCCCGAAGGAGAAGAGAAAAGAAACTGGTGTTTGAAAATGAACATTAAGGAAGGAACTGTTACTCCAGATGATTGGATATGTCATAGACATTTCGAAAGAAAATCTATAATAGATGGCCGAAAACCCAAGCCGGGTATGTTGCCCACTTTACTATTGAATAGTCTGgatgaaaaatctttaattcGAAAATCCCAGCCGCATACGCTGGCTACTTTAGTACGCGATAGTGTGACTGTGGCTGAGAATATTAACGATTTAGCTGTGCCAGATTGTAGACCGAAAAATCACAAAGTaatcaaaactaaatgtttATTTCCCTTTTGTAAGGACAACAAGGGACAAGTTTTATACGATTGGcctgataaatttattttcggcAAAATATGGCTGATGGCAAACAAGTTAGGACGACATGCCGAAGATGCTAGTTCCTGGAAGAGAATGTTTGAACAAACTTTAATAAATGAGCAGCCGCCAATTGAAAGTTCGGCGGGCGGAGATTCggaaaaacacattaaattgtGTGATGaacacttttattatttatacaaaacaaacaatGAAGCCATAAACGGCTACGAAGCCTTCGAAGAATACCAGGACTTAAAACACAATGTTCAAGTTACCTTTGACTTCTTaaattctttagaaaaaatctatacaaaaaaatgtgctGTGCCACAATGTAAAACAGATCAAAATATCAAAGACGCCGCagtaaaatcaataaaactttttgacttCCCGAGAAAAGAAATAGCTAAGAAATGGTGCGATAATATTGGCCTAGAATACAGCATCCTCGAAAGAAAGCCATTTATCAAGGTTTGTGAGATTCATTTTGAGGATTATTGTTTACTAAGAAGAAACCTTCTCGATTGGGCTTTACCAACTTTACATTTACCGCTTATGAAAGATGCTCAGGATATTAAGCAAAATGATGCTGTCCAGGTAATTGCCATAAGGGACAAGAGCAAGTGTTGCATTGAAACATGTCCTTCTGTTCGGGACATGGACTCGAATAGCAATTTAAGCTTATACAAATTTCCCAAAGACCCTGTACTGCTTCAGAAATGGCTACAGAATACCAACTGTGAAAAGACATTTGATGCAAATTTAACACGTATATGTGCGTTACATTTTCATGCATCCGATATACTCGATGAGAGCAAATTACACGAACAAGCTATTCCAAAATGTTATTTAGATTTAAGCAATTCAAACTTTTCATCATACCCATCCTGTCTAAATAGTTCGTTTATAGATGAACATATACAAGTTAAACAGGAATTGGATAACAGTGAAGAATGGTGTGTAACCTCCCAGCAGGAAAGTGTTGCACGTACAACTCCAACAAACGAAATGGTATTTGAActtaaattgaaagaaaacaaTAGAGACGCTGATTACAATCAATTGacagaaataaaacaagaaatcaTAGAAGTTCAAGAAGAAACCTCCTTATTTACCATACACAAATTTGAAACCTCCAGTCCACCCGAATTTAAATACCCCTACACCAATTcgaataatagtaataataatcaGCCAGCAGCTTTTGTTATAAGCgatgtcaaatcgcagctctaTTTTTGTTGTGTGCAAAAATGTACCAACAATTCGGCAACACCCGGCATACGTATATTCAATAAGTTTCCCCACGATTcggaaattttcattaaatggtgttttaatttaaaaattgacccTCGCAACTATAAGGAAAACCAATATGCCATTTGTGAACAGCATTTTGAACCTATTTGCTTTACGGGAAATGGCCTACTGCAAAACTGGTCGGTACCAACATTGAATcttaatttaaatgaacaatCTTTTATACATCAAAACGATATACCTGAACATTTGAAACCCTCCAGCGAACAGTGTATTGTATATGGTTGTATAAATCCGTTGAAAccactttttaaatttccccATAATCCTGATATTTCACTCAAATGGTTTTCAAATCTAAAACTAGACTTTACTGACTTTCGAGCCCAGAATTATCGCATTTGTAGGCGACATTTTTCCCCCATATGCTTCGGAATAAACGATTCTAATAAATTGACTAGCGAAGCTGTGCCGACGCAATTTCTTGGTCACACCGATAAAATATGCCATTTTAATAGTGTCGAAGAGCAGCAACTGCAAGCGGATGGTGGGGTTAATAATCAGGATAATAGTCGGGGCAGCAGTCAGGGATCCTTAGTAAGAATAATATCTCCACATAATATAGAAGATCATGATAGtagttattttgaagattttgaagAATATTACGGACAAGATGAATAA
Protein Sequence
MSQNNQRKHYHIHAPYQHPQQQQPQQQQPQQIQQHHHSHHLTPSQQQQHQQWYSQQHYQHGLHMRDSRHIQHPQHHPHHHIAQQQQPHHNHTMSPHMFTSGYVGITASGGVASGGAGVSSAGSGVGVTSSAHNSAAMAATSHNMPASSSSAHHYSSAMPASGASVGGNAANANSGGAYAGRNRIFDLEMLTTQPQQHSQHSTASATPSHSMLSSGSSSGRAGFDAYSHSSLYAQQNQRHLLAPASSHHHLATTHHSTANALHPHHHTQQLHHPHQQPQSALHHHQLQHHQQTQHYYHHTQQTSLQRSHTQVMPPMLQHVKSEPVEQMPTTPSIQTEEVIIKSEPVDEISYHHKSAPQFENKPFHIEEKRKQHEQHQQQQQQQQQQEQHQRDQHQRAQHQQQLHEQQFHQHQLQQIKQEHYYHPQNENTNNEDVSQQTQQRTNSENSSIMQPAAAAVTAADEKQQQQEQQQQPQISLTNIKTEAKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYSQHVALSTCNIVEYDFKCSVCPMSFMSNEELQIHEQLHRSNRYFCQKYCGKFYETIDECEQHEYGQHEYEMFKCNICCISVTQRDQLLEHLNDHKYQPRFDCCICRLCFQTSSELHDHYMANEDFCGKFYDKEAFKKPNTSSSAAYLGKPESSNLEIANSFSLKDIPPGNSHQLEGLYPKPSSSKTSMEPPSTPTTASSSSFSTVNEFAVLEPQIEVKTEIKVEPDFYPPMDQSDFASFDSDYGTPDYTSSSNQSFSFLHDYQDNASSSTNSSFSLNNNDAIQDDAAICCVPKCGVRKFSSPSLQFFGFPKEEKYLSQWLHNLKMVYNPNVNYSMYRICSLHFPKRCIAKYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPGGEMAKCSMPGCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKDNRFFCSKHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPNMYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPPPPPQPSTSSKAQKYAQLFKQERDSSSGSRIYDGVFINSMVKKFSSGSSNSSNNQDLGDVCLVPSCKRTRNSDDITLHTVPKRPEQLKKWCHNLKMNLVKMHKSARVCSAHFEKYCIGGCMRPFAVPTLELGHDDPNIFRNPDVIKKLNIRETCCVQSCKRNRDRDHANLHRFPTHPELMQKWCENLQKRIPDGTKLFNDAVCEVHFEDRCLRNKRLEKWAIPTLNLGWNGAPHSLPSEEEINENWVKPFAPNNGDEQGECCVVSCKRNPQIDDVKLYRPPEDAEQLVKWAHNLQVDVTDLPNLKICNLHFEQHCIGKRLLNWAMPTLNLGGKVEHLFENPPPMPAIYKKKIKPDRILNSQDGIKWSPRCCLPHCRKMRSADKIHLFRFPYNNRQTLAKWCHNLQLPLVGSSHRRICSSHFEASVLTKRCPMTLAVPTLDLNAPPGYKIYQNPARLKQIKIGAQRHCVIESCRKTKLDEVILFRFPNNRSILYKWRHNIKNWPKGKLSSQMRICSEHFEPHSVGVKKLSPGAIPTLKLGHEAKDLYPNEIRSFFDLEKCVVSGCDSRKEMEDIKLFRFPRDDDELLKKWCNNLKMNANDCVGIKICSKHFELECIGPRQLYKWSIPTLKLGHKEDDMVEIIANPPPEQRTGEFLFKCCVPSCGKTRKYDDAQMNSFPKHLKLFRKWTHNLKLDFLNFKEREKYKICNDHFEPVCVGKTRLNFGALPTLKLGHDELDDLYQINPDRIRPNLFIKQKDVERLERRRILREENAEQYTGEEQDDDVGDPLGLEPSDIKCCVTECTAPKSIMREPYDLPETKQIRQLWLKEFEKTDEEDLPTESKICGLHFQSIFKKLKHQMLEIIDENDDLKSDFNKLQYNLQKSNISLVISSYQCRVEDCPTNLLNSSIRLYFFPYGKQLVSKWSHNTGIIPDEHRRYMNKVCALHFESFCITENQRLRSWAIPTLNLPAGEEKEKHLYKNPDLTKIDRRILGPQILKCAVNNCSYPKLVDDESLKLFNFPTDDKLLRKWCDNLKMSHHFTPLLKICSLHFEKICFGSCRIRSWAIPTLNLGHSDAPEHLNKTTIRQEVYDVPEDVSEIQLKQVKIKKSLDSTKCYIPSCRKSRLKHGVRFYNLPSNLKMKRKWLHNLQIRHLKSNQKMHNIKICNLHFHKRCLEGKLLKPWAVPTRHLGHSESVYDNPRKVRALPPLRCALSHCKNHAGSRAVRTFVFPKSPEFLEKWAKNLKLELEKCKGKICHEHFDKEIVGMKKLQSGAVPTLDLGHSDKVMYDNTELMEKLKLKQIEKELNRDSCKMNIIEQDDLDEEYEPHSEEEEEEIWEYEECEDEEDEEEEDDEQICYDDEDEEEEEEEKRHDEDKEETPQDDDEISITNSTSDWSSVKFKELRVSITPLTPEDLMDLCSRSSYEREFGALTPANNLRGRRSVTPASSWKDSRSETSDQKSNSFNFNSNRSETPDKKASNYFREPRSVSPEQKPNIRTADEKCNSPKDPLGENLEDFCTKTPNQIEALVFKEETTSECDLTVNKLKRRTSQIPNESFKRECLEFSEYEIVNTMLPNEIELTGTTNLRTDKALNAVAPICCLKHCGKEKTPEQHLTTYGFPKDPQLLQKWCDNLGLQPEECIGRVCIDHFELRVIGTRRLRLGAVPTLNLAPNQVAKHTNMEDTPQKKSVTKEFSETANMQEADSSLEPPPPYKTPKPSKQSVFRLCCLKHCRRKKLLNLDKVDNQPLMERMVCQEEPQEILFKFPTEQNMLNKWYKNLRLPENLTVTQDLQICSQHFQSNVIENGKLHPEAVPTLQLSYANLPPIYTNYQLLGYKSEMKEKPIQKCCLPHCGNKMSEHIHLFAFPENQPWLLRKWCQNLKLNLLPGQYKSLYICNVHFEPYVFFRKRLRSGALPTLDLGHTDAIIRNCRKLRLQTENISTIKEKCCIADCETTNLKLYSFPRSSELRKIWCNNLQIEPRQALNNHSKLCAHHFTVDSFIVGTDNLKLNAVPVLNLGIKNESHLLMTTNPAESKCIVENCQKTPSVDKVKLFNFPQKQEILKKWLFNLNLAADNLRKDDVVCSKHFDKCCIKNGILHEKAIPTQFLEFSPKGWFYKNNEDLYEIPKKCCALSCQQTSEDAKHLYRFPKHKEDLDKWVYNLKLQVDESDVKDLRVCDRHFEPSCKISNKDLLTQALPTLNLGHDDADIYGNNFIKCCLDNCSIEGFYYHKLPEDLMLQSFWFQELEMETTYNNSLYICSVHFVTFFERTLEKYSAFLKESKEYVKLSVTYNEIKALPALQSYKCHISKCTSGFKLIWKLFKFPKDVKLFNKWMHNTSLQFEYEQRHCYRICSQHFEERCLSEKKLHRWSLPTLKLPFNNSLYVNPPEALPSNHENLRHCCVSNCTTLKGPFYKFPVRQVEVKKWIHNLDLGNQQCTLNLRVCYKHFENYCFSKAVNKVKPLISWSVPTLRLKRKVALFLNPADKIAFHVCCIESCRKILNKSKGIYLFKFPFSNTFKQRWLHNLNIGQQDYKETMRVCSAHFEMECFYKGFKLMRKDSVPTLALFKPPPDLYTNPVRRAYFKCCVKLCKAPWEQLLSFPKDKILLRKWSHNLQLDKEIKLETLRDWKICSRHFEQQCINSNGTIRSVAVPTLKLGHRKKLFLNPDFALKSNLKSKQKKLHDESSAKIDEHEVTNTLETNIEPEILDDISLEDQNINIKTQTEIKTKLSSKVKSLKPRKRLRKRKCLFGKKRKAKIVAKKLLNENEQNVIKEKENSATSQQFTIEEKKERANEQKSLTELHTHENLDETTNSLFTEVVNPNISDTVVYEQKVEETINLQEDAYLENLLEILTESLPENDELKETPQLFKQEPTDSDTLLQVAGEPQHVDLKYFPNDSDEIATFQITEIKQEVDQMPIEEEFLEERADYEPSQHSEAEVSKEEKALSFKSKHLKNLISCCIKTCRNYLNYKPDLLLFKLPVVRKLRTHWLENCKLNQRQYSANGVLKKLRICAEHFDKNCIKDDSRLLLGAVPTLHLGSNLDYKESLIKFTYLRCKIQSCQRSVQHDKIHRIPFPEGEEKRNWCLKMNIKEGTVTPDDWICHRHFERKSIIDGRKPKPGMLPTLLLNSLDEKSLIRKSQPHTLATLVRDSVTVAENINDLAVPDCRPKNHKVIKTKCLFPFCKDNKGQVLYDWPDKFIFGKIWLMANKLGRHAEDASSWKRMFEQTLINEQPPIESSAGGDSEKHIKLCDEHFYYLYKTNNEAINGYEAFEEYQDLKHNVQVTFDFLNSLEKIYTKKCAVPQCKTDQNIKDAAVKSIKLFDFPRKEIAKKWCDNIGLEYSILERKPFIKVCEIHFEDYCLLRRNLLDWALPTLHLPLMKDAQDIKQNDAVQVIAIRDKSKCCIETCPSVRDMDSNSNLSLYKFPKDPVLLQKWLQNTNCEKTFDANLTRICALHFHASDILDESKLHEQAIPKCYLDLSNSNFSSYPSCLNSSFIDEHIQVKQELDNSEEWCVTSQQESVARTTPTNEMVFELKLKENNRDADYNQLTEIKQEIIEVQEETSLFTIHKFETSSPPEFKYPYTNSNNSNNNQPAAFVISDVKSQLYFCCVQKCTNNSATPGIRIFNKFPHDSEIFIKWCFNLKIDPRNYKENQYAICEQHFEPICFTGNGLLQNWSVPTLNLNLNEQSFIHQNDIPEHLKPSSEQCIVYGCINPLKPLFKFPHNPDISLKWFSNLKLDFTDFRAQNYRICRRHFSPICFGINDSNKLTSEAVPTQFLGHTDKICHFNSVEEQQLQADGGVNNQDNSRGSSQGSLVRIISPHNIEDHDSSYFEDFEEYYGQDE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00741912;
90% Identity
-
80% Identity
-