Basic Information

Gene Symbol
-
Assembly
GCA_035046325.1
Location
JAWNOC010000075.1:5852189-5867069[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 4.6e-15 8.8e-12 45.6 4.3 1 86 615 687 615 688 0.85
2 28 2.9e-15 5.4e-12 46.2 4.6 1 87 715 784 715 784 0.83
3 28 6.2e-16 1.2e-12 48.4 0.4 1 87 806 878 806 878 0.84
4 28 6.2e-16 1.2e-12 48.4 5.2 1 86 979 1048 979 1049 0.82
5 28 6.1e-15 1.2e-11 45.2 3.2 1 86 1073 1144 1073 1145 0.82
6 28 1.6e-12 3e-09 37.4 1.0 1 87 1180 1248 1180 1248 0.81
7 28 9.8e-11 1.9e-07 31.7 1.4 1 86 1293 1362 1293 1363 0.76
8 28 6.1e-16 1.2e-12 48.4 0.1 1 86 1390 1459 1390 1460 0.83
9 28 6.4e-13 1.2e-09 38.7 0.9 1 87 1481 1551 1481 1551 0.80
10 28 8.5e-15 1.6e-11 44.7 2.1 1 86 1578 1649 1578 1650 0.85
11 28 3.3e-14 6.3e-11 42.8 2.6 1 85 1726 1794 1726 1796 0.82
12 28 3.3e-12 6.2e-09 36.4 0.1 1 86 1819 1887 1819 1888 0.81
13 28 7.6e-14 1.4e-10 41.7 1.2 1 87 2030 2099 2030 2099 0.80
14 28 5.2e-12 9.8e-09 35.8 0.1 1 86 2167 2233 2167 2248 0.80
15 28 0.0065 12 6.6 0.1 1 58 2258 2308 2258 2322 0.74
16 28 1.6e-11 3.1e-08 34.2 0.6 1 87 2347 2417 2347 2417 0.83
17 28 9.7e-15 1.8e-11 44.5 1.8 1 86 2501 2570 2501 2571 0.83
18 28 2.3e-12 4.4e-09 36.9 0.7 1 86 2606 2677 2606 2678 0.81
19 28 2.9e-13 5.5e-10 39.8 0.4 1 87 2688 2760 2688 2760 0.81
20 28 1e-14 2e-11 44.4 0.0 1 86 2789 2862 2789 2863 0.78
21 28 0.00018 0.34 11.6 0.0 1 58 2896 2946 2896 2970 0.80
22 28 2e-14 3.7e-11 43.5 0.2 1 86 2985 3057 2985 3058 0.80
23 28 7.4e-14 1.4e-10 41.7 0.4 1 86 3210 3282 3210 3283 0.83
24 28 1.8e-14 3.5e-11 43.6 1.5 1 86 3343 3413 3343 3414 0.81
25 28 2.8e-13 5.3e-10 39.9 5.5 1 86 3519 3589 3519 3590 0.85
26 28 3.4e-13 6.4e-10 39.6 0.0 1 87 3667 3737 3667 3737 0.84
27 28 1.2e-09 2.3e-06 28.2 1.6 1 58 3757 3806 3757 3814 0.83
28 28 6.8e-10 1.3e-06 29.0 1.0 18 87 3823 3881 3809 3881 0.75

Sequence Information

Coding Sequence
AtgtcacaacacaacaatcCCCCCCCGCATCATCTTCACTActaccagcaacagcaacaacaattacaacaccaacatcaccatcatcagcagcagcaacaacatcaccaccaacaacaacaacagctacagcataaacaaatacagcagcaacacaattGGTACTCACATGTTGCTTCCTACCCTCCCCACCATTCGCAGGCCGCCGCAGCCTTTGCGTCGCCCTGcaaagccaacagcaacaacaataacaacaacaacagcattatGAATGCATACGGCTCTGGAGTTGTTGCAAGTGGCACGCAGACACCATATTatggggcagcagcagcagctggtggTGGGGTGGGATATAACCTTGAGGCCAATACTGTTGCCTATGCGCACAACCAGCTGCTGCagtaccaacaacaacaacaacaacagcagcaacaacaacaacaccagcagcatcaacagcaacaacaacaacatcatctgCTCAATCAACGCTCTTCTTATATGTCGCATGGTTCAATGCATAGCTCTTATCCTTATATCAAGAGCGAGCCATTGGAGATGCCCGATGATAGACAAcgacagccacaacaacagcagcagcatcatcaacaacaacaacatcaccagcaacaacaacagcatttcCAAAATCCTATGGCACCGCCGCCAGCTCCCGCCAATCATCGTCATAGTCTCGATGCCAGCGGtgaaatgataataaaatCGGAACCCATTGACGAAAATGCCTACAAATCAAACTATATCGATGATAATACGccatttgttgattttagtAAATATCCCGAATTCGGCGACGATATGCTGAGTCCAAAGGTGGAATTAACCGTCAAGGACGAGGGCTATGGCAGTCAAAAGGTTCCCaACCCGCTTAGCTATCCGCGGCGCAAGCTGCAAACGGATCGCTCAACAGAAAGTCTTCCCATATGCCAGCGTTGCAAGGAGGTGTTCTTTAAAAGGCCGATCTACTTGCGACATGTGGCCGAGAGCAGTTGCAACATACAGGAGTATGACTTCAAGTGCAACCTCTGCACCATGTCTTTCATGACCATCGATGAGCTGCAGAAACACAAGCATCTGCACAGAGCGGACAAGTTCTTTTGCCACAAATACTGTGGCAAGTACTTTGATACAATTGCCGAATGCGAATCGCATGAGTACATGCAGCACGAATATGAAAACTTTGTATGCAACatATGCTCCATGACGTTTGCCTCGCGGGAACAACTCTATGGTCATTTGCCGCAGCACAAATTCCAGCAGCGTTACGATTGTCCCATTTGTCGTCTGTGGTATCAAACCGCTTTGGAGTTGCACGAGCATCGATTAGCGGCGCCGTACTTCTGTGGCAAGTACTACGTACCCGCTCAATCGGCAGTACATCAGCAAcatccacagcagcaacattctcatcaacagcatcagcaacaggcCAACTACAAACTTCAGGATTGTCACATGGGCACCATTGAAATGCCTTCGCCGCAACACAAAGCTAATACATCATCAGCAAACGCATTgccggcaacagcagcgctcAATTCGCTGTTGCAACAGCGTCAAGCCAATGCCGATAGTGCCGCAATGTTTGCTTCCACACTGAAGAATGAGGCGAATGTGAAGTTGGAGCGAAGCTACAGCAATTCAACGAGCGAATCTGGATACAGTCTACACGATAACAGTAGCTTCAACAATGCCTATGGCAGCGACAACTCGACAATTCATGCGGCAGCCGGaggcggtggtggcggcggctcTGGTGGTGCCATTGGAGGTCCGCAGGCGCACTCCTCCACGCTGGACGACTCGGAGGACGCCCTCTGCTGTGTGCCTTTGTGTGGAGTGCGCAAGAGTACCAGTCCAACGCTGCAGTTCTTTACGTTTCCCAAAGATGAGAAGTATCTGCATCAGTGGCTGCATAATCTCAAGATGTTTCACATTCCGGCATCAAGCTATGCGAGCTTCCGCATCTGCAGCATGCACTTTCCGAAGCGTTGCATCAATCGCTATTCGTTATGCTATTGGGCGGTGCCCACATTCAATCTGGGCCACGATGATGTTGCCAATCTGTATCAGAATCGCGAGCTGACCAACACGTTTACCACCGGCGAGGTGGCACGCTGTAGCATGCCCAACTGCACCAGCCAGCGGGGCGAGAGCAATCTCAAGTTCTACAATTTTCCCAAGGATATCAAGAGTCTGATCAAATGGTGCCAGAATGCCCGTCTGCCCGTCCAAGCCAAGGAGCCGCGTCACTTTTGCAGTCGCCACTTTGAGGAGCGTTGTATTGGCAAGTTCCGGCTGAAACCGTGGGCAGTGCCCACCTTACATTTAGGTGCTCAGTACGGCAAGATCCATGACAATCCCAAGAATCTGTATGTGGAGGAGAAGCGTTGCTGCCTTAACTTTTGTCGTCGCAGTCGCTCGTCCGACTTTAATATGTCGCTTTATCGCTTTCCGCGCGATGAGGTCCTGCTTCGACGCTGGTGCTATAATCTACGACTCGATCCTTCGGTCTATCGCGGCAAGAATCACAAAATATGCAGCGCTCATTTCATCAAAGAGGCTTTGGGACTACGCAAACTCTCGCCGGGTGCTGTTCCCACGCTGCATTTGGGGCACAACGACACGTTCAACATCTACGAGAATGAGTTGTGGCCACCACCGACGCCATCCACGCCCACCAaccaccatcagcagcaactgcagcagcatcagttgcagcaacaccagcagcaacagcaacaacaacaacatcatcacaAATATCAACGTCACTCGGCAGCCTCAACATCTTCTTCAGCCAGCTCATCGCACTATGTAGATGCTGGAGACATGGGTGGATCGTACATGGGCATGGGCAACTCAGGAGGCTCTTCGTCCGGTCTGAATGTGAGCGACAGCATGGACGTGTGCTGTGTGCCCAGCTGCGAGAGCAAGCGGCACAATAATGAGAACATCACATTCCACACGATACCGAGGAGACCTGAGCAGATGCGCAAATGGTGTCACAATCTAAAGATACCCGAGGATAAGATGCACAAAGGAATGCGCATCTGTAGCCTGCACTTTGAACCCTATTGTATTGGCGGTTGCATGCGTCCGTTTGCCGTGCCCACGTTGCAATTGGGCCACGACGATGAGGACATTCATCGCAATCCGGATGTGATCAAGAAGCTGAACATTAGGGAAACCTGCTGTGTGGCTGTCTGCAAGCGTAATCGCGATCGTGATCATGCTAATCTGCATCGCTTCCCCAGCAATGTGGCGTTGCTGACCAAGTGGTGTGCGAATCTGCAGCGTCCGGTGCCGGATGGCAGCAAGCTCTTCAACGATGCCATCTGCGAGGTGCACTTCGAGGATCGCTGTCTGCGCAACAAGCGGCTGGAGAAGTGGGCCGTGCCAACGTTGATACTTGGCCACGAGAACATCGCTTATCCGCTGCCCACGGCAGAACAGGTGGCCGAGTTCTATGCTCGACCCAGTGCACCCAATAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGACACGTGCAAGCGTAATCCCAGTGTCGATGACATCAAACTCTATCGTCCGCCTGAAGAGTCGCAAGTGCTGGCCAAATGGTCACATAATCTGCAGATAGACGCGGCGAAGTTATCCAGCTTGAGGATCTGCAATCTGCACTTTGAAGCGCACTGCATTGGGAAGCGCATGCGTCCATGGGCGATACCCACGCTCAATCTGGCGACAACCATTGACAATCTCTACGAGAATCCCGAGCACCAAATGCTCTATAAGCGACGCACACATCTCAAGACGAAACGTGCCGCTAGTCACGAGGCGGGTGGCGTGAAACCGACATGGGTGCCACGCTGTTGTCTGCCACATTGCCGCAAGGTGCGTGCACTGCATAATGTGCAGCTGTATCGCTTTCCCAAGCTCAATCGCTCCACGTTGGCCAAGTGGGCGCACAATCTGCAGGTGCCGCTGGTGGGCAGTGCCCAACGACGTCTCTGCTCGGCGCACTTTGAGCCTCACGTGCTCAGCAAGAAGTGTCCGGTGCCGCTGGCTGTGCCCACGCTTGATCTGAACTCACCACCTGGCTACAAGATCTATCAGAATCCCGCCAAGCTCAAGGCCAACAAGTTGTGCCTGCAACGCGTCTGCATTGTCGAGAGTTGTCGCCGGCAACGCGGCCAGGGCGTGCAGCTCTTCCGGCTGCCACATAATCCTACACAGCTGCGCAAATGGATGCACAACATACGCATGCGTCCCAGAGGCGCCATGCGGCAACAGTATCGTATGTGCTCAATTCACTTTGAGACGCACTCCTTCAATGGCAAGCGATTGAGTGCAGGCGCCATTCCAACGCTCGAGTTGGGTCACGATGACGACGATATCTATCCGAATGAAGCGCAATCGTTTGTTGAGGAACACTGCACTGTCGAGGGCTGTGGGGCGTCCAAGGAGCAGCCCGATGTGCGTCTCTTCCGTTTCCCCACCGACGACGAAGATTTGCTGTGGAAGTGGTGCAACAATCTCAAGATGAACCCCGTGGATTGCATTGGTGTGCGCATCTGCAACAAGCACTTTGAGCTGGACTGTATCGGGCCCAAGCATCTGTACAAATGGGCGATACCCACACTGCATCTAGGTCATGACGATGAGCAGATCGAGCTGATCGACAATCCCAAGCCCGAAGAACGCTATGTGGATCCCGTGTTCAAGTGCTGCGTACCGACGTGCGGCAAGACGCGCAAGTTCGATGAGGTGCAGATGAACAGCTTTCCCAAGGATCCAAATATGTTTCAGCGCTGGCGACACAATTTGCGACTCGAGCATCTCAATTTCAAGGAGCGCGAACGCTACAAGATTTGCAATGTGCACTTTGAAGACATTTGCATAGGGAAGACGCGGCTCAACATTGGCTCTATACCCACGCTCGAGTTGGGGCACGACGAGACGGTGGATCTGTTCCAAGTGAATCCCGACGAGCTGCAGAGCAATCTCTTTGGACGTCAGCGTCGCGTGAACTCTTCGCTGGGCATGAGCATCAAGCAGGAGGACAACTCGGAGGTGGATGAGGACATTAAGCCCGACTTGAACATGGTGGgggcaaaagacaaaaataccGCACAGGTTAAGATTAAGCGTTCTCTGGCGGATTACAAGTGCTGTGTGCCAGACTGTGGACGCAGTCGCTTGGAGCATGGCGCTCGCCTGTTTCCCTTCCCCaatggcaagcagcagcagagcaagtGGCGCCATAATCTGCGCCTGCAACCTGATGAAGTGGATCGTAGCACACGCGTCTGCAGCGCGCATTTCAATCGTCGCTGCATCGATGGCAAGCAGCTAAGGAGTTGGGCCATGCCTACGCAGCAGCTGGGCCATCAGGAACTGCCCATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCCCATTGTCGCAAGCGTCGCAGCATTGACAATGATCTGCGTACTTATCGCTATCCGCGTAGCGAAGATCTGCTCGAGAAGTGGCGTGTTAATCTGCGATTGGCGCCGGATCAGTGTCGTGGACGCATTTGCGCGGATCACTTTGAGCCCATGGTGCGGggcaagctgaagctgaagacgGGCGCAGTGCCCACGTTGAAGTTGGGTCACAACGAAGGCGTTGTCTTTGACAATGAGGCTATCAAGGCGGGACTGCAGCAAGAGGCGGATGAGGGTGGCGATCAGGAGACCAGCATGGAATCGATGGTCAAAGTGAAGCAGGAGCGACTCGATCCGGAAGAGGAGCAAACTGATGATGTGGACCACGAGCAGCagcacgacgacgacgaagagcAGGCAGATCATGGTTACTTTGATCCTCTAGAGCTGGTCGAGACGTTTGCTGAGCAGCACAGCGCCGAGGATGAGCACGAACtcaatgatgacgatgacgaagatgaagatgaggaCGAACCGGGCGACGATGATGAGCTGCTATTGCCAGACACGCCACCAGTGAAGCGACTTCCGCCTTTGGTGCTGCCGCCGAGACGCGAGAAACCCGTGAACAATGTGACCCCCATCTGCTGCTTAAAACACTGTCGCAAGGAGCGCACGGCCAGCCATCAGCTGAGCACCTTTGGCTTTCCAAAGGATCGGCAACAGTTGCTTAAATGGAGCGCCAATCTACAGCTATCGCTCGACGATTGTGTGGGACGCGTGTGCATCGAACACTTTGAGTCAGAGATGCTGGGCACACGAAAGCTGAAGCAGCATGCAGTACCCACCTTGAATCTGGGTCACGCAACGCCCCTCAGCTACAGTTGCAATGGTCAGGCATTGAGCATCTACGATGCACAGCCGCAACATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGCCGCAAACGGAAGCATCCAATGGAACCGCCGgatcagcagcagaagcaacggCAGCTGGATCAGAACCCCGCAACGATGATGACTAAGCGACGCTGTTGCCTGCCCAGCTGCGGCAAGCAGCCGGAGGTGCATGGCGTGCAATTGCAGCGGCTGCCCAGCAATCGCATTCAGCTGCGCAAGTGGCTGCACAATCTCAGGCTATCCCCCATGCTGGACAGCAGTCAGGCGCGTCTCTGCAGCGAACACTTTGAGCCGGAGCTGCTGGACCATGTGGAGGATGCGGTGCCCACGCTGCGACTGGGACACGATGACACGCACATCTATCGCAATCGTAACAACATCACGGCAGCCTCTACGTCGAGTGCTTGTTTGGTGGCAAGTTGTCCGTGTGCTCGCCTCAATCTCTATCGCTGTTACGATCTGCCCGAGCATCGTCTGGTGCAACACGCCTGGCTGCAGTGGCtccagctgccgctgccccAACAAGCCAGCGATGGCAAGCTCTGTGTCATGCACTTCATGCAACTCTTCGAGCAGGTGCCGCTGCCGGCGGAGTTGCCAGGTTCGGTGCTCCGTCAATTGCAGGAGACTTATGATCTCATTGCAGGCTCCACGATGGCCATGAAGTTGCGCTGTGCTGTGCCCGGCTGCTACTCGAAATACACGGACAACATCAGGCTCACCAAGCTTCCCATGTGTGCCGGCATGTGCTCCAAGTGGGTGCACAACACCAAGATCAACTATGATGCAACGCGTCACTATGTCTATCGCATCTGCATGCTGCACTTTGAGTCTCGCTGCTTGGGCCCTGTGCGTCCCAAGCTGTGGGCGGTGCCAACGCTGCACTTAAACCACAACGATGCGAATATCTATCAGAACCCAAAGTTGGATGGGCAATTGCCGTCAGCTCCAGCGCCGACTCCAGTGCCCATTGCCATGACAGCCTCGGTGCCCATTGCCATGACATCCTCGGTGCCCATTGCCATAACAGCCTCGGTGCCCGTTGAGTTGCCGTTGCGCATCAAGACGGAGCTGGCCTTCAGTGGCAGTCCCAGCGCCAGTGCAAGTCCCAGTCCGCGTGGCAAGCTACGCTTCTGTTGCATCCCCAGCTGCTTGCAACAGGCTACGTCGCAGACGCGACTCTTTCGCTTTCCCACCGCTGAAACGGCGCTGCTCAAGTGGCTGGTGAATACGCAGCAACAGCCGCGTTTGGTTGATACCCAGCAGCTGTTCATTTGCCAGGATCACTTCGAGGCGGAGGCCATCTGCAAGAAGCAGCTGCGCAGCTGGGCGGTGCCAACATTGAAGCTGGGTCACGACGGCCATGTCATACCGAATGCCAGGCACAATGGCAACATTGCCGACAGCCAGGAGAACAAGCAGACGTTGCAGTACATCTGGGAGAACTATTGTTCCGTCTTGAGCTGCTTTCAGCCACGTAGTGCGGAATTGCGTCTCTACGCTTATCCAACGGATCGTCCCACCATTCGCAAGTGGGCGACCAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGATGGTTTCCAGGTCTGCCAACTGCACTTTGCACCGCATTGCTTTGACCAGGAGACGGGCGAGTTGAGGGAGGATGCGGTGCCCACGCTGGAACTGAGTCGATGCCTTAACGATGTGCACTGCGTCGTTGCTGGCTGTGTGAAGGACGAGGATGGACCGCGTCAACGCTTCTACAAGATGCCCAAGCGCAGTGCTCAACTGCTTAGCTGGTGTCACAATCTGCGTTTGGATGCGGCAACCATGGGCAGTGGAGAGCATCATGTCTGCGATCGCCACTTCGAAACGCAGTGCATCAATCAGCAAAAACTGCTACGACCCGGCGCACGTCCTACTCTTCACCTGGGCCACGATGAGTCCATTGACTTGATGCCCAATCCAGCGGAGTGGGATGCAACGGATGCTGCGCCTGCTACAGACAATGTCTGCTGTGTGCCCAACTGCGGTCTGGCCaaggatgaggaggaggatgtGCAGCTGTTTGCCTTCCCCAAGCTGCGAACGCTCGCTGAGAAGTGGCTGCAGAATATACGCCTCGAGAACATAAGTCGTGAGCAGCTGATGCGCCTAAGGATTTGCGGCGCACACTTCGATGCTGGCTGCCTGGAGAGCAACGGACGTCCGCAGCTGGGCGCCATGCCCACGCTGCAGCTGGGCCACgaggacaacagcaacattcaTCGCAGCACCGATGCTGCTGCCGTCAAGGCTAAGAAGTTTTGCAATCGAAGTGGCTCCAGTTATGACTGCTGCTATCCACAATGTGTGGAGCTGCAGAAGAGTTACCTGAGGATTAGCTACGATTTGCCACAATCGGAGGCACTGCGTCTCAAGTGGCTGGAGTACATGGGTCTGGAAAAGACGGAAGAGAAGCTCTTAAAGCTATGCCCGCTGCACTTGGTGCTGCTCTACGATCACAGCGTCGAGCACTTTGCAGCAGAACACTCGCCCGAGCCGCAACTGGAGGCCAACTACGAGGACAGTCGCAACAGTGTGCGATTGCGTGTCATCAGCTGCGCGGTGCCTGGCTGCCGTACGCTAAAGCCGCGTGATGGTGGCATACTTCATGGATTGCCGCAGCGCCGCGATGTGCTTGAGATGTGGCTGCACAACATGCAGCTGGTGTTCTATGAGCAGCAGCGTTACATGTACAAGATATGCAGCAAGCACTTTGAGCCCTGCTGTTTTATGGACACCACGCGACGCTTGAAACCGTGGACTATGCCGACGCTGGAGCTGCCGTCTCGTGATGCGGAAGAAGCACCCATTTATCCCAATCCCAGCGAGCAGGAGTGGCAGCGCATGAACGAGCTGCTGGCGGTCGAGCAACTGCAACCGCAGAAAGAGGAGCCGCAGCAGCCAGAAGAGCTATGCAACTTACTTGAGCCAATTGTGAAGATGGAGCACATTGACAGGGACGAGGATGAAGACGAGTTTCACGAGTATCAAGAGCAGCAAGATGAAGAGCTGCAGCCGACGGATAATGTCGATGACAACTCACAGCAACCGCTGGCGCTTGAGGTGCTGCTCGAGGTGGGTCACGTGGAGAAGTGCACCACATATGAGCAAATGGACAACGAGGCGAATCTGAGCTAttccgagcagcagcaacagctgcacgCATATGGAGCAGGAGCTGCTTCAAGTGGCCACTTGGGCACCAATGGCTTCAAGTACACGGCGCGGCATTGCAGCGTACATGGCTGCGATGTGACTGTGAACGATGTGAATGGTAGCATTAAGCTGCACAAGTTTCCCACCTCGCTGGACGCCATGGAGAAGTGGAAGCACAACACCCAGGTGGAAGTGGACCTGAATTACTCGTGGCGTTTTCGCATCTGTAGCTATCACTTTACCGACGAATGTTTCCATGGCGCACGCATCAAACGCGGTGCTATGCCAACTCTTAGTTTGGGGCCGCGACGACCGCCCAAGATCTATGACAATGAGTTTGGCAGCACGCTGCCGCTCTCAGAGCCGGCACAACTGCAGCACAGCGAGGAGAAACAACTATCGAGGCACACAAAGGATAACGAGGTTAATCTTCGGCTGCCGGAGCCGGCGCCGCCGCGGAAGTCAAGCAAATTCTGTCAGGTCGATGGTTGCCCGAATCATTTGACCAGCGAGAACTTGACGCTACACAAGTTTCCACATGATGTGGACATGTGCGCCAAGTGGCAACACAATACACAGGTGCCCTTTGATCCCGACTATCGTTGGCGTTATCGCATCTGCAGCGCCCACTTTGAACCCATTTGCCTGTTGAATATGCGATTGTTACACGGCAGCGTGCCCACATTGAAGCTGGGACCACGTGCGCCGCGGCAGCTTTTCGATAGCGACTTTGAGGCCATTAACATGCGGCTGGACAAGCAGAAGAATAGCAGCGGCGAACAACAATTCCCCATCAAACTAGAGCAGGTCTTTCAGGAAGAGGAGGAAGGTGAGGAAGCGGAGCTGAGCTATTTGGTGCCCGAAATGCAGTTGCACGAGGAGATGGAGCACGCGCAAGGCAGATCGAGGAACTGGGAGGAGCTGCGCTTGCCCAGCATCAAGCAGGAGTCTGAGGAGCAATCACAGACCAGCTACAATCCGGTTAAGTCGGGCTACGACAAGTGTTCTTTGGTGCATTGCCAGCGTCAGCGTTCGCAGCACGGTGTACACATCTACAAGTTTCCACGTtcccggcagcagcagcagcgctggaTGCATAACCTGCGCATCAAGTACGACGAGCGGCGGCCCTGGAAGACGATGATCTGCAGCGTACACTTCGAGCCGAGCTGCATCCGGCTGCGCAAGCTGTGTTCGTGGGCAGTGCCCACTTTGGAGCTGGGCGATAATGTGCCGCTAGAGATCTATACGAATGAGCAGAGTCGCCAGCAACAGGAAGTAGGCAGCGATTGTGAGGATATGCCGCTGGAGGATAGCTACGAGGATGACGATTACGATGATGATTTGGCCCAGCAGATGGCCAATGAACCGTTGGTAAAGCGCGAGCGTCGCTCACGTCTCGATCCCTTACCGCCGGGTCAACTGCCGCCTTGGAAGATCAAGGTGTGCTCCCTGCCCTATTGTCGCAGCCCGCGAGGCGATGGCATCAAGCTGTTTCGGCTGCCCAACAACACCAGTTCCATACGCAAATGGGAGCAGGCGACTGGCATGCGTTTCACCGAAGCCCAGCGCAATACGAAGCTCATCTGCAGTCGGCACTTTGATCCACAACTGATTGGAGTGCGGCGTCTCATGTACAATGCAGTGCCGACGCTCAATCTGGGTCCAAGGAGTGAGGACAGTTCAGCTGTGCTGCTGCCTACTCTTGGACCACGCTGCTGTATGCCCGATTGTCAAGCGGAAGGCAAGGATACCAAGCTGCACAAGTTTCCTAGTGATCCCATGCTGCTGCATCAGTGGTGTCATGCACTGAATCTCTCGGACACTCAACATTATCGTGGCAAGCACATTTGTGCGCAGCATCTGCCCGCCAAGACACCCAATTGTATTGTTTGCGGCATTGAGCAATTACAGTTGCCACTGCTCGACTTCCCAGAGAATCGCAATATGCGTGCCAAGTGGTGCTATAATCTCAAAATCGAACCCATTGCCAAATGGGACCACTCAAGACAGATATGCAGCAAGCACTTTGAAAGCTATTGCTTCACTCAGCCAGGACAACTGCAACCGGAGGCTGCGCCAACGTTGCATTTGCGGCACAACGATAGCAATATATTCCTAAACGATTATGCCATAACAGATGACAGTAAGATGCTGCGCATCAAGGATGAGCCGCTGGACAGCGATGATCTGATGCTgtaa
Protein Sequence
MSQHNNPPPHHLHYYQQQQQQLQHQHHHHQQQQQHHHQQQQQLQHKQIQQQHNWYSHVASYPPHHSQAAAAFASPCKANSNNNNNNNSIMNAYGSGVVASGTQTPYYGAAAAAGGGVGYNLEANTVAYAHNQLLQYQQQQQQQQQQQQHQQHQQQQQQHHLLNQRSSYMSHGSMHSSYPYIKSEPLEMPDDRQRQPQQQQQHHQQQQHHQQQQQHFQNPMAPPPAPANHRHSLDASGEMIIKSEPIDENAYKSNYIDDNTPFVDFSKYPEFGDDMLSPKVELTVKDEGYGSQKVPNPLSYPRRKLQTDRSTESLPICQRCKEVFFKRPIYLRHVAESSCNIQEYDFKCNLCTMSFMTIDELQKHKHLHRADKFFCHKYCGKYFDTIAECESHEYMQHEYENFVCNICSMTFASREQLYGHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYVPAQSAVHQQHPQQQHSHQQHQQQANYKLQDCHMGTIEMPSPQHKANTSSANALPATAALNSLLQQRQANADSAAMFASTLKNEANVKLERSYSNSTSESGYSLHDNSSFNNAYGSDNSTIHAAAGGGGGGGSGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPTNHHQQQLQQHQLQQHQQQQQQQQHHHKYQRHSAASTSSSASSSHYVDAGDMGGSYMGMGNSGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVDTCKRNPSVDDIKLYRPPEESQVLAKWSHNLQIDAAKLSSLRICNLHFEAHCIGKRMRPWAIPTLNLATTIDNLYENPEHQMLYKRRTHLKTKRAASHEAGGVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGAMRQQYRMCSIHFETHSFNGKRLSAGAIPTLELGHDDDDIYPNEAQSFVEEHCTVEGCGASKEQPDVRLFRFPTDDEDLLWKWCNNLKMNPVDCIGVRICNKHFELDCIGPKHLYKWAIPTLHLGHDDEQIELIDNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPNMFQRWRHNLRLEHLNFKERERYKICNVHFEDICIGKTRLNIGSIPTLELGHDETVDLFQVNPDELQSNLFGRQRRVNSSLGMSIKQEDNSEVDEDIKPDLNMVGAKDKNTAQVKIKRSLADYKCCVPDCGRSRLEHGARLFPFPNGKQQQSKWRHNLRLQPDEVDRSTRVCSAHFNRRCIDGKQLRSWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEDLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHNEGVVFDNEAIKAGLQQEADEGGDQETSMESMVKVKQERLDPEEEQTDDVDHEQQHDDDEEQADHGYFDPLELVETFAEQHSAEDEHELNDDDDEDEDEDEPGDDDELLLPDTPPVKRLPPLVLPPRREKPVNNVTPICCLKHCRKERTASHQLSTFGFPKDRQQLLKWSANLQLSLDDCVGRVCIEHFESEMLGTRKLKQHAVPTLNLGHATPLSYSCNGQALSIYDAQPQHSVFRLWSLKHCRKRKHPMEPPDQQQKQRQLDQNPATMMTKRRCCLPSCGKQPEVHGVQLQRLPSNRIQLRKWLHNLRLSPMLDSSQARLCSEHFEPELLDHVEDAVPTLRLGHDDTHIYRNRNNITAASTSSACLVASCPCARLNLYRCYDLPEHRLVQHAWLQWLQLPLPQQASDGKLCVMHFMQLFEQVPLPAELPGSVLRQLQETYDLIAGSTMAMKLRCAVPGCYSKYTDNIRLTKLPMCAGMCSKWVHNTKINYDATRHYVYRICMLHFESRCLGPVRPKLWAVPTLHLNHNDANIYQNPKLDGQLPSAPAPTPVPIAMTASVPIAMTSSVPIAITASVPVELPLRIKTELAFSGSPSASASPSPRGKLRFCCIPSCLQQATSQTRLFRFPTAETALLKWLVNTQQQPRLVDTQQLFICQDHFEAEAICKKQLRSWAVPTLKLGHDGHVIPNARHNGNIADSQENKQTLQYIWENYCSVLSCFQPRSAELRLYAYPTDRPTIRKWATNCKHRSMQASSDGFQVCQLHFAPHCFDQETGELREDAVPTLELSRCLNDVHCVVAGCVKDEDGPRQRFYKMPKRSAQLLSWCHNLRLDAATMGSGEHHVCDRHFETQCINQQKLLRPGARPTLHLGHDESIDLMPNPAEWDATDAAPATDNVCCVPNCGLAKDEEEDVQLFAFPKLRTLAEKWLQNIRLENISREQLMRLRICGAHFDAGCLESNGRPQLGAMPTLQLGHEDNSNIHRSTDAAAVKAKKFCNRSGSSYDCCYPQCVELQKSYLRISYDLPQSEALRLKWLEYMGLEKTEEKLLKLCPLHLVLLYDHSVEHFAAEHSPEPQLEANYEDSRNSVRLRVISCAVPGCRTLKPRDGGILHGLPQRRDVLEMWLHNMQLVFYEQQRYMYKICSKHFEPCCFMDTTRRLKPWTMPTLELPSRDAEEAPIYPNPSEQEWQRMNELLAVEQLQPQKEEPQQPEELCNLLEPIVKMEHIDRDEDEDEFHEYQEQQDEELQPTDNVDDNSQQPLALEVLLEVGHVEKCTTYEQMDNEANLSYSEQQQQLHAYGAGAASSGHLGTNGFKYTARHCSVHGCDVTVNDVNGSIKLHKFPTSLDAMEKWKHNTQVEVDLNYSWRFRICSYHFTDECFHGARIKRGAMPTLSLGPRRPPKIYDNEFGSTLPLSEPAQLQHSEEKQLSRHTKDNEVNLRLPEPAPPRKSSKFCQVDGCPNHLTSENLTLHKFPHDVDMCAKWQHNTQVPFDPDYRWRYRICSAHFEPICLLNMRLLHGSVPTLKLGPRAPRQLFDSDFEAINMRLDKQKNSSGEQQFPIKLEQVFQEEEEGEEAELSYLVPEMQLHEEMEHAQGRSRNWEELRLPSIKQESEEQSQTSYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQQQQRWMHNLRIKYDERRPWKTMICSVHFEPSCIRLRKLCSWAVPTLELGDNVPLEIYTNEQSRQQQEVGSDCEDMPLEDSYEDDDYDDDLAQQMANEPLVKRERRSRLDPLPPGQLPPWKIKVCSLPYCRSPRGDGIKLFRLPNNTSSIRKWEQATGMRFTEAQRNTKLICSRHFDPQLIGVRRLMYNAVPTLNLGPRSEDSSAVLLPTLGPRCCMPDCQAEGKDTKLHKFPSDPMLLHQWCHALNLSDTQHYRGKHICAQHLPAKTPNCIVCGIEQLQLPLLDFPENRNMRAKWCYNLKIEPIAKWDHSRQICSKHFESYCFTQPGQLQPEAAPTLHLRHNDSNIFLNDYAITDDSKMLRIKDEPLDSDDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00601834;
90% Identity
-
80% Identity
-