Basic Information

Gene Symbol
-
Assembly
GCA_037075165.1
Location
JBAMCL010000055.1:432547-451866[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 7.8e-15 1.3e-11 45.3 4.4 1 86 585 657 585 658 0.85
2 29 3.3e-15 5.4e-12 46.5 4.8 1 87 685 754 685 754 0.82
3 29 1e-15 1.6e-12 48.2 0.4 1 87 776 848 776 848 0.85
4 29 8.8e-16 1.4e-12 48.4 5.7 1 87 948 1018 948 1018 0.83
5 29 9.5e-15 1.5e-11 45.1 3.3 1 86 1042 1113 1042 1114 0.82
6 29 7.7e-13 1.3e-09 38.9 1.1 1 87 1149 1217 1149 1217 0.80
7 29 6.3e-11 1e-07 32.8 1.8 1 86 1264 1333 1264 1334 0.77
8 29 4.9e-16 8e-13 49.2 0.3 1 86 1361 1430 1361 1431 0.82
9 29 5.3e-12 8.6e-09 36.3 2.6 1 86 1452 1521 1452 1522 0.78
10 29 1e-14 1.6e-11 45.0 1.6 1 86 1549 1620 1549 1621 0.86
11 29 9.6e-14 1.6e-10 41.9 0.7 1 86 1698 1767 1698 1768 0.82
12 29 2.9e-12 4.7e-09 37.1 0.1 1 86 1791 1859 1791 1860 0.81
13 29 5.3e-14 8.7e-11 42.7 0.9 1 87 2007 2076 2007 2076 0.79
14 29 6.2e-11 1e-07 32.8 0.1 1 86 2134 2205 2134 2206 0.78
15 29 0.029 47 5.0 0.1 1 58 2245 2295 2245 2314 0.79
16 29 6.3e-11 1e-07 32.8 1.4 1 86 2334 2403 2334 2404 0.85
17 29 1.3e-14 2.1e-11 44.7 1.3 1 87 2462 2532 2462 2532 0.83
18 29 2.8e-12 4.5e-09 37.2 0.3 1 86 2567 2638 2567 2639 0.82
19 29 6.4e-12 1e-08 36.0 2.3 1 87 2649 2720 2649 2720 0.82
20 29 2e-12 3.2e-09 37.6 0.0 1 86 2746 2816 2746 2817 0.78
21 29 3.1e-05 0.05 14.6 0.1 1 58 2850 2900 2850 2924 0.81
22 29 1e-13 1.7e-10 41.7 0.0 1 86 2938 3010 2938 3011 0.81
23 29 7.2e-15 1.2e-11 45.5 0.5 1 86 3173 3245 3173 3246 0.84
24 29 8.3e-15 1.4e-11 45.3 1.5 1 86 3310 3380 3310 3381 0.82
25 29 9.1 1.5e+04 -2.9 0.0 55 70 3409 3424 3401 3435 0.64
26 29 2.3e-13 3.7e-10 40.6 5.9 1 86 3486 3556 3486 3557 0.85
27 29 2.9e-14 4.7e-11 43.5 0.1 1 87 3641 3711 3641 3711 0.84
28 29 5.3e-10 8.6e-07 29.9 0.6 1 58 3727 3775 3727 3791 0.85
29 29 1.6e-08 2.6e-05 25.1 1.8 18 87 3792 3850 3779 3850 0.74

Sequence Information

Coding Sequence
ATGTCACAACACAACAATCCCCCGCATCATCATCACTATtaccagcatcagcagcaacaacaacaacagcagcaccagcagcagcatcagcagcagctacaacataaacaaatacagCAGCACAGTTGGTACTCACATGTTGCTTCCTACCCTCCCCACCATTCGCACGTCGCTGCCTTTGCGGCGCCCTGCAAAaccaataataacaacaacaacattatgAATGCATACGGCGCGGGAGCTGGCAGCACGCATGCAGCATATTACggctctgctgctgcggcagctggTGGGGTGGGCTATAACCTTGAGGCCAACACTGTGGCCTATGCGCACAACCAGCTGCTGCAgtaccaacaacagcagcaacaacatcagcagcagcaacagcagcagcagctcagtcAACGTTCGTATATGCCGCACAGTTTAATGCATGGCTCATATCCCTACATCAAGAGCGAGCCATTAGAGCTGCCCGATGATAGACaacgccaacagcagcagcagcaacaacaacaacaacatcatcatcaacagcagcagcagcagcaccaccaccagcaacaacaacaacactttcagAATCCAATGGCACCGCCGCCGGCGCCCGCCAATCGCCACCCGCTCGATGTCAGCGGcgaaatgataataaaatCGGAACCCATAGACGAACATGCGTACAAGTCGAACTATATCGATGATAATACTCCCTTTGCCGATTTTAGTAAATATCCCGAGTTTGGCGACGACATGTTAAGTCCCAAGGTGGAACTTACCGTCAAAGATGAGGCCTATGGAAACCAAAAAaaCCTGCTCAACTATCCGCGCCGCAAGCTGCAAACGGAGCGGTCATCGGAAAGGTTGCCCATTTGTCAGCGCTGCAAGGAGGTATTCTTCAAGAAGCAAGTCTACATGCGGCATGTGGCagtcagcagctgcagcataCAAGAGTATGACTTCAAGTGCAACATATGCCCCATGTCCTTTATGAGCACCGAGGAGCTGCAGAAACACAAGCACTTGCACAGGGCGGACAAGTTCTTCTGCCACAAATACTGTGGCAAACACTTTGACACGATTGCTGAATGCGAGTCGCATGAGTATATGCAGCATGAGTACGATAGCTTTGTGTGCAATATGTGCTCCGTAACGTTTTCCACACGGGAACAGCTGTATGCGCATCTGCCGCAACACAAGTTCCAACAGCGTTATGATTGCCCCATTTGCCGCTTATGGTATCAAACAGCGCTGGAGCTGCACGAGCATCGCCGCGCGGCACCTTACTTCTGTGGCAAGTATTACCCGGCTGCTCAGTCCACggcacatcagcagcaacagcagcttcCGCAGCATCAACATcctcagcagcaacatccgcaacagcatcagcaacaggcCAGTTACAAACTGCAGGACTGTCGCATGGGTTCTATTGAAATGCCAACTTCGCACCACAAGCCGAATACAACTGCCTCCACACTGCCGGCAACTGCCGCGCTTAGCTCGCTGTTGCAGCAGCGTCAGGCAAATGCCGACGGCGCCGCAATGTTTGCATCAACGCTGAAGAGCGAAGCTAATATCAAGCTGGAGCGTAGCTATAGCAACTCCACAAGCGAGTCCGGCTACAGTCTCCATGACAGCAGCTATAACAATGCATACGGCAGCGACACCTCGTTACATGGAGGCGGCGCCGCAATTGGTGGTCCACAGGCACACTCCTCGACGCTGGACGACTCGGAGGATGCGCTCTGCTGTGTGCCGCTGTGCGGTGTGCGAAAGAGCACAAGCCCGACGCTGCAGTTCTTTACGTTTCCCAAAGATGATAAGTACTTGCATCAGTGGCTGCACAATTTAAAGATGTTTCACATTCCGGCATCGAGCTATGCAAACTTTCGCATCTGCAGCATGCACTTTCCGAAGCGCTGCATCAATCGTTACTCTCTGTGCTATTGGGCGGTGCCCACTTTCAATCTGGGCCACGACGATGTCGCCAATTTGTATCAGAATCGGGAACTGACCAACACATTCACCACTGGCGAGATAGCGCGCTGCAGCATGCCCAATTGCACTAGTCAGCGCGGCGAGAGCAATCTTAAGTTCTACAACTTTCCCAAGGACATTAAGAGCCTAATCAAGTGGTGCCAAAACGCCCGTCTGCCCGTCCAGTCCAAGGAGCCGCGTCACTTTTGCAGTCGTCATTTCGAGGAGCGCTGTATTGGCAAGTTCAGGTTGAAACCGTGGGCAGTGCCCACGCTTCATTTGGGTGCCCAGTACGGCAAAATTCACGACAATCCGAAGAACTTGTATGTGGAGGAGAAACGCTGCTGCCTCAATTTCTGTCGTCGCAGTCGCTCCTCAGACTTCAACATGTCACTATATCGATTCCCGAGAGATGAGGTTCTCCTGCGACGCTGGTGCTACAATCTGCGTCTCGATCCTGCCGTCTATCGTGGCAAGAATCACAAAATATGCAGCGCTCACTTCATCAAGGAGGCATTGGGGCTCCGCAAATTATCACCGGGCGCTGTTCCCACGCTGCATCTTGGTCACAACGACACATTTAACATATACGAGAATGAGCTGTGGCCGCCACCGTCGCCCTCAACGTCCACCAATCACCAGCAGCAtctgcagcagcaccagctgcagcaacatcagcagcaacacgGACAACACACTCACCATAGCAAGTATCAGCGTCATTCGGCTGCATCCACGTCCTCGTCGGCCAGCTCGACATCGCATTATGTGGACGCAGAGCTGAGTGCATCATATCTAGGCATGGGCGCCTCTGGAGGCTCGTCCTCTGGGCTGAATGTCAGCGACAGCATGGATGTGTGCTGTGTGCCCAGTTGCGAGAGCAAGCGTCACAATAATGAGAACATCACATTCCATACAATACCCAGGCGGCCGGAACAGATGCGCAAGTGGTGTCACAATCTGAAGATACCCGAGGATAAGATGCATAAGGGCATGCGCATATGCAGCCTGCACTTTGAACCCTACTGCATTGGTGGCTGCATGCGTCCGTTTGCGGTGCCCACGCTGCATCTGGGCCACGACGACGAGGACATCCATCGCAATCCGGACGTGATCAAGAAACTCAACATTCGCGAAACGTGCTGCGTTGCTGTGTGCAAGCGCAATCGTGATCGCGATCACGCCAATCTGCATCGTTTCCCAAGCAATGTCGCTCTCTTGACCAAATGGTGTGCAAACCTGCAGCGGTCCGTGCCGGATGGAAGCAAGCTATTCAACGATGCCATTTGTGAGGTACACTTTGAGGATCGTTGTCTGCGCAACAAGCGGCTGGAAAAGTGGGCGGTGCCCACGCTGGTGCTGGGGCACGAAAACATTGCCTACCCGCTGCCCACGCCCGAGCAGGTCGCCGAGTTCTATGCGCGTCCCAGCGCACCGAATAATGGCGAGGAGCAGGGCGAATGCTGTGTGGATACATGCAAGCGTAATCCCAGCGTCGATGATATAAAGCTCTATCGGCCACCTGAGGAGTCTCAGGTGCTGGCCAAATGGGCTCATAATCTGCAGTTGGATGTCGCCAAGCTGCCTAATATGAGAATTTGCAACCTGCACTTTGAATCACACTGCATTGGCAAGCGCATGCGGCCCTGGGCTATACCCACGCTCAACTTGGCTAACAATATTGAGAATATGTTCGAGAATCCTGAGCATCAGCTGCTATACAAGCGTCGCACACATCTGAACTCGGACAGAGCCGCTGCACACAGCTCCGGCGCGGGTGTTGCTAAGCCGACGTGGGTACCACGTTGTTGCTTGCCGCATTGCCGCAAAGTGCGCGCGCTTCATAATGTGCAGCTGTATCGTTTCCCGAAGCTCAATCGCTCTACGCTCGCCAAGTGGGCCCACAATTTGCAAGTACCGCAAGTGGGCAGTGCCCAAAGACGTCTGTGCTCTGCGCACTTTGAACCGCATGTGCTAAGCAAAAAGTGTCCGGTTCCGTTGGCGGTGCCTACGCTGGACCTCAATACGCCACCCGGCTACAAAATCTATCAGAATCCCGCCAAGCTGAAGGCCAACAAGTTGTGTCTGCAGCGCGTCTGCATTGTGGAGAGCTGCCGGCGGCAGCGTGCGCAAGGCGTACAGCTCTTCCGGCTGCCCCACAGTCCCACACAACTACGGAAATGGATGCACAATATTCACATGCGACCAAGAGGCGCCATGCGACAACAATATCGCATCTGTTCCATGCACTTTGAGACGCACTCGTTTAATGGAAAACGACTGAGCACTGGTGCTATACCCACTCTGGAGCTGGGCCATCAGGACGACGATATATATCCGAATGAGGCGCAGTCCTTTGTGGAGGAGCACTGTACCGTCGAGAACTGCGAGTCGGCCAAGGAACACCCGGATGTGCGTTTGTTCCGCTTCCCCACAGACGACGAGGACCTGCTGTGGAAGTGGTGCAATAATCTCAAAATGAATCCCGTCGATTGCATAGGCGTGCGGATCTGCAATAAGCATTTTGATTCGGACTGCATTGGGCCGAAGCATCTGTTCAAGTGGGCTATTCCAACAATGGCTCTGGGGCACGACGATGCACAGATTGAGCTCATACTCAATCCCAAGCCGGAGGAGCGATATGTCGATCCGGTGTTCAAGTGTTGTGTACCAACTTGCGGCAAGACGCGCAAGTTCGATGAGGCTCAAATGAATAGCTTTCCCAAGGATCCGACGCTTTTCCAGCGATGGCGTCATAATCTGCGACTCGATCATCTCAACTTTAAGGAGCGCGAACGCTACAAGATCTGCAATGCACACTTTGAAGACATTTGCATTGGCAAGACGCGGCTCAATATTGGCTCCATACCAACGCTAGAGTTGGGCCACGACCAGACTGAGGATCTGTATCAAGTGAACCCTGCAGATCTGCAAAGTAATTTGTTTGGACGCCAACGACGCGTGCAAGACTCAATGGGCGTCACAATCAAGCAGGAGGACCAATCAGAGCAAGAAGATGACTCGATTAAGCCGGAGATCACTATGTCTGAGGCCACTGGCTTAAATACAAGACAGgtaaaaataaagaaattgttaTGCGATCTTAAGTGCTGTGTGCCGAGCTGCGGACGCAGTCGATTGGAACATGGAGCACGCTTGTTCCCCTTCCCGAACGGCAAGCAACAACAGATCAGGTGGCGACACAATCTTCGCATGGATGCCGCCGACCTGGACAAGACGACACGCGTGTGTAGCGCTCACTTCAATCGTCGATGCATCGATGGCAAGCAGCTGCGGGGCTGGGCTATGCCCACACTTCAGCTGGGCCACGACGAGCAGCCGATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCATTGCCGCAAGCGGCGCAGCATTGACAACGATCTGCGAACCTATCGCTATCCGCGCAGCGAGGAGCTGCTGGAGAAGTGGCGCGTCAATTTGCGACTGGCGCCAGATCAATGTCGCGGCCGCATCTGTGCAGATCACTTCGAGCCGATGGTGCGCGGCAAGCTAAAGCTGAAAACGGGAGCGGTGCCCACGCTGAAACTGGGCCATGATGAGGGCGTCATATTTGATAACGAGGCCATTAAGATGGGCATGCTGCAGGAAGATGCTGAAGAGGGCGATGTCAGCACAGAGTCAATAGCGCTGCGAACCAGGATAAAAATTGAAGACGATGACCAGCAACACCATAAGGTAGAGAGCtttgatgatgacgacgatgatgctgatgctgatgctaaTGCTGATCTTGATgcggatgatgatgaggaggaTAACGACCAAGACGAACATAGCTACTTTGATCCCATGGAACTGGTGGAAACCTACGCCGAGCAGCATAGCGATGCGGCTGAAGAAGATGATGATGTTGGTGacgatgatggtgatgatgatgagctgctgttgctgccggaCACGCTGCCAGTTCAAATGGCGTTGCCACCACGTCGTGAGAAGCCTGTCAACAATGTGACTCCTATTTGCTGCCTGAAGCACTGCCGCAAGGAGCGCACAGCCAGTCATCAGCTAAGCACATTTGGCTTTCCGAAAGatcagcaacagctgcttaaATGGAGCGCCAATCTGCAGATCTCGATTGCCGACTGTCTGGGACGCGTGTGCATTGAGCATTTTGAGTCCGAGATGCTGGGCACGCGAAAACTAAAACAGAATGCGGTGCCCACATTGAATCTGGGACACACAGCGCCGCTAGAATACACCTGCAATGGACAACCGACCATTTACGACGAACAGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAACTGCAAGTGGACCTGCCGGACCTGCCAGCGAGTAATCATCCACCTAAGCGACGCTGCTGTCTGCCCAGCTGTGGCAAGCAGCCGGAATTGCATGGCGTTCAGTTGCAGCGATTGCCCAGGAATCGCATTATGCTTCGAAAATGGCTGCACAATCTCAAACTCTCGCCAGCAGCTGTGGACACCAGCCAGGCATCTATTTGCACCGAACACTTTGAGCCACAGCTGCTGAAAATGGGCCGCGCGCCGCTGGAAGATTGCGTGCCCACCCTGAGATTGGGCCATGTCGATACTAACATTTATCGCAATCGGGTCAGTGGCAATGCCATTGCCAGTGACGGTGCCATTGCCAGTGGTAATGGTAGAAACGTCCCAGTCTCATCCAGTAGTTGTATGGTGCCCAGCTGTCCGTGTGCGCGCCTCAATATATATCGCTGCTTTGATCTGCCGGATAATCGTTTGGTGCAGCAAGCCTGGCTACAATGGCTGCAGCTGCCAATGCCGCAGCTGGCCTGCGACGGTCAGCTATGTGTCATGCACTACATGCAAGTGTACGAGCAGGTGCCGCTGCCCGAGGGGCTGCCAGAAACGGtggtgcagcagctgcaggaaACCTATGAAGCAATCGCGAGCTCTTCCATGGCCATGAAGCTACGTTGTGCCGTGCCCAAATGTTACTCTAAGTACACAGACAACATACGGCTTACCAAGCTTCCGTTGTGCCCGGATATGTGCGCCAAGTGGGTGCACAATACCAAAATTACTTATGACATTACACGTCATTACATTTATCGCATTTGTTTTCTGCACTTTGAACCGCGCTGCTTGGGCCCAGTGCGTCCCAAAATTTGGGCAGTGCCCACTTTGCAACTTCATCATAAGGATCCGAATATCTATTTGAATCCAAAGTCGGAAGCCACCACAATCCCTGCGCCAGTGCCACTAGAGCTGCCGTTGCGTATTAAAACGGAGTTACCGCTCGCATTGAGCGTCAGTCCCAGCAACAGTGCCAGTCCCAGTCCGCGTGGCAAGCTGCGCTTTTGCTGCATCCCCACCTGTGCTCAGCAGGCCACATCGCAATTGCGCCTATTCCGCTTTCCCACTGCTGAGACGGCGCTGCTCAAGTGGCTGGTGAACACGCAACAAAGTCCGCGCTTGGTCGATCCGCAGCAGCTGTTTGTGTGCCAGGAACACTTTGAGTCGGAGGCTATTTGCATGAAGCAGCTGCGCAGCTGGGCGGTGCCCACGCTTAATCTGGGCCACGATGGCTACGTCATTCCAAATGCAAAGCACAATGGCAACATTGCCGACAGTCAGGAGAACAAACATGCCCTGCAGTACATTTGGGAGAACTACTGTTCGATTTTGAGCTGCCTTCAGCAGCGCAGTGAGGAGTTGCGTCTCTACTCATACCCCCAGGATCGGCCTACAATACGCAAATGGGCGGCCAACTGCAAGCACCGATCCATGCAGGCTAGCAGTGATGGATTTCAGGTATGTCAGTCGCACTTTGCACCGGATTGCTTCGATCCAGAGACGGCGGAGTTACTGGAGGGCGCTGTGCCGACACTGGAGTTAAGCCGAAACCTCAATGAGTTGCGCTGTATAGTGACTGGCTGCGTAAAGGATGATTCACAGCGTCGTCGCTTATTTCGAATGCCAAAGCGCTGCTCCCAGCTGGTCGATTGGTGTCACAATCTGCACATAGATCCGGCGTCTTTTGCCGGCACGGAACAGCACGTTTGTGAACGCCACTTTGAGGCCGACTGCTTTAATGCGTATAAAGTTTTGCGTCCTGGAGCACGACCAACGCTGCACTTGGGTCATGACGAGCAGGTAGAGCTGTTGCGCAATCCAGCAAATTGGGCCCGTTGCCCAGAGGAACCTGTAATTTGCTGCGTGCCGAACTGTGGAAGTTCCGCAGATACTGATGATGTGAAGCTGTTTGGCCTGCCCAAAATGCGCATCTTGGCCGATAAATGGCTACAAAATGTTCGGCTTGAGCCAGGCAATGGGTCGCTTCCCAAGCTGAAATTCTGCAGCCTACACTTTGAGAGCAGCTGCATAGAGAATGGACGCCCCCAGATGGGAGCCATGCCCACGCTTCAGCTGGGGCACGAAGAACGACACAACATACATAAGACCGCTGACCAAAGTCAAAGCAAGAACAAACGGTACTGCAACAGGAATGGCTCTAGTCATGGCTGTTGCTATCCGCAATGCGTGGAGCTGCAGAAAAGTTACCTACGCATTAGCTACGAGCTTCCACGGCAAGAGGCATTACGTCGCCAGTGGCTGGAATATATGGAAGTGGAGGAATTCGAGGAGCAACCGCTTAAGCTCTGCCCGCTGCATTTGGTCATATTATATGATCATAGTGTCGAGCACTTTGCAGAACACGCGTCCGAGCAGCTTCTGGACGCCAGCTATGAAGATGCGCGCAACAGTGTGCGCTTACGCGTCATAAGCTGTGCGGTGCGTGGCTGCAAGACACTTAAACCACGTGACGGCGGCCGTCTGCATGGCTTGCCGTATCGTCGCGATATCCTTGATATGTGGCTGTACAATATGAAGCTGGTGTTTTATGAGCAGCAGCGTTATATGTACAAAATATGCAGCAAACACTTTGAGCCAAACTGCCTAATGGAGGCGACACGCCGGCTAAAGCCTTGGAGCATGCCGACATTGGAGCTACCAGAGCGTGAGCCTGGCGAGGATCCTCCATATCAGATTCCCACAGAAGCTGAATGGCAACAAATGAATGAGTCCATGGCCATCAGCAAAACACCGCAGCAACAGGACACTGAGGCAATCGAGGACAGCTATTTGCTGGAGCCTATTGTAAAGTTGGAGCCGCAGGAAAGTGAGGAACCAGAGCAGCTGCTAGAGGATGACAACTCACAGCAGTTTGCAGATGATGACTACAACTCTCAGCAGTTTTTGGAGCAGCCTCCGCACGAGGACGACACCAGCTCCCAGCAGCCGTTGGAAATGCAAGCTTTGGAGGTGCTGCTTGAAGTGGGTCACGTAGAGAACTGCACCACATACGAGCAAATGGACACCGAGGCAGATCTTTGCTATGCCGAGAAGCAAGCGCACAACAGCTTTGGCGCAGCCCCGCCAATTGGCAGTGGCACCATTGTAAGCAATGGTCTTCAATACAGTGCGCGGCACTGCAGCGTGCGGGGCTGTGATGTGACCGCCAACGATGTGGATGACAACTTAAAGTTACACAAATTTCCCACCTCTTCGGATGCCATGCAGAAATGGATGCACAACACTCAGGTGGATGTGGATACAAACTTTGCGTGGCGCTTCCGAATCTGCAGCTATCACTTTGTACCCGAATGCTTTAATGGCTCGCGCATAAGACGCGGCGCAATTCCCACGATGCGACTAGGACCTCGCAGACCAGCGATTATATACGACAATGAGTTTAACACATTGCTTCAGCTGGATCAGCAAAATAATGAAGCCAGCAGCGAACACCCACAAACACTGGAGCCAGGTGAGATCGTTCCGAAGACTGCACCAATAAAACTGCGCCTGCCGCGTCCGGCGCCGCCACGCAAGTCCAGCAAATTCTGTCAGATTGAAGGCTGTCCAAATCATTTGACCAGCGAGAATTTGACGCTTCATAAATTTCCTCACGCACCGGACATGTGCGCCAAATGGCAACACAATACCCAGGTGCCCTTCGATCCGGAGTATCGCTGGCGCTATCGCATCTGTAGCGCACATTTTGAGGCCAGCTGCTTGGGCAACATGCGTCTAATGCATGGTAGCGTGCCCACGTTGCAACTTGGCCCGCGTGCACCTTCGCAGTTATTCGCTAATGACGTTGCATCGTTTAATATGCGTCTTGATAAGCCAAAGAGCAGCTCCGATCATTTTGAGCCGTCGGATCAGCTAGAAGACTCGGATTTGCAAGAACAGGAAGATCTCAGTTTACTGGTGCCAGATATGCAGTTGTACGAGGATGGCGAGCAGTCGGACAACCAGCTGTACTATAATAGCCACAACAGACCGAAGGAGTACCAGCATCTGCGACTGCCCAGCATTAAGCAGGAGAAAGCCAGCAGCTATAATCCGGTCAAGTCTGGCTATGACAAGTGCTCACTGTTTCATTGCCAGCGTCAGCGAACGCAGCACGGCGTGCACCTCTACAAATTTCCACGCGCGCGACATCTGCAGCAACGTTGGATGCACAATTTGCGCATTAAGTACGACGAGCGACGGCCATGGAAGACTATGATATGCAGCGTACACTTTGAGCCGCACTGCATACGGCTGCGCAAGCTCTGTCCCTGGGCGGTGCCCACTCTTGAGCTGGGCGACAATGTTCCGCAGCAAATATATACGAATGAGGAGAGCCAGCAGCAGTTTGACTTAGAGAACAACGATATGGCAGCGGCCAGCGACGACGAGGATATGGACGTGGATGTGGATGGCACAATGCTAGAGGCGGACTACGACGAATATGATGATGACGAGCCATTTTCCCAAGAGCCGAATGTCAAAAAAGAGCGTCGCTCACGTCAGGACCCATGGCAGACAGCGCcttggaaaataaaaatgtgctgTTTGCCCTACTGTCGCAGTCCACGTGGTGATGGCATCAAGCTCTTTCGGCTGCCCAACAACCTTAGTTCCATACgcaaatgggaacaggccacGGGCATGCGCTTCGATGAATCACAACGCAATACAAAGCTCATCTGCAGCCGTCATTTTGAATCGCAGCTTATTGGCGTACGTCGCCTCATGTCAAATGCCGTGCCCACCCTCCATCTGGGCCCAATGAGTCAGCCCGTGCTGCCTCCTGCGGGTCCACGCTGCTGTATAACCGATTGCCATCAAGAACTGAATGTTAAGCTGCACAAGTTTCCAACTGATCCCATGCTGCTTGAACAGTGGTGCCAGGCGCTAAAGCTGTCGGATGCTGAACGCTATCGAGGCAAATATGTTTGCGCGACTCATTTGCCATCCAAAGCGAAGAGCTGCCTTATCTGCGGTGTGGAGGATAtacagctgccgctgcttgaCTTCCCCGAGAATCGCAATCAGCGTGCCAAATGGTGCTACAATCTGAAAGTCGAAACCATACCTAAGTGGGACAACTCGAAGCACATTTGCTGCAAGCATTTTGAGAGTTACTGCTTTACCCAGCCGGGTCAGCTGGTAGAGGAGGCAGCGCCTACTTTGCATTTAAAGCATAACGATAAGAACATATTCGTAAACGATTATGCCATAGATAACAAGACGTTGCGCATTAAGGACGAGCCCTTGGACAGCGATGAATATATGCTGTAA
Protein Sequence
MSQHNNPPHHHHYYQHQQQQQQQQHQQQHQQQLQHKQIQQHSWYSHVASYPPHHSHVAAFAAPCKTNNNNNNIMNAYGAGAGSTHAAYYGSAAAAAGGVGYNLEANTVAYAHNQLLQYQQQQQQHQQQQQQQQLSQRSYMPHSLMHGSYPYIKSEPLELPDDRQRQQQQQQQQQQHHHQQQQQQHHHQQQQQHFQNPMAPPPAPANRHPLDVSGEMIIKSEPIDEHAYKSNYIDDNTPFADFSKYPEFGDDMLSPKVELTVKDEAYGNQKNLLNYPRRKLQTERSSERLPICQRCKEVFFKKQVYMRHVAVSSCSIQEYDFKCNICPMSFMSTEELQKHKHLHRADKFFCHKYCGKHFDTIAECESHEYMQHEYDSFVCNMCSVTFSTREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRRAAPYFCGKYYPAAQSTAHQQQQQLPQHQHPQQQHPQQHQQQASYKLQDCRMGSIEMPTSHHKPNTTASTLPATAALSSLLQQRQANADGAAMFASTLKSEANIKLERSYSNSTSESGYSLHDSSYNNAYGSDTSLHGGGAAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHIPASSYANFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEIARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQSKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPSPSTSTNHQQHLQQHQLQQHQQQHGQHTHHSKYQRHSAASTSSSASSTSHYVDAELSASYLGMGASGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHENIAYPLPTPEQVAEFYARPSAPNNGEEQGECCVDTCKRNPSVDDIKLYRPPEESQVLAKWAHNLQLDVAKLPNMRICNLHFESHCIGKRMRPWAIPTLNLANNIENMFENPEHQLLYKRRTHLNSDRAAAHSSGAGVAKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPQVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNTPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRAQGVQLFRLPHSPTQLRKWMHNIHMRPRGAMRQQYRICSMHFETHSFNGKRLSTGAIPTLELGHQDDDIYPNEAQSFVEEHCTVENCESAKEHPDVRLFRFPTDDEDLLWKWCNNLKMNPVDCIGVRICNKHFDSDCIGPKHLFKWAIPTMALGHDDAQIELILNPKPEERYVDPVFKCCVPTCGKTRKFDEAQMNSFPKDPTLFQRWRHNLRLDHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDQTEDLYQVNPADLQSNLFGRQRRVQDSMGVTIKQEDQSEQEDDSIKPEITMSEATGLNTRQVKIKKLLCDLKCCVPSCGRSRLEHGARLFPFPNGKQQQIRWRHNLRMDAADLDKTTRVCSAHFNRRCIDGKQLRGWAMPTLQLGHDEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVIFDNEAIKMGMLQEDAEEGDVSTESIALRTRIKIEDDDQQHHKVESFDDDDDDADADANADLDADDDEEDNDQDEHSYFDPMELVETYAEQHSDAAEEDDDVGDDDGDDDELLLLPDTLPVQMALPPRREKPVNNVTPICCLKHCRKERTASHQLSTFGFPKDQQQLLKWSANLQISIADCLGRVCIEHFESEMLGTRKLKQNAVPTLNLGHTAPLEYTCNGQPTIYDEQPQHSVFRLWSLKHCRKRKLQVDLPDLPASNHPPKRRCCLPSCGKQPELHGVQLQRLPRNRIMLRKWLHNLKLSPAAVDTSQASICTEHFEPQLLKMGRAPLEDCVPTLRLGHVDTNIYRNRVSGNAIASDGAIASGNGRNVPVSSSSCMVPSCPCARLNIYRCFDLPDNRLVQQAWLQWLQLPMPQLACDGQLCVMHYMQVYEQVPLPEGLPETVVQQLQETYEAIASSSMAMKLRCAVPKCYSKYTDNIRLTKLPLCPDMCAKWVHNTKITYDITRHYIYRICFLHFEPRCLGPVRPKIWAVPTLQLHHKDPNIYLNPKSEATTIPAPVPLELPLRIKTELPLALSVSPSNSASPSPRGKLRFCCIPTCAQQATSQLRLFRFPTAETALLKWLVNTQQSPRLVDPQQLFVCQEHFESEAICMKQLRSWAVPTLNLGHDGYVIPNAKHNGNIADSQENKHALQYIWENYCSILSCLQQRSEELRLYSYPQDRPTIRKWAANCKHRSMQASSDGFQVCQSHFAPDCFDPETAELLEGAVPTLELSRNLNELRCIVTGCVKDDSQRRRLFRMPKRCSQLVDWCHNLHIDPASFAGTEQHVCERHFEADCFNAYKVLRPGARPTLHLGHDEQVELLRNPANWARCPEEPVICCVPNCGSSADTDDVKLFGLPKMRILADKWLQNVRLEPGNGSLPKLKFCSLHFESSCIENGRPQMGAMPTLQLGHEERHNIHKTADQSQSKNKRYCNRNGSSHGCCYPQCVELQKSYLRISYELPRQEALRRQWLEYMEVEEFEEQPLKLCPLHLVILYDHSVEHFAEHASEQLLDASYEDARNSVRLRVISCAVRGCKTLKPRDGGRLHGLPYRRDILDMWLYNMKLVFYEQQRYMYKICSKHFEPNCLMEATRRLKPWSMPTLELPEREPGEDPPYQIPTEAEWQQMNESMAISKTPQQQDTEAIEDSYLLEPIVKLEPQESEEPEQLLEDDNSQQFADDDYNSQQFLEQPPHEDDTSSQQPLEMQALEVLLEVGHVENCTTYEQMDTEADLCYAEKQAHNSFGAAPPIGSGTIVSNGLQYSARHCSVRGCDVTANDVDDNLKLHKFPTSSDAMQKWMHNTQVDVDTNFAWRFRICSYHFVPECFNGSRIRRGAIPTMRLGPRRPAIIYDNEFNTLLQLDQQNNEASSEHPQTLEPGEIVPKTAPIKLRLPRPAPPRKSSKFCQIEGCPNHLTSENLTLHKFPHAPDMCAKWQHNTQVPFDPEYRWRYRICSAHFEASCLGNMRLMHGSVPTLQLGPRAPSQLFANDVASFNMRLDKPKSSSDHFEPSDQLEDSDLQEQEDLSLLVPDMQLYEDGEQSDNQLYYNSHNRPKEYQHLRLPSIKQEKASSYNPVKSGYDKCSLFHCQRQRTQHGVHLYKFPRARHLQQRWMHNLRIKYDERRPWKTMICSVHFEPHCIRLRKLCPWAVPTLELGDNVPQQIYTNEESQQQFDLENNDMAAASDDEDMDVDVDGTMLEADYDEYDDDEPFSQEPNVKKERRSRQDPWQTAPWKIKMCCLPYCRSPRGDGIKLFRLPNNLSSIRKWEQATGMRFDESQRNTKLICSRHFESQLIGVRRLMSNAVPTLHLGPMSQPVLPPAGPRCCITDCHQELNVKLHKFPTDPMLLEQWCQALKLSDAERYRGKYVCATHLPSKAKSCLICGVEDIQLPLLDFPENRNQRAKWCYNLKVETIPKWDNSKHICCKHFESYCFTQPGQLVEEAAPTLHLKHNDKNIFVNDYAIDNKTLRIKDEPLDSDEYML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00802880;
90% Identity
iTF_00803632;
80% Identity
-