Basic Information

Gene Symbol
-
Assembly
GCA_008042775.1
Location
VNKC01000420.1:227378-244829[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 30 2.9 5.8e+03 -2.0 1.3 44 60 315 332 300 350 0.55
2 30 2.7e-15 5.3e-12 46.2 4.0 1 86 544 616 544 617 0.85
3 30 1e-14 2e-11 44.3 5.0 1 87 644 713 644 713 0.83
4 30 9.3e-16 1.8e-12 47.6 0.2 1 87 735 807 735 807 0.85
5 30 6.3e-16 1.2e-12 48.2 5.1 1 87 903 973 903 973 0.82
6 30 2.2e-15 4.3e-12 46.4 3.6 1 86 997 1068 997 1069 0.82
7 30 7.9e-12 1.6e-08 35.0 0.5 1 87 1104 1172 1104 1172 0.80
8 30 8.2e-11 1.6e-07 31.8 1.4 1 86 1214 1283 1214 1284 0.76
9 30 5e-17 1e-13 51.7 0.4 1 86 1311 1380 1311 1381 0.82
10 30 4.6e-12 9.1e-09 35.8 1.9 1 85 1402 1470 1402 1472 0.78
11 30 7.7e-15 1.5e-11 44.7 0.5 1 86 1499 1570 1499 1571 0.85
12 30 8.5e-13 1.7e-09 38.1 3.3 1 85 1644 1712 1644 1714 0.82
13 30 2.4e-12 4.8e-09 36.7 0.1 1 86 1737 1805 1737 1806 0.83
14 30 6.8e-13 1.3e-09 38.5 2.5 1 87 1949 2018 1949 2018 0.80
15 30 2e-11 3.9e-08 33.8 0.1 1 86 2107 2178 2107 2179 0.75
16 30 1.1e-05 0.022 15.4 0.0 1 60 2194 2245 2194 2265 0.78
17 30 2.9e-12 5.8e-09 36.4 0.1 1 87 2273 2343 2273 2343 0.79
18 30 6.8e-13 1.3e-09 38.5 0.4 1 87 2395 2465 2395 2465 0.82
19 30 2.7e-11 5.3e-08 33.3 0.1 1 86 2500 2574 2500 2575 0.81
20 30 2.9e-12 5.8e-09 36.4 0.0 1 87 2585 2659 2585 2659 0.79
21 30 1.1e-10 2.2e-07 31.4 0.0 1 61 2684 2739 2684 2758 0.70
22 30 0.00033 0.65 10.6 0.1 1 58 2785 2835 2785 2858 0.80
23 30 3.1e-11 6.1e-08 33.1 0.2 1 86 2875 2946 2875 2947 0.81
24 30 1.8 3.5e+03 -1.4 0.0 19 61 2952 3003 2949 3016 0.51
25 30 4.1e-16 8.1e-13 48.8 0.4 1 86 3059 3131 3059 3132 0.81
26 30 1e-12 2e-09 37.9 3.3 1 86 3195 3265 3195 3266 0.80
27 30 3.1e-12 6.1e-09 36.3 5.2 1 86 3358 3428 3358 3429 0.84
28 30 2.1e-11 4.1e-08 33.7 0.1 1 86 3509 3578 3509 3579 0.84
29 30 2.3e-10 4.5e-07 30.4 0.5 1 58 3604 3652 3604 3666 0.83
30 30 7e-10 1.4e-06 28.8 1.4 18 86 3670 3727 3660 3728 0.75

Sequence Information

Coding Sequence
ATGTCACAACACAACCCCAATCACGCCCACCCACACTACCACTACCCGTCCCATCCAACGCCGCTggctctgcagcagcagcagcaccagcagcagcatcagcaggagcagcacggCAGTAGTTGGTACTCACATGTTGCTTCCTACCCAGCATCCTCCCACTCCCACAACCTCTCTCACTCGGCCTTTGGCCCTGCGCCCCCTTGCAAGGccagcatcagcaacaacaacaccattATGGGTGCctacggaggaggaggagggggtGCTGGCTCGCATGGATATTTCGGCGCCGCTGGCGGTGGCCTCAATGTCAGCGGGGCGGGGGGTGGTGCTGGGTCGACCTACGGCCTTGGGGCCAACACGGTGGCATATGCTCACAACCAGCTGCTGCAGTaccagcatcatcatcagcaacaccagcagcagcagcacctgggTCTGAGCCAGCGATCCTATATGGGCCACGATGTCATGGCCGGGAGCTATCCCTATATCAAGAGCGAACCTTTGGAGGGCTTCCAGCAACCGGCCAATCCAATGGCCCCACCCCCGGCCCCagaaatgataataaaatCGGAACCCATTGACGAACTTGCCTACAAGTCAAACTACATTGACGATAATACGCCATTTGCAGACTTTAGCAAGTTTAGCGAATTCGGCGAGGACATGCTGAGTCCCAAAGTCGAGCTGACAGTTAAGAATGAGTCCTACGACAGGAATCCCAATAGCTTTTTACGCCGTAAGCAACAATCTGATCGGTCGACAACAGAGAGCCTGCCCGTCTGCCAGCGATGCAAGGAGGTGTTCTTCAAGAAGCAGACTTATCTTCGCCACGTCGCCGAGAGCAACTGCGGCATCCAGGAGTACGACTTCAAGTGCAGCATATGTCCTATGTCCTTCATGACCGCCGAGGAGTTACAACAGCATAAGCAACAGCATCGAGCGGACAGGTTCTTTTGTCACAAGTACTGTGGAAAGCACTTTGGTACCATCGCAGAGTGCGAGACTCACGAATACATGCAACACGAATACGATAACATAGTTTGCAACATGTGTTCGGGAACGTTCACCACGAGGGAACAACTGTATGCCCACTTGCCGCAGCACAAGTTCCAGCAGCGCTTCGACTGCCCCATATGCCGCTTGTGGTACCAAACGGCTCTTGAGCTGCACGAGCACCGCTTGGCAGCACCTTACTTTTGCGGGAAATACTATACGGGCGGACAGTCTCCGTCGTCCtctcaacagcaacagcaccagaaCCAGACGAACTACAAGCTGCAAGATTGTCATATGGCAACAATGGAAATGCCAAGCGCGCCACTCCTTAAGACGAACCCATCCGACTCGCCTGCTTTGCCCGCGACTGCGGCGCTTAATTCACTATTGCAGCAGCGTCAGGCAAATGCCGATGGAGCGGCCATTTTTGCCGCATCTACACTTAAGAACGAGGTCACTGTGAAACTGGAGCGCAGCTACAGTAACTCGACAAACGAATCGTCTTATAGCGCTCAGGAGAGCGGCTAcaacaatatttatagcaGCAGCGACACCTCAATCCACGGCTCCCTCGCCGGACCGCAGGCACACTCTTCGACGCTGGACGACTCTGAGGATGCGCTCTGCTGTGTGCCGCTGTGTGGGGTGCGGAAGAGCACAAGTCCCACCTTGCAGTTTTTCACGTTCCCAAAGGACGAGAAGTATCTCAACCAGTGGCTACATAACCTCAAGATGTTTCATATACCCGCCTCCAGCTACGTTAGCTTCCGCATCTGCAGCATGCACTTCCCGAAGCGATGTATAAACCGCTACTCCCTGTGCTACTGGGCGGTGCCAACGTTCAACCTAGGCCACGATGACGTAGCCAATCTCTACCAGAATCGTGAACTGACCAACACCTTTACCACTGGCGAAGTGGCGCGCTGCAGCATGCCTCATTGCACTAGCCAGCGAGGCGAGAGCAACCTTAAGTTTTACAACTTTCCAAAGGACATCAAAAGCTTGATTAAGTGGTGCCAGAATGCCCGGCTTCCGGTGCAGGCGAAGGAACCGCGACATTTCTGCAGCCGCCACTTCGAGGAGCGATGCATTGGCAAGTTTCGACTAAAACCGTGGGCGGTGCCCACACTGCACCTCGGCGCCCAGTATGGCAAGATACACGACAATCCGAAAAATCTTTACGTTGAAGAGAAACGCTGTTGCCTTAACTTTTGCCGTCGAAGTCGATCGTCTGATTTCAATATGTCGCTATATCGATTTCCTAGGGACGAAGTTCTTTTACGTCGCTGGTGCTACAATCTTCGCCTCGATCCGGGAGTATACCGCGGCAAGAATCACAAAATATGCAGCGCACACTTTATCAAGGAGGCATTGGGTCTCCGTAAACTATCTCCTGGTGCCGTGCCCACGCTTCATCTGGGCCACAATGACACATTCAACATCTACGAGAACGAACTATGGCCGCCGCCGACACCAACACCCTCTACTTGCCACTTGCAACAGCAGTCATCACTTCACTCACTACAACAGCAGATGCACAACAAATCCTACCAGCGCCGTTCGGTAGCATCCACTTCGTCGTCAGCGAGTTCAGCAGCCTCGCATTACGTGGACCCGGAGATGAGCGCCTCTTACCACCTAGCCATGTCCACCTCCGCCAGTGGCTCTGCGGCAATAAACGCCAGCGACAGCATGGATGTCTGCTGCGTGCCTAGTTGCGAAAGCAAGCGGCATAACAGCGATAACATCACATTCCACACGATACCGCGACGGCCCGAGCAGATGCGTAAATGGTGTCACAACCTTAAGATAGCTGAGGACAAGATGCACAAGGGCATGCGGATCTGTAGCCTCCACTTCGAGCCCTACTGCATCGGCGGATGTATGCGACCATTTGCGGTGCCCACGCTTCACTTGGGTCACGAAGACGAGGACATTCACCGCAATCCGGACGTGATCAAGAAGCTTAACATTCGGGAAACATGCTGCGTGGCTGTGTGCAAGCGGAATAGGGATAGGGACCATGCTAATCTGCATCGTTTCCCCAGCAACGTGGCCTTACTGAAAAAATGGTGCGCCAATTTGCAGCGCAGTGTTCCCGATGGCAGTAAACTTTTCAATGATGCCATCTGTGAGGTGCACTTTGAGGATCGCTGCCTACGCAACAAGAGGCTGGAGAAGTGGGCAGTGCCCACTCTGATCCTTGGACACGAGGACATCGCCTATCCGCTGCCCACACCAGAACAAGTGACCGAGTTCTATGCCCGGCCTACAGCTCCTAACAATGGCGAGGAACAGGGCGAGTGTTGCGTGGAAACCTGCAAGAGGAATCCCAGCGTGGACGATATTAAGCTATACCGCCCGCCGGAGGAGTCCACCGTGCTGGCAAAGTGGGCGCACAATCTGCAGACGGAGGCCAGTCAGCTGATAGGCATGAGGATCTGCAACCTTCACTTCGAGGCGCATTGCATCGGCAagaggatgcggatgtgggCAATACCAACTCTAAATCTAGCTGGCAACATCGAGAATCTCTACGAGAATCCAGAGCAATCGTTGTTGTACAGGCGGCGGACGACTCACTTAAAGACGAAGCTGCCATCGATCTCCACAAAGCCCACCTGGGTTCCCAGGTGCTGTCTTCCACACTGTCGCAAAGTCAGAGCCCTGCACAACGTCCAGCTTTATCGCTTCCCCAAGCTCAACCGCTCCACATTGGCTAAGTGGGCGCACAATCTCCAGGTTCCAATGGTGGGCAGTGCCCAGCGCAGGCTATGCTCAGCTCATTTCGAGCCGCATGTGCTTAGTAAAAAGTGCCCGGTGCCGCTGGCCGTGCCAACGCTTGACCTGAATTCACCACCAGGCTTGAAAATCTACCAGAATCCGGTGAAGCTAAAGGCCAGCAAACTGTGTCTGCAGCGGGTCTGCATCGTCGAGAGCTGCCGCAAGACGCGGGCGCAGGGCGTGCAGCTTTTCCGGCTACCGCACAGTCCCACACAGCTGAGGAAATGGATGCACAACATAAGGACGCGGCCACGTGCAGCTATGAGGGCTCAATACCGGGTCTGTTCCCGCCACTTTGAGACACACTCCTTTAATGGCCGAAGACTAAGTGCAGGTGCTATTCCGACTTTGGAACTGGGTCACGATGGTGACGATATCTATCCTAATGAAGCGCAGGCATTTGTGGATGAACATTGTGCTGTCGAAGGCTGCGAGGCGTCCAAGGAGCACCCGGAGGTGCGACTTTTCCGTTTCCccaccgacgacgacgacatgTTGTGGAAATGGTGCAACAACCTAAAAATGAATCCTGTGGACTGCATTGGGGTGCGTATCTGCAACAAGCACTTCGAGGTCGATTGTATCGGTCCCAAGCACCTGTACAAGTGGGCCATTCCCACCAAGGAGCTGGGCCACGACGACGCACAGATAGAGCTGATCCCGAATCCAAAATTAGAGGAGAGGTATGTGGATCCAGTATTCAAATGCATCGTTCCCACCTGCGGCAAGACGCGACGCTTTGATGAGGTGCAGATGAACAGCTTCCCGAAGGATCCGGATCTATTTCAGCGTTGGCGGCACAACCTGCGCTTAGAACACCTCAGTTTCCAAGAACGTGAGCGCTACAAGATTTGTAACGCTCACTTTGAAGAGATCTGTATTGGGAAAACACGGCTGAACATTGGATCCGTACCAACCTTGGAGCTGGGCCATGACGATGAGGATGATATTTTCCAAGTGAATCCGGCGGAACTGCAGAGCAACTTATTCGGGCGACAACGTCGACTGCTTGAGAGATTCGGCGAAGTGAGAGTCAAACAAGAACTGTCCGAGACGGAAGACAACGGAAATGCGGACTTGATGGCCACAGGCTCAAATCCCAAAAAGGTTAAGATCAAGAGACCTATTTCGGATCTAAAGTGTTGTGTGCGCAGCTGTGGAAGAAGTCGATTGGAACACGGGGCACGGCTGTTTCCCTTTCCAACGggcaagcagcagcacctgaAGTGGCGTCATAATCTGCACCTGGAACCAGAGGAGGTGGACCGATCAACGCGAGTTTGCAGCGCCCACTTCAATCGGCGTTGCATCGAGGGCAAACAGCTGAGGAGCTGGGCGATGCCCACTCAGCAGTTGGGACACCACGACCAGCCGATATATGAGAACCCGAAGAATATACCTGGATTCTTCACACCTACCTGTGCCCTAGGACACTGTCGCAAGAGGAGGAGTATTGACAACGATCTGCGTACCTATCGATATCCAAGGAGCGAAGATCTGCTGGAAAAATGGCGAGCTAATTTACGTCTGGCTCCGGATCAGTGTCGTGGCCGGATTTGTGCGAATCACTTCGAGCCGCAGGTGCGGGGCAAGCTAAAGTTAAAGACGGGAGCCGTGCCCACATTAATACTGGGACACGATGAGGGATTAGTCTATGACAATGAAGCTATAAAGGCGGGTATGGTTGACGAAGAGGAAGGCATCACCACAGAATTCCAGCGactgaaacaaaaaaatgagatgttcgatgaggaggaggagggtgaAGAGAATGATGGCGAAAAGCAGCACCCAGATGAACAGGACGAGGCAGACGAAGATGAAAAAGACGACCACTACTTTGATCCTCTTGAACTGGTAGAGACTTTTGCTGAACATCGCAGCGATGATGATGCccaagatgatgatgaagaagaAGAGGGTCGAGTTGACTCCCCCTCCGCTTACGAGGTCAAGGAGGAGATAGAACAGCTTCCAAGCACCCCGCCTTCACCTTTACCCCGACGCCACTACGCTCCGCGTCGAGACAAGCCGGCTAATAATGTGACTCCCATATGTTGTCTGAAACACTGCAGGAAGGAACGCACTGCCTTCCACCTTCTGAGCACTTTCGGCTTCCCAAAGGATCGCCAATTGCTGCTAAAGTGGTGTGTCAACCTGCATTTAAACCCGGATGACTGCGTTGGGCGGGTTTGCATCGAGCACTTCCAGCCGGAGGTGCTCGGAACCCGTAAGCTTAAGCAGAATGCGGTGCCCACTGTTAATGTGGGACATGAGGAGCCGCTTAGGTACTCGTGTCATGGGGTGGACCAGAATCTCGAGGAGCAAGACCCACAGCCCCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAGCTAACGGAGCCGCCAGATATTCCCCTAGCCAAGAGGAAAGCGCTGGGGATGCCGATGATGAAGCGGGAATgggagatggagatgcagAAAGAGCGGGAGTTGAGGAAAATGACTCAAACGGATAGTGAGTCAAAGAGATGCTGTGTTAGCAGTTGCGGGAACGAAGAAGCAAGCCAATTGCTGCCTTTGCCCGTGGAGAAATCCTTGCTAAGAAAGTGGAGTCACAACTTAAAGCTGTCCACTGAGACTGACACTCTTtctttaagccaaaaaagagTTTGCTTGGCCCATTTTGAGTCGCAGCTGTTGGAGAATGGAAAACTCTCGAAGGAATCAGAGGCAGTTCCCACTTTAAAACTTGGCCACCGCAGTTGGAACCTATACAGGAGCAATGGGATCTGCCTGGTGCCTAACTGCACACACAACACCATGGGTCGCTTAAGCTTCATCGATCTGCCggataatacaataattagaGAAGCTTTCTTTTCCTACCTCAACCTACCTAATCCTCCCAAGGAACAGGCAAGATTATGCGGTATCCATTTTATGGAGGTATACAAGAACTTAAGTCTTCCCAAGGTTTTGCACTCCCAAGATATAATGCAGCTGCAAAGTGTTGAAGACGAATTGCAATGCGCAGTGCCTGGCTGCTTCGAAAATACTGGTCAGAGTTTTCAGCTACTCCAGATTCCAGATAACAAAGAGGTGCTGTCCAAGTGGCTGCACAACACCAAGATCCCCTACGATCCTTCTAGGCACCGAAGCTATCGCATCTGCAGACTACACTTTGAAGCAGAGTACTTAGAAGACGCTTCGTCGCTAAACTGGGCTATACCAACACTCCACCTAAACCAAGACGATGAGATCTACTTAAATACTAAGCCCTTGCAAGAGGAAGAGGTCTCTATGTTGACTCCATTGCGGATAAAGACGGATCTGGCCTTGTTGGGCAGtccaagtgcaagtgcaagcCCCAGTCCTCGGGGCAGGATCCGTATATGTTGCATTCCCACATGTGGACAGATTGGAAGCAATCAAGTAAGGCTCTATCGATTTCCCACCGAGGAGCAGGCGTTACTTCGGTGGCTGGTAAATACGCAACAGCAGCCAAGACTTGTAGATCCCATGGACTTGTATGTCTGCCAGTCGCATTTTGAGCCTGAGGCCATTTGTAAAAAACAGCTCCGCAGCTGGGCCGAGCCCACCTTGAACCTGGGACACGACGGATACGTAATTCCGAATGCCAAACACAATGGAAACATTTCTGACAGCCAGGATACCGAGCAAGCAATGAAGTTCATTCGCGAACGCTTCTGCTCCGTCATTTCATGTTTTCAATCAAAAGGACAGGAAGAGGGAGGAGTGAGGTTGTACGACTATCCCGAAGATATGGCTACTACTCGAAAGTGGGCAGCCGCATGCAGACATCGCTCCATGCAGGCCAGGAGCCATGGGTTCAAGGTGTGTCAGTTGCACTTCTCTATGGAATGCTTTGACCCAGTTACTGGAAATTTGATTGAAGGCTCAGTGCCCACCCTGGAGTTGAGCAGGGATGATATGGAGAGGCAGTGTCTTGTAACTGTATGCGTAAGGAATGATCCTAATGGAGCCCGCCTCCGATACTACAAGATACCAAAAACTACTGCTCAATTGGAAGCGTGGAGCAACAACCTTAAGATCCACCCAACGGATCTAATGCAAGGGGAACAGCAGTACATCTGCGAGAAGCACTTTGAGGCGTTCTGCTTTGGAGCCAACAAGGGACTGCGTTCTGGTGCTCTCCCAACTCTCTTTCTGGGCCATGATGAGGAGGTCGAGATGCTTCCAAATCCGGAAAGTCTCTTCTCCCAGATCAAAACGGACAAGTGCTGCGTACCAGGTTGCGGACGTATCTGGCAGACTGGTGACCGTAAGTTCCGTGGCTTTCCGAAATTGTTGACCATGGCTAAAAAATGGAGGCATAACCTTCGTTTAGTTGCGACTATGGAGCAACTGGGCAAGCTCAAGGTTTGCAGTGCTCACTTTGAGGCCACCTCCCCCCACGTCATTACAAATGGATTAAGTCCTAGTACTTCGATACCCACCTTGGAATTGGGTCATTCTTCTCCGGATATTTACCAAGCGGACACGAGCTTAAAGTTCCAAAAGCGGTCCGTAATGGTGCGTTATTGCTGTTATCCCAAGTGCGAGGAAATCTGTCTGCCCAAGAATCTGTCTTATGGGCTTCCTGAAGAGGAGCATCTGCGAAATGCCTGGCTAAGCCACATGAACATAGAAGATCCGAAAGATGGAGCAGACGCACAACTATGTCCGCTGCACTATGTCATCCTCTACCAGCACAGTGCCACAAACTATCCCGAGTATCACGCTTCAAGCCGATTGCTTCTTGATGATAATTACAAGGATGCGCGGAACAACAGACGCGTGAAGATTGTGAGCTGTGCAATCAAGGGCTGTGACATGGTTAAGCCCCGGGATGGGGTACTACTGCACGGGATGCCGCAAAGCCAGGACATCCTGCAGATGTGGATAGATAATGGTCAGTTTGAGTTTTTAGAGCAGCAGCGGTACATGCTCAAGGTGTGCCACAATCATTTTGAGTCATGCTGCTTCTTCGACGATAGACGCCTGCTCTCATGGAGCGTGCCGACCTTGCGCCTACCTGGCAAAACATTTCACCAAAATCCTACGGCCGAACAGTGGCAGAACATGATCAACAAGCCTGCAGCAGAAAAAATCAATGCAGATGAGAAAGAGGAGCCAGATCTTGATACGGATGTGGATAAGAGTGAGCCCATTGTAAAGACGGAGCATTTTGAATCCGAAGATGAAAATATAAACTCGGAGATGCAGGCCCTAGAGGTCCTCCTAGAAGTTGGCCACGTGGAACGAATGGAGAGCTATGAGAACGTGGATAAATCACCGGTAATCTATACCGAAAATTCACCCTTCCGATCGTCACCCATACGTTGCCAATACAATGCTAACCACTGTGCCGTAGAGGGATGCCAGGTGACCGTCGAGGATGTGGACGGCACAATAAAGCTGCACAAATTCCCCGCATCGCAGGAAGCCGCACAGAAGTGGATGCACAACACCCAAGTTGACATGGACGAAAAGTTCTGGTGGCGCTACCGCATATGCAGTTACCACTTCGATCAAGAGTGCTTTCAGAGCGCAAGAATTCGAAAAGGCGCGATGCCCACGCTCTTGTTAGGACCTCGGCGACCGGATAAGGTGTACGATAATGAATTCGCACAACCAGAGACGGAGGAGCCTTTTCTAGAGCCCCCTGGAATTCAGCTGGAGGAAAGTATGACTGCGGCGTCTAAAGTTCGTAAGGAAGTTTCCAGTTTATGCCTTCCGCCAAGGGCGCCGCCTCGAAAGTCGAGCAAGTTTTGCCAGATTTATTCTTGTACGAACCACCTGACAACTGAGAACATGACACTTCACAAGTTTCCTCACTCGGAGGACATGTGCCTCAAGTGGCAGCACAACACTCAAGTACCATTCGATCCCTACTACCGCTGGCGTTACCGCATCTGCAGTGCGCATTTTCATCCGATGTGTTTGGTCAACATGCGTCTAGTTCACGGAAGCGTTCCCACTTTGAAGCTAGGTCCCAAGGCTCCATCCGAACTGTTTGACAACGATTTTGATGCCATTAACCAAAGGTTGGACAAAAGGTTGACGGAGTCGAATGCCAATGTGTATATCAAACATGAAAGGAGAGAGGAGGATGAAGACTCGATGATGTTGCCGGAGCCCGAGCTCCAGTTTCACGAGGATCAAGATGATAAGATATCAGCATGGAACAGCAAACTGCAATTGACACCTGTAAAGCTGGAGAAAAGTATCTATAGCCAGATGAAGTCCGGCTATGATAAGTGTTCGCTGGCTCACTGCCAACGCCAAAGGTTCCAGCATGGCGTCCACATTTATAAGTTTCCCAGATCGAGGCGCCAGCAGGAGCGTTGGATGCACAACCTCTGTATCCGCTATGATGAGCGTACTCCGTGGAAATTTATGATATGTAGCGTTCATTTCGAACCGCACTGCATCATCTTAAGGAAGTTGCAACCTTGGGCGGTGCCCACACTGGAGCTGGGCGACAATGTGCCAAAGAAGATCTATTCCAACGAACAGTgtgaggaggaggtggtgacTGATCGCAGTGACCTGGAGAGCGACGCCGAGGAAGAAGACGGCTTAcaggaggatgatgatgatgaagacgAGGACGATCTGAAGCCGGATAATGTTGGCATAAAAAGGCGAAGACGTTTAAAGATAGATTCCGCTTGCCCTCCTACCCCGACTGCACCCTGGAAAGTCAAGCAATGCTGCCTTCCCTATTGCCGTGCCTTCCGAGGCGATGGCATCAAGCTATTTCGGCTTCCGAGCAACCGAAATTCCATTAGCAACTGGGAACTGGCCACGGGAATGGTATTCAAGGAGTCGCAACGGAATACTCGTTTGATCTGTAGCCGTCACTTTGAGCCAGAGCTGATTGGAGTCAGGCGTCTAATGCGTAATGCCATTCCCACTAGGCACTTAAACCCCCAAGGAGCTAACCAGATCCGTTctaaaaaagagaagaaaccTCAAGCTGCTGTTATTCCCATCTGCTGCATGGCGGACTGCCACTACAATGGAAATGTGAAGCTGCACAAGTTTCCAAATGATCCCACACTGCTTAGACAGTGGTGCCAGGCTCTTCGGCTCACCGATACGCAGCGGTATTTGGGCAAGCACATTTGTTCGATGCACCTGCCGATGAACAAAACATTGACCTGTGTTATCTGCGGTGGAGACAACGTAGAGTTGCCGATGCTTGAGTTTCCGGAGAACCGCAACCAGCGCGCCAAATGGTGTTACAATCTCAAGATTGAGACAATACCAAAGTGGGACCACTCAAAGCAAATTTGCTGCCGGCATTTCGAGTCCCATTGCTTTGATAAGCCGGGTGAACTACGTCCAGGAGCGGCTCCCACGCTCTATCTGAATCACGATGACTCAAACATATTCTTCAGCGACTATGCCACTGGCCTTCCGTCCTCGCCAATACGAAGTCGAATTAAAGACGAGCCGCTGGAATCGGAGTCTGACGAGATGTTGCCGGAGTAG
Protein Sequence
MSQHNPNHAHPHYHYPSHPTPLALQQQQHQQQHQQEQHGSSWYSHVASYPASSHSHNLSHSAFGPAPPCKASISNNNTIMGAYGGGGGGAGSHGYFGAAGGGLNVSGAGGGAGSTYGLGANTVAYAHNQLLQYQHHHQQHQQQQHLGLSQRSYMGHDVMAGSYPYIKSEPLEGFQQPANPMAPPPAPEMIIKSEPIDELAYKSNYIDDNTPFADFSKFSEFGEDMLSPKVELTVKNESYDRNPNSFLRRKQQSDRSTTESLPVCQRCKEVFFKKQTYLRHVAESNCGIQEYDFKCSICPMSFMTAEELQQHKQQHRADRFFCHKYCGKHFGTIAECETHEYMQHEYDNIVCNMCSGTFTTREQLYAHLPQHKFQQRFDCPICRLWYQTALELHEHRLAAPYFCGKYYTGGQSPSSSQQQQHQNQTNYKLQDCHMATMEMPSAPLLKTNPSDSPALPATAALNSLLQQRQANADGAAIFAASTLKNEVTVKLERSYSNSTNESSYSAQESGYNNIYSSSDTSIHGSLAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPTPSTCHLQQQSSLHSLQQQMHNKSYQRRSVASTSSSASSAASHYVDPEMSASYHLAMSTSASGSAAINASDSMDVCCVPSCESKRHNSDNITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHEDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLKKWCANLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESTVLAKWAHNLQTEASQLIGMRICNLHFEAHCIGKRMRMWAIPTLNLAGNIENLYENPEQSLLYRRRTTHLKTKLPSISTKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPVKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDGDDIYPNEAQAFVDEHCAVEGCEASKEHPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEVDCIGPKHLYKWAIPTKELGHDDAQIELIPNPKLEERYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFQRWRHNLRLEHLSFQERERYKICNAHFEEICIGKTRLNIGSVPTLELGHDDEDDIFQVNPAELQSNLFGRQRRLLERFGEVRVKQELSETEDNGNADLMATGSNPKKVKIKRPISDLKCCVRSCGRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEEVDRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHHDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICANHFEPQVRGKLKLKTGAVPTLILGHDEGLVYDNEAIKAGMVDEEEGITTEFQRLKQKNEMFDEEEEGEENDGEKQHPDEQDEADEDEKDDHYFDPLELVETFAEHRSDDDAQDDDEEEEGRVDSPSAYEVKEEIEQLPSTPPSPLPRRHYAPRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCVNLHLNPDDCVGRVCIEHFQPEVLGTRKLKQNAVPTVNVGHEEPLRYSCHGVDQNLEEQDPQPQHSVFRLWSLKHCRKRKLTEPPDIPLAKRKALGMPMMKREWEMEMQKERELRKMTQTDSESKRCCVSSCGNEEASQLLPLPVEKSLLRKWSHNLKLSTETDTLSLSQKRVCLAHFESQLLENGKLSKESEAVPTLKLGHRSWNLYRSNGICLVPNCTHNTMGRLSFIDLPDNTIIREAFFSYLNLPNPPKEQARLCGIHFMEVYKNLSLPKVLHSQDIMQLQSVEDELQCAVPGCFENTGQSFQLLQIPDNKEVLSKWLHNTKIPYDPSRHRSYRICRLHFEAEYLEDASSLNWAIPTLHLNQDDEIYLNTKPLQEEEVSMLTPLRIKTDLALLGSPSASASPSPRGRIRICCIPTCGQIGSNQVRLYRFPTEEQALLRWLVNTQQQPRLVDPMDLYVCQSHFEPEAICKKQLRSWAEPTLNLGHDGYVIPNAKHNGNISDSQDTEQAMKFIRERFCSVISCFQSKGQEEGGVRLYDYPEDMATTRKWAAACRHRSMQARSHGFKVCQLHFSMECFDPVTGNLIEGSVPTLELSRDDMERQCLVTVCVRNDPNGARLRYYKIPKTTAQLEAWSNNLKIHPTDLMQGEQQYICEKHFEAFCFGANKGLRSGALPTLFLGHDEEVEMLPNPESLFSQIKTDKCCVPGCGRIWQTGDRKFRGFPKLLTMAKKWRHNLRLVATMEQLGKLKVCSAHFEATSPHVITNGLSPSTSIPTLELGHSSPDIYQADTSLKFQKRSVMVRYCCYPKCEEICLPKNLSYGLPEEEHLRNAWLSHMNIEDPKDGADAQLCPLHYVILYQHSATNYPEYHASSRLLLDDNYKDARNNRRVKIVSCAIKGCDMVKPRDGVLLHGMPQSQDILQMWIDNGQFEFLEQQRYMLKVCHNHFESCCFFDDRRLLSWSVPTLRLPGKTFHQNPTAEQWQNMINKPAAEKINADEKEEPDLDTDVDKSEPIVKTEHFESEDENINSEMQALEVLLEVGHVERMESYENVDKSPVIYTENSPFRSSPIRCQYNANHCAVEGCQVTVEDVDGTIKLHKFPASQEAAQKWMHNTQVDMDEKFWWRYRICSYHFDQECFQSARIRKGAMPTLLLGPRRPDKVYDNEFAQPETEEPFLEPPGIQLEESMTAASKVRKEVSSLCLPPRAPPRKSSKFCQIYSCTNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPMCLVNMRLVHGSVPTLKLGPKAPSELFDNDFDAINQRLDKRLTESNANVYIKHERREEDEDSMMLPEPELQFHEDQDDKISAWNSKLQLTPVKLEKSIYSQMKSGYDKCSLAHCQRQRFQHGVHIYKFPRSRRQQERWMHNLCIRYDERTPWKFMICSVHFEPHCIILRKLQPWAVPTLELGDNVPKKIYSNEQCEEEVVTDRSDLESDAEEEDGLQEDDDDEDEDDLKPDNVGIKRRRRLKIDSACPPTPTAPWKVKQCCLPYCRAFRGDGIKLFRLPSNRNSISNWELATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTRHLNPQGANQIRSKKEKKPQAAVIPICCMADCHYNGNVKLHKFPNDPTLLRQWCQALRLTDTQRYLGKHICSMHLPMNKTLTCVICGGDNVELPMLEFPENRNQRAKWCYNLKIETIPKWDHSKQICCRHFESHCFDKPGELRPGAAPTLYLNHDDSNIFFSDYATGLPSSPIRSRIKDEPLESESDEMLPE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00525910;
90% Identity
-
80% Identity
-