Basic Information

Gene Symbol
-
Assembly
GCA_008042755.1
Location
VNKA01002195.1:113121-126616[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 4.9 1.3e+04 -3.0 2.0 49 60 168 181 150 199 0.56
2 29 2.3e-15 6.1e-12 46.2 4.0 1 86 396 468 396 469 0.85
3 29 8.6e-15 2.3e-11 44.4 5.0 1 87 496 565 496 565 0.83
4 29 7.8e-16 2.1e-12 47.7 0.2 1 87 587 659 587 659 0.85
5 29 5.5e-16 1.5e-12 48.2 4.8 1 86 757 826 757 827 0.82
6 29 2.3e-15 6.1e-12 46.2 3.8 1 86 851 922 851 923 0.82
7 29 9.8e-13 2.6e-09 37.8 1.5 1 87 958 1026 958 1026 0.81
8 29 6.9e-11 1.9e-07 31.8 1.4 1 86 1068 1137 1068 1138 0.76
9 29 4.2e-17 1.1e-13 51.7 0.4 1 86 1165 1234 1165 1235 0.82
10 29 1.1e-12 3e-09 37.6 1.4 1 85 1256 1324 1256 1326 0.79
11 29 1.1e-14 3e-11 44.0 0.5 1 86 1353 1424 1353 1425 0.85
12 29 1.5e-12 4.1e-09 37.2 3.7 1 85 1500 1568 1500 1570 0.82
13 29 1.5e-12 4e-09 37.2 0.1 1 86 1593 1661 1593 1662 0.83
14 29 3.8e-13 1e-09 39.1 2.2 1 87 1809 1878 1809 1878 0.80
15 29 2.7e-13 7.4e-10 39.5 0.1 1 87 1973 2047 1973 2047 0.82
16 29 0.00014 0.38 11.6 1.1 1 61 2062 2114 2062 2129 0.73
17 29 2.8e-14 7.5e-11 42.7 0.0 1 86 2141 2211 2141 2212 0.76
18 29 2.6e-13 7e-10 39.6 0.1 1 87 2264 2334 2264 2334 0.81
19 29 1.7e-12 4.7e-09 37.0 0.1 1 86 2369 2443 2369 2444 0.80
20 29 5.3e-13 1.4e-09 38.6 0.0 1 86 2454 2527 2454 2528 0.80
21 29 2.3e-10 6.2e-07 30.2 0.0 1 61 2553 2608 2553 2625 0.77
22 29 2.8e-05 0.075 13.9 0.1 1 58 2650 2700 2650 2717 0.84
23 29 2.1e-11 5.6e-08 33.5 0.8 1 87 2740 2812 2740 2812 0.82
24 29 2.7e-16 7.3e-13 49.2 0.2 1 86 2923 2995 2923 2996 0.81
25 29 2.7e-12 7.4e-09 36.3 3.5 1 86 3059 3129 3059 3130 0.80
26 29 3.2e-14 8.7e-11 42.5 4.2 1 86 3222 3292 3222 3293 0.84
27 29 3.6e-12 9.8e-09 35.9 0.2 1 86 3374 3443 3374 3444 0.85
28 29 4e-10 1.1e-06 29.4 0.4 1 58 3470 3518 3470 3532 0.81
29 29 4e-10 1.1e-06 29.4 1.4 18 87 3536 3594 3524 3594 0.77

Sequence Information

Coding Sequence
ATGTCACAACACAACCCCAATCACGCCCACCACCCACACTACCACTACCCTGAACCCTTGGAAGGGTTCCAGCAGCCGCCAAATCCAATGGCCCCACCCCCGGCCCCagaaatgataataaaatCGGAACCCATTGACGACCTGGCCTACAAGTCAAACTACATAGACGACAATACGCCATTTGCGGACTTTAGCAAGTTTAGCGAATTCAGCGAGGACATGCTGAGTCCCAAAGTCGAGCTGACAGTCAAGGATGAGTCCTTCGTTAGGAACCCCAATAGCTTTTTACGCCGTAAGCAACAATCGGATCTGACGACAGCAGAGAGCCTGCCCGTCTGCCAGCGATGCAAAGAGGTGTTCTTCAAGAAGCAGACTTACCTGCGGCACGTCGCCGAGAGCAACTGCGGCATCCAGGAGTACGACTTCAAGTGCACCATATGCCCCATGTCCTTTATGACCGCCGAGGAGCTACACCAGCATAAGCAACAGCATCGAGCGGACAGATTCTTCTGCCACAAGTACTGCGGAAAGCACTTTGGCACGATCACAGAGTGCGAGGCGCATGAGTACATGCAACATGAATACGAAAACATTGTGTGCAACATGTGCTCGGGATCTTTCGCCACGCGGGAACAACTTTATGCTCATTTGCCGCAGCACAAGTTCCAGCAGCGCTTTGACTGCCCCGTATGCCGCCTATGGTACCAAACCGCTGTGGAGCTGCATGAGCACCGCCTGGCTGCACCCTACTTCTGCGGTAAATACTACACGGGCGGACAGTCCCCGTCCCCGTCCTCGTCCtcccaacagcaacagcaccagAGTCAGACGAACTACAAGCTGCAGGACTGTCATATGGCCACCATGGAAATGCCAAACGCACCGCTCCTTAAGGCAAACTCATCCAACTCGCCGGCCTTGCCAGCGACAGCAGCGCTTAACTCCCTGTTGCAACAGCGCCAGGCCAATGCCGATGGAGCAGCTATTTTTGCCGCATCTTCGCTGAAGAACGAGGTCGCTGTGAAGCTGGAGCGCAGCTACAGTAACTCGACCAACGAATCGTCTTATAGCGTCCAGGAGAGCGGCTACAATAATGTGTATGGCAGCAGTGACAGCTCAGTTCACGGTGCCATTGCCGGGCCACAGGCACACTCTTCGACGCTGGACGATTCCGAGGATGCGCTTTGCTGTGTGCCGCTGTGCGGTGTGCGGAAGAGTACGAGTCCCACCTTGCAGTTTTTCACGTTCCCAAAGGACGAAAAATATCTCAACCAGTGGCTGCATAACCTCAAGATGTTCCACATACCCGCTTCCAGCTACGTTAGCTTCCGGATCTGCAGTATGCACTTCCCCAAGCGATGCATCAACCGCTATTCGCTGTGCTACTGGGCGGTGCCGACATTTAACCTCGGCCACGATGACGTAGCCAATCTCTACCAGAATCGGGAGCTGACCAACACGTTTACCACTGGCGAAGTGGCGCGCTGCAGCATGCCACATTGTACCAGCCAGCGGGGTGAGAGCAACCTCAAGTTTTACAATTTCCCAAAGGATATCAAAAGCCTGATTAAGTGGTGCCAAAACGCCCGACTTCCGGTGCAGGCAAAGGAGCCGCGACATTTCTGTAGCCGCCACTTTGAGGAGCGGTGCATTGGCAAGTTTCGACTGAAACCTTGGGCAGTGCCCACCTTACACCTGGGCGCCCAGTACGGCAAGATCCACGACAATCCAAAGAATCTATATGTGGAAGAGAAACGCTGTTGCCTCAACTTTTGTCGCCGGAGCCGCTCTTCCGACTTCAATATGTCGCTATATCGATTTCCTAGAGACGAAGTCCTGCTACGGCGCTGGTGCTACAATCTTCGCCTCGATCCGGGAGTGTATCGCGGCAAGAATCACAAAATATGCAGCGCTCACTTTATAAAAGAGGCGTTGGGTCTTCGGAAACTGTCGCCTGGTGCCGTGCCCACACTTCATCTGGGCCACAATGATACCTTCAACATCTACGAGAACGAACTGTGGCCACCGCCAACTCCGACACCCTCCTCTTGTCATctccaacagcaacagcagtcaTCCCTTCATTCGCTTCAACAGCAGATGCACAGCAAATCCTACCAGCGCCGCTCAGCGGCATCTACATCGTCATCGGCAAGCTCGGCAGCCTCGCATTATGTGGATCCTGAGATGAGCGCCTCTTAccatctagccatgtccgccTCCGCCGGTGGCTCTGCGACGATAAACGCCAGCGACAGCATGGATGTCTGTTGCGTGCCCAGTTGCGAGAGCAAGCGACACAATAGCGAGAACATTACATTCCACACGATTCCGCGACGGCCCGAGCAAATGCGTAAATGGTGTCACAATCTTAAGATTGCCGAGGACAAGATGCACAAGGGCATGCGAATCTGCAGCCTTCACTTCGAGCCCTACTGCATCGGCGGCTGTATGCGTCCGTTTGCTGTGCCCACTCTTCAGTTGGGCCACGACGATGAGGATATCCACCGCAATCCGGACGTGATCAAGAAGCTGAACATCCGGGAGACATGCTGTGTGGCTGTGTGCAAGCGGAATAGGGACAGGGATCATGCGAATCTGCATCGTTTCCCCAGCAATGTGGCTTTGCTGAAAAAGTGGTGCACCAATTTGCAGCGCAGCGTTCCCGATGGCAGTAAACTCTTCAATGATGCCATCTGTGAGGTGCACTTTGAGGATCGTTGCCTGCGCAACAAGAGGCTCGAGAAGTGGGCAGTGCCTACTCTGATCCTGGGACACGATGACATTGCCTATCCGCTGCCCACGCCAGAGCAAGTAACCGAGTTCTATGCCCGGCCCACGGCTCCCAACAATGGTGAGGAACAGGGCGAGTGCTGTGTGGAGACGTGCAAGAGGAATCCGAGCGTGGACGATATAAAGCTATACCGGCCACCGGAGGAGGCCGCCGTGCTGGCCAAGTGGGCGCACAACCTGCAAACGGAGGCCAACCAACTGACAAGCATGAGGATCTGCAATCTACACTTTGAGGCGCATTGCATCGGCAAGAGGATGCGACATTGGGCCATACCGACTTTGAATCTAGCCGGCAACATTGAGAATCTTTATGAGAATCCAGAGCAATCGCTGCTGTACAGGCGTCGCACTACTCACATGAAGGCGAAGCTGACGCAAGCCTCCGTCAAACCCACCTGGGTGCCCAGGTGCTGTCTTCCACACTGTCGCAAAGTCAGAGCCCTGCACAATGTCCAGCTGTATCGCTTCCCCAAGCTCAATCGCTCCACATTGGCCAAGTGGGCGCATAATCTCCAGGTTCCAATGGTGGGCAGTGCCCAGCGCAGGCTATGCTCGGCCCACTTCGAGCCACATGTGCTCAGCAAAAAGTGCCCGGTGCCGCTGGCGGTGCCTACGCTCGACCTAAATTCACCACCCGGCTTGAAAATCTACCAGAATCCGGCCAAGCTAAAGGCCAGCAAACTGTGCCTGCAGCGGGTGTGCATTGTCGAAAGCTGCCGCAAGACGCGGGCGCAGGGCGTTCAGCTTTTCCGGCTGCCGCACAGCCCCACACAGCTGCGAAAGTGGATGCACAACATCAGGACGCGGCCACGAGCAGCTATGCGGGCTCAGTACCGGGTGTGTTCCCGTCACTTCGAGACGCACTCCTTCAATGGCCGAAGACTGAGCGCAGGTGCCATTCCGACTCTAGAGCTGGGCCACGATGGCGACGATATCTATCCGAATGAAGCGCAGGCATTTGTGGACGAGCATTGTGCTGTCGAAGGCTGTGAGGCATCCAAGGAGCAGCCGGAGGTGCGATTGTTCCGCTTCCccaccgacgacgacgataTGTTGTGGAAGTGGTGCAACAACCTCAAAATGAATCCTGTGGACTGCATTGGGGTACGCATCTGCAACAAGCACTTCGAGGCCGATTGCATCGGTCCCAAGCATCTGTACAAGTGGGCTATTCCCACACAGGAGCTGGGCCACGACGATGCGCAGATCGAGCTGATACCGAATCCCAAGCCAGAGGATAGGTATGTGGATCCCGTCTTCAAGTGCATCGTTCCCACCTGCGGCAAGACACGACGGTTTGACGAAGTGCAAATGAACAGCTTCCCCAAGGATCCGGATCTATTCCAGCGATGGCGGCACAATCTGCGCATTGATCATCTCAGTTTCCAGGAGCGTGAGCGCTACAAGATCTGCAACGCACACTTCGAGGAGATTTGTATTGGAAAGACACGGCTAAACATTGGATCCGTTCCAACCTTGGAGCTTGGTCATGACGATGAGGAGGATATTTTCAAAGTTAATCCAGCGGAGCTGCAGAGCAATTTATTCGGGCGGCAGCGTCGACTGCTGCTCGAGGGATCCGGCGAACAGAGTGTCGTCAAGCAAGAGCTATCCGAGACGGAGGACAACAACAAGGCGGATGTGACGGCCACTGGCTCCAATTCCAAGCAGATCAAGATCAAGAGATCTTCTTTGGATCTTAAGTGTTGTGTGCACAGTTGTGGAAGAAGTCGCTTGGAACACGGAGCCCGGCTGTTTCCCTTTCCCACGggcaagcagcagcacctaAAGTGGCGTCACAATCTGCACCTGGAACCGGAGGAGGTGGACCGTTCGACGCGCGTTTGCAGCGCTCACTTTAATCGACGCTGCATCGAGGGCAAACAACTGAGGAGCTGGGCCATGCCCACGCAACAGTTGGGACACAACGACCAGCCGATCTACGAGAACCCAAAGAACATACCGGGATTCTTCACACCTACCTGTGCCCTGGGACACTGTCGCAAGCGGAGGAGTATTGACAACGATCTGCGCACCTATCGGTATCCCAGGAGCGAAGATCTTCTAGAGAAATGGCGAGCTAATCTGCGACTGGCTCCAGATCAGTGTCGTGGTCGAATCTGTGCAAATCATTTCGAACCGCAGGTTCGGGGCAAGCTAAAGCTGAAGACGGGAGCCGTTCCTACACTACAACTGGGACACGATGAGGAATTAATCTATGACAATGAAGCTATTAAGGCAGGCATGACCGAAGAAGAGGAGGCCATAACCACAGACTTTCCGCGattgaaaccaaaaaaagagttgttcgaagaggaggaggaggagtgcgAAGGGAACGATGGCGAGCAGCAGCACGCAGATGACCTGGACGAGAATGCAGATGAAGAAGACAAAGATGATCAGTACTTTGATCCTCTTGAGCTGGTTGAGACTTTTGCCGAACATCGCAGTGATGACGAAGCCCAGGACTATGAGGATGAAGAAGACGAGGGTCGAGTTGAGGACTCCCCCTCCGGTTATGATGTCAAGGAGGAGATAGAACCGCCGCCAAGCTCACCACCCTCTCCGCTTCGACGACGGCACCATGTTCCGCGCCGAGACAAGCCTGCTAACAATGTGACGCCCATTTGCTGCCTGAAGCACTGCAGAAAGGAACGCACTGCCTTCCATCTGTTGAGCACTTTCGGCTTCCCAAAGGAtcgccagctgctgctgaaaTGGTGTGTCAATCTGCATTTAAACCCGGACGACTGCATCGGTAGGGTTTGCATCGAGCACTTTCAGCCGGAGGTACTCGGCACCCGCAAGCTCAAGCAGAACGCAGTGCCCACTCTTAATGTGGGACATGATGAACCGCTTAGGTATTCGTGCCATGGCGTGGACCAGAATCTCGAGGAGCGGGAGCCCCAGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGCCAACGGAGCCGCCGGATATTCCCCCAACCAAGAGGAGAGTGCTGGAGATGCCAATGATGAAGCGGGAGTGGGAGATGGAGATGCcaatgcagatgcagatggagCAGAAGAAGGAGGCAAAGAAGATGACTCAAAATGAAAGTAATTCACTCACATGCTGTATTAGCAGTTGCGGAAACCAGGAAGTTAGCCAATTGCTGGCATTTCCTGAAGAGACATCCTTGTTGAGAAAGTGGATCCATAATTTAAGGCTTTCCAATGAGATTGAGCCCTCTTCTCTAAGCCTGAAAAGAGTTTGCTTGTCGCACTTCGAATCGCAGCTCTTGGAGAATGGAAAGCTCACAAAGgaagcagaggcagaggctgTGCCTACCTTAAACCTGGGCCATAGCAGCTGGAATCTATACAGAAGCAATGGGATTTGCCTAGTGCCTGACTGCACCCATAATACCTTCGGACGCATTAGCTTTATCGACCTGCCGGATAACAGTATTATTAGGAAGGCTTGTTTCTCCTGCTTAAACCTACCTGAATCTTCCGAGGAGCAGGCGAGACTATGTTGTGTCCACTTCATGCAGGCTTACAAAAAGTTTGATCTGCCTAATGTTCTGCACCCTAAAGTCATGATGGCGCTACAAAGTGTTGTGGCCGAGCTGAAATGCGCGGTGCCTGACTGTAATTCCGAAGAAGCTGGTTCTGACTTTCAACTTATCCAGTTTCCCGATGACAAGGAGATGCTGTCACAGTGGCTGCACAACACCAAGGTCCCTTATGATCCTTCTAATCACCAAAGTTATCGCATCTGCACACGTCACTTTGAATCAGAGTATTTGGAGTTGAATGGCCCGCTAAAAGGAGCTCTACCAACGCTACATCTAAACCATGAAGATGAGATTCACTTGAATACCAGCCCTTTGCCAGAGGATCAGAACTCTATATTGACACCACTGCGTATAAAGACGGATCCGGCCTTCTTGGGCAGTCCCTGTGCAAGTGCAAGCCCCAGTCCCCGGGGCAAGATCCGTATGTGCTGCATTCCCACATGTGGACAGTATGGCAGCAGTCAAGTGCGGCTGTTTCGTTTTCCCACCGAGGAGCAGGCGTTGCTTCGGTGGCTGGTGAACACCCAACAGCAGCCGCGACTGGTTGATCCCATGGACTTGTATGTGTGCCAGTCGCATTTTGAGCCCGAGGCCATTTATATGAAGCAACTACGAAACTGGGCTGAGCCCACCTTAAACTTGGGACACGACGGCCATATAATACCGAATGCCAAACataatggaaatatttccGACAGCCAAGATACCGAGCAAGCCATGAGGTTTATTCGCGAACGATTCTGCTCGGTCATTTCTTGCTTTCAGGCTGGCggacaggaggaggagggagtgAGGCTATTTGATTATCCCGAGGATATGGCGACTACTCGAAAATGGGCCGCCGCATGCCGACATCGCTCCATGCAGGCCAGAAGCCATGGGTTCAAGGTGTGCCAATTGCATTTCGCCAAGGAATGCTTTGACCCAAATACCGGGGAATTAATTGAGGGCGCTGTGCCTACACTGGAGTTGAGCAGAGATGAAATGGAGAGGCAATGTCTGGTGGCTGGATGTGTGAAAAATGATGCCAATGGAACCCGCCTTCGATACTTTAAGATACCAAAAGTTGCTGCCCAATTAGAAGCGTGGAGCAACAACCTTAAAGTCCATCCAACGGATCTCATGCAAGGCGAGCAGCAGTACATCTGCGAGAAACACTTTGAGTCGTTCTGTTTCGCGGCCAACAAGGGACTGCGTTCTGGTGCTCTGCCAACCCTCCTCCTAGGCCATGATGAAGAGGTGGATATGCTTCCAAATCCGGAAAGCTTCAACTGCCAGAATAAGGCGGATAAATGCTGCGTACCCGGCTGCGGGCGTGTCTGGCAGGCTGGTGATCGTAAATTTCGTGGATTTCCCAAATTGCTGGCCATGGCCAAGAAATGGAGACATAATCTTCGTTTGGAGGAGCCCGTGGAGCAACTCGGCAAGCTGAAGGTCTGCGGTGCTCACTTTGAGGCCACCTCACCCAGCCTGGGTACAAATGGACTAAGTGTCTCGATACCAACCCTGGAATTGGGCCACTCTTCTCGGGATATTTTCCCAGCGGAGATAAGCTTAAAGTTCCAAAAGCCAGCGAAAACGATTTGCTGTTATCCCAAATGCGAAGAAGCCTGTTTATCCAAGAACTTTTCTTACGGTCTTCCCCAGGAGGAGCATCTGAGAAATGCCTGGCTAAGCTATATGGACATCGAAGACCCGAAAGATGAAGAAATCGCACAGGTGTGCCCGCTGCACTATGTCATCCTCTACCAGCACAGTGCCGCACTCTATCCGGAGCTTCATGCTTCAAGCCGTCGGCTTCTTGACTACAATTACAAGGAGGCGTGGAACAACAGGCGCGTTAAGATTGTGAGTTGCACGATCAAGGGCTGCGACATGATTAAGCCACGAGATGGGATACCACTGCACGGGATGCCGCAAAGCAAGGACATCCTGCAGATGTGGATAGAAAATGGTCAGTTTGAGTTCTTAGAGCAGCAGCGGTATATGTTCAAGGTGTGTCACAATCATTTTGAGCCATGCTGCTACTTCGACGACAGACGTTTGCACTCATGGAGCGTGCCCACTTTGCATCTACCTGGAGATGTAATTCACCAAAATCCCACCGCCGAGCAGTGGCAGAACATGATCAACAAGCAAGCAGCAGCGAAAACTGACCGAGAAGAGAGCGAGGAGCCAGATCCATATGAGGATGTGGTTAAAACCGAACCCATTGTAAAGATGGAGCATATCGAATCGGAATATGAAGATGAAAACAGTGAGATGCAGGCCCTCGAGGTCCTCCTAGAAGTTGGCCATGTCGAGCGAATGGAGAGCTATGAGAAAATGGATAAATCACCAGCGACATACACCGATACACCGTTTCGATCTTCACCCATACGTTACCCATACAATGCTAATCATTGTGCCGTAGAAGGATGCCAGGTGACTGTCGAGGATGTGGACGGCACAATTAAGCTGCATAAATTTCCCGCCTCGCAGGAAGCAGCACAGAAGTGGATGCACAACACCCAAGTTGACATGGACGAAAAGTATTGGTGGCGTTATCGCATTTGCAGCTATCACTTCGAACAAGAATGCTTTCAGGGTGCTAGAATTCGTAAGGGCGCGATGCCCACGCTTTTGCTAGGACCGCGGCGACCGGACGAGGTATACGATAATGAGTTTTCACTACCAGAGGCGGAGGAGCCCTTTCCAGAGCCACCCGAGACTCAACTTGAGGAAAGAACGTCCTTGGCGTCCAGAGTTCAAAAGGAGGTAACCAATTTATGCCTGCCGCCACGGGCGCCGCCTCGAAAGTCCAGCAAGTTTTGCCAAATTGATTCCTGCACAAATCATTTGACCACTGAGAACATGACACTTCACAAGTTTCCACACTCGGAGGACATGTGCCTCAAGTGGCAGCACAACACACAAGTGCCATTTGATCCCTACTACCGCTGGCGCTATCGCATTTGCAGTGCCCATTTTCATCCGGTGTGTTTGGTCAACATGCGTCTAGTCCATGGAAGCGTTCCCACTTTAAAGCTGGGCCCTAAGGCTCCTTCCGAGCTGTTTGACAACGATTTCGAAGCCATTAACCTAAGATTGGACAAAAGGTTGACAGAGTCCAATGCTAATGTGTATATCAAGCATGAAAAAagggaggaggatgaggattCGATGATGTTCCTGGAGCCCGAACTTCAGTTACACGAGGACCAAGACGATAAGGTATCAAGCTGGAGCAGCAAAATGCCATTACCACCTGTGAAGCAAGAGAAGATTATATACAGCCAGATCAAGTCTGGCTACGATAAGTGTTCGCTGGCGCACTGCCAGCGCCAAAGGTCCCAGCATGGCGTCCACATTTATAAGTTTCCCAGATCGAGGCGTCAGCAGGAGCGTTGGATGCACAACCTACGCATCCGCTATGATGATCGGACGCCGTGGAAATTCATGATCTGCAGCGTTCATTTCGAGCCGCATTGCGTCAGCCTAAGGAAGCTGCGACCATGGGCGGTGCCCACACTGGAACTGGGTGACAATGTACCAGAGACAATCTTCACGAACGAACAGTGCGAGAAGGAGCTGGTGACCGATCGCAGTGATCCGGATAGCGACGCCGAGGAAGAAGACGGCTTGcaggaggacgacgacgatgatgaagaCGAAGACGATGTGAAGCCCGATGTTATTGGCATAAAAAGGAGGAAACGTTCCAAAATAGATGCCAACTGCCCTCCCAGCCAGATTCCACCCTGGAAAGTCAAGCAATGCTGCCTCCCCTATTGTCGTGCCTTTCGAGGCGATGGCATCAAGCTGTTTCGGCTTCCGAATAACCGAAACTCCATTAGCAACTGGGAACGGGCCACCGGAATGGTATTCAAGGAGTCGCAACGGAACACTCGCCTGATCTGCAGTCGTCACTTTGAGCCAGAGCTGATTGGCGTCAGGCGTCTAATGCGTAACGCCATTCCCACGAAACACTTGAGCCCCCAAGCTGTGGACCAGACGCGTactaaaaaggaaaagaatcCTCCTCCGGCCACTATTGTACCCATCTGCTGTATGGCGGATTGTCATTACAACGGAAATGTGAAGCTGTACAAGTTTCCAAGTGATCCCACTCTTCTCAAACAGTGGTGCCAGGCTCTCCGTCTCACGGATACGCAGCGGTATTTGGGCAAGCACATTTGCTCCATGCACATGCCAATGAACAAGACGCAGAGCTGTGTCATCTGCGGTGGAGATGACGTAGAGCTGCCAATGCTTGGGTTTCCGGAAAACCGCAATCAGCGCGCCAAATGGTGTTACAATCTTAAAATTGAGGCAATACCAAAGTGGGACCACTCAAAGCATATTTGCTGCCGGCACTTTGAGTCCCATTGCTTTGACAAGCCGGGTGAGCTACGTCCAGGAGCGGCTCCCACGCTCCATCTCAATCACGATGACACAAACATATTCTTCAGCGACTATGCCACTGGTCTTCCGTCCTCGCCACTAGGCAATCGAATTAAAGACGAGCCCCTGGAATCGGAATCCGACGAGACACTGCTGGTGTAG
Protein Sequence
MSQHNPNHAHHPHYHYPEPLEGFQQPPNPMAPPPAPEMIIKSEPIDDLAYKSNYIDDNTPFADFSKFSEFSEDMLSPKVELTVKDESFVRNPNSFLRRKQQSDLTTAESLPVCQRCKEVFFKKQTYLRHVAESNCGIQEYDFKCTICPMSFMTAEELHQHKQQHRADRFFCHKYCGKHFGTITECEAHEYMQHEYENIVCNMCSGSFATREQLYAHLPQHKFQQRFDCPVCRLWYQTAVELHEHRLAAPYFCGKYYTGGQSPSPSSSSQQQQHQSQTNYKLQDCHMATMEMPNAPLLKANSSNSPALPATAALNSLLQQRQANADGAAIFAASSLKNEVAVKLERSYSNSTNESSYSVQESGYNNVYGSSDSSVHGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPTPSSCHLQQQQQSSLHSLQQQMHSKSYQRRSAASTSSSASSAASHYVDPEMSASYHLAMSASAGGSATINASDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLKKWCTNLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDDIAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEEAAVLAKWAHNLQTEANQLTSMRICNLHFEAHCIGKRMRHWAIPTLNLAGNIENLYENPEQSLLYRRRTTHMKAKLTQASVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDGDDIYPNEAQAFVDEHCAVEGCEASKEQPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTQELGHDDAQIELIPNPKPEDRYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFQRWRHNLRIDHLSFQERERYKICNAHFEEICIGKTRLNIGSVPTLELGHDDEEDIFKVNPAELQSNLFGRQRRLLLEGSGEQSVVKQELSETEDNNKADVTATGSNSKQIKIKRSSLDLKCCVHSCGRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEEVDRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHNDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICANHFEPQVRGKLKLKTGAVPTLQLGHDEELIYDNEAIKAGMTEEEEAITTDFPRLKPKKELFEEEEEECEGNDGEQQHADDLDENADEEDKDDQYFDPLELVETFAEHRSDDEAQDYEDEEDEGRVEDSPSGYDVKEEIEPPPSSPPSPLRRRHHVPRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCVNLHLNPDDCIGRVCIEHFQPEVLGTRKLKQNAVPTLNVGHDEPLRYSCHGVDQNLEEREPQPQHSVFRLWSLKHCRKRKPTEPPDIPPTKRRVLEMPMMKREWEMEMPMQMQMEQKKEAKKMTQNESNSLTCCISSCGNQEVSQLLAFPEETSLLRKWIHNLRLSNEIEPSSLSLKRVCLSHFESQLLENGKLTKEAEAEAVPTLNLGHSSWNLYRSNGICLVPDCTHNTFGRISFIDLPDNSIIRKACFSCLNLPESSEEQARLCCVHFMQAYKKFDLPNVLHPKVMMALQSVVAELKCAVPDCNSEEAGSDFQLIQFPDDKEMLSQWLHNTKVPYDPSNHQSYRICTRHFESEYLELNGPLKGALPTLHLNHEDEIHLNTSPLPEDQNSILTPLRIKTDPAFLGSPCASASPSPRGKIRMCCIPTCGQYGSSQVRLFRFPTEEQALLRWLVNTQQQPRLVDPMDLYVCQSHFEPEAIYMKQLRNWAEPTLNLGHDGHIIPNAKHNGNISDSQDTEQAMRFIRERFCSVISCFQAGGQEEEGVRLFDYPEDMATTRKWAAACRHRSMQARSHGFKVCQLHFAKECFDPNTGELIEGAVPTLELSRDEMERQCLVAGCVKNDANGTRLRYFKIPKVAAQLEAWSNNLKVHPTDLMQGEQQYICEKHFESFCFAANKGLRSGALPTLLLGHDEEVDMLPNPESFNCQNKADKCCVPGCGRVWQAGDRKFRGFPKLLAMAKKWRHNLRLEEPVEQLGKLKVCGAHFEATSPSLGTNGLSVSIPTLELGHSSRDIFPAEISLKFQKPAKTICCYPKCEEACLSKNFSYGLPQEEHLRNAWLSYMDIEDPKDEEIAQVCPLHYVILYQHSAALYPELHASSRRLLDYNYKEAWNNRRVKIVSCTIKGCDMIKPRDGIPLHGMPQSKDILQMWIENGQFEFLEQQRYMFKVCHNHFEPCCYFDDRRLHSWSVPTLHLPGDVIHQNPTAEQWQNMINKQAAAKTDREESEEPDPYEDVVKTEPIVKMEHIESEYEDENSEMQALEVLLEVGHVERMESYEKMDKSPATYTDTPFRSSPIRYPYNANHCAVEGCQVTVEDVDGTIKLHKFPASQEAAQKWMHNTQVDMDEKYWWRYRICSYHFEQECFQGARIRKGAMPTLLLGPRRPDEVYDNEFSLPEAEEPFPEPPETQLEERTSLASRVQKEVTNLCLPPRAPPRKSSKFCQIDSCTNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPVCLVNMRLVHGSVPTLKLGPKAPSELFDNDFEAINLRLDKRLTESNANVYIKHEKREEDEDSMMFLEPELQLHEDQDDKVSSWSSKMPLPPVKQEKIIYSQIKSGYDKCSLAHCQRQRSQHGVHIYKFPRSRRQQERWMHNLRIRYDDRTPWKFMICSVHFEPHCVSLRKLRPWAVPTLELGDNVPETIFTNEQCEKELVTDRSDPDSDAEEEDGLQEDDDDDEDEDDVKPDVIGIKRRKRSKIDANCPPSQIPPWKVKQCCLPYCRAFRGDGIKLFRLPNNRNSISNWERATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTKHLSPQAVDQTRTKKEKNPPPATIVPICCMADCHYNGNVKLYKFPSDPTLLKQWCQALRLTDTQRYLGKHICSMHMPMNKTQSCVICGGDDVELPMLGFPENRNQRAKWCYNLKIEAIPKWDHSKHICCRHFESHCFDKPGELRPGAAPTLHLNHDDTNIFFSDYATGLPSSPLGNRIKDEPLESESDETLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00525910;
90% Identity
iTF_00594581;
80% Identity
-