Basic Information

Gene Symbol
-
Assembly
GCA_008042535.1
Location
VNJO01001704.1:1-22051[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 25 4.8 8.7e+03 -2.2 1.2 40 60 313 334 304 352 0.58
2 25 2.8e-15 5.1e-12 46.7 3.7 1 86 546 618 546 619 0.85
3 25 1.2e-14 2.3e-11 44.6 5.0 1 87 646 715 646 715 0.83
4 25 1.1e-15 2.1e-12 47.9 0.2 1 87 737 809 737 809 0.85
5 25 7.8e-16 1.4e-12 48.4 5.3 1 87 909 979 909 979 0.82
6 25 4.7e-15 8.5e-12 45.9 3.6 1 86 1003 1074 1003 1075 0.81
7 25 1.6e-12 3e-09 37.8 0.4 1 87 1110 1178 1110 1178 0.80
8 25 1e-10 1.8e-07 32.1 1.4 1 86 1220 1289 1220 1290 0.76
9 25 6.1e-17 1.1e-13 52.0 0.4 1 86 1317 1386 1317 1387 0.82
10 25 2.4e-12 4.5e-09 37.2 1.9 1 85 1408 1476 1408 1478 0.79
11 25 9.3e-15 1.7e-11 45.0 0.5 1 86 1505 1576 1505 1577 0.85
12 25 3.8e-12 6.9e-09 36.6 5.3 1 85 1651 1719 1651 1721 0.82
13 25 2.5e-12 4.6e-09 37.2 0.1 1 86 1744 1812 1744 1813 0.83
14 25 1.7e-13 3.1e-10 40.9 2.6 1 87 1959 2028 1959 2028 0.80
15 25 2.7 4.9e+03 -1.4 0.0 65 86 2098 2119 2056 2120 0.79
16 25 4.2e-12 7.7e-09 36.5 0.0 1 86 2130 2203 2130 2204 0.80
17 25 3.5 6.5e+03 -1.7 0.2 1 8 2229 2236 2229 2241 0.92
18 25 0.027 49 5.1 0.1 50 62 2250 2265 2234 2287 0.70
19 25 6.5e-05 0.12 13.4 0.1 1 58 2295 2345 2295 2366 0.82
20 25 3.4e-11 6.2e-08 33.6 0.3 1 87 2385 2457 2385 2457 0.81
21 25 2.5e-15 4.6e-12 46.8 0.6 1 86 2569 2641 2569 2642 0.81
22 25 9.5e-07 0.0017 19.3 0.0 25 86 2677 2726 2658 2727 0.74
23 25 3.3e-13 6.1e-10 40.0 3.6 1 86 2819 2889 2819 2890 0.84
24 25 1e-11 1.9e-08 35.2 0.2 1 86 2969 3038 2969 3039 0.83
25 25 1.4e-08 2.5e-05 25.2 1.3 1 57 3064 3111 3064 3111 0.85

Sequence Information

Coding Sequence
ATGTCACAACACAACCCAAATCACGCCCACCCACACTACCACTACCCGTCCCATCCGACGCCGCTGGctctgcagcagcaccagcagcaggagcacgGCAGTAGTTGGTACTCACATGTTGCTTCCTACCCAGCATCCTCCCACTCCCACAACCTCTCTCAGTCGGCCTTTGGCCCTGCGCCCCCTTGCAAGgccagcatcagcaacaacaacaccattATGGGTGCCTacggaggaggagaaggaggggGTGGTGGCTCGCATGGATATTTCGGCGCCGCTGGCGGGGGCCTCAATGTCAGCGGGGTGGCAGGGGGTGGTGCTGGGCCGACCTACGGCCTTGGGGCCAACACTGTGGCATATGCTCACAACCAGCTGCTGCAGTaccagcatcatcatcagcaacaccagcagcagcagcagcagcagcagcagcagcacctgggTCTGAGCCAGCGATCCTATATGGGCCACGATGTCATGGCCGGGAGCTATCCCTATATCAAAAGTGAACCTTTGGAGGGCTTCCAGCAACCGCCCAATCCAATGGCCCCACCCCCAGCCCCagaaatgataataaaatCGGAACCCGTTGACGAACTTGCCTACAAGTCAAACTTCATTGACGACAATACGCCATTTGCAGACTTTAGTAAGTTCAGCGAATTCAGCGAGAACATGCTGAGTCCCAAAATCGAGCTGACAGTTAAGAATGAGTCCTTCGACAGGAATCCCAATAGCTTTTTACGCCGTAAGCAGCAATCTGATCGGTCAACATCTGAGAGCCTGCCCGTCTGCCAGCGATGCAAGGAGGTGTTCTTTAAGAAGCAGACTTACCTCCGCCACGTCGCCGAGAGCAACTGCGGCATCCAGGAGTACGACTTCAAGTGCACCTTATGCCCCATGTCCTTCATGAACGCCGAGGAGATACAGCTACATAAGCAACAGCATCGAGCGGACAGGTTCTTTTGTCACAAGTACTGTGGCAAGCACTTTGGCACGATCGCAGAGTGCGAAGCGCACGAATACATGCAACACGAATACGACAACATTGTTTGCAACATGTGTTCGGCAACGTTCGCCACGCGGGAGCAACTGTATGCACACTTGCCACAGCACAAGTTCCAGCAGCGCTTCGACTGTCCCGTATGCCGCTTGTGGTACCAAACGGCTCTTGAGCTGCACGAGCACCGCCTGGCTGCCCCTTACTTTTGCGGTAAATACTATGCAGACGGACGGTCCCCTTCGTCTTCCCACCATCAACACCACCAAAGCCAAACGAACTACAAGCTGCAGGACTGTCATATGGCAACAATGGAAATGCCAAACGCACCACTCCCTAAGACGAACCCATCCAGCTCGCCTGCCTTGCCCGCGACCGCGGCACTTAATTCACTGTTACAGCAGCGTCAGGCAAATGCTAATGAAGCGGGCATTTTTGCCGCATCTACGCTCAAGAACGAGGTCACTGTGAAACTAGAGCGTAGCTACAGTAATTCGACTAACGAATCGTCTTATAGCGTTCAGGAGAGCGgctacaataatatttatagtagCAGCGACACCTCAATCCACGGTGCCCTCGCTGGACCGCAGGCACATTCTTCGACACTGGACGACTCCGAGGATGCGCTGTGCTGTGTGCCGCTGTGTGGGGTGCGGAAGAGCACAAGTCCCACATTGCAATTTTTCACGTTCCCAAAGGACGAGAAGTATCTCAACCAGTGGCTGCATAACCTTAAGATGTTCCACATACCCGCTTCGAGCTACCTTAGCTTCCGCATCTGCAGCATGCACTTTCCGAAGCGATGTATTAACCGCTACTCCCTGTGCTACTGGGCGGTGCCAACTTTCAACCTCGGCCACGATGACGTAGCCAATCTCTACCAGAATCGGGAACTTACCAACACCTTTACCACTGGCGAAGTGGCGCGCTGCAGCATGCCACATTGCACTAGCCAGCGAGGTGAGAGCAACCTTAAGTTCTACAACTTCCCGAAGGATATCAAAAGCTTGATAAAGTGGTGCCAGAACGCCCGACTTCCGGTGCAGGCGAAGGAGCCGCGACATTTCTGCAGCCGCCACTTCGAGGAGCGATGCATTGGCAAGTTTCGATTAAAGCCGTGGGCAGTGCCGACATTGCACCTTGGCGCCCAGTACGGCAAGATACACGACAATCCAAAAAATCTTTACGTAGAAGAAAAACGGTGTTGCCTAAACTTTTGCCGTCGAAGCCGATCGTCGGATTTCAATATGTCGCTATACCGATTTCCTAGGGACGAAGTCCTTTTGCGTCGCTGGTGCTATAATCTTCGCCTCGATCCGGGAGTGTACCGCGGCAAGAATCACAAAATTTGCAGCGCCCACTTTATCAAGGAGGCACTGGGTCTCCGTAAACTATCTCCTGGTGCCGTGCCCACGCTTCATCTAGGCCACAACGATACATTCAACATTTACGAAAACGAGCTATGGCCGCCGCCGACACCAACACCCTCAACTTGCCacttgcaacagcaacagcagtcaTCACTTCACTCACTGCAACAGCAAATGCACAGCAAGTCCTACCAGCGTCGTTCGGTGGCATCCACTTCGTCGTCGGCGAGTTCGGCAGCCTCGCATTACGTGGACCCGGAGATGAACGCCTCTTAccatctagccatgtccgcctccgcctccgccagtGGCTCTGCGACGATAAACGCCAGCGATAGCATGGATGTCTGCTGCGTGCCCAGTTGCGAAAGCAAGCGCCACAATAGTGAGAACATCACATTCCACACGATACCGCGACGGCCCGAGCAGATGCGCAAGTGGTGTCACAACCTTAAGATTGCCGAGGACAAGATGCACAAGGGCATGCGAATCTGTAGCCTCCACTTCGAGCCCTACTGCATCGGCGGCTGTATGCGACCCTTTGCGGTTCCCACTCTGCATCTGGGTCACGATGACGAGGACATCCATCGCAATCCGGACGTGATCAAAAAGCTAAACATTCGGGAGACATGCTGCGTGGCTGTGTGCAAGCGGAATAGGGACAGGGACCATGCCAATCTGCATCGCTTCCCCAGCAACGTGGCCTTACTAAGAAAATGGTGCGCCAATTTGCGGCGCTCTGTCCCCGATGGCAGTAAACTCTTCAATGATGCCATCTGTGAGGTGCACTTTGAGGATCGCTGCCTGCGCAACAAGAGGCTGGAGAAGTGGGCGGTGCCTACTCTGGTTCTCGGACACGAAGACGTCGCCTATCCACTGCCCACACCGGAGCAGGTGACCGAGTTCTATGCACGGCCTACAGCTCCCAACAATGGCGAAGAGCAGGGCGAGTGTTGCGTGGAGACCTGCAAGAGGAATCCCAGCGTGGACGACATTAAGCTTTACCGCCCGCCGGAGGATGCCGCCGTGCTGGCAAAGTGGGCGCACAATTTGCAGACGGAGGCCAACCAGCTGACGGGCATGAGGATCTGCAACCTGCACTTCGAGGCGCATTGCATTGGCAAGAGGATGCGACTGTGGGCAATACCAACTTTGAATCTAGCTGGCAACATCGAGAACCTCTACGAGAATCCAGAGCAATCGATGCTGTACAGAAGGCGGACGACTCATTTGAAGACAAAGCTGCCGTCAATCTCCGCAAAGCCCACCTGGGTGCCCAGGTGCTGTCTTCCGCACTGTCGCAAGGTCAGAGCCCTGCACAACGTTCAGCTTTACCGCTTTCCCAAGCTCAATCGCTCCACACTGGCCAAGTGGGCGCATAATCTGCAGGTTCCCATGGTGGGCAGTGCCCAGCGCAGGCTATGCTCGGCTCATTTCGAGCCGCATGTGCTTAGCAAAAAGTGCCCGGTCCCGCTGGCCGTACCTACGCTCGACCTAAACTCACCATCCGGCCTGAAGATCTATCAGAATCCGGCCAAGCTAAAGGCCAGCAAACTGTGCCTGCAGCGGGTTTGCATCGTCGAAAGCTGCCGCAAGACGCGGGCGCAGGGCGTGCAACTATTCCGTCTTCCGCACAGTCCCACGCAGTTGAGGAAATGGATGCACAACATAAGGACGCGGCCACGGGCAGCTATGAGGGCTCAATACCGTGTCTGTTCCCGCCACTTTGAGACGCACTCATTCAATGGCCGCAGACTCAGCGCAGGTGCTATTCCGACTTTGGAACTGGGTCACGATGGCGACGACATCTATCCCAATGAAGCCCAGGCATTCGTGGAAGAACACTGTGCTGTCGAGGGCTGCGAGGCCTCCAAGGAGCACCCGGAGGTCCGACTCTTCCGCTTCCCCACCGATGACGACGACATGCTGTGGAAGTGGTGCAACAACCTCAAAATGAATCCTGTGGACTGCATTGGGGTACGCATCTGCAACAAGCACTTCGAGGCCGACTGTATTGGTCCCAAGCACCTGTACAAGTGGGCCATTCCCACAAAGGAGCTGGGCCACGATGACGCACAGATCGAGCTGATCCCGAATCCAAAACTAGAGGAGAGGTATGTGGATCCAGTCTTTAAGTGCATCGTACCCACCTGCGGAAAGACGCGACGCTTTGATGAGGTGCAGATGAACAGCTTTCCCAAGGATCCGGATCTGTTTCAGCGTTGGCGGCACAACCTGCGCTTGGAACACCTCAGTTTCCAAGAACGTGAGCGCTACAAGATTTGTAACGCTCACTTTGAAGAGATTTGTATTGGGAAGACTCGGCTGAACATTGGATCCGTACCTACCCTGGAGCTGGGTCATGACGATGAAGATGATATTTTCCAAGTGAATCCTGCCGAACTGCAGAGTAACTTATTCGGCCGACAACGTCGACTGCTACTTGAGGAGTCCGGCGATGGGAGTGCGAAACAAGGGCTGTCCGAGACGAAAGATAACAGAAATTCGGATCTGACGGCCACAGGCTCCAATTCCAAACAGGCTAAGATCAAGAGATCTATTTCGGATCTAAAGtgttgtgtacacagctgTAGAAGAAGCCGTTTGGAGCACGGAGCACGGCTGTTTCCCTTTCCAACGggcaagcagcagcacctaAAGTGGCGTCATAATCTGCACCTGGAACCAGAGGAGGTGAACCGATCAACGCGAGTATGCAGTGCTCACTTCAATCGGCGTTGCATCGAGGGCAAACAGCTGAGGAGCTGGGCGATGCCCACTCAACAATTGGGTCACCACGACCAGCCGATATACGAAAACCCAAAGAATATACCTGGGTTTTTCACACCTACCTGTGCCCTGGGACACTGTCGCAAGAGGAGGAGTATTGACAACGATCTGCGTACCTATCGATATCCAAAAAGCGAAGATCTGCTGGAAAAATGGCGAGTTAATTTACAGCTGGCTCCGGATCAGTGCCGTGGTCGGATTTGTGCAAACCATTTCGAGCCGCAGGTGCGGGGCAAGCTAAAGTTAAAGACGGGAGCCGTGCCCACCCTAGAACTTGGACACGATGATGGATTTATCTATGACAATGAGGCTATAAAGACGGGTATGATGGAGGAAGAGGAAGGCCTCACCACAGAATTCCAGCGcctgaaaccaaaaaaagagaTGTTTGAAGCGGTTGAAGAGGACGTTGAAGAGAATGTTGGTGAAAAGCAGCACCCAGATTACCAGGACGAAAATGCAGATGAGGATGACAAGGATGACCAGTATTTTGATCCTCTTGAGCTGGTAGAGACCTTTGAACATTGCAGCGATGATGACGCCcaagatgatgatgaaaaaGAGGGCCGAGTTCTGGACTCCCCCTCTGGTTACGCGGTCAAGAAGGAGATAGAACAGCTTCCAAGCACCCCACCTTCACCTTCACCCTTACCCCGACGGCACCAAACTCTGCGTCGAGGCAAGCCCGCCAACAATGTGACGCCCATTTGTTGTCTAAAACACTGCAGGAAGGAGCGTACTGCCTTCCACCTGCTGAGCACTTTCGGCTTCCCAAAGGATCGCCAATTGCTGCTGAAGTGGTGTGCCAATCTGCATCTAAACCCGGATGACTGCACCGGCCGGGTTTGCATCGAACACTTCCAGCCGGAGGTGCTCGGTACCCGGAAGCTTAAGCAGAACGCGGTGCCCACTGTTAACGTTGGACATGAGGAGCCGCTTAGGTACTCGTGCCATGGCGTGGATCAGAGCCTCGAGGACCAAGACCCACAGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAGCTAACGGAGCCACCGGATATTGCCCTGGCCAAGAGGAAACCGCTGGGAATGCCGATAATGAAGCGGGAATGGGAGATGGAGAAGTCAAAGAAGATGACTCAAGCGGACAATACTGGAAAATTGACTGAGGGTTCGGTCCCCACACTGGAGTTGAGCAGAGATGATATGGAACGGCAGTGTCTTGTAGCTGGATGTGTAAAGAGTGATCCTAATGGGGCCCGTCTTCGCTACTACAAGATACCAAAGACTGCTGCTCAATTGGAAGCGTGGAGCAACAACCTTAAGATCCATCCCACGGTTCTAATGCAAGGGGAACAGCACTATATCTGCGAGAAACACTTTGAAGCGTTCTGCTTTGGAGCTAACAAAGGACTGCGTTCTGGCGCTCTTCCAACTCTCTTGCTGGGTCACGATGAGGACGTAGATTTGCTTCCAAATCCGGAAAGTCTCTTCTGCCAGAGCAAGACGGACAAGTGCTGCGTTCCAGGTTGCGGACGTATCTGGCAGGTTGGCGATCCGACCATGGAGGAACTGGGAAAGCTTAAGGTCTGCAGTGCTCACTTTGAGAGTTCTTCAATACCCACCGTCGAATTGGGTCATTCTTCTCCGGATATTTACCAAGCGGATTTACCAAGCTTAAAGTCCCAAAAGCGGTCCGTAATGGTCTCCTATTGCTGCTATCCCAAGTGCGAAGAAATCAGTCTTTCCAAGAATCTGTCTTATGGGCTTCCCCAAGAGGAGCATCTGCGAAGTGCCTGGTTAAGGCACATGAACATAGAAGATCCGAAAGATGGTGCAGTCGCAGAGCTTTGCCCGCTGCACTATGTCATTCTCTACCAGCACAGTGCCAGAAACTATCCTGAGTATCACGATTCAAGCCGATTGTTTCTTGATGATAACTACAAGGATGCGCGGAACAACCGGCGCGTAAGGATTGTGAGCTGTGTGATCAAGGGCTGCGACATGGTTAAGCCACGGGATGGGATACCATTGCACGGGATGCCGCAGAGCCAGGACATCCTGCAGATGTGGATAGATAATGGTCAGTTTGAGTTCTTAGAGCAGCAGCGGTACATGCTTAAGGTGTGTCACAACCACTTTGAGCCATGCTGTTTCTTCGACGATAGACGCTTGCTCTCATGGAGCGTGCCGACTCTGCACCTGCCTGGCGAAGCATTTCACCAAAATCCTACCGCCGAACAGTGGCAAAACATAGTCAAAAAACAATCAGCAGCCAAAACAAATGCAGTGGCGAAAGAGGAGTCAGAGCTATATAGGGATGAGGATAGGACGGAGcccattttaaaaatggagCACATTGAATCCGAATATGATGATAAAAACTCGGAGATGCAGGCCCTAGAGGTCCTCCTGGAAGTTGGGCATGTGGAACGAATGGAGAGCTATGAGAAAGTAGATAAATCACCGGTAATCTATACCGAAAATTCACCCTTCCGATCGTCACCAATACGTGGCCAATACAATGCTAATCAGTGTGCCGTAGAGGGATGCCAAGTGACCGTCGAGGATGTGGACGGCACTATTAAACTGCACAAGTTCCCCGCGTCACAGGAAGCCGCACAAAAGTGGAAGCACAACACCCAAGTTGACATGGACGAAAAGTTCTGGTGGCGCTACCGCATATGCAGTTACCACTTCGATCAGGAGTGCTTTCAGAGCGCTAGGATTCGAAAGGGCGCGATGCCCACGCTTTTGTTGGGACCCCGGCGACCGGATAAGGTCTACGATAATGAATTTGCACAaccagaggcagaggcggaaGAGTCTTTTTTAGAGCCACCGGGAATTCAGCTGGAGGAAGGAGAGGACATGTGCCTTAAGTGGCAGCACAATACGCAAGTGCCATTCGATCCCTACTACCGCTGGCGTTACCGCATCTGCAGTGCCCATTTCAATCCGGTGTGTTTGGTCAATATGCGTCTAGTCCACGGAAGCGTTCCCACTTTAAAGCTAGGTCCCAAGGCTCCATCCGAACTGTTTGACAACGACTTTAATGCCATCAACCTAAGGCTGGACAAAAGGTTGACGGAGTCCAATGCCAATGTGTATATCAAGCATGAAAAGAGGGAAGAGGATGAAGGCTCGCCAATTCAGCTCGAGCCCGAGCTCCAGTTTCAAGAGGATCAAGACGATAGGATATCAGCATGGAACAGCAAACTGCAATTGCGACCTATAAAGCTGGAGAAAATAAGTTATAGCCAGAAGAAGTCTGGCTCTGATAAGTGTCTGCTGGCTCACTGCCAACGCCAAAGGTTCCAACATGGCGTCCACATTTATAAGTTTCCAAGAGCGAGGCGCCAACAGGAGCGTTGGATGCACAACCTCCGCATCCGCTATGATGAGCGTACACCGTGGAAATTCATGATCTGCAGCGTTCATTTCGAACCTCACTGCATCAGCTTAAGAAAGCTGCAACCTTGGGCGGTGCCCACACTGGAGCTGGGCGACAATGTGCCAGAGAATATCTTTTTGAACGAACAGTGCGAGGAGGACTTGGTGATCAATCGCAGCGAGCTGGACAGCGACGCCGAGGATGAAGATGGCTTACAGgaggacgacgatgatgaagaCGAGGACGATCTGAAGCCGGATGTTGGAATAAAAAGGCAAAGACGTTTCAATAGAGATTCCTCATGCCCTCCTACCCAGACACCACCCTGGAAAGTCAAACAATGCTGCCTCCCCTATTGCCGTGCCTTCCGAGGCGATGGCATCAAGCTTTTTCGGCTTCCGAGCAACCGAAACTCCATTAGCAACTGGGAACGGGCCACAGGAATGGTTTTCAAAGAGTCACAACGGAATACGCGTTTGATTTGCAGCCGTCACTTTGAGACAGAGCTGATTGGAGTGAGGCGTCTAATGCGTAACGCCATTCCCACAAAGCACTTAAATCCGCAATGCGCTGACCAGGTCCGTACTAAAAGGGAGAAGAATCCGCAAGCCTCTATTATTCCCACCTGCTGCATGGCGGACTGCCACTACAAAGGAAATGTGAAGCTGCTCAAGTTTCCCAGTgATCCCACACTGCTTAAAAAGTGGTGCCAGGCTCTTCGGCTCACCGATACACAGCGGTATATTGGCAAGCACATTTGTTCGATGCACC
Protein Sequence
MSQHNPNHAHPHYHYPSHPTPLALQQHQQQEHGSSWYSHVASYPASSHSHNLSQSAFGPAPPCKASISNNNTIMGAYGGGEGGGGGSHGYFGAAGGGLNVSGVAGGGAGPTYGLGANTVAYAHNQLLQYQHHHQQHQQQQQQQQQQHLGLSQRSYMGHDVMAGSYPYIKSEPLEGFQQPPNPMAPPPAPEMIIKSEPVDELAYKSNFIDDNTPFADFSKFSEFSENMLSPKIELTVKNESFDRNPNSFLRRKQQSDRSTSESLPVCQRCKEVFFKKQTYLRHVAESNCGIQEYDFKCTLCPMSFMNAEEIQLHKQQHRADRFFCHKYCGKHFGTIAECEAHEYMQHEYDNIVCNMCSATFATREQLYAHLPQHKFQQRFDCPVCRLWYQTALELHEHRLAAPYFCGKYYADGRSPSSSHHQHHQSQTNYKLQDCHMATMEMPNAPLPKTNPSSSPALPATAALNSLLQQRQANANEAGIFAASTLKNEVTVKLERSYSNSTNESSYSVQESGYNNIYSSSDTSIHGALAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYLSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPTPSTCHLQQQQQSSLHSLQQQMHSKSYQRRSVASTSSSASSAASHYVDPEMNASYHLAMSASASASGSATINASDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLRKWCANLRRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHEDVAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEDAAVLAKWAHNLQTEANQLTGMRICNLHFEAHCIGKRMRLWAIPTLNLAGNIENLYENPEQSMLYRRRTTHLKTKLPSISAKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPSGLKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDGDDIYPNEAQAFVEEHCAVEGCEASKEHPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTKELGHDDAQIELIPNPKLEERYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFQRWRHNLRLEHLSFQERERYKICNAHFEEICIGKTRLNIGSVPTLELGHDDEDDIFQVNPAELQSNLFGRQRRLLLEESGDGSAKQGLSETKDNRNSDLTATGSNSKQAKIKRSISDLKCCVHSCRRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEEVNRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHHDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPKSEDLLEKWRVNLQLAPDQCRGRICANHFEPQVRGKLKLKTGAVPTLELGHDDGFIYDNEAIKTGMMEEEEGLTTEFQRLKPKKEMFEAVEEDVEENVGEKQHPDYQDENADEDDKDDQYFDPLELVETFEHCSDDDAQDDDEKEGRVLDSPSGYAVKKEIEQLPSTPPSPSPLPRRHQTLRRGKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCANLHLNPDDCTGRVCIEHFQPEVLGTRKLKQNAVPTVNVGHEEPLRYSCHGVDQSLEDQDPQPQHSVFRLWSLKHCRKRKLTEPPDIALAKRKPLGMPIMKREWEMEKSKKMTQADNTGKLTEGSVPTLELSRDDMERQCLVAGCVKSDPNGARLRYYKIPKTAAQLEAWSNNLKIHPTVLMQGEQHYICEKHFEAFCFGANKGLRSGALPTLLLGHDEDVDLLPNPESLFCQSKTDKCCVPGCGRIWQVGDPTMEELGKLKVCSAHFESSSIPTVELGHSSPDIYQADLPSLKSQKRSVMVSYCCYPKCEEISLSKNLSYGLPQEEHLRSAWLRHMNIEDPKDGAVAELCPLHYVILYQHSARNYPEYHDSSRLFLDDNYKDARNNRRVRIVSCVIKGCDMVKPRDGIPLHGMPQSQDILQMWIDNGQFEFLEQQRYMLKVCHNHFEPCCFFDDRRLLSWSVPTLHLPGEAFHQNPTAEQWQNIVKKQSAAKTNAVAKEESELYRDEDRTEPILKMEHIESEYDDKNSEMQALEVLLEVGHVERMESYEKVDKSPVIYTENSPFRSSPIRGQYNANQCAVEGCQVTVEDVDGTIKLHKFPASQEAAQKWKHNTQVDMDEKFWWRYRICSYHFDQECFQSARIRKGAMPTLLLGPRRPDKVYDNEFAQPEAEAEESFLEPPGIQLEEGEDMCLKWQHNTQVPFDPYYRWRYRICSAHFNPVCLVNMRLVHGSVPTLKLGPKAPSELFDNDFNAINLRLDKRLTESNANVYIKHEKREEDEGSPIQLEPELQFQEDQDDRISAWNSKLQLRPIKLEKISYSQKKSGSDKCLLAHCQRQRFQHGVHIYKFPRARRQQERWMHNLRIRYDERTPWKFMICSVHFEPHCISLRKLQPWAVPTLELGDNVPENIFLNEQCEEDLVINRSELDSDAEDEDGLQEDDDDEDEDDLKPDVGIKRQRRFNRDSSCPPTQTPPWKVKQCCLPYCRAFRGDGIKLFRLPSNRNSISNWERATGMVFKESQRNTRLICSRHFETELIGVRRLMRNAIPTKHLNPQCADQVRTKREKNPQASIIPTCCMADCHYKGNVKLLKFPSDPTLLKKWCQALRLTDTQRYIGKHICSMH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-