Basic Information

Gene Symbol
-
Assembly
GCA_963924685.1
Location
OZ004727.1:2800927-2823046[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 33 1.3e-14 1.2e-11 45.9 2.5 1 86 804 876 804 877 0.84
2 33 6.9e-15 6.3e-12 46.8 4.7 1 87 904 973 904 973 0.80
3 33 4e-15 3.6e-12 47.6 0.3 1 87 994 1066 994 1066 0.83
4 33 1.7e-13 1.5e-10 42.4 3.8 1 86 1146 1214 1146 1215 0.79
5 33 1.5e-15 1.4e-12 49.0 5.8 1 87 1239 1311 1239 1311 0.81
6 33 7e-12 6.4e-09 37.2 0.9 1 87 1346 1414 1346 1414 0.80
7 33 5.2e-11 4.7e-08 34.4 2.1 1 84 1455 1522 1455 1530 0.72
8 33 6.2e-15 5.7e-12 47.0 0.2 1 86 1552 1621 1552 1622 0.81
9 33 2.4e-13 2.2e-10 41.9 2.3 1 86 1644 1713 1644 1714 0.79
10 33 4.5e-13 4.1e-10 41.0 2.6 1 87 1741 1813 1741 1813 0.86
11 33 4.7e-07 0.00043 21.7 0.1 1 59 1880 1932 1880 1956 0.76
12 33 5.9e-11 5.4e-08 34.2 0.3 1 87 1977 2049 1977 2049 0.81
13 33 5.7e-13 5.2e-10 40.7 3.3 1 87 2080 2151 2080 2151 0.81
14 33 2.1e-13 1.9e-10 42.1 4.8 1 85 2196 2267 2196 2268 0.84
15 33 6e-12 5.5e-09 37.4 0.5 1 87 2291 2359 2291 2359 0.78
16 33 3.5e-14 3.2e-11 44.6 0.2 1 87 2660 2729 2660 2729 0.80
17 33 8.4e-11 7.6e-08 33.7 2.4 1 86 2787 2877 2787 2878 0.77
18 33 5.3e-13 4.8e-10 40.8 1.8 1 86 2910 2981 2910 2982 0.79
19 33 6.5e-12 5.9e-09 37.3 1.0 1 86 3010 3078 3010 3079 0.80
20 33 4.4e-13 4.1e-10 41.0 1.2 1 87 3099 3169 3099 3169 0.80
21 33 1.4e-13 1.3e-10 42.6 1.1 1 87 3192 3263 3192 3263 0.80
22 33 4.9e-05 0.044 15.2 0.1 1 59 3279 3326 3279 3353 0.78
23 33 8e-12 7.3e-09 37.0 6.1 1 86 3367 3436 3367 3437 0.82
24 33 5.2e-12 4.7e-09 37.6 5.0 1 86 3462 3532 3462 3533 0.80
25 33 1.6e-13 1.4e-10 42.4 2.6 1 86 3553 3625 3553 3626 0.79
26 33 7e-11 6.4e-08 34.0 3.1 1 86 3646 3715 3646 3716 0.80
27 33 2.8e-12 2.5e-09 38.5 1.1 1 87 4014 4089 4014 4089 0.81
28 33 5e-07 0.00045 21.6 0.9 1 85 4108 4176 4108 4178 0.69
29 33 0.0017 1.5 10.3 0.5 1 81 4217 4312 4217 4317 0.52
30 33 2.5e-12 2.3e-09 38.6 0.6 1 87 4331 4405 4331 4405 0.82
31 33 6.9e-15 6.3e-12 46.8 1.2 1 85 4430 4503 4430 4505 0.86
32 33 3.8e-12 3.4e-09 38.0 3.7 1 87 4643 4716 4643 4716 0.78
33 33 6.3e-11 5.7e-08 34.1 0.8 1 87 4741 4811 4741 4811 0.81

Sequence Information

Coding Sequence
ATGTCGCAAAGTAACcaacgaaaacattttcatattcatgCTCCCTATCAACACCcccaacaacagcagcaacaacagcaagcaCAACTTCATCATCATGGTCATCATTTAACACCGACGCAGCAACAGCAGTCGCAATGGTTCTCACAGCAACATTATCAACACGGTCTGCATTTGAGAGATTCGCGCCATATTCAACATGGGCAGCATCATCCCCATCACCATCATactcaacagcagcagcagcaatcaCATCACAATCATACAATGACACCGCATATGTTTACAAGTGGTTATGTTGGGGGTGTTGTAGGTGGGGGTGGGGGTGGCAGCAATACTGGAAATGCTGTTGGAAGTGGGGTTGCTATAAGTAATTCTTCACATAATGCGGCAACTATAGGTGCTACCGCACATAACATGCCGGCTTCATCTTCGTCTGCGCATCCTTATTCTTCTGCTATGCCGGTACCGGCTTCTGGAGGGAGTGTTGGTGTTAATTCTAGTGGTGGTAGTTATGCTGGTCGTAATAGAATTTTTGACCTTGAAATGTTAACAACACAGACGCAACAACACGCTGCAACATCTGCTGCACATTCCCATTCTATATTATCAAGTGGCAGTTCGAGCGGACGACAAGGATTTGATGCATATTCACATAATTCAATGTATGCACAACAAAATCAACGACATCTTTTACCTCCTGCTTCTTCCCACCACCATCTTGCTCCAACACATCATTCGGCAGCTAATTCGTTGCATCCTCATCATCATACTCAACAATTACATCATCCACATCAACAGCCACCATCAAGtctgcatcatcatcagcagcatCAACACCAccaacatcaacagcaacaacatcaacagcaacaacattatTATCACCACCCTCAGCAGAGTTCTCTGCACCGGCCACATACTCAAGTTATACCATCTATGCTGCAGCATATAAAATCTGAGCCAGTGGAACAAATAACCACAACATCGTCAATACAAACCGAGGAAGTTATTATTAAATctGAACCCGTCGATGATATTGGTTATCATCACAAAAGTGCGCcacaatttgaaaacaaatatttgcacATTGAGGAAAAACGTAAACAACttgaacagcaacagcagcgaCAACAAGAACTACAACAACAGcgtcaacaacagcaacaacgccTGCAGCAGGAACAACGTGCAcaccaacaacagcaattaCATGAGCAGCAACTTCATCAACATGGATTACAACAAATAAAGCAGGAACATTATCATCATCCTGAGAGTGAACAGAGCCATAATGAACATGCTTCCCAACAAACGCAACAACGTACAAATTCCGAGAATTCTTCTATAATGCAACCAACAGTAGATGAAAAGCAACAACTGCTACAACAACATCAGCAACCGCAAATATCCTTgagtaacataaaaacagaagcaaagCCTCTTAACTTTCCTCGTCGTAAATTACAAACAGAACGTTCTTCAACTCTGCCCATATGCCAGCGatgtaaacaagtttttttaaaacgtCAAAACTACTTACAACATGTTGCTCTATCCAGTTGCAATATTGTTGAGTACGACTTTAAATGCTCAGTATGTCCCATGTCCTTTATGTCTAATGAGGAATTGCAGACACACGAACAACTACATCGTTCAAACAgatatttttgtcaaaaatattGTGGCAAATTCTATGAAACAATTGCTGAATGCGAACAACATGAATACGGTCAGCACGAATATGAAATGTTTAAATGCAATatttgttGCATAAGCGTGACGCAGCGTGAGCAATTATTAGAACATCTTAATGAACATAAATATCAGCCACGCTTTGATTGCTGTATATGTCGTTTATGTTTTCAAACTGCCATTGAATTACATGATCATTATATGTCTAATGAAGATTTCTGCGGGAAATTTTATGACAAAGAAGCTTTTAAAAAAACCAATACCTCGTCATCGTCACCTTATCTGGGAAAGTTGGAAAGTTCGAAATTGGAAATAGCTAATACATTTTCATTGAAAGATATACCCCCTGGCAATAGTCATCATTTGGAACGTTTATACGCAAAGCCTACCAGCTCCAAAACTTCCATGCAACCACCTAACACTATACCAACTATACCTTCATCTTTATCGTTTAGCACTGTAAATGAGTTTGCTGCTCTCGAACCCCATGTTGAGGTAAAAACGGAAATTAAAGTAGAACCGGATTTTTATCCCCCCATGGACCAATCTGATTTTGCTAGCTATGATAATGATTACAGTACACCCGACTATACATCTACAAGTTCTAatcaaaacttttcatttctaCAAGATTACCAAGACAATGCTTCCAGTTCTACCACTTCATCTTATTCCTTTAATAATAACGATGCCATACAAGATGAAGAAGCAATTTGCTGTGTGCCCAAATGTGGGGTACGCAAATTTTCATCACCCTCTTTACAATTCTTCGGATTCCCCAAGGAAGACAAGTACTTATCGCAGTggttacataatttaaaaatgacatACGATCCTAATGTAAATTACTGTGCATATCGTATTTGTAGCTTACATTTCCCCAAACGATGTATTGCAAAATATTCGTTGAGTTATTGGGCAGTGCCTACCTTTAACTTGGGCCATGATGATGTGggaaatttatatcaaaatagAGAAAGTTCTGGGGGGTTTCCAGGAGGTGAAATGGCCAAATGTAGTATGCCTGGTTGCCCTTCCCAACGTGGAGAAACCAATGTAAAATTTCATGTATTTCCAAGGGATTTAAAAACCTTGATTAAATGGTGTCAGAATTCACGACTGCCGGTACACAGCAAAGATAATCGTTTTTTTTGTTCTCgacattttgaagaaaaatgttttggCAAATTTCGCTTAAAACCTTGGGCTATACCTACTCTTAATTTAGGCACGGTTTATGGCAAGATACACGATAATCCCAATATATAtcaggaagaaaaaaaatgctttttgccGTTCTGCCGCCGTAGTAGATCCTACGATTGTAATTTATCATTATATAGATTTCCCAGAGATGAAACTTTGTTGCGGCGTTGGTGTTACAATTTGAGGTTAGATCCCAACATGTACAGAggcaaaaatcataaaatttgttCTTCTCACTTCATTAAAGAAGCTTTAGGTCTAAGAAAACTTAACCCAGGGGCAGTGCCCACATTAAATTTGGGTCATAATGATAGATtcaatatatatgaaaatgaactATATACACCACCGCCACCTCCGCCACCTCAACCTTCTACGTCATCCAAGGCACATAAATATACCCAATTATTTAAACAAGAAAGGGAAAGTTCTTCCAGTTCACATATTTATGATGGCGTATTCATGAACTCTATGGTACAAAAATTCTCTTCCGCTTCTTCAAACAGTTCTAATAATCTAGATTTGGGAGATGTTTGTCTTGTGCCGTCCTGTAAGAGAACGCGTCATTCCAATGACATCACTCTGCATACTGTTCCCAAAAGGGCAGAACAGCTTAAGAAGTGGtgtcataatttaaaaatgaatttggtCAAAATGCACAAAAGTGCTAGAATTTGTAGTGCTCATTTCGAAAAGTATTGCATAGGAGGCTGCATGAGACCATTTGCCGTACCTACTTTGGAGTTGGGACATGAGGATACTAATATATTCCGTAATCCTGATGtcataaagaaattaaacattAGAGAAACGTGTTGCGTACAATCTTGTAAGAGAAATCGTGACCGCGATCATGCTAATTTGCACAGGTTTCCTACTCATCCAGAATTGTTACAGAAGTGGTGCGAGAATCTACAGAAACCTATACCAGATGGCACCAAACTTTTCAATGATGCGGTATGTGAAGTGCATTTCGAAGACAGATGTTTGCGTAATAAGCGTTTAGAAAAATGGGCTATACCTACCATTAATTTAGGATGGGATGACGCTCCCCATAATTTACCGTCGGAAGAAGAGATCAACGAAAGCTGGGTAAAACCTTTTGCACCGAATAATGGGGATGAGCAGGGTGAATGTTGCGTTATTAGCTGCAAGCGCAATCCTCAAATTGATGATGTAAAATTATACAGACCTCCCGAAGATGCAGAGCAGTTGGTTAAATGGGCTCATAACCTCCAAGTAGATGTAACGGAGTTACCAAATCTGAAAATCTGCAATTTGCACTTCGAACAACATTGCATAGGAAAGCGTTTGTTGAATTGGGCCATGCCTACTTTAAATTTAGGAGGCAAAGTTGAACACCTATTTGAAAATCCTCCACCTATGCCTGCCATTTATAAGAAGAAAATCAAAcccgaaaaaattttaaataatcaagAGAGCGTTAAGTGGTCTCCAAGATGCTGTTTACCCCACTGTCGCAAGATGCGTTCTGTAGACAAGGTTCATCTTTTCCGTTTTCCTTACATTAACCGTCAAACTTTATCGAAATGGTGTCACAATCTACAGTTACCATTGGTAGGTAGCTCACATCGACGTATCTGCTCTACTCATTTTGAACCGTCTGTTTTAACCAAACGTTGTCCCATGACATTAGCCGTACCTACGTTAGACCTAAATGCTCCTCCAGGCTATAAAATCTACCAAAATCCTGCAAGActgaagcaaataaaaataggcGCCCAAAGACAGTGTGTAATAGAATCTTGTCGTAAAACTAAATTGGATGGAATTATACTATTTCGTTTCCCCAATAACAggtcaattttatataaatggcgtcataatataaaaaattggccTAAGGGCAAATTAAGTTCTCAAATGAGAATTTGCTCCGAACATTTCGAGAGCCATTCAGTAGGAGTGAAGAGGTTATCCCCGGGTGCTATACCCACTTTGAAGTTAGGGCACGATGCGAAGGATTTATATGCCAACGAAACAAGATCATTCTTTGACTTGGAGAAATGCGTAGTTAGTGGATGCGATTCGCGCAAAGAAATGGAAGATATAAGACTTTTCCGTTTCCCACGAGATGATGAAGAATTGCTTAGGAAATGGTGCCACAATCTCAAAATGAATTCCAATGACTGCGTTGGTATTAAGATATGCAGCAAACACTTTGAATTAGAATGTTTAGGTCCGAGACAACTATATAAATGGTGTATTCCGACTTTAAAATTGGGCCATAGAGATGATTCGGTAGAAATAATACCTAATCCTCCACCGGAGCAAAGAAGCGGAGAATTCCTTTTCAAATGTTGTGTACCCTCATGTGGTAAGACACGTAAATATGATGAGGCACAAATGAATAGTTTtcccaaaaatataaaactctTCCGCAAATGGAAACATAATCTAAAGTtggattttcttaattttaaagaaagagAAAGGTATAAAATTTGCAATGATCACTTCGAGCCAGTTTGTGTGGGTAAAACCCGACTTAATTTTGGAGCCTTGCCCACCTTGAATTTAGGACATGATGACTTAGATGATTTATATCAAATTAATCCCGATAGAATAAGacctaatttatttattaaacaaaaagatGTAGAAAGATTAGAAAGGAAAAGAATCGTAAGAGAAGAAAATGCGGAACAATATGATTGCGATGACCAAGATGATGAGGCGGCCGATCCCTTAAGTATAGAGCCAAGCGATATGAAATGTTGTGTTACGGAATGCACTGCCCCTAAATCAATAATGAGAGAACCTTACGATTTACCAGATACTAAGGAGATTAGGCAGATGTGGTTAAAAGAACTTGAGAAAACTGAAATGCATGATTTACCGCCGGAATCTAAAGTATGTGGATTGCATttccaattaatatttaaaaggcTTAAAAACCAAATGCTGGAAGTGTTAGAGACCAATGAAGACTTAAAATCGGATTTTAATAAACTGCAATACGATTATCAAAAGTCTGACATATCCCTGGTTATAAACAGTTATCAGTGTAGGGTGGAAGGATGCCCCACTAATTTACTTAATTCTTCAATCAggctatatttttttccatacgGCAAACATTTAGTAACAAAATGGTCTAACAATACAGGCATAATACCCGATGAAAATCGCAGATACATGAATAAAGTATGTGCTTTACACTTCGAGTCTTTTTGCATTACTGAAAATCAAAGATTGAGATCTTGGGCCATTCCCACAATTAATTTACCTTTCTCTGcagaagaaaaacatttatataagaATCCTGATTTAACTAAAATAGACAGAAGAATGTTGGGACCTCAAATTTTAAAGTGTGTTGTCAACAATTGCAATTATCTCAAAATGGTCGAAAACGAATCCCTTAAGCTTTTTAACTTTCCCACTGATGAAGTGTTGTTAAAGAAATGGTGTAATAACCTTAAAATGCCTCATCATTTCACACCCTTGCTTAAAATATGTTCTTTGCATTTTGAAAAGATGTGCTTCGGTAGTTGTCGTATACGTTCGTGGGCCATACCCACTTTAAATTTGGGTCATAATGATGTTCCCGAACATTTGAATAAAACAACTATAAGACAAGAAGTTTTCGATGCTTCTGAAGATATTTCTGAAATACAATTGAAACAAGTTAAAATCAAAAAGTCACTAGATGGTACGAAATGTTATGTACCCAGCTGTCGCAAAAGTAGATTGAAGCATGGTGTGCGTTTTTACAATTTACCTtcaaatttgaaaatgaaacgCAAATGGTTGCACAATTTACAAATCAGGAAAATAAAGTCTTCCCAAAAAGtgcataatattaaaatttgcaatCTACATTTCCACAAAAGATGTTTAGATGGCAAACATTTAAAGCCTTGGGCTGTGCCCACAAGGCTTTTAGGACATAATGAATCCGTTTTCGATAATCCGCGAAAAGTACGAGCATTACCGCCATTACGTTGTGGCCTTGCACACTGTGGAAATCATACATGTCCAAAGGCAGTACGTACGTTTGCATTCCCAAAATCACCAGATATTTTGGAAAAGtggtcgaaaaattttaaattggaattaGAGAAATGCAAGGGAAGAATATGTTatgaacattttgaaaaagacGTCTTAGGTATAAGAAAGTTGAATAGTGGAGCCGTACCTACTCTTAATCTGGGTCATAGCGATAAAGATATTTATGATAATACAGAAATAATAGGATTTAAGGATTTAAATCGACATtcgtgcaaaataaaaattacggaGCAAGACGACATGATTGACGAATTTGAACATGTGACAGATTTTGAAGAGGAGGACGAAGTATGGGAATCTGAAGAAGGTGAAGAAGAAGAGGATGATGAGCAAATatattatgatgatgatgataatgacgaGGAGGAGGTGGATGATGTAGAAAAGGAGAATACACAAGATGATGATGAAATCAGTGTAGCAAATTCTATGTCCGACTGGAGCTCTATTAAGCTTAGGGAACTTAGGGTTTCTATAACTCCCTTAACACCTGAAGATTTAATGGATTTGTGTTCTCGTTCTTCATATGAAAAGGAATTTGGACCTTTGACACCAGCAAGTCATTTAAGAGCACGCAGGTCTGTAACACCTGCTTCAAACTGGAAGGATTTGCGGTCAGAGACTGCCGATCAAAAGTCCAACAGGTCTGAAACACCCGAcaagaaaacatttaattactTTAGAGAACCTCGCTCAGTCACCCCTGAACAGAAAGCAGATAATTTTCTCGAGCCTGATAGAAAATCTGTTAGCCCGAAGGAAGATCCTCTGGGTGAAACTTTGGATGGTCTATCTTCCAAAACTCCAAACCAAATAGAATCACTAGTTTTTTCAGGAGAAGCGATTTCGGAGCTTGATGTGTCTGCTAACTGTATGAAAAGGGCAAATACTCAAACAAATAATGAAGTTTCCAAACGAGAACATTTGGAAATCTCGGAAGATGAAGTCACTATTAATTCTTTAACAAATCAAGTGGAGCTTACCAGTTTCACTACCAATCTAAGAACAGATAAAGCCCTTAACGCGGTTGCTCCCATTTGTTGTATGAAACACTGTGGCAAAGAAAAAACACCCGAACAACATTTAACTACTTATGGCTTCCCTAAAGATCCTCAACTTTTACAAAAATGGTGTGACAACTTAGGCTTACAACCTGAAGAGTGTATTGGACGTGTCTGCATAGATCATTTTGAACTGCGAGTAATCGGCACCCGCAGGCTCAAATTAGGAGCTGTCCCCACTTTGAATTTAGGCCCGAGACGTATTGCCAAGCACACTAATATGGAGGACACAGCACAAAAGAAAACTGTAACAAAGGAGTGTCCTGAAACAACAAATATGCAGGAGGCGGGTTCAAGTCCAAAGGCACCGCCACCATATAAAACTCCCAAAGCTGGTAAGCAATCGGTTTTTCGGCTATGTTGCCTCAAACATTGTCGACGCAAGAAATTCTTGAAGCAGGAGAAGAAAGAGAAGCAACAGTTACTGATGGAAAGAATGGATTGCCAGGAGAAAACACAGGAAGTCTTGTTTAAATTTCCTactgatgaaaatattttaatgaaatggtACAAAAACTTAAGACTACCTGAACAATTAAGTATAACAACAGATTTACAAATATGCTCTAAACATTTTGAATCTAGTGTTATAAAAAATGGCAAATTGCATTCTGAAGCATTACCCACTTTACAATTGAGTTATGCTAATCGACCAcctatttatagaaataaaccACAGGATTGTAATAGCCTCGGCTTTAAAATCAAGCATAAGTCGACGTTAAAGTGTTGTCTACCCCAATGTGGCAATAAAATATCGGATGACATCTTCTTATTATCATTCCCCGAAAATGAACCTTTGACTTTCAAGAAATGGTGTAAAAATTTGAAACTAAGTTTTGAGAAAGGAAAACATAAGCATTTGATGATATGCAATCAACACTTTGAACCTTAtgttttctataaaaagaaatatttacgaCCTGGTGCTTTGCCCACGGTTAAATTAGGACATACGGATGCTATCATCAGGAATTGCCGCAAACTTCGCTTGAAAAGAGAAAATGTCAGTGTCATTAGTGAGAAATGTTGTATAGCCGAATGCAAAGAAATGAACCTTAAACTTTATTCATTTCCACGAAGTTCAGAATTACGTAAAATTTGGTGTAACAATGTGCAAATCGAATTACGCCAGGCTCTCCACAATCATTATAAATTATGTGCACGCCATTTTACAGTGGATAGTTTCATAGTAGGCACAGACAATCTAAAGCTAAATGCTGTACCGGTATTAAAGTTGGGCCTAAAAACTGACAATCATTTGTTAATAACAACAAATGCTGCTGAAAGCAGATGTATCGTAGAAAACTGTCAAAAAACGCCAATTGTTGATAAAGTGAAATTGTTCAAGTTCCCACAAAAACAGGAAATACTTAAGAAGTggctttttaatttgaatatatcaGCAGATACTCTTAATCCGTATGATGTTGTATGCAGTAAACATTTCGATAAGAGTtgcattaaaaatggaattttgcATGAGAAAGCTATACCTACACAGTTTTTAGAAGTTTCACCTAAAGGCtggttttataaaaacaatgacGATTTGTATGAAGTACCCAGAAAATGCTGTGTTCTTGGTTGCCAGCATTCTTCCGAAGAAGCAAGACATTTGTACAGATTTCCTAAGCACAAAGAGGATTTAGATAAATggctttacaatttaaaattgcaagTGGAGGAGGCGGATGTTAAGGATTTAAGAGTATGTGACAGCCATTTtgagcagagttgtaaaatctCTAATAAGGATTTAATAACCCAGGCCTTGCCTACTCTCAATCTCGGTCACAGTGATAGCGAAATTTATGGCAACAACTTTATTAAATGCTGTTTGGATAGCTGTAGTATAGAGGGATTTTACTATCATAAATTACCCGAAGATTTGATGATACAAAGTTTTTGGTTTCAAGAACTTGAAATGGAAACTACTTATAACACTTCTATGTATATATGCTCGGTGCACTTTGTATCATTCTTTGAACGAATATTGGAGAAATACAGCTCTTTTCTTAATGAATCTGGAGAATATGTAAAACTTTCTGTTACCTATAATGAGCTTAAAGCTCTACCTGCCTTACAATGTTATAAATGTCATATAACCAAATGTAATTCTGGTTTTAAACTAATctggaaattatttaaatttccgaAAGATGAGACGTTGTTCAACAAGTGGCTGCATAATACTAGTTTGAACTTTGATTATGATCAACGCAATTGTTATCGTATTTGTTCTCAACATTTTGAGGAAAGatgtttaagtgaaaaaaaattacaccgcTGGTCTTTGCCCACTCTAAAACTACCCTTCAATAATAGTTTATATGTCAATCCCCCTGAAGCTTTGCCATCCAATCATGAGAACTTAAGGCACTGCTGTGTGTCTAATTGCCCTACCAATAGGGgaccattttataaatttcccgTTAAACAGTTGGAAGTAAGGAAATGgatacataatttaaatttgggTAATCAACAATGCACTTTAAACTTACGAGTTTGTTATAAACATTTTGAGAACTATTGCTTCTCCAAggctgtaaataaaattaaaccctTGAAATCGTGGTCGGTACCTACTCTcagattgaaaagaaaaactgatCTTTTTCTCAATCCAGCTGACAAAATTGCCTTCTACGTTtgttgtatagaaagttgtagacaaattcttaataaatcCAAAGAGATTTATCTGTTTAAATTTCCTTTCAGTAACACCTTGAAACAGAAATGGTTACACAATTTAAATATGGGCAAACAGGATTATAAGGAAACAATGAGAATTTGCTCTGCACACTTTGAAATGCATTGTTTCTACAAGGGTTTTAGGCTATTACGAAAACATTCTGTTCCCACTTTGGCTCTATTAAAACCGCCTACGGACCTCTATAGAAATCCTGCGAGAAGGgcttattttaaatgttgtgtCAAATTGTGTAAAGCACCCTGGGAACAACTGTTAAATTTTCCCAAGGATAAGACACTTTTAAGAAAATGGTGTCATAATTTACAATTGGATAAAGAAATACAATTAGAGTCCTTGAGGGATTGGAAAATATGTGGACGACATTTTGAACAgcaatgcataaataaatttggTATAATAAGAAGTGTGGCAGTGCCTACCCTTAAATTAGGACatcgaaaaaaattgtttcaaaatccggatttttctttgaaatccaACTCTAAAAATGAAATGGAACATTTAAATGatgaggaaaatttaaaaaacattgaaaatattgtgataaagaaaatattggatGATATTCCACAAGAAAatacaaacattaaaattaaaagtattaaaactaCACTATCGGCTAAAGTTAATAGTTCAAGTCCTCTGCAAACTTTAAGGAAGCCTAAAAATTTGCTTATCAAAAAGCCTCTCAACGTTAAAATTATATctaagaaaagttttaacaaaaaaattcgaaaagttattaaggaaaatgaaaattataaaccaCCAAAGCAATTTAAAATCGAGAAGAAAAAAGAATGCGCTACGGTAGAGTCTACTAACGTTACGCAAGTCAAGGAACAGGATAAATTGTTAACGGATTTAAAGCCGGAAAACGAAGAACAGAAATCTTCAATAGAGAACATTAAACAAGAATCTACAAATATTTCATCTACTGAAATTAGTAATAAACCGAATATTTCAAACACTATAGTGCGTGAACACAACATGTTGGAAAGCAACATACAGGAAGATGCTTATCTAGAGAACTTACTAGAAATTTTAACTGAAAGTTTGCCTGAAAATgaggatataaaaaaaatgccacaGCTGCTAAAACAAGACAGTATTAGTTCCGACACACAAATGCAGGTTGCAGCAGATATGCAGCAGGATAATTTTACGAATTGTGCACAAAATGTAAagcaagaaaataatatatttaagaattacGAAACAAAACAGGAAGTACAAGTAGAGAACCATAAAGAATTCCAACAGTCAGAGGAGGAAAATGTAAAAGCCATAggcttcaaaaataaaaatttcattttttgctGCATAAGAACCTGTGCTAATTATGGGAATTTCAAACCGGACTTAGTGCTTTTTAAGCTACCCATTGTACGTAAACTGCGTAATCAATGGATAGAAAATTGTAAACTAAAACAATATTCAGGAAATTACACATTAAAAGGATTTCGCGtttgtataaaacattttgcCAAACATTGTATTAAAGATAATAATCGTCTAGTGTTGGGTTCAGTGCCGACATTGAACCTCGGCAATAGTCTAGACTGTATAAAGCCATTGAACAAAGTTAACTATTTAAGATGTCGAGTAAAGGGTTGCCAAAGATCGAGTCAGCGTGATAAGATCAATCGTATACCATTTCCTCAAggagaaatgaaaagaaaatggtGTTTGAGTCTGAATATTGAAGAGGATAGCATTACTGCCGAAGATTGGATATGTCATAGACATTTTGAACGAAAAGCTTTAATAGGTTGTCGTAAACCCAAGTCCGGAACACTGCCTACTTTGCTATTGGATAGCTGggagaaaaattcaaaaaattgcaGTATGCCTTTAAAAACGTGTAATAGTGGTATGGGGAAACATTGTAGGCACCCGAAAATCCACAAAGAAATCAAacttaaatgtttatttcccCTATGCAAAGAAAAGTCTCAAGGTTTACATGATTGGCCTGATAAagctatttttgttaaaatctggctgttagttaaaaaattaggaaGACATGCCGAAGATGCTAATGCCTGGAAGAAAATGTGGCAACAATCTTTCCCGAATGAAGACGAAACTTCAAATTCGGAACTAAACATCAAATTGTGTgatgaacatttttattatctatataaaacaaatgaaaaagcaaTCCATGACTATGAAATAGCTGAAGAATACAAGATTCTAAAACAAAATGTTCAACTTACTTTTgactttttaaattctttagaaaaaatgtatacaaagCAATGTGCTGTGCCACAATGTAAAACAGATCTAAATATTAAAGGCTGTAGGCCTTTAAAACTCTTCGATTTCCCATCCGAAGAAATGGCTAAAAAATGGTGCCACAATATTGGTATAGAATACAATTCGCTCAAAGCGAAACCTTCTcttaaaatatgtgaaattcatTTTGAAGACTATTGTTTGCTAAGAAGAAATCTGCTTGATTGGGCATTGCCCACTCTAAATTTACCGCTTACAAAAGACCCTCAagatattaaacaaaatgatgctgataaagtatttaatataaaagctAAATGCTGTATTAGCACTTGTCCCAATTCTCAGTTCTTAGAAACAGAAAGCAATTTaagattatataaatttccTAAAGACCCCATGCTACTTAAAAGATGGCTGGAGAACACAAACTGTGAACAGACTTTTGATGAAAATATTACACGCATATGTGCGCTGCATTTCCAAGCATCTGATATGCTCAATAAGAAAACAGTTCTAAAAGAACACGCTGTTCCGCAATACTATTTGGAGCACCATACAAAGTTTTCGTACACCTCTCCTAATAGCTCGATTATAGATGAACACATACAAGTTAAACAAGAATTGGATAATAGTGAAGAATGGTGTGTTCCCTTAGAGCATGAAGCCAATACAAATGTGAATTCAGATTTTTCTCTGAAAAGTTCAACAAATGAAAATCTATTTGAatctaaatttaaagaaaacaatggAGAACAAGAGGATTGCAATCAATTCACAGAAATAAAGCAGGAAATTATAGAACTGCAAGAAGAAGAGCCACAAACGTCCCTATTTAccatacataaatatgaaaCGTCCAGCCCACCCGATCTGAAATACTCGTATTCTAATGGAAATACATCGATTCCACCAGCAACTTTTGTTATAAGCGATGTCAAATCGCAAATATATATGTGTTGTGTACAAAAGTGTAGCAACAATTCGGAAAGCCCCGGTATACGTGTATTCAACGAGTTTCCCCACGATTCGGAAATATTCATTAAAtggtgttttaatttaaaaattgatccTCGCAACTATAAAGAAAACCAATATGCCATTTGTGAACAACATTTTGAACCTATATGCTTTACTGAAAATGGTCTACTACAAAGTTGGTCGGTACCTACTTTGAATCTTAATTTAAATGAACATTCTTTCATACACCAAAACGATATACCTGAACATTTAAAGCCCTCCAATGAACAGTGTATTGTATATGGTTGTATTAATCCCTTACAACCTCTATTCAAATTTCCATATAATCCTGATATTTCACTTAAATGGTTTTCCAACTTAAAACTAGACTATACTGACTTTCGAGCCCAAAATTATCGCATATGCCGAAGACATTTTCCTCCCATATGTTTTGAAATATGCGATGTTAATAAATTGACTAGTGAAGCTGTACCAACTCAATTTCTGGGTCACATGGATAAAATTTGGCATTTCAATAATGCTGAAGAACAGCAATTACGTCAAGATGGTATGGCTGGTAGTCTTAGTAATCAAGATAACAGTCGTGGCAGCAGTCAGGGATCCTTAGCAAGAATATTATCTCCTCATGATCTGGAAGATCACGATAGCAGTTATTATGAAGATTTTGAAGAATATTACGGACAAGATGATTAA
Protein Sequence
MSQSNQRKHFHIHAPYQHPQQQQQQQQAQLHHHGHHLTPTQQQQSQWFSQQHYQHGLHLRDSRHIQHGQHHPHHHHTQQQQQQSHHNHTMTPHMFTSGYVGGVVGGGGGGSNTGNAVGSGVAISNSSHNAATIGATAHNMPASSSSAHPYSSAMPVPASGGSVGVNSSGGSYAGRNRIFDLEMLTTQTQQHAATSAAHSHSILSSGSSSGRQGFDAYSHNSMYAQQNQRHLLPPASSHHHLAPTHHSAANSLHPHHHTQQLHHPHQQPPSSLHHHQQHQHHQHQQQQHQQQQHYYHHPQQSSLHRPHTQVIPSMLQHIKSEPVEQITTTSSIQTEEVIIKSEPVDDIGYHHKSAPQFENKYLHIEEKRKQLEQQQQRQQELQQQRQQQQQRLQQEQRAHQQQQLHEQQLHQHGLQQIKQEHYHHPESEQSHNEHASQQTQQRTNSENSSIMQPTVDEKQQLLQQHQQPQISLSNIKTEAKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYLQHVALSSCNIVEYDFKCSVCPMSFMSNEELQTHEQLHRSNRYFCQKYCGKFYETIAECEQHEYGQHEYEMFKCNICCISVTQREQLLEHLNEHKYQPRFDCCICRLCFQTAIELHDHYMSNEDFCGKFYDKEAFKKTNTSSSSPYLGKLESSKLEIANTFSLKDIPPGNSHHLERLYAKPTSSKTSMQPPNTIPTIPSSLSFSTVNEFAALEPHVEVKTEIKVEPDFYPPMDQSDFASYDNDYSTPDYTSTSSNQNFSFLQDYQDNASSSTTSSYSFNNNDAIQDEEAICCVPKCGVRKFSSPSLQFFGFPKEDKYLSQWLHNLKMTYDPNVNYCAYRICSLHFPKRCIAKYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPGGEMAKCSMPGCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKDNRFFCSRHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPNMYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPPPQPSTSSKAHKYTQLFKQERESSSSSHIYDGVFMNSMVQKFSSASSNSSNNLDLGDVCLVPSCKRTRHSNDITLHTVPKRAEQLKKWCHNLKMNLVKMHKSARICSAHFEKYCIGGCMRPFAVPTLELGHEDTNIFRNPDVIKKLNIRETCCVQSCKRNRDRDHANLHRFPTHPELLQKWCENLQKPIPDGTKLFNDAVCEVHFEDRCLRNKRLEKWAIPTINLGWDDAPHNLPSEEEINESWVKPFAPNNGDEQGECCVISCKRNPQIDDVKLYRPPEDAEQLVKWAHNLQVDVTELPNLKICNLHFEQHCIGKRLLNWAMPTLNLGGKVEHLFENPPPMPAIYKKKIKPEKILNNQESVKWSPRCCLPHCRKMRSVDKVHLFRFPYINRQTLSKWCHNLQLPLVGSSHRRICSTHFEPSVLTKRCPMTLAVPTLDLNAPPGYKIYQNPARLKQIKIGAQRQCVIESCRKTKLDGIILFRFPNNRSILYKWRHNIKNWPKGKLSSQMRICSEHFESHSVGVKRLSPGAIPTLKLGHDAKDLYANETRSFFDLEKCVVSGCDSRKEMEDIRLFRFPRDDEELLRKWCHNLKMNSNDCVGIKICSKHFELECLGPRQLYKWCIPTLKLGHRDDSVEIIPNPPPEQRSGEFLFKCCVPSCGKTRKYDEAQMNSFPKNIKLFRKWKHNLKLDFLNFKERERYKICNDHFEPVCVGKTRLNFGALPTLNLGHDDLDDLYQINPDRIRPNLFIKQKDVERLERKRIVREENAEQYDCDDQDDEAADPLSIEPSDMKCCVTECTAPKSIMREPYDLPDTKEIRQMWLKELEKTEMHDLPPESKVCGLHFQLIFKRLKNQMLEVLETNEDLKSDFNKLQYDYQKSDISLVINSYQCRVEGCPTNLLNSSIRLYFFPYGKHLVTKWSNNTGIIPDENRRYMNKVCALHFESFCITENQRLRSWAIPTINLPFSAEEKHLYKNPDLTKIDRRMLGPQILKCVVNNCNYLKMVENESLKLFNFPTDEVLLKKWCNNLKMPHHFTPLLKICSLHFEKMCFGSCRIRSWAIPTLNLGHNDVPEHLNKTTIRQEVFDASEDISEIQLKQVKIKKSLDGTKCYVPSCRKSRLKHGVRFYNLPSNLKMKRKWLHNLQIRKIKSSQKVHNIKICNLHFHKRCLDGKHLKPWAVPTRLLGHNESVFDNPRKVRALPPLRCGLAHCGNHTCPKAVRTFAFPKSPDILEKWSKNFKLELEKCKGRICYEHFEKDVLGIRKLNSGAVPTLNLGHSDKDIYDNTEIIGFKDLNRHSCKIKITEQDDMIDEFEHVTDFEEEDEVWESEEGEEEEDDEQIYYDDDDNDEEEVDDVEKENTQDDDEISVANSMSDWSSIKLRELRVSITPLTPEDLMDLCSRSSYEKEFGPLTPASHLRARRSVTPASNWKDLRSETADQKSNRSETPDKKTFNYFREPRSVTPEQKADNFLEPDRKSVSPKEDPLGETLDGLSSKTPNQIESLVFSGEAISELDVSANCMKRANTQTNNEVSKREHLEISEDEVTINSLTNQVELTSFTTNLRTDKALNAVAPICCMKHCGKEKTPEQHLTTYGFPKDPQLLQKWCDNLGLQPEECIGRVCIDHFELRVIGTRRLKLGAVPTLNLGPRRIAKHTNMEDTAQKKTVTKECPETTNMQEAGSSPKAPPPYKTPKAGKQSVFRLCCLKHCRRKKFLKQEKKEKQQLLMERMDCQEKTQEVLFKFPTDENILMKWYKNLRLPEQLSITTDLQICSKHFESSVIKNGKLHSEALPTLQLSYANRPPIYRNKPQDCNSLGFKIKHKSTLKCCLPQCGNKISDDIFLLSFPENEPLTFKKWCKNLKLSFEKGKHKHLMICNQHFEPYVFYKKKYLRPGALPTVKLGHTDAIIRNCRKLRLKRENVSVISEKCCIAECKEMNLKLYSFPRSSELRKIWCNNVQIELRQALHNHYKLCARHFTVDSFIVGTDNLKLNAVPVLKLGLKTDNHLLITTNAAESRCIVENCQKTPIVDKVKLFKFPQKQEILKKWLFNLNISADTLNPYDVVCSKHFDKSCIKNGILHEKAIPTQFLEVSPKGWFYKNNDDLYEVPRKCCVLGCQHSSEEARHLYRFPKHKEDLDKWLYNLKLQVEEADVKDLRVCDSHFEQSCKISNKDLITQALPTLNLGHSDSEIYGNNFIKCCLDSCSIEGFYYHKLPEDLMIQSFWFQELEMETTYNTSMYICSVHFVSFFERILEKYSSFLNESGEYVKLSVTYNELKALPALQCYKCHITKCNSGFKLIWKLFKFPKDETLFNKWLHNTSLNFDYDQRNCYRICSQHFEERCLSEKKLHRWSLPTLKLPFNNSLYVNPPEALPSNHENLRHCCVSNCPTNRGPFYKFPVKQLEVRKWIHNLNLGNQQCTLNLRVCYKHFENYCFSKAVNKIKPLKSWSVPTLRLKRKTDLFLNPADKIAFYVCCIESCRQILNKSKEIYLFKFPFSNTLKQKWLHNLNMGKQDYKETMRICSAHFEMHCFYKGFRLLRKHSVPTLALLKPPTDLYRNPARRAYFKCCVKLCKAPWEQLLNFPKDKTLLRKWCHNLQLDKEIQLESLRDWKICGRHFEQQCINKFGIIRSVAVPTLKLGHRKKLFQNPDFSLKSNSKNEMEHLNDEENLKNIENIVIKKILDDIPQENTNIKIKSIKTTLSAKVNSSSPLQTLRKPKNLLIKKPLNVKIISKKSFNKKIRKVIKENENYKPPKQFKIEKKKECATVESTNVTQVKEQDKLLTDLKPENEEQKSSIENIKQESTNISSTEISNKPNISNTIVREHNMLESNIQEDAYLENLLEILTESLPENEDIKKMPQLLKQDSISSDTQMQVAADMQQDNFTNCAQNVKQENNIFKNYETKQEVQVENHKEFQQSEEENVKAIGFKNKNFIFCCIRTCANYGNFKPDLVLFKLPIVRKLRNQWIENCKLKQYSGNYTLKGFRVCIKHFAKHCIKDNNRLVLGSVPTLNLGNSLDCIKPLNKVNYLRCRVKGCQRSSQRDKINRIPFPQGEMKRKWCLSLNIEEDSITAEDWICHRHFERKALIGCRKPKSGTLPTLLLDSWEKNSKNCSMPLKTCNSGMGKHCRHPKIHKEIKLKCLFPLCKEKSQGLHDWPDKAIFVKIWLLVKKLGRHAEDANAWKKMWQQSFPNEDETSNSELNIKLCDEHFYYLYKTNEKAIHDYEIAEEYKILKQNVQLTFDFLNSLEKMYTKQCAVPQCKTDLNIKGCRPLKLFDFPSEEMAKKWCHNIGIEYNSLKAKPSLKICEIHFEDYCLLRRNLLDWALPTLNLPLTKDPQDIKQNDADKVFNIKAKCCISTCPNSQFLETESNLRLYKFPKDPMLLKRWLENTNCEQTFDENITRICALHFQASDMLNKKTVLKEHAVPQYYLEHHTKFSYTSPNSSIIDEHIQVKQELDNSEEWCVPLEHEANTNVNSDFSLKSSTNENLFESKFKENNGEQEDCNQFTEIKQEIIELQEEEPQTSLFTIHKYETSSPPDLKYSYSNGNTSIPPATFVISDVKSQIYMCCVQKCSNNSESPGIRVFNEFPHDSEIFIKWCFNLKIDPRNYKENQYAICEQHFEPICFTENGLLQSWSVPTLNLNLNEHSFIHQNDIPEHLKPSNEQCIVYGCINPLQPLFKFPYNPDISLKWFSNLKLDYTDFRAQNYRICRRHFPPICFEICDVNKLTSEAVPTQFLGHMDKIWHFNNAEEQQLRQDGMAGSLSNQDNSRGSSQGSLARILSPHDLEDHDSSYYEDFEEYYGQDD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00741912;
90% Identity
-
80% Identity
-