Basic Information

Gene Symbol
-
Assembly
GCA_963931995.1
Location
OZ008368.1:71007745-71030900[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 1.4e-13 2.7e-10 43.1 1.1 1 86 473 545 473 546 0.85
2 34 1.9e-14 3.9e-11 45.8 4.1 1 86 573 641 573 642 0.80
3 34 1.8e-14 3.7e-11 45.9 0.5 1 87 663 735 663 735 0.83
4 34 2.7e-13 5.5e-10 42.1 2.7 1 86 814 882 814 883 0.78
5 34 1.2e-14 2.5e-11 46.4 6.1 1 87 907 979 907 979 0.80
6 34 1.9e-11 3.7e-08 36.3 0.8 1 87 1014 1082 1014 1082 0.80
7 34 4.4e-10 8.9e-07 31.8 3.3 1 86 1124 1193 1124 1194 0.75
8 34 1.4e-15 2.9e-12 49.4 0.2 1 86 1221 1291 1221 1292 0.81
9 34 1.8e-13 3.6e-10 42.7 0.9 1 86 1314 1383 1314 1384 0.80
10 34 5.3e-13 1.1e-09 41.2 3.0 1 87 1412 1484 1412 1484 0.86
11 34 0.00028 0.56 13.3 0.1 1 62 1552 1608 1552 1628 0.69
12 34 9.8e-11 2e-07 33.9 0.4 1 87 1649 1721 1649 1721 0.80
13 34 2.3e-14 4.7e-11 45.6 0.7 1 87 1751 1822 1751 1822 0.80
14 34 5.1e-12 1e-08 38.1 2.9 1 87 1868 1940 1868 1940 0.82
15 34 4.6e-14 9.3e-11 44.6 1.0 1 86 1963 2031 1963 2032 0.79
16 34 1e-13 2.1e-10 43.5 0.3 1 87 2287 2356 2287 2356 0.80
17 34 3.5e-13 7e-10 41.8 3.3 1 86 2415 2498 2415 2499 0.83
18 34 4.5e-10 9.1e-07 31.8 5.2 1 87 2536 2608 2536 2608 0.79
19 34 3.4e-13 6.8e-10 41.8 0.2 1 86 2634 2703 2634 2704 0.81
20 34 1.8e-12 3.7e-09 39.5 0.9 1 85 2728 2796 2728 2798 0.80
21 34 8.9e-14 1.8e-10 43.7 3.0 1 86 2819 2895 2819 2896 0.81
22 34 5.7e-05 0.11 15.5 3.8 1 62 2914 2967 2914 2986 0.73
23 34 3.6e-12 7.2e-09 38.5 1.7 1 86 2996 3065 2996 3066 0.82
24 34 6.3e-11 1.3e-07 34.5 4.8 1 87 3091 3163 3091 3163 0.78
25 34 1e-11 2.1e-08 37.1 1.8 1 86 3183 3253 3183 3254 0.79
26 34 1.9e-08 3.9e-05 26.6 0.8 1 87 3274 3346 3274 3346 0.77
27 34 9e-09 1.8e-05 27.6 4.0 1 87 3850 3923 3850 3923 0.77
28 34 5.7e-10 1.2e-06 31.5 0.6 1 86 3947 4017 3947 4018 0.76
29 34 2e-13 4e-10 42.6 2.1 1 87 4049 4121 4049 4121 0.84
30 34 0.00039 0.78 12.8 0.2 1 58 4157 4205 4157 4222 0.85
31 34 2.4e-11 4.8e-08 35.9 3.6 1 87 4250 4328 4250 4328 0.84
32 34 2.6e-12 5.2e-09 39.0 2.2 1 86 4352 4424 4352 4425 0.84
33 34 1.9e-12 3.8e-09 39.4 3.7 1 87 4631 4705 4631 4705 0.81
34 34 2.2e-11 4.5e-08 36.0 0.1 1 86 4724 4791 4724 4792 0.78

Sequence Information

Coding Sequence
ATGAATGAAGAACGTAAACAgcagcaacaacgacaacaacaacatcaacagcagcagcaacaacaacaacagcaacaccatCAACATCGAGTACAACAGCCACCATTATCGTCACAAGAGCAAAAGCATCATAATCAACATAATccacatcatcatcaacatcctcCGCTGCAGCATCATGAGCAACGAGACCGGCACCAGCAACAGTCTGAGCCTCAGCCACAGCCacaacatcatcgtcatcgttTACCAACTCAACATCATCATCAGAGTGAAATTATatcacagcaacagcaacaaacgCACCAGAAGCCGCAACAacaatcatcatcgtcatcaccaGCAGCAGAAAATTCCAACGCACCAACAACGGCAGAGGAAAAACGGCAACTACAACCATCCTTGGCTAATATTAAAAGCGAACCGaagCCTCTTAACTTTCCACGCCGCAAATTAGTACAAACAGAACGGTCATCCACTCTACCCATATGCCAGAGATGTAAACAGGTCTTTTTAAAACGTCAGAGCTACACACAACATGTTGCTCTATCCTGTTGTAATATCGTAGAGTATGATTTCAAATGTGCTATATGCCCCATGTCCTTTATGTCCAATGAGGAACTGCAGGCCCACGAACAATTGCATCGCTTAAATCGTTACTTCTGCCAGAAATACTGTGGCAAGTACTACGAAACGATTGAGGAGTGTGAACAGCATGAGTTTGGACAACATGAATACGAAATGTTTAAATGTAATATATGTTGTTTATGCTTCCCAAAACGTGAGCAGTTATTTACCCATCTTATGGAGCATCGACAGCAGTCACGTTATGATTGTTGCATTTGTCGTATATGCTTCCAGAACGTATCGGAATTGGACGATCATTATATAGGGAATCCGGATTTTTGTGGCAAATTTTATGATAAGGAAGCTTTcaaatctttaaaaatattttccaaacccAAGCCACAACCACCACAGCTTTCGACAACCTCCCGATCTGATGGCCTTACAAGTTTCATGATTAAAGACATTACCTCAACTAAAATGCCCGAAAAAGGTCTGCCGCCAGCTAAACAACGTAAACCAGCACCTACACCACAACCAGAGCCAGACTCACCACCAAATTCGTCCTTGTATGACGACATACCAAATTTTGCTGCGCCACTGGTAGAAGTTAAGACAGAAATTAAAGTTGAACCGGATTTTTACCCGCCTATGGATCATGGAGATTTTGCAGCATACGACAATGATTATTCCAATTCTCAAGATTATTCACAGGGTTCCAATCAAAATTTAACATTCTTACAAGATTTCCATGATAACGCTTCAAATTCAACCAATTCATCGTTCTCTGTCAATACCACCACGTCTACTTCCAATAACAATGATGCTACCCAGGATGAAGATGCCGTTTGTTGTGTGGCCAGATGTGGTGTGCGAAAATTCAGCTCGCCCACATTACAGTTCTTTCCATTTCCACGAGACGAAAAGTATCTGTCGCAATGGTTACACAATTTAAAAATGACATATGATCCCAATGTAAATTATGGTATATATCGGGTGTGTGGTTTGCATTTTCCAAAGAGATGTGTTGCTAGACATTCACTAAGTTACTGGGCTGTGCCCACTTTTAATCTGGGTCACGACGATGTAGGAAATTTATATCAGAATCGGGAAAGTTCGGGAGGATTTCCGTCGGGTGAAATGGCTCGATGCTACATGCCCGGCTGTCTTTCACAGAGAGGAGAGACTAATGTAAAATTTCATAGCTTCCCTAGGGATTTAAAGACTTTAATAAAATGGTGTCAAAATTCTCGGCTTCCTATTCACAGTAAGGAGAATCGTTTCTTTTGCTCTCGCCATTTCGAAGAGAAATGCTTCGGTAAATTCCGACTTAAGCCTTGGGCTATACCTACGCTACGTTTAGGCACAGTATATGGGAAAATACACGACAACCCTAACATTTATCAAGAGGAAAAGAAGTGTTTTCTGCCATTTTGCCGTAGGAGCAGATCATTTGACTGTAACCTGTCTCTTTATAGATTTCCGCGAGATGAGACTTTATTGAGAAGATGGTGTTATAACTTAAGATTAGAACCTGAAATGTATAGAGGTAAAAACCATAAGATATGCTCTTCACATTTCGTGAAGGAAGCTCTGGGCTTGAGAAAGCTTAATCCAGGAGCAGTGCCCACAATGAATTTGGGTCATAATGATAGATTTAATATCTACGAGAATGAATTATATACACCGCCTCCACCTCCGGCACCACCACAGCCATCTACATCTTCCAAAGCCCAAAAGTTTGCTGAGATGTTTAAACAGGACATGAACTCTGCCTCTGCCATATACGATGAGGTCTTTATGAGTTCAATGGTACAAAAATTCTCCGGATCGAGCTCTGTCAATGTCCCTAATCTAGATTTAGGGGATGTATGCTTAGTTCCGACGTGCAAGAGAACGCGCCATACCGATGATGTAACCCTACACACAGTGCCCAAGAGAGCCGAACAACTAAAGAAGTGGTGCCACAATTTGAAAATGGATTTAGAGAAGTTGCACAAGAGTGTACGTATTTGCAGCGCCCATTTTGAAAGCTATTGCATTGGAGGCTGTATGAGACCCTTTGCCGTGCCCACCTTGGAATTGGGTCACGATGATACAGATATATTCCGCAATCCTGATGTTATTAAGAAATTGAACATACGTGAAACTTGTTGTGTACCTTCGTGTAAAAGAAATCGTGACCGCGATCATGCAAATCTGCATCGATTCCCCACTCATCCGGAATTACTGCAAAAATGGtgcgagaatttaaaaaaacctatACCGGACGGTACTAAACTTTTCAACGACGCCGTTTGTGAGGTCCATTTCGAGGACCGATGTTTACGCAACAAGCGCTTGGAAaaatggtctattccaacaatgAATTTGGGCTACGAGGATATAGTGCACAAACTACCCTCAGAGGAAGAGATAGCAGAGCATTGGACAAAATCCTTCACCCCCAATAATGGGGACGAGCAAGGTGACTGTTGTGTTGCCTCATGCAGACGTAATCCTCAAATAGATGATGTAAGACTCTATAGACCACCCGAAGATGCTGAACAAATGGTTAAATGGGCTCACAACCTACAAGTTGAGGTTACTGACTTGCCAAATCTTAAAATTTGTAATCTTCACTTCGAGTCTCACTGTATAGGAAAGCGTTTATTGAATTGGGCTATGCCTACTCTTAATTTAGGCTCTAGAATTGAACATTTATTCGAAAATCCTCCACCCACCCAGGTGGTGTACtacaagaaaaaagaaaaagcCTTGAAAATCGCTTCCAACCACGACGGTATGAAGTGGGCACCAAGATGTTGTTTATCACATTGTCGGAAGACAAGGTCATACGACAATATACAACTCTTCCGATTTCCTTATATCAATAGATCAACATTGGCGAAATGGTGCCATAACCTTCAGCTGCCTTTGGTTGGCAGCTCCCATAGGCGTTTGTGTTCCTCACATTTTGAACCTTCGGTACTAACTAAGCGCTGCCCTATGAATTTGGCAGTACCGACCCTACATTTGAATGCACCAATTggatataaaatatatcaaaatccGGCTCGTTTAAAACAGGTAAAGATTGGCGCCCAACGTCAATGTATCATCGAAATGTGCCGAAAGACCAAGGTAGATGGCGTTGTCCTATTCAGATTCCCCAATAATCGTACCATTTTGCAAAAATGGCGCCACAACATTAAGAATAGTTGGCCCAAGGGAAAATTAAGTTCCCAATTAAGGGTGTGCTCGGAACATTTTGAATCCCATTCGGTAGGAGGTAAACGACTATCTCCTGGAGCCATACCTACATTAAAATTAGGCCATGAATCTACAGATTTATATCCCAACGAAACTAGGTCGTTCTTTGATTTGGAGAAATGTGCTGTTAGCGGATGCGATTCCAGAAAGAGTATGGAGGATATACGGCTCTTCCGTTTCCCTAGAGATGACGAAGAGCTACTGCAAAAATGGTGTaataacttgaaaatgaatccCTTAGATTGTGTTGGTATAAGAGTGTGCACAAAACACTTTGAGGAGGAATGTATTGGACCCAAACTCGTTTTCAAATGGGCTATACCTACTTTGAAGTTGGGGCACAAAGAAGATGATGACGTCGAAATTATACCAAATCCTCCTCCGGAGCAGAGGTCTGGAGAATTTATCTTCAAATGTTGTGTACCCAGCTGTGGCAAAACCCGAAAATATGATGATGCTCAAATGAATAGTTTCCCTAAACATTTGAAACAGTTCCGCAAATGGAAACATAATCTAAAATTGGACATCCTTAACTTCAAGGAAAGGGAAAAATACAAGATATGCAATGACCACTTTGAACCAGTGTGTGTTGGCAAAACAAGGTTGAATTTCGGAGCCTTGCCTACGATCAATCTGGGTCACAATGATACTGAAGATCTTTACAAAATCAATCCGGAACGAATACGTCCGAACTTATTTATAAGCCAAAAGGATATAGATCGCATGGAAAGAAAACAATTAAGATTGGACATGGATCATGATGAACAGCACCTACAAGAAATTGATCAGGAACAGGATGATGATCCCTTAGATCCTCTAAGCACCCCAATGGAATGTTGTGTAGAAGAGTGTAAAGCTCCCAAGACTTTGATGAGGGAACCCTATGATATTCCAGAATGTAAAGAACTACGTGAACTGTGGAGCAGAGTGTTAAAAAAAGACATTACTGATGCAACGGCAGTGGAGGGGAAAATTTGTGGTTTGCATTATCAGCAAATATTTTTAGAGTTGAAACCCGAAATGGAATCAATGGCCGAAGAAAATCCTGAAGTGAAACTGGATTATGGCAAACTTCTATATGCGTACCAGAAATCCGAAATCTCATTAGTAATTAAAACGTATCAATGCAGAGTTGAAACTTGCTCCACAAACCCCTTAAATTCGGATGTGCGTTTATACTATTTTCCTTATggaaaagaaattgttaaaaaatggaCTTACAATACGGGCATTATACCAGATGAACATCGGCGTTACGTCAACAAAGTCTGCGCACTGCATTTTGAACACTTTTGTATAACTGACACCCAGCGTCTCAGATCTTGGGCTATACCGACCATAAATTTGAAGCACGATAATCCGGAAAAGTTATATAAGAATCCAGATCTTACAAGAATCGATAGAAGAATGCTGGGACCCCAAATTTTGAAATGTGCTGTACAAAATTGTGATAATGGCATTGCAACGGATACAGAAAATATACGATTTTTCAACTTCCCCACAGATGATAATTTATTAGAGAAGTGGTGCGCCAATTTGAAAATGTCTCGTCGTCTTACACCTCTTTACAAGATATGCGCCATACACTTTGAAAAGCAATGCCTTGGTAGTTCACGTGTCCGTTCATGGGCCATACCAACATTACATTTGGGCCACAATGAAGATCCCGAATATTTCAATAAGACCACCATTAAAAAGGAAGTATACGAAGAACAATCCAATAAAATGGAACAACTACAACTAAAGCAAGTGAAAATCAAGAAATCTCTGGATTCCTTAAAGTGTTATGTATCCTCTTGTCGCAGATCACGACTCCAACATGGTGTACGATTTTACCCACTACCCGTAAATCCTAAAATGAAACGAAAATGGATGCACAATTTACAAATTGCTTCTCAAAAGGCTGGGAAAGTTACTAATTTAAGGATTTGTAATTTACATTTCCACAAACGATGCTTCGACGGAAAACAACTGAAGAATTGGACAGTGCCTACTATGCATTTGGGTCATTCAGAGCCGATTTTCGATAATCCACGTCGGTTGCAAAATCCCTTGACAGTGCGCCGCTGTGTTTTACAACACTGTACAAATCGTACTGTCACAAATGACAAATTACGAACATTTGTATTTCCAAAAGCCCCTGAATTTTTGGAGAGATGGGCGAAAAATCTTAAATTAGAGGCATCCCAGTGTAAGGGGCGTTTATGCTCTGAACACTTTGAGGTTGAAGTTAAAGGTGAAAAGAAACTCAAAAATGGAGCAGTACCCACTCTAAAATTGGGGCATGACGacgaattactctatgataacaAGGATTTGATCAAAAGACTTGCAATTAAGGCTTCCCAGAAGGAATCTGACTCGAAAAATGAAGTGCTAGAAGACGATGACTATGAGGAGGAGggggaggaagaagaagaaattgcAGACGACGAAGAATTCGATGATGATGACGAACTAATTGAGTATCATGACATTGACGAAGAAGAGGAGGAAGAAGATGAAgacgaagatgaagatgaagacgaAGAGGAGGAGGAGGAAGTTGATATGGATAAGGTACGAATTCGTGGGACTTTACAACACTGGACTTCCATAAAAATGAAAGAGTTACGGGTAACTCTCGTACCCATAAGCAAAGAAGAACTTTTGGAGATTTCATCGGTATCGTCTTATGAACGAGACCGTCGGTCTGTTACACCTGCCAACAGTTTGAAAGATCTGCGCTCTGAAACCCCCGCAAGTGTGATAGGAGTCAATACATCCAACGAACCCGATCGTCGTTCTGTAACACCTTCTAATAGTTATTCACGCGATGTACACTCCGAAACCCCTATAAGTACGGTAGTAGCGGCCGGTCAAACTTCCAACGACAGGGAACAACGCTCTGTAACTCCAGCCAATAGTAACTCACGTGATATACACTCCGAGACCCCCGCCAGTGTGGGTGGTGGTCAGACATCGTTGGAATGCCATGACGAAAATTCTGCTTGCACTCTGTTAAGAACGGATAAACCAATAAACGGTGTCGCACCTATGTGTTGTCTGAAACATTGTGGCAAGGAGAAAACACCTGAACAACATTTAACCACCTATGGTTTCCCTAAAGACCCTCAAGTATTACAGAAATGGTGTGATAATTTAGGTCTTCAACCCGAAGAATGTATTGGTCGTTTGTGTATAGATCACTTTGAGTTGAGAGTTATAATGACTCGTAGACTCAGACCAGGAGCTGTACCCACTTTGAATCTAGGACACGCAAGAGTACCCAAACACACGAATGATGAGCTGCCCAAAGATAAGAAAGGTGCCATTATAACGGAGACAGAAACTAAGGTGGAAGCGAAAAGTGAACAGATACCAGCTCCACCACCGCCGTATACAAATCCTAAGCCGATAGGCAAACAATCGGTTTTTCGGCTATGTTGCCTCAAGCACTGTCGGCGCAAGAAACAAGCAGAGACGGAAGAGGAGAAACAATGCCCTAAGGTCCAGCGAAAGCCTTATTTACTTTTCAAATTTCCGCAAGACTATGAGACTTTAAAAAAATGGGCAGCAAATCTGAGACTACCGGAGAAAATATGTGGCCGTAAAGAATTgcgagtatgttccaaacattttgAACCTATTGTTATAAAGGGTAAACAACTGAAACCCTACGCAATACCTACTATAGATTTAAGCTATTCACAACGCCCTCCAGTATATGGAAATAGCTCCAAGGAGATCAAACAAAAACCTAAACCCAAACCGAAACGCAAACACAAAGAATTGAATAAAGCCAAATGCTTCCTTCCTCACTGTCAACGCAAACAGGATGATGAGACATTCCTCTTAAATTTCCCCCAACATTGCCTTTCACAGCAAAGAAGGTGGTGTAGAAATCTCAAGTTGGATAGTAAGGTAAAACAGTACAAAGAATTAAAGATCTGTAATCATCATTTCGAAACGTATGTATTCTTTAAACAACGAAATCTTAAGATAGGGGCAGTACCCACTTTGAATTTGGGTCATTCTGATCCAATAAGCAAAAACTTGAAAAGGCTAAGACGTAAAGTTCGAGTGGCACCCAAGGAAACTTGCTGTGTTAAGGCTTgtgaaaatcaaaattccaagAAACTCTATGCATTTCCAAAAAATTCTGAATTGAGACGAATATGGAGTAATAATTTGCAAATTGAGCTAAGAGAAGCTTTAAGTTACCACTATAAACTTTGTGGTCTTCACTTTTCCCCGGAAAGCTTTGAGGTCGGCTCTGATACATTAAAAATAAGTGCTGTGCCCTCTTTGAAACTTGGTGTGGCGGCAGCGGAGAACCATAAGGTTTTAAGTAAAATCCAGACGTCTGCATATGAATGTCAATGTGTAGTCGAAAATTGCTTAAAATCGTCCAGCGTTGACAAAGTGAAACTATTTGAATTGCCCAAAACTAAGGACTTGCTGAAGAAATGgttattcaatttgaatttgtcGCCCGATATTGATTTAGTCAAGACCCGCATTTGCAGTCGTCACTTCGATAGATTATGCATTAAAAAGGGTAATCTACACCCAATGGCTGTTCCCACTCGGTGCTTGAAAGCCAAAACTTCATTCTATCAAAACGATGATGAGGTGTACGAAGAGAATTACAAGTGTTGTGTCTTACAATGTAACTATGAGACACACGAGGAAGAAGATCAACCTAGGCAAATGTACAAGTTTCCAAAACAGAAGGAAGAACTCTTAAACAAATGGCTTCACAATTTAAAACTCATGTTGAAAGAAGATCAGATGGATTTTAAACATTTAAGGGTATGTGAAATACACTTTGAAAATGCGTGCAAGAGCAAAGATCATTTAGTGCCTGATTCAGTGCCTACGCTACAACTTGGCCACGATCAAATACAAGATATTTATCGGAATCATGAGATACAAAAATGCTGCATAGAAAATTGTCAATGGCGAGGTTACCACTGTCATAAATTACCGGAAAATTTAGAAGCTAAAACGTTATGGTTAAAGCAACTGGAAAAGGAATTCATCGTCTGCAATTTGGACTATATATGTTCAGTGCACTTTGTATCCTGGTATGAAAAGATTTACAACTCCAAACCTCCCAAAGAAGAGTCATTTAAAAGGCTTTATGAAGAACTTTCCATACTACCAGAGCTTAGACCTTTCAAATGCTCTGTTTCCAATTGTAAAACAGGTTTTACGTTGATATGGAAACTGTTTCCATTCCCCACAAATGTTAGTTTGTTCCAAAAATGGTTACACAATACTTCGCTGACGTTAGATTATACAGACCGTTCTAAATATCGCATTTGTGCTCAACATTTTGAAGAACGTTGCTTAAGTGAGAAAAAACTACATAGATGGGCTTTGCCCACCTTGGAATTGCCCTTTAATTTAAGTTTATATGTCAATCCCCCAGAAGCCTTGCCCTCCCATCATGAAAACCTTAAACAttgttgtgtttcattctgtAACATAACTAAGggacctttttttaaatttccgccCAGAGCCTACGATGCCAAGAAATGGAAACACAATCTTGACCTGGGACCTCAACAGTCCACCCTGAATTTGAGAGTATGTCATCGTCACTTTGAAGAATATTGTTTTTCCCGGGATGAGCAGAATCGTAttacaaaacttaaattttggTCTGTACCAACTTTACATTTGAAAAGAGTATCCGGCCTACATGATAACCCTACCGACAAGCAAGACTATTTCTTTTGTTGTGTCTGCAAAGAGCGCCAGAATAAACCTGAGGGCTTGTATCTCTTTCGATTCCCAACTAAACTCTCTAGTTTCCTGAAATGGTTACACAATTTAAGACTGCAACGTCAAGATTATCGGGACAGTATGAGAGTGTGCATAAAACATTTTGATAACGACTGTTTCAACAAGACATTGAAGTTATTACGAAGGCGTTCCGTACCTACCATAGGTGTGCCATGCCCCTTAAATGAATTGTTCACAAATCCTAAAAGAAAACTGACCTCTAAATGCTGTGTGTCATCGTGTGAAGGCCCCTGGCCAGACTTGAATCCATTTCCAAAGGATAGCATTATACTAAAGAAAtggtatttcaatttaaatttaaagcaaaataaagATACAGATGCGGAGGCAGTAagagttttgaaaatttgccaACGCCACTTTGAGAGCAAGTGTTTAAATGCCTTTGGGGTGATAAGACCCACAGCATTGCCAACTTTGAATTTGGGTCATaataataaaatctttaaaaaccaAAGGGTTATAAACAAAAGCAAACACAATCAACAGTTGACAGAGACCTCCACATCGGAAAAAGTACCAGCATATACTTTGGAAAATCAAGAAATTGGACTAAAAGATCAAGAAGCAAACGCATCAACAAAAGCTCTCAATGAAAGCTGCCAGCTTCCGCCTTTAAAAAGACCAAGAGGTCGGCCTAAGAAAAATGTTTGTGGGGTAATTAGAGCTTCAAATTGTTTATTAAGTACTCAAATTAAAATTGAAGGTCAGAAAAACGAGAATAAGAAGATAATACATCCCGAAGTAGAACCAAAAACGAATGCCACGGACGATGATGATTTGGACAAATACCAAACATTGGGGCACTACATTCAACAAAGACAAATGCGACAAACGTTGAAGCCGCAAAAAACGGAAAGCTTACAAAATTTGACTAAACGAGTAACAAAACCACACCAACACAAACAGAcgatgaaaataattaaaaaggaaTATGAGGATGATAAAATGGGAGCTGGAGGCGTGTCTTTAAATGCTTCCCCATTAAAATCTCCCAAAGACTTGGATTTGGATAAGTGCAAAACCTTAGGCGACTTTGTAAAACAAAGAAAGAGTATGAAAGCTTTACTTAGAAAGAATACAAAATCTTTACTTATGAAAAAGGACAAGGCCACAAAAGTTAAGACCACGACTCAAAGCAATAATACGATAAATACTATTAAAGTAGAACCCAATAAGGATACCTGCACAAATATTATAACCATTTCTGAGACTCCTTCTAAGACTTTAGAAACTTCTGAAAATCTGGATGAATATAAATCTCTAAGCGATAACTTAGAATATAGTAAACAGCAAAATAATTCTTCAATGACTACAAATCGAAACGAAGTAGAACTCAAAAAGGATATATACCAAAATGTTATAAACTCTACTGACACCCCTTCTAGTGATTTAAAATCGCCTAATGAGCTGGCAATtgaatacaaaaaactaaatgatagtgacaaaaaattaaaagagcaAAAGACTTCTCCTGATAATAACGAAGCTTCTGCATCATTAAAATGTAATGTCAACAAAGTTAAAAACCAGCCAGTTGTTCCAGTGAATGTAATAAAGGTTACTGAAACTCCCTCCACATCTTtaacaaatattgtgaaaaaaGAAAATGAAGCAAAGCCACAACTGCTCTTCATAAACAACGACGAATATAACACTGTAAATAACATGAACATTTCTCGATCGTTTGAAATGGACCTTAATCCTCCTTGTGAAGTGGTTTTAAATTCAATGGATGTTCTTCCTGGATTTCAAATCAAACAAGAACTTCTGCTGGAGGAACTGCAGCCACATTCTTATGAACCAGAGGGTATTTATCCAGATAATGTACCAAAACAGGACTATTTTTTCGATGACGATACATCGTTAAGTTACTCTGAAGCTTTCGGTGATAGATCAagttcagcacaaaaacaatacGTACGTCAAAAATGTACCATTTCAACCTGCCACTACaagtttaacaattttaaatgtacatTTTATAAACTGCCACCAAATATGAAGCTCCGATGGATATGGATTAAACATTGCCAGGAATCATCTCAATCTGACATTATATTTGGCAGGAAGTTCAAATTGTGTAAGGATCACTTTGAAGAGCAATGTTTCATGGAGGAACGGTTGTTGCTTGGAGCCATCCCCACTTTAAATTTGGGGCCGAAAAGTATGACTGATTTGCAAGACTGTTCGGAAATGTTAAAAACGTTCCGTGAATTTAGATGTCGCATTGATGACTGTCAGAGATCAGCAGAAATAGATCAAGTACATAAAATGCAATTTCCTTCTTCAAATGACGAACTTCGATCAAAATGGTTACACAACTTAAACTTCCAAGAATCCGATATATCCGAAAGTGATTGGCTCTGCCACAGACATTTCGAGCGTAAGACTTTAATTAAAGGTAAACTCAAAGCCGAAGCAGTTCCGACATTGTTACTGGGCTCTCAAGCAAAATCAGAAGACGATTTGTACAAAAATCCTGAACACATACCCCAAGCGCAATACTTCGCCCAACTGCGGCAGGTATGTTGTGTGCCCAGCTGCTCAAACACTAAGCAATCAATGGGAAATATACGATTTACCCCATTCCCTAAAAGAAGAGATATCTATGACAAATGGTTACATAACTTGAATTTAGAAGACTCTGTGGATGTGCGCAACGCTTACCAGATATGCTGGGATCATTTCGAACCGATTTGCCATTCGAAATATACCTTAAAGTTTGCTTCCATACCGACTTTGAATTTGGACAAAAAAGAGGATTTATATCCCTTAGACGAAACACAATTACAAGAACTGGATTGTTTTGGAAGACGTCTCAGAAGTAGTTGggtatttaaaaaacataaatgttcaCATCCCCAGTGCGGAGAAGAAAAAATCCAGGTATTCTCGGGCCCTAAGATTcctcaattgaaaaaaatttggctAGATGGCTTGAACGTAGAGGATCCGAAAGACACTTGTGAGGATATGGAATTCTGCGATGAACATTTCTATTTGCTATACAagcaatttgaaaatcatttAGTCGATATAGAAAACTCTGGATCCTTTGAAGATGAGGTGAAATTGTTAAGAAGTACTTTTACAGAACTAGCGTTAAAAACGAAATTCTTCGGAAGACATTGCTTTGTGGCCGAATGTCGAACTGATCTTGTTCTACCCCAATACAAagacataaaattatttaatgtcCCGACGAATCCTGAGGTAGCAAAGAAATGGTGCCATAATTGTAATGTCGACTATAAaacatttgaacaaaattataaattgaacTATTACAAAGTCTGTGATAGACATTTTGAATCGTTTTGTTTTGGCCAGCGTTCCATATTTTCCTGGGCCTTGCCCACCTTAAACTTGCCAGACTCGAGACCTACAGAGATATATGAAAACCACCCGGATAATAAATTGGGTTATACTGGTAACTGTTGTATACTATCGTGTGTAAACAATGAAGGTTTAAAACTGGATTCTAAATGCCGCTTATATACGTTTCCAGAAAATCTGGAGCATCGTTCGAAATGGTTACACAATGtcaattgtgaaaattttcaagAGAATGAAACTAAAATCTGTGGGCTCCATTTTCGTTctatttattttaagagaatgAAGGCTCTGAATGAGCTGGCTATACCCACGTTGAGGCTTTATCCACATCGACGAACAGATGATGAAATTTTCTTAGATATCAGAGATGCAAATCCTACCAAACCACCTAAAAAGGAAGACAAAGATAAATTCATTAAAACCGAAACTATTGAAATGCCAATTAATGTGTCTGATCATGactattgctttgaacaacaaccagaAAATTCGGAAGAAAAAGATTCGTCCCAATTCTTCGAGATGCTCCCCATAAAACAGGAAATAATTGAAGATCAGTATGGTTTAGAGGAAGATCAACAAATGCCGTTTGAAATATCTACTATTAAAGAGGAGATCACTGGCATGGGAGAAACGCACAAAGATCAATATAGTTTAAATAAAGAGCATAAAGTACAGTTCGATATTCCAGTGGTCAAAGAAGAAATCATTGAAATTGATGAATGCAATCAACAGCAATACGAAATTCCTGTCTTTAAAGAAGAAATTATAGAAATTGAAGAATCCAATGCCTATGATATAACGGAAAACTATGAATATTTAGATGATATGACCAACATTAGCGATTATGTTACTTGCAGAGGTTCCACCAACACCTCAACCACCATTTTTCCCTCCCTCGTCATATCGGACGTGAAGTCGCACATCTTTATGTGTTGTGTACAAAAATGTTGCAGCAATTCAGAAACTCTGGGTGTAAGACTATACACTGAATTCCCAAATGATTCCGATATATTCATTAAATGGTGTTTTAATGTCAAGATAGATCCCCGTAACTACAAAGAAAATCAATATGCTATATGTAGCCAACACTTTGACTCAATGTGTTTTAAGGATGGCAACTTATCGCTGCACTCTTGGGCGGTGCCCACCTTAAATTTGAATCTTCCAGAGAATTCTTTCATTCATCAAAACGATACGCCCAACGAACAGTGTATAGTCTATGGTTGTATCCAACCCTTATATCCGCTGTACAATTTTCCATTACAATTGGATTTGTGCCAAAaatggttttcaaatttaaaattagatCTTACCGATTATAGGGCCCGAAACTATAGAATATGTCGCAGACATTTTGCGGCCGAATGTTTCGAGGCGAATCTGAAGTTAAAATCAGATGCGATACCCACAGTGTATTTGGGGCATGCGGATAGAATATTACATTTTAATGCATTTGAAGTTAGATTACAGGAATTGGAAGGTTATGAAGTACTAGATGCAGCCGCGGTGGCAGCTCCGCCAGTGGGTCTCGACAATAGTCGTGGTAGTAGCCAGGGTTCTGCTGTATTAGCAAGGCATTTGATATCACCCAATGACTTGGAAGATCATGATAGTAGttattttgaagattttgaagattGCTATAGTATGGAAGATTAA
Protein Sequence
MNEERKQQQQRQQQHQQQQQQQQQQHHQHRVQQPPLSSQEQKHHNQHNPHHHQHPPLQHHEQRDRHQQQSEPQPQPQHHRHRLPTQHHHQSEIISQQQQQTHQKPQQQSSSSSPAAENSNAPTTAEEKRQLQPSLANIKSEPKPLNFPRRKLVQTERSSTLPICQRCKQVFLKRQSYTQHVALSCCNIVEYDFKCAICPMSFMSNEELQAHEQLHRLNRYFCQKYCGKYYETIEECEQHEFGQHEYEMFKCNICCLCFPKREQLFTHLMEHRQQSRYDCCICRICFQNVSELDDHYIGNPDFCGKFYDKEAFKSLKIFSKPKPQPPQLSTTSRSDGLTSFMIKDITSTKMPEKGLPPAKQRKPAPTPQPEPDSPPNSSLYDDIPNFAAPLVEVKTEIKVEPDFYPPMDHGDFAAYDNDYSNSQDYSQGSNQNLTFLQDFHDNASNSTNSSFSVNTTTSTSNNNDATQDEDAVCCVARCGVRKFSSPTLQFFPFPRDEKYLSQWLHNLKMTYDPNVNYGIYRVCGLHFPKRCVARHSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPSGEMARCYMPGCLSQRGETNVKFHSFPRDLKTLIKWCQNSRLPIHSKENRFFCSRHFEEKCFGKFRLKPWAIPTLRLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSFDCNLSLYRFPRDETLLRRWCYNLRLEPEMYRGKNHKICSSHFVKEALGLRKLNPGAVPTMNLGHNDRFNIYENELYTPPPPPAPPQPSTSSKAQKFAEMFKQDMNSASAIYDEVFMSSMVQKFSGSSSVNVPNLDLGDVCLVPTCKRTRHTDDVTLHTVPKRAEQLKKWCHNLKMDLEKLHKSVRICSAHFESYCIGGCMRPFAVPTLELGHDDTDIFRNPDVIKKLNIRETCCVPSCKRNRDRDHANLHRFPTHPELLQKWCENLKKPIPDGTKLFNDAVCEVHFEDRCLRNKRLEKWSIPTMNLGYEDIVHKLPSEEEIAEHWTKSFTPNNGDEQGDCCVASCRRNPQIDDVRLYRPPEDAEQMVKWAHNLQVEVTDLPNLKICNLHFESHCIGKRLLNWAMPTLNLGSRIEHLFENPPPTQVVYYKKKEKALKIASNHDGMKWAPRCCLSHCRKTRSYDNIQLFRFPYINRSTLAKWCHNLQLPLVGSSHRRLCSSHFEPSVLTKRCPMNLAVPTLHLNAPIGYKIYQNPARLKQVKIGAQRQCIIEMCRKTKVDGVVLFRFPNNRTILQKWRHNIKNSWPKGKLSSQLRVCSEHFESHSVGGKRLSPGAIPTLKLGHESTDLYPNETRSFFDLEKCAVSGCDSRKSMEDIRLFRFPRDDEELLQKWCNNLKMNPLDCVGIRVCTKHFEEECIGPKLVFKWAIPTLKLGHKEDDDVEIIPNPPPEQRSGEFIFKCCVPSCGKTRKYDDAQMNSFPKHLKQFRKWKHNLKLDILNFKEREKYKICNDHFEPVCVGKTRLNFGALPTINLGHNDTEDLYKINPERIRPNLFISQKDIDRMERKQLRLDMDHDEQHLQEIDQEQDDDPLDPLSTPMECCVEECKAPKTLMREPYDIPECKELRELWSRVLKKDITDATAVEGKICGLHYQQIFLELKPEMESMAEENPEVKLDYGKLLYAYQKSEISLVIKTYQCRVETCSTNPLNSDVRLYYFPYGKEIVKKWTYNTGIIPDEHRRYVNKVCALHFEHFCITDTQRLRSWAIPTINLKHDNPEKLYKNPDLTRIDRRMLGPQILKCAVQNCDNGIATDTENIRFFNFPTDDNLLEKWCANLKMSRRLTPLYKICAIHFEKQCLGSSRVRSWAIPTLHLGHNEDPEYFNKTTIKKEVYEEQSNKMEQLQLKQVKIKKSLDSLKCYVSSCRRSRLQHGVRFYPLPVNPKMKRKWMHNLQIASQKAGKVTNLRICNLHFHKRCFDGKQLKNWTVPTMHLGHSEPIFDNPRRLQNPLTVRRCVLQHCTNRTVTNDKLRTFVFPKAPEFLERWAKNLKLEASQCKGRLCSEHFEVEVKGEKKLKNGAVPTLKLGHDDELLYDNKDLIKRLAIKASQKESDSKNEVLEDDDYEEEGEEEEEIADDEEFDDDDELIEYHDIDEEEEEEDEDEDEDEDEEEEEEVDMDKVRIRGTLQHWTSIKMKELRVTLVPISKEELLEISSVSSYERDRRSVTPANSLKDLRSETPASVIGVNTSNEPDRRSVTPSNSYSRDVHSETPISTVVAAGQTSNDREQRSVTPANSNSRDIHSETPASVGGGQTSLECHDENSACTLLRTDKPINGVAPMCCLKHCGKEKTPEQHLTTYGFPKDPQVLQKWCDNLGLQPEECIGRLCIDHFELRVIMTRRLRPGAVPTLNLGHARVPKHTNDELPKDKKGAIITETETKVEAKSEQIPAPPPPYTNPKPIGKQSVFRLCCLKHCRRKKQAETEEEKQCPKVQRKPYLLFKFPQDYETLKKWAANLRLPEKICGRKELRVCSKHFEPIVIKGKQLKPYAIPTIDLSYSQRPPVYGNSSKEIKQKPKPKPKRKHKELNKAKCFLPHCQRKQDDETFLLNFPQHCLSQQRRWCRNLKLDSKVKQYKELKICNHHFETYVFFKQRNLKIGAVPTLNLGHSDPISKNLKRLRRKVRVAPKETCCVKACENQNSKKLYAFPKNSELRRIWSNNLQIELREALSYHYKLCGLHFSPESFEVGSDTLKISAVPSLKLGVAAAENHKVLSKIQTSAYECQCVVENCLKSSSVDKVKLFELPKTKDLLKKWLFNLNLSPDIDLVKTRICSRHFDRLCIKKGNLHPMAVPTRCLKAKTSFYQNDDEVYEENYKCCVLQCNYETHEEEDQPRQMYKFPKQKEELLNKWLHNLKLMLKEDQMDFKHLRVCEIHFENACKSKDHLVPDSVPTLQLGHDQIQDIYRNHEIQKCCIENCQWRGYHCHKLPENLEAKTLWLKQLEKEFIVCNLDYICSVHFVSWYEKIYNSKPPKEESFKRLYEELSILPELRPFKCSVSNCKTGFTLIWKLFPFPTNVSLFQKWLHNTSLTLDYTDRSKYRICAQHFEERCLSEKKLHRWALPTLELPFNLSLYVNPPEALPSHHENLKHCCVSFCNITKGPFFKFPPRAYDAKKWKHNLDLGPQQSTLNLRVCHRHFEEYCFSRDEQNRITKLKFWSVPTLHLKRVSGLHDNPTDKQDYFFCCVCKERQNKPEGLYLFRFPTKLSSFLKWLHNLRLQRQDYRDSMRVCIKHFDNDCFNKTLKLLRRRSVPTIGVPCPLNELFTNPKRKLTSKCCVSSCEGPWPDLNPFPKDSIILKKWYFNLNLKQNKDTDAEAVRVLKICQRHFESKCLNAFGVIRPTALPTLNLGHNNKIFKNQRVINKSKHNQQLTETSTSEKVPAYTLENQEIGLKDQEANASTKALNESCQLPPLKRPRGRPKKNVCGVIRASNCLLSTQIKIEGQKNENKKIIHPEVEPKTNATDDDDLDKYQTLGHYIQQRQMRQTLKPQKTESLQNLTKRVTKPHQHKQTMKIIKKEYEDDKMGAGGVSLNASPLKSPKDLDLDKCKTLGDFVKQRKSMKALLRKNTKSLLMKKDKATKVKTTTQSNNTINTIKVEPNKDTCTNIITISETPSKTLETSENLDEYKSLSDNLEYSKQQNNSSMTTNRNEVELKKDIYQNVINSTDTPSSDLKSPNELAIEYKKLNDSDKKLKEQKTSPDNNEASASLKCNVNKVKNQPVVPVNVIKVTETPSTSLTNIVKKENEAKPQLLFINNDEYNTVNNMNISRSFEMDLNPPCEVVLNSMDVLPGFQIKQELLLEELQPHSYEPEGIYPDNVPKQDYFFDDDTSLSYSEAFGDRSSSAQKQYVRQKCTISTCHYKFNNFKCTFYKLPPNMKLRWIWIKHCQESSQSDIIFGRKFKLCKDHFEEQCFMEERLLLGAIPTLNLGPKSMTDLQDCSEMLKTFREFRCRIDDCQRSAEIDQVHKMQFPSSNDELRSKWLHNLNFQESDISESDWLCHRHFERKTLIKGKLKAEAVPTLLLGSQAKSEDDLYKNPEHIPQAQYFAQLRQVCCVPSCSNTKQSMGNIRFTPFPKRRDIYDKWLHNLNLEDSVDVRNAYQICWDHFEPICHSKYTLKFASIPTLNLDKKEDLYPLDETQLQELDCFGRRLRSSWVFKKHKCSHPQCGEEKIQVFSGPKIPQLKKIWLDGLNVEDPKDTCEDMEFCDEHFYLLYKQFENHLVDIENSGSFEDEVKLLRSTFTELALKTKFFGRHCFVAECRTDLVLPQYKDIKLFNVPTNPEVAKKWCHNCNVDYKTFEQNYKLNYYKVCDRHFESFCFGQRSIFSWALPTLNLPDSRPTEIYENHPDNKLGYTGNCCILSCVNNEGLKLDSKCRLYTFPENLEHRSKWLHNVNCENFQENETKICGLHFRSIYFKRMKALNELAIPTLRLYPHRRTDDEIFLDIRDANPTKPPKKEDKDKFIKTETIEMPINVSDHDYCFEQQPENSEEKDSSQFFEMLPIKQEIIEDQYGLEEDQQMPFEISTIKEEITGMGETHKDQYSLNKEHKVQFDIPVVKEEIIEIDECNQQQYEIPVFKEEIIEIEESNAYDITENYEYLDDMTNISDYVTCRGSTNTSTTIFPSLVISDVKSHIFMCCVQKCCSNSETLGVRLYTEFPNDSDIFIKWCFNVKIDPRNYKENQYAICSQHFDSMCFKDGNLSLHSWAVPTLNLNLPENSFIHQNDTPNEQCIVYGCIQPLYPLYNFPLQLDLCQKWFSNLKLDLTDYRARNYRICRRHFAAECFEANLKLKSDAIPTVYLGHADRILHFNAFEVRLQELEGYEVLDAAAVAAPPVGLDNSRGSSQGSAVLARHLISPNDLEDHDSSYFEDFEDCYSMED

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-