Basic Information

Gene Symbol
-
Assembly
GCA_037044455.1
Location
JBAMBS010002739.1:152556-166715[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 31 6 2.2e+04 -6.3 3.8 54 86 135 169 115 170 0.51
2 31 6 2.2e+04 -5.0 2.6 23 59 176 207 172 223 0.62
3 31 4.4 1.6e+04 -3.0 3.4 45 62 357 376 340 391 0.56
4 31 1.6e-14 5.9e-11 43.3 3.7 1 86 593 665 593 666 0.85
5 31 1.9e-15 7.2e-12 46.2 4.6 1 87 693 762 693 762 0.83
6 31 4.7e-16 1.7e-12 48.2 0.4 1 87 784 856 784 856 0.85
7 31 5.3e-16 2e-12 48.0 5.4 1 86 954 1023 954 1024 0.81
8 31 3.8e-15 1.4e-11 45.3 3.3 1 86 1048 1119 1048 1120 0.82
9 31 2.8e-13 1e-09 39.3 0.9 1 87 1155 1223 1155 1223 0.81
10 31 2.7e-10 9.9e-07 29.7 1.8 1 86 1271 1340 1271 1341 0.76
11 31 3.4e-17 1.3e-13 51.8 0.1 1 86 1368 1437 1368 1438 0.83
12 31 7.6e-13 2.8e-09 37.9 0.9 1 86 1459 1528 1459 1529 0.81
13 31 4.7e-15 1.7e-11 45.0 1.9 1 86 1556 1627 1556 1628 0.85
14 31 8.6e-14 3.2e-10 40.9 1.6 1 85 1695 1763 1695 1765 0.82
15 31 3.2e-12 1.2e-08 35.9 0.1 1 86 1786 1854 1786 1855 0.81
16 31 1.2e-14 4.3e-11 43.7 0.4 1 87 2022 2091 2022 2091 0.81
17 31 3e-12 1.1e-08 36.0 0.1 1 86 2151 2221 2151 2222 0.77
18 31 0.0026 9.4 7.4 0.1 1 59 2250 2301 2250 2325 0.75
19 31 1.3e-11 4.8e-08 33.9 0.7 1 86 2339 2408 2339 2409 0.84
20 31 2e-13 7.5e-10 39.7 2.6 1 86 2465 2534 2465 2535 0.83
21 31 2.2e-14 8.1e-11 42.8 1.0 1 86 2570 2641 2570 2642 0.82
22 31 6.7e-12 2.5e-08 34.9 2.5 1 87 2652 2724 2652 2724 0.80
23 31 8.2e-12 3e-08 34.6 0.1 1 64 2750 2810 2750 2823 0.76
24 31 0.00026 0.98 10.5 0.0 1 58 2854 2904 2854 2916 0.82
25 31 2.1e-14 7.7e-11 42.9 0.1 1 86 2942 3014 2942 3015 0.81
26 31 3.4e-14 1.2e-10 42.2 0.3 1 86 3185 3257 3185 3258 0.82
27 31 2.3e-12 8.6e-09 36.3 3.6 1 87 3320 3391 3320 3391 0.81
28 31 8.6e-15 3.2e-11 44.1 2.5 1 86 3498 3568 3498 3569 0.85
29 31 5.1e-14 1.9e-10 41.7 0.1 1 87 3722 3792 3722 3792 0.83
30 31 2.1e-09 7.9e-06 26.8 1.8 1 59 3812 3860 3812 3873 0.86
31 31 3e-10 1.1e-06 29.6 0.4 18 87 3876 3934 3865 3934 0.74

Sequence Information

Coding Sequence
ATGTCACAACACAACCCCCcgcatcaacatcatcatcactaCTATcaggaggagcagcagcagcagcaacatcatcatcaacaacaacaacagcagcagctacaacataaacaaatacaacagcaGAGTTGGTACTCACATGTTGCTTCCTACCCTTCCCATCATCCACACGTTGCTGGCTTTTCGGCGCcatgcaaaaacaataataataacaacaacaataacatgaATGCATACGAAAGTGGAGCTGGAAGTTCGCATACAGCATATTATGGCTCCTCTGCAGCAGGTACGGCTGGGGTGGGCTATAACCTTGAGGGCAATAATGTGGCCTATGCGCACAACCAGCTGCTGCagtaccaacaacagcaacaacaacaacatcatcatcatcatcaacagcaacaacaacatcaccaacaacaacagcagcagcagcagcaccaccaccttAGTCAACGCTCGTATATGCAGCATAATTTAATGCATGGTTCATATCCTTACATCAAAAGTGAGCCATTGGAGCTGCCTGATGATAGACAACgtcgacagcgacagcagcagcagcaacaacaacagcaacagcataatCAGCAtcatcaccaacaacaacattttcagAATCCAATGGCACCGCCGCCAGCACCCGCCAATCGCCACCCACACGATGCCAGCGGtgaaatgataataaaatCGGAACCTATAGACGAACATGCGTACAAGTCAAACTATATCGATGATAATACACCGTTCGCCGATTTTAGTAAATATCCAGAATTTAACGACGATATGTTGAACCCTAAGATTGAATTAACTGTCAAAGATGAAGGCTATGGAAACCAGAAGAACCCGCTTAGCTATCCCCGCCGCAAACTGCAAAGTGAGCGTTCATCTGAGAGTTTGCCCATATGCCAGCGTTGCAAGGAAGTATTCTTTAAGAAGCAAATCTACTTGCGGCATGTGGCCGAAAGCAGTTGCACTATTCAAGAGTATGACTTTAAATGTAACCTATGTCCTATGTCCTTCATGAGCATTGAAGAACTGCACAAGCATAAGCATCTACATAGAACGGAGAAATACTTTTGTCACAAATATTGTGGCAAACACTTTGATACAATTGCTGAATGCGAATCCCATGAGTACATGGAGCATGAGTATGATAGCTTTGTATGCAATATGTGCTCCCTTACGTTTGCCAACAGAGAGCAATTGTATGCGCATCTGCCACAACATAAGTATCAGCAGCGTTATGATTGCCCCATATGCCGATTGTGGTATCAAACAACGTTGGAGTTGCATGAGCATCGCATGGCAGCGCCGTACTTCTGTGGCAAATATTACGTTGCAGCgcatcagcaacagccgcatacgcagcagcatcaacaacagcatgcacatcaacagcatcagcattcaCAGCATCATCAGCCCAATTACAAACTGCAGGATTGTCAATTGGGTTCCGTTGAAATGCAACACCATAAGGCGATTCCAACAGCCAGCGCATTGCCTGCAACGGCAGCACTCAATTCGCTATTGCAGCAACGACAGGCTAATGCAGATGGTGCACCAATGTTTGGCTCTACCATGAAAACTGAGAGCAATATCAAGCTGGAGCGTAGCTATAGCAATTCTACAAGTGAATCTGGTTACAGCCTGCACGATAGCAGCTATAACAATGCTTATGCTAGCGATACGTCGCTACATGGAGGCAGTGGTGCAATTGGTGGACCACAAGCGCACTCCTCCACGCTAGACGATTCTGAGGATGCACTCTGTTGTGTGCCATTATGTGGTGTACGCAAGAGCACCAGTCCAACGCTGCAATTCTTCACGCTTCCAAAGGATGAAAAGTATTTGCATCAGTGGTTGCATAATCTAAAGATGTTCCACATTCCGGCGTCAAGCTATGCCAGTTTCCGTATTTGCAGCATGCACTTTCCGAAGCGTTGCATCAATCGTTACTCACTCTGCTACTGGGCAGTACCCACATTCAATCTGGGTCACGATGATGTTGccaatttatatcaaaatcgCGAACTAACCAATACATTTACCACTGGTGAAGTGGCGCGTTGCAGCATGCCCAACTGCACCAGCCAACGCGGAGAGAGTAATCTTAAATTCTACAACTTTCCCAAGGATATTAAGAGCCTAATAAAATGGTGTCAAAACGCACGTCTTCCTGTCCAGGCCAAGGAGCCTCGTCACTTCTGCAGTCGCCACTTTGAGGAGCGTTGCATTGGCAAGTTTCGATTAAAACCATGGGCAGTCCCCACGCTACATTTAGGTGCACAATACGGAAAGATTCATGACAATCCGAAGAACTTGTATGTGGAGGAGAAACGTTGCTGTCTCAATTTCTGTCGACGCAGTCGCTCCTCTGATTTCAACATGTCATTGTACCGTTTTCCCAGAGATGAAGTGCTGCTGCGACGTTGGTGCTACAACCTACGCCTCGATCCTGCTGTCTATCGTGGCAAGAATCATAAAATATGTAGCGCTCACTTCATCAAGGAAGCTTTAGGCCTACGTAAGCTATCACCAGGTGCTGTTCCTACGCTTCATCTGGGACACCACGATACCTTTAATATATACGAGAATGAGTTGTGGCCACCACCGACAGCATCTACGCCCACCAATCATCAACAGCAccaactgcagcagcatcagttacagcaacatcaacaacaacacgcacATCACAGCAAATATCAGCGACAGTCCGCTGCATCTACGTCATCATCGGCTAGCTCAGCATCGCATTATGTGGACCCAGACCTAAGTGCATCGTACATGAGTCTAAACGTCTCTGGTGGCTCCGGCTCAGCATTAAATGTCAGCGATAGCATGGACATCTGTTGTGTGCCCAGCTGCGAGAGCAAGCGACACAATAATGAGAATATAACGTTCCATACAATACCAAGGCGTCCGGAACAAATGCGTAAATGGTgtcacaatttgaaaatatccGAGGATAAGATGCACAAGGGCATGCGTATCTGTAGTCTACATTTTGAGTCCTACTGCATTGGCGGCTGCATGCGTCCATTTGCAGTGCCTACGTTACAGTTGGGACACGATGATGAGGATATACACCGCAATCCGGATGTAATCAAGAAGCTCAATATACGCGAAACATGCTGTGTTGCTGTATGCAAACGTAATCGTGATCGGGATCATGCCAATCTACATCGTTTCCCAAGCAATGTGGCGCTGCTCAGCAAGTGGTGTGCCAACTTGCAACGCTCAGTGCCGGATGGCACGAAACTCTTTAACGATGCAATTTGTGAGGTGCATTTTGAGGATCGCTGTCTGCGTAATAAACGATTGGAGAAATGGGCAGTACCCACGCTGGTATTGGGACATGATAACATTGCATATCCATTACCAACACCCGAACAGGTGGCTGAGTTTTATGCGCGTCCAACTGCACCGAACAATGGTGAAGAGCAGGGTGAATGCTGTGTGGATACATGCAAACGTAATCCAAGTGTCGATGATATTAAACTCTACAGACCGCCAGAGGAATCACAGGTGCTAGCCAAATGGGCACACAATCTGCAGTTGGATGCTGCACAGTTGCCCAATATGAGAATCTGTAACCTGCACTTTGAAGCACATTGCATAGGCAAGCGCATGCGTCCATGGGCCATACCCACGCTCAATTTATCCACAACTGTGGAGAACTTATATGAGAATCCTGAACATAATCAGATCTTTAAGCGTCGCACGTTTCTTAAGGAACGGGTTGCCAAAAGCGTTGCCTCGTCTAGCGTTGGCGGTATTAAACCTACTTGGGTGCCACGTTGCTGCTTGTCACATTGTCGCAAGGTGCGCGCGCTCCACAATGTGCAACTTTATCGCTTCCCCAAACTGAATCGTTCCACATTAGCCAAGTGGGCACACAATTTACAGGTGCCAATGGTTGGCAGTGCACAGCGACGCCTTTGCTCTGCGCACTTTGAACAGCATGTCTTGAGCAAGAAGTGCCCCGTACCATTGGCAGTGCCAACTCTGGATCTGAACACACCACTTGGCTACAAAATCTATCAAAATCCGGCCAAGTTGAAGGCTAACAAACTGTGCCTGCAGCGCGTTTGCATTGTGGAGAGCTGCCGTCGTCAACGTGGTCAGGGCGTGCAACTCTTCCGGCTGCCGCATAATCCCACACAACTGCGCAAATGGATGCACAATATACGCATGCGTCCCAGAGGTTCTATGCGTCAACAATATCGCATTTGTTCCATGCACTTTGAAACGCACTCGTTCAATGGTAAACGCTTAAGCGCTGGTGCTATTCCCACTCTGGAATTGGGTCATCAGGATAACGATATTTATCCCAATGAGGCGCAATCATTTGTGGAGGAACACTGCACTGTGGAGGGCTGCGAAGCGACCAAGGAACAACCCGATGTACGCTTATTCCGTTTTCCCACGGAGGACGATGATCTGCTATGGAAATGGTGCAACAATCTCAAAATGAATCCAGCCGATTGTATTGGAGTTCGCATCTGCAACAAACATTTTGAGCTGGACTGCATTGGACCCAAGCATCTATACAAATGGGCTATACCAACGCTAGTCTTAGGTCACAATGATGTTGAGATTGAGCTTGTAATCAATCCGAAACCAGAGGAACGCTACGTTGATCCCGTGTTCAAATGTTGTGTGCCAACTTGCGGCAAAACACGCAGATTTGATGAGGTACAAATGAATAGTTTCCCCAAGAATCCTGAACTGTTTCAGCGTTGGCGTCACAACCTCCGACTGGATCATCTCAACTTTAGGGAACGTGAAAAGTATAAGATCTGCAATGCACACTTTGAGGAGATTTGCATTGGAAAGACACGTCTTAATATTGGCTCTATACCTACGCTGGAGTTGGGACACGATGAGACTGAAGATATTTATCATGTGAATCCTGCAGAATTGCAAAGCAATTTGTTTGGACGGCAGCGACGAGTACGCTCCCGAAGTGCTTCCATCAAGAAGGAGGAAACTTCAGAGGATGATAGTCTAGTCCCAATACTAGACATCGAGAGGCAGGTTAAAACTAGAAAATCCTTGGCTAATCTCAAGTGTTGTGTGCTAACCTGTGGGCGTAGTCGTTTGGAGCATGGAGCACGTTTGTTTTCCTTTCCGAAtggtaagcagcagcagagtaaATGGCGACAGAATCTACGTATGACTGCGGAGGAAGTGGACAGATCAACGCGTGTATGCAGCGCTCATTTTAATCGACGCTGCATTGATGGTAAAGACCTCCGTTCCTGGGCCATACCCACACAACAATTGGGCTACCAGGCCATCTATGAGAATCCAAAGAATATACCGGGTTTCTTTACACCAACCTGTGCGCTGGGTCACTGTCGCAAGCGGCGCAGCATAGATAATGATCTCAGAACATATCGATATCCTCGGAGCGAGGAGCTGCTCGAAAAATGGCGTATCAATTTGCGCCTGTCGCCTGATCAATGTCGCGGGCGCATATGTGCGGATCACTTTGAGCCGATGGTGCGTGGCAAGTTAAAGCTTAAAACAGGCGCGGTACCTACACTGAAACTTGGCCATGATGAGGGTATCATTTTTGACAATGAAGCCATCAAGGCAGGCCTGCAGCAGGAAGAGAATGAAGAGGATGAGGACATGCTGGCAACGCGGGTTAGGATCAAAAGTGATAATAAAATGgacgataatgatgatgaggcTGATTTTGATGCAGACCCTGACGacgatgctgatgatgatgatgaagatgatgatatGTTACATGATCAGGATAACGAACATGATCTGGAACACGAGGATGAAAAGGAACAAGATCAAGATCACGATCAGGATGAAAATAGTTACTTTGATCCTTTGGAACTTGTAGAAACCTATGCGGAGCAGCACAGCGAAAATAGTGCAGGTGAAGATGCGCTCGAGGATGAACTGGAAGATGTTAATGGCGATGATGATGAGCTGCTGCTACCGGATAATCCGCCAATACCAGTTAAATTAATAGTGCCGTCCGTGACGCCCCGGCGTGAGAAGGCCGTAAATAATGTGACGCCCATTTGTTGCCTAAAGCATTGCCGTAAGGAGCGCACGGCCACCTATCAATTAAGCACTTTTGGATTTCCTAAGGATGAAAAGCTACTGCTTAAATGGAGCGCCAATCTGGAACTTTCACCTAGTGATTGTGTGGGACGTGTGTGCATTGAGCATTTTGAACCCGAAATGCTGGGCACGCGCAAGCTGAAGCAGAATGCGGTACCTACTCTGAATCTGGGACACACGACGCCGCTTCGTTATAGCTGCAATGGGCAGACGTTGACCATACATGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGAAAAAGGAAGCTGCTAATGGTGGATCCACCGGATCAGAAAGCGAGTAAGTTGCAACCAAAACGTTGCTGCTCGTTACCCAGTTGCGGCAAGCAGCCGGAGATGCATGATGTGCAACTACATCGTCTACCAAGTAATCGTGTAATGCTTCACAAGTGGCTACATAACTTGAAACTACCGTCCGACATAGACATTACCACGCAATGGATTTGCTCGGAGCACTTTGATCCACATTTGTTGAATGGTGGTGAGAAGTCATTGGAGGAGTTTGTGCCCACCATAAAACTGGGACATAATGATACCGATTTATATCAAAATCCTGAACAAGGTAACCTTATCAATAGTCACACAAGCACATCCAGTGCCTGCCTGGTACCCAGTTGTCCATGTGCACGACTTAACCTCTATCGCTGCTATGATCTGCCCGAGAATCGAGTGGTGCAACAGACTTGGCTAAAATGGCTGCAGCTACCCATGCCGCAGCTGCCGAGCGATGCTCAACTGTGTGTCATGCACTACATGGAGCTGTATGAGAAAGTGCCGCTTTCCAAGGAACTGCCTGTTGCAGTGCGGAAACAACTGAAGGAAACCTATGAAATCATATCAAAATCTTCGATGGCCATGAAGCTACGTTGTGCCGTGCCCGGCTGTTACTCCAAGTACACAGATAATATACGACTCACCAAACTGCCCATATGTCAGGATACCTGTGCCAAATGGGTGCACAATACCAAAATAGATTACGATCCCATGCGGCATTATGTCTATCGCATTTGCCTGCTGCACTTTGAGCCCCGTTGCTTGGGCCCAGTGCGTCCAAAGCAGTGGGCAATGCCCACACTGGAACTTAAACATAAAGATACCAATATCTATGAAAATCCAAAGATGGAAACCATTTCGTTGCCGCAACCAGAGCCAATCGAATTACCGCTACGCATTAAAACTGAACTCGCTCTAAGCGCCAGtcccagtgccagtgccagtccTAGTCCATTGGGTAGACAGCGCATCTGTTGCATACCCAGCTGCAGCCAGCATGCCACAGTGCTACTGCGTCTGTATCGCTTTCCCAACATTGAAACAACGTTGTTGAAGTGGCTAGTGAATACGCAGCAGAGTCCACGTTTCATCGATCCACATCATTTGTTTGTATGCCAGGATCATTTTGAGCCCGAGGCAATATGCAAGAATCAACTGCGCAGCTGGGCAGTGCCCACACTTAAGCTGGGACATGAAGGTCATATCATACCAAATGCCAAACACAATGGCAATATTGCTGATAGTCAGGAGAATAAGCTCACGCTGCAATTTATTTGGGAGAACTACTGCTCAATATTGTCATGCTTCCAGCAGCGCAGCGACCAGGTGCGTCTCTATGCTTATCCCAAGGATAAGCCCACGATACGTAAATGGGCCGCAAACTGCAAGCATCGCTCTATGCAGGCGAGCAGCGATGGGTTTCAGgtcTGCGAATCGCACTTCACAGAGGACTGCTTTGACCCTGAGACGAGAGAGCTGCAGGAGGGCGCAGTGCCCACGCTTGAGCTGAGCCGTGCAGTAAATAAATTGTGCTGCATTGTGCCTGGCTGTGCGAAGGATGAGGACGCCCCACGTCGGCGCTTCTTCAAGGTGCCAAAGCGCTGCGGTCAGCTAGTTGATTGGTGCCACAATTTGTGTTTGGATCCAGCTAATTTTAATGGCACTGACGCGCATGTTTGTGATCGTCACTTCGAGCCGTACTGTTTCAATCCTTATAAAATGTTACGTCCCGGTGCACGTCCAACACTGCATTTGGGCCATGATAGGGACAGGGAACTTATACCCAATCCGGCAAATTGGGCACGTTCCGCCGAAGGAACTATGGTATGTTGTGTGCCCAATTGTGGACGTTCCAAAACAATAGATAATGTGGAACTATTTCGCCTACCAAAACAACGTTCATTGTCTGACCGTTGGCTGCACAATGTGCGCCTAGATCCCAGCGTCTTGGAGCTCTCCAAGCTATGGTTTTGCAGCGTGCATTTTGAACCCAGCAGCATGTTGAATGGACGTCCACAGTACAATACGTTGCCCACATTACAACTGGGTCATGACTACTACGACATTGTGCATTATTCCTATAATCAGGCGAACAACAAGATAAAGGGgtattgcaataaaaatggtTCCAGCTACGACTGCTGTTATCCGCAATGCACAGAGCTACAAAAGAGCTATCTACGCATCAGCTACGATTTGCCCATGAAGGAGGAACTGCGACTTGCTTGGCTGGCATACATGGAAGTGGAGGAGTTGGAAGGGCAGCAACTGAAGCTTTGTCCATTACACTTGATTATGCTCTATGATCACAGTTTGGAAAACTTTCCAGAGCACTCGCCAGAGCCACTGTTAGATGGAACCTATAAAGATTCCCGTAATAGCGTGCGCATACGCCTTATAAGCTGCGCTGTAAGCGGCTGTAAGACACTTAAGCCACGAGATGGAGGCTGCCTGCATGGATTGCCACAGCGTCGGGATGTGCTGCAGATGTGGTTGTATAATATCAAATTGGATTACTACGAGCATCAACGTTATATGTACAAAATATGTAGCAAGCACTTTGAAGCTTCTTGCATTATAGAGACAACACGTCGTCTAAAACCCTGGTCCATGCCAACTTTGGATCTGCCAGAACATGAGGCGGATAAGCCACCACCCTACCAAAATCCTACAGAAGATGAATGGCAGCGTATGAATGAAGCATGGGCCTTTACTCAGGAGGCAGAAAAATTGCAAGAAAACGCTATGGCACCTGGTAATAGCTGCATGACAGAATCCATTGTTAAGAATGAGCCAGACACAGAGGACGACTCATTAGCATTCACAGGGGCGGATCAGTTGCTTGAAATTGACAATTCAAAAGAGTGTGCTGGCATGGACAGCCATCCCATGCACTTTTtggagcagcaacaatacaatGAAGACAGCAATTCACAACATCCGCTGGAAATGCAAGCTTTGGAAGTGATGCTGGAAGTGGCGCATGTAGAAAACTGTGCAACCTATGAGCAAATGGACACCGAAGCGGATCTAGGCTATGCCGACCAACAGGTGCATAACTATGGTGCAACGGCTCAACCaggtggcagtggcaacatGCCTAACAGTCTTAACTATTACAGCGCACGTCATTGTAGTGTACAGGGCTGTGATGTCACAGATAAGGATGTGGATGGCAACATAAAGTTGCATAAGTTTCCAGTCTCATTGGAGGCTTTGAAAAAATGGATGCACAATACTCAGGTCCATGTGGATACAAACTATGGTTGGCGTTTTCGCATCTGTAGCTATCACTTTGTACCCGAATGCTTTAATGGTTCGCGCATAAGGCGTGGCTCCATGCCCACCTTGCGTCTGGGTTCAAACAAACCCGATGAGATATATGATAATGAGTTTACGACAATGCAGCAGGTCGAGCAGCTCAATAAGGAACAGCTAAACTGTACTGACCTGGACTCGCATGAGGAATATGTGCAGGATTCAGAGATAACCATGTGCCTGCCAGAACCTGCACCACCACGCAAATCGAGTAAATTTTGTCAAATCGAGAGTTGTCCCAATCATTTGACCAGCGAGAATATGACACTTCATAAGTTTCCACATTCATTGGATATGTGCCGCAAGTGGCAGCACAATACGCAGGTTCCTTTTGATCCAGTGTATCGTTGGCGCTATCGCATCTGCACTGCACACTTTGAACCCATCTGCTTGTCCAATATGCGTCTAATGCATGGCAGTGTGCCCACTTTACATCTAGGTCCACATGCACCGCGCCAGCTGTTTGAAAACGATTTTGAGGCAATTAATACACGTTTGGATAAACAGAAAGGCAGCTCCTCCGAGCAGTTATTTATCAAGCAGGAGAAAGAGGAGGAACCGGAATATCAATCAGACCAAGAGGAGGATATGAATTTGCTTGTGCCAGAGCTGCAATTGCATGAGAATGGACAAGATTCCGATAATCCAGTGCACTACAGTAAACCTAGCAGCTGGACggatcagcagcaacagttacgCTTACCCAGCGTTGAGGCAATCACCTATAATAGAGTCAAGTCTGGCTATGATAAGTGCTCACTGGTGCATTGCCAACGACAACGTTCCCAGCATGGAGTGCACATATACAAGTTTCCAAGATCACGTTTACTGCAGCAGCGTTGGATGCACAATTTGCGCATTAAGTACGATGAGCGACGTCCCTGGAAGACTATGATATGCAGTGGACACTTTGAGCCCAATTGCATAAGGTTACGCAAGTTACGTCCATGGGCGGTGCCCACATTGGAACTGGGAAACAATGTACCAAATCAGATATACACAAATGAACAGAGCCAGCAGCATTTTGCTATGGAACGCGGCGATTTCGACACGGGCAGTGATGAAGACAATTTCGATTGTGAAATGGAAGAAACAATGTTGGATGAGGAATTCGTtcaggaggaggaggaggaacaTGCTGATAAACCTGATGATAAACGTGATGATAAACCTGATGATAAACCTGATGATAAACATGATGATCAACCTGATGATCAACCTGATGATCAACCTGATGATCAACCTGATGATCAACCTGATGATAAACATGATGATCAACATGATGATGAACATGATGATAAACCTGATAATAATCATGATGATAAACATGATGAGTTGGTtacaacaaagacaaaaacTAAGCAAGAGCGTCGTTCAAATGATCAATTGCCGCCAGGTCAATCGCAagttttgaaaaataaatattgctgCTTGCCATATTGCCGCAATCCCCGTGGCGATGGCATCAAGCTGTTTCGTCTGCCCAACAACTCAATTTCGCTGCGCAAGTGGGAGCAAGCGACTGGCATGAGTTTTAATGAGTCACAACGCAACACAAAGCTCATTTGCAGCCGTCACTTTGATGCCAATCTAATTGGTGTGCGTCGTCTTATGTTCAATGCAGTTCCAACCCTTAATTTGCAACCAGACAAAGCTAATGCTCAACCTGTGTTACCGCCAGTTCCTCCTCCGCGATGCTTAGTGTCAGAGTGCCAAAGCACAAATGTCAAACTGCACAAATTTCCCAAAGATCCAATGCTTTTTCATCAATGGTGCCAGGCTTTAAATCTTACAAATATTCATAGATTCCGTGACAAATATATTTGTGCAGTTCATTTGCCTCCCAAAGCAATGACTTGCACAATCTGTGGAGTGGAGAATATACAATTGCCAATGCTTAATTTTCCAGAGAATCGTAATCAGCGCGCCAAATGGTGTTATAATCTTAAAATCGAATCCATTGCCAAGTGGGatgacataaaatatatttgctctAAGCATTTTGAAAGCTATTGCTTCATTGAACCGGGTAAACTGCTACCAGAGGCAGCACCTACCTTACATTTAAACCATAATGAcagaaacatatttttaaatgattatgcaacagacaacagcaaGTTGCCACGCATCAAAGACGAGCCCATAGACAGCGATGATTTGATGCTGTAG
Protein Sequence
MSQHNPPHQHHHHYYQEEQQQQQHHHQQQQQQQLQHKQIQQQSWYSHVASYPSHHPHVAGFSAPCKNNNNNNNNNMNAYESGAGSSHTAYYGSSAAGTAGVGYNLEGNNVAYAHNQLLQYQQQQQQQHHHHHQQQQQHHQQQQQQQQHHHLSQRSYMQHNLMHGSYPYIKSEPLELPDDRQRRQRQQQQQQQQQQHNQHHHQQQHFQNPMAPPPAPANRHPHDASGEMIIKSEPIDEHAYKSNYIDDNTPFADFSKYPEFNDDMLNPKIELTVKDEGYGNQKNPLSYPRRKLQSERSSESLPICQRCKEVFFKKQIYLRHVAESSCTIQEYDFKCNLCPMSFMSIEELHKHKHLHRTEKYFCHKYCGKHFDTIAECESHEYMEHEYDSFVCNMCSLTFANREQLYAHLPQHKYQQRYDCPICRLWYQTTLELHEHRMAAPYFCGKYYVAAHQQQPHTQQHQQQHAHQQHQHSQHHQPNYKLQDCQLGSVEMQHHKAIPTASALPATAALNSLLQQRQANADGAPMFGSTMKTESNIKLERSYSNSTSESGYSLHDSSYNNAYASDTSLHGGSGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTLPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHHDTFNIYENELWPPPTASTPTNHQQHQLQQHQLQQHQQQHAHHSKYQRQSAASTSSSASSASHYVDPDLSASYMSLNVSGGSGSALNVSDSMDICCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKISEDKMHKGMRICSLHFESYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLSKWCANLQRSVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHDNIAYPLPTPEQVAEFYARPTAPNNGEEQGECCVDTCKRNPSVDDIKLYRPPEESQVLAKWAHNLQLDAAQLPNMRICNLHFEAHCIGKRMRPWAIPTLNLSTTVENLYENPEHNQIFKRRTFLKERVAKSVASSSVGGIKPTWVPRCCLSHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEQHVLSKKCPVPLAVPTLDLNTPLGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGSMRQQYRICSMHFETHSFNGKRLSAGAIPTLELGHQDNDIYPNEAQSFVEEHCTVEGCEATKEQPDVRLFRFPTEDDDLLWKWCNNLKMNPADCIGVRICNKHFELDCIGPKHLYKWAIPTLVLGHNDVEIELVINPKPEERYVDPVFKCCVPTCGKTRRFDEVQMNSFPKNPELFQRWRHNLRLDHLNFREREKYKICNAHFEEICIGKTRLNIGSIPTLELGHDETEDIYHVNPAELQSNLFGRQRRVRSRSASIKKEETSEDDSLVPILDIERQVKTRKSLANLKCCVLTCGRSRLEHGARLFSFPNGKQQQSKWRQNLRMTAEEVDRSTRVCSAHFNRRCIDGKDLRSWAIPTQQLGYQAIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEELLEKWRINLRLSPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGIIFDNEAIKAGLQQEENEEDEDMLATRVRIKSDNKMDDNDDEADFDADPDDDADDDDEDDDMLHDQDNEHDLEHEDEKEQDQDHDQDENSYFDPLELVETYAEQHSENSAGEDALEDELEDVNGDDDELLLPDNPPIPVKLIVPSVTPRREKAVNNVTPICCLKHCRKERTATYQLSTFGFPKDEKLLLKWSANLELSPSDCVGRVCIEHFEPEMLGTRKLKQNAVPTLNLGHTTPLRYSCNGQTLTIHDAQPQHSVFRLWSLKHCRKRKLLMVDPPDQKASKLQPKRCCSLPSCGKQPEMHDVQLHRLPSNRVMLHKWLHNLKLPSDIDITTQWICSEHFDPHLLNGGEKSLEEFVPTIKLGHNDTDLYQNPEQGNLINSHTSTSSACLVPSCPCARLNLYRCYDLPENRVVQQTWLKWLQLPMPQLPSDAQLCVMHYMELYEKVPLSKELPVAVRKQLKETYEIISKSSMAMKLRCAVPGCYSKYTDNIRLTKLPICQDTCAKWVHNTKIDYDPMRHYVYRICLLHFEPRCLGPVRPKQWAMPTLELKHKDTNIYENPKMETISLPQPEPIELPLRIKTELALSASPSASASPSPLGRQRICCIPSCSQHATVLLRLYRFPNIETTLLKWLVNTQQSPRFIDPHHLFVCQDHFEPEAICKNQLRSWAVPTLKLGHEGHIIPNAKHNGNIADSQENKLTLQFIWENYCSILSCFQQRSDQVRLYAYPKDKPTIRKWAANCKHRSMQASSDGFQVCESHFTEDCFDPETRELQEGAVPTLELSRAVNKLCCIVPGCAKDEDAPRRRFFKVPKRCGQLVDWCHNLCLDPANFNGTDAHVCDRHFEPYCFNPYKMLRPGARPTLHLGHDRDRELIPNPANWARSAEGTMVCCVPNCGRSKTIDNVELFRLPKQRSLSDRWLHNVRLDPSVLELSKLWFCSVHFEPSSMLNGRPQYNTLPTLQLGHDYYDIVHYSYNQANNKIKGYCNKNGSSYDCCYPQCTELQKSYLRISYDLPMKEELRLAWLAYMEVEELEGQQLKLCPLHLIMLYDHSLENFPEHSPEPLLDGTYKDSRNSVRIRLISCAVSGCKTLKPRDGGCLHGLPQRRDVLQMWLYNIKLDYYEHQRYMYKICSKHFEASCIIETTRRLKPWSMPTLDLPEHEADKPPPYQNPTEDEWQRMNEAWAFTQEAEKLQENAMAPGNSCMTESIVKNEPDTEDDSLAFTGADQLLEIDNSKECAGMDSHPMHFLEQQQYNEDSNSQHPLEMQALEVMLEVAHVENCATYEQMDTEADLGYADQQVHNYGATAQPGGSGNMPNSLNYYSARHCSVQGCDVTDKDVDGNIKLHKFPVSLEALKKWMHNTQVHVDTNYGWRFRICSYHFVPECFNGSRIRRGSMPTLRLGSNKPDEIYDNEFTTMQQVEQLNKEQLNCTDLDSHEEYVQDSEITMCLPEPAPPRKSSKFCQIESCPNHLTSENMTLHKFPHSLDMCRKWQHNTQVPFDPVYRWRYRICTAHFEPICLSNMRLMHGSVPTLHLGPHAPRQLFENDFEAINTRLDKQKGSSSEQLFIKQEKEEEPEYQSDQEEDMNLLVPELQLHENGQDSDNPVHYSKPSSWTDQQQQLRLPSVEAITYNRVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRLLQQRWMHNLRIKYDERRPWKTMICSGHFEPNCIRLRKLRPWAVPTLELGNNVPNQIYTNEQSQQHFAMERGDFDTGSDEDNFDCEMEETMLDEEFVQEEEEEHADKPDDKRDDKPDDKPDDKHDDQPDDQPDDQPDDQPDDQPDDKHDDQHDDEHDDKPDNNHDDKHDELVTTKTKTKQERRSNDQLPPGQSQVLKNKYCCLPYCRNPRGDGIKLFRLPNNSISLRKWEQATGMSFNESQRNTKLICSRHFDANLIGVRRLMFNAVPTLNLQPDKANAQPVLPPVPPPRCLVSECQSTNVKLHKFPKDPMLFHQWCQALNLTNIHRFRDKYICAVHLPPKAMTCTICGVENIQLPMLNFPENRNQRAKWCYNLKIESIAKWDDIKYICSKHFESYCFIEPGKLLPEAAPTLHLNHNDRNIFLNDYATDNSKLPRIKDEPIDSDDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-