Zqui020139.1
Basic Information
- Insect
- Zygothrica quintamaculata
- Gene Symbol
- -
- Assembly
- GCA_037044455.1
- Location
- JBAMBS010002739.1:152556-166715[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 31 6 2.2e+04 -6.3 3.8 54 86 135 169 115 170 0.51 2 31 6 2.2e+04 -5.0 2.6 23 59 176 207 172 223 0.62 3 31 4.4 1.6e+04 -3.0 3.4 45 62 357 376 340 391 0.56 4 31 1.6e-14 5.9e-11 43.3 3.7 1 86 593 665 593 666 0.85 5 31 1.9e-15 7.2e-12 46.2 4.6 1 87 693 762 693 762 0.83 6 31 4.7e-16 1.7e-12 48.2 0.4 1 87 784 856 784 856 0.85 7 31 5.3e-16 2e-12 48.0 5.4 1 86 954 1023 954 1024 0.81 8 31 3.8e-15 1.4e-11 45.3 3.3 1 86 1048 1119 1048 1120 0.82 9 31 2.8e-13 1e-09 39.3 0.9 1 87 1155 1223 1155 1223 0.81 10 31 2.7e-10 9.9e-07 29.7 1.8 1 86 1271 1340 1271 1341 0.76 11 31 3.4e-17 1.3e-13 51.8 0.1 1 86 1368 1437 1368 1438 0.83 12 31 7.6e-13 2.8e-09 37.9 0.9 1 86 1459 1528 1459 1529 0.81 13 31 4.7e-15 1.7e-11 45.0 1.9 1 86 1556 1627 1556 1628 0.85 14 31 8.6e-14 3.2e-10 40.9 1.6 1 85 1695 1763 1695 1765 0.82 15 31 3.2e-12 1.2e-08 35.9 0.1 1 86 1786 1854 1786 1855 0.81 16 31 1.2e-14 4.3e-11 43.7 0.4 1 87 2022 2091 2022 2091 0.81 17 31 3e-12 1.1e-08 36.0 0.1 1 86 2151 2221 2151 2222 0.77 18 31 0.0026 9.4 7.4 0.1 1 59 2250 2301 2250 2325 0.75 19 31 1.3e-11 4.8e-08 33.9 0.7 1 86 2339 2408 2339 2409 0.84 20 31 2e-13 7.5e-10 39.7 2.6 1 86 2465 2534 2465 2535 0.83 21 31 2.2e-14 8.1e-11 42.8 1.0 1 86 2570 2641 2570 2642 0.82 22 31 6.7e-12 2.5e-08 34.9 2.5 1 87 2652 2724 2652 2724 0.80 23 31 8.2e-12 3e-08 34.6 0.1 1 64 2750 2810 2750 2823 0.76 24 31 0.00026 0.98 10.5 0.0 1 58 2854 2904 2854 2916 0.82 25 31 2.1e-14 7.7e-11 42.9 0.1 1 86 2942 3014 2942 3015 0.81 26 31 3.4e-14 1.2e-10 42.2 0.3 1 86 3185 3257 3185 3258 0.82 27 31 2.3e-12 8.6e-09 36.3 3.6 1 87 3320 3391 3320 3391 0.81 28 31 8.6e-15 3.2e-11 44.1 2.5 1 86 3498 3568 3498 3569 0.85 29 31 5.1e-14 1.9e-10 41.7 0.1 1 87 3722 3792 3722 3792 0.83 30 31 2.1e-09 7.9e-06 26.8 1.8 1 59 3812 3860 3812 3873 0.86 31 31 3e-10 1.1e-06 29.6 0.4 18 87 3876 3934 3865 3934 0.74
Sequence Information
- Coding Sequence
- ATGTCACAACACAACCCCCcgcatcaacatcatcatcactaCTATcaggaggagcagcagcagcagcaacatcatcatcaacaacaacaacagcagcagctacaacataaacaaatacaacagcaGAGTTGGTACTCACATGTTGCTTCCTACCCTTCCCATCATCCACACGTTGCTGGCTTTTCGGCGCcatgcaaaaacaataataataacaacaacaataacatgaATGCATACGAAAGTGGAGCTGGAAGTTCGCATACAGCATATTATGGCTCCTCTGCAGCAGGTACGGCTGGGGTGGGCTATAACCTTGAGGGCAATAATGTGGCCTATGCGCACAACCAGCTGCTGCagtaccaacaacagcaacaacaacaacatcatcatcatcatcaacagcaacaacaacatcaccaacaacaacagcagcagcagcagcaccaccaccttAGTCAACGCTCGTATATGCAGCATAATTTAATGCATGGTTCATATCCTTACATCAAAAGTGAGCCATTGGAGCTGCCTGATGATAGACAACgtcgacagcgacagcagcagcagcaacaacaacagcaacagcataatCAGCAtcatcaccaacaacaacattttcagAATCCAATGGCACCGCCGCCAGCACCCGCCAATCGCCACCCACACGATGCCAGCGGtgaaatgataataaaatCGGAACCTATAGACGAACATGCGTACAAGTCAAACTATATCGATGATAATACACCGTTCGCCGATTTTAGTAAATATCCAGAATTTAACGACGATATGTTGAACCCTAAGATTGAATTAACTGTCAAAGATGAAGGCTATGGAAACCAGAAGAACCCGCTTAGCTATCCCCGCCGCAAACTGCAAAGTGAGCGTTCATCTGAGAGTTTGCCCATATGCCAGCGTTGCAAGGAAGTATTCTTTAAGAAGCAAATCTACTTGCGGCATGTGGCCGAAAGCAGTTGCACTATTCAAGAGTATGACTTTAAATGTAACCTATGTCCTATGTCCTTCATGAGCATTGAAGAACTGCACAAGCATAAGCATCTACATAGAACGGAGAAATACTTTTGTCACAAATATTGTGGCAAACACTTTGATACAATTGCTGAATGCGAATCCCATGAGTACATGGAGCATGAGTATGATAGCTTTGTATGCAATATGTGCTCCCTTACGTTTGCCAACAGAGAGCAATTGTATGCGCATCTGCCACAACATAAGTATCAGCAGCGTTATGATTGCCCCATATGCCGATTGTGGTATCAAACAACGTTGGAGTTGCATGAGCATCGCATGGCAGCGCCGTACTTCTGTGGCAAATATTACGTTGCAGCgcatcagcaacagccgcatacgcagcagcatcaacaacagcatgcacatcaacagcatcagcattcaCAGCATCATCAGCCCAATTACAAACTGCAGGATTGTCAATTGGGTTCCGTTGAAATGCAACACCATAAGGCGATTCCAACAGCCAGCGCATTGCCTGCAACGGCAGCACTCAATTCGCTATTGCAGCAACGACAGGCTAATGCAGATGGTGCACCAATGTTTGGCTCTACCATGAAAACTGAGAGCAATATCAAGCTGGAGCGTAGCTATAGCAATTCTACAAGTGAATCTGGTTACAGCCTGCACGATAGCAGCTATAACAATGCTTATGCTAGCGATACGTCGCTACATGGAGGCAGTGGTGCAATTGGTGGACCACAAGCGCACTCCTCCACGCTAGACGATTCTGAGGATGCACTCTGTTGTGTGCCATTATGTGGTGTACGCAAGAGCACCAGTCCAACGCTGCAATTCTTCACGCTTCCAAAGGATGAAAAGTATTTGCATCAGTGGTTGCATAATCTAAAGATGTTCCACATTCCGGCGTCAAGCTATGCCAGTTTCCGTATTTGCAGCATGCACTTTCCGAAGCGTTGCATCAATCGTTACTCACTCTGCTACTGGGCAGTACCCACATTCAATCTGGGTCACGATGATGTTGccaatttatatcaaaatcgCGAACTAACCAATACATTTACCACTGGTGAAGTGGCGCGTTGCAGCATGCCCAACTGCACCAGCCAACGCGGAGAGAGTAATCTTAAATTCTACAACTTTCCCAAGGATATTAAGAGCCTAATAAAATGGTGTCAAAACGCACGTCTTCCTGTCCAGGCCAAGGAGCCTCGTCACTTCTGCAGTCGCCACTTTGAGGAGCGTTGCATTGGCAAGTTTCGATTAAAACCATGGGCAGTCCCCACGCTACATTTAGGTGCACAATACGGAAAGATTCATGACAATCCGAAGAACTTGTATGTGGAGGAGAAACGTTGCTGTCTCAATTTCTGTCGACGCAGTCGCTCCTCTGATTTCAACATGTCATTGTACCGTTTTCCCAGAGATGAAGTGCTGCTGCGACGTTGGTGCTACAACCTACGCCTCGATCCTGCTGTCTATCGTGGCAAGAATCATAAAATATGTAGCGCTCACTTCATCAAGGAAGCTTTAGGCCTACGTAAGCTATCACCAGGTGCTGTTCCTACGCTTCATCTGGGACACCACGATACCTTTAATATATACGAGAATGAGTTGTGGCCACCACCGACAGCATCTACGCCCACCAATCATCAACAGCAccaactgcagcagcatcagttacagcaacatcaacaacaacacgcacATCACAGCAAATATCAGCGACAGTCCGCTGCATCTACGTCATCATCGGCTAGCTCAGCATCGCATTATGTGGACCCAGACCTAAGTGCATCGTACATGAGTCTAAACGTCTCTGGTGGCTCCGGCTCAGCATTAAATGTCAGCGATAGCATGGACATCTGTTGTGTGCCCAGCTGCGAGAGCAAGCGACACAATAATGAGAATATAACGTTCCATACAATACCAAGGCGTCCGGAACAAATGCGTAAATGGTgtcacaatttgaaaatatccGAGGATAAGATGCACAAGGGCATGCGTATCTGTAGTCTACATTTTGAGTCCTACTGCATTGGCGGCTGCATGCGTCCATTTGCAGTGCCTACGTTACAGTTGGGACACGATGATGAGGATATACACCGCAATCCGGATGTAATCAAGAAGCTCAATATACGCGAAACATGCTGTGTTGCTGTATGCAAACGTAATCGTGATCGGGATCATGCCAATCTACATCGTTTCCCAAGCAATGTGGCGCTGCTCAGCAAGTGGTGTGCCAACTTGCAACGCTCAGTGCCGGATGGCACGAAACTCTTTAACGATGCAATTTGTGAGGTGCATTTTGAGGATCGCTGTCTGCGTAATAAACGATTGGAGAAATGGGCAGTACCCACGCTGGTATTGGGACATGATAACATTGCATATCCATTACCAACACCCGAACAGGTGGCTGAGTTTTATGCGCGTCCAACTGCACCGAACAATGGTGAAGAGCAGGGTGAATGCTGTGTGGATACATGCAAACGTAATCCAAGTGTCGATGATATTAAACTCTACAGACCGCCAGAGGAATCACAGGTGCTAGCCAAATGGGCACACAATCTGCAGTTGGATGCTGCACAGTTGCCCAATATGAGAATCTGTAACCTGCACTTTGAAGCACATTGCATAGGCAAGCGCATGCGTCCATGGGCCATACCCACGCTCAATTTATCCACAACTGTGGAGAACTTATATGAGAATCCTGAACATAATCAGATCTTTAAGCGTCGCACGTTTCTTAAGGAACGGGTTGCCAAAAGCGTTGCCTCGTCTAGCGTTGGCGGTATTAAACCTACTTGGGTGCCACGTTGCTGCTTGTCACATTGTCGCAAGGTGCGCGCGCTCCACAATGTGCAACTTTATCGCTTCCCCAAACTGAATCGTTCCACATTAGCCAAGTGGGCACACAATTTACAGGTGCCAATGGTTGGCAGTGCACAGCGACGCCTTTGCTCTGCGCACTTTGAACAGCATGTCTTGAGCAAGAAGTGCCCCGTACCATTGGCAGTGCCAACTCTGGATCTGAACACACCACTTGGCTACAAAATCTATCAAAATCCGGCCAAGTTGAAGGCTAACAAACTGTGCCTGCAGCGCGTTTGCATTGTGGAGAGCTGCCGTCGTCAACGTGGTCAGGGCGTGCAACTCTTCCGGCTGCCGCATAATCCCACACAACTGCGCAAATGGATGCACAATATACGCATGCGTCCCAGAGGTTCTATGCGTCAACAATATCGCATTTGTTCCATGCACTTTGAAACGCACTCGTTCAATGGTAAACGCTTAAGCGCTGGTGCTATTCCCACTCTGGAATTGGGTCATCAGGATAACGATATTTATCCCAATGAGGCGCAATCATTTGTGGAGGAACACTGCACTGTGGAGGGCTGCGAAGCGACCAAGGAACAACCCGATGTACGCTTATTCCGTTTTCCCACGGAGGACGATGATCTGCTATGGAAATGGTGCAACAATCTCAAAATGAATCCAGCCGATTGTATTGGAGTTCGCATCTGCAACAAACATTTTGAGCTGGACTGCATTGGACCCAAGCATCTATACAAATGGGCTATACCAACGCTAGTCTTAGGTCACAATGATGTTGAGATTGAGCTTGTAATCAATCCGAAACCAGAGGAACGCTACGTTGATCCCGTGTTCAAATGTTGTGTGCCAACTTGCGGCAAAACACGCAGATTTGATGAGGTACAAATGAATAGTTTCCCCAAGAATCCTGAACTGTTTCAGCGTTGGCGTCACAACCTCCGACTGGATCATCTCAACTTTAGGGAACGTGAAAAGTATAAGATCTGCAATGCACACTTTGAGGAGATTTGCATTGGAAAGACACGTCTTAATATTGGCTCTATACCTACGCTGGAGTTGGGACACGATGAGACTGAAGATATTTATCATGTGAATCCTGCAGAATTGCAAAGCAATTTGTTTGGACGGCAGCGACGAGTACGCTCCCGAAGTGCTTCCATCAAGAAGGAGGAAACTTCAGAGGATGATAGTCTAGTCCCAATACTAGACATCGAGAGGCAGGTTAAAACTAGAAAATCCTTGGCTAATCTCAAGTGTTGTGTGCTAACCTGTGGGCGTAGTCGTTTGGAGCATGGAGCACGTTTGTTTTCCTTTCCGAAtggtaagcagcagcagagtaaATGGCGACAGAATCTACGTATGACTGCGGAGGAAGTGGACAGATCAACGCGTGTATGCAGCGCTCATTTTAATCGACGCTGCATTGATGGTAAAGACCTCCGTTCCTGGGCCATACCCACACAACAATTGGGCTACCAGGCCATCTATGAGAATCCAAAGAATATACCGGGTTTCTTTACACCAACCTGTGCGCTGGGTCACTGTCGCAAGCGGCGCAGCATAGATAATGATCTCAGAACATATCGATATCCTCGGAGCGAGGAGCTGCTCGAAAAATGGCGTATCAATTTGCGCCTGTCGCCTGATCAATGTCGCGGGCGCATATGTGCGGATCACTTTGAGCCGATGGTGCGTGGCAAGTTAAAGCTTAAAACAGGCGCGGTACCTACACTGAAACTTGGCCATGATGAGGGTATCATTTTTGACAATGAAGCCATCAAGGCAGGCCTGCAGCAGGAAGAGAATGAAGAGGATGAGGACATGCTGGCAACGCGGGTTAGGATCAAAAGTGATAATAAAATGgacgataatgatgatgaggcTGATTTTGATGCAGACCCTGACGacgatgctgatgatgatgatgaagatgatgatatGTTACATGATCAGGATAACGAACATGATCTGGAACACGAGGATGAAAAGGAACAAGATCAAGATCACGATCAGGATGAAAATAGTTACTTTGATCCTTTGGAACTTGTAGAAACCTATGCGGAGCAGCACAGCGAAAATAGTGCAGGTGAAGATGCGCTCGAGGATGAACTGGAAGATGTTAATGGCGATGATGATGAGCTGCTGCTACCGGATAATCCGCCAATACCAGTTAAATTAATAGTGCCGTCCGTGACGCCCCGGCGTGAGAAGGCCGTAAATAATGTGACGCCCATTTGTTGCCTAAAGCATTGCCGTAAGGAGCGCACGGCCACCTATCAATTAAGCACTTTTGGATTTCCTAAGGATGAAAAGCTACTGCTTAAATGGAGCGCCAATCTGGAACTTTCACCTAGTGATTGTGTGGGACGTGTGTGCATTGAGCATTTTGAACCCGAAATGCTGGGCACGCGCAAGCTGAAGCAGAATGCGGTACCTACTCTGAATCTGGGACACACGACGCCGCTTCGTTATAGCTGCAATGGGCAGACGTTGACCATACATGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGAAAAAGGAAGCTGCTAATGGTGGATCCACCGGATCAGAAAGCGAGTAAGTTGCAACCAAAACGTTGCTGCTCGTTACCCAGTTGCGGCAAGCAGCCGGAGATGCATGATGTGCAACTACATCGTCTACCAAGTAATCGTGTAATGCTTCACAAGTGGCTACATAACTTGAAACTACCGTCCGACATAGACATTACCACGCAATGGATTTGCTCGGAGCACTTTGATCCACATTTGTTGAATGGTGGTGAGAAGTCATTGGAGGAGTTTGTGCCCACCATAAAACTGGGACATAATGATACCGATTTATATCAAAATCCTGAACAAGGTAACCTTATCAATAGTCACACAAGCACATCCAGTGCCTGCCTGGTACCCAGTTGTCCATGTGCACGACTTAACCTCTATCGCTGCTATGATCTGCCCGAGAATCGAGTGGTGCAACAGACTTGGCTAAAATGGCTGCAGCTACCCATGCCGCAGCTGCCGAGCGATGCTCAACTGTGTGTCATGCACTACATGGAGCTGTATGAGAAAGTGCCGCTTTCCAAGGAACTGCCTGTTGCAGTGCGGAAACAACTGAAGGAAACCTATGAAATCATATCAAAATCTTCGATGGCCATGAAGCTACGTTGTGCCGTGCCCGGCTGTTACTCCAAGTACACAGATAATATACGACTCACCAAACTGCCCATATGTCAGGATACCTGTGCCAAATGGGTGCACAATACCAAAATAGATTACGATCCCATGCGGCATTATGTCTATCGCATTTGCCTGCTGCACTTTGAGCCCCGTTGCTTGGGCCCAGTGCGTCCAAAGCAGTGGGCAATGCCCACACTGGAACTTAAACATAAAGATACCAATATCTATGAAAATCCAAAGATGGAAACCATTTCGTTGCCGCAACCAGAGCCAATCGAATTACCGCTACGCATTAAAACTGAACTCGCTCTAAGCGCCAGtcccagtgccagtgccagtccTAGTCCATTGGGTAGACAGCGCATCTGTTGCATACCCAGCTGCAGCCAGCATGCCACAGTGCTACTGCGTCTGTATCGCTTTCCCAACATTGAAACAACGTTGTTGAAGTGGCTAGTGAATACGCAGCAGAGTCCACGTTTCATCGATCCACATCATTTGTTTGTATGCCAGGATCATTTTGAGCCCGAGGCAATATGCAAGAATCAACTGCGCAGCTGGGCAGTGCCCACACTTAAGCTGGGACATGAAGGTCATATCATACCAAATGCCAAACACAATGGCAATATTGCTGATAGTCAGGAGAATAAGCTCACGCTGCAATTTATTTGGGAGAACTACTGCTCAATATTGTCATGCTTCCAGCAGCGCAGCGACCAGGTGCGTCTCTATGCTTATCCCAAGGATAAGCCCACGATACGTAAATGGGCCGCAAACTGCAAGCATCGCTCTATGCAGGCGAGCAGCGATGGGTTTCAGgtcTGCGAATCGCACTTCACAGAGGACTGCTTTGACCCTGAGACGAGAGAGCTGCAGGAGGGCGCAGTGCCCACGCTTGAGCTGAGCCGTGCAGTAAATAAATTGTGCTGCATTGTGCCTGGCTGTGCGAAGGATGAGGACGCCCCACGTCGGCGCTTCTTCAAGGTGCCAAAGCGCTGCGGTCAGCTAGTTGATTGGTGCCACAATTTGTGTTTGGATCCAGCTAATTTTAATGGCACTGACGCGCATGTTTGTGATCGTCACTTCGAGCCGTACTGTTTCAATCCTTATAAAATGTTACGTCCCGGTGCACGTCCAACACTGCATTTGGGCCATGATAGGGACAGGGAACTTATACCCAATCCGGCAAATTGGGCACGTTCCGCCGAAGGAACTATGGTATGTTGTGTGCCCAATTGTGGACGTTCCAAAACAATAGATAATGTGGAACTATTTCGCCTACCAAAACAACGTTCATTGTCTGACCGTTGGCTGCACAATGTGCGCCTAGATCCCAGCGTCTTGGAGCTCTCCAAGCTATGGTTTTGCAGCGTGCATTTTGAACCCAGCAGCATGTTGAATGGACGTCCACAGTACAATACGTTGCCCACATTACAACTGGGTCATGACTACTACGACATTGTGCATTATTCCTATAATCAGGCGAACAACAAGATAAAGGGgtattgcaataaaaatggtTCCAGCTACGACTGCTGTTATCCGCAATGCACAGAGCTACAAAAGAGCTATCTACGCATCAGCTACGATTTGCCCATGAAGGAGGAACTGCGACTTGCTTGGCTGGCATACATGGAAGTGGAGGAGTTGGAAGGGCAGCAACTGAAGCTTTGTCCATTACACTTGATTATGCTCTATGATCACAGTTTGGAAAACTTTCCAGAGCACTCGCCAGAGCCACTGTTAGATGGAACCTATAAAGATTCCCGTAATAGCGTGCGCATACGCCTTATAAGCTGCGCTGTAAGCGGCTGTAAGACACTTAAGCCACGAGATGGAGGCTGCCTGCATGGATTGCCACAGCGTCGGGATGTGCTGCAGATGTGGTTGTATAATATCAAATTGGATTACTACGAGCATCAACGTTATATGTACAAAATATGTAGCAAGCACTTTGAAGCTTCTTGCATTATAGAGACAACACGTCGTCTAAAACCCTGGTCCATGCCAACTTTGGATCTGCCAGAACATGAGGCGGATAAGCCACCACCCTACCAAAATCCTACAGAAGATGAATGGCAGCGTATGAATGAAGCATGGGCCTTTACTCAGGAGGCAGAAAAATTGCAAGAAAACGCTATGGCACCTGGTAATAGCTGCATGACAGAATCCATTGTTAAGAATGAGCCAGACACAGAGGACGACTCATTAGCATTCACAGGGGCGGATCAGTTGCTTGAAATTGACAATTCAAAAGAGTGTGCTGGCATGGACAGCCATCCCATGCACTTTTtggagcagcaacaatacaatGAAGACAGCAATTCACAACATCCGCTGGAAATGCAAGCTTTGGAAGTGATGCTGGAAGTGGCGCATGTAGAAAACTGTGCAACCTATGAGCAAATGGACACCGAAGCGGATCTAGGCTATGCCGACCAACAGGTGCATAACTATGGTGCAACGGCTCAACCaggtggcagtggcaacatGCCTAACAGTCTTAACTATTACAGCGCACGTCATTGTAGTGTACAGGGCTGTGATGTCACAGATAAGGATGTGGATGGCAACATAAAGTTGCATAAGTTTCCAGTCTCATTGGAGGCTTTGAAAAAATGGATGCACAATACTCAGGTCCATGTGGATACAAACTATGGTTGGCGTTTTCGCATCTGTAGCTATCACTTTGTACCCGAATGCTTTAATGGTTCGCGCATAAGGCGTGGCTCCATGCCCACCTTGCGTCTGGGTTCAAACAAACCCGATGAGATATATGATAATGAGTTTACGACAATGCAGCAGGTCGAGCAGCTCAATAAGGAACAGCTAAACTGTACTGACCTGGACTCGCATGAGGAATATGTGCAGGATTCAGAGATAACCATGTGCCTGCCAGAACCTGCACCACCACGCAAATCGAGTAAATTTTGTCAAATCGAGAGTTGTCCCAATCATTTGACCAGCGAGAATATGACACTTCATAAGTTTCCACATTCATTGGATATGTGCCGCAAGTGGCAGCACAATACGCAGGTTCCTTTTGATCCAGTGTATCGTTGGCGCTATCGCATCTGCACTGCACACTTTGAACCCATCTGCTTGTCCAATATGCGTCTAATGCATGGCAGTGTGCCCACTTTACATCTAGGTCCACATGCACCGCGCCAGCTGTTTGAAAACGATTTTGAGGCAATTAATACACGTTTGGATAAACAGAAAGGCAGCTCCTCCGAGCAGTTATTTATCAAGCAGGAGAAAGAGGAGGAACCGGAATATCAATCAGACCAAGAGGAGGATATGAATTTGCTTGTGCCAGAGCTGCAATTGCATGAGAATGGACAAGATTCCGATAATCCAGTGCACTACAGTAAACCTAGCAGCTGGACggatcagcagcaacagttacgCTTACCCAGCGTTGAGGCAATCACCTATAATAGAGTCAAGTCTGGCTATGATAAGTGCTCACTGGTGCATTGCCAACGACAACGTTCCCAGCATGGAGTGCACATATACAAGTTTCCAAGATCACGTTTACTGCAGCAGCGTTGGATGCACAATTTGCGCATTAAGTACGATGAGCGACGTCCCTGGAAGACTATGATATGCAGTGGACACTTTGAGCCCAATTGCATAAGGTTACGCAAGTTACGTCCATGGGCGGTGCCCACATTGGAACTGGGAAACAATGTACCAAATCAGATATACACAAATGAACAGAGCCAGCAGCATTTTGCTATGGAACGCGGCGATTTCGACACGGGCAGTGATGAAGACAATTTCGATTGTGAAATGGAAGAAACAATGTTGGATGAGGAATTCGTtcaggaggaggaggaggaacaTGCTGATAAACCTGATGATAAACGTGATGATAAACCTGATGATAAACCTGATGATAAACATGATGATCAACCTGATGATCAACCTGATGATCAACCTGATGATCAACCTGATGATCAACCTGATGATAAACATGATGATCAACATGATGATGAACATGATGATAAACCTGATAATAATCATGATGATAAACATGATGAGTTGGTtacaacaaagacaaaaacTAAGCAAGAGCGTCGTTCAAATGATCAATTGCCGCCAGGTCAATCGCAagttttgaaaaataaatattgctgCTTGCCATATTGCCGCAATCCCCGTGGCGATGGCATCAAGCTGTTTCGTCTGCCCAACAACTCAATTTCGCTGCGCAAGTGGGAGCAAGCGACTGGCATGAGTTTTAATGAGTCACAACGCAACACAAAGCTCATTTGCAGCCGTCACTTTGATGCCAATCTAATTGGTGTGCGTCGTCTTATGTTCAATGCAGTTCCAACCCTTAATTTGCAACCAGACAAAGCTAATGCTCAACCTGTGTTACCGCCAGTTCCTCCTCCGCGATGCTTAGTGTCAGAGTGCCAAAGCACAAATGTCAAACTGCACAAATTTCCCAAAGATCCAATGCTTTTTCATCAATGGTGCCAGGCTTTAAATCTTACAAATATTCATAGATTCCGTGACAAATATATTTGTGCAGTTCATTTGCCTCCCAAAGCAATGACTTGCACAATCTGTGGAGTGGAGAATATACAATTGCCAATGCTTAATTTTCCAGAGAATCGTAATCAGCGCGCCAAATGGTGTTATAATCTTAAAATCGAATCCATTGCCAAGTGGGatgacataaaatatatttgctctAAGCATTTTGAAAGCTATTGCTTCATTGAACCGGGTAAACTGCTACCAGAGGCAGCACCTACCTTACATTTAAACCATAATGAcagaaacatatttttaaatgattatgcaacagacaacagcaaGTTGCCACGCATCAAAGACGAGCCCATAGACAGCGATGATTTGATGCTGTAG
- Protein Sequence
- MSQHNPPHQHHHHYYQEEQQQQQHHHQQQQQQQLQHKQIQQQSWYSHVASYPSHHPHVAGFSAPCKNNNNNNNNNMNAYESGAGSSHTAYYGSSAAGTAGVGYNLEGNNVAYAHNQLLQYQQQQQQQHHHHHQQQQQHHQQQQQQQQHHHLSQRSYMQHNLMHGSYPYIKSEPLELPDDRQRRQRQQQQQQQQQQHNQHHHQQQHFQNPMAPPPAPANRHPHDASGEMIIKSEPIDEHAYKSNYIDDNTPFADFSKYPEFNDDMLNPKIELTVKDEGYGNQKNPLSYPRRKLQSERSSESLPICQRCKEVFFKKQIYLRHVAESSCTIQEYDFKCNLCPMSFMSIEELHKHKHLHRTEKYFCHKYCGKHFDTIAECESHEYMEHEYDSFVCNMCSLTFANREQLYAHLPQHKYQQRYDCPICRLWYQTTLELHEHRMAAPYFCGKYYVAAHQQQPHTQQHQQQHAHQQHQHSQHHQPNYKLQDCQLGSVEMQHHKAIPTASALPATAALNSLLQQRQANADGAPMFGSTMKTESNIKLERSYSNSTSESGYSLHDSSYNNAYASDTSLHGGSGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTLPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHHDTFNIYENELWPPPTASTPTNHQQHQLQQHQLQQHQQQHAHHSKYQRQSAASTSSSASSASHYVDPDLSASYMSLNVSGGSGSALNVSDSMDICCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKISEDKMHKGMRICSLHFESYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLSKWCANLQRSVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHDNIAYPLPTPEQVAEFYARPTAPNNGEEQGECCVDTCKRNPSVDDIKLYRPPEESQVLAKWAHNLQLDAAQLPNMRICNLHFEAHCIGKRMRPWAIPTLNLSTTVENLYENPEHNQIFKRRTFLKERVAKSVASSSVGGIKPTWVPRCCLSHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEQHVLSKKCPVPLAVPTLDLNTPLGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGSMRQQYRICSMHFETHSFNGKRLSAGAIPTLELGHQDNDIYPNEAQSFVEEHCTVEGCEATKEQPDVRLFRFPTEDDDLLWKWCNNLKMNPADCIGVRICNKHFELDCIGPKHLYKWAIPTLVLGHNDVEIELVINPKPEERYVDPVFKCCVPTCGKTRRFDEVQMNSFPKNPELFQRWRHNLRLDHLNFREREKYKICNAHFEEICIGKTRLNIGSIPTLELGHDETEDIYHVNPAELQSNLFGRQRRVRSRSASIKKEETSEDDSLVPILDIERQVKTRKSLANLKCCVLTCGRSRLEHGARLFSFPNGKQQQSKWRQNLRMTAEEVDRSTRVCSAHFNRRCIDGKDLRSWAIPTQQLGYQAIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEELLEKWRINLRLSPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGIIFDNEAIKAGLQQEENEEDEDMLATRVRIKSDNKMDDNDDEADFDADPDDDADDDDEDDDMLHDQDNEHDLEHEDEKEQDQDHDQDENSYFDPLELVETYAEQHSENSAGEDALEDELEDVNGDDDELLLPDNPPIPVKLIVPSVTPRREKAVNNVTPICCLKHCRKERTATYQLSTFGFPKDEKLLLKWSANLELSPSDCVGRVCIEHFEPEMLGTRKLKQNAVPTLNLGHTTPLRYSCNGQTLTIHDAQPQHSVFRLWSLKHCRKRKLLMVDPPDQKASKLQPKRCCSLPSCGKQPEMHDVQLHRLPSNRVMLHKWLHNLKLPSDIDITTQWICSEHFDPHLLNGGEKSLEEFVPTIKLGHNDTDLYQNPEQGNLINSHTSTSSACLVPSCPCARLNLYRCYDLPENRVVQQTWLKWLQLPMPQLPSDAQLCVMHYMELYEKVPLSKELPVAVRKQLKETYEIISKSSMAMKLRCAVPGCYSKYTDNIRLTKLPICQDTCAKWVHNTKIDYDPMRHYVYRICLLHFEPRCLGPVRPKQWAMPTLELKHKDTNIYENPKMETISLPQPEPIELPLRIKTELALSASPSASASPSPLGRQRICCIPSCSQHATVLLRLYRFPNIETTLLKWLVNTQQSPRFIDPHHLFVCQDHFEPEAICKNQLRSWAVPTLKLGHEGHIIPNAKHNGNIADSQENKLTLQFIWENYCSILSCFQQRSDQVRLYAYPKDKPTIRKWAANCKHRSMQASSDGFQVCESHFTEDCFDPETRELQEGAVPTLELSRAVNKLCCIVPGCAKDEDAPRRRFFKVPKRCGQLVDWCHNLCLDPANFNGTDAHVCDRHFEPYCFNPYKMLRPGARPTLHLGHDRDRELIPNPANWARSAEGTMVCCVPNCGRSKTIDNVELFRLPKQRSLSDRWLHNVRLDPSVLELSKLWFCSVHFEPSSMLNGRPQYNTLPTLQLGHDYYDIVHYSYNQANNKIKGYCNKNGSSYDCCYPQCTELQKSYLRISYDLPMKEELRLAWLAYMEVEELEGQQLKLCPLHLIMLYDHSLENFPEHSPEPLLDGTYKDSRNSVRIRLISCAVSGCKTLKPRDGGCLHGLPQRRDVLQMWLYNIKLDYYEHQRYMYKICSKHFEASCIIETTRRLKPWSMPTLDLPEHEADKPPPYQNPTEDEWQRMNEAWAFTQEAEKLQENAMAPGNSCMTESIVKNEPDTEDDSLAFTGADQLLEIDNSKECAGMDSHPMHFLEQQQYNEDSNSQHPLEMQALEVMLEVAHVENCATYEQMDTEADLGYADQQVHNYGATAQPGGSGNMPNSLNYYSARHCSVQGCDVTDKDVDGNIKLHKFPVSLEALKKWMHNTQVHVDTNYGWRFRICSYHFVPECFNGSRIRRGSMPTLRLGSNKPDEIYDNEFTTMQQVEQLNKEQLNCTDLDSHEEYVQDSEITMCLPEPAPPRKSSKFCQIESCPNHLTSENMTLHKFPHSLDMCRKWQHNTQVPFDPVYRWRYRICTAHFEPICLSNMRLMHGSVPTLHLGPHAPRQLFENDFEAINTRLDKQKGSSSEQLFIKQEKEEEPEYQSDQEEDMNLLVPELQLHENGQDSDNPVHYSKPSSWTDQQQQLRLPSVEAITYNRVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRLLQQRWMHNLRIKYDERRPWKTMICSGHFEPNCIRLRKLRPWAVPTLELGNNVPNQIYTNEQSQQHFAMERGDFDTGSDEDNFDCEMEETMLDEEFVQEEEEEHADKPDDKRDDKPDDKPDDKHDDQPDDQPDDQPDDQPDDQPDDKHDDQHDDEHDDKPDNNHDDKHDELVTTKTKTKQERRSNDQLPPGQSQVLKNKYCCLPYCRNPRGDGIKLFRLPNNSISLRKWEQATGMSFNESQRNTKLICSRHFDANLIGVRRLMFNAVPTLNLQPDKANAQPVLPPVPPPRCLVSECQSTNVKLHKFPKDPMLFHQWCQALNLTNIHRFRDKYICAVHLPPKAMTCTICGVENIQLPMLNFPENRNQRAKWCYNLKIESIAKWDDIKYICSKHFESYCFIEPGKLLPEAAPTLHLNHNDRNIFLNDYATDNSKLPRIKDEPIDSDDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -