Pmne044683.1
Basic Information
- Insect
- Parnassius mnemosyne
- Gene Symbol
- clz9
- Assembly
- GCA_963668995.1
- Location
- CAVLGL010000093.1:9908777-9911545[+]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 3.1e-09 1e-06 30.2 0.0 3 40 17 55 15 59 0.93
Sequence Information
- Coding Sequence
- ATGCCgcgaaaatatataagaaaatctGATAGAAGCAGATATGATTATGATAAATTAACTGAGGCTGTCAATGCAGTAAAAAATGGAACGATATCTGCATACGCCGCATCAAAACAATATAATATCCCAAGAACAACAATTGTCAACCGTGTATACGACCGCAAAGGTTTAAAATCCAAAACTTTAGGTAGGTGTACTGCATTACCCAGGGATGTGGAAGAAACGCTTGCGGAAAATCTCCACGTAATGGAAAAATCAGGGTTTGGGTTAACGCGCAAGGAAGTGATTGAATTGGTTGGCCAATACGTGGTTAAGAATAATATTAAAACCCCGTTTAAAGATGGAATCCCTGGTGAGGACTGGTTTATTGCTTTTAAAAATAGACATGGCCTTTCTGTTAAAAAACCTCAGGCCGTTGAACACGCGCGTAGAACTGCATGCAGACCTGCCGTGATCTACGGTTACTTTGATTTATTAGAGAAAACTATTAATGAACTGGGCTTACGTGATAAACCATCACATATTTGGAATCTCGACGAGACCAGCTTCAGTAAGGACCCGAATAAATCAAATGTTGTTGGTCGTCGAGGCTATACGTCTACAAGAACCATCGCCTCTCCAGGGAAAACGAATACAACTGTGCTTTTAGCATGCAATGCCGCTGGTGACAAAGCTCCGCCGCTAATTATATACACAGGAAAAAATATCTGGAACGAATGGGTTTCTCAAGATGGGTTTCCTGGAACCGTTTATGCTGCGACTGAAAAAGGTTGGATGGAGGCGACAGTATTCGAAAATTTTTTCGAAAAAGTTTTTTTGCCTACCGTTGAAAATAAACGCCCTATTCTCTTGCTTTATGATGGTCATTCCACTCATGTGGgaattaatataatacaaaagGCACGAGAAAATAACGTAACGATTTTAAAAATCCCACCTCATACGAGTCATCTATTGCAGCCTCTGGATCTAGCGGTGAATAAATCGTTCAAAGACAAATGGGACATAGCGTTGGTCAAATGGCAAAGACTGAACGTCGGTAAGGTTTTACCCAAAAAAGAGTTTTCAACGATTCTTGGCCAAGTGTGGACGCAGATAGATTCCGAAGTATGTAAAGCAGGCTTCCGTAAAGCAGGGATATacccaataaataaatatgtcatagCTGAACACAAGTTTGACCCAGTTCAGTTAGCAGAATGGAAGAATGCGAAACAgctagaaaaatcacaaaatcaaCTTGTTAAACAAATATCCCCCAAAAAATTGAGTGAAATCGCACTGACTGttgtaaatgaaatatatataatacagcaGGAAGTTCCTCAGTGTGATTCAATAGAAGCGTCACCAAATTACGTTTTACACAGCGAAGAAAAAGCTGTTCCATTGCAAAATATTGAAAATGTTCAACAGGGTCCTAAAGTCCAAATTCTTGACAATCAGCGTGTATATACCAATATCACATTTGAAGAGTTGCTTCTGAAAATGATTAAGCCTGGAAAACCATCTTTGACAACCAAAAGAACAAAAATAGCAACAGGCGCAGAGGTCATCACTCATGATCATGTGTTCGAAAGATATCAAAAAATGGAagcagaaaaaaaagaaaagttgaaAAAGCAAGaagaaaagaaaacaagtaaagaaaaaaataaatcaacGGAAAAAAGAGCGAGCACAAATAATAAGACTGGAGCAGAAAAAAATAACGAGGCGAAAAGGTGTGAGAACGGGGGTAATTATAAGCAGACTGATATACAGGTTTCTCGAGTGGATATGTCATTACCAGGTCCATCAGGTCTTAACAAAAAGAGGAAAATTAAAAGTGAGTTAAAAAAGAGTAAAACCAtaaattctacaaaaaaaatagtTCGACGAAATAAAAAACTTAGAAAAATATCATCATCCTCATCTACCAGCATATCTGATTGCATGAGCGTGCATGAGGATTCTGATATTATAGACGTGACCGATGAAGAATATGATATAACAAGAGATTTATACACCAACGAATTTATACAAAATTATGAACCCTGTAATGAAGAAGAAGATAAAATGTTTAGCATAGACAAGAAAAGCAATGTGGTGAAGAAATTTGGTGAGAATCCGGACAAGCAAAGCAAGGAGACAGTGAATTTGACTGAGGAATCAGATGAAACTAATATGGTAGCAGGAGAATTTAATGGCGAATTGGATGGGCTGACAGAAGTGGAAAAGGAAGAGCATTGTGCATTTAATGACCAAGCGGTTACTGAGGGGAACAAGGTGATACATGAAATCAGTGGGATATCGGATGAGAGAAACGATATTGTTGAAAAAGTTTGTGATAGTTCGAATGAGAGAAATCAGGCGGCAAAAGAAGTTAGTGAGAGATCGCATCAAAGTAACAACAATACAACAGCAAAAGAAGGAAATGAGAAATGGGATGTGAAGAAAAATATAGAAGAGAAAgagaattataaaataaatgacaatGTCTTGATTCGATACTTTATCAGGAAAAAGTGGGTGTATTATGTAGGTTACATAGAAAATATACTTTCTGGAAGCGATACTAGTTATAATATAAACTTTTACAAAACTTACAAAAAGCCTCTGAAATTTAAACTCGCTAAACAAGTAGATCGTGATACCGTTTTAGAATATTCTATAGTCAAGAAAGTTCACctgcatagactaatatataatactggagataGACTGGCAACTGAAGTTTTCATATTTAGTTACCGTTCAGCGCCCTCTGAACGCGAAGCAATGAACTAA
- Protein Sequence
- MPRKYIRKSDRSRYDYDKLTEAVNAVKNGTISAYAASKQYNIPRTTIVNRVYDRKGLKSKTLGRCTALPRDVEETLAENLHVMEKSGFGLTRKEVIELVGQYVVKNNIKTPFKDGIPGEDWFIAFKNRHGLSVKKPQAVEHARRTACRPAVIYGYFDLLEKTINELGLRDKPSHIWNLDETSFSKDPNKSNVVGRRGYTSTRTIASPGKTNTTVLLACNAAGDKAPPLIIYTGKNIWNEWVSQDGFPGTVYAATEKGWMEATVFENFFEKVFLPTVENKRPILLLYDGHSTHVGINIIQKARENNVTILKIPPHTSHLLQPLDLAVNKSFKDKWDIALVKWQRLNVGKVLPKKEFSTILGQVWTQIDSEVCKAGFRKAGIYPINKYVIAEHKFDPVQLAEWKNAKQLEKSQNQLVKQISPKKLSEIALTVVNEIYIIQQEVPQCDSIEASPNYVLHSEEKAVPLQNIENVQQGPKVQILDNQRVYTNITFEELLLKMIKPGKPSLTTKRTKIATGAEVITHDHVFERYQKMEAEKKEKLKKQEEKKTSKEKNKSTEKRASTNNKTGAEKNNEAKRCENGGNYKQTDIQVSRVDMSLPGPSGLNKKRKIKSELKKSKTINSTKKIVRRNKKLRKISSSSSTSISDCMSVHEDSDIIDVTDEEYDITRDLYTNEFIQNYEPCNEEEDKMFSIDKKSNVVKKFGENPDKQSKETVNLTEESDETNMVAGEFNGELDGLTEVEKEEHCAFNDQAVTEGNKVIHEISGISDERNDIVEKVCDSSNERNQAAKEVSERSHQSNNNTTAKEGNEKWDVKKNIEEKENYKINDNVLIRYFIRKKWVYYVGYIENILSGSDTSYNINFYKTYKKPLKFKLAKQVDRDTVLEYSIVKKVHLHRLIYNTGDRLATEVFIFSYRSAPSEREAMN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01156260; iTF_00634285; iTF_00634286; iTF_00634287; iTF_00634290; iTF_00634293; iTF_01155029; iTF_01155032; iTF_01155033; iTF_00114874; iTF_00677515; iTF_00677512; iTF_01230479; iTF_01230478; iTF_01155024; iTF_01155028; iTF_00373909; iTF_00036569; iTF_00636250; iTF_01439836; iTF_00357376; iTF_00448971; iTF_00906980; iTF_00147274; iTF_00147276; iTF_00036568; iTF_01439865; iTF_00036570; iTF_00036571; iTF_00036572; iTF_00036573; iTF_01439837; iTF_01439838; iTF_01439839; iTF_00114875;
- 90% Identity
- iTF_01156260;
- 80% Identity
- -