Oiva029347.1
Basic Information
- Insect
- Oeneis ivallda
- Gene Symbol
- -
- Assembly
- GCA_029955525.1
- Location
- JARPMR010000020.1:9230838-9232762[-]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 6.1e-12 1.2e-09 39.1 0.0 1 39 10 48 10 53 0.92
Sequence Information
- Coding Sequence
- atgcCTCCCATCAAAAGGAAGACCTGGGACCACACGGCCATGATACAGGCTGTGAATGCAGTACGGAGAAAGGAAATGGGTTACTTAAGAGCTGCAAAACGATTTGGTGTTCCTAAGGGTACCCTTGAGCGCTATGTGAAGAAGGATGTCCGTGCAGAAGATCTTGTCCAGGTTCGTATGGGGAGACGACCAGCCTTACCTATTGATTTAGAAGCCGAGCTGGAAAGTTATTGCAAAGAAATGGATAGACGATTTTATGGCTTACGTCTGCAGGATATCAAATACATGGCGTTTCAATTGGCTATTAAAAACAATCTAAGGCATCCGTTCAGTGTGACTAAAGCCTCTGCTGGTAAAAAATGGCTACGTGGATTTTTGAGGAGACATCCTACATTATCAATTAGGACACCTGAAGCAGTTTCTGCCGCCAGAGTAAAAGGATTTAATCCAGTTGCGGTCGCCAACTTTTTTGATTTATATGAGCCTGAACTTGTTAAGATAAAATCCGCTCCACACAGGCTCTACAACGTCGACGAGACAGGAATAACAGTAGTTCAACATAAACGTTCCAAAGTTGTCAGTATGAAAGGCAAAAAACAGGTTGGTGCGTTGACATCCCTGGAAAGAGGAAAACTCATGACTATACCTAGTGACCTGTATGAACGCTTGTGGTACCTATACGTGCCTCCATTAATAGTTTTTCCAAGGAAAAATATGGCTCAAGAGCTAATGGATGGTGCACCAGCAGGTTCAATTGGCGATTGTCATCCCTCAGGCTGGATCCAGACACACCTATTCACAAAGTGGTTtcagcattttatccaatttacaaAACCAAGTAAAGATGATCCAATTTTATTGGTTTTAGATGGCCACTACACACATACACGAAACGTCGATGTGATTGATTTAGCTCGAGACAACAACGTGATAATTGTTTGCTTGCCGCCACATTGCACACATAAAATGCAGCCAATGGATGTGGCATTTATGAAACCACTAAAAGCTTATTACTCCCAAGAAACTGAAACTTGGCTGCGTAATAATCCGGGACGCACACTAACAAACAAATATGTGGCAAGATTGTTTGGTACGGCTTATGAAAAAGCTGCTACTATGACCAACTCCGTAAATGGTTTTCGCAAAACTGGGTTGTTTCCCTGCAATCGACATATTTTTACAGACGAAGAGTTTTCAATCTTTGATGAAGGGGATCAAGAGCAAGAGTTATTTAATGTTCAGATAAATGACGAAAACGCAAATCCGGAGAATACAAGTTGTGTGCCTCCTTCGATTGAAAACCAATCAAAGCCAACTGAAGTGTTATCCAATAGCGGAATTTCAAATGAAAATCAGAGAGAGGAAAGCGGTTTAGTAAATGAAAATCCTCCTGATGAGCCTCAAGAATTGGCATCTTCTATGGCATCATCAAGAGATAGTGTTTATGATAATCCAGTTCCTAGTACATCGTCTCATGTCTCACCTTTTGCACTTAACCCAGTGCCTAGGCTACCTAAGAACAATTCTTCTTCTAAGAAAACTCGAACTGGAGCAGCTTCTCagacaatgaaaaaaaaagctttcaaaaagaaaaaagttgAAGAAGTTAGCTCCAGTTCCGAAGACGATGACATGCCTGAGCTCGCAGATAGCAGTGGGGATGAATATGATGCTGAATGTCCCTACTGCAGTGGAAACTTTTCACAAGATACAAGAGGAGAAAAATGGGCTAAGTGTCAAGTTTGTTTTAAATGGGCTCATGAGGATTGTGGAGATGTAGCATCGAATCGTTTTTTGTGTTCTTTATGTTTAGATATGTAA
- Protein Sequence
- MPPIKRKTWDHTAMIQAVNAVRRKEMGYLRAAKRFGVPKGTLERYVKKDVRAEDLVQVRMGRRPALPIDLEAELESYCKEMDRRFYGLRLQDIKYMAFQLAIKNNLRHPFSVTKASAGKKWLRGFLRRHPTLSIRTPEAVSAARVKGFNPVAVANFFDLYEPELVKIKSAPHRLYNVDETGITVVQHKRSKVVSMKGKKQVGALTSLERGKLMTIPSDLYERLWYLYVPPLIVFPRKNMAQELMDGAPAGSIGDCHPSGWIQTHLFTKWFQHFIQFTKPSKDDPILLVLDGHYTHTRNVDVIDLARDNNVIIVCLPPHCTHKMQPMDVAFMKPLKAYYSQETETWLRNNPGRTLTNKYVARLFGTAYEKAATMTNSVNGFRKTGLFPCNRHIFTDEEFSIFDEGDQEQELFNVQINDENANPENTSCVPPSIENQSKPTEVLSNSGISNENQREESGLVNENPPDEPQELASSMASSRDSVYDNPVPSTSSHVSPFALNPVPRLPKNNSSSKKTRTGAASQTMKKKAFKKKKVEEVSSSSEDDDMPELADSSGDEYDAECPYCSGNFSQDTRGEKWAKCQVCFKWAHEDCGDVASNRFLCSLCLDM
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01091643; iTF_01348769; iTF_01348770; iTF_01348767; iTF_01348768; iTF_00998506; iTF_00932385; iTF_00290603; iTF_00290604; iTF_00290605; iTF_01338543; iTF_01338547; iTF_00290612; iTF_01338548; iTF_01338551; iTF_01363031; iTF_01363032; iTF_01338553; iTF_01363033; iTF_01363034; iTF_00288799; iTF_00290594; iTF_00998505; iTF_01338549; iTF_01338552; iTF_00383552; iTF_00383553; iTF_00783376; iTF_00281064; iTF_00121329; iTF_00121330; iTF_00121331; iTF_00041714; iTF_00984998; iTF_01264406; iTF_00924532;
- 90% Identity
- -
- 80% Identity
- -