Emer035152.1
Basic Information
- Insect
- Eudonia mercurella
- Gene Symbol
- -
- Assembly
- GCA_963082485.1
- Location
- OY720048.1:7565857-7571667[-]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 7 4.1 8.4e+03 -1.8 0.1 2 13 64 75 63 75 0.88 2 7 9.2 1.9e+04 -2.9 0.0 22 31 222 233 212 235 0.71 3 7 0.016 33 5.9 1.5 2 42 406 455 405 459 0.74 4 7 0.0019 3.8 8.9 0.1 23 46 581 603 562 603 0.88 5 7 2.1e-05 0.043 15.1 0.2 2 44 653 703 652 705 0.85 6 7 0.0037 7.5 8.0 0.3 23 44 841 862 809 864 0.76 7 7 0.42 8.4e+02 1.4 0.1 3 13 1016 1026 1014 1040 0.91
Sequence Information
- Coding Sequence
- ATGGAACAAATCATTGTCAAAACGGAGGTTCAACCCAATGGGGAAATATTACTTTACTATGTTGATGaaaaTGTTGTTGATTCTAATGGAGATCAAGTATTTCAACTTGAAGAGGGTACAGACTATGAGATAATGGAGACTACTGTGCCACCAGATGATCCAGTCAGTGAGCCTCAAGAGGCAGGGAATGAGAAATGGTCAGATGATGACATTCAAAAActccttattttctttttagacAACAAAGCCTCTTTTAGCAGCAGTATGACTAGAAAGGAGCATCTATGGACAGTAGCCTGTAAAACTATGCTTATTGGCAAAGATGTGAAGGCATGTGAATCCAAGTTGCGCAGCctgaaaaaaaagtatgttcAACAGCATTTGGAACTACAGAAAGGTTATAACGTTACTTGGCTTTTCTATGATTTAAGCCATCAGGCTTTTCAAGATGACAGATACGTTGAAACATTACTGAAGGACTATGAGCAACAGCAACAGACGGTGACTAAGATAATGTTACCGCAAACAAATTTGGAAACAGACAGCAAAGTCATTGTTGTTAAAAATGATACCATTAATAAAATACCATCTAATGACAAAGTTGAAACGATGCTTAACTTGTATTTAAGATATATGAAGAATGGACAgagcaatatgtcatcaaaaaGTATATGGGCATCCATTGCTATGGAACTTGGTGAAGAGGGTTCTGAATATTGGAGCAAACGTTTCATGAACTTTAAGCAGCATTACTTGCGAATGCTGGCAAAAAAAGCAGTTGATGGCCCCGCTAGTATCAACTGGCCCTACATGGATCTATTTGACCAAATATACAGCGATGATCCTGTGTTCCAGAGTAAATATGCTGGTGTGACAGTAGATGAATCCAGTATAACCATAGAAGTCCCTTCTTACGGAGATGATTATTGGAATGACACAGAGTTGCTAATTCTAGTCAAGTACTATTTCGATTGCTTCCACGAGTTCCAAGATACCACTATTCCTACCAACTTTCTTTGGACGGAAGTTGGAAGATTGTTGGATAAGAAACCGGAGCTTTGCAAGAAAAAATATGAAGAGCTCAAGGAAGCTCATTATGACCAGTACTTCACTGGTACTTATACATTACAGAGGCGTATTCCGTTGGAGATAAtttttgataatataatatctaaagaGGTGGAGATTGAAATGAGCCAGCATAAGAGAAGTAGTGGGAAATGGAGCACGGAAGAGCTGGACGAACTAGTTAAGTTCTTTCACGAGAACTTGGAATTGTTTAAGGATCCAGTTTGCTACTTTGTTTGCTGGTGTTGCATAAGTAGATCGCTTAACAGAAGTATATGTAGCTGCAAGACGCAGTGGGACGAATTGAAATTGATGTACAAATCTatacttaacaataaaatgGAGAATCCTGACTTGCAAATAGATTGGCCTTACATGGATTTGTTTGATAGGATATTTGACTACGGGATGAACACGAAGTTGCTTGACGACTTTGAGAAATCGAAAGAAATACAAGTTAAAAACTCACAGAAAGTAGGAgttaaaaaagtcAACATCAAATTGGAGAACGAATCGTACGAAAACGACACAGACGACGAGGAAATAGACGAGAAAGGCTTCATGAACAGCACAAAAAGAGGCTTTGGTAACTCCAAAGCATTCAAAATCCTCGTTTATTACCAGAAGCATAAGCACATGTTTTCAACAACACAAAAGAAGAAGCAAGCGTTATGGGACGTACTAGCAAAAAAGTTGGGCGTTACCGGTGAGCAGTGCGCCCATAGATTCAGGAATCTGAAACAAGTTTACAGAGAGTATGTCCTAAGGGAAATAAACAAACCGGACAAACCAATCATCTGGCCGTACTACGCTCTATGCAAAAAGGTGTTCGGCTACAGAGCTTTCAAGTCCAAATTGAAGAGCAACAAAGGGGATTTCGCTGACACAGAAGAATGGACTCCCAAAGAAATCAAACAGCTGATCAGTTACTTCGGTCATCATTTCAACGAACTATCTAATAATGACGACGTCAGCAAATGGAGCGAGTTAGCTCAGGATCTTGGTAAGACTGAGCGTGCTGTTAAAGAACGTTTTATTGAATTGAGAAAGTCTTATAGAAGGCTGAAGACTGTGAAAGAGAATAATCCAAAATACAAAGTGAATTGGAAGTATTTTAGTATGTTTAGTGATATTTATGAAGGATCTAAAAATGGGGATAGCGGGGCAATGGATGTTGATGAGCCCACAAACTATGAAGCGCTGTCGGATGAGAGAAATGAAGATGAAGAAGATTATCAATGCATCATAGTTATACCAGAAGGCGATGACATAAATGAGGCACAATTCATCATTCAAAACAAAGGTGATACGCAAATTGAAGAACCATCGGAAGCCATTGAAACAAAACCAGTTCCTACTAAATGGACTAAGAAGAGTAAGAGACGTTTGCTGATTCTCTACCACAACTACCTTAAAGCAAGAAAAGGACAAGAGATCAGCCCCAAAGAGATGTGGACAGAAATCGCTTCGAAATTAACTTTGAAAACCCCATTGCAATGCAGAAAAATGTATGCAAAACTCAAAACTAATCATCTGAAATCGAAAAGTTTAGACGATTCGAACAAGAAAAAGACTCCTTACTACACACTGCTTGAGAAAATACTGAATTTGAAACCGAAATTTCCCAAAAAACCTAAGAAAAAGCTTAGTGAAGGCAAGCCGTACAAAGATGTTCTACTACCCGCAAATAAAGTAGAATTGGCTTTACAATACTACCTTCAACACGTCGACGAGTTTGCGAGCCCCAAGTTTGAGAAAAAGTATTTGTGGACGGAACTAGCCAACTTCGTGTCAGAACCAGTCAATAAACTGTTTAATAAGATCaattatttgaaacaaaacTATAGCGTTGAAACAGACGAAGTTGCCGGGGAGAAAACGCTTTTCAGTGAATTGTTAAGGGAAATTCTGTCTAAAGAGAATGCTGTTAGAGCTGAAATTACAGAAACACCAACGCTAGACGAGATTCAGGAAGCTTCCTGGACGGACGACGAAATTGAGCAACTCCTAGTTTGGTATTTGGCCAACCTGGACAAATTTAAAAACCCGAAATTCGTTCGCAAATACCTTTGGTTGGAGGCATCCTCAATTCTGGAGAAGAGTCCCCTGACTTGCTCCAAGAAAATGACCGAAATTAGGACTCAGTATAAAACTATGATTAAGGAAAACCCGAATGATTTGAACCAGTGGAGGTTTTATGAATTGTGCCAGAAAATATATGGCACTGGAAAGAGGAATGAACTGCCCACGGTTACTCAAGCAATGAATGAGCCGGCCATTTGA
- Protein Sequence
- MEQIIVKTEVQPNGEILLYYVDENVVDSNGDQVFQLEEGTDYEIMETTVPPDDPVSEPQEAGNEKWSDDDIQKLLIFFLDNKASFSSSMTRKEHLWTVACKTMLIGKDVKACESKLRSLKKKYVQQHLELQKGYNVTWLFYDLSHQAFQDDRYVETLLKDYEQQQQTVTKIMLPQTNLETDSKVIVVKNDTINKIPSNDKVETMLNLYLRYMKNGQSNMSSKSIWASIAMELGEEGSEYWSKRFMNFKQHYLRMLAKKAVDGPASINWPYMDLFDQIYSDDPVFQSKYAGVTVDESSITIEVPSYGDDYWNDTELLILVKYYFDCFHEFQDTTIPTNFLWTEVGRLLDKKPELCKKKYEELKEAHYDQYFTGTYTLQRRIPLEIIFDNIISKEVEIEMSQHKRSSGKWSTEELDELVKFFHENLELFKDPVCYFVCWCCISRSLNRSICSCKTQWDELKLMYKSILNNKMENPDLQIDWPYMDLFDRIFDYGMNTKLLDDFEKSKEIQVKNSQKVGVKKVNIKLENESYENDTDDEEIDEKGFMNSTKRGFGNSKAFKILVYYQKHKHMFSTTQKKKQALWDVLAKKLGVTGEQCAHRFRNLKQVYREYVLREINKPDKPIIWPYYALCKKVFGYRAFKSKLKSNKGDFADTEEWTPKEIKQLISYFGHHFNELSNNDDVSKWSELAQDLGKTERAVKERFIELRKSYRRLKTVKENNPKYKVNWKYFSMFSDIYEGSKNGDSGAMDVDEPTNYEALSDERNEDEEDYQCIIVIPEGDDINEAQFIIQNKGDTQIEEPSEAIETKPVPTKWTKKSKRRLLILYHNYLKARKGQEISPKEMWTEIASKLTLKTPLQCRKMYAKLKTNHLKSKSLDDSNKKKTPYYTLLEKILNLKPKFPKKPKKKLSEGKPYKDVLLPANKVELALQYYLQHVDEFASPKFEKKYLWTELANFVSEPVNKLFNKINYLKQNYSVETDEVAGEKTLFSELLREILSKENAVRAEITETPTLDEIQEASWTDDEIEQLLVWYLANLDKFKNPKFVRKYLWLEASSILEKSPLTCSKKMTEIRTQYKTMIKENPNDLNQWRFYELCQKIYGTGKRNELPTVTQAMNEPAI
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00681603; iTF_00682156; iTF_00683278;
- 90% Identity
- iTF_00681603; iTF_00682156; iTF_00683278;
- 80% Identity
- iTF_00682156;