Csas004888.1
Basic Information
- Insect
- Carposina sasakii
- Gene Symbol
- -
- Assembly
- GCA_014607495.2
- Location
- CP053172.1:551914-553788[-]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 8.8e-09 1.5e-06 28.5 0.0 1 41 14 54 14 57 0.89
Sequence Information
- Coding Sequence
- ATGTCTGAGTCAAAACCCAGGAAAAGAGGTAGTACATGGGATGAAGACAGTTTAAGACGTGCCATGGAAGCAGTACAAAATAATCAACTTAATACAAATGCTGCAGCGTTGAAGTTCAATATTCCAAGACGCACCCTTAGAAACCATTTAGTAAGCGGTAACAGTACTAAAATAATTGGGAAGACAACTATTCTTACAAAACAATTAGAAGAAGATTTATCGCAGCGAATCAAACGTTTTGCAAAACTGGGTGTACCACTTACGCCTAAATTTATCAGGAAACAAGCATTTTTATTTTGTGAGAGGTTTAAAATCAAGCACTCGTTTAATATGTCAACAAGAATGGCTGGACGCAAATGGCTCAAAATGTTTTTGGTTCGAAACCCATCAATATCAAAAAGAAAACCCCAGCTAATGAATCCAGCTAGAGCTCAAAAAATGAATAAGCCCATTGTGAAAAACCATTTTGAAGAAGTGAAGAAACTCTATGAAGAGCTAGACATTATTGCACATCCAGAACGCTTGTATAACATGGATGAAAAAGGGTGTCGCATTACGGTCCACAAACAAACTGTAGTTTTGGCTGAAAAAGGGAATACAAGAGTACATCTTGTTGCTCCGGAACACGCCGAGAACGTTACAATAGCGATGTGCGTAAATGCGGTGGGTATTGCTATACCACCTATGATTATTTTTAAGGGGATAAGGCACATATCAGAATTAGCATCAAATTTGCCACCTGGAACAAAGGTAAGCATGGCACCCAAAGGCAGCATGACCAGTAGCTTATTCGTAGAGTTCATACAACATTTAGCACAACATAAAGTACCCGGTAAATGTCTGCTTATCTTTGATGGCGCAAAGTGTCATTTATCATTTGAGTCTCTGGAAGTAGCAGATAAAAATAATATAGTTCTATACTGCTTACCTTCGAACACCACGCACGAACTCCAACCTCTAGATAAATCAGTGAATCGGTCCTTCGAACATCACTGGGATGAAGAGGTTTTGAACTACCTGTGCAATTCTCAAGAGAGAACTTTAAACAAAGCTGCGTTTAACAAAATTTTCTCCCGAACATGGCCAAAGTGTATGACTCAAACTAATATAACGAATGGATTCAAAGCCACTGGTTTATACCCCCTTGATCCTGATGTAATACCCGAAGATGCTTATGCTCCTTCAATTGTAACTGAAAGACCTTTATCGGAAACGTCACTACACCAGATCTACCAACCACTTATATCTGTTCAAGTCTCGCCCCCTGTGTCTGAATTTAAAAAAGTGTCAGAATTGTCGTTTCAGTCTCTAATAGATGAAAGGAGGTCAGCTCATGTTTCTAACGTTTCTTCAATTTTTCCATCAACTTCCGAAGAGAAGATAGTATCGACATCGCCAAAATTTTCAGCTGCAACAGCAAAAAGAAAACCCGTTCTTGTTTCTTACAGTTCTTCAACTGATGCTTCAGACTTAGAAGTAGATATAACGGTCAATGATCAGTGCCGCCCCATGCTGCTAAGTTCAGTACATCGTTTCAACGTCGCAGATACGAAACATGACCCTCAACCATCTCTGTACGATTATCCTCTTCCGAGTACGTCCGGCCTACAAGAAAATATCGTCTGTTCTAGCTCTGAATCTGATCTTGATTTGAATCTTAGTGAATCATTTATCAAAGATCACTACATGACTAAACTTCAAGTCTCGGATTTATACACTTCTTCGTCATTTGATGATGACCTAGATCAGAAAAAAGAAATCACTCCGAAAAAACGATTAATACAAAACAAAAGCTCAATTAAAATAAAAGCTCAACTAAAGAGTCATGAGACTAGATCTTCGGATGATGACCAGCCAGAATCTAATCGAGGATAA
- Protein Sequence
- MSESKPRKRGSTWDEDSLRRAMEAVQNNQLNTNAAALKFNIPRRTLRNHLVSGNSTKIIGKTTILTKQLEEDLSQRIKRFAKLGVPLTPKFIRKQAFLFCERFKIKHSFNMSTRMAGRKWLKMFLVRNPSISKRKPQLMNPARAQKMNKPIVKNHFEEVKKLYEELDIIAHPERLYNMDEKGCRITVHKQTVVLAEKGNTRVHLVAPEHAENVTIAMCVNAVGIAIPPMIIFKGIRHISELASNLPPGTKVSMAPKGSMTSSLFVEFIQHLAQHKVPGKCLLIFDGAKCHLSFESLEVADKNNIVLYCLPSNTTHELQPLDKSVNRSFEHHWDEEVLNYLCNSQERTLNKAAFNKIFSRTWPKCMTQTNITNGFKATGLYPLDPDVIPEDAYAPSIVTERPLSETSLHQIYQPLISVQVSPPVSEFKKVSELSFQSLIDERRSAHVSNVSSIFPSTSEEKIVSTSPKFSAATAKRKPVLVSYSSSTDASDLEVDITVNDQCRPMLLSSVHRFNVADTKHDPQPSLYDYPLPSTSGLQENIVCSSSESDLDLNLSESFIKDHYMTKLQVSDLYTSSSFDDDLDQKKEITPKKRLIQNKSSIKIKAQLKSHETRSSDDDQPESNRG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01538691; iTF_01538692; iTF_00208804; iTF_00801765; iTF_00726283; iTF_00726284; iTF_00704924; iTF_00661217; iTF_00786702; iTF_00662167; iTF_00661215; iTF_00302028; iTF_00850609; iTF_01081524; iTF_01375707; iTF_00162614; iTF_00208803; iTF_00162617; iTF_00276281; iTF_00276284;
- 90% Identity
- iTF_00425315; iTF_00208804; iTF_01424996; iTF_00801765; iTF_00425318; iTF_01338635; iTF_00704911; iTF_00850609; iTF_00281141; iTF_00428951; iTF_00428949; iTF_00428950; iTF_00662167; iTF_00162617; iTF_00276281; iTF_00276284; iTF_00704924; iTF_00661217; iTF_00647933; iTF_00661215;
- 80% Identity
- -