Nio009184.1
Basic Information
- Insect
- Nymphalis io
- Gene Symbol
- Mblk-1
- Assembly
- GCA_905147045.1
- Location
- LR989920.1:2548551-2557240[+]
Transcription Factor Domain
- TF Family
- HTH
- Domain
- HTH_psq domain
- PFAM
- PF05225
- TF Group
- Helix-turn-helix
- Description
- This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [1]. In pipsqueak this domain binds to GAGA sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 3.7e-15 5.7e-12 45.7 0.0 1 39 73 112 73 117 0.93 2 2 5.6e-19 8.5e-16 57.9 0.0 1 43 282 325 282 327 0.95
Sequence Information
- Coding Sequence
- ATGAAAGTATATAATTACAGCAGGCTATCAGAAAGCGACAAGGTAGCAGCCCCCAGCTCGTCCCCCACCGTCGCGGAACAGCCCCTGGATCTGAGTGCCAAGTCTACGTCCAGCACCAGCGGTACACCGCCTCCCGACGCCAAGAATCTGGATAACGCCAGATTAAAACGTGCAGTCCTCGAAGGAACCAGTAATAGCACTACGAGACGTGCCTACACGGAAGATGAGTTGCAATCAGCTCTGCGGGATATCCAATCTGGCCGACTGGGCACCCGCCGTGCCGCTGTCGTCTACGGCATCCCGCGCTCCACGCTTCGCAACAAGGTCAACAAGTTTGGGCTCATGTCCGACACTGGCCAGGAGTCAGACCCGGACAGCGAGCCCGATAAACCGGAGTCACCCCCGTCCGTTATCCTCAAGATACCGACATTCCCGCCCCCCGACGAGAAGAGCCCCTCACCGGCAACCCCCGTCACAACGCCGGTCACGCCTATCACGCCGATCCTACCGCCACAGCCACCGCTCAATGCAACCTCGCTGCTACTTCCACCATCGGTCTACGCTGATGCACCCTCCAGTCAGCATCTTTTTACGTCACTGAGTGACGTAATAGCGAAAAGTATCAGCCAAAAGTTCCAGCAGCCTCTCGATAGACCGCCGCAGGCCTCGGACCTGCAATTCATGCGAGCGCCCGACAGACACGTATCCGTGATAAAAACTCCACCCGATAACCAGAGGAACTACGCGATGCCAAGCAATTCCAAGGCGACGCCCAATAACAACGGGCAGCCGGCGACTGGTGGTAAGGGCACGAGGCCCAAGCGAGGAAAGTATCGCAACTACGACAGAGACAGTTTGGTGGAGGCCGTGAAAGCGGTGCAGCGAGGCGAAATGTCGGTACACCGCGCCGGCTCCTACTACGGAGTACCTCACTCTACACTGGAGTACAAGGTAAAGGAGCGACATCTCATGCGACCCAGGAAAAGGGAGCCGAAGCCGCAGCAAGACATCAAGCCGCAGCCACCGAAGCCTGCCCCGAAACCCCCGACGAAACCCTTCACGAACGGGCTCAACGGCCCCGAGAGCGGCGGCTACCCGGCCGGTTACCCGTTCTGGCCGGGTGCCGGCTTCGCACCGCCGCCGACGCCCGACCTGTATGCGTCGCACATGATGCGGCGCCTGCGCGAGGAAGCCCCGCCGCCGGCCAACGGCTCGTTCCTCGAGGGCATCATCCGCTCGAGCCTCGAGCGGCCAGGCGCGGCGCTGCTGCAGCGGCTGAGCGGTGCGCCCGCGTCTCCGCCGGGCGCGGGCGCGCTGCGGCGCCGCGGCGAGCCCGGCGACGAGCCGGCCGCGCGCCGCCCGCGCCTCGACTCGGACCACCAGCTGGCGGCCGACATGCGCGAGGCGGTGCAGCGGCTGCGCGCCGACAAGCTGCGGCCGCGCAACGGCACGCCCACGCCGCCGCCCGCGGCGTCGCCGGCGGGGGCGGGCGCGGGGGAGCGCGCCTAG
- Protein Sequence
- MKVYNYSRLSESDKVAAPSSSPTVAEQPLDLSAKSTSSTSGTPPPDAKNLDNARLKRAVLEGTSNSTTRRAYTEDELQSALRDIQSGRLGTRRAAVVYGIPRSTLRNKVNKFGLMSDTGQESDPDSEPDKPESPPSVILKIPTFPPPDEKSPSPATPVTTPVTPITPILPPQPPLNATSLLLPPSVYADAPSSQHLFTSLSDVIAKSISQKFQQPLDRPPQASDLQFMRAPDRHVSVIKTPPDNQRNYAMPSNSKATPNNNGQPATGGKGTRPKRGKYRNYDRDSLVEAVKAVQRGEMSVHRAGSYYGVPHSTLEYKVKERHLMRPRKREPKPQQDIKPQPPKPAPKPPTKPFTNGLNGPESGGYPAGYPFWPGAGFAPPPTPDLYASHMMRRLREEAPPPANGSFLEGIIRSSLERPGAALLQRLSGAPASPPGAGALRRRGEPGDEPAARRPRLDSDHQLAADMREAVQRLRADKLRPRNGTPTPPPAASPAGAGAGERA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00874370; iTF_00876073; iTF_00875223; iTF_00687813; iTF_01080653; iTF_00353998; iTF_00419694; iTF_00418890; iTF_00420466; iTF_00421274; iTF_00777330; iTF_00933650; iTF_01178328; iTF_00744527; iTF_00642259; iTF_00205044; iTF_01021109; iTF_01507092; iTF_01507988; iTF_01079817; iTF_01506263; iTF_00896694; iTF_00786682; iTF_00925649; iTF_00247100; iTF_00213345; iTF_00159748; iTF_00457168; iTF_00722961; iTF_00247968; iTF_01181820; iTF_00774430; iTF_00954334; iTF_00960094; iTF_00959338; iTF_00958605; iTF_01017663; iTF_01018502; iTF_01091616;
- 90% Identity
- iTF_00824631;
- 80% Identity
- -