Aeph004518.1
Basic Information
- Insect
- Acentria ephemerella
- Gene Symbol
- -
- Assembly
- GCA_943193655.1
- Location
- CALPDL010000021.1:3107832-3112867[+]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 7 0.1 91 3.6 0.0 2 13 76 87 75 99 0.91 2 7 9.9 8.9e+03 -2.8 0.0 14 31 215 239 211 244 0.71 3 7 0.0014 1.3 9.5 0.1 3 42 416 464 415 467 0.79 4 7 0.00024 0.22 11.9 0.2 13 46 568 609 565 609 0.76 5 7 0.0026 2.3 8.7 0.1 3 42 660 707 659 710 0.91 6 7 0.024 22 5.5 0.1 23 42 845 864 815 868 0.82 7 7 0.6 5.5e+02 1.1 0.1 2 13 1019 1030 1018 1046 0.90
Sequence Information
- Coding Sequence
- ATGGAACAAATTGTTGTTAAAACTGAAGTGCAACCCAATGGAGAGATATTACTATTTTATGTAGATGAAAATGCAGTTGATGCAAATGGGGATCAAGTATTGCATTTAGAAGAAGCCTCTGATTATGAAATTCTTCAAGCTAATTTCCAATCTGAAATTGTTGAGAGTAAAAATGAGATAGAACATGTTCCTATTGAGATACACAATGAAGATGAGTCAGAGCATAAGAAGTGGTCCCCAGAAGAAGTAAAAAAGCTATTAATATTCTATATTGATAACAAAGAGACTTTTAACAAAACCGCAAACAAGCATCATCTATGGACTGTGGCATGTAACACAATTCTAGTAGGTAAAACAGTCCAAAGCTGTGAATGTAAACTTCGTAATTTAAAGAGTAAATACTCTCAACAACGATTAGAATTGGAAAAAGGATACAGTGTTACTTGGTCACTGTACAATTTGTGTAATCAAGCATTCCATGATGACAAATATGTTGCAGTTTTGGTAAGAGATTATGAGCAACAACAAAACATAGTAACCAAGGTATCAATTCCAAGAGCAGAGAGTAATCCAGGCCCTATAGTTGTGAGAAATATAAGCAACAAGCCTAGTGTATTAGACGACAAAGTGGAAACAATGTTGAAGTTGTACTTGAGATATAAAAAGAATGACTACACACCAAAATACTTGTGGGAGTCTATTGCTATAGAATTAGGTGATGAAAGTGCTGAATATTGGCACAAACGTTTCTTAAACTTCAAGCAGCATTATTTGCGAATGCTGCCAAAAAGAGAAATATCTGGTTCTGATAGTATTAACTGGCAATACATGGATTTGTTTGATCAAATATATGGTGATGATCCAGTCTTCCAAAATAAATATGTTAAAAATAGTAAGAATAATAATGATCAAACTATAAACAATGATTTATGTGATGCGGAAGCTGATGATTGGAATGATACAGAATATACCATCCTTGCTAAATATTACTTTGATTGCTTTGATGAGTTTCAGGATGTAACAATACCAGACCATTTCCTTTGGACTGAAGTTGGTCGATTGTTGGATAAAAAACCAGAAGTTTGTAAGATTAAGTTTTTAGAAATCAAGAACACACACTATGATAAATATTTTAATGGATCATATACACTACGGTCTCGAGTTCCTTTAGAAATTATTTTAGACAATATCATATCACAAGAGGTAGAAATAGAAATGAGCAAAGATGTGAATAAGAGGCTTGATTCATGGACACCTGATGATACTGACCAATTGGTGCAGTTTCTTTATGAAAACGTAGAAATGTTTAAAGATCCTGTCTGTTATTATGTTTGTTGGGCTTGTATTAGTAAATCTTTGAATCATACTGTAACTAGCTGCAAAAAGCAATGGGATGGTCTGAAATCTCACTACAAGTCTATGTTACATAATAAAATGGAAAATCCTGATTTGCAAATCGATTGGAGATATATAGATATTTTTGACAGAATATTTGACTTTGGCATGGCCACGGATCTGTTGAATGATTTTGTTAAACCAAAAAACAAGACAAAGTCAACTGAAAAATTTGGAGTTCAAAAAGTTAAAATAAACTTAGATAGTGACACTTATGAAAATGATACGGATGAAGAAGGATTTGATGAAAAGGGCTTTACAAAACGCACTAAAAGGGGCATGGGCGACTCCAAAGCATTCAAAATCCTTGAGTATTATCAGAAAAATAAAGAAAAGTTTTCATCTAAACAAAAGAAAGGCCTTTGGGACGTAGTAGCAAAACAGATTGGCATAAGTGCTGAACAGTGCGCACATCGCTTTAGGAACTTAAAACAAGTTTACACTACTTATGTTCAACGAGAGATCAACAAACCTGAAAAGCCCATCTTCTGGCCTTACTATGCTTTATGCAAAAAAGTCTTTGGTTACAGAGCGATAAAGTCTAAATTGAAAAATAATAAACTGGATTCGGATGATGTAGAAGATTGGACACCTAAGGAAATCAAAACATTAATTAATTACTTTGGTTTGCATATAAATCAACTGTCAAATAGTTCTGATGCCATTAATTGGTCTGAACTGGCTAATGACCTTCGGAAATCTGAAACTGCTATTCGAGATAAATTTGATGAGTTGAATAAATCTTATAAGAAATTGAAGACAGTTAAGGATAACAATCCTGAGTATAAGGTGTCTTGGAAATACTTTAATTTGATTAACGATATCTATGAGAAATCAGCAAATGTTTACAATGAGGTTATGGAGCTTGTGGAAGATAATGAATGTTTAAATGAAGATGATGAAGATTATCAGTGCATCATTGTCATGCCAGAAGAAAGTGATGTAAATGACATGAATAATACTCATATAATAATTGAAAAGAATGACGAAACAGAATTCTTTGAAGAAACTATACAAAGCGTTCAGGTTAAACCTGAAATATTTAAATGGACTAAAGGAAGTAAAAGGAAGCTTCTAATTCTTTATCATAACTATGTCAAATCTAGAAGAGAAAATCAAATTAAACCTCGAGAAATGTGGACGGAAATAGCTTCGAAGTTAGCTGGCAAAACGCCGCTCGGTTGCAGAAAAATGTTTGCAAAACTTAAAATTAACCACTCGAAGTTGAAAGATGCTGACGACCAGAACAAATATAACACACCGTATTACACATTATTTGAAAAAATATTGAAACTGAAACCTAAGTTTGTTAAAACTAAACAGAACAATTTAAAAGATCAGAAAATTTATAAAAATGTTGAACTACCCGCAGACAAAGTAGAATTGGCCTTGCAATATTACCTTCAAAATATTGATGAGTTTGCGAGTCCAAAGTTCGAGAAAAAGTATTTATGGTCTGAACTGGCTAAATATGTATCTGAACCATTAAATAAGCTTTTTAATAAAATTAATTATTTAAAACAAAATTATGATGTTGAGACAGGAGCAGTTCCTGGAGAGATAAGCGCATTTAACGAGCTTTTGAAGGAAATTGTGACTAAAGAAAATGCTATTACAGCTGACATTTCAGAGCAGATTAGCTTAGATGAGCAGGAGGAAGTTAATTGGTCAGACGATGAAATTGAACAGCTTTTAGTGTGGTATTTGGCAAACTTGGACAAATTTAAAAATCCGAAATTTGTTCGTAAATATCTCTGGTTAGAAGTGTCTTCAATTTTGGAGAAGAGTCCTTTGGCTTGTTCAAAGAAAATGGCAGAAGTGAGAACGCAGTATAAAGCTATGATAAAAGAAAGTCCCGCTGAACTGAACGGATGGCGGTTCTACGAGCTGTGCCAGAAAATTTATGGTACTGGGAAAAAGAGTGAAGTAACTATAGTTCCGATGAATCAGACTGAATGA
- Protein Sequence
- MEQIVVKTEVQPNGEILLFYVDENAVDANGDQVLHLEEASDYEILQANFQSEIVESKNEIEHVPIEIHNEDESEHKKWSPEEVKKLLIFYIDNKETFNKTANKHHLWTVACNTILVGKTVQSCECKLRNLKSKYSQQRLELEKGYSVTWSLYNLCNQAFHDDKYVAVLVRDYEQQQNIVTKVSIPRAESNPGPIVVRNISNKPSVLDDKVETMLKLYLRYKKNDYTPKYLWESIAIELGDESAEYWHKRFLNFKQHYLRMLPKREISGSDSINWQYMDLFDQIYGDDPVFQNKYVKNSKNNNDQTINNDLCDAEADDWNDTEYTILAKYYFDCFDEFQDVTIPDHFLWTEVGRLLDKKPEVCKIKFLEIKNTHYDKYFNGSYTLRSRVPLEIILDNIISQEVEIEMSKDVNKRLDSWTPDDTDQLVQFLYENVEMFKDPVCYYVCWACISKSLNHTVTSCKKQWDGLKSHYKSMLHNKMENPDLQIDWRYIDIFDRIFDFGMATDLLNDFVKPKNKTKSTEKFGVQKVKINLDSDTYENDTDEEGFDEKGFTKRTKRGMGDSKAFKILEYYQKNKEKFSSKQKKGLWDVVAKQIGISAEQCAHRFRNLKQVYTTYVQREINKPEKPIFWPYYALCKKVFGYRAIKSKLKNNKLDSDDVEDWTPKEIKTLINYFGLHINQLSNSSDAINWSELANDLRKSETAIRDKFDELNKSYKKLKTVKDNNPEYKVSWKYFNLINDIYEKSANVYNEVMELVEDNECLNEDDEDYQCIIVMPEESDVNDMNNTHIIIEKNDETEFFEETIQSVQVKPEIFKWTKGSKRKLLILYHNYVKSRRENQIKPREMWTEIASKLAGKTPLGCRKMFAKLKINHSKLKDADDQNKYNTPYYTLFEKILKLKPKFVKTKQNNLKDQKIYKNVELPADKVELALQYYLQNIDEFASPKFEKKYLWSELAKYVSEPLNKLFNKINYLKQNYDVETGAVPGEISAFNELLKEIVTKENAITADISEQISLDEQEEVNWSDDEIEQLLVWYLANLDKFKNPKFVRKYLWLEVSSILEKSPLACSKKMAEVRTQYKAMIKESPAELNGWRFYELCQKIYGTGKKSEVTIVPMNQTE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -