Cnem000434.1
Basic Information
- Insect
- Choreutis nemorana
- Gene Symbol
- -
- Assembly
- GCA_949316135.1
- Location
- OX438595.1:4646414-4658088[+]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 0.58 6.5e+02 0.9 0.0 5 20 17 32 17 34 0.95 2 12 4.6 5.2e+03 -2.0 0.0 5 28 714 739 714 740 0.82 3 12 0.82 9.2e+02 0.5 0.0 5 20 983 998 983 1000 0.93 4 12 0.55 6.2e+02 1.0 0.0 5 20 1118 1133 1118 1135 0.94 5 12 0.55 6.2e+02 1.0 0.0 5 20 1253 1268 1253 1270 0.94 6 12 0.55 6.2e+02 1.0 0.0 5 20 1388 1403 1388 1405 0.94 7 12 0.55 6.2e+02 1.0 0.0 5 20 1523 1538 1523 1540 0.94 8 12 0.55 6.2e+02 1.0 0.0 5 20 1658 1673 1658 1675 0.94 9 12 0.55 6.2e+02 1.0 0.0 5 20 1793 1808 1793 1810 0.94 10 12 0.55 6.2e+02 1.0 0.0 5 20 1928 1943 1928 1945 0.94 11 12 0.55 6.2e+02 1.0 0.0 5 20 2063 2078 2063 2080 0.94 12 12 4.4 5e+03 -1.9 0.0 5 20 2198 2213 2198 2215 0.90
Sequence Information
- Coding Sequence
- ATGAAGAAAGCAAAAGCTGAGGGTCTTATTACTCCATTGGAAGGAAAAACAAAAGaacagaaagaaaaaataataaaagggtTAGCAATGCACGGCTTACCACTGCCAGAAGGAAAAACGGCGTCAGAAAAGAAATTGATTGACAAAGTAATTCGCGACCTTGGCCTTCCTCCAGAGCCAAAAACGGATTCTCAAAAAAAGAAGTTCAAAGAAGCACAAGCAGCAGGACTTATTGTTCCTCTTGAAGGAAAATCGGCAGCGgaaaaagaaaagattttGCAAAAGCAAGCAGAGCTAGGACTCGAATTGCCAGAAGGTCGAACTCCTTCTGAGAAATCGTTAATCGCAAAAATAAAGGCCCAATCCAAAGGTAAACCACCTTCAACCATTAAAATTCCCTCGGCAAAGCTGAAAAAAGCTAAAGCTGAGGGTCTTTTAACGCCATTAGAAGgaaaaacaaaagaacaaaaagaaaaaatactccGAGGCTTAGCCATGAACGGAATACCTTTGCCCGAAGGAAAAACGGCATCAGAAAGGAAGTTGATTGACAAAGTCATTCGTGACCTTGGCCTTCCTCCAGAGCCGAAAACAGACTCTCAAAAAAAGAAGTTCAAAGAAGCACAAGCAGCAGGACTTATTGTTCCTCTTGAAGGAAAATCGGCAGCGgaaaaagaaaagattttGCAAAAGCAAGCAGAGTTAGGACTAGAATTGCCAGAAGGTCGAACTCCTTCTGAGAAATCGTTAATCGCAAAAATAAAGGCCCAATCCAAAGGTAAACCACCTTCAACCATTAAAATTCCCTCGGCAAAGCTGAAAAAAGCTAAAGCTGAGGGTCTTTTAACGCCATTGGAAGgaaaaacaaaagaacaaaaagaaaaaatactccGAGGCTTAGCCATGAACGGAATACCTTTGCCCGAAGGAAAAACGGCATCAGAAAAGAAGTTGATTGACAAAGTCATTCGTGACCTTGGCCTTCCTCCAGAGCCGAAAACAGACTCTCAAAAAAAGAAGTTCAAAGAAGCACAAGCAGCAGGTGTTATTGTTCCTCTTGAAGGAAAATCTGCAGCGgaaaaagaaaagattttGCAAAAGCAAGCAGAGTTAGGACTAGAATTGCCAGAAGGTCGAACTCCTTCTGAGAAATCTTTAATCGCCAAAATAAAGGCCCAAACTAAGGGAAAACCACCTTCAACCCTAAAAATTCCCTCGGCGAAGCTGAAAAAAGCTAAAGCTGAGGGTCTTATAACGCCATTGGAAGgaaaaacaaaagaacaaaaagaaaaaatactccGAGGCTTAGCCATGAACGGAATACCTTTGCCCGAAGGAAAAACGGCATCAGAAAAGAAGTTGATTGACAAAGTGATTCGCGACCTTGGCCTTCCTCCAGAGCCAAAAACGGATTCTCAAAAAAAGAAGTTCAAAGAAGCACAAGCAGCAGGACTTATTGTTCCTCTTGAAGGAAAATCTGCAGCGgaaaaagaaaagattttACAAAAGCAAGCAGAGTTAGGATTGGATTTACCAGAAGGTCGAACACCATCTGAGAAAGCACTCATTGctaaaatgagaaaaaaagcTATGGGATTACCTTCGAAAATGGAAGTGCCATCTGAAAAGTTCCGAAAAGCTCGTGCTGCTGGATATTTAACTCCTTTAGAAGGAAAAACAAAGGAACAGAAGGAAAAAATCTTAAGAGGCCTCGCTATGAATGGCATTCCACTTCCTGAAGGCAAAACCCcgtctgaaaaaaaaatgattgataAGATACGTAATGAACTGGGACTTCCTCCATCTCCAGATACCCCATCGGCAAAAGAAAAGTACAAGAAAGCACTTGAAGCTGGGCTTATAACACCTCTCGAAGGTAAAAGCCCCGCACAGAAAGAAAAAATTCTTAGGGGTCAAGCTGCTATGGGTCTTCCACTACCTAAAGGTCGAACAAAGTCAGAAAAAGCATTAATAGAAAAACTAGAAAAGGAACTACCTGCTGCTATTACTTCTGCAGAAAGAATGCGGAAAGCAAAAGCTGCTGGAATTATGACTCCTTTGGAAGGAAAAACTGCTgcacaaaaagaaaaaatctttagAGACATGATTAAAGCTAAAGTTCCTTTACCTGAAGGTAAGACAGAGtcggaaaaaaatatattaaagaagTTAAGATCAGAGATGGGTTTGCCACCTGAACCCGATACTGTAtctcaaaagaaaaaacttgaaaaagcTTATGCTCAAGGTTTAATAACTCCTTTAAAAGGAAAGACTAAAGatcaaaaggaaaaaatattgaaagacATGATAGAAGCAGGCGTTAAGCTACCAGAAGGACGTACTGAGTCAGAAAAACAATTGATCCGACAAATCAAAGCATCAGCTGTACCATTTCCGGAAGGAAGAACACCATCAGAGaaagcttttataaaaaaggtaaaggCAGAGACGGGATATGTTTCACCTGAAGATTTAGAAAAAATGAAACGTGCCAAAGCAGCTGGTCTCTTAACACCACTAACTGGAAAGACACCacaacaaaaagaaaatatattgcGAGGTTTAGCTAAAGCAGGAATCCCGTTACCAGAAGCAAAAACACCCTCAGAAAAAGAGTTGATTAAAAAGGTCAGAAAAGATTTAGGATTACCACCGGAGCCAAAAACAGATTcggaaaaagaaaaatacaaaaaagctTTACAGGCAGGTGTCATAACCCCTTTAGAAGGTAAAACTGAAgctcaaaagaaaaatattttgcagaAAATGCACGACGCAGGTATAGAATTACCAGAAGGGAGAACGCCATCTGAAAAAGCATTAATAAGTAAGATAACAGGCAAACCAGTGCCACCTATATCACCTGAAGAAgctataaaaatagataaggaaaaattaaaaaaagcaaaagcAGAGGGTCTCCTTACTCCCTTAACAGGTAAAACACCacaagaaaaagaaaagattttAAGAGGTCTAGCCATGAACGGAATTCCTCTACCCGAAGGTAAAACACCATCTGAAAAGAAATTGATCGAAAAAGTAAGACACGATCTAGGATTGCCTCAAGAACCGAGAACATCAGCAGCCAAGCGTAAATATCAAGACGCTTTGGCCGCAGGCATCATCACCCCTCTGCAAGGTAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCCGAAGGAAGAACTCCATCAGAAAAAGCTTTGATCGCTAAAGTTAAAGCTGCCAAGGGTATACCACCGACTCCAGTGGGCATTCCATCTGAAAAATTACGTAAAGCCAAAGCAGCAGGATTTCTTACGCCACTAACAGGTAAAACACCACaagaaaaagagaaaatattaaaaggtcTAGCCATGAATGGTATCCCTCTTCCAGAAGGAAAAACGCCTTCTGAAAAGAAGTTGATTGAAAAAGTAAGACACGATCTAGGATTGCCTCAAGAACCGAGAACATCAGCAGCCAAGCGTAAATATCAAGATGCTTTGGCCGCAGGCATCATCACCCCTCTGCAAGGTAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCCGAAGGAAGAACTCCATCAGAAAAAGCTTTGATCGCTAAAGTTAAAGCCGCCAAGGGCATGCCACCGACTCCAGTGGGCATTCCATCTGAAAAATTACGTAAAGCCAAAGCAGCAGGATTTCTTACGCCTCTAACAGGTAAAACACCACaagaaaaagagaaaatattaaaaggtcTAGCCATGAATGGAATCCCTCTTCCAGAAGGAAAAACGCCTTCTGAAAAGAAGTTGATTGAAAAAGTAAGACACGATCTAGGATTGCCTCAAGAACCGAGAACATCAGCAGCCAAGCGTAAATATCAAGACGCTTTGGCCGCAGGCATCATCACCCCTCTGCAAGGAAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCCGAAGGAAGAACTCCATCAGAAAAAGCTTTAATCGCTAAAGTTAAAGCTGCCAAGGGTATGCCACCGACTCCAGTGGGCATTCCATCTGAAAAATTACGTAAAGCCAAAGCAGCAGGATTTCTTACGCCTTTAACAGGTAAAACACCACaagaaaaagagaaaatattaaaaggtcTAGCCATGAATGGAATCCCTCTTCCAGAAGGAAAAACGCCTTCTGAAAAGAAGTTGATTGAAAAAGTAAGACACGATCTAGGATTGCCTCAAGAACCGAGAACATCAGCAGCCAAGCGTAAATATCAAGACGCTTTGGCCGCAGGCATTATCACCCCTCTGCAAGGTAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCAGAAGGAAGAACTCCATCAGAAAAGGCTTTGATCGCTAAAGTTAAAGCCGCCAAGGGTATACCACCGACTCCAGTGGGCATTCCATCTGAAAAATTACGTAAAGCCAAAGCAGCAGGATTTCTTACGCCTCTAACAGGTAAAACACCACaagaaaaagagaaaatattaaaaggtcTAGCCATGAATGGAATCCCTCTTCCAGAAGGAAAAACGCCTTCTGAAAAGAAGTTGATTGAAAAAGTAAGACACGATCTAGGATTGCCTCAAGAACCGAGAACATCAGCAGCCAAGCGTAAATATCGAGATGCTTTGGCCGCAGGCATCATCACCCCTCTGCAAGGAAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCAGAAGGAAGAACTCCATCAGAAAAAGCTTTGATCGCTAAAGTTAAAGCTGCCAAGGGTGTGCCGCCGACTGCAGCTGGCATTCCGTCTGAAAAATTACGTAAAGCCAAAGCAGCAGGATTTCTTACGCCACTAACAGGTAAAACACCACaagaaaaagagaaaatattaaaaggtcTAGCCATGAATGGAATCCCTCTTCCAGAAGGAAAAACGCCTTCTGAAAAGAAGTTGATTGAAAAAGTAAGACACGATCTAGGATTGCCACAAGAACCGAGGACATCAGCAGCCAAGCGTAAATATCAAGACGCTTTGGCCGCAGGCATCATCACCCCTCTGCAAGGAAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCAGAAGGAAGAACTCCATCAGAAAAAGCTATGATCGCTAAAATTAAAGCCGCCAAGGGCATGCCGCCGACTCCAGTGGGGATTCCGTCTGAGAAATTGCGCAAAGCGAAAGCAGAAGGATTTCTTACGCCACTAACAGGTAAAACACCACaagaaaaagagaaaatattaaaaggtcTAGCCATGAATGGAATCCCTCTTCCAGAAGGAAAAACGCCTTCTGAAAAGAAGTTGATTGAAAAAGTAAGACACGATCTAGGATTGCCACAAGAACCGAGGACATCAGCAGCCAAGCGTAAATATCAAGACGCTTTAGCCGCAGGCATCATCACCCCTCTGCAAGGAAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCTGAAGGGAGAACTCCATCAGAAAAAGCTATGATCGCTAAAATTAAAGCCGCCAAGGGCATGCCGCCGACTCCAGTGGGGATTCCGTCTGAGAAATTGCGCAAAGCGAAAGCAGAAGGATTTCTTACTCCTTTAACAGGTAAAACACCACaagaaaaagagaaaatattaaaaggtcTAGCCATGAATGGAATCCCTCTTCCAGAAGGAAAAACGCCTTCTGAAAAGAAGTTGATTGAAAAAGTAAGACACGATCTAGGATTGCCTCAAGAACCGAGAACATCAGCAGCCAAGCGTAAATATCAAGACGCTTTGGCCGCAGGCATCATCACCCCTCTGCAAGGAAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCCGAAGGGAGAACTCCTTCAGAAAAAGCTTTGATCGCTAAAGTTAAAGCTGCCAAAGGCGCGCCGCCGGCTCCAGTGGGCATTCCGTCTGAAAAATTGCGTAAAGCCAAAGCAGCAGGATTTCTTACGCCACTAACAGGTAAAACACCACaagaaaaagagaaaatattaaaaggtcTAGCCATGAATGGAATCCCTCTTCCAGAAGGAAAAACGCCTTCTGAAAAGAAGTTGATTGAAAAAGTAAGACACGATCTAGGATTGCCACAAGAACCGAGAACATCAGCAGCCAAGCGTAAATATCAAGACGCTTTGGCCGCAGGCATCATCACCCCTCTGCAAGGAAAATCCGCTGcacagaaagaaaaaatcctaAATGCATTAGCTGATAAAGGCATTGATTTACCCGAAGGAAGAACTCCATCAGAAAAGGCTTTGATCGCTAAAATTAAAGCCGCCAAGGGTGTGCCTCCAACTGCAGTGGGCATTCCGTCTGAGAAATTGCGTAAAGCTAAAGCAGAAGGACTCCTTACTCCCTTAGAAGGTAAaacatcaaaagaaaaagagaaaatattaaaaggtcTAGCTAAGAATGGAATTCCTCTTCCTGAGGGAAAAACGCCGTCTGAAAAGAAATTGATAGAAAAAGTCAGACGTGAATTAGGCCTTCCACCAGAACCAAAAACGGCAGCGGATAAAGAAAAGTACGCAAAAGCTCTTGCTGCCGGAATGATTACACCACTGGATGGAAAATCACCAGAACAGAAAGAAAAAATTCTCAAAACTTTAGCCGAACAAGGGATAGATTTACCAGAAGGGAGAACTGCTTCAGAAAAAGCcctgataaaaaaaattgcaacagCAATTGGTAAACCCGTTATATCGGAAGCTGTTGTAACAAAACCTTTAAAAGAAGGAGAAGAAATCGaTGTTATCAATAAAAAGTCACCAACGGAGCTCAGAGGAAGTTGTTCTTTGAACGATGTAGTGGTATCCACGTGCAGTATTCCAGGATCCCGTGATGTCCATTTTGCAGACGAGTATTGTACAAGGACCTGTGGTGTGTCAGCTAACTCACTCCGACTCTGTGAGGAGGAAGACTCGTCAAAGCCAATGTATCTAATAGGCCCAGCGCCAAAACAACGAGGAGCAGTTATAGTCCCATCGTGTGCGTGTGATCCCGATGCAATGGtccaaatattaaaaaaggaacTCCAAAATCAAGAATGCAGTAGTTTTTCTACCCCTAAAATAGGTTAG
- Protein Sequence
- MKKAKAEGLITPLEGKTKEQKEKIIKGLAMHGLPLPEGKTASEKKLIDKVIRDLGLPPEPKTDSQKKKFKEAQAAGLIVPLEGKSAAEKEKILQKQAELGLELPEGRTPSEKSLIAKIKAQSKGKPPSTIKIPSAKLKKAKAEGLLTPLEGKTKEQKEKILRGLAMNGIPLPEGKTASERKLIDKVIRDLGLPPEPKTDSQKKKFKEAQAAGLIVPLEGKSAAEKEKILQKQAELGLELPEGRTPSEKSLIAKIKAQSKGKPPSTIKIPSAKLKKAKAEGLLTPLEGKTKEQKEKILRGLAMNGIPLPEGKTASEKKLIDKVIRDLGLPPEPKTDSQKKKFKEAQAAGVIVPLEGKSAAEKEKILQKQAELGLELPEGRTPSEKSLIAKIKAQTKGKPPSTLKIPSAKLKKAKAEGLITPLEGKTKEQKEKILRGLAMNGIPLPEGKTASEKKLIDKVIRDLGLPPEPKTDSQKKKFKEAQAAGLIVPLEGKSAAEKEKILQKQAELGLDLPEGRTPSEKALIAKMRKKAMGLPSKMEVPSEKFRKARAAGYLTPLEGKTKEQKEKILRGLAMNGIPLPEGKTPSEKKMIDKIRNELGLPPSPDTPSAKEKYKKALEAGLITPLEGKSPAQKEKILRGQAAMGLPLPKGRTKSEKALIEKLEKELPAAITSAERMRKAKAAGIMTPLEGKTAAQKEKIFRDMIKAKVPLPEGKTESEKNILKKLRSEMGLPPEPDTVSQKKKLEKAYAQGLITPLKGKTKDQKEKILKDMIEAGVKLPEGRTESEKQLIRQIKASAVPFPEGRTPSEKAFIKKVKAETGYVSPEDLEKMKRAKAAGLLTPLTGKTPQQKENILRGLAKAGIPLPEAKTPSEKELIKKVRKDLGLPPEPKTDSEKEKYKKALQAGVITPLEGKTEAQKKNILQKMHDAGIELPEGRTPSEKALISKITGKPVPPISPEEAIKIDKEKLKKAKAEGLLTPLTGKTPQEKEKILRGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYQDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKALIAKVKAAKGIPPTPVGIPSEKLRKAKAAGFLTPLTGKTPQEKEKILKGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYQDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKALIAKVKAAKGMPPTPVGIPSEKLRKAKAAGFLTPLTGKTPQEKEKILKGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYQDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKALIAKVKAAKGMPPTPVGIPSEKLRKAKAAGFLTPLTGKTPQEKEKILKGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYQDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKALIAKVKAAKGIPPTPVGIPSEKLRKAKAAGFLTPLTGKTPQEKEKILKGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYRDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKALIAKVKAAKGVPPTAAGIPSEKLRKAKAAGFLTPLTGKTPQEKEKILKGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYQDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKAMIAKIKAAKGMPPTPVGIPSEKLRKAKAEGFLTPLTGKTPQEKEKILKGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYQDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKAMIAKIKAAKGMPPTPVGIPSEKLRKAKAEGFLTPLTGKTPQEKEKILKGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYQDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKALIAKVKAAKGAPPAPVGIPSEKLRKAKAAGFLTPLTGKTPQEKEKILKGLAMNGIPLPEGKTPSEKKLIEKVRHDLGLPQEPRTSAAKRKYQDALAAGIITPLQGKSAAQKEKILNALADKGIDLPEGRTPSEKALIAKIKAAKGVPPTAVGIPSEKLRKAKAEGLLTPLEGKTSKEKEKILKGLAKNGIPLPEGKTPSEKKLIEKVRRELGLPPEPKTAADKEKYAKALAAGMITPLDGKSPEQKEKILKTLAEQGIDLPEGRTASEKALIKKIATAIGKPVISEAVVTKPLKEGEEIDVINKKSPTELRGSCSLNDVVVSTCSIPGSRDVHFADEYCTRTCGVSANSLRLCEEEDSSKPMYLIGPAPKQRGAVIVPSCACDPDAMVQILKKELQNQECSSFSTPKIG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -