Sper005420.1
Basic Information
- Insect
- Sarcophaga peregrina
- Gene Symbol
- SMARCC2
- Assembly
- GCA_014635995.1
- Location
- CM025789.1:127543163-127547186[+]
Transcription Factor Domain
- TF Family
- MYB
- Domain
- Myb_DNA-binding domain
- PFAM
- PF00249
- TF Group
- Helix-turn-helix
- Description
- This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.4e-14 2.3e-11 44.2 0.2 3 44 651 692 649 694 0.95
Sequence Information
- Coding Sequence
- ATGTGCACTTTAGCTCCAAAAAAGGACGGTAGTCCAAATATTGAGTTCTACAATTCCCCCGAGTCATTGCAAGGGTTCGAAACCATTCGGCAGTGGcttcaaaaaaattgtaaaaaacaTTTGGCTAGTAGTAATGAGCCAATAACAAAAGAATCATTGTCTCAACTATTGATACAATTTCTTCAATATGTGGAGGCGAAACTTGGTAAAAACTCTACAGAACCACCAGCGACGCGTATACCGATGCGTTGTTTTTTGGATTTCAGACCAGGTGGTGGCTTATGCATCATATTCTCTACCATGTTTCGCTTTCGTGCTGACcagagaggaaaaaaatttgatttttccgTTGGAAAAAATCCTACTCGTAAAGACCCCAATATTCAATTATTGATTGATATTGAACAAGCCTTAGTTGAGGCTGATCTTTATCGCATTCCCTACATTTTTATACGTCCCGAGGTAGACAAACAACTAGTGACAGTTTTGCGGGAAATTTTAGCTACCCGACGTGTAGAGATTGTATCGGACGAAGAAGACGCGACTCATATTATATATCCAGTGGTTGATCCACACCCTGACGAATATGCTCGTCCGATTTTTAAGCGTGGTAATTACGTTATGATGCACTGGTATTATTTCCCAGAATCGTATGATACGTGGACCGCAAATACATTGGAATTACCGGATAATATTCCAGAAAATCCCGAGTCTCCAGCGGAACGTTGTTGGCGGGTGTCCGCCTCTTGGATTTTAGATCTTGATCAGTATAATGAGTGGATGGCTGAAGAAGACTACGAAGTAGACGaaatgggaaagaaaaagaTGTACAAACAACGTTTATCAATTGACGATATAATGTCTGGAGGTGttgatgaaaagaaaaaggtGGTTGGTGGTACACCAGCTGgtgcaaaacaaaaacgtaGGCGATCACCGTCGCCCAGTTCTTCTAAACCAGGAAAACGTAAACGTTCTCCAGCTGTTCTCCATAAAAAATCTCGTAATGACGAAGAAGATGAAGATCTAACACGTGACATGGAGGATCCGCCTGCTGAAACTAATTTGCAGGAAGTTATTAAGACCAGTTCATTGCAGTCGACAGCTAGTCCTGCGCCTGGTAGCAAATCTCGGGCTGATAATGATATGATGCCGATAAAgggTGGCACCCTAACAGATCTTGACGATGAAATGACTGGTGGCAGTGGCACACAAGCTTTATCTGCGTGTGATGGAGAAAATTCCCAAACAGGAAAAACTAGTGATAACAGTAACACCCAAGAATTTTCTTCCTCAGCAAAAGAAGATATGGAGGATAATGTCACCGAACAAACGCATCACATTATAGTGCCATCCTATTCGGCATGGTTCGATTACAATTCCATTCATGTAATTGAGAAACGTGCAATGCcagaatttttcaattcaaaaaataagTCAAAAACTCCAGAAATTTATATGGCCTACAGAAATTTCATGATTGACACGTATAGGCTTAATCCAACTGAGTATTTAACTAGCACTGCCTGCAGACGTAACTTGGCTGGGGATGTATGTGCTATTATGCGAGTTCATGCTTTTCTTGAGCAATGGGGTCTTATAAATTATCAAATTGACGCCGAGTTGCGTCCCACACCAATGGGACCACCGCCAACttctcattttcatattttgtccgACACGCCTTCGGGTATACAGTCTTTGAATCCTCAGAAAACACAGCAGCCTTCGGCTGCAAAAACATTGCTTGACCTAGAGAAAAAACTTTCAGGTGGTCCTAATAAAGAAGGCAAAGAAGATTCCTTGGATAAAGCTCCAGTGGGTATTAAAACTGAGCCGATAGAAAACGGTAGCGCTCCACCAGCATCAGGACAATTTGGATTAAAACTAGATCAGTATGCTAAAAAACCGGCTGCTATGAAAAATCGAACGGCTGCAAGCATGTCACGTGAATGGACGGATCAAGAAACTTTACTTCTACTGGAAGGTTTAGAATTACATAAGGATGACTGGAATAAAGTATGTGAACACGTTGGTACACGTACTCAAGATGAATGTATTTTACATTTCCTACGGTTACCAATTGAAGATCCCTACTTAGAAGATGACGGTGGATTTTTGGGACCATTTGGTTGCCAACCAATACCGTTTAGTAAATCGGGAAATCCTATAATGTCTACAGTAGCATTTCTAGCCTCAGTAGTTGATCCACGGGTTGCTGCAGCTGCTGCTAAAGCAGCAATGGAAGAATTTGCCGCTATAAAAGACGAAGTGCCAGCAACAATTATGGATAATCACATGAAAAATGTCGAAAAGGCTTCAGCAGCTGGTAAATTTAATCCAACGTATGGATTGGCAAGTAGTGGTATAGCTGGTACTGGTGCCGACAAAGAGGATGAAGAGCCCAATATACCAACAGCAAATGTTGTTCCTCCTTCTGGATCTAACGatgaagaaatgaaagacaTTTCGAAAAAAGACGATAAAGATCTTACCAAATCCCCTTCCAAATCCAAAGACGAATCTAAAGACAAAgataaaaaagatgaaaaagaaaccgataacaaaacaaacaaaaaagatgaAAatGCTACTGTCCTTGACGTCAAGGATATTAAGTCCGATGATTCTACTGGTGATGGTAGCGGAGCTGAATGCACCTCAAAAGATATCGATCCAGCTAAACAAGTATTCAATGAAGCGAACGTTCAAactgcagctgctgctgctctaGCTTCAGCTGCTGTTAAAGCTAAGCACTTAGCTGCtttagaagaaagaaaaatcaaatctttAGTGGCTCTTTTAGTTGAAACCCAGATGAAAAAACTAGAGATTAAATTGCGTCATTTTGAAGAATTGGAGACAACAATGGAGCGTGAACGTGAGGGATTAGAGTATCAAAGACAACAACTTATAACGGAACGTCAACAATTTCATCTAGAGCAATTAAAGGCAGCTGAGTTCAGAGCACGTCAGCAGGCTCATCATCGCCTCCAACAGGAACAGCAATGGCAGGGCGGTGCCTCAGGTACTGGAGCTCCGGCCGGTAATGCTAATGCTTCAAGTGCAAGTGGTACAAGTGTTGGTTCAACACTGCCACAGAGTCATCCTTTAACTGGGACACCACCTGTTGGTACACAATTAGGTCCTTCTGCTACAATGCCTGGAGCTCCTGCTGCAGCCCCATCACCACAGGGTGCAGCTACAACTTCTACACCGGCAAGTAGTTCTATAGCCAGTGctcaaacaccaggtcaaacCACAACACCAATGGATACAACACCGGCTGCTGGCGCTCCACCAGGTACAAATACACCGGCAGAAACTCCTGTCCCTTCGTCTGTATCAGCCCCATCCATACCATCGACGAATCCCTCTGCTGTAATGCCAACTAATGTATCTGCACCTCCTCAACCAACACCTAACATTCCACCCGCCCCAGGAGGTCCAACAGGTGCAACCACATCATAA
- Protein Sequence
- MCTLAPKKDGSPNIEFYNSPESLQGFETIRQWLQKNCKKHLASSNEPITKESLSQLLIQFLQYVEAKLGKNSTEPPATRIPMRCFLDFRPGGGLCIIFSTMFRFRADQRGKKFDFSVGKNPTRKDPNIQLLIDIEQALVEADLYRIPYIFIRPEVDKQLVTVLREILATRRVEIVSDEEDATHIIYPVVDPHPDEYARPIFKRGNYVMMHWYYFPESYDTWTANTLELPDNIPENPESPAERCWRVSASWILDLDQYNEWMAEEDYEVDEMGKKKMYKQRLSIDDIMSGGVDEKKKVVGGTPAGAKQKRRRSPSPSSSKPGKRKRSPAVLHKKSRNDEEDEDLTRDMEDPPAETNLQEVIKTSSLQSTASPAPGSKSRADNDMMPIKGGTLTDLDDEMTGGSGTQALSACDGENSQTGKTSDNSNTQEFSSSAKEDMEDNVTEQTHHIIVPSYSAWFDYNSIHVIEKRAMPEFFNSKNKSKTPEIYMAYRNFMIDTYRLNPTEYLTSTACRRNLAGDVCAIMRVHAFLEQWGLINYQIDAELRPTPMGPPPTSHFHILSDTPSGIQSLNPQKTQQPSAAKTLLDLEKKLSGGPNKEGKEDSLDKAPVGIKTEPIENGSAPPASGQFGLKLDQYAKKPAAMKNRTAASMSREWTDQETLLLLEGLELHKDDWNKVCEHVGTRTQDECILHFLRLPIEDPYLEDDGGFLGPFGCQPIPFSKSGNPIMSTVAFLASVVDPRVAAAAAKAAMEEFAAIKDEVPATIMDNHMKNVEKASAAGKFNPTYGLASSGIAGTGADKEDEEPNIPTANVVPPSGSNDEEMKDISKKDDKDLTKSPSKSKDESKDKDKKDEKETDNKTNKKDENATVLDVKDIKSDDSTGDGSGAECTSKDIDPAKQVFNEANVQTAAAAALASAAVKAKHLAALEERKIKSLVALLVETQMKKLEIKLRHFEELETTMEREREGLEYQRQQLITERQQFHLEQLKAAEFRARQQAHHRLQQEQQWQGGASGTGAPAGNANASSASGTSVGSTLPQSHPLTGTPPVGTQLGPSATMPGAPAAAPSPQGAATTSTPASSSIASAQTPGQTTTPMDTTPAAGAPPGTNTPAETPVPSSVSAPSIPSTNPSAVMPTNVSAPPQPTPNIPPAPGGPTGATTS*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01312944; iTF_01315440; iTF_01194332; iTF_00350114; iTF_00045292; iTF_01260929; iTF_01259148; iTF_00258803; iTF_00259707; iTF_00817351; iTF_00435461; iTF_00199815; iTF_01074411; iTF_01162191; iTF_01397359; iTF_01399395; iTF_01398425; iTF_01137942; iTF_01374006; iTF_00899483; iTF_00900520; iTF_01231515; iTF_01165469; iTF_00370943; iTF_00816480; iTF_00921376; iTF_00922040; iTF_01201543; iTF_01174157; iTF_01235656; iTF_01427407; iTF_01313740; iTF_00655255; iTF_00975543; iTF_01376569; iTF_00997659; iTF_00760028; iTF_00741810; iTF_01109276; iTF_00679422; iTF_00716574; iTF_00892670; iTF_00331456; iTF_01237590; iTF_01176869; iTF_01236597;
- 90% Identity
- iTF_00997659;
- 80% Identity
- -