Xvio003697.1
Basic Information
- Insect
- Xylocopa violacea
- Gene Symbol
- MBD-R2
- Assembly
- GCA_963969225.1
- Location
- CAXAJV010000005.1:9198515-9206023[-]
Transcription Factor Domain
- TF Family
- MBD
- Domain
- MBD domain
- PFAM
- PF01429
- TF Group
- Unclassified Structure
- Description
- The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs [2]. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has negligible non-specific affinity for DNA. In vitro foot-printing with MeCP2 showed the MBD can protect a 12 nucleotide region surrounding a methyl CpG pair [2]. MBDs are found in several Methyl-CpG binding proteins and also DNA demethylase [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 2.5 8.5e+03 -3.4 0.1 43 60 399 416 398 431 0.78 2 2 2.2e-17 7.5e-14 51.3 0.2 6 69 542 605 537 613 0.90
Sequence Information
- Coding Sequence
- ATGCGGCGGCGTATAGCGCGGTCGCATGTTAAAAGGATGCCGCCTGTGGGAACGATGAGAATGGCGCGCAAATGTTGCGTGCGTACCTGTGAGGCCGACGTGCAGGATGCCCGGGCTAAAGGTTTGCCGCTTCATAAATTTCCGAAGGACGTTGCCTTGAGGAACAGATGGCTGGCCAGTGGTGGATTCGAACCTAGTTTCAAACCTTCACCGGGTCAAGTTGTTTGTCATAGACATTTTAAACGAGCCGATTATGAAGCCGCTAAGGGACATAAATTACTTCTCCGCAAAGGCAGTATTCCGTCGGTTTTTGCAGATTATGATAATCATCCGGATCCTGTAATTATGTCTGTGAAATCATCAACCTCATATGCACAGGAAGACTTAGATCTCATTAATTCTGAAATTTTAAATCTGGGGCAGTCTATATCACCATTAAATTTGAATGCTAGAACGCCAAAGTCCGATAGTTATGGAGAAACATGTTCTTCCAGACCAAATTCATCAGCTGATTCTCTTAATCAGTTAGATTCAGCAGAATCTGTTGATAATGGATCCAAACCTTTGACTGTTAAAGAAGAAATTGTACCTACTGTTAAACAAGAAATAGTTGAAAATTCAGAAATGGAAGTGAACACTATGGCAGAGCAGTTAGGAATTGTTGAATCTGATGTGGCgataaagcatatacaaaaagatCTGACTATTAAAGATGAATTGAAAACAATGAAAATCGAAGAGGAAAAAGTATTTGATAAACCAGAGGAATTAAAGCCAAGAATATGTAATAGAGACGGACTAAAATTCTTTCCTGGAGCCAAATTGGAGGCTAAAGATTTCAATGAGAAATGGTACTCTGCAAAAGTAGTTGAAACTGATTGGGATGAAAGGGAAGTTTTGATACGCTTCGACAAGTGGAGCGCAAGATTTGACGAATGGATACCTATGGATAGTTCTAGATTACGAGTATTGCAAACACAACCAAACGAACAAACCTGGAATCTGCCATCTCCGGAGGCGAAAATGAAAGACTTTTCAGTGGGAGAAAGGATACTTGCAACATGGGCCGACGGTAGAAAATATCCAGCTAAAGTAAATGCTGTCTTGGGAAACGATAGGTATGATGTACTGTTCGATGATGGATACGCGAAAACAGTTAAGTCGTCGAAAATGACAAAAATCGCGACAACATCGGCAAAGCAATCTAGTCAAACTGAAGAATATATAGGAAGCAAACAAGAAAGGAGAGACAAGAAAAGGAAGCATACAGTAATGGAGTTATTTCATACTCATTCTAGAAAACGTTCAAAAAATGAAACAGATAAGTTACCAAAAAAAGAAGCAGTTACAGTGAATGAAACTGGGGAATCCTTTCCAGAAACTAAAATTGATTTGGATGGTACTTTGTTTGGACCTTGTTATGATCCAGGCACTGATTTATTACGAGGATTTGATACTAACATATCTAAAATGAAAGCTTATTCTAAGAAAAATAAGAAAGAAGTATCTAAAACCGATGCAGAACAGATAGAAGATGTTGGACCTGAATGGATTGACGGAGAACCTCAAGGAACTGAATCTTACATAGTAGATGGAAATGATGGACCACGCCGATCAATAATAGTAGCAGATAAAAGATTGCCGCCAGGATGGCAAAAACATTTTACTCAAAGAAAAGCTGGTACGTCTGCTGGGAAATGGGATGTCTTGTTTCTTCACAAGTCAAGCGGGAAAAAGTTCAGATCAAAAAATGATATTAGAGCATTCATGGAAAATCAGGGACAATTCGACTTTGATCCTGAAAAATTTGATTTTTGTATTCATCGAAAGAAGAGAAACCAAAGCCAGAAAATAAAGCAGGATGTGATAGTTGATGTCCCAAAGAAGATTAAGACTTTATTACCTAAAGCAAAGGCAACACCCGTAACTGATAATTCGTTGCTTGTTCCCACGAATACACCAGTTACAACTGTAGTATCTGCTTCAACTACATCTGTTACAGATGGAGCTGTTTTTATCGGTGGACTTCGAGTAGAAATGGAGGACAGTGCTTACAAATGTCCAAAACAaggatgtagtaaaacatttagAAAGGAAAATCTTTTACAAATGCATATAAAGCACTATCATCCAGAGTATGCGAAATTTTTGGGATCTACGCCAAATGTTGCAGATCTAGCTTATGCAAGAACGATTGGAGAATCTGTCGAAGATATTATTCCGAAAAAATCAAATAATTCGTTAGAGAAGTGCAATAAATTCGGGAAAAGAAAATCTATTCAAGATAGATTGTCTGTATGTGTCTCATCACCACCTGTAACGAGTAATGTTTCACCTGTGTCTCCAACAATAATTGTACCAACTGCACCAGAAACGGAGGATGAAGTGGATCAAACAGAAAAATGTAACGACGTACAAATGGAAGATGTTAAAATAGAAAGAATGTCTCCAACTTCTAGTCATAGTTTAGATATGGATGATGAAACTGAGAAAAAACGGGAAGATACATGTGCAATGTCACCTGGAACGTTATTTGATATGAAAATCAGAGAAGAGAAAACACAAAGCGGTATTAAAACTCTTCTTCCAGTTAGGCCAGCTGTGTCTGCAGAAGTACAGAGAGTTGATAGATCGAAGTCTTTAGACGAAACAATGCATATTGAGAAGATGAAGGGTCAAAGAAAGCGACAATTATCAGAATACAGTTCTGATATATCAAACAGGGGTAAAAAGCGATCTGGCACTCAAGAATTTACAGATGAATACGGTGATTTGGATGATAGTGGTATGGATACGGAAGGACCGACTACACTTATGTATAGATATAGTCGCAGAAAATCAGATTCAAGAAGTGATGAAAATAGTCAAAGTAGTCAACTAAATGATTCTCGTGTTGAAAAAGATGATCCCTTCAGAGGGGATATTGGTAAAAAAGATACTAATGATGGGGAagAGAACGAAGGAGTTATGATGATGATTAATGGTGAAATGGTAAAAGTGGAACAGCTACGTAGAGAAGAAATAATAAATTGTACTTGTGGATTTATGGAGGAAGATGGTTTAATGATACAATGCGATCTTTGTTTGTGTTGGCAACATGGTCACTGCAATGCAATAGAAAGAGAAAAAGATGTACCTGAAAAATATGTTTGTTTTATTTGTCGGAATCCATATCGACAACGACAGTCTAAGAAGTATTGTCATGATCAAGATTGGATAAAGGAAGGGAAATTACCAACATTACCTAATCGAACGAAAAACCAACATCGAATCAATCAAAGAACAGCTATGTTAAAACGTTCTTATGATTTAGTTGCCGCTCTTTTACATATACAACAAGTTTTACATAGTTTACGGGTGAAAATTAATGTAGCACAAAAGAAAGATCATCCAAAGTTGTATCTTTGGGCAAAAAAATGGGAAGAAACCGACATACCAAAACCAAATCTAACACCAGTACCGGTTATGGAAGTTATGAAATCAGGAAAAGATTGTACAGATACTGCTAGTGAGAGTTCTTGTCGAGTTGAAGTTAAAATGGAAACAAAATTCAGTTTAAAAGATGATCACGATGAAAAATCAATAGCATCGGATTCAGAATTGATGAAGATATTAGAAGAAGATAATACAACTTCAGATGAGTCTAAAATTACgtctaaaaaagaagatatttcaCATCAAAGTAAGAGTCATATTCTTCTTGATGCCCTTACAAAAAATGATATCTCTGAaggaaaatataaaaattcatTACCTTCGGAAGTAAAAACAGATACAGATTTGTTGACTAATCAAACGAATGTGTCGGAAAATAATGTACCTGTGTCAAATGCATCAACGAATCACATGCATGAAGAATTAAACGAACATGAAATGTCCACACCATTGCAACCTTTTATACCACAGCCAGAAGCGCCAATTGATCCAGGGGAATGTCGAATGCGATTATTAGAACATATAGAACACTTTCAAAATCACATAGACGCAAAATTAACATTTATCGAGGCACAAGTTTGTGCATTAGAAGCCATGGATCCTGAAGATGTACCTACCTCAGATGTTCAACCCCGAACTAAACAAACAGTTCAGATGCTTCTTCGAGATTTAAACACAGTACGGAAATTAGCTGCTTTGTGTTGA
- Protein Sequence
- MRRRIARSHVKRMPPVGTMRMARKCCVRTCEADVQDARAKGLPLHKFPKDVALRNRWLASGGFEPSFKPSPGQVVCHRHFKRADYEAAKGHKLLLRKGSIPSVFADYDNHPDPVIMSVKSSTSYAQEDLDLINSEILNLGQSISPLNLNARTPKSDSYGETCSSRPNSSADSLNQLDSAESVDNGSKPLTVKEEIVPTVKQEIVENSEMEVNTMAEQLGIVESDVAIKHIQKDLTIKDELKTMKIEEEKVFDKPEELKPRICNRDGLKFFPGAKLEAKDFNEKWYSAKVVETDWDEREVLIRFDKWSARFDEWIPMDSSRLRVLQTQPNEQTWNLPSPEAKMKDFSVGERILATWADGRKYPAKVNAVLGNDRYDVLFDDGYAKTVKSSKMTKIATTSAKQSSQTEEYIGSKQERRDKKRKHTVMELFHTHSRKRSKNETDKLPKKEAVTVNETGESFPETKIDLDGTLFGPCYDPGTDLLRGFDTNISKMKAYSKKNKKEVSKTDAEQIEDVGPEWIDGEPQGTESYIVDGNDGPRRSIIVADKRLPPGWQKHFTQRKAGTSAGKWDVLFLHKSSGKKFRSKNDIRAFMENQGQFDFDPEKFDFCIHRKKRNQSQKIKQDVIVDVPKKIKTLLPKAKATPVTDNSLLVPTNTPVTTVVSASTTSVTDGAVFIGGLRVEMEDSAYKCPKQGCSKTFRKENLLQMHIKHYHPEYAKFLGSTPNVADLAYARTIGESVEDIIPKKSNNSLEKCNKFGKRKSIQDRLSVCVSSPPVTSNVSPVSPTIIVPTAPETEDEVDQTEKCNDVQMEDVKIERMSPTSSHSLDMDDETEKKREDTCAMSPGTLFDMKIREEKTQSGIKTLLPVRPAVSAEVQRVDRSKSLDETMHIEKMKGQRKRQLSEYSSDISNRGKKRSGTQEFTDEYGDLDDSGMDTEGPTTLMYRYSRRKSDSRSDENSQSSQLNDSRVEKDDPFRGDIGKKDTNDGEENEGVMMMINGEMVKVEQLRREEIINCTCGFMEEDGLMIQCDLCLCWQHGHCNAIEREKDVPEKYVCFICRNPYRQRQSKKYCHDQDWIKEGKLPTLPNRTKNQHRINQRTAMLKRSYDLVAALLHIQQVLHSLRVKINVAQKKDHPKLYLWAKKWEETDIPKPNLTPVPVMEVMKSGKDCTDTASESSCRVEVKMETKFSLKDDHDEKSIASDSELMKILEEDNTTSDESKITSKKEDISHQSKSHILLDALTKNDISEGKYKNSLPSEVKTDTDLLTNQTNVSENNVPVSNASTNHMHEELNEHEMSTPLQPFIPQPEAPIDPGECRMRLLEHIEHFQNHIDAKLTFIEAQVCALEAMDPEDVPTSDVQPRTKQTVQMLLRDLNTVRKLAALC
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01539657; iTF_00220579; iTF_00219841; iTF_00231361; iTF_00226595; iTF_00219112; iTF_00215754; iTF_00227274; iTF_00222571; iTF_00215021; iTF_00230029; iTF_00233229; iTF_00217072; iTF_00226640; iTF_00220627; iTF_00219157; iTF_00219798; iTF_00231318; iTF_00227319; iTF_00230072; iTF_00233274; iTF_00215067; iTF_00215707; iTF_00222525; iTF_00217118; iTF_00230729; iTF_00230690; iTF_00216394; iTF_00216442; iTF_00228015; iTF_00227967; iTF_00225235; iTF_00225281; iTF_00221305; iTF_00221259; iTF_00225917; iTF_00225961; iTF_00761023; iTF_00760980; iTF_00217792; iTF_00217748; iTF_00224552; iTF_00223238; iTF_00228717; iTF_00223189; iTF_00228671; iTF_00141226; iTF_00142505; iTF_00141262; iTF_00142464; iTF_00141825; iTF_00141870; iTF_00140587; iTF_00140635; iTF_00684208; iTF_00684166; iTF_00223923; iTF_00223874; iTF_00684787; iTF_00684825; iTF_00983604; iTF_00983646; iTF_00733953; iTF_00733991; iTF_00983011; iTF_00982967; iTF_00982290; iTF_00982333; iTF_00676021; iTF_00676068; iTF_01418997; iTF_01420932; iTF_01418350; iTF_01420276; iTF_01419641; iTF_00218423; iTF_00218471; iTF_00303031; iTF_00303089; iTF_01417752; iTF_01417706; iTF_00118009; iTF_00118057; iTF_00232604; iTF_00232647; iTF_00214436; iTF_00231982; iTF_00221910; iTF_00214392; iTF_00232026; iTF_00221951; iTF_01424382; iTF_01424428;
- 90% Identity
- iTF_01539657;
- 80% Identity
- iTF_01539657;