Emex001302.1
Basic Information
- Insect
- Eufriesea mexicana
- Gene Symbol
- BAZ2B
- Assembly
- GCA_001483705.1
- Location
- NW:274383-380135[-]
Transcription Factor Domain
- TF Family
- MBD
- Domain
- MBD domain
- PFAM
- PF01429
- TF Group
- Unclassified Structure
- Description
- The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs [2]. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has negligible non-specific affinity for DNA. In vitro foot-printing with MeCP2 showed the MBD can protect a 12 nucleotide region surrounding a methyl CpG pair [2]. MBDs are found in several Methyl-CpG binding proteins and also DNA demethylase [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 2.9e-22 5.8e-19 66.7 0.0 4 76 603 675 600 676 0.94
Sequence Information
- Coding Sequence
- ATGGAGAAGGAGAACTCCGCATCGGGCGGAGGTGGCGGGGGCGGTGGAGGTGGCGGAGGCGAAGCGGCGGCCACGGCGACTCCAGGCGCCACCTCCGCCTCCGAGAAACTCCAGGCGGACCAGGCCAATCCGTTGCTCGATCCGACTGCGCTCTTCAGCGCGTATTGGCCACGGGGCGACAGTGCAGCTTCGTCGCTTTTCAGCGGGATGCCGGGCGGATATGGATTGGGGGCCCATCATTTACCATCGGCTTACGCCATTCTGGGCCGTGCAGGCTCTGCACCCGGATTCGGGGGCCACACACCGGCTTCCGCTCCGCCGCCACCCCCGTACTCCCACAACAGCCTTGGTACCCTGAGCGTAGCTGCCAGTCAGGCTGCGAGTTTAgGTATCAATCCCGCTAGTGCAGCATGGTGGACGATGGCGTCACACTTAGCCGCACAGGACTACCTCGCGAGGTTACAAGGAGCTGCAGGATTGCCCGGATTTCCGCCTGGCGCCGAGAGCCTGCTGCCACCCTATCCTGCCTCTCTACTTAATCCCCCGTCCCTATCGTCTCACAAGTCCAGTAAGTCTAAGTCAAGCAAGAGTCACAAGACGCCAGCGAGCAGTAGCAGCTCGACGACGCCTAGTATGACGAGCAGCAGTTTGCCGGTCTCGACTCAAGCACCGGTCACGTCCTCTCATCACAGCACGTCGGCCAGCAGCACGCCGAATTCGCAAACGAACGTTGTCAGTTCTGCGAAAGAGGGCAGCGACCCTAGCAGTATATTAGGAGGTGTCCGGCTGCCTCCAGACACAGAGATTATCAAGTACACGTCGAGCATAGTCGGTCCAAAGGTTCCTGGCACAACGAACCGCGGCAGGAAGAAGACTATATCCTTAGACACGCCGAGCGTCAGCGTTCATCCACCGCCGATACCAGCTCTGAGCGCTCATCAAACCAACACGACGACGTCGTCGTTGATGATGGAGCCGAGAAAGTACAATCGCACGGGGACCGAGTCGAACGATTACAGGGAGTCGGTGGATCGCGTGGAGGTGATCAAATTGCCGGCACATTCGACGAACGGCTCTGTTCTGCCGGCGCCATCGTCGTACACGACCACCACCAACGCCAGCAATTCCAACGACTCGGACGCACCACTGAACCTCTCGCTGAAGCCCGCGACGACGAGCAGTAGCTCGCCGATTTCCGGCAGCCAGCCGCTCAGTCAGCTGAGTAATTTAAGTCAGTCGTTACTCGCCACCGATCGAACTTCGAGAAGAAAGCCCGGACCGAAGCCTCGAAGAGTGCCACAGAACTCTGTACCCGTGCCAGCATCTTCGAGTCCGTCGTTGGCGCAGCTGTTCGCCGCGGCGGACTCGCCGCAACGACCGAGCAGCGGAAGCGAGGAGAGCGAGAGCGCCAGCACGACTCACCACAAAGACGGTCGGCCAAGGAACCTGGGTCGCGGGGTATCGAAACCGAAGAAGAACACGGTGGCCTCGTTGCTGGCTCAGAGCAGAGCCCTGGGAATGAAACAGACGACCACCTTGGACCCCAGCGTGCCATTGTCTCATCAGGTCTCGTTACTGAGGTCTAACATTCTGGCTGCTCAGCTGCATGCCTCCGGACAGAGCGACGACAAGAACCAGCGGTCTTTGCAGGAGAAGATGAAGAACAAGCTGCTCGAGGTGTCCGGCGAGGAGAGCAACATGGACGTGACGAGCGAAAGCGGCAGCAACACCGACGTTGTGACGGATACCGACGACGACAACGCGGATGGCGTGTCCAGCGCAAAGAGAAGAAAGGTGAAGCCCAGCGAGAGGGATCTCCAGGTGCCGCTTGAGCGTGGCTGGAAGCGGGAGACCGTCATCAAAGGATTAGGGAAGTCGGGGGTGATAAAGGGTGACGTGTCTTATTATAGTCCTTGTGGAAAGACGTTCAGAAGCAGTCCGGATTTAGCGAAGTTTTTGGAGCAACAGAATCCGCCCGAGCTAACGACGGCACACTTTTCGTTCTCGTCTCGTCCGCTTGTGGGCGAGTTTCTGCAGCCAACGATGGGCCTCGCGGAGGCGGAATTCGTTAGGTTGGGCGCCCAGGAAGTCGCGAGAAGATTGGAGGAGCTGAGAGCCGCGGGTGGTTTCAGGGACGTGCGAACGAACAATCAGTACGAGCGAGAGAAACTGGCGTACGCAAAAAAGTTGGCGAAAGAGGAGGCGCAGCGGCACAAGGAGCAGGCTAGGCTCATCAAGGAGCAGGAGAAAACGGAAAGGCAGGAGGCAGTGAGACGGGAGCGGGAGATTCGAAATCAACAGCTGCTCGAGGCTCGGAAAAAACGGCAGGAAGAGGTGGAGAAGATACGACTGGAAGAGCAACAACGGAAGCAACAGGAACGAGAGCTGAAGCGGCAGCAGGCGGTTATGCTGAAAGAACAGATGTACATGCAGGAGCTCACCAAGCAGCGCGAGATGCTCTACACCGTCGAGCTGGAAAGAGAACGAAGGAGACAGCACATGGCGCTGGTTAGAGCACTGGAGAATCGTCGGAAGATGGAGGAAAGGGAGAAGAAACGGCTGGAGGCAAGAGCCGAGCGAATAGCGACGAAAGAGAAGCGCGCCGAGCAGAGGAAGGTAGAGATGGAACTGATCGAACAGATCAGAAAGCCTGTGGAGGACATGGAGCTAACGGATCACAGACCACTGCCAGAATTGAAACGAATACCTGGTCTGAAGCTGTCCGGTCAGGCGTTCGCTGATATCGTGATGGTTTTCGAGTTTTTGCATAATTTTGGCGAGACTTTAGGCTTCGATATGGAGTCGCTGCCGAGCCTAAAGAGCCTTCAGCTGGCACTCCTCAACGACGAGGAAGCGGAAGAAGAGCTCTTGTCCGTGATGACACATCTATTAGTATGCGCAATCGAGGATCCAGGAATCCCTCAACCGGCGAGGCATACGACAGGTCTTGGTCAGAGTCTGCGTCAAGCTGACATAACGCACGCTAACATCAGCGAAGTGTTACGGATTTACTTATACGCGAACGCGACCGGAGAGGTGAAAGCTTTGACAGGGGTGTGCCTGGAACGAGAACGTGACAAAAAGTTCGCTGATCATCATCAGAACGGCGGTGATTATGCTTCGACCTGTTCGGGTAAAAATGCCCAATTTTATGAGCACCTGCACAACAACGAAACGTGGAAGATGTCCGAAAGGCTGAGGGACAAACCATTCTTAGCCTTGAACCCGACGCACAAAGCTCAGATGCTAGCGTTCCTCTGCAACGAGCTGTTGCAGAACAAGGCTGTGATCAGACAGATTGAGGGGAGCTTGGAAACGGTGGCTCAATTGAGGAAAGAGAGATTCGTGTTAGATACAAAGATTAGAAAGTTGAGGCAGTTACATAGTCGAAAGGTGCGAATGGAAGCTGTTGGTGTAATTGTTAATAAGACTGGGGACACTATTACGATTGAAAAGAAAGAGGTTGACGAGGAGGGTAACACGACGTCGACAGCAGTAGGAACGACACCCACTCCGGATGAGATTCATCATGAAGATGAGGTTGAGGATATGTCTGAGAACGAGAGCGAAGGAACACAGCCCGAGGAGGAGGAAGATAAAAATCTGTCTGGCGAAGAACTTGGTAAAAAATTGGATAAATTGTTGAAACAGTCGGAGGAACAATTGCAGAAATTGAATAGCTCCTCGAAACAGTTACGAGCACATAtattcggtcaggataggtatTGGAGAAGATACTGGGAGCTGGCATGCGCGGGTGGCATCTTCGTCGaggcaatggagagcgcagaaccGGAAATTCTGGAACTGCAAGCTGAATTAGACGAAAAGTACAAAAATGTGTCATTGGAGAAGAAACCAGAAACGAAGCAGGAGGACACCAAAGTCGAAAATCGCGAAAATGAAGCTCCCAATGACgtaaagaaggaaaagaagttCAATTCCAGTGAACAAGAGGAAGATGTGAAGCCTGTGGTGGATAAAACCAAGTCTGAGGTTGAAGATATCGGTTGTAAAAAGGAACCTAtgcaaaattatgaaaatttgacAAACGTaaaggaggagaaaaagaaCGATTTGGACAGTCCGATGACCGATGCCAAGACCAACGTTACGTCTGAAGAGATTAAACAAGAAACGGAAGTTGTTAGCATGGATGTGGATATCAAAGAAGAGACAAAGAAAGAGAACGACGAAACGGACGAAGATATGAAATCGGTGAAGATGATGGAAGATAAGATCGTTGAGACAATTCCGAACGGTGACAAGTACAATCACGTTAACAATCTTCATAATGGAAAGGAGCTGAATGGCACATTTATTTCCAATAACAGTAATGAATTCAACTGGTTCTCGATTCTACCACGAGAAACTTGCGACACTCCAGGACCAAGTACCAAACAAATATTTGGAATAGCTGAACCGACCGAGCTGAGAATACCAGTATTCCCTCCTCCAGCTAGTCCAAATTACGACAGGTGCGACAGTCCTGCTCCTTTGATTTTAACTCAAGACGAGGCAGCCCAACTGGAGTATTTGAAAGTGCATGGTTTGCCACCTCCTGGTGAAGCTAAACCAGTACCAAGAGACTTGAGATATGGTTGGTGGAGAATAACAGATGTCGATACATTTCAAGAATTATTGGAGCACCTTCATTCTCGCGGCGTTCGCGAGAAGGAACTAAAACGTACAACGTGGGCAACCATGGAATCCTTTTTAGCTGTTACGGGAAAGATCAACGTGGACCCTGGCAATGTTACTGCAACTGAACTTCAAGCGACGCCCGACGAACCCGATACACCGGTTCCAAAACCAGATAACCCAGCAGACTGGAGCGAACAAGTCGCGTTACGTGTGGATGCGCAGTTATTGGAACAAGTTGAGGCCCTAGAAGACAAAGTCGCAAATGCCAGCATGCAGGTCAAAGGCTGGAAGTTACCTCCACGGGCAGGAACCGAGGAGGctgaagaaattgaaaaactAAACGAGATGGAGAAGATAAGTGCAGTTGAACAAGCGCGACAAAGATTATTGTCTCTGGAAGCTGCTATAGAAAGGAGATACTTAAAACCACCATTAggcgtttgTACGGGAGATCCAAACTTAGCGGCCTTAAAAGCAGAACAAGCGGCTGCTGCAAACGCGAATTCGAATAATTCGGATCAGAACAGTCAGACCCCGGTACCTCAAGAAGAAACGACTCCAAGAGGACTGAACAACTGGCGGGAGGCAACAGCTCGAGCACATACATCAGCTCAGCTCGCTATGGCACTTTATATGTTGGAGGCCAGCATCGCTTGGGACAAGAGCATCATGAAGGCTGTGAGTCTAACACCAGCTAGAAACTCGGTCTGCGTCAAGCTACGAAACCGCTGCGTCTCACTCAAAGCTACCACTCAATACAATCAGCTATTGACTACTTCTCAGGCCTCTAATTGTCAGTTTTGTCACAGCGGAGATAACGAAGACAAACTATTACTGTGTGATGGTTGTGACCGCGGCTATCATACTTATTGTTTCCGTCCAAAAATGGAAAACATTCCTGATGGTGACTGGTATTGTCACGAATGCATGAACAAAGCAACAGGTGAACGAAACTGTTTGGTATGTGGAAAGAGAGTTGGTAAAAACTTAGTGCTATGTGAACTCTGTCCACGAGCTTATCACACTGACTGCCACAATCCTGTTATGCCAAAAATGCCAAGGGGAAAATGGTATTGTTCTAATTGCCACAGTAAACAACCAAAGAAGAGAAATAGTAGTCGAAGGAGTCATACCAAAGGGGGAGGCACCAGAGAAAGTGAAAGTTCTGATCATCCACCAGCTAGTCCAACGCCGTCAACGGCATCGAACACACACGTAGAGGACGTCAGTTCATCGGAACCGGCAACCCCAACTGCTTCGCCACGGAAGGAGGGAAACAATAGGACGCTCACGAAGAAACAGCAACGAGAGTTGGCTCCTTGTAAGGTGCTACTCGAACAGTTGGAGCAACAGGACGAGGCCTGGCCGTTCCTCTTGCCGGTGAACACCAAACAGTTCCCTAcctacaagaaaattattaaaacaccCATGGATCTCAGTACGATCAAGAAGAAATTGCAGGACTCCGTGTACAAGTCTCGCGATGAGTTTTGCGCCGATGTCAGACAGATGTTCATCAACTGCGAGGTATTCAACGAGGACGACAGTCCTGTGGGGAAGGCCGGACACGGGATGCGCAGTTTCTTCGAAATGCGCTGGACCGAGATTACTGGCGCACCACCTCCACATCCGCAAACGCATAGCTGA
- Protein Sequence
- MEKENSASGGGGGGGGGGGGEAAATATPGATSASEKLQADQANPLLDPTALFSAYWPRGDSAASSLFSGMPGGYGLGAHHLPSAYAILGRAGSAPGFGGHTPASAPPPPPYSHNSLGTLSVAASQAASLGINPASAAWWTMASHLAAQDYLARLQGAAGLPGFPPGAESLLPPYPASLLNPPSLSSHKSSKSKSSKSHKTPASSSSSTTPSMTSSSLPVSTQAPVTSSHHSTSASSTPNSQTNVVSSAKEGSDPSSILGGVRLPPDTEIIKYTSSIVGPKVPGTTNRGRKKTISLDTPSVSVHPPPIPALSAHQTNTTTSSLMMEPRKYNRTGTESNDYRESVDRVEVIKLPAHSTNGSVLPAPSSYTTTTNASNSNDSDAPLNLSLKPATTSSSSPISGSQPLSQLSNLSQSLLATDRTSRRKPGPKPRRVPQNSVPVPASSSPSLAQLFAAADSPQRPSSGSEESESASTTHHKDGRPRNLGRGVSKPKKNTVASLLAQSRALGMKQTTTLDPSVPLSHQVSLLRSNILAAQLHASGQSDDKNQRSLQEKMKNKLLEVSGEESNMDVTSESGSNTDVVTDTDDDNADGVSSAKRRKVKPSERDLQVPLERGWKRETVIKGLGKSGVIKGDVSYYSPCGKTFRSSPDLAKFLEQQNPPELTTAHFSFSSRPLVGEFLQPTMGLAEAEFVRLGAQEVARRLEELRAAGGFRDVRTNNQYEREKLAYAKKLAKEEAQRHKEQARLIKEQEKTERQEAVRREREIRNQQLLEARKKRQEEVEKIRLEEQQRKQQERELKRQQAVMLKEQMYMQELTKQREMLYTVELERERRRQHMALVRALENRRKMEEREKKRLEARAERIATKEKRAEQRKVEMELIEQIRKPVEDMELTDHRPLPELKRIPGLKLSGQAFADIVMVFEFLHNFGETLGFDMESLPSLKSLQLALLNDEEAEEELLSVMTHLLVCAIEDPGIPQPARHTTGLGQSLRQADITHANISEVLRIYLYANATGEVKALTGVCLERERDKKFADHHQNGGDYASTCSGKNAQFYEHLHNNETWKMSERLRDKPFLALNPTHKAQMLAFLCNELLQNKAVIRQIEGSLETVAQLRKERFVLDTKIRKLRQLHSRKVRMEAVGVIVNKTGDTITIEKKEVDEEGNTTSTAVGTTPTPDEIHHEDEVEDMSENESEGTQPEEEEDKNLSGEELGKKLDKLLKQSEEQLQKLNSSSKQLRAHIFGQDRYWRRYWELACAGGIFVEAMESAEPEILELQAELDEKYKNVSLEKKPETKQEDTKVENRENEAPNDVKKEKKFNSSEQEEDVKPVVDKTKSEVEDIGCKKEPMQNYENLTNVKEEKKNDLDSPMTDAKTNVTSEEIKQETEVVSMDVDIKEETKKENDETDEDMKSVKMMEDKIVETIPNGDKYNHVNNLHNGKELNGTFISNNSNEFNWFSILPRETCDTPGPSTKQIFGIAEPTELRIPVFPPPASPNYDRCDSPAPLILTQDEAAQLEYLKVHGLPPPGEAKPVPRDLRYGWWRITDVDTFQELLEHLHSRGVREKELKRTTWATMESFLAVTGKINVDPGNVTATELQATPDEPDTPVPKPDNPADWSEQVALRVDAQLLEQVEALEDKVANASMQVKGWKLPPRAGTEEAEEIEKLNEMEKISAVEQARQRLLSLEAAIERRYLKPPLGVCTGDPNLAALKAEQAAAANANSNNSDQNSQTPVPQEETTPRGLNNWREATARAHTSAQLAMALYMLEASIAWDKSIMKAVSLTPARNSVCVKLRNRCVSLKATTQYNQLLTTSQASNCQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKRNSSRRSHTKGGGTRESESSDHPPASPTPSTASNTHVEDVSSSEPATPTASPRKEGNNRTLTKKQQRELAPCKVLLEQLEQQDEAWPFLLPVNTKQFPTYKKIIKTPMDLSTIKKKLQDSVYKSRDEFCADVRQMFINCEVFNEDDSPVGKAGHGMRSFFEMRWTEITGAPPPHPQTHS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01520225; iTF_00264994; iTF_01245220; iTF_00264270; iTF_01270498; iTF_01268269; iTF_01269056; iTF_00016035; iTF_00016679; iTF_01261830; iTF_00014717; iTF_00385316; iTF_00252069; iTF_00867397; iTF_00868796; iTF_00938021; iTF_00086278; iTF_00360212; iTF_00085375; iTF_00084550; iTF_00088822; iTF_00087159; iTF_00087997; iTF_00298310; iTF_01473925; iTF_00294477; iTF_00297550; iTF_00296015; iTF_00291536; iTF_00292947; iTF_00293720; iTF_01473178; iTF_01475450; iTF_00255080; iTF_00252816; iTF_00253566; iTF_01198664; iTF_00770371; iTF_01199404; iTF_00296779; iTF_00292269; iTF_00295247; iTF_01395123; iTF_01537958; iTF_00224551; iTF_00730055; iTF_01514823; iTF_00229352; iTF_00231981; iTF_00214391; iTF_00232603; iTF_00227273; iTF_01254793; iTF_01355306; iTF_01498219; iTF_01514155; iTF_00215020; iTF_01120204; iTF_00982289; iTF_00873798; iTF_00982966; iTF_01099185; iTF_01228370; iTF_00898739; iTF_00254324; iTF_00769620; iTF_01123060; iTF_00963846; iTF_01122432; iTF_00219797; iTF_00625493; iTF_00866007; iTF_00865318; iTF_00762550; iTF_00862562; iTF_00861874; iTF_00763227; iTF_00864636; iTF_01077463; iTF_01477562; iTF_00633627; iTF_00391290; iTF_01394299; iTF_01130813; iTF_01087198; iTF_00142463; iTF_00230689; iTF_01065569; iTF_01069667; iTF_01068296; iTF_01068973; iTF_01066255; iTF_00760979; iTF_00221909; iTF_01067609; iTF_00222524; iTF_01066940; iTF_01070364; iTF_01424381; iTF_00141824; iTF_00452575; iTF_01474687;
- 90% Identity
- iTF_00227273;
- 80% Identity
- -