Basic Information

Gene Symbol
-
Assembly
GCA_028554685.1
Location
CM052983.1:8034061-8064525[+]

Transcription Factor Domain

TF Family
MBD
Domain
MBD domain
PFAM
PF01429
TF Group
Unclassified Structure
Description
The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs [2]. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has negligible non-specific affinity for DNA. In vitro foot-printing with MeCP2 showed the MBD can protect a 12 nucleotide region surrounding a methyl CpG pair [2]. MBDs are found in several Methyl-CpG binding proteins and also DNA demethylase [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 20 0.0037 17 6.1 0.0 36 68 156 187 152 194 0.83
2 20 0.0037 17 6.1 0.0 36 68 195 226 191 233 0.83
3 20 0.0037 17 6.1 0.0 36 68 234 265 232 273 0.83
4 20 0.0037 17 6.1 0.0 36 68 273 304 269 311 0.83
5 20 0.0037 17 6.1 0.0 36 68 312 343 308 350 0.83
6 20 0.0039 17 6.1 0.0 36 68 351 382 349 389 0.83
7 20 0.0037 17 6.1 0.0 36 68 390 421 386 428 0.83
8 20 0.025 1.1e+02 3.4 0.0 36 54 429 447 425 452 0.85
9 20 0.0037 17 6.1 0.0 36 68 455 486 453 494 0.83
10 20 0.025 1.1e+02 3.4 0.0 36 54 494 512 490 517 0.85
11 20 0.0018 7.9 7.1 0.0 36 69 520 553 518 560 0.85
12 20 0.0036 16 6.1 0.0 36 68 560 591 558 600 0.83
13 20 0.15 6.9e+02 0.9 0.0 36 48 599 611 597 614 0.83
14 20 0.0034 15 6.2 0.0 36 68 650 682 624 689 0.87
15 20 0.026 1.2e+02 3.4 0.0 36 54 690 708 688 713 0.84
16 20 0.026 1.2e+02 3.4 0.0 36 54 716 734 714 739 0.84
17 20 0.0039 17 6.1 0.0 36 68 742 773 740 780 0.83
18 20 0.0037 17 6.1 0.0 36 68 781 812 777 819 0.83
19 20 0.0037 17 6.1 0.0 36 68 820 851 816 858 0.83
20 20 0.012 53 4.5 0.0 37 55 860 878 855 913 0.65

Sequence Information

Coding Sequence
ATGACGCTTGACCTTGATGAATGTCACGGGAGCCGGCGAACGACACCGGTCAACGCCCGATCAGCTGACAAGCGCACAGATGATTCAAATAACGgcGAGAAACCATTCCAATGTCGTTACTGCGACTATGCTTGTCGAGACAGCTCCACGGTAAGGAGGCATATGGAGAGGCATCTAGGAATATCTAAGACTTACCCGTGCTCCGTATGTAATAAGAATTTCAAAAAGAAGATGTCACTACAAGTACATATAGACGAAGTACATTTTGATCTGGACACGAGGAAGTACCCCTGTGAGCAGTGTGACAAGATGTTCAAAACTAAGAGGACGCTTAGCTCTCATGTTATCGCAGTCCACGACAAGTCAAATCGCGTCAAATGTGAGATCTGTGGCCTCGTCGTCACAAAAAACAATCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAAAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGGTGATTTAAAACGCCATCAATTAATCCACTACCCGGATTACCAACATATTTGTACAGTCTGTAACAGACGGTTCCCTCGCAAGTGGAGGCTCAAGAAACATACCTTGAATGGATGTCAAAGAGTACAGTGTCATGACTGTGGACAGCTatttacAGCGAAGAAATTCCTCGCCCAGCACATAAAGAATGCTCACGGTCCCTATCCAAAATCAAGAGAGTATCTCTGCGATGTTTGTGACATGGTCACTTACAGTAGGCGAGGGATCATACAACACTTGAAGTATGGACATGGTACTGAGAAGgacACGATATGCCAAATATGTAGAAAGGATTATTTCAAATGTCTAACTTTGAAGGAACATTATTTAAGTCATCATAACTTTATTTACACTTTGATGGATCAAAAGAAAGATGATTTGATAGAAATAAAAGAGGAACCTTTCGAAGAAATGGAACACGTAGCGGACCCCACGGAATTTATATATGAAGTTGATATACAGAAAACTGAAATCGATGAAATACCTTCAGATCCACCATTACAACTCCAGCCGCCGTCCCCTCAACACTTCCTAATAGATTCAGATTCCATCACATACTCTAAGAAGGTGTTATCAGAGGATATATCTCGCAGTTTAATAGATAAGGGTGGAGTGACTGATGTCTTCCACGAGTTGTTTATAGGACATGCTTTGGATATTAGTGGGGCTCCAGTAGTGACcaAGTTACCAAAACAAATAGTTGATGAGGAGAAAGAAAAGCACGTTGAGCAACGTCTAGAAAAACTTCTAACTGAAGTGCGTATGCGCAACCAATGGGTAGTTTTGGAGAAATTGCGCAAACAATACAATAGACGGATCAAAATGGCTACCAGTGGTGGAGAGAAGAAAGAACAAACATCAATAAGATATTTCAAGAAAACAGAAAACAATAACGAATCTActaaaacacaagaaaatacCAAAAATGATGAAGAAAGCAATATCGTAGGTGATGAAACAAGAAATAACCAAAAAAATGATGAATTGAACAAAGAACAGATTAACTTGGAGCAAGAAAATGGTGAAACTATTGAAAATGATGAAATTGGTGAAGAAACAATCAATGGTGATGTAATTGTTAAAGAAACAATCAATGGTGATGAGATGGATGATGGAGATGAAACAATAGATGATGAAAAAGATGTAGATTATAGAGAAGATGATTTTGAAAGTAACAGCGATGAAGAAACAAATCCGAATGGAAAACTGAAGTTTAATACTCATCAgtgttatatttgttttaagCTCTTCACAACAAAATCAGATTTGAAGTTCCACTGCAAGGAACACTTCGACATATGCAACGATAAAATGTTAAAGAAATGCCCGTTTTGTGGCTATGTCACTAATTTAACAATTACAAGACATATCCGATTGGTTCATAATGTTACTTTAGAAATACCGTACGGACGTATTAAGGAAAGAGATACGGGAATAGGGTCCAAATAtgtatttcaaattgacaaagacTGTGACCTAGAAGTTATACCAAGTgatgagtgcgcgcgcctagagcacgacacagagctgtcaaacaactga
Protein Sequence
MTLDLDECHGSRRTTPVNARSADKRTDDSNNGEKPFQCRYCDYACRDSSTVRRHMERHLGISKTYPCSVCNKNFKKKMSLQVHIDEVHFDLDTRKYPCEQCDKMFKTKRTLSSHVIAVHDKSNRVKCEICGLVVTKNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMLQAMREEIQGYGESCVCYKNKQNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMGDLKRHQLIHYPDYQHICTVCNRRFPRKWRLKKHTLNGCQRVQCHDCGQLFTAKKFLAQHIKNAHGPYPKSREYLCDVCDMVTYSRRGIIQHLKYGHGTEKDTICQICRKDYFKCLTLKEHYLSHHNFIYTLMDQKKDDLIEIKEEPFEEMEHVADPTEFIYEVDIQKTEIDEIPSDPPLQLQPPSPQHFLIDSDSITYSKKVLSEDISRSLIDKGGVTDVFHELFIGHALDISGAPVVTKLPKQIVDEEKEKHVEQRLEKLLTEVRMRNQWVVLEKLRKQYNRRIKMATSGGEKKEQTSIRYFKKTENNNESTKTQENTKNDEESNIVGDETRNNQKNDELNKEQINLEQENGETIENDEIGEETINGDVIVKETINGDEMDDGDETIDDEKDVDYREDDFESNSDEETNPNGKLKFNTHQCYICFKLFTTKSDLKFHCKEHFDICNDKMLKKCPFCGYVTNLTITRHIRLVHNVTLEIPYGRIKERDTGIGSKYVFQIDKDCDLEVIPSDECARLEHDTELSNN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00042270;
90% Identity
iTF_00042270;
80% Identity
iTF_00042270;