Aips025932.1
Basic Information
- Insect
- Agrotis ipsilon
- Gene Symbol
- -
- Assembly
- GCA_028554685.1
- Location
- CM052983.1:8034061-8064525[+]
Transcription Factor Domain
- TF Family
- MBD
- Domain
- MBD domain
- PFAM
- PF01429
- TF Group
- Unclassified Structure
- Description
- The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs [2]. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has negligible non-specific affinity for DNA. In vitro foot-printing with MeCP2 showed the MBD can protect a 12 nucleotide region surrounding a methyl CpG pair [2]. MBDs are found in several Methyl-CpG binding proteins and also DNA demethylase [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.0037 17 6.1 0.0 36 68 156 187 152 194 0.83 2 20 0.0037 17 6.1 0.0 36 68 195 226 191 233 0.83 3 20 0.0037 17 6.1 0.0 36 68 234 265 232 273 0.83 4 20 0.0037 17 6.1 0.0 36 68 273 304 269 311 0.83 5 20 0.0037 17 6.1 0.0 36 68 312 343 308 350 0.83 6 20 0.0039 17 6.1 0.0 36 68 351 382 349 389 0.83 7 20 0.0037 17 6.1 0.0 36 68 390 421 386 428 0.83 8 20 0.025 1.1e+02 3.4 0.0 36 54 429 447 425 452 0.85 9 20 0.0037 17 6.1 0.0 36 68 455 486 453 494 0.83 10 20 0.025 1.1e+02 3.4 0.0 36 54 494 512 490 517 0.85 11 20 0.0018 7.9 7.1 0.0 36 69 520 553 518 560 0.85 12 20 0.0036 16 6.1 0.0 36 68 560 591 558 600 0.83 13 20 0.15 6.9e+02 0.9 0.0 36 48 599 611 597 614 0.83 14 20 0.0034 15 6.2 0.0 36 68 650 682 624 689 0.87 15 20 0.026 1.2e+02 3.4 0.0 36 54 690 708 688 713 0.84 16 20 0.026 1.2e+02 3.4 0.0 36 54 716 734 714 739 0.84 17 20 0.0039 17 6.1 0.0 36 68 742 773 740 780 0.83 18 20 0.0037 17 6.1 0.0 36 68 781 812 777 819 0.83 19 20 0.0037 17 6.1 0.0 36 68 820 851 816 858 0.83 20 20 0.012 53 4.5 0.0 37 55 860 878 855 913 0.65
Sequence Information
- Coding Sequence
- ATGACGCTTGACCTTGATGAATGTCACGGGAGCCGGCGAACGACACCGGTCAACGCCCGATCAGCTGACAAGCGCACAGATGATTCAAATAACGgcGAGAAACCATTCCAATGTCGTTACTGCGACTATGCTTGTCGAGACAGCTCCACGGTAAGGAGGCATATGGAGAGGCATCTAGGAATATCTAAGACTTACCCGTGCTCCGTATGTAATAAGAATTTCAAAAAGAAGATGTCACTACAAGTACATATAGACGAAGTACATTTTGATCTGGACACGAGGAAGTACCCCTGTGAGCAGTGTGACAAGATGTTCAAAACTAAGAGGACGCTTAGCTCTCATGTTATCGCAGTCCACGACAAGTCAAATCGCGTCAAATGTGAGATCTGTGGCCTCGTCGTCACAAAAAACAATCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAAAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGTGAGTCATGTGTCTGTTATAAGAACAAACAACCTCCAGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGTCTCATATGAGACGACACATTGATGTGAGACCTTACCGGTGTAGCTACAAGCCATGCGGGAAGAGATTCAAGGATATGGGTGATTTAAAACGCCATCAATTAATCCACTACCCGGATTACCAACATATTTGTACAGTCTGTAACAGACGGTTCCCTCGCAAGTGGAGGCTCAAGAAACATACCTTGAATGGATGTCAAAGAGTACAGTGTCATGACTGTGGACAGCTatttacAGCGAAGAAATTCCTCGCCCAGCACATAAAGAATGCTCACGGTCCCTATCCAAAATCAAGAGAGTATCTCTGCGATGTTTGTGACATGGTCACTTACAGTAGGCGAGGGATCATACAACACTTGAAGTATGGACATGGTACTGAGAAGgacACGATATGCCAAATATGTAGAAAGGATTATTTCAAATGTCTAACTTTGAAGGAACATTATTTAAGTCATCATAACTTTATTTACACTTTGATGGATCAAAAGAAAGATGATTTGATAGAAATAAAAGAGGAACCTTTCGAAGAAATGGAACACGTAGCGGACCCCACGGAATTTATATATGAAGTTGATATACAGAAAACTGAAATCGATGAAATACCTTCAGATCCACCATTACAACTCCAGCCGCCGTCCCCTCAACACTTCCTAATAGATTCAGATTCCATCACATACTCTAAGAAGGTGTTATCAGAGGATATATCTCGCAGTTTAATAGATAAGGGTGGAGTGACTGATGTCTTCCACGAGTTGTTTATAGGACATGCTTTGGATATTAGTGGGGCTCCAGTAGTGACcaAGTTACCAAAACAAATAGTTGATGAGGAGAAAGAAAAGCACGTTGAGCAACGTCTAGAAAAACTTCTAACTGAAGTGCGTATGCGCAACCAATGGGTAGTTTTGGAGAAATTGCGCAAACAATACAATAGACGGATCAAAATGGCTACCAGTGGTGGAGAGAAGAAAGAACAAACATCAATAAGATATTTCAAGAAAACAGAAAACAATAACGAATCTActaaaacacaagaaaatacCAAAAATGATGAAGAAAGCAATATCGTAGGTGATGAAACAAGAAATAACCAAAAAAATGATGAATTGAACAAAGAACAGATTAACTTGGAGCAAGAAAATGGTGAAACTATTGAAAATGATGAAATTGGTGAAGAAACAATCAATGGTGATGTAATTGTTAAAGAAACAATCAATGGTGATGAGATGGATGATGGAGATGAAACAATAGATGATGAAAAAGATGTAGATTATAGAGAAGATGATTTTGAAAGTAACAGCGATGAAGAAACAAATCCGAATGGAAAACTGAAGTTTAATACTCATCAgtgttatatttgttttaagCTCTTCACAACAAAATCAGATTTGAAGTTCCACTGCAAGGAACACTTCGACATATGCAACGATAAAATGTTAAAGAAATGCCCGTTTTGTGGCTATGTCACTAATTTAACAATTACAAGACATATCCGATTGGTTCATAATGTTACTTTAGAAATACCGTACGGACGTATTAAGGAAAGAGATACGGGAATAGGGTCCAAATAtgtatttcaaattgacaaagacTGTGACCTAGAAGTTATACCAAGTgatgagtgcgcgcgcctagagcacgacacagagctgtcaaacaactga
- Protein Sequence
- MTLDLDECHGSRRTTPVNARSADKRTDDSNNGEKPFQCRYCDYACRDSSTVRRHMERHLGISKTYPCSVCNKNFKKKMSLQVHIDEVHFDLDTRKYPCEQCDKMFKTKRTLSSHVIAVHDKSNRVKCEICGLVVTKNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMLQAMREEIQGYGESCVCYKNKQNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMVSHVSVIRTNNLQSHMRRHIDVRPYRCSYKPCGKRFKDMSHMRRHIDVRPYRCSYKPCGKRFKDMGDLKRHQLIHYPDYQHICTVCNRRFPRKWRLKKHTLNGCQRVQCHDCGQLFTAKKFLAQHIKNAHGPYPKSREYLCDVCDMVTYSRRGIIQHLKYGHGTEKDTICQICRKDYFKCLTLKEHYLSHHNFIYTLMDQKKDDLIEIKEEPFEEMEHVADPTEFIYEVDIQKTEIDEIPSDPPLQLQPPSPQHFLIDSDSITYSKKVLSEDISRSLIDKGGVTDVFHELFIGHALDISGAPVVTKLPKQIVDEEKEKHVEQRLEKLLTEVRMRNQWVVLEKLRKQYNRRIKMATSGGEKKEQTSIRYFKKTENNNESTKTQENTKNDEESNIVGDETRNNQKNDELNKEQINLEQENGETIENDEIGEETINGDVIVKETINGDEMDDGDETIDDEKDVDYREDDFESNSDEETNPNGKLKFNTHQCYICFKLFTTKSDLKFHCKEHFDICNDKMLKKCPFCGYVTNLTITRHIRLVHNVTLEIPYGRIKERDTGIGSKYVFQIDKDCDLEVIPSDECARLEHDTELSNN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00042270;
- 90% Identity
- iTF_00042270;
- 80% Identity
- iTF_00042270;