Cpra018289.1
Basic Information
- Insect
- Cryptocheilus praepositus
- Gene Symbol
- run
- Assembly
- GCA_033815515.1
- Location
- JAWWQZ010000300.1:383719-388344[+]
Transcription Factor Domain
- TF Family
- Runt
- Domain
- Runt domain
- PFAM
- PF00853
- TF Group
- Beta-Scaffold Factors
- Description
- The AML1 gene is rearranged by the t(8;21) translocation in acute myeloid leukemia [1]. The gene is highly similar to the Drosophila melanogaster segmentation gene runt and to the mouse transcription factor PEBP2 alpha subunit gene [1]. The region of shared similarity, known as the Runt domain, is responsible for DNA-binding and protein-protein interaction.In addition to the highly-conserved Runt domain, the AML-1 gene product carries a putative ATP-binding site (GRSGRGKS), and has a C-terminal region rich in proline and serine residues. The protein (known as acute myeloid leukemia 1 protein, oncogene AML-1, core-binding factor (CBF), alpha-B subunit, etc.) binds to the core site, 5'-pygpyggt-3', of a number of enhancers and promoters.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 3.5e-73 1.8e-69 231.8 0.0 2 129 17 144 16 144 0.98
Sequence Information
- Coding Sequence
- ATGCATTTGCCGGAGGGACCGTTGGGCATGGAGAGCTTCAGCGCGATCCACGAGACGCTGAGGGCATGCCATGGGGATCTCGTGCAGACCGGAAGTCCCGCCATCCTCTGCAGCGCACTGCCATCCCACTGGAGGTCGAACAAGTCGCTGCCGGTCGCGTTCAAGGTCGTCGCGCTCGACGACGTCAGCGACGGCACCCTCGTCACTATCAGGGCCGGCAACGATGAGAACTGCTGCGGCGAGCTCAGGAATTGTACCGCCGTCATGAAGAACCAGGTCGCCAAGTTCAACGACTTGCGATTTGTCGGACGCAGCGGTAGAGGAAAGTCATTCTCGCTGACGATCCAAATCAGCACGGTGCCGTTTCAAGTGGCAACTTACAATAAAGCCATTAAAGTCACCGTCGATGGACCAAGAGAACCCAGATCCAAGTCGAATTATCAGTATGGGCCCGGCTTCCCTGGACTGGGACTGCTCAATCCGTGGCTGGACGCGGCCTACCTCGGTCACGCCTGGCATCTGCCGCATCCGGCTCTCGTCAAAGGAACGATACCAATGCCACCCGCCGACCTCTTCACGCCAACGTTCGCGCCCACCGTTCTGCCATCGTACCCGTTCGAGCACATAAAGTACCCGACCGAGTACACGACCTTACCCCCCAAAGCCGCGAGCCAAGCGACTGCGGCCACCACGATACCTAACAGTCCATCACGGACCCCGCCAAGAAGTCCGAGCGAGTCAGAAAGCGAGTCAACCGCCGAGGAAGTCCGAAGTGCCTTTGTGCCGATCCGACTGAACACACTCCCTCCGAGTTCATCAGTGACgccttcgtcgtcgtcgcccGAGCGGTTGGCATCCAGGAAACCGACCGAAGGCGTGAGGAACGAGCTGAAGGCACCGACGGCCCTCATCTCGCAGAGAATCTCGTCGCCGAAGAGGAGTCCGTCGCCGACGAAGATCTCGTCGCCACCGCCTGCCAAACCGGTTTGGAGGCCGTACTAG
- Protein Sequence
- MHLPEGPLGMESFSAIHETLRACHGDLVQTGSPAILCSALPSHWRSNKSLPVAFKVVALDDVSDGTLVTIRAGNDENCCGELRNCTAVMKNQVAKFNDLRFVGRSGRGKSFSLTIQISTVPFQVATYNKAIKVTVDGPREPRSKSNYQYGPGFPGLGLLNPWLDAAYLGHAWHLPHPALVKGTIPMPPADLFTPTFAPTVLPSYPFEHIKYPTEYTTLPPKAASQATAATTIPNSPSRTPPRSPSESESESTAEEVRSAFVPIRLNTLPPSSSVTPSSSSPERLASRKPTEGVRNELKAPTALISQRISSPKRSPSPTKISSPPPAKPVWRPY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01255315; iTF_01453867; iTF_01120001; iTF_00738114; iTF_01513251; iTF_01511979; iTF_01512624; iTF_01514614; iTF_01513945; iTF_00760772; iTF_00089317; iTF_00773573; iTF_00460266; iTF_01001034; iTF_01000323; iTF_01394060; iTF_01389973; iTF_01391733; iTF_00140366; iTF_00141616; iTF_00142264; iTF_00200526; iTF_01089774; iTF_00010863; iTF_00881008; iTF_00879399; iTF_00880231; iTF_00882727; iTF_00881777; iTF_01389201; iTF_01390860; iTF_00684595; iTF_00216169; iTF_00228449; iTF_00226373; iTF_01066023; iTF_01066728; iTF_00223657; iTF_00220362; iTF_00214188; iTF_00218892; iTF_00221037; iTF_00222318; iTF_00229135; iTF_00232399; iTF_00221713; iTF_00224337; iTF_00227057; iTF_00229809; iTF_01069428; iTF_01065333; iTF_01068757; iTF_00214806; iTF_00216854; iTF_00218198; iTF_00233014; iTF_01539382; iTF_01070135; iTF_00675800; iTF_00230489; iTF_00231097; iTF_00225690; iTF_00227739; iTF_00219581; iTF_00222973; iTF_00225021; iTF_00217534; iTF_01067390; iTF_00215488; iTF_00231776; iTF_01068064; iTF_00733769; iTF_01420076; iTF_01417487; iTF_00982068; iTF_00982743; iTF_00873593; iTF_00683950; iTF_01122864; iTF_01122215; iTF_00306035; iTF_00117784; iTF_01497972; iTF_00756406; iTF_01168950; iTF_01393272; iTF_01424154; iTF_01392541; iTF_00633407; iTF_00391073;
- 90% Identity
- iTF_01255315; iTF_01453867;
- 80% Identity
- -