Basic Information

Gene Symbol
lz
Assembly
GCA_963931855.1
Location
OZ007556.1:79087921-79159012[+]

Transcription Factor Domain

TF Family
Runt
Domain
Runt domain
PFAM
PF00853
TF Group
Beta-Scaffold Factors
Description
The AML1 gene is rearranged by the t(8;21) translocation in acute myeloid leukemia [1]. The gene is highly similar to the Drosophila melanogaster segmentation gene runt and to the mouse transcription factor PEBP2 alpha subunit gene [1]. The region of shared similarity, known as the Runt domain, is responsible for DNA-binding and protein-protein interaction.In addition to the highly-conserved Runt domain, the AML-1 gene product carries a putative ATP-binding site (GRSGRGKS), and has a C-terminal region rich in proline and serine residues. The protein (known as acute myeloid leukemia 1 protein, oncogene AML-1, core-binding factor (CBF), alpha-B subunit, etc.) binds to the core site, 5'-pygpyggt-3', of a number of enhancers and promoters.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 3 7.4e-47 8e-43 146.9 0.0 13 94 44 125 38 129 0.97
2 3 3.1 3.3e+04 -3.8 0.0 39 70 503 529 481 536 0.59
3 3 6.3e-18 6.8e-14 53.4 0.2 91 129 735 773 733 773 0.95

Sequence Information

Coding Sequence
ATGACCGGATTGCTGATCACACGACACGAAATTGATATTGATATGATACTGTTACTGATCCCTTGTGTTAGCTTATGTGGcttcattcattcatattttTCCCAAGCGTATCCTCCTGCCTGTCGTGATGttcgAACGGGTAGTCCGTATTTCCTCTGCTCAGCCCTGCCCACCCACTGGAGATCAAATAAAACTCTTCCGCTGGCCTTCAAAGTGGTCGCCCTAGTCGATGTCGGTGACGGCACAATAGTAACGATACGTGCTGGAAACGACGAAAACTGCTGTGCAGAATTAAGAAATTGCACAGCCGTGATGAAGAATCAAGTTGCCAAATTCAATGACCTGCGTTTCGTCGGACGAAGTGGCCGAGAGATATCTGGGAGGACTGAGCATTTCTGGAAGatgattcaatccatttttTACATTGATGGGTCAAAAGATCAGGATGGAAATACCATAGTGGAAACGCAGGCGATGTTAGCCATCTGGAGAGACTACTTTAACCAGTTGCTTAAGGGCGATGACAGTCAAAATCCCGCCCAAAATCAACTTGTACCAGTGAATCTAGATGACGTCCATGAGTTATATCCACCCGACCAAAATGAAATCAAAGTAGCCATCAACAGACTAAAGATGAACAAAGTAGCTGGATGCGATGGTCTACAAGCGGAGCTTTTTAAGCATGCTGGAGATGACCTGGTCAAGAGCATACACcaactcatctgcaaaatatggtcggaggaaagtatgcccgatgaatggaatctCAGTACCATCTTCCCGATCCACAAAAAAGGAGACCGCCTACTCTGCGCCAACTATAGAGGCATAAGTCTTTTAAGCATCGCCTATAAGATCCTGTCCAGCGTTTTATGTGAACGTCTCAAGCCCTTTGCTGATGATCTTATAGGTCCTTACCAATGCAGCTTCAGACCTGGAAAATCTACAActgaccaaatattcacattgcgCCAGATCCTGGAAAAAACCCAAGAATATAACATCGATACCCACCACCTATTTGTCGATTTTAAAGCAGCCTACGACAGTGTTAATAGAGCTGAACTTTATCGTACCATGTCCAACTTTGGAATCCCTGCTAAACTGATACGCCTTATCAGGATGACGTTGGAAAATGCTCGCTGTTCCGTCAAAGTTGGTAGGGACTTCACTGAATCCTTCGAAACCACCAGAGGGCTAAGACAAGGTGACGCACTGTCATGCAGCTGTTTTAACATTGTGCTTGAAAATGTCATACGAAACACCCATGCAAGCACATCAGGCACCATCTTCAATAAATCGACACAATTACTTGGGTTTGCGGACGACATTGACGTGATAGGAAGAACAAAGCGAGCAGTAACCGGTGCATTCACTACAATGGAGTCAGCAGCAAGCAGAATGGGACTGAAAATAAACGAGggcaagacgaagtatatgctttCATCAACAAAAGTCCAATCCCACCACCGACTGGGTCAGAACGTGACCAttgataggtataacttcgaggtAGTCAGTGAATTTGTCTACCTGGGGACGGCTGTAAACTCAGGCAACAACATCAGCGCCGAAATCAAACGCAGACTAACTCTTgctaatcgctgtttctttgggcTGAGTAAGCAATTGAGGAATAAAGCCCTATCGAGAAGAACCAAGGTGACACTCTATAAAACActaatcatcccggtcctgctatATGGTGCAGAAGCATGGACCCTCTCCCAATCGGATGAAAAATCACTTGGAGTCTTCGAGAGAAAAGTTCTGCGTACGATCTATGGTCCTGTCTGCGTAAATCGAGAAAACAACATTTGGCGAAGAAGATACAACGACGAACTATACGAACTGTACAGCGACATAAGCCTAGCCAGAAAAATTCGAATTCAACGACTAAGATGGCTGGgccatgtagagcgaatggacaatGACGCTCCAGCTCGTAAAGTCTTCAACGCCAAACCCGAGGGAGTCAGAAGCCAAGGTAGACCTCATCTTCGATGGAGCGACCAAGTAGAAGGAGACCTCCGACAACTTGGAGTGCGAAATTGGAGACGGCAGGCTAGCGATCGAGTCGGCTGGAGGGGCATACTAAGCAAGGCCCAGGAGCACCACGCTCTAGTGGCGCCatcCGTATTCCAGGCCGAACCTTTGGTGATTAAAGAAGTCCCGTCTTGGCTTAAGGAAAAGTTGATATCAATCAAAGgGAAGAGCTTTACGCTGACGATAACCGTGTCAACGAGCCCACCACAAGTAGCCACATATGGAAAAGCCATCAAAGTGACAGTTGACGGACCCAGGGAGCCAAGGTCTAAGACAAGTCCGCCCGGTGGTCATCAATATCGGGCTGTCGGTTTGGGTCAACGACCCTTTCTGGATCCTCCATTCTATAGTCATGTTCGTGAATTAGAGACATTAAGACGTAAGGCTCCTACTACTGGTTCATCGCTACCACCATTAGCCAGTAATCACTCATCCTCCAATAGTACAATGAATTCCCCCGACGGACAGGGCTACAAGCCTAATGCACCGCACATACAAGAAAGTAATTTGATGGGGGCTGCCGACTGGACGGGCTACACGCCGAGCACAACAACACCGGGTATGAGTTCTTATCATACATACCAGCAACTGCAGACGAATGCTTCAAGTGCAGCGGCTGCAGCGTATGGATATGATGGTCACACAGCAGGTAATCCTCCGCCAGGACCACCACCTCCACCGCCAGCACCGGATCATCACGGAAATTTCCATCTGCCAACTGTGTTGACAGACATGCAGCCCTTTTGTGCCAGCTCTGACTATCATCACACGGGAATCGCTCCAACTGCTGCGACAAGTGGAATCATGTCTCACACCCAGCCACCAATATGTAGTCCGACTTATGGGTCCACAAAGCCAGAAATTGATTCTCTCAATCCCTCGTACCCATCCTACAACAATTGGTCCAATGGATACAATAACTACCAATACGGAAGCTGTGCCGCACAACCTCAGTATCCAGGGCATACGGCACCAACAATGGTCCTGTATCCTCAACTCTACTCAACTGTCAACCAAAACCAAATCCACCTGCATTTACACGGCACCGATAAGCTTGAACAGTATCTTGGCCCTGAGAATTCCCTCACCATAAGTTCCATCGGTGGTAGTCGATCTGGCATCGAGATTGGAATCGGTACATCGGATCATGAGTCAGCGATGTTGAGCAACGAAGGAGCCGATACACAACAACACCCGCAACACGTACAAGTACAACAGGAGACTGAAGTCCCACAGGACCATCGCACCGACGAGGATGTCGTGGATCCGGGAAGTGTGTGGCGACCTTACTGA
Protein Sequence
MTGLLITRHEIDIDMILLLIPCVSLCGFIHSYFSQAYPPACRDVRTGSPYFLCSALPTHWRSNKTLPLAFKVVALVDVGDGTIVTIRAGNDENCCAELRNCTAVMKNQVAKFNDLRFVGRSGREISGRTEHFWKMIQSIFYIDGSKDQDGNTIVETQAMLAIWRDYFNQLLKGDDSQNPAQNQLVPVNLDDVHELYPPDQNEIKVAINRLKMNKVAGCDGLQAELFKHAGDDLVKSIHQLICKIWSEESMPDEWNLSTIFPIHKKGDRLLCANYRGISLLSIAYKILSSVLCERLKPFADDLIGPYQCSFRPGKSTTDQIFTLRQILEKTQEYNIDTHHLFVDFKAAYDSVNRAELYRTMSNFGIPAKLIRLIRMTLENARCSVKVGRDFTESFETTRGLRQGDALSCSCFNIVLENVIRNTHASTSGTIFNKSTQLLGFADDIDVIGRTKRAVTGAFTTMESAASRMGLKINEGKTKYMLSSTKVQSHHRLGQNVTIDRYNFEVVSEFVYLGTAVNSGNNISAEIKRRLTLANRCFFGLSKQLRNKALSRRTKVTLYKTLIIPVLLYGAEAWTLSQSDEKSLGVFERKVLRTIYGPVCVNRENNIWRRRYNDELYELYSDISLARKIRIQRLRWLGHVERMDNDAPARKVFNAKPEGVRSQGRPHLRWSDQVEGDLRQLGVRNWRRQASDRVGWRGILSKAQEHHALVAPSVFQAEPLVIKEVPSWLKEKLISIKGKSFTLTITVSTSPPQVATYGKAIKVTVDGPREPRSKTSPPGGHQYRAVGLGQRPFLDPPFYSHVRELETLRRKAPTTGSSLPPLASNHSSSNSTMNSPDGQGYKPNAPHIQESNLMGAADWTGYTPSTTTPGMSSYHTYQQLQTNASSAAAAAYGYDGHTAGNPPPGPPPPPPAPDHHGNFHLPTVLTDMQPFCASSDYHHTGIAPTAATSGIMSHTQPPICSPTYGSTKPEIDSLNPSYPSYNNWSNGYNNYQYGSCAAQPQYPGHTAPTMVLYPQLYSTVNQNQIHLHLHGTDKLEQYLGPENSLTISSIGGSRSGIEIGIGTSDHESAMLSNEGADTQQHPQHVQVQQETEVPQDHRTDEDVVDPGSVWRPY

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-