Basic Information

Gene Symbol
-
Assembly
GCA_035578135.1
Location
JAQJVK010000018.1:11796946-11799945[-]

Transcription Factor Domain

TF Family
Runt
Domain
Runt domain
PFAM
PF00853
TF Group
Beta-Scaffold Factors
Description
The AML1 gene is rearranged by the t(8;21) translocation in acute myeloid leukemia [1]. The gene is highly similar to the Drosophila melanogaster segmentation gene runt and to the mouse transcription factor PEBP2 alpha subunit gene [1]. The region of shared similarity, known as the Runt domain, is responsible for DNA-binding and protein-protein interaction.In addition to the highly-conserved Runt domain, the AML-1 gene product carries a putative ATP-binding site (GRSGRGKS), and has a C-terminal region rich in proline and serine residues. The protein (known as acute myeloid leukemia 1 protein, oncogene AML-1, core-binding factor (CBF), alpha-B subunit, etc.) binds to the core site, 5'-pygpyggt-3', of a number of enhancers and promoters.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 12 0.017 1.9e+02 3.9 0.0 34 69 87 122 81 130 0.86
2 12 0.0024 29 6.6 0.0 30 69 129 168 118 176 0.84
3 12 0.018 2.1e+02 3.8 0.0 25 68 166 213 162 222 0.73
4 12 0.0065 76 5.2 0.0 32 68 223 259 217 267 0.87
5 12 0.018 2.1e+02 3.8 0.0 24 68 257 305 253 311 0.73
6 12 0.0057 67 5.4 0.0 31 69 314 352 306 360 0.87
7 12 0.14 1.6e+03 0.9 0.0 32 68 361 399 352 405 0.76
8 12 0.083 9.7e+02 1.7 0.0 32 68 409 445 404 450 0.86
9 12 0.0047 56 5.7 0.0 30 68 453 491 449 500 0.89
10 12 0.0029 34 6.4 0.0 30 69 499 538 492 546 0.88
11 12 0.0014 17 7.4 0.0 30 71 545 586 538 594 0.87
12 12 0.11 1.3e+03 1.3 0.0 32 61 592 621 584 628 0.83

Sequence Information

Coding Sequence
ATGTGGCGTAATGAGAGATCTCCCGGCAATTTCCAGACATTCATCTTTGCAAGGGATGTGGAGGTGGAAGGCGACCAGTCGTCGTACTCGGAGCTGCAACACTTGCTGTCGGATTCCATCGTCCTTTCCCACTTGATCGCTCGCCAGGTCTGCAGTGTGGCGCGGTTGTCGCGGAAGTCGCTGCAGTTAACAACTGTACGCGGAATTGTTGTCAACGGCGTTCAAGGCCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGCTGGAGGTGACATTTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCGTTGCCAACCTCGCCAACCTCGCACGCTGGAGGTGACATTTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCGTTGCCAACCTCGCCAACCTCGCACGCTGGAGGTGACATCTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCGTTGCCAACCTCGCCAACCTCGCACGCTGGAGGTGACATCTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCGTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTGGAGGACGTAGTCGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCGCGAGGCGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGTTGCCAACCTCGCCAACCTCGCACGCTGGAGGTGACATTTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCTCTCGCGTTGCCAGCCTCGCCAACCTCGCACGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCCTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTAGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCGACTGCACTCGCGTTGCCAACCTCGCCAACACAAGCTGGAGGTGACATTTAGACGAGCGTTACACCGTGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTCAAGGCAGCGGTTGCGACGTACACTGTGGAACGGTGTCGCACTGCACTCTAGTGCAGGTGTCGGTGCAGTCGGTCAGGTGCAAGTATTACATGTATGTAGGGGGCTTGTAA
Protein Sequence
MWRNERSPGNFQTFIFARDVEVEGDQSSYSELQHLLSDSIVLSHLIARQVCSVARLSRKSLQLTTVRGIVVNGVQGQGYCVAATALALEVTFRRALHREVEDVVAVTVMFVNSSRATASLQLHSRCQPRQPRTLEVTFRRALHREVEDVVAVTVMFVNSSRATASLQLHSRCQPRQPRTLEVTSRRALHREVEDVVAVTVMFVNSSRATASLQLRSRCQPRQPRKLEVTFRRALHHEVEDVVAVTVMFVNSSRATASLQLHSRCQPRQPRTLEVTSRRALHREVEDVVAVTVMFVNSSRATASLQLRSRCQPRQPRKLEVTFRRALHHEVEDVVAVTVMFVNSSRATASLQLHSRCQPRQPRKLEVTFRRALHHEVEDVVVVAVTVMFVNSSRATASLQLRSRCQPRQPRKLEVTFRRALHREAEDVVAVTVMFVNSSRATASLQLRSRCQPRQPRTLEVTFRRALHREVEDVVAVTVMFVNSSRATASLQLLSRCQPRQPRTLEVTFRRALHHEVEDVVAVTVMFVNSSRATASLQLHSPCQPRQPRKLEVTFRRALHHEVEDVVAVTVMFVNSSRATASLRLHSRCQPRQHKLEVTFRRALHREVEDVVAVTVMFVNSSRAQGSGCDVHCGTVSHCTLVQVSVQSVRCKYYMYVGGL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-