Fatl068532.1
Basic Information
- Insect
- Flexamia atlantica
- Gene Symbol
- -
- Assembly
- GCA_035578135.1
- Location
- JAQJVK010000018.1:11796946-11799945[-]
Transcription Factor Domain
- TF Family
- Runt
- Domain
- Runt domain
- PFAM
- PF00853
- TF Group
- Beta-Scaffold Factors
- Description
- The AML1 gene is rearranged by the t(8;21) translocation in acute myeloid leukemia [1]. The gene is highly similar to the Drosophila melanogaster segmentation gene runt and to the mouse transcription factor PEBP2 alpha subunit gene [1]. The region of shared similarity, known as the Runt domain, is responsible for DNA-binding and protein-protein interaction.In addition to the highly-conserved Runt domain, the AML-1 gene product carries a putative ATP-binding site (GRSGRGKS), and has a C-terminal region rich in proline and serine residues. The protein (known as acute myeloid leukemia 1 protein, oncogene AML-1, core-binding factor (CBF), alpha-B subunit, etc.) binds to the core site, 5'-pygpyggt-3', of a number of enhancers and promoters.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 0.017 1.9e+02 3.9 0.0 34 69 87 122 81 130 0.86 2 12 0.0024 29 6.6 0.0 30 69 129 168 118 176 0.84 3 12 0.018 2.1e+02 3.8 0.0 25 68 166 213 162 222 0.73 4 12 0.0065 76 5.2 0.0 32 68 223 259 217 267 0.87 5 12 0.018 2.1e+02 3.8 0.0 24 68 257 305 253 311 0.73 6 12 0.0057 67 5.4 0.0 31 69 314 352 306 360 0.87 7 12 0.14 1.6e+03 0.9 0.0 32 68 361 399 352 405 0.76 8 12 0.083 9.7e+02 1.7 0.0 32 68 409 445 404 450 0.86 9 12 0.0047 56 5.7 0.0 30 68 453 491 449 500 0.89 10 12 0.0029 34 6.4 0.0 30 69 499 538 492 546 0.88 11 12 0.0014 17 7.4 0.0 30 71 545 586 538 594 0.87 12 12 0.11 1.3e+03 1.3 0.0 32 61 592 621 584 628 0.83
Sequence Information
- Coding Sequence
- ATGTGGCGTAATGAGAGATCTCCCGGCAATTTCCAGACATTCATCTTTGCAAGGGATGTGGAGGTGGAAGGCGACCAGTCGTCGTACTCGGAGCTGCAACACTTGCTGTCGGATTCCATCGTCCTTTCCCACTTGATCGCTCGCCAGGTCTGCAGTGTGGCGCGGTTGTCGCGGAAGTCGCTGCAGTTAACAACTGTACGCGGAATTGTTGTCAACGGCGTTCAAGGCCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGCTGGAGGTGACATTTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCGTTGCCAACCTCGCCAACCTCGCACGCTGGAGGTGACATTTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCGTTGCCAACCTCGCCAACCTCGCACGCTGGAGGTGACATCTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCGTTGCCAACCTCGCCAACCTCGCACGCTGGAGGTGACATCTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCGTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTGGAGGACGTAGTCGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCGCGAGGCGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCGCTCGCGTTGCCAACCTCGCCAACCTCGCACGCTGGAGGTGACATTTAGACGAGCGTTACACCGCGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCTCTCGCGTTGCCAGCCTCGCCAACCTCGCACGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCAACTGCACTCGCCTTGCCAACCTCGCCAACCTCGCAAGCTGGAGGTGACATTTAGACGAGCGTTACACCACGAGGTAGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTACTGCGTCGCTGCGACTGCACTCGCGTTGCCAACCTCGCCAACACAAGCTGGAGGTGACATTTAGACGAGCGTTACACCGTGAGGTGGAGGACGTAGTCGCAGTCACTGTaatgtttgtaaacagcagCAGGGCTCAAGGCAGCGGTTGCGACGTACACTGTGGAACGGTGTCGCACTGCACTCTAGTGCAGGTGTCGGTGCAGTCGGTCAGGTGCAAGTATTACATGTATGTAGGGGGCTTGTAA
- Protein Sequence
- MWRNERSPGNFQTFIFARDVEVEGDQSSYSELQHLLSDSIVLSHLIARQVCSVARLSRKSLQLTTVRGIVVNGVQGQGYCVAATALALEVTFRRALHREVEDVVAVTVMFVNSSRATASLQLHSRCQPRQPRTLEVTFRRALHREVEDVVAVTVMFVNSSRATASLQLHSRCQPRQPRTLEVTSRRALHREVEDVVAVTVMFVNSSRATASLQLRSRCQPRQPRKLEVTFRRALHHEVEDVVAVTVMFVNSSRATASLQLHSRCQPRQPRTLEVTSRRALHREVEDVVAVTVMFVNSSRATASLQLRSRCQPRQPRKLEVTFRRALHHEVEDVVAVTVMFVNSSRATASLQLHSRCQPRQPRKLEVTFRRALHHEVEDVVVVAVTVMFVNSSRATASLQLRSRCQPRQPRKLEVTFRRALHREAEDVVAVTVMFVNSSRATASLQLRSRCQPRQPRTLEVTFRRALHREVEDVVAVTVMFVNSSRATASLQLLSRCQPRQPRTLEVTFRRALHHEVEDVVAVTVMFVNSSRATASLQLHSPCQPRQPRKLEVTFRRALHHEVEDVVAVTVMFVNSSRATASLRLHSRCQPRQHKLEVTFRRALHREVEDVVAVTVMFVNSSRAQGSGCDVHCGTVSHCTLVQVSVQSVRCKYYMYVGGL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -