Basic Information

Gene Symbol
-
Assembly
GCA_951640565.1
Location
OX621218.1:2954638-2975609[+]

Transcription Factor Domain

TF Family
P53
Domain
P53 domain
PFAM
PF00870
TF Group
Beta-Scaffold Factors
Description
P53 is a tumor suppressor gene product; mutations in p53 or lack of expression are found associated with a large fraction of all human cancers. P53 is activated by DNA damage and acts as a regulator of gene expression that ultimatively blocks progression through the cell cycle. P53 binds to DNA as a tetrameric transcription factor. In its inactive form, p53 is bound to the ring finger protein Mdm2, which promotes its ubiquitinylation and subsequent proteosomal degradation. Phosphorylation of p53 disrupts the Mdm2-p53 complex, while the stable and active p53 binds to regulatory regions of its target genes, such as the cyclin-kinase inhibitor p21, which complexes and inactivates cdk2 and other cyclin complexes [PMID: 20066118, PMID: 12629332, PMID: 1397838, PMID: 6544917, PMID: 19826090, PMID: 19776744, PMID: 6278740, PMID: 221923, PMID: 6318442, PMID: 20030809].This domain is found in p53 transcription factors, where it is responsible for DNA-binding. The DNA-binding domain acts to clamp, or in the case of TonEBP, encircle the DNA target in order to stabilise the protein-DNA complex [PMID: 11780147]. Protein interactions may also serve to stabilise the protein-DNA complex, for example in the STAT-1 dimer the SH2 (Src homology 2) domain in each monomer is coupled to the DNA-binding domain to increase stability [PMID: 9630226]. The DNA-binding domain consists of a beta-sandwich formed of 9 strands in 2 sheets with a Greek-key topology. This structure is found in many transcription factors, often within the DNA-binding domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 1.8e-17 2.9e-13 50.9 0.0 5 88 69 149 66 167 0.92
2 8 8e-07 0.013 16.2 1.1 50 91 177 221 171 260 0.81
3 8 9.9e-07 0.016 15.9 1.3 50 91 267 311 260 333 0.81
4 8 3.3e-07 0.0054 17.4 1.6 50 91 359 403 351 438 0.82
5 8 4.4e-07 0.0072 17.0 2.3 50 95 449 495 442 557 0.80
6 8 8.2e-07 0.013 16.2 2.8 46 88 576 618 553 624 0.86
7 8 5.8e-07 0.0095 16.7 1.7 47 88 618 659 615 661 0.87
8 8 8.1e-33 1.3e-28 101.0 0.1 67 192 655 785 653 786 0.85

Sequence Information

Coding Sequence
ATGATGAACCCGCAAGAATATACCATAATGTATATTGAAGATGAAACTGGCCTGGACCTCAATGACGATGGCGGGCACATCAACTTAGACGCGGTACTTCCCCTGCCAGACTCACTTCCGGATTTGAATTTCGATGGAAAGTTTGAATTGTGTGAGCCTTCTGGATCAGTGATGCCGAGGCGCGGCCCGCCGGACCGGGTGCCTTTCCCCGGGAAATACGATTTCCACGTAGAAATCAATAGTACCCATACTAATACGAAGAAATATGTGTTGTCTCACAAGCTGAACCGTATTTACGTGGAACTGGGAATGGACTTCCCGGTGCAGTTCCAGTGGAGCGGGCCCGCGGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCCGAACGCACCCGCagGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACTGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACTGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCaggtatacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCCGAACGCACCCGCaggtatacacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCaggtatacacacacacacacacacccgcgGACATGTTCGTGCGCGCCGCCTGCGTGTTCGCGGACGCGGCGCAGGCGCAGAAGCGCGTCGAGCGCTGCATACAGCACTACCACACGCACCCGCagGCGCAGAAGCGCGTCGAGCGCTGCATACTGCACTACCACACGCACCCGCCGAACGCACCCGCagACCCGGCCGTGAAAGCGTTGAACGTGCTCCACTCGTCGCGGCCGCGCGACACGGCCGACGTGCACTACGAGGGCGATGCGTCCGCGCAGGACTCGTGGCTCTCCGTGCGCGTCGCGCTCCACAAGGCGGGCTTGGAACCCGCGGCGCACGCTTACCGCTTCGCTTGCAAGAACTCCTGCACCAGCGGCATCAATCGTCGAGCTATACTGATTATATTCACGCTGGAGGATCAAAaTGGCATAATCCACGGGCGACAACTAGTGGGTGCCCGTGTCTGCGCCTGCCCACGTCGAGATATGCTCAAGGACGAGGAGAGCGAGGCAGCCCCGCGAGCCAAGAAGCGTCGCTTGCCCCCGCCGCCTCCGCTGCCGGCCAGAGTCAAAAACGAGGAGATGTCGCCTGACAATGCTGTCATAGATCTGTCGTTACTTCACCCGCAAGCAGCCGTCACCGGTCTCGAGGTCATGATCAAGATGGAGGAACAGGCGCCGGAACCTCGCCGATCTGACGTCATCGCCAGGCTTAAACAAATACAGGAAGAACTCAAAAGGAAGCACAACATACAGTGA
Protein Sequence
MMNPQEYTIMYIEDETGLDLNDDGGHINLDAVLPLPDSLPDLNFDGKFELCEPSGSVMPRRGPPDRVPFPGKYDFHVEINSTHTNTKKYVLSHKLNRIYVELGMDFPVQFQWSGPADMFVRAACVFADAAQAQKRVERCIQHYHTHPPNAPAGAEARRALHTALPHAPAERTRRYTHTHTPADMFVRAACVFADAAQAQKRVERCILHYHTHPPNAPAGIHTHTHTRGHVRARRLRVRGRGAGAEARRALHTALPHAPAERTRRYTHTHTPADMFVRAACVFADAAQAQKRVERCILHYHTHPPNAPAGIHTHTHTHTRGHVRARRLRVRGRGAGAEARRALHTALPHAPAERTRRYTHTHTPADMFVRAACVFADAAQAQKRVERCIQHYHTHPPNAPAGIHTHTHTRGHVRARRLRVRGRGAGAEARRALHTALPHAPAERTRRYTHTHTPADMFVRAACVFADAAQAQKRVERCIQHYHTHPPNAPAGIHTHTHTRGHVRARRLRVRGRGAGAEARRALHTALPHAPAGIHTHTHPRTCSCAPPACSRTRRRRRSASSAAYSTTTRTRRTHPQVYTHTHTPADMFVRAACVFADAAQAQKRVERCIQHYHTHPQVYTHTHTPADMFVRAACVFADAAQAQKRVERCIQHYHTHPQAQKRVERCILHYHTHPPNAPADPAVKALNVLHSSRPRDTADVHYEGDASAQDSWLSVRVALHKAGLEPAAHAYRFACKNSCTSGINRRAILIIFTLEDQNGIIHGRQLVGARVCACPRRDMLKDEESEAAPRAKKRRLPPPPPLPARVKNEEMSPDNAVIDLSLLHPQAAVTGLEVMIKMEEQAPEPRRSDVIARLKQIQEELKRKHNIQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-