Basic Information

Gene Symbol
-
Assembly
GCA_963170105.1
Location
OY720629.1:38221357-38230170[-]

Transcription Factor Domain

TF Family
P53
Domain
P53 domain
PFAM
PF00870
TF Group
Beta-Scaffold Factors
Description
P53 is a tumor suppressor gene product; mutations in p53 or lack of expression are found associated with a large fraction of all human cancers. P53 is activated by DNA damage and acts as a regulator of gene expression that ultimatively blocks progression through the cell cycle. P53 binds to DNA as a tetrameric transcription factor. In its inactive form, p53 is bound to the ring finger protein Mdm2, which promotes its ubiquitinylation and subsequent proteosomal degradation. Phosphorylation of p53 disrupts the Mdm2-p53 complex, while the stable and active p53 binds to regulatory regions of its target genes, such as the cyclin-kinase inhibitor p21, which complexes and inactivates cdk2 and other cyclin complexes [PMID: 20066118, PMID: 12629332, PMID: 1397838, PMID: 6544917, PMID: 19826090, PMID: 19776744, PMID: 6278740, PMID: 221923, PMID: 6318442, PMID: 20030809].This domain is found in p53 transcription factors, where it is responsible for DNA-binding. The DNA-binding domain acts to clamp, or in the case of TonEBP, encircle the DNA target in order to stabilise the protein-DNA complex [PMID: 11780147]. Protein interactions may also serve to stabilise the protein-DNA complex, for example in the STAT-1 dimer the SH2 (Src homology 2) domain in each monomer is coupled to the DNA-binding domain to increase stability [PMID: 9630226]. The DNA-binding domain consists of a beta-sandwich formed of 9 strands in 2 sheets with a Greek-key topology. This structure is found in many transcription factors, often within the DNA-binding domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 3 0.6 1.9e+04 -3.9 0.0 17 36 35 54 28 61 0.79
2 3 4.6e-57 1.4e-52 179.1 0.0 2 192 159 348 158 349 0.95
3 3 0.73 2.3e+04 -4.2 6.3 66 124 631 688 596 738 0.77

Sequence Information

Coding Sequence
ATGGCGGAAAAGGACTTTTTACTAACCCAGTCCCAAGAATTAACCCTTTTGATGGAACAGGACGTggGAACCTATATGGCGTTGTTTCACAATAATTCCATGGAACACGAACAACCAAACGACAACCAATTGATTTACTCCGAGGAATTTGGCAATTTGTTTAAAGGTGAAACCGATACACAGCTTTTGCAAAACGATCAAAACAATTCCCAGTTGTTACAACAGCAAGTGACACAATTGCTACCACAAGAAGTTACCCAATTGTATCAGACTACAAACGATTTttccaatcaaaataatttcaatagtCAGTTGATGTCCTATCATTCTTCTTCCCTTTTAAATCGCCCTTCTTCTGCTCATGACATTTCTCCTCCTAATGTAAATCATCTTCCCGTTAACATCAATTATCTCCCTCCTATAAATTGTTTGTCTCAACAGATTAACGAAGTAAAGATGCCACCCATTCTTAGTACGGAAGATTATCCAGGTCcttataattttgatgtttatttCAACCCCCTTAGTGTTAGAAATCCATGGTGGtaCTCTCCAACTTTGGATAAGCTTTTTATTACAATGGGTGATTGTTTTCCtgttgattttaaaacagTGGGTGATGCTAAGGATCTTTATATTAGAGCTACCACTTTATATGCTTCACCACAATTAAATCAGGAAATTGTTATGCGCTGTTGGGCTCATAGTTCCGCAGATGATACTTTAAATACAGATGTTGCTGAACACGTAATAAGTCACGTTTTACGATGTCAagattcaaattcaaattatcaagGTGATGTTAAGATAAAGCAACATTTGAGCGTTGTGGTACCTTTGGGATTACCTCAACCTGGTACAGATGTTGTTCGTCTAATATACTACTTTGTATGTAAAAATTCATGTCCTACTGGTATAAACCGTCGACCTACCTGTGTGATTTTTGCTTTGGAAGATAAATTTGGCATGGTGAAAGGACGAAGGCGACTAAATGTACGTGTTTGTTCTTGCCCCAAGCGGgacaaagaaaaagaagaaaaaaatgctaTAAATGGTACAGCACCTCAgggtaaaaaaagaaaaatcgaaaaggtTGAAAAGAAAGAATCCAACGAAGACGCTACTCCCGATAATACAGAAGGAGAGAAACTTAACATCaacttTGACGTCtatggaaaaaaatataagcttTTGGCTTTATATGCTGTTCATGATCGTCTTAGTGGTGAATTAATCAGGCATGGAcctaacgaaaatttaaaaaaatcaattgccACGATCGAGGAACAAATTAGTAAGCTCCTACATTTTTTGTCTTATCCACTGATATCGATGGTCTTACTTGCTCTTACCGGTTGCGTCATTTTTAAAAGCGTGGAAGCTCAGGTGACATTTTCGAAAGATTGGGGAGCCGGAAAGCGATCTACCGAGAATTTTGATTGCCATGCAGCAATTAAGTCGGCCACGGCTTTATGTCACTTACTAATGgtaACATTTGAGGACGATTTGCCCAACACGGTGTGCCTTACGTGTTCACATAGAATTGAAAGTGCTTACTTGCTCAAGTTGCAATGTCGAGAATCGGATTCCACATTGCGTCGATATTTGAATTCGGAATTTCGAtctgaaaattgtaattctgATGAAGTAAAGAATAACGTAAGCCACGGTTTGCTGTTGGAATTAACGAATGAGGAATCCGAAAATTTTACAGATACAAATAAGAGTAAGACGTTTGAATGCGACGAATGTAAACTTGTGTTtgataaacaaaaactattaaGATTACATATTCGTGATcacaagaaaaagaaaaggtATGCCtgcaaaatttgtaacaaGAATTTTGTAGAAGTTCTTCAGCTAAATCAACACATGAATGTAATGCACAACAGGGaagaatatgaaaataatacgCTACATATTTGTACGGTATGtgatgataaatttaatactaacGCAAGCTTAATTGAACACATGGTACGTAAGCATACCCAAAAAGAAACTAAGGAATTAAGTGAACAAAACGCAACGCACAACGAAAGATTGATGTGTAATGTTTGTAATAAGACTTATTTAAAAGCCAGTAATTTGACCGCCCACATGGGAACTCATACGGGCGAAAAACCTTTCGAATGCCATATTTGTGGCAAGAGGTTTACTCAAGGACGCGCTCACGCTTGCCACATGCGAACACATTCTAAGGAAATCGAAAAACCTTACGGCTGTAAAATGTGCAACAAAGAATTTGCACGCGAAAGTCAATTAGCGGTCCATTCGAAAAAACATTTAGCTACAAAGTCATTTGTTTGCAGCATCTGTAATAAAAGTTGCTGTAATTCGGGTAACCTCAAGTCGCATATGCGTTTGCACACCGGAGATAAGCCATATTCTTGCAATATTTGTGGTCGAACGTTTGCTCAAAGTAATGCACACTCTTATCACATGAAAACTCATTCAGGTGAGAGACCTTTCGAATGCGACGTATGTCCGAAAGCTTTTACTACAAACGGCCAATTGATAAATCATCGCCGCTTACATACAGGCGAAAAACCGTTCGTGTGCGCTGTTTGCAATAAACGATTTACGCAAAAAGTTGCGCACACAATACACATGATGACTCACACGGGTAATAAGCCACACTTGTGCAGTATATGTGgcaaaaaatattcacaaaaCAGTCAATTGGTTGAACACATGCGCACTCACACGGGCGAAACTTTACAATGCAATGTCTGTGGGAAAGGTTACTCTACGGCTGGAAATTTAGGTGCTCACATGAAGACGCACACGGGTATAAAGACGCACGTGTGTCAAATTTGTCACAAGGGTTTTTATGTCGCTAACAAATTGGCAAAACATATGAGGACTCACACCGGAGAGAGGCCTTACGCTTGCACAATGTGTCCAAAACGTTTTACTAGTTCCGAAGTAATGAAAGTTCACAATAGAATTCACACAGGAGAAAGGCCATACCGTTGCAGTATTTGTGATAAAGGTTTCACAAATAATTCTCAACTCACCGTTCATCTCCGTTCTCACACTGGTCAGAGACCTTACGTATGTACAGTTTGTTTGCAAGGCTTTGCCACTTCTGCAAGTTTAAACTCTCACTCCAGTACTTGTACTAGGATGATCGAAACCGccatttattctaattttactCCAAATTAA
Protein Sequence
MAEKDFLLTQSQELTLLMEQDVGTYMALFHNNSMEHEQPNDNQLIYSEEFGNLFKGETDTQLLQNDQNNSQLLQQQVTQLLPQEVTQLYQTTNDFSNQNNFNSQLMSYHSSSLLNRPSSAHDISPPNVNHLPVNINYLPPINCLSQQINEVKMPPILSTEDYPGPYNFDVYFNPLSVRNPWWYSPTLDKLFITMGDCFPVDFKTVGDAKDLYIRATTLYASPQLNQEIVMRCWAHSSADDTLNTDVAEHVISHVLRCQDSNSNYQGDVKIKQHLSVVVPLGLPQPGTDVVRLIYYFVCKNSCPTGINRRPTCVIFALEDKFGMVKGRRRLNVRVCSCPKRDKEKEEKNAINGTAPQGKKRKIEKVEKKESNEDATPDNTEGEKLNINFDVYGKKYKLLALYAVHDRLSGELIRHGPNENLKKSIATIEEQISKLLHFLSYPLISMVLLALTGCVIFKSVEAQVTFSKDWGAGKRSTENFDCHAAIKSATALCHLLMVTFEDDLPNTVCLTCSHRIESAYLLKLQCRESDSTLRRYLNSEFRSENCNSDEVKNNVSHGLLLELTNEESENFTDTNKSKTFECDECKLVFDKQKLLRLHIRDHKKKKRYACKICNKNFVEVLQLNQHMNVMHNREEYENNTLHICTVCDDKFNTNASLIEHMVRKHTQKETKELSEQNATHNERLMCNVCNKTYLKASNLTAHMGTHTGEKPFECHICGKRFTQGRAHACHMRTHSKEIEKPYGCKMCNKEFARESQLAVHSKKHLATKSFVCSICNKSCCNSGNLKSHMRLHTGDKPYSCNICGRTFAQSNAHSYHMKTHSGERPFECDVCPKAFTTNGQLINHRRLHTGEKPFVCAVCNKRFTQKVAHTIHMMTHTGNKPHLCSICGKKYSQNSQLVEHMRTHTGETLQCNVCGKGYSTAGNLGAHMKTHTGIKTHVCQICHKGFYVANKLAKHMRTHTGERPYACTMCPKRFTSSEVMKVHNRIHTGERPYRCSICDKGFTNNSQLTVHLRSHTGQRPYVCTVCLQGFATSASLNSHSSTCTRMIETAIYSNFTPN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00270123;
90% Identity
iTF_00270123;
80% Identity
iTF_00270123;