Basic Information

Gene Symbol
-
Assembly
GCA_932294385.1
Location
CAKOAM010000025.1:4336349-4349989[-]

Transcription Factor Domain

TF Family
P53
Domain
P53 domain
PFAM
PF00870
TF Group
Beta-Scaffold Factors
Description
P53 is a tumor suppressor gene product; mutations in p53 or lack of expression are found associated with a large fraction of all human cancers. P53 is activated by DNA damage and acts as a regulator of gene expression that ultimatively blocks progression through the cell cycle. P53 binds to DNA as a tetrameric transcription factor. In its inactive form, p53 is bound to the ring finger protein Mdm2, which promotes its ubiquitinylation and subsequent proteosomal degradation. Phosphorylation of p53 disrupts the Mdm2-p53 complex, while the stable and active p53 binds to regulatory regions of its target genes, such as the cyclin-kinase inhibitor p21, which complexes and inactivates cdk2 and other cyclin complexes [PMID: 20066118, PMID: 12629332, PMID: 1397838, PMID: 6544917, PMID: 19826090, PMID: 19776744, PMID: 6278740, PMID: 221923, PMID: 6318442, PMID: 20030809].This domain is found in p53 transcription factors, where it is responsible for DNA-binding. The DNA-binding domain acts to clamp, or in the case of TonEBP, encircle the DNA target in order to stabilise the protein-DNA complex [PMID: 11780147]. Protein interactions may also serve to stabilise the protein-DNA complex, for example in the STAT-1 dimer the SH2 (Src homology 2) domain in each monomer is coupled to the DNA-binding domain to increase stability [PMID: 9630226]. The DNA-binding domain consists of a beta-sandwich formed of 9 strands in 2 sheets with a Greek-key topology. This structure is found in many transcription factors, often within the DNA-binding domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 4.2e-49 1.1e-44 153.1 0.0 3 191 82 273 80 275 0.91
2 32 0.12 3e+03 -1.7 0.4 181 191 339 349 338 351 0.90
3 32 0.12 3e+03 -1.7 0.4 181 191 377 387 376 389 0.90
4 32 0.12 3e+03 -1.7 0.4 181 191 415 425 414 427 0.90
5 32 0.12 3e+03 -1.7 0.4 181 191 453 463 452 465 0.90
6 32 0.12 3e+03 -1.7 0.4 181 191 491 501 490 503 0.90
7 32 0.12 3e+03 -1.7 0.4 181 191 529 539 528 541 0.90
8 32 0.12 3e+03 -1.7 0.4 181 191 567 577 566 579 0.90
9 32 0.12 3e+03 -1.7 0.4 181 191 605 615 604 617 0.90
10 32 0.12 3e+03 -1.7 0.4 181 191 643 653 642 655 0.90
11 32 0.12 3e+03 -1.7 0.4 181 191 681 691 680 693 0.90
12 32 0.12 3e+03 -1.7 0.4 181 191 719 729 718 731 0.90
13 32 0.12 3e+03 -1.7 0.4 181 191 757 767 756 769 0.90
14 32 0.12 3e+03 -1.7 0.4 181 191 795 805 794 807 0.90
15 32 0.12 3e+03 -1.7 0.4 181 191 833 843 832 845 0.90
16 32 0.12 3e+03 -1.7 0.4 181 191 871 881 870 883 0.90
17 32 0.12 3e+03 -1.7 0.4 181 191 909 919 908 921 0.90
18 32 0.12 3e+03 -1.7 0.4 181 191 947 957 946 959 0.90
19 32 0.12 3e+03 -1.7 0.4 181 191 985 995 984 997 0.90
20 32 0.12 3e+03 -1.7 0.4 181 191 1023 1033 1022 1035 0.90
21 32 0.12 3e+03 -1.7 0.4 181 191 1061 1071 1060 1073 0.90
22 32 0.12 3e+03 -1.7 0.4 181 191 1099 1109 1098 1111 0.90
23 32 0.12 3e+03 -1.7 0.4 181 191 1137 1147 1136 1149 0.90
24 32 0.12 3e+03 -1.7 0.4 181 191 1175 1185 1174 1187 0.90
25 32 0.12 3e+03 -1.7 0.4 181 191 1213 1223 1212 1225 0.90
26 32 0.12 3e+03 -1.7 0.4 181 191 1251 1261 1250 1263 0.90
27 32 0.12 3e+03 -1.7 0.4 181 191 1289 1299 1288 1301 0.90
28 32 0.12 3e+03 -1.7 0.4 181 191 1327 1337 1326 1339 0.90
29 32 0.12 3e+03 -1.7 0.4 181 191 1365 1375 1364 1377 0.90
30 32 0.12 3e+03 -1.7 0.4 181 191 1403 1413 1402 1415 0.90
31 32 0.12 3e+03 -1.7 0.4 181 191 1441 1451 1440 1453 0.90
32 32 0.12 3e+03 -1.7 0.4 181 191 1479 1489 1478 1491 0.90

Sequence Information

Coding Sequence
ATGCTGAAGCAGGAGAGCGTCAACATGGCCGAATTCGAGGAATATTTGGTCGACACCAATATACTCAGCGGCGTCAACCTAGAAGACCTGGAGCGTCTAGGCAGCTTCAACAGCCAGGGCAACTTGGAGGTGCTCTCCCCGCCAGACGTCGCTGAGGAGACGCTCGACATCAGCTATGTCATTCACAATGTACCCAGCATTGAAGAAATGATCCCGGTGTCTCCGCGCGGCCCGCCCGCTCGCTCATCGTTCGCAGGCGGCCTCAACTTCGCCGTGGAAATCAACGCTGCAGACACCAATAGGAAGAGATATCTTTACTCGGACAAGCTGAACCGTATCTATGTGGATATAAAGACGAATTTTGCGGTTCAATTCCGGTGGGATTTCGAGGCGGCTGCGAGCCCGATGTTCGTGCGCGCTACCACTGTGTTCTCCGACGAAGCGCAGTCTGAGAAGAGAGTCGAGAGATGTCTGCAGCATACACATGAAAGTGCTAATGCTGCCATAGACCCGGTGATAGTGAAGAACGTGCTGCACTCGTCGGCCGCGCCGGGCACGCGGGGCGTGTTCTACTGCGGCGCGCCCTCGGTGCCGGACTCGTGGTACTCCGTGCTGCTGCGCTTCGACGGCCGCCCGCAGCAGCCCCACTCGCACGCCTACCAGTTCGTCTGCAAGAACTCCTGCTCCAGCGGGATCAACAGGCGGGCTATTGACATCATCTTCACATTGGAAGATCATACAGGCCACGTGTACGGCCGAGAGACGGTAGGCGCCCGCGTGTGCGCCTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACGTGCTGGTGCTGCCGGCGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACTGCCCGCGCCGCGACAAGCTCAAGGACGAGGAGGTGGAGCCCAAGCCCAACAAGCGGCGCAGCTCGCAGCCGCCGCGCCCCGGGAAGAGAGTCAAGGTGGAGCCTGATGACGACGTGCTGGTGCTGCCGGCGCTACAGATAGTGGGGCGGCAAACTATGGTCACTGGCCTTGAAGTCATGCTCAGCATGATGGAGGCAGTGGCCACCAAGACGCCTGAACACGAGCAGGACAAATACCAGACCTCCATCCGAGCTCTCCGGGACAAGATAGAGCAACTGAGAGGTTCGCAGACTGAGTAA
Protein Sequence
MLKQESVNMAEFEEYLVDTNILSGVNLEDLERLGSFNSQGNLEVLSPPDVAEETLDISYVIHNVPSIEEMIPVSPRGPPARSSFAGGLNFAVEINAADTNRKRYLYSDKLNRIYVDIKTNFAVQFRWDFEAAASPMFVRATTVFSDEAQSEKRVERCLQHTHESANAAIDPVIVKNVLHSSAAPGTRGVFYCGAPSVPDSWYSVLLRFDGRPQQPHSHAYQFVCKNSCSSGINRRAIDIIFTLEDHTGHVYGRETVGARVCACPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDVLVLPALKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDCPRRDKLKDEEVEPKPNKRRSSQPPRPGKRVKVEPDDDVLVLPALQIVGRQTMVTGLEVMLSMMEAVATKTPEHEQDKYQTSIRALRDKIEQLRGSQTE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-