Cbeh022452.1
Basic Information
- Insect
- Colias behrii
- Gene Symbol
- -
- Assembly
- GCA_029959075.1
- Location
- JARWMB010000020.1:11591672-11602906[+]
Transcription Factor Domain
- TF Family
- P53
- Domain
- P53 domain
- PFAM
- PF00870
- TF Group
- Beta-Scaffold Factors
- Description
- P53 is a tumor suppressor gene product; mutations in p53 or lack of expression are found associated with a large fraction of all human cancers. P53 is activated by DNA damage and acts as a regulator of gene expression that ultimatively blocks progression through the cell cycle. P53 binds to DNA as a tetrameric transcription factor. In its inactive form, p53 is bound to the ring finger protein Mdm2, which promotes its ubiquitinylation and subsequent proteosomal degradation. Phosphorylation of p53 disrupts the Mdm2-p53 complex, while the stable and active p53 binds to regulatory regions of its target genes, such as the cyclin-kinase inhibitor p21, which complexes and inactivates cdk2 and other cyclin complexes [PMID: 20066118, PMID: 12629332, PMID: 1397838, PMID: 6544917, PMID: 19826090, PMID: 19776744, PMID: 6278740, PMID: 221923, PMID: 6318442, PMID: 20030809].This domain is found in p53 transcription factors, where it is responsible for DNA-binding. The DNA-binding domain acts to clamp, or in the case of TonEBP, encircle the DNA target in order to stabilise the protein-DNA complex [PMID: 11780147]. Protein interactions may also serve to stabilise the protein-DNA complex, for example in the STAT-1 dimer the SH2 (Src homology 2) domain in each monomer is coupled to the DNA-binding domain to increase stability [PMID: 9630226]. The DNA-binding domain consists of a beta-sandwich formed of 9 strands in 2 sheets with a Greek-key topology. This structure is found in many transcription factors, often within the DNA-binding domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 18 2.4e-18 8.5e-14 52.8 0.3 4 88 76 161 73 171 0.92 2 18 0.0031 1.1e+02 3.5 1.2 68 88 171 191 162 198 0.79 3 18 0.0046 1.6e+02 3.0 1.8 68 88 201 221 188 227 0.80 4 18 0.0048 1.7e+02 2.9 1.8 69 88 232 251 223 260 0.79 5 18 0.0054 1.9e+02 2.7 1.9 69 88 262 281 253 288 0.79 6 18 0.0047 1.6e+02 2.9 1.7 68 88 291 311 282 317 0.79 7 18 0.0047 1.6e+02 2.9 1.8 69 88 322 341 313 350 0.79 8 18 0.0045 1.6e+02 3.0 1.7 69 88 352 371 343 379 0.79 9 18 0.005 1.8e+02 2.8 1.9 69 88 382 401 373 410 0.79 10 18 0.004 1.4e+02 3.1 1.6 68 88 411 431 402 440 0.79 11 18 0.005 1.7e+02 2.8 1.9 69 88 442 461 433 470 0.79 12 18 0.0043 1.5e+02 3.0 1.8 68 88 471 491 458 500 0.80 13 18 0.0039 1.4e+02 3.2 1.6 68 88 501 521 492 529 0.79 14 18 0.0059 2.1e+02 2.6 2.0 69 88 532 551 523 557 0.79 15 18 0.0051 1.8e+02 2.8 1.7 69 88 562 581 554 587 0.79 16 18 0.0038 1.3e+02 3.2 1.5 68 88 591 611 582 618 0.79 17 18 0.0044 1.6e+02 3.0 1.7 68 88 621 641 612 647 0.79 18 18 2.6e-28 9.2e-24 85.3 6.5 95 192 706 800 641 801 0.77
Sequence Information
- Coding Sequence
- ATGTTCAaaaatgaagaaattcataTATTTCAACAAGCTGAAGGAGAAGATAATATAATTACATTCAACGAGGTGGACTTCCAAATGTTGAACACGTACCTGCCGCAAACTCCGGACCTGCTGCAGGATCCACAGTTCTGCGAGGAGACCATGGATGTACAGAGGAAGTATGAGCAGCCTATACAGCTTGTACCTGGTTCACCTCTCGGCCCTCCTGTGAGAGTCAACTTTGCTGGTGATGTCAACTTCACAGTGGAGATACATGCTGCTCATACAAACAAAAAGAAGTATTTGTACTCGCATCGCTTGAACCGGATCTACGTGGACATGGGCTGCAACTTCTCCGTTGGCTTCCGCTGGGACGCGGACAAGCTGTGCGGCCGCGATGTCTTCGTGCGGGCAACCACCGTGTTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGCGAGTCGAGCGCTGCTACCAGCACTCGCACGAGTCGTCCAACGCGGGTCAGTGTGTGTGCGAGCTCGCGGAGCTGGCGCAGGCCGAGAAGTGGTATACTGTGATAGTGTCTTCAAGAAGAAAATGCACAGAGCGCGCAGTGACAATGAACGTGCTCCGCTCAGCCCGGGCGCTCGGTTCAGCCGGCGTGTTCTACTGCGGACGGGCTGAGCGCAGCGACTCGTGGTACTCGGTGGCGGTGCGGTTGGATGCCCCCGCAGACCTCGCGTATGTGTTCGCCTGCAAGAGCTCCTGCTCCTCCGGGATTAATAGACGGCCGATTAATGTTATCTTCACGTTGGAGGATATCTGCGGCAAAGTGTACGGGCGGCAGAGTGTAGGCGTGCGCGTGTGTTCCTGCCCCCGGCGGGACATGCTGCGCGACGAGGAGGACGCGGAAGTGCCCCGGGGCAAGCGGCCCGCTGCCCCACCTGCCCCGGGCGGGCAGGACAAGGGCAAGAAGGTGCGGCTGGAGGTGGAGAAGGACGACGCTGTGGTCACGTTGCCGTCGATAGAAGTGGTAGGCGTTCGAACAGCGATACTAGGGCTGGAAGTAATGGTCGGCATGATGGAGCAGACCCGGGACGCGCGTGCTAGCTCACAGCAGCCCATAGCAGATATTAACACGCAGATAAATAACCTCAAGAGTGTTATAGATGGGCTCAAGACGCAGAAGAAATGA
- Protein Sequence
- MFKNEEIHIFQQAEGEDNIITFNEVDFQMLNTYLPQTPDLLQDPQFCEETMDVQRKYEQPIQLVPGSPLGPPVRVNFAGDVNFTVEIHAAHTNKKKYLYSHRLNRIYVDMGCNFSVGFRWDADKLCGRDVFVRATTVFAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKRVERCYQHSHESSNAGQCVCELAELAQAEKWYTVIVSSRRKCTERAVTMNVLRSARALGSAGVFYCGRAERSDSWYSVAVRLDAPADLAYVFACKSSCSSGINRRPINVIFTLEDICGKVYGRQSVGVRVCSCPRRDMLRDEEDAEVPRGKRPAAPPAPGGQDKGKKVRLEVEKDDAVVTLPSIEVVGVRTAILGLEVMVGMMEQTRDARASSQQPIADINTQINNLKSVIDGLKTQKK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -