Basic Information

Gene Symbol
-
Assembly
GCA_963662175.1
Location
OY759217.1:45889318-45890418[-]

Transcription Factor Domain

TF Family
SRF
Domain
SRF domain
PFAM
PF00319
TF Group
Helix-turn-helix
Description
Serum response factor (SRF) is a ubiquitous nuclear protein important for cell proliferation and differentiation. SRF function is essential for transcriptional regulation of numerous growth-factor-inducible genes, such as c-fos oncogene and muscle-specific actin genes. A core domain of around 90 amino acids is sufficient for the activities of DNA-binding, dimerisation and interaction with accessory factors. Within the core is a DNA-binding region, designated the MADS box [2], that is highly similar to many eukaryotic regulatory proteins: among these are MCM1, the regulator of cell type-specific genes in fission yeast; DSRF, a Drosophila trachea development factor; the MEF2 family of myocyte-specific enhancer factors; and the Agamous and Deficiens families of plant homeotic proteins. In SRF, the MADS box has been shown to be involved in DNA-binding and dimerisation [1]. Proteins belonging to the MADS family function as dimers, the primary DNA-binding element of which is an anti-parallel coiled coil of two amphipathic α-helices, one from each subunit. The DNA wraps around the coiled coil allowing the basic N-termini of the helices to fit into the DNA major groove. The chain extending from the helix N-termini reaches over the DNA backbone and penetrates into the minor groove. A 4-stranded, anti-parallel β-sheet packs against the coiled-coil face opposite the DNA and is the central element of the dimerisation interface. The MADS-box domain is commonly found associated with K-box region see (IPR002487).
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 16 0.025 2.5e+02 2.6 0.0 12 27 4 19 1 22 0.85
2 16 0.0051 50 4.9 0.0 8 27 23 42 20 45 0.87
3 16 0.01 1e+02 3.9 0.0 12 29 50 67 47 68 0.86
4 16 0.006 59 4.6 0.0 8 27 69 88 66 91 0.88
5 16 0.0049 48 4.9 0.0 8 27 92 111 88 114 0.88
6 16 0.021 2.1e+02 2.9 0.0 12 27 119 134 116 137 0.84
7 16 0.031 3e+02 2.4 0.0 14 27 144 157 139 162 0.84
8 16 0.0049 48 4.9 0.0 8 27 161 180 157 183 0.88
9 16 0.021 2.1e+02 2.9 0.0 12 27 188 203 185 206 0.84
10 16 0.021 2.1e+02 2.9 0.0 12 27 211 226 208 229 0.84
11 16 0.021 2.1e+02 2.9 0.0 12 27 234 249 231 252 0.84
12 16 0.021 2.1e+02 2.9 0.0 12 27 257 272 254 275 0.84
13 16 0.021 2.1e+02 2.9 0.0 12 27 280 295 277 298 0.84
14 16 0.022 2.2e+02 2.8 0.0 12 27 303 318 300 321 0.84
15 16 0.027 2.7e+02 2.5 0.0 12 27 326 341 323 343 0.85
16 16 0.028 2.8e+02 2.5 0.0 12 27 349 364 346 366 0.85

Sequence Information

Coding Sequence
ATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCCATCGGACGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACATCCCTCCATCGGACGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCCATCGGACGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGACGGCTCGCTTCGGAACTAGACACAACATTGTGTTCAAAGCCGCTCATGAACTGACACCCCTCCATCGGACGGCTCGCTTCGAAACTAGACACAACATAGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGACGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTATTGGATGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCCATTGGACGGCTCGCTTCGAAACTAGACACAACATTGTGTTTAAAGCCGCTCATGAACTGACACCCCTCTAA
Protein Sequence
MARFETRHNIVFKAAHELTPLHRTARFETRHNIVFKAAHELTPLYWMARFETRHNIVFKAAHELTSLHRTARFETRHNIVFKAAHELTPLHRTARFETRHNIVFKAAHELTPLYWMARFETRHNIVFKAAHELTPLYWTARFGTRHNIVFKAAHELTPLHRTARFETRHNIVFKAAHELTPLYWMARFETRHNIVFKAAHELTPLYWMARFETRHNIVFKAAHELTPLYWMARFETRHNIVFKAAHELTPLYWMARFETRHNIVFKAAHELTPLYWMARFETRHNIVFKAAHELTPLYWTARFETRHNIVFKAAHELTPLYWMARFETRHNIVFKAAHELTPLHWTARFETRHNIVFKAAHELTPL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-