Basic Information

Gene Symbol
ERG
Assembly
GCA_029955525.1
Location
JARPMR010000010.1:27029767-27043482[+]

Transcription Factor Domain

TF Family
ETS
Domain
Ets domain
PFAM
PF00178
TF Group
Helix-turn-helix
Description
Transcription factors are protein molecules that bind to specific DNA sequences in the genome, resulting in the induction or inhibition of gene transcription [3]. The ets oncogene is such a factor, possessing a region of 85-90 amino acids known as the ETS (erythroblast transformation specific) domain [3, 5, 4]. This domain is rich in positively-charged and aromatic residues, and binds to purine-rich segments of DNA. The ETS domain has been identified in other transcription factors such as PU.1, human erg, human elf-1, human elk-1, GA binding protein, and a number of others [3, 5, 2]. It is generally localized at the C terminus of the protein, with the exception of ELF-1, ELK-1, ELK-3, ELK-4 and ERF where it is found at the N terminus. NMR-analysis of the structure of the Ets domains revealed that it contains three α-helices (1-3) and four-stranded β-sheets (1-4) arranged in the order α1-β1-β2-α2-α3-β3-β4 forming a winged helix-turn-helix (wHTH) topology [1]. The third α-helix is responsive to contact to the major groove of the DNA. Different members of the Ets family proteins display distinct DNA binding specificities. The Ets domains and the flanking amino acid sequences of the proteins influence the binding affinity, and the alteration of a single amino acid in the Ets domain can change its DNA binding specificities.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 26 0.022 1.1e+02 4.6 0.0 1 25 34 56 34 62 0.89
2 26 9.4e-22 4.8e-18 66.7 0.0 25 72 85 132 82 137 0.94
3 26 9.1e-22 4.6e-18 66.8 0.0 25 72 148 195 145 200 0.94
4 26 9.1e-22 4.6e-18 66.8 0.0 25 72 211 258 208 263 0.94
5 26 9.1e-22 4.6e-18 66.8 0.0 25 72 274 321 271 326 0.94
6 26 9.1e-22 4.6e-18 66.8 0.0 25 72 337 384 334 389 0.94
7 26 9.1e-22 4.6e-18 66.8 0.0 25 72 400 447 397 452 0.94
8 26 9.1e-22 4.6e-18 66.8 0.0 25 72 463 510 460 515 0.94
9 26 8.9e-22 4.5e-18 66.8 0.0 25 72 526 573 523 578 0.94
10 26 9.1e-22 4.6e-18 66.8 0.0 25 72 589 636 586 641 0.94
11 26 9.1e-22 4.6e-18 66.8 0.0 25 72 652 699 649 704 0.94
12 26 9.1e-22 4.6e-18 66.8 0.0 25 72 715 762 712 767 0.94
13 26 9.1e-22 4.6e-18 66.8 0.0 25 72 778 825 775 830 0.94
14 26 9.1e-22 4.6e-18 66.8 0.0 25 72 841 888 838 893 0.94
15 26 9.1e-22 4.6e-18 66.8 0.0 25 72 904 951 901 956 0.94
16 26 9.1e-22 4.6e-18 66.8 0.0 25 72 967 1014 964 1019 0.94
17 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1030 1077 1027 1082 0.94
18 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1093 1140 1090 1145 0.94
19 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1156 1203 1153 1208 0.94
20 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1219 1266 1216 1271 0.94
21 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1282 1329 1279 1334 0.94
22 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1345 1392 1342 1397 0.94
23 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1408 1455 1405 1460 0.94
24 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1471 1518 1468 1523 0.94
25 26 9.1e-22 4.6e-18 66.8 0.0 25 72 1534 1581 1531 1586 0.94
26 26 6.2e-26 3.1e-22 80.1 0.1 25 81 1597 1653 1594 1653 0.96

Sequence Information

Coding Sequence
ATGCTGTGGCTGAGCGCGAGCGAGCATCGCCGCATCCAGCAGTTGTATCACGAGCCGTACAACGGGCTGAGCGGCTGCGCGCGCAGCGGCGGGCAGGTGCAGCTGTGGCAGTTCCTGCTGGAGGAgctggcggcgcgcgcgccCGGCATCGAGTGGGTGGACAAGGAAATAAAAGGACTCGGTAAAATGATAATCTTACGACCCGTGATATTGCTGTTGAAATATGATGGTGATGGTGCGCAGGGCAACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCAGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGAGAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGGCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCGTGCTGTCTACTGTCTACGTGGTGGTGATGGTGCGCAGGGCGACCCGGCGGACGGCGAGTTCCGGCTGTCGGACCCGGACGAGGTGGCGCGGCGCTGGGGCCGGCGCAAGCAGAAGCCCAACATGAACTACGACAAGCTGTCGCGCGCGCTGCGCTACTACTACGACAAGAGCATCATGAGCAAGGTGCACGGCACGCGGTACGCGTACCGGTTCAGCTGGGCGGGGCTCGCGGCCGCCAGCCAGGCGCAGGCGCCCGACGCGCCGCCCTACTGGCACTACGTCCCGCGCCCGCAGCCGCACCCGCCCGCGCCCTCCGCGCCgcagccgccgccgcccgccccaCAGCACCGCCCGCCCTTGCAGTGA
Protein Sequence
MLWLSASEHRRIQQLYHEPYNGLSGCARSGGQVQLWQFLLEELAARAPGIEWVDKEIKGLGKMIILRPVILLLKYDGDGAQGNPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYEKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVRAVYCLRGGDGAQGDPADGEFRLSDPDEVARRWGRRKQKPNMNYDKLSRALRYYYDKSIMSKVHGTRYAYRFSWAGLAAASQAQAPDAPPYWHYVPRPQPHPPAPSAPQPPPPAPQHRPPLQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-