Basic Information

Gene Symbol
PRDM7
Assembly
GCA_947859175.1
Location
OX401858.1:1708882-1715213[-]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 12 0.01 1.1e+02 3.3 0.1 22 46 469 493 457 499 0.79
2 12 0.0044 46 4.4 0.0 21 44 496 519 492 524 0.89
3 12 0.00065 6.8 7.1 0.1 21 44 524 547 519 551 0.90
4 12 0.0007 7.2 7.0 0.1 21 44 552 575 548 579 0.90
5 12 0.0007 7.3 7.0 0.1 21 44 580 603 576 607 0.90
6 12 6.8e-05 0.71 10.2 0.1 21 46 608 633 604 639 0.87
7 12 0.0016 17 5.8 0.1 21 45 636 660 632 667 0.87
8 12 0.0021 21 5.5 0.0 22 46 665 689 660 695 0.87
9 12 0.014 1.4e+02 2.9 0.0 21 44 692 715 688 719 0.89
10 12 0.00099 10 6.5 0.0 20 46 719 745 709 751 0.82
11 12 0.0052 54 4.2 0.0 21 45 748 772 744 779 0.85
12 12 0.021 2.2e+02 2.3 0.0 20 48 775 803 764 807 0.79

Sequence Information

Coding Sequence
ATGTCCCAGCGTAATTTGCGGCACAAGCCGCGGATTAGTTATTACGAGCCAGAAGAACCTGATTTGGACGAGTATGTGTTTTGCGAAAAATGCGGAGACTACGTTTACGAGTACTGCGCCATTCATGGACCTTTACTGGTCATACCAGATGATAAGGTTCCCGCCAAACCCAATGTCCCACCGTACGTGCCTCGCGCGGCGCTAACTATTCCTCACGTGTTCCTGCATATCACATATTCCATCATACCAGGTTTGACCTTCCTGAGCCTTCCTTTCCTGCTCGGATCGTGTCCACCAAAGTTGAGCCTATTAGATGCTGGATTGGGAGTATTCACATCAATGGCGCTTCCTTCGGGAGTGCGTTTCGGCCCTTACCAAGGACAGCGAACAGACGTCGTTGACTCTTCGTACTGTTGGCAGATTTACGATAAGGATCGTAAGCCACTTCACTGCATTGACGCAGCCGATGCCAATAAATCTAACTGGATGCGATACGTGAACTGCGCCCGACACTGGAGGGAACAAAACCTTGTGGCATACCAATATCAAGGGGAATTGTATTACAGAACTATCAAAATAATTCCTCGCTACACGGAGCTGATGGTGTTCTACGGCAGCGAATTCGCGTGTGAACTGGACGTCGACCTCGGAAAGTACAACTCTCCGACTGGATATGCGCAGAAATTTGGTGCACCAGTTGCAAAGAAACGCAAACCGAACGATGACCATGCCAAGATAGAAACTCAGGAAGAAATACAAACGAGAAACTTCGAAATGGCTAACACACAAGTGCCGACATCTGCAAAAGGTGCTATTATCAAAGACATAACCAGTGAAATGCAAAATGGACAAGTAAATCTGACTTCGATAGGTTCAGGACAAGTTGGAGAAAATTACACCACAAAATCAGACGACATTAAAGGGGAAAAAAGCAGGAAAAAAGCAGTAACTAAAAACAGTCAGGTAAATAAGAGCAATGGTGTAATTTCAACAAAGAACAATAAAACATCGAAGGTAGCTCAAACCAAACAAGACGGTGCATGTGTTCATCTCAAACCTCATAACATTGAAAGAGATGAAAGTAAAAATAACGACATTAATCTAAACAAGGTAGAGTTCAAAACTAACCTTGCAGCACAAAACAATGTTCTAGACTTGTACATTTATTGTGATAGTTGCAACAAAAAGTGTGCAACAAGGCAAGAGCTTAAAACACATCTTAGATTTCATGATTTAAATCGAAAGTATGTTTGTGAAATTTGCAATTTAAGGTACACTTCAATTTCAAGTTTAAACGATCATAGTAATATTCATAGAATAAATAAAGATTATGAATTTACAAtatgcagtaaaatacatactcgaaatgataatttaaaatctcatatgagaactcacaccgaagaaaagcggtatgcctgtgaagtatgcaatgcaaggtttaatcaaaacagtgatttaaaaacacatatgagaactcacaccggagaaaagccgtatgcctgtgaagtatgcaatgcaagatttaatgaaaacggtaatttaaaaatacatatgagaactcacactggagaaaagccgtatgcttgtgaagtatgcaatgcaaggtttaatcataacagtaatttaaaaaaacatatgagaactcacactggagaaaagccgtatgcttgtgaagtatgcaatgcaaggtttaatcataacagtaatttaaaaaaacatatgagaactcacaccggagaaaagccgtatgcctgtgaagtatgcaatgcaaggtttaatcataacagtaatttaaaaaaacatatgagaactcacaccggagaaaagccgtatgcctgtgaagtatgcaatgcaaggtttaatcaaaacagtgatttaaaaagacatatgagaactcacaccggagaaaagccgtatgcctgtgaagtatgcaatgcaagatttaacgaaaacggttctttaaaaagacatatgagaactcacatcggagaaaagccgtatgcctgtgaagtatgcaatgcaagatttaatgaaaacggtaatttaaaaacacatatgagaactcacaccggagaaaagccgtatggctgtgaagtatgcaatgcaagatttaataaaaacggttctttaaaaaaacatatgagaactcacaccggagaaaagccgtatgcctgtgaagtatgcaatgcaagatttaatgaaaacggtaatttaaaaacacatatgagaactcacaccggagaaaagccgtatgcctgtgaagtatgcaatgcaagatttaataaaaacggttctttaaaaaaacatatgagaactcacaccggagaaaagccgtacgcctgcgaaatatgcgaagaaaaattcacctacacagcaagcttgaaaaatcacctcgtgaAAATACATCTTGGAGACAAAAGTAACCAAAAGCCTACAACATGA
Protein Sequence
MSQRNLRHKPRISYYEPEEPDLDEYVFCEKCGDYVYEYCAIHGPLLVIPDDKVPAKPNVPPYVPRAALTIPHVFLHITYSIIPGLTFLSLPFLLGSCPPKLSLLDAGLGVFTSMALPSGVRFGPYQGQRTDVVDSSYCWQIYDKDRKPLHCIDAADANKSNWMRYVNCARHWREQNLVAYQYQGELYYRTIKIIPRYTELMVFYGSEFACELDVDLGKYNSPTGYAQKFGAPVAKKRKPNDDHAKIETQEEIQTRNFEMANTQVPTSAKGAIIKDITSEMQNGQVNLTSIGSGQVGENYTTKSDDIKGEKSRKKAVTKNSQVNKSNGVISTKNNKTSKVAQTKQDGACVHLKPHNIERDESKNNDINLNKVEFKTNLAAQNNVLDLYIYCDSCNKKCATRQELKTHLRFHDLNRKYVCEICNLRYTSISSLNDHSNIHRINKDYEFTICSKIHTRNDNLKSHMRTHTEEKRYACEVCNARFNQNSDLKTHMRTHTGEKPYACEVCNARFNENGNLKIHMRTHTGEKPYACEVCNARFNHNSNLKKHMRTHTGEKPYACEVCNARFNHNSNLKKHMRTHTGEKPYACEVCNARFNHNSNLKKHMRTHTGEKPYACEVCNARFNQNSDLKRHMRTHTGEKPYACEVCNARFNENGSLKRHMRTHIGEKPYACEVCNARFNENGNLKTHMRTHTGEKPYGCEVCNARFNKNGSLKKHMRTHTGEKPYACEVCNARFNENGNLKTHMRTHTGEKPYACEVCNARFNKNGSLKKHMRTHTGEKPYACEICEEKFTYTASLKNHLVKIHLGDKSNQKPTT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00737670;
90% Identity
iTF_00737670;
80% Identity
iTF_00737670;