Basic Information

Gene Symbol
PRDM7
Assembly
GCA_905475405.1
Location
FR997855.1:1394121-1403710[-]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 18 0.17 3.9e+03 -0.6 0.0 21 45 416 440 408 447 0.71
2 18 0.0077 1.7e+02 3.7 0.0 22 48 445 471 440 475 0.84
3 18 0.059 1.3e+03 0.9 0.0 21 46 472 497 468 503 0.88
4 18 0.057 1.3e+03 0.9 0.0 22 48 501 527 494 531 0.83
5 18 0.0027 62 5.1 0.1 21 44 528 551 524 559 0.68
6 18 0.0056 1.3e+02 4.1 0.1 21 52 556 587 552 588 0.87
7 18 0.024 5.4e+02 2.1 0.1 21 44 584 607 580 611 0.82
8 18 0.0052 1.2e+02 4.2 0.0 21 49 640 668 634 671 0.87
9 18 0.0021 49 5.5 0.0 21 51 668 698 665 700 0.86
10 18 0.082 1.9e+03 0.4 0.1 21 34 696 709 692 723 0.74
11 18 5.7e-05 1.3 10.5 0.0 21 49 724 752 720 755 0.87
12 18 4e-05 0.91 11.0 0.0 21 52 752 783 750 785 0.88
13 18 0.11 2.5e+03 -0.0 0.0 21 51 780 810 778 813 0.83
14 18 0.0021 48 5.5 0.0 21 48 808 835 801 839 0.87
15 18 0.0014 33 6.0 0.0 21 52 836 867 834 869 0.88
16 18 0.003 69 5.0 0.2 21 48 864 891 860 895 0.87
17 18 0.0031 71 4.9 0.1 21 48 892 919 889 923 0.87
18 18 0.0033 75 4.9 0.0 21 52 920 951 916 953 0.88

Sequence Information

Coding Sequence
ATGCCAGGACGCACGCTTCGTCAGAAAACACGCGTCTCTTACTACGAGCCTGAAGAGCCGGCGCTAGACGAATATATTTTTTGTGAAGAATGTTCAGACTACGTGTTCGAGTACTGCGCTATACATGGACCACTGCTTGTTATACCAGACGACAAGCTGTCGTCCAAGAACAGCGTCCCTGCGATAGTGCCTCGCGCCGCGCTCACCATACCTCACGTGTTCCTGCACCTGGCGCCCTCCTACATACCTGGCGCGGGTATAGGCGTGTTCAGCACCCTCACACTACCGCGCGGAGTCCGCTTCGGTCCGTATCGCGGACAGCGGACTGACGGCGTCGACTCTAAGTACTGCTGGCAGATATACGATCGCAACAACAAGCGGTCGCACGTCGTAGACGCGGCTGACTCACAACAGTCAAACTGGATGCGCTACGTGAACTGCGCCAGACACTGGAGGGAACAGAACCTTGTCGCCTTCCAGTACAAGGGACAACTGTATTACAGAACTATTAAGATTATTCCTCGTTTCACGGAGCTGCTAGTGTTCTACGGCAGTGAGTTCGCTAACTCATTGCACATCGATCTTGGAGCTTACAACGCGCCGAAGGGATATGCTCGTAAATTCGGTGCTCCTAAAAAACCGAATCAAAAGGAAAACTATGAAGATAATACACAAAACAAACATAAGAAATGCAAAACTGTTGAAGCAAACAGTGAACAAGAcacagttataataaaaaaagcgattacaaaacgtaaaatacttattaaagaTAATGTTGATAATATCAAGAAACGTAAACACACTTTTGTTGCACCTACCATTAATTTACATAAAGAGAAGCCTGATACACCTCTGAATGTTGATACAAGTAATGTCAATGTAAACGTCGCGGATAAAACCATTACGAGACATAGAGTAATCGATGTtaagaataaaaattataataaacaaaatttagtAGCACCGATTAATAAGTTTTGCAGAGATATGCATGAGAAAAGAGCAAAGGAtattattgaaaatataaaTGATATTAGTTCCATAAACAAACTGACCACAAACATAAAACATGATAATGTAGAAATTGCTAATAAATGTAAAGAAAAGAAAGAGGTAACAGAAAATAAAGTAGATAGTAACAATAATGTTAATATTAGTAAAACGAAATTTAATGAATGCAGTGTTTGTCATAAAAGTTTTACTTCAAAAACTTATCTGGATAAACATTTACGTATACATACTGGTGAGAAACCATATAAATGTGATGTTTGTAATAAGAGTTTTAACTTTAAACATCATTTAGTAACTCATTTACGTATACATACTGACCAGAAACCATATACATGTGATGTTTGTAATAAGAGTTTTAAACGAAGTGATAGTTTAGTAACTCATATGCGTATACATACTGGCGAGAAACCATACAAATGTGATGTTTGTAATAAGAGTTTTAACCGCAATGATAATTTAGTAACTCATATGCGTATACATACTGGTGAGAAATCATACACATGTGTTATTTGTAATAAGAGTTTTACCGAAAATGGTACTTTAGTTAAACATTTACGTATACATACTGGTGAAAAACCATATACATGTGATATTTGTAACAAGAGTTTTAACCTTAAACATCATTTAGTACAACATTTACGTATACATACTGGCGAGAAACCATATAAATGTGATGTTTGTAATAAGAGTTTTAGCCAAAATTGTGATTTAGTAAAACATTTACGTATACATACTGGTGAGAAACCATATACATGTGATGTTTGTAATAAGAGTTTTAACCAAAAAGGTCCTTTATTAAGTCATATGCATATACATACTGGCGATAGTAGATATAAATGCgaaatatgtaataaatgttTTGCTACAAAAACTGCAATAAGTAGTCATTTACGTATACATACTGGTGAGAAACCATATAAATGTGATGTTTGTAATAAGAGTTTTAACGTAAGTGGTACTTTAGTAAGACATTTACGTATACATACTGGTGAGAAACCATATACATGTAATGTTTGTAATAAGAGTTTTAACGATAGAGGTAATTTAGTAAAACATATGCGTATACATACTGGTGAGAAACCATATACATGTGATGTTTGTAATAAGAGTTTTAACCGAAAAGGTCCTTTAGTAATACATTTACGTATACATACTGGTGAGAAACCATACAAATGTGATATTTGTAATAAGAACTTTAGCCAAAGTAGTGATTTAGTAAGACATTTACGTATACATACTGGTGAGAAACCATATAAATGTGATATTTGTAATAAGAGTTTTAGCCAAAGTAGTGATTTAGTAAGACATTTACGTATACATACTGGTGAGAAACCTTATAAATGTGATGTttgtaataaaagttttattcacGGTAATCATTTAGTTAAACATTTACGTATACACACTGGTGAGAAACCATATAAATGTGATGTTTGTAATAAGAGTTTTAACCAAAGCAGTACTTTAGTAAAACATATGCGTATACATACTGGTGAGAAACCATATAAATGTGATGTTTGTAATAAGAGTTTTACCGAAAGTAGTTCTTTAGTAAGACATTTACGTATTCACACTGATGAGAAACCATATATATGTGATGTTTGTAATAAGAGTTTTAATCAAAAACCTCATTTAGTAAGACATTTACGTGTACATACTGGTGAAAAACCATATAAATGTGATGTTTGTAATAAGAGTTTTAATCAAAAACCTCATTTAGTAAGACATTTACGTGTACATACTGGTGAAAAACCATATAAATGTGATGTTTGTAATAAGAGTTTTACCCAAAGTGGtgattttgtaaaacatttacgTATACATACTGGTAAGAAACCATATACATGTGATGTTTGTAATAAGAGTTTTAACGATAGAGttactttTGTAGAAGAAATGTACCTGTTGCCTGTTGAATTGTTAGGTACTTTAGTGTAA
Protein Sequence
MPGRTLRQKTRVSYYEPEEPALDEYIFCEECSDYVFEYCAIHGPLLVIPDDKLSSKNSVPAIVPRAALTIPHVFLHLAPSYIPGAGIGVFSTLTLPRGVRFGPYRGQRTDGVDSKYCWQIYDRNNKRSHVVDAADSQQSNWMRYVNCARHWREQNLVAFQYKGQLYYRTIKIIPRFTELLVFYGSEFANSLHIDLGAYNAPKGYARKFGAPKKPNQKENYEDNTQNKHKKCKTVEANSEQDTVIIKKAITKRKILIKDNVDNIKKRKHTFVAPTINLHKEKPDTPLNVDTSNVNVNVADKTITRHRVIDVKNKNYNKQNLVAPINKFCRDMHEKRAKDIIENINDISSINKLTTNIKHDNVEIANKCKEKKEVTENKVDSNNNVNISKTKFNECSVCHKSFTSKTYLDKHLRIHTGEKPYKCDVCNKSFNFKHHLVTHLRIHTDQKPYTCDVCNKSFKRSDSLVTHMRIHTGEKPYKCDVCNKSFNRNDNLVTHMRIHTGEKSYTCVICNKSFTENGTLVKHLRIHTGEKPYTCDICNKSFNLKHHLVQHLRIHTGEKPYKCDVCNKSFSQNCDLVKHLRIHTGEKPYTCDVCNKSFNQKGPLLSHMHIHTGDSRYKCEICNKCFATKTAISSHLRIHTGEKPYKCDVCNKSFNVSGTLVRHLRIHTGEKPYTCNVCNKSFNDRGNLVKHMRIHTGEKPYTCDVCNKSFNRKGPLVIHLRIHTGEKPYKCDICNKNFSQSSDLVRHLRIHTGEKPYKCDICNKSFSQSSDLVRHLRIHTGEKPYKCDVCNKSFIHGNHLVKHLRIHTGEKPYKCDVCNKSFNQSSTLVKHMRIHTGEKPYKCDVCNKSFTESSSLVRHLRIHTDEKPYICDVCNKSFNQKPHLVRHLRVHTGEKPYKCDVCNKSFNQKPHLVRHLRVHTGEKPYKCDVCNKSFTQSGDFVKHLRIHTGKKPYTCDVCNKSFNDRVTFVEEMYLLPVELLGTLV*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01333055;
90% Identity
iTF_01333055;
80% Identity
iTF_01333055;