Basic Information

Gene Symbol
-
Assembly
GCA_030762175.1
Location
CM060830.1:29042487-29070215[+]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 5 3.8e+03 -0.1 0.0 18 44 335 361 333 364 0.87
2 15 0.022 17 7.4 0.3 21 52 366 397 363 399 0.91
3 15 3.2 2.4e+03 0.5 0.1 22 47 395 420 392 425 0.81
4 15 0.15 1.1e+02 4.8 0.0 22 46 423 447 420 455 0.83
5 15 0.086 66 5.5 0.1 23 49 480 506 476 509 0.84
6 15 0.064 49 5.9 0.1 20 51 505 536 503 538 0.86
7 15 0.36 2.7e+02 3.6 0.1 22 44 535 557 532 568 0.87
8 15 6.4 4.9e+03 -0.5 0.1 22 52 616 646 612 647 0.81
9 15 0.12 90 5.1 0.1 23 45 645 667 641 675 0.88
10 15 0.048 36 6.4 0.1 26 52 676 702 670 703 0.84
11 15 0.04 31 6.6 0.1 20 52 698 730 696 732 0.87
12 15 2.6 2e+03 0.8 0.0 23 45 729 751 726 759 0.85
13 15 0.38 2.9e+02 3.5 0.2 23 52 785 814 780 816 0.87
14 15 0.18 1.4e+02 4.5 0.0 22 51 840 869 837 872 0.85
15 15 4.3 3.3e+03 0.1 0.0 23 44 869 890 865 893 0.89

Sequence Information

Coding Sequence
ATGGATATTAAATTAGAGATAAAGGAAGAAGTACTTGACAATGATCAGATGATACCAGCAAGTAAAGGAGAAAACTCTTGTAAAACTGCTCATGAAGATGTGGTGGGGAATGAGATAAAGGAGGAGAAAGAAGTTGATATTTCTTTAATGAATGAGGTTATCACAGCTGAGGAATTGCATTTTCCATACAATGGTCCAAGATTGAAACAGAGCTCAGTTTTAGCTCAGGATTTATCAAGTTCCTCTAGAAATGTACCATCACTAGGAGGCAGCCTGTGCAATAGAAACTTAAAGGATTTTGAGCCGCAGAGTGCCGGTCAACAATATAATAATGCTGAAGAGCTTATACAACCAAAGGAAAAACAAGCAATGCACAACACTGACAACAAACTGGAACTATATTATCAGAACTTAAGAACGACTCAAGAgTCTCCAATAGTCATGAATGAAAGTACTGACCATGAGAAAATTAACATCAAACTAGAGATAAAGGAAGAACTGCTGGACCCTAATATGTATTCACTTGAAATGAAGCCAGAAGCTATGGAAGAAAATGCATGTATTGGTACTCATAGTGCTGTTTTGAAGAATGAGGAGTGGCATTGCAATGGAATCCAATGCTTACCTCTTACAGAATTTTGCTCTAGTGCCATCACTATAGTGAAAATCGACTGCAATAGGAAAAAAGGCTTATTATTGAGTTCTATCTTTTTCCGCGGCAGCAATCCACCCACCAGGAAACTAGAACTGTTGGTTGCACATTACACTCAAAGAGGGGAACACACCATCTTTGGGTGTGATGCCAACTCTCACCATGAAACTTGGGGCAGCAAAGATATCAGCAAAAGGGAAATTCCAGGAGAACCTGGTCGGAGCCCTAATTGGAGGATCTCTCAGATAGGATTGAGTAGATTGATAGGGTTAGATCTTTGgccaaatatAGAAGACATTTCCAGCTGTTTTGACAGCTCATCCACCACTTCTGTTCATGTTACACTGGTAAGAAAAGAAAGCAACAAGCACCACAAGTGCAAAGTATGTGAGAAGAGCTTTATCCAAACATCTCATTTGAAAGAACATCTATTGATTCATGAGGGTGAAAAgccccataaatgtgaaatttgtaagAAGAGCTTTCCCCTCGTATCTAATTTGAGAAaccatcttttgattcataagcACAAGAAGCCCCATAGGTGTGcaatttgtggaaagagctttatccaGTTATCCTATTTcaggacacatcttttgattcatgagggtaagaagccccATAAGTGTGAAGTATGTGGGAAAAACTTTACCCAGGCTTCTACTctgaggaaacatcttttgactcatgagggcaagaagccacattaTTGTGAAgcttgtgggaaaagctttacccaGGTATCTCATTTGAGGGaacattttttgattcatgagggtaggAAGCCACATCAGTGTGAAACCTGcaggaagagctttacccaggcatctaATTTGAAGAagcatcttttgattcatcagggaaagaagccacataagtgtgaagtttgtgggaaaagttttACCCAGGCATCTCATTTGAGagaacatcttttaattcatgatggcaagaagccacataagtgtgaagtttgtgggaagagctttacacaTTCCTCTAATTTCAGGAAGCATGTTTTAATTCATAATGGCAAGCAatcacataagtgtgaagtttgtgggaagagctttgctCACTCTTCAAGTTTGAGTGGACATCTTATTATTCATAAGGTCaacaaatgtgaaattaataatGAGAACTTTACCAACACATCTAATTTGGGAAAACATCTTTTTAGTCAAGAAGACAAGAACCCGTATAAGTGTGAAGTTTGCGggaaaagtttcaccagagcaccTTATTTAAGAGTACATCTTTTAATCCATAATAACAGGAAACCCCATAAGTGTAAAATTTGTAGAAAGAATTTTACCTACTCATTtaatttgaagaaacatcttttggTTCATGAGGGCAGAAAGTCACACAAATGTGAAATTTGCAGGAAGAGCTTTAGTCAGACATCTAATTTGAAGAagcatcttttgattcatcagggcaagaagccacataagtgtgaagtttgtggaaagagctttacccagacatctCATTTGaggcaacatcttttgattcatgaaggcaagaagccacataaatgtgaaatttgtgggaagagctttacccatgcATCTAATTTCAAAAAGCATATTTTGATTCATAATGGCAAGAAATCTCACgaatgtgaattttgtgggaaAACCTTTAATCGGGCATTTAATTTgaaggaacatcttttgattcatgagggcaggaagccacataaatgtgaaatttgcagaaaagactttACCCAGGCATCTCATTTGAAAAATCACCTTTTGATTCATCAGGGCatgaagccacataaatgtgaagtttgtgggaaaagctttagcCGGTTATTCTATTTcaggacacatcttttgattcacaagGGCAAAAAGCCCCATGAGTGTGAGGTATGTGGAAAAAGCTTTACCCAGACATTCAATTTGAaggaacatcttttaattcatgagggcagGAAACCACAtaattgtgaagtttgtgggaagaactttacccaGATATCTcatttgaagaaacatcttttgattcattaG
Protein Sequence
MDIKLEIKEEVLDNDQMIPASKGENSCKTAHEDVVGNEIKEEKEVDISLMNEVITAEELHFPYNGPRLKQSSVLAQDLSSSSRNVPSLGGSLCNRNLKDFEPQSAGQQYNNAEELIQPKEKQAMHNTDNKLELYYQNLRTTQESPIVMNESTDHEKINIKLEIKEELLDPNMYSLEMKPEAMEENACIGTHSAVLKNEEWHCNGIQCLPLTEFCSSAITIVKIDCNRKKGLLLSSIFFRGSNPPTRKLELLVAHYTQRGEHTIFGCDANSHHETWGSKDISKREIPGEPGRSPNWRISQIGLSRLIGLDLWPNIEDISSCFDSSSTTSVHVTLVRKESNKHHKCKVCEKSFIQTSHLKEHLLIHEGEKPHKCEICKKSFPLVSNLRNHLLIHKHKKPHRCAICGKSFIQLSYFRTHLLIHEGKKPHKCEVCGKNFTQASTLRKHLLTHEGKKPHYCEACGKSFTQVSHLREHFLIHEGRKPHQCETCRKSFTQASNLKKHLLIHQGKKPHKCEVCGKSFTQASHLREHLLIHDGKKPHKCEVCGKSFTHSSNFRKHVLIHNGKQSHKCEVCGKSFAHSSSLSGHLIIHKVNKCEINNENFTNTSNLGKHLFSQEDKNPYKCEVCGKSFTRAPYLRVHLLIHNNRKPHKCKICRKNFTYSFNLKKHLLVHEGRKSHKCEICRKSFSQTSNLKKHLLIHQGKKPHKCEVCGKSFTQTSHLRQHLLIHEGKKPHKCEICGKSFTHASNFKKHILIHNGKKSHECEFCGKTFNRAFNLKEHLLIHEGRKPHKCEICRKDFTQASHLKNHLLIHQGMKPHKCEVCGKSFSRLFYFRTHLLIHKGKKPHECEVCGKSFTQTFNLKEHLLIHEGRKPHNCEVCGKNFTQISHLKKHLLIH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00995418;
90% Identity
iTF_00995418;
80% Identity
iTF_00995418;