Basic Information

Transcription Factor Domain

TF Family: zf-GAGA
Domain: zf-GAGA domain
PFAM: PF09237
TF Group: Zinc-Coordinating Group
Description: Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out: # of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc

1 5 0.12 7.8e+02 0.8 0.2 14 45 101 133 97 136 0.69

2 5 0.012 75 4.0 0.0 18 45 135 162 130 169 0.81

3 5 0.0002 1.2 9.7 0.2 22 46 167 191 162 194 0.90

4 5 3.5e-05 0.22 12.1 0.1 21 52 194 225 191 226 0.89

5 5 0.00095 5.9 7.6 0.1 21 48 222 249 218 255 0.88

#	of	c-Evalue	i-Evalue	score	bias	hmm coord from	hmm coord to	ali coord from	ali coord to	env coord from	env coord to	acc
1	5	0.12	7.8e+02	0.8	0.2	14	45	101	133	97	136	0.69
2	5	0.012	75	4.0	0.0	18	45	135	162	130	169	0.81
3	5	0.0002	1.2	9.7	0.2	22	46	167	191	162	194	0.90
4	5	3.5e-05	0.22	12.1	0.1	21	52	194	225	191	226	0.89
5	5	0.00095	5.9	7.6	0.1	21	48	222	249	218	255	0.88

Coding Sequence: ATGGATGAAAGGAGACACACACATTTTAGCACATTGGAGCCTGATGTTATTATTACTGATGGGAGTCTTCTGGACGATAAATTATCCCTGTTAAAGGACGATATCTTAGAAGATCACGAGGATTCTTTATCGTTAGGTCCGGAAATCTCTATTATCCCTGTTATGGGAAAATCTCCTGGTAAGATTCGCGTTCGTAACTTCCTCGAAGAGGAGAGACGCATGATATTACACAGAGAACGGTCACCAAGAGGTAATATTCGAATAGTAACCCCGGACGAAGTTCTAAAAGCGGAAAATGAGAACTCGTACTCTAGAAGGAAATCTCTGCCACTGGAGAAATGTCACATATGCAAAAAGTTCTTCAGACGAATGAAAACGCATCTTCTAAAACACGAAGAGAAACGAAGGGATCCTAACGATCCCCTAACCTGCAAGTTGTGCATGAAAGCTTTTAACACGTACAGTAATTTAAGTATTCATATGAGGACCCATACAGGGGATAAACCTTATATTTGCGATATTTGTAATAAGTCATTTTCTCAGAGTTGCAACCTAGTTAACCACTTAAGGATACATACAGGGGAAAGACCTTATAAATGCCCGCATTGCGACAGGGCTTTTACACAATCGGGTAATCTTACCAATCATATACGTTTACATACGGACGAGAAACCGTTTAAATGTCATTTCTGCGATCGAGCTTTCACGCAATCGGGGAATCTAAACTCGCACATAAGAAACAATCACAAATTCGATGATCCGAGAAGCATCGGTGGTTTCCattaa
Protein Sequence: MDERRHTHFSTLEPDVIITDGSLLDDKLSLLKDDILEDHEDSLSLGPEISIIPVMGKSPGKIRVRNFLEEERRMILHRERSPRGNIRIVTPDEVLKAENENSYSRRKSLPLEKCHICKKFFRRMKTHLLKHEEKRRDPNDPLTCKLCMKAFNTYSNLSIHMRTHTGDKPYICDICNKSFSQSCNLVNHLRIHTGERPYKCPHCDRAFTQSGNLTNHIRLHTDEKPFKCHFCDRAFTQSGNLNSHIRNNHKFDDPRSIGGFH*

Sequence clustering based on sequence similarity using MMseqs2