Basic Information

Gene Symbol
-
Assembly
None
Location
GWHAMMQ00001438:249826-251369[-]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 12 7.8 6.2e+02 0.6 0.3 20 34 100 114 82 132 0.74
2 12 0.091 7.2 6.8 0.0 21 44 157 180 148 184 0.87
3 12 0.039 3.1 8.0 0.1 21 44 185 208 181 212 0.90
4 12 1.1 85 3.4 0.1 22 44 214 236 209 240 0.86
5 12 0.032 2.6 8.2 0.0 21 44 241 264 237 268 0.90
6 12 0.003 0.24 11.6 0.1 21 44 269 292 265 296 0.90
7 12 0.08 6.3 7.0 0.0 21 45 297 321 292 328 0.86
8 12 0.037 2.9 8.1 0.1 21 44 325 348 321 354 0.90
9 12 1.1 85 3.4 0.1 22 44 354 376 349 380 0.86
10 12 0.032 2.6 8.2 0.0 21 44 381 404 377 408 0.90
11 12 0.0036 0.29 11.3 0.0 21 46 409 434 405 440 0.86
12 12 0.79 63 3.8 0.0 21 44 437 460 433 464 0.91

Sequence Information

Coding Sequence
ATGTGTGCTTATATGTTTTCAGGTTTCCTATCAGCGGCAAAAAAATGGCAACCTTACTTTGGAAAATTCTATGGAACATGTTCATCTTGTGGAAAAATTTTCTATGATCAATTGCTATTTTTAACCCATTTTTATATTTATCACTTTTTCGTTAATTCGAAAACATCTGTGAACTTAAAAAGGACTAATTCTGCCACAAAATATATGAACCGACAATATCTTAGTAATTTAAACAAAAAGATAAAGGCGGACAGGTTTACACAACAAAGTAATTTAAAAAAATATACTAAAACACAAACTGGCGAGAAACCTTTTAAATGTAAAATTTGTTCGAAATGTTTCACACATCCCAGTACTTTAAAACTTCATAATAGAACTCACACTGGCGAAAAACCATTTAGATGTAAGTTTTGTTCAAAATTTTTCACATACTCCAGTAGTTTAAAAGTTCATAATAGAACTCACACTGACGAGAAACCTTTTAAATGTAAAATATGTTCGAAATGTTTCACAGATTCCAGTAATTTAAATAGCCATATTAGAACTCACACTGGTGAAAAACCATTTAGTTGTCAAATTTGTTCGAAATGTTTCACAGATTCCAGTAATTTAAAAGCTCATATTAGAACTCACACTGGCGAGAAAAATTTTAAATGTAAAATTTGTTCGAAATGTTTTACACAATCCAGTCACTTAAAAACTCATGTTAGAACTCACACTGATGAGAAACCTTTTAAATGTAAGATTTGTTCGAAAGATTTCACACAATCCAGTATTTTAAAAAGACATATGAGAACTCACACTAACGAAAAACCTTTTAAATGTAAAATTTGTTCAAAATGTTTAACATCTTCCAGGAATTTAAAAGATCATATTAGAACTCACACTGGTGAGAAACCTTTTAAATGTAAAATTTGTTCAAAATGTTTCACAGATTCCAGTAATTTAAATAGCCATATTAGAACTCACACTGGTGAAAAACCGTTTAGTTGTCAAATTTGTTCGAAATGTTTCACCGATTCCAGTAATTTAAAAGCTCATATTAGAACTCACACTGGCGAGAAAAATTTTAAATGTAAAATTTGTTCGAAATGTTTTACACAATCCAGTCACTTAAAAACTCATGTTAGAACTCACACTGATGAGAAACCTTTTAAATGTAAGATTTGTTCGAAAGATTTCACACAATCCAGTATTTTAAAAAGACATATGAGAACTCACACTAACGAAAAACCTTTTAAATGTAAAATTTGTTCGAAATGTTTCACACAATCCGGTAATTTAACAAAGCACATTAAAACTCACACTGATGAAAAACCTTTTAAATGTAAGTTTTGTTCAAAATGTTTTACACAATCCGGTGAATTAAAATCTCATATTACCACTCACACTGACGAAAAGCTGCTTAAATATAAAATTTGA
Protein Sequence
MCAYMFSGFLSAAKKWQPYFGKFYGTCSSCGKIFYDQLLFLTHFYIYHFFVNSKTSVNLKRTNSATKYMNRQYLSNLNKKIKADRFTQQSNLKKYTKTQTGEKPFKCKICSKCFTHPSTLKLHNRTHTGEKPFRCKFCSKFFTYSSSLKVHNRTHTDEKPFKCKICSKCFTDSSNLNSHIRTHTGEKPFSCQICSKCFTDSSNLKAHIRTHTGEKNFKCKICSKCFTQSSHLKTHVRTHTDEKPFKCKICSKDFTQSSILKRHMRTHTNEKPFKCKICSKCLTSSRNLKDHIRTHTGEKPFKCKICSKCFTDSSNLNSHIRTHTGEKPFSCQICSKCFTDSSNLKAHIRTHTGEKNFKCKICSKCFTQSSHLKTHVRTHTDEKPFKCKICSKDFTQSSILKRHMRTHTNEKPFKCKICSKCFTQSGNLTKHIKTHTDEKPFKCKFCSKCFTQSGELKSHITTHTDEKLLKYKI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00056816;
90% Identity
iTF_00056816;
80% Identity
iTF_00056816;