Basic Information

Gene Symbol
-
Assembly
None
Location
GWHAMMQ00001009:153168-156686[+]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 12 0.0028 0.22 11.6 0.2 26 45 233 252 230 255 0.91
2 12 0.27 21 5.3 0.1 18 46 253 281 251 286 0.88
3 12 0.11 9 6.5 0.2 25 44 288 307 282 311 0.86
4 12 0.18 14 5.9 0.1 21 52 312 343 308 343 0.86
5 12 0.00029 0.023 14.8 0.8 21 48 340 367 336 371 0.86
6 12 0.23 19 5.5 0.2 22 44 369 391 367 398 0.90
7 12 3.7 2.9e+02 1.7 0.1 22 45 425 448 415 454 0.84
8 12 0.22 17 5.6 0.3 25 52 456 483 447 483 0.84
9 12 0.058 4.6 7.4 0.1 21 51 480 510 476 512 0.86
10 12 1.5 1.2e+02 2.9 0.1 22 44 509 531 504 534 0.90
11 12 0.005 0.39 10.9 0.1 20 48 535 563 528 566 0.84
12 12 0.28 22 5.2 0.3 22 45 566 589 561 596 0.86

Sequence Information

Coding Sequence
ATGGATGTTTTTAAAGATGCATCAAACCCAGTAATGAGTAATAAAATCGGAACGGATTCTCAAGATTTATTGTCTAATATTAAAACAGAAGAAATGGATATTTTAGAAGAATCTAAACTTGAATTCGAGAACCTGCATCGATTCGACCTTTGTAAGATCAAGACGGAAAGTTGTGAAGAAAAAGTTTCTCCAACTAAAAGTAATGTTGTGCTAAATGTTGATTCAAAAGAGGATATGCAAGAAAGAATAAGTGATATCAAAATTAAGACAGAAAATTTGCAGAGTGAATTTAATTTCCCTGTTTGTAGAAATGATTTTGGTGAAAATCGACAAACAGTAGAAACAATCAAAATGTATTCTAAGGAAAAATGTTTCCTATCTGCACCAAAAAAAGGGCAACCTCACATTGGAAAATTCTTTGAAACATGTACCTCTTGTGGACAATTTTTCTCTGATAAATTGCTATTTTTAAACCATTTCTATATTTATCACTTTTTAGTTAATTCTAAAAAATGTGTTAACTTGAGAAGAAATAATTCTGCTACAAAACAAATTAACCTAGAAAATCTTAGTTATTTAAATGAACAGATTAGGACTAATAATGGACAAAAATCCTTTAAAAGTGAACATTGGTCAAAATGTTTGAAGCGAAATAGTGTCTTAAATCAACTTATTAGAATTCACACTGGTGATGTAAAATGTAAAATGTGTTATAAATTTTATACAAGTTCTAGAAATTTAAAAAGGCATATTAATAGAAACATTGGTGAAAAACCTTTCACTTGTAAATTTTGTTCAAAATGTTTCATAGCATCCTGTGCTTTAAAAAAACATATGAGAATTCACTCTAACGAGAAACTTTTTAAATGTGAAATTTGTACAAAATGTTTTATAGAATCCAGAGATTTAAAAAGGCATATTAGAACTCACACTGGTGAAAAACCTTTCAGATGTGAAATTTGTTCAAAGTGTTTTATAGAGTCCAGTAAGTTAAAAAAACATATTAGAATTCACACTAACGAGAAACCTTTTAAATGCGAAATTTGTTCAAAATGTTTCAGACAGAACAGTAATTTAAGAAGTCATATTAGAGTTCACTTTGACGAGAAACCTTTTAAATGTGAAATTTGTTCCAAATGTTTCAGAAGATCTAGTTATTTAAAACTGCATATTAGAACTCACACGGGCGAAAAATTATTCAAATGTGAAATTTGTTCAAAATATTTTACACGAAAAAGTAAGTTAAAAGAACATAATAGAATTCACACTAACGAGAAACTTTTTAAATGTGAAATTTGTTCCAAATGTTTCACACAATCTAGTTATTTAAAAACTCATGCTAGAATTCACACTAACGAGAAACTTTTTAAATGTGAAATTTGTTCAAAATGCTTTATAGGCTCCAATAATTTAAGAACACATATTAGAATTCACAATAACGAGAAACCTTTTAAATGTAAAATTTGTTCAAAATGTTTCACAGATCAAAGTAATTTAAAAAAGCATATTAGAATTCACACTGATGACAAACCTTTTGAATGTGAAATTTGTTCAAAATGTTTCACAGACCGAAGTAATTTAAAAAAACATATTAGAAGTCACACTGGTGAAAAACCTTTCAGTTGTGAAATTTGTTCAAAATCTTTCTCAGAAAAAAGAAATTTAAAACAGCATATAATTAGAAAACACACTGGTGAAAAGTCATTCAACTGTGAAATTTGTTTAAAATGTTTCACACAACAAAGTGATTTAAAACGACATATTAGAACTCACACTCGGGAAAAAACTTTTAAATGCAAAATTTGTACAAAATGTTTCACAGAATCTGATGATTTAAAAATACATGTAAGAATTCACACTGACGAGAAAGCATTTTAA
Protein Sequence
MDVFKDASNPVMSNKIGTDSQDLLSNIKTEEMDILEESKLEFENLHRFDLCKIKTESCEEKVSPTKSNVVLNVDSKEDMQERISDIKIKTENLQSEFNFPVCRNDFGENRQTVETIKMYSKEKCFLSAPKKGQPHIGKFFETCTSCGQFFSDKLLFLNHFYIYHFLVNSKKCVNLRRNNSATKQINLENLSYLNEQIRTNNGQKSFKSEHWSKCLKRNSVLNQLIRIHTGDVKCKMCYKFYTSSRNLKRHINRNIGEKPFTCKFCSKCFIASCALKKHMRIHSNEKLFKCEICTKCFIESRDLKRHIRTHTGEKPFRCEICSKCFIESSKLKKHIRIHTNEKPFKCEICSKCFRQNSNLRSHIRVHFDEKPFKCEICSKCFRRSSYLKLHIRTHTGEKLFKCEICSKYFTRKSKLKEHNRIHTNEKLFKCEICSKCFTQSSYLKTHARIHTNEKLFKCEICSKCFIGSNNLRTHIRIHNNEKPFKCKICSKCFTDQSNLKKHIRIHTDDKPFECEICSKCFTDRSNLKKHIRSHTGEKPFSCEICSKSFSEKRNLKQHIIRKHTGEKSFNCEICLKCFTQQSDLKRHIRTHTREKTFKCKICTKCFTESDDLKIHVRIHTDEKAF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00056812;
90% Identity
iTF_00056812;
80% Identity
iTF_00056812;