Basic Information

Gene Symbol
-
Assembly
GCA_947086415.1
Location
OX352315.1:3028645-3047056[-]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 24 0.29 7.2e+03 -1.4 0.0 42 52 10 20 7 21 0.90
2 24 0.011 2.8e+02 3.1 0.1 42 52 33 43 29 45 0.92
3 24 0.011 2.8e+02 3.1 0.1 42 52 56 66 52 68 0.92
4 24 0.011 2.8e+02 3.1 0.1 42 52 79 89 75 91 0.92
5 24 0.011 2.8e+02 3.1 0.1 42 52 102 112 98 114 0.92
6 24 0.011 2.8e+02 3.1 0.1 42 52 125 135 121 137 0.92
7 24 0.011 2.8e+02 3.1 0.1 42 52 148 158 144 160 0.92
8 24 0.011 2.8e+02 3.1 0.1 42 52 171 181 167 183 0.92
9 24 0.011 2.8e+02 3.1 0.1 42 52 194 204 190 206 0.92
10 24 0.011 2.8e+02 3.1 0.1 42 52 217 227 213 229 0.92
11 24 0.011 2.8e+02 3.1 0.1 42 52 240 250 236 252 0.92
12 24 0.011 2.8e+02 3.1 0.1 42 52 263 273 259 275 0.92
13 24 0.011 2.8e+02 3.1 0.1 42 52 286 296 282 298 0.92
14 24 0.011 2.8e+02 3.1 0.1 42 52 309 319 305 321 0.92
15 24 0.011 2.8e+02 3.1 0.1 42 52 332 342 328 344 0.92
16 24 0.011 2.8e+02 3.1 0.1 42 52 355 365 351 367 0.92
17 24 0.011 2.8e+02 3.1 0.1 42 52 378 388 374 390 0.92
18 24 0.011 2.8e+02 3.1 0.1 42 52 401 411 397 413 0.92
19 24 0.011 2.8e+02 3.1 0.1 42 52 424 434 420 436 0.92
20 24 0.011 2.8e+02 3.1 0.1 42 52 447 457 443 459 0.92
21 24 0.011 2.8e+02 3.1 0.1 42 52 470 480 466 482 0.92
22 24 0.011 2.8e+02 3.1 0.1 42 52 493 503 489 505 0.92
23 24 0.011 2.8e+02 3.1 0.1 42 52 516 526 512 528 0.92
24 24 0.67 1.7e+04 -2.5 0.0 16 26 618 628 617 629 0.83

Sequence Information

Coding Sequence
ATGGAGCCTCTAGCGGCGCGCGGCTGGAGAGACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGCGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGCGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTCAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTTTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGCGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGCGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGCGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGCGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGCGGCGCGCGGCTGGAGACACCTGGCGCTGAGGGTGTTCAGCAAGCCGGAGCCCGAACAGGGCGTGCTGGAAGCCGGTAGAACGTTGCCAGTGAACGAAGACGCGTCTTTGGATCCCAGCATAGCGAACCTAGCGCCGGGGAGACCGGTGCAGGAACGATTAGAACACTTGCTGAGGCTGTTGAGAATCAATGCCACGGCACATGTAGTTTCAGAGTGGCCAGCGCCGGACGAGACGAGCCGGTGGATGTCAGACAACTCGGACAACTCCATATACCAAAGAGTGCCTATATCTTACCTGCAGAAGGTGAACAGTATAATCCGAAGCCGCAGCGGTAGTGAAACAGCCGTGACATTCCTACAACTACCTGAGCCGCCTCGATTGACGGCTCAACTTAGTGATACTGACAAAACTATATGTGGACAATATCTACAGGTACTAGACGAATTGACGAAAGAGCTGTCTCCGACGATACTTGTACGAGGATTGAAGCCCGTCACCTCCACTGCTCTATAA
Protein Sequence
MEPLAARGWRDLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLAARGWRHLALRVFSKPEPEQGVLEAGRTLPVNEDASLDPSIANLAPGRPVQERLEHLLRLLRINATAHVVSEWPAPDETSRWMSDNSDNSIYQRVPISYLQKVNSIIRSRSGSETAVTFLQLPEPPRLTAQLSDTDKTICGQYLQVLDELTKELSPTILVRGLKPVTSTAL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-