Avir000399.1
Basic Information
- Insect
- Altica viridicyanea
- Gene Symbol
- -
- Assembly
- None
- Location
- GWHAMMQ00001009:23654-28571[-]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 3.9 3.1e+02 1.6 0.0 23 46 234 257 230 263 0.84 2 17 4.2 3.3e+02 1.5 0.0 21 44 260 283 257 287 0.87 3 17 0.037 2.9 8.1 0.2 20 44 287 311 283 315 0.89 4 17 0.22 17 5.6 0.2 22 44 317 339 311 343 0.86 5 17 0.47 37 4.5 0.2 22 45 345 368 340 373 0.87 6 17 5.5 4.4e+02 1.1 0.1 21 45 372 396 368 402 0.87 7 17 0.0035 0.28 11.3 0.1 21 46 400 425 396 429 0.90 8 17 1.5 1.2e+02 2.9 0.0 22 46 471 495 466 501 0.83 9 17 0.009 0.71 10.0 0.1 21 46 498 523 495 529 0.87 10 17 1.2 94 3.2 0.0 21 46 526 551 522 557 0.85 11 17 0.22 17 5.6 0.1 21 44 554 577 550 581 0.91 12 17 0.17 13 5.9 0.1 21 45 582 606 578 612 0.89 13 17 4.1 3.3e+02 1.5 0.1 22 44 611 633 606 637 0.86 14 17 0.016 1.3 9.2 0.1 21 45 638 662 634 665 0.90 15 17 0.33 26 5.0 0.0 21 52 666 697 662 698 0.88 16 17 0.56 45 4.3 0.0 21 44 694 717 692 724 0.89 17 17 0.61 49 4.2 0.1 25 45 726 746 719 748 0.87
Sequence Information
- Coding Sequence
- ATGGATGTTTTCAAAGATGCATCAAATCCATTAATGAATAATAAAGATGAAAGTAATTCTAAAGATTTGTTATCCAATATTAAATCCGAAGAAATTGATATTGAAGAATGTAAACTCGAATTCGATAACTTGTATGAATTCGATCTTTCTACGATCAAAACGGAAAGTTGTGTAGAAAAAGAAGTTTCCTCAACTAAAACTGATGTGTTAATAAGCAGAGAAACTATAAAAACCGAAAATGTTGATTCAAATGACGGTATTCAAGAAAGAAAAAGTGATATAGAAATTAAGACAGACCTTACAAACCGTCCATTTGATTTCCCTATTAGTAGAAATGATATCCATAATTTTGATGAAAATCGACAGACAGTGCAAACAATTAAAATGTATTCTTGGAAAAAACGTTTCCTATCTGCGGCAAAAAAATGGCATCCTCAATTTGGAAAATACTATGGAAGCTGTTCCTTTTGTGGACAATTTTTCTGTGATAAATTGCTATTTTTAAACCATTTTTATATTTACCACATTCTACTTAATTGTAAAAAATCTTTTAATTTGAGAACGAGTAATTATGCTACAAAACGTATTAACCTAAAAGATTTTGGTAATTTAAAAGAACAGATGAGGACTTACAGTGGACAAAACAGTTTTAAACGACAAAAAGACTTAAAACAGCATATTAAAATTCACACTGGTAAAAAACCTTTCAGATGTAAAATTTGTTCAAAATGTTTCACACGAAAAGGTAACTTAAAAGATCATATTAAAACTCACACAGGTGAAAAACCTTTTAAATGTAAAATTTGTTCAAAATGTTTCACATCTTCTAGTAAATTCAAAGTTCATCTTAGGTCTCACATTAGTGAAAAACCTTACAAATGTGAAATTTGTTCAAAATGTTTTCTACACTCAGATAGTTTAAAAAGGCATATAAGAACTCACACTGGTGAAAAAACTTTCAGATGCGAAATTTGTTCGAAATGTTTCAGAGATTCCAGTAATTTAAATAGCCATATGAGAACTCACACTGGTGAAAAATCGTTTAGATGTCAAATTTGTTCAAAATGTTTCACACAATCCACTAGTTTAAAAAAACATATGAGAACTCACACTGGTGAACAAACTTTCAGATGTGAAATTTGTTCAAAATGTTTTACACAAAACGGACATTTACAAGATCATATTAAAACTCACAAAGGTGAAAAACCTTTCAGATGTGAAATTTGTACAAAATGTTTTAGAAGTTCCAGTAATTTAAAAACTCATATTAAAATTCACAATATGGAATCTGATAAGTTAAACAACCATATTAAAATTGACATTGGTGAAAACCGTTTCAGATGTGGAATTTGTTCGAAATATTTCACTAATTCCAGTAGTTTAAAAGCTCATATTAGAACTCACACTGGCGAGAAATCTTTTAAATGTAAAATTTGTTCAAAATGTTTCACAGATTCCAGTAATTTAAATAGCCATATTAAAACTCACACTGGTGAAAAACCGTTTAGATGCCAAATTTGTTTAAAAAGTTTTATACAATCCAGTGATTTAAAAAGGCATATTAGAACTCACACTGGTGAGAAACCTTTCAGATGTGAAATTTGTTCAAAATGTTTCACAGTAAACAGTAACTTAAAAGATCACATTAAAACTCACACAGGTGAAAAACCTTTTAAATGTAAAATTTGTTCAAAATGTTTCACATCTTCTAGTAATTTAAAAGTTCATATTAGATCTCACATTAGCGAAAAACCTTACAAATGTGAAATTTGTTCAAAATGTTTTCTACACTCTGATAGTTTAAAAAAGCATATAAGAACTCACACTGGTGAAAAAACTTTCAGATGTGAAATTTGTTCGAAATGTTTCAGAGATTTCAGTAATTTAAATAGCCACATAAGAACTCACACTGGTGAAAAACCGTTTAGATGTCAAATTTGTTCAAAAAGTTTTATACAATCCAGTGATTTAAAAAGGCATATTAAAACTCACACTGGTGAGAAACCTTTCAGATGTGAAATTTGTTCAAAATGTTTCACAGTTAAAGGTAACTTAAAAAATCATATTAGACTTCACAGTGGTGAAAAGCCATTTAAATGTAAAATTTGTTCGAAATGTTTTACACAATCCGGTCACTTAAAAATACATACTAAAACTCATACTAACGAAAAACTTTTTAGATGTAAAGTTTGTTCAAAATGTTTCACACAAGAGGGTAACTTAAAACGACATATTAATAGTTGTGACGGTAAAACGTCCTAA
- Protein Sequence
- MDVFKDASNPLMNNKDESNSKDLLSNIKSEEIDIEECKLEFDNLYEFDLSTIKTESCVEKEVSSTKTDVLISRETIKTENVDSNDGIQERKSDIEIKTDLTNRPFDFPISRNDIHNFDENRQTVQTIKMYSWKKRFLSAAKKWHPQFGKYYGSCSFCGQFFCDKLLFLNHFYIYHILLNCKKSFNLRTSNYATKRINLKDFGNLKEQMRTYSGQNSFKRQKDLKQHIKIHTGKKPFRCKICSKCFTRKGNLKDHIKTHTGEKPFKCKICSKCFTSSSKFKVHLRSHISEKPYKCEICSKCFLHSDSLKRHIRTHTGEKTFRCEICSKCFRDSSNLNSHMRTHTGEKSFRCQICSKCFTQSTSLKKHMRTHTGEQTFRCEICSKCFTQNGHLQDHIKTHKGEKPFRCEICTKCFRSSSNLKTHIKIHNMESDKLNNHIKIDIGENRFRCGICSKYFTNSSSLKAHIRTHTGEKSFKCKICSKCFTDSSNLNSHIKTHTGEKPFRCQICLKSFIQSSDLKRHIRTHTGEKPFRCEICSKCFTVNSNLKDHIKTHTGEKPFKCKICSKCFTSSSNLKVHIRSHISEKPYKCEICSKCFLHSDSLKKHIRTHTGEKTFRCEICSKCFRDFSNLNSHIRTHTGEKPFRCQICSKSFIQSSDLKRHIKTHTGEKPFRCEICSKCFTVKGNLKNHIRLHSGEKPFKCKICSKCFTQSGHLKIHTKTHTNEKLFRCKVCSKCFTQEGNLKRHINSCDGKTS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00056793;
- 90% Identity
- iTF_00056793;
- 80% Identity
- iTF_00056793;