Avir000407.1
Basic Information
- Insect
- Altica viridicyanea
- Gene Symbol
- -
- Assembly
- None
- Location
- GWHAMMQ00001009:153168-156686[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 0.0028 0.22 11.6 0.2 26 45 233 252 230 255 0.91 2 12 0.27 21 5.3 0.1 18 46 253 281 251 286 0.88 3 12 0.11 9 6.5 0.2 25 44 288 307 282 311 0.86 4 12 0.18 14 5.9 0.1 21 52 312 343 308 343 0.86 5 12 0.00029 0.023 14.8 0.8 21 48 340 367 336 371 0.86 6 12 0.23 19 5.5 0.2 22 44 369 391 367 398 0.90 7 12 3.7 2.9e+02 1.7 0.1 22 45 425 448 415 454 0.84 8 12 0.22 17 5.6 0.3 25 52 456 483 447 483 0.84 9 12 0.058 4.6 7.4 0.1 21 51 480 510 476 512 0.86 10 12 1.5 1.2e+02 2.9 0.1 22 44 509 531 504 534 0.90 11 12 0.005 0.39 10.9 0.1 20 48 535 563 528 566 0.84 12 12 0.28 22 5.2 0.3 22 45 566 589 561 596 0.86
Sequence Information
- Coding Sequence
- ATGGATGTTTTTAAAGATGCATCAAACCCAGTAATGAGTAATAAAATCGGAACGGATTCTCAAGATTTATTGTCTAATATTAAAACAGAAGAAATGGATATTTTAGAAGAATCTAAACTTGAATTCGAGAACCTGCATCGATTCGACCTTTGTAAGATCAAGACGGAAAGTTGTGAAGAAAAAGTTTCTCCAACTAAAAGTAATGTTGTGCTAAATGTTGATTCAAAAGAGGATATGCAAGAAAGAATAAGTGATATCAAAATTAAGACAGAAAATTTGCAGAGTGAATTTAATTTCCCTGTTTGTAGAAATGATTTTGGTGAAAATCGACAAACAGTAGAAACAATCAAAATGTATTCTAAGGAAAAATGTTTCCTATCTGCACCAAAAAAAGGGCAACCTCACATTGGAAAATTCTTTGAAACATGTACCTCTTGTGGACAATTTTTCTCTGATAAATTGCTATTTTTAAACCATTTCTATATTTATCACTTTTTAGTTAATTCTAAAAAATGTGTTAACTTGAGAAGAAATAATTCTGCTACAAAACAAATTAACCTAGAAAATCTTAGTTATTTAAATGAACAGATTAGGACTAATAATGGACAAAAATCCTTTAAAAGTGAACATTGGTCAAAATGTTTGAAGCGAAATAGTGTCTTAAATCAACTTATTAGAATTCACACTGGTGATGTAAAATGTAAAATGTGTTATAAATTTTATACAAGTTCTAGAAATTTAAAAAGGCATATTAATAGAAACATTGGTGAAAAACCTTTCACTTGTAAATTTTGTTCAAAATGTTTCATAGCATCCTGTGCTTTAAAAAAACATATGAGAATTCACTCTAACGAGAAACTTTTTAAATGTGAAATTTGTACAAAATGTTTTATAGAATCCAGAGATTTAAAAAGGCATATTAGAACTCACACTGGTGAAAAACCTTTCAGATGTGAAATTTGTTCAAAGTGTTTTATAGAGTCCAGTAAGTTAAAAAAACATATTAGAATTCACACTAACGAGAAACCTTTTAAATGCGAAATTTGTTCAAAATGTTTCAGACAGAACAGTAATTTAAGAAGTCATATTAGAGTTCACTTTGACGAGAAACCTTTTAAATGTGAAATTTGTTCCAAATGTTTCAGAAGATCTAGTTATTTAAAACTGCATATTAGAACTCACACGGGCGAAAAATTATTCAAATGTGAAATTTGTTCAAAATATTTTACACGAAAAAGTAAGTTAAAAGAACATAATAGAATTCACACTAACGAGAAACTTTTTAAATGTGAAATTTGTTCCAAATGTTTCACACAATCTAGTTATTTAAAAACTCATGCTAGAATTCACACTAACGAGAAACTTTTTAAATGTGAAATTTGTTCAAAATGCTTTATAGGCTCCAATAATTTAAGAACACATATTAGAATTCACAATAACGAGAAACCTTTTAAATGTAAAATTTGTTCAAAATGTTTCACAGATCAAAGTAATTTAAAAAAGCATATTAGAATTCACACTGATGACAAACCTTTTGAATGTGAAATTTGTTCAAAATGTTTCACAGACCGAAGTAATTTAAAAAAACATATTAGAAGTCACACTGGTGAAAAACCTTTCAGTTGTGAAATTTGTTCAAAATCTTTCTCAGAAAAAAGAAATTTAAAACAGCATATAATTAGAAAACACACTGGTGAAAAGTCATTCAACTGTGAAATTTGTTTAAAATGTTTCACACAACAAAGTGATTTAAAACGACATATTAGAACTCACACTCGGGAAAAAACTTTTAAATGCAAAATTTGTACAAAATGTTTCACAGAATCTGATGATTTAAAAATACATGTAAGAATTCACACTGACGAGAAAGCATTTTAA
- Protein Sequence
- MDVFKDASNPVMSNKIGTDSQDLLSNIKTEEMDILEESKLEFENLHRFDLCKIKTESCEEKVSPTKSNVVLNVDSKEDMQERISDIKIKTENLQSEFNFPVCRNDFGENRQTVETIKMYSKEKCFLSAPKKGQPHIGKFFETCTSCGQFFSDKLLFLNHFYIYHFLVNSKKCVNLRRNNSATKQINLENLSYLNEQIRTNNGQKSFKSEHWSKCLKRNSVLNQLIRIHTGDVKCKMCYKFYTSSRNLKRHINRNIGEKPFTCKFCSKCFIASCALKKHMRIHSNEKLFKCEICTKCFIESRDLKRHIRTHTGEKPFRCEICSKCFIESSKLKKHIRIHTNEKPFKCEICSKCFRQNSNLRSHIRVHFDEKPFKCEICSKCFRRSSYLKLHIRTHTGEKLFKCEICSKYFTRKSKLKEHNRIHTNEKLFKCEICSKCFTQSSYLKTHARIHTNEKLFKCEICSKCFIGSNNLRTHIRIHNNEKPFKCKICSKCFTDQSNLKKHIRIHTDDKPFECEICSKCFTDRSNLKKHIRSHTGEKPFSCEICSKSFSEKRNLKQHIIRKHTGEKSFNCEICLKCFTQQSDLKRHIRTHTREKTFKCKICTKCFTESDDLKIHVRIHTDEKAF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00056812;
- 90% Identity
- iTF_00056812;
- 80% Identity
- iTF_00056812;