Avir017181.1
Basic Information
- Insect
- Altica viridicyanea
- Gene Symbol
- -
- Assembly
- None
- Location
- GWHAMMQ00000933:231117-248763[-]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 0.1 8.1 6.6 0.1 25 49 199 222 188 227 0.79 2 13 4.4 3.5e+02 1.4 0.0 22 48 224 250 220 255 0.71 3 13 0.45 35 4.6 0.0 21 45 251 275 247 282 0.86 4 13 0.046 3.7 7.7 0.1 22 44 280 302 274 306 0.89 5 13 0.21 17 5.6 0.1 21 44 307 330 303 334 0.90 6 13 0.095 7.5 6.7 0.1 21 46 335 360 330 367 0.86 7 13 0.00037 0.029 14.5 0.1 21 52 391 422 383 422 0.86 8 13 0.057 4.5 7.5 0.1 21 45 419 443 416 450 0.87 9 13 0.55 44 4.3 0.1 21 45 447 471 443 474 0.90 10 13 0.069 5.5 7.2 0.0 21 46 475 500 471 506 0.87 11 13 0.019 1.5 9.0 0.1 21 47 503 529 499 534 0.85 12 13 2.7 2.2e+02 2.1 0.1 21 44 531 554 527 558 0.89 13 13 0.21 16 5.7 0.2 21 45 559 583 551 588 0.88
Sequence Information
- Coding Sequence
- ATGAATGTTTCCCAACAAACAATAAAATCTGACATTAAACTGGAACAAATTGAGGTTTTCGAAGAACATGTTCAAGAAGAGCCCTACAAAAGTAGTATGTTTAAGAGAGAACCGAAATTCGAATACGAAAACTTGCAACAATTCGTATCAGTGGATAGTTCTTTCACTACTAAAAGTGAAATGCTAATTAGCACTGAAAATGAGATTTCAAGTCAAACTAATGATGAGATGAATCTTGAAATTGAAATTAAAAAAGAAGAATTTCACTCCATTTGTAATTTACCGGATTTTAAAACGGAATTTCAAGATTTTGATGAAGTATTACGTTCTGAGCGACGAGTCAAAACATATTCCAGGAAACAGGATCCGAGTTGTAAAAAGCGTTTCCTATCTGCGGTAAAAAAATGGCAATCTCACTTTGGAAAATTCTATGGAACATGTTCAACTTGTGGAAAAATGTTTTTTGATAAATTGCTATTTTTAAACCATTTTTATATTTACCACTTTTTCGTTAATTCGAAAAAATCTGTTAACTTAAAAAGGAATAATTCTAAATTTAACGAACAGATGAGAATTCACAATGAACAAAAATACTTTAAATGCGAAATTTGTTTTCAATGTTACACCACATCTTATAATCTAAAATTACATCTTCAAAGTCATTTTGGTGAAAATTCTGTTAGTTGTGAAATTTGTTCAAAACGTTTCACACGACAAAGTGATTTTAAAAGACATATTAAAACCCACACTGCCGAAAAACCGTTTAAATGTAAGATTTGTTCAAAGAGTTTCACACTTTCCTTTAACTTAAAAATTCATATTAGAACTCATACTGACGATAAACCTTTTACATGTAAAATTTGTTCGAAATATTTCACACATTCCAGTTACTTAAAAAGGCACATTAGAACTCACACTGACGAGAAACCTTTTAAATGTAAATTTTGTTTGAAATGTTTCACTCAATCCATTAACTTAAGTACTCATATTAGAACTCATACTGACGAGAAACCTTTTAAATGTAAAAATTGTTCAAAATGTTTCACACAATCCAGTAACTTAAAAACTCATATTAGGACTCACACTGACGAGAAACCTTTTAAATGTAAGTTTTGTTCAAAATGTTTCACACATTCCAGTAGTTTAAAAGTTCATATTAGCACTCACACTGACAATAAACCTTTTACATGTAAAATATGTTCGAAATGTTTTACACAATCCAATAACTTAAAAACTCATATTAGGATTCACACTAACGAGAAACCTTTTAAATGTAAAATTTGTTCGAAATGTTTCACACAATCTAGTAGTTTAAAAACTCATATTAGAACTCACACTGGCGAGAAACCTTTTAAATGTCAAATTTGTTCGAAATGTTTCAATAGTTCCAGTGACTTAAAAACTCATATTAAAACTCACACTGGCGAAAAACCTTTTAAATGTAAAATTTGTTCGAAATGTTTCACTCAATCCAGTCATTTAAAAACCCATTTTAAAACTCACACTGACGAAAAACCTTTTATATGTAAAATTTGTTCAAAATGTTTTAAAGAATCCTGTAATTTAAAAATGCACATTAGAACTCACACTGCTGAAAAACCTTTTAAATGTAAATTTTGTTCGAAATGTTTCAATCGTTCCAGTGACTTAAAAACTCATATTAGAACTCACACTGGCGAAAAACCTTTTAAATGTTATATTTGTTCGAAATGTTTTACACAATCCAGTCATTTAAATACTCATATTAAAACTCACACTGACAAAAAGCTTTTTAAATAG
- Protein Sequence
- MNVSQQTIKSDIKLEQIEVFEEHVQEEPYKSSMFKREPKFEYENLQQFVSVDSSFTTKSEMLISTENEISSQTNDEMNLEIEIKKEEFHSICNLPDFKTEFQDFDEVLRSERRVKTYSRKQDPSCKKRFLSAVKKWQSHFGKFYGTCSTCGKMFFDKLLFLNHFYIYHFFVNSKKSVNLKRNNSKFNEQMRIHNEQKYFKCEICFQCYTTSYNLKLHLQSHFGENSVSCEICSKRFTRQSDFKRHIKTHTAEKPFKCKICSKSFTLSFNLKIHIRTHTDDKPFTCKICSKYFTHSSYLKRHIRTHTDEKPFKCKFCLKCFTQSINLSTHIRTHTDEKPFKCKNCSKCFTQSSNLKTHIRTHTDEKPFKCKFCSKCFTHSSSLKVHISTHTDNKPFTCKICSKCFTQSNNLKTHIRIHTNEKPFKCKICSKCFTQSSSLKTHIRTHTGEKPFKCQICSKCFNSSSDLKTHIKTHTGEKPFKCKICSKCFTQSSHLKTHFKTHTDEKPFICKICSKCFKESCNLKMHIRTHTAEKPFKCKFCSKCFNRSSDLKTHIRTHTGEKPFKCYICSKCFTQSSHLNTHIKTHTDKKLFK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00056824;
- 90% Identity
- iTF_00056824;
- 80% Identity
- iTF_00056824;