Acus000781.1
Basic Information
- Insect
- Arma custos
- Gene Symbol
- -
- Assembly
- GCA_037127475.1
- Location
- CM073759.1:12028209-12037066[-]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.07 1.7e+02 4.5 0.3 22 49 12 39 9 43 0.87 2 20 0.0023 5.6 9.2 0.5 18 49 37 68 31 73 0.87 3 20 0.094 2.3e+02 4.1 0.4 18 48 66 96 64 100 0.86 4 20 6.6e-05 0.16 14.2 0.1 16 49 93 126 85 132 0.87 5 20 0.0018 4.4 9.5 0.2 21 49 156 184 150 189 0.88 6 20 0.0063 15 7.8 0.3 21 48 214 241 205 244 0.91 7 20 0.0013 3.2 10.0 0.3 20 49 242 271 240 277 0.87 8 20 3.2 7.8e+03 -0.9 0.1 20 51 271 302 269 304 0.81 9 20 0.0031 7.5 8.8 0.4 26 49 306 329 300 333 0.86 10 20 0.18 4.3e+02 3.2 0.4 18 31 327 340 325 357 0.82 11 20 0.12 3e+02 3.7 0.1 24 48 362 386 355 389 0.84 12 20 0.038 91 5.3 0.1 16 48 383 415 380 421 0.83 13 20 0.2 4.9e+02 3.0 0.1 21 48 417 444 413 447 0.81 14 20 0.031 74 5.6 0.6 17 48 442 473 437 475 0.88 15 20 0.0014 3.4 9.9 0.3 18 49 472 503 470 507 0.89 16 20 0.014 33 6.7 0.6 18 51 501 534 499 538 0.87 17 20 0.43 1e+03 1.9 0.1 26 48 542 564 539 568 0.89 18 20 2.6 6.2e+03 -0.6 0.0 26 48 579 601 577 606 0.78 19 20 0.43 1e+03 1.9 0.2 16 48 598 630 591 633 0.81 20 20 0.42 1e+03 2.0 0.4 21 48 632 659 627 662 0.86
Sequence Information
- Coding Sequence
- ATGAAAAACCATGTAATGGCCCTTCATGCTGGTGAGAAGCCTtttcaatgccctcattgtgattataaatctgtacattCTGGAcatatgaaaaaacatatacaacatcgtcataccggtgagaagccttttcaatgtcctcattgtgattataaatctgtaacatctggaaatatgaaaaaacatatacaacgtcgtcataccggtgagaagccttttcaatgtcctcattgtgattataaatctgtaacatctggacaaatgaaaaatcatgtcCAAGCTCGTCATTCCGGTGAGAAGCCTtttcaatgtcctcattgtgattataaatctgtaacatctggaaatatgaaaaaacatatacaacttcgtcataccggtgagaagccTTTTCTATGtactcattgtgattataaatctgtaagatctggacaaatgaaaaaccatGTACAAGCttgtcataccggtgagaagccctatcaatgccctcattgtgattataaatccgTAGCATCtggaaatttgaaaaaacatatacaacatcgtcataccggtgagaagccTTTTCAATGtgctcattgtgattataaatctgtaacatctggagcattgaaaaaccatatacaagctcaTCATACGGGTGAGAAGCCTtttcaatgccctcattgtgatcatcaatctgtaacatctggaaatatgaaaaaacatatacaacgtcgtcataccggtgagaagccttttcaatgtcctcattgtgattataaatctgtagcatctggaaatttgaaaaaacatatacaacatcgtcataccggtgagaagccTTTTCAATGtactcattgtgattataaatctgtaacatctgtaCAATTGAAATACCATATACAAGCTCATCATACGGGTAAAaagtcctatcaatgccctcattgcgATTATAAATCTACAACATTTAGAAATTTGAAAAGACAtttaatggcccgtcatacaagtgagaagcctcatcaatgtcctcattgtgattataaatctgtggAATCAGGAGCTATGAAAATACACTACATGGCCTGGCACAAAGGTGAGAAGCtatatcaatgccctcattgtgattataaatctgtacaatctggaaatatgaaaaaacatgtaagggcccatcatactggtgagaagccgtttcaatgccctcattgtgattataaatctgtaacatctggaaatatgaaaaaacatgtaatggcccatcatactggtgagaagccttttcaatgccctcattgtgattataaatctataaaatctGGAGATGTGAAAAACcatgtaatggcccgtcatactggtgagaagcctttccaatgccctcattgtgattataaatctgtacattCTGGAcatatgaaaaaacatatacaacatcgtcataccggtgagaagccttttcaatgtcctcattgtgattataaatttgtaacatctggaaatatgaaaaaacatatacaacgtcgtcataccggtgagaagccttttcaatgtcctcattgtgattataaatctgtaacatctggagaaatgaaaaatcatgtacaagctcgtcatactgctcgtcataccggtgtgaagtcctatcaatgccctcattgtgattataaatctgtacaaTCTGGAcatatgaaaaaccatatacaagctcgtcataccgGTTCCAAAAATAGTTGCCATAAACTTCTGAAGTCTCATCAATGTCcttattgtgattataaatctgtaagatCTGGACAAATTAAAGATCATGTACAAtctcgtcataccggtgagaagtccaaccaatgccctcattgtgattataagtcaGTACAATTAGGGAATATGAATAGTCATATAATGGCCCATCATACCAGCGTGAGGCCTtttcaatgccctcattgtgattataaatctgcaAGATCTGGAAGCATAAAAAACCATGTGCAAGCTCGTCATACCAAGTTATTAAAGAAGGTTGAAGAAAACACAAGAGACCAGAACGGCTGCAAACTTTGGCACGAATTAAGATTTGGTAGAATAATGGCCTCCAAAGCTTATGAAGTTAAAACATGTAAAACAAGTGATGGTGCTTTggtatag
- Protein Sequence
- MKNHVMALHAGEKPFQCPHCDYKSVHSGHMKKHIQHRHTGEKPFQCPHCDYKSVTSGNMKKHIQRRHTGEKPFQCPHCDYKSVTSGQMKNHVQARHSGEKPFQCPHCDYKSVTSGNMKKHIQLRHTGEKPFLCTHCDYKSVRSGQMKNHVQACHTGEKPYQCPHCDYKSVASGNLKKHIQHRHTGEKPFQCAHCDYKSVTSGALKNHIQAHHTGEKPFQCPHCDHQSVTSGNMKKHIQRRHTGEKPFQCPHCDYKSVASGNLKKHIQHRHTGEKPFQCTHCDYKSVTSVQLKYHIQAHHTGKKSYQCPHCDYKSTTFRNLKRHLMARHTSEKPHQCPHCDYKSVESGAMKIHYMAWHKGEKLYQCPHCDYKSVQSGNMKKHVRAHHTGEKPFQCPHCDYKSVTSGNMKKHVMAHHTGEKPFQCPHCDYKSIKSGDVKNHVMARHTGEKPFQCPHCDYKSVHSGHMKKHIQHRHTGEKPFQCPHCDYKFVTSGNMKKHIQRRHTGEKPFQCPHCDYKSVTSGEMKNHVQARHTARHTGVKSYQCPHCDYKSVQSGHMKNHIQARHTGSKNSCHKLLKSHQCPYCDYKSVRSGQIKDHVQSRHTGEKSNQCPHCDYKSVQLGNMNSHIMAHHTSVRPFQCPHCDYKSARSGSIKNHVQARHTKLLKKVEENTRDQNGCKLWHELRFGRIMASKAYEVKTCKTSDGALV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00165379;
- 90% Identity
- iTF_00165379;
- 80% Identity
- iTF_00165379;