Cglo007512.1
Basic Information
- Insect
- Cotesia glomerata
- Gene Symbol
- -
- Assembly
- None
- Location
- Contig25106:3630-6462[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 0.8 4.7e+03 -1.8 2.3 20 43 158 188 135 193 0.52 2 17 0.00044 2.6 8.6 0.0 16 46 189 219 182 225 0.81 3 17 0.13 7.6e+02 0.7 0.0 21 47 222 248 220 254 0.83 4 17 0.014 81 3.8 0.0 23 44 252 273 249 276 0.91 5 17 2.1e-05 0.12 12.8 0.0 21 48 278 305 273 310 0.83 6 17 0.22 1.2e+03 0.0 0.1 22 48 307 332 303 338 0.80 7 17 0.085 4.9e+02 1.3 0.0 26 46 339 359 331 367 0.87 8 17 0.71 4.1e+03 -1.6 0.0 26 44 367 385 361 390 0.86 9 17 0.053 3.1e+02 2.0 0.0 21 43 390 412 387 417 0.87 10 17 0.76 4.4e+03 -1.7 0.0 18 44 442 468 438 475 0.76 11 17 0.021 1.2e+02 3.3 0.1 26 44 478 496 472 499 0.91 12 17 0.0001 0.61 10.6 0.2 21 47 501 527 496 534 0.83 13 17 1.4 8e+03 -2.6 0.0 24 38 588 602 585 609 0.74 14 17 0.00064 3.7 8.1 0.0 21 43 613 635 607 641 0.88 15 17 2 1.2e+04 -3.1 0.0 27 46 647 666 646 673 0.81 16 17 0.0016 9.3 6.8 0.1 26 46 675 695 669 697 0.89 17 17 0.0097 56 4.3 0.0 19 43 697 721 695 725 0.85
Sequence Information
- Coding Sequence
- ATGGATGTTGTTAATGACATTTCGTTTGAAAAAGTATTTAAAATTGAAAAACATGATCCGGAGGACTCAGAAATTAACAATGTTACGGAGGAACATAATTATCCTGATGATACGAATAATATGAGTGCTCCAGAAAGTGTTTTTATAAATATTGATCTTCCTGTCAGCATAAAAGCAGAGGAGCCTGATAATGAAGAAGCTGCGGATCCATTGGCAGAAGTTACCGAAACACAGGCTGAGGAATCATCGACGGGTTCAGACTTGAAAGGATTATTAGACAGCAATTCGACTAACTGTTCTCTAGATAGCAATCAGTATACATTGATTGCTAATGAAAATTTAATTTTAATAAAAAATGAATACGACAGCTCACCTGAGGTAAACAGTGATGTTAAAGAAACGAATGTAAAATCATCAAAGAGGCCAAAAATTACTAACAAAAAGGAAAGACAGTCTCGTAGTAAAAAGAAACGAAGTTCAAAACCTGCGAAAAAGAAAGCTAAACTTTTGAAATGCGACAGGTGTCCTGCTTTATTCGACTACAACAGTAACCTTGAAAGACATAGGCGAACTCACTCACCGAAAAAACCATATACTTGCGACATTTGTGGAACAAAATTTGCTTGGAAACGTAATTTAAAAGGTCATATATTTTTACATTCTGGAGAGCGACCTTTCAAATGTAAGACATGCTTGAAAACATTCAACAATAAAAATACACTGGAAAGACATATGATTATTCACTTAGGTGTTAAACCACACAAATGCAAAATATGTCCAGCAGGATTTGATTCTAAAAGCAATTTAACGAGACACATTCGCTCTCATACAGGTGAAAAACCATTCCAATGTGAAATCTGCTTAGCAAAATTTACAGAGAAGCGCAATTTGCAGAACCATTTACTAATTCATTCAGGAGAGAAACCGTGGCAGTGTAATGCATGTCCGTTAAGATTCAGTCAGAAGGCGTACTTGAGAAAACACCAAATGTACCACATAAGTAAAAACGTTCTAGAGTGCGACGTCTGTTTGATGACATTCGAAACGAAACGCAGTCTATCTAGACATATGACGATTCACGCAACCGACAGGAAGTTTGAATGTGATATGTGTTTAGCAAGATTCATTAAAAAACGTGAACTGACCAATCATATAATGGCTCACACAGGGGAAAAACCATGGCACTGTGAAATTTGCCCAGCAAAATTTAGTCAGAAGTTGTATTTAAAAAATCATAAAATTGTTCATGCAGAACAATTACCGTTTGAGTGTAAGACTTGTTCCGAAAGATTTGATACGGAAGACAATCTTTCAGAGCACAAAAAGTCTCATTCGAAAAATTTATGGCAATGTACAATTTGTAATAAAAAATTTACGTGGAAACATCATCTCGAGAAACACTTACTAACACATACCAGGGCGCGACGTTACAGGTGTCCAACGTGTCCAGCAGCGTTTGGTTCTAATAGTAATCTCACTAGACATATTCGCTCTCATACAGGCGAGAAACCATTCCAATGTGAAATTTGCGAAGCAAGATTTAGTGAGAAACGTAATCTCCAAAATCATATTAATACCCATACAGGAGAAAAGCCGTGGCAGTGTGACAATTGTCCAGCGAAATTTGGATCCAGGTCCACATTGACCAAGCACCAGCAGCTGCATAAGCAACCACTTTTATTTTCGTGTGACCACTGCTCACAGAAATTTGTTTCAAAACGAAAGCTCATCAAACATATACTAACCCATGCGGACAATTGGCCATTTCAATGTGAAATTTGTTCAGCAAGATTCACCGGGAAACGGAGTATGTATTTTCACTCTCTGACTCACACAGGTGAAAAACCATGGGAGTGTGAAATTTGTTTTGCTAAATTTGCTCAAAAAGCTTATTTAACAAGACATAAACTTACTCACATAGATATGCTATCATACAAGTGTCCTCACTGTTCAGAGCGATTTCATTCGGTAGAAAATCTTAATGAACATATACCAATTCACATGGCTCCTAAAACGGTTATACAGTGTGAATTATGTTCAGCAACGTTTCGCGAAATACGGAATTTAAATAATCATATGGTACTTCATTCTCAAGGAGAAGAACCTTGGAGATGTGAAATTTGTTCCACAGGTTTTAGATGGAAGTTTGATTTAGATAAACATAAACTTGACCATTGTCAGTGA
- Protein Sequence
- MDVVNDISFEKVFKIEKHDPEDSEINNVTEEHNYPDDTNNMSAPESVFINIDLPVSIKAEEPDNEEAADPLAEVTETQAEESSTGSDLKGLLDSNSTNCSLDSNQYTLIANENLILIKNEYDSSPEVNSDVKETNVKSSKRPKITNKKERQSRSKKKRSSKPAKKKAKLLKCDRCPALFDYNSNLERHRRTHSPKKPYTCDICGTKFAWKRNLKGHIFLHSGERPFKCKTCLKTFNNKNTLERHMIIHLGVKPHKCKICPAGFDSKSNLTRHIRSHTGEKPFQCEICLAKFTEKRNLQNHLLIHSGEKPWQCNACPLRFSQKAYLRKHQMYHISKNVLECDVCLMTFETKRSLSRHMTIHATDRKFECDMCLARFIKKRELTNHIMAHTGEKPWHCEICPAKFSQKLYLKNHKIVHAEQLPFECKTCSERFDTEDNLSEHKKSHSKNLWQCTICNKKFTWKHHLEKHLLTHTRARRYRCPTCPAAFGSNSNLTRHIRSHTGEKPFQCEICEARFSEKRNLQNHINTHTGEKPWQCDNCPAKFGSRSTLTKHQQLHKQPLLFSCDHCSQKFVSKRKLIKHILTHADNWPFQCEICSARFTGKRSMYFHSLTHTGEKPWECEICFAKFAQKAYLTRHKLTHIDMLSYKCPHCSERFHSVENLNEHIPIHMAPKTVIQCELCSATFREIRNLNNHMVLHSQGEEPWRCEICSTGFRWKFDLDKHKLDHCQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00382232;
- 90% Identity
- iTF_00379310;
- 80% Identity
- iTF_00380536;