Tpre001775.1
Basic Information
- Insect
- Trichogramma pretiosum
- Gene Symbol
- -
- Assembly
- GCA_000599845.3
- Location
- NW:2903217-2908811[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 16 0.18 74 3.1 0.3 27 45 214 232 212 239 0.89 2 16 1.3 5.6e+02 0.3 0.1 26 46 241 262 234 269 0.77 3 16 0.062 26 4.6 0.0 23 48 267 292 262 298 0.88 4 16 0.016 6.6 6.4 0.0 23 48 296 321 292 327 0.88 5 16 4 1.6e+03 -1.2 0.0 22 46 324 348 321 353 0.77 6 16 1.3 5.3e+02 0.3 0.1 16 44 355 383 339 393 0.76 7 16 4.3 1.8e+03 -1.4 0.0 23 44 391 412 380 421 0.78 8 16 2 8.2e+02 -0.3 0.0 23 46 420 443 409 447 0.84 9 16 0.13 56 3.5 0.0 22 48 448 474 444 479 0.90 10 16 0.1 43 3.8 0.0 22 48 477 503 474 509 0.88 11 16 0.29 1.2e+02 2.4 0.1 25 47 509 532 501 538 0.75 12 16 0.76 3.1e+02 1.1 0.0 23 47 536 560 533 563 0.87 13 16 0.05 21 4.9 0.1 22 49 564 591 550 596 0.87 14 16 0.9 3.7e+02 0.8 0.1 23 48 594 619 591 625 0.84 15 16 0.086 36 4.1 0.0 23 48 652 677 644 682 0.87 16 16 0.025 10 5.8 0.1 23 49 681 707 676 713 0.85
Sequence Information
- Coding Sequence
- ATGGAAATTCAAGATAACATAGTttgGATTAAGGAAGAACCGAACGATACCTGGGCAGATGCAGGCGTCGATTATAATTTCGATTCGGCGGGCTCTTCCAAAATCGAAAACTGTGAAACATTTCCGTATGATAAATCACCGgcACATCACACTAATGAGGCGAGTgagtcaaatgaaaaattaaatgaaaaaatatttaatgattttaagaGCAATGATGTTAAACTTGAACTACCGTCTGTATCGACAACTAACTTCATGTGCGAGGGTCAACATTTTGAAGCTATGGTAGAAGAAGAATATCAAACCAATgacaaaaaggaaaatatattcattgattttgaggCCCACTGTGTGAAACCTGAACTGCAGTATTTGGCAACAAATATCTGTGAAACGGAATATCAAAGTTATCCACCAATTGAAAAACTAGAAAACCAAATTCAAATCAATTGCTTGAACAATAAAGATCCATTAATTTTAACGAATAAAGAAttcgattttgaaaatgatggTGACCTTTCAGAAAGTTCTCGTTTGGAAATTGACGCGTccaaaaaagtaaacattttgAATACAGGAGAAGAAAGCCTGAAAACACGTATCAATACAGTACATAAGGGCATTCTATTATATGAATGTGTGATTTGTCACAATTCatatcgacaaaaaaaagctctgaaAATTCACATTAAGACACACAATAGTATCAAATCCttcgaatgtaatatttgtcacaaattaatTGGAAACCGAAGTCAACTCAAAACGCATACTATAGAAAtgcataataatttcaaaccctttgaatgtgatatttgtcacaaatcatttggagagaaaaataaacttacaaGGCACATGGAGACGGTACATGAAggaagcaaaccctttgaatgtgatatttgccataaatcatttggacaaagtAGTCACCTTAAATCTCATATAAGCGTAGTACATAATGGTgacaaaccttttgaatgtgaaatttgtcacaaatcatttgggtaCAAGAgtaaactcaaaaatcacaaGTCATTACATGAACTCAAGTTCGTTCGTTCATTACATGAACGCCATCAATCTTTCGGATGtaacatttgtcaaaaatcatttggaaaacaGAGTCAACTCAAAAAACACACTAAagaaatacatgaaaatttcaaaccatttgaatgtgatatttgtaaaaaatcatttggaaaacaGAGTCAACTCAAAAAACACTCTAAagaaatacatgaaaatttcaaaccatttgaatgtgatatttgtaaaaaatcatttggagagaAGACTAAACTTTCAAGGCACATAACGACGGTACATGATGGTAGtcaacccttcgaatgtgataaatgtaagaaatcatttggacaaagtTGTCACCTTAAATCTCATATAAACGTAGTGCATAATGGCaacaaaccttttgaatgtgaaatttgtcacacaTCATTTGGACAACAAAGTAAACTCAAAagtcacataaattcagtacataatagTATCAAATCGTTGGAATGtaacatttgtcacaaattaatTGGAAACCAAAGTCAACTTAAAAAACACACTATGGAAATACATAATCAattcaaaccctttgaatgtgatatttgtcacaaatcatttggagagaaaaataaacttacgAGGCACATGACGAcggtacatgatcgtagccaacccttcgaatgtgaattctgtcacaaatcatttggtcaGAATAGTCACCTAAAATCTCACATAAACGTAGTACACCATGGAAGCAAACcgtttgaatgtaaaatttgtcacaaatcatttgggtaCCAGAGTAAACTCAGAagtcacataaattcagtacacgAACGCCGTCAATCATTCGAATGtaacatttgtcaaaaatcatttggaaaccAGAGTCAACTCAAAAGTCACATTttttcagtacataatcgcagcaaaccctttgaatgtgatatttgccaaaaatcatttggagagaaaaataaacttacaaGGCACATggagacagtacatgaaggaaGCAAACCTtttaaatgtgatatttgtcataaatcatttggtcAGAATAGTcacctcaaatctcacataaacGTAGTACACCACGGAAGCAAACcgtttgaatgtgaaatttgtcacaaatcatttggataccagagtAAACTTAAAATTCACATAACGACGGTACATAACTTTAGCAATCTCCTTGAGAgttag
- Protein Sequence
- MEIQDNIVWIKEEPNDTWADAGVDYNFDSAGSSKIENCETFPYDKSPAHHTNEASESNEKLNEKIFNDFKSNDVKLELPSVSTTNFMCEGQHFEAMVEEEYQTNDKKENIFIDFEAHCVKPELQYLATNICETEYQSYPPIEKLENQIQINCLNNKDPLILTNKEFDFENDGDLSESSRLEIDASKKVNILNTGEESLKTRINTVHKGILLYECVICHNSYRQKKALKIHIKTHNSIKSFECNICHKLIGNRSQLKTHTIEMHNNFKPFECDICHKSFGEKNKLTRHMETVHEGSKPFECDICHKSFGQSSHLKSHISVVHNGDKPFECEICHKSFGYKSKLKNHKSLHELKFVRSLHERHQSFGCNICQKSFGKQSQLKKHTKEIHENFKPFECDICKKSFGKQSQLKKHSKEIHENFKPFECDICKKSFGEKTKLSRHITTVHDGSQPFECDKCKKSFGQSCHLKSHINVVHNGNKPFECEICHTSFGQQSKLKSHINSVHNSIKSLECNICHKLIGNQSQLKKHTMEIHNQFKPFECDICHKSFGEKNKLTRHMTTVHDRSQPFECEFCHKSFGQNSHLKSHINVVHHGSKPFECKICHKSFGYQSKLRSHINSVHERRQSFECNICQKSFGNQSQLKSHIFSVHNRSKPFECDICQKSFGEKNKLTRHMETVHEGSKPFKCDICHKSFGQNSHLKSHINVVHHGSKPFECEICHKSFGYQSKLKIHITTVHNFSNLLES
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01482800;
- 90% Identity
- iTF_01485061;
- 80% Identity
- iTF_01485061;