Bcop008132.1
Basic Information
- Insect
- Bradysia coprophila
- Gene Symbol
- -
- Assembly
- GCA_014529535.1
- Location
- NW:8177512-8181899[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 0.021 37 4.4 0.2 21 45 62 86 58 94 0.87 2 17 0.14 2.4e+02 1.8 0.1 25 45 98 118 87 121 0.84 3 17 0.28 5e+02 0.8 0.0 20 44 177 201 170 206 0.83 4 17 0.029 52 3.9 0.2 20 48 206 234 198 237 0.84 5 17 0.0094 17 5.5 0.2 21 46 264 289 258 295 0.86 6 17 0.056 1e+02 3.0 0.2 21 44 320 343 312 348 0.89 7 17 0.0047 8.4 6.5 0.2 14 45 340 372 336 378 0.81 8 17 0.1 1.9e+02 2.1 0.1 25 44 468 487 462 491 0.87 9 17 0.016 29 4.7 0.0 21 47 492 518 487 523 0.87 10 17 0.0033 6 6.9 0.0 17 45 543 572 540 579 0.80 11 17 0.18 3.1e+02 1.4 0.1 21 34 576 589 572 602 0.73 12 17 0.0046 8.3 6.5 0.0 22 44 605 627 598 634 0.89 13 17 0.012 22 5.1 0.2 25 46 636 657 629 665 0.85 14 17 0.77 1.4e+03 -0.6 0.0 21 45 660 684 656 688 0.86 15 17 0.0091 16 5.5 0.3 21 45 688 712 684 720 0.90 16 17 2.9 5.3e+03 -2.5 0.1 21 43 745 767 721 772 0.69 17 17 1.7 3e+03 -1.7 0.0 27 44 780 797 778 802 0.86
Sequence Information
- Coding Sequence
- ATGAATGGTAAATCCAACAAAATGCCGAACAAAACGGAGACAATTCTCAGGGGTCCTGCTGATGTAAACGTGAAACTGGAGAATAGCAGCGAGTTACTCAGCGATCCATCGAAGTATTTACCGCCGAACTGTGTCACTGGTCTGAAACAATTCGACAACGAAAATCATTTGCAAGATCATACAACCGAAAAGCCGTTTCAATGTCATATTTGCAGCACTGCCTTCAGACTCAAACACACTTTGAAGATGCACATGAGACGGCACACAGGACACCAGACGGaattcaaaacatttacaTGTGACATTTGTGCGAAGTCATTCAGGAAATTTCAAAGATTGAAGCTGCACATGGCGATGCATGGCGGCAGCGAGACACATCAGTGCAAAATTTGTTCCGACTCCTTCAACAATTTGAGCTCGCTAAATAATCACAAGAAGCAACATCATCCCGATGCATTCAAGTGCACCGATTGTCTGAAAACTTTTGCGGATGACCGGGCGCTAAGTAACCATGTCAAACGTATCCATCGCTCGGAACGACCGTATAAATGCGAGCTGTGTGATAAAACGTGGAAGACGATGGTTGATCTGAAAGGTCATCTACGAACACACCTATCCGACGAGAGACCGTACATATGTGACATATGCAACTATGCTTTCAAACAACTTTGCGCGTTGAAATTGCATATCAAGCAGATACACGTCGGTGGACGGCCTCACAAGTGCGATGAGTGTGGTGCTGGTTTCAATCGTCAGGATTACTTGCTGATCCATAAACGGGTCCATAGTAAGGAAAAACCTTACACGTGTGAGGTTTGCAGCAGTGCGTTTAGTCAGCTATGCTCGTTGAAGGCTCACATGGCTATTCACTCCAACGAACGGAAGTACACGTGCGAGTATTGTGACGCCAAATTTTTGCGTATGAGCTCTCTCAAAATTCACTTCAGAACGCACACGGGTGAGAAACCGTACTCGTGTACCGTCTGTTCATCTAGCTTCGTCCAGAAATGTACACTCAAACAACACATGCGAACGCACTCTGACGATCGACCGTTTCAATGTAGCATTTGCGAGGCTTCCTTCAAGTTTAAGACCATTTTGAACCGGCATGTGGAAAAGCATACGACGACACAGACGACATTCAAGTGTGTTGATTGCGGTGAAGAGTTTGCCCAATCGAActtgctgaaatttcacaaacGAAAGCAGCACAAAGACATTTACTACACGGTCCACTGTAAAGAGTGCCAACGATGCTTCAGCAATGACAAAATGTTGGCCGATCACGTTGCCAAAGTACATCGGGTCGACAAACTGTTCAAATGCGAACTATGCACGAAGCGGTTCAAGGCTGAGTCGGATTTAAACGTTCATTTGGCGGCACACTTAGACGAACGCCGTTACAGGTGCACTGTTTGTGATGGTTCCTTTAAGCAGAGCCACCGATTGAAGGAACATATGCGTACCCATACGGGTGAGAGACCATTTCAATGTGAGATATGCGACGCTGATTTTAAGTATCGCAACAACTGGAAGCAACACatgaaaatccataaaaacgaaaaatcgttCAAATGCAACGAATGTGATGCTTCGTTTATACAGAAGAATTCATTGATCACGCATTCGAGGATTCACTCCAACGATAAGCCATTCAACTGTGAAGTCTGTTATATGAGCTTCAAGCATGTGCACAGTCTAAAAACGCATCTAAGGAAGCACACAGGTGAAAAACCGTTCAAGTGCAACATTTGCAATGCTTCGTTCAACCATCGGGCTAGCTTCAACATTCACAAGAAGACGCATGATGAAGATAGGCCGTACAAGTGTGGCATCTGCGATGCTACGTTCAAACACAGCAGCTATTTGAAGAATCACCTACCCACGCACACGGAAGAGAAACTGTTTAAATGCACCATATGTCCGTCGTCGTTCAAACAACATAGATATTTGAATCAACATCTGAAAACGCACACAACGGAACGTCCCTTTAAATGTGACTCATGCCCTGCCGCATTTAAAATGCAGAAAAGTTTGGTCGAACATATTCGCATTCACACCAACGAAAGGCCGTTCAAATGCAACGTCTGCACTTCTGCCTTCAAACAACAGCACACTTTGACGCAACATTTGCGAACTCATACGAAGGAACGCCTGTTTAAATGCGAGGTGTGCCCGAAATCATTTATGCAACACATTTCGCTGAAGGCACACATACAGCGCGTCCACAACAAGgaaaatccgttcaaatgtaGCCAATGCGACAAGAGTTTTGGTGATGAAAAGGGTCTGAGGAATCATAAGCGGAAAATCCACGATTTATCGTCGATTTTTGTGTGCGATCTGTGCTGTGCAACGGTCAAAGGTAAAACTGCGCTCAAAGCCCACATGATGAAACATCGGAAGGTAATAAAGCTGCTAAACTTGAAAGtataa
- Protein Sequence
- MNGKSNKMPNKTETILRGPADVNVKLENSSELLSDPSKYLPPNCVTGLKQFDNENHLQDHTTEKPFQCHICSTAFRLKHTLKMHMRRHTGHQTEFKTFTCDICAKSFRKFQRLKLHMAMHGGSETHQCKICSDSFNNLSSLNNHKKQHHPDAFKCTDCLKTFADDRALSNHVKRIHRSERPYKCELCDKTWKTMVDLKGHLRTHLSDERPYICDICNYAFKQLCALKLHIKQIHVGGRPHKCDECGAGFNRQDYLLIHKRVHSKEKPYTCEVCSSAFSQLCSLKAHMAIHSNERKYTCEYCDAKFLRMSSLKIHFRTHTGEKPYSCTVCSSSFVQKCTLKQHMRTHSDDRPFQCSICEASFKFKTILNRHVEKHTTTQTTFKCVDCGEEFAQSNLLKFHKRKQHKDIYYTVHCKECQRCFSNDKMLADHVAKVHRVDKLFKCELCTKRFKAESDLNVHLAAHLDERRYRCTVCDGSFKQSHRLKEHMRTHTGERPFQCEICDADFKYRNNWKQHMKIHKNEKSFKCNECDASFIQKNSLITHSRIHSNDKPFNCEVCYMSFKHVHSLKTHLRKHTGEKPFKCNICNASFNHRASFNIHKKTHDEDRPYKCGICDATFKHSSYLKNHLPTHTEEKLFKCTICPSSFKQHRYLNQHLKTHTTERPFKCDSCPAAFKMQKSLVEHIRIHTNERPFKCNVCTSAFKQQHTLTQHLRTHTKERLFKCEVCPKSFMQHISLKAHIQRVHNKENPFKCSQCDKSFGDEKGLRNHKRKIHDLSSIFVCDLCCATVKGKTALKAHMMKHRKVIKLLNLKV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00245615;
- 90% Identity
- iTF_00245615;
- 80% Identity
- iTF_00245615;