Bodo003840.1
Basic Information
- Insect
- Bradysia odoriphaga
- Gene Symbol
- -
- Assembly
- GCA_016920775.1
- Location
- JAFDOW010001522.1:66261-70747[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 0.34 1.1e+03 -0.3 0.0 26 46 156 176 146 182 0.87 2 12 0.37 1.2e+03 -0.4 0.0 23 44 181 202 177 210 0.82 3 12 0.025 79 3.3 0.5 6 45 227 269 222 277 0.73 4 12 0.0055 18 5.4 0.1 20 36 280 296 271 309 0.77 5 12 0.007 22 5.1 0.0 27 45 314 332 305 341 0.90 6 12 1.8 5.7e+03 -2.6 0.0 26 46 341 361 336 369 0.81 7 12 0.49 1.6e+03 -0.8 0.3 22 46 365 389 359 395 0.83 8 12 0.14 4.6e+02 0.9 0.1 21 53 392 424 383 425 0.82 9 12 0.0026 8.2 6.5 0.0 21 48 420 447 415 451 0.88 10 12 0.0016 5 7.2 0.1 22 46 449 473 446 479 0.93 11 12 0.0024 7.7 6.6 0.1 21 52 509 540 497 541 0.92 12 12 0.0003 0.96 9.5 0.1 26 48 572 594 557 598 0.86
Sequence Information
- Coding Sequence
- ATGATGGACGAAGAGGATTCAAATCAATCTGATGACCAATCGAAAAATGGAGATGATGACGAAGATTATGTGGAGGAACAGGATGATGAAGAGGAGGAGGAGGAAGACGACAATAACGAAGAAGAcggtgatgatgatgacgagGATGACAACCAATCCGATAATAAAACTGATGTGCTTAAAAGTGAAGACAAAAATGAGTCAAACGATGATACGAGCCTGATAACACCAAAAGACGAACCGGATCCTGATTCTGAATCAGATACGAAGCCCGAAGTCAAAGTCAAACCTGAACCAGCACCAGAACCAGAAGAGTCATCCGGATATGACATTTATAAGAATATGACCGATGAACAGCGACAAAAAGCTTTAAAGAACAATGAATGTTTTCTCTGTGAGAAACGGTTCGCTTCATTCACATCATTCAAGAAACATATGATTCGTCATACTGGTGTCAAGAACTATAAGTGTCACATATGTGATAAGGCTTTTGCGGAAGGGAAATACCTACGTGCTCATATGAACATTCACAGTGGACGAACTCCCTATACGTGTAAAGTCTGCGAAAAGAAATTTGCAAGTTCATCAAGTTTGCATGGCCATATGTTGATCCACACCGGTGAAAAACGGTACAAATGTGATATTTGCAATAAAGCCTTCACGGCGTCATCCACTCGGGGCAAGCACATGAAAATGGTTCACAATCCCGAAGAACGGGCTAACCGAAAGCCATCGGAACGGAGGTTCAAATGTGAAATTTGCGAGCAGGCATTCAGCcgaaaaatgaacttaaatGTGCACATGAAGAGTCACAGTACCTCTAGGGGTATACTGGGCCAAGACGATGAAACGGAAGAGTGCCCAATTTGCTTCAAAATTATCGCTAAGAATTCCAAATATCACATGAAGACCCATGAAAAAGGAGTGAAAGTCTACGAGTGTAAAGTGTGCGATGCCAAATTCAATCAATCGGAAAGTCTTCGGGCCCACATGTCTCATCACACGGGTATTAAGGATTTTGTGTGCAGTGTTTGCTCAAAGGCATTTAGTGTTTCATCGCGTCTGACAAAACATATGAGGATTCATACGGGCGAAAAAAGATTTGTCTGCGAAATTTGTCAGCGCGCTTTTACCGATTGTTCGGCATTACATCGCCATCGAAAGATTCACACCGCtgaaaagaatttcttatgCGAGATCTGCTCTAAAGCGTTTTCACAACCATCCAGTCTTCAACTTCATATGCGTGTTCACACAGGTGAAAAACCGCACGTTTGTAAAGTTTGCACCAAAGCGTTCCACGATAGTTCATCACTGTCCAAACACATGAACCTGCATCTTCCGGAGAAACCTTTCCAATGTCAAATTTGTCTGaagaaattcaatcaaaaatactGTCTGAAGAAGCACCTTGAAACGCATGAGAATGATCATCTCCTGAAAGATTTCGACGGAGTTTGtgaaatttgttccaaaaagtTCGGAACACCCAGTGCACTGAAAATTCATCAAGCTGTACATTCCACCGATAAGCCCCATCAATGTAGCTATTGCTTTAAAAAGTTccgtttgatacaaaatatgaaaactcACATCGGAAAAGTGCACATTGATAAGCCATATGAATGCAACATTTGCCAAAAGAAAGCCTTTGCAACCATGGAAAAATTGGAGGAACATCGAGAGAAACATCTGGCTGAAAAGAAGGTACTGCCGTGTGGCATATGCCATCAGACCTTCAAAGCACCGGGCAATCTGCGCAAACATTTAATGAAGCGTCACAATATCGAAAATGAATCGAAAGAACCAATGAACATCGAGCAGATGCCGAACCCAACCGTACCAATGCCACAGATGGTGATGCCACCGATGCCAATGAAAGTTGTGTCCAATAATAAGGCAATGAGTGACGTTCTGAATTTATCGCATTCGATATTCCAACGGCAGGGCCATCCGACCTGGCTTTAA
- Protein Sequence
- MMDEEDSNQSDDQSKNGDDDEDYVEEQDDEEEEEEDDNNEEDGDDDDEDDNQSDNKTDVLKSEDKNESNDDTSLITPKDEPDPDSESDTKPEVKVKPEPAPEPEESSGYDIYKNMTDEQRQKALKNNECFLCEKRFASFTSFKKHMIRHTGVKNYKCHICDKAFAEGKYLRAHMNIHSGRTPYTCKVCEKKFASSSSLHGHMLIHTGEKRYKCDICNKAFTASSTRGKHMKMVHNPEERANRKPSERRFKCEICEQAFSRKMNLNVHMKSHSTSRGILGQDDETEECPICFKIIAKNSKYHMKTHEKGVKVYECKVCDAKFNQSESLRAHMSHHTGIKDFVCSVCSKAFSVSSRLTKHMRIHTGEKRFVCEICQRAFTDCSALHRHRKIHTAEKNFLCEICSKAFSQPSSLQLHMRVHTGEKPHVCKVCTKAFHDSSSLSKHMNLHLPEKPFQCQICLKKFNQKYCLKKHLETHENDHLLKDFDGVCEICSKKFGTPSALKIHQAVHSTDKPHQCSYCFKKFRLIQNMKTHIGKVHIDKPYECNICQKKAFATMEKLEEHREKHLAEKKVLPCGICHQTFKAPGNLRKHLMKRHNIENESKEPMNIEQMPNPTVPMPQMVMPPMPMKVVSNNKAMSDVLNLSHSIFQRQGHPTWL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00246529;
- 90% Identity
- iTF_00246529;
- 80% Identity
- iTF_00246529;