Bcop013201.1
Basic Information
- Insect
- Bradysia coprophila
- Gene Symbol
- -
- Assembly
- GCA_014529535.1
- Location
- NW:20056687-20065989[-]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 21 0.91 1.6e+03 -0.9 0.0 23 46 208 231 204 237 0.86 2 21 2.7 4.8e+03 -2.4 0.0 23 44 236 257 232 261 0.82 3 21 0.56 1e+03 -0.2 0.1 26 43 303 318 299 326 0.73 4 21 0.02 36 4.5 0.2 26 45 329 348 316 355 0.86 5 21 0.53 9.5e+02 -0.1 0.0 26 46 357 377 348 385 0.85 6 21 0.6 1.1e+03 -0.3 0.0 16 53 403 440 390 441 0.82 7 21 0.093 1.7e+02 2.3 0.0 21 51 436 466 432 468 0.87 8 21 0.063 1.1e+02 2.9 0.1 22 45 465 488 462 499 0.84 9 21 0.019 33 4.5 0.1 11 52 516 557 508 559 0.82 10 21 0.0011 2 8.5 0.0 17 48 578 610 565 613 0.80 11 21 1.4 2.5e+03 -1.4 0.0 26 46 825 845 817 853 0.87 12 21 1.8 3.2e+03 -1.8 0.0 23 44 850 871 846 874 0.80 13 21 0.1 1.8e+02 2.2 0.4 6 45 896 938 891 946 0.73 14 21 0.023 42 4.2 0.0 20 36 949 965 942 976 0.75 15 21 0.033 59 3.7 0.0 27 45 983 1001 979 1007 0.90 16 21 1.6 2.8e+03 -1.6 0.1 22 46 1034 1058 1023 1064 0.83 17 21 0.58 1e+03 -0.2 0.1 21 53 1061 1093 1053 1094 0.82 18 21 0.011 19 5.3 0.0 21 48 1089 1116 1085 1120 0.88 19 21 0.0063 11 6.1 0.1 22 46 1118 1142 1115 1148 0.93 20 21 0.0098 18 5.4 0.1 21 52 1178 1209 1166 1210 0.92 21 21 0.0012 2.2 8.3 0.1 26 48 1241 1263 1226 1267 0.86
Sequence Information
- Coding Sequence
- ATGGATGAGGTATGTGACGAATTGTCCGACATTCAACAGTTACAAGAAGATCTCAGTGAGTTCCCGAACGAGGACGACGAAGACGAAGATGAAGATGAAGACGAACATAACGAAGAAGatgaagaggaagaagaaaacgATGAAAGCGgtgacgacgacgacgataACAACATTTTTACCTCAATGTCATTGGATGAAGAACCAGTGGAGGAAAAACCCAGTGCCGGTAATGGACTCGATATCGCCGTCAAACTGGAAGTCAGCATTCTAGAGgatgaagaaaatcaaaataataccAAGAATCAGCAGGAAAAAGTGCAGGTAAAGCGCGAACCTTCCGATGCTGAAAAAGTTCAATCCAGTGATAAAACGGGTGATGAAGCTCCGCCACCAGAGCCTCCCGGTTACGATATTTATAAGAATATAACCGAAGAGCAGCGTGAAGCAGCTAGGAAACATAACGAATGTTTCATTTGCCAGAAGCGGTTTGGTACTTTTGCCACCTTTAAACGGCATATAATCCGACACACTGGCTTAAAGAACTACAAGTGTCATATCTGCGGAAAAGCATTTGCCGAAGGCAGTTACTTAAAGGCCCATCTGATGACCCATAGTGGAACTACGCCGTACAGCTGTGATATTTGCctgaaaaaatttgcaaattcgTCCAGTCTCCATGGACACATTAAAATTCATACAGggGAAAAGCGATACAAATGCGAGATATGTGGGAAAGCATTTACCGAATCATCAACGCGACGCAAACATATGCTGGTCCATAATCCGAACAAGCAATTCAAATGCGAGATGTGCGACAAGGCGTTCAGTcgcaaaatcaatttgaatgttCACATCAAAACGCATTATCGTCAGCAGGGATTGCTGGCCGAAGGAGATGTGGTTCAGGAATGTCCGGTGTGCTTCAAAACAATTACGTACAATTTCAAGCAGCACATGAGAAACCATGAACGCGGTAAAGCGTACGCATGCAAATTGTGCGATGCGCGATTTCATCAGTCGAACAGTTTACGTGCCCATATGTCCAAACATACCGGTAAAAAGGAGTATTGTTGCACCGTTTGTTTGAAGGAGTTCAGCATGTCGTCGAATTTGACGAAACACATGAGAATACATACGGGCGAGAAACGATTCGTCTGTGAAGTGTGTCAGCGTGCGTTCACGGATTGTTCAACACTACGAAAACATCGCATGATTCACACCACCGAAAAGAACTTCTTGTGCGAGGTTTGCTCGAAAGCCTTCTCTCAGCAGACGAGTCTCCAGCTGCACATGAGGATACACACAGGGGAAAAACCACACGTTTGTAAAGTATGTTCCAAGGCCTTCCACGACGGTTCATCGTTGTCAAAGCACATGAACCTGCATCTTCCCGAGAAGCCTTTCCAATGTGAAATTTGCTTAAAGAAATTCACCCAAAAATACTGCCTGAAAAAGCATATGAAAACGCACGAGAACGATCACATAGTCAAGACTGGCGACGAATGCATCtgtaaaatttgttcaaagaaATTCGCCACTCCAACCGGACTGAAAATCCACCAGACCACACATTCAAGCGACAAGCAACATCAATGCAAACTTTGTTTGAAGAAATTCACATTGGCCAACAATTTGAGGACTCACATTGCTAAGGTCCACGTGGAAAAACCATTCGAATGCAGCATTTGCCAGAAATCGTTTGCTACGACGGAAAAACTGGACGAACATCGGGACAAGCACTTTTCGGAGAAAAAGGTCCTACCATGCGACATATGCCATAAAACGTTCAAAGCTGCTGGCAATTTACGGAAGCATATGTTAAAGCGTCATaatttggaaaatgaaatgaaaataccGAACGATATAGACAACATGCCCATGTCGATGTCGATGCCAATGTCGATGGCATCAATGTCAATGCCAATGGATGTGATGCCGAATGGCAATAATAAGCCGAATGTTTTGAATCTGTCGCAAACAATGTTTCAACGGCATGAACCGCCTAGTTGGATGATGGACGAAGAGGATTCAAATCAATCAGATGAACAATCGCAAAACGGAGATGATGACGAAGATTATGTGGAGGAACAGGATGATGAAGAGGAGGAGGACGAAGACAATAAAGATGAAGACGGTGGTGgtgatgacgatgatgacAGCCAATCCGATAATAAAAGTGATGTACTTAAACGTGAAGACAAAAATGAGTCAAACGATGATACCAGCCTAGTAACACCAAAGGACGAACCGGATCCTGATTCTGAATCTGATACGAAGCCCGAAGTCAAAGTCAAACCAGAACCAGCTCCAGAACCGGAAGAGTCATCCGGATATGACATTTATAAGAATATGACCGATGAACAGCGACAAAAAGCTTTAAAGAACAATGAATGTTTTCTCTGTGAGAAACGGTTCGCTTCATTCACATCATTCAAGAAACATATGATTCGTCATACCGGTGTCAAGAATTACAAGTGTCACATATGTGATAAGGCTTTTGCCGAAGGGAAATACCTACGTGCTCATATGAACATTCACAGTGGACGAACTCCCTATACGTGTAAAGTCTGCGaaaagaaatttgcaaGCTCATCGAGTTTACATGGCCATATGTTGATCCACACAGGTGAAAAGCGGTATAAATGTGATATTTGCAATAAAGCCTTTACGGCGTCATCCACTCGGGGCAAGCACATGAAAATGGTTCACAATCCCGAGGAACGGGCTAACCGAAAGCCATCGGAACGGAGGTTCAAATGTGAAATTTGCGAGCAGGCATTCAGCCGAAAAATGAACCTAAATGTGCACATGAAGAGTCACAGTACCTCCAGGGGTATACTAGGCCAAGACGATGAAACGGAAGAGTGCCCCATTTGCTTCAAAATTATCGCTAAGAATTCCAAATATCACATGAAAACTCACGAAAAAGGAGTGAAAGTCTACGAGTGTAAGGTGTGCGATGCCAAATTCAATCAGTCGGAAAGTCTTCGGGCCCACATGTCTCATCACACGGGTATTAAGGATTTTGTGTGCAGTGTTTGCTCAAAGGCATTTAGTGTTTCGTCACGACTAACAAAACATATGAGGATTCATACGGGCGAAAAAAGATTTGTCTGCGAAATTTGTCAGCGCGCTTTTACCGATTGTTCGGCATTACATCGCCATCGAAAGATTCACACCGCTGAAAAGAATTTCTTATGCGAGATCTGCTCCAAAGCGTTTTCACAGCCATCGAGTCTTCAACTTCATATGAGGGTGCACACAGGTGAAAAACCGCACGTTTGTAAAGTTTGCACCAAAGCGTTCCACGACAGCTCATCACTATCGAAACATATGAACCTGCATCTTCCGGAGAAACCTTTCCAATGTCAAATTTGTCTGaagaaattcaatcaaaagtACTGTCTGAAGAAGCACCTCGAAACGCATGAGAATGATCATCTCCTGAAAGATTTCGACGGGGTTTGTGAAATCTGTTCCAAAAAGTTCGGAACACCCAGCGcattgaaaattcatcaagCTGTACATTCCACTGATAAGCCCCATCAATGTAGCTATTGCTTTAAAAAGTTccgtttgatacaaaatatgaaaactcaCATCGGAAAAGTGCACATTGATAAGCCCTATGAATGCAACATTTGTCAAAAGAAAGCCTTCGCAACCATGGAAAAATTGGAGGAACATCGAGAGAAACATCTGGCCGAAAAGAAGGTACTGCCGTGCGGCATATGTCATCAGACCTTCAAAGCACCGGGCAATCTGCGCAAGCATTTAATGAAGCGTCACAATATCGAAAATGAATCGAAAGAACCAATGCACGTCGAGCAGGTGCCCAACCCAACTGTACCAATGCCACAGATGATGATGCCACCGATGCCAATGAAAGTTGTGTCCAATAATAAGGCAATGAGTGACGTTCTGAATTTATCGCATTCAATATTCCAACGGCAGGGGCATCCGAACTGGCTttag
- Protein Sequence
- MDEVCDELSDIQQLQEDLSEFPNEDDEDEDEDEDEHNEEDEEEEENDESGDDDDDNNIFTSMSLDEEPVEEKPSAGNGLDIAVKLEVSILEDEENQNNTKNQQEKVQVKREPSDAEKVQSSDKTGDEAPPPEPPGYDIYKNITEEQREAARKHNECFICQKRFGTFATFKRHIIRHTGLKNYKCHICGKAFAEGSYLKAHLMTHSGTTPYSCDICLKKFANSSSLHGHIKIHTGEKRYKCEICGKAFTESSTRRKHMLVHNPNKQFKCEMCDKAFSRKINLNVHIKTHYRQQGLLAEGDVVQECPVCFKTITYNFKQHMRNHERGKAYACKLCDARFHQSNSLRAHMSKHTGKKEYCCTVCLKEFSMSSNLTKHMRIHTGEKRFVCEVCQRAFTDCSTLRKHRMIHTTEKNFLCEVCSKAFSQQTSLQLHMRIHTGEKPHVCKVCSKAFHDGSSLSKHMNLHLPEKPFQCEICLKKFTQKYCLKKHMKTHENDHIVKTGDECICKICSKKFATPTGLKIHQTTHSSDKQHQCKLCLKKFTLANNLRTHIAKVHVEKPFECSICQKSFATTEKLDEHRDKHFSEKKVLPCDICHKTFKAAGNLRKHMLKRHNLENEMKIPNDIDNMPMSMSMPMSMASMSMPMDVMPNGNNKPNVLNLSQTMFQRHEPPSWMMDEEDSNQSDEQSQNGDDDEDYVEEQDDEEEEDEDNKDEDGGGDDDDDSQSDNKSDVLKREDKNESNDDTSLVTPKDEPDPDSESDTKPEVKVKPEPAPEPEESSGYDIYKNMTDEQRQKALKNNECFLCEKRFASFTSFKKHMIRHTGVKNYKCHICDKAFAEGKYLRAHMNIHSGRTPYTCKVCEKKFASSSSLHGHMLIHTGEKRYKCDICNKAFTASSTRGKHMKMVHNPEERANRKPSERRFKCEICEQAFSRKMNLNVHMKSHSTSRGILGQDDETEECPICFKIIAKNSKYHMKTHEKGVKVYECKVCDAKFNQSESLRAHMSHHTGIKDFVCSVCSKAFSVSSRLTKHMRIHTGEKRFVCEICQRAFTDCSALHRHRKIHTAEKNFLCEICSKAFSQPSSLQLHMRVHTGEKPHVCKVCTKAFHDSSSLSKHMNLHLPEKPFQCQICLKKFNQKYCLKKHLETHENDHLLKDFDGVCEICSKKFGTPSALKIHQAVHSTDKPHQCSYCFKKFRLIQNMKTHIGKVHIDKPYECNICQKKAFATMEKLEEHREKHLAEKKVLPCGICHQTFKAPGNLRKHLMKRHNIENESKEPMHVEQVPNPTVPMPQMMMPPMPMKVVSNNKAMSDVLNLSHSIFQRQGHPNWL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00245613;
- 90% Identity
- iTF_00245613;
- 80% Identity
- iTF_00245613;