Dhop036386.1
Basic Information
- Insect
- Dorcus hopei
- Gene Symbol
- -
- Assembly
- GCA_033060865.1
- Location
- CM065425.1:72244304-72250441[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 4.2 5.9e+03 -1.0 0.1 25 44 14 33 9 41 0.76 2 13 0.91 1.3e+03 1.1 0.4 21 32 38 49 22 64 0.79 3 13 0.00014 0.19 13.4 0.1 21 44 95 118 88 127 0.86 4 13 2.8 3.8e+03 -0.4 0.0 21 45 123 147 120 153 0.79 5 13 0.013 18 7.1 0.1 21 44 179 202 171 210 0.88 6 13 0.042 59 5.4 0.2 21 45 207 231 204 238 0.84 7 13 0.0066 9.2 8.0 0.1 23 45 237 259 232 262 0.88 8 13 0.00081 1.1 10.9 0.0 21 48 263 289 260 294 0.89 9 13 0.012 17 7.2 0.1 21 47 291 317 289 322 0.86 10 13 0.018 25 6.6 0.2 21 48 319 345 316 351 0.88 11 13 0.02 28 6.4 0.2 23 44 350 371 346 378 0.90 12 13 5.5 7.7e+03 -1.4 0.2 27 48 382 402 380 409 0.80 13 13 2.6 3.6e+03 -0.3 0.1 18 44 401 427 396 432 0.72
Sequence Information
- Coding Sequence
- ATGACGCTGCACAAATTGAACCATACCGACGAGAAGCGGTTCAGTTGCGGCCTTTGCGAATACAAGTGCCAGCGAGCGGCACATTTGAAGCGGCACATGTTCACACACAACGACGAAAAACCGTTCACCTGTGAGGTTTGCGATCACAAATTCAATCGTCTCGAACACTTGAAGAGCCACCAATACCGGCGGATTAAGGACGAAATAACAGAAGAATGTGGTCCATGTGGTTCCAAACTTGAATCGCACAGAAGTTTGCGCGAGCACACACCGAAACGCACAGACGAGAAGCCGCTTACATGTGACGCATGCGACTGCAAATTTCGAGACGCTCGGAAATTGAAGCGGCACATGTTAACACATACTGgggagaaaccgttcagttgcgacCTTTGCACCTACGAAGGCGATCAACTGCAACAACTGAAAGCGCACATGCGAACACACACCGGCGCGAAACTATTTAAGTGCGACTTCTGTGACTACGAGTCCCGACTGTCCTCACGTTTGAAACGTCACTCATTAATACACACCGGTGAAAAGCCGTTCAGCTGCGAGCTTTGCGATTGCAGATTCCGCTTTTCGTCTAACTTGGCGCAGCACATGTTAACGCACTcaggcgagaagccgttcagttgcgatcaCTGCGCTTATAAATGCGGACAAATCACAAATTTCAGGAGGCACATGCTAACGCATACGGAAACCAAACCGTACAGTTGCGATTCTTGCAGTTACAAATCTCGACAAtcctcaaatttaaaaaagcacatGATGATACACACTGGGGAAAAACCGTTTGCTTGTAGCGTTTGCGATTTCAAGTGCCGACAATCCGGAGATATGAAGCGCCACGAATTgacgcacaccgacgagaagccgttgggttgtaatttttgcgattacaagtgccgaGGGGCCCGAAGTTTAAGGCagcacatgttaatacacaccggcgagaagccgctGAGTTGCGgcgtttgcgattataaatgtcgccATCTTGGAACCTTGAGGCGGCACATGTTAAAGCACACGGTCGACGTGAAACCGTACAGCTGTAATGTTTGCGAGTACAAGTGCCAAAAGTCCGAGGACATGAGGCGTCACTTGTTGACGCACACGGGCGGAAAGCGGTTCGGTTGTGACCTGTGCGGTTACAGATGCCGACGGCTCGCGCATTTGAAACAGCACCTGCTGAGACATTcgggcgagaaaccgttcggttgtgacctttgcgattataaatgcctaGTTAAGGCAAGTTTGAAGAAGCACTTGTTGGCACATACACTTGATAAGAAACAATAA
- Protein Sequence
- MTLHKLNHTDEKRFSCGLCEYKCQRAAHLKRHMFTHNDEKPFTCEVCDHKFNRLEHLKSHQYRRIKDEITEECGPCGSKLESHRSLREHTPKRTDEKPLTCDACDCKFRDARKLKRHMLTHTGEKPFSCDLCTYEGDQLQQLKAHMRTHTGAKLFKCDFCDYESRLSSRLKRHSLIHTGEKPFSCELCDCRFRFSSNLAQHMLTHSGEKPFSCDHCAYKCGQITNFRRHMLTHTETKPYSCDSCSYKSRQSSNLKKHMMIHTGEKPFACSVCDFKCRQSGDMKRHELTHTDEKPLGCNFCDYKCRGARSLRQHMLIHTGEKPLSCGVCDYKCRHLGTLRRHMLKHTVDVKPYSCNVCEYKCQKSEDMRRHLLTHTGGKRFGCDLCGYRCRRLAHLKQHLLRHSGEKPFGCDLCDYKCLVKASLKKHLLAHTLDKKQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00465303;
- 90% Identity
- iTF_00465303;
- 80% Identity
- iTF_00465303;