Basic Information

Insect
Dorcus hopei
Gene Symbol
-
Assembly
GCA_033060865.1
Location
CM065425.1:72464843-72468903[+]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 0.068 94 4.8 0.0 21 44 51 74 37 79 0.90
2 15 0.031 43 5.8 0.1 21 48 79 105 76 107 0.91
3 15 0.08 1.1e+02 4.5 0.2 18 44 104 130 103 136 0.88
4 15 0.91 1.3e+03 1.1 0.2 23 48 137 161 133 166 0.82
5 15 0.45 6.3e+02 2.1 0.1 20 47 162 189 159 194 0.83
6 15 0.039 55 5.5 0.2 17 44 186 214 168 223 0.80
7 15 0.049 69 5.2 0.0 15 44 227 256 225 262 0.92
8 15 0.018 25 6.6 0.0 21 51 289 319 285 320 0.87
9 15 0.038 53 5.5 0.1 21 44 317 340 315 345 0.90
10 15 0.025 35 6.1 0.1 21 49 345 372 342 376 0.89
11 15 0.031 44 5.8 0.7 21 45 373 397 370 405 0.88
12 15 0.067 94 4.8 0.1 14 44 409 436 399 441 0.83
13 15 0.016 23 6.7 0.0 21 44 441 464 437 470 0.90
14 15 0.012 17 7.1 0.4 21 44 470 493 466 501 0.87
15 15 1.2 1.6e+03 0.8 0.4 21 44 498 521 495 526 0.78

Sequence Information

Coding Sequence
ATGGCGTGTAAGTTAAGGCAAGTTCGCACGGGATCatgCCTACAAATTTCCGAACTTACACTCCTGGACAAAAGACCATACGAATGTCGCgtctgcgattataaattcaaaaatcccGGCCACAGACGTAATCACATGCTgatacataccggcgagaagccgttcaccTGCGAGTTTTGCGGTTACAAAAGCCGACTGGCCGGAAACTTGAAGCAGCACATGCTAACCCATTccgacgagaagccgtacaCTTGTAACGATTGCGGCTTCAAATGTCGACACCGCGGAAACCTCAAACAGCACATGCTGagacacaccggcgagaagccgtacaTTTGCGATCTTTGCGGCTACAAATGCCAACAGGGCACGAGTTTAAGACAGCACATGCTGATACATACCGGCGAAAAGCAATACAAGTGCGATCTTTGCGGTTACGAATGCAATCGGTCGGGAAATTTGAAGATACACTTGTTAAGGCATTCCAACGATAGGCCGTTTAAGTGCGGACGTTGCGAGCACGTGAGCAAGACGGCCGGAAACTTAAAAATACACATGCGAGTGCATTCCGATGAGAAACCGTTCAGGTGCAACGTCTGCGATTACGCGACTAAATGGTCGCCGCACTTGAAGAGGCATATGATGTCGCACGAGGAACTGCTCAGATTTCGATGGCTCCCAAAATCCGAAAGTCCCGACCAGAAACCCTTCAAATGCCACACGTGCGGTACTATGTATAAGTCTTACACCCGCATGCGCCGTCACATGCTGTCGCACACCAACGAGACGGAATTCAGCTGCGACTTTTGCGATTACAAGAACCGCCAGTTCGGAAAGATCCAGAGCcacatgttaacgcacaccggtgagaagccgttcggctgtgatctctgcgattataaatgcagggAATTGGGAAAGTTGAGGCGGCACATGTTGttgcacaccggcgagaagccgttcagttgtgacgtctgcgattacaagtgccgacAACCCGGAAGGTTGAGGGAGCACATGTtgacgcacaccggcgagaagccgttcggttgcgatctctgcgattacaagtgcAGAGAGACCGGAAAGTTGAAGCGGCACAACTTAAGGCACATccgcgagaagccgttcagctgcgatttttgcgattacaaatgccacCAACCCGTAACGTTAAGACGCCACCTATTGATACACAGCGGGCAAAAATCGGAGAAGGGGGAGAAACCGGTTAGTTATGAGAAACCGTATGGTTGTGATTTATGCGATTATAGGAGTCGAGAGGCTGCAAAAttgaaacggcacatgttgacgcacacgggcgagaagccgttcagttgtacTCTTTGCGACTATAAGTGTCGACAATCCGGGAGGTTGAAAGAGCACATGTTCATGCACGTCGGTGGCGAAAAGCCGTTAAGTTGTGACGTTTGCAGTTACAGGTGCCGACAGCCCGCTATGTTGAAGAGGCACATGTTAACGCACGGCAGCGAGAAGCCCCTCGGCTGCGAGCTCTGCGGTTACAGATGTTGTCATCCCTCAACGTTAAAACGGCACGTGTTGAAGCACGCCGGCGAAAAACGCTAG
Protein Sequence
MACKLRQVRTGSCLQISELTLLDKRPYECRVCDYKFKNPGHRRNHMLIHTGEKPFTCEFCGYKSRLAGNLKQHMLTHSDEKPYTCNDCGFKCRHRGNLKQHMLRHTGEKPYICDLCGYKCQQGTSLRQHMLIHTGEKQYKCDLCGYECNRSGNLKIHLLRHSNDRPFKCGRCEHVSKTAGNLKIHMRVHSDEKPFRCNVCDYATKWSPHLKRHMMSHEELLRFRWLPKSESPDQKPFKCHTCGTMYKSYTRMRRHMLSHTNETEFSCDFCDYKNRQFGKIQSHMLTHTGEKPFGCDLCDYKCRELGKLRRHMLLHTGEKPFSCDVCDYKCRQPGRLREHMLTHTGEKPFGCDLCDYKCRETGKLKRHNLRHIREKPFSCDFCDYKCHQPVTLRRHLLIHSGQKSEKGEKPVSYEKPYGCDLCDYRSREAAKLKRHMLTHTGEKPFSCTLCDYKCRQSGRLKEHMFMHVGGEKPLSCDVCSYRCRQPAMLKRHMLTHGSEKPLGCELCGYRCCHPSTLKRHVLKHAGEKR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00465224;
90% Identity
iTF_00465224;
80% Identity
iTF_00465224;