Dhop036409.1
Basic Information
- Insect
- Dorcus hopei
- Gene Symbol
- -
- Assembly
- GCA_033060865.1
- Location
- CM065425.1:72486469-72509457[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 31 0.06 84 4.9 0.2 21 44 8 31 4 38 0.90 2 31 2.7 3.7e+03 -0.4 0.1 23 44 38 59 33 64 0.81 3 31 7.2e-05 0.099 14.3 0.5 21 48 64 90 60 97 0.89 4 31 4.8 6.6e+03 -1.2 0.1 19 44 90 115 88 119 0.77 5 31 0.013 17 7.1 0.1 21 44 120 143 111 148 0.91 6 31 0.035 48 5.7 0.0 21 44 148 171 144 180 0.87 7 31 2 2.8e+03 0.0 0.1 21 44 176 199 173 203 0.88 8 31 0.99 1.4e+03 1.0 0.1 22 52 233 263 229 265 0.87 9 31 0.19 2.6e+02 3.4 0.1 18 44 271 297 261 302 0.86 10 31 0.88 1.2e+03 1.2 0.0 25 44 306 325 300 329 0.84 11 31 0.00073 1 11.1 0.0 21 44 330 353 327 361 0.88 12 31 3.9 5.5e+03 -0.9 0.2 21 44 358 381 355 389 0.73 13 31 0.00041 0.56 11.9 0.1 21 44 405 428 397 437 0.87 14 31 0.011 16 7.3 0.1 21 45 433 457 430 464 0.85 15 31 0.7 9.7e+02 1.5 0.1 21 44 461 484 458 488 0.90 16 31 0.0087 12 7.6 0.0 21 45 489 513 486 522 0.85 17 31 0.7 9.8e+02 1.5 0.1 21 44 517 540 514 544 0.90 18 31 0.0043 6 8.6 0.3 21 50 545 573 542 577 0.87 19 31 0.29 4.1e+02 2.7 1.3 23 48 575 599 570 602 0.89 20 31 0.0062 8.6 8.1 0.3 18 46 598 626 597 632 0.85 21 31 0.21 2.9e+02 3.2 0.1 21 44 629 652 626 660 0.87 22 31 2.5 3.4e+03 -0.2 0.1 20 45 714 739 709 748 0.82 23 31 2.3 3.2e+03 -0.1 0.0 23 47 745 769 740 776 0.83 24 31 8.7 1.2e+04 -2.0 0.2 21 44 771 792 764 795 0.76 25 31 0.011 15 7.3 0.6 19 44 852 877 848 882 0.89 26 31 0.11 1.5e+02 4.1 0.0 21 44 882 905 879 910 0.90 27 31 0.17 2.4e+02 3.4 0.2 21 48 910 936 906 940 0.90 28 31 4.4 6.1e+03 -1.0 0.0 27 51 977 1001 968 1002 0.88 29 31 0.04 55 5.5 0.0 22 48 1180 1206 1176 1210 0.85 30 31 0.0021 2.9 9.6 0.0 22 45 1208 1231 1204 1240 0.85 31 31 1.4 2e+03 0.5 0.2 22 48 1236 1262 1232 1264 0.82
Sequence Information
- Coding Sequence
- ATGAATATTGCGACACACTCGGGCGAGAGACCGTTTACTTGCGAATTTTGCGGTTGCAAATACCTACACGCCCGAAGGCTAAAGCGGCACATGCTAACACACACCGACGAAAAATCATTCGGTTGTAGTTCCTGCGACTTTCGCTGCAAGCAGAAGCAAAACTTAACGCAGCACTTACTGTCGCACACCGCCGAGAAGCCCTTCGCCTGTAAACTTTGCGATTACAGATGCGGCCAATCCAGAAATCTGAAGCGGCACGTCTTGAGGCACTCCGACGAGAAGCCCTTCGCCTGTAAACTCTGTGATTACAAATGCCAACGGGCGGACTACATAGAACggcacatgttaatacacaccgacgagaagccctTCCGTTGCACTTTCTGCGGCTACAAGTGCCGACAATCCGAGAATTTGAAGATGCACGTGCTGACACACACCGACGAAAAACCGTTCGGCTGTAagctttgcgattataagtgccgacaAAACGGAACGCTGAAGCGTCACATGTTGACTCACACGGGCGAAAAGCCCTTCGGCtgcgacctttgcgattacaaatgccgacgaATCGAAGTCCTGAAGCGACACACGTTGATTCACACCGATGAGAAACTTTTCAGTTGCGATCTCTGCCAGTTCGAATGCCGACGGCCCGAACGATTGGACGAGCACATGTTAACGCACGGCGAGGAGAAACCGATCGgctgcgatctttgcgattacaagtgccgacAGACGGGAAGATACCAGCAGAATACTGAATATCAACATGTGACAAACACCGGCGAAAATCCGCACGTGTTAAGATACGCCAACGAGAAGCCTTTCAGTTGTACTTTCTGCAATTATAAATGCGAACAACCCGGAAAACTGAAACGGCACATGCTAACGCACACCAAGGAGAAGCGTTTCCGTTGTACCTTGTGCGATTACAAAAGCCAACAATCCGGAAACCTGAAGCagcacatgttaacgcacaccgacgagaagcctttCACTTGTACTTTGTGCGGTTGTAAATTCAAACAATCCGGGAAACtgaaacggcacatgttaaCACATACCGAGGAGAAGCCCTTCGCTTGCGTGCTCTGCAATTACCAATGCCTACGGGCGCCCAGCTTGGAACGGCACATGCTAACGCACACCAATGAGAAGCCCTTCGGATACCAGTGGAATATTAAATATCAACGTGTGACACACCCCGACGAGAAGCCCTTCAGTTGTACTTTGTGCGATTACAAATGTAATCAATCGGGAAACTTGAGGCGGCACATGTTAACACACGCCAGCGAGAAACACTTCAGCTGTACCTtgtgcgattacaaatgcaagCAGTCCGGAAACATGAAACGacacatgttaacgcacaccAACGAGAAGCCCTTCGGTTGTGTActctgcgattacaagtgcTCGCAAGCGCTAAGCTTAAAACGGCACATGTTAACACACACCAGCGAGAAACACCTCAGCTGTACCTTGTGCGATTACAAGTGCAAGCAGTCCGGAAACATGAAACGacacatgttaacgcacaccAACGAGAAGCCCTTCGGTTGTGTActctgcgattacaagtgcTCGCAAGCGCTAAGCTTAAAACggcacatgttaacgcacaccAACGAGAAGCCCCTAAGTTGCACTTTCTGCGACTACAAATGCCGACAACCCGAAAATTTGAGAGCGCACATATTGAAACACACCGGCAAAAATCCGTTCTgctgtaatctttgcgattataaatgccgttATAACGGTGCATTGAGGCGTCACATGTTAAGACATACAGATGAGAAACCTGTCAGTTGTgaactttgcgattacaagtgccgacAAAAGGAATACCTGAAGAGGCACATGTTGATACACACCGATGAGAAACCtttcagttgcgatctttgcgattgcAAGTGCCGACGATCGGATGAATTGGATCagcacatgttaacgcacaGCGGCGAGAAACCGATTGGATACCAGCAGAGTCTTGAATGTCAAGTTGTAAACAACCAGACAGAGTGTGGAATGTGCGGCCCCAAATTTAAATCTCACAAGTGTAACCACGTGTTGATACATACCGAAGAGAAACCGCTCAAAACCGGTCATCAATTCCGAGAAGCgaaacacaccgacgagaaaccatACAGCTGTGAATCTTGCGGTTGCAAATACCTGCACGCTCAGAGTTTAAAACGGCACATGTTAACACACACCGACGTGAAACCCTTCAGTTGtaacctttgcgattacaaaagcCAACAAAAGGGATACCTGAAGAGGCACATGTTGATACACTCCGATGAGAAACCTTTCAGTTGCGAGCTTTGCGATTGCAAATGTCGACGAGCATACCGGCGGAGTTTTGAATATCAAGTTGTAAACAACCTGACAGTTTGTGAAATGTGCGGCCCCAAATTTAAATCTCACAAATGTAACCACGTCTTGAAACATGCGTTCAAAATCGGTCGTCAATTCCGAGAAGCAGAACGGCACATCGTAAAACACACCGACGAAAAGCAGCACATGTTAACACACCGTGAGAGTGATACtaagaaaccgttcagttgtgatgtcTGCGATTACAGATGCCGACAACGCAGAAGGTTGAAAGAGCACTTATtaacacacaccgacgagaagccctTCAGTTGTACCTTCTGTGATTACAAATGCAGACAGTCTGGACAATTGAAAGGACACATGTTAACGCACTCCGATGAGAAGCCGATCCGCtgtgatttttgcgattataaatgccggctgAGCGGAACAATGAAGCGGCACGTGTTAagacacaccggcgagaaacgcTTCGCGTGTGGTCTTTGTGGTTACAAGAGCCCGGAAGCCGGAAGCACGCTAAACAATTCTAACGATTTTAAAAGCTCGGCCGCCGGCGAGAAATTTGTATTTCAATGTATAGTCTGCAACTATGACATCCTCAGCGAAAAAGAACTTCGATACCACGTCAAGCAAGTGCACGTCACCAAGGAATTCAAAGAGAAGGCCAAGGCCTGCGGCGAGCCGTGGTACAGATGCGTCGCTTGCGATTACACGTGTTCGAAAGCCTCGCGTTTAATCCAGCATTACTCCacgcataccggcgagaagccctTCAGGTGCAACAAGTGCACCGCGAGATTCTCCCGGATTTACTCACTGAAGATGCACCTGCTGACGCACACCGACGAGTTAAACTTCTCGTGCGTCAGTTGCGGGTTAAAATTCAAGTACCCCCAACGGCTGAGGCAGCACATGTACTCTCACATCGAGGGTGTACCCTACACGTGCGACAGGTGCAGCTTGAGCTTCGTCGAGGGCAGCGAGTTGCAGAAGCATCTGAGGATACACGCCGACGACAGGCCTTTCGCCTGTGAATTCTGCGAAGCCAAGTACAAATCCGTGGGCCTCCTGAATAAGCACATGTATACTCACACCGGTGTGAAACGTTACGAGTGCGAACTTTGTTCCTGTAAGTGTCGCACCAGGGAGCACCTGAAACTCCACATGTACGTCCACACGAACCAGAAGCCCGTCGTCTGCAGCGTTTGTAACGCGGCGTTCAAGTCCGAGCATCATTTAAGGGGGCACATGGCCGTGCACGAGGACGATAAGCCCTTTAAATGTGACGTGTGCGGTTTTGCGTTCAAGTCGTCTAGAAATTTAAGACCGCACCTGTTGACGCACACCGAGGATAAGCCCTTCAAATGTACGAGCTGCAGTTACCAATGTATTAGGGCGAAATATTTGAAAGCGCACATTGAAAAGAAACATGTAATATAA
- Protein Sequence
- MNIATHSGERPFTCEFCGCKYLHARRLKRHMLTHTDEKSFGCSSCDFRCKQKQNLTQHLLSHTAEKPFACKLCDYRCGQSRNLKRHVLRHSDEKPFACKLCDYKCQRADYIERHMLIHTDEKPFRCTFCGYKCRQSENLKMHVLTHTDEKPFGCKLCDYKCRQNGTLKRHMLTHTGEKPFGCDLCDYKCRRIEVLKRHTLIHTDEKLFSCDLCQFECRRPERLDEHMLTHGEEKPIGCDLCDYKCRQTGRYQQNTEYQHVTNTGENPHVLRYANEKPFSCTFCNYKCEQPGKLKRHMLTHTKEKRFRCTLCDYKSQQSGNLKQHMLTHTDEKPFTCTLCGCKFKQSGKLKRHMLTHTEEKPFACVLCNYQCLRAPSLERHMLTHTNEKPFGYQWNIKYQRVTHPDEKPFSCTLCDYKCNQSGNLRRHMLTHASEKHFSCTLCDYKCKQSGNMKRHMLTHTNEKPFGCVLCDYKCSQALSLKRHMLTHTSEKHLSCTLCDYKCKQSGNMKRHMLTHTNEKPFGCVLCDYKCSQALSLKRHMLTHTNEKPLSCTFCDYKCRQPENLRAHILKHTGKNPFCCNLCDYKCRYNGALRRHMLRHTDEKPVSCELCDYKCRQKEYLKRHMLIHTDEKPFSCDLCDCKCRRSDELDQHMLTHSGEKPIGYQQSLECQVVNNQTECGMCGPKFKSHKCNHVLIHTEEKPLKTGHQFREAKHTDEKPYSCESCGCKYLHAQSLKRHMLTHTDVKPFSCNLCDYKSQQKGYLKRHMLIHSDEKPFSCELCDCKCRRAYRRSFEYQVVNNLTVCEMCGPKFKSHKCNHVLKHAFKIGRQFREAERHIVKHTDEKQHMLTHRESDTKKPFSCDVCDYRCRQRRRLKEHLLTHTDEKPFSCTFCDYKCRQSGQLKGHMLTHSDEKPIRCDFCDYKCRLSGTMKRHVLRHTGEKRFACGLCGYKSPEAGSTLNNSNDFKSSAAGEKFVFQCIVCNYDILSEKELRYHVKQVHVTKEFKEKAKACGEPWYRCVACDYTCSKASRLIQHYSTHTGEKPFRCNKCTARFSRIYSLKMHLLTHTDELNFSCVSCGLKFKYPQRLRQHMYSHIEGVPYTCDRCSLSFVEGSELQKHLRIHADDRPFACEFCEAKYKSVGLLNKHMYTHTGVKRYECELCSCKCRTREHLKLHMYVHTNQKPVVCSVCNAAFKSEHHLRGHMAVHEDDKPFKCDVCGFAFKSSRNLRPHLLTHTEDKPFKCTSCSYQCIRAKYLKAHIEKKHVI
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00465176;
- 90% Identity
- iTF_00465176;
- 80% Identity
- iTF_00465176;