Dhop036640.1
Basic Information
- Insect
- Dorcus hopei
- Gene Symbol
- ZFX
- Assembly
- GCA_033060865.1
- Location
- CM065425.1:75626667-75640805[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.06 83 4.9 0.2 18 52 156 190 147 191 0.88 2 20 0.011 16 7.2 0.0 20 45 214 239 199 245 0.90 3 20 2.9 4e+03 -0.5 0.0 22 44 244 266 240 268 0.88 4 20 0.46 6.4e+02 2.1 0.1 21 52 350 381 340 382 0.85 5 20 0.0058 8.1 8.2 0.0 6 46 391 431 388 438 0.87 6 20 4.7 6.5e+03 -1.1 0.0 22 44 435 457 432 458 0.86 7 20 0.044 61 5.4 0.1 21 52 576 607 566 608 0.90 8 20 0.049 68 5.2 0.0 21 45 632 656 625 662 0.92 9 20 1.6 2.2e+03 0.4 0.0 22 47 661 686 656 691 0.80 10 20 0.021 29 6.4 0.2 23 53 690 720 686 721 0.88 11 20 0.68 9.4e+02 1.6 0.4 17 50 748 780 737 785 0.79 12 20 0.024 33 6.2 0.1 21 53 787 819 773 820 0.83 13 20 0.0051 7.1 8.3 0.6 18 52 845 879 838 881 0.87 14 20 0.28 3.9e+02 2.8 0.0 22 44 884 906 881 910 0.88 15 20 0.064 89 4.8 0.2 18 46 907 935 903 942 0.83 16 20 0.51 7.1e+02 1.9 0.2 21 51 1006 1035 993 1038 0.82 17 20 0.00094 1.3 10.7 0.0 18 47 1094 1123 1090 1128 0.89 18 20 0.48 6.7e+02 2.0 0.0 17 44 1149 1176 1141 1181 0.84 19 20 8.2 1.1e+04 -1.9 0.0 21 43 1181 1203 1174 1209 0.83 20 20 0.17 2.4e+02 3.5 0.1 21 48 1209 1236 1201 1239 0.92
Sequence Information
- Coding Sequence
- ATGGCCGGTCAAACGGCTGCAGCCGATAACGATTCCCAGGAAGCACAGCAATGCGTAAAGCGTGAGTTGGGAGTAGGGCTACCACACGAGATTGTAACCCAGTCGCAGAGTCAATTgtTGACCTCAAAAAGTACTCGCCCCGAAGTTTCGAGAGTCTCTCTTATCAAGGACAAGCAAACGAAACTGTTTAGTTGTCAACACTGCGACTACAAATCAAACCGTTCCTATTCCGTGAAGAGACATGTTTTGACGCACTTCAAGGGAGACCTACTCGTATGCGAGTTGAAACCTTTAAAAACGCACGCGAAGGAAGTTAAAAAACAACTGGTGTGTCAGATTTGCGGTTATAAAACGGATAAACTCTCGTATCTCAAGAGGCACGTCGAGAGACATCAGAAATCGTATCAGTGCGATCGCTGCGGCAACACGTTCAACGCACCTGAAAGCTTGCAATCGCATATGAGGCGACACACCGGGGAGAAACCGTACAGGTGcaacctttgcgattataaatgtacCCAATCCAGTCACCTCTCGACACATATGAAGctgcacaccggcgagaagcctttTCAGTGTGACAAGTGTGATTACAGAGCTGCCCTGAAAGGCAACCTCACGAAACATATGGTTAAGCATTTCGCGGAGAGGCCGTTCGCGTGTAACCTCTGcgattttaaaacgaaaacCTCCAAGAGTTTGAAGAGACATTTGAAGGCGCACACGGGCGAGAGGACGATAGAATGTAGCGTTTGCACTAAGCGATTCGTTAATTCGAAGGACATAAAGAGGCATATGTTGCCCAACGTCAAAGCTATTTCAATCCCCTCATCATTTTCTTCCGTCGAAAACGAGTTGAAACACACTAGGAAACTTACGAGGGAAGTTAAAAAGCAATTCGTATGTGAGACTTGCGATTATAAAGCGGACAAACTTTGGCATCTCAGGAGGCACTTCAACAGTCACGAGAAACCGTATCAGTGCGATCGCTGCGGCAACACGTTCACCGCACTTGCCCATTTGAAGTCCCATATGAGGCTACATACCGGGGAAAAACCGTATAAGTGCAAGCTTTGCGATTATGAATCTATTCAATCCTCTCACCTCTGGACACATATGAGGCTACATTCCGGCGCGAGGCCTTTTCAGTGTGACAAGTGCGATTACAGAGCTGCTCTGAAAAGTGGCCTCACGAAACATATGGTCAGACATTCCGCGGAAAAGCCGTTCAAGTGTAAGCTTTGCGATTTTGAAACGAAAACCTCCGGGGATTTAAACAGACATTTAAAGGTGCACACGGGCGAGAAGGCTTTAGAATGTGGCGTTTGCGCCAAAAGATTCGGCAATTCGAAGGATATAAAAAGGCATATGTGCCCCAACGTCGAAGCTATTTCAATCCCCTCATCATTTTCTTCCGTCGAAAAAAtgcaaattataaaaacagatCCTGATCTACTCTTATGCGAGTTGGAAACTTCAAACACTGAAATACTGGGAATTGTAAAAACAGATCCTGATCCACTCGTACGCGAGTTGAAACCTTCAAAGAAACACACGAGGGAAGTTAAAAAGCAACTCGTATGTGAGACTTGCGATTTTAAAGCGGACAAACTTTGGCATCTCAAGAGGCACCTCAGCAGTCACGAGAAACCGTACCAGTGTGATCGCTGCGGCAACACGTTCACCGCACTTGCCCACTTGAACTCGCATATGAGGCTACACACCGGGGAAAAACCGTATAAGTGCAACCTTTGCGATTACGAATGTATTCAGTCCTCTCACCTCTGGACACATATGAAGCTACACACCGGCGCGAAGCCGTTTCAGTGTGACAAGTGCGATTACAGAGCTGCTCTGAAAAGCAACCTCACGAAACATATGGTCATACATTTCGCGGAGAAGCCGTTCGAGTGTAACTTCTGcgattttaaaacgaaaacCTCCAAGAGTTTGAAGAGACATTTGAAGGCGCACACGGGCGAGAAGGCGTTAGAATGTGGCGTTTGCGCTAAAAGATTCGGTAATTCGAAGGACGTAAAACGGCATATGTTAATACATACGGGGAAGAAACCGTTCGCGTGCAGCCTGTGCGATTATAGATCAACTCAGGCGGCAAATCTAAAATCGCATATGAAGCACAGACACAAACAGTTGCCTAACGTCGAAGGTATTTCTATCCCTTCCGTAGAGGAAATACGCGGTTACGAAACGAATCGTGCCCGTACTCCGAAGAAGCACGTTTTCAGGCATACCGGCGAAAAGCCGTACGAATGTAAATCGTGCGACTACAGGAGTGCTATGCCTGGAAATCTCAGGAAGCACGTGAGGGCACATCACGCTAAGGCAAAAGGCAAAGCGACCGAGTCTGATTTGACATGCGAGATTTGCGATTTTAAAACGACGCAAGCCGATTACTTGAGGAAGCACATGAAGAGACACGGCGATCAGAAACCGGGCCAGAGCTTGAAGTTCAATTGTCAGTACTGTGATTATAAGACGAATCGGGCCTACACCCTCAAGAAGCATGTTTTTAGGCACACCGGCGAAAAACCGTACGCGTGTAAATTGTGCGAGTACAGGTGCACCATGTCTGGAAGCCTCAACCGGCACATCAGGGTTCGTCATGCGAATCAAAAGGGCACAGGAGCCGAATCTGGGTTCACATGTGAGATTTGTGATTTCAAAACGACGCAAGCCGATTATTTGAAGAGGCACATGAGAAAGCACGGCGATCAGCGATTTGCCTGTAAATTGTGCGATTACCGAGGCACCGAGTCCAGAAACTTCAAGAGGCACATGATAATGCACGAGGTAAAGGAGAAAGGCGAAGGATATGAATATAAATTCACGTGCAAGATTTGTGGTTTTAAAACCATTCTAGCTAGTTACCTGAACAAACACATGAAGCGACACAGTGGTCACAAACCATCTAATGGTGTGTGGCACAAATGTGACTATTGCGATTATAAGTCGAGTCTCCCCTCCACTCTCAGGAAACATATGTATACCCATAgtggtgagaagccgttcgcATGTAAATCGTGCAGTTACAGATCCTGCACGTCCGGAAGCCTCCGAAGGCACTCGAAGACGCACATGGCGAAAGAAAAAGTCCAGGATGCTGACAAGAAATTCATGTGTGATGTCTGCAGTTTCAAAACCACGACGGCCAGATACCTGGCGAAACACATGTTGATACACAGCGTTGACAAGTCGTTCCAATGCGATCGCTGCGGCGACAAATTCTCGTCGAATACCATTTTGAAAATGCATACAATGCGACATACCGGGGAGAAACCGTACGAGTGCAACATTTGCGGCTATAAGTGCACCCAAAGCGGCTCGCTGAAACGTCACCTCGTGCTGCACACCGGCGTGAAAACCTTTCAGTGTAACAAGTGCGATTATAAGGCTGCCCTAAAAGGTACCCTCACGAAGCACATGGCTAAACACTCCACGGAGAAGCCGTTCGAGTGCAACGTCTGTAAATACAAGACGAAAACCTCTAATCATTTAAAAAGCCACATGCTATCGCACACCGGCGAGCAGTTGTATAAATGTAACATCTGTAATAAAGGATTCTCTCGCAACAAAGATATGAAGAAGCATACGTTGATACACACAGGAGAGAAACCGTTCGCGTGTAATTTATGCGATTACAGATCTACGCAGAAGTCGAACGTGAAGATACATATGAAGCACAGACATAAggaatgtttttttaatgaatttagtGCAAGTAAAATTGAAGCAACGCAAGACGATTGA
- Protein Sequence
- MAGQTAAADNDSQEAQQCVKRELGVGLPHEIVTQSQSQLLTSKSTRPEVSRVSLIKDKQTKLFSCQHCDYKSNRSYSVKRHVLTHFKGDLLVCELKPLKTHAKEVKKQLVCQICGYKTDKLSYLKRHVERHQKSYQCDRCGNTFNAPESLQSHMRRHTGEKPYRCNLCDYKCTQSSHLSTHMKLHTGEKPFQCDKCDYRAALKGNLTKHMVKHFAERPFACNLCDFKTKTSKSLKRHLKAHTGERTIECSVCTKRFVNSKDIKRHMLPNVKAISIPSSFSSVENELKHTRKLTREVKKQFVCETCDYKADKLWHLRRHFNSHEKPYQCDRCGNTFTALAHLKSHMRLHTGEKPYKCKLCDYESIQSSHLWTHMRLHSGARPFQCDKCDYRAALKSGLTKHMVRHSAEKPFKCKLCDFETKTSGDLNRHLKVHTGEKALECGVCAKRFGNSKDIKRHMCPNVEAISIPSSFSSVEKMQIIKTDPDLLLCELETSNTEILGIVKTDPDPLVRELKPSKKHTREVKKQLVCETCDFKADKLWHLKRHLSSHEKPYQCDRCGNTFTALAHLNSHMRLHTGEKPYKCNLCDYECIQSSHLWTHMKLHTGAKPFQCDKCDYRAALKSNLTKHMVIHFAEKPFECNFCDFKTKTSKSLKRHLKAHTGEKALECGVCAKRFGNSKDVKRHMLIHTGKKPFACSLCDYRSTQAANLKSHMKHRHKQLPNVEGISIPSVEEIRGYETNRARTPKKHVFRHTGEKPYECKSCDYRSAMPGNLRKHVRAHHAKAKGKATESDLTCEICDFKTTQADYLRKHMKRHGDQKPGQSLKFNCQYCDYKTNRAYTLKKHVFRHTGEKPYACKLCEYRCTMSGSLNRHIRVRHANQKGTGAESGFTCEICDFKTTQADYLKRHMRKHGDQRFACKLCDYRGTESRNFKRHMIMHEVKEKGEGYEYKFTCKICGFKTILASYLNKHMKRHSGHKPSNGVWHKCDYCDYKSSLPSTLRKHMYTHSGEKPFACKSCSYRSCTSGSLRRHSKTHMAKEKVQDADKKFMCDVCSFKTTTARYLAKHMLIHSVDKSFQCDRCGDKFSSNTILKMHTMRHTGEKPYECNICGYKCTQSGSLKRHLVLHTGVKTFQCNKCDYKAALKGTLTKHMAKHSTEKPFECNVCKYKTKTSNHLKSHMLSHTGEQLYKCNICNKGFSRNKDMKKHTLIHTGEKPFACNLCDYRSTQKSNVKIHMKHRHKECFFNEFSASKIEATQDD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00465153; iTF_00467099; iTF_00466583;
- 90% Identity
- iTF_00465153; iTF_00467099; iTF_00466583;
- 80% Identity
- iTF_00465153;