Scau006317.2
Basic Information
- Insect
- Stenodera caucasica
- Gene Symbol
- -
- Assembly
- GCA_037414795.1
- Location
- JAZBGZ010000743.1:19993-27406[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 24 0.00031 0.019 15.1 0.2 5 23 21 39 20 39 0.96 2 24 4.7e-06 0.00028 20.9 1.0 1 23 45 67 45 67 0.98 3 24 2.3e-06 0.00014 21.9 3.9 1 23 122 144 122 144 0.98 4 24 1.6e-05 0.00094 19.2 2.8 1 23 150 172 150 172 0.98 5 24 4.4e-07 2.6e-05 24.1 1.3 1 23 178 200 178 200 0.98 6 24 4.9e-07 2.9e-05 24.0 2.3 1 23 206 228 206 228 0.99 7 24 0.0013 0.078 13.2 0.3 1 23 262 285 262 285 0.93 8 24 0.00016 0.0097 16.0 0.6 1 23 291 313 291 313 0.97 9 24 5.3e-05 0.0031 17.6 4.2 1 23 319 341 319 341 0.98 10 24 4.8e-06 0.00028 20.9 3.7 1 23 347 369 347 369 0.98 11 24 0.0033 0.19 11.9 6.1 1 23 382 404 382 404 0.98 12 24 7e-05 0.0042 17.2 2.6 1 23 410 432 410 432 0.99 13 24 3.9e-05 0.0023 18.0 0.4 1 23 438 460 438 460 0.98 14 24 8.9e-07 5.3e-05 23.1 4.7 1 23 531 553 531 553 0.98 15 24 1.7e-05 0.001 19.1 1.2 2 23 558 579 558 579 0.96 16 24 3.3e-07 2e-05 24.5 2.0 1 23 585 607 585 607 0.98 17 24 0.00013 0.0077 16.3 3.6 1 23 613 635 613 635 0.98 18 24 1.1e-05 0.00064 19.7 0.1 1 23 664 686 664 686 0.97 19 24 9.8e-06 0.00058 19.9 1.0 1 23 691 713 691 713 0.98 20 24 5.4e-06 0.00032 20.7 1.0 1 23 719 741 719 741 0.98 21 24 2e-05 0.0012 18.9 0.4 1 23 747 769 747 769 0.98 22 24 7.1e-06 0.00042 20.3 3.6 1 23 775 797 775 797 0.98 23 24 2.4e-06 0.00014 21.8 2.1 1 23 803 825 803 825 0.98 24 24 5.3e-05 0.0031 17.6 2.7 1 23 831 853 831 853 0.98
Sequence Information
- Coding Sequence
- ATGACCAACAAGAACGGGCTCATACGTTTTATTCAAGATAGAAGAGCAGCAACTTTAGAGATTTGTTCAAAATCGTTCACTTACATTAGTTCGTTGAAAGAACATTTGAGAATTCATACTGGCGAAAAACCATACCATTGTAGCATCTGTGGTAGAGATTTTGTATTTAAGGGAAATTTACGAGCACATTGGAGGCTTCATATTGAGCAGAAACCGAATCAATGTTGCAATTGTGGCAAAATTCTTCCAAATAGTCATAAATGTGAACATATAAGCGAAAAACCTACAATTAGTCATGGAACTTTTACATCTGAGAAAAATCCACAGAGACATCGGATAATATTGTATACAGATAGGAAATCCTATCATTGTGAAACTTGTTCGAAATCGTTCACTTCAAGTTATTCGCTGAAAATACACCAGAGAGTCCATACAGGCGAAAGACCCTATCATTGTAACATTTGTGGTAAAGATTTTGTCTATAGGCAAAATTTCCGAGAACATTTGAGACTACATACGGGACAGAAACCGTATAGATGCGATATTTGTGATAAGAGTTTCTCGAATAGTAAACGTTTGAAAAATCATCTATTGGTACATACAGGCGAGAAACCATTCAAATGTAATATTTGTGGTAGGAATTTCGCTTTCAAATATAACCTACAATATCATTTACGATCGCATAGTTTCGTTGTAAAAAGTAATAGTGATGATTCTCAGAATAGTagtttcaaaattgaagaatcaATCGAATATTCGACCAATACTAATGGAGACAAATTATTCATCTGTAGTATGTGTAAAGAATCGTATCGAGAAGAGGATGCTTTGAAAAGACATTTTCAGTGGATACATAGCGGCGAGAAAAGGTATCCATGTCCTGCTTGTTCGAAAACTTTCATATCAAAGACTACATTAAAAGGACACTATAAACTCCATACAGGTGTTAAACCGTATCACTGTGATATTTGTGGTAAAGATTTCGTATTTAAACAACACTTAAAAGAACATACAAAAGTACATTCCGACGAGAAGCCATTTAAGTGCGACATTTGTAGCAAGACTTACAAACATAATTCATCTTTGATTTATCACATGAAGGAGCATAGAAATACCGAAAATACTCTCGAAAAGGAATTCACCTATATGTGTGAAATTTGCAGTAAATTATTCCATACGAAATGCAGTTTGAATAAACATCGACAAATACATTCAACAGAAAAACCGTATCAGTGTGATCTTTGCGGTAAGAAATTCTTAACTAAACAACGTTTAAATAAACATATGACAACGCATATTGGTCTGAAACCGTATCGGTGTAATATTTGTGATGCAAGATTTACAGCTAGTGGCTCTTTGAAGAGTCATTCACGGATACATAATGATGAGAAACCGTTCAGTTGTCATATTTGTAagaATAAAGAACTCACTGCTGATAAAATTGGAGATAAAACTAACGAGCGTCGCGACTTATACGATGATGAAAAGAAATCTTTTGTAGATTCGTGTAAATTAGAAGTTGACGATCAACTTTTCCAATCTGGTGACAATCTCGCAATTTTAGACCTCAATAAAGACACTGGAGATGACTATAATTGTCATGTTTGTGGTAAATCTTTCACtcatcaatttgaattgaatcaacatttaaaaattcattatgGTTCAAATACTTGCAGTATTTGTAACAAAACATTTTCAGCTAAATCCTATCTGAAAACCCATTTAGAATTACATAAAGATGAAAAACCACATCAATGTAGTATTTGTGGTAAGACATTTGCTTTAagtaaaaatttaaaaacacaTTTACGTGTACATACCGGTGAGAAACCGCATCAATGTACGATTTGTTTGAAGAATTTCACATCTAGTACAACGTTAAAAGTGCATTTTCGAATACATACAGGTGAAAAACCGTACCATTGTAATACTTGTGatgataatagtttaattcaacattcaacaacacaaaatggtgataaattatttatttgtaatatttgtaatGAGAGTTATCCCGATCAAATCGGCCTCAAACGGCATCTACGTTTACATAGTGGTGAGCGATTTGTTTGTGAGATTTGTAGTAAAAGTTTTAAATTGCGCGGCAGTTTAAGGACACATTTGAAGCTACATAGTGGCGAAAAACCATATTCATGTAATGTTTGTAATCGACGATTTCCATCAAAGTCACGTTTAAATATCCATCTAAAAGTTCATACTGGTGAAAAACCGTTCAATTGTGATATTTGCAGTGCTAGTTTTTCCGCAAAGTCCACATTAACATCACATTTGAGGATACATAGCAATGAAAAACCGTTCAGTTGTCACATTTGCAATAAGAGTTTCCGGGTGAAATGCGCTTTAACGGTTCACATAAGGGCGCATAGTGATGAGAAGCCGTTCAGTTGTCATATTTGTAATAAGAGTTTTCGTGTAAGGGCTGTTTTGAACATACATATGAAGACGCACAGTGGGGAAAAGCcgtttatttgtgaaatttgcaATAAGGGCTTCTCTTTAAAATGGTGTTTAAATAAACATCTGAAGAAACACGGGGAGAAAAGTAGGAATGATTTAGAAGAAaagaatttgtaa
- Protein Sequence
- MTNKNGLIRFIQDRRAATLEICSKSFTYISSLKEHLRIHTGEKPYHCSICGRDFVFKGNLRAHWRLHIEQKPNQCCNCGKILPNSHKCEHISEKPTISHGTFTSEKNPQRHRIILYTDRKSYHCETCSKSFTSSYSLKIHQRVHTGERPYHCNICGKDFVYRQNFREHLRLHTGQKPYRCDICDKSFSNSKRLKNHLLVHTGEKPFKCNICGRNFAFKYNLQYHLRSHSFVVKSNSDDSQNSSFKIEESIEYSTNTNGDKLFICSMCKESYREEDALKRHFQWIHSGEKRYPCPACSKTFISKTTLKGHYKLHTGVKPYHCDICGKDFVFKQHLKEHTKVHSDEKPFKCDICSKTYKHNSSLIYHMKEHRNTENTLEKEFTYMCEICSKLFHTKCSLNKHRQIHSTEKPYQCDLCGKKFLTKQRLNKHMTTHIGLKPYRCNICDARFTASGSLKSHSRIHNDEKPFSCHICKNKELTADKIGDKTNERRDLYDDEKKSFVDSCKLEVDDQLFQSGDNLAILDLNKDTGDDYNCHVCGKSFTHQFELNQHLKIHYGSNTCSICNKTFSAKSYLKTHLELHKDEKPHQCSICGKTFALSKNLKTHLRVHTGEKPHQCTICLKNFTSSTTLKVHFRIHTGEKPYHCNTCDDNSLIQHSTTQNGDKLFICNICNESYPDQIGLKRHLRLHSGERFVCEICSKSFKLRGSLRTHLKLHSGEKPYSCNVCNRRFPSKSRLNIHLKVHTGEKPFNCDICSASFSAKSTLTSHLRIHSNEKPFSCHICNKSFRVKCALTVHIRAHSDEKPFSCHICNKSFRVRAVLNIHMKTHSGEKPFICEICNKGFSLKWCLNKHLKKHGEKSRNDLEEKNL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -