Mdil008835.1
Basic Information
- Insect
- Megalocaria dilatata
- Gene Symbol
- GIS1
- Assembly
- GCA_034642435.1
- Location
- CM067855.1:70863702-70880352[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 35 0.0031 0.34 12.6 0.3 2 23 176 198 175 198 0.94 2 35 0.0003 0.034 15.8 0.6 1 23 204 226 204 226 0.98 3 35 0.00061 0.068 14.8 4.7 1 23 232 254 232 254 0.95 4 35 4.3e-05 0.0048 18.4 1.8 1 23 260 282 260 282 0.98 5 35 0.0031 0.34 12.6 0.3 2 23 396 418 395 418 0.94 6 35 0.0003 0.034 15.8 0.6 1 23 424 446 424 446 0.98 7 35 0.00061 0.068 14.8 4.7 1 23 452 474 452 474 0.95 8 35 0.67 76 5.2 0.6 7 23 486 502 480 502 0.93 9 35 0.0031 0.34 12.6 0.3 2 23 616 638 615 638 0.94 10 35 0.0003 0.034 15.8 0.6 1 23 644 666 644 666 0.98 11 35 0.00061 0.068 14.8 4.7 1 23 672 694 672 694 0.95 12 35 0.67 76 5.2 0.6 7 23 706 722 700 722 0.93 13 35 0.0031 0.34 12.6 0.3 2 23 836 858 835 858 0.94 14 35 0.0003 0.034 15.8 0.6 1 23 864 886 864 886 0.98 15 35 0.00061 0.068 14.8 4.7 1 23 892 914 892 914 0.95 16 35 0.67 76 5.2 0.6 7 23 926 942 920 942 0.93 17 35 0.0031 0.34 12.6 0.3 2 23 1074 1096 1073 1096 0.94 18 35 0.0003 0.034 15.8 0.6 1 23 1102 1124 1102 1124 0.98 19 35 0.00061 0.068 14.8 4.7 1 23 1130 1152 1130 1152 0.95 20 35 4.3e-05 0.0048 18.4 1.8 1 23 1158 1180 1158 1180 0.98 21 35 0.0003 0.034 15.8 0.6 1 23 1312 1334 1312 1334 0.98 22 35 0.00041 0.046 15.3 4.3 1 23 1340 1362 1340 1362 0.95 23 35 0.00012 0.014 17.0 2.2 1 23 1368 1390 1368 1390 0.97 24 35 0.0031 0.34 12.6 0.3 2 23 1522 1544 1521 1544 0.94 25 35 0.0003 0.034 15.8 0.6 1 23 1550 1572 1550 1572 0.98 26 35 0.00061 0.068 14.8 4.7 1 23 1578 1600 1578 1600 0.95 27 35 4.3e-05 0.0048 18.4 1.8 1 23 1606 1628 1606 1628 0.98 28 35 0.0031 0.34 12.6 0.3 2 23 1760 1782 1759 1782 0.94 29 35 0.0003 0.034 15.8 0.6 1 23 1788 1810 1788 1810 0.98 30 35 0.00061 0.068 14.8 4.7 1 23 1816 1838 1816 1838 0.95 31 35 4.3e-05 0.0048 18.4 1.8 1 23 1844 1866 1844 1866 0.98 32 35 0.0031 0.34 12.6 0.3 2 23 1998 2020 1997 2020 0.94 33 35 0.0003 0.034 15.8 0.6 1 23 2026 2048 2026 2048 0.98 34 35 0.00061 0.068 14.8 4.7 1 23 2054 2076 2054 2076 0.95 35 35 4.3e-05 0.0048 18.4 1.8 1 23 2082 2104 2082 2104 0.98
Sequence Information
- Coding Sequence
- atggagaataatagtgatgtccatttttctcaactagaacagaatgttcatgaacggacagattcttttctatgtgaaaaatgcggaatgatcctttactcggaaatttcatttcatttccatctgcaatattaccatcataaacccttattcaataaatggagttctcttatagatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaacgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgataactgttcgatgaaggggaataatggtatacttgtttacccatcagatgatatgaaccttcttccagattcttacacaggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcccacgcaagttcttccaacgattcaaaatatccattcgttaagcagcaatgatcaacttatttcaattaaagaaaatgttatgtgtgatttagtctgcgtgatatgtaataaaatcttcagcaaggccatgtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaaattgcgaaaaaagtttcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattctcttgtgaagtatgtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaaatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaacgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgatatctgttcgatgaaggggaataatggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcccacgcaagttcttccaacgattcaaaatatccattcgttaagcagcaatgatcaacttatttcaattaaagaaaatgttatgtgtgatttagtctgcgtgatatgtaataaaatcttcagcaaggccatgtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaaattgcgaaaaaagtttcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattctcttgtgaagtaagtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaaatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaacgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgatatctgttcgatgaaggggaataatggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcccacgcaagttcttccaacgattcaaaatatccattcgttaagcagcaatgatcaacttatttcaattaaagaaaatgttatgtgtgatttagtctgcgtgatatgtaataaaatcttcagcaaggccatgtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaaattgcgaaaaaagtttcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattctcttgtgaagtaagtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaaatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaacgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgatatctgttcgatgaaggggaataatggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcccacgcaagttcttccaacgattcaaaatatccattcgttaagcagcaatgatcaacttatttcaattaaagaaaatgttatgtgtgatttagtctgcgtgatatgtaataaaatcttcagcaaggccatgtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaaattgcgaaaaaagtttcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattctcttgtgaagtaagtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaaatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaacgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgataactgttcgatgaaggggaataatggtatatttgtttacccatcagatgatatgaaccttcttccagattcttacacaggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcccacgcaagttcttccaacgattcaaaatatccattcgttaagcagcaatgatcaacttatttcaattaaagaaaatgttatgtgtgatttagtctgcgtgatatgtaataaaatcttcagcaaggccatgtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaaattgcgaaaaaagtttcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattctcttgtgaagtatgtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaaatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaacgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgataactgttcgatgaaggggaataatggtatatttgtttacccatcagatgatatgaaccttcttccagattcttacacaggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcccacgcaagttcttccaacgattcaaaatatccattccaaggccaagtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaatttgcgaaaaaagattcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattcttttgtgaagtatgtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaaatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaacgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgataactgttcgatgaaggggaataatggtatatttgtttacccatcagatgatatgaaccttcttccagattcttacacaggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcccacgcaagttcttccaacgattcaaaatatccattcgttaagcagcaatgatcaacttatttcaattaaagaaaatgttatgtgtgatttagtctgcgtgatatgtaataaaatcttcagcaaggccatgtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaaattgcgaaaaaagtttcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattctcttgtgaagtatgtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaaatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaatgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgataactgttcgatgaaggggaataatggtatatttgtttacccatcagatgatatgaaccttcttccagattcttacacaggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcccacgcaagttcttccaacgattcaaaatatccattcgttaagcagcaatgatcaacttatttcaattaaagaaaatgttatgtgtgatttagtctgcgtgatatgtaataaaatcttcagcaaggccatgtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaaattgcgaaaaaagtttcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattctcttgtgaagtatgtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaaatgagtgtttgcaaactgaagtttccccaaatgcaacgaaccttgacgtcttaccgacggaaatgttcggtaacattgtcgaaatgccaaaagacttcaacaagaaaaacttcgatgataactgttcgatgaaggggaataatggtatatttgtttacccatcagatgatatgaaccttcttccagattcttacacaggcttggcgtacaagtataatgtcacaaacgaagtgaaatggattgataataccaaaaatcacacgcaagttcttccaacgattcaaaatatccattcgttaagcagcaatgatcaacttatttcaattaaagaaaatgttatgtgtgatttagtctgcgtgatatgtaataaaatcttcagcaaggccatgtatctcactcaacacaataaaagtattcactctggtgaaaagcccttcaaatgtaaacgttgtggaaagagataccaagacaacgatgcatatgaaatacacatcgccaagcatggagatgataagcctcacaaatgctcaaattgcgaaaaaagtttcaattacaaggccgacttaaagagacacaagtatttacacgcagctataaagccattctcttgtgaagtatgtaataaagctttcataaggaacgatcatatgctgaaacatgtgaaatcacacatacgcagggcacataataagactagtggtgttggagggggaagtaaacgacggtga
- Protein Sequence
- MENNSDVHFSQLEQNVHERTDSFLCEKCGMILYSEISFHFHLQYYHHKPLFNKWSSLIDECLQTEVSPNATNLDVLPTETFGNIVEMPKDFNKKNFDDNCSMKGNNGILVYPSDDMNLLPDSYTGLAYKYNVTNEVKWIDNTKNPTQVLPTIQNIHSLSSNDQLISIKENVMCDLVCVICNKIFSKAMYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSNCEKSFNYKADLKRHKYLHAAIKPFSCEVCNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGNECLQTEVSPNATNLDVLPTETFGNIVEMPKDFNKKNFDDICSMKGNNGLAYKYNVTNEVKWIDNTKNPTQVLPTIQNIHSLSSNDQLISIKENVMCDLVCVICNKIFSKAMYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSNCEKSFNYKADLKRHKYLHAAIKPFSCEVSNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGNECLQTEVSPNATNLDVLPTETFGNIVEMPKDFNKKNFDDICSMKGNNGLAYKYNVTNEVKWIDNTKNPTQVLPTIQNIHSLSSNDQLISIKENVMCDLVCVICNKIFSKAMYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSNCEKSFNYKADLKRHKYLHAAIKPFSCEVSNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGNECLQTEVSPNATNLDVLPTETFGNIVEMPKDFNKKNFDDICSMKGNNGLAYKYNVTNEVKWIDNTKNPTQVLPTIQNIHSLSSNDQLISIKENVMCDLVCVICNKIFSKAMYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSNCEKSFNYKADLKRHKYLHAAIKPFSCEVSNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGNECLQTEVSPNATNLDVLPTETFGNIVEMPKDFNKKNFDDNCSMKGNNGIFVYPSDDMNLLPDSYTGLAYKYNVTNEVKWIDNTKNPTQVLPTIQNIHSLSSNDQLISIKENVMCDLVCVICNKIFSKAMYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSNCEKSFNYKADLKRHKYLHAAIKPFSCEVCNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGNECLQTEVSPNATNLDVLPTETFGNIVEMPKDFNKKNFDDNCSMKGNNGIFVYPSDDMNLLPDSYTGLAYKYNVTNEVKWIDNTKNPTQVLPTIQNIHSKAKYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSICEKRFNYKADLKRHKYLHAAIKPFFCEVCNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGNECLQTEVSPNATNLDVLPTETFGNIVEMPKDFNKKNFDDNCSMKGNNGIFVYPSDDMNLLPDSYTGLAYKYNVTNEVKWIDNTKNPTQVLPTIQNIHSLSSNDQLISIKENVMCDLVCVICNKIFSKAMYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSNCEKSFNYKADLKRHKYLHAAIKPFSCEVCNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGNECLQTEVSPNATNLDVLPTEMFGNIVEMPKDFNKKNFDDNCSMKGNNGIFVYPSDDMNLLPDSYTGLAYKYNVTNEVKWIDNTKNPTQVLPTIQNIHSLSSNDQLISIKENVMCDLVCVICNKIFSKAMYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSNCEKSFNYKADLKRHKYLHAAIKPFSCEVCNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGNECLQTEVSPNATNLDVLPTEMFGNIVEMPKDFNKKNFDDNCSMKGNNGIFVYPSDDMNLLPDSYTGLAYKYNVTNEVKWIDNTKNHTQVLPTIQNIHSLSSNDQLISIKENVMCDLVCVICNKIFSKAMYLTQHNKSIHSGEKPFKCKRCGKRYQDNDAYEIHIAKHGDDKPHKCSNCEKSFNYKADLKRHKYLHAAIKPFSCEVCNKAFIRNDHMLKHVKSHIRRAHNKTSGVGGGSKRR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00965361;
- 90% Identity
- iTF_00965361;
- 80% Identity
- -