Avir004769.1
Basic Information
- Insect
- Altica viridicyanea
- Gene Symbol
- zfh1
- Assembly
- None
- Location
- GWHAMMQ00000182:427523-518166[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 47 5.1 1.6e+02 3.0 4.3 1 23 115 137 115 137 0.97 2 47 0.042 1.3 9.6 2.9 1 23 143 165 143 165 0.92 3 47 0.0001 0.0033 17.8 5.1 1 23 171 194 171 194 0.98 4 47 0.0086 0.27 11.8 1.8 1 23 206 229 206 229 0.92 5 47 0.0019 0.059 13.8 0.7 3 23 240 260 238 260 0.97 6 47 1.8e-06 5.7e-05 23.3 2.7 1 23 266 288 266 288 0.97 7 47 2.3e-05 0.00073 19.8 0.5 1 23 294 316 294 316 0.99 8 47 0.015 0.49 11.0 0.0 3 21 325 343 324 344 0.95 9 47 4.1 1.3e+02 3.3 0.9 2 21 407 426 406 428 0.78 10 47 3 93 3.8 6.6 1 20 462 481 462 484 0.80 11 47 0.00073 0.023 15.1 2.5 1 23 490 513 490 513 0.97 12 47 0.003 0.095 13.2 0.7 1 23 525 548 525 548 0.95 13 47 0.2 6.2 7.5 1.0 3 23 559 579 557 579 0.96 14 47 0.00036 0.011 16.1 0.2 1 23 585 607 585 607 0.98 15 47 4.5e-05 0.0014 19.0 3.7 1 23 613 635 613 635 0.99 16 47 0.22 6.9 7.3 0.2 2 23 680 701 679 701 0.92 17 47 0.023 0.73 10.4 5.4 1 23 707 729 707 729 0.98 18 47 0.0019 0.059 13.8 0.3 1 23 735 757 735 757 0.98 19 47 0.0006 0.019 15.4 5.8 1 23 763 786 763 786 0.98 20 47 0.032 1 10.0 1.7 1 23 792 815 792 815 0.90 21 47 0.039 1.2 9.7 1.7 3 23 824 844 822 844 0.97 22 47 4e-06 0.00012 22.3 0.3 1 23 849 871 849 871 0.98 23 47 8.5e-05 0.0027 18.1 3.5 1 23 877 899 877 899 0.98 24 47 0.13 4.1 8.0 3.0 1 23 977 999 977 999 0.96 25 47 1.2 36 5.1 2.6 1 23 1011 1033 1011 1033 0.98 26 47 1.1 35 5.1 2.1 1 11 1039 1049 1039 1053 0.88 27 47 6.6e-05 0.0021 18.4 1.7 1 23 1067 1090 1067 1090 0.97 28 47 0.02 0.61 10.6 3.8 1 23 1102 1125 1102 1125 0.95 29 47 0.00039 0.012 16.0 0.2 2 23 1135 1156 1134 1156 0.97 30 47 0.00018 0.0055 17.1 3.5 1 23 1162 1184 1162 1184 0.97 31 47 1.4 44 4.8 1.3 3 23 1273 1293 1272 1293 0.97 32 47 0.51 16 6.2 4.6 1 23 1299 1321 1299 1321 0.97 33 47 0.11 3.3 8.3 0.4 2 23 1325 1346 1324 1346 0.91 34 47 0.00031 0.0096 16.3 2.0 1 23 1352 1375 1352 1376 0.97 35 47 0.0023 0.072 13.6 1.9 1 23 1387 1410 1387 1410 0.95 36 47 2.5e-05 0.00079 19.7 0.7 3 23 1421 1441 1419 1441 0.97 37 47 0.00025 0.0078 16.6 2.4 1 23 1447 1469 1447 1469 0.98 38 47 0.0012 0.038 14.4 0.4 1 23 1475 1497 1475 1497 0.97 39 47 0.75 24 5.7 0.9 2 23 1546 1567 1545 1567 0.87 40 47 0.00022 0.0068 16.8 2.7 1 23 1573 1595 1573 1595 0.98 41 47 0.021 0.65 10.6 1.3 1 19 1598 1616 1598 1619 0.90 42 47 0.00056 0.017 15.5 0.9 1 23 1624 1647 1624 1647 0.97 43 47 0.18 5.7 7.6 1.0 1 23 1659 1682 1659 1682 0.96 44 47 0.0018 0.056 13.9 2.2 3 23 1693 1713 1691 1713 0.97 45 47 2.5e-05 0.00078 19.7 0.8 1 23 1719 1741 1719 1741 0.98 46 47 0.3 9.3 6.9 0.2 1 23 1747 1769 1747 1769 0.95 47 47 0.00026 0.008 16.6 1.0 2 21 1776 1795 1775 1796 0.93
Sequence Information
- Coding Sequence
- ATGCCTTTAAAACCACCAAAAAATGAGAGTATATCTTGTCCAATTTTAAAATTAGAAAAAGAAGATCCGGATTTTATACCACCGAAATCGAGTAATAATACAGAAGTTCCACCAGATAAACCAAAACGAGGTAGAAAGAAAGGTAGTAAAAACTTACCTAAAGACGATAAAGCGAAACAGGAAAGAAATCGACGTAAAAATCAAAATCCTAAAAAAATTATACTACACCATCCTCCACCATTAGAATTACCAGAACCAATACAATGCGATAATTGTAGTGCAGAATATACAAATAATGTAGATTTCGCTTTTCACTCATTAAGCCATAGTGATGATGGAAAATATACTTGTCATTTATGCAACTATAAAAATGGTTCAAAATATCACATGGAAATGCATGTAAGAGCACATGAAGGGACTACAAAGTACAAATGCGAAATTTGCGATAAAGCTTTTACTATTAGTACACATGCAATTGAACATAAACATTTCCATACTGGAGAAAAACCATTCCAATGTGAGATTTGTGGGAAACATTTTATGTTCTCATGGCATTTAACATCACACAGAAGAACTTCTCATTATGAAATACTTACAGGTAAACCTTTGGTAAAATTCGACTGTCTAGAATGCAATAAACACTACGAATCCGCCAGTGGACTTCGAAGACACAATATAAAAAAACACAAAACCAGCGAAATAGATTTATCAGTAATATGTGAAATCTGTGGTAAAAGATTATCGAGCAAAGAAAAACTAAAATTCCACCTTCGCACTCACACCGGTTACAAACCACACGCTTGTCACGTTTGTCCAAAAAGTTTTTCGAAAAAAGATCAACTAATAGAACACATTCGTACTCATACCGGCGAAAAACCGTACGTTTGTAAATTATGCGGCAAAGGATTCGCTCAACGAACTCCACTTAAAACTCACGAACGAACTCATAATGTTGATCGACCTGGCAGTGCTTGTACTTTTTGTGGATTAGTTTTTAGAGCAAAAATCGAATTAGAGAGTCATATGAAAGTTTTAACTACTTTTAGATTGAAAATTGAAAATTCCAATGATGTTCATCCACCGAAAAGAAAATACACCAAACGCATTAAAAAAGATCCAGAAGACACAGAAAAGAAGGCGAAAAGAGAAAGTAAAAAACGAGGAAGATATCGACAGACATCACCGTTACCAGAACCAATTAAATTAGAAGAACCATTACAATGTGAACATTGTAAAAAAGATTTTTTAACAAATCTTGATTTTGCGATGCATTCCATCGAACATAGCGCTGATGATACTTATTCCTGTCACTTATGTACATTTCAGAACACTTGTAAATATAGATTTGAAACTCATATTAGATTACATGACGGCACAACAAAACACAAATGCAAAATATGTGGTAAAGCTTTCAGGTGCAATACTCACGCAGAAGAACATAAATATTTCCACACCGGAGAACGTCCTTTTCAATGTGAAATCTGCGGCAAACATTTTATGTATTCACGAAGATTAGCAAATCATCGCAGAAATATTCATTATCAAACCATTACAGGAAAACCTAGAGTTAAATTTGATTGTGAGTATTGCGATAAAAAATTTGATTCCATTGCAGGACTAACTAGTCATAGTAATAAAAAACACTTAAATATAGTTAATGATATATCAGTTATTTGTGAAATATGCGGTAAAAGAATATCATGTAAAGGTAGATTAAAGTATCATCTTAGAGTACACAGCGGAGATAAACCGTTTTCTTGTATGGTATGTCCTAAAAAATTCGCAATGAAGGATTTATTAATTGAACACACTAGAGTTCATACCGGTGAAAAACCGTATGTTTGTAAATATTGCGGTAAAACTTTTGGACACAGATCGCCTTATAGATACCATATTAAAACACATACTGGTGAAAAAAACTATGTTTGTCCTATTTGTAGTAAAGCGAATAAAAACATATCTGATAAAAGAAAAATTAGTAAACTGCCAAAAAATAAAAGACAAACCAATGAACCATTAATATTAGATGAAGAAATTCAATGTGGAAACTGTGAGGAATCATTTAACACTAATTTATCGTTTGCTATACATTCAATAAATCATCGAAAAGATGGGTTATATCATTGCCATTTATGTGATACAACACTTAAGACTAAATACACCATAGAAACCCATGTTCGAACACATGATGGTTGTTTACGTTATAAATGTGATATTTGTGGTGATTATTTTAACGTAAAAAGCGAGTTAAGAGATCATGAAAATTCCCATTCTGGTTCTAGTCCATACCAATGTGAGGTTTGTGGTAAAAATTTCCATTTCCAAAAACATCTATCATCACATCGACGAAGTTTACATGCAAAATCAAAATCGTTTGAATGTAAAATTTGCAATATAATCTATAAATTCCCTTCAGGATTGTATCGCCATAATCATTTAAAACACAAAAACATAGACGCATCTGTTATATGTGATATTTGTGGAAAACGATTATCTTGTAGAGAAAAATTAAAATCGCATCAAAAGATACATTTAGGACAACTATTTCCTTGTTTAGTTTGCTCGAAACCCTTTACAAGAGAAGTTCAGTTACGGGTTCATATGAGAACTCATACCGGAGAAAAACCATATGTTTGTAAATATTGCGGAAAAGCGTTTAAACAACATTCTCCATATAGATATCATATTAAAAACCATATTGGATTACATACTTTGTTATGTGTTTTATATTTATCTAATACATCTTTTTTTAGGTTTGCTGATGATGATTTGAAATCTCTTTTTACACAATATCTTGACGACGAAGATTCTTTTAAAACTTTACCTCCAAAACTTAAAAACCCAATAAATTGTCCAAAATGTAATGCAACTTTTTTAACTAATTTAGATATGGGTCTTCACTCAAAAACACACAGTGAAAATAATGAATATAATTGTCATTTATGTAAGTTCAGTCTAATACAAAAATCTGGATTTGAAACACATATATTCAACCACGAGAAATTAAAAAAATATCATTATCCACATATATACATCTGTCATTTGTGTGATTTTAAAGAAAACCAAAAATTTGCCTTCGAGTTACACATTAGAAAACATGAAGGAATTAAACGATTTAAATGTCCAATTTGCAATAAACAATTTCAAGATATCTCATGCGCACTGAACACAAATACTTTCATACCGGAGAAAAAACCGTTCCAATGTGAAATTTGCGGAAAACATTTTCCTTATACCAAAAGTCTAGCTGCACATAGACGTAATACACATTATGAAATCATTACGGGAAAACCGTTACAAAAATATGATTGTAAAGTATGTAACAGACACTACGAAAGCCATTCTGGTTTAGTACGACATTATTCTTCTTCTCATAAAGAACTTGGTTACGACTATTCAGTAGTTTGTGATATTTGCGGTAAAACCATCGCTACTAGAAATAAACTAAAATACCATCTCAGAGTCCATACAGGTTATAAACCACATGAATGTAATGTATGTCATAAAGGTTTTTCGAAAAAACAGGAACTTAAAATACACGAACGAGTACATACTGGGGAAAAACCATTTACTTGTTCTTATTGTGTTGTGTGGCAAGAAAATTCTGTTTTTGTATTCAATATAAATATGTGTAACAAACATAGTAAGTATAAACATTATTTTCACATAATTTTATTAACACACACCTTTTTTAGGGATATCAAAAAATCACCATCAACTACAAAGAAAAAAACACCATTAATAATACCGGAACCTTGTATAGATGAAGAGACTACAAAACTTAGAGAAATATTAAAAATAGATAAACCACTATTATGTACATCTTGTGAAGAACCATTTACCTCACACTTAGATTTAGCATATCATTCTAAAAAACATAACGAATCTAAATTGCACAAATGTCATTTATGTCAATTTACTGTAGCTACAAAATATAGAATAAAACGACACATTAACGGACACGCCGGATGGAAATGTGATTTATGCAATAAAATCTTCAGAAAAGCCAAAGCTGCTTTAAAACATTCTTACATCCACACTGGAGAGAAATTATATCAATGTGAAATATGTGGTAAACATTTAGCAAACTCCAAAAGTCTGGATACTCATCTGAATACAATCCATCACGAAATAATAACAGGAAAACCATTAGTGAAGTTCGACTGTCCAATATGTAAAAAACATTACGAATCCGAAACGGGTCTTCGGAGACACTACTCGAGTAACCATAAAGAAATGGGTGTTGATCTAACAGTTATTTGTGAAATTTGTGGTAAAAGAATTTCGAATCGAACACGATTAACCCGACACATGCGGACTCATAACGGTGAAAAACCGTTTCCTTGTTCGGTTTGCAATCGGAATTTCGCGACAAAGTGTTTACTTACGTCGCATACGCGAGTACACACCGGGGAAAAACCGTACGCTTGTTCTTTTTGTGGTAAAAGATTCGGACAGTCAGCTCCATATCGATATCATATTAAAATACATACAGGAGATCGAAAATATCATTGTAAAATGTACGAAATGAAACGTGAACGAAAATCTAAAAAGAAAAATCTACCAAAAAACTTGAAATTCACAAAAGAAATCGAAAACAAAATCCAAAACCTTCTAGTCACAAACCAACCAATGGAATGCCAAATCTGTAAAGAATCCTTTAAAAATTTATTAGATTTCGCTTTTCATTCAAAAACTCACGAAAAGAACAATTTCTATTCCTGCCATGTTTGCAACAAAAAAGTAGCAAATAAAAAACAATTCGAACGCCACATATTATCGCACAAAAGCTTCAAATGTGAAGTTTGCAATAAGATTTTTAAACGAAAATTTACTGCCCTAAACCACGCACACAGTTGGGAAAAATTGTTTCAATGTAAAATTTGTAGTAAACATTTAAGCACAGCTTGGGCTTTAAATAATCATTTAACTACTGTTCATGAAGAGAATTTTAACGATGGACAAATATTTAAATACCAATGTACAATTTGTTTAAAATCATATTTATACGAAAGTGGTTTAAAGTTACATTATTCTAGTCAACACAAAGAATTGGGAATCGATTTTAGTGTTATTTGTGATGTTTGTGGTAAAAGTTTAAGTTGTAAAAGTAAACTAAAACAACATCTACGAACACATTCCGGTGATAAACCATTTCCTTGTCAAATTTGTTCGAGAAAATTTATCACTAAAGATCTTATGACATCACATATGCGAGTACATACCGGGGAAAAACCGTATGTTTGTATTTATTGCGGTAAAAAATTCGGACAAGGTGCTCCTTATCGTTATCATCTTAAAATACATACCGGAGAGAAATCTTGTAGTTGTCATATTTGTGGAAAAGAATTTATATCTAAGGGAAATATGAGGATACATTTGAAGGCTTGTACTATGAAGAAACAGTTTATTGAGGTTAAGGTGGAGTCACATTATGCTACTTAG
- Protein Sequence
- MPLKPPKNESISCPILKLEKEDPDFIPPKSSNNTEVPPDKPKRGRKKGSKNLPKDDKAKQERNRRKNQNPKKIILHHPPPLELPEPIQCDNCSAEYTNNVDFAFHSLSHSDDGKYTCHLCNYKNGSKYHMEMHVRAHEGTTKYKCEICDKAFTISTHAIEHKHFHTGEKPFQCEICGKHFMFSWHLTSHRRTSHYEILTGKPLVKFDCLECNKHYESASGLRRHNIKKHKTSEIDLSVICEICGKRLSSKEKLKFHLRTHTGYKPHACHVCPKSFSKKDQLIEHIRTHTGEKPYVCKLCGKGFAQRTPLKTHERTHNVDRPGSACTFCGLVFRAKIELESHMKVLTTFRLKIENSNDVHPPKRKYTKRIKKDPEDTEKKAKRESKKRGRYRQTSPLPEPIKLEEPLQCEHCKKDFLTNLDFAMHSIEHSADDTYSCHLCTFQNTCKYRFETHIRLHDGTTKHKCKICGKAFRCNTHAEEHKYFHTGERPFQCEICGKHFMYSRRLANHRRNIHYQTITGKPRVKFDCEYCDKKFDSIAGLTSHSNKKHLNIVNDISVICEICGKRISCKGRLKYHLRVHSGDKPFSCMVCPKKFAMKDLLIEHTRVHTGEKPYVCKYCGKTFGHRSPYRYHIKTHTGEKNYVCPICSKANKNISDKRKISKLPKNKRQTNEPLILDEEIQCGNCEESFNTNLSFAIHSINHRKDGLYHCHLCDTTLKTKYTIETHVRTHDGCLRYKCDICGDYFNVKSELRDHENSHSGSSPYQCEVCGKNFHFQKHLSSHRRSLHAKSKSFECKICNIIYKFPSGLYRHNHLKHKNIDASVICDICGKRLSCREKLKSHQKIHLGQLFPCLVCSKPFTREVQLRVHMRTHTGEKPYVCKYCGKAFKQHSPYRYHIKNHIGLHTLLCVLYLSNTSFFRFADDDLKSLFTQYLDDEDSFKTLPPKLKNPINCPKCNATFLTNLDMGLHSKTHSENNEYNCHLCKFSLIQKSGFETHIFNHEKLKKYHYPHIYICHLCDFKENQKFAFELHIRKHEGIKRFKCPICNKQFQDISCALNTNTFIPEKKPFQCEICGKHFPYTKSLAAHRRNTHYEIITGKPLQKYDCKVCNRHYESHSGLVRHYSSSHKELGYDYSVVCDICGKTIATRNKLKYHLRVHTGYKPHECNVCHKGFSKKQELKIHERVHTGEKPFTCSYCVVWQENSVFVFNINMCNKHSKYKHYFHIILLTHTFFRDIKKSPSTTKKKTPLIIPEPCIDEETTKLREILKIDKPLLCTSCEEPFTSHLDLAYHSKKHNESKLHKCHLCQFTVATKYRIKRHINGHAGWKCDLCNKIFRKAKAALKHSYIHTGEKLYQCEICGKHLANSKSLDTHLNTIHHEIITGKPLVKFDCPICKKHYESETGLRRHYSSNHKEMGVDLTVICEICGKRISNRTRLTRHMRTHNGEKPFPCSVCNRNFATKCLLTSHTRVHTGEKPYACSFCGKRFGQSAPYRYHIKIHTGDRKYHCKMYEMKRERKSKKKNLPKNLKFTKEIENKIQNLLVTNQPMECQICKESFKNLLDFAFHSKTHEKNNFYSCHVCNKKVANKKQFERHILSHKSFKCEVCNKIFKRKFTALNHAHSWEKLFQCKICSKHLSTAWALNNHLTTVHEENFNDGQIFKYQCTICLKSYLYESGLKLHYSSQHKELGIDFSVICDVCGKSLSCKSKLKQHLRTHSGDKPFPCQICSRKFITKDLMTSHMRVHTGEKPYVCIYCGKKFGQGAPYRYHLKIHTGEKSCSCHICGKEFISKGNMRIHLKACTMKKQFIEVKVESHYAT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -