Cfla010855.1
Basic Information
- Insect
- Cotesia flavipes
- Gene Symbol
- zfh1
- Assembly
- None
- Location
- scaffold:102-17625[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 46 0.02 1.9 9.0 1.7 1 23 72 95 72 95 0.96 2 46 0.016 1.6 9.3 0.2 2 23 102 124 102 124 0.93 3 46 0.00014 0.014 15.8 2.9 1 23 130 152 130 152 0.97 4 46 0.0014 0.14 12.7 0.9 1 23 156 178 156 178 0.98 5 46 5.5e-05 0.0054 17.1 3.0 3 23 186 206 185 206 0.97 6 46 0.0012 0.12 12.9 0.1 1 23 212 234 212 234 0.98 7 46 0.0031 0.31 11.6 0.5 1 23 319 341 319 341 0.91 8 46 2.2e-07 2.1e-05 24.7 1.1 1 23 347 369 347 369 0.99 9 46 1.2e-06 0.00011 22.4 0.1 1 23 375 397 375 397 0.98 10 46 0.00042 0.041 14.3 1.7 2 23 404 426 401 426 0.93 11 46 2.2e-05 0.0022 18.3 6.7 1 23 431 453 431 453 0.97 12 46 3.4e-05 0.0034 17.7 1.5 1 23 462 484 462 484 0.98 13 46 9.1e-06 0.0009 19.5 0.9 1 23 490 512 490 512 0.97 14 46 0.0001 0.0099 16.2 1.3 2 23 519 540 518 540 0.97 15 46 6.8e-06 0.00067 19.9 3.3 1 23 546 568 546 568 0.98 16 46 4.1e-05 0.004 17.5 0.4 1 23 574 596 574 596 0.96 17 46 0.79 78 4.0 1.1 1 23 670 693 670 693 0.91 18 46 2.7e-07 2.7e-05 24.3 0.6 3 23 702 722 701 722 0.98 19 46 8e-06 0.00078 19.7 3.3 2 23 752 773 751 773 0.96 20 46 0.00024 0.024 15.0 4.6 2 23 779 800 778 800 0.97 21 46 0.00064 0.063 13.7 0.1 1 23 805 827 805 827 0.98 22 46 9.9e-07 9.7e-05 22.6 2.6 1 23 835 857 835 857 0.98 23 46 0.56 55 4.5 2.3 3 23 865 886 863 886 0.94 24 46 0.0014 0.13 12.7 0.4 1 23 994 1017 994 1017 0.93 25 46 1.8 1.7e+02 2.9 0.3 1 23 1029 1052 1029 1052 0.92 26 46 0.038 3.7 8.1 5.1 1 23 1065 1088 1065 1088 0.94 27 46 0.00045 0.044 14.2 5.4 1 23 1094 1116 1094 1117 0.95 28 46 0.012 1.2 9.7 0.6 3 23 1124 1145 1122 1145 0.95 29 46 8e-05 0.0079 16.6 0.7 1 23 1150 1173 1150 1173 0.98 30 46 6.2e-07 6.1e-05 23.2 0.4 2 23 1180 1201 1179 1201 0.96 31 46 5.7e-07 5.6e-05 23.3 0.7 1 23 1207 1229 1207 1229 0.96 32 46 6.8e-05 0.0067 16.8 1.0 1 23 1235 1257 1235 1257 0.98 33 46 0.00058 0.056 13.9 3.2 1 23 1402 1424 1402 1424 0.97 34 46 1.6e-06 0.00015 22.0 1.1 1 23 1433 1456 1433 1456 0.93 35 46 9.2e-05 0.0091 16.4 0.8 2 23 1463 1484 1462 1484 0.96 36 46 1.9e-07 1.8e-05 24.9 0.6 1 23 1490 1512 1490 1512 0.99 37 46 0.92 90 3.8 4.7 1 23 1544 1567 1544 1567 0.96 38 46 7.7e-06 0.00075 19.8 0.3 1 23 1573 1595 1573 1595 0.97 39 46 0.00041 0.04 14.3 4.3 1 23 1601 1623 1601 1623 0.98 40 46 9e-05 0.0088 16.4 0.2 1 23 1629 1651 1629 1651 0.98 41 46 0.0018 0.18 12.3 2.9 1 23 1656 1679 1656 1679 0.97 42 46 3.3e-05 0.0032 17.8 0.5 1 23 1691 1714 1691 1714 0.96 43 46 0.0052 0.51 10.9 0.1 1 23 1722 1744 1722 1744 0.96 44 46 4.8e-06 0.00047 20.4 1.4 3 23 1752 1772 1751 1772 0.98 45 46 1.5e-06 0.00015 22.0 4.6 1 23 1778 1800 1778 1800 0.98 46 46 1.9e-05 0.0019 18.5 1.6 3 23 1808 1828 1806 1828 0.97
Sequence Information
- Coding Sequence
- TATATTGGTCTGGGTCCGTCGTACGCTTACGCGAAGTATGCAACCGTAGAGCAAATACCGATAACAGCTATCAAAAGCGATACTATCCCCGCCAAGTCAAAGCCAAAGCCAAAGCCAAAGCCAAAGCCAAAGCCAGTGAAGCTGAAACTTGACCCTGAGAAGGTGAAATCCGAGCCGGAAAAAAATAAAGAAAAACCCAAAAGTGGAAAAAGGTACAAGTGCAACATCTGCGGGAAAGAGTCCTGGAATTTTCACCAGTGGGAGCGACACTACCGGCTGGACCACGGGAAGCCCAAAGCTGAGAAATGCGGGCAGTGCGATAGAATTTTTTACGACGACGAGAAACTCGCGAGACACGTGGTGAAGATGCACCAGGAGGAGCGACCGTTTCCCTGTCACATCTGCAACAAGCCCTTCAAGACCGACAAGTTCCTGAGGATCCATTTGAAGCAGCACAACAAGCGGTACTGCTGCGAGATCTGCGGGATGTCCAAGGTCTCTCCTTACGACTTGCGGCTCCACATGATGAAGCACAACAAGGAGTACTACGCGCACTGCGAGTACTGCCAGCGAGGGTTCTACACCAAGCAGTCTCTCCAGCGACACATTCTCACCCACACCGGCGAGCGGCCCTTCGTCTGCAGAATTTGCAACGCCTCCTACGCCAGTGCCTCCTACTTAAGTCTCCATGCCAAAAGCCACACGCAGAGGGAGAAATTCAAGTGCAACCTCTGCGACTACGAGACCTTCTGGAAGACCTCGCTCAAGGGCCACCATGGATCGCTCAATGCTCATATGAAGGGACACAAAAATATGCTCGAGAAGGTTGAAGCTATTGACTTTGAGGAGCTGGAGGAAGAATTGTTTCACTCGGAAATAAAGCCGGAGTTGCTAATAAACAGTACAAAAAGCCCAGGGATTCAAAAGAAGCTAAAAGAAGAAGAAGAAAAAGATTACGTCTGTAAAATCTGTAACAAGCAGTTCAGATTGAAGAACCTGTTCGAGGGGCACTTGGTGGCGCACAGCGACTTGCGACCGTACCAGTGTGACATCTGCAACAAGTCATTCAAGCGAACGAGTACCCTGGCAGTGCACCGGAGGATCCACACGCGGGAGCGGAACTTTATCTGCGATGTGTGCGGGCGCGCTTTTATCCAGGCGTCCCAGTTGGCGATGCACCAGCGAAGACACTTCGAGAAGTACATCTGCTCCTGTGAACTCTGTGGTAAGGGTTTCTTCACCAACGCGGAGTTGCACGGCCACATGAACATCAAGCACGGCGCCAAGGAGCACGTGTGCCACCACTGCGGAAAGTCCTTCCCGAACAATCACACGCTGGTCAGGCATGCCAAAGTCCACGACCCGGACTTCAAGCCCATCAAGCACCAGTGCGAGTTCTGCGGGAAGATATTCGCCTACAAAAACTCCCTGATGGTCCACGTAAAGTCCCACACTGGAGAGAACAAGTACGACTGTCATCTCTGCGGAAAGTCCGTGTCCTCCAGAGGCTCGCTCCAGGACCACTTGCGGCTCCACGGGGGAGAGAAGTCTTTAGTCTGCGATGTTTGCGGCAAGGCTTTTCACAAAAGGACCACCCTGGTGGTCCACAAGCGCACGCACACTGGAGAAAAACCCTACAGCTGTGACACCTGCGGGAAGTCTTTTACTCAACACTCGACCCTGGTCATTCACAAACGCTACCACACTGGGCAAAGACCGTATCAGTGCTCGGTTTGCACTAAATCATTTGTCTCCAGGGCGCTGCTTAATGTCCACAGCAAAGCTCACAATGTTGAACCTACATTCACTCAGCCTCCTAGCTGCAAAGTGTCTTCGTATTACCCGATTACAAATGAAGATCTAAGACTGACAAATGACCCAGCTCAACAAGAGGAAGATAAGATACTCGGCGTAGACCCTGGGTACCCCCTGAACTCGGAGGTCTGCTATAAATCAAAGCTGCGAGAAGACAGCGAGGATGCTAATCAGGAAGACATGCCTTTGGCGTACTACTGCAAGCCCTGCGCGAGGTTCTTTGCCTCTCAGTCGGTCTTTGAGAAGCACAAAGCAGAGTTTCACGGGAGAAGACCCACGAGGAACGCCTGCGAGGAGTGCGGGCGGGTCTTCAGGACCTTGTCGAGTCTTCGTAATCACATGAAGAGCCACCGGGGGAAGATTGAGTCTGACCTGGAGGCCAGCGACAAGGAAATCGACGGGAAAATCGAGGGAGAGACCCGGACCAGGCGGGAATTGGTCTGCAATATTTGCAGCAAGGTCTTCCGGCACAGGAGCAACTTCAAGAAGCACCTGGTGAGACACACCACCGGAGACCTGACTTGCAAGCACTGTCCGAAAAAATTCAGGCTGTATCGCGACTTGACTCGTCATGAAAAGACTCACTTTCTGCCCAGCTACATGTGCAAGGAGTGCGACTACGAGACCACTGTCCTGGCCGCTCTGACGGTCCACATGCTCAAGCACACCGACAACGCCGGGCTGCCCTACAAGTGTAACGACTGCGATAAGCACTTCCGGAAGGCCAGCGACCTCCAGGAGCACTACAATATCCACTCCGGGGACAAGCCGTTCGGCTGCGAGCAGTGCGGCAGCTCCTTTTACCTCCGGCGCCAGCTCTCCGCTCACTGCCGGCGCCTTCATCCAGAAATCAAGGCCAACAAAGTCACCAGCACTACTTGTGACATCTGCGGCAGGGAGGTAGACGCAGAGGGTAACATAATAGGTGAGAGTATTACAGAGGAAGAGCAGGCCCAGGAAGTCCCGCAGTACATCGAGTTTGACAGCATAGACTCCCAGGTGATTGAGTACGCTACTGAGAACGAGGAAGACGTAGATGACGTTGGTGCTGTAGAGAAATTCCAAGAAGCTCCTCAGGTGGCGACGATAGTCCCCGTCGCTGTTGCTACTAAGAACCACCGGCAGACCAGGTACCGGCCGATCTTACCGAAGAGCACGCCTACGTTTAGGTGTACGCTCTGCTCGGAGACCTTTCCCTCGAAGATCGCTTTTAGGAAACACGTCGCTTGGACCCACAAGAAGAAGGTCTGCATTCAGGAGGACGGAGCCTACGTCTGCGCGGTTTGTGATTATCGGACGCTCAAGAAGAACCTCTTCGCGGCGCATTTGGAGCGGAAGCACGAGACCTGGCCCAAGAAGGGCTCAAGCAATGTCGACTTTCCCTGCGTGGTCTGCGGATTTAAGTGTCGCTCCAAGCACTCGCTCCAGTCACATTTCATTCGCAAGCACACCGATACTTTCGAGCACCAGTGCAAGTTCTGTCCCAAGCAGTTCAAGGTCAAGGGCGACCTGACCAACCACGTGAGGTTCCATCACAAAGAGAAGCCGGTTAATTGTACTGTTTGCGGTAAACTCTGTCAGAACAGCGGCTCGCTGTATGTCCATCAGAAGTGGGCCCACTTCAAGCCCAAGTTCGAGTGCAAGATCTGCAAGCGGCGCATGGTCACTCAGGAAAATCTTGAGCAGCACATGCTCACGCAGCACGAGAAGCGGGAGAAGATCGTTTGCGCGGAATGCGGAAAGACTTTTACTAAAAAGGATTCATTTAAGCGGCACATGGCCGTGCATACTGGCAGCAAACCCCACAACTGCCCAATTTGCAATAAAGCCTTTGCCAGACAATCGCAATTGCGCCAGCATGTACTTATCCACACCGGCAAACGGCCTTTTGTGTGTGATATTTGCGGCAAGGCGTTCACACAGAAGCCCGGACTTATTTGCCACCGCAAGACTCACCCCGGATCGCATCCCCCGCTCCCAGTTATGCCTATTGCGGACCTTGGAATTCCCGGTGATCCAAGTACGGACTCGTTCATCTGTACGCACTCCAAGAGATACATAAAAAATTTAAACTGTAAATTGCACAAATGTCGCCACTGCTCGTTCAGTTCCTTGGACTTCCAGCGAATCGTTTTGCATGAAAAATACAAACACAGTGAATTATCAAAGCAGTTTAGATGTCGCAAGTGTTACAAGAACAAAATGACGCACTTGGAGTACCAGAGCCACTTTTGCAAGCCGCAAGACTTGAAAAGAAAACAAGTTTTGAAGAAAGAAGCCGAAATAAACGGGGAATTTTTGCTCTGCGAATATTGCGGGTTTTCTACAAAGAAGAAGAAGAAGAAGACTCTGCAGTACCACATCCAAGCCCACCACACCGAGCACAAGTTCCACTGCACACTCTGCAAGAAGAAATTCGTCACTGCGCTGGGGTTGAAGAGACACTTGAACAACCACGTGGACTCGATTAGCCGCGTGGATTTCTTGTGCGACTACTGCGGAAGGGGCTTTAGACATAAGGATAATTTAAGAAATCATATCATGGGAGTTCATATGCACATCAAGGACTCTGTCTGCTCGATCTGCAGCAAATCATTCGCCAAGAGAAACAGCCTCCGGCAGCATCTTCTCCTCCACTCGGGCAAGCGACCTTATACTTGTGATATTTGCGGGAAAACTTTCGTTCAGAAGCCCGCCCTGACTAGTCACAGGAAAACTCATCCTGGAAAACTCCCTCCCATGCCCGTGGTCTTCATTGACTCCTATATTAAAGAGGTTGACCCGGCCATGCGAAGAAAAAAAAATAAAACGCCGCACAAGTGTGTCAAGTGCAGCGCGTCTTTTTTCCACATGACAAAGCTGGTGGAGCACCTTAAGAATACTCACGGAATCGAGAACGCCTTCAGCTGCGACGAGTGCGGAAAAGCGTTCAAGAGCCCGATGAATATCACGCGACACAAGTTGATCCACTCGGGGACGAAAAAGTTCTCCTGCGACGTCTGCGAGTACACCTGCAATCAAAAAACAAACTTGGAGACGCACAAGCGACGGCACGCCAAGGATTTTCTCTTCAAGTGCAGCATCTGCAAGAAGGGATTCATCTCCAAGGCGGACTTTGCGGAACACGCGAACGTCCACACGAATTTCAGGTTCCGCTGCGACGCCTGCGACAAAAATTACTTCTACAAGAGGAACTTGATCGCTCATCTGAGACTTTGTCACTACGAGGTCGAGAAGAAGAAGGACAAGTCCTCGTTCAACTGTAACGAGTGCGAGCTTGCCTTTACCACTCTTGGAACCCTCAGGAACCACATTCGGAACCGCCACCGCGTCAAGAAGCGCCACCAGTACCTCTGTGACCTTTGCGGCGCAAATGTCTCTTCGAAAAAAGCTCTTGATATTCACACTCGCACTCACACTGGGGAGAGAGACGTCGCCTGTGATTTATGCCCCAAGAAGTTCACCACCAAGGAGAACTACAAAATCCACCGGAGAACTCACACTGGGGAAAAACCCTACTCTTGTCACGAGTGCGGGAAACATTTCACCCAGAGGTCTTCTCTCGTGATTCATCTGAGGTATCACACCGGCGACAGACCTTACTGGTGTCCTGATTGCGGGAAAGGGTTTGTTACTAATCATTTTCTTAAAAAACATCGCAAGGGCCATGAAAATACTAAAGTTGCTCCTCATTTTGAAGAACAACCTCGCAGGTCGAAGTCTAAA
- Protein Sequence
- YIGLGPSYAYAKYATVEQIPITAIKSDTIPAKSKPKPKPKPKPKPVKLKLDPEKVKSEPEKNKEKPKSGKRYKCNICGKESWNFHQWERHYRLDHGKPKAEKCGQCDRIFYDDEKLARHVVKMHQEERPFPCHICNKPFKTDKFLRIHLKQHNKRYCCEICGMSKVSPYDLRLHMMKHNKEYYAHCEYCQRGFYTKQSLQRHILTHTGERPFVCRICNASYASASYLSLHAKSHTQREKFKCNLCDYETFWKTSLKGHHGSLNAHMKGHKNMLEKVEAIDFEELEEELFHSEIKPELLINSTKSPGIQKKLKEEEEKDYVCKICNKQFRLKNLFEGHLVAHSDLRPYQCDICNKSFKRTSTLAVHRRIHTRERNFICDVCGRAFIQASQLAMHQRRHFEKYICSCELCGKGFFTNAELHGHMNIKHGAKEHVCHHCGKSFPNNHTLVRHAKVHDPDFKPIKHQCEFCGKIFAYKNSLMVHVKSHTGENKYDCHLCGKSVSSRGSLQDHLRLHGGEKSLVCDVCGKAFHKRTTLVVHKRTHTGEKPYSCDTCGKSFTQHSTLVIHKRYHTGQRPYQCSVCTKSFVSRALLNVHSKAHNVEPTFTQPPSCKVSSYYPITNEDLRLTNDPAQQEEDKILGVDPGYPLNSEVCYKSKLREDSEDANQEDMPLAYYCKPCARFFASQSVFEKHKAEFHGRRPTRNACEECGRVFRTLSSLRNHMKSHRGKIESDLEASDKEIDGKIEGETRTRRELVCNICSKVFRHRSNFKKHLVRHTTGDLTCKHCPKKFRLYRDLTRHEKTHFLPSYMCKECDYETTVLAALTVHMLKHTDNAGLPYKCNDCDKHFRKASDLQEHYNIHSGDKPFGCEQCGSSFYLRRQLSAHCRRLHPEIKANKVTSTTCDICGREVDAEGNIIGESITEEEQAQEVPQYIEFDSIDSQVIEYATENEEDVDDVGAVEKFQEAPQVATIVPVAVATKNHRQTRYRPILPKSTPTFRCTLCSETFPSKIAFRKHVAWTHKKKVCIQEDGAYVCAVCDYRTLKKNLFAAHLERKHETWPKKGSSNVDFPCVVCGFKCRSKHSLQSHFIRKHTDTFEHQCKFCPKQFKVKGDLTNHVRFHHKEKPVNCTVCGKLCQNSGSLYVHQKWAHFKPKFECKICKRRMVTQENLEQHMLTQHEKREKIVCAECGKTFTKKDSFKRHMAVHTGSKPHNCPICNKAFARQSQLRQHVLIHTGKRPFVCDICGKAFTQKPGLICHRKTHPGSHPPLPVMPIADLGIPGDPSTDSFICTHSKRYIKNLNCKLHKCRHCSFSSLDFQRIVLHEKYKHSELSKQFRCRKCYKNKMTHLEYQSHFCKPQDLKRKQVLKKEAEINGEFLLCEYCGFSTKKKKKKTLQYHIQAHHTEHKFHCTLCKKKFVTALGLKRHLNNHVDSISRVDFLCDYCGRGFRHKDNLRNHIMGVHMHIKDSVCSICSKSFAKRNSLRQHLLLHSGKRPYTCDICGKTFVQKPALTSHRKTHPGKLPPMPVVFIDSYIKEVDPAMRRKKNKTPHKCVKCSASFFHMTKLVEHLKNTHGIENAFSCDECGKAFKSPMNITRHKLIHSGTKKFSCDVCEYTCNQKTNLETHKRRHAKDFLFKCSICKKGFISKADFAEHANVHTNFRFRCDACDKNYFYKRNLIAHLRLCHYEVEKKKDKSSFNCNECELAFTTLGTLRNHIRNRHRVKKRHQYLCDLCGANVSSKKALDIHTRTHTGERDVACDLCPKKFTTKENYKIHRRTHTGEKPYSCHECGKHFTQRSSLVIHLRYHTGDRPYWCPDCGKGFVTNHFLKKHRKGHENTKVAPHFEEQPRRSKSK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -