Pinq024102.1
Basic Information
- Insect
- Prosopocoilus inquinatus
- Gene Symbol
- -
- Assembly
- GCA_036172665.1
- Location
- CM069876.1:45051499-45070524[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 63 0.04 2.5 9.7 2.5 1 23 14 36 14 36 0.98 2 63 3.4 2.1e+02 3.6 0.2 1 12 42 53 42 61 0.78 3 63 0.0083 0.52 11.8 6.1 1 23 67 89 67 89 0.98 4 63 0.019 1.2 10.7 4.7 1 23 95 117 95 117 0.98 5 63 1 64 5.2 1.6 3 23 125 145 123 145 0.96 6 63 7.7 4.8e+02 2.5 5.9 2 23 152 173 151 173 0.96 7 63 0.11 7 8.3 2.0 1 23 179 201 179 201 0.98 8 63 2.4 1.5e+02 4.1 0.2 1 12 207 218 207 226 0.78 9 63 0.00015 0.0097 17.3 3.0 1 23 232 254 232 254 0.98 10 63 0.05 3.2 9.4 2.1 1 23 260 282 260 282 0.98 11 63 0.87 55 5.5 1.6 2 23 288 309 287 309 0.94 12 63 0.0006 0.037 15.4 4.5 1 23 315 337 315 337 0.98 13 63 0.00013 0.0079 17.5 1.6 1 23 343 365 343 365 0.98 14 63 0.0054 0.34 12.4 4.0 1 23 371 393 371 393 0.98 15 63 0.25 16 7.2 7.7 2 23 400 421 399 421 0.96 16 63 0.00024 0.015 16.7 0.8 2 23 454 475 453 475 0.97 17 63 0.0042 0.26 12.8 2.2 1 23 481 503 481 503 0.97 18 63 0.023 1.4 10.4 1.0 1 23 509 531 509 531 0.96 19 63 8.2 5.1e+02 2.4 0.1 1 11 537 547 537 556 0.81 20 63 0.35 22 6.7 3.6 1 23 562 584 562 584 0.97 21 63 0.0055 0.34 12.4 1.8 2 23 591 612 590 612 0.96 22 63 2.9e-05 0.0018 19.6 0.3 1 23 618 640 618 640 0.96 23 63 0.00056 0.035 15.5 3.5 1 23 646 668 646 668 0.97 24 63 0.0011 0.069 14.6 2.3 1 23 674 696 674 696 0.97 25 63 0.36 22 6.7 6.6 1 23 708 730 708 730 0.97 26 63 0.078 4.9 8.8 4.0 1 23 736 758 736 758 0.96 27 63 0.055 3.5 9.2 2.6 3 23 765 785 763 785 0.98 28 63 0.012 0.74 11.3 1.1 3 23 793 813 791 813 0.97 29 63 0.013 0.8 11.2 1.4 1 23 819 841 819 841 0.97 30 63 0.00021 0.013 16.9 0.3 2 23 858 879 857 879 0.96 31 63 0.00039 0.024 16.0 2.1 1 23 885 907 885 907 0.97 32 63 0.051 3.2 9.3 1.0 1 23 913 932 913 932 0.97 33 63 0.048 3 9.4 2.3 1 23 938 960 938 960 0.97 34 63 0.0026 0.16 13.4 2.7 3 23 968 988 966 988 0.98 35 63 0.0039 0.24 12.9 1.4 3 23 996 1016 994 1016 0.97 36 63 0.033 2.1 9.9 1.4 2 23 1022 1043 1021 1043 0.96 37 63 0.64 40 5.9 0.5 1 23 1049 1071 1049 1071 0.91 38 63 2e-05 0.0012 20.1 1.7 1 23 1077 1099 1077 1099 0.98 39 63 0.018 1.1 10.8 1.4 1 23 1105 1127 1105 1127 0.98 40 63 0.87 54 5.5 6.0 1 23 1133 1155 1133 1155 0.98 41 63 0.0088 0.55 11.7 1.5 2 23 1162 1183 1161 1183 0.96 42 63 4.9 3e+02 3.1 0.6 1 15 1189 1200 1189 1208 0.74 43 63 0.0081 0.5 11.9 2.5 3 23 1216 1236 1214 1236 0.98 44 63 0.00084 0.053 14.9 1.2 1 23 1242 1264 1242 1264 0.97 45 63 0.32 20 6.9 1.8 2 23 1270 1291 1269 1291 0.96 46 63 0.0069 0.43 12.1 4.0 1 23 1297 1319 1297 1319 0.98 47 63 0.0026 0.16 13.4 3.8 1 21 1325 1345 1325 1346 0.95 48 63 0.00023 0.014 16.7 2.3 2 23 1360 1381 1359 1381 0.97 49 63 0.0028 0.18 13.3 2.5 1 23 1387 1409 1387 1409 0.97 50 63 0.0016 0.097 14.1 3.1 1 23 1415 1437 1415 1437 0.97 51 63 1.2 73 5.1 0.4 1 23 1443 1462 1443 1462 0.86 52 63 0.028 1.7 10.2 6.2 1 23 1468 1490 1468 1490 0.98 53 63 0.0014 0.09 14.2 1.7 1 23 1496 1518 1496 1518 0.98 54 63 2.7 1.7e+02 3.9 3.1 3 23 1526 1546 1524 1546 0.97 55 63 0.17 11 7.7 2.6 1 23 1552 1574 1552 1574 0.98 56 63 0.062 3.8 9.1 2.2 3 23 1608 1628 1606 1628 0.98 57 63 0.0022 0.14 13.6 2.0 1 23 1634 1656 1634 1656 0.98 58 63 0.0015 0.094 14.2 3.7 3 23 1664 1684 1664 1684 0.99 59 63 0.23 15 7.3 1.0 2 23 1690 1711 1689 1711 0.95 60 63 0.0095 0.6 11.6 4.1 1 23 1717 1739 1717 1739 0.98 61 63 0.00026 0.016 16.6 0.1 1 23 1745 1767 1745 1767 0.98 62 63 0.014 0.85 11.2 0.4 3 23 1775 1795 1773 1795 0.97 63 63 0.011 0.72 11.4 6.4 2 23 1802 1823 1801 1823 0.96
Sequence Information
- Coding Sequence
- ATGAATCTGCACAAATTACGACACGCCAATGAAAAGCGATTCCGCTGTACCCTTTGCGAATACAAATGCTTCGGAGCAGCCCAATTAAAACGGCATGTCTCAACGCACGACAGCGAAAAGCCGTTCACCTGCGGAATTTGCAACAAGAAAGTCAGAAACTTGAGACCCCAcatgttaatacacaccggcgaaaagACGTTCAGTTGTaatctctgcgattacaagtgtcgGCACGGCTCAAAGTTGAAACGTCACATGTcaatacacaccggcgaaaagccATTcacttgcgatctttgcgactACAAGTGCGGAGAGACCTCGACGTTAAAACACCACATgttaacacacaccggcgaaaagccgttcggttgcgatctttgcgatttcaGGTGTCGGGGAAACGCAACGCTGAAACAGCACAACTTAATACACACCGGTGAAAAGCCACTGAGTTGCGATCTTTGCAATTTCAAATGCCGACACGTCCAAAGTTTGAAACTGCACAAGTTAAGGCACGCCGACGAGAAGCGGTTCCGTTGTACTCTTTGCGAATACAAATGCCTCAAAGCATCAGAATTAGAACGGCACGTGTTAACGCACGACGACGAAAAGCCGTTCACCTGCGGAATTTGCGATAAGAAAGTCAGAAACTTGAGACCCCAcatgttaatacacaccggcgaaaagccgTTTAGCTGcggtctttgcgattacaagtgtcgAGGCAAGTCGAATTTGAAACGGCACATGTCAGTACACACCGGCGgaaagccgttcagttgcgatctttgTAATTACGAGTGTCGAGGCATGTCAGTCTTGAAACAGCACATGTCGACACACACCGGAAACTCGTTGAGTTGCTACgcttgcgattacaagtgcgatCGCTCCGACGCTATGAAGGCGCACAAATTAAGACATGCCGACGAGAAGCGATTCAGTTGTGCTCTTTGCGAATACAGATGCGTCCGATCGTCACATTTAAAACGGCACATATTAACACACAACAACGAAAAGCCGTTCACCTGCGGGGTTTGCGATAGGAAATTCAGAGAGCTAGCGCACTTGAGACGTCACACGTTAATACACACCGGGGAGAAGCCATTcacttgcgatctttgcgatttcaGTTGTCGAGAGACCTCGTCGTTGAAGAAACACTTGTtgaaacacaccggcgagaagccgttaagttgcgatctttgcgattataaatgccgacaccACCAAAGTTTGAAACTGCACATGTCAAGGCACACCGGAAAACCAACGACGGACTGTACCGGTCACACCGAAGCaattaatttgtgCAAACCGACTCCTAATTCTAAACTTGAAGACAAAGAGAAAGTCACGTGCCGCATATGTGATTCTAGGTTTACAACTAAAGCATATTTGAAGAAGCACTTGCTgatacataccggcgagaagcccTACATTTGCGacatttgcgattataaatgtcgagaaTCCTCCGGTCTAAAATCGCACAAATTAAggcacaccaacgagaagcccTTCCGTTGTACTCTTTGCGAATACAAAAGCTTCGCAGCGTCACAACTAAAACGGCACGTGTTAACGCACAACAAGGAAAAGCCGTTCACCTGCGGAATTTGCAACAAGAAAATCAGAAACTTGAGGCCCCACATGTTGCTGCACACCGACGAAAAGCCGTTCACTTgtaatctttgcgattacaagtgccgagACCGCACGATATTGAGACAACACACGTCAGtacacaccggcgaaaagccgATCAGTTGCGGtctctgcgattacaagtgcctcAGGGCATCGCAATTAAAACGGCATATGTTAACGCACAACAACGAAAAGCCGTTTAACTGTGAGATTTGCAATAAGAAATTCAGTCAACTCGGGAACTTAAGACTTCACGCGTTagtgcacaccggcgagaagccgtttagTTGCGGCCTTTGTGATTACAAGTGTCGAAGAAAATCGTGGTTGAAACGGCACacgttaatacacaccggcgagaagccgttcgcttgCGATTTTTGCGATTACATGTGTCGGGACAAGTCAATGTTGAAAAACCACGCGTTAACGCACGGCGAAAAGTCGTTACGTACCGGCGAGAAGCGTTTCAGTTGTGacgtttgcgattacaagtgcagGCGTTCCGACTGTATGAGGCTGCACAAATTAAGACATACCAACGAGAAGCGCTTCAGTTGCACTCTTTGCGAATACAAGAGTCTCCAGTCAACGCATTTAAAACGGCATATGTTGACAcacaacgagaagccgttcggttgcgaTCTCTGCGACTACAGGAGTCGAGAGACCTCGACGTTGAAACACCACATGTTGAAACACActggcgagaagccgttcggttgcgaAGTCTGCGATTACGTGTGCCGAGGGACCAGATCGTTGAAACGTCACATGTTAGGACACACAGGCGAAAAGCCtttcagttgcgatctttgcggTTTCAAGTGCCGAGAGGAATCCATgctgaaacaacacgtgttaattcatttGTACAAGCGGACTCTTGAATCTAAACTCGAAGGGAAAGGACATGTCGTGTGCCGCATATGTGATTCCACATTCAGAACTAAAGCGTACTTGAAAAGACACGTGCTgatacataccggcgagaaacccttcaactgtgatctctgcgattacaaatgcatCCGAGCATCGCAGCTAAAACGCCACGCGTTAACACACAACGTCGAAAAGCCGTACACCTGCGGAATCTGCGACAAGAAAGTCAGAGACTTGAAGCGCCACatgttaacacacaccgacgaaaagCCATTcacttgcgatctttgcgattacaagtgtcgAGAAAACCCGATGCTGAAACAGCACaagttaatacacaccggcgaaaagccgTTTGGCTGCGATTTCTGCGACTACAAGTGCCGAGGCAGCTCAAATTTGAAACGGCACGTgttaacacacaccggcgaaaagccgTTCGGTTGCGACCTATGCGACTACAAGTGTCGAGACAAGCCGATGTTGAAACAGCACATGCTAATACACACCGGAAACTCGTTGAGTTGTGACgtctgcgattacaagtgcgaACGTCCCGACGCCATGAAAACGCACAAGTTAAGACACGCCAACGAGAAGCGGTTCAGCTGCGCACTTTGCGAATACAAGAGTCTCGAATCGACGGCATTAAAACGTCACATGTTAACCCACAACGACGAAAAGCCGTTCACCTGTGGGATCTGCGATTATAAGTTCAGGCAACTCGAACACCTGAGACGCCACGTgttgatacacaccggcgagaagccgtacagttgcgacctttgcgatttcAGGTGTCGAGAGAACGCGACGctgaagcaacacgtgttaatacacaccggcgacaagccgttcagttgtgacctctgcgatttCAAGTGCCGGCACGTTCAAAGCTTGAAACTGCACAGGTTGAGGCACACCAACGAAAAGCGATTGCGCTGTGCTCTTTGCGAATACAGGTGCGTCAGGGCGTCGCAGTTAAAACAGCACGTGTTGAAGCACAGCGACGAAAATCCGTTCACCTGCGGAATTTGCGATAAGAAAGTCAAATACTTGAGGCCCCACATGTTGACACACAAcagcgagaaaccgttcggttgcgatctttgcgattacaagtgtcgAAACAACGCGCTTTTGAAGCAGCATAGgctaacgcacaccggcgaTAAACCGTTCAGCTGCGATCTTTGTGATTACGTGTGTCGAGACAGGTCAGTGTTGAAGCGGCACGTGTTAATACACACAGGAGAGTTGTCGAGTTGTGacgtttgcgattacaagtgcgaCAGTTCCGAAGCTATGAAGATGCACAAATTACGACATGCCGACGAGAAGCGATTCAGTTGTAACCTTTGCGAATACAGATGTCTCCAGTCATCGCAATTAAAACGGCACATGTTGACGCACAACAACGAAAAGCCGTTCACCTGTGGGGTTTGCTACAAGAAATTCAGAGAGCTCGCGCACTTGCGACGGCAcatgtgCAAAATCTCTCCTGAATCTCAGCTTGAGGAGAAGGGGAAAGTCACGTGCCGCATATGCGATTCGAAATTCACAACCAAGGCACACTTGAAGAAACACTTGCTgatacataccggcgagaaaccgttcaacTGTAACCTCTGCGATTACAAGACTCGAGATTCCTCCACCCTGAAGACGCACAAAATAAggcacaccaacgagaagcgATTCCGTTGCGCCCTCTGCGAATACAAATGCTTCAGGCCGTCCCTATTAAAACGGCACATGATAAGgcacaccggcgaaaagccgTTCGCCTGCGGCATCTGCAACAAGAAAATCCGAAACCTGAAAGCCCACATGCTAATACACACCGACGAAaggccgttcagttgtgatctctgcgatcaCAAGTGCCGAACCGGCTCAAAATTGAAACAGCACATGACgacgcacaccggcgaaaagccgttcacttgcgatctttgcgattcgAGGTGCCGAGATAACGCAACGCTGAAACAACACATgttaatacacaccgacgaaaaGCCGTTCGGTTGCGACCTGTGCGATTTCAAAAGCCGACACCTCCAAAGTTTGAAACTGCACAAGTTAAGGCACACCAGCGAAAAGCGGTTCCGCTGTACTATCTGCGAATACAGATGTCTCAGAGCGACAGAATTAAAACAGCACGTGTTAAAGCACGATGACGGAAAACCGTTCAAAACATGCGCAATTTGCGATAAGAAAGTCAAACACCTGAGACCTCACATGTTAATACACACCGAAGAAAAGCCGTTCggctgcgatctttgcgattacaagtgtcgCAGCGGCTCGAAGTTGAAGCAGCACGTgctaacgcacaccggcgaaaagccgTTCACCTGCGATCTTTGTGATTACGAGTGTCGAGGCAGGTCAATGTTGAAACGGCACGCGCTAACGCACACCGGCCAAAAGCCCGTCGGTTGCGGACTTTGTGATTACAAATGTCGAGACAGGTCGAAGCTGAACCGGCACATGTTGACACACACCGGAAATTCGTTGAGTTGTGACgcttgcgattacaagtgcgtGCGTCCTGATGCCATGAGGACGCACAAATTAAGACATACCAACGAGAAGCGATTCAGTTGTACTCTTTGCGAATACCGATGTTTGCAATCGTCGCAATTAAAACGGCACATGTTGACACACAACGACGAAAAGCCGTTCACCTGCGCGGTTTGCGATAAGAAGTTCAAGGAACCGGCGCCCTTGAGACGGCACGTGTTAATACATACCGACGAGAAGCCATTCggctgcgatctttgcgatttcaGGAGCCGAGATAACTCGTCGCTGAAACAGCACgtgttaatacacaccggcgagaagccgttaaGTTGCAGTATTTGCGATTATAGATGCCGACACCGCCGAAGTTTGAAGATGCACATGTCAAGGCACGCCGGAAAATCAACGACGCGCTCTGCCGCCCACGAAAGCAATTAA
- Protein Sequence
- MNLHKLRHANEKRFRCTLCEYKCFGAAQLKRHVSTHDSEKPFTCGICNKKVRNLRPHMLIHTGEKTFSCNLCDYKCRHGSKLKRHMSIHTGEKPFTCDLCDYKCGETSTLKHHMLTHTGEKPFGCDLCDFRCRGNATLKQHNLIHTGEKPLSCDLCNFKCRHVQSLKLHKLRHADEKRFRCTLCEYKCLKASELERHVLTHDDEKPFTCGICDKKVRNLRPHMLIHTGEKPFSCGLCDYKCRGKSNLKRHMSVHTGGKPFSCDLCNYECRGMSVLKQHMSTHTGNSLSCYACDYKCDRSDAMKAHKLRHADEKRFSCALCEYRCVRSSHLKRHILTHNNEKPFTCGVCDRKFRELAHLRRHTLIHTGEKPFTCDLCDFSCRETSSLKKHLLKHTGEKPLSCDLCDYKCRHHQSLKLHMSRHTGKPTTDCTGHTEAINLCKPTPNSKLEDKEKVTCRICDSRFTTKAYLKKHLLIHTGEKPYICDICDYKCRESSGLKSHKLRHTNEKPFRCTLCEYKSFAASQLKRHVLTHNKEKPFTCGICNKKIRNLRPHMLLHTDEKPFTCNLCDYKCRDRTILRQHTSVHTGEKPISCGLCDYKCLRASQLKRHMLTHNNEKPFNCEICNKKFSQLGNLRLHALVHTGEKPFSCGLCDYKCRRKSWLKRHTLIHTGEKPFACDFCDYMCRDKSMLKNHALTHGEKSLRTGEKRFSCDVCDYKCRRSDCMRLHKLRHTNEKRFSCTLCEYKSLQSTHLKRHMLTHNEKPFGCDLCDYRSRETSTLKHHMLKHTGEKPFGCEVCDYVCRGTRSLKRHMLGHTGEKPFSCDLCGFKCREESMLKQHVLIHLYKRTLESKLEGKGHVVCRICDSTFRTKAYLKRHVLIHTGEKPFNCDLCDYKCIRASQLKRHALTHNVEKPYTCGICDKKVRDLKRHMLTHTDEKPFTCDLCDYKCRENPMLKQHKLIHTGEKPFGCDFCDYKCRGSSNLKRHVLTHTGEKPFGCDLCDYKCRDKPMLKQHMLIHTGNSLSCDVCDYKCERPDAMKTHKLRHANEKRFSCALCEYKSLESTALKRHMLTHNDEKPFTCGICDYKFRQLEHLRRHVLIHTGEKPYSCDLCDFRCRENATLKQHVLIHTGDKPFSCDLCDFKCRHVQSLKLHRLRHTNEKRLRCALCEYRCVRASQLKQHVLKHSDENPFTCGICDKKVKYLRPHMLTHNSEKPFGCDLCDYKCRNNALLKQHRLTHTGDKPFSCDLCDYVCRDRSVLKRHVLIHTGELSSCDVCDYKCDSSEAMKMHKLRHADEKRFSCNLCEYRCLQSSQLKRHMLTHNNEKPFTCGVCYKKFRELAHLRRHMCKISPESQLEEKGKVTCRICDSKFTTKAHLKKHLLIHTGEKPFNCNLCDYKTRDSSTLKTHKIRHTNEKRFRCALCEYKCFRPSLLKRHMIRHTGEKPFACGICNKKIRNLKAHMLIHTDERPFSCDLCDHKCRTGSKLKQHMTTHTGEKPFTCDLCDSRCRDNATLKQHMLIHTDEKPFGCDLCDFKSRHLQSLKLHKLRHTSEKRFRCTICEYRCLRATELKQHVLKHDDGKPFKTCAICDKKVKHLRPHMLIHTEEKPFGCDLCDYKCRSGSKLKQHVLTHTGEKPFTCDLCDYECRGRSMLKRHALTHTGQKPVGCGLCDYKCRDRSKLNRHMLTHTGNSLSCDACDYKCVRPDAMRTHKLRHTNEKRFSCTLCEYRCLQSSQLKRHMLTHNDEKPFTCAVCDKKFKEPAPLRRHVLIHTDEKPFGCDLCDFRSRDNSSLKQHVLIHTGEKPLSCSICDYRCRHRRSLKMHMSRHAGKSTTRSAAHESN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01258844;
- 90% Identity
- iTF_01258844;
- 80% Identity
- iTF_01258844;