Psol005404.1
Basic Information
- Insect
- Phenacoccus solenopsis
- Gene Symbol
- -
- Assembly
- GCA_009761765.1
- Location
- chr3:24081027-24097166[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 39 5.8e-06 0.00039 20.1 4.9 1 23 121 144 121 144 0.97 2 39 0.48 33 4.6 0.2 1 23 162 185 162 185 0.93 3 39 0.13 8.5 6.5 4.5 1 23 249 272 249 272 0.93 4 39 0.025 1.7 8.7 1.4 1 20 290 309 290 311 0.93 5 39 0.0044 0.3 11.1 2.1 1 23 325 348 325 348 0.98 6 39 0.0012 0.082 12.8 2.9 1 23 363 386 363 386 0.98 7 39 3.5e-05 0.0024 17.7 1.1 1 23 394 417 394 417 0.92 8 39 0.14 9.2 6.4 2.6 1 23 443 465 443 465 0.97 9 39 9.1e-05 0.0061 16.4 1.8 1 23 471 494 471 494 0.97 10 39 0.0017 0.12 12.3 3.5 1 23 500 522 500 522 0.98 11 39 0.00083 0.056 13.3 0.2 1 23 528 550 528 550 0.96 12 39 2.4e-05 0.0016 18.2 5.2 1 23 556 578 556 578 0.98 13 39 0.00027 0.018 14.9 3.6 1 23 584 607 584 607 0.97 14 39 0.00092 0.062 13.2 3.4 1 23 613 635 613 635 0.97 15 39 1.3 89 3.3 12.0 1 23 744 767 744 767 0.97 16 39 1.6e-05 0.0011 18.7 1.9 2 21 783 802 782 805 0.89 17 39 0.0006 0.04 13.8 0.2 1 23 820 843 820 843 0.94 18 39 0.89 60 3.8 1.2 3 21 864 882 863 883 0.91 19 39 9.3 6.3e+02 0.6 6.0 1 23 902 925 902 925 0.90 20 39 0.0048 0.32 10.9 3.9 1 23 976 999 976 999 0.96 21 39 0.019 1.3 9.1 0.2 2 23 1022 1044 1021 1044 0.96 22 39 0.36 25 5.0 0.1 6 23 1057 1075 1057 1075 0.89 23 39 4.5 3e+02 1.6 1.6 1 23 1101 1123 1101 1123 0.91 24 39 2.2e-05 0.0015 18.3 1.7 1 23 1129 1152 1129 1152 0.97 25 39 0.064 4.3 7.4 7.6 1 23 1158 1180 1158 1180 0.97 26 39 0.0013 0.09 12.7 0.4 1 23 1186 1208 1186 1208 0.95 27 39 7.3e-06 0.00049 19.8 2.1 1 23 1214 1236 1214 1236 0.97 28 39 0.00057 0.038 13.9 3.0 1 23 1242 1265 1242 1265 0.97 29 39 0.00031 0.021 14.7 3.5 1 23 1271 1293 1271 1293 0.98 30 39 6.5e-05 0.0044 16.8 1.9 1 23 1313 1336 1313 1336 0.97 31 39 0.0047 0.32 11.0 1.1 1 23 1355 1377 1355 1377 0.96 32 39 0.6 41 4.3 5.5 2 23 1384 1405 1383 1405 0.97 33 39 0.0017 0.11 12.4 1.1 1 23 1409 1432 1409 1432 0.93 34 39 1.5e-06 0.0001 22.0 0.9 1 23 1438 1460 1438 1460 0.97 35 39 1.6 1.1e+02 3.0 2.1 1 11 1466 1476 1466 1480 0.85 36 39 0.071 4.8 7.3 3.4 1 23 1514 1536 1514 1537 0.95 37 39 7.3 4.9e+02 0.9 6.7 2 23 1544 1566 1543 1566 0.94 38 39 1.1e-06 7.7e-05 22.4 0.2 1 23 1570 1593 1570 1593 0.95 39 39 3.2e-06 0.00021 21.0 2.9 1 23 1599 1621 1599 1621 0.98
Sequence Information
- Coding Sequence
- ATGGAGTTAGACGATAGCGACGTGGAAAACGAAAGAAAACCGGCTAAGCGTAGCAGACGCGACAGCGAAAGCGACCAAGGGTTCGAAGCGCCGTTGGCCAAGAAGAAAAAAGAGGCTAGAAGTAAGAAGCAATCGATCGCCGCTGTGTCGGAGAACGACGACAGCTCCAGTAACGAGGTGAGTTTGGGGATACCCGAATTACAATTCGAAGTCATCGAAGAGATGATTGTGCCGGACAGTGTGACTAGGAAGAACGTTCAAGCGGTGGTATCCGATGCGACGGTGAAACGTCGTAAACCGAGGTTGAAGCGTTTGAGGCGAAATAATGGAGAGGACGACGACGATAAGAGTCAACTTATTTATACTTGCGCTCACTGCTTGCGTAATTTCCGGCGTAAGTCGAACCTGAAAAGACATATGTCCAAGTTGCATCTTGGTAAGGAAAAAGACGAAGGGCCTGATTCGTCGAAATGGTTCACCGGCTATTGTTGCGACGTATGCGGCGAAGGGTTACAGCTGCGGTCTACGATGATGGCGCATCGCGACGTAGTACACGGCAAGTACCCGGAAATGGATTGGTGCAAATACCAGACCTCTTTAGCTAGAGTGAAGTTCTGTGGTGAATGTAAAAAGTACTTCAGAGCTGGGAAACTGTACGACGAGCACAAGTGTGATAAAGGTACCAGTATTGTTGATCAAGAGCTGGTCAGATTGATGAAAGAAGCAGAAGAGGTGATGCCCACCTTTACCTGCCATATTTGCAATTTGTGTTTCCGTTGGCGATGGGACTACCGATCGCATCGTGAAAACGAGCATAAAGACGCACCACCTATGGACTGGATAGCGTTACAACCGATTAAGCCGGAGTACTTTTGCGAAAAGTGTTTCAAAGCGTACACTGAAGAGGACGAATTGCAAAATCATGAGTGTAGCGCGCAGCAACAGGCAGCCAACGACACGGTGGTGAAGCCATTCCGATGCGATTTGTGCGGAAACGATTATTTCTGGAAATCTGATTTCCGTCGGCATATGCGTACGAAGCATCCGAAAGAAAATTACAGACCGACTGAACACGATACCATCATATACTCGTGCCCATATTGCGAGGAGAAATTCTCGATGAAGAAGCGTATACTCCATCATATGCGTAAGGTGCACAGTATCGCATCTGATTCGCCTTTTGTATGCGTACAGTGTAATAAGGTGTTCAGACGACGCGATAATCTTGATCGGCACAATGAGTCGTATCATCCGGCCCTAAAGGACGAAGAAGAAGCGAATAAGATATTAGCCTCGGCCGAGATCAAGATAAACGGCGAGATCGCTTATCATTGTCAAATGTGTAATCGTAATATCACGAATCCAAATCGTTTCATATCCCATTACCGCGGTCACTATTCGGAAACGAAGTTCACGTGCGATTTGTGCGGTAAACAGGCGAAAACGCAGCATCAGCTGAATACGCACATAAAAAATATTCACTTGAATATACGTAACTACAAATGCGACATTTGTAGCAAGAGCTTTTACACGAAACAAGCGTGCGAAGAACATCGACGTATTCACACCGGCGAACGACCGTTTTCGTGCGAAATTTGCGGAAAAACGTTCGTAGCCGGGAACGCGTTGATATCGCACAAACGCTTTCACAACGATTTCTATCCGCACTCGTGTCATATGTGTCCGAAAAAATTCAAAGTACGGAGATCGCTTATTAACCACATACGAACGCACACCGGCGAACGGCCTTTCAAATGCGATTTGTGCTCGAAAACGTTCAACAACTCGTCGCAGTATTCGTACCACAAAAAGGTAACGCATTCGGACGCGCGACCCTTCACGTGCTCGTTATGCGGTAACTGCTTCAAAGCTAATAAGTTCTTGACAAGGCATATGGAGCTGCATACGGTACGTAGCCAAATACAGAGTCGTAAACCGAATAATCCGGCGCATTACGTTACGTCACAAATAAAAACCACTCAGGAACCGGTTGCCACCACAACGATGGTTACGCAGCAGCGTGTCAAAAATGTGGGTAACGCGTCTTCGTCTAACCTTAAAACATCGTCTAGTAAACCATCGTTGAATTATGTGCAATTGAAAGTCGGTACTCATTGTTTCATTTGTGGCGGTAAACTTGCTAAAGATAACGATTACAAAAGTCATCGTTGCCCACCGCATGATCAGATTGCGAAAGTGACTCGGGAAAGGAAACCGCACGATGATACCTTTTATTGTCATCACTGCGAATATGCTTGTAAGAGGAAGTATACTTTGAAATGCCATATACAAAGGTCTCACATGGATTTGAAAGGTCGTTCTTGTGATACTAGTAATTCGATGAAGACTTGCGCTATGTGCCAAAGGTCATTTTCGAGGAAATCGAACCTACGCAGGCACGTGTTTGATCAACATTTGGCTCGTGCCGGTGGACCGAAGTGTGTAATGAAAAAGGGTTATTGCTGTGATATTTGTGGTGAAGGGTTCAACGATCAAAGAGAAATGTTAGCTCATCGCGATGTCTTGCACGAGAATGCATCACCTATGTCGTGGGAGTCGTTTTGCGCTAGTATGAAAAAAGTGATGTTTTGCTACGCTTGCGGTAGATATTTCAGCTACGAAAAGGAATTCTCGCAGCATTTATGCGAGAGACCCGAGTACGCGGAGAAAAAAGTAATTTGTATATTGGAGTCAAAACCGTTGTACAAGTGTGACAGCTGCTCGCGCTGTTTTCATTGGAAATGGATGTACCGTTCACATCGTGAAAGCTCGCATCCACACGCTGAGCCGTTGGTATGGGATACGCTAGCCATGGATAACATAGAGCATTGCTGTTTAATTTGTTGCGAAAATTTTAATAATTCCGTTGATTTGCAACTCCATGAACCATATTGTATGAAGAAAGCTCAAAGTAAGCTGAAATCTTTCGTGTGTGACCTGTGCGGTAGCAGTTTCGTCCGCAAGTCGGATTGCAGAAGGCATAAACGAAATAAACACGCCGATGGTAAAGACAGAATATCCGATGAAGAAGGGTTTTCATCGTCGAGAAAAAAATGCGTCATCAGATGTCCTTATTGCGATATTTCCATGACCACCCGTAACAGTATATTGACTCATATCAAAAGCGTCCACAATATCGAGACCGATACGCCTTTTTTATGTATTCCATGTAATAAGGTTTTCAAACGTAAAGCGACGATGGATGCGCATAATGAAGTGTATCATCCGGAAAAGGAAGATACCGAAGAAACCAATAAGATACTGAAAGAATCTGAAATTTTACTCAATGGCGAAACGGCTTATCATTGCAATGTCTGTAATCGTAATGTTCTGAACTCTATACGTTTCTTAGCGCATTACAGGTTGCATTATGTAGAGCGTAAATTCACGTGCGATTTATGCGGCAAGCAGACGCGTACGCAACACCAGCTGAATATGCATATAAAGATTATCCACCTGAATATTCGAAACCACAAATGCGACATTTGCGATAAAACGTTCCACTCCAGGCAATCGTGTGAAGAACACAGGCGCATTCATACCGGAGAACGACCCTTTTCGTGCGAAATATGTGGTAAAACGTTTATCGCCATGAATGCCTTATTGACACACAAAAAGTTCCACAACGATTTTTATGCCCATCCTTGTTCCATGTGCCCGAAGAAATTCAAAGTTCGAAGATCGCTTATTAATCACATTCGCACGCACACAGGAGAAAGACCATTCAAATGCGAACTATGCCCGAAAACATTCAACAATTCTTCTAGGTTCTCGTATCATAAAAAAGTAACACATTCGGATAATCGTCCATTCACTTGTACTGAATGCGGTAGCTGCTTCAAAGCGAATAAGTTCTTAATTCGTCATATGAAATTGCACAAAATTCATAGTTTACAAAAGCATGCTGCAAATGCGCATAACGAACGTTTGGAACAGTTCGTGTGCGATTTATGTGATCGAACGTTTAATAGTAAGAATAAACTAGCGGGGCATATACGTAGCTGTCACGTGGATATAACCGACTTGATAGAAAATCCAATTAAAAAACTAATTAACGGGAAATTTAAATGCCCTATTTGTAATAAAGAATTCCTGCGTAAGACAAATGCTAATCTGCACTTCCTATCGCATGTGACAGAAAAGAGCTTACAATGCGAGAAATGCCAGTTCAGATGTCATACGCAAGCCCAACTGGATGTGCATAAGAACAAACACGAGAAACGATTTTGTTGCGAAATATGTAGTAAGATGTTCGCGTATAAGTTTCAGTTGGATGTACACGTGCAAGGAGTGCATTATAATTTACGACCTTTCCCTTGTAACATATGTGGAAAAACGTTCAAAACTAGGTATAACTATGGATCGCACATGGCTCAGCACAAAGATATCAGGAATTTTCAATGTCCACATTGCCCTAGAAGGTGTCAACACGAACGATCACACGAGGACGTAGTTGATGATACCATAGAGCTGTCGCAAGAATCTAGTCATCTGCGAAAAAAGGCTGTATCTTACGTCGGTAAAATCGAATCCGGCTTGTTCGAGTGTACCATTTGTAAAGAAAAATTTCCACTCAAAAATACCGTCCGCGAACATTATTTTAAACATCATTGTGGATCTGAACAACAGCAGTGTCCGCATTGTGAGTTTGCGTGTTATTCGAAATGTGCATTGGATGTGCATTTAGTGAAATGCCACACCAAAAGGTACGCTTGCGAAATATGCGGTAAAATGTTCCCATATAAGTATCAGCTGAAGACACACATTAACGCGGTGCATTTAAATATTAGAAATCATACGTGCGACGTATGCGGGAAGAGTTTCAAAACGAGAGCAAACTTTGATACACATATGTCTCGCCATATGGACTCGCAGTTTGCGATCCTTATTGTTTTGGATGATTAA
- Protein Sequence
- MELDDSDVENERKPAKRSRRDSESDQGFEAPLAKKKKEARSKKQSIAAVSENDDSSSNEVSLGIPELQFEVIEEMIVPDSVTRKNVQAVVSDATVKRRKPRLKRLRRNNGEDDDDKSQLIYTCAHCLRNFRRKSNLKRHMSKLHLGKEKDEGPDSSKWFTGYCCDVCGEGLQLRSTMMAHRDVVHGKYPEMDWCKYQTSLARVKFCGECKKYFRAGKLYDEHKCDKGTSIVDQELVRLMKEAEEVMPTFTCHICNLCFRWRWDYRSHRENEHKDAPPMDWIALQPIKPEYFCEKCFKAYTEEDELQNHECSAQQQAANDTVVKPFRCDLCGNDYFWKSDFRRHMRTKHPKENYRPTEHDTIIYSCPYCEEKFSMKKRILHHMRKVHSIASDSPFVCVQCNKVFRRRDNLDRHNESYHPALKDEEEANKILASAEIKINGEIAYHCQMCNRNITNPNRFISHYRGHYSETKFTCDLCGKQAKTQHQLNTHIKNIHLNIRNYKCDICSKSFYTKQACEEHRRIHTGERPFSCEICGKTFVAGNALISHKRFHNDFYPHSCHMCPKKFKVRRSLINHIRTHTGERPFKCDLCSKTFNNSSQYSYHKKVTHSDARPFTCSLCGNCFKANKFLTRHMELHTVRSQIQSRKPNNPAHYVTSQIKTTQEPVATTTMVTQQRVKNVGNASSSNLKTSSSKPSLNYVQLKVGTHCFICGGKLAKDNDYKSHRCPPHDQIAKVTRERKPHDDTFYCHHCEYACKRKYTLKCHIQRSHMDLKGRSCDTSNSMKTCAMCQRSFSRKSNLRRHVFDQHLARAGGPKCVMKKGYCCDICGEGFNDQREMLAHRDVLHENASPMSWESFCASMKKVMFCYACGRYFSYEKEFSQHLCERPEYAEKKVICILESKPLYKCDSCSRCFHWKWMYRSHRESSHPHAEPLVWDTLAMDNIEHCCLICCENFNNSVDLQLHEPYCMKKAQSKLKSFVCDLCGSSFVRKSDCRRHKRNKHADGKDRISDEEGFSSSRKKCVIRCPYCDISMTTRNSILTHIKSVHNIETDTPFLCIPCNKVFKRKATMDAHNEVYHPEKEDTEETNKILKESEILLNGETAYHCNVCNRNVLNSIRFLAHYRLHYVERKFTCDLCGKQTRTQHQLNMHIKIIHLNIRNHKCDICDKTFHSRQSCEEHRRIHTGERPFSCEICGKTFIAMNALLTHKKFHNDFYAHPCSMCPKKFKVRRSLINHIRTHTGERPFKCELCPKTFNNSSRFSYHKKVTHSDNRPFTCTECGSCFKANKFLIRHMKLHKIHSLQKHAANAHNERLEQFVCDLCDRTFNSKNKLAGHIRSCHVDITDLIENPIKKLINGKFKCPICNKEFLRKTNANLHFLSHVTEKSLQCEKCQFRCHTQAQLDVHKNKHEKRFCCEICSKMFAYKFQLDVHVQGVHYNLRPFPCNICGKTFKTRYNYGSHMAQHKDIRNFQCPHCPRRCQHERSHEDVVDDTIELSQESSHLRKKAVSYVGKIESGLFECTICKEKFPLKNTVREHYFKHHCGSEQQQCPHCEFACYSKCALDVHLVKCHTKRYACEICGKMFPYKYQLKTHINAVHLNIRNHTCDVCGKSFKTRANFDTHMSRHMDSQFAILIVLDD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -