Opun000109.1
Basic Information
- Insect
- Othius punctulatus
- Gene Symbol
- -
- Assembly
- GCA_951805005.1
- Location
- CATORJ010000308.1:92202-102570[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 49 0.11 18 7.8 1.0 1 23 88 111 88 111 0.97 2 49 0.0016 0.26 13.6 1.2 1 23 143 165 143 165 0.98 3 49 0.0023 0.38 13.1 11.5 1 23 171 193 171 193 0.98 4 49 0.0018 0.29 13.5 0.6 1 23 199 221 199 221 0.97 5 49 0.00021 0.034 16.4 2.4 1 23 227 249 227 249 0.99 6 49 0.0035 0.56 12.6 4.8 1 23 255 277 255 277 0.98 7 49 0.00052 0.084 15.2 4.4 1 23 284 306 284 306 0.99 8 49 0.0033 0.54 12.6 2.9 1 23 312 334 312 334 0.98 9 49 0.00015 0.024 16.9 4.2 1 23 340 362 340 362 0.97 10 49 0.0038 0.62 12.4 0.4 1 23 368 390 368 390 0.97 11 49 0.067 11 8.5 1.7 1 23 396 418 396 418 0.97 12 49 0.0043 0.7 12.3 1.6 1 23 424 446 424 446 0.99 13 49 0.02 3.3 10.2 1.0 1 23 452 474 452 474 0.98 14 49 2e-05 0.0032 19.6 2.9 1 23 480 502 480 502 0.98 15 49 0.22 36 6.9 0.7 1 23 554 577 554 577 0.96 16 49 0.0016 0.26 13.6 1.2 1 23 609 631 609 631 0.98 17 49 0.00012 0.019 17.2 7.5 1 23 637 659 637 659 0.99 18 49 0.00015 0.025 16.8 3.0 1 23 665 687 665 687 0.99 19 49 0.0016 0.26 13.6 2.8 1 23 693 715 693 715 0.99 20 49 1.6e-05 0.0026 19.9 1.7 1 23 722 744 722 744 0.99 21 49 0.00014 0.022 17.0 3.4 1 23 750 772 750 772 0.98 22 49 6.4e-06 0.001 21.2 4.2 1 23 778 800 778 800 0.98 23 49 0.01 1.7 11.1 0.3 1 23 806 828 806 828 0.97 24 49 0.00021 0.034 16.4 1.3 1 23 834 856 834 856 0.98 25 49 0.0061 1 11.8 1.7 1 23 862 884 862 884 0.98 26 49 0.02 3.3 10.2 1.0 1 23 890 912 890 912 0.98 27 49 2e-05 0.0032 19.6 2.9 1 23 918 940 918 940 0.98 28 49 0.07 11 8.4 1.3 1 23 992 1015 992 1015 0.97 29 49 0.00041 0.066 15.5 3.7 1 23 1047 1069 1047 1069 0.98 30 49 0.00019 0.031 16.5 7.1 1 23 1075 1097 1075 1097 0.99 31 49 0.00017 0.028 16.7 1.0 1 23 1103 1125 1103 1125 0.97 32 49 0.00043 0.069 15.4 3.4 1 23 1131 1153 1131 1153 0.99 33 49 0.013 2.1 10.8 4.6 1 23 1159 1181 1159 1181 0.98 34 49 1.7e-05 0.0028 19.8 1.6 1 23 1188 1210 1188 1210 0.99 35 49 0.002 0.33 13.3 4.2 1 23 1216 1238 1216 1238 0.98 36 49 0.0022 0.35 13.2 1.8 1 23 1244 1266 1244 1266 0.98 37 49 0.0043 0.7 12.3 0.6 1 23 1272 1294 1272 1294 0.97 38 49 0.00054 0.088 15.1 0.8 1 23 1300 1322 1300 1322 0.98 39 49 0.0058 0.94 11.9 2.1 1 23 1328 1350 1328 1350 0.99 40 49 0.036 5.9 9.4 1.0 1 23 1356 1378 1356 1378 0.98 41 49 2e-05 0.0032 19.6 2.9 1 23 1384 1406 1384 1406 0.98 42 49 0.0032 0.52 12.7 4.2 1 23 1412 1434 1412 1434 0.97 43 49 0.0006 0.097 15.0 4.8 1 23 1440 1462 1440 1462 0.98 44 49 0.0011 0.18 14.1 2.1 1 23 1468 1490 1468 1490 0.98 45 49 0.57 93 5.6 2.4 1 23 1496 1518 1496 1518 0.97 46 49 0.02 3.2 10.2 1.2 1 23 1524 1546 1524 1546 0.96 47 49 0.0037 0.6 12.5 1.7 1 23 1552 1574 1552 1574 0.98 48 49 1.5e-05 0.0025 20.0 0.9 1 23 1580 1602 1580 1602 0.97 49 49 0.0021 0.33 13.3 0.5 1 23 1608 1630 1608 1630 0.98
Sequence Information
- Coding Sequence
- ATGGATTtgattaaattagaaattaaggATGAAATAGAATTAGATTTCGAAGAAATCTACATTAAAACAGAAAAACACTCTAGTGATGAGGACAGTAATGACCTATTGAATGCAGATGATATAAAGCAGGTTCTGGATCAGCCACAATATGTTTTTGATTGTGAAACTAAGACAAACACAATTGAATATATTAAACCTGAGGTCAACACTTGCATAGTAAAGATAGAAGAAGAGACTGTGCAGATCAGCGCAGCTATACCGCACAGTTGTGAGACGTGTAATATTACATATGATGGTCTGGCGAGTTATGAGATGCATTTAAAGACTAAACATCGTAGTCATATCAGAAGTAGGGATGTTTCTAAACGGTTCGCCAAGAAGAAGAGTCTAAAGAGGTGCAAGGTGATCGATCCTGAAGAGAGTTGTTACAAATGTGTCTCATGCACATCAACTTTCAAACAGAGATGGCTTTTAAATCGTCACCAAATAATTCACACTGGAGAGAAGCCTTACAAATGTCACATATGTAACTGTTGCTACACCCAAAGAAGTCATTTAAGACAGCATGAGAAGATTCACAGTGAAGAGAAgcctttcaaatgtgatgagtgcGCCATGGCCTTTAGTCttaatacttatttgaagaGGCACAAAATAATTCACTCTGGCGAgaggccttacaaatgtgacgtgTGCACCTCAAGCTTCTTCACGCCCAGTACTTTGACCCAGCACAAGAAAACTCATACTGGTGGAAAGCCGCACAAATGTGACATTTGCAACTGTAGTTACACAACAAAACGTATCTTAGAGAGACACATAATGACCCACACTGGAGCGGAAAAGCCTTATCAGTGTGGTTTGTGCAGCGTAAGATTCACCCAGAAATGTCATTTGAACGAACACCTGAGGATACACACTGGAGAGAAGCGGCACAAATGTAATTTATGTTCCTCGCAATTCAGCACAGCTGGCAGCTTGACCACCCACAAGATGACCCATACTGGCGAGAAGCCACATAAATGTGACATTTGCAACTCGAgattcaaaagaaaacaaatgttAAAGAGGCACATGGTGATGCACACTGGAGAGAAACCATACCAGTGTGATGTATGCAAGTCACGTTTCATTTCAAAAGTAGAGTTGAAACTGCACGAGACAGTCCACACTGGAGACAAGCctcacaaatgtgatgaatgcatgTCAAGCTTCAGCACATCAGGTCGTTTGACGGtgcacaagatgatccacacTGGAGAGAAGCCGTACACATGTGATTTATGTATGTCCAGCTTCAGTACACCAGGTAAACTGAAACAACACAAGAAGACCCACTCTGGTGACAAGCCACACAAATGTGATATTTGCAACTCAGCCTTCATAGCAAAAAATATCTTGAAGAGACACATGATGATGCACACTGGAGAAAAACCACACAAATGCGATGTATGCACCATAAGCTTCGCTCAGAAAGGTCATTTAAAGGAACACCAGAGGATACACACGGGAGAGAAGCCACacaaatTTCTGGATCAGCCACAATATGTTTTTGATTGTGTAGCTAAGACAAACACAATTGAATATATTAAACCTGAGgtCAACACTTGCATAGTAAAGATAGAAGAAGAGACTGTCCAGATCAGCGCAGTTACACCGCACAGTTGTGAGACGTGTAATATTACATATGACGGTTTGGCGAGTTATGAGATGCATTTAAAGATTAAACATTGTAGTCATATTAGAAGTAGGGATGTTTCTAAAAGGTTCGCCAAGAAGAAGAGTCAAAAGAGGTGTAAGGTGATCGATCATGAAGAGAGTTGTTACAAATGTGTCTCATGCACATCAACTTTCAAACAAAGATGGCTTTTGAATCGTCACCAAATAATTCACACTGGAGagaagccttacaaatgtgacatATGTAACTCGTGCTACACCCAAAAAAGTCATTTAAGACAGCATCAAAAGAGTCACACTGAAGAGAAgcctttcaaatgtgatgagtgcacctCGAGCTTCTTCACTGCTAGTAGTTTGACGAAACACAAGAAAACCCACAGCGGTGGAAAGCCATACAAATGTGACATTTGCAACTGTAGATACACAACAAAACGTATCTTAGAGAGACACATAATAACCCACACTGGAGTGGAAAAACCTTACCAGTGTGATTTGTGCAGCGTAAGGTTCACCCAGAAAGGTCATTTGAATGAACATCTGAGGATACACACTGGAGAGAAGCCACACAAGTGTGATTTATGTTCCTTGCAATTCAGCACATCTGGCAATTTGACCAAGCACAAGATGACCCATACTGGTGAGAAACCACATAAATGTGATATTTGCAACTTGAGATTCACAAGAAAACAAATGTTAAAGAGGCACATGATGATGCACACTGGAGAGAAACCTTACCAGTGTGATGTATGCACCTCACGTTTCATTTCAAAAATAGAGTTGAAACTACACGAGACAGTCCATACTGGAGacaagccttacaaatgtgatgaatgcacatcaagctTCAGCACATCAAGTCGTTTGACGGtgcacaagatgatccacacTGGAGAGAAGCCGTTCACATGTGATTTATGTATGTCCAGCTTCAGTACACCAGGTAAACTGAAACAACACAAGAAGACCCACTCTGGTGACAAGCCACACAAATGTGATATTTGCAACTCAGCCTTCATAGCAAAAAATATCTTGAAGAGACACATGATGATGCACACGGGAGAAAAACCACACAAATGCGATGTATGCACCATAAGCTTCGCTCAGAAAGGTCATTTAAAGGAACACCAGAGGATACACATGGGAGAGAAACCACacaaatTTCTGGATCAGCCACAATACATTTTTGATTGTGAAACTAAGCCAAGCACTGTTCAATATGTTATACCTGAGGACAACACTTGCATAGTAAAGAtagaagaagagacagtgcagatcaGCGCAGTTACACCGCACAGTTGTGATAAGTGCAATATTACATATGATGGTTTGGCAAGTTATGAGATGCATTTAAAGACTAAACATCGTAGTCGTATCAAAAGTAGAGAAGTTTCTAAATGGTTCGCCAAGAAGAAGAGTCTAAAAAAGCACAAGGTGATCGACCCTGAAGAGAGTTGTTACAAGTGTGACTCATGCACATCAACATTTAAACGAAGATGGCATTTGAATCGTCACCAAATAATTCACACTGGAGAGAAGCCTTACAAATGTCAAATATGTAACTCATGCTACTCCCAAAGAAGTCATTTAAGACAGCATCAGAAGATTCACACTGAAGAGAAgcctttcaaatgtgatgagtgcaccATGGCTTTCAGTCTtaatagttatttgaaaaagcacaaaatCATTCACTCTGGGGAGAGGCCTTACAAATGCGATGTGTGCACCTCAAGCTTTTTCACTTCTAGTACTTTGACCCAGCACAAGAAAACCCACAGTGGTGGAAAGCCGCACAAATGTGACATTTGCAACTGTAGCTACACAACAAAACGTATCTTAGAGAGACACATAATGAACCACACTGGAGTGGAAAAGCCATATCAGTGTGATTTGTGCAGCGTAAAATTCTCCCAGAAAGGTCATTTGAACGAACACCTGAGGATACACACTGGAGAGAAGCCGCACAAATGTAATTTATGTTCCTCGCAATTCAGCACATCTGGCAGCTTGACTAAGCACAAGATGACCCATACTGGCGAGAAGCCACATAAATGTGATATTTGCAACTCGGCCTTCATAGCAAAACAAAGATTAAAGAGACACATGATGATGCACACTGGAGAGAAACCTTACCAGTGTGACGTATGCACCTCACGTTTCATTTCAAAAACAGAGTTGAAACTACACGAGACAGTCCACACTGGAGacaagccttacaaatgtgatgaatgcacgtcAAGCTTCAGCACATCGGGTCGATTGACAGtgcacaagatgatccacacTGGAGAGAAGCCATACACATGTGATTTATGCCTGTCAAGCTACAGCACATCCGGTAAATTGAAACAGCACAAGAAGACCCACTCTGGTGACAAGCCATACAAATGTGATATTTGCAACTCAGCCTTTGtagcaaaacatattttaaagaGGCACACAATGATACATACTGGAGAAAAACCTCACAAATGCGATGTATGCACCATAAGCTTTGCCCAGAAAGGTCATTTGAAGGAACACCAAAGGATACACACGGGAGAGAAGCCACacaaatgtaatttatgtaCATCACAATTCAGCACATCTAGTAGTTTAAATAAACACAGTGAGACCCACACGGGTGCAAAGCCACATAAATGTGATATTTGCAACTCATGCTTTCGAGCAAAGACTTTATTACAGAGGCACATAATGAcgcacacaggagaaaaaccttacaaatgtgatgtgTGCACCTCATGTTTCATTTCAAAAGTCGAGTTGAAAATGCACAGGATGATCCACACCGGAGAGAGGCCCCACAAATGTGATCTATGTATTTCAAGCTTCATCACAAAAAGTCGCCTGACAATGCACAAAATGATTCACTTTGGAGAGAAGCCACACAATTGTGATGTATGCTCGTCGACCTTCACCCTAAAAAGTGGTTTAAACGTTCACAAGTTGATCCATACCGGGGAGAAGTCTTTCACGTGCAGTATATGCCCTTCGAGCTTCATCACGAAATGTCGTTTGACGGCGCACGAGAAGGTCCACACGGGAGAGAAACCTTTcgaatgtgacgaatgcaccaCGAGCTTTGCTACAAAGGGTCAACTGAAGAGACACAAATCAATTCACACTGAAGAgaaacctttcaaatgtgacGCTTGCACCTCGAGCTTTGCCACATCTGCCGGCTTGACTCGGCACAAGAAGATTCACTTCGGCGAGAAACCACACGAATGA
- Protein Sequence
- MDLIKLEIKDEIELDFEEIYIKTEKHSSDEDSNDLLNADDIKQVLDQPQYVFDCETKTNTIEYIKPEVNTCIVKIEEETVQISAAIPHSCETCNITYDGLASYEMHLKTKHRSHIRSRDVSKRFAKKKSLKRCKVIDPEESCYKCVSCTSTFKQRWLLNRHQIIHTGEKPYKCHICNCCYTQRSHLRQHEKIHSEEKPFKCDECAMAFSLNTYLKRHKIIHSGERPYKCDVCTSSFFTPSTLTQHKKTHTGGKPHKCDICNCSYTTKRILERHIMTHTGAEKPYQCGLCSVRFTQKCHLNEHLRIHTGEKRHKCNLCSSQFSTAGSLTTHKMTHTGEKPHKCDICNSRFKRKQMLKRHMVMHTGEKPYQCDVCKSRFISKVELKLHETVHTGDKPHKCDECMSSFSTSGRLTVHKMIHTGEKPYTCDLCMSSFSTPGKLKQHKKTHSGDKPHKCDICNSAFIAKNILKRHMMMHTGEKPHKCDVCTISFAQKGHLKEHQRIHTGEKPHKFLDQPQYVFDCVAKTNTIEYIKPEVNTCIVKIEEETVQISAVTPHSCETCNITYDGLASYEMHLKIKHCSHIRSRDVSKRFAKKKSQKRCKVIDHEESCYKCVSCTSTFKQRWLLNRHQIIHTGEKPYKCDICNSCYTQKSHLRQHQKSHTEEKPFKCDECTSSFFTASSLTKHKKTHSGGKPYKCDICNCRYTTKRILERHIITHTGVEKPYQCDLCSVRFTQKGHLNEHLRIHTGEKPHKCDLCSLQFSTSGNLTKHKMTHTGEKPHKCDICNLRFTRKQMLKRHMMMHTGEKPYQCDVCTSRFISKIELKLHETVHTGDKPYKCDECTSSFSTSSRLTVHKMIHTGEKPFTCDLCMSSFSTPGKLKQHKKTHSGDKPHKCDICNSAFIAKNILKRHMMMHTGEKPHKCDVCTISFAQKGHLKEHQRIHMGEKPHKFLDQPQYIFDCETKPSTVQYVIPEDNTCIVKIEEETVQISAVTPHSCDKCNITYDGLASYEMHLKTKHRSRIKSREVSKWFAKKKSLKKHKVIDPEESCYKCDSCTSTFKRRWHLNRHQIIHTGEKPYKCQICNSCYSQRSHLRQHQKIHTEEKPFKCDECTMAFSLNSYLKKHKIIHSGERPYKCDVCTSSFFTSSTLTQHKKTHSGGKPHKCDICNCSYTTKRILERHIMNHTGVEKPYQCDLCSVKFSQKGHLNEHLRIHTGEKPHKCNLCSSQFSTSGSLTKHKMTHTGEKPHKCDICNSAFIAKQRLKRHMMMHTGEKPYQCDVCTSRFISKTELKLHETVHTGDKPYKCDECTSSFSTSGRLTVHKMIHTGEKPYTCDLCLSSYSTSGKLKQHKKTHSGDKPYKCDICNSAFVAKHILKRHTMIHTGEKPHKCDVCTISFAQKGHLKEHQRIHTGEKPHKCNLCTSQFSTSSSLNKHSETHTGAKPHKCDICNSCFRAKTLLQRHIMTHTGEKPYKCDVCTSCFISKVELKMHRMIHTGERPHKCDLCISSFITKSRLTMHKMIHFGEKPHNCDVCSSTFTLKSGLNVHKLIHTGEKSFTCSICPSSFITKCRLTAHEKVHTGEKPFECDECTTSFATKGQLKRHKSIHTEEKPFKCDACTSSFATSAGLTRHKKIHFGEKPHE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -