Ssin010435.1
Basic Information
- Insect
- Stratiomys singularior
- Gene Symbol
- -
- Assembly
- GCA_954870665.1
- Location
- OX940878.1:12844228-12860785[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 35 0.023 3.8 9.3 0.4 2 23 287 309 286 309 0.95 2 35 1.5 2.5e+02 3.6 0.8 6 23 318 335 318 335 0.98 3 35 0.0014 0.23 13.1 3.5 2 23 342 364 341 364 0.95 4 35 0.017 2.9 9.7 2.8 1 23 372 394 372 394 0.98 5 35 1.9e-06 0.00032 22.1 3.2 1 23 402 425 402 425 0.97 6 35 0.11 17 7.2 2.7 1 23 431 454 431 454 0.94 7 35 0.16 26 6.7 0.7 2 23 463 484 462 484 0.97 8 35 3.4e-05 0.0057 18.2 0.1 1 23 491 514 491 514 0.95 9 35 0.00018 0.03 15.9 3.6 1 23 520 542 520 542 0.98 10 35 0.0001 0.017 16.7 6.1 1 23 548 571 548 571 0.98 11 35 0.53 88 5.0 1.0 2 19 797 814 796 819 0.89 12 35 9.4 1.6e+03 1.1 1.3 6 23 828 845 828 845 0.93 13 35 6.4e-05 0.011 17.3 1.9 2 23 852 874 851 874 0.89 14 35 0.013 2.1 10.1 3.5 1 23 882 905 882 905 0.96 15 35 4.3e-06 0.00071 21.0 0.9 1 23 913 936 913 936 0.97 16 35 0.0076 1.3 10.8 0.4 1 23 942 964 942 964 0.95 17 35 0.056 9.3 8.1 0.6 2 23 973 994 973 994 0.97 18 35 0.00034 0.056 15.1 0.3 1 23 1002 1025 1002 1025 0.96 19 35 0.091 15 7.4 1.4 1 16 1031 1046 1031 1047 0.87 20 35 0.12 21 7.0 0.7 2 23 1255 1277 1254 1277 0.96 21 35 0.18 30 6.5 2.0 6 23 1286 1303 1286 1303 0.98 22 35 1.6e-05 0.0027 19.2 1.0 2 23 1310 1332 1309 1332 0.95 23 35 0.0014 0.22 13.2 6.8 1 23 1340 1362 1340 1362 0.98 24 35 8e-07 0.00013 23.3 0.3 1 23 1370 1393 1370 1393 0.96 25 35 0.1 17 7.2 0.1 3 23 1401 1421 1400 1421 0.95 26 35 0.0055 0.92 11.2 1.2 2 23 1430 1451 1430 1451 0.98 27 35 2.8e-05 0.0047 18.5 0.2 1 23 1459 1482 1459 1482 0.97 28 35 4.3e-06 0.00071 21.0 1.4 1 23 1488 1510 1488 1510 0.99 29 35 0.0001 0.017 16.7 6.1 1 23 1516 1539 1516 1539 0.98 30 35 1.4 2.3e+02 3.7 0.5 10 23 1589 1603 1586 1603 0.91 31 35 0.12 20 7.0 0.1 3 23 1611 1631 1610 1631 0.95 32 35 0.0023 0.37 12.5 1.0 2 23 1640 1661 1640 1661 0.98 33 35 0.00037 0.062 14.9 0.1 1 23 1669 1692 1669 1692 0.96 34 35 1.3e-06 0.00021 22.7 0.8 1 23 1698 1720 1698 1720 0.99 35 35 4e-05 0.0066 18.0 5.5 1 23 1726 1749 1726 1749 0.98
Sequence Information
- Coding Sequence
- ATGATTTGTCGCATGTGTGTGCTTGAATCGACAAATCTACTATCCATTTTTGAAACAGAAGCGACACAGCTAAACAAAGCGGGAATAATTACACTACACTTTGGATTTGAGCCGATGAAGGACGATCCGGTAACTACTATGGTGTGTTCCACATGCTGGTTACAAATCGAAAGTTTTCATACGTACTATTTAAACGTCGAGGAAGCACAGCGTAAATTTTCTAAACTACAACTAGGTTGCAAAATCAAGGAAGATTATGCGATCTACGAtgatattattgataaaaaggagCAGTTCTTATTTTCAATCGAAGAAGTAGATGATAAATTATTTGCCACACCACTTGCAGAAGTGCCTAATGAATCGGAAAATTTAACAGTGCCAGATGTGATTCCGATTTGTGCGGGAGAAACAAGAGCGAATTCTGACAACGAAagtttaaaaaacgaaaaaaatggagGCAGTGTGCAAGTTAACGAAACAGAACAGGGATATGATTCCGAGGACGCTGACGATATTGAAGATGCAACTTACGAAGATCGTTCAGCTTGCAATGACTCCGATAACGATAGCGATTCGTTCGACTCTGATGATGACAAAAAACTAATTGACTTTACGAGGTTGCGCCAACCGTtacaattgaaagaaaacgaaatcaaatTAGAAACGCCATCAACGGATTCTCCTGAAACTACTCCATCTCATAGAGCATCCCACAGAACGGAAAACAAACTACCTCCGAAAAGACGAGGTAGAAAAAGAAAGTCAGAGAAGCCAAATCCTACCAATCCTCGTACATCAGAATATCGAATTAAATCCAAGCAATATGATGAAGAAATCGCAAAAAGAACCAAACTGAAATGTGATTTGTGTGAGAAAGAGTTTACAATATTTAATGTGTTACAAAGGCACTACCGTGCGGAACATAAGGTCAAAGGATATGTGGTGTGCTGTACGAGGAAATTCTACAAGCGTGGAATGTTTCTAGATCACATAATGAAACATTTGGAACCTGATCGGCTAAAATGTGATATTTGCAAGAAGACTTACTGTAGCAGACGAGCGTTACATGACCATATATTAGTATATCATGTGTCTGGTGATGTTAAGTCTTTCCAGTGCGATCAGTGTCCTCGTAAGTACTTGAAACAATATCAACTTAACATCCATAAAAAGAAACATGTACCGAAAGATGCGTCTCAATTTAAATGCGAAGAATGTGGAAAATGTTTCAATACGAACAACGCTCTGCAAACTCATCGGCGTTAcattcatgaaaataaatatgccCACATGTGCCATATTTGTGCGAAAGTTTTGCGGAGCAGGACATTATATTTAAAGCACAAATTAGTAGAACATGAAGGATATGTCGAACCGAAGCAACAGTGCAAAGAGTGTGGTGCATGGCTCAAGAATGCTTTTCGATTAAAACTGCATATGAAAAAgcacaaagaaagagaaaacgACTTTATTTGTAACATATGTGGCAAGTCTTCCCCATCGAGTAGTGCGTTACAAAGTCACGTAAAGTATGTTCACGAAGCACAAAGACTTCACCAATGTACATTTTGTAAGAAGGCGTTTAAGAGGGCAATCACATTGAAGGAGCATTTAACAACGCACACAGGGGATGTACTTTATACATGTCCCCACTGTCCAAAAACCTTCAACTCAAGAGCGAATATGCACTCGCACCGGAAGAAAAAACACCCGAAGGAATGGGAAGAAGCGCGACGTATTCGAGGTGCTCCTCTTGGCACTATAGGAAAAATTCGAATAGACGATGATACGAATGCAGTTGTTGAAGAAGATAGtgtaaaaaataatCCAACGGAAAACGATTCAATTTCAACGTACATATGCATATCCTGTTGGTCACGTATTGAAACATTTcatatatattatttgaaaattgaagaagcGCAACGTACACTCACAGAAACGGGCGCCAATACGGAAAAGTGTTGCATATACACAGAAGAAACTAAAACTGTGCATCTAGAAAATCGAAACGATTCCGAGAACAGTGCAATCAAGTTGGAGTTACTCAAACCAGATTATTCAACCACCGATCATGATCGGATGAGCAGTGAATCTTCAATTGAGGATAAGGTGCAAGATGCATACTATGATTCAGAGGATAGCTTTGGAACTTTCGATAGCGTACCGCTAGCAGTTATTAAAGCAGAAGCAGAACAcagaaaaaatcaaaggaataTAATCGATGTACGAGAAGAAAGTAATATTGAAAAGCTTAAATCATCCAATGATGACAAAAATGAAGCAACTAGAAAGATAGGTACACAAAAACCGAAAAAGCCGAGGGGAGAGCCACGAAGAATAGTAAATGAAAATCATGAGAAAATTgctaaaataataacaataacatgCGATCTATGCACGTTAGCATTTAAAACGTTTGCTGATATGAAGCAACATTGTATTTCTGAGCACAAAGTTAAAGGATATGCTGTGTGTTGTTCAAGTAAATTTTATagacgaaataaatttttagaacaTATGATTAGACATGTAGCTCCTGACATGCTGAAATGTGAAGAATGTGATAAATGTTTCTCCAGTAAACGAGCGCTGAAAGACCACACTGTACTGCTCCATAATACAGTCgatgtaaaaatatttcattgcaatATGTGCCCGAAACGATACTTTAAAGAATATCTCTTGAACCAACATAAAAACATGGTTCACAAAGAGCTGGATAAAACTGCTTTTATATGTGACGATTGTGAAAAAAGTTTCAGGACCGAGTCTTTACTGAAAATGCATAAACGCAGTGTTCACGAAAAAAGAAACCGCTTTATATGTGATATATGTGCAAAAGTTTCACGAAGCAAAGCACTTTTTACGAAACACAAGTTAGAGCATGACGGGATTTTCGAGCCAAAAGCGCAATGTAAAGAATGTGGCTCGTGGCTTAAGAatgcttttaatttaaaattacataTGAAATTGCATAATCGGCCAGAAGGGCAGCTATTTGAATGTGATCAATGTGGAAAAAAATCAGCAACGAGAAGTGCCATACAAAATCATATTCGATTTGTTCATTCAGACGAAAGACCATTTCAATGTACATTTTGTGATAAAGCATTTAAAAAGCGACTTACATTGAAACCCAGACAAGACGATTCGCTTTCAACACTCATATGCCTTTGCTGTTGgtcgaaaattcaaagtttccATCTGTATTATCTGAGTGTGGAGGAGGCACAGCGCAAGTTGACAGAAATCGCCGTAATCAAGGAAGAAGTTCAGTCGGAATTTTTTATATACGAGCCCAGTGAATTTGAGCCTGCTCCAAATACATCTAACAAagcagaagaaaatgaagaatatcAACCAATCAAAGAATATCCAGAAAGTACAACCggtagatttgaatttattggacAAGACTATGCGACGGTTGAGGATGACGGTCAGGAAGACGATGATCCAAGAGATGAAGATTTCAATGCAGCAGACGGCTTCGATACTTCCGATGATGAACCGCTAGAATCACTTAAAATAGAAACCAAACATCGACCGAAAAGTAAACATAATTGCAGTGAAATACAAGTGAAAACAGAAGATATTGAGGCCgcttcatcaaaagaaaatacagAAACAGTAACTACAAAAAAGAGAGGGAGAAAGAAAAAGTCTGACACAAGTCCGAAAAAATCAGCCGAAGAGACGGAACagcgtaaaaaaataaatgaatatgatGAAGAGATCGCTAGACAAACTACATTAAAATGTGATCTGTGCACATctgaatttaaattatttgcggAAATGCAACGACATTATCGGTCTGTGCACAAAGTTAAAGGTTACGCAGTATGTTGTTCGACGAAATTTTATAATCGTAGTACATTTTTTAATCACATGATAAAACATGTGGCCCCGGAAAGGCTGAAGTGTGACAAATGcgacaaaattttctcaaataaacGTGCATTGCAAAATCACACGCTTATATTCCATGATCCAGTCGATATAAAAATCTTCCATTGTGATCAATGTCCAAAACGCTACGCTAAACAATATCAACTTAATCATCACAAAAAAACCCATTTAGAAAAGAATAGGACTGATTTCACGTGTGACGAGTGCGGGAAAGGTTTCAGCAATAAATACGCACTGAATGCACACAAACGCGGCATACATGAGAAGGGTTATTGCCGCATGTGTGATATATGTGCTAAAGTTTTACCGAGCAGAGCACTCTTCATCAAACACAGATTGGAGCATGAAGGCATTTTCGAACCGAAAGCACAATGTAAAGACTGTGGCAGGTGGCTTAAAAATGCATTTAGATTAAAATTACATATGAGAACGCACAATCGACCGGacgtaaataaatttacatgCAAAGAATGTGGGAAAACTGCACCATCGAGAAGTGCATTGCAGAGTCATGTGCGATATGTTCATACAGTAGAGCGACAATATCAATGTACATTCTGTGATAAGGCGTTTAAAAAGCCAATTACATTGAAAGAACACATGACTACCCATACCGGAGACGTTCTATACACCTGTCCACATTGTCCAAAAACATTTAACTCACGAGCAAATATGCACTCGCATCGAAAAAAGAAGCATCCGAAAGAATGGGAGGAGGCTCGAAGAACTCGTGGAATGCCTGAAGAATTATTGCCTAAAACTAATctaagtgaaacaaaaattgtgaataattttGACACAGAAAATTCTGCAAGTAATCAAGAAAAAACGAATTATACTATGTATTTCAGCAATAAACATGCACTGAAAACGCATGTGCGTGGTATACATGAAAATGCTTACTCGCGAATATGTGATATTTGTGCGAAAGTTTTGCCAAGTAAAGGGCTGTTCATAAAGCATAAATTGGAGCATGAAGGAATAATCGAACCGAGAGCACAATGTAGCGAATGTGAGAAATGGCTAAAAAATGCTTACCAATTGAAGTTACATATGAGGAAGCACACAAGGCCAGCTGTAAATCAATTTGCCTGTAATGAATGTGGTAAAACGGCACCATCAAAAAATGCACTACGCGGACATATACGATATGTGCATAAATCTGGCCGAGAGTATCAATGCACGTATTGTGAAAAAGCATTTAAGCGGCCGCTAACGCTGaaaGAACATATGGCAACTCATACGGGAGACATCCTCTACACATGTCAGCATTGCTCGAAGACGTTCAATTCAAATGCGAATTTGCATGCACATCGGAAAAAAGTACATCCAAAAGAGTGGGAGCAAGCACGCAAAATGCGTGGTATGCCGGAAGAACTACTGAAGAAAGTTTTAATGGAAGCTGAGTCGAACAGCGGAAATGATGTCGAAAACTCGATAGTGAATACTCATGAGCCCACTGCATTCTATAAACGCAATTAA
- Protein Sequence
- MICRMCVLESTNLLSIFETEATQLNKAGIITLHFGFEPMKDDPVTTMVCSTCWLQIESFHTYYLNVEEAQRKFSKLQLGCKIKEDYAIYDDIIDKKEQFLFSIEEVDDKLFATPLAEVPNESENLTVPDVIPICAGETRANSDNESLKNEKNGGSVQVNETEQGYDSEDADDIEDATYEDRSACNDSDNDSDSFDSDDDKKLIDFTRLRQPLQLKENEIKLETPSTDSPETTPSHRASHRTENKLPPKRRGRKRKSEKPNPTNPRTSEYRIKSKQYDEEIAKRTKLKCDLCEKEFTIFNVLQRHYRAEHKVKGYVVCCTRKFYKRGMFLDHIMKHLEPDRLKCDICKKTYCSRRALHDHILVYHVSGDVKSFQCDQCPRKYLKQYQLNIHKKKHVPKDASQFKCEECGKCFNTNNALQTHRRYIHENKYAHMCHICAKVLRSRTLYLKHKLVEHEGYVEPKQQCKECGAWLKNAFRLKLHMKKHKERENDFICNICGKSSPSSSALQSHVKYVHEAQRLHQCTFCKKAFKRAITLKEHLTTHTGDVLYTCPHCPKTFNSRANMHSHRKKKHPKEWEEARRIRGAPLGTIGKIRIDDDTNAVVEEDSVKNNPTENDSISTYICISCWSRIETFHIYYLKIEEAQRTLTETGANTEKCCIYTEETKTVHLENRNDSENSAIKLELLKPDYSTTDHDRMSSESSIEDKVQDAYYDSEDSFGTFDSVPLAVIKAEAEHRKNQRNIIDVREESNIEKLKSSNDDKNEATRKIGTQKPKKPRGEPRRIVNENHEKIAKIITITCDLCTLAFKTFADMKQHCISEHKVKGYAVCCSSKFYRRNKFLEHMIRHVAPDMLKCEECDKCFSSKRALKDHTVLLHNTVDVKIFHCNMCPKRYFKEYLLNQHKNMVHKELDKTAFICDDCEKSFRTESLLKMHKRSVHEKRNRFICDICAKVSRSKALFTKHKLEHDGIFEPKAQCKECGSWLKNAFNLKLHMKLHNRPEGQLFECDQCGKKSATRSAIQNHIRFVHSDERPFQCTFCDKAFKKRLTLKPRQDDSLSTLICLCCWSKIQSFHLYYLSVEEAQRKLTEIAVIKEEVQSEFFIYEPSEFEPAPNTSNKAEENEEYQPIKEYPESTTGRFEFIGQDYATVEDDGQEDDDPRDEDFNAADGFDTSDDEPLESLKIETKHRPKSKHNCSEIQVKTEDIEAASSKENTETVTTKKRGRKKKSDTSPKKSAEETEQRKKINEYDEEIARQTTLKCDLCTSEFKLFAEMQRHYRSVHKVKGYAVCCSTKFYNRSTFFNHMIKHVAPERLKCDKCDKIFSNKRALQNHTLIFHDPVDIKIFHCDQCPKRYAKQYQLNHHKKTHLEKNRTDFTCDECGKGFSNKYALNAHKRGIHEKGYCRMCDICAKVLPSRALFIKHRLEHEGIFEPKAQCKDCGRWLKNAFRLKLHMRTHNRPDVNKFTCKECGKTAPSRSALQSHVRYVHTVERQYQCTFCDKAFKKPITLKEHMTTHTGDVLYTCPHCPKTFNSRANMHSHRKKKHPKEWEEARRTRGMPEELLPKTNLSETKIVNNFDTENSASNQEKTNYTMYFSNKHALKTHVRGIHENAYSRICDICAKVLPSKGLFIKHKLEHEGIIEPRAQCSECEKWLKNAYQLKLHMRKHTRPAVNQFACNECGKTAPSKNALRGHIRYVHKSGREYQCTYCEKAFKRPLTLKEHMATHTGDILYTCQHCSKTFNSNANLHAHRKKVHPKEWEQARKMRGMPEELLKKVLMEAESNSGNDVENSIVNTHEPTAFYKRN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -