Gpol003321.1
Basic Information
- Insect
- Gastrophysa polygoni
- Gene Symbol
- stc
- Assembly
- GCA_963576655.1
- Location
- OY755082.1:48379251-48401874[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 2 2.7e+04 -5.7 3.4 6 10 1307 1311 1304 1311 0.69 2 12 6.6e-06 0.087 13.4 13.7 4 19 1352 1367 1350 1367 0.93 3 12 1.1e-07 0.0014 19.1 13.3 1 18 1403 1420 1403 1421 0.98 4 12 2.3e-08 0.00031 21.3 13.6 1 18 1461 1478 1461 1479 0.97 5 12 2.4e-06 0.032 14.8 13.1 4 19 1526 1541 1519 1541 0.87 6 12 2 2.7e+04 -4.5 2.0 5 10 1570 1575 1570 1575 0.93 7 12 0.072 9.6e+02 0.5 11.5 1 11 1581 1591 1581 1593 0.96 8 12 2 2.7e+04 -4.5 1.1 4 10 1596 1602 1595 1602 0.73 9 12 2e-10 2.6e-06 27.9 15.2 1 19 1608 1626 1608 1626 0.98 10 12 0.00057 7.6 7.2 14.8 1 19 1665 1682 1665 1682 0.94 11 12 2.5e-08 0.00034 21.1 18.0 1 19 1718 1737 1718 1737 0.97 12 12 0.4 5.4e+03 -1.9 15.7 4 18 1749 1765 1747 1766 0.89
Sequence Information
- Coding Sequence
- atgGAACCTGAAGATGCCAAGTTTATGCGAACGGAACGTAAGCCAGGTTCAATTGATGTTCATCCTAATGTTGATGCAATCATTTTAAATTATGATTTAGATATACAAATACTCGCATCCAAAGACAATGTAATTTATGAAGAGAAGAAGggTTTGAAGAAAGTTATAGAATTACCAATGATCAACAGCAGAACAGACTGCCATGTTTTGTCAAAAGAAGTTGTTGGTCAGTGTGATCTTATTCACCATTCTCGTTTATCGGAAGTGGAACAGATAATTTATTATCTGAAAAAAAGAAAGCTTCAAACAGGTGTGAAATCAGGGTCTGCCCTACTTATAGATGctgataaaaattcaacagacaAAAATGAAGCAAACTACAACAGTTTGTCTGACTACATTGATTTGCTGTATGAAGGCATGACTGAGAAAATTAAAGGGGCACAGATGATTCAGTTACTGGCTAGAGATCCCCAGAATTTAGAAGCACTTGCTAAAAatgaaattttgattaGTGCACTTGGTCGTGTTCTGAGAGAAGATTGGAAGAGAAGCATTGCTTTGACCACCCATCTAGTATTcacatttttctgtttttcaatgtattcatgCTTCCATGATGTCATTTTGAAGTGTAAGGTTGGTTCTATTTGCATGGATATCTTAGACTATGAACTCCGTCGCTATGACAAGTGGAAGGCCGATTTCGAAGGGTCTGAGGCGCCATCTGACACCCCCATAGTCCCAGTAACTCGAAAGCCTTGCCCTAGCAGTGCTAGCATGTCCGAAATTCCCAGAAGTCGTATACCCGAGCCAGTCCGACCAAAATCTGGCAATTTTTCAGACACCAACATCAGTCATATAATGGAAGGCAGCATATATGATGATCTCACCAATTCGATGGAATGTATTGATGATAAAAAACTCACAGATGAACAGAAACTGAAGAGATTTCGTACACTTGTTAAAAAACAAGAGCACTTGCTACGTGTTGCATTCTATTTACTATTGAATATATCTGAGGATGAAACTGTGGAAGAGAAGATGTCTAAAAGAAACATCATTGGGCTTCTGGGCAAAGCTCTGGAAAGGGAAAATGAAGATTTGCTGATATTGGTTGccacatttctgaaaaaattgtcCATAATGCAATGCAATAAAGAAGCAATGATAGGAAGTTTGAATATTGTTGAGAAACTTCCCAGGTTGTTGGAGTCAAATAGTGCGGATTTGGTCCATTTGACACTAAAACTAATATTCAATCTGTCCTTTGATAGTTCTGTTAGGGCAAGAATTGTTCAAGCTGGAATGCTGCCCAAAATCAGAAGTCTTCTAAgtGATGATAAACATCAAGAAATTGTTCTCAAAATCCTTTACCACTTGAGCTATGAAAAtgagaacaaacatcaattcaTGGATTGCATTGGATTGATCACAGATATGCTTGTTCTAAATGTTGGAAGTGAAAGTGACAAATCAATGGTTGCGCTTTGTATAAATCTGGCAACTGATCCCTCTAATGCCCAGCACATCATCAAGAAAAATAGGCTGCAGGCCTTGATGATGCGAGCTTTCACTTATCAGGATGCCATGTTGATGAAAATGCTGAGGAATATCTCTGACTATCCAGCTTCTGCACCATCGTTCATTGAATTTGTGGGAGATATAGCAAAAGCGGTAGTAGAATCAAGAGATGAAGATTTTATCAGGGAATGTATTGGTATTCTGAGCAATATTTACTTACCTGATCTCGATTGGGCGGagatattcaaacattttgataTGGTTAAATGGGTGAAAAATGTTATAACCAGTAACAACGCCGATGCTGAGCTAGTCTTACATGTTGTGGTTTTGTTAGGCACGGCAGCTTCAGATGAGGGATGTGCAAAATTATTCTGTGAAACAAATATTCTAGGACTcctcattgatttattgaaaacacATCAGGAAGATGATGAAATCGTCTTACAAATTGTGTATATATTTTACGTTACTTTGGCTCATGGTtgtaatattgaatacattgtGAATAAAACAGAGGCACCTGCTTATCTAATCGATCTCCTCCAAGACAACAACAAACTGATTCGAAAGCTGAGCAACACGTGCCTCAACATCATTTCCGATCATAACAAAACTTGGGCGGAGCGAATTCGAATCGAAAAATTTCGCAACCACAATGCCCAGTGGCTTGCTCTAGTCGATTCTCAACAGCTCGGCCTCGAAGATGACGAGGACGAAGAAGACGAGTTGCCACCGTACTTGAATACAGAATATCTGAGTACAGCTGTGGTGCCCCCATTATCAGCACAGTTGAGACCAGATGCAATGTCCTTCCAAAATAGGCAAGACCCAGAACAAAACTTTCCTAGTAACCCATCTGGACCCTATGAGAATTCCAGGAGTTCTTTTAGGAATCTTGGATATGGAAATAACAGTGAAAGTTCCAGTTACTCATCATTAGAACCAACTGCTCAGGAGTTTTATCCaagtaataatgaaaatttaccAAGTGGGACCAtcaagaaaaaatttcaaaataatccaaGAATTGCTGGAAGAGGTGGAAAAAGTAGATATGGAACTGGAAGGACATTCAGTGGACACAGAGGTGATTATAATGATTTCAGTAATACCTACTATCAAGAagatgaatacatgaataatcATCCAAGAAATCATACAAAGAAGTCATATAGTAGGGGCAACACTCAAAATCTAACCAGTTCATCTAATTGTAATTATGACCGTGATGAACCTCAAAGCTGTGGTAACCAGGGAAATATAATTCCACAAACCCAGAAACATAGGCCACAATCATATCCTAATTCTGGCCATGGTAAAACAAATGgttatgaaaaatcatcatccTTATTTGATGGAGAATTTCCTGAAGGTGATTtgggaaatccaaatgaattcaatggaGGCCGACACCCTAGTTCACAAAACTTCAGAAGGCACAATAAGAAAAATTACTATGCTAAACCAGGAAGAAAAGACAATGATAAATATAATCATAGCACTATTATTGGGGAAAGAAGTGATGATTCTGAACACCATAAATCTGAGGATATTGGTCCAAATGCCAATTTTGTAGATGCAAAAAATAATGTATCCAATCCAAATTTGTCTAAGAACTATTCACAGAACTACAATAAATATGATGGGCACTATTTCACAAAACCAGTGAGAAACAACCATAATAAGCACAATGGAAAGGAACAAAATGATGATTCCCaagatcaaaaattcaataagaatagtTTTGGTGAATATGGTATGCCAAGTGGTAGAGGTGCACTTTCAGGTATCAATCAAAGAAATAGTATTTCTGATGTGAATTGGGCcagaagaaatgaaagaaattactATGAAAAATCTGATAGAAATACCCATGGAAAATATAATTACAATAATGAGGATAGAAAAActaatggaaaatattatgatgGGGGTTCAAGATCAAGAAACAGCTATGATAAATACAATCATAGAAATGACAACAATTTTGATGATAGTAGTTCCAGATCAAGAGACTTCAATGATTATTCTGAGACTAACAAAATGAAGAGGTCCTATGATAACAGAAGAACTGATATTAATAAATCCTATGACAATAGGTACCAGAAATATGATCAAGAAGGTCAACCAATTGATAGGAAAAAATCAACTGCTAGTAAAGCCAGTAAATCCATCAACAGTTACAACAAAAAGAtggATGCAGCCTCGCAGAGAGAAAGACTGGAAGAAATGCTAAACCACAGAACTCTTGAATGTCTTGTTTgctgtgaaaaaataaaacacacTGATAAAGTCTGGTCATGTATGCTTTGCTACAACATCCTACATCTCTACTGTGTTGCTTCATGGGCCAAATCttctaaaattgatgaaaactgGAGATGCCCAGcctgccaaaatgtttgcagtgAATTCCCCCGTAAATACACATGCTACTGTGGAAAGACAGTTGAGCCTAAATTTGATCCTGGTGCTGTCCCACACGGGTGTGGAGAAATGTGTTTACGCAAGGGTCGTAACTGTGACCACAAATGTACATTATTATGCCATCCAGGTCCATGTCCTGATTGTACAATCATGGTAGCCAAAGAATGTGGCTGTAGAGCTACAAAACCTGTAGTAAAGTGCAACACAGACATTGAAATTGTCTGTGGAGCAACCTGCAATAAATTGCTTGAATGTGGAGTTCATAACTGCAAACTGAGCTGTCATTCTGGAGAATGTGGGCCTTGTGATAAGATGATCATTCAGGAATGTTATTGTGGAAAGGAGGGCAGAAAAGTTCCATGTCGTTTGGAACGTCAAGGTGAGGTCCAGTACGAATGTGGAGATGTCTGTGGGAAAATGTTATCTTGTGGAAATCATAAATGCGAATCATCGTGTCACGAGGGTTCATGTGGCATATGTGAGAGGGACGTTATGGTCGTGCATACCTGTCCTTGTGGAAAGAATGAGCTGAAGGAGAAGAGAACATCTTGTTTGAACCCAATCCCTTGTTGTGGTCAACTATGCAGTAAGCCTCTCAAATGTGGACAACCCAGTGCTCCACACAAATGTGAACAATACTGCCATGAAGGAAACTGTCCTCTTTGTCCTTTGACGACATTAGTAAGATGTCGCTGCGGACATATGGACAAGGAGATACCTTGTCAAAAGTTAACTACCAAAGCTGATGATGCTAGATGTGAGAAAAAATGTACCAAGAAACGTCTGTGCGGCAAGCACAAGTGCAACCAGCACTGCTGCATCGAAATCGAGCACGTGTGCCCGCTGCCGTGCAACCGTCAGCTGTCGTGCGGCAACCACCGCTGCGAACGCACCTGCCACTCGGGCAGATGCCCGCCCTGCTCGGAGACCAGCTTCGACGAGCTGTATTGCGAGTGCGGCGTTAGCGTGCTGTACCCGCCGGTGGCGTGCGGCACCAGGCCGCCCGCCTGCAGCGCGCCCTGCTCGAGAGGGCGGCCGTGCGGCCACGAGACAAACCACACTTGCCACGTGGGGCCTTGTCCGCCCTGCACGGTGCTCTGCAAGCGATGGTGCTACGGTGGCCACGAGCAACGGTCGGCCATCCCCTGCCACCAGGACAACTTCAGCTGCGGCAAGCCGTGCGGCCGCGACATGCCCTGCGGCAGGCACAAGTGCCCGAAGCCGTGCCACGACGGGCCGTGTCCGACGCCCTGCAAGCTGCCCTGCACCGTCCCCAGGAGCCTGTGCGACCATCCGTGCGGCAGGCCGTGCCACGAGCCGCCTTGCCCCGAGACCAGCTGCCGGCACACTGTCCAAGTGTCATGCCCGTGCGGCCTGCAGAAGGCGACCAAACTTTGCATAGAGGTGGAAGGCGAATTCAAGAATTTACAGATGTCTCAGATCAAGGACAAAATGGGAGTCTTATCTAACAGTCACTCAGTGGATATTTCGGATATATTGAGTGCCCCCAAGAAGCCAACTATCCTCAAGATACTCGAGTGCTCAGAAGAATGCAGAGTCCTAGAACGCAACCGTCGCCTAGCCATAGGGCTCCAAATCCGCAACCCGGACCTGAGCCAGAAACTGACCCCTCGGTACTCCGACTTCATGCGGCAGTGGGCCAAGAAGGACCCGCATTTCTGCCAACGCGTCCACGAAAAACTCTCCGACCTCGTACAGCTGGCTAAGGCCAGCAAGCAGAAGAGCAGGGCCTACTCGTTCGAGTCCATGAACAGGGACAAGCGCCACTTCGTGCACGAGTACTGCGAGCACTTCGGCGTCGACAGCGCCGCCTACGACACGGAGCCGAACCGGAACATCGTGGCAACGGCGCTCAAGGACAAGTCGTGGCTGCCGAGCATGAGTCTGCTGGAGTTGCTGCAGAGAGAGAACGGGCAGCGGCGCGTGCCAGGGCCTGTCTCGGTGATGGGCAGGACTGTGGCCGGCAAGCCCGAGACGGTGTCGCTGCGGCTGCCCGGGAAGGCGCCGAGAGACAGCACGCCGGAAGCATTTGACCCGTTCTCCTAG
- Protein Sequence
- MEPEDAKFMRTERKPGSIDVHPNVDAIILNYDLDIQILASKDNVIYEEKKGLKKVIELPMINSRTDCHVLSKEVVGQCDLIHHSRLSEVEQIIYYLKKRKLQTGVKSGSALLIDADKNSTDKNEANYNSLSDYIDLLYEGMTEKIKGAQMIQLLARDPQNLEALAKNEILISALGRVLREDWKRSIALTTHLVFTFFCFSMYSCFHDVILKCKVGSICMDILDYELRRYDKWKADFEGSEAPSDTPIVPVTRKPCPSSASMSEIPRSRIPEPVRPKSGNFSDTNISHIMEGSIYDDLTNSMECIDDKKLTDEQKLKRFRTLVKKQEHLLRVAFYLLLNISEDETVEEKMSKRNIIGLLGKALERENEDLLILVATFLKKLSIMQCNKEAMIGSLNIVEKLPRLLESNSADLVHLTLKLIFNLSFDSSVRARIVQAGMLPKIRSLLSDDKHQEIVLKILYHLSYENENKHQFMDCIGLITDMLVLNVGSESDKSMVALCINLATDPSNAQHIIKKNRLQALMMRAFTYQDAMLMKMLRNISDYPASAPSFIEFVGDIAKAVVESRDEDFIRECIGILSNIYLPDLDWAEIFKHFDMVKWVKNVITSNNADAELVLHVVVLLGTAASDEGCAKLFCETNILGLLIDLLKTHQEDDEIVLQIVYIFYVTLAHGCNIEYIVNKTEAPAYLIDLLQDNNKLIRKLSNTCLNIISDHNKTWAERIRIEKFRNHNAQWLALVDSQQLGLEDDEDEEDELPPYLNTEYLSTAVVPPLSAQLRPDAMSFQNRQDPEQNFPSNPSGPYENSRSSFRNLGYGNNSESSSYSSLEPTAQEFYPSNNENLPSGTIKKKFQNNPRIAGRGGKSRYGTGRTFSGHRGDYNDFSNTYYQEDEYMNNHPRNHTKKSYSRGNTQNLTSSSNCNYDRDEPQSCGNQGNIIPQTQKHRPQSYPNSGHGKTNGYEKSSSLFDGEFPEGDLGNPNEFNGGRHPSSQNFRRHNKKNYYAKPGRKDNDKYNHSTIIGERSDDSEHHKSEDIGPNANFVDAKNNVSNPNLSKNYSQNYNKYDGHYFTKPVRNNHNKHNGKEQNDDSQDQKFNKNSFGEYGMPSGRGALSGINQRNSISDVNWARRNERNYYEKSDRNTHGKYNYNNEDRKTNGKYYDGGSRSRNSYDKYNHRNDNNFDDSSSRSRDFNDYSETNKMKRSYDNRRTDINKSYDNRYQKYDQEGQPIDRKKSTASKASKSINSYNKKMDAASQRERLEEMLNHRTLECLVCCEKIKHTDKVWSCMLCYNILHLYCVASWAKSSKIDENWRCPACQNVCSEFPRKYTCYCGKTVEPKFDPGAVPHGCGEMCLRKGRNCDHKCTLLCHPGPCPDCTIMVAKECGCRATKPVVKCNTDIEIVCGATCNKLLECGVHNCKLSCHSGECGPCDKMIIQECYCGKEGRKVPCRLERQGEVQYECGDVCGKMLSCGNHKCESSCHEGSCGICERDVMVVHTCPCGKNELKEKRTSCLNPIPCCGQLCSKPLKCGQPSAPHKCEQYCHEGNCPLCPLTTLVRCRCGHMDKEIPCQKLTTKADDARCEKKCTKKRLCGKHKCNQHCCIEIEHVCPLPCNRQLSCGNHRCERTCHSGRCPPCSETSFDELYCECGVSVLYPPVACGTRPPACSAPCSRGRPCGHETNHTCHVGPCPPCTVLCKRWCYGGHEQRSAIPCHQDNFSCGKPCGRDMPCGRHKCPKPCHDGPCPTPCKLPCTVPRSLCDHPCGRPCHEPPCPETSCRHTVQVSCPCGLQKATKLCIEVEGEFKNLQMSQIKDKMGVLSNSHSVDISDILSAPKKPTILKILECSEECRVLERNRRLAIGLQIRNPDLSQKLTPRYSDFMRQWAKKDPHFCQRVHEKLSDLVQLAKASKQKSRAYSFESMNRDKRHFVHEYCEHFGVDSAAYDTEPNRNIVATALKDKSWLPSMSLLELLQRENGQRRVPGPVSVMGRTVAGKPETVSLRLPGKAPRDSTPEAFDPFS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -