Gspi031238.1
Basic Information
- Insect
- Geotrupes spiniger
- Gene Symbol
- -
- Assembly
- GCA_959613385.1
- Location
- OY390740.1:1452099-1460198[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 42 1e-05 0.00071 20.9 0.9 1 23 96 118 96 118 0.99 2 42 0.00086 0.061 14.8 3.5 1 23 124 146 124 146 0.99 3 42 6.5e-05 0.0046 18.4 3.2 1 23 152 174 152 174 0.98 4 42 6.8e-05 0.0048 18.3 1.4 1 23 180 202 180 202 0.95 5 42 0.00067 0.047 15.2 0.2 1 23 208 230 208 230 0.98 6 42 5.5e-06 0.00039 21.7 1.8 1 23 236 258 236 258 0.99 7 42 0.021 1.5 10.5 2.6 1 23 270 291 270 291 0.96 8 42 6.2e-05 0.0044 18.4 1.5 1 23 296 318 296 318 0.98 9 42 8.6e-05 0.0061 18.0 2.0 1 23 324 346 324 346 0.99 10 42 0.0018 0.13 13.8 0.1 1 23 352 374 352 374 0.98 11 42 8.4e-05 0.0059 18.0 2.0 1 23 380 402 380 402 0.98 12 42 0.011 0.8 11.3 3.6 1 23 408 430 408 430 0.98 13 42 3.1e-05 0.0022 19.4 2.8 1 23 434 456 434 456 0.99 14 42 3.9e-05 0.0028 19.1 1.0 1 23 462 484 462 484 0.97 15 42 0.0047 0.33 12.5 2.2 1 23 490 512 490 512 0.97 16 42 0.35 25 6.6 0.7 5 23 662 680 661 680 0.94 17 42 0.0019 0.13 13.8 0.9 1 23 697 719 697 719 0.99 18 42 0.00021 0.015 16.8 0.9 1 23 723 745 723 745 0.99 19 42 0.00013 0.0092 17.4 1.9 1 23 751 773 751 773 0.98 20 42 2.9e-05 0.002 19.5 5.1 1 23 779 801 779 801 0.98 21 42 7.5e-06 0.00053 21.3 2.4 1 23 807 829 807 829 0.99 22 42 0.005 0.36 12.4 1.4 1 23 835 857 835 857 0.97 23 42 3.4e-05 0.0024 19.3 0.6 1 23 863 885 863 885 0.98 24 42 3.7e-05 0.0026 19.1 3.1 1 23 891 913 891 913 0.99 25 42 0.0069 0.49 12.0 3.1 1 23 919 941 919 941 0.98 26 42 0.00058 0.041 15.4 2.8 1 23 947 969 947 969 0.97 27 42 0.68 48 5.7 0.5 6 23 978 995 978 995 0.99 28 42 1.5e-06 0.0001 23.6 1.9 1 23 1001 1023 1001 1023 0.99 29 42 4.6e-05 0.0032 18.8 1.5 1 23 1029 1051 1029 1051 0.98 30 42 2e-06 0.00014 23.1 0.8 1 23 1120 1142 1120 1142 0.99 31 42 9.2e-06 0.00065 21.0 0.6 1 23 1146 1168 1146 1168 0.99 32 42 0.0024 0.17 13.4 8.8 1 23 1174 1196 1174 1196 0.97 33 42 1.4e-06 9.9e-05 23.6 3.1 1 23 1202 1224 1202 1224 0.99 34 42 0.0023 0.16 13.5 2.3 1 23 1230 1252 1230 1252 0.99 35 42 0.00055 0.039 15.5 4.2 1 23 1258 1280 1258 1280 0.98 36 42 2.2e-05 0.0016 19.8 2.9 1 23 1286 1308 1286 1308 0.99 37 42 0.0019 0.14 13.7 3.9 1 23 1314 1336 1314 1336 0.98 38 42 0.00058 0.041 15.4 5.8 1 23 1342 1364 1342 1364 0.97 39 42 4.2e-05 0.003 19.0 2.9 1 23 1370 1392 1370 1392 0.99 40 42 2 1.4e+02 4.2 0.4 6 23 1401 1418 1401 1418 0.98 41 42 9.7e-05 0.0068 17.8 4.6 1 23 1424 1446 1424 1446 0.99 42 42 0.0001 0.0074 17.7 2.5 1 23 1452 1474 1452 1474 0.99
Sequence Information
- Coding Sequence
- ATGGAGACGAACATATGCGATTCTAATATCGCGATTGTTGAAGTGAAAACTGAGCCGACGACTTTAATTCCGAAAGTGGACTTTTGTTGTGCGCAGTTTCCCGTTCAAATTGATCACAAAAACGCACTAAATGCTGTTACAATAAAGGAAGAGGAATTCGAGGAAGTCGtgaataatatttctaataaagtAAATACTTGCAGtgatcttaaaattaaaatagatcaCAGCGATGACAACGAGATATGTAACaatactgaaaataatattttacaacaagaaaattacaaatgcgacaTTTGCAGTAAATTATTCACAAAGCCGGACAACCTTTTAAGACACACGAAAGTCCACACCCACGCGAGGAACttcaaatgcgaaatttgcacgaaacGCTTCTTCACCAAACAAGACGCGTCGAAACACACGAGAACGCACATCGCTGAAAAAaaccacaaatgtgaaatttgcggcaAACGCTTcaaactgcaaaaatatttggCGGAACACACGAAAACGCACGCCGATCGAAGGAACCACAAGTGCGACGTTTGCGCGAAATCGTTCACGACGAAACAAAACCTGACGACGCACGTCATAGCGCACACGGCCGGGCGAAactacaaatgcaaaatttgcgcGAAATCTTTGGCGACGGAAGACAGCCTGTCGGCCCACTTGAAATTGCACGCCGgcgaaaaaaactacaaatgtgaaatttgcaacaggAGCTTCGTAAGGAGACGGACTTTGTTGGACCACCTGAGGAAACACGCCCGATGGATGCTGCAGATCGACGAAGGATACTTCAATTgtgaaatatgcaataaattcaCGAAGAAGGAAAACATGTCGCAGCACATGAAAACGCACGGGGAGAAAAACcacaaatgcgagatttgcgCCAAGCGCTTCGTAACGAAGGCGAACATGCTGGAACATCTCAGAACGCACACCGGCGAACGGAAGTATCGGTGCGACGTTTGCGACAAACGCTTCAAACGGAAGCAGACGATGTTGGAACACGCGAAGACGCACCGCGGCGAAAGGAACTATTCCTGTCGAATTTGCGGCAAAGTTTACGCGAAGGAGCAGAGCATGCTGGAGCACGTGAGAATACACGCCGGCGAAGGAAgctacaaatgcgaaatttgcggcaAAAATTTCGCGACCGAGAAAAGTATGTGCAAACACGTCAAGGCGCACACCGCCGAGAGAAACTACAACTGTCACGTTTGCAATAAATCCTTGAAGACGAAACAAAGCGTGCTGTATCACGTGAAGACGCACACGAGAAACTACAAGTGTGAAGTTTGCTCGGCGACGTTCACGACAAAAGCGCATCTGACGACTCACGTGAAAAAGCACGTCGGCGAGAAGAACTACGAATGCGACGTTTGCGGCAAACGATTCACGACGAAACAATACATGTCGAGGCACGCGAAGGCGCACACCGCCGAAAGAAACCACGAATGCGAAATCTGCAACAAACTTTTTACAAGGAAGCCCGTTATGTTGAAGCACTTGAAGAGGCACGCAAATTACAAAGTTGAAATTTGCAAGGAGTACTGTGGAATGAGtacattaaaacataaaaataaaaaacgtcaCGGTGAAGTAAACGCAGACGAGGACAATACTGTGAAGATTGAAAAACAATTAGATAAGGAAGAGACTGAATCGTTTGTAAACGTAAAAGTGGACATTGACGGTGACAAAATGGAGACCGATTCTAACATCCCAATTACTGACATAAAACCTGAACCAACAACTGTAAATCCTAAACTTTGCTACGTTGAAAATATCACGATAAAGGAAGAGGTTGATTTCTTCGTACCACCATCAGAAATAAAACTTGACACTGTCGGCGAGACGACAAATGAAATCGTGCACGAcgttaaaatgaaattggaaCCCAGTGATTTTTCAACCGGTGTTTTAAAACACACGACCACACCGAGCAAAGTAAGACATAAACGCAAAGTTTGCTCCGGACGTTTTAaatcgagaaaaaatataacgGAACATATGAAATCGCACGCAAACGGCgccaaaagaaaaaattacgttcAGACCAACGGGGGAAATTACAAGTGCGACGTTTGTTCGAAACTGTTCACGAGAACGGCGATTTTATCGAGACACATGAAAATACACCGGAGAAACTACAAGTGCGGCGTTTGCGCTAAATTTTTCACGACGCGAGGATACCTGACGATACACGCGAAGAGACACGCCGGTCAAACGAACTATAAATGCGAGATTTGCACCAAGCGCTTTACGACTAAGCAACGCTTGTCGGAACACGTGAAGATCCACTCCGGCGATCGAAACCACAAATGCGACGTCTGCAACAAATCTTTTACGATGAAACGATATCTGTCGAACCACATGAAAACCCACTCAGGCAAGAAAAACTACAAATGCGAGCTTTGCGGCAAGCCCTTCGCGAACAAACACAACATGTCGAGGCACGAAAAAACACACTCCGACGAAAAAAAccacaaatgcgacgtttgcgcGAAACGCTTCCGCACGGAAGTTTTTTTGTCGAAACACGCGATGATCCATTCGGGCGCGGGGAAATACGAATGCGACGTTTGCGCGAAAAGCTTCACGACGAAAGGATATCTGGCGAATCATTCGAAGACGCACGCCACCGACAAGAAGTTCAAATGCGAGGTTTGCGACAAGCGTTTCACGACGAAGCAGTACATGTTGAGGCACGAGAAGACGCACGCCAGCGAgcgaaactacaaatgtgaaatttgcgccAAGCGTTTTACGACGAAACACGTCATGTTGCAGCACGTTAGGTTCCACTCGGGCGATCGAAACTACAAGTGCGACGTTTGCACGAAGAGCTTTTCGTCGAGGCAGCACATGTTGACGCACGCGAAAGCGCACGACGGCGAAGTTAACTACAAGTGTTGCCGCCGACGGTTCGAAACGAGGGAAGAGATGTCGGAGCACGTGAGGACGCACGCCGCGggaagaaactacaaatgtgaaatttgcacgaaGGGGTTCAAAACGAGGGAGTATTTGACGAGACACGCGAGAACGCACACCGCCGAGAGGAAGtttaaatgtgaagtttgcggGAGACGCTTCGCGACGAAGCAGTACATGTTGAGACACACAAAAGTACACACTGGGGGTGATGTGTTCGTAAATGAAGAAAAGGTGGAGGTTGGCAATGaTATTTCGGAGAACGGCGTAAATGATTGGAACGTCCCGATTACGGACGTGAAGCCCGAACCGACGACTTTAATCCCAAACTTGCAGTTGTGCTGCGCGGGAGATGTTACGATGAAGGAAGAAGAACGAGCAGTTTCTTTCACCGCCGaaggaaattacaaatgcgacaTTTGCTCGAAAGCGTTTGCGAGAACGTCGCTTCTTTTGCGACACATAAAACGACACGAGCGAaactacaagtgcgaaatttgcgccaAATTCTTCGCGACGAAAGCCTATCTGATCATCCACATGAAACGACACACCGGCGAGACGAACCACGAGTGCAAGATGTGCACGAAGTGCTTCACGACGAAACAACATTTACTCGAGCACACGAAGATCCACTCGGACGACCGAAACTACAAATGCGACGTCTGCGACAAACGCTTCACAACGAAGCGATATCTGGTGACGCACATGAAGAGACACTCGGGCGAAAGAAACTTCAAATGCGAGCTCTGCGCGAAATATTTCGCGAACAAACACGACGTGCTGCGACACGTCAAGACGCACACCGACGAGAAAACGTTCAAATGCGAATCTTGCGACAAGTGCTTTCGAACGGAGACGTTTCTGTTGAAACACGTGAAGATCCACGCCGGCGAGGGCaattacaaatgcgacgtttgcacGAAGCGCTTTACGACGAAGGGGTATCTGATGAACCACTCGAAGACGCACGCCACCGAGAAGAAccacaaatgcgacgtttgcgcGAAACGCTTCACGACCAAACAGTACATGCTGAGGCACGCGAAGACGCACGCCGGCGAACGGAACTATAAATGCGAGATTTGCGGCAAGTGCTTCTCGACGAAGCACGTCATGTTGAAGCACGTCAAGTTCCACTCGGGCGACAGGAactacaaatgcgacgtttgcacGAAGAGCTTCACGACGAAGCAGTACATGTTGAGGCACGCGAAGACGCACGACGGCGAAATCGACTACAAGTGTTGCAACCAACGGTTCACGACGAGGCAGCTCATGTTGGAACACGTGGAGACGCACGCGgccgaaagaaattacaaatgtgaagtttgcacgAAGCGCTATCGTACGAAACGACATCTGGCGATCCACACGAAGACGCACAGCAGCGAAAGGAACTACAGATGCGAGATGTGTGGGAAATGTTTGGCGACGAAGCGATCCATGTTGAAACACATGGAAACGCACGCCGAGGCTGATGCGTTCATAAGTGGAACGCGACGGACGTCACTTTACACGACCGGCGATGAACGTCAAAAGAACGATCCGTTCGAACGGAAGAATTTCCGGCGTTTTTCGGGTGACGTAAGTGGTTGTCGTTTGGCGATTCGCGACTCACGTCGAACTTGCAGGGCGCCGCGATCGACGTGGAACGCACGCAACGATTCCACGAACGTGTCGGATTTGCTGGCGCCGGAGCTGAGAGAGTACCGGTCCGCGGAAATCCCGGGAAAAGTGACTTCGTTTTCCGACGCCGGTGAAATGGTCGCGTCGGACGCGCACTGCGAGAGTTGTTATTGTAGGGGGCCGATCGTGGCGCCGCGGTTGCTCCAAACGCAACGTCTGTCGCCGCTGAAACACCGACATGTACGTCGATGTGTCGCCCCGATAGATGGCGACGGTGGTTGTAAGTTATCGCGAAACGAGTCGCCGTTGAAAAGTGAACTGTCAAAGTCGGACTTGGAATACACGCCCGACAGATTGAATTACGAACATTTCAAGTCGTCGTTGTTGGACGGCGACGGCGAAACCGCAACTGGAACCCACGGCGACGTCATCGGTTACTCGCCGCCGGACGAATTTAAGGTTGACGGCGACGTCGCGCTAAAAGACGTTTCGGATTTTCGCGCAAGTAACTACTTCGATTGTCACGCGTACGAAACTAAGCCCTGTTATCACGGGGGCCACACATGCGTGCACAAGTTCAGGATCAACGAACGGCTGATGCCCGAGCCGGTCAACTGCGACATGTACGGGGTGAGCCggtgcgaaatttgcaacaagCCGATGGAAAAATTCAAACGCGACGCGGCGTCGCCGAAAAGTAAAAACGTGCGCAAGACTCGCGTGCAGAGGGTGCACGAGTGCAGTTTGGCGCCGCCCACGTTGAACGTGGGCACCGGGAAGGAGCGCATCGTGGTCGAAGTCCCCGAAAGTGTGCGGCGGGAATGTCTTGGGAAGATCGGGAAGGAGGTCGACGCGAAAATGAGGAAGAACGTGCCTTGCAACAGTCTGGCGTTGCGGTTCCAAAAAGGCGTCGTTTGA
- Protein Sequence
- METNICDSNIAIVEVKTEPTTLIPKVDFCCAQFPVQIDHKNALNAVTIKEEEFEEVVNNISNKVNTCSDLKIKIDHSDDNEICNNTENNILQQENYKCDICSKLFTKPDNLLRHTKVHTHARNFKCEICTKRFFTKQDASKHTRTHIAEKNHKCEICGKRFKLQKYLAEHTKTHADRRNHKCDVCAKSFTTKQNLTTHVIAHTAGRNYKCKICAKSLATEDSLSAHLKLHAGEKNYKCEICNRSFVRRRTLLDHLRKHARWMLQIDEGYFNCEICNKFTKKENMSQHMKTHGEKNHKCEICAKRFVTKANMLEHLRTHTGERKYRCDVCDKRFKRKQTMLEHAKTHRGERNYSCRICGKVYAKEQSMLEHVRIHAGEGSYKCEICGKNFATEKSMCKHVKAHTAERNYNCHVCNKSLKTKQSVLYHVKTHTRNYKCEVCSATFTTKAHLTTHVKKHVGEKNYECDVCGKRFTTKQYMSRHAKAHTAERNHECEICNKLFTRKPVMLKHLKRHANYKVEICKEYCGMSTLKHKNKKRHGEVNADEDNTVKIEKQLDKEETESFVNVKVDIDGDKMETDSNIPITDIKPEPTTVNPKLCYVENITIKEEVDFFVPPSEIKLDTVGETTNEIVHDVKMKLEPSDFSTGVLKHTTTPSKVRHKRKVCSGRFKSRKNITEHMKSHANGAKRKNYVQTNGGNYKCDVCSKLFTRTAILSRHMKIHRRNYKCGVCAKFFTTRGYLTIHAKRHAGQTNYKCEICTKRFTTKQRLSEHVKIHSGDRNHKCDVCNKSFTMKRYLSNHMKTHSGKKNYKCELCGKPFANKHNMSRHEKTHSDEKNHKCDVCAKRFRTEVFLSKHAMIHSGAGKYECDVCAKSFTTKGYLANHSKTHATDKKFKCEVCDKRFTTKQYMLRHEKTHASERNYKCEICAKRFTTKHVMLQHVRFHSGDRNYKCDVCTKSFSSRQHMLTHAKAHDGEVNYKCCRRRFETREEMSEHVRTHAAGRNYKCEICTKGFKTREYLTRHARTHTAERKFKCEVCGRRFATKQYMLRHTKVHTGGDVFVNEEKVEVGNDISENGVNDWNVPITDVKPEPTTLIPNLQLCCAGDVTMKEEERAVSFTAEGNYKCDICSKAFARTSLLLRHIKRHERNYKCEICAKFFATKAYLIIHMKRHTGETNHECKMCTKCFTTKQHLLEHTKIHSDDRNYKCDVCDKRFTTKRYLVTHMKRHSGERNFKCELCAKYFANKHDVLRHVKTHTDEKTFKCESCDKCFRTETFLLKHVKIHAGEGNYKCDVCTKRFTTKGYLMNHSKTHATEKNHKCDVCAKRFTTKQYMLRHAKTHAGERNYKCEICGKCFSTKHVMLKHVKFHSGDRNYKCDVCTKSFTTKQYMLRHAKTHDGEIDYKCCNQRFTTRQLMLEHVETHAAERNYKCEVCTKRYRTKRHLAIHTKTHSSERNYRCEMCGKCLATKRSMLKHMETHAEADAFISGTRRTSLYTTGDERQKNDPFERKNFRRFSGDVSGCRLAIRDSRRTCRAPRSTWNARNDSTNVSDLLAPELREYRSAEIPGKVTSFSDAGEMVASDAHCESCYCRGPIVAPRLLQTQRLSPLKHRHVRRCVAPIDGDGGCKLSRNESPLKSELSKSDLEYTPDRLNYEHFKSSLLDGDGETATGTHGDVIGYSPPDEFKVDGDVALKDVSDFRASNYFDCHAYETKPCYHGGHTCVHKFRINERLMPEPVNCDMYGVSRCEICNKPMEKFKRDAASPKSKNVRKTRVQRVHECSLAPPTLNVGTGKERIVVEVPESVRRECLGKIGKEVDAKMRKNVPCNSLALRFQKGVV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -