Gqui032646.1
Basic Information
- Insect
- Gonioctena quinquepunctata
- Gene Symbol
- -
- Assembly
- GCA_018342105.1
- Location
- JAFIRS010000143.1:1104313-1134695[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 45 6.6e-06 0.00077 20.8 0.3 1 23 254 276 254 276 0.97 2 45 0.49 57 5.5 4.7 1 23 364 387 364 387 0.92 3 45 0.0075 0.87 11.2 5.3 1 23 397 419 397 419 0.97 4 45 0.002 0.23 13.0 0.4 3 23 426 446 424 446 0.97 5 45 0.0015 0.17 13.4 0.8 1 23 452 474 452 474 0.97 6 45 8.7e-07 0.0001 23.6 0.4 1 23 480 502 480 502 0.98 7 45 0.008 0.92 11.1 0.1 1 21 508 528 508 529 0.95 8 45 2.6 3e+02 3.2 0.1 3 13 615 625 614 634 0.86 9 45 0.0054 0.62 11.7 0.4 3 23 639 659 638 659 0.95 10 45 0.0017 0.2 13.2 0.2 1 23 664 686 664 686 0.95 11 45 0.0024 0.28 12.8 1.3 1 23 690 712 690 712 0.98 12 45 0.096 11 7.7 0.2 1 23 718 740 718 740 0.93 13 45 0.0013 0.14 13.7 0.8 1 23 746 769 746 769 0.91 14 45 3.2e-05 0.0037 18.7 2.1 1 23 780 803 780 804 0.96 15 45 6.8e-05 0.0078 17.7 0.8 2 23 812 833 811 833 0.97 16 45 0.0032 0.37 12.4 2.2 1 23 839 861 839 861 0.98 17 45 0.022 2.5 9.8 1.5 1 23 885 907 885 907 0.98 18 45 0.0012 0.13 13.8 0.3 2 23 974 996 973 996 0.96 19 45 0.016 1.9 10.2 0.3 1 23 1021 1043 1021 1043 0.98 20 45 0.017 2 10.1 0.8 1 23 1047 1069 1047 1069 0.98 21 45 0.0058 0.67 11.6 0.7 1 23 1075 1098 1075 1098 0.97 22 45 0.0021 0.25 12.9 4.5 1 23 1104 1126 1104 1126 0.96 23 45 0.092 11 7.8 0.2 1 23 1132 1155 1132 1155 0.90 24 45 0.00022 0.025 16.1 1.2 2 23 1167 1189 1166 1189 0.93 25 45 0.00018 0.02 16.3 2.0 3 23 1203 1223 1201 1223 0.96 26 45 0.00014 0.017 16.6 0.8 1 23 1229 1251 1229 1251 0.98 27 45 0.071 8.2 8.1 1.1 2 23 1341 1363 1340 1363 0.96 28 45 0.17 19 7.0 0.2 1 23 1382 1405 1382 1405 0.97 29 45 0.0037 0.42 12.2 2.2 1 23 1409 1431 1409 1431 0.99 30 45 0.0019 0.22 13.1 2.1 1 23 1437 1460 1437 1460 0.98 31 45 0.011 1.2 10.8 0.4 1 23 1466 1488 1466 1488 0.88 32 45 0.036 4.2 9.1 0.8 1 23 1494 1517 1494 1517 0.95 33 45 0.0071 0.82 11.3 0.7 2 23 1529 1551 1528 1551 0.96 34 45 2.1e-05 0.0024 19.3 1.0 3 23 1565 1585 1563 1585 0.96 35 45 0.055 6.3 8.5 0.3 2 23 1668 1690 1667 1690 0.96 36 45 0.75 86 4.9 0.1 1 23 1709 1731 1709 1731 0.96 37 45 0.0023 0.26 12.8 2.4 1 23 1735 1757 1735 1757 0.96 38 45 0.089 10 7.8 1.2 1 23 1763 1786 1763 1786 0.96 39 45 0.0074 0.86 11.2 0.2 1 23 1792 1814 1792 1814 0.94 40 45 0.088 10 7.9 1.6 1 23 1820 1843 1820 1843 0.91 41 45 0.0019 0.22 13.1 0.5 2 23 1855 1877 1854 1877 0.93 42 45 7e-05 0.008 17.6 1.6 3 23 1890 1910 1888 1910 0.96 43 45 4.1e-05 0.0047 18.4 1.5 1 23 1916 1938 1916 1938 0.98 44 45 4.3e-06 0.0005 21.4 1.2 1 23 1944 1966 1944 1966 0.99 45 45 0.16 19 7.0 0.7 1 19 1972 1990 1972 1992 0.97
Sequence Information
- Coding Sequence
- ATGGCGGAAATCGAGGAGAAAGAGTTCCCTATGGATTTTTTCACACCAATCGTTACAATTAATGAAGCACCTCTCAAGCCTCCAACTCCAAAGGCATCACCAAAAATAAAGaagatcaaaataaaaattgaaggCACCGAAGAAGGGGCGATTTCTGGGGAGATCGTGAAGCCGAAAAGAGGTAGACCAAAGAAAATTAAAATAGAAGTCGAAACTGAAGACACCGAGGAAGCTGAAGAATCAGAAGTCATGAAACCAAAAAGAGGCAGGCCAAAAAAAGTCAGCACAGATTCTATTGAAAGCAAAGACAAAGAAGACACAAAATCGGATAAATATATTGATGAAGACGATGATCCATTATGGTCTGCATCAGCTTCtcataagaagaaaaaaaagaaaaaagataaagttaaaaacaaaaaatctccaAAGAAATCAAAAACGGGAGATAAAAAGCAAGAAGTTGTTGTAAAATTGGAACCACCAGATGAAAATGCAGTTGATGCTATCAAAGATGAAATCAAAGAAGAAATCAAGGAAGAACCTGTAGACCCAGTCGAATTGCCCGTAATAGTACCATTGACGGAAGAAGAATTCAAAGCAATGTTGCCTCAGGAGGACCACAGGAAGTTCAAATGCCCTCTCTGTCATAAAGAAAAACAAGATGAAACTGAAAGAACTCCCAAGGAACTCAAACTACACTACAAAGATGACCATCCGGGAAAGAGGCTGAGGCAGTCTCGCTTCTCCAATGAGGTGCACCCCTGCGATATATGTGGCAAAGAGTTTCGGACTAACAGTGCTGTCAAAGATCACATAGAAACCCACAACAACTACTTCTATTGTGAGATTTGTAACgcttcccagaaaaaaatcctgGATCATATAATACATCTGAGGATACACTCAGAACCAGGCCTGTTCCATTGTTTGATGTGTGATCTCAACACCCCTGATATCAACAAAATCACGGAGCATGTGAACAACCACGAGGATTTATTGAAGTATTGGTGTCAGCCGTGCAAAAAAGGCTTCCAGATCCTGCCGTGGTTTCAAGAACACGACAACTACCACACCGGGTTGAAACCATTCGACTGCGAATTTTGTGGAAAGTGTTTCTTGTACTCTCGCTACCTCCACGCCCATAAAATCAACATGCACAAGGAAGATATGATCTTCCCCAGCTTGCACGAATGTGTCATCTGTAAGAAGCAGTACCAACACAAAAACAGCTTGAAGTTGCACATGAACTCCCACACGGGCAACTTTTCCATCTGTGATATATGCGGTAAGATGCTTTCAAGCAAGGAGAAGCTCAAGTTCCACATACGGATCCATACGGGGTACAAGCCCTATAGCTGTTCGTATTGTGAGAAGTCCTTCACCAAGAAACCTATTTTGGTGGAGCATACCAGGATCCATACCGGGGAACGACCTTATATTTGCGAGTACTGCACCAAGGCGTTTTCGCAAAGATCTAGTCTTGTGATACATATGAGGGGGCATACGGGGGAGAGGCCCTATGTGTGTCAGTTCTGTACTAAAGCGTTTGTTGCAAAAGCCATGCTGAATATTCATCTTAAGTCTTACGATGATTCCAATGAATATTCAGTCGTCGACTCCAACTTCACTGACTTCAAAAATGTCTTTGGTGATTTGAAAACGGAACCTAAAGATGAAATGGATACCTTTGAACCGGAAAATATTGACCCCTCTTCATTAGAATATTTAATTATAAAAGATGAAGTTGAGGCAGTACTTTCCTCTGATGCCATGTCCAAACCTGATATTAGTAAAACTAGATTGAAGAGGAAGCGGAAAGCTGTTCCCCGTGACAGCTTGTGTCCTATCTGTGGTAAAATTGTCAAAGACAAAAATCTGCCAAAGCATTTTGCCAGTCACGGTCATAAAAACTGTGACATTTGCACTGCTTCCTTCAAAACAATGGAGGAATTGGAAAAGCATAAAGAACTCCACCTGGAAAATGAATATCCTTGTGGAGAATGCAACCTAAGTTTCAAGCGAGCAATCGATCTTGCTACACATTCATTCAAACACGCTGGAGAATATCCCTGTCCTAAGTGTACTTTCAGTACAAAGAGCAAAAGCTCAATCAAGGGACATATTAAAAGGCACGAGAGGAGCTACACTCATTATTGCACTGTATGTGGTAAAGGTTTTTTGGGAAAGGCGTTGCTAGCAACTCACGAAGAAATCCATCTAGATATAAAGCGTTATGCTTGTGACGTTTGTGGGAAGAAATTCTCCGTCAAAAGGTACCTCGATGTCCATCGTTCTTTCAACCACAAGAAAGAACTGTATGGCATCGAACAACTCTACAAGTGTGAGGTTTGTGGGAGGGATTTCACTTTCGAGAAAAGCCTGAAGCGCCACCAAAGTGTCATACATCACATGGGTGAGGACCTAACTGTCGAATGCAAAGTATGCCACAAAATAATTGCAAACAATTACAACCTCAAAATGCACATGCGCATTCACACCGGCGAGAAAAAGTACTGTTGCGACTTGTGTGGAAAAGCTTTTGCAGCGTATAAGTATTGGAATAGACACAAGCAGACGCACGAGAAACAGGAATCTGAATTGAAGGATTTTGAAGAAGGGGGGAATGAAAGTGAGGTTGCCGATACAGTTTATTGCTGTGAATTGTGTGGGCAAGAATTTCCTTCTTTGGAATGTTGGGAAGAGCATGAAAAGACACATACGTTTttggaagaagaagaagaggaGACTGGCATCAAAATAGAAGATGAAAACTGTGGAGAGgataaaaagaaaattaaaaaaattaagattaagCAAAAAAAACTCAAACCGAGAACATCTCCCAAAGAAAAGAAATCTAAACAAATTTCGAAGTATATTCCTAAATTGAAACCCAATCACAAAGCAAGAGCCTGGGTGTGTAAAATATGCTTTGAAGAGTTCAGTACCAGAAAAGCACTATTTGAACATAGAAAAATTATACATGAAGACATCGAAGATTCATTCAAAGAAGAGTTTGAAGGTGAAAAATATACCTTTGATGAAGTTCTCGAATTTTTTACGTGCAGCAATTGCTCTGCAGAGTTCCAGACAAGAGAAGAAGTAGAAAAACATGTGGAAACGCACGAAGAAAAGTATGATTGTGAACTCTGCCACAAATCCGTAAACGGGGCCTTACAATTTTCTGCTCACATGCAGAAACATCGGGAAGACAAAAACTTCCCTTGTCCTATGTGCACACATATGACTGCCAGAAAATCCGCCATGTTGACCCACATCCAGAGGATGCATTATAGGAAGTACGATTTTCAGTGCCGTACCTGTGGCAAATGCTTCAATGACGCTACTACTTTCAAAGAACACGAAAACTTCCACTTAGGTGTCAAACCTTTCATTTGTATTGTGTGTAATAGGGAGTTCATTTATTCGCGATATTTGATAGCCCATCAAGTAAGGAATCACAGGGTCAGAGTTCTCGATAAGGAGTCGAAGACTCAGTGCCACATGTGCCCGAAGATGTTTGCTAGGAACGAAACATTGGTCAAGCACGTTGTCAGTAAACATTTGACGTTCCATGAGGGGCCGCACGAGAAAAAGCATTTGTGTGATGTATGTGGGCAAGGATTTTCTAGAACTGATAAATTGAAAATCCACTATAGAAAGCACACTGGAGAAAAACCGTATTCTTGCGTGTACTGCAGTAAAAGCTTCATCAAACGAGATTATTTGATCATGCACGAAAGGATACACAATGGGGAGAAACCgttttttcccGATTTTGAGTTGGAAATGAAAGTGGGCGAAGTGGAATTCAGTCTAAACGACGCTTACGAGAAggagaaaaaaatcaagacaccCGAGAAGCCATCATCTCCTTTGAAGAAGAAAATCAAAGAAGAAGTTGATGATGAATACACCCCACCGGggcagaaaaagaaaaaatctcttACAGCAAAACCCAAGAAACCTAGAAAACCGAGGCCGAGTAAGGTGAAGAAGCAAGAACACAAAATATGGACATGCAGGCGATGTTTGGAAGAATTTGATTCACGTCGAGATCTTACCGATCATACCAAAACCTATCATCCTGAGGAACCTAAAGATGCACCCGCCTACAAGTTCGATGAAGAGCAAGAACTCTACACTTGTGGAACTTGTTCAGCAGAGTACCAAACAAAAGAAGAAGTCGAAACTCACATTTCGAAGGTACATGAGGAATTTTACACCTGTGAGGTCTGTAAGCACACCTCGAAAAAAGCTTACAGCTTTGCTGTACACATGAAGATCCACAGCTCCGATGGTACTTACACCTGTCCTTTATGCAATTACAACACGCCCAGAAGAACCTGCTTGCAGACCCATATCAACAGAGTCCACTACCACAAGTTTTACTACACCTGCCCGACTTGTGGTAAAGGCTTCAACGACTCTGTCATATTCAAAGAACACAACAACGAGCACTTGGGGATCAAACcttttatttgtgttgtttgtaACAAGGACTTCGTCTACTCCCGCTACATGCTGATTCACCAAACGAGATACCACACAGTTCACATAGAAGGGACCCTCCACAAGACTCAGTGCAGCATCTGCATGAAGGTCTTCAGCAAAGTGGCGACTTTATTGAAACATATCACGACCAAGCACACTTCCGGAAGTATGGACAAACCCGAAAAACGCCATCTATGCGACATGTGCGGCAAAGGGTTCGCCACATCAGACAAACTCAAGATTCACTATAGAATACACACGGGCGATAAGCCTTTTGCTTGCCGGAAAAAAGCAGCTGAAAAAGACAACCTCAGGAGCACCAGATCGCCGAGAAAAAGGGCATTTCCCATCAAAATCTACTATACAGAATCCGAAGACTTCGATGACGATTACGAATATGTACCAGTTACCAaaagaagtaaacaaaaaacttcAAAAGAAGTAGTAGAAATAGTGGTTAAAAACAGTGGTAGCCGAAGAAGGTATGACTCAAAGAAACAATGGCCCTGTAAGAAGTGCAACGAAGTTATGCCAACtaaaagaaaattaataatcCACCGAAAAATGGAGCATTCGAAAGTAGACGTGAACGAACACACTTACAAATTCGACGAGATCCAAGAACTCTTTGTATGCAATACTTGTTCCGCCGAGTACATAGAAAAGAGTGAAATCGATAAACACGTCAAAGCTCACGAAGAAAAATTCGAATGTAACATATGCAACAAGAAGTTCAAAAAGGCCTATGACTTCGGAACACACAACTACACACACGACCCCGACAAAATGTTCCGTTGTCCACTTTGTTCGTATAATACAACGAAAAGAACCGGCTTCTTGGTACACATAAACTACACACATCTGAAAAAGTTCGGTTACGTATGCGAAACCTGCGGAAAAGGCTTCAACGACATCGTTTTGTACAAAGAACACAACAACGAACACCTGGGCGTCAAACCTTTTTCTTGTATAGTCTGCTCAAAAAATTTCACCTACTCCAGGTACCTCCTCACCCATCAAGTACGGTCACACAGAGTGGGCATAGATGGTCAACTACTACCGAATCAATGCTCCGTTTGTAGCAAAGTTTTCAGCAAGATGGTTACCCTTGAAAAGCACTTGGAGGAACGCCACGTCAAACAACAACTACCTCACGTCAAGAAGCATCTCTGCGACACGTGTGGCAAAGGCTTCGCCCAAAAAAATAAGCTGAGAGTGCACTACAGGGTCCATACTGGCTTCAAACCGTACACTTGTACCTACTGCGCCAAAAGCTTCACCAAGAAAGATTATTTGGTAATGCACGAAAGGGTGCACAGCGGTGAGAAACCGTATTCTTGCGAGTATTGTGGAAAGTGTTTCAGCCAGGGGGCACCGCTTAGGATACATCTTAGAACGCATACCGGAGAAAGACCTTATGTCTGTCAGTTTTGCAGTGCGGGGTTTACATCTCGAGGGGCACTCAATATACACTGCAAAAACTGCACCGGATCATCATGA
- Protein Sequence
- MAEIEEKEFPMDFFTPIVTINEAPLKPPTPKASPKIKKIKIKIEGTEEGAISGEIVKPKRGRPKKIKIEVETEDTEEAEESEVMKPKRGRPKKVSTDSIESKDKEDTKSDKYIDEDDDPLWSASASHKKKKKKKDKVKNKKSPKKSKTGDKKQEVVVKLEPPDENAVDAIKDEIKEEIKEEPVDPVELPVIVPLTEEEFKAMLPQEDHRKFKCPLCHKEKQDETERTPKELKLHYKDDHPGKRLRQSRFSNEVHPCDICGKEFRTNSAVKDHIETHNNYFYCEICNASQKKILDHIIHLRIHSEPGLFHCLMCDLNTPDINKITEHVNNHEDLLKYWCQPCKKGFQILPWFQEHDNYHTGLKPFDCEFCGKCFLYSRYLHAHKINMHKEDMIFPSLHECVICKKQYQHKNSLKLHMNSHTGNFSICDICGKMLSSKEKLKFHIRIHTGYKPYSCSYCEKSFTKKPILVEHTRIHTGERPYICEYCTKAFSQRSSLVIHMRGHTGERPYVCQFCTKAFVAKAMLNIHLKSYDDSNEYSVVDSNFTDFKNVFGDLKTEPKDEMDTFEPENIDPSSLEYLIIKDEVEAVLSSDAMSKPDISKTRLKRKRKAVPRDSLCPICGKIVKDKNLPKHFASHGHKNCDICTASFKTMEELEKHKELHLENEYPCGECNLSFKRAIDLATHSFKHAGEYPCPKCTFSTKSKSSIKGHIKRHERSYTHYCTVCGKGFLGKALLATHEEIHLDIKRYACDVCGKKFSVKRYLDVHRSFNHKKELYGIEQLYKCEVCGRDFTFEKSLKRHQSVIHHMGEDLTVECKVCHKIIANNYNLKMHMRIHTGEKKYCCDLCGKAFAAYKYWNRHKQTHEKQESELKDFEEGGNESEVADTVYCCELCGQEFPSLECWEEHEKTHTFLEEEEEETGIKIEDENCGEDKKKIKKIKIKQKKLKPRTSPKEKKSKQISKYIPKLKPNHKARAWVCKICFEEFSTRKALFEHRKIIHEDIEDSFKEEFEGEKYTFDEVLEFFTCSNCSAEFQTREEVEKHVETHEEKYDCELCHKSVNGALQFSAHMQKHREDKNFPCPMCTHMTARKSAMLTHIQRMHYRKYDFQCRTCGKCFNDATTFKEHENFHLGVKPFICIVCNREFIYSRYLIAHQVRNHRVRVLDKESKTQCHMCPKMFARNETLVKHVVSKHLTFHEGPHEKKHLCDVCGQGFSRTDKLKIHYRKHTGEKPYSCVYCSKSFIKRDYLIMHERIHNGEKPFFPDFELEMKVGEVEFSLNDAYEKEKKIKTPEKPSSPLKKKIKEEVDDEYTPPGQKKKKSLTAKPKKPRKPRPSKVKKQEHKIWTCRRCLEEFDSRRDLTDHTKTYHPEEPKDAPAYKFDEEQELYTCGTCSAEYQTKEEVETHISKVHEEFYTCEVCKHTSKKAYSFAVHMKIHSSDGTYTCPLCNYNTPRRTCLQTHINRVHYHKFYYTCPTCGKGFNDSVIFKEHNNEHLGIKPFICVVCNKDFVYSRYMLIHQTRYHTVHIEGTLHKTQCSICMKVFSKVATLLKHITTKHTSGSMDKPEKRHLCDMCGKGFATSDKLKIHYRIHTGDKPFACRKKAAEKDNLRSTRSPRKRAFPIKIYYTESEDFDDDYEYVPVTKRSKQKTSKEVVEIVVKNSGSRRRYDSKKQWPCKKCNEVMPTKRKLIIHRKMEHSKVDVNEHTYKFDEIQELFVCNTCSAEYIEKSEIDKHVKAHEEKFECNICNKKFKKAYDFGTHNYTHDPDKMFRCPLCSYNTTKRTGFLVHINYTHLKKFGYVCETCGKGFNDIVLYKEHNNEHLGVKPFSCIVCSKNFTYSRYLLTHQVRSHRVGIDGQLLPNQCSVCSKVFSKMVTLEKHLEERHVKQQLPHVKKHLCDTCGKGFAQKNKLRVHYRVHTGFKPYTCTYCAKSFTKKDYLVMHERVHSGEKPYSCEYCGKCFSQGAPLRIHLRTHTGERPYVCQFCSAGFTSRGALNIHCKNCTGSS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -