Gcor002793.1
Basic Information
- Insect
- Gnatocerus cornutus
- Gene Symbol
- znf711
- Assembly
- GCA_029298725.1
- Location
- CM055510.1:5840174-5854467[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 55 0.59 46 5.1 0.8 2 18 45 58 45 64 0.68 2 55 3.4e-05 0.0027 18.4 0.1 1 23 70 92 70 92 0.97 3 55 1.8e-05 0.0015 19.3 0.5 1 23 98 120 98 120 0.97 4 55 4.7e-05 0.0038 18.0 2.7 1 23 133 155 133 155 0.97 5 55 6e-07 4.8e-05 23.9 0.6 1 23 161 183 161 183 0.98 6 55 7.7e-05 0.0061 17.3 0.8 1 23 189 211 189 211 0.98 7 55 0.00049 0.039 14.8 0.7 1 23 217 239 217 239 0.99 8 55 4.2e-05 0.0033 18.1 2.4 1 23 245 267 245 267 0.98 9 55 6.4e-05 0.0051 17.5 1.7 1 23 273 296 273 296 0.95 10 55 0.2 16 6.5 0.2 2 23 337 359 336 359 0.96 11 55 0.0058 0.46 11.4 2.7 2 23 366 388 365 388 0.96 12 55 4.4e-05 0.0035 18.1 3.6 1 23 393 415 393 415 0.98 13 55 1.4 1.1e+02 3.9 0.9 2 23 429 450 428 450 0.94 14 55 0.00067 0.053 14.3 1.1 1 23 456 478 456 478 0.94 15 55 3.5e-05 0.0028 18.4 0.9 1 23 485 507 485 507 0.98 16 55 0.00031 0.025 15.4 1.5 1 23 513 535 513 535 0.98 17 55 0.0019 0.15 12.9 1.6 1 23 563 585 563 585 0.98 18 55 2.3e-05 0.0018 18.9 1.9 2 23 595 616 594 616 0.96 19 55 4.7e-05 0.0037 18.0 1.1 1 23 622 644 622 644 0.97 20 55 6.2e-06 0.00049 20.8 0.3 1 23 650 672 650 672 0.98 21 55 6.6 5.3e+02 1.8 0.2 1 9 678 686 678 689 0.88 22 55 0.00083 0.065 14.1 0.2 1 20 696 715 696 716 0.95 23 55 0.046 3.6 8.6 3.2 1 23 723 745 723 745 0.97 24 55 0.00015 0.012 16.4 1.4 2 23 901 922 900 922 0.96 25 55 7.4e-05 0.0059 17.3 2.7 1 23 928 950 928 950 0.98 26 55 0.0006 0.047 14.5 1.5 2 23 957 978 956 978 0.97 27 55 0.0079 0.63 11.0 0.1 2 23 985 1007 984 1007 0.90 28 55 9.9e-05 0.0078 17.0 1.7 1 23 1012 1034 1012 1034 0.96 29 55 0.007 0.55 11.1 4.6 1 23 1043 1066 1043 1066 0.95 30 55 0.0002 0.016 16.0 0.5 1 23 1073 1095 1073 1095 0.97 31 55 5.4e-06 0.00043 20.9 0.3 2 23 1102 1123 1101 1123 0.97 32 55 8.9e-06 0.0007 20.3 0.2 3 22 1131 1150 1129 1150 0.95 33 55 0.002 0.16 12.8 1.7 1 19 1194 1212 1194 1212 0.98 34 55 0.00031 0.024 15.4 0.6 1 20 1223 1242 1223 1245 0.92 35 55 0.0064 0.51 11.3 4.0 1 23 1250 1272 1250 1273 0.95 36 55 0.14 11 7.0 2.8 3 23 1402 1422 1401 1422 0.99 37 55 0.00018 0.014 16.1 0.2 2 23 1425 1446 1424 1446 0.96 38 55 0.013 1 10.3 1.6 1 23 1451 1473 1451 1473 0.97 39 55 0.0047 0.38 11.7 2.1 1 23 1479 1501 1479 1501 0.97 40 55 0.0015 0.12 13.3 2.4 1 23 1507 1529 1507 1529 0.97 41 55 0.00041 0.032 15.0 4.4 1 23 1535 1558 1535 1559 0.94 42 55 3.1e-06 0.00024 21.7 0.5 1 23 1569 1592 1569 1592 0.97 43 55 0.0002 0.016 16.0 0.6 2 23 1601 1622 1600 1622 0.97 44 55 0.13 11 7.1 0.3 2 16 1629 1643 1629 1644 0.90 45 55 0.1 8.2 7.5 0.6 2 23 1676 1697 1675 1697 0.95 46 55 0.011 0.87 10.5 0.3 1 23 1716 1739 1716 1739 0.97 47 55 5.1 4.1e+02 2.1 5.9 1 23 1743 1765 1743 1765 0.97 48 55 0.035 2.8 8.9 1.6 1 23 1771 1794 1771 1794 0.96 49 55 0.0058 0.46 11.4 0.9 1 23 1800 1822 1800 1822 0.96 50 55 0.00077 0.061 14.2 0.6 1 23 1828 1851 1828 1851 0.96 51 55 0.00086 0.068 14.0 0.6 2 23 1863 1885 1862 1885 0.96 52 55 0.00098 0.078 13.8 2.6 3 23 1898 1918 1896 1918 0.96 53 55 4.5e-05 0.0035 18.0 1.0 1 23 1924 1946 1924 1946 0.98 54 55 6.7e-05 0.0053 17.5 0.1 1 23 1952 1974 1952 1974 0.98 55 55 0.0043 0.34 11.8 0.2 1 21 1980 2000 1980 2001 0.95
Sequence Information
- Coding Sequence
- ATGTGTGAACTCTGTGGACGAATTGAGCCACTGGGAGATGAGGAAAGCAAAAGAAAACTATCTTTTCTTCTCGTAACTGAGGATTTACCGCCAGCTACAAAAAAACAGACCACTCGTGACAGATCAACAAGAGTTTGCGAGGAATGTGGACGAACTTACAAAACTCTAAACACTCACAAAATGCGTTATCACTCTGCAGAAAGAAGTTTCATGTGTGATTATTGCGGAAAAGCCTTTTGGAGCCCCCAGAACCTGAGCAGACATATACTTATCCATACAGGCGAGAAGCCCTTCATGTGTGAACTTTGTGGAAGAGCTTTTAACCAAAAGACCACTTTACAAGACCACATTGATCGACACGCCGGAATCAAGAAAAAACCAAGAAACAGTAAAAACCACAATTGTCAACTTTGCGGGAAATGTTTTTGGAATCCTTCCCTTCTAAAAATTCATATGCGCGTACACACAGGAGAAAAACCATTTACATGTAAAATTTGCGACAAGTCTTTTTCGCAGGAAGGTGCCATGAAAATGCATATGAGAATTCATACGGAGGATAAACCATTTGTTTGTTCAATTTGTAACAAGTCGTTGTGTAACGATGGGGCTTTGAAATTACATATGAGAATTCACACGGGGGAAAAACCGTACAAATGCGAATTTTGCGACGCGTGTTTTGCGTATAAAGGTGTCTTGATTCTACACCTCAGGATTCATACGAAGGAACGACCTTTCAAGTGTGAGCTGTGCGAGAAGTCTTTTCGGGGCAAAAGTGCGTTAGATAGGCATCATAAGGTCCACACAAAGGAAAAACCGTATAAATGCGAGTTGTGCGAGACTTGTTTTAGTCAGAGCAGTGGTTTAAAGACTCATGTGTTAAATATTCATATGAAGAAGAAAAAACTGGAAACGTCTGAGTCCTCTTCTCCATTGTCTGTGAAAAGAGAACCATTAGATACCTCTGAAGATTTAAAAATCCTCTTGTCTCCTTCTAGGCAATACTTACTTTCTTGTTCCCAATGCAGTTATGGCACCAACATACCTAGAAACCTTGACAACCACATGATCACTTTACACACCAAAAACTACCACTTGAAATGTGATGTTTGCGACCGTGGTTTTGTTTCCAGCCATCGGCTGAACTATCATcacaaaatagagcacgaaaatCTCCGTTTTCCTTGTAAATTCTGCAAGAAAATCTTCAAATACCAAGAATCCTTAAAATACCACATGAAATCCCACAAACCCGACTATACTGCGGACAAAGAAAAAGAACCATGCGTTTGCGAAGTCTGCAACAAAATTCTAGGTTCTAAAATCATCTTCCAAAGTCACATGAAACTCCACACAAATCCCCAATCTTTCGTTTGTGATATTTGCGGTAAAACCTTAACCACGAAAGTGAGTCTCGTGAACCATTGCAAGATCCACACAGGTGAGAAATCCACGTTTAGTTGTGATATTTGTGGAAAATCTGTGACAACTAAACAGAAATTGATGGATCACAAAATGACACACACGGGAGAAAAACCATTTGTTTGTGAATTCTGTGGATGTGGGTTCATTAGGAGAGCCACGTTGACTGAACACTCCAGGACTCATACAAAGGAAAAACCGTTCAAATGTGAGGACGATGTGTCTTGGAAGGAACATGAAGAGAGGCACAATTCTAGCCCAGCGTCGTACCAATGTAATGTTTGTTTGAAGCAGTTGCGTTCAAAAACAGGTTATTCTCGTCATATGAGAGAACATATCggtaaaaagaagaaaaagagtTGCGTTTGTGAGGACTGTGGTAAAGTGTATTCATCGCTTGATAGTTTTCGTGACCACCGGAAGACTCACGCTGGTGTGAAGAACCACGTGTGTGAAACTTGTGGGAAAGCTTTTTACAGACGACAACAATTAATCATACATAACAGAATTCACACGAACGAAAAACCTTACGAATGTTGGGTTTGTGGAAAGAAATTCGCGCAGAAACCGTCTCTTAATGTCCACCTGAAGTATCACACTGGCAACATGCCGCACAAGTGTCATATTTGCCCTCGAGCatccgACGAGGAGACAGAAATCGTGAAGTACATTTGTAATATATGCGGTCAACCATTCGCGGATAAAAACGAGTTTCTTCGACACCACGTGGATAAAAACAGTCAAGCCTTCAAGTGCTGTAACTGCGCGAAACGTTTCAAAACCATTGCCCTCCTCTATACGCATCTCGAACAACACTCAACCGTTGAAGTGAACGTCAGCGATGGTGAAACCGTAGAAGAAGTTGTGTTAAACGAGCAAGAAATCAGTGACACAAACGAACTGTTGTACATCATCGATGATTCTCAGTGTGTCAACGAAGTGGTCATTGAAGAAGACGAGGTGCTGCAACCAAAATCGAAGAAAGTCGATTATAAAATTGTACAACTCGAACACGGTTATGTTATTCCATCGGCTGTAAACAAGACACCTAAAGAGGTGGTTATATCAAACGGAAAGCAAAAGATTGCGCAGGCTTCCACCAGTAAACTGGTGCCGCAGTCTAAGCCACGGACGCCACAAGCTAAGCGTCGTCACGTAATTCCGGATTTTTCTTCCGCAAACTACGAGTTTGTGAACGCTAATGAGGAAATTGAGATCCCGCATTATAAGTCATGGTGGTTTCCAATGAGTCCAACAAACCAAAAAGACGACAAATACAAACGCTGCCCTCACTGCCCTAAAACGTTTCCAACGAACCGAATTCTCAAACGCCACATGATAGCGCACAGTGATAGACGCCCATTCTCTTGTGATATCTGTTCAAGAACGTTTAAGCGAAAGTATGAAATCAAAAGTCACAAACGTGTTCACGAAGATCCCCTTTCTCTTCAATGCGATCTTTGTGATAGAAAACTGAAATCGAAAGGATCCTTACTAACACACCGAAGGCGACATCTTAAACAGTACGTTAAAGTTTGTCAAATATGTAATATTGGATTCGTTACCAATCAGGAATATAATAATCACATGGGAGCCAGACATGGAACCAGTTCTCACATTTGTGATGTTTGTGGAAGAAGCTGCTATGATAAAGCAGCGCTGCAAAGTCACATGGAAAAGCACGCTGAAGGTTATgagaacaataaatttaaatgtgAATACTGTGATAAGACGTTCCTACAAGAGAAATATTTAAAGCACCACTTTTTGAGGATACATAAAGATGGAGGGCGGAAGTTCATGTGCGATTTGTGCGGGAAGAAAGTTAACTCGAAAACAAGTCTTCGGGATCATTTGATAATGCACTCGGGGTCGAAACCGATAGAGTGTAAAGAATGCGGGAAAGGGTTTGCTTTGAAGACTACCTTGAAAGCGCACTTGAGAACGCATTCCGGGGAGCGACCTTATGGGTGCAAACAGTGTGGAAAAGCGTTTACGCAGAAGACGGCGTTGACGATACATATGAGATCAACAGTTCGTTTACCTAGTGGAATCACTATAAAGATATTGGCACCGTCGAATTCATCTCCAATAAAAAAGGATTCTTCCAAACCAATTCTAAAGAAACCAGCGTCACTTTTTAACCAAAATAGGACGTGTTACTACTGCTCCAAATGCAAGAAAGTCTTTCCTTCTTTCTCCGAACTAACAACTCACGATGCTGGCTGCGAAAAACCCAAACCGCGTCACGTCTGCAAATACTGCGACAAAGTTTTCGCATCAATGGGTGCTCTTAAAACACATCTTCCACGCCACGAAAAAACAGATCATCGCTGCAATGTCTGTGGGAGGGTCTTTAATAATTCTGCATCGAAAGACATGCATAGAAAAATGCACCACACTGAGTTTTTTAGTGAAAGTATTGATGGTGATTACCTTTGTAAGATatctgACTGGGATGACTGCAACGAAATGATAATAATCACACTAACTTCAGATGATTTAAATGAAACTACAGAATTTGTGATAGAAGAGAGTTTAGAATTAGACGAAACGGAAGCAGACAACAATGAAGAAGTCTCTCCAGACTGTACAAGCACAACTGTAAGCAAATTTAGTGACGACGGGAGCACCAAGAGcaccaaaaaaaatgtcaaactaAGGAAGAAGCATCAACAAACAATCTGCCATGAATGTGGAATTCTAGTAACAACAGCGTACCTCCCCACACACATCTCGGAAATGCATTATTATACTGAGAACGTTCAATCGCCTCATGTCCTTTGCCCTGATTGTGGTATTTTATGCAGAGATCATCGCTCGTACGCTTTACATAAACGAACCCATTTGGTGTCTTGTAAACAGTGTCCTGCTGCGTTCAAATCGCAAGAACAATTAAAGAAACATGCTCTGGTCCATGAAGAGCAACACTACGAGTGTAATAAGTGTGAAGCAACATTTAGCGGTTGCTACGAGTTCATCGTCCACAAAGAAAGTCACGAAGGTACCGGGATTTACGCTTGTGCGAAATGTGACCATACCACTTCAAAATTAGAGTTGTTGAGAAGACATTTGAAGCAGCACGATGGTCCCAAAGTTTACACTTGTGAGGTCTGTGATAAGAAGTTTTTTGTACATGCCATGTTTAAAACTCACATGGAGTTACATAGTGGAGTTAAAAAGTATTCTTGTGAACATTGCCCAAAGACGTTTTCTACGATAAATTATCTTAAAAATCATCGGGAGTTGAATCACCATAAGGAACTTTACGGGTATGATTTAACATATCCGTGTCCAGAATGTAAACGGGTATTTACTTTCGCGAAGAGTTTGAGGCGACACTTGAGCACTATTCATAAGATTGGGGAGAACACCACTGTTTCCTGCTCGATTTGTTTGAAGAAACTTGCTAATAATTACAATCTGAAGGTGCATATGAGGCAACATACTGGGGAGACTCCCAACACGTGTGAAGGCTGTGGAAAAGGATTTGTGACGCTCTCGTCCTTTCACCCAAAAACCCGTTCGAAATGCTACATAAAACAAGAAGAAGATAACGATAACAAGTCTCCTATAACAAGATCTTCTAAAACTCCCTCTACTTGGGTttgtaaaaaatgttttgtttcaTTTCCAACAAGGAAAGCTCGACAGGCTCATAACAAGATTCATAAAACTCAAGAGCGGGACCATCAAACTTACAAGTATGATGACAATTTAGAGTTATATATTTGCAACACTTGctcggctgaattccaagacaAAGAAGAGGTGGAGAATCACATCAAAAAAACTCACATCGAAACGTATTCGTGCGAAGCTTGCAACTGTAAGTTTAGTGATCCGTACAAATATTCTTGCCACATGCAACAACACAACGATAAAGGTTCCTTTGCTTGCCCTATGTGCTCTTATGCGACTAAAAGAAGGAGTTGCATTCTTACCCACATAAATAGGATGCATTACCATAAATTCTATTATTACTGTAATACTTGTGGCAAAGGTTTCAACGATACCGTTCGATTTAAGGAACACGAAAACGAACATTTAGGGATTAAACCCTTCGTATGCGTGGTCTGTACAAAGGCTTTTGTATACTCAAGATACCTTCAGCTGCATCAGATTCGTTACCACACTGTGGGCATAACAGGACAGTTATGTAAAAATCAGTGCTCAGTTTGCTTGAGAATTTTTTCAAGGAATGGAACTTTGGACAGACACATGGTCACCAGACATTCCAATAAGTTGGGACCACGTGTTAAACGACATCTTTGTGATATTTGTGGAAAAGGATTTGCCACCAACTGTAAGATGCAAATTCATTATAGAATACATACTGGGATTAAACCGTACGCTTGCAAGTACTGCACGAAGAGCTTTATTAAAAGAGATTATTTAATTATGCATGAAAGAGTGCATACAGGAGAGAAACCTTACGTTTGTCAGTATTGCGGAAAGTCTTTCAATCAAGGAGCTCCGTTGAGGATCCACGTTCGGGGGCACACAGGGGAGCGACCTTATATTTGccagttttgtaatttagggTTTATATCTAAGGGATCTCTCAATATGCACCAAAAAGTGTGCAGAGTCGAGTAG
- Protein Sequence
- MCELCGRIEPLGDEESKRKLSFLLVTEDLPPATKKQTTRDRSTRVCEECGRTYKTLNTHKMRYHSAERSFMCDYCGKAFWSPQNLSRHILIHTGEKPFMCELCGRAFNQKTTLQDHIDRHAGIKKKPRNSKNHNCQLCGKCFWNPSLLKIHMRVHTGEKPFTCKICDKSFSQEGAMKMHMRIHTEDKPFVCSICNKSLCNDGALKLHMRIHTGEKPYKCEFCDACFAYKGVLILHLRIHTKERPFKCELCEKSFRGKSALDRHHKVHTKEKPYKCELCETCFSQSSGLKTHVLNIHMKKKKLETSESSSPLSVKREPLDTSEDLKILLSPSRQYLLSCSQCSYGTNIPRNLDNHMITLHTKNYHLKCDVCDRGFVSSHRLNYHHKIEHENLRFPCKFCKKIFKYQESLKYHMKSHKPDYTADKEKEPCVCEVCNKILGSKIIFQSHMKLHTNPQSFVCDICGKTLTTKVSLVNHCKIHTGEKSTFSCDICGKSVTTKQKLMDHKMTHTGEKPFVCEFCGCGFIRRATLTEHSRTHTKEKPFKCEDDVSWKEHEERHNSSPASYQCNVCLKQLRSKTGYSRHMREHIGKKKKKSCVCEDCGKVYSSLDSFRDHRKTHAGVKNHVCETCGKAFYRRQQLIIHNRIHTNEKPYECWVCGKKFAQKPSLNVHLKYHTGNMPHKCHICPRASDEETEIVKYICNICGQPFADKNEFLRHHVDKNSQAFKCCNCAKRFKTIALLYTHLEQHSTVEVNVSDGETVEEVVLNEQEISDTNELLYIIDDSQCVNEVVIEEDEVLQPKSKKVDYKIVQLEHGYVIPSAVNKTPKEVVISNGKQKIAQASTSKLVPQSKPRTPQAKRRHVIPDFSSANYEFVNANEEIEIPHYKSWWFPMSPTNQKDDKYKRCPHCPKTFPTNRILKRHMIAHSDRRPFSCDICSRTFKRKYEIKSHKRVHEDPLSLQCDLCDRKLKSKGSLLTHRRRHLKQYVKVCQICNIGFVTNQEYNNHMGARHGTSSHICDVCGRSCYDKAALQSHMEKHAEGYENNKFKCEYCDKTFLQEKYLKHHFLRIHKDGGRKFMCDLCGKKVNSKTSLRDHLIMHSGSKPIECKECGKGFALKTTLKAHLRTHSGERPYGCKQCGKAFTQKTALTIHMRSTVRLPSGITIKILAPSNSSPIKKDSSKPILKKPASLFNQNRTCYYCSKCKKVFPSFSELTTHDAGCEKPKPRHVCKYCDKVFASMGALKTHLPRHEKTDHRCNVCGRVFNNSASKDMHRKMHHTEFFSESIDGDYLCKISDWDDCNEMIIITLTSDDLNETTEFVIEESLELDETEADNNEEVSPDCTSTTVSKFSDDGSTKSTKKNVKLRKKHQQTICHECGILVTTAYLPTHISEMHYYTENVQSPHVLCPDCGILCRDHRSYALHKRTHLVSCKQCPAAFKSQEQLKKHALVHEEQHYECNKCEATFSGCYEFIVHKESHEGTGIYACAKCDHTTSKLELLRRHLKQHDGPKVYTCEVCDKKFFVHAMFKTHMELHSGVKKYSCEHCPKTFSTINYLKNHRELNHHKELYGYDLTYPCPECKRVFTFAKSLRRHLSTIHKIGENTTVSCSICLKKLANNYNLKVHMRQHTGETPNTCEGCGKGFVTLSSFHPKTRSKCYIKQEEDNDNKSPITRSSKTPSTWVCKKCFVSFPTRKARQAHNKIHKTQERDHQTYKYDDNLELYICNTCSAEFQDKEEVENHIKKTHIETYSCEACNCKFSDPYKYSCHMQQHNDKGSFACPMCSYATKRRSCILTHINRMHYHKFYYYCNTCGKGFNDTVRFKEHENEHLGIKPFVCVVCTKAFVYSRYLQLHQIRYHTVGITGQLCKNQCSVCLRIFSRNGTLDRHMVTRHSNKLGPRVKRHLCDICGKGFATNCKMQIHYRIHTGIKPYACKYCTKSFIKRDYLIMHERVHTGEKPYVCQYCGKSFNQGAPLRIHVRGHTGERPYICQFCNLGFISKGSLNMHQKVCRVE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -