Qclu001256.1
Basic Information
- Insect
- Quadriops clusia
- Gene Symbol
- -
- Assembly
- GCA_033557875.1
- Location
- JAWQRP010000187.1:23080-32591[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 49 0.35 29 6.2 1.7 2 20 75 93 74 95 0.91 2 49 0.0025 0.21 13.0 1.6 1 23 159 181 159 181 0.99 3 49 0.011 0.95 10.9 0.9 1 23 184 206 184 206 0.98 4 49 9.7e-06 0.0008 20.6 0.8 2 20 213 231 212 233 0.95 5 49 1.4e-06 0.00012 23.2 0.4 1 23 240 263 240 263 0.97 6 49 0.0032 0.26 12.7 0.9 1 23 268 291 268 291 0.97 7 49 0.2 17 7.0 0.1 3 23 328 345 327 345 0.82 8 49 8.4e-07 7e-05 23.9 1.6 1 23 351 373 351 373 0.99 9 49 0.0069 0.57 11.6 2.4 1 23 379 401 379 401 0.99 10 49 0.2 17 7.0 0.1 3 23 481 498 480 498 0.82 11 49 0.00038 0.031 15.6 1.9 1 23 504 526 504 526 0.99 12 49 0.017 1.4 10.3 2.3 1 23 532 554 532 554 0.98 13 49 0.0029 0.24 12.8 3.0 1 23 684 706 684 706 0.99 14 49 0.0091 0.75 11.2 1.2 1 23 709 731 709 731 0.98 15 49 4e-05 0.0033 18.6 1.9 2 20 738 756 737 758 0.95 16 49 0.00027 0.022 16.1 0.7 1 23 765 788 765 788 0.97 17 49 0.00033 0.027 15.8 0.7 1 23 793 816 793 816 0.98 18 49 3.9 3.2e+02 3.0 0.1 2 23 851 871 850 871 0.80 19 49 8.7e-06 0.00072 20.7 3.9 1 23 877 899 877 899 0.99 20 49 0.00029 0.024 16.0 5.0 1 23 905 927 905 927 0.99 21 49 0.25 20 6.7 0.6 1 23 945 967 945 967 0.97 22 49 7.4 6.1e+02 2.1 3.0 1 23 1049 1073 1049 1073 0.89 23 49 0.028 2.3 9.7 2.0 1 23 1076 1098 1076 1098 0.97 24 49 0.0016 0.13 13.6 2.2 2 20 1105 1123 1104 1125 0.94 25 49 4.8e-06 0.00039 21.6 0.5 1 23 1132 1155 1132 1155 0.96 26 49 0.00058 0.048 15.0 0.4 1 23 1160 1183 1160 1183 0.97 27 49 0.77 64 5.2 0.5 2 23 1192 1212 1191 1212 0.94 28 49 0.21 17 7.0 0.1 2 23 1218 1238 1217 1238 0.95 29 49 1.7e-06 0.00014 23.0 2.1 1 23 1244 1266 1244 1266 0.99 30 49 0.00025 0.021 16.1 5.1 1 23 1272 1294 1272 1294 0.99 31 49 0.81 67 5.1 1.1 1 23 1326 1348 1326 1348 0.82 32 49 0.12 9.6 7.8 4.2 2 23 1548 1569 1547 1569 0.97 33 49 0.022 1.8 10.0 0.5 1 23 1572 1594 1572 1594 0.94 34 49 0.042 3.5 9.1 0.1 3 21 1602 1620 1600 1621 0.93 35 49 0.0091 0.75 11.2 1.6 1 23 1627 1650 1627 1650 0.93 36 49 0.54 45 5.7 4.8 1 23 1655 1677 1655 1677 0.97 37 49 0.064 5.3 8.6 0.2 3 23 1687 1706 1686 1706 0.97 38 49 1.7e-05 0.0014 19.8 0.1 2 23 1712 1733 1712 1733 0.96 39 49 0.0031 0.25 12.7 0.4 1 23 1739 1761 1739 1761 0.98 40 49 0.00025 0.02 16.2 2.0 1 23 1767 1789 1767 1789 0.99 41 49 0.011 0.93 10.9 0.1 2 23 1866 1887 1865 1887 0.95 42 49 0.00012 0.0097 17.2 0.9 1 23 1890 1912 1890 1912 0.95 43 49 0.0071 0.59 11.6 0.1 2 22 1919 1939 1918 1939 0.94 44 49 1.5e-05 0.0012 20.0 1.8 1 23 1944 1967 1944 1967 0.96 45 49 0.076 6.3 8.3 5.3 1 23 1972 1994 1972 1994 0.98 46 49 0.00062 0.051 14.9 0.2 2 23 2030 2051 2029 2051 0.95 47 49 0.00054 0.044 15.1 0.3 1 23 2057 2079 2057 2079 0.97 48 49 0.00025 0.021 16.1 2.0 1 23 2085 2107 2085 2107 0.97 49 49 0.0049 0.4 12.1 2.9 3 23 2115 2135 2113 2136 0.95
Sequence Information
- Coding Sequence
- ATGACAGACAGAATCCTGAAAAGTGACCTGGAGGATTATCGATCTGACATTTTCTGCGATTTGTTTGGGGATGGTGCAGAATTATTCGAACACAAGCACGAGCATTCTCTGCAAAAGGAGAAAGTAACTTCACCATCGACCGCCGAATCGGAAAATGCAATTTCCtcaaacaagaaaacaaagaaaaagccAGTTACAAAATCccgaacaaagaaaatttggaCTTGTAAACACTGCTTCGTAAGCTTCACTAGCAAGCTTCTTTTGGACGCACACGTATGCAACAAGGAGGGTCCCATTGCCAATTCTGAATCAGAAACTACAACTTccccaaaaaagaaaacaaggaGACAGTTAACTAAAAAATGGACATGTGAAAGCTGCTCTATAACCTGGCACAACAAGCTTTTGTTTAATACACACATATGCAGCACAGAGGGCGACATCGCCAATTTCGGTAAACAGATACCGTCCTTCAAGTGTGAGTTCTGCCCAGAAGTTTACACAGAAAAAAATGAGTTCTTCGAACACCGAAAAACGCATGAAGGTTACAGTTGTCCTCATTGCGATAGAAAACTTCTAAACCCGGATTTGTTCGAACGCCACGTCCAATCTCACACTAAAGATAATTACGTAAAGTGTGGTGAATGCGGGAAATCTTTTAACCATCAAACTAATTTGCAAAAGCACCTCGACTACAAGCTTTGCAGAGGCGGCTTCGTATGCGAAGTgtgcaacaaaatatttacccAGAAGGGAACTTTGACTAATCATATGAACTCTGTGCACATAGGAATCAGATTCGAGTGCAAAccatgtaacaaaaaattctcAATGGAACAAAGTTACAAAGCTCATATGCTAACAATCCACGATCCTAATTTTGTCAAGAAGACGAAAAAATGTCCTTTTTGTGAAAAGGAAATACCCTCTCGTTCTTTCGGGAGTCATGTTACAAGACACAAGAATGGTTTGCGCGTTTGCCCGGAATGCGGAAAACGTGTGAACAATTTAAAGGTGCATCTAATGGTTCATCAAGACCTTAAACCCTACAAGTGCGACGAATGCGGAAAAGAATTCAGATCATCAAAATACTTAAAGGAGCACCAGAAGATTCATGCCAACGATAAACGATATGAATGTCCCTTATGTTTTAAGAAATTCGTTCAATGTAATGCTAAGACTATTCATGTCAGAACACACACTGGCGAAAAACCTTCCAGATGTGACGAAAtagatttgaaatataattcgATCAAAGACACAATCCTGAAAAGTGAACTGGGGGATTATCGATGTGACCTTTGTTGCGATTTGTTTGAGGAAGAGGCAAAATTCCTGGAACACAAGCATGAGCATTCTCTGCAAAAGGAGTCTACACAGAAAGTAGTATCTTCTTTCGGGAGTCACGTTACAAAACACAAGAATGGTTTGCGCGTTTGCCCAGAATGCGGAAAACGTgtgaacaatttaaaagtgCATCTAATGGTCCATCAGGACCTTAAGCCTTACAAGTGCGACGAATGCCGAAAAGAATTAAGATCATCAAAATACTTGAAGGAGCACCAGAAGATTCATGCAAACGATAAACGATATGAATGTCCTATATGTTTTAAGAAATTCGTACAGTGTAATGTTAAGACTATTCATGTCAGAACACACACGGGCGAAAAACCTTCCAGATGTGACGAAATAGAGGTGAAATATAACCCGATCTCAGCAGGAACCGTTAAAAGTGACCCGGAGGATTATCAATGCGACATCTGCTGTGATTTGTTTGAGGATGAGGCAGAATTCTTCCAACACATGCATGACCATTCTTTGCAAAAGAAATCTACGCGGAAAGTTGGTTCATCATTGGCCATCAAACGGGAAAATACAACTTCctcaataaagaaaaaaaagaaaaagcaaGTTAGAAAACAcccaggaaataaaaaatggactTGTGAAAGGTGTTACATCAGCTTGAGTAACAAGCTTCTGTTTGATACACACGTGTGCAATACAGAAGGCCAAATTGTCAATTCCAAAAAGAAGACACTGTCCTTCAAGTGTGAGTTTTGCCCAGAAATTTACACTGAAAAGAAAGAGTTCTTCAAACATCGAAAAACGCACGAGGGGTACAAGTGTCCCCACTGCGATAGAAAAATTCTAAGGCCTGATTTGTTCGAACGCCACGTCCAATCTCATACTAAAGACAACTACGTGAAGTGTAACGAATGCGGGCAATCTTTTAACCACCAAACTAATTTGCAAAAGCACTTAGACCACAAACTTTGCAGAGGAGGCTTCATGTGCGaaatttgcaagaaaatatttactcaAATGACCGGCTGGAAGACCCATATGAAATCTGCACATTTAGGGGTGAAATTCGAGTGCAAAATATGTAAGAAATCTTACGCATTAGAGCAGAGTTACAAGGCGCACATGAAGATGGTCCATGATCCTGACTTTGTAAAAACCGAAAAATGTCCTGTTTGTGAGAAGGACATAAGTAAGCGTACGATAAGGAGCCACGTAGCGAAGCACAAGAGAGGGAAGACCGTTTGCCCTGAATGTGGAAAACTATTGACGGCAGCCAGTCTGAAGCCCCATTTGCTGAACCACCAGGGGCTTAAGCCGTATAAATGCGATGAATGCGGGAAAGCGTTTAGTTCGTCGTGTTATTTGAAGGATCACAAGAAGACTCACTCGAAGGATAAACGTTATGAATGTCCTATATGCCACAAGAAGTTCATACAGTGTAGCACGAAGAATATTCATTTGAGAACGCATACTGGTGACAAGCCTCACGGATGCAATATATCCTACAATAATCTAGAGGACTATCAGTGTAACATTTGCTTCGATGTGTTGAACGATGAGACAGAATTGTTCGTTCACATGCACGAACATTCCCAGGAGCCTACAAAGTCGAAGAACAAAGTTGAAGAAACTTTAAAGGCTTCTTCAGAGAAAGATTCAGATAATTCAACAAAAGATAATGATCGTACCCcgaaagaaaagaaagcaaGGAAGAGAACCAGGACCATACATACCAAGCTCAAATGCGAAACTTGTTCAGGAATTTTCTATAATCGAGTTCTGTTTGATACGCATGTCTGCGAAGGTAAACGACAACAACGATCCTTACTTACATTTAAATGTGACTTCTGCCcggaaatttataattataaggaaAAAACCAAATTTTTCGAACACAGAAAGACTCATGGAGGATTCAGCTGTCCACATTGTGACAAAAAGATCCTAAGGCGGGATTTGTTCGAGCGACACGTCCAATCGCACactaaaagtaattatatcaAGTGCGAGACTTGCGGACAATCTTTTAACCACGAGACCAATTTCCGAAAGCACTTAGATCACAAACTTTGTAGGGGCGGCTTCGTGTGcgaaatttgcaacaaaatgttttcccaGAAGATTGGCTTGAACAAGCACATGGATTCTACTCACTTAGGGGTGAAGTTTGAGTGCGTGATATGCAATAAAGCTTACACTGTTGAGCAATCATATAAGACACACATGAAAATGGTCCATGatccaaattttgtaaaaaccaaaaagtgtCCTGTTTGTAAGAAGGACATTAGCAAAACTTCATTAAGGACCCACGTTGCTTTGCACAAGAAAGGTAAAAGTGTTTGTCCTGAGTGTGGCAAGAGATTATGGCCGGAGTCTTTAAAAACGCATCTGTTGACCCACCGGGGTCTTAAACCTTATCAGTGCGAGCATTGCGGGAAGGCCTTTGCTTCGAAACCTTACTTGAAGGATCACAGGAAGACTCATACGAATGATAAGCGTTATGAGTGCCATATATGCCATAAGAAGTTCACACAGGCCACCAGCAGGAATATACATGTTAGGACTCATACTGGCGATAAGCCTTATGGTTGTAGCTTGTGCGAGAAGAAGTTTTATACTGGATACAAACAAGTGAATTTGGAACCAACTATAATCATTCTGTACAAATGCGGCATCTGCAGTGAAACGTTCGAAAACGAGCAATTGCACTCTGAGCACGTAAAACAACATTCTCTTCAGAAATGCTCCAGTAAAAAGCAGAAGGCTGGGAATTTGAAAAGCAAACTCAAAGAGAAGAAAGGTAAAAAGTCTGGCAAGAAGGCAGAAAATACGAAAAGGACAAGGAAGTCCAGAAATGGTGGCCCGTCTAAATCCATGGAATCCAAATCGTTAGAATCTAAACCTTCAGATTCTAAATCATCGGAATCTATATCATCAGATTCTAAAGCATCAGATTCTAAAGCATCAGATTCTAAAGCATCAGATTCTAAAGCATCAGATTTTAAAGCATCAGAATCTAAATCATCACAATCTCAAGCATCAAAAGTGAAATCATCAAAAGTAAAATCTTCAAAAGTGAAatcatcaaattttaaatcatcaGAATCCGAATCCTCTGAATCCGAATCCTCAGAACCCGACTCTTTGGAACTCGGATCCTCAGAATCCGATTCGTGGGATCTCGAACCAAAGAAGAACAccgaaagaaagaaaaaggcAAAAAAAGCCCAAAAATCCAGAAAAACCGATATATCCAAGCCAGTAATACGAACGGGAAAACGGGGAAGGCCCAGAAAAGCCCCAGACCCCACCCAACCAGAAGTTTGGACTTGTCACTTCTGCCCGGCGAAATTCAACAATGAGAAGAAGTACTACACCCATCGCAAAACCCATGGCCCTTACAGTTGTCCTCATTGCGACAAAAAACTGGCCATCTTCAGTGCCTACACAAGACACGTGGCCGCCCACACGAAATCAAATTACTTAGCCTGCGAGAGATGCGGGAAGCTCTTTCCTTTCGACAAGAACTTAGAAAAGCATCAGCTTATATGCCTCAAAGGCGGCCATGCTTGCGAACATTGCGAGAAGGTGTTCTCCACAAGAGCTGCCAAGCAAGAACACACGGAATCCGTCCACTTGGGCGTCTTGCACTCTTGCAAATTCTGCGACcaacaattcaaaataaagCAGTCGCTCGTCTTCCACATGAAGACCCATGAGCCGGGATACAATGGCCCTGAACCTTGCCCAATTTGCGATAAGCCAGTCAACGCAAATTACAGACAAAAGCACATGCAAAGACATAAGATGGAGAGGCGAGTCTGTCCTGAATGTGGGAAGAGTTTCGCGAAGTTAGAGAGTCTAAAAACTCATCTGAATGGTCACAGAGGTATTCGTCCTTTTCAGTGCGATCTTTGCGAGGCTTCGTTTACTGCGAAGAAAACGTTGGTTGACCACAGGAAGATCCACACTGGGGAGCGGAATTACGAATGCGAGTTTTGTTACAAGCGGTTTATTCAAAGAACTAGTTGGAAAATTCATTTGAGAACGCATATGAAGGATAGGGGTTTCTTCCAATATGAGTTTTTGAAACTGGAACCTCCTTTAGAGATAAAATGCGAATTCTTAACTGAATACGAGGAACTGGGCCAAGAGCCCGAACTCATGGATCCGCTTAAAAGGAGCAAAGAGGACATTCCGAGGAAAGCTCGCAAAAAAGgatcaaagaaaaaacatcAACCTTCAGTCAAGAATCTGAGTCTTATTCCACCTCTTTTGTGGACTTGTAAAATTTGTAACGAAGCATTCCCAACGAAGACGATCCTTACAGCTCATTGGAAAGCTCACGGCGGGTACAGTTGTCCTCATTGCGACAAAAAGATTAACAATAGTTCTGCATTGAAGAAGCACGTGGACGCCCACAAGAAAGATACTTATCTAGAATGCGATTTGTGCAAAAAGAAGTATCCTAATGAAGAGGAACTGAAAGAGCACCAACTCACTTGTATAGGAGGGTTTGTGTGCGAACATTGCAATAAGAAATTTGTGCAAAAGGGAACTTTGAGGGCGCACATAGACTCGAAGCATTTAGGGGTAAAGTACCATTGCACGATCTGCGACAAGTACTTAACCCATATTGATTCGTTTAAAGCTCACAAGAAGAGCCATGACCCAGACTACGAAAAGTTCTACTGGTGCGACATCTGCGAGAAGAAGTTGAGGGTCACCTCGAGGAAAACTCATTCAAAACTGCACAAAAGCGACTCGAGGCTCATTTGCCAGGAATGTGGCAAAAGCTTGGCCACAAAGGAAAGCTTGAAGTTGCATTTAGGGAAGCATAGGGGGGAGAAACCATTCCAGTGCGAAGTGTGCGGGAAAGGGTTCACCGCGAAGAAATACTTGGAAGGGCACAAGGTCACGCACACGAAGGAGAAAAGCTTCGCTTGCAACGTGTGCAGCAAGAAGTTCACGCAGAGGACACCTCTGGTCATTCACATGCGCCACCATACGGGCGAGAGGCCGTACGGGTGCGACATTTGCAGCAAAAAGTTTGTCACCAAGACATTGTTGGTTTATCACAAAAAGAAACACCATTGCCCTTCCGATAGCGAGTAA
- Protein Sequence
- MTDRILKSDLEDYRSDIFCDLFGDGAELFEHKHEHSLQKEKVTSPSTAESENAISSNKKTKKKPVTKSRTKKIWTCKHCFVSFTSKLLLDAHVCNKEGPIANSESETTTSPKKKTRRQLTKKWTCESCSITWHNKLLFNTHICSTEGDIANFGKQIPSFKCEFCPEVYTEKNEFFEHRKTHEGYSCPHCDRKLLNPDLFERHVQSHTKDNYVKCGECGKSFNHQTNLQKHLDYKLCRGGFVCEVCNKIFTQKGTLTNHMNSVHIGIRFECKPCNKKFSMEQSYKAHMLTIHDPNFVKKTKKCPFCEKEIPSRSFGSHVTRHKNGLRVCPECGKRVNNLKVHLMVHQDLKPYKCDECGKEFRSSKYLKEHQKIHANDKRYECPLCFKKFVQCNAKTIHVRTHTGEKPSRCDEIDLKYNSIKDTILKSELGDYRCDLCCDLFEEEAKFLEHKHEHSLQKESTQKVVSSFGSHVTKHKNGLRVCPECGKRVNNLKVHLMVHQDLKPYKCDECRKELRSSKYLKEHQKIHANDKRYECPICFKKFVQCNVKTIHVRTHTGEKPSRCDEIEVKYNPISAGTVKSDPEDYQCDICCDLFEDEAEFFQHMHDHSLQKKSTRKVGSSLAIKRENTTSSIKKKKKKQVRKHPGNKKWTCERCYISLSNKLLFDTHVCNTEGQIVNSKKKTLSFKCEFCPEIYTEKKEFFKHRKTHEGYKCPHCDRKILRPDLFERHVQSHTKDNYVKCNECGQSFNHQTNLQKHLDHKLCRGGFMCEICKKIFTQMTGWKTHMKSAHLGVKFECKICKKSYALEQSYKAHMKMVHDPDFVKTEKCPVCEKDISKRTIRSHVAKHKRGKTVCPECGKLLTAASLKPHLLNHQGLKPYKCDECGKAFSSSCYLKDHKKTHSKDKRYECPICHKKFIQCSTKNIHLRTHTGDKPHGCNISYNNLEDYQCNICFDVLNDETELFVHMHEHSQEPTKSKNKVEETLKASSEKDSDNSTKDNDRTPKEKKARKRTRTIHTKLKCETCSGIFYNRVLFDTHVCEGKRQQRSLLTFKCDFCPEIYNYKEKTKFFEHRKTHGGFSCPHCDKKILRRDLFERHVQSHTKSNYIKCETCGQSFNHETNFRKHLDHKLCRGGFVCEICNKMFSQKIGLNKHMDSTHLGVKFECVICNKAYTVEQSYKTHMKMVHDPNFVKTKKCPVCKKDISKTSLRTHVALHKKGKSVCPECGKRLWPESLKTHLLTHRGLKPYQCEHCGKAFASKPYLKDHRKTHTNDKRYECHICHKKFTQATSRNIHVRTHTGDKPYGCSLCEKKFYTGYKQVNLEPTIIILYKCGICSETFENEQLHSEHVKQHSLQKCSSKKQKAGNLKSKLKEKKGKKSGKKAENTKRTRKSRNGGPSKSMESKSLESKPSDSKSSESISSDSKASDSKASDSKASDSKASDFKASESKSSQSQASKVKSSKVKSSKVKSSNFKSSESESSESESSEPDSLELGSSESDSWDLEPKKNTERKKKAKKAQKSRKTDISKPVIRTGKRGRPRKAPDPTQPEVWTCHFCPAKFNNEKKYYTHRKTHGPYSCPHCDKKLAIFSAYTRHVAAHTKSNYLACERCGKLFPFDKNLEKHQLICLKGGHACEHCEKVFSTRAAKQEHTESVHLGVLHSCKFCDQQFKIKQSLVFHMKTHEPGYNGPEPCPICDKPVNANYRQKHMQRHKMERRVCPECGKSFAKLESLKTHLNGHRGIRPFQCDLCEASFTAKKTLVDHRKIHTGERNYECEFCYKRFIQRTSWKIHLRTHMKDRGFFQYEFLKLEPPLEIKCEFLTEYEELGQEPELMDPLKRSKEDIPRKARKKGSKKKHQPSVKNLSLIPPLLWTCKICNEAFPTKTILTAHWKAHGGYSCPHCDKKINNSSALKKHVDAHKKDTYLECDLCKKKYPNEEELKEHQLTCIGGFVCEHCNKKFVQKGTLRAHIDSKHLGVKYHCTICDKYLTHIDSFKAHKKSHDPDYEKFYWCDICEKKLRVTSRKTHSKLHKSDSRLICQECGKSLATKESLKLHLGKHRGEKPFQCEVCGKGFTAKKYLEGHKVTHTKEKSFACNVCSKKFTQRTPLVIHMRHHTGERPYGCDICSKKFVTKTLLVYHKKKHHCPSDSE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -