Basic Information

Gene Symbol
wek
Assembly
GCA_963942485.1
Location
OZ012630.1:6920696-6927905[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 51 0.018 1.7 9.7 3.8 1 23 34 56 34 56 0.98
2 51 0.29 26 6.0 0.3 1 21 58 78 58 79 0.92
3 51 1.9e-05 0.0017 19.1 1.0 1 23 86 108 86 108 0.98
4 51 0.021 1.9 9.5 1.2 1 23 113 136 113 136 0.91
5 51 0.22 21 6.3 0.1 1 23 186 208 186 208 0.96
6 51 2.8e-05 0.0025 18.6 1.1 1 23 506 528 506 528 0.98
7 51 2e-05 0.0018 19.0 1.2 1 23 534 556 534 556 0.98
8 51 9.2e-06 0.00084 20.1 6.9 1 23 561 583 561 583 0.99
9 51 0.0054 0.49 11.4 2.6 3 23 591 611 590 612 0.95
10 51 0.0025 0.23 12.4 0.2 2 23 618 639 617 639 0.96
11 51 0.00018 0.016 16.1 1.6 1 23 649 671 649 671 0.97
12 51 0.00025 0.022 15.6 0.4 1 23 677 699 677 699 0.98
13 51 2.3e-06 0.00021 22.0 1.9 1 23 705 727 705 727 0.98
14 51 3.9e-07 3.6e-05 24.4 2.0 1 23 733 755 733 755 0.98
15 51 0.00011 0.0098 16.7 3.5 5 23 764 782 761 782 0.95
16 51 0.0004 0.036 14.9 0.8 1 23 830 853 830 853 0.93
17 51 0.02 1.8 9.6 3.9 2 23 866 887 865 887 0.97
18 51 0.0089 0.81 10.7 2.5 5 23 895 914 893 914 0.95
19 51 0.0094 0.85 10.6 3.9 1 23 919 941 919 941 0.95
20 51 0.00048 0.044 14.7 1.3 1 23 949 972 949 972 0.97
21 51 0.00027 0.024 15.5 4.9 1 23 979 1001 979 1001 0.96
22 51 5.5e-06 0.0005 20.8 0.8 1 23 1007 1029 1007 1029 0.98
23 51 0.04 3.7 8.6 1.0 2 23 1057 1077 1056 1077 0.93
24 51 9.7e-05 0.0089 16.9 2.6 1 23 1083 1105 1083 1105 0.98
25 51 0.0015 0.13 13.2 0.5 1 23 1108 1130 1108 1130 0.99
26 51 0.019 1.7 9.7 0.4 1 20 1138 1157 1138 1158 0.98
27 51 7.9e-05 0.0072 17.2 2.3 1 23 1166 1189 1166 1189 0.96
28 51 0.0016 0.15 13.0 1.5 1 23 1197 1220 1197 1220 0.98
29 51 5.1e-05 0.0046 17.8 0.2 1 23 1226 1248 1226 1248 0.97
30 51 0.21 19 6.4 3.4 1 23 1254 1277 1254 1277 0.90
31 51 1.3e-05 0.0012 19.7 1.8 1 23 1284 1306 1284 1306 0.98
32 51 0.012 1.1 10.3 2.6 1 21 1312 1332 1312 1333 0.92
33 51 0.0013 0.12 13.3 4.2 2 23 1344 1365 1344 1365 0.98
34 51 0.011 1 10.4 2.6 1 23 1371 1393 1371 1393 0.96
35 51 0.00017 0.016 16.1 1.0 1 23 1399 1421 1399 1421 0.99
36 51 0.016 1.4 9.9 0.3 2 23 1428 1450 1428 1450 0.91
37 51 0.00021 0.019 15.8 3.4 1 23 1455 1477 1455 1477 0.96
38 51 0.0026 0.24 12.4 1.9 2 23 1488 1510 1487 1510 0.95
39 51 0.00016 0.015 16.2 0.4 1 23 1517 1539 1517 1539 0.97
40 51 2.6e-05 0.0024 18.7 0.2 2 23 1546 1567 1545 1567 0.97
41 51 2.4e-07 2.2e-05 25.1 1.2 1 23 1573 1595 1573 1595 0.98
42 51 0.00091 0.083 13.8 0.1 1 23 1601 1624 1601 1624 0.97
43 51 4.9 4.5e+02 2.1 0.3 1 21 1644 1664 1644 1665 0.91
44 51 0.0029 0.27 12.2 0.5 1 23 1672 1694 1672 1694 0.99
45 51 0.022 2 9.4 0.1 1 23 1700 1723 1700 1723 0.93
46 51 3.9 3.5e+02 2.4 0.4 1 23 1735 1758 1735 1758 0.89
47 51 0.00022 0.02 15.8 0.8 1 23 1764 1786 1764 1786 0.93
48 51 7.1e-05 0.0065 17.3 0.5 3 23 1794 1814 1792 1814 0.98
49 51 0.0002 0.019 15.9 0.1 1 22 1820 1841 1820 1841 0.95
50 51 2.2 2e+02 3.2 0.3 1 23 1875 1897 1875 1897 0.90
51 51 0.75 69 4.6 3.6 1 23 1902 1924 1902 1924 0.98

Sequence Information

Coding Sequence
ATGTGTAAAAAATGCCACGTAGATGTAGACAATATTCTATTAAACAAATCATACGTCTTTGTAAACAACACAGTAAACCTACAGAAAAAACGCAAAGTGTTCACCTGCGAAATATGCGGTGACACCTTCACAAACTTCAATCAATTCGGACGCCACAAACGTCATCACATCTTCCGCTGCGTTCGCTGCAACATGTTCTTCTCCAATGCCCAAGACGTGATGTACCACGTGTCTACTACGAAACTCGTCTGCTCTTTTGAGTGTCTTAAGTGCCCTAAAACTTTTACTAGTCGACAGGCGTTTAGTGTCCATATGCGATGGCATAAACTGTTCGAGTATTACTGTTCTCAATGTAATGCAGGGTTTGAGGATTATTTCGAACTGAAGAGGCATTCAGTGAGAAAGCACTCTGGTGTTGTTAAATATAAGAAGCACCCGGGTACtgataaagttaaaaaacagTTGGGTACTGTTAAACTTGAAAAGTTGCAGAAACAAATCGCGGCGAAACTACATAAGATCCTGATACCTAAGGAGGGTGAAGCAGAAGTGAGGTTTAGCTGCGAGGAGTGTTTCGACGAGTTTTCAGATCAAGAGGCATTGGATGAGCACACGATTTTGCACAAGGAGCTAAACGAGGCCACCCAGAGGAAATCCATAGCCACAGACGATGAAGTTGAAGTTATTGAGGACACCTCAAATGTTCACGAGGTGGTCACCACTGTAAAACTAGCAAGTTACAAGGGGCCCCAAACATCTAAAACCATAACCATGAACGCTGCTGAATCCCCTAGTTGCAGCACCACCAACAAAAGCGGCCTAACTGCCAAAATAACCGGGGTAATTCCAGCTGTGGAAGGAGTAGTCAAACCACAACCCTCACTTGCTGTTGCAAGCTTCAATATTCAACCTCAAGTGGTGCCACCTTTATCTCCAATTCGATATCAAATACAACAAACCTCCAGCCCACCGAAAATGTTGCTGATTGTGCATCAGAAACCATCAAACACTTTTACTGCGGTGCCTGTGATAAGATTGCCATCAGCTCCAAGTGCCTCTAGTCAAATTATTGATCCAACCCCAGTTGTGATTAGTGATTCGTCTAGTGACTCAGAGGAGGAAGGGAAGCTTGTTGTGGTCACAGAAAGTGAAGCTACTGCCAGTTCTATGGTTTCGTCGACAACTAAACCAAAGATTTTTGTTAGAACTGACCTCGGGAAtgaGTTATACTCCACAACTTTCGGTTTGGAATGCAATTATTTCATGGACACCCCCAACTTGCCCGATCTAGACCCATCGGAGCTCAACGAATCCGACGCTGAGCTCCTAAACCTCCTAACAGTCGAAGAAGACTGCTCAGCTATTCTACACAACATCGCCACTAACGAGCAATTCCAAAGTATCATAAGCAACGAGGCTCATCTTCTACCAAACGATAACAACCTGGAGCAATTCATGTGTCCTGTTGAGGTGCAACCTCCTGTTGCTCCTACTATTTTGGAAGCTCCAAAAGGTTTCAAGTGTGATCAGTGTGGAAGAGTTAGTAATACGAAGAAGCTACTAAAGAAGCATTTGTTAATCCACACGGCTGAGCGTAGGTTTCCTTGCGAGGTTTGCGGGAAGCTGTTCAGGCACCGCTATGAGGTCACTGCGCACCAACGCAGCCATAATAAACCGAGTTTTCAATGTGAGATTTGCTCGAAGATGTTCATACACAAATCCCACTTGACTACGCATCGTCGTAAGCATTTGGGTGAGTTTGTAGCGTTTTGTACTCAGTGCCAAATTGGGTTTGTCTCGAATAGCTCCTTCAAAACTCATCAGAGTATCCACCATGACAACCAACAGCTCATCTGCGAAACTTGCGGAGCACGATTGAGCTCCAATAGCGCTTTGAAGGAACACAAACTCACCCATGACAAGAATTATGGGCAACAAAGATCGCACATTTGCGAAATCTGCGGGAAAAGCTATCTAACAACTCGAAATTTGAGGAGCCACATGAAGGTCCACGCTAAAGTCAAACCGTACATCTGCTCAATCTGTGGCAAGTCGGTTAGTAGTAAGAAGATTTTAGAAACGCACCAGAAGATGCATACTGGCGTGCGGGATTTTCATTGTGAGGTATGCAATAAGAGTTTTGCTTCCAAGGAGTACTTGACAGTTCATCAGAGGATCCACGTGGGGGATAAACCGTATGAGTGTACCATGTGTGGGAAGCGCTTCACGCAGAAAACTAGCTTGACTGTACATTTGAGGTATCACACTGGACAAAGACCGTATAAGTGTGAATGTGGTAAGGAGTTTACGACGAAGAGTCACTTGATGACGCATTATAAGACTCATGATGTGGGAGGGGTGGAAATTGAGAACACAAACGACGAATCCGACCAAATCATAAACTACGAGTGTCCTCTTTGTAGCTCCATCGAACTCACCAAAGCAACCCTCTTTGCCCACCTTCTAAAACACTGTACCACTCTTCCGTTTATTTGCCCAATTTGCAATCGCATGTTTTGCGATAGTACCAAGTTTGCATCTCACATAATCGACGAGCATTTCGACTTTCAACCTACACAGTCTGAGGCAATGGTCCAATGTGACCTTTGTCATAAGCAGCTGAAGAATTCCCGAAGCTTAAGGTTGCATAAACAGCGCCATAACCAGAAACAACCAAATTGTAAATGCGGGAAAAGCTTCAAGTCTCACAAAGAGTACCAAATTCACTTGGCTACCAACCATAATGAAGGTCCCTATTCGTGTGGGGTGTGCCACAAGGCTTGCTACACAAAGTACTCGTTGAAACTCCATGAGGAGTTGCATGTTGAAGGGAAATGTCAAGAGTTTTCGTGTGGGAGGTGtgggaaaatttttgatacaGTCTGTTTGTTGAAGAATCACATTCGACAGGTGCACTTGAAGCAAGTGAGGAAGCATATATGTGCGACTTGTGGCAAATGCTTTACTACTAATCACAGTTTGAAGGGGCATTTGTTGGTTCATAGTGGCGAGTTGCCGTTTGAGTGTTCGATTTGCGATAAGCGGTTTAGATTGAAGACTAGTTTGACTATTCATGTTAGGGTTCATACTGGGGAAAAACCGTACAAGCGAGTCGAAGATAAATCCGGCATTCTTATAGCGGAACCTCAATCTAAAACTCATGAAGTCACGTGTGAAGTATGcggaaaaatatattctacGCTCTATTTTAAAAAGCACTTAAAAAAACACCAAGGCCAAGCTGGTTTTTCTTGTGGCCACTGCTTGGAGACCTTCAGTTTAAAAAGCGAGCTAACCAAACACCAGCTGACGCACAAAACATACCAATGCCAACTATGTCCTAAAGTCTTACAAAACTTGCAGTCGCTTTCTTGGCACACCAGAAGACATTCAGAAGGCAACTCAAAAGATTACCAATGCGACCAGTGCAACCTTCTCTTCACCAACAAAGAAGTCTTCCGACAACACATGTGGGCCAATCACAAGTTACGAATCCACAACTGTCATATTTGTGGCCAAGGTTATTATAACGAGTCCACTTTGAAGGTTCACATCAGGAGTATGCATGAGGACTCCTCCAACAACCTTTTCAAGTGCAACAGTTGCGAGAAAACCTTTCGTTTGAAGCTCTATCTCGACAAACATGTTAGGATAATCCataagaattacggaaatcaCGTTTGCGATATTTGCGGTAAAACCTTGACTACCTACGCGGGTTTGAAAGCACATCTTTTGGTACATTCCGGGAAAAAACCACACGTTTGTCATATTTGTTCGAAGGGGTATAATAAGAggcaaaatttgaatttccatGTGAAGGCCAGTCATATGGGTTTGGAGAAGAAGTATCGATGTACGGAGTGTGGTATGTGCTTTGCTAGAAATCCGGACTTGAAGTTGCATACTATGCGACACACGAAGAAATATTCGCATTGTTGTGAGAATTGTGGGAGGGGGTTCACCTCGAATGCGTCGAAAAAGAAGCATATTGAGAAGAGGGTTTGTATTAGGTTACAATTGCAACCTCAATGCGGCGACTGTGGGAAAATTTGCAAGAACACTAGAAAGTTGAACCAGCATCGCCGCTCTCATTCCACAGCTCGTCCTTTCCCATGCCCTCTTTGTCCTAAACGCTTTAAACTCCAACAGCAACTAAATTGCCATAAGAAAATCCACGACGGTACCTCAAGCTTCCAATGCGAAATCTGCCTGAAAACTCTTAAGTCTAAAGGTTCTTTACTAATCCACCGCCGTCGCCATTTAAAAGACTTCGTCGCCAAGTGCAAACTATGCGATTTAGGTTTCGTAACAAATCAAGAGTACCTCAACCACATGGGAGCCAAACATGGCCAAAGCCAGCATATTTGCACAGTTTGCAATCGAAGTTGCTACGATAAATCGGCGTTGCAAAGCCACATGGCTAGACACGCCCCTAACTACAACACTAACACTGATATTAAGTGTCCTTTATGTGATAAGACTTTCCTACAGAACCACTATTTGAAAGAACATTATACACGAGTGCACAAAGATGGTGGTCAGAATTTCATGTGTGATTTGTGTGGGAAAAACGTGAGGAGTAAGAGGAGTTTGCGTGATCATTTATTGATACATCAAGGCTTGAAGCCTTTAGAGTGTCCGGAGTGTGGACGAGGGTTTGCGTTGAGGACTACGCTTAAATTGCATTTGAGGACGCATACTGGACAAAGACCGTACGTTTGTAGTGAGTGTCAGAAGAGTTTTACGCAAAAAACGGCGTTGAATGTACATATGAGGTATCATACGGGGGAAAGACCGTATGTCTGTGATATTTGTAGTGGGGGGTTTGTGACTAAAGGTGCCATGAAGGTTCACATGAGGAACAAACATGGGAAGATGCTTTTGGCAAAACCCAAATCTGGTTTCGTTCCCATCACTTACACCAAGTACCTTTGTTCTATGTGCCTTAACTACTTTCAAAGTCTCAACGAGTTAAACCAGCACATATCCAAAGTGTGCACTAGGAGGAAGTACAAATGTTCCCAGTGTGGCTCTGTTTGGAAACTACGAAGCGACCTTAAGGCCCACTTGAAGACCCATCAAGCCAAAACGCGTTACCCTTGCACTAAATGTGGTAAACCTCTTGCGAATAATCTAGCGTTAGATGCACACTTATACAACTATCatactaaatattttgacttaacaCCTTCAGGAGTGTTCCAGTGTAAGTTGTGCGATGTTCATAAGGCTACTAAGGGTGATGTTCTAGAACACATCAACGCAAATCACTTGCAAGTTACCACGTTTCTTTGTGATATCTGTGggaagaatctgaagaataaGAAGAATTTGTTGAGGCATAAGGTGGTACACTCCAAAGAGAAACCTTATGGATGTAGTGTTTGTTCTAAGGTTTTTAAAGGAGCTGCGCATTTGAGACAGCATATTATAACGCATAATCCTGACAGGAGTTTCATTTGTGAGGAGTGTGGGAAGGCGTTTAAGGGTAGTCGTGGGTTGCTAGCGCATCAAAAGAGGGTAAACTGTGTTATATACGAGTATCTAGACGATGATGACGAAGAGGAGGAAGATGAAAACCAAGACACCACTAATCATGTGCCTACAAATTTACAAACCTACACTTGCGAAATGTGCGAGGAAGTCTTAACTTCCCACATGACTATAGCCTTACATTCCATAAAACACAATACAGATCGCTACTACTGCCACTTTTGCTTGTACAAAACGCCGTCTAAGAAACTAATACGAGATCACATGATCAAACATGTTGATACCCGAGGCAAGCGATAG
Protein Sequence
MCKKCHVDVDNILLNKSYVFVNNTVNLQKKRKVFTCEICGDTFTNFNQFGRHKRHHIFRCVRCNMFFSNAQDVMYHVSTTKLVCSFECLKCPKTFTSRQAFSVHMRWHKLFEYYCSQCNAGFEDYFELKRHSVRKHSGVVKYKKHPGTDKVKKQLGTVKLEKLQKQIAAKLHKILIPKEGEAEVRFSCEECFDEFSDQEALDEHTILHKELNEATQRKSIATDDEVEVIEDTSNVHEVVTTVKLASYKGPQTSKTITMNAAESPSCSTTNKSGLTAKITGVIPAVEGVVKPQPSLAVASFNIQPQVVPPLSPIRYQIQQTSSPPKMLLIVHQKPSNTFTAVPVIRLPSAPSASSQIIDPTPVVISDSSSDSEEEGKLVVVTESEATASSMVSSTTKPKIFVRTDLGNELYSTTFGLECNYFMDTPNLPDLDPSELNESDAELLNLLTVEEDCSAILHNIATNEQFQSIISNEAHLLPNDNNLEQFMCPVEVQPPVAPTILEAPKGFKCDQCGRVSNTKKLLKKHLLIHTAERRFPCEVCGKLFRHRYEVTAHQRSHNKPSFQCEICSKMFIHKSHLTTHRRKHLGEFVAFCTQCQIGFVSNSSFKTHQSIHHDNQQLICETCGARLSSNSALKEHKLTHDKNYGQQRSHICEICGKSYLTTRNLRSHMKVHAKVKPYICSICGKSVSSKKILETHQKMHTGVRDFHCEVCNKSFASKEYLTVHQRIHVGDKPYECTMCGKRFTQKTSLTVHLRYHTGQRPYKCECGKEFTTKSHLMTHYKTHDVGGVEIENTNDESDQIINYECPLCSSIELTKATLFAHLLKHCTTLPFICPICNRMFCDSTKFASHIIDEHFDFQPTQSEAMVQCDLCHKQLKNSRSLRLHKQRHNQKQPNCKCGKSFKSHKEYQIHLATNHNEGPYSCGVCHKACYTKYSLKLHEELHVEGKCQEFSCGRCGKIFDTVCLLKNHIRQVHLKQVRKHICATCGKCFTTNHSLKGHLLVHSGELPFECSICDKRFRLKTSLTIHVRVHTGEKPYKRVEDKSGILIAEPQSKTHEVTCEVCGKIYSTLYFKKHLKKHQGQAGFSCGHCLETFSLKSELTKHQLTHKTYQCQLCPKVLQNLQSLSWHTRRHSEGNSKDYQCDQCNLLFTNKEVFRQHMWANHKLRIHNCHICGQGYYNESTLKVHIRSMHEDSSNNLFKCNSCEKTFRLKLYLDKHVRIIHKNYGNHVCDICGKTLTTYAGLKAHLLVHSGKKPHVCHICSKGYNKRQNLNFHVKASHMGLEKKYRCTECGMCFARNPDLKLHTMRHTKKYSHCCENCGRGFTSNASKKKHIEKRVCIRLQLQPQCGDCGKICKNTRKLNQHRRSHSTARPFPCPLCPKRFKLQQQLNCHKKIHDGTSSFQCEICLKTLKSKGSLLIHRRRHLKDFVAKCKLCDLGFVTNQEYLNHMGAKHGQSQHICTVCNRSCYDKSALQSHMARHAPNYNTNTDIKCPLCDKTFLQNHYLKEHYTRVHKDGGQNFMCDLCGKNVRSKRSLRDHLLIHQGLKPLECPECGRGFALRTTLKLHLRTHTGQRPYVCSECQKSFTQKTALNVHMRYHTGERPYVCDICSGGFVTKGAMKVHMRNKHGKMLLAKPKSGFVPITYTKYLCSMCLNYFQSLNELNQHISKVCTRRKYKCSQCGSVWKLRSDLKAHLKTHQAKTRYPCTKCGKPLANNLALDAHLYNYHTKYFDLTPSGVFQCKLCDVHKATKGDVLEHINANHLQVTTFLCDICGKNLKNKKNLLRHKVVHSKEKPYGCSVCSKVFKGAAHLRQHIITHNPDRSFICEECGKAFKGSRGLLAHQKRVNCVIYEYLDDDDEEEEDENQDTTNHVPTNLQTYTCEMCEEVLTSHMTIALHSIKHNTDRYYCHFCLYKTPSKKLIRDHMIKHVDTRGKR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-