Basic Information

Gene Symbol
-
Assembly
GCA_963971405.1
Location
OZ020490.1:30079051-30093547[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 0.065 4.3 8.5 5.4 1 23 126 148 126 148 0.97
2 32 0.0058 0.38 11.8 0.2 1 23 154 176 154 176 0.97
3 32 0.00036 0.024 15.6 3.6 1 23 182 204 182 204 0.97
4 32 1.5e-05 0.001 19.9 1.0 2 23 211 232 210 232 0.97
5 32 2e-05 0.0013 19.5 1.3 1 23 238 260 238 260 0.99
6 32 0.00016 0.011 16.7 1.5 1 23 266 288 266 288 0.98
7 32 9e-07 6e-05 23.8 1.0 1 23 294 316 294 316 0.99
8 32 0.00013 0.0084 17.0 0.4 1 23 322 344 322 344 0.99
9 32 0.3 20 6.4 0.7 3 23 1146 1167 1145 1167 0.97
10 32 0.00031 0.021 15.8 3.3 1 23 1178 1200 1178 1200 0.98
11 32 1.6e-05 0.0011 19.8 2.7 1 23 1228 1250 1228 1250 0.99
12 32 3.4e-05 0.0023 18.8 1.7 1 23 1256 1278 1256 1278 0.98
13 32 0.00017 0.011 16.6 2.2 1 23 1284 1306 1284 1306 0.99
14 32 1.2e-05 0.00078 20.2 1.8 1 23 1311 1333 1311 1333 0.99
15 32 0.0058 0.38 11.8 4.4 1 23 1339 1361 1339 1361 0.99
16 32 0.00024 0.016 16.2 1.1 1 23 1367 1389 1367 1389 0.99
17 32 0.00054 0.036 15.0 1.5 1 23 1395 1417 1395 1417 0.99
18 32 0.027 1.8 9.7 0.9 1 23 1423 1445 1423 1445 0.99
19 32 2.6e-06 0.00017 22.3 2.1 1 23 1451 1474 1451 1474 0.98
20 32 0.12 8.1 7.6 1.2 3 23 1576 1597 1575 1597 0.98
21 32 0.00031 0.021 15.8 3.3 1 23 1608 1630 1608 1630 0.98
22 32 1.6e-05 0.0011 19.8 2.7 1 23 1658 1680 1658 1680 0.99
23 32 3.4e-05 0.0023 18.8 1.7 1 23 1686 1708 1686 1708 0.98
24 32 0.00017 0.011 16.6 2.2 1 23 1714 1736 1714 1736 0.99
25 32 1.7e-05 0.0011 19.7 1.9 1 23 1741 1763 1741 1763 0.98
26 32 0.02 1.3 10.1 4.3 1 23 1769 1791 1769 1791 0.99
27 32 5.3e-05 0.0035 18.2 2.5 1 23 1797 1819 1797 1819 0.99
28 32 0.00056 0.037 15.0 1.0 1 23 1825 1847 1825 1847 0.98
29 32 7.3e-05 0.0048 17.8 1.6 1 23 1853 1875 1853 1875 0.99
30 32 0.021 1.4 10.1 3.5 2 23 1882 1903 1881 1903 0.98
31 32 0.038 2.5 9.2 1.0 1 23 1909 1931 1909 1931 0.98
32 32 1.8e-05 0.0012 19.7 1.7 1 23 1937 1960 1937 1960 0.95

Sequence Information

Coding Sequence
ATGGATATAGTTATAAAATTGGAGCCAGATGATTCTGAGGTGTCATTTGATACTGAACCTGGCCATCTCGTCGATGGCGTAAATAAAGAAGAAAGCGTTAAGTGTGAAATAGCAGGAGCCCTGTGTGATTTAACGGAAGAGAAAGCATTTCAAATAACAGACAAGTGCCTAATAATGGAAGGAGACGAGGGTGTAATACCTAAAGGAGCCAAGCTGGAAATTGCGGAAGGAGAAAAGTGTAATATAAAGGAAGAAGACGACGAGGATTCAAATTTCCCCTCTCCGGGGGCACCTGAAGACGTGAAAGTGCTGACTATCGATGTGAGCATCAGGAATAGTGGAAGGGGCAGAAAGCCCATAACTGATTCACAACAGCATCAGTGCAAAGTCTGTCAAAAATATTTCGCCAGTGGCCATAAGCTTAAGCTACACGTAAGGTTTCATACCGGAGAGAAACCTTACGAGTGCATTATATGCGCAAAACGCTTTGCGCTAAGCGGCACTCTCCAAGCCCATACAAGAACCCATACCGGGGAAAAACGGCACCAGTGTTCTGTTTGCATGAAGTCATTTACACAGAGTACTAACTTACGTAGACATTCCCAAATTCATATAGGAGAGAAGCCTTTACAGTGCACGGTATGTTCGAAATATTTTCGCCAGAGCGACGATCTACGGAAACACGCGAGGATCCATACAGGGGAGAAAGCTTATAAGTGCATGGTTTGCTCGAAGGGTTTCACGTCGAGTAGCGAAGTTCGGAGGCACGAGAGGACGCACACTCGGGAGAAGCCGCACCAGTGTCAGGTCTGTCTGAAGTCGTTTACGCAGGCGATTACGTTGCAGAAGCATGGGAGAATACACACCGGGGAAAAGCGTTACGAGTGTACGGTTTGTTCGAAGAAGTTTACGGACAGTGGGGACTTACGGAAGCACGCGAGAACCCATACTGGGGAGAAGCCCTTCCAGTGTAAAATATGTGGACAATGGTTTGGGGATAGGAGTAATTTGGGTAGGCACGTGAGGAGGCATAACATGGGGGTGGAAGGAGAAAACATTGTTGACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGAACCAGGCAAACGTTTCATCACTTACACCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGCACCAGGCAAACGTTTCATCGCTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGAACCAGGCAAACGTTTCATCACTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGCACCAGGCAAACGTTTCATCACTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGAACCAGGCAAACGTTTCATCACTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGCACCAGGCAAACGTTTCATCGCTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGAACCAGGCAAACGTTTCATCACTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGCACCAGGCAAACGTTTCATCACTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGAACCAGGCAAACGTTTCATCACTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCAAAAGTGAACCAGGCAAACGTTTCATCACTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGTTGAAGGAGATGTACCCGAGGCCAACTTAGAGGCGATCGAAGGGACTGGAGCAGCTATCACAAGTGAACCAGGCAAACGTTTCATCACTTATGCCAGTCCACGGGAACatgaaattggaaatcatggTGGTGTAgaggtaaaaaaagaaaacattgttgACTGTGACATGTCAGGAGATGTGCCAGCATTTGATGAAGGAGACACCAACCAAGAGGCGGTAGACGTGATTGGAGTATCAATCAAAAGTGAGCTTGGCCCAACGACTGACGCAGCTAGGGCCCTGCACGATCCTGAAAATACAGACGAAAAACTATTTCAATGCTTTGTCTGTAGGGTTAGATTTCCTTGCGAGGGAGACTTACTTGAACACAAGGTAAGAGAAAAACACGGCCGTAATATAGGAGAAAAACCCCATGGGTGCAGTCAATGCAGGGCACGTTTCTTCACTTACGCGAGTCTGCAGGAACATGAAAAATCAACACATAACGACGTACGAAAACAATCTACTCAGACTCACCAGTGTCCTGTGTGTTCGAAATGTTTGCCTTCGAAGTATTCATTGGAGAAGCATACAATAACGCATAAGGGAAAGAAGCTTGAAGAATGTTTGTCTTCCAGTGGTTCGCTGGAGAACCATAGAATAACAAATACAGAAGAAAACCCTTACCAGTGTCCTATTTGTTCGAAATGTTTGTCTTCAAAGCGTTCACTGGAAGAGCATACAAGAACACATACAGGAGAGAAGACATATCAGTGTACTGTTTGTTCAAAAACCTTTTCTGCAAAACGTTCGCTGAAGAAGCATGCCGAAACTCATACCGGAAAGAAGCCTTACCAGTGCGTGATATGCTCCAAGTGTTATAAATCGAGTGATGGGCTATGGTTACATAAAAGATCGCATACGACTGAGCGTTATACGTGCGAGACATGCTCAAAACAGTTCACTACTGAGAAGTCGCTCCTAAGGCATATGCAAATACATTCAGGGGAGGCCCCGTACCAATGTCCTGTTTGTTCGAAATGTTTTCTACAGAAGTGCACGTTGCAATTGCACGCGAGAATACATTCGGGGgagaagccttacaagtgtgaTGTATGCTCGAGGGGCTTCGGTTATAAAAATAACTTGCTAACGCATAAGCGAGTGCACACTGGAgagaagccttacaagtgtgaAGTCTGCCTGAAATCGTTTACGCAAGTGAATACATTACTGAATCATAGTCGAAGACATTCAGGCGAGAGGCGTTATGAGTGTAAGGTGTGCTTGAAGAAGTTCCTGGTTAGTGGGGATATGCGGAAACATGAGAGAACCCATACTGGAGAGAGGCCTTTCCAGTGTAACATATGTGGGCAATGGTTTAGAGATAGGAGTAATTTGCGCAAGCACATGAAGAGGAAACATAATAAGGAGGTGTTTGCCGGGGTTAAAGAAGAAAACATTGCTGACTCTGACATGTCAGAAGATGTGCCAGCATTTGTTGAAGGAGCCGCCAACTTAGAGGCGGTAGACGTGATTGGAGTATCAATCAAAAGTGAGCTTGGCCCAACCACTGACGTGGCCAGGGCCCTGCACGATCCTGAAAATGCAGATGAAAAACTATTTCAATGCTTTGTCTGTAGGGATAGATTTCCTTGCGAGGGAGACTTACTGGAACACAAGATAAGAGAAAAACACGGCCGTGACAGAGGAGAAAAACCCCATGGGTGCAGTCAATGCAGGGCACGTTTCTTCACTTACGCGAGTCTGCAGGAACATAAAAAGACAACACATAACGACGTACGAAAACAATCTACTCAGACTCACCAGTGTCCTGTGTGTTCGAAATGTTTGCCTTCGAAGTATTCATTGGAGAAGCATACAATAACGCATAAGGGAAAGAAGCTTGAAGAATGTTTGTCTTCCAGTGGTTCGCTGGAGAACCATAGAATAACAAATACAGAAGAAAACCCTTACCAGTGTCCTATTTGTTCGAAATGTTTGTCTTCAAAGCGTTCACTGGAAGAGCATACAAGAACACATACAGGAGAGAAGACATATCAGTGTACTGTTTGTTCAAAAACCTTTTCTGCAAAACGTTCGCTGAAGAAGCATGCCGAAACTCATTCTGAGAAGAAGCCTTACCAGTGCGTTATATGCTCCAAGTGTTATAAATCGAGTGATGGGCTATGGCTACATAAAAGATCGCATATGACTGAGCATTTTACGTGCGAGACATGCTCAAAACAGTTCACTACTGAGAAGTCGCTCCTAAGGCACATGCAAATACATTCAGGGAAGGCCCCGTACCAATGTCCTGTTTGTTTGAAATGTTTTCTGCGGAAGTGCACGTTGCAACTACACGCTAGAATACATTCGGGGgagaagccttacaagtgtgaTGTATGCTCCAGAAGCTTCGGTTATAAACATAACCTGCTGGCGCATAAAAGAGTGCATACTGGAgagaagccttacaagtgtgaAGTCTGTCTGAAATCGTTTAGACAAGTGAATACATTACTGAATCATAGTCGAATACATTCAGGGgagaagccttacaagtgtgaTGTATGCTCGAGGAGCTTCGGTTATAAAAACAACTTGCTGACGCATAAAAGAGTGCATACTGGAGAGAAGCCTAACAAGTGTGAAGTCTGCCTGAAATCGTTTAAGCATGTGAATACATTGTTGAATCATAGTCGAAGACATTCAGGGGAGAGGCGATTTGAGTGTAAGGTGTGCTTGAAGAAGTTCCTGGTTAGTGGGGACATGCGGAAACATGAGAGAACCCATACTGGAGAGAGGACTTTCCTATGTAACATATGTGGGCAATTGTTTAGGGGTAAGAGTAATTTGCGGAAGCACCTGAGGAGGAAACATATTAAGGTACGGTTGGCCGGTGAGAATGCAGCCGTTACAGCGTCGGCCACGGATGATTGA
Protein Sequence
MDIVIKLEPDDSEVSFDTEPGHLVDGVNKEESVKCEIAGALCDLTEEKAFQITDKCLIMEGDEGVIPKGAKLEIAEGEKCNIKEEDDEDSNFPSPGAPEDVKVLTIDVSIRNSGRGRKPITDSQQHQCKVCQKYFASGHKLKLHVRFHTGEKPYECIICAKRFALSGTLQAHTRTHTGEKRHQCSVCMKSFTQSTNLRRHSQIHIGEKPLQCTVCSKYFRQSDDLRKHARIHTGEKAYKCMVCSKGFTSSSEVRRHERTHTREKPHQCQVCLKSFTQAITLQKHGRIHTGEKRYECTVCSKKFTDSGDLRKHARTHTGEKPFQCKICGQWFGDRSNLGRHVRRHNMGVEGENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSEPGKRFITYTSPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSAPGKRFIAYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSEPGKRFITYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSAPGKRFITYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSEPGKRFITYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSAPGKRFIAYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSEPGKRFITYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSAPGKRFITYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSEPGKRFITYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAIKSEPGKRFITYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFVEGDVPEANLEAIEGTGAAITSEPGKRFITYASPREHEIGNHGGVEVKKENIVDCDMSGDVPAFDEGDTNQEAVDVIGVSIKSELGPTTDAARALHDPENTDEKLFQCFVCRVRFPCEGDLLEHKVREKHGRNIGEKPHGCSQCRARFFTYASLQEHEKSTHNDVRKQSTQTHQCPVCSKCLPSKYSLEKHTITHKGKKLEECLSSSGSLENHRITNTEENPYQCPICSKCLSSKRSLEEHTRTHTGEKTYQCTVCSKTFSAKRSLKKHAETHTGKKPYQCVICSKCYKSSDGLWLHKRSHTTERYTCETCSKQFTTEKSLLRHMQIHSGEAPYQCPVCSKCFLQKCTLQLHARIHSGEKPYKCDVCSRGFGYKNNLLTHKRVHTGEKPYKCEVCLKSFTQVNTLLNHSRRHSGERRYECKVCLKKFLVSGDMRKHERTHTGERPFQCNICGQWFRDRSNLRKHMKRKHNKEVFAGVKEENIADSDMSEDVPAFVEGAANLEAVDVIGVSIKSELGPTTDVARALHDPENADEKLFQCFVCRDRFPCEGDLLEHKIREKHGRDRGEKPHGCSQCRARFFTYASLQEHKKTTHNDVRKQSTQTHQCPVCSKCLPSKYSLEKHTITHKGKKLEECLSSSGSLENHRITNTEENPYQCPICSKCLSSKRSLEEHTRTHTGEKTYQCTVCSKTFSAKRSLKKHAETHSEKKPYQCVICSKCYKSSDGLWLHKRSHMTEHFTCETCSKQFTTEKSLLRHMQIHSGKAPYQCPVCLKCFLRKCTLQLHARIHSGEKPYKCDVCSRSFGYKHNLLAHKRVHTGEKPYKCEVCLKSFRQVNTLLNHSRIHSGEKPYKCDVCSRSFGYKNNLLTHKRVHTGEKPNKCEVCLKSFKHVNTLLNHSRRHSGERRFECKVCLKKFLVSGDMRKHERTHTGERTFLCNICGQLFRGKSNLRKHLRRKHIKVRLAGENAAVTASATDD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-