Basic Information

Gene Symbol
-
Assembly
GCA_958510855.1
Location
OY294038.1:11301508-11327299[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 49 0.032 2.1 9.9 1.2 1 23 47 70 47 70 0.95
2 49 7e-05 0.0046 18.3 0.6 1 23 75 97 75 97 0.94
3 49 0.55 37 6.0 0.5 2 23 104 125 103 125 0.93
4 49 2.1e-05 0.0014 19.9 0.4 1 23 131 154 131 154 0.98
5 49 3.3e-05 0.0022 19.3 3.5 2 23 159 180 158 180 0.97
6 49 0.004 0.26 12.8 1.1 1 23 189 211 189 211 0.97
7 49 0.00059 0.039 15.4 2.8 1 23 217 239 217 239 0.95
8 49 3.1e-06 0.00021 22.5 3.0 1 23 245 267 245 267 0.98
9 49 4.2e-06 0.00028 22.1 0.2 1 23 358 380 358 380 0.98
10 49 0.0003 0.02 16.3 2.8 1 21 386 406 386 407 0.95
11 49 0.016 1.1 10.9 2.9 2 23 415 436 414 436 0.96
12 49 2.9 1.9e+02 3.8 6.2 1 23 544 566 544 567 0.95
13 49 0.016 1.1 10.8 4.9 1 23 573 595 573 595 0.98
14 49 0.00064 0.043 15.3 0.6 1 23 627 650 627 650 0.98
15 49 9.8 6.5e+02 2.1 0.1 2 13 655 666 654 669 0.88
16 49 0.00015 0.0098 17.3 2.2 1 23 756 778 756 778 0.97
17 49 6.5e-05 0.0043 18.4 1.8 2 23 785 806 784 806 0.97
18 49 0.0059 0.39 12.2 0.2 3 23 811 831 810 831 0.97
19 49 6.2e-05 0.0041 18.4 0.3 1 23 837 860 837 860 0.98
20 49 0.0036 0.24 12.9 0.2 2 23 865 887 864 887 0.96
21 49 0.013 0.85 11.2 2.3 1 23 893 916 893 916 0.97
22 49 2.2 1.5e+02 4.1 0.2 1 23 922 944 922 944 0.77
23 49 1.5e-05 0.001 20.4 0.6 1 23 950 972 950 972 0.97
24 49 3.8e-05 0.0025 19.1 3.7 1 23 978 1000 978 1000 0.96
25 49 3.9 2.6e+02 3.4 0.1 1 10 1006 1015 1006 1017 0.92
26 49 1.7e-05 0.0011 20.2 1.7 1 23 1143 1165 1143 1165 0.98
27 49 0.0016 0.1 14.0 3.4 1 23 1171 1193 1171 1193 0.97
28 49 0.016 1 10.9 4.0 2 23 1200 1222 1199 1222 0.93
29 49 0.00014 0.0091 17.4 1.3 1 23 1228 1250 1228 1250 0.97
30 49 6.1e-06 0.00041 21.6 3.4 1 23 1256 1278 1256 1278 0.97
31 49 1.9e-05 0.0013 20.0 1.5 1 23 1389 1411 1389 1411 0.98
32 49 0.073 4.8 8.8 3.6 1 23 1417 1439 1417 1439 0.97
33 49 0.0015 0.098 14.1 3.0 2 23 1446 1467 1445 1467 0.97
34 49 0.00085 0.056 14.9 1.1 1 23 1473 1496 1473 1496 0.96
35 49 0.0011 0.074 14.5 0.5 2 23 1501 1522 1500 1522 0.95
36 49 0.0016 0.11 14.0 1.9 1 23 1528 1551 1528 1551 0.96
37 49 3 2e+02 3.7 3.1 1 23 1557 1579 1557 1579 0.89
38 49 0.059 3.9 9.1 5.0 1 23 1585 1607 1585 1607 0.88
39 49 4.4e-06 0.00029 22.1 3.9 1 23 1613 1635 1613 1635 0.97
40 49 1.9e-05 0.0013 20.0 1.5 1 23 1746 1768 1746 1768 0.98
41 49 0.00044 0.03 15.8 3.7 1 23 1774 1796 1774 1796 0.97
42 49 3.1e-05 0.0021 19.4 2.5 2 23 1803 1824 1802 1824 0.97
43 49 0.0027 0.18 13.3 1.3 1 23 1830 1853 1830 1853 0.96
44 49 0.0011 0.074 14.5 0.5 2 23 1858 1879 1857 1879 0.95
45 49 0.00093 0.062 14.8 0.8 1 23 1885 1908 1885 1908 0.96
46 49 3 2e+02 3.7 3.1 1 23 1914 1936 1914 1936 0.89
47 49 0.03 2 10.0 5.1 1 23 1942 1964 1942 1964 0.89
48 49 4.4e-06 0.00029 22.1 3.9 1 23 1970 1992 1970 1992 0.97
49 49 3.3e-05 0.0022 19.3 1.2 1 23 1998 2020 1998 2020 0.98

Sequence Information

Coding Sequence
ATGACCATCAAAACAGAAGAATTTGATGGTGCAATCCCACCACATACCGAAATTAGATGCAAGATGTGCAACAAACTCCTCCCTTTGACAGAATTGGAAAGTCACGTGATTAAAGAGCACGCAGATAAAGAAAAGAATTTTGCATGTGACATTTGTGGAAAACTGTTTAATAAAAACAAGCAACGGAATTATCACAAAAAAATTGTTCACGGGGATCGGAAACATATCTGTTCCGTTTGTAATAAAGCTTTCGCCACACCTAGTGGATTGCGACGTCACGAATCTTACCATAGCTCCGAACGGCCATGGATATGCGATTTCTGCGAAAGCGCTTTTCAGAATAAATCTGCTTTATACTCCCATAAAACACAGCATTCCGAATCAAAAAAATACGTGTGTGATCTTTGCTATAAATCCTTTTCGATTCCATCTCTTTTGAAAAACCACATTCGAAGTTCTCACATTACAAGTGTCGAATGCAAACTTTGCAATGAAACGTTCACCAACTTATACCATTTGAAGAAGCACACCCGATCACACCGAGAGTCCAATAATCTTCAAAGCCATGTTTGCAAAATTTGTGATAAAGTTTTTGATATGGATCTCAGGTTACAGAGGCATCTACTAACTCATAGAAATAAAGATAAGCATATGTGCGATTTGTGCGGAAAAACTTTTAGCACCACCACGAGCTTAAGAGATCACAAGTTTATTCATCTCGATATTAAGCCTTTCGAATGCTATAATTGTGGGAAAaagtttaacaataaaaatactCTCAAGAGCCACATACGAACACATACAGAGAGTAACGTTTTGGAAAATCCTACGGCTCCAGGATTATTAAATATTCCGAAACCTGATGGTAGTATGGTATATGTCGTTCTAAATAATCACAATTCTGATTCGGTGAAGTCGCCTAAATTATATCCACAGCAAGATAAACTTGAAAAGTGGCTGCAAGGTTTGAAACGACTCCAACCTAATTCTTTCGTCACTTGTATATTCTGCAAAGGACTGCTTACGGTTGAGCGAGTGGCTCAACACGTAAACTTAATACATTCAGTATTTTACAATTGTGAGGAATGTGGGAAGTCGTTTCCAAAAAAGTCTGCGCTTATTCTCCATAGAATAACGCATGATGACACCAGAAACTATGAATGTGGAGTTTGTTTGAAACGTTTTAAAACAAAGTATAATTTAAAACATCACACGAAGAGTTACTGTAAGAAAAGAAACTGTGTTTGCCATATTTGCCAAGCCGAATTTAAATATAAACGTTATTTGGCAATTCACGCTCGAAATCATTCTAACGATAGACCGTCCGAATGTAATATTTGTGCTGAAAGGTTATTAAAGATCAAATCAGAGAATAGCACCGCAAAAAGTCTTAAAACTCCTGGATTCGTATATTTTCCAAATACTAATAAAGTTTATCTGGTTTCAAATCCACAAAACTCGGATTCGGTGAATttgccaaaatttgaagtaaaccAAACAAATATTGAAGAATGGATGCAAAGCTTAAACCGGCTGGAACCTAATTCTCTTACCACTTGTGTATTCTGCAAAGGAATTTTTACTGTTGAAAATATGATCCAACATGTGCACTTAGAACATAAAGTAGGGTACAATTGCGAAAAATGTGGAAAGCATTGTTTAACCCCGACTGCGCTTACTTTCCATATAAAAACACATCACGATCACACAATAAGACATAAATGTGGAATTTGTTTCAAACGTATCAAATCAAAATATGCTTTCAAAACTCACATGAAGAACCACCGTGAGAAAGAAAAGTGGGATTGTAATATTTGCTCAGCTTCGTTTAATTGTAAAAGTAATTTGCAATGTCATCTTGGAAAGCACAATGGAAAGTACCAATGTGACCAGTGTGATAAGAACTTCAATGCTCAATGGGCATTAAATGATCATTTTAGGAGCACGCATGCCACAAATGTTGAGTGTAACACCTGCAAGGAAGTATTCCCAAGCAAGTTATTAAAGATCAAATCAGAGAATGACGTTCCGGAAAATCTTACAGCTCCTGGATTCTTATATTTTCCAAATTCTAATAAAGTTTACCTGATTGCAAATCAACAAAACTCTGTTACGGTAAATTTGCCGCCATTGGAActaaatcaagaaaatattgaagaatggATGAAAAGCTTAAAACGGCTGGAACATAATTCTCTTACCACTTGCATATTCTGCCAAGGACTGTTTACTGTTGAAATTATGGCCAAACATGTATACTTAGAACATATGTGCGAAAAATGTGGACAGCAATGTTCTAACCAGGCTGAGCTTATTCTCCATATAAGAACACACGATGACACCAGAATCATCGAATGTGAAGTTTGTTTCAAGCGATTTAAATCAAAAGCTCAGTTAAAAAAGCACAAGAAGATCCACATTGAGTGGCTTTGCTATATTTGCAAAGCATCGTTTAATGATAAAAGTTATTTGGTTGACCACATTGGAAGTCACTGCAGTAATGAAGAGTACGAATGTAATCTTTGTGATAAAAAGTTCAGTTATGCATTGCTCTTAAACGCACATGTTAGGAGTATGCATGCCACCAATCTGAAGTGTAACATCTGCAAAGAAATATTACCAAACTACTATTATTTACAAGAACATATCAGAAGCGtccataaaaatccaatttgCCATGTCTGTCCAGTTTGTAAGAAAGAATTTAATGCGAAACATTTGGCCAGCCGACATTTGAAAACAGTTCATGGAGACAAACAAAAGTTTGTTTGCGATATCTGTGGTAAAATTGTTGGGACTGGTACCATTTTGAGAGACCATAAATTTACTCATTTTGGAATTAAGCCTTTTGAATGTTACGATTGCGGAAAAAAGTTTACTAAGAAAGGTATGTTAAAGATACATGTACTGGTTCACACCGGGTTAAAACCACATGCTTGTAAAATGTGCAACAAGAAATTTTCACAAAGAAATACATTGACGATTCATATCAGAAGTCATACCGGTGAGAGGCCTTACAAATGCGAGTTATGTGATGAAGCTTTTGTTATGGTTCGGATTGGATCTTTCCCATCACGATACGACGCTACGTTACGATTGGCATGCGAAGACCTTGCAAGAATGTTAGTAACGATCAAAACAGAGAATGACGTTTCGGAAAATATTACAGTTCCTGGattattatattttccaaaatctaATAAGGTTTATGTCGTTACAAATCACCAAAACTCTGCTTCGTTGGATTTACCAATTTACGGATTTACGGATAATTATCAATTAGAACCAAACCAAGTCCATATTGATGAATGGACGCAAAGCTTAACCGGTCTTCAACCTAATTCTCTTACCACTTGTATATTCTGCCAGGGAGTGTTTCTTGTTGAAAATATGAGCCAACATGTTTACTTAAAACATACAGTCGGGTACAATTGCGGAAAATGTGGAAAAGAGTGTTCAAACCAGACTGCGCTTACTCTCCATATTAGAACACATGATGAGACCAGAAACCACGAATGTGGAGTTTGTTTCAAacgtttcaaattaaaatatgctttaaatTATCACATGAAGAGCCACAATAAGAAAAAGGAGTGGATGTGCCATATTTGCAAAGCTTcgtttaaatataaaagttctTTGGATAGTCACCATAAATTTATTCATTTGAGCATTAAGCCCTTTGAATGTCATGTTTGCGGGAAAAAGTTTAATACGAAAAATGGGGTGAGGTCCCATGTACTGGTTCATACCAGATTAAAACCACATTCGTGTCAAATGTGCGATAAGAAATTTACGCAGAGAAGTACTTTGGTCATTCATATGAGAAGCCATACCGGGGAGAGGCCTTACCAATGCGACTTGTTAGTAAATATCAAATCAGAGAATAACGTTCCGAAAAATCTTACAGCTCCTGGATTCTTATATTTTCCAAAAGCTAATTTGGTTTATGATATTCCAAATCTGCAAAACTCTGCTGAGCAGAGTTTTGGAGCAGAGTCTTCTCCGGTGGATTTGCCAAAATTGGAACCACACCAAGTTGAAGAATGGATGCAAAGCTTAAAAAAACTAGAACCTAATTCTCTTACCACTTGCATATTCTGCAAACGACTGTTTACGGTTGAAAAAGTTATCCAACATGCCCACTTAAAACATAAAGTAGGGTACAATTGCGAAAAATGTGGAAAGCCGTGTTCAAACCAGACTGCACTTACTCTCCATGTAAGAACACATGATGAGACCAGAAACCACGAATGTGGAGTTTGTTTCAAacgtttcaaattaaaatatgctttaaatTTTCACATGAAGAGCCACAGTGAAAGTAGAGAGTGGGTTTGCCACATTTGCCAAGCGtcatttaaatataaaagttgcATGGTTATTCACATTCGAAGGCACTCCAATACTGGAGACTACGAATGTAATCTTTGCAATAAAAAGTTCAGTAACCCATCGATCTTAAACGAACATTTTAGGAATACCCATGCAACAAATATTAAGTGTAACACCTGCAAAGAAGTATTCCCAAACTACTATTATCTACAAAAGCATAGCATAGTCCATAAAAATCCCAGTCGCCATGTCTGCCCAATTTGTAAGAAGGAATTTAATGTGAAACATTTGGCCACAAGACATTTGAAAGTGGTTCACGGAAACAAGGAAAAGTTTGTTTGCGATATCTGTGGTAAAATTTGTTCGTGTGGTACCAGTTTAAGGGACCATAAATTTATTCATTTGGGCATTAAGCCTTTTGAATGCCACGATTGCGGAAAGaagtttaataacaaaaatcgGATGAGGTGCCATGTGCTGGTTCATACCGGGCTGAAGCCACATTCGTGTAAAATGTGCGATAAGAAATTTACGCAGAGAAGTACTTTGGTCATTCATATGAGAAGCCATACTGGAGAGAAGCCTTACAAATGCGACTTGTTAGTAAATATCAAATCAGAGAATAACGTTTCGAAAAATCTTACAGCTCCTGGATTCTTATATTTTCCAAAAGATAATTTGGTTTATGTCATTCCAAATCTGCAAAACTCTGCTGATCAGAGTTTTGTAGCAGAGTCTGCTCCGGTGGATTTGCCAAAATTGGAACCACAGCAAGTTGAAGAATGGATGCAAAGCTTAAAAAAACTAGAACCTAATTCTCTTACCACTTGCATATTCTGCAAACGACTGTTTACGGTTGAAAAAGTTATCCAACATGCCCACTTAAAACATAAAGTAGGGTACAATTGCGAAAAATGTGGAAAGCCGTGTTCAAACCAGACTGCACTTACTCTCCATGTAAGAACACATGATGAGACCAGAAACCACGAATGTGGAGTTTGTTTCAAAcgtttcaaatcaaaatatgttCTAAATAATCACATGAAGAGCCACAGTGAAAACAGAGAGTGGGTTTGCCACATTTGCCAAGCATCGTTTAAACATAAAAGTTATTTGGTTATTCACATTCGAAGGCACTCCAATACTGGAGACTACGAATGTAATCTTTGCAATAAAAAGTTCAGTTACCCATCGATCTTAAACGAACATTTTAGGAATACCCATGCAACAAATATTAAGTGTAACACCTGCAAAGAAGTATTCCCAAACTACTATTATCTACAAAAGCATAGCATAGTCCATAAAAATCCCAGTCGACATGTCTGCCCAATTTGTAAGAAGGAATTTAATGTGAAATATTTGGCCACAAAACATTTGAAAGTGGTTCACGGAAACAAGGAAAAGTTTGTTTGCGATATCTGTGGTAAAATTTGTTCGTGTGGTACCAGTTTACGGGACCATAAATTTATTCATTTGGGCATTAAGCCTTTTGAATGCCACGATTGCGGAAAAAAGTTTACTAACAAAAATCGGATGAGGTGCCATGTGTTGGTTCATACCGGGCTGAAGCCACATTCATGTAAAATGTGCGATAAGAAATTTACTCAGAGAAGTACTTTGGTCATTCATATGAGAAGCCATACTGGAGAGAAGCCTTACAAATGCGACTTGTGCGATAGGGCATTTGTTAGTCGAAGTTTATTAAATACTCATAAGAAAAATCATGactaa
Protein Sequence
MTIKTEEFDGAIPPHTEIRCKMCNKLLPLTELESHVIKEHADKEKNFACDICGKLFNKNKQRNYHKKIVHGDRKHICSVCNKAFATPSGLRRHESYHSSERPWICDFCESAFQNKSALYSHKTQHSESKKYVCDLCYKSFSIPSLLKNHIRSSHITSVECKLCNETFTNLYHLKKHTRSHRESNNLQSHVCKICDKVFDMDLRLQRHLLTHRNKDKHMCDLCGKTFSTTTSLRDHKFIHLDIKPFECYNCGKKFNNKNTLKSHIRTHTESNVLENPTAPGLLNIPKPDGSMVYVVLNNHNSDSVKSPKLYPQQDKLEKWLQGLKRLQPNSFVTCIFCKGLLTVERVAQHVNLIHSVFYNCEECGKSFPKKSALILHRITHDDTRNYECGVCLKRFKTKYNLKHHTKSYCKKRNCVCHICQAEFKYKRYLAIHARNHSNDRPSECNICAERLLKIKSENSTAKSLKTPGFVYFPNTNKVYLVSNPQNSDSVNLPKFEVNQTNIEEWMQSLNRLEPNSLTTCVFCKGIFTVENMIQHVHLEHKVGYNCEKCGKHCLTPTALTFHIKTHHDHTIRHKCGICFKRIKSKYAFKTHMKNHREKEKWDCNICSASFNCKSNLQCHLGKHNGKYQCDQCDKNFNAQWALNDHFRSTHATNVECNTCKEVFPSKLLKIKSENDVPENLTAPGFLYFPNSNKVYLIANQQNSVTVNLPPLELNQENIEEWMKSLKRLEHNSLTTCIFCQGLFTVEIMAKHVYLEHMCEKCGQQCSNQAELILHIRTHDDTRIIECEVCFKRFKSKAQLKKHKKIHIEWLCYICKASFNDKSYLVDHIGSHCSNEEYECNLCDKKFSYALLLNAHVRSMHATNLKCNICKEILPNYYYLQEHIRSVHKNPICHVCPVCKKEFNAKHLASRHLKTVHGDKQKFVCDICGKIVGTGTILRDHKFTHFGIKPFECYDCGKKFTKKGMLKIHVLVHTGLKPHACKMCNKKFSQRNTLTIHIRSHTGERPYKCELCDEAFVMVRIGSFPSRYDATLRLACEDLARMLVTIKTENDVSENITVPGLLYFPKSNKVYVVTNHQNSASLDLPIYGFTDNYQLEPNQVHIDEWTQSLTGLQPNSLTTCIFCQGVFLVENMSQHVYLKHTVGYNCGKCGKECSNQTALTLHIRTHDETRNHECGVCFKRFKLKYALNYHMKSHNKKKEWMCHICKASFKYKSSLDSHHKFIHLSIKPFECHVCGKKFNTKNGVRSHVLVHTRLKPHSCQMCDKKFTQRSTLVIHMRSHTGERPYQCDLLVNIKSENNVPKNLTAPGFLYFPKANLVYDIPNLQNSAEQSFGAESSPVDLPKLEPHQVEEWMQSLKKLEPNSLTTCIFCKRLFTVEKVIQHAHLKHKVGYNCEKCGKPCSNQTALTLHVRTHDETRNHECGVCFKRFKLKYALNFHMKSHSESREWVCHICQASFKYKSCMVIHIRRHSNTGDYECNLCNKKFSNPSILNEHFRNTHATNIKCNTCKEVFPNYYYLQKHSIVHKNPSRHVCPICKKEFNVKHLATRHLKVVHGNKEKFVCDICGKICSCGTSLRDHKFIHLGIKPFECHDCGKKFNNKNRMRCHVLVHTGLKPHSCKMCDKKFTQRSTLVIHMRSHTGEKPYKCDLLVNIKSENNVSKNLTAPGFLYFPKDNLVYVIPNLQNSADQSFVAESAPVDLPKLEPQQVEEWMQSLKKLEPNSLTTCIFCKRLFTVEKVIQHAHLKHKVGYNCEKCGKPCSNQTALTLHVRTHDETRNHECGVCFKRFKSKYVLNNHMKSHSENREWVCHICQASFKHKSYLVIHIRRHSNTGDYECNLCNKKFSYPSILNEHFRNTHATNIKCNTCKEVFPNYYYLQKHSIVHKNPSRHVCPICKKEFNVKYLATKHLKVVHGNKEKFVCDICGKICSCGTSLRDHKFIHLGIKPFECHDCGKKFTNKNRMRCHVLVHTGLKPHSCKMCDKKFTQRSTLVIHMRSHTGEKPYKCDLCDRAFVSRSLLNTHKKNHD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-