Basic Information

Gene Symbol
-
Assembly
GCA_030247185.2
Location
CM058071.2:157118128-157140071[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 55 4.8e-06 0.00037 22.0 2.6 1 23 37 59 37 59 0.98
2 55 0.00088 0.069 14.8 2.5 1 23 65 87 65 87 0.98
3 55 0.0014 0.11 14.2 0.9 1 23 93 115 93 115 0.96
4 55 0.031 2.4 10.0 0.2 1 23 121 143 121 143 0.97
5 55 0.00033 0.026 16.2 4.4 1 23 149 171 149 171 0.98
6 55 0.00043 0.034 15.8 0.4 1 23 177 199 177 199 0.95
7 55 8.4e-06 0.00066 21.2 1.2 1 23 205 227 205 227 0.98
8 55 0.024 1.9 10.3 2.8 1 21 232 252 232 253 0.93
9 55 0.00046 0.036 15.7 2.3 1 23 324 346 324 346 0.96
10 55 7.3e-06 0.00057 21.4 0.5 1 23 352 374 352 374 0.97
11 55 0.003 0.24 13.1 1.2 1 23 382 404 382 404 0.89
12 55 0.58 46 6.0 0.7 1 13 410 422 410 424 0.88
13 55 0.0014 0.11 14.2 1.5 1 23 505 527 505 527 0.96
14 55 7.9e-05 0.0062 18.1 1.4 1 23 533 555 533 555 0.97
15 55 0.36 28 6.6 2.6 2 23 568 589 567 589 0.86
16 55 0.00029 0.023 16.4 0.7 1 23 595 617 595 617 0.99
17 55 0.011 0.87 11.4 2.4 1 23 623 645 623 645 0.95
18 55 0.00016 0.012 17.2 0.5 1 23 651 673 651 673 0.97
19 55 0.0028 0.22 13.2 0.9 1 23 679 701 679 701 0.98
20 55 0.0014 0.11 14.2 1.5 1 23 724 746 724 746 0.96
21 55 7.8e-05 0.0061 18.1 1.0 1 23 752 774 752 774 0.96
22 55 8.8 6.9e+02 2.2 2.5 2 23 787 808 786 808 0.90
23 55 9.7e-05 0.0076 17.8 1.6 1 23 895 917 895 917 0.96
24 55 0.0018 0.14 13.8 0.4 1 23 923 945 923 945 0.96
25 55 0.92 72 5.3 2.5 2 23 958 979 957 979 0.90
26 55 1.1e-05 0.00083 20.9 0.5 1 23 985 1007 985 1007 0.99
27 55 0.0049 0.38 12.5 1.6 1 23 1013 1035 1013 1035 0.96
28 55 0.018 1.4 10.7 1.2 1 23 1041 1063 1041 1063 0.96
29 55 0.016 1.3 10.8 0.8 1 23 1069 1092 1069 1092 0.96
30 55 0.0014 0.11 14.2 1.5 1 23 1183 1205 1183 1205 0.96
31 55 0.0018 0.14 13.8 0.4 1 23 1211 1233 1211 1233 0.96
32 55 0.067 5.3 8.9 1.8 2 23 1246 1267 1245 1267 0.90
33 55 8.4e-06 0.00066 21.2 0.3 1 23 1273 1295 1273 1295 0.99
34 55 0.011 0.86 11.4 3.0 1 23 1301 1323 1301 1323 0.95
35 55 5.2e-05 0.0041 18.7 0.9 1 23 1329 1351 1329 1351 0.97
36 55 0.0042 0.33 12.7 0.5 1 23 1357 1380 1357 1380 0.96
37 55 0.0015 0.11 14.1 3.0 1 23 1471 1493 1471 1493 0.96
38 55 4.5e-06 0.00035 22.0 0.4 1 23 1499 1521 1499 1521 0.97
39 55 0.0014 0.11 14.2 0.7 1 23 1530 1552 1530 1552 0.93
40 55 4.9e-07 3.9e-05 25.1 1.9 1 23 1558 1580 1558 1580 0.99
41 55 0.0033 0.26 13.0 1.6 1 23 1586 1608 1586 1608 0.96
42 55 0.0067 0.52 12.1 1.5 1 23 1614 1636 1614 1636 0.97
43 55 0.0046 0.36 12.6 0.6 1 23 1642 1665 1642 1665 0.96
44 55 0.037 2.9 9.7 0.2 5 23 1758 1776 1757 1776 0.95
45 55 4.5e-05 0.0035 18.9 0.9 1 23 1782 1804 1782 1804 0.97
46 55 0.00027 0.021 16.5 1.4 2 23 1817 1838 1816 1838 0.95
47 55 3.2e-05 0.0025 19.4 0.6 1 23 1844 1866 1844 1866 0.99
48 55 0.0049 0.38 12.5 1.6 1 23 1872 1894 1872 1894 0.96
49 55 5.7e-06 0.00045 21.7 0.4 1 23 1900 1922 1900 1922 0.98
50 55 0.00028 0.022 16.4 0.5 1 23 1928 1950 1928 1950 0.98
51 55 0.00071 0.055 15.1 2.4 1 23 1956 1978 1956 1978 0.98
52 55 0.079 6.2 8.7 0.1 1 23 1984 2006 1984 2006 0.97
53 55 0.00029 0.023 16.4 1.9 1 23 2012 2034 2012 2034 0.98
54 55 0.0023 0.18 13.5 3.4 1 23 2038 2060 2038 2060 0.97
55 55 7.2e-06 0.00056 21.4 5.2 1 23 2065 2087 2065 2087 0.99

Sequence Information

Coding Sequence
ATGTTCGTCGATACCGGAGTAAAGCCGTTCGAAAGTGATGTGTGTGACAATGGAATTCGCAACACTGAAAAGGAACGCCAGCTCAATGTTCATACCGTCAAACGATTGTACAAATGCACTGAATGCGGAAAAGATTTCCACACTCAAATAAATCTTAATCGGCATATGTACATTCACACTGGCAAGAGGCCGTTCAAATGCACTGTATGCGGAAAATGCTTTGTTTCAAACGGTGTCCGTAGCACGCACATGTCTACACACACGGGTGATTATAAGTATGTTTGTGATGTATGTGGCAAAGGATTTGTGCACAGACGGCGTTTGGAGGACCACAAATACATTCATGCCGGAGAGCGGCCTTACAAATGCGCGGTATGTGAAAAAGATTTTAAAACCAAACCGATTTTGGACATACATATGCTGAATCACACAGTGGATCGTCCACACAAATGCACTGTATGTGGTAAAAGTTTTGCCATTAAACATCGCCTAATTAGACACCAATCCACACATACGGGCGATTATAAGTATACTTGTGAAGTGTGTGGCAAAGGATTCGTGAACAGTTCACGCTTGGCAGATCACAAGTTCAACCATACGAAAGAGCGGTCTTATCAATGCACTATATGCGAAAAAGATTTTACTACTAACAATTACCTAACCCAACACATGTTGCTTCACACTGGTGCTCGACATCAATGTGAAGTGTGTGGCAAAGAATACTCTAGCcacatttatttgaaattgcACAAGTGCATTCTTACATATGATCAGTCGATTGATCCGTTACCAATGGAAACAACAGATCAGGATAGCGACATTCTAACCAAACAGGAGAATATAACAATTGAACCAACGAAACTCGATAACAATGACAACGTAAATAAACAAGCTTATACCAATGGTTACTGCGATGATCTAAATGATAAAGTATCCTGTCAGCTACAAGATGTTATGCCAGAGATGGAACACAAGTGTCAGATTTGTGATAAATACTTTGCTAACAAAGATGATCTTCATGAGCATGCCTCAGTTCATGAAAGCGAACGACCATTTAGTTGTGATATTTGCGGACAAAGTTATGAATCGAAGGATCAGCTAACCAAGCATATGACCGAGCATATTGAGTGTCCTAATGATAAATATAAATGTGATGTGTGTGGCAAAGAGTTTCGTGACAACCAAACATTAAAGTTCCACAACAGTATTCACACCGGAGAGCGACCGTACGAATGCACAGTATGCGAAAAAAGCTTTGGTTCCAAAAATCAGTCGATTGATCCGTTACCAATGGAAACAACAGACCAGGATAGCGACATTCTAGCCAAACAGAAGAATATAACAATTGAACCAACAAAACTCGATGACAAGCTAAATGAACAAACAAACACCAATGGCAACAGCGATGCAATTGAATTACAACCAAACTCACACGATCCAGTTAGCAGTAATGAAATTGATCGAGATGAAAAAGTATCCTGCCAGCTACAGGATGTTATGTCAGAGATGGAACACAAGTGTCAGATTTGTGATGAATACTTTGCTAACAAGGATGATCTCCATGAGCATGCCTCAGTTCATGAAAGCGAACGACCATTTAGTTGTGATATTTGCGGGCAAAGTTACGAATTGAAGGATCACCTAACCAAGCATATGACCGAGCATAATGAGAAGAAACCGTTCTCCTGTCCTAGTGATTGCAAATGTGATGTGTGTAGCAAAGAATTTCGTGACAATCAAGCATTGAAGTTCCACAACAGTATTCACACCGGAGAGCGACCGTACGAATGCACTGTATGCGGAAAAAACTTTAGTTTCAAAGGTATTCTAAGCACACACATGTCTACACACACGGGTAATTATAAGCATGTGTGTGATGTATGTGGCAAAGGATTTGTGCACAGATCGCGTCTGGATGACCACAAATACAGTCATACCGGAGAGCAACCTTACAAATGCACTGTGTGTGGAAAGATTTTTCCGACTAACACTGCTCTAAGTTTACACTTTTCCTCACACACGGGTAATTATAAGTACGAGTGTGATGTTTGTGGCAAAGGATTCATGATACAATCCCGTTTGAAATACCACAAAAACAAGCATACGGGAGAGCGACCTTACAAATGCACGGTATCCAACCAGCTACAGGATGTCATGTCAGAGAAGGAACACAAGTGTCAGATTTGTGATGAATACTTTGCTAATAAAGATGATCTCCATGAGCATGCCTCAGTTCATGAAAGCGAGCGACAATTTAATTGTGATATTTGCGGGAAAAGTTATGAATTGGAGGATCACCTAACCAAGCATATGACCGAGCATAATGAGGAAAAAACAATATCCTGTCCTAGTGATTGCAAATGTGATGTGTGTAGCAAAAAAATTCGTGACAATCAAGCATTGAAGTTCCACAACAATATTCATTCTGGAGATCGGCAGTACGAATGCACATCGATTGATCCGTTACCAATCGAACCAACAGATCACGATCCCTTTATTCTAGCCAAACAGGAGAATATAACAATTGAACCAACGAAACTCGATGACAATGACAAGATCAATGAGCAAACAACCATCAATAGCAACAGCGACGCATTCGATTTACAACCAAACCCACACGATCCTGTTGGCAGCAATGAAATTGATCGAATCCAGCTACAGGATGTTATGTCAAAGAATGAACACAAGTGTCCGATTTGTGATAAATACTTTGTAAACAAGGATGATCTTCATGAGCATGCCTCAGTTCATGAAAGCGAGCGACCATTCAGTTGTGATATTTGCGGGAAAAATTATGTATTGATAGATCACCTGGCCAGGCATTTGACCGAGCATATTAAGGAAAAAACGTTCTTCTGTCCTAGGGATTGCCAATGTGATAAGTGTGGCAAAAAAATTCGTGACAATCAAGCATTGAAGTTTCACAACAATATTCATACCGGAGAGAGGCAGTACGAATGCACAGTATGTGGAAAAAAACTTGTTACGAAGACTAATCTAAGCGTGCACATGTCTACACATACGGGTAATTATAAATATACGTGTGACGTGTGCGGCAGAGGATTTGTGCACAGATCGCGTTTGGATGACCACAAATACAGTCATACCGGAGAGCGGCCTTATAAATGCACTGTGTGTGGAAAGATTTTTTTGTCTAACACTACTCTAAGTTTACACTTTTCCTCACACACGGGAAACTATAAGTACGAATGTGATGTTTGTGGCAGAGGATTCATGCTAAGATCACGATTGAAATACCACAAATTAAAATTGCACAAACGCATTCATTCATCCGATCAGTCGATTGATTCGTTACAAATCAAACCAGCAGATCACGATAGCGTCAATCTAGCCAAACAGAAGAATACAACAAATGAATCCACAAAACTCGATGACAATGACAAGGTAAATGAACAAACAAACACCAATAGAAACAGCGATGCAATTGATACACAACCAAACTCACACGATCCAGTTAGCAGTAATGAAATTGATCGAGCTGAAAAAGTATCCTGCCAGCTACAGGATGTTATGTCAGAGATGGAACACAAGTGTCAGATTTGTGATGAATACTTTGCTAATAAAGATGATCTCCATGAGCATGCCTCAGTTCATGAAAGCGAGCGACCATTCAGTTGTGATATTTGCGGGAAAAATTATGTATTGATAGATCACCTGGCCAGGCATTTGACCGAGCATATTAAGGAAAAAACGTTCTTCTGTCCTAGGGATTGCCAATGTGATAAGTGTGGCGAAGAATTCCGTGACAATCAAGCATTGAAGTTCCACAACAATATTCATTCTGGAGATCGGCAGTACGAATGCAAAGTATGTGGAAAAAGACTTGTTACGAAGACTAATCTAAGCGTACACATGGCTACACATACGGGTAATTATAAACATAGTTGTGATATATGTGGCAAAGGATTTGTGCACAGATCACGTTTGGATGATCACAAATACAGTCATACCGGAGAGCGGCCTTACAAATGCACTGTGTGTGGAAAGATTTTTTCGACTAACGCTACTCTAAGTTTACACTTCTCCTCACACACGGGTGATTATAAGTACGAATGTGATGTTTGTGGCAGAGGATTCATGCTAAGATCACGattgaaaaatcacaaattaaaAGTGCACAAACGCATTCATACATCTGATCAGTCGATTGATCCGTTACAAATCAAACCAACAGATGACGATAGCGACATTCTAGCAAAACAGGAGAATATAACAATTGAACCAACAAAACTCGATGACAATGACAAGGTAAATGAACAAACAAACACCAATGGCAACAGCGATGCAATTGAATTACAACCAAACCCACACGATCCTGTTAGCAGCAATGAAATTGATCAAGATGAAAAGGTATCCAACCAGCTACAGGATGTTATGCCAAAGAAGGAACACAAATGTCAGATTTGTGATGAACACTTTGCTAACAAAGATGATCTCCATGAGCATGCCTCAGTTCATGAAAGTGAGCAAGCATTTAGTTGTGATATTTGCGGGCAAAGTTATGCATCGAAGGATCAGCTAACCAAGCATATGACCGAGCATATTGAGTGGCCTAGTGATTACAAGTATAAATGTGATGTGTGTGGCAAAGAATTTCGTGACAATCAAGCATTGAAGTTTCACAACAATATTCATACCGGAGGGCGGCCATACAAATGCACTGTATGcggaaaaatatttgttaccAACAGTAGTCTATGCGTACACATGTCTACGCATACGGGTAATTATAGGTATAGTTGTGATGTATGTGGTAAAGGATTTGTGCACAGATCACGTTTGGATGATCACAAATACAGTCATACCGGAGAGCGTCCTTATAAATGCACTTTGTGTGGAAAGATTTTTTTGTCTAAAACCACTTTAAACATACACTTCTCCTCACACACGGGTGATTATAAGTACGAATGTGATGTTTGTGGCAGAGGATTCATGCTAAGATCACGattgaaaaatcacaaattgaaATTGCACAAACGCATTCATACATTTGACCAGTCGAATGATCCGTTACAAATCAAACCAAAAGATCACGATAGCGACATTCTAGTAAAACAGGAGAATATAACAATTGAACCAACAAAACTCGATGACAAGGTAAATGAACAAACAACCATCAATAGCAACAGCGACGCATTCGATTTACAACCAAACCCACGCGATCCTGGTAGCAGTAATGAAATTGATCGAGatgaaaaattATCCTGCCAGCTTCAGAATGTTATGTCAGAGAAGGAATACAAGTATCAGATTTGTGATGAACACTTTACTGACAAGAACGACCTCCGTGACCATGCCTCAGTTCATGAAAGCGAGCGACCATTTAGTTGTGATATTTGCGGGCAAAGTTATGAATTGAAGGATCACCTAACCAAGCATATGATCGAGCATAATGAGGGAAAAACATTCTCCTGTCCTAGTGATTGCAAATGTGATGTGTGTGGCAAAGAATTTCGTGACAATCAAGCATTGAAGTTGCACAACAATATTCATTCTGGAGATCGGCAGTACGAGTGCACAGTATGCGGAAAAAGCTTTGGTACAAAAAGTATCCTAAGCACACACATGTCTACACATACGGGTGATTATAAGTATACGTGTGACGTGTGCGGTAGAGGTTTTGTGCACAGATCGCGTTTAGATGACCACAAATACAGTCATACCGGAGAACGGCCTTACAAATGCACTGTGTGTGGAAAGATTTTTTCGACTAACGCTACTCTAAGCATACACTTATCGTCACATACAGGTGATTATAGGTACGAGTGTGATGTGTGTGGCAAAGGATTTGCGCTAAGATCGCGTTTGAAAAGTCACAAAAATAGTCACATGGAAGAGCGACCTCACAAATGCACGGTATGCGAAAAAGATTTCAAAATCAAACGGAATCTTGACGCACATATGACGACTCACACAGCCGAGCGGCCGTACAGGTGCACTGTATGTGGAGATAGTTTCGTCgctatcaatcaactaactgcacACGAATCCAAACACACTGGTGAATATAAGCATACGTGTGACTTGTGCGGCAAAGGATTCATATACAGTTCATGGTTCAAAGAACACAGAAACAGTCATGAGCGGTCTTATCACTGCACGACATGCGGAAAAGACTTTGCCACTAACAAGAAACTATCCCAGCACAAGTCAATTCACTCGGGTATTCGGTATAAGTGCGAAATGTGCGGCAAAGAATTCACTAGGAAATTTAACTTAAAAGTGCACAAATGCACACATACATCTGATCAGTTAACGAATCCAAGTGGAACACTGTCTTCGTGTGAAACACAACCGTAA
Protein Sequence
MFVDTGVKPFESDVCDNGIRNTEKERQLNVHTVKRLYKCTECGKDFHTQINLNRHMYIHTGKRPFKCTVCGKCFVSNGVRSTHMSTHTGDYKYVCDVCGKGFVHRRRLEDHKYIHAGERPYKCAVCEKDFKTKPILDIHMLNHTVDRPHKCTVCGKSFAIKHRLIRHQSTHTGDYKYTCEVCGKGFVNSSRLADHKFNHTKERSYQCTICEKDFTTNNYLTQHMLLHTGARHQCEVCGKEYSSHIYLKLHKCILTYDQSIDPLPMETTDQDSDILTKQENITIEPTKLDNNDNVNKQAYTNGYCDDLNDKVSCQLQDVMPEMEHKCQICDKYFANKDDLHEHASVHESERPFSCDICGQSYESKDQLTKHMTEHIECPNDKYKCDVCGKEFRDNQTLKFHNSIHTGERPYECTVCEKSFGSKNQSIDPLPMETTDQDSDILAKQKNITIEPTKLDDKLNEQTNTNGNSDAIELQPNSHDPVSSNEIDRDEKVSCQLQDVMSEMEHKCQICDEYFANKDDLHEHASVHESERPFSCDICGQSYELKDHLTKHMTEHNEKKPFSCPSDCKCDVCSKEFRDNQALKFHNSIHTGERPYECTVCGKNFSFKGILSTHMSTHTGNYKHVCDVCGKGFVHRSRLDDHKYSHTGEQPYKCTVCGKIFPTNTALSLHFSSHTGNYKYECDVCGKGFMIQSRLKYHKNKHTGERPYKCTVSNQLQDVMSEKEHKCQICDEYFANKDDLHEHASVHESERQFNCDICGKSYELEDHLTKHMTEHNEEKTISCPSDCKCDVCSKKIRDNQALKFHNNIHSGDRQYECTSIDPLPIEPTDHDPFILAKQENITIEPTKLDDNDKINEQTTINSNSDAFDLQPNPHDPVGSNEIDRIQLQDVMSKNEHKCPICDKYFVNKDDLHEHASVHESERPFSCDICGKNYVLIDHLARHLTEHIKEKTFFCPRDCQCDKCGKKIRDNQALKFHNNIHTGERQYECTVCGKKLVTKTNLSVHMSTHTGNYKYTCDVCGRGFVHRSRLDDHKYSHTGERPYKCTVCGKIFLSNTTLSLHFSSHTGNYKYECDVCGRGFMLRSRLKYHKLKLHKRIHSSDQSIDSLQIKPADHDSVNLAKQKNTTNESTKLDDNDKVNEQTNTNRNSDAIDTQPNSHDPVSSNEIDRAEKVSCQLQDVMSEMEHKCQICDEYFANKDDLHEHASVHESERPFSCDICGKNYVLIDHLARHLTEHIKEKTFFCPRDCQCDKCGEEFRDNQALKFHNNIHSGDRQYECKVCGKRLVTKTNLSVHMATHTGNYKHSCDICGKGFVHRSRLDDHKYSHTGERPYKCTVCGKIFSTNATLSLHFSSHTGDYKYECDVCGRGFMLRSRLKNHKLKVHKRIHTSDQSIDPLQIKPTDDDSDILAKQENITIEPTKLDDNDKVNEQTNTNGNSDAIELQPNPHDPVSSNEIDQDEKVSNQLQDVMPKKEHKCQICDEHFANKDDLHEHASVHESEQAFSCDICGQSYASKDQLTKHMTEHIEWPSDYKYKCDVCGKEFRDNQALKFHNNIHTGGRPYKCTVCGKIFVTNSSLCVHMSTHTGNYRYSCDVCGKGFVHRSRLDDHKYSHTGERPYKCTLCGKIFLSKTTLNIHFSSHTGDYKYECDVCGRGFMLRSRLKNHKLKLHKRIHTFDQSNDPLQIKPKDHDSDILVKQENITIEPTKLDDKVNEQTTINSNSDAFDLQPNPRDPGSSNEIDRDEKLSCQLQNVMSEKEYKYQICDEHFTDKNDLRDHASVHESERPFSCDICGQSYELKDHLTKHMIEHNEGKTFSCPSDCKCDVCGKEFRDNQALKLHNNIHSGDRQYECTVCGKSFGTKSILSTHMSTHTGDYKYTCDVCGRGFVHRSRLDDHKYSHTGERPYKCTVCGKIFSTNATLSIHLSSHTGDYRYECDVCGKGFALRSRLKSHKNSHMEERPHKCTVCEKDFKIKRNLDAHMTTHTAERPYRCTVCGDSFVAINQLTAHESKHTGEYKHTCDLCGKGFIYSSWFKEHRNSHERSYHCTTCGKDFATNKKLSQHKSIHSGIRYKCEMCGKEFTRKFNLKVHKCTHTSDQLTNPSGTLSSCETQP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-