Basic Information

Gene Symbol
-
Assembly
GCA_963422695.1
Location
OY730478.1:62312186-62321664[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 46 0.00017 0.012 16.8 2.0 2 23 185 206 184 206 0.96
2 46 0.00029 0.021 16.1 2.7 1 23 211 233 211 233 0.99
3 46 7.3e-05 0.0053 18.0 1.9 1 23 239 261 239 261 0.98
4 46 0.043 3.1 9.3 1.5 2 23 267 287 266 287 0.96
5 46 3.5e-05 0.0025 19.0 0.9 1 23 294 316 294 316 0.99
6 46 0.00031 0.022 16.0 1.1 1 23 322 344 322 344 0.99
7 46 2.9e-06 0.00021 22.4 1.4 2 23 354 375 353 375 0.97
8 46 0.0017 0.13 13.7 1.9 1 23 381 403 381 403 0.99
9 46 0.00097 0.071 14.4 0.8 1 23 409 431 409 431 0.98
10 46 2.4e-05 0.0017 19.5 1.1 3 23 506 526 504 526 0.97
11 46 3.7e-05 0.0027 18.9 2.0 1 23 532 554 532 554 0.98
12 46 0.00042 0.03 15.6 0.2 2 23 560 581 560 581 0.97
13 46 1.2e-06 8.8e-05 23.6 1.7 1 23 588 610 588 610 0.99
14 46 0.00072 0.052 14.9 6.4 1 23 616 638 616 638 0.99
15 46 1.1e-05 0.00082 20.5 1.0 2 23 648 669 647 669 0.97
16 46 0.0005 0.036 15.4 1.8 1 23 675 697 675 697 0.99
17 46 0.00048 0.035 15.4 2.0 1 23 703 725 703 725 0.98
18 46 0.002 0.15 13.5 5.0 3 23 818 838 816 838 0.96
19 46 0.0011 0.082 14.2 1.4 1 23 843 865 843 865 0.98
20 46 2.2e-05 0.0016 19.6 1.6 1 23 871 893 871 893 0.98
21 46 0.00013 0.0095 17.2 0.8 1 23 924 946 924 946 0.98
22 46 0.0019 0.14 13.5 1.2 1 23 952 974 952 974 0.98
23 46 4.4e-06 0.00032 21.8 0.7 2 23 984 1005 983 1005 0.97
24 46 0.00031 0.023 16.0 0.4 1 23 1011 1033 1011 1033 0.98
25 46 0.00087 0.063 14.6 0.3 1 23 1039 1061 1039 1061 0.98
26 46 0.0049 0.36 12.2 0.5 1 23 1067 1089 1067 1089 0.95
27 46 0.00039 0.028 15.7 0.5 3 23 1126 1146 1124 1146 0.96
28 46 8.8e-05 0.0064 17.7 3.1 1 23 1151 1173 1151 1173 0.99
29 46 0.00083 0.061 14.7 1.2 1 23 1179 1201 1179 1201 0.98
30 46 0.14 10 7.7 0.5 2 23 1207 1227 1206 1227 0.96
31 46 0.0038 0.28 12.6 1.3 3 23 1235 1255 1233 1255 0.95
32 46 0.00018 0.013 16.8 1.7 1 23 1261 1283 1261 1283 0.99
33 46 2.2e-05 0.0016 19.6 1.6 2 23 1293 1314 1292 1314 0.97
34 46 0.48 35 6.0 4.5 1 21 1320 1340 1320 1342 0.95
35 46 2.3 1.7e+02 3.8 0.0 1 13 1348 1360 1348 1363 0.88
36 46 2.6e-05 0.0019 19.4 0.9 3 23 1451 1471 1449 1471 0.96
37 46 6.3e-06 0.00046 21.3 4.0 1 23 1477 1499 1477 1499 0.99
38 46 0.00077 0.056 14.8 3.8 1 23 1505 1527 1505 1527 0.98
39 46 0.00039 0.028 15.7 0.3 1 23 1533 1555 1533 1555 0.98
40 46 9.2e-06 0.00067 20.8 2.9 2 23 1563 1584 1562 1584 0.96
41 46 0.00024 0.017 16.4 1.4 1 23 1591 1613 1591 1613 0.99
42 46 0.0009 0.065 14.6 0.5 1 23 1619 1641 1619 1641 0.98
43 46 4.5e-06 0.00033 21.8 0.4 2 23 1651 1672 1650 1672 0.97
44 46 0.02 1.5 10.3 5.9 1 21 1678 1698 1678 1700 0.96
45 46 0.002 0.15 13.5 0.8 1 23 1706 1728 1706 1728 0.98
46 46 0.078 5.7 8.5 0.2 1 23 1734 1756 1734 1756 0.89

Sequence Information

Coding Sequence
atGGATGGAATTTGTCGGTGTTGCTTGCTAGAAAGTTCCGAAATGGAATCAATTTTTAAgtcaaaaaatttattgaaattactaACTTGTGCAAAAATAACAATTTCTGAGAATGATGGATTACCTACTTTGATATGTAAAGTTTGTTGTCAAAAAATGAATGATTCTTATGAATTTCGGAAAGAATGTGAAAAgtctcaaacaattttaaaacaatgggttaaaaataataaaagtattcCACCAGAAACTGATGTAATTCCACAGGAACTACCTGCTCTAATAGAAAACGAattgtgtgtaaaaaaaattgttgttaaaCGAAGTTCCAGGATCAGTTCTGAGGGAGATTTTTCAAACGAAATCAAAATTGAATCGAATCCTGAAAAAGCAACAAAATTTAGGAGTGAAGAATttgaaaaagctttaaaaatcgaaattaaaCAGGAAGAAGAAATGAGCATTGCTAGTACTGAAGATATTAAACTAACAAATGATCTTGAAGAAAATACTATTCCAGAAGTTAATCGAAATACCAGTGCTAGTGGTATAAGTTGTAAATTGTGTggaaaaaagtttgaaaaatcGAGATCATTATCTAGGCACTTGAGCCGACATAAGGAAAAGACATTTAAATGTAGTTATTGCCCGAGTAGTTTTCACCAAATGTATGAATTAACTAATCATATACGGATACACACCGGAGAAAAACCTTATGAATGTTCAATTTGTTCAAAACGTTTTTCAAGTACTTCATTATTGTATAATCATAAGAAGGTCCATTTAGCGAAACAAACAGtttgtaaaatttgtaataaaacattaatggAAACAACGCTGAAAAAACACATGAAATTACATACCGATCGAGAAAGAAATTTCCAATGTACAGAATGTGGTAAAACATTTTTGCAAAAGGCAACTTTAATATATCATGTACGTACACATGGTACTGAGAGGCCATATAAATGTACAGAATGTCCAAAAGGTTACTTTGTGAAATATCAATTAGATCTCCATATGAGACAACATTTGGGAATTCGTCCACATTTAAGATTGAAATGTACAGTTTGTGAAAAGAGTTTTTCACATAAAGGCGAATTAGGCGTTCATATGCGAACTCACACAGGTGAACGTCCCTATGAATGTTCTGTATGCAAATACCGATTTCTAATTAATAGTCATCTTGTTGTTCATATGAGAACTCATACGGGCGAGAAACCATATCCATGTACAGTCTGTAATATGCAGTTCGCTAGATTGGGATCACGTAAAAAACATATGCTGAAACACGCTGAAACACATGTACAAGAAATTGCGGTTAAACAAAACACTAGCATGTATGACAGTGCCGAGGACGACTTTgcaaacaaaattgaaagcaaaTCGAATTCCAATTTGAATAGTTCTCAAGAAGTAAACATTTCTACTGTTGAAGACATTGAATTGACCAATGATCCCAAAGGAAATACTATTCCAGAAGATAATAAAAATACAGATTCCAGTGATATAATTTGTAAACTTTGTGGAAAAAATTTTCATCAACCAAAACAATTGATGTCTCATATTCGGGTACATGCCGAGGAAAAGCCTTATGAATGTCAAATTTGTTCGAAACGTTTTACAAGTAATTCATTATTGTATAgccataaaaaaattcatttaccaAAGCAAAATGTATGTTCAATATGTAGTAAAGCATTTATTGCGAAAGAAGCATTAAGATCACATATGAAATTGCATACGAACAGAGAAAGAAGTTTTCAATGTACAGAATGTGGTAAAACGTTTTTCCAAAAATCGTCTTTAGCATATCATGCACGTACACATGGTACTGAGAAGCAATATAAATGTACGGAATGCCCAAAAAGTTACTTTGTTAAACATCATTTAAATGCTCATTTTAGAAAGCATTTGGGAATCCGACCATCCTTAATATTGAAATGTACAATTtgtgaaaagaatttttcacataaagGGGAATTAGGCGTTCATATGCGAATACATACAGGTGAACGTCCTTATGAATGTTCTGTGTGTAAAAAACGTTTTCTAGTTAATAGTCATCTTGTTGGTCATATGAGAACTCATACGGGTGAAAAACCCTATGCATGTCCAATTTGTAGTATGCATTTCACTAGATTGGGATCACGTAAAAGACATATGCTGAAACACGCTGAAACGTGTGTACAGGAAATTGTTAAACAAAGTGCCTACGACAGTTCCggggaaaatttttcaaatcaaattaaaattaaatcgaatttaaatttgaatagtCCTGAAAGTGACAACGAAATTAAGAATAGAGAATTTGAAAAACCTTTAGAAATCGAAATTTACCAGGAAGGAATAAGCATTCCTCCTATTGAAGAGACTGAATTGATAAATGATTCCAAGGGAAATGTTATTCCAGAAGATAATCAAAATACCGGTTCCAGTGATATAATTTGTAAACTTTGTAAGAAAAAGTGTACTTCATCGAAGTCATTATCTAGGCACATGAGCCGacataaagaaaagaaattcaaatgtaCTCAGTGTGAATCTAGTTTTCATCAACCTAACCAATTGGCATCTCATATTCGAGCACATACCGGAGAAAAACCTTATGAATGTCAAATTTGTTCGAAACGTTTTACAAGTAATTCATTATTGTATGCTCATAACAAGACTCATTTAGCAAAGCAAAAAGTATGTCCaatatgtaataaaacattattagACGTTAGAAGGCATATGAAACTTCATAATGAgcgagaaaaaaattttcaatgtacAGAATGTGGTAAAACATTTTTACAGAAAGCGTCTTTAACATATCATGCACTTACACATGGCACTGAGAAGGAATTTAAATGTACAGAATGCCCAAAaagtttctttattaaaaatcaattagatCTTCATTTGAATCAACATTTGGGAATTCGTCCAGCCTTAAGATTGAAATGTACAATCTGCGAAAAGAGGTTTGCAAGCAAAGGTGAATTAACCATTCATACGCGAACTCATACTGGCGAGCGTCCTTTCGAATGTTCTATATGTAAAAGACGTTTTCTAGTTAATGGTAATCTTATTATACATATGAGATCTCATACGGGTGAAAAACCGTATGCATGTCCAATTTGTAAAATATCATTCCCTGCATTAAAATCACGTAAAAGGCATATGCAAACGCATACcgggaaaaataattttctatgtgATGTTTGTGGGGAAAGTTTTAAAGATATTAAAagtagaaataatcatcgacaaattcatattCAAGAAGTAAACATTCCTACTGTTGAAGACATTGAATTCACCAATGATCCCAAAGGAAATACTATTCCAGAAGATAATCAAAATACCAATTCCAGTGATATAATTTGTAAACTTTGTGGGAAAAAGTTTGCATCATTGAAGTCATTATATGGGCATATGAGTcgacataaagaaaaaaaatttaaatgtagtCATTGTGGAATTGGATTTCACAAACTGGACCAATTAACATCTCATATTCGAGTACATACCGGAGAAAAACCGTATGAATGCTCCGTTTGCTCAAAACGTTTtacaaatatgaaattattatctGGTCATAGGAAAGTTCATTTAGCAAAACAAACCGTATGTCCCgtatgtaataaaacattattggACACAACATTAAGGAGACACATGAAATTGCACACCGAACGTGAAAAACATTTATGTACAGAATGCGgtaaaacatttttacaaaaatggGATTTAACATATCATGCACTTACACATGGTACTGAGAAGAAATTTAAATGTACAGAATGTCCAAAAAGTTTCTTTCTTAAAAATCAGTTAGATGCTCATTTGAAAAAGCATTTAGGAATTCGTCCACCCTTAAGATTGAAATGTACAAATTGTGAAAAAAGGTTTGCACATAAAGGCGAGTTAGGTGTTCATATGCGAACTCATACCGGTGAACGTCCCTATGAATGTACTCTGtgtaaaaaacgttttttagtTAGTAATCATCTTGTTCTTCATATGAGATGTCATACGGGTGAAAAACCGTATGCGTGTTCAATATGTAATATGCATTTTGCTAGATTAGGATCACaaatatatatacaagaaaTTTCTGTTAGGCAAGATACCGACACGTACGATGGTTCCGATGACAAAACTAAACCGAATTCTAATTTAACTAGTCCTGAAAATGAAACAAGAATAAAGAGTGAAGAATTCAAAGAAACTTTAGAAATCGAAATTTAccaagaagaagaaataaacaATTCTATTGAACGCGTTGAACTGACCAATGATCCTAAAGCAAATAATATTCTAGAAGATAATCAAAATACCAATTTCAATGATATAATTTGTAAGCTTTGTGGGAAAAAGTTTGCATCATCAAAATCGTTATATAATCACATGAGTCGACATACTAAggaaaagaaattcaaatgtaCTCATTGTGGAATTAGTTTTCATCAACCGAAACAATTAACATCTCATATTCGAGTGCATACCGGAGAAAAACCTTATGAATGTTCAGTTTGTTACAAATGTTTTCGAAAACGTGGTCAATTGACTCTCCACGAAACTACTCATAGTAATGACAAACCCTATGAATGTCCAGTTTGTTTGAAACGTTTCGCTAACAATTCATTATTGTATGGTcataagaaaattcatttaccaaaacaaaaagaatgtcAATGTACAATATGTAATAAAGCATTCACATCGAAAGCAACATTAAGGTTACATATGAAATTGCATActaaacgagaaaaaaaattccaatgtaCAAAATGTGGTAAAACGTTTTTAGAAAGATCTTCTTTAATATATCATGCACGTACACATGGaactgaaaaacaatttaaatgtaCAGAATGTCCAAAGGGTTTCTTTGCTAATAATAAATTGGTTGCCCATTTGAGACAACATATGGGAATCCGTCCACCATTAAGATTGGAATGTACAATTTGTGAAAAAAGATTTGCAACAAAAGGTGAATTGACCATTCATACACGAACTCATACCGGTGAACGTCCTTTTGAATGTTCTGTATGTAAAAAACGTTTTCAAGGACATAGTCATCTTGTTGTTCATATGAGATGTCATACAGGTGAAAAACCGTATGCATGTCCAATTTGTCATATACTGTTCCCGAGATTAGAATCACGTAAAAGACATATGCTGAAACACGCTGGTAAAAATACCTTTTTATGTGATGTCTGCGGGGAAAGTTTTAAAGATGCTGAaattagaaataatcatcgacAAATCCATATTGTAAATGGAAAAATTGCaataatagataaaaaaaacaaatcatag
Protein Sequence
MDGICRCCLLESSEMESIFKSKNLLKLLTCAKITISENDGLPTLICKVCCQKMNDSYEFRKECEKSQTILKQWVKNNKSIPPETDVIPQELPALIENELCVKKIVVKRSSRISSEGDFSNEIKIESNPEKATKFRSEEFEKALKIEIKQEEEMSIASTEDIKLTNDLEENTIPEVNRNTSASGISCKLCGKKFEKSRSLSRHLSRHKEKTFKCSYCPSSFHQMYELTNHIRIHTGEKPYECSICSKRFSSTSLLYNHKKVHLAKQTVCKICNKTLMETTLKKHMKLHTDRERNFQCTECGKTFLQKATLIYHVRTHGTERPYKCTECPKGYFVKYQLDLHMRQHLGIRPHLRLKCTVCEKSFSHKGELGVHMRTHTGERPYECSVCKYRFLINSHLVVHMRTHTGEKPYPCTVCNMQFARLGSRKKHMLKHAETHVQEIAVKQNTSMYDSAEDDFANKIESKSNSNLNSSQEVNISTVEDIELTNDPKGNTIPEDNKNTDSSDIICKLCGKNFHQPKQLMSHIRVHAEEKPYECQICSKRFTSNSLLYSHKKIHLPKQNVCSICSKAFIAKEALRSHMKLHTNRERSFQCTECGKTFFQKSSLAYHARTHGTEKQYKCTECPKSYFVKHHLNAHFRKHLGIRPSLILKCTICEKNFSHKGELGVHMRIHTGERPYECSVCKKRFLVNSHLVGHMRTHTGEKPYACPICSMHFTRLGSRKRHMLKHAETCVQEIVKQSAYDSSGENFSNQIKIKSNLNLNSPESDNEIKNREFEKPLEIEIYQEGISIPPIEETELINDSKGNVIPEDNQNTGSSDIICKLCKKKCTSSKSLSRHMSRHKEKKFKCTQCESSFHQPNQLASHIRAHTGEKPYECQICSKRFTSNSLLYAHNKTHLAKQKVCPICNKTLLDVRRHMKLHNEREKNFQCTECGKTFLQKASLTYHALTHGTEKEFKCTECPKSFFIKNQLDLHLNQHLGIRPALRLKCTICEKRFASKGELTIHTRTHTGERPFECSICKRRFLVNGNLIIHMRSHTGEKPYACPICKISFPALKSRKRHMQTHTGKNNFLCDVCGESFKDIKSRNNHRQIHIQEVNIPTVEDIEFTNDPKGNTIPEDNQNTNSSDIICKLCGKKFASLKSLYGHMSRHKEKKFKCSHCGIGFHKLDQLTSHIRVHTGEKPYECSVCSKRFTNMKLLSGHRKVHLAKQTVCPVCNKTLLDTTLRRHMKLHTEREKHLCTECGKTFLQKWDLTYHALTHGTEKKFKCTECPKSFFLKNQLDAHLKKHLGIRPPLRLKCTNCEKRFAHKGELGVHMRTHTGERPYECTLCKKRFLVSNHLVLHMRCHTGEKPYACSICNMHFARLGSQIYIQEISVRQDTDTYDGSDDKTKPNSNLTSPENETRIKSEEFKETLEIEIYQEEEINNSIERVELTNDPKANNILEDNQNTNFNDIICKLCGKKFASSKSLYNHMSRHTKEKKFKCTHCGISFHQPKQLTSHIRVHTGEKPYECSVCYKCFRKRGQLTLHETTHSNDKPYECPVCLKRFANNSLLYGHKKIHLPKQKECQCTICNKAFTSKATLRLHMKLHTKREKKFQCTKCGKTFLERSSLIYHARTHGTEKQFKCTECPKGFFANNKLVAHLRQHMGIRPPLRLECTICEKRFATKGELTIHTRTHTGERPFECSVCKKRFQGHSHLVVHMRCHTGEKPYACPICHILFPRLESRKRHMLKHAGKNTFLCDVCGESFKDAEIRNNHRQIHIVNGKIAIIDKKNKS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-