Basic Information

Gene Symbol
-
Assembly
GCA_905333055.1
Location
HG995236.1:5065515-5077654[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 36 0.001 0.048 14.5 0.8 1 23 296 318 296 318 0.99
2 36 4.6e-05 0.0021 18.8 3.5 1 23 532 554 532 554 0.98
3 36 0.0059 0.27 12.1 4.3 1 23 560 582 560 582 0.99
4 36 0.00015 0.0068 17.2 4.2 1 23 588 610 588 610 0.99
5 36 9.7e-05 0.0045 17.7 3.7 1 23 616 638 616 638 0.99
6 36 0.00078 0.036 14.9 3.0 1 23 644 666 644 666 0.98
7 36 9.7e-05 0.0045 17.7 3.7 1 23 672 694 672 694 0.99
8 36 9.1e-06 0.00042 21.0 2.4 1 23 700 722 700 722 0.99
9 36 0.0033 0.15 12.9 0.8 1 23 890 912 890 912 0.98
10 36 0.03 1.4 9.9 4.4 1 23 918 940 918 940 0.98
11 36 0.00015 0.0068 17.2 4.2 1 23 946 968 946 968 0.99
12 36 9.6e-05 0.0044 17.8 4.4 1 23 974 996 974 996 0.98
13 36 0.0013 0.059 14.2 2.2 1 23 1002 1024 1002 1024 0.98
14 36 0.00026 0.012 16.4 3.2 1 23 1192 1214 1192 1214 0.98
15 36 0.02 0.91 10.5 7.5 1 23 1220 1242 1220 1242 0.99
16 36 0.02 0.91 10.5 7.9 1 23 1248 1270 1248 1270 0.99
17 36 7.6e-05 0.0035 18.1 1.3 1 23 1276 1298 1276 1298 0.98
18 36 0.00019 0.0088 16.8 2.3 1 23 1304 1326 1304 1326 0.97
19 36 1.1 53 4.9 6.9 1 23 1332 1354 1332 1354 0.98
20 36 7.5e-06 0.00035 21.2 3.7 1 23 1360 1382 1360 1382 0.99
21 36 1.2e-05 0.00058 20.5 2.9 1 23 1388 1410 1388 1410 0.98
22 36 8.6e-05 0.004 17.9 2.3 1 23 1416 1438 1416 1438 0.99
23 36 5e-06 0.00023 21.8 3.2 1 23 1445 1467 1445 1467 0.99
24 36 1.2e-05 0.00056 20.6 2.9 1 23 1474 1496 1474 1496 0.99
25 36 1.2e-05 0.00056 20.6 2.9 1 23 1503 1525 1503 1525 0.99
26 36 5e-06 0.00023 21.8 3.2 1 23 1532 1554 1532 1554 0.99
27 36 5e-06 0.00023 21.8 3.2 1 23 1560 1582 1560 1582 0.99
28 36 5e-06 0.00023 21.8 3.2 1 23 1588 1610 1588 1610 0.99
29 36 0.0023 0.11 13.4 2.4 1 23 1616 1638 1616 1638 0.97
30 36 0.00056 0.026 15.3 2.0 1 23 1644 1666 1644 1666 0.98
31 36 0.00019 0.0088 16.8 2.3 1 23 1672 1694 1672 1694 0.97
32 36 0.00016 0.0074 17.1 2.4 1 23 1700 1722 1700 1722 0.98
33 36 3.9e-05 0.0018 19.0 2.7 1 23 1728 1750 1728 1750 0.98
34 36 0.00022 0.01 16.6 3.7 1 23 1756 1778 1756 1778 0.98
35 36 1.8e-05 0.00084 20.0 3.4 1 23 1784 1806 1784 1806 0.99
36 36 0.00068 0.031 15.1 2.9 1 23 1812 1834 1812 1834 0.98

Sequence Information

Coding Sequence
ATGCGTTGTTGTGTGCCATTCTGCACAAATACTCCCACCAATGTGTCCACATCTGAGTCGAAGGGGATCAGTTTTCACGCTCTCCCCAGTGATGTGCGTCTGCGCGCTGCTTGGCTCAGAGCCCTCGGCAAACAAGACAGTCAGCTACCAGACTCTGCTGTGGTCTGCTCCCAGCATTTTCTTGATGATGAAATGTATGAAACAGAAAGTGGCTTCAGGCAGATTGCTACTGGTGCTGTTCCTTCAACAGTGCAGGTTTGCATGATATGTCTAAACACTGACAACAAGCTGTATCTAATGAGTAAACACAAATTGGAAGAAACATATGAGCGGTTAACCGGATATCCATTGTATGATCAAGGAAACCTGAAACGAACACTTTGTGTACAATGTGCTCAGAGACTAATAAACTTTAGTAGATTCAGAGACAAGAGCTTGAGAGCCCGTGCACTGATGATGGACTTAGTTGAAAAACATGAATTAATAACACAGCGACATATCAAAGGGATAAACCGCACAAAGAACCAACTAAAGAGTAATTTTGTGGTGAAAACTCTAGGACCTGACCACTGTGACTTATACATACTGGACTCCTCAGACAAACAGACAGAATCAGAGCCAATCGGCTATAGTGCTGTTGTGAAGAATGAAGAAAGTTTTGACTCGCCGGTTGATGAGGACATGGAAGTGGCAGAGGAAGATGACAATACTGCAGACAATGTCAATGATGAGTTTGTTGCATGTAACGAGGAGTGCTTCTCTGACGATAGCATAATGTTGGAAACGAAATTGCTGGATGATGTTATTTGCAAAGCTCTGGAGAACCCAGTATTTGCAGAACAAGTGGCTGAAATACCGGATTTGTTAAAGTGCGAAAGTGCTGCCTTCCAATGTGCTTTATGTTCTGAGGAGTTTGTCAGTGAACATGCGTACATGCAACATATGCGCATGCATGACCAGAACAGTGACGGTTATGCTGACTGTGGCACGTCTCAAGTATGCAAGCCTCATACAGCCGTGAGCTCCTCACACTCTCCACACCTCACTGAGAACAAGCAGGCCATTCAGCTGAGGCAGGCCCCAAGCCCGTCCGCACACTCGCCTCGAACTGCAGTCGCTCCTCTGTCCGCGAGACTTGCGACGAGTAACGGAAACAACGTGCAAGCTGAAGACGCCGATACAGTCCAGAAGAGTGAACCACTATTTAAAATTAACAATGTCGGATTAAACAACCAATTATACAATACCAACAATATAACTGACATTAATAGATTAACAGATTGTGTGGTCATATTATATGATATTTTCAAAAAGCCGCAAGAAGGTGTACCAATACAAGGTGAATCACTGGCGACAATGCGACCAAGTGTTAGTTCTTGGGACAGTCAGCGTCATATTGAAAATGCCTCTAAAGATTTTAACTGTCAGGCAGTTAGTCGGAATGAAGACACTAGACATGCGATAGGAGACATTCGACCAGTTACTAGTAAACTGAAAGTTACTGAGCTTAATGCTACGAAAACTTCACAAAGTAAAACTTATTGTTCAAATAATTACAACAGTGAAGATAGTTTGCTTCATTACAAAAATGGGTATATTTGTGATGTATGTAGACATAAGTTTAAAAGGAAAAGTCTCTTAGTTAAACACATAAAGACTCACAGTGAAGTAGAACCGTTCACTTGCAAGTATTGCCAATTCAAAGCTATATATAAATTTAGTTTAAAGAAACATATGCAAACTCACACGGGTGACAAACCATTTACTTGCGAGTTATGCGACTACAAATGTACATACAGTAGTAAACTAGTGCAACACATGAGAACTCACACGGGTGAAAAACCTTACGTTTGTAAGTTATGTGATCATAAATTTGCACTATCAAGTCATTTAGTGAGACACATGAGTACTCACACGGGTGAAAAACCTTTTACTTGCGAGTTATGCGATTACAAATGTGCATTTAAGTCTAATCTGGCGAGGCACAAAATAACTCACACGGGTGAAAAACCTTATGTTTGTAAGTTATGTGATCACAAATTTGCACTATCGAGTCATTTAGTGAGACACATGAGTACTCACACGGGTGAAAAACCTTTTACTTGCGAGTTATGCGACTACAAATGTGCACAAAATAGTAATTTAGTGCAACACATGAGAACTCACACGGGTGAAAAACCTTACATTTTCGCTCCTCTGTCCGCGAGACTTGCGACGAGTAACGGAAACAACGTGCAAGCTGAAGACGCCGATACAGTCCAGAAGAGTAAACCACTATTTAAAATTAACAATGTCGAATTAAACAACCAATTATTCAATGCTAACAATATAACTGACATTAATAGATTGACTGATTGTGTGGTCATATTATATGATATTTTCAAAAAGCCCGAAGTAGGTGTACCAATACAAGGTGAATCACTGGCGACAATGCGACCAAGTATTAGTTCTTGGGACAGTCAGCGTCATATTGAACAGGCCTCTAAAGATTTTAACTGTCAGGCAGTTAGTCGGAATGAAAACACTAGACATACGATAGGAGACATTCGACTAGTTACTAGTGAACTGAAAGTTACTGAGCTTAATGCTACGAAAACTTCACAAAGTAAAGCTTATTGTTCAAATGATTACAATAGTGAAGATAGTTTGCTTAATTACAAAAATGGGTATATTTGTGATGTATGCAGAAATACGTTTAAATGGAAAAGTCTCTTAGTTAAACACATAAAGACTCACAGTGAAGTGAAACCGTTCACTTGCAAGTTTTGCCAATTCAAAGCTATATATAAATTTAGTTTAAAGAAACATATGCAAACTCACACGGTTGACAAACCATTTACTTGCGAGTTATGCGACTACAAATGTACATACAGTAGTAAACTAGTGCAACACATGAGAACTCACACGGGTGAAAAACCTTTTACTTGCGAGTTATGCGATTACAAATGTGCACAATCAAGTCATTTAGTGAGACACATGAGTACTCACACGGGTGAAAAACCTTTTACTTGCGAGTTATGCGATTACAAATGTGCACTTAAGTCTAATCTAGCGAGGCACAAAATAACTCACACAGGTGAAAAACCTTACGTTTTCGCTCCTCTGTCCGCGAGACTTGCGACGAGTAACGGAAACAACGTGCAAGCTGAAGACGCCGATACAGTCCAGAAGAGTGAACCACTATTTATAATTAACAATGTCGGATTAAACAACCAATTATTCGATACCAACAATATAACTGACATTAATAGATTGACTGATTGTGTGGTCATATTATATGATATTTTCAAAAAGCCCCAAGAAGGTGTACCAATACAAGGTGAATCACTGGCGACAATGCGACCAAGTGTTAGTTCTTGGGACAGTCAGCGTTATATTGAAAATGCCTCTAAAGATTTTAACTGTCAGGCAGTTAGTCGGAATGAAAACACTAGACATACGATAGGAGACATTCGACTAGTTACTAGTGAACTGAAAGTTACTGAGCTTAATGCTACGAAAACTTCACAAAGTAAAACTTATTGTTCAAATAATTACAATATCGAAGATAGTTTGCCTTATGACAAAAATGGGTATATTTGTGATGTATGTAGACATAAGTTTAAACGGAAAAGTGTCTTAGTTAAACACATAAAGACTCACAGTGAAGTGAAACCGTTCACTTGCGAGTTTTGCCAATTCAAAACTAAATATAAATGGTGTTTAAAGAAACATATGCGAACTCACACTGGTGACAAACCTTTTACTTGCGAGCTATGCGATTACAAATGTGCATGTAAAAGTCATTTAGTGAGACACATGAGAACTCACACGGGTGAAAAGCCTTACATTTGTAAGTTATGTGATTATAAATTTGCAGTAACTGGTCATTTAGTGAGCCACATGAGAACTCACACGGGTGAAAAACCTTTTGCTTGCGAGCTATGCGATTACAAATGTGCACGTAAGTCTAGTCTGGCGAGCCACAGGAGAACTCACACGGGTGAAAAACCTTTTAATTGCACATTATGCAAATATAAATGTGCGGATTCTACAAATTTTGTGTGCCACATGAGAACTCACACTGGTGAAAAACCTTACACTTGCGAGTTATGCGACTACAAATGTACACAAAGTAGTAGTCTAGTGAAACACATGAGAACTCACACGGGTGAAAAACCTTTTATTTGCGAGTTATGCGACTACAAATGTACACAAAGAAGTAATCTAGTGCAACACATGAGAACTCACACGGGTGAAAAACCTTTTACTTGCGAGTTATGTGACTACAAATGTGCACAAAGTAGAGATCTAGTGCAACACATGAGAACCCACACGGGTGAAAAAAAACCTTTTACTTGCGAGTTATGCGATTACAAATGTACACAAAGTAGTGATCTAGTGCGACACATGAGAACCCACACGGGTGAAAAAAAACCTTTTACTTGCGAGTTATGCGATTACAAATGTACACAAAGTAGTGATCTAGTGCAACACATGAGAACCCACACGGGTGAAAAAAAACCTTTTACTTGCGAGTTATGCGATTACAAATGTACACAAAGTAGTGATCTAGTGCAACACATGAGAACCCACACGGGTGAAAAAAAACCTTTTACTTGCGAGTTATGCGATTACAAATGTACACAAAGTAGTGATCTAGTGCGACACATGAGAACTCACACGGGTGAAAAACCTTTTACTTGCGAGTTATGCGACTACAAATGTACACAAAGTAGTGATCTAGTGCGACACATGAGAACTCACACGGGTGAAAAACCTTTTACTTGCGAGTTATGCGACTACAAATGTACACAAAGTAGTGATCTAGTGCGACACATGAGAACTCACACGGGTGAAAAACCTTTTGCTTGCGAGCTATGCGATTACAAATGTGCAAGTAAGTTTAGTCTGGCGAGCCACAGGAGAACTCACACGGGTGAAAAACCTTACATTTGTAAGTTATGTGATCATAAATTTGCAGTAACTGGTCATTTAGTGAGCCACATGAGAATTCACACGGGTGAAAAACCTTTTGCTTGCGAGCTATGCGATTACAAATGTGCACGTAAGTCTAGTCTGGCGAGCCACAGGAGAACTCACACGGGTGAAAAACCTTTTAATTGCACATTATGCAAATATAAATGTGCGGATTCTAGCAATTTAGTGAGCCACATGAGAATTCACACTGGTGAAAAACCTTACATTTGTAAGTTATGTGGTTATAAATGTGCACGAAGTAGTTATCTAGTGCAACACATGAAAACTCACACGGGTGAAAAACCTTTTACTTGCGAGTTATGCGACTACAAATGTACACAAAGTAAGAATCTAGTGCAACACATGACAACTCACACGGGTGAAAAAACTTTTACTTGCGAGTTATGCGACTACAAATGTACACAAAGTAGTAGTCTAGTGCAACACATGAGAACTCACACGGGTGAAAAACGTTTTTCTTGTGAGTTGTGCGAGTACAAGTGTGTAGACAAAAGAAATTTAGTGAGGCACTTGAAAACACACTTCTAA
Protein Sequence
MRCCVPFCTNTPTNVSTSESKGISFHALPSDVRLRAAWLRALGKQDSQLPDSAVVCSQHFLDDEMYETESGFRQIATGAVPSTVQVCMICLNTDNKLYLMSKHKLEETYERLTGYPLYDQGNLKRTLCVQCAQRLINFSRFRDKSLRARALMMDLVEKHELITQRHIKGINRTKNQLKSNFVVKTLGPDHCDLYILDSSDKQTESEPIGYSAVVKNEESFDSPVDEDMEVAEEDDNTADNVNDEFVACNEECFSDDSIMLETKLLDDVICKALENPVFAEQVAEIPDLLKCESAAFQCALCSEEFVSEHAYMQHMRMHDQNSDGYADCGTSQVCKPHTAVSSSHSPHLTENKQAIQLRQAPSPSAHSPRTAVAPLSARLATSNGNNVQAEDADTVQKSEPLFKINNVGLNNQLYNTNNITDINRLTDCVVILYDIFKKPQEGVPIQGESLATMRPSVSSWDSQRHIENASKDFNCQAVSRNEDTRHAIGDIRPVTSKLKVTELNATKTSQSKTYCSNNYNSEDSLLHYKNGYICDVCRHKFKRKSLLVKHIKTHSEVEPFTCKYCQFKAIYKFSLKKHMQTHTGDKPFTCELCDYKCTYSSKLVQHMRTHTGEKPYVCKLCDHKFALSSHLVRHMSTHTGEKPFTCELCDYKCAFKSNLARHKITHTGEKPYVCKLCDHKFALSSHLVRHMSTHTGEKPFTCELCDYKCAQNSNLVQHMRTHTGEKPYIFAPLSARLATSNGNNVQAEDADTVQKSKPLFKINNVELNNQLFNANNITDINRLTDCVVILYDIFKKPEVGVPIQGESLATMRPSISSWDSQRHIEQASKDFNCQAVSRNENTRHTIGDIRLVTSELKVTELNATKTSQSKAYCSNDYNSEDSLLNYKNGYICDVCRNTFKWKSLLVKHIKTHSEVKPFTCKFCQFKAIYKFSLKKHMQTHTVDKPFTCELCDYKCTYSSKLVQHMRTHTGEKPFTCELCDYKCAQSSHLVRHMSTHTGEKPFTCELCDYKCALKSNLARHKITHTGEKPYVFAPLSARLATSNGNNVQAEDADTVQKSEPLFIINNVGLNNQLFDTNNITDINRLTDCVVILYDIFKKPQEGVPIQGESLATMRPSVSSWDSQRYIENASKDFNCQAVSRNENTRHTIGDIRLVTSELKVTELNATKTSQSKTYCSNNYNIEDSLPYDKNGYICDVCRHKFKRKSVLVKHIKTHSEVKPFTCEFCQFKTKYKWCLKKHMRTHTGDKPFTCELCDYKCACKSHLVRHMRTHTGEKPYICKLCDYKFAVTGHLVSHMRTHTGEKPFACELCDYKCARKSSLASHRRTHTGEKPFNCTLCKYKCADSTNFVCHMRTHTGEKPYTCELCDYKCTQSSSLVKHMRTHTGEKPFICELCDYKCTQRSNLVQHMRTHTGEKPFTCELCDYKCAQSRDLVQHMRTHTGEKKPFTCELCDYKCTQSSDLVRHMRTHTGEKKPFTCELCDYKCTQSSDLVQHMRTHTGEKKPFTCELCDYKCTQSSDLVQHMRTHTGEKKPFTCELCDYKCTQSSDLVRHMRTHTGEKPFTCELCDYKCTQSSDLVRHMRTHTGEKPFTCELCDYKCTQSSDLVRHMRTHTGEKPFACELCDYKCASKFSLASHRRTHTGEKPYICKLCDHKFAVTGHLVSHMRIHTGEKPFACELCDYKCARKSSLASHRRTHTGEKPFNCTLCKYKCADSSNLVSHMRIHTGEKPYICKLCGYKCARSSYLVQHMKTHTGEKPFTCELCDYKCTQSKNLVQHMTTHTGEKTFTCELCDYKCTQSSSLVQHMRTHTGEKRFSCELCEYKCVDKRNLVRHLKTHF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-