Basic Information

Gene Symbol
-
Assembly
GCA_907164805.1
Location
OU015470.1:25302020-25359096[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 47 0.021 1.8 10.1 7.2 1 23 33 55 33 55 0.98
2 47 0.00091 0.08 14.4 1.1 1 20 62 81 62 83 0.96
3 47 3.1e-05 0.0027 19.0 0.3 1 21 272 292 272 293 0.96
4 47 1.9e-05 0.0017 19.6 8.1 1 23 301 324 301 324 0.97
5 47 0.00038 0.033 15.6 2.1 1 21 343 363 343 364 0.96
6 47 3.4e-05 0.0031 18.8 4.2 1 23 372 395 372 395 0.97
7 47 0.0056 0.5 11.9 3.4 1 21 418 438 418 439 0.93
8 47 0.0014 0.12 13.8 2.2 1 21 467 487 467 488 0.93
9 47 0.00018 0.016 16.6 1.0 1 21 523 543 523 544 0.96
10 47 0.018 1.6 10.3 0.4 1 23 552 575 552 575 0.96
11 47 0.0019 0.17 13.3 2.5 1 22 598 619 598 619 0.95
12 47 1.4e-05 0.0013 20.0 1.7 1 21 625 645 625 646 0.95
13 47 0.027 2.4 9.7 1.3 1 19 693 711 693 711 0.98
14 47 0.013 1.2 10.7 0.0 2 20 721 739 720 741 0.93
15 47 5e-05 0.0044 18.3 0.7 1 21 749 769 749 770 0.94
16 47 0.00015 0.014 16.8 2.0 1 21 790 810 790 811 0.96
17 47 9.8e-06 0.00087 20.5 2.0 1 21 819 839 819 840 0.96
18 47 1.7 1.5e+02 4.1 1.9 1 21 869 889 869 890 0.91
19 47 2.1e-05 0.0019 19.5 5.6 1 21 898 918 898 919 0.96
20 47 0.0098 0.87 11.1 0.2 3 20 951 968 950 970 0.92
21 47 0.00018 0.016 16.5 1.4 2 21 979 998 978 999 0.94
22 47 0.00057 0.051 15.0 0.4 2 21 1006 1025 1005 1026 0.94
23 47 0.019 1.7 10.2 0.5 1 20 1058 1077 1058 1079 0.93
24 47 0.0096 0.85 11.1 3.7 1 21 1088 1108 1088 1109 0.95
25 47 0.00036 0.031 15.6 0.3 1 20 1133 1152 1133 1154 0.94
26 47 0.00094 0.084 14.3 4.3 1 21 1163 1183 1163 1184 0.96
27 47 0.0014 0.12 13.8 0.7 1 21 1213 1233 1213 1234 0.90
28 47 0.2 18 7.0 0.3 2 21 1268 1287 1267 1288 0.93
29 47 6.5e-05 0.0057 18.0 3.5 1 23 1296 1319 1296 1319 0.96
30 47 0.26 23 6.6 0.2 2 21 1344 1363 1343 1364 0.93
31 47 0.0004 0.036 15.5 1.7 1 23 1372 1395 1372 1395 0.95
32 47 0.0085 0.75 11.3 5.6 1 21 1418 1438 1418 1439 0.95
33 47 0.34 30 6.2 0.5 2 11 1446 1455 1445 1459 0.87
34 47 0.11 10 7.7 3.1 2 21 1479 1498 1478 1499 0.94
35 47 0.19 17 7.0 3.3 2 21 1539 1558 1538 1559 0.94
36 47 0.14 12 7.5 5.5 1 21 1598 1618 1598 1619 0.95
37 47 0.21 18 6.9 5.2 1 21 1658 1678 1658 1679 0.96
38 47 2 1.8e+02 3.8 0.2 2 10 1686 1694 1685 1699 0.88
39 47 0.14 12 7.5 5.5 1 21 1718 1738 1718 1739 0.95
40 47 9.6 8.5e+02 1.7 0.5 2 10 1746 1754 1745 1754 0.92
41 47 0.053 4.7 8.8 1.8 1 21 1778 1798 1778 1799 0.96
42 47 0.00077 0.069 14.6 3.5 2 21 1806 1825 1805 1826 0.94
43 47 0.00012 0.01 17.2 1.6 3 21 1852 1870 1851 1871 0.96
44 47 0.00039 0.035 15.5 2.3 1 20 1879 1898 1879 1900 0.95
45 47 4.4e-05 0.0039 18.5 1.8 1 23 1938 1961 1938 1961 0.97
46 47 0.0013 0.11 13.9 1.2 1 21 1987 2007 1987 2008 0.95
47 47 0.0048 0.42 12.1 0.9 1 23 2016 2038 2016 2038 0.98

Sequence Information

Coding Sequence
ATGTGTAATAATGATGTTCCATTCTTTGTTTCAGAATCGCCTTCAGTATGGAATCCTTATATCAACCCGAGTGGAAGGCAGGACGAAGCGTTTCGCTATCACTGCCCACGTTGCAAGCGAAGCTACAAAGCGAAGAAGACACTCAACAGTCATATGAAACATCACTGCGGTCAAGAGCCTCGCTTCCAATGCCCACATTGCCCTTATCGCTCCAAGCAGAAGGGGCACGTGAAACGTCATATCATTGAAGAGTTCCGCCAGATATTGAGCATGCTCCGAGACAACCAGAAGAAACTTGACTACATCATCCAACTCGTGGGACCAGCTGACGGGGTTTCTCTGTCTTCATCTGCCAGTGTGGTGGCTCCGGACTTCCCACACAACTTACCGTTCCAGTCCGATGAGCAACTAGTGGAATTTGAAGAATATTTGAAAAATGGCGAGAACTTGAAAAAATTGTGCTGTCGAATCGGCTTGGTTGGCGGCGCCGACCTGAGAGACTGCGTACGCCGCGTCATGTGCTGCATCATAACGACCGAACTGGCCAAGTCTCTGAATTGGAAGAATCGACTGAAGCCTGTTAGCCTCATTGATCTCATCTATGTTAAAAAGCTGATTCGAGGAGCTGTGCGCCAAAATACCACGTTCAATGCAGCTACAGATGCCGAGCTTGACACCTTATGCAAGATTTACTTGCAGAATGCGGTTCATAAAGGAAAGAGCCGCAATAAAACAAAGGAACCGGCTAAAACCTCCACTTCTGCGTGGTGTCGGTCTCGCGCGTCTCTTGACCGCAGCGGAGGTCCGTTCCGCTTCCCGTGCCCGAACTGCGGCAAGGACTACAAGTACAAGCCGGATCTTAACAGACACATGAAGCAGGAATGCGGAAAAGAACCGCAATTCGCGTGTCAACATTGTCACAGGCGGTTCAGACAGAAGTCGAACTTGACTACGCACATGAAGAAAACTCACAGTACGTCCACTGTCTGGTTTCCGTCGCGCGATCGCAGTTGTGACGGGTTTCGCTTTATTTGTCCGAATTGCTACAAGAACTACAAGCACAAGAAAGATCTCAACAGGCATCTGAGAGACGAATGCGGGAAAGAAGCGCAATACGCGTGTCAACATTGCCAGAAACGATACAAACAGAAGTCGACCTTGAATACACATATAAAGAGAGCTCACAGCGCGTCGAATGTCTTTTCACTGCACATGCTTCCGTCGCGCGATCGCAGCTGTGAAGAATCTCGTTTCCTCTGTCCAAGTTGTCACAAGCACTACAAGTACAGGAAAGATCTCAACAGACATCTGAGAGATGAATGCGGGAAAGAACCACAATTCGCGTGTACGTCCTCTGTCTGGCTCCCGTCGCGCGACCGCAGCTGTGACGCGACTCGTTTTCTGTGCCCGACTTGCGACAGACATTACAAGTACAAGAAAGATCTCAACAGGCATCTCAAACAAGATTGCGGGAAAGACCCACAGTTCTCCTGTCCTGGGATTTTTGTTCCGTTTCCGTACGAAGATGACGGCGAGTTTTCCGGATCGCAGGACCTTGACATCCGGTTCCGGTGCCCGACGTGTGACCGCACGTACCAGAGGAAACGCACGTTGGCACGACATATCAAGTACGAGTGCAACAAGGAACCACAATTCAGATGCCCGTTGTGCGACTTCTCTGCGAAACGAAAAGTGGTTTTGAAGTCACATTTACTCAACTTGCATAGGGATCGGGCTACGAATCCCAGCGCCCTGCAGCCCGTAGTCTCTGGCCAAGCATGGTACTCACGCTTCCACTGCGAGCGTTGTGACAAGAGCTACACTCAGAGGAGCAGATTGGCCTACCACATGAAGCGGGAATGTGGACGTAGCTTTCAATGTCCGTCGTGTAATCGCAGCTATTCACGCAAATCTCACCTGACCAGTCATATCCTCAAGCGTGATTGCGAAAAGCCGCTGAATTGTTCCAAGTGCATGACGTTGTTCCAGACTCAGACAGGCCTCAGGAATCACTATAGCGTGAACGTGCCTGCCGTAAGTATCGGCCCCGTATCAAACACGGTTCACTCATCGCCATACCGCTGCGCGCGCTGCAACATCAGCTACCCGCACAAGGAGCACCTGCGCTGGCATTGCGACCATGAGTGCGGCAAATGCCTGGAGTGTCCTACTTGTAAGAAACCGTACAAGTCCATAGCAGCGTTGAACAACCATATCGTCAAGAAGGAGTGTGGAAAAGCATCGTTCGTGTGTCCAGTGTGTAGTAAAGTGTATTCGCATAAAGAGAATCTCGTTCGGCATGTGAATCATGAGTGTGGCAAGGAGCTGCAATGTCCTCGTTTTGATGTTCGTCCTGTGAGCACACGGTTCCGGTGTGAACGCTGCAGCAGGACCTACTCACGCAAGGACACGCTCAACAATCACATGAAATACGAGTGCGGACAGAAGACTTTGTTCCAATGTCCCACATGCGGCAAGAAATACTCGCATAAGAATAACCTGTCGAAGCACATTAACCATGAGTGTGGAAAACGACTTGAATCAGCTGTTGAAATGAAACGGATATTTGTCGCACCGTTTCCGAGTGTGTTGGATGTGCCACGCTTCCTCTGCCCACGCTGTCACGCTCGGTTCCAACATAGAATCACGTTCACGGAACACGTCAAGAGTGAATGTGGTCAAAAGAATGTTTTTTCTTGTCACCAGTGTAATAAGACGTTCACGCACAAATATAATCTGTCCAGGCATATCAAGTATGAGTGTGGAAAAGAGCTGCAATTTGCTGCTGGCACACGACACATCACTGTCCGTGCCTTGGACACTCTCACGTATGGAGCTTCACTGCCTTCGTGTGCCCGCTGCGACAAAACCTTCACCACCACGCGAGGTCTCGTCCGTCACGTGTCTGTGTGTGGTCTGCCCAAGAAATCCCTCACATGCCCGTCGTGCAACAAGTGCTATTCACGTAAAGGCGATCTAGAGGCTCATATCAAACACGCGTGTGGGAAATTTCTGCAATGCCGTGATTGCAAGAAGACGTTCACTTCTAAAACTGGATTGGATGCCCACGTCAGAAGTTGTTGTGATAACCAATCATTTCCGTGCATTGGTGTTTGGGCTCAACCCATTACTGTCCGTGCATTGGACAGCCTCACATATGATACGCCACAATTCCACTGTGACCGCTGCAGCAAGGGCTTCAAAACCAAGCAGGGTCTCCTCATACACGTGTCAATCGTGTGTGGTGTGCCCAAGCAATCTTTCATATGTCCGTCATGTGAGAAATGCTTCAACCGTGAGAAGTGTTTAATAGCTCATATTAAACACGAGTGTGTTGGTGTTTGGTCTCAGCGTATCGCTGTGCGTCATGTAGAAAGTCTGACCGAGCCTCAGCGATTCAACTGCGAGCGATGCGGTAAGAGTTACAAGAACAAGCAGTCTCTCACCATCCACGTGTCGGCCGAGTGCGGGGCGACCAAGAAGTCGTTCCGTTGTCCAACGTGCGACAAATGCTATTCACGAGAGAAACATCTGATGGGTCATATTAAACATGAGTGTGATGATTCTCCTGAGTTTCTGTTTGTTGGCACATCAGCCGGCCTTGAGACGTCAGTGGTACAGATGCCGCCTCTGGTATTCCCCTGTGTGCCGTGTGGCAGATCGTACAAACGTAAATCAGATCTCAATAGGCACAAGAAGTTCGAGTGCGGCAAACCAGCCACATTTTGCTGTAGTTTTTGTCCGTATCGCGGTTTTGGTGCTGGTTGGTGCATGCCCAACGCAAATATGAGGAAATCGCTTCAGTGTTCAGTTTGCTTCAAGAAGTATCAGTCTGTCGAGAGTTTAGTTCGTCATGTCAAGTATGAGTGTGCGAAACAGCCGCAATTTCCGTGTCCTCATTGCGATTATATTTGCAAGCGGAAGGATAACCTTAAATTACACATAATTAGAAAACATGGTTTAAATAGCATTCATTATGGTGTTGATGCGGGTCGGTGCTTGTCCTACGCGAGTAGCAGTAAACTTCTTGAATGTTCAGTTTGCTTCAAGAAGTATCAGTCTGTCGAGAGTTTAGCTCGTCATGTCAAGTATGAGTGTGCGAAAGAGCCGCAATTTCCGTGTCCTCATTGCGTTTACATTTCCAAGCGAAAGGATAATCTTAAGCGACACATAGTTAGAAAGCATGATAATGGTCACTCGATGCACATGTTCATCGCCTCGGTTGCTAGTCGTCCACCACCAGCATTGCGCTTCCATTGTCCGAACTGCAAGAAAAGCTACACACAGAAGAAAAAGTTGAATTACCATCTGAAACACGAATGTGGAAAAGACTTGCGTTGTCCTTACTGTGATAAAAAATTTGTTTATGATGGTCACTCGATGCACATGTTCAACACCTCGGTTGCCAGTCGTCAACCACCAGCATCTCCTCTCCATTGTCCGAACTGCAAGAAACAATTCGTGCTGAAGAAGAAGTTGACGTACCATATAAAACACGAATGTGGAAAGGACTTACGTTGTCCTCGTTGTAAAAAAAAGTCTCTTTATGATGGTCACTCGATGCACATGTTCAACACCTCGGTTGCCAGTCGTCAACCACCAGCATCTCCTCTCCATTGTCCGAACTGCAAGAAACAATTCGAGCTGAAGAAGAAGTTGACGTACCATATAAAACACGAATGTGGAAAGGACTTAAGTTGTCCTCGTTGTAAAAAAGCATTCCTTCATGATGGTCACTCGATGCACATGTTCAACACCTCGGTTGCCAGTCGTCAACCACCAGCATTTCCCTTCCATTGTCCGAACTGCAAGAAACGGTACAAGCTGAAGAAGAAGTTGACGTACCATATAAAACACGAATGTGGAAAGGACTCAAGTTGTCCTCGTTGTAAAAAAAAGTTTCTTTATAATGGCCAGTCAATGAAAATTGTCATCGCCTCGATTGCTAGTCTTCAAAAACCAGCATTTCCCTTCCATTGTCCGAACTGCAAGCAACGGTACAAGCTGAAGAAGACGTTGAATTACCATATGAAACACGAATGTGGAAAGGACTTACGTTGTCCTCGTTGTGATAAAACATTTCTTCATAATGGTCACTCGATGCACATGTTCATCTCCTCGGTTTCTAGTCTTCAACCACCAGCATTTCCCTTCCATTGTCCGAACTGCAAGAAAAGATACAAGCTGAAGAAGAAGTTGACGTACCATATAAAACACGAATGTGGAAAGGACTTACGTTGTCCTCGTTGTAATAAAAAGTTTCTTTGTTGGTCCAACTCGCTACCGATTGTTTACGGCTCGGTTGTCAGTCAGGAGGAAATATCCCAACGTTTTCAGTGTTTAACCTGCACCAAGAGTTATAACGCCGAGAGAAACCTTATGTATCACATGAAGCATGAGTGCGGGAAAGTTCTACGTTGTCCAACGTGCAGTAAACAGTTCAAGTACAAATCTCATTTGTTTACACACATCAACCACGATTGTGATCGGTTGCAGTTTTCTTGGAATGTAGCTCGTCAGCTAATACCCGAAGAGGACCTAAGCCGTGCCTTTTGCCCCACTTGTCCCAAGAGTTATAAGAATAGAAATCACTTGCTACGGCACATGAAGTATGAGTGTGGTAAAGAACCTCAGTTTCAGTGTCCGCATTGCAATCATCGGACGAAACACAAAGATAATTTGAAAGCACATGTTGTAGTTTGGAACATTGTTATTGGCCAGAGATTTCATTGTCTCTGTTGCCAGCGTAGTTATCGCCATCACCGTGACCTCAAGAGGCACCTGTTGTACGAGTGCGGAGTGGCACCGCGATTCAACTGCACGATATGTTTCCGCGGTTTCAAACAGAAAGTCTCACTTAAAAGACATATCAACGTCAAGCACAGGGAAGATTTTAATGCTTGTCTGCCATCACTCTCGTCATCGTTCGACAACCAGGGCTTGTTTTACGGGTTCGGTTTCCAGTGCGGTAAGTGCAGTAAAGTCTACAAGGAGAAGAAAACGCTGAACCGCCATGTCAACTACGAGTGTCAGAAGGAGCCGCAGTTCCAATGTCCTCTGTGTCCTCATCGGGCCAAGCAGAAGGCAGCGATGAAAAAACATTTGATAAGACATTCGAATCTCGACAAAATGTTGAACCAATGA
Protein Sequence
MCNNDVPFFVSESPSVWNPYINPSGRQDEAFRYHCPRCKRSYKAKKTLNSHMKHHCGQEPRFQCPHCPYRSKQKGHVKRHIIEEFRQILSMLRDNQKKLDYIIQLVGPADGVSLSSSASVVAPDFPHNLPFQSDEQLVEFEEYLKNGENLKKLCCRIGLVGGADLRDCVRRVMCCIITTELAKSLNWKNRLKPVSLIDLIYVKKLIRGAVRQNTTFNAATDAELDTLCKIYLQNAVHKGKSRNKTKEPAKTSTSAWCRSRASLDRSGGPFRFPCPNCGKDYKYKPDLNRHMKQECGKEPQFACQHCHRRFRQKSNLTTHMKKTHSTSTVWFPSRDRSCDGFRFICPNCYKNYKHKKDLNRHLRDECGKEAQYACQHCQKRYKQKSTLNTHIKRAHSASNVFSLHMLPSRDRSCEESRFLCPSCHKHYKYRKDLNRHLRDECGKEPQFACTSSVWLPSRDRSCDATRFLCPTCDRHYKYKKDLNRHLKQDCGKDPQFSCPGIFVPFPYEDDGEFSGSQDLDIRFRCPTCDRTYQRKRTLARHIKYECNKEPQFRCPLCDFSAKRKVVLKSHLLNLHRDRATNPSALQPVVSGQAWYSRFHCERCDKSYTQRSRLAYHMKRECGRSFQCPSCNRSYSRKSHLTSHILKRDCEKPLNCSKCMTLFQTQTGLRNHYSVNVPAVSIGPVSNTVHSSPYRCARCNISYPHKEHLRWHCDHECGKCLECPTCKKPYKSIAALNNHIVKKECGKASFVCPVCSKVYSHKENLVRHVNHECGKELQCPRFDVRPVSTRFRCERCSRTYSRKDTLNNHMKYECGQKTLFQCPTCGKKYSHKNNLSKHINHECGKRLESAVEMKRIFVAPFPSVLDVPRFLCPRCHARFQHRITFTEHVKSECGQKNVFSCHQCNKTFTHKYNLSRHIKYECGKELQFAAGTRHITVRALDTLTYGASLPSCARCDKTFTTTRGLVRHVSVCGLPKKSLTCPSCNKCYSRKGDLEAHIKHACGKFLQCRDCKKTFTSKTGLDAHVRSCCDNQSFPCIGVWAQPITVRALDSLTYDTPQFHCDRCSKGFKTKQGLLIHVSIVCGVPKQSFICPSCEKCFNREKCLIAHIKHECVGVWSQRIAVRHVESLTEPQRFNCERCGKSYKNKQSLTIHVSAECGATKKSFRCPTCDKCYSREKHLMGHIKHECDDSPEFLFVGTSAGLETSVVQMPPLVFPCVPCGRSYKRKSDLNRHKKFECGKPATFCCSFCPYRGFGAGWCMPNANMRKSLQCSVCFKKYQSVESLVRHVKYECAKQPQFPCPHCDYICKRKDNLKLHIIRKHGLNSIHYGVDAGRCLSYASSSKLLECSVCFKKYQSVESLARHVKYECAKEPQFPCPHCVYISKRKDNLKRHIVRKHDNGHSMHMFIASVASRPPPALRFHCPNCKKSYTQKKKLNYHLKHECGKDLRCPYCDKKFVYDGHSMHMFNTSVASRQPPASPLHCPNCKKQFVLKKKLTYHIKHECGKDLRCPRCKKKSLYDGHSMHMFNTSVASRQPPASPLHCPNCKKQFELKKKLTYHIKHECGKDLSCPRCKKAFLHDGHSMHMFNTSVASRQPPAFPFHCPNCKKRYKLKKKLTYHIKHECGKDSSCPRCKKKFLYNGQSMKIVIASIASLQKPAFPFHCPNCKQRYKLKKTLNYHMKHECGKDLRCPRCDKTFLHNGHSMHMFISSVSSLQPPAFPFHCPNCKKRYKLKKKLTYHIKHECGKDLRCPRCNKKFLCWSNSLPIVYGSVVSQEEISQRFQCLTCTKSYNAERNLMYHMKHECGKVLRCPTCSKQFKYKSHLFTHINHDCDRLQFSWNVARQLIPEEDLSRAFCPTCPKSYKNRNHLLRHMKYECGKEPQFQCPHCNHRTKHKDNLKAHVVVWNIVIGQRFHCLCCQRSYRHHRDLKRHLLYECGVAPRFNCTICFRGFKQKVSLKRHINVKHREDFNACLPSLSSSFDNQGLFYGFGFQCGKCSKVYKEKKTLNRHVNYECQKEPQFQCPLCPHRAKQKAAMKKHLIRHSNLDKMLNQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-