Basic Information

Gene Symbol
-
Assembly
GCA_907164805.1
Location
OU015470.1:25127484-25176879[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 49 0.00061 0.054 14.9 1.0 1 21 8 28 8 29 0.96
2 49 0.00075 0.067 14.6 4.2 1 21 66 86 66 87 0.96
3 49 0.00045 0.04 15.3 6.4 1 21 95 115 95 121 0.97
4 49 0.013 1.1 10.7 1.2 1 21 122 142 122 143 0.95
5 49 0.035 3.1 9.3 1.2 1 21 151 171 151 172 0.92
6 49 0.00018 0.016 16.6 0.2 1 21 178 198 178 199 0.95
7 49 0.1 8.9 7.9 0.9 1 22 212 233 212 233 0.95
8 49 0.00012 0.01 17.2 1.7 1 21 240 260 240 261 0.96
9 49 0.3 27 6.4 5.4 1 20 267 286 267 288 0.94
10 49 0.87 77 5.0 4.0 1 21 318 338 318 339 0.91
11 49 1e-05 0.00088 20.5 0.7 1 21 346 366 346 367 0.97
12 49 0.14 13 7.4 0.7 1 21 389 409 389 410 0.95
13 49 4e-05 0.0036 18.6 0.3 1 21 417 437 417 438 0.97
14 49 0.0074 0.66 11.5 2.3 1 20 472 491 472 493 0.94
15 49 0.00026 0.023 16.1 0.5 1 21 501 521 501 522 0.94
16 49 8.1 7.1e+02 1.9 2.3 1 19 546 564 546 567 0.93
17 49 0.00078 0.069 14.6 1.1 1 21 638 658 638 659 0.94
18 49 0.00043 0.038 15.4 1.9 2 21 665 684 664 685 0.94
19 49 0.008 0.71 11.4 1.4 1 19 706 724 706 726 0.95
20 49 0.11 9.9 7.8 4.5 1 20 746 765 746 767 0.93
21 49 0.24 21 6.7 1.0 2 13 776 787 775 788 0.89
22 49 0.011 0.95 11.0 1.7 1 21 807 827 807 828 0.94
23 49 0.065 5.7 8.5 3.5 1 21 836 856 836 857 0.95
24 49 0.0016 0.14 13.6 0.7 1 21 863 883 863 884 0.94
25 49 0.00014 0.012 16.9 1.8 1 22 907 928 907 928 0.96
26 49 0.0085 0.76 11.3 2.8 1 21 935 955 935 956 0.96
27 49 0.00031 0.027 15.8 2.4 1 21 1006 1026 1006 1027 0.96
28 49 0.001 0.091 14.2 0.7 1 21 1035 1055 1035 1056 0.96
29 49 0.11 9.5 7.8 0.4 1 21 1062 1082 1062 1083 0.95
30 49 1e-05 0.00093 20.4 3.5 1 21 1091 1111 1091 1112 0.96
31 49 0.0049 0.43 12.0 7.7 1 23 1118 1141 1118 1141 0.97
32 49 0.01 0.89 11.1 2.7 2 21 1165 1184 1164 1185 0.94
33 49 0.013 1.2 10.7 0.4 2 21 1219 1238 1218 1239 0.93
34 49 0.077 6.9 8.3 4.1 1 21 1259 1279 1259 1280 0.95
35 49 7.6 6.7e+02 2.0 1.8 2 11 1287 1296 1286 1302 0.87
36 49 0.0067 0.59 11.6 1.7 1 21 1319 1339 1319 1340 0.94
37 49 0.0099 0.88 11.1 0.6 2 21 1347 1366 1346 1367 0.94
38 49 0.0098 0.86 11.1 0.8 1 21 1396 1416 1396 1417 0.96
39 49 0.0086 0.76 11.3 1.4 2 21 1424 1443 1423 1444 0.94
40 49 2.8e-05 0.0025 19.1 2.0 1 21 1470 1490 1470 1491 0.96
41 49 0.63 56 5.4 4.3 1 21 1507 1527 1507 1528 0.93
42 49 1.3e-06 0.00011 23.3 1.6 1 23 1535 1557 1535 1557 0.98
43 49 0.014 1.2 10.6 3.3 2 21 1582 1601 1581 1602 0.89
44 49 0.018 1.6 10.2 5.2 1 21 1628 1648 1628 1649 0.96
45 49 0.083 7.4 8.2 1.1 2 21 1658 1677 1657 1678 0.94
46 49 0.0087 0.77 11.3 0.3 1 21 1684 1704 1684 1705 0.95
47 49 0.013 1.1 10.7 1.9 2 21 1719 1738 1718 1739 0.91
48 49 1.8 1.6e+02 4.0 4.6 1 23 1747 1770 1747 1770 0.90
49 49 0.0085 0.76 11.3 10.0 1 23 1796 1819 1796 1819 0.96

Sequence Information

Coding Sequence
ATGGTATCTTCTGCATCACGCTTCAACTGCACCATTTGCTACAAGAGCTACAAGTACAAATACGATCTAGGCAAACACATCAGAGAGGAGTGTGGGAAGAACAATTCAGCGAAACCGTTCGCTTGCACAGCCTATCTTGAGGATGTTCAGTCGATTATTGTCACTCCATCTCTGGACTTGTTTCAAGAATCTCGCTTCCAATGCGAGCGCTGCAACAAGAGCTATAAACACAAACCCACCCTCAACTCACACATGAAACACGAGTGTGGTAAACGCACGTCGTTCCAGTGTTCCTCGTGCAACAAGTGTTTCACCCTGAAGAAGAACTTGACTCGTCACATGAACCACGACTGCGGCAAGTCGTACAAGTGCATGTCTTGCCAGAAGAGTTACGAGACGTACAGTGCTCTGAATAGTCATGTGAATCACGAGTGTGGCCGGAGTGACAGATACAAGTGTGTGAATTGTAGCAAAAGTTTCAAGCAGAAGATTCGGTTGGCGTTTCATATGAGGAACGAGTGTGGTGTGATGTTTGTATGTCCGTCTTGTAATGTGAGATTTGGCACTAAAAGTGATCTGAAACGGCATATCATCAAGTGTGATGCTGCTAGAGCCGAGTCTGACTCGTCACTCTTCCACTGCGCGCGCTGCAATAGTGATTTCGATGACAAGACCGGCCTATCCCGGCACATTAGGCGAGTGTGCGGGAAGACAACATTCCAGTGTCCTACATGTACCAATCGCTTCACCCGAACGGACAGTTTGAAACGTCACATGAAATATGACTGTGGCAAGAACTTCATGTGTCCCAAATGCATGAAGTGTTATCAGACGAAGCATAGTTTAAGGCATCACGTTTCTAATGTGTGTGGTGAAGATGGATTCCACCCTTATGCTGCTGAAAACGAGTCAATATACCTTGCTCCACCCGATCTCTCTGACACGTCACTCTTCCACTGCTTTTGCTGCAATAGGAATTTCGATGACAAGACTGGCCTATCCCGGCACATTAGACAAGTGTGCGGGAAGACAACATTCCAGTGCCCCACATGTACTAAACGCTTCACGCGAACGGAAGGCTTGAGGCGGCACATGAAGGACGACTGTGGCAAGAATTTCGTGTGCCCTAGATATGCTGCTAGAGCCGAGGCTAACTCGTCACTATTCCACTGCGCGCGCTGCAATAGTGATTTCGATGACAAGACCGGCCTATCCAGGCACATTAGACAAGTGTGCGGGAAGACATCATTCCAGTGCCCCACATGTACCAAACTCTTCGCCCGAAAGGAAGGCTTGAGGCGGCACATGAAGGACGACTGTGGGAAGAATTTCATGTGTCCCAGATACGTTGCCGGCATTTGGTCCACTGTTGACAGTACTTCCTCGAACCCGTTCGAGAACAACTCCGCTCGTTTCCACTGCGCACGCTGCACGCGCAGCTACAAACGTAAACAGGATCTGGTCAAGCATATGACTTACAATTGTGGGAAGATCGAATCATTTCCATGTCCCAAATGTAACAGTTCGTTCTCGAGAAAGAAATCTTTGGAGACTCATGTAAATCAAAATTGCGATGGTGCCGACCCGCAGCCCATTGCTATAAGCTCAGTCATGAGTCTGCGCACTGGTTCACGCTTCTGTTGCGCGCGCTGCAAGAGTAGCTACAAATACAAGCAAGATCTAGTCACACATTCCGTGACGTCATTCGGGGTGAAGTGGAGCGACCTTGAAGTGGCCAAGCGGGATAGGAGAGAGGTGTCAAAAACAATGCTGATCAAAACAATATGCATTGTTCATGTAGCCTACCATCCTCAACAACATATATGGGTTCCAGTAGCGAATCGTCGGGGTGGTGATTGTCCGCAGCTTTTCTCCATCAGCTCAGTGAGGAGTCTGAGTGAAGCTCCACGCTTCAATTGCACGCGCTGCAAGAAAACGTACAAACGTAAGGTGGACCTAGTCAGGCACGTGAACAACGTATGCGGGATACTCAAATGCCCGGTGTGCTCACTTAAATTCAGGTTTAAAAGCAACTTGATTCGTCATAGGAAGCACGCTGATGAATTCGAATTTGTTGTTGTCACTCCGGCGTTGGACGTGTATGATGCTTCTCGTTTCCACTGCGATATGTGTAACAAGAGCTACAAGTACAAGCGTGACCTTGTCAGTCACAGCGCTGATGAATTCCAATGTGTTGCTGTCACTCCGGCGTTGGACGTGTATGATGCTTCTCGTTTCCACTGCGATAGGTGTAACAAGAGCTACAAGTACAAGCGTGACCTTGTCAGTCATAGCAAACACGATTGTGGGGAACAGAAACCATTACAATGTCCGACATGTAATAAGTGTTTCGCAGACAGTGGTGTAACTCAGTCTATCGTTGTCACTCCGGAACTGGACCTATACGACACTTCAGGCTACCACTGTCCCCGCTGCAACAAGGACTTCGAGGACAAGTACCTGTTCTTCACTCACGTCCAGCTCGAGTGCGGGAAGAAAGAAATCTTCCGATGTCCAACTTGCTTCTCGTATTTCTCTCAGAAGTGCAGTCTAACTCGCCACGTGAAGGAGGACTGTGGCAAGACGTACGCGTGTCCCTCGTGCCAGAAGACCTACAAGACCAAGAAGACGCTCAGCGATCACCTCAACAATGAGTGTGGTGTAGTCGACACTGAGCACATCTTCGTCACCCCTGCGTGGAGCCTTTACGCGCATCGCTACCACTGCGCACTCTGCAAGAAGAGTTACACGCAGAAGAGCAGTCTCACCGTGCATCTCAAGACTTGCGGCAAGAACGCGGTGTTCAAGTGCGGCAAATGTAACTCTCATTTCTCGTTCAAGAGCGGATTAATGCATCACATCAAATGGGACTGCGGTAAAACGGAGCACTGTACTCTACCACTGCGAGCGCTGTCACAAGATCTATACAAGCAAGCCCTCTTATTACAGACATTTGAAGCCAAGGCGCAGTCCATAATCGTCTCCCCCGACTTGGGCCTCTTCGATTCTCCGCGCTACCAATGCGCGCGCTGCAACAAAAGCTACAAACACAACAAACATCTCATCAGCCACATCAAAGCCGAGTGTGGAATGCAAGGCCTGTTCAAATGTGCTTCGTGCTTGAAAACATTCTCGCGTGAGAGAAATCTCACCGATCATATCAAACAGGAATGTGGTAAAACATTCATTTGTCCCAAATGTAATAAACTTTACCAATCGATGACGACATTCAACGCGCACATCAAGTATGAGTGTGGTAAACAGGCGTCGTTCGTTTGTCCGGATTGCAGTAAGTGCTTCAAGCAGAAGCCGAATCTGACGTATCACATGAGACACGAGTGTGGAGTCTCGTTCCATTGCTCGGCCTGCACGAAGAGCTTCGTTGGTAAGAGCCACTTGAACCGTCACGTGAGGAAGTGTCATAGCCAAGATATACCCTCACTGGTGCCGTTCACGTTCTGCTCGGTCGCCAGTCACCAGGACTCTTCTCTCCAGTGCCCGAACTGCCACAGGCGCTTCAAGCTGCAGAAAGCGCTCACTTACCACTTGAAACACGACTGTGGCAAGAACAACCAATGTCCGACGTGCTTGAAACGCTACTTGGAGAGGAAGCAAGTCCTCTATCACATCAAACACGAGTGTGGTAAAGATCTTCGCTGTCCAACTTGCAACAAACAATTCAAACTTATGCGTTATCTGAAAGCTCACGTACAAAGTGTTGTGTTCTCGGTCGCTGTTCCTAGAGGCCGTGCCCAGGAAGTGATATCACAGCGCTTCAAATGTCGCAACTGCAAGAGGAGCTACAAGGAACAGAAGAGCCTGACTTACCACGTAAAACATGAGTGTGGCAAGAAGATGCAATGCTCCAAATGTTCTAAACGCTTCACATGCAATTCTTCTTGGACACCTACGGCCGTCACGTCGGTTCTCCGTGTTCTTGATACGCCGAAGCGCTTCATATGTCTAGCCTGCAACAGAAGTTATACGCGACAGAAAATGCTCACGTATCATATGCGGCATGAATGTGGGAAGAAGCTGCAATGTTCTGTTTGTTTGAAGCTTTTCTCGTTCAAAGGACATTTGAATGTTCATATCAAGTACGAATGCGGCAAAGAGATACATCTTTCTTCTTCGGTAAATACTGTAACCAGTTCGATGGTCACTCTTCCAGTCGGAAAACAGCGCTTTGAATGTTCAGCCTGTGATAAGAGCTACGTGGCGAAGAAGCTTCTCACTTATCACATGCGACATGAGTGTGGCAAAGATCTACGGTGTACAGCTTGCAATAAACAGTTTGCATATCTCTCCAAGTTAACAACTCATATGAAACATGAATGTGATGTTCATTCTTCAATGCCTATGGTGGTTATTACTCCTATAGACACTCTTCTGGAGAAAACACAGCGCTTTCAATGCCCAGCCTGCGACAAGAGCTACACGAAGAAGAGTAACCTCACGTATCACATGAAACACGAGTGTGGTAAAGAGATACGCTGTGCGAGCGCAGAGTTACCACGCTACCACTGCGCGCTCTGTGACAGAAACTATAAAAACTACTGCGATCTGGTCAAGCATTTTAAACGCTGCGGCAAGAATATCTCGTACAACTGTCCAACGTGTGACAAGAGTTATACTCGCAAGGATTCTGTTATCAGGCACATGAAGAAACATATGCCGAGAATATTTGGTGTTCCTCTCCTCATCAGCTCAGTTGTCAGTCTTCAGGAGCCAGTGCCACGTTTACATTGTCCGACTTGTGACAAGAGTTACAAGCGTAAAAGGCACTTGACTTTCCATCTGAAATACGAGTGTGGTAAAGAGATGCAATACATCTCAGTCAGCCCGTACTTCACGATCAGCCTGGATCGAGATCCGTCGCGTTTCTACTGCCCGACGTGTTACCGCCATTACGCCCACAAACACTCGCTCACCAACCACTTGCGGTACGAGTGCGGCAAAAACGAGCCCGTGCAATGCGGCTCGTGCGTCATGAAGTTCAAGCAGAAAGCACAATTGACTTATCATCAGAAGCACGAGTGTGGACGTGTATTCGAGTGTGAAATCTGTAACGATAAGTTTAATCGAAAAGTAAATTACAATGCTCATGTGAAAAACAAGTTGTACTTGTGTGCTGGTTTGTCGCGAGATCGGTGTGTCTGCTTCGCCTGTGGCAAGAGCTACACGTTCATGTACAACCTGAAGAGGCATGTGAAATACGAGTGTGGGAAAGATCCACAATTCTCTTGCATGTTTTGCCCATATCGATCCAAGCAGAAAGCTCATGTCAAAAGACACACAGTTCATTTACACAAGATTGGCGGAACGTTCTGGGTGCCGACACCGGAACTACTACGTGTCACTGTGAGTCGTTCCCCAAGAGTCCGCTTCCAGTGTCCACATTGCAGCAAAGACTACAAGCACAAGTGTCATCTGAAAGATCACATCAAGGCCAAACACCAAGGTCTTCACAAGGCAATAACCGCGTAA
Protein Sequence
MVSSASRFNCTICYKSYKYKYDLGKHIREECGKNNSAKPFACTAYLEDVQSIIVTPSLDLFQESRFQCERCNKSYKHKPTLNSHMKHECGKRTSFQCSSCNKCFTLKKNLTRHMNHDCGKSYKCMSCQKSYETYSALNSHVNHECGRSDRYKCVNCSKSFKQKIRLAFHMRNECGVMFVCPSCNVRFGTKSDLKRHIIKCDAARAESDSSLFHCARCNSDFDDKTGLSRHIRRVCGKTTFQCPTCTNRFTRTDSLKRHMKYDCGKNFMCPKCMKCYQTKHSLRHHVSNVCGEDGFHPYAAENESIYLAPPDLSDTSLFHCFCCNRNFDDKTGLSRHIRQVCGKTTFQCPTCTKRFTRTEGLRRHMKDDCGKNFVCPRYAARAEANSSLFHCARCNSDFDDKTGLSRHIRQVCGKTSFQCPTCTKLFARKEGLRRHMKDDCGKNFMCPRYVAGIWSTVDSTSSNPFENNSARFHCARCTRSYKRKQDLVKHMTYNCGKIESFPCPKCNSSFSRKKSLETHVNQNCDGADPQPIAISSVMSLRTGSRFCCARCKSSYKYKQDLVTHSVTSFGVKWSDLEVAKRDRREVSKTMLIKTICIVHVAYHPQQHIWVPVANRRGGDCPQLFSISSVRSLSEAPRFNCTRCKKTYKRKVDLVRHVNNVCGILKCPVCSLKFRFKSNLIRHRKHADEFEFVVVTPALDVYDASRFHCDMCNKSYKYKRDLVSHSADEFQCVAVTPALDVYDASRFHCDRCNKSYKYKRDLVSHSKHDCGEQKPLQCPTCNKCFADSGVTQSIVVTPELDLYDTSGYHCPRCNKDFEDKYLFFTHVQLECGKKEIFRCPTCFSYFSQKCSLTRHVKEDCGKTYACPSCQKTYKTKKTLSDHLNNECGVVDTEHIFVTPAWSLYAHRYHCALCKKSYTQKSSLTVHLKTCGKNAVFKCGKCNSHFSFKSGLMHHIKWDCGKTEHCTLPLRALSQDLYKQALLLQTFEAKAQSIIVSPDLGLFDSPRYQCARCNKSYKHNKHLISHIKAECGMQGLFKCASCLKTFSRERNLTDHIKQECGKTFICPKCNKLYQSMTTFNAHIKYECGKQASFVCPDCSKCFKQKPNLTYHMRHECGVSFHCSACTKSFVGKSHLNRHVRKCHSQDIPSLVPFTFCSVASHQDSSLQCPNCHRRFKLQKALTYHLKHDCGKNNQCPTCLKRYLERKQVLYHIKHECGKDLRCPTCNKQFKLMRYLKAHVQSVVFSVAVPRGRAQEVISQRFKCRNCKRSYKEQKSLTYHVKHECGKKMQCSKCSKRFTCNSSWTPTAVTSVLRVLDTPKRFICLACNRSYTRQKMLTYHMRHECGKKLQCSVCLKLFSFKGHLNVHIKYECGKEIHLSSSVNTVTSSMVTLPVGKQRFECSACDKSYVAKKLLTYHMRHECGKDLRCTACNKQFAYLSKLTTHMKHECDVHSSMPMVVITPIDTLLEKTQRFQCPACDKSYTKKSNLTYHMKHECGKEIRCASAELPRYHCALCDRNYKNYCDLVKHFKRCGKNISYNCPTCDKSYTRKDSVIRHMKKHMPRIFGVPLLISSVVSLQEPVPRLHCPTCDKSYKRKRHLTFHLKYECGKEMQYISVSPYFTISLDRDPSRFYCPTCYRHYAHKHSLTNHLRYECGKNEPVQCGSCVMKFKQKAQLTYHQKHECGRVFECEICNDKFNRKVNYNAHVKNKLYLCAGLSRDRCVCFACGKSYTFMYNLKRHVKYECGKDPQFSCMFCPYRSKQKAHVKRHTVHLHKIGGTFWVPTPELLRVTVSRSPRVRFQCPHCSKDYKHKCHLKDHIKAKHQGLHKAITA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-