Basic Information

Gene Symbol
-
Assembly
GCA_947086415.1
Location
OX352294.1:38257188-38269346[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 0.0038 0.42 12.6 0.2 1 23 27 49 27 49 0.98
2 29 0.087 9.7 8.3 4.7 1 20 55 74 55 76 0.96
3 29 0.16 18 7.5 0.1 5 23 84 102 83 102 0.92
4 29 0.094 11 8.2 0.4 2 23 112 134 111 134 0.96
5 29 1.7 1.9e+02 4.2 0.2 2 23 158 180 157 180 0.93
6 29 0.094 11 8.2 0.4 2 23 250 272 249 272 0.96
7 29 0.094 11 8.2 0.4 2 23 296 318 295 318 0.96
8 29 0.12 14 7.8 0.5 2 23 342 364 341 364 0.95
9 29 1.1 1.2e+02 4.9 0.2 2 20 480 498 479 500 0.90
10 29 1.1 1.2e+02 4.9 0.2 2 20 740 758 739 760 0.90
11 29 0.094 11 8.2 0.4 2 23 862 884 861 884 0.96
12 29 0.094 11 8.2 0.4 2 23 908 930 907 930 0.96
13 29 1.1 1.2e+02 4.9 0.2 2 20 954 972 953 974 0.90
14 29 0.094 11 8.2 0.4 2 23 1122 1144 1121 1144 0.96
15 29 0.094 11 8.2 0.4 2 23 1168 1190 1167 1190 0.96
16 29 0.094 11 8.2 0.4 2 23 1214 1236 1213 1236 0.96
17 29 0.094 11 8.2 0.4 2 23 1260 1282 1259 1282 0.96
18 29 0.094 11 8.2 0.4 2 23 1306 1328 1305 1328 0.96
19 29 1.1 1.2e+02 4.9 0.2 2 20 1352 1370 1351 1372 0.90
20 29 0.094 11 8.2 0.4 2 23 1520 1542 1519 1542 0.96
21 29 0.094 11 8.2 0.4 2 23 1566 1588 1565 1588 0.96
22 29 0.96 1.1e+02 5.0 0.2 2 21 1612 1631 1611 1632 0.91
23 29 0.00068 0.075 14.9 0.3 1 23 1852 1874 1852 1874 0.98
24 29 0.00061 0.068 15.1 2.2 1 23 1880 1903 1880 1903 0.96
25 29 4.2e-05 0.0047 18.7 2.0 1 23 1909 1931 1909 1931 0.98
26 29 1.5e-05 0.0017 20.1 1.5 1 23 1937 1959 1937 1959 0.98
27 29 3.5 3.9e+02 3.2 0.8 1 14 1965 1978 1965 1979 0.89
28 29 7e-07 7.8e-05 24.3 1.0 1 23 1986 2008 1986 2008 0.99
29 29 3.8 4.2e+02 3.1 0.3 1 13 2014 2026 2014 2027 0.91

Sequence Information

Coding Sequence
ATGCAAGGAGTGTGCTTCCCAGAGGTAGTGTGCCTATCAGTGCACAAGCTGCGCCTCCACAACCCGTCGGGCATGAGTTACAGCTGTGCGACGTGCGAGCGCCGCTTCGTGTCGCAGTCCGTGCTGGACCTCCACATGGTCACGCACACTAAGGACTACCAGTTCCAGTGTGATGTCTGCCAGAAGAAGTTCAGCTGCCAGGGTTTCCTGACCAAACATATGACACACACAGATGGAAAAGACCGAGAGATTTGCGATCAGTGCGGCGTCTCCGTTGCTAACCTGAAAGCCCACATGCTCCGCCACCGGCCCAAGGAAGAGCGCCGGCTGCTACAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGGAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAGTACAAGTGTGAAGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCGGCTGCTACAATGCAAGGCGTGCCCGAAGATGTTCTACGCCATCGGTGGAAGGAAGAGACATTACAGTACCGCGCATTTGGGGAATAAGTACAAGTGTGAAGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGTGTGGTCTAAGATGTTCTGCACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGATTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAGTACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGGAAGAAGAGACATTACAGCACCGTGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGCTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAGTACAAGTGTAAAGTTTGTAATAAAGGTATGTTGAAATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGCTCTACACCATCGGTGGAAGGAAGAGATATTACGGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTGTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAAATGTTCTACACCATCGGTGGGAAGAAGAGACATTACAGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGTTCTACACCATCGGTGGAGAGAAGAGACAACACAGTACCGCGCATTTGGGGAAGAAATACAAGTATGAAGTTTGTAACAAAGGTATGTTGAGACTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGGAAGAGACGTTACAGTACCGCGCATTTGGGGAAGAAGTACAATTGTGAAGATTGTAATAGGGGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGCTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAGTACAAGTGTAAAGTTTGTAATAAAGGTATGTTGAAATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGCTCTACACCATCGGTGGAAGGAAGAGATATTACGGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGGAAGAAGAGACATTACAGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGTTCTACACCATCGGTGGAAAGAAGAGACAACACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTAACAAAGGTATGTTGAGACTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGGAAGAGACAACACAGTACTGCGCATTTGGGGAAGAAATACAAGTGTGACGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGGAAGAAGAGACATTACAGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGCTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAGTACAAGTGTAAAGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGTTCTACACCATCGGTGGAAAGAAGAGACAACACAGTACCGCGCATTTGGGGACGAAATACAAGTGTGAAGTTTGTAACAAAGGTATGTTGAGACTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGGAAGAGACAACACAGTACTGCGCATTTGGGGAAGAAATACAAGTGTGACGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGGAAGAAGAGACATTACAGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGCTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAGTACAAGTGTAAAGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGTTCTACACCATCGGTGGAAAGAAGAGACAACACAGTACCGCGCATTTGGGGACGAAATACAAGTGTGAAGTTTGTAACAAAGGTATGTTGAGACTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGGAAGAGACAACACAGTACTGCGCATTTGGGGAAGAAATACAAGTGTGACGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTGATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGGAAGAAGAGACATTACATGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGGAAGAAGAGACATTACAGTACCGTGCATTTGGGAAGAAGTACAAGTGTGAAGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGTTCTACACCATCGGTGGAAAGAAGAGACAACACAGTACCGCGCATTTGGGGAAGAAATACAAGTGTGAAGTTTGTAACAAAGGTATGTTGAGACTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGAAGGAAGAGACAACACAGTACTGCGCATTTGGGGAAGAAATACAAGTGTGACGTTTGTAATAAAGGTATGTTGAGATTTGAAGAGCGCCTGCTGCTGCAGTGCAAGGCGTGCCCGAAGATGTTCTACACCATCGGTGGGAAGAAGAGACGTTACAGTACCGCGCATTTGGGGAAGAAGTACAATTGTGAAGATTGTAATAGGGGTATGTTGAGATTTGAAGAGCGCCGGCTGCTGCAGTGCAAGGCGTGGCCTAAGATGTTCTACACCATCGGTGGAAGAAAGAGACATTACAAGGTACAGTGCCTGGCGGTACATAGAAAGCGAATGCACGCTCAGAACGAGCTCCGCTACTCGTGCCTGGAGTGCGGGCGGCGCTGCGTGTCGCAGTCCGTGCTGGCCGAGCACATGCTCATACACACCAGGGACTTCCAGCACGAGTGCCAAATCTGCCAGAAGAAGTTCCGCGCCCAGATTTTCCTACAGAGACATATCCGGCAGGTGCATGAAAAAGAAAGGAACCATCAATGTGAACTCTGTCCAAAGGCATTCTTCAGTAAAGATAAACTACAAGATCATTTAAGATCCCACTCTAACGAACGGCCGTACACCTGCAACGAATGCGGGCTGTCGTTCAAAGTGAAGCACGCGCTCACAGCCCACCTGTTCACGCACAGCGTCGACAAGAAGTTCCACTGTCAACAATGTAGCATGAGCTTCAAGTCCCCCGGATCGCATACAAACGAACGGCCGTACACCTGCAGCGAGTGCGGGCAGTCGTTCACGATGAATAACGCGCTCACGAGGCACCTGCGCACGCACAGCGACGTCAAGAACTACCACTGTCAACTGTGTAGCATGAGCTTCAAATCCCCCGGGTAA
Protein Sequence
MQGVCFPEVVCLSVHKLRLHNPSGMSYSCATCERRFVSQSVLDLHMVTHTKDYQFQCDVCQKKFSCQGFLTKHMTHTDGKDREICDQCGVSVANLKAHMLRHRPKEERRLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCNKGMLRFEERRLLQCKACPKMFYAIGGRKRHYSTAHLGNKYKCEVCNKGMLRFEERLLLQCKVWSKMFCTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEDCDKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGKKRHYSTVHLGKKYKCEVCDKGMLRFEERLLLQCKAWPKMLYTIGGRKRHYSTAHLGKKYKCKVCNKGMLKFEERRLLQCKAWPKMLYTIGGRKRYYGTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGKKRHYSMLRFEERRLLQCKAWPKMFYTIGGEKRQHSTAHLGKKYKYEVCNKGMLRLEERLLLQCKACPKMFYTIGGRKRRYSTAHLGKKYNCEDCNRGMLRFEERRLLQCKAWPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKAWPKMLYTIGGRKRHYSTAHLGKKYKCKVCNKGMLKFEERRLLQCKAWPKMLYTIGGRKRYYGTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGKKRHYSMLRFEERRLLQCKAWPKMFYTIGGKKRQHSTAHLGKKYKCEVCNKGMLRLEERLLLQCKACPKMFYTIGGRKRQHSTAHLGKKYKCDVCNKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGKKRHYSMLRFEERRLLQCKAWPKMLYTIGGRKRHYSTAHLGKKYKCKVCNKGMLRFEERRLLQCKAWPKMFYTIGGKKRQHSTAHLGTKYKCEVCNKGMLRLEERLLLQCKACPKMFYTIGGRKRQHSTAHLGKKYKCDVCNKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGKKRHYSMLRFEERRLLQCKAWPKMLYTIGGRKRHYSTAHLGKKYKCKVCNKGMLRFEERRLLQCKAWPKMFYTIGGKKRQHSTAHLGTKYKCEVCNKGMLRLEERLLLQCKACPKMFYTIGGRKRQHSTAHLGKKYKCDVCNKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGRKRHYSTAHLGKKYKCEVCDKGMLRFEERLLLQCKACPKMFYTIGGKKRHYMQGVPEDVLHHRWEEETLQYRAFGKKYKCEVCNKGMLRFEERRLLQCKAWPKMFYTIGGKKRQHSTAHLGKKYKCEVCNKGMLRLEERLLLQCKACPKMFYTIGGRKRQHSTAHLGKKYKCDVCNKGMLRFEERLLLQCKACPKMFYTIGGKKRRYSTAHLGKKYNCEDCNRGMLRFEERRLLQCKAWPKMFYTIGGRKRHYKVQCLAVHRKRMHAQNELRYSCLECGRRCVSQSVLAEHMLIHTRDFQHECQICQKKFRAQIFLQRHIRQVHEKERNHQCELCPKAFFSKDKLQDHLRSHSNERPYTCNECGLSFKVKHALTAHLFTHSVDKKFHCQQCSMSFKSPGSHTNERPYTCSECGQSFTMNNALTRHLRTHSDVKNYHCQLCSMSFKSPG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-