Basic Information

Gene Symbol
-
Assembly
GCA_963675205.1
Location
OY776092.1:26481525-26503366[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 43 0.066 12 8.4 3.0 1 23 271 293 271 293 0.96
2 43 0.00047 0.083 15.1 1.3 2 23 301 322 300 322 0.94
3 43 2.9 5.1e+02 3.2 0.3 1 23 326 348 326 348 0.94
4 43 0.0033 0.58 12.5 0.9 1 23 353 376 353 376 0.98
5 43 0.011 1.9 10.9 0.2 1 23 380 403 380 403 0.94
6 43 0.013 2.3 10.6 0.3 2 20 419 437 419 443 0.93
7 43 0.035 6.2 9.2 1.9 3 23 453 474 453 474 0.97
8 43 2e-05 0.0035 19.5 0.5 1 23 527 549 527 549 0.99
9 43 1.3e-06 0.00024 23.1 2.3 1 23 555 577 555 577 0.99
10 43 0.031 5.5 9.4 1.4 1 21 583 603 583 604 0.95
11 43 0.031 5.5 9.4 1.4 1 21 629 649 629 650 0.95
12 43 0.031 5.5 9.4 1.4 1 21 675 695 675 696 0.95
13 43 0.031 5.5 9.4 1.4 1 21 721 741 721 742 0.95
14 43 0.031 5.5 9.4 1.4 1 21 767 787 767 788 0.95
15 43 0.031 5.5 9.4 1.4 1 21 813 833 813 834 0.95
16 43 0.031 5.5 9.4 1.4 1 21 859 879 859 880 0.95
17 43 0.031 5.5 9.4 1.4 1 21 905 925 905 926 0.95
18 43 0.031 5.5 9.4 1.4 1 21 951 971 951 972 0.95
19 43 0.094 17 7.9 0.8 1 21 997 1017 997 1018 0.94
20 43 0.031 5.5 9.4 1.4 1 21 1043 1063 1043 1064 0.95
21 43 0.031 5.5 9.4 1.4 1 21 1089 1109 1089 1110 0.95
22 43 0.031 5.5 9.4 1.4 1 21 1135 1155 1135 1156 0.95
23 43 0.031 5.5 9.4 1.4 1 21 1181 1201 1181 1202 0.95
24 43 0.031 5.5 9.4 1.4 1 21 1227 1247 1227 1248 0.95
25 43 0.094 17 7.9 0.8 1 21 1273 1293 1273 1294 0.94
26 43 0.031 5.5 9.4 1.4 1 21 1319 1339 1319 1340 0.95
27 43 0.031 5.5 9.4 1.4 1 21 1365 1385 1365 1386 0.95
28 43 0.094 17 7.9 0.8 1 21 1411 1431 1411 1432 0.94
29 43 0.031 5.5 9.4 1.4 1 21 1457 1477 1457 1478 0.95
30 43 0.031 5.5 9.4 1.4 1 21 1503 1523 1503 1524 0.95
31 43 0.031 5.5 9.4 1.4 1 21 1549 1569 1549 1570 0.95
32 43 0.031 5.5 9.4 1.4 1 21 1595 1615 1595 1616 0.95
33 43 0.031 5.5 9.4 1.4 1 21 1641 1661 1641 1662 0.95
34 43 0.094 17 7.9 0.8 1 21 1687 1707 1687 1708 0.94
35 43 0.031 5.5 9.4 1.4 1 21 1733 1753 1733 1754 0.95
36 43 0.031 5.5 9.4 1.4 1 21 1779 1799 1779 1800 0.95
37 43 0.031 5.5 9.4 1.4 1 21 1825 1845 1825 1846 0.95
38 43 0.031 5.5 9.4 1.4 1 21 1871 1891 1871 1892 0.95
39 43 0.031 5.5 9.4 1.4 1 21 1917 1937 1917 1938 0.95
40 43 0.031 5.5 9.4 1.4 1 21 1963 1983 1963 1984 0.95
41 43 0.031 5.5 9.4 1.4 1 21 2009 2029 2009 2030 0.95
42 43 0.0072 1.3 11.4 3.2 1 23 2055 2077 2055 2077 0.98
43 43 0.00018 0.032 16.4 0.3 1 23 2083 2106 2083 2106 0.97

Sequence Information

Coding Sequence
atgGAGCATGTGAAAGTATGTCATATATGTTTGATCATGGACGTAAAAATGTACGACTTGCAGTCATAtccattaaatacatattttgagcCTATTACAGGAcgaaatcTGTTAAGAATTGTCAACTCTCCACCGCATGCCTGTTTTGAGTGTGCATTACTTGTGAAAAAGTATTACTTTTTTAGACAGAAATGTCTCAGAAGCCAAGATATACTTTATGGATTAATTGATCACAGTGGCAAGATAGCTTCAGAAGACATAAAACAAGTAGACCGAGACGGTCTAGTCTTATCTTCGAATCTGGCACCATATACTGTAGTAGGATTATCAACACAGACTCTCTTTGACTCCAAAACTCCTATCAAAGAAGAACAGGATGGTGAGGATGTTCAGAATCATTGTGATGCAGAAGAAAGTGGTATTAAAGAAGAAGATGAATATGAAATGCTGTCTCCAGAAAGGAGTTTCTCTAGTGATGAAGATGATGAACCATTGTCGCTACATAAAGAAAAGAAGATAAAAGAATGTGAGGAGAACCAGGTCAAAGAGGATGAAGATCTTGTTCTGAAATTAGAGCCCACGGAAGAAGAGATATGCGATGAACTTCCCACCAAAAGGAAAAGAGGCCGGGCTAAGAAGTTAAAAGTTCCTAAGAAAACTATTAAACCGAATGGAAAAGCTGTTGGTAGAAAAACATTCAGTGAGAGTATTGAACTGGGCGATTATATTACAGTTATTAACCTATCGATGGAAGAACAAATGGAAGAGATAAACAAACGGAAGGAGTCATCCAACTACCTGAATTCGCCGTTCCAGTGTAACCTGTGCTATAAAGGCTTCATCGACACGCATGCGTGGAAACACCACGTCGGGAAACATTCGCCGAATGCGGGTGATATAGAGTGTCCTATATGCAAGTTCAGATTTAAAACGAAGCGTGCCTTGCAGAAGCATTCGTCTAATCATGAAAAGAAATACGCGTGCAAATCGTGTCCATACATATCTTCGAATacGACTCAGGCCAAGCAACACCAAGGGTGGCACAAGGGCGTCACATACAAATGCAAGTACTGCGATGACATTTTCACTGTGTGGACTTCGTACATGAGTCACGTGCGAATCAAACACCCGTCCGAGTTCATATGCGGTATCTGCGGGTACTCGTACATTAGCAAGCTGGGGCTCACTATGCACAAGACTATGATGCACAAGGATGTCATGGAAAAAGGCGAAGATATCGTAAAAGAAGGCCCGTACTGTGAAGATTGTGACGTCAAGTTCGTGTCTATGGAAGCGTTTAAACGACACATGGTGATGTCCACCAAGCACACTAGGACTACTAAGActatAAATGGTTGTCGCACATGCGGTGATACTTTTAAGAACGCGGAAGACTTGAGACTGCATCATCGGCAAGAGCATGCGAGAAAAAGACCGAAGAACTACGGAAAGAAACCACCTAATATTAGTTGGCCGGCGAAGTGTGAACACTGTTCAGAAGAGATCCCGAACGGTCGCGAGCACTGGTCCCACTACCGGCGCATGCACCCCGACAAGAAGTACCCCATACAGAAGAACTACGTGTGTGATATATGTGGGAAGAGCTTTAGGGGTAACGCGTTCCTCGTATACCACAAGCGCACGCACACGGAAGAACGAGCGTACAAATGTAACCAGTGCGACAAAGCGTTCTTCAACCGCACCAACCTGCAGGTGCATCTAAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTAAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAAGCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTAAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACATACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTGTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTAAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAAGCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACATACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTAAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACATACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTGTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTAAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAAGCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACATACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTGTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTAAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAAGCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCGTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTAAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGGTAAGTGTGCTATGCTCACGACACAACCGCACCAACCTGCAGGTGCATCTGAAGACTCACAGCGACTCGCGCCCCTACCCCTGCACCGTCTGCTTCAAGGCCTTCAAGTGTAAAGGGGCGCTCAACAGACACTTTAGGAGTCACACCGGCCAGAAACCGTACGAGTGTGAAGTATGCGGCAAGGCGTTCACTCAATCCAACAGTCGCAAACTTCACGTCACCACTGTGCATCTTAAACAACCGGCGCCGTACGTCAGCCGGAGAAGGAACAAATCTGCCGACAAAGAACCGTCACACCAACAGTTTATATACTGA
Protein Sequence
MEHVKVCHICLIMDVKMYDLQSYPLNTYFEPITGRNLLRIVNSPPHACFECALLVKKYYFFRQKCLRSQDILYGLIDHSGKIASEDIKQVDRDGLVLSSNLAPYTVVGLSTQTLFDSKTPIKEEQDGEDVQNHCDAEESGIKEEDEYEMLSPERSFSSDEDDEPLSLHKEKKIKECEENQVKEDEDLVLKLEPTEEEICDELPTKRKRGRAKKLKVPKKTIKPNGKAVGRKTFSESIELGDYITVINLSMEEQMEEINKRKESSNYLNSPFQCNLCYKGFIDTHAWKHHVGKHSPNAGDIECPICKFRFKTKRALQKHSSNHEKKYACKSCPYISSNTTQAKQHQGWHKGVTYKCKYCDDIFTVWTSYMSHVRIKHPSEFICGICGYSYISKLGLTMHKTMMHKDVMEKGEDIVKEGPYCEDCDVKFVSMEAFKRHMVMSTKHTRTTKTINGCRTCGDTFKNAEDLRLHHRQEHARKRPKNYGKKPPNISWPAKCEHCSEEIPNGREHWSHYRRMHPDKKYPIQKNYVCDICGKSFRGNAFLVYHKRTHTEERAYKCNQCDKAFFNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTKLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNIHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTKLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNIHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNIHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTKLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNIHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTKLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRVSVLCSRHNRTNLQVHLKTHSDSRPYPCTVCFKAFKCKGALNRHFRSHTGQKPYECEVCGKAFTQSNSRKLHVTTVHLKQPAPYVSRRRNKSADKEPSHQQFIY

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-