Basic Information

Gene Symbol
-
Assembly
GCA_036419045.1
Location
CM071634.1:10651856-10660587[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 52 4.3e-06 0.00071 21.3 2.8 1 23 90 113 90 113 0.97
2 52 4e-05 0.0066 18.3 2.7 1 21 119 139 119 140 0.95
3 52 4.3e-06 0.00071 21.3 2.8 1 23 151 174 151 174 0.97
4 52 4.3e-06 0.00071 21.3 2.8 1 23 180 203 180 203 0.97
5 52 4.3e-06 0.00071 21.3 2.8 1 23 209 232 209 232 0.97
6 52 4e-05 0.0066 18.3 2.7 1 21 238 258 238 259 0.95
7 52 4.3e-06 0.00071 21.3 2.8 1 23 270 293 270 293 0.97
8 52 4.3e-06 0.00071 21.3 2.8 1 23 299 322 299 322 0.97
9 52 4.3e-06 0.00071 21.3 2.8 1 23 328 351 328 351 0.97
10 52 0.0025 0.4 12.7 0.1 1 17 357 373 357 374 0.94
11 52 0.086 14 7.8 0.1 9 23 394 409 393 409 0.93
12 52 4.3e-06 0.00071 21.3 2.8 1 23 415 438 415 438 0.97
13 52 4.3e-06 0.00071 21.3 2.8 1 23 444 467 444 467 0.97
14 52 4.3e-06 0.00071 21.3 2.8 1 23 473 496 473 496 0.97
15 52 4.3e-06 0.00071 21.3 2.8 1 23 502 525 502 525 0.97
16 52 4.3e-06 0.00071 21.3 2.8 1 23 531 554 531 554 0.97
17 52 4e-05 0.0066 18.3 2.7 1 21 560 580 560 581 0.95
18 52 4.3e-06 0.00071 21.3 2.8 1 23 592 615 592 615 0.97
19 52 4.3e-06 0.00071 21.3 2.8 1 23 621 644 621 644 0.97
20 52 4.3e-06 0.00071 21.3 2.8 1 23 650 673 650 673 0.97
21 52 4.3e-06 0.00071 21.3 2.8 1 23 679 702 679 702 0.97
22 52 4.3e-06 0.00071 21.3 2.8 1 23 708 731 708 731 0.97
23 52 4.3e-06 0.00071 21.3 2.8 1 23 737 760 737 760 0.97
24 52 4.3e-06 0.00071 21.3 2.8 1 23 766 789 766 789 0.97
25 52 4.3e-06 0.00071 21.3 2.8 1 23 795 818 795 818 0.97
26 52 3e-05 0.0048 18.7 2.2 1 21 824 844 824 846 0.94
27 52 4.3e-06 0.00071 21.3 2.8 1 23 967 990 967 990 0.97
28 52 4.3e-06 0.00071 21.3 2.8 1 23 996 1019 996 1019 0.97
29 52 4.3e-06 0.00071 21.3 2.8 1 23 1025 1048 1025 1048 0.97
30 52 4.3e-06 0.00071 21.3 2.8 1 23 1054 1077 1054 1077 0.97
31 52 4.3e-06 0.00071 21.3 2.8 1 23 1083 1106 1083 1106 0.97
32 52 4.3e-06 0.00071 21.3 2.8 1 23 1112 1135 1112 1135 0.97
33 52 4.3e-06 0.00071 21.3 2.8 1 23 1141 1164 1141 1164 0.97
34 52 4.3e-06 0.00071 21.3 2.8 1 23 1170 1193 1170 1193 0.97
35 52 4.3e-06 0.00071 21.3 2.8 1 23 1199 1222 1199 1222 0.97
36 52 4.3e-06 0.00071 21.3 2.8 1 23 1228 1251 1228 1251 0.97
37 52 4.3e-06 0.00071 21.3 2.8 1 23 1257 1280 1257 1280 0.97
38 52 4.3e-06 0.00071 21.3 2.8 1 23 1286 1309 1286 1309 0.97
39 52 4.3e-06 0.00071 21.3 2.8 1 23 1315 1338 1315 1338 0.97
40 52 4.3e-06 0.00071 21.3 2.8 1 23 1344 1367 1344 1367 0.97
41 52 4.3e-06 0.00071 21.3 2.8 1 23 1373 1396 1373 1396 0.97
42 52 4.3e-06 0.00071 21.3 2.8 1 23 1402 1425 1402 1425 0.97
43 52 4.3e-06 0.00071 21.3 2.8 1 23 1431 1454 1431 1454 0.97
44 52 4.3e-06 0.00071 21.3 2.8 1 23 1460 1483 1460 1483 0.97
45 52 4e-05 0.0066 18.3 2.7 1 21 1489 1509 1489 1510 0.95
46 52 4.3e-06 0.00071 21.3 2.8 1 23 1521 1544 1521 1544 0.97
47 52 4.3e-06 0.00071 21.3 2.8 1 23 1550 1573 1550 1573 0.97
48 52 4.3e-06 0.00071 21.3 2.8 1 23 1579 1602 1579 1602 0.97
49 52 4.3e-06 0.00071 21.3 2.8 1 23 1608 1631 1608 1631 0.97
50 52 4.3e-06 0.00071 21.3 2.8 1 23 1637 1660 1637 1660 0.97
51 52 4.3e-06 0.00071 21.3 2.8 1 23 1666 1689 1666 1689 0.97
52 52 3.5e-05 0.0058 18.4 4.8 1 23 1695 1718 1695 1718 0.96

Sequence Information

Coding Sequence
ATGAGAAAGCGAACGTGTCCGAGGACAGGTGGTGGACGGGCAGACGCCGGCGGGCCAGGCGCAGGCGACGCCGGCGATGCAGGCGGCGGGCGCGCGGCACTCGTGCGGCGAGTGCGGCCGCGGGTTCGGCGCGCGCGCGGCGCTGCGGCGGCACGCGCGGCGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCTAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCACACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCTAACCTCAACAAGCACCGCAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCTAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCTAACCTCAACACAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGCCTTCTCCGACGCGTCTAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCTAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCTAACCTCAACAAGCACCGCAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCTAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCACACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGGTGAGCCACACGGGCGAGCGGCCGCACAAGTGCCCGCACTGCGCGCGGGCCTTCTCCGACGCGTCCAACCTCAACAAGCACCGCAAGcACGTGCACAAGCAAGTTAGCCTACTATCGACTGGGGCAATGGAAAATGTCGTGGGCACTTCTCTGGGCGCAGTACTGGCGCCCCCCATAAGCTCTGTTGTTGGAACTACAATAAGTTCTATGGTGGGCACAGTGGTGGGAACTATGGGCTCAGTGGCAAACATGGTGGAAACTCCGATGGGCACTTTAGTGAGCGCTCCTGTGGGCGTGATGAGCAGTCCCATGGGTGTGATAGGCAGTCCTATAGGAGTGGTGGGCGGAACCATGGGTGTGATGGGCAGCCCCGTGGGGCTGTTGGACGCGGCCGCGGGGCCCGTGGTGGAGGTGACGGACGAAGAAGGCAACCCCTTGCATTTCACAATGGAGGACGGCACTCAACTCGCAATCACCCTGGCCGGCGGCAAGTCGTTACAGGAGGTGGTCAACACAGAAGCAATAGTCGAGCAGTTGGAAATGCCCAAAGAGGAGGAGCCCACTGCGCAAGTCGGTCATTTTTTCACTATTGGCAGTTTAAGTAAAATGTGA
Protein Sequence
MRKRTCPRTGGGRADAGGPGAGDAGDAGGGRAALVRRVRPRVRRARGAAAARAAPHGRAAAQVPALRAGLLRRVQPQQAPQGEPHGRAAHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNTAPQGEPHGRAAAQVPALRAAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKPHGRAAAQVPALRAGLLRRVQPQQAPQGEPHGRAAAQVPALRAGLLRRVQPQQAPQGEPHGRAAAQVPALRAGLLRRVQPQQAPQGEPHGRAAAQVPALRAGLLRRVQPQQAPQGEPHGRAPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKVSHTGERPHKCPHCARAFSDASNLNKHRKHVHKQVSLLSTGAMENVVGTSLGAVLAPPISSVVGTTISSMVGTVVGTMGSVANMVETPMGTLVSAPVGVMSSPMGVIGSPIGVVGGTMGVMGSPVGLLDAAAGPVVEVTDEEGNPLHFTMEDGTQLAITLAGGKSLQEVVNTEAIVEQLEMPKEEEPTAQVGHFFTIGSLSKM

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-