Basic Information

Gene Symbol
ZIPIC
Assembly
GCA_964007245.1
Location
OZ023282.1:25133104-25151909[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 57 0.12 11 7.9 0.8 1 23 274 296 274 296 0.96
2 57 3.3 3e+02 3.3 2.1 1 23 303 325 303 325 0.86
3 57 0.00043 0.04 15.6 0.4 2 23 332 353 331 353 0.96
4 57 0.00015 0.014 17.0 3.5 1 23 359 381 359 381 0.98
5 57 0.0037 0.34 12.6 2.0 1 23 387 409 387 409 0.96
6 57 0.00019 0.018 16.7 0.1 1 21 415 435 415 436 0.94
7 57 0.21 19 7.1 0.5 2 21 439 458 438 467 0.95
8 57 0.031 2.8 9.8 1.0 2 21 470 489 469 498 0.95
9 57 0.031 2.8 9.8 1.0 2 21 501 520 500 529 0.95
10 57 0.0026 0.24 13.1 1.5 2 23 532 553 531 553 0.97
11 57 0.00019 0.018 16.7 0.1 1 21 559 579 559 580 0.94
12 57 0.019 1.7 10.4 1.1 2 21 583 602 582 611 0.95
13 57 0.031 2.8 9.8 1.0 2 21 614 633 613 642 0.95
14 57 0.0026 0.24 13.1 1.5 2 23 645 666 644 666 0.97
15 57 0.00019 0.018 16.7 0.1 1 21 672 692 672 693 0.94
16 57 0.031 2.8 9.8 1.0 2 21 696 715 695 724 0.95
17 57 0.031 2.8 9.8 1.0 2 21 727 746 726 755 0.95
18 57 0.031 2.8 9.8 1.0 2 21 758 777 757 786 0.95
19 57 0.36 33 6.4 0.5 2 21 789 808 788 817 0.95
20 57 0.031 2.8 9.8 1.0 2 21 820 839 819 848 0.95
21 57 0.031 2.8 9.8 1.0 2 21 851 870 850 879 0.95
22 57 0.0017 0.15 13.7 2.2 2 23 882 903 881 903 0.97
23 57 0.00019 0.018 16.7 0.1 1 21 909 929 909 930 0.94
24 57 0.0026 0.24 13.1 1.5 2 23 933 954 932 954 0.97
25 57 0.00019 0.018 16.7 0.1 1 21 960 980 960 981 0.94
26 57 0.019 1.7 10.4 1.1 2 21 984 1003 983 1012 0.95
27 57 0.031 2.8 9.8 1.0 2 21 1015 1034 1014 1043 0.95
28 57 0.019 1.7 10.4 1.7 2 21 1046 1065 1045 1074 0.95
29 57 0.031 2.8 9.8 1.0 2 21 1077 1096 1076 1105 0.95
30 57 0.019 1.7 10.4 1.7 2 21 1108 1127 1107 1136 0.95
31 57 0.031 2.8 9.8 1.0 2 21 1139 1158 1138 1167 0.95
32 57 0.0026 0.24 13.1 1.5 2 23 1170 1191 1169 1191 0.97
33 57 0.00019 0.018 16.7 0.1 1 21 1197 1217 1197 1218 0.94
34 57 0.031 2.8 9.8 1.0 2 21 1221 1240 1220 1249 0.95
35 57 0.0026 0.24 13.1 1.5 2 23 1252 1273 1251 1273 0.97
36 57 0.00019 0.018 16.7 0.1 1 21 1279 1299 1279 1300 0.94
37 57 0.0026 0.24 13.1 1.5 2 23 1303 1324 1302 1324 0.97
38 57 0.00019 0.018 16.7 0.1 1 21 1330 1350 1330 1351 0.94
39 57 0.031 2.8 9.8 1.0 2 21 1354 1373 1353 1382 0.95
40 57 0.035 3.2 9.6 0.4 2 21 1385 1404 1384 1413 0.95
41 57 0.031 2.8 9.8 1.0 2 21 1416 1435 1415 1444 0.95
42 57 0.031 2.8 9.8 1.0 2 21 1447 1466 1446 1475 0.95
43 57 0.0026 0.24 13.1 1.5 2 23 1478 1499 1477 1499 0.97
44 57 0.00019 0.018 16.7 0.1 1 21 1505 1525 1505 1526 0.94
45 57 0.0026 0.24 13.1 1.5 2 23 1529 1550 1528 1550 0.97
46 57 0.00019 0.018 16.7 0.1 1 21 1556 1576 1556 1577 0.94
47 57 0.031 2.8 9.8 1.0 2 21 1580 1599 1579 1608 0.95
48 57 0.035 3.2 9.6 0.4 2 21 1611 1630 1610 1639 0.95
49 57 0.031 2.8 9.8 1.0 2 21 1642 1661 1641 1670 0.95
50 57 0.031 2.8 9.8 1.0 2 21 1673 1692 1672 1701 0.95
51 57 0.0026 0.24 13.1 1.5 2 23 1704 1725 1703 1725 0.97
52 57 0.00019 0.018 16.7 0.1 1 21 1731 1751 1731 1752 0.94
53 57 0.0026 0.24 13.1 1.5 2 23 1755 1776 1754 1776 0.97
54 57 0.00019 0.018 16.7 0.1 1 21 1782 1802 1782 1803 0.94
55 57 0.0026 0.24 13.1 1.5 2 23 1806 1827 1805 1827 0.97
56 57 1.8e-05 0.0016 19.9 0.7 1 23 1833 1855 1833 1855 0.97
57 57 0.00065 0.059 15.0 0.5 1 23 1876 1899 1876 1899 0.92

Sequence Information

Coding Sequence
ATGTCTCAATCGAGTGCAATGAACTCCACGGAACCCACGGAGGAGAAGCAAACAGTGCAGGAAGGTGTCGTGAAGCTGGAGGTGACGGTGAAGGAAGAGCCGCTGGAGTGCGTGGTGTGCGGCACCAAGACCCGCGAGTGCCTCGACATCTACACGACGCGGACGCAGTCCGGTGCCTCGCTGCATCACTTCCTCGCCAAATACACACAAGTCGATCTCGCATTTACCGAATCCTGCTCGAAACAATCATGCAAGAGCTGTTACGATTTAATCACCGTGCTGGAACGAGCAGAGCTCGAATATGTGAAGCTAAAAGAGGCATTCGAAGCCATCATCAGCAAGAACCCTCTGTTCGAGTCGTCGGTGGTGCCGGCGCCGATGGAGATGTCGACCGTCAAACGGGAGATAGAGCCTCCCGACGACTCGGAGGACGAGCCGCTGGCGAGACGGAAATCTACCAGATCTATAAAACCagttaagaaaaaaaagaaattagggAGACCTAAGCGAAAGAAAAGCGTCGAAAAAAACAGGTGGGAGTGTGAGGAGTGCGGCGCGCGCGGGGAGGAGGGCGGCGCTGTCGCCCTCGCCGGACACAAGATCACCATGCATGCAGAGGTTCCTCCCGTCAAGATAGAGGACGGGTCCCAAAGTCCATACAACAACGGCAATGATGAGCTATTACAGATAACATTTGGGCATTTccatgatgatgacgatgacagCAATTTCAACCCCGATGAGTGCAACGACTCTAACGATGGAAACAAAAAGAAGGCAAAGAGAAAACCTATAGAACGGAAGAAACCTAAAGTGATGCACTCGTGTGACCAGTGTAGTGCTAGATATTCTTCATTAGCGCGGCTGGAGGCGCACCGGGCGAAGCACGACGGGTCGAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGCGCCTGTGACATACACATCGCGCTGCACAAAGGTATAAGCGACTGGAAGTGTGAAGAGTGCAACAAACTGTTCCCCTCCAAAAACGCGCTGCAGAGGCATAACAACATTCACACCGGAAAACTCAATTATCAGTGCGACCTGTGCGGCAAGTCGTTCATCCACACGTCGTCGTTCAAGATGCACAAGCTGTCGCACTCTGGCGTGAAGCCGCACGCGTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGCCGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGTGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCGCCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCTCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGTGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCTCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCTCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCAACTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCAACTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGGTAACTACAACCTGCCGCGTGTGCGGGCTCGCGCTCATGACGCGCTCGCACCTCAAGCGGCACGAGCGCATCCACTCCGGCGAGAAGCGGCACGAGTGCGGCGCCTGCGGCAAGCGCTTCTCCGAGCGATACAACCTGGCCGCGCACATGAAGTCGCACGAgccggcgggcggcgcggcggctgCTACAGCGGCGAGCGCGGAGCGGCGGCGGCCCATGTTCCGCTGCCAGTTCTGCCCGGAGCGGTTCGAGCGGCGCTACATGCTGGAGCGGCACACGGCGGCGGCGCACGGCCGCACGCTGGAGCGGCCGCCGCCCACGCCGCGCAACACCATGAGCAAGCTGCTGAAGCAGCGCGCGCAGCGgcccgcccccgcccccgcccccaGCCCGCCCGCCACGCCCGCCAGCCCGCCGGCCCCGCccccgcccgccgccgcgccctcCAAGCTGCTCGCGCACCTCACGTCGTCGTCGTCGATAATAACGACGGGCTCGTGGTCGGGCGCGTACGCCGCCGAGTTCGGCCTGCGACCCGACTTCCCGCACTGA
Protein Sequence
MSQSSAMNSTEPTEEKQTVQEGVVKLEVTVKEEPLECVVCGTKTRECLDIYTTRTQSGASLHHFLAKYTQVDLAFTESCSKQSCKSCYDLITVLERAELEYVKLKEAFEAIISKNPLFESSVVPAPMEMSTVKREIEPPDDSEDEPLARRKSTRSIKPVKKKKKLGRPKRKKSVEKNRWECEECGARGEEGGAVALAGHKITMHAEVPPVKIEDGSQSPYNNGNDELLQITFGHFHDDDDDSNFNPDECNDSNDGNKKKAKRKPIERKKPKVMHSCDQCSARYSSLARLEAHRAKHDGSKPPYICEVCGAHYKHKRACDIHIALHKGISDWKCEECNKLFPSKNALQRHNNIHTGKLNYQCDLCGKSFIHTSSFKMHKLSHSGVKPHACRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMPRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLVLMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSRLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLSLMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLVLMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLSLMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLSLMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSQLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSQLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKVTTTCRVCGLALMTRSHLKRHERIHSGEKRHECGACGKRFSERYNLAAHMKSHEPAGGAAAATAASAERRRPMFRCQFCPERFERRYMLERHTAAAHGRTLERPPPTPRNTMSKLLKQRAQRPAPAPAPSPPATPASPPAPPPPAAAPSKLLAHLTSSSSIITTGSWSGAYAAEFGLRPDFPH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-