Basic Information

Gene Symbol
ZIPIC
Assembly
GCA_963978885.1
Location
OZ022266.1:65545573-65550589[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 50 2.4e-05 0.0064 19.0 0.5 2 23 98 119 98 119 0.97
2 50 5e-05 0.013 18.0 0.1 1 23 125 147 125 147 0.98
3 50 3.5e-05 0.0094 18.5 2.7 1 23 151 173 151 173 0.99
4 50 1.2e-05 0.0031 20.0 4.7 1 23 179 201 179 201 0.99
5 50 0.0093 2.5 10.9 1.0 1 23 207 229 207 229 0.98
6 50 0.00021 0.057 16.0 1.1 1 21 237 257 237 258 0.97
7 50 1.9 4.9e+02 3.6 0.2 1 21 270 291 270 294 0.88
8 50 0.00057 0.15 14.7 0.4 1 23 301 323 301 323 0.96
9 50 1.1e-05 0.0029 20.1 0.2 1 23 329 351 329 351 0.98
10 50 0.0035 0.93 12.2 3.3 1 23 361 383 361 383 0.99
11 50 0.0001 0.027 17.1 1.2 1 23 389 411 389 411 0.98
12 50 1.1 2.8e+02 4.4 0.2 1 21 431 452 431 455 0.89
13 50 4.7e-06 0.0012 21.3 1.7 1 23 462 484 462 484 0.96
14 50 8.9e-05 0.024 17.2 0.3 1 23 490 512 490 512 0.98
15 50 0.00039 0.1 15.2 5.5 1 23 525 547 525 547 0.97
16 50 5.2e-06 0.0014 21.1 0.4 1 23 553 575 553 575 0.99
17 50 0.62 1.7e+02 5.1 3.1 2 23 582 603 581 603 0.97
18 50 3.6 9.5e+02 2.7 0.9 1 19 610 628 610 630 0.80
19 50 0.22 57 6.6 0.1 1 23 643 667 643 667 0.94
20 50 0.00057 0.15 14.7 0.4 1 23 674 696 674 696 0.96
21 50 1.1e-05 0.0029 20.1 0.2 1 23 702 724 702 724 0.98
22 50 0.0035 0.93 12.2 3.3 1 23 734 756 734 756 0.99
23 50 0.0001 0.027 17.1 1.2 1 23 762 784 762 784 0.98
24 50 1.1 2.8e+02 4.4 0.2 1 21 804 825 804 828 0.89
25 50 4.7e-06 0.0012 21.3 1.7 1 23 835 857 835 857 0.96
26 50 8.9e-05 0.024 17.2 0.3 1 23 863 885 863 885 0.98
27 50 0.00038 0.1 15.3 3.9 1 23 895 917 895 917 0.96
28 50 5.2e-06 0.0014 21.1 0.4 1 23 923 945 923 945 0.99
29 50 0.34 91 5.9 2.8 1 23 951 973 951 973 0.98
30 50 2.2 5.8e+02 3.4 1.9 1 20 980 999 980 1000 0.88
31 50 3.2 8.5e+02 2.9 0.1 1 19 1013 1032 1013 1034 0.93
32 50 0.049 13 8.6 3.8 1 23 1044 1066 1044 1066 0.96
33 50 0.00046 0.12 15.0 0.7 1 23 1072 1094 1072 1094 0.96
34 50 0.00015 0.04 16.5 2.5 1 23 1107 1129 1107 1129 0.96
35 50 2.2e-06 0.0006 22.3 1.5 1 23 1135 1157 1135 1157 0.99
36 50 0.61 1.6e+02 5.2 3.9 1 23 1163 1185 1163 1185 0.98
37 50 3.6 9.5e+02 2.7 0.9 1 19 1192 1210 1192 1212 0.80
38 50 1.3 3.4e+02 4.1 0.3 1 21 1225 1246 1225 1249 0.89
39 50 4.7e-06 0.0012 21.3 1.7 1 23 1256 1278 1256 1278 0.96
40 50 0.1 28 7.6 3.3 1 23 1284 1306 1284 1306 0.98
41 50 0.0026 0.7 12.6 2.3 1 23 1319 1341 1319 1341 0.96
42 50 4.8e-06 0.0013 21.2 0.4 1 23 1347 1369 1347 1369 0.99
43 50 0.61 1.6e+02 5.2 3.9 1 23 1375 1397 1375 1397 0.98
44 50 5 1.3e+03 2.3 1.0 1 19 1404 1422 1404 1424 0.79
45 50 0.13 34 7.3 0.1 1 21 1437 1458 1437 1461 0.90
46 50 0.00016 0.043 16.4 0.8 1 23 1468 1490 1468 1490 0.96
47 50 0.003 0.8 12.4 0.7 1 23 1496 1518 1496 1518 0.98
48 50 2.5e-05 0.0068 18.9 3.2 1 23 1523 1545 1523 1545 0.99
49 50 5.9 1.6e+03 2.0 2.4 1 21 1552 1572 1552 1573 0.89
50 50 0.051 14 8.6 4.5 1 23 1615 1637 1615 1637 0.98

Sequence Information

Coding Sequence
ATGAGTGATTTGGATGATTTATCCTTTGCGTGCAAAAAAGTTATATCTGTAAAAGAGGAAGTTGAAATCTGCGAAACACAGTTACTAGAACAGATTTTCGATAGATTCGAAGTGCCACATGTTGACCCTTGTGTTAAAATCGAAAAAGGGAATACAGAAGATAATCAAACCGAATTAAACGATTGTGAAGGGATTAGGGAGCAACCATACGAGTGCTACTTACGACCGGATGATTCCACCAGTTCAGCTCGAAAACATGACGATATTAGGCCATCCTTGAAAAATAATCGTACATGCTCGATTTGTGCGAAATCATTCAGGTCCAGTTCAAGGCTTGAAGTGCATATGAATGTTCATACTGGCAGTCGGCTGTATGCATGCACAATATGCCCTCGAAAGTACGCCAGTAATAATGGCTTACGCGCGCACATTTTAACTCACGGTTCCGGATTCAAATGTACGTTTTGTGAAAAGTCTTACAAATCCAGATCCGGAATGAATAAGCACATTAAAATTCATCGTGGCGAACCTAATTTCCAGTGTGAAATCTGTTCCAAATCATTCACGAGCAAGTATCTACTCTCGTCACACCATAAAACGCATtcggataaaaaaaaatatgagtgCACGCTGTGCAGTGAAAAATTCGCAGAGAATAGCACGCGGCTTAGGCACGAAAAGACGCACACAAATAACCATAGTAATTATTATGAatgtgaaaattgtaaaaagtatTTTAGAACAGAGGACCAGTATCGCAGTCATATGAAAGGTAATAAGTGCAAAGGATATGCTTCAGAAAAACCATTTGAATGTTCCAAGTGTGAAAATGAATCATTCGTAAGCATTGAACTCCTAAACCAGCACAAAGTCGAAGTTCATCCGCCCTCTAAAAAAAACCATGCATGCTCAGTTTGTACGAAAGCCTACAGATATGATGCTGAACTTCGCGTGCATATGAACGTTCATACTGGCAAGCGGCCGTACGCATGCACAATATGTTCAAAGGCGTTCATCTGGCCCAGTAACTTACGCAAGCACATGTTAACTCACAGCGCAGAAGCGAAGAATCGCTCCAAGTTCAAATGTACACTCTGCGAAAAGCTTTGTAGTTCAAGGGGCCGACTGAATAGTCACATGAATATGCACCGAGGCGAACGTGCTTTTAAGTGCGACACCTGTCCCAAAACGTTTATGAGCCCAGATACACTCTCGTCACACAAAAAACTGCATGCAGATGATAGGGAAAATGAGTTATGCAATCAAGAATCTACAGACAATAACAAGCCATTTCAATGTTCCAAATGTGAAAATGAATCATTCGTAAGCATTGAGCTCCTAAACCAGCACAAAGTCGAAGTTCATCCACCcccgaaaaaaaatcatgcatgCTCGATTTGTGAGAAATCATTCAGGTCGAGTTCAAGGCTTAAAGTGCATATGAACGTTCATACCGGCAATCGACCGTACGCATGCACAATATGTTCAAAGGCGTTCATCTGGCCCAGTAACTTACGGCGGCACAAGTTAACTCACAACTTCTTAAGAGCAGAAAGGAAGATTCGTCACAGATTCAAATGTACGCTCTGTGAAAAGTCATACTACTCAAAAACCAATCTGAATAGTCACATGCATTTTCATCGTGGCGAACGTCCTTTCAAATGCGAAATATGTCCCAAGTCATATACGAGCGCAAAATCACTCGGATCACACATTAATACGCATTCTGATGAGAAGAAAGTTGAGTGCACGTTATGTAATTTAAAATGCCGAGCCAATAGTGAGCGGCGTAAGCACGAGAAAACGCATACAAATCGCCGTACGAATTATCAATGTGAAAAGTgtaaattgtattttagattggAGGACCAGCACCGCAGCCATTTGGAAGATAATAAATGCAGAGGATATGCTTCAGAAAAGCCTTTTGAATGTTCCAAGTGTGAAAATGAATCATTCGTAAGCATAGAGCTCCTAAACCAGCACATACTCGAAGTTCATCCACCCTCTAAAAAAAACCATGCATGCTCAGTTTGTACGAAAGCCTACAGATATGATGCTGAACTTCGCGTGCATATGAACGTTCATACTGGCAAGCGGCCGTACGCATGCACAATATGTTCAAAGGCGTTCATCTGGCCCAGTAACTTACGCAAGCACATGTTAACTCACAGCGCAGAAGCGAAGAATCGCTCCAAGTTCAAATGTACACTCTGCGAAAAGCTTTGTAGTTCAAGGGGCCGACTGAATAGTCACATGAATATGCATCGAGGCGAACGTCCTTTCAAGTGCGACACCTGTCCCAAAACGTTTATGAGCCCAGATACACTCTCGTCACACAAAAAACTGCATGCAGATGATAGGGAAAATGAGTTATGCAATCAAGAATCTACAGACAATAACAAGCCATTTCAATGTTCCAAATGTGAAAATGAATCATTCGTAAGCATTGAGCTCCTAAACCAGCACAAAGTCGAAGTTCATCCACCcccgaaaaaaaatcatgcatgCTCGATTTGTGAGAAATCATTCAGGTCGAGTTCAAGGCTTAAAGTGCATATGAACGTTCATACCGGCAATCGACCGTACGCATGCACAATATGTTCAAAGGCGTTCATCTGGCCCAGTAACTTACGGCGGCACAAGTTAACTCACAACGCAGAAAGGAAGATTCGTCACAGATTCAAATGTACGCTCTGTGAAAAGTCTTACTTCTCAAAAACCAATCTGAATAGTCACATGTATTTTCATCGTGGCGAACGTCCTTTCAAATGCGAAATATGTCCCAAGTCATATACGAGCGCAAAATCACTCGGATCACACATTAATACGCATTCTGATGAGAAGAAATTTGAGTGCACGTTATGTAATTTAAAATGCAGAGCCAATAGTGAGCGGCGTATTCACGAGAAAACGCATACAAATCGCCGTACGAATTATCAATGTGAAAAGTgtaaattgtattttagattgaAGGACCAGCACCGCAGCCATTTGGAAGATAATAAGTGCAAAGGATATGCTTCAGAAAAGCCATTTGAATGTTCTGAGTGTGAAAATGAATCATTCGTAAGTATTGAGCTCCTAAACCAGCACAAAGTCGAATTTAACCCACCCTCGAAGAAAAATCACATATGCTTGATTTGCACGAAAACATACAAGCTTAAGTGCTTGCTTGACATACATATGAACTCTCACACTGGTAGTCGGCCTTATACATGCCCAATTTGTCCAAAACGGTTCGCCTGCCCCGGCAGCTTAAACCGGCACAAGTTAGATCACAACTTCTTAAGCGCAGATGGGAAGATTCGTGGCAGATTCAAATGTACGCTCTGTGAAAAGTCTTACTCATCAAAAATCAGTCTGAATAGTCACATGTATTTTCATCGTGGCGAACGTCCTTTCAAGTGCGAAATATGTCCCAAGTCGTATATGAACCCACACACACTCAAATGGCACATTAAAACGCATTTTGATGAAAAGAAATTTGAGTGCACGTTATGTAATTTAAAATGCAGAGGCAATAGTAAGCGGCGTATTCACGAGAAAACGCATACAAATCGCCGTATGAATTATCAATGTGAAAAGTgtaaattgtattttagattggAGGACCAGCACCGCAGCCATTTGGAAGATAATAAGTGCAAAGGATATGCTTCAGAAAAGCCATTTGAATGTTCTAAGTGTAAAAATGAATCATTCGTAAGTATTGAGCTCCTAAACCAGCACAAAGTCGAAGTTCATCGACCCCCGAAAAGAAATCATGCATGCTCGATTTGTGAGAAATCATTCAGGTCGAGTTCAAGGCTTAAAGTGCATATGAACGTTCATACCGGCAATCGACCGTATAAGTGCACAGTGTGCCCTAAAAAGTACGGCTGCCCCACGAGCTTGCGCTATCACAAGTTAACTCACAGCTTATTAAGCGCAGATAGGAAGATTCGTGGCAGATTCAAATGTACGCTCTGTGAAAAGTCTTACTTCTCAAAAATCGGTCTGAATAGTCACATGTATTTTCATCGTGGCGAACGTCCTTTTAAATGCGAAATATGTCCCAAGTCATATAGGAGCGCCATCTCACTCAGATCACACGTTAATACGCATTCTGATGAAAAGAAATTTGAGTGCACGTTATGTAATTTAAAATGCAGAGGCAATAGTAAGCGGCGTATTCACGAGAAAACGCATACAAATCGCCGTATGAATTTTCAATGTGAAAAGTgtaaattgtattttagattggAGGACCAGCACCGCAGCCATTTGGAAGATAATAAGTGCAAAGGATATGCTAGCGAAAACCCATTTCAATGTCCCAAGTGTGACAATGAAACATTCGCAAGCATTGAACTTCTCAACCAGCACAAAGTCGAAGTCCATCCACCCCCGAAAAAAGACCATGCATGTTCAATTTGTGCGAAAGCCTACAGGCGCAAATCTGAACTAGACATCCACATGAATTATCATCGTGGCAAACGTCCTTTCAAATGTGAAGTCTGTTCCAAGCCGTTTTGGAGCAACGAGCTacttttaaaacacaataaaacACATTTCGGCAGGAAGTATGAATGCACAATgtgcaataaaaaattcactGACAATAGCAATCGGCGTCAGCACGAGAAAACGCACTCAAACCGCCGTGAGCGTTATCATTGCATCACCtgtaaaaagtattttaagcTAGAAGATCAATTTCGTAGCCATTACCAAGATAATAAGTGCGAAGGAAATTCACAAGAGCTCCATGCTCATGAAAAAGGGATTCAAGCTCaaaatgaaatagaaaatattggAACTGCTGGAAAATCCCAGAACGATGATATTTTCGAGAAGAAATACAAATGTGACGAATGTAACAAAGAGTTTATttgtaaacgaaatttgaaTTTCCATTTGAGATCACATCTTCCAATTTCGTCAACTTAG
Protein Sequence
MSDLDDLSFACKKVISVKEEVEICETQLLEQIFDRFEVPHVDPCVKIEKGNTEDNQTELNDCEGIREQPYECYLRPDDSTSSARKHDDIRPSLKNNRTCSICAKSFRSSSRLEVHMNVHTGSRLYACTICPRKYASNNGLRAHILTHGSGFKCTFCEKSYKSRSGMNKHIKIHRGEPNFQCEICSKSFTSKYLLSSHHKTHSDKKKYECTLCSEKFAENSTRLRHEKTHTNNHSNYYECENCKKYFRTEDQYRSHMKGNKCKGYASEKPFECSKCENESFVSIELLNQHKVEVHPPSKKNHACSVCTKAYRYDAELRVHMNVHTGKRPYACTICSKAFIWPSNLRKHMLTHSAEAKNRSKFKCTLCEKLCSSRGRLNSHMNMHRGERAFKCDTCPKTFMSPDTLSSHKKLHADDRENELCNQESTDNNKPFQCSKCENESFVSIELLNQHKVEVHPPPKKNHACSICEKSFRSSSRLKVHMNVHTGNRPYACTICSKAFIWPSNLRRHKLTHNFLRAERKIRHRFKCTLCEKSYYSKTNLNSHMHFHRGERPFKCEICPKSYTSAKSLGSHINTHSDEKKVECTLCNLKCRANSERRKHEKTHTNRRTNYQCEKCKLYFRLEDQHRSHLEDNKCRGYASEKPFECSKCENESFVSIELLNQHILEVHPPSKKNHACSVCTKAYRYDAELRVHMNVHTGKRPYACTICSKAFIWPSNLRKHMLTHSAEAKNRSKFKCTLCEKLCSSRGRLNSHMNMHRGERPFKCDTCPKTFMSPDTLSSHKKLHADDRENELCNQESTDNNKPFQCSKCENESFVSIELLNQHKVEVHPPPKKNHACSICEKSFRSSSRLKVHMNVHTGNRPYACTICSKAFIWPSNLRRHKLTHNAERKIRHRFKCTLCEKSYFSKTNLNSHMYFHRGERPFKCEICPKSYTSAKSLGSHINTHSDEKKFECTLCNLKCRANSERRIHEKTHTNRRTNYQCEKCKLYFRLKDQHRSHLEDNKCKGYASEKPFECSECENESFVSIELLNQHKVEFNPPSKKNHICLICTKTYKLKCLLDIHMNSHTGSRPYTCPICPKRFACPGSLNRHKLDHNFLSADGKIRGRFKCTLCEKSYSSKISLNSHMYFHRGERPFKCEICPKSYMNPHTLKWHIKTHFDEKKFECTLCNLKCRGNSKRRIHEKTHTNRRMNYQCEKCKLYFRLEDQHRSHLEDNKCKGYASEKPFECSKCKNESFVSIELLNQHKVEVHRPPKRNHACSICEKSFRSSSRLKVHMNVHTGNRPYKCTVCPKKYGCPTSLRYHKLTHSLLSADRKIRGRFKCTLCEKSYFSKIGLNSHMYFHRGERPFKCEICPKSYRSAISLRSHVNTHSDEKKFECTLCNLKCRGNSKRRIHEKTHTNRRMNFQCEKCKLYFRLEDQHRSHLEDNKCKGYASENPFQCPKCDNETFASIELLNQHKVEVHPPPKKDHACSICAKAYRRKSELDIHMNYHRGKRPFKCEVCSKPFWSNELLLKHNKTHFGRKYECTMCNKKFTDNSNRRQHEKTHSNRRERYHCITCKKYFKLEDQFRSHYQDNKCEGNSQELHAHEKGIQAQNEIENIGTAGKSQNDDIFEKKYKCDECNKEFICKRNLNFHLRSHLPISST

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-