Basic Information

Gene Symbol
ZEB2
Assembly
GCA_947425015.1
Location
OX380338.1:30978791-31007673[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 45 0.00032 0.045 15.7 1.7 1 23 22 44 22 44 0.98
2 45 0.052 7.3 8.8 0.4 2 23 50 72 49 72 0.90
3 45 0.0067 0.95 11.6 5.6 3 23 96 117 94 117 0.96
4 45 0.47 66 5.8 4.5 3 23 123 144 122 144 0.89
5 45 0.044 6.2 9.0 1.2 1 23 154 176 154 176 0.98
6 45 0.0016 0.23 13.5 2.0 1 21 181 201 181 202 0.94
7 45 0.015 2.1 10.5 0.5 6 23 214 231 210 231 0.92
8 45 0.00054 0.076 15.0 1.6 2 23 264 285 263 285 0.95
9 45 1e-05 0.0014 20.4 0.7 1 23 291 314 291 314 0.96
10 45 0.002 0.28 13.2 0.6 1 23 320 342 320 342 0.98
11 45 0.00024 0.034 16.1 3.7 1 23 348 371 348 371 0.98
12 45 0.0078 1.1 11.3 0.6 1 23 376 398 376 398 0.94
13 45 0.0059 0.83 11.7 0.1 2 23 408 429 407 429 0.97
14 45 0.012 1.7 10.7 1.1 2 23 441 461 440 461 0.96
15 45 1.2e-05 0.0017 20.2 0.7 1 23 467 490 467 490 0.97
16 45 0.0016 0.22 13.5 0.4 1 23 496 518 496 518 0.99
17 45 5.4e-05 0.0076 18.2 3.1 1 23 524 547 524 547 0.97
18 45 0.19 27 7.0 0.3 1 23 552 574 552 574 0.93
19 45 0.0006 0.085 14.9 0.6 2 23 584 605 583 605 0.97
20 45 5.6 7.8e+02 2.4 0.2 2 23 615 635 614 635 0.85
21 45 0.00017 0.024 16.6 2.0 1 23 641 664 641 664 0.90
22 45 0.0048 0.67 12.0 0.7 1 23 670 692 670 692 0.98
23 45 0.00014 0.02 16.8 2.7 1 23 698 721 698 721 0.97
24 45 0.031 4.4 9.5 0.2 1 23 726 748 726 748 0.93
25 45 0.0086 1.2 11.2 0.3 2 23 757 778 756 778 0.97
26 45 0.41 57 5.9 0.8 2 23 790 810 789 810 0.95
27 45 0.0094 1.3 11.1 0.1 1 17 816 832 816 833 0.94
28 45 0.00099 0.14 14.2 1.0 1 23 846 868 846 868 0.97
29 45 0.021 3 10.0 0.2 2 23 874 896 873 896 0.96
30 45 0.83 1.2e+02 5.0 2.6 1 11 901 911 901 915 0.94
31 45 0.043 6.1 9.0 1.6 3 23 936 957 935 957 0.97
32 45 0.00061 0.086 14.8 3.1 2 23 963 984 962 984 0.92
33 45 3.9 5.5e+02 2.9 1.5 3 16 991 1001 990 1009 0.72
34 45 5.5e-05 0.0078 18.1 2.9 1 23 1051 1074 1051 1074 0.95
35 45 0.0097 1.4 11.0 1.7 2 23 1131 1152 1130 1152 0.95
36 45 0.058 8.2 8.6 3.4 1 23 1157 1179 1157 1179 0.95
37 45 5.9e-06 0.00083 21.2 0.3 2 23 1186 1207 1185 1207 0.97
38 45 0.00011 0.015 17.2 0.3 1 23 1213 1236 1213 1236 0.96
39 45 2.1e-06 0.0003 22.6 0.3 1 23 1241 1263 1241 1263 0.98
40 45 0.00068 0.096 14.7 0.5 1 23 1272 1295 1272 1295 0.96
41 45 2.9e-05 0.0041 19.0 1.3 1 20 1331 1350 1331 1353 0.93
42 45 4.1e-05 0.0057 18.5 1.2 2 20 1361 1379 1361 1381 0.96
43 45 0.00062 0.087 14.8 0.7 1 20 1390 1409 1390 1411 0.94
44 45 0.0018 0.25 13.4 0.8 1 20 1418 1437 1418 1439 0.94
45 45 4.3 6e+02 2.7 0.0 2 20 1446 1464 1445 1468 0.86

Sequence Information

Coding Sequence
ATGATGCCAGCTCCTGGGTATGAAATTATTGGTTACGTGCCAATGAATGTCCTACTAAATCCCTACTGCTGTAAAAAGTGTGGAGAAAGATTTCCAACCCCGGAAAAATTGGGTAATCACGGCCGTAAACATAAGCCTTGTGATAAACCATGCAAAGTTTGCGGGAAATTATTTAAATCAGAATCTGCGGCTCGATATCACTCACAAAATGTACATGAAAAGATACGGATATGTTCAAATGGAAGAAAGGATTCCGTGAAAGAAGTTGAAGACAATAACCTCGACTGCCCAGATTGCTCTCTATCATGCAGGGATAATGCTCATTTGAAACAACACCACAAACGTTACCATTCTGAACTAGAAGCCTGTACTGAATGCGGTAAACTATTCGTAACAAAACAGTTTCTTCATAATCACTGTCAAAAGATGCACCCTAAGGTGCAATCTAAAGATATTAATTACAAATGTGTAATCTGCGGACAAGAATTTGAATGTGACAAATCTTTAAAAGAGCATATATTACGTCATGCTAATAATAAGTTTCATTGTGAACAGTGCAAGCAAGGATTTAATCAAAAATCGTCTCTTGAAGAACATATTTGTGTAAAAAAAGATGATCTTAGGAACATTTGTTTCCCATGTGACAAAACTTATGCTAATAAAACATCATATCTGTTGCATATGAATTCACATAAACCTTCCGAATTAAAACGTATCGTTTGTAATGAAGTTGTAGCAAAAGAGGGAAATTTATCAATAGATAAAAGTGTGCATAATAAAGAGATGTGTACATGTGATGTTTGTGGAAAAACAATGTCGACTCCAGGAAATTTGCGCACTCATAAAATGCAACATAATCCAGAAACTTGTTTTTCTTGTGCGGAATGTGGAAAAAAGTTTACAAAAAAAACCGATCTTACCCTTCATATTAATTTACTTCACTTAAAAACAATTAAATTTGAATGCACAATATGTGGACGTCAGCTATCCAGTAAGAAAAGTTTAGGTTTGCATACATTACGTCATACTAATAATTACAAATTTCGTTGTGAACAGTGCAAACAAGGATTTACTCAAAAATGTCTTCTTAAAGAACATATTCGCGTGAAACATGAAGGCTTTAGATACATTTGTTTTCCGTGTGATAAAACTTATTCTGATAAAAAATGGTATATGTTACATATGGATTCGCATAATCCAGCAACAAAAGATTTAGGTTTAGAATGTACCATTTGTAATGTAGTTTTACCAAATAAAAGGAAATTAATTTTGCACAGAAGAATACATTCAGAATCCGAGAAAGCTAAACGTGAATCTTGCGTATGTGATATATGTGGAATAGTATTACGGAAATCATCTTTACAGCTACATAAAAAGTCACATGACAGTGAAACTAATTTTCCTTGTTCCATATGTGATATGGTATTCAAGCGAAAAGAAGGACTTAAAACTCATACTCAAAAGCTACACTGGAATATAGCTAAATTCAAATGCTTGATTTGTGAACGTCAGCTAGCCAGTAAAAGAAGTTTAAATCTGCATATATTAACTCATTCTAATAATTGCAAATTTAGTTGTGAACAGTGCAAACAAGAATTTACTCGAAAACATCTTCTTCAAGAGCACATTCGTGTGAAGCATGAAAACTTTAGATACATTTGTTTCCCGTGTGATAAAGTTTATTCTGATAAAGCATGGTATAGGTTCCATATGGAATCGCATAATCCGGCAACATCAGGTTCTGGGCTAGAATGTACTGTTTGCAAAAAAGTTTTCCCAAATGATAGGAAATTAGATGTACACAGAAGAACACATACAGATAATAACGTGAAACCCAAGAGTAAATGTGATATATGCGGAATATCGTTACTTACAGTGTCTTTAGCACAGCATAAAAAACTACACGATGAAAACATTAAATTTCCTTGTTCCTTATGTGATAAGGTATTCAAGCGAAATGATGCACTTAAGATTCACTGTCAAAAGTTACACTGGAATATAGGTAAATTTAAATGCTTGATCTGTGGTCGTCAGCTAGCCAGTAAGAGAAGTTTAGATCAGCATACTCTACGTCATACTGATAAAAACAAATTTAGTTGTGAACAGTGCAACCAAGGATTTACTCAAAAATGTCTGCTTAAGGAGCATATTCGCGTGAGACATGATGGCTTTAGATACATTTGTTTCCCTTGTGATAAAACTTATGCTGATAAAACATGGTATATGTTACATATGGAGTTACATAATCCAGAAACAGCCTCAAGATTAGAATGTACCATTTGTAATGCAGTTTTACCAAATAGAAGGAAATTAATTCTGCACAGAAGAAAACATACAGAGTACGGGAAAACCAGAAGAGAAACTTGCGTATGTGATATATGCGGAATATCGTTACTAAAATCTTCTTTGCTACTGCATAAAAGGCTACATGACAGTAAAGCTAATTTTCCTTGTTCCACATGTGATATGGTTTTCAAGCGAAAAGATGCACTTAAGATTGTATTCGAATGGGTTCCAGTCAAACAAGAGCAGCAGTTCATCTGCAGCGTCTGCGGGTTCACTTTCGAAAATGAGACCAAATACAATCAACATCAAAAAGGCCACAATACCAAGACAGTGCCTTGTGAAGTTTGCAACAGAGTCTTCCTCAACTCTAAGCGTCTTGAAATGCATATAAAGAAAATCCACAAGCCAGTTAAATATCAGTGCGATCAATGCGGTAAAGAATTTAGTTGTCCATATTCGAGTACCAGTCAACAACAGATGCAAGATCCATTGCAAAAAGATTCGAAGACAAATAAATTCTGTACGATTTGCAATCAAGAATTCCTGACTCGTCGTAAGTACATGGAACATATGCGAACTCTTCACGCACCAAAGATTTTGCCATGCAAATACTGTGGAAAGAAATTCAATATGCGATCCCAATTGAAATTACATGAACATTTTCACGAGAGAGTTCATGTTGCTTGTGAAATATGCCATAAAAAATTTGGAAATGTTTATGAACACAAACGTCGTCAACATGCAAAAGAATTTCCTCACCAGACATCCGCTGAGCAGTGGCTTCAAGGTCTACGAAAAATTGCACCGAAGTTAAACCCCAGTGCTATACTAATAATTAATGAACATGATTACGTATTAAAAGAATACAAGTGTAAACAGTGTAAAGAATGTTTTACCTCTCTAGAGTATTTACAAAAACATATTGAAGAAATCCATGAAGATGTAGCCAAAGATACAAATATTTCAAAATCATCTGATACCAGTatgtgtgatatatgcgTGCTTATTTCCAGGCTTCCCAATCTTCATCCGTTTCAAGATCCATTAAAAGTCGAACCACTCTCAGCTGAGACAGATATGTCGATGGAAGATCCACTATACAAGTACTGTACCATTTGTCAACGAGAATTTCCTACCAGACGTAAATTTGTAAGGCACAATCGAGCACACAGACCTAAAATTTATTCGTGTAAACTTTGCGACCGAAAATTTGGCTTTATTTTTCAACTAAAACATCATGAAGAAAATCATGATAACATTCGAAATATTCCTTGCGATTTGTGTGGAAAGAAATTCTCTAAGCAAGCATATCTACTTGTTCATCAAAGACGCCACGAAAAAAACTTTAGATTCAATTGCAAGGAATGTGATAAAGGATATATTTCTAATCAAGAGCTAAGCAGACATATTGAAGTGACTCATCGTACTGTTTCCTATCCATGCTCGATTTGTGGCAAAGGTTTTAGAAGTCGCGCTTATTTACTGTTACATTTAAAAACTCATGGCACTGACAACGAACGGCCAGCATATAAATGCGAACACTGTGATAAAGTGTATGCATCTGTGAAAGGTGTCCAAAGACATACTGAAGTAGTTCATGAAAACTTAACGTTCATGtgtgatgaatgtgATCAGAAGAAGTCTGAAGATAAAGAATGGAACAATCATAGAAAACAAAATGGCATAACGACATTACGTAAAGAATATAAATGCCCATCATGTTTTAAAACGTTCAAAAACAAGGGTTCTCTGATACGTCATATTCCCAGACACGTGGAAGAAAACCCTAAACCAGAGTGCAACATTTGTAGCAAAACCTTCAAAAACTCACATAGTTTAAAGAAACATATTTGGCTGGTACACGAATCGAAAAGCGAAATATATCAGTGTAAAGTCTGTGAACGACAATTCAACTATAAAACAAATCTTGCTCAGCATGTTACTCGCTACCCTCCGGAAAAGGACTTCTCTTGTCCTTTGTGTGACAAAAAATTTCACTATGAAAAGACATTGCAAGTACATCTTACCATTGAAAGATTAGGAGCTCTTCCATGCTATGCTTGTGAAAGTAGTTTCACAACCGTAGCAAGTTTAAAACTCCATGCTGTTGAAGTGCATGATCAGAAGAGGGGAATGGAGACCTCGAAATGA
Protein Sequence
MMPAPGYEIIGYVPMNVLLNPYCCKKCGERFPTPEKLGNHGRKHKPCDKPCKVCGKLFKSESAARYHSQNVHEKIRICSNGRKDSVKEVEDNNLDCPDCSLSCRDNAHLKQHHKRYHSELEACTECGKLFVTKQFLHNHCQKMHPKVQSKDINYKCVICGQEFECDKSLKEHILRHANNKFHCEQCKQGFNQKSSLEEHICVKKDDLRNICFPCDKTYANKTSYLLHMNSHKPSELKRIVCNEVVAKEGNLSIDKSVHNKEMCTCDVCGKTMSTPGNLRTHKMQHNPETCFSCAECGKKFTKKTDLTLHINLLHLKTIKFECTICGRQLSSKKSLGLHTLRHTNNYKFRCEQCKQGFTQKCLLKEHIRVKHEGFRYICFPCDKTYSDKKWYMLHMDSHNPATKDLGLECTICNVVLPNKRKLILHRRIHSESEKAKRESCVCDICGIVLRKSSLQLHKKSHDSETNFPCSICDMVFKRKEGLKTHTQKLHWNIAKFKCLICERQLASKRSLNLHILTHSNNCKFSCEQCKQEFTRKHLLQEHIRVKHENFRYICFPCDKVYSDKAWYRFHMESHNPATSGSGLECTVCKKVFPNDRKLDVHRRTHTDNNVKPKSKCDICGISLLTVSLAQHKKLHDENIKFPCSLCDKVFKRNDALKIHCQKLHWNIGKFKCLICGRQLASKRSLDQHTLRHTDKNKFSCEQCNQGFTQKCLLKEHIRVRHDGFRYICFPCDKTYADKTWYMLHMELHNPETASRLECTICNAVLPNRRKLILHRRKHTEYGKTRRETCVCDICGISLLKSSLLLHKRLHDSKANFPCSTCDMVFKRKDALKIVFEWVPVKQEQQFICSVCGFTFENETKYNQHQKGHNTKTVPCEVCNRVFLNSKRLEMHIKKIHKPVKYQCDQCGKEFSCPYSSTSQQQMQDPLQKDSKTNKFCTICNQEFLTRRKYMEHMRTLHAPKILPCKYCGKKFNMRSQLKLHEHFHERVHVACEICHKKFGNVYEHKRRQHAKEFPHQTSAEQWLQGLRKIAPKLNPSAILIINEHDYVLKEYKCKQCKECFTSLEYLQKHIEEIHEDVAKDTNISKSSDTSMCDICVLISRLPNLHPFQDPLKVEPLSAETDMSMEDPLYKYCTICQREFPTRRKFVRHNRAHRPKIYSCKLCDRKFGFIFQLKHHEENHDNIRNIPCDLCGKKFSKQAYLLVHQRRHEKNFRFNCKECDKGYISNQELSRHIEVTHRTVSYPCSICGKGFRSRAYLLLHLKTHGTDNERPAYKCEHCDKVYASVKGVQRHTEVVHENLTFMCDECDQKKSEDKEWNNHRKQNGITTLRKEYKCPSCFKTFKNKGSLIRHIPRHVEENPKPECNICSKTFKNSHSLKKHIWLVHESKSEIYQCKVCERQFNYKTNLAQHVTRYPPEKDFSCPLCDKKFHYEKTLQVHLTIERLGALPCYACESSFTTVASLKLHAVEVHDQKRGMETSK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-