Basic Information

Gene Symbol
ZFX
Assembly
GCA_905333045.1
Location
HG995321.1:15542765-15548732[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 27 9.1e-05 0.0075 17.3 1.2 1 23 249 271 249 271 0.98
2 27 1.6 1.3e+02 4.0 0.4 6 23 283 300 283 300 0.96
3 27 1.2 1e+02 4.3 1.5 1 23 309 332 309 332 0.90
4 27 0.00011 0.0087 17.1 3.0 3 23 353 373 352 373 0.98
5 27 0.00017 0.014 16.5 0.7 1 23 379 402 379 402 0.96
6 27 0.076 6.2 8.1 0.1 1 23 408 430 408 430 0.94
7 27 0.0013 0.11 13.7 0.1 1 23 436 458 436 458 0.94
8 27 3 2.4e+02 3.1 1.0 1 23 579 601 579 601 0.91
9 27 3.2e-05 0.0026 18.8 3.3 2 23 608 630 607 630 0.92
10 27 6.7e-05 0.0055 17.7 1.7 1 23 636 658 636 658 0.96
11 27 1.1e-05 0.00089 20.2 0.6 1 23 664 686 664 686 0.97
12 27 0.0024 0.2 12.8 0.1 1 23 692 714 692 714 0.98
13 27 0.02 1.7 9.9 1.7 1 23 720 742 720 742 0.95
14 27 0.34 28 6.1 6.0 1 23 748 771 748 771 0.92
15 27 1.4 1.2e+02 4.1 0.1 2 23 830 850 829 850 0.94
16 27 0.00024 0.02 16.0 0.3 1 23 856 879 856 879 0.89
17 27 3.8e-05 0.0031 18.5 1.5 1 23 885 907 885 907 0.94
18 27 0.00065 0.053 14.6 0.3 1 23 913 935 913 935 0.97
19 27 0.00068 0.056 14.6 1.0 1 23 941 963 941 963 0.96
20 27 0.0024 0.2 12.8 0.3 6 23 973 990 972 990 0.99
21 27 3.1 2.5e+02 3.1 0.5 1 23 1056 1077 1056 1077 0.86
22 27 0.022 1.8 9.8 0.5 3 23 1196 1217 1195 1217 0.95
23 27 0.0019 0.16 13.2 0.1 1 23 1223 1245 1223 1245 0.97
24 27 0.0064 0.52 11.5 0.0 1 23 1251 1273 1251 1273 0.97
25 27 0.00021 0.017 16.2 3.1 1 23 1279 1301 1279 1301 0.96
26 27 0.00014 0.011 16.8 0.4 1 23 1307 1329 1307 1329 0.96
27 27 0.00019 0.016 16.3 0.5 2 23 1335 1357 1334 1357 0.95

Sequence Information

Coding Sequence
ATGAATATTGAGAGTGAAAAAATTCTTGTAAATTCAAGTGCTAATTTGTCGCAAGAGAGCCTAAATTTCGCGGAAACCTGCAATGTGTTGAACGGTCTAGAAAATTTGTCAAATGTCTGCCGAATATGTGCAAATGTAACTGAGTTTGTTATACCCATATTCATCGGTGAGGGTCTACACAATAACTTGGCGCAAAAAATTCAGCAACATCTTCCAATAAAGATATCAGAGGATGATGTTCTGCCGCATGTGGTGTGCTACCAGTGTGCCAGCACAATCCTCGCCTGGCATGACCTCGTGGAGTGTTGCCTCCAGGCTGACGTGACACTGCAGCAGCGAATATCTTCACTTACTAATGAAACAGAGCTGAAAAAAAATTCAAACCAAAGATTAGAGGATAATGAAGAAAGTTCTGCCAAAAAAGGGGAGGATAAACTTATAAGCCTAGTGTATGACATTCTTCGTGAGTACCTACAACTTATGGAATTAGATGAAAATAATATTATGGATTTAGATTATGTGTGCCAGGTCTGCACCCACAAACCTGCAGCTAAAACAATAGGCAACTTGGTGAAACACCTCCGGTTGTCTCATAGCACGGAACTCTGCAACTCGGAGTCTATTCGGAACTTCATCAGGAACAATATAACTTTTGAAGAAACTTTAAGCGCGGAAGATAAAGACGACCATGGGAGTGATACAGACAAAGGAATCGATAAAGAGGCCGGCACCGACTTGCCGGCATACTTCTGCCCCTACTGCGACAGTGCCTTCTCCACCCCCACTAGGCTGATACATCACATCAACAGACACATAGATATATCGACAGAGAACGTGCTAACATGCTGCGGGGAAAGCTTCGACAAAAAATCCTTTGCGCTCCACCTGCAGCGCAGGCACGTGGCGCGCGTCGTCGCGGGCCCGCACGTGTGCCGGAGCTGCTCCCACAGCGCGCGCACCGAGCACGATCTACAGGCGCACGTGGCCGCCGAGCACGGCGCGCCGCCCGTCGCCCGCCCCGAGCCGCGCCGTCGCCGCTCGCACGCGCCCGCCGCCTGCCCACGGTGCCGCAAGACATTCTCCAACAAACACAACATGCTGGTGCATCTGAGGTCCCACGACGCGAGCGAGCCGTTCGCGTGCGACAGATGCGACAGGCGCTACAGGACCCGGAGCGGCCTGAGCGAGCACCGCCGGATCGTCCACGAGGGCGTGATGCGGCACGTGTGCGCGACGTGCGGCGAGGCCTTCCCCACGCGCGGCGCGCGCGACGTGCACGCCCGCCTGCACTCCGGCGCGCGCCCCTTCGCCTGCGACTACTGCGGCAAGGCCTACCGCGCCAAGAACACGCTGGACCGCCACGTCGAGGCGCACCTGGACGTCCGGAAGTACCCGTGCGCGCCCCGCCCCGGCTCCGAGCCGGCGCCGAGCGAGGCGGCCGGGGAGCGCGCCGATCGCGACGACGACGACGAGCCCCCGGCGAGCGGGGTGGGCAGGAGGGCCGCCGAGCTGTACGGAAACTTCTACGGCGCCCTCGTAAATTTTCGGGACCATCTTGCCACCGAGCACGAGGGGAGCTCCCCGGACTCCGGCGACTCCGACGAGGGCGGGAGCGACGAGGCGGTCGGCGACGAGTACGACGACCTCACCCGGGGCGCGAGGAGGGCGGGGGGGAGCGCTGAGACCGAGCTCGAGCTCGCCCGCGCGCAGACGACGGTCGACGGGCGGGCGTACTACGCGTGCGGGACGTGCGGCAAGCACCTCGGCTCGGCGCAGACCTACCTGTTCCACCGGCGCATCCACACGGACGAGCGGCCCTGCGTGTGCCACGTCTGCGGCAAACGGTTCCGCACGCCCAACGGCCTGCGCCGCCACCTCACCGAGACCCACGAGCGCCGCCGGCTCCACGCCTGCGCGCTCTGCCACAGGACCTTCGCCAACTCCCAGAACCTCCGGGCGCACGCGCGCATCCACACCGGCGAGAAGCCGTTCGTGTGCGCGCAGTGCGGGAAGCGCTTCACGCAGAGTGGCTCCCTACACGCGCACGCGCGCGGCCACAGCGCGCTGTACCCGTACCGCTGCGCCGAGTGCGGGGCACAGTTCAAGCTGCGCGCCGGCCTCGCGCGCCACGCGCTGCGGCACACGGGCGAGCGGCCGCACGCCTGCGACGCGTGCGCGCGCGCCTTCCGCCACCGCCACGAGCTCGCCGCGCACGCGCTCACGCACACCGACGCCCGTCCGCACGCCTGTCGCCTGTGCGGCTCCGCCTTCCGCCAGCGCAAGGCGCTGCGCCACCACTGCAGGCGCGTCCACGAGACCGACGTCGCCGACGCGCCCGGGCCCGTCTTCGACCACGTCGGCCCGTACGATTCGAACGTAAAGGACGAGATCGGGGACGAGAGTCGATCGAGCGGCCCCGCGCAGCGAAAGCCGGCCGACGAGTCGAGGAACCCGACTACGCTCCCGAGCGACGGAAAAGAATACGTGGAGTGCACACTGTGCGACCAAAACGTCCCGGCCGCGTCCTGGCGCCGCCACCGGCGGTCCCATCAGGGCGGGCGGCCGCACAGCTGCCACGTGTGCGGGCTGGGGTTCGCCGACCGCGGGAACCTCGCGCGCCACGCGGCGGAGCGCCACCGCCGCCTGCGCCCGCACGCCTGCGCCCTCTGCGGCCGGACGTTCGCGCGCAAGTCGCACCTGCGGGACCACGCGGCGTCGCACTCGGCCGTGCGCGACCACGTGTGCGACGTGTGCGGCCGCGCCTCCAAGTCCGCCGCCGCGCTGCGCATGCACAAGAAGATGCACGACGACACGCTCGGGTTTCACTGCGCGTTCTGCGACGCTCGATTCAGGAGGAGGAGCCAGCTCGTGGCGCACGCGTCCGTGCACACCGGGGAGAGGGCCCACGCGTGCGCCTGCGGCCGGACCTTCCGTCTGCGCAGCCAGCTCTACGCGCACGCCCGCACGCACGACGACGGCCGACCGCGGGGAGACGGGCCGGGCCCCGATCCGCGGGACGATGTCGCGGGGCCGATCGACCGGCCCCACCCCGCCGAGGCGGCGGCCCCCCCTCCGGTGCCCCCGCGGACCTGCGCCTATTGCCGGGCGGAGTGCCCCGAGCCCCAGCTAAAGGAGCACCTGATATCGACGCACTCGGAGCTGCTGTTCCGCTGCGACGAGTGCGACAGCTACGTCGACCGCCAGGACTTCGTCGAGCACATGTCGCAGCACGCCGTCGAGTACGCCGCCGACGCGGAGGAGGCCCGCCCCGCGCCCCCGGACCCCGGGGGCCGGGAGGGCGAGCGCGCGGCGGACGGGGTCGAAAACGAGGACGGAGAAGCGACGGGAGAGGAGAGAGACGGGGGCGGCTCGGTGAACGGGGACGAGTTCTCGGACCGCAGCGACGCGGAGTACTTCGGCGAGCTGCCCGAATCCGTGTTCGAGGCCATCGAGGACTCGCAGGACACGCAGGCGGCCGACGACGGCGACCGCCTCTCGCGGGCCGACGCCGCGACCCCCGCCGGCCCCGCGGCGCCGCTCGCGGGACCCCCCATCGACGGGAGGAAAACGAAAAAGGAGCGGGCGTGCCCCGTCTGCTCGAAGGTGTACACCGCCAGCTCCAGCTACTTCTATCACGTGAAGCACGCGCACGGCGCCGCCCGGCAGCACGAGTGTGACGTCTGCGGGCGGAGGTTCGGCGCCAAGCCGACGCTGGCGCAGCACCTGCTCATCCACGCGGCGGAGCGGCCGTACGCGTGCGCGCAGTGCGGCGCCACGTTCCGCGCGCGCGCCGCGCTCTACGTGCACGAGCAGACGCACGGCGGGCGCAGGACGCACGCGTGCGGGGAGTGCGGGCGCTCGTTCCGCTGGCGCGCGCACCTCGAGCGGCACCGGCGGCGTCACGCGGGCGGGCGCGAGCACGCGTGCGACGCCTGCGGGCGGAGCTTCAACGTGCGCGCCGACCTGCTGCGGCACGCGCGCACGCACGCGGCGGCCGCGTTGCCGTGCCCGCACTGCGAGCTGCGGTTCGCGCAGCCCCGCTACCTGCGCGCGCACCTCAACAACAAGCACGCCGCCGAGGGAGGAGGAGTCGTGAAATGA
Protein Sequence
MNIESEKILVNSSANLSQESLNFAETCNVLNGLENLSNVCRICANVTEFVIPIFIGEGLHNNLAQKIQQHLPIKISEDDVLPHVVCYQCASTILAWHDLVECCLQADVTLQQRISSLTNETELKKNSNQRLEDNEESSAKKGEDKLISLVYDILREYLQLMELDENNIMDLDYVCQVCTHKPAAKTIGNLVKHLRLSHSTELCNSESIRNFIRNNITFEETLSAEDKDDHGSDTDKGIDKEAGTDLPAYFCPYCDSAFSTPTRLIHHINRHIDISTENVLTCCGESFDKKSFALHLQRRHVARVVAGPHVCRSCSHSARTEHDLQAHVAAEHGAPPVARPEPRRRRSHAPAACPRCRKTFSNKHNMLVHLRSHDASEPFACDRCDRRYRTRSGLSEHRRIVHEGVMRHVCATCGEAFPTRGARDVHARLHSGARPFACDYCGKAYRAKNTLDRHVEAHLDVRKYPCAPRPGSEPAPSEAAGERADRDDDDEPPASGVGRRAAELYGNFYGALVNFRDHLATEHEGSSPDSGDSDEGGSDEAVGDEYDDLTRGARRAGGSAETELELARAQTTVDGRAYYACGTCGKHLGSAQTYLFHRRIHTDERPCVCHVCGKRFRTPNGLRRHLTETHERRRLHACALCHRTFANSQNLRAHARIHTGEKPFVCAQCGKRFTQSGSLHAHARGHSALYPYRCAECGAQFKLRAGLARHALRHTGERPHACDACARAFRHRHELAAHALTHTDARPHACRLCGSAFRQRKALRHHCRRVHETDVADAPGPVFDHVGPYDSNVKDEIGDESRSSGPAQRKPADESRNPTTLPSDGKEYVECTLCDQNVPAASWRRHRRSHQGGRPHSCHVCGLGFADRGNLARHAAERHRRLRPHACALCGRTFARKSHLRDHAASHSAVRDHVCDVCGRASKSAAALRMHKKMHDDTLGFHCAFCDARFRRRSQLVAHASVHTGERAHACACGRTFRLRSQLYAHARTHDDGRPRGDGPGPDPRDDVAGPIDRPHPAEAAAPPPVPPRTCAYCRAECPEPQLKEHLISTHSELLFRCDECDSYVDRQDFVEHMSQHAVEYAADAEEARPAPPDPGGREGERAADGVENEDGEATGEERDGGGSVNGDEFSDRSDAEYFGELPESVFEAIEDSQDTQAADDGDRLSRADAATPAGPAAPLAGPPIDGRKTKKERACPVCSKVYTASSSYFYHVKHAHGAARQHECDVCGRRFGAKPTLAQHLLIHAAERPYACAQCGATFRARAALYVHEQTHGGRRTHACGECGRSFRWRAHLERHRRRHAGGREHACDACGRSFNVRADLLRHARTHAAAALPCPHCELRFAQPRYLRAHLNNKHAAEGGGVVK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01217675;
90% Identity
iTF_00935010;
80% Identity
-