Basic Information

Gene Symbol
-
Assembly
GCA_949825065.1
Location
OX463868.1:1829600-1840692[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 5e-06 0.00039 21.7 0.7 1 23 160 182 160 182 0.96
2 34 0.0012 0.091 14.3 1.6 2 23 216 237 215 237 0.96
3 34 0.0015 0.12 13.9 1.1 3 23 245 265 243 265 0.97
4 34 0.0038 0.3 12.6 0.6 1 21 333 353 333 354 0.95
5 34 0.00025 0.019 16.4 0.3 1 23 514 536 514 536 0.96
6 34 6.5e-05 0.005 18.2 0.4 1 23 542 564 542 564 0.97
7 34 2.3 1.8e+02 3.9 0.8 1 19 570 588 570 590 0.93
8 34 1.1e-05 0.00082 20.7 0.1 1 23 647 669 647 669 0.97
9 34 0.0028 0.21 13.1 6.3 1 21 693 713 693 715 0.94
10 34 0.00031 0.024 16.1 1.1 1 23 721 743 721 743 0.98
11 34 0.00023 0.018 16.5 1.9 1 21 801 821 801 823 0.94
12 34 6.9 5.4e+02 2.4 0.2 7 23 844 860 844 860 0.96
13 34 0.0023 0.18 13.3 1.7 1 23 886 908 886 908 0.95
14 34 5e-06 0.00039 21.7 0.4 1 21 914 934 914 935 0.94
15 34 0.048 3.7 9.2 2.8 1 23 1103 1125 1103 1125 0.97
16 34 0.0046 0.36 12.4 1.7 1 23 1131 1153 1131 1153 0.97
17 34 1.8e-05 0.0014 20.0 0.2 1 23 1205 1227 1205 1227 0.98
18 34 2.8e-05 0.0022 19.4 1.9 1 23 1283 1305 1283 1305 0.95
19 34 9.1e-06 0.0007 20.9 1.0 1 23 1311 1333 1311 1333 0.98
20 34 1.7e-05 0.0013 20.0 0.4 1 23 1352 1374 1352 1374 0.98
21 34 5.3e-05 0.0041 18.5 0.2 2 23 1414 1435 1413 1435 0.97
22 34 0.00032 0.025 16.0 0.2 1 23 1463 1485 1463 1485 0.98
23 34 0.00023 0.018 16.5 1.3 1 23 1491 1513 1491 1513 0.97
24 34 3.1e-06 0.00024 22.3 2.1 1 23 1519 1541 1519 1541 0.99
25 34 4.4e-05 0.0034 18.7 0.7 1 23 1566 1588 1566 1588 0.98
26 34 6.8e-06 0.00053 21.3 1.3 1 23 1594 1616 1594 1616 0.97
27 34 5e-05 0.0039 18.6 3.4 1 23 1622 1644 1622 1644 0.97
28 34 0.0018 0.14 13.6 1.6 2 23 1653 1674 1652 1674 0.95
29 34 0.00013 0.01 17.3 0.4 1 23 1715 1737 1715 1737 0.97
30 34 1.3e-05 0.001 20.4 0.4 1 23 1743 1765 1743 1765 0.97
31 34 0.023 1.8 10.2 2.3 1 23 1771 1793 1771 1793 0.95
32 34 1.1e-05 0.00083 20.7 1.7 1 23 1799 1821 1799 1821 0.99
33 34 0.47 37 6.0 1.6 1 23 1827 1849 1827 1849 0.96
34 34 8.3e-05 0.0065 17.9 2.1 1 23 1855 1877 1855 1877 0.92

Sequence Information

Coding Sequence
ATGAGTTCTCAGGAGCGAGTCCAGGTCTCGGATTCAACTAAGCTAATACACGTAATTGTGAAAGACGAAAAGCATAAGGAGAAAGATGGTGCTGCAGTGGAGTACATCAtcgaaatacataaaataatcaaagacGACAGCTCCGTAGTCACCAGTACAGAGCCTGAAGAGCAGTATGCTCGTGAAGAAATcgAAGCACCTGATCCGAGTGCCGTTGAAGATGAAGAAATCGAACAAATATTCTTCGTCACCCGACTTACCGAATTTACCAAACTCGAAACGACAACTCCAGAAGTTCGGCTTGATGAAACGGTCCAACGTGGTAACGTGCAAGAAACGTTTCTGGACGGATCAAAGGCTGCTGGCGATGGAGGCCAATCTCAGAAGAGATGCGAGATTTGTGGAAAGACCTGCGGGGGAAAGTGCAGTTTAAAAGAGAGTGACAAGAGGAAACCTTCAACTCACGGGAATAAACCGTATCTATGTAAAGATTGTGGAAAGCGTTTTGTTCGGAAGTCCGAGTTAGGATTACACAGAAGAGTTCACGGTCGATTGAAGCCATATGAATGTGCTCAGTGTGGAAGTAGGTTTTTAGATAAAACCAGCTTTGATTACCATCTTTCTGTACGTAAAGGATCGACCCTCATTTGCGATATATGCGGaaagttttttgataaaatttgtctTCTAAATCGACATAAGAGAATTCATATTGGAACGAAACCATATGGGTGTGATCTATGTGGGAAGAGATTCATAAAGAAACACCAAGCCATGAACCACATGTTAATCCATAATGATGGGGTTAGCGGCGAGGCGTTTGATAATAGTTACGAGGTAGCCGAAAGCAAAATCCAatgtaaattatacaataagacatttttaagaaaaactcAATTGACGTGCCATAAGTCTCGAGTTCATGCCAAAAAGAAACAGTTTGATTGTGACGATGATTTGATTGCACATAGAATCGTTCATAGTTCAGGTAAACTGTTCACATGTGATGAATGCGACCAGAGTTTTGCGAACGAAAATAAGTTCGTCAGACATAAGAAGGACTGTGAAAAGGAACTCATTGAAAAGAGTCCTTATGCAAGATtggttgaaaagaaaaaaacgctTTTGTCTGTAGATAATTCAAGAGTTATAACGAATGAAAGTGTTCCTTCGAAAAAAGGTTCTGATTTTAGATTGTCTGAAGCAGAAACGACTTTTCTGTCCGGAGAAAGCTTGAATGAAATAACTAGGGAATACGTAGCTTCAAAAAAGGGACCTGATTTAAGATTGTACGGAACGGAAACAATTCTTTTACATAGAGAAAGCTTTGATGATGAATTAACGAGGGAATGTGAAGCTTCAAAAGACGGCGCTGATTCAAGATTGTCTGATGCGGAAATAAGTAGAAAATGTATAGCTTCAAAAAAGGAATCTGATTTAAGATCGGCTAAAACACGAACAGTTTTATCCGGGAAAAACGTTAAGTTAAAGAGGGAAGGTGTACGTGTAAATAGGGGAATTGAAAGTCTTTCGAAACATTTTGAATGTGACGAATGTGGTAATAAATACTCCAGCAAGGGTAACCTAGCCAAGCACAAGGTTATCCACAGTGGAATAAAACCTTTTGCCTGTAATGTATGCTGGAAATATTTTGCTAGGAAGGGTAATTTAAGGGAGCACTCGAGGATTCATGCTAAAGTGTCCCCGTTTAGTTGCGCCAAGTGTGAGAGACGGTTTTTATCTAGAAGTCTTTGGATGAAACATAAATATGGTAACTGCGATGAAAAACCTTGGGTAGAGTTAGAATCTGGGAATGGTGGTTCGAAAGAGCAAATAATACAACCAATCGTAGAAGAATCAGATCCTCTTTGGTTAGGAGATAAAGATGTAGACGTGGATGACCGAGAAGACAACATTGAAAGTAATGAAGGAACTAAATATCGTTGTGAAGAATGTGGAGAGACGTTTGTTGATTTAACTAATTTGGCACGACACAAAGAGATTCATAAACGTGGAAGATCTTTACGAAGAAGGAATCTGAGGGATCGTCTCGTGGCAAACAGCGATTTGAGGCTATTTTGTTGCGATCACTGTGAGAAAAGTTTCACGGTTAAGGATGAGTTGAAAATACATATGCTGTGTCATGTAATGGAGAAACCACATCGTTGTGAACAGTGTGACGAGATATTTCCAACGAGTAGCGTTTTAAAGAATCATACGAAGATTCACGGTAGTGTTAAGTGTTCGATGCGGAGAATACATTTAGAACAACGCCAATTGTTGCATTATAATAAGATTCACAATGATACTAAGTCACCTGTCTGTGATGATCGCTTTAAAGGGTTTGGTTTGGAAATGAAAAGGGACAAACGTAAAGTAGTCTGCACTAAGGAGAAACCATTTAAGTGCGAGCTATGTGGGATAGAATTCGTCACGAAATCGAATTTGAAAGTTcacgaaaaatgtcataaaaaagaCGAAATAACCCATTCTACGTTAAAGGATACACCGTTCAGTTATGATATACGTGGAAAGAAATATTCGACGGAGAAAACTTTATCGTCGCATCGAATAGTTCATAGCGGAGAACGAAAAAAGGTTCTTCGAAGTGTTGAAAAATCGATACGCGACGGTTTGAATAGCGAAGATAAGCCGTTTGCCTGTGATGTTTGCTTTAAGAGGTTTAAGTTGAAGAGTAATATGAACAAACATAAAGTTGTTCATAGTAAAGAGAGACCGTTTAAGTGTTCGCAGTGTGGGAAGGCATTCGCCAGGAAAGGGAATTTGAAACAACACGAAAAGATATATTCGAAaggcaaaataattaatttggagAGTCACGGAAGTGTAcatgtgaagaagaaaaattctaaaattgAAAAAGGTACGAGTAAATCAATGCAAAGTGATCAGCGAGGATGTGACATTGTTATCAATTATGAAAGTTCTTCGAACGTTGGGGTCGTTGAGCCTCAAAGTAGTTCTTTGAACGTAGGCGAAACTCCAGGGTATGGTACAAAGGATATGAATTTTGCGCAGGTGATTCGTATATCACAACATGCTGTCATTGATACGGAAAATACTTGCGTACTttctaacgaaataaataacgCAAATATATTGGTAACTTCTCCTATTTATAACCCGGAAGAACATAGCCACTCTGAAATTGATAACGCGGAAGAAATTCATTATTCTGACTTCGACAATGCAGAAGAAACGACACGCAATTCTTATATTGCGAACTCGAAAACTGATAATGCGGCGAGCCTCTTTGTTCATAATACACATACTGCGAAGAAACCCTTCTTCTGCGGAGAATGCGGCAAACAGTTCATGAAGAAACACGTTCTGTATATTCACAAAAAGATCCACACAGGTGAGAGACGTTTTCCTTGCGATAAGTGCAGCAGAAGTTTTCGAATCAAGCACGCCTTGGAAATGCACAGTATGATGCACGGCGGAGGGAAGCCGTATTGCTGCGAACAGTATGGGAAGAGGTTTTCGTACGGCGATCATATGAAAAAACGCACGAAAGTGCGTaggaatgattcttttatttccGATACTGAGAAACCCTTTGTTTCTAACATTAATATCGTGAATAAATCCTTCGTTTGCGAAGAATGCGGCAAACAGTTTGCGAAGAAATATGGTCTGATTAATCACGTAAAAATTCACAGTGCGGAGAAACTCTCTTTTACCAACGTTAATACCGAGGAAAAACCCTCTATTCGTAATACAAATACCGCAGATAAACCCGTCGTTGGTGAAGAACGCGCCAAACGTTTCGTTAAGAAACATTCTCTGAATGTCCACGAAAAGACCCGCAGCGCAGAGAGACTGTTTTTTTGCGAAAAGTGTGGGAAAAGCTTTCGCCGCAAGTATTCATTGGTATCGCACAATTTGATTCACACTGGAGAGAAGCCGTATATCTGTGCACAGTGTGGGCGGAGGTTCGCTCAAAAGAGTAATATGAATAAACACACAAAAGTGCATAGGGGTGAGGTTTAtaccacaaatattaataaCGCAGAGAAACCAAATGAATCGTTCGTTTGCGATGAATGCGGTCAACAGTTCACGAGGAAACGTGCTATTATCAATCACATGAACACTCACAGTgcgaaaaaactctttatttccaTTATAAATACGGTGGATAAACCATCTATTCCTAATACAAATACGATGGATAAACCATCTGTTCCTAATAAAAATACCGTGGATAAATCTATCGTTTGCGAAGAATGCGGCGAACAATTTGTGACGAAACACGCTTTAATCAATCACGTAAACACTCACGGTGTGAAGGAACTTTCTGTTTCCAATATTAATAACGGAGAGACACCTTTTAATACTGATATAAAAACAGCGGTTCGACCCTTCGTTTGTGGAGAATGCGGGAAACGTTTCGTTAAGCAACTAGCTTTGTATGCCCACAAAAAGATTCACAGTACTGCGAGACCTTTTTCTTGTGACAATTGCGGGAAAAGCTTTCGCCTCAAGCATGCATTGATGATTCACAACATGATTCACACTGGAGAGAAGCCGTATCGTTGCGTACAGTGCGAGAGGAGTTTCGTTCAAAAGAGTAACATGAAAAAACACATGAAAGTGCATAAAAATATCTCCGATATTGATAAAGGGAAGAAACTGTTGGCTATCGATGATCTAAATACCGTGGATAAATCCTTCATTTGCGAAGAATGCGGCAAACAGTTTGCGACGAAACACGTTCTGATTAATCACGTAAATACCCATAAACCGGAAAGACCTTTTTCTTGTGAGAAGTGCGGGAAAAGTTTTCGAACTAAGTATTCATTGTTAGTTCACAATATGATTCACACCGGAGAGAAGCCGTATTGTTGTGCACAGTGTGGAAAGAGGTTCATTCAAatgagtaatatgaagaaacaCATAAAGATGCATAAAGAAACGaaggaattaaaattaaattgtgaaGAATGCGGTAAGGTGTGCGGTTTGAAGTCAACTTTGACCAGACACAGACTGTATCATAGACGAGATAAAATGATTATGGCTTTTATTGCCGCCGACAAAGGAATCTCTTTTTCTACGGCTGATTGTAAGATGAACGATGGAACGATAGGTCCCAAAACGGAATCGACGTTCGATGGTTTCATGTGCGAGGAATGTGGtcgaaagtttaataaaataagcgGTCTCGAAAGACATAAACTCGTTCATAATGCTTCGAAAGTCTTTCCTTGCAGACATTGCGACGAAGTTTTCACGCGCCAATCAGACTTGATAGATCACGTGTCGATTCATATCGGCGATAAAGTGTTCCCTTGCGGGTATTGCGGGAAGAAATTCGTGAAGAAAAGCGTTTGTGAGAAACACGAAAAAATACACGCGAGAGACGAGCCGTATCAATGCGGACAGTGTACGAAAAAATTCTCATCCAGGACCAATTTGCGTATTCACGAAAGGGTGCACAGTGATTTGAAagcgtacgagtgtgtacagtGTGCGAAGAGATTCAGATGGAGGTCCAATTTGACGTGTCACATGGGGGTACACGCGAAAAATAAACGTTTCAAGTGTAATTTTTGCAAGAAGTCATTCTCGACCAAGGAATATTTAGCGAAACACGAGGTCGCGCATAGTGGCGCGAAGCTTCACGCCTGTGATCTTTgcGAGTCAGATTGTGGGGAATCTGAAGAGGAAAGAATCTTCTATAGTGAGGATGGAGAAGAAACAGTTTCAGAGGAGGATGCTTCAGGTTGGGAAGATGGATCGGATGAAGGGAACATTTCAAGAAATAGTCGCTGTCCAGGATCCCACACTCGGCAGCTGGTGAATAAGGCAGGGACCTCCGACACTCCTCTCAAGCTGCAAGGAAGAAACTCCCACTTTCCTTATAAAATCCTTGACAATGCTGGTGGAGAAAAATATGTAAGGCTAAAGCAAAGGCCTGAACAAGGAGTGTGTGTGGGAATGTAG
Protein Sequence
MSSQERVQVSDSTKLIHVIVKDEKHKEKDGAAVEYIIEIHKIIKDDSSVVTSTEPEEQYAREEIEAPDPSAVEDEEIEQIFFVTRLTEFTKLETTTPEVRLDETVQRGNVQETFLDGSKAAGDGGQSQKRCEICGKTCGGKCSLKESDKRKPSTHGNKPYLCKDCGKRFVRKSELGLHRRVHGRLKPYECAQCGSRFLDKTSFDYHLSVRKGSTLICDICGKFFDKICLLNRHKRIHIGTKPYGCDLCGKRFIKKHQAMNHMLIHNDGVSGEAFDNSYEVAESKIQCKLYNKTFLRKTQLTCHKSRVHAKKKQFDCDDDLIAHRIVHSSGKLFTCDECDQSFANENKFVRHKKDCEKELIEKSPYARLVEKKKTLLSVDNSRVITNESVPSKKGSDFRLSEAETTFLSGESLNEITREYVASKKGPDLRLYGTETILLHRESFDDELTRECEASKDGADSRLSDAEISRKCIASKKESDLRSAKTRTVLSGKNVKLKREGVRVNRGIESLSKHFECDECGNKYSSKGNLAKHKVIHSGIKPFACNVCWKYFARKGNLREHSRIHAKVSPFSCAKCERRFLSRSLWMKHKYGNCDEKPWVELESGNGGSKEQIIQPIVEESDPLWLGDKDVDVDDREDNIESNEGTKYRCEECGETFVDLTNLARHKEIHKRGRSLRRRNLRDRLVANSDLRLFCCDHCEKSFTVKDELKIHMLCHVMEKPHRCEQCDEIFPTSSVLKNHTKIHGSVKCSMRRIHLEQRQLLHYNKIHNDTKSPVCDDRFKGFGLEMKRDKRKVVCTKEKPFKCELCGIEFVTKSNLKVHEKCHKKDEITHSTLKDTPFSYDIRGKKYSTEKTLSSHRIVHSGERKKVLRSVEKSIRDGLNSEDKPFACDVCFKRFKLKSNMNKHKVVHSKERPFKCSQCGKAFARKGNLKQHEKIYSKGKIINLESHGSVHVKKKNSKIEKGTSKSMQSDQRGCDIVINYESSSNVGVVEPQSSSLNVGETPGYGTKDMNFAQVIRISQHAVIDTENTCVLSNEINNANILVTSPIYNPEEHSHSEIDNAEEIHYSDFDNAEETTRNSYIANSKTDNAASLFVHNTHTAKKPFFCGECGKQFMKKHVLYIHKKIHTGERRFPCDKCSRSFRIKHALEMHSMMHGGGKPYCCEQYGKRFSYGDHMKKRTKVRRNDSFISDTEKPFVSNINIVNKSFVCEECGKQFAKKYGLINHVKIHSAEKLSFTNVNTEEKPSIRNTNTADKPVVGEERAKRFVKKHSLNVHEKTRSAERLFFCEKCGKSFRRKYSLVSHNLIHTGEKPYICAQCGRRFAQKSNMNKHTKVHRGEVYTTNINNAEKPNESFVCDECGQQFTRKRAIINHMNTHSAKKLFISIINTVDKPSIPNTNTMDKPSVPNKNTVDKSIVCEECGEQFVTKHALINHVNTHGVKELSVSNINNGETPFNTDIKTAVRPFVCGECGKRFVKQLALYAHKKIHSTARPFSCDNCGKSFRLKHALMIHNMIHTGEKPYRCVQCERSFVQKSNMKKHMKVHKNISDIDKGKKLLAIDDLNTVDKSFICEECGKQFATKHVLINHVNTHKPERPFSCEKCGKSFRTKYSLLVHNMIHTGEKPYCCAQCGKRFIQMSNMKKHIKMHKETKELKLNCEECGKVCGLKSTLTRHRLYHRRDKMIMAFIAADKGISFSTADCKMNDGTIGPKTESTFDGFMCEECGRKFNKISGLERHKLVHNASKVFPCRHCDEVFTRQSDLIDHVSIHIGDKVFPCGYCGKKFVKKSVCEKHEKIHARDEPYQCGQCTKKFSSRTNLRIHERVHSDLKAYECVQCAKRFRWRSNLTCHMGVHAKNKRFKCNFCKKSFSTKEYLAKHEVAHSGAKLHACDLCESDCGESEEERIFYSEDGEETVSEEDASGWEDGSDEGNISRNSRCPGSHTRQLVNKAGTSDTPLKLQGRNSHFPYKILDNAGGEKYVRLKQRPEQGVCVGM

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-