Basic Information

Gene Symbol
-
Assembly
GCA_959613385.1
Location
OY390740.1:4333587-4344739[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 62 0.63 45 5.8 1.3 1 15 42 56 42 62 0.78
2 62 1.2e-05 0.00085 20.7 3.6 1 23 68 90 68 90 0.98
3 62 0.022 1.5 10.4 5.3 1 23 95 117 95 117 0.93
4 62 2.4e-06 0.00017 22.9 3.3 1 23 123 145 123 145 0.99
5 62 2.8e-06 0.0002 22.7 5.5 1 23 151 173 151 173 0.99
6 62 0.00015 0.011 17.2 7.6 1 23 179 201 179 201 0.99
7 62 0.00045 0.032 15.7 0.9 2 23 208 229 207 229 0.98
8 62 0.00011 0.0074 17.7 3.2 1 23 235 257 235 257 0.99
9 62 0.029 2 10.0 3.8 1 23 263 285 263 285 0.99
10 62 0.011 0.79 11.3 1.0 1 23 291 313 291 313 0.99
11 62 1.4e-05 0.001 20.4 1.7 1 23 319 341 319 341 0.99
12 62 0.0023 0.16 13.5 3.0 1 23 347 369 347 369 0.98
13 62 0.00029 0.021 16.3 1.9 1 23 375 397 375 397 0.98
14 62 5.1e-05 0.0036 18.7 6.0 1 23 403 425 403 425 0.99
15 62 5.6e-05 0.0039 18.6 1.3 1 23 431 453 431 453 0.98
16 62 6e-05 0.0042 18.5 4.7 1 23 459 481 459 481 0.99
17 62 5.4e-07 3.8e-05 24.9 1.8 1 23 487 509 487 509 0.99
18 62 2e-06 0.00014 23.1 2.1 1 23 515 537 515 537 0.98
19 62 1.1e-05 0.00075 20.8 2.4 1 23 543 565 543 565 0.99
20 62 0.067 4.7 8.9 5.9 1 23 571 593 571 593 0.98
21 62 0.0011 0.081 14.4 5.7 1 23 765 787 765 787 0.98
22 62 5.5e-05 0.0039 18.6 1.0 1 23 815 837 815 837 0.98
23 62 0.0016 0.11 14.0 6.9 1 23 845 867 845 867 0.98
24 62 0.44 31 6.3 0.5 1 23 873 895 873 895 0.98
25 62 1.5e-05 0.0011 20.4 2.7 1 23 901 923 901 923 0.99
26 62 0.00098 0.069 14.6 2.0 1 23 929 951 929 951 0.95
27 62 0.00096 0.068 14.7 0.7 1 23 957 979 957 979 0.98
28 62 0.00029 0.02 16.3 0.7 1 23 985 1007 985 1007 0.98
29 62 0.12 8.4 8.1 2.0 1 23 1013 1035 1013 1035 0.94
30 62 0.00095 0.067 14.7 4.2 1 23 1041 1063 1041 1063 0.98
31 62 0.01 0.73 11.4 0.3 1 19 1069 1087 1069 1091 0.79
32 62 4.9e-05 0.0035 18.8 0.5 1 23 1097 1119 1097 1119 0.98
33 62 9.3e-05 0.0066 17.9 3.2 1 23 1125 1147 1125 1147 0.98
34 62 2.2e-06 0.00016 23.0 0.8 1 23 1153 1175 1153 1175 0.99
35 62 0.0028 0.2 13.2 2.9 1 23 1181 1203 1181 1203 0.98
36 62 9.6e-05 0.0068 17.8 1.1 1 23 1209 1231 1209 1231 0.99
37 62 8.3e-07 5.9e-05 24.3 2.0 1 23 1252 1274 1252 1274 0.98
38 62 0.0082 0.58 11.8 4.1 2 23 1281 1302 1280 1302 0.96
39 62 2.6e-05 0.0019 19.6 3.6 1 23 1312 1334 1312 1334 0.99
40 62 0.0037 0.26 12.8 1.1 1 23 1340 1362 1340 1362 0.99
41 62 0.00012 0.0086 17.5 4.0 1 23 1368 1390 1368 1390 0.98
42 62 0.00028 0.02 16.3 0.5 1 23 1396 1418 1396 1418 0.97
43 62 0.00022 0.016 16.7 1.0 1 23 1424 1446 1424 1446 0.98
44 62 0.00039 0.027 15.9 3.1 1 23 1452 1474 1452 1474 0.98
45 62 0.00077 0.054 15.0 2.8 1 23 1480 1502 1480 1502 0.99
46 62 1.3e-07 9.2e-06 26.9 1.0 1 23 1508 1530 1508 1530 0.99
47 62 2.6e-05 0.0019 19.6 1.3 1 23 1536 1558 1536 1558 0.98
48 62 8.8e-08 6.2e-06 27.4 1.2 1 23 1564 1586 1564 1586 0.99
49 62 4.8e-06 0.00034 21.9 0.8 1 23 1592 1614 1592 1614 0.99
50 62 0.021 1.5 10.5 1.9 1 23 1620 1642 1620 1642 0.97
51 62 0.2 14 7.4 1.1 2 23 1714 1734 1714 1734 0.93
52 62 0.00018 0.013 17.0 1.2 1 23 1740 1762 1740 1762 0.99
53 62 0.0085 0.6 11.7 0.6 2 23 1765 1786 1764 1786 0.98
54 62 3.1e-05 0.0022 19.4 1.2 1 23 1792 1814 1792 1814 0.99
55 62 1.6e-06 0.00011 23.4 0.3 1 23 1820 1842 1820 1842 0.98
56 62 9.5e-07 6.7e-05 24.1 0.4 1 23 1848 1870 1848 1870 0.99
57 62 0.00018 0.013 17.0 3.7 1 23 1876 1898 1876 1898 0.99
58 62 2.1e-06 0.00015 23.1 0.4 1 23 1904 1926 1904 1926 0.98
59 62 6.6e-05 0.0047 18.3 0.7 1 23 1932 1954 1932 1954 0.98
60 62 0.0013 0.091 14.3 3.5 1 23 1960 1982 1960 1982 0.99
61 62 6.4e-07 4.5e-05 24.7 2.4 1 23 1988 2010 1988 2010 0.98
62 62 0.0031 0.22 13.1 5.2 1 23 2016 2038 2016 2038 0.99

Sequence Information

Coding Sequence
ATGAATGGTTCATTGAGATCTTCTTGCAACATAGAACCACAAGATGCACCTGCAGATTTAAAGACTCGTATCGAAACGCACAGCGAGGGATGTCAACACTTTAAAACACACGACGAACAAAAATACAACTGTAAAATCTGCAACAGGGATTTCAAGTCTGAAAAAAATCAGCACGTGAAATTGCACGACGAGGAAAAAAATCACAAGTGCGAGATTTGCCCGAagacgtttaaaaaaaaaaattatttactgcaGCACATGAAAATTCACGGCGAGAGAAATCACAAGTGTAACGTTTGCGGAAAATGCTTCACGTCGAAGCATATTCTGGTCGGGCACGTGAAACTCCACGCCGATGAAggtaattacaaatgtgaaatttgctcgAAATGCTTTACGAGAAAGGGCACTCTTTTGCGACATTTGAACGTACACTcgagtgaaaaaaattacaaatgcagCGGTTGCACGAAGAGCTTCACGAGAAAAGAACACTTGTCGAGTCATATGAAGAAGCACACCGGTGAACGAaactacaagtgcgaaatttgcacgaaatgcTTTACGACGAAGCATTACATGTCGCAGCACATGAAGATACACACCGGCGAAGGAAACTCGAAATGCGAAGTGTGTGGTAAGTGGTTCTCGTCGAAGCAAAATATGTCGCAACACATGAAAATACATTCGGGtaacagaaattataaatgtgagaTTTGTACCAAATGTTTCCTGACGAAGGGCAATTTGTTCGAACACATGAAGATACACActggtgaaaaaaattacaagtgcaAGATTTGCAGTAAATGGTTTAGAACGAAGTATTACACTCTGCAGCACTTGAAAATGCACTCGGGCGAgggaaattacaaatgtgaaatatgtTCGAAGTGGTCCACCTCGAAGCAAAATATGTTGCAgcacatgaaaatacacaccggtgaaagaaactataaatgtgaaatttgcagcaaatgctTCATAAGACGGGGAGATCTCCTGCAACACGTGAAAATACATTCCgccgaaaaaaattacaagtgcgacGTCTGCACGAAACGCTTCACGACGAAACGAATCATGTCGAGGCACATGAAACTCCACACCGGCGAAGGCAACTACGAATGCGAAATTTGTTCCAAGTGCTACCCGACGAAATCCTATATGCTGCAACACGTCAAAATTCACACCGtcgagaaaaatttcaaatgcgacgtttgcacGAAGTGCTTCACGACGAAGCCGTACCTGCTGCACCACATGAAGATACACGCAACCGAAAGGAACCACAAGTGCGAAGTTTGCGCGAAAAGTTTCACGTCGAACGGGACTCTCCTGCGACACGTGAAaatacacaccggtgaaagaaattacaaatgcgacgtCTGCGCCAAATGCTTCACGACGAAGGGACACCTGACGGAACACACGAAGATACACACCGGCGAGAGAAATTACGAgtgcaaaatttgcaataaaagttCAACCACGAAAGGAAATCTGTTGAAGCACATGAAGACGCACAGCGGCGAAAAAAaccataaatgtgaaatttgcgacaagAGTTTAACGACGAAGGGAAACCTCTCGaaacacatgaaaatacacactggtgaaagaaattataagTGCGAGATTTGCACGAAGTGCTTCATACGAAAGGGGGATCTATTGCAGCATGCGAAAACGCACGTCGGCGAGAAACACCACAAATGCAAGTTTTGCGTTCGCGATTTCAAAACGAGGAAAATGTTATTGGAACATAAGAAGACGCATGTGGAAGGGAGCGGTGATGAACAGCACGATCGAAAATTGCCACAAGATGTACGAACCACGGATGAAACCTGCAAGATTAAGGAAGAGACTAATAGATTAATAAAGGATGAAAAATTGGAAACCGCCTTTGacAATATGAAAACCCATGACGTCTTGATCACTGAAGTGAAACCCGAACCAACCGAAATTATTTCCTTAAATCCAACTCAATTTTGCTCCGGATATTCCGATGctgttgaaattaattacaagatTGCAGGAAATACTAATACCATAAAGAAAGAGGAATGCGTAGAGGCTTCTGCCCTAACTGACATTGAAGTAAATGCAATTGGCGCGACACAATTACCTGATCTGGTTGATCTGTATGGTTTGTTGAACTGCAGCCCCGTAGAGACAAGCGAATTTAATTACACAACCGGCTTAAAGGAACCCATCGATCCACGCCGTGATTTGAACCGCGCGAGAATACACAGCGATGAAAGAATTAAATCACACACCGAAGATAAAATTcacaaatgcgagatttgctTCAAGATTTTTCAGAGAAAAGATCACGTGCTGAAACACATGAAAATGcacgaaagaaattacaaatgtaagtTATTCACAAGAAAGGAACGTCTGTTAAGTCACAAGAAAGTACACGTTGACGAAGGAAacttcaaatgtaaaatatgcgATAAACGGTTTGCATCGGAGCGAAATATGTTAGTGCACTTGAAACTTCACATACgtaatagtaatagaaaatACAAGTGCAAAACGTGCACGAAATgtttctcaacaaaatattacatgttGCAACACGCGAAAATGCACACCGGCGAAGGAAGCTACAAATGCGAAATATGCGCTAAATGGTCCACGTCAAAGCAAAACATGTTAGAACACACGAAAGTGCACACGAccgaaagaaactacaaatgcgaaatttgcacgaaGCGCTTCGCACGGAAAGGATATCTACTGCACCACTTGAAAATACACAGCAccgaaagaaatttcaaatgcgAATCGTGCGACAAGACCTTCATAAGAAAATGCGACCTGTCGCAACACGCGGTGatacacaccggcgaaagaaactACGAATGCGAAATTTGCTCGAAACGCTTCACCACGAACCAAATTCTGCTGCAGCACCTGAAAATACACAACGCCGAGAGAAACTACGAATGCGAAACGTGCGCCAAACGCTTCACAAGAAAGTTCGACCTGCTGCAACACGCGATGatacacaccggcgaaagaaaccacaaatgcgaaatttgcgccaAGCGTTTCACGACGAAGAAAATCATGCTGCAGCATGCGAAAGTACACAACGCCGAAATCACCCACAAGTGCGAGATGTGCGCCAAATCGTTCGTGAGAAAGTGCGATTTGTTGAAACACGTGAAGATACACACCGGCGAGAGAAACCACgagtgcgaaatttgcggcaAACGTTTCACGACGAAAGCAATTATGCTGAGTCACGCGGAGCTGCACTTGGACGAAAGGACGcacgaatgcgagatctgcgcTAAACGTTTCATAACGAAAGCCGATCTGCTCAAGCACGTGAGGATACACACggttgaaaaaaatcacaaatgtgaaatttgcggcaAGTATTTCACGACGAAGAATTATCTGCTGCTGCACACGAAAATACACGCGACCGAAAGAAActataaatgcgaaatttgcgcgaAAAGCTTCAAAACAAACGGGACGCTGCAGAGACACGTGAAAATACACGTCGGCGaaggaaactacaaatgcgcCACTTGCACTAAATGCTTTTTGAGAAAAGGAGATTTGTTGAAACATATTAAAGGGCAcggtgttgaaaaaaattacaaatgcgaaatttgcggcaGAGGTTtcacgacgaaaaatattttattgaaacacttGAAGAAACATACGGGCGAAAGGAAAGATATTTTATCGAGACACGCGAAGATACACACCAGCGAGAGAAATTTTAAATGCGTCATTTGCGGCAAACGCTTCATAAGGAAAGAACATCTGTCGAATCATATGAACGTACACACCGGAGAAGGTAACTGCAAGTGCGGAACGTGCAACAAATGGTTCGcatcaaaacaaaatatgacCCAACACATGAAAACGCACGAAACCACCAAAAAACGCGCCGATAATTACAGATGTAAAATCTGCGTTAAGTCTTTCACTACAAAGTATTATATGTCGCAACACATGAAAATGCACGCCGGCGACGAaaactacaaatgcgaaatATGCGCCAAAAAGTCAACCTCGAACCAGTACATGTTGCAACACATGAAGATACACGACGGCGAAAGGAACCACAAATGCGAACTTTGCAGCAAGTCGTTCAAACGAAAAGGAGATCTGTCgcaacacaaaaaaatacacACCGGCGAACGGAACcacaaatgcgagatttgcgCGAAACGCTTCGTACGTAACGGCGATCTGCTGCAACACGCGAACatacacaccggtgaaagaaacTTCGAATGCGAAACTTGCGCCAAACGCTTCGCGACCAAAAAAAACCTGCTGCTGCACACGAAAGTGCACACCGGCGGAggaaatcacaaatgtgaaatttgcagcaaacgCTACCCCTCGAAGTATTACTTGTCGCAACACGTGAATACGCACGCCGACGAGAAACGCTACAAATGCGAAACTTGCGCAAAGCGCTTCGCGACAAAACAATATCTGCTGCACCACGTGAAGATACACACgaacgaaagaaattacaaatgcgaaatctgcGGTAAGAGTTTCACGATGAAAGGGAGTTTGACGAAACACGTGAGAatgcacaccggcgaaagaaactACGAGTGTGATATTTGCGGCAAGCGCTTCACCACGAAAGGGCATCAGATGGAACACGTGAAGatacacaccggcgaaagaaattacaaatgcaaaatttgcggTAAAAGTTCCACGACGAAGGTGAATCTGTTGAGGCACATGAGAGTGCACACGGGCGAAAggaattacaaatgtgaaatctGTAGTAAAAGATCTGCAACGAAAGGAAATCTGATGAAACATATGAAAGTACACGCCGGTGAAACGAATTATAACTGCGATATTTGCCgacgatatttcaaaaatactaaaCTGTTGTTGGAACACAAGGACACGCACAAGGGCAGCGAGGAAATTGATCGAAAATTTACGTTACTAATAGACGAAAGTAAGATTGGAAAACCGTTGATTAATGAACCATGTAATACgttaataaaagaagaaaagttGGAAATCATCGATGAAAATGTGAAAGAAGAGAAATGTGTAATTGCTGGTTTGACTCAAAGTGTTTCTTGTAACGATACCAAAACCACTGGTGAGAAATTGcggaaatgtgaaatttgtgcGAGGAGTTTTCCAACGAAAGATTACAAGCAACACTTGAAAACGCATGTCGATGAGAAAAActataaatgcgaaatttgcgccaAAGGCTTCGCCTTTAAGAAGAAATTACTGAGACACGTGAAAACGCACACGAACAAATGCGAAATATGCCGGCGGAACTTCGAAAACGAGACGGCCCTGTCGGAACACGAAAAGACGCACATCGGTGGTAAAatctacaaatgtgaaatttgctccAAGTTCATCTCGTCGAAACCGAACTTGTCCAAACACATGCAaacgcacaccggcgagaaGAACTTCACTTGCGGAATTTGCGCCCGGAGCTTCACGAGGAAATCAAATCTGTTGGCGCACATTATCACGCACACCGGCAAGAACAACTACAAGTGCGAGATCTGCGAACAGTCGTTTGCGACAAAGGCGAGCCTTTTGCGACACATGAAAATACACGCGGCCGAAATAATTTACAGGTGCGATATTTGCAACAGGCACTTCGCGAAGAGGAAATCGCTGTGGGAGCATAAGAGAACGCACGCCGGCGCCAGGAACTACAAATGCGAAATATGCGGCGCGTCCTTCGTGAAGAAGGAAACCCTGACGAAGCACATGAGCATTCACACCGGGGAAAGGAAGTACGTatgtgaaatttgcaccaaATGTTTCAGAACTGGAGCCAATCTGTGGTCGCACGCTAAAATACACACCGgcgagaaaaaattcaaatgcgaTATTTGTTCGAAACGCTTCTTGAGGAAACCTCACTTGATGCAGCACGTGAAAACACACGCCGACGAACGAAATTACAAGTGCGACGTTTGCAATAAATGCTTCAggacaaattcaaatttattggaacacaTGGACGTGCACAGCGAGGAAAGAAAATACCAATGCAAACTTTGCAATCGAcgttacaaaaacaaaaaaatgttgtatacaCATAATAAAACACATACTAAAGAAAATAACGAGGAAGGGGCTGTTAAGTTCATAAAGGAGGAACAGTTGGAAATCGACAATGacatGGATTTTAATCTCTCGACGAATTCAATCAGAAATTATCTTTGCGGCGCACCAACTGAAGTGAAATCGGAACCAACGGAACTTTGGCATCCGTCGCAAAGATTGAAATTAGGAGACGGATTCGAATAA
Protein Sequence
MNGSLRSSCNIEPQDAPADLKTRIETHSEGCQHFKTHDEQKYNCKICNRDFKSEKNQHVKLHDEEKNHKCEICPKTFKKKNYLLQHMKIHGERNHKCNVCGKCFTSKHILVGHVKLHADEGNYKCEICSKCFTRKGTLLRHLNVHSSEKNYKCSGCTKSFTRKEHLSSHMKKHTGERNYKCEICTKCFTTKHYMSQHMKIHTGEGNSKCEVCGKWFSSKQNMSQHMKIHSGNRNYKCEICTKCFLTKGNLFEHMKIHTGEKNYKCKICSKWFRTKYYTLQHLKMHSGEGNYKCEICSKWSTSKQNMLQHMKIHTGERNYKCEICSKCFIRRGDLLQHVKIHSAEKNYKCDVCTKRFTTKRIMSRHMKLHTGEGNYECEICSKCYPTKSYMLQHVKIHTVEKNFKCDVCTKCFTTKPYLLHHMKIHATERNHKCEVCAKSFTSNGTLLRHVKIHTGERNYKCDVCAKCFTTKGHLTEHTKIHTGERNYECKICNKSSTTKGNLLKHMKTHSGEKNHKCEICDKSLTTKGNLSKHMKIHTGERNYKCEICTKCFIRKGDLLQHAKTHVGEKHHKCKFCVRDFKTRKMLLEHKKTHVEGSGDEQHDRKLPQDVRTTDETCKIKEETNRLIKDEKLETAFDNMKTHDVLITEVKPEPTEIISLNPTQFCSGYSDAVEINYKIAGNTNTIKKEECVEASALTDIEVNAIGATQLPDLVDLYGLLNCSPVETSEFNYTTGLKEPIDPRRDLNRARIHSDERIKSHTEDKIHKCEICFKIFQRKDHVLKHMKMHERNYKCKLFTRKERLLSHKKVHVDEGNFKCKICDKRFASERNMLVHLKLHIRNSNRKYKCKTCTKCFSTKYYMLQHAKMHTGEGSYKCEICAKWSTSKQNMLEHTKVHTTERNYKCEICTKRFARKGYLLHHLKIHSTERNFKCESCDKTFIRKCDLSQHAVIHTGERNYECEICSKRFTTNQILLQHLKIHNAERNYECETCAKRFTRKFDLLQHAMIHTGERNHKCEICAKRFTTKKIMLQHAKVHNAEITHKCEMCAKSFVRKCDLLKHVKIHTGERNHECEICGKRFTTKAIMLSHAELHLDERTHECEICAKRFITKADLLKHVRIHTVEKNHKCEICGKYFTTKNYLLLHTKIHATERNYKCEICAKSFKTNGTLQRHVKIHVGEGNYKCATCTKCFLRKGDLLKHIKGHGVEKNYKCEICGRGFTTKNILLKHLKKHTGERKDILSRHAKIHTSERNFKCVICGKRFIRKEHLSNHMNVHTGEGNCKCGTCNKWFASKQNMTQHMKTHETTKKRADNYRCKICVKSFTTKYYMSQHMKMHAGDENYKCEICAKKSTSNQYMLQHMKIHDGERNHKCELCSKSFKRKGDLSQHKKIHTGERNHKCEICAKRFVRNGDLLQHANIHTGERNFECETCAKRFATKKNLLLHTKVHTGGGNHKCEICSKRYPSKYYLSQHVNTHADEKRYKCETCAKRFATKQYLLHHVKIHTNERNYKCEICGKSFTMKGSLTKHVRMHTGERNYECDICGKRFTTKGHQMEHVKIHTGERNYKCKICGKSSTTKVNLLRHMRVHTGERNYKCEICSKRSATKGNLMKHMKVHAGETNYNCDICRRYFKNTKLLLEHKDTHKGSEEIDRKFTLLIDESKIGKPLINEPCNTLIKEEKLEIIDENVKEEKCVIAGLTQSVSCNDTKTTGEKLRKCEICARSFPTKDYKQHLKTHVDEKNYKCEICAKGFAFKKKLLRHVKTHTNKCEICRRNFENETALSEHEKTHIGGKIYKCEICSKFISSKPNLSKHMQTHTGEKNFTCGICARSFTRKSNLLAHIITHTGKNNYKCEICEQSFATKASLLRHMKIHAAEIIYRCDICNRHFAKRKSLWEHKRTHAGARNYKCEICGASFVKKETLTKHMSIHTGERKYVCEICTKCFRTGANLWSHAKIHTGEKKFKCDICSKRFLRKPHLMQHVKTHADERNYKCDVCNKCFRTNSNLLEHMDVHSEERKYQCKLCNRRYKNKKMLYTHNKTHTKENNEEGAVKFIKEEQLEIDNDMDFNLSTNSIRNYLCGAPTEVKSEPTELWHPSQRLKLGDGFE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-