Basic Information

Gene Symbol
-
Assembly
GCA_963422695.1
Location
OY730484.1:47028331-47033142[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 52 4.5 3.2e+02 2.9 0.2 2 23 83 104 82 104 0.90
2 52 0.0033 0.24 12.8 1.8 2 23 125 147 124 147 0.95
3 52 1.7e-05 0.0013 20.0 1.6 1 23 166 188 166 188 0.98
4 52 2e-05 0.0015 19.8 4.6 1 23 194 216 194 216 0.99
5 52 0.00079 0.057 14.7 5.5 1 23 221 243 221 243 0.98
6 52 8.1e-06 0.00059 21.0 5.2 1 23 249 271 249 271 0.98
7 52 5.1e-06 0.00037 21.6 2.3 1 23 277 299 277 299 0.99
8 52 4.1e-05 0.003 18.8 5.2 1 23 306 328 306 328 0.99
9 52 1.4e-06 0.0001 23.4 2.1 1 23 334 356 334 356 0.99
10 52 5.2e-06 0.00038 21.6 0.4 1 23 362 384 362 384 0.93
11 52 1.9e-07 1.3e-05 26.2 3.2 1 23 390 412 390 412 0.99
12 52 0.00082 0.06 14.7 4.2 1 23 418 440 418 440 0.99
13 52 8.8e-07 6.4e-05 24.0 2.6 1 23 446 468 446 468 0.99
14 52 2.3e-05 0.0017 19.5 5.7 1 23 474 496 474 496 0.99
15 52 0.00019 0.014 16.7 6.3 1 23 502 524 502 524 0.98
16 52 1.9e-05 0.0014 19.8 2.9 1 23 530 552 530 552 0.99
17 52 2.9e-05 0.0021 19.2 1.6 1 23 558 580 558 580 0.98
18 52 3.4e-05 0.0025 19.0 1.9 1 23 586 608 586 608 0.99
19 52 8.3e-07 6e-05 24.1 4.5 1 23 614 636 614 636 0.99
20 52 2.4e-07 1.8e-05 25.8 1.5 1 23 642 664 642 664 0.99
21 52 0.00031 0.022 16.0 3.6 1 23 670 692 670 692 0.98
22 52 0.0013 0.097 14.0 5.3 1 23 698 720 698 720 0.99
23 52 0.00024 0.018 16.3 4.3 1 23 726 748 726 748 0.98
24 52 1.5e-05 0.0011 20.2 1.9 1 23 754 776 754 776 0.99
25 52 3e-06 0.00022 22.4 4.1 1 23 782 804 782 804 0.98
26 52 6.1e-05 0.0045 18.2 0.8 1 23 810 832 810 832 0.99
27 52 3.6e-06 0.00026 22.1 4.7 1 23 838 860 838 860 0.99
28 52 1.6e-06 0.00012 23.2 0.8 1 23 866 888 866 888 0.99
29 52 1.9e-07 1.3e-05 26.2 3.2 1 23 894 916 894 916 0.99
30 52 0.00051 0.037 15.3 2.2 1 23 922 944 922 944 0.99
31 52 0.0011 0.077 14.3 2.4 1 23 950 972 950 972 0.98
32 52 4.9e-05 0.0035 18.5 2.7 1 23 978 1000 978 1000 0.99
33 52 6.3e-07 4.6e-05 24.5 4.7 1 23 1006 1028 1006 1028 0.99
34 52 4.8e-06 0.00035 21.7 0.9 1 23 1034 1056 1034 1056 0.99
35 52 1.9e-07 1.3e-05 26.2 3.2 1 23 1062 1084 1062 1084 0.99
36 52 0.00051 0.037 15.3 2.2 1 23 1090 1112 1090 1112 0.99
37 52 0.00018 0.013 16.8 1.7 1 23 1118 1140 1118 1140 0.98
38 52 4.9e-05 0.0035 18.5 2.7 1 23 1146 1168 1146 1168 0.99
39 52 6.3e-07 4.6e-05 24.5 4.7 1 23 1174 1196 1174 1196 0.99
40 52 8.5e-08 6.2e-06 27.2 0.6 1 23 1202 1224 1202 1224 0.99
41 52 4e-07 2.9e-05 25.1 3.8 1 23 1230 1252 1230 1252 0.99
42 52 0.00039 0.028 15.7 2.2 1 23 1258 1280 1258 1280 0.99
43 52 0.0087 0.63 11.5 4.1 1 23 1286 1308 1286 1308 0.95
44 52 0.00031 0.023 16.0 4.4 1 23 1314 1336 1314 1336 0.99
45 52 0.00078 0.057 14.7 5.0 1 23 1342 1364 1342 1364 0.98
46 52 0.00035 0.026 15.8 4.8 1 23 1370 1392 1370 1392 0.99
47 52 1.5e-06 0.00011 23.3 3.9 1 23 1398 1420 1398 1420 0.99
48 52 5.3e-06 0.00038 21.6 0.7 1 23 1426 1448 1426 1448 0.98
49 52 1.1e-05 0.0008 20.6 1.0 1 23 1454 1476 1454 1476 0.98
50 52 8.3e-06 0.0006 21.0 2.1 1 23 1482 1504 1482 1504 0.99
51 52 0.00061 0.044 15.1 1.0 1 19 1511 1529 1511 1532 0.96
52 52 0.032 2.3 9.7 4.9 2 23 1560 1582 1559 1582 0.96

Sequence Information

Coding Sequence
atgaaAACTAATTTCGCTAATAATTGTGATTTCTTAATTGATGGAGAAAACCAATCTGAATTTGCTTtgataaaatttccaattaaggAAGAATCTGAAGTAATTAATGAAGATGGAATTGAGactgaaaaaattaaacaagaacctgAATTAATAATAGAAGATGAAATTGAGACTGCAAAAATTcggaaatatttacaaaaatctgATATAACTCCAGAAGATCGATTGAATTTAATTTGGCATAGCTGTATAACATGTGAAGAACGTTTTTTAACTCCCGAAGATTTGTCGCTGCACATCGAAATTCATGAAAAAACTAAAGACGAGTCTTTCATAAAAGTTTTAAGACATTGCGACGCCCGTACACTGCACTGTGATCTCTGTGAAGCAAAATTTGTTAGTTCGAACTCATTGTTTAAACATCAACGCATTTTCCACCCAAACTTTTGTAAATCTAAAAAAGCAAAAGCACTTTCACATTCTAAAAAAAGATTCAAATGTTCACAATGTAAGAAAGTATACTCTTCCAAGTATAGTTTAATAGCTCATTCCCGAATCCATACAAGAGAACGACGTTTTAAATGTAAACACTGTAACAAGGTTTTTCGATCTCAAATAAAATTAGTTACACATCGTCGAGTCCATGCGGAAGAACAATTTAAATGTGGTACTTGTGGTCGTTCTTGTAGATCAAAGCAGCATCTGGAAATACATAATCGATCGCATACAGGAGAACGTCCTTTCAAATGTGAACATTGTAACAAATCTTTTCGATCTTCGAGTAACTTAGTTATACATCGTCATATTCATTCTaacgaaaaaccatttaaatgcaaatATTGTGATTATGCAACAAAACAGTCGGGTAATTTATCATCCCATATGAGAAAACATATTACTGGTGAgagaccattcaaatgtgaccgCTGCGATCGTTCCTTCACTACAAAGCAGTGTCTAGAAATACATTATAGAACACATACAGGAGAACGTCCTTTTCAATGTGAAGTATGTAACAAGTATTTTCGATCAAAATCTAACTTAGTTACACATCGTCGTATTCATAATGAGGAAGAGCAATTTCAATGTGATATCTGTGGTCGTTCCTTTACTACACAATACAGCCTAGAAATACATGATCGAATACATACAGGGGAACGTCCTTTTAAATGTGAAGAATGTAACAAATGTTTTCGATCACAATCTAACTTAGTTGCACATCGTCGTATACATAATGAAGCAGAGCAATTTCAATGTGATAGATGTGCTCGTTCCTTTAATACAAAGCAATGTTTAGAAATACATTATCGAACACATACAGGAGAACGTCCTTTTCAATGTTCTCATTGTGATAAACAATTTATCCGAAAAGCCAACCTTCGAATACATATTATGAGACATACCGGCGAGAAACCCTTCAAATGCGATATATGTGATCGTTCTTTTACGACAAAATATTGCCTAGAATTACATCATCGGATACATACAGGAGAACGTCCTTTTAAATGTGAAGAATGTAACAAATGCTTTCGAGCACAGTCTAGCTTAGTAACACATCGCCGTCATCATAATGAGGAAGAGAAATTTCAATGCGATAAGTGTGCTCGTTCCTTTACTACAAAGTACTACTTAGAAATACATTATCGAAAACATACAGGAGAACGTCCTTATAAATGTGAACAATGTAACAAATCTTTTCGATCTCAAACTGTTTTAATTACACATCGTCAGATCCATAATGAAGTAAAGAAATATAAATGCGATAGTTGTGCTCGTTCATTTACTACCAAGCAATCTTTAGAAATACATTATCGAACACATACAGGAGAACGTCCTTTTAAATGTGAAGTATGTAACAAGTGTTTTCGATCAAAATCTAACTTAGTTACACATCGTCGTATTCATAATGAGGAAGAGCAATTTCAATGTGATATCTGTGATCGTTCCTTTACTACACAATACAGCCTAGAAATACATTATCGAAGGCATACAGGAGAACGTCCTTTTAAATGTGAAGATTGTAACAAACGTTTTCTATCACAATCTAACTTACTTGCACATCGTCATAGGCATAATGAAGCAGAGCAATTTAAATGTGATAGGTGTGCTCGTTCCTTTAATACAAAGTACTGCCTGGAAATACATTATCGAAAACATACAGGAGAACGTCCTTATAAATGTGAACATTGTAACAAATCTTTTCGATCTCAAGCTGTCTTAGTTACACATCGTCATATCCATAATGAAATAAGGAAATATAAATGCGATTCTTGTGATCGTTCCTTTTCTACAAAATACAGGCTAGAAATACATTATCGAATACATACAGGGGAACGTCCATTTAAATGCGAAGAATGCAACAAATGTTTTCGAACACAATCTAGCTTAGTTACACATCGTCAGATCCATAATGAAGTAAGGGAATATGAATGCAATAGTTGTGCTCGTTCATTTGCTACCAAGCAATCTTTAGAAATACATTACCGAACACATACAGGAGAACGTCCTTTTAAATGTGAAGTATGTAACAAGTGTTTTCGATCAAAAACTAACTTAGTTACACATCGTCGTATTCATAATGAAGAAGAGCAATTTCAGTGTGATGTCTGTGATCGCTCCTATACTACACAATACAGCCTAGAAATACATTATCGAATACATACAGGAGAACGTCCTTTTAAATGTGAAGAATGTAACAAATGTTTTCGATCACAATCTAACTTAGTTGCACATCGTCGTATACATAATGAAGCAGAGCAATTTCAATGTGATAGATGTGCTCGTTCCTTTAATACAAAGCAACGTTTAGAAATACATTATCGAACACATACAGGAGAACGTCCTTTTAAATGTGAAGAATGTAACAGATCTTTTCGATCTCAAGCTATCTTAGCTACACACCGTCATACCCATAATGAAGTAAGGGAATATAAATGCGATAATTGTGCTCGTTCATTTACTACCAAGCAATCTTTTGAAATACATTACCGAACACATACAGGAGAACGTCCTTTTAAATGTGAAGAATGTAACAAGTGTTTTCGATCAAAATCTAACTTAGTTACACATCGTCGTATTCATAATGAAGAAGAGCAATTTCAGTGTGATGTCTGTGATCGCTCCTATACTACACAATTCAGCCTAGAAATACATTATCGAATACATACAGGAGAACGTCCTTTTAAATGTGAAGAATGTAACAAATGTTTTCGATCACAATCTAACTTAGTTGCACATCGTCGTATACATAATGAAGCAGAGCAATTTCAATGTGATAGATGTGCTCGTTCCTTTAATACAAAGCAACGTTTAGAAATACATTATCGAACACATACAGGAGAACGTCCTTTTAAATGTGAAGAATGTAACAGATCTTTTCGATCTCAAGCTGTCTTAGCTACACACCGTCATATCCATAATGAAGTAAGGGAATATAAATGCGATAATTGTGCTCGTTCATTTACTACCAAGCAATCTTTTGAAATACATTACCGAACACATACAGGAGAACGTCCTTTTAAATGTGAAGAATGTAACAAGTGTTTTCGATCAAAATCTAATTTAGTTACACATCGTCGTATTCATAATGAAGAAGAGCAATTTCAATGTGATATCTGTGGTCGATCTTTTAGTACACAATACGGCCTAGAAATACATTATCGAACACATACAGGGGAACGTCCTTTTAAATGCGAAGAATGTAACAAATGTTTTCGATCAAAATCTAACTTAGTTGTACATCGTCGTATCCATAATAAAGAAGAGAAATTTAAATGTGATAGGTGTTCTCGTTCGTTTGCTGCAAAATTCAACCTAAAAATTCATTCTCGGACACATACAGGAGAACGTCCTTTTAAATGTGAACAGTGTAACAAATCTTTCCTATCTCAATCTAACTTAGTTACACATTGTCGTCTACATAATGAAGTAAAGCAATTTAAATGTGATACCTGTGATCATTcctttattacaaaatataggCTAGAAATACATTATCGAAAACATACAGGAGAACGTCCTTTTAAATGTGAACATTGTAACAAATCTTTTCGATCTCAAACTGTTTTAGTTGCACATAGTCATACCCATAAAGAAGTAAGGAAATATAAATGCGATAGGTGTGCTCGGTCTTTTAATACAAAGTACTGCTTAGAAATACATTATCGAACACATACTGGAGAACGTCCTTTTAAATGTGAAGAATGTTACAAATGTTTTCGATCAAAATCTAACTTAGTTGCACATCGTCGTATCCATAATGAAGAAGGGAAATTCCAATGTGATAGCTGTGATCGTTCGTTTGCTACAAAATACAACTTAGAAATTCATTCTCGGGTACATACAGGAGAACGTCCttttaaatgtgaaaaatgtAGCAAATCTTTTCGATCTATGTCGAATTTAGTAGTACATCGTATAGTTCATACCAGAAGGAAAGGATACAAATGCAAATATTGTGATTTCGCTACAAAACGACCAAGTAATTTAAGTAGTCATATAAAAAAGCATTTTATTGGAGAAAACCCGTTTAAATGTGACCGATGTGGTCGTTCGTATGCTACAAAACACCTTCTAGCAAGACATTCCGATAAATGTGTGGTTTCTGAACGTAGGTTCCCACGACGTGGTAATTTAAGGAGTCAACAGAAAATACATAATCCAATTCTGCGTGATCAATGTCATTTTTGTAATGAAAGTTTTACATGTAGCCAACAACTTAAACAACATATTCAAATATATCATAAGAAGCTAGTTTgcaaaattgaatatttgaaatcaaaTGTAAAAGGTATTCAtgatatttcataa
Protein Sequence
MKTNFANNCDFLIDGENQSEFALIKFPIKEESEVINEDGIETEKIKQEPELIIEDEIETAKIRKYLQKSDITPEDRLNLIWHSCITCEERFLTPEDLSLHIEIHEKTKDESFIKVLRHCDARTLHCDLCEAKFVSSNSLFKHQRIFHPNFCKSKKAKALSHSKKRFKCSQCKKVYSSKYSLIAHSRIHTRERRFKCKHCNKVFRSQIKLVTHRRVHAEEQFKCGTCGRSCRSKQHLEIHNRSHTGERPFKCEHCNKSFRSSSNLVIHRHIHSNEKPFKCKYCDYATKQSGNLSSHMRKHITGERPFKCDRCDRSFTTKQCLEIHYRTHTGERPFQCEVCNKYFRSKSNLVTHRRIHNEEEQFQCDICGRSFTTQYSLEIHDRIHTGERPFKCEECNKCFRSQSNLVAHRRIHNEAEQFQCDRCARSFNTKQCLEIHYRTHTGERPFQCSHCDKQFIRKANLRIHIMRHTGEKPFKCDICDRSFTTKYCLELHHRIHTGERPFKCEECNKCFRAQSSLVTHRRHHNEEEKFQCDKCARSFTTKYYLEIHYRKHTGERPYKCEQCNKSFRSQTVLITHRQIHNEVKKYKCDSCARSFTTKQSLEIHYRTHTGERPFKCEVCNKCFRSKSNLVTHRRIHNEEEQFQCDICDRSFTTQYSLEIHYRRHTGERPFKCEDCNKRFLSQSNLLAHRHRHNEAEQFKCDRCARSFNTKYCLEIHYRKHTGERPYKCEHCNKSFRSQAVLVTHRHIHNEIRKYKCDSCDRSFSTKYRLEIHYRIHTGERPFKCEECNKCFRTQSSLVTHRQIHNEVREYECNSCARSFATKQSLEIHYRTHTGERPFKCEVCNKCFRSKTNLVTHRRIHNEEEQFQCDVCDRSYTTQYSLEIHYRIHTGERPFKCEECNKCFRSQSNLVAHRRIHNEAEQFQCDRCARSFNTKQRLEIHYRTHTGERPFKCEECNRSFRSQAILATHRHTHNEVREYKCDNCARSFTTKQSFEIHYRTHTGERPFKCEECNKCFRSKSNLVTHRRIHNEEEQFQCDVCDRSYTTQFSLEIHYRIHTGERPFKCEECNKCFRSQSNLVAHRRIHNEAEQFQCDRCARSFNTKQRLEIHYRTHTGERPFKCEECNRSFRSQAVLATHRHIHNEVREYKCDNCARSFTTKQSFEIHYRTHTGERPFKCEECNKCFRSKSNLVTHRRIHNEEEQFQCDICGRSFSTQYGLEIHYRTHTGERPFKCEECNKCFRSKSNLVVHRRIHNKEEKFKCDRCSRSFAAKFNLKIHSRTHTGERPFKCEQCNKSFLSQSNLVTHCRLHNEVKQFKCDTCDHSFITKYRLEIHYRKHTGERPFKCEHCNKSFRSQTVLVAHSHTHKEVRKYKCDRCARSFNTKYCLEIHYRTHTGERPFKCEECYKCFRSKSNLVAHRRIHNEEGKFQCDSCDRSFATKYNLEIHSRVHTGERPFKCEKCSKSFRSMSNLVVHRIVHTRRKGYKCKYCDFATKRPSNLSSHIKKHFIGENPFKCDRCGRSYATKHLLARHSDKCVVSERRFPRRGNLRSQQKIHNPILRDQCHFCNESFTCSQQLKQHIQIYHKKLVCKIEYLKSNVKGIHDIS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-