Basic Information

Gene Symbol
-
Assembly
GCA_901482245.1
Location
CABEEJ010000073.1:212007-230986[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 52 0.0084 2 11.3 0.6 2 23 9 30 8 30 0.93
2 52 1.2e-07 2.9e-05 26.5 1.1 1 23 36 58 36 58 0.98
3 52 0.13 31 7.5 1.1 1 23 68 90 68 90 0.96
4 52 7.7e-06 0.0018 20.9 0.7 1 23 96 118 96 118 0.99
5 52 2.2e-07 5.2e-05 25.7 1.3 1 23 124 146 124 146 0.99
6 52 2.9e-05 0.0068 19.1 0.8 1 23 152 174 152 174 0.98
7 52 0.0044 1.1 12.2 0.4 2 23 244 266 243 266 0.95
8 52 0.11 27 7.7 2.2 1 21 271 291 271 293 0.93
9 52 0.014 3.4 10.6 0.4 1 23 299 321 299 321 0.98
10 52 0.032 7.5 9.5 0.3 1 23 327 349 327 349 0.95
11 52 5.3e-07 0.00013 24.5 1.7 1 23 355 377 355 377 0.98
12 52 0.14 34 7.4 1.6 1 23 387 409 387 409 0.94
13 52 6.9e-06 0.0016 21.0 0.1 1 23 415 437 415 437 0.98
14 52 1e-07 2.4e-05 26.7 1.3 1 23 443 465 443 465 0.99
15 52 8.8e-05 0.021 17.5 3.0 1 23 471 493 471 493 0.99
16 52 3.4 8e+02 3.1 0.2 1 13 499 511 499 512 0.90
17 52 0.011 2.6 10.9 0.9 1 23 572 595 572 595 0.89
18 52 0.0023 0.56 13.0 7.0 1 23 601 623 601 623 0.97
19 52 0.005 1.2 12.0 0.2 1 23 633 655 633 655 0.96
20 52 0.016 3.9 10.4 0.0 2 23 662 683 661 683 0.97
21 52 2.6e-07 6.2e-05 25.5 1.4 1 23 689 711 689 711 0.98
22 52 6.7e-06 0.0016 21.0 1.4 1 23 717 739 717 739 0.99
23 52 2 4.8e+02 3.8 0.6 1 13 745 757 745 758 0.90
24 52 0.0006 0.14 14.9 0.2 2 23 766 787 765 787 0.97
25 52 0.0011 0.26 14.1 1.4 2 23 794 815 793 815 0.97
26 52 0.21 49 6.9 3.2 1 23 821 843 821 843 0.98
27 52 0.089 21 8.1 1.7 2 23 850 871 850 871 0.97
28 52 0.00051 0.12 15.1 3.3 1 23 877 899 877 899 0.98
29 52 0.055 13 8.7 4.7 1 23 910 932 910 932 0.96
30 52 0.00053 0.13 15.1 0.1 2 23 939 960 938 960 0.97
31 52 7.2e-06 0.0017 20.9 0.5 1 23 966 988 966 988 0.98
32 52 4.1e-05 0.0097 18.6 4.9 1 23 994 1016 994 1016 0.98
33 52 0.00022 0.053 16.2 4.6 1 23 1022 1044 1022 1044 0.97
34 52 4.4e-05 0.01 18.5 0.5 2 23 1090 1111 1089 1111 0.97
35 52 0.00089 0.21 14.4 4.9 1 23 1116 1138 1116 1138 0.97
36 52 1.7 3.9e+02 4.1 5.2 1 23 1144 1166 1144 1166 0.98
37 52 0.05 12 8.9 1.5 2 23 1173 1194 1172 1194 0.97
38 52 0.00074 0.18 14.6 5.0 1 23 1200 1222 1200 1222 0.98
39 52 0.75 1.8e+02 5.2 6.4 1 23 1233 1255 1233 1255 0.97
40 52 0.001 0.25 14.2 0.1 1 23 1261 1283 1261 1283 0.98
41 52 1.8e-06 0.00042 22.9 0.3 1 23 1289 1311 1289 1311 0.98
42 52 4.2e-06 0.001 21.7 0.6 1 23 1317 1339 1317 1339 0.98
43 52 0.00045 0.11 15.3 4.5 2 20 1392 1410 1391 1413 0.94
44 52 0.017 3.9 10.4 4.6 1 23 1419 1441 1419 1441 0.98
45 52 0.0055 1.3 11.9 2.7 1 23 1447 1469 1447 1469 0.99
46 52 0.0046 1.1 12.1 1.0 2 23 1476 1497 1475 1497 0.95
47 52 0.00043 0.1 15.4 2.3 1 23 1503 1525 1503 1525 0.96
48 52 0.22 52 6.8 6.6 1 23 1536 1558 1536 1558 0.96
49 52 0.00046 0.11 15.3 0.0 1 23 1564 1586 1564 1586 0.98
50 52 2.3e-05 0.0055 19.3 0.1 1 23 1592 1614 1592 1614 0.97
51 52 2.1e-06 0.00051 22.6 2.1 3 23 1622 1642 1620 1642 0.97
52 52 5e-06 0.0012 21.4 1.2 1 23 1648 1670 1648 1670 0.98

Sequence Information

Coding Sequence
atgcattataaTGTCTACAAGCTGTATTGTGGCACCTGCTCGAAAGGTTTCAATTCCGAGAAACTTCTAAAGAAACACGAAGAATTACATTCCGGTATAAAGCCGTTTCAATGCGAAGAGTGCGGTAAACCGTTTGCTTCAAAGTATAACCTGCAGAAACATAAAAGGCTTCACGATCCAGTGTTTAGAAACAATCCAAAATACTATTGCGAGTTTTGCGGTAAAGGCTTCTTGTACACGACTGTACTAAACAAACACAGAAGCATCCATATGGGCGAGAACAAGTTTCAGTGCGACCTGTGCGGAAAAACGCTCAACTCCAAGGCGTACCTAAGGACGCACTTACGCATACACAGTGGTGAGAAGCCTTACGTTTGTGACGTGTGCGGCAAGGCTTTTGTGACGAACAAGCACTTGACGATCCACCGTCGCACACATACCGGCGAGAAGCTATATGTGTGTAATGTATGTAACAAGTCTTACACGCAGGTGGGTTACTTGACCATCCACAAGCGCTCGCATACGGGCGAGCGACCGTACCACTGCGAGTCGTGCGGCAAGGACTTTGCTTCCAAGTGTATTTCAAGACAGAAACCGAAATTTGTAAGGATCGCTCCGAAGCCAGCTCCAAGACCAGCTCCAAGAGCAGCTCCACGGCCGGCTCCAAGACTGAATCAAGTTATCACTCTTCTCAAACCCAAGCAACCAGAACCTAAAAGGGACCTCTGGAAGTGCACGCCGTGCGGTCAGAGCTTTTCCCTCCTCTCGGAGTTTAAAAGCCACATAATGTCGCAGCACCGACGAAACTCGCACCCGTGCGATATGTGCGACCTCAAGTTCATATCCCCGGTACGACTCGCCAAACATAGGAAATGTCACGGGGTGATGATAAGGTACTCGTGCGATATATGCGGCTATTCCGGGCAGAGCAAGACGTTGGTGGAAACGCACCGGAAGATGCATTACAACGAGTACAAGTATTACTGCAGTATCTGTTCTCGAGGTTTTAATGTGGTGAAACTCTTGAAGGAACATGAGCTGTTGCATACCGGAATCAAACCGTACCAGTGTGAGGAGTGCGGGAGACAATTCACCTCAAAGTACAATCTCGCCAAGCATAAGCGGCTTCACGAGCCGGGCTACAAAGACAATCCAAAGTTCTTTTGCGAGTTTTGCGGTCGGTCGTTCTTGTACAGCACGGTCTTGAACAAGCACCGCAGTGGTCACCTGGGTGAGAACCGGTTCGAGTGCGACGTTTGCGGTAAGGTTGTCAACTCCAAGGAGTACCTGAAGGTCCACCTTCGTATCCACAACGGTGACAAACCCTACGTGTGTGACATCTGCGGCAAGGCGTTTGTGACCAACAAGCACCTGACCGTTCACAGACGCACGCATACGGGGGAGAAACTTTACACGTGTAACGTATGTCACAAGTCATACACGCAGGTGGGGTACCTCACCATCCACAAGCGTTCACACACGGGAGAGCGCCCGTATCATTGTGTCGTTTGCGGTAGGGACTTTGCCTCCAAGTTGGTAAACTTGGTTACGGGTAGCGCGCAGGCCAACCCGGCCTTGATGCCAAACGTAGCTTCAGTTGTAAAACAGAACATTGCTCGCGATAAGACCCCCGAGAAGGTTTCGGTGAAACGGTCTTTATTCGGCGACGGGATTCAATGGAGGTGCAAGAAAGCTCATTTCTCTAAAGTGAAGCATGTTTGCAGTACTTGTTCTAAAGAGTTTAAAACCGAGCAGTCACTGATCGATCACAACTTGTTAGCACACACCGGGATCAATCCGTTTAAATGCGATCACTGCGGGAAAAGGTAcaagtcaaaatattatttaaaattccaccAGAAATTACACGAGCCAGGTTTCAAGGACAACCCAAAGTATTACTGTGAGATTTGCGGGAAAGGTTTCTTGTACATATCGGCGTTGAATAATCACCGAGAGAACCATCTGGGCAGCAACAAGGTCGTGTGCGCCATTTGTGATGTGCCGCTTGCGTCGAAAAGATACATGACGGTCCACATGCGAATACACAGCGGGGAGAAACCGTTTGTTTGTGACGTTTGTGGTAAGGCATTTGTCACCAACAAGCATTTAACGGTTCACCGACGCACGCATACGGGGGAGAAACTGTACACGTGTAACGTGTGTAACAAATCATTCACGCAAGTGGGTTATTTGACCATTCACAAGCGCTCTCATACCGGCGAGCGCCCGTATCGATGTGAGACGTGCCGTAAAGGCTTTGCTTCCAGGCAAGTCCCCTCAGACTCGACCATAAGATGTGAAATATGCGGTAAAGAATACAAGCTCGATATCAATCTGAGCCGCCACATGGCAAAGCACAACCCGGATAACCAGATTAAATGTGATGTGTGTGACAAAACATGTCCGTCTGGGTTTCAACTGAAGCAACACTTGAAAATCCACGAGACGGCGAAACTATACTCGTGCGATATGTGTTCATATGCGAGCCACGTACAAGTGAACCTGCACGCTCATAGAAAGAGGCACTTACACGATTACAAGGCGAGGTGTGACGTGTGTAACAGAGGTTTCTACTCCATGTGTTCGTACAAGTTACACCAGGCCGTCCATAAAGACGAGATGCCCTTCAGTTGCGACGTCTGCGGTAAGCTGTACAAGTTCAAACACTACCTAACCGACCATCTTCAGACGCACAAACCAGACTATGAGGAAACCGTCTTAAAGTTCAAGTGCCGATATTGCGGCAAGGGTTTCCGCTACCGGGGCAAGTGCGAGGTCCACGAGAAGCGGCATCAGGGACACAAAGAAGTGATGTGCGACATATGCGGCAAAGTGCTGAGCACGGTCCGCGAGCTGGGCCTCCACATGCGCCGTCATACCGGGGAGAAACCGTACGTCTGCAGCTATTGCGGAAAGTTGTTCAGCCGACTGAACTCGTTGACTCTCCACGTACGTGTCCATACGGGCGAGCGCCCGTACACTTGCGACGAATGTGGGAAGTCGTTCACGCAGCACTCGAGCTATGCTATCCACAAGCGCCATCATACCGGTGAGAGACCTTACTGTTGTTTGATCTGTCAGAAAACCTTCGTGTCCAAGTCTTTGATGAAGTACCACATGAAGTCTCACGAAGAACACAAgtCCCCAAGTAGTTTCTTTGTCGAACAATCCGAGGAAAAATCTAAAAGCTCCAAGGTTAACAGGAAAGCTTCAAACTCCGTTTCTGGACCTGTTAAGAGGAAGGTATCTAACAAAAAGTCAATCAAATGTAAAATTTGCGGAAATGAGTACAAGCGCAATAGCACCCTGCAGAATCACCTGCTTGTCCACGGCCAGGATTCCCACGAGTGCGACCTTTGTAACAAACCGTTCAAGTCCGAGTGGCATCTGAAACAGCACAAGAAATACCACAGCAAAGCGAGACAGTACAAATGTGACATGTGTGGATACGAGACGTGCATCAAGGGGAACCTCCTCTGTCACAAAAGGAAACACTTGAAACATTACATTGTCAAGTGTGACGCGTGTAATAAAGGCTTCTATAGGACATGTGATTACCAAGAACACGTTAAAATACACACGGGCGAGAAACCCCACCGATGCGACAAGTGCGGAAAGTCGTTCCCCGGCAAACACCGGCTGGCCAATCACAAGAAAACCCACGAACCGGACTACGAAGAGACAGTTCTTCGATTCAGGTGCAAGTTTTGTAACAAAGGTTTCCCCGGTCAGAGTCTGTGCACGAGGCATGAGAAACACCACCTGGGTGTCAAGGAGTTCATCTGCGACGTTTGCGGCAAAGCCCTCAGCTCGGCGCACGCCATGAGTCTCCACCTCCGCATCCACACCGGGGAAAAACCGTTTGTGTGCGACTTCTGCGGTAAGATGTTTGGCCGGTCGTCTGCTCTCACTATGCACTTACGCGTTCACACGGGTGAGCGCCCTTATGTGTGCAACGTGTGCGAGAAGGGGTTCACGCAGCCGTCGACGCTCACGATTCACAAGCGTTACCACACGGGCCAGAAACCGTTCACAAGCGATAACATTGTTTCTAAATCGTCAGAGAAATCCAAACGCCACAGCAAGAACTCTTTGATTTCGCCAAAGCCGCTATCTAGTCCTATAGAACTTATGGAATTGGAGTGGGGGAGCGCAGAGAGCCCCAAATGGAAATGTAAAATATGCGAGAAAGTGTTCAAACATAAATGTTCTCTGAAGAACCACATGCCGCTTCATAACACCGAGGGCTCTTTCAAATGCGACATCTGCCATAAGTTGCTCAAATCAGATTATCAGTTGACTCGACATAAAAAGAACCACAGCCAACTGAAAGACTACAAATGCGGCATCTGCGGCTATGAGACGTGTTCCAACAGTAAAATCGAGAGCCATAAGAGGAGACATCTGAATGAGTACAAAATGAAGTGTGAGCTGTGTAATAAAGGTTTCTACACCTCACGAGAGTTCCAGGAGCACTCCATGCTCCACACAGGAGAAAGGCCCCACTCCTGCGAGGTCTGCGGCAAGTCCTACCCCAACAAATACCGGCTGACGGACCACAAGAAGCTGCACGAGCCCAACTACGAGGAGATGTTCCTCAGGTTCAAGTGCCGGTTCTGCGACAAGGGCTTCCCCCGGCAAAGCTCTTGCGTCCGTCACGAGAAGCACCACCTGGGCGTCCGGGAATACGCCTGCGATGTGTGCGGCAAGGCGCTCAGTTCGGGGCCCCTCCTCAAGCTCCACATGCGCATCCACACGGGGGAGAAACCATTTGCATGCGACGTGTGCGGTAAATCGTTTGGGCGCGTGTATGCCTTGACCATGCACTACCGTGTCCACACGGGTGAGCGCCCGTACGGTTGCTCTGTGTGTGGAAAGAGGTTCACACAACACTCGAGTCTAACAATACACATGCGCTATCATACGGGGCAGAAGCCGTACGTGTGTAGCACATGTACGAGGTCGTTTGTTTCCAAAACTTTGCTGACGGCCCACATGAAGAGTCACGCGCACTAA
Protein Sequence
MHYNVYKLYCGTCSKGFNSEKLLKKHEELHSGIKPFQCEECGKPFASKYNLQKHKRLHDPVFRNNPKYYCEFCGKGFLYTTVLNKHRSIHMGENKFQCDLCGKTLNSKAYLRTHLRIHSGEKPYVCDVCGKAFVTNKHLTIHRRTHTGEKLYVCNVCNKSYTQVGYLTIHKRSHTGERPYHCESCGKDFASKCISRQKPKFVRIAPKPAPRPAPRAAPRPAPRLNQVITLLKPKQPEPKRDLWKCTPCGQSFSLLSEFKSHIMSQHRRNSHPCDMCDLKFISPVRLAKHRKCHGVMIRYSCDICGYSGQSKTLVETHRKMHYNEYKYYCSICSRGFNVVKLLKEHELLHTGIKPYQCEECGRQFTSKYNLAKHKRLHEPGYKDNPKFFCEFCGRSFLYSTVLNKHRSGHLGENRFECDVCGKVVNSKEYLKVHLRIHNGDKPYVCDICGKAFVTNKHLTVHRRTHTGEKLYTCNVCHKSYTQVGYLTIHKRSHTGERPYHCVVCGRDFASKLVNLVTGSAQANPALMPNVASVVKQNIARDKTPEKVSVKRSLFGDGIQWRCKKAHFSKVKHVCSTCSKEFKTEQSLIDHNLLAHTGINPFKCDHCGKRYKSKYYLKFHQKLHEPGFKDNPKYYCEICGKGFLYISALNNHRENHLGSNKVVCAICDVPLASKRYMTVHMRIHSGEKPFVCDVCGKAFVTNKHLTVHRRTHTGEKLYTCNVCNKSFTQVGYLTIHKRSHTGERPYRCETCRKGFASRQVPSDSTIRCEICGKEYKLDINLSRHMAKHNPDNQIKCDVCDKTCPSGFQLKQHLKIHETAKLYSCDMCSYASHVQVNLHAHRKRHLHDYKARCDVCNRGFYSMCSYKLHQAVHKDEMPFSCDVCGKLYKFKHYLTDHLQTHKPDYEETVLKFKCRYCGKGFRYRGKCEVHEKRHQGHKEVMCDICGKVLSTVRELGLHMRRHTGEKPYVCSYCGKLFSRLNSLTLHVRVHTGERPYTCDECGKSFTQHSSYAIHKRHHTGERPYCCLICQKTFVSKSLMKYHMKSHEEHKSPSSFFVEQSEEKSKSSKVNRKASNSVSGPVKRKVSNKKSIKCKICGNEYKRNSTLQNHLLVHGQDSHECDLCNKPFKSEWHLKQHKKYHSKARQYKCDMCGYETCIKGNLLCHKRKHLKHYIVKCDACNKGFYRTCDYQEHVKIHTGEKPHRCDKCGKSFPGKHRLANHKKTHEPDYEETVLRFRCKFCNKGFPGQSLCTRHEKHHLGVKEFICDVCGKALSSAHAMSLHLRIHTGEKPFVCDFCGKMFGRSSALTMHLRVHTGERPYVCNVCEKGFTQPSTLTIHKRYHTGQKPFTSDNIVSKSSEKSKRHSKNSLISPKPLSSPIELMELEWGSAESPKWKCKICEKVFKHKCSLKNHMPLHNTEGSFKCDICHKLLKSDYQLTRHKKNHSQLKDYKCGICGYETCSNSKIESHKRRHLNEYKMKCELCNKGFYTSREFQEHSMLHTGERPHSCEVCGKSYPNKYRLTDHKKLHEPNYEEMFLRFKCRFCDKGFPRQSSCVRHEKHHLGVREYACDVCGKALSSGPLLKLHMRIHTGEKPFACDVCGKSFGRVYALTMHYRVHTGERPYGCSVCGKRFTQHSSLTIHMRYHTGQKPYVCSTCTRSFVSKTLLTAHMKSHAH*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01450784;
90% Identity
iTF_01450784;
80% Identity
-