Basic Information

Gene Symbol
-
Assembly
GCA_964007535.1
Location
OZ023333.1:38049326-38097642[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 55 0.0057 0.51 12.0 0.3 2 20 55 73 54 75 0.94
2 55 0.11 9.9 7.9 1.7 2 21 82 101 81 102 0.92
3 55 3 2.7e+02 3.4 0.3 2 11 140 149 139 152 0.90
4 55 0.0013 0.12 14.0 0.5 2 20 191 209 190 211 0.93
5 55 4.8e-05 0.0042 18.5 2.4 3 21 219 237 217 238 0.94
6 55 0.00041 0.036 15.6 5.3 1 21 244 264 244 265 0.95
7 55 0.00025 0.022 16.2 2.5 2 21 299 318 298 319 0.94
8 55 0.00014 0.013 17.0 1.5 1 21 354 374 354 375 0.94
9 55 9e-05 0.008 17.7 0.1 1 20 391 410 391 412 0.95
10 55 0.0085 0.75 11.5 0.2 3 21 420 438 418 439 0.93
11 55 0.00035 0.031 15.8 1.2 2 21 446 465 445 466 0.94
12 55 0.00015 0.013 17.0 0.5 1 21 492 512 492 513 0.96
13 55 0.00027 0.024 16.2 1.0 2 21 520 539 519 540 0.94
14 55 0.00016 0.014 16.9 4.7 2 21 547 566 546 567 0.94
15 55 0.00024 0.021 16.3 1.6 1 21 583 603 583 604 0.96
16 55 0.0024 0.21 13.2 1.6 2 21 611 630 610 631 0.94
17 55 0.00089 0.079 14.5 3.6 2 21 638 657 637 658 0.94
18 55 0.0052 0.46 12.1 2.3 2 21 665 684 664 685 0.93
19 55 0.00013 0.012 17.1 2.1 2 21 692 711 691 712 0.94
20 55 0.0012 0.11 14.1 1.8 1 21 738 758 738 759 0.95
21 55 0.0018 0.16 13.6 2.3 2 21 766 785 765 786 0.95
22 55 0.0023 0.21 13.2 1.4 2 21 793 812 792 813 0.94
23 55 0.00051 0.045 15.3 1.1 2 21 820 839 819 840 0.93
24 55 0.00015 0.013 17.0 0.5 1 21 866 886 866 887 0.96
25 55 0.00022 0.019 16.5 1.4 2 21 894 913 893 914 0.94
26 55 0.00029 0.026 16.1 0.7 2 21 921 940 920 941 0.94
27 55 0.00029 0.025 16.1 2.3 2 21 948 967 947 968 0.94
28 55 0.13 12 7.7 0.2 2 14 975 987 974 993 0.80
29 55 2.5e-05 0.0022 19.4 1.3 1 21 1021 1041 1021 1042 0.96
30 55 0.0027 0.24 13.0 4.2 2 21 1049 1068 1048 1069 0.94
31 55 2e-05 0.0017 19.7 2.5 2 21 1076 1095 1075 1096 0.94
32 55 0.018 1.6 10.4 0.2 2 20 1103 1121 1102 1122 0.82
33 55 0.0016 0.14 13.8 0.5 1 21 1147 1167 1147 1168 0.96
34 55 0.006 0.53 11.9 1.9 2 21 1175 1194 1174 1195 0.92
35 55 1.1 99 4.8 4.7 1 21 1201 1221 1195 1222 0.89
36 55 0.064 5.7 8.7 3.6 3 21 1230 1248 1222 1249 0.85
37 55 0.00053 0.047 15.2 1.5 2 21 1256 1275 1255 1276 0.94
38 55 0.00066 0.058 14.9 3.1 2 21 1283 1302 1282 1303 0.93
39 55 0.069 6.1 8.6 3.8 2 21 1310 1329 1303 1330 0.86
40 55 0.29 25 6.6 0.0 2 19 1337 1354 1336 1356 0.93
41 55 0.0016 0.14 13.8 0.5 1 21 1381 1401 1381 1402 0.96
42 55 0.0059 0.52 11.9 1.9 2 21 1409 1428 1408 1429 0.92
43 55 1 90 4.9 4.5 1 21 1435 1455 1429 1456 0.89
44 55 0.00053 0.047 15.2 1.5 2 21 1463 1482 1462 1483 0.94
45 55 0.00065 0.058 14.9 3.1 2 21 1490 1509 1489 1510 0.93
46 55 0.078 6.9 8.4 3.8 3 21 1518 1536 1510 1537 0.85
47 55 0.29 25 6.6 0.0 2 19 1544 1561 1543 1563 0.93
48 55 0.00066 0.059 14.9 1.9 1 21 1588 1608 1588 1609 0.96
49 55 0.0066 0.58 11.8 2.4 2 21 1616 1635 1615 1636 0.93
50 55 0.00095 0.084 14.4 4.8 2 21 1643 1662 1642 1663 0.94
51 55 0.21 19 7.1 1.1 3 21 1671 1689 1663 1690 0.85
52 55 0.00038 0.034 15.7 1.0 1 21 1716 1736 1716 1737 0.96
53 55 0.016 1.5 10.5 6.5 2 21 1744 1763 1743 1764 0.94
54 55 0.0035 0.31 12.7 3.6 2 21 1771 1790 1770 1791 0.93
55 55 2.2e-05 0.002 19.6 0.9 2 21 1798 1817 1797 1818 0.93

Sequence Information

Coding Sequence
ATGTCCCGCATGGGACAAACGCTTCACGCACAAGAGAACGAGGTGCAATGTTGTTGTGGTGAATTTTCCAACACTCTCGAGGATTATTCTGTCTGGTACCGGTACTTGCCTGAGATTGGCAGTGTTGCTTATTCTAATCAGAATTCTGATAGCGGGAAGCTCCAGTGCCCGACTTGCAGTAAGGTTGTAAGAGGAAAGTCTAGTCTAACCAGTCACATCACTCATGATTGCGGTCAAGATATCAGATGCCATGTATGCAACAAAGGTTTCAAGGCTAGGCAGACTCTTGTCATGCACAAGAAGCATGAGTGTGGTAAAGAGTTACAATGTCAAGGCTGTGATGATGCTGACTGGCATTTGCCTGTAATTGTCAAAGTTGCTACTCTTCTCGATCAGATTTCTGGTGTCGAAAATCTCCCGTGCCCGAAGTGCAAAAAGAATTTCAAATTGGGGAGTAGTAGTTTTCAAACCAGACACGCTCCTCATGAGTGTGACAATGCTGACTGGAACTTGCCTGAAATTAGTAATGTCGTTAGCCTTCCtattcaaaattctgatgaCAAGAAGCTCCAGTGCCCAACGTGCAATAAGGTATTCAAAATCAGGAGTCTTCTTAACAGTCACGTTAGTCATGACTGTGGTAAAGAAATAATATGCACTCTTTGCAACAAACATTTCACGCGTAAGAGCAATCTCGTTACACACATGAAGCATGAGTGTGGTAAAGATTTTCGTTGCACAGTTTGCCACAAAAGCTTCAAGCACCAGCGTACTCTCACCACACACACAAAGCATGAATGTGGTAAAGAATTACTATGCATTGTGTGCAACAAACGCTTCAAGTTCAAGAGGGATCACGTTGCGCACGTAAACTACGAGTGTGGTAAAGTTATCCGATGCAATGTGTGCAACAAATGCTTCAAGTATAAGACTGGTCTCCAAAGGCACGTAAAGTACCGTTGTGGTAAAAAGCTGCGATGCCAAGGCTGTGATGATGCTGCCTGGCCTATGATTAGCAATGTCGTTACTCTTCCTGTTCGGAATTCTGATGACAAGAAGTTCCAGTGTCCTCAGTGCAGTAAGGTTTTCAAGGTGAAACGTGAACTGACGAGACACTTCAAATATGACTGCGGTAAAGATATTCGATGCAAAATTTCTGATGGCATGATCTTCCAGTGCCCGACTTGTAATAAGGTGTTCGAGGAGAGAAGTGATCTCGCCAGACATGTTATTTATGAGTGTGGTAAAGATATTATATGCACTGTATGCAATAAACGCTTTGCATTTAAAACAGCGTTAATTTCGCACAAGAAGTATGAATGTGGTAAAGATATTCGTTGTACAGTTTGCAACAAAGGTTTCAAGCACAATAGTACTCTTATCACGCACACAAAGTATGAGTGTGATGATGCTGCCTGGCCTGTGATTAGCAATGTCGTTACTCTTCGTGTTCAGAATTCTGATGTCAAGACGTTCCAGTGTCCTCAGTGCAGTGAGGTTTTCAAGGTGAAACGTGCTCTAACCAGACACATAAATCATGAGTGCGGTAAAGAGATTCGATGCTCAGTGTGCAACAAACAATTTGCGCGCAAAACAACACTAACTAGGCACACGAAGTATGAGTGTGGAAAAGATCTTCGATGCAAAGTATGCCACAAATCATTCACGCACAAGACTAATCTAACAACTCACACGAAGTATGAGTGTGGAAAAGACATTCGATGCAAAAATTCTGATGTCAAAAAGTTCCAGTGTCTTCAGTGCAGTAAGGTTTTCAAGGTGAAACGTGCTCTAACCAGACACATGAATCATGAGTGCGGTAAAGAAATTCGATGTACAGTATGCAACAAACAATTCAAGTACCAAAGGGATCTCATTACTCATACAAAGCATGAGTGTGGTAAAGAACTTCAATGCAAAGGGTGCAACAAACACTTCAAGTACCAAAGAGATCTCATCAGACATACAAAGCATGAGTGTGGTAAAGATATCCGATGCAAAGTATGCGACAAACACTTCAAGTATATGACACATCTTATTACACACACGAATTATGAGTGCGGTAAAGACATTCGATGCAATGTTTGCGACAAATCCTTCACGCGGAAGACTAGTCTGAGTACTCACACAAAGCATGAGTGTGATGCTGCTGCCTTGCCTGTGATTAGCAATGTCGTTTCTCTTGGTGTTCCGAATTCTGCCGACAAGAAGTTTAGGTGTCCTCAGTGCAGTAAGGTTTTCAAGATAAAAACTGAACTACAGACTCACACAAAGCATGAATGTGGTAAAGATATCCGATGCAAAGTGTGCGACAAACAATTCAAGCTCAAAAAAACACTAATGACACACATGAAGCATGAATGTGGTAAAGATATCCGATGCGAAGTGTGCGACAAACGCTTCAAGTACAGGAAACGTCTAATTACACACATCAAACATGAGTGCAGTAAAGTTATTCGATGCAATGTGTGCGATAAACACTTCAAGTATCAGAAAGATCTTATCACTCACACGAATTATGAGTGCGATGATGCTGCAAGGCTTGTGGTTAGCAATGTCGTTACTCATCGTGTTCAGAATTCTGATGACAAGACGTTCCAGTGTCCTCAGTGCAGTGAGGTTTTCAAGGTGAAACGTGCTCTAACCAGACACATAAATCATGAGTGTGGTAAAGATATTCGATGCACAGTGTGCGATAAACGATTCGCGCGCAAAAGCACACTATTAACACACACGAAACACGAGTGTGGTAAAGATATTCGATGCAGAGTGTGCAACAAACAATTCGAGCGACAATCATCACTAATTAGGCACACGAAGTATGAGTGTGGTAAAGTAAATCGATGCAAAGTGTGCAACAAATCCTACACGCACAAGTCTTCTCTAATCACTCACACAAAGTATGAGTGTGGTAAAAATTTGCGATGTCGAGTATGTGGTAAATCTTTCAAAACTATCCGGGGTCAAAACATCCATAATGCTGCGTGTAAAGATGATGCTGCCTTGCCTGTGATCAGCAATGTCGTTATTCTTCGTGTTCAGAATTCTGCTGGCAAGAAGTTCCAGTGTCCTCAGTGCAGTAAAGTTTTCAAAGTGAAAACTGAACTAAACCGACACATCAAGCATGATTGTGGGAAAGATATTCGATGCACAGTGTGCGGCAAACACTTCAAGTTCCAGAGACATCTTATTACACACACGAAACATGAGTGTGGTAAAGATATCCGATGCACAGTCTGCGACAAATCGTTCACGCACAAGTCTAATCTAATCACTCACACAAAGCATGAGTGTAGTAAAAATTTGCGCTGTCAATACTGTGGTAAATCTTTCAAAACTATCCAGGGTCAAAGCCAGCATATCGCCGCTTATGAAGCAGCCTGGCCCGTGATTAGCAATGTCGTTACTCTTCGTGTTCCGAATTCTGCTGCAAAGAAGTTCCAGTGTTCTCAGTGCAGCAAGGTTTTCGAGGTGAAAACTAAACTGATGAGTCACATCAAGGATGAGTGTGGTAAAGAACTACGATGCAGAGTGTGCAACAGACAATTCAAATACCAGAGACATCTCACTGCGCATGAAAAATATGAGTGTGGTAAAGATTTTCGATGCATAGTGTGCTACAAACAATTCAAGTACAGGAAACATCTTATTACACACACGAAGCATGAGTGTGGTAAAGACATTCGATGCATAGTATGCAACAAATCCTTCACTCACAAGTCCACTCTGGATACGCACACAAAACATGAGTGTGGTAAAGATATTCGATGCACTGTGTGTAACAAGAGTTTCAAGTACCAAAGGGATCTCATAAGTCATACAAAGCATGAGTGTGGTAAAGAACTTCAATGCAAAGGATGCAACAAACACTTCAAGTACCAAAGAGATCTCAATAGACATGCGAAGCATGAGTGTGGTAAAGATATTCAATGCATAGTATGCAACAAATCCTTCACTCACAAGTCCACTCTGGACACGCACACAAAACATGAGTGTGGAAAAACTTTGCGATGTCAAGGCTGTGGTAAATCTCTCGAAACTATCCGGGGTCTAAACAGTCATAGCGCTCTCTATGAAGCAGCCTGGCCCGTGATTAGCAATGTCGTTACTCTTCGTGTTCCGAATTCTGCTGCAAAGAAGTTCCAGTGTTCTCAGTGCAGCAAGGTTTTCGAGGTGAAAACTAAACTGATGAGTCACATCAAGGATGAGTGTGGTAAAGAACTACGATGCAGAGTGTGCAACAGACAATTCAAATACCAGAGACATCTCACTGCGCATGAAAAATATGAGTGTGGTAAAGATTTTCGATGCATAGTGTGCTACAAACAATTCAAGTACAGGAAACATCTTATTACACACACGAAGCATGAGTGTGGTAAAGATATTCGATGCACTGTGTGTAACAAGAGTTTCAAGTACCAAAGGGATCTCATAAGTCATACAAAGCATGAGTGTGGTAAAGAACTTCAATGCAAAGGATGCAACAAACACTTCAAGTACCAAAGAGATCTCAATAGACATGCGAAGCATGAGTGTGGTAAAGATATTCGATGCATAGTATGCAACAAATCCTTCACTCACAAGTCCACTCTGGACACGCACACAAAACATGAGTGTGGAAAAACTTTGCGATGTCAAGGCTGTGGTAAATCTCTCGAAACTATCCGGGGTCTAAACAGTCATAGCGCTCTCTATGATGCTGCCTGGCCTGTGATTAGCAATGTCGTCACTCTTCCTGTTCGGAATTCTGATGACACGACGTTCCAGTGTTCTAGGTGCAGTAAGGTTTTCAAGGTGAAGACTGAACTAAACACTCACATTAAACATGAGTGCGGTAAAGATCTTCGTTGCTCGATGTGCAACAAGCAATTCGCGCGCAAGACAACACTTTCTACGCATACAAAGCATGAATGTGGAAAAGATCTCCGATGCACAATATGCAACAAACACTTCAAGTACCGGAGACATCTCACTACGCATATAAAGCATGAGTGTGGTAAAGATATTCGATGTATTGTGTGCAACAAATCTTTTGCGCTCAAGACCTCTCTAATCAATCACACAAAGCATGAGTGTGATGGTGCTTCCTGGTCTGTGACTAGCAATGACGTTACTCTTCGTGTTCCGAATTCTGCTGGCAAAAAGTTCCAGTGTCTTCGGTGCAGTAAGGTTTTCAAGGTGAAAACTGAACTGAACCGTCACATCAAGTATGAGTGTGGTAAAGAGCTCCGATGCAAAGTGTGCGACAAACACTTCAAGTCCCAGAGACAATTTTGTACACACAGGAAATATGAGTGTGGTAAAGATATTCGATGTGAAGTGTGCGACAAACACTTCAAGTACCAGAGACATCTTATTACACACAAGAACCATGAGTGTGGTAAAGATATTCGATGCAAAGTGTGCAACAAATCATTTACGCACAAGTCTTCTCTAATCGCTCACGCAAAGCGTGTGTGTGGAAGAAATTTGCGGTGTCGAAGCAACGCTTACAACTATGAAGATAACGAAAAGAGCTTTGTCTACTTGAAGAAGGGATTTGATGGATTCTTAGCCAGGTATCTGCCCGCTTAA
Protein Sequence
MSRMGQTLHAQENEVQCCCGEFSNTLEDYSVWYRYLPEIGSVAYSNQNSDSGKLQCPTCSKVVRGKSSLTSHITHDCGQDIRCHVCNKGFKARQTLVMHKKHECGKELQCQGCDDADWHLPVIVKVATLLDQISGVENLPCPKCKKNFKLGSSSFQTRHAPHECDNADWNLPEISNVVSLPIQNSDDKKLQCPTCNKVFKIRSLLNSHVSHDCGKEIICTLCNKHFTRKSNLVTHMKHECGKDFRCTVCHKSFKHQRTLTTHTKHECGKELLCIVCNKRFKFKRDHVAHVNYECGKVIRCNVCNKCFKYKTGLQRHVKYRCGKKLRCQGCDDAAWPMISNVVTLPVRNSDDKKFQCPQCSKVFKVKRELTRHFKYDCGKDIRCKISDGMIFQCPTCNKVFEERSDLARHVIYECGKDIICTVCNKRFAFKTALISHKKYECGKDIRCTVCNKGFKHNSTLITHTKYECDDAAWPVISNVVTLRVQNSDVKTFQCPQCSEVFKVKRALTRHINHECGKEIRCSVCNKQFARKTTLTRHTKYECGKDLRCKVCHKSFTHKTNLTTHTKYECGKDIRCKNSDVKKFQCLQCSKVFKVKRALTRHMNHECGKEIRCTVCNKQFKYQRDLITHTKHECGKELQCKGCNKHFKYQRDLIRHTKHECGKDIRCKVCDKHFKYMTHLITHTNYECGKDIRCNVCDKSFTRKTSLSTHTKHECDAAALPVISNVVSLGVPNSADKKFRCPQCSKVFKIKTELQTHTKHECGKDIRCKVCDKQFKLKKTLMTHMKHECGKDIRCEVCDKRFKYRKRLITHIKHECSKVIRCNVCDKHFKYQKDLITHTNYECDDAARLVVSNVVTHRVQNSDDKTFQCPQCSEVFKVKRALTRHINHECGKDIRCTVCDKRFARKSTLLTHTKHECGKDIRCRVCNKQFERQSSLIRHTKYECGKVNRCKVCNKSYTHKSSLITHTKYECGKNLRCRVCGKSFKTIRGQNIHNAACKDDAALPVISNVVILRVQNSAGKKFQCPQCSKVFKVKTELNRHIKHDCGKDIRCTVCGKHFKFQRHLITHTKHECGKDIRCTVCDKSFTHKSNLITHTKHECSKNLRCQYCGKSFKTIQGQSQHIAAYEAAWPVISNVVTLRVPNSAAKKFQCSQCSKVFEVKTKLMSHIKDECGKELRCRVCNRQFKYQRHLTAHEKYECGKDFRCIVCYKQFKYRKHLITHTKHECGKDIRCIVCNKSFTHKSTLDTHTKHECGKDIRCTVCNKSFKYQRDLISHTKHECGKELQCKGCNKHFKYQRDLNRHAKHECGKDIQCIVCNKSFTHKSTLDTHTKHECGKTLRCQGCGKSLETIRGLNSHSALYEAAWPVISNVVTLRVPNSAAKKFQCSQCSKVFEVKTKLMSHIKDECGKELRCRVCNRQFKYQRHLTAHEKYECGKDFRCIVCYKQFKYRKHLITHTKHECGKDIRCTVCNKSFKYQRDLISHTKHECGKELQCKGCNKHFKYQRDLNRHAKHECGKDIRCIVCNKSFTHKSTLDTHTKHECGKTLRCQGCGKSLETIRGLNSHSALYDAAWPVISNVVTLPVRNSDDTTFQCSRCSKVFKVKTELNTHIKHECGKDLRCSMCNKQFARKTTLSTHTKHECGKDLRCTICNKHFKYRRHLTTHIKHECGKDIRCIVCNKSFALKTSLINHTKHECDGASWSVTSNDVTLRVPNSAGKKFQCLRCSKVFKVKTELNRHIKYECGKELRCKVCDKHFKSQRQFCTHRKYECGKDIRCEVCDKHFKYQRHLITHKNHECGKDIRCKVCNKSFTHKSSLIAHAKRVCGRNLRCRSNAYNYEDNEKSFVYLKKGFDGFLARYLPA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-