Basic Information

Gene Symbol
-
Assembly
GCA_963693495.1
Location
OY856401.1:2771920-2788178[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 38 0.038 3.6 9.0 0.6 1 21 128 148 128 149 0.95
2 38 0.64 61 5.1 1.1 3 23 178 198 177 198 0.96
3 38 0.017 1.6 10.1 0.4 2 23 221 242 220 242 0.96
4 38 0.005 0.47 11.8 0.1 1 23 246 268 246 268 0.96
5 38 0.021 2 9.8 0.1 1 23 273 296 273 296 0.93
6 38 0.0051 0.49 11.7 0.3 1 23 302 325 302 325 0.95
7 38 0.0019 0.18 13.1 1.3 1 23 332 355 332 355 0.97
8 38 5e-05 0.0048 18.1 0.9 3 23 364 384 362 384 0.97
9 38 1e-05 0.00098 20.2 1.6 1 23 390 412 390 412 0.98
10 38 0.0086 0.82 11.0 0.2 1 23 490 513 490 513 0.95
11 38 0.0026 0.25 12.7 1.1 1 23 582 604 582 604 0.99
12 38 0.0001 0.0097 17.1 1.0 1 23 608 630 608 630 0.97
13 38 0.0094 0.9 10.9 0.9 1 23 664 687 664 687 0.93
14 38 0.25 23 6.4 0.3 1 23 751 773 751 773 0.95
15 38 0.19 18 6.8 0.2 2 23 800 822 799 822 0.94
16 38 0.0011 0.11 13.8 1.0 1 23 844 866 844 866 0.98
17 38 7.4e-06 0.00071 20.7 1.1 1 23 870 892 870 892 0.98
18 38 1.8 1.7e+02 3.8 1.3 1 21 897 917 897 920 0.93
19 38 0.22 21 6.6 1.3 1 19 926 944 926 946 0.82
20 38 0.24 23 6.5 0.2 1 23 1007 1029 1007 1029 0.96
21 38 0.0061 0.58 11.5 0.0 2 23 1052 1074 1051 1074 0.96
22 38 0.00043 0.041 15.1 1.1 2 23 1097 1118 1096 1118 0.96
23 38 0.99 95 4.5 0.2 3 23 1124 1144 1122 1144 0.95
24 38 0.0093 0.88 10.9 0.2 2 23 1152 1174 1151 1174 0.94
25 38 0.0034 0.33 12.3 1.0 1 23 1180 1203 1180 1203 0.92
26 38 0.028 2.6 9.4 2.0 1 23 1209 1232 1209 1232 0.96
27 38 0.00095 0.091 14.0 1.0 1 23 1238 1260 1238 1260 0.96
28 38 9.3e-06 0.00089 20.4 2.8 1 23 1266 1288 1266 1288 0.98
29 38 0.00029 0.028 15.7 4.7 1 23 1294 1316 1294 1317 0.96
30 38 0.046 4.4 8.7 0.8 1 23 1405 1427 1405 1427 0.95
31 38 0.046 4.4 8.7 0.1 3 23 1449 1469 1448 1469 0.95
32 38 0.00063 0.06 14.6 2.5 1 23 1491 1513 1491 1513 0.97
33 38 0.00028 0.026 15.7 0.4 1 23 1517 1539 1517 1539 0.98
34 38 0.00014 0.013 16.7 0.6 1 23 1544 1567 1544 1567 0.95
35 38 0.23 22 6.6 0.4 1 23 1573 1596 1573 1596 0.88
36 38 2.4e-05 0.0023 19.1 0.4 1 23 1603 1626 1603 1626 0.96
37 38 9.9e-05 0.0095 17.1 3.9 1 23 1632 1654 1632 1654 0.98
38 38 0.00011 0.011 17.0 5.1 1 23 1661 1684 1661 1684 0.97

Sequence Information

Coding Sequence
ATGGACGACCTAGCCACGTCGTGTTCTACGCCTTCGGAGAGCATTAAGTTGGCCGAAGAAGCCATTGCCATGTTTCGCGCTGGAGGTTACGACTTAGTGAAATTCTCCAGCAACTTTCCTGAGGTTATGAATGCAATTCCTGCCTCACATCGGATCTCCGATGTCGTGGAAATAAATCCCGAAAGCCACCTGAAAATACTAGGAAACCGCGCCCAAACGAATGAAATAACTATGATACAAAATGGCGATATAGAAACTAAATTGAACAAGTCCACCTATAAAAGGAGCGCGAGGTCTGAAGCCAGGATAGCGATAAAGAATAATGCGACATCTATACTTCAATGTTGGACTCTATGCCCGTTTAGATGGAAGAAGAATAGATTTAAATGTGCCTATTGTGAAGAAGACTTCACCGAATGTACTTCGCTTAGAGGTCATGTGAGAGTGTGTTCAACTTTGCACACCGTTGCTGATATATATCGAAAATATAGAGAGCTGCCAGTCATAAACATAGATACGATTGATGCCATATGCTGTTTCTGCTCCACGCCTTTCACTGATGTCATTCAGATGAGAGCCCACACAATCAGACACGGCTTCGAATTCAACACCGAGCAGCCAGATGGCGTGTTGCCGTTCTCCCTTGACAAGGACGCTTGGAACTGCGTCATATGTAACGAGAAGTTCAATAATTTCCTCAAACTGTACGAACACATGAATGTCCATTATCAACACTATATTTGCGCTATCTGCGGGAAAGGGTACATGACAGCAACGAGATTGAGAAAACACTCGGAGGTGCACGTGACGGGATCATTTCCATGCAACGAGTGCCCTAAAATATTCACAATGAGAGCATCTAGAGATGCCCATAAAGCAACAGTCCACCCGAAAACTCCAAGATATGAGTGCCCGTACTGTAATATGCGTTTTGACGGATACTACAACCGAATGGCACATCTGAAGGACGCGCACAGAGAAAAGGAAGTGTCTTACGATTGTGGTCATTGTGAATTATCATTCAAAACGTCAGGGAAGAGGTCTATACATATACGTACCGTGCACTTTCCGCCTAAGTCTGAATTTGGATGCATGTACTGCGAGTGGCAGTTTAAGACCAACTATGAGTTGAAGAGGCACATGATAAAGCACACTGGGGAAAAGAGTTTCCATTGCTCGGTGTGCGGTAGTTCCTTCTCAAGGAACAAAGCGTTGATTAACCACATGAAAGTCCATCAGAAATTGATCTGTTTTAATATAGAAGCCGAGGGAAATGTAGGCCAAATAGTATCGGGCAGGAAATTAAAGGAAGTCGTGTGCGCACGCCAAATGCGCAGACGACGTCGGGCGAACAATGAAATTCCAGATCATCTCAAGAAACGTATATCCAAACAAATGATGCGGAAGAATGCATTGACCCTTCTAGAATGCTCTACCGCGTGGGCGTTTAGATGGTTCCACGGGATGTTCTATTGTTCTTATTGTGACGCTAAATTCCCTGACTCGATCCCACTACGTGAACACGTAGAGAAATTCCATCTAACCGAATTACCGAGTAAAATTGTGTTTTCGAAACTCTCGGAAAATAATATGGTCAAAATTGACATTGCCAAACTAAGTTGTAGGTTCTGTAAATGTGAATTAGGTTGCTTAGACTCCCTTAAAATGCACATGAAAATCCACGAGAGACATTTGTGCGAAGGTTACAGTGATGGCATCCTGCCTTTCAAGCTAAGTGAAGAGTTTCAATGTCAAATCTGTTTTAAGATTTTCTCGAATTTCCCAAGAATAAATGAGCACATGAACACACATTATCAAAATCATGTTTGCGACGAGTGTGGTAAGGCTTTTATATCCGAATCTAGATTTCGAAAGCATGCGCAATCCCATGAGTCTGGAAGTTTCCCTTGTGCTCTCTGCGAATTAGTTCTACCGACTAGAGTGGCTCGAATGTGTCATAGACAAAAAGTACACCGGAAAGGGATAAGGTACAACTGTCCGAGGTGTCCGGAGGTATTCACGTCGTATCATGGGAGAACGAAGCATTTATCGGAGGCTCATAGACAAAGAAGAGCATTCAACTGTAACATTTGCGGAACTGAAGAGGGAAGAAGTAAAGCCAGTTCCGAGAAAAAGCCTATTAAAGTTGTGTGGAAACGCAAATATGGTGCCTTAAGCGAAGATAAAGCGAATGCTGCTTTGCTTCTAGAACATTCCAACGCTGTTGCTTTCAGATGGCAACGCGGAAAGTTTATGTGCGCTTATTGTCCTAAAATATGCGCCAACGTTGCCGAAGTTCGTTCACATTCCGTTACGCATAGCCAACTAGATTTACTGTTAAAACCTGTTAAAGAAGTCAGAAATAGCTTTCCTCTGCGTGTAGATATAACCAATCTAACTTGCTCTATATGCGAGTCTCTAGTCAAGACGCTAGATAATCTTAAACTCCATCTAAAGGAAATACACGGAAAGAATATAAACTCTAATTACACTGACGGTTTGATACCTTTTGTCTTGACTGGAAAAAAGTATAGTTGCCTACACTGCGGCGAGAGTTTCGCAGGATTTATGACTCTATTTGTGCATATGAACCAGCATTATCAAAGTTTCGTCTGCGATGAATGCGGTAAAGGATATTCTGCAAATCACAAACTCAGAGCGCATCAGAGGACTCATGATAATGGAAAGTTTGTTTGTTCCAAATGTGAATTGGATTTTCCTAATCGTGTTGTCAGGAACAGGCATATCAAATGTGTTCATGGTCGGAAACAGCGCTATCGATGTCCTATTTGCGATTCACACTTTGACTCTTATCACTTGAGACTTAAGCACTTAGATTGCGTTCATGGCCAGAAGTCTGAATATAAGTGCAGCTTGTGTCCTTCAGTGTTTGGCAGTGGGCCAACCAACTACAAAGTACAATGGACATCGACGCGCAAGTTTTATGATCAAAGAGACAATGCTGCAATTATTATAGACTGTACAAATGTCATACCGTTTCGTTGGAGAACAGGTGCGTTCAAATGCGCATACTGTACTCTAGCTTTCGGtgattttgaattaataaaagcACACGCGACGAGTCATTCCAATGCAATTAAAACTATACGCCTCCACAGTGATGTTAACATGAAAATAGAAATATCTAACTTACGCTGTGAGATTTGTTATCAACCCTTCGAAGATTTGGATTCTTTAGTTGAACATCTGATGGTAACACATCAGAAAAACATCGACAAAGAGCACGGTTTAGGTGTGTCGCCGTTCATTATTACACCTAAAGAGTGTATTTGCCCATATTGCAAAGAGTCTTTCGATTTATTCTCTgctttaaataaacatttgaatATTCATAATCCCACCAGTATATGCTTTTTGTGTGCAAAATTATTTGCAACAATGACGAGGCTCAAGGCTCATTTGGCGATTCATGATTCTGAAGTTGTCCACCTTAAATGTGGTAAATGTAATGAGACTTTCGAGAGCAGGGCTTTAAAGAACGCTCACATTAAGGCCGTGCACAAAAAGGAATGTAGATACAAATGTCAAGATTGTAACATTTCGTTTAAAAGTTATTCAGATCGCGGGAAACATTTGGCGGAGTACCACGATAGAAAGATTGAGTATCCGTGTAACTTGTGTTCTAGAGTGTTCGCGATGTGTAACCAAAGAACGAAGCATATCAAACAAGTGCACACTAGGCCAAAGATACACAACTGCACGTATTGTTCATATTCGACTGTGACGGCGGGACAATTGCGCGCGCATCTGATCCGCCATACTGGCGAGAGGACGTATCATTGTGAAGTGTGTAAGAAGTCTTACGCGCGATCGAAAACTTTGCGAGAGCATATGAGAAtacataataatgataaaagaTTTGTCTGTGATTATTGTAATAGTGCATATGTACAGAAATGTAGTCTCCAGAGTCATATGAGAACGCACCATCCAGACGCTTTACCTGTAAAGAAAATGACGCTTGTTGgaagAAAATCCGAAAgaaaacaaagagaacaaaaacaaaataccgatgatgataatgatgatgatgacgatgaagATGAAGAGGACGAAGCAAAAAAAGCGGCTGACGCTGCAAATCTGTCGAAAAAGATGCTTAAAAGGGCAAAGAAGGAAGTGTTCAGAAAAAATATTGGAGTTATAATGGATTGTACCACAGCTTATCCCTTTAAGTACGGCAAAGGGCTGTTCCTCTGCTTCTACTGCGAGAGCACTTTCATTGAGCCAGAGTACTTGAGGTATCATACGCGTTTACATGAGAAAACCAGCGTGAGAACCAAAGGCTACGAGccattaaaatttgattttgaaGGAAGTGCTTGCAAATTGTGTGGCTTAGAAACTGAAGACTATTCAACATTGAAGACTCATTTAGCAGAGCATGGAAAAGAATTGGATAACACGCACGGAGAAAGTGTGCTTCCGTACAAACTGAGTATCAACGAATACATTTGTCAGATATGTGGGAAGAAGTACGAGAAGTTCCTCAACTTGCATAAGCACATGAACGAGCATTTCCAACAGTTTGTGTGCGAAACGTGCGGCAAGAGGTTTGCATCGTACCAAAGAGTCGCCAATCACATACGAATGCACGAAATGGGACAGTTTCCGTGCAAGAAGTGCGGTGAAAGCTTTCCAACTTACGGATCTCTCCATACGCACGTAGCGAAAGTACATCGGGATAATAATCGGTACAAATGCCCGCTTTGTGATGCGAAGTTCTCGTGGTACAAGCAGAGAGTTGTACACCTGAACAAAGTGCACGGAGACACAAGTACGGTGTTCCCTTGCCCGTCTTGTCCGAAAGTTTTCAACATTCCTGGCCATTTACAGCGGCATATTAAATATACGCACCTCCATGAAAGGAAATTTAGTTGCACGGAATGCGGTAAACATTTCTTCGCCAACTACGAGCTGCAGAAGCATTTGAAGAAACACGCCGGCGAGACATTTTTGCACCAATGCCACTTCTGCGAGAAGGCGTACACCCGTTCGAACAATTTGAAGGAGCATATTATGAAAAATCATGACTATAGTGCGATACCTGAAACGAAATTAGAACAAGATGCAAGTCTCAGTCAAATGCAGTAG
Protein Sequence
MDDLATSCSTPSESIKLAEEAIAMFRAGGYDLVKFSSNFPEVMNAIPASHRISDVVEINPESHLKILGNRAQTNEITMIQNGDIETKLNKSTYKRSARSEARIAIKNNATSILQCWTLCPFRWKKNRFKCAYCEEDFTECTSLRGHVRVCSTLHTVADIYRKYRELPVINIDTIDAICCFCSTPFTDVIQMRAHTIRHGFEFNTEQPDGVLPFSLDKDAWNCVICNEKFNNFLKLYEHMNVHYQHYICAICGKGYMTATRLRKHSEVHVTGSFPCNECPKIFTMRASRDAHKATVHPKTPRYECPYCNMRFDGYYNRMAHLKDAHREKEVSYDCGHCELSFKTSGKRSIHIRTVHFPPKSEFGCMYCEWQFKTNYELKRHMIKHTGEKSFHCSVCGSSFSRNKALINHMKVHQKLICFNIEAEGNVGQIVSGRKLKEVVCARQMRRRRRANNEIPDHLKKRISKQMMRKNALTLLECSTAWAFRWFHGMFYCSYCDAKFPDSIPLREHVEKFHLTELPSKIVFSKLSENNMVKIDIAKLSCRFCKCELGCLDSLKMHMKIHERHLCEGYSDGILPFKLSEEFQCQICFKIFSNFPRINEHMNTHYQNHVCDECGKAFISESRFRKHAQSHESGSFPCALCELVLPTRVARMCHRQKVHRKGIRYNCPRCPEVFTSYHGRTKHLSEAHRQRRAFNCNICGTEEGRSKASSEKKPIKVVWKRKYGALSEDKANAALLLEHSNAVAFRWQRGKFMCAYCPKICANVAEVRSHSVTHSQLDLLLKPVKEVRNSFPLRVDITNLTCSICESLVKTLDNLKLHLKEIHGKNINSNYTDGLIPFVLTGKKYSCLHCGESFAGFMTLFVHMNQHYQSFVCDECGKGYSANHKLRAHQRTHDNGKFVCSKCELDFPNRVVRNRHIKCVHGRKQRYRCPICDSHFDSYHLRLKHLDCVHGQKSEYKCSLCPSVFGSGPTNYKVQWTSTRKFYDQRDNAAIIIDCTNVIPFRWRTGAFKCAYCTLAFGDFELIKAHATSHSNAIKTIRLHSDVNMKIEISNLRCEICYQPFEDLDSLVEHLMVTHQKNIDKEHGLGVSPFIITPKECICPYCKESFDLFSALNKHLNIHNPTSICFLCAKLFATMTRLKAHLAIHDSEVVHLKCGKCNETFESRALKNAHIKAVHKKECRYKCQDCNISFKSYSDRGKHLAEYHDRKIEYPCNLCSRVFAMCNQRTKHIKQVHTRPKIHNCTYCSYSTVTAGQLRAHLIRHTGERTYHCEVCKKSYARSKTLREHMRIHNNDKRFVCDYCNSAYVQKCSLQSHMRTHHPDALPVKKMTLVGRKSERKQREQKQNTDDDNDDDDDEDEEDEAKKAADAANLSKKMLKRAKKEVFRKNIGVIMDCTTAYPFKYGKGLFLCFYCESTFIEPEYLRYHTRLHEKTSVRTKGYEPLKFDFEGSACKLCGLETEDYSTLKTHLAEHGKELDNTHGESVLPYKLSINEYICQICGKKYEKFLNLHKHMNEHFQQFVCETCGKRFASYQRVANHIRMHEMGQFPCKKCGESFPTYGSLHTHVAKVHRDNNRYKCPLCDAKFSWYKQRVVHLNKVHGDTSTVFPCPSCPKVFNIPGHLQRHIKYTHLHERKFSCTECGKHFFANYELQKHLKKHAGETFLHQCHFCEKAYTRSNNLKEHIMKNHDYSAIPETKLEQDASLSQMQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-