Basic Information

Gene Symbol
-
Assembly
GCA_033807575.1
Location
CM066368.1:10239698-10244982[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 34 0.43 36 6.3 2.8 1 23 43 66 43 66 0.94
2 34 0.02 1.7 10.5 1.6 2 23 133 154 133 154 0.96
3 34 0.0044 0.37 12.6 1.1 1 21 158 178 158 179 0.96
4 34 7 5.8e+02 2.5 2.0 1 23 247 270 247 270 0.88
5 34 8.1 6.8e+02 2.3 0.2 3 23 294 314 293 314 0.93
6 34 0.016 1.3 10.9 1.6 2 23 337 358 336 358 0.97
7 34 1.9e-05 0.0016 20.0 2.1 3 23 364 384 363 384 0.98
8 34 0.037 3.1 9.7 1.7 1 23 389 412 389 412 0.92
9 34 0.07 5.8 8.8 0.4 1 23 421 444 421 444 0.91
10 34 0.0016 0.13 14.0 2.5 1 23 451 474 451 474 0.97
11 34 0.00028 0.023 16.4 1.0 1 23 481 503 481 503 0.96
12 34 1.1e-05 0.00094 20.7 0.5 1 23 509 531 509 531 0.99
13 34 5.4 4.5e+02 2.9 0.6 1 13 537 549 537 550 0.89
14 34 2.7 2.3e+02 3.8 2.9 1 23 596 619 596 619 0.94
15 34 0.15 12 7.8 3.5 1 23 688 710 688 710 0.96
16 34 0.0029 0.24 13.2 0.2 1 21 714 734 714 735 0.93
17 34 0.041 3.4 9.5 0.5 1 23 787 810 787 810 0.92
18 34 0.048 4 9.3 0.1 2 23 831 852 831 852 0.95
19 34 0.00095 0.079 14.7 2.1 1 23 875 897 875 897 0.97
20 34 1.5e-05 0.0012 20.4 0.5 1 23 901 923 901 923 0.98
21 34 0.00029 0.025 16.3 1.8 1 23 931 954 931 954 0.96
22 34 0.0022 0.18 13.5 0.6 2 23 961 983 960 983 0.98
23 34 0.0054 0.45 12.3 0.4 1 23 990 1013 990 1013 0.96
24 34 0.029 2.4 10.0 3.3 1 23 1019 1041 1019 1041 0.96
25 34 0.056 4.7 9.1 0.4 1 23 1106 1129 1106 1129 0.88
26 34 4.5 3.8e+02 3.1 0.1 3 23 1159 1180 1159 1180 0.93
27 34 0.00023 0.019 16.7 0.5 1 23 1202 1224 1202 1224 0.98
28 34 0.006 0.5 12.2 0.3 1 23 1228 1250 1228 1250 0.94
29 34 0.52 43 6.1 0.2 1 23 1255 1278 1255 1278 0.87
30 34 0.008 0.67 11.8 4.4 1 23 1284 1307 1284 1307 0.95
31 34 0.00013 0.011 17.4 0.2 1 23 1314 1337 1314 1337 0.98
32 34 5.2e-05 0.0044 18.6 0.9 1 23 1343 1365 1343 1365 0.96
33 34 4.8e-05 0.004 18.8 5.0 1 23 1371 1393 1371 1393 0.98
34 34 0.00085 0.071 14.8 4.6 1 23 1399 1421 1399 1422 0.95

Sequence Information

Coding Sequence
ATGACCGCAGCGGAAAACACTGAGGATTCTCTCTGGCGCGCCAACCTATGGAATAACGTTAATACTATCATCGCTATGACCACAGTGTATCCATTCAAGTACAAAGCCAACCTAAACGGCGCGATATTTCAATGTTATTATTGCACAGATGCATTCTGCGATCCAACAGAAATGAGAAATCATAATCGTGACCGTCATTCTGACAACAAAAAAGTTCCCAGACCTAAATGCTATCAACCCGTTAAAATGGACTTCACAGCTGCATCGTGTAAAGTGTGTGCTGCTGACATTCAGAACTATGCTATGCTAAAAACCCACTTGGCGGAACACGGTAAAGTCATCGACACAAAGCGTGGAGAGAGTATCTTGCCGTATTACCTGGGATCAGAGTCAAACGTCTGCCAGATATGCGAGAAGAAGTTTCATAAATTCGTGGGTTTACTCAGACATATGTCAGAGCATTCCGATCGTCATCGATGCAAAAAATGTGGGCAAAGATTAAACTCTTATGCTCAGCTTATGACTCATATCCGACAAAATAAATGTAGCACGCGCCAGAAGAAGTCATTTGATATCGAAAATGCTAGTGCATCGGCAGATAGATTTATGGAAGAACTTGAAGATGAAGCTTTTGCTAGACACGCCGCCGAGCAGACTAAGTTTGCTAACTTCAGAAATAACATTTCTATAATAACAACCATGACTACAGTCTACCCCTTTAAGTATAAAAGGGGCTTATTTCAATGTTACTACTGTGACACGACGTCTCTAAACCCACACGATATTCGAAGTCACAACCGTGATCAACATTATGATATTAAATCATTTCCAAACTTTAGAGTTTATCAGCCTTTAAAAATGGACTTTGCGGATGCTTCTTGCAAGCTTTGTTTAGCCGAGATTCGCGACTATCCTACGTTAAAAAACCATCTAGCCAAACACGGCAATATCATAGACACCTCTCTTAGCGAGAGCGTGATTCCCTACAACTTGAACAAAGATGAGATCGTTTGTCAAATATGCAATACAACCTACCAATCGTTTTTGACTCTACACAGACACATGAACGTACATTGCGCCTATAACATTTGTGATCAATGTGGCAAAGCTTTTAAAACTCACTCACAGCTATCCATTCATCGCCAAACTCATGAGAGCGGCGTATTCCCTTGCAAGGAATGCAATCAAACATTCCCGTTCCTTTCACGTCTACATTCTCACAGTGCCAAAGTACATAAACGCAAGATACCCCACACCGTATACAAATGCCCCGTTTGTAAAGAGGAATTCCCTACATACAGAAAAAGGTTGATACACTTGTCCCAAGTCCATGGCCACAAAATGCCAGAGTTCCCCTGTCCCTCCTGCAATAAGGTCTTCAATTTGTGCAGCAGTATGACCAACCATGTAAAAACGAAACActtacaaaaagagaaaaatcaCCCATGCACCTTGTGTGACATGAAGTTCTCTAACAGTACTGCGCTTAAAGATCATATCGTTAAACATAACGGCGAAAGGAAATTTCAGTGTGATATTTGCAAGAAGGCGTATGCCAGGATTAAGACCCTGAGGGAACACATGCGTATTCATAACGACGACCGACGGTTCGCTTGTGCTGTGTGTGGGCTGACTTTCATACAGAAGTGCAgaCGGGAAAGTGAAGAACCCATACAGGCTAAGAAAATGAGCGAAGCAGAGCGCTGGCGCGCCAATTTAAGAGATAACGTTTCTGCTATTATATCTACTACAACAGCGTAtccatttaaatacaaaaaaggaCAATACCTTTGTTTCTTCTGTAAAGACGAATTTTCAGAAGCAGAAAAGCTTCGCGAACACCATCGAAAGCAACATTATGATGAGATACAACTGCCTAAGCTGAAGGCGTATGATCCAACAAAAGTGGACATAGCGGATGCTTCGTGCAAACTGTGCTTAGCCCAAATTACAGACTATACCATGTTAAAAACCCATTTAGCCGAACACGGCAAACAGTTAGTTACTACAAACTCTGCGCAGGGCGAGACTGTGCTTCCATTCAAAATGCTTAGAAACAGCCACATTTGCCAACTATGCGGAAAGCCGTGTGAGTCACTAATGAGATTACTTAATCACATGCACGTACACTCTACTCGTCATGTGTGTGATAAATGCGGAAGTCGATTCAGGAGTTCGGTCCTGCTGGCTAAACATGTTCAAATGAAAGAGTGCGGTACATATAAAAAGGATTCCGTTGAGGCAGACGATGCGGATGACCGATTCCCCGGCGAACGAAGACGCATTAGTCTAATAAACAATCTCTCAATTATCATAGCGATGACAACAGTTCAGCCATTCCGTTTTGACAAAGGAGTCTATCTCTGTTACTATTGCAAAGAAACTTTCACAGAGCCAGCGGTGCTTCGACAACATAATCGCGACCAACAcactgaaattaaaaaaatacccaaATATTATGAACCCGCTAAAGTTGACTTTGTAGACGCTTCATGCAAGATTTGTTCGAAAGAAATTCCCGATAATCAAACATTGAAAGCCCACCTGGCGGAACATGGGAAGGAATTGGACATGACTCACGGCGATAGCGTTATCCCCTATAATCTAAACAGTTCTTCTGAACATGTGTGTCAAATATGTGGAAAGAAGTATGAAACATTCGTAGGCTTACATAGACATATCAACGAGCATTATGACAGCTATATATGTGATCAATGTGGAAAGAAGTTTATCACGTCTCGGCAAATGATCAGGCATGCTCGCATCCACGACGCCCAGGAACAGGGCTTGTTTCCTTGCAAGAACTGCGATCAAACATTCACAATTCGCGCTCATCTGACTTCGCACATTACCAAAGAACACAAATCCAAAGAGCGGAATAAATGTCCTATTTGCAACGAGAAATTTGCCTTTTACGGACAGAGATTGAAACACCTTCGTACTGTCCATGATCAGAAAACCGCCGCTTATCCTTGTCCCTCGTGTGACAAGGTCTTTGATCTGAGTAGTAAACGATCTGAACATATAAGGTATCAACACTTGCAAGTGAAGAAACATGAGTGTCATATTTGTGGCATGAAGTTCTTTGGTAAACATGGGCTCGACGCTCACTTGGTTAAGCATGGCGGTGATGAGACCTCACCCACCGACAGTGAGAAACAGGTTGCTGGCTTGAAAACCGGCGGAATTAAAGAAAACGaagacaatttaaaaaaatctaatgtcGATGTCGACGACCTGAAAGCTAGGAAGAAGAACGCTACAATGTTACTACAATTCACAAAAATATGTCCGTTTAGATGGATGAAGAATTTATACTTGTGTTTTTACTGTGATAACCAATTTAACAATCCAGCCTGGCTGAGGACGCATAATGAAAATGAGCATTGCGTCAAAACTGCTGCACAAGTTAGATGTGCTCTCTCAAAACTGAAGAAATACGAACTTGCCAAAGTAGATGTGACTAACACTGGATGCGCTATTTGCGACGATGTTATAACTGATTTCAATGAGTTAAAAAAGCATCTGCtagaaaaacataacaaaaacattGACCCGAAATCAAATGACGGTATTCTCCCATTTAAAGTAAGGAAAGATGAGTTTAAATGCGCAATTTGTTCTGAAACGTTTGCTGAATTCAAAAGTTTAAACCAGCATATGAACGTGCACTATCAGAATTTTATATGCGAGCAGTGCGGAACGGGTTTTATCACCCCAGAGCGTTTGAGGACGCACTCATTTTCGCACGAGACGGGTTCTTTCCCATGTGACGCTTGTGAAAAAGTTTTTAGATCAACAAATGCCAAGAACGAACATTACGCTCTCGTTCACATGCGGGTTAAGCGACATCGATGCCCCCATTGCCCTGAGACATTTAGAAATTACTTTCAAAGAAACAAACATGTTTCGTCCGTCCACGGTCTGAAACTAAAAGAGTTTAAGTGTAATTTGTGTCCTAAAGTTTTCACACTAAGTGGGAAACTGGGAGTTCATGTCAGGACGGTGCATTTGAAGCTCAAAAGGCATGCGTGCGAAGTATGCGAATGGAAGTTCTATTCTAAATCTGAGCTAAAGGAGCATATGATCCGTCACGGAGGAGAGAGGAAGCATCAGTGTGCAGTGTGTAAAAAGTGCTATGCAAGAAAATATACTCTGCGGGAACATATGCGTATACACGAGGACGACAGGAGATTCATTTGTGCGACTTGCGGGGAGTCATTTGTGCAAAACTGTAGTTTGAAACATCACGTAAAAGTACACCATCCGGTTAAATATGAATTTATGTCATGA
Protein Sequence
MTAAENTEDSLWRANLWNNVNTIIAMTTVYPFKYKANLNGAIFQCYYCTDAFCDPTEMRNHNRDRHSDNKKVPRPKCYQPVKMDFTAASCKVCAADIQNYAMLKTHLAEHGKVIDTKRGESILPYYLGSESNVCQICEKKFHKFVGLLRHMSEHSDRHRCKKCGQRLNSYAQLMTHIRQNKCSTRQKKSFDIENASASADRFMEELEDEAFARHAAEQTKFANFRNNISIITTMTTVYPFKYKRGLFQCYYCDTTSLNPHDIRSHNRDQHYDIKSFPNFRVYQPLKMDFADASCKLCLAEIRDYPTLKNHLAKHGNIIDTSLSESVIPYNLNKDEIVCQICNTTYQSFLTLHRHMNVHCAYNICDQCGKAFKTHSQLSIHRQTHESGVFPCKECNQTFPFLSRLHSHSAKVHKRKIPHTVYKCPVCKEEFPTYRKRLIHLSQVHGHKMPEFPCPSCNKVFNLCSSMTNHVKTKHLQKEKNHPCTLCDMKFSNSTALKDHIVKHNGERKFQCDICKKAYARIKTLREHMRIHNDDRRFACAVCGLTFIQKCRRESEEPIQAKKMSEAERWRANLRDNVSAIISTTTAYPFKYKKGQYLCFFCKDEFSEAEKLREHHRKQHYDEIQLPKLKAYDPTKVDIADASCKLCLAQITDYTMLKTHLAEHGKQLVTTNSAQGETVLPFKMLRNSHICQLCGKPCESLMRLLNHMHVHSTRHVCDKCGSRFRSSVLLAKHVQMKECGTYKKDSVEADDADDRFPGERRRISLINNLSIIIAMTTVQPFRFDKGVYLCYYCKETFTEPAVLRQHNRDQHTEIKKIPKYYEPAKVDFVDASCKICSKEIPDNQTLKAHLAEHGKELDMTHGDSVIPYNLNSSSEHVCQICGKKYETFVGLHRHINEHYDSYICDQCGKKFITSRQMIRHARIHDAQEQGLFPCKNCDQTFTIRAHLTSHITKEHKSKERNKCPICNEKFAFYGQRLKHLRTVHDQKTAAYPCPSCDKVFDLSSKRSEHIRYQHLQVKKHECHICGMKFFGKHGLDAHLVKHGGDETSPTDSEKQVAGLKTGGIKENEDNLKKSNVDVDDLKARKKNATMLLQFTKICPFRWMKNLYLCFYCDNQFNNPAWLRTHNENEHCVKTAAQVRCALSKLKKYELAKVDVTNTGCAICDDVITDFNELKKHLLEKHNKNIDPKSNDGILPFKVRKDEFKCAICSETFAEFKSLNQHMNVHYQNFICEQCGTGFITPERLRTHSFSHETGSFPCDACEKVFRSTNAKNEHYALVHMRVKRHRCPHCPETFRNYFQRNKHVSSVHGLKLKEFKCNLCPKVFTLSGKLGVHVRTVHLKLKRHACEVCEWKFYSKSELKEHMIRHGGERKHQCAVCKKCYARKYTLREHMRIHEDDRRFICATCGESFVQNCSLKHHVKVHHPVKYEFMS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-