Basic Information

Gene Symbol
ZEB2
Assembly
GCA_949128085.1
Location
OX421882.1:4710647-4888487[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 84 0.012 2.1 10.5 5.6 1 23 56 78 56 78 0.96
2 84 0.00094 0.17 14.0 0.1 1 23 84 106 84 106 0.98
3 84 6e-05 0.011 17.7 1.2 1 23 112 134 112 134 0.96
4 84 1.9e-05 0.0036 19.3 1.6 1 23 140 162 140 162 0.98
5 84 5.4e-05 0.01 17.9 1.5 1 23 168 190 168 190 0.98
6 84 3.6 6.6e+02 2.7 1.9 1 13 196 208 196 210 0.91
7 84 0.046 8.4 8.6 2.3 5 23 328 346 325 346 0.96
8 84 3.6 6.6e+02 2.7 1.6 1 23 472 494 472 494 0.97
9 84 0.074 14 8.0 3.4 2 23 501 522 500 522 0.97
10 84 0.0066 1.2 11.3 0.1 1 23 528 550 528 550 0.93
11 84 0.00016 0.029 16.4 0.8 1 23 556 578 556 578 0.98
12 84 0.0013 0.24 13.5 0.4 1 23 627 650 627 650 0.97
13 84 0.00014 0.025 16.6 4.7 1 23 656 678 656 678 0.97
14 84 0.11 20 7.5 4.5 1 23 684 706 684 707 0.96
15 84 0.0005 0.091 14.8 1.1 1 23 712 734 712 734 0.97
16 84 0.59 1.1e+02 5.2 0.6 1 23 741 763 741 763 0.97
17 84 0.00034 0.063 15.3 0.5 1 23 769 791 769 791 0.98
18 84 9.4e-05 0.017 17.1 0.2 1 23 797 819 797 819 0.98
19 84 0.081 15 7.9 0.6 1 23 825 847 825 847 0.98
20 84 6.2 1.1e+03 1.9 9.5 1 21 853 873 853 874 0.93
21 84 0.11 21 7.4 3.7 1 23 965 987 965 987 0.96
22 84 0.0024 0.45 12.7 0.5 1 23 996 1018 996 1018 0.98
23 84 2.1 3.9e+02 3.4 0.0 1 17 1024 1040 1024 1040 0.92
24 84 0.001 0.19 13.9 1.0 1 23 1119 1141 1119 1141 0.98
25 84 0.16 30 6.9 0.0 1 17 1147 1163 1147 1163 0.91
26 84 0.0004 0.073 15.1 1.3 1 23 1214 1236 1214 1236 0.98
27 84 0.81 1.5e+02 4.7 4.1 1 23 1242 1264 1242 1264 0.98
28 84 3.1e-05 0.0057 18.6 1.2 1 23 1270 1292 1270 1292 0.97
29 84 0.00076 0.14 14.2 1.3 1 23 1298 1320 1298 1320 0.99
30 84 4e-05 0.0073 18.3 0.2 1 23 1326 1348 1326 1348 0.98
31 84 3 5.5e+02 2.9 0.6 3 17 1485 1499 1484 1505 0.90
32 84 5.3e-06 0.00097 21.0 3.2 1 23 1511 1533 1511 1533 0.99
33 84 2.4 4.3e+02 3.3 0.3 1 16 1826 1841 1826 1843 0.80
34 84 0.028 5.2 9.3 0.5 1 23 1901 1924 1901 1924 0.95
35 84 2.8 5.2e+02 3.0 6.7 1 23 1930 1952 1930 1952 0.96
36 84 0.00038 0.07 15.2 0.0 1 23 2298 2320 2298 2320 0.98
37 84 0.002 0.36 12.9 0.5 1 23 2387 2410 2387 2410 0.98
38 84 0.0042 0.77 11.9 0.6 1 23 2490 2513 2490 2513 0.97
39 84 0.05 9.1 8.5 0.1 1 23 2610 2633 2610 2633 0.96
40 84 1.1 2.1e+02 4.2 11.5 1 23 2639 2661 2639 2662 0.96
41 84 0.0017 0.3 13.2 2.0 1 23 2708 2731 2708 2731 0.97
42 84 0.002 0.36 12.9 0.5 1 23 2820 2843 2820 2843 0.98
43 84 0.012 2.1 10.5 0.3 1 23 2880 2902 2880 2902 0.98
44 84 2.8 5.2e+02 3.0 0.2 1 16 2983 2998 2983 3000 0.92
45 84 7e-05 0.013 17.5 5.3 1 23 3035 3057 3035 3057 0.95
46 84 0.066 12 8.2 2.7 1 23 3063 3085 3063 3085 0.98
47 84 0.0039 0.71 12.0 0.4 1 23 3091 3113 3091 3113 0.99
48 84 0.012 2.3 10.4 0.1 1 21 3119 3139 3119 3140 0.92
49 84 0.0008 0.15 14.2 0.4 1 23 3423 3445 3423 3445 0.98
50 84 0.065 12 8.2 0.3 1 23 3608 3630 3608 3630 0.97
51 84 0.55 1e+02 5.3 0.6 1 23 3924 3946 3924 3946 0.94
52 84 0.00044 0.081 15.0 2.0 1 23 3954 3976 3954 3976 0.99
53 84 0.00031 0.057 15.5 0.4 1 23 3982 4004 3982 4004 0.98
54 84 4.3 7.9e+02 2.4 2.8 1 23 4053 4076 4053 4076 0.90
55 84 0.021 3.8 9.7 3.3 1 23 4082 4104 4082 4104 0.97
56 84 0.14 25 7.1 2.8 1 23 4148 4170 4148 4171 0.94
57 84 0.41 75 5.7 5.7 1 23 4177 4200 4177 4200 0.96
58 84 0.0038 0.7 12.0 0.6 1 23 4206 4228 4206 4229 0.95
59 84 2.9e-06 0.00052 21.9 2.7 1 23 4234 4256 4234 4256 0.96
60 84 0.025 4.5 9.5 1.2 1 23 4262 4284 4262 4284 0.98
61 84 0.001 0.19 13.8 3.4 1 23 4290 4312 4290 4312 0.98
62 84 0.015 2.7 10.2 0.1 1 23 4447 4470 4447 4470 0.95
63 84 0.0015 0.27 13.3 0.7 1 23 4528 4551 4528 4551 0.97
64 84 0.021 3.8 9.7 0.2 2 23 4616 4638 4616 4638 0.97
65 84 0.00016 0.03 16.3 0.5 5 23 4992 5010 4991 5010 0.96
66 84 0.00069 0.13 14.4 1.1 1 20 5016 5035 5016 5037 0.94
67 84 7.6 1.4e+03 1.7 0.2 13 23 5132 5142 5127 5142 0.89
68 84 0.00016 0.03 16.3 0.5 5 23 5150 5168 5149 5168 0.96
69 84 0.00069 0.13 14.4 1.1 1 20 5174 5193 5174 5195 0.94
70 84 3.8e-05 0.0069 18.4 0.7 1 23 5376 5398 5376 5398 0.98
71 84 4.8 8.8e+02 2.3 2.0 1 13 5404 5416 5404 5417 0.91
72 84 0.52 96 5.3 1.4 1 23 5475 5498 5475 5498 0.95
73 84 0.00053 0.097 14.7 0.2 1 23 5733 5756 5733 5756 0.96
74 84 6.9 1.3e+03 1.8 2.9 1 23 5797 5820 5797 5820 0.95
75 84 0.0012 0.21 13.7 0.6 1 23 5825 5847 5825 5847 0.95
76 84 0.57 1e+02 5.2 4.7 1 23 5853 5875 5853 5875 0.98
77 84 0.00036 0.065 15.3 0.2 1 23 5884 5906 5884 5906 0.97
78 84 0.00028 0.051 15.6 0.5 1 23 5912 5934 5912 5934 0.98
79 84 0.019 3.5 9.8 0.3 5 23 5944 5962 5943 5962 0.95
80 84 0.0065 1.2 11.3 2.8 1 23 6487 6509 6487 6509 0.97
81 84 3e-05 0.0055 18.7 1.0 2 23 6539 6560 6538 6560 0.96
82 84 1.1 2e+02 4.3 4.2 1 23 6567 6589 6567 6589 0.98
83 84 0.0026 0.47 12.6 0.3 1 23 6595 6617 6595 6617 0.97
84 84 1.1e-05 0.0021 20.0 0.4 1 23 6623 6645 6623 6645 0.98

Sequence Information

Coding Sequence
ATGCCATCATGCATAAAAAATGAGGATATAGAAGAGGGTACCTATTTCGTAACATCGTTAGATATTGAACAAGCAATGTCGAAACGATACAAGAATGGGTTAACGTCACTATGTGTCAAGAAAGAAGAGGGAAGCCACATAGAGAGTCTCCGAAGTACCGAGCAACATGATTGCGATATATGCGAGAAATCGTTTTGTTCAGCTTTTGATTTACATATGCACAAACGCATACATACTGGAGAAAAGTCTTTCGTTTGTGCTATTTGCGAAGAAGTCTTTGGAACCAAACAATTGCTTCGGTCACACATTTTAAGTCACACGAACGAGAAGCCTTACTTATGTAAATATTGCGACAAAGGTTTTGCACGTAAATCGTACCTAACgccacatttgcgtacgcatactggAGAAAAGCCATACATATGTGGATTTTGCAGCAAGGGTTTTGCGCAACCTACACACTTAACGAGGCacttgcgtacgcataccggtgaaaagccttatatatgtaaattttgcgacaagggttttgcgcacACTACAAACCTAGCagtacatttgcgtacgcataccggtgaaaagccttacgtatgtaaattttgcgataaggATTTCGCGCATCAGTTTACACCATCATGCTTCAAAACTGAGGACATGGAAGAGGGTCCTTATATCATAACATCATTAGATGTTAAACAAGAGAAATCGCAACGACAGAACAATAAGATAACGCAACTCTGGGTTAAAACAGAGTTACCTGATCGCAATGAGGAGAAAATGAAACCTTTTAGTGTAGTGTACACTTGTGACGAACACATAAACGCCAGTCACGAAAAGAGTAATATGTACAAATGCAAATGTTGCTGCAAACACTTCTCCAGTAAGCAGGACTTGTCCGTATATGGAACAGACCAAATGAAGGAGTTCAGATATAAGTGCAATGATTGCGAACCTCACCATGGTACAGGACAGCACGAGTGCGAATGCGGGAAATCTTTTGCCTCGCTCCAGTGTTTGAATGTGCATAAACGCATACATACCGGAGAGAAGCCGTTTAAACCATCATGCTTCAAAAGCGAGGATATAGAAGAGGATTCTTGTATCATAACACCGTTAGATGTCAAACAAGCAATTTCGAAACGATACAAAAATGAGGCAACGCCACTATGCGTAAAGAAGCAGGTATTAGATCATTATACGGAGGAAACGAAACACAATAGTACATCATATGCTTGTGAAGAGTGCGGGAACAATTACAGTATCAAAGGAGGGTGCGGGAAACACTTAAAGACTCTTAACGAAGAGGGTGAAATCTATAAATGTGAATTTTGCTGCGATAAATGCGTGAAAGATTTCAAAGCCAAACTATATTTAGACCGACAAATGGCGACACACGGTTCCAGCAAGGAGGAGTATGTGTGCAAGGTGTGTTCGACCATACTCCATGGAAAGGGAAGCTTTCATGTCCACATGGAGCTACACCAAAGTGCAGCGCAACTTAAGTGTGATATATGCGAGAAGTGGTTTTGTTCAGCCTCTGATATGCATGTGCACAAGCGTACACATACCGAAGAGATGCCCTTCCTTTGTGTTATTTGCGAGGAAGCTTTTAGAACTAAAGAATTGCTTCGATCGCATGTTTTGGGACACACAAAcgagaagccttacatatgcaaattttgcggGAAGGTTTTCTGGCGAACTACAGACCTGGCGAGACATTTACGTACGCACAGCGGggaaaagccttacatGTTCACACCAGCAGGCTTCAAGAGCGAGGATATAGAAGTGAGTATTCCTGAAGTACCCGTATTATGGATTAAGGATGAGATTTCGGATAGTCATGAGAAGCAAAAGCAGCAGAATAGTGCAGCTTTTGCTTGTTACGACTGCGGAAACGGTTACAAGAGCAAAGGAAGCTACGAGAAGCACATAAAAACCGTCCACGAAAatgataaattttacaaatgtgAATTTTGTGGCAAGCACTTTTCAAGTAAATACAACTTGTCCGTTCATAGAACACAGCACACGAAAGAGTTTAGATATAGGTGTGATAAATGTGAGCGCGGTTACCTGAGGTTGTACGACCTAAAACACCATCAGAACGTTCACCATAATACGCCCAATTTCTTTTGCGGTCAATGCGGAAAAGCTTTCAAGATCAAACGTTATTTAAAGGAACACATGACGATTCACGATTCCAGCAAGGAGAAGTACGCGTGCGGTGTGTGCTCGGCCGTACTCCATCAGGAGAAATCTTATAATCGCCACATGGACCGCCACCAGGGTAAAGGGCAACATAAATGCGATATATGCGAGAAATCGTTATCCTCGGCATCTGGCTTGCTTGAGCATAGACGCATACATACCTGCGAGAAACCCTTCATTTGCGCTATTTGCGAGAGAACTTTTGCAGCCAAAGGATCGCTTCGGTTACATATGCGGACGCACACGAAGgaaaagccttacgtatgtaaattttgcgacaagtaTTTTGCATGGACTACCAGCCTAGCGATCCATTTGCGTCAGCATTCCGGCGAAAAGCCTCATAAATGTTTACATTGCGGAAGGAGATTTACATGCACTTCAAATTTGAACCAACATAAATGCAAGAGGGTGCAGTTTACACCAGCATGCCTCAAAATCGAGGATGTAGAAGAGGATCCCTATATCATAACACCATTTGATGTTACACAAGAAAGTTCACAACGATACAATGATAAGGTAATACCACTATGTGTTAAGAAGGAGCAATGGGATCATCAGGACGAAGAAGCGAAACTCTTCAGTGTGGCGTACACCCATGAAGTTCGCGGAAACGATTACAAGATCAAGGAAAGCTACGACAAAGACTTAAAGCCCATTCACGAAAAGGGTAAAGTttacaaatgcaaattttgcggCATGCAGTTCTTCAGTGAGCACTACTTATCCGTACATAGAACAGGGCACTTGAAGGATGGTGGTACAGTGCAGCATAAGTGCGATGTATGCGGGAAACCTTTGGCTTCACTCCAGAGCTTGAATATGCATAAACGCATACACACCGGAGAGAAGCCATTCGTTTGTGCCATTTGCGAGAAAGGTTTCGCAGTCAAAGGATCGCTTCGGTTTACACCATCACGCCTCAAAAACGAGGATGTAGAAGAAGGTCCTTATATCATAACACCATTAGATATTAAACAAGAAAGTTCACAACCATACAATAATAAGGTAACACCACTATATGTTAAGAAGGAGCAATGGGATCGTCAGGAGGAGGAATTTTGCAGCATGCAGTTCTTCAGTAAGCACTACTTGTCCATACATAGAACAAAGCACACGGAAGGTGGTGGTATAGTGCAGCACAAGTGCGATGTATGCGGGAAGTCTTTGGCTTCACTCCAGAGCTTGAATATGCATAAACGCATACACACCGGAGAGAAGCCATTCATTTGCCCCATTTGCAAGAAAGCTTTCGCAGTCAAAGGATCGCTTCGGTTCATACCGACGTGCGTCAAAAGCGAGGATATAGAAGCGGATCCATGCGGTATAACATCAATATATATCAAGCAAGAGAATTTTGGGAGTCACAATATCGAAATGAATGATTCAGCGTACCCTTGTGAACAGTGTGGAAAAAACCTAGACTACAAATGCGAATTCTGTGGCAAATGTTTGGCCAGTAAGAGCGGCCTGGCCATTCACAGATTGCGACACAcgaaagaatttaaatatagGTGCGATAAATGCGAACGCGATTTCCTTCGATTGTGCGACCTAAAAAGCCACCAGTATTCTCACCAGAGTACGCTTAATTTCTTTTGCGATAAGTGCGGGAAAGGGTTCAAGGCCAAGAGTTATTTAAAGGCGCATATGGACCGTCATCTGAGTATGGAGCGGTACAAGTGCGATATATGCGGGAAATGTATGTCCTCACCCCAGTATTTGCTTATCCATAAGCGCATACACACCGGCGAGAAGCCTTTCGTATGTGGTatttgcgagaaagcttttgcaCTCAAACAAATGCTCAAGATACACATGCGAACCCACACGAAAGAGaagccttacGGTATAGAAGAGGGCCCCTACATCATAACACCATTAGATATTAAACAAGAAGCAGCATACACTTGCGAAGTATCGAGCAAAGAGAACTACGACAAATACACAAAGGTCAGTCACGAAAATGGTATAGACTGCAAACGCGAATGTTGTGGCAAGCAGTCTTCCAACAAGCAAAACTTTTCGATACACAGAACAAAGCACACTAAAAGAATTCAATACAAGTTCGATAAATACGAACGTCGCCATGGCATAGGACAGAATAAATATGATGTATGCGGGAAATCTTTGGCTTCCCTCCAGAAGCCCTTCATTTGTGCCGTCGCCAAGAAGGCTTTTGCAGTCAAAGGATCGTTTTGGTTACATACACGTACTCACGCGAAAGAAAAGCCTAACTGTTTTTTTTGCGACAAGAATTTTACGTATGCTAGGAGTTTAACGACACCTTTGAATACGCATGCCCGTGAAAAgctttacacatgtaaattCTGCGAAAAGGGCTTTACGCGTCCTGCACACCTGACGACACATTTGCGAACGCATAGgtttatacCAGCATGCGTCAAGAACGAGGGTACAGAAGAAGGTCCCTATATCATAACACCATTACGTATTAAACAAGAAAGTTCGGAACATCACAGTAATAAAGTAACATCACTATGTGTTAGGAAGGAGCTATTGCATCGCAGTCAGGAAACGATTGAACCTTTTAGTGCAGTGTATACTAGCTACGACAAACACATAAAGGCTAGTCACGAAAGGGGTGGCTGTGAGCAATTCCCCAGTGAGCAGTACTTGTCCATACATAGAATAGAGCACACGGAGAGGTTTATACCATCATCCGTCAAAAACGAGGATACAGAAGAGGGTCCCTATATCATAACAccattacatattaaacaagaaAGTTGGGAACAATACAGTAACAGCGTAACACCACTACGTGTGAGGAAGGAGCTATTGGATCGCAGTGAGGAAACGATGGAACCTTTCACTGCAGTGTACACTAGCTACGACAAACACATAAAGCCCAGTCACGAAAAGGGTGAATTCTCCAGTGAGGAgTCTTCAACATCACGCTGCAAAAACGAGGATGTAGAAGCGGATCCCTATATCATAACACCATTTGATGTTACACAAAAAATTTCACAACGATACAATAGTAAGGTAACCTCACTATGTGTTAAGAAGGTGCAATGGGATTGTCATGAGCAGAGAATTAATATTCGCGGTGCAGGGCACAGTTGTGAAGAGGACCGAAACGGTTGCAAGACTAAACAAACCTACGACAAACACATAAAGACTAGTCACGAAAGTGATAAAGTCTgcaaatgcaaattttgcagCAAGCAGTTATCCAGTAAGCAGTACTTGTCCATACTTAGAACAAAGCATACGAAAGCTGGTGGTATAGAAGGGCATAAGTGCGACGTGTGCGGGAAATCTTTGTATTCACTCCGGTTTTTAACATCATGCTTCAGAAAGGAGGATGTCGAAGTGGGACCTTATATCATAACACCATTAGATGTTAGACAAGATAATTCGCAATCacacaataataagaataataaggTAGCACCACTATGTATTAGTAAGGAACAATGGGATCGTCATGAGGACAAAGTGAAACTTGTCAGTGCAGCGTACACTTGTGAAGTGTGCGGAGACAGTTACAAGACCCAAGAAAGCTACGACAAACACAGAATGGCCAGTCACGAAAGGGGTAAAGGctacaaatgcaaattttgctgCAAGCAATTCTCCAGTAAGCAGTACTTGTCCATACATAGAACAGAGCACACGAAAGCTAGTGGTATAGGACAGCATAAGTTTTCAACATCATGCGTCAAAAACGAGGATGTAGAAGAGGGTCCCTATATCATAACACCATTAGATATTAGGCAAGAAAATTCACAACGATACAATGATAAGGTAGTGCCACCATGTGTCAAGGAGCAAAGGAATCGTCAGGAGGAGAAAATGAAAAGTCTCAGTGCAGCGTACACTTGTGACGAGTACGGTAACGGTTACAAGATCAAACAAAGCTACGACAAACACATAAAGGCCAGTCACGAAAAGGGCGAAGTCCAAATGCAAAAGTGCGATTCGGCGGGTGCACAACATCCTGGAGAAGGAAACCACTCTGAAGGTCAATGCGGCCCTCTGTGGGGAGTTTGTTGTAATCAATGGGATGAGGAGATTTTCGAATTCAAGTACTTGAAGCCACAGAATTCACCCATCTATCGCGACGCCGGCCTGGAGGATTGGTTCGAGGGAGATGTTTGCATTCCCATCTTCACACAGAGAATTTCAACCCCAAATTCCGGCGGTATTTACGCCTCCATAATAGCTACGACTTTTTTGACACAAAAAAAAGAGGAGAGTCCCGGCGTGGAGCACTTGCTATTCAAATTTCGTGCTTCCAAGCCGATATACTCATCGGCCTATCAGAGAAGTCCAGGTGGGCGTGACCTAGGTGAAAGGCCTTCAACATCACGCTTCAAAAACGAGGATGTAGAAGAGGGTCCTTATATTATAACACCATTAGATATTAGGCAAGAAAAATCACAACGATACAATGATAAGGTAGTGCCGCTACGTATCAGGAAGCAGCAATGGAATCGTCAGGAGGAGAAAATGAAACGTCTCAGTACAGCACACGCTCAGGAAGTGTGCAAAAATGGTTACAAGATTAAAGAAAGCTACGACAAACACATAAAGGCCAGTCACGAAAACGGCAAAGTCCAAATGCGAAAGTGCGATGTAAGCGGGAAACCTTTAGCTTTACCACAGCGCTTGAATTTGCACAAACGCATACACACCCGAGAGGAGCCTTTCATTTGTGCCATTTGCGAGAAAGCCTTCGCGGTCGAAGAATCGCTTCAGTTACATATGCAGACTCAcgcgaaagaaaaaaaacggaaaCTTAGCCAGAAGGGCTCTGCACGTAAGTCGCGCTTAACAAGACATTTGCCTAAGTTCATACCATCAGGGTTCAGAAAGGAAGATACAGAAGTGGGTATTTATGAGGTACCGGTATTATGGATTAAAGATGAAGTTTCGGATGGTTATGAGGAGGAAAAGAAACACACTGGTGCAACTTATACTTGTGACGAGTGCGGAAGCAGTTACGACACCAAAGAGAGCTACGACAAACACATAAAGGTCAGTCACGAAAAGGCTAAAGTctacaaatgcaaattttgctgCAAGAAATTCTCCAGTGAGCACTGCTTGTCCATACATAGATCAGAGCACACGAAAGCTAGTGGTATACTACCGCATAAGTTCATACCATCAGGGTTCAGAAAGGAAGATACAGAAGTGGGTATTTATGAGGTACCGGTATTATGGATTAAAGATGAAGTTTCGGATGGTTATGAGGAGGAAAAGAAACACACTGGTGCAACTTATACTTGTGACGAGTGCGGAAGCAGTTACGACACCAAAGAGAGCTACGACAAACACATAAAGCTCAGTCACGAAAAGGCTAAAGTctacaaatgcaaattttgctgCAAGAAATTCTCCAGTGAGCGCTGCTTGTCCATACATAGATCAGAGCACACGAAAGCTAGTGGTATACTACCGCATAAgtTTTTAACATCATACGTCAAATACGAGGATATAGAAGAGGGTCCCCATATCATAACACCATTAGACCTTAAACAAGAAGATCCGCAACAACACAATAAGAAGGTGACGCCATTGTATGTTAAGAAGAAACAATGGGAAGCTCAGGAGGAGAACATGGAATCCTTTAGTGCAGCCTACACATGCGAAGTGTGCGGAAGCGGTTACAAGACCGAAGAGAGCTATGGCAAACACATAAAGGCCGCTCACGAGATGGGTGAAATATACAAATGCGAATTTTGCAGCAAGCAGTACTCCAGTGAGCACCATTTATGCATACACAGAACAAAGCACCATGGTATAGAGCAGTTCATACCACCAGGGTTCAAAAACGAAGATATAGAAGCCGGCATCTATGAGGTACCTGTATCGTGGATAAAAAATGACGTTTTGGATGGTTATGAGCATGAAAAGAAATACATTAGCGCAGCTTACACTTGCTACGAGTGTGAAAAGGGTTACAACACCAAAAAGAGCTACAACAAACACTTAAACGTCAGCCACGAAAAGGCTAAAGTCTACAGATGCAAATTTTGCTGCAAGCAGTCCATGATTCACAAAGGATCGCATCGCTTACATATGCGGACACACGCGAAAGAAAAGCCTTGCATATGTCAATTTTGCGACAAGCGTTTTGGGCATAAGTTCACACAATCAGGCTTCAGAAACGAAGATATAAAAGTGGACTCTTATGAGGTTCCGGTATTATGGATTAAAGATGAAGTTTCGGCTGGTTATGAGGAGGAAAAGAAACACATTGGTGTAATTTATACTTGTGACGAGTGCGGAAGCAGTTACGACACCAAAGAGAGCTACGATAAACACATAAAAGTCAGTCACGAAAAGGCTAAAGTCTACAAATGGAAATTGTGCTGCAAGCAGTCTTCTAGTAAGCATAACTTGTCCATACATAGAACAGAGGACACAAAAGCTGGTGGTATAGGACAGCACAAATGCGATGGATGTGGGAGACCTTTGGCTTCGCCCCAGCGCTTGAATCTACACAAACGCATACACACTGGAGAGAAGCCTTTCATTCACAAAGGATCGCATCGGTTTATACTATCTCAGACCAAGAGCGAGGACATAGAAGACGATCCCTATACCGTAACGCCATTAGAGATTAAAGAGGAAGAATTGGAGCCACACAGTAGAGGAATTCACCACAATATCGGAGAGTACAGTTGTGAACAGTGCGGAAACAGTTGGAATGATTACGTTGGTCACAGACAGGCGCATCAACAAAAGAAAGCGGGCCACAAATGCGAATTTTGTGACATGTATTTTGCTAGTAGGCATACTTTGGTTGTATCACAAAAGCATGCGAAAGAGTTTAAACACAACTGTAATAAATGCGAGGGCGGTCACTATAAATTGTGCGACCAGCAGGTTCACCAGGGTGGTACATCCAATTTCTGTTGCGATAAATGCGGGAAGAGTTTCAAGACCAGGTATTATTTAAAACGACATACGGTCGTGCACGATGATAACAAGAAGTACGTGTGTGATGTGTGTTCCGCCGTACTTCACCACAAGGATAGTTATCGTCGCCATATGGACCGTCACCAAAATACAGACCAGTATAAGTGCGATGTATGTCGGAAATCTTTGGCTTCACTCCAGGGCTTGCGTGTGCATAAACGCATACATACCGGAGAGAAGCCTTTCCTTTGTGCTATTTGTGAGAAGGACTTTAGAAGCAAAAGATTGCTTGTGGTGCACATGATCAAGAGCGAGGATATCGAAGAGGATCTCTATACCGTAACGCCATTATACATTAAACAAGAAGATGTGGAACAACTCAGTGGAGTTAACGATACCGGTGAAGTATATAAATCTGTGGAGTGCAAAAACGGTTACAAAAGTTGGGAGGATAACGTCGGACAGGAATACGCACACGATGACAACAAAAAGTTTATATCATCACAGTTCAAGAGCGAGGATTTCGAAGAGGATCGCTATACCGTAGCGCCATTATACATTAAACAAGAAGATTTGGAACAACTCACTGGAGTTAACGATACCGGTGAAGTGTATAAATCTGTGCAGTGCAAAAATAGTTACAAAGGTTGGGAGGATAACGTCGGACAGGAATACACACACGATGACAACAAAGAGTACATGTGTGATACCGCAGAGAAGCTCTTCACTTATGGTACTTGTGAGAAGATCAAGAGCGATCATATGGAAGAGGATCCCTATACTGTAATGCcattagatattaaaaaagaagatttgGGACGATCCAGTAGAATTAACGATACTAGTGGAGAGTATAATTCTGTACAGTGCAAAACCAGTTACAGCGGTTGGGAAGCCAACGTCAAGCAGAACTACGACATAGGCGATGACAACAAGAAGTTTATACCGTCACAGATCAAGATCGAGGATATCGAAGAGGATCCCTATACCGTAACGCCATTATACATTAAACAAGAAGATTTGGGACGACCCAGTGCAGTTAACGATACCGGTGAAGTGTATAATTCTGTACAGTGCAAAAGCAGTTACAAGAGTTGGGAGGAACACGTACACGATGACAACAAAACGTACATGTTTGCTGCCGCAGAGAAGCCTTTTACTTGTGGTACTTGTGAGAAGGTTTTTGGAAGTAAACGAATGCTTACGGTACATCTGGTGACGCACGCGAACgaaaagccttacATCAAGATCGAGGATATCGAAGAGGATCTCTATACCGTAACGCCTTTATACATTAAACAAGAAGATTTGGGACGACCCAGTGCAGTTAACGATACCGGAGAAGTGTATAATTCTGTGCAGTACAAAAACAGTTACAAGAGTTGCGAGGAACACCCACACGATGACAACAAAAAGGACCTGCTTGATACCGCAGAGAAGTCTTTTGCCTGTGGTACTTGTGAGGAGGTTTTTGGAAGTAAACGAATGCTTACGATCAAGAGCGAGGATATCGAAGAGGATCTCTATACCGTAACGCCATTATACATTAAACAAGAAGATTTGGAACAACTCAGTGGAGTTAACGATACCGGTGAAGTGTATAAATCTGTGGAGTGCAAAAACAGTTACAAAAGTTGGGAGGATAACGTCGGACAGGAATACGCACACGATGACAACAAAAAGTACATGTTTGATACCGCGGAGAAGCTCTTCACTTGTGGTACTTGTGAGAAGGTTTTTGGAAGTAAACGAATTCTTACGGAACACCTGGTGACACAtgcgaaagaaaagccttacATCAAGAGCGAGGATATCGAAGAGGATCTCTATACCGTAACGCCATTATACATTAAACAAGAAGATGTGGAACAACTCAGTGGAGTTAACGATACCGGTGAAGTGTATAAATCTGTGGAGTGCAAAAACAGTTACAAAAGTTGGGAGGATAACGTCGGACAGGAATACTCACACGATGACAACAAGAAgTTTATACCTTCCCAGATCAAAAGTGAGGATCTCGAAGAGGGTTCCTATACCGTAACGTCATCAGATATTAAAGAAGAAGATTTGGAACGGGGCGGGTTTATACCATCTCAAATCAAAAGCGAGGATCTCGACGAGGATTCCTATACCGTAACGTCATTAGATATTAAAGAAGATTTGGAACGTGGAGGTTTTGCGCGAACTGAATTTGTAGCGGTTAAGGTACATTTGTTTTTTCTCTTACTTAAAAGCGAGGACCTGGAAGAGGGTCCCTATACCGTAACGTCATTAGATATTAAAGAAGAAGATTTGGAATTGGGAGGGTTTATACCATCTCAAATCAAAAGCGAGGATCTCGACGAGGATTCCTATACCGTAACGTCATTAGATATTAAAGAAGAAGATTTGGAACGTGGAGGTTTTGCGCGAGCTGAATTTGTAGCGGATCATTTTTCTAGGTTTATACCATATCAGACGAAGATCGAGGACATAGAAGCGGATTCCTATACCGTAGCATCATTCgatataaaagaagaaaatttggaGCGAAACAGTGGAGAATTTAAATACACTAGCGCAGAGTACAGTTCTGAACAATGCGAAAATAGTTACGAGAGTTGCAATGATTACGTCGGTCATACAGAGGTGCATCTCCAAAAGGATTCAGGCCACCAATACCTCTGCGATAAATGCAACTGCGGTTATGGTACCTTATGGGACCTACAAAACCATGAGAACGCTCATTCTACTCAAGATACTGAGGAGTATAAATGCGATGTATGCGAGAAATCTTTGCGTTCAGCCCATTATTTACGTGTGCATAAACGCATACATTCCGGAGAGAAGCATTTCACTTGTGCTAGTTGTGAGAAGGTTTTTAGAAGCAAATCATTGCTTTTGGTACACCTTGTGACACACGGCAAAAGCGAGGGTATCGAGGAGGATTCCTATTCCGTAACGTCATTAGATATTAAAGAAGAAGATTTAGAACGATACGGTagagaaattatgaaaaatgaTTACGTAGGTCACAGACAGGTGCGTCAACAAAAGAATCCAGGCCACAAATGCGAATTTTGCGACATACACTTCGCCAGTAAGCGTATCTTGGCGGTTCATACATCACAAAAGCATGCGACAAAGTTTAAATACGTCTGCGATAAATGCAACTGCTGTTATGGTACCTTGTGGGATCTCAAAAATCATCAGAACGCTcattataaGTTTATGACGTCTCCGGCTAAAAGCGAGGGGATCGAGGTGCATCCCTACACCATAACCTCATTAGATATTAAAGAAGAAGATTTGAAAGGACGCAGTGGAGAAATGAACCAGAGTAGCGAAGAGCACAGTTGTAAACAGTGCGAAAAAGGTTACAAGAGTTGGAATGATTACGTTGGTCACAGACTGGTGCATCACCAGAAGAATCCAGGCCACAAATGCGATTTTTGTAATATGCGCTATGCCACTAAGCACACCTTGGCTCTTCATAGATCACGAAAGCACACGAAGGAGTTTAAATATATCTGCGTTAAATGCGACCGCGGTTATGGTACGCTATGGGACCTCAAAAACCATCAGAACGTTCACCATAGTACTACTAATTTCTTTTGCGATAAATGCGGGAAGAGTTTCAAGACCAAACGTTATTTACAGCGACATATGGTGATACATGATGACGACAAAAAGTACGTGTGTGATGTGTGTTCAGCTGTACTTCATCGCAAGGAAAGTTATCGTCGTCACATGGATCGTCATCAAAATACAGAGCAGCATACTTGCAGTGTATGCGAAAAATCTTTATCTTCAGCCTATCACTTGCGTGTTCATAAACGCATACATACCGGAGAGAAGCCTTTCGCTTGTGCTATTTGCGAGAAGTTAATACAATCATCTGTCAAATACGAGGATATAGAAGAGGGTCCCTACGTTGTATCACCATTAGACATTGAACGAGAAATTCCGAAACGCGGCCGTTATGAAATAACGCCATTATGTCTTCAGAAAGACATTTGTGGTTCATAtaagaagaaaatgaaacacaATGGTGCACCACATACTTGCGAAGAGTTTATACAATCATTCGTCAAATACGAGGATATCGAAGAGGGTTCATACGTGGCAACACCATTACACATTAAACTAGAAACTACGAAACCATACAATAATGAGGTAACGTCAGTATATATTCAAGAAGACGTTTGTGGTCCTCATGGAAAGAAAGTAAAACACAATAGTTCAACATATACTTGTGGGGAGTGTGGAAACGGTTACAGGACCAAAGAAAGCTACGGCAAACATATACAAGCGGCGCACGAAAACTTAACGCAACACTTGCATACGCATACCGATGAAAGAccttacacatCACTATTGGACATTAAACAAGAAAGTTcgaaacaatacaataatgagGTAACGCCATTTTGTATTGAGGAAGACGTTTGTGATCCTGATGGGAAGGAAATGAAACACAGCAGCGGAGCATATACTTGTGAAGGGTGTGGAAACAGCTACCAGACCAAACAAAGTTACACCAAACACTTGCGAGCAGCTCACGAAAACTTAAAGGTGCACATTCCGACACATGCGGAATCACCTGTCAAACACGAGGATGCAGACGAGAGTCCCTACGTTGTAACACCATTAGACATTAAACAAGAAAGTTCGGAACGCGACAATAATGAACTAACGCCATTATGTATTCCGGAAGACGTCTGTGGTCCTTGTGGAAAAAAAGTGAACGACAATACTTGTGAAGATTGCGGATACGACTACGGGACTGAAAAGAGCTACGGCAAACACATGCAAACTGTTCACGAAAGCCTAACAAGACGTTTGCGTACGCATATCGTCTTGGCATTTTTGTATTCATGGGATCATATTTCCAGGTTTATACAATCACCTGTTAAATACGAGGATATAGAAGAGAATCCTTACCTCATAACACCATTAGACATTAAACAAGAAAGTTCGAAACGATACAACAATGACGCAACGCCATTATATATTCAGAAAGGCGTTTGTGGTTCTCATGGGAAGAAAATAAGATGTAATAGCACAGCATATATTTATGAACAGTTTATACAATTACTCGTCAAACACGAGGATATAGAAGAGAGTCCCTACGTTATAACACCATTGGACGTTAAACAAGAGTGTTCGAAACTATGCAATAATGAGGGAATGccattttgtattcagaatgatGTCTGTGATCCTGATGGAAAGAAAATGAATGAAGAGTGCGGACAAGATTACGAAGGGAGCTACAGCAAACACACACAAGCGGTTCACGAAAACTTAACGGATCATTTgtttatacAATTACTCATCAAACACGAGGATATAGAAGAGGGTCTCCACGTTGTAACACCATTAGACATTAAACAAGAAAGTTCGAAACTATGCAATAATGAGGTAACGCCATTTTGTATTCACGAAGAGGTTTGTGATCCTGATGAGAAGACAATGAAACATAATAGCGACACATATACTGGTGAAGAGTGCGGAAAAGATGGGAAAGAGAGTTACAGCAAACACATACAAGCGGTTCACGATAACTTAACGGATCATTTGTTAATACAATATTTCGTCAAACACGAGGACATAGAAGAGAGTCCTTATACTGTAACACCGTTAGACATTAAACAAGAAAGTTCGAAGCTATACAGTGATGAGGTAACGTCAGTTTGTATTCTGAAACAAGTTTGTTCTCCTCAtgggaagaaaataaaatacaatagcAGATCAGCATATACTTATGAAGAGAGCGGAAATGGTTACTGGAACAATGAAAGCTACGGCAAACACATAAAAGCTATTCACGAAAACTTAACGAAACGTTTACGTACGCATACCGCTGAGAAGGCTTatacatTTTGCGGTAAGGGCTTCGCGCATAGTACGAACCTAGCGGCGCATATGCGTACGCATACAGGTGAGAAAGCTTTTCGGTGCTTACATTGTGATAGGAGATTTGGTACCAATGCACATTTAATTCGCCACAGAGAAGTATGCAAAGGGATAGCCTCAAAAGAGGATTATGGAATTGGATCACAGTCTATACAATTACTCATCAAATACGAGGACATAGAAGAGAGTTCCTACCATGTAACACCATTAGACATTAAACAGGAATGTTCGAAGCGATACAATAATGAGATAACGCCGGTATGTGTTCAGAAGGACGTTTGTGGTCCTCATGGGAAGAAAGTGAAACACAATAGCGCAACATATGCTTCTGAAGAGGGCGACGATGGTTACAGGAACAAAAAAAGCAAACGCATAAGAACGGTTCCCAAAAACCTAGAgcaacatttgcgtacacataccgctGAGAAGCCTTatacatTTTGCGGTAAGGGCTTCGCGCATAGTACGAACCTAGCGGCGCATATGCGTACGCATACAGGTGAGAAAGCTTTTCGGTGCTTACATTGTGATAGGAGATTTGGTACCAATGCACATTTAATTCGCCACAGAGAAGTATGCAAAGGGATAGCCTCAAAAGAGGATTATGGAATTGGATCACAGTCTATACAATTACTCATCAAATACGAGGACATAGAAGAGAGTTCCTACCATGTAACACCATTAGACATTAAACAGGAATGTTCGAAGCGATACAATAATGAGATAACGCCGTTATGTGTTCAGAAGGACGTTTGTGGTCCTCATGGGAAGAAAGTGAAACACAATAGCGCAACATATGCTTCTGAAGAGGGCGACGATGGTTACAGGAACAAAAAAAGCAAACGCATAAGAACGGAGGACATAGAAGAGAGTGTCTACCTTGTAACCCCATTAGACATTAAACAGGAATGTTCGGAACGATACATAAATGAGATACCGCCATTATGTGTTCAGAAAGACGATTGTGGTCCTGCTGAGGAGAAAATGAAACACAACAGTGCAACATATACTTGTGAAGAGGGCGACGATGCTTACAGAATCAAAGAAAGCAAACGCATAAGAACGGTTCACAAAAACCTAGAGCAACATTTGCGTACGGATGCCGCAGAGAAGCTTTAcgcatgtaaattttgcgaaaaggaaTTTGACCGATCAGCAAGCCTGACGATACATTTGCGCACGCATACCGGGGAAAAGCCTCACACATGTCAATTTTGCGGCAAGGGTTTTGCGCACAAgtTTATACAATCATCTGTCAAATACGAAGATATAGACGAGGGTTCCTACATTGTAACACCATTAGGCATTAAACAAGAGAGTTCTCAGCGATACAATAATGAGGTAACGCCATTATGTTTTCGAAAAGAGGTATGGGATCATTGTGAGGAGAACATGATACACAATACTACAGCATATACTTGCCAAGAGTGCGGAAACAGTCACAAGACTAAAGAAAGCTATGACAAACACATACAAGCCGTTCATGAAAATTTAACGAAAcgtttgcgtacgcatacctgTGAAAAGCCATTTATACAATCACCTCTCAAATTCGAGGATATAGAAGAGGGCCCCTACATTGTAACCCTATCAGGCATTAAACAAGAAAGTTTGAATCTATACAATAATGAGATAACGCCATTACGTATTCAGAGGGAGGGTTGTGGTTCTCGTGGGAAAAATATGAAACACAATGGCGCTACACATATTTGTGAAGAGTGTGTAAACGGttacagaagcaaaaaaagGTTTATACAACCACCTGTCAAATTCGAGGATATAGAAGAGAGTCCGTACGTTGTAACACTAACAGACAAACAACAAAGTTCGAAGCTATACAATGATGAAGTAACGCCATTACGTATTCGGAAAGACATTTATGGTTCTTATGGCAAGAAGATGAAACACAATGATGAAGCATATGCTTGTGAAGAGGGTGGAAACGGTTACAGAACCAAAAAAAGCTACGACAAACACAGACAAGTGGTTCACGATAACTTTACGATGCATTTGCGTACGCTTACCGAATCACCTGTCAAACACGAGGATGTAGAAGAGGGTCCCTATTTTATAACACCATTAGACATTATACAAGACATTTCGAAGCGATACAGTAATGAGGTAACGCCATTAAGTATTAAAAAAGGCGTTTATGGTCCTCATGGGAAGAAACTTAATAGTGCAACATATACTTGTGAAGAGTGCGGAACCAGTTATAGGACCAAAGAAAGCTACGGCGAACACATGCAGGCGGTTCATGAAAACTTGCGTACGATTATACAATCACAAGTCAAAAATGAGGACATAGAAGAGTGTGCTTATGTTTTAACATCATTAGGTATTGAACAAGATAATTTGAAACACATTAACGATATTACATACATGTGCGACAAATGCGAATGCGATTACTTCAGGTTGTCGGACCTGAAATTCCATCAGCTCATCCAGCACAGCACGCTTAATTTCTTGTGCGATAAGTGCGGGAGAGGTTTCGagatcaaattttatttaaagcgaCATATTGCAACGCACGATGGCAACAAGGAGTACGTGTGCGAGATTTGTTCGGCCATACTCCATCACAAGGACAGCTACTGTCGCCATATGGAGCGGCACCAAGATCATCAAGCTATAGGGCGGCATGAGTGCGACGTATGCGGAAAATCCTTGTCTTCACCCAATGCTTTGCTCGTACATAAGCGCATACATACCGGAGAGAAGCCTTTCGTCTGTGCTATTTGCGACAAACTTTTTAGAACCAAGCAGATGGTACAGTTACACATACGTACACACACGAAGGAAAAGCTCCACGTATACAAGCTTTGCGAGAAAGATTTTGCACAAACTACACAACTAACGAAGCATTTACGcgtacataccggtgaaaagccagccttacatatgTTTATATATTCACCCGTGAAAGGCGAGGATATAGAAGAGGGTTCCTATATCGTAACACCATTAGAAATTAAACAAGAAAGTTCAAAAGGATATCCTAATGAGGTAACGGCATCATGTATTCTAAAGAAGGTTTGGGATCGCTATGAGGAAATGAACCACATTAGAGCAGCAAACACGCCTGAAGAGAGCGGACACGGTGACAAGACTAAAGAGACGAAGTACCTCTGCAAGTTCTGTTGGGCCGTATTCCATCAGAAGTTTATATATTCACCCGTGAAAGGCGAGGATATAGAAGAGGGTCCCTATATCGTAACACCATTAGATATTAAACAAGAAAGTTCAAAAGGATATGGTAATGAGGTAACGGCATCATGTATTCTAAAGAAGGTTTGGGATCGCTATGAGGAAATGAACCACAATAGTGCAGCAAACACGTCTGAAGAGAGCGGACACGGTAACAAGACTAAAGAGATGAAGTTCATACAATTCCCTATCAAAATGGAGGATATAAAGGAGAGTCTACACGTTATAACGTCATTAGATATTAAACAACAGCATTCGACACGACATGACAATGAGGCCTTATGTATGAAGAAAGGAGTTTGGGATCGTCATGAGGAGAAAGTGAAGCACAATGATGAAGAGTTATTACAAACAGCTGTCAAATACGAGGATATAGAAGAGGGTCCCTACGTTCTAACACCATTAGATATTAAGCAAGAAATTAGGAAACGACATAATAATGGGCTAATGCAATTATGTATTCAGAAAGACGATCGCGGTCTACATGAGACGAAAATGAAATGTTCACCGTTAGATGCTGAGCAAGAAAATTCGGAACAACGCAATAATGGGATAACCCCATTGTGTGTTAAGAAAGAGATTTGGAATCGCCACGAGAAGAAACTGAAGCGCAATGGTGCAGCCTATACTTGTGGACAGAGCGAAAATAATATGAAGGCTCGCGTAAGTTGCGACAGATATGAAAAAGCGCAACATAAAATGCACTCGCAGTATAGTCATAGCAATGGAAAGCCGTTTTTACAAGCACTCGTCAAAAATGAAGATATAGAAGAGGGTCCCTATGTTATCACACCGTTAGATGCTGAACAAGAAAATTCGAAACAAGGCAATAATGGATTAACGCCACTGTTTTTACAAGCACTCGCCAAAGAAGAAGATGTAGAAGAGGGTCCCTATGTTATAATACCGTTAGATGCTGAACAAGAGAATTCGAAACAACACAATATGACGCCATTCTCTGTTAAGAAAGAGGTTTGGAATCGTCATGAGGAGAAATTGGAGCGCAACGGTGCAGCATATGCTTGTGAACAGGACGAAAATAGTGTCAAGGCTCCCCATGTTTCCAGGTTGATATGGTCCCAAGTCAAGCACGAAAATATAGAAGAAGGTCCCTACGTTGTAATACCACTGaatatgaaacaagaaaattcaGAACGGCGCCATAGAGAAACCAACCACTTGTACACTTCTGAAGGCCACGTAGAAAGTTTTAATATTAGGAGATACGAAACGAACCAAGACTACAAGTGCGAATTCTGCGAGAAGCACTTCGCCAGCAAGATATCCCTAGCTAATCACAGAACGCAGCACGCGACAGGACTTAAATGCGAGAGCAGTTACCCTAGATTGTGCGACCTAGAAAATCATCAGAGCATTTATCAAGATACCCTCAGACTTTCTTGCGTTAAATGCGGAAAGGTTTTTAAGAGCCGGAAATATTTAAGCCAACATATGGCGACGCACCGTCCCAAAACGGAAGAGTACACGTGTGAGGTGTGCGCAACCGTGCTCCACCACAAGAAGTCCTATTATCGTCACATGGCGCGCCACCAAGGCAAGGGACAGCATCAATGCGATATCTGTGGCAAATCGCTGTCCAGAGCCGAATATTTGGCCCCCCATAAACGCATACACAGCGGTGAAAAGCCTTTCGTTTGTACTATTTGCGAGAGAGCTTTCACAACGAAACGATTGCTTGTACTACACATACGGACACATGCGAAACAGAAGCCTCATTGA
Protein Sequence
MPSCIKNEDIEEGTYFVTSLDIEQAMSKRYKNGLTSLCVKKEEGSHIESLRSTEQHDCDICEKSFCSAFDLHMHKRIHTGEKSFVCAICEEVFGTKQLLRSHILSHTNEKPYLCKYCDKGFARKSYLTPHLRTHTGEKPYICGFCSKGFAQPTHLTRHLRTHTGEKPYICKFCDKGFAHTTNLAVHLRTHTGEKPYVCKFCDKDFAHQFTPSCFKTEDMEEGPYIITSLDVKQEKSQRQNNKITQLWVKTELPDRNEEKMKPFSVVYTCDEHINASHEKSNMYKCKCCCKHFSSKQDLSVYGTDQMKEFRYKCNDCEPHHGTGQHECECGKSFASLQCLNVHKRIHTGEKPFKPSCFKSEDIEEDSCIITPLDVKQAISKRYKNEATPLCVKKQVLDHYTEETKHNSTSYACEECGNNYSIKGGCGKHLKTLNEEGEIYKCEFCCDKCVKDFKAKLYLDRQMATHGSSKEEYVCKVCSTILHGKGSFHVHMELHQSAAQLKCDICEKWFCSASDMHVHKRTHTEEMPFLCVICEEAFRTKELLRSHVLGHTNEKPYICKFCGKVFWRTTDLARHLRTHSGEKPYMFTPAGFKSEDIEVSIPEVPVLWIKDEISDSHEKQKQQNSAAFACYDCGNGYKSKGSYEKHIKTVHENDKFYKCEFCGKHFSSKYNLSVHRTQHTKEFRYRCDKCERGYLRLYDLKHHQNVHHNTPNFFCGQCGKAFKIKRYLKEHMTIHDSSKEKYACGVCSAVLHQEKSYNRHMDRHQGKGQHKCDICEKSLSSASGLLEHRRIHTCEKPFICAICERTFAAKGSLRLHMRTHTKEKPYVCKFCDKYFAWTTSLAIHLRQHSGEKPHKCLHCGRRFTCTSNLNQHKCKRVQFTPACLKIEDVEEDPYIITPFDVTQESSQRYNDKVIPLCVKKEQWDHQDEEAKLFSVAYTHEVRGNDYKIKESYDKDLKPIHEKGKVYKCKFCGMQFFSEHYLSVHRTGHLKDGGTVQHKCDVCGKPLASLQSLNMHKRIHTGEKPFVCAICEKGFAVKGSLRFTPSRLKNEDVEEGPYIITPLDIKQESSQPYNNKVTPLYVKKEQWDRQEEEFCSMQFFSKHYLSIHRTKHTEGGGIVQHKCDVCGKSLASLQSLNMHKRIHTGEKPFICPICKKAFAVKGSLRFIPTCVKSEDIEADPCGITSIYIKQENFGSHNIEMNDSAYPCEQCGKNLDYKCEFCGKCLASKSGLAIHRLRHTKEFKYRCDKCERDFLRLCDLKSHQYSHQSTLNFFCDKCGKGFKAKSYLKAHMDRHLSMERYKCDICGKCMSSPQYLLIHKRIHTGEKPFVCGICEKAFALKQMLKIHMRTHTKEKPYGIEEGPYIITPLDIKQEAAYTCEVSSKENYDKYTKVSHENGIDCKRECCGKQSSNKQNFSIHRTKHTKRIQYKFDKYERRHGIGQNKYDVCGKSLASLQKPFICAVAKKAFAVKGSFWLHTRTHAKEKPNCFFCDKNFTYARSLTTPLNTHAREKLYTCKFCEKGFTRPAHLTTHLRTHRFIPACVKNEGTEEGPYIITPLRIKQESSEHHSNKVTSLCVRKELLHRSQETIEPFSAVYTSYDKHIKASHERGGCEQFPSEQYLSIHRIEHTERFIPSSVKNEDTEEGPYIITPLHIKQESWEQYSNSVTPLRVRKELLDRSEETMEPFTAVYTSYDKHIKPSHEKGEFSSEESSTSRCKNEDVEADPYIITPFDVTQKISQRYNSKVTSLCVKKVQWDCHEQRINIRGAGHSCEEDRNGCKTKQTYDKHIKTSHESDKVCKCKFCSKQLSSKQYLSILRTKHTKAGGIEGHKCDVCGKSLYSLRFLTSCFRKEDVEVGPYIITPLDVRQDNSQSHNNKNNKVAPLCISKEQWDRHEDKVKLVSAAYTCEVCGDSYKTQESYDKHRMASHERGKGYKCKFCCKQFSSKQYLSIHRTEHTKASGIGQHKFSTSCVKNEDVEEGPYIITPLDIRQENSQRYNDKVVPPCVKEQRNRQEEKMKSLSAAYTCDEYGNGYKIKQSYDKHIKASHEKGEVQMQKCDSAGAQHPGEGNHSEGQCGPLWGVCCNQWDEEIFEFKYLKPQNSPIYRDAGLEDWFEGDVCIPIFTQRISTPNSGGIYASIIATTFLTQKKEESPGVEHLLFKFRASKPIYSSAYQRSPGGRDLGERPSTSRFKNEDVEEGPYIITPLDIRQEKSQRYNDKVVPLRIRKQQWNRQEEKMKRLSTAHAQEVCKNGYKIKESYDKHIKASHENGKVQMRKCDVSGKPLALPQRLNLHKRIHTREEPFICAICEKAFAVEESLQLHMQTHAKEKKRKLSQKGSARKSRLTRHLPKFIPSGFRKEDTEVGIYEVPVLWIKDEVSDGYEEEKKHTGATYTCDECGSSYDTKESYDKHIKVSHEKAKVYKCKFCCKKFSSEHCLSIHRSEHTKASGILPHKFIPSGFRKEDTEVGIYEVPVLWIKDEVSDGYEEEKKHTGATYTCDECGSSYDTKESYDKHIKLSHEKAKVYKCKFCCKKFSSERCLSIHRSEHTKASGILPHKFLTSYVKYEDIEEGPHIITPLDLKQEDPQQHNKKVTPLYVKKKQWEAQEENMESFSAAYTCEVCGSGYKTEESYGKHIKAAHEMGEIYKCEFCSKQYSSEHHLCIHRTKHHGIEQFIPPGFKNEDIEAGIYEVPVSWIKNDVLDGYEHEKKYISAAYTCYECEKGYNTKKSYNKHLNVSHEKAKVYRCKFCCKQSMIHKGSHRLHMRTHAKEKPCICQFCDKRFGHKFTQSGFRNEDIKVDSYEVPVLWIKDEVSAGYEEEKKHIGVIYTCDECGSSYDTKESYDKHIKVSHEKAKVYKWKLCCKQSSSKHNLSIHRTEDTKAGGIGQHKCDGCGRPLASPQRLNLHKRIHTGEKPFIHKGSHRFILSQTKSEDIEDDPYTVTPLEIKEEELEPHSRGIHHNIGEYSCEQCGNSWNDYVGHRQAHQQKKAGHKCEFCDMYFASRHTLVVSQKHAKEFKHNCNKCEGGHYKLCDQQVHQGGTSNFCCDKCGKSFKTRYYLKRHTVVHDDNKKYVCDVCSAVLHHKDSYRRHMDRHQNTDQYKCDVCRKSLASLQGLRVHKRIHTGEKPFLCAICEKDFRSKRLLVVHMIKSEDIEEDLYTVTPLYIKQEDVEQLSGVNDTGEVYKSVECKNGYKSWEDNVGQEYAHDDNKKFISSQFKSEDFEEDRYTVAPLYIKQEDLEQLTGVNDTGEVYKSVQCKNSYKGWEDNVGQEYTHDDNKEYMCDTAEKLFTYGTCEKIKSDHMEEDPYTVMPLDIKKEDLGRSSRINDTSGEYNSVQCKTSYSGWEANVKQNYDIGDDNKKFIPSQIKIEDIEEDPYTVTPLYIKQEDLGRPSAVNDTGEVYNSVQCKSSYKSWEEHVHDDNKTYMFAAAEKPFTCGTCEKVFGSKRMLTVHLVTHANEKPYIKIEDIEEDLYTVTPLYIKQEDLGRPSAVNDTGEVYNSVQYKNSYKSCEEHPHDDNKKDLLDTAEKSFACGTCEEVFGSKRMLTIKSEDIEEDLYTVTPLYIKQEDLEQLSGVNDTGEVYKSVECKNSYKSWEDNVGQEYAHDDNKKYMFDTAEKLFTCGTCEKVFGSKRILTEHLVTHAKEKPYIKSEDIEEDLYTVTPLYIKQEDVEQLSGVNDTGEVYKSVECKNSYKSWEDNVGQEYSHDDNKKFIPSQIKSEDLEEGSYTVTSSDIKEEDLERGGFIPSQIKSEDLDEDSYTVTSLDIKEDLERGGFARTEFVAVKVHLFFLLLKSEDLEEGPYTVTSLDIKEEDLELGGFIPSQIKSEDLDEDSYTVTSLDIKEEDLERGGFARAEFVADHFSRFIPYQTKIEDIEADSYTVASFDIKEENLERNSGEFKYTSAEYSSEQCENSYESCNDYVGHTEVHLQKDSGHQYLCDKCNCGYGTLWDLQNHENAHSTQDTEEYKCDVCEKSLRSAHYLRVHKRIHSGEKHFTCASCEKVFRSKSLLLVHLVTHGKSEGIEEDSYSVTSLDIKEEDLERYGREIMKNDYVGHRQVRQQKNPGHKCEFCDIHFASKRILAVHTSQKHATKFKYVCDKCNCCYGTLWDLKNHQNAHYKFMTSPAKSEGIEVHPYTITSLDIKEEDLKGRSGEMNQSSEEHSCKQCEKGYKSWNDYVGHRLVHHQKNPGHKCDFCNMRYATKHTLALHRSRKHTKEFKYICVKCDRGYGTLWDLKNHQNVHHSTTNFFCDKCGKSFKTKRYLQRHMVIHDDDKKYVCDVCSAVLHRKESYRRHMDRHQNTEQHTCSVCEKSLSSAYHLRVHKRIHTGEKPFACAICEKLIQSSVKYEDIEEGPYVVSPLDIEREIPKRGRYEITPLCLQKDICGSYKKKMKHNGAPHTCEEFIQSFVKYEDIEEGSYVATPLHIKLETTKPYNNEVTSVYIQEDVCGPHGKKVKHNSSTYTCGECGNGYRTKESYGKHIQAAHENLTQHLHTHTDERPYTSLLDIKQESSKQYNNEVTPFCIEEDVCDPDGKEMKHSSGAYTCEGCGNSYQTKQSYTKHLRAAHENLKVHIPTHAESPVKHEDADESPYVVTPLDIKQESSERDNNELTPLCIPEDVCGPCGKKVNDNTCEDCGYDYGTEKSYGKHMQTVHESLTRRLRTHIVLAFLYSWDHISRFIQSPVKYEDIEENPYLITPLDIKQESSKRYNNDATPLYIQKGVCGSHGKKIRCNSTAYIYEQFIQLLVKHEDIEESPYVITPLDVKQECSKLCNNEGMPFCIQNDVCDPDGKKMNEECGQDYEGSYSKHTQAVHENLTDHLFIQLLIKHEDIEEGLHVVTPLDIKQESSKLCNNEVTPFCIHEEVCDPDEKTMKHNSDTYTGEECGKDGKESYSKHIQAVHDNLTDHLLIQYFVKHEDIEESPYTVTPLDIKQESSKLYSDEVTSVCILKQVCSPHGKKIKYNSRSAYTYEESGNGYWNNESYGKHIKAIHENLTKRLRTHTAEKAYTFCGKGFAHSTNLAAHMRTHTGEKAFRCLHCDRRFGTNAHLIRHREVCKGIASKEDYGIGSQSIQLLIKYEDIEESSYHVTPLDIKQECSKRYNNEITPVCVQKDVCGPHGKKVKHNSATYASEEGDDGYRNKKSKRIRTVPKNLEQHLRTHTAEKPYTFCGKGFAHSTNLAAHMRTHTGEKAFRCLHCDRRFGTNAHLIRHREVCKGIASKEDYGIGSQSIQLLIKYEDIEESSYHVTPLDIKQECSKRYNNEITPLCVQKDVCGPHGKKVKHNSATYASEEGDDGYRNKKSKRIRTEDIEESVYLVTPLDIKQECSERYINEIPPLCVQKDDCGPAEEKMKHNSATYTCEEGDDAYRIKESKRIRTVHKNLEQHLRTDAAEKLYACKFCEKEFDRSASLTIHLRTHTGEKPHTCQFCGKGFAHKFIQSSVKYEDIDEGSYIVTPLGIKQESSQRYNNEVTPLCFRKEVWDHCEENMIHNTTAYTCQECGNSHKTKESYDKHIQAVHENLTKRLRTHTCEKPFIQSPLKFEDIEEGPYIVTLSGIKQESLNLYNNEITPLRIQREGCGSRGKNMKHNGATHICEECVNGYRSKKRFIQPPVKFEDIEESPYVVTLTDKQQSSKLYNDEVTPLRIRKDIYGSYGKKMKHNDEAYACEEGGNGYRTKKSYDKHRQVVHDNFTMHLRTLTESPVKHEDVEEGPYFITPLDIIQDISKRYSNEVTPLSIKKGVYGPHGKKLNSATYTCEECGTSYRTKESYGEHMQAVHENLRTIIQSQVKNEDIEECAYVLTSLGIEQDNLKHINDITYMCDKCECDYFRLSDLKFHQLIQHSTLNFLCDKCGRGFEIKFYLKRHIATHDGNKEYVCEICSAILHHKDSYCRHMERHQDHQAIGRHECDVCGKSLSSPNALLVHKRIHTGEKPFVCAICDKLFRTKQMVQLHIRTHTKEKLHVYKLCEKDFAQTTQLTKHLRVHTGEKPALHMFIYSPVKGEDIEEGSYIVTPLEIKQESSKGYPNEVTASCILKKVWDRYEEMNHIRAANTPEESGHGDKTKETKYLCKFCWAVFHQKFIYSPVKGEDIEEGPYIVTPLDIKQESSKGYGNEVTASCILKKVWDRYEEMNHNSAANTSEESGHGNKTKEMKFIQFPIKMEDIKESLHVITSLDIKQQHSTRHDNEALCMKKGVWDRHEEKVKHNDEELLQTAVKYEDIEEGPYVLTPLDIKQEIRKRHNNGLMQLCIQKDDRGLHETKMKCSPLDAEQENSEQRNNGITPLCVKKEIWNRHEKKLKRNGAAYTCGQSENNMKARVSCDRYEKAQHKMHSQYSHSNGKPFLQALVKNEDIEEGPYVITPLDAEQENSKQGNNGLTPLFLQALAKEEDVEEGPYVIIPLDAEQENSKQHNMTPFSVKKEVWNRHEEKLERNGAAYACEQDENSVKAPHVSRLIWSQVKHENIEEGPYVVIPLNMKQENSERRHRETNHLYTSEGHVESFNIRRYETNQDYKCEFCEKHFASKISLANHRTQHATGLKCESSYPRLCDLENHQSIYQDTLRLSCVKCGKVFKSRKYLSQHMATHRPKTEEYTCEVCATVLHHKKSYYRHMARHQGKGQHQCDICGKSLSRAEYLAPHKRIHSGEKPFVCTICERAFTTKRLLVLHIRTHAKQKPH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-