Caur001749.1
Basic Information
- Insect
- Cetonia aurata
- Gene Symbol
- ZEB2
- Assembly
- GCA_949128085.1
- Location
- OX421882.1:4710647-4888487[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 84 0.012 2.1 10.5 5.6 1 23 56 78 56 78 0.96 2 84 0.00094 0.17 14.0 0.1 1 23 84 106 84 106 0.98 3 84 6e-05 0.011 17.7 1.2 1 23 112 134 112 134 0.96 4 84 1.9e-05 0.0036 19.3 1.6 1 23 140 162 140 162 0.98 5 84 5.4e-05 0.01 17.9 1.5 1 23 168 190 168 190 0.98 6 84 3.6 6.6e+02 2.7 1.9 1 13 196 208 196 210 0.91 7 84 0.046 8.4 8.6 2.3 5 23 328 346 325 346 0.96 8 84 3.6 6.6e+02 2.7 1.6 1 23 472 494 472 494 0.97 9 84 0.074 14 8.0 3.4 2 23 501 522 500 522 0.97 10 84 0.0066 1.2 11.3 0.1 1 23 528 550 528 550 0.93 11 84 0.00016 0.029 16.4 0.8 1 23 556 578 556 578 0.98 12 84 0.0013 0.24 13.5 0.4 1 23 627 650 627 650 0.97 13 84 0.00014 0.025 16.6 4.7 1 23 656 678 656 678 0.97 14 84 0.11 20 7.5 4.5 1 23 684 706 684 707 0.96 15 84 0.0005 0.091 14.8 1.1 1 23 712 734 712 734 0.97 16 84 0.59 1.1e+02 5.2 0.6 1 23 741 763 741 763 0.97 17 84 0.00034 0.063 15.3 0.5 1 23 769 791 769 791 0.98 18 84 9.4e-05 0.017 17.1 0.2 1 23 797 819 797 819 0.98 19 84 0.081 15 7.9 0.6 1 23 825 847 825 847 0.98 20 84 6.2 1.1e+03 1.9 9.5 1 21 853 873 853 874 0.93 21 84 0.11 21 7.4 3.7 1 23 965 987 965 987 0.96 22 84 0.0024 0.45 12.7 0.5 1 23 996 1018 996 1018 0.98 23 84 2.1 3.9e+02 3.4 0.0 1 17 1024 1040 1024 1040 0.92 24 84 0.001 0.19 13.9 1.0 1 23 1119 1141 1119 1141 0.98 25 84 0.16 30 6.9 0.0 1 17 1147 1163 1147 1163 0.91 26 84 0.0004 0.073 15.1 1.3 1 23 1214 1236 1214 1236 0.98 27 84 0.81 1.5e+02 4.7 4.1 1 23 1242 1264 1242 1264 0.98 28 84 3.1e-05 0.0057 18.6 1.2 1 23 1270 1292 1270 1292 0.97 29 84 0.00076 0.14 14.2 1.3 1 23 1298 1320 1298 1320 0.99 30 84 4e-05 0.0073 18.3 0.2 1 23 1326 1348 1326 1348 0.98 31 84 3 5.5e+02 2.9 0.6 3 17 1485 1499 1484 1505 0.90 32 84 5.3e-06 0.00097 21.0 3.2 1 23 1511 1533 1511 1533 0.99 33 84 2.4 4.3e+02 3.3 0.3 1 16 1826 1841 1826 1843 0.80 34 84 0.028 5.2 9.3 0.5 1 23 1901 1924 1901 1924 0.95 35 84 2.8 5.2e+02 3.0 6.7 1 23 1930 1952 1930 1952 0.96 36 84 0.00038 0.07 15.2 0.0 1 23 2298 2320 2298 2320 0.98 37 84 0.002 0.36 12.9 0.5 1 23 2387 2410 2387 2410 0.98 38 84 0.0042 0.77 11.9 0.6 1 23 2490 2513 2490 2513 0.97 39 84 0.05 9.1 8.5 0.1 1 23 2610 2633 2610 2633 0.96 40 84 1.1 2.1e+02 4.2 11.5 1 23 2639 2661 2639 2662 0.96 41 84 0.0017 0.3 13.2 2.0 1 23 2708 2731 2708 2731 0.97 42 84 0.002 0.36 12.9 0.5 1 23 2820 2843 2820 2843 0.98 43 84 0.012 2.1 10.5 0.3 1 23 2880 2902 2880 2902 0.98 44 84 2.8 5.2e+02 3.0 0.2 1 16 2983 2998 2983 3000 0.92 45 84 7e-05 0.013 17.5 5.3 1 23 3035 3057 3035 3057 0.95 46 84 0.066 12 8.2 2.7 1 23 3063 3085 3063 3085 0.98 47 84 0.0039 0.71 12.0 0.4 1 23 3091 3113 3091 3113 0.99 48 84 0.012 2.3 10.4 0.1 1 21 3119 3139 3119 3140 0.92 49 84 0.0008 0.15 14.2 0.4 1 23 3423 3445 3423 3445 0.98 50 84 0.065 12 8.2 0.3 1 23 3608 3630 3608 3630 0.97 51 84 0.55 1e+02 5.3 0.6 1 23 3924 3946 3924 3946 0.94 52 84 0.00044 0.081 15.0 2.0 1 23 3954 3976 3954 3976 0.99 53 84 0.00031 0.057 15.5 0.4 1 23 3982 4004 3982 4004 0.98 54 84 4.3 7.9e+02 2.4 2.8 1 23 4053 4076 4053 4076 0.90 55 84 0.021 3.8 9.7 3.3 1 23 4082 4104 4082 4104 0.97 56 84 0.14 25 7.1 2.8 1 23 4148 4170 4148 4171 0.94 57 84 0.41 75 5.7 5.7 1 23 4177 4200 4177 4200 0.96 58 84 0.0038 0.7 12.0 0.6 1 23 4206 4228 4206 4229 0.95 59 84 2.9e-06 0.00052 21.9 2.7 1 23 4234 4256 4234 4256 0.96 60 84 0.025 4.5 9.5 1.2 1 23 4262 4284 4262 4284 0.98 61 84 0.001 0.19 13.8 3.4 1 23 4290 4312 4290 4312 0.98 62 84 0.015 2.7 10.2 0.1 1 23 4447 4470 4447 4470 0.95 63 84 0.0015 0.27 13.3 0.7 1 23 4528 4551 4528 4551 0.97 64 84 0.021 3.8 9.7 0.2 2 23 4616 4638 4616 4638 0.97 65 84 0.00016 0.03 16.3 0.5 5 23 4992 5010 4991 5010 0.96 66 84 0.00069 0.13 14.4 1.1 1 20 5016 5035 5016 5037 0.94 67 84 7.6 1.4e+03 1.7 0.2 13 23 5132 5142 5127 5142 0.89 68 84 0.00016 0.03 16.3 0.5 5 23 5150 5168 5149 5168 0.96 69 84 0.00069 0.13 14.4 1.1 1 20 5174 5193 5174 5195 0.94 70 84 3.8e-05 0.0069 18.4 0.7 1 23 5376 5398 5376 5398 0.98 71 84 4.8 8.8e+02 2.3 2.0 1 13 5404 5416 5404 5417 0.91 72 84 0.52 96 5.3 1.4 1 23 5475 5498 5475 5498 0.95 73 84 0.00053 0.097 14.7 0.2 1 23 5733 5756 5733 5756 0.96 74 84 6.9 1.3e+03 1.8 2.9 1 23 5797 5820 5797 5820 0.95 75 84 0.0012 0.21 13.7 0.6 1 23 5825 5847 5825 5847 0.95 76 84 0.57 1e+02 5.2 4.7 1 23 5853 5875 5853 5875 0.98 77 84 0.00036 0.065 15.3 0.2 1 23 5884 5906 5884 5906 0.97 78 84 0.00028 0.051 15.6 0.5 1 23 5912 5934 5912 5934 0.98 79 84 0.019 3.5 9.8 0.3 5 23 5944 5962 5943 5962 0.95 80 84 0.0065 1.2 11.3 2.8 1 23 6487 6509 6487 6509 0.97 81 84 3e-05 0.0055 18.7 1.0 2 23 6539 6560 6538 6560 0.96 82 84 1.1 2e+02 4.3 4.2 1 23 6567 6589 6567 6589 0.98 83 84 0.0026 0.47 12.6 0.3 1 23 6595 6617 6595 6617 0.97 84 84 1.1e-05 0.0021 20.0 0.4 1 23 6623 6645 6623 6645 0.98
Sequence Information
- Coding Sequence
- ATGCCATCATGCATAAAAAATGAGGATATAGAAGAGGGTACCTATTTCGTAACATCGTTAGATATTGAACAAGCAATGTCGAAACGATACAAGAATGGGTTAACGTCACTATGTGTCAAGAAAGAAGAGGGAAGCCACATAGAGAGTCTCCGAAGTACCGAGCAACATGATTGCGATATATGCGAGAAATCGTTTTGTTCAGCTTTTGATTTACATATGCACAAACGCATACATACTGGAGAAAAGTCTTTCGTTTGTGCTATTTGCGAAGAAGTCTTTGGAACCAAACAATTGCTTCGGTCACACATTTTAAGTCACACGAACGAGAAGCCTTACTTATGTAAATATTGCGACAAAGGTTTTGCACGTAAATCGTACCTAACgccacatttgcgtacgcatactggAGAAAAGCCATACATATGTGGATTTTGCAGCAAGGGTTTTGCGCAACCTACACACTTAACGAGGCacttgcgtacgcataccggtgaaaagccttatatatgtaaattttgcgacaagggttttgcgcacACTACAAACCTAGCagtacatttgcgtacgcataccggtgaaaagccttacgtatgtaaattttgcgataaggATTTCGCGCATCAGTTTACACCATCATGCTTCAAAACTGAGGACATGGAAGAGGGTCCTTATATCATAACATCATTAGATGTTAAACAAGAGAAATCGCAACGACAGAACAATAAGATAACGCAACTCTGGGTTAAAACAGAGTTACCTGATCGCAATGAGGAGAAAATGAAACCTTTTAGTGTAGTGTACACTTGTGACGAACACATAAACGCCAGTCACGAAAAGAGTAATATGTACAAATGCAAATGTTGCTGCAAACACTTCTCCAGTAAGCAGGACTTGTCCGTATATGGAACAGACCAAATGAAGGAGTTCAGATATAAGTGCAATGATTGCGAACCTCACCATGGTACAGGACAGCACGAGTGCGAATGCGGGAAATCTTTTGCCTCGCTCCAGTGTTTGAATGTGCATAAACGCATACATACCGGAGAGAAGCCGTTTAAACCATCATGCTTCAAAAGCGAGGATATAGAAGAGGATTCTTGTATCATAACACCGTTAGATGTCAAACAAGCAATTTCGAAACGATACAAAAATGAGGCAACGCCACTATGCGTAAAGAAGCAGGTATTAGATCATTATACGGAGGAAACGAAACACAATAGTACATCATATGCTTGTGAAGAGTGCGGGAACAATTACAGTATCAAAGGAGGGTGCGGGAAACACTTAAAGACTCTTAACGAAGAGGGTGAAATCTATAAATGTGAATTTTGCTGCGATAAATGCGTGAAAGATTTCAAAGCCAAACTATATTTAGACCGACAAATGGCGACACACGGTTCCAGCAAGGAGGAGTATGTGTGCAAGGTGTGTTCGACCATACTCCATGGAAAGGGAAGCTTTCATGTCCACATGGAGCTACACCAAAGTGCAGCGCAACTTAAGTGTGATATATGCGAGAAGTGGTTTTGTTCAGCCTCTGATATGCATGTGCACAAGCGTACACATACCGAAGAGATGCCCTTCCTTTGTGTTATTTGCGAGGAAGCTTTTAGAACTAAAGAATTGCTTCGATCGCATGTTTTGGGACACACAAAcgagaagccttacatatgcaaattttgcggGAAGGTTTTCTGGCGAACTACAGACCTGGCGAGACATTTACGTACGCACAGCGGggaaaagccttacatGTTCACACCAGCAGGCTTCAAGAGCGAGGATATAGAAGTGAGTATTCCTGAAGTACCCGTATTATGGATTAAGGATGAGATTTCGGATAGTCATGAGAAGCAAAAGCAGCAGAATAGTGCAGCTTTTGCTTGTTACGACTGCGGAAACGGTTACAAGAGCAAAGGAAGCTACGAGAAGCACATAAAAACCGTCCACGAAAatgataaattttacaaatgtgAATTTTGTGGCAAGCACTTTTCAAGTAAATACAACTTGTCCGTTCATAGAACACAGCACACGAAAGAGTTTAGATATAGGTGTGATAAATGTGAGCGCGGTTACCTGAGGTTGTACGACCTAAAACACCATCAGAACGTTCACCATAATACGCCCAATTTCTTTTGCGGTCAATGCGGAAAAGCTTTCAAGATCAAACGTTATTTAAAGGAACACATGACGATTCACGATTCCAGCAAGGAGAAGTACGCGTGCGGTGTGTGCTCGGCCGTACTCCATCAGGAGAAATCTTATAATCGCCACATGGACCGCCACCAGGGTAAAGGGCAACATAAATGCGATATATGCGAGAAATCGTTATCCTCGGCATCTGGCTTGCTTGAGCATAGACGCATACATACCTGCGAGAAACCCTTCATTTGCGCTATTTGCGAGAGAACTTTTGCAGCCAAAGGATCGCTTCGGTTACATATGCGGACGCACACGAAGgaaaagccttacgtatgtaaattttgcgacaagtaTTTTGCATGGACTACCAGCCTAGCGATCCATTTGCGTCAGCATTCCGGCGAAAAGCCTCATAAATGTTTACATTGCGGAAGGAGATTTACATGCACTTCAAATTTGAACCAACATAAATGCAAGAGGGTGCAGTTTACACCAGCATGCCTCAAAATCGAGGATGTAGAAGAGGATCCCTATATCATAACACCATTTGATGTTACACAAGAAAGTTCACAACGATACAATGATAAGGTAATACCACTATGTGTTAAGAAGGAGCAATGGGATCATCAGGACGAAGAAGCGAAACTCTTCAGTGTGGCGTACACCCATGAAGTTCGCGGAAACGATTACAAGATCAAGGAAAGCTACGACAAAGACTTAAAGCCCATTCACGAAAAGGGTAAAGTttacaaatgcaaattttgcggCATGCAGTTCTTCAGTGAGCACTACTTATCCGTACATAGAACAGGGCACTTGAAGGATGGTGGTACAGTGCAGCATAAGTGCGATGTATGCGGGAAACCTTTGGCTTCACTCCAGAGCTTGAATATGCATAAACGCATACACACCGGAGAGAAGCCATTCGTTTGTGCCATTTGCGAGAAAGGTTTCGCAGTCAAAGGATCGCTTCGGTTTACACCATCACGCCTCAAAAACGAGGATGTAGAAGAAGGTCCTTATATCATAACACCATTAGATATTAAACAAGAAAGTTCACAACCATACAATAATAAGGTAACACCACTATATGTTAAGAAGGAGCAATGGGATCGTCAGGAGGAGGAATTTTGCAGCATGCAGTTCTTCAGTAAGCACTACTTGTCCATACATAGAACAAAGCACACGGAAGGTGGTGGTATAGTGCAGCACAAGTGCGATGTATGCGGGAAGTCTTTGGCTTCACTCCAGAGCTTGAATATGCATAAACGCATACACACCGGAGAGAAGCCATTCATTTGCCCCATTTGCAAGAAAGCTTTCGCAGTCAAAGGATCGCTTCGGTTCATACCGACGTGCGTCAAAAGCGAGGATATAGAAGCGGATCCATGCGGTATAACATCAATATATATCAAGCAAGAGAATTTTGGGAGTCACAATATCGAAATGAATGATTCAGCGTACCCTTGTGAACAGTGTGGAAAAAACCTAGACTACAAATGCGAATTCTGTGGCAAATGTTTGGCCAGTAAGAGCGGCCTGGCCATTCACAGATTGCGACACAcgaaagaatttaaatatagGTGCGATAAATGCGAACGCGATTTCCTTCGATTGTGCGACCTAAAAAGCCACCAGTATTCTCACCAGAGTACGCTTAATTTCTTTTGCGATAAGTGCGGGAAAGGGTTCAAGGCCAAGAGTTATTTAAAGGCGCATATGGACCGTCATCTGAGTATGGAGCGGTACAAGTGCGATATATGCGGGAAATGTATGTCCTCACCCCAGTATTTGCTTATCCATAAGCGCATACACACCGGCGAGAAGCCTTTCGTATGTGGTatttgcgagaaagcttttgcaCTCAAACAAATGCTCAAGATACACATGCGAACCCACACGAAAGAGaagccttacGGTATAGAAGAGGGCCCCTACATCATAACACCATTAGATATTAAACAAGAAGCAGCATACACTTGCGAAGTATCGAGCAAAGAGAACTACGACAAATACACAAAGGTCAGTCACGAAAATGGTATAGACTGCAAACGCGAATGTTGTGGCAAGCAGTCTTCCAACAAGCAAAACTTTTCGATACACAGAACAAAGCACACTAAAAGAATTCAATACAAGTTCGATAAATACGAACGTCGCCATGGCATAGGACAGAATAAATATGATGTATGCGGGAAATCTTTGGCTTCCCTCCAGAAGCCCTTCATTTGTGCCGTCGCCAAGAAGGCTTTTGCAGTCAAAGGATCGTTTTGGTTACATACACGTACTCACGCGAAAGAAAAGCCTAACTGTTTTTTTTGCGACAAGAATTTTACGTATGCTAGGAGTTTAACGACACCTTTGAATACGCATGCCCGTGAAAAgctttacacatgtaaattCTGCGAAAAGGGCTTTACGCGTCCTGCACACCTGACGACACATTTGCGAACGCATAGgtttatacCAGCATGCGTCAAGAACGAGGGTACAGAAGAAGGTCCCTATATCATAACACCATTACGTATTAAACAAGAAAGTTCGGAACATCACAGTAATAAAGTAACATCACTATGTGTTAGGAAGGAGCTATTGCATCGCAGTCAGGAAACGATTGAACCTTTTAGTGCAGTGTATACTAGCTACGACAAACACATAAAGGCTAGTCACGAAAGGGGTGGCTGTGAGCAATTCCCCAGTGAGCAGTACTTGTCCATACATAGAATAGAGCACACGGAGAGGTTTATACCATCATCCGTCAAAAACGAGGATACAGAAGAGGGTCCCTATATCATAACAccattacatattaaacaagaaAGTTGGGAACAATACAGTAACAGCGTAACACCACTACGTGTGAGGAAGGAGCTATTGGATCGCAGTGAGGAAACGATGGAACCTTTCACTGCAGTGTACACTAGCTACGACAAACACATAAAGCCCAGTCACGAAAAGGGTGAATTCTCCAGTGAGGAgTCTTCAACATCACGCTGCAAAAACGAGGATGTAGAAGCGGATCCCTATATCATAACACCATTTGATGTTACACAAAAAATTTCACAACGATACAATAGTAAGGTAACCTCACTATGTGTTAAGAAGGTGCAATGGGATTGTCATGAGCAGAGAATTAATATTCGCGGTGCAGGGCACAGTTGTGAAGAGGACCGAAACGGTTGCAAGACTAAACAAACCTACGACAAACACATAAAGACTAGTCACGAAAGTGATAAAGTCTgcaaatgcaaattttgcagCAAGCAGTTATCCAGTAAGCAGTACTTGTCCATACTTAGAACAAAGCATACGAAAGCTGGTGGTATAGAAGGGCATAAGTGCGACGTGTGCGGGAAATCTTTGTATTCACTCCGGTTTTTAACATCATGCTTCAGAAAGGAGGATGTCGAAGTGGGACCTTATATCATAACACCATTAGATGTTAGACAAGATAATTCGCAATCacacaataataagaataataaggTAGCACCACTATGTATTAGTAAGGAACAATGGGATCGTCATGAGGACAAAGTGAAACTTGTCAGTGCAGCGTACACTTGTGAAGTGTGCGGAGACAGTTACAAGACCCAAGAAAGCTACGACAAACACAGAATGGCCAGTCACGAAAGGGGTAAAGGctacaaatgcaaattttgctgCAAGCAATTCTCCAGTAAGCAGTACTTGTCCATACATAGAACAGAGCACACGAAAGCTAGTGGTATAGGACAGCATAAGTTTTCAACATCATGCGTCAAAAACGAGGATGTAGAAGAGGGTCCCTATATCATAACACCATTAGATATTAGGCAAGAAAATTCACAACGATACAATGATAAGGTAGTGCCACCATGTGTCAAGGAGCAAAGGAATCGTCAGGAGGAGAAAATGAAAAGTCTCAGTGCAGCGTACACTTGTGACGAGTACGGTAACGGTTACAAGATCAAACAAAGCTACGACAAACACATAAAGGCCAGTCACGAAAAGGGCGAAGTCCAAATGCAAAAGTGCGATTCGGCGGGTGCACAACATCCTGGAGAAGGAAACCACTCTGAAGGTCAATGCGGCCCTCTGTGGGGAGTTTGTTGTAATCAATGGGATGAGGAGATTTTCGAATTCAAGTACTTGAAGCCACAGAATTCACCCATCTATCGCGACGCCGGCCTGGAGGATTGGTTCGAGGGAGATGTTTGCATTCCCATCTTCACACAGAGAATTTCAACCCCAAATTCCGGCGGTATTTACGCCTCCATAATAGCTACGACTTTTTTGACACAAAAAAAAGAGGAGAGTCCCGGCGTGGAGCACTTGCTATTCAAATTTCGTGCTTCCAAGCCGATATACTCATCGGCCTATCAGAGAAGTCCAGGTGGGCGTGACCTAGGTGAAAGGCCTTCAACATCACGCTTCAAAAACGAGGATGTAGAAGAGGGTCCTTATATTATAACACCATTAGATATTAGGCAAGAAAAATCACAACGATACAATGATAAGGTAGTGCCGCTACGTATCAGGAAGCAGCAATGGAATCGTCAGGAGGAGAAAATGAAACGTCTCAGTACAGCACACGCTCAGGAAGTGTGCAAAAATGGTTACAAGATTAAAGAAAGCTACGACAAACACATAAAGGCCAGTCACGAAAACGGCAAAGTCCAAATGCGAAAGTGCGATGTAAGCGGGAAACCTTTAGCTTTACCACAGCGCTTGAATTTGCACAAACGCATACACACCCGAGAGGAGCCTTTCATTTGTGCCATTTGCGAGAAAGCCTTCGCGGTCGAAGAATCGCTTCAGTTACATATGCAGACTCAcgcgaaagaaaaaaaacggaaaCTTAGCCAGAAGGGCTCTGCACGTAAGTCGCGCTTAACAAGACATTTGCCTAAGTTCATACCATCAGGGTTCAGAAAGGAAGATACAGAAGTGGGTATTTATGAGGTACCGGTATTATGGATTAAAGATGAAGTTTCGGATGGTTATGAGGAGGAAAAGAAACACACTGGTGCAACTTATACTTGTGACGAGTGCGGAAGCAGTTACGACACCAAAGAGAGCTACGACAAACACATAAAGGTCAGTCACGAAAAGGCTAAAGTctacaaatgcaaattttgctgCAAGAAATTCTCCAGTGAGCACTGCTTGTCCATACATAGATCAGAGCACACGAAAGCTAGTGGTATACTACCGCATAAGTTCATACCATCAGGGTTCAGAAAGGAAGATACAGAAGTGGGTATTTATGAGGTACCGGTATTATGGATTAAAGATGAAGTTTCGGATGGTTATGAGGAGGAAAAGAAACACACTGGTGCAACTTATACTTGTGACGAGTGCGGAAGCAGTTACGACACCAAAGAGAGCTACGACAAACACATAAAGCTCAGTCACGAAAAGGCTAAAGTctacaaatgcaaattttgctgCAAGAAATTCTCCAGTGAGCGCTGCTTGTCCATACATAGATCAGAGCACACGAAAGCTAGTGGTATACTACCGCATAAgtTTTTAACATCATACGTCAAATACGAGGATATAGAAGAGGGTCCCCATATCATAACACCATTAGACCTTAAACAAGAAGATCCGCAACAACACAATAAGAAGGTGACGCCATTGTATGTTAAGAAGAAACAATGGGAAGCTCAGGAGGAGAACATGGAATCCTTTAGTGCAGCCTACACATGCGAAGTGTGCGGAAGCGGTTACAAGACCGAAGAGAGCTATGGCAAACACATAAAGGCCGCTCACGAGATGGGTGAAATATACAAATGCGAATTTTGCAGCAAGCAGTACTCCAGTGAGCACCATTTATGCATACACAGAACAAAGCACCATGGTATAGAGCAGTTCATACCACCAGGGTTCAAAAACGAAGATATAGAAGCCGGCATCTATGAGGTACCTGTATCGTGGATAAAAAATGACGTTTTGGATGGTTATGAGCATGAAAAGAAATACATTAGCGCAGCTTACACTTGCTACGAGTGTGAAAAGGGTTACAACACCAAAAAGAGCTACAACAAACACTTAAACGTCAGCCACGAAAAGGCTAAAGTCTACAGATGCAAATTTTGCTGCAAGCAGTCCATGATTCACAAAGGATCGCATCGCTTACATATGCGGACACACGCGAAAGAAAAGCCTTGCATATGTCAATTTTGCGACAAGCGTTTTGGGCATAAGTTCACACAATCAGGCTTCAGAAACGAAGATATAAAAGTGGACTCTTATGAGGTTCCGGTATTATGGATTAAAGATGAAGTTTCGGCTGGTTATGAGGAGGAAAAGAAACACATTGGTGTAATTTATACTTGTGACGAGTGCGGAAGCAGTTACGACACCAAAGAGAGCTACGATAAACACATAAAAGTCAGTCACGAAAAGGCTAAAGTCTACAAATGGAAATTGTGCTGCAAGCAGTCTTCTAGTAAGCATAACTTGTCCATACATAGAACAGAGGACACAAAAGCTGGTGGTATAGGACAGCACAAATGCGATGGATGTGGGAGACCTTTGGCTTCGCCCCAGCGCTTGAATCTACACAAACGCATACACACTGGAGAGAAGCCTTTCATTCACAAAGGATCGCATCGGTTTATACTATCTCAGACCAAGAGCGAGGACATAGAAGACGATCCCTATACCGTAACGCCATTAGAGATTAAAGAGGAAGAATTGGAGCCACACAGTAGAGGAATTCACCACAATATCGGAGAGTACAGTTGTGAACAGTGCGGAAACAGTTGGAATGATTACGTTGGTCACAGACAGGCGCATCAACAAAAGAAAGCGGGCCACAAATGCGAATTTTGTGACATGTATTTTGCTAGTAGGCATACTTTGGTTGTATCACAAAAGCATGCGAAAGAGTTTAAACACAACTGTAATAAATGCGAGGGCGGTCACTATAAATTGTGCGACCAGCAGGTTCACCAGGGTGGTACATCCAATTTCTGTTGCGATAAATGCGGGAAGAGTTTCAAGACCAGGTATTATTTAAAACGACATACGGTCGTGCACGATGATAACAAGAAGTACGTGTGTGATGTGTGTTCCGCCGTACTTCACCACAAGGATAGTTATCGTCGCCATATGGACCGTCACCAAAATACAGACCAGTATAAGTGCGATGTATGTCGGAAATCTTTGGCTTCACTCCAGGGCTTGCGTGTGCATAAACGCATACATACCGGAGAGAAGCCTTTCCTTTGTGCTATTTGTGAGAAGGACTTTAGAAGCAAAAGATTGCTTGTGGTGCACATGATCAAGAGCGAGGATATCGAAGAGGATCTCTATACCGTAACGCCATTATACATTAAACAAGAAGATGTGGAACAACTCAGTGGAGTTAACGATACCGGTGAAGTATATAAATCTGTGGAGTGCAAAAACGGTTACAAAAGTTGGGAGGATAACGTCGGACAGGAATACGCACACGATGACAACAAAAAGTTTATATCATCACAGTTCAAGAGCGAGGATTTCGAAGAGGATCGCTATACCGTAGCGCCATTATACATTAAACAAGAAGATTTGGAACAACTCACTGGAGTTAACGATACCGGTGAAGTGTATAAATCTGTGCAGTGCAAAAATAGTTACAAAGGTTGGGAGGATAACGTCGGACAGGAATACACACACGATGACAACAAAGAGTACATGTGTGATACCGCAGAGAAGCTCTTCACTTATGGTACTTGTGAGAAGATCAAGAGCGATCATATGGAAGAGGATCCCTATACTGTAATGCcattagatattaaaaaagaagatttgGGACGATCCAGTAGAATTAACGATACTAGTGGAGAGTATAATTCTGTACAGTGCAAAACCAGTTACAGCGGTTGGGAAGCCAACGTCAAGCAGAACTACGACATAGGCGATGACAACAAGAAGTTTATACCGTCACAGATCAAGATCGAGGATATCGAAGAGGATCCCTATACCGTAACGCCATTATACATTAAACAAGAAGATTTGGGACGACCCAGTGCAGTTAACGATACCGGTGAAGTGTATAATTCTGTACAGTGCAAAAGCAGTTACAAGAGTTGGGAGGAACACGTACACGATGACAACAAAACGTACATGTTTGCTGCCGCAGAGAAGCCTTTTACTTGTGGTACTTGTGAGAAGGTTTTTGGAAGTAAACGAATGCTTACGGTACATCTGGTGACGCACGCGAACgaaaagccttacATCAAGATCGAGGATATCGAAGAGGATCTCTATACCGTAACGCCTTTATACATTAAACAAGAAGATTTGGGACGACCCAGTGCAGTTAACGATACCGGAGAAGTGTATAATTCTGTGCAGTACAAAAACAGTTACAAGAGTTGCGAGGAACACCCACACGATGACAACAAAAAGGACCTGCTTGATACCGCAGAGAAGTCTTTTGCCTGTGGTACTTGTGAGGAGGTTTTTGGAAGTAAACGAATGCTTACGATCAAGAGCGAGGATATCGAAGAGGATCTCTATACCGTAACGCCATTATACATTAAACAAGAAGATTTGGAACAACTCAGTGGAGTTAACGATACCGGTGAAGTGTATAAATCTGTGGAGTGCAAAAACAGTTACAAAAGTTGGGAGGATAACGTCGGACAGGAATACGCACACGATGACAACAAAAAGTACATGTTTGATACCGCGGAGAAGCTCTTCACTTGTGGTACTTGTGAGAAGGTTTTTGGAAGTAAACGAATTCTTACGGAACACCTGGTGACACAtgcgaaagaaaagccttacATCAAGAGCGAGGATATCGAAGAGGATCTCTATACCGTAACGCCATTATACATTAAACAAGAAGATGTGGAACAACTCAGTGGAGTTAACGATACCGGTGAAGTGTATAAATCTGTGGAGTGCAAAAACAGTTACAAAAGTTGGGAGGATAACGTCGGACAGGAATACTCACACGATGACAACAAGAAgTTTATACCTTCCCAGATCAAAAGTGAGGATCTCGAAGAGGGTTCCTATACCGTAACGTCATCAGATATTAAAGAAGAAGATTTGGAACGGGGCGGGTTTATACCATCTCAAATCAAAAGCGAGGATCTCGACGAGGATTCCTATACCGTAACGTCATTAGATATTAAAGAAGATTTGGAACGTGGAGGTTTTGCGCGAACTGAATTTGTAGCGGTTAAGGTACATTTGTTTTTTCTCTTACTTAAAAGCGAGGACCTGGAAGAGGGTCCCTATACCGTAACGTCATTAGATATTAAAGAAGAAGATTTGGAATTGGGAGGGTTTATACCATCTCAAATCAAAAGCGAGGATCTCGACGAGGATTCCTATACCGTAACGTCATTAGATATTAAAGAAGAAGATTTGGAACGTGGAGGTTTTGCGCGAGCTGAATTTGTAGCGGATCATTTTTCTAGGTTTATACCATATCAGACGAAGATCGAGGACATAGAAGCGGATTCCTATACCGTAGCATCATTCgatataaaagaagaaaatttggaGCGAAACAGTGGAGAATTTAAATACACTAGCGCAGAGTACAGTTCTGAACAATGCGAAAATAGTTACGAGAGTTGCAATGATTACGTCGGTCATACAGAGGTGCATCTCCAAAAGGATTCAGGCCACCAATACCTCTGCGATAAATGCAACTGCGGTTATGGTACCTTATGGGACCTACAAAACCATGAGAACGCTCATTCTACTCAAGATACTGAGGAGTATAAATGCGATGTATGCGAGAAATCTTTGCGTTCAGCCCATTATTTACGTGTGCATAAACGCATACATTCCGGAGAGAAGCATTTCACTTGTGCTAGTTGTGAGAAGGTTTTTAGAAGCAAATCATTGCTTTTGGTACACCTTGTGACACACGGCAAAAGCGAGGGTATCGAGGAGGATTCCTATTCCGTAACGTCATTAGATATTAAAGAAGAAGATTTAGAACGATACGGTagagaaattatgaaaaatgaTTACGTAGGTCACAGACAGGTGCGTCAACAAAAGAATCCAGGCCACAAATGCGAATTTTGCGACATACACTTCGCCAGTAAGCGTATCTTGGCGGTTCATACATCACAAAAGCATGCGACAAAGTTTAAATACGTCTGCGATAAATGCAACTGCTGTTATGGTACCTTGTGGGATCTCAAAAATCATCAGAACGCTcattataaGTTTATGACGTCTCCGGCTAAAAGCGAGGGGATCGAGGTGCATCCCTACACCATAACCTCATTAGATATTAAAGAAGAAGATTTGAAAGGACGCAGTGGAGAAATGAACCAGAGTAGCGAAGAGCACAGTTGTAAACAGTGCGAAAAAGGTTACAAGAGTTGGAATGATTACGTTGGTCACAGACTGGTGCATCACCAGAAGAATCCAGGCCACAAATGCGATTTTTGTAATATGCGCTATGCCACTAAGCACACCTTGGCTCTTCATAGATCACGAAAGCACACGAAGGAGTTTAAATATATCTGCGTTAAATGCGACCGCGGTTATGGTACGCTATGGGACCTCAAAAACCATCAGAACGTTCACCATAGTACTACTAATTTCTTTTGCGATAAATGCGGGAAGAGTTTCAAGACCAAACGTTATTTACAGCGACATATGGTGATACATGATGACGACAAAAAGTACGTGTGTGATGTGTGTTCAGCTGTACTTCATCGCAAGGAAAGTTATCGTCGTCACATGGATCGTCATCAAAATACAGAGCAGCATACTTGCAGTGTATGCGAAAAATCTTTATCTTCAGCCTATCACTTGCGTGTTCATAAACGCATACATACCGGAGAGAAGCCTTTCGCTTGTGCTATTTGCGAGAAGTTAATACAATCATCTGTCAAATACGAGGATATAGAAGAGGGTCCCTACGTTGTATCACCATTAGACATTGAACGAGAAATTCCGAAACGCGGCCGTTATGAAATAACGCCATTATGTCTTCAGAAAGACATTTGTGGTTCATAtaagaagaaaatgaaacacaATGGTGCACCACATACTTGCGAAGAGTTTATACAATCATTCGTCAAATACGAGGATATCGAAGAGGGTTCATACGTGGCAACACCATTACACATTAAACTAGAAACTACGAAACCATACAATAATGAGGTAACGTCAGTATATATTCAAGAAGACGTTTGTGGTCCTCATGGAAAGAAAGTAAAACACAATAGTTCAACATATACTTGTGGGGAGTGTGGAAACGGTTACAGGACCAAAGAAAGCTACGGCAAACATATACAAGCGGCGCACGAAAACTTAACGCAACACTTGCATACGCATACCGATGAAAGAccttacacatCACTATTGGACATTAAACAAGAAAGTTcgaaacaatacaataatgagGTAACGCCATTTTGTATTGAGGAAGACGTTTGTGATCCTGATGGGAAGGAAATGAAACACAGCAGCGGAGCATATACTTGTGAAGGGTGTGGAAACAGCTACCAGACCAAACAAAGTTACACCAAACACTTGCGAGCAGCTCACGAAAACTTAAAGGTGCACATTCCGACACATGCGGAATCACCTGTCAAACACGAGGATGCAGACGAGAGTCCCTACGTTGTAACACCATTAGACATTAAACAAGAAAGTTCGGAACGCGACAATAATGAACTAACGCCATTATGTATTCCGGAAGACGTCTGTGGTCCTTGTGGAAAAAAAGTGAACGACAATACTTGTGAAGATTGCGGATACGACTACGGGACTGAAAAGAGCTACGGCAAACACATGCAAACTGTTCACGAAAGCCTAACAAGACGTTTGCGTACGCATATCGTCTTGGCATTTTTGTATTCATGGGATCATATTTCCAGGTTTATACAATCACCTGTTAAATACGAGGATATAGAAGAGAATCCTTACCTCATAACACCATTAGACATTAAACAAGAAAGTTCGAAACGATACAACAATGACGCAACGCCATTATATATTCAGAAAGGCGTTTGTGGTTCTCATGGGAAGAAAATAAGATGTAATAGCACAGCATATATTTATGAACAGTTTATACAATTACTCGTCAAACACGAGGATATAGAAGAGAGTCCCTACGTTATAACACCATTGGACGTTAAACAAGAGTGTTCGAAACTATGCAATAATGAGGGAATGccattttgtattcagaatgatGTCTGTGATCCTGATGGAAAGAAAATGAATGAAGAGTGCGGACAAGATTACGAAGGGAGCTACAGCAAACACACACAAGCGGTTCACGAAAACTTAACGGATCATTTgtttatacAATTACTCATCAAACACGAGGATATAGAAGAGGGTCTCCACGTTGTAACACCATTAGACATTAAACAAGAAAGTTCGAAACTATGCAATAATGAGGTAACGCCATTTTGTATTCACGAAGAGGTTTGTGATCCTGATGAGAAGACAATGAAACATAATAGCGACACATATACTGGTGAAGAGTGCGGAAAAGATGGGAAAGAGAGTTACAGCAAACACATACAAGCGGTTCACGATAACTTAACGGATCATTTGTTAATACAATATTTCGTCAAACACGAGGACATAGAAGAGAGTCCTTATACTGTAACACCGTTAGACATTAAACAAGAAAGTTCGAAGCTATACAGTGATGAGGTAACGTCAGTTTGTATTCTGAAACAAGTTTGTTCTCCTCAtgggaagaaaataaaatacaatagcAGATCAGCATATACTTATGAAGAGAGCGGAAATGGTTACTGGAACAATGAAAGCTACGGCAAACACATAAAAGCTATTCACGAAAACTTAACGAAACGTTTACGTACGCATACCGCTGAGAAGGCTTatacatTTTGCGGTAAGGGCTTCGCGCATAGTACGAACCTAGCGGCGCATATGCGTACGCATACAGGTGAGAAAGCTTTTCGGTGCTTACATTGTGATAGGAGATTTGGTACCAATGCACATTTAATTCGCCACAGAGAAGTATGCAAAGGGATAGCCTCAAAAGAGGATTATGGAATTGGATCACAGTCTATACAATTACTCATCAAATACGAGGACATAGAAGAGAGTTCCTACCATGTAACACCATTAGACATTAAACAGGAATGTTCGAAGCGATACAATAATGAGATAACGCCGGTATGTGTTCAGAAGGACGTTTGTGGTCCTCATGGGAAGAAAGTGAAACACAATAGCGCAACATATGCTTCTGAAGAGGGCGACGATGGTTACAGGAACAAAAAAAGCAAACGCATAAGAACGGTTCCCAAAAACCTAGAgcaacatttgcgtacacataccgctGAGAAGCCTTatacatTTTGCGGTAAGGGCTTCGCGCATAGTACGAACCTAGCGGCGCATATGCGTACGCATACAGGTGAGAAAGCTTTTCGGTGCTTACATTGTGATAGGAGATTTGGTACCAATGCACATTTAATTCGCCACAGAGAAGTATGCAAAGGGATAGCCTCAAAAGAGGATTATGGAATTGGATCACAGTCTATACAATTACTCATCAAATACGAGGACATAGAAGAGAGTTCCTACCATGTAACACCATTAGACATTAAACAGGAATGTTCGAAGCGATACAATAATGAGATAACGCCGTTATGTGTTCAGAAGGACGTTTGTGGTCCTCATGGGAAGAAAGTGAAACACAATAGCGCAACATATGCTTCTGAAGAGGGCGACGATGGTTACAGGAACAAAAAAAGCAAACGCATAAGAACGGAGGACATAGAAGAGAGTGTCTACCTTGTAACCCCATTAGACATTAAACAGGAATGTTCGGAACGATACATAAATGAGATACCGCCATTATGTGTTCAGAAAGACGATTGTGGTCCTGCTGAGGAGAAAATGAAACACAACAGTGCAACATATACTTGTGAAGAGGGCGACGATGCTTACAGAATCAAAGAAAGCAAACGCATAAGAACGGTTCACAAAAACCTAGAGCAACATTTGCGTACGGATGCCGCAGAGAAGCTTTAcgcatgtaaattttgcgaaaaggaaTTTGACCGATCAGCAAGCCTGACGATACATTTGCGCACGCATACCGGGGAAAAGCCTCACACATGTCAATTTTGCGGCAAGGGTTTTGCGCACAAgtTTATACAATCATCTGTCAAATACGAAGATATAGACGAGGGTTCCTACATTGTAACACCATTAGGCATTAAACAAGAGAGTTCTCAGCGATACAATAATGAGGTAACGCCATTATGTTTTCGAAAAGAGGTATGGGATCATTGTGAGGAGAACATGATACACAATACTACAGCATATACTTGCCAAGAGTGCGGAAACAGTCACAAGACTAAAGAAAGCTATGACAAACACATACAAGCCGTTCATGAAAATTTAACGAAAcgtttgcgtacgcatacctgTGAAAAGCCATTTATACAATCACCTCTCAAATTCGAGGATATAGAAGAGGGCCCCTACATTGTAACCCTATCAGGCATTAAACAAGAAAGTTTGAATCTATACAATAATGAGATAACGCCATTACGTATTCAGAGGGAGGGTTGTGGTTCTCGTGGGAAAAATATGAAACACAATGGCGCTACACATATTTGTGAAGAGTGTGTAAACGGttacagaagcaaaaaaagGTTTATACAACCACCTGTCAAATTCGAGGATATAGAAGAGAGTCCGTACGTTGTAACACTAACAGACAAACAACAAAGTTCGAAGCTATACAATGATGAAGTAACGCCATTACGTATTCGGAAAGACATTTATGGTTCTTATGGCAAGAAGATGAAACACAATGATGAAGCATATGCTTGTGAAGAGGGTGGAAACGGTTACAGAACCAAAAAAAGCTACGACAAACACAGACAAGTGGTTCACGATAACTTTACGATGCATTTGCGTACGCTTACCGAATCACCTGTCAAACACGAGGATGTAGAAGAGGGTCCCTATTTTATAACACCATTAGACATTATACAAGACATTTCGAAGCGATACAGTAATGAGGTAACGCCATTAAGTATTAAAAAAGGCGTTTATGGTCCTCATGGGAAGAAACTTAATAGTGCAACATATACTTGTGAAGAGTGCGGAACCAGTTATAGGACCAAAGAAAGCTACGGCGAACACATGCAGGCGGTTCATGAAAACTTGCGTACGATTATACAATCACAAGTCAAAAATGAGGACATAGAAGAGTGTGCTTATGTTTTAACATCATTAGGTATTGAACAAGATAATTTGAAACACATTAACGATATTACATACATGTGCGACAAATGCGAATGCGATTACTTCAGGTTGTCGGACCTGAAATTCCATCAGCTCATCCAGCACAGCACGCTTAATTTCTTGTGCGATAAGTGCGGGAGAGGTTTCGagatcaaattttatttaaagcgaCATATTGCAACGCACGATGGCAACAAGGAGTACGTGTGCGAGATTTGTTCGGCCATACTCCATCACAAGGACAGCTACTGTCGCCATATGGAGCGGCACCAAGATCATCAAGCTATAGGGCGGCATGAGTGCGACGTATGCGGAAAATCCTTGTCTTCACCCAATGCTTTGCTCGTACATAAGCGCATACATACCGGAGAGAAGCCTTTCGTCTGTGCTATTTGCGACAAACTTTTTAGAACCAAGCAGATGGTACAGTTACACATACGTACACACACGAAGGAAAAGCTCCACGTATACAAGCTTTGCGAGAAAGATTTTGCACAAACTACACAACTAACGAAGCATTTACGcgtacataccggtgaaaagccagccttacatatgTTTATATATTCACCCGTGAAAGGCGAGGATATAGAAGAGGGTTCCTATATCGTAACACCATTAGAAATTAAACAAGAAAGTTCAAAAGGATATCCTAATGAGGTAACGGCATCATGTATTCTAAAGAAGGTTTGGGATCGCTATGAGGAAATGAACCACATTAGAGCAGCAAACACGCCTGAAGAGAGCGGACACGGTGACAAGACTAAAGAGACGAAGTACCTCTGCAAGTTCTGTTGGGCCGTATTCCATCAGAAGTTTATATATTCACCCGTGAAAGGCGAGGATATAGAAGAGGGTCCCTATATCGTAACACCATTAGATATTAAACAAGAAAGTTCAAAAGGATATGGTAATGAGGTAACGGCATCATGTATTCTAAAGAAGGTTTGGGATCGCTATGAGGAAATGAACCACAATAGTGCAGCAAACACGTCTGAAGAGAGCGGACACGGTAACAAGACTAAAGAGATGAAGTTCATACAATTCCCTATCAAAATGGAGGATATAAAGGAGAGTCTACACGTTATAACGTCATTAGATATTAAACAACAGCATTCGACACGACATGACAATGAGGCCTTATGTATGAAGAAAGGAGTTTGGGATCGTCATGAGGAGAAAGTGAAGCACAATGATGAAGAGTTATTACAAACAGCTGTCAAATACGAGGATATAGAAGAGGGTCCCTACGTTCTAACACCATTAGATATTAAGCAAGAAATTAGGAAACGACATAATAATGGGCTAATGCAATTATGTATTCAGAAAGACGATCGCGGTCTACATGAGACGAAAATGAAATGTTCACCGTTAGATGCTGAGCAAGAAAATTCGGAACAACGCAATAATGGGATAACCCCATTGTGTGTTAAGAAAGAGATTTGGAATCGCCACGAGAAGAAACTGAAGCGCAATGGTGCAGCCTATACTTGTGGACAGAGCGAAAATAATATGAAGGCTCGCGTAAGTTGCGACAGATATGAAAAAGCGCAACATAAAATGCACTCGCAGTATAGTCATAGCAATGGAAAGCCGTTTTTACAAGCACTCGTCAAAAATGAAGATATAGAAGAGGGTCCCTATGTTATCACACCGTTAGATGCTGAACAAGAAAATTCGAAACAAGGCAATAATGGATTAACGCCACTGTTTTTACAAGCACTCGCCAAAGAAGAAGATGTAGAAGAGGGTCCCTATGTTATAATACCGTTAGATGCTGAACAAGAGAATTCGAAACAACACAATATGACGCCATTCTCTGTTAAGAAAGAGGTTTGGAATCGTCATGAGGAGAAATTGGAGCGCAACGGTGCAGCATATGCTTGTGAACAGGACGAAAATAGTGTCAAGGCTCCCCATGTTTCCAGGTTGATATGGTCCCAAGTCAAGCACGAAAATATAGAAGAAGGTCCCTACGTTGTAATACCACTGaatatgaaacaagaaaattcaGAACGGCGCCATAGAGAAACCAACCACTTGTACACTTCTGAAGGCCACGTAGAAAGTTTTAATATTAGGAGATACGAAACGAACCAAGACTACAAGTGCGAATTCTGCGAGAAGCACTTCGCCAGCAAGATATCCCTAGCTAATCACAGAACGCAGCACGCGACAGGACTTAAATGCGAGAGCAGTTACCCTAGATTGTGCGACCTAGAAAATCATCAGAGCATTTATCAAGATACCCTCAGACTTTCTTGCGTTAAATGCGGAAAGGTTTTTAAGAGCCGGAAATATTTAAGCCAACATATGGCGACGCACCGTCCCAAAACGGAAGAGTACACGTGTGAGGTGTGCGCAACCGTGCTCCACCACAAGAAGTCCTATTATCGTCACATGGCGCGCCACCAAGGCAAGGGACAGCATCAATGCGATATCTGTGGCAAATCGCTGTCCAGAGCCGAATATTTGGCCCCCCATAAACGCATACACAGCGGTGAAAAGCCTTTCGTTTGTACTATTTGCGAGAGAGCTTTCACAACGAAACGATTGCTTGTACTACACATACGGACACATGCGAAACAGAAGCCTCATTGA
- Protein Sequence
- MPSCIKNEDIEEGTYFVTSLDIEQAMSKRYKNGLTSLCVKKEEGSHIESLRSTEQHDCDICEKSFCSAFDLHMHKRIHTGEKSFVCAICEEVFGTKQLLRSHILSHTNEKPYLCKYCDKGFARKSYLTPHLRTHTGEKPYICGFCSKGFAQPTHLTRHLRTHTGEKPYICKFCDKGFAHTTNLAVHLRTHTGEKPYVCKFCDKDFAHQFTPSCFKTEDMEEGPYIITSLDVKQEKSQRQNNKITQLWVKTELPDRNEEKMKPFSVVYTCDEHINASHEKSNMYKCKCCCKHFSSKQDLSVYGTDQMKEFRYKCNDCEPHHGTGQHECECGKSFASLQCLNVHKRIHTGEKPFKPSCFKSEDIEEDSCIITPLDVKQAISKRYKNEATPLCVKKQVLDHYTEETKHNSTSYACEECGNNYSIKGGCGKHLKTLNEEGEIYKCEFCCDKCVKDFKAKLYLDRQMATHGSSKEEYVCKVCSTILHGKGSFHVHMELHQSAAQLKCDICEKWFCSASDMHVHKRTHTEEMPFLCVICEEAFRTKELLRSHVLGHTNEKPYICKFCGKVFWRTTDLARHLRTHSGEKPYMFTPAGFKSEDIEVSIPEVPVLWIKDEISDSHEKQKQQNSAAFACYDCGNGYKSKGSYEKHIKTVHENDKFYKCEFCGKHFSSKYNLSVHRTQHTKEFRYRCDKCERGYLRLYDLKHHQNVHHNTPNFFCGQCGKAFKIKRYLKEHMTIHDSSKEKYACGVCSAVLHQEKSYNRHMDRHQGKGQHKCDICEKSLSSASGLLEHRRIHTCEKPFICAICERTFAAKGSLRLHMRTHTKEKPYVCKFCDKYFAWTTSLAIHLRQHSGEKPHKCLHCGRRFTCTSNLNQHKCKRVQFTPACLKIEDVEEDPYIITPFDVTQESSQRYNDKVIPLCVKKEQWDHQDEEAKLFSVAYTHEVRGNDYKIKESYDKDLKPIHEKGKVYKCKFCGMQFFSEHYLSVHRTGHLKDGGTVQHKCDVCGKPLASLQSLNMHKRIHTGEKPFVCAICEKGFAVKGSLRFTPSRLKNEDVEEGPYIITPLDIKQESSQPYNNKVTPLYVKKEQWDRQEEEFCSMQFFSKHYLSIHRTKHTEGGGIVQHKCDVCGKSLASLQSLNMHKRIHTGEKPFICPICKKAFAVKGSLRFIPTCVKSEDIEADPCGITSIYIKQENFGSHNIEMNDSAYPCEQCGKNLDYKCEFCGKCLASKSGLAIHRLRHTKEFKYRCDKCERDFLRLCDLKSHQYSHQSTLNFFCDKCGKGFKAKSYLKAHMDRHLSMERYKCDICGKCMSSPQYLLIHKRIHTGEKPFVCGICEKAFALKQMLKIHMRTHTKEKPYGIEEGPYIITPLDIKQEAAYTCEVSSKENYDKYTKVSHENGIDCKRECCGKQSSNKQNFSIHRTKHTKRIQYKFDKYERRHGIGQNKYDVCGKSLASLQKPFICAVAKKAFAVKGSFWLHTRTHAKEKPNCFFCDKNFTYARSLTTPLNTHAREKLYTCKFCEKGFTRPAHLTTHLRTHRFIPACVKNEGTEEGPYIITPLRIKQESSEHHSNKVTSLCVRKELLHRSQETIEPFSAVYTSYDKHIKASHERGGCEQFPSEQYLSIHRIEHTERFIPSSVKNEDTEEGPYIITPLHIKQESWEQYSNSVTPLRVRKELLDRSEETMEPFTAVYTSYDKHIKPSHEKGEFSSEESSTSRCKNEDVEADPYIITPFDVTQKISQRYNSKVTSLCVKKVQWDCHEQRINIRGAGHSCEEDRNGCKTKQTYDKHIKTSHESDKVCKCKFCSKQLSSKQYLSILRTKHTKAGGIEGHKCDVCGKSLYSLRFLTSCFRKEDVEVGPYIITPLDVRQDNSQSHNNKNNKVAPLCISKEQWDRHEDKVKLVSAAYTCEVCGDSYKTQESYDKHRMASHERGKGYKCKFCCKQFSSKQYLSIHRTEHTKASGIGQHKFSTSCVKNEDVEEGPYIITPLDIRQENSQRYNDKVVPPCVKEQRNRQEEKMKSLSAAYTCDEYGNGYKIKQSYDKHIKASHEKGEVQMQKCDSAGAQHPGEGNHSEGQCGPLWGVCCNQWDEEIFEFKYLKPQNSPIYRDAGLEDWFEGDVCIPIFTQRISTPNSGGIYASIIATTFLTQKKEESPGVEHLLFKFRASKPIYSSAYQRSPGGRDLGERPSTSRFKNEDVEEGPYIITPLDIRQEKSQRYNDKVVPLRIRKQQWNRQEEKMKRLSTAHAQEVCKNGYKIKESYDKHIKASHENGKVQMRKCDVSGKPLALPQRLNLHKRIHTREEPFICAICEKAFAVEESLQLHMQTHAKEKKRKLSQKGSARKSRLTRHLPKFIPSGFRKEDTEVGIYEVPVLWIKDEVSDGYEEEKKHTGATYTCDECGSSYDTKESYDKHIKVSHEKAKVYKCKFCCKKFSSEHCLSIHRSEHTKASGILPHKFIPSGFRKEDTEVGIYEVPVLWIKDEVSDGYEEEKKHTGATYTCDECGSSYDTKESYDKHIKLSHEKAKVYKCKFCCKKFSSERCLSIHRSEHTKASGILPHKFLTSYVKYEDIEEGPHIITPLDLKQEDPQQHNKKVTPLYVKKKQWEAQEENMESFSAAYTCEVCGSGYKTEESYGKHIKAAHEMGEIYKCEFCSKQYSSEHHLCIHRTKHHGIEQFIPPGFKNEDIEAGIYEVPVSWIKNDVLDGYEHEKKYISAAYTCYECEKGYNTKKSYNKHLNVSHEKAKVYRCKFCCKQSMIHKGSHRLHMRTHAKEKPCICQFCDKRFGHKFTQSGFRNEDIKVDSYEVPVLWIKDEVSAGYEEEKKHIGVIYTCDECGSSYDTKESYDKHIKVSHEKAKVYKWKLCCKQSSSKHNLSIHRTEDTKAGGIGQHKCDGCGRPLASPQRLNLHKRIHTGEKPFIHKGSHRFILSQTKSEDIEDDPYTVTPLEIKEEELEPHSRGIHHNIGEYSCEQCGNSWNDYVGHRQAHQQKKAGHKCEFCDMYFASRHTLVVSQKHAKEFKHNCNKCEGGHYKLCDQQVHQGGTSNFCCDKCGKSFKTRYYLKRHTVVHDDNKKYVCDVCSAVLHHKDSYRRHMDRHQNTDQYKCDVCRKSLASLQGLRVHKRIHTGEKPFLCAICEKDFRSKRLLVVHMIKSEDIEEDLYTVTPLYIKQEDVEQLSGVNDTGEVYKSVECKNGYKSWEDNVGQEYAHDDNKKFISSQFKSEDFEEDRYTVAPLYIKQEDLEQLTGVNDTGEVYKSVQCKNSYKGWEDNVGQEYTHDDNKEYMCDTAEKLFTYGTCEKIKSDHMEEDPYTVMPLDIKKEDLGRSSRINDTSGEYNSVQCKTSYSGWEANVKQNYDIGDDNKKFIPSQIKIEDIEEDPYTVTPLYIKQEDLGRPSAVNDTGEVYNSVQCKSSYKSWEEHVHDDNKTYMFAAAEKPFTCGTCEKVFGSKRMLTVHLVTHANEKPYIKIEDIEEDLYTVTPLYIKQEDLGRPSAVNDTGEVYNSVQYKNSYKSCEEHPHDDNKKDLLDTAEKSFACGTCEEVFGSKRMLTIKSEDIEEDLYTVTPLYIKQEDLEQLSGVNDTGEVYKSVECKNSYKSWEDNVGQEYAHDDNKKYMFDTAEKLFTCGTCEKVFGSKRILTEHLVTHAKEKPYIKSEDIEEDLYTVTPLYIKQEDVEQLSGVNDTGEVYKSVECKNSYKSWEDNVGQEYSHDDNKKFIPSQIKSEDLEEGSYTVTSSDIKEEDLERGGFIPSQIKSEDLDEDSYTVTSLDIKEDLERGGFARTEFVAVKVHLFFLLLKSEDLEEGPYTVTSLDIKEEDLELGGFIPSQIKSEDLDEDSYTVTSLDIKEEDLERGGFARAEFVADHFSRFIPYQTKIEDIEADSYTVASFDIKEENLERNSGEFKYTSAEYSSEQCENSYESCNDYVGHTEVHLQKDSGHQYLCDKCNCGYGTLWDLQNHENAHSTQDTEEYKCDVCEKSLRSAHYLRVHKRIHSGEKHFTCASCEKVFRSKSLLLVHLVTHGKSEGIEEDSYSVTSLDIKEEDLERYGREIMKNDYVGHRQVRQQKNPGHKCEFCDIHFASKRILAVHTSQKHATKFKYVCDKCNCCYGTLWDLKNHQNAHYKFMTSPAKSEGIEVHPYTITSLDIKEEDLKGRSGEMNQSSEEHSCKQCEKGYKSWNDYVGHRLVHHQKNPGHKCDFCNMRYATKHTLALHRSRKHTKEFKYICVKCDRGYGTLWDLKNHQNVHHSTTNFFCDKCGKSFKTKRYLQRHMVIHDDDKKYVCDVCSAVLHRKESYRRHMDRHQNTEQHTCSVCEKSLSSAYHLRVHKRIHTGEKPFACAICEKLIQSSVKYEDIEEGPYVVSPLDIEREIPKRGRYEITPLCLQKDICGSYKKKMKHNGAPHTCEEFIQSFVKYEDIEEGSYVATPLHIKLETTKPYNNEVTSVYIQEDVCGPHGKKVKHNSSTYTCGECGNGYRTKESYGKHIQAAHENLTQHLHTHTDERPYTSLLDIKQESSKQYNNEVTPFCIEEDVCDPDGKEMKHSSGAYTCEGCGNSYQTKQSYTKHLRAAHENLKVHIPTHAESPVKHEDADESPYVVTPLDIKQESSERDNNELTPLCIPEDVCGPCGKKVNDNTCEDCGYDYGTEKSYGKHMQTVHESLTRRLRTHIVLAFLYSWDHISRFIQSPVKYEDIEENPYLITPLDIKQESSKRYNNDATPLYIQKGVCGSHGKKIRCNSTAYIYEQFIQLLVKHEDIEESPYVITPLDVKQECSKLCNNEGMPFCIQNDVCDPDGKKMNEECGQDYEGSYSKHTQAVHENLTDHLFIQLLIKHEDIEEGLHVVTPLDIKQESSKLCNNEVTPFCIHEEVCDPDEKTMKHNSDTYTGEECGKDGKESYSKHIQAVHDNLTDHLLIQYFVKHEDIEESPYTVTPLDIKQESSKLYSDEVTSVCILKQVCSPHGKKIKYNSRSAYTYEESGNGYWNNESYGKHIKAIHENLTKRLRTHTAEKAYTFCGKGFAHSTNLAAHMRTHTGEKAFRCLHCDRRFGTNAHLIRHREVCKGIASKEDYGIGSQSIQLLIKYEDIEESSYHVTPLDIKQECSKRYNNEITPVCVQKDVCGPHGKKVKHNSATYASEEGDDGYRNKKSKRIRTVPKNLEQHLRTHTAEKPYTFCGKGFAHSTNLAAHMRTHTGEKAFRCLHCDRRFGTNAHLIRHREVCKGIASKEDYGIGSQSIQLLIKYEDIEESSYHVTPLDIKQECSKRYNNEITPLCVQKDVCGPHGKKVKHNSATYASEEGDDGYRNKKSKRIRTEDIEESVYLVTPLDIKQECSERYINEIPPLCVQKDDCGPAEEKMKHNSATYTCEEGDDAYRIKESKRIRTVHKNLEQHLRTDAAEKLYACKFCEKEFDRSASLTIHLRTHTGEKPHTCQFCGKGFAHKFIQSSVKYEDIDEGSYIVTPLGIKQESSQRYNNEVTPLCFRKEVWDHCEENMIHNTTAYTCQECGNSHKTKESYDKHIQAVHENLTKRLRTHTCEKPFIQSPLKFEDIEEGPYIVTLSGIKQESLNLYNNEITPLRIQREGCGSRGKNMKHNGATHICEECVNGYRSKKRFIQPPVKFEDIEESPYVVTLTDKQQSSKLYNDEVTPLRIRKDIYGSYGKKMKHNDEAYACEEGGNGYRTKKSYDKHRQVVHDNFTMHLRTLTESPVKHEDVEEGPYFITPLDIIQDISKRYSNEVTPLSIKKGVYGPHGKKLNSATYTCEECGTSYRTKESYGEHMQAVHENLRTIIQSQVKNEDIEECAYVLTSLGIEQDNLKHINDITYMCDKCECDYFRLSDLKFHQLIQHSTLNFLCDKCGRGFEIKFYLKRHIATHDGNKEYVCEICSAILHHKDSYCRHMERHQDHQAIGRHECDVCGKSLSSPNALLVHKRIHTGEKPFVCAICDKLFRTKQMVQLHIRTHTKEKLHVYKLCEKDFAQTTQLTKHLRVHTGEKPALHMFIYSPVKGEDIEEGSYIVTPLEIKQESSKGYPNEVTASCILKKVWDRYEEMNHIRAANTPEESGHGDKTKETKYLCKFCWAVFHQKFIYSPVKGEDIEEGPYIVTPLDIKQESSKGYGNEVTASCILKKVWDRYEEMNHNSAANTSEESGHGNKTKEMKFIQFPIKMEDIKESLHVITSLDIKQQHSTRHDNEALCMKKGVWDRHEEKVKHNDEELLQTAVKYEDIEEGPYVLTPLDIKQEIRKRHNNGLMQLCIQKDDRGLHETKMKCSPLDAEQENSEQRNNGITPLCVKKEIWNRHEKKLKRNGAAYTCGQSENNMKARVSCDRYEKAQHKMHSQYSHSNGKPFLQALVKNEDIEEGPYVITPLDAEQENSKQGNNGLTPLFLQALAKEEDVEEGPYVIIPLDAEQENSKQHNMTPFSVKKEVWNRHEEKLERNGAAYACEQDENSVKAPHVSRLIWSQVKHENIEEGPYVVIPLNMKQENSERRHRETNHLYTSEGHVESFNIRRYETNQDYKCEFCEKHFASKISLANHRTQHATGLKCESSYPRLCDLENHQSIYQDTLRLSCVKCGKVFKSRKYLSQHMATHRPKTEEYTCEVCATVLHHKKSYYRHMARHQGKGQHQCDICGKSLSRAEYLAPHKRIHSGEKPFVCTICERAFTTKRLLVLHIRTHAKQKPH
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -