Tuni066734.2
Basic Information
- Insect
- Thereva unica
- Gene Symbol
- -
- Assembly
- GCA_949987705.1
- Location
- OX465285.1:95381885-95406328[+]
Transcription Factor Domain
- TF Family
- E2F
- Domain
- E2F_TDP domain
- PFAM
- PF02319
- TF Group
- Helix-turn-helix
- Description
- This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in association with TDP1/2, the heterodimer having increased binding efficiency. The crystal structure of an E2F4-DP2-DNA complex shows that the DNA-binding domains of the E2F and DP proteins both have a fold related to the winged-helix DNA-binding motif. Recognition of the central c/gGCGCg/c sequence of the consensus DNA-binding site is symmetric, and amino acids that contact these bases are conserved among all known E2F and DP proteins.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 35 0.16 3e+03 0.0 0.0 12 43 165 199 162 201 0.73 2 35 0.017 3.2e+02 3.2 0.0 12 43 352 386 349 388 0.73 3 35 0.013 2.4e+02 3.6 0.0 12 43 539 573 536 575 0.75 4 35 0.017 3.2e+02 3.2 0.0 12 43 726 760 723 762 0.73 5 35 0.017 3.2e+02 3.2 0.0 12 43 913 947 910 949 0.73 6 35 0.017 3.2e+02 3.2 0.0 12 43 1100 1134 1097 1136 0.73 7 35 0.017 3.2e+02 3.2 0.0 12 43 1287 1321 1284 1323 0.73 8 35 0.017 3.2e+02 3.2 0.0 12 43 1474 1508 1471 1510 0.73 9 35 0.017 3.2e+02 3.2 0.0 12 43 1661 1695 1658 1697 0.73 10 35 0.017 3.2e+02 3.2 0.0 12 43 1848 1882 1845 1884 0.73 11 35 0.017 3.2e+02 3.2 0.0 12 43 2035 2069 2032 2071 0.73 12 35 0.017 3.2e+02 3.2 0.0 12 43 2222 2256 2219 2258 0.73 13 35 0.017 3.2e+02 3.2 0.0 12 43 2409 2443 2406 2445 0.73 14 35 0.017 3.2e+02 3.2 0.0 12 43 2596 2630 2593 2632 0.73 15 35 0.017 3.2e+02 3.2 0.0 12 43 2783 2817 2780 2819 0.73 16 35 0.017 3.2e+02 3.2 0.0 12 43 2970 3004 2967 3006 0.73 17 35 0.017 3.2e+02 3.2 0.0 12 43 3157 3191 3154 3193 0.73 18 35 0.017 3.2e+02 3.2 0.0 12 43 3344 3378 3341 3380 0.73 19 35 0.015 2.8e+02 3.3 0.0 12 43 3531 3565 3528 3567 0.74 20 35 0.015 2.8e+02 3.3 0.0 20 43 3729 3752 3713 3754 0.72 21 35 0.017 3.2e+02 3.2 0.0 12 43 3905 3939 3902 3941 0.73 22 35 0.017 3.2e+02 3.2 0.0 12 43 4092 4126 4089 4128 0.73 23 35 0.017 3.2e+02 3.2 0.0 12 43 4279 4313 4276 4315 0.73 24 35 0.017 3.2e+02 3.2 0.0 12 43 4392 4426 4389 4428 0.73 25 35 0.015 2.8e+02 3.3 0.0 20 43 4590 4613 4574 4615 0.72 26 35 0.017 3.2e+02 3.2 0.0 12 43 4766 4800 4763 4802 0.73 27 35 0.017 3.2e+02 3.2 0.0 12 43 4953 4987 4950 4989 0.73 28 35 0.017 3.2e+02 3.2 0.0 12 43 5140 5174 5137 5176 0.73 29 35 0.017 3.2e+02 3.2 0.0 12 43 5327 5361 5324 5363 0.73 30 35 0.017 3.2e+02 3.2 0.0 12 43 5514 5548 5511 5550 0.73 31 35 0.017 3.2e+02 3.2 0.0 12 43 5701 5735 5698 5737 0.73 32 35 0.024 4.6e+02 2.6 0.0 21 43 5900 5922 5885 5924 0.78 33 35 0.017 3.2e+02 3.2 0.0 12 43 6075 6109 6072 6111 0.73 34 35 0.017 3.2e+02 3.2 0.0 12 43 6262 6296 6259 6298 0.73 35 35 0.0042 80 5.1 0.0 12 43 6449 6483 6446 6485 0.73
Sequence Information
- Coding Sequence
- ATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGAAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGCTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGTCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGGCCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACAGCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAACACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGACATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAATTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAATTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAAAATCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAGCCATAATATAAATATGTTGGAGCTCGCCATTATAAACGATATTCAGAGCCTCGTCCACAAAAAGGATATGCCAGCTtctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAATTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGGCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGATAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAATTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGTGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATAACCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCACGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGAAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAACTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAACTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTGTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCACAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGATGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTGTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAATACCCCGAGAAGGTCATTGAAAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACAACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCTAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGTGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAAAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCAAGAAGGTCATTGAGAACCCGACTACAATAGAAGAATTTTTGGAAAATCTGATAAAGGCCTTCAGAAGAATTTACGATATTAAGGAATCTGGCGTTATATCGAAACAATTTACTGAAAATGCAACACATGAGCTAATCGATATTGAGCATGGAGGTACAGATCGTCATGGTATGCACATTCAAGTGGTACAGAGTGTTCTTTACCCTCGAAATTACAGTGAActacaaaaagaatttaaaagcGTGAAGACACGCGATGAAACTCTTTTAGATGAGAAGAAGGAGAAGAATAATGAAGTTGCCAGCTTCAACTATGAAAATCACAAATTGACTGTGTTGGAAATTGACATTTTAAAAGACTTACAAAGCCTTGTTCAGGCAAACAAAACACCAGCGTCAGAAAAGAGTAAGATCAAGAACTTGATAGAAAAGAAGATGAAAGAACTTCGAGAAGCGCACAGTAACTTCAAGGAAAAGTTCCCTAGAGACCGTTCAAGTCATGCAATTTTGAAACGAGTAGAATTTTTGGTAAAGGAAGCCAAGTCTTACGAATTTTGGGAGGTTGAACACTTGAAGACGGAGGATCCCGTTGAAACTGTTGAAGGATCGTCTTTGAATTCATTGGATAAAGTTAAAGAAATGTTcgaaaaagaagtaagaaaagtATTCCAAATTCAAGAACCTTTAGATATTTTAGTCGAAACCACTAGAAATTGTACACATTCTGTAACCACCTTTAAAACTCAAGGGGTTGAAAGGCAAGCCATTGATTTTCAAGTTTTACCTGCAGCAACTATTCGCCGTATGAGTTTGGCTGAGTATAGGAAACTGCAGAAGCAGTATGAAAACGAAAAATCGCAGGAAAGTCTACTGCtggaaaaaagaaaggaaaatcgATCtattgaagcgtacttcaactATGAAATGCACAGAGTGAGTATGATGGAGTTGGACATAATCACAATCGTTCAAAGTATTATTCATGAAACTAATATGTCGACCTCGCAAATGACTGAAATTAAACGAGAGGTGGAGCAGAAGTCCACGGATCTACGAAATCTTTATGCAAGGTTTTATGAAGTGTTCCCAAGGAACAAACTGCAAAACCCGAGATGGCGGCTGATTATGACTTTAGTGGAATCTTTTGTCGACTCTGAAACTACAACGGCAGAACCCAATAACCTAGTTGATGTAACTATGAATAACGGAGTGACTATATCAAAACAAGTCGTTCAAATTAGTGAAAAACCGTCAATTAAGACGATAGAAGACGCCAAGCAATTCCTGGAAGATGAAGTTAgaaaaatttatcaaatttcAGGACCTATGATTGTAACAGAAAAGCACATTGAAAATCGAACTCACCTGGTTGTTGAATGTGAAATACGAGGATTGGAAAGGTTGGCGAAAGAACTTACAGTAAAACCGCATTCCAGGATTCGTTGCATTGATGGCTCGGAGTACACTGGAATGCTAGAGCAGTTCGAAGCAGAAAAGGCTCGAGAGAAAGAGTTAAAATTCAATATGATAAATAACCGTTCTGAGGACGCGACATTTGAATATCAAAGACACGTTCTAAGTATTTTAGAATTTGATATATTAGTGAACATTCAAAGTATAGCTCACAACAAGGATATGTCGGCAACTGGAAAGGATAAAATTGAGAAATACGTTGAGCAAAAACTTAGAGAACTTTCGGAACTCTACGAGAAATTCTATAAAAGATTTCCAAGGAGTAATTTGAAAAAACCGAAATGGCGACAAATTAAAGTGTTCATTTGGCTTAAACCAACAACAGACTATGAAGTTACGTCTGAAACTTCAAAAGTAACTGATCTTGAAACAGCAACTGGAGGAGAGAACATTGTCTCAGAACAGCCCCCAATAGATCAGCCTGCCCTGGAAATTCCACAGCAAGGACCAATGTTTTCGCCCGAACAACAACCTGAGTCGACTATGGTAACAGAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCTATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCAACCCCAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTAAAGAGCTATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTCAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAACAGAACCGTCCACAGAGGCTCCAGAAAAACAACCTCAGTCGGCTACTGTAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCAACCCCAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGCAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTCAGTCGGCTACTGTAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCAACCCCAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTTCACAGAGGCTCCACAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTCAGTCGGCTACTGTAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCAACCCCAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTCAGTCGGCTACTGTAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTATAACGGAACAGCCCACAGCCCCTCCAAAACAACAATCGGAGTCGGCTAGTATAACAGAACCGTCCACAGAGGCTCCAGAAAAACTACCTGAGTCGGCTACTATAACGGAACCGTCCACAGCCCCTCCAAAACAACAATCTGAGTCGACTAGTATAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTATAACGGAACCGTCCACAGCCCCTCCAAAACAACAATCTGAGTCGGCTAGTATAACAGAACCGTTCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTATAACGGAACCGTCCACAGATGCCCCAGATTCGACCCCATCTAGAGACTGCGAGGGATCTGCTATTGCTTGCGTTTCACTTACACCTGATTCAGATGCCTTGATCAGTCCTGTGATTGATTTAAATGGTTATGGAAACGTAGAAAATGTAGATCAATACCCTCCAATATATGACGGTAGTCCCTACTGGCCACCTCAGTACCCAGACGAGTATCCCGAAATGTACCATCCGGAGTCACCTCAGTATGCAGTAGACTATCCTGCATATGGTGAAATGTACCATCCAGCGCCTCCTCAGTATCCAGAACAGTATCCTGCCTACAGTGAAATGTATCACCCGTCATATCTTCCCCAGGAATCACCGTATTATCCCATTTATGACCATTACCCAGTTCTAGATGGCTACCAACCTCCGCCCTATCCTTATGACTACACAGGTTATCCTCACCAGCTGCCAGATGAAGCTTATCTAAATACTTACCCAGAACAATATCCATCCGTCATCAGTCCAAGTGCAGAAGTATTGCCAGATGACTTACAATGCGTTATCCGACTATTGTCAAGCTTTTGCAGTTGGACAAAGGACAAGAGTCAACTTCATATATCGGAGCAGTGCTCACATGTTGAGAATCTGAATTCAGAAGAAGTACTTACAATCGCCCATAATTTACGAGAAAATCTGCAAGCATTTTGCAGCTATGTAGGTTTTAGGCATGCTGATTTGAGGTATGCCTGCAGCAATGTCTTGGAATTCTAA
- Protein Sequence
- MLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKKIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQAYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEVTFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEQPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELAIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKNQEKTLRETMKAKNTPEATFNYESHNINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELAIINDIQSLVHKKGMAASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKIEKSQEKTLRETMKAKNTPEATFNYENHKINMLELAIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNVQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYEYKEIQEELKLEKSQEKTLRETMKAKNNPEATFNYENHKINMLELAIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETTKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLKHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQLKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQLKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVCDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVCDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEYPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPKKVIENPTTIEEFLENLIKAFRRIYDIKESGVISKQFTENATHELIDIEHGGTDRHGMHIQVVQSVLYPRNYSELQKEFKSVKTRDETLLDEKKEKNNEVASFNYENHKLTVLEIDILKDLQSLVQANKTPASEKSKIKNLIEKKMKELREAHSNFKEKFPRDRSSHAILKRVEFLVKEAKSYEFWEVEHLKTEDPVETVEGSSLNSLDKVKEMFEKEVRKVFQIQEPLDILVETTRNCTHSVTTFKTQGVERQAIDFQVLPAATIRRMSLAEYRKLQKQYENEKSQESLLLEKRKENRSIEAYFNYEMHRVSMMELDIITIVQSIIHETNMSTSQMTEIKREVEQKSTDLRNLYARFYEVFPRNKLQNPRWRLIMTLVESFVDSETTTAEPNNLVDVTMNNGVTISKQVVQISEKPSIKTIEDAKQFLEDEVRKIYQISGPMIVTEKHIENRTHLVVECEIRGLERLAKELTVKPHSRIRCIDGSEYTGMLEQFEAEKAREKELKFNMINNRSEDATFEYQRHVLSILEFDILVNIQSIAHNKDMSATGKDKIEKYVEQKLRELSELYEKFYKRFPRSNLKKPKWRQIKVFIWLKPTTDYEVTSETSKVTDLETATGGENIVSEQPPIDQPALEIPQQGPMFSPEQQPESTMVTEPSTEAPEKQPESTTVKELSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPTPEAPEKQPESTTKELSTEAPEKQPESTTVKEPSTEAPEKQPQSTTVKEPSTEAPEKQPESTTVTEPSTEAPEKQPQSATVTKPSTEAPEKQPESTTVKEPTPEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTAKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPQSATVTKPSTEAPEKQPESTTVKEPTPEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPFTEAPQKQPESTTVKEPSTEAPEKQPQSATVTKPSTEAPEKQPESTTVKEPTPEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPQSATVTKPSTEAPEKQPESTTITEQPTAPPKQQSESASITEPSTEAPEKLPESATITEPSTAPPKQQSESTSITKPSTEAPEKQPESTTITEPSTAPPKQQSESASITEPFTEAPEKQPESTTITEPSTDAPDSTPSRDCEGSAIACVSLTPDSDALISPVIDLNGYGNVENVDQYPPIYDGSPYWPPQYPDEYPEMYHPESPQYAVDYPAYGEMYHPAPPQYPEQYPAYSEMYHPSYLPQESPYYPIYDHYPVLDGYQPPPYPYDYTGYPHQLPDEAYLNTYPEQYPSVISPSAEVLPDDLQCVIRLLSSFCSWTKDKSQLHISEQCSHVENLNSEEVLTIAHNLRENLQAFCSYVGFRHADLRYACSNVLEF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -