Basic Information

Insect
Thereva unica
Gene Symbol
-
Assembly
GCA_949987705.1
Location
OX465285.1:95381885-95406328[+]

Transcription Factor Domain

TF Family
E2F
Domain
E2F_TDP domain
PFAM
PF02319
TF Group
Helix-turn-helix
Description
This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in association with TDP1/2, the heterodimer having increased binding efficiency. The crystal structure of an E2F4-DP2-DNA complex shows that the DNA-binding domains of the E2F and DP proteins both have a fold related to the winged-helix DNA-binding motif. Recognition of the central c/gGCGCg/c sequence of the consensus DNA-binding site is symmetric, and amino acids that contact these bases are conserved among all known E2F and DP proteins.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 35 0.16 3e+03 0.0 0.0 12 43 165 199 162 201 0.73
2 35 0.017 3.2e+02 3.2 0.0 12 43 352 386 349 388 0.73
3 35 0.013 2.4e+02 3.6 0.0 12 43 539 573 536 575 0.75
4 35 0.017 3.2e+02 3.2 0.0 12 43 726 760 723 762 0.73
5 35 0.017 3.2e+02 3.2 0.0 12 43 913 947 910 949 0.73
6 35 0.017 3.2e+02 3.2 0.0 12 43 1100 1134 1097 1136 0.73
7 35 0.017 3.2e+02 3.2 0.0 12 43 1287 1321 1284 1323 0.73
8 35 0.017 3.2e+02 3.2 0.0 12 43 1474 1508 1471 1510 0.73
9 35 0.017 3.2e+02 3.2 0.0 12 43 1661 1695 1658 1697 0.73
10 35 0.017 3.2e+02 3.2 0.0 12 43 1848 1882 1845 1884 0.73
11 35 0.017 3.2e+02 3.2 0.0 12 43 2035 2069 2032 2071 0.73
12 35 0.017 3.2e+02 3.2 0.0 12 43 2222 2256 2219 2258 0.73
13 35 0.017 3.2e+02 3.2 0.0 12 43 2409 2443 2406 2445 0.73
14 35 0.017 3.2e+02 3.2 0.0 12 43 2596 2630 2593 2632 0.73
15 35 0.017 3.2e+02 3.2 0.0 12 43 2783 2817 2780 2819 0.73
16 35 0.017 3.2e+02 3.2 0.0 12 43 2970 3004 2967 3006 0.73
17 35 0.017 3.2e+02 3.2 0.0 12 43 3157 3191 3154 3193 0.73
18 35 0.017 3.2e+02 3.2 0.0 12 43 3344 3378 3341 3380 0.73
19 35 0.015 2.8e+02 3.3 0.0 12 43 3531 3565 3528 3567 0.74
20 35 0.015 2.8e+02 3.3 0.0 20 43 3729 3752 3713 3754 0.72
21 35 0.017 3.2e+02 3.2 0.0 12 43 3905 3939 3902 3941 0.73
22 35 0.017 3.2e+02 3.2 0.0 12 43 4092 4126 4089 4128 0.73
23 35 0.017 3.2e+02 3.2 0.0 12 43 4279 4313 4276 4315 0.73
24 35 0.017 3.2e+02 3.2 0.0 12 43 4392 4426 4389 4428 0.73
25 35 0.015 2.8e+02 3.3 0.0 20 43 4590 4613 4574 4615 0.72
26 35 0.017 3.2e+02 3.2 0.0 12 43 4766 4800 4763 4802 0.73
27 35 0.017 3.2e+02 3.2 0.0 12 43 4953 4987 4950 4989 0.73
28 35 0.017 3.2e+02 3.2 0.0 12 43 5140 5174 5137 5176 0.73
29 35 0.017 3.2e+02 3.2 0.0 12 43 5327 5361 5324 5363 0.73
30 35 0.017 3.2e+02 3.2 0.0 12 43 5514 5548 5511 5550 0.73
31 35 0.017 3.2e+02 3.2 0.0 12 43 5701 5735 5698 5737 0.73
32 35 0.024 4.6e+02 2.6 0.0 21 43 5900 5922 5885 5924 0.78
33 35 0.017 3.2e+02 3.2 0.0 12 43 6075 6109 6072 6111 0.73
34 35 0.017 3.2e+02 3.2 0.0 12 43 6262 6296 6259 6298 0.73
35 35 0.0042 80 5.1 0.0 12 43 6449 6483 6446 6485 0.73

Sequence Information

Coding Sequence
ATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGAAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGCTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGTCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGGCCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACAGCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAACACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGACATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAATTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAATTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAAAATCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAGCCATAATATAAATATGTTGGAGCTCGCCATTATAAACGATATTCAGAGCCTCGTCCACAAAAAGGATATGCCAGCTtctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAATTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGGCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGATAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAATTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGTGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATAACCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggaGCTCGCCATTATAAACGACATTCAGAGCCTCGTCCACAAAAAGGGTATGCCAGCTtctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgccattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCACGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGAAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAACTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAACTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaatgaaGAACGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTATGATTACAAGGACATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTGTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCACAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGATGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTGTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAATACCCCGAGAAGGTCATTGAAAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACAACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCTAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAGAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGTGAGACCATGAAGGCTAAGAATACCCCAGAAGCCACCTTCAACTACGAAAgccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCGAGAAGGTCATTGAAAACCCGACTACAATAGAGGAAGCTCAAAAAATGCTGGCCGCCGAAGTCAGGCAAGTTTACGATATTGAGGGACCCATGATCATTACGACCAAAGTTTCTGACAATGCCACGCACCACCTGAACAAATTGGAAGTCAGAGGAATGGAGCGTCCAGCGGCCTACTTCGAAGTCGAAGATGAATCCAAGATTCAGGTATTGCCTAGCTCTGAGTACAAGGAGATACAAGAGGAGCTCAAGTTAGAGAAATCTCAAGAGAAGACCCTTCGCGAGACCATGAAAGCTAAGAATACCCCAGAAGCCACCTTCAACTACGaaaaccataaaataaatatgttggagctcgacattataaacgacattcagagcctcgtccacaaaaaggatatgccagcttctgaaaaacgaaaaataaagaacGTGGTAGAAAAGAAGAAGGACGAGCTGAAGAAACTGTACGAGCGATTCAATGAGCAGTTCCCACGAAATACATTGGAAGAACCAAGATGGAAGCAAGTGAAAGTGTTTGCTGAAAAACTGGAACACCCCAAGAAGGTCATTGAGAACCCGACTACAATAGAAGAATTTTTGGAAAATCTGATAAAGGCCTTCAGAAGAATTTACGATATTAAGGAATCTGGCGTTATATCGAAACAATTTACTGAAAATGCAACACATGAGCTAATCGATATTGAGCATGGAGGTACAGATCGTCATGGTATGCACATTCAAGTGGTACAGAGTGTTCTTTACCCTCGAAATTACAGTGAActacaaaaagaatttaaaagcGTGAAGACACGCGATGAAACTCTTTTAGATGAGAAGAAGGAGAAGAATAATGAAGTTGCCAGCTTCAACTATGAAAATCACAAATTGACTGTGTTGGAAATTGACATTTTAAAAGACTTACAAAGCCTTGTTCAGGCAAACAAAACACCAGCGTCAGAAAAGAGTAAGATCAAGAACTTGATAGAAAAGAAGATGAAAGAACTTCGAGAAGCGCACAGTAACTTCAAGGAAAAGTTCCCTAGAGACCGTTCAAGTCATGCAATTTTGAAACGAGTAGAATTTTTGGTAAAGGAAGCCAAGTCTTACGAATTTTGGGAGGTTGAACACTTGAAGACGGAGGATCCCGTTGAAACTGTTGAAGGATCGTCTTTGAATTCATTGGATAAAGTTAAAGAAATGTTcgaaaaagaagtaagaaaagtATTCCAAATTCAAGAACCTTTAGATATTTTAGTCGAAACCACTAGAAATTGTACACATTCTGTAACCACCTTTAAAACTCAAGGGGTTGAAAGGCAAGCCATTGATTTTCAAGTTTTACCTGCAGCAACTATTCGCCGTATGAGTTTGGCTGAGTATAGGAAACTGCAGAAGCAGTATGAAAACGAAAAATCGCAGGAAAGTCTACTGCtggaaaaaagaaaggaaaatcgATCtattgaagcgtacttcaactATGAAATGCACAGAGTGAGTATGATGGAGTTGGACATAATCACAATCGTTCAAAGTATTATTCATGAAACTAATATGTCGACCTCGCAAATGACTGAAATTAAACGAGAGGTGGAGCAGAAGTCCACGGATCTACGAAATCTTTATGCAAGGTTTTATGAAGTGTTCCCAAGGAACAAACTGCAAAACCCGAGATGGCGGCTGATTATGACTTTAGTGGAATCTTTTGTCGACTCTGAAACTACAACGGCAGAACCCAATAACCTAGTTGATGTAACTATGAATAACGGAGTGACTATATCAAAACAAGTCGTTCAAATTAGTGAAAAACCGTCAATTAAGACGATAGAAGACGCCAAGCAATTCCTGGAAGATGAAGTTAgaaaaatttatcaaatttcAGGACCTATGATTGTAACAGAAAAGCACATTGAAAATCGAACTCACCTGGTTGTTGAATGTGAAATACGAGGATTGGAAAGGTTGGCGAAAGAACTTACAGTAAAACCGCATTCCAGGATTCGTTGCATTGATGGCTCGGAGTACACTGGAATGCTAGAGCAGTTCGAAGCAGAAAAGGCTCGAGAGAAAGAGTTAAAATTCAATATGATAAATAACCGTTCTGAGGACGCGACATTTGAATATCAAAGACACGTTCTAAGTATTTTAGAATTTGATATATTAGTGAACATTCAAAGTATAGCTCACAACAAGGATATGTCGGCAACTGGAAAGGATAAAATTGAGAAATACGTTGAGCAAAAACTTAGAGAACTTTCGGAACTCTACGAGAAATTCTATAAAAGATTTCCAAGGAGTAATTTGAAAAAACCGAAATGGCGACAAATTAAAGTGTTCATTTGGCTTAAACCAACAACAGACTATGAAGTTACGTCTGAAACTTCAAAAGTAACTGATCTTGAAACAGCAACTGGAGGAGAGAACATTGTCTCAGAACAGCCCCCAATAGATCAGCCTGCCCTGGAAATTCCACAGCAAGGACCAATGTTTTCGCCCGAACAACAACCTGAGTCGACTATGGTAACAGAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCTATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCAACCCCAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTAAAGAGCTATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTCAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAACAGAACCGTCCACAGAGGCTCCAGAAAAACAACCTCAGTCGGCTACTGTAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCAACCCCAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGCAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTCAGTCGGCTACTGTAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCAACCCCAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTTCACAGAGGCTCCACAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTCAGTCGGCTACTGTAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCAACCCCAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCATCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCTTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTGTAAAAGAGCCGTCCACAGAGGCTCCAGAAAAACAACCTCAGTCGGCTACTGTAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTATAACGGAACAGCCCACAGCCCCTCCAAAACAACAATCGGAGTCGGCTAGTATAACAGAACCGTCCACAGAGGCTCCAGAAAAACTACCTGAGTCGGCTACTATAACGGAACCGTCCACAGCCCCTCCAAAACAACAATCTGAGTCGACTAGTATAACAAAACCGTCCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTATAACGGAACCGTCCACAGCCCCTCCAAAACAACAATCTGAGTCGGCTAGTATAACAGAACCGTTCACAGAGGCTCCAGAAAAACAACCTGAGTCGACTACTATAACGGAACCGTCCACAGATGCCCCAGATTCGACCCCATCTAGAGACTGCGAGGGATCTGCTATTGCTTGCGTTTCACTTACACCTGATTCAGATGCCTTGATCAGTCCTGTGATTGATTTAAATGGTTATGGAAACGTAGAAAATGTAGATCAATACCCTCCAATATATGACGGTAGTCCCTACTGGCCACCTCAGTACCCAGACGAGTATCCCGAAATGTACCATCCGGAGTCACCTCAGTATGCAGTAGACTATCCTGCATATGGTGAAATGTACCATCCAGCGCCTCCTCAGTATCCAGAACAGTATCCTGCCTACAGTGAAATGTATCACCCGTCATATCTTCCCCAGGAATCACCGTATTATCCCATTTATGACCATTACCCAGTTCTAGATGGCTACCAACCTCCGCCCTATCCTTATGACTACACAGGTTATCCTCACCAGCTGCCAGATGAAGCTTATCTAAATACTTACCCAGAACAATATCCATCCGTCATCAGTCCAAGTGCAGAAGTATTGCCAGATGACTTACAATGCGTTATCCGACTATTGTCAAGCTTTTGCAGTTGGACAAAGGACAAGAGTCAACTTCATATATCGGAGCAGTGCTCACATGTTGAGAATCTGAATTCAGAAGAAGTACTTACAATCGCCCATAATTTACGAGAAAATCTGCAAGCATTTTGCAGCTATGTAGGTTTTAGGCATGCTGATTTGAGGTATGCCTGCAGCAATGTCTTGGAATTCTAA
Protein Sequence
MLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKKIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQAYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEVTFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEQPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELAIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKNQEKTLRETMKAKNTPEATFNYESHNINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELAIINDIQSLVHKKGMAASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKIEKSQEKTLRETMKAKNTPEATFNYENHKINMLELAIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNVQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYEYKEIQEELKLEKSQEKTLRETMKAKNNPEATFNYENHKINMLELAIINDIQSLVHKKGMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELAIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETTKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLKHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQLKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQLKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKMKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSYDYKDIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVCDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVCDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEYPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYESHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPEKVIENPTTIEEAQKMLAAEVRQVYDIEGPMIITTKVSDNATHHLNKLEVRGMERPAAYFEVEDESKIQVLPSSEYKEIQEELKLEKSQEKTLRETMKAKNTPEATFNYENHKINMLELDIINDIQSLVHKKDMPASEKRKIKNVVEKKKDELKKLYERFNEQFPRNTLEEPRWKQVKVFAEKLEHPKKVIENPTTIEEFLENLIKAFRRIYDIKESGVISKQFTENATHELIDIEHGGTDRHGMHIQVVQSVLYPRNYSELQKEFKSVKTRDETLLDEKKEKNNEVASFNYENHKLTVLEIDILKDLQSLVQANKTPASEKSKIKNLIEKKMKELREAHSNFKEKFPRDRSSHAILKRVEFLVKEAKSYEFWEVEHLKTEDPVETVEGSSLNSLDKVKEMFEKEVRKVFQIQEPLDILVETTRNCTHSVTTFKTQGVERQAIDFQVLPAATIRRMSLAEYRKLQKQYENEKSQESLLLEKRKENRSIEAYFNYEMHRVSMMELDIITIVQSIIHETNMSTSQMTEIKREVEQKSTDLRNLYARFYEVFPRNKLQNPRWRLIMTLVESFVDSETTTAEPNNLVDVTMNNGVTISKQVVQISEKPSIKTIEDAKQFLEDEVRKIYQISGPMIVTEKHIENRTHLVVECEIRGLERLAKELTVKPHSRIRCIDGSEYTGMLEQFEAEKAREKELKFNMINNRSEDATFEYQRHVLSILEFDILVNIQSIAHNKDMSATGKDKIEKYVEQKLRELSELYEKFYKRFPRSNLKKPKWRQIKVFIWLKPTTDYEVTSETSKVTDLETATGGENIVSEQPPIDQPALEIPQQGPMFSPEQQPESTMVTEPSTEAPEKQPESTTVKELSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPTPEAPEKQPESTTKELSTEAPEKQPESTTVKEPSTEAPEKQPQSTTVKEPSTEAPEKQPESTTVTEPSTEAPEKQPQSATVTKPSTEAPEKQPESTTVKEPTPEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTAKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPQSATVTKPSTEAPEKQPESTTVKEPTPEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPFTEAPQKQPESTTVKEPSTEAPEKQPQSATVTKPSTEAPEKQPESTTVKEPTPEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPESTTVKEPSTEAPEKQPQSATVTKPSTEAPEKQPESTTITEQPTAPPKQQSESASITEPSTEAPEKLPESATITEPSTAPPKQQSESTSITKPSTEAPEKQPESTTITEPSTAPPKQQSESASITEPFTEAPEKQPESTTITEPSTDAPDSTPSRDCEGSAIACVSLTPDSDALISPVIDLNGYGNVENVDQYPPIYDGSPYWPPQYPDEYPEMYHPESPQYAVDYPAYGEMYHPAPPQYPEQYPAYSEMYHPSYLPQESPYYPIYDHYPVLDGYQPPPYPYDYTGYPHQLPDEAYLNTYPEQYPSVISPSAEVLPDDLQCVIRLLSSFCSWTKDKSQLHISEQCSHVENLNSEEVLTIAHNLRENLQAFCSYVGFRHADLRYACSNVLEF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-