Pven007150.1
Basic Information
- Insect
- Pachypsylla venusta
- Gene Symbol
- -
- Assembly
- GCA_012654025.1
- Location
- CM022880.1:25055982-25080521[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 24 0.17 16 6.4 0.6 2 22 1383 1403 1382 1405 0.88 2 24 0.64 60 4.6 0.3 1 22 1410 1431 1410 1434 0.88 3 24 0.44 41 5.1 1.6 2 19 6067 6084 6066 6089 0.93 4 24 0.076 7.1 7.5 1.2 3 23 6097 6118 6095 6118 0.94 5 24 0.26 24 5.8 0.6 3 21 6138 6156 6136 6157 0.92 6 24 0.17 16 6.4 0.5 2 23 6165 6187 6164 6187 0.94 7 24 2.9 2.7e+02 2.5 0.1 1 20 6335 6354 6335 6354 0.96 8 24 0.075 7 7.5 2.1 2 19 6363 6380 6362 6383 0.92 9 24 0.011 0.98 10.2 2.7 2 23 6417 6438 6416 6439 0.92 10 24 0.0011 0.1 13.3 1.9 1 23 6516 6539 6516 6539 0.95 11 24 0.002 0.19 12.5 0.3 2 23 6546 6568 6545 6568 0.93 12 24 0.00092 0.086 13.5 1.8 2 23 6573 6595 6572 6595 0.95 13 24 0.0016 0.15 12.8 0.5 1 23 6862 6885 6862 6885 0.94 14 24 0.035 3.3 8.5 3.7 2 23 6933 6955 6932 6955 0.96 15 24 0.0022 0.21 12.3 0.9 1 23 7142 7165 7142 7165 0.95 16 24 0.039 3.6 8.4 2.0 2 23 7187 7209 7186 7209 0.94 17 24 0.47 44 5.0 1.5 2 23 7232 7254 7231 7254 0.91 18 24 0.0046 0.43 11.3 0.1 1 23 7509 7532 7509 7532 0.97 19 24 0.0071 0.66 10.7 1.2 2 23 7545 7567 7544 7567 0.94 20 24 0.0022 0.21 12.3 1.3 5 23 7575 7594 7572 7594 0.91 21 24 0.0076 0.71 10.6 0.2 1 23 7598 7621 7598 7621 0.95 22 24 9.9 9.3e+02 0.8 8.6 1 23 7644 7667 7644 7667 0.94 23 24 0.019 1.8 9.4 0.9 2 23 7714 7736 7713 7736 0.96 24 24 0.29 27 5.6 3.0 1 23 7741 7763 7741 7763 0.97
Sequence Information
- Coding Sequence
- ATGAAGAAAGGTGTTGGGGATACAAATCCAACATCACCCCATCTGACCAACATCATTCAAGATAATGTCCAGTCTTCACTCTCgccaaaaatatatcaaaactcAACACCTCAGCATGTTCTGCATACATCACCCCATGGTGTTTCTCCCATGCACCAAGAGTTTTCTGGATTGCAACAAGTCAGACATATTTACCCAGAGATAAGTATCATTAGGGAACAAACAAATCTTCCTTCGAGTAGGGGTAAACCTTTAATACACCGAACGAAACCAAAGTCAGGTGTCGTTGATTCGAATCCAAATATTCATGAACAAATGAACGCAAACGTATCATCAATTCGGAATGATTCAATTCGCGCAATGAACAAAGACGTCAGAAGTATCCCTCAATCCAGTGTTACTGTAAATCAAACCAATGCTACCCCAAATATGTTAGGTCAAGATAGAAATGATCCTCATTTAGGAAACTCATTTTCAGGTGGTAAAAGTAAACTACCTGAATTCAGTCAAATTAAACAAAGGAAAGTTTCGCCTTCCTTGTCGTCAACCCCTCAACATGATACGAACTATCCAGGTGGGGATTCCGTCATCCACCTTCTGGATAACAATATCATGGCTGAACAGAATctcattgaatttgaaaagtCTTTCAACGATGTTATTAAAGGTAACTTAAACATTTTAGAGTCCTCCATTATTGCAAGTGATGGTCAGGAGTATACTAGTTCTAATTTTTATAGGAATACTTCCCAGTGTAACGTTGACCCCAACTTGCAACGTAGTCCACATTTGAGTAGTCCCCATGTTCATAACAGTCCTCATTTACAAAATAGTCCTTTACAGAGTAGTCCACATTTAATTGTGGAAAGCTCCTCTAGCTTTAACAGTCCTGATAGTCACCAGTCTCTCACCAGTTCGGATTACTATCCAAGTAAAGATAGTGACAGTTTGGAATCAGGTTCGGTGATAACTAATTCAATATCTGGAATGATGTTAGGACATGAAGATAGGTCAAAACACATAGAACCAGGCCAAATACGAAATCGTCTTTTCGAAGAAATTATGAATGAAAAGATTCATATTGAAACAAACATTGCAGCTATGAGCTTCACTCCTTTAGAAGAGTGTTTACTTCAAGGCACATTTAATTTGCCTACAGACATTCCCTCCGCTTTAGAGAAAAGTATATCTCTGGATATACAGAGGGAAACTATGAAAGCAAATACTGAGTATGATGATAAAAAGATAGGATCAAAGAATTCTGCAGAGTCTTCCTTAGAAAAtaaccagaaaaaaataattcaagtaAGAAGTGATTTATTTCATCacaataaaacaacaaaaacagataGTTTACCatcaaaaactcaaaataaagtaAAGTTGGTAAGATCTCAAACGCCTCAAAATATTGCTACAAAAAACAAAGGTCAAAATATTAATGttgataaaaattcaaatacaattcAAAATACAACTTCTCTTTTAGcgtccaaaaaatcaaatagaaaattaaacaaCCTGAATCTTAACCAACCTAACAAAATGTTAGAACTGATCAAATCAGcatctaatataaaaataagatcGAATGCAAAGACACAAAGTATCAATAAACCCGTCCCCAGCTATAAAAAAATCCCTcccaatatacctaataattcTGGTAATCGAAGTGTAGCTAGTACACCTTTGAAAGTCAAAGAACAAGCCACCCCCCTAATGGGAACCATTCAAAGTGTAAAAAACAGTAGCTTAAATATAATTGGAAGTTCTACGACACCTTCTCCACAGCCAAACCTACATAGGAAAGAACATGGTAAAATTGAGGGTAAAGAATGTAccataaaaaatgttaaacaaATATCAACTAATCAAGAAAAATCAACCATTAACTATTCTACTCTTCAGTCAAATTCTGACTTAAAGAGGGATAATCTAAGTGGCGATAAAGGTAAAACAGTGAATGAGGTAATATCATTCAAAAATGtgatcattaaaaataattcttataatAAAGTTGATAGAAATACAACTGAtatcaatattaataaaaatattcaaacaaaggAAGTTAATAGTACTGTGATTAGCCAAAATGTTCATAAGGGTGAACAAGAATGTAGTAGAATCTGCCAAACTAATAGTCTGCCAGTCAAAACCCTTAGTTACTGTGATAAACCTGAAGTAATTAGTAGACTAAAGCTGACAGGTAAGACCGCAGACCCGTACTCATTTGATTCCGAATCAGATAAATTACCACAACGTAAAGTTTCAGATGATATTTCTGATGACCAAACAATTGTTACATGTCCTaaacaacaaaatattgaacaagacaaaaataaaaatataaatgatcaGGAAACATCtggtttcaaaataaaagtaattaatAAAACCGATTTAAGTTTGATTTTACACAAGGAAGATACTGCATGTTTGAATGAAGAGAGATCTGCGAAAAAGTCCTTAGAGAAAATGAAGGAAAGTAAACAACAAAACACTATTGATCAACTAGCGAGCCCAGAAGAAATATTGAGGCAAAGTGTATCATCCAACCCACCCGTAGAATCTCATTCAGAAACCCGTGATTTATCTTCAACTAAAAAACATTTGTCATACAAAGAAGATAGTCCTAGTTTAATGACGTGCTCAATGGGAGGAAATCCAACTCGCCTGAAATTGAAATATGTGAAAAATTCTACTTCGTCTGAATCTAATTCTGATAAAGGCTCAGATAGTCCCAATATGGATAGTAGTGAAAGAATGAAAACTTTAAGAATAAAAATTCCGTCCTATTCTGTAGAAAAAGATCACTCTCCGAAGCGTTCAAATAAAAGTAGTCCATTACCCAAAGATGTGAAATCTGACCATAGAGttcaagaaaaattaattataaatttaaaaacaaatcaaataacaagaacaaaagAGGACGTTATCAATAATAATGACTCTGAAGACAATCTACCCACTGCAATAGATTCAAATAACACATCAATCATCCAAGGTCCTAATTTAGCTTTATCTCCTaatgttttaaaagaaaaacgaaaaagttttCGTAAAATCGATGAAATATGTCAAAATTTACGGCGAAAAAGTCACGAAAAAGAAGGAACAAATATTGACAATGTATCAAATGATTGTGCCAAAGAAGGATTGGAAATTGAAAAACTAGTTGACAGTTCCATTGATAGTGGTGATATTACTCaaagaaaagtgaataatttaAATACCGAAAAGTTGtcagatttaaaaataagtgATAGTGTAAATAGAGAGAAAGGAATAGAGAAATCGAAAAGTTTTGTTGTTAAATTTAGAAGAATGTCAAGAagtggtgaaaatattattgtcaaaCAAATTGTCGACAGAAAGAATGTAACGGATGAAACTGACGATGAATACCAAACAGAAAGGGAAGACAATACAGACACAGAAAATGACTTGGAAACAGGAAATCTAAATACGGCGATTGTGCGTCATGCAGAGAACGATAATAGTCATAATACGAAAACagattttaaacttaaaataagaaTTGGTAGTGAAATAATGTCAGAACAAAacgtaaataaaaagaaagataagaagaaaaagaaatcgaaaaaaCACAAAGATAAATCTAAAAAATCGCGTCACAGATCTGGAGTTTCTGAAGAAATCGACCAAGCGCATGCTAGTAAATCTGACGATCACGTATTTATCAAACCTCTAatattgaaaaggaaaaatgacgtttcactttttactttatcacaAAATGACACAAATTCCTCCAAAGATAATAATACTGAATCTgaagaaaatacaaatttaagtgACAGATTATGTTTATTGAAGCAGAAAGATAGTTCTTCTGATGAGATGAATGTAAGCAATGAAATAACAGAAGAACTAGAATTGAAGCAACCCATTCCCGACCGATTGAAGGaagaaagtttaaaaatttGTAGCAAGAGTAATAAGAGAAAATCAAAACAGTCTGTACAAGAACACCATGAAAGTGAGCCATCTTCTCATTTACTTGGTCAGATAGTTTCATCTGAAAACACTTCATTGAAAGAGCATAAAAATGATATCAAATCAGAGTGTGATGCGTATGCAGAAAAAATTTCCTCACTTTTGACGGAAACACAAGTTTGTGAAAGTTGTTCCAAGTCTTTTGAAAACGAATTTGAGTACAATAAACATAAGGTTTTAATTCACAATTATAATGCATTTTTATGTCATATATGTTATGTATCGTTTGTAGAGAAATGGAAATTAGATATACATTTAATGTCTGCAGAACATATGGCAATGATAAATAGTGGATCCGATCAAGTTACAGATGTTATAACTACATCTCGTACAACCAGacgaagtaaaaataatttaggaAATGAAACATTGTCACCAAAacatgaaaacaaaaagaaaagcaataaaaccaaaattaaagaagaagataaatctAACATTGACGTTGCTAGTTCTAGTCTCACAGTCTCTACAGATTCTAAAAACCTCAAAGATGATAAGGACAAtgttaaaaatcaaatcttaCAATCGATGGACAGTAACATAGTTGAATCTAAAGAATCAACGGGACAGATGGActtaaaaacaatcaaaaactCGGAAATGTCTTTGAGTGCTGAAACaagtcttttaaaaaataacacaaatgtATGCAATGAAGATAATTTTACTTTGAACAGTGAAAATCAAGCACTTGTTAATACCCAttcaaaagataattttgatgaaatcgaaAGGCGTGATGATTTAAAATCtccaaacaaagaaaaaaatacaagtaaggaaattaacaaaacaaaagaaagaaaaaatgattttgaaaaatatttggctATTGAAGATTCCTGTTCAAAAGATAGTGACGCCAGTCAAAGTGTACCTGCGTTGAACACTGTAATATCTTCTAGTCTCGACTCTAAAAGTGCAATAAATAATGATCAGATATCTGAACACGAAAGTGGAATATCAACTTGTTCAAGTACCTCGGAAATAGATTGTAGCAGCTTTTCACAATCTGATAATCTATTAAGTCCAGATTCAGTAGGGGAAAAAAGTAAAGCAACAGAATTACAACATCCGAAttccttattaattaaaaacagAGTTATTGAAAGtgtaactgaaaatattttaccagaACAAAACACCAGCTTAATTGTAGATTTAGAAGAAAACATACCTTCTTGTATAATACGaaacagaaaattaaataaaaaaacaaatgaaaaagtccAATTAGACAGCGGAAGAAAAGCTGCCTATAAAGATGAATTAAAGACTGAAATTATCAAATCAAAGAATGAAGAGCAgaaattaattgatttaaatGTTACTACCCTCAGTAAAACTACTCGACCTGTACCAAAGCTAAGACCATTACCTGGTTTGATAAAAATCACTCCTGAAGTTTTACCAATCATTCCAAATCTACAAATCGATGGTTTGGATAGAATAAGAAAAATTCAGCAAACAGTACATTCTAAAACAATTGACATTCTTGAAATTGGAAAGGTTCAAGAATTTCAAACCACAGATAAAGAAATAGTCGttccaaaaaaatcattgaaaaagaaaagagattGCTGTGAAAGTGAATCCGATAAGAACGAAAAGGAAATTATGAACTTTAAGGCGTATCCTCAGAAAAACGATCAAGTTTGTCATGCAGATGGTAATTTAAATGAACATATAAACTCTGATGATGGGAATAATGAGAGGATTCATGAAGTAACTGAAACATGTGATAAAAAAGTctcgaaaaataaacaaaaagtaaatggTGATGTGGAAATCAATATTATAGGCAAAGACGAAATTGCAGACAGACCTTGTCGagttttaaggaaaaaaaatattcctcctAAAAGTGAAACTATTGTAAAAGAAGTTACGgaaaatttaattgaacccaTTACAGttataaatgaaaaagtaagtttgctttttgataaaatagaaaCAGAAAATACATCTGAAGACTTGAAAGTTAAAGGGGACGTTCTGATCAATGAATGCACACGCCTCAGAAGAAGTACTTATAAAAAATCTAGTGTTTTATATGAAAGACGAAAAATCCCTAGAAGTACATCcaagaataagaagaagaatgtaAATTATGATGAAAATATAGTAGACAAAatactagaaaataataatgaccattCAGAGCTCATTGACGAAATAAATATTCCTGGCGCCCGTATAAGTACCGAAAGTGAGATAAATGAACTTTGTATAAATCCTGAAagtctaaataaaaatattaaatccaaaGAAACATTATCAGTTTGTAATAAAATCACCACATGTGAGGAAAAAATCAGTAGCAGCAgtcaaacaaatgatgagagTAGTACAACAGTTATCAATAGTGAGACAATGTCTGAGACAGAAACTGAAACTCTTAATCAAAAAAGATCTGCTGATAGTGTGAGTAAACTGAAAGAAACAAATGAATCTTTTGACGAAGATGATATTCCGTTAGACATCAGACAAATTTCAAAAGCGAAAAgtcatgaaaatttaatttcttctcaAACTGAAATTGATTTCAAAGAGAACGACGAAGATGATATACCCttagaaataagaaaaactaaTTTATCTAAAAGTGTGGAAAATTTGAACTCTTTAGAAACTGTATGTTCAAGTAAAACTGAGCTTATAAAAACCATTAACTATATTAATAATGATGAAAATGAAGcattagcaaataaaaaaatgtgccGTGCTAAAAGCCACGAAAATGTTTCTTCTAAAACTCATtcagaaatcgaaaaaaaatcttcgacgGTTTTCCTATCGAATAAAACTCcaattgaacaaaatattcgATTAAGAAAACCAATTTCTATGGATAAGAATTCAGTTGatgacaaaaatatgaaagattgTTCTACCACCAGATCtttgatagaaaataaaaaaaatcgtaaaataaaaactcaCGAATGTTTGTCTTCTCTTAAAGATGCTGACAAGTTAACAGATATAAAAGGTGTTATTCAAGAGGATTTGTCatcagaaagaagaaaaagtctTAGGGTTAAAACTCGCGAGCATATATCGTCCGCTGAAATCTCAAATAAAACTACATTAGAAGAAGTATGTCTCACAGATAAAAAGACGTCAAAAAAACAGAAAACTATCCAACAGACTGATGAGACTAATATGTTGGAGAAAGTTAGTGTGCCGTCCGAAGTAAGAAAGAGTCTAAGAATTAAAGTTATTGACCAAACTGTTTCTAACTTAGAAAATAGTCCACAGAAGGCTCTTCCAGGTATTTTAAATGAAGCGACGTTCAACTCAAGAACTCGTCTTAGAGGAAAAAGCCAAGATAATTTTTCTTCATCCTTGGTATCATTGGAATTTCCTAATCCAAactcaaatcaaattttagaaaaggaTGGAACCAAAATTTCGAATCAAGACTTAAttacaaaagagaatggaagtCATATTTCAATAGTTCAAGTTTTAGAGCATGAAACTATCGAAGtgaacactgaaaaaaaatatgacatcaATTGCCGCGATCAAGAAAATGTTTCAATAGATATTGAGGACAAAATGTCATCTTTAGTGACAGAAAAGAGCGAACGGCATTTAGAAGCTAAAGTTGCCACCGTGGAAACATCTATACAAGACACACAAAAAGATCAGATTCttattgaaaattctaaaacaGCGCATTGGAATGAGTTAAATCTATCTTCTCCaactaaaaatttcaaaaaaagagaaaggaataaaaaatatcttgttaaAAAGCAGCTTGAAGAACTCTCTCAAATTGCAGCCCAAAATTCtttacttgaaaataataaaaaagagttgattgaaaaacaaaatagtgTACCATgtgatatcaaaaaaaatagATCTAGAAGAAAAACTCTTGAAGAATGCACAAACCATAAAAATGAAGATGATACCATTCCTGAATCCCCTAATGAAACAGTCAATATCACAGAAAATAATTTGCCAATGAGAAAAAGTctgagaaatagaaaaattaaaaagtttcctgatgaagaaattgaaatatttaatggGAAAAGACggaatcaaaagaaaatcaataataataataaagttactCCTTTGAATACAATGTTAGACTTTGAGAAAGTCGAAAATGAACCAAAGTTAACTAGAAATAGAAGTCATGATTCTTTACGTTCCATAGATAACCCTCATGTTGAACATGAGAAAGATAGACATGCTGTCTCTAAAACTATGGAAGTTGAACACAATCAGGTTTCTACTTTGaatgaaacaatgaaaaataaaaaatatgcaaacAAATCCGACGACTTAACTGATGATCTGACAAATAAATCACAAATCGATCACAAACGTAACATAAAACATGTAAATAACGACTTAGAAAAGGAGGGCCAACAAAGTGTTCATAACCACATTACATCGGATATCAATTTGCAATCAGATAGTATTACAATTTTTAAAGAGAGTAGTgttgtcaaaaataataaaaatggcaaAAGTAAACTAAGACTATCAGGGTCAACTGATAAATCTATTCATAAGGAAACTATAACTGAACTCACCAAATTTTTGCCATCACCAGAgaagtttgaaaaacaaaatttactgACAGATAAATTGATTGGATTGCAAGAATCCAGCCAAGCACCTTCTAAGGATTGTGTCCAAAATAAAGAGAATGATCTGGAAGAAGTTGTTGAACAGGGCGAGGAAACTTCTGAGCAATTTGACGATGACCTcgaaaacaaattaaaagaattCTTAAAAAAACCAATCCTCGATTCTGCAGTTAGTATAATCAAACCAATCGTTCATTCCAgcgaaaatatttataaagaaacgaACAAacctttcaataataaattagataaaattgaacttattaaacaaaatattgaccctaaaataacaattaaaactttGCATGCCATAGAACCCACTAAACCGGAATCTAGCCAGACTGAAACTTCTGTTGGGATGGTGGCAAAACCTATTGAAGATATAGGTAAAGAAAAAACTGGGCGTATTAAATGGAAGGCTGAAactactgaaaaaaatattgatgactTAGAATCCAACACCAATAATGCTTCAAAAGATTTTGAGCTTGAATTCATTCAAGCAAAGAGGGCAGAAGGGTCTAGTCCTATTAATCTTTTATCTCAAGGAAAATTTAGCCGTATTGAGAGTAAAGCTGAAATCGTAAACAAACTTACTGATTTGATTGTTGAAAAGGGAACTAGTCCTACTGAAGATACAACTGAATCTATTTTAGGGTTCAAAGACAAGAAAATCCATAGAAAATCAAGTCCAATGGAAGATGTTTTTGTTGACAAGGTGTCTATCCCAACTGAAACCGGAACCGAAATTTCGGTTGATCACATTGAAACTGTACGTGAAGAGGGATCCAGTCTTTTTGAACTCAAAGCTGATTTTTTAGTTGAATCGACGGAAACTATTGTTGGACTTGTGTCCAACCCAATTGATACCAAAATAGAAATAGTTGAAAACACTGAAACTACAAATAAAGCATCCAGGTCATatgaatcaaaattaaaaaattgtattgaacACACCACCACTGCAGTCGATGGTGTATCCAGTTTATCTAAAGTAATTACTGCAATTGTAGGTAAAGTTAGTGAAACTGTAGTGAACGAGGTGTCTAACCCCATCTCATTTGAATCTCAAACTGAGACTATGGTTGACATAATGACTGAATCAACTGAAATAGTTTCTGAAGTGTCCGAACAAATTGTTACAATTGATGAAGTAAAGGAAGCCGTAGATCATATAACTAAAAAGGTGTCTGAACGTATTGAAGTGTTAAAAAGCCCACAAACTATTATTAGTAAGGTGACTGAAAATGTTGACAATTTTACTGAAGTGTTAGaaacttttgaaattattgATGAGGCATCTGAGAACCATAAAGTAATTACAGAAGagtataaaattattgaagagGGACATGGAATTGACGAATCTGATAATGTCGGTATAATTTCTGATTTATCCAAGGAAATTGAAATTACAACTGAAAACATCTCCCAAACTATTAACACCGTCATAGAGGTGACAGGGATCGTTAATACATTTACTGAAGTAACTGAAACTATTAAACAAATGACCGAAGAAGTGTccgaatttattgaaaattgtgaTGAAATATCCAAATCCGCTgagaaagtaataaaaccaGTAGCTGAGAGAATAACGCATGCATCTGTTGAAGAATTATttgatgaaataattgaaatagtTGTTGAAAAAACTGAACAAACTGCTGAGCATATACCTGAATCAGTTGATGGTGAAATAACTGAACATTGTGCTGAAGAAATATCTGAACTATCTGCTGAGGAAATTCATGAATCAGCAGTTGAAGATATATACGAACTATCAGCTGAGGAAATACATGAATCAGCAGTTGAAGAAATATCTGAAAAATCTACTGTGGAAATAACTGAACTATCTGCTGAAAAAATACATGAATCAGCAGTTGAAGCAAAATCCGAACAATCAGCTGGGGAAATACATGAATCAGTAGTTGTAGAAATATCTGAAAAATCTACCGAGGAAATAACTAAACCAGCAGctgaagaaataaaagaacTAACTGCTGAAGAAATACATGAACAAGCAGTTAAAGATATATCTGAACAATCAGCTGAGGAAATACATGAATCAGCAGTTGTAGAAATATCTGAACAATCTGCTGAAAAAATACATGAATCAGCAGTTGAAGCAATATCCGAACAATCAGCTGAGGAAATACATGAAACAGCAGTTGTAGAAATATCTGAAAAATCTACGGAGGAAATAACGGAACCAGCAGCTGAACAAATAAATGAACTATCTGCTGAAGAAATGCATGGACCAGCAGTTGAAGAAATGTCTGAACAATCTGCTGAAAGAATACATGAATCAGCAGTTGAAAATATATCCGAACAATCAGCTGAGAAAATACATGAATCAGCTGTTCtagaaatatatgaaaaatctTCTGTGGAAATAACTGAACTATCTGCTGAAAAAATACGTGAATCAGCAGTTGAAGCCATATCCGAACAGTCAACTGAGGAAATACATGAATCAGTAGTTGTAGAAATATCTGAAAAATCTACCGAGGAAATAACTAAACCAGCAGCTGAACAAATAAGTGAACTAACTGCTGAAGAAATACATGAACAAGCAGTTGAAGAAATATCTGAACAATCTATTGAGAAAATACATGAATCAGCAGTTGAAGATATATCCGAACAATCAGCTGAGGAAATACATGAATCATTAGTTGAAGAAATATCTGAAAAATCTACCGAGGAAATAACTAAATCAGCAGCTGAACAAATTAATCAACTAACTGCTGAAGAAATACATGAACAAGCAGTTGAAGAAATGTCTGAACAATCTGCTGAAGAAATACATGAATCAGCAGTTGAAGATATATCTGACCAATCAGCTGAGGAAATACATGAATCAGCAGTTGTAGAAATATCTGAACAATCAGCTGAGAAAATGCATGAATCAGCAGTTGAAGATATATACGAACAATCAGCTGAGGAAATACATGAATCAGCAGTTGAAGAAATATCTGAAAAATCTACTGTGGAAATAACTGAactattttctgaaaaaatacaTGAATCAGCAGTTGAAGCAATATCAGAACAATCAGCTGAGGAAATACATGAATCAGCAGTTGTAGAAATATCTGAAAAATGTACCGAGGAAATAACTGAACCAGCAGCTGAACAAATAAATGAACTACCTGCTGAAGAAATACATGCACTAGCAGTTGAAGAAATGTCTGAACAATCTGCTGAAAGAATACATGAATCAGCAGTTGAAGATATAACGGAACAATCAGCTGAGGAATTACAAGAATCAGCAGTTGAAAAAATATCCGAAAAATCTACTGTGGAAATAACTGAACTATCTTCTGAAGAAATACATGAATCAGCAGTGGAAGCAATATCAGAACAATCAGCTGAGGAAATACATGAATCAGCAGTTGtagaaatatatgaaaaatctACCGAGGAAATAACTGTACCAGCAGTTGACAAAATTACTGAACTATCCGCTGAAAAAATACATGAATCAGCAGTTGAAGAAATATCTGAACAATCTGTTGAAAAAATACATGAATTAGCAgttgaagaaatatttgaacaagCTGTTGAAGAAATGCATGAATCAGCTGTTGAAGCAATATACGAAACACCTGCTGAGGAAATAACTGAACCAGCAATTGAAGAAATATCTGAACAATCTGCTGGGGAAATAACTGAACAACATGTTGAGAAAATAACAGAATCAGCATCTGAAGATATGTCTGCACAATCTTCTGAGGCAATAACTGAATCAGCAGTAGAAAAAATGTCTGAACAACCCTCCGAGCAAACAACTAAACCAGCAGTTGAAGAAATGTCTGAACAATCTGCTGCAGAAATAACAGAACAACCTTCTGAGCAAATAACTGAGCCAGCAGTTGAGGAAATGTCTGAACAATTTGCTTCAGAAATAACTGAACAACCTTCTGATGAAATAACTGAACACGTAGTTGGAGGAATATATGAGGCAACAGTTGGAGAAATAAGTGAACCAATAGTTGAAGGAATAGCTGAACTAACTGCTGAGAAACTAACTGAACCAATAGTAGAAGAAATATTTGAACTATCAATTGAAGAAATGCATCAATCAGCAATTGAAGCAATACCCGAAAAACCTGCTGAGGAAATAACTGTATCAGCAGTTGAAGAAATATCTAAACAACCTGCTAAGCATATAACTGAATCATCAATTGAAGAAATAACTGAACTATCTGCTGGAGAAATACATGAACCAACAGTTGAAGAATCAGCTGCAGAAATAACAGAACAACCTGCTGCAGAAATAACTAAATCAGCAGTTGAAGAAATTTTTAAACAATCTGCTGAGGAAATAACTGCACAAGCATTTGAAGGAAGGTCTGAACAAACTGTTGCGGAAATAACTGAATCAGCAGTTGAAGATATGTCTGAAAAATCTGCTGAAACAATAATTGAATCAgcagttgaaaaaatatttcaacaaacTTCTGTAGAAAAAACTGAATTATCAGTTGAAGAAATATCTGAAAAAGCTGAGAAAATAACAGAACAACCTGCTACAGGAATAACTAAATCAGCAGGggaagaattttttaaacaCTCTGCTGAGGAAAAAACTGAATCTACAGTAGTAGAAATGTCTGAGGAAATAACTGTACTAGCAGATGAAAAACTACAGAAAAGATCTGAATCTGTTGCGGAAATAACTGCTAAAGGAATAACTGAATCGGCTGTTGAAGAAATTTCTGAAccatttgatgaaattgaatCAGCAGTTGAAGAAATGTCTGAACTATATGCTGAGGAAATAACTGAACCAGTAGTTGAAGGAATGTATGTAGCAGTTGGAGAAATAACTGAGCCAATAGTTGATGGAATATCTGAAACAACTGCTGAGAAAATTGAACCAATAGATGAAGGAATATCTGAAACAACTGCTGAGAAAATAACTGAACCAATAgttgaagaaatatttgaacCACCAATTGAAGAAATTCATGAATTAGCTGAACAACCTGCTGAGCATATAACTGTATTAGCAGTTGAAGGACTAACTGAACCAACTACTGAGGAAATAAGTAAATCAATAGTTGAAGAAATATCTAAAAAATCTGCTGAGGAAATCACCGAATCTGTTGTTGACGCAATATACGAAAAACATCCTGGGAAAATAACAGAAACAGCAATTGAAGAAATACATTATTCAGCAGTTGAAGAAATATCTGTACATTCTGCTGAGGAAATAACTGTATCAGCGGTTGAAGAAATAGCTTTACAATCTACTGAGGTAATAACTGAACCAGCATTTGATAAAGTATCTCAACAATCTGCTGAGGTAATAACTGTATCAGCAGTTGAAAAAATATCGGATCAATCCactaaagaaatatttgaagaaacTCTTGGGAAAATGGCTAGACCAGCAATTGAAGAAATACTCGAACAATCTGCAAAGGAAATATCTGAACCAACTGTTGGGGAAAGAGCTAAACCATCAGTTGAAGAAATGCCTGAACAATCTCCTGAGGAAATATCTGTAGGGGAAATAGTTAAACCGGCAGTTGAAGAAATACCTGAACAATCTTCTGAGGAAACTGTTGGGCAAATAACTGAaccaaaagttgaaaatatatatgaacTAACTGCTCTGGAAATAACTGAACCAATAGTGGGAGGAATATCTGAACCAACTTCTGGGGAAATAACTAAATCACTAGTTGCTGAAATAGCCAAAAAACGTTGTGAAATAACTGAAACATACAATGAAGAAATATCTGAACCAACTGCTGGGGAAATAAGTGAATCAATAGttgaagaaatatataaaaaatctgttaaGGAAATCACCGAATCAGCAGTTGACGCAATATCCGAAAAGCATCCTGGGGAAATAACAGAAACAGCAATTGAAGAAATGCATGATTCAGCAGTTGAAGAAATATCTGAACAATCTGCTGAGGAAATAACTGTATCAGCGGTTGAAGAAATATCTATACAATCTACTGAGGTAATAACTGTATCAGcagttgaaaaaatattggatCAATCTACTAAAGGAATATTTGAAGAAACTGTTGGGAAAATTGATAAACCAGCAATTGAAGAATCTATAAAGAAAACTCAGCAACTCGAACAATCTACAAAGGAAATCTCTGAACCAACTGTTGGGGAAATTCTCAAACCAGCAGTTGAAGAAATACCTGAACAAACTACCGAGGAAACATTTGGGCAAATAACTGAaccaaaagttgaaaatataattgaacTAACTGCTCTGGAAATAACTGAACCAACAGCTGAAGGAATATCTGAACTAACTTCTGAGGATATAACTGAACCAATAGctgatgaaatatttgaaccAACTACTCAGGAAATAACTGAACCAATAGTCGGAGGAATATCTGAACCAACTGCTGAGGAAATAACTGAATCACTAGTTGCTGAAATAGCCAAAAAATGTTGTGAAATAACTGAAACATACAATGAAGAAATATCTGAAGCTGCTGATGATGGAAACAACAAATCTTCTCCTCGGGAAATAACTGGACACTGTGGTGAAATAACTAATAAACCATCTGCTGGGGTAGCAAACGAAAAAAATGGTGATGAAATAATTGAAGTGGGTGAAGTAATAATCCCACCTTTCGAAGTTAAAACTGAACCATCggattatgaaatatttgaacaagGAGATGATGAAATAACAGAAATACCTGATAATGATATAATTGAATCAGCTAATGatgaaataattgaagaaaTGTCTGAACCAACAGCTCAGGAAATAACTGAACAAACGGCTCAGGAAATAACTGAACTAATTGCTCTGGAAATAACTAAATCGGTTGATAGTGAAATAACTGAACcaatatttaaagaaatttcTGAACCAACTGCTGAGGAAATAACTGAACAAACTGCTCAGGAAATATCTGAACAATGTGATAGAATAACTAAACAATCTGCAGGAGAAATAATCGAACAAAGTGGTAATGTATTAGAAGCAGCTGATGATGAAAAAACTAAATCATCTTCGGAGGAAATAACTGAACAATGTGATGGAATAACAAAACCATCTGGAGGGGAAATACCCGAACAAAACGGcgatgaaataattgaaatagaTGATTGTGATATAATTGAACCAGCTGGTGAAATAATTATTCCACCTTTTGAAGTTAAAACTGAACCATCAGATGAAGAAATAACCGAAACATTTTGTGATGAAATAACTGATGCAGCTGATGATGAAAAAACTAGATCTTTGGATGAGGAAATAAATGAACAACGTGATGAAATAAAACAATCCAGTGGGGAAATACCTGAACAAAACGGtgatgaaataattgaaatggaTGATTTTGATCTAATTGAAGCACCTGGTGAAGGTATAATTCCTCCTTTCGAAGTTAAAACTGAACCCACGgatgaagaaatatttgaacaaactGGTGATGAAATAACTGAGGCAGTTgatgatgaaataattgaaTCAGCTGATGAAGATATTAAACTACACCTTGaagataaaactaaaaaatttgCTGAAGACGAAATATCTGAACTATCAGTGGAAAAAACTGAGAAATGTGATGAAATAACTAAACCATTTACAGAGGACGTAACCAAGTTAGCTGCTGATGATATAAGTGAAACAGCAGGAGAAAAAACATCTAAACCTGAAATATCTGAACCAACTGCTCAGGAAATATTTGGATCAGTTGATGAAGATAAATCTGAAACAGCAATTGAACTAACATCTGAACCACCTGCTGAGGAATTAACGGAAACATCGAATGATGATAAAATCAAACTAATTgataaaaattttaaaccaCACGATAAAGATAAAACTGAATCAGGTGGCGATAAAAGAGCTAAATCATCCCTTGACAAATCAACTGAATTAGGAGTTGAAATAATATCTGTAAAATCTGTTTCGACAATATATGAATCTACTGAAACAATTATCGAAAAATGTGATGAAATAACTGAACCATTTACAGAGGAAATAACTGAGGCAGCTCATATAAGTAATCCAGCTGATCAAGATAATAAATCAccttttaaagataaaattaaaacagccGCTGAAATGAGTGAATCAGTTGGGAATGAAGCTAAACAATTAACTGACAAAATAACTGAGACAACAGTTGAtagtgaaaaaattaaagaagctATTGAGAAAATAATTGAATCTTCTGATGAAGAAATTGGTGATATAATTGAACCAGCTCTTAATGATAAGCCTGAATTAGTTGATAgtgaaaaaattgaagaagcTACTGAGGAAATAATTAAACCGGCTGGTGCAGAAATAACAGAAGCAGCTGGTGATATAAATGAACCAGCTGTTAAAGATAAGACTGAATCAGTTGAtagtgaaaaaacaaaacaagataTTGAGGAAATAATTAAACCTTCTGGTGCAGAAATAACACAAGCAATTGGTGATATAAATGAACCAGCTGTTAAAGATAAGACTGAATCAATTGATAGTAAAAAAACGAAAGAAGCGATTGACGAAATAACTGAAACATCTGATGCAGAAGTAACAGAGGCAACTGGTGATATAAATGAACCAGCTGTTAAAGATAAGACCGAATCAGTTGAtagtgaaaaaattaaagaagctATTGAGGAAATAATTGAACCTTCTGGTGCAAAAATAATACAAGCAGCTGGTGATATAATTGAACCAGCTGTTAAAAATAAGACTGAATCAGTTGATAGTAAAAAAACGAAAGAAGCGATTGAGGGAATAACTGAAACATCTGGTGCAGAAGTAACAGAAGCAACTGGTGATATAAATGAACCAGCTGTTAAAGATAAGAATGAATCAGTCGATAGTGAAAAAACGAAACAAGATATTGAGGAAATAATTGAACCTTCTGATGCAGAAATAACAGAAGGTGATATAACTGAACCAGCTTTTAAAGATAAAACTGAATCAGTTGATAGTGATAAAACGAAAGAAGCTATTGAGGATATAACTGAACCGTCTGGTACAGAAGTAACAGAAGCAACTGGTCTTATAATTGAACCAGctgttaaatataaaattgaattagcTACAGAAGAAATATCTCAACGAGCAGTTGTAGATGTAGATAAGCCAAAATATGAAGAACTAAATGAATCCAGTATGCAATTAATAAATGAAACAAACGATGTGGATTTTACTACTACTGTTGTGAAGGAAAACCTGGACGAAAAAACTGAACCAGTCGTAGAGGAAATTATGAAGTTATCTGTTAAAACATCACCTGATATCACTGCAACGAACACATTAGAGCCAACAGACACAATTCTTCTGACGCACAATACTGAATCTTTGGAGATTGCTTCTAATCTTGATACATTAGTGTCAAAAGATGTCCAGACGTCTGAAGTCTCAGAGAATAAACCTTTATCTCTGAAGGAATCTTCAGAGCAAAAAGGAACATATGGTCCAATAAAAGATGGCCCAAATCCTCAAAGTGTGCCTGTACCAATGAAAAGTGATCAAACTACTTCAGATCCTAATCGGTCTCCAAAGAAGAAACCTCTCCAACAAATTCAAtgtgaaaaatgttttgaatattTCGGTAACAAAGAAAATTTACTGAAACATGATAAACATGTTCATAATACTACCCAGGCAACATTCTGTAATATTTGTAAGAGAAACTTCCGTAATGTTAATGCACTTAGAATGCATAATGctacatttcataaaaataaaaacgagaCCTCCAAAAAAGAcgctaataataaaaagaaatcaataaattgtgaaaaatgtaaaacaacTTTTAAAATTCAGAAGGATTTTGAGATGCATGTAAAATCTTGTAAGAGAAATAGTGAACTTCAATGTGATATATGTTTAGAAATATTTCGGTCCAAGTATAtcttacaaaaacataaaaataatgaccACACCGCCCGTGATCACGAAAATAGGAATATTGATGACAACAAAAGAACAAACAATGgggacaaaaaagaaaagaaaaatctaaaattagaaACAATACCTCAAAATACATCCATTACTACAAGACgatcaaaaagaaatatacaaaatgagGAAGATgacgaaaagaaaaaagatgatGTTGATAGTAATAAACTGAATGAACAAAAGCCAAACCAAACTCCTGAGgacaaatcgaataaaagaaactCCGAAGAATCCAATTCTGCAACGAATGGTGAAATACCAACTCCTAATAATGAATCTCAAGCTCAACAACATTCCCCTATTCAAAGTGGAACATCAAATCATCCGCCCACTCCAAATATAACTTCCCAACAACCGCTTACTAAAATTGAAACATCACAAAGTCCTCCTAACGAATCTCCAAGTAAAAAAGTTGAATATAAATGCGACACCTGTCGTATTCTTTTcgacaatataaaattattggaaGATCATATTCCGGTTTGTcggaatataaataaaaaatgtttgaagtgTGGGGAAACCTTTAAAACAAAACTGTTATTAAAAATGCATTCTTGTATAAAACCAAAACCTATCCAATGTAGATTTTGTCAGAAGAATATTCCGAAACACAAATTTCAAGATCATATAAAAATATGTGCACTACCAAAAGATGATTTGATTTGTAATTATTGTAAACAGTCATTTAGTGGTAAATTTTTGTTACAAAGGCATTTTGTTAGTCACCACAATATTGAAACAGGgatcagaaaaagaaaaagtcttGAAATGAACAATGATgacatcaaaaaaataaaagcagaaCATAAGGAAAGTTCTTCATCAGATGGAAAATTGATTGAAGACATAAATGAAATCAGAAAAGATTCAGTCAAATCCAAAGTTACTACACCAGTAAACGATGATTTAAAGATAACCAAAATGATTATAGACACGGATACGAATAGCAACAAAGGCTACCAATGTAAgaaatgtaatgaaaaattaagtAATCAACGCGATCTTACAAAGCACTATCTTCAAATCCATAGTAATGTTGAAGATATAAAATGCGACGAGTGCTCAAAAACATTTAGAAACCTAACTTTGCTTGATAAACATGTTATCAATGCACACAAAAATGCAATCAaatgtaatatttgtaataaagtttttaaaacactttatttataCAGTAAACATAATCGtctttttcataataataaaagtagtGAAACAAATTCTAAGCCgaatgaaaatcaaattgaaacctCTCCAGAAATTAGTGTAGATAAAGAATCTTTAATAGAAAAATGTCCTTTACCTCGTCAGTACTCTAGATTCCACAATGCGAAGAAACCAGCTGTTGAAAGTTCAGCTAGCattgaaaatgaaagtaaacCAATGTCTAATACAATTCAGTGTGTTGCTGAAGTACATTGTAGTGGTACGCCTGATGAACCTGATGAAACTATTGTTAAGCCACCTGATCTTATAAAGCCAGGATCAAGTGAAAATAACTCTCCCAAACAAGTTCATACGAATTCCAAATCAGTCAAAGAAAATGAATCATCTATGCTTCAcgaaaatgaagaaagaaagaaaatagtttgtaaaaagtttgaaGCAAGTCCCgataaaaaattgatcactttcataaatttattgaaagaagaaaacaaaatgaatgaaAGACGAAAACTTGGCGATGCTAAAATCAGTTCCGCCACAAGTGTAATCAAAGAAGTTACTCCTAATCCAAATCAGGAATTAAAagcagaaataaataaaagctctaaactgaaaaatatagaaactcCGTCTCTAGAGAAACAACGCGATGCACAATCGAATTCTACACATATTGGTCcatttcagaataaaaaatcaagacTCCTTCCCAAAGTTGATGTCAAACTTAACTTTGttgaacaatttcaaaaaacaataccTTCACAGATCCAGAGTACACAAAAACCGCAGCATCTAGAGCCTGAGCCAACagaagttattttttataaatgtgcTCGATGTGATAAAGAATTCCCGGATAAATTAGAACTCAGACAACACATTTCTGATAAACATATTGCTTTACCCCCATTGGAAGAAGTGTTtagaaaagagaaaaagttcATTCGAAAAGTTTCTAAAGATACGTATGTTCTAGTGCAAAAAGTCGGTCAGCCAAAGGAAAAAGTTTACTATTCATTAAATATACCTGAACTTAAATGTTGTCAATGCGAAGCTAAATTTACATCTCAGAAGAGCTTAGAATTGCACACGAAAACGTCTCACATTAATACCAAATTTCCgccaattaatttaaaatcaaaatcagcATCGACCAGTAATATCAACGTTAATATTTCATCCATTCAAGGTTTGGAGTCTACAAATACTCCTGCAAAACGCAGACATTCCGAAACGGCCATTATTTCTTCCGCAACTGCTGAGCCAAGTACGCTTATTCCAAGACATCCTTCACCTATCACGATTCCTACAACATCTGAAAATATTAATACGCCTATCCAAAAAAATTCTGACGCTACCAACATGCCCACAACTTCCGAAAGTTCGAGTACGTCAATCTTATCGAATGTCTTACAGTATGGCAGTAACTTTGCAGACGGCGATCCTCTAGCGCCTGTCGTTTCCGATTTCACCCAGGATGAAAATAACTCAAGTTGTGACATTTTCAACCCTAACGATATTCTGAATTccaataattttgtcaaaactgATACCAATACCAGTGCAACGGATAGTGCAATAAACGGTCCTCTTGCCGATTTGCAGTGCTTCAGATGTGGGGTTAAGTTTAATAATGGACCTGAACGACATTACTGTACTGTATTTAAATGCGATAAGTGTTTCCAAGCATTTGTGTCTAAAGAATTGCTCAATGCCCATTATTTACTCAAGCACGACCTCTACCTAGTCCCCAAgcatgttaaaaatataaatgaattcactaatattaaaaatctgaaTTGTAAATTTTGTTCAGCCATATTCACCAAAAATAAAGATCTAAATAAACATTATAAAGAATTCCATAAATACAATCCCAGCCAAATGAAGAGTTCGGTgatcaaaaattataaagacGTATCGTCGTTCATTCACTGTACAATGTgtaatgatattttcaattctaaCTTTGAGCTTCAGAAACATTATCTGGACTATCATAACTACGATAGTTCGGTCAAAACATTTGAAATGACCCCCAGCACTAGCACTGAAAAGAGACTAGAGGAAACTAATAGTAATACCGTACAAACGATTTCCAATACAGGCCTTCCTCAGACTAATATTAAGTATAGCATTGTCTCTAAAAGAGGGAGAAAACGAAAGAGTGCACAAGATGACACGATGAAGAATATCAATAGTTtaaattcaatcaatagtgttttgaatgatGCACTGGGATACACCGTCAGTAGTACCgtactaaataatattttacctaaTCAAAATACAATTACGAGTGACATTATTATCAAAAATCCGGCCATACCTCCTCTAACATCAACTAGTGCTGCCATATTGCCCACTGCCGTTACAGTACCCAGTTTAGTCCCTACTAATACTCAGATATTACCTGATTTAGTTTTCACAAATACAGCCACAATAACTAAGAAAGTGCCAACTAATGATGTTAATGACATTCCACGTTTAATTTCATTTAGTAATCCAATTGTAGGATCGCTTCCTAGTATCAAGCAATCACAACTTGATCCAGTACCTGAAGAAGACGTACCCTCTGAGAACATTGTAAATCAGACCAATCTCGTAAAAGACAGCCTGTCCTCAAATGATCTCGCAGATAATTCAAACACCGATCTGAATAATTTGTTTGTGTCACATCCCGGCACTTCGTCCACAGTTGATTTATCTGTGATTCAACCCGATTCTTTTCAAATGCCTTCAGACCAACTTGTCCATCCTCAGTTTTCTTGTGACTTTTGTCCATTGACCTTTGTCTCAGAGTCTGCTCTCTCATTACATTTATTAAGAGATCACAAAACACCAAGTTCAGAAGCCGGTGGCGGATTTATCAAATGTGACTTTTGTTCTTTTgagtttgaaaataaagataagCTGAAAACGCACATGCAAGACATGCACAAAGATTCTCAAATATGTATTCAATGCAATCGTAAGTTCTCAAATAAATATAACTTGAAACGCCATGTTACTGTCACCCATGATGCGAGGTATAATTGCACAATGTGTGCCGAAGTGTTCTTTAGAAAAGCAGACTACACAAAACATTTAGCCGTCGTGCATTCTGCATCAATGAGAGATAATAATATTGGTCCTACGCCAGCCAATCACCACCAAATTCCTACGGATCATATGTGTAAAGTTTGCTGCAAATACCTTTCGAGTAAATACCATTTGAAGAGACATATTGTTAAATTTCATGCAAACGTCACATCCGTTCTTAATAGTTGTAAACAATGTATGAAAGGAGCAAATGGTATCAACAGATTCTATTCTCATCTCGTATATGAACATAAGGTCAAGTCAAGTATTGTAGAAGGCATGTCTTCAGCCGCAGTCGTCTGTAATGAATGTAGAAGAATATTTGTCAATAAGAAAATGATtaagatacatttgaaaaagaaacatgATTTTAAACCTCATAGATGTGCCATTTGCTTTAGAGGCTATgcctttaaaaaacatattctaAGACATATGGAAATACATGACAGTCTTAAAGACTTTCTTTCGTTTTCTGCACCCCAAGACGATATGAGTGAATGGGGAGATTATGGTGACTTTACAACTGATGAAAATGGAGTTGTCTATCAGATGGGCCAAGTTGCTTATGAACATGCTCAGAGTGTAGTTCAAGAATCACAAAACGTTCAAACATCATGTATGTCAACACTAATTGACGACAGTTTTGACGAATCTCTACACCTGCTGAGTAATGAAAACTCCAAACAAATATctcagaaaaaaacaaaaaattatagaaCACAAGTTATAAAAAGTACCAACAAACAAACAATGGACACCAAGTTCCATGAAATAGAAACAACAAGTCAGTCTCTAGAAGAAATACAAAAAGAGAGAGAACGACAAATGTTAGCACAAATGGAAGAAAATATGAAGGAAATCAGCAGCAACAACCAAACAACTCCTACAGTAACCAACTATGATCCAACACTGTACCATAttaacaatactgaaaacaatTTTCAATCTTTACAACCAATCATTCAAACAGTTGGTAATGAGGAATCTATGCAATTTGTAAACGAAACCAATATGGAGATCAGTGATCCAAACTTGAATCAGAATCAAACGTTACATATTATAAACAATCCAGATGAATCTAATTATGATCTACAATCCGAAATGCAAACCAACCATGAATATCAACAAATTGACGAATCCCAACACTTGGCAGTcatgcaaaatttaaatcaaCACCAATTTCAGCAAATCCCAGAACAAGAACAAGTATGTAAGACATTGATATTACAAAGTGCAGATATAAGCAATCCTTATGAAACGGATTATAATGAAACCATTGACCCATCTTCGTATCAAGCACAGGTTATCAGTGATTCCACAACGTACTTGACACCAAACCAAATGGCGTACGTTCCACAAATGATGGATACTGACATGTCTACAGAAATGAATTTATCAAATATGGGAGACTCAGTTGTGTTACAGCCAATGGAATCCTCTTATGAAACAATGCAACATGCCATGCCAACAGCATACGGTATGGGATCCCAAGGAGCAACAACATTCGATATGGGAAACCAAGGGACAGCAACCTTTGAAATTGGAAATGATGGAACAACGACATATGAAATGAGTACCCAAGGTTCTAATAATGGTGCCCCAATTTTTGTCTTATTACTGAAACCTCCCGATCATCTGCCATCTTGA
- Protein Sequence
- MKKGVGDTNPTSPHLTNIIQDNVQSSLSPKIYQNSTPQHVLHTSPHGVSPMHQEFSGLQQVRHIYPEISIIREQTNLPSSRGKPLIHRTKPKSGVVDSNPNIHEQMNANVSSIRNDSIRAMNKDVRSIPQSSVTVNQTNATPNMLGQDRNDPHLGNSFSGGKSKLPEFSQIKQRKVSPSLSSTPQHDTNYPGGDSVIHLLDNNIMAEQNLIEFEKSFNDVIKGNLNILESSIIASDGQEYTSSNFYRNTSQCNVDPNLQRSPHLSSPHVHNSPHLQNSPLQSSPHLIVESSSSFNSPDSHQSLTSSDYYPSKDSDSLESGSVITNSISGMMLGHEDRSKHIEPGQIRNRLFEEIMNEKIHIETNIAAMSFTPLEECLLQGTFNLPTDIPSALEKSISLDIQRETMKANTEYDDKKIGSKNSAESSLENNQKKIIQVRSDLFHHNKTTKTDSLPSKTQNKVKLVRSQTPQNIATKNKGQNINVDKNSNTIQNTTSLLASKKSNRKLNNLNLNQPNKMLELIKSASNIKIRSNAKTQSINKPVPSYKKIPPNIPNNSGNRSVASTPLKVKEQATPLMGTIQSVKNSSLNIIGSSTTPSPQPNLHRKEHGKIEGKECTIKNVKQISTNQEKSTINYSTLQSNSDLKRDNLSGDKGKTVNEVISFKNVIIKNNSYNKVDRNTTDININKNIQTKEVNSTVISQNVHKGEQECSRICQTNSLPVKTLSYCDKPEVISRLKLTGKTADPYSFDSESDKLPQRKVSDDISDDQTIVTCPKQQNIEQDKNKNINDQETSGFKIKVINKTDLSLILHKEDTACLNEERSAKKSLEKMKESKQQNTIDQLASPEEILRQSVSSNPPVESHSETRDLSSTKKHLSYKEDSPSLMTCSMGGNPTRLKLKYVKNSTSSESNSDKGSDSPNMDSSERMKTLRIKIPSYSVEKDHSPKRSNKSSPLPKDVKSDHRVQEKLIINLKTNQITRTKEDVINNNDSEDNLPTAIDSNNTSIIQGPNLALSPNVLKEKRKSFRKIDEICQNLRRKSHEKEGTNIDNVSNDCAKEGLEIEKLVDSSIDSGDITQRKVNNLNTEKLSDLKISDSVNREKGIEKSKSFVVKFRRMSRSGENIIVKQIVDRKNVTDETDDEYQTEREDNTDTENDLETGNLNTAIVRHAENDNSHNTKTDFKLKIRIGSEIMSEQNVNKKKDKKKKKSKKHKDKSKKSRHRSGVSEEIDQAHASKSDDHVFIKPLILKRKNDVSLFTLSQNDTNSSKDNNTESEENTNLSDRLCLLKQKDSSSDEMNVSNEITEELELKQPIPDRLKEESLKICSKSNKRKSKQSVQEHHESEPSSHLLGQIVSSENTSLKEHKNDIKSECDAYAEKISSLLTETQVCESCSKSFENEFEYNKHKVLIHNYNAFLCHICYVSFVEKWKLDIHLMSAEHMAMINSGSDQVTDVITTSRTTRRSKNNLGNETLSPKHENKKKSNKTKIKEEDKSNIDVASSSLTVSTDSKNLKDDKDNVKNQILQSMDSNIVESKESTGQMDLKTIKNSEMSLSAETSLLKNNTNVCNEDNFTLNSENQALVNTHSKDNFDEIERRDDLKSPNKEKNTSKEINKTKERKNDFEKYLAIEDSCSKDSDASQSVPALNTVISSSLDSKSAINNDQISEHESGISTCSSTSEIDCSSFSQSDNLLSPDSVGEKSKATELQHPNSLLIKNRVIESVTENILPEQNTSLIVDLEENIPSCIIRNRKLNKKTNEKVQLDSGRKAAYKDELKTEIIKSKNEEQKLIDLNVTTLSKTTRPVPKLRPLPGLIKITPEVLPIIPNLQIDGLDRIRKIQQTVHSKTIDILEIGKVQEFQTTDKEIVVPKKSLKKKRDCCESESDKNEKEIMNFKAYPQKNDQVCHADGNLNEHINSDDGNNERIHEVTETCDKKVSKNKQKVNGDVEINIIGKDEIADRPCRVLRKKNIPPKSETIVKEVTENLIEPITVINEKVSLLFDKIETENTSEDLKVKGDVLINECTRLRRSTYKKSSVLYERRKIPRSTSKNKKKNVNYDENIVDKILENNNDHSELIDEINIPGARISTESEINELCINPESLNKNIKSKETLSVCNKITTCEEKISSSSQTNDESSTTVINSETMSETETETLNQKRSADSVSKLKETNESFDEDDIPLDIRQISKAKSHENLISSQTEIDFKENDEDDIPLEIRKTNLSKSVENLNSLETVCSSKTELIKTINYINNDENEALANKKMCRAKSHENVSSKTHSEIEKKSSTVFLSNKTPIEQNIRLRKPISMDKNSVDDKNMKDCSTTRSLIENKKNRKIKTHECLSSLKDADKLTDIKGVIQEDLSSERRKSLRVKTREHISSAEISNKTTLEEVCLTDKKTSKKQKTIQQTDETNMLEKVSVPSEVRKSLRIKVIDQTVSNLENSPQKALPGILNEATFNSRTRLRGKSQDNFSSSLVSLEFPNPNSNQILEKDGTKISNQDLITKENGSHISIVQVLEHETIEVNTEKKYDINCRDQENVSIDIEDKMSSLVTEKSERHLEAKVATVETSIQDTQKDQILIENSKTAHWNELNLSSPTKNFKKRERNKKYLVKKQLEELSQIAAQNSLLENNKKELIEKQNSVPCDIKKNRSRRKTLEECTNHKNEDDTIPESPNETVNITENNLPMRKSLRNRKIKKFPDEEIEIFNGKRRNQKKINNNNKVTPLNTMLDFEKVENEPKLTRNRSHDSLRSIDNPHVEHEKDRHAVSKTMEVEHNQVSTLNETMKNKKYANKSDDLTDDLTNKSQIDHKRNIKHVNNDLEKEGQQSVHNHITSDINLQSDSITIFKESSVVKNNKNGKSKLRLSGSTDKSIHKETITELTKFLPSPEKFEKQNLLTDKLIGLQESSQAPSKDCVQNKENDLEEVVEQGEETSEQFDDDLENKLKEFLKKPILDSAVSIIKPIVHSSENIYKETNKPFNNKLDKIELIKQNIDPKITIKTLHAIEPTKPESSQTETSVGMVAKPIEDIGKEKTGRIKWKAETTEKNIDDLESNTNNASKDFELEFIQAKRAEGSSPINLLSQGKFSRIESKAEIVNKLTDLIVEKGTSPTEDTTESILGFKDKKIHRKSSPMEDVFVDKVSIPTETGTEISVDHIETVREEGSSLFELKADFLVESTETIVGLVSNPIDTKIEIVENTETTNKASRSYESKLKNCIEHTTTAVDGVSSLSKVITAIVGKVSETVVNEVSNPISFESQTETMVDIMTESTEIVSEVSEQIVTIDEVKEAVDHITKKVSERIEVLKSPQTIISKVTENVDNFTEVLETFEIIDEASENHKVITEEYKIIEEGHGIDESDNVGIISDLSKEIEITTENISQTINTVIEVTGIVNTFTEVTETIKQMTEEVSEFIENCDEISKSAEKVIKPVAERITHASVEELFDEIIEIVVEKTEQTAEHIPESVDGEITEHCAEEISELSAEEIHESAVEDIYELSAEEIHESAVEEISEKSTVEITELSAEKIHESAVEAKSEQSAGEIHESVVVEISEKSTEEITKPAAEEIKELTAEEIHEQAVKDISEQSAEEIHESAVVEISEQSAEKIHESAVEAISEQSAEEIHETAVVEISEKSTEEITEPAAEQINELSAEEMHGPAVEEMSEQSAERIHESAVENISEQSAEKIHESAVLEIYEKSSVEITELSAEKIRESAVEAISEQSTEEIHESVVVEISEKSTEEITKPAAEQISELTAEEIHEQAVEEISEQSIEKIHESAVEDISEQSAEEIHESLVEEISEKSTEEITKSAAEQINQLTAEEIHEQAVEEMSEQSAEEIHESAVEDISDQSAEEIHESAVVEISEQSAEKMHESAVEDIYEQSAEEIHESAVEEISEKSTVEITELFSEKIHESAVEAISEQSAEEIHESAVVEISEKCTEEITEPAAEQINELPAEEIHALAVEEMSEQSAERIHESAVEDITEQSAEELQESAVEKISEKSTVEITELSSEEIHESAVEAISEQSAEEIHESAVVEIYEKSTEEITVPAVDKITELSAEKIHESAVEEISEQSVEKIHELAVEEIFEQAVEEMHESAVEAIYETPAEEITEPAIEEISEQSAGEITEQHVEKITESASEDMSAQSSEAITESAVEKMSEQPSEQTTKPAVEEMSEQSAAEITEQPSEQITEPAVEEMSEQFASEITEQPSDEITEHVVGGIYEATVGEISEPIVEGIAELTAEKLTEPIVEEIFELSIEEMHQSAIEAIPEKPAEEITVSAVEEISKQPAKHITESSIEEITELSAGEIHEPTVEESAAEITEQPAAEITKSAVEEIFKQSAEEITAQAFEGRSEQTVAEITESAVEDMSEKSAETIIESAVEKIFQQTSVEKTELSVEEISEKAEKITEQPATGITKSAGEEFFKHSAEEKTESTVVEMSEEITVLADEKLQKRSESVAEITAKGITESAVEEISEPFDEIESAVEEMSELYAEEITEPVVEGMYVAVGEITEPIVDGISETTAEKIEPIDEGISETTAEKITEPIVEEIFEPPIEEIHELAEQPAEHITVLAVEGLTEPTTEEISKSIVEEISKKSAEEITESVVDAIYEKHPGKITETAIEEIHYSAVEEISVHSAEEITVSAVEEIALQSTEVITEPAFDKVSQQSAEVITVSAVEKISDQSTKEIFEETLGKMARPAIEEILEQSAKEISEPTVGERAKPSVEEMPEQSPEEISVGEIVKPAVEEIPEQSSEETVGQITEPKVENIYELTALEITEPIVGGISEPTSGEITKSLVAEIAKKRCEITETYNEEISEPTAGEISESIVEEIYKKSVKEITESAVDAISEKHPGEITETAIEEMHDSAVEEISEQSAEEITVSAVEEISIQSTEVITVSAVEKILDQSTKGIFEETVGKIDKPAIEESIKKTQQLEQSTKEISEPTVGEILKPAVEEIPEQTTEETFGQITEPKVENIIELTALEITEPTAEGISELTSEDITEPIADEIFEPTTQEITEPIVGGISEPTAEEITESLVAEIAKKCCEITETYNEEISEAADDGNNKSSPREITGHCGEITNKPSAGVANEKNGDEIIEVGEVIIPPFEVKTEPSDYEIFEQGDDEITEIPDNDIIESANDEIIEEMSEPTAQEITEQTAQEITELIALEITKSVDSEITEPIFKEISEPTAEEITEQTAQEISEQCDRITKQSAGEIIEQSGNVLEAADDEKTKSSSEEITEQCDGITKPSGGEIPEQNGDEIIEIDDCDIIEPAGEIIIPPFEVKTEPSDEEITETFCDEITDAADDEKTRSLDEEINEQRDEIKQSSGEIPEQNGDEIIEMDDFDLIEAPGEGIIPPFEVKTEPTDEEIFEQTGDEITEAVDDEIIESADEDIKLHLEDKTKKFAEDEISELSVEKTEKCDEITKPFTEDVTKLAADDISETAGEKTSKPEISEPTAQEIFGSVDEDKSETAIELTSEPPAEELTETSNDDKIKLIDKNFKPHDKDKTESGGDKRAKSSLDKSTELGVEIISVKSVSTIYESTETIIEKCDEITEPFTEEITEAAHISNPADQDNKSPFKDKIKTAAEMSESVGNEAKQLTDKITETTVDSEKIKEAIEKIIESSDEEIGDIIEPALNDKPELVDSEKIEEATEEIIKPAGAEITEAAGDINEPAVKDKTESVDSEKTKQDIEEIIKPSGAEITQAIGDINEPAVKDKTESIDSKKTKEAIDEITETSDAEVTEATGDINEPAVKDKTESVDSEKIKEAIEEIIEPSGAKIIQAAGDIIEPAVKNKTESVDSKKTKEAIEGITETSGAEVTEATGDINEPAVKDKNESVDSEKTKQDIEEIIEPSDAEITEGDITEPAFKDKTESVDSDKTKEAIEDITEPSGTEVTEATGLIIEPAVKYKIELATEEISQRAVVDVDKPKYEELNESSMQLINETNDVDFTTTVVKENLDEKTEPVVEEIMKLSVKTSPDITATNTLEPTDTILLTHNTESLEIASNLDTLVSKDVQTSEVSENKPLSLKESSEQKGTYGPIKDGPNPQSVPVPMKSDQTTSDPNRSPKKKPLQQIQCEKCFEYFGNKENLLKHDKHVHNTTQATFCNICKRNFRNVNALRMHNATFHKNKNETSKKDANNKKKSINCEKCKTTFKIQKDFEMHVKSCKRNSELQCDICLEIFRSKYILQKHKNNDHTARDHENRNIDDNKRTNNGDKKEKKNLKLETIPQNTSITTRRSKRNIQNEEDDEKKKDDVDSNKLNEQKPNQTPEDKSNKRNSEESNSATNGEIPTPNNESQAQQHSPIQSGTSNHPPTPNITSQQPLTKIETSQSPPNESPSKKVEYKCDTCRILFDNIKLLEDHIPVCRNINKKCLKCGETFKTKLLLKMHSCIKPKPIQCRFCQKNIPKHKFQDHIKICALPKDDLICNYCKQSFSGKFLLQRHFVSHHNIETGIRKRKSLEMNNDDIKKIKAEHKESSSSDGKLIEDINEIRKDSVKSKVTTPVNDDLKITKMIIDTDTNSNKGYQCKKCNEKLSNQRDLTKHYLQIHSNVEDIKCDECSKTFRNLTLLDKHVINAHKNAIKCNICNKVFKTLYLYSKHNRLFHNNKSSETNSKPNENQIETSPEISVDKESLIEKCPLPRQYSRFHNAKKPAVESSASIENESKPMSNTIQCVAEVHCSGTPDEPDETIVKPPDLIKPGSSENNSPKQVHTNSKSVKENESSMLHENEERKKIVCKKFEASPDKKLITFINLLKEENKMNERRKLGDAKISSATSVIKEVTPNPNQELKAEINKSSKLKNIETPSLEKQRDAQSNSTHIGPFQNKKSRLLPKVDVKLNFVEQFQKTIPSQIQSTQKPQHLEPEPTEVIFYKCARCDKEFPDKLELRQHISDKHIALPPLEEVFRKEKKFIRKVSKDTYVLVQKVGQPKEKVYYSLNIPELKCCQCEAKFTSQKSLELHTKTSHINTKFPPINLKSKSASTSNINVNISSIQGLESTNTPAKRRHSETAIISSATAEPSTLIPRHPSPITIPTTSENINTPIQKNSDATNMPTTSESSSTSILSNVLQYGSNFADGDPLAPVVSDFTQDENNSSCDIFNPNDILNSNNFVKTDTNTSATDSAINGPLADLQCFRCGVKFNNGPERHYCTVFKCDKCFQAFVSKELLNAHYLLKHDLYLVPKHVKNINEFTNIKNLNCKFCSAIFTKNKDLNKHYKEFHKYNPSQMKSSVIKNYKDVSSFIHCTMCNDIFNSNFELQKHYLDYHNYDSSVKTFEMTPSTSTEKRLEETNSNTVQTISNTGLPQTNIKYSIVSKRGRKRKSAQDDTMKNINSLNSINSVLNDALGYTVSSTVLNNILPNQNTITSDIIIKNPAIPPLTSTSAAILPTAVTVPSLVPTNTQILPDLVFTNTATITKKVPTNDVNDIPRLISFSNPIVGSLPSIKQSQLDPVPEEDVPSENIVNQTNLVKDSLSSNDLADNSNTDLNNLFVSHPGTSSTVDLSVIQPDSFQMPSDQLVHPQFSCDFCPLTFVSESALSLHLLRDHKTPSSEAGGGFIKCDFCSFEFENKDKLKTHMQDMHKDSQICIQCNRKFSNKYNLKRHVTVTHDARYNCTMCAEVFFRKADYTKHLAVVHSASMRDNNIGPTPANHHQIPTDHMCKVCCKYLSSKYHLKRHIVKFHANVTSVLNSCKQCMKGANGINRFYSHLVYEHKVKSSIVEGMSSAAVVCNECRRIFVNKKMIKIHLKKKHDFKPHRCAICFRGYAFKKHILRHMEIHDSLKDFLSFSAPQDDMSEWGDYGDFTTDENGVVYQMGQVAYEHAQSVVQESQNVQTSCMSTLIDDSFDESLHLLSNENSKQISQKKTKNYRTQVIKSTNKQTMDTKFHEIETTSQSLEEIQKERERQMLAQMEENMKEISSNNQTTPTVTNYDPTLYHINNTENNFQSLQPIIQTVGNEESMQFVNETNMEISDPNLNQNQTLHIINNPDESNYDLQSEMQTNHEYQQIDESQHLAVMQNLNQHQFQQIPEQEQVCKTLILQSADISNPYETDYNETIDPSSYQAQVISDSTTYLTPNQMAYVPQMMDTDMSTEMNLSNMGDSVVLQPMESSYETMQHAMPTAYGMGSQGATTFDMGNQGTATFEIGNDGTTTYEMSTQGSNNGAPIFVLLLKPPDHLPS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -