Basic Information

Gene Symbol
-
Assembly
GCA_012654025.1
Location
CM022880.1:25055982-25080521[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 24 0.17 16 6.4 0.6 2 22 1383 1403 1382 1405 0.88
2 24 0.64 60 4.6 0.3 1 22 1410 1431 1410 1434 0.88
3 24 0.44 41 5.1 1.6 2 19 6067 6084 6066 6089 0.93
4 24 0.076 7.1 7.5 1.2 3 23 6097 6118 6095 6118 0.94
5 24 0.26 24 5.8 0.6 3 21 6138 6156 6136 6157 0.92
6 24 0.17 16 6.4 0.5 2 23 6165 6187 6164 6187 0.94
7 24 2.9 2.7e+02 2.5 0.1 1 20 6335 6354 6335 6354 0.96
8 24 0.075 7 7.5 2.1 2 19 6363 6380 6362 6383 0.92
9 24 0.011 0.98 10.2 2.7 2 23 6417 6438 6416 6439 0.92
10 24 0.0011 0.1 13.3 1.9 1 23 6516 6539 6516 6539 0.95
11 24 0.002 0.19 12.5 0.3 2 23 6546 6568 6545 6568 0.93
12 24 0.00092 0.086 13.5 1.8 2 23 6573 6595 6572 6595 0.95
13 24 0.0016 0.15 12.8 0.5 1 23 6862 6885 6862 6885 0.94
14 24 0.035 3.3 8.5 3.7 2 23 6933 6955 6932 6955 0.96
15 24 0.0022 0.21 12.3 0.9 1 23 7142 7165 7142 7165 0.95
16 24 0.039 3.6 8.4 2.0 2 23 7187 7209 7186 7209 0.94
17 24 0.47 44 5.0 1.5 2 23 7232 7254 7231 7254 0.91
18 24 0.0046 0.43 11.3 0.1 1 23 7509 7532 7509 7532 0.97
19 24 0.0071 0.66 10.7 1.2 2 23 7545 7567 7544 7567 0.94
20 24 0.0022 0.21 12.3 1.3 5 23 7575 7594 7572 7594 0.91
21 24 0.0076 0.71 10.6 0.2 1 23 7598 7621 7598 7621 0.95
22 24 9.9 9.3e+02 0.8 8.6 1 23 7644 7667 7644 7667 0.94
23 24 0.019 1.8 9.4 0.9 2 23 7714 7736 7713 7736 0.96
24 24 0.29 27 5.6 3.0 1 23 7741 7763 7741 7763 0.97

Sequence Information

Coding Sequence
ATGAAGAAAGGTGTTGGGGATACAAATCCAACATCACCCCATCTGACCAACATCATTCAAGATAATGTCCAGTCTTCACTCTCgccaaaaatatatcaaaactcAACACCTCAGCATGTTCTGCATACATCACCCCATGGTGTTTCTCCCATGCACCAAGAGTTTTCTGGATTGCAACAAGTCAGACATATTTACCCAGAGATAAGTATCATTAGGGAACAAACAAATCTTCCTTCGAGTAGGGGTAAACCTTTAATACACCGAACGAAACCAAAGTCAGGTGTCGTTGATTCGAATCCAAATATTCATGAACAAATGAACGCAAACGTATCATCAATTCGGAATGATTCAATTCGCGCAATGAACAAAGACGTCAGAAGTATCCCTCAATCCAGTGTTACTGTAAATCAAACCAATGCTACCCCAAATATGTTAGGTCAAGATAGAAATGATCCTCATTTAGGAAACTCATTTTCAGGTGGTAAAAGTAAACTACCTGAATTCAGTCAAATTAAACAAAGGAAAGTTTCGCCTTCCTTGTCGTCAACCCCTCAACATGATACGAACTATCCAGGTGGGGATTCCGTCATCCACCTTCTGGATAACAATATCATGGCTGAACAGAATctcattgaatttgaaaagtCTTTCAACGATGTTATTAAAGGTAACTTAAACATTTTAGAGTCCTCCATTATTGCAAGTGATGGTCAGGAGTATACTAGTTCTAATTTTTATAGGAATACTTCCCAGTGTAACGTTGACCCCAACTTGCAACGTAGTCCACATTTGAGTAGTCCCCATGTTCATAACAGTCCTCATTTACAAAATAGTCCTTTACAGAGTAGTCCACATTTAATTGTGGAAAGCTCCTCTAGCTTTAACAGTCCTGATAGTCACCAGTCTCTCACCAGTTCGGATTACTATCCAAGTAAAGATAGTGACAGTTTGGAATCAGGTTCGGTGATAACTAATTCAATATCTGGAATGATGTTAGGACATGAAGATAGGTCAAAACACATAGAACCAGGCCAAATACGAAATCGTCTTTTCGAAGAAATTATGAATGAAAAGATTCATATTGAAACAAACATTGCAGCTATGAGCTTCACTCCTTTAGAAGAGTGTTTACTTCAAGGCACATTTAATTTGCCTACAGACATTCCCTCCGCTTTAGAGAAAAGTATATCTCTGGATATACAGAGGGAAACTATGAAAGCAAATACTGAGTATGATGATAAAAAGATAGGATCAAAGAATTCTGCAGAGTCTTCCTTAGAAAAtaaccagaaaaaaataattcaagtaAGAAGTGATTTATTTCATCacaataaaacaacaaaaacagataGTTTACCatcaaaaactcaaaataaagtaAAGTTGGTAAGATCTCAAACGCCTCAAAATATTGCTACAAAAAACAAAGGTCAAAATATTAATGttgataaaaattcaaatacaattcAAAATACAACTTCTCTTTTAGcgtccaaaaaatcaaatagaaaattaaacaaCCTGAATCTTAACCAACCTAACAAAATGTTAGAACTGATCAAATCAGcatctaatataaaaataagatcGAATGCAAAGACACAAAGTATCAATAAACCCGTCCCCAGCTATAAAAAAATCCCTcccaatatacctaataattcTGGTAATCGAAGTGTAGCTAGTACACCTTTGAAAGTCAAAGAACAAGCCACCCCCCTAATGGGAACCATTCAAAGTGTAAAAAACAGTAGCTTAAATATAATTGGAAGTTCTACGACACCTTCTCCACAGCCAAACCTACATAGGAAAGAACATGGTAAAATTGAGGGTAAAGAATGTAccataaaaaatgttaaacaaATATCAACTAATCAAGAAAAATCAACCATTAACTATTCTACTCTTCAGTCAAATTCTGACTTAAAGAGGGATAATCTAAGTGGCGATAAAGGTAAAACAGTGAATGAGGTAATATCATTCAAAAATGtgatcattaaaaataattcttataatAAAGTTGATAGAAATACAACTGAtatcaatattaataaaaatattcaaacaaaggAAGTTAATAGTACTGTGATTAGCCAAAATGTTCATAAGGGTGAACAAGAATGTAGTAGAATCTGCCAAACTAATAGTCTGCCAGTCAAAACCCTTAGTTACTGTGATAAACCTGAAGTAATTAGTAGACTAAAGCTGACAGGTAAGACCGCAGACCCGTACTCATTTGATTCCGAATCAGATAAATTACCACAACGTAAAGTTTCAGATGATATTTCTGATGACCAAACAATTGTTACATGTCCTaaacaacaaaatattgaacaagacaaaaataaaaatataaatgatcaGGAAACATCtggtttcaaaataaaagtaattaatAAAACCGATTTAAGTTTGATTTTACACAAGGAAGATACTGCATGTTTGAATGAAGAGAGATCTGCGAAAAAGTCCTTAGAGAAAATGAAGGAAAGTAAACAACAAAACACTATTGATCAACTAGCGAGCCCAGAAGAAATATTGAGGCAAAGTGTATCATCCAACCCACCCGTAGAATCTCATTCAGAAACCCGTGATTTATCTTCAACTAAAAAACATTTGTCATACAAAGAAGATAGTCCTAGTTTAATGACGTGCTCAATGGGAGGAAATCCAACTCGCCTGAAATTGAAATATGTGAAAAATTCTACTTCGTCTGAATCTAATTCTGATAAAGGCTCAGATAGTCCCAATATGGATAGTAGTGAAAGAATGAAAACTTTAAGAATAAAAATTCCGTCCTATTCTGTAGAAAAAGATCACTCTCCGAAGCGTTCAAATAAAAGTAGTCCATTACCCAAAGATGTGAAATCTGACCATAGAGttcaagaaaaattaattataaatttaaaaacaaatcaaataacaagaacaaaagAGGACGTTATCAATAATAATGACTCTGAAGACAATCTACCCACTGCAATAGATTCAAATAACACATCAATCATCCAAGGTCCTAATTTAGCTTTATCTCCTaatgttttaaaagaaaaacgaaaaagttttCGTAAAATCGATGAAATATGTCAAAATTTACGGCGAAAAAGTCACGAAAAAGAAGGAACAAATATTGACAATGTATCAAATGATTGTGCCAAAGAAGGATTGGAAATTGAAAAACTAGTTGACAGTTCCATTGATAGTGGTGATATTACTCaaagaaaagtgaataatttaAATACCGAAAAGTTGtcagatttaaaaataagtgATAGTGTAAATAGAGAGAAAGGAATAGAGAAATCGAAAAGTTTTGTTGTTAAATTTAGAAGAATGTCAAGAagtggtgaaaatattattgtcaaaCAAATTGTCGACAGAAAGAATGTAACGGATGAAACTGACGATGAATACCAAACAGAAAGGGAAGACAATACAGACACAGAAAATGACTTGGAAACAGGAAATCTAAATACGGCGATTGTGCGTCATGCAGAGAACGATAATAGTCATAATACGAAAACagattttaaacttaaaataagaaTTGGTAGTGAAATAATGTCAGAACAAAacgtaaataaaaagaaagataagaagaaaaagaaatcgaaaaaaCACAAAGATAAATCTAAAAAATCGCGTCACAGATCTGGAGTTTCTGAAGAAATCGACCAAGCGCATGCTAGTAAATCTGACGATCACGTATTTATCAAACCTCTAatattgaaaaggaaaaatgacgtttcactttttactttatcacaAAATGACACAAATTCCTCCAAAGATAATAATACTGAATCTgaagaaaatacaaatttaagtgACAGATTATGTTTATTGAAGCAGAAAGATAGTTCTTCTGATGAGATGAATGTAAGCAATGAAATAACAGAAGAACTAGAATTGAAGCAACCCATTCCCGACCGATTGAAGGaagaaagtttaaaaatttGTAGCAAGAGTAATAAGAGAAAATCAAAACAGTCTGTACAAGAACACCATGAAAGTGAGCCATCTTCTCATTTACTTGGTCAGATAGTTTCATCTGAAAACACTTCATTGAAAGAGCATAAAAATGATATCAAATCAGAGTGTGATGCGTATGCAGAAAAAATTTCCTCACTTTTGACGGAAACACAAGTTTGTGAAAGTTGTTCCAAGTCTTTTGAAAACGAATTTGAGTACAATAAACATAAGGTTTTAATTCACAATTATAATGCATTTTTATGTCATATATGTTATGTATCGTTTGTAGAGAAATGGAAATTAGATATACATTTAATGTCTGCAGAACATATGGCAATGATAAATAGTGGATCCGATCAAGTTACAGATGTTATAACTACATCTCGTACAACCAGacgaagtaaaaataatttaggaAATGAAACATTGTCACCAAAacatgaaaacaaaaagaaaagcaataaaaccaaaattaaagaagaagataaatctAACATTGACGTTGCTAGTTCTAGTCTCACAGTCTCTACAGATTCTAAAAACCTCAAAGATGATAAGGACAAtgttaaaaatcaaatcttaCAATCGATGGACAGTAACATAGTTGAATCTAAAGAATCAACGGGACAGATGGActtaaaaacaatcaaaaactCGGAAATGTCTTTGAGTGCTGAAACaagtcttttaaaaaataacacaaatgtATGCAATGAAGATAATTTTACTTTGAACAGTGAAAATCAAGCACTTGTTAATACCCAttcaaaagataattttgatgaaatcgaaAGGCGTGATGATTTAAAATCtccaaacaaagaaaaaaatacaagtaaggaaattaacaaaacaaaagaaagaaaaaatgattttgaaaaatatttggctATTGAAGATTCCTGTTCAAAAGATAGTGACGCCAGTCAAAGTGTACCTGCGTTGAACACTGTAATATCTTCTAGTCTCGACTCTAAAAGTGCAATAAATAATGATCAGATATCTGAACACGAAAGTGGAATATCAACTTGTTCAAGTACCTCGGAAATAGATTGTAGCAGCTTTTCACAATCTGATAATCTATTAAGTCCAGATTCAGTAGGGGAAAAAAGTAAAGCAACAGAATTACAACATCCGAAttccttattaattaaaaacagAGTTATTGAAAGtgtaactgaaaatattttaccagaACAAAACACCAGCTTAATTGTAGATTTAGAAGAAAACATACCTTCTTGTATAATACGaaacagaaaattaaataaaaaaacaaatgaaaaagtccAATTAGACAGCGGAAGAAAAGCTGCCTATAAAGATGAATTAAAGACTGAAATTATCAAATCAAAGAATGAAGAGCAgaaattaattgatttaaatGTTACTACCCTCAGTAAAACTACTCGACCTGTACCAAAGCTAAGACCATTACCTGGTTTGATAAAAATCACTCCTGAAGTTTTACCAATCATTCCAAATCTACAAATCGATGGTTTGGATAGAATAAGAAAAATTCAGCAAACAGTACATTCTAAAACAATTGACATTCTTGAAATTGGAAAGGTTCAAGAATTTCAAACCACAGATAAAGAAATAGTCGttccaaaaaaatcattgaaaaagaaaagagattGCTGTGAAAGTGAATCCGATAAGAACGAAAAGGAAATTATGAACTTTAAGGCGTATCCTCAGAAAAACGATCAAGTTTGTCATGCAGATGGTAATTTAAATGAACATATAAACTCTGATGATGGGAATAATGAGAGGATTCATGAAGTAACTGAAACATGTGATAAAAAAGTctcgaaaaataaacaaaaagtaaatggTGATGTGGAAATCAATATTATAGGCAAAGACGAAATTGCAGACAGACCTTGTCGagttttaaggaaaaaaaatattcctcctAAAAGTGAAACTATTGTAAAAGAAGTTACGgaaaatttaattgaacccaTTACAGttataaatgaaaaagtaagtttgctttttgataaaatagaaaCAGAAAATACATCTGAAGACTTGAAAGTTAAAGGGGACGTTCTGATCAATGAATGCACACGCCTCAGAAGAAGTACTTATAAAAAATCTAGTGTTTTATATGAAAGACGAAAAATCCCTAGAAGTACATCcaagaataagaagaagaatgtaAATTATGATGAAAATATAGTAGACAAAatactagaaaataataatgaccattCAGAGCTCATTGACGAAATAAATATTCCTGGCGCCCGTATAAGTACCGAAAGTGAGATAAATGAACTTTGTATAAATCCTGAAagtctaaataaaaatattaaatccaaaGAAACATTATCAGTTTGTAATAAAATCACCACATGTGAGGAAAAAATCAGTAGCAGCAgtcaaacaaatgatgagagTAGTACAACAGTTATCAATAGTGAGACAATGTCTGAGACAGAAACTGAAACTCTTAATCAAAAAAGATCTGCTGATAGTGTGAGTAAACTGAAAGAAACAAATGAATCTTTTGACGAAGATGATATTCCGTTAGACATCAGACAAATTTCAAAAGCGAAAAgtcatgaaaatttaatttcttctcaAACTGAAATTGATTTCAAAGAGAACGACGAAGATGATATACCCttagaaataagaaaaactaaTTTATCTAAAAGTGTGGAAAATTTGAACTCTTTAGAAACTGTATGTTCAAGTAAAACTGAGCTTATAAAAACCATTAACTATATTAATAATGATGAAAATGAAGcattagcaaataaaaaaatgtgccGTGCTAAAAGCCACGAAAATGTTTCTTCTAAAACTCATtcagaaatcgaaaaaaaatcttcgacgGTTTTCCTATCGAATAAAACTCcaattgaacaaaatattcgATTAAGAAAACCAATTTCTATGGATAAGAATTCAGTTGatgacaaaaatatgaaagattgTTCTACCACCAGATCtttgatagaaaataaaaaaaatcgtaaaataaaaactcaCGAATGTTTGTCTTCTCTTAAAGATGCTGACAAGTTAACAGATATAAAAGGTGTTATTCAAGAGGATTTGTCatcagaaagaagaaaaagtctTAGGGTTAAAACTCGCGAGCATATATCGTCCGCTGAAATCTCAAATAAAACTACATTAGAAGAAGTATGTCTCACAGATAAAAAGACGTCAAAAAAACAGAAAACTATCCAACAGACTGATGAGACTAATATGTTGGAGAAAGTTAGTGTGCCGTCCGAAGTAAGAAAGAGTCTAAGAATTAAAGTTATTGACCAAACTGTTTCTAACTTAGAAAATAGTCCACAGAAGGCTCTTCCAGGTATTTTAAATGAAGCGACGTTCAACTCAAGAACTCGTCTTAGAGGAAAAAGCCAAGATAATTTTTCTTCATCCTTGGTATCATTGGAATTTCCTAATCCAAactcaaatcaaattttagaaaaggaTGGAACCAAAATTTCGAATCAAGACTTAAttacaaaagagaatggaagtCATATTTCAATAGTTCAAGTTTTAGAGCATGAAACTATCGAAGtgaacactgaaaaaaaatatgacatcaATTGCCGCGATCAAGAAAATGTTTCAATAGATATTGAGGACAAAATGTCATCTTTAGTGACAGAAAAGAGCGAACGGCATTTAGAAGCTAAAGTTGCCACCGTGGAAACATCTATACAAGACACACAAAAAGATCAGATTCttattgaaaattctaaaacaGCGCATTGGAATGAGTTAAATCTATCTTCTCCaactaaaaatttcaaaaaaagagaaaggaataaaaaatatcttgttaaAAAGCAGCTTGAAGAACTCTCTCAAATTGCAGCCCAAAATTCtttacttgaaaataataaaaaagagttgattgaaaaacaaaatagtgTACCATgtgatatcaaaaaaaatagATCTAGAAGAAAAACTCTTGAAGAATGCACAAACCATAAAAATGAAGATGATACCATTCCTGAATCCCCTAATGAAACAGTCAATATCACAGAAAATAATTTGCCAATGAGAAAAAGTctgagaaatagaaaaattaaaaagtttcctgatgaagaaattgaaatatttaatggGAAAAGACggaatcaaaagaaaatcaataataataataaagttactCCTTTGAATACAATGTTAGACTTTGAGAAAGTCGAAAATGAACCAAAGTTAACTAGAAATAGAAGTCATGATTCTTTACGTTCCATAGATAACCCTCATGTTGAACATGAGAAAGATAGACATGCTGTCTCTAAAACTATGGAAGTTGAACACAATCAGGTTTCTACTTTGaatgaaacaatgaaaaataaaaaatatgcaaacAAATCCGACGACTTAACTGATGATCTGACAAATAAATCACAAATCGATCACAAACGTAACATAAAACATGTAAATAACGACTTAGAAAAGGAGGGCCAACAAAGTGTTCATAACCACATTACATCGGATATCAATTTGCAATCAGATAGTATTACAATTTTTAAAGAGAGTAGTgttgtcaaaaataataaaaatggcaaAAGTAAACTAAGACTATCAGGGTCAACTGATAAATCTATTCATAAGGAAACTATAACTGAACTCACCAAATTTTTGCCATCACCAGAgaagtttgaaaaacaaaatttactgACAGATAAATTGATTGGATTGCAAGAATCCAGCCAAGCACCTTCTAAGGATTGTGTCCAAAATAAAGAGAATGATCTGGAAGAAGTTGTTGAACAGGGCGAGGAAACTTCTGAGCAATTTGACGATGACCTcgaaaacaaattaaaagaattCTTAAAAAAACCAATCCTCGATTCTGCAGTTAGTATAATCAAACCAATCGTTCATTCCAgcgaaaatatttataaagaaacgaACAAacctttcaataataaattagataaaattgaacttattaaacaaaatattgaccctaaaataacaattaaaactttGCATGCCATAGAACCCACTAAACCGGAATCTAGCCAGACTGAAACTTCTGTTGGGATGGTGGCAAAACCTATTGAAGATATAGGTAAAGAAAAAACTGGGCGTATTAAATGGAAGGCTGAAactactgaaaaaaatattgatgactTAGAATCCAACACCAATAATGCTTCAAAAGATTTTGAGCTTGAATTCATTCAAGCAAAGAGGGCAGAAGGGTCTAGTCCTATTAATCTTTTATCTCAAGGAAAATTTAGCCGTATTGAGAGTAAAGCTGAAATCGTAAACAAACTTACTGATTTGATTGTTGAAAAGGGAACTAGTCCTACTGAAGATACAACTGAATCTATTTTAGGGTTCAAAGACAAGAAAATCCATAGAAAATCAAGTCCAATGGAAGATGTTTTTGTTGACAAGGTGTCTATCCCAACTGAAACCGGAACCGAAATTTCGGTTGATCACATTGAAACTGTACGTGAAGAGGGATCCAGTCTTTTTGAACTCAAAGCTGATTTTTTAGTTGAATCGACGGAAACTATTGTTGGACTTGTGTCCAACCCAATTGATACCAAAATAGAAATAGTTGAAAACACTGAAACTACAAATAAAGCATCCAGGTCATatgaatcaaaattaaaaaattgtattgaacACACCACCACTGCAGTCGATGGTGTATCCAGTTTATCTAAAGTAATTACTGCAATTGTAGGTAAAGTTAGTGAAACTGTAGTGAACGAGGTGTCTAACCCCATCTCATTTGAATCTCAAACTGAGACTATGGTTGACATAATGACTGAATCAACTGAAATAGTTTCTGAAGTGTCCGAACAAATTGTTACAATTGATGAAGTAAAGGAAGCCGTAGATCATATAACTAAAAAGGTGTCTGAACGTATTGAAGTGTTAAAAAGCCCACAAACTATTATTAGTAAGGTGACTGAAAATGTTGACAATTTTACTGAAGTGTTAGaaacttttgaaattattgATGAGGCATCTGAGAACCATAAAGTAATTACAGAAGagtataaaattattgaagagGGACATGGAATTGACGAATCTGATAATGTCGGTATAATTTCTGATTTATCCAAGGAAATTGAAATTACAACTGAAAACATCTCCCAAACTATTAACACCGTCATAGAGGTGACAGGGATCGTTAATACATTTACTGAAGTAACTGAAACTATTAAACAAATGACCGAAGAAGTGTccgaatttattgaaaattgtgaTGAAATATCCAAATCCGCTgagaaagtaataaaaccaGTAGCTGAGAGAATAACGCATGCATCTGTTGAAGAATTATttgatgaaataattgaaatagtTGTTGAAAAAACTGAACAAACTGCTGAGCATATACCTGAATCAGTTGATGGTGAAATAACTGAACATTGTGCTGAAGAAATATCTGAACTATCTGCTGAGGAAATTCATGAATCAGCAGTTGAAGATATATACGAACTATCAGCTGAGGAAATACATGAATCAGCAGTTGAAGAAATATCTGAAAAATCTACTGTGGAAATAACTGAACTATCTGCTGAAAAAATACATGAATCAGCAGTTGAAGCAAAATCCGAACAATCAGCTGGGGAAATACATGAATCAGTAGTTGTAGAAATATCTGAAAAATCTACCGAGGAAATAACTAAACCAGCAGctgaagaaataaaagaacTAACTGCTGAAGAAATACATGAACAAGCAGTTAAAGATATATCTGAACAATCAGCTGAGGAAATACATGAATCAGCAGTTGTAGAAATATCTGAACAATCTGCTGAAAAAATACATGAATCAGCAGTTGAAGCAATATCCGAACAATCAGCTGAGGAAATACATGAAACAGCAGTTGTAGAAATATCTGAAAAATCTACGGAGGAAATAACGGAACCAGCAGCTGAACAAATAAATGAACTATCTGCTGAAGAAATGCATGGACCAGCAGTTGAAGAAATGTCTGAACAATCTGCTGAAAGAATACATGAATCAGCAGTTGAAAATATATCCGAACAATCAGCTGAGAAAATACATGAATCAGCTGTTCtagaaatatatgaaaaatctTCTGTGGAAATAACTGAACTATCTGCTGAAAAAATACGTGAATCAGCAGTTGAAGCCATATCCGAACAGTCAACTGAGGAAATACATGAATCAGTAGTTGTAGAAATATCTGAAAAATCTACCGAGGAAATAACTAAACCAGCAGCTGAACAAATAAGTGAACTAACTGCTGAAGAAATACATGAACAAGCAGTTGAAGAAATATCTGAACAATCTATTGAGAAAATACATGAATCAGCAGTTGAAGATATATCCGAACAATCAGCTGAGGAAATACATGAATCATTAGTTGAAGAAATATCTGAAAAATCTACCGAGGAAATAACTAAATCAGCAGCTGAACAAATTAATCAACTAACTGCTGAAGAAATACATGAACAAGCAGTTGAAGAAATGTCTGAACAATCTGCTGAAGAAATACATGAATCAGCAGTTGAAGATATATCTGACCAATCAGCTGAGGAAATACATGAATCAGCAGTTGTAGAAATATCTGAACAATCAGCTGAGAAAATGCATGAATCAGCAGTTGAAGATATATACGAACAATCAGCTGAGGAAATACATGAATCAGCAGTTGAAGAAATATCTGAAAAATCTACTGTGGAAATAACTGAactattttctgaaaaaatacaTGAATCAGCAGTTGAAGCAATATCAGAACAATCAGCTGAGGAAATACATGAATCAGCAGTTGTAGAAATATCTGAAAAATGTACCGAGGAAATAACTGAACCAGCAGCTGAACAAATAAATGAACTACCTGCTGAAGAAATACATGCACTAGCAGTTGAAGAAATGTCTGAACAATCTGCTGAAAGAATACATGAATCAGCAGTTGAAGATATAACGGAACAATCAGCTGAGGAATTACAAGAATCAGCAGTTGAAAAAATATCCGAAAAATCTACTGTGGAAATAACTGAACTATCTTCTGAAGAAATACATGAATCAGCAGTGGAAGCAATATCAGAACAATCAGCTGAGGAAATACATGAATCAGCAGTTGtagaaatatatgaaaaatctACCGAGGAAATAACTGTACCAGCAGTTGACAAAATTACTGAACTATCCGCTGAAAAAATACATGAATCAGCAGTTGAAGAAATATCTGAACAATCTGTTGAAAAAATACATGAATTAGCAgttgaagaaatatttgaacaagCTGTTGAAGAAATGCATGAATCAGCTGTTGAAGCAATATACGAAACACCTGCTGAGGAAATAACTGAACCAGCAATTGAAGAAATATCTGAACAATCTGCTGGGGAAATAACTGAACAACATGTTGAGAAAATAACAGAATCAGCATCTGAAGATATGTCTGCACAATCTTCTGAGGCAATAACTGAATCAGCAGTAGAAAAAATGTCTGAACAACCCTCCGAGCAAACAACTAAACCAGCAGTTGAAGAAATGTCTGAACAATCTGCTGCAGAAATAACAGAACAACCTTCTGAGCAAATAACTGAGCCAGCAGTTGAGGAAATGTCTGAACAATTTGCTTCAGAAATAACTGAACAACCTTCTGATGAAATAACTGAACACGTAGTTGGAGGAATATATGAGGCAACAGTTGGAGAAATAAGTGAACCAATAGTTGAAGGAATAGCTGAACTAACTGCTGAGAAACTAACTGAACCAATAGTAGAAGAAATATTTGAACTATCAATTGAAGAAATGCATCAATCAGCAATTGAAGCAATACCCGAAAAACCTGCTGAGGAAATAACTGTATCAGCAGTTGAAGAAATATCTAAACAACCTGCTAAGCATATAACTGAATCATCAATTGAAGAAATAACTGAACTATCTGCTGGAGAAATACATGAACCAACAGTTGAAGAATCAGCTGCAGAAATAACAGAACAACCTGCTGCAGAAATAACTAAATCAGCAGTTGAAGAAATTTTTAAACAATCTGCTGAGGAAATAACTGCACAAGCATTTGAAGGAAGGTCTGAACAAACTGTTGCGGAAATAACTGAATCAGCAGTTGAAGATATGTCTGAAAAATCTGCTGAAACAATAATTGAATCAgcagttgaaaaaatatttcaacaaacTTCTGTAGAAAAAACTGAATTATCAGTTGAAGAAATATCTGAAAAAGCTGAGAAAATAACAGAACAACCTGCTACAGGAATAACTAAATCAGCAGGggaagaattttttaaacaCTCTGCTGAGGAAAAAACTGAATCTACAGTAGTAGAAATGTCTGAGGAAATAACTGTACTAGCAGATGAAAAACTACAGAAAAGATCTGAATCTGTTGCGGAAATAACTGCTAAAGGAATAACTGAATCGGCTGTTGAAGAAATTTCTGAAccatttgatgaaattgaatCAGCAGTTGAAGAAATGTCTGAACTATATGCTGAGGAAATAACTGAACCAGTAGTTGAAGGAATGTATGTAGCAGTTGGAGAAATAACTGAGCCAATAGTTGATGGAATATCTGAAACAACTGCTGAGAAAATTGAACCAATAGATGAAGGAATATCTGAAACAACTGCTGAGAAAATAACTGAACCAATAgttgaagaaatatttgaacCACCAATTGAAGAAATTCATGAATTAGCTGAACAACCTGCTGAGCATATAACTGTATTAGCAGTTGAAGGACTAACTGAACCAACTACTGAGGAAATAAGTAAATCAATAGTTGAAGAAATATCTAAAAAATCTGCTGAGGAAATCACCGAATCTGTTGTTGACGCAATATACGAAAAACATCCTGGGAAAATAACAGAAACAGCAATTGAAGAAATACATTATTCAGCAGTTGAAGAAATATCTGTACATTCTGCTGAGGAAATAACTGTATCAGCGGTTGAAGAAATAGCTTTACAATCTACTGAGGTAATAACTGAACCAGCATTTGATAAAGTATCTCAACAATCTGCTGAGGTAATAACTGTATCAGCAGTTGAAAAAATATCGGATCAATCCactaaagaaatatttgaagaaacTCTTGGGAAAATGGCTAGACCAGCAATTGAAGAAATACTCGAACAATCTGCAAAGGAAATATCTGAACCAACTGTTGGGGAAAGAGCTAAACCATCAGTTGAAGAAATGCCTGAACAATCTCCTGAGGAAATATCTGTAGGGGAAATAGTTAAACCGGCAGTTGAAGAAATACCTGAACAATCTTCTGAGGAAACTGTTGGGCAAATAACTGAaccaaaagttgaaaatatatatgaacTAACTGCTCTGGAAATAACTGAACCAATAGTGGGAGGAATATCTGAACCAACTTCTGGGGAAATAACTAAATCACTAGTTGCTGAAATAGCCAAAAAACGTTGTGAAATAACTGAAACATACAATGAAGAAATATCTGAACCAACTGCTGGGGAAATAAGTGAATCAATAGttgaagaaatatataaaaaatctgttaaGGAAATCACCGAATCAGCAGTTGACGCAATATCCGAAAAGCATCCTGGGGAAATAACAGAAACAGCAATTGAAGAAATGCATGATTCAGCAGTTGAAGAAATATCTGAACAATCTGCTGAGGAAATAACTGTATCAGCGGTTGAAGAAATATCTATACAATCTACTGAGGTAATAACTGTATCAGcagttgaaaaaatattggatCAATCTACTAAAGGAATATTTGAAGAAACTGTTGGGAAAATTGATAAACCAGCAATTGAAGAATCTATAAAGAAAACTCAGCAACTCGAACAATCTACAAAGGAAATCTCTGAACCAACTGTTGGGGAAATTCTCAAACCAGCAGTTGAAGAAATACCTGAACAAACTACCGAGGAAACATTTGGGCAAATAACTGAaccaaaagttgaaaatataattgaacTAACTGCTCTGGAAATAACTGAACCAACAGCTGAAGGAATATCTGAACTAACTTCTGAGGATATAACTGAACCAATAGctgatgaaatatttgaaccAACTACTCAGGAAATAACTGAACCAATAGTCGGAGGAATATCTGAACCAACTGCTGAGGAAATAACTGAATCACTAGTTGCTGAAATAGCCAAAAAATGTTGTGAAATAACTGAAACATACAATGAAGAAATATCTGAAGCTGCTGATGATGGAAACAACAAATCTTCTCCTCGGGAAATAACTGGACACTGTGGTGAAATAACTAATAAACCATCTGCTGGGGTAGCAAACGAAAAAAATGGTGATGAAATAATTGAAGTGGGTGAAGTAATAATCCCACCTTTCGAAGTTAAAACTGAACCATCggattatgaaatatttgaacaagGAGATGATGAAATAACAGAAATACCTGATAATGATATAATTGAATCAGCTAATGatgaaataattgaagaaaTGTCTGAACCAACAGCTCAGGAAATAACTGAACAAACGGCTCAGGAAATAACTGAACTAATTGCTCTGGAAATAACTAAATCGGTTGATAGTGAAATAACTGAACcaatatttaaagaaatttcTGAACCAACTGCTGAGGAAATAACTGAACAAACTGCTCAGGAAATATCTGAACAATGTGATAGAATAACTAAACAATCTGCAGGAGAAATAATCGAACAAAGTGGTAATGTATTAGAAGCAGCTGATGATGAAAAAACTAAATCATCTTCGGAGGAAATAACTGAACAATGTGATGGAATAACAAAACCATCTGGAGGGGAAATACCCGAACAAAACGGcgatgaaataattgaaatagaTGATTGTGATATAATTGAACCAGCTGGTGAAATAATTATTCCACCTTTTGAAGTTAAAACTGAACCATCAGATGAAGAAATAACCGAAACATTTTGTGATGAAATAACTGATGCAGCTGATGATGAAAAAACTAGATCTTTGGATGAGGAAATAAATGAACAACGTGATGAAATAAAACAATCCAGTGGGGAAATACCTGAACAAAACGGtgatgaaataattgaaatggaTGATTTTGATCTAATTGAAGCACCTGGTGAAGGTATAATTCCTCCTTTCGAAGTTAAAACTGAACCCACGgatgaagaaatatttgaacaaactGGTGATGAAATAACTGAGGCAGTTgatgatgaaataattgaaTCAGCTGATGAAGATATTAAACTACACCTTGaagataaaactaaaaaatttgCTGAAGACGAAATATCTGAACTATCAGTGGAAAAAACTGAGAAATGTGATGAAATAACTAAACCATTTACAGAGGACGTAACCAAGTTAGCTGCTGATGATATAAGTGAAACAGCAGGAGAAAAAACATCTAAACCTGAAATATCTGAACCAACTGCTCAGGAAATATTTGGATCAGTTGATGAAGATAAATCTGAAACAGCAATTGAACTAACATCTGAACCACCTGCTGAGGAATTAACGGAAACATCGAATGATGATAAAATCAAACTAATTgataaaaattttaaaccaCACGATAAAGATAAAACTGAATCAGGTGGCGATAAAAGAGCTAAATCATCCCTTGACAAATCAACTGAATTAGGAGTTGAAATAATATCTGTAAAATCTGTTTCGACAATATATGAATCTACTGAAACAATTATCGAAAAATGTGATGAAATAACTGAACCATTTACAGAGGAAATAACTGAGGCAGCTCATATAAGTAATCCAGCTGATCAAGATAATAAATCAccttttaaagataaaattaaaacagccGCTGAAATGAGTGAATCAGTTGGGAATGAAGCTAAACAATTAACTGACAAAATAACTGAGACAACAGTTGAtagtgaaaaaattaaagaagctATTGAGAAAATAATTGAATCTTCTGATGAAGAAATTGGTGATATAATTGAACCAGCTCTTAATGATAAGCCTGAATTAGTTGATAgtgaaaaaattgaagaagcTACTGAGGAAATAATTAAACCGGCTGGTGCAGAAATAACAGAAGCAGCTGGTGATATAAATGAACCAGCTGTTAAAGATAAGACTGAATCAGTTGAtagtgaaaaaacaaaacaagataTTGAGGAAATAATTAAACCTTCTGGTGCAGAAATAACACAAGCAATTGGTGATATAAATGAACCAGCTGTTAAAGATAAGACTGAATCAATTGATAGTAAAAAAACGAAAGAAGCGATTGACGAAATAACTGAAACATCTGATGCAGAAGTAACAGAGGCAACTGGTGATATAAATGAACCAGCTGTTAAAGATAAGACCGAATCAGTTGAtagtgaaaaaattaaagaagctATTGAGGAAATAATTGAACCTTCTGGTGCAAAAATAATACAAGCAGCTGGTGATATAATTGAACCAGCTGTTAAAAATAAGACTGAATCAGTTGATAGTAAAAAAACGAAAGAAGCGATTGAGGGAATAACTGAAACATCTGGTGCAGAAGTAACAGAAGCAACTGGTGATATAAATGAACCAGCTGTTAAAGATAAGAATGAATCAGTCGATAGTGAAAAAACGAAACAAGATATTGAGGAAATAATTGAACCTTCTGATGCAGAAATAACAGAAGGTGATATAACTGAACCAGCTTTTAAAGATAAAACTGAATCAGTTGATAGTGATAAAACGAAAGAAGCTATTGAGGATATAACTGAACCGTCTGGTACAGAAGTAACAGAAGCAACTGGTCTTATAATTGAACCAGctgttaaatataaaattgaattagcTACAGAAGAAATATCTCAACGAGCAGTTGTAGATGTAGATAAGCCAAAATATGAAGAACTAAATGAATCCAGTATGCAATTAATAAATGAAACAAACGATGTGGATTTTACTACTACTGTTGTGAAGGAAAACCTGGACGAAAAAACTGAACCAGTCGTAGAGGAAATTATGAAGTTATCTGTTAAAACATCACCTGATATCACTGCAACGAACACATTAGAGCCAACAGACACAATTCTTCTGACGCACAATACTGAATCTTTGGAGATTGCTTCTAATCTTGATACATTAGTGTCAAAAGATGTCCAGACGTCTGAAGTCTCAGAGAATAAACCTTTATCTCTGAAGGAATCTTCAGAGCAAAAAGGAACATATGGTCCAATAAAAGATGGCCCAAATCCTCAAAGTGTGCCTGTACCAATGAAAAGTGATCAAACTACTTCAGATCCTAATCGGTCTCCAAAGAAGAAACCTCTCCAACAAATTCAAtgtgaaaaatgttttgaatattTCGGTAACAAAGAAAATTTACTGAAACATGATAAACATGTTCATAATACTACCCAGGCAACATTCTGTAATATTTGTAAGAGAAACTTCCGTAATGTTAATGCACTTAGAATGCATAATGctacatttcataaaaataaaaacgagaCCTCCAAAAAAGAcgctaataataaaaagaaatcaataaattgtgaaaaatgtaaaacaacTTTTAAAATTCAGAAGGATTTTGAGATGCATGTAAAATCTTGTAAGAGAAATAGTGAACTTCAATGTGATATATGTTTAGAAATATTTCGGTCCAAGTATAtcttacaaaaacataaaaataatgaccACACCGCCCGTGATCACGAAAATAGGAATATTGATGACAACAAAAGAACAAACAATGgggacaaaaaagaaaagaaaaatctaaaattagaaACAATACCTCAAAATACATCCATTACTACAAGACgatcaaaaagaaatatacaaaatgagGAAGATgacgaaaagaaaaaagatgatGTTGATAGTAATAAACTGAATGAACAAAAGCCAAACCAAACTCCTGAGgacaaatcgaataaaagaaactCCGAAGAATCCAATTCTGCAACGAATGGTGAAATACCAACTCCTAATAATGAATCTCAAGCTCAACAACATTCCCCTATTCAAAGTGGAACATCAAATCATCCGCCCACTCCAAATATAACTTCCCAACAACCGCTTACTAAAATTGAAACATCACAAAGTCCTCCTAACGAATCTCCAAGTAAAAAAGTTGAATATAAATGCGACACCTGTCGTATTCTTTTcgacaatataaaattattggaaGATCATATTCCGGTTTGTcggaatataaataaaaaatgtttgaagtgTGGGGAAACCTTTAAAACAAAACTGTTATTAAAAATGCATTCTTGTATAAAACCAAAACCTATCCAATGTAGATTTTGTCAGAAGAATATTCCGAAACACAAATTTCAAGATCATATAAAAATATGTGCACTACCAAAAGATGATTTGATTTGTAATTATTGTAAACAGTCATTTAGTGGTAAATTTTTGTTACAAAGGCATTTTGTTAGTCACCACAATATTGAAACAGGgatcagaaaaagaaaaagtcttGAAATGAACAATGATgacatcaaaaaaataaaagcagaaCATAAGGAAAGTTCTTCATCAGATGGAAAATTGATTGAAGACATAAATGAAATCAGAAAAGATTCAGTCAAATCCAAAGTTACTACACCAGTAAACGATGATTTAAAGATAACCAAAATGATTATAGACACGGATACGAATAGCAACAAAGGCTACCAATGTAAgaaatgtaatgaaaaattaagtAATCAACGCGATCTTACAAAGCACTATCTTCAAATCCATAGTAATGTTGAAGATATAAAATGCGACGAGTGCTCAAAAACATTTAGAAACCTAACTTTGCTTGATAAACATGTTATCAATGCACACAAAAATGCAATCAaatgtaatatttgtaataaagtttttaaaacactttatttataCAGTAAACATAATCGtctttttcataataataaaagtagtGAAACAAATTCTAAGCCgaatgaaaatcaaattgaaacctCTCCAGAAATTAGTGTAGATAAAGAATCTTTAATAGAAAAATGTCCTTTACCTCGTCAGTACTCTAGATTCCACAATGCGAAGAAACCAGCTGTTGAAAGTTCAGCTAGCattgaaaatgaaagtaaacCAATGTCTAATACAATTCAGTGTGTTGCTGAAGTACATTGTAGTGGTACGCCTGATGAACCTGATGAAACTATTGTTAAGCCACCTGATCTTATAAAGCCAGGATCAAGTGAAAATAACTCTCCCAAACAAGTTCATACGAATTCCAAATCAGTCAAAGAAAATGAATCATCTATGCTTCAcgaaaatgaagaaagaaagaaaatagtttgtaaaaagtttgaaGCAAGTCCCgataaaaaattgatcactttcataaatttattgaaagaagaaaacaaaatgaatgaaAGACGAAAACTTGGCGATGCTAAAATCAGTTCCGCCACAAGTGTAATCAAAGAAGTTACTCCTAATCCAAATCAGGAATTAAAagcagaaataaataaaagctctaaactgaaaaatatagaaactcCGTCTCTAGAGAAACAACGCGATGCACAATCGAATTCTACACATATTGGTCcatttcagaataaaaaatcaagacTCCTTCCCAAAGTTGATGTCAAACTTAACTTTGttgaacaatttcaaaaaacaataccTTCACAGATCCAGAGTACACAAAAACCGCAGCATCTAGAGCCTGAGCCAACagaagttattttttataaatgtgcTCGATGTGATAAAGAATTCCCGGATAAATTAGAACTCAGACAACACATTTCTGATAAACATATTGCTTTACCCCCATTGGAAGAAGTGTTtagaaaagagaaaaagttcATTCGAAAAGTTTCTAAAGATACGTATGTTCTAGTGCAAAAAGTCGGTCAGCCAAAGGAAAAAGTTTACTATTCATTAAATATACCTGAACTTAAATGTTGTCAATGCGAAGCTAAATTTACATCTCAGAAGAGCTTAGAATTGCACACGAAAACGTCTCACATTAATACCAAATTTCCgccaattaatttaaaatcaaaatcagcATCGACCAGTAATATCAACGTTAATATTTCATCCATTCAAGGTTTGGAGTCTACAAATACTCCTGCAAAACGCAGACATTCCGAAACGGCCATTATTTCTTCCGCAACTGCTGAGCCAAGTACGCTTATTCCAAGACATCCTTCACCTATCACGATTCCTACAACATCTGAAAATATTAATACGCCTATCCAAAAAAATTCTGACGCTACCAACATGCCCACAACTTCCGAAAGTTCGAGTACGTCAATCTTATCGAATGTCTTACAGTATGGCAGTAACTTTGCAGACGGCGATCCTCTAGCGCCTGTCGTTTCCGATTTCACCCAGGATGAAAATAACTCAAGTTGTGACATTTTCAACCCTAACGATATTCTGAATTccaataattttgtcaaaactgATACCAATACCAGTGCAACGGATAGTGCAATAAACGGTCCTCTTGCCGATTTGCAGTGCTTCAGATGTGGGGTTAAGTTTAATAATGGACCTGAACGACATTACTGTACTGTATTTAAATGCGATAAGTGTTTCCAAGCATTTGTGTCTAAAGAATTGCTCAATGCCCATTATTTACTCAAGCACGACCTCTACCTAGTCCCCAAgcatgttaaaaatataaatgaattcactaatattaaaaatctgaaTTGTAAATTTTGTTCAGCCATATTCACCAAAAATAAAGATCTAAATAAACATTATAAAGAATTCCATAAATACAATCCCAGCCAAATGAAGAGTTCGGTgatcaaaaattataaagacGTATCGTCGTTCATTCACTGTACAATGTgtaatgatattttcaattctaaCTTTGAGCTTCAGAAACATTATCTGGACTATCATAACTACGATAGTTCGGTCAAAACATTTGAAATGACCCCCAGCACTAGCACTGAAAAGAGACTAGAGGAAACTAATAGTAATACCGTACAAACGATTTCCAATACAGGCCTTCCTCAGACTAATATTAAGTATAGCATTGTCTCTAAAAGAGGGAGAAAACGAAAGAGTGCACAAGATGACACGATGAAGAATATCAATAGTTtaaattcaatcaatagtgttttgaatgatGCACTGGGATACACCGTCAGTAGTACCgtactaaataatattttacctaaTCAAAATACAATTACGAGTGACATTATTATCAAAAATCCGGCCATACCTCCTCTAACATCAACTAGTGCTGCCATATTGCCCACTGCCGTTACAGTACCCAGTTTAGTCCCTACTAATACTCAGATATTACCTGATTTAGTTTTCACAAATACAGCCACAATAACTAAGAAAGTGCCAACTAATGATGTTAATGACATTCCACGTTTAATTTCATTTAGTAATCCAATTGTAGGATCGCTTCCTAGTATCAAGCAATCACAACTTGATCCAGTACCTGAAGAAGACGTACCCTCTGAGAACATTGTAAATCAGACCAATCTCGTAAAAGACAGCCTGTCCTCAAATGATCTCGCAGATAATTCAAACACCGATCTGAATAATTTGTTTGTGTCACATCCCGGCACTTCGTCCACAGTTGATTTATCTGTGATTCAACCCGATTCTTTTCAAATGCCTTCAGACCAACTTGTCCATCCTCAGTTTTCTTGTGACTTTTGTCCATTGACCTTTGTCTCAGAGTCTGCTCTCTCATTACATTTATTAAGAGATCACAAAACACCAAGTTCAGAAGCCGGTGGCGGATTTATCAAATGTGACTTTTGTTCTTTTgagtttgaaaataaagataagCTGAAAACGCACATGCAAGACATGCACAAAGATTCTCAAATATGTATTCAATGCAATCGTAAGTTCTCAAATAAATATAACTTGAAACGCCATGTTACTGTCACCCATGATGCGAGGTATAATTGCACAATGTGTGCCGAAGTGTTCTTTAGAAAAGCAGACTACACAAAACATTTAGCCGTCGTGCATTCTGCATCAATGAGAGATAATAATATTGGTCCTACGCCAGCCAATCACCACCAAATTCCTACGGATCATATGTGTAAAGTTTGCTGCAAATACCTTTCGAGTAAATACCATTTGAAGAGACATATTGTTAAATTTCATGCAAACGTCACATCCGTTCTTAATAGTTGTAAACAATGTATGAAAGGAGCAAATGGTATCAACAGATTCTATTCTCATCTCGTATATGAACATAAGGTCAAGTCAAGTATTGTAGAAGGCATGTCTTCAGCCGCAGTCGTCTGTAATGAATGTAGAAGAATATTTGTCAATAAGAAAATGATtaagatacatttgaaaaagaaacatgATTTTAAACCTCATAGATGTGCCATTTGCTTTAGAGGCTATgcctttaaaaaacatattctaAGACATATGGAAATACATGACAGTCTTAAAGACTTTCTTTCGTTTTCTGCACCCCAAGACGATATGAGTGAATGGGGAGATTATGGTGACTTTACAACTGATGAAAATGGAGTTGTCTATCAGATGGGCCAAGTTGCTTATGAACATGCTCAGAGTGTAGTTCAAGAATCACAAAACGTTCAAACATCATGTATGTCAACACTAATTGACGACAGTTTTGACGAATCTCTACACCTGCTGAGTAATGAAAACTCCAAACAAATATctcagaaaaaaacaaaaaattatagaaCACAAGTTATAAAAAGTACCAACAAACAAACAATGGACACCAAGTTCCATGAAATAGAAACAACAAGTCAGTCTCTAGAAGAAATACAAAAAGAGAGAGAACGACAAATGTTAGCACAAATGGAAGAAAATATGAAGGAAATCAGCAGCAACAACCAAACAACTCCTACAGTAACCAACTATGATCCAACACTGTACCATAttaacaatactgaaaacaatTTTCAATCTTTACAACCAATCATTCAAACAGTTGGTAATGAGGAATCTATGCAATTTGTAAACGAAACCAATATGGAGATCAGTGATCCAAACTTGAATCAGAATCAAACGTTACATATTATAAACAATCCAGATGAATCTAATTATGATCTACAATCCGAAATGCAAACCAACCATGAATATCAACAAATTGACGAATCCCAACACTTGGCAGTcatgcaaaatttaaatcaaCACCAATTTCAGCAAATCCCAGAACAAGAACAAGTATGTAAGACATTGATATTACAAAGTGCAGATATAAGCAATCCTTATGAAACGGATTATAATGAAACCATTGACCCATCTTCGTATCAAGCACAGGTTATCAGTGATTCCACAACGTACTTGACACCAAACCAAATGGCGTACGTTCCACAAATGATGGATACTGACATGTCTACAGAAATGAATTTATCAAATATGGGAGACTCAGTTGTGTTACAGCCAATGGAATCCTCTTATGAAACAATGCAACATGCCATGCCAACAGCATACGGTATGGGATCCCAAGGAGCAACAACATTCGATATGGGAAACCAAGGGACAGCAACCTTTGAAATTGGAAATGATGGAACAACGACATATGAAATGAGTACCCAAGGTTCTAATAATGGTGCCCCAATTTTTGTCTTATTACTGAAACCTCCCGATCATCTGCCATCTTGA
Protein Sequence
MKKGVGDTNPTSPHLTNIIQDNVQSSLSPKIYQNSTPQHVLHTSPHGVSPMHQEFSGLQQVRHIYPEISIIREQTNLPSSRGKPLIHRTKPKSGVVDSNPNIHEQMNANVSSIRNDSIRAMNKDVRSIPQSSVTVNQTNATPNMLGQDRNDPHLGNSFSGGKSKLPEFSQIKQRKVSPSLSSTPQHDTNYPGGDSVIHLLDNNIMAEQNLIEFEKSFNDVIKGNLNILESSIIASDGQEYTSSNFYRNTSQCNVDPNLQRSPHLSSPHVHNSPHLQNSPLQSSPHLIVESSSSFNSPDSHQSLTSSDYYPSKDSDSLESGSVITNSISGMMLGHEDRSKHIEPGQIRNRLFEEIMNEKIHIETNIAAMSFTPLEECLLQGTFNLPTDIPSALEKSISLDIQRETMKANTEYDDKKIGSKNSAESSLENNQKKIIQVRSDLFHHNKTTKTDSLPSKTQNKVKLVRSQTPQNIATKNKGQNINVDKNSNTIQNTTSLLASKKSNRKLNNLNLNQPNKMLELIKSASNIKIRSNAKTQSINKPVPSYKKIPPNIPNNSGNRSVASTPLKVKEQATPLMGTIQSVKNSSLNIIGSSTTPSPQPNLHRKEHGKIEGKECTIKNVKQISTNQEKSTINYSTLQSNSDLKRDNLSGDKGKTVNEVISFKNVIIKNNSYNKVDRNTTDININKNIQTKEVNSTVISQNVHKGEQECSRICQTNSLPVKTLSYCDKPEVISRLKLTGKTADPYSFDSESDKLPQRKVSDDISDDQTIVTCPKQQNIEQDKNKNINDQETSGFKIKVINKTDLSLILHKEDTACLNEERSAKKSLEKMKESKQQNTIDQLASPEEILRQSVSSNPPVESHSETRDLSSTKKHLSYKEDSPSLMTCSMGGNPTRLKLKYVKNSTSSESNSDKGSDSPNMDSSERMKTLRIKIPSYSVEKDHSPKRSNKSSPLPKDVKSDHRVQEKLIINLKTNQITRTKEDVINNNDSEDNLPTAIDSNNTSIIQGPNLALSPNVLKEKRKSFRKIDEICQNLRRKSHEKEGTNIDNVSNDCAKEGLEIEKLVDSSIDSGDITQRKVNNLNTEKLSDLKISDSVNREKGIEKSKSFVVKFRRMSRSGENIIVKQIVDRKNVTDETDDEYQTEREDNTDTENDLETGNLNTAIVRHAENDNSHNTKTDFKLKIRIGSEIMSEQNVNKKKDKKKKKSKKHKDKSKKSRHRSGVSEEIDQAHASKSDDHVFIKPLILKRKNDVSLFTLSQNDTNSSKDNNTESEENTNLSDRLCLLKQKDSSSDEMNVSNEITEELELKQPIPDRLKEESLKICSKSNKRKSKQSVQEHHESEPSSHLLGQIVSSENTSLKEHKNDIKSECDAYAEKISSLLTETQVCESCSKSFENEFEYNKHKVLIHNYNAFLCHICYVSFVEKWKLDIHLMSAEHMAMINSGSDQVTDVITTSRTTRRSKNNLGNETLSPKHENKKKSNKTKIKEEDKSNIDVASSSLTVSTDSKNLKDDKDNVKNQILQSMDSNIVESKESTGQMDLKTIKNSEMSLSAETSLLKNNTNVCNEDNFTLNSENQALVNTHSKDNFDEIERRDDLKSPNKEKNTSKEINKTKERKNDFEKYLAIEDSCSKDSDASQSVPALNTVISSSLDSKSAINNDQISEHESGISTCSSTSEIDCSSFSQSDNLLSPDSVGEKSKATELQHPNSLLIKNRVIESVTENILPEQNTSLIVDLEENIPSCIIRNRKLNKKTNEKVQLDSGRKAAYKDELKTEIIKSKNEEQKLIDLNVTTLSKTTRPVPKLRPLPGLIKITPEVLPIIPNLQIDGLDRIRKIQQTVHSKTIDILEIGKVQEFQTTDKEIVVPKKSLKKKRDCCESESDKNEKEIMNFKAYPQKNDQVCHADGNLNEHINSDDGNNERIHEVTETCDKKVSKNKQKVNGDVEINIIGKDEIADRPCRVLRKKNIPPKSETIVKEVTENLIEPITVINEKVSLLFDKIETENTSEDLKVKGDVLINECTRLRRSTYKKSSVLYERRKIPRSTSKNKKKNVNYDENIVDKILENNNDHSELIDEINIPGARISTESEINELCINPESLNKNIKSKETLSVCNKITTCEEKISSSSQTNDESSTTVINSETMSETETETLNQKRSADSVSKLKETNESFDEDDIPLDIRQISKAKSHENLISSQTEIDFKENDEDDIPLEIRKTNLSKSVENLNSLETVCSSKTELIKTINYINNDENEALANKKMCRAKSHENVSSKTHSEIEKKSSTVFLSNKTPIEQNIRLRKPISMDKNSVDDKNMKDCSTTRSLIENKKNRKIKTHECLSSLKDADKLTDIKGVIQEDLSSERRKSLRVKTREHISSAEISNKTTLEEVCLTDKKTSKKQKTIQQTDETNMLEKVSVPSEVRKSLRIKVIDQTVSNLENSPQKALPGILNEATFNSRTRLRGKSQDNFSSSLVSLEFPNPNSNQILEKDGTKISNQDLITKENGSHISIVQVLEHETIEVNTEKKYDINCRDQENVSIDIEDKMSSLVTEKSERHLEAKVATVETSIQDTQKDQILIENSKTAHWNELNLSSPTKNFKKRERNKKYLVKKQLEELSQIAAQNSLLENNKKELIEKQNSVPCDIKKNRSRRKTLEECTNHKNEDDTIPESPNETVNITENNLPMRKSLRNRKIKKFPDEEIEIFNGKRRNQKKINNNNKVTPLNTMLDFEKVENEPKLTRNRSHDSLRSIDNPHVEHEKDRHAVSKTMEVEHNQVSTLNETMKNKKYANKSDDLTDDLTNKSQIDHKRNIKHVNNDLEKEGQQSVHNHITSDINLQSDSITIFKESSVVKNNKNGKSKLRLSGSTDKSIHKETITELTKFLPSPEKFEKQNLLTDKLIGLQESSQAPSKDCVQNKENDLEEVVEQGEETSEQFDDDLENKLKEFLKKPILDSAVSIIKPIVHSSENIYKETNKPFNNKLDKIELIKQNIDPKITIKTLHAIEPTKPESSQTETSVGMVAKPIEDIGKEKTGRIKWKAETTEKNIDDLESNTNNASKDFELEFIQAKRAEGSSPINLLSQGKFSRIESKAEIVNKLTDLIVEKGTSPTEDTTESILGFKDKKIHRKSSPMEDVFVDKVSIPTETGTEISVDHIETVREEGSSLFELKADFLVESTETIVGLVSNPIDTKIEIVENTETTNKASRSYESKLKNCIEHTTTAVDGVSSLSKVITAIVGKVSETVVNEVSNPISFESQTETMVDIMTESTEIVSEVSEQIVTIDEVKEAVDHITKKVSERIEVLKSPQTIISKVTENVDNFTEVLETFEIIDEASENHKVITEEYKIIEEGHGIDESDNVGIISDLSKEIEITTENISQTINTVIEVTGIVNTFTEVTETIKQMTEEVSEFIENCDEISKSAEKVIKPVAERITHASVEELFDEIIEIVVEKTEQTAEHIPESVDGEITEHCAEEISELSAEEIHESAVEDIYELSAEEIHESAVEEISEKSTVEITELSAEKIHESAVEAKSEQSAGEIHESVVVEISEKSTEEITKPAAEEIKELTAEEIHEQAVKDISEQSAEEIHESAVVEISEQSAEKIHESAVEAISEQSAEEIHETAVVEISEKSTEEITEPAAEQINELSAEEMHGPAVEEMSEQSAERIHESAVENISEQSAEKIHESAVLEIYEKSSVEITELSAEKIRESAVEAISEQSTEEIHESVVVEISEKSTEEITKPAAEQISELTAEEIHEQAVEEISEQSIEKIHESAVEDISEQSAEEIHESLVEEISEKSTEEITKSAAEQINQLTAEEIHEQAVEEMSEQSAEEIHESAVEDISDQSAEEIHESAVVEISEQSAEKMHESAVEDIYEQSAEEIHESAVEEISEKSTVEITELFSEKIHESAVEAISEQSAEEIHESAVVEISEKCTEEITEPAAEQINELPAEEIHALAVEEMSEQSAERIHESAVEDITEQSAEELQESAVEKISEKSTVEITELSSEEIHESAVEAISEQSAEEIHESAVVEIYEKSTEEITVPAVDKITELSAEKIHESAVEEISEQSVEKIHELAVEEIFEQAVEEMHESAVEAIYETPAEEITEPAIEEISEQSAGEITEQHVEKITESASEDMSAQSSEAITESAVEKMSEQPSEQTTKPAVEEMSEQSAAEITEQPSEQITEPAVEEMSEQFASEITEQPSDEITEHVVGGIYEATVGEISEPIVEGIAELTAEKLTEPIVEEIFELSIEEMHQSAIEAIPEKPAEEITVSAVEEISKQPAKHITESSIEEITELSAGEIHEPTVEESAAEITEQPAAEITKSAVEEIFKQSAEEITAQAFEGRSEQTVAEITESAVEDMSEKSAETIIESAVEKIFQQTSVEKTELSVEEISEKAEKITEQPATGITKSAGEEFFKHSAEEKTESTVVEMSEEITVLADEKLQKRSESVAEITAKGITESAVEEISEPFDEIESAVEEMSELYAEEITEPVVEGMYVAVGEITEPIVDGISETTAEKIEPIDEGISETTAEKITEPIVEEIFEPPIEEIHELAEQPAEHITVLAVEGLTEPTTEEISKSIVEEISKKSAEEITESVVDAIYEKHPGKITETAIEEIHYSAVEEISVHSAEEITVSAVEEIALQSTEVITEPAFDKVSQQSAEVITVSAVEKISDQSTKEIFEETLGKMARPAIEEILEQSAKEISEPTVGERAKPSVEEMPEQSPEEISVGEIVKPAVEEIPEQSSEETVGQITEPKVENIYELTALEITEPIVGGISEPTSGEITKSLVAEIAKKRCEITETYNEEISEPTAGEISESIVEEIYKKSVKEITESAVDAISEKHPGEITETAIEEMHDSAVEEISEQSAEEITVSAVEEISIQSTEVITVSAVEKILDQSTKGIFEETVGKIDKPAIEESIKKTQQLEQSTKEISEPTVGEILKPAVEEIPEQTTEETFGQITEPKVENIIELTALEITEPTAEGISELTSEDITEPIADEIFEPTTQEITEPIVGGISEPTAEEITESLVAEIAKKCCEITETYNEEISEAADDGNNKSSPREITGHCGEITNKPSAGVANEKNGDEIIEVGEVIIPPFEVKTEPSDYEIFEQGDDEITEIPDNDIIESANDEIIEEMSEPTAQEITEQTAQEITELIALEITKSVDSEITEPIFKEISEPTAEEITEQTAQEISEQCDRITKQSAGEIIEQSGNVLEAADDEKTKSSSEEITEQCDGITKPSGGEIPEQNGDEIIEIDDCDIIEPAGEIIIPPFEVKTEPSDEEITETFCDEITDAADDEKTRSLDEEINEQRDEIKQSSGEIPEQNGDEIIEMDDFDLIEAPGEGIIPPFEVKTEPTDEEIFEQTGDEITEAVDDEIIESADEDIKLHLEDKTKKFAEDEISELSVEKTEKCDEITKPFTEDVTKLAADDISETAGEKTSKPEISEPTAQEIFGSVDEDKSETAIELTSEPPAEELTETSNDDKIKLIDKNFKPHDKDKTESGGDKRAKSSLDKSTELGVEIISVKSVSTIYESTETIIEKCDEITEPFTEEITEAAHISNPADQDNKSPFKDKIKTAAEMSESVGNEAKQLTDKITETTVDSEKIKEAIEKIIESSDEEIGDIIEPALNDKPELVDSEKIEEATEEIIKPAGAEITEAAGDINEPAVKDKTESVDSEKTKQDIEEIIKPSGAEITQAIGDINEPAVKDKTESIDSKKTKEAIDEITETSDAEVTEATGDINEPAVKDKTESVDSEKIKEAIEEIIEPSGAKIIQAAGDIIEPAVKNKTESVDSKKTKEAIEGITETSGAEVTEATGDINEPAVKDKNESVDSEKTKQDIEEIIEPSDAEITEGDITEPAFKDKTESVDSDKTKEAIEDITEPSGTEVTEATGLIIEPAVKYKIELATEEISQRAVVDVDKPKYEELNESSMQLINETNDVDFTTTVVKENLDEKTEPVVEEIMKLSVKTSPDITATNTLEPTDTILLTHNTESLEIASNLDTLVSKDVQTSEVSENKPLSLKESSEQKGTYGPIKDGPNPQSVPVPMKSDQTTSDPNRSPKKKPLQQIQCEKCFEYFGNKENLLKHDKHVHNTTQATFCNICKRNFRNVNALRMHNATFHKNKNETSKKDANNKKKSINCEKCKTTFKIQKDFEMHVKSCKRNSELQCDICLEIFRSKYILQKHKNNDHTARDHENRNIDDNKRTNNGDKKEKKNLKLETIPQNTSITTRRSKRNIQNEEDDEKKKDDVDSNKLNEQKPNQTPEDKSNKRNSEESNSATNGEIPTPNNESQAQQHSPIQSGTSNHPPTPNITSQQPLTKIETSQSPPNESPSKKVEYKCDTCRILFDNIKLLEDHIPVCRNINKKCLKCGETFKTKLLLKMHSCIKPKPIQCRFCQKNIPKHKFQDHIKICALPKDDLICNYCKQSFSGKFLLQRHFVSHHNIETGIRKRKSLEMNNDDIKKIKAEHKESSSSDGKLIEDINEIRKDSVKSKVTTPVNDDLKITKMIIDTDTNSNKGYQCKKCNEKLSNQRDLTKHYLQIHSNVEDIKCDECSKTFRNLTLLDKHVINAHKNAIKCNICNKVFKTLYLYSKHNRLFHNNKSSETNSKPNENQIETSPEISVDKESLIEKCPLPRQYSRFHNAKKPAVESSASIENESKPMSNTIQCVAEVHCSGTPDEPDETIVKPPDLIKPGSSENNSPKQVHTNSKSVKENESSMLHENEERKKIVCKKFEASPDKKLITFINLLKEENKMNERRKLGDAKISSATSVIKEVTPNPNQELKAEINKSSKLKNIETPSLEKQRDAQSNSTHIGPFQNKKSRLLPKVDVKLNFVEQFQKTIPSQIQSTQKPQHLEPEPTEVIFYKCARCDKEFPDKLELRQHISDKHIALPPLEEVFRKEKKFIRKVSKDTYVLVQKVGQPKEKVYYSLNIPELKCCQCEAKFTSQKSLELHTKTSHINTKFPPINLKSKSASTSNINVNISSIQGLESTNTPAKRRHSETAIISSATAEPSTLIPRHPSPITIPTTSENINTPIQKNSDATNMPTTSESSSTSILSNVLQYGSNFADGDPLAPVVSDFTQDENNSSCDIFNPNDILNSNNFVKTDTNTSATDSAINGPLADLQCFRCGVKFNNGPERHYCTVFKCDKCFQAFVSKELLNAHYLLKHDLYLVPKHVKNINEFTNIKNLNCKFCSAIFTKNKDLNKHYKEFHKYNPSQMKSSVIKNYKDVSSFIHCTMCNDIFNSNFELQKHYLDYHNYDSSVKTFEMTPSTSTEKRLEETNSNTVQTISNTGLPQTNIKYSIVSKRGRKRKSAQDDTMKNINSLNSINSVLNDALGYTVSSTVLNNILPNQNTITSDIIIKNPAIPPLTSTSAAILPTAVTVPSLVPTNTQILPDLVFTNTATITKKVPTNDVNDIPRLISFSNPIVGSLPSIKQSQLDPVPEEDVPSENIVNQTNLVKDSLSSNDLADNSNTDLNNLFVSHPGTSSTVDLSVIQPDSFQMPSDQLVHPQFSCDFCPLTFVSESALSLHLLRDHKTPSSEAGGGFIKCDFCSFEFENKDKLKTHMQDMHKDSQICIQCNRKFSNKYNLKRHVTVTHDARYNCTMCAEVFFRKADYTKHLAVVHSASMRDNNIGPTPANHHQIPTDHMCKVCCKYLSSKYHLKRHIVKFHANVTSVLNSCKQCMKGANGINRFYSHLVYEHKVKSSIVEGMSSAAVVCNECRRIFVNKKMIKIHLKKKHDFKPHRCAICFRGYAFKKHILRHMEIHDSLKDFLSFSAPQDDMSEWGDYGDFTTDENGVVYQMGQVAYEHAQSVVQESQNVQTSCMSTLIDDSFDESLHLLSNENSKQISQKKTKNYRTQVIKSTNKQTMDTKFHEIETTSQSLEEIQKERERQMLAQMEENMKEISSNNQTTPTVTNYDPTLYHINNTENNFQSLQPIIQTVGNEESMQFVNETNMEISDPNLNQNQTLHIINNPDESNYDLQSEMQTNHEYQQIDESQHLAVMQNLNQHQFQQIPEQEQVCKTLILQSADISNPYETDYNETIDPSSYQAQVISDSTTYLTPNQMAYVPQMMDTDMSTEMNLSNMGDSVVLQPMESSYETMQHAMPTAYGMGSQGATTFDMGNQGTATFEIGNDGTTTYEMSTQGSNNGAPIFVLLLKPPDHLPS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-