Basic Information

Gene Symbol
Wars2
Assembly
GCA_958510855.1
Location
OY294036.1:105593138-105608707[-]

Transcription Factor Domain

TF Family
HSF
Domain
HSF_DNA-bind domain
PFAM
PF00447
TF Group
Helix-turn-helix
Description
Heat shock factor (HSF) is a transcriptional activator of heat shock genes [1, 4]: it binds specifically to heat shock promoter elements, which are palindromic sequences rich with repetitive purine and pyrimidine motifs [1]. Under normal conditions, HSF is a homo-trimeric cytoplasmic protein, but heat shock activation results in relocalisation to the nucleus [2]. Each HSF monomer contains one C-terminal and three N-terminal leucine zipper repeats [3]. Point mutations in these regions result in disruption of cellular localisation, rendering the protein constitutively nuclear [2]. Two sequences flanking the N-terminal zippers fit the consensus of a bi- partite nuclear localisation signal (NLS). Interaction between the N- and C-terminal zippers may result in a structure that masks the NLS sequences: following activation of HSF, these may then be unmasked, resulting in relocalisation of the protein to the nucleus [3]. The DNA-binding component of HSF lies to the N terminus of the first NLS region, and is referred to as the HSF domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 20 0.88 1.6e+04 -2.6 0.0 51 64 761 778 759 805 0.70
2 20 0.0035 63 5.1 0.0 47 65 895 913 877 950 0.72
3 20 0.0028 50 5.5 0.0 42 64 954 978 938 1015 0.72
4 20 0.004 72 5.0 0.1 47 64 1023 1042 1003 1072 0.77
5 20 0.005 90 4.7 0.0 47 64 1087 1106 1068 1133 0.74
6 20 0.0063 1.1e+02 4.3 0.0 49 65 1153 1169 1141 1192 0.80
7 20 0.00075 14 7.3 0.0 33 64 1201 1234 1191 1268 0.76
8 20 0.0036 66 5.1 0.0 42 65 1274 1297 1255 1330 0.72
9 20 0.0045 81 4.8 0.0 45 65 1337 1357 1318 1391 0.74
10 20 0.0034 61 5.2 0.0 37 65 1393 1421 1382 1451 0.72
11 20 0.005 91 4.6 0.0 46 65 1462 1481 1445 1512 0.75
12 20 0.0031 57 5.3 0.0 37 65 1517 1545 1506 1578 0.71
13 20 0.0057 1e+02 4.5 0.0 46 65 1586 1605 1569 1624 0.73
14 20 0.0038 69 5.0 0.0 37 65 1641 1669 1630 1696 0.75
15 20 0.0025 45 5.6 0.0 35 64 1699 1730 1692 1761 0.71
16 20 0.0045 82 4.8 0.0 49 65 1777 1793 1757 1827 0.76
17 20 0.0057 1e+02 4.5 0.0 47 65 1839 1857 1821 1891 0.76
18 20 0.0067 1.2e+02 4.2 0.0 49 64 1905 1922 1883 1958 0.77
19 20 0.0039 71 5.0 0.1 47 64 1967 1986 1946 2021 0.77
20 20 0.002 37 5.9 0.0 47 64 2031 2054 2008 2105 0.71

Sequence Information

Coding Sequence
ATGTATTACTTGATTCAATCGAGCAATGTTCCTGATAAAGTGTTTTCTGGAATTCAACCAACGGGTAGCGTGCACCTGGGAAATTATTTAGGTGCCATAAATCAATgggtaaaattacaaaattctgACCAGGACGTTATTCTTAGTATTGTCGATTTGCATGCTATGACGTTACCTTTtGATCCCAAAGAACTATCTAGGAGTATTTTGGAAGTAACGGCCACTTTACTGGCTTGCGGTGTCGATCCAAATAAAACAACTTTATTTCAACAATCAAATGTATATCAGCACACCGAATTGTGGCGATACTTAAGAACAGTTTCCACTATGGCAAGGCTGGCTCATTtaccacaatttaaaacaaaatcttCTACTTTACAAGATGTACCCCTAGCACTATATTTCTATCCCGTATTGCAAAGTGCCGACATTTTGTTGTACAAGGCTACTCATGTACCGGTCGGCGAGGATCAATTACAACACATACAGTTGTGTCGAGAATTAGCTAGAATGTTTAACAAGAGATATGGTCAAACGTTTCCAATTCCTCAAGCTATTTTATCTGatgacaccgtttcaaaaataaagagTTTAAGAGATCCCCTTAAAAAAATGTCCAAGTCTGATTCTAATTTTGACAGTAGAATATCCTTAACAGACAGTCCCGACGAAATAGTACATAAAATTGAAGGTGCCGTTTGTGGTAATAACTCCAATatGACTCTCCAGCTAGAAGGCAAACCCGAACTTTCAAATTTGGTATTTATCCATTCCATGTTATCAGAAAAAGGTGTCGAAAGTATTATTAAGGAATACGAGGGATTAGATATTTTACGGTATAAGTCATTATTGGCCGAATGTATTATAGAACATCTGAGTCCAATTCGTACTAAGTTGTTAAATTTCATTGCCAATCCTGAATATTTAATTAGTGTCTTGGAGGATGGAGCTAAGAAAGCTTCAAAAGTTGCTGCAATAACTGTACGGGAGGTTCATTCCAAATTAGGTGTCAACCACAgcaatatttgttttaataatagtcaaaatattttaaaggcGAACGATGCTAATCTCAAACCaggagaaaaaatatttttaggacTTGAGGCTACAGGAAATATCCATTTGGGAAATTACTTAAATACTATTAAGGAATGTGTAAAGTTACAAAATTTGGGCAAAAATGTTATGCTTAGTATCGAAAATTCTACGACACCTTGTGAACCTAAAgtacaaaagaaaaatattttggaagcaaCCGCAACTTTATTGGCTTGTGGTATCAACCCCAACAAATCGGTTTTGTTTCAACAGTCACATATTTATCAccacaacgatttgtgttggcACACAACTTGCATTGCAACAATGGAAAGAGCCACCCACGTACCAGCAGGAGGTTACCCAACGGAACATCTACGGTTGGCTCAAGAAATAGCAAAATTGTTTAACCATAAATACGGCTTAAAGTTCCCGATACCACAGCTGGTTGGGGGCGATGgtgCAGGCATTTCTAAAATACTTAGTTTGAAGGATCCGATGAAAGAAATGCACAACTCTGATTCTAATCctaaaagtataatttatttaacagatACTCCTGACGTTatcgttaaaaaaattaaaggagCCGTGAGTGATTTTAATTCCACGATAACTTTTGAGCCAAGCAGCAGGCCTGGAATATCAAATTTGGTTCGTATTCATTCTTTATTAACAGAAAAAAGCGTGGAAGATATTTGTCGAGAATGTGAAGGATTAGATACTTTACGgtATAAGTTACAATTAGCTGAATGTGTTATCGAACACCTTACTCCAATTCGTACAAAATTGATAGATTTGCGTGCCAATCCTGATCATTTATTAAAAGTGTTGGAAAATGGAGCAGAAAAAGCTTCAAAAGTTGCCCAAGTTACAGCCAACgaagtttgttttaatttaggTTTACAACTTGGGAATATAagcaACTTAAATAGAAgcaaaaaacaaattaatttcggTTCTGATAACGAAAGACAAGGCTACTTTTATGATAAACCATCTATTTCGTTTGAGCTACCAACACGAAAGCCAACATCAACTCCGTTTGCAGCGGCCACCACAATAGCACCAGTTCCAACGGGATACAGTTATCCAAAACCAAGCATTGCATTTGAAGAAAATCCAAAACCAGTAGGAATAAGACATGAACAAATATCTTCTACATTGTTTGGAAGCGGTGGTGGATCGtcaatttcttcaaaaaccCCATCGATCAATATAGGCGGAGGTGGACAAACTTCAGTCACCACTACCTTTAATAAATATGAATTTAATAAACCAAGGGTGGACATTGAAGTACAAAAGCCATCATCTTTCGGCATTTTTGGTAATGGAGCATCTACTATTACAACAAACGTAAATAGATACAATACCTTTAAACCGAGTGTTACAACTATCACGGAGAAAACCGCCCCACCATCTACATTTACCTCCAAAATACCTACTGGTGGAATATCCGGTTATGGACAAACTTCGTTTGTATCAAACGTCAATAAAGTTCAATAcactaaaccaaaaattgaaattactactTTCAAACCTGCGCCTCAAGCATCTTTTGGAATTCTTGGTAGTGGTGTACAAAAAACCACTTCTCAGTCTACCTTTGAATCTAAAGTTCCATCTGTAGGAATAAGTGGAGGAGGTCAAACTTCATTTGTGTCTACCGTCAACAAATATGATTTTAAGAAGCCTACAATACAATTTGAGGTACCTAAACCGACCCCTAAACCAGTTGGTCAGTCTTCATTTGGTGTTTTTGGTAGTGGCGTACAAAAAACTACTTCCCAGTCTGCATTTGGTTCAAAAGTTCCATCTGCAGGAATTAGTGGAAGTGGCCAAACTTCATTTGTGTCTACCGTCAACAAATACGATTTTCAAAAGCCTACGGTACAATTTGATATACCTAAACCAACCCCTAAACCAGTTGGTCAGTCTTCATTTGGTGTTTTTGGTAGTGGTGTACAAAAAACTTCTTCACAGGCAACATTCGGTTCTCAATTTCCATCATCAGGAATAAGTGGAAGTGGTCAGTCTTCATTTGTATCAACCGTGAACAAGTACGATTTTAACAAACCCGCAGTACAATTTGATGTACCTAAACCGACATCTAAACCAGTTGGTCAGTCTTCATTTGGTGTTTTTGGTGGTGGTGTTCAAAAAACGACTTCCCAGTCTGCGTTTGGCTCTAAAGTTCCATCTGGAGGAATTAGTGGAAGTGGTCAGACTTCATTCGTGTCTACTGTCAACAAGTACGATTTTAACAAGCCCACAATACAATTTGATGTACCTAAACCAACCCCTAAACCAGTTAGTCAGTCTTCATTTGGTATTTTTGGCAGCAGTGTACAAAAAACGACTTCCCAATCAACCTATGGCTCTAAAGTTCCATCTGCAGGAACAAGTGGAAGTGGCCAAACTTCATTTGTATCTAACGTCAACAAATATGATTTTAACAAACCAACAGTGCAATTTGATATACCCAAACCGACCCCTAAACCAGTTGGTCAGTCTTCATTTGCTACTTTTGGCAGTGGTGTACAAAAAGCCACTTCTCAGTCTGCTTTTGGCTCTAAAGTTCCATCTACAGGGATAAGTGGAAATAGTCAGTCTTCATTTGTATCAACCGTTAACAAGTACGATTTTAACAAGCCTACAATACAATTTGATGTACCTAAACCGACACCTAAACCCGTTGGACAGTCTTCATTTGGAATTCTTGGTAGTAGTGTACAAAAAACGACTTCCCAGACAACCTTTGGCTCTAAAGTTCCATCTGCAGGAGTTAGTGGAAGTGGCCAGACTTCATTCGTATCTACTGTCAACAAGTACGATTTTAACAAGCCCACAGTACAATTTGATGTACCTAAACCAGTTGGTCAGTCTTCATTTGGAATCTTTGGCAGTGGTGTACAAAAAACGACTTCCCAGTCAACCTTTGGCTCTAAAGTTCCATCTACAGGAATTGGTGGAAGTGGCCAGACTTCATTTGTATCGACTGTCAACAAGTACGATTTTAACAAACCCACAGTACAATTTGATATACCTAAACCGACACCAAAGCCAGTTGGTCAGTCTTCATTTGGTGCTTTTGGTGGTGGTGTACAAAAAACTAGTTCCCAGTCTACCTTTGGCTCTAATGTTCCATCTACAGGAATTAGTGGAAATGGCCAGACGTCATTCGTGTCTACTGTCAACAAGTACGATTTTAACAAGCCCACGGTACAATTTGATGTACCTAAACCAGTTGGTCAGTCTTCATTTGGAATCTTTGGCAGTGGTGTACAAAAAACGACTTCCCAGTCAACCTTTGGCTCTAAAGTTCCATCTACAGGAATTGGTGGAAGTGGCCAGACTTCATTTGTATCGACTGTCAACAAGTACGATTTTAACAAACCCACAGTACAATTTGATATACCTAAACCGACACCAAAGCCAGTTGGTCAGTCTTCATTTGGTGCTTTTGGTGGTGGTGTACAAAAAACTAGTTCCCAGTCTACCTTTGGCTCTAATGTTCCATCTACAGGAATTAGTGGAAATGGCCAGACGTCATTCGTGTCTACTGTCAACAAGTACGATTTTAACAAGCCCACGGTACAATTTGATGTACCTAAACCAGTTGGTCAGTCTTCATTTGGAATCTTTGGCAGTGGTGTACAAAAAACGACTTCCCAGTCAACCTTTGGCTCTAAAGTTCCATCTACAGGAATTGGTGGAAGTGGCCAGACTTCATTTGTATCGACTGTCAACAAGTACGATTTTAACAAACCCACAGTACAATTTGATATACCTAAACCGACACCAAAGCCAGTTGGTCAGTCTTCATTTGGTGCTTTTGGTGGTGGTGTACAAAAAACTAGTTCCCAGTCTACCTTTGGCTCTAATGTTCCATCTACAGGAATTAGTGGAAATGGCCAGACGTCATTCGTGTCTACTGTCAACAAGTACGATTTTAACAAGCCCACGGTACAATTTGATGTACCTAAACCAGTTGGTCAGTCTTCATTTGGAATCTTTGGCAGTGGTGTACAAAAAACGACTTCCCAGTCAACCTTTGGCTCTAAAGTTCCATCTACAGGAATTAGTGGAAGTGGCCAGACTTCATTTGTATCGACTGTCAACAAGTACGATTTTAACAAACCCACAGTACAATTTGATGTGCCTAAACCGACATCTAAACCAGTTGGTCACTCTTCATTTGGTAGTTTTGGTAGTGGTGTCCAAAAAACGACTTCCCAGTCTGCATTTGGCTCTAAAGTTCCATCTGCAGGAATAAGTGGAGGAGGTCAAACTTCATTTGTGTCTACCGTCAAcaaatatgattttaaaaagCCCTCAGTGCAATTTGATGTGCCTAAACCGACCCTTAAACCAGTTGGTCAGTCTTCATTTGGTGTATTTGGTGGTGGTGTACAAAAAACGACTTCCCAGTCTTCGTTTGGCTCTAAAGTTCCCTCTGGAGGAGTTAGTGGAAGTGGTCAGACTTCATTCGTCTCTACTGTTAACAAGTACGATTTTAACAAGCCTACAATACAATTTGATGTACCTAAACCGACACCTAAACCAGTTGGGCAGTCTTCATTTGGTATATTGGGTGGTGGTGTACAAAAAACTACTTCCCAGGCAACCTTTGGCTCTAAAGTTCCATCTGCAGGAGTTATTGGAAGCGGTCAGACTTCATTCGTGTCTACTGTCAACAAGTACGATTTTAACAAGCCTACAATACAATTTGATGTACCTAAACCGACACCTAAACCAGTTGGGCAGTCTTCATTTGGCATTTTCGGTAGTGGTGTACAGAAAACATCTTCACAGTCAACATTCGGTTCTCAATTTCCATCTTCAGGAATTAGTGGAAGTGGGCAGTCTTCATTTGTATCAACCGTGAACAAGTACGATTTTAACAAACCCACAGTACAATTTGATGTACCTAAACCGACACCAAAGCCAGTTGGTCAATCTTCATTTGGTGTTTTTGGTGGTGGTGTACAAAAGACGACTTCCCAGTCTGCCTTTGGCTCTACAGTTCCGTCAGCAGGAATTAGTGGAAGTGGCCAAACATCATTCGTGTCTACCGTCAACAAATACGATTTTAAGAAACCCACAATACAATTTGATGTACCTAAACCGACACCTAAACCAGTTGATCAATCATCATTTGGAATTTTTGGAAGTGGTGGATCTTCATTGAATACACAAAAAACAACTTCTACATTTTCGTCAAAGTTTCCATCTACAGGAAACGGTCAAACTGGGGTGTTTTCAAATGTGAACAGATATGAAgcaaataaaccaaaaattgagtTTTCCTCATCCAAACCTACCGGTGGATCATTTGGAATTTACTCTACGCAACGTCCGTCTACTAACGCGCCCGAATATCTACCTCCACTTGGTTCAACATTCCAGAAAGTAAAATATAACGATGacatcaaatatttttga
Protein Sequence
MYYLIQSSNVPDKVFSGIQPTGSVHLGNYLGAINQWVKLQNSDQDVILSIVDLHAMTLPFDPKELSRSILEVTATLLACGVDPNKTTLFQQSNVYQHTELWRYLRTVSTMARLAHLPQFKTKSSTLQDVPLALYFYPVLQSADILLYKATHVPVGEDQLQHIQLCRELARMFNKRYGQTFPIPQAILSDDTVSKIKSLRDPLKKMSKSDSNFDSRISLTDSPDEIVHKIEGAVCGNNSNMTLQLEGKPELSNLVFIHSMLSEKGVESIIKEYEGLDILRYKSLLAECIIEHLSPIRTKLLNFIANPEYLISVLEDGAKKASKVAAITVREVHSKLGVNHSNICFNNSQNILKANDANLKPGEKIFLGLEATGNIHLGNYLNTIKECVKLQNLGKNVMLSIENSTTPCEPKVQKKNILEATATLLACGINPNKSVLFQQSHIYHHNDLCWHTTCIATMERATHVPAGGYPTEHLRLAQEIAKLFNHKYGLKFPIPQLVGGDGAGISKILSLKDPMKEMHNSDSNPKSIIYLTDTPDVIVKKIKGAVSDFNSTITFEPSSRPGISNLVRIHSLLTEKSVEDICRECEGLDTLRYKLQLAECVIEHLTPIRTKLIDLRANPDHLLKVLENGAEKASKVAQVTANEVCFNLGLQLGNISNLNRSKKQINFGSDNERQGYFYDKPSISFELPTRKPTSTPFAAATTIAPVPTGYSYPKPSIAFEENPKPVGIRHEQISSTLFGSGGGSSISSKTPSINIGGGGQTSVTTTFNKYEFNKPRVDIEVQKPSSFGIFGNGASTITTNVNRYNTFKPSVTTITEKTAPPSTFTSKIPTGGISGYGQTSFVSNVNKVQYTKPKIEITTFKPAPQASFGILGSGVQKTTSQSTFESKVPSVGISGGGQTSFVSTVNKYDFKKPTIQFEVPKPTPKPVGQSSFGVFGSGVQKTTSQSAFGSKVPSAGISGSGQTSFVSTVNKYDFQKPTVQFDIPKPTPKPVGQSSFGVFGSGVQKTSSQATFGSQFPSSGISGSGQSSFVSTVNKYDFNKPAVQFDVPKPTSKPVGQSSFGVFGGGVQKTTSQSAFGSKVPSGGISGSGQTSFVSTVNKYDFNKPTIQFDVPKPTPKPVSQSSFGIFGSSVQKTTSQSTYGSKVPSAGTSGSGQTSFVSNVNKYDFNKPTVQFDIPKPTPKPVGQSSFATFGSGVQKATSQSAFGSKVPSTGISGNSQSSFVSTVNKYDFNKPTIQFDVPKPTPKPVGQSSFGILGSSVQKTTSQTTFGSKVPSAGVSGSGQTSFVSTVNKYDFNKPTVQFDVPKPVGQSSFGIFGSGVQKTTSQSTFGSKVPSTGIGGSGQTSFVSTVNKYDFNKPTVQFDIPKPTPKPVGQSSFGAFGGGVQKTSSQSTFGSNVPSTGISGNGQTSFVSTVNKYDFNKPTVQFDVPKPVGQSSFGIFGSGVQKTTSQSTFGSKVPSTGIGGSGQTSFVSTVNKYDFNKPTVQFDIPKPTPKPVGQSSFGAFGGGVQKTSSQSTFGSNVPSTGISGNGQTSFVSTVNKYDFNKPTVQFDVPKPVGQSSFGIFGSGVQKTTSQSTFGSKVPSTGIGGSGQTSFVSTVNKYDFNKPTVQFDIPKPTPKPVGQSSFGAFGGGVQKTSSQSTFGSNVPSTGISGNGQTSFVSTVNKYDFNKPTVQFDVPKPVGQSSFGIFGSGVQKTTSQSTFGSKVPSTGISGSGQTSFVSTVNKYDFNKPTVQFDVPKPTSKPVGHSSFGSFGSGVQKTTSQSAFGSKVPSAGISGGGQTSFVSTVNKYDFKKPSVQFDVPKPTLKPVGQSSFGVFGGGVQKTTSQSSFGSKVPSGGVSGSGQTSFVSTVNKYDFNKPTIQFDVPKPTPKPVGQSSFGILGGGVQKTTSQATFGSKVPSAGVIGSGQTSFVSTVNKYDFNKPTIQFDVPKPTPKPVGQSSFGIFGSGVQKTSSQSTFGSQFPSSGISGSGQSSFVSTVNKYDFNKPTVQFDVPKPTPKPVGQSSFGVFGGGVQKTTSQSAFGSTVPSAGISGSGQTSFVSTVNKYDFKKPTIQFDVPKPTPKPVDQSSFGIFGSGGSSLNTQKTTSTFSSKFPSTGNGQTGVFSNVNRYEANKPKIEFSSSKPTGGSFGIYSTQRPSTNAPEYLPPLGSTFQKVKYNDDIKYF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-