Bros031082.1
Basic Information
- Insect
- Bacillus rossius
- Gene Symbol
- sup-9
- Assembly
- GCA_032445375.1
- Location
- CM063641.1:126184210-126207577[-]
Transcription Factor Domain
- TF Family
- STAT
- Domain
- STAT_bind domain
- PFAM
- PF02864
- TF Group
- Beta-Scaffold Factors
- Description
- STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain Pfam:PF00017.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 15 0.0014 75 6.4 0.0 91 132 31 70 17 71 0.88 2 15 0.22 1.2e+04 -0.7 4.9 96 132 100 137 72 186 0.63 3 15 0.011 5.6e+02 3.6 6.0 34 132 115 208 102 209 0.71 4 15 0.044 2.4e+03 1.5 9.4 33 131 185 300 169 302 0.63 5 15 0.05 2.7e+03 1.4 3.9 95 132 263 301 235 324 0.67 6 15 0.013 7e+02 3.2 4.5 61 131 329 393 283 395 0.62 7 15 0.0043 2.3e+02 4.8 0.6 85 132 420 464 394 465 0.76 8 15 0.0034 1.8e+02 5.2 0.1 97 132 473 508 465 509 0.91 9 15 0.0016 84 6.2 1.7 96 131 571 606 536 630 0.56 10 15 0.00064 34 7.5 0.3 95 132 636 673 609 674 0.73 11 15 0.00023 12 8.9 1.5 87 132 718 761 674 762 0.63 12 15 0.00031 16 8.5 0.1 96 132 769 805 761 806 0.90 13 15 0.032 1.7e+03 2.0 0.0 96 132 813 853 807 854 0.88 14 15 0.0035 1.9e+02 5.1 0.8 95 132 860 897 824 924 0.61 15 15 0.0016 84 6.2 0.1 95 132 930 967 900 968 0.85
Sequence Information
- Coding Sequence
- ATGTTACACTCCAGAGTTCATTGGGGTCAAGTATCAGGTCGCTCTCGGACACGACTCTGCAAGTGCCGACCTCGCGTCCGAGTCGCTCGGTGCCGGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCCCTCAGTTGTCATGGCGACTGTGCTCACTCAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCGCCcagttgtcatggcaactgaaCTCACTCAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCCCTCAGTTGTCATGGCGACTGTGCTCACTCAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCCCTCAGTTGTTGTCATGGCAACTGTGCTCACTCAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCGCCCTGTTGTCATGGCAACTGAACTCACTCAGCGGAACCGGAAGGTACACGTTACAGAGCGCAGTCGCtcagttgtcatggcaactgtGCTCACTCAGCGGCACCAGAAGCGGAACCTGAAGGTACACGTGACAGAGCGCAGTCCCTCAGTTGTTGTCATGGCAACTGTGCTCACTCAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCGCCCTGTTGTCATGGCAACTGAACTCACTCAGCGGAACCGGAAGGTACACGTTACAGAGCGCAGTCGCtcagttgtcatggcaactgtGCTCACTCAGCGGCACCAGAAGCGGAACCAGAAGGTACACGTTACAGAGCGCAGTCCCtcagttgtcatggcaactgtGCTCACTCAGCGGCACCAGAAGGTACACGTTACAGAGCGCAGTCGCTCAGTTGTTGTCATGGCAACTGTGCTCACTCAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCGCCCTGTTGTCATGGCAACAGAACTCACTCAGCGGAACCGGAAGGTACACGTGACAGAGCGCAGTCCCTCAGTTGTTGTCATGGCAACTGTGCTCACTCAGCGGAACCAGAAGGTACACGTTACAGAGCGCAGTCGCtcagttgtcatggcaactgtGCTCACTCAGCGGCACCAGAAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCCCTCAGTTGTCAAGGCAACTGAACTCACTCAGCGGAACCGGAAGGTACACGTGACAGAGCGCAGTCGCCCAGTTGTCAAGGCAACTGAACTCACTCAGCGGAACCGGAAGGTACACGTGACAGAGCGCAGTCCCtcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGCGGAACCGGAAGGTACACGTGACGGAGCGCAGTCGCtcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGGTACACGTGACCGAGCGCAGTCGCtcagttgtcatggcaactgaaCTCACTCAGTGGAACCGGAAGGTACACGTGACAGAGCGCAGTCGCCAAGTTGTCATGGCAACAGAACTAACTCGGCGGAACCAGAAGGTACACGTGACGGAGCGCAGTCGCCcagttgtcatggcaactgaaCTCACTCAGCGGAACAttgtcatggcaactgaaCTCACTCAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCGCCCAGTTGTAATGGCAACTGAACTCACTCAGCGGAGCCGGAAGGTACACGTGACAGAGCGCAGTCGCCCTGTTGTCATGGCAACTGAACTCACTCAGCGGAACCGGAAGGTACACGTGACAGAGCGCAGTCGCCcagttgtcatggcaactgaactcactcagcggaaccagaaggtacacgtgacagagcgcagtcgcccagttgtcatggcaactgaaCTCACTCAGCGGAGCCGGAAGGTACACGTGACAGAGCGCAGTCGCCCTGTTGTCATGGCAACTGAACTCACTCAGCGGAACCGGAAGGTACACGTGACAGAGCGCAGTCGCCcagttgtcatggcaactgaaCTCACTCTGCGGAACCGGAAGGTACACGTGACGGAGCGCAGTCGCCCAGTTGTCATGGCAATTGAACTCACTCAGCGGAACCGGAAGGTACACGTGACAGAACGCAGTCGCtcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGGTACACGTGACAGAGCGCAGTCCCTCAGTTGTCATGGCAATTGAACTCACTCAGCGGAACCGGAAGGTACACGTGACAGAACGCAGTCGCtcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGGTACACGTGACAGAACGCAGTCGCCcagttgtcatggcaactgaaCTCACTCAGCGGAACCAGAAGGTACACGTGACAGAACGCAGTCGCtcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGGTACACGTGACGGAGCGCAGTCGGCcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGGTACACGTGACAGAGCGCTGTCCCtcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGCGGAACCGGAAGGTACACGTGACGGAGCGCAGTCGCCcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGGTACACGTGACAGAGCGCAGTCCCTCAGCTGTCATGGCAACTGAACTCACTCAGCGGAACCAGAAGGTACACGTGACAGAGCGCAGTCGCCcagttgtcatggcaactgaaCTCACTCAGCGGAACCGGAAGCGGAACCGGAAGGTACACGTGACAGAGCGCAGTCGCCcagttgtcatggcaactgaaCTCACTCAGCGGAACCGTAAGGTACACGTGACAGAGGGCCGTCGCCCAGTTTTCATGGCAACTGAACTCACTCAGCGGAACCGGAAGGTACACGTGACGGAGCGCAGTCGCCCAGTTGTCATTGCAACTGAACTCAGCGGAACCGGAACGTACACTGAAGAGAAGGCTTGCGGGCGGCTGGTGTCCGCAGGCTGCAAGGAGCCGGTGCTGTGCAACAAGTACGCCGTGGAGCTGGAGGACGCCGCGAGGTCACCGAGGTCACCGCGCAGGCCCAAGGAGCGGCCGAGACCGCCGGCCTCCCAGCAGGGCGGCTGCATCGCGCGCCAGGCGTCCTTCAAGCAGCGCCACCAGCGGCTGTCGCGCCTGCGCGAGGACCCCGACTACGAGGACTACGACGAGCTGGGCCTGGTCGGCTACGACTACGACGACCCCGCCTCGCCGCCCACGCGCACCCGGCCCGTGCCCATCTGGCTGTGCGTGTTCCTCGTCGTCACCTACATCTTCTGCGGCGCGCTGCTCTTCATGAACTGGGAGGAGTGGAGCTTCCTCGACGCGGCCTACTTCTGCTTCATCACGCTCACCACCATCGGCTTCGGCGACTTCGTGCCGGCCAGGCGGGTGCAGAAGACCGACGCCGGAGTCAGCATCGCGCTGTGCTCCTTGTACCTGCTGTTCGGCATAGCGCTGCTGGCCATGAGCTTCAACCTCGTGCAGGAGGAGGTCATCAACAACGTCAAGGCCGTCGCAAAACATTTGGGCATTGTGAAGGACGACGAAGAAGAAGATGAGGGTGACTAa
- Protein Sequence
- MLHSRVHWGQVSGRSRTRLCKCRPRVRVARCRRNQKVHVTERSPSVVMATVLTQRNQKVHVTERSRPVVMATELTQRNQKVHVTERSPSVVMATVLTQRNQKVHVTERSPSVVVMATVLTQRNQKVHVTERSRPVVMATELTQRNRKVHVTERSRSVVMATVLTQRHQKRNLKVHVTERSPSVVVMATVLTQRNQKVHVTERSRPVVMATELTQRNRKVHVTERSRSVVMATVLTQRHQKRNQKVHVTERSPSVVMATVLTQRHQKVHVTERSRSVVVMATVLTQRNQKVHVTERSRPVVMATELTQRNRKVHVTERSPSVVVMATVLTQRNQKVHVTERSRSVVMATVLTQRHQKRNQKVHVTERSPSVVKATELTQRNRKVHVTERSRPVVKATELTQRNRKVHVTERSPSVVMATELTQRNRKRNRKVHVTERSRSVVMATELTQRNRKVHVTERSRSVVMATELTQWNRKVHVTERSRQVVMATELTRRNQKVHVTERSRPVVMATELTQRNIVMATELTQRNQKVHVTERSRPVVMATELTQRSRKVHVTERSRPVVMATELTQRNRKVHVTERSRPVVMATELTQRNQKVHVTERSRPVVMATELTQRSRKVHVTERSRPVVMATELTQRNRKVHVTERSRPVVMATELTLRNRKVHVTERSRPVVMAIELTQRNRKVHVTERSRSVVMATELTQRNRKVHVTERSPSVVMAIELTQRNRKVHVTERSRSVVMATELTQRNRKVHVTERSRPVVMATELTQRNQKVHVTERSRSVVMATELTQRNRKVHVTERSRPVVMATELTQRNRKVHVTERCPSVVMATELTQRNRKRNRKVHVTERSRPVVMATELTQRNRKVHVTERSPSAVMATELTQRNQKVHVTERSRPVVMATELTQRNRKRNRKVHVTERSRPVVMATELTQRNRKVHVTEGRRPVFMATELTQRNRKVHVTERSRPVVIATELSGTGTYTEEKACGRLVSAGCKEPVLCNKYAVELEDAARSPRSPRRPKERPRPPASQQGGCIARQASFKQRHQRLSRLREDPDYEDYDELGLVGYDYDDPASPPTRTRPVPIWLCVFLVVTYIFCGALLFMNWEEWSFLDAAYFCFITLTTIGFGDFVPARRVQKTDAGVSIALCSLYLLFGIALLAMSFNLVQEEVINNVKAVAKHLGIVKDDEEEDEGD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -