Bsom039751.1
Basic Information
- Insect
- Bedellia somnulentella
- Gene Symbol
- STAT5A_1
- Assembly
- GCA_963576735.1
- Location
- OY755198.1:10580332-10610888[-]
Transcription Factor Domain
- TF Family
- STAT
- Domain
- STAT_bind domain
- PFAM
- PF02864
- TF Group
- Beta-Scaffold Factors
- Description
- STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain Pfam:PF00017.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 4 0.54 2.4e+04 -3.0 0.4 39 67 32 60 3 117 0.55 2 4 0.21 9.4e+03 -1.6 0.1 62 130 100 177 49 178 0.69 3 4 0.32 1.4e+04 -2.2 0.1 44 82 598 635 577 644 0.73 4 4 2.1e-46 9.6e-42 144.0 0.4 1 133 1814 1953 1814 1953 0.97
Sequence Information
- Coding Sequence
- ATGGAAAAGGAAATAAAGGTGAGATCCGTTATCAGACGACAATTTTCAAAATCGGCCAATAATTTAGATTCATTActcaaagaatataaaataggtACTTCTACAAAAAATAGTTTAGAAGCAAGCTATAATTtaacaaataatgtttttgCTGAGCTTAAAAACCACGATGAAAAGGTGATGACGCACATGTTGGAAAACGAACAATTCGATGATATAATGTTAGAAAAAGAATTGGATGAAGTCGATAAATACCGATTACGATGGGAATCACTGAAACTAACATTTTCATCTATTTTAAGTACGTCAAACCGAGAAACGAAAATTCAAGTAAACgaaatgaaatttaaattaccaaTGATGGAATTAAAAAAGTTCGATGGTAATATTAGAAATTGGTTACTTTTCTGGGGTCAATTCAAACGTATTGACGAAGATATTAATATTGATGAAATAGACAAACTACAATATTTAACACAATTTATGACACCAAACACCGAAGTCAAAGCATTCGTAGAAAGCTTTCCCCTAATACCTGAAAGTTACAATTTATGTCTCGCCGAACTAAAATCGAGGTATGCCAGAGAAGATGTACTTATTCAAATCTATGTTCGTGAACTACTAAGTTTGATACTCACAAAAGATAAAAATGTTGATTTGACTAGTTTATACGATAATTTGAGCAGCCACTTACGATCACTTGAAGCTCTAAAAGTTACAAAGGAAAAATACTCTATATTCCTGTTGCCAATGGTGGAGTCTGCGTTGCCCTCAGAAATAATTAAAGCTTGGGAAAGAACACGATCTTCTAAGACCACCGATGACCTGACAAGCCTACTTGAGTTTGTAAAGAAAGAAGTCGAGAGTGATCAAAGAGTAACCTTAGCTAGTGAATTTAAGATTCAGTCCAGTTCATCGGAGCCAGCAGCAACCTGTCTTGCAGTTCACGATAGCGTGATAAATAAACgtgaaattaaatgtatatggTGTGATAAGGCACACGCAAGCACAGAATGTTACAGGCTTGCAAAGATGACCATTGATGACAGAAAGGAGTTCCTGAAAAAGAAAAAGGCTTGTGCATTATGCCTCAAAACAGGTCACCACGCGAAGTTTTGCAAGAATTATATAAAGTGTTTTGTATGCGGTCAACGTCATTCTCCTGTTCTATGTTCTGGCATTATaagtacaaatattaataaagtagAGATGAAATCTATTTCAAGCTATTCACAAACATCCTTTACTACATTATTACAAACTGTCAATGTCAACGTACGTTATGGCCAGCAAACAGTGACAGTAAGGGCATTACTTGACAGCGGTTCCCAACGTTCGTATATCAAGGAAGATGTTATAAGGAAACTACACGCACAACCTACTGCCACTGAAAGTCTGGGACATTCATTATTTGGAGGTAATACTGtatcatataaaaaatacaatgtctACGATGTGAGTATTGAAAATATAGATAAATCTTTTCAAATTAATATGTTGGTGTTAGGTCAGGCAGTAATTTGTACTAATGTTCCTCAAGTAAGTAACAATAATATGTCTATCTCGTTACTACTGgataaacataatattcatcTGACAGACACGGAAGGTGAAAGTTCTGATAAAGATATAGGTCTTCTAATAGGAGCAGATTTTCTAGGACTTTTACTGACTAATAATATTGTTCAGTTAGGTGAAGGTTTAACAGCAATTAATACTAAACTGGGATGGACATTACAAGGGAAAATAAATGATACGTTAGTTAATACTAGTATGGTAACGTCTTTGTTTTGTTCACAgaatataagtgatttttggaatTTAGAAATATTAGGTATCCAAGATCCAGTAGAAATGCagttaaaacaacaaaaagaacaagaaatcatatttaattttgaagaaacGATTCAAGTTAACAACGAAGGAAGGTATGAAGTACAACTTCCATGGAAAACAGGATATAATGATCTTCTTAGCAATCGTGAACTCGCCTCGCGCCGATTACAATCTGCAACAAACAAAATGGTGGTGAATAACAAGTTTTATGATTACCATAAGGTTTTAAAAGAATGGGAACAAAGTGGCATAATAGAACCAGTACCAGCTACAGATTTAGATAATAAGAGTCACTACTTACCCCATAGAGCAGTTTATAAAGATACTAGCCTTACTTCAAAAATACGCCCCGTGTTCGACGCATCTGCAAAAGATAAAAGAGGTAGATCATTAAATGCGTGTTTAGAAAAGGGTGTAAACTTTTTAGATAAAATACCTGATTTATTAATAGGTTTTAGAAAAGGATATATAGGTCTCATAGCTGATATAGCAAAAGCATTTCTACAAATATCATTAAACCCTAAAGAGAGAGATAATTTAAGATTTCTTTGGTGGAGAGATGAGATGCGACAAGATTTCATAGAATATAGACACTGCCGAGTTGTTTTTGGACTAACTTCGAGTCCATTCTTGTTGTCAGCCACAATATTACATCATTTAAATCAGATCAAAGGTAAGCACGAAGAGACTGCCAAAATTTTAGCCAGCTCATTTTATGTTGATAATTTAGTAACAAGCGTAGACTCAAAAGAAGATGCTGTGAGGTTTATTCGAGAGGCAAAAGATATTATGATGGAAGCTAAGTTCGATCTGAGAATGTGGGTCACatctacattaaaaataaatgaaactaaTCAAAATATTGTGTCCGTATTAGGTTTATTATGGGATACAACTAGTGATGATATATTTTGCAACCTAAATTCATTACTTAGCCCTAttgattatgtttttaataaaagaataattttGTCACTTACTCAGAAGATTTTCGATCCAGTTGGAATGTTATGTCCAGTCACCCTTATTCCAAGGTTAATATTACAAGACATATGGAATAGGAAAATAAACTGGGATGAAGATTTACCTGATGATATAATAAAGCAATTTCGGGAATGGTATAGCGATATTAACTTGTTAATAAATTGTAGAATTCCTAGAAGATTCTCACTAGTATCATTAGAGAGTTGTAAGAACGTCAGCATTCATATGTTTTGTGACGCTAGCAAAAATGGATATGCAGCATGCGTTTTTATAAGAACTGAAGATAAGAATGGTAATATAACAATTAGATTAGTGTGTGCAAAGTCTAGAATATCACCTGACAAAGCAACAATACCACGTCTCGAGCTATTAGGGGCTCTTATGGGTTCAAGGTTGATAAAAGCggcaagaaataaaataagtgaAAACAAAATACACAACGAGTTTTGCTGGTGTGATTCGGCAGTAGCATTATGTTGGATTAAACGACAACTAGACTGGAATACTTACATAGGAAACCGCGTAAAAGAAATAAGATTAAACACTGATGTCAATAGTTGGCAACACATACCAGGGTGTGACAACTGGGCCGATTTACCATCGAGAGGTTGCTATCCTCACCAATTTCTTGAATCCAGATGGTGGGAAGGTCCACCATGGCTATTTCTGCGTTCCGAACATTGGCCACAATCCACAATGGTTGTTGATGAAAAGGAGGCAATCAAAGAACTGAAAAGGTCTGCAGTAATAAACATCAGTTGTGATGGGAGTTTAACAaCACCCTTACCTTTAGAACGTATGCAAATGAATGCACCATTTGAAGCCGTAGGAATTGATCTTGCCGGTCCCTTATACTTACGTGACGGTAGTAAAAGCTGGATAGTGGTTTACACGTGCGCGGTATACCGTGCAATACATTTAGAACTATCACTATCTACCGAGGCATTTATGATGACGTTAAGACGCTTTATCGCTCGAAGAGGGCGGCCAAGTATAATTATTTCAGATAATGGTACCAATTTCATAGGAGCAAAGAATTTGTTGTCTGCTGTCGACTGGGATGAAGTACAGCGTCAGTCAACAATCATGCGCATAAAATGGAAACTCAACGTTCCCGCAGCACCTTGGTGGGGTGGATTTTTTGAacgtatgataaaaataattaaaatgttgttGAGACGGGTTTTAGGTAAGTCTATAGTATCCTATGAAGAAATGGAAACTCTGTTGTGTGACTGTGAATCTGTAGTCAATGATAGACCTCTTACCTATGTTGATGCCGATGACGTGAATTCCCTAGAACCCATTAAACCTTCTTGTTTTTTAAACCCGCTTCCTACAACACAAGTGACTGATTTAGATGTGATTGATAGTCAAATTTGTAATAGAAGGTTGCGATACTTACAGAGGCTAAGAGAAGATTTTAGAAACCGTTTTAAGAATGAATACCTTACTGAACTCATACAAAAAGGGAAGGAAACAAATTCTCCTATTAGAGTTGGTGACATTGTTCTTGTTGAAACCGAAGAGAGAAGGGTAAAGTGGCCTTTGGGTcttgttttagaaatatttaccGGTAAGGACGGCATAAATAGAACTGCAAAGGTGAAAACCTCAACTGGTTTTAAAACACGTCCTTTTCAAAGATTGTACCAGTTAGAACTTTCTTCTTCGAAAGTTGAAGGTCCATCTACTGATCCTATAACTAAAGCATTTTTCCGTAATAAAAAATCTTTGGAACCAACAACGACGAGgGCCAACGAATCCGAGGACCAACAACGCATGTTTCTAGACGACCTGTGTCGCGAGATAACGGTGACCGCGGACATGATGACCACGCAGGACACGTTTGTGACCAAGATGAAGTTGTTTGATGCGGCCAAACAACTGCAGTTGCAGTATGGGCACAGTCCTCACGAGTTGTACGCGCACGTAAAGAGTTGTTTTCACGCGGAGATGGAGATCATAGAGAGTTGTCTGCAAGGATACGGACAGTCCATGCAGACAGAGATGAAGTATAGTGAGTTAATAAACGGCCTAAAAACAATAAGCCAAGAGGTGGCGCTAGTGAGCGAGCAACTGAGAGTCACTCAAGCCAAGATCGAGTCTTTCTCTATACACTATCACGAGTGCGGGAAGCATAAAGtcCATCTAAACTTCATGCAATCCCAAACACCGACGATGCAACCGAACCACGAAATGGTCGCGAGACTGAGACAGACTATTGAAAATTTGGAAGTTAACCTCAACGATCTGATGATGCAAATAAAGCAGTCGATGATGATGTTGGTTGACCGACTGAAGGCTAATATAGACACGCTGAGAAAGCTGCAGAGCGAAGTTTTGGATAGAGAACTTATTagATGGAAACGCGAGCAGCAATTAAACGGCAATGGTGTCCGAATGTCGTGCAACTTAGACGCCGTACAAGAATGGTGCGAGAAATTAGCAGAACTTATTTGGGTCACTCGCACTCAAGTCAACAACGCTTTGGCAGTCAACGCGGCTTGTATGCCCGAATTGGCCTCGCAGCATTTGAATCCTGTGTTGAAAGATGCTGCTCAGCAGATCACGTGTCTGCTAACGACTCTAGTGACATCTACATTCGTGATTGAGAAACAACCGCCTCAGGTCATGAAGACTAACACGAGGTTTACCGCCACAGTCCGTCTTTTAGTCGGAGGTCAGTTAAACGTGCACATGACCCCTCCGCGGGTTTCCGTCACCATCATATCGGAACAACAAGCGCAGTTGTTACTAAATAGGGAGCAGTGTTCCGGAAGTCCGGTTAAGGAACCGGAACCGTGTGGTGATATACTGAATAATACCGGAACGATGGAGTTTCCGCAGCCGACTAAACATCTTTCGGTTACTTTTAGAAACATGCAGCTACGTAAAATCAAGCGCGCAGAGAAGAAAGGAACCGAAAGCGTGATGGATGAAAAATTGACCTTGTTCTTTAGTTCCACTTTCAAAGTGGGTGGTGGAGAACTTGATTTTCAGGTATGGACTTTATCCCTGCCGGTGGTGGTTATAGTCCATGGAAACCAAGAACCCCACGGCTGGGCCACGGTCACGTGGGACAACGCGTTCAGCCAGCTCGATAGAGTGCCCTTTCATGTGCCGGAGAAGGTGTCCTGGGGTCAACTAGCCGAAACTTTGTGCGTAAAATTCAAACATCAGACGGGCGGGGATTTGACACAGGATAATTTGAGATTCCTAGCTGAGAAGATATTTAGAGCCTCAGTACCCGCCAACGTGAACGAGCTGAACCAAATGCTAGTGTCCTGGACGCAATTCTGCAAGGACGCTTTGCCCGAACGCAACTTTACGTTCTGGGAATGGTTCTATATGGTCATGAAGGTCACTAGGGACCATTTGAAACAGCTGTGGAGCGAACGCCTAATAATTGGTTTCATCCAAAAGAAGCAAGCAGAAGAGATGTTATCCGAATGTCCGTCCGGTACTTTCCTACTAAGGTTCTCTGACTCGGAGCTAGGTGGCGTTACTATCGCGTGGGTGGCTCCTGGGGCCGATGGAGACAACGAAGTATACAGTCTCCAACCATTCATTACCCGGGATCTCTTGCTACGTTCCGTGGCCGATAGAGTGATGGATCTCCCACAACTGCAATACTTATACCCGAATATACCTAAAGATGCCGTGTTCTCCAAGTATTATACTAAGCCTGACGAGCCCCAGAAGAATGGCTACGTAAAGACCGGTTTAGTTCTCGCGATACCCGGGTTCATTCCCCACGCACAGTCCCCGGACTCGCATCGTAACACTCCTAGTGTTAACTCCAGGTAA
- Protein Sequence
- MEKEIKVRSVIRRQFSKSANNLDSLLKEYKIGTSTKNSLEASYNLTNNVFAELKNHDEKVMTHMLENEQFDDIMLEKELDEVDKYRLRWESLKLTFSSILSTSNRETKIQVNEMKFKLPMMELKKFDGNIRNWLLFWGQFKRIDEDINIDEIDKLQYLTQFMTPNTEVKAFVESFPLIPESYNLCLAELKSRYAREDVLIQIYVRELLSLILTKDKNVDLTSLYDNLSSHLRSLEALKVTKEKYSIFLLPMVESALPSEIIKAWERTRSSKTTDDLTSLLEFVKKEVESDQRVTLASEFKIQSSSSEPAATCLAVHDSVINKREIKCIWCDKAHASTECYRLAKMTIDDRKEFLKKKKACALCLKTGHHAKFCKNYIKCFVCGQRHSPVLCSGIISTNINKVEMKSISSYSQTSFTTLLQTVNVNVRYGQQTVTVRALLDSGSQRSYIKEDVIRKLHAQPTATESLGHSLFGGNTVSYKKYNVYDVSIENIDKSFQINMLVLGQAVICTNVPQVSNNNMSISLLLDKHNIHLTDTEGESSDKDIGLLIGADFLGLLLTNNIVQLGEGLTAINTKLGWTLQGKINDTLVNTSMVTSLFCSQNISDFWNLEILGIQDPVEMQLKQQKEQEIIFNFEETIQVNNEGRYEVQLPWKTGYNDLLSNRELASRRLQSATNKMVVNNKFYDYHKVLKEWEQSGIIEPVPATDLDNKSHYLPHRAVYKDTSLTSKIRPVFDASAKDKRGRSLNACLEKGVNFLDKIPDLLIGFRKGYIGLIADIAKAFLQISLNPKERDNLRFLWWRDEMRQDFIEYRHCRVVFGLTSSPFLLSATILHHLNQIKGKHEETAKILASSFYVDNLVTSVDSKEDAVRFIREAKDIMMEAKFDLRMWVTSTLKINETNQNIVSVLGLLWDTTSDDIFCNLNSLLSPIDYVFNKRIILSLTQKIFDPVGMLCPVTLIPRLILQDIWNRKINWDEDLPDDIIKQFREWYSDINLLINCRIPRRFSLVSLESCKNVSIHMFCDASKNGYAACVFIRTEDKNGNITIRLVCAKSRISPDKATIPRLELLGALMGSRLIKAARNKISENKIHNEFCWCDSAVALCWIKRQLDWNTYIGNRVKEIRLNTDVNSWQHIPGCDNWADLPSRGCYPHQFLESRWWEGPPWLFLRSEHWPQSTMVVDEKEAIKELKRSAVINISCDGSLTTPLPLERMQMNAPFEAVGIDLAGPLYLRDGSKSWIVVYTCAVYRAIHLELSLSTEAFMMTLRRFIARRGRPSIIISDNGTNFIGAKNLLSAVDWDEVQRQSTIMRIKWKLNVPAAPWWGGFFERMIKIIKMLLRRVLGKSIVSYEEMETLLCDCESVVNDRPLTYVDADDVNSLEPIKPSCFLNPLPTTQVTDLDVIDSQICNRRLRYLQRLREDFRNRFKNEYLTELIQKGKETNSPIRVGDIVLVETEERRVKWPLGLVLEIFTGKDGINRTAKVKTSTGFKTRPFQRLYQLELSSSKVEGPSTDPITKAFFRNKKSLEPTTTRANESEDQQRMFLDDLCREITVTADMMTTQDTFVTKMKLFDAAKQLQLQYGHSPHELYAHVKSCFHAEMEIIESCLQGYGQSMQTEMKYSELINGLKTISQEVALVSEQLRVTQAKIESFSIHYHECGKHKVHLNFMQSQTPTMQPNHEMVARLRQTIENLEVNLNDLMMQIKQSMMMLVDRLKANIDTLRKLQSEVLDRELIRWKREQQLNGNGVRMSCNLDAVQEWCEKLAELIWVTRTQVNNALAVNAACMPELASQHLNPVLKDAAQQITCLLTTLVTSTFVIEKQPPQVMKTNTRFTATVRLLVGGQLNVHMTPPRVSVTIISEQQAQLLLNREQCSGSPVKEPEPCGDILNNTGTMEFPQPTKHLSVTFRNMQLRKIKRAEKKGTESVMDEKLTLFFSSTFKVGGGELDFQVWTLSLPVVVIVHGNQEPHGWATVTWDNAFSQLDRVPFHVPEKVSWGQLAETLCVKFKHQTGGDLTQDNLRFLAEKIFRASVPANVNELNQMLVSWTQFCKDALPERNFTFWEWFYMVMKVTRDHLKQLWSERLIIGFIQKKQAEEMLSECPSGTFLLRFSDSELGGVTIAWVAPGADGDNEVYSLQPFITRDLLLRSVADRVMDLPQLQYLYPNIPKDAVFSKYYTKPDEPQKNGYVKTGLVLAIPGFIPHAQSPDSHRNTPSVNSR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -