Basic Information

Gene Symbol
STAT5B
Assembly
GCA_963576445.1
Location
OY754902.1:5923838-5961056[-]

Transcription Factor Domain

TF Family
STAT
Domain
STAT_bind domain
PFAM
PF02864
TF Group
Beta-Scaffold Factors
Description
STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain Pfam:PF00017.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 2 1.1 1.4e+04 -3.0 0.2 58 119 806 870 793 900 0.65
2 2 5.5e-47 7e-43 146.9 0.9 1 133 998 1136 998 1136 0.94

Sequence Information

Coding Sequence
ATGTCTCTTTGGGCTAGGGCTCAACAGTTACCGCCAGAAAGTCTACAGAAGGTGAGAACCATCTATGGAGACCACTTCCCCATCGAGGTGAGGCACTGCCTCGCTCCATGGATCGAAAGCAGGATATGGCCCCCAGTTCCCGGTGGTCCCGCTATTCCCGGTTACGGTAGCGGCCGCGGTAACGGCGGGGCGGGGGGTGCTAAGAATCTCCGGGTTACGGGAGGCTACCAACAACGACTACAGCTGGCAGCGTTTAACTGCAACACACTTCGGCTGGACCATCACCTTACCGAACTTGAGGTAGAACTAGGTCGCATAAACTGGCACATTTTGGGGTTGTCCGAGATCCGAAGACAGGGGGAGGACACGATAACCCTTGAGTCAGGCGACCTGTTCTACTTCAGAGAAGGTGATCAGCTCTCCCAGGGAGGTGTCGGTTTTCTGATCAAGAGGGACCTCATTAGCAGCGTTGTCGAAGTCGGAAGTGTGTCGAGTCGGGTAGCGTACCTTGTTCTGAAAATCACCAACAGGCGTTCCCTGAAGGTGGTTCAGGTATATGCGCCGACCTCGGCTCACCCTGACGAAACTGCCGAGGCTGTGTATGAGGACATTGTCAAGGCAATGAATCATACCACCAAGGCCACCTACAGCGTTGTTATGGGGGACTTCAATGCTAAAGTGGGAGTACAAGAGCGCGGTGAAACGAGAATCGGACCCTATGGAGTGGGGCGCCGAAACCACCGGGGCCAAATGCTGGTCAACTTTCTAGAATCCCAAGGGCTCTTCTTAATGAATACCTTCTTTAAGAAGAAGCCCCACAGGAAGTGGACTTGGCAAAGGCGGGATTCCATGGTAAGGAACGAGATAGACTTTTTCATCACGGATAGTAAGCACATATTTAGAGATGTCTCCGTGATCAACAGGTTTAAAACCGGAAGTGATCACAGGCTTATCAGAGGCTCTCTAAATATCAACCTTAAGGCTGAAAGATTCCGGATGATACGGTCTACTCTCCGACCTACGCCGCCCCAGGTAGCTATAGGGTCCGAAAAGTTTCAACttgaacttcaaaatcgtttcgcaatgCTGGAAACCACTGGCAGCATTGATGAGAAAACCGACACAGTGGTCAAAACTCTGCAGGAAGTGTCCCGAAAGTTTTTTCCCGGAAAACGCAAACAGCTTGATTCCAAACTCTCTTCCGAGACTCTCGAACTtatgaggaggaggcgtgatGAGCCCTCCACTCAGTCGTCTTCGTCAGGTGGGAGAGCCTTAAaccatcaaattaaaaagttgatacgacgcgacctccgacgctCCAACACACGCGCCATCCAGGATGCGATCGAGCGTAATAGGGGGTCGATAGTGTTCGCTCAGCAATTAGGAAGGCCCCATCTGACGAAACTCAAAACTGAAGATGGCAGAACCATCGCCTCCAAGTTCGAGATACTCGAAGAGGTCGAGAAGTTCTATGGACAGCTGTATGCTTCACGGACGACAAAACCGACGGGACACAGCgatgaagaccacagagccccaCTCACGCGCCATTACACCGAGGAGATACCCGATGTCGAGCAGTGCGAGATAGAGGCGGCTCTCAAACAAATGAAGATTAATAAGTGCCCTGGTGATGACGGAATTACCACTGAGCTCCTAAAGGCAGGcgggaccccagtcctgaaagaGCTAGCGAGCCTATTTAATTCCGTCATCCAACAGGGCTTAACACCCAGGGCATGGAGCGGGAGTGCGGTGGTgttgttcttcaaaaaaggtgATAAGGCCTTGCTGAAGAACTATagacccatctcactcctgagccatATCTACAAACTGTTTTCGAGAGTCGTCACAAATCGCCTCGCCAGTAAGCTCGACGAGTTCCAGCCACcggagcaagccgggtttcgaaaaggctacagcaccgtggaccacatccatactgttcggcagattgtacAGAAGACTGAGGAGTACAGTCTGCCGCTATGTATGGCTTTCGTGGACTACgaaaaagccttcgactccatcgaaacctgggcagttctggatgctctgcagcGGTGCCGTGTCGACTGGCGATACATCGAAGTGCTGAGATGCTTATACGAGACCACCactatGACCGGTGAGCCGGAAGAACAGCAGAGGTTCTTCGTGGATGAGCTGGTTCGGGAGATTCAGGCGCACGCAGACCTCATGCTGTCCCCAGACATGTTCGTCACCAAGATGAAACTCCTGGACGCGGCCAAGAACTTCCACATGACATACAGCCACAGTCCCCACGAGCTGTTCCAGTACATGCGTCGCTCGCTCGCTATGGAGATGGATGTGATCCAGAACGCCATGGGTGCCCCCTACGTGGCGCCCCCTCAGACCGAGAGGAAGTACAGCGAGCTGATCACGGGGCTGCAGACGGTACGCCAGAAAGTGAACATGGTCGGGGAGGAGATCCGCGGCTTGCAGGCCAACATCGAGTCCTTCTCTATCGAGTACCACGAGTGTCTTAAGCATAAAGGTCATATGAACTACCTCCAGCAGTCCATGACCAACGATCGTCGAGACCTGGTGGCCTGCCTCCGCGTACAGATTGAGGAGACGGAGAGGAAACTCAACGCCAAGGTCGCGCAAATTACGCAGTCGCAACTGGAGCTAGTGGACCATATGAAGGAGAACATCACGAACCTCCGCCAGCTGCAGAGCCAGGTCTTGGATGAGGAGCTTATCAAGTGGAAACGTGAACAGCAGCTATCTGGCAACGGAGTACCCATGCAATCGAACCTGAACACCATCCAGGAGTGGTGCGAACTGCTCGCTGACCTCATCTGGACCACTCGCCAGCAAGTCAACAATGTGGCCCGCATCAACACCAAGACCGTGGTGGAACTGCGCCAGCCGCACCTGGCCGAGATGCTGGACGAGATGAGCAAGACGGTGACTGGTCTGCTGTCGACCCTGGTGACTTCGACGTTCGTGATTGAGAAGCAGCCGCCGCAGGTGATGAAGACTAACACGCGCTTCACGGCTACTGTCCGTTTGCTGGTCGGGGGTCAGCTAAACGTATACATGACACCGCCCAGAGTCAGCGTGGTGATAATCTCGGAGCAGCAGGCGCAGCTGCTGCTGAAGAGCGAGACCCAGGCCGGCAAGGGCAAGCAGCCCGTGGAGTGCGGAGACATCCTCAACAACACCGGCACCATGGAGTACCAGCCCACCAGCAGGCAGCTCAGCGTCAGCTTCAGAAACATGCAACTCCGCAAGATCAAGCGCGCCGAGAAGAAAGGCACGGAGAGCGTGATGGACGAGAAGCTGACGCTGCTGTTCCAGTCTCAATTCAACGTCGGAGGGGGGGAGCTAGTCTTCCAGGTGTGGACGCTGTCCCTGCCGGTGGTGGTGATCGTCCACGGCAACCAGGAGCCGCACGGCTGGGCCACGGTCACGTGGGACAACGCCTTCAGCCCGCCCGGCCGCGTGCCCTTCGCCGTGCCCGACAAGGTGACATGGGGCCAGTTAGCCGAGACGCTCCGCATCAAATTCGGCTCAGCCACTGGCGGCGATCTGTCCGAAGACAACCTCCGGTTCTTGGCCGAAAAGATATTTAGGACTTCCCTGCCGATGTCAACCATGGACCTTAACGCGATGGCGGTCTCGTGGACGCAATACTGCAAGGACGCGCTTCCCGAGAGGAACTTCACTTTCTGGGAGTGGTTCTACATGGTCGTCAAGGTCACCAGAGACTACCTTCGGACGCTCTGGTGCGACCATTTAATCATGGGCTTCATTCAGAAGAAGCAAGCCGAAGACATGCTCTCGAAGTGTCCCCCCGGCACGTTCCTGCTGAGGTTCTCTGACTCTGAACTTGGAGGGATCACCATCGCTTGGGTTGGAGAAGGCAACGAGGTGTTCAGCCTCCAGCCGTTCACATCCCGGGACCTGATGCTGCGCTCCCTGGCCGACCGGGTACTAGACCTGACGCAGCTGCAGTTCCTGTACCCCAACATCGCCAAAGATGATGTTTTCTCCAAGTACTACACTAAACCAGAGAGCGATATGCGGACGAACGGCTACGTGAAGCCGGTCCTTGTGACGACGCTGCCGTCCTACATCTCGTCGTCCCCCGCCTACGCGCACTCGCCCGACTCACACCGCAACACGCCCAGCGTGCAGAGCAGGTACGACTCACGCTTCACTCACTACACACTACACGCGAACAGACCACGGCGCGTGGCGGACAAAATGTCCGCcatgcactcttatttgtacgatcataAGAACGACAGTTTCATGGACAGCGAGCTGTTTGAGCAGATCCGCGCGTTCGAACCTGAGGGGCTCGACGACTTTGACATATACAATAATATGGCCATGAAGTGA
Protein Sequence
MSLWARAQQLPPESLQKVRTIYGDHFPIEVRHCLAPWIESRIWPPVPGGPAIPGYGSGRGNGGAGGAKNLRVTGGYQQRLQLAAFNCNTLRLDHHLTELEVELGRINWHILGLSEIRRQGEDTITLESGDLFYFREGDQLSQGGVGFLIKRDLISSVVEVGSVSSRVAYLVLKITNRRSLKVVQVYAPTSAHPDETAEAVYEDIVKAMNHTTKATYSVVMGDFNAKVGVQERGETRIGPYGVGRRNHRGQMLVNFLESQGLFLMNTFFKKKPHRKWTWQRRDSMVRNEIDFFITDSKHIFRDVSVINRFKTGSDHRLIRGSLNINLKAERFRMIRSTLRPTPPQVAIGSEKFQLELQNRFAMLETTGSIDEKTDTVVKTLQEVSRKFFPGKRKQLDSKLSSETLELMRRRRDEPSTQSSSSGGRALNHQIKKLIRRDLRRSNTRAIQDAIERNRGSIVFAQQLGRPHLTKLKTEDGRTIASKFEILEEVEKFYGQLYASRTTKPTGHSDEDHRAPLTRHYTEEIPDVEQCEIEAALKQMKINKCPGDDGITTELLKAGGTPVLKELASLFNSVIQQGLTPRAWSGSAVVLFFKKGDKALLKNYRPISLLSHIYKLFSRVVTNRLASKLDEFQPPEQAGFRKGYSTVDHIHTVRQIVQKTEEYSLPLCMAFVDYEKAFDSIETWAVLDALQRCRVDWRYIEVLRCLYETTTMTGEPEEQQRFFVDELVREIQAHADLMLSPDMFVTKMKLLDAAKNFHMTYSHSPHELFQYMRRSLAMEMDVIQNAMGAPYVAPPQTERKYSELITGLQTVRQKVNMVGEEIRGLQANIESFSIEYHECLKHKGHMNYLQQSMTNDRRDLVACLRVQIEETERKLNAKVAQITQSQLELVDHMKENITNLRQLQSQVLDEELIKWKREQQLSGNGVPMQSNLNTIQEWCELLADLIWTTRQQVNNVARINTKTVVELRQPHLAEMLDEMSKTVTGLLSTLVTSTFVIEKQPPQVMKTNTRFTATVRLLVGGQLNVYMTPPRVSVVIISEQQAQLLLKSETQAGKGKQPVECGDILNNTGTMEYQPTSRQLSVSFRNMQLRKIKRAEKKGTESVMDEKLTLLFQSQFNVGGGELVFQVWTLSLPVVVIVHGNQEPHGWATVTWDNAFSPPGRVPFAVPDKVTWGQLAETLRIKFGSATGGDLSEDNLRFLAEKIFRTSLPMSTMDLNAMAVSWTQYCKDALPERNFTFWEWFYMVVKVTRDYLRTLWCDHLIMGFIQKKQAEDMLSKCPPGTFLLRFSDSELGGITIAWVGEGNEVFSLQPFTSRDLMLRSLADRVLDLTQLQFLYPNIAKDDVFSKYYTKPESDMRTNGYVKPVLVTTLPSYISSSPAYAHSPDSHRNTPSVQSRYDSRFTHYTLHANRPRRVADKMSAMHSYLYDHKNDSFMDSELFEQIRAFEPEGLDDFDIYNNMAMK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-