Ecen000956.2
Basic Information
- Insect
- Eupithecia centaureata
- Gene Symbol
- Stat5b
- Assembly
- GCA_944548335.1
- Location
- CALYMU010000032.1:1123598-1156206[+]
Transcription Factor Domain
- TF Family
- STAT
- Domain
- STAT_bind domain
- PFAM
- PF02864
- TF Group
- Beta-Scaffold Factors
- Description
- STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain Pfam:PF00017.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 3 0.045 7.1e+02 1.5 0.2 48 125 128 208 110 213 0.81 2 3 1.4e-22 2.2e-18 67.9 0.1 1 79 330 414 330 427 0.89 3 3 1.3e-28 2e-24 87.4 0.1 54 133 428 507 413 507 0.91
Sequence Information
- Coding Sequence
- ATGTCGCTCTGGGCCAGAGCTCAGCAGCTCCCACCAGAGAGTTTGCAGAAGGTGCGTACAATCTACGTGGATCATTTTCCCATCGAAGTTCGCCATTGCCTAGCGCCATGGATCGAAAGCAGGATTTGGACGGCAGAGCCAGAGGAACAGCAGCGGTTCTTCGTGGAGGAACTGGTCCAGGAGATCCAGACGCACGCCGACTTGATGCTGTCTCCAGACATGTTCGTCACCAAAATGAAGCTTCTGGAAGCGGCCAAGAACTTCCACATGCAGTACAGCCACGCTCCCCATGAACTGTACGCCTACATGCGCCGCTGCCTGGGACTGGAGATGGAGGTCATCCAAGCGGCCATGGGCGGCCAGTATCCGGCCCAACCTCAGACCGAGCGCAAGTACAGCGAGCTTATAACGGGCCTACAAACAGTACGCCAGAAGGTGAATGTAGCCGGAGAAGAGATTAGAAGCTTGCAGGCTAACATCGAGTCCTTCTCTCTGCAATACCACGAGTGTCTCAAGAACAAAGGTCACATGAACTATCTACAGCAGAGTATGACGAACGAGCGCCGCGACCTGGTGGCCTGTCTCCGAGTACAGATTGAGGATACTGAGAGGAAACTTAACGCTTTGGTGGCTCAAATCAGCCAATCCCAAATGGAGCTCGTCGATCACATGAAAGAGAACATCGCTAACTTGAGACAACTGCAGAGTCAAGTCTTAGACGAGGAACTCATCAAATGGAAACGAGAACAACAGCTGACAGGCAACGGAGTCCCCATGCAATCCAACCTGAACACCATCCAGGAGTGGTGCGAGCTGCTCGCAGACCTTATATGGAACACCAGGCAACAGGTGAACAACGTGGCCCGAATCAACACTAAGACGATAGTGGAGCTCCGACAGCCGCATTTAGCTGATATGCTCGACGATATGAGCAAgcagGTGACTGGATTGTTATCTACACTAGTGACATCTACATTCGTGATCGAGAAACAACCGCCGCAGGTTATGAAGACTAATACACGTTTCACGGCTACAGTCCGCTTGCTGGTGGGGGGTCAGCTCAACGTGTACATGACTCCGCCCAGAGTCAGCGTGGCAATAATATCAGAGCAGCAAGCGCAGTTGTTGCTAAAAAGCGAGACCCAAGCCGGCAAGGGCAAGCAGCCGGTGGAGTGTGGAGAGATCCTCAATAACTCTGGCGCTATGGAGTACCAGCCGACCAGTCGCCAACTCTGTACGAGTGTAGCTGTAGCAGAGACACAAGCCGGCAAGGGCAAGCAGCCGGTGGAGTGTGGAGAGATCCTCAATAACTCTGGCGCTATGGAGTACCAGCCGACCAGTCGCCAACTCTGTGTGAGCTTCAGAAACATGCAGCTTCGCAAGATAAAGCGCGCCGAGAAGAAAGGCACCGAGAGCGTAATGGATGAGAAATTGACTTTGCTTTTCCAATCGCAGTTCAATGTCGGCGGGGGCGAACTGGTGTTCCAGGTGTGGACGCTTTCCCTGCCCGTGGTTGTAATCGTCCACGGGAACCAAGAGCCCCACGGCTGGGCAACCGTCACGTGGGACAACGCCTTCTCCGCGCCGGGACGAGTGCCTTTCCACGTGCCCGACAAGGTAACATGGGGCCTATTAGCTGAGACGCTCCGCATCAAATTCTGTTCAGCCACTGGCGGGGATCTATCAGAGGACAACCTACGCTTCCTAGCTGAGAAGATCTTCAGGACCAACCTCCCAATCAACACGCTAGAACTGAACGGCATGGCGGTGTCCTGGACCCAGTTCTGTAAGGACGCGCTGCCGGAGAGGAACTTCACCTTCTGGGAGTGGTTCTACATGGTTGTGAAGGTCACCAGGGACTATCTGCGGACGCTCTGGTGCGACCGCCTGATTCGCGGCTTCATCCAGAAGAAAGGCGCTGAGGACATGCTCTCCAAGTGTCCCCCGGGGACGTTCCTGCTGCGCTTCTCAGACTCTGAGCTAGGGGGCATCACCATCGCCTGGGTTGGTGAGGGGAACGAGGTGTTCAGTCTCCAGCCGTTCACCTCCCGAGACCTGATGCTTCGCTCGCTCGCAGACCGCGTCCTGGATTTACCCCAGCTGCAGTTTCTGTACCCTAACATAGCCAAGGACGATGTCTTCTCCAAGTACTACACCAAACCGGaGAACGAGATGCTCAAAAACGGCTACGTGAAGCCAGTTTTAGTGACCACCTTGCCTCCCTACATGTCGTCCGCTTCCCCTGCGTACGCACACTCGCCCGACTCCCATCGCAACACGCCTTCCGTCACCAGCAGCTACTTCAGCGCTCAGACGCCGGCTACAGTAGACACATTCATGGACAGCGAGCTTTTTGAACAGATACGTGCCTTTGAGCCGGAGGGACTCGACGATTTGGACTTTTACAACAATGTCGCTATGAAGTAA
- Protein Sequence
- MSLWARAQQLPPESLQKVRTIYVDHFPIEVRHCLAPWIESRIWTAEPEEQQRFFVEELVQEIQTHADLMLSPDMFVTKMKLLEAAKNFHMQYSHAPHELYAYMRRCLGLEMEVIQAAMGGQYPAQPQTERKYSELITGLQTVRQKVNVAGEEIRSLQANIESFSLQYHECLKNKGHMNYLQQSMTNERRDLVACLRVQIEDTERKLNALVAQISQSQMELVDHMKENIANLRQLQSQVLDEELIKWKREQQLTGNGVPMQSNLNTIQEWCELLADLIWNTRQQVNNVARINTKTIVELRQPHLADMLDDMSKQVTGLLSTLVTSTFVIEKQPPQVMKTNTRFTATVRLLVGGQLNVYMTPPRVSVAIISEQQAQLLLKSETQAGKGKQPVECGEILNNSGAMEYQPTSRQLCTSVAVAETQAGKGKQPVECGEILNNSGAMEYQPTSRQLCVSFRNMQLRKIKRAEKKGTESVMDEKLTLLFQSQFNVGGGELVFQVWTLSLPVVVIVHGNQEPHGWATVTWDNAFSAPGRVPFHVPDKVTWGLLAETLRIKFCSATGGDLSEDNLRFLAEKIFRTNLPINTLELNGMAVSWTQFCKDALPERNFTFWEWFYMVVKVTRDYLRTLWCDRLIRGFIQKKGAEDMLSKCPPGTFLLRFSDSELGGITIAWVGEGNEVFSLQPFTSRDLMLRSLADRVLDLPQLQFLYPNIAKDDVFSKYYTKPENEMLKNGYVKPVLVTTLPPYMSSASPAYAHSPDSHRNTPSVTSSYFSAQTPATVDTFMDSELFEQIRAFEPEGLDDLDFYNNVAMK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00000961;
- 90% Identity
- iTF_00701737; iTF_00704728; iTF_00698048; iTF_00696282; iTF_00361390; iTF_01219562; iTF_01170348; iTF_01171242; iTF_01182446; iTF_00376879; iTF_00662864; iTF_00810776; iTF_00856381; iTF_00638430; iTF_01428830;
- 80% Identity
- -