Scos004569.1
Basic Information
- Insect
- Scrobipalpa costella
- Gene Symbol
- TSPAN5
- Assembly
- GCA_949820665.1
- Location
- OX463296.1:17879109-17890846[+]
Transcription Factor Domain
- TF Family
- GTF2I
- Domain
- GTF2I domain
- PFAM
- PF02946
- TF Group
- Other Alpha-Helix Group
- Description
- This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain [2, 1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 4 0.001 45 5.7 0.0 9 38 170 199 165 203 0.88 2 4 0.0011 48 5.6 0.0 10 38 218 246 209 249 0.86 3 4 0.0012 54 5.5 0.0 11 38 266 293 260 296 0.86 4 4 0.0011 48 5.6 0.0 11 38 313 340 307 353 0.88
Sequence Information
- Coding Sequence
- ATGCCAGGAATACGGAAATACCGACGCGACACCAGCGAAGTCAGCTGTTGTttgaaatatgttatttttggaGTAAATGTTTTGTTCTGGttccttggtctagtggtgcTGGCAGTGGGGATATGGGCCTGGTCGGAGAAGGACACCTTCAACAACCTGTCCAGATTAACCAACATAGCCTTGGATCCGGCATTCATATTGATATGTGTTGGTACAATCACCTTCATAATCGGGTTCACCGGATGCGTCGGCGCCTTGCGTGAGAACACCTGCCTTTTGGCTTGCTATGCGGTGTTCCTAGCGTTACTGCTTCTAGCTGAGATGACCGTAGGAATTCTCTTCTTTGTTTTCAAAGACTGGATAAAGCAGCAAGCAACCAGTGGCTTCCAGACGTTCATCACGCATTACAGAGAAGACCCTGATCAGCAGAATCTCATTGACTGGATACAAGAAGATTGGTTGCAATGCTGCGGCGTGGAAGGGCCGCGCGACTGGGACCGCAATGCGTACTTCAACTGCTCGTCGGGCGCGGTCGGGTCGCGGGAGGCGTGCGGGGTGCCCTTCAGCTGCTGCCGAGCCAAGCCCACTGACGTCATCCGCAACAAGCAGTGCGGCTACGACGTGCGGAAACCCACTTATCTGAATAAGTACTTCAACTGCTCGTCGGGCGCGGTCGGGTCGCGGGAGGCGTGCGGGGTGCCCTTCAGCTGTTGCCGAGCCAAGCCCACTGACGTCATCCGCAACAAGCAGTGCGGATACGACGTGTGCAAGCCCACTTACCTGAATAAGTACTTCAACTGTTCGTCGGGCGCGGTCGGGTCGCGGGAGGCGTGCGGGGTGCCCTTCAGCTGTTGCCGAGCCAAGCCCACTGACGTCATCCGCAACAAGCAGTGCGGATACGACGTGTGCAAGCCCACTTACCTGAATAAGTACTTCAACTGTTCGTCGGGCGCGGTCGGGTCGCGGGAGGCGTGCGGGGTGCCCTTCAGCTGCTGCCGAGCCAAGCCCACTGACGTCATCCGCAACAAGCAGTGCGGCTACGACGTGCGGAAACCCACTTATGGTGTTAGCGAGCGCGTGATCCACGAGCACGGCTGCCTGGCGGCGGGCGAGGATTGGCTGCAGAGGAACTTCCTACCGGTGGCTGCGACGGTACTTGCTCCGGGCGCGCTAAAGAACTATGATATAGCTAAGATAATATATGACAAGGGTTGCCTCGAGGCGGCGGAGGAGTGGTTCGACCACAACTTGTTGATAGTAGCCACGTCAGCTGTTTGTACCGCTTTTGCACAAATCCTAGGCATCTGCTTTGCGCAGAACCTCCGCGCAGACATCTTCGCGCAGAAGGCGAAGTGGCACTGA
- Protein Sequence
- MPGIRKYRRDTSEVSCCLKYVIFGVNVLFWFLGLVVLAVGIWAWSEKDTFNNLSRLTNIALDPAFILICVGTITFIIGFTGCVGALRENTCLLACYAVFLALLLLAEMTVGILFFVFKDWIKQQATSGFQTFITHYREDPDQQNLIDWIQEDWLQCCGVEGPRDWDRNAYFNCSSGAVGSREACGVPFSCCRAKPTDVIRNKQCGYDVRKPTYLNKYFNCSSGAVGSREACGVPFSCCRAKPTDVIRNKQCGYDVCKPTYLNKYFNCSSGAVGSREACGVPFSCCRAKPTDVIRNKQCGYDVCKPTYLNKYFNCSSGAVGSREACGVPFSCCRAKPTDVIRNKQCGYDVRKPTYGVSERVIHEHGCLAAGEDWLQRNFLPVAATVLAPGALKNYDIAKIIYDKGCLEAAEEWFDHNLLIVATSAVCTAFAQILGICFAQNLRADIFAQKAKWH
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -