Dsan007994.1
Basic Information
- Insect
- Drosophila santomea
- Gene Symbol
- dsx
- Assembly
- GCA_016746245.1
- Location
- NC:6546601-6585552[+]
Transcription Factor Domain
- TF Family
- DM
- Domain
- DM domain
- PFAM
- PF00751
- TF Group
- Zinc-Coordinating Group
- Description
- The DM domain is named after dsx and mab-3 [2]. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4 [1]. The dsx DM domain has been shown to dimerise and bind palindromic DNA [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.6e-24 5.6e-21 73.8 9.8 2 47 41 86 40 86 0.97
Sequence Information
- Coding Sequence
- ATGGTTTCGGAGGAGAACTGGAATAGCGACACGATGTCCGACTCGGACATGATCGACTCAAAGAACGACGTATGTGGCGGTGCCTCCAGTTCCAGTGGCAGCTCGATTTCGCCGAGGACGCCACCCAACTGCGCCCGCTGCCGCAATCATGGCCTGAAGATTACACTGAAGGGACACAAGAGGTACTGCAAGTTCCGCTACTGTACGTGCGAAAAGTGCCGGCTGACGGCGGACCGCCAGCGGGTGATGGCTCTGCAAACGGCCTTGAGGCGAGCCCAGGCGCAGGACGAGCAGCGGGCACTGCACATGCACGAGGTGCCGCCTGCCAATACGGCGGCCACAACTTTGCTGAGTCATCACCATCATGTCGCAGCTCCCGCCCATGTCCACGCCCACCATGTGCATGCCCATCACGCACACGGACACCACTCGCATCACGGTCATGTCCTGCACCACCAGCAGGCAGCGgcggccgcagcagcagctccgtCGGCTCCAGCCTCCCATCTTGGTGGATCCACCACGGCCACCTCCAGCCTCCACGGTCACGCCCACGCGCACCACGTCCACATGgcagccgctgccgccgcttCGGTGGCTCAGCACCAGCACCAAAGCCACCCACActcgcaccaccaccaccaccagcagaaCCATCACCAGCATTCGCATCAACAACCGGCCACGCAGACCGCTTTGCGATCTCCGCCGCACAGCGACCACGGGGGCAATGTGGGCccggccagcagcagctctGGCGGTGGAGCACCCAGTTCCAGCAATGCGGCAGCTGCCACTTCGAGCAGCGGATCCAGcagtggaggaggaggaggaggaggaggaggcgcaGGGGGCAGTTCGGGAGGCGGAGCAGGAGGTGGTAGATCGTCGGGGACATCGGTGATCACTAGCGCCGATCATCACATGACCACGGTGCCTACGCCCGCCCAATCGCTGGAGGGGTCCTGCGACTCGTCATCTCCATCGCCGTCGTCCACTTCCGGTGCCGCCATTTTGCCGATCTCAGTTTCCGTCAATCGCAAGAACGGCGCCAACGTGCCCTTGGGCCAAGACGTTTTCCTAGACTATTGCCAAAAGCTATTAGAAAAATTCCGCTATCCTTGGGAGCTGATGCCGCTCATGTATGTGATATTGAAGGACGCAGACGCCAACATTGAAGAGGCTTCCCGGCGAATCGAAGAGGGTAAGTCTGCTGATAACCTTATAGCACCTTACTGCCAAGCCCTTAAACTGGCTGTGAAGTAA
- Protein Sequence
- MVSEENWNSDTMSDSDMIDSKNDVCGGASSSSGSSISPRTPPNCARCRNHGLKITLKGHKRYCKFRYCTCEKCRLTADRQRVMALQTALRRAQAQDEQRALHMHEVPPANTAATTLLSHHHHVAAPAHVHAHHVHAHHAHGHHSHHGHVLHHQQAAAAAAAAPSAPASHLGGSTTATSSLHGHAHAHHVHMAAAAAASVAQHQHQSHPHSHHHHHQQNHHQHSHQQPATQTALRSPPHSDHGGNVGPASSSSGGGAPSSSNAAAATSSSGSSSGGGGGGGGGAGGSSGGGAGGGRSSGTSVITSADHHMTTVPTPAQSLEGSCDSSSPSPSSTSGAAILPISVSVNRKNGANVPLGQDVFLDYCQKLLEKFRYPWELMPLMYVILKDADANIEEASRRIEEGKSADNLIAPYCQALKLAVK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00571916; iTF_00525896; iTF_00491531; iTF_00590256; iTF_00531675; iTF_00480185; iTF_00581549; iTF_00536785; iTF_00578556; iTF_00505422; iTF_00605587; iTF_00515611; iTF_00529582; iTF_00494404; iTF_00524432; iTF_00609186; iTF_00541092; iTF_00485158; iTF_00569703; iTF_00489391; iTF_00478034; iTF_00490099; iTF_00561779; iTF_00561778; iTF_00613526; iTF_00483000; iTF_00492250; iTF_00606332; iTF_00533117; iTF_00593854; iTF_00548082; iTF_00602610; iTF_00596675; iTF_00488682; iTF_00512023; iTF_00604858; iTF_00604859; iTF_00543133; iTF_00579997; iTF_00504704;
- 90% Identity
- iTF_00543133;
- 80% Identity
- -