Basic Information

Gene Symbol
-
Assembly
None
Location
ML762991.1:533983-544828[-]

Transcription Factor Domain

TF Family
DM
Domain
DM domain
PFAM
PF00751
TF Group
Zinc-Coordinating Group
Description
The DM domain is named after dsx and mab-3 [2]. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4 [1]. The dsx DM domain has been shown to dimerise and bind palindromic DNA [3].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 2.9e-08 9.7e-05 22.1 6.3 2 40 429 468 427 472 0.82
2 5 1.2e-09 3.9e-06 26.5 10.3 4 42 478 517 475 523 0.90
3 5 1.2e-05 0.038 13.8 7.6 4 36 525 558 522 561 0.91
4 5 7e-11 2.3e-07 30.4 11.7 3 40 926 964 924 972 0.83
5 5 2.6 8.5e+03 -3.4 1.7 20 36 1705 1719 1700 1722 0.63

Sequence Information

Coding Sequence
ATGCAGCGAGTCCAGCTGTCGGTACGGGCGCTAaggcccgccctgcagcgcagcCCCCTGTGTGACCTTGGCGTCGCCGGAgccgcgagcgacgcgagcgGGGACGAGTCGGACGAGTTCAAGgatgaggaggagccCGCTCCCCCGGAGCCCCCtgaccgcggcgcccgcgctcgtggtggtggcgccccgctgccgcccctggcgccctggccgccgctgccaccaccaaTGCCGCCGCTACCCAaggaggcgccccccgcgctgccgcccctgccgcccctgccagcGTCACCGCCCCTCGACGAGAATCACAACAAGCCGAAAGGTGGTGTGATCTGGGACGACGACATGGGCTTGGAGTCGCTTGGGGACGCCCTCAACAGGGAGAAAGAGCTGATCGCCCGCTGGGACATGGCGGACCTGGTGGTGCTCTCTGACCTGAAGGACGACGTCGAGGCCGCCTCCCCGTCCAGGaagcccccggccaccccggagccgcccggctggccgcgctccggCTCTGACTCGAGGAGCGCCAAGGGGTCTGGAACTTCGGCGTGCAgccggagcggcagcaggaagCGAAGCCGCAGCACGAGCCGCAGTAAGAGGTGCAGTCGGAGCCGAAGTAGGAGCCGCTGCAGGAAGCGCAGTGCGAGTCCCAGCCCTACCAGGAGacggagccgcagccgcagcaggaggcgaagccgcagccgcagcagaagGCGCAGCCCCGTCGGCAGCTCCGACGGACGCagaggccgacgacgccgacgccgccgtcgtcgcagctccagtagcagcagcagaagtttGAGCCGCAGCAGAAGCCGGAGCAGAAGCCGTCtgagcagccccagccccagtcgcagccgcgggccgggtcggggatggcgcggggcgcgaggccgcGGACGGAGCCGCCACCGCTCCGGGGACTCTCGCGGCCGACGGAGTCGAAGCCGCAGCACCTGCATCGTGTCGCGCTCCGTCGTGGATGtcaagcgcggccgcggcgccgccccgtcagccaggggcggccgcggcgacctgCTCCgggtggccgtcgccgccgtcgccgctgtcgccgcgcccgctcgctcccggTCGCCGCCATCTCAGTcgtcctcgcgctcgccgccctccttgCCGTCTACACGGCCTAGCAGCGCCGAGGGCACGACGGGCTTCACCATGCAGAACGACGCCGACACCGTGGAGGAGTTCGACCTCCAGGACAGGCTGGTGAAGGGAGCCTGGCGCAagagcggcggcaggcggccgacGCCCTGCAAGATGTGCATGAACCACGGCCTGCTGCGCATGTTCGTGGGCGCGCACCGGGCGCACTGCCGCTACCTCGGCTGCGCCTGCGACCTCTGCGTGGAGAGCCGGCGCCGGCTGGGCTCTTCGGAGGGCAAGCTCATCAAGTGCACCATGTGCGAGAACCACGGCGAGGACGTCCTCTTCTCCCAGGAGCACAAGTTCCGGTGCAAGTACTTCAAGTGCATGTGCCGCCTATGCCGCGGCGTGCGCACCCGCTTCAAGGCCCATATGCAGAACAAGACGCTGTGTCCGCTGTGCGAGCGCCACCAGGTGTTCACTTTTGATGACAACGTGCACCTCAAGTTGTGCAGGTTCCGCGGGTGTCGCTGTCACCTCTGCCAGGCGGCGTCGCCCCCGCACCGGCTCGGCCTGGTGGCATCGTACGGCAGCGACGGTGACTCCAGGGACGGCCTCGAGGGGCCGCTCTCCAGGAAGGGTCGAGACAGCCCGGATCCTGGAGAAGCAGAACCGCCTCCTCCTGGTGTGGATGAAGAGCCGCCCGGGCCCTTAGTGACTCTCCCGGGTATGGACAAGTATCGTGACGTCACTCCCTCTCCTAGCCCGGATGCAGATGAAGATGTTGCCCCGACTGAAGTGGATGCAACCAACAAAGTCGCTTCCTCGGCGACGAATCAGGGGGAAGGAATTGGAAGTGAGGCGACGCCCCCAACAAAGAGTGATCCCGGGGAAAGGAACAGCAGTGCTGGGCCCTTCTCAGGTCTGGAGGAAACCGTCGACGCCATTTCAGAATTTTTGCGCAATTCTGACAGCGGCCCAATGTCACCGTCTCCAACTACCGCAGCCCATCACAATGAAGACACGGATGACACTCTTGAAAATATTGATTTGGAGGCCCTGCTTGACGACTTGATGGAAGAAGATGCACCCGCTGAGGAGCACACCTCCCCAAATGTGTCCTCCCAAGCCTCACCAAGTATCTCGCGCACTAAATCTGAAACTACCCCCTCACTTCCCAGCCTTAAAACGACGATGCCGTCCACGCCTCTGCGTTCTAGCTCGCAACGGAGTTTGGAGCCAATGCTAGGGAAACCTAGTGAAGACTCACCCTCCAGTGACAACAGTTACCgGTTGCGGTTGACACCCTCGCCTGACGAGAGGCACTCAGGGACAGAGAGCTTGCATCAGTCTCACGGCGGCGCCTCTTCCAACCTGGTGGTATTCGAGTTGGAGGCATCCCTagatgatgacgacgaggaggacctCGACCTGGACGACACCCTCTCTGATCGGCTGGTGATCGACCTTGACGCTGCGTCGGACCTGACAGTGACACCGGCGGGCACGCCCACGCCGCTGGACGATGCTGCGAGGGTGAGCCCCTCTAGGATGATTCCGGACCCTACAACAAAGACTGAATCCTCCGCGGTCACGGCCGAGGCCAagaccgagccgggcgggactgCGCAAACCTCGCCCGCAAACCAGCGATATTGCGCTTTTTGCCGGAACCACGGCAAGTTGCGGGTGCTCAAAGGCCACAAGACGTCGTGCGAGTACCGTCAGAAGTGCAAGTGTCCTCCCTGTGAGAAAAGGTCCCAACAAAACCAGCGACTGGCCTCCGGCCAACCAtctccgcgggggagggggcgtccccgTATCCGCGATGTCGGTTTCGGTGGCATTCCCCGCACCGTTGAGAGTTCTCAGGTTCATCTTGGAGACCAAATATCTCAGGAGGCTAAAATAAAAGTTGAGGATTACGGAGTCACACCAGTTTCAGTGAAACTAGAAGAGATTAACTCAGTTCTTGTAAAGCAAGATGAGAGTAACGCCAAGACAAAGgaCCCAGAAGACATTCCTTTAATTTTCCGATTACAAAGTAGCATTAATGAAAAGATAAAGAAATTGGGTATCAAACCCTTGAGTGATAACCTAGATGTAAAACCCACAGAATCAATGAAACTTGAAGATAAGCCACTCTATGTTAGACTTCAAGAATTTAGGATGAAGTCAAAGGACAACGCAAAGCCTGGGCCATTATGTAGTAAACTCTTAAAAAAGAcaattaagaaaaagaaaattaagtcATCTCCAGCTGACAAGAGTCAAAAAGAAGTGAAATCTCAACCATTAGTTGGAATTCCCTCCGATCAAATAGCTAAAGTCTGCTCCGAAGTACAATCTCAAGCTTTATACAAAGAAACCAAAGTAATGTCAAGTAATAATGCCAAAGAAGATAAATTTAAATGTGAAACCTTAAATGAATCTAAATTGGTTTCTGCAAAGGTCGATGAATGCATGGAACCACTGAAAAAGATAAAAGTAAGTCCAGAATCAGGAACCAAATCAACAAAATTTCAAGTTATCACAATACTGAACAATACAGTAATAGGGAAAGTCACTGTACGCAATGAGGCTGAAATGTTAGCCACCCCTGAAGATTTTTCCGAACCCAAAATGTTAGAAAAGGTTTCAAATAACATTAGTGCTCTCCCTCTTGAGCACACTCACAGCTTGTCACAATTAGAGCATCTAGCTGTGTCCCCACCATCAGTCCAAAGCTCCCCTCAGGCCCTGCTGCGGAAACAAGCCACTCCTACCACCCCAACTAGTCCAATGGATTTGAGTGTTGCTCGGAGCTCTCAATATACTTTGAGATCTCCTGAAGTTACAGTACTCTCACCTCCTGACGCTAAGCCCCGGCTTCCCCTGAAGTTGAAGCTGAAGTTGACGGCGAGCCCGTCGAGTTCACCACAGGATCGTTCATCACCATCTAGTACAACATCTGCACCTAAAGTGTACTGCGTCGTACCGTCTGCTTCTGCACCGTCACCGAGTGAACCTGAACCCCAAGCCATCCCAGCCTGTGTTAGTGCGCCGCCAGCCTTCCCACCCTCTGCCAAAGTACCTCAATCGATCCCACCCTCTGCAAATGCACCTCAATCCCTCCCACCCTTTTCAAATGCACCTCAATCCCTCCCACCCTTTTCAAATGCACCTCAATCCCTTCCACCCTCTGCAAATGTACCTCAATCCCTTCCACCCTCTGCAAATGTACCTCAACCCTTCCCAACCTGTTCAAATGTGCGTCAAGCCCACATACCCTCTTCTAACGTACATCAACCTTTCCCACCCTCTTCAAATGCCTCTCATGCCTTCCCAGCCGCTACTAATTCGCATCAACCTCTCGCGACGTCTGCGAGTCCAAACTTCACGGGGTCTGCAAGTCTTCCTCAAGCCTACAAGCCTTCAACGAGTGCCCCTCCAGCCTACTTCTCCTCACCGGTGTCTCAGGTGCCTCCAGGCAGGGCCACGGCGTCGTCTGCGCCCCCATGCTTCGGCCCCGCACCTTCTGCCACCGCCCGGCTCTCTGCCTGCGAGGCCGAGTCTGCGCTGCCCGAGctcgtcgacctggaggacGTGTTGGGGGTCGGCGATGGcacgctggcggccgcggccaaacCTGTCCCGGCGCCgagcacggtggcggcggggacgtggaaggtcccggcggcgcccgtaCCACAGGTCGCCGCTCGCCGGGCGCAACAACTCGCCTCCTGGGGAAGTGTCAAGAAGTCGGCCGACCCCATGGCCTCGCTGGCCAGGATGGCGGCCTCttggggcggcagcgcggcggttgTGCCTTCGATCCCCGCCAGCCCGGTCGGCCCTGGGCCGCCCACGTTCAGGgcgcccgctgcacccgccccggcgcggccctgggagTCGACTTCAGTGATCATTAACAAAACCCACCCTTGCGGCGACTGCGGCCTCAGGCAGCCATCtgccctcgccgcggcgcggcaccgcgtcgACCACCACACGGCGTGCACGAGGCGAGAGTGCGACAAGTGCAGCGGCGTGTCCTGGAGTAAGCTGCAGAGTGACGTCCACCAAGCGCTGGCGCACCCGCCGGGCGTGCAGCACCGCTGCCCCGTGTGCGCCGCCTGCTTCCCCTCGGACGTGATGCTCACCACCCACCTGCTGACCATGAAGGACCCCCCTCACGTGGACATGCGCTCAAGCCTGCCCGCGGAGCAGGCCAAACTCACCTGCCCCATCTGTTGGAGGTTCCGAGCCAAGGACCCCAAGGTGCTGGGCAGCCACGTTCGCCTGGAGCACGCCAACTGCACGCCCTTCGCCTGCGACGAGCCGGGCTGCAGCTTGGCCTTTGACGCGCCCTTGGCTCTGCAGACGCATCGACAGAGGCACAACCTGGACAACGCTCAGGAGCGCTGCCTGGCATGCTGCGTGCCCTTCGTGACGCGGGAGCTGCTCAACGACCACTGGCGGCAGAAGCACTCCGGCGTCTCCCCCTTCGCCTGCCCGTTGTGCTCCAAGTCTTtccgcgcgcctgccggccTCACCCAGCACATCCGAAGGCAGCACCCCGGCAGCACTCCCTTCCGATGCCGCGAGGAGGGCTGCAACCGCAGCTTCCCCTGCGTCGCGCTCCTGCACGAGCACGCCTGCGAGAGCAGGCACATGTCCCTGCCTGCGCCCAGCCACCTGTTCACGCAGCCCAAGCTGTGGCAGCCCCAGCTCGACCCtgtcagcagcgccgccaccagcacgcccagcacctGCTCCGTGTGTCCGGCGTCCCTGCCCGACGAGTGGTCCCTCATTGACCACCAGGATGTGGAGCATATGAACTTGGTGCCCATAGCGCGTGCGACTCAGCTGCTCTCCTTGACAAGAGATGCAGAACTTCTGCGTCCCCGGCAAGGgtattttcccccctctccttatgCTTCTGACTCTGTAACTACCTCGCGTGCACCGTACTCCACTGCACAAACAAGTGTGCAACCCCAGGTCCAGCCCCAAATCACACCCCATTACTCATCCCAATCCCAAGACCCATCCCAACCCCAAATCCCACCCCATTACCCATCCCAACCCCAAATCCCACCCCATTACCCATCCCAACCCCAAACCCAACTTCCTAGTTTTACTAACCGACCAGTCTTTGCGCAACCACCGCAAACTTTTGTTCAATCTACCTCTTATGTACAAACTGCTTCTTTTGTACAACCCCCTTTGTACTCTGAGTTTAAGCAAACTTGA
Protein Sequence
MQRVQLSVRALRPALQRSPLCDLGVAGAASDASGDESDEFKDEEEPAPPEPPDRGARARGGGAPLPPLAPWPPLPPPMPPLPKEAPPALPPLPPLPASPPLDENHNKPKGGVIWDDDMGLESLGDALNREKELIARWDMADLVVLSDLKDDVEAASPSRKPPATPEPPGWPRSGSDSRSAKGSGTSACSRSGSRKRSRSTSRSKRCSRSRSRSRCRKRSASPSPTRRRSRSRSRRRSRSRSRRRSPVGSSDGRRGRRRRRRRRRSSSSSSRSLSRSRSRSRSRLSSPSPSRSRGPGRGWRGARGRGRSRHRSGDSRGRRSRSRSTCIVSRSVVDVKRGRGAAPSARGGRGDLLRVAVAAVAAVAAPARSRSPPSQSSSRSPPSLPSTRPSSAEGTTGFTMQNDADTVEEFDLQDRLVKGAWRKSGGRRPTPCKMCMNHGLLRMFVGAHRAHCRYLGCACDLCVESRRRLGSSEGKLIKCTMCENHGEDVLFSQEHKFRCKYFKCMCRLCRGVRTRFKAHMQNKTLCPLCERHQVFTFDDNVHLKLCRFRGCRCHLCQAASPPHRLGLVASYGSDGDSRDGLEGPLSRKGRDSPDPGEAEPPPPGVDEEPPGPLVTLPGMDKYRDVTPSPSPDADEDVAPTEVDATNKVASSATNQGEGIGSEATPPTKSDPGERNSSAGPFSGLEETVDAISEFLRNSDSGPMSPSPTTAAHHNEDTDDTLENIDLEALLDDLMEEDAPAEEHTSPNVSSQASPSISRTKSETTPSLPSLKTTMPSTPLRSSSQRSLEPMLGKPSEDSPSSDNSYRLRLTPSPDERHSGTESLHQSHGGASSNLVVFELEASLDDDDEEDLDLDDTLSDRLVIDLDAASDLTVTPAGTPTPLDDAARVSPSRMIPDPTTKTESSAVTAEAKTEPGGTAQTSPANQRYCAFCRNHGKLRVLKGHKTSCEYRQKCKCPPCEKRSQQNQRLASGQPSPRGRGRPRIRDVGFGGIPRTVESSQVHLGDQISQEAKIKVEDYGVTPVSVKLEEINSVLVKQDESNAKTKDPEDIPLIFRLQSSINEKIKKLGIKPLSDNLDVKPTESMKLEDKPLYVRLQEFRMKSKDNAKPGPLCSKLLKKTIKKKKIKSSPADKSQKEVKSQPLVGIPSDQIAKVCSEVQSQALYKETKVMSSNNAKEDKFKCETLNESKLVSAKVDECMEPLKKIKVSPESGTKSTKFQVITILNNTVIGKVTVRNEAEMLATPEDFSEPKMLEKVSNNISALPLEHTHSLSQLEHLAVSPPSVQSSPQALLRKQATPTTPTSPMDLSVARSSQYTLRSPEVTVLSPPDAKPRLPLKLKLKLTASPSSSPQDRSSPSSTTSAPKVYCVVPSASAPSPSEPEPQAIPACVSAPPAFPPSAKVPQSIPPSANAPQSLPPFSNAPQSLPPFSNAPQSLPPSANVPQSLPPSANVPQPFPTCSNVRQAHIPSSNVHQPFPPSSNASHAFPAATNSHQPLATSASPNFTGSASLPQAYKPSTSAPPAYFSSPVSQVPPGRATASSAPPCFGPAPSATARLSACEAESALPELVDLEDVLGVGDGTLAAAAKPVPAPSTVAAGTWKVPAAPVPQVAARRAQQLASWGSVKKSADPMASLARMAASWGGSAAVVPSIPASPVGPGPPTFRAPAAPAPARPWESTSVIINKTHPCGDCGLRQPSALAAARHRVDHHTACTRRECDKCSGVSWSKLQSDVHQALAHPPGVQHRCPVCAACFPSDVMLTTHLLTMKDPPHVDMRSSLPAEQAKLTCPICWRFRAKDPKVLGSHVRLEHANCTPFACDEPGCSLAFDAPLALQTHRQRHNLDNAQERCLACCVPFVTRELLNDHWRQKHSGVSPFACPLCSKSFRAPAGLTQHIRRQHPGSTPFRCREEGCNRSFPCVALLHEHACESRHMSLPAPSHLFTQPKLWQPQLDPVSSAATSTPSTCSVCPASLPDEWSLIDHQDVEHMNLVPIARATQLLSLTRDAELLRPRQGYFPPSPYASDSVTTSRAPYSTAQTSVQPQVQPQITPHYSSQSQDPSQPQIPPHYPSQPQIPPHYPSQPQTQLPSFTNRPVFAQPPQTFVQSTSYVQTASFVQPPLYSEFKQT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-