Basic Information

Gene Symbol
-
Assembly
GCA_029379305.1
Location
JAHWGI010000295.1:1728715-1732695[+]

Transcription Factor Domain

TF Family
DM
Domain
DM domain
PFAM
PF00751
TF Group
Zinc-Coordinating Group
Description
The DM domain is named after dsx and mab-3 [2]. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4 [1]. The dsx DM domain has been shown to dimerise and bind palindromic DNA [3].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 1.4e-05 0.043 14.5 7.8 4 35 7 40 5 48 0.84
2 6 1e-07 0.00031 21.3 8.5 5 35 58 89 55 91 0.88
3 6 0.00076 2.4 8.9 8.9 5 40 92 129 89 133 0.87
4 6 0.00029 0.92 10.2 6.7 5 43 186 226 183 230 0.84
5 6 1.8e-05 0.058 14.1 7.9 2 43 277 318 276 323 0.85
6 6 2.2e-09 6.8e-06 26.6 8.2 1 39 395 434 395 440 0.92

Sequence Information

Coding Sequence
ATGCCAGCACCCCAGAATTACTGCACTTGGTGCCGGAGACACGGCGAATGGAGCATCTTCAATGGAGGCCATCTCAGAATCTGCCGCTTCTACCGATGTAAATCCTGCAATATTTGTGATGGCAATGCGCAGTCGTCCAAAGCGGCGCTACCCAAGGATGTCAGTGGGTTCTGTAGAAGGTGTGGGAATCACGGAATGCAGGTGCTTGACAAGCCTTGGCACGCAACGAATTGCAGATACGGCGCGTGCCCATGCACCCGCTGCATGGGATTCTGCGTCTTCTGCATGTTCCATGATATCGAGAAGTACGACCACAGGCAGCACTCTCAAGCCTGCAAGTTCCGTGGCTCGTGTCTGTGCTCCCGTTGCGCCGGAGAGCGTCGACGGATGGGTGTGGACTTGAAAGAATCCCCCACTAAGGCTGCTCAGGAAGCAGAGAACAAGAAAGAGAAAACAGTGACGGATGAGACGCGTGTAATTGAGCCAGGGAGGACATCAGCCACCACGATCAAGAATGTGCCGCCCAGTCAGTCGAGTGCGTGTTTTCGAGAACTCTGCGTCTTCTGCAAGTTGCACGGCCAAGAAAAGGAAGATAATGGGCAGCACGCACAGTTCTGCAAGTTGCGGAATCAATGTCCGTGTGGCCTGTGCCTCGGAGAGCGTCGGCGGCAGGGGCTCGTTGTAAAAGACTCGTCAGCTGAGGTGGAAAATCTTATTGCGACAGACCAGCCACGCCAGATCGAGCAGGGAAAAACACAAGCTGAGGAGATCAAGAACGTGAGGAATGTTTCGTCTAGTCAGCCGAGTGGCTGCCGCAGTAGTGGATCTCGACCACTCTGCGTATTCTGCAAGTTTCATGACCAGGAAAAGTTCGATGTTGGAATGCATGCGAAGTTCTGCAAGTTTCGAGGCGAGTGCGCATGCAGCTGTTGCGTTGAAGAGCGCCAGCGGCTGGGTCTGGACGCGAAAGACTCATCCGCGAAGGCCGAGAAACCTGCAGAAGAAAAACCAGAGGAACCTACGGTTGACCAGCCACGATGCAGTCAACAGAAGGCACAAGCTTCGCTCCACACTACCAAAACCGACACTAGTAAAACTAAGAACCAACCTGCTCTTCAAACGGCAGACAAGCGCAAGGGCGATAGAATTGTGTTGCCGGAGCCAAAACGAGCTCACATCGAACGGAAACCTTACTGCCTGTATTGCCTTAATCACGGTTGGCAATTTCCAATGAAAAGTCACCATGTACTTGTCTGTATGTACAGAAACTGTCGATGTTTTGCTTGTCGTACAGTGCGTTCTGAACTTAAGGCCCTCAAGACTTGTAATTACACGTTTTCTCCACCTAAGAAATCCGTTGCTGTTGCTCCCCCTAATCCAAGGACTGTCCCAAACCAATCTTTaatgaattcagaaaaatcagccAATGAACCAACTCTTGGGGAAAAAGCTTCCTCAGTCAAACCAGCCTTGAAGGACTCAGCGTTCAAGTCCGACACGAATACGGCCGAGGAACCTGCTCCAACTGTGAAGCTTGATAAATCTGCTCCAACTTGCGAGGCAAGAAACGTATCAGTGTCTCTTCATACAGCCGAACAATCTACATCCGATAAGATAGACAATCACATAGTGCCCAAAGCAAAGAAAATGTTAAAAGACTCCTCGCTTCAAGAAAACCTGAATGGAGAGGGTTTACCCGAAGAGGAAGCCGCGGAGACAACCATGAGAGAGAAGGCCGCTGTTTGCGAAGTTGTCGTCGTCAACGGAGAAGACTTTGCCGTTGTGGACGAGGTCGGCGCAGACACATCATCTGATGAGATCTCAAGCAGCCAAGAGACCCTGCCAGACCTAAGCACCTCAAATTCTAACACTATTTTGTCCCATGATTTAGCAAAGCAGACGCCTTTCAAGGACGAAGAAGTAAATGAATGTCCTGCGGGTAGCTCACTAGAACATGCATTGAACAGCGTGGCTGGGTTAAAGGAAATTTCTGATATCAACCCGAATCTGAATGGAACGTCGGCTTTCTGTACGAAACCCCAAGGTGACACTACTAGTACAGATCGGGTGGTAAGGGAATCGGGAGAATCTCGAGGAGATTTATCGGATATACCTGGTACGCGTATGACAGGTTCCCTACGGAATGCAGAGGTTGTGGACTGCACTATGAAACATTCTACAAACCTCGGAAGCACCCAAAAAACGGACATTGCTGGTGACGGGCACCATGGAACGATTGAGAAAGGCAATGAAGACGCCTTATTAATAATCTCTAAAGCTAAGCAATCAGATAACAGCATAGGAGAAGacttggaaaaaagaaaatccgCAAGTGGTCCACTGTTTACGAAAATTAGCGATGAACGAGAAGTTCCTTCCGCTTCCTCTCTTCTTTCCCTCGCCATAAATGAAGTTGTTGGGACATTCTACAAATCGCCTGCAGAAGAAGATGGGAGGGCGTCGAATGAAGACAATGCCCAGGGAGTCAAAGACAATGGAAAAGAAATTCAGGCCCGTGCAGCTTCCATTGTTGACGATGTAGGGAATGTGCTGGGCTCTGCAATCGGCGTTAGTGAAATTGATGGTACATCATTGTCTACTAAACCGTCTACAGTAGATGGGAAGACGCCGAATGAAAGCACCGCCAAGATGGAGGGAGAGGAATGTGGGACTGAAAAAGAAACTCGGGGCAGCACATCATCACCTACGGTTGTCGAAGTTAGTAGTGAAGTGTCCCCTGCTGCAAGCGATGTCCTGAACGAAGTTGATGGCACGTTATCTACCAAACCGCCAACAGAAGATAGGAAGATGTCAAATGAAGATCTTGCCCATGAAGTGGAAGAAAAATTTGAGACTGAAAGAGGAATCCAGGCGAGCTGCCCCTCTCCCATTGTTGACGAAGCAAGCAATGTAGCGTCTTCTATTACAACGGACGTCTTAAGTGAAGTTGATGGAACATTCTATACTAACCCGTCCACAGTTGATGGAAATCCACCGATAGAAGGCATTTCCCAGGGAGATGTAGAGAAATGTGGAACTGAAAAAGAAATCCAGGGCAGAACATTAACTCCCACCGTTGTCGAAGTAAGAAATGAAGTGCCTCCTCCTGCTTCGAAGACTGGTTACGAGAAGACTCAGAGTGAAGCTTCTGCACAAAGAGGCGCCGAGGAATGTGATCCTGTTAAGGAAATCCAGGCCTTAACAGCTCCCCCCAGTCGCGAGCTGAGTAATGTAGCGCCCCCTGCTTCAGCTGACTGTTTCGATAAAGCTTCTTCCAAGCTATGTGCTGAAGACAGTGCGAAGACATCAACCAATCCAACCTCAGAGCACCATGGAAAGGGGACACCCGAATGCTCAACGGAAAATGTCAGTGAGATTGAGAGCCAAACACCTGGAGACAAGCCAAGTGCAGAACAGTGTGAAATAGCGAGCTTGCTGAACGAGCTAGCAGAGTTCGGCAAAAATCTTCGGCCGTACGAACGATACGCACGTTCAGCTAACTTGGGAAACTTGTTTACAAATTTACCCGAGAAACCGAAAGCGCCAGAGAAGAGTACATCTGTGCTGCCTAATGTATCACCAGGAAAGAAAGAAACTCCAAATCCTCTGCCTGTCTCAGGAAACAGTTCAAGTGACAAGTGTGCAGAAAACAGTCCCCGTACGAACAAAATGGAAGAACAAATAGAATCATCCAACAAAAAGAAGCAAAGCACAAATGCTTGCACAACATGCAACAAGGAATTCCCAAGTTCACAACAGCTGCGGCTGCACACTTACTTCCAGCACCAACTCGACCTTGAATCAAAGGACCCAAAAATTAAATGTCGTGAATGTTCGTTTGAATCTTCAAACCAGTTGGTTCTCTGTGATCATGAAAGTACCCACATGGGCCTAGTACCCCTTCAAAGACTGTTTTGCCTCTTTGGTCAAGTTAGACAGGTTTTAAGATTTCCAAAAGACGGTAGTGCAGAAACAAAGTAG
Protein Sequence
MPAPQNYCTWCRRHGEWSIFNGGHLRICRFYRCKSCNICDGNAQSSKAALPKDVSGFCRRCGNHGMQVLDKPWHATNCRYGACPCTRCMGFCVFCMFHDIEKYDHRQHSQACKFRGSCLCSRCAGERRRMGVDLKESPTKAAQEAENKKEKTVTDETRVIEPGRTSATTIKNVPPSQSSACFRELCVFCKLHGQEKEDNGQHAQFCKLRNQCPCGLCLGERRRQGLVVKDSSAEVENLIATDQPRQIEQGKTQAEEIKNVRNVSSSQPSGCRSSGSRPLCVFCKFHDQEKFDVGMHAKFCKFRGECACSCCVEERQRLGLDAKDSSAKAEKPAEEKPEEPTVDQPRCSQQKAQASLHTTKTDTSKTKNQPALQTADKRKGDRIVLPEPKRAHIERKPYCLYCLNHGWQFPMKSHHVLVCMYRNCRCFACRTVRSELKALKTCNYTFSPPKKSVAVAPPNPRTVPNQSLMNSEKSANEPTLGEKASSVKPALKDSAFKSDTNTAEEPAPTVKLDKSAPTCEARNVSVSLHTAEQSTSDKIDNHIVPKAKKMLKDSSLQENLNGEGLPEEEAAETTMREKAAVCEVVVVNGEDFAVVDEVGADTSSDEISSSQETLPDLSTSNSNTILSHDLAKQTPFKDEEVNECPAGSSLEHALNSVAGLKEISDINPNLNGTSAFCTKPQGDTTSTDRVVRESGESRGDLSDIPGTRMTGSLRNAEVVDCTMKHSTNLGSTQKTDIAGDGHHGTIEKGNEDALLIISKAKQSDNSIGEDLEKRKSASGPLFTKISDEREVPSASSLLSLAINEVVGTFYKSPAEEDGRASNEDNAQGVKDNGKEIQARAASIVDDVGNVLGSAIGVSEIDGTSLSTKPSTVDGKTPNESTAKMEGEECGTEKETRGSTSSPTVVEVSSEVSPAASDVLNEVDGTLSTKPPTEDRKMSNEDLAHEVEEKFETERGIQASCPSPIVDEASNVASSITTDVLSEVDGTFYTNPSTVDGNPPIEGISQGDVEKCGTEKEIQGRTLTPTVVEVRNEVPPPASKTGYEKTQSEASAQRGAEECDPVKEIQALTAPPSRELSNVAPPASADCFDKASSKLCAEDSAKTSTNPTSEHHGKGTPECSTENVSEIESQTPGDKPSAEQCEIASLLNELAEFGKNLRPYERYARSANLGNLFTNLPEKPKAPEKSTSVLPNVSPGKKETPNPLPVSGNSSSDKCAENSPRTNKMEEQIESSNKKKQSTNACTTCNKEFPSSQQLRLHTYFQHQLDLESKDPKIKCRECSFESSNQLVLCDHESTHMGLVPLQRLFCLFGQVRQVLRFPKDGSAETK*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-