Basic Information

Gene Symbol
dsf
Assembly
GCA_014622435.1
Location
JACWBX010004182.1:39343-47534[+]

Transcription Factor Domain

TF Family
RXR-like
Domain
zf-C4|RXR-like
PFAM
AnimalTFDB
TF Group
Zinc-Coordinating Group
Description
DNA-binding domain of retinoid X receptor (RXR) is composed of two C4-type zinc fingers. Each zinc finger contains a group of four Cys residues which co-ordinates a single zinc atom. RXR functions as a DNA binding partner by forming heterodimers with other nuclear receptors including CAR, FXR, LXR, PPAR, PXR, RAR, TR, and VDR. All RXR heterodimers preferentially bind response elements composed of direct repeats of two AGGTCA sites with a 1-5 bp spacer. RXRs can play different roles in these heterodimers. RXR acts either as a structural component of the heterodimer complex, required for DNA binding but not acting as a receptor, or as both a structural and a functional component of the heterodimer, allowing 9-cis RA to signal through the corresponding heterodimer. In addition, RXR can also form homodimers, functioning as a receptor for 9-cis RA, independently of other nuclear receptors. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, RXR has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). [cite:PUB00121610], [cite:PUB00121611], [cite:PUB00121612], [cite:PUB00025660], [cite:PUB00121613], [cite:PUB00092048], [cite:PUB00092725], [cite:PUB00092726], [cite:PUB00016724], [cite:PUB00059514PMID:18971932
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 2 0.085 2.1e+02 1.5 0.1 128 164 136 172 114 199 0.75
2 2 1.6e-55 3.9e-52 176.9 0.9 2 199 378 567 377 570 0.92

Sequence Information

Coding Sequence
ATGGGAACAGGCGATAGACTGTTAGATATTCCATGCAAAGTATGTGGTGATAGAAGCAGTGGAAAGCATTATGGAATTTATTCATGTGATGGTTGTTCAGGTTTCTTCAAGAGAAGCATTCATAGAAATAGAATCTACACATGTAAAGCAGCTGGAGAACTAAAAGGTCGTTGTCCAGTTGATAAAACTCATAGAAACCAGTGTAGAGCTTGTCGTTTAGCAAAATGTTTCCAATCAGCTATGAATAAAGATGCTGTTCAACATGAACGCGGTCCCAGGAAGCCAAAATTACAACATTTGCATACACAATTAGCTCATCATCCACCGCATCATCATCACCATCATCATCATGGTTTCAATCCAATGGCAATAAATTCACAGTCATATAATCATCACAATCATCATGCAAGTGCAACACATTTCACGCAATTAAATACTTTTCAACAACACTCAACAATTCTGCATCATCCTCAACCACTTTCTTTTGGAACTCTTCTTCATCCGACACCAATTTTACCCAATCAACATGCATCATTCACAACTGCCACAAGCATTATGTCAAAGTTTACTGAACAACCAAAAATTTCAACAAATTTCGACTTTCCATCAATCAATAGCAATTTTTACAAACCACTCAATCAGCAAATGCAGCAACTTTCACCAACAAATTCAAAAAGCACCATCAGTATCGATTCATCGTCTTCTGCATCAACTGTTTCTGAGCGAATAATGTCTCCACATTCGACTGTTGATTCACACTCTGGCAATAATGTAACTCCACCACCACCACCCATTTCAACAACACCTAATGCTAATGCGAATCATAGCCAAAGTATTACAGCTATTGAACCAAAAGTGTCAGCACAATATTCTGTCTCATCTACATCATCCATTTCGCCATTAGCGTCGATTACAAATGGAAATGTGTCGTCACCAGTATCGAGTCAACAAACCGCACGAGCCAACATGCAAAATGGCTTGTTGGACATTCTAATGTGTCCGGATAAATATCAAGAGCTGATTCAGTATCATCAAGTCCAAAATTCAATTATCTTTCCATCCTTGCATCATGCTGCGTATCCAATTGATGCTGCTGCATTAAATCCAAGATTACCAACGTGGGAATTTCTGCAGGAAACAACTGCGCGTCTTTTATTCATGGCAGTACGTTGGGTTCGTTGTTTAGTGCCCTTTCAAACACTTTCAAAAAGCGACCAACAGCTGCTGCTTCAAGAATCATGGAAGGAACTTTTTCTATTAAATTTTGCTCAATGGAGTGTTCCTTGGGATTTAGGTACGCTTTTTGATAGTCCACAAGTTCGTGAGAGATTGCCAGATGATACAACAACTTTAGTTGAAATTAAAACTATTCAGGAAATTCTTTGTCGATTTAGACAATTATCACCTGATTCAAGTGAAGTTGGATGCATGAAAGCTGTCATTCTTTTTTCACCAGAAACGACAAGTTTATGTGACGTTCAACCAGTTGAAATGCTACAAGATCAAGCACAATGCATTCTCTCAGATCATGTGAGGTTGAGATATCCAAGGCAGCCAACTCGATTTGGTCGATTGCTTCTACTACTTCCATCACTTCGGGCAATTCGTTCAACATCAATTGAAATGTTGTTTTTCAAGGAAACAATTGGAAATGTGCCGATCACTAGATTATTGGGAGACATGTATTTAATGGAGAAATATAATAATAACAATAGCAATGTTAATAATAGCAGCAGTCCAAGCACACAAAGCGATAATAATAATAATACAAATAATAGCAATGCAAACAGCGTTATCAATAGTGAAGCATATTAA
Protein Sequence
MGTGDRLLDIPCKVCGDRSSGKHYGIYSCDGCSGFFKRSIHRNRIYTCKAAGELKGRCPVDKTHRNQCRACRLAKCFQSAMNKDAVQHERGPRKPKLQHLHTQLAHHPPHHHHHHHHGFNPMAINSQSYNHHNHHASATHFTQLNTFQQHSTILHHPQPLSFGTLLHPTPILPNQHASFTTATSIMSKFTEQPKISTNFDFPSINSNFYKPLNQQMQQLSPTNSKSTISIDSSSSASTVSERIMSPHSTVDSHSGNNVTPPPPPISTTPNANANHSQSITAIEPKVSAQYSVSSTSSISPLASITNGNVSSPVSSQQTARANMQNGLLDILMCPDKYQELIQYHQVQNSIIFPSLHHAAYPIDAAALNPRLPTWEFLQETTARLLFMAVRWVRCLVPFQTLSKSDQQLLLQESWKELFLLNFAQWSVPWDLGTLFDSPQVRERLPDDTTTLVEIKTIQEILCRFRQLSPDSSEVGCMKAVILFSPETTSLCDVQPVEMLQDQAQCILSDHVRLRYPRQPTRFGRLLLLLPSLRAIRSTSIEMLFFKETIGNVPITRLLGDMYLMEKYNNNNSNVNNSSSPSTQSDNNNNTNNSNANSVINSEAY

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01352691;
90% Identity
iTF_01249176;
80% Identity
iTF_01249176;