Basic Information

Insect
Agrotis puta
Gene Symbol
Hr78
Assembly
GCA_943137145.2
Location
CALPBO020000074.1:101321-114956[+]

Transcription Factor Domain

TF Family
RXR-like
Domain
zf-C4|RXR-like
PFAM
AnimalTFDB
TF Group
Zinc-Coordinating Group
Description
DNA-binding domain of retinoid X receptor (RXR) is composed of two C4-type zinc fingers. Each zinc finger contains a group of four Cys residues which co-ordinates a single zinc atom. RXR functions as a DNA binding partner by forming heterodimers with other nuclear receptors including CAR, FXR, LXR, PPAR, PXR, RAR, TR, and VDR. All RXR heterodimers preferentially bind response elements composed of direct repeats of two AGGTCA sites with a 1-5 bp spacer. RXRs can play different roles in these heterodimers. RXR acts either as a structural component of the heterodimer complex, required for DNA binding but not acting as a receptor, or as both a structural and a functional component of the heterodimer, allowing 9-cis RA to signal through the corresponding heterodimer. In addition, RXR can also form homodimers, functioning as a receptor for 9-cis RA, independently of other nuclear receptors. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, RXR has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). [cite:PUB00121610], [cite:PUB00121611], [cite:PUB00121612], [cite:PUB00025660], [cite:PUB00121613], [cite:PUB00092048], [cite:PUB00092725], [cite:PUB00092726], [cite:PUB00016724], [cite:PUB00059514PMID:18971932
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 0.048 1.3e+02 2.5 0.0 1 26 270 295 270 297 0.94
2 15 0.048 1.3e+02 2.5 0.0 1 26 397 422 397 424 0.94
3 15 0.048 1.3e+02 2.5 0.0 1 26 524 549 524 551 0.94
4 15 0.048 1.3e+02 2.5 0.0 1 26 651 676 651 678 0.94
5 15 0.048 1.3e+02 2.5 0.0 1 26 778 803 778 805 0.94
6 15 0.048 1.3e+02 2.5 0.0 1 26 905 930 905 932 0.94
7 15 0.048 1.3e+02 2.5 0.0 1 26 1032 1057 1032 1059 0.94
8 15 0.048 1.3e+02 2.5 0.0 1 26 1159 1184 1159 1186 0.94
9 15 0.048 1.3e+02 2.5 0.0 1 26 1286 1311 1286 1313 0.94
10 15 0.048 1.3e+02 2.5 0.0 1 26 1413 1438 1413 1440 0.94
11 15 0.048 1.3e+02 2.5 0.0 1 26 1540 1565 1540 1567 0.94
12 15 0.048 1.3e+02 2.5 0.0 1 26 1667 1692 1667 1694 0.94
13 15 0.048 1.3e+02 2.5 0.0 1 26 1794 1819 1794 1821 0.94
14 15 0.048 1.3e+02 2.5 0.0 1 26 1921 1946 1921 1948 0.94
15 15 1.1e-24 3.1e-21 76.4 0.8 25 196 1992 2180 1987 2182 0.79

Sequence Information

Coding Sequence
ATGGAAGGTCAGGACCAAATGGATGTGAAGTTTGGCGGTGAGGTAGGGGGGTTAGAGCTCTGCATTGTATGCGGAGACAGGGCTTCTGGAAGACACTATGGAGCCATCAGCTGTGAAGGCTGCAAAGGTTTCTTCAAGCGCTCAATCCGCAAGAAGCTGGGCTACCAGTGCCGCGGAAGCATGAACTGTGAGGTGACAAAGCACCACCGGAATCGCTGCCAGTACTGCAGGCTGCAGAAATGCCTCGCGTGCGGCATGAGAAGCGACTCGGTCCAGCATGAACGCAAGCCGATCGTGGACAAGTCCAAAGGGGAGCAGCGGGAAGCCTTGCACGACAGGCAGGCCGTTTACTCCAAATTCTTGGGGCTGGCGGGACAGGCTCAAGGTGGACAGATCAATCCTAAGGAAGAGCCGAGCGACTCTGCGTTCGGCGCGGTGTCTCCTGCGCCTGCGATCAACTTCGCGCTCGCTGCAGCGGTCGCGTTCAACAAGGGTAATCCAGTGTCCCCATACCTAACTCCAGGCAACGCGGGTGATATCGAAGGCGCTCGCAGACAACAACAGTTGATGCTGCAACAACAACTAGCTAAGAATCTCTTCAAGATGGGGCAGTTTGGTGCAATCAATGAGTATCTTCAAGCTTACGGCGCGACCCCGCCAGAGGTGCCACTCGCTGCTCTACACGCCGCCGCTGAGACACAACAGACTGAAGACGCTCCAGGCGGAGGTATCCTGAGTACTCCAGAGTCACTCCAGCTGCTGGTGTCCCTCCCGGGCGCGGCCCCGGCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAGTACAGCTGCTGGTGTCACACCTGCGCCTGCACGCCGTGTGCGAGGCAGGAGCCAGGCTGCTGGCAGCGGCAGCCAGGTGGCTCCGAGCAGTGCCTGCTGCACACGCGCTACCGTATGTATCACTCAATTCGAGGTCCAAGTGACCCTCCTAAAGAAGTGCTGGCCGGAGCTGTTCGTGCTGTCCCTGTCGATGTTCTCCCAGCAGCTGTCGCTGAACAGTCTGCTGCCCCAGATGGTGAGCCATCTGCAGGCAGTGCTGCGGGACCGAGCCAGCAACGACTGCGTGGACCGGTCACATGATATGACCATGCCagagATCACAGCGTCGGACTACTCTGACGAGCGCATAGCCGAGGTGAGCAGTAGTCTGTCGCGGCTGCAGCAGTTCATCACCAACATGGAGCAGCTGCGCGTGCAGCAGCGGGAACACGCGCATCTCAGGGCGCTGTGCTTGTTCTCACCAGACGGCGTTCCGGATTTCCTGACGCGCAAACTCCAAGACTATCAGATCAAGGTCTTACGTTCCCTCCGAGCATCATGCCAGCCGGATGAAGATCGTCTGGCTACCTTACTGCTGCAGCTCCCAGTCTTGAGAACCTTCTCTGGACCCTTCCTTGAAGATGTGTTCTTTGTCGGCTTTGTAGGAGATGTCAGTATTGATGATGTTATTCCATATTTGTTGAATGCTGAGCGCTAG
Protein Sequence
MEGQDQMDVKFGGEVGGLELCIVCGDRASGRHYGAISCEGCKGFFKRSIRKKLGYQCRGSMNCEVTKHHRNRCQYCRLQKCLACGMRSDSVQHERKPIVDKSKGEQREALHDRQAVYSKFLGLAGQAQGGQINPKEEPSDSAFGAVSPAPAINFALAAAVAFNKGNPVSPYLTPGNAGDIEGARRQQQLMLQQQLAKNLFKMGQFGAINEYLQAYGATPPEVPLAALHAAAETQQTEDAPGGGILSTPESLQLLVSLPGAAPAHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQYSCWCHTCACTPCARQEPGCWQRQPGGSEQCLLHTRYRMYHSVQLLVSHLRLHAVCEAGARLLAAAARWLRAVPAAHALPYVSLSTAAGVTPAPARRVRGRSQAAGSGSQVAPSSACCTRATVCITQFEVQVTLLKKCWPELFVLSLSMFSQQLSLNSLLPQMVSHLQAVLRDRASNDCVDRSHDMTMPEITASDYSDERIAEVSSSLSRLQQFITNMEQLRVQQREHAHLRALCLFSPDGVPDFLTRKLQDYQIKVLRSLRASCQPDEDRLATLLLQLPVLRTFSGPFLEDVFFVGFVGDVSIDDVIPYLLNAER

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-