Mmin003021.2
Basic Information
- Insect
- Mycalesis mineus
- Gene Symbol
- USP
- Assembly
- GCA_034621355.1
- Location
- CM067883.1:12025738-12056423[+]
Transcription Factor Domain
- TF Family
- RXR-like
- Domain
- zf-C4|RXR-like
- PFAM
- AnimalTFDB
- TF Group
- Zinc-Coordinating Group
- Description
- DNA-binding domain of retinoid X receptor (RXR) is composed of two C4-type zinc fingers. Each zinc finger contains a group of four Cys residues which co-ordinates a single zinc atom. RXR functions as a DNA binding partner by forming heterodimers with other nuclear receptors including CAR, FXR, LXR, PPAR, PXR, RAR, TR, and VDR. All RXR heterodimers preferentially bind response elements composed of direct repeats of two AGGTCA sites with a 1-5 bp spacer. RXRs can play different roles in these heterodimers. RXR acts either as a structural component of the heterodimer complex, required for DNA binding but not acting as a receptor, or as both a structural and a functional component of the heterodimer, allowing 9-cis RA to signal through the corresponding heterodimer. In addition, RXR can also form homodimers, functioning as a receptor for 9-cis RA, independently of other nuclear receptors. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, RXR has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). [cite:PUB00121610], [cite:PUB00121611], [cite:PUB00121612], [cite:PUB00025660], [cite:PUB00121613], [cite:PUB00092048], [cite:PUB00092725], [cite:PUB00092726], [cite:PUB00016724], [cite:PUB00059514PMID:18971932
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 23 4.1e-33 2.1e-29 103.5 0.3 1 115 195 321 195 326 0.96 2 23 1.6e-12 8.2e-09 36.3 0.0 61 115 327 378 323 384 0.94 3 23 3.9e-12 2e-08 35.0 0.0 61 115 384 435 380 439 0.95 4 23 3.6e-12 1.8e-08 35.1 0.0 61 115 441 492 438 499 0.94 5 23 3.4e-12 1.7e-08 35.2 0.0 61 115 498 549 493 555 0.94 6 23 3.9e-12 2e-08 35.0 0.0 61 115 555 606 551 610 0.95 7 23 3.4e-12 1.7e-08 35.2 0.0 61 115 612 663 607 669 0.94 8 23 3.6e-12 1.8e-08 35.2 0.0 61 115 669 720 665 726 0.94 9 23 3.6e-12 1.8e-08 35.2 0.0 61 115 726 777 721 782 0.94 10 23 3.6e-12 1.8e-08 35.2 0.0 61 115 783 834 778 839 0.94 11 23 3.7e-12 1.8e-08 35.1 0.0 61 115 840 891 836 896 0.95 12 23 3.7e-12 1.8e-08 35.1 0.0 61 115 897 948 893 953 0.95 13 23 3.3e-12 1.6e-08 35.3 0.0 61 115 954 1005 949 1012 0.94 14 23 2.5e-12 1.2e-08 35.7 0.0 61 115 1011 1062 1006 1081 0.91 15 23 0.02 99 3.3 0.0 100 115 1088 1103 1085 1108 0.91 16 23 3.4e-12 1.7e-08 35.2 0.0 61 115 1109 1160 1105 1167 0.94 17 23 2.5e-12 1.2e-08 35.7 0.0 61 115 1166 1217 1161 1236 0.91 18 23 9.9e-10 4.9e-06 27.2 0.1 73 114 1243 1285 1220 1286 0.71 19 23 0.00018 0.88 10.0 0.0 94 115 1284 1305 1283 1309 0.93 20 23 3.3e-12 1.6e-08 35.3 0.0 61 115 1311 1362 1306 1369 0.94 21 23 3.4e-12 1.7e-08 35.2 0.0 61 115 1368 1419 1363 1423 0.94 22 23 2.9e-12 1.4e-08 35.5 0.0 59 115 1416 1469 1416 1474 0.95 23 23 5.3e-38 2.6e-34 119.5 0.0 61 199 1475 1608 1470 1610 0.96
Sequence Information
- Coding Sequence
- ATGCGTTCAGTAGTTAACCGGCTGAACATAGAGGCGGGGTTCATGTCGCCCATGTCGCCGCCGGAGATGAAGCCGGACACAGCGATGCTGGACGGCATGCGCGACGACGCCACGTCCCCGCCGGCCATGAGGAACTACCCCCCGAACCACCCCCTCAGCGGCTCCAAACACCTCTGCTCTATATGCGGAGACAGGGCGTCGGGGAAACACTACGGAGTTTATAGTTGCGAAGGCTGCAAGGGCTTCTTCAAGAGGACCGTGAGGAAGGACCTGACGTACGCGTGCCGCGAGGAGAGGAACTGTATAATCGACAAGCGGCAGCGGAATCGATGTCAGTACTGCCGCTATCAGAAGTGCCTGGTTTGCGGCATGAAGCGCGAGGCCGTGCAAGAGGAGCGGCAGCGGGCGGCCCGCGGCGCCGAGGACGCGCACCCGAGCAGTTCTGTACAGGTAACGGAGTTATCAATCGAGCGGTTGCTAGAGATGGAGTCGCTGGTTGCTGACCCTCCCGAGGAGTTTCAGTTCCTGCGCGTGGGCCCCGACAGCAACGTGCCGGCGCGGTACCGAGCGCCCGTCTCCAGTTTGTGTCAAATAGGGAACAAGCAGATCGCCGCACTAATGGTGTGGGCGCGCGATATTCCTCACTTCAGCCAACTGGAGCTAGACGACCAGGTGTTGCTGCTCAAAGCGTCGTGGAACGAACTACTGCTGTTCGCCTTCGCCTGGCGCTCTATGGAGTACCTTGAAGATGAACGAGAGAACATGGACGGCACGAGaagcgccgcgccgccgcagtTGATGTGCCTGATGCCTGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGACCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGCCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGATGTGAAAGGGCTGAAAAGCCGTTCAGACGTCGATGTTTTAAGAGAAAAGATATTCTCATGCTTGGACGAATACTGTCGTCGAGCACACAGTTCAGAGGAAGGCAGGTTCGCTTCGCTGCTGCTGAGGCTACCGGCGTTGCGCTCCATCTCGCTGAAGAGCTTCGAGCATCTGTTCTTCTTCCACCTGATCGCGGAGGGGAGCATCGGGAACTACATCCGGGAGGCGCTGCGCAACCACGCGCCGCCCATCGACACCAGCACCATGTTGTAA
- Protein Sequence
- MRSVVNRLNIEAGFMSPMSPPEMKPDTAMLDGMRDDATSPPAMRNYPPNHPLSGSKHLCSICGDRASGKHYGVYSCEGCKGFFKRTVRKDLTYACREERNCIIDKRQRNRCQYCRYQKCLVCGMKREAVQEERQRAARGAEDAHPSSSVQVTELSIERLLEMESLVADPPEEFQFLRVGPDSNVPARYRAPVSSLCQIGNKQIAALMVWARDIPHFSQLELDDQVLLLKASWNELLLFAFAWRSMEYLEDERENMDGTRSAAPPQLMCLMPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQTEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAAVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPDVKGLKSRSDVDVLREKIFSCLDEYCRRAHSSEEGRFASLLLRLPALRSISLKSFEHLFFFHLIAEGSIGNYIREALRNHAPPIDTSTML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01021283;
- 90% Identity
- iTF_01021283;
- 80% Identity
- iTF_01021283;