Basic Information

Gene Symbol
USP
Assembly
GCA_034621355.1
Location
CM067883.1:12025738-12056423[+]

Transcription Factor Domain

TF Family
THR-like
Domain
zf-C4|THR-like
PFAM
AnimalTFDB
TF Group
Zinc-Coordinating Group
Description
DNA-binding domain of thyroid hormone receptors (TRs) is composed of two C4-type zinc fingers. Each zinc finger contains a group of four Cys residues which co-ordinates a single zinc atom. TR interacts with the thyroid response element, which is a DNA site with direct repeats of the consensus sequence 5'-AGGTCA-3' separated by one to five base pairs, upstream of target genes and modulates the rate of transcriptional initiation. Thyroid hormone receptor (TR) mediates the actions of thyroid hormones, which play critical roles in growth, development, and homeostasis in mammals. They regulate overall metabolic rate, cholesterol and triglyceride levels, and heart rate, and affect mood. TRs are expressed from two separate genes (alpha and beta) in human and each gene generates two isoforms of the receptor through differential promoter usage or splicing. TRalpha functions in the heart to regulate heart rate and rhythm and TRbeta is active in the liver and other tissues. The unliganded TRs function as transcription repressors, by binding to thyroid hormone response elements (TRE) predominantly as homodimers, or as heterodimers with retinoid X-receptors (RXR), and being associated with a complex of proteins containing corepressor proteins. Ligand binding promotes corepressor dissociation and binding of a coactivator to activate transcription. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, TR has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). [4, 3, 5, 2, 7, 1, 6]
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 22 1.7e-10 2.4e-07 31.2 0.0 14 131 198 328 191 340 0.78
2 22 0.0034 4.8 7.4 0.0 97 131 351 385 329 408 0.82
3 22 0.0073 10 6.3 0.0 97 131 408 442 386 464 0.83
4 22 0.007 10 6.3 0.0 97 132 465 500 444 523 0.82
5 22 0.0076 11 6.2 0.0 97 131 522 556 501 578 0.83
6 22 0.0056 8 6.6 0.0 96 132 578 614 551 637 0.79
7 22 0.0063 9 6.5 0.0 97 132 636 671 612 694 0.81
8 22 0.0078 11 6.2 0.0 97 132 693 728 675 751 0.83
9 22 0.0081 11 6.1 0.0 97 132 750 785 733 808 0.84
10 22 0.0077 11 6.2 0.0 97 131 807 841 787 864 0.83
11 22 0.0075 11 6.2 0.0 97 131 864 898 838 910 0.81
12 22 0.0073 10 6.3 0.0 97 131 921 955 899 977 0.83
13 22 0.0093 13 5.9 0.0 97 131 978 1012 959 1026 0.85
14 22 0.0084 12 6.1 0.0 97 131 1035 1069 1013 1081 0.83
15 22 0.0049 6.9 6.8 0.0 96 131 1132 1167 1071 1189 0.81
16 22 0.0055 7.8 6.7 0.1 97 132 1190 1225 1167 1275 0.79
17 22 0.15 2.1e+02 2.0 0.1 97 123 1259 1285 1233 1293 0.85
18 22 0.039 56 3.9 0.0 99 131 1280 1312 1276 1333 0.85
19 22 0.0065 9.2 6.4 0.0 97 132 1335 1370 1312 1393 0.81
20 22 0.018 26 5.0 0.0 96 124 1391 1419 1365 1433 0.79
21 22 0.0091 13 6.0 0.0 97 131 1442 1476 1421 1484 0.83
22 22 8e-14 1.1e-10 42.0 0.1 98 183 1500 1582 1479 1592 0.90

Sequence Information

Coding Sequence
ATGCGTTCAGTAGTTAACCGGCTGAACATAGAGGCGGGGTTCATGTCGCCCATGTCGCCGCCGGAGATGAAGCCGGACACAGCGATGCTGGACGGCATGCGCGACGACGCCACGTCCCCGCCGGCCATGAGGAACTACCCCCCGAACCACCCCCTCAGCGGCTCCAAACACCTCTGCTCTATATGCGGAGACAGGGCGTCGGGGAAACACTACGGAGTTTATAGTTGCGAAGGCTGCAAGGGCTTCTTCAAGAGGACCGTGAGGAAGGACCTGACGTACGCGTGCCGCGAGGAGAGGAACTGTATAATCGACAAGCGGCAGCGGAATCGATGTCAGTACTGCCGCTATCAGAAGTGCCTGGTTTGCGGCATGAAGCGCGAGGCCGTGCAAGAGGAGCGGCAGCGGGCGGCCCGCGGCGCCGAGGACGCGCACCCGAGCAGTTCTGTACAGGTAACGGAGTTATCAATCGAGCGGTTGCTAGAGATGGAGTCGCTGGTTGCTGACCCTCCCGAGGAGTTTCAGTTCCTGCGCGTGGGCCCCGACAGCAACGTGCCGGCGCGGTACCGAGCGCCCGTCTCCAGTTTGTGTCAAATAGGGAACAAGCAGATCGCCGCACTAATGGTGTGGGCGCGCGATATTCCTCACTTCAGCCAACTGGAGCTAGACGACCAGGTGTTGCTGCTCAAAGCGTCGTGGAACGAACTACTGCTGTTCGCCTTCGCCTGGCGCTCTATGGAGTACCTTGAAGATGAACGAGAGAACATGGACGGCACGAGaagcgccgcgccgccgcagtTGATGTGCCTGATGCCTGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGACCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGCCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGATGTGAAAGGGCTGAAAAGCCGTTCAGACGTCGATGTTTTAAGAGAAAAGATATTCTCATGCTTGGACGAATACTGTCGTCGAGCACACAGTTCAGAGGAAGGCAGGTTCGCTTCGCTGCTGCTGAGGCTACCGGCGTTGCGCTCCATCTCGCTGAAGAGCTTCGAGCATCTGTTCTTCTTCCACCTGATCGCGGAGGGGAGCATCGGGAACTACATCCGGGAGGCGCTGCGCAACCACGCGCCGCCCATCGACACCAGCACCATGTTGTAA
Protein Sequence
MRSVVNRLNIEAGFMSPMSPPEMKPDTAMLDGMRDDATSPPAMRNYPPNHPLSGSKHLCSICGDRASGKHYGVYSCEGCKGFFKRTVRKDLTYACREERNCIIDKRQRNRCQYCRYQKCLVCGMKREAVQEERQRAARGAEDAHPSSSVQVTELSIERLLEMESLVADPPEEFQFLRVGPDSNVPARYRAPVSSLCQIGNKQIAALMVWARDIPHFSQLELDDQVLLLKASWNELLLFAFAWRSMEYLEDERENMDGTRSAAPPQLMCLMPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQTEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAAVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPDVKGLKSRSDVDVLREKIFSCLDEYCRRAHSSEEGRFASLLLRLPALRSISLKSFEHLFFFHLIAEGSIGNYIREALRNHAPPIDTSTML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01021283;
90% Identity
iTF_01021283;
80% Identity
iTF_01021283;