Basic Information

Gene Symbol
USP
Assembly
GCA_034621355.1
Location
CM067883.1:12025738-12056423[+]

Transcription Factor Domain

TF Family
NGFIB-like
Domain
zf-C4|NGFIB-like
PFAM
AnimalTFDB
TF Group
Zinc-Coordinating Group
Description
During the development of the vertebrate nervous system, many neurons become redundant (because they have died, failed to connect to target cells, etc.) and are eliminated. At the same time, developing neurons send out axon outgrowths that contact their target cells [1]. Such cells control their degree of innervation (the number of axon connections) by the secretion of various specific neurotrophic factors that are essential for neuron survival. One of these is nerve growth factor (NGF), which is involved in the survival of some classes of embryonic neuron (e.g., peripheral sympathetic neurons) [1]. NGF is mostly found outside the central nervous system (CNS), but slight traces have been detected in adult CNS tissues, although a physiological role for this is unknown [1]; it has also been found in several snake venoms [2, 3]. Proteins similar to NGF include brain-derived neurotrophic factor (BDNF) and neurotrophins 3 to 7, all of which demonstrate neuron survival and outgrowth activities. Although NGF was originally identified in snake venom, its most abundant and best studied source is the submaxillary gland of adult male mice [4]. Mouse NGF is a high molecular weight hexamer, composed of 2 subunits each of alpha, beta and gamma polypeptides. The beta subunit (NGF-beta) is responsible for the physiological activity of the complex [4].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 21 1.6e-08 7.8e-05 22.9 0.3 74 175 200 319 167 323 0.78
2 21 0.0032 16 5.5 0.0 129 175 328 376 324 380 0.87
3 21 0.0035 17 5.4 0.0 129 175 385 433 381 437 0.87
4 21 0.0035 17 5.4 0.0 129 175 442 490 438 494 0.87
5 21 0.0035 17 5.4 0.0 129 175 499 547 495 551 0.87
6 21 0.0035 17 5.4 0.0 129 175 556 604 552 608 0.87
7 21 0.0035 17 5.4 0.0 129 175 613 661 609 665 0.87
8 21 0.0035 17 5.4 0.0 129 175 670 718 666 722 0.87
9 21 0.0035 17 5.4 0.0 129 175 727 775 723 779 0.87
10 21 0.0035 17 5.4 0.0 129 175 784 832 780 836 0.87
11 21 0.0035 17 5.4 0.0 129 175 841 889 837 893 0.87
12 21 0.0035 17 5.4 0.0 129 175 898 946 894 950 0.87
13 21 0.0035 17 5.4 0.0 129 175 955 1003 951 1007 0.87
14 21 0.0029 14 5.7 0.0 129 176 1012 1061 1008 1075 0.87
15 21 0.0035 17 5.4 0.0 129 175 1110 1158 1106 1162 0.87
16 21 0.0029 14 5.7 0.0 129 176 1167 1216 1163 1231 0.87
17 21 0.17 8.3e+02 -0.1 0.0 136 175 1244 1284 1233 1289 0.85
18 21 0.0034 17 5.4 0.0 129 175 1312 1360 1308 1366 0.87
19 21 0.0039 19 5.2 0.0 129 175 1369 1417 1364 1419 0.87
20 21 0.0054 27 4.8 0.1 129 175 1419 1467 1416 1471 0.85
21 21 4.4e-14 2.2e-10 41.1 0.1 129 242 1476 1590 1472 1595 0.88

Sequence Information

Coding Sequence
ATGCGTTCAGTAGTTAACCGGCTGAACATAGAGGCGGGGTTCATGTCGCCCATGTCGCCGCCGGAGATGAAGCCGGACACAGCGATGCTGGACGGCATGCGCGACGACGCCACGTCCCCGCCGGCCATGAGGAACTACCCCCCGAACCACCCCCTCAGCGGCTCCAAACACCTCTGCTCTATATGCGGAGACAGGGCGTCGGGGAAACACTACGGAGTTTATAGTTGCGAAGGCTGCAAGGGCTTCTTCAAGAGGACCGTGAGGAAGGACCTGACGTACGCGTGCCGCGAGGAGAGGAACTGTATAATCGACAAGCGGCAGCGGAATCGATGTCAGTACTGCCGCTATCAGAAGTGCCTGGTTTGCGGCATGAAGCGCGAGGCCGTGCAAGAGGAGCGGCAGCGGGCGGCCCGCGGCGCCGAGGACGCGCACCCGAGCAGTTCTGTACAGGTAACGGAGTTATCAATCGAGCGGTTGCTAGAGATGGAGTCGCTGGTTGCTGACCCTCCCGAGGAGTTTCAGTTCCTGCGCGTGGGCCCCGACAGCAACGTGCCGGCGCGGTACCGAGCGCCCGTCTCCAGTTTGTGTCAAATAGGGAACAAGCAGATCGCCGCACTAATGGTGTGGGCGCGCGATATTCCTCACTTCAGCCAACTGGAGCTAGACGACCAGGTGTTGCTGCTCAAAGCGTCGTGGAACGAACTACTGCTGTTCGCCTTCGCCTGGCGCTCTATGGAGTACCTTGAAGATGAACGAGAGAACATGGACGGCACGAGaagcgccgcgccgccgcagtTGATGTGCCTGATGCCTGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGACCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGGTAATTACAACTCCGTCACAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGCCGTGGGGCAGATCTTCGACCGCGTGCTGTCCGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTGGCGCTCAAGGCCATCGTGCTGCTCAACCCCGATGTGAAAGGGCTGAAAAGCCGTTCAGACGTCGATGTTTTAAGAGAAAAGATATTCTCATGCTTGGACGAATACTGTCGTCGAGCACACAGTTCAGAGGAAGGCAGGTTCGCTTCGCTGCTGCTGAGGCTACCGGCGTTGCGCTCCATCTCGCTGAAGAGCTTCGAGCATCTGTTCTTCTTCCACCTGATCGCGGAGGGGAGCATCGGGAACTACATCCGGGAGGCGCTGCGCAACCACGCGCCGCCCATCGACACCAGCACCATGTTGTAA
Protein Sequence
MRSVVNRLNIEAGFMSPMSPPEMKPDTAMLDGMRDDATSPPAMRNYPPNHPLSGSKHLCSICGDRASGKHYGVYSCEGCKGFFKRTVRKDLTYACREERNCIIDKRQRNRCQYCRYQKCLVCGMKREAVQEERQRAARGAEDAHPSSSVQVTELSIERLLEMESLVADPPEEFQFLRVGPDSNVPARYRAPVSSLCQIGNKQIAALMVWARDIPHFSQLELDDQVLLLKASWNELLLFAFAWRSMEYLEDERENMDGTRSAAPPQLMCLMPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQTEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPGNYNSVTGMTLHRNSALQAAVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPDVKGLKSRSDVDVLREKIFSCLDEYCRRAHSSEEGRFASLLLRLPALRSISLKSFEHLFFFHLIAEGSIGNYIREALRNHAPPIDTSTML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2