Basic Information

Gene Symbol
HR38
Assembly
GCA_963971235.1
Location
OZ020238.1:6310295-6332531[-]

Transcription Factor Domain

TF Family
NGFIB-like
Domain
zf-C4|NGFIB-like
PFAM
AnimalTFDB
TF Group
Zinc-Coordinating Group
Description
During the development of the vertebrate nervous system, many neurons become redundant (because they have died, failed to connect to target cells, etc.) and are eliminated. At the same time, developing neurons send out axon outgrowths that contact their target cells [1]. Such cells control their degree of innervation (the number of axon connections) by the secretion of various specific neurotrophic factors that are essential for neuron survival. One of these is nerve growth factor (NGF), which is involved in the survival of some classes of embryonic neuron (e.g., peripheral sympathetic neurons) [1]. NGF is mostly found outside the central nervous system (CNS), but slight traces have been detected in adult CNS tissues, although a physiological role for this is unknown [1]; it has also been found in several snake venoms [2, 3]. Proteins similar to NGF include brain-derived neurotrophic factor (BDNF) and neurotrophins 3 to 7, all of which demonstrate neuron survival and outgrowth activities. Although NGF was originally identified in snake venom, its most abundant and best studied source is the submaxillary gland of adult male mice [4]. Mouse NGF is a high molecular weight hexamer, composed of 2 subunits each of alpha, beta and gamma polypeptides. The beta subunit (NGF-beta) is responsible for the physiological activity of the complex [4].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 2 1 3.8e+04 -7.7 4.7 35 68 313 346 310 351 0.75
2 2 2e-118 7.5e-114 380.6 0.0 1 261 617 877 617 877 0.99

Sequence Information

Coding Sequence
ATGCGTACTGCCGGGAACGCGGCGGGGACAAAAACAACGGTGCAACCGTGTCCTGGATCGACGGTCGGGAGAACGGTTAATATCGAGGATCGCGTCAACGATCGAGATCGAGAGTGCAGCGTGGCTGCGCACGTTGGACCGAGGAGTCCGTACTCGCAGCTACAAACAACGTTGCAAGCCGATACGAATAACAATCACGATCATCATCGGAATCTCGAGTCGACGGAACATCATCTCAATCATCATCGTACCGACGATCACTCATGGACCAAGTGGCAACTGCACACGCCGAGCGGCGACAGCGACTGCGAGAACGGGAAAAGTTACTGCAACAGCCCCCAACAGCAAATATCCGTGGAGCAAGTGTCGAGAAGTGAGGTCGCTGAACATCGGGGTATCGCGTTGGTAGGCTGCAACGAGACTACCGGTACCACCGCAATCACGACGACCTCGACGACTACAACGGCTACTGCCTTCACCGTTGCATCAAGTATGCTTCTGCTCCAAACACAGagcCCATTCGGATGTAGTAGCTTCGCAGACTTGCTTAGCGCCCCTTATACAGATCCCGCGGACACGGGTAGTCTACCCGAGGAACTCGATCCCTTTCCGGAATTACAATTGGGCAATCCTCAGGGTGTACCGAACGCCGAGGAACCGGCGCAGCCACAGCAAGCAATTCATCATCACCTTCAGCATCATCAGCACAGACCCTTATCCGCTGGCGACGTACAAACCGATTCGCTCAGTCCAACGCCATTGCCGAGTTTCCAAGAGACCTATACTGTGCATCAGCACCGTTACACGCGTCAGGAACTCCAGGGACTCGGTATAAAAATGGACGACGAGTGTTTCGATTCGTTGCAATTCACGTGCAATGACGTTCCCTACACGGCTGACTTTACCACAACGGTTCCATATCACAGTCATCATCAGGCGCAGCATCATCATCAGCCGCAGCAGACGCACCTTCATCATCCCCATCACCAACAGCAACAACATCACCTTcaccagcaacagcaacagcaacagcagcaacatcCACATCACCACTTGCAACATCATCACCACCATACGAATCAGAATGACAACCACAGTCATCAGAATATAACTCCCGGCGCTTTGAGTCCCGCATCGCCGGCAGTTTCGACGAGCGCAGGCAGCGCGACGAGATCGCCACCGCCGGGTTCGCCTCCGCCAACTCAAGCGCCGCCACCTCCACCACCTCCGCCACCTCCACCTCCGGGTTATTTCACCGCTGTATCTTCAAATAATTCTGTAGCGAACGTGCAAACTCCCGTCAGCATGGGACATCAACGAGACCTTAGTAGTTACGCGACTGTACCGATGAGTCCGGTTTACGGACAACCGAGTAATCTCGTCAGTGGACCGATAACTAGTGTCGGTGTACCGAACGAACTAAGATCAACGCTTCCTTCAAATTCTGTTCTCGTTGCCAATACCGGCAGAACACGACCTCCGATGCCACTTCAGAGATCGGACTCTACTAGTAGCGGAAGTAATCAAGAATCACCGAAGCCTCGTGGAAGCGGTGGAAGCGTAACATTGGGATCCGTACCGTCGCCCGGCGGAACAGAACGCGCCCCGCCGAGTCCGAGTCAATTGTGCGCGGTATGCGGCGACACAGCGGCTTGTCAGCACTACGGTGTACGCACATGCGAGGGCTGCAAAGGATTCTTCAAGCGAACCGTGCAAAAAGGATCGAAATACGTATGCCTCGCGGAAAAAGCGTGTCCCGTCGACAAACGCCGACGAAATCGTTGCCAGTTCTGTCGTTTTCAGAAGTGTCTGATGGTCGGAATGGTTAAAGAGgttGTAAGAACGGATTCGCTGAAAGGACGTCGAGGGCGTTTGCCATCGAAGCCAAAATCACCGCAAGAATCACCACCGAGTCCACCGATATCCCTGATAACGGCTCTGGTACGCGCACACGTCGACACGACACCCGATTTAGCGAATCTCGATTATTCGCAATATCGGGAACCCGGTCCGTCCGATCTGCCGATCAGTGAAGccgaaaaaattcaacaattctACAATCTTCTCATGACGTCCGTCGACGTTATACGCAACTTCGCGGATAAAATTCCCGGCTTTGCGGATCTCACGAGAGAGGATCAGGAGCTACTCTTCCAATCGGCGAGCTTAGAATTGTTCGTACTGAGGCTCGCGTATCGCACGAGAGCCGACGACACGAGTCTAACATTCTGCAACGGTGTGGTACTCGCACGCGCTCAGTGTCAACGAAGTTTCGGCGATTGGTTGCACGGTATTCTGGACTTTTGTCAAGCACTGCGGGTGCTCGATGTCGATATAAGCGCTTTCGCCTGTCTGTGCGCTCTCACCCTCGTTACCGAGAGATACGGCCTGAAAGAGCCGCATCGCGTGGAGTTACTTCAGACAAAAATAATATCGTCGTTGCGCGATCACGTTACATACAACGCCGAGGCGCAACGAAAGACGCAGTACCTGTCGAGGCTTTTGGGCAAATTGCCGGAGCTTCGTAGTCTCTCGGTTCAGGGACTACAGAgaattttttatctcaaattGGAAGATCTCGTACCGGCACCGCCTCTCATCGAGACTATGTTCGTCGGTAGTCTacctttttaa
Protein Sequence
MRTAGNAAGTKTTVQPCPGSTVGRTVNIEDRVNDRDRECSVAAHVGPRSPYSQLQTTLQADTNNNHDHHRNLESTEHHLNHHRTDDHSWTKWQLHTPSGDSDCENGKSYCNSPQQQISVEQVSRSEVAEHRGIALVGCNETTGTTAITTTSTTTTATAFTVASSMLLLQTQSPFGCSSFADLLSAPYTDPADTGSLPEELDPFPELQLGNPQGVPNAEEPAQPQQAIHHHLQHHQHRPLSAGDVQTDSLSPTPLPSFQETYTVHQHRYTRQELQGLGIKMDDECFDSLQFTCNDVPYTADFTTTVPYHSHHQAQHHHQPQQTHLHHPHHQQQQHHLHQQQQQQQQQHPHHHLQHHHHHTNQNDNHSHQNITPGALSPASPAVSTSAGSATRSPPPGSPPPTQAPPPPPPPPPPPPGYFTAVSSNNSVANVQTPVSMGHQRDLSSYATVPMSPVYGQPSNLVSGPITSVGVPNELRSTLPSNSVLVANTGRTRPPMPLQRSDSTSSGSNQESPKPRGSGGSVTLGSVPSPGGTERAPPSPSQLCAVCGDTAACQHYGVRTCEGCKGFFKRTVQKGSKYVCLAEKACPVDKRRRNRCQFCRFQKCLMVGMVKEVVRTDSLKGRRGRLPSKPKSPQESPPSPPISLITALVRAHVDTTPDLANLDYSQYREPGPSDLPISEAEKIQQFYNLLMTSVDVIRNFADKIPGFADLTREDQELLFQSASLELFVLRLAYRTRADDTSLTFCNGVVLARAQCQRSFGDWLHGILDFCQALRVLDVDISAFACLCALTLVTERYGLKEPHRVELLQTKIISSLRDHVTYNAEAQRKTQYLSRLLGKLPELRSLSVQGLQRIFYLKLEDLVPAPPLIETMFVGSLPF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00397380;
90% Identity
iTF_00414338;
80% Identity
iTF_00397380;