Basic Information

Gene Symbol
HR38
Assembly
GCA_905220475.1
Location
HG992025.1:7194379-7197126[-]

Transcription Factor Domain

TF Family
NGFIB-like
Domain
zf-C4|NGFIB-like
PFAM
AnimalTFDB
TF Group
Zinc-Coordinating Group
Description
During the development of the vertebrate nervous system, many neurons become redundant (because they have died, failed to connect to target cells, etc.) and are eliminated. At the same time, developing neurons send out axon outgrowths that contact their target cells [1]. Such cells control their degree of innervation (the number of axon connections) by the secretion of various specific neurotrophic factors that are essential for neuron survival. One of these is nerve growth factor (NGF), which is involved in the survival of some classes of embryonic neuron (e.g., peripheral sympathetic neurons) [1]. NGF is mostly found outside the central nervous system (CNS), but slight traces have been detected in adult CNS tissues, although a physiological role for this is unknown [1]; it has also been found in several snake venoms [2, 3]. Proteins similar to NGF include brain-derived neurotrophic factor (BDNF) and neurotrophins 3 to 7, all of which demonstrate neuron survival and outgrowth activities. Although NGF was originally identified in snake venom, its most abundant and best studied source is the submaxillary gland of adult male mice [4]. Mouse NGF is a high molecular weight hexamer, composed of 2 subunits each of alpha, beta and gamma polypeptides. The beta subunit (NGF-beta) is responsible for the physiological activity of the complex [4].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 1 8.2e-109 1.2e-104 350.1 0.0 1 261 331 593 331 593 0.97

Sequence Information

Coding Sequence
ATGTTCGCCGAGGGGCCGAGCGCCTTAGGGTCGCGCTCGCCAAGCGCCTTCTCGCCATCCTCCTCCAGCATGCTACTGCTGCAGACACAAAGCAACTACGGTTCGTCCTTCACTGATCTATTGTCGCCACAATATCAAGAAGACTCGCCTGAGATTTTAGAAGAAAATCTTGATCCTTTCCCTGACGTCGAATTCCACGCTCCAGTGCCCTGTGAAGTTAAGTCGCAACGCACAACCCCCATTAGCGAATCGTCATCACCAACTCCCGGCCCTGCACTACCTAGTTTCGAAGAAACCTATTCCGTTCGCTACCCTAAACAAGAGATGGCGGAGTTCGGGATCAAGATGGACGAGGACTGCTACAATGTCAGCGCATATTCGCACCAAGGACATGCATCGACGCAGCTCTTATATCAATACCACCAACCTTCTATCCCATATGTTCCTTCTTCATACTATGTACCACCGCAACAGTGCAGTCCTACTTTTGATACAGGTGGAATCACCCCTAACCAAGACTCTTACTCGTTGCCGCCGTTTTCAAGCTCGGTGGATTTACATATTTCGGCTGAGCAGAGACATCGACGGGCCTCACTGCCTGTACAAAGGTCCGAATCTAACAGTTCAAATGATAGCCCAAAAGTACATGGTAGCAGAGTGCACTGCATGCAAGCTTCGGCACCCAGCTCTGCGTCTAGTTCACCTGGTATAGCACCGCCGGAAGGTGTCATCCCACCACGAGCCGCGCCGTCGTCTCCTAGCCAGCTCTGCGCCGTCTGTGGTGATACAGCTGCCTGCCAACACTACGGAGTACGCACTTGTGAAGGGTGCAAAGGATTTTTTAAAAGAACCGTTCAGAAAGGTTCTAAATATGTGTGCCTAGCGGAAAAATCTTGCCCCGTCGATAAACGAAGACGAAACCGGTGTCAATTTTGTCGTTTTCAAAAGTGTCTCGTCGTTGGAATGGTGAAGGAAGTTGTAAGAACCGATTCATTAAAAGGAAGACGTGGTAGACTACCGTCTAAGCCGAAATGTCCTCAGGAATCACCACCCAGCCCCCCAATTTCACTCATTACCGCTCTCGTGCGAGCTCACGTCGATACGTCACCCGACTTTGCTAATCTAGACTACTCTCAATACCGAGAACCAAATCCTTTGGAGCCACCGATGTCCGATTTAGAAGTGATTCAACAGTTTTATTCATTGTTGACGACGTCAATCGACATGATCAAGGTATTTGCCGAGAAAGTGCCGGGGTACGGGGACCTTTGCGCGGAGGACCGCGAGCAGCTGTTCGCGTCCGCACGGTTGGAGCTGTTCGTACTGCGGCTAGCGTACCGCACGCGCCCTGAGGACACCAAGCTCACGTTCTGCAATGGACTCGTGCTGGACAAGCGCCAGTGCCAGCGCTCTTTCGGTGACTGGCTTCACGCAGTTCTAGACTTCAGCAACACCTTGCACTCCATGGACATCGACATTTCCACATTTGCCTGCCTTTGCGCCCTCACTCTAATTACCGATCGGCATGGTCTTAAGGAGCCTCATCGAGTGGAGCACCTACAGATGAAGATCATAGGGTGTCTCCGCGCGCACATGCCTGGCGGGGGCGGCGGTGGTGGCGTCAGCGGTGCGCCACACTTCAGCCGTGTGCTAGGTGCACTGCCGGAGCTGCGCTCGCTCTCAGTGCAGGGCATGCAGCGCATCTTTTACCTGAAACTGGAAGACCTGGTGCCAGCCCCGCCACTTATTGAGAACATGTTCCGCGCTAGCCTGCCTTTCTAG
Protein Sequence
MFAEGPSALGSRSPSAFSPSSSSMLLLQTQSNYGSSFTDLLSPQYQEDSPEILEENLDPFPDVEFHAPVPCEVKSQRTTPISESSSPTPGPALPSFEETYSVRYPKQEMAEFGIKMDEDCYNVSAYSHQGHASTQLLYQYHQPSIPYVPSSYYVPPQQCSPTFDTGGITPNQDSYSLPPFSSSVDLHISAEQRHRRASLPVQRSESNSSNDSPKVHGSRVHCMQASAPSSASSSPGIAPPEGVIPPRAAPSSPSQLCAVCGDTAACQHYGVRTCEGCKGFFKRTVQKGSKYVCLAEKSCPVDKRRRNRCQFCRFQKCLVVGMVKEVVRTDSLKGRRGRLPSKPKCPQESPPSPPISLITALVRAHVDTSPDFANLDYSQYREPNPLEPPMSDLEVIQQFYSLLTTSIDMIKVFAEKVPGYGDLCAEDREQLFASARLELFVLRLAYRTRPEDTKLTFCNGLVLDKRQCQRSFGDWLHAVLDFSNTLHSMDIDISTFACLCALTLITDRHGLKEPHRVEHLQMKIIGCLRAHMPGGGGGGGVSGAPHFSRVLGALPELRSLSVQGMQRIFYLKLEDLVPAPPLIENMFRASLPF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00818391;
90% Identity
iTF_00827019;
80% Identity
iTF_00650130;