Basic Information

Gene Symbol
Tulp4
Assembly
GCA_003118335.3
Location
JAOPJP010000031.1:2027079-2044702[+]

Transcription Factor Domain

TF Family
Tub
Domain
Tub domain
PFAM
PF01167
TF Group
Unclassified Structure
Description
Tubby, an autosomal recessive mutation, mapping to mouse chromosome 7, was recently found to be the result of a splicing defect in a novel gene with unknown function. This mutation maps to the tub gene [3, 4]. The mouse tubby mutation is the cause of maturity-onset obesity, insulin resistance and sensory deficits. By contrast with the rapid juvenile-onset weight gain seen in diabetes (db) and obese (ob) mice, obesity in tubby mice develops gradually, and strongly resembles the late-onset obesity observed in the human population. Excessive deposition of adipose tissue culminates in a two-fold increase of body weight. Tubby mice also suffer retinal degeneration and neurosensory hearing loss. The tripartite character of the tubby phenotype is highly similar to human obesity syndromes, such as Alstrom and Bardet-Biedl. Although these phenotypes indicate a vital role for tubby proteins, no biochemical function has yet been ascribed to any family member [2], although it has been suggested that the phenotypic features of tubby mice may be the result of cellular apoptosis triggered by expression of the mutated tub gene. TUB is the founding-member of the tubby-like proteins, the TULPs. TULPs are found in multicellular organisms from both the plant and animal kingdoms. Ablation of members of this protein family cause disease phenotypes that are indicative of their importance in nervous-system function and development [1]. Mammalian TUB is a hydrophilic protein of ~500 residues. The N-terminal (IPR005398) portion of the protein is conserved neither in length nor sequence, but, in TUB, contains the nuclear localisation signal and may have transcriptional-activation activity. The C-terminal 250 residues are highly conserved. The C-terminal extremity contains a cysteine residue that might play an important role in the normal functioning of these proteins. The crystal structure of the C-terminal core domain from mouse tubby has been determined to 1.9A resolution. This domain is arranged as a 12-stranded, all anti-parallel, closed β-barrel that surrounds a central α helix, (which is at the extreme carboxyl terminus of the protein) that forms most of the hydrophobic core. Structural analyses suggest that TULPs constitute a unique family of bipartite transcription factors [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 0.025 1.4e+02 2.3 0.0 6 67 418 474 415 487 0.78
2 4 0.0003 1.7 8.6 0.1 116 158 603 643 580 699 0.70
3 4 2.2 1.2e+04 -4.1 0.3 42 91 989 1039 978 1064 0.46
4 4 1.8e-30 1e-26 94.5 0.0 171 253 1285 1361 1218 1364 0.86

Sequence Information

Coding Sequence
ATGCATCTACACTTCGAGCGGAATGTTAACGCTAAATGCGATTGTACTATACTGTCGCTGTCGTGGATGGGAAAGGTGCCCGACGAGTTGCCAGAGGAGGAAGGTTGGAAGTTGAACCGCAACAATTACTACCAAGAGGGTTGGTTGGCGACGGGCAACGTGCGTGGAGTGGTCGGCGTGACTTTTACGTCATCACACGCGCGGCGGCCACACGAGCTACCGCTGCGTACCAACTACAACCTACGAGGGCATAGATCTGATGTAATTCTGGTAAAATGGAACGAACCATATCAGAAGTTGGCGTCGTGCGACAGCTCGGGCGTCATCTTCGTATGGATCAAGTACGAGGGCCGATGGAGTATCGAACTCATAAACGACAGAAGCACACCTGTCACACACTTCTCCTGGTCTCACGATGGCAGGATGGCCCTTATTTGTTACCAGGATGGTTTCGTGTTGGTGGGTTCGGTGGCAGGTCAGCGCTACTGGTCTTCGATGCTATCGTTAGACGCGAGGATCACGTGCGGCTGCTGGACGCCTGACGACGGCCAGGTGTACCTCGGCACAGCCTCTGCGCAGCTGGTCGTTATGGACGTGCACGGCGCTATGGTCTCGCAGGTGCAATTGGTAGCTGAAGGTGGCATAACATCTATGGCATGGTCGTGCGAGAAGTTCAAGATGGAGGAAGGCGAGGAGGGAGGCGAGAGTAGCGGTGGCCACGTACTGGCCGTCGCGCTGGGCAACGGAGAGATAGTGCTTTTACATGGCCACGACGACGTTAGTCCAGTGCGTGTGACCACTGGCCTCAGAGGCAGCACACTGGCCATGGAATGGGCTAACTCACGAGAACTGCTTGCAGTTGCGGGAACCTTGAACGCGGAGACAAACGAGTCCGCTGAAGCGCCTCCGTTCAACAATGTGGTCAAGTTCTATTCCGACGCCGGGGTGCTAATATACACTGTGCCCATACCTTATACACAGGCCCGTGTGAGTGCGCTAACGTGGGGCCACGCGGCGCGGCGCCTGTTCGTGGGCGTGGGCGGCGCGGTGTGCACGGCGCGCGTGTGGCGCGCGGTGGCGCCGCTGCAGCTGCTGGCGCGCGTGCGGGCCGCGCAGGCGCTGCGCCATGCGCGCCTCGCCGCCGCGCTGCCCCTCCCGCCGCGCCTGCAGCCCGCACTCGCCAGCCTCTTCGCACACACCATCCGGTGCAACGTGCCGGAGGCGAGCGAGCTGCGGCGGTTCGTGTCACGTCCGCCGGGCGGGCGGCTGCACTGCACCATGCTGCGACACGACGACGAGGAGGCCGGCGCCTACACGCTTTACCTGGAGCATCTCGGCGGCCTCGTGCCGCTGCTCAAGGGACGCCGCACCAGCAAGATACGACCCGAGTTCGTCATATTCGACCCACAGGCCGAAGTGTGCGCAGATGGCGCTGCGTGCGCGGTGAGCAGCAGCAGCAGCAGTAGCAGCAGCAGCGGCGCGGGGCTGGCGGGTGGCTCCCCCGCCCCCCGCACGCCGCGACCCGCGCCCCCCGCCACCCCCGCACACCGCCCCCGTCTGCCGCCCCCGCAACACTCCTCCTGCTCCTCCGACAGCGAGAGAGAAGACGGTTGTACAGGATCTCCCAGACTACAACGACGCAGGCGCGCTCGGGAGCGGCGCAAAGCGAGGAAATCGGGGGACAAGGACGACCCGCCCGACGAACTGGCTTATATCGACTCCTTGCCTGAGGATGTTCGTTTAGTTGAAGTCACATCGAATATTTGGGGTACCAAATTTAAAATGCACGGGCTGGCTAAAAACGTTCCTGATAACTTAGGTCAAGTGACTTACAAAACGTCTTTGCTTCATTTGCAACCGAGACAAATGACGCTAATGATAACCGAACTGCGGGACGACCACCCAGTAGATCCCGATCCGAATTTTAACCCAAATATATTTTCCGAAGACGAGGAAGAAGTGTTTCAGTCAAACGACTCGAGTTCACCTTCGCATACGAATAACAACATTAGGCGGAAATTAACTATCAACGACCGAATCAATAATTCAAATAATGTTTTATCTCAAAATGAAACGTATTCCAGTAATAACAACAATTCGAACCCATCTATAGCGCGAGCGGAATCGTACGACGAATTCCCATACATAGATACAAACGACACCGTCAACAACGTGTCAGAGAGCGTGTACTCGACGGGCGTGCGGCACGCCGCGCCCGCGCCCGAGCGCAGGGTGAACAGCTCCGCCGGCGTCCCCCACCGGCACGCCATCTCCCCGCTGCGGTGCGAGAGCTCCGTCCCTACCTTGCAGTCCCCGAAGAATGCCGTCGCGCCGACAGACATCATCTTCGAAAGGCCCTCCCCGCAAACGGTGACCTGCGGAGGTCGCGGTGACTTCTGCGGCGGCAGGACCGACTACAGTGTTCGCGGTGATAACGTAACCCTAAAAAATAATATACCTAACATCGAACAACAATCTTTCAATTTAAACCTTAGCGTAGAACCCAGTAGGCAGGTTACAATCAAAAAATGCGATAACCACGTTTCGGACAATTGTCTCTCAAAGTTACGCAAAAACATTTGCAACCGAGGCGAATCGGCGTCATTCGAAGTTAATACTAGAATACTGAAATCTTTATGTAACAAAAATTTCGAACACGAGGTGCATCCGGATGCGATGACACGGCGCTCAGATAATGTAAAAAATATCCAAAAGGGCGAGGATTTGAAATTCATCGACGAGGAAACCCCCATTGAAACCACGATAAATAATTTGAGCGAGAGACCTCGGGAGATGAGGGTGCAGCGGACGACGACCGTGGTGCCCATCAGCCCGGTGTGCGCCAGCGTTCCCGTCTACGACACGATGACGCGCAGCTGCAGCGTGGGTTACCTAGACCTCGTCGACCCGCAAGTGCTGCGCGCCCACAGCCTTACCGCCCTGCGCGGCGAGCAGCCCCGAAGACTGGTCCTGGTGAATAACAAACGTCACAGGAGAGCGAAGCGACACTTCCGGACCAACGACATCAAGCAAATGGAAACCAAAACGCCGAGCTTAAAGAAATGTGGGAAATCGAGGAGCCTCGATTCGGGCGAGCTGTCTATAACGGTGGAGAAATGCAAGCGGCAGTCGCGGTCCGACAGCAACCAGAGGAGCGAGGCGGCGTCGGCGCAGTCGAGCAGCCGCTACACTAGCACAGGGGAGGACAATAGCGGCACGAGCACGGAGGAGGGCGGAGGGCGACGCGCGCGGCGCGACTTTCCGGTGTGCACGCGCTGCCGCCTCGTGTCGCCGTACGACCGGCGGGACGTGGACGCCGCCACGTACGTGTGCGTGGCATGCAGCTCGCGTGCCGCCGCACCGCCCCCCGCGCCGCAGCCGCCCCCGCCCCCGCCCCCCGTACTGCACACCGACAGCGATTCCGACTACAGCAAGTATTATAGTTCACTTGAGCAGTTGGCGTTGCGGTTGCTGGCCGCTAGGGGTCGCAGCGGACGTGGGGGCGGGGGCGGGGGCGGTGCGCGCGAGCGGCCGCCGCGCGCGCAACCCGCCCCCGCCGCCGCCTCGCCCGCCCCCGCACGCCGCACGCGCTACTCCTCCGCCTCACCTATACGACAACTGCTGAACTCTCCGCTGTTGAACCGGCGGCGTAACAAGAAACCGTCGGAAAGCTCGGATGACGAGTTCTCCAACGGCTACAGTGAAGTCAACGGCAAGAACTATAGAGATCTCGAGAGCTTCCAGAAAGCACAGCTCAGGAACAAGTTGAAGCGCGCGGGGGGCGTGCCGACGGGCGCGGCTGTGGGCGCAGGGGGTGCTGCGGGCGCGGCGGGGGGCGCAGGGGGCGGGCAGGCGCGCCGGCAGCTGCTGATGCACAACAAGGCGCCGATGTGGAACGAGAACAGCCAGGTGTACCAGCTGGACTTCGGCGGCCGCGTCACGCAGGAGTCCGCCAAGAACTTCCAGATCGAGTATCATGGGAAACAGGTGATGCAGTTCGGGCGCATCGACGGCAACGCGTACACGCTGGACTTCCAGTATCCGTTTTCCGCGCTGCAGGCGTTCGCCGTCGCGCTGGCCAACGTCACGCAGCGCCTCAAGTAG
Protein Sequence
MHLHFERNVNAKCDCTILSLSWMGKVPDELPEEEGWKLNRNNYYQEGWLATGNVRGVVGVTFTSSHARRPHELPLRTNYNLRGHRSDVILVKWNEPYQKLASCDSSGVIFVWIKYEGRWSIELINDRSTPVTHFSWSHDGRMALICYQDGFVLVGSVAGQRYWSSMLSLDARITCGCWTPDDGQVYLGTASAQLVVMDVHGAMVSQVQLVAEGGITSMAWSCEKFKMEEGEEGGESSGGHVLAVALGNGEIVLLHGHDDVSPVRVTTGLRGSTLAMEWANSRELLAVAGTLNAETNESAEAPPFNNVVKFYSDAGVLIYTVPIPYTQARVSALTWGHAARRLFVGVGGAVCTARVWRAVAPLQLLARVRAAQALRHARLAAALPLPPRLQPALASLFAHTIRCNVPEASELRRFVSRPPGGRLHCTMLRHDDEEAGAYTLYLEHLGGLVPLLKGRRTSKIRPEFVIFDPQAEVCADGAACAVSSSSSSSSSSGAGLAGGSPAPRTPRPAPPATPAHRPRLPPPQHSSCSSDSEREDGCTGSPRLQRRRRARERRKARKSGDKDDPPDELAYIDSLPEDVRLVEVTSNIWGTKFKMHGLAKNVPDNLGQVTYKTSLLHLQPRQMTLMITELRDDHPVDPDPNFNPNIFSEDEEEVFQSNDSSSPSHTNNNIRRKLTINDRINNSNNVLSQNETYSSNNNNSNPSIARAESYDEFPYIDTNDTVNNVSESVYSTGVRHAAPAPERRVNSSAGVPHRHAISPLRCESSVPTLQSPKNAVAPTDIIFERPSPQTVTCGGRGDFCGGRTDYSVRGDNVTLKNNIPNIEQQSFNLNLSVEPSRQVTIKKCDNHVSDNCLSKLRKNICNRGESASFEVNTRILKSLCNKNFEHEVHPDAMTRRSDNVKNIQKGEDLKFIDEETPIETTINNLSERPREMRVQRTTTVVPISPVCASVPVYDTMTRSCSVGYLDLVDPQVLRAHSLTALRGEQPRRLVLVNNKRHRRAKRHFRTNDIKQMETKTPSLKKCGKSRSLDSGELSITVEKCKRQSRSDSNQRSEAASAQSSSRYTSTGEDNSGTSTEEGGGRRARRDFPVCTRCRLVSPYDRRDVDAATYVCVACSSRAAAPPPAPQPPPPPPPVLHTDSDSDYSKYYSSLEQLALRLLAARGRSGRGGGGGGGARERPPRAQPAPAAASPAPARRTRYSSASPIRQLLNSPLLNRRRNKKPSESSDDEFSNGYSEVNGKNYRDLESFQKAQLRNKLKRAGGVPTGAAVGAGGAAGAAGGAGGGQARRQLLMHNKAPMWNENSQVYQLDFGGRVTQESAKNFQIEYHGKQVMQFGRIDGNAYTLDFQYPFSALQAFAVALANVTQRLK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01147413;
90% Identity
iTF_01149635;
80% Identity
-