Basic Information

Gene Symbol
Tulp4
Assembly
GCA_035042405.1
Location
JAWNLI010000485.1:4932096-4939916[-]

Transcription Factor Domain

TF Family
Tub
Domain
Tub domain
PFAM
PF01167
TF Group
Unclassified Structure
Description
Tubby, an autosomal recessive mutation, mapping to mouse chromosome 7, was recently found to be the result of a splicing defect in a novel gene with unknown function. This mutation maps to the tub gene [3, 4]. The mouse tubby mutation is the cause of maturity-onset obesity, insulin resistance and sensory deficits. By contrast with the rapid juvenile-onset weight gain seen in diabetes (db) and obese (ob) mice, obesity in tubby mice develops gradually, and strongly resembles the late-onset obesity observed in the human population. Excessive deposition of adipose tissue culminates in a two-fold increase of body weight. Tubby mice also suffer retinal degeneration and neurosensory hearing loss. The tripartite character of the tubby phenotype is highly similar to human obesity syndromes, such as Alstrom and Bardet-Biedl. Although these phenotypes indicate a vital role for tubby proteins, no biochemical function has yet been ascribed to any family member [2], although it has been suggested that the phenotypic features of tubby mice may be the result of cellular apoptosis triggered by expression of the mutated tub gene. TUB is the founding-member of the tubby-like proteins, the TULPs. TULPs are found in multicellular organisms from both the plant and animal kingdoms. Ablation of members of this protein family cause disease phenotypes that are indicative of their importance in nervous-system function and development [1]. Mammalian TUB is a hydrophilic protein of ~500 residues. The N-terminal (IPR005398) portion of the protein is conserved neither in length nor sequence, but, in TUB, contains the nuclear localisation signal and may have transcriptional-activation activity. The C-terminal 250 residues are highly conserved. The C-terminal extremity contains a cysteine residue that might play an important role in the normal functioning of these proteins. The crystal structure of the C-terminal core domain from mouse tubby has been determined to 1.9A resolution. This domain is arranged as a 12-stranded, all anti-parallel, closed β-barrel that surrounds a central α helix, (which is at the extreme carboxyl terminus of the protein) that forms most of the hydrophobic core. Structural analyses suggest that TULPs constitute a unique family of bipartite transcription factors [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 0.49 2.8e+03 -2.4 0.0 10 59 424 472 417 497 0.70
2 4 0.00031 1.8 8.1 0.0 115 148 593 625 562 675 0.71
3 4 0.6 3.4e+03 -2.6 0.2 130 171 969 1010 960 1034 0.51
4 4 1.3e-30 7.1e-27 94.6 0.2 132 253 1284 1431 1244 1434 0.71

Sequence Information

Coding Sequence
ATGGTTGGGGTTGGGAATTGGGATGGCGGATGGCTGATTGGGTATGGAGATGAAGGCTGGAAGCTGAACCGCACGAACTACTACCAGGAGGGCTGGCTGGCCACGGGCAACATACGCGGCATTGTGGGCGTCACATTCACCACCTCGCATTGCCGCAAGAACATGGACTATCCACTGCGCACCAACTACAATCTGCGGGGACACAGATCGGATGTTATTCTCGTCAAGTGGAATGAGCCGTATCAGAAGCTGGCCTCCTGCGACAGTTCTGGCATCATTTTTGTGTGGATCAAATACGAGGGTCGCTGGTCAATTGAGCTGATCAATGATAGAAACACACCGGTGACGCATTTCTCCTGGTCGCATGATGGTCGCATGGCGCTCATCTGCTATCAGGATGGGTTTGTACTCGTTGGATCGGTGGCTGGCCAACGATATTGGTCATCGATGTTGAATCTAGAGTCGACCATTACGTGCGGCATTTGGACGCCCGACGATCAGCAGGTTTACTTTGGCACCACCCAGGGCCAGGTGATTGTCATGGATGTACATGGTGCGATGGTGTCACAGGTGCAATTATCCAACGATGTTCCCATCACATCCATGGCCTGGTCTTGCGAGAAATTCAAAATGGAGGAGGGCGAGGAAGCGGAGCCCGGTGTAACCAATGCCGcCAAGCGCTCGTTTGTGCTGGCGGTTAGCTTCCAGAATGGATATATCTATTTGCTAAAGTCGTTTGACGACGTTTCACCCGCGCATATCAACACCTGCCTTAATGGGGCTCTGGGCATGGTCATGGAGTGGAGCAACTCCCGGGAGCTGCTCGCCGTCGCTGGCACATTGCGCACCAGTAATTCCAATGGCGTCGGCCCAGATGGCAAGCTGGATGAGCTGGGCACGGCCAGCTGCTATAACAATCTGGTCAAGTTCTATACGGAGTCGGGCACGTGCCTCTACCAGGCGCACATACCCTGCAGCAATGCCACCGTCTCGGCCATCACTTGGGGCCACAACGATAAGCGACTGTTTATTGCCACCGGCACCCAGGTGCACATTGCCTGGATATCGAGGCGGGTGGCATCGTTGCAGTTGCTCTGCCGGCTGCAGATCCAGGCTAGCGTTGGCTCcgagttgctgttgccgctgctgccgttgccttCTAGGATTAAATCGCTCATTGGCAACCTGTTTGCTCAAACCATTCGATGCTGTGTACCTGATCTGAAGTCGCTGCGCGATTTTGTGTCGCGTCCACCGCTCTGCTCGACGCGGCTGCACTGCACCATGATACGGCACGATGATGACTCAAATCTCAGCTCGGGCATATGCTACACACTCTACTTGGAGTATCTGGGTGGTCTGGTGCCGCTGCTCAAGGGCAAACGCACCTCGAAGATCCGACCGGAATTTGTCATCTTTGATCCACAAGTTAATGACAATTCGCTGTATTTTCAATATGCAGCGGAGGCGAAGAGCTCCTCGGGCTCCAGTCAATCCACGACGACGGGGAACAGCGGACGCACTGATTCCTCCGACAGCGATTTCGAGGAGCGTTCACGCTTTGGCTCACCCCGCACGCCCCGCAAGCGTCGTGTGCGCCCCAAGCGACGCAACCAGAATGGAGATCGTGCGGCGAGTGGCAGTGGGGGCAACGATCCGGATAGCCTGGATGAGCTGGCCTATGTGGATACGCTGCCCGAGCAGGAAGTCAAGCTGGTGGAGGTGACCTCGAATATATGGGGCACAAAATTCAAGATTCATGGACTTGCCAAAACCGTCCCAGCCAATTTAGGCCAAGTAACTTATAAGACGTCCCTACTGCATTTGCAGCCGCGTCAAATGACGCTGGTCATCACGGAGCTGCGAGACGATTTTCCAACCGGACCCGATCCCAGCTTCAATCCGAATCTCTTCTCCGAGGACGAAGAGGAGCATAACAATTACAATCACAGTCATAATCAACACAATTCGGATGCAGTGCCGCATGTGAGTGTCACCAGTGCTGCGCCCGATGCCGCCCAGTTGGCCGCATTGAAGCCACCAATTATGCCACAACGTCGCCTCAACGAAGGCGCCTCTGCGCCGCCCATTGCGCCCATGTCACCACGACCCAATCGCATTCTGGCTCGTCACAAGAACTCCCATTCGCTGAGCGTTAATGGTGCCTCCTCGGTGGGCCTGAGTCCATTGGCACGTGCCGAGAGTTACGACGATGACTCCTCCAACGAATCTCAGGAGGCGGCAGCGGCGACGGCCAGCACTACAGTGCTGCTCCACCAAGCGCCCTCATCCAGCTCGAACGCCGGACCCAGCTGCAGCAAGAATCTCAGCCGGCCCAAGACCATAAGCAGCTTCAAGAACAGCTACAGCCGCTCCAGCTCCAATTCCAGCTGCCAGTCACGTCATGCCATTTCGCCGCTCTACTGCGACGGATCCGTGCCAACGTTGCAGTCACCCAAAAATGCTGTTGCCCCCTCGGACATTATATTCGAGCGACCGGCTGTGCCGGCTGCTGGTCAGACCACACTGATGTCGTACTCGAGCAATGCGGACTATGCCAACAATGTGGTGCAGGTGAAGAATGCGTTGATGTCGGAACCGGTGCGGTCGGCGAATAGTCATGTGAATCCGGTGCCACTTAACCTCAATCTCAATTTGGAGCGCATGGACGCAAGGGCTGCCAAGTGCGCGACAGCGAAGCGACGCGACATGCTCTACATTGATGAGGAGACGCAGTCGCCGACacccaccaacaacaccacTAACACCACCTCAAATATGAAACGCACTCCGACTGTGGTATCCATTGCGCCCGCTCTGCCCGATTCCATAACGCGCAGCTGCAGCGTTGGGTATCTGGATTCGGTGGCCATAACACCCTCGGATGAGGCACTCTCCGCCCTGCGCAAAGATGCGCCCAACAAGCGACTGATACTGGTGGACAAGCGgcgcaatcgcaatcgcaaaaGACAGCAGCAGACGGATGTGAGGCGCCAGAAGCTGCAGCAGACGGGCAAATCGAAGAGTCTGGACTCCTGTGATCTGCTATCGCTGCAAACGAAATTGTCGAGCAAGGAGCACGAGCAGGTGGTGCGTAAGCTGCAGGAGATCTCCGATAGCAGCGCTTGCAGTAGCGCTGCAAACACTCTGTGCTTCAAGTGTCGCAACAACATGAATCCCGCCAGCATCTGCAAACGTTGTCAGCCGGCTGCAGCGAATAGTTCGGCGCTTGACGAGATTACCACGGTTGCGGCGTCTGTTGTCGTTGAGCCCGCCAAGGAGGCGATTCCAGTGACACCAGTTGCCGCTGTGCCAAGCAAACCCACGCCCAAGAAACGTTTCGATGTCATCACTAGCTTTACGGACAGTCCGCTCTTCACGCGCAAACATCGCTTTGGCTATGGACGCAGCAAGGATCTGGCCGCAGCTGCAACGGGCAGCAGCACAGAGAACTCCACCCCAATCCTGGCACGCAAGCACGAGAACAGCTTCAGCTTTGTCAAGCAGCTATCCGAGGTGCGCTGGCGCCGCAAAGAGCCAACTCCAAGCCAGAGCCAACAGCAGGGCGCAAGCAATGCGAGCACTTTGGAAAGACAACATTCCTGCGGCACTGTGGAAGCCACTCCGGTTGAGGCCAAGGCATCCGTATCGCTGCACACTCAGgCTCTGACCACGTTGGAGAACATCATCAGTCGTTTGCGTGATTTGGACGAGGGTCGCTTGACGCCACCAACGACGCCGCAGCGCTTGCCACGCAGTTCGCCCGCCTCACCGGCGGCCAGCAAAAAGAACAAGCGCCAGCAGAGCAACTCGCCAATAAGACACATCTTGAACTCACCACTGCTGAATCGGCGACAACGCAAGAAACCGAGCATCATCGAGAGCTCCGACGATGAGGGCAATCAGACGAATGGCTCCGGCGAGGAAATGTGCAGCaccagcaatggcaacggcaaacaGTACCGCGATCTGGAGACATTCCAAAAGGCCCAGTTGCGCCAGAAGCTTAAGCGTGGCAAGATTGAGCCAAATGGTAGCGCCAGTTGCGCCATTCCGGCGCCAGTGCGTCGCGAGTTTGTCATGCACAACAAGGCGCCCATGTGGAACGAGATGAGTCAAGTCTATCAGCTGGACTTTGGCGGACGCGTCACCCAGGAGTCGGCCaagaattttcaaattgagttCCGTGGCAAACAGGTGATGCAATTTGGTCGCATTGATGGCAATGCATATACCCTGGATTTTCAGTATCCCTTCTCGGCGCTACAGGCATTTGCCGTGGCGCTGGCCAATGTTACGCAGCGTCTCAAGTAA
Protein Sequence
MVGVGNWDGGWLIGYGDEGWKLNRTNYYQEGWLATGNIRGIVGVTFTTSHCRKNMDYPLRTNYNLRGHRSDVILVKWNEPYQKLASCDSSGIIFVWIKYEGRWSIELINDRNTPVTHFSWSHDGRMALICYQDGFVLVGSVAGQRYWSSMLNLESTITCGIWTPDDQQVYFGTTQGQVIVMDVHGAMVSQVQLSNDVPITSMAWSCEKFKMEEGEEAEPGVTNAAKRSFVLAVSFQNGYIYLLKSFDDVSPAHINTCLNGALGMVMEWSNSRELLAVAGTLRTSNSNGVGPDGKLDELGTASCYNNLVKFYTESGTCLYQAHIPCSNATVSAITWGHNDKRLFIATGTQVHIAWISRRVASLQLLCRLQIQASVGSELLLPLLPLPSRIKSLIGNLFAQTIRCCVPDLKSLRDFVSRPPLCSTRLHCTMIRHDDDSNLSSGICYTLYLEYLGGLVPLLKGKRTSKIRPEFVIFDPQVNDNSLYFQYAAEAKSSSGSSQSTTTGNSGRTDSSDSDFEERSRFGSPRTPRKRRVRPKRRNQNGDRAASGSGGNDPDSLDELAYVDTLPEQEVKLVEVTSNIWGTKFKIHGLAKTVPANLGQVTYKTSLLHLQPRQMTLVITELRDDFPTGPDPSFNPNLFSEDEEEHNNYNHSHNQHNSDAVPHVSVTSAAPDAAQLAALKPPIMPQRRLNEGASAPPIAPMSPRPNRILARHKNSHSLSVNGASSVGLSPLARAESYDDDSSNESQEAAAATASTTVLLHQAPSSSSNAGPSCSKNLSRPKTISSFKNSYSRSSSNSSCQSRHAISPLYCDGSVPTLQSPKNAVAPSDIIFERPAVPAAGQTTLMSYSSNADYANNVVQVKNALMSEPVRSANSHVNPVPLNLNLNLERMDARAAKCATAKRRDMLYIDEETQSPTPTNNTTNTTSNMKRTPTVVSIAPALPDSITRSCSVGYLDSVAITPSDEALSALRKDAPNKRLILVDKRRNRNRKRQQQTDVRRQKLQQTGKSKSLDSCDLLSLQTKLSSKEHEQVVRKLQEISDSSACSSAANTLCFKCRNNMNPASICKRCQPAAANSSALDEITTVAASVVVEPAKEAIPVTPVAAVPSKPTPKKRFDVITSFTDSPLFTRKHRFGYGRSKDLAAAATGSSTENSTPILARKHENSFSFVKQLSEVRWRRKEPTPSQSQQQGASNASTLERQHSCGTVEATPVEAKASVSLHTQALTTLENIISRLRDLDEGRLTPPTTPQRLPRSSPASPAASKKNKRQQSNSPIRHILNSPLLNRRQRKKPSIIESSDDEGNQTNGSGEEMCSTSNGNGKQYRDLETFQKAQLRQKLKRGKIEPNGSASCAIPAPVRREFVMHNKAPMWNEMSQVYQLDFGGRVTQESAKNFQIEFRGKQVMQFGRIDGNAYTLDFQYPFSALQAFAVALANVTQRLK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00473793;
90% Identity
iTF_01322659;
80% Identity
-