Ctra030803.1
Basic Information
- Insect
- Cosmia trapezina
- Gene Symbol
- Glmp
- Assembly
- GCA_905163495.1
- Location
- LR991037.1:3520379-3538144[+]
Transcription Factor Domain
- TF Family
- NCU-G1
- Domain
- NCU-G1 domain
- PFAM
- PF15065
- TF Group
- Unclassified Structure
- Description
- NCU-G1 is a set of highly conserved nuclear proteins rich in proline with a molecular weight of approximately 44 kDa. Especially high levels are detected in human prostate, liver and kidney. NCU-G1 is a dual-function family capable of functioning as a transcription factor as well as a nuclear receptor co-activator by stimulating the transcriptional activity of peroxisome proliferator-activated receptor-alpha (PPAR-alpha) [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.7e-104 8.8e-100 335.6 0.2 1 352 38 379 38 380 0.95
Sequence Information
- Coding Sequence
- ATGCATTTACAGActttattaattgtttttttactaTCTTTCGCGTGTGGACAAGACAGACAGATAATCCCAAAACTGAACCCAGGATGTGAAAACTGTGATTCAAGTACAACCCTGGTATACATCAAAGCGGAAGGCACCCATGATACCATCCACCAGCTGTGGGACTTCACGAGAGGTGTGCCCACTATCATCTATGTGATCACCAAACTCAACTCTTCGTTAAATATAACTTGGGACTATGAAGAACCTGTTAAATTTACGCTCACCGAAAAACCGCTGTATAGTTTTGCTGCAACAATTGacaagCTACAAGAATACAATGACATCGAGAACAACGGGCACATTGACAACAAGAGCCCACAGAGGCAGCTGTCGTTGCGTCACGTGGTGTGGAAACGCATCGACAGCAAGCTAACTGACAAGGAAGCCATGCTGCACGTGCATGGGCATTTCGAGGATCAGAGGAGGCAGAGAGGCGTTCTTGATATGAGACTAGACCTCCTCCCCTTCAAAGACTACGCGGTAGACCTGCCCCACCTCATCCACACAGCGAACTCGACGCTCATCGACGTCAGCCTAGTGAACCTGACGACGTCCCCTGACTATAACGCGTCACGATTCGCGATACACTTCATGATGGCCAGCACCGATGCATGGAGCGATACCATGAGTTACAACATGAGGAAGAGCTTGGATGATGAACATACGCCGGGAGTTTTTGAGATATTTGAGATAAAGACGCCCAAGTCCAGTCGGTCAGACGACGGCGGTTTCATGCAGTTCCGTCCGGTGTGCTACACGGAGGCAGAGCGCAGCGTGTCCTCTTCGACCAACGCGTACATTACGAACTTTACCAGATCGGATCTGCCCAAGTCGGGTACTTTGCGCGATTTCTACCGCGACTATGACCGGAACAACTTACTCATTCAAGATATGTTCATATCGTTCGGTCTGCCCGGTGACGGCTTCTATAGACAACATAACTATACTTCTTGGTCATTCACTATAGGCTACGGAGTGCCGCCTGTAGAGAACTTCTCCctcttcgtcatcatcatcatatccaTCGGGCTAGGTGTTCCCGTACTGCTGGCACTATCAGGCATCACCTATGTACTGGTCCGAAGGTGTAAGCAGAGGAATGCTCCAGCGAGGTTTACTGATGATGACGAATAa
- Protein Sequence
- MHLQTLLIVFLLSFACGQDRQIIPKLNPGCENCDSSTTLVYIKAEGTHDTIHQLWDFTRGVPTIIYVITKLNSSLNITWDYEEPVKFTLTEKPLYSFAATIDKLQEYNDIENNGHIDNKSPQRQLSLRHVVWKRIDSKLTDKEAMLHVHGHFEDQRRQRGVLDMRLDLLPFKDYAVDLPHLIHTANSTLIDVSLVNLTTSPDYNASRFAIHFMMASTDAWSDTMSYNMRKSLDDEHTPGVFEIFEIKTPKSSRSDDGGFMQFRPVCYTEAERSVSSSTNAYITNFTRSDLPKSGTLRDFYRDYDRNNLLIQDMFISFGLPGDGFYRQHNYTSWSFTIGYGVPPVENFSLFVIIIISIGLGVPVLLALSGITYVLVRRCKQRNAPARFTDDDE*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00771959;
- 90% Identity
- iTF_00374150; iTF_00907078; iTF_00906186; iTF_00425421; iTF_00907918; iTF_00124305; iTF_00123410; iTF_00449139; iTF_00726393; iTF_00172992;
- 80% Identity
- -