Adir008726.1
Basic Information
- Insect
- Anopheles dirus
- Gene Symbol
- glmp-b
- Assembly
- GCA_000349145.1
- Location
- KB672813.1:1782399-1785171[-]
Transcription Factor Domain
- TF Family
- NCU-G1
- Domain
- NCU-G1 domain
- PFAM
- PF15065
- TF Group
- Unclassified Structure
- Description
- NCU-G1 is a set of highly conserved nuclear proteins rich in proline with a molecular weight of approximately 44 kDa. Especially high levels are detected in human prostate, liver and kidney. NCU-G1 is a dual-function family capable of functioning as a transcription factor as well as a nuclear receptor co-activator by stimulating the transcriptional activity of peroxisome proliferator-activated receptor-alpha (PPAR-alpha) [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.1e-119 1.3e-115 385.6 1.1 1 352 58 396 58 397 0.94
Sequence Information
- Coding Sequence
- ATGTTTTcgtcaaacaaaacgaatccgTGCTCCGCCATCTTCAGCATCACCGTTTTTCTGCTGTGCCTGCTTAGCGGAGCCCTTTGTGATGAGCCCAGCAAATTGCAGCGAAAACTAACGGCATCGCTCAATCCCGGGTGCAAGGGCATTTGCGAGAATAACACCGCCATCACACTGGTGCACATTGCGGCCGAATCCGACACGGATACGATTCACTACGTGTGGGACTTCACGGGCAAGCCAACGATCCTGGTGGCCCTCACGAGCAAGCATGCCGAATTCCATATCGACTGGCAGAATCTGATGGACAGCCAGCCGGAATCGGTGCGCTTCAGTGAGGTGCCGCAGTACACTTTCATGGCGGTTATCAATCGGATTTTCCAGTACGATGACGCCAACGATCGTGCGATGCTGGATGATGGCTCTAACGTGTTCGTATATGACCCGCATAATTTCACCTGGAACCGGAGCCTGCTGTGGTCGAACGAAAAGGACGTTATGATGGCAATCAACGCGGGCGACGATTTCCTGTTCAAGCTTAATGCGTACTCAACGAAAGATCACGGCATGGACTTTCCCCATCTGCTACACTCGTCGAACTCCACGCAAATCGACATAGTGTTCAACAACATAACGAACCGATTCGTAAATCCACGCTTCGCGATCGAACTGCTGTTCGTCGTGTCGGAACAGGCCGTCGTCGGCTCAGACTTCGAGGTGACGAAGCGCAAAACACTGGACGACGAACACACACCGGGCATTTTCGAAATCGTCGATGTCCTGTCGCCAGGCGCGTTCACCTTTTCGGCCGGCGGTTTTATCGAGTATCGGCCCGTGTCGTACACGCACCCGGAACGAGACGTCGCGACGTCGACCGAAACGCGTCAGAGCCAACCGGTCACAGTGCCGTCACCGGCGGCAACCCTTCAGACGACGCTGGCCTACGCGGTGTACGGCACGAAGCTGGACAGCTTGCTCGTGCAAGGCATGAACGTGTCGTTCGGCGTTAGCGAGGATGGATTCTACCGCAAGACGAACTACACCACCATGACGTTCCAGGTCGGGTACGGTATGCCGCCGGTTGAGGAGCTGTCCGCGTTCGTGCTGATTGTGGCCGGCATCGGTATCGGCAttccgctggtggtgctggtggcgagCGTCATTTACGTATGCGTAAAGAAACTCCGCAACCAGGACCGGTTCCAGTTGCAGCGGTTGTAA
- Protein Sequence
- MFSSNKTNPCSAIFSITVFLLCLLSGALCDEPSKLQRKLTASLNPGCKGICENNTAITLVHIAAESDTDTIHYVWDFTGKPTILVALTSKHAEFHIDWQNLMDSQPESVRFSEVPQYTFMAVINRIFQYDDANDRAMLDDGSNVFVYDPHNFTWNRSLLWSNEKDVMMAINAGDDFLFKLNAYSTKDHGMDFPHLLHSSNSTQIDIVFNNITNRFVNPRFAIELLFVVSEQAVVGSDFEVTKRKTLDDEHTPGIFEIVDVLSPGAFTFSAGGFIEYRPVSYTHPERDVATSTETRQSQPVTVPSPAATLQTTLAYAVYGTKLDSLLVQGMNVSFGVSEDGFYRKTNYTTMTFQVGYGMPPVEELSAFVLIVAGIGIGIPLVVLVASVIYVCVKKLRNQDRFQLQRL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00106407; iTF_00103020; iTF_00094484; iTF_00095080; iTF_00097781; iTF_00100963; iTF_00095654; iTF_00106991; iTF_00108400; iTF_00099665; iTF_00101684; iTF_00093461; iTF_00100313; iTF_00105146; iTF_00104381; iTF_00103678; iTF_00102368; iTF_00105836; iTF_00096392; iTF_00109151; iTF_00094016;
- 90% Identity
- iTF_00103020;
- 80% Identity
- -