Pnir065575.1
Basic Information
- Insect
- Pleistodontes nigriventris
- Gene Symbol
- CAMTA2
- Assembly
- GCA_903653215.1
- Location
- CAHLDJ010000234.1:90479-114714[-]
Transcription Factor Domain
- TF Family
- CG-1
- Domain
- CG-1 domain
- PFAM
- PF03859
- TF Group
- Unclassified Structure
- Description
- CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein [2]. CG-1 domains are associated with CAMTA proteins (for CAlModulin -binding Transcription Activator) that are transcription factors containing a calmodulin -binding domain and ankyrins (ANK) motifs [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 9.2e-44 6.2e-39 134.2 0.6 11 116 4 107 1 107 0.98
Sequence Information
- Coding Sequence
- atgtccATAGAGATCGCAGCGATACTAATAAGCTTTCAGCGTCACGCAGAATGGCAAAGTCGGGAGGTGAAGGTGCGACCTCGAAGTGGCTCGATGCTACTTTACTCAAGGAAGAAAGTTCGTTATCGAAGAGATGGCTACTGTTGGAAAAAGCGAAAAGATGGAAAAACCACGAGAGAAGATCACATGAAACTCAAGGTCCAAGGCGTGGAGTGCATCTACGGCTGTTACGTGCATTCGGCGATCCTGCCGACGTTTCACCGGCGTTGCTACTGGCTCCTCCAGAACCCGGACGTCGTGCTCGTGCACTACCTGAACGTACCCTACCCGGACGGGGATGCGAAGCTGGCGGCCCTGCCGCCGTGTCTCGCCCTGCCGCCCGACAAGAAGGAGTGGACGCGCGACGAGTTGGCCTCCCAGCTGAGGCCTATGTTCCTGGGCGGCGACGAGGCCGAGCAGCCCGGCAACGCGCACGGCCTTGGCGGACACTCCGGCCATCCCGTCGACATGATCGTGTCCCAGCTGCTGGACCGTCAAAGGGCTTCCTCGGCAACCTCCAGCACCGCCGCCCAGCTCGCACCCAGGAGACTAACACCCGACAATCAGgTCTCTTCGACTACCGGAGGTCAACAGCCAAGTACGACGGCGTCGACGACTCCGCGAGTTTACTCGAGACATTCCCACTCCACTCAGAGCCAGCAGCCGGCTCCACTAGTTTTAAGTTTACAACAAATCCAAGGCGGCGGGGGTCTACTCATCCTCAACAGTCAGCCATATCATCaccagcagcaacaacaacaacagcagcagcagcagcaggtcCAGCAGCAACcgtcgcagcagcagcagcagcagagcCAGACACAAAGCCAGGTGGATATGCAACAGGTGACggaacagcagcagcagcagcagcacgtGATTCAGCAAAACAGTATCGACCGCGAGCAGCAGACTCAACAACACGAGATCGAATCCAAGGATAACAGCAGTCATGGCAATGGAACTGACTGTACATCCCAGCAATCCGTACCTCCGCCGCCAGTCGTAGCGGACTTCGTCGAGACGCTGGACCTCAGCCAGGAGGACATCCAGCGCACGCTCTCCGCGAATATGGTGCATCCGTCGCCGTCGCCCTCCCCAGCCGATAACAATATCATCAATCCCATGGACTTCATCGACTCGACCGACGATGTCCTCGTGAATCTCGACGCGTTCGACATGTTCAGCGATCTGCCCGAGCTACATGACTTCGAGGCCGCGGCGAGCGCCGCCGAGGCCGAGACCAAGACTGATATCATGCGAATAAGCCCGGAGAGTGAGGGACACTGTCATCCGGGTACCACCGTCCACATCGCCGAGTACAGCCCCGAGTGGAGTTACACCGAGGGTGGCGTTAAGGTGCTGGTGGCTGGACCGTGGACCGGTGGTGGTTCCCAGTCGTACTCTATCCTGTTCGACGGTGAGCCCGTGGAGGCTTGTCTGGTGCAACCGGGTGTGTTGCGCTGCCGGTGTCCGGCTCACGCGGCCGGAGTCGCCTCGTTGCAAGTCGCGTGCGACGGCTTCGTCGTCTCCGACAGCGTGGCTTTCGAATACCGCAGACCACCCCAGAACGAGCCCAGCCCCGAGAAGGCTTTGCTCGATCGACTGGCCGACGTGGAAACGCGGCTCCAGGGGCCGGGACCACCCTCGCCAGCCGCGCACCTCGAGGAGCGACTCGTCGCCTACTGCCAGGACGCGGTGGTGAGGCCTTGGCGCACGGGAGCGGAGCCGCTGCAGTCGGGGGGGCCGACGCTGCTGCACCTCGCCGCGGGCCTCGGCTACTCGAGGCTCGCCTGCGCCCTGCTGCACTGGCGCGCCGAGAACCCCAGCAGCGTCCTCGACGCCGAGGTCGACGCGCTGCGCCAGGACGCGGCCGGACTCACGCCCCTCGCCTGGGCCTGCGCGGCCGGCCACGCCGACACAGCCAGGATCCTCTATCGATGGAATGCGATGGCTCTGCGCGTGCGAGATTGTCAGAACAGAAGTGCGACCGAGCTGGCCGCGGAGAACGGCCACAGCGCTATCGCCGAAGAGCTGAATCGGTTGGAGGCTCGCCGGCAAGACGAGAGGCTCTTCTTGAGACCGGCCAGCCCGAGCCCTAGGAGGCCCTCCCAAGACAGCGGTCTCGATCTGGCTCTCTGTGGTTCACCGCTCCTGGACAATATGGAGCTGTTGCAGGATGATGAGTCGTCATTAGACCTCGGCCAGCAGGGCATGGAAAGCGCCCCGAGTCCCTCGGAGTCCGTAGGGGAGGAGGACGCGAGGGTCTTGACCTTGGCCGAGCAAATTATAGCCGCTCTACCCGAGAGGATCAAGAGAGAGCAATGTGACCCCCCTTCTCCGTCCTCGCCGCCACCGCCACCGCCGTTGCCGCAGCTGGAGGATGCTTTCATGGAACAAATGCCCCTCGACACCGGCGAGCTGTTCGACTCGTACCGCGAGTGCAGTGGCGGCGCGGCCTCCGTCTCCGACGCGGACGCTGAGGCGAGCCCGTCCTCGCCGTCCAGCAGCTGCTTGACCCCGGACTCGCCCTCGCCCCCGCCGACCACCGCCGACTTCTGCGACTTCCTCCAGCTCCAGCTGCAGCTCGACAGCGGGGTCGGCTCCTCGGGCAGCAACTGCTCGACCGACAAGGTGATCGGGAGCGCGGCGATCGCGGCGAGCAGCCCGGCCAACCCCTCGGCCGCGGCCGGGGCGAGCCTCGCCGGCGGCGACGGCGAGGCCGACCTCAGCAGACTCACCCTCTCGGACAGGGAGCAGCGCGAGCTTTACCAGGCGGCGCGCATGATCCAGAAGGCGTATCGCAGCTACAAGGGCAGGCAGAGGCAGGAGGAGGCGGAGAGGCACGCGGCTGTGCTCATACAGCAGTATTATCGCCGGCACAAGCAGTACGCTTATCACAGGCAGGCGACGAAGGCCGCGCTGGTGATTCAGAACAACTATAGGAATTACCGGGCGCGGCCGAGCGCCGCGGCCAGCGCGAGGCAGCAGGCGGTGCACCAGCAAGCGGCGCATCAGGCGGCGCGGAAGATCCAGCAGTTCATGCGGCAATCAAAAATCAAACTGCAGAACGCCAGGGCCGTCGCAAGCGCGAGCGCGAGGCCGGGACAGGTGCCCACTTCCAGGGTGGCTGCTGCCCCCCCAAGCTCGCACTTATCTAGCCCAGGTGCCAGCCTGGCAGCCGGCAATTGTCCCGAGACGACCTAG
- Protein Sequence
- MSIEIAAILISFQRHAEWQSREVKVRPRSGSMLLYSRKKVRYRRDGYCWKKRKDGKTTREDHMKLKVQGVECIYGCYVHSAILPTFHRRCYWLLQNPDVVLVHYLNVPYPDGDAKLAALPPCLALPPDKKEWTRDELASQLRPMFLGGDEAEQPGNAHGLGGHSGHPVDMIVSQLLDRQRASSATSSTAAQLAPRRLTPDNQVSSTTGGQQPSTTASTTPRVYSRHSHSTQSQQPAPLVLSLQQIQGGGGLLILNSQPYHHQQQQQQQQQQQQVQQQPSQQQQQQSQTQSQVDMQQVTEQQQQQQHVIQQNSIDREQQTQQHEIESKDNSSHGNGTDCTSQQSVPPPPVVADFVETLDLSQEDIQRTLSANMVHPSPSPSPADNNIINPMDFIDSTDDVLVNLDAFDMFSDLPELHDFEAAASAAEAETKTDIMRISPESEGHCHPGTTVHIAEYSPEWSYTEGGVKVLVAGPWTGGGSQSYSILFDGEPVEACLVQPGVLRCRCPAHAAGVASLQVACDGFVVSDSVAFEYRRPPQNEPSPEKALLDRLADVETRLQGPGPPSPAAHLEERLVAYCQDAVVRPWRTGAEPLQSGGPTLLHLAAGLGYSRLACALLHWRAENPSSVLDAEVDALRQDAAGLTPLAWACAAGHADTARILYRWNAMALRVRDCQNRSATELAAENGHSAIAEELNRLEARRQDERLFLRPASPSPRRPSQDSGLDLALCGSPLLDNMELLQDDESSLDLGQQGMESAPSPSESVGEEDARVLTLAEQIIAALPERIKREQCDPPSPSSPPPPPPLPQLEDAFMEQMPLDTGELFDSYRECSGGAASVSDADAEASPSSPSSSCLTPDSPSPPPTTADFCDFLQLQLQLDSGVGSSGSNCSTDKVIGSAAIAASSPANPSAAAGASLAGGDGEADLSRLTLSDREQRELYQAARMIQKAYRSYKGRQRQEEAERHAAVLIQQYYRRHKQYAYHRQATKAALVIQNNYRNYRARPSAAASARQQAVHQQAAHQAARKIQQFMRQSKIKLQNARAVASASARPGQVPTSRVAAAPPSSHLSSPGASLAAGNCPETT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00668417; iTF_00305635; iTF_00734564; iTF_00326375; iTF_00714373; iTF_00739080; iTF_01381324; iTF_00712846; iTF_01380599; iTF_01469671; iTF_01099178; iTF_01112587; iTF_01113417; iTF_01110110; iTF_01110950; iTF_00969607; iTF_00968815; iTF_01111759; iTF_00845263; iTF_01428272; iTF_00079921; iTF_00463985; iTF_00877845; iTF_00073720; iTF_01035414; iTF_01037003; iTF_00286329; iTF_01488569; iTF_00690171; iTF_00309500; iTF_01190333; iTF_01003518; iTF_00738340; iTF_01006056; iTF_01004239; iTF_01005076; iTF_00756632; iTF_01465182; iTF_01466207; iTF_01468850; iTF_01524021;
- 90% Identity
- iTF_01113417;
- 80% Identity
- -