Ocom074270.2
Basic Information
- Insect
- Ophraella communa
- Gene Symbol
- CAMTA1
- Assembly
- GCA_035357415.1
- Location
- CM068992.1:25007559-25042523[+]
Transcription Factor Domain
- TF Family
- CG-1
- Domain
- CG-1 domain
- PFAM
- PF03859
- TF Group
- Unclassified Structure
- Description
- CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein [2]. CG-1 domains are associated with CAMTA proteins (for CAlModulin -binding Transcription Activator) that are transcription factors containing a calmodulin -binding domain and ankyrins (ANK) motifs [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 3 4.5e-15 2.3e-10 42.6 0.2 78 116 16 53 3 53 0.91 2 3 0.4 2e+04 -2.3 0.0 27 60 187 220 180 227 0.78 3 3 0.41 2.1e+04 -2.4 3.9 12 87 1172 1253 1166 1263 0.62
Sequence Information
- Coding Sequence
- ATGGAAGATATTACGTTGAGCGAAAAAGCGGAAAATTTTACGTACAGGCAaTGTATCTATGGATGTTACGTACATTCAGCTATTTTGCCAACGTTCCATCGCAGATGCTACTGGCTCCTGCAAAACCCAGACATCGTTCTTGTCCATTATTTGAACGTTCCTTATCCAGACGATAACAAATTGGCCGTTATAACCCCGAGTTTGGCCCTTTGGGCCGACAAAAAAGAATGGACGAAAGAAGAGCTAGTTTCTCAATTAAAACCGATGTTTTTTAGCGAAGACGAACCGGACATTAACAACGAACTAGAAATATCGaattttaaattacaGACTGCAGAAACAGTGGAAGCTATCGTAAGTCAATTAATGGAGAAACAAAGGGTAGCCAGACAAGCAGCTTTAGTTAAACAATTAGAATGTGGTTGTCCGGATTCGACTTGCGCTGATGGAAAAAGTTGTTCCCATCCAATGAGGAGGATAACTGCTGCCAAAGAGGTTGCTAGCTCACCCTTACAGAGTTCAAGCTCGAATTCAAGACCACCCAGCAATTCTGATAATAACAATCAGGTTTCTTCCACTACTGGTTCCGGTTCCATGTTGTTGGGTAACAGTAACTCTCCAAGGGTTTATTCGAGAGACTCTAGAAATCAACAAGTTAAGACGGTTCAGAATAACACAAATTGTTCATCCAGTTCAGGAAACACACCTCCTTTAGTATTAAGTCTGTCTCAAATTCAAGGAGGTGGAGGTTTACTCATTCTTAATAGCAATTCAGGCAACAACGCTAACAACCATCAGAACCTTGTAAATCCTGTGTCAGTAGCAAATTTCGTTTGTAATTCAAATCGTGTGATTCCCAAAGACAGAGCTAGTCATCTGGTACTTAAACAAGAAATCATGGATACGAATCCTTCGTGTCTGCACGCCTCTAAACAAAATTCGAAAGGAGTACCAAGAGAAATGAAGATGGAATTAGGAGAAAATCACAGACAATCAGTTTTTCATGATGGAAATATGTATAACAATAGaaatcaCCATCAACAAACAGAAGTAGTAATGTCAAGTGCTCCATCAACTCCTTCAAAACATATGGATACGACACAGGAAGATGTCGTAGATGAGTACAAACATCAAAATTTCTGTAACGAAACAGTGGTACTGTTGGGTACAGATTCTAGTGGTTCTCTAGTTAGTAAAAGCGATGGTTCTTCAATCATAAATGGAGGGTTCTTTAATGAGACTTTAGATCTATCTCAGGAGGATATCCAAAGAACTCTCTCCGCCAACATGCCATTGTGTTCAACTGAACTTAATAATCAAAACTCTGCATCCGAAACGAACGGATCCGACCATCATCAAAACAAACCGCAGTCCCAACAGCAACCTCTAACTAATGAAATCAATCCAATGGACTTTATAGATAGTTGCGATGTTGTTGTATCACCGACACATGTTGTAGATGATGACGTTTTCGTCAATTTAGATGCTTTCGATATGCTCGGCGAGTTTCCAGATTTAGAAAGTTTAGATTCTGGTCATGCCGGACTCCTAGATGTTAATCCGTCCGAAAATAGAAGTAACACTAAGGCACAACCAGAACAACAAAACAATCATCATATGGAAGGTTCGGCAAAAATAACCGATTATTCCCCTGAATGGGCTTATCCGGAAGGGGGTGTTAAAGTTTTGGTCACAGGACCATGGCATTCATCTGGACCATATACAGTACTTTTTGATACATTCCCTGTACCAACCACTTTAGTCCAAAGTGGTGTGTTACGATGTTATTGTCCAgCTCACGAAGCTGGTTTAGCAACTTTACAAGTAGCATGCGACGGCTATGTGATTTCCAACTCTGTGATATTCGAGTACAAGTTGCCACCGAGAGAAGAACAAGTGGCTGCACCTGAACCGAAAGTAGAAAGATGTAATGATAATTTACTTAAATTCACTCTGCTTCAAAGACTAGAAGCTATGGATGACAGACTGCAGATCAAACAGGAACCTGTTGATAACGATGTGGTCGAAGACACGGCTCTTTTTGGacaaccaaattttgaagacaggCTGGTTACCTTTTGCCAAAATATGACGTCGAGAATATGGAGACACGGCGAAGAACTTAGTGTCTCATGGTTTGCCAGCCACAGAGGGATGACGCTATTACATCTAGCAGCTTCCTTAGGTTATTCTCGTTTAGTATGCGCCATGTTGCACTGGCGAGCTGAAAATTCGTCTTTACTACTTGAAACTGAAGTCGATGCACTAAGTCAAGATGAAGATGGATTTACGCCACTGATGTGGGCGTGTGCTAGAGGTCATATTGAAACAGCAGTTATGCTCTATAAATGGAACCATAACGCCCTCTATATGAAAAACATTTCAAATCAAACTGCTCTTGAATGTGCCAGAGTAACTAATCACACAGATCTCGttaaagaaatagaaaaattagaatTGAGGAGAGACAAAGCAAATTTGATGCTCCAATCGAATCAATCCTCCACAGAAGTCATGTCGCCTACAGTAATATCACCCGCCAGTTCAATTGGATCTCTTATTTCTATAGCTTCAACTAGTAAATCTCATGATGGTGTATTCTTAAGGCCTGGAGCAGTTACAAGAAGCGAAATTAACAAGTTCAAGATTTTGAACGTCGACTTAGACTCGGATGTCAACAGTAAACTATTAACTTCAGCTCCAAGTCCCATGCTTTCGGATTTTTCATCAAGTGTTAGAGGTAGTAATGGTCAAAAATTGGTCAAGAGACCGTCGATCGACAGTGGAATCAATATGAGCTGTGGACCAGCTTCGGAATGTTATAGACCCAAGAGTTCAAAAGTATGGAACTCACGTGAAACACCAAAACTATCCAAGTTTGATCGAAGTATGTCTTTACCTCTTCAGACGTCCAATCTAAAAGATAACTCATTTGATAATGAGAGTTTGGAATCCCACAAAAAGATGGATTTTGCTTTATgCGAAATAGGGAGTGGCCAACGAAGCAGTAGTCCACTGATAGACGTAGAAGGTGTTTCGGACGATGAGAATGAGCTCCCCAACAGTACTGTAGgGGAGCAAGATGCTAGAGTTTTGACATTGGCTGAACAAATCATTGCTGCAATGCCTGAAAGAATCAAGaatGAAAGCGAAGAAATGTTGATGGACAAGAATCCGTGTGCTTCAGATAATCTACAAACATCTGATACGCTATCAGATGTTTTTATGGAACCATTATTGGATCAATCATCATCTAGTTTTGAATCTACTGAATTCAGCTTTGAATTTTCCGATAATAATTATAGatACTATGATGTTAGCACACCCTGTTCTAGTTTGAGTCCAGCTTCTTCCAGTTGTTTGCAGTCACCTTGCTCCTTTACACTCGATTCTCCATCTCCACCACCTACTACGGCCGACTTTTGTGAGTTCTTCCAAGCTTCTGGTACTGTTTTTGAAAACGATTTCTCAAATCTCACATTATCagATCGTGAACAGAGGGAATTGTATGAAGCTGCAAAAATAATTCAGAAAGCATATCGTTCATACAAGGGACGTCAACAACAAGAGCAAAATAAAGAGAGACAAGCAGCTGTTGTCATCCAGAATTACTATAGACGTTATAAGCAATACGCTTACTACAAACAAATGACTCATGCTGCTATGGTGATTCAGAATGGTTACCGCTCCTACTGTGAACATAAAAGGTTTAAGAAAAGCCAAGAGGCTGCTGTGTGCATTCAAAACTATTATCGAAATTACAAGGAGCAGGGGGGGAGAAATAGTAGAGAAGGGACTCCTGCCACGGCAGGGTTGAAGTAA
- Protein Sequence
- MEDITLSEKAENFTYRQCIYGCYVHSAILPTFHRRCYWLLQNPDIVLVHYLNVPYPDDNKLAVITPSLALWADKKEWTKEELVSQLKPMFFSEDEPDINNELEISNFKLQTAETVEAIVSQLMEKQRVARQAALVKQLECGCPDSTCADGKSCSHPMRRITAAKEVASSPLQSSSSNSRPPSNSDNNNQVSSTTGSGSMLLGNSNSPRVYSRDSRNQQVKTVQNNTNCSSSSGNTPPLVLSLSQIQGGGGLLILNSNSGNNANNHQNLVNPVSVANFVCNSNRVIPKDRASHLVLKQEIMDTNPSCLHASKQNSKGVPREMKMELGENHRQSVFHDGNMYNNRNHHQQTEVVMSSAPSTPSKHMDTTQEDVVDEYKHQNFCNETVVLLGTDSSGSLVSKSDGSSIINGGFFNETLDLSQEDIQRTLSANMPLCSTELNNQNSASETNGSDHHQNKPQSQQQPLTNEINPMDFIDSCDVVVSPTHVVDDDVFVNLDAFDMLGEFPDLESLDSGHAGLLDVNPSENRSNTKAQPEQQNNHHMEGSAKITDYSPEWAYPEGGVKVLVTGPWHSSGPYTVLFDTFPVPTTLVQSGVLRCYCPAHEAGLATLQVACDGYVISNSVIFEYKLPPREEQVAAPEPKVERCNDNLLKFTLLQRLEAMDDRLQIKQEPVDNDVVEDTALFGQPNFEDRLVTFCQNMTSRIWRHGEELSVSWFASHRGMTLLHLAASLGYSRLVCAMLHWRAENSSLLLETEVDALSQDEDGFTPLMWACARGHIETAVMLYKWNHNALYMKNISNQTALECARVTNHTDLVKEIEKLELRRDKANLMLQSNQSSTEVMSPTVISPASSIGSLISIASTSKSHDGVFLRPGAVTRSEINKFKILNVDLDSDVNSKLLTSAPSPMLSDFSSSVRGSNGQKLVKRPSIDSGINMSCGPASECYRPKSSKVWNSRETPKLSKFDRSMSLPLQTSNLKDNSFDNESLESHKKMDFALCEIGSGQRSSSPLIDVEGVSDDENELPNSTVGEQDARVLTLAEQIIAAMPERIKNESEEMLMDKNPCASDNLQTSDTLSDVFMEPLLDQSSSSFESTEFSFEFSDNNYRYYDVSTPCSSLSPASSSCLQSPCSFTLDSPSPPPTTADFCEFFQASGTVFENDFSNLTLSDREQRELYEAAKIIQKAYRSYKGRQQQEQNKERQAAVVIQNYYRRYKQYAYYKQMTHAAMVIQNGYRSYCEHKRFKKSQEAAVCIQNYYRNYKEQGGRNSREGTPATAGLK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01289983;
- 90% Identity
- iTF_00387235; iTF_00056581; iTF_00792170; iTF_00458916; iTF_00735555; iTF_00910738; iTF_01047269; iTF_00026041;
- 80% Identity
- -