Sinv008953.1
Basic Information
- Insect
- Solenopsis invicta
- Gene Symbol
- CAMTA2
- Assembly
- GCA_016802725.1
- Location
- NC:6464129-6529550[-]
Transcription Factor Domain
- TF Family
- CG-1
- Domain
- CG-1 domain
- PFAM
- PF03859
- TF Group
- Unclassified Structure
- Description
- CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein [2]. CG-1 domains are associated with CAMTA proteins (for CAlModulin -binding Transcription Activator) that are transcription factors containing a calmodulin -binding domain and ankyrins (ANK) motifs [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 1.7e-47 2.4e-43 146.2 2.5 2 116 200 312 199 312 0.98 2 2 1 1.4e+04 -6.4 3.8 34 38 480 484 463 504 0.50
Sequence Information
- Coding Sequence
- ATGAGGCAGGTAGTCGTGCTGGAGCCAGCTGGCAGCGTCCTGGTGATCAGACAGACCTCTCTGGGTGGTGGCGGCGTCGCCACTACCGGCAATGTCAACTCTAACCCTACGGGCAGCAGCGACACCCACGCCTTGGTCAATCACGCCGCTGTCTCCACGATCGCCTCCAACGATCACAATCAAAATAGGATAGTCGTGGTGCGCTCGTCTACCTCATCCACCTGCATGACCAGCTCGGCCGACAATCACGCATCGACCAACGGCAATAGCAACATCAACGTCAACGCGAACATTAATGGGAATCCGGGCAGTATCAGCTGTAAGCCGTCGTCGAAGGCCGGAATTGGCGGCGGTGCTTGTAAGCCGAATGAGAGAAACGACCACCTACGAAACGAGCACAAAAATCAGGACGGAGTTGCCAACAGCAACGAAACGAACCCTGCCGCTGGATGTCGTCCAAGGACGAGCAAGGAAGCCGCTCACGAGAACGTCGCCTCCGACAGGAAGCTCGATGGCGCGGATCCCATCTCCCATGGCGACCCGATAAAGCTGCCCGAAAATCTGGAAACCTTGCCGCGAGCTGAACACTTTCCAACTCAGAGGCATCGATGGAACACCAACGAGgAAATCGCCGCCATCCTAATAAGTTTTCAAAGGCACGCCGAATGGCAAAGTCGGGAGGTAAAAGTACGACCACGAAGTGGTTCAATGTTACTGTACTCGAGGAAGAAAGTACGTTACCGGAGGGACGGTTATTGttggaagaaaagaaaagatggaAAGACTACACGAGAGGATCACATGAAACTGAAGGTTCAGGGAGTCGAGTGCATCTACGGCTGTTACGTGCACTCGGCAATCCTTCCGACGTTTCATCGCCGGTGTTACTGGCTACTGCAAAATCCGGACGTCGTCCTCGTCCATTATTTGAACGTCCCTTATCCGGACGGTGATGCGAAGCTAGCGGCCCTGCCACCTTGCCTCGCACTGCCGCCAGACAAAAAGGAGTGGACGCGAGACGAGTTAGCGTCGCAATTAAGGCCCATGTTCCTCGGTGGAGATGACGATCCGAACAATCCTCATCTCACTCAGCACTCGAACCATCCCGTCGACATGATAGTCTCTCAACTGCTCGATAGGCAGAGGGCATCCTCTACTTCCACCACCAGCAACACTCAACTCGCACCTAGGAGACTTACGCCCGACAATCAAGTCTCTTCAACAACGGGAGGTCAACAACAATCAACGACAGCGGCCCCGGCTCCCCGCGTGTACTCAAGACATTCCCATTCCACTCAGAGTCAACAGCCGGCTCCTCTAGTTTTAAGTTTGCAACAAATCCAAGGCGGTGGTGGTCTCCTGATACTCAACAGTCAACCATATCATCatcagcaacaacagcaacagcagcagcagcagcagcaacagcagcagcagccacaacaacaacagcagcagcagcaacaacagcagcagcagcagccgcaaCAACAGCCACAACCGCAACAGCAGCAGAGTCAGCAGGTGGAGATGCAACAGGTCACAGAGCAACAAATCGTACCGCAGACTAGCGTCGACCGAGAACAGCAAACGCAGCAAGAAATTGACGCGCAGGAGAGTATGGACAGGTCTGCCGTGCAACCTCTGCCGATGGGCGGCTCCGGCTCCGAGGTGACCGATTTTGCGGAGACACTGGATCTTAGTCAGGAAGACATTCAGAGGACACTGTCGGCCAACATGGTGCCCCCATCGCCGTCGCCTTCGCCGGCGGACAACAGCATGATCAATCCGATGGACTTTATTGACTCGTCGGACGACGTGCTGGTCAATCTGGACGCGTTCGACGTATTTGGCGATTTGCCGGAGCTGCACGACTTCGAAGCCGAGCAGACGAAGCACGAGGAACAGCGAGGTGGTCCCGAGAATAACGTCGGATGTCATCCtggtacaaccgtccatattgCAGAATATAGTCCCGAGTGGAGTTACACCGAGGGTGGTGTAAAGGTATTAGTTGCTGGTCCGTGGACCGGCGGTAGTGGTTCCCAATCGTATTCGGTACTTTTCGATGCGGAACCGGTCGAGGCATGCCTCGTGCAACCAGGTGTATTGCGTTGTCGATGTCCCGCGCACGCACCAGGAATAGCGTCCCTTCAGGTGGCCTGTGACGGTTTTGTCGTGTCCGATAGCGTTGCCTTCGAATATCGTAGAGCGCCGACAAGCGAACCTAGCCCAGAGAAAGCTCTGCTGGATCGTCTTGCGGATGTCGAGGCCCGTCTGCAGGGTCCCGGTCCGCCGTCTCCCGCAGCTCATCTGGAAGAGCGACTGGTTGCTTATTGCCAGGATGCTGTTGTCCGTCCTTGGCGAACTGGAGCGGAACCTCTTCAATCCGGTGGGCCTACCCTCTTGCATCTGGCGGCCGGGTTGGGATACTCCAGGTTAGCCTGTGCACTCCTTCACTGGAGAGCAGAAAATCCTAGTAGCATTTTGGACGCCGAGGTCGATGCTCTCAGGCAGGATAGCGCAGGTCTCACGCCACTTGCTTGGGCTTGTGCGGCAGGGCACGCCGACACCGCCAGGATTCTTTACAGATGGAATGCTATGGCACTACGCGTGAGGGACTGTCAGAATAGAACGGCGACCGAGCTGGCCGCGGAGAACGGTCACACGGCAATCGCCGAAGAACTGAATCAACTCGAAACCCGAAGGCAAGACGAGAGGCTTTTCTTGCGACCCGCCAGCCCTAGCCCTAGGCGGCCATCTCAAGACAGCGGTCTCGATCTGGCGTTGTGTGGCTCGCCGCTGCTGGACAACATGGAGTTGTTGCAAGAAGATGAGTCGTCGTTAGGCCTCAACGAACAGGGAATGGAGAGCGCTCCGACCCCTCAGGAGACCGTAGGGGAAGAAGACGCGAGGGTGCTGACATTGGCAGAACAAATTATAGCGGCGTTACCGGAAAGGATCAAGAGGGCGGAGGGTGAGTCTCCGTCTTCTTCCTCGCCACCACCACCCACGGCGCCTCTGTCACCCTTGGAAGACGCTCTCATGGAACAAATGCCTCTCGACTCCGGGGAACTGTTTGACTCGTATCGCGACTGCAGCGGCGGCGCCGCTTCGGTCTCGGATGCCGATGCCGACGCCAGTCCCTCGAGTCCCTCCAGCAGTTGTTTGACACCGGACTCGCCGTCTCCGCCGCCCACCACCGCCGATTTCTGCGAGTTTCTGCAGCTGCAGCTGCAGCTCGACGGCAATGGCAATGCTCACAGCGGTCAGTACTACTCGCAGAACGGCGGCGGTGAGCGCAAGTTGAACGGCATGATCGGTTCAGGGGCTATGACGGCGTTTGGCGGAAACGGCGACGGTAGCGAGGCGGATCTCAGCAGACTAACATTGTCTGATCGTGAACAGAGAGAGCTTTATCACGCCGCCAGGATGATTCAGAAAGCGTACAGGAATTACAAGGGTCGTCAGAGGCAGGAGGAGGCCGAGAGACATGCCGCCGTTCTGATCCAACAGTACTATCGTCGTCACAAACAGTACGCTTACCACAGGCAAGCCACGAAGGCAGCCTTGGTGATCCAGAACAACTATCGAAGTTATCGCGCACGACCGGGCTCGGCCAGCGCGAGACAGCAGGCGGTTCATCAGCAGGCGGCACATCAGGCAGCGAGGAAGATTCAACAGTTCATGAGGCAGTCTAAGATCAAACTGCAGAACGCCAAGGCCGTCGCAAACGGGAACGGGAGGCCGCCAGTGGTCACTTCACGGGCGGCTGCTGTCCCCCAAAGCTCGCCCTCATCTAGCCCAGGGGCCAGCCTAGTAGCCGCCAGCCCGGGGGTCACCTAG
- Protein Sequence
- MRQVVVLEPAGSVLVIRQTSLGGGGVATTGNVNSNPTGSSDTHALVNHAAVSTIASNDHNQNRIVVVRSSTSSTCMTSSADNHASTNGNSNINVNANINGNPGSISCKPSSKAGIGGGACKPNERNDHLRNEHKNQDGVANSNETNPAAGCRPRTSKEAAHENVASDRKLDGADPISHGDPIKLPENLETLPRAEHFPTQRHRWNTNEEIAAILISFQRHAEWQSREVKVRPRSGSMLLYSRKKVRYRRDGYCWKKRKDGKTTREDHMKLKVQGVECIYGCYVHSAILPTFHRRCYWLLQNPDVVLVHYLNVPYPDGDAKLAALPPCLALPPDKKEWTRDELASQLRPMFLGGDDDPNNPHLTQHSNHPVDMIVSQLLDRQRASSTSTTSNTQLAPRRLTPDNQVSSTTGGQQQSTTAAPAPRVYSRHSHSTQSQQPAPLVLSLQQIQGGGGLLILNSQPYHHQQQQQQQQQQQQQQQQPQQQQQQQQQQQQQQPQQQPQPQQQQSQQVEMQQVTEQQIVPQTSVDREQQTQQEIDAQESMDRSAVQPLPMGGSGSEVTDFAETLDLSQEDIQRTLSANMVPPSPSPSPADNSMINPMDFIDSSDDVLVNLDAFDVFGDLPELHDFEAEQTKHEEQRGGPENNVGCHPGTTVHIAEYSPEWSYTEGGVKVLVAGPWTGGSGSQSYSVLFDAEPVEACLVQPGVLRCRCPAHAPGIASLQVACDGFVVSDSVAFEYRRAPTSEPSPEKALLDRLADVEARLQGPGPPSPAAHLEERLVAYCQDAVVRPWRTGAEPLQSGGPTLLHLAAGLGYSRLACALLHWRAENPSSILDAEVDALRQDSAGLTPLAWACAAGHADTARILYRWNAMALRVRDCQNRTATELAAENGHTAIAEELNQLETRRQDERLFLRPASPSPRRPSQDSGLDLALCGSPLLDNMELLQEDESSLGLNEQGMESAPTPQETVGEEDARVLTLAEQIIAALPERIKRAEGESPSSSSPPPPTAPLSPLEDALMEQMPLDSGELFDSYRDCSGGAASVSDADADASPSSPSSSCLTPDSPSPPPTTADFCEFLQLQLQLDGNGNAHSGQYYSQNGGGERKLNGMIGSGAMTAFGGNGDGSEADLSRLTLSDREQRELYHAARMIQKAYRNYKGRQRQEEAERHAAVLIQQYYRRHKQYAYHRQATKAALVIQNNYRSYRARPGSASARQQAVHQQAAHQAARKIQQFMRQSKIKLQNAKAVANGNGRPPVVTSRAAAVPQSSPSSSPGASLVAASPGVT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00393592;
- 90% Identity
- iTF_00730049; iTF_00015362; iTF_01254786; iTF_01476164; iTF_00129734; iTF_00128966; iTF_00126704; iTF_01520218; iTF_00730728; iTF_01099178; iTF_01099177; iTF_01354602; iTF_00885099; iTF_00765572; iTF_01228364; iTF_00898733; iTF_00417550; iTF_01423649; iTF_01408200; iTF_01407273; iTF_01421549; iTF_01409105; iTF_01405687; iTF_00729294; iTF_00769614; iTF_00279954; iTF_01087192; iTF_00452569; iTF_00125917; iTF_00128191; iTF_01476911; iTF_00181296; iTF_00181954; iTF_01077457; iTF_01477556; iTF_00868789; iTF_00127446; iTF_01523350; iTF_00264987; iTF_00264263; iTF_01245212; iTF_01270492; iTF_01267556; iTF_01271205; iTF_01268263; iTF_01269048; iTF_01269790; iTF_00016029; iTF_00014710; iTF_00385309; iTF_01015805; iTF_00867390; iTF_01261823; iTF_00109819; iTF_00016672;
- 80% Identity
- -