Spol001427.1
Basic Information
- Insect
- Scaptomyza polygonia
- Gene Symbol
- cnc
- Assembly
- GCA_035044585.1
- Location
- JAWNNU010000151.1:6515857-6531229[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 2.8e-17 4.1e-14 52.9 5.6 3 62 762 821 760 824 0.94
Sequence Information
- Coding Sequence
- ATGAACAAAGGCAGcgtcgcagcagcagccacagcattGTCTGCTTTGCACTCACACATGCACAGTGAATACATACGCTTACCTTTGGATGAGCTGCTCAACGACGTGCTGCAACAATTTCCACtcgaagacgacgacgacgaaTTAGCCAACGATTCGGTTGCATCCACATCACaggctgctgcagctgccgctTTAAATAGCCAACCAGCAACGCGTATTGTTTCCGAAACTGGTGAGGATTTGTCATTCATTTCCGATATCGATCTTGAGTGCAGCGACAAGGATAACGAGGCGAGTTTTTCGGCAAGCGATTTTGAGGATCTGCAAGATTCGGTCGATTCGAATCTGTTCGATTTAGATGAGGAGGCCAAAAAAGAATTAGATGAAATGTTGCAATCCACAGCACCACCATATCACCATCACGCTCCCCATCCCCATGCCCATCATTCGCACCACCATGCCGCCGCCCACCACCACGCTcatcatcaagcggtggtcgcGCATCAGCGTGCGGTGCATGCCAGCGCCAACTATGCCACCATGGGCAGCTCAACGGGCAGCGCGTTCCAACGTCAGCCCCCAACTTCAGCCGGATTCCATCATGGTCATCATCAGGGCCGCATGCAGCGTTTGAATCGTGGCGTCTCCATGGATCTGGCCACCTATTTTAGCCCCATACCGAGCATGGGCGTCACCGACATGCCCCCGTATCCACCCCACTATACTGGGTATTCGTATCAGGGTCCGGTTGGTGCTGCCGGTCCGGGAATGCCACCCAGTGCCCAACAATATGGACAGGCAACGGTTGCACCACCAACATCATTgccgccaccaccgccgccgcatCACAGTCACGGTCATGGTCACAGTCACGGTCACCATGCGGCAATGTTGCATGCAAATTCGACATTGGGCGATCTAGGCTCGAGTCAACCTCACTATGGCCACAATCTGGGATCAGCGGTCTCGTCCAGCATGCATTTGACCAATTCCAGCCATGAGTCTGATGGCGCTGCtgcgtctgctgctgctgccgccgctgctgctgctgctgctgcaagcgCCAACTACAAAATGGAACATGAGATGATGTATTATGCGAACACCTCTTCGGACATGAATCACACGGATGGCTTAATGAATTCCTTTTTCAACGATGAGGATCTGCATTTGATGGATATGACAGAAAGTTTCTGTCGCATGGTGGATAATAGCACAAGCAACAACTCTTCAGTTTTGGGCTTGCCCAGCAGTGGACACGTCAGCAATGCTGGCAGCTCAACTTTGAATGTTGGCAATCATggaaatggcaatggcaatggtgTAGCTGCTGTATCGGGCGCTGTACCGGTTGGCATCACATCGATGAGTGGTGCAGCAGCGGCTGCTGTCACTGGAGCGACTGGTGGCATGACCAGCGATCTATTGGCCAACAGTGGTGCTGGCGCTCAGGGTGGCGCACAGGATCGTTTGGACGCGTCCAGTGACAGTGCGGTTAGTTCGATGGGCTCCGAACGTGTGCCATCGCTGTCCGATGGCGAATGGGGTGAGGGTAGCGATTCGGCACAGGATTATCATCAGGCCAAATATGTGGGACCATATGATTTTagttacaacaacagcaacaacaacaatagtgtCAATCGTCAACCGCCTGTGGCACAAAAGAAACATCAGCTCTATGGCAAAAGGGATCTGCACAAACAGACGCCGAGTGGTGCAGCCCAACAAACACCAGtggtgcaacaacaacaacaacaacagcagcagcagcaacaacagcaggcaGCTCATTTGCAACAAAGCATCAAATATGAATACGAAACAAATGCGGCAGCAGAGTTTATTGTTAATGAAGCAGCCGGATTGGCGCCAGCCCATCAGGCCAAGGATTATCATCAGGCGTATGGCATGAGTGCGGCCAGTGCATTCACCGCCGATTATGGCATGCCGCCACGCCCAAACCCACTAGCGCACCAGGGCATTATTCACCTCAATCACACCTATTCCCTGCCCGAGGGCACAGGCGGAGCTCTGCCCCAAGGCAGTGGATCACAAAGCAGGCCGCATCCACGCGACAAGAAGCTGTCCACAGGCAGCAAACATGGCTCGAAATCCGGTGACGACAATCTAACCGAGGACGAGCATCTGTCCAGGGATGAGAAGCGTGCACGCGCTTTGAATATCCCAATTCCTGTTTTGGATATCATCAATCTGCCCATGGATGAGTTCAACGAACGTTTGTCCAAATACGATCTGAGCGAGAATCAATTGTCGCTGATTCGTGACATTCGAAGACGTGGCAAGAACAAGGTTGCCGCACAAAATTGTCGCAAGCGCAAGCTGGATCAAATACTAACCCTGGAGGACGAGGTAAATACGGTGGTGAAGCGTAAGGCGCATCTTAATAACGAACGTGATCATCTCGAAAGCGAACGCAAGCGCATCTCCAATAAGTTCTCCTTGCTGCATCGTCACGTGTTTCAGtATCTTCGCGATCCTGAAGGCAATCCTTGCTCGCCAGCTGATTTTAGCTTGCAGCAGGCAGCCGATGGTTCGGTTTATTTGCTGCCACGTGACAAGAACGATAATGGCacgacagcaacagctgctggcAGCGCCGTTTCGGGCAGCAgtcccagcagcagcagcagcagcatgaatggtcaacaacagcagcagactccgctgcagcagctgcaacaacataTGCAACAACATCAAGCGCGACTGCCGCCAcacttgcaacagcagcaacaagcaacgCATAATCAACTgttgccacaacaacagcagcagcagcaacaacaacaacaacagcagcagcagcagcagcaacaggcgcCGCATCAGCATCACAAGGAATGA
- Protein Sequence
- MNKGSVAAAATALSALHSHMHSEYIRLPLDELLNDVLQQFPLEDDDDELANDSVASTSQAAAAAALNSQPATRIVSETGEDLSFISDIDLECSDKDNEASFSASDFEDLQDSVDSNLFDLDEEAKKELDEMLQSTAPPYHHHAPHPHAHHSHHHAAAHHHAHHQAVVAHQRAVHASANYATMGSSTGSAFQRQPPTSAGFHHGHHQGRMQRLNRGVSMDLATYFSPIPSMGVTDMPPYPPHYTGYSYQGPVGAAGPGMPPSAQQYGQATVAPPTSLPPPPPPHHSHGHGHSHGHHAAMLHANSTLGDLGSSQPHYGHNLGSAVSSSMHLTNSSHESDGAAASAAAAAAAAAAAASANYKMEHEMMYYANTSSDMNHTDGLMNSFFNDEDLHLMDMTESFCRMVDNSTSNNSSVLGLPSSGHVSNAGSSTLNVGNHGNGNGNGVAAVSGAVPVGITSMSGAAAAAVTGATGGMTSDLLANSGAGAQGGAQDRLDASSDSAVSSMGSERVPSLSDGEWGEGSDSAQDYHQAKYVGPYDFSYNNSNNNNSVNRQPPVAQKKHQLYGKRDLHKQTPSGAAQQTPVVQQQQQQQQQQQQQQAAHLQQSIKYEYETNAAAEFIVNEAAGLAPAHQAKDYHQAYGMSAASAFTADYGMPPRPNPLAHQGIIHLNHTYSLPEGTGGALPQGSGSQSRPHPRDKKLSTGSKHGSKSGDDNLTEDEHLSRDEKRARALNIPIPVLDIINLPMDEFNERLSKYDLSENQLSLIRDIRRRGKNKVAAQNCRKRKLDQILTLEDEVNTVVKRKAHLNNERDHLESERKRISNKFSLLHRHVFQYLRDPEGNPCSPADFSLQQAADGSVYLLPRDKNDNGTTATAAGSAVSGSSPSSSSSSMNGQQQQQTPLQQLQQHMQQHQARLPPHLQQQQQATHNQLLPQQQQQQQQQQQQQQQQQQQAPHQHHKE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01325409; iTF_01324646; iTF_01323856; iTF_00496338; iTF_00558403; iTF_00521094; iTF_00597103; iTF_00570145; iTF_00619586; iTF_00497128; iTF_00559971; iTF_00516740; iTF_00548550; iTF_00556966; iTF_00566534; iTF_00494855; iTF_00592839; iTF_00485608; iTF_00499336; iTF_00542185; iTF_00497898; iTF_00511018; iTF_00615946; iTF_00552780; iTF_00598626; iTF_00576813; iTF_00501551; iTF_00527791; iTF_00564400; iTF_00500050; iTF_00582707; iTF_00535060; iTF_00552053; iTF_00513206; iTF_00498615; iTF_00481976; iTF_00609624; iTF_00576092; iTF_00573790; iTF_00543488; iTF_01327635; iTF_01321661; iTF_01320939; iTF_01320218; iTF_01326911;
- 90% Identity
- iTF_01321661;
- 80% Identity
- -