Dneo009454.1
Basic Information
- Insect
- Drosophila neoperkinsi
- Gene Symbol
- cnc
- Assembly
- GCA_037043555.1
- Location
- JBAMBG010005998.1:24054-39010[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 2.1e-16 2.3e-13 49.9 3.2 3 62 748 807 746 810 0.93
Sequence Information
- Coding Sequence
- ATGCTGACACGCCTGAGCACAAGTGTCTATTGGCTTGAGGGTGAATACATACGCTTACCGTTGGATGAGTTGCTCAACGACGTGCTGCAACAATTTCCACTCGAAGACGAGGAGAACGATTCGGTTGCATCCACATcgcaggctgctgctgctgccgctttaATCAGTCAACCGGCGACGCGTATTGTGTCCGAGACTGGTGAGGATTTACCATTTGGATCGGATCCGGATCTTGAGTGCAGCGACAAGGATAACGAGGCCAGTTTCTCGGCGAGTGATTTTGAGGATCTGCAGGATTCGGTCGGTTCGAATCTGTTCGATTTAGATGAGGAGGCCAAAAAGGAATTAGATGAAATGTTGCAATCCACAGCACCGCCATACCATCACGCCCCCCATCCCCATGCCCATCACTCGCACCACCATGCTGCCGCCCACCATCATGCCCACCACCAAGCGGTGGTTGCCCATCAGCGGGCGGTGCAGGCGAGCGCCAACTATGCCAGCATGGGCAGCTCCACCGGCAGCGCCTTTCAGCGTCAGCCGCCAACTTCAGCCGGATTCCATCATCAGgGCCGCATGCAGCGCTTGAATCGTAGCGTCTCCATGGATCTGGCCACCTATTTCAGCCCCATACCGAGCATGGGCGTGGTCGGTGGAGTATCCGATATGCCGCCGTATGCATCCCACTACACCGGCTACTCGTATCAGGGGCCAACTGGCGGTGCTGCTCCAGGAATGCCACCCAGTGCTGCCCAGCAATATGGAcaggctgctgttgcagcaccATCtttgccgccaccgccgccgccccATCACAGTCACGGACACAGTCACGGTCACCATGCGGCAATGTTGCACGCGAATTCCACTTTGGGCGATCTCTGCTCCAGCCAGCCGCACTATGGCCACAATCTCGGCTCTGCGGTCACATCCAGCATGCATTTGACCAACGCCAGCCACGAGGCTGATgcagctgcctctgctgctgccgccgccgctgccgcagcTAGTAGCAACTACAAAATGGAACATGAGATGATGTATTATGCGAACACCTCGTCGGACATGAATCACACGGATGGCCTAATGAATTCCTTTTTTAACGATGAGGATATGCACTTGATGGATATGACGGAAAGTTTCTGTCGCATGGTGGACAACAGCACAAGCAACAATTCGTCGGTGCTGGGTCTGCCCAGCAGTGGACATGTCAGCAACGCGGGCAGCTCATCATTGAATGGGGGGAGTCATGCGAATCCAAATAGTGTAGCTGCTGTTCCCGgtgctattgttgctggtggCATCACATCGATGAgtggtgcaacagcagctgctgctgctggggtcGTTGGTGCCACTGGTGGCATGACCAGCGATCTATTGGccaacgctgctgctggcgctcagggtggtggtggtggtgcacAGGATCGCTTGGACGCGTCCAGTGACAGTGCCGTTAGCTCGATGGGATCCGAGCGTGTTCCATCCCTTTCCGACGGCGAGTGGGGCGAGGGCAGCGATTCGGCACAGGACTATCATCAGGCCAAGTACGTTGGTCCCTATGATTTcagttacaacaacaacaacaacaacaccaacacacgcCAACCGCCCGTGGCACAGAAGAAGCATCAATTGTATGGCAAGAGGGATCTGCACAAACATAACCCAACCGGAGCAACGCAGCAGCCACCAGtggtgcagcaacagcaacagcagcaacaacaacaacaacagcagcagcagcaacaacagcagccgccgccgcaggTGCAACAGAGCATCAAGTATGAGTATGAGGCAGGTGCCGGTGCGGCATTCAGCGTTGGCGAGGCAGCCGCCATGGCGCCCACTCTGGCCAAGGATTATCATCAGGCGTATGGCATGAGTGCGGCGAGTGCATTCACCGCTGACTATGGCATGCCACGCCCCGCGCAGGGCTTGGTGAACCTCAATCACACCTATGCGCTGCCCCAAGGAACGGGCGGAACTCTGCCCCAGGGCGGTGGATCGCTAAGCAGACCGCATCTGCGGGACAAGAAGCTCATTGCAGGCAGCAAACATTCATCGAAATCGGGCGAGGATAATCTCACCGAGGATGAGCATCTGTCGAGGGATGAGAAGCGGGCACGTTCCTTGAACATACCCATTCCGGTGGGCGATATTATCAATCTGCCGATGGATGAGTTCAATGAGCGTCTGTCCAAATATGATCTGAGCGAGAATCAGTTGTCTTTGATCCGTGACATACGCAGACGTGGCAAGAACAAGGTTGCCGCACAGAATTGCCGCAAGCGCAAGCTCGACCAGATCCTGACGCTCGAGGATGAGGTGAATACGGTGGTGAAGCGCAAGTCGCAGTTGAATCACGATCGCGATCATCTCGAGGGCGAACGCAAACGCATCTCCAACAAGTTCTCCATGCTGCATCGCCATGTTTTCCAGTATCTGCGCGATCCCGAAGGCAATCCTTGCTCACCGGCTGATTACAGCTTGCAGCAGGCTGCTGATGGTTCCGTTTACTTGCTGCCACGCGACAAGGTCGACAACggaacgacagcaacagctgctgccagcgccgtttcggccagcagcagcagcgttaatggccaacaacaacagccgccgcagcagcagcagcagcagtcgtcgCTGCACAATCATCAGGGTCATCATCAGGCGGCACAGGCAATGCCgccattgcagcagcagcaacaacaatcgcgCCTGCCGCCACActtgcaccagcagcagcagcagcagcaagcgggTCATCATCAActgttgccacagcagcagcaacagcaacaacagcaggcgccacatcagcatcagcatcacaAGGAATGA
- Protein Sequence
- MLTRLSTSVYWLEGEYIRLPLDELLNDVLQQFPLEDEENDSVASTSQAAAAAALISQPATRIVSETGEDLPFGSDPDLECSDKDNEASFSASDFEDLQDSVGSNLFDLDEEAKKELDEMLQSTAPPYHHAPHPHAHHSHHHAAAHHHAHHQAVVAHQRAVQASANYASMGSSTGSAFQRQPPTSAGFHHQGRMQRLNRSVSMDLATYFSPIPSMGVVGGVSDMPPYASHYTGYSYQGPTGGAAPGMPPSAAQQYGQAAVAAPSLPPPPPPHHSHGHSHGHHAAMLHANSTLGDLCSSQPHYGHNLGSAVTSSMHLTNASHEADAAASAAAAAAAAASSNYKMEHEMMYYANTSSDMNHTDGLMNSFFNDEDMHLMDMTESFCRMVDNSTSNNSSVLGLPSSGHVSNAGSSSLNGGSHANPNSVAAVPGAIVAGGITSMSGATAAAAAGVVGATGGMTSDLLANAAAGAQGGGGGAQDRLDASSDSAVSSMGSERVPSLSDGEWGEGSDSAQDYHQAKYVGPYDFSYNNNNNNTNTRQPPVAQKKHQLYGKRDLHKHNPTGATQQPPVVQQQQQQQQQQQQQQQQQQQPPPQVQQSIKYEYEAGAGAAFSVGEAAAMAPTLAKDYHQAYGMSAASAFTADYGMPRPAQGLVNLNHTYALPQGTGGTLPQGGGSLSRPHLRDKKLIAGSKHSSKSGEDNLTEDEHLSRDEKRARSLNIPIPVGDIINLPMDEFNERLSKYDLSENQLSLIRDIRRRGKNKVAAQNCRKRKLDQILTLEDEVNTVVKRKSQLNHDRDHLEGERKRISNKFSMLHRHVFQYLRDPEGNPCSPADYSLQQAADGSVYLLPRDKVDNGTTATAAASAVSASSSSVNGQQQQPPQQQQQQSSLHNHQGHHQAAQAMPPLQQQQQQSRLPPHLHQQQQQQQAGHHQLLPQQQQQQQQQAPHQHQHHKE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01326138;
- 90% Identity
- iTF_00499336; iTF_00542185; iTF_00497898; iTF_00570145; iTF_00559971; iTF_00516740; iTF_00548550; iTF_00485608; iTF_00497128; iTF_00566534; iTF_00511018; iTF_00615946; iTF_00501551; iTF_00576813; iTF_00496338; iTF_00558403; iTF_00521094; iTF_00597103; iTF_00619586; iTF_00494855; iTF_00592839; iTF_00527791; iTF_00564400; iTF_00598626; iTF_00552780; iTF_00552053; iTF_00481976; iTF_00609624; iTF_00573790; iTF_00500050; iTF_00582707; iTF_00513206; iTF_00498615; iTF_00576092; iTF_00543488; iTF_00535060;
- 80% Identity
- -