g1285.t1
Basic Information
- Insect
- Drosophila pseudoobscura
- Gene Symbol
- cnc
- Assembly
- GCA_009870125.2
- Location
- CM020868.1:10000478-10012973[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 6.8e-17 6.9e-14 51.8 6.2 3 62 767 826 765 829 0.94
Sequence Information
- Coding Sequence
- ATGAAACAAAGAAAGATCATTCTTGGGAAATGGGAAAGTGAATACATTCGCTTGCCCCTGGATGAGCTGTTGAACGACGTGCTCAAGCTCTCTGAATTCCCCCTCGAAGACGAGTTAACCAACGATCCGGATGCTTCGACATCGCAGGCTGCTGCCGCTTTGAACGAGAACCAATCGCAACGGATCACCTCCGAGACGGGTGAGGATCTGCTGACTGCCGAAGGAGTTGCTGTCCAGCAGCGAAAGATCAATACAGACGGCAGGAGTACCCGCACCAACAGCGGCCAGAGCGACTTCAACGATTCAAGCGACAATTTCCCCCTGTGCGACTTTGACGATCTGCAGAGCTCCGTGGGCTCGCCCCTCTTCGACTTAGATGACGACGCCAAGAAGGAGCTAGACGAGATGTTGCAATCCAAGGCACCGCCCTacccccatccccatgcccatgACCATCCCCACGCCCACGGCCACCCCTACGCGCATCCGCACAGCCACCACcatgccgctgcagccgccgcgCATCACCACCACGCCCATGCCCACCatgccgcagccgcagctgccgCCCACCAGCGGGCCGTGCAGCAGGCCAACGCCAACTATGGCGGCGTGGGCAGTGCCCCGGGCAGCGCCTTCCAGCGCCAGCCATCAGCTGCCGGTGGATTCCATCATGGCCATCATCAGAGCCGCATGCCGCGCCTGAACCACAGCGTGTCGATGGAGCGTCTTCAGGACTTTGCCACTTACTTCAGTCCCATACCCAGCATGGTGGGCGGGGTCTCGGACATGCCGCCCTACCCCCATCATCCGCACTATCCCGGTTACTCGTACCAGAGCAGTCCTTCGAACGGCGCCGCCGGCGGTGGTCCCATCCCTCCAGTGCCCGGACAGCACGGCCAGTACGGCTCCCCGGCCACTGCGGCGCTGCagcccccaccaccaccgccgccaccacaCCATGCGGCCATGTTGCACCACCCGAATGCGGCGCTGGGCGACCTCTgctccgccaccgccgccagcgGACAGCCCCACTACGGGCACAATCTCGGATCGGCCGTCACCTCCAGCATGCACCTGACCAACTCCAGCCACGAGGCGGAGggagccgctgccgcagcggcCGCCGCCGGTAATGCCTACAAGATGGAGCACGACCTGATGTACTACGCGAACACTTCCTCGGACATGAATCAAACGGATGGCTTCATGAACTCGATTTTCACCGATGAGGATTTGCAAATGATGGACATGAATGAGAGCTTCTGTCGCATGGTGgacaacagcaccagcaacaactcCTCGGTCCTGGGCCTGCCCGTCAGCGGACATGTCAGCCACGCGGCCGGCTCCGCACAGCTGGTTCCGGGGAATCACGGCACTGCTAATGGCGGATCATCCGTTGGCGGCGGTGTGGCATCTATGAGCGGTGGAGCTTCGTCGGTTGGCGCTACGGGTGGCATGACCGCCGATATCCTGGCCAGCGGCGGAGGTGCAGCACAGGGCGGAACGGATCGATTGGACGCCAGCAGTGACAGTGCCGTCAGCTCGATGGGCTCCGAGCGGGTGCCGTCCCTCTCCGATGGGGAGTGGGGCGAGGGCAGCGACTCGGCCCAGGACTACCACCAGGGCAAGTACGGCGGCCCCTACGACTTTAGTTACAACCACTCGCGCATCAGCACGGCCACCCGCCAGCCGCCGGTGGCCCAGAAGAAGCACCAGCTCTACGGCAAGCGCGATCCCCATAAGCAGGTGCCCAGCGCCCTGCCGCCCACAGCGCCTCCGGCCACGGCCCACgcgcaggcgcaggcacaGAGCatcaaatacgagtacgatgCCGGATACGCGGGCATGGCCAGCGGAGGAGTCGCCGGACTGCAGCACAGCGAGCCGGGCGCCATGGGGCCTGCCCTCTCCAAGGAGTACCATCAGCAAGCCTACGGCATGGGCGCTAGCAGCAACTTTCCCGGCGACTACGGGGTTCGTCCGCCGCCTCGCACCTCCGAGGACCTGGTGCAGCTGAATCACACGTACTCGCTGCCCCAGGGCAGTGGCTCGCTCCCCAGACCCCAGGCGCGCGACAAGAAGCCTCTGGTGGCCACCAAGAACGCATCGAAGGCGGCTGCCGGCAgctctgctgccgccaccgcgGAGGACGAACATCTGACGCGCGACGAGAAGCGCGCCCGCTCCCTGAACATACCCATCTCGGTGCAGGACATCATCAATCTGCCGATGGACGAGTTCAACGAGCGCCTGTCCAAGTACGACTTGAGCGAGAACCAGCTGTCCCTCATCCGGGACATTCGTCGCCGTGGCAAGAACAAGGTGGCCGCCCAGAACTGCCGCAAGCGCAAGCTGGATCAGATCCTCTCCCTGGAAGATGAGGTGAATGCGGTCGTCAAGCGCAAGACGGAGCTCAGCCAGGCCCGCGCCCACCTGGAGTCCGAGCGCAAGCGCATCTCTAATAAGTTTGCAATGCTCCATCGCCACGTGTTCCAGTACCTGCGCGATCCTGATGGCAATCCCTGCTCGCCGACCGACTACAGTCTGCAGCAGGCGGCCGATGGTTCGGTGTATTTGCTGCCACGCGACAAGTCCGAGGGCAACAGCACGGCCACGAACGCCTCGAACGCAGTGTCCAGCGCTGGAACGAGTAACCTGAACGGGCATGGCCCGGTGGCGCCTCCCATGCACGGAGGCCACCACAGCCAGCACCATCAGCCGCAGCATGTGGACACCGCAATGTCCCAGCACGTGGCCAGGATGCCGCCGCatctgcagcaacagcagcagcagtcgtcgcagcatcaccatcagcagcagccaggaggaggaggcggatcgcaacagcaacagcaccacaaGGAATGA
- Protein Sequence
- MKQRKIILGKWESEYIRLPLDELLNDVLKLSEFPLEDELTNDPDASTSQAAAALNENQSQRITSETGEDLLTAEGVAVQQRKINTDGRSTRTNSGQSDFNDSSDNFPLCDFDDLQSSVGSPLFDLDDDAKKELDEMLQSKAPPYPHPHAHDHPHAHGHPYAHPHSHHHAAAAAAHHHHAHAHHAAAAAAAHQRAVQQANANYGGVGSAPGSAFQRQPSAAGGFHHGHHQSRMPRLNHSVSMERLQDFATYFSPIPSMVGGVSDMPPYPHHPHYPGYSYQSSPSNGAAGGGPIPPVPGQHGQYGSPATAALQPPPPPPPPHHAAMLHHPNAALGDLCSATAASGQPHYGHNLGSAVTSSMHLTNSSHEAEGAAAAAAAAGNAYKMEHDLMYYANTSSDMNQTDGFMNSIFTDEDLQMMDMNESFCRMVDNSTSNNSSVLGLPVSGHVSHAAGSAQLVPGNHGTANGGSSVGGGVASMSGGASSVGATGGMTADILASGGGAAQGGTDRLDASSDSAVSSMGSERVPSLSDGEWGEGSDSAQDYHQGKYGGPYDFSYNHSRISTATRQPPVAQKKHQLYGKRDPHKQVPSALPPTAPPATAHAQAQAQSIKYEYDAGYAGMASGGVAGLQHSEPGAMGPALSKEYHQQAYGMGASSNFPGDYGVRPPPRTSEDLVQLNHTYSLPQGSGSLPRPQARDKKPLVATKNASKAAAGSSAAATAEDEHLTRDEKRARSLNIPISVQDIINLPMDEFNERLSKYDLSENQLSLIRDIRRRGKNKVAAQNCRKRKLDQILSLEDEVNAVVKRKTELSQARAHLESERKRISNKFAMLHRHVFQYLRDPDGNPCSPTDYSLQQAADGSVYLLPRDKSEGNSTATNASNAVSSAGTSNLNGHGPVAPPMHGGHHSQHHQPQHVDTAMSQHVARMPPHLQQQQQQSSQHHHQQQPGGGGGSQQQQHHKE*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00535807; iTF_00611141; iTF_00474255; iTF_00487052; iTF_00480625; iTF_00473527; iTF_00471354; iTF_00484191; iTF_00517453;
- 90% Identity
- iTF_00535807; iTF_00611141; iTF_00474255; iTF_00487052; iTF_00517453; iTF_00484191; iTF_00471354; iTF_00480625; iTF_00473527;
- 80% Identity
- -