Basic Information

Gene Symbol
cnc
Assembly
GCA_036785405.1
Location
CM072081.1:5582125-5615949[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 1.8e-12 2.1e-09 38.1 8.6 3 64 413 474 411 479 0.91
2 8 0.9 1.1e+03 0.6 0.1 29 60 479 510 473 514 0.83
3 8 1.5 1.8e+03 -0.1 0.0 33 60 530 557 524 563 0.78
4 8 1.6 1.9e+03 -0.2 0.1 33 60 577 604 571 608 0.78
5 8 1.6 1.9e+03 -0.2 0.1 33 60 624 651 618 655 0.78
6 8 1.6 1.9e+03 -0.2 0.1 33 60 671 698 665 702 0.78
7 8 1.6 1.9e+03 -0.2 0.1 33 60 718 745 712 749 0.78
8 8 1.7 2e+03 -0.3 0.0 33 60 765 792 759 795 0.78

Sequence Information

Coding Sequence
ATGTTCCAGTCGCTGATGCGATCGATGTCGGTGGAGCAGCGGTGGCAGGACCTGGCGTCGCTGCTGACCATCCCTCCCCCGCCCGAGCAGTACCAGCACTACCACCAGCACCCGCACGCGCACCCGCacccgcacgcgcacgcgcacccGCACAACATCAGCGGCCACGGCGCGGCCGGCTACGCGCCCAACTACCACGCCCCCATAGCCGCTCCCGTGCCCGAGAAACACCATGAGCCCTACGGTGCGGCCGCGCCACTGGAGGGCGCTTACAAGGTGGAGTCAGCCCACCACCCTCAGCACCACGACACGCTGTATTACCAGAACTCCACCAGCGAGATGGCTCCACCGAACCAGGACGGGTTCCTGCAGTCCATCCTCAACGACGAGGACCTGCAGCTCATGGACATGGCGATGAACGAGGGCATGTACACGATGCGCATGCTGGACGGCGCGTCGTCCGCGCACGTGGCTGCGCACACGCACCCGCACACCACGCACATGCCCATCACCACCGAGCGAGACTCTGCGTCAGACAGCGCGGTGTCTTCGATGGGCTCGGAGCGGGTTCCGTCGCTCTCAGATGGAGAATGGTGCGACGGCAGCGATTCCGCTCAGGAGTTCCACAGCTCCAAGTTCCGCCCATACGACAGCGGGTTCGTGCGCGAGCGCGCGTCGCACGCGCCGCAGAAGAAGCACCACATGTTCGGGAAACGATGCTTTCAGGAGCAAGCAGCCGCGCCGGTGGAGCCGCTGGCGGCGACGCGCGCACCCGGCGTCATCAAGTACGAGTGCGAGCAGCCCTACCACGACGCCATGCACATGCACAACGTGGAGTACAACTCTCGCGCGCAATTGCCGCAGCCGCACGTGCCGGTGCTGCAGCCGGCGCTGGACATCAGCGCACCGCACTCCAGCCACGCGTTGCTGCAGAGCACAGTGCCGAGCCCCGGCCCTCGGTTCGGCTTCGCGTCGGGCGACCGCGTGCGACACAACCACACGTACAGCGCGCCGCTGCCCGTCGAGCGCCCGCCGACGAGGGACAAGAGAGTGCGTCGGCTGACGGACGGCAGTGCGTCGGACAGCGGCTGCGGCGGTTCGCATCTCACGAGGGACGAGAAGCGAGCCAAGGCACTCGGAATTCCTCTGGAGGTTCAAGACATTATCAATCTACCGATGGACGAGTTCAACGAGCGGTTGTCGAAACACGACCTCAGTGAAGCACAGCTGTCCCTTATAAGGGACATACGACGGCGGGGGAAGAACAAGgtGGCAGCCCAGAACTGCCGCAAGCGGAAGCTGGACCAGATAACGTCCCTGGCGGACGAAGTGCGCACGGTGCGTGACCGCAaggcgcgcacgcagcgcgacCACCACTCGCTGCTGGCCGAGCGGCAGCGCGTCAAGGAGCGGTTCGCTGCGCTCTACCGGCACGTGTTCCAGGACCGCAAGGCGCACACGCAGCGCGACCACCACTCGCTGCTGGCCGAGCGGCAGCGCGTCAAGGAGCGGTTCGCTGCGCTCTACCGGCACGTGTTCCAGATTATACACTCATGTCATACACTCGAAGTGCGCACGGTGCGCGACCGCAaggcgcgcacgcagcgcgacCACCACTCGCTGCTGGCCGAGCGGCAGCGCGTCAAGGAGCGGTTCGCTGCGCTCTACCGGCACGTGTTCCAGACTATACACTCGTGTCATACACTCGAAGTGCGCACGGTGCGGGACCGCAaggcgcgcacgcagcgcgacCACCACTCGCTGCTGGCCGAGCGACAGCGCGTGAAGGAGCGGTTCGCTGCGCTCTACCGGCACGTGTTCCAGATTATACACTCATGTCATACACTCGAAGTGCGCACGGTGCGCGACCGCAaggcgcgcacgcagcgcgacCACCACTCGCTGCTGGCCGAGCGGCAGCGCGTCAAGGAGCGGTTCGCTGCGCTCTACCGGCACGTGTTCCAGATTATACACTCATGTCATACACTCGAAGTGCGCACGGTGCGCGACCGCAaggcgcgcacgcagcgcgacCACCACTCGCTGCTGGCCGAGCGGCAGCGCGTCAAGGAGCGGTTCGCTGCGCTCTACCGGCACGTGTTCCAGATTATACACTCATGTCATACACTCGAAGTGCGCACGGTGCGCGACCGCAaggcgcgcacgcagcgcgacCACCACTCGCTGCTGGCCGAGCGGCAGCGCGTCAAGGAGCGGTTCGCTGCGCTCTACCGGCACGTGTTCCAGATTATACACTCATGTCATACACTCGAAGTGCGCACGGTGCGCGACCGCAaggcgcgcacgcagcgcgacCACCACTCGCTGCTGGCCGAGCGGCAGCGCGTCAAGGAGCGGTTCGCTGCGCTCTACCGGCACGTGTTCCAGAACCTGCGCGACCAGGAGGGTCGGCCGCTGTCGTCGAACCAGTACTCGCTGCAGCAGGCGGCCGACGGCAACGTGGTGCTGGTGCCCAAGATGCAGCACAACGATCACCCCATGAACCGCAGCTCCGACGACGACCTGGACCGCAAGGCGAAGAACTACGAGCAGTGA
Protein Sequence
MFQSLMRSMSVEQRWQDLASLLTIPPPPEQYQHYHQHPHAHPHPHAHAHPHNISGHGAAGYAPNYHAPIAAPVPEKHHEPYGAAAPLEGAYKVESAHHPQHHDTLYYQNSTSEMAPPNQDGFLQSILNDEDLQLMDMAMNEGMYTMRMLDGASSAHVAAHTHPHTTHMPITTERDSASDSAVSSMGSERVPSLSDGEWCDGSDSAQEFHSSKFRPYDSGFVRERASHAPQKKHHMFGKRCFQEQAAAPVEPLAATRAPGVIKYECEQPYHDAMHMHNVEYNSRAQLPQPHVPVLQPALDISAPHSSHALLQSTVPSPGPRFGFASGDRVRHNHTYSAPLPVERPPTRDKRVRRLTDGSASDSGCGGSHLTRDEKRAKALGIPLEVQDIINLPMDEFNERLSKHDLSEAQLSLIRDIRRRGKNKVAAQNCRKRKLDQITSLADEVRTVRDRKARTQRDHHSLLAERQRVKERFAALYRHVFQDRKAHTQRDHHSLLAERQRVKERFAALYRHVFQIIHSCHTLEVRTVRDRKARTQRDHHSLLAERQRVKERFAALYRHVFQTIHSCHTLEVRTVRDRKARTQRDHHSLLAERQRVKERFAALYRHVFQIIHSCHTLEVRTVRDRKARTQRDHHSLLAERQRVKERFAALYRHVFQIIHSCHTLEVRTVRDRKARTQRDHHSLLAERQRVKERFAALYRHVFQIIHSCHTLEVRTVRDRKARTQRDHHSLLAERQRVKERFAALYRHVFQIIHSCHTLEVRTVRDRKARTQRDHHSLLAERQRVKERFAALYRHVFQNLRDQEGRPLSSNQYSLQQAADGNVVLVPKMQHNDHPMNRSSDDDLDRKAKNYEQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-