Basic Information

Gene Symbol
cnc
Assembly
GCA_949319315.1
Location
OX439379.1:21225298-21237909[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 9.2 9.1e+03 -2.8 2.8 15 26 299 310 290 313 0.84
2 8 9.9 9.8e+03 -2.9 2.7 8 21 338 351 335 354 0.80
3 8 2.3 2.3e+03 -0.9 2.8 8 23 384 399 381 404 0.81
4 8 5.3 5.2e+03 -2.1 0.1 17 38 408 429 407 433 0.75
5 8 7.8 7.7e+03 -2.6 2.4 15 23 437 445 429 461 0.80
6 8 6.8 6.7e+03 -2.4 2.2 15 23 466 474 457 488 0.80
7 8 4.9 4.9e+03 -2.0 1.6 15 23 495 503 486 513 0.81
8 8 3.6e-12 3.5e-09 36.9 7.9 3 61 601 659 599 667 0.77

Sequence Information

Coding Sequence
ATGGATCTCGGAATAGCATCCTACGGGTTGGCCGCCCGCCATTATTACGGTCTGCCACCATACTTGTCCGGGCCCATACTGTATCCGGTCCCGCCCGAGTACTTGAGCTCTTACTACAAACTTTACGATGGGATGTACACGATGCGGATACTGGACGGCGGCGCGCCGCAGCACCCGGGTCCGCATCACACGCACACGCATCACATGATGACCACTGAGCGTGACTCCGCGTCCGACAGCGCCGTGTCGTCAATGGGGTCGGAGCGGGTGCCCTCGCTGTCCGACGGCGAGTGGTGCGACGCCGGCAGCGACTCCGCGCAGGAGTACCACAGCTCTAAGTTCCGTCCATACGACAGTGGCTACGGGCGCAACCGCTCCGCGCCCACGCATCCGCAGAAAAAGCACCATATGTTCGGCAAGCGCTGCTTCCAGGAgcagccgcagccgcagccgcagccgcagccgcagccgcagccgcTGGAGCCGCGCGCGCACAGCGTTATCAAGTACGAGTGCCCGGACCAACCAGCCGCATACCCGCACGATCACATGCATCTGCACAACGTAGAATTCAACGCGcgccaacaggtgcaccacgggGCTGGACTGCGTCCGTCCGCCGACCTGCACCACGCCCCGCCGCCGCTGCAGCCCGCCCCCGACCTGCTGGCTCCACACTCCTCACATGCGCTGCTGCAGAGCGGCGTCCCGAGCCCCGCCCCCCACCCCGGCCGATACACGTACGCGACCCCCGAGCGAGTGCGACACAACCACACTTACAGCGCTCCCGCTCTAGCAGCCGAGCCGCGCGCCGCTCGAGACAAGCGAGGTATGTACACCGAGCGAGTGAGCGAGACCCAACCACACTTACAGCGCTCCCGCTATAGCAGCCGAGCCGCGCGCCGCTCAAGACAAGCGCGCGCTCCCGCTATAGCAGCCGAGCCGCGCGCCGCTCAAGACAAGCGCGGTATGTGCACCAAGCGAGTGAGCGAGACACAACCACACTTACAGCGCTCCTGCTCTAACAGCCGAGCCGCGCGCCGCTCGAGACAAGCGAGCGCTCCTGCTCTAGCAGCCGAGCCGCGCGCCGCTCGAGACAAGCGCGGTATGTGCACCGAGCGAGTGAGCGAGACACAACCACACTTACAGCGCTCCTGCTCTAACAGCCGAGCCGCGCGCCGCTCGAGACAAGCGCGCGCTCCTGCTCTAGCAGCCGAGCCGCGCGCCGCTCGAGACAAGCGAGGTATGTGCACCGAGCGATTGAGCGAGACACAACCACACTTACAGCGCTCCCGCTATAGCAGCCGAGCCGCGCGCCGCTCGAGACAAGCGAGGTGCACCGAGCGAGTGAGCGAGACACAACCACACTTACAGCGCTCCCGCTATAGCAGCCGAGCCGCGCGCCGCTCGAGACAAGCGAGGTGCACCGAGCGAGTGAGCGAGACACAACCACACTTACAGCGCTCCCGCTATAGCAGCCGAGCCGCGCGCCGCTCGAGACAAGCGCGGTGCACCGAGCGAGTGAGCGAGACACAACCACACTTACAGCGCTCCTGCTCTAGCAGCCGAGCCGCGCGCCGCTCGAGACAAGCGCGGTATGTGCACCGAGCGATGCGCAGACTGACCGATGGCAGCATGTCGGACGCGGGGTCGACGACCAGCGGACACATGTCGCGCGACGAGAAGAGGGCCAAGGCGCTAGGAATCCCGATGGAGGTCCACGACATAATCAACCTTCCGATGGATGAGTTCAACGAGAGGCTCTCCAAGCACGACCTCAGCGAGGCACAGCTGTCCCTAATAAGGGACATCCGGCGTCGGGGCAAGAACAAGGTGGCAGCGCAGAATTGCCGCAAGCGCAAGCTGGACCAGATCACGTCCCTTGCCGACGAAGTGCGTACCGTGCGTGATCGCAAGCAACGCTCCGTGCGCGACCGACACCAGCTCATGGCCGAGCGACAGCGCGTCAAGGAACGCTTCGCAGCGCTCTACAGACACGTGTTCCAGAACCTTCGCGACCCGGAGGGCCGGCCGCTGTCCTCCAGCCAGTACTCGCTGCAGCAAGCCGCGGATGGTAACGTCGTGTTGGTCCCCAAGATGCCCCACCACCCCGACCAGTCGATGAACCGCACCGACGACGAGCTCGAACGCAAGGCGAAGAGCTACGACCAGTAG
Protein Sequence
MDLGIASYGLAARHYYGLPPYLSGPILYPVPPEYLSSYYKLYDGMYTMRILDGGAPQHPGPHHTHTHHMMTTERDSASDSAVSSMGSERVPSLSDGEWCDAGSDSAQEYHSSKFRPYDSGYGRNRSAPTHPQKKHHMFGKRCFQEQPQPQPQPQPQPQPLEPRAHSVIKYECPDQPAAYPHDHMHLHNVEFNARQQVHHGAGLRPSADLHHAPPPLQPAPDLLAPHSSHALLQSGVPSPAPHPGRYTYATPERVRHNHTYSAPALAAEPRAARDKRGMYTERVSETQPHLQRSRYSSRAARRSRQARAPAIAAEPRAAQDKRGMCTKRVSETQPHLQRSCSNSRAARRSRQASAPALAAEPRAARDKRGMCTERVSETQPHLQRSCSNSRAARRSRQARAPALAAEPRAARDKRGMCTERLSETQPHLQRSRYSSRAARRSRQARCTERVSETQPHLQRSRYSSRAARRSRQARCTERVSETQPHLQRSRYSSRAARRSRQARCTERVSETQPHLQRSCSSSRAARRSRQARYVHRAMRRLTDGSMSDAGSTTSGHMSRDEKRAKALGIPMEVHDIINLPMDEFNERLSKHDLSEAQLSLIRDIRRRGKNKVAAQNCRKRKLDQITSLADEVRTVRDRKQRSVRDRHQLMAERQRVKERFAALYRHVFQNLRDPEGRPLSSSQYSLQQAADGNVVLVPKMPHHPDQSMNRTDDELERKAKSYDQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-