Basic Information

Gene Symbol
cnc
Assembly
GCA_947623375.1
Location
OX392523.1:9335289-9356641[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 12 5.6e-13 8.6e-10 39.5 8.1 3 64 312 373 310 374 0.93
2 12 0.5 7.7e+02 1.2 0.1 25 54 399 428 395 437 0.60
3 12 0.5 7.7e+02 1.2 0.1 25 54 450 479 446 488 0.60
4 12 0.5 7.7e+02 1.2 0.1 25 54 501 530 497 539 0.60
5 12 0.5 7.7e+02 1.2 0.1 25 54 552 581 548 590 0.60
6 12 0.5 7.7e+02 1.2 0.1 25 54 603 632 599 641 0.60
7 12 0.5 7.7e+02 1.2 0.1 25 54 654 683 650 692 0.60
8 12 0.5 7.7e+02 1.2 0.1 25 54 705 734 701 743 0.60
9 12 0.5 7.7e+02 1.2 0.1 25 54 756 785 752 794 0.60
10 12 0.5 7.7e+02 1.2 0.1 25 54 807 836 803 845 0.60
11 12 0.5 7.7e+02 1.2 0.1 25 54 858 887 854 896 0.60
12 12 0.76 1.2e+03 0.6 0.1 26 59 910 936 904 941 0.58

Sequence Information

Coding Sequence
ATGAATCTTTCGGTGTCGCCTTTCGCCTACGGCGCGCCGTACTTGCCGCCACTGTTGACGAGTCACCTACTGTACCCCGACCCCGCGGAATACCTGAGTTCCTACTACAAATTATATGATGGCATGTACACGATGCGGATGCTGGACGGGGCCGCTGGCGGTCACCACGCGCCGCACAACCACTCGCACATGATGATTGCTGAGCGTGACTCGGCGTCGGACAGCGCCGTTTCGTCCATGGGATCTGAGCGCGTGCCGTCTCTGTCTGACGGAGAATGGTGCGACGGCAGCGACTCGGCGCAGGAGTTCCACAGTTCAAAATTTCGTCCCTACGACGGATCGTACGGCCGCGAGCGAGCCCCGCACCAGCCCCAAAAGAAACACCACATGTTCGGAAAGCGCTGCTTCCAGGAACAGAACCAGCCGGCGCCGTCGCTGGAGACGCTGACGCCTCCACGGCCGGTCGTCAAGTACGAGTGCCCCGAGCAGGCCTACCCGCATGAACCCATGCACATGCACAACGTGGAGTTCGGCGCACGGCAGCAATTGCACGCGCCCGCGCCGCCGCTCGACCTCAACACCGCGCACTCCAGCCACGCTCTACTACAGAATGGCCTAGCCGGTAGCGCAGCTCGCTTCGCATACGCGACGCCAGAGCGCGTGCGCCACAACCACACCTACAGTGCGCCTGCGCAGGCGCCGGAGCGGCCTGCTGCCGTGCGCGACAAGAGAGTTCGACGGTTGACGGATGGCAGTATATCGGACGGCGGGTCGACGACGAGCGCTGGACACCTGTCGCGCGACGAGAAACGCGCCAAAGCATTAGTGGTCGCAGGCATCCCCATGGAAGTGCACGACATCATCAACCTGCCGATGGACGAGTTCAACGAGCGGCTCTCCAAGCACGACCTCAGCGAGGCGCAGCTCTCGCTCATCCGCGACATCCGGCGCCGCGGCAAGAACAAGGTTGCAGCGCAGAACTGCCGCAAGCGCAAGCTGGACCAGATCACGTCGTTGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGGTGAGACACCCGCTCACGTCACAGTCGCCGGCGGACGAGGTGCGCACGGTGCGCGACCGCAAGCAGCGCACGCAGCGCGACCACCACACGCTCACCGCCGAGCGGCAGCGCGTCAAGGAGCGCTTCGCCGCGCTCTACCGACACGTGTTCCAGAACCTGCGCGACGCGTCGGGTCGGCCGCTGTCGTCGGCGCAGTACTCGCTGCAGCAGGCCGCCGACGGGAATGTCGTGCTCGTGCCCAAGATGAACCAGCACCCTGACCACCCAATGAACCGCACCGATGAGGACATAGACCGGAAAACCAAAAACTACGAACAGTGA
Protein Sequence
MNLSVSPFAYGAPYLPPLLTSHLLYPDPAEYLSSYYKLYDGMYTMRMLDGAAGGHHAPHNHSHMMIAERDSASDSAVSSMGSERVPSLSDGEWCDGSDSAQEFHSSKFRPYDGSYGRERAPHQPQKKHHMFGKRCFQEQNQPAPSLETLTPPRPVVKYECPEQAYPHEPMHMHNVEFGARQQLHAPAPPLDLNTAHSSHALLQNGLAGSAARFAYATPERVRHNHTYSAPAQAPERPAAVRDKRVRRLTDGSISDGGSTTSAGHLSRDEKRAKALVVAGIPMEVHDIINLPMDEFNERLSKHDLSEAQLSLIRDIRRRGKNKVAAQNCRKRKLDQITSLADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQVRHPLTSQSPADEVRTVRDRKQRTQRDHHTLTAERQRVKERFAALYRHVFQNLRDASGRPLSSAQYSLQQAADGNVVLVPKMNQHPDHPMNRTDEDIDRKTKNYEQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-