Basic Information

Gene Symbol
-
Assembly
GCA_949628255.1
Location
OX451211.1:14990383-14995158[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 13 0.00039 0.48 11.3 7.5 25 63 537 575 535 577 0.91
2 13 0.00049 0.61 11.0 7.1 25 63 593 631 592 633 0.92
3 13 0.00035 0.44 11.5 4.9 24 61 655 692 652 696 0.91
4 13 0.0026 3.3 8.7 8.8 25 64 705 744 694 745 0.85
5 13 0.0078 9.7 7.2 11.6 25 63 719 757 713 762 0.80
6 13 0.00035 0.44 11.5 4.9 24 61 781 818 778 822 0.91
7 13 0.0026 3.3 8.7 8.8 25 64 831 870 820 871 0.85
8 13 0.0078 9.7 7.2 11.6 25 63 845 883 839 888 0.80
9 13 0.00073 0.91 10.4 5.3 25 63 901 939 900 941 0.79
10 13 0.0019 2.4 9.1 3.0 32 63 957 988 953 990 0.93
11 13 0.017 21 6.1 7.1 25 56 1006 1037 1005 1044 0.58
12 13 0.12 1.4e+02 3.4 2.6 29 64 1089 1124 1084 1132 0.69
13 13 0.15 1.9e+02 3.0 4.6 25 61 1127 1163 1120 1180 0.65

Sequence Information

Coding Sequence
ATGGCTCCCGCGgtggtggcaatttttttaatcactctATTCGTGGAAGGATTATCAGCTCCAAGTTGGTGTGGAGATTGCCAAACATGGGAATCGCACAGTGGCTCTCAGACTCATAGAGGATTTGGGAGGCAAATAAATCCCGAAAATTTGTCTCAGAGGTCAGAAAACTTGGAAGATTTAACACAACAAGCCGAAACCGAGTTAAACAGATCTCCCAATCAATTACCTTTCGATAATACAAGGCCTGGAAATTGGACCGATGTTAATCATTACAGAACATCGGATGGCCATGGAAGAGTATACGAAGAACAAGGCCAGCGTGTAGATGGATCAAATCGAATAAGATTCTCTAGAAGAAATTTCACTTCCAGTTATAGCAGTGGAAGCTTAGGTTCCTTTGGAGAAACTAATCTGGGACATATATATCCCAACGTTAGACAAGATGAGAGCCAGTTATTGAACCGCGAATCTTTGGATCAGTCACAAAATTCGGCTTATGATCGATTCGCCACTGGACGAAATTTCCATACTACACAGGACTCTTTACACTCTACAGAAAGGGTGAACAGTCATAACGATGCATCCAGATATTATGAAAATCATGGCAATAGTGGTCGGATCAGTGGAATTACTTCTGGCCAATCATCGCAACAAGGAATCAATGTATTGGATCGAACAAGACCAGGAAATTGGAGCACGGTTAATACTTTTAGAACCAATGAGGGTAATGGCAGAGTTTACGAAGAACGAGGGCAGATTGTAACAGGGCCGAGGCGAGTTCATTTTTATGCAAGAAATTACACTTCAAGTTATGCCTCTGGCGGAGGTATTCCGACTCTTGGTTTGGAGGGCGACAATACAAGGAACATCGAGAGTACCGTGCAGCAGATGCAGAGACAATTTGATAGTTATGGAAGAGAGCTTCATCAAAGTACTGAAGGTTCAACAAATGGTGATTACACTCAGCATTATCCTGGAGATTATACATCACCTAGTCAAACGTCGAGACAAACAAACTATAGATATGTATCAAGACCCAGTAACTATGAATCGCAAAATCAGAATACTTTGGATTCAAATTCTCACCAAACGTACCAACACACAACTAACTTAGGAAATCGGCATGTGTCTCAGTCTAGCAGTAGTTCTTTTGGTGGATCTGGACAACTTAATGGAAGAAATCCAGATTCTGGATACTCAACGGGCAGTTATACCAGTGGTAGTGGATACAATCATCAGGGAACTTTGGAAATACCATCTTCAGGACACACTGGACACCAAATTCCATATTACAATCAATTTCAAACCACCTCTGACTCCTCCTCTGCTGCCTTTGTATCTCGTCCCGATACCGATCTAAGAACTATTCAATCTGGTAGCGATCTAGAAACACAGCGAACGCTTAATACACACAACAGCTTTGATCAAACTACTAATAATAATCGGATTTATAGGATACAGAATGGGCAACTAGTTACACAGGGTATTGATTTGGGACAAATAGCACAAGCTCCTGATTGTGCAGAAGGTACAAATGGATATAGCTCATACGAACAGTCCTCCCGTAGAATCTATAGAGGGGCTGCCGAGCCTCATGATCTTTCGCAACAAGTGCAAGATCTTACCCAGCAGACGGAGGATCTTACCCAGCAAACGGAAGATCTAACTCAGCAGACACAGGATCTTACACAACAAACAGAAGATCTTACACAGCAAAATCAAGATTTTGGACAACAATCTTCTTGGAGACCAGGTAAATTGGAGGTTGGCAGTCAGCAGGTTGAAGATCTCACCCAGCAAACAGAGGATCTTACCCAACAAACGGAAGATCTTACCCAGCAATCGCAGGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAAGATTTTGGACAGCAATCTTCTTGGAGGCCAGGTAAATTGGAAGTTGGTAGTCAGCAGATTGAAAATCTCCCACAACAAACCGAAGGTCTTACTCAGCAAACGGAAGATCTTACCCAGCAGACGGAGGATCTTACTCAACAAGCTGAAGATCTGACACAACAAAATCAAGATTTCGGACAGCAACCTTCTCGGCGACCAGGTAAATTGGAAGTTGGTAGTCAACAGGTTGAAGATCTCACCCAGCAAACAGAGGATCTTACCCAACAAACGGAAGATCTTACCCAGCAAACGGAAGATCTTACCCAGCAAACAGAAGATCTCACTCAGCAAACGGAGGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAAGATTTTGGACAGCAATCTTTTTGGAGGCCAGGAAAATTGGAAGTTGGTAGTCAGCAGATTGAAAATCTCCCACAACAAACCGAAGGTCTTACTCAGCAAACGGAAGATCTTACCCAGCAGACGGAGGATCTTACTCAACAAGCTGAAGATCTGACACAACAAAATCAAGATTTCGGACAGCAACCTTCTCGGCGACCAGGTAAATTGGAAGTTGGTAGTCAACAGGTTGAAGATCTCACCCAGCAAACAGAGGATCTTACCCAACAAACGGAAGATCTTACCCAGCAAACGGAAGATCTTACCCAGCAAACAGAAGATCTCACTCAGCAAACGGAGGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAAGATTTTGGACAGCAATCTTTTTGGAGGCCAGGAAAATTGGAAGTTGGTAGTCAGCAGATTGAAAATCTCCCACAACAAACCGAAGGTCTTACCCAGCAAACAGAAGATCTTACTCAGCAAACGGAGGATCTTACACAACAAACGGAAGATCTGACACAACAAAATCAAGATTTCGGACCGCAATCTTCTTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAACAGGTTGAAGATCTCACCCAGCAAACAGAGCATCTTACCCAGCAAACGGAGGATGTTACACAACAAACGGAAGATCTGACACAACAAAATCAAGATTTCGGACCGCAATCTTTTTGGCGACCAGGTAAATTAGAAGTTGGTAGTCAGCAGGTTGAAGATCTCACCCAACAAACGGAAGATCTTACGCAACAAACGGAAGATCTTACGCAACAAACAGAAGACCTTACTCAACAAACAGAAGGCGAGACGCAACAAAATCTTATACCACCTCATTTCCAACCTTGGCACCATGAAAGATGGCAAGCTGCAGACCCAAATTATGTACCCCTACAAACAGTGTACGAAGGAGCAGTGAGACCTGAAAACTCAGATATCACCAGTCAACAACCGGAGGATCTAACTCAACAAACTGAAGATTTTGGGCAACAAACACAGGATCTTACTCAGCAAAGTGAAGATCTTGGTCAACAAACTGAAGATTTTGGTCAACAAACACAGGATCTTACTCAAAAAACTGAAGATCTTGGTCAACAAACTGAAGATTTTGGTCAACAAACACAGGATCTTACTCAACAAACTGAAGATCTTGGTCAACAAACTGAATATTTTGGTCAACAAACACAGGATCTTGGTCAACAAACTGAAGATCTTGGTCAGCAAACGCAAGGTATCATCCAAGAAACTGATGGCCAATCACagcaaaatgaaaattttaatggtTGGAGGGAACAGATAACAAGTGGCCCAGGATTTGGACAGGAGTCTCCTTGGAACTCTGACAATCTGGAAATTGGAGGTCAACaaaccgaaaatttttatcaagaaaatcaatttggcAAACACCAAACAATCATCCATCCCGGACAACCGACAAGACCAGCACCAAAGCCTGCACCTAAACCAAAACGTCCAAGTCACGGGAATTTCCATCATACTCAAGAGATTAATATAGAGATTGAAGAACCAACTGTATCTAATGCAGATAGTCATACGGTGCAACATAATGATCAGcaaaatagtgaaaaatggGTATCAACGGGTGTTCCTCCCACTCCACAAAGAGGTGATCAAGGTATCAACGTAAACTCTAACGAACCTGAAGAAGCAGATATTAATATTGAATCAGATATACCCAAGATACCTGAACATCAAATTCAATATGTGTACCCGTACCCGGATTCATCCAGCCAACAAACTACAAGTGGAAATCAATTCAGAGAAACTCAACCTACTAAAACTAAGACAAGTCGCCGAAGAGGGAATAATGCTGTTCAATACCAAGGTCCACAAGGATGGCATTCTCGTGATTTGTCAATCAGTCAAGACCCAACAATCAGACTTGTTGATAGACGTATAAACTCAGGTGACTTAAACTTGCCGCAATCAGCAAATACCGGACAAGTTATACAAGACTTTCAACAACATTTGACTAATCCTAAAGAAATTGAACAACTTGAATCTGGGCAAACAGTTCAGAGAATTCAACCTCTTGGTGCGGCTATAGAATCAAGACAACGGAGCAGTGGTCAATcagaaaaaattgtctttcCCGAATCTTCAGAAGTCTCTTTTAGTCCTAGAATTTTAGAGGCATTTGGAGCGAATGGACCATACGGCGAACATGATTTGGATATATTTGATTCTGCCAAACAGTATCCTGACACTACAACAGTTTTAACACCGCCTGAAAATGGAAATGATTGGGATATTCGTGAGGTTGATCGGATAGTTACAACCACAACTGAGGCTCCAACTCCTTTAccatcaacaacaacaactcCTCTGCCAACAACAACACCACCTCCGCCTCCAACTCCGGCTCCtggattttggaaaaaactgGGTAACACGTTTAGTACTACCGTAGAAAAAGCCAAGGACAAGGCGAGAGACTGGTTCGGTTAA
Protein Sequence
MAPAVVAIFLITLFVEGLSAPSWCGDCQTWESHSGSQTHRGFGRQINPENLSQRSENLEDLTQQAETELNRSPNQLPFDNTRPGNWTDVNHYRTSDGHGRVYEEQGQRVDGSNRIRFSRRNFTSSYSSGSLGSFGETNLGHIYPNVRQDESQLLNRESLDQSQNSAYDRFATGRNFHTTQDSLHSTERVNSHNDASRYYENHGNSGRISGITSGQSSQQGINVLDRTRPGNWSTVNTFRTNEGNGRVYEERGQIVTGPRRVHFYARNYTSSYASGGGIPTLGLEGDNTRNIESTVQQMQRQFDSYGRELHQSTEGSTNGDYTQHYPGDYTSPSQTSRQTNYRYVSRPSNYESQNQNTLDSNSHQTYQHTTNLGNRHVSQSSSSSFGGSGQLNGRNPDSGYSTGSYTSGSGYNHQGTLEIPSSGHTGHQIPYYNQFQTTSDSSSAAFVSRPDTDLRTIQSGSDLETQRTLNTHNSFDQTTNNNRIYRIQNGQLVTQGIDLGQIAQAPDCAEGTNGYSSYEQSSRRIYRGAAEPHDLSQQVQDLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQNQDFGQQSSWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQSQDLTQQTEDLTQQNQDFGQQSSWRPGKLEVGSQQIENLPQQTEGLTQQTEDLTQQTEDLTQQAEDLTQQNQDFGQQPSRRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSFWRPGKLEVGSQQIENLPQQTEGLTQQTEDLTQQTEDLTQQAEDLTQQNQDFGQQPSRRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSFWRPGKLEVGSQQIENLPQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGPQSSWRPGKLEVGSQQVEDLTQQTEHLTQQTEDVTQQTEDLTQQNQDFGPQSFWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEGETQQNLIPPHFQPWHHERWQAADPNYVPLQTVYEGAVRPENSDITSQQPEDLTQQTEDFGQQTQDLTQQSEDLGQQTEDFGQQTQDLTQKTEDLGQQTEDFGQQTQDLTQQTEDLGQQTEYFGQQTQDLGQQTEDLGQQTQGIIQETDGQSQQNENFNGWREQITSGPGFGQESPWNSDNLEIGGQQTENFYQENQFGKHQTIIHPGQPTRPAPKPAPKPKRPSHGNFHHTQEINIEIEEPTVSNADSHTVQHNDQQNSEKWVSTGVPPTPQRGDQGINVNSNEPEEADINIESDIPKIPEHQIQYVYPYPDSSSQQTTSGNQFRETQPTKTKTSRRRGNNAVQYQGPQGWHSRDLSISQDPTIRLVDRRINSGDLNLPQSANTGQVIQDFQQHLTNPKEIEQLESGQTVQRIQPLGAAIESRQRSSGQSEKIVFPESSEVSFSPRILEAFGANGPYGEHDLDIFDSAKQYPDTTTVLTPPENGNDWDIREVDRIVTTTTEAPTPLPSTTTTPLPTTTPPPPPTPAPGFWKKLGNTFSTTVEKAKDKARDWFG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-