Basic Information

Gene Symbol
-
Assembly
GCA_945859575.2
Location
OX243848.1:24485772-24509684[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 0.077 67 4.6 3.5 30 62 186 218 178 221 0.83
2 6 0.00019 0.17 13.0 0.8 27 62 583 618 579 620 0.89
3 6 0.0085 7.3 7.7 0.0 31 60 1012 1041 1008 1045 0.85
4 6 0.00079 0.68 11.0 1.3 29 61 1547 1579 1541 1582 0.88
5 6 0.0039 3.4 8.8 0.7 28 64 1883 1919 1880 1922 0.85
6 6 6.8 5.9e+03 -1.6 2.2 3 18 2079 2094 2077 2101 0.79

Sequence Information

Coding Sequence
ATGAAGGATACCGATTTCCAAGAAGAGGGTGTTGAGTGTGGCCGAAGCCGAAGTCGCTCATCTTCGGTAGGTGATAAATCTGTAAGAGAGAGCGGGAGTGACGCCGAGAGCGATACCCCGCAGAGATTATGGCGTAGCTGTGTGGGGTCGGGCTCGACCTCGGAAGACTCCGCGGCACCTATAACGCCAGCTGTAAAACGCGGCAGAGGCCGGCCACCCACTACGGGCCAGTATGTGGGTATTGGGAAAGCCCGCCGCGAAAAAGCTGACGCCCTTCGCGCCGACGAAGAAAAAAACAGAACGGCTGAGCTCCGTGTATCCGAGCTCATTAATAGAGTGCGAACAGAGAGGCCATTACCGAGTTCCATCACGCACAGAGCAGTAGACGACGAAGACCCGTCTACTAAGGAGTTGGACACCAAACTCAAAGCAGCTCTCGCTGCCATCACGGAGGTGTCCAAAATTTCCAAAGGGttgaaaggcacttgccagaaggcccttaaagaggcagtcgcgtgcatatcggagtgcacaaaaacgctcctccaaagatcgacctcagaagaggttaacatactgcaggcgtctaatacGCGGCTAAGGGCCGATATCGACGAACTCCAAAGAGAGCAGAGTGAGCTCAGGCTAGAAGTGCGGAAGCAACGCGCCCTCAACTGCCAGCCTCTCGAGGCTCCACCATCGGTGCAGCCGTCAGATGAGGCGATGGGGAGATTCATGGACCGTATTGAGGCGCGCTTCGCTGTTTTGGAGGCCAAGTTATTAGCGGAACCAATCGCACCTAAACCTGCCCATAAAGATCAACCTACGGCCATAAAACCACCGCGGTCAAAAAAGACGAAGAAGGGGGAAAAGGCTCAGAAGTCTAAGCAGGACCTTATCCCTCCCGCTCCGGACGCCCTACCATCGCCCGGTCCAGCAACGGCTCCTAATGCTCCCGAACGGCCGGAAAAATCCACACAGGaaaggtggcagaaggagggtcccgagcgcacttcgctcggggaatcccttgcggagccagtcggctccgtcagtcccgcgtactctgggggggacaaCCAACGACACATCCGCCCGTTGCCGAGCGGTAAATCCGGCACAAAGAAGAGCGCGAAGGTCGCGCTCCGAACCTCGAGCCGTATTTCTGCGGTTCGCGAGAAGGGTCGTAAGACCGTGAGAGGGGCTAAGGCTAGTGGGGCTCCATCCCCGTTGCCGCCTAAGGTCCCGCGTACGGGCTCTTTGACCGCCTCGGAGTCCTCCATTACGGACGTGGACGTAATGGAGGCTACTGAAGTCCACGACGACAAAGCGGTGCGTGACGAGCAGCCAGGCAGGCTCTGGGTTTCCCAGAAGCGTGGCCGCAGCCGCTCGCCGCCGCAGGAGGTGCCGGGGGTCTCCCAGACCCTCAGCAAAATACCTACAGCGCGTAGGGGAGGAGCCAGTGGCTCTGTTTTCTCCAGCAGGGAGAAATACCTGCGCACGCGCACCGAGGGCGAGCGCACCGCCGAAGACCGGGTGGTGCAGCTCCTCGCCGAGGACCAACGGTCGGGGGGGGACCTTGAATCCCCTCCTACTACGTCGGAACTCAAGGCGAGGGCCGGGGATGCGgtagccgcgatttacgaggtggctaccaaatccactaatttgaagggcccctttgtgcgcggccttaaggaggcgggcgctaaactgctcgtagtggtggacgacctggccggtcgcacggaagccACCACCAGGGAGATACAGCAGTTGCGCTCAGAGAACGCCACTCTGCGGTCGGACATGGCGGAGCTCCGCGCCGAGCTCGCCAGACTGCGGGCCGACCTAGCCTCCTCTAGACCCGGTTCATCGCGGGCTGTTTTGGCCAGTCCTGCGAAAACGCAGCCTTCACCCTCCTCCGGAGGTATTGAGGCGCAGTTGCGGCAGCTGATGGGCTCTTTTGAGTCGAAAATCGCTGCCCGCATCGGGAGTCTGGAGGCAAGATTGCCTCCGGTGCCGACCTCCCGCACACCGCCGGTCGCAAAGCCGTACACGGCTACCGCTCCGACGCCACTACCCTGTTCCGGGGGACGCGTTGGGACCTACGCGACACGCGCGGCGTCTGCGCCCATGCAGAAGACGGCACGAGCTACGGCCCCTCCAGCGGTCGAGAGGACGCCAGCCCAGTCCCGGGGAGGCCCCCAGCCACCCCAGGTTTTGACGGCGGCACCATCTCCCGCGCTGTCGGTCGCTCCACGTGCCCAGAAGGCTCGTGTCCCTGCGCCCGCACCTGCTCCcgcccccctcgtccccgctgacgagggctggcagaagAGGAGGAGCCCAAGGCTCGTGTTGAAAGATGTGGTTACTGATGGAGAAGGAAAGGACGCAGCAATGAGTGGGGGCAGTTTACCCCAAGAAGTGGCGGAGTCCGGCCTTTTGAGCAGTTCCAGAATCTCAGTGACGTCTGCGTCACTGGACAGTCTGTTGCGCTCTGGCCGGTTCGGCTCAGAATCCTCGCTAAGCTCAGCGCTTAGCGATATGGATCAGTTGGTGGATGAGTGCCGCGCTCGGCAGCCGAAGCGTGGAGCCTCTCCCGGAGACGGTGAGAACCTTCCTGAGGTTGACCTTGATGTCCCTAGGAAGGTTCACTCCTCCCAAAGAGGGGGATCTGTGGCTAGGTTTGCTCGGCCGCAGACACCCCCTGCAAAGACGGCTGCCAATGTGACGCCGGTCCCTTCAACTTCCAGCGTTTTCTCTAGGCGGGAGGCCTATAATCGAGGCCGAACAGAAAGGGAACAGGTTGCTGAAGACCAGGTTCAGCAGCTGTTGGGCGGGCTTTGTTCTTCGGGAGGGGCTATTGCCCCGAAAACCGTGACGGAGCTCGTAGAATCGGCGACCGCATCGGTAAAGGTCATTGCAATGGTGGCAGCCAAAAGCAAGAACCTAAAGGGTACCTTTGCTAAGGGCTTGAGGGAGTCTGCCTCCGTGCTAAAGGAGGTGGTCGAGAGTTTCACCGCCACACAGGTGGAATTGGAGAAGTTGCGGGCTGACAATTCCCGCCTAGAGGCAGAGGTCAAGGCACTTCGCGCAGACATCCTAGCATGGCGGGCCGATTTTAGAGCGGCCGAATCTGGATCTGTCCTGGTTAACGTGGGGGGTAGTGAGCAGCTATCAGACGCCACCTTCACGGAGGCTCAGATGAGGCAGCTCAGTCTTCTCATAGGGGGTCTAATTGATGCACGCCTTGAAGGCATTAAGGCGCGTCTTCCCCCGGAGCCTAGCTGTCGGCCGCCGCTCGCAGCGTCTAGGCCAAAGGCCCCAAGGCCCCAAGGGCCATCGCAGGCACCATCGGTGCCTCTACCCCGTACAGGGGGACGCGCCGGGACCCTTGCCGCGGTTGTTGCCTCAGTGCCGACAGGCCGAGGAACGCCACGGGACGGTGTGCGGCCGAAGGGTGGAGATCCCCATCCCGCGCTGCAGCGTGCGCCAAAAAGGGCATCTCGTGCTGCCGTAGCGGGCACCTCAAACGCCCCAACGGCTCAGATCCGTCTCCCGGTGTCAGCATCTGCACCCGCCACTGGCCCCTCTGGTGAGGAATGGCAGGTGGCGGAGAGCAAGGGATCGCGCCGGAAGAAGAAGAGGgccgctatcgtcgcttctacgcaggaggtcactacgacaagaggtagcggcaggaagaggccgaaactgtctccgccgaaaacggccgccgtcctggtgacggtgtcgccggaggcggagcgcaacggcataagctacgcggacctcttccgcgagacgaaggcgcggctcaccgccgccaaccaaagcgttgaagtgagcgtccgacgctcccagaccggggctcgattgctcgagtgccccggtcaagacggcgccaaagttgcagatatcgtggcggggaggaggagCCCAAGGCTCGTGTTGAAAGATGTGGCTGTTGAAGGAGAGGACGCAGCAAAGAGTGGGGGCAGTTTGCCCCGAGAAGTGGCGGAGGCCGGCCCTTTGAGCGGTTCCAGAACTTCGGTGACGTCTGGGTCACTGGATAGCCTGTTGCGCTCGGGCCGGTTCGGCTCCGATTCATCGCTGAGCTCAGTCAGCGACATGGACTTGTTGGTGGGTCCACAGACCCTGTCAGCTGAGCACCGCCGCGCTCAACAACAAAAGCGCGGCGCATCTCCCGGAGGTGACGGAAACCTTCCTGAGGTCGACCTCGAGGCACCCAGGAAGGTTCATTCTTCCCAAAGAGGGGGGTCTGTGGCGAGGTTCGCTCGCCCGAAGACACCCCCTCCTGCGGTGACGGCACCGAACGTTCCTACGTCCGGTGTGTTCTCCAGGCGGGAGGCCTACAACAGAGGCCGCACGGAGGGTGAAGTAGTTGCTGAGGAGCAGGTCCAGCAGCTACTGGATGAGCTCCGCCCGTCGGAGGGAGTGAGCTCCACGAAGACCGTGGCGGAGCTCATAGAATCGGCGGCTACCTCCGTGAAGGTTATAACGAACGTGGCTCAGAAGAGTAAAAACCTTAAGGGCACGTTCGCCAAGGGCCTGAAGGCGTCGGCCTCGACATTAAAGGAGGTGGTGGAGGCGCTATTTGAGCGCTCCACAGCCAACGAGGTGGAGACTCAGAGGTTGCGCGCTGACAACGCCCGTCTGGAGAAGGAGCTGAAGGCACTTCGCTCCGAGATGGCTCTGCTCCGTGCCGATCTTGGATCGACTAGAGCCGGTTCTGCCCCGGCCGTAGTCAGTGCTGTGCCGGCCGGTTCTGTCCCGGCCCATAGTGGAGAGCGGCCATCTGGCGACTCCTTTACGGAGGCGCAGATGAGGCAGCTCTCACTTCTTATCGGAGGCCTCGTCGACGTTCGCCTTGAGGGCATCAAGGCGCGTCTACCCCCGGAACCGAGCTGTCGGCCGCCGCTCGCGGCACCCAAGACCACGCAGGGGCAACTGATCGCACGCCCGGTGCCACCACCCCGAACAGGGGGACGCGCTGGGTCTTTTGCGACAGTGGTTGCCTCAGGGCCGGGCGGGCGAGGGGCGCAGCGTGCGCCGAGGAAAGCACCCCGTGTCGCCGAAGTGGGCGCCTCAGGCGCCCCAACGGCCCAAGCCTGTCCTCCGGCGCCAGCGTCTGCTCCCGCTTCTGCCCCCTCTGGAGAGGAATGGCGAACGGCGGAAAGCAAAAAATCGCGGCGCCGGCAGAAGAAGAAGgccactcccgccgctcccgtgcaggaggttgccccggcaagggagaaaggggggggatcagcggcgggaacgcgtgctaagacaccgtTGTCCCGTCCACAAACTCCGACGACGTCAACACCGGCCGATAACACCTTGGCTGGTTCGGCGTCACCTTTCAGTAGACGCGAGGCCTATGTCCGCACCCGCACGGAGGGAGAACTGGTCGCCGAGAACCAGGTGGACGGGTTCCTGGAGGAACTTCGATCTGTGGGAGGGGATAGTGCCCCGAAAACCGGATCGGAGCTTGTAAAGGCGGCGGCCACCTCCTTGTCGGTTATTGTGAATGTTGCCACTAAGAGCAAAAATCTTAAGGGCACATTCGCAAAAGGCCTGCGGACGTCCGCCCAGGTTTTGGGGGAAGTGGTGGATGCGCTGTTCGAGCGTGTCACTGCCACCGAGGCGGAATCGGAAAGGCTGCGGGCGAGCAACATCCGCATGGAAGAGGAGTTGGGCAAGCTACGTGCTGAAGTAGCAAAATTACGTGCCGACTTTGCGTCGGCCAGAGCCGGTTTTGCCCCGGGGTTGGTTGATTCTGCTGGGGCTCATAGTGTAGGGCGCCCATCTGACGACTCCTTTACGGAGGCGCAGATGAGGCAGCTCGCACTTCTTTTCGGGGGCCTCTTCGATGCGCGCTTTGAGGGAATCAAGGCGCGCCTGCCCCCGGAACCGAGCTGTCGGCCGCCGCTTGCGGCGCATAAGCCCCCTCAGGGGCCAAAGATGCCACCCCCGGCACCACCCCGTAAAAAAGGCCACGCCGGGTCTTTTGCTACAGTGGCTGCCTCAGGGCAAGGTGCGCGGCCGCGGGGTGAAGATCCCCCGCCGCAGCGTACGTTGAGAGGAGCACCCCGAGCTTCCATAGCGGGCACCTCGGAGGCTCCAGCCACCCAGGTCTGTCCCCCGGCGCCAGCGTCTGCCCCCGCCTCTGCCCCCTCTGGTGAGGAATGGCGAACGGCGGAGAGTAGGAGGTCGCGGCGCCGGAAGCATAAGGAGgccgcttccgccgcttctgtgcgggaggctactctggtcagggataaggttaggcgaaagccaaaactatctccgccgaaaacggccgccgttctcgtgacggtgtcgccggaggcggaaagtcgaggcgttacgtacgccaacgcccagcggggcgagggccgctcttctggcagctggggggcgtcgaagtga
Protein Sequence
MKDTDFQEEGVECGRSRSRSSSVGDKSVRESGSDAESDTPQRLWRSCVGSGSTSEDSAAPITPAVKRGRGRPPTTGQYVGIGKARREKADALRADEEKNRTAELRVSELINRVRTERPLPSSITHRAVDDEDPSTKELDTKLKAALAAITEVSKISKGLKGTCQKALKEAVACISECTKTLLQRSTSEEVNILQASNTRLRADIDELQREQSELRLEVRKQRALNCQPLEAPPSVQPSDEAMGRFMDRIEARFAVLEAKLLAEPIAPKPAHKDQPTAIKPPRSKKTKKGEKAQKSKQDLIPPAPDALPSPGPATAPNAPERPEKSTQERWQKEGPERTSLGESLAEPVGSVSPAYSGGDNQRHIRPLPSGKSGTKKSAKVALRTSSRISAVREKGRKTVRGAKASGAPSPLPPKVPRTGSLTASESSITDVDVMEATEVHDDKAVRDEQPGRLWVSQKRGRSRSPPQEVPGVSQTLSKIPTARRGGASGSVFSSREKYLRTRTEGERTAEDRVVQLLAEDQRSGGDLESPPTTSELKARAGDAVAAIYEVATKSTNLKGPFVRGLKEAGAKLLVVVDDLAGRTEATTREIQQLRSENATLRSDMAELRAELARLRADLASSRPGSSRAVLASPAKTQPSPSSGGIEAQLRQLMGSFESKIAARIGSLEARLPPVPTSRTPPVAKPYTATAPTPLPCSGGRVGTYATRAASAPMQKTARATAPPAVERTPAQSRGGPQPPQVLTAAPSPALSVAPRAQKARVPAPAPAPAPLVPADEGWQKRRSPRLVLKDVVTDGEGKDAAMSGGSLPQEVAESGLLSSSRISVTSASLDSLLRSGRFGSESSLSSALSDMDQLVDECRARQPKRGASPGDGENLPEVDLDVPRKVHSSQRGGSVARFARPQTPPAKTAANVTPVPSTSSVFSRREAYNRGRTEREQVAEDQVQQLLGGLCSSGGAIAPKTVTELVESATASVKVIAMVAAKSKNLKGTFAKGLRESASVLKEVVESFTATQVELEKLRADNSRLEAEVKALRADILAWRADFRAAESGSVLVNVGGSEQLSDATFTEAQMRQLSLLIGGLIDARLEGIKARLPPEPSCRPPLAASRPKAPRPQGPSQAPSVPLPRTGGRAGTLAAVVASVPTGRGTPRDGVRPKGGDPHPALQRAPKRASRAAVAGTSNAPTAQIRLPVSASAPATGPSGEEWQVAESKGSRRKKKRAAIVASTQEVTTTRGSGRKRPKLSPPKTAAVLVTVSPEAERNGISYADLFRETKARLTAANQSVEVSVRRSQTGARLLECPGQDGAKVADIVAGRRSPRLVLKDVAVEGEDAAKSGGSLPREVAEAGPLSGSRTSVTSGSLDSLLRSGRFGSDSSLSSVSDMDLLVGPQTLSAEHRRAQQQKRGASPGGDGNLPEVDLEAPRKVHSSQRGGSVARFARPKTPPPAVTAPNVPTSGVFSRREAYNRGRTEGEVVAEEQVQQLLDELRPSEGVSSTKTVAELIESAATSVKVITNVAQKSKNLKGTFAKGLKASASTLKEVVEALFERSTANEVETQRLRADNARLEKELKALRSEMALLRADLGSTRAGSAPAVVSAVPAGSVPAHSGERPSGDSFTEAQMRQLSLLIGGLVDVRLEGIKARLPPEPSCRPPLAAPKTTQGQLIARPVPPPRTGGRAGSFATVVASGPGGRGAQRAPRKAPRVAEVGASGAPTAQACPPAPASAPASAPSGEEWRTAESKKSRRRQKKKATPAAPVQEVAPAREKGGGSAAGTRAKTPLSRPQTPTTSTPADNTLAGSASPFSRREAYVRTRTEGELVAENQVDGFLEELRSVGGDSAPKTGSELVKAAATSLSVIVNVATKSKNLKGTFAKGLRTSAQVLGEVVDALFERVTATEAESERLRASNIRMEEELGKLRAEVAKLRADFASARAGFAPGLVDSAGAHSVGRPSDDSFTEAQMRQLALLFGGLFDARFEGIKARLPPEPSCRPPLAAHKPPQGPKMPPPAPPRKKGHAGSFATVAASGQGARPRGEDPPPQRTLRGAPRASIAGTSEAPATQVCPPAPASAPASAPSGEEWRTAESRRSRRRKHKEAASAASVREATLVRDKVRRKPKLSPPKTAAVLVTVSPEAESRGVTYANAQRGEGRSSGSWGASK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-