Basic Information

Gene Symbol
-
Assembly
GCA_945859575.2
Location
OX243843.1:50183-68298[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 7 3.7e-05 0.032 15.3 0.8 29 62 266 299 262 301 0.89
2 7 1.8 1.5e+03 0.3 1.5 4 19 433 448 431 450 0.89
3 7 0.00015 0.13 13.3 0.4 27 62 701 736 699 738 0.90
4 7 0.0044 3.8 8.6 0.2 25 60 1129 1164 1128 1167 0.92
5 7 0.0058 5 8.2 0.6 28 63 1528 1563 1523 1565 0.88
6 7 8.9 7.6e+03 -2.0 2.8 20 39 1594 1613 1592 1624 0.88
7 7 0.0027 2.3 9.3 0.0 30 59 1825 1854 1813 1856 0.73

Sequence Information

Coding Sequence
atggacagtaacaaactacgaaatatttttaacaagaggagcacaacgctatcggagaaggagaaggagaaggagaaggagaaagagaaggagagagagggaGAGAACGTACTGCAAAGGAGTGCGGGGAGTTTGCCCCTAGAGGCGGCTGATTCCGGCCGGCTTAGCCGTTCCAGGACTTCGGTGACCTCTGCGTCGCTGGAGAGTTTGCTGCGCTCGGGCCAGTTCGGCTCTGAGTCATCCGTTAGCTCAGCGGTTAGCGACCTGGACCAGATGGTGGCTGGGTTAGGGAACGCTCCAGAGACCTCGACGGCTGGCGAACGCCGCGCTCAACATCAAAAGCGCGGCGTCTCTCCCAGTGATGACGAAAACCTTCCTGGGCTTGACCTTGAGGCCCCCAGGAAGATACATACATCCCAAAGGGGGGGATCTGTGGCGAGGTTTGCTCGCCCGAAAACACCCCCTGCAGTGACGGCTACTAGTGCGGCTACTAAGGCGGTCAATAAAACGTCTGGCGTATTTTCTAGGCGGGAGGCCTACAACAGGGGCCGCTCAGAGGGCGAAATAGTTGCTGAGGAGCGGGTGCAGCAGCTGTTGGGTGGGCTCTGCCCGTCGGGAGGGGTTAGCTCCCCGAAAACCACGGAAGAGCTCATGCGCTCGGCGACCACCTCCGTGGCGGTCATCACAAACGTGGCCGTAAAAAGTAAAAACCTCAAGGGCACCTTCGCCAAGGGCCTGAAGGAGTCCGCCACGGTTTTAAAGGAGGTGGTAGAGGCGTTTTTTGAGCGGTCCACTGCCAACGAGACGGAGACCAAGAGTCTGCGGGCGGATAACACCCGCCTGAAGGAAGAAATAAAGGTGCTCCGCGCTGAAGTAGCACAGTTGCGTGCCGATTTCAGATCGGCTAGAGGCGGTTCTGTCCCGGTCTTAGTCAGGACCGGGCCTGTCGGTTCTGTCCCGGCCCATAGTGGAGAGCGCCCATCTGACGACTCCTTTACGGAGGCGCAGATGAGGCAGCTCTCACGTCTTATCGGGGGCATCGTCGACGCTCGCCTTGAGGGCGTTAAGGCGCGTCTGCCCCCAGAACCGAGCTGTCGGCCGCCGCTCGCGGGGCCCAAGCCCCCTAAGGGGCCACTGATGGCACGCCCGGCGCCACCACCGCGTACAGCGGGACGCGCTGGGTCTCTTAGAGCAGCGGGCACCTCAAGCGCCCAAACTGCTCAAGTCCGTCCCCCGGCGCCTGCGCCTGCTCCCGCCTCTGCTCCCTCTGGTGAGGAATGGCAAACGGCAGGGAGCAAGAAATCGCGGCGCCGGCATAAGCAGAGGgccgctgccgcctcctctgtgcaggaggagggtcccgagcgcacttcgctcggggaatcccttgcggagccagtcggctccgtctgtcccacgtactctgggggggacaCCCAACGACGCACCCGCCCTGTGCCGAGCGGTAAATCCGGCACCGAGAAAAGCGCGAAGGTCGCGCTTCGATCTTCCAACCGCAGTGCGGTTCTTAAGAAGGGTCGAAAGACCACAAGGAGGGCTAAGGCGAGTGGGGATCCATCCCCGTTGCCGCCTAAAGTCCCGCGGAAGGGGTCTCTGACCGCTTCTGAGTCCTCCATTACGGACGTGGACGTAATGGAGGATACCGAAGTCCACGAAGGGGGTGTAGTGTGTGGCGAGCAGCCAGGTAGGCTCTGGGCTTCCCAGAAGCGTAACCGCAGCCGTTCGCCGCATCAGGAAGTGCCGGGGGTCTCTCAGACTCCTAGCAAAATTCCGACCGCCCGCAGAGGAGGAATTGGTAGTTCAGTGTTCTCCAGTAGGGAGAAATACCTACGCACGCGCACGGAGGGTGAGCGCACCGCCGAAGACCGGGTGGTGCAGCTCCTCGCCGAGGACCAACGGTCGGGGGGGGACCTTGAATCCCCTCCTACTACGTCGGAACTCAAGGCGAGGGCCGGGGATGCGGTAGCCGCGATTTACGAGGTGGCTACCAAATCCACCAATTTGAAGGGCCCCTTTGTGCGGGGCCTTAAAGAGGCAGGCGCAAAACTGCTTGTAGTGGTGGACGACCTGGCTGGTCGAACAGACGCCACCACCAGGGAGATACAGCAGTTGCGCTCAGAGAACGCCACTCTGCGGTCGGACATGGCGGAGCTCCGCGCCGAGCTCGCCAGATTGCGGGCTGACTTGGCCTCCTCCCAAGCCGGTCCACCCCGGGCTGTATTGACCAGTCCGGGCAAGACGCAGCCTTCCACCTCTCCTGGAGGCATTGAGGCGCAGTTGCGGCAGTTAATGGGCTCATTCGAGTCTAAGATCGCTGCTCGCATCGGGAGTCTGGAGGCAAGATTGCCTCCGGTGCCGACCTCCCGCACACCGCCGGTCGCAAAGCCGTATACGGCTACCGCTCCGACGCCCCTACCCTGTTCTGGGGGACGCGTTGGGACCTACGCGACACGCGCGGCGTCTGCGCCTAAGAAGACGACGGCACGGGCTACGGCCCCTCCAGCGGTCGAGAGAACGCCAGCCCAGTCCCGGGGAGGCCCCCAGCCAACCCAGGTTTTGACGGCAGCACCATCTCCCGCTCCGTCGGTCACTCCACGTGCCCGGGAAAACGAGGTTCGTGCCCCAGCGCCCGCACCTGCTCCTGGCCCCCTCGTCTCCGCTGACGAGGGCTGGCAGAAGgagacggtaacagatggagagggagaAGATGCTGTAAGGAGTGGGGGCAGATTGTCCCGAGAGGTGGCTGAATCCGGCCTTTTGAGCGGTTCCAGAACTTCAGTGACGTCTGCGTCGCTGGATAGCCTGTTGCGCTCGGGTCGGTTCGGCTCAGATTCCTCGCTTAGCTCAGTAGTAAGCGATATGGACTCGCTGGCGGGTGGGTCAAGTACCGGTCCACGAACGTTGACAGCTGAGGATAGCCGCGCTCAACAACAAAAGCGCGGCGCATCACCCGGATTTGACGAGAGTCTTCCTGAGGTTGACCTTGACGTCCCCAGGAAGGTTCATTCCTCCCAAAGGGGGGGGTCTGTAGCGAGGTTTACCCGCCCAAAGACACCCCCTGCAGTGACGGCCCCGACCGTTTCAACGGCCGATGTGTTTTCCAGGCGTGAGGCCTATAACAGAGGCCGCACAGAGGGAGAAATAGCTGCTGAGGAGAAAGTCCAGCAGCTGTTGGGTGGGCTCTGCTCGTCGGGAGCGGTTGGCACCCCAAAATCCATCGAGGAACTCGTCCAATCGGCGACCAAATCAGtggcggttattgcgaatgtggccacgaaaagcaaaaacttgaagggcacattcgcaaaaggtttgaaggaagcagcctccatcctaaaggaggtggtaaacacgttctccgaacgtactgctgccactgaggtGGAGATGGAGAAGATGCGGGCCGACAACACCCGACTGAAGGATGAACTAAAGACGGTCCGGGCAGAGATGTCTCAACTGCGTGCCGACTTCGGATCGCCTAGAGCCGGTTCTGTCCCGGCCCATAGTGGAGAGCGCCCACCTGACGACTCCTTTACGGAGGCGCAGATGAGGCAGCTCTCACGTCTTATCGGGGGCATCGTCGACGCTCGCCTTGAGGGCGTTAAGGCGCGTCTGCCCCCGGAACCGAGTTGCCGGCCGCCACTCGCGGCGCCAAAGCCCCCAAAGGGGCGATTGATGGCACTCCCGGCGCCACCAACCCAAACAGGGGGACGCGCCGGGTCTTTTGCGGCAGTGGTTGCCTCAGGACCAGTCGGCCGAAGTGCGCGGAGTGGAGATTCCCATCCCCCGCAGCGTGTGCCAAGAGGAGCACCCCGTGTTGCCGTAGTGGGCGCCTCAAGCGCCCCATCGGCCCAAGTCTGCCACCAAGCGCCAGCGTCTGCTCCCGCCTCCGCCCCCTCTGGAGAGGACGCAGCAAGGAGTGGGGGCAGTTTGCCCCGAGAAGTGGCGGAGGCCGGCCCTTTGAGCGGTTCCAGAACTTCGGTGACGTCTGGGTCACTGGATAGCCTGTTGCGCTCGGGCCGGTTGGGCTCCGATTCATCGCTGAGCTCAGTCAGTGATATGGACTTGTTAGTGGGTCCACAGACCCTGTCAGCTGAGCACCGCCGCGCTCAACAACAAAAGCGCGGCGCATCTCCCGGAGGTGACGGAAACCTTCCTGAGGTTGACCTCGAGGTCCCCAGGAAGGTTCATTCTGCTCAAAGAGGGGGTTCCGTGGCGAGGTTCGCTCGCCCGAAAACACCCCCTGCAGTGACGGCACCGGTCGTTCCAACGTCCGGCGTGTTTTCTAGGCGGGACGCCTACAACAGAGGCCGCACGGAGGGGGAAATAGTTGCTGAGGAGCGGGTCCAGCAACTGTTGGGTAAGCTTCGCCCGTCGGAGGGAGTTAGCTCTCCGAAGACCATGGAGGAGCTAATGGAATCGGCGGCCACCTCCGTGAAGGTCATAGCGAACGTGGCTCAAAAGAGCAAGaaccttaagggcacgttcgccaagggcctgaagacgtcggcctcgacattaaaagaggtggtggaggcgatgtttgagcgcttcagtgccaacgaggtggagacacggaggttgcgggccgacaactcccgcttgGAGGAAGACCTGAAGGCACTTCGCTCCGAGTTAGCTCTGTTGCGTGCCGATCTTCGATCGACTGGAGCTGGTTCTGCCCCGGCCGTAGTCAGTGCTGTGTCGGTCGGTTCTGTCCAGGCCCATAGtggagccggaggcggagcacagagggagaaggaaaaggagaAAGAGAAGGAGAAGGAGAAAGAGGTGGAGGACGTAACGCGAAGGAGCGCGGGCAGTTTGCCTCAAGAAGCGGCCAATTCCGGCCGGCTAAGCGGTTCCAGGACTTCGGTGACCTCTGCGTCACTGGAGAGTTTGTTGCGCTCGGGCCGGTTCGGCTCGGAATCGTCGCTCAGCTCAGCGATGAGCGATTTGGACCAGTCGATGGGTGGACCAGTCATCGGTCCAGACGCGCCTACGGCTGATGAGTGCCGCGCTAGGCAGCAGAAGCGCGGTGCTTCGCCTGGAAGCGGTGAGAACCTTCCTGAGGTTGACGTTGGAGTCCCCAGGAAGGTTCACTCTTCCCAGAGAGGGGGATCTGTGACTAGATTTGCTCGGCCGCAGACACCCCCTGCAAAGACGCCGGTCGCAACAACGAGTGGCGTTTTTTCTAGGCGGGAGGCCTACAACCGAGGCCGCACAGAAGGAGAGTTAGTCGCTGAGGACCAAGTCCAGCAACTGATGGACGGGCTCCACTCGTCAGGAGGGATAATTGCCCCGAAAACCGTGACGGAGCTCGTGGAGTCGGCGACTGCTTCAATAGCGGTCATTGCGAAGggcaccttcgcaaaaggcttgaaggagtcagcctccgtccttaaagaggtggtggacaccttctccgaccgttgcATGGAGTTGCGGGCCAATGGCTCCCGCTTAGAGGCTGAAGTCGGGGCGCTCCGCGCCGAGATCAGTCAGTTGCGGGCCGAAGTTAAATCGGTCAAGGCCGGTTCTGTCCCGGCCAGTGTAGAGGCTGTGCCAACAGGTTCCGTCCTGAATGTGCCAGCCCAAACTAGAGAGCTGTCGCCTGGCGACTCCTTTACGGAGGCGCAAATGAGACAGCTCACAATTCTTTTCGGGAGCCTTATTGACGCGCACCTTGAGGGCATCAAGGCGCGTCTGCCCCCGGAGCCGAGCTGTCGGCCGCCGCTCGCAGTGCCCAAGCCGAAGCCTTCTCAGGGGCCACCAATGGGGCCGCCACCACCCCGTACAGGGGGACGGGCCGGGTCTTTTGCAGCAGCGGCTGCCTCAAGACCGGTTGAGCAAGGTGTGCGGCCGCGGAGGGGAGATCCCAAACCCCCGCCGCAGCGTGCGCCGAGAGGAGCCGTTGTAGATGCTTCATGCGCCCCAGCGGCTCAGGTCCATCACCCGGCGCCTGCGCCTGCTCCCGCCACTGCCACCTCTTGTGAGGAATGGCAGTAG
Protein Sequence
MDSNKLRNIFNKRSTTLSEKEKEKEKEKEKEREGENVLQRSAGSLPLEAADSGRLSRSRTSVTSASLESLLRSGQFGSESSVSSAVSDLDQMVAGLGNAPETSTAGERRAQHQKRGVSPSDDENLPGLDLEAPRKIHTSQRGGSVARFARPKTPPAVTATSAATKAVNKTSGVFSRREAYNRGRSEGEIVAEERVQQLLGGLCPSGGVSSPKTTEELMRSATTSVAVITNVAVKSKNLKGTFAKGLKESATVLKEVVEAFFERSTANETETKSLRADNTRLKEEIKVLRAEVAQLRADFRSARGGSVPVLVRTGPVGSVPAHSGERPSDDSFTEAQMRQLSRLIGGIVDARLEGVKARLPPEPSCRPPLAGPKPPKGPLMARPAPPPRTAGRAGSLRAAGTSSAQTAQVRPPAPAPAPASAPSGEEWQTAGSKKSRRRHKQRAAAASSVQEEGPERTSLGESLAEPVGSVCPTYSGGDTQRRTRPVPSGKSGTEKSAKVALRSSNRSAVLKKGRKTTRRAKASGDPSPLPPKVPRKGSLTASESSITDVDVMEDTEVHEGGVVCGEQPGRLWASQKRNRSRSPHQEVPGVSQTPSKIPTARRGGIGSSVFSSREKYLRTRTEGERTAEDRVVQLLAEDQRSGGDLESPPTTSELKARAGDAVAAIYEVATKSTNLKGPFVRGLKEAGAKLLVVVDDLAGRTDATTREIQQLRSENATLRSDMAELRAELARLRADLASSQAGPPRAVLTSPGKTQPSTSPGGIEAQLRQLMGSFESKIAARIGSLEARLPPVPTSRTPPVAKPYTATAPTPLPCSGGRVGTYATRAASAPKKTTARATAPPAVERTPAQSRGGPQPTQVLTAAPSPAPSVTPRARENEVRAPAPAPAPGPLVSADEGWQKETVTDGEGEDAVRSGGRLSREVAESGLLSGSRTSVTSASLDSLLRSGRFGSDSSLSSVVSDMDSLAGGSSTGPRTLTAEDSRAQQQKRGASPGFDESLPEVDLDVPRKVHSSQRGGSVARFTRPKTPPAVTAPTVSTADVFSRREAYNRGRTEGEIAAEEKVQQLLGGLCSSGAVGTPKSIEELVQSATKSVAVIANVATKSKNLKGTFAKGLKEAASILKEVVNTFSERTAATEVEMEKMRADNTRLKDELKTVRAEMSQLRADFGSPRAGSVPAHSGERPPDDSFTEAQMRQLSRLIGGIVDARLEGVKARLPPEPSCRPPLAAPKPPKGRLMALPAPPTQTGGRAGSFAAVVASGPVGRSARSGDSHPPQRVPRGAPRVAVVGASSAPSAQVCHQAPASAPASAPSGEDAARSGGSLPREVAEAGPLSGSRTSVTSGSLDSLLRSGRLGSDSSLSSVSDMDLLVGPQTLSAEHRRAQQQKRGASPGGDGNLPEVDLEVPRKVHSAQRGGSVARFARPKTPPAVTAPVVPTSGVFSRRDAYNRGRTEGEIVAEERVQQLLGKLRPSEGVSSPKTMEELMESAATSVKVIANVAQKSKNLKGTFAKGLKTSASTLKEVVEAMFERFSANEVETRRLRADNSRLEEDLKALRSELALLRADLRSTGAGSAPAVVSAVSVGSVQAHSGAGGGAQREKEKEKEKEKEKEVEDVTRRSAGSLPQEAANSGRLSGSRTSVTSASLESLLRSGRFGSESSLSSAMSDLDQSMGGPVIGPDAPTADECRARQQKRGASPGSGENLPEVDVGVPRKVHSSQRGGSVTRFARPQTPPAKTPVATTSGVFSRREAYNRGRTEGELVAEDQVQQLMDGLHSSGGIIAPKTVTELVESATASIAVIAKGTFAKGLKESASVLKEVVDTFSDRCMELRANGSRLEAEVGALRAEISQLRAEVKSVKAGSVPASVEAVPTGSVLNVPAQTRELSPGDSFTEAQMRQLTILFGSLIDAHLEGIKARLPPEPSCRPPLAVPKPKPSQGPPMGPPPPRTGGRAGSFAAAAASRPVEQGVRPRRGDPKPPPQRAPRGAVVDASCAPAAQVHHPAPAPAPATATSCEEWQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-