Bhor018529.1
Basic Information
- Insect
- Bombus hortorum
- Gene Symbol
- -
- Assembly
- GCA_905332935.1
- Location
- HG995195.1:9259-34689[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 4 0.074 32 4.9 0.0 26 46 146 166 144 173 0.87 2 4 0.074 32 4.9 0.0 26 46 711 731 709 738 0.87 3 4 0.074 32 4.9 0.0 26 46 1276 1296 1274 1303 0.87 4 4 0.074 32 4.9 0.0 26 46 1914 1934 1912 1941 0.87
Sequence Information
- Coding Sequence
- ATGCCTCACGCTAGAGCGGACCGAAGGACAACAGAGGTCCCGGAAGTAGCCCCAAGAAGAGTGACCAGGGGCTCGTTGAAGGCCGCGGAGAACGTGTTCCTTTCTCCGCCTATGGTGATTGCCGACACACCGTCGATCCCAACCGCAGGTGTGTTTCGCATGCGAGTGCGATCCGACGACGAAGAGTCGGTCGCCTCGTCGCTGGCCTCGTCGGTTTCGAGAGGCAAGAAGCGGCGAATCGTGTCGACGGTCGCGAGCGCGTCGGAGGAGCTCGCGATCGACGTGCGGACTTGCAGTCCGGCGGACGCTCACGTCGAGTTTACGAGGCAAGCGGCACGAATCATATACGTGTCGTCGTCATCGTCAAACATGAAGGGGACGCATATAAAGATCCTCAGGGACGCGTCCAAGGTAATGCAGGAGCTGTGGACGACGAAGATCGCGGAAATGGAAGCGCGGCTGTTCGCGCTCGAGAAAGAGAACGCAGCCCTTCGCGAAGGGACGTCCAGAATGGCAGTTTGCGCCAACGTGTGCGCGCAGTGCGGCGGCCCGGCTTCCGAGCCCGGCCGTCCTCCGCGAGAGCGCAACAACGACGGTGCGACCGAAGATGACCTGGAAAGGAGGTTTCGGCAGCTAGAAGCCTCCCTGAAGAGAATCGAGGAGCGTCTCGGGGAACGACAACCGTGCGTCCCAGAGCCCCGACCCGTCCCGGAACCCCGACGCGTCGCGGATCCCCCCGCCACTCGCCGCGCCACCCCGACAACTGCGCCCCCGAGGGAGCAGGCGGATGGCGAATGGCGAGTGGTCGAAAGGAGGAGGAACAGGAGAAGAACGGCCGCCGCCGCCGCGGTCGGCAAAGAAGTATCGACAACATCAAGAAGAGGGGCGAACGCGGCCCCACCGCCGTCGACGAGGAGTACAGGCGCGCCAAAACCGAGCGTCGCCGCCACCGCCACGAGGAAGGGGCCGATACCGGCACCGAGGACAATGCCGCCGCCCCCCCGGTCACCGCAGACATCCGCGGTGACCGTAACGTTGAAAGAAGGATCCAGCATGTCGTACGCCGACGTGCTGGCAACGGCCCGAAGGACGATCCCGCTCGCGGAGATCGGCGTGGGGAAACTCGGAATGAAAAAGAGTATGACGGGCGGCATTATAATCGAAGTGCCCGGGGACAAGGACAGAGAAAAAGCGGAGCGGCTAGCGACGCGTCTGTCCGAGGTGCTAGGCCCGGCCACGGCAAAGGTCGCAGCTCCGACGAAGACCGCGGAGCTGAGAGTATCCGGGCTAGATATCTCGGTTACCAAAGAGGAGTTGCGGCAAGCATTGGCCGCAGCGGCGGGTTGCAGAGCCGCCGATATACGAACGGGAGACATCGGCATCGCCAGAGACGGCCTCGCAATGGCCTGGATCAAGTGCCCGGAGGTCGCAGCCTGGAAGTTGGCGCAGGCAAGAAGAGTGCCTATGGGGTGGTCGAAGGCAAAGATCAGGGCCATCCCAAAGAAACCCCTGCAGTGCTACAAATGCCTGCAGTACGGTCACGTGCGAGCCACCTGCACGTCCACCGTGAACCGAGAGCACCTGTGCTTCAGGTGCGGCGGAGCCGGACACCGCGCCAAAGGCTGTCCCGCCTCCGCACCCAAGTGCCTCCTGTGCGAGTCGCTCGGGGTGCCCGCCAACCACAAGATGGGCGGAGACGCGTACGCTAGAGCGGACCAAAGGACAACAGAGGTCCCGGAAGTAGCCCCAAGAAGAGTGACGAGGGGCTCGTTGAAGGCCGCGGAGAACGTGTTCCTTTCTCCGCCTATGGTGATTGCCGACACACCGTCGATCCCAACCGCAGGTGTGTTTCGCATGCGAGTGCGATCCGACGACGAAGAGTCGGTCGCCTCGTCGCTGGCCTCGTCGGTTTCGAGAGGCAAGAAGCGGCGAATCGTGTCGACGGTCGCGAGCGCGTCGGAGGAGCTCGCGATCGACGTGAGGACTTGCAGTCCGGCGGACGCTCACGTCGAGTTTACGAGGCAAGCGGCACGAATCATATACGTGTCGTCGTCATCGTCAAACATGAAGGGGACGCATATAAAGATCCTCAGGGACGCGTCCAAGGTAATGCAGGAGCTGTGGACGACGAAGATCGCGGAAATGGAAGCGCGGCTGTTCGCGCTCGAGAAAGAGAACGCAGCCCTTCGCGAAGGGACGTCCAGAATGGCAGTTTGCGCCAACGTGTGCGCGCAGTGCGGCGGCCCGGCTTCCGAGCCCGGCCGTCCTCCGCGAGAACGCAACAACGACGGTGCGACCGAAGAGGACCTGGAAAGGAGGTTTCGGCAGCTCGAAGCCTCCCTGAAGAGAATCGAGGAGCGTCTCGGGGAACGACAACCGCGCGTCCCAGAGCCCCGACCCGTCCCGGAACCCCGACGCGTCGCGGATCCCCCCGCCACTCGCCGCGCCACCCCGACAACTGCGACCCCGAGGGAGCAGGCGGATGGCGAATGGCGAGTGGTCGAAAGGAGGAGGAACAGGAGAAGAACGGCCGCCGCCGCCGCGGTCGGCAAAGAAGTATCGACAACATCAAGAAGAGGGGCGAACGCGGCCCCACCGCCGTCGACGAGGAGTACAGGCGCGCCAAAACCGAGCGTCGCCGCCACCGCCACGAGGAAGGGGCCGATACCGGCACCGAGGACAATGCCGCCGCCCCCCCGGTCACCGCAGACATCCGCGGTGACCGTAACGTTGAAAGAAGGATCCAGCATGTCGTACGCCGACGTGCTGGCAACGGCCCGAAGGACGATCCCGCTCGCGGAGATCGGCGTGGGGAAACTCGGAATGAAAAAGAGTATGACGGGCGGCATTATAATCGAAGTGCCCGGGGACAAGGACAGAGAAAAAGCGGAGCGGCTAGCGACGCGTCTGTCCGAGGTGCTAGGCCCGGCCACGGCAAAGGTCGCAGCCCCGACGAAGACCGCGGAGCTGAGAGTATCCGGGCTAGATATCTCGGTTACCAAAGAGGAGTTGCGGCAAGCATTGGCCGCAGCGGCGGGTTGCAGAGCCGCCGATATACGAACGGGAGACATCGGCATCGCCAGAGACGGCCTCGCAATGGCCTGGATCAAGTGCCCGGAGGTCGCAGCCTGGAAGTTGGCGCAGGCAAGAAGAGTGCCTATGGGGTGGTCGAAGGCAAAGATCAGGGCCATCCCAAAGAAACCCCTGCAGTGCTACAAATGCCTGCAGTACGGTCACGTGCGAGCCACCTGCACGTCCACCGTGAACCGAGAGCACCTGTGCTTCAGGTGCGGCGGAGCCGGACACCGCGCCAAAGGCTGTCCCGCCTCCGCACCCAAGTGCCTCCTGTGCGAGTCGCTCGGGGTGCCCGCCAACCACAAGATGGGCGGAGACGCGTACGCTAGAGCGGACCAAAGGACAACAGAGGTCCCGGAAGTAGCCCCAAGAAGAGTGACCAGGGGCTCGTTGAAGGCCGCGGAGAACGTGTTCCTTTCTCCGCCTATGGTGATTGCCGACACACCGTCGATCCCAACCGCAGGTGTGTTTCGCATGCGAGTGCGATCCGACGACGAAGAGTCGGTCGCCTCGTCGCTGGCCTCGTCGGTTTCGAGAGGCAAGAAGCGGCGAATCGTGTCGACGGTCGCGAGCGCGTCGGAGGAGCTCGCGATCGACGTGAGGACTTGCAGTCCGGCGGACGCTCACGTCGAGTTTACGAGGCAAGCGGCACGAATCATATACGTGTCGTCGTCATCGTCAAACATGAAGGGGACGCATATAAAGATCCTCAGGGACGCGTCCAAGGTAATGCAGGAGCTGTGGACGACGAAGATCGCGGAAATGGAAGCGCGGCTGTTCGCGCTCGAGAAAGAGAACGCAGCCCTTCGCGAAGGGACGTCCAGAATGGCAGTTTGCGCCAACGTGTGCGCGCAGTGCGGCGGCCCGGCTTCCGAGCCCGGCCGTCCTCCGCGAGAGCGCAACAACGACGGTGCGACCGAAGATGACCTGGAAAGGAGGTTTCGGCAGCTAGAAGCCTCCCTGAAGAGAATCGAGGAGCGTCTCGGGGAACGACAACCGTGCGTCCCAGAGCCCCGACCCGTCCCGGAACCCCGACGCGTCGCGGATCCCCCCGCCACTCGCCGCGCCACCCCGACAACTGCGCCCCCGAGGGAGCAGGCGGATGGCGAATGGCGAGTGGTCGAAAGGAGGAGGAACAGGAGAAGAACGGCCGCCGCCGCCGCGGTCGGCAAAGAAGTATCGACAACATCAAGAAGAGGGGCGAACGCGGCCCCACCGCCGTCGACGAGGAGTACAGGCGCGCCAAAACCGAGCGTCGCCGCCACCGCCACGAGGAAGGGGCCGATACCGGCACCGAGGACAATGCCGCCGCCCCCCCGGTCACCGCAGACATCCGCGGTGACCGTAACGTTGAAAGAAGGATCCAGCATGTCGTACGCCGACGTGCTGGCAACGGCCCGAAGGACGATCCCGCTCGCGGAGATCGGCGTGGGGAAACTCGGAATGAAAAAGAGTATGACGGGCGGCATTATAATCGAAGTGCCCGGGGACAAGGACAGAGAAAAAGCGGAGCGGCTAGCGACGCGTCTGTCCGAGGTGCTAGGCCCGGCCACGGCAAAGGTCGCAGCCCCGACGAAGACCGCGGAGCTGAGAGTATCCGGGCTAGATATCTCGGTTACCAAAGAGGAGTTGCGGCAAGCATTGGCCGCAGCGGCGGGTTGCAGAGCCGCCGATATACGAACGGGAGACATCGGCATCGCCAGAGACGGCCTCGCAATGGCCTGGATCAAGTGCCCGGAGGTCGCAGCCTGGAAGTTGGCGCAGGCAAGAAGAGTGCCTATGGGGTGGTCGAAGGCAAAGATCAGGGCCATCCCGAAGAAACCCCTGCAGTGCTACAAATGCCTGCAGTACGGTCACGTGCGAGCCACCTGCACGTCCACCGTGAACCGAGAGCACCTGTGCTTCAGGTGCGGCGGAGCCGGACACCGCGCCAAAGGCTGTCCCGCCTCCGCACCCAAGTGCCTCCTGTGCGAGTCGCTCGGGGTGCCCGCCAACCACAAGATGGGCGGAGACGCGTGGGGTAGCACCACTCCCTGGGAAACCCCGATGGATCTGATATTCCCCTATAGCCGGGTGGCGGCTGATCGGTGCCTCTGGCACCCGTCAGTTTGCCGATCCGGTGCGGAGAATTTCGGAGAAGTTGGTGGGCCCATGTCTGCCAAGACCACTCACCTCACCTTCCCGTGGGGCTCCCCGTCACCCTGCACCGGCTTTGACCGGGACAGGGTGCGAAAGAACGCTAGAGCGGACCAAAGGACAACAGAGGTCCCGGAAGTAGCCCCAAGAAGAGTGACGAGGGGCTCGTTGAAGGCCGCGGAGAACGTGTTCCTTTCTCCGCCTATGGTGATTGCCGACACACCGTCGATCCCAACCGCAGGTGTGTTTCGCATGCGAGTGCGATCCGACGACGAAGAGTCGGTCGCCTCGTCGCTGGCCTCGTCGGTTTCGAGAGGCAAGAAGCGGCGAATCGTGTCGACGGTCGCGAGCGCGTCGGAGGAGCTCGCGATCGACGTGAGGACTTGCAGTCCGGCGGACGCTCACGTCGAGTTTACGAGGCAAGCGGCACGAATCATATACGTGTCGTCGTCATCGTCAAACATGAAGGGGACGCATATAAAGATCCTCAGGGACGCGTCCAAGGTAATGCAGGAGCTGTGGACGACGAAGATCGCGGAAATGGAAGCGCGGCTGTTCGCGCTCGAGAAAGAGAACGCAGCCCTTCGCGAAGGGACGTCCAGAATGGCAGTTTGCGCCAACGTGTGCGCGCAGTGCGGCGGCCCGGCTTCCGAGCCCGGCCGTCCTCCGCGAGAGCGCAACAACGACGGTGCGACCGAAGATGACCTGGAAAGGAGGTTTCGGCAGCTAGAAGCCTCCCTGAAGAGAATCGAGGAGCGTCTCGGGGAACGACAACCGTGCGTCCCAGAGCCCCGACCCGTCCCGGAACCCCGACGCGTCGCGGATCCCCCCGCCACTCGCCGCGCCACCCCGACAACTGCGCCCCCGAGGGAGCAGGCGGATGGCGAATGGCGAGTGGTCGAAAGGAGGAGGAACAGGAGAAGAACGGCCGCCGCCGCCGCGGTCGGCAAAGAAGTATCGACAACATCAAGAAGAGGGGCGAACGCGGCCCCACCGCCGTCGACGAGGAGTACAGGCGCGCCAAAACCGAGCGTCGCCGCCACCGCCACGAGGAAGGGGCCGATACCGGCACCGAGGACAATGCCGCCGCCCCCCCGGTCACCGCAGACATCCGCGGTGACCGTAACGTTGAAAGAAGGATCCAGCATGTCGTACGCCGACGTGCTGGCAACGGCCCGAAGGACGATCCCGCTCGCGGAGATCGGCGTGGGGAAACTCGGAATGAAAAAGAGTATGACGGGCGGCATTATAATCGAAGTGCCCGGGGACAAGGACAGAGAAAAAGCGGAGCGGCTAGCGACGCGTCTGTCCGAGGTGCTAGGCCCGGCCACGGCAAAGGTCGCAGCCCCGACGAAGACCGCGGAGCTGAGAGTATCCGGGCTAGATATCTCGGTTACCAAAGAGGAGTTGCGGCAAGCATTGGCCGCAGCGGCGGGTTGCAGAGCCGCCGATATACGAACGGGAGACATCGGCATCGCCAGAGACGGCCTCGCAATGGCCTGGATCAAGTGCCCGGAGGTCGCAGCCTGGAAGTTGGCGCAGGCAAGAAGAGTGCCTATGGGGTGGTCGAAGGCAAAGATCAGGGCCATCCCAAAGAAACCCCTGCAGTGCTACAAATGCCTGCAGTACGGTCACGTGCGAGCCACCTGCACGTCCACCGTGAACCGAGAGCACCTGTGCTTCAGGTGCGGCGGAGCCGGACACCGCGCCAAAGGCTGTCCCGCCTCCGCACCCAAGTGCCTCCTGTGCGAGTCGCTCGGGGTGCCCGCCAACCACAAGATGGGCGGAGACGCGTATCGTCGTCGTCGTCGCGACGCAGAAGAGACACCGCGCGCCGCAGGGTTCGCGGCGCTGCCGCCGCCGCCGCCACCCCAAGAGCGTCGGACCAAGATCGAGGCGCTAGGGGGAGTGGTCCCCCCTAGCGACCAAACCAACAAGTGA
- Protein Sequence
- MPHARADRRTTEVPEVAPRRVTRGSLKAAENVFLSPPMVIADTPSIPTAGVFRMRVRSDDEESVASSLASSVSRGKKRRIVSTVASASEELAIDVRTCSPADAHVEFTRQAARIIYVSSSSSNMKGTHIKILRDASKVMQELWTTKIAEMEARLFALEKENAALREGTSRMAVCANVCAQCGGPASEPGRPPRERNNDGATEDDLERRFRQLEASLKRIEERLGERQPCVPEPRPVPEPRRVADPPATRRATPTTAPPREQADGEWRVVERRRNRRRTAAAAAVGKEVSTTSRRGANAAPPPSTRSTGAPKPSVAATATRKGPIPAPRTMPPPPRSPQTSAVTVTLKEGSSMSYADVLATARRTIPLAEIGVGKLGMKKSMTGGIIIEVPGDKDREKAERLATRLSEVLGPATAKVAAPTKTAELRVSGLDISVTKEELRQALAAAAGCRAADIRTGDIGIARDGLAMAWIKCPEVAAWKLAQARRVPMGWSKAKIRAIPKKPLQCYKCLQYGHVRATCTSTVNREHLCFRCGGAGHRAKGCPASAPKCLLCESLGVPANHKMGGDAYARADQRTTEVPEVAPRRVTRGSLKAAENVFLSPPMVIADTPSIPTAGVFRMRVRSDDEESVASSLASSVSRGKKRRIVSTVASASEELAIDVRTCSPADAHVEFTRQAARIIYVSSSSSNMKGTHIKILRDASKVMQELWTTKIAEMEARLFALEKENAALREGTSRMAVCANVCAQCGGPASEPGRPPRERNNDGATEEDLERRFRQLEASLKRIEERLGERQPRVPEPRPVPEPRRVADPPATRRATPTTATPREQADGEWRVVERRRNRRRTAAAAAVGKEVSTTSRRGANAAPPPSTRSTGAPKPSVAATATRKGPIPAPRTMPPPPRSPQTSAVTVTLKEGSSMSYADVLATARRTIPLAEIGVGKLGMKKSMTGGIIIEVPGDKDREKAERLATRLSEVLGPATAKVAAPTKTAELRVSGLDISVTKEELRQALAAAAGCRAADIRTGDIGIARDGLAMAWIKCPEVAAWKLAQARRVPMGWSKAKIRAIPKKPLQCYKCLQYGHVRATCTSTVNREHLCFRCGGAGHRAKGCPASAPKCLLCESLGVPANHKMGGDAYARADQRTTEVPEVAPRRVTRGSLKAAENVFLSPPMVIADTPSIPTAGVFRMRVRSDDEESVASSLASSVSRGKKRRIVSTVASASEELAIDVRTCSPADAHVEFTRQAARIIYVSSSSSNMKGTHIKILRDASKVMQELWTTKIAEMEARLFALEKENAALREGTSRMAVCANVCAQCGGPASEPGRPPRERNNDGATEDDLERRFRQLEASLKRIEERLGERQPCVPEPRPVPEPRRVADPPATRRATPTTAPPREQADGEWRVVERRRNRRRTAAAAAVGKEVSTTSRRGANAAPPPSTRSTGAPKPSVAATATRKGPIPAPRTMPPPPRSPQTSAVTVTLKEGSSMSYADVLATARRTIPLAEIGVGKLGMKKSMTGGIIIEVPGDKDREKAERLATRLSEVLGPATAKVAAPTKTAELRVSGLDISVTKEELRQALAAAAGCRAADIRTGDIGIARDGLAMAWIKCPEVAAWKLAQARRVPMGWSKAKIRAIPKKPLQCYKCLQYGHVRATCTSTVNREHLCFRCGGAGHRAKGCPASAPKCLLCESLGVPANHKMGGDAWGSTTPWETPMDLIFPYSRVAADRCLWHPSVCRSGAENFGEVGGPMSAKTTHLTFPWGSPSPCTGFDRDRVRKNARADQRTTEVPEVAPRRVTRGSLKAAENVFLSPPMVIADTPSIPTAGVFRMRVRSDDEESVASSLASSVSRGKKRRIVSTVASASEELAIDVRTCSPADAHVEFTRQAARIIYVSSSSSNMKGTHIKILRDASKVMQELWTTKIAEMEARLFALEKENAALREGTSRMAVCANVCAQCGGPASEPGRPPRERNNDGATEDDLERRFRQLEASLKRIEERLGERQPCVPEPRPVPEPRRVADPPATRRATPTTAPPREQADGEWRVVERRRNRRRTAAAAAVGKEVSTTSRRGANAAPPPSTRSTGAPKPSVAATATRKGPIPAPRTMPPPPRSPQTSAVTVTLKEGSSMSYADVLATARRTIPLAEIGVGKLGMKKSMTGGIIIEVPGDKDREKAERLATRLSEVLGPATAKVAAPTKTAELRVSGLDISVTKEELRQALAAAAGCRAADIRTGDIGIARDGLAMAWIKCPEVAAWKLAQARRVPMGWSKAKIRAIPKKPLQCYKCLQYGHVRATCTSTVNREHLCFRCGGAGHRAKGCPASAPKCLLCESLGVPANHKMGGDAYRRRRRDAEETPRAAGFAALPPPPPPQERRTKIEALGGVVPPSDQTNK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00220268;
- 90% Identity
- iTF_00220268;
- 80% Identity
- iTF_00220290;