Bros078396.1
Basic Information
- Insect
- Bacillus rossius
- Gene Symbol
- -
- Assembly
- GCA_032445375.1
- Location
- CM063651.1:1937337-1952490[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 11 0.0052 28 7.3 0.5 28 57 594 623 583 629 0.55 2 11 0.0012 6.2 9.4 1.0 31 59 618 646 603 666 0.52 3 11 0.0022 12 8.5 0.2 34 63 663 692 653 708 0.53 4 11 0.0012 6.5 9.3 0.6 27 58 712 743 695 750 0.57 5 11 0.0018 9.4 8.8 0.3 29 62 721 754 715 764 0.56 6 11 0.004 21 7.7 0.5 29 55 756 782 735 792 0.56 7 11 0.0054 29 7.2 1.4 30 55 771 796 757 809 0.48 8 11 0.0028 15 8.2 0.3 28 63 783 818 779 820 0.78 9 11 0.0025 13 8.3 0.6 29 63 812 846 806 851 0.73 10 11 0.0014 7.5 9.1 0.4 29 58 847 876 835 883 0.53 11 11 0.0011 6.1 9.4 0.3 29 64 882 917 876 918 0.78
Sequence Information
- Coding Sequence
- ATGTCAGCGGGAGGTAGCACATCGGTGGAGCGCGCGCTCGTTAACTTCACGGGCGTATCGCACCCGTGGGAGCCAGGCGCGTACATCACGAGCACGCTCCACGCTCTACGCTCTACGCTCTACGCTCTACGCTCTCGTGATGTGCGGGTGGCACTTTGGGGAGCCTCTGGCGCCATTACGGCCAGTTGTCCCGCTGTCCTGCCCGCTCTTGTCGCTACTAGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGCACGCTCTTGTCGCTACTAGTGTCGCTACTGGCCATGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGCCCGCTCTTGTCGCTACTAGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGCCCGCTCTTGTCGCTACTAGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCTCGCTGGCAGCAACTGTCCTGCCCGCTCTTGTCGCTACTGGTGTCGCTACTGGCCATGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGCCCGCTCTTGTCGCTACTGGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGCCCGCTCTTGTCGCTACTGGTGTCGCTACTGGCTAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGCCCGCTCTTGTCGCTACTGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGCCCGCTCTTGTCGCTACTAGTATCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGCCCGCTCTTGTCGCTACTAGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGTCCGCTCTTGTCGCTACTGGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGTCCGCTCTTGTCGCTACTGGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGTCCGCTCTTGTCGCTACTGGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGTCCGCTCTTGTCGCTACTGGTGTCGCTACTGGCTAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGTCCGCTCGTGTCGCTACTAGTGTCGCTACTGGCCAGTGCGATTACTTGCGATTACTGTCAGTCTTCCTGCAGGCAGCAACTGTCCTGTCCGCTCTTGTCGCTACTAGTGTCGCTACTGGCCAGTGCGATTACTGTCAGTCTTCCCGCTGGCAGCAACTGTCCTGTCCGCTCTTGTCGCTACTAGTGTCGCTACTGGCCAGTGCGTTTACTAGCAGCGCCTACACGCGCCGAGAGATCAAAACAAGCTCGCAGCGCTTCCGTTCCGTTCCTCCTCCCTGCCCCCTCCATCCTGTTCGCCCCGCCCATCGCTTACCTCCCCTCCCTACCCTGTTTCCCCCCTCTCCGCTACAAGCGCTCGTGTGCAGGCAGAGTATCGCGAGCTTGTTTAACTTCGGGTCGGATTCCATTCCCGCGGAGCAGTACCGCGCCCGCCCGCCAGACTGCGCTGGTGTGCCCGCCCTCCCCTCACCGCGCGCATTAGCCCTCAATCCACACTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTCCGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCCAGTACGCGAGCCTCACGCGCAAGCTCACGAGCCTGGGCGGTGCCGACTCAAGCCCCAGTCTCCAGGTGTACGCGCGGAGGCGCAATGCAAGTTGCCTAGCTGCGGGAGCGCAGGGGCAGGCTCGCTGCGGCCGGACGTACACAGCACTGGGATTGGGCGGGAACTGCGGCAGCGAGCGAGGGAACCAGCAGCCGGGCACTAACACTGTGTGCGAGCTGCCACTGACTGGTCTCAGCGGACACGCGCGCTCGCAGCTATGGCGGACCCTGTCCCCGGAGCGGCTGTGGTATACTACTGACAGGGCAGAGGCCGCTGCGGGCGCGCAGTTATATTTGCCCTCGATTACCCGGCCACTCGCGCGACAGGTCAAGTGGCTCGCTCATCCATCAGGCGCGCCCATGCGCGGTGTAGCGCGGCGATCCAGCGGGCAAACAAATAACATCCAGTGA
- Protein Sequence
- MSAGGSTSVERALVNFTGVSHPWEPGAYITSTLHALRSTLYALRSRDVRVALWGASGAITASCPAVLPALVATSVATGQCDYCQSSRWQQLSCTLLSLLVSLLAMRLLSVFPLAATVLPALVATSVATGQCDYCQSSRWQQLSCPLLSLLVSLLASAITVSLLAGSNCPARSCRYWCRYWPCDYCQSSRWQQLSCPLLSLLVSLLASAITVSLPAGSNCPARSCRYWCRYWLVRLLSVFPLAATVLPALVATVRLLSVFPLAATVLPALVATSIATGQCDYCQSSRWQQLSCPLLSLLVSLLASAITVSLPAGSNCPVRSCRYWCRYWPVRLLSVFPLAATVLSALVATGVATGQCDYCQSSRWQQLSCPLLSLLVSLLASAITVSLPAGSNCPVRSCRYWCRYWLVRLLSVFPLAATVLSARVATSVATGQCDYLRLLSVFLQAATVLSALVATSVATGQCDYCQSSRWQQLSCPLLSLLVSLLASAFTSSAYTRREIKTSSQRFRSVPPPCPLHPVRPAHRLPPLPTLFPPSPLQALVCRQSIASLFNFGSDSIPAEQYRARPPDCAGVPALPSPRALALNPHSASLTRQSASLTRQSASLTRQSASLTRQSASLTRQSASLTRQSASLTRQSASLTRQYASLTRQYASLTRQYASLTRQYASLTRQYASLTRQSASLTRQYASLTRQYASLTRQSASLTRQYASLTRQSASLTRQSASLTRQSASLTRQSASLTRQYASLTRQYASLTRQYASLTRQYASLTRQSASLTRQSASLTRQSASLTRQSASLTRQSASLTRQSASLTRQSASLTRQYASLTRQSASLTRQSASLTRQSASLTRQYASLTRQSASLTRQSASLTRQSASLTRQYASLTRQYASLTRQSASLTRQSASLTRQYASLTRQYASLTRKLTSLGGADSSPSLQVYARRRNASCLAAGAQGQARCGRTYTALGLGGNCGSERGNQQPGTNTVCELPLTGLSGHARSQLWRTLSPERLWYTTDRAEAAAGAQLYLPSITRPLARQVKWLAHPSGAPMRGVARRSSGQTNNIQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -