Bros065790.1
Basic Information
- Insect
- Bacillus rossius
- Gene Symbol
- -
- Assembly
- GCA_032445375.1
- Location
- CM063647.1:43954687-43964323[+]
Transcription Factor Domain
- TF Family
- HMGA
- Domain
- HMGA domain
- PFAM
- AnimalTFDB
- TF Group
- Unclassified Structure
- Description
- This entry represents the HMGA family, whose members contain DNA-binding domains, also known as AT hooks due to their ability to interact with the narrow minor groove of AT-rich DNA sequences. They play an important role in chromatin organisation [1]. The high mobility group (HMG) proteins are the most abundant and ubiquitous nonhistone chromosomal proteins. They bind to DNA and to nucleosomes and are involved in the regulation of DNA-dependent processes such as transcription, replication, recombination, and DNA repair. They can be grouped into three families: HMGB (HMG 1/2), HMGN (HMG 14/17) and HMGA (HMG I/Y). The characteristic domains are: AT-hook for the HMGA family, the HMG Box for the HMGB family, and the nucleosome-binding domain (NBD) for the members of the HMGN family [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 23 0.0029 3.1e+02 3.6 2.2 3 11 21 29 17 29 0.90 2 23 0.00076 81 5.5 0.2 3 11 35 43 34 43 0.89 3 23 0.00076 81 5.5 0.2 3 11 49 57 48 57 0.89 4 23 0.00076 81 5.5 0.2 3 11 63 71 62 71 0.89 5 23 0.00076 81 5.5 0.2 3 11 77 85 76 85 0.89 6 23 0.00076 81 5.5 0.2 3 11 91 99 90 99 0.89 7 23 0.00076 81 5.5 0.2 3 11 105 113 104 113 0.89 8 23 0.00076 81 5.5 0.2 3 11 119 127 118 127 0.89 9 23 0.00076 81 5.5 0.2 3 11 133 141 132 141 0.89 10 23 0.00076 81 5.5 0.2 3 11 147 155 146 155 0.89 11 23 0.00076 81 5.5 0.2 3 11 161 169 160 169 0.89 12 23 0.00076 81 5.5 0.2 3 11 175 183 174 183 0.89 13 23 0.00076 81 5.5 0.2 3 11 189 197 188 197 0.89 14 23 0.00076 81 5.5 0.2 3 11 203 211 202 211 0.89 15 23 0.00076 81 5.5 0.2 3 11 217 225 216 225 0.89 16 23 0.00076 81 5.5 0.2 3 11 231 239 230 239 0.89 17 23 0.00076 81 5.5 0.2 3 11 245 253 244 253 0.89 18 23 0.00076 81 5.5 0.2 3 11 259 267 258 267 0.89 19 23 0.00076 81 5.5 0.2 3 11 273 281 272 281 0.89 20 23 0.00076 81 5.5 0.2 3 11 287 295 286 295 0.89 21 23 0.00076 81 5.5 0.2 3 11 301 309 300 309 0.89 22 23 0.00076 81 5.5 0.2 3 11 315 323 314 323 0.89 23 23 0.00076 81 5.5 0.2 3 11 329 337 328 337 0.89
Sequence Information
- Coding Sequence
- ATGACAGCGGCTGAAATTCGTCAGATCACAAACCCCAAGTCCCCACTCCCAAGTCCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTCCCAAGTGCCCACTATTGA
- Protein Sequence
- MTAAEIRQITNPKSPLPSPHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHSQVPTPKCPLPSAHY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -