Basic Information

Gene Symbol
-
Assembly
GCA_907165275.1
Location
OU015652.1:14994650-14997436[-]

Transcription Factor Domain

TF Family
HMGA
Domain
HMGA domain
PFAM
AnimalTFDB
TF Group
Unclassified Structure
Description
This entry represents the HMGA family, whose members contain DNA-binding domains, also known as AT hooks due to their ability to interact with the narrow minor groove of AT-rich DNA sequences. They play an important role in chromatin organisation [1]. The high mobility group (HMG) proteins are the most abundant and ubiquitous nonhistone chromosomal proteins. They bind to DNA and to nucleosomes and are involved in the regulation of DNA-dependent processes such as transcription, replication, recombination, and DNA repair. They can be grouped into three families: HMGB (HMG 1/2), HMGN (HMG 14/17) and HMGA (HMG I/Y). The characteristic domains are: AT-hook for the HMGA family, the HMG Box for the HMGB family, and the nucleosome-binding domain (NBD) for the members of the HMGN family [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 9 0.00033 0.73 10.1 4.0 5 13 118 126 116 127 0.89
2 9 0.00033 0.73 10.1 4.0 5 13 168 176 166 177 0.89
3 9 0.00033 0.73 10.1 4.0 5 13 218 226 216 227 0.89
4 9 0.00014 0.32 11.2 2.8 5 13 275 283 273 284 0.89
5 9 0.00033 0.73 10.1 4.0 5 13 463 471 461 472 0.89
6 9 0.00033 0.73 10.1 4.0 5 13 513 521 511 522 0.89
7 9 0.00033 0.73 10.1 4.0 5 13 563 571 561 572 0.89
8 9 0.00033 0.73 10.1 4.0 5 13 613 621 611 622 0.89
9 9 0.00014 0.32 11.2 2.8 5 13 670 678 668 679 0.89

Sequence Information

Coding Sequence
ATGACCGGCGCCCAAGGCCCCCTGCGACGGGTCAGCCCCTCCACGCCGCCAATCACCGGGGCGGCCCCGTCGAGGCCGCAGAGGAAAAGCATCTCCACGGCCTCTAACACTCAGCCGCGGTGCTCCCCGAGCCACCGCGGCCCGCCTCAGTGGCGCCGCCGCCCCGGAGGACGACAGCGCCACCTACATGGGCCGCCTCGACCACGTGCTTGCTGCCTGCAAGTGGGTGAGACGGCCCCCTCCCGGTGCCTCATGCCGGGCGCGGCGTCGGATGAAATCGACGGCGCCGCACCCGAACATTCCGGCCCTTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGCCGCGCCCCCACGCCAAAGCGCCCGAGAGGCACGGCCTACCCGTCCCGGTGCCGCATGCCGGGCGCGGCGTCGGATGAAATCGACGGCGCCGCACCCGAGCATTCCGGCCCTTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGCCGCGCCCCCACGCCAAAGCGCCCGAGAGGCACGGCCTACCCGTCCCGGTGCCGCATGCCGGGCGCGGCGTCGGATGAAATCGACGGCGCCGCACCCGAACATTCCGGCCCTTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGCCGCGCCCCCACGCCAAAGCGCCCGAGAGGCACGGCCTACGCTCCCCTCTCCCGGCCACGGTTTTTTACGCCGCGGCCCACCTCAGTTGCACCCCCACGCCAAAGCGCCCGAGAGGCGCAACCTACCGTCCCCTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGTCGCGCCCCCACGCCAAAGCGCCCGAGAGGCGCAACCTACCCGTCTCGGTGCCTCATGTCGGGCGCAGCGCTGGCCCTTCTCAGCGCCGCACTCACCGGCATTCCGGCCCGCCTCAGGGTTCCTCTTGGCTTCACCAATGACGACGATGCGCCGGAACACGCGACGCACCGCCGCCCTCCTCCCTCTCTCGGGGACCTTGTGCAGAGGACGCCGGTCGGGCCCATGACCGGCGCCCAAGGCCCCCTGCGACGGCCCCTCCACGCCGCCACTCACCGGGGCGGCCCCGTCGAGGCCGCGGAGGAAAAACACCCCCACGGCCTCCAACCCCCGGCCGCGGCGCTCGAGCTACGCCGCAGCCCGCCTCAGTGGCGCCGCCGCCCCGGAGGACGACAGCGCCACCTACATGGGCCGCCTCGACCACGTGCTTGCTGCCTGCAAGTGGGTGAGACGGCCCCCTCCCGGTGCCTCATGCCGGGCGCGGCGTCGGATGAAATCGACGGCGCCGCACCCGAACATTCCGGCCCTTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGCCGCGCCCCCACGCCAAAGCGCCCGAGAGGCACGGCCTACCCGTCCCGGTGCCGCATGCCGGGCGCGGCGTCGGATGAAATCGACGGCGCCGCACCCGAGCATTCCGGCCCTTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGCCGCGCCCCCACGCCAAAGCGCCCGAGAGGCACGGCCTACCCGTCCCGGTGCCGCATGCCGGGCGCGGCGTCGGATGAAATCGACGGCGCCGCACCCGAACATTCCGGCCCTTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGCCGCGCCCCCACGCCAAAGCGCCCGAGAGGCACGGCCTACCCGTCCCGGTGCCGCATGCCGGGCGCGGCGTCGGATGAAATCGACGGCGCCGCACCCGAACATTCCGGCCCTTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGCCGAGCCCCCACGCCAAAGCGCCCGAGAGGCACGGCCTACGCTCCCCTCTCCCGGCCACGGTTTTTTACGCCGCGGCCCACCTCAGTTGCACCCCCACGCCAAAGCGCCCGAGAGGCGCAACCTACCGTCCCCTCTCCCGGCCACGGTTTTTACGCCGCGACCCGCCTCAGTCGCGCCCCCACGCCAAAGCGCCCGAGAGGCGCAACCTACCCGTCTCGGTGCCTCATGTCGGGCGCAGCGCTGGCCCTTCTCAGCGCCGCACTCACCGGCATTCCGGCCCGCCTCAGGGTTCCTCTTGGCTTCACCGTGGGTGGACAGTACCCACGGCTACTAAGCCCCCCCGCCACGACAAGGCGGGAACCCAATTCGGGGGGGAACCTAACCTAA
Protein Sequence
MTGAQGPLRRVSPSTPPITGAAPSRPQRKSISTASNTQPRCSPSHRGPPQWRRRPGGRQRHLHGPPRPRACCLQVGETAPSRCLMPGAASDEIDGAAPEHSGPSPGHGFYAATRLSRAPTPKRPRGTAYPSRCRMPGAASDEIDGAAPEHSGPSPGHGFYAATRLSRAPTPKRPRGTAYPSRCRMPGAASDEIDGAAPEHSGPSPGHGFYAATRLSRAPTPKRPRGTAYAPLSRPRFFTPRPTSVAPPRQSAREAQPTVPSPGHGFYAATRLSRAPTPKRPRGATYPSRCLMSGAALALLSAALTGIPARLRVPLGFTNDDDAPEHATHRRPPPSLGDLVQRTPVGPMTGAQGPLRRPLHAATHRGGPVEAAEEKHPHGLQPPAAALELRRSPPQWRRRPGGRQRHLHGPPRPRACCLQVGETAPSRCLMPGAASDEIDGAAPEHSGPSPGHGFYAATRLSRAPTPKRPRGTAYPSRCRMPGAASDEIDGAAPEHSGPSPGHGFYAATRLSRAPTPKRPRGTAYPSRCRMPGAASDEIDGAAPEHSGPSPGHGFYAATRLSRAPTPKRPRGTAYPSRCRMPGAASDEIDGAAPEHSGPSPGHGFYAATRLSRAPTPKRPRGTAYAPLSRPRFFTPRPTSVAPPRQSAREAQPTVPSPGHGFYAATRLSRAPTPKRPRGATYPSRCLMSGAALALLSAALTGIPARLRVPLGFTVGGQYPRLLSPPATTRREPNSGGNLT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01569354;
90% Identity
iTF_01569354;
80% Identity
-