Basic Information

Transcription Factor Domain

TF Family: HMGA
Domain: HMGA domain
PFAM: AnimalTFDB
TF Group: Unclassified Structure
Description: This entry represents the HMGA family, whose members contain DNA-binding domains, also known as AT hooks due to their ability to interact with the narrow minor groove of AT-rich DNA sequences. They play an important role in chromatin organisation [1]. The high mobility group (HMG) proteins are the most abundant and ubiquitous nonhistone chromosomal proteins. They bind to DNA and to nucleosomes and are involved in the regulation of DNA-dependent processes such as transcription, replication, recombination, and DNA repair. They can be grouped into three families: HMGB (HMG 1/2), HMGN (HMG 14/17) and HMGA (HMG I/Y). The characteristic domains are: AT-hook for the HMGA family, the HMG Box for the HMGB family, and the nucleosome-binding domain (NBD) for the members of the HMGN family [2].
Hmmscan Out: # of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc

1 4 0.048 4.5e+03 -0.3 0.5 12 15 97 100 96 101 0.93

2 4 7.5e-10 7e-05 24.7 2.2 4 21 117 134 115 135 0.87

3 4 1 9.3e+04 -4.6 6.9 10 15 146 151 145 152 0.83

4 4 0.27 2.6e+04 -2.7 1.5 4 9 154 159 153 160 0.83

#	of	c-Evalue	i-Evalue	score	bias	hmm coord from	hmm coord to	ali coord from	ali coord to	env coord from	env coord to	acc
1	4	0.048	4.5e+03	-0.3	0.5	12	15	97	100	96	101	0.93
2	4	7.5e-10	7e-05	24.7	2.2	4	21	117	134	115	135	0.87
3	4	1	9.3e+04	-4.6	6.9	10	15	146	151	145	152	0.83
4	4	0.27	2.6e+04	-2.7	1.5	4	9	154	159	153	160	0.83

Coding Sequence: atgagtCGTAGACATTTACCACGAAGTGCAATACTCGAGGAATTACTAAGAGAAGACGAAGTAGAGAGTGAATCAGAGCACAGCGACCAAGGCTCTGAAACTTCCGACCACATCGTTTCAGAGGCAACCCAATCTGCTAAGTTTGATTCCTCAGCAGAGAACTGCTTGAACTCAGATGAGGATGACTTGCCCCTTTCAGAAATAAATTGCTGTTTTACAGGGCGCGATATAATAACGACGTGGCAAAAATCTAACAAAAACAGAGAGGCGAAAGGAGACACCAAGAAGAGAGGAAGACCAGCAGTGCCTGCTAAAACAAAAGAGTCTACAAAATCCTCTGATGATGAACAGGCACCGGTAGCAAAACGAGGGCGAGGCCGGCCTAAAGGCTCTAAGAAAAAGGCTGCGAAGGCAAAGAGTGCACCTGTTGAGGGACGGGGTCGTGGCAGACCACGCAAGGACGTACCTCCTCCAAAGAAGGATGCGGCATCAACTGAAGAGGAACAAGAAGACGATGACGAAGAGGAGGGGTCTGACcagtaa
Protein Sequence: MSRRHLPRSAILEELLREDEVESESEHSDQGSETSDHIVSEATQSAKFDSSAENCLNSDEDDLPLSEINCCFTGRDIITTWQKSNKNREAKGDTKKRGRPAVPAKTKESTKSSDDEQAPVAKRGRGRPKGSKKKAAKAKSAPVEGRGRGRPRKDVPPPKKDAASTEEEQEDDDEEEGSDQ

Sequence clustering based on sequence similarity using MMseqs2