Basic Information

Transcription Factor Domain

TF Family: HMGA
Domain: HMGA domain
PFAM: AnimalTFDB
TF Group: Unclassified Structure
Description: This entry represents the HMGA family, whose members contain DNA-binding domains, also known as AT hooks due to their ability to interact with the narrow minor groove of AT-rich DNA sequences. They play an important role in chromatin organisation [1]. The high mobility group (HMG) proteins are the most abundant and ubiquitous nonhistone chromosomal proteins. They bind to DNA and to nucleosomes and are involved in the regulation of DNA-dependent processes such as transcription, replication, recombination, and DNA repair. They can be grouped into three families: HMGB (HMG 1/2), HMGN (HMG 14/17) and HMGA (HMG I/Y). The characteristic domains are: AT-hook for the HMGA family, the HMG Box for the HMGB family, and the nucleosome-binding domain (NBD) for the members of the HMGN family [2].
Hmmscan Out: # of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc

1 6 0.0016 19 4.5 2.5 3 16 2 16 2 18 0.78

2 6 0.00057 6.7 5.9 8.7 4 15 83 94 82 96 0.92

3 6 0.0034 40 3.4 1.1 8 15 135 142 130 143 0.89

4 6 0.089 1e+03 -1.1 0.2 6 9 159 162 157 163 0.89

5 6 2.7e-10 3.1e-06 26.1 7.5 4 21 180 197 178 198 0.93

6 6 1 1.2e+04 -7.4 6.9 13 18 213 218 212 221 0.49

#	of	c-Evalue	i-Evalue	score	bias	hmm coord from	hmm coord to	ali coord from	ali coord to	env coord from	env coord to	acc
1	6	0.0016	19	4.5	2.5	3	16	2	16	2	18	0.78
2	6	0.00057	6.7	5.9	8.7	4	15	83	94	82	96	0.92
3	6	0.0034	40	3.4	1.1	8	15	135	142	130	143	0.89
4	6	0.089	1e+03	-1.1	0.2	6	9	159	162	157	163	0.89
5	6	2.7e-10	3.1e-06	26.1	7.5	4	21	180	197	178	198	0.93
6	6	1	1.2e+04	-7.4	6.9	13	18	213	218	212	221	0.49

Coding Sequence: ATGACCGAGGAAACATCTCCAGTTAAGAAGGGACGTGGACGCCCGAAAAAGAGCGATGTGTCAGTGAAAGAATCACCAAAAGAGAAAAAGGTGGTGCCACCAAAAGAAGTGGTCGATTCCGACTCAGAGGAGATAGACTCAGAACCGGAGGAAAGGAGCCCGGTTACGACTCCCAAACCTGCTAAAAAGCGtgctgccgcagcggccgccgccgccgctgccgccgATAATGAGACGGCCGATGGGGGAACTCCAGCGCCGAAACGAGGTCGCGGCCGCCCGCCAAAGTACTCGAACATTAATTACTGCACGTCTTGGTTATACGAAAACATACTAAATTGTTTGGTATGGGAAATAGCTGAGCAAGGACGACACACAATGTCTGATGCAGAGGTCGTTGAACCGAAGAAGGGTCGTGGACGCCCACCGAATCCGGACAAGAACAGTGTGGCACCGAAAAAGCGGGCCCGCGCTCCCTCGCCGAAGGAGTCCAAGTTGGTGGAGGAAAAGGAGAAACCCGTTGCGTCGGACGACGGTGAAGAGCCGTCGCCGAAGCGTGGCCGGGGCCGACCGAAGGGCAGCACCAAGAAGGTAGCGAAAGCGGCCAAGAAAGCTCCGGCAGCGCCGGTCGGCCGGGGTCGAGGACGAGGCCGCAAGAAACCGGTGAAAGAGGAATCTTCcgaagaggaggaggacgacgaggacgaggacgaggacgaggaggatgacgagcaggaggatagcgagggcaatgaggaAAATTACGGAAACGAAGAATCCGATTCGTAA
Protein Sequence: MTEETSPVKKGRGRPKKSDVSVKESPKEKKVVPPKEVVDSDSEEIDSEPEERSPVTTPKPAKKRAAAAAAAAAAADNETADGGTPAPKRGRGRPPKYSNINYCTSWLYENILNCLVWEIAEQGRHTMSDAEVVEPKKGRGRPPNPDKNSVAPKKRARAPSPKESKLVEEKEKPVASDDGEEPSPKRGRGRPKGSTKKVAKAAKKAPAAPVGRGRGRGRKKPVKEESSEEEEDDEDEDEDEEDDEQEDSEGNEENYGNEESDS

Sequence clustering based on sequence similarity using MMseqs2