Basic Information

Transcription Factor Domain

TF Family: HMGA
Domain: HMGA domain
PFAM: AnimalTFDB
TF Group: Unclassified Structure
Description: This entry represents the HMGA family, whose members contain DNA-binding domains, also known as AT hooks due to their ability to interact with the narrow minor groove of AT-rich DNA sequences. They play an important role in chromatin organisation [1]. The high mobility group (HMG) proteins are the most abundant and ubiquitous nonhistone chromosomal proteins. They bind to DNA and to nucleosomes and are involved in the regulation of DNA-dependent processes such as transcription, replication, recombination, and DNA repair. They can be grouped into three families: HMGB (HMG 1/2), HMGN (HMG 14/17) and HMGA (HMG I/Y). The characteristic domains are: AT-hook for the HMGA family, the HMG Box for the HMGB family, and the nucleosome-binding domain (NBD) for the members of the HMGN family [2].
Hmmscan Out: # of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc

1 4 0.7 3.2e+04 -3.0 0.4 17 21 98 102 98 102 0.89

2 4 0.14 6.6e+03 -0.8 0.5 12 15 171 174 170 175 0.93

3 4 2.1e-09 9.4e-05 24.2 2.1 4 21 191 208 189 209 0.88

4 4 2 9.2e+04 -5.1 6.9 10 15 222 227 221 228 0.83

#	of	c-Evalue	i-Evalue	score	bias	hmm coord from	hmm coord to	ali coord from	ali coord to	env coord from	env coord to	acc
1	4	0.7	3.2e+04	-3.0	0.4	17	21	98	102	98	102	0.89
2	4	0.14	6.6e+03	-0.8	0.5	12	15	171	174	170	175	0.93
3	4	2.1e-09	9.4e-05	24.2	2.1	4	21	191	208	189	209	0.88
4	4	2	9.2e+04	-5.1	6.9	10	15	222	227	221	228	0.83

Coding Sequence: ATGTCGGAGAAAACCGTGAGTAGAATATCTAAAGAAGGTGAAGCTGCTGCCTCTACATCTCAAAAACTCAAGTCTCCTGGAAAACATCGTCAAAGACGTAAAACGGTGGATCTTGATGATTTCGATTTATgcgcaataagaaacaaaatccatgaaatgtatactgtaagaaaagttgttcctactttaaataaattacttgtagaattaaagaatgacataaactttggtggaggtcgaacgactttgtggaaaattttaaaacaaattggatttcagtttaaaaagtgtggttcgaaacggaaaattttaatggaaaggcacgacatagctgcatggagacgcaagtatatcgttactatgaggcaaaatcgagcagatggtcggccaatagttttcttagatgaaacatacattcacgcatcgtatgcagtgaaaaaatgttggcaaaaagATGACGAGGACGGACTACTAATAAGTGATTCTGATGCCAAAGCAGAAACAAAAAAGAGGGGTAGACCTGCAGCACCTGCTAAAACTAAAGAGGCAAAAAATTCATCTGATGATGAGCAGGCACCAGCAGTTAAGAGAGGAAGAGGTAGACCTAAAGGATCCAAAAAGAAAGCTTCAGCTCCTAAAGCAAAGagTGGGTCTGGTGAAGGAAGAGGTAGAGGTAGGCCACGTAAAGAAGCAGCACCTCCTAAAAAGGATGCGGCTTCCACGGACGAGGAACAAgatgaagatgaagaagaagaagaagagggaTCTGACCAGTAA
Protein Sequence: MSEKTVSRISKEGEAAASTSQKLKSPGKHRQRRKTVDLDDFDLCAIRNKIHEMYTVRKVVPTLNKLLVELKNDINFGGGRTTLWKILKQIGFQFKKCGSKRKILMERHDIAAWRRKYIVTMRQNRADGRPIVFLDETYIHASYAVKKCWQKDDEDGLLISDSDAKAETKKRGRPAAPAKTKEAKNSSDDEQAPAVKRGRGRPKGSKKKASAPKAKSGSGEGRGRGRPRKEAAPPKKDAASTDEEQDEDEEEEEEGSDQ

Sequence clustering based on sequence similarity using MMseqs2