Basic Information

Gene Symbol
-
Assembly
GCA_000349105.1
Location
KB672031:50830-54371[-]

Transcription Factor Domain

TF Family
HMGA
Domain
HMGA domain
PFAM
AnimalTFDB
TF Group
Unclassified Structure
Description
This entry represents the HMGA family, whose members contain DNA-binding domains, also known as AT hooks due to their ability to interact with the narrow minor groove of AT-rich DNA sequences. They play an important role in chromatin organisation [1]. The high mobility group (HMG) proteins are the most abundant and ubiquitous nonhistone chromosomal proteins. They bind to DNA and to nucleosomes and are involved in the regulation of DNA-dependent processes such as transcription, replication, recombination, and DNA repair. They can be grouped into three families: HMGB (HMG 1/2), HMGN (HMG 14/17) and HMGA (HMG I/Y). The characteristic domains are: AT-hook for the HMGA family, the HMG Box for the HMGB family, and the nucleosome-binding domain (NBD) for the members of the HMGN family [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 0.0016 19 4.5 2.5 3 16 2 16 2 18 0.78
2 6 0.00057 6.7 5.9 8.7 4 15 83 94 82 96 0.92
3 6 0.0034 40 3.4 1.1 8 15 135 142 130 143 0.89
4 6 0.089 1e+03 -1.1 0.2 6 9 159 162 157 163 0.89
5 6 2.7e-10 3.1e-06 26.1 7.5 4 21 180 197 178 198 0.93
6 6 1 1.2e+04 -7.4 6.9 13 18 213 218 212 221 0.49

Sequence Information

Coding Sequence
ATGACCGAGGAAACATCTCCAGTTAAGAAGGGACGTGGACGCCCGAAAAAGAGCGATGTGTCAGTGAAAGAATCACCAAAAGAGAAAAAGGTGGTGCCACCAAAAGAAGTGGTCGATTCCGACTCAGAGGAGATAGACTCAGAACCGGAGGAAAGGAGCCCGGTTACGACTCCCAAACCTGCTAAAAAGCGtgctgccgcagcggccgccgccgccgctgccgccgATAATGAGACGGCCGATGGGGGAACTCCAGCGCCGAAACGAGGTCGCGGCCGCCCGCCAAAGTACTCGAACATTAATTACTGCACGTCTTGGTTATACGAAAACATACTAAATTGTTTGGTATGGGAAATAGCTGAGCAAGGACGACACACAATGTCTGATGCAGAGGTCGTTGAACCGAAGAAGGGTCGTGGACGCCCACCGAATCCGGACAAGAACAGTGTGGCACCGAAAAAGCGGGCCCGCGCTCCCTCGCCGAAGGAGTCCAAGTTGGTGGAGGAAAAGGAGAAACCCGTTGCGTCGGACGACGGTGAAGAGCCGTCGCCGAAGCGTGGCCGGGGCCGACCGAAGGGCAGCACCAAGAAGGTAGCGAAAGCGGCCAAGAAAGCTCCGGCAGCGCCGGTCGGCCGGGGTCGAGGACGAGGCCGCAAGAAACCGGTGAAAGAGGAATCTTCcgaagaggaggaggacgacgaggacgaggacgaggacgaggaggatgacgagcaggaggatagcgagggcaatgaggaAAATTACGGAAACGAAGAATCCGATTCGTAA
Protein Sequence
MTEETSPVKKGRGRPKKSDVSVKESPKEKKVVPPKEVVDSDSEEIDSEPEERSPVTTPKPAKKRAAAAAAAAAAADNETADGGTPAPKRGRGRPPKYSNINYCTSWLYENILNCLVWEIAEQGRHTMSDAEVVEPKKGRGRPPNPDKNSVAPKKRARAPSPKESKLVEEKEKPVASDDGEEPSPKRGRGRPKGSTKKVAKAAKKAPAAPVGRGRGRGRKKPVKEESSEEEEDDEDEDEDEEDDEQEDSEGNEENYGNEESDS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-