Dchr011928.1
Basic Information
- Insect
- Danaus chrysippus
- Gene Symbol
- -
- Assembly
- GCA_004959915.1
- Location
- RPCZ01000060.1:59673-66284[+]
Transcription Factor Domain
- TF Family
- HMGA
- Domain
- HMGA domain
- PFAM
- AnimalTFDB
- TF Group
- Unclassified Structure
- Description
- This entry represents the HMGA family, whose members contain DNA-binding domains, also known as AT hooks due to their ability to interact with the narrow minor groove of AT-rich DNA sequences. They play an important role in chromatin organisation [1]. The high mobility group (HMG) proteins are the most abundant and ubiquitous nonhistone chromosomal proteins. They bind to DNA and to nucleosomes and are involved in the regulation of DNA-dependent processes such as transcription, replication, recombination, and DNA repair. They can be grouped into three families: HMGB (HMG 1/2), HMGN (HMG 14/17) and HMGA (HMG I/Y). The characteristic domains are: AT-hook for the HMGA family, the HMG Box for the HMGB family, and the nucleosome-binding domain (NBD) for the members of the HMGN family [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 9 2e-05 0.41 10.5 4.7 9 22 103 116 101 116 0.87 2 9 2e-05 0.41 10.5 4.7 9 22 271 284 269 284 0.87 3 9 2e-05 0.41 10.5 4.7 9 22 439 452 437 452 0.87 4 9 2e-05 0.41 10.5 4.7 9 22 607 620 605 620 0.87 5 9 2e-05 0.41 10.5 4.7 9 22 775 788 773 788 0.87 6 9 2e-05 0.41 10.5 4.7 9 22 943 956 941 956 0.87 7 9 2e-05 0.41 10.5 4.7 9 22 1111 1124 1109 1124 0.87 8 9 2e-05 0.41 10.5 4.7 9 22 1279 1292 1277 1292 0.87 9 9 2e-05 0.41 10.5 4.7 9 22 1447 1460 1445 1460 0.87
Sequence Information
- Coding Sequence
- atgtcTGTTTTATTCGAAGCGTTAATGTTTGAGACTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCGTATAGTGATATTTTCAGACGAGGAGGAAGGAACCCCGGCCGCGTCTGTTCAGTGTGACAGTCAAACGGATATGTCATTAAAGCGTGGTCCAGGTCGGCCGAAAGGCTCCAAGAACGCGTCACCCCGGACTCTCGGGAGCGGGAGTGGTCGCCGGGGCAGGCCGCCGGTCCCGCCGGAACTACGCCTGCCTGGCATAACTGAAATGAAGAAGTTCTGCAAAGCGGCCGGCATTAGATTCGACTATAAGAAACTCGTTGAAGGATGCACAAATAACAAGGAGCGTGTGGCTAAGATGTTGGATCTACTGACAGCTGCGGGTCTTGAAGGGAAGCCGACGTTGGACAAATGTATAGCGTTGAAACAAGCCAAGCTAGGCAAGAGAGAACAAGAGATCCTGGCTAGGAAAGAGGCACGCGCCAAACATAAAGATGAAGACGAGGTGGAGAGTGAGTCTTCAGCTCGTGGCATGACGCGAGGCGCGAGGGGCGTCAAGCCTAGGCAGCAACCTTCCTCGAGCATTACACTGACGAGGGACGGCTGTTGTTCACGAGCAGGGGACGGGGATTGA
- Protein Sequence
- MSVLFEALMFETAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQRIVIFSDEEEGTPAASVQCDSQTDMSLKRGPGRPKGSKNASPRTLGSGSGRRGRPPVPPELRLPGITEMKKFCKAAGIRFDYKKLVEGCTNNKERVAKMLDLLTAAGLEGKPTLDKCIALKQAKLGKREQEILARKEARAKHKDEDEVESESSARGMTRGARGVKPRQQPSSSITLTRDGCCSRAGDGD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -