Esep015862.1
Basic Information
- Insect
- Eristalinus sepulchralis
- Gene Symbol
- PAX6
- Assembly
- GCA_944738805.1
- Location
- CALYJE010000177.1:812458-824651[-]
Transcription Factor Domain
- TF Family
- PAX
- Domain
- PAX domain
- PFAM
- PF00292
- TF Group
- Helix-turn-helix
- Description
- The paired domain, a ~126 amino acid DNA-binding domain, is found in eukaryotic transcription regulatory proteins involved in embryogenesis. Initially identified in Drosophila’s paired (prd) protein, it typically resides in the N-terminal region and may be followed by an octapeptide, a homeodomain, or a Pro-Ser-Thr-rich C terminus. Paired domain proteins act as transcription repressors or activators, with DNA-binding specificity mediated by three subdomains. Crystal structures reveal a bipartite DNA-binding paired domain: an N-terminal subdomain (PAI) and a C-terminal subdomain (RED), linked by a flexible linker. Both subdomains contain a helix-turn-helix motif that binds DNA's major groove, while the linker may bind the minor groove. Variations in domain usage across Pax proteins and isoforms determine sequence specificity.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 4.4e-37 8.5e-34 116.3 0.0 12 125 56 169 47 169 0.88
Sequence Information
- Coding Sequence
- ATGTTGATTGAATCGGGTGTGAGTCGCGAATTTGGATCGTTACCAACATCAGCTCAGTGGAAAATGGAGCCTCCAAGGATACCAATGACAGGTGGCGCTTCTATACCGCCACCCCCACCAACTGTTGGTCTCTCACCGTCAGTTCAAATGCCTGGATCAACGCTCTTCACGGGGGGATCACCTTCGCCGACAACACTCAGCGCGCTAGTATCGCAGCAACGGATTCTCGAGCTCTCGCGCTTCGGACTGCGGGGCTACGACATCGCCCAACACATGCTGACCCAGCAAGGCGCGGTCTCCAAATTACTCGGTTCGCTGCGACCACCTGGCCTAATTGGTGGCTCGAAGCCAAAGGTAGCTACTCCGACGGTGGTGTCCAAGATCGAGCAGTACAAGCGGGAGAACCCGACCATATTCGCCTGGGAAATTCGTGAGCGCTTGATCTCGGAAGGTGTCTGCACGAATGCCACGGCCCCGTCGGTGTCGTCCATTAATCGTATTCTGCGAAACCGTGCCGCGGAACGCGTTGCCAGTGAATTTGCTCGGACTGCGGCCTATGGGCTCTATCCTCCTCATCCCTACGGTGGATTCACGTGGCATCCGGCGGCAGCGGCAGCTGCTGCAGCCTCCAGTCAACCGCACTTCTGGCCGACCGGACCCAGCTTACCGTCAGTCCCAGTTGGTGGGTCGCCTCTCAATGTGTCCAGACCCATTTCACCTGGTTCTGGGAGCCACGACACTCTGGAGTCTCCCGACGAGAATCGGCTAATCGATTCTGACTACTTGGACGATGATGACGAGCCGAAGTTTCGCCGGAATCGCACAACCTTCAGTCCCGAGCAGCTGGAGGAGCTAGAGAAGGAGTTCGACAAGTCGCACTATCCGTGTGTCAGCACAAGAGAACGGCTCTCTAGTCGCACGTCGCTGAGCGAAGCCCGAGTTCAGGTTTGGTTTTCCAACAGAAGAGCGAAGTGGCGCCGGCACCAACGCATGAATCTGTTGAAGCGACGTTCCAGTCCAACTCTGGGCCAGCAGCAGCGGGCTGTCGCCGATTCGCCACCCATCAGCAACAGCAGCAGCATCACCAGTCACCAGCATACCACCCGTACTTCCGGCAGCGAGGAGCACCCGCATGGCGGGCACCCATCCCAGCAGCACCATCAGGCGCAGTCCTCCGAAGCTTCGGCCAGCATCCGGTCCTCATTGACCCATCCCATGCAGCACCACCACCACCATCAGCACCCACCTTCGCAGCCATTGCTTCTGAACCATCACCACTATCCCCACCAGCATTATCCCCATCCTCCACCAAGAGCGTCCAGCCTCAGCCCGAAGTCGCCACCGGCAGCGGCGCCACAATCACCGATGACGGTGGCAGCGGAGAAATCTCCCAAGGCGGCAGTCACACAGCACAACCAGTCATCAGCCACAGCATCAGCTCCATTATCAATGGGCGGTGAGCGCAGTGCGTTCCGCTCTCTGGTCGCTGGCTCGCCATCGGCCGCTGCCTTTGCACTCGGCCTGGCGCGACAATACGCCGTTGCTGCCTCACTTTCGCAGCAACAGCAGCAGCAACATCATCAACACCAGCCGCACCAGCAGCAGGCTGACGACTCGGACTCCGACGAAGAAATCAACGTCCACGACGACTCGGATGATGACTCATTGCCGGCTGCTGCGGCGGCCGCCGGCCAAAGCTCCAAAGCGCCTCTCCAGTTGACCAAGCATGACCGTTAG
- Protein Sequence
- MLIESGVSREFGSLPTSAQWKMEPPRIPMTGGASIPPPPPTVGLSPSVQMPGSTLFTGGSPSPTTLSALVSQQRILELSRFGLRGYDIAQHMLTQQGAVSKLLGSLRPPGLIGGSKPKVATPTVVSKIEQYKRENPTIFAWEIRERLISEGVCTNATAPSVSSINRILRNRAAERVASEFARTAAYGLYPPHPYGGFTWHPAAAAAAAASSQPHFWPTGPSLPSVPVGGSPLNVSRPISPGSGSHDTLESPDENRLIDSDYLDDDDEPKFRRNRTTFSPEQLEELEKEFDKSHYPCVSTRERLSSRTSLSEARVQVWFSNRRAKWRRHQRMNLLKRRSSPTLGQQQRAVADSPPISNSSSITSHQHTTRTSGSEEHPHGGHPSQQHHQAQSSEASASIRSSLTHPMQHHHHHQHPPSQPLLLNHHHYPHQHYPHPPPRASSLSPKSPPAAAPQSPMTVAAEKSPKAAVTQHNQSSATASAPLSMGGERSAFRSLVAGSPSAAAFALGLARQYAVAASLSQQQQQQHHQHQPHQQQADDSDSDEEINVHDDSDDDSLPAAAAAAGQSSKAPLQLTKHDR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00671450; iTF_00672043; iTF_00672806; iTF_00672176; iTF_00672692; iTF_00670903; iTF_00670782; iTF_01541391; iTF_01541262; iTF_01542007; iTF_01542160; iTF_00389580; iTF_00389450; iTF_01223373; iTF_01223374; iTF_01223216; iTF_01223217; iTF_00240494; iTF_00240674; iTF_01521843; iTF_01521710; iTF_01395799; iTF_01395665; iTF_00984057; iTF_00984195; iTF_01116282; iTF_01116409; iTF_01002588; iTF_01002732; iTF_01520796; iTF_01521004; iTF_00976285; iTF_00976438; iTF_01044671; iTF_01044562;
- 90% Identity
- iTF_00670782;
- 80% Identity
- iTF_00671450;