Tact027054.1
Basic Information
- Insect
- Thymelicus acteon
- Gene Symbol
- PAX6
- Assembly
- GCA_951805285.1
- Location
- OX638234.1:5018012-5082656[+]
Transcription Factor Domain
- TF Family
- PAX
- Domain
- PAX domain
- PFAM
- PF00292
- TF Group
- Helix-turn-helix
- Description
- The paired domain, a ~126 amino acid DNA-binding domain, is found in eukaryotic transcription regulatory proteins involved in embryogenesis. Initially identified in Drosophila’s paired (prd) protein, it typically resides in the N-terminal region and may be followed by an octapeptide, a homeodomain, or a Pro-Ser-Thr-rich C terminus. Paired domain proteins act as transcription repressors or activators, with DNA-binding specificity mediated by three subdomains. Crystal structures reveal a bipartite DNA-binding paired domain: an N-terminal subdomain (PAI) and a C-terminal subdomain (RED), linked by a flexible linker. Both subdomains contain a helix-turn-helix motif that binds DNA's major groove, while the linker may bind the minor groove. Variations in domain usage across Pax proteins and isoforms determine sequence specificity.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 4.1e-73 7.5e-70 233.6 0.7 1 125 30 154 30 154 0.99
Sequence Information
- Coding Sequence
- ATGCTGCCGGCGCAGCAGCTGCCTCCAATCAGCCCCTGGCCGCCGGACGCCAACCTCCTGGACCGCATGGATGACCTGGCTCACAAAGGTCACAGCGGTGTTAACCAGCTCGGCGGCGTGTTTGTAGGAGGGAGACCCCTGCCCGACTCCACGCGGCAGAAGATCGTGGAGCTAGCCCACTCCGGAGCCCGGCCCTGCGACATCAGCCGCATCCTCCAGGTCTCCAACGGCTGCGTCTCCAAGATACTCGGCAGGTATTACGAGACGGGGTCAATAAGACCCCGAGCGATCGGCGGGTCAAAACCGAGAGTAGCAACTGCGGAAGTGGTCAGCAAGATCGCGCAATACAAGCGCGAATGCCCGTCGATCTTCGCTTGGGAGATCAGAGATCGTCTGCTGAGTGAAGGAGTCTGCTCATCGGATAATATACCTAGCGTTTCCTCAATCAACCGCGTACTCCGCAACCTAGCAGCTCAGAAGGAGAAGTCCAGCAACCAGCAACCCTCAGACTGCACCACGCCTGTGTACGAGAGGCTGCGACTCCTGGGAACACCAGGGACAACCCCCAGCTGGCCTCGGGCACCCTGGCCCACGCAGATAGACACCAGGACGCCCCCCTACCAGCTCCATAGTCTCAGTCCTGGGCCGCAGGCTATTGGATGCAATGGAGGGGAAATTCCAGGGATGAAGAAAGGTGAGGAGCCACTCGAAGGTCTGGAAGGACTGCACTCGGACGAGACGGGGTCCGGAGACAACTCGAACGCGGGCTCCAGCGGTGCGGACGACGACGCAGCCCGACTGCGCCTGAAGAGGAAGCTGCAGCGCAACCGCACCAGCTTCACCAACGAGCAGATTGATAGCTTGGAGAGAGAATTCGAGCGTACCCACTACCCTGATGTGTTCGCCCGAGAACGACTGGCGTCCAAGATAGGGCTCCCGGAGGCACGGATACAGGTGTGGTTCTCGAATCGGCGCGCGAAGTGGCGTCGCGAGGAGAAGATCCGGTCCCAGCGGCGGAGCCCCGAGTGCTACGGGGGGCCCGTGCCGCCCCCCGCGGGGGTGCAGCCCCCCGCGCCGCACCCCCACGCGCCCGCGCACCCCACGCACCATGCGTACCAGCCGCCGCTCAATGTTGACACGTATAGCCCGCTAACCCCGATGGGGTACGGCGGCGGCATGGTGGCGGAGACGCCGGTGTGCGAGGGCGGCGAGGCGGAGCTCTGCAAGGGCGGCTTCCTGGGGTACCCGCGCTACGAGCTGGGGTACCGCCAGCCACACCCCTACGTCAACCACGGCTACCAGCATGAACCGCCCTCACCTGCGCCAGAGCTCGGAGGCCTCCTGGGCGCCGGCGTGTCGGTCCCGCTGGCGGTCCCGGGACAGGACTCGCAGCAGTACTGGTCCCGGCTGCAGTGA
- Protein Sequence
- MLPAQQLPPISPWPPDANLLDRMDDLAHKGHSGVNQLGGVFVGGRPLPDSTRQKIVELAHSGARPCDISRILQVSNGCVSKILGRYYETGSIRPRAIGGSKPRVATAEVVSKIAQYKRECPSIFAWEIRDRLLSEGVCSSDNIPSVSSINRVLRNLAAQKEKSSNQQPSDCTTPVYERLRLLGTPGTTPSWPRAPWPTQIDTRTPPYQLHSLSPGPQAIGCNGGEIPGMKKGEEPLEGLEGLHSDETGSGDNSNAGSSGADDDAARLRLKRKLQRNRTSFTNEQIDSLEREFERTHYPDVFARERLASKIGLPEARIQVWFSNRRAKWRREEKIRSQRRSPECYGGPVPPPAGVQPPAPHPHAPAHPTHHAYQPPLNVDTYSPLTPMGYGGGMVAETPVCEGGEAELCKGGFLGYPRYELGYRQPHPYVNHGYQHEPPSPAPELGGLLGAGVSVPLAVPGQDSQQYWSRLQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01437807; iTF_01125283; iTF_01124560; iTF_01124438; iTF_01125431; iTF_01506307; iTF_01506179; iTF_00985064; iTF_00984929; iTF_00077322; iTF_00077447; iTF_01203884; iTF_01204004; iTF_00953357; iTF_00953480; iTF_00357456; iTF_00357263; iTF_00705968; iTF_00705827; iTF_00445137; iTF_01377441; iTF_00042467; iTF_00042597; iTF_01363078; iTF_01425031; iTF_00809897; iTF_01342057; iTF_01342185; iTF_01533769; iTF_01230346; iTF_01533900; iTF_00810029; iTF_01230541; iTF_00808944; iTF_00446099; iTF_01362932; iTF_01424884; iTF_00445944; iTF_00444953; iTF_01377309; iTF_00809088; iTF_00785857; iTF_00783268; iTF_00783432; iTF_00785972; iTF_00823867; iTF_00823826; iTF_00185277; iTF_00185153; iTF_01147348; iTF_01147250; iTF_01203092; iTF_01203206; iTF_00819894; iTF_00820039; iTF_00827926; iTF_00827817; iTF_01071479; iTF_01071589; iTF_01144503; iTF_01144395; iTF_01140782; iTF_01149420; iTF_01140883; iTF_01149558; iTF_00185976; iTF_00186145; iTF_00697304; iTF_00697167; iTF_00878712; iTF_00878819; iTF_01151064; iTF_01151174; iTF_00710169; iTF_00710024; iTF_01281305; iTF_01281144; iTF_00842491; iTF_00842631; iTF_01143067; iTF_01142953; iTF_00150493; iTF_00150318; iTF_01083293; iTF_01083139; iTF_00318782; iTF_00318830; iTF_00235711; iTF_00235825;
- 90% Identity
- iTF_01125283;
- 80% Identity
- iTF_01437807;