Csup006877.1
Basic Information
- Insect
- Chilo suppressalis
- Gene Symbol
- PAX1
- Assembly
- None
- Location
- chr15:1874187-1942119[+]
Transcription Factor Domain
- TF Family
- PAX
- Domain
- PAX domain
- PFAM
- PF00292
- TF Group
- Helix-turn-helix
- Description
- The paired domain, a ~126 amino acid DNA-binding domain, is found in eukaryotic transcription regulatory proteins involved in embryogenesis. Initially identified in Drosophila’s paired (prd) protein, it typically resides in the N-terminal region and may be followed by an octapeptide, a homeodomain, or a Pro-Ser-Thr-rich C terminus. Paired domain proteins act as transcription repressors or activators, with DNA-binding specificity mediated by three subdomains. Crystal structures reveal a bipartite DNA-binding paired domain: an N-terminal subdomain (PAI) and a C-terminal subdomain (RED), linked by a flexible linker. Both subdomains contain a helix-turn-helix motif that binds DNA's major groove, while the linker may bind the minor groove. Variations in domain usage across Pax proteins and isoforms determine sequence specificity.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 2.5e-69 6.2e-66 219.7 0.2 3 125 49 171 47 171 0.99
Sequence Information
- Coding Sequence
- ATGATGCTGGGCGGCATGGACGGCAAGGAGTACGGCGCGCTGCACGCCGCCGCGGCCGCCGCTGGATACCCTATGGAAACCGACCCGAGTGGCGGTGGCATGGGCGGCGTGGGTGGCGTGGGCGGCGCGGGAGGGCAGCAGTATGGGGAGGTGAATCAGCTGGGCGGCGTGTTCGTCAACGGCCGCCCACTGCCCAATGCCGTGCGACTACGTATCGTAGAACTGGCGCAACTTGGGATCAGGCCGTGTGATATCAGCCGGCAGCTGCGGGTGTCCCACGGCTGCGTGTCCAAGATCCTGGCGCGGTACCACGAAACTGGCTCCATCCTGCCGGGAGCCATCGGAGGCTCAAAGCCTAGGGTCACCACTCCTAAGGTGGTGTCATACATAAAGCAATTGAAGGCGAAGGACCCTGGAATCTTCGCGTGGGAGATCCGCGACCGGCTGCTGGCAGACGGCGTGTGCGACAAGTACAACGTGCCCTCAGTATCCAGCATCAGCCGAATCCTACGCAATAAGCTAGGCGGCGGCGGGCTATACCCCGTGCCGCCGCTATACCCCGTGCCGGCTCAGCGGTGTTGGCCGCTACACCAACCATACGACTACTATGTGTACCTACAGGGCCGTCAGCATGCGGGAGGGGCCCCGCACGGGTTGCCGCACCCGCACCAGCAGGGGCCTCACCACGCGCACGCACATGCGCTCTGA
- Protein Sequence
- MMLGGMDGKEYGALHAAAAAAGYPMETDPSGGGMGGVGGVGGAGGQQYGEVNQLGGVFVNGRPLPNAVRLRIVELAQLGIRPCDISRQLRVSHGCVSKILARYHETGSILPGAIGGSKPRVTTPKVVSYIKQLKAKDPGIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKLGGGGLYPVPPLYPVPAQRCWPLHQPYDYYVYLQGRQHAGGAPHGLPHPHQQGPHHAHAHAL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00924629;
- 90% Identity
- iTF_00409250; iTF_01367874; iTF_00275235; iTF_00356165; iTF_00012990; iTF_00876134; iTF_01529160; iTF_00661250; iTF_01429016; iTF_00267153; iTF_00662193; iTF_01528256; iTF_00125098; iTF_01161322; iTF_00077451; iTF_00382571; iTF_00418956; iTF_00419725; iTF_00421304; iTF_00420512; iTF_01336812; iTF_00806702; iTF_01171438; iTF_01170568; iTF_00119567; iTF_00715759; iTF_00947856; iTF_01209232; iTF_00032881; iTF_00390340; iTF_00895912; iTF_00649065; iTF_00666505; iTF_00825705; iTF_00649978; iTF_00826940; iTF_00143038; iTF_00033775; iTF_01438896; iTF_00651716; iTF_01220622; iTF_00935699; iTF_01437945; iTF_00791164; iTF_00790233; iTF_00318066; iTF_00345778; iTF_01245906; iTF_00357459; iTF_00358532; iTF_00377075; iTF_00663044; iTF_00737451; iTF_00856596; iTF_00810999; iTF_00701939; iTF_00836531; iTF_00835566; iTF_00323478; iTF_01081558; iTF_00448104; iTF_00006249; iTF_00035627; iTF_00034700; iTF_01490097; iTF_01334810; iTF_00284507; iTF_01561974; iTF_00236599; iTF_00235828; iTF_00683287; iTF_00682378; iTF_00361592; iTF_00827928; iTF_01117183; iTF_00375175; iTF_00374065; iTF_00461529; iTF_00647961; iTF_00640290; iTF_00720263; iTF_00408425; iTF_00960896; iTF_01302307; iTF_00752041; iTF_00736590; iTF_00072785; iTF_00011957; iTF_01281308; iTF_00858511; iTF_00953483; iTF_01091883; iTF_00801818; iTF_00954432; iTF_00467381; iTF_00468144; iTF_01279223; iTF_01358841; iTF_00341287; iTF_00859423; iTF_00673472; iTF_00010013; iTF_01464129; iTF_00008951; iTF_01463224; iTF_01034558; iTF_01335903; iTF_01547519; iTF_00150496;
- 80% Identity
- -