Tdiv018019.1
Basic Information
- Insect
- Tetrapedia diversipes
- Gene Symbol
- PAX6_5
- Assembly
- GCA_033822845.1
- Location
- JAOPTO010002141.1:1253204-1271037[-]
Transcription Factor Domain
- TF Family
- PAX
- Domain
- PAX domain
- PFAM
- PF00292
- TF Group
- Helix-turn-helix
- Description
- The paired domain, a ~126 amino acid DNA-binding domain, is found in eukaryotic transcription regulatory proteins involved in embryogenesis. Initially identified in Drosophila’s paired (prd) protein, it typically resides in the N-terminal region and may be followed by an octapeptide, a homeodomain, or a Pro-Ser-Thr-rich C terminus. Paired domain proteins act as transcription repressors or activators, with DNA-binding specificity mediated by three subdomains. Crystal structures reveal a bipartite DNA-binding paired domain: an N-terminal subdomain (PAI) and a C-terminal subdomain (RED), linked by a flexible linker. Both subdomains contain a helix-turn-helix motif that binds DNA's major groove, while the linker may bind the minor groove. Variations in domain usage across Pax proteins and isoforms determine sequence specificity.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.6e-30 1.9e-27 96.4 0.0 61 125 218 283 179 283 0.93
Sequence Information
- Coding Sequence
- ATGATGATCGGTGGCGGCGTCGCGACCCCTCCTGGTACTGCCAAGTCGGCGGCTGCAATCGACAGCGAAATGGTGCAAGTCGATGGCATTGGCAGTGGTAGTGTCAGTGGCAGTGCTAGTGGTAGTGGTAGCCCTGCGACAGGCGCAACTACTACGCGTCTAACCAATAGCGGCAATACTGGCAGCAACAACAATAACGGGGATGTTGCgaacaataataacaacaacaacaacgagGGACATTCGGTGGTGAACTGCGCCAGTGGCCCGGCTGCGGCTACCGATAAATTCTGTTGTCACGACGACGATGCCCCCGCGAGTACCGGCGTCGCGCGACCTTGGGAACCACCCTCGCCGACCTTCTCGCAAACGAGGTCGCATCTGCTCCAGGTCCATCCGCCGCACCATCATCAGCATCCAGCTGCgcatcatcatcatcaacaGATGACCGTGCCGCCAGTTTCTTTGATCGACAGCGTGAGCTTGCAAAACTGTACAGCCGGGCCGTTCGCCAGCGGCAACGGCGGTGGGACCAGTAGACAACGATTGCTCGAGCTATCGCACGGATTAGGGGCCCTGAGGCATTACAACGACCTGGCCAATCACGTGCTGTCGCTGAATCAACAGGGCGCGGTCGTCACCAAACTCCTGGGTACGCTACGCCCGCCGGGTTTAATCGGTGGCAGCAAGCCAAAAGTCGCGACGCCGGCCGTCGTCGCCAAGATCGAGCAATACAAGAGGGAGAACCCGACGATTTTCGCCTGGGAGATTCGAGAGCGACTCATCTCTGAAGGCGTGTGCAGCAACGCGACGGCCCCATCCGTCAGCAGCATCAACCGAATCCTGAGGAATCGAGCTGCTGAACGGGCAGCTGCAGAATTTGCAAGGGCGGCAGGTTACGGACTGTACGCAGCGGGTCCGCATCCGTACTTCAACAGCGCTCATCAGCATCCAACAACCAGCCATCATCTACCCGCCGGCTGGCCAGCACCGGGAGCGGCGGGCCACCCTTGGATGTTGCCTCCGTTGGCTACCGGCATCTCCGGCGCGGCTTCCGCTCTGCTTCTGCCGCCGTCCTTGAGCCCAGGAGCAGCGGCAGCCGCAGCCGCAGCCGCCTCCGCGTCGGCGGCCGGGACCACGGACCATTCTCTGCACGCCGACGCCATTGCTCGAGGCTATCTACAAGATGGCGACGGTGACGAGGGAAGCCTGGACGGATCGGAACAGCCAAAGTTCCGAAGGAACCGCACCACCTTCAGCCCGGAACAGCTCGAGGAGCTCGAAAAAGAATTCGAGCGATCCCATTATCCTTGCGTGTCTACCCGCGAACGGCTAGCTTCGAAGACCTCTCTGTCGGAAGCTCGTGTACAGGTTTGGTTTTCCAACAGACGAGCAAAATGGCGTCGTCACCAACGAATGAACCTCTTAAAGCGTTCGCCGCCGCCACCACCGCCACCTCCGCCGCCACCACCGCTGccacagcagcagcagcaacagcagcagcagccgcCGCAGCAGCCGCAACCGCATTCCGCATCCGGTATGGAAATAAACCGTGCGTCAAGCTGCTCAATCGCCGGAATGGGAGGAGAAAGTAGCGCGTTCCGGGCCGTCGTCACGAATTCCTCCTCCAGAGAGGCCGCAGAAAGGAACGAGAGGATCGACAGGGTTGACTCGATCGCTAAACAGCCGGAAAGAAAGCCTAGCGCCTTCAGGATGATCAGCCAATTGGTTGGCGAGGATTCGCCTAGCATACCGAGGTCCTCGAGTCCGTCGTACGAGCAACAGAGGCACGAGTCCAACATTGGGTCCGCGGTTGGATCCGAGTCGACGAATAAAAGCGAATTCGAGCGCGTCGAGGATGAAGAAGAGGACGAGGAAATCGACGTACAGGACTCGGACCAGGATGTTCCTTCTTCTCCGACTGTCGCTTGGAGGGATCACTGGACGGACCAGCAACCCTTGGAGTTAACCAAACATGATCGTTGA
- Protein Sequence
- MMIGGGVATPPGTAKSAAAIDSEMVQVDGIGSGSVSGSASGSGSPATGATTTRLTNSGNTGSNNNNGDVANNNNNNNNEGHSVVNCASGPAAATDKFCCHDDDAPASTGVARPWEPPSPTFSQTRSHLLQVHPPHHHQHPAAHHHHQQMTVPPVSLIDSVSLQNCTAGPFASGNGGGTSRQRLLELSHGLGALRHYNDLANHVLSLNQQGAVVTKLLGTLRPPGLIGGSKPKVATPAVVAKIEQYKRENPTIFAWEIRERLISEGVCSNATAPSVSSINRILRNRAAERAAAEFARAAGYGLYAAGPHPYFNSAHQHPTTSHHLPAGWPAPGAAGHPWMLPPLATGISGAASALLLPPSLSPGAAAAAAAAASASAAGTTDHSLHADAIARGYLQDGDGDEGSLDGSEQPKFRRNRTTFSPEQLEELEKEFERSHYPCVSTRERLASKTSLSEARVQVWFSNRRAKWRRHQRMNLLKRSPPPPPPPPPPPPLPQQQQQQQQQPPQQPQPHSASGMEINRASSCSIAGMGGESSAFRAVVTNSSSREAAERNERIDRVDSIAKQPERKPSAFRMISQLVGEDSPSIPRSSSPSYEQQRHESNIGSAVGSESTNKSEFERVEDEEEDEEIDVQDSDQDVPSSPTVAWRDHWTDQQPLELTKHDR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01424199; iTF_01068805; iTF_01066891; iTF_01067563; iTF_01067439; iTF_01066775; iTF_01068925; iTF_01065520; iTF_01065383; iTF_00302865; iTF_00302981; iTF_00965880; iTF_00965987; iTF_01066205; iTF_01066071; iTF_00088775; iTF_00088611; iTF_00982793; iTF_00982919; iTF_01539434; iTF_01539554; iTF_00085323; iTF_00087112; iTF_00085994; iTF_00086930; iTF_00086230; iTF_00085113; iTF_01498027; iTF_01498147; iTF_01255501; iTF_01255367; iTF_00087758; iTF_00087949; iTF_00360046; iTF_00360163; iTF_00391122; iTF_00391243; iTF_00633458; iTF_00633581; iTF_00625332; iTF_00625448; iTF_00760820; iTF_00760939; iTF_00873748; iTF_00873636; iTF_00142421; iTF_00142311; iTF_00964310; iTF_00964430; iTF_00084503; iTF_00084358; iTF_01122264; iTF_01122384; iTF_00963801; iTF_00963696; iTF_00733914; iTF_00733810; iTF_01453914; iTF_01454038; iTF_00683997; iTF_00684118; iTF_00216336; iTF_00225187; iTF_00227108; iTF_00216901; iTF_00217701; iTF_00224389; iTF_00229861; iTF_00214854; iTF_00231270; iTF_00233062; iTF_00225865; iTF_00230537; iTF_00214346; iTF_00228618; iTF_00215659; iTF_00231147; iTF_00225740; iTF_00227916; iTF_00227789; iTF_00215534; iTF_00222478; iTF_00228494; iTF_00217583; iTF_00225071; iTF_00231824; iTF_00223825; iTF_00220530; iTF_00226546; iTF_00230644; iTF_00223702; iTF_00229302; iTF_00219063; iTF_00224503; iTF_00226423; iTF_00229978; iTF_00214235; iTF_00216219; iTF_00218939; iTF_00220411; iTF_00227227; iTF_00233179; iTF_00214972; iTF_00221212; iTF_00222364; iTF_00231935; iTF_00217024; iTF_00221088; iTF_00229184; iTF_01418301; iTF_01418192; iTF_00140414; iTF_00140534; iTF_00862398; iTF_00862511; iTF_00218368; iTF_00218249; iTF_00232448; iTF_00232558; iTF_00982240; iTF_00982119; iTF_01070880; iTF_01070978;
- 90% Identity
- iTF_00220411;
- 80% Identity
- iTF_01424199;