Tcly015485.1
Basic Information
- Insect
- Tetragonula clypearis
- Gene Symbol
- pax2a
- Assembly
- GCA_010645135.1
- Location
- WIUT01003137.1:4605-12277[+]
Transcription Factor Domain
- TF Family
- PAX
- Domain
- PAX domain
- PFAM
- PF00292
- TF Group
- Helix-turn-helix
- Description
- The paired domain, a ~126 amino acid DNA-binding domain, is found in eukaryotic transcription regulatory proteins involved in embryogenesis. Initially identified in Drosophila’s paired (prd) protein, it typically resides in the N-terminal region and may be followed by an octapeptide, a homeodomain, or a Pro-Ser-Thr-rich C terminus. Paired domain proteins act as transcription repressors or activators, with DNA-binding specificity mediated by three subdomains. Crystal structures reveal a bipartite DNA-binding paired domain: an N-terminal subdomain (PAI) and a C-terminal subdomain (RED), linked by a flexible linker. Both subdomains contain a helix-turn-helix motif that binds DNA's major groove, while the linker may bind the minor groove. Variations in domain usage across Pax proteins and isoforms determine sequence specificity.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 5.3e-73 1.7e-69 232.3 0.2 1 122 50 171 50 171 0.99
Sequence Information
- Coding Sequence
- ATGAACGGACTTGCGTCACAGACTGTTAGTCGACTGGCGTTAAGGATTAGAAACAGCGACGATTGGATCATCGAATCATCGGTGCACTGGCACGTACACGAGGACATATGTCGCGGTCCAGGACCCTCCATTACGATATTAGATCGAAGCCACGGCGGAGTTAACCAGCTCGGCGGCGTCTTCGTGAACGGAAGGCCCCTACCGGATGTAGTGAGGCAGAGGATCGTCGAGCTAGCGCACAGCGGCGTCCGACCGTGCGACATTTCCAGACAACTCAGGGTATCTCACGGCTGTGTATCCAAGATACTGTCGAGGTATTACGAAACCGGCAGCTTCAAGGCTGGCGTGATAGGTGGCTCGAAGCCGAAGGTGGCAACGCCGCCGGTTGTCGAGGCGATCGCTAATTACAAGAGGGACAATCCGACGATGTTCGCGTGGGAGATCAGGGATCGATTGCTCGCTGAAGGTATTTGCTCGCAGGACAACGTGCCATCCGTCTCTTCGATCAATCGGTGA
- Protein Sequence
- MNGLASQTVSRLALRIRNSDDWIIESSVHWHVHEDICRGPGPSITILDRSHGGVNQLGGVFVNGRPLPDVVRQRIVELAHSGVRPCDISRQLRVSHGCVSKILSRYYETGSFKAGVIGGSKPKVATPPVVEAIANYKRDNPTMFAWEIRDRLLAEGICSQDNVPSVSSINR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00293666; iTF_01475395; iTF_00228613; iTF_01473125; iTF_00227910; iTF_00253511; iTF_00633575; iTF_00964423; iTF_01474631; iTF_00087944; iTF_01130761; iTF_01255497; iTF_01473865; iTF_00862507; iTF_00226540; iTF_00141773; iTF_00298253; iTF_00860333; iTF_01420877; iTF_01417647; iTF_00252016; iTF_00086225; iTF_00229297; iTF_00738289; iTF_00255026; iTF_00229972; iTF_00233172; iTF_00294420; iTF_00227221; iTF_00221206; iTF_00295190; iTF_01198614; iTF_01418296; iTF_00297497; iTF_00306201; iTF_00117946; iTF_00217018; iTF_00252762; iTF_00291482; iTF_00292890; iTF_00982234; iTF_00218363; iTF_00863899; iTF_01493531; iTF_00773756; iTF_01539548; iTF_00360157; iTF_01498141; iTF_00254270; iTF_00963135; iTF_01395071; iTF_01492831; iTF_00982912; iTF_01419584;
- 90% Identity
- iTF_00306201;
- 80% Identity
- -