Imas011251.1
Basic Information
- Insect
- Incurvaria masculella
- Gene Symbol
- Poxn
- Assembly
- GCA_946894085.1
- Location
- CAMPPJ020000095.1:869330-888491[-]
Transcription Factor Domain
- TF Family
- PAX
- Domain
- PAX domain
- PFAM
- PF00292
- TF Group
- Helix-turn-helix
- Description
- The paired domain, a ~126 amino acid DNA-binding domain, is found in eukaryotic transcription regulatory proteins involved in embryogenesis. Initially identified in Drosophila’s paired (prd) protein, it typically resides in the N-terminal region and may be followed by an octapeptide, a homeodomain, or a Pro-Ser-Thr-rich C terminus. Paired domain proteins act as transcription repressors or activators, with DNA-binding specificity mediated by three subdomains. Crystal structures reveal a bipartite DNA-binding paired domain: an N-terminal subdomain (PAI) and a C-terminal subdomain (RED), linked by a flexible linker. Both subdomains contain a helix-turn-helix motif that binds DNA's major groove, while the linker may bind the minor groove. Variations in domain usage across Pax proteins and isoforms determine sequence specificity.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 22 9.7e-42 3e-38 131.5 0.6 1 74 36 109 36 111 0.98 2 22 7.5e-14 2.4e-10 41.3 0.2 45 74 112 141 109 143 0.94 3 22 1.2e-14 3.9e-11 43.9 0.3 45 74 144 173 140 176 0.93 4 22 1.4e-14 4.4e-11 43.7 0.3 45 74 176 205 173 208 0.93 5 22 4.4e-13 1.4e-09 38.9 0.2 45 74 208 237 205 239 0.94 6 22 1.2e-14 3.9e-11 43.9 0.3 45 74 240 269 236 272 0.93 7 22 5.2e-13 1.6e-09 38.6 0.2 45 74 272 301 269 304 0.93 8 22 1.2e-14 3.9e-11 43.9 0.3 45 74 304 333 300 336 0.93 9 22 3.8e-13 1.2e-09 39.1 0.2 45 74 336 365 333 368 0.93 10 22 1.2e-14 3.9e-11 43.9 0.3 45 74 368 397 364 400 0.93 11 22 1.2e-14 3.9e-11 43.9 0.3 45 74 400 429 396 432 0.93 12 22 4.6e-13 1.4e-09 38.8 0.2 45 74 432 461 428 464 0.93 13 22 1.4e-14 4.4e-11 43.7 0.3 45 74 464 493 461 496 0.93 14 22 3.7e-13 1.2e-09 39.1 0.2 45 74 496 525 493 528 0.93 15 22 1.6e-14 5.1e-11 43.5 0.3 45 74 528 557 525 559 0.94 16 22 1.4e-14 4.5e-11 43.7 0.3 45 74 560 589 556 591 0.93 17 22 3.7e-13 1.2e-09 39.1 0.2 45 74 592 621 589 624 0.93 18 22 1.2e-14 3.9e-11 43.9 0.3 45 74 624 653 620 656 0.93 19 22 6.1e-13 1.9e-09 38.4 0.2 45 74 656 685 653 687 0.94 20 22 2.5e-13 7.8e-10 39.7 0.2 45 72 688 715 684 725 0.91 21 22 1.4 4.5e+03 -1.6 0.0 23 61 857 896 850 901 0.82 22 22 2.1e-05 0.066 14.1 0.0 39 74 1008 1037 990 1047 0.80
Sequence Information
- Coding Sequence
- ATGCCTGCGCCAACACGTCACTCGAGCAAAACTATCCTCGGCCCCTGTGGTAAATTACCTCGCGGGTTAACAGGAAACGTCTCAGTGAATGTACGTTTGCCGGCAGGTCAAGCGGGCGTCAATCAGCTCGGCGGCGTGTTCGTGAACGGTCGGCCGCTGCCGGACGTGGTACGCCGGCGCATCGTGGAGCTCGCCCTCCACGGCGTGAGACCCTGCGACATCAGCCGGCAGCTGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTACGAGACCGGCTCCATCCGGCCCGGCTCCATTGGTGGCAGCAAGACTAAGACGCTCGTCTCGGACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCGGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCGGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTACGAGACCGGCTCCATCCGGCCCGGCTCCATTGGTGGCAGCAAGACTAAGACGCTCGTCTCGGACGGCTGCGTCTCCAATATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCGGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTACGAGACCGGCTCCATCCGGCCCGGCTCCATTGGTGGCAGCAAGACTAAGACGCTCGTATCGGACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCAGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTACGAGACCGGCTCCATCCGGCCCGGCTCCATTGGTGGCAGCAAGACTAAGACGCTCGTCTCGGACGGCTGCGTCTCCAATATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCGGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCGGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTTTACGAGACCGGCTCCATCCGGCCCGGCTCCATTGGTGGCAGCAAGACTAAGACGCTCGTATCGGACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCAGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTACGAGACCGGCTCCATCCGGCCCGGCTCCATTGGTGGCAGCAAGACTAAGACGCTCGTCTCGGACGGCTGCGTCTCCAATATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCGGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCGGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTTTACGAGACCGGCTCCATCCGGCCCGGCTCCATTGGTGGCAGCAAGACTAAGACGCTCGTCTCGGACGGCTGCGTCTCCAATATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCGGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTACGAGACCGGCTCCATCCGGCCCGGCTCCATTGGTGGCAGCAAGACTAAGACGCTCGTATCGGACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTATGAGACCGGCTCCATCCGTCCCGGCTCCATCAGCGGCAGCAAGACCAAGACGCTCGTCTCGCACGGCTGCGTCTCCAAGATCCTCACCCGCTTCTACGAGACCGGCTCCATCCGGCCCGGCTCCATCGGCGGCAGTGTGACGTTCGACCGGAACCCTCGCTTTTGCGGCGGCTCCGTCCGCGTCATCACGCATGGTGGGTGCACGCGCGTGCGGTGCGGCGCGCGGGGCGTCGGCTCAGAGCAGCGGCGCCGGCGTGTCGGCCGCGTCGTCAACGCCGGCCGACGGCGTGGCGGCGTGGCTCAGCGCTGCTCGGCGCGGGGCGCATCGGCGCGGCTGCGCGGATGGCCGGCCAGCGTGCTCGGCTACCTCGCCGCGTCGCTGTCGGCGACGGCGACCCATCGGCGAGTCGTGCCTCGGGTGAACGGAAGACATAACTGGGGCCCACTTGCGGAAGATCAGACATCACCTTGCAGTGGCCGTCCAGACCCTCGAAGGCGCGAGCCGGACGACCGGTGCGGGAGCGGGCCGCTCCAGACGACTTCCGCTGTGACGCGAACGGAACGAGACGAATCGATTTCGGTGGCGTTCGCCAGCGTGCGCGCCGCCGTGATCGCGTCGCGTGTCCGAACGGCCGAGCAGTACATAGCTCGGTTTGGAGTGGGGCGCAAAATAGAAACAGGATTCCTAGTCCTGCTCGTCCCCGAACCAGTGGATGCCAGTGTGACGTTCGACCGGAACCCTCGCTTTTGCGGCGGCTCCGTCCGCGTCATCACGCGCCACCACTCCAAATTAAATCATGGTGGGTGCACGCGCGTGCGGTGCGGCGCGCGGGGCGTCGGCTCAGAGCAGCGGCGCCGGCGTGTCGGCCGCGTCGTCAACGCCGGCCGACGGCGTGGCGGCGTGGCTCAGCGCTGCTCGGCGCGGGGCGCATCGGCGCGGCTGCGCGGATGGCCGGCCAGCGTGCTCGGCTACCTCGCCGCGTCGCTGTCGGCGACGGCGACCCATCGGCGAGTCGTGCCTCGGACGCTCGTCTCGCACTACTGGGTCTCCAAGATCCTCACCCGCTTCGACGAGACCGGCTCCATCGGCGGCAGCAAGACCAAGAGATTCTCCGTGCGGGCATCGTTCGCCATGGTCCTCCTCGGTCCGGCCGCCCCAGGTGCGTTTACGATCCGCGATGATGCCGCGCCGTAA
- Protein Sequence
- MPAPTRHSSKTILGPCGKLPRGLTGNVSVNVRLPAGQAGVNQLGGVFVNGRPLPDVVRRRIVELALHGVRPCDISRQLLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSDGCVSKILTRFYETGSIRPGSIGGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSDGCVSNILTRFYETGSIRPGSIGGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSDGCVSKILTRFYETGSIRPGSISGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSDGCVSNILTRFYETGSIRPGSIGGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSDGCVSKILTRFYETGSIRPGSISGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSDGCVSNILTRFYETGSIRPGSIGGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSDGCVSNILTRFYETGSIRPGSIGGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSKTKTLVSDGCVSKILTRFYETGSIRPGSISGSKTKTLVSHGCVSKILTRFYETGSIRPGSIGGSVTFDRNPRFCGGSVRVITHGGCTRVRCGARGVGSEQRRRRVGRVVNAGRRRGGVAQRCSARGASARLRGWPASVLGYLAASLSATATHRRVVPRVNGRHNWGPLAEDQTSPCSGRPDPRRREPDDRCGSGPLQTTSAVTRTERDESISVAFASVRAAVIASRVRTAEQYIARFGVGRKIETGFLVLLVPEPVDASVTFDRNPRFCGGSVRVITRHHSKLNHGGCTRVRCGARGVGSEQRRRRVGRVVNAGRRRGGVAQRCSARGASARLRGWPASVLGYLAASLSATATHRRVVPRTLVSHYWVSKILTRFDETGSIGGSKTKRFSVRASFAMVLLGPAAPGAFTIRDDAAP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -