Lhir003116.1
Basic Information
- Insect
- Lagria hirta
- Gene Symbol
- -
- Assembly
- GCA_947359425.1
- Location
- OX375803.1:7963490-7992841[+]
Transcription Factor Domain
- TF Family
- POU
- Domain
- Homeobox|Pou
- PFAM
- PF00157
- TF Group
- Helix-turn-helix
- Description
- The POU domain is a bipartite domain composed of two subunits separated by a non-conserved region of 15-55 aa. The N-terminal subunit is known as the POU-specific (POUs) domain (this entry), while the C-terminal subunit is a homeobox domain (IPR001356). Both subdomains contain the structural motif 'helix-turn-helix', which directly associates with the two components of bipartite DNA binding sites, and both are required for high affinity sequence-specific DNA-binding. 3D structures of complexes including both POU subdomains bound to DNA are available. The domain may also be involved in protein-protein interactions [6]. The subdomains are connected by a flexible linker [7, 5, 8]. Despite of the lack of sequence homology, the tridimensional structure of POUs is similar to 3D structure of bacteriophage lambda repressor and other members of HTH_3 family [7, 5]. POU proteins are eukaryotic transcription factors containing a bipartite DNA binding domain referred to as the POU domain. The acronym POU (pronounced 'pow') is derived from the names of three mammalian transcription factors, the pituitary-specific Pit-1, the octamer-binding proteins Oct-1 and Oct-2, and the neural Unc-86 from Caenorhabditis elegans. POU domain genes have been identified in diverse organisms including nematodes, flies, amphibians, fish and mammals but have not been yet identified in plants and fungi. The various members of the POU family have a wide variety of functions, all of which are related to the function of the neuroendocrine system [4] and the development of an organism [1]. Some other genes are also regulated, including those for immunoglobulin light and heavy chains (Oct-2) [3, 2], and trophic hormone genes, such as those for prolactin and growth hormone (Pit-1).
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 0.0034 17 6.1 0.1 42 68 188 214 168 218 0.82 2 12 0.29 1.4e+03 -0.0 0.0 33 59 229 253 224 257 0.80 3 12 0.0046 23 5.7 0.2 38 71 646 677 629 678 0.76 4 12 0.6 3e+03 -1.1 0.0 43 60 697 714 685 717 0.79 5 12 0.0026 13 6.5 0.2 42 71 744 773 726 774 0.83 6 12 0.6 3e+03 -1.1 0.0 43 60 793 810 781 813 0.79 7 12 0.0044 22 5.8 0.2 42 68 840 866 822 870 0.82 8 12 0.18 9.1e+02 0.6 0.0 34 61 882 907 876 915 0.78 9 12 0.012 59 4.4 0.1 42 68 1189 1215 1178 1219 0.80 10 12 1.3 6.6e+03 -2.2 0.0 44 59 1239 1254 1228 1257 0.78 11 12 0.0014 6.9 7.4 0.2 39 68 1621 1648 1601 1652 0.83 12 12 0.52 2.6e+03 -0.9 0.0 42 59 1670 1687 1659 1691 0.79
Sequence Information
- Coding Sequence
- ATGATAATTGGAATCTGCAAAGCTTGCGTTTTACCATGGACTGATATCAAAGGAACAAGTTCCATAAATTTATATTACGCTGACAAATGTCCAATATTTACTGATCCATACCAGTTAAAAATAGTAGGAGTTAATGAACCAATAAATTTACCGGAGTTCAAAATATTTAAAAAAAAAGTCGACGAATATTTTCTAAATTATAACTCACAGTCTTTAAATTTCGCTGATTTTTCAAACAATAACATAGCAGAAATTGACTTAACATCTTTAAGATATTTCGAAAATCTAAAATCTTTAAATGTCTCATTTTGTCAATTAAAACAGTTGTCAGACTCAGAAAATACTTCCTCAACAGTTTCATCTTTAAAATACCTTTATCTCAATAATAATAACATTAGTAATGCAATTGTATTCTCAAAATATTTTCCACATTTAGAAAAATTATATTTACAGCATAATAATTTTAGCCAAATTAATTTTAATGTTTTTCATAATTTAACAAATCTTCAACAACTTAATTTATCCTACAACCACATTGAGCTTCTACATTATGAAACATTTCAATCGCTGCAAAGGTTACAAAATCTTGATTTAAGTCACAATAGAATCTTACAAATTGTACCAAATCTTTTCCATAACCTATTAGAACTTCAATTGTTAGCTCTATCTTACAACAACATTCAAACTATTTACCGTGATACATTTTACAGTTTGCAACACTTGCAAAACCTGGACTTAAGCCACAATAACATCATTAGTTTCCACGACTCTACTTTTGAGTACTTACCTGCTCTAAGGTCATTAGCAATAAATGACAATCAGATACAAGATTTATACATTGATGACCTTTTACTCTACTCTAAACAAATTCATACAATTTCCCTAGATAACAATCCATGGAATTGTAAATCAATTGAAAACATTGCTTCAAAATGTCACCAAGGGCAAATTCTAATAAAACCTGGTCAAAATTATAACGCTAGTAATGTCTTAGGTATTTCTTGTAAAACTCCAACTGTTAAAGATGCAGAAAAAAATCTATCAAATGTAACGATAGCAAGAGATGTAAGCTCTGTAAATACTTCAGATAACTACAGAAAGGTAAATTTTGAAGACAGTGAGACAAAAAAAATTTGTAATTTCTTAGGTATTTCTTGTAAAACTCAAACTGTTAAAGATGCAGAAAAAAATCTATCAAATGTAACGATAGCAAAAGATGTAAACTCTGTAAATACTTCAGATAACTGCAGAAAGGTTGGTAGAAACTTCCGGTTTGATAAAATTGTTGCATGTAATGATAAAATTGTTATCATTGGCAATTCATTTCTTTCTGTGTGTTCGTGTGGTATCTACTTCCAATACGTTACCATTTATGTATCTACATTTGAAGAGGCTGTGCTAAGAAAAATGGGGAATACAAAGATGATTGTTTGTCTTGGTTTGACTTCCGTGATGATAATTGGAATCTGCAAAGCTTGCGTTTTATCATGGACTGATATCAAAGGAACAAGTTCCATAAATTTATATTACGCTGGCAAATGTCAAACATATGGTTATTCAAACGAGTTAAAAATAGTAGGAGTTAATGAACCAATAAATTCACCGGAGTTCAAAATATTTCAAAAAAAAGTCGGCGAATATTTTGGAAGTTATAAGTCACAGGCTTTACGTTTCGCTGATTTTTCAAACAATAACATAGCAGAAATTGACTTAACACTTTTAGGATATTTCCAACATCTAAAATCTTTAAATTTCTCATTTTGTCAATTAAAACAGTTGTCAGGCTCAGAAAATACTTTCTCAACAGTTTCATCTTTAAGATACCTTTATCTCAATAATAATAACATTAGTAATGCAATTATATTCTCAAAATATTTTCCTTATTTAGAAAAATTATATTTACAGTATAATAATTTTAGCCAAATTAATTTTAATGTTTTTCATAATTTAACAAATCTTCAACAACTTGATTTAAGTCACAATAGAATCTTACAAATTTTACCAAATCTTTTCCATAACCTATTAGAACTTGAATTGTTAACTCTATCTTACAACAATATTGAAACTTTTCACCGTGATACATTTTACAGTTTGCCACACTTGCAAAAGCTGGACTTAAGCCACAATAACATCATTAGCATTAATTTTAATGTTTTTCATAATTTAACAAATCTTCAACAATTTAATTTATCCTACAACCACATTGAGCTTCTACATTATGAAACATTTCAATCGCTGCAAAAGTTACAAAATCTTGATTTAAGTCACAATAGAATCTTACAAATTTTACCAAATCTTTTCCATAACCTATTAGAACTTGAATTGTTAACTCTATCTTACAACAATATTGAAACTTTTCACCGTGATACATTTTACAGTTTGCCACACTTGCAAAAGCTGGACTTAAGCCACAATAACATCATTAGCATTAATTTTAATGTTTTTCATAATTTAACAAATCTTCAACAATTTAATTTATCCTACAACCACATTGAGCTTCTACATTATGAAACATTTCAATCGCTGCAAAAGTTACAAAATCTTGATTTAAGTCACAATAGAATCTTACAAATTCTACCAAATCTTTTCCATAACCTATTAGAACTTCAATTGTTAGCTCTATCTTACAACAACATTGAAACTATTTACCGTGATACATTTTACAGTTTGCAACACTTGCAAAACCTGGACTTAAGCCACAATAACATCATTAGTTTCCACGACTCTATTTTTGAGTATTTACCTGCTCTAAGGTCATTAGCAATAAATGACAATCAGATACAAAATTTATACATTGATGACCTTTTACTCTACTCTAAACAAATTCATACAATTTCCCTAGATAACAATCCATGGAATTGTAAATCAATTGAAAACATTGCTTCAAAATGTCACCAAAGGCAAATTCTAATAAAACGTGGTCAAAATTATAACGCTAGTAATGTCTTAGGTATTCCTTGTAAAACTCCAACTGTTAAAGATGCAGAAAAAAATCTATCAAATGTAACGATAGCAAAAGATGTAAACTCTGTAAATACTTCAGATAACTACAGAAAAGTAAATTTTGAAGACATGATGATAATTGGAATCTGCAAAGCTTGCGTTTTACCATGGACTGATATCAAAGGAACAAGTTCCATAAATTTATATTACGCTGACGAATGTGTAGAATATAGTTATCCAAACGAGTTAAAAATAGTAGGAGTTAATGAACCAATAAATTCACCGGAGTTCAAAATATTTAAAAAAAAAGTCGACGAATATTTTGTAAACTATAACTCACAGCACATTAATTTCGCTGATTTTTCAAACAATAACATAGCAGAAATTGACTTAACATCTTTAAGATATTTCGAAAAGCTATATTCTTTAAATGTCTCATTTTGTCAATTAAAACAGTTGTCAGACTCAGAAAATACTTCCTCAACAGTTTCATCTTTAAGATACCTTTATCTCAATAATAATAACATTAGTAATGCAATTGTATTCTCAAAATATTTTCCACATTTACAAAAATTATACTTACAGTATAATAATTTTAGCCATATTAATTTTAATGTTTTTCATAATTTAACAAATCTTCAACAACTTGATTTAAGTCACAACAGAATCTTACAAATTTTACCAAATCTTTTCCATAACCTATTAGAACTTCAATTGTTAGCTCTATCTTACAACAATATTGAAACTTTTCACCGTGATACATTTTACAGTTTGCCACACTTGCAAAACCTGGACTTAAGCCACAATAACATCATTAGCTTCCACGATTCTACTTTTGAGTACTTACCTGCTCTAAGATCATTAGCAATAAATGACAATCAGATACAAGATTTATACATTGATGACCTTTTACTCTACTCTAAACAAATTCATACAATTTCCCTAGATAACAATCCATGGAATTGTAAAACAATTAAAAACATTACTTCAAAATGTCACCAAAGGCAAATTCTAATAAAACCTGGTCAAAATTATAACGCTAGTAATTTCTTAGGTATTTCTTGTAAAACTCCAACTGTTAAAGATGCAGAAAAAAATCTATTAAATGTAACGATAGCAAAAGATGTAAACTCTGTAAATACTTCAGATAACTACAGAAAGGTTGGTAGAAACTTCCGGTTTGATAAAATTGTTGCATGTAATGATAAAATTGTTATCATTGGCAATTCATTTCTTTCTGTGTGTTCGTGTGGTATCGACTTCCAATACGTTACCATTTATGTATTTACATTTGAAGAGGCTGTGCTAAGAAAAATGTGGAATACAAAGATGATTGTTTGTCTTGGTTTGACTTCAGTGATGATAATTGGAATCTGCAAAGCTTGCGTTTTACCATGGACTGATATCAAAGGAACAAATTCCATAAATTTATATTACGCTGACGAATGTAACGTATATAGTGATCCAAACGAGTTAAAAATAGTAGGAGTTAATGAACCAATAAATTCACCGGAGTTCAAAATATTTAAAAAAAAAGTCGACGAATATTTTGTAAACTATAACTCACAGTCTTTATATTTCGCTGATTTTTCAAACAATAACATAGCAGAAATTGACTTAACACCTTTAGGATATTTCGGAACTCTAGAATCTTTAAATTTCTCATTTTGTCAATTAAAACAGTTGTCAGGCTCAGAAAATACTTCCTCAACAGTTTCATATTTAAGATACCTTTATCTCAATAATAATATCATTAGTAATGCAATTATATTCTCAAAATATTTTCCACATTTAGAAAAATTATATTTACAGTATAATAATTTTAGCCAAATTAATTTTAATGTTTTTCATAATTTAACAAATCTTCAACAACTTGATTTATCCTACAACCACATTGAGCTTCTACATTATGAAACATTTCAGTCGCTGCAAAGGTTACAAAATCTTGATTTAAGTCACAATAGAATCTTACAAATTTTACCAAATCTTTTCCATAGCCTATTAGAACTTCAATTGTTAGCTCTATCTTACAACAACATTGAAACTTTTCACCGTGATACATTTTACAGTTTGCAACACTTGCAAAACCTGGACTTAAGCCACAATAACATCATTAGTTTCCACGATTCTACTTTTGAGTACTTACCTGCTCTAAGGTCATTAGCAATAAATGACAATCAGATACAAGATTTATACATTGATGACCTTTTACTCTACTCTAAACAAATTCATACAATTTCCCTAGATAACAATCCATGGAATTGTAAATCAATTGAAAACATTACTTCAAAATGTCACCAAAGGCAAATTCTAATAAAACCTGGTCAAAATTATAACGCTAGTAATGTCTTAGGTATTTCTTGTAAAACTCAAATTGTTAAAGATGCAGAAAAAAATCTATCAAATGCAACGATAGCAAAAGATGTAAACTCTGTAAATACTTCAGATAACTACAGAAAGGTAAATTTTGAAGACAAAAACGATATTTGGCTTTTGATTCAATGGAGAGAAGCTGGGGGAGTTTGTGTTCGAATTGGAGAAGACAATAGGGAGAAAACGAGCATGTTGCACAATATCACAGTATATTGCTGA
- Protein Sequence
- MIIGICKACVLPWTDIKGTSSINLYYADKCPIFTDPYQLKIVGVNEPINLPEFKIFKKKVDEYFLNYNSQSLNFADFSNNNIAEIDLTSLRYFENLKSLNVSFCQLKQLSDSENTSSTVSSLKYLYLNNNNISNAIVFSKYFPHLEKLYLQHNNFSQINFNVFHNLTNLQQLNLSYNHIELLHYETFQSLQRLQNLDLSHNRILQIVPNLFHNLLELQLLALSYNNIQTIYRDTFYSLQHLQNLDLSHNNIISFHDSTFEYLPALRSLAINDNQIQDLYIDDLLLYSKQIHTISLDNNPWNCKSIENIASKCHQGQILIKPGQNYNASNVLGISCKTPTVKDAEKNLSNVTIARDVSSVNTSDNYRKVNFEDSETKKICNFLGISCKTQTVKDAEKNLSNVTIAKDVNSVNTSDNCRKVGRNFRFDKIVACNDKIVIIGNSFLSVCSCGIYFQYVTIYVSTFEEAVLRKMGNTKMIVCLGLTSVMIIGICKACVLSWTDIKGTSSINLYYAGKCQTYGYSNELKIVGVNEPINSPEFKIFQKKVGEYFGSYKSQALRFADFSNNNIAEIDLTLLGYFQHLKSLNFSFCQLKQLSGSENTFSTVSSLRYLYLNNNNISNAIIFSKYFPYLEKLYLQYNNFSQINFNVFHNLTNLQQLDLSHNRILQILPNLFHNLLELELLTLSYNNIETFHRDTFYSLPHLQKLDLSHNNIISINFNVFHNLTNLQQFNLSYNHIELLHYETFQSLQKLQNLDLSHNRILQILPNLFHNLLELELLTLSYNNIETFHRDTFYSLPHLQKLDLSHNNIISINFNVFHNLTNLQQFNLSYNHIELLHYETFQSLQKLQNLDLSHNRILQILPNLFHNLLELQLLALSYNNIETIYRDTFYSLQHLQNLDLSHNNIISFHDSIFEYLPALRSLAINDNQIQNLYIDDLLLYSKQIHTISLDNNPWNCKSIENIASKCHQRQILIKRGQNYNASNVLGIPCKTPTVKDAEKNLSNVTIAKDVNSVNTSDNYRKVNFEDMMIIGICKACVLPWTDIKGTSSINLYYADECVEYSYPNELKIVGVNEPINSPEFKIFKKKVDEYFVNYNSQHINFADFSNNNIAEIDLTSLRYFEKLYSLNVSFCQLKQLSDSENTSSTVSSLRYLYLNNNNISNAIVFSKYFPHLQKLYLQYNNFSHINFNVFHNLTNLQQLDLSHNRILQILPNLFHNLLELQLLALSYNNIETFHRDTFYSLPHLQNLDLSHNNIISFHDSTFEYLPALRSLAINDNQIQDLYIDDLLLYSKQIHTISLDNNPWNCKTIKNITSKCHQRQILIKPGQNYNASNFLGISCKTPTVKDAEKNLLNVTIAKDVNSVNTSDNYRKVGRNFRFDKIVACNDKIVIIGNSFLSVCSCGIDFQYVTIYVFTFEEAVLRKMWNTKMIVCLGLTSVMIIGICKACVLPWTDIKGTNSINLYYADECNVYSDPNELKIVGVNEPINSPEFKIFKKKVDEYFVNYNSQSLYFADFSNNNIAEIDLTPLGYFGTLESLNFSFCQLKQLSGSENTSSTVSYLRYLYLNNNIISNAIIFSKYFPHLEKLYLQYNNFSQINFNVFHNLTNLQQLDLSYNHIELLHYETFQSLQRLQNLDLSHNRILQILPNLFHSLLELQLLALSYNNIETFHRDTFYSLQHLQNLDLSHNNIISFHDSTFEYLPALRSLAINDNQIQDLYIDDLLLYSKQIHTISLDNNPWNCKSIENITSKCHQRQILIKPGQNYNASNVLGISCKTQIVKDAEKNLSNATIAKDVNSVNTSDNYRKVNFEDKNDIWLLIQWREAGGVCVRIGEDNREKTSMLHNITVYC
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -