Mpro041203.1
Basic Information
- Insect
- Melinopterus prodromus
- Gene Symbol
- -
- Assembly
- GCA_964023965.1
- Location
- OZ030395.1:30670070-30684208[+]
Transcription Factor Domain
- TF Family
- BTB
- Domain
- zf-C2H2|ZBTB
- PFAM
- PF00651
- TF Group
- Zinc-Coordinating Group
- Description
- The BTB (for BR-C, ttk and bab) [6] or POZ (for Pox virus and Zinc finger) [1] domain is present near the N-terminus of a fraction of zinc finger (Pfam:PF00096) proteins and in proteins that contain the Pfam:PF01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation [1]. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule [2]. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT [5, 3, 4]. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 26 1.9e-05 0.0083 17.9 0.0 48 89 119 160 107 163 0.82 2 26 4.3e-07 0.00019 23.2 0.0 53 89 167 203 159 206 0.87 3 26 5.4e-11 2.4e-08 35.8 0.0 53 105 210 262 202 267 0.90 4 26 8.2e-11 3.7e-08 35.2 0.0 53 105 268 320 260 324 0.90 5 26 1.1e-06 0.00049 21.9 0.0 53 88 326 361 319 365 0.88 6 26 1.9e-06 0.00083 21.2 0.0 53 89 369 405 362 408 0.88 7 26 0.00016 0.069 15.0 0.0 53 89 412 448 404 453 0.87 8 26 4.8e-07 0.00021 23.1 0.0 53 89 455 491 448 494 0.88 9 26 8.6e-07 0.00038 22.2 0.0 53 88 498 533 490 537 0.86 10 26 2.9e-11 1.3e-08 36.7 0.0 53 105 541 593 533 598 0.91 11 26 9.6e-11 4.3e-08 35.0 0.0 53 105 599 651 592 656 0.91 12 26 9.6e-11 4.3e-08 35.0 0.0 53 105 657 709 650 714 0.91 13 26 0.00036 0.16 13.8 0.0 54 88 716 749 709 753 0.83 14 26 3e-11 1.4e-08 36.6 0.0 53 105 757 809 749 813 0.91 15 26 7.8e-11 3.5e-08 35.3 0.0 53 105 815 867 807 872 0.90 16 26 7.8e-11 3.5e-08 35.3 0.0 53 105 873 925 865 930 0.90 17 26 8.9e-07 0.0004 22.2 0.0 53 88 931 966 923 970 0.86 18 26 2.9e-11 1.3e-08 36.7 0.0 53 105 974 1026 966 1031 0.91 19 26 7.8e-11 3.5e-08 35.3 0.0 53 105 1032 1084 1024 1089 0.90 20 26 8.2e-11 3.7e-08 35.2 0.1 53 105 1090 1142 1082 1146 0.90 21 26 2.8e-11 1.2e-08 36.7 0.0 53 105 1148 1200 1140 1205 0.90 22 26 8.2e-11 3.7e-08 35.2 0.1 53 105 1206 1258 1198 1262 0.90 23 26 6.3e-11 2.8e-08 35.5 0.0 53 105 1264 1316 1256 1320 0.90 24 26 3.7e-05 0.016 17.0 0.0 53 90 1322 1358 1314 1361 0.85 25 26 4.9e-11 2.2e-08 35.9 0.1 53 105 1364 1416 1356 1420 0.91 26 26 5.1e-09 2.3e-06 29.4 0.0 53 101 1422 1470 1414 1476 0.88
Sequence Information
- Coding Sequence
- ATGGACATGATGTGGAAGAgGTTTCCTTCAAATATTCCTTCAACTGACCGACCAAAAACGTTTACCGGAGAGCTTTGGACATTTGCGATAGGCAATTTTATAAACACATACATCGATTTCTGCGAATCGGTGCTGCGCTGCCAGTTACAGACTGGAGAAGAGCGATTGAAAGGATTGGCGCTACTCAATATTCATAGAGATGTGGAAATCAGTGAAAAGGTATTGGATATTCTGGCAAAGAATATCATCCAATACCACCCTACAAAGCAATACACTACCGAAACCAGTATCCGACCAGAAACAACCAAAACATTCAAATTCCATACGTTCGTTCGTGACTGGACCGCACAAGatgAAGAAGGTAAAGGTGATATCCACATTGACTACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAACAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAATGAAGAAGGTAAAGGTGATATCCACATCGACGACATATCACCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATTTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTAGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGAAGTTTTTCTACACAGGAGAAATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATGGGCTAATGAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAatgAAGAAGGTAAAGGTGATATCCACATTGACTACATACCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAACAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAATGAAGAAGGTAAAGGTGATATCCACATCGACGACATATCACCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGAAGTTTTTCTACACAGGAGAAATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATGGGCTAatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGAAGTTTTTCTACACAGGAGACATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCCACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTAGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCTACCGCCGTCCGATCAATATTGAAGTTTTTCTACACAGGAGAAATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATGGGCTAatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGAAGTTTTTCTACACAGGAGACATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCCACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTAGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtATGAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGAAGTTTTTCTACACAGGAGAAATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATGGGCTAatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGAAGTTTTTCTACACAGGAGACATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCCACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTAGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGAAGTTTTTCTACACAGGAGACATAACAATAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCCACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTAGCTGGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAAtatgAAGAAGGTAAAGGTGATATCCACATTGACGACATATCATCCACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAgtGAAGAAGGTGAAGGTGATATACAAATCGACGACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACaggagaaataataataaccgcGTCTAACGTGGAAGATTTGCTATTGGCTAGTAAGAAGTTTAAGATTGCAAAGTTGGAAACAGCATGCAGCCAATATGAAGAAGGTAAAGGTGATATCCACATCGACGACATATCATCTACCGCCGTCCGATCAATATTGCTGTTTTTCTATACAGGAGAAATAACAGTAACCGCGTCTAACGTGGAAGATTTGCTATTGGCTAGCAAGAAATTTAAGATTGCAAAGTTGGAAACAGCACGCAGCCAATGTCAAAACTACTAA
- Protein Sequence
- MDMMWKRFPSNIPSTDRPKTFTGELWTFAIGNFINTYIDFCESVLRCQLQTGEERLKGLALLNIHRDVEISEKVLDILAKNIIQYHPTKQYTTETSIRPETTKTFKFHTFVRDWTAQDEEGKGDIHIDYISSTAVRSILLFFYTGEITTTASNVEDLLLANEEGKGDIHIDDISPTAVRSILLFFYTGEITITASNVEDLLLANEEGKGDIHIDDISFTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILKFFYTGEITITASNVEDLLWANEEGKGDIHIDDISSTAVRSILLFFYTGEITITASNVEDLLLANEEGKGDIHIDYIPSTAVRSILLFFYTGEITTTASNVEDLLLANEEGKGDIHIDDISPTAVRSILLFFYTGEITITASNVEDLLLANEEGKGDIHIDDISSTAVRSILKFFYTGEITITASNVEDLLWANEEGKGDIHIDDISSTAVRSILKFFYTGDITITASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISTAVRSILKFFYTGEITITASNVEDLLWANEEGKGDIHIDDISSTAVRSILKFFYTGDITITASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILKFFYTGEITITASNVEDLLWANEEGKGDIHIDDISSTAVRSILKFFYTGDITITASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILKFFYTGDITITASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLAGKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITTASNVEDLLLASEEGEGDIQIDDISSTAVRSILLFFYTGEIIITASNVEDLLLASKKFKIAKLETACSQYEEGKGDIHIDDISSTAVRSILLFFYTGEITVTASNVEDLLLASKKFKIAKLETARSQCQNY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -