Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Huang F, Gao Q, Zhou X, Guo W, Feng K, Zhu L, Huang T, Cai YD. Prediction of Solubility of Proteins in Escherichia coli Based on Functional and Structural Features Using Machine Learning Methods. Protein J 2024;43:983-996. [PMID: 39243320 DOI: 10.1007/s10930-024-10230-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/21/2024] [Indexed: 09/09/2024]

Watthanasakphuban N, Ninchan B, Pinmanee P, Rattanaporn K, Keawsompong S. In Silico Analysis and Development of the Secretory Expression of D-Psicose-3-Epimerase in Escherichia coli. Microorganisms 2024;12:1574. [PMID: 39203416 PMCID: PMC11356227 DOI: 10.3390/microorganisms12081574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 07/27/2024] [Accepted: 07/28/2024] [Indexed: 09/03/2024] Open

Wei H, Lunin VV, Alahuhta M, Himmel ME, Huang S, Bomble YJ, Zhang M. Streamlining heterologous expression of top carbonic anhydrases in Escherichia coli: bioinformatic and experimental approaches. Microb Cell Fact 2024;23:190. [PMID: 38956607 PMCID: PMC11218372 DOI: 10.1186/s12934-024-02463-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 06/18/2024] [Indexed: 07/04/2024] Open

Abstract

BACKGROUND

Carbonic anhydrase (CA) enzymes facilitate the reversible hydration of CO2 to bicarbonate ions and protons. Identifying efficient and robust CAs and expressing them in model host cells, such as Escherichia coli, enables more efficient engineering of these enzymes for industrial CO2 capture. However, expression of CAs in E. coli is challenging due to the possible formation of insoluble protein aggregates, or inclusion bodies. This makes the production of soluble and active CA protein a prerequisite for downstream applications.

RESULTS

In this study, we streamlined the process of CA expression by selecting seven top CA candidates and used two bioinformatic tools to predict their solubility for expression in E. coli. The prediction results place these enzymes in two categories: low and high solubility. Our expression of high solubility score CAs (namely CA5-SspCA, CA6-SazCAtrunc, CA7-PabCA and CA8-PhoCA) led to significantly higher protein yields (5 to 75 mg purified protein per liter) in flask cultures, indicating a strong correlation between the solubility prediction score and protein expression yields. Furthermore, phylogenetic tree analysis demonstrated CA class-specific clustering patterns for protein solubility and production yields. Unexpectedly, we also found that the unique N-terminal, 11-amino acid segment found after the signal sequence (not present in its homologs), was essential for CA6-SazCA activity.

CONCLUSIONS

Overall, this work demonstrated that protein solubility prediction, phylogenetic tree analysis, and experimental validation are potent tools for identifying top CA candidates and then producing soluble, active forms of these enzymes in E. coli. The comprehensive approaches we report here should be extendable to the expression of other heterogeneous proteins in E. coli.

Collapse

Li B, Ming D. GATSol, an enhanced predictor of protein solubility through the synergy of 3D structure graph and large language modeling. BMC Bioinformatics 2024;25:204. [PMID: 38824535 DOI: 10.1186/s12859-024-05820-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 05/29/2024] [Indexed: 06/03/2024] Open

Abstract

BACKGROUND

Protein solubility is a critically important physicochemical property closely related to protein expression. For example, it is one of the main factors to be considered in the design and production of antibody drugs and a prerequisite for realizing various protein functions. Although several solubility prediction models have emerged in recent years, many of these models are limited to capturing information embedded in one-dimensional amino acid sequences, resulting in unsatisfactory predictive performance.

RESULTS

In this study, we introduce a novel Graph Attention network-based protein Solubility model, GATSol, which represents the 3D structure of proteins as a protein graph. In addition to the node features of amino acids extracted by the state-of-the-art protein large language model, GATSol utilizes amino acid distance maps generated using the latest AlphaFold technology. Rigorous testing on independent eSOL and the Saccharomyces cerevisiae test datasets has shown that GATSol outperforms most recently introduced models, especially with respect to the coefficient of determination R2, which reaches 0.517 and 0.424, respectively. It outperforms the current state-of-the-art GraphSol by 18.4% on the S. cerevisiae_test set.

CONCLUSIONS

GATSol captures 3D dimensional features of proteins by building protein graphs, which significantly improves the accuracy of protein solubility prediction. Recent advances in protein structure modeling allow our method to incorporate spatial structure features extracted from predicted structures into the model by relying only on the input of protein sequences, which simplifies the entire graph neural network prediction process, making it more user-friendly and efficient. As a result, GATSol may help prioritize highly soluble proteins, ultimately reducing the cost and effort of experimental work. The source code and data of the GATSol model are freely available at https://github.com/binbinbinv/GATSol .

Collapse

Gonçalves AAM, Ribeiro AJ, Resende CAA, Couto CAP, Gandra IB, Dos Santos Barcelos IC, da Silva JO, Machado JM, Silva KA, Silva LS, Dos Santos M, da Silva Lopes L, de Faria MT, Pereira SP, Xavier SR, Aragão MM, Candida-Puma MA, de Oliveira ICM, Souza AA, Nogueira LM, da Paz MC, Coelho EAF, Giunchetti RC, de Freitas SM, Chávez-Fumagalli MA, Nagem RAP, Galdino AS. Recombinant multiepitope proteins expressed in Escherichia coli cells and their potential for immunodiagnosis. Microb Cell Fact 2024;23:145. [PMID: 38778337 PMCID: PMC11110257 DOI: 10.1186/s12934-024-02418-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 05/07/2024] [Indexed: 05/25/2024] Open

Abstract

Recombinant multiepitope proteins (RMPs) are a promising alternative for application in diagnostic tests and, given their wide application in the most diverse diseases, this review article aims to survey the use of these antigens for diagnosis, as well as discuss the main points surrounding these antigens. RMPs usually consisting of linear, immunodominant, and phylogenetically conserved epitopes, has been applied in the experimental diagnosis of various human and animal diseases, such as leishmaniasis, brucellosis, cysticercosis, Chagas disease, hepatitis, leptospirosis, leprosy, filariasis, schistosomiasis, dengue, and COVID-19. The synthetic genes for these epitopes are joined to code a single RMP, either with spacers or fused, with different biochemical properties. The epitopes' high density within the RMPs contributes to a high degree of sensitivity and specificity. The RMPs can also sidestep the need for multiple peptide synthesis or multiple recombinant proteins, reducing costs and enhancing the standardization conditions for immunoassays. Methods such as bioinformatics and circular dichroism have been widely applied in the development of new RMPs, helping to guide their construction and better understand their structure. Several RMPs have been expressed, mainly using the Escherichia coli expression system, highlighting the importance of these cells in the biotechnological field. In fact, technological advances in this area, offering a wide range of different strains to be used, make these cells the most widely used expression platform. RMPs have been experimentally used to diagnose a broad range of illnesses in the laboratory, suggesting they could also be useful for accurate diagnoses commercially. On this point, the RMP method offers a tempting substitute for the production of promising antigens used to assemble commercial diagnostic kits.

Collapse

Affiliation(s)

Ana Alice Maia Gonçalves Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Anna Julia Ribeiro Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Carlos Ananias Aparecido Resende Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Carolina Alves Petit Couto Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Isadora Braga Gandra Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Isabelle Caroline Dos Santos Barcelos Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Jonatas Oliveira da Silva Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Juliana Martins Machado Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Kamila Alves Silva Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Líria Souza Silva Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Michelli Dos Santos Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Lucas da Silva Lopes Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Mariana Teixeira de Faria Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Sabrina Paula Pereira Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Sandra Rodrigues Xavier Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Matheus Motta Aragão Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte, 31270-901, Brazil
Mayron Antonio Candida-Puma Computational Biology and Chemistry Research Group, Vicerrectorado de Investigación, Universidad Católica de Santa María, Arequipa, 04000, Peru
Izadora Cristina Moreira de Oliveira Biophysics Laboratory, Institute of Biological Sciences, Department of Cell Biology, University of Brasilia, Brasília, 70910-900, Brazil
Amanda Araujo Souza Biophysics Laboratory, Institute of Biological Sciences, Department of Cell Biology, University of Brasilia, Brasília, 70910-900, Brazil
Lais Moreira Nogueira Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Mariana Campos da Paz Bioactives and Nanobiotechnology Laboratory, Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil
Eduardo Antônio Ferraz Coelho Postgraduate Program in Health Sciences, Infectious Diseases and Tropical Medicine, Faculty of Medicine, Federal University of Minas Gerais, Belo Horizonte, 30130-100, Brazil
Rodolfo Cordeiro Giunchetti Laboratory of Biology of Cell Interactions, National Institute of Science and Technology on Tropical Diseases (INCT-DT), Department of Morphology, Federal University of Minas Gerais, Belo Horizonte, 31270-901, Brazil
Sonia Maria de Freitas Biophysics Laboratory, Institute of Biological Sciences, Department of Cell Biology, University of Brasilia, Brasília, 70910-900, Brazil
Miguel Angel Chávez-Fumagalli Computational Biology and Chemistry Research Group, Vicerrectorado de Investigación, Universidad Católica de Santa María, Arequipa, 04000, Peru
Ronaldo Alves Pinto Nagem Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte, 31270-901, Brazil
Alexsandro Sobreira Galdino Microorganism Biotechnology Laboratory, National Institute of Science and Technology on Industrial Biotechnology (INCT-BI), Federal University of São João Del-Rei, Midwest Campus, Divinópolis, 35501-296, Brazil.

Collapse

Taheri-Anganeh M, Nezafat N, Gharibi S, Khatami SH, Vahedi F, Shabaninejad Z, Asadi M, Savardashtaki A, Movahedpour A, Ghasemi H. Designing a Secretory form of RTX-A as an Anticancer Toxin: An In Silico Approach. Recent Pat Biotechnol 2024;18:332-343. [PMID: 38817010 DOI: 10.2174/0118722083267796231210060150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 10/29/2023] [Accepted: 11/17/2023] [Indexed: 06/01/2024]

Abstract

BACKGROUND

Cancer is a leading cause of death and a significant public health issue worldwide. Standard treatment methods such as chemotherapy, radiotherapy, and surgery are only sometimes effective. Therefore, new therapeutic approaches are needed for cancer treatment. Sea anemone actinoporins are pore-forming toxins (PFTs) with membranolytic activities. RTX-A is a type of PFT that interacts with membrane phospholipids, resulting in pore formation. The synthesis of recombinant proteins in a secretory form has several advantages, including protein solubility and easy purification. In this study, we aimed to discover suitable signal peptides for producing RTX-A in Bacillus subtilis in a secretory form.

METHODS

Signal peptides were selected from the Signal Peptide Web Server. The probability and secretion pathways of the selected signal peptides were evaluated using the SignalP server. ProtParam and Protein-sol were used to predict the physico-chemical properties and solubility. AlgPred was used to predict the allergenicity of RTX-A linked to suitable signal peptides. Non-allergenic, stable, and soluble signal peptides fused to proteins were chosen, and their secondary and tertiary structures were predicted using GOR IV and I-TASSER, respectively. The PROCHECK server performed the validation of 3D structures.

RESULTS

According to bioinformatics analysis, the fusion forms of OSMY_ECOLI and MALE_ECOLI linked to RTX-A were identified as suitable signal peptides. The final proteins with signal peptides were stable, soluble, and non-allergenic for the human body. Moreover, they had appropriate secondary and tertiary structures.

CONCLUSION

The signal above peptides appears ideal for rationalizing secretory and soluble RTX-A. Therefore, the signal peptides found in this study should be further investigated through experimental researches and patents.

Collapse

Chen Z, Wang X, Chen X, Huang J, Wang C, Wang J, Wang Z. Accelerating therapeutic protein design with computational approaches toward the clinical stage. Comput Struct Biotechnol J 2023;21:2909-2926. [PMID: 38213894 PMCID: PMC10781723 DOI: 10.1016/j.csbj.2023.04.027] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 04/11/2023] [Accepted: 04/27/2023] [Indexed: 01/13/2024] Open

Hashemzaei M, Nezafat N, Ghoshoon MB, Negahdaripour M. In-silico selection of appropriate signal peptides for romiplostim secretory production in Escherichia coli. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.101146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Tamehri M, Rasooli I, Pishgahi M, Jahangiri A, Ramezanalizadeh F, Banisaeed Langroodi SR. Combination of BauA and OmpA elicit immunoprotection against Acinetobacter baumannii in a murine sepsis model. Microb Pathog 2022;173:105874. [DOI: 10.1016/j.micpath.2022.105874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 09/18/2022] [Accepted: 11/05/2022] [Indexed: 11/09/2022]

Rahmatabadi SS, Mobini K, Askari S, Najafian J, Karami K, Soleymani B, Mostafaie A. In silico characterization of fructosyl peptide oxidase properties from Eupenicillium terrenum. J Mol Recognit 2022;35:e2980. [PMID: 35657361 DOI: 10.1002/jmr.2980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 05/23/2022] [Accepted: 06/01/2022] [Indexed: 12/24/2022]

Yi W, Sun A, Liu M, Liu X, Zhang W, Dai Q. Comparative Study on Feature Selection in Protein Structure and Function Prediction. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022;2022:1650693. [PMID: 36267316 PMCID: PMC9578875 DOI: 10.1155/2022/1650693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Accepted: 09/14/2022] [Indexed: 11/18/2022]

Novel multi epitope-based vaccine against monkeypox virus: vaccinomic approach. Sci Rep 2022;12:15983. [PMID: 36156077 PMCID: PMC9510130 DOI: 10.1038/s41598-022-20397-z] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 09/13/2022] [Indexed: 11/30/2022] Open

Abstract

While mankind is still dealing with the COVID-19 pandemic, a case of monkeypox virus (MPXV) has been reported to the WHO on May 7, 2022. Monkeypox is a viral zoonotic disease that has been a public health threat, particularly in Africa. However, it has recently expanded to other parts of the world, so it may soon become a global issue. Thus, the current work was planned and then designed a multi-epitope vaccine against MPXV utilizing the cell surface-binding protein as a target in order to develop a novel and safe vaccine that can evoke the desirable immunological response. The proposed MHC-I, MHC-II, and B-cell epitopes were selected to design multi-epitope vaccine constructs linked with suitable linkers in combination with different adjuvants to enhance the immune responses for the vaccine constructs. The proposed vaccine was composed of 275 amino acids and was shown to be antigenic in Vaxijen server (0.5311) and non-allergenic in AllerTop server. The 3D structure of the designed vaccine was predicted, refined and validated by various in silico tools to assess the stability of the vaccine. Moreover, the solubility of the vaccine construct was found greater than the average solubility provided by protein-Sol server which indicating the solubility of the vaccine construct. Additionally, the most promising epitopes bound to MHC I and MHC II alleles were found having good binding affinities with low energies ranging between − 7.0 and − 8.6 kcal/mol. According to the immunological simulation research, the vaccine was found to elicit a particular immune reaction against the monkeypox virus. Finally, the molecular dynamic study shows that the designed vaccine is stable with minimum RMSF against MHC I allele. We conclude from our research that the cell surface-binding protein is one of the primary proteins involved in MPXV pathogenesis. As a result, our study will aid in the development of appropriate therapeutics and prompt the development of future vaccines against MPXV.

Collapse

Abuei H, Pirouzfar M, Mojiri A, Behzad-Behbahani A, Kalantari T, Bemani P, Farhadi A. Maximizing the recovery of the native p28 bacterial peptide with improved activity and maintained solubility and stability in Escherichia coli BL21 (DE3). METHODS IN MICROBIOLOGY 2022;200:106560. [PMID: 36031157 DOI: 10.1016/j.mimet.2022.106560] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 08/10/2022] [Accepted: 08/20/2022] [Indexed: 02/06/2023]

Wang H, Kwong CF, Liu Q, Liu Z, Chen Z. A Novel Artificial Intelligence System in Formulation Dissolution Prediction. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:8640115. [PMID: 35978897 PMCID: PMC9377879 DOI: 10.1155/2022/8640115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Revised: 06/20/2022] [Accepted: 06/24/2022] [Indexed: 11/29/2022]

Karaiyan P, Chang CCH, Chan ES, Tey BT, Ramanan RN, Ooi CW. In silico screening and heterologous expression of soluble dimethyl sulfide monooxygenases of microbial origin in Escherichia coli. Appl Microbiol Biotechnol 2022;106:4523-4537. [PMID: 35713659 PMCID: PMC9259527 DOI: 10.1007/s00253-022-12008-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 05/30/2022] [Accepted: 06/01/2022] [Indexed: 11/28/2022]

Abstract

Abstract

Sequence-based screening has been widely applied in the discovery of novel microbial enzymes. However, majority of the sequences in the genomic databases were annotated using computational approaches and lacks experimental characterization. Hence, the success in obtaining the functional biocatalysts with improved characteristics requires an efficient screening method that considers a wide array of factors. Recombinant expression of microbial enzymes is often hampered by the undesirable formation of inclusion body. Here, we present a systematic in silico screening method to identify the proteins expressible in soluble form and with the desired biological properties. The screening approach was adopted in the recombinant expression of dimethyl sulfide (DMS) monooxygenase in Escherichia coli. DMS monooxygenase, a two-component enzyme consisting of DmoA and DmoB subunits, was used as a model protein. The success rate of producing soluble and active DmoA is 71% (5 out of 7 genes). Interestingly, the soluble recombinant DmoA enzymes exhibited the NADH:FMN oxidoreductase activity in the absence of DmoB (second subunit), and the cofactor FMN, suggesting that DmoA is also an oxidoreductase. DmoA originated from Janthinobacterium sp. AD80 showed the maximum NADH oxidation activity (maximum reaction rate: 6.6 µM/min; specific activity: 133 µM/min/mg). This novel finding may allow DmoA to be used as an oxidoreductase biocatalyst for various industrial applications. The in silico gene screening methodology established from this study can increase the success rate of producing soluble and functional enzymes while avoiding the laborious trial and error involved in the screening of a large pool of genes available.

Key points

• A systematic gene screening method was demonstrated.

• DmoA is also an oxidoreductase capable of oxidizing NADH and reducing FMN.

• DmoA oxidizes NADH in the absence of external FMN.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00253-022-12008-8.

Collapse

Santos-Junior MN, Neves WS, Santos RS, Almeida PP, Fernandes JM, Guimarães BCDB, Barbosa MS, da Silva LSC, Gomes CP, Sampaio BA, Rezende IDS, Correia TML, Neres NSDM, Campos GB, Bastos BL, Timenetsky J, Marques LM. Heterologous Expression, Purification, and Immunomodulatory Effects of Recombinant Lipoprotein GUDIV-103 Isolated from Ureaplasma diversum. Microorganisms 2022;10:microorganisms10051032. [PMID: 35630474 PMCID: PMC9147684 DOI: 10.3390/microorganisms10051032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 05/05/2022] [Accepted: 05/07/2022] [Indexed: 02/06/2023] Open

Affiliation(s)

Manoel Neres Santos-Junior Department of Biointeraction, Multidisciplinary Institute of Health, Federal University of Bahia, Vitória da Conquista 40170-110, Brazil; (M.N.S.-J.); (W.S.N.); (R.S.S.); (J.M.F.); (T.M.L.C.); (N.S.d.M.N.) Department of Biology, and Biotechnology of Microorganisms, State University of Santa Cruz (UESC), Ilhéus 45662-900, Brazil; (B.C.d.B.G.); (L.S.C.d.S.); (C.P.G.); (B.A.S.); (G.B.C.); (B.L.B.)
Wanderson Souza Neves Department of Biointeraction, Multidisciplinary Institute of Health, Federal University of Bahia, Vitória da Conquista 40170-110, Brazil; (M.N.S.-J.); (W.S.N.); (R.S.S.); (J.M.F.); (T.M.L.C.); (N.S.d.M.N.)
Ronaldo Silva Santos Department of Biointeraction, Multidisciplinary Institute of Health, Federal University of Bahia, Vitória da Conquista 40170-110, Brazil; (M.N.S.-J.); (W.S.N.); (R.S.S.); (J.M.F.); (T.M.L.C.); (N.S.d.M.N.)
Palloma Porto Almeida Bioinformatics and Computational Biology Lab, Division of Experimental and Translational Research, Brazilian National Cancer Institute (INCA), Rio de Janeiro 20231-050, Brazil;
Janaina Marinho Fernandes Department of Biointeraction, Multidisciplinary Institute of Health, Federal University of Bahia, Vitória da Conquista 40170-110, Brazil; (M.N.S.-J.); (W.S.N.); (R.S.S.); (J.M.F.); (T.M.L.C.); (N.S.d.M.N.)
Bruna Carolina de Brito Guimarães Department of Biology, and Biotechnology of Microorganisms, State University of Santa Cruz (UESC), Ilhéus 45662-900, Brazil; (B.C.d.B.G.); (L.S.C.d.S.); (C.P.G.); (B.A.S.); (G.B.C.); (B.L.B.)
Maysa Santos Barbosa Department of Microbiology, Institute of Biomedical Science, University of São Paulo, São Paulo 05508-000, Brazil; (M.S.B.); (I.d.S.R.); (J.T.)
Lucas Santana Coelho da Silva Department of Biology, and Biotechnology of Microorganisms, State University of Santa Cruz (UESC), Ilhéus 45662-900, Brazil; (B.C.d.B.G.); (L.S.C.d.S.); (C.P.G.); (B.A.S.); (G.B.C.); (B.L.B.)
Camila Pacheco Gomes Department of Biology, and Biotechnology of Microorganisms, State University of Santa Cruz (UESC), Ilhéus 45662-900, Brazil; (B.C.d.B.G.); (L.S.C.d.S.); (C.P.G.); (B.A.S.); (G.B.C.); (B.L.B.)
Beatriz Almeida Sampaio Department of Biology, and Biotechnology of Microorganisms, State University of Santa Cruz (UESC), Ilhéus 45662-900, Brazil; (B.C.d.B.G.); (L.S.C.d.S.); (C.P.G.); (B.A.S.); (G.B.C.); (B.L.B.)
Izadora de Souza Rezende Department of Microbiology, Institute of Biomedical Science, University of São Paulo, São Paulo 05508-000, Brazil; (M.S.B.); (I.d.S.R.); (J.T.)
Thiago Macedo Lopes Correia Department of Biointeraction, Multidisciplinary Institute of Health, Federal University of Bahia, Vitória da Conquista 40170-110, Brazil; (M.N.S.-J.); (W.S.N.); (R.S.S.); (J.M.F.); (T.M.L.C.); (N.S.d.M.N.)
Nayara Silva de Macedo Neres Department of Biointeraction, Multidisciplinary Institute of Health, Federal University of Bahia, Vitória da Conquista 40170-110, Brazil; (M.N.S.-J.); (W.S.N.); (R.S.S.); (J.M.F.); (T.M.L.C.); (N.S.d.M.N.)
Guilherme Barreto Campos Department of Biology, and Biotechnology of Microorganisms, State University of Santa Cruz (UESC), Ilhéus 45662-900, Brazil; (B.C.d.B.G.); (L.S.C.d.S.); (C.P.G.); (B.A.S.); (G.B.C.); (B.L.B.)
Bruno Lopes Bastos Department of Biology, and Biotechnology of Microorganisms, State University of Santa Cruz (UESC), Ilhéus 45662-900, Brazil; (B.C.d.B.G.); (L.S.C.d.S.); (C.P.G.); (B.A.S.); (G.B.C.); (B.L.B.)
Jorge Timenetsky Department of Microbiology, Institute of Biomedical Science, University of São Paulo, São Paulo 05508-000, Brazil; (M.S.B.); (I.d.S.R.); (J.T.)
Lucas Miranda Marques Department of Biointeraction, Multidisciplinary Institute of Health, Federal University of Bahia, Vitória da Conquista 40170-110, Brazil; (M.N.S.-J.); (W.S.N.); (R.S.S.); (J.M.F.); (T.M.L.C.); (N.S.d.M.N.) Department of Biology, and Biotechnology of Microorganisms, State University of Santa Cruz (UESC), Ilhéus 45662-900, Brazil; (B.C.d.B.G.); (L.S.C.d.S.); (C.P.G.); (B.A.S.); (G.B.C.); (B.L.B.) Department of Microbiology, Institute of Biomedical Science, University of São Paulo, São Paulo 05508-000, Brazil; (M.S.B.); (I.d.S.R.); (J.T.) Correspondence:

Collapse

Thumuluri V, Martiny HM, Almagro Armenteros JJ, Salomon J, Nielsen H, Johansen AR. NetSolP: predicting protein solubility in Escherichia coli using language models. Bioinformatics 2022;38:941-946. [PMID: 35088833 DOI: 10.1093/bioinformatics/btab801] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 10/13/2021] [Accepted: 11/23/2021] [Indexed: 11/13/2022] Open

Caetano BDL, Domingos MDO, da Silva MA, da Silva JCA, Polatto JM, Montoni F, Iwai LK, Pimenta DC, Vigerelli H, Vieira PCG, Ruiz RDC, Patané JS, Piazza RMF. In Silico Prediction and Design of Uropathogenic Escherichia coli Alpha-Hemolysin Generate a Soluble and Hemolytic Recombinant Toxin. Microorganisms 2022;10:microorganisms10010172. [PMID: 35056621 PMCID: PMC8778037 DOI: 10.3390/microorganisms10010172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 01/01/2022] [Accepted: 01/08/2022] [Indexed: 01/27/2023] Open

Affiliation(s)

Bruna De Lucca Caetano Laboratório de Bacteriologia, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (B.D.L.C.); (M.d.O.D.); (M.A.d.S.); (J.C.A.d.S.); (J.M.P.); (P.C.G.V.); (R.d.C.R.)
Marta de Oliveira Domingos Laboratório de Bacteriologia, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (B.D.L.C.); (M.d.O.D.); (M.A.d.S.); (J.C.A.d.S.); (J.M.P.); (P.C.G.V.); (R.d.C.R.)
Miriam Aparecida da Silva Laboratório de Bacteriologia, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (B.D.L.C.); (M.d.O.D.); (M.A.d.S.); (J.C.A.d.S.); (J.M.P.); (P.C.G.V.); (R.d.C.R.)
Jessika Cristina Alves da Silva Laboratório de Bacteriologia, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (B.D.L.C.); (M.d.O.D.); (M.A.d.S.); (J.C.A.d.S.); (J.M.P.); (P.C.G.V.); (R.d.C.R.)
Juliana Moutinho Polatto Laboratório de Bacteriologia, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (B.D.L.C.); (M.d.O.D.); (M.A.d.S.); (J.C.A.d.S.); (J.M.P.); (P.C.G.V.); (R.d.C.R.)
Fabio Montoni Laboratório de Toxinologia Aplicada, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (F.M.); (L.K.I.)
Leo Kei Iwai Laboratório de Toxinologia Aplicada, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (F.M.); (L.K.I.)
Daniel Carvalho Pimenta Laboratório de Biofísica e Bioquímica, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (D.C.P.); (H.V.)
Hugo Vigerelli Laboratório de Biofísica e Bioquímica, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (D.C.P.); (H.V.)
Paulo Cesar Gomes Vieira Laboratório de Bacteriologia, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (B.D.L.C.); (M.d.O.D.); (M.A.d.S.); (J.C.A.d.S.); (J.M.P.); (P.C.G.V.); (R.d.C.R.)
Rita de Cassia Ruiz Laboratório de Bacteriologia, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (B.D.L.C.); (M.d.O.D.); (M.A.d.S.); (J.C.A.d.S.); (J.M.P.); (P.C.G.V.); (R.d.C.R.)
José Salvatore Patané Laboratório de Ciclo Celular, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil Correspondence: (J.S.P.); (R.M.F.P.)
Roxane Maria Fontes Piazza Laboratório de Bacteriologia, Instituto Butantan, Av. Vital Brazil, São Paulo 1500-05503-900, SP, Brazil; (B.D.L.C.); (M.d.O.D.); (M.A.d.S.); (J.C.A.d.S.); (J.M.P.); (P.C.G.V.); (R.d.C.R.) Correspondence: (J.S.P.); (R.M.F.P.)

Collapse

Mustafa MI, Shantier SW. Next generation multi epitope based peptide vaccine against Marburg Virus disease combined with molecular docking studies. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.101087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Madani M, Lin K, Tarakanova A. DSResSol: A Sequence-Based Solubility Predictor Created with Dilated Squeeze Excitation Residual Networks. Int J Mol Sci 2021;22:13555. [PMID: 34948354 PMCID: PMC8704505 DOI: 10.3390/ijms222413555] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 12/13/2021] [Accepted: 12/14/2021] [Indexed: 11/16/2022] Open

Abstract

Protein solubility is an important thermodynamic parameter that is critical for the characterization of a protein's function, and a key determinant for the production yield of a protein in both the research setting and within industrial (e.g., pharmaceutical) applications. Experimental approaches to predict protein solubility are costly, time-consuming, and frequently offer only low success rates. To reduce cost and expedite the development of therapeutic and industrially relevant proteins, a highly accurate computational tool for predicting protein solubility from protein sequence is sought. While a number of in silico prediction tools exist, they suffer from relatively low prediction accuracy, bias toward the soluble proteins, and limited applicability for various classes of proteins. In this study, we developed a novel deep learning sequence-based solubility predictor, DSResSol, that takes advantage of the integration of squeeze excitation residual networks with dilated convolutional neural networks and outperforms all existing protein solubility prediction models. This model captures the frequently occurring amino acid k-mers and their local and global interactions and highlights the importance of identifying long-range interaction information between amino acid k-mers to achieve improved accuracy, using only protein sequence as input. DSResSol outperforms all available sequence-based solubility predictors by at least 5% in terms of accuracy when evaluated by two different independent test sets. Compared to existing predictors, DSResSol not only reduces prediction bias for insoluble proteins but also predicts soluble proteins within the test sets with an accuracy that is at least 13% higher than existing models. We derive the key amino acids, dipeptides, and tripeptides contributing to protein solubility, identifying glutamic acid and serine as critical amino acids for protein solubility prediction. Overall, DSResSol can be used for the fast, reliable, and inexpensive prediction of a protein's solubility to guide experimental design.

Collapse

Mital S, Christie G, Dikicioglu D. Recombinant expression of insoluble enzymes in Escherichia coli: a systematic review of experimental design and its manufacturing implications. Microb Cell Fact 2021;20:208. [PMID: 34717620 PMCID: PMC8557517 DOI: 10.1186/s12934-021-01698-w] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Accepted: 10/22/2021] [Indexed: 02/06/2023] Open

Vakili O, Khatami SH, Maleksabet A, Movahedpour A, Fana SE, Sadegh R, Salmanzadeh AH, Razeghifam H, Nourdideh S, Tehrani SS, Taheri-Anganeh M. Finding Appropriate Signal Peptides for Secretory Production of Recombinant Glucarpidase: An In SilicoMethod. Recent Pat Biotechnol 2021;15:302-315. [PMID: 34547999 DOI: 10.2174/1872208315666210921095420] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Revised: 06/16/2021] [Accepted: 08/02/2021] [Indexed: 11/22/2022]

Prediction of Protein Solubility Based on Sequence Feature Fusion and DDcCNN. Interdiscip Sci 2021;13:703-716. [PMID: 34236625 DOI: 10.1007/s12539-021-00456-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Revised: 06/21/2021] [Accepted: 06/23/2021] [Indexed: 10/20/2022]

Abstract

BACKGROUND

Prediction of protein solubility is an indispensable prerequisite for pharmaceutical research and production. The general and specific objective of this work is to design a new model for predicting protein solubility by using protein sequence feature fusion and deep dual-channel convolutional neural networks (DDcCNN) to improve the performance of existing prediction models.

METHODS

The redundancy of raw protein is reduced by CD-HIT. The four subsequences are built from protein sequence: one global and three locals. The global subsequence is the entire protein sequence, and these local subsequences are obtained by moving a sliding window with some rules. Using G-gap to extract the features of the above four subsequences, a mixed matrix is constructed as the input of one channel which is composed of three-layer convolutional operating. Additional features are extracted by SCRATCH tool as input of another channel, which is consist of a single convolution in order to find hidden relationships and improve the accuracy of predictor. The outputs of two parallel channels are concatenated as the input of the hidden layer. And the prediction of protein solubility is obtained in the output layer. The best protein solubility prediction model is obtained by doing some comparative experiments of different frameworks.

RESULTS

The performance indicators of DDcCNN model (our designed) are as follows: accuracy of 77.82%, Matthew's correlation coefficient of 0.57, sensitivity of 76.13% and specificity of 79.32%. The results of some comparative experiments show that the overall performance of DDcCNN model is better than existing models (GCNN, LCNN and PCNN). The related models and data are publicly deposited at http://www.ddccnn.wang .

CONCLUSION

The satisfactory performance of DDcCNN model reveals that these features and flexible computational methodologies can reinforce the existing prediction models for better prediction of protein solubility could be applied in several applications, such as to preselect initial targets that are soluble or to alter solubility of target proteins, thus can help to reduce the production cost.

Collapse

Wu X, Yu L. EPSOL: sequence-based protein solubility prediction using multidimensional embedding. Bioinformatics 2021;37:4314-4320. [PMID: 34145885 DOI: 10.1093/bioinformatics/btab463] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 05/18/2021] [Accepted: 06/17/2021] [Indexed: 11/14/2022] Open

Barrett R, White AD. Investigating Active Learning and Meta-Learning for Iterative Peptide Design. J Chem Inf Model 2021;61:95-105. [PMID: 33350829 PMCID: PMC7842147 DOI: 10.1021/acs.jcim.0c00946] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Indexed: 01/14/2023]

Santos Junior MN, Santos RS, Neves WS, Fernandes JM, de Brito Guimarães BC, Barbosa MS, Silva LSC, Gomes CP, Rezende IS, Oliveira CNT, de Macêdo Neres NS, Campos GB, Bastos BL, Timenetsky J, Marques LM. Immunoinformatics and analysis of antigen distribution of Ureaplasma diversum strains isolated from different Brazilian states. BMC Vet Res 2020;16:379. [PMID: 33028315 PMCID: PMC7542862 DOI: 10.1186/s12917-020-02602-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 09/30/2020] [Indexed: 01/29/2023] Open

Affiliation(s)

Manoel Neres Santos Junior Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil.,Department of Microbiology, State University of Santa Cruz (UESC), Ilhéus, Brazil
Ronaldo Silva Santos Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil
Wanderson Souza Neves Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil
Janaina Marinho Fernandes Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil
Bruna Carolina de Brito Guimarães Department of Microbiology, State University of Santa Cruz (UESC), Ilhéus, Brazil
Maysa Santos Barbosa Department of Microbiology, Institute of Biomedical Science, University of São Paulo, São Paulo, Brazil
Lucas Santana Coelho Silva Department of Microbiology, State University of Santa Cruz (UESC), Ilhéus, Brazil
Camila Pacheco Gomes Department of Microbiology, State University of Santa Cruz (UESC), Ilhéus, Brazil
Izadora Souza Rezende Department of Microbiology, Institute of Biomedical Science, University of São Paulo, São Paulo, Brazil
Caline Novaes Teixeira Oliveira Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil.,Department of Microbiology, State University of Santa Cruz (UESC), Ilhéus, Brazil
Nayara Silva de Macêdo Neres Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil
Guilherme Barreto Campos Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil
Bruno Lopes Bastos Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil
Jorge Timenetsky Department of Microbiology, Institute of Biomedical Science, University of São Paulo, São Paulo, Brazil
Lucas Miranda Marques Department of Biointeraction, Multidisciplinary Institute of Health, Universidade Federal da Bahia, Rua Hormindo Barros, 58 - Quadra 17 - Lote 58, Bairro Candeias - CEP: 45.029-094, Vitória da Conquista, BA, Brazil. .,Department of Microbiology, State University of Santa Cruz (UESC), Ilhéus, Brazil. .,Department of Microbiology, Institute of Biomedical Science, University of São Paulo, São Paulo, Brazil.

Collapse

Raimondi D, Orlando G, Fariselli P, Moreau Y. Insight into the protein solubility driving forces with neural attention. PLoS Comput Biol 2020;16:e1007722. [PMID: 32352965 PMCID: PMC7217484 DOI: 10.1371/journal.pcbi.1007722] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Revised: 05/12/2020] [Accepted: 02/10/2020] [Indexed: 12/29/2022] Open

Abstract

Protein solubility is a key aspect for many biotechnological, biomedical and industrial processes, such as the production of active proteins and antibodies. In addition, understanding the molecular determinants of the solubility of proteins may be crucial to shed light on the molecular mechanisms of diseases caused by aggregation processes such as amyloidosis. Here we present SKADE, a novel Neural Network protein solubility predictor and we show how it can provide novel insight into the protein solubility mechanisms, thanks to its neural attention architecture. First, we show that SKADE positively compares with state of the art tools while using just the protein sequence as input. Then, thanks to the neural attention mechanism, we use SKADE to investigate the patterns learned during training and we analyse its decision process. We use this peculiarity to show that, while the attention profiles do not correlate with obvious sequence aspects such as biophysical properties of the aminoacids, they suggest that N- and C-termini are the most relevant regions for solubility prediction and are predictive for complex emergent properties such as aggregation-prone regions involved in beta-amyloidosis and contact density. Moreover, SKADE is able to identify mutations that increase or decrease the overall solubility of the protein, allowing it to be used to perform large scale in-silico mutagenesis of proteins in order to maximize their solubility.

The solubility of proteins is a crucial biophysical aspect when it comes to understanding many human diseases and to improve the industrial processes for protein production. Due to its relevance, computational methods have been devised in order to study and possibly optimize the solubility of proteins. In this work we apply a deep-learning technique, called neural attention to predict protein solubility while “opening” the model itself to interpretability, even though Machine Learning models are usually considered black boxes. Thank to the attention mechanism, we show that i) our model implicitly learns complex patterns related to emergent, protein folding-related, aspects such as to recognize β-amyloidosis regions and that ii) the N-and C-termini are the regions with the highes signal fro solubility prediction. When it comes to enhancing the solubility of proteins, we, for the first time, propose to investigate the synergistic effects of tandem mutations instead of “single” mutations, suggesting that this could minimize the number of required proposed mutations.

Collapse

Stepwise optimization of recombinant protein production in Escherichia coli utilizing computational and experimental approaches. Appl Microbiol Biotechnol 2020;104:3253-3266. [DOI: 10.1007/s00253-020-10454-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Revised: 01/28/2020] [Accepted: 02/07/2020] [Indexed: 12/14/2022]

In Silico Study of Different Signal Peptides to Express Recombinant Glutamate Decarboxylase in the Outer Membrane of Escherichia coli. Int J Pept Res Ther 2019. [DOI: 10.1007/s10989-019-09986-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Han X, Zhang L, Zhou K, Wang X. ProGAN: Protein solubility generative adversarial nets for data augmentation in DNN framework. Comput Chem Eng 2019. [DOI: 10.1016/j.compchemeng.2019.106533] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Sadeghian-Rizi T, Ebrahimi A, Moazzen F, Yousefian H, Jahanian-Najafabadi A. Improvement of solubility and yield of recombinant protein expression in E. coli using a two-step system. Res Pharm Sci 2019;14:400-407. [PMID: 31798656 PMCID: PMC6827196 DOI: 10.4103/1735-5362.268200] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Khurana S, Rawi R, Kunji K, Chuang GY, Bensmail H, Mall R. DeepSol: a deep learning framework for sequence-based protein solubility prediction. Bioinformatics 2019;34:2605-2613. [PMID: 29554211 DOI: 10.1093/bioinformatics/bty166] [Citation(s) in RCA: 97] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Accepted: 03/13/2018] [Indexed: 01/09/2023] Open

Ghaheh HS, Ganjalikhany MR, Yaghmaei P, Pourfarzam M, Mir Mohammad Sadeghi H. Improving the solubility, activity, and stability of reteplase using in silico design of new variants. Res Pharm Sci 2019;14:359-368. [PMID: 31516513 PMCID: PMC6714118 DOI: 10.4103/1735-5362.263560] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

In silico analysis of different signal peptides for the secretory production of recombinant human keratinocyte growth factor in Escherichia coli. Comput Biol Chem 2019;80:225-233. [DOI: 10.1016/j.compbiolchem.2019.03.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2018] [Revised: 01/23/2019] [Accepted: 03/11/2019] [Indexed: 12/31/2022]

Amin SA, Endalur Gopinarayanan V, Nair NU, Hassoun S. Establishing synthesis pathway-host compatibility via enzyme solubility. Biotechnol Bioeng 2019;116:1405-1416. [PMID: 30802311 DOI: 10.1002/bit.26959] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Revised: 12/18/2018] [Accepted: 02/21/2019] [Indexed: 12/12/2022]

de Marco A, Ferrer-Miralles N, Garcia-Fruitós E, Mitraki A, Peternel S, Rinas U, Trujillo-Roldán MA, Valdez-Cruz NA, Vázquez E, Villaverde A. Bacterial inclusion bodies are industrially exploitable amyloids. FEMS Microbiol Rev 2019;43:53-72. [PMID: 30357330 DOI: 10.1093/femsre/fuy038] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Accepted: 10/23/2018] [Indexed: 12/13/2022] Open

Affiliation(s)

Ario de Marco Laboratory for Environmental and Life Sciences, University of Nova Gorica, Vipavska Cesta 13, 5000 Nova Gorica, Slovenia
Neus Ferrer-Miralles Institut de Biotecnologia i de Biomedicina (IBB), Carrer de la Vall Moronta s/n, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain.,Departament de Genètica i de Microbiologia, Carrer de la Vall Moronta s/n, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain.,CIBER de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Carrer de la Vall Moronta s/n, 08193 Cerdanyola del Vallès, Spain
Elena Garcia-Fruitós Department of Ruminant Production, Institut de Recerca i Tecnologia Agroalimentàries (IRTA), Torre Marimon, 08140 Caldes de Montbui, Barcelona, Spain
Anna Mitraki Department of Materials Science and Technology, University of Crete, Vassilika Vouton, 70013 Heraklion, Crete, Greece.,Institute of Electronic Structure and Laser (IESL), Foundation for Research and Technology Hellas (FORTH), N. Plastira 100, Vassilika Vouton, 70013 Heraklion, Crete, Greece
Spela Peternel Lupinica, Alešovceva 6, 1000 Ljubljana, Slovenia
Ursula Rinas Leibniz University of Hannover, Technical Chemistry and Life Science, 30167 Hannover, Germany.,Helmholtz Centre for Infection Research, 38124 Braunschweig, Germany
Mauricio A Trujillo-Roldán Programa de Investigación de Producción de Biomoléculas, Unidad de Bioprocesos, Departamento de Biología Molecular y Biotecnología, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, 04510 Ciudad de México, México
Norma A Valdez-Cruz Programa de Investigación de Producción de Biomoléculas, Departamento de Biología Molecular y Biotecnología, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, 04510 Ciudad de México, México
Esther Vázquez Institut de Biotecnologia i de Biomedicina (IBB), Carrer de la Vall Moronta s/n, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain.,Departament de Genètica i de Microbiologia, Carrer de la Vall Moronta s/n, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain.,CIBER de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Carrer de la Vall Moronta s/n, 08193 Cerdanyola del Vallès, Spain
Antonio Villaverde Institut de Biotecnologia i de Biomedicina (IBB), Carrer de la Vall Moronta s/n, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain.,Departament de Genètica i de Microbiologia, Carrer de la Vall Moronta s/n, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain.,CIBER de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Carrer de la Vall Moronta s/n, 08193 Cerdanyola del Vallès, Spain

Collapse

Rawi R, Mall R, Kunji K, Shen CH, Kwong PD, Chuang GY. PaRSnIP: sequence-based protein solubility prediction using gradient boosting machine. Bioinformatics 2019;34:1092-1098. [PMID: 29069295 DOI: 10.1093/bioinformatics/btx662] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Accepted: 10/17/2017] [Indexed: 11/13/2022] Open

Owji H, Hemmati S. A comprehensive in silico characterization of bacterial signal peptides for the excretory production of Anabaena variabilis phenylalanine ammonia lyase in Escherichia coli. 3 Biotech 2018;8:488. [PMID: 30498661 DOI: 10.1007/s13205-018-1517-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2018] [Accepted: 11/13/2018] [Indexed: 12/30/2022] Open

In silico Analysis of Different Signal Peptides for the Excretory Production of Recombinant NS3-GP96 Fusion Protein in Escherichia coli. Int J Pept Res Ther 2018. [DOI: 10.1007/s10989-018-9775-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Pellizza L, Smal C, Rodrigo G, Arán M. Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli. Sci Rep 2018;8:10618. [PMID: 30006617 PMCID: PMC6045634 DOI: 10.1038/s41598-018-29035-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 06/21/2018] [Indexed: 12/15/2022] Open

Yang Y, Liu G, Liu M, Bai Z, Liu X, Dai X, Guo W. Correlation Between Protein Primary Structure and Soluble Expression Level of HSA dAb in Escherichia coli. Food Technol Biotechnol 2018;56:101-109. [PMID: 29796003 DOI: 10.17113/ftb.56.01.18.5445] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Affiliation(s)

Yankun Yang The Key Laboratory of Carbohydrate Chemistry and Biotechnology, School of Biotechnology, Jiangnan University, Ministry of Education, 1800 Lihu Avenue, 214122 Wuxi, PR China.,National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
Guoqiang Liu The Key Laboratory of Carbohydrate Chemistry and Biotechnology, School of Biotechnology, Jiangnan University, Ministry of Education, 1800 Lihu Avenue, 214122 Wuxi, PR China.,National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
Meng Liu National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
Zhonghu Bai National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.,Jiangsu Provincial Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
Xiuxia Liu National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.,Jiangsu Provincial Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
Xiaofeng Dai National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.,Jiangsu Provincial Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China
Wenwen Guo Jiangsu Provincial Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China.,The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, 1800 Lihu Avenue, 214122 Wuxi, PR China

Collapse

Assessment of Prokaryotic Signal Peptides for Secretion of Tumor Necrosis Factor Related Apoptosis Inducing Ligand (TRAIL) in E. coli: An in silico Approach. JOURNAL OF PURE AND APPLIED MICROBIOLOGY 2016. [DOI: 10.22207/jpam.10.4.22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Yang Y, Wu X, Xuan H, Gao Z. Functional analysis of plant NB-LRR gene L3 by using E. coli. Biochem Biophys Res Commun 2016;478:1569-74. [PMID: 27586278 DOI: 10.1016/j.bbrc.2016.08.154] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Accepted: 08/27/2016] [Indexed: 11/19/2022]

Cong C, Yu X, He Y, Dai Y, Zhang Y, Wang M, He M. Cell-free ribosome display and selection of antibodies on arrayed antigens. Proteomics 2016;16:1291-6. [PMID: 26899874 DOI: 10.1002/pmic.201500412] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Revised: 01/05/2016] [Accepted: 02/16/2016] [Indexed: 11/09/2022]

Chang CCH, Li C, Webb GI, Tey B, Song J, Ramanan RN. Periscope: quantitative prediction of soluble protein expression in the periplasm of Escherichia coli. Sci Rep 2016;6:21844. [PMID: 26931649 PMCID: PMC4773868 DOI: 10.1038/srep21844] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Accepted: 01/28/2016] [Indexed: 12/20/2022] Open

Li C, Ching Han Chang C, Nagel J, Porebski BT, Hayashida M, Akutsu T, Song J, Buckle AM. Critical evaluation of in silico methods for prediction of coiled-coil domains in proteins. Brief Bioinform 2016;17:270-82. [PMID: 26177815 PMCID: PMC6078162 DOI: 10.1093/bib/bbv047] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Revised: 05/29/2015] [Indexed: 12/19/2022] Open

Soluble expression and stability enhancement of transcription factors using 30Kc19 cell-penetrating protein. Appl Microbiol Biotechnol 2015;100:3523-32. [DOI: 10.1007/s00253-015-7199-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Revised: 11/15/2015] [Accepted: 11/23/2015] [Indexed: 12/20/2022]

Habibi N, Norouzi A, Mohd Hashim SZ, Shamsir MS, Samian R. Prediction of recombinant protein overexpression in Escherichia coli using a machine learning based model (RPOLP). Comput Biol Med 2015;66:330-6. [DOI: 10.1016/j.compbiomed.2015.09.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Revised: 09/18/2015] [Accepted: 09/19/2015] [Indexed: 01/28/2023]

Zamani M, Nezafat N, Negahdaripour M, Dabbagh F, Ghasemi Y. In Silico Evaluation of Different Signal Peptides for the Secretory Production of Human Growth Hormone in E. coli. Int J Pept Res Ther 2015. [DOI: 10.1007/s10989-015-9454-z] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Wang H, Wang M, Tan H, Li Y, Zhang Z, Song J. PredPPCrys: accurate prediction of sequence cloning, protein production, purification and crystallization propensity from protein sequences using multi-step heterogeneous feature fusion and selection. PLoS One 2014;9:e105902. [PMID: 25148528 PMCID: PMC4141844 DOI: 10.1371/journal.pone.0105902] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2014] [Accepted: 07/25/2014] [Indexed: 01/14/2023] Open

Abstract

X-ray crystallography is the primary approach to solve the three-dimensional structure of a protein. However, a major bottleneck of this method is the failure of multi-step experimental procedures to yield diffraction-quality crystals, including sequence cloning, protein material production, purification, crystallization and ultimately, structural determination. Accordingly, prediction of the propensity of a protein to successfully undergo these experimental procedures based on the protein sequence may help narrow down laborious experimental efforts and facilitate target selection. A number of bioinformatics methods based on protein sequence information have been developed for this purpose. However, our knowledge on the important determinants of propensity for a protein sequence to produce high diffraction-quality crystals remains largely incomplete. In practice, most of the existing methods display poorer performance when evaluated on larger and updated datasets. To address this problem, we constructed an up-to-date dataset as the benchmark, and subsequently developed a new approach termed ‘PredPPCrys’ using the support vector machine (SVM). Using a comprehensive set of multifaceted sequence-derived features in combination with a novel multi-step feature selection strategy, we identified and characterized the relative importance and contribution of each feature type to the prediction performance of five individual experimental steps required for successful crystallization. The resulting optimal candidate features were used as inputs to build the first-level SVM predictor (PredPPCrys I). Next, prediction outputs of PredPPCrys I were used as the input to build second-level SVM classifiers (PredPPCrys II), which led to significantly enhanced prediction performance. Benchmarking experiments indicated that our PredPPCrys method outperforms most existing procedures on both up-to-date and previous datasets. In addition, the predicted crystallization targets of currently non-crystallizable proteins were provided as compendium data, which are anticipated to facilitate target selection and design for the worldwide structural genomics consortium. PredPPCrys is freely available at http://www.structbioinfor.org/PredPPCrys.

Collapse