1
|
Albayati SH, Nezhad NG, Taki AG, Rahman RNZRA. Efficient and easible biocatalysts: Strategies for enzyme improvement. A review. Int J Biol Macromol 2024; 276:133978. [PMID: 39038570 DOI: 10.1016/j.ijbiomac.2024.133978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/19/2024] [Accepted: 07/16/2024] [Indexed: 07/24/2024]
Abstract
Owing to the environmental friendliness and vast advantages that enzymes offer in the biotechnology and industry fields, biocatalysts are a prolific investigation field. However, the low catalytic activity, stability, and specific selectivity of the enzyme limit the range of the reaction enzymes involved in. A comprehensive understanding of the protein structure and dynamics in terms of molecular details enables us to tackle these limitations effectively and enhance the catalytic activity by enzyme engineering or modifying the supports and solvents. Along with different strategies including computational, enzyme engineering based on DNA recombination, enzyme immobilization, additives, chemical modification, and physicochemical modification approaches can be promising for the wide spread of industrial enzyme usage. This is attributed to the successful application of biocatalysts in industrial and synthetic processes requires a system that exhibits stability, activity, and reusability in a continuous flow process, thereby reducing the production cost. The main goal of this review is to display relevant approaches for improving enzyme characteristics to overcome their industrial application.
Collapse
Affiliation(s)
- Samah Hashim Albayati
- Enzyme and Microbial Technology Research Centre, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia; Department of Microbiology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
| | - Nima Ghahremani Nezhad
- Enzyme and Microbial Technology Research Centre, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia; Department of Microbiology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
| | - Anmar Ghanim Taki
- Department of Radiology Techniques, Health and Medical Techniques College, Alnoor University, Mosul, Iraq
| | - Raja Noor Zaliha Raja Abd Rahman
- Enzyme and Microbial Technology Research Centre, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia; Department of Microbiology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia; Institute Bioscience, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia.
| |
Collapse
|
2
|
Umerenkov D, Nikolaev F, Shashkova TI, Strashnov PV, Sindeeva M, Shevtsov A, Ivanisenko NV, Kardymon OL. PROSTATA: a framework for protein stability assessment using transformers. Bioinformatics 2023; 39:btad671. [PMID: 37935419 PMCID: PMC10651431 DOI: 10.1093/bioinformatics/btad671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 10/25/2023] [Accepted: 11/02/2023] [Indexed: 11/09/2023] Open
Abstract
MOTIVATION Accurate prediction of change in protein stability due to point mutations is an attractive goal that remains unachieved. Despite the high interest in this area, little consideration has been given to the transformer architecture, which is dominant in many fields of machine learning. RESULTS In this work, we introduce PROSTATA, a predictive model built in a knowledge-transfer fashion on a new curated dataset. PROSTATA demonstrates advantage over existing solutions based on neural networks. We show that the large improvement margin is due to both the architecture of the model and the quality of the new training dataset. This work opens up opportunities to develop new lightweight and accurate models for protein stability assessment. AVAILABILITY AND IMPLEMENTATION PROSTATA is available at https://github.com/AIRI-Institute/PROSTATA and https://prostata.airi.net.
Collapse
Affiliation(s)
| | | | | | - Pavel V Strashnov
- Bioinformatics Group, AIRI, Moscow 121170, Russia
- Department of Computer Design and Technology, Bauman Moscow State Technical University, Moscow 105005, Russia
| | | | - Andrey Shevtsov
- Bioinformatics Group, AIRI, Moscow 121170, Russia
- Regulatory Transcriptomics and Epigenomics Group, Institute of Bioengineering, Research Center of Biotechnology RAS, Moscow 117036, Russia
| | - Nikita V Ivanisenko
- Bioinformatics Group, AIRI, Moscow 121170, Russia
- Laboratory of Computational Proteomics, Institute of Cytology and Genetics SB RAS, Novosibirsk 630090, Russia
| | | |
Collapse
|
3
|
Chen Z, Wang X, Chen X, Huang J, Wang C, Wang J, Wang Z. Accelerating therapeutic protein design with computational approaches toward the clinical stage. Comput Struct Biotechnol J 2023; 21:2909-2926. [PMID: 38213894 PMCID: PMC10781723 DOI: 10.1016/j.csbj.2023.04.027] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 04/11/2023] [Accepted: 04/27/2023] [Indexed: 01/13/2024] Open
Abstract
Therapeutic protein, represented by antibodies, is of increasing interest in human medicine. However, clinical translation of therapeutic protein is still largely hindered by different aspects of developability, including affinity and selectivity, stability and aggregation prevention, solubility and viscosity reduction, and deimmunization. Conventional optimization of the developability with widely used methods, like display technologies and library screening approaches, is a time and cost-intensive endeavor, and the efficiency in finding suitable solutions is still not enough to meet clinical needs. In recent years, the accelerated advancement of computational methodologies has ushered in a transformative era in the field of therapeutic protein design. Owing to their remarkable capabilities in feature extraction and modeling, the integration of cutting-edge computational strategies with conventional techniques presents a promising avenue to accelerate the progression of therapeutic protein design and optimization toward clinical implementation. Here, we compared the differences between therapeutic protein and small molecules in developability and provided an overview of the computational approaches applicable to the design or optimization of therapeutic protein in several developability issues.
Collapse
Affiliation(s)
- Zhidong Chen
- Department of Pathology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen 518033, China
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Xinpei Wang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Xu Chen
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Juyang Huang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Chenglin Wang
- Shenzhen Qiyu Biotechnology Co., Ltd, Shenzhen 518107, China
| | - Junqing Wang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Zhe Wang
- Department of Pathology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen 518033, China
| |
Collapse
|
4
|
Benevenuta S, Birolo G, Sanavia T, Capriotti E, Fariselli P. Challenges in predicting stabilizing variations: An exploration. Front Mol Biosci 2023; 9:1075570. [PMID: 36685278 PMCID: PMC9849384 DOI: 10.3389/fmolb.2022.1075570] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Accepted: 12/15/2022] [Indexed: 01/06/2023] Open
Abstract
An open challenge of computational and experimental biology is understanding the impact of non-synonymous DNA variations on protein function and, subsequently, human health. The effects of these variants on protein stability can be measured as the difference in the free energy of unfolding (ΔΔG) between the mutated structure of the protein and its wild-type form. Throughout the years, bioinformaticians have developed a wide variety of tools and approaches to predict the ΔΔG. Although the performance of these tools is highly variable, overall they are less accurate in predicting ΔΔG stabilizing variations rather than the destabilizing ones. Here, we analyze the possible reasons for this difference by focusing on the relationship between experimentally-measured ΔΔG and seven protein properties on three widely-used datasets (S2648, VariBench, Ssym) and a recently introduced one (S669). These properties include protein structural information, different physical properties and statistical potentials. We found that two highly used input features, i.e., hydrophobicity and the Blosum62 substitution matrix, show a performance close to random choice when trying to separate stabilizing variants from either neutral or destabilizing ones. We then speculate that, since destabilizing variations are the most abundant class in the available datasets, the overall performance of the methods is higher when including features that improve the prediction for the destabilizing variants at the expense of the stabilizing ones. These findings highlight the need of designing predictive methods able to exploit also input features highly correlated with the stabilizing variants. New tools should also be tested on a not-artificially balanced dataset, reporting the performance on all the three classes (i.e., stabilizing, neutral and destabilizing variants) and not only the overall results.
Collapse
Affiliation(s)
| | - Giovanni Birolo
- Department of Medical Sciences, University of Torino, Torino, Italy
| | - Tiziana Sanavia
- Department of Medical Sciences, University of Torino, Torino, Italy
| | - Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Torino, Italy,*Correspondence: Piero Fariselli,
| |
Collapse
|
5
|
Zhang W, Wang H, Feng N, Li Y, Gu J, Wang Z. Developability assessment at early-stage discovery to enable development of antibody-derived therapeutics. Antib Ther 2022; 6:13-29. [PMID: 36683767 PMCID: PMC9847343 DOI: 10.1093/abt/tbac029] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 11/01/2022] [Accepted: 11/02/2022] [Indexed: 11/13/2022] Open
Abstract
Developability refers to the likelihood that an antibody candidate will become a manufacturable, safe and efficacious drug. Although the safety and efficacy of a drug candidate will be well considered by sponsors and regulatory agencies, developability in the narrow sense can be defined as the likelihood that an antibody candidate will go smoothly through the chemistry, manufacturing and control (CMC) process at a reasonable cost and within a reasonable timeline. Developability in this sense is the focus of this review. To lower the risk that an antibody candidate with poor developability will move to the CMC stage, the candidate's developability-related properties should be screened, assessed and optimized as early as possible. Assessment of developability at the early discovery stage should be performed in a rapid and high-throughput manner while consuming small amounts of testing materials. In addition to monoclonal antibodies, bispecific antibodies, multispecific antibodies and antibody-drug conjugates, as the derivatives of monoclonal antibodies, should also be assessed for developability. Moreover, we propose that the criterion of developability is relative: expected clinical indication, and the dosage and administration route of the antibody could affect this criterion. We also recommend a general screening process during the early discovery stage of antibody-derived therapeutics. With the advance of artificial intelligence-aided prediction of protein structures and features, computational tools can be used to predict, screen and optimize the developability of antibody candidates and greatly reduce the risk of moving a suboptimal candidate to the development stage.
Collapse
Affiliation(s)
- Weijie Zhang
- Biologicals Innovation and Discovery, WuXi Biologicals, 1951 Huifeng West Road, Fengxian District, Shanghai 201400, China
| | - Hao Wang
- Biologicals Innovation and Discovery, WuXi Biologicals, 1951 Huifeng West Road, Fengxian District, Shanghai 201400, China
| | - Nan Feng
- Biologicals Innovation and Discovery, WuXi Biologicals, 1951 Huifeng West Road, Fengxian District, Shanghai 201400, China
| | - Yifeng Li
- Technology and Process Development, WuXi Biologicals, 288 Fute Zhong Road, Waigaoqiao Free Trade Zone, Shanghai 200131, China
| | - Jijie Gu
- Biologicals Innovation and Discovery, WuXi Biologicals, 1951 Huifeng West Road, Fengxian District, Shanghai 201400, China
| | - Zhuozhi Wang
- To whom correspondence should be addressed. Biologics Innovation and Discovery, WuXi Biologicals, 1951 Huifeng West Road, Fengxian District, Shanghai 201400, China, Phone number: +86-21-50518899
| |
Collapse
|
6
|
Construction of Mitochondrial Protection and Monitoring Model of Lon Protease Based on Machine Learning under Myocardial Ischemia Environment. JOURNAL OF ENVIRONMENTAL AND PUBLIC HEALTH 2022; 2022:4805009. [PMID: 36254306 PMCID: PMC9569194 DOI: 10.1155/2022/4805009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Revised: 09/17/2022] [Accepted: 09/21/2022] [Indexed: 11/24/2022]
Abstract
The localization of a protein's submitochondrial structure is important for therapeutic design of associated disorders caused by mitochondrial abnormalities because many human diseases are directly tied to mitochondria. When Lon protease expression changes, glycolysis replaces respiratory metabolism in the cell, which is a common occurrence in cancer cells. The fact that protein formation is a dynamic research object makes it impossible to reproduce the unique living environment of proteins in an experimental setting, which surely makes it more challenging to determine protein function through experiments. This research suggests a model of Lon protease-based mitochondrial protection under myocardial ischemia based on ML (machine learning). To ensure the balance of all submitochondrial proteins, the data set is processed using a random oversampling method, each overlapping fixed-length subsequence that is created from the protein sequence functions as a channel in the convolution layer. The results demonstrate that applying the oversampling strategy increases the ROC value by 17.6%-21.3%. Our prediction method is successful as evidenced by the fact that ML prediction outperforms the predictions of other conventional classifiers.
Collapse
|
7
|
Miralavy I, Bricco AR, Gilad AA, Banzhaf W. Using genetic programming to predict and optimize protein function. PEERJ PHYSICAL CHEMISTRY 2022. [DOI: 10.7717/peerj-pchem.24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Protein engineers conventionally use tools such as Directed Evolution to find new proteins with better functionalities and traits. More recently, computational techniques and especially machine learning approaches have been recruited to assist Directed Evolution, showing promising results. In this article, we propose POET, a computational Genetic Programming tool based on evolutionary computation methods to enhance screening and mutagenesis in Directed Evolution and help protein engineers to find proteins that have better functionality. As a proof-of-concept, we use peptides that generate MRI contrast detected by the Chemical Exchange Saturation Transfer contrast mechanism. The evolutionary methods used in POET are described, and the performance of POET in different epochs of our experiments with Chemical Exchange Saturation Transfer contrast are studied. Our results indicate that a computational modeling tool like POET can help to find peptides with 400% better functionality than used before.
Collapse
Affiliation(s)
- Iliya Miralavy
- BEACON Center of Evolution in Action, Michigan State University, East Lansing, MI, United States of America
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, United States of America
| | - Alexander R. Bricco
- BEACON Center of Evolution in Action, Michigan State University, East Lansing, MI, United States of America
- Department of Biomedical Engineering, Michigan State University, East Lansing, MI, United States of America
| | - Assaf A. Gilad
- BEACON Center of Evolution in Action, Michigan State University, East Lansing, MI, United States of America
- Department of Chemical Engineering, Michigan State University, East Lansing, MI, United States of America
| | - Wolfgang Banzhaf
- BEACON Center of Evolution in Action, Michigan State University, East Lansing, MI, United States of America
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, United States of America
- Department of Computer Science, Memorial University of Newfoundland, St. John’s, Newfoundland and Labrador, Canada
| |
Collapse
|
8
|
Horne J, Shukla D. Recent Advances in Machine Learning Variant Effect Prediction Tools for Protein Engineering. Ind Eng Chem Res 2022; 61:6235-6245. [DOI: 10.1021/acs.iecr.1c04943] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Affiliation(s)
- Jesse Horne
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana−Champaign, Champaign, Illinois 61801, United States
| | - Diwakar Shukla
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana−Champaign, Champaign, Illinois 61801, United States
- Department of Bioengineering, University of Illinois Urbana−Champaign, Champaign, Illinois 61801, United States
- Department of Plant Biology, University of Illinois Urbana−Champaign, Champaign, Illinois 61801, United States
- Cancer Center at Illinois, University of Illinois Urbana−Champaign, Champaign, Illinois 61801, United States
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana−Champaign, Champaign, Illinois 61801, United States
| |
Collapse
|
9
|
Kunka A, Lacko D, Stourac J, Damborsky J, Prokop Z, Mazurenko S. CalFitter 2.0: Leveraging the power of singular value decomposition to analyse protein thermostability. Nucleic Acids Res 2022; 50:W145-W151. [PMID: 35580052 PMCID: PMC9252748 DOI: 10.1093/nar/gkac378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/20/2022] [Accepted: 05/14/2022] [Indexed: 11/24/2022] Open
Abstract
The importance of the quantitative description of protein unfolding and aggregation for the rational design of stability or understanding the molecular basis of protein misfolding diseases is well established. Protein thermostability is typically assessed by calorimetric or spectroscopic techniques that monitor different complementary signals during unfolding. The CalFitter webserver has already proved integral to deriving invaluable energy parameters by global data analysis. Here, we introduce CalFitter 2.0, which newly incorporates singular value decomposition (SVD) of multi-wavelength spectral datasets into the global fitting pipeline. Processed time- or temperature-evolved SVD components can now be fitted together with other experimental data types. Moreover, deconvoluted basis spectra provide spectral fingerprints of relevant macrostates populated during unfolding, which greatly enriches the information gains of the CalFitter output. The SVD analysis is fully automated in a highly interactive module, providing access to the results to users without any prior knowledge of the underlying mathematics. Additionally, a novel data uploading wizard has been implemented to facilitate rapid and easy uploading of multiple datasets. Together, the newly introduced changes significantly improve the user experience, making this software a unique, robust, and interactive platform for the analysis of protein thermal denaturation data. The webserver is freely accessible at https://loschmidt.chemi.muni.cz/calfitter.
Collapse
Affiliation(s)
- Antonin Kunka
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Centre for Clinical Research, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - David Lacko
- Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
| | - Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Centre for Clinical Research, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Centre for Clinical Research, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Zbynek Prokop
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Centre for Clinical Research, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Centre for Clinical Research, St. Anne's University Hospital Brno, Brno, Czech Republic
| |
Collapse
|
10
|
Derat E, Kamerlin SCL. Computational Advances in Protein Engineering and Enzyme Design. J Phys Chem B 2022; 126:2449-2451. [PMID: 35387452 DOI: 10.1021/acs.jpcb.2c01198] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Etienne Derat
- Institut Parisien de Chimie Moléculaire, UMR 8232 CNRS, Sorbonne Université, 75005 Paris, France
| | | |
Collapse
|
11
|
Karthikeyan A, Priyakumar UD. Artificial intelligence: machine learning for chemical sciences. J CHEM SCI 2021; 134:2. [PMID: 34955617 PMCID: PMC8691161 DOI: 10.1007/s12039-021-01995-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 09/08/2021] [Accepted: 09/14/2021] [Indexed: 12/05/2022]
Abstract
Research in molecular sciences witnessed the rise and fall of Artificial Intelligence (AI)/ Machine Learning (ML) methods, especially artificial neural networks, few decades ago. However, we see a major resurgence in the use of modern ML methods in scientific research during the last few years. These methods have had phenomenal success in the areas of computer vision, speech recognition, natural language processing (NLP), etc. This has inspired chemists and biologists to apply these algorithms to problems in natural sciences. Availability of high performance Graphics Processing Unit (GPU) accelerators, large datasets, new algorithms, and libraries has enabled this surge. ML algorithms have successfully been applied to various domains in molecular sciences by providing much faster and sometimes more accurate solutions compared to traditional methods like Quantum Mechanical (QM) calculations, Density Functional Theory (DFT) or Molecular Mechanics (MM) based methods, etc. Some of the areas where the potential of ML methods are shown to be effective are in drug design, prediction of high-level quantum mechanical energies, molecular design, molecular dynamics materials, and retrosynthesis of organic compounds, etc. This article intends to conceptually introduce various modern ML methods and their relevance and applications in computational natural sciences. Synopsis Recent surge in the application of machine learning (ML) methods in fundamental sciences has led to a perspective that these methods may become important tools in chemical science. This perspective provides an overview of the modern ML methods and their successful applications in chemistry during the last few years.
Collapse
Affiliation(s)
- Akshaya Karthikeyan
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, 500 032 India
| | - U Deva Priyakumar
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, 500 032 India
| |
Collapse
|