1
|
Razaghi-Moghadam Z, Soleymani Babadi F, Nikoloski Z. Harnessing the optimization of enzyme catalytic rates in engineering of metabolic phenotypes. PLoS Comput Biol 2024; 20:e1012576. [PMID: 39495797 PMCID: PMC11563432 DOI: 10.1371/journal.pcbi.1012576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 11/14/2024] [Accepted: 10/21/2024] [Indexed: 11/06/2024] Open
Abstract
The increasing availability of enzyme turnover number measurements from experiments and of turnover number predictions from deep learning models prompts the use of these enzyme parameters in precise metabolic engineering. Yet, there is no computational approach that allows the prediction of metabolic engineering strategies that rely on the modification of turnover numbers. It is also unclear if modifications of turnover numbers without alterations in the host's transcriptional regulatory machinery suffice to increase the production of chemicals of interest. Here, we present a constraint-based modeling approach, termed Overcoming Kinetic rate Obstacles (OKO), that uses enzyme-constrained metabolic models to predict in silico strategies to increase the production of a given chemical, while ensuring specified cell growth. We demonstrate that the application of OKO to enzyme-constrained metabolic models of Escherichia coli and Saccharomyces cerevisiae results in strategies that can at least double the production of over 40 compounds with little penalty to growth. Interestingly, we show that the overproduction of compounds of interest does not entail only an increase in the values of turnover numbers. Lastly, we demonstrate that a refinement of OKO, allowing also for manipulation of enzyme abundance, facilitates the usage of the available compendia and deep learning models of turnover numbers in the design of precise metabolic engineering strategies. Our results expand the usage of genome-scale metabolic models toward the identification of targets for protein engineering, allowing their direct usage in the generation of innovative metabolic engineering designs for various biotechnological applications.
Collapse
Affiliation(s)
- Zahra Razaghi-Moghadam
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | - Fayaz Soleymani Babadi
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | - Zoran Nikoloski
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| |
Collapse
|
2
|
Alazmi M. Enzyme catalytic efficiency prediction: employing convolutional neural networks and XGBoost. Front Artif Intell 2024; 7:1446063. [PMID: 39498388 PMCID: PMC11532030 DOI: 10.3389/frai.2024.1446063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Accepted: 10/07/2024] [Indexed: 11/07/2024] Open
Abstract
Introduction In the intricate realm of enzymology, the precise quantification of enzyme efficiency, epitomized by the turnover number (k cat), is a paramount yet elusive objective. Existing methodologies, though sophisticated, often grapple with the inherent stochasticity and multifaceted nature of enzymatic reactions. Thus, there arises a necessity to explore avant-garde computational paradigms. Methods In this context, we introduce "enzyme catalytic efficiency prediction (ECEP)," leveraging advanced deep learning techniques to enhance the previous implementation, TurNuP, for predicting the enzyme catalase k cat. Our approach significantly outperforms prior methodologies, incorporating new features derived from enzyme sequences and chemical reaction dynamics. Through ECEP, we unravel the intricate enzyme-substrate interactions, capturing the nuanced interplay of molecular determinants. Results Preliminary assessments, compared against established models like TurNuP and DLKcat, underscore the superior predictive capabilities of ECEP, marking a pivotal shift in silico enzymatic turnover number estimation. This study enriches the computational toolkit available to enzymologists and lays the groundwork for future explorations in the burgeoning field of bioinformatics. This paper suggested a multi-feature ensemble deep learning-based approach to predict enzyme kinetic parameters using an ensemble convolution neural network and XGBoost by calculating weighted-average of each feature-based model's output to outperform traditional machine learning methods. The proposed "ECEP" model significantly outperformed existing methodologies, achieving a mean squared error (MSE) reduction of 0.35 from 0.81 to 0.46 and R-squared score from 0.44 to 0.54, thereby demonstrating its superior accuracy and effectiveness in enzyme catalytic efficiency prediction. Discussion This improvement underscores the model's potential to enhance the field of bioinformatics, setting a new benchmark for performance.
Collapse
Affiliation(s)
- Meshari Alazmi
- College of Computer Science and Engineering, University of Ha’il, Ha’il, Saudi Arabia
| |
Collapse
|
3
|
Ferreira MADM, Silveira WBD, Nikoloski Z. Protein constraints in genome-scale metabolic models: Data integration, parameter estimation, and prediction of metabolic phenotypes. Biotechnol Bioeng 2024; 121:915-930. [PMID: 38178617 DOI: 10.1002/bit.28650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 10/24/2023] [Accepted: 12/18/2023] [Indexed: 01/06/2024]
Abstract
Genome-scale metabolic models provide a valuable resource to study metabolism and cell physiology. These models are employed with approaches from the constraint-based modeling framework to predict metabolic and physiological phenotypes. The prediction performance of genome-scale metabolic models can be improved by including protein constraints. The resulting protein-constrained models consider data on turnover numbers (kcat ) and facilitate the integration of protein abundances. In this systematic review, we present and discuss the current state-of-the-art regarding the estimation of kinetic parameters used in protein-constrained models. We also highlight how data-driven and constraint-based approaches can aid the estimation of turnover numbers and their usage in improving predictions of cellular phenotypes. Finally, we identify standing challenges in protein-constrained metabolic models and provide a perspective regarding future approaches to improve the predictive performance.
Collapse
Affiliation(s)
| | | | - Zoran Nikoloski
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
- Systems Biology and Mathematical Modeling, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
| |
Collapse
|
4
|
Kugler A, Stensjö K. Machine learning predicts system-wide metabolic flux control in cyanobacteria. Metab Eng 2024; 82:171-182. [PMID: 38395194 DOI: 10.1016/j.ymben.2024.02.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 02/14/2024] [Accepted: 02/20/2024] [Indexed: 02/25/2024]
Abstract
Metabolic fluxes and their control mechanisms are fundamental in cellular metabolism, offering insights for the study of biological systems and biotechnological applications. However, quantitative and predictive understanding of controlling biochemical reactions in microbial cell factories, especially at the system level, is limited. In this work, we present ARCTICA, a computational framework that integrates constraint-based modelling with machine learning tools to address this challenge. Using the model cyanobacterium Synechocystis sp. PCC 6803 as chassis, we demonstrate that ARCTICA effectively simulates global-scale metabolic flux control. Key findings are that (i) the photosynthetic bioproduction is mainly governed by enzymes within the Calvin-Benson-Bassham (CBB) cycle, rather than by those involve in the biosynthesis of the end-product, (ii) the catalytic capacity of the CBB cycle limits the photosynthetic activity and downstream pathways and (iii) ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO) is a major, but not the most, limiting step within the CBB cycle. Predicted metabolic reactions qualitatively align with prior experimental observations, validating our modelling approach. ARCTICA serves as a valuable pipeline for understanding cellular physiology and predicting rate-limiting steps in genome-scale metabolic networks, and thus provides guidance for bioengineering of cyanobacteria.
Collapse
Affiliation(s)
- Amit Kugler
- Microbial Chemistry, Department of Chemistry-Ångström Laboratory, Uppsala University, Box 523, SE-751 20, Uppsala, Sweden
| | - Karin Stensjö
- Microbial Chemistry, Department of Chemistry-Ångström Laboratory, Uppsala University, Box 523, SE-751 20, Uppsala, Sweden.
| |
Collapse
|
5
|
Han Y, Tafur Rangel A, Pomraning KR, Kerkhoven EJ, Kim J. Advances in genome-scale metabolic models of industrially important fungi. Curr Opin Biotechnol 2023; 84:103005. [PMID: 37797483 DOI: 10.1016/j.copbio.2023.103005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 09/04/2023] [Accepted: 09/05/2023] [Indexed: 10/07/2023]
Abstract
Many fungal species have been used industrially for production of biofuels and bioproducts. Developing strains with better performance in biomanufacturing contexts requires a systematic understanding of cellular metabolism. Genome-scale metabolic models (GEMs) offer a comprehensive view of interconnected pathways and a mathematical framework for downstream analysis. Recently, GEMs have been developed or updated for several industrially important fungi. Some of them incorporate enzyme constraints, enabling improved predictions of cell states and proteome allocation. Here, we provide an overview of these newly developed GEMs and computational methods that facilitate construction of enzyme-constrained GEMs and utilize flux predictions from GEMs. Furthermore, we highlight the pivotal roles of these GEMs in iterative design-build-test-learn cycles, ultimately advancing the field of fungal biomanufacturing.
Collapse
Affiliation(s)
- Yichao Han
- Energy and Environment Directorate, Pacific Northwest National Laboratory, Richland, WA, USA; Agile BioFoundry, Department of Energy, Emeryville, CA, USA
| | - Albert Tafur Rangel
- Department of Life Sciences, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark
| | - Kyle R Pomraning
- Energy and Environment Directorate, Pacific Northwest National Laboratory, Richland, WA, USA; Agile BioFoundry, Department of Energy, Emeryville, CA, USA
| | - Eduard J Kerkhoven
- Department of Life Sciences, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark; SciLifeLab, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden
| | - Joonhoon Kim
- Energy and Environment Directorate, Pacific Northwest National Laboratory, Richland, WA, USA; Agile BioFoundry, Department of Energy, Emeryville, CA, USA; Joint BioEnergy Institute, Department of Energy, Emeryville, CA, USA.
| |
Collapse
|
6
|
Qiu S, Zhao S, Yang A. DLTKcat: deep learning-based prediction of temperature-dependent enzyme turnover rates. Brief Bioinform 2023; 25:bbad506. [PMID: 38189538 PMCID: PMC10772988 DOI: 10.1093/bib/bbad506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 11/29/2023] [Accepted: 12/08/2023] [Indexed: 01/09/2024] Open
Abstract
The enzyme turnover rate, ${k}_{cat}$, quantifies enzyme kinetics by indicating the maximum efficiency of enzyme catalysis. Despite its importance, ${k}_{cat}$ values remain scarce in databases for most organisms, primarily because of the cost of experimental measurements. To predict ${k}_{cat}$ and account for its strong temperature dependence, DLTKcat was developed in this study and demonstrated superior performance (log10-scale root mean squared error = 0.88, R-squared = 0.66) than previously published models. Through two case studies, DLTKcat showed its ability to predict the effects of protein sequence mutations and temperature changes on ${k}_{cat}$ values. Although its quantitative accuracy is not high enough yet to model the responses of cellular metabolism to temperature changes, DLTKcat has the potential to eventually become a computational tool to describe the temperature dependence of biological systems.
Collapse
Affiliation(s)
- Sizhe Qiu
- Department of Engineering Science, University of Oxford, OX1 3PJ, United Kingdom
| | - Simiao Zhao
- Radcliffe Department of Medicine, University of Oxford, OX3 9DU, United Kingdom
| | - Aidong Yang
- Department of Engineering Science, University of Oxford, OX1 3PJ, United Kingdom
| |
Collapse
|
7
|
Key A, Haiman Z, Palsson BO, D’Alessandro A. Modeling Red Blood Cell Metabolism in the Omics Era. Metabolites 2023; 13:1145. [PMID: 37999241 PMCID: PMC10673375 DOI: 10.3390/metabo13111145] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 10/23/2023] [Accepted: 11/08/2023] [Indexed: 11/25/2023] Open
Abstract
Red blood cells (RBCs) are abundant (more than 80% of the total cells in the human body), yet relatively simple, as they lack nuclei and organelles, including mitochondria. Since the earliest days of biochemistry, the accessibility of blood and RBCs made them an ideal matrix for the characterization of metabolism. Because of this, investigations into RBC metabolism are of extreme relevance for research and diagnostic purposes in scientific and clinical endeavors. The relative simplicity of RBCs has made them an eligible model for the development of reconstruction maps of eukaryotic cell metabolism since the early days of systems biology. Computational models hold the potential to deepen knowledge of RBC metabolism, but also and foremost to predict in silico RBC metabolic behaviors in response to environmental stimuli. Here, we review now classic concepts on RBC metabolism, prior work in systems biology of unicellular organisms, and how this work paved the way for the development of reconstruction models of RBC metabolism. Translationally, we discuss how the fields of metabolomics and systems biology have generated evidence to advance our understanding of the RBC storage lesion, a process of decline in storage quality that impacts over a hundred million blood units transfused every year.
Collapse
Affiliation(s)
- Alicia Key
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver, Anschutz Medical Campus, Aurora, CO 80045, USA;
| | - Zachary Haiman
- Department of Bioengineering, University of California, San Diego, CA 92093, USA (B.O.P.)
- Bioinformatics and Systems Biology Program, University of California, San Diego, CA 92093, USA
- Department of Pediatrics, University of California, San Diego, CA 92161, USA
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California, San Diego, CA 92093, USA (B.O.P.)
- Bioinformatics and Systems Biology Program, University of California, San Diego, CA 92093, USA
- Department of Pediatrics, University of California, San Diego, CA 92161, USA
| | - Angelo D’Alessandro
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver, Anschutz Medical Campus, Aurora, CO 80045, USA;
| |
Collapse
|
8
|
Arend M, Zimmer D, Xu R, Sommer F, Mühlhaus T, Nikoloski Z. Proteomics and constraint-based modelling reveal enzyme kinetic properties of Chlamydomonas reinhardtii on a genome scale. Nat Commun 2023; 14:4781. [PMID: 37553325 PMCID: PMC10409818 DOI: 10.1038/s41467-023-40498-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 08/01/2023] [Indexed: 08/10/2023] Open
Abstract
Metabolic engineering of microalgae offers a promising solution for sustainable biofuel production, and rational design of engineering strategies can be improved by employing metabolic models that integrate enzyme turnover numbers. However, the coverage of turnover numbers for Chlamydomonas reinhardtii, a model eukaryotic microalga accessible to metabolic engineering, is 17-fold smaller compared to the heterotrophic cell factory Saccharomyces cerevisiae. Here we generate quantitative protein abundance data of Chlamydomonas covering 2337 to 3708 proteins in various growth conditions to estimate in vivo maximum apparent turnover numbers. Using constrained-based modeling we provide proxies for in vivo turnover numbers of 568 reactions, representing a 10-fold increase over the in vitro data for Chlamydomonas. Integration of the in vivo estimates instead of in vitro values in a metabolic model of Chlamydomonas improved the accuracy of enzyme usage predictions. Our results help in extending the knowledge on uncharacterized enzymes and improve biotechnological applications of Chlamydomonas.
Collapse
Affiliation(s)
- Marius Arend
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, 14476, Potsdam, Germany
- Systems Biology and Mathematical Modelling, Max Planck Institute of Molecular Plant Physiology, 14476, Potsdam, Germany
- Bioinformatics and Mathematical Modeling Department, Center of Plant Systems Biology and Biotechnology, 4000, Plovdiv, Bulgaria
| | - David Zimmer
- Computational Systems Biology, TU Kaiserslautern, 67663, Kaiserslautern, Germany
| | - Rudan Xu
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, 14476, Potsdam, Germany
- Systems Biology and Mathematical Modelling, Max Planck Institute of Molecular Plant Physiology, 14476, Potsdam, Germany
| | - Frederik Sommer
- Molecular Biotechnology & Systems Biology, TU Kaiserslautern, 67663, Kaiserslautern, Germany
| | - Timo Mühlhaus
- Computational Systems Biology, TU Kaiserslautern, 67663, Kaiserslautern, Germany
| | - Zoran Nikoloski
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, 14476, Potsdam, Germany.
- Systems Biology and Mathematical Modelling, Max Planck Institute of Molecular Plant Physiology, 14476, Potsdam, Germany.
- Bioinformatics and Mathematical Modeling Department, Center of Plant Systems Biology and Biotechnology, 4000, Plovdiv, Bulgaria.
| |
Collapse
|