1
|
Chen Y, Gustafsson J, Tafur Rangel A, Anton M, Domenzain I, Kittikunapong C, Li F, Yuan L, Nielsen J, Kerkhoven EJ. Reconstruction, simulation and analysis of enzyme-constrained metabolic models using GECKO Toolbox 3.0. Nat Protoc 2024; 19:629-667. [PMID: 38238583 DOI: 10.1038/s41596-023-00931-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Accepted: 10/13/2023] [Indexed: 03/10/2024]
Abstract
Genome-scale metabolic models (GEMs) are computational representations that enable mathematical exploration of metabolic behaviors within cellular and environmental constraints. Despite their wide usage in biotechnology, biomedicine and fundamental studies, there are many phenotypes that GEMs are unable to correctly predict. GECKO is a method to improve the predictive power of a GEM by incorporating enzymatic constraints using kinetic and omics data. GECKO has enabled reconstruction of enzyme-constrained metabolic models (ecModels) for diverse organisms, which show better predictive performance than conventional GEMs. In this protocol, we describe how to use the latest version GECKO 3.0; the procedure has five stages: (1) expansion from a starting metabolic model to an ecModel structure, (2) integration of enzyme turnover numbers into the ecModel structure, (3) model tuning, (4) integration of proteomics data into the ecModel and (5) simulation and analysis of ecModels. GECKO 3.0 incorporates deep learning-predicted enzyme kinetics, paving the way for improved metabolic models for virtually any organism and cell line in the absence of experimental data. The time of running the whole protocol is organism dependent, e.g., ~5 h for yeast.
Collapse
Affiliation(s)
- Yu Chen
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
- Key Laboratory of Quantitative Synthetic Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Johan Gustafsson
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Albert Tafur Rangel
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
- Novo Nordisk Foundation Center for Biosustainability, Technology University of Denmark, Lyngby, Denmark
| | - Mihail Anton
- Department of Life Sciences, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, Gothenburg, Sweden
| | - Iván Domenzain
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | | | - Feiran Li
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
- Institute of Biopharmaceutical and Health Engineering, Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, China
| | - Le Yuan
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Jens Nielsen
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
- BioInnovation Institute, Copenhagen, Denmark
| | - Eduard J Kerkhoven
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden.
- Novo Nordisk Foundation Center for Biosustainability, Technology University of Denmark, Lyngby, Denmark.
- SciLifeLab, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.
| |
Collapse
|
2
|
Kaizu K, Takahashi K. Technologies for whole-cell modeling: Genome-wide reconstruction of a cell in silico. Dev Growth Differ 2023; 65:554-564. [PMID: 37856476 DOI: 10.1111/dgd.12897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 09/06/2023] [Accepted: 10/14/2023] [Indexed: 10/21/2023]
Abstract
With advances in high-throughput, large-scale in vivo measurement and genome modification techniques at the single-nucleotide level, there is an increasing demand for the development of new technologies for the flexible design and control of cellular systems. Computer-aided design is a powerful tool to design new cells. Whole-cell modeling aims to integrate various cellular subsystems, determine their interactions and cooperative mechanisms, and predict comprehensive cellular behaviors by computational simulations on a genome-wide scale. It has been applied to prokaryotes, yeasts, and higher eukaryotic cells, and utilized in a wide range of applications, including production of valuable substances, drug discovery, and controlled differentiation. Whole-cell modeling, consisting of several thousand elements with diverse scales and properties, requires innovative model construction, simulation, and analysis techniques. Furthermore, whole-cell modeling has been extended to multiple scales, including high-resolution modeling at the single-nucleotide and single-amino acid levels and multicellular modeling of tissues and organs. This review presents an overview of the current state of whole-cell modeling, discusses the novel computational and experimental technologies driving it, and introduces further developments toward multihierarchical modeling on a whole-genome scale.
Collapse
Affiliation(s)
- Kazunari Kaizu
- RIKEN Center for Biosystems Dynamics Research, Osaka, Japan
| | | |
Collapse
|
3
|
Chen Y, Li F, Nielsen J. Genome-scale modeling of yeast metabolism: retrospectives and perspectives. FEMS Yeast Res 2022; 22:foac003. [PMID: 35094064 PMCID: PMC8862083 DOI: 10.1093/femsyr/foac003] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 01/06/2022] [Accepted: 01/27/2022] [Indexed: 11/30/2022] Open
Abstract
Yeasts have been widely used for production of bread, beer and wine, as well as for production of bioethanol, but they have also been designed as cell factories to produce various chemicals, advanced biofuels and recombinant proteins. To systematically understand and rationally engineer yeast metabolism, genome-scale metabolic models (GEMs) have been reconstructed for the model yeast Saccharomyces cerevisiae and nonconventional yeasts. Here, we review the historical development of yeast GEMs together with their recent applications, including metabolic flux prediction, cell factory design, culture condition optimization and multi-yeast comparative analysis. Furthermore, we present an emerging effort, namely the integration of proteome constraints into yeast GEMs, resulting in models with improved performance. At last, we discuss challenges and perspectives on the development of yeast GEMs and the integration of proteome constraints.
Collapse
Affiliation(s)
- Yu Chen
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE412 96 Gothenburg, Sweden
| | - Feiran Li
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE412 96 Gothenburg, Sweden
| | - Jens Nielsen
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE412 96 Gothenburg, Sweden
- BioInnovation Institute, DK2200 Copenhagen N, Denmark
| |
Collapse
|
4
|
Abstract
Metal ions are essential to all living cells, as they can serve as cofactors of enzymes to drive catalysis of biochemical reactions. We present a constraint-based model of yeast that relates metabolism with metal ions via enzymes. The model is able to capture responses of metabolism and gene expression upon iron depletion, suggesting that yeast cells allocate iron resource in the way abiding to optimization principles. Interestingly, the model predicts up-regulation of several iron-containing enzymes that coincide with experiments, which raises the possibility that the decrease in activity due to limited iron could be compensated by elevated enzyme abundance. Moreover, the model paves the way for guiding biosynthesis of high-value compounds (e.g., p-coumaric acid) that relies on iron-containing enzymes. Metal ions are vital to metabolism, as they can act as cofactors on enzymes and thus modulate individual enzymatic reactions. Although many enzymes have been reported to interact with metal ions, the quantitative relationships between metal ions and metabolism are lacking. Here, we reconstructed a genome-scale metabolic model of the yeast Saccharomyces cerevisiae to account for proteome constraints and enzyme cofactors such as metal ions, named CofactorYeast. The model is able to estimate abundances of metal ions binding on enzymes in cells under various conditions, which are comparable to measured metal ion contents in biomass. In addition, the model predicts distinct metabolic flux distributions in response to reduced levels of various metal ions in the medium. Specifically, the model reproduces changes upon iron deficiency in metabolic and gene expression levels, which could be interpreted by optimization principles (i.e., yeast optimizes iron utilization based on metabolic network and enzyme kinetics rather than preferentially targeting iron to specific enzymes or pathways). At last, we show the potential of using the model for understanding cell factories that harbor heterologous iron-containing enzymes to synthesize high-value compounds such as p-coumaric acid. Overall, the model demonstrates the dependence of enzymes on metal ions and links metal ions to metabolism on a genome scale.
Collapse
|
5
|
Schinn SM, Morrison C, Wei W, Zhang L, Lewis NE. Systematic evaluation of parameters for genome-scale metabolic models of cultured mammalian cells. Metab Eng 2021; 66:21-30. [PMID: 33771719 DOI: 10.1016/j.ymben.2021.03.013] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 11/25/2020] [Accepted: 03/17/2021] [Indexed: 10/21/2022]
Abstract
Genome-scale metabolic models describe cellular metabolism with mechanistic detail. Given their high complexity, such models need to be parameterized correctly to yield accurate predictions and avoid overfitting. Effective parameterization has been well-studied for microbial models, but it remains unclear for higher eukaryotes, including mammalian cells. To address this, we enumerated model parameters that describe key features of cultured mammalian cells - including cellular composition, bioprocess performance metrics, mammalian-specific pathways, and biological assumptions behind model formulation approaches. We tested these parameters by building thousands of metabolic models and evaluating their ability to predict the growth rates of a panel of phenotypically diverse Chinese Hamster Ovary cell clones. We found the following considerations to be most critical for accurate parameterization: (1) cells limit metabolic activity to maintain homeostasis, (2) cell morphology and viability change dynamically during a growth curve, and (3) cellular biomass has a particular macromolecular composition. Depending on parameterization, models predicted different metabolic phenotypes, including contrasting mechanisms of nutrient utilization and energy generation, leading to varying accuracies of growth rate predictions. Notably, accurate parameter values broadly agreed with experimental measurements. These insights will guide future investigations of mammalian metabolism.
Collapse
Affiliation(s)
- Song-Min Schinn
- Department of Pediatrics, University of California, San Diego, USA
| | - Carly Morrison
- Pfizer, Biotherapeutics Pharmaceutical Sciences, Andover, MA, USA
| | - Wei Wei
- Pfizer, Biotherapeutics Pharmaceutical Sciences, Andover, MA, USA
| | - Lin Zhang
- Pfizer, Biotherapeutics Pharmaceutical Sciences, Andover, MA, USA
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, USA; Department of Bioengineering, University of California, San Diego, USA; Novo Nordisk Foundation Center for Biosustainability at UC, San Diego, USA.
| |
Collapse
|
6
|
Domenzain I, Li F, Kerkhoven EJ, Siewers V. Evaluating accessibility, usability and interoperability of genome-scale metabolic models for diverse yeasts species. FEMS Yeast Res 2021; 21:foab002. [PMID: 33428734 PMCID: PMC7943257 DOI: 10.1093/femsyr/foab002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 01/08/2021] [Indexed: 12/18/2022] Open
Abstract
Metabolic network reconstructions have become an important tool for probing cellular metabolism in the field of systems biology. They are used as tools for quantitative prediction but also as scaffolds for further knowledge contextualization. The yeast Saccharomyces cerevisiae was one of the first organisms for which a genome-scale metabolic model (GEM) was reconstructed, in 2003, and since then 45 metabolic models have been developed for a wide variety of relevant yeasts species. A systematic evaluation of these models revealed that-despite this long modeling history-the sequential process of tracing model files, setting them up for basic simulation purposes and comparing them across species and even different versions, is still not a generalizable task. These findings call the yeast modeling community to comply to standard practices on model development and sharing in order to make GEMs accessible and useful for a wider public.
Collapse
Affiliation(s)
- Iván Domenzain
- Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
- Novo Nordisk Foundation Center for Biosustainability, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
| | - Feiran Li
- Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
- Novo Nordisk Foundation Center for Biosustainability, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
| | - Eduard J Kerkhoven
- Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
- Novo Nordisk Foundation Center for Biosustainability, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
| | - Verena Siewers
- Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
- Novo Nordisk Foundation Center for Biosustainability, Chalmers University of Technology, Kemivägen 10, SE-412 96, Gothenburg, Sweden
| |
Collapse
|
7
|
Li G, Hu Y, Jan Zrimec, Luo H, Wang H, Zelezniak A, Ji B, Nielsen J. Bayesian genome scale modelling identifies thermal determinants of yeast metabolism. Nat Commun 2021; 12:190. [PMID: 33420025 PMCID: PMC7794507 DOI: 10.1038/s41467-020-20338-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 11/25/2020] [Indexed: 12/05/2022] Open
Abstract
The molecular basis of how temperature affects cell metabolism has been a long-standing question in biology, where the main obstacles are the lack of high-quality data and methods to associate temperature effects on the function of individual proteins as well as to combine them at a systems level. Here we develop and apply a Bayesian modeling approach to resolve the temperature effects in genome scale metabolic models (GEM). The approach minimizes uncertainties in enzymatic thermal parameters and greatly improves the predictive strength of the GEMs. The resulting temperature constrained yeast GEM uncovers enzymes that limit growth at superoptimal temperatures, and squalene epoxidase (ERG1) is predicted to be the most rate limiting. By replacing this single key enzyme with an ortholog from a thermotolerant yeast strain, we obtain a thermotolerant strain that outgrows the wild type, demonstrating the critical role of sterol metabolism in yeast thermosensitivity. Therefore, apart from identifying thermal determinants of cell metabolism and enabling the design of thermotolerant strains, our Bayesian GEM approach facilitates modelling of complex biological systems in the absence of high-quality data and therefore shows promise for becoming a standard tool for genome scale modeling. While temperature impacts the function of all cellular components, it’s hard to rule out how the temperature dependence of cell phenotypes emerged from the dependence of individual components. Here, the authors develop a Bayesian genome scale modelling approach to identify thermal determinants of yeast metabolism.
Collapse
Affiliation(s)
- Gang Li
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Yating Hu
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Jan Zrimec
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Hao Luo
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Hao Wang
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, SE-41258, Gothenburg, Sweden.,Wallenberg Center for Molecular and Translational Medicine, University of Gothenburg, SE-41258, Gothenburg, Sweden
| | - Aleksej Zelezniak
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,Science for Life Laboratory, Tomtebodavägen 23a, SE-171 65, Stockholm, Sweden
| | - Boyang Ji
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark
| | - Jens Nielsen
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden. .,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark. .,BioInnovation Institute, Ole Måløes Vej 3, DK2200, Copenhagen N, Denmark.
| |
Collapse
|
8
|
A mechanism-aware and multiomic machine-learning pipeline characterizes yeast cell growth. Proc Natl Acad Sci U S A 2020; 117:18869-18879. [PMID: 32675233 DOI: 10.1073/pnas.2002959117] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Metabolic modeling and machine learning are key components in the emerging next generation of systems and synthetic biology tools, targeting the genotype-phenotype-environment relationship. Rather than being used in isolation, it is becoming clear that their value is maximized when they are combined. However, the potential of integrating these two frameworks for omic data augmentation and integration is largely unexplored. We propose, rigorously assess, and compare machine-learning-based data integration techniques, combining gene expression profiles with computationally generated metabolic flux data to predict yeast cell growth. To this end, we create strain-specific metabolic models for 1,143 Saccharomyces cerevisiae mutants and we test 27 machine-learning methods, incorporating state-of-the-art feature selection and multiview learning approaches. We propose a multiview neural network using fluxomic and transcriptomic data, showing that the former increases the predictive accuracy of the latter and reveals functional patterns that are not directly deducible from gene expression alone. We test the proposed neural network on a further 86 strains generated in a different experiment, therefore verifying its robustness to an additional independent dataset. Finally, we show that introducing mechanistic flux features improves the predictions also for knockout strains whose genes were not modeled in the metabolic reconstruction. Our results thus demonstrate that fusing experimental cues with in silico models, based on known biochemistry, can contribute with disjoint information toward biologically informed and interpretable machine learning. Overall, this study provides tools for understanding and manipulating complex phenotypes, increasing both the prediction accuracy and the extent of discernible mechanistic biological insights.
Collapse
|
9
|
Chen Y, Sun Y, Liu Z, Dong F, Li Y, Wang Y. Genome-scale modeling for Bacillus coagulans to understand the metabolic characteristics. Biotechnol Bioeng 2020; 117:3545-3558. [PMID: 32648961 DOI: 10.1002/bit.27488] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Revised: 06/01/2020] [Accepted: 07/09/2020] [Indexed: 12/14/2022]
Abstract
Lactic acid is widely used in many industries, especially in the production of poly-lactic acid. Bacillus coagulans is a promising lactic acid producer in industrial fermentation due to its thermophilic property. In this study, we developed the first genome-scale metabolic model (GEM) of B. coagulans iBag597, together with an enzyme-constrained model ec-iBag597. We measured strain-specific biomass composition and integrated the data into a biomass equation. Then, we validated iBag597 against experimental data generated in this study, including amino acid requirements and carbon source utilization, showing that simulations were generally consistent with the experimental results. Subsequently, we carried out chemostats to investigate the effects of specific growth rate and culture pH on metabolism of B. coagulans. Meanwhile, we used iBag597 to estimate the intracellular metabolic fluxes for those conditions. The results showed that B. coagulans was capable of generating ATP via multiple pathways, and switched among them in response to various conditions. With ec-iBag597, we estimated the protein cost and protein efficiency for each ATP-producing pathway to investigate the switches. Our models pave the way for systems biology of B. coagulans, and our findings suggest that maintaining a proper growth rate and selecting an optimal pH are beneficial for lactate fermentation.
Collapse
Affiliation(s)
- Yu Chen
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Yan Sun
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Zhihao Liu
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Fengqing Dong
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Yuanyuan Li
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Yonghong Wang
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| |
Collapse
|
10
|
Chanda P, Costa E, Hu J, Sukumar S, Van Hemert J, Walia R. Information Theory in Computational Biology: Where We Stand Today. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E627. [PMID: 33286399 PMCID: PMC7517167 DOI: 10.3390/e22060627] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 05/31/2020] [Accepted: 06/03/2020] [Indexed: 12/30/2022]
Abstract
"A Mathematical Theory of Communication" was published in 1948 by Claude Shannon to address the problems in the field of data compression and communication over (noisy) communication channels. Since then, the concepts and ideas developed in Shannon's work have formed the basis of information theory, a cornerstone of statistical learning and inference, and has been playing a key role in disciplines such as physics and thermodynamics, probability and statistics, computational sciences and biological sciences. In this article we review the basic information theory based concepts and describe their key applications in multiple major areas of research in computational biology-gene expression and transcriptomics, alignment-free sequence comparison, sequencing and error correction, genome-wide disease-gene association mapping, metabolic networks and metabolomics, and protein sequence, structure and interaction analysis.
Collapse
Affiliation(s)
- Pritam Chanda
- Corteva Agriscience™, Indianapolis, IN 46268, USA
- Computer and Information Science, Indiana University-Purdue University, Indianapolis, IN 46202, USA
| | - Eduardo Costa
- Corteva Agriscience™, Mogi Mirim, Sao Paulo 13801-540, Brazil
| | - Jie Hu
- Corteva Agriscience™, Indianapolis, IN 46268, USA
| | | | | | - Rasna Walia
- Corteva Agriscience™, Johnston, IA 50131, USA
| |
Collapse
|