1
|
Stapor P, Schmiester L, Wierling C, Merkt S, Pathirana D, Lange BMH, Weindl D, Hasenauer J. Mini-batch optimization enables training of ODE models on large-scale datasets. Nat Commun 2022; 13:34. [PMID: 35013141 PMCID: PMC8748893 DOI: 10.1038/s41467-021-27374-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2019] [Accepted: 11/11/2021] [Indexed: 11/09/2022] Open
Abstract
Quantitative dynamic models are widely used to study cellular signal processing. A critical step in modelling is the estimation of unknown model parameters from experimental data. As model sizes and datasets are steadily growing, established parameter optimization approaches for mechanistic models become computationally extremely challenging. Mini-batch optimization methods, as employed in deep learning, have better scaling properties. In this work, we adapt, apply, and benchmark mini-batch optimization for ordinary differential equation (ODE) models, thereby establishing a direct link between dynamic modelling and machine learning. On our main application example, a large-scale model of cancer signaling, we benchmark mini-batch optimization against established methods, achieving better optimization results and reducing computation by more than an order of magnitude. We expect that our work will serve as a first step towards mini-batch optimization tailored to ODE models and enable modelling of even larger and more complex systems than what is currently possible.
Collapse
Affiliation(s)
- Paul Stapor
- Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, 85764, Neuherberg, Germany
- Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748, Garching, Germany
| | - Leonard Schmiester
- Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, 85764, Neuherberg, Germany
- Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748, Garching, Germany
| | | | - Simon Merkt
- Universität Bonn, Faculty of Mathematics and Natural Sciences, 53115, Bonn, Germany
| | - Dilan Pathirana
- Universität Bonn, Faculty of Mathematics and Natural Sciences, 53115, Bonn, Germany
| | | | - Daniel Weindl
- Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, 85764, Neuherberg, Germany
| | - Jan Hasenauer
- Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, 85764, Neuherberg, Germany.
- Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748, Garching, Germany.
- Universität Bonn, Faculty of Mathematics and Natural Sciences, 53115, Bonn, Germany.
| |
Collapse
|
2
|
Schmiester L, Weindl D, Hasenauer J. Efficient gradient-based parameter estimation for dynamic models using qualitative data. BIOINFORMATICS (OXFORD, ENGLAND) 2021. [PMID: 34260697 DOI: 10.1101/2021.02.06.430039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
MOTIVATION Unknown parameters of dynamical models are commonly estimated from experimental data. However, while various efficient optimization and uncertainty analysis methods have been proposed for quantitative data, methods for qualitative data are rare and suffer from bad scaling and convergence. RESULTS Here, we propose an efficient and reliable framework for estimating the parameters of ordinary differential equation models from qualitative data. In this framework, we derive a semi-analytical algorithm for gradient calculation of the optimal scaling method developed for qualitative data. This enables the use of efficient gradient-based optimization algorithms. We demonstrate that the use of gradient information improves performance of optimization and uncertainty quantification on several application examples. On average, we achieve a speedup of more than one order of magnitude compared to gradient-free optimization. In addition, in some examples, the gradient-based approach yields substantially improved objective function values and quality of the fits. Accordingly, the proposed framework substantially improves the parameterization of models from qualitative data. AVAILABILITY AND IMPLEMENTATION The proposed approach is implemented in the open-source Python Parameter EStimation TOolbox (pyPESTO). pyPESTO is available at https://github.com/ICB-DCM/pyPESTO. All application examples and code to reproduce this study are available at https://doi.org/10.5281/zenodo.4507613. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Leonard Schmiester
- Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg 85764, Germany
- Center for Mathematics, Technische Universität München, Garching 85748, Germany
| | - Daniel Weindl
- Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg 85764, Germany
| | - Jan Hasenauer
- Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg 85764, Germany
- Center for Mathematics, Technische Universität München, Garching 85748, Germany
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn 53113, Germany
| |
Collapse
|
3
|
Schmiester L, Weindl D, Hasenauer J. Efficient gradient-based parameter estimation for dynamic models using qualitative data. Bioinformatics 2021; 37:4493-4500. [PMID: 34260697 PMCID: PMC8652033 DOI: 10.1093/bioinformatics/btab512] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Revised: 07/02/2021] [Accepted: 07/08/2021] [Indexed: 11/22/2022] Open
Abstract
Motivation Unknown parameters of dynamical models are commonly estimated from experimental data. However, while various efficient optimization and uncertainty analysis methods have been proposed for quantitative data, methods for qualitative data are rare and suffer from bad scaling and convergence. Results Here, we propose an efficient and reliable framework for estimating the parameters of ordinary differential equation models from qualitative data. In this framework, we derive a semi-analytical algorithm for gradient calculation of the optimal scaling method developed for qualitative data. This enables the use of efficient gradient-based optimization algorithms. We demonstrate that the use of gradient information improves performance of optimization and uncertainty quantification on several application examples. On average, we achieve a speedup of more than one order of magnitude compared to gradient-free optimization. In addition, in some examples, the gradient-based approach yields substantially improved objective function values and quality of the fits. Accordingly, the proposed framework substantially improves the parameterization of models from qualitative data. Availability and implementation The proposed approach is implemented in the open-source Python Parameter EStimation TOolbox (pyPESTO). pyPESTO is available at https://github.com/ICB-DCM/pyPESTO. All application examples and code to reproduce this study are available at https://doi.org/10.5281/zenodo.4507613. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Leonard Schmiester
- Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, 85764, Germany.,Center for Mathematics, Technische Universität München, Garching, 85748, Germany
| | - Daniel Weindl
- Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, 85764, Germany
| | - Jan Hasenauer
- Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, 85764, Germany.,Center for Mathematics, Technische Universität München, Garching, 85748, Germany.,Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, 53113, Germany
| |
Collapse
|
4
|
Mitra ED, Hlavacek WS. Bayesian inference using qualitative observations of underlying continuous variables. Bioinformatics 2020; 36:3177-3184. [PMID: 32049328 PMCID: PMC7214020 DOI: 10.1093/bioinformatics/btaa084] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Revised: 01/08/2020] [Accepted: 02/03/2020] [Indexed: 01/28/2023] Open
Abstract
MOTIVATION Recent work has demonstrated the feasibility of using non-numerical, qualitative data to parameterize mathematical models. However, uncertainty quantification (UQ) of such parameterized models has remained challenging because of a lack of a statistical interpretation of the objective functions used in optimization. RESULTS We formulated likelihood functions suitable for performing Bayesian UQ using qualitative observations of underlying continuous variables or a combination of qualitative and quantitative data. To demonstrate the resulting UQ capabilities, we analyzed a published model for immunoglobulin E (IgE) receptor signaling using synthetic qualitative and quantitative datasets. Remarkably, estimates of parameter values derived from the qualitative data were nearly as consistent with the assumed ground-truth parameter values as estimates derived from the lower throughput quantitative data. These results provide further motivation for leveraging qualitative data in biological modeling. AVAILABILITY AND IMPLEMENTATION The likelihood functions presented here are implemented in a new release of PyBioNetFit, an open-source application for analyzing Systems Biology Markup Language- and BioNetGen Language-formatted models, available online at www.github.com/lanl/PyBNF. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Eshan D Mitra
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - William S Hlavacek
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| |
Collapse
|
5
|
Mitra ED, Hlavacek WS. Parameter Estimation and Uncertainty Quantification for Systems Biology Models. CURRENT OPINION IN SYSTEMS BIOLOGY 2019; 18:9-18. [PMID: 32719822 PMCID: PMC7384601 DOI: 10.1016/j.coisb.2019.10.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Mathematical models can provide quantitative insights into immunoreceptor signaling, and other biological processes, but require parameterization and uncertainty quantification before reliable predictions become possible. We review currently available methods and software tools to address these problems. We consider gradient-based and gradient-free methods for point estimation of parameter values, and methods of profile likelihood, bootstrapping, and Bayesian inference for uncertainty quantification. We consider recent and potential future applications of these methods to systems-level modeling of immune-related phenomena.
Collapse
Affiliation(s)
- Eshan D. Mitra
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - William S. Hlavacek
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| |
Collapse
|
6
|
Mitra ED, Suderman R, Colvin J, Ionkov A, Hu A, Sauro HM, Posner RG, Hlavacek WS. PyBioNetFit and the Biological Property Specification Language. iScience 2019; 19:1012-1036. [PMID: 31522114 PMCID: PMC6744527 DOI: 10.1016/j.isci.2019.08.045] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 06/21/2019] [Accepted: 08/22/2019] [Indexed: 02/07/2023] Open
Abstract
In systems biology modeling, important steps include model parameterization, uncertainty quantification, and evaluation of agreement with experimental observations. To help modelers perform these steps, we developed the software PyBioNetFit, which in addition supports checking models against known system properties and solving design problems. PyBioNetFit introduces Biological Property Specification Language (BPSL) for the formal declaration of system properties. BPSL allows qualitative data to be used alone or in combination with quantitative data. PyBioNetFit performs parameterization with parallelized metaheuristic optimization algorithms that work directly with existing model definition standards: BioNetGen Language (BNGL) and Systems Biology Markup Language (SBML). We demonstrate PyBioNetFit's capabilities by solving various example problems, including the challenging problem of parameterizing a 153-parameter model of cell cycle control in yeast based on both quantitative and qualitative data. We demonstrate the model checking and design applications of PyBioNetFit and BPSL by analyzing a model of targeted drug interventions in autophagy signaling.
Collapse
Affiliation(s)
- Eshan D Mitra
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Ryan Suderman
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Joshua Colvin
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - Alexander Ionkov
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Andrew Hu
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Richard G Posner
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - William S Hlavacek
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
| |
Collapse
|
7
|
Chen M, Amos BD, Watson LT, Tyson JJ, Cao Y, Shaffer CA, Trosset MW, Oguz C, Kakoti G. Quasi-Newton Stochastic Optimization Algorithm for Parameter Estimation of a Stochastic Model of the Budding Yeast Cell Cycle. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:301-311. [PMID: 29990127 DOI: 10.1109/tcbb.2017.2773083] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Parameter estimation in discrete or continuous deterministic cell cycle models is challenging for several reasons, including the nature of what can be observed, and the accuracy and quantity of those observations. The challenge is even greater for stochastic models, where the number of simulations and amount of empirical data must be even larger to obtain statistically valid parameter estimates. The two main contributions of this work are (1) stochastic model parameter estimation based on directly matching multivariate probability distributions, and (2) a new quasi-Newton algorithm class QNSTOP for stochastic optimization problems. QNSTOP directly uses the random objective function value samples rather than creating ensemble statistics. QNSTOP is used here to directly match empirical and simulated joint probability distributions rather than matching summary statistics. Results are given for a current state-of-the-art stochastic cell cycle model of budding yeast, whose predictions match well some summary statistics and one-dimensional distributions from empirical data, but do not match well the empirical joint distributions. The nature of the mismatch provides insight into the weakness in the stochastic model.
Collapse
|
8
|
Using both qualitative and quantitative data in parameter identification for systems biology models. Nat Commun 2018; 9:3901. [PMID: 30254246 PMCID: PMC6156341 DOI: 10.1038/s41467-018-06439-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Accepted: 09/04/2018] [Indexed: 11/28/2022] Open
Abstract
In systems biology, qualitative data are often generated, but rarely used to parameterize models. We demonstrate an approach in which qualitative and quantitative data can be combined for parameter identification. In this approach, qualitative data are converted into inequality constraints imposed on the outputs of the model. These inequalities are used along with quantitative data points to construct a single scalar objective function that accounts for both datasets. To illustrate the approach, we estimate parameters for a simple model describing Raf activation. We then apply the technique to a more elaborate model characterizing cell cycle regulation in yeast. We incorporate both quantitative time courses (561 data points) and qualitative phenotypes of 119 mutant yeast strains (1647 inequalities) to perform automated identification of 153 model parameters. We quantify parameter uncertainty using a profile likelihood approach. Our results indicate the value of combining qualitative and quantitative data to parameterize systems biology models. Much of the data generated in biology is qualitative, but exploiting such data to inform models of biological systems remains a challenge. Here, the authors demonstrate an approach that allows use of both quantitative and qualitative data for parameterising dynamical models.
Collapse
|
9
|
Roy M, Finley SD. Computational Model Predicts the Effects of Targeting Cellular Metabolism in Pancreatic Cancer. Front Physiol 2017; 8:217. [PMID: 28446878 PMCID: PMC5388762 DOI: 10.3389/fphys.2017.00217] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Accepted: 03/27/2017] [Indexed: 12/13/2022] Open
Abstract
Reprogramming of energy metabolism is a hallmark of cancer that enables the cancer cells to meet the increased energetic requirements due to uncontrolled proliferation. One prominent example is pancreatic ductal adenocarcinoma, an aggressive form of cancer with an overall 5-year survival rate of 5%. The reprogramming mechanism in pancreatic cancer involves deregulated uptake of glucose and glutamine and other opportunistic modes of satisfying energetic demands in a hypoxic and nutrient-poor environment. In the current study, we apply systems biology approaches to enable a better understanding of the dynamics of the distinct metabolic alterations in KRAS-mediated pancreatic cancer, with the goal of impeding early cell proliferation by identifying the optimal metabolic enzymes to target. We have constructed a kinetic model of metabolism represented as a set of ordinary differential equations that describe time evolution of the metabolite concentrations in glycolysis, glutaminolysis, tricarboxylic acid cycle and the pentose phosphate pathway. The model is comprised of 46 metabolites and 53 reactions. The mathematical model is fit to published enzyme knockdown experimental data. We then applied the model to perform in silico enzyme modulations and evaluate the effects on cell proliferation. Our work identifies potential combinations of enzyme knockdown, metabolite inhibition, and extracellular conditions that impede cell proliferation. Excitingly, the model predicts novel targets that can be tested experimentally. Therefore, the model is a tool to predict the effects of inhibiting specific metabolic reactions within pancreatic cancer cells, which is difficult to measure experimentally, as well as test further hypotheses toward targeted therapies.
Collapse
Affiliation(s)
- Mahua Roy
- Biomedical Engineering, University of Southern CaliforniaLos Angeles, CA, USA
| | - Stacey D Finley
- Biomedical Engineering, University of Southern CaliforniaLos Angeles, CA, USA.,Chemical Engineering, University of Southern CaliforniaLos Angeles, CA, USA
| |
Collapse
|
10
|
Oguz C, Watson LT, Baumann WT, Tyson JJ. Predicting network modules of cell cycle regulators using relative protein abundance statistics. BMC SYSTEMS BIOLOGY 2017; 11:30. [PMID: 28241833 PMCID: PMC5329933 DOI: 10.1186/s12918-017-0409-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2016] [Accepted: 02/17/2017] [Indexed: 11/10/2022]
Abstract
BACKGROUND Parameter estimation in systems biology is typically done by enforcing experimental observations through an objective function as the parameter space of a model is explored by numerical simulations. Past studies have shown that one usually finds a set of "feasible" parameter vectors that fit the available experimental data equally well, and that these alternative vectors can make different predictions under novel experimental conditions. In this study, we characterize the feasible region of a complex model of the budding yeast cell cycle under a large set of discrete experimental constraints in order to test whether the statistical features of relative protein abundance predictions are influenced by the topology of the cell cycle regulatory network. RESULTS Using differential evolution, we generate an ensemble of feasible parameter vectors that reproduce the phenotypes (viable or inviable) of wild-type yeast cells and 110 mutant strains. We use this ensemble to predict the phenotypes of 129 mutant strains for which experimental data is not available. We identify 86 novel mutants that are predicted to be viable and then rank the cell cycle proteins in terms of their contributions to cumulative variability of relative protein abundance predictions. Proteins involved in "regulation of cell size" and "regulation of G1/S transition" contribute most to predictive variability, whereas proteins involved in "positive regulation of transcription involved in exit from mitosis," "mitotic spindle assembly checkpoint" and "negative regulation of cyclin-dependent protein kinase by cyclin degradation" contribute the least. These results suggest that the statistics of these predictions may be generating patterns specific to individual network modules (START, S/G2/M, and EXIT). To test this hypothesis, we develop random forest models for predicting the network modules of cell cycle regulators using relative abundance statistics as model inputs. Predictive performance is assessed by the areas under receiver operating characteristics curves (AUC). Our models generate an AUC range of 0.83-0.87 as opposed to randomized models with AUC values around 0.50. CONCLUSIONS By using differential evolution and random forest modeling, we show that the model prediction statistics generate distinct network module-specific patterns within the cell cycle network.
Collapse
Affiliation(s)
- Cihan Oguz
- Department of Biological Sciences, Virginia Tech, Blacksburg VA, 24061, USA.
| | - Layne T Watson
- Department of Computer Science, Virginia Tech, Blacksburg VA, 24061, USA.,Department of Mathematics, Virginia Tech, Blacksburg VA, 24061, USA.,Department of Aerospace and Ocean Engineering, Virginia Tech, Blacksburg VA, 24061, USA
| | - William T Baumann
- Department of Electrical and Computer Engineering, Virginia Tech, Blacksburg VA, 24061, USA
| | - John J Tyson
- Department of Biological Sciences, Virginia Tech, Blacksburg VA, 24061, USA
| |
Collapse
|
11
|
Abstract
The cell division cycle is controlled by a complex regulatory network which ensures that the phases of the cell cycle are executed in the right order. This regulatory network receives signals from the environment, monitors the state of the DNA, and decides timings of cell cycle events. The underlying transcriptional and post-translational regulatory interactions lead to complex dynamical responses, such as the oscillations in the levels of cell cycle proteins driven by intertwined biochemical reactions. A cell moves between different phases of its cycle similar to a dynamical system switching between its steady states. The complex molecular network driving these phases has been investigated in previous computational systems biology studies. Here, we review the critical physiological and molecular transitions that occur in the cell cycle and discuss the role of mathematical modeling in elucidating these transitions and understand cell cycle synchronization.
Collapse
|
12
|
Laomettachit T, Chen KC, Baumann WT, Tyson JJ. A Model of Yeast Cell-Cycle Regulation Based on a Standard Component Modeling Strategy for Protein Regulatory Networks. PLoS One 2016; 11:e0153738. [PMID: 27187804 PMCID: PMC4871373 DOI: 10.1371/journal.pone.0153738] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Accepted: 04/04/2016] [Indexed: 12/14/2022] Open
Abstract
To understand the molecular mechanisms that regulate cell cycle progression in eukaryotes, a variety of mathematical modeling approaches have been employed, ranging from Boolean networks and differential equations to stochastic simulations. Each approach has its own characteristic strengths and weaknesses. In this paper, we propose a “standard component” modeling strategy that combines advantageous features of Boolean networks, differential equations and stochastic simulations in a framework that acknowledges the typical sorts of reactions found in protein regulatory networks. Applying this strategy to a comprehensive mechanism of the budding yeast cell cycle, we illustrate the potential value of standard component modeling. The deterministic version of our model reproduces the phenotypic properties of wild-type cells and of 125 mutant strains. The stochastic version of our model reproduces the cell-to-cell variability of wild-type cells and the partial viability of the CLB2-dbΔ clb5Δ mutant strain. Our simulations show that mathematical modeling with “standard components” can capture in quantitative detail many essential properties of cell cycle control in budding yeast.
Collapse
Affiliation(s)
- Teeraphan Laomettachit
- Genetics, Bioinformatics, and Computational Biology Program, Virginia Polytechnic Institute and State University, Blacksburg, Virginia, United States of America
| | - Katherine C. Chen
- Department of Biological Sciences, Virginia Polytechnic Institute and State University, Blacksburg, Virginia, United States of America
| | - William T. Baumann
- Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Blacksburg, Virginia, United States of America
| | - John J. Tyson
- Department of Biological Sciences, Virginia Polytechnic Institute and State University, Blacksburg, Virginia, United States of America
- * E-mail:
| |
Collapse
|
13
|
Abstract
Structural and dynamical fingerprints of evolutionary optimization in biological networks are still unclear. Here we analyze the dynamics of genetic regulatory networks responsible for the regulation of cell cycle and cell differentiation in three organisms or cell types each, and show that they follow a version of Hebb's rule which we have termed coherence. More precisely, we find that simultaneously expressed genes with a common target are less likely to act antagonistically at the attractors of the regulatory dynamics. We then investigate the dependence of coherence on structural parameters, such as the mean number of inputs per node and the activatory/repressory interaction ratio, as well as on dynamically determined quantities, such as the basin size and the number of expressed genes.
Collapse
Affiliation(s)
- Neşe Aral
- Department of Physics, Koç University, Rumelifeneri Yolu Sarıyer 34450, Istanbul, Turkey
| | | |
Collapse
|
14
|
Model-Based Analysis of Cell Cycle Responses to Dynamically Changing Environments. PLoS Comput Biol 2016; 12:e1004604. [PMID: 26741131 PMCID: PMC4704810 DOI: 10.1371/journal.pcbi.1004604] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2015] [Accepted: 10/14/2015] [Indexed: 11/19/2022] Open
Abstract
Cell cycle progression is carefully coordinated with a cell's intra- and extracellular environment. While some pathways have been identified that communicate information from the environment to the cell cycle, a systematic understanding of how this information is dynamically processed is lacking. We address this by performing dynamic sensitivity analysis of three mathematical models of the cell cycle in Saccharomyces cerevisiae. We demonstrate that these models make broadly consistent qualitative predictions about cell cycle progression under dynamically changing conditions. For example, it is shown that the models predict anticorrelated changes in cell size and cell cycle duration under different environments independently of the growth rate. This prediction is validated by comparison to available literature data. Other consistent patterns emerge, such as widespread nonmonotonic changes in cell size down generations in response to parameter changes. We extend our analysis by investigating glucose signalling to the cell cycle, showing that known regulation of Cln3 translation and Cln1,2 transcription by glucose is sufficient to explain the experimentally observed changes in cell cycle dynamics at different glucose concentrations. Together, these results provide a framework for understanding the complex responses the cell cycle is capable of producing in response to dynamic environments.
Collapse
|
15
|
Bianconi F, Baldelli E, Ludovini V, Luovini V, Petricoin EF, Crinò L, Valigi P. Conditional robustness analysis for fragility discovery and target identification in biochemical networks and in cancer systems biology. BMC SYSTEMS BIOLOGY 2015; 9:70. [PMID: 26482604 PMCID: PMC4617482 DOI: 10.1186/s12918-015-0216-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2015] [Accepted: 07/16/2015] [Indexed: 12/14/2022]
Abstract
Background The study of cancer therapy is a key issue in the field of oncology research and the development of target therapies is one of the main problems currently under investigation. This is particularly relevant in different types of tumor where traditional chemotherapy approaches often fail, such as lung cancer. Results We started from the general definition of robustness introduced by Kitano and applied it to the analysis of dynamical biochemical networks, proposing a new algorithm based on moment independent analysis of input/output uncertainty. The framework utilizes novel computational methods which enable evaluating the model fragility with respect to quantitative performance measures and parameters such as reaction rate constants and initial conditions. The algorithm generates a small subset of parameters that can be used to act on complex networks and to obtain the desired behaviors. We have applied the proposed framework to the EGFR-IGF1R signal transduction network, a crucial pathway in lung cancer, as an example of Cancer Systems Biology application in drug discovery. Furthermore, we have tested our framework on a pulse generator network as an example of Synthetic Biology application, thus proving the suitability of our methodology to the characterization of the input/output synthetic circuits. Conclusions The achieved results are of immediate practical application in computational biology, and while we demonstrate their use in two specific examples, they can in fact be used to study a wider class of biological systems. Electronic supplementary material The online version of this article (doi:10.1186/s12918-015-0216-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Fortunato Bianconi
- Dept of Experimental Medicine, University of Perugia, Polo Unico Sant'Andrea delle Fratte, Via Gambuli, 1, Perugia, 06156, IT.
| | - Elisa Baldelli
- Center for Applied Proteomics and Molecular Medicine George Mason University, 10900 University Blvd, Manassas, 20110, USA.
| | | | - Vienna Luovini
- Dept of Medical Oncology, Santa Maria della Misericordia Hospital, Azienda Ospedaliera di Perugia, Piazzale Menghini, 1, Loc. Sant'Andrea delle Fratte, Perugia, 06156, IT.
| | - Emanuel F Petricoin
- Center for Applied Proteomics and Molecular Medicine George Mason University, 10900 University Blvd, Manassas, 20110, USA.
| | - Lucio Crinò
- Dept of Medical Oncology, Santa Maria della Misericordia Hospital, Azienda Ospedaliera di Perugia, Piazzale Menghini, 1, Loc. Sant'Andrea delle Fratte, Perugia, 06156, IT.
| | - Paolo Valigi
- Dept of Engineering, University of Perugia, G. Duranti, 93, Perugia, 06125, IT.
| |
Collapse
|
16
|
Abstract
Mechanistic modeling has the potential to transform how cell biologists contend with the inescapable complexity of modern biology. I am a physiologist–electrical engineer–systems biologist who has been working at the level of cell biology for the past 24 years. This perspective aims 1) to convey why we build models, 2) to enumerate the major approaches to modeling and their philosophical differences, 3) to address some recurrent concerns raised by experimentalists, and then 4) to imagine a future in which teams of experimentalists and modelers build—and subject to exhaustive experimental tests—models covering the entire spectrum from molecular cell biology to human pathophysiology. There is, in my view, no technical obstacle to this future, but it will require some plasticity in the biological research mind-set.
Collapse
Affiliation(s)
- Robert D Phair
- Integrative Bioinformatics, Inc., Mountain View, CA 94041
| |
Collapse
|
17
|
Summary of the DREAM8 Parameter Estimation Challenge: Toward Parameter Identification for Whole-Cell Models. PLoS Comput Biol 2015; 11:e1004096. [PMID: 26020786 PMCID: PMC4447414 DOI: 10.1371/journal.pcbi.1004096] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Whole-cell models that explicitly represent all cellular components at the molecular level have the potential to predict phenotype from genotype. However, even for simple bacteria, whole-cell models will contain thousands of parameters, many of which are poorly characterized or unknown. New algorithms are needed to estimate these parameters and enable researchers to build increasingly comprehensive models. We organized the Dialogue for Reverse Engineering Assessments and Methods (DREAM) 8 Whole-Cell Parameter Estimation Challenge to develop new parameter estimation algorithms for whole-cell models. We asked participants to identify a subset of parameters of a whole-cell model given the model’s structure and in silico “experimental” data. Here we describe the challenge, the best performing methods, and new insights into the identifiability of whole-cell models. We also describe several valuable lessons we learned toward improving future challenges. Going forward, we believe that collaborative efforts supported by inexpensive cloud computing have the potential to solve whole-cell model parameter estimation. Whole-cell models promise to enable rational bioengineering by predicting how cells behave. Even for simple bacteria, whole-cell models require thousands of parameters, many of which are poorly characterized or unknown. New approaches are needed to estimate these parameters. We organized the Dialogue for Reverse Engineering Assessments and Methods (DREAM) 8 Whole-Cell Parameter Estimation Challenge to develop new approaches for whole-cell model parameter identification. Here we describe the challenge, the best performing methods, new insights into the identifiability of whole-cell models, and several lessons we learned for improving future challenges. Going forward, we believe that collaborative efforts have the potential to produce powerful tools for identifying whole-cell models.
Collapse
|
18
|
A Mathematical Framework for Understanding Four-Dimensional Heterogeneous Differentiation of CD4+ T Cells. Bull Math Biol 2015; 77:1046-64. [PMID: 25779890 DOI: 10.1007/s11538-015-0076-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2014] [Accepted: 03/02/2015] [Indexed: 12/24/2022]
Abstract
At least four distinct lineages of CD4+ T cells play diverse roles in the immune system. Both in vivo and in vitro, naïve CD4+ T cells often differentiate into a variety of cellular phenotypes. Previously, we developed a mathematical framework to study heterogeneous differentiation of two lineages governed by a mutual-inhibition motif. To understand heterogeneous differentiation of CD4+ T cells involving more than two lineages, we present here a mathematical framework for the analysis of multiple stable steady states in dynamical systems with multiple state variables interacting through multiple mutual-inhibition loops. A mathematical model for CD4+ T cells based on this framework can reproduce known properties of heterogeneous differentiation involving multiple lineages of this cell differentiation system, such as heterogeneous differentiation of TH1-TH2, TH1-TH17 and iTReg-TH17 under single or mixed types of differentiation stimuli. The model shows that high concentrations of differentiation stimuli favor the formation of phenotypes with co-expression of lineage-specific master regulators.
Collapse
|
19
|
Shestov AA, Liu X, Ser Z, Cluntun AA, Hung YP, Huang L, Kim D, Le A, Yellen G, Albeck JG, Locasale JW. Quantitative determinants of aerobic glycolysis identify flux through the enzyme GAPDH as a limiting step. eLife 2014; 3. [PMID: 25009227 PMCID: PMC4118620 DOI: 10.7554/elife.03342] [Citation(s) in RCA: 178] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2014] [Accepted: 07/08/2014] [Indexed: 12/12/2022] Open
Abstract
Aerobic glycolysis or the Warburg Effect (WE) is characterized by the increased metabolism of glucose to lactate. It remains unknown what quantitative changes to the activity of metabolism are necessary and sufficient for this phenotype. We developed a computational model of glycolysis and an integrated analysis using metabolic control analysis (MCA), metabolomics data, and statistical simulations. We identified and confirmed a novel mode of regulation specific to aerobic glycolysis where flux through GAPDH, the enzyme separating lower and upper glycolysis, is the rate-limiting step in the pathway and the levels of fructose (1,6) bisphosphate (FBP), are predictive of the rate and control points in glycolysis. Strikingly, negative flux control was found and confirmed for several steps thought to be rate-limiting in glycolysis. Together, these findings enumerate the biochemical determinants of the WE and suggest strategies for identifying the contexts in which agents that target glycolysis might be most effective. DOI:http://dx.doi.org/10.7554/eLife.03342.001 Cells generate energy from a sugar called glucose via a process called glycolysis. This process involves many enzymes that catalyze 10 different chemical reactions, and it essentially converts glucose step-by-step into a simpler chemical called pyruvate. Pyruvate is then normally transported into structures within the cell called mitochondria, where it is further broken down using oxygen to release more energy. However, in cells that are rapidly dividing, pyruvate is converted into another chemical called lactate—which releases energy more quickly, but releases less energy overall. Cancer cells often convert most of their glucose into lactate, rather than breaking down pyruvate in their mitochondria: an observation known as the ‘Warburg effect’. And while many factors affect how a cell releases energy from pyruvate, it remains unclear what regulates which of these biochemical processes is most common in a living cell. In this study, Shestov et al. have developed a computational model for the process of glycolysis and used this to investigate the causes of the Warburg Effect. The model was based on the known characteristics of the enzymes and chemical reactions involved at each step. It predicted that the activity of the enzyme called GAPDH, which carries out the sixth step in glycolysis, in many cases affects how much lactate is produced. This suggests that this enzyme represents a bottleneck in the pathway. Next, Shestov et al. performed experiments where they used drugs to block different stages of the glycolysis pathway, and confirmed that the GAPDH enzyme is important for regulating this pathway in living cancer cells too. In these treated cells, the levels of a chemical called fructose-1,6-biphosphate (which is made in a step in the pathway between glucose and pyruvate) were either very high or very low. Shestov et al. proposed that the flow of chemicals through the glycolysis pathway is controlled by the GAPDH enzyme when the chemicals used by the enzymes upstream of GAPDH in the pathway (which includes fructose-1,6-biphosphate) are plentiful. However, if these chemicals are limited, other enzymes that are involved in earlier steps of the pathway regulate the process instead. The findings of Shestov et al. reveal that the regulation of glycolysis is more complex than previously thought, and is also very different when cells are undergoing the Warburg Effect. In the future, these findings might help to identify the types of cancer that could be effectively treated using drugs that target the glycolysis process, which are currently being tested in pre-clinical studies. DOI:http://dx.doi.org/10.7554/eLife.03342.002
Collapse
Affiliation(s)
| | - Xiaojing Liu
- Division of Nutritional Sciences, Cornell University, Ithaca, United States
| | - Zheng Ser
- Division of Nutritional Sciences, Cornell University, Ithaca, United States
| | - Ahmad A Cluntun
- Field of Biochemistry and Molecular Cell Biology, Department of Molecular Biology and Genetics, Cornell University, Ithaca, United States
| | - Yin P Hung
- Department of Neurobiology, Harvard Medical School, Boston, United States
| | - Lei Huang
- Field of Computational Biology, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, United States
| | - Dongsung Kim
- Field of Biochemistry and Molecular Cell Biology, Department of Molecular Biology and Genetics, Cornell University, Ithaca, United States
| | - Anne Le
- Department of Pathology and Oncology, Johns Hopkins University School of Medicine, Baltimore, United States
| | - Gary Yellen
- Department of Neurobiology, Harvard Medical School, Boston, United States
| | - John G Albeck
- Department of Cell Biology, Harvard Medical School, Boston, United States
| | - Jason W Locasale
- Division of Nutritional Sciences, Cornell University, Ithaca, United States
| |
Collapse
|
20
|
Oguz C, Palmisano A, Laomettachit T, Watson LT, Baumann WT, Tyson JJ. A stochastic model correctly predicts changes in budding yeast cell cycle dynamics upon periodic expression of CLN2. PLoS One 2014; 9:e96726. [PMID: 24816736 PMCID: PMC4016136 DOI: 10.1371/journal.pone.0096726] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Accepted: 04/11/2014] [Indexed: 01/30/2023] Open
Abstract
In this study, we focus on a recent stochastic budding yeast cell cycle model. First, we estimate the model parameters using extensive data sets: phenotypes of 110 genetic strains, single cell statistics of wild type and cln3 strains. Optimization of stochastic model parameters is achieved by an automated algorithm we recently used for a deterministic cell cycle model. Next, in order to test the predictive ability of the stochastic model, we focus on a recent experimental study in which forced periodic expression of CLN2 cyclin (driven by MET3 promoter in cln3 background) has been used to synchronize budding yeast cell colonies. We demonstrate that the model correctly predicts the experimentally observed synchronization levels and cell cycle statistics of mother and daughter cells under various experimental conditions (numerical data that is not enforced in parameter optimization), in addition to correctly predicting the qualitative changes in size control due to forced CLN2 expression. Our model also generates a novel prediction: under frequent CLN2 expression pulses, G1 phase duration is bimodal among small-born cells. These cells originate from daughters with extended budded periods due to size control during the budded period. This novel prediction and the experimental trends captured by the model illustrate the interplay between cell cycle dynamics, synchronization of cell colonies, and size control in budding yeast.
Collapse
Affiliation(s)
- Cihan Oguz
- Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
- * E-mail:
| | - Alida Palmisano
- Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Teeraphan Laomettachit
- Bioinformatics and Systems Biology Program, School of Bioresources and Technology, King Mongkut's University of Technology Thonburi, Bangkok, Thailand
| | - Layne T. Watson
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia, United States of America
- Department of Mathematics, Virginia Tech, Blacksburg, Virginia, United States of America
- Department of Aerospace and Ocean Engineering, Virginia Tech, Blacksburg, Virginia, United States of America
| | - William T. Baumann
- Department of Electrical and Computer Engineering, Virginia Tech, Blacksburg, Virginia, United States of America
| | - John J. Tyson
- Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| |
Collapse
|