1
|
Karabekmez ME. Insights into yeast response to chemotherapeutic agent through time series genome-scale metabolic models. Biotechnol Bioeng 2024; 121:3351-3359. [PMID: 39199017 DOI: 10.1002/bit.28833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 07/17/2024] [Accepted: 08/20/2024] [Indexed: 09/01/2024]
Abstract
Organism-specific genome-scale metabolic models (GSMMs) can unveil molecular mechanisms within cells and are commonly used in diverse applications, from synthetic biology, biotechnology, and systems biology to metabolic engineering. There are limited studies incorporating time-series transcriptomics in GSMM simulations. Yeast is an easy-to-manipulate model organism for tumor research. Here, a novel approach (TS-GSMM) was proposed to integrate time-series transcriptomics with GSMMs to narrow down the feasible solution space of all possible flux distributions and attain time-series flux samples. The flux samples were clustered using machine learning techniques, and the clusters' functional analysis was performed using reaction set enrichment analysis. A time series transcriptomics response of Yeast cells to a chemotherapeutic reagent-doxorubicin-was mapped onto a Yeast GSMM. Eleven flux clusters were obtained with our approach, and pathway dynamics were displayed. Induction of fluxes related to bicarbonate formation and transport, ergosterol and spermidine transport, and ATP production were captured. Integrating time-series transcriptomics data with GSMMs is a promising approach to reveal pathway dynamics without any kinetic modeling and detects pathways that cannot be identified through transcriptomics-only analysis. The codes are available at https://github.com/karabekmez/TS-GSMM.
Collapse
|
2
|
Fonseca LL, Böttcher L, Mehrad B, Laubenbacher RC. Surrogate modeling and control of medical digital twins. ARXIV 2024:arXiv:2402.05750v2. [PMID: 38827450 PMCID: PMC11142319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
The vision of personalized medicine is to identify interventions that maintain or restore a person's health based on their individual biology. Medical digital twins, computational models that integrate a wide range of health-related data about a person and can be dynamically updated, are a key technology that can help guide medical decisions. Such medical digital twin models can be high-dimensional, multi-scale, and stochastic. To be practical for healthcare applications, they often need to be simplified into low-dimensional surrogate models that can be used for optimal design of interventions. This paper introduces surrogate modeling algorithms for the purpose of optimal control applications. As a use case, we focus on agent-based models (ABMs), a common model type in biomedicine for which there are no readily available optimal control algorithms. By deriving surrogate models that are based on systems of ordinary differential equations, we show how optimal control methods can be employed to compute effective interventions, which can then be lifted back to a given ABM. The relevance of the methods introduced here extends beyond medical digital twins to other complex dynamical systems.
Collapse
Affiliation(s)
- Luis L. Fonseca
- Laboratory for Systems Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Lucas Böttcher
- Laboratory for Systems Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
- Department of Computational Science and Philosophy, Frankfurt School of Finance and Management, 60322 Frankfurt am Main, Germany
| | - Borna Mehrad
- Laboratory for Systems Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Reinhard C. Laubenbacher
- Laboratory for Systems Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
| |
Collapse
|
3
|
Lee JY, Han Y, Styczynski MP. Towards inferring absolute concentrations from relative abundance in time-course GC-MS metabolomics data. Mol Omics 2023; 19:126-136. [PMID: 36374123 PMCID: PMC9974747 DOI: 10.1039/d2mo00168c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Metabolomics, the large-scale study of metabolites, has significant appeal as a source of information for metabolic modeling and other scientific applications. One common approach for measuring metabolomics data is gas chromatography-mass spectrometry (GC-MS). However, GC-MS metabolomics data are typically reported as relative abundances, precluding their use with approaches and tools where absolute concentrations are necessary. While chemical standards can be used to help provide quantification, their use is time-consuming, expensive, or even impossible due to their limited availability. The ability to infer absolute concentrations from GC-MS metabolomics data without chemical standards would have significant value. We hypothesized that when analyzing time-course metabolomics datasets, the mass balances of metabolism and other biological information could provide sufficient information towards inference of absolute concentrations. To demonstrate this, we developed and characterized MetaboPAC, a computational framework that uses two approaches-one based on kinetic equations and another using biological heuristics-to predict the most likely response factors that allow translation between relative abundances and absolute concentrations. When used to analyze noiseless synthetic data generated from multiple types of kinetic rate laws, MetaboPAC performs significantly better than negative control approaches when 20% of kinetic terms are known a priori. Under conditions of lower sampling frequency and high noise, MetaboPAC is still able to provide significant inference of concentrations in 3 of 4 models studied. This provides a starting point for leveraging biological knowledge to extract concentration information from time-course intracellular GC-MS metabolomics datasets, particularly for systems that are well-studied and have partially known kinetic structures.
Collapse
Affiliation(s)
- Justin Y Lee
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA, USA.
| | - Yue Han
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA, USA.
| | - Mark P Styczynski
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA, USA.
| |
Collapse
|
4
|
Zhang H, Chen J, Tian T. Bayesian Inference of Stochastic Dynamic Models Using Early-Rejection Methods Based on Sequential Stochastic Simulations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1484-1494. [PMID: 33216717 DOI: 10.1109/tcbb.2020.3039490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Stochastic modelling is an important method to investigate the functions of noise in a wide range of biological systems. However, the parameter inference for stochastic models is still a challenging problem partially due to the large computing time required for stochastic simulations. To address this issue, we propose a novel early-rejection method by using sequential stochastic simulations. We first show that a large number of stochastic simulations are required to obtain reliable inference results. Instead of generating a large number of simulations for each parameter sample, we propose to generate these simulations in a number of stages. The simulation process will go to the next stage only if the accuracy of simulations at the current stage satisfies a given error criterion. We propose a formula to determine the error criterion and use a stochastic differential equation model to examine the effects of different criteria. Three biochemical network models are used to evaluate the efficiency and accuracy of the proposed method. Numerical results suggest the proposed early-rejection method achieves substantial improvement in the efficiency for the inference of stochastic models.
Collapse
|
5
|
Lee JY, Styczynski MP. Diverse classes of constraints enable broader applicability of a linear programming-based dynamic metabolic modeling framework. Sci Rep 2022; 12:762. [PMID: 35031616 PMCID: PMC8760257 DOI: 10.1038/s41598-021-03934-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 12/08/2021] [Indexed: 11/29/2022] Open
Abstract
Current metabolic modeling tools suffer from a variety of limitations, from scalability to simplifying assumptions, that preclude their use in many applications. We recently created a modeling framework, Linear Kinetics-Dynamic Flux Balance Analysis (LK-DFBA), that addresses a key gap: capturing metabolite dynamics and regulation while retaining a potentially scalable linear programming structure. Key to this framework's success are the linear kinetics and regulatory constraints imposed on the system. However, while the linearity of these constraints reduces computational complexity, it may not accurately capture the behavior of many biochemical systems. Here, we developed three new classes of LK-DFBA constraints to better model interactions between metabolites and the reactions they regulate. We tested these new approaches on several synthetic and biological systems, and also performed the first-ever comparison of LK-DFBA predictions to experimental data. We found that no single constraint approach was optimal across all systems examined, and systems with the same topological structure but different parameters were often best modeled by different types of constraints. However, we did find that when genetic perturbations were implemented in the systems, the optimal constraint approach typically remained the same as for the wild-type regardless of the model topology or parameterization, indicating that just a single wild-type dataset could allow identification of the ideal constraint to enable model predictivity for a given system. These results suggest that the availability of multiple constraint approaches will allow LK-DFBA to model a wider range of metabolic systems.
Collapse
Affiliation(s)
- Justin Y. Lee
- grid.213917.f0000 0001 2097 4943School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA USA
| | - Mark P. Styczynski
- grid.213917.f0000 0001 2097 4943School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA USA
| |
Collapse
|
6
|
Ferreira AE, Sousa Silva M, Cordeiro C. Metabolic Network Inference from Time Series. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11347-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
|
7
|
Voit EO. Metabolic Systems. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11619-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
|
8
|
Kuriya Y, Araki M. Dynamic Flux Balance Analysis to Evaluate the Strain Production Performance on Shikimic Acid Production in Escherichia coli. Metabolites 2020; 10:E198. [PMID: 32429049 PMCID: PMC7281464 DOI: 10.3390/metabo10050198] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Revised: 05/11/2020] [Accepted: 05/12/2020] [Indexed: 11/18/2022] Open
Abstract
Flux balance analysis (FBA) is used to improve the microbial production of useful compounds. However, a large gap often exists between the FBA solution and the experimental yield, because of growth and byproducts. FBA has been extended to dynamic FBA (dFBA), which is applicable to time-varying processes, such as batch or fed-batch cultures, and has significantly contributed to metabolic and cultural engineering applications. On the other hand, the performance of the experimental strains has not been fully evaluated. In this study, we applied dFBA to the production of shikimic acid from glucose in Escherichia coli, to evaluate the production performance of the strain as a case study. The experimental data of glucose consumption and cell growth were used as FBA constraints. Bi-level FBA optimization with maximized growth and shikimic acid production were the objective functions. Results suggest that the shikimic acid concentration in the high-shikimic-acid-producing strain constructed in the experiment reached up to 84% of the maximum value by simulation. Thus, this method can be used to evaluate the performance of strains and estimate the milestones of strain improvement.
Collapse
Affiliation(s)
- Yuki Kuriya
- Graduate School of Medicine, Kyoto University, 54 ShogoinKawahara-cho, Sakyo-ku, Kyoto 606-8507, Japan;
| | - Michihiro Araki
- Graduate School of Medicine, Kyoto University, 54 ShogoinKawahara-cho, Sakyo-ku, Kyoto 606-8507, Japan;
- National Institutes of Biomedical Innovation, Health and Nutrition, 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8636, Japan
- Graduate School of Science, Technology and Innovation, Kobe University, 1-1 Rokkodai, Nada, Kobe, Hyogo 657-8501, Japan
| |
Collapse
|
9
|
Dattner I, Ship H, Voit EO. Separable Nonlinear Least-Squares Parameter Estimation for Complex Dynamic Systems. COMPLEXITY 2020; 2020:6403641. [PMID: 34113070 PMCID: PMC8188859 DOI: 10.1155/2020/6403641] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Nonlinear dynamic models are widely used for characterizing processes that govern complex biological pathway systems. Over the past decade, validation and further development of these models became possible due to data collected via high-throughput experiments using methods from molecular biology. While these data are very beneficial, they are typically incomplete and noisy, which renders the inference of parameter values for complex dynamic models challenging. Fortunately, many biological systems have embedded linear mathematical features, which may be exploited, thereby improving fits and leading to better convergence of optimization algorithms. In this paper, we explore options of inference for dynamic models using a novel method of separable nonlinear least-squares optimization and compare its performance to the traditional nonlinear least-squares method. The numerical results from extensive simulations suggest that the proposed approach is at least as accurate as the traditional nonlinear least-squares, but usually superior, while also enjoying a substantial reduction in computational time.
Collapse
Affiliation(s)
- Itai Dattner
- Department of Statistics, University of Haifa, 199 Aba Khoushy Ave., Mount Carmel, Haifa 3498838, Israel
| | - Harold Ship
- Department of Statistics, University of Haifa, 199 Aba Khoushy Ave., Mount Carmel, Haifa 3498838, Israel
| | - Eberhard O. Voit
- The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, 950 Atlantic Drive, Atslanta, GA 30332–2000, USA
| |
Collapse
|
10
|
Dromms RA, Lee JY, Styczynski MP. LK-DFBA: a linear programming-based modeling strategy for capturing dynamics and metabolite-dependent regulation in metabolism. BMC Bioinformatics 2020; 21:93. [PMID: 32122331 PMCID: PMC7053146 DOI: 10.1186/s12859-020-3422-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 02/17/2020] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND The systems-scale analysis of cellular metabolites, "metabolomics," provides data ideal for applications in metabolic engineering. However, many of the computational tools for strain design are built around Flux Balance Analysis (FBA), which makes assumptions that preclude direct integration of metabolomics data into the underlying models. Finding a way to retain the advantages of FBA's linear structure while relaxing some of its assumptions could allow us to account for metabolite levels and metabolite-dependent regulation in strain design tools built from FBA, improving the accuracy of predictions made by these tools. We designed, implemented, and characterized a modeling strategy based on Dynamic FBA (DFBA), called Linear Kinetics-Dynamic Flux Balance Analysis (LK-DFBA), to satisfy these specifications. Our strategy adds constraints describing the dynamics and regulation of metabolism that are strictly linear. We evaluated LK-DFBA against alternative modeling frameworks using simulated noisy data from a small in silico model and a larger model of central carbon metabolism in E. coli, and compared each framework's ability to recapitulate the original system. RESULTS In the smaller model, we found that we could use regression from a dynamic flux estimation (DFE) with an optional non-linear parameter optimization to reproduce metabolite concentration dynamic trends more effectively than an ordinary differential equation model with generalized mass action rate laws when tested under realistic data sampling frequency and noise levels. We observed detrimental effects across all tested modeling approaches when metabolite time course data were missing, but found these effects to be smaller for LK-DFBA in most cases. With the E. coli model, we produced qualitatively reasonable results with similar properties to the smaller model and explored two different parameterization structures that yield trade-offs in computation time and accuracy. CONCLUSIONS LK-DFBA allows for calculation of metabolite concentrations and considers metabolite-dependent regulation while still retaining many computational advantages of FBA. This provides the proof-of-principle for a new metabolic modeling framework with the potential to create genome-scale dynamic models and the potential to be applied in strain engineering tools that currently use FBA.
Collapse
Affiliation(s)
- Robert A Dromms
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - Justin Y Lee
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - Mark P Styczynski
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA, USA.
| |
Collapse
|
11
|
Chung M, Binois M, Gramacy RB, Bardsley JM, Moquin DJ, Smith AP, Smith AM. PARAMETER AND UNCERTAINTY ESTIMATION FOR DYNAMICAL SYSTEMS USING SURROGATE STOCHASTIC PROCESSES. SIAM JOURNAL ON SCIENTIFIC COMPUTING : A PUBLICATION OF THE SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS 2019; 41:A2212-A2238. [PMID: 31749599 PMCID: PMC6867882 DOI: 10.1137/18m1213403] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Inference on unknown quantities in dynamical systems via observational data is essential for providing meaningful insight, furnishing accurate predictions, enabling robust control, and establishing appropriate designs for future experiments. Merging mathematical theory with empirical measurements in a statistically coherent way is critical and challenges abound, e.g., ill-posedness of the parameter estimation problem, proper regularization and incorporation of prior knowledge, and computational limitations. To address these issues, we propose a new method for learning parameterized dynamical systems from data. We first customize and fit a surrogate stochastic process directly to observational data, front-loading with statistical learning to respect prior knowledge (e.g., smoothness), cope with challenging data features like heteroskedasticity, heavy tails, and censoring. Then, samples of the stochastic process are used as "surrogate data" and point estimates are computed via ordinary point estimation methods in a modular fashion. Attractive features of this two-step approach include modularity and trivial parallelizability. We demonstrate its advantages on a predator-prey simulation study and on a real-world application involving within-host influenza virus infection data paired with a viral kinetic model, with comparisons to a more conventional Markov chain Monte Carlo (MCMC) based Bayesian approach.
Collapse
Affiliation(s)
- Matthias Chung
- Department of Mathematics, Computational Modeling and Data Analytics Division, Academy of Integrated Science, Virginia Tech, Blacksburg, VA 24061
| | - Mickaël Binois
- Booth School of Business, University of Chicago, Chicago, IL 60637
| | | | | | - David J Moquin
- Department of Internal Medicine, University of Tennessee Health Science Center, Memphis, TN 38103
| | - Amanda P Smith
- Department of Pediatrics, University of Tennessee Health Science Center, Memphis, TN 38103
| | - Amber M Smith
- Department of Pediatrics, University of Tennessee Health Science Center, Memphis, TN 38103
| |
Collapse
|
12
|
Dattner I, Gugushvili S. Application of one-step method to parameter estimation in ODE models. STAT NEERL 2018; 72:126-156. [PMID: 29937593 PMCID: PMC5993282 DOI: 10.1111/stan.12124] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Revised: 11/23/2017] [Indexed: 11/27/2022]
Abstract
In this paper, we study application of Le Cam's one-step method to parameter estimation in ordinary differential equation models. This computationally simple technique can serve as an alternative to numerical evaluation of the popular non-linear least squares estimator, which typically requires the use of a multistep iterative algorithm and repetitive numerical integration of the ordinary differential equation system. The one-step method starts from a preliminary n -consistent estimator of the parameter of interest and next turns it into an asymptotic (as the sample size n→∞) equivalent of the least squares estimator through a numerically straightforward procedure. We demonstrate performance of the one-step estimator via extensive simulations and real data examples. The method enables the researcher to obtain both point and interval estimates. The preliminary n -consistent estimator that we use depends on non-parametric smoothing, and we provide a data-driven methodology for choosing its tuning parameter and support it by theory. An easy implementation scheme of the one-step method for practical use is pointed out.
Collapse
Affiliation(s)
- Itai Dattner
- Department of StatisticsUniversity of Haifa199 Aba Khoushy Ave., Mount CarmelHaifa3498838Israel
| | - Shota Gugushvili
- Mathematical InstituteLeiden UniversityP.O. Box 95122300 RALeidenThe Netherlands
| |
Collapse
|
13
|
Downs DM, Bazurto JV, Gupta A, Fonseca LL, Voit EO. The three-legged stool of understanding metabolism: integrating metabolomics with biochemical genetics and computational modeling. AIMS Microbiol 2018; 4:289-303. [PMID: 31294216 PMCID: PMC6604926 DOI: 10.3934/microbiol.2018.2.289] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Accepted: 04/02/2018] [Indexed: 12/23/2022] Open
Abstract
Traditional biochemical research has resulted in a good understanding of many aspects of metabolism. However, this reductionist approach is time consuming and requires substantial resources, thus raising the question whether modern metabolomics and genomics should take over and replace the targeted experiments of old. We proffer that such a replacement is neither feasible not desirable and propose instead the tight integration of modern, system-wide omics with traditional experimental bench science and dedicated computational approaches. This integration is an important prerequisite toward the optimal acquisition of knowledge regarding metabolism and physiology in health and disease. The commentary describes advantages and drawbacks of current approaches to assessing metabolism and highlights the challenges to be overcome as we strive to achieve a deeper level of metabolic understanding in the future.
Collapse
Affiliation(s)
- Diana M Downs
- Department of Microbiology, University of Georgia, Athens, GA, 30602, USA
| | - Jannell V Bazurto
- Department of Biological Sciences, University of Idaho, Moscow, ID, 83844, USA
| | - Anuj Gupta
- Department of Biomedical Engineering, Georgia Institute of Technology, 950 Atlantic Drive, Suite 2115, Atlanta, GA, 30332-2000, USA
| | - Luis L Fonseca
- Department of Biomedical Engineering, Georgia Institute of Technology, 950 Atlantic Drive, Suite 2115, Atlanta, GA, 30332-2000, USA
| | - Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology, 950 Atlantic Drive, Suite 2115, Atlanta, GA, 30332-2000, USA
| |
Collapse
|
14
|
Voit EO. The best models of metabolism. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2017; 9:10.1002/wsbm.1391. [PMID: 28544810 PMCID: PMC5643013 DOI: 10.1002/wsbm.1391] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2017] [Revised: 03/31/2017] [Accepted: 04/01/2017] [Indexed: 12/25/2022]
Abstract
Biochemical systems are among of the oldest application areas of mathematical modeling. Spanning a time period of over one hundred years, the repertoire of options for structuring a model and for formulating reactions has been constantly growing, and yet, it is still unclear whether or to what degree some models are better than others and how the modeler is to choose among them. In fact, the variety of options has become overwhelming and difficult to maneuver for novices and experts alike. This review outlines the metabolic model design process and discusses the numerous choices for modeling frameworks and mathematical representations. It tries to be inclusive, even though it cannot be complete, and introduces the various modeling options in a manner that is as unbiased as that is feasible. However, the review does end with personal recommendations for the choices of default models. WIREs Syst Biol Med 2017, 9:e1391. doi: 10.1002/wsbm.1391 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| |
Collapse
|
15
|
|
16
|
Chung M, Krueger J, Pop M. Identification of microbiota dynamics using robust parameter estimation methods. Math Biosci 2017; 294:71-84. [PMID: 29030152 DOI: 10.1016/j.mbs.2017.09.009] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Revised: 09/25/2017] [Accepted: 09/28/2017] [Indexed: 01/25/2023]
Abstract
The compositions of in-host microbial communities (microbiota) play a significant role in host health, and a better understanding of the microbiota's role in a host's transition from health to disease or vice versa could lead to novel medical treatments. One of the first steps toward this understanding is modeling interaction dynamics of the microbiota, which can be exceedingly challenging given the complexity of the dynamics and difficulties in collecting sufficient data. Methods such as principal differential analysis, dynamic flux estimation, and others have been developed to overcome these challenges. Despite their advantages, these methods are still vastly underutilized in fields such as mathematical biology, and one potential reason for this is their sophisticated implementation. While this paper focuses on applying principal differential analysis to microbiota data, we also provide comprehensive details regarding the derivation and numerics of this method and include a functional implementation for readers' benefit. For further validation of these methods, we demonstrate the feasibility of principal differential analysis using simulation studies and then apply the method to intestinal and vaginal microbiota data. In working with these data, we capture experimentally confirmed dynamics while also revealing potential new insights into the system dynamics.
Collapse
Affiliation(s)
- Matthias Chung
- Virginia Tech, Department of Mathematics, 225 Stanger St, Blacksburg, VA, United States; Virginia Tech, Computational Modeling and Data Analytics, Academy of Integrated Science, Blacksburg, VA, United States.
| | - Justin Krueger
- Virginia Tech, Department of Mathematics, 225 Stanger St, Blacksburg, VA, United States.
| | - Mihai Pop
- University of Maryland, Center for Bioinformatics and Computational Biology, 8314 Paint Branch Dr., College Park, MD, United States.
| |
Collapse
|
17
|
Schmitz AC, Hartline CJ, Zhang F. Engineering Microbial Metabolite Dynamics and Heterogeneity. Biotechnol J 2017; 12. [PMID: 28901715 DOI: 10.1002/biot.201700422] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2017] [Revised: 09/06/2017] [Indexed: 11/09/2022]
Abstract
As yields for biological chemical production in microorganisms approach their theoretical maximum, metabolic engineering requires new tools, and approaches for improvements beyond what traditional strategies can achieve. Engineering metabolite dynamics and metabolite heterogeneity is necessary to achieve further improvements in product titers, productivities, and yields. Metabolite dynamics, the ensemble change in metabolite concentration over time, arise from the need for microbes to adapt their metabolism in response to the extracellular environment and are important for controlling growth and productivity in industrial fermentations. Metabolite heterogeneity, the cell-to-cell variation in a metabolite concentration in an isoclonal population, has a significant impact on ensemble productivity. Recent advances in single cell analysis enable a more complete understanding of the processes driving metabolite heterogeneity and reveal metabolic engineering targets. The authors present an overview of the mechanistic origins of metabolite dynamics and heterogeneity, why they are important, their potential effects in chemical production processes, and tools and strategies for engineering metabolite dynamics and heterogeneity. The authors emphasize that the ability to control metabolite dynamics and heterogeneity will bring new avenues of engineering to increase productivity of microbial strains.
Collapse
Affiliation(s)
- Alexander C Schmitz
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, St. Louis, USA
| | - Christopher J Hartline
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, St. Louis, USA
| | - Fuzhong Zhang
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, St. Louis, USA.,Division of Biological and Biomedical Sciences, and Institute of Materials Science and Engineering, Washington University in St. Louis, St. Louis, USA
| |
Collapse
|
18
|
Faraji M, Voit EO. Stepwise inference of likely dynamic flux distributions from metabolic time series data. Bioinformatics 2017; 33:2165-2172. [PMID: 28334199 PMCID: PMC5860468 DOI: 10.1093/bioinformatics/btx126] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2016] [Accepted: 03/03/2017] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Most metabolic pathways contain more reactions than metabolites and therefore have a wide stoichiometric matrix that corresponds to infinitely many possible flux distributions that are perfectly compatible with the dynamics of the metabolites in a given dataset. This under-determinedness poses a challenge for the quantitative characterization of flux distributions from time series data and thus for the design of adequate, predictive models. Here we propose a method that reduces the degrees of freedom in a stepwise manner and leads to a dynamic flux distribution that is, in a statistical sense, likely to be close to the true distribution. RESULTS We applied the proposed method to the lignin biosynthesis pathway in switchgrass. The system consists of 16 metabolites and 23 enzymatic reactions. It has seven degrees of freedom and therefore admits a large space of dynamic flux distributions that all fit a set of metabolic time series data equally well. The proposed method reduces this space in a systematic and biologically reasonable manner and converges to a likely dynamic flux distribution in just a few iterations. The estimated solution and the true flux distribution, which is known in this case, show excellent agreement and thereby lend support to the method. AVAILABILITY AND IMPLEMENTATION The computational model was implemented in MATLAB (version R2014a, The MathWorks, Natick, MA). The source code is available at https://github.gatech.edu/VoitLab/Stepwise-Inference-of-Likely-Dynamic-Flux-Distributions and www.bst.bme.gatech.edu/research.php . CONTACT mojdeh@gatech.edu or eberhard.voit@bme.gatech.edu. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mojdeh Faraji
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| |
Collapse
|
19
|
Faraji M, Voit EO. Nonparametric dynamic modeling. Math Biosci 2017; 287:130-146. [PMID: 27590775 PMCID: PMC5706552 DOI: 10.1016/j.mbs.2016.08.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Revised: 08/17/2016] [Accepted: 08/17/2016] [Indexed: 01/11/2023]
Abstract
Challenging as it typically is, the estimation of parameter values seems to be an unavoidable step in the design and implementation of any dynamic model. Here, we demonstrate that it is possible to set up, diagnose, and simulate dynamic models without the need to estimate parameter values, if the situation is favorable. Specifically, it is possible to establish nonparametric models for nonlinear compartment models, including metabolic pathway models, if sufficiently many high-quality time series data are available that describe the biological phenomenon under investigation in an appropriate and representative manner. The proposed nonparametric strategy is a variant of the method of Dynamic Flux Estimation (DFE), which permits the estimation of numerical flux profiles from metabolic time series data. However, instead of attempting to formulate these numerical profiles as explicit functions and to optimize their parameter values, as it is done in DFE, the metabolite and flux profiles are used here directly as a scaffold for a library from which values are interpolated and retrieved for the simulation of the differential equations describing the model. Beyond simulations, the proposed methods render it possible to determine steady states from non-steady state data, perform sensitivity analyses, and estimate the Jacobian of the system at a steady state.
Collapse
Affiliation(s)
- Mojdeh Faraji
- Department of Biomedical Engineering, Georgia Institute of Technology, 950 Atlantic Drive, Suite 2115, Atlanta, GA 30332-2000, USA.
| | - Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology, 950 Atlantic Drive, Suite 2115, Atlanta, GA 30332-2000, USA.
| |
Collapse
|
20
|
Qi Z, Roback JD, Voit EO. Effects of Storage Time on Glycolysis in Donated Human Blood Units. Metabolites 2017; 7:metabo7020012. [PMID: 28353627 PMCID: PMC5487983 DOI: 10.3390/metabo7020012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2016] [Revised: 03/06/2017] [Accepted: 03/23/2017] [Indexed: 12/26/2022] Open
Abstract
Background: Donated blood is typically stored before transfusions. During storage, the metabolism of red blood cells changes, possibly causing storage lesions. The changes are storage time dependent and exhibit donor-specific variations. It is necessary to uncover and characterize the responsible molecular mechanisms accounting for such biochemical changes, qualitatively and quantitatively; Study Design and Methods: Based on the integration of metabolic time series data, kinetic models, and a stoichiometric model of the glycolytic pathway, a customized inference method was developed and used to quantify the dynamic changes in glycolytic fluxes during the storage of donated blood units. The method provides a proof of principle for the feasibility of inferences regarding flux characteristics from metabolomics data; Results: Several glycolytic reaction steps change substantially during storage time and vary among different fluxes and donors. The quantification of these storage time effects, which are possibly irreversible, allows for predictions of the transfusion outcome of individual blood units; Conclusion: The improved mechanistic understanding of blood storage, obtained from this computational study, may aid the identification of blood units that age quickly or more slowly during storage, and may ultimately improve transfusion management in clinics.
Collapse
Affiliation(s)
- Zhen Qi
- Department of Biomedical Engineering, Georgia Institute of Technology and Emory University School of Medicine, Atlanta, GA 30332, USA.
| | - John D Roback
- Center for Transfusion and Cellular Therapy, Department of Pathology and Laboratory, Emory University School of Medicine, Atlanta, GA 30322, USA.
- Emory University Hospital, Blood Bank, Atlanta, GA 30322, USA.
| | - Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology and Emory University School of Medicine, Atlanta, GA 30332, USA.
| |
Collapse
|
21
|
Dolatshahi S, Fonseca LL, Voit EO. New insights into the complex regulation of the glycolytic pathway in Lactococcus lactis. II. Inference of the precisely timed control system regulating glycolysis. MOLECULAR BIOSYSTEMS 2016; 12:37-47. [PMID: 26609780 DOI: 10.1039/c5mb00726g] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The dairy bacterium Lactococcus lactis has to master a complicated task. It must control its essentially linear glycolytic pathway in such a fashion that, when the substrate, glucose, runs out, it retains enough phosphoenolpyruvate and fructose-1,6-bisphosphate to be able to restart glycolysis as soon as new glucose becomes available. Although glycolysis is arguably the best-studied metabolic pathway, its details in L. lactis are still unclear, and it is, in particular, not understood how the bacterium manages the stop-and-start task. The primary purpose of this paper and its companion is a clarification of some of the details of the governing regulatory strategies with which L. lactis manages to retain the necessary metabolites for the restart of glycolysis after periods of starvation. The paper furthermore discusses how the bacterium changes these strategies when it is subjected to aerobic rather than anaerobic conditions.
Collapse
Affiliation(s)
- Sepideh Dolatshahi
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| | - Luis L Fonseca
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| | - Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| |
Collapse
|
22
|
Dolatshahi S, Voit EO. Identification of Metabolic Pathway Systems. Front Genet 2016; 7:6. [PMID: 26904095 PMCID: PMC4748741 DOI: 10.3389/fgene.2016.00006] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Accepted: 01/18/2016] [Indexed: 01/22/2023] Open
Abstract
The estimation of parameters in even moderately large biological systems is a significant challenge. This challenge is greatly exacerbated if the mathematical formats of appropriate process descriptions are unknown. To address this challenge, the method of dynamic flux estimation (DFE) was proposed for the analysis of metabolic time series data. Under ideal conditions, the first phase of DFE yields numerical representations of all fluxes within a metabolic pathway system, either as values at each time point or as plots against their substrates and modulators. However, this numerical result does not reveal the mathematical format of each flux. Thus, the second phase of DFE selects functional formats that are consistent with the numerical trends obtained from the first phase. While greatly facilitating metabolic data analysis, DFE is only directly applicable if the pathway system contains as many dependent variables as fluxes. Because most actual systems contain more fluxes than metabolite pools, this requirement is seldom satisfied. Auxiliary methods have been proposed to alleviate this issue, but they are not general. Here we propose strategies that extend DFE toward general, slightly underdetermined pathway systems.
Collapse
Affiliation(s)
- Sepideh Dolatshahi
- Department of Biomedical Engineering, Georgia Institute of Technology Atlanta, GA, USA
| | - Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology Atlanta, GA, USA
| |
Collapse
|
23
|
Bayesian Computation Methods for Inferring Regulatory Network Models Using Biomedical Data. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 939:289-307. [DOI: 10.1007/978-981-10-1503-8_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
24
|
Dolatshahi S, Fonseca LL, Voit EO. New insights into the complex regulation of the glycolytic pathway in Lactococcus lactis. I. Construction and diagnosis of a comprehensive dynamic model. MOLECULAR BIOSYSTEMS 2016; 12:23-36. [DOI: 10.1039/c5mb00331h] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
This article and the companion paper use computational systems modeling to decipher the complex coordination of regulatory signals controlling the glycolytic pathway in the dairy bacterium Lactococcus lactis.
Collapse
Affiliation(s)
- Sepideh Dolatshahi
- Department of Biomedical Engineering
- Georgia Institute of Technology
- Atlanta
- USA
| | - Luis L. Fonseca
- Department of Biomedical Engineering
- Georgia Institute of Technology
- Atlanta
- USA
| | - Eberhard O. Voit
- Department of Biomedical Engineering
- Georgia Institute of Technology
- Atlanta
- USA
| |
Collapse
|
25
|
Schumacher R, Wahl SA. Effective Estimation of Dynamic Metabolic Fluxes Using (13)C Labeling and Piecewise Affine Approximation: From Theory to Practical Applicability. Metabolites 2015; 5:697-719. [PMID: 26690237 PMCID: PMC4693191 DOI: 10.3390/metabo5040697] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Revised: 11/11/2015] [Accepted: 11/26/2015] [Indexed: 11/25/2022] Open
Abstract
The design of microbial production processes relies on rational choices for metabolic engineering of the production host and the process conditions. These require a systematic and quantitative understanding of cellular regulation. Therefore, a novel method for dynamic flux identification using quantitative metabolomics and 13C labeling to identify piecewise-affine (PWA) flux functions has been described recently. Obtaining flux estimates nevertheless still required frequent manual reinitalization to obtain a good reproduction of the experimental data and, moreover, did not optimize on all observables simultaneously (metabolites and isotopomer concentrations). In our contribution we focus on measures to achieve faster and robust dynamic flux estimation which leads to a high dimensional parameter estimation problem. Specifically, we address the following challenges within the PWA problem formulation: (1) Fast selection of sufficient domains for the PWA flux functions, (2) Control of over-fitting in the concentration space using shape-prescriptive modeling and (3) robust and efficient implementation of the parameter estimation using the hybrid implicit filtering algorithm. With the improvements we significantly speed up the convergence by efficiently exploiting that the optimization problem is partly linear. This allows application to larger-scale metabolic networks and demonstrates that the proposed approach is not purely theoretical, but also applicable in practice.
Collapse
Affiliation(s)
- Robin Schumacher
- Department of Biotechnology, Delft University of Technology, Julianalaan 67, 2628 BC Delft, The Netherlands.
| | - S Aljoscha Wahl
- Department of Biotechnology, Delft University of Technology, Julianalaan 67, 2628 BC Delft, The Netherlands.
| |
Collapse
|
26
|
Sun X, Länger B, Weckwerth W. Challenges of Inversely Estimating Jacobian from Metabolomics Data. Front Bioeng Biotechnol 2015; 3:188. [PMID: 26636075 PMCID: PMC4649029 DOI: 10.3389/fbioe.2015.00188] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2015] [Accepted: 11/02/2015] [Indexed: 12/02/2022] Open
Abstract
Inferring dynamics of metabolic networks directly from metabolomics data provides a promising way to elucidate the underlying mechanisms of biological systems, as reported in our previous studies (Weckwerth, 2011; Sun and Weckwerth, 2012; Nägele et al., 2014) by a differential Jacobian approach. The Jacobian is solved from an overdetermined system of equations as JC + CJT = −2D, called Lyapunov Equation in its generic form,1 where J is the Jacobian, C is the covariance matrix of metabolomics data, and D is the fluctuation matrix. Lyapunov Equation can be further simplified as the linear form Ax = b. Frequently, this linear equation system is ill-conditioned, i.e., a small variation in the right side b results in a big change in the solution x, thus making the solution unstable and error-prone. At the same time, inaccurate estimation of covariance matrix and uncertainties in the fluctuation matrix bring biases to the solution x. Here, we first reviewed common approaches to circumvent the ill-conditioned problems, including total least squares, Tikhonov regularization, and truncated singular value decomposition. Then, we benchmarked these methods on several in silico kinetic models with small to large perturbations on the covariance and fluctuation matrices. The results identified that the accuracy of the reverse Jacobian is mainly dependent on the condition number of A, the perturbation amplitude of C, and the stiffness of the kinetic models. Our research contributes a systematical comparison of methods to inversely solve Jacobian from metabolomics data.
Collapse
Affiliation(s)
- Xiaoliang Sun
- Department of Ecogenomics and Systems Biology, University of Vienna , Vienna , Austria ; Institute of Integrative Biology, University of Liverpool , Liverpool , United Kingdom
| | - Bettina Länger
- Department of Ecogenomics and Systems Biology, University of Vienna , Vienna , Austria
| | - Wolfram Weckwerth
- Department of Ecogenomics and Systems Biology, University of Vienna , Vienna , Austria ; Vienna Metabolomics Center (VIME), University of Vienna , Vienna , Austria
| |
Collapse
|
27
|
Sehr C, Kremling A, Marin-Sanguino A. Design Principles as a Guide for Constraint Based and Dynamic Modeling: Towards an Integrative Workflow. Metabolites 2015; 5:601-35. [PMID: 26501332 PMCID: PMC4693187 DOI: 10.3390/metabo5040601] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 10/10/2015] [Indexed: 01/19/2023] Open
Abstract
During the last 10 years, systems biology has matured from a fuzzy concept combining omics, mathematical modeling and computers into a scientific field on its own right. In spite of its incredible potential, the multilevel complexity of its objects of study makes it very difficult to establish a reliable connection between data and models. The great number of degrees of freedom often results in situations, where many different models can explain/fit all available datasets. This has resulted in a shift of paradigm from the initially dominant, maybe naive, idea of inferring the system out of a number of datasets to the application of different techniques that reduce the degrees of freedom before any data set is analyzed. There is a wide variety of techniques available, each of them can contribute a piece of the puzzle and include different kinds of experimental information. But the challenge that remains is their meaningful integration. Here we show some theoretical results that enable some of the main modeling approaches to be applied sequentially in a complementary manner, and how this workflow can benefit from evolutionary reasoning to keep the complexity of the problem in check. As a proof of concept, we show how the synergies between these modeling techniques can provide insight into some well studied problems: Ammonia assimilation in bacteria and an unbranched linear pathway with end-product inhibition.
Collapse
Affiliation(s)
- Christiana Sehr
- Specialty Division for Systems Biotechnology, Technische Universität München, Boltzmannstraße 15, Garching 85748, Germany.
| | - Andreas Kremling
- Specialty Division for Systems Biotechnology, Technische Universität München, Boltzmannstraße 15, Garching 85748, Germany.
| | - Alberto Marin-Sanguino
- Specialty Division for Systems Biotechnology, Technische Universität München, Boltzmannstraße 15, Garching 85748, Germany.
| |
Collapse
|
28
|
Hartmann A, Lemos JM, Vinga S. Modeling multiple experiments using regularized optimization: A case study on bacterial glucose utilization dynamics. Comput Biol Med 2015; 63:301-9. [DOI: 10.1016/j.compbiomed.2014.08.027] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2014] [Revised: 06/27/2014] [Accepted: 08/28/2014] [Indexed: 11/26/2022]
|
29
|
Chen PW, Fonseca LL, Hannun YA, Voit EO. Dynamics of the Heat Stress Response of Ceramides with Different Fatty-Acyl Chain Lengths in Baker's Yeast. PLoS Comput Biol 2015; 11:e1004373. [PMID: 26241868 PMCID: PMC4524633 DOI: 10.1371/journal.pcbi.1004373] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 06/02/2015] [Indexed: 11/19/2022] Open
Abstract
The article demonstrates that computational modeling has the capacity to convert metabolic snapshots, taken sequentially over time, into a description of cellular, dynamic strategies. The specific application is a detailed analysis of a set of actions with which Saccharomyces cerevisiae responds to heat stress. Using time dependent metabolic concentration data, we use a combination of mathematical modeling, reverse engineering, and optimization to infer dynamic changes in enzyme activities within the sphingolipid pathway. The details of the sphingolipid responses to heat stress are important, because they guide some of the longer-term alterations in gene expression, with which the cells adapt to the increased temperature. The analysis indicates that all enzyme activities in the system are affected and that the shapes of the time trends in activities depend on the fatty-acyl CoA chain lengths of the different ceramide species in the system.
Collapse
Affiliation(s)
- Po-Wei Chen
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- The George W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Luis L. Fonseca
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Yusuf A. Hannun
- The Cancer Center at Stony Brook Medicine, Stony Brook University, Health Science Center, Stony Brook, New York, United States of America
| | - Eberhard O. Voit
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| |
Collapse
|
30
|
Dromms RA, Styczynski MP. Improved metabolite profile smoothing for flux estimation. MOLECULAR BIOSYSTEMS 2015; 11:2394-405. [PMID: 26172986 DOI: 10.1039/c5mb00165j] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
As genome-scale metabolic models become more sophisticated and dynamic, one significant challenge in using these models is to effectively integrate increasingly prevalent systems-scale metabolite profiling data into them. One common data processing step when integrating metabolite data is to smooth experimental time course measurements: the smoothed profiles can be used to estimate metabolite accumulation (derivatives), and thus the flux distribution of the metabolic model. However, this smoothing step is susceptible to the (often significant) noise in experimental measurements, limiting the accuracy of downstream model predictions. Here, we present several improvements to current approaches for smoothing metabolite time course data using defined functions. First, we use a biologically-inspired mathematical model function taken from transcriptional profiling and clustering literature that captures the dynamics of many biologically relevant transient processes. We demonstrate that it is competitive with, and often superior to, previously described fitting schemas, and may serve as an effective single option for data smoothing in metabolic flux applications. We also implement a resampling-based approach to buffer out sensitivity to specific data sets and allow for more accurate fitting of noisy data. We found that this method, as well as the addition of parameter space constraints, yielded improved estimates of concentrations and derivatives (fluxes) in previously described fitting functions. These methods have the potential to improve the accuracy of existing and future dynamic metabolic models by allowing for the more effective integration of metabolite profiling data.
Collapse
Affiliation(s)
- Robert A Dromms
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, GA 30332-0100, USA.
| | | |
Collapse
|
31
|
Castellini A, Paltrinieri D, Manca V. MP-GeneticSynth: inferring biological network regulations from time series. Bioinformatics 2015; 31:785-7. [PMID: 25344496 DOI: 10.1093/bioinformatics/btu694] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MP-GeneticSynth is a Java tool for discovering the logic and regulation mechanisms responsible for observed biological dynamics in terms of finite difference recurrent equations. The software makes use of: (i) metabolic P systems as a modeling framework, (ii) an evolutionary approach to discover flux regulation functions as linear combinations of given primitive functions, (iii) a suitable reformulation of the least squares method to estimate function parameters considering simultaneously all the reactions involved in complex dynamics. The tool is available as a plugin for the virtual laboratory MetaPlab. It has graphical and interactive interfaces for data preparation, a priori knowledge integration, and flux regulator analysis. Availability and implementation: Source code, binaries, documentation (including quick start guide and videos) and case studies are freely available at http://mplab.sci.univr.it/plugins/mpgs/index.html.
Collapse
Affiliation(s)
- Alberto Castellini
- Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy
| | - Daniele Paltrinieri
- Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy
| | - Vincenzo Manca
- Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy
| |
Collapse
|
32
|
Abstract
This year we celebrate the 150th anniversary of the law of mass action. This law is often assumed to have been “there” forever, but it has its own history, background, and a definite starting point. The law has had an impact on chemistry, biochemistry, biomathematics, and systems biology that is difficult to overestimate. It is easily recognized that it is the direct basis for computational enzyme kinetics, ecological systems models, and models for the spread of diseases. The article reviews the explicit and implicit role of the law of mass action in systems biology and reveals how the original, more general formulation of the law emerged one hundred years later ab initio as a very general, canonical representation of biological processes.
Collapse
|
33
|
|
34
|
Parameter estimation of dynamic biological network models using integrated fluxes. BMC SYSTEMS BIOLOGY 2014; 8:127. [PMID: 25403239 PMCID: PMC4241227 DOI: 10.1186/s12918-014-0127-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Accepted: 10/29/2014] [Indexed: 11/29/2022]
Abstract
Background Parameter estimation is often the bottlenecking step in biological system modeling. For ordinary differential equation (ODE) models, the challenge in this estimation has been attributed to not only the lack of parameter identifiability, but also computational issues such as finding globally optimal parameter estimates over highly multidimensional search space. Recent methods using incremental estimation approach could alleviate the computational difficulty by performing the parameter estimation one-reaction-at-a-time. However, incremental estimation strategies usually require data smoothing and are known to produce biased parameter estimates. Results In this article, we presented a new parameter estimation method called integrated flux parameter estimation (IFPE). We employed the integral form of the ODE such that we could compute the integral of reaction fluxes from time-series concentration data without data smoothing. Here, we formulated the parameter estimation as a nested optimization problem. In the outer optimization, we performed a minimization of model prediction errors over parameters associated with a subset of reactions labeled as independent. The dimension of the independent reaction subset was equal to the degrees of freedom in the calculation of integrated fluxes (IF) from concentration data. We selected the independent reactions such that given their IF values, the IFs of the remaining (dependent) reactions could be uniquely determined. Meanwhile, in the inner optimization, we estimated the model parameters associated with the dependent reactions, one-reaction-at-a-time, by minimizing the dependent IF prediction errors. We demonstrated the performance of the IFPE method using two case studies: a generalized mass action model of a branched pathway and a lin-log ODE model of Lactococcus lactis glycolytic pathway. Conclusions The IFPE significantly outperformed standard simultaneous parameter estimation in terms of computational efficiency and scaling. In comparison to incremental parameter estimation (IPE) method, the IFPE produced parameter estimates with significantly lower bias and did not require time-series data smoothing. The advantages of IFPE over the IPE however came at the cost of a small increase in the computational time. Electronic supplementary material The online version of this article (doi:10.1186/s12918-014-0127-x) contains supplementary material, which is available to authorized users.
Collapse
|
35
|
Wu Q, Smith-Miles K, Tian T. Approximate Bayesian computation schemes for parameter inference of discrete stochastic models using simulated likelihood density. BMC Bioinformatics 2014; 15 Suppl 12:S3. [PMID: 25473744 PMCID: PMC4243104 DOI: 10.1186/1471-2105-15-s12-s3] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mathematical modeling is an important tool in systems biology to study the dynamic property of complex biological systems. However, one of the major challenges in systems biology is how to infer unknown parameters in mathematical models based on the experimental data sets, in particular, when the data are sparse and the regulatory network is stochastic. RESULTS To address this issue, this work proposed a new algorithm to estimate parameters in stochastic models using simulated likelihood density in the framework of approximate Bayesian computation. Two stochastic models were used to demonstrate the efficiency and effectiveness of the proposed method. In addition, we designed another algorithm based on a novel objective function to measure the accuracy of stochastic simulations. CONCLUSIONS Simulation results suggest that the usage of simulated likelihood density improves the accuracy of estimates substantially. When the error is measured at each observation time point individually, the estimated parameters have better accuracy than those obtained by a published method in which the error is measured using simulations over the entire observation time period.
Collapse
|
36
|
Khodayari A, Zomorrodi AR, Liao JC, Maranas CD. A kinetic model of Escherichia coli core metabolism satisfying multiple sets of mutant flux data. Metab Eng 2014; 25:50-62. [DOI: 10.1016/j.ymben.2014.05.014] [Citation(s) in RCA: 145] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2014] [Revised: 04/17/2014] [Accepted: 05/28/2014] [Indexed: 01/27/2023]
|
37
|
Hatakeyama Y, Kataoka H, Nakajima N, Watabe T, Fujimoto S, Okuhara Y. Prediction model for glucose metabolism based on lipid metabolism. Methods Inf Med 2014; 53:357-63. [PMID: 24986162 DOI: 10.3414/me14-01-0034] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2014] [Accepted: 04/18/2014] [Indexed: 11/09/2022]
Abstract
OBJECTIVES We developed a robust, long-term clinical prediction model to predict conditions leading to early diabetes using laboratory values other than blood glucose and insulin levels. Our model protects against missing data and noise that occur during long-term analysis. METHODS RESULTS of a 75-g oral glucose tolerance test (OGTT) were divided into three groups: diabetes, impaired glucose tolerance (IGT), and normal (n = 114, 235, and 325, respectively). For glucose metabolic and lipid metabolic parameters, near 30-day mean values and 10-year integrated values were compared. The relation between high-density lipoprotein cholesterol (HDL-C) and variations in HbA1c was analyzed in 158 patients. We also constructed a state space model consisting of an observation model (HDL-C and HbA1c) and an internal model (disorders of lipid metabolism and glucose metabolism) and applied this model to 116 cases. RESULTS The root mean square error between the observed HbA1c and predicted HbA1c was 0.25. CONCLUSIONS In the observation model, HDL-C levels were useful for prediction of increases in HbA1c. Even with numerous missing values over time, as occurs in clinical practice, clinically valid predictions can be made using this state space model.
Collapse
Affiliation(s)
- Y Hatakeyama
- Yutaka Hatakeyama, Center of Medical Information Science, Kochi University Medical School, Oko-cho Kohasu, Nankoku, Kochi, Kochi 783-8505, Japan, E-mail:
| | | | | | | | | | | |
Collapse
|
38
|
Identification of crucial parameters in a mathematical multiscale model of glioblastoma growth. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2014; 2014:437094. [PMID: 24899919 PMCID: PMC4034489 DOI: 10.1155/2014/437094] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Accepted: 03/22/2014] [Indexed: 02/08/2023]
Abstract
Glioblastomas are highly malignant brain tumours. Mathematical models and their analysis
provide a tool to support the understanding of the development of these tumours as well as
the design of more effective treatment strategies. We have previously developed a multiscale
model of glioblastoma progression that covers processes on the cellular and molecular scale.
Here, we present a novel nutrient-dependent multiscale sensitivity analysis of this model
that helps to identify those reaction parameters of the molecular interaction network that influence
the tumour progression on the cellular scale the most. In particular, those parameters
are identified that essentially determine tumour expansion and could be therefore used as potential
therapy targets. As indicators for the success of a potential therapy target, a deceleration
of the tumour expansion and a reduction of the tumour volume are employed.
From the results, it can be concluded that no single parameter variation results in a less
aggressive tumour. However, it can be shown that a few combined perturbations of two
systematically selected parameters cause a slow-down of the tumour expansion velocity accompanied
with a decrease of the tumour volume. Those parameters are primarily linked to
the reactions that involve the microRNA-451 and the thereof regulated protein MO25.
Collapse
|
39
|
Zurauskienė J, Kirk P, Thorne T, Pinney J, Stumpf M. Derivative processes for modelling metabolic fluxes. ACTA ACUST UNITED AC 2014; 30:1892-8. [PMID: 24578401 PMCID: PMC4071196 DOI: 10.1093/bioinformatics/btu069] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Motivation: One of the challenging questions in modelling biological systems is to characterize the functional forms of the processes that control and orchestrate molecular and cellular phenotypes. Recently proposed methods for the analysis of metabolic pathways, for example, dynamic flux estimation, can only provide estimates of the underlying fluxes at discrete time points but fail to capture the complete temporal behaviour. To describe the dynamic variation of the fluxes, we additionally require the assumption of specific functional forms that can capture the temporal behaviour. However, it also remains unclear how to address the noise which might be present in experimentally measured metabolite concentrations. Results: Here we propose a novel approach to modelling metabolic fluxes: derivative processes that are based on multiple-output Gaussian processes (MGPs), which are a flexible non-parametric Bayesian modelling technique. The main advantages that follow from MGPs approach include the natural non-parametric representation of the fluxes and ability to impute the missing data in between the measurements. Our derivative process approach allows us to model changes in metabolite derivative concentrations and to characterize the temporal behaviour of metabolic fluxes from time course data. Because the derivative of a Gaussian process is itself a Gaussian process, we can readily link metabolite concentrations to metabolic fluxes and vice versa. Here we discuss how this can be implemented in an MGP framework and illustrate its application to simple models, including nitrogen metabolism in Escherichia coli. Availability and implementation: R code is available from the authors upon request. Contact:j.norkunaite@imperial.ac.uk; m.stumpf@imperial.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Justina Zurauskienė
- Theoretical Systems Biology Group, Centre for Bioinformatics, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Paul Kirk
- Theoretical Systems Biology Group, Centre for Bioinformatics, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Thomas Thorne
- Theoretical Systems Biology Group, Centre for Bioinformatics, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - John Pinney
- Theoretical Systems Biology Group, Centre for Bioinformatics, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Michael Stumpf
- Theoretical Systems Biology Group, Centre for Bioinformatics, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| |
Collapse
|
40
|
Voit EO. Characterizability of metabolic pathway systems from time series data. Math Biosci 2013; 246:315-25. [PMID: 23391489 PMCID: PMC3709000 DOI: 10.1016/j.mbs.2013.01.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 01/11/2013] [Accepted: 01/20/2013] [Indexed: 11/28/2022]
Abstract
Over the past decade, the biomathematical community has devoted substantial effort to the complicated challenge of estimating parameter values for biological systems models. An even more difficult issue is the characterization of functional forms for the processes that govern these systems. Most parameter estimation approaches tacitly assume that these forms are known or can be assumed with some validity. However, this assumption is not always true. The recently proposed method of Dynamic Flux Estimation (DFE) addresses this problem in a genuinely novel fashion for metabolic pathway systems. Specifically, DFE allows the characterization of fluxes within such systems through an analysis of metabolic time series data. Its main drawback is the fact that DFE can only directly be applied if the pathway system contains as many metabolites as unknown fluxes. This situation is unfortunately rare. To overcome this roadblock, earlier work in this field had proposed strategies for augmenting the set of unknown fluxes with independent kinetic information, which however is not always available. Employing Moore-Penrose pseudo-inverse methods of linear algebra, the present article discusses an approach for characterizing fluxes from metabolic time series data that is applicable even if the pathway system is underdetermined and contains more fluxes than metabolites. Intriguingly, this approach is independent of a specific modeling framework and unaffected by noise in the experimental time series data. The results reveal whether any fluxes may be characterized and, if so, which subset is characterizable. They also help with the identification of fluxes that, if they could be determined independently, would allow the application of DFE.
Collapse
Affiliation(s)
- Eberhard O Voit
- The Wallace H. Coulter, Department of Biomedical Engineering at Georgia Tech. and Emory University, 313 Ferst Drive, Suite 4103, Atlanta, GA 30332-0535, United States.
| |
Collapse
|
41
|
Voit EO. Mesoscopic modeling as a starting point for computational analyses of cystic fibrosis as a systemic disease. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1844:258-70. [PMID: 23570976 DOI: 10.1016/j.bbapap.2013.03.023] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Revised: 03/01/2013] [Accepted: 03/25/2013] [Indexed: 12/18/2022]
Abstract
Probably the most prominent expectation associated with systems biology is the computational support of personalized medicine and predictive health. At least some of this anticipated support is envisioned in the form of disease simulators that will take hundreds of personalized biomarker data as input and allow the physician to explore and optimize possible treatment regimens on a computer before the best treatment is applied to the actual patient in a custom-tailored manner. The key prerequisites for such simulators are mathematical and computational models that not only manage the input data and implement the general physiological and pathological principles of organ systems but also integrate the myriads of details that affect their functionality to a significant degree. Obviously, the construction of such models is an overwhelming task that suggests the long-term development of hierarchical or telescopic approaches representing the physiology of organs and their diseases, first coarsely and over time with increased granularity. This article illustrates the rudiments of such a strategy in the context of cystic fibrosis (CF) of the lung. The starting point is a very simplistic, generic model of inflammation, which has been shown to capture the principles of infection, trauma, and sepsis surprisingly well. The adaptation of this model to CF contains as variables healthy and damaged cells, as well as different classes of interacting cytokines and infectious microbes that are affected by mucus formation, which is the hallmark symptom of the disease (Perez-Vilar and Boucher, 2004) [1]. The simple model represents the overall dynamics of the disease progression, including so-called acute pulmonary exacerbations, quite well, but of course does not provide much detail regarding the specific processes underlying the disease. In order to launch the next level of modeling with finer granularity, it is desirable to determine which components of the coarse model contribute most to the disease dynamics. The article introduces for this purpose the concept of module gains or ModGains, which quantify the sensitivity of key disease variables in the higher-level system. In reality, these variables represent complex modules at the next level of granularity, and the computation of ModGains therefore allows an importance ranking of variables that should be replaced with more detailed models. The "hot-swapping" of such detailed modules for former variables is greatly facilitated by the architecture and implementation of the overarching, coarse model structure, which is here formulated with methods of biochemical systems theory (BST). This article is part of a Special Issue entitled: Computational Proteomics, Systems Biology & Clinical Implications. Guest Editor: Yudong Cai.
Collapse
Affiliation(s)
- Eberhard O Voit
- Department of Biomedical Engineering, Georgia Tech, 313 Ferst Drive, Suite 4103, Atlanta, GA 30332-0535, USA.
| |
Collapse
|
42
|
Scotti M, Stella L, Shearer EJ, Stover PJ. Modeling cellular compartmentation in one-carbon metabolism. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2013; 5:343-65. [PMID: 23408533 DOI: 10.1002/wsbm.1209] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Folate-mediated one-carbon metabolism (FOCM) is associated with risk for numerous pathological states including birth defects, cancers, and chronic diseases. Although the enzymes that constitute the biological pathways have been well described and their interdependency through the shared use of folate cofactors appreciated, the biological mechanisms underlying disease etiologies remain elusive. The FOCM network is highly sensitive to nutritional status of several B-vitamins and numerous penetrant gene variants that alter network outputs, but current computational approaches do not fully capture the dynamics and stochastic noise of the system. Combining the stochastic approach with a rule-based representation will help model the intrinsic noise displayed by FOCM, address the limited flexibility of standard simulation methods for coarse-graining the FOCM-associated biochemical processes, and manage the combinatorial complexity emerging from reactions within FOCM that would otherwise be intractable.
Collapse
Affiliation(s)
- Marco Scotti
- The Microsoft Research-University of Trento Centre for Computational and Systems Biology (COSBI), Rovereto, Italy
| | | | | | | |
Collapse
|
43
|
Abstract
Biochemical systems theory (BST) is the foundation for a set of analytical andmodeling tools that facilitate the analysis of dynamic biological systems. This paper depicts major developments in BST up to the current state of the art in 2012. It discusses its rationale, describes the typical strategies and methods of designing, diagnosing, analyzing, and utilizing BST models, and reviews areas of application. The paper is intended as a guide for investigators entering the fascinating field of biological systems analysis and as a resource for practitioners and experts.
Collapse
|
44
|
Jia G, Stephanopoulos G, Gunawan R. Incremental parameter estimation of kinetic metabolic network models. BMC SYSTEMS BIOLOGY 2012; 6:142. [PMID: 23171810 PMCID: PMC3568022 DOI: 10.1186/1752-0509-6-142] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2012] [Accepted: 11/07/2012] [Indexed: 11/10/2022]
Abstract
BACKGROUND An efficient and reliable parameter estimation method is essential for the creation of biological models using ordinary differential equation (ODE). Most of the existing estimation methods involve finding the global minimum of data fitting residuals over the entire parameter space simultaneously. Unfortunately, the associated computational requirement often becomes prohibitively high due to the large number of parameters and the lack of complete parameter identifiability (i.e. not all parameters can be uniquely identified). RESULTS In this work, an incremental approach was applied to the parameter estimation of ODE models from concentration time profiles. Particularly, the method was developed to address a commonly encountered circumstance in the modeling of metabolic networks, where the number of metabolic fluxes (reaction rates) exceeds that of metabolites (chemical species). Here, the minimization of model residuals was performed over a subset of the parameter space that is associated with the degrees of freedom in the dynamic flux estimation from the concentration time-slopes. The efficacy of this method was demonstrated using two generalized mass action (GMA) models, where the method significantly outperformed single-step estimations. In addition, an extension of the estimation method to handle missing data is also presented. CONCLUSIONS The proposed incremental estimation method is able to tackle the issue on the lack of complete parameter identifiability and to significantly reduce the computational efforts in estimating model parameters, which will facilitate kinetic modeling of genome-scale cellular metabolism in the future.
Collapse
Affiliation(s)
- Gengjie Jia
- Chemical and Pharmaceutical Engineering, Singapore-MIT Alliance, Singapore 117576, Singapore
| | | | | |
Collapse
|
45
|
Ensemble kinetic modeling of metabolic networks from dynamic metabolic profiles. Metabolites 2012; 2:891-912. [PMID: 24957767 PMCID: PMC3901226 DOI: 10.3390/metabo2040891] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2012] [Revised: 11/02/2012] [Accepted: 11/05/2012] [Indexed: 01/21/2023] Open
Abstract
Kinetic modeling of metabolic pathways has important applications in metabolic engineering, but significant challenges still remain. The difficulties faced vary from finding best-fit parameters in a highly multidimensional search space to incomplete parameter identifiability. To meet some of these challenges, an ensemble modeling method is developed for characterizing a subset of kinetic parameters that give statistically equivalent goodness-of-fit to time series concentration data. The method is based on the incremental identification approach, where the parameter estimation is done in a step-wise manner. Numerical efficacy is achieved by reducing the dimensionality of parameter space and using efficient random parameter exploration algorithms. The shift toward using model ensembles, instead of the traditional "best-fit" models, is necessary to directly account for model uncertainty during the application of such models. The performance of the ensemble modeling approach has been demonstrated in the modeling of a generic branched pathway and the trehalose pathway in Saccharomyces cerevisiae using generalized mass action (GMA) kinetics.
Collapse
|
46
|
|
47
|
Chou IC, Voit EO. Estimation of dynamic flux profiles from metabolic time series data. BMC SYSTEMS BIOLOGY 2012; 6:84. [PMID: 22776140 PMCID: PMC3495652 DOI: 10.1186/1752-0509-6-84] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2012] [Accepted: 05/05/2012] [Indexed: 11/25/2022]
Abstract
Background Advances in modern high-throughput techniques of molecular biology have enabled top-down approaches for the estimation of parameter values in metabolic systems, based on time series data. Special among them is the recent method of dynamic flux estimation (DFE), which uses such data not only for parameter estimation but also for the identification of functional forms of the processes governing a metabolic system. DFE furthermore provides diagnostic tools for the evaluation of model validity and of the quality of a model fit beyond residual errors. Unfortunately, DFE works only when the data are more or less complete and the system contains as many independent fluxes as metabolites. These drawbacks may be ameliorated with other types of estimation and information. However, such supplementations incur their own limitations. In particular, assumptions must be made regarding the functional forms of some processes and detailed kinetic information must be available, in addition to the time series data. Results The authors propose here a systematic approach that supplements DFE and overcomes some of its shortcomings. Like DFE, the approach is model-free and requires only minimal assumptions. If sufficient time series data are available, the approach allows the determination of a subset of fluxes that enables the subsequent applicability of DFE to the rest of the flux system. The authors demonstrate the procedure with three artificial pathway systems exhibiting distinct characteristics and with actual data of the trehalose pathway in Saccharomyces cerevisiae. Conclusions The results demonstrate that the proposed method successfully complements DFE under various situations and without a priori assumptions regarding the model representation. The proposed method also permits an examination of whether at all, to what degree, or within what range the available time series data can be validly represented in a particular functional format of a flux within a pathway system. Based on these results, further experiments may be designed to generate data points that genuinely add new information to the structure identification and parameter estimation tasks at hand.
Collapse
Affiliation(s)
- I-Chun Chou
- Integrative BioSystems Institute and The Wallace H, Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, 313 Ferst Drive, Atlanta, GA 30332, USA
| | | |
Collapse
|
48
|
Construction of a genome-scale kinetic model of mycobacterium tuberculosis using generic rate equations. Metabolites 2012; 2:382-97. [PMID: 24957639 PMCID: PMC3901218 DOI: 10.3390/metabo2030382] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2012] [Revised: 06/07/2012] [Accepted: 06/25/2012] [Indexed: 12/17/2022] Open
Abstract
The study of biological systems at the genome scale helps us understand fundamental biological processes that govern the activity of living organisms and regulate their interactions with the environment. Genome-scale metabolic models are usually analysed using constraint-based methods, since detailed rate equations and kinetic parameters are often missing. However, constraint-based analysis is limited in capturing the dynamics of cellular processes. In this paper, we present an approach to build a genome-scale kinetic model of Mycobacterium tuberculosis metabolism using generic rate equations. M. tuberculosis causes tuberculosis which remains one of the largest killer infectious diseases. Using a genetic algorithm, we estimated kinetic parameters for a genome-scale metabolic model of M. tuberculosis based on flux distributions derived from Flux Balance Analysis. Our results show that an excellent agreement with flux values is obtained under several growth conditions, although kinetic parameters may vary in different conditions. Parameter variability analysis indicates that a high degree of redundancy remains present in model parameters, which suggests that the integration of other types of high-throughput datasets will enable the development of better constrained models accounting for a variety of in vivo phenotypes.
Collapse
|
49
|
Yang X, Dent JE, Nardini C. An S-System Parameter Estimation Method (SPEM) for biological networks. J Comput Biol 2012; 19:175-87. [PMID: 22300319 DOI: 10.1089/cmb.2011.0269] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Advances in experimental biology, coupled with advances in computational power, bring new challenges to the interdisciplinary field of computational biology. One such broad challenge lies in the reverse engineering of gene networks, and goes from determining the structure of static networks, to reconstructing the dynamics of interactions from time series data. Here, we focus our attention on the latter area, and in particular, on parameterizing a dynamic network of oriented interactions between genes. By basing the parameterizing approach on a known power-law relationship model between connected genes (S-system), we are able to account for non-linearity in the network, without compromising the ability to analyze network characteristics. In this article, we introduce the S-System Parameter Estimation Method (SPEM). SPEM, a freely available R software package (http://www.picb.ac.cn/ClinicalGenomicNTW/temp3.html), takes gene expression data in time series and returns the network of interactions as a set of differential equations. The methods, which are presented and tested here, are shown to provide accurate results not only on synthetic data, but more importantly on real and therefore noisy by nature, biological data. In summary, SPEM shows high sensitivity and positive predicted values, as well as free availability and expansibility (because based on open source software). We expect these characteristics to make it a useful and broadly applicable software in the challenging reconstruction of dynamic gene networks.
Collapse
Affiliation(s)
- Xinyi Yang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, PR China
| | | | | |
Collapse
|
50
|
Pozo C, Marín-Sanguino A, Alves R, Guillén-Gosálbez G, Jiménez L, Sorribas A. Steady-state global optimization of metabolic non-linear dynamic models through recasting into power-law canonical models. BMC SYSTEMS BIOLOGY 2011; 5:137. [PMID: 21867520 PMCID: PMC3201032 DOI: 10.1186/1752-0509-5-137] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/05/2011] [Accepted: 08/25/2011] [Indexed: 01/18/2023]
Abstract
Background Design of newly engineered microbial strains for biotechnological purposes would greatly benefit from the development of realistic mathematical models for the processes to be optimized. Such models can then be analyzed and, with the development and application of appropriate optimization techniques, one could identify the modifications that need to be made to the organism in order to achieve the desired biotechnological goal. As appropriate models to perform such an analysis are necessarily non-linear and typically non-convex, finding their global optimum is a challenging task. Canonical modeling techniques, such as Generalized Mass Action (GMA) models based on the power-law formalism, offer a possible solution to this problem because they have a mathematical structure that enables the development of specific algorithms for global optimization. Results Based on the GMA canonical representation, we have developed in previous works a highly efficient optimization algorithm and a set of related strategies for understanding the evolution of adaptive responses in cellular metabolism. Here, we explore the possibility of recasting kinetic non-linear models into an equivalent GMA model, so that global optimization on the recast GMA model can be performed. With this technique, optimization is greatly facilitated and the results are transposable to the original non-linear problem. This procedure is straightforward for a particular class of non-linear models known as Saturable and Cooperative (SC) models that extend the power-law formalism to deal with saturation and cooperativity. Conclusions Our results show that recasting non-linear kinetic models into GMA models is indeed an appropriate strategy that helps overcoming some of the numerical difficulties that arise during the global optimization task.
Collapse
Affiliation(s)
- Carlos Pozo
- Departament de Ciències Mèdiques Bàsiques, Institut de Recerca Biomèdica de Lleida (IRBLLEIDA), Universitat de Lleida, Spain
| | | | | | | | | | | |
Collapse
|