1
|
Thompson JC, Zavala VM, Venturelli OS. Integrating a tailored recurrent neural network with Bayesian experimental design to optimize microbial community functions. PLoS Comput Biol 2023; 19:e1011436. [PMID: 37773951 PMCID: PMC10540976 DOI: 10.1371/journal.pcbi.1011436] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 08/16/2023] [Indexed: 10/01/2023] Open
Abstract
Microbiomes interact dynamically with their environment to perform exploitable functions such as production of valuable metabolites and degradation of toxic metabolites for a wide range of applications in human health, agriculture, and environmental cleanup. Developing computational models to predict the key bacterial species and environmental factors to build and optimize such functions are crucial to accelerate microbial community engineering. However, there is an unknown web of interactions that determine the highly complex and dynamic behavior of these systems, which precludes the development of models based on known mechanisms. By contrast, entirely data-driven machine learning models can produce physically unrealistic predictions and often require significant amounts of experimental data to learn system behavior. We develop a physically-constrained recurrent neural network that preserves model flexibility but is constrained to produce physically consistent predictions and show that it can outperform existing machine learning methods in the prediction of certain experimentally measured species abundance and metabolite concentrations. Further, we present a closed-loop, Bayesian experimental design algorithm to guide data collection by selecting experimental conditions that simultaneously maximize information gain and target microbial community functions. Using a bioreactor case study, we demonstrate how the proposed framework can be used to efficiently navigate a large design space to identify optimal operating conditions. The proposed methodology offers a flexible machine learning approach specifically tailored to optimize microbiome target functions through the sequential design of informative experiments that seek to explore and exploit community functions.
Collapse
Affiliation(s)
- Jaron C. Thompson
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Victor M. Zavala
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Ophelia S. Venturelli
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Bacteriology, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| |
Collapse
|
2
|
Vo HD, Forero-Quintero LS, Aguilera LU, Munsky B. Analysis and design of single-cell experiments to harvest fluctuation information while rejecting measurement noise. Front Cell Dev Biol 2023; 11:1133994. [PMID: 37305680 PMCID: PMC10250612 DOI: 10.3389/fcell.2023.1133994] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Accepted: 05/10/2023] [Indexed: 06/13/2023] Open
Abstract
Introduction: Despite continued technological improvements, measurement errors always reduce or distort the information that any real experiment can provide to quantify cellular dynamics. This problem is particularly serious for cell signaling studies to quantify heterogeneity in single-cell gene regulation, where important RNA and protein copy numbers are themselves subject to the inherently random fluctuations of biochemical reactions. Until now, it has not been clear how measurement noise should be managed in addition to other experiment design variables (e.g., sampling size, measurement times, or perturbation levels) to ensure that collected data will provide useful insights on signaling or gene expression mechanisms of interest. Methods: We propose a computational framework that takes explicit consideration of measurement errors to analyze single-cell observations, and we derive Fisher Information Matrix (FIM)-based criteria to quantify the information value of distorted experiments. Results and Discussion: We apply this framework to analyze multiple models in the context of simulated and experimental single-cell data for a reporter gene controlled by an HIV promoter. We show that the proposed approach quantitatively predicts how different types of measurement distortions affect the accuracy and precision of model identification, and we demonstrate that the effects of these distortions can be mitigated through explicit consideration during model inference. We conclude that this reformulation of the FIM could be used effectively to design single-cell experiments to optimally harvest fluctuation information while mitigating the effects of image distortion.
Collapse
Affiliation(s)
- Huy D. Vo
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Linda S. Forero-Quintero
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Luis U. Aguilera
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Brian Munsky
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO, United States
| |
Collapse
|
3
|
Wang J, Dowling AW. Pyomo.
DOE
: An
Open‐Source
Package for
Model‐Based
Design of Experiments in Python. AIChE J 2022. [DOI: 10.1002/aic.17813] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Jialu Wang
- Department of Chemical and Biomolecular Engineering University of Notre Dame Notre Dame IN
| | - Alexander W. Dowling
- Department of Chemical and Biomolecular Engineering University of Notre Dame Notre Dame IN
| |
Collapse
|
4
|
Mester R, Landeros A, Rackauckas C, Lange K. Differential methods for assessing sensitivity in biological models. PLoS Comput Biol 2022; 18:e1009598. [PMID: 35696417 PMCID: PMC9232177 DOI: 10.1371/journal.pcbi.1009598] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 06/24/2022] [Accepted: 05/12/2022] [Indexed: 11/18/2022] Open
Abstract
Differential sensitivity analysis is indispensable in fitting parameters, understanding uncertainty, and forecasting the results of both thought and lab experiments. Although there are many methods currently available for performing differential sensitivity analysis of biological models, it can be difficult to determine which method is best suited for a particular model. In this paper, we explain a variety of differential sensitivity methods and assess their value in some typical biological models. First, we explain the mathematical basis for three numerical methods: adjoint sensitivity analysis, complex perturbation sensitivity analysis, and forward mode sensitivity analysis. We then carry out four instructive case studies. (a) The CARRGO model for tumor-immune interaction highlights the additional information that differential sensitivity analysis provides beyond traditional naive sensitivity methods, (b) the deterministic SIR model demonstrates the value of using second-order sensitivity in refining model predictions, (c) the stochastic SIR model shows how differential sensitivity can be attacked in stochastic modeling, and (d) a discrete birth-death-migration model illustrates how the complex perturbation method of differential sensitivity can be generalized to a broader range of biological models. Finally, we compare the speed, accuracy, and ease of use of these methods. We find that forward mode automatic differentiation has the quickest computational time, while the complex perturbation method is the simplest to implement and the most generalizable.
Collapse
Affiliation(s)
- Rachel Mester
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail:
| | - Alfonso Landeros
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| | - Chris Rackauckas
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Pumas-AI, Annapolis, Maryland, United States of America
- Julia Computing, Cambridge, Massachusetts, United States of America
| | - Kenneth Lange
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Statistics, University of California Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
5
|
Ghosh D, Mandal C. Clustering Based Parameter Estimation of Thyroid Hormone Pathway. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:343-354. [PMID: 32750849 DOI: 10.1109/tcbb.2020.2995589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
An ordinary differential equation (ODE) model of the working of the thyroid system for euthyroidism has been presented. As clinical data for thyroid hormones is relatively scarce, such modelling offers potential benefits over wet lab procedures. Genetic algorithms developed for determining of parameters of the ODE system using the available data have been presented and evaluated. This approach enables subject specific parameter estimation towards characterisation of individual thyroid operation. Initially, a simple steady state model was used. Later a cosinor model for the circadian variation of thyroid hormones was used to obtain more reliable results, as indicated through sensitivity analysis in conjunction with other statistical methods. Our parameter determination method has been tested on groups of patients with similar observed values of thyroid stimulating hormone (TSH), free T 3 and free T 4 (identified through clustering) to determine their parameter values jointly. This approach appears to produce parameter sets with lower variation than parameters determined independently, thus leading to better parameter determination.
Collapse
|
6
|
Mikhalychev A, Zhevno K, Vlasenko S, Benediktovitch A, Ulyanenkova T, Ulyanenkov A. Fisher information for optimal planning of X-ray diffraction experiments. J Appl Crystallogr 2021. [DOI: 10.1107/s1600576721009869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Fisher information is a powerful mathematical tool suitable for quantification of data `informativity' and optimization of the experimental setup and measurement conditions. Here, it is applied to X-ray diffraction and an informational approach to choosing the optimal measurement configuration is proposed. The core idea is maximization of the information which can be extracted from the measured data set by the selected analysis technique, over the sets of accessible reflections and measurement geometries. The developed approach is applied to high-resolution X-ray diffraction measurements and microstructure analysis of multilayer samples, and its efficiency and consistency are demonstrated with the results of more straightforward Monte Carlo simulations.
Collapse
|
7
|
Pal K, Szilagyi B, Burcham CL, Jarmer DJ, Nagy ZK. Iterative model‐based experimental design for spherical agglomeration processes. AIChE J 2021. [DOI: 10.1002/aic.17178] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Kanjakha Pal
- Davidson School of Chemical Engineering Purdue University West Lafayette Indiana USA
| | - Botond Szilagyi
- Davidson School of Chemical Engineering Purdue University West Lafayette Indiana USA
| | | | - Daniel J. Jarmer
- Eli Lilly and Company Lilly Technology Center Indianapolis Indiana USA
| | - Zoltan K. Nagy
- Davidson School of Chemical Engineering Purdue University West Lafayette Indiana USA
- Department of Chemical Engineering Loughborough University Loughborough UK
| |
Collapse
|
8
|
Lee D, Jayaraman A, Kwon JS. Development of a hybrid model for a partially known intracellular signaling pathway through correction term estimation and neural network modeling. PLoS Comput Biol 2020; 16:e1008472. [PMID: 33315899 PMCID: PMC7769624 DOI: 10.1371/journal.pcbi.1008472] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 12/28/2020] [Accepted: 10/26/2020] [Indexed: 12/30/2022] Open
Abstract
Developing an accurate first-principle model is an important step in employing systems biology approaches to analyze an intracellular signaling pathway. However, an accurate first-principle model is difficult to be developed since it requires in-depth mechanistic understandings of the signaling pathway. Since underlying mechanisms such as the reaction network structure are not fully understood, significant discrepancy exists between predicted and actual signaling dynamics. Motivated by these considerations, this work proposes a hybrid modeling approach that combines a first-principle model and an artificial neural network (ANN) model so that predictions of the hybrid model surpass those of the original model. First, the proposed approach determines an optimal subset of model states whose dynamics should be corrected by the ANN by examining the correlation between each state and outputs through relative order. Second, an L2-regularized least-squares problem is solved to infer values of the correction terms that are necessary to minimize the discrepancy between the model predictions and available measurements. Third, an ANN is developed to generalize relationships between the values of the correction terms and the system dynamics. Lastly, the original first-principle model is coupled with the developed ANN to finalize the hybrid model development so that the model will possess generalized prediction capabilities while retaining the model interpretability. We have successfully validated the proposed methodology with two case studies, simplified apoptosis and lipopolysaccharide-induced NFκB signaling pathways, to develop hybrid models with in silico and in vitro measurements, respectively. An intracellular signaling pathway is often represented by a set of nonlinear ordinary differential equations, which translate our current knowledge about the signaling pathway into a testable mathematical model. However, predictions from such models are often subject to high uncertainty since many signaling pathways are only partially known beforehand. In this study, we propose a systematic approach to develop a hybrid model to improve model accuracy by combining machine learning and the first-principle modeling. Specifically, model correction terms are learned from discrepancy between model predictions and measurements, and these terms are added to the first-principle model to enhance the prediction accuracy. Once these correction terms are learned from the data, an artificial neural network (ANN) model is developed to find an empirical relation between the model and the correction terms so that the developed ANN can be used to posses improved predictive capabilities even in new operating conditions (i.e., generalizability). The final hybrid model is then constructed by coupling the first-principle model with the developed ANN.
Collapse
Affiliation(s)
- Dongheon Lee
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Texas A&M Energy Institute, Texas A&M University, College Station, Texas, USA
| | - Arul Jayaraman
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Department of Biomedical Engineering, Texas A&M University, College Station, Texas, USA
| | - Joseph S. Kwon
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Texas A&M Energy Institute, Texas A&M University, College Station, Texas, USA
- * E-mail:
| |
Collapse
|
9
|
Application of Systems Engineering Principles and Techniques in Biological Big Data Analytics: A Review. Processes (Basel) 2020. [DOI: 10.3390/pr8080951] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
In the past few decades, we have witnessed tremendous advancements in biology, life sciences and healthcare. These advancements are due in no small part to the big data made available by various high-throughput technologies, the ever-advancing computing power, and the algorithmic advancements in machine learning. Specifically, big data analytics such as statistical and machine learning has become an essential tool in these rapidly developing fields. As a result, the subject has drawn increased attention and many review papers have been published in just the past few years on the subject. Different from all existing reviews, this work focuses on the application of systems, engineering principles and techniques in addressing some of the common challenges in big data analytics for biological, biomedical and healthcare applications. Specifically, this review focuses on the following three key areas in biological big data analytics where systems engineering principles and techniques have been playing important roles: the principle of parsimony in addressing overfitting, the dynamic analysis of biological data, and the role of domain knowledge in biological data analytics.
Collapse
|
10
|
Srinivasan S, Cluett WR, Mahadevan R. A scalable method for parameter identification in kinetic models of metabolism using steady-state data. Bioinformatics 2020; 35:5216-5225. [PMID: 31197317 DOI: 10.1093/bioinformatics/btz445] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Revised: 04/26/2019] [Accepted: 06/05/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION In kinetic models of metabolism, the parameter values determine the dynamic behaviour predicted by these models. Estimating parameters from in vivo experimental data require the parameters to be structurally identifiable, and the data to be informative enough to estimate these parameters. Existing methods to determine the structural identifiability of parameters in kinetic models of metabolism can only be applied to models of small metabolic networks due to their computational complexity. Additionally, a priori experimental design, a necessity to obtain informative data for parameter estimation, also does not account for using steady-state data to estimate parameters in kinetic models. RESULTS Here, we present a scalable methodology to structurally identify parameters for each flux in a kinetic model of metabolism based on the availability of steady-state data. In doing so, we also address the issue of determining the number and nature of experiments for generating steady-state data to estimate these parameters. By using a small metabolic network as an example, we show that most parameters in fluxes expressed by mechanistic enzyme kinetic rate laws can be identified using steady-state data, and the steady-state data required for their estimation can be obtained from selective experiments involving both substrate and enzyme level perturbations. The methodology can be used in combination with other identifiability and experimental design algorithms that use dynamic data to determine the most informative experiments requiring the least resources to perform. AVAILABILITY AND IMPLEMENTATION https://github.com/LMSE/ident. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Shyam Srinivasan
- Department of Chemical Engineering and Applied Chemistry, 200 College Street, University of Toronto, Toronto, ON, M5S3E5, Canada
| | - William R Cluett
- Department of Chemical Engineering and Applied Chemistry, 200 College Street, University of Toronto, Toronto, ON, M5S3E5, Canada
| | - Radhakrishnan Mahadevan
- Department of Chemical Engineering and Applied Chemistry, 200 College Street, University of Toronto, Toronto, ON, M5S3E5, Canada.,Institute of Biomaterials and Biomedical Engineering, 164 College Street, University of Toronto, Toronto, ON, M5S 3G9, Canada
| |
Collapse
|
11
|
Lee D, Jayaraman A, Sang-Il Kwon J. Identification of a time-varying intracellular signalling model through data clustering and parameter selection: application to NF-[inline-formula removed]B signalling pathway induced by LPS in the presence of BFA. IET Syst Biol 2019; 13:169-179. [PMID: 31318334 PMCID: PMC8687386 DOI: 10.1049/iet-syb.2018.5079] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Revised: 02/07/2019] [Accepted: 02/14/2019] [Indexed: 01/02/2023] Open
Abstract
Developing a model for a signalling pathway requires several iterations of experimentation and model refinement to obtain an accurate model. However, the implementation of such an approach to model a signalling pathway induced by a poorly-known stimulus can become labour intensive because only limited information on the pathway is available beforehand to formulate an initial model. Therefore, a large number of iterations are required since the initial model is likely to be erroneous. In this work, a numerical scheme is proposed to construct a time-varying model for a signalling pathway induced by a poorly-known stimulus when its nominal model is available in the literature. Here, the nominal model refers to one that describes the signalling dynamics under a well-characterised stimulus. First, global sensitivity analysis is implemented on the nominal model to identify the most important parameters, which are assumed to be piecewise constants. Second, measurement data are clustered to determine temporal subdomains where the parameters take different values. Finally, a least-squares problem is solved to estimate the parameter values in each temporal subdomain. The effectiveness of this approach is illustrated by developing a time-varying model for NF-[inline-formula removed]B signalling dynamics induced by lipopolysaccharide in the presence of brefeldin A.
Collapse
Affiliation(s)
- Dongheon Lee
- Texas A&M Energy Institute, Texas A&M University, College Station, TX 77843, USA
| | - Arul Jayaraman
- Department of Biomedical Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Joseph Sang-Il Kwon
- Texas A&M Energy Institute, Texas A&M University, College Station, TX 77843, USA.
| |
Collapse
|
12
|
Pitt JA, Banga JR. Parameter estimation in models of biological oscillators: an automated regularised estimation approach. BMC Bioinformatics 2019; 20:82. [PMID: 30770736 PMCID: PMC6377730 DOI: 10.1186/s12859-019-2630-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 01/14/2019] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Dynamic modelling is a core element in the systems biology approach to understanding complex biosystems. Here, we consider the problem of parameter estimation in models of biological oscillators described by deterministic nonlinear differential equations. These problems can be extremely challenging due to several common pitfalls: (i) a lack of prior knowledge about parameters (i.e. massive search spaces), (ii) convergence to local optima (due to multimodality of the cost function), (iii) overfitting (fitting the noise instead of the signal) and (iv) a lack of identifiability. As a consequence, the use of standard estimation methods (such as gradient-based local ones) will often result in wrong solutions. Overfitting can be particularly problematic, since it produces very good calibrations, giving the impression of an excellent result. However, overfitted models exhibit poor predictive power. Here, we present a novel automated approach to overcome these pitfalls. Its workflow makes use of two sequential optimisation steps incorporating three key algorithms: (1) sampling strategies to systematically tighten the parameter bounds reducing the search space, (2) efficient global optimisation to avoid convergence to local solutions, (3) an advanced regularisation technique to fight overfitting. In addition, this workflow incorporates tests for structural and practical identifiability. RESULTS We successfully evaluate this novel approach considering four difficult case studies regarding the calibration of well-known biological oscillators (Goodwin, FitzHugh-Nagumo, Repressilator and a metabolic oscillator). In contrast, we show how local gradient-based approaches, even if used in multi-start fashion, are unable to avoid the above-mentioned pitfalls. CONCLUSIONS Our approach results in more efficient estimations (thanks to the bounding strategy) which are able to escape convergence to local optima (thanks to the global optimisation approach). Further, the use of regularisation allows us to avoid overfitting, resulting in more generalisable calibrated models (i.e. models with greater predictive power).
Collapse
Affiliation(s)
- Jake Alan Pitt
- (Bio)Process Engineering Group, IIM-CSIC, Eduardo Cabello 6, Vigo, 36208 Spain
- RWTH Aachen University, Faculty of Medicine, Joint Research Centre for Computational Biomedicine (JRC-COMBINE), Aachen, Germany
| | - Julio R. Banga
- (Bio)Process Engineering Group, IIM-CSIC, Eduardo Cabello 6, Vigo, 36208 Spain
| |
Collapse
|
13
|
Multi-Objective Optimization of Experiments Using Curvature and Fisher Information Matrix. Processes (Basel) 2017. [DOI: 10.3390/pr5040063] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
14
|
A parallel metaheuristic for large mixed-integer dynamic optimization problems, with applications in computational biology. PLoS One 2017; 12:e0182186. [PMID: 28813442 PMCID: PMC5557587 DOI: 10.1371/journal.pone.0182186] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2017] [Accepted: 07/13/2017] [Indexed: 11/24/2022] Open
Abstract
Background We consider a general class of global optimization problems dealing with nonlinear dynamic models. Although this class is relevant to many areas of science and engineering, here we are interested in applying this framework to the reverse engineering problem in computational systems biology, which yields very large mixed-integer dynamic optimization (MIDO) problems. In particular, we consider the framework of logic-based ordinary differential equations (ODEs). Methods We present saCeSS2, a parallel method for the solution of this class of problems. This method is based on an parallel cooperative scatter search metaheuristic, with new mechanisms of self-adaptation and specific extensions to handle large mixed-integer problems. We have paid special attention to the avoidance of convergence stagnation using adaptive cooperation strategies tailored to this class of problems. Results We illustrate its performance with a set of three very challenging case studies from the domain of dynamic modelling of cell signaling. The simpler case study considers a synthetic signaling pathway and has 84 continuous and 34 binary decision variables. A second case study considers the dynamic modeling of signaling in liver cancer using high-throughput data, and has 135 continuous and 109 binaries decision variables. The third case study is an extremely difficult problem related with breast cancer, involving 690 continuous and 138 binary decision variables. We report computational results obtained in different infrastructures, including a local cluster, a large supercomputer and a public cloud platform. Interestingly, the results show how the cooperation of individual parallel searches modifies the systemic properties of the sequential algorithm, achieving superlinear speedups compared to an individual search (e.g. speedups of 15 with 10 cores), and significantly improving (above a 60%) the performance with respect to a non-cooperative parallel scheme. The scalability of the method is also good (tests were performed using up to 300 cores). Conclusions These results demonstrate that saCeSS2 can be used to successfully reverse engineer large dynamic models of complex biological pathways. Further, these results open up new possibilities for other MIDO-based large-scale applications in the life sciences such as metabolic engineering, synthetic biology, drug scheduling.
Collapse
|
15
|
Malwade A, Nguyen A, Sadat-Mousavi P, Ingalls BP. Predictive Modeling of a Batch Filter Mating Process. Front Microbiol 2017; 8:461. [PMID: 28377756 PMCID: PMC5359259 DOI: 10.3389/fmicb.2017.00461] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Accepted: 03/06/2017] [Indexed: 01/21/2023] Open
Abstract
Quantitative characterizations of horizontal gene transfer are needed to accurately describe gene transfer processes in natural and engineered systems. A number of approaches to the quantitative description of plasmid conjugation have appeared in the literature. In this study, we seek to extend that work, motivated by the question of whether a mathematical model can accurately predict growth and conjugation dynamics in a batch process. We used flow cytometry to make time-point observations of a filter-associated mating between two E. coli strains, and fit ordinary differential equation models to the data. A model comparison analysis identified the model formulation that is best supported by the data. Identifiability analysis revealed that the parameters were estimated with acceptable accuracy. The predictive power of the model was assessed by comparison with test data that demanded extrapolation from the training experiments. This study represents the first attempt to assess the quality of model predictions for plasmid conjugation. Our successful application of this approach lays a foundation for predictive modeling that can be used both in the study of natural plasmid transmission and in model-based design of engineering approaches that employ conjugation, such as plasmid-mediated bioaugmentation.
Collapse
Affiliation(s)
- Akshay Malwade
- Department of Applied Mathematics, University of Waterloo Waterloo, ON, Canada
| | - Angel Nguyen
- Department of Applied Mathematics, University of Waterloo Waterloo, ON, Canada
| | | | - Brian P Ingalls
- Department of Applied Mathematics, University of Waterloo Waterloo, ON, Canada
| |
Collapse
|
16
|
Penas DR, González P, Egea JA, Doallo R, Banga JR. Parameter estimation in large-scale systems biology models: a parallel and self-adaptive cooperative strategy. BMC Bioinformatics 2017; 18:52. [PMID: 28109249 PMCID: PMC5251293 DOI: 10.1186/s12859-016-1452-4] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Accepted: 12/24/2016] [Indexed: 12/02/2022] Open
Abstract
Background The development of large-scale kinetic models is one of the current key issues in computational systems biology and bioinformatics. Here we consider the problem of parameter estimation in nonlinear dynamic models. Global optimization methods can be used to solve this type of problems but the associated computational cost is very large. Moreover, many of these methods need the tuning of a number of adjustable search parameters, requiring a number of initial exploratory runs and therefore further increasing the computation times. Here we present a novel parallel method, self-adaptive cooperative enhanced scatter search (saCeSS), to accelerate the solution of this class of problems. The method is based on the scatter search optimization metaheuristic and incorporates several key new mechanisms: (i) asynchronous cooperation between parallel processes, (ii) coarse and fine-grained parallelism, and (iii) self-tuning strategies. Results The performance and robustness of saCeSS is illustrated by solving a set of challenging parameter estimation problems, including medium and large-scale kinetic models of the bacterium E. coli, bakerés yeast S. cerevisiae, the vinegar fly D. melanogaster, Chinese Hamster Ovary cells, and a generic signal transduction network. The results consistently show that saCeSS is a robust and efficient method, allowing very significant reduction of computation times with respect to several previous state of the art methods (from days to minutes, in several cases) even when only a small number of processors is used. Conclusions The new parallel cooperative method presented here allows the solution of medium and large scale parameter estimation problems in reasonable computation times and with small hardware requirements. Further, the method includes self-tuning mechanisms which facilitate its use by non-experts. We believe that this new method can play a key role in the development of large-scale and even whole-cell dynamic models. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1452-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- David R Penas
- BioProcess Engineering Group, IIM-CSIC, Eduardo Cabello 6, Vigo, 36208, Spain
| | - Patricia González
- Computer Architecture Group, Universidade da Coruña, Campus de Elviña s/n, Coruña, 15071 A, Spain
| | - Jose A Egea
- Department of Applied Mathematics and Statistics, Universidad Politécnica de Cartagena, c/ Dr. Fleming s/n, Cartagena, 30202, Spain
| | - Ramón Doallo
- Computer Architecture Group, Universidade da Coruña, Campus de Elviña s/n, Coruña, 15071 A, Spain
| | - Julio R Banga
- BioProcess Engineering Group, IIM-CSIC, Eduardo Cabello 6, Vigo, 36208, Spain.
| |
Collapse
|
17
|
Structured model and parameter estimation in plant cell cultures of Thevetia peruviana. Bioprocess Biosyst Eng 2016; 40:573-587. [PMID: 27987091 DOI: 10.1007/s00449-016-1722-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2016] [Accepted: 12/06/2016] [Indexed: 10/20/2022]
Abstract
In this work, a mechanistic model for predicting the dynamic behavior of extracellular and intracellular nutrients, biomass production, and the main metabolites involved in the central carbon metabolism in plant cell cultures of Thevetia peruviana is presented. The proposed model is the first mechanistic model implemented for plant cell cultures of this species, and includes 28 metabolites, 33 metabolic reactions, and 61 parameters. Given the over-parametrization of the model, its nonlinear nature and the strong correlation among the effects of the parameters, a parameter estimation routine based on identifiability analysis was implemented. This routine reduces the parameter's search space by selecting the most sensitive and linearly independent parameters. Results have shown that only 19 parameters are identifiable. Finally, the model was used for analyzing the fluxes distribution in plant cell cultures of T. peruviana. This analysis shows high uptake of phosphates and parallel uptake of glucose and fructose. Furthermore, it has pointed out the main central carbon metabolism routes for promoting biomass production in this cell culture.
Collapse
|
18
|
Villiger TK, Scibona E, Stettler M, Broly H, Morbidelli M, Soos M. Controlling the time evolution of mAb N-linked glycosylation - Part II: Model-based predictions. Biotechnol Prog 2016; 32:1135-1148. [PMID: 27273889 DOI: 10.1002/btpr.2315] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Revised: 03/24/2016] [Indexed: 01/04/2023]
Abstract
N-linked glycosylation is known to be a crucial factor for the therapeutic efficacy and safety of monoclonal antibodies (mAbs) and many other glycoproteins. The nontemplate process of glycosylation is influenced by external factors which have to be tightly controlled during the manufacturing process. In order to describe and predict mAb N-linked glycosylation patterns in a CHO-S cell fed-batch process, an existing dynamic mathematical model has been refined and coupled to an unstructured metabolic model. High-throughput cell culture experiments carried out in miniaturized bioreactors in combination with intracellular measurements of nucleotide sugars were used to tune the parameter configuration of the coupled models as a function of extracellular pH, manganese and galactose addition. The proposed modeling framework is able to predict the time evolution of N-linked glycosylation patterns during a fed-batch process as a function of time as well as the manipulated variables. A constant and varying mAb N-linked glycosylation pattern throughout the culture were chosen to demonstrate the predictive capability of the modeling framework, which is able to quantify the interconnected influence of media components and cell culture conditions. Such a model-based evaluation of feeding regimes using high-throughput tools and mathematical models gives rise to a more rational way to control and design cell culture processes with defined glycosylation patterns. © 2016 American Institute of Chemical Engineers Biotechnol. Prog., 32:1135-1148, 2016.
Collapse
Affiliation(s)
- Thomas K Villiger
- Dept. of Chemistry and Applied Biosciences, Inst. for Chemical and Bioengineering, ETH Zurich, Zurich, Switzerland
| | - Ernesto Scibona
- Dept. of Chemistry and Applied Biosciences, Inst. for Chemical and Bioengineering, ETH Zurich, Zurich, Switzerland
| | - Matthieu Stettler
- Biotech Process Sciences, Merck-Serono S.A., Corsier-sur-Vevey, 1809, Switzerland
| | - Hervé Broly
- Biotech Process Sciences, Merck-Serono S.A., Corsier-sur-Vevey, 1809, Switzerland
| | - Massimo Morbidelli
- Dept. of Chemistry and Applied Biosciences, Inst. for Chemical and Bioengineering, ETH Zurich, Zurich, Switzerland
| | - Miroslav Soos
- Dept. of Chemical Engineering, University of Chemistry and Technology, Prague, Czech Republic.
| |
Collapse
|
19
|
Efficient Optimization of Stimuli for Model-Based Design of Experiments to Resolve Dynamical Uncertainty. PLoS Comput Biol 2015; 11:e1004488. [PMID: 26379275 PMCID: PMC4574939 DOI: 10.1371/journal.pcbi.1004488] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2014] [Accepted: 08/05/2015] [Indexed: 11/19/2022] Open
Abstract
This model-based design of experiments (MBDOE) method determines the input magnitudes of an experimental stimuli to apply and the associated measurements that should be taken to optimally constrain the uncertain dynamics of a biological system under study. The ideal global solution for this experiment design problem is generally computationally intractable because of parametric uncertainties in the mathematical model of the biological system. Others have addressed this issue by limiting the solution to a local estimate of the model parameters. Here we present an approach that is independent of the local parameter constraint. This approach is made computationally efficient and tractable by the use of: (1) sparse grid interpolation that approximates the biological system dynamics, (2) representative parameters that uniformly represent the data-consistent dynamical space, and (3) probability weights of the represented experimentally distinguishable dynamics. Our approach identifies data-consistent representative parameters using sparse grid interpolants, constructs the optimal input sequence from a greedy search, and defines the associated optimal measurements using a scenario tree. We explore the optimality of this MBDOE algorithm using a 3-dimensional Hes1 model and a 19-dimensional T-cell receptor model. The 19-dimensional T-cell model also demonstrates the MBDOE algorithm’s scalability to higher dimensions. In both cases, the dynamical uncertainty region that bounds the trajectories of the target system states were reduced by as much as 86% and 99% respectively after completing the designed experiments in silico. Our results suggest that for resolving dynamical uncertainty, the ability to design an input sequence paired with its associated measurements is particularly important when limited by the number of measurements. Many mathematical models that have been developed for biological systems are limited because the complex systems are not well understood, the parameters are not known, and available data is limited and noisy. On the other hand, experiments to support model development are limited in terms of costs and time, feasible inputs and feasible measurements. MBDOE combines the mathematical models with experiment design to strategically design optimal experiments to obtain data that will contribute to the understanding of the systems. Our approach extends current capabilities of existing MBDOE techniques to make them more useful for scientists to resolve the trajectories of the system under study. It identifies the optimal conditions for stimuli and measurements that yield the most information about the system given the practical limitations. Exploration of the input space is not a trivial extension to MBDOE methods used for determining optimal measurements due to the nonlinear nature of many biological system models. The exploration of the system dynamics elicited by different inputs requires a computationally efficient and tractable approach. Our approach plans optimal experiments to reduce dynamical uncertainty in the output of selected target states of the biological system.
Collapse
|
20
|
Difference kernel iterative method for linear and nonlinear partial differential equations. Neural Comput Appl 2015. [DOI: 10.1007/s00521-015-1886-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
21
|
Dynamic Modeling of Cell-Free Biochemical Networks Using Effective Kinetic Models. Processes (Basel) 2015. [DOI: 10.3390/pr3010138] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
22
|
BioPreDyn-bench: a suite of benchmark problems for dynamic modelling in systems biology. BMC SYSTEMS BIOLOGY 2015; 9:8. [PMID: 25880925 PMCID: PMC4342829 DOI: 10.1186/s12918-015-0144-4] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Accepted: 01/15/2015] [Indexed: 11/21/2022]
Abstract
Background Dynamic modelling is one of the cornerstones of systems biology. Many research efforts are currently being invested in the development and exploitation of large-scale kinetic models. The associated problems of parameter estimation (model calibration) and optimal experimental design are particularly challenging. The community has already developed many methods and software packages which aim to facilitate these tasks. However, there is a lack of suitable benchmark problems which allow a fair and systematic evaluation and comparison of these contributions. Results Here we present BioPreDyn-bench, a set of challenging parameter estimation problems which aspire to serve as reference test cases in this area. This set comprises six problems including medium and large-scale kinetic models of the bacterium E. coli, baker’s yeast S. cerevisiae, the vinegar fly D. melanogaster, Chinese Hamster Ovary cells, and a generic signal transduction network. The level of description includes metabolism, transcription, signal transduction, and development. For each problem we provide (i) a basic description and formulation, (ii) implementations ready-to-run in several formats, (iii) computational results obtained with specific solvers, (iv) a basic analysis and interpretation. Conclusions This suite of benchmark problems can be readily used to evaluate and compare parameter estimation methods. Further, it can also be used to build test problems for sensitivity and identifiability analysis, model reduction and optimal experimental design methods. The suite, including codes and documentation, can be freely downloaded from the BioPreDyn-bench website, https://sites.google.com/site/biopredynbenchmarks/. Electronic supplementary material The online version of this article (doi:10.1186/s12918-015-0144-4) contains supplementary material, which is available to authorized users.
Collapse
|
23
|
Akabua E, Inanc T, Gaweda A, Brier ME, Kim S, Zurada JM. Individualized model discovery: the case of anemia patients. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2015; 118:23-33. [PMID: 25459523 DOI: 10.1016/j.cmpb.2014.10.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2013] [Revised: 09/20/2014] [Accepted: 10/01/2014] [Indexed: 06/04/2023]
Abstract
The universal sequel to chronic kidney condition (CKD) is anemia. Patients of anemia have kidneys that are incapable of performing certain basic functions such as sensing of oxygen levels to secrete erythropoietin when red blood cell counts are low. Under such conditions, external administration of human recombinant erythropoietin (EPO) is administered as alternative to improve conditions of CKD patients by increasing their hemoglobin (Hb) levels to a given therapeutic range. Presently, EPO dosing strategies extensively depend on packet inserts and on "average" responses to the medication from previous patients. Clearly dosage strategies based on these approaches are, at best, nonoptimal to EPO medication and potentially dangerous to patients that do not adhere to the notion of expected "average" response. In this work, a technique called semi-blind robust identification is provided to uniquely identify models of the individual patients of anemia based on their actual Hb responses and EPO administration. Using the a priori information and the measured input-output data of the individual patients, the procedure identifies a unique model consisting of a nominal model and the associated model uncertainty for the patients. By incorporating the effects of unknown system initial conditions, considerably small measurement samples can be used in the modeling process.
Collapse
Affiliation(s)
- Elom Akabua
- Department of Electrical and Computer Engineering, University of Louisville, Louisville, KY 40292, United States.
| | - Tamer Inanc
- Department of Electrical and Computer Engineering, University of Louisville, Louisville, KY 40292, United States
| | - Adam Gaweda
- Department of Electrical and Computer Engineering, University of Louisville, Louisville, KY 40292, United States; Department of Medicine, University of Louisville, Louisville, KY 40292, United States
| | - Michael E Brier
- Department of Medicine, University of Louisville, Louisville, KY 40292, United States; Department of Veteran Affairs, University of Louisville, Louisville, KY 40292, United States
| | - Seongho Kim
- Biostatistics Core, Karmanos Cancer Institute, Wayne State University, Detroit, MI 48201, United States
| | - Jacek M Zurada
- Department of Electrical and Computer Engineering, University of Louisville, Louisville, KY 40292, United States; Spoleczna Akademia Nauk, 90-011 Lodz, Poland
| |
Collapse
|
24
|
Wu Q, Smith-Miles K, Tian T. Approximate Bayesian computation schemes for parameter inference of discrete stochastic models using simulated likelihood density. BMC Bioinformatics 2014; 15 Suppl 12:S3. [PMID: 25473744 PMCID: PMC4243104 DOI: 10.1186/1471-2105-15-s12-s3] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mathematical modeling is an important tool in systems biology to study the dynamic property of complex biological systems. However, one of the major challenges in systems biology is how to infer unknown parameters in mathematical models based on the experimental data sets, in particular, when the data are sparse and the regulatory network is stochastic. RESULTS To address this issue, this work proposed a new algorithm to estimate parameters in stochastic models using simulated likelihood density in the framework of approximate Bayesian computation. Two stochastic models were used to demonstrate the efficiency and effectiveness of the proposed method. In addition, we designed another algorithm based on a novel objective function to measure the accuracy of stochastic simulations. CONCLUSIONS Simulation results suggest that the usage of simulated likelihood density improves the accuracy of estimates substantially. When the error is measured at each observation time point individually, the estimated parameters have better accuracy than those obtained by a published method in which the error is measured using simulations over the entire observation time period.
Collapse
|
25
|
Pauwels E, Lajaunie C, Vert JP. A Bayesian active learning strategy for sequential experimental design in systems biology. BMC SYSTEMS BIOLOGY 2014; 8:102. [PMID: 25256134 PMCID: PMC4181721 DOI: 10.1186/s12918-014-0102-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Accepted: 08/14/2014] [Indexed: 11/23/2022]
Abstract
BackgroundDynamical models used in systems biology involve unknown kinetic parameters. Setting these parameters is a bottleneck in many modeling projects. This motivates the estimation of these parameters from empirical data. However, this estimation problem has its own difficulties, the most important one being strong ill-conditionedness. In this context, optimizing experiments to be conducted in order to better estimate a system¿s parameters provides a promising direction to alleviate the difficulty of the task.ResultsBorrowing ideas from Bayesian experimental design and active learning, we propose a new strategy for optimal experimental design in the context of kinetic parameter estimation in systems biology. We describe algorithmic choices that allow to implement this method in a computationally tractable way and make it fully automatic. Based on simulation, we show that it outperforms alternative baseline strategies, and demonstrate the benefit to consider multiple posterior modes of the likelihood landscape, as opposed to traditional schemes based on local and Gaussian approximations.ConclusionThis analysis demonstrates that our new, fully automatic Bayesian optimal experimental design strategy has the potential to support the design of experiments for kinetic parameter estimation in systems biology.
Collapse
Affiliation(s)
- Edouard Pauwels
- />CNRS, LAAS, 7 Avenue du Colonel Roche, Toulouse, F-31400 France
- />Université de Toulouse LAAS, Toulouse, F-31400 France
| | - Christian Lajaunie
- />MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, 35 rue Saint-Honoré, Fontainebleau, 77300 France
- />Institut Curie, 26 rue d’Ulm, F-75248, Paris, France
- />INSERM U900, Paris, F-75248 France
| | - Jean-Philippe Vert
- />MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, 35 rue Saint-Honoré, Fontainebleau, 77300 France
- />Institut Curie, 26 rue d’Ulm, F-75248, Paris, France
- />INSERM U900, Paris, F-75248 France
| |
Collapse
|
26
|
Ud-Dean SMM, Gunawan R. Ensemble inference and inferability of gene regulatory networks. PLoS One 2014; 9:e103812. [PMID: 25093509 PMCID: PMC4122380 DOI: 10.1371/journal.pone.0103812] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2014] [Accepted: 07/05/2014] [Indexed: 01/05/2023] Open
Abstract
The inference of gene regulatory network (GRN) from gene expression data is an unsolved problem of great importance. This inference has been stated, though not proven, to be underdetermined implying that there could be many equivalent (indistinguishable) solutions. Motivated by this fundamental limitation, we have developed new framework and algorithm, called TRaCE, for the ensemble inference of GRNs. The ensemble corresponds to the inherent uncertainty associated with discriminating direct and indirect gene regulations from steady-state data of gene knock-out (KO) experiments. We applied TRaCE to analyze the inferability of random GRNs and the GRNs of E. coli and yeast from single- and double-gene KO experiments. The results showed that, with the exception of networks with very few edges, GRNs are typically not inferable even when the data are ideal (unbiased and noise-free). Finally, we compared the performance of TRaCE with top performing methods of DREAM4 in silico network inference challenge.
Collapse
Affiliation(s)
| | - Rudiyanto Gunawan
- Institute for Chemical and Bioengineering, ETH Zurich, Zurich, Switzerland
- * E-mail:
| |
Collapse
|
27
|
Perumal TM, Gunawan R. pathPSA: A Dynamical Pathway-Based Parametric Sensitivity Analysis. Ind Eng Chem Res 2014. [DOI: 10.1021/ie403277d] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- Thanneer Malai Perumal
- Luxembourg
Centre for Systems Biomedicine, University of Luxembourg, Esch/Alzette 4362, Luxembourg
| | - Rudiyanto Gunawan
- Institute
for Chemical and Bioengineering, ETH Zurich, Zurich 8093, Switzerland
| |
Collapse
|
28
|
Becker K, Balsa-Canto E, Cicin-Sain D, Hoermann A, Janssens H, Banga JR, Jaeger J. Reverse-engineering post-transcriptional regulation of gap genes in Drosophila melanogaster. PLoS Comput Biol 2013; 9:e1003281. [PMID: 24204230 PMCID: PMC3814631 DOI: 10.1371/journal.pcbi.1003281] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 09/02/2013] [Indexed: 12/19/2022] Open
Abstract
Systems biology proceeds through repeated cycles of experiment and modeling. One way to implement this is reverse engineering, where models are fit to data to infer and analyse regulatory mechanisms. This requires rigorous methods to determine whether model parameters can be properly identified. Applying such methods in a complex biological context remains challenging. We use reverse engineering to study post-transcriptional regulation in pattern formation. As a case study, we analyse expression of the gap genes Krüppel, knirps, and giant in Drosophila melanogaster. We use detailed, quantitative datasets of gap gene mRNA and protein expression to solve and fit a model of post-transcriptional regulation, and establish its structural and practical identifiability. Our results demonstrate that post-transcriptional regulation is not required for patterning in this system, but is necessary for proper control of protein levels. Our work demonstrates that the uniqueness and specificity of a fitted model can be rigorously determined in the context of spatio-temporal pattern formation. This greatly increases the potential of reverse engineering for the study of development and other, similarly complex, biological processes. The analysis of pattern-forming gene networks is largely focussed on transcriptional regulation. However, post-transcriptional events, such as translation and regulation of protein stability also play important roles in the establishment of protein expression patterns and levels. In this study, we use a reverse-engineering approach—fitting mathematical models to quantitative expression data—to analyse post-transcriptional regulation of the Drosophila gap genes Krüppel, knirps and giant, involved in segment determination during early embryogenesis. Rigorous fitting requires us to establish whether our models provide a robust and unique solution. We demonstrate, for the first time, that this can be done in the context of a complex spatio-temporal regulatory system. This is an important methodological advance for reverse-engineering developmental processes. Our results indicate that post-transcriptional regulation is not required for pattern formation, but is necessary for proper regulation of gap protein levels. Specifically, we predict that translation rates must be tuned for rapid early accumulation, and protein stability must be increased for persistence of high protein levels at late stages of gap gene expression.
Collapse
Affiliation(s)
- Kolja Becker
- EMBL/CRG Research Unit in Systems Biology, Centre de Regulació Genòmica, and Universitat Pombeu Fabra (UPF), Barcelona, Spain
- Institute of Genetics, Johannes Gutenberg University, Mainz, Germany
| | | | - Damjan Cicin-Sain
- EMBL/CRG Research Unit in Systems Biology, Centre de Regulació Genòmica, and Universitat Pombeu Fabra (UPF), Barcelona, Spain
| | - Astrid Hoermann
- EMBL/CRG Research Unit in Systems Biology, Centre de Regulació Genòmica, and Universitat Pombeu Fabra (UPF), Barcelona, Spain
| | - Hilde Janssens
- EMBL/CRG Research Unit in Systems Biology, Centre de Regulació Genòmica, and Universitat Pombeu Fabra (UPF), Barcelona, Spain
| | | | - Johannes Jaeger
- EMBL/CRG Research Unit in Systems Biology, Centre de Regulació Genòmica, and Universitat Pombeu Fabra (UPF), Barcelona, Spain
- * E-mail:
| |
Collapse
|
29
|
Rodriguez-Fernandez M, Rehberg M, Kremling A, Banga JR. Simultaneous model discrimination and parameter estimation in dynamic models of cellular systems. BMC SYSTEMS BIOLOGY 2013; 7:76. [PMID: 23938131 PMCID: PMC3765209 DOI: 10.1186/1752-0509-7-76] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2012] [Accepted: 08/08/2013] [Indexed: 01/06/2023]
Abstract
Background Model development is a key task in systems biology, which typically starts from an initial model candidate and, involving an iterative cycle of hypotheses-driven model modifications, leads to new experimentation and subsequent model identification steps. The final product of this cycle is a satisfactory refined model of the biological phenomena under study. During such iterative model development, researchers frequently propose a set of model candidates from which the best alternative must be selected. Here we consider this problem of model selection and formulate it as a simultaneous model selection and parameter identification problem. More precisely, we consider a general mixed-integer nonlinear programming (MINLP) formulation for model selection and identification, with emphasis on dynamic models consisting of sets of either ODEs (ordinary differential equations) or DAEs (differential algebraic equations). Results We solved the MINLP formulation for model selection and identification using an algorithm based on Scatter Search (SS). We illustrate the capabilities and efficiency of the proposed strategy with a case study considering the KdpD/KdpE system regulating potassium homeostasis in Escherichia coli. The proposed approach resulted in a final model that presents a better fit to the in silico generated experimental data. Conclusions The presented MINLP-based optimization approach for nested-model selection and identification is a powerful methodology for model development in systems biology. This strategy can be used to perform model selection and parameter estimation in one single step, thus greatly reducing the number of experiments and computations of traditional modeling approaches.
Collapse
|
30
|
|
31
|
Chakrabarty A, Buzzard GT, Rundell AE. Model-based design of experiments for cellular processes. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2013; 5:181-203. [PMID: 23293047 DOI: 10.1002/wsbm.1204] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Affiliation(s)
- Ankush Chakrabarty
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA
| | | | | |
Collapse
|
32
|
Chu Y, Hahn J. Necessary condition for applying experimental design criteria to global sensitivity analysis results. Comput Chem Eng 2013. [DOI: 10.1016/j.compchemeng.2012.09.007] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
33
|
Tian LP, Shi ZK, Wu FX. Complexity analysis and parameter estimation of dynamic metabolic systems. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2013; 2013:698341. [PMID: 24233242 PMCID: PMC3819894 DOI: 10.1155/2013/698341] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/24/2013] [Revised: 08/18/2013] [Accepted: 09/05/2013] [Indexed: 11/25/2022]
Abstract
A metabolic system consists of a number of reactions transforming molecules of one kind into another to provide the energy that living cells need. Based on the biochemical reaction principles, dynamic metabolic systems can be modeled by a group of coupled differential equations which consists of parameters, states (concentration of molecules involved), and reaction rates. Reaction rates are typically either polynomials or rational functions in states and constant parameters. As a result, dynamic metabolic systems are a group of differential equations nonlinear and coupled in both parameters and states. Therefore, it is challenging to estimate parameters in complex dynamic metabolic systems. In this paper, we propose a method to analyze the complexity of dynamic metabolic systems for parameter estimation. As a result, the estimation of parameters in dynamic metabolic systems is reduced to the estimation of parameters in a group of decoupled rational functions plus polynomials (which we call improper rational functions) or in polynomials. Furthermore, by taking its special structure of improper rational functions, we develop an efficient algorithm to estimate parameters in improper rational functions. The proposed method is applied to the estimation of parameters in a dynamic metabolic system. The simulation results show the superior performance of the proposed method.
Collapse
Affiliation(s)
- Li-Ping Tian
- 1School of Information, Beijing Wuzi University, Beijing 101149, China
| | - Zhong-Ke Shi
- 2School of Atuomation, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, China
| | - Fang-Xiang Wu
- 3Department of Mechanical Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, Canada S7N 5A9
- 4Division of Biomedical Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, Canada S7N 5A9
- *Fang-Xiang Wu:
| |
Collapse
|
34
|
Ensemble kinetic modeling of metabolic networks from dynamic metabolic profiles. Metabolites 2012; 2:891-912. [PMID: 24957767 PMCID: PMC3901226 DOI: 10.3390/metabo2040891] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2012] [Revised: 11/02/2012] [Accepted: 11/05/2012] [Indexed: 01/21/2023] Open
Abstract
Kinetic modeling of metabolic pathways has important applications in metabolic engineering, but significant challenges still remain. The difficulties faced vary from finding best-fit parameters in a highly multidimensional search space to incomplete parameter identifiability. To meet some of these challenges, an ensemble modeling method is developed for characterizing a subset of kinetic parameters that give statistically equivalent goodness-of-fit to time series concentration data. The method is based on the incremental identification approach, where the parameter estimation is done in a step-wise manner. Numerical efficacy is achieved by reducing the dimensionality of parameter space and using efficient random parameter exploration algorithms. The shift toward using model ensembles, instead of the traditional "best-fit" models, is necessary to directly account for model uncertainty during the application of such models. The performance of the ensemble modeling approach has been demonstrated in the modeling of a generic branched pathway and the trehalose pathway in Saccharomyces cerevisiae using generalized mass action (GMA) kinetics.
Collapse
|
35
|
Gidvani RD, Sudmant P, Li G, DaSilva LF, McConkey BJ, Duncker BP, Ingalls BP. A quantitative model of the initiation of DNA replication in Saccharomyces cerevisiae predicts the effects of system perturbations. BMC SYSTEMS BIOLOGY 2012; 6:78. [PMID: 22738223 PMCID: PMC3439281 DOI: 10.1186/1752-0509-6-78] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Accepted: 06/05/2012] [Indexed: 11/17/2022]
Abstract
Background Eukaryotic cell proliferation involves DNA replication, a tightly regulated process mediated by a multitude of protein factors. In budding yeast, the initiation of replication is facilitated by the heterohexameric origin recognition complex (ORC). ORC binds to specific origins of replication and then serves as a scaffold for the recruitment of other factors such as Cdt1, Cdc6, the Mcm2-7 complex, Cdc45 and the Dbf4-Cdc7 kinase complex. While many of the mechanisms controlling these associations are well documented, mathematical models are needed to explore the network’s dynamic behaviour. We have developed an ordinary differential equation-based model of the protein-protein interaction network describing replication initiation. Results The model was validated against quantified levels of protein factors over a range of cell cycle timepoints. Using chromatin extracts from synchronized Saccharomyces cerevisiae cell cultures, we were able to monitor the in vivo fluctuations of several of the aforementioned proteins, with additional data obtained from the literature. The model behaviour conforms to perturbation trials previously reported in the literature, and accurately predicts the results of our own knockdown experiments. Furthermore, we successfully incorporated our replication initiation model into an established model of the entire yeast cell cycle, thus providing a comprehensive description of these processes. Conclusions This study establishes a robust model of the processes driving DNA replication initiation. The model was validated against observed cell concentrations of the driving factors, and characterizes the interactions between factors implicated in eukaryotic DNA replication. Finally, this model can serve as a guide in efforts to generate a comprehensive model of the mammalian cell cycle in order to explore cancer-related phenotypes.
Collapse
Affiliation(s)
- Rohan D Gidvani
- Department of Biology, University of Waterloo, Waterloo, ON, Canada
| | | | | | | | | | | | | |
Collapse
|
36
|
Bazil JN, Buzzard GT, Rundell AE. A Global Parallel Model Based Design of Experiments Method to Minimize Model Output Uncertainty. Bull Math Biol 2011; 74:688-716. [DOI: 10.1007/s11538-011-9686-9] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2011] [Accepted: 08/04/2011] [Indexed: 01/14/2023]
|
37
|
Chu Y, Hahn J. Generalization of a parameter set selection procedure based on orthogonal projections and the D-optimality criterion. AIChE J 2011. [DOI: 10.1002/aic.12727] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
38
|
Tian LP, Liu L, Wu FX. Parameter estimation method for improper fractional models and its application to molecular biological systems. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2011; 2010:1477-80. [PMID: 21096361 DOI: 10.1109/iembs.2010.5626849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Derived from biochemical principles, molecular biological systems can be described by a group of differential equations. Generally these differential equations contain fractional functions plus polynomials (which we call improper fractional model) as reaction rates. As a result, molecular biological systems are nonlinear in both parameters and states. It is well known that it is challenging to estimate parameters nonlinear in a model. However, in fractional functions both the denominator and numerator are linear in the parameters while polynomials are also linear in parameters. Based on this observation, we develop an iterative linear least squares method for estimating parameters in biological systems modeled by improper fractional functions. The basic idea is to transfer optimizing a nonlinear least squares objective function into iteratively solving a sequence of linear least squares problems. The developed method is applied to the estimation of parameters in a metabolism system. The simulation results show the superior performance of the proposed method for estimating parameters in such molecular biological systems.
Collapse
Affiliation(s)
- Li-Ping Tian
- School of Information, Beijing Wuzi University, No.1 Fuhe Street, Tongzhou District, China
| | | | | |
Collapse
|
39
|
Zhan C, Yeung LF. Parameter estimation in systems biology models using spline approximation. BMC SYSTEMS BIOLOGY 2011; 5:14. [PMID: 21255466 PMCID: PMC3750107 DOI: 10.1186/1752-0509-5-14] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/18/2010] [Accepted: 01/24/2011] [Indexed: 11/24/2022]
Abstract
Background Mathematical models for revealing the dynamics and interactions properties of biological systems play an important role in computational systems biology. The inference of model parameter values from time-course data can be considered as a "reverse engineering" process and is still one of the most challenging tasks. Many parameter estimation methods have been developed but none of these methods is effective for all cases and can overwhelm all other approaches. Instead, various methods have their advantages and disadvantages. It is worth to develop parameter estimation methods which are robust against noise, efficient in computation and flexible enough to meet different constraints. Results Two parameter estimation methods of combining spline theory with Linear Programming (LP) and Nonlinear Programming (NLP) are developed. These methods remove the need for ODE solvers during the identification process. Our analysis shows that the augmented cost function surfaces used in the two proposed methods are smoother; which can ease the optima searching process and hence enhance the robustness and speed of the search algorithm. Moreover, the cores of our algorithms are LP and NLP based, which are flexible and consequently additional constraints can be embedded/removed easily. Eight system biology models are used for testing the proposed approaches. Our results confirm that the proposed methods are both efficient and robust. Conclusions The proposed approaches have general application to identify unknown parameter values of a wide range of systems biology models.
Collapse
Affiliation(s)
- Choujun Zhan
- Department of Electronic Engineering, City University of Hong Kong, PR China.
| | | |
Collapse
|
40
|
MIAO HONGYU, XIA XIAOHUA, PERELSON ALANS, WU HULIN. ON IDENTIFIABILITY OF NONLINEAR ODE MODELS AND APPLICATIONS IN VIRAL DYNAMICS. SIAM REVIEW. SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS 2011; 53:3-39. [PMID: 21785515 PMCID: PMC3140286 DOI: 10.1137/090757009] [Citation(s) in RCA: 267] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Ordinary differential equations (ODE) are a powerful tool for modeling dynamic processes with wide applications in a variety of scientific fields. Over the last 2 decades, ODEs have also emerged as a prevailing tool in various biomedical research fields, especially in infectious disease modeling. In practice, it is important and necessary to determine unknown parameters in ODE models based on experimental data. Identifiability analysis is the first step in determing unknown parameters in ODE models and such analysis techniques for nonlinear ODE models are still under development. In this article, we review identifiability analysis methodologies for nonlinear ODE models developed in the past one to two decades, including structural identifiability analysis, practical identifiability analysis and sensitivity-based identifiability analysis. Some advanced topics and ongoing research are also briefly reviewed. Finally, some examples from modeling viral dynamics of HIV, influenza and hepatitis viruses are given to illustrate how to apply these identifiability analysis methods in practice.
Collapse
Affiliation(s)
- HONGYU MIAO
- Department of Biostatistics and Computational Biology, University of Rochester School of Medicine and Dentistry, 601 Elmwood Avenue, Box 630, Rochester, New York 14642, USA
| | - XIAOHUA XIA
- Department of Electrical, Electronic and Computer Engineering, University of Pretoria, Lynnwood Road, Pretoria 0002, South Africa
| | - ALAN S. PERELSON
- Theoretical Biology and Biophysics Group, MS-K710, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - HULIN WU
- Department of Biostatistics and Computational Biology, University of Rochester School of Medicine and Dentistry, 601 Elmwood Avenue, Box 630, Rochester, New York 14642, USA
| |
Collapse
|
41
|
Apgar JF, Witmer DK, White FM, Tidor B. Sloppy models, parameter uncertainty, and the role of experimental design. MOLECULAR BIOSYSTEMS 2010; 6:1890-900. [PMID: 20556289 PMCID: PMC3505121 DOI: 10.1039/b918098b] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Computational models are increasingly used to understand and predict complex biological phenomena. These models contain many unknown parameters, at least some of which are difficult to measure directly, and instead are estimated by fitting to time-course data. Previous work has suggested that even with precise data sets, many parameters are unknowable by trajectory measurements. We examined this question in the context of a pathway model of epidermal growth factor (EGF) and neuronal growth factor (NGF) signaling. Computationally, we examined a palette of experimental perturbations that included different doses of EGF and NGF as well as single and multiple gene knockdowns and overexpressions. While no single experiment could accurately estimate all of the parameters, experimental design methodology identified a set of five complementary experiments that could. These results suggest optimism for the prospects for calibrating even large models, that the success of parameter estimation is intimately linked to the experimental perturbations used, and that experimental design methodology is important for parameter fitting of biological models and likely for the accuracy that can be expected from them.
Collapse
Affiliation(s)
- Joshua F. Apgar
- Department of Biological Engineering
- Computer Science and Artificial Intelligence Laboratory
| | - David K. Witmer
- Computer Science and Artificial Intelligence Laboratory
- Department of Electrical Engineering and Computer Science
| | - Forest M. White
- Department of Biological Engineering
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Bruce Tidor
- Department of Biological Engineering
- Computer Science and Artificial Intelligence Laboratory
- Department of Electrical Engineering and Computer Science
| |
Collapse
|
42
|
Poovathingal SK, Gunawan R. Global parameter estimation methods for stochastic biochemical systems. BMC Bioinformatics 2010; 11:414. [PMID: 20691037 PMCID: PMC2928803 DOI: 10.1186/1471-2105-11-414] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2010] [Accepted: 08/06/2010] [Indexed: 11/16/2022] Open
Abstract
Background The importance of stochasticity in cellular processes having low number of molecules has resulted in the development of stochastic models such as chemical master equation. As in other modelling frameworks, the accompanying rate constants are important for the end-applications like analyzing system properties (e.g. robustness) or predicting the effects of genetic perturbations. Prior knowledge of kinetic constants is usually limited and the model identification routine typically includes parameter estimation from experimental data. Although the subject of parameter estimation is well-established for deterministic models, it is not yet routine for the chemical master equation. In addition, recent advances in measurement technology have made the quantification of genetic substrates possible to single molecular levels. Thus, the purpose of this work is to develop practical and effective methods for estimating kinetic model parameters in the chemical master equation and other stochastic models from single cell and cell population experimental data. Results Three parameter estimation methods are proposed based on the maximum likelihood and density function distance, including probability and cumulative density functions. Since stochastic models such as chemical master equations are typically solved using a Monte Carlo approach in which only a finite number of Monte Carlo realizations are computationally practical, specific considerations are given to account for the effect of finite sampling in the histogram binning of the state density functions. Applications to three practical case studies showed that while maximum likelihood method can effectively handle low replicate measurements, the density function distance methods, particularly the cumulative density function distance estimation, are more robust in estimating the parameters with consistently higher accuracy, even for systems showing multimodality. Conclusions The parameter estimation methodologies described in this work have provided an effective and practical approach in the estimation of kinetic parameters of stochastic systems from either sparse or dense cell population data. Nevertheless, similar to kinetic parameter estimation in other modelling frameworks, not all parameters can be estimated accurately, which is a common problem arising from the lack of complete parameter identifiability from the available data.
Collapse
Affiliation(s)
- Suresh Kumar Poovathingal
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 4 Engineering Drive 4, 117576, Singapore
| | | |
Collapse
|
43
|
Kriete A, Bosl WJ, Booker G. Rule-based cell systems model of aging using feedback loop motifs mediated by stress responses. PLoS Comput Biol 2010; 6:e1000820. [PMID: 20585546 PMCID: PMC2887462 DOI: 10.1371/journal.pcbi.1000820] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2009] [Accepted: 05/18/2010] [Indexed: 01/01/2023] Open
Abstract
Investigating the complex systems dynamics of the aging process requires integration of a broad range of cellular processes describing damage and functional decline co-existing with adaptive and protective regulatory mechanisms. We evolve an integrated generic cell network to represent the connectivity of key cellular mechanisms structured into positive and negative feedback loop motifs centrally important for aging. The conceptual network is casted into a fuzzy-logic, hybrid-intelligent framework based on interaction rules assembled from a priori knowledge. Based upon a classical homeostatic representation of cellular energy metabolism, we first demonstrate how positive-feedback loops accelerate damage and decline consistent with a vicious cycle. This model is iteratively extended towards an adaptive response model by incorporating protective negative-feedback loop circuits. Time-lapse simulations of the adaptive response model uncover how transcriptional and translational changes, mediated by stress sensors NF-κB and mTOR, counteract accumulating damage and dysfunction by modulating mitochondrial respiration, metabolic fluxes, biosynthesis, and autophagy, crucial for cellular survival. The model allows consideration of lifespan optimization scenarios with respect to fitness criteria using a sensitivity analysis. Our work establishes a novel extendable and scalable computational approach capable to connect tractable molecular mechanisms with cellular network dynamics underlying the emerging aging phenotype. The global process of aging disturbs a broad range of cellular mechanisms in a complex fashion and is not well understood. One important goal of computational approaches in aging is to develop integrated models in terms of a unifying aging theory, predicting progression of aging phenotypes grounded on molecular mechanisms. However, current experimental data incoherently reflects many isolated processes from a large diversity of approaches, biological model systems, and species, which makes such integration a challenging task. In an attempt to close this gap, we iteratively develop a fuzzy-logic cell systems model considering the interplay of damage, metabolism, and signaling by positive and negative feedback-loop motifs using relationships drawn from literature data. Because cellular biodynamics may be considered a complex control system, this approach seems particularly suitable. Here, we demonstrate that rule-based fuzzy-logic models provide semi-quantitative predictions that enhance our understanding of complex and interlocked molecular mechanisms and their implications on the aging physiome.
Collapse
Affiliation(s)
- Andres Kriete
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Bossone Research Center, Philadelphia, Pennsylvania, United States of America.
| | | | | |
Collapse
|
44
|
Srinath S, Gunawan R. Parameter identifiability of power-law biochemical system models. J Biotechnol 2010; 149:132-40. [PMID: 20197073 DOI: 10.1016/j.jbiotec.2010.02.019] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2009] [Revised: 02/19/2010] [Accepted: 02/22/2010] [Indexed: 11/20/2022]
Abstract
Mathematical modeling has become an integral component in biotechnology, in which these models are frequently used to design and optimize bioprocesses. Canonical models, like power-laws within the Biochemical Systems Theory, offer numerous mathematical and numerical advantages, including built-in flexibility to simulate general nonlinear behavior. The construction of such models relies on the estimation of unknown case-specific model parameters by way of experimental data fitting, also known as inverse modeling. Despite the large number of publications on this topic, this task remains the bottleneck in canonical modeling of biochemical systems. The focus of this paper concerns with the question of identifiability of power-law models from dynamic data, that is, whether the parameter values can be uniquely and accurately identified from time-series data. Existing and newly developed parameter identifiability methods were applied to two power-law models of biochemical systems, and the results pointed to the lack of parametric identifiability as the root cause of the difficulty faced in the inverse modeling. Despite the focus on power-law models, the analyses and conclusions are extendable to other canonical models, and the issue of parameter identifiability is expected to be a common problem in biochemical system modeling.
Collapse
Affiliation(s)
- Sridharan Srinath
- Department of Chemical and Biomolecular Engineering, National University of Singapore, Blk E5, 4 Engineering Drive 4, #02-16, Singapore 117576, Singapore
| | | |
Collapse
|
45
|
Balsa-Canto E, Alonso AA, Banga JR. An iterative identification procedure for dynamic modeling of biochemical networks. BMC SYSTEMS BIOLOGY 2010; 4:11. [PMID: 20163703 PMCID: PMC2838808 DOI: 10.1186/1752-0509-4-11] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2009] [Accepted: 02/17/2010] [Indexed: 12/16/2022]
Abstract
Background Mathematical models provide abstract representations of the information gained from experimental observations on the structure and function of a particular biological system. Conferring a predictive character on a given mathematical formulation often relies on determining a number of non-measurable parameters that largely condition the model's response. These parameters can be identified by fitting the model to experimental data. However, this fit can only be accomplished when identifiability can be guaranteed. Results We propose a novel iterative identification procedure for detecting and dealing with the lack of identifiability. The procedure involves the following steps: 1) performing a structural identifiability analysis to detect identifiable parameters; 2) globally ranking the parameters to assist in the selection of the most relevant parameters; 3) calibrating the model using global optimization methods; 4) conducting a practical identifiability analysis consisting of two (a priori and a posteriori) phases aimed at evaluating the quality of given experimental designs and of the parameter estimates, respectively and 5) optimal experimental design so as to compute the scheme of experiments that maximizes the quality and quantity of information for fitting the model. Conclusions The presented procedure was used to iteratively identify a mathematical model that describes the NF-κB regulatory module involving several unknown parameters. We demonstrated the lack of identifiability of the model under typical experimental conditions and computed optimal dynamic experiments that largely improved identifiability properties.
Collapse
Affiliation(s)
- Eva Balsa-Canto
- Bioprocess Engineering Group, Spanish National Research Council, IIM-CSIC, 36208 Vigo-Spain.
| | | | | |
Collapse
|
46
|
Niemi J, West M. Adaptive Mixture Modeling Metropolis Methods for Bayesian Analysis of Nonlinear State-Space Models. J Comput Graph Stat 2010; 19:260-280. [DOI: 10.1198/jcgs.2010.08117] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
47
|
Improving prediction capabilities of complex dynamic models via parameter selection and estimation. Chem Eng Sci 2009. [DOI: 10.1016/j.ces.2009.06.057] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
48
|
Thingnes J, Oyehaug L, Hovig E, Omholt SW. The mathematics of tanning. BMC SYSTEMS BIOLOGY 2009; 3:60. [PMID: 19505344 PMCID: PMC2714304 DOI: 10.1186/1752-0509-3-60] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/24/2008] [Accepted: 06/09/2009] [Indexed: 11/10/2022]
Abstract
BACKGROUND The pigment melanin is produced by specialized cells, called melanocytes. In healthy skin, melanocytes are sparsely spread among the other cell types in the basal layer of the epidermis. Sun tanning results from an UV-induced increase in the release of melanin to neighbouring keratinocytes, the major cell type component of the epidermis as well as redistribution of melanin among these cells. Here we provide a mathematical conceptualization of our current knowledge of the tanning response, in terms of a dynamic model. The resolution level of the model is tuned to available data, and its primary focus is to describe the tanning response following UV exposure. RESULTS The model appears capable of accounting for available experimental data on the tanning response in different skin and photo types. It predicts that the thickness of the epidermal layer and how far the melanocyte dendrites grow out in the epidermal layers after UV exposure influence the tanning response substantially. CONCLUSION Despite the paucity of experimental validation data the model is constrained enough to serve as a foundation for the establishment of a theoretical-experimental research programme aimed at elucidating the more fine-grained regulatory anatomy underlying the tanning response.
Collapse
Affiliation(s)
- Josef Thingnes
- Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences (UMB), PO Box 5003, 1432 As, Norway.
| | | | | | | |
Collapse
|
49
|
Jiménez-Hornero JE, Santos-Dueñas IM, García-García I. Optimization of biotechnological processes. The acetic acid fermentation. Part II: Practical identifiability analysis and parameter estimation. Biochem Eng J 2009. [DOI: 10.1016/j.bej.2009.01.010] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
50
|
Quaiser T, Mönnigmann M. Systematic identifiability testing for unambiguous mechanistic modeling--application to JAK-STAT, MAP kinase, and NF-kappaB signaling pathway models. BMC SYSTEMS BIOLOGY 2009; 3:50. [PMID: 19426527 PMCID: PMC2714303 DOI: 10.1186/1752-0509-3-50] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2008] [Accepted: 05/09/2009] [Indexed: 11/20/2022]
Abstract
Background When creating mechanistic mathematical models for biological signaling processes it is tempting to include as many known biochemical interactions into one large model as possible. For the JAK-STAT, MAP kinase, and NF-κB pathways a lot of biological insight is available, and as a consequence, large mathematical models have emerged. For large models the question arises whether unknown model parameters can uniquely be determined by parameter estimation from measured data. Systematic approaches to answering this question are indispensable since the uniqueness of model parameter values is essential for predictive mechanistic modeling. Results We propose an eigenvalue based method for efficiently testing identifiability of large ordinary differential models and compare this approach to three existing ones. The methods are benchmarked by applying them to models of the signaling pathways mentioned above. In all cases the eigenvalue method proposed here and the orthogonal method find the largest set of identifiable parameters, thus clearly outperforming the other approaches. The identifiability analysis shows that the pathway models are not identifiable, even under the strong assumption that all system state variables are measurable. We demonstrate how the results of the identifiability analysis can be used for model simplification. Conclusion While it has undoubtedly contributed to recent advances in systems biology, mechanistic modeling by itself does not guarantee unambiguous descriptions of biological processes. We show that some recent signal transduction pathway models have reached a level of detail that is not warranted. Rigorous identifiability tests reveal that even if highly idealized experiments could be carried out to measure all state variables of these signaling pathways, some unknown parameters could still not be estimated. The identifiability tests therefore show that the level of detail of the investigated models is too high in principle, not just because too little experimental information is available. We demonstrate how the proposed method can be combined with biological insight, however, to simplify these models.
Collapse
Affiliation(s)
- Tom Quaiser
- Institute for Heat and Fuel Technology, Technical University Braunschweig, D-38106 Braunschweig, Germany.
| | | |
Collapse
|