1
|
Klumpe HE, Lugagne JB, Khalil AS, Dunlop MJ. Deep Neural Networks for Predicting Single-Cell Responses and Probability Landscapes. ACS Synth Biol 2023; 12:2367-2381. [PMID: 37467372 DOI: 10.1021/acssynbio.3c00203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/21/2023]
Abstract
Engineering biology relies on the accurate prediction of cell responses. However, making these predictions is challenging for a variety of reasons, including the stochasticity of biochemical reactions, variability between cells, and incomplete information about underlying biological processes. Machine learning methods, which can model diverse input-output relationships without requiring a priori mechanistic knowledge, are an ideal tool for this task. For example, such approaches can be used to predict gene expression dynamics given time-series data of past expression history. To explore this application, we computationally simulated single-cell responses, incorporating different sources of noise and alternative genetic circuit designs. We showed that deep neural networks trained on these simulated data were able to correctly infer the underlying dynamics of a cell response even in the presence of measurement noise and stochasticity in the biochemical reactions. The training set size and the amount of past data provided as inputs both affected prediction quality, with cascaded genetic circuits that introduce delays requiring more past data. We also tested prediction performance on a bistable auto-activation circuit, finding that our initial method for predicting a single trajectory was fundamentally ill-suited for multimodal dynamics. To address this, we updated the network architecture to predict the entire distribution of future states, showing it could accurately predict bimodal expression distributions. Overall, these methods can be readily applied to the diverse prediction tasks necessary to predict and control a variety of biological circuits, a key aspect of many synthetic biology applications.
Collapse
Affiliation(s)
- Heidi E Klumpe
- Biomedical Engineering, Boston University, Boston, Massachusetts 02215, United States
- Biological Design Center, Boston University, Boston, Massachusetts 02215, United States
| | - Jean-Baptiste Lugagne
- Biomedical Engineering, Boston University, Boston, Massachusetts 02215, United States
- Biological Design Center, Boston University, Boston, Massachusetts 02215, United States
| | - Ahmad S Khalil
- Biomedical Engineering, Boston University, Boston, Massachusetts 02215, United States
- Biological Design Center, Boston University, Boston, Massachusetts 02215, United States
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, Massachusetts 02115, United States
| | - Mary J Dunlop
- Biomedical Engineering, Boston University, Boston, Massachusetts 02215, United States
- Biological Design Center, Boston University, Boston, Massachusetts 02215, United States
| |
Collapse
|
2
|
Murali A, Sarkar RR. Mechano-immunology in microgravity. LIFE SCIENCES IN SPACE RESEARCH 2023; 37:50-64. [PMID: 37087179 DOI: 10.1016/j.lssr.2023.03.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 02/16/2023] [Accepted: 03/05/2023] [Indexed: 05/03/2023]
Abstract
Life on Earth has evolved to thrive in the Earth's natural gravitational field; however, as space technology advances, we must revisit and investigate the effects of unnatural conditions on human health, such as gravitational change. Studies have shown that microgravity has a negative impact on various systemic parts of humans, with the effects being more severe in the human immune system. Increasing costs, limited experimental time, and sample handling issues hampered our understanding of this field. To address the existing knowledge gap and provide confidence in modelling the phenomena, in this review, we highlight experimental works in mechano-immunology under microgravity and different computational modelling approaches that can be used to address the existing problems.
Collapse
Affiliation(s)
- Anirudh Murali
- Chemical Engineering and Process Development, CSIR - National Chemical Laboratory, Pune, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
| | - Ram Rup Sarkar
- Chemical Engineering and Process Development, CSIR - National Chemical Laboratory, Pune, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.
| |
Collapse
|
3
|
Sarmah D, Smith GR, Bouhaddou M, Stern AD, Erskine J, Birtwistle MR. Network inference from perturbation time course data. NPJ Syst Biol Appl 2022; 8:42. [PMID: 36316338 PMCID: PMC9622863 DOI: 10.1038/s41540-022-00253-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 10/18/2022] [Indexed: 11/05/2022] Open
Abstract
Networks underlie much of biology from subcellular to ecological scales. Yet, understanding what experimental data are needed and how to use them for unambiguously identifying the structure of even small networks remains a broad challenge. Here, we integrate a dynamic least squares framework into established modular response analysis (DL-MRA), that specifies sufficient experimental perturbation time course data to robustly infer arbitrary two and three node networks. DL-MRA considers important network properties that current methods often struggle to capture: (i) edge sign and directionality; (ii) cycles with feedback or feedforward loops including self-regulation; (iii) dynamic network behavior; (iv) edges external to the network; and (v) robust performance with experimental noise. We evaluate the performance of and the extent to which the approach applies to cell state transition networks, intracellular signaling networks, and gene regulatory networks. Although signaling networks are often an application of network reconstruction methods, the results suggest that only under quite restricted conditions can they be robustly inferred. For gene regulatory networks, the results suggest that incomplete knockdown is often more informative than full knockout perturbation, which may change experimental strategies for gene regulatory network reconstruction. Overall, the results give a rational basis to experimental data requirements for network reconstruction and can be applied to any such problem where perturbation time course experiments are possible.
Collapse
Affiliation(s)
- Deepraj Sarmah
- Department of Chemical and Biomolecular Engineering, Clemson University, Clemson, SC, USA
| | - Gregory R Smith
- Department of Neurology, Center for Advanced Research on Diagnostic Assays, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Mehdi Bouhaddou
- J. David Gladstone Institutes, San Francisco, CA, 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Alan D Stern
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - James Erskine
- Department of Chemical and Biomolecular Engineering, Clemson University, Clemson, SC, USA
| | - Marc R Birtwistle
- Department of Chemical and Biomolecular Engineering, Clemson University, Clemson, SC, USA.
- Department of Bioengineering, Clemson University, Clemson, SC, USA.
| |
Collapse
|
4
|
Kariya Y, Honma M, Tokuda K, Konagaya A, Suzuki H. Utility of constraints reflecting system stability on analyses for biological models. PLoS Comput Biol 2022; 18:e1010441. [PMID: 36084151 PMCID: PMC9491612 DOI: 10.1371/journal.pcbi.1010441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Revised: 09/21/2022] [Accepted: 07/26/2022] [Indexed: 12/03/2022] Open
Abstract
Simulating complex biological models consisting of multiple ordinary differential equations can aid in the prediction of the pharmacological/biological responses; however, they are often hampered by the availability of reliable kinetic parameters. In the present study, we aimed to discover the properties of behaviors without determining an optimal combination of kinetic parameter values (parameter set). The key idea was to collect as many parameter sets as possible. Given that many systems are biologically stable and resilient (BSR), we focused on the dynamics around the steady state and formulated objective functions for BSR by partial linear approximation of the focused region. Using the objective functions and modified global cluster Newton method, we developed an algorithm for a thorough exploration of the allowable parameter space for biological systems (TEAPS). We first applied TEAPS to the NF-κB signaling model. This system shows a damped oscillation after stimulation and seems to fit the BSR constraint. By applying TEAPS, we found several directions in parameter space which stringently determines the BSR property. In such directions, the experimentally fitted parameter values were included in the range of the obtained parameter sets. The arachidonic acid metabolic pathway model was used as a model related to pharmacological responses. The pharmacological effects of nonsteroidal anti-inflammatory drugs were simulated using the parameter sets obtained by TEAPS. The structural properties of the system were partly extracted by analyzing the distribution of the obtained parameter sets. In addition, the simulations showed inter-drug differences in prostacyclin to thromboxane A2 ratio such that aspirin treatment tends to increase the ratio, while rofecoxib treatment tends to decrease it. These trends are comparable to the clinical observations. These results on real biological models suggest that the parameter sets satisfying the BSR condition can help in finding biologically plausible parameter sets and understanding the properties of biological systems. We propose a new method to analyze the properties of biological dynamic models, which we named TEAPS (Thorough Exploration of Allowable Parameter Space). TEAPS can thoroughly determine combinations of parameter values for ordinary differential equations with which an initial state in a certain range converges to a particular fixed point. This stable and resilient behavior is a characteristic shared with many biological systems, including metabolic systems and intracellular signaling systems. Therefore, this thorough search outlined the possible parameter space as biological systems for target models, which helps to understand the system constraints when the target systems behave dynamically. The obtained parameter space can be used as an initial space for parameter tuning. For models that include a large number of parameters, the parameter space to be searched in the parameter tuning process is too large; therefore, narrowing down the space by TEAPS potentially contributes to the analysis of the dynamics of complicated biological models. Thus, our approach can partly overcome the current problem in parameter tuning and can advance the computational dynamic analyses of biological systems.
Collapse
Affiliation(s)
- Yoshiaki Kariya
- Department of Pharmacy, The University of Tokyo Hospital, Faculty of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Masashi Honma
- Department of Pharmacy, The University of Tokyo Hospital, Faculty of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
- * E-mail:
| | - Keita Tokuda
- Department of Computer Science, University of Tsukuba, Tsukuba, Ibaraki, Japan
| | - Akihiko Konagaya
- Molecular Robotics Research Institute, Limited, Kyowa Create Dai-ichi, Minato-ku, Tokyo, Japan
| | - Hiroshi Suzuki
- Department of Pharmacy, The University of Tokyo Hospital, Faculty of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| |
Collapse
|
5
|
Abstract
![]()
AlphaFold has burst into our lives. A powerful algorithm
that underscores
the strength of biological sequence data and artificial intelligence
(AI). AlphaFold has appended projects and research directions. The
database it has been creating promises an untold number of applications
with vast potential impacts that are still difficult to surmise. AI
approaches can revolutionize personalized treatments and usher in
better-informed clinical trials. They promise to make giant leaps
toward reshaping and revamping drug discovery strategies, selecting
and prioritizing combinations of drug targets. Here, we briefly overview
AI in structural biology, including in molecular dynamics simulations
and prediction of microbiota–human protein–protein interactions.
We highlight the advancements accomplished by the deep-learning-powered
AlphaFold in protein structure prediction and their powerful impact
on the life sciences. At the same time, AlphaFold does not resolve
the decades-long protein folding challenge, nor does it identify the
folding pathways. The models that AlphaFold provides do not capture
conformational mechanisms like frustration and allostery, which are
rooted in ensembles, and controlled by their dynamic distributions.
Allostery and signaling are properties of populations. AlphaFold also
does not generate ensembles of intrinsically disordered proteins and
regions, instead describing them by their low structural probabilities.
Since AlphaFold generates single ranked structures, rather than conformational
ensembles, it cannot elucidate the mechanisms of allosteric activating
driver hotspot mutations nor of allosteric drug resistance. However,
by capturing key features, deep learning techniques can use the single
predicted conformation as the basis for generating a diverse ensemble.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21702, United States.,Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21702, United States
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute, Frederick, Maryland 21702, United States
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21702, United States
| |
Collapse
|
6
|
Villaverde AF, Pathirana D, Fröhlich F, Hasenauer J, Banga JR. A protocol for dynamic model calibration. Brief Bioinform 2022; 23:bbab387. [PMID: 34619769 PMCID: PMC8769694 DOI: 10.1093/bib/bbab387] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 08/06/2021] [Accepted: 08/29/2021] [Indexed: 12/23/2022] Open
Abstract
Ordinary differential equation models are nowadays widely used for the mechanistic description of biological processes and their temporal evolution. These models typically have many unknown and nonmeasurable parameters, which have to be determined by fitting the model to experimental data. In order to perform this task, known as parameter estimation or model calibration, the modeller faces challenges such as poor parameter identifiability, lack of sufficiently informative experimental data and the existence of local minima in the objective function landscape. These issues tend to worsen with larger model sizes, increasing the computational complexity and the number of unknown parameters. An incorrectly calibrated model is problematic because it may result in inaccurate predictions and misleading conclusions. For nonexpert users, there are a large number of potential pitfalls. Here, we provide a protocol that guides the user through all the steps involved in the calibration of dynamic models. We illustrate the methodology with two models and provide all the code required to reproduce the results and perform the same analysis on new models. Our protocol provides practitioners and researchers in biological modelling with a one-stop guide that is at the same time compact and sufficiently comprehensive to cover all aspects of the problem.
Collapse
Affiliation(s)
- Alejandro F Villaverde
- Universidade de Vigo, Department of Systems Engineering & Control, Vigo 36310, Galicia, Spain
| | - Dilan Pathirana
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn 53115, Germany
| | - Fabian Fröhlich
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg 85764, Germany
| | - Jan Hasenauer
- Center for Mathematics, Technische Universität München, Garching 85748, Germany
- Harvard Medical School, Cambridge, MA 02115, USA
| | - Julio R Banga
- Bioprocess Engineering Group, IIM-CSIC, Vigo 36208, Galicia, Spain
| |
Collapse
|
7
|
Stapor P, Schmiester L, Wierling C, Merkt S, Pathirana D, Lange BMH, Weindl D, Hasenauer J. Mini-batch optimization enables training of ODE models on large-scale datasets. Nat Commun 2022; 13:34. [PMID: 35013141 PMCID: PMC8748893 DOI: 10.1038/s41467-021-27374-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2019] [Accepted: 11/11/2021] [Indexed: 11/09/2022] Open
Abstract
Quantitative dynamic models are widely used to study cellular signal processing. A critical step in modelling is the estimation of unknown model parameters from experimental data. As model sizes and datasets are steadily growing, established parameter optimization approaches for mechanistic models become computationally extremely challenging. Mini-batch optimization methods, as employed in deep learning, have better scaling properties. In this work, we adapt, apply, and benchmark mini-batch optimization for ordinary differential equation (ODE) models, thereby establishing a direct link between dynamic modelling and machine learning. On our main application example, a large-scale model of cancer signaling, we benchmark mini-batch optimization against established methods, achieving better optimization results and reducing computation by more than an order of magnitude. We expect that our work will serve as a first step towards mini-batch optimization tailored to ODE models and enable modelling of even larger and more complex systems than what is currently possible.
Collapse
Affiliation(s)
- Paul Stapor
- Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, 85764, Neuherberg, Germany
- Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748, Garching, Germany
| | - Leonard Schmiester
- Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, 85764, Neuherberg, Germany
- Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748, Garching, Germany
| | | | - Simon Merkt
- Universität Bonn, Faculty of Mathematics and Natural Sciences, 53115, Bonn, Germany
| | - Dilan Pathirana
- Universität Bonn, Faculty of Mathematics and Natural Sciences, 53115, Bonn, Germany
| | | | - Daniel Weindl
- Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, 85764, Neuherberg, Germany
| | - Jan Hasenauer
- Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, 85764, Neuherberg, Germany.
- Technische Universität München, Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, 85748, Garching, Germany.
- Universität Bonn, Faculty of Mathematics and Natural Sciences, 53115, Bonn, Germany.
| |
Collapse
|
8
|
Boolean function metrics can assist modelers to check and choose logical rules. J Theor Biol 2022; 538:111025. [DOI: 10.1016/j.jtbi.2022.111025] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 12/07/2021] [Accepted: 01/10/2022] [Indexed: 12/25/2022]
|
9
|
Integrating Patient-Specific Information into Logic Models of Complex Diseases: Application to Acute Myeloid Leukemia. J Pers Med 2021; 11:jpm11020117. [PMID: 33578936 PMCID: PMC7916657 DOI: 10.3390/jpm11020117] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 02/05/2021] [Accepted: 02/05/2021] [Indexed: 12/12/2022] Open
Abstract
High throughput technologies such as deep sequencing and proteomics are increasingly becoming mainstream in clinical practice and support diagnosis and patient stratification. Developing computational models that recapitulate cell physiology and its perturbations in disease is a required step to help with the interpretation of results of high content experiments and to devise personalized treatments. As complete cell-models are difficult to achieve, given limited experimental information and insurmountable computational problems, approximate approaches should be considered. We present here a general approach to modeling complex diseases by embedding patient-specific genomics data into actionable logic models that take into account prior knowledge. We apply the strategy to acute myeloid leukemia (AML) and assemble a network of logical relationships linking most of the genes that are found frequently mutated in AML patients. We derive Boolean models from this network and we show that by priming the model with genomic data we can infer relevant patient-specific clinical features. Here we propose that the integration of literature-derived causal networks with patient-specific data should be explored to help bedside decisions.
Collapse
|
10
|
Abstract
Motivation Cells regulate themselves via dizzyingly complex biochemical processes called signaling pathways. These are usually depicted as a network, where nodes represent proteins and edges indicate their influence on each other. In order to understand diseases and therapies at the cellular level, it is crucial to have an accurate understanding of the signaling pathways at work. Since signaling pathways can be modified by disease, the ability to infer signaling pathways from condition- or patient-specific data is highly valuable. A variety of techniques exist for inferring signaling pathways. We build on past works that formulate signaling pathway inference as a Dynamic Bayesian Network structure estimation problem on phosphoproteomic time course data. We take a Bayesian approach, using Markov Chain Monte Carlo to estimate a posterior distribution over possible Dynamic Bayesian Network structures. Our primary contributions are (i) a novel proposal distribution that efficiently samples sparse graphs and (ii) the relaxation of common restrictive modeling assumptions. Results We implement our method, named Sparse Signaling Pathway Sampling, in Julia using the Gen probabilistic programming language. Probabilistic programming is a powerful methodology for building statistical models. The resulting code is modular, extensible and legible. The Gen language, in particular, allows us to customize our inference procedure for biological graphs and ensure efficient sampling. We evaluate our algorithm on simulated data and the HPN-DREAM pathway reconstruction challenge, comparing our performance against a variety of baseline methods. Our results demonstrate the vast potential for probabilistic programming, and Gen specifically, for biological network inference. Availability and implementation Find the full codebase at https://github.com/gitter-lab/ssps. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- David Merrell
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, USA.,Morgridge Institute for Research, Madison, WI 53715, USA
| | - Anthony Gitter
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, USA.,Morgridge Institute for Research, Madison, WI 53715, USA.,Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53726, USA
| |
Collapse
|
11
|
Khalilimeybodi A, Paap AM, Christiansen SLM, Saucerman JJ. Context-specific network modeling identifies new crosstalk in β-adrenergic cardiac hypertrophy. PLoS Comput Biol 2020; 16:e1008490. [PMID: 33338038 PMCID: PMC7781532 DOI: 10.1371/journal.pcbi.1008490] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 01/04/2021] [Accepted: 11/05/2020] [Indexed: 11/25/2022] Open
Abstract
Cardiac hypertrophy is a context-dependent phenomenon wherein a myriad of biochemical and biomechanical factors regulate myocardial growth through a complex large-scale signaling network. Although numerous studies have investigated hypertrophic signaling pathways, less is known about hypertrophy signaling as a whole network and how this network acts in a context-dependent manner. Here, we developed a systematic approach, CLASSED (Context-specific Logic-bASed Signaling nEtwork Development), to revise a large-scale signaling model based on context-specific data and identify main reactions and new crosstalks regulating context-specific response. CLASSED involves four sequential stages with an automated validation module as a core which builds a logic-based ODE model from the interaction graph and outputs the model validation percent. The context-specific model is developed by estimation of default parameters, classified qualitative validation, hybrid Morris-Sobol global sensitivity analysis, and discovery of missing context-dependent crosstalks. Applying this pipeline to our prior-knowledge hypertrophy network with context-specific data revealed key signaling reactions which distinctly regulate cell response to isoproterenol, phenylephrine, angiotensin II and stretch. Furthermore, with CLASSED we developed a context-specific model of β-adrenergic cardiac hypertrophy. The model predicted new crosstalks between calcium/calmodulin-dependent pathways and upstream signaling of Ras in the ISO-specific context. Experiments in cardiomyocytes validated the model’s predictions on the role of CaMKII-Gβγ and CaN-Gβγ interactions in mediating hypertrophic signals in ISO-specific context and revealed a difference in the phosphorylation magnitude and translocation of ERK1/2 between cardiac myocytes and fibroblasts. CLASSED is a systematic approach for developing context-specific large-scale signaling networks, yielding insights into new-found crosstalks in β-adrenergic cardiac hypertrophy. Pathological cardiac hypertrophy is a disease in which the heart grows abnormally in response to different motivators such as high blood pressure or variations in hormones and growth factors. The shape of the heart after its growth depends on the context in which it grows. Since cell signaling in the cardiac cells plays a key role in the determination of heart shape, a thorough understanding of cardiac cells signaling in each context enlightens the mechanisms which control response of cardiac cells. However, cell signaling in cardiac hypertrophy comprises a complex web of pathways with numerous interactions, and predicting how these interactions control the hypertrophic signal in each context is not achievable by only experiments or general computational models. To address this need, we developed an approach to bring together the experimental data of each context with a signaling network curated from literature to identify the main players of cardiac cells response in each context and attain the context-specific models of cardiac hypertrophy. By utilizing our approach, we identified the main regulators of cardiac hypertrophy in four important contexts. We developed a network model of β-adrenergic cardiac hypertrophy, and predicted and validated new interactions that regulate cardiac cells response in this context.
Collapse
Affiliation(s)
- Ali Khalilimeybodi
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| | - Alexander M. Paap
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| | - Steven L. M. Christiansen
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| | - Jeffrey J. Saucerman
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
- Robert M. Berne Cardiovascular Research Center, University of Virginia, Charlottesville, Virginia, United States of America
- * E-mail:
| |
Collapse
|
12
|
Lee D, Jayaraman A, Kwon JS. Development of a hybrid model for a partially known intracellular signaling pathway through correction term estimation and neural network modeling. PLoS Comput Biol 2020; 16:e1008472. [PMID: 33315899 PMCID: PMC7769624 DOI: 10.1371/journal.pcbi.1008472] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 12/28/2020] [Accepted: 10/26/2020] [Indexed: 12/30/2022] Open
Abstract
Developing an accurate first-principle model is an important step in employing systems biology approaches to analyze an intracellular signaling pathway. However, an accurate first-principle model is difficult to be developed since it requires in-depth mechanistic understandings of the signaling pathway. Since underlying mechanisms such as the reaction network structure are not fully understood, significant discrepancy exists between predicted and actual signaling dynamics. Motivated by these considerations, this work proposes a hybrid modeling approach that combines a first-principle model and an artificial neural network (ANN) model so that predictions of the hybrid model surpass those of the original model. First, the proposed approach determines an optimal subset of model states whose dynamics should be corrected by the ANN by examining the correlation between each state and outputs through relative order. Second, an L2-regularized least-squares problem is solved to infer values of the correction terms that are necessary to minimize the discrepancy between the model predictions and available measurements. Third, an ANN is developed to generalize relationships between the values of the correction terms and the system dynamics. Lastly, the original first-principle model is coupled with the developed ANN to finalize the hybrid model development so that the model will possess generalized prediction capabilities while retaining the model interpretability. We have successfully validated the proposed methodology with two case studies, simplified apoptosis and lipopolysaccharide-induced NFκB signaling pathways, to develop hybrid models with in silico and in vitro measurements, respectively. An intracellular signaling pathway is often represented by a set of nonlinear ordinary differential equations, which translate our current knowledge about the signaling pathway into a testable mathematical model. However, predictions from such models are often subject to high uncertainty since many signaling pathways are only partially known beforehand. In this study, we propose a systematic approach to develop a hybrid model to improve model accuracy by combining machine learning and the first-principle modeling. Specifically, model correction terms are learned from discrepancy between model predictions and measurements, and these terms are added to the first-principle model to enhance the prediction accuracy. Once these correction terms are learned from the data, an artificial neural network (ANN) model is developed to find an empirical relation between the model and the correction terms so that the developed ANN can be used to posses improved predictive capabilities even in new operating conditions (i.e., generalizability). The final hybrid model is then constructed by coupling the first-principle model with the developed ANN.
Collapse
Affiliation(s)
- Dongheon Lee
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Texas A&M Energy Institute, Texas A&M University, College Station, Texas, USA
| | - Arul Jayaraman
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Department of Biomedical Engineering, Texas A&M University, College Station, Texas, USA
| | - Joseph S. Kwon
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Texas A&M Energy Institute, Texas A&M University, College Station, Texas, USA
- * E-mail:
| |
Collapse
|
13
|
Kumar P, Sinha R, Shukla P. Artificial intelligence and synthetic biology approaches for human gut microbiome. Crit Rev Food Sci Nutr 2020; 62:2103-2121. [PMID: 33249867 DOI: 10.1080/10408398.2020.1850415] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
The gut microbiome comprises a variety of microorganisms whose genes encode proteins to carry out crucial metabolic functions that are responsible for the majority of health-related issues in human beings. The advent of the technological revolution in artificial intelligence (AI) assisted synthetic biology (SB) approaches will play a vital role in the modulating the therapeutic and nutritive potential of probiotics. This can turn human gut as a reservoir of beneficial bacterial colonies having an immense role in immunity, digestion, brain function, and other health benefits. Hence, in the present review, we have discussed the role of several gene editing tools and approaches in synthetic biology that have equipped us with novel tools like Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR-Cas) systems to precisely engineer probiotics for diagnostic, therapeutic and nutritive value. A brief discussion over the AI techniques to understand the metagenomic data from the healthy and diseased gut microbiome is also presented. Further, the role of AI in potentially impacting the pace of developments in SB and its current challenges is also discussed. The review also describes the health benefits conferred by engineered microbes through the production of biochemicals, nutraceuticals, drugs or biotherapeutics molecules etc. Finally, the review concludes with the challenges and regulatory concerns in adopting synthetic biology engineered microbes for clinical applications. Thus, the review presents a synergistic approach of AI and SB toward human gut microbiome for better health which will provide interesting clues to researchers working in the area of rapidly evolving food and nutrition science.
Collapse
Affiliation(s)
- Prasoon Kumar
- Department of Biotechnology and Medical Engineering, National Institute of Technology, Rourkela, India.,Department of Medical Devices, National Institute of Pharmaceutical Education and Research, Ahmedabad, India
| | | | - Pratyoosh Shukla
- School of Biotechnology, Institute of Science, Banaras Hindu University, Varanasi, India.,Enzyme Technology and Protein Bioinformatics Laboratory, Department of Microbiology, Maharshi Dayanand University, Rohtak, India
| |
Collapse
|
14
|
Rybiński M, Möller S, Sunnåker M, Lormeau C, Stelling J. TopoFilter: a MATLAB package for mechanistic model identification in systems biology. BMC Bioinformatics 2020; 21:34. [PMID: 31996136 PMCID: PMC6990465 DOI: 10.1186/s12859-020-3343-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 01/08/2020] [Indexed: 12/27/2022] Open
Abstract
Background To develop mechanistic dynamic models in systems biology, one often needs to identify all (or minimal) representations of the biological processes that are consistent with experimental data, out of a potentially large set of hypothetical mechanisms. However, a simple enumeration of all alternatives becomes quickly intractable when the number of model parameters grows. Selecting appropriate dynamic models out of a large ensemble of models, taking the uncertainty in our biological knowledge and in the experimental data into account, is therefore a key current problem in systems biology. Results The TopoFilter package addresses this problem in a heuristic and automated fashion by implementing the previously described topological filtering method for Bayesian model selection. It includes a core heuristic for searching the space of submodels of a parametrized model, coupled with a sampling-based exploration of the parameter space. Recent developments of the method allow to balance exhaustiveness and speed of the model space search, to efficiently re-sample parameters, to parallelize the search, and to use custom scoring functions. We use a theoretical example to motivate these features and then demonstrate TopoFilter’s applicability for a yeast signaling network with more than 250’000 possible model structures. Conclusions TopoFilter is a flexible software framework that makes Bayesian model selection and reduction efficient and scalable to network models of a complexity that represents contemporary problems in, for example, cell signaling. TopoFilter is open-source, available under the GPL-3.0 license at https://gitlab.com/csb.ethz/TopoFilter. It includes installation instructions, a quickstart guide, a description of all package options, and multiple examples.
Collapse
Affiliation(s)
- Mikołaj Rybiński
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.,ID Scientific IT Services, ETH Zurich, Zurich, 8092, Switzerland
| | - Simon Möller
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland
| | - Mikael Sunnåker
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland
| | - Claude Lormeau
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.,Life Science Zurich Ph.D. program "Systems Biology", Zurich, 8092, Switzerland
| | - Jörg Stelling
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.
| |
Collapse
|
15
|
Meier-Schellersheim M, Varma R, Angermann BR. Mechanistic Models of Cellular Signaling, Cytokine Crosstalk, and Cell-Cell Communication in Immunology. Front Immunol 2019; 10:2268. [PMID: 31681261 PMCID: PMC6798038 DOI: 10.3389/fimmu.2019.02268] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Accepted: 09/09/2019] [Indexed: 12/21/2022] Open
Abstract
The cells of the immune system respond to a great variety of different signals that frequently reach them simultaneously. Computational models of signaling pathways and cellular behavior can help us explore the biochemical mechanisms at play during such responses, in particular when those models aim at incorporating molecular details of intracellular reaction networks. Such detailed models can encompass hypotheses about the interactions among molecular binding domains and how these interactions are modulated by, for instance, post-translational modifications, or steric constraints in multi-molecular complexes. In this way, the models become formal representations of mechanistic immunological hypotheses that can be tested through quantitative simulations. Due to the large number of parameters (molecular abundances, association-, dissociation-, and enzymatic transformation rates) the goal of simulating the models can, however, in many cases no longer be the fitting of particular parameter values. Rather, the simulations perform sweeps through parameter space to test whether a model can account for certain experimentally observed features when allowing the parameter values to vary within experimentally determined or physiologically reasonable ranges. We illustrate how this approach can be used to explore possible mechanisms of immunological pathway crosstalk. Probing the input-output behavior of mechanistic pathway models through systematic simulated variations of receptor stimuli will soon allow us to derive cell population behavior from single-cell models, thereby bridging a scale gap that currently still is frequently addressed through heuristic phenomenological multi-scale models.
Collapse
Affiliation(s)
- Martin Meier-Schellersheim
- Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Bethesda, MD, United States
| | | | - Bastian R Angermann
- Translational Science and Experimental Medicine, Early Respiratory, Inflammation and Autoimmunity, BioPharmaceuticals, AstraZeneca, Gothenburg, Sweden
| |
Collapse
|
16
|
Gabel M, Hohl T, Imle A, Fackler OT, Graw F. FAMoS: A Flexible and dynamic Algorithm for Model Selection to analyse complex systems dynamics. PLoS Comput Biol 2019; 15:e1007230. [PMID: 31419221 PMCID: PMC6697322 DOI: 10.1371/journal.pcbi.1007230] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Accepted: 06/30/2019] [Indexed: 01/12/2023] Open
Abstract
Most biological systems are difficult to analyse due to a multitude of interacting components and the concomitant lack of information about the essential dynamics. Finding appropriate models that provide a systematic description of such biological systems and that help to identify their relevant factors and processes can be challenging given the sheer number of possibilities. Model selection algorithms that evaluate the performance of a multitude of different models against experimental data provide a useful tool to identify appropriate model structures. However, many algorithms addressing the analysis of complex dynamical systems, as they are often used in biology, compare a preselected number of models or rely on exhaustive searches of the total model space which might be unfeasible dependent on the number of possibilities. Therefore, we developed an algorithm that is able to perform model selection on complex systems and searches large model spaces in a dynamical way. Our algorithm includes local and newly developed non-local search methods that can prevent the algorithm from ending up in local minima of the model space by accounting for structurally similar processes. We tested and validated the algorithm based on simulated data and showed its flexibility for handling different model structures. We also used the algorithm to analyse experimental data on the cell proliferation dynamics of CD4+ and CD8+ T cells that were cultured under different conditions. Our analyses indicated dynamical changes within the proliferation potential of cells that was reduced within tissue-like 3D ex vivo cultures compared to suspension. Due to the flexibility in handling various model structures, the algorithm is applicable to a large variety of different biological problems and represents a useful tool for the data-oriented evaluation of complex model spaces. Identifying the systematic interactions of multiple components within a complex biological system can be challenging due to the number of potential processes and the concomitant lack of information about the essential dynamics. Selection algorithms that allow an automated evaluation of a large number of different models provide a useful tool in identifying the systematic relationships between experimental data. However, many of the existing model selection algorithms are not able to address complex model structures, such as systems of differential equations, and partly rely on local or exhaustive search methods which are inappropriate for the analysis of various biological systems. Therefore, we developed a flexible model selection algorithm that performs a robust and dynamical search of large model spaces to identify complex systems dynamics and applied it to the analysis of T cell proliferation dynamics within different culture conditions. The algorithm, which is available as an R-package, provides an advanced tool for the analysis of complex systems behaviour and, due to its flexible structure, can be applied to a large variety of biological problems.
Collapse
Affiliation(s)
- Michael Gabel
- Center for Modelling and Simulation in the Biosciences, BioQuant-Center, Heidelberg University, Heidelberg, Germany
- * E-mail: (MG); (FG)
| | - Tobias Hohl
- Center for Modelling and Simulation in the Biosciences, BioQuant-Center, Heidelberg University, Heidelberg, Germany
| | - Andrea Imle
- Department of Infectious Diseases, Centre for Integrative Infectious Disease Research (CIID), Integrative Virology, University Hospital Heidelberg, Heidelberg, Germany
| | - Oliver T. Fackler
- Department of Infectious Diseases, Centre for Integrative Infectious Disease Research (CIID), Integrative Virology, University Hospital Heidelberg, Heidelberg, Germany
| | - Frederik Graw
- Center for Modelling and Simulation in the Biosciences, BioQuant-Center, Heidelberg University, Heidelberg, Germany
- * E-mail: (MG); (FG)
| |
Collapse
|
17
|
Tian S, Wang C, Wang B. Incorporating Pathway Information into Feature Selection towards Better Performed Gene Signatures. BIOMED RESEARCH INTERNATIONAL 2019; 2019:2497509. [PMID: 31073522 PMCID: PMC6470448 DOI: 10.1155/2019/2497509] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Accepted: 03/07/2019] [Indexed: 12/29/2022]
Abstract
To analyze gene expression data with sophisticated grouping structures and to extract hidden patterns from such data, feature selection is of critical importance. It is well known that genes do not function in isolation but rather work together within various metabolic, regulatory, and signaling pathways. If the biological knowledge contained within these pathways is taken into account, the resulting method is a pathway-based algorithm. Studies have demonstrated that a pathway-based method usually outperforms its gene-based counterpart in which no biological knowledge is considered. In this article, a pathway-based feature selection is firstly divided into three major categories, namely, pathway-level selection, bilevel selection, and pathway-guided gene selection. With bilevel selection methods being regarded as a special case of pathway-guided gene selection process, we discuss pathway-guided gene selection methods in detail and the importance of penalization in such methods. Last, we point out the potential utilizations of pathway-guided gene selection in one active research avenue, namely, to analyze longitudinal gene expression data. We believe this article provides valuable insights for computational biologists and biostatisticians so that they can make biology more computable.
Collapse
Affiliation(s)
- Suyan Tian
- Division of Clinical Research, The First Hospital of Jilin University, 71 Xinmin Street, Changchun, Jilin 130021, China
| | - Chi Wang
- Department of Biostatistics, Markey Cancer Center, The University of Kentucky, 800 Rose St., Lexington, KY 40536, USA
| | - Bing Wang
- School of Life Science, Jilin University, 2699 Qianjin Street, Changchun, Jilin 130012, China
| |
Collapse
|
18
|
Cova TFGG, Bento DJ, Nunes SCC. Computational Approaches in Theranostics: Mining and Predicting Cancer Data. Pharmaceutics 2019; 11:E119. [PMID: 30871264 PMCID: PMC6471740 DOI: 10.3390/pharmaceutics11030119] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2019] [Revised: 02/26/2019] [Accepted: 03/07/2019] [Indexed: 02/02/2023] Open
Abstract
The ability to understand the complexity of cancer-related data has been prompted by the applications of (1) computer and data sciences, including data mining, predictive analytics, machine learning, and artificial intelligence, and (2) advances in imaging technology and probe development. Computational modelling and simulation are systematic and cost-effective tools able to identify important temporal/spatial patterns (and relationships), characterize distinct molecular features of cancer states, and address other relevant aspects, including tumor detection and heterogeneity, progression and metastasis, and drug resistance. These approaches have provided invaluable insights for improving the experimental design of therapeutic delivery systems and for increasing the translational value of the results obtained from early and preclinical studies. The big question is: Could cancer theranostics be determined and controlled in silico? This review describes the recent progress in the development of computational models and methods used to facilitate research on the molecular basis of cancer and on the respective diagnosis and optimized treatment, with particular emphasis on the design and optimization of theranostic systems. The current role of computational approaches is providing innovative, incremental, and complementary data-driven solutions for the prediction, simplification, and characterization of cancer and intrinsic mechanisms, and to promote new data-intensive, accurate diagnostics and therapeutics.
Collapse
Affiliation(s)
- Tânia F G G Cova
- Coimbra Chemistry Centre, Department of Chemistry, Faculty of Sciences and Technology, University of Coimbra, 3004-535 Coimbra, Portugal.
| | - Daniel J Bento
- Coimbra Chemistry Centre, Department of Chemistry, Faculty of Sciences and Technology, University of Coimbra, 3004-535 Coimbra, Portugal.
| | - Sandra C C Nunes
- Coimbra Chemistry Centre, Department of Chemistry, Faculty of Sciences and Technology, University of Coimbra, 3004-535 Coimbra, Portugal.
| |
Collapse
|
19
|
|
20
|
Efficient Parameter Estimation Enables the Prediction of Drug Response Using a Mechanistic Pan-Cancer Pathway Model. Cell Syst 2018; 7:567-579.e6. [DOI: 10.1016/j.cels.2018.10.013] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2018] [Revised: 09/07/2018] [Accepted: 10/29/2018] [Indexed: 12/25/2022]
|
21
|
Reconstructing phosphorylation signalling networks from quantitative phosphoproteomic data. Essays Biochem 2018; 62:525-534. [PMID: 30072490 PMCID: PMC6204553 DOI: 10.1042/ebc20180019] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Revised: 06/25/2018] [Accepted: 06/26/2018] [Indexed: 12/25/2022]
Abstract
Cascades of phosphorylation between protein kinases comprise a core mechanism in the integration and propagation of intracellular signals. Although we have accumulated a wealth of knowledge around some such pathways, this is subject to study biases and much remains to be uncovered. Phosphoproteomics, the identification and quantification of phosphorylated proteins on a proteomic scale, provides a high-throughput means of interrogating the state of intracellular phosphorylation, both at the pathway level and at the whole-cell level. In this review, we discuss methods for using human quantitative phosphoproteomic data to reconstruct the underlying signalling networks that generated it. We address several challenges imposed by the data on such analyses and we consider promising advances towards reconstructing unbiased, kinome-scale signalling networks.
Collapse
|
22
|
Barbosa S, Niebel B, Wolf S, Mauch K, Takors R. A guide to gene regulatory network inference for obtaining predictive solutions: Underlying assumptions and fundamental biological and data constraints. Biosystems 2018; 174:37-48. [PMID: 30312740 DOI: 10.1016/j.biosystems.2018.10.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Revised: 10/05/2018] [Accepted: 10/08/2018] [Indexed: 02/07/2023]
Abstract
The study of biological systems at a system level has become a reality due to the increasing powerful computational approaches able to handle increasingly larger datasets. Uncovering the dynamic nature of gene regulatory networks in order to attain a system level understanding and improve the predictive power of biological models is an important research field in systems biology. The task itself presents several challenges, since the problem is of combinatorial nature and highly depends on several biological constraints and also the intended application. Given the intrinsic interdisciplinary nature of gene regulatory network inference, we present a review on the currently available approaches, their challenges and limitations. We propose guidelines to select the most appropriate method considering the underlying assumptions and fundamental biological and data constraints.
Collapse
Affiliation(s)
- Sara Barbosa
- Insilico Biotechnology AG, Meitnerstrasse 9, 70563 Stuttgart, Germany.
| | - Bastian Niebel
- Insilico Biotechnology AG, Meitnerstrasse 9, 70563 Stuttgart, Germany
| | - Sebastian Wolf
- Insilico Biotechnology AG, Meitnerstrasse 9, 70563 Stuttgart, Germany
| | - Klaus Mauch
- Insilico Biotechnology AG, Meitnerstrasse 9, 70563 Stuttgart, Germany
| | - Ralf Takors
- Institute of Biochemical Engineering, University of Stuttgart, Allmandring 31, 70569 Stuttgart, Germany
| |
Collapse
|
23
|
Inferring dynamic topology for decoding spatiotemporal structures in complex heterogeneous networks. Proc Natl Acad Sci U S A 2018; 115:9300-9305. [PMID: 30150403 PMCID: PMC6140519 DOI: 10.1073/pnas.1721286115] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Inferring connections forms a critical step toward understanding large and diverse complex networks. To date, reliable and efficient methods for the reconstruction of network topology from measurement data remain a challenge due to the high complexity and nonlinearity of the system dynamics. These obstacles also form a bottleneck for analyzing and controlling the dynamic structures (e.g., synchrony) and collective behavior in such complex networks. The novel contribution of this work is to develop a unified data-driven approach to reliably and efficiently reveal the dynamic topology of complex networks in different scales—from cells to societies. The developed technique provides guidelines for the refinement of experimental designs toward a comprehensive understanding of complex heterogeneous networks. Extracting complex interactions (i.e., dynamic topologies) has been an essential, but difficult, step toward understanding large, complex, and diverse systems including biological, financial, and electrical networks. However, reliable and efficient methods for the recovery or estimation of network topology remain a challenge due to the tremendous scale of emerging systems (e.g., brain and social networks) and the inherent nonlinearity within and between individual units. We develop a unified, data-driven approach to efficiently infer connections of networks (ICON). We apply ICON to determine topology of networks of oscillators with different periodicities, degree nodes, coupling functions, and time scales, arising in silico, and in electrochemistry, neuronal networks, and groups of mice. This method enables the formulation of these large-scale, nonlinear estimation problems as a linear inverse problem that can be solved using parallel computing. Working with data from networks, ICON is robust and versatile enough to reliably reveal full and partial resonance among fast chemical oscillators, coherent circadian rhythms among hundreds of cells, and functional connectivity mediating social synchronization of circadian rhythmicity among mice over weeks.
Collapse
|
24
|
Henriques D, Alonso-Del-Real J, Querol A, Balsa-Canto E. Saccharomyces cerevisiae and S. kudriavzevii Synthetic Wine Fermentation Performance Dissected by Predictive Modeling. Front Microbiol 2018; 9:88. [PMID: 29456524 PMCID: PMC5801724 DOI: 10.3389/fmicb.2018.00088] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 01/15/2018] [Indexed: 12/22/2022] Open
Abstract
Wineries face unprecedented challenges due to new market demands and climate change effects on wine quality. New yeast starters including non-conventional Saccharomyces species, such as S. kudriavzevii, may contribute to deal with some of these challenges. The design of new fermentations using non-conventional yeasts requires an improved understanding of the physiology and metabolism of these cells. Dynamic modeling brings the potential of exploring the most relevant mechanisms and designing optimal processes more systematically. In this work we explore mechanisms by means of a model selection, reduction and cross-validation pipeline which enables to dissect the most relevant fermentation features for the species under consideration, Saccharomyces cerevisiae T73 and Saccharomyces kudriavzevii CR85. The pipeline involved the comparison of a collection of models which incorporate several alternative mechanisms with emphasis on the inhibitory effects due to temperature and ethanol. We focused on defining a minimal model with the minimum number of parameters, to maximize the identifiability and the quality of cross-validation. The selected model was then used to highlight differences in behavior between species. The analysis of model parameters would indicate that the specific growth rate and the transport of hexoses at initial times are higher for S. cervisiae T73 while S. kudriavzevii CR85 diverts more flux for glycerol production and cellular maintenance. As a result, the fermentations with S. kudriavzevii CR85 are typically slower; produce less ethanol but higher glycerol. Finally, we also explored optimal initial inoculation and process temperature to find the best compromise between final product characteristics and fermentation duration. Results reveal that the production of glycerol is distinctive in S. kudriavzevii CR85, it was not possible to achieve the same production of glycerol with S. cervisiae T73 in any of the conditions tested. This result brings the idea that the optimal design of mixed cultures may have an enormous potential for the improvement of final wine quality.
Collapse
Affiliation(s)
| | - Javier Alonso-Del-Real
- Grupo de Biología de Sistemas en Levaduras de Interés Biotecnológico, IATA-CSIC, Valencia, Spain
| | - Amparo Querol
- Grupo de Biología de Sistemas en Levaduras de Interés Biotecnológico, IATA-CSIC, Valencia, Spain
| | | |
Collapse
|
25
|
Cuperlovic-Culf M. Machine Learning Methods for Analysis of Metabolic Data and Metabolic Pathway Modeling. Metabolites 2018; 8:E4. [PMID: 29324649 PMCID: PMC5875994 DOI: 10.3390/metabo8010004] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Revised: 01/08/2018] [Accepted: 01/09/2018] [Indexed: 01/15/2023] Open
Abstract
Machine learning uses experimental data to optimize clustering or classification of samples or features, or to develop, augment or verify models that can be used to predict behavior or properties of systems. It is expected that machine learning will help provide actionable knowledge from a variety of big data including metabolomics data, as well as results of metabolism models. A variety of machine learning methods has been applied in bioinformatics and metabolism analyses including self-organizing maps, support vector machines, the kernel machine, Bayesian networks or fuzzy logic. To a lesser extent, machine learning has also been utilized to take advantage of the increasing availability of genomics and metabolomics data for the optimization of metabolic network models and their analysis. In this context, machine learning has aided the development of metabolic networks, the calculation of parameters for stoichiometric and kinetic models, as well as the analysis of major features in the model for the optimal application of bioreactors. Examples of this very interesting, albeit highly complex, application of machine learning for metabolism modeling will be the primary focus of this review presenting several different types of applications for model optimization, parameter determination or system analysis using models, as well as the utilization of several different types of machine learning technologies.
Collapse
Affiliation(s)
- Miroslava Cuperlovic-Culf
- Digital Technologies Research Center, National Research Council of Canada, 1200 Montreal Road, Ottawa, ON K1A 0R6, Canada.
| |
Collapse
|
26
|
Choi K. Robust Approaches to Generating Reliable Predictive Models in Systems Biology. RNA TECHNOLOGIES 2018. [DOI: 10.1007/978-3-319-92967-5_15] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
27
|
Ogilvie LA, Kovachev A, Wierling C, Lange BMH, Lehrach H. Models of Models: A Translational Route for Cancer Treatment and Drug Development. Front Oncol 2017; 7:219. [PMID: 28971064 PMCID: PMC5609574 DOI: 10.3389/fonc.2017.00219] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 09/01/2017] [Indexed: 12/12/2022] Open
Abstract
Every patient and every disease is different. Each patient therefore requires a personalized treatment approach. For technical reasons, a personalized approach is feasible for treatment strategies such as surgery, but not for drug-based therapy or drug development. The development of individual mechanistic models of the disease process in every patient offers the possibility of attaining truly personalized drug-based therapy and prevention. The concept of virtual clinical trials and the integrated use of in silico, in vitro, and in vivo models in preclinical development could lead to significant gains in efficiency and order of magnitude increases in the cost effectiveness of drug development and approval. We have developed mechanistic computational models of large-scale cellular signal transduction networks for prediction of drug effects and functional responses, based on patient-specific multi-level omics profiles. However, a major barrier to the use of such models in a clinical and developmental context is the reliability of predictions. Here we detail how the approach of using “models of models” has the potential to impact cancer treatment and drug development. We describe the iterative refinement process that leverages the flexibility of experimental systems to generate highly dimensional data, which can be used to train and validate computational model parameters and improve model predictions. In this way, highly optimized computational models with robust predictive capacity can be generated. Such models open up a number of opportunities for cancer drug treatment and development, from enhancing the design of experimental studies, reducing costs, and improving animal welfare, to increasing the translational value of results generated.
Collapse
Affiliation(s)
| | | | | | | | - Hans Lehrach
- Alacris Theranostics GmbH, Berlin, Germany.,Max Planck Institute for Molecular Genetics, Berlin, Germany
| |
Collapse
|
28
|
A parallel metaheuristic for large mixed-integer dynamic optimization problems, with applications in computational biology. PLoS One 2017; 12:e0182186. [PMID: 28813442 PMCID: PMC5557587 DOI: 10.1371/journal.pone.0182186] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2017] [Accepted: 07/13/2017] [Indexed: 11/24/2022] Open
Abstract
Background We consider a general class of global optimization problems dealing with nonlinear dynamic models. Although this class is relevant to many areas of science and engineering, here we are interested in applying this framework to the reverse engineering problem in computational systems biology, which yields very large mixed-integer dynamic optimization (MIDO) problems. In particular, we consider the framework of logic-based ordinary differential equations (ODEs). Methods We present saCeSS2, a parallel method for the solution of this class of problems. This method is based on an parallel cooperative scatter search metaheuristic, with new mechanisms of self-adaptation and specific extensions to handle large mixed-integer problems. We have paid special attention to the avoidance of convergence stagnation using adaptive cooperation strategies tailored to this class of problems. Results We illustrate its performance with a set of three very challenging case studies from the domain of dynamic modelling of cell signaling. The simpler case study considers a synthetic signaling pathway and has 84 continuous and 34 binary decision variables. A second case study considers the dynamic modeling of signaling in liver cancer using high-throughput data, and has 135 continuous and 109 binaries decision variables. The third case study is an extremely difficult problem related with breast cancer, involving 690 continuous and 138 binary decision variables. We report computational results obtained in different infrastructures, including a local cluster, a large supercomputer and a public cloud platform. Interestingly, the results show how the cooperation of individual parallel searches modifies the systemic properties of the sequential algorithm, achieving superlinear speedups compared to an individual search (e.g. speedups of 15 with 10 cores), and significantly improving (above a 60%) the performance with respect to a non-cooperative parallel scheme. The scalability of the method is also good (tests were performed using up to 300 cores). Conclusions These results demonstrate that saCeSS2 can be used to successfully reverse engineer large dynamic models of complex biological pathways. Further, these results open up new possibilities for other MIDO-based large-scale applications in the life sciences such as metabolic engineering, synthetic biology, drug scheduling.
Collapse
|
29
|
Babtie AC, Stumpf MPH. How to deal with parameters for whole-cell modelling. J R Soc Interface 2017; 14:20170237. [PMID: 28768879 PMCID: PMC5582120 DOI: 10.1098/rsif.2017.0237] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Accepted: 06/22/2017] [Indexed: 11/12/2022] Open
Abstract
Dynamical systems describing whole cells are on the verge of becoming a reality. But as models of reality, they are only useful if we have realistic parameters for the molecular reaction rates and cell physiological processes. There is currently no suitable framework to reliably estimate hundreds, let alone thousands, of reaction rate parameters. Here, we map out the relative weaknesses and promises of different approaches aimed at redressing this issue. While suitable procedures for estimation or inference of the whole (vast) set of parameters will, in all likelihood, remain elusive, some hope can be drawn from the fact that much of the cellular behaviour may be explained in terms of smaller sets of parameters. Identifying such parameter sets and assessing their behaviour is now becoming possible even for very large systems of equations, and we expect such methods to become central tools in the development and analysis of whole-cell models.
Collapse
Affiliation(s)
- Ann C Babtie
- Department of Life Sciences, Imperial College London, London, UK
| | | |
Collapse
|