1
|
Nilsson A, Peters JM, Meimetis N, Bryson B, Lauffenburger DA. Artificial neural networks enable genome-scale simulations of intracellular signaling. Nat Commun 2022; 13:3069. [PMID: 35654811 PMCID: PMC9163072 DOI: 10.1038/s41467-022-30684-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Accepted: 05/11/2022] [Indexed: 12/14/2022] Open
Abstract
Mammalian cells adapt their functional state in response to external signals in form of ligands that bind receptors on the cell-surface. Mechanistically, this involves signal-processing through a complex network of molecular interactions that govern transcription factor activity patterns. Computer simulations of the information flow through this network could help predict cellular responses in health and disease. Here we develop a recurrent neural network framework constrained by prior knowledge of the signaling network with ligand-concentrations as input and transcription factor-activity as output. Applied to synthetic data, it predicts unseen test-data (Pearson correlation r = 0.98) and the effects of gene knockouts (r = 0.8). We stimulate macrophages with 59 different ligands, with and without the addition of lipopolysaccharide, and collect transcriptomics data. The framework predicts this data under cross-validation (r = 0.8) and knockout simulations suggest a role for RIPK1 in modulating the lipopolysaccharide response. This work demonstrates the feasibility of genome-scale simulations of intracellular signaling.
Collapse
Affiliation(s)
- Avlant Nilsson
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, SE 41296, Sweden
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, 02139, USA
| | - Joshua M Peters
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, 02139, USA
| | - Nikolaos Meimetis
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Bryan Bryson
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, 02139, USA
| | - Douglas A Lauffenburger
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, 02139, USA.
| |
Collapse
|
2
|
Baltussen MG, van de Wiel J, Fernández Regueiro CL, Jakštaitė M, Huck WTS. A Bayesian Approach to Extracting Kinetic Information from Artificial Enzymatic Networks. Anal Chem 2022; 94:7311-7318. [PMID: 35549162 PMCID: PMC9134183 DOI: 10.1021/acs.analchem.2c00659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In order to create artificial enzymatic networks capable of increasingly complex behavior, an improved methodology in understanding and controlling the kinetics of these networks is needed. Here, we introduce a Bayesian analysis method allowing for the accurate inference of enzyme kinetic parameters and determination of most likely reaction mechanisms, by combining data from different experiments and network topologies in a single probabilistic analysis framework. This Bayesian approach explicitly allows us to continuously improve our parameter estimates and behavior predictions by iteratively adding new data to our models, while automatically taking into account uncertainties introduced by the experimental setups or the chemical processes in general. We demonstrate the potential of this approach by characterizing systems of enzymes compartmentalized in beads inside flow reactors. The methods we introduce here provide a new approach to the design of increasingly complex artificial enzymatic networks, making the design of such networks more efficient, and robust against the accumulation of experimental errors.
Collapse
Affiliation(s)
- Mathieu G Baltussen
- Institute for Molecules and Materials, Radboud University Nijmegen, 6525 AJ, Nijmegen, The Netherlands
| | - Jeroen van de Wiel
- Institute for Molecules and Materials, Radboud University Nijmegen, 6525 AJ, Nijmegen, The Netherlands
| | | | - Miglė Jakštaitė
- Institute for Molecules and Materials, Radboud University Nijmegen, 6525 AJ, Nijmegen, The Netherlands
| | - Wilhelm T S Huck
- Institute for Molecules and Materials, Radboud University Nijmegen, 6525 AJ, Nijmegen, The Netherlands
| |
Collapse
|
3
|
Chen Y, Cheng J, Gupta A, Huang H, Xu S. Numerical method for parameter inference of systems of nonlinear ordinary differential equations with partial observations. ROYAL SOCIETY OPEN SCIENCE 2021; 8:210171. [PMID: 34350015 PMCID: PMC8316824 DOI: 10.1098/rsos.210171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 07/06/2021] [Indexed: 06/13/2023]
Abstract
Parameter inference of dynamical systems is a challenging task faced by many researchers and practitioners across various fields. In many applications, it is common that only limited variables are observable. In this paper, we propose a method for parameter inference of a system of nonlinear coupled ordinary differential equations with partial observations. Our method combines fast Gaussian process-based gradient matching and deterministic optimization algorithms. By using initial values obtained by Bayesian steps with low sampling numbers, our deterministic optimization algorithm is both accurate, robust and efficient with partial observations and large noise.
Collapse
Affiliation(s)
- Yu Chen
- School of Mathematics, Shanghai University of Finance and Economics, Shanghai, People’s Republic of China
- Centre for Quantitative Analysis and Modeling (CQAM), The Fields Institute for Research in Mathematical Sciences, 222 College Street, Toronto, Ontario, Canada
| | - Jin Cheng
- School of Mathematics, Shanghai University of Finance and Economics, Shanghai, People’s Republic of China
- School of Mathematical Sciences, Fudan University, Shanghai 200433, People’s Republic of China
| | - Arvind Gupta
- Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Huaxiong Huang
- Centre for Quantitative Analysis and Modeling (CQAM), The Fields Institute for Research in Mathematical Sciences, 222 College Street, Toronto, Ontario, Canada
- Computer Science, University of Toronto, Toronto, Ontario, Canada
- Joint Mathematical Research Centre of Beijing Normal University and BNU-HKBU United International College, Zhuhai, People’s Republic of China
- Department of Mathematics and Statistics, York University, Toronto, Ontario, Canada
| | - Shixin Xu
- Duke Kunshan University, 8 Duke Ave, Kunshan, Jiangsu, People’s Republic of China
| |
Collapse
|
4
|
Newmiwaka T, Engelhardt B, Wendland P, Kahl D, Fröhlich H, Kschischo M. SEEDS: data driven inference of structural model errors and unknown inputs for dynamic systems biology. Bioinformatics 2021; 37:1330-1331. [PMID: 32931565 DOI: 10.1093/bioinformatics/btaa786] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 08/05/2020] [Accepted: 09/03/2020] [Indexed: 11/14/2022] Open
Abstract
SUMMARY Dynamic models formulated as ordinary differential equations can provide information about the mechanistic and causal interactions in biological systems to guide targeted interventions and to design further experiments. Inaccurate knowledge about the structure, functional form and parameters of interactions is a major obstacle to mechanistic modeling. A further challenge is the open nature of biological systems which receive unknown inputs from their environment. The R-package SEEDS implements two recently developed algorithms to infer structural model errors and unknown inputs from output measurements. This information can facilitate efficient model recalibration as well as experimental design in the case of misfits between the initial model and data. AVAILABILITY AND IMPLEMENTATION For the R-package seeds, see the CRAN server https://cran.r-project.org/package=seeds.
Collapse
Affiliation(s)
- Tobias Newmiwaka
- Department of Mathematics and Technology, University of Applied Sciences Koblenz, RheinAhrCampus, Remagen 53424, Germany
| | - Benjamin Engelhardt
- Department of Mathematics and Technology, University of Applied Sciences Koblenz, RheinAhrCampus, Remagen 53424, Germany.,Bonn-Aachen International Center for IT (b-it), Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn 53115, Germany.,AbbVie Deutschland GmbH & Co. KG, Clinical Pharmacology and Pharmacometrics, Knollstrasse, Ludwigshafen 67061, Germany
| | - Philipp Wendland
- Department of Mathematics and Technology, University of Applied Sciences Koblenz, RheinAhrCampus, Remagen 53424, Germany
| | - Dominik Kahl
- Department of Mathematics and Technology, University of Applied Sciences Koblenz, RheinAhrCampus, Remagen 53424, Germany
| | - Holger Fröhlich
- Bonn-Aachen International Center for IT (b-it), Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn 53115, Germany
| | - Maik Kschischo
- Department of Mathematics and Technology, University of Applied Sciences Koblenz, RheinAhrCampus, Remagen 53424, Germany
| |
Collapse
|
5
|
Kahl D, Kschischo M. Searching for Errors in Models of Complex Dynamic Systems. Front Physiol 2021; 11:612590. [PMID: 33505318 PMCID: PMC7830364 DOI: 10.3389/fphys.2020.612590] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 12/11/2020] [Indexed: 11/13/2022] Open
Abstract
Mathematical modeling is seen as a key step to understand, predict, and control the temporal dynamics of interacting systems in such diverse areas like physics, biology, medicine, and economics. However, for large and complex systems we usually have only partial knowledge about the network, the coupling functions, and the interactions with the environment governing the dynamic behavior. This incomplete knowledge induces structural model errors which can in turn be the cause of erroneous model predictions or misguided interpretations. Uncovering the location of such structural model errors in large networks can be a daunting task for a modeler. Here, we present a data driven method to search for structural model errors and to confine their position in large and complex dynamic networks. We introduce a coherence measure for pairs of network nodes, which indicates, how difficult it is to distinguish these nodes as sources of an error. By clustering network nodes into coherence groups and inferring the cluster inputs we can decide, which cluster is affected by an error. We demonstrate the utility of our method for the C. elegans neural network, for a signal transduction model for UV-B light induced morphogenesis and for synthetic examples.
Collapse
Affiliation(s)
- Dominik Kahl
- Mathematics and Technology, University of Applied Sciences Koblenz, Koblenz, Germany
| | - Maik Kschischo
- Mathematics and Technology, University of Applied Sciences Koblenz, Koblenz, Germany
| |
Collapse
|
6
|
Lee D, Jayaraman A, Kwon JS. Development of a hybrid model for a partially known intracellular signaling pathway through correction term estimation and neural network modeling. PLoS Comput Biol 2020; 16:e1008472. [PMID: 33315899 PMCID: PMC7769624 DOI: 10.1371/journal.pcbi.1008472] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 12/28/2020] [Accepted: 10/26/2020] [Indexed: 12/30/2022] Open
Abstract
Developing an accurate first-principle model is an important step in employing systems biology approaches to analyze an intracellular signaling pathway. However, an accurate first-principle model is difficult to be developed since it requires in-depth mechanistic understandings of the signaling pathway. Since underlying mechanisms such as the reaction network structure are not fully understood, significant discrepancy exists between predicted and actual signaling dynamics. Motivated by these considerations, this work proposes a hybrid modeling approach that combines a first-principle model and an artificial neural network (ANN) model so that predictions of the hybrid model surpass those of the original model. First, the proposed approach determines an optimal subset of model states whose dynamics should be corrected by the ANN by examining the correlation between each state and outputs through relative order. Second, an L2-regularized least-squares problem is solved to infer values of the correction terms that are necessary to minimize the discrepancy between the model predictions and available measurements. Third, an ANN is developed to generalize relationships between the values of the correction terms and the system dynamics. Lastly, the original first-principle model is coupled with the developed ANN to finalize the hybrid model development so that the model will possess generalized prediction capabilities while retaining the model interpretability. We have successfully validated the proposed methodology with two case studies, simplified apoptosis and lipopolysaccharide-induced NFκB signaling pathways, to develop hybrid models with in silico and in vitro measurements, respectively. An intracellular signaling pathway is often represented by a set of nonlinear ordinary differential equations, which translate our current knowledge about the signaling pathway into a testable mathematical model. However, predictions from such models are often subject to high uncertainty since many signaling pathways are only partially known beforehand. In this study, we propose a systematic approach to develop a hybrid model to improve model accuracy by combining machine learning and the first-principle modeling. Specifically, model correction terms are learned from discrepancy between model predictions and measurements, and these terms are added to the first-principle model to enhance the prediction accuracy. Once these correction terms are learned from the data, an artificial neural network (ANN) model is developed to find an empirical relation between the model and the correction terms so that the developed ANN can be used to posses improved predictive capabilities even in new operating conditions (i.e., generalizability). The final hybrid model is then constructed by coupling the first-principle model with the developed ANN.
Collapse
Affiliation(s)
- Dongheon Lee
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Texas A&M Energy Institute, Texas A&M University, College Station, Texas, USA
| | - Arul Jayaraman
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Department of Biomedical Engineering, Texas A&M University, College Station, Texas, USA
| | - Joseph S. Kwon
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Texas A&M Energy Institute, Texas A&M University, College Station, Texas, USA
- * E-mail:
| |
Collapse
|
7
|
Yazdani A, Lu L, Raissi M, Karniadakis GE. Systems biology informed deep learning for inferring parameters and hidden dynamics. PLoS Comput Biol 2020; 16:e1007575. [PMID: 33206658 PMCID: PMC7710119 DOI: 10.1371/journal.pcbi.1007575] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Revised: 12/02/2020] [Accepted: 10/11/2020] [Indexed: 01/23/2023] Open
Abstract
Mathematical models of biological reactions at the system-level lead to a set of ordinary differential equations with many unknown parameters that need to be inferred using relatively few experimental measurements. Having a reliable and robust algorithm for parameter inference and prediction of the hidden dynamics has been one of the core subjects in systems biology, and is the focus of this study. We have developed a new systems-biology-informed deep learning algorithm that incorporates the system of ordinary differential equations into the neural networks. Enforcing these equations effectively adds constraints to the optimization procedure that manifests itself as an imposed structure on the observational data. Using few scattered and noisy measurements, we are able to infer the dynamics of unobserved species, external forcing, and the unknown model parameters. We have successfully tested the algorithm for three different benchmark problems.
Collapse
Affiliation(s)
- Alireza Yazdani
- Division of Applied Mathematics, Brown University, Providence, Rhode Island, USA
| | - Lu Lu
- Department of Mathematics, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Maziar Raissi
- Department of Applied Mathematics, University of Colorado, Boulder, Colorado, USA
| | | |
Collapse
|
8
|
Villaverde AF, Tsiantis N, Banga JR. Full observability and estimation of unknown inputs, states and parameters of nonlinear biological models. J R Soc Interface 2019; 16:20190043. [PMID: 31266417 PMCID: PMC6685009 DOI: 10.1098/rsif.2019.0043] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
In this paper, we address the system identification problem in the context of biological modelling. We present and demonstrate a methodology for (i) assessing the possibility of inferring the unknown quantities in a dynamic model and (ii) effectively estimating them from output data. We introduce the term Full Input-State-Parameter Observability (FISPO) analysis to refer to the simultaneous assessment of state, input and parameter observability (note that parameter observability is also known as identifiability). This type of analysis has often remained elusive in the presence of unmeasured inputs. The method proposed in this paper can be applied to a general class of nonlinear ordinary differential equations models. We apply this approach to three models from the recent literature. First, we determine whether it is theoretically possible to infer the states, parameters and inputs, taking only the model equations into account. When this analysis detects deficiencies, we reformulate the model to make it fully observable. Then we move to numerical scenarios and apply an optimization-based technique to estimate the states, parameters and inputs. The results demonstrate the feasibility of an integrated strategy for (i) analysing the theoretical possibility of determining the states, parameters and inputs to a system and (ii) solving the practical problem of actually estimating their values.
Collapse
Affiliation(s)
| | - Nikolaos Tsiantis
- 1 Bioprocess Engineering Group , IIM-CSIC , Vigo , Galicia 36208 , Spain.,2 Department of Chemical Engineering , University of Vigo , Vigo , Galicia 36310 , Spain
| | - Julio R Banga
- 1 Bioprocess Engineering Group , IIM-CSIC , Vigo , Galicia 36208 , Spain
| |
Collapse
|
9
|
Computational Methods for Estimating Molecular System from Membrane Potential Recordings in Nerve Growth Cone. Sci Rep 2018. [PMID: 29540815 PMCID: PMC5852145 DOI: 10.1038/s41598-018-22506-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Biological cells express intracellular biomolecular information to the extracellular environment as various physical responses. We show a novel computational approach to estimate intracellular biomolecular pathways from growth cone electrophysiological responses. Previously, it was shown that cGMP signaling regulates membrane potential (MP) shifts that control the growth cone turning direction during neuronal development. We present here an integrated deterministic mathematical model and Bayesian reversed-engineering framework that enables estimation of the molecular signaling pathway from electrical recordings and considers both the system uncertainty and cell-to-cell variability. Our computational method selects the most plausible molecular pathway from multiple candidates while satisfying model simplicity and considering all possible parameter ranges. The model quantitatively reproduces MP shifts depending on cGMP levels and MP variability potential in different experimental conditions. Lastly, our model predicts that chloride channel inhibition by cGMP-dependent protein kinase (PKG) is essential in the core system for regulation of the MP shifts.
Collapse
|
10
|
Tsiantis N, Balsa-Canto E, Banga JR. Optimality and identification of dynamic models in systems biology: an inverse optimal control framework. Bioinformatics 2018. [DOI: 10.1093/bioinformatics/bty139] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Nikolaos Tsiantis
- Bioprocess Engineering Group, Spanish National Research Council, IIM-CSIC Vigo, Spain
- Department of Chemical Engineering, University of Vigo Vigo, Spain
| | - Eva Balsa-Canto
- Bioprocess Engineering Group, Spanish National Research Council, IIM-CSIC Vigo, Spain
| | - Julio R Banga
- Bioprocess Engineering Group, Spanish National Research Council, IIM-CSIC Vigo, Spain
| |
Collapse
|