1
|
Jia C, Grima R. Holimap: an accurate and efficient method for solving stochastic gene network dynamics. Nat Commun 2024; 15:6557. [PMID: 39095346 PMCID: PMC11297302 DOI: 10.1038/s41467-024-50716-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2024] [Accepted: 07/13/2024] [Indexed: 08/04/2024] Open
Abstract
Gene-gene interactions are crucial to the control of sub-cellular processes but our understanding of their stochastic dynamics is hindered by the lack of simulation methods that can accurately and efficiently predict how the distributions of gene product numbers vary across parameter space. To overcome these difficulties, here we present Holimap (high-order linear-mapping approximation), an approach that approximates the protein or mRNA number distributions of a complex gene regulatory network by the distributions of a much simpler reaction system. We demonstrate Holimap's computational advantages over conventional methods by applying it to predict the stochastic time-dependent dynamics of various gene networks, including transcriptional networks ranging from simple autoregulatory loops to complex randomly connected networks, post-transcriptional networks, and post-translational networks. Holimap is ideally suited to study how the intricate network of gene-gene interactions results in precise coordination and control of gene expression.
Collapse
Affiliation(s)
- Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing, China
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
2
|
Wagner V, Strässer R, Allgöwer F, Radde N. A Provably Convergent Control Closure Scheme for the Method of Moments of the Chemical Master Equation. J Chem Theory Comput 2023; 19:9049-9059. [PMID: 38051675 DOI: 10.1021/acs.jctc.3c00548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]
Abstract
In this article, we introduce a novel moment closure scheme based on concepts from model predictive control (MPC) to accurately describe the time evolution of the statistical moments of the solution of the chemical master equation (CME). The method of moments, a set of ordinary differential equations frequently used to calculate the first nm moments, is generally not closed since lower-order moments depend on higher-order moments. To overcome this limitation, we interpret the moment equations as a nonlinear dynamical system, where the first nm moments serve as states, and the closing moments serve as the control input. We demonstrate the efficacy of our approach using three example systems and show that it outperforms existing closure schemes. For polynomial systems, which encompass all mass-action systems, we provide probability bounds for the error between true and estimated moment trajectories. We achieve this by combining the convergence properties of a priori moment estimates from stochastic simulations with guarantees for nonlinear reference tracking MPC. Our proposed method offers an effective solution to accurately predict the time evolution of moments of the CME, which has wide-ranging implications for many fields, including biology, chemistry, and engineering.
Collapse
Affiliation(s)
- Vincent Wagner
- University of Stuttgart, Institute for Stochastics and Applications, 70550 Stuttgart, Germany
| | - Robin Strässer
- University of Stuttgart, Institute for Systems Theory and Automatic Control, 70550 Stuttgart, Germany
| | - Frank Allgöwer
- University of Stuttgart, Institute for Systems Theory and Automatic Control, 70550 Stuttgart, Germany
| | - Nicole Radde
- University of Stuttgart, Institute for Stochastics and Applications, 70550 Stuttgart, Germany
| |
Collapse
|
3
|
Sinzger-D’Angelo M, Startceva S, Koeppl H. Bye bye, linearity, bye: quantification of the mean for linear CRNs in a random environment. J Math Biol 2023; 87:43. [PMID: 37573263 PMCID: PMC10423146 DOI: 10.1007/s00285-023-01973-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 07/04/2023] [Accepted: 07/22/2023] [Indexed: 08/14/2023]
Abstract
Molecular reactions within a cell are inherently stochastic, and cells often differ in morphological properties or interact with a heterogeneous environment. Consequently, cell populations exhibit heterogeneity both due to these intrinsic and extrinsic causes. Although state-of-the-art studies that focus on dissecting this heterogeneity use single-cell measurements, the bulk data that shows only the mean expression levels is still in routine use. The fingerprint of the heterogeneity is present also in bulk data, despite being hidden from direct measurement. In particular, this heterogeneity can affect the mean expression levels via bimolecular interactions with low-abundant environment species. We make this statement rigorous for the class of linear reaction systems that are embedded in a discrete state Markov environment. The analytic expression that we provide for the stationary mean depends on the reaction rate constants of the linear subsystem, as well as the generator and stationary distribution of the Markov environment. We demonstrate the effect of the environment on the stationary mean. Namely, we show how the heterogeneous case deviates from the quasi-steady state (Q.SS) case when the embedded system is fast compared to the environment.
Collapse
Affiliation(s)
- Mark Sinzger-D’Angelo
- Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany
- Centre for Synthetic Biology, Technische Universität Darmstadt, Darmstadt, Germany
| | - Sofia Startceva
- Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany
- Centre for Synthetic Biology, Technische Universität Darmstadt, Darmstadt, Germany
| | - Heinz Koeppl
- Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany
- Centre for Synthetic Biology, Technische Universität Darmstadt, Darmstadt, Germany
| |
Collapse
|
4
|
Davies K, Lenhart S, Day J, Lloyd AL, Lanzas C. Extensions of mean-field approximations for environmentally-transmitted pathogen networks. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:1637-1673. [PMID: 36899502 DOI: 10.3934/mbe.2023075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Many pathogens spread via environmental transmission, without requiring host-to-host direct contact. While models for environmental transmission exist, many are simply constructed intuitively with structures analogous to standard models for direct transmission. As model insights are generally sensitive to the underlying model assumptions, it is important that we are able understand the details and consequences of these assumptions. We construct a simple network model for an environmentally-transmitted pathogen and rigorously derive systems of ordinary differential equations (ODEs) based on different assumptions. We explore two key assumptions, namely homogeneity and independence, and demonstrate that relaxing these assumptions can lead to more accurate ODE approximations. We compare these ODE models to a stochastic implementation of the network model over a variety of parameters and network structures, demonstrating that with fewer restrictive assumptions we are able to achieve higher accuracy in our approximations and highlighting more precisely the errors produced by each assumption. We show that less restrictive assumptions lead to more complicated systems of ODEs and the potential for unstable solutions. Due to the rigour of our derivation, we are able to identify the reason behind these errors and propose potential resolutions.
Collapse
Affiliation(s)
- Kale Davies
- Department of Mathematics, University of Chicago, Chicago, IL, USA
- Department of Population Health and Pathobiology, North Carolina State University, Raleigh, NC, USA
| | - Suzanne Lenhart
- Department of Mathematics, University of Tennessee, Knoxville, TN, USA
| | - Judy Day
- Department of Mathematics, University of Tennessee, Knoxville, TN, USA
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, USA
| | - Alun L Lloyd
- Biomathematics Graduate Program and Department of Mathematics, North Carolina State University, Raleigh, NC, USA
| | - Cristina Lanzas
- Department of Population Health and Pathobiology, North Carolina State University, Raleigh, NC, USA
| |
Collapse
|
5
|
Braniff N, Pearce T, Lu Z, Astwood M, Forrest WSR, Receno C, Ingalls B. NLoed: A Python Package for Nonlinear Optimal Experimental Design in Systems Biology. ACS Synth Biol 2022; 11:3921-3928. [PMID: 36473701 PMCID: PMC9765746 DOI: 10.1021/acssynbio.2c00131] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Indexed: 12/12/2022]
Abstract
Modeling in systems and synthetic biology relies on accurate parameter estimates and predictions. Accurate model calibration relies, in turn, on data and on how well suited the available data are to a particular modeling task. Optimal experimental design (OED) techniques can be used to identify experiments and data collection procedures that will most efficiently contribute to a given modeling objective. However, implementation of OED is limited by currently available software tools that are not well suited for the diversity of nonlinear models and non-normal data commonly encountered in biological research. Moreover, existing OED tools do not make use of the state-of-the-art numerical tools, resulting in inefficient computation. Here, we present the NLoed software package and demonstrate its use with in vivo data from an optogenetic system in Escherichia coli. NLoed is an open-source Python library providing convenient access to OED methods, with particular emphasis on experimental design for systems biology research. NLoed supports a wide variety of nonlinear, multi-input/output, and dynamic models and facilitates modeling and design of experiments over a wide variety of data types. To support OED investigations, the NLoed package implements maximum likelihood fitting and diagnostic tools, providing a comprehensive modeling workflow. NLoed offers an accessible, modular, and flexible OED tool set suited to the wide variety of experimental scenarios encountered in systems biology research. We demonstrate NLoed's capabilities by applying it to experimental design for characterization of a bacterial optogenetic system.
Collapse
Affiliation(s)
- Nathan Braniff
- Department of Applied Mathematics, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada
| | - Taylor Pearce
- Department of Applied Mathematics, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada
| | - Zixuan Lu
- Department of Applied Mathematics, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada
| | - Michael Astwood
- Department of Applied Mathematics, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada
| | - William S. R. Forrest
- Department of Applied Mathematics, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada
| | - Cody Receno
- Department of Applied Mathematics, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada
| | - Brian Ingalls
- Department of Applied Mathematics, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada
| |
Collapse
|
6
|
Wagner V, Castellaz B, Oesting M, Radde N. Quasi-Entropy Closure: a fast and reliable approach to close the moment equations of the Chemical Master Equation. Bioinformatics 2022; 38:4352-4359. [PMID: 35916726 DOI: 10.1093/bioinformatics/btac501] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 05/13/2022] [Accepted: 08/01/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION The Chemical Master Equation is a stochastic approach to describe the evolution of a (bio)chemical reaction system. Its solution is a time-dependent probability distribution on all possible configurations of the system. As this number is typically large, the Master Equation is often practically unsolvable. The Method of Moments reduces the system to the evolution of a few moments, which are described by ordinary differential equations. Those equations are not closed, since lower order moments generally depend on higher order moments. Various closure schemes have been suggested to solve this problem. Two major problems with these approaches are first that they are open loop systems, which can diverge from the true solution, and second, some of them are computationally expensive. RESULTS Here we introduce Quasi-Entropy Closure, a moment-closure scheme for the Method of Moments. It estimates higher order moments by reconstructing the distribution that minimizes the distance to a uniform distribution subject to lower order moment constraints. Quasi-Entropy Closure can be regarded as an advancement of Zero-Information Closure, which similarly maximizes the information entropy. Results show that both approaches outperform truncation schemes. Quasi-Entropy Closure is computationally much faster than Zero-Information Closure, although both methods consider solutions on the space of configurations and hence do not completely overcome the curse of dimensionality. In addition, our scheme includes a plausibility check for the existence of a distribution satisfying a given set of moments on the feasible set of configurations. All results are evaluated on different benchmark problems. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Vincent Wagner
- Institute for Systems Theory and Automatic Control, University of Stuttgart, 70569 Stuttgart, Germany
| | - Benjamin Castellaz
- Institute for Systems Theory and Automatic Control, University of Stuttgart, 70569 Stuttgart, Germany
| | - Marco Oesting
- Stuttgart Center for Simulation Science, University of Stuttgart, 70569 Stuttgart, Germany.,Institute for Stochastics and Applications, University of Stuttgart, 70569 Stuttgart, Germany
| | - Nicole Radde
- Institute for Systems Theory and Automatic Control, University of Stuttgart, 70569 Stuttgart, Germany.,Stuttgart Center for Simulation Science, University of Stuttgart, 70569 Stuttgart, Germany
| |
Collapse
|
7
|
Ham L, Coomer M, Stumpf M. The chemical Langevin equation for biochemical systems in dynamic environments. J Chem Phys 2022; 157:094105. [DOI: 10.1063/5.0095840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Modelling and simulation of complex biochemical reaction networks form cornerstones of modern biophysics. Many of the approaches developed so far capture temporal fluctuations due to the inherent stochasticity of the biophysical processes, referred to as intrinsic noise. Stochastic fluctuations, however, predominantly stem from the interplay of the network with many other - and mostly unknown - fluctuating processes, as well as with various random signals arising from the extracellular world; these sources contribute extrinsic noise. Here we provide a computational simulation method to probe the stochastic dynamics of biochemical systems subject to both intrinsic and extrinsic noise. We develop an extrinsic chemical Langevin equation-a physically motivated extension of the chemical Langevin equation- to model intrinsically noisy reaction networks embedded in a stochastically fluctuating environment. The extrinsic CLE is a continuous approximation to the Chemical Master Equation (CME) with time-varying propensities. In our approach, noise is incorporated at the level of the CME, and can account for the full dynamics of the exogenous noise process, irrespective of timescales and their mismatches. We show that our method accurately captures the first two moments of the stationary probability density when compared with exact stochastic simulation methods, while reducing the computational runtime by several orders of magnitude. Our approach provides a method that is practical, computationally efficient and physically accurate to study systems that are simultaneously subject to a variety of noise sources.
Collapse
Affiliation(s)
- Lucy Ham
- The University of Melbourne, University of Melbourne, Australia
| | | | | |
Collapse
|
8
|
Lunz D, Bonnans JF, Ruess J. Revisiting moment-closure methods with heterogeneous multiscale population models. Math Biosci 2022; 350:108866. [PMID: 35753520 DOI: 10.1016/j.mbs.2022.108866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 04/10/2022] [Accepted: 06/08/2022] [Indexed: 11/29/2022]
Abstract
Stochastic chemical kinetics at the single-cell level give rise to heterogeneous populations of cells even when all individuals are genetically identical. This heterogeneity can lead to nonuniform behaviour within populations, including different growth characteristics, cell-fate dynamics, and response to stimuli. Ultimately, these diverse behaviours lead to intricate population dynamics that are inherently multiscale: the population composition evolves based on population-level processes that interact with stochastically distributed single-cell states. Therefore, descriptions that account for this heterogeneity are essential to accurately model and control chemical processes. However, for real-world systems such models are computationally expensive to simulate, which can make optimisation problems, such as optimal control or parameter inference, prohibitively challenging. Here, we consider a class of multiscale population models that incorporate population-level mechanisms while remaining faithful to the underlying stochasticity at the single-cell level and the interplay between these two scales. To address the complexity, we study an order-reduction approximations based on the distribution moments. Since previous moment-closure work has focused on the single-cell kinetics, extending these techniques to populations models prompts us to revisit old observations as well as tackle new challenges. In this extended multiscale context, we encounter the previously established observation that the simplest closure techniques can lead to non-physical system trajectories. Despite their poor performance in some systems, we provide an example where these simple closures outperform more sophisticated closure methods in accurately, efficiently, and robustly solving the problem of optimal control of bioproduction in a microbial consortium model.
Collapse
Affiliation(s)
- Davin Lunz
- Inria Paris, 2 rue Simone Iff, 75012 Paris, France; Institut Pasteur, 28 rue du Docteur Roux, 75015 Paris, France.
| | - J Frédéric Bonnans
- Université Paris-Saclay, CNRS, CentraleSupélec, Inria, Laboratory of signals and systems, 91190, Gif-sur-Yvette, France
| | - Jakob Ruess
- Inria Paris, 2 rue Simone Iff, 75012 Paris, France; Institut Pasteur, 28 rue du Docteur Roux, 75015 Paris, France
| |
Collapse
|
9
|
Quantifying biochemical reaction rates from static population variability within incompletely observed complex networks. PLoS Comput Biol 2022; 18:e1010183. [PMID: 35731728 PMCID: PMC9216546 DOI: 10.1371/journal.pcbi.1010183] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 05/07/2022] [Indexed: 11/19/2022] Open
Abstract
Quantifying biochemical reaction rates within complex cellular processes remains a key challenge of systems biology even as high-throughput single-cell data have become available to characterize snapshots of population variability. That is because complex systems with stochastic and non-linear interactions are difficult to analyze when not all components can be observed simultaneously and systems cannot be followed over time. Instead of using descriptive statistical models, we show that incompletely specified mechanistic models can be used to translate qualitative knowledge of interactions into reaction rate functions from covariability data between pairs of components. This promises to turn a globally intractable problem into a sequence of solvable inference problems to quantify complex interaction networks from incomplete snapshots of their stochastic fluctuations.
Collapse
|
10
|
Li Y, Jiang DQ, Jia C. Steady-state joint distribution for first-order stochastic reaction kinetics. Phys Rev E 2021; 104:024408. [PMID: 34525607 DOI: 10.1103/physreve.104.024408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 07/19/2021] [Indexed: 11/07/2022]
Abstract
While the analytical solution for the marginal distribution of a stochastic chemical reaction network has been extensively studied, its joint distribution, i.e., the solution of a high-dimensional chemical master equation, has received much less attention. Here we develop an alternative method of computing the exact joint distributions of a wide class of first-order stochastic reaction systems in steady-state conditions. The effectiveness of our method is validated by applying it to four gene expression models of biological significance, including models with 2A peptides, nascent mRNA, gene regulation, translational bursting, and alternative splicing.
Collapse
Affiliation(s)
- Youming Li
- LMAM, School of Mathematical Sciences, Peking University, Beijing 100871, China.,Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, China
| | - Da-Quan Jiang
- LMAM, School of Mathematical Sciences, Peking University, Beijing 100871, China.,Center for Statistical Science, Peking University, Beijing 100871, China
| | - Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, China
| |
Collapse
|
11
|
Abstract
The interaction between a consumer (such as, a predator or a parasitoid) and a resource (such as, a prey or a host) forms an integral motif in ecological food webs, and has been modeled since the early 20th century starting from the seminal work of Lotka and Volterra. While the Lotka-Volterra predator-prey model predicts a neutrally stable equilibrium with oscillating population densities, a density-dependent predator attack rate is known to stabilize the equilibrium. Here, we consider a stochastic formulation of the Lotka-Volterra model where the prey's reproduction rate is a random process, and the predator's attack rate depends on both the prey and predator population densities. Analysis shows that increasing the sensitivity of the attack rate to the prey density attenuates the magnitude of stochastic fluctuations in the population densities. In contrast, these fluctuations vary non-monotonically with the sensitivity of the attack rate to the predator density with an optimal level of sensitivity minimizing the magnitude of fluctuations. Interestingly, our systematic study of the predator-prey correlations reveals distinct signatures depending on the form of the density-dependent attack rate. In summary, stochastic dynamics of nonlinear Lotka-Volterra models can be harnessed to infer density-dependent mechanisms regulating predator-prey interactions. Moreover, these mechanisms can have contrasting consequences on population density fluctuations, with predator-dependent attack rates amplifying stochasticity, while prey-dependent attack rates countering to buffer fluctuations.
Collapse
Affiliation(s)
- Abhyudai Singh
- Departments of Electrical and Computer Engineering, Biomedical Engineering and Mathematical Sciences, University of Delaware, Newark, DE, United States of America
| |
Collapse
|
12
|
Kurdyaeva T, Milias-Argeitis A. Uncertainty propagation for deterministic models of biochemical networks using moment equations and the extended Kalman filter. J R Soc Interface 2021; 18:20210331. [PMID: 34343452 DOI: 10.1098/rsif.2021.0331] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Differential equation models of biochemical networks are frequently associated with a large degree of uncertainty in parameters and/or initial conditions. However, estimating the impact of this uncertainty on model predictions via Monte Carlo simulation is computationally demanding. A more efficient approach could be to track a system of low-order statistical moments of the state. Unfortunately, when the underlying model is nonlinear, the system of moment equations is infinite-dimensional and cannot be solved without a moment closure approximation which may introduce bias in the moment dynamics. Here, we present a new method to study the time evolution of the desired moments for nonlinear systems with polynomial rate laws. Our approach is based on solving a system of low-order moment equations by substituting the higher-order moments with Monte Carlo-based estimates from a small number of simulations, and using an extended Kalman filter to counteract Monte Carlo noise. Our algorithm provides more accurate and robust results compared to traditional Monte Carlo and moment closure techniques, and we expect that it will be widely useful for the quantification of uncertainty in biochemical model predictions.
Collapse
Affiliation(s)
- Tamara Kurdyaeva
- Molecular Systems Biology, Groningen Biomolecular Sciences & Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Andreas Milias-Argeitis
- Molecular Systems Biology, Groningen Biomolecular Sciences & Biotechnology Institute, University of Groningen, Groningen, Netherlands
| |
Collapse
|
13
|
Waizmann T, Bortolussi L, Vandin A, Tribastone M. Improved estimations of stochastic chemical kinetics by finite-state expansion. Proc Math Phys Eng Sci 2021. [DOI: 10.1098/rspa.2020.0964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Stochastic reaction networks are a fundamental model to describe interactions between species where random fluctuations are relevant. The master equation provides the evolution of the probability distribution across the discrete state space consisting of vectors of population counts for each species. However, since its exact solution is often elusive, several analytical approximations have been proposed. The deterministic rate equation (DRE) gives a macroscopic approximation as a compact system of differential equations that estimate the average populations for each species, but it may be inaccurate in the case of nonlinear interaction dynamics. Here we propose finite-state expansion (FSE), an analytical method mediating between the microscopic and the macroscopic interpretations of a stochastic reaction network by coupling the master equation dynamics of a chosen subset of the discrete state space with the mean population dynamics of the DRE. An algorithm translates a network into an expanded one where each discrete state is represented as a further distinct species. This translation exactly preserves the stochastic dynamics, but the DRE of the expanded network can be interpreted as a correction to the original one. The effectiveness of FSE is demonstrated in models that challenge state-of-the-art techniques due to intrinsic noise, multi-scale populations and multi-stability.
Collapse
Affiliation(s)
| | - Luca Bortolussi
- Department of Mathematics and Geosciences, University of Trieste, Trieste 34127, Italy
| | - Andrea Vandin
- Sant’Anna School of Advanced Studies, Pisa 56127, Italy
- Department of Applied Mathematics and Computer Science, DTU Technical University of Denmark, Kgs. Lyngby 2800, Denmark
| | | |
Collapse
|
14
|
Sukys A, Grima R. MomentClosure.jl: automated moment closure approximations in Julia. Bioinformatics 2021; 38:289-290. [PMID: 34170295 PMCID: PMC8696096 DOI: 10.1093/bioinformatics/btab469] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 06/15/2021] [Accepted: 06/22/2021] [Indexed: 02/03/2023] Open
Abstract
SUMMARY MomentClosure.jl is a Julia package providing automated derivation of the time-evolution equations of the moments of molecule numbers for virtually any chemical reaction network using a wide range of moment closure approximations. It extends the capabilities of modelling stochastic biochemical systems in Julia and can be particularly useful when exact analytic solutions of the chemical master equation are unavailable and when Monte Carlo simulations are computationally expensive. AVAILABILITY AND IMPLEMENTATION MomentClosure.jl is freely accessible under the MIT licence. Source code and documentation are available at https://github.com/augustinas1/MomentClosure.jl.
Collapse
Affiliation(s)
| | - Ramon Grima
- To whom correspondence should be addressed. or
| |
Collapse
|
15
|
Pietzsch T, Duso L, Zechner C. Compartor: A toolbox for the automatic generation of moment equations for dynamic compartment populations. Bioinformatics 2021; 37:2782-2784. [PMID: 33538766 PMCID: PMC8428613 DOI: 10.1093/bioinformatics/btab058] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 01/21/2021] [Accepted: 01/25/2021] [Indexed: 01/13/2023] Open
Abstract
Summary Many biochemical processes in living organisms take place inside compartments that can interact with each other and remodel over time. In a recent work, we have shown how the stochastic dynamics of a compartmentalized biochemical system can be effectively studied using moment equations. With this technique, the time evolution of a compartment population is summarized using a finite number of ordinary differential equations, which can be analyzed very efficiently. However, the derivation of moment equations by hand can become time-consuming for systems comprising multiple reactants and interactions. Here we present Compartor, a toolbox that automatically generates the moment equations associated with a user-defined compartmentalized system. Through the moment equation method, Compartor renders the analysis of stochastic population models accessible to a broader scientific community. Availability and implementation Compartor is provided as a Python package and is available at https://pypi.org/project/compartor/. Source code and usage tutorials for Compartor are available at https://github.com/zechnerlab/Compartor.
Collapse
Affiliation(s)
- Tobias Pietzsch
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, 01307, Germany.,Center for Systems Biology Dresden, Dresden, 01307, Germany
| | - Lorenzo Duso
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, 01307, Germany.,Center for Systems Biology Dresden, Dresden, 01307, Germany
| | - Christoph Zechner
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, 01307, Germany.,Center for Systems Biology Dresden, Dresden, 01307, Germany.,Cluster of Excellence Physics of Life, TU Dresden, Dresden, 01062, Germany
| |
Collapse
|
16
|
Browning AP, Warne DJ, Burrage K, Baker RE, Simpson MJ. Identifiability analysis for stochastic differential equation models in systems biology. J R Soc Interface 2020; 17:20200652. [PMID: 33323054 PMCID: PMC7811582 DOI: 10.1098/rsif.2020.0652] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 11/24/2020] [Indexed: 12/26/2022] Open
Abstract
Mathematical models are routinely calibrated to experimental data, with goals ranging from building predictive models to quantifying parameters that cannot be measured. Whether or not reliable parameter estimates are obtainable from the available data can easily be overlooked. Such issues of parameter identifiability have important ramifications for both the predictive power of a model, and the mechanistic insight that can be obtained. Identifiability analysis is well-established for deterministic, ordinary differential equation (ODE) models, but there are no commonly adopted methods for analysing identifiability in stochastic models. We provide an accessible introduction to identifiability analysis and demonstrate how existing ideas for analysis of ODE models can be applied to stochastic differential equation (SDE) models through four practical case studies. To assess structural identifiability, we study ODEs that describe the statistical moments of the stochastic process using open-source software tools. Using practically motivated synthetic data and Markov chain Monte Carlo methods, we assess parameter identifiability in the context of available data. Our analysis shows that SDE models can often extract more information about parameters than deterministic descriptions. All code used to perform the analysis is available on Github.
Collapse
Affiliation(s)
- Alexander P. Browning
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - David J. Warne
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - Kevin Burrage
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, Queensland University of Technology, Brisbane, Australia
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Ruth E. Baker
- Mathematical Institute, University of Oxford, Oxford, UK
| | - Matthew J. Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
17
|
Kang YM, Liu RN. Moment dynamics for gene regulation with rational rate laws. Phys Rev E 2020; 102:042407. [PMID: 33212610 DOI: 10.1103/physreve.102.042407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 09/09/2020] [Indexed: 06/11/2023]
Abstract
This aim of this paper is mainly to investigate the performance of two typical moment closure schemes in gene regulatory master equations of rational rate laws. When the reaction rate is polynomial, the error bounds between the authentic and approximate moments obtained by schemes of Gaussian moment closure and log-normal moment closure are explicitly given. When the reaction rate is not polynomial, it is shown that the two schemes both behave well in the absence of active-inactive state switch, but in the presence of active-inactive state switch the log-normal closure scheme is far superior to the Gaussian closure scheme in capturing the asymptotic ensemble statistics. Moreover, the accuracy of the log-normal closure method is further confirmed by steady-state analytic results and the conditional Gaussian closure method. It is also disclosed that optimal negative feedback exists in suppressing protein noise in the presence of the on-off switch control.
Collapse
Affiliation(s)
- Yan-Mei Kang
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Ruo-Nan Liu
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| |
Collapse
|
18
|
Abstract
Many biochemical processes in living systems take place in compartmentalized environments, where individual compartments can interact with each other and undergo dynamic remodeling. Studying such processes through mathematical models poses formidable challenges because the underlying dynamics involve a large number of states, which evolve stochastically with time. Here we propose a mathematical framework to study stochastic biochemical networks in compartmentalized environments. We develop a generic population model, which tracks individual compartments and their molecular composition. We then show how the time evolution of this system can be studied effectively through a small number of differential equations, which track the statistics of the population. Our approach is versatile and renders an important class of biological systems computationally accessible. Compartmentalization of biochemical processes underlies all biological systems, from the organelle to the tissue scale. Theoretical models to study the interplay between noisy reaction dynamics and compartmentalization are sparse, and typically very challenging to analyze computationally. Recent studies have made progress toward addressing this problem in the context of specific biological systems, but a general and sufficiently effective approach remains lacking. In this work, we propose a mathematical framework based on counting processes that allows us to study dynamic compartment populations with arbitrary interactions and internal biochemistry. We derive an efficient description of the dynamics in terms of differential equations which capture the statistics of the population. We demonstrate the relevance of our approach by analyzing models inspired by different biological processes, including subcellular compartmentalization and tissue homeostasis.
Collapse
|
19
|
Liu RN, Kang YM. Stochastic master equation for early protein aggregation in the transthyretin amyloid disease. Sci Rep 2020; 10:12437. [PMID: 32709875 PMCID: PMC7381670 DOI: 10.1038/s41598-020-69319-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Accepted: 07/10/2020] [Indexed: 11/09/2022] Open
Abstract
It is significant to understand the earliest molecular events occurring in the nucleation of the amyloid aggregation cascade for the prevention of amyloid related diseases such as transthyretin amyloid disease. We develop chemical master equation for the aggregation of monomers into oligomers using reaction rate law in chemical kinetics. For this stochastic model, lognormal moment closure method is applied to track the evolution of relevant statistical moments and its high accuracy is confirmed by the results obtained from Gillespie's stochastic simulation algorithm. Our results show that the formation of oligomers is highly dependent on the number of monomers. Furthermore, the misfolding rate also has an important impact on the process of oligomers formation. The quantitative investigation should be helpful for shedding more light on the mechanism of amyloid fibril nucleation.
Collapse
Affiliation(s)
- Ruo-Nan Liu
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, 710049, Shaanxi, China
| | - Yan-Mei Kang
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, 710049, Shaanxi, China.
| |
Collapse
|
20
|
Enhancement of gene expression noise from transcription factor binding to genomic decoy sites. Sci Rep 2020; 10:9126. [PMID: 32499583 PMCID: PMC7272470 DOI: 10.1038/s41598-020-65750-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Accepted: 05/08/2020] [Indexed: 12/29/2022] Open
Abstract
The genome contains several high-affinity non-functional binding sites for transcription factors (TFs) creating a hidden and unexplored layer of gene regulation. We investigate the role of such “decoy sites” in controlling noise (random fluctuations) in the level of a TF that is synthesized in stochastic bursts. Prior studies have assumed that decoy-bound TFs are protected from degradation, and in this case decoys function to buffer noise. Relaxing this assumption to consider arbitrary degradation rates for both bound/unbound TF states, we find rich noise behaviors. For low-affinity decoys, noise in the level of unbound TF always monotonically decreases to the Poisson limit with increasing decoy numbers. In contrast, for high-affinity decoys, noise levels first increase with increasing decoy numbers, before decreasing back to the Poisson limit. Interestingly, while protection of bound TFs from degradation slows the time-scale of fluctuations in the unbound TF levels, the decay of bound TFs leads to faster fluctuations and smaller noise propagation to downstream target proteins. In summary, our analysis reveals stochastic dynamics emerging from nonspecific binding of TFs and highlights the dual role of decoys as attenuators or amplifiers of gene expression noise depending on their binding affinity and stability of the bound TF.
Collapse
|
21
|
Ham L, Brackston RD, Stumpf MPH. Extrinsic Noise and Heavy-Tailed Laws in Gene Expression. PHYSICAL REVIEW LETTERS 2020; 124:108101. [PMID: 32216388 DOI: 10.1103/physrevlett.124.108101] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 02/12/2020] [Indexed: 06/10/2023]
Abstract
Noise in gene expression is one of the hallmarks of life at the molecular scale. Here we derive analytical solutions to a set of models describing the molecular mechanisms underlying transcription of DNA into RNA. Our ansatz allows us to incorporate the effects of extrinsic noise-encompassing factors external to the transcription of the individual gene-and discuss the ramifications for heterogeneity in gene product abundance that has been widely observed in single cell data. Crucially, we are able to show that heavy-tailed distributions of RNA copy numbers cannot result from the intrinsic stochasticity in gene expression alone, but must instead reflect extrinsic sources of variability.
Collapse
Affiliation(s)
- Lucy Ham
- School of BioSciences and School of Mathematics and Statistics, University of Melbourne, Parkville VIC 3010, Australia
| | - Rowan D Brackston
- Department Life Sciences, Imperial College London, SW7 2AZ, United Kingdom
| | - Michael P H Stumpf
- School of BioSciences and School of Mathematics and Statistics, University of Melbourne, Parkville VIC 3010, Australia
- Department Life Sciences, Imperial College London, SW7 2AZ, United Kingdom
| |
Collapse
|
22
|
Kuntz J, Thomas P, Stan GB, Barahona M. Bounding the stationary distributions of the chemical master equation via mathematical programming. J Chem Phys 2019; 151:034109. [PMID: 31325941 DOI: 10.1063/1.5100670] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The stochastic dynamics of biochemical networks are usually modeled with the chemical master equation (CME). The stationary distributions of CMEs are seldom solvable analytically, and numerical methods typically produce estimates with uncontrolled errors. Here, we introduce mathematical programming approaches that yield approximations of these distributions with computable error bounds which enable the verification of their accuracy. First, we use semidefinite programming to compute increasingly tighter upper and lower bounds on the moments of the stationary distributions for networks with rational propensities. Second, we use these moment bounds to formulate linear programs that yield convergent upper and lower bounds on the stationary distributions themselves, their marginals, and stationary averages. The bounds obtained also provide a computational test for the uniqueness of the distribution. In the unique case, the bounds form an approximation of the stationary distribution with a computable bound on its error. In the nonunique case, our approach yields converging approximations of the ergodic distributions. We illustrate our methodology through several biochemical examples taken from the literature: Schlögl's model for a chemical bifurcation, a two-dimensional toggle switch, a model for bursty gene expression, and a dimerization model with multiple stationary distributions.
Collapse
Affiliation(s)
- Juan Kuntz
- Department of Mathematics, Imperial College London, London, United Kingdom
| | - Philipp Thomas
- Department of Mathematics, Imperial College London, London, United Kingdom
| | - Guy-Bart Stan
- Department of Bioengineering, Imperial College London, London, United Kingdom
| | - Mauricio Barahona
- Department of Mathematics, Imperial College London, London, United Kingdom
| |
Collapse
|
23
|
Loskot P, Atitey K, Mihaylova L. Comprehensive Review of Models and Methods for Inferences in Bio-Chemical Reaction Networks. Front Genet 2019; 10:549. [PMID: 31258548 PMCID: PMC6588029 DOI: 10.3389/fgene.2019.00549] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Accepted: 05/24/2019] [Indexed: 01/30/2023] Open
Abstract
The key processes in biological and chemical systems are described by networks of chemical reactions. From molecular biology to biotechnology applications, computational models of reaction networks are used extensively to elucidate their non-linear dynamics. The model dynamics are crucially dependent on the parameter values which are often estimated from observations. Over the past decade, the interest in parameter and state estimation in models of (bio-) chemical reaction networks (BRNs) grew considerably. The related inference problems are also encountered in many other tasks including model calibration, discrimination, identifiability, and checking, and optimum experiment design, sensitivity analysis, and bifurcation analysis. The aim of this review paper is to examine the developments in literature to understand what BRN models are commonly used, and for what inference tasks and inference methods. The initial collection of about 700 documents concerning estimation problems in BRNs excluding books and textbooks in computational biology and chemistry were screened to select over 270 research papers and 20 graduate research theses. The paper selection was facilitated by text mining scripts to automate the search for relevant keywords and terms. The outcomes are presented in tables revealing the levels of interest in different inference tasks and methods for given models in the literature as well as the research trends are uncovered. Our findings indicate that many combinations of models, tasks and methods are still relatively unexplored, and there are many new research opportunities to explore combinations that have not been considered-perhaps for good reasons. The most common models of BRNs in literature involve differential equations, Markov processes, mass action kinetics, and state space representations whereas the most common tasks are the parameter inference and model identification. The most common methods in literature are Bayesian analysis, Monte Carlo sampling strategies, and model fitting to data using evolutionary algorithms. The new research problems which cannot be directly deduced from the text mining data are also discussed.
Collapse
Affiliation(s)
- Pavel Loskot
- College of Engineering, Swansea University, Swansea, United Kingdom
| | - Komlan Atitey
- College of Engineering, Swansea University, Swansea, United Kingdom
| | - Lyudmila Mihaylova
- Department of Automatic Control and Systems Engineering, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|
24
|
Solving Stochastic Reaction Networks with Maximum Entropy Lagrange Multipliers. ENTROPY 2018; 20:e20090700. [PMID: 33265789 PMCID: PMC7513230 DOI: 10.3390/e20090700] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Revised: 09/05/2018] [Accepted: 09/06/2018] [Indexed: 11/16/2022]
Abstract
The time evolution of stochastic reaction networks can be modeled with the chemical master equation of the probability distribution. Alternatively, the numerical problem can be reformulated in terms of probability moment equations. Herein we present a new alternative method for numerically solving the time evolution of stochastic reaction networks. Based on the assumption that the entropy of the reaction network is maximum, Lagrange multipliers are introduced. The proposed method derives equations that model the time derivatives of these Lagrange multipliers. We present detailed steps to transform moment equations to Lagrange multiplier equations. In order to demonstrate the method, we present examples of non-linear stochastic reaction networks of varying degrees of complexity, including multistable and oscillatory systems. We find that the new approach is as accurate and significantly more efficient than Gillespie’s original exact algorithm for systems with small number of interacting species. This work is a step towards solving stochastic reaction networks accurately and efficiently.
Collapse
|
25
|
Linear mapping approximation of gene regulatory networks with stochastic dynamics. Nat Commun 2018; 9:3305. [PMID: 30120244 PMCID: PMC6098115 DOI: 10.1038/s41467-018-05822-0] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Accepted: 07/20/2018] [Indexed: 01/09/2023] Open
Abstract
The presence of protein-DNA binding reactions often leads to analytically intractable models of stochastic gene expression. Here we present the linear-mapping approximation that maps systems with protein-promoter interactions onto approximately equivalent systems with no binding reactions. This is achieved by the marriage of conditional mean-field approximation and the Magnus expansion, leading to analytic or semi-analytic expressions for the approximate time-dependent and steady-state protein number distributions. Stochastic simulations verify the method's accuracy in capturing the changes in the protein number distributions with time for a wide variety of networks displaying auto- and mutual-regulation of gene expression and independently of the ratios of the timescales governing the dynamics. The method is also used to study the first-passage time distribution of promoter switching, the sensitivity of the size of protein number fluctuations to parameter perturbation and the stochastic bifurcation diagram characterizing the onset of multimodality in protein number distributions.
Collapse
|
26
|
Pischel D, Sundmacher K, Flassig RJ. Efficient simulation of intrinsic, extrinsic and external noise in biochemical systems. Bioinformatics 2018; 33:i319-i324. [PMID: 28881987 PMCID: PMC5870780 DOI: 10.1093/bioinformatics/btx253] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Motivation Biological cells operate in a noisy regime influenced by intrinsic, extrinsic and external noise, which leads to large differences of individual cell states. Stochastic effects must be taken into account to characterize biochemical kinetics accurately. Since the exact solution of the chemical master equation, which governs the underlying stochastic process, cannot be derived for most biochemical systems, approximate methods are used to obtain a solution. Results In this study, a method to efficiently simulate the various sources of noise simultaneously is proposed and benchmarked on several examples. The method relies on the combination of the sigma point approach to describe extrinsic and external variability and the τ-leaping algorithm to account for the stochasticity due to probabilistic reactions. The comparison of our method to extensive Monte Carlo calculations demonstrates an immense computational advantage while losing an acceptable amount of accuracy. Additionally, the application to parameter optimization problems in stochastic biochemical reaction networks is shown, which is rarely applied due to its huge computational burden. To give further insight, a MATLAB script is provided including the proposed method applied to a simple toy example of gene expression. Availability and implementation MATLAB code is available at Bioinformatics online. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dennis Pischel
- Otto-von-Guericke-University Magdeburg, Process Systems Engineering, Magdeburg, Germany
| | - Kai Sundmacher
- Otto-von-Guericke-University Magdeburg, Process Systems Engineering, Magdeburg, Germany.,Max Planck Institute for Dynamics of Complex Technical Systems, Process Systems Engineering, Magdeburg, Germany
| | - Robert J Flassig
- Max Planck Institute for Dynamics of Complex Technical Systems, Process Systems Engineering, Magdeburg, Germany
| |
Collapse
|
27
|
Bronstein L, Koeppl H. A variational approach to moment-closure approximations for the kinetics of biomolecular reaction networks. J Chem Phys 2018; 148:014105. [DOI: 10.1063/1.5003892] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Affiliation(s)
- Leo Bronstein
- Department of Electrical Engineering and Information Technology, Technische Universität Darmstadt, 64283 Darmstadt, Germany
| | - Heinz Koeppl
- Department of Electrical Engineering and Information Technology, Technische Universität Darmstadt, 64283 Darmstadt, Germany
| |
Collapse
|
28
|
Sakurai Y, Hori Y. Optimization-based synthesis of stochastic biocircuits with statistical specifications. J R Soc Interface 2018; 15:20170709. [PMID: 29321266 PMCID: PMC5805972 DOI: 10.1098/rsif.2017.0709] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 12/08/2017] [Indexed: 01/19/2023] Open
Abstract
Model-guided design has become a standard approach to engineering biomolecular circuits in synthetic biology. However, the stochastic nature of biomolecular reactions is often overlooked in the design process. As a result, cell-cell heterogeneity causes unexpected deviation of biocircuit behaviours from model predictions and requires additional iterations of design-build-test cycles. To enhance the design process of stochastic biocircuits, this paper presents a computational framework to systematically specify the level of intrinsic noise using well-defined metrics of statistics and design highly heterogeneous biocircuits based on the specifications. Specifically, we use descriptive statistics of population distributions as an intuitive specification language of stochastic biocircuits and develop an optimization-based computational tool that explores parameter configurations satisfying design requirements. Sensitivity analysis methods are also performed to ensure the robustness of a biocircuit design against extrinsic perturbations. These design tools are formulated with convex optimization programs to enable rigorous and efficient quantification of the statistics. We demonstrate these features by designing a stochastic negative feedback biocircuit that satisfies multiple statistical constraints and perform an in-depth study of noise propagation and regulation in negative feedback pathways.
Collapse
Affiliation(s)
- Yuta Sakurai
- Department of Applied Physics and Physico-Informatics, Keio University, Yokohama, Kanagawa, Japan
| | - Yutaka Hori
- Department of Applied Physics and Physico-Informatics, Keio University, Yokohama, Kanagawa, Japan
| |
Collapse
|
29
|
Price DJ, Breuzé A, Dybowski R, Mastroeni P, Restif O. An efficient moments-based inference method for within-host bacterial infection dynamics. PLoS Comput Biol 2017; 13:e1005841. [PMID: 29155811 PMCID: PMC5714343 DOI: 10.1371/journal.pcbi.1005841] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Revised: 12/04/2017] [Accepted: 10/22/2017] [Indexed: 11/18/2022] Open
Abstract
Over the last ten years, isogenic tagging (IT) has revolutionised the study of bacterial infection dynamics in laboratory animal models. However, quantitative analysis of IT data has been hindered by the piecemeal development of relevant statistical models. The most promising approach relies on stochastic Markovian models of bacterial population dynamics within and among organs. Here we present an efficient numerical method to fit such stochastic dynamic models to in vivo experimental IT data. A common approach to statistical inference with stochastic dynamic models relies on producing large numbers of simulations, but this remains a slow and inefficient method for all but simple problems, especially when tracking bacteria in multiple locations simultaneously. Instead, we derive and solve the systems of ordinary differential equations for the two lower-order moments of the stochastic variables (mean, variance and covariance). For any given model structure, and assuming linear dynamic rates, we demonstrate how the model parameters can be efficiently and accurately estimated by divergence minimisation. We then apply our method to an experimental dataset and compare the estimates and goodness-of-fit to those obtained by maximum likelihood estimation. While both sets of parameter estimates had overlapping confidence regions, the new method produced lower values for the division and death rates of bacteria: these improved the goodness-of-fit at the second time point at the expense of that of the first time point. This flexible framework can easily be applied to a range of experimental systems. Its computational efficiency paves the way for model comparison and optimal experimental design.
Collapse
Affiliation(s)
- David J. Price
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Alexandre Breuzé
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
- ENSTA-ParisTech, Palaiseau, France
| | - Richard Dybowski
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Piero Mastroeni
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Olivier Restif
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
30
|
Lakatos E, Stumpf MPH. Control mechanisms for stochastic biochemical systems via computation of reachable sets. ROYAL SOCIETY OPEN SCIENCE 2017; 4:160790. [PMID: 28878957 PMCID: PMC5579072 DOI: 10.1098/rsos.160790] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2017] [Accepted: 07/21/2017] [Indexed: 06/07/2023]
Abstract
Controlling the behaviour of cells by rationally guiding molecular processes is an overarching aim of much of synthetic biology. Molecular processes, however, are notoriously noisy and frequently nonlinear. We present an approach to studying the impact of control measures on motifs of molecular interactions that addresses the problems faced in many biological systems: stochasticity, parameter uncertainty and nonlinearity. We show that our reachability analysis formalism can describe the potential behaviour of biological (naturally evolved as well as engineered) systems, and provides a set of bounds on their dynamics at the level of population statistics: for example, we can obtain the possible ranges of means and variances of mRNA and protein expression levels, even in the presence of uncertainty about model parameters.
Collapse
Affiliation(s)
- Eszter Lakatos
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Biosciences, Imperial College London, London SW7 2AZ, UK
| | - Michael P. H. Stumpf
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Biosciences, Imperial College London, London SW7 2AZ, UK
| |
Collapse
|
31
|
Bravi B, Sollich P. Statistical physics approaches to subnetwork dynamics in biochemical systems. Phys Biol 2017; 14:045010. [PMID: 28510539 DOI: 10.1088/1478-3975/aa7363] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
We apply a Gaussian variational approximation to model reduction in large biochemical networks of unary and binary reactions. We focus on a small subset of variables (subnetwork) of interest, e.g. because they are accessible experimentally, embedded in a larger network (bulk). The key goal is to write dynamical equations reduced to the subnetwork but still retaining the effects of the bulk. As a result, the subnetwork-reduced dynamics contains a memory term and an extrinsic noise term with non-trivial temporal correlations. We first derive expressions for this memory and noise in the linearized (Gaussian) dynamics and then use a perturbative power expansion to obtain first order nonlinear corrections. For the case of vanishing intrinsic noise, our description is explicitly shown to be equivalent to projection methods up to quadratic terms, but it is applicable also in the presence of stochastic fluctuations in the original dynamics. An example from the epidermal growth factor receptor signalling pathway is provided to probe the increased prediction accuracy and computational efficiency of our method.
Collapse
Affiliation(s)
- B Bravi
- Current affiliation: Institute of Theoretical Physics, Ecole Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
| | | |
Collapse
|
32
|
Ghusinga KR, Vargas-Garcia CA, Lamperski A, Singh A. Exact lower and upper bounds on stationary moments in stochastic biochemical systems. Phys Biol 2017; 14:04LT01. [PMID: 28661893 DOI: 10.1088/1478-3975/aa75c6] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
In the stochastic description of biochemical reaction systems, the time evolution of statistical moments for species population counts is described by a linear dynamical system. However, except for some ideal cases (such as zero- and first-order reaction kinetics), the moment dynamics is underdetermined as lower-order moments depend upon higher-order moments. Here, we propose a novel method to find exact lower and upper bounds on stationary moments for a given arbitrary system of biochemical reactions. The method exploits the fact that statistical moments of any positive-valued random variable must satisfy some constraints that are compactly represented through the positive semidefiniteness of moment matrices. Our analysis shows that solving moment equations at steady state in conjunction with constraints on moment matrices provides exact lower and upper bounds on the moments. These results are illustrated by three different examples-the commonly used logistic growth model, stochastic gene expression with auto-regulation and an activator-repressor gene network motif. Interestingly, in all cases the accuracy of the bounds is shown to improve as moment equations are expanded to include higher-order moments. Our results provide avenues for development of approximation methods that provide explicit bounds on moments for nonlinear stochastic systems that are otherwise analytically intractable.
Collapse
Affiliation(s)
- Khem Raj Ghusinga
- Department of Electrical and Computer Engineering, University of Delaware, Newark, DE, United States of America
| | | | | | | |
Collapse
|
33
|
Weber MF, Frey E. Master equations and the theory of stochastic path integrals. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2017; 80:046601. [PMID: 28306551 DOI: 10.1088/1361-6633/aa5ae2] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
This review provides a pedagogic and self-contained introduction to master equations and to their representation by path integrals. Since the 1930s, master equations have served as a fundamental tool to understand the role of fluctuations in complex biological, chemical, and physical systems. Despite their simple appearance, analyses of master equations most often rely on low-noise approximations such as the Kramers-Moyal or the system size expansion, or require ad-hoc closure schemes for the derivation of low-order moment equations. We focus on numerical and analytical methods going beyond the low-noise limit and provide a unified framework for the study of master equations. After deriving the forward and backward master equations from the Chapman-Kolmogorov equation, we show how the two master equations can be cast into either of four linear partial differential equations (PDEs). Three of these PDEs are discussed in detail. The first PDE governs the time evolution of a generalized probability generating function whose basis depends on the stochastic process under consideration. Spectral methods, WKB approximations, and a variational approach have been proposed for the analysis of the PDE. The second PDE is novel and is obeyed by a distribution that is marginalized over an initial state. It proves useful for the computation of mean extinction times. The third PDE describes the time evolution of a 'generating functional', which generalizes the so-called Poisson representation. Subsequently, the solutions of the PDEs are expressed in terms of two path integrals: a 'forward' and a 'backward' path integral. Combined with inverse transformations, one obtains two distinct path integral representations of the conditional probability distribution solving the master equations. We exemplify both path integrals in analysing elementary chemical reactions. Moreover, we show how a well-known path integral representation of averaged observables can be recovered from them. Upon expanding the forward and the backward path integrals around stationary paths, we then discuss and extend a recent method for the computation of rare event probabilities. Besides, we also derive path integral representations for processes with continuous state spaces whose forward and backward master equations admit Kramers-Moyal expansions. A truncation of the backward expansion at the level of a diffusion approximation recovers a classic path integral representation of the (backward) Fokker-Planck equation. One can rewrite this path integral in terms of an Onsager-Machlup function and, for purely diffusive Brownian motion, it simplifies to the path integral of Wiener. To make this review accessible to a broad community, we have used the language of probability theory rather than quantum (field) theory and do not assume any knowledge of the latter. The probabilistic structures underpinning various technical concepts, such as coherent states, the Doi-shift, and normal-ordered observables, are thereby made explicit.
Collapse
Affiliation(s)
- Markus F Weber
- Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Department of Physics, Ludwig-Maximilians-Universität München, Theresienstraße 37, 80333 München, Germany
| | - Erwin Frey
- Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Department of Physics, Ludwig-Maximilians-Universität München, Theresienstraße 37, 80333 München, Germany
| |
Collapse
|
34
|
Fan S, Geissmann Q, Lakatos E, Lukauskas S, Ale A, Babtie AC, Kirk PDW, Stumpf MPH. MEANS: python package for Moment Expansion Approximation, iNference and Simulation. Bioinformatics 2016; 32:2863-5. [PMID: 27153663 PMCID: PMC5018365 DOI: 10.1093/bioinformatics/btw229] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Revised: 03/23/2016] [Accepted: 04/21/2016] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Many biochemical systems require stochastic descriptions. Unfortunately these can only be solved for the simplest cases and their direct simulation can become prohibitively expensive, precluding thorough analysis. As an alternative, moment closure approximation methods generate equations for the time-evolution of the system's moments and apply a closure ansatz to obtain a closed set of differential equations; that can become the basis for the deterministic analysis of the moments of the outputs of stochastic systems. RESULTS We present a free, user-friendly tool implementing an efficient moment expansion approximation with parametric closures that integrates well with the IPython interactive environment. Our package enables the analysis of complex stochastic systems without any constraints on the number of species and moments studied and the type of rate laws in the system. In addition to the approximation method our package provides numerous tools to help non-expert users in stochastic analysis. AVAILABILITY AND IMPLEMENTATION https://github.com/theosysbio/means CONTACTS m.stumpf@imperial.ac.uk or e.lakatos13@imperial.ac.uk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sisi Fan
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - Quentin Geissmann
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - Eszter Lakatos
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - Saulius Lukauskas
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - Angelique Ale
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - Ann C Babtie
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | | | - Michael P H Stumpf
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| |
Collapse
|
35
|
Huang L, Pauleve L, Zechner C, Unger M, Hansen AS, Koeppl H. Reconstructing dynamic molecular states from single-cell time series. J R Soc Interface 2016; 13:20160533. [PMID: 27605167 PMCID: PMC5046952 DOI: 10.1098/rsif.2016.0533] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Accepted: 08/09/2016] [Indexed: 11/12/2022] Open
Abstract
The notion of state for a system is prevalent in the quantitative sciences and refers to the minimal system summary sufficient to describe the time evolution of the system in a self-consistent manner. This is a prerequisite for a principled understanding of the inner workings of a system. Owing to the complexity of intracellular processes, experimental techniques that can retrieve a sufficient summary are beyond our reach. For the case of stochastic biomolecular reaction networks, we show how to convert the partial state information accessible by experimental techniques into a full system state using mathematical analysis together with a computational model. This is intimately related to the notion of conditional Markov processes and we introduce the posterior master equation and derive novel approximations to the corresponding infinite-dimensional posterior moment dynamics. We exemplify this state reconstruction approach using both in silico data and single-cell data from two gene expression systems in Saccharomyces cerevisiae, where we reconstruct the dynamic promoter and mRNA states from noisy protein abundance measurements.
Collapse
Affiliation(s)
- Lirong Huang
- Institute of Molecular Systems Biology, D-BIOL, ETH Zurich, Zurich, Switzerland
| | - Loic Pauleve
- Laboratoire de Recherche en Informatique, Université Paris-Sud, Orsay, France
| | - Christoph Zechner
- Department of Biosystems Science and Engineering, ETH Zurich, Zurich, Switzerland
| | - Michael Unger
- Institute of Biochemistry, ETH Zurich, Zurich, Switzerland
| | - Anders S Hansen
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
| | - Heinz Koeppl
- Department of Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany Department of Biology, Technische Universität Darmstadt, Darmstadt, Germany
| |
Collapse
|
36
|
Lenive O, W Kirk PD, H Stumpf MP. Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation. BMC SYSTEMS BIOLOGY 2016; 10:81. [PMID: 27549182 PMCID: PMC4994381 DOI: 10.1186/s12918-016-0324-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Accepted: 07/22/2016] [Indexed: 12/29/2022]
Abstract
Background Gene expression is known to be an intrinsically stochastic process which can involve single-digit numbers of mRNA molecules in a cell at any given time. The modelling of such processes calls for the use of exact stochastic simulation methods, most notably the Gillespie algorithm. However, this stochasticity, also termed “intrinsic noise”, does not account for all the variability between genetically identical cells growing in a homogeneous environment. Despite substantial experimental efforts, determining appropriate model parameters continues to be a challenge. Methods based on approximate Bayesian computation can be used to obtain posterior parameter distributions given the observed data. However, such inference procedures require large numbers of simulations of the model and exact stochastic simulation is computationally costly. In this work we focus on the specific case of trying to infer model parameters describing reaction rates and extrinsic noise on the basis of measurements of molecule numbers in individual cells at a given time point. Results To make the problem computationally tractable we develop an exact, model-specific, stochastic simulation algorithm for the commonly used two-state model of gene expression. This algorithm relies on certain assumptions and favourable properties of the model to forgo the simulation of the whole temporal trajectory of protein numbers in the system, instead returning only the number of protein and mRNA molecules present in the system at a specified time point. The computational gain is proportional to the number of protein molecules created in the system and becomes significant for systems involving hundreds or thousands of protein molecules. Conclusions We employ this simulation algorithm with approximate Bayesian computation to jointly infer the model’s rate and noise parameters from published gene expression data. Our analysis indicates that for most genes the extrinsic contributions to noise will be small to moderate but certainly are non-negligible. Electronic supplementary material The online version of this article (doi:10.1186/s12918-016-0324-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Paul D W Kirk
- MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge, UK
| | - Michael P H Stumpf
- Imperial College, London, Centre for Integrative Systems Biology and Bioinformatics, London, SW7 2AZ, UK.
| |
Collapse
|
37
|
Schnoerr D, Sanguinetti G, Grima R. Comparison of different moment-closure approximations for stochastic chemical kinetics. J Chem Phys 2015; 143:185101. [DOI: 10.1063/1.4934990] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- David Schnoerr
- School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
- School of Informatics, University of Edinburgh, Edinburgh, United Kingdom
| | - Guido Sanguinetti
- School of Informatics, University of Edinburgh, Edinburgh, United Kingdom
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|