1
|
Fronk C, Yun J, Singh P, Petzold L. Bayesian polynomial neural networks and polynomial neural ordinary differential equations. PLoS Comput Biol 2024; 20:e1012414. [PMID: 39388392 PMCID: PMC11476690 DOI: 10.1371/journal.pcbi.1012414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 08/14/2024] [Indexed: 10/12/2024] Open
Abstract
Symbolic regression with polynomial neural networks and polynomial neural ordinary differential equations (ODEs) are two recent and powerful approaches for equation recovery of many science and engineering problems. However, these methods provide point estimates for the model parameters and are currently unable to accommodate noisy data. We address this challenge by developing and validating the following Bayesian inference methods: the Laplace approximation, Markov Chain Monte Carlo (MCMC) sampling methods, and variational inference. We have found the Laplace approximation to be the best method for this class of problems. Our work can be easily extended to the broader class of symbolic neural networks to which the polynomial neural network belongs.
Collapse
Affiliation(s)
- Colby Fronk
- Department of Chemical Engineering, University of California, Santa Barbara, California; United States of America
| | - Jaewoong Yun
- Department of Statistics and Applied Probability, University of California, Santa Barbara, California; United States of America
- Department of Geography, University of California, Santa Barbara, California; United States of America
| | - Prashant Singh
- Science for Life Laboratory, Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Linda Petzold
- Department of Mechanical Engineering, University of California, Santa Barbara, California; United States of America
- Department of Computer Science, University of California, Santa Barbara, California; United States of America
| |
Collapse
|
2
|
Saldaña F, Stollenwerk N, Van Dierdonck JB, Aguiar M. Modeling spillover dynamics: understanding emerging pathogens of public health concern. Sci Rep 2024; 14:9823. [PMID: 38684927 PMCID: PMC11058258 DOI: 10.1038/s41598-024-60661-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 04/25/2024] [Indexed: 05/02/2024] Open
Abstract
The emergence of infectious diseases with pandemic potential is a major public health threat worldwide. The World Health Organization reports that about 60% of emerging infectious diseases are zoonoses, originating from spillover events. Although the mechanisms behind spillover events remain unclear, mathematical modeling offers a way to understand the intricate interactions among pathogens, wildlife, humans, and their shared environment. Aiming at gaining insights into the dynamics of spillover events and the outcome of an eventual disease outbreak in a population, we propose a continuous time stochastic modeling framework. This framework links the dynamics of animal reservoirs and human hosts to simulate cross-species disease transmission. We conduct a thorough analysis of the model followed by numerical experiments that explore various spillover scenarios. The results suggest that although most epidemic outbreaks caused by novel zoonotic pathogens do not persist in the human population, the rising number of spillover events can avoid long-lasting extinction and lead to unexpected large outbreaks. Hence, global efforts to reduce the impacts of emerging diseases should not only address post-emergence outbreak control but also need to prevent pandemics before they are established.
Collapse
Affiliation(s)
| | - Nico Stollenwerk
- Basque Center for Applied Mathematics (BCAM), Bilbao, Spain
- Ikerbasque, Basque Foundation for Science, Bilbao, Spain
| | | | - Maíra Aguiar
- Basque Center for Applied Mathematics (BCAM), Bilbao, Spain.
- Ikerbasque, Basque Foundation for Science, Bilbao, Spain.
- Dipartimento di Matematica, Università degli Studi di Trento, Trento, Italy.
| |
Collapse
|
3
|
Loman TE, Ma Y, Ilin V, Gowda S, Korsbo N, Yewale N, Rackauckas C, Isaacson SA. Catalyst: Fast and flexible modeling of reaction networks. PLoS Comput Biol 2023; 19:e1011530. [PMID: 37851697 PMCID: PMC10584191 DOI: 10.1371/journal.pcbi.1011530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Accepted: 09/19/2023] [Indexed: 10/20/2023] Open
Abstract
We introduce Catalyst.jl, a flexible and feature-filled Julia library for modeling and high-performance simulation of chemical reaction networks (CRNs). Catalyst supports simulating stochastic chemical kinetics (jump process), chemical Langevin equation (stochastic differential equation), and reaction rate equation (ordinary differential equation) representations for CRNs. Through comprehensive benchmarks, we demonstrate that Catalyst simulation runtimes are often one to two orders of magnitude faster than other popular tools. More broadly, Catalyst acts as both a domain-specific language and an intermediate representation for symbolically encoding CRN models as Julia-native objects. This enables a pipeline of symbolically specifying, analyzing, and modifying CRNs; converting Catalyst models to symbolic representations of concrete mathematical models; and generating compiled code for numerical solvers. Leveraging ModelingToolkit.jl and Symbolics.jl, Catalyst models can be analyzed, simplified, and compiled into optimized representations for use in numerical solvers. Finally, we demonstrate Catalyst's broad extensibility and composability by highlighting how it can compose with a variety of Julia libraries, and how existing open-source biological modeling projects have extended its intermediate representation.
Collapse
Affiliation(s)
- Torkel E. Loman
- Sainsbury Laboratory, University of Cambridge, Cambridge, United Kingdom
- Computer Science and AI Laboratory (CSAIL), Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Yingbo Ma
- JuliaHub, Cambridge, Massachusetts, United States of America
| | - Vasily Ilin
- Department of Mathematics, University of Washington, Seattle, Washington, United States of America
| | - Shashi Gowda
- Department of Mathematics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Niklas Korsbo
- Pumas-AI, Baltimore, Maryland, United States of America
| | - Nikhil Yewale
- Department of Applied Mechanics, Indian Institute of Technology Madras, Chennai, India
| | - Chris Rackauckas
- Computer Science and AI Laboratory (CSAIL), Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- JuliaHub, Cambridge, Massachusetts, United States of America
- Pumas-AI, Baltimore, Maryland, United States of America
| | - Samuel A. Isaacson
- Department of Mathematics and Statistics, Boston University, Boston, Massachusetts, United States of America
| |
Collapse
|
4
|
Melo MCR, Bernardi RC. Fostering discoveries in the era of exascale computing: How the next generation of supercomputers empowers computational and experimental biophysics alike. Biophys J 2023; 122:2833-2840. [PMID: 36738105 PMCID: PMC10398237 DOI: 10.1016/j.bpj.2023.01.042] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 01/24/2023] [Accepted: 01/30/2023] [Indexed: 02/05/2023] Open
Abstract
Over a century ago, physicists started broadly relying on theoretical models to guide new experiments. Soon thereafter, chemists began doing the same. Now, biological research enters a new era when experiment and theory walk hand in hand. Novel software and specialized hardware became essential to understand experimental data and propose new models. In fact, current petascale computing resources already allow researchers to reach unprecedented levels of simulation throughput to connect in silico and in vitro experiments. The reduction in cost and improved access allowed a large number of research groups to adopt supercomputing resources and techniques. Here, we outline how large-scale computing has evolved to expand decades-old research, spark new research efforts, and continuously connect simulation and observation. For instance, multiple publicly and privately funded groups have dedicated extensive resources to develop artificial intelligence tools for computational biophysics, from accelerating quantum chemistry calculations to proposing protein structure models. Moreover, advances in computer hardware have accelerated data processing from single-molecule experimental observations and simulations of chemical reactions occurring throughout entire cells. The combination of software and hardware has opened the way for exascale computing and the production of the first public exascale supercomputer, Frontier, inaugurated by the Oak Ridge National Laboratory in 2022. Ultimately, the popularization and development of computational techniques and the training of researchers to use them will only accelerate the diversification of tools and learning resources for future generations.
Collapse
Affiliation(s)
- Marcelo C R Melo
- Auburn University, Department of Physics, Auburn University, Auburn, Alabama
| | - Rafael C Bernardi
- Auburn University, Department of Physics, Auburn University, Auburn, Alabama.
| |
Collapse
|
5
|
Matthew S, Carter F, Cooper J, Dippel M, Green E, Hodges S, Kidwell M, Nickerson D, Rumsey B, Reeve J, Petzold LR, Sanft KR, Drawert B. GillesPy2: A Biochemical Modeling Framework for Simulation Driven Biological Discovery. LETTERS IN BIOMATHEMATICS 2023; 10:87-103. [PMID: 37655179 PMCID: PMC10470263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
Stochastic modeling has become an essential tool for studying biochemical reaction networks. There is a growing need for user-friendly and feature-complete software for model design and simulation. To address this need, we present GillesPy2, an open-source framework for building and simulating mathematical and biochemical models. GillesPy2, a major upgrade from the original GillesPy package, is now a stand-alone Python 3 package. GillesPy2 offers an intuitive interface for robust and reproducible model creation, facilitating rapid and iterative development. In addition to expediting the model creation process, GillesPy2 offers efficient algorithms to simulate stochastic, deterministic, and hybrid stochastic-deterministic models.
Collapse
Affiliation(s)
- Sean Matthew
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina, Asheville, NC 28804
| | - Fin Carter
- Department of Computer Science, University of North Carolina, Asheville, NC 28804
| | - Joshua Cooper
- Department of Computer Science, University of North Carolina, Asheville, NC 28804
| | - Matthew Dippel
- Department of Computer Science, University of North Carolina, Asheville, NC 28804
| | - Ethan Green
- Department of Computer Science, University of North Carolina, Asheville, NC 28804
| | - Samuel Hodges
- Department of Computer Science, North Carolina State University, NC 27695
| | - Mason Kidwell
- Department of Computer Science, University of North Carolina, Asheville, NC 28804
| | - Dalton Nickerson
- Department of Computer Science, University of North Carolina, Asheville, NC 28804
| | - Bryan Rumsey
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina, Asheville, NC 28804
| | - Jesse Reeve
- Department of Computer Science, University of North Carolina, Asheville, NC 28804
| | - Linda R. Petzold
- Department of Computer Science and Department of Mechanical Engineering, University of California, Santa Barbara, CA 93106
| | - Kevin R. Sanft
- Department of Computer Science, University of North Carolina, Asheville, NC 28804
| | - Brian Drawert
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina, Asheville, NC 28804
| |
Collapse
|
6
|
Coulier A, Singh P, Sturrock M, Hellander A. Systematic comparison of modeling fidelity levels and parameter inference settings applied to negative feedback gene regulation. PLoS Comput Biol 2022; 18:e1010683. [PMID: 36520957 PMCID: PMC9799300 DOI: 10.1371/journal.pcbi.1010683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 12/29/2022] [Accepted: 10/25/2022] [Indexed: 12/23/2022] Open
Abstract
Quantitative stochastic models of gene regulatory networks are important tools for studying cellular regulation. Such models can be formulated at many different levels of fidelity. A practical challenge is to determine what model fidelity to use in order to get accurate and representative results. The choice is important, because models of successively higher fidelity come at a rapidly increasing computational cost. In some situations, the level of detail is clearly motivated by the question under study. In many situations however, many model options could qualitatively agree with available data, depending on the amount of data and the nature of the observations. Here, an important distinction is whether we are interested in inferring the true (but unknown) physical parameters of the model or if it is sufficient to be able to capture and explain available data. The situation becomes complicated from a computational perspective because inference needs to be approximate. Most often it is based on likelihood-free Approximate Bayesian Computation (ABC) and here determining which summary statistics to use, as well as how much data is needed to reach the desired level of accuracy, are difficult tasks. Ultimately, all of these aspects-the model fidelity, the available data, and the numerical choices for inference-interplay in a complex manner. In this paper we develop a computational pipeline designed to systematically evaluate inference accuracy for a wide range of true known parameters. We then use it to explore inference settings for negative feedback gene regulation. In particular, we compare a detailed spatial stochastic model, a coarse-grained compartment-based multiscale model, and the standard well-mixed model, across several data-scenarios and for multiple numerical options for parameter inference. Practically speaking, this pipeline can be used as a preliminary step to guide modelers prior to gathering experimental data. By training Gaussian processes to approximate the distance function values, we are able to substantially reduce the computational cost of running the pipeline.
Collapse
Affiliation(s)
- Adrien Coulier
- Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Prashant Singh
- Science for Life Laboratory, Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Marc Sturrock
- Department of Physiology, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, Uppsala, Sweden
- * E-mail:
| |
Collapse
|
7
|
AKesson M, Singh P, Wrede F, Hellander A. Convolutional Neural Networks as Summary Statistics for Approximate Bayesian Computation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3353-3365. [PMID: 34460381 PMCID: PMC9847490 DOI: 10.1109/tcbb.2021.3108695] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Approximate Bayesian Computation is widely used in systems biology for inferring parameters in stochastic gene regulatory network models. Its performance hinges critically on the ability to summarize high-dimensional system responses such as time series into a few informative, low-dimensional summary statistics. The quality of those statistics acutely impacts the accuracy of the inference task. Existing methods to select the best subset out of a pool of candidate statistics do not scale well with large pools of several tens to hundreds of candidate statistics. Since high quality statistics are imperative for good performance, this becomes a serious bottleneck when performing inference on complex and high-dimensional problems. This paper proposes a convolutional neural network architecture for automatically learning informative summary statistics of temporal responses. We show that the proposed network can effectively circumvent the statistics selection problem of the preprocessing step for ABC inference. The proposed approach is demonstrated on two benchmark problem and one challenging inference problem learning parameters in a high-dimensional stochastic genetic oscillator. We also study the impact of experimental design on network performance by comparing different data richness and data acquisition strategies.
Collapse
|
8
|
Kundu S. TemporalGSSA: a numerically robust R-wrapper to facilitate computation of a metabolite-specific and simulation time-dependent trajectory from stochastic simulation algorithm (SSA)-generated datasets. J Bioinform Comput Biol 2022; 20:2250018. [DOI: 10.1142/s0219720022500184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
9
|
Drawert B, Flies AS, Matthew S, Powell M, Rumsey B. Saving the Devils Is in the Details: Tasmanian Devil Facial Tumor Disease Can Be Eliminated with Interventions. LETTERS IN BIOMATHEMATICS 2022; 9:121-140. [PMID: 37655234 PMCID: PMC10470821 DOI: 10.30707/lib9.1.1681913305.269822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
Tasmanian Devils facial tumor disease (DFTD) is severely impacting the population of this wild animal. We developed a computational model of the population of Tasmanian Devils, and the change induced by DFTD. We use this model to test possible intervention strategies Tasmanian conservationists could do. We investigate bait drop vaccination programs, diseased animal removals programs, and evolution of natural immunity. We conclude that a combination of intervention strategies gives the most favorable outcome. An additional goal of this paper is reproducibility of our results. Our StochSS software platform features the ability to share and reproduce the computational notebooks that created all of the results in the paper. We endeavor that all readers should be able to reproduce our results with minimum effort.
Collapse
Affiliation(s)
- Brian Drawert
- National Environmental Modeling and Analysis Center, University of North Carolina Asheville, Asheville, NC, USA
| | - Andrew S. Flies
- Menzies Institute for Medical Research, College of Health and Medicine, University of Tasmania, Hobart, TAS, 7000, Australia
| | - Sean Matthew
- National Environmental Modeling and Analysis Center, University of North Carolina Asheville, Asheville, NC, USA
| | - Megan Powell
- Department of Mathematics and Statistics, University of North Carolina Asheville, Asheville, NC, USA
| | - Bryan Rumsey
- National Environmental Modeling and Analysis Center, University of North Carolina Asheville, Asheville, NC, USA
| |
Collapse
|
10
|
Song YM, Hong H, Kim JK. Universally valid reduction of multiscale stochastic biochemical systems using simple non-elementary propensities. PLoS Comput Biol 2021; 17:e1008952. [PMID: 34662330 PMCID: PMC8562860 DOI: 10.1371/journal.pcbi.1008952] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 11/02/2021] [Accepted: 10/04/2021] [Indexed: 11/19/2022] Open
Abstract
Biochemical systems consist of numerous elementary reactions governed by the law of mass action. However, experimentally characterizing all the elementary reactions is nearly impossible. Thus, over a century, their deterministic models that typically contain rapid reversible bindings have been simplified with non-elementary reaction functions (e.g., Michaelis-Menten and Morrison equations). Although the non-elementary reaction functions are derived by applying the quasi-steady-state approximation (QSSA) to deterministic systems, they have also been widely used to derive propensities for stochastic simulations due to computational efficiency and simplicity. However, the validity condition for this heuristic approach has not been identified even for the reversible binding between molecules, such as protein-DNA, enzyme-substrate, and receptor-ligand, which is the basis for living cells. Here, we find that the non-elementary propensities based on the deterministic total QSSA can accurately capture the stochastic dynamics of the reversible binding in general. However, serious errors occur when reactant molecules with similar levels tightly bind, unlike deterministic systems. In that case, the non-elementary propensities distort the stochastic dynamics of a bistable switch in the cell cycle and an oscillator in the circadian clock. Accordingly, we derive alternative non-elementary propensities with the stochastic low-state QSSA, developed in this study. This provides a universally valid framework for simplifying multiscale stochastic biochemical systems with rapid reversible bindings, critical for efficient stochastic simulations of cell signaling and gene regulation. To facilitate the framework, we provide a user-friendly open-source computational package, ASSISTER, that automatically performs the present framework.
Collapse
Affiliation(s)
- Yun Min Song
- Department of Mathematical Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
- Biomedical Mathematics Group, Institute for Basic Science, Daejeon, Republic of Korea
| | - Hyukpyo Hong
- Department of Mathematical Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
- Biomedical Mathematics Group, Institute for Basic Science, Daejeon, Republic of Korea
| | - Jae Kyoung Kim
- Department of Mathematical Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
- Biomedical Mathematics Group, Institute for Basic Science, Daejeon, Republic of Korea
| |
Collapse
|
11
|
Lubbock AL, Lopez CF. Programmatic modeling for biological systems. CURRENT OPINION IN SYSTEMS BIOLOGY 2021; 27:100343. [PMID: 34485764 PMCID: PMC8411905 DOI: 10.1016/j.coisb.2021.05.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
Computational modeling has become an established technique to encode mathematical representations of cellular processes and gain mechanistic insights that drive testable predictions. These models are often constructed using graphical user interfaces or domain-specific languages, with community standards used for interchange. Models undergo steady state or dynamic analysis, which can include simulation and calibration within a single application, or transfer across various tools. Here, we describe a novel programmatic modeling paradigm, whereby modeling is augmented with software engineering best practices. We focus on Python - a popular programming language with a large scientific package ecosystem. Models can be encoded as programs, adding benefits such as modularity, testing, and automated documentation generators, while still being extensible and exportable to standardized formats for use with external tools if desired. Programmatic modeling is a key technology to enable collaborative model development and enhance dissemination, transparency, and reproducibility.
Collapse
Affiliation(s)
- Alexander L.R. Lubbock
- Department of Biochemistry, Vanderbilt University, Nashville, Tennessee 37212, United States of America
- Vanderbilt-Ingram Cancer Center, Vanderbilt University, Nashville Tennessee 37212, United States of America
| | - Carlos F. Lopez
- Department of Biochemistry, Vanderbilt University, Nashville, Tennessee 37212, United States of America
- Vanderbilt-Ingram Cancer Center, Vanderbilt University, Nashville Tennessee 37212, United States of America
- Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee 37212, United States of America
| |
Collapse
|
12
|
Tangherloni A, Nobile MS, Cazzaniga P, Capitoli G, Spolaor S, Rundo L, Mauri G, Besozzi D. FiCoS: A fine-grained and coarse-grained GPU-powered deterministic simulator for biochemical networks. PLoS Comput Biol 2021; 17:e1009410. [PMID: 34499658 PMCID: PMC8476010 DOI: 10.1371/journal.pcbi.1009410] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 09/27/2021] [Accepted: 08/28/2021] [Indexed: 11/19/2022] Open
Abstract
Mathematical models of biochemical networks can largely facilitate the comprehension of the mechanisms at the basis of cellular processes, as well as the formulation of hypotheses that can be tested by means of targeted laboratory experiments. However, two issues might hamper the achievement of fruitful outcomes. On the one hand, detailed mechanistic models can involve hundreds or thousands of molecular species and their intermediate complexes, as well as hundreds or thousands of chemical reactions, a situation generally occurring in rule-based modeling. On the other hand, the computational analysis of a model typically requires the execution of a large number of simulations for its calibration, or to test the effect of perturbations. As a consequence, the computational capabilities of modern Central Processing Units can be easily overtaken, possibly making the modeling of biochemical networks a worthless or ineffective effort. To the aim of overcoming the limitations of the current state-of-the-art simulation approaches, we present in this paper FiCoS, a novel "black-box" deterministic simulator that effectively realizes both a fine-grained and a coarse-grained parallelization on Graphics Processing Units. In particular, FiCoS exploits two different integration methods, namely, the Dormand-Prince and the Radau IIA, to efficiently solve both non-stiff and stiff systems of coupled Ordinary Differential Equations. We tested the performance of FiCoS against different deterministic simulators, by considering models of increasing size and by running analyses with increasing computational demands. FiCoS was able to dramatically speedup the computations up to 855×, showing to be a promising solution for the simulation and analysis of large-scale models of complex biological processes.
Collapse
Affiliation(s)
- Andrea Tangherloni
- Department of Human and Social Sciences, University of Bergamo, Bergamo, Italy
| | - Marco S. Nobile
- Department of Industrial Engineering & Innovation Sciences, Eindhoven University of Technology, Eindhoven, The Netherlands
- SYSBIO/ISBE.IT Centre of Systems Biology, Milan, Italy
- Bicocca Bioinformatics Biostatistics and Bioimaging Centre (B4), University of Milano-Bicocca, Vedano al Lambro, Italy
| | - Paolo Cazzaniga
- Department of Human and Social Sciences, University of Bergamo, Bergamo, Italy
- SYSBIO/ISBE.IT Centre of Systems Biology, Milan, Italy
- Bicocca Bioinformatics Biostatistics and Bioimaging Centre (B4), University of Milano-Bicocca, Vedano al Lambro, Italy
| | - Giulia Capitoli
- Bicocca Bioinformatics Biostatistics and Bioimaging Centre (B4), University of Milano-Bicocca, Vedano al Lambro, Italy
- School of Medicine and Surgery, University of Milano-Bicocca, Monza, Italy
| | - Simone Spolaor
- Bicocca Bioinformatics Biostatistics and Bioimaging Centre (B4), University of Milano-Bicocca, Vedano al Lambro, Italy
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
| | - Leonardo Rundo
- Department of Radiology, University of Cambridge, Cambridge, United Kingdom
- Cancer Research UK Cambridge Centre, University of Cambridge, Cambridge, United Kingdom
| | - Giancarlo Mauri
- SYSBIO/ISBE.IT Centre of Systems Biology, Milan, Italy
- Bicocca Bioinformatics Biostatistics and Bioimaging Centre (B4), University of Milano-Bicocca, Vedano al Lambro, Italy
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
| | - Daniela Besozzi
- SYSBIO/ISBE.IT Centre of Systems Biology, Milan, Italy
- Bicocca Bioinformatics Biostatistics and Bioimaging Centre (B4), University of Milano-Bicocca, Vedano al Lambro, Italy
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
| |
Collapse
|
13
|
Coulier A, Hellander S, Hellander A. A multiscale compartment-based model of stochastic gene regulatory networks using hitting-time analysis. J Chem Phys 2021; 154:184105. [PMID: 34241042 PMCID: PMC8116061 DOI: 10.1063/5.0010764] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Spatial stochastic models of single cell kinetics are capable of capturing both fluctuations in molecular numbers and the spatial dependencies of the key steps of intracellular regulatory networks. The spatial stochastic model can be simulated both on a detailed microscopic level using particle tracking and on a mesoscopic level using the reaction-diffusion master equation. However, despite substantial progress on simulation efficiency for spatial models in the last years, the computational cost quickly becomes prohibitively expensive for tasks that require repeated simulation of thousands or millions of realizations of the model. This limits the use of spatial models in applications such as multicellular simulations, likelihood-free parameter inference, and robustness analysis. Further approximation of the spatial dynamics is needed to accelerate such computational engineering tasks. We here propose a multiscale model where a compartment-based model approximates a detailed spatial stochastic model. The compartment model is constructed via a first-exit time analysis on the spatial model, thus capturing critical spatial aspects of the fine-grained simulations, at a cost close to the simple well-mixed model. We apply the multiscale model to a canonical model of negative-feedback gene regulation, assess its accuracy over a range of parameters, and demonstrate that the approximation can yield substantial speedups for likelihood-free parameter inference.
Collapse
Affiliation(s)
- Adrien Coulier
- Department of Information Technology, Uppsala University, Box 337, SE-755 01 Uppsala, Sweden
| | - Stefan Hellander
- Department of Information Technology, Uppsala University, Box 337, SE-755 01 Uppsala, Sweden
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, Box 337, SE-755 01 Uppsala, Sweden
| |
Collapse
|
14
|
Shaikh B, Marupilla G, Wilson M, Blinov ML, Moraru II, Karr JR. RunBioSimulations: an extensible web application that simulates a wide range of computational modeling frameworks, algorithms, and formats. Nucleic Acids Res 2021; 49:W597-W602. [PMID: 34019658 PMCID: PMC8262693 DOI: 10.1093/nar/gkab411] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 04/18/2021] [Accepted: 04/30/2021] [Indexed: 11/13/2022] Open
Abstract
Comprehensive, predictive computational models have significant potential for science, bioengineering, and medicine. One promising way to achieve more predictive models is to combine submodels of multiple subsystems. To capture the multiple scales of biology, these submodels will likely require multiple modeling frameworks and simulation algorithms. Several community resources are already available for working with many of these frameworks and algorithms. However, the variety and sheer number of these resources make it challenging to find and use appropriate tools for each model, especially for novice modelers and experimentalists. To make these resources easier to use, we developed RunBioSimulations (https://run.biosimulations.org), a single web application for executing a broad range of models. RunBioSimulations leverages community resources, including BioSimulators, a new open registry of simulation tools. These resources currently enable RunBioSimulations to execute nine frameworks and 44 algorithms, and they make RunBioSimulations extensible to additional frameworks and algorithms. RunBioSimulations also provides features for sharing simulations and interactively visualizing their results. We anticipate that RunBioSimulations will foster reproducibility, stimulate collaboration, and ultimately facilitate the creation of more predictive models.
Collapse
Affiliation(s)
- Bilal Shaikh
- Icahn Institute for Data Science & Genomic Technology and Department of Genetics & Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, New York, NY 10029, USA
| | - Gnaneswara Marupilla
- Center for Cell Analysis and Modeling, University of Connecticut School of Medicine, 263 Farmington Avenue, Farmington, CT 06030, USA
| | - Mike Wilson
- Center for Cell Analysis and Modeling, University of Connecticut School of Medicine, 263 Farmington Avenue, Farmington, CT 06030, USA
| | - Michael L Blinov
- Center for Cell Analysis and Modeling, University of Connecticut School of Medicine, 263 Farmington Avenue, Farmington, CT 06030, USA
| | - Ion I Moraru
- Center for Cell Analysis and Modeling, University of Connecticut School of Medicine, 263 Farmington Avenue, Farmington, CT 06030, USA
| | - Jonathan R Karr
- Icahn Institute for Data Science & Genomic Technology and Department of Genetics & Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, New York, NY 10029, USA
| |
Collapse
|
15
|
Jiang RM, Wrede F, Singh P, Hellander A, Petzold LR. Accelerated regression-based summary statistics for discrete stochastic systems via approximate simulators. BMC Bioinformatics 2021; 22:339. [PMID: 34162329 PMCID: PMC8220802 DOI: 10.1186/s12859-021-04255-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Accepted: 06/10/2021] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND Approximate Bayesian Computation (ABC) has become a key tool for calibrating the parameters of discrete stochastic biochemical models. For higher dimensional models and data, its performance is strongly dependent on having a representative set of summary statistics. While regression-based methods have been demonstrated to allow for the automatic construction of effective summary statistics, their reliance on first simulating a large training set creates a significant overhead when applying these methods to discrete stochastic models for which simulation is relatively expensive. In this τ work, we present a method to reduce this computational burden by leveraging approximate simulators of these systems, such as ordinary differential equations and τ-Leaping approximations. RESULTS We have developed an algorithm to accelerate the construction of regression-based summary statistics for Approximate Bayesian Computation by selectively using the faster approximate algorithms for simulations. By posing the problem as one of ratio estimation, we use state-of-the-art methods in machine learning to show that, in many cases, our algorithm can significantly reduce the number of simulations from the full resolution model at a minimal cost to accuracy and little additional tuning from the user. We demonstrate the usefulness and robustness of our method with four different experiments. CONCLUSIONS We provide a novel algorithm for accelerating the construction of summary statistics for stochastic biochemical systems. Compared to the standard practice of exclusively training from exact simulator samples, our method is able to dramatically reduce the number of required calls to the stochastic simulator at a minimal loss in accuracy. This can immediately be implemented to increase the overall speed of the ABC workflow for estimating parameters in complex systems.
Collapse
Affiliation(s)
- Richard M. Jiang
- Department of Computer Science, University of California, Santa Barbara, Santa Barbara, USA
| | - Fredrik Wrede
- Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Prashant Singh
- Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Linda R. Petzold
- Department of Computer Science, University of California, Santa Barbara, Santa Barbara, USA
| |
Collapse
|
16
|
Singh P, Wrede F, Hellander A. Scalable machine learning-assisted model exploration and inference using Sciope. Bioinformatics 2021; 37:279-281. [PMID: 32706854 PMCID: PMC8055224 DOI: 10.1093/bioinformatics/btaa673] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2020] [Revised: 05/18/2020] [Accepted: 07/20/2020] [Indexed: 11/13/2022] Open
Abstract
SUMMARY Discrete stochastic models of gene regulatory networks are fundamental tools for in silico study of stochastic gene regulatory networks. Likelihood-free inference and model exploration are critical applications to study a system using such models. However, the massive computational cost of complex, high-dimensional and stochastic modelling currently limits systematic investigation to relatively simple systems. Recently, machine-learning-assisted methods have shown great promise to handle larger, more complex models. To support both ease-of-use of this new class of methods, as well as their further development, we have developed the scalable inference, optimization and parameter exploration (Sciope) toolbox. Sciope is designed to support new algorithms for machine-learning-assisted model exploration and likelihood-free inference. Moreover, it is built ground up to easily leverage distributed and heterogeneous computational resources for convenient parallelism across platforms from workstations to clouds. AVAILABILITY AND IMPLEMENTATION The Sciope Python3 toolbox is freely available on https://github.com/Sciope/Sciope, and has been tested on Linux, Windows and macOS platforms. SUPPLEMENTARY INFORMATION Supplementary information is available at Bioinformatics online.
Collapse
Affiliation(s)
- Prashant Singh
- Department of Information Technology, Uppsala University, 751 05 Uppsala, Sweden
| | - Fredrik Wrede
- Department of Information Technology, Uppsala University, 751 05 Uppsala, Sweden
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, 751 05 Uppsala, Sweden
| |
Collapse
|
17
|
Jiang R, Jacob B, Geiger M, Matthew S, Rumsey B, Singh P, Wrede F, Yi TM, Drawert B, Hellander A, Petzold L. Epidemiological modeling in StochSS Live! Bioinformatics 2021; 37:2787-2788. [PMID: 33512399 PMCID: PMC7929247 DOI: 10.1093/bioinformatics/btab061] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 01/20/2021] [Accepted: 01/26/2021] [Indexed: 11/16/2022] Open
Abstract
Summary We present StochSS Live!, a web-based service for modeling, simulation and analysis of a wide range of mathematical, biological and biochemical systems. Using an epidemiological model of COVID-19, we demonstrate the power of StochSS Live! to enable researchers to quickly develop a deterministic or a discrete stochastic model, infer its parameters and analyze the results. Availability and implementation StochSS Live! is freely available at https://live.stochss.org/ Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Richard Jiang
- Department of Computer Science, University of California-Santa Barbara, Santa Barbara, CA, 93117, USA
| | - Bruno Jacob
- Department of Mechanical Engineering, University of California-Santa Barbara, Santa Barbara, California, 93106, USA
| | - Matthew Geiger
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina at Asheville, Asheville, NC, 28804
| | - Sean Matthew
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina at Asheville, Asheville, NC, 28804
| | - Bryan Rumsey
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina at Asheville, Asheville, NC, 28804
| | - Prashant Singh
- Department of Information Technology, Uppsala University, Box 337, 751 05 Uppsala, Sweden
| | - Fredrik Wrede
- Department of Information Technology, Uppsala University, Box 337, 751 05 Uppsala, Sweden
| | - Tau-Mu Yi
- Department of Molecular, Cellular, and Developmental Biology, University of California-Santa Barbara, Santa Barbara, CA 93106, USA
| | - Brian Drawert
- Department of Computer Science, University of North Carolina at Asheville, Asheville, NC, 28804, USA
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, Box 337, 751 05 Uppsala, Sweden
| | - Linda Petzold
- Department of Computer Science, University of California-Santa Barbara, Santa Barbara, CA, 93117, USA.,Department of Mechanical Engineering, University of California-Santa Barbara, Santa Barbara, California, 93106, USA
| |
Collapse
|
18
|
Johnson ME, Chen A, Faeder JR, Henning P, Moraru II, Meier-Schellersheim M, Murphy RF, Prüstel T, Theriot JA, Uhrmacher AM. Quantifying the roles of space and stochasticity in computer simulations for cell biology and cellular biochemistry. Mol Biol Cell 2021; 32:186-210. [PMID: 33237849 PMCID: PMC8120688 DOI: 10.1091/mbc.e20-08-0530] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 10/13/2020] [Accepted: 11/17/2020] [Indexed: 12/29/2022] Open
Abstract
Most of the fascinating phenomena studied in cell biology emerge from interactions among highly organized multimolecular structures embedded into complex and frequently dynamic cellular morphologies. For the exploration of such systems, computer simulation has proved to be an invaluable tool, and many researchers in this field have developed sophisticated computational models for application to specific cell biological questions. However, it is often difficult to reconcile conflicting computational results that use different approaches to describe the same phenomenon. To address this issue systematically, we have defined a series of computational test cases ranging from very simple to moderately complex, varying key features of dimensionality, reaction type, reaction speed, crowding, and cell size. We then quantified how explicit spatial and/or stochastic implementations alter outcomes, even when all methods use the same reaction network, rates, and concentrations. For simple cases, we generally find minor differences in solutions of the same problem. However, we observe increasing discordance as the effects of localization, dimensionality reduction, and irreversible enzymatic reactions are combined. We discuss the strengths and limitations of commonly used computational approaches for exploring cell biological questions and provide a framework for decision making by researchers developing new models. As computational power and speed continue to increase at a remarkable rate, the dream of a fully comprehensive computational model of a living cell may be drawing closer to reality, but our analysis demonstrates that it will be crucial to evaluate the accuracy of such models critically and systematically.
Collapse
Affiliation(s)
- M. E. Johnson
- Thomas C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, 21218
| | - A. Chen
- Thomas C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, 21218
| | - J. R. Faeder
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15260
| | - P. Henning
- Institute for Visual and Analytic Computing, University of Rostock, 18055 Rostock, Germany
| | - I. I. Moraru
- Department of Cell Biology, Center for Cell Analysis and Modeling, University of Connecticut Health Center, Farmington, CT 06030
| | - M. Meier-Schellersheim
- Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892
| | - R. F. Murphy
- Computational Biology Department, Department of Biological Sciences, Department of Biomedical Engineering, Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15289
| | - T. Prüstel
- Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892
| | - J. A. Theriot
- Department of Biology and Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195
| | - A. M. Uhrmacher
- Institute for Visual and Analytic Computing, University of Rostock, 18055 Rostock, Germany
| |
Collapse
|
19
|
Abstract
This book chapter is drafted for biologists with experimental experiences in ROS biology but being newcomers in the field of modeling. We start with a general introduction about computational modeling in biology and an overview of software tools suitable for beginners. This chapter encompasses an introduction to computational models with special focus on simulation of ROS dynamics. A step-by-step tutorial follows providing guidance for all relevant model development processes. This course of action gives a comprehensible way to understand the benefits of computational models and to gain the necessary knowledge to build own small equation-based models. Small models can be created without any special programming expertise or in-depth technical and mathematical knowledge. Afterward in the final section, a short overview of pitfalls, challenges, and limitations is provided, combined with suggestions for further reading to improve and expand modeling skills of biologists.
Collapse
Affiliation(s)
- Jana Schleicher
- Experimental Transplantation Surgery, Department of General, Visceral and Vascular Surgery, University Hospital Jena, Jena, Germany.
| |
Collapse
|
20
|
Irurzun-Arana I, Rackauckas C, McDonald TO, Trocóniz IF. Beyond Deterministic Models in Drug Discovery and Development. Trends Pharmacol Sci 2020; 41:882-895. [PMID: 33032836 PMCID: PMC7534664 DOI: 10.1016/j.tips.2020.09.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Revised: 07/28/2020] [Accepted: 09/10/2020] [Indexed: 02/06/2023]
Abstract
The model-informed drug discovery and development paradigm is now well established among the pharmaceutical industry and regulatory agencies. This success has been mainly due to the ability of pharmacometrics to bring together different modeling strategies, such as population pharmacokinetics/pharmacodynamics (PK/PD) and systems biology/pharmacology. However, there are promising quantitative approaches that are still seldom used by pharmacometricians and that deserve consideration. One such case is the stochastic modeling approach, which can be important when modeling small populations because random events can have a huge impact on these systems. In this review, we aim to raise awareness of stochastic models and how to combine them with existing modeling techniques, with the ultimate goal of making future drug-disease models more versatile and realistic.
Collapse
Affiliation(s)
- Itziar Irurzun-Arana
- Pharmacometrics and Systems Pharmacology, Department of Pharmaceutical Technology and Chemistry, School of Pharmacy and Nutrition, University of Navarra, Pamplona, 31008, Spain; Navarra Institute for Health Research (IdisNA), University of Navarra, 31080, Pamplona, Spain.
| | - Christopher Rackauckas
- Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Thomas O McDonald
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA 02115, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA; Center for Cancer Evolution, Dana-Farber Cancer Institute, Boston, MA 02115, USA
| | - Iñaki F Trocóniz
- Pharmacometrics and Systems Pharmacology, Department of Pharmaceutical Technology and Chemistry, School of Pharmacy and Nutrition, University of Navarra, Pamplona, 31008, Spain; Navarra Institute for Health Research (IdisNA), University of Navarra, 31080, Pamplona, Spain; Institute of Data Science and Artificial Intelligence, DATAI, University of Navarra, Pamplona, 31080, Spain.
| |
Collapse
|
21
|
Porubsky VL, Goldberg AP, Rampadarath AK, Nickerson DP, Karr JR, Sauro HM. Best Practices for Making Reproducible Biochemical Models. Cell Syst 2020; 11:109-120. [PMID: 32853539 DOI: 10.1016/j.cels.2020.06.012] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 05/15/2020] [Accepted: 06/24/2020] [Indexed: 01/03/2023]
Abstract
Like many scientific disciplines, dynamical biochemical modeling is hindered by irreproducible results. This limits the utility of biochemical models by making them difficult to understand, trust, or reuse. We comprehensively list the best practices that biochemical modelers should follow to build reproducible biochemical model artifacts-all data, model descriptions, and custom software used by the model-that can be understood and reused. The best practices provide advice for all steps of a typical biochemical modeling workflow in which a modeler collects data; constructs, trains, simulates, and validates the model; uses the predictions of a model to advance knowledge; and publicly shares the model artifacts. The best practices emphasize the benefits obtained by using standard tools and formats and provides guidance to modelers who do not or cannot use standards in some stages of their modeling workflow. Adoption of these best practices will enhance the ability of researchers to reproduce, understand, and reuse biochemical models.
Collapse
Affiliation(s)
- Veronica L Porubsky
- Department of Bioengineering, University of Washington, Seattle, WA 98105, USA.
| | - Arthur P Goldberg
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| | - Anand K Rampadarath
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - David P Nickerson
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Jonathan R Karr
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle, WA 98105, USA
| |
Collapse
|
22
|
Wrede F, Hellander A. Smart computational exploration of stochastic gene regulatory network models using human-in-the-loop semi-supervised learning. Bioinformatics 2020; 35:5199-5206. [PMID: 31141124 DOI: 10.1093/bioinformatics/btz420] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Revised: 04/24/2019] [Accepted: 05/26/2019] [Indexed: 01/09/2023] Open
Abstract
MOTIVATION Discrete stochastic models of gene regulatory network models are indispensable tools for biological inquiry since they allow the modeler to predict how molecular interactions give rise to nonlinear system output. Model exploration with the objective of generating qualitative hypotheses about the workings of a pathway is usually the first step in the modeling process. It involves simulating the gene network model under a very large range of conditions, due to the large uncertainty in interactions and kinetic parameters. This makes model exploration highly computational demanding. Furthermore, with no prior information about the model behavior, labor-intensive manual inspection of very large amounts of simulation results becomes necessary. This limits systematic computational exploration to simplistic models. RESULTS We have developed an interactive, smart workflow for model exploration based on semi-supervised learning and human-in-the-loop labeling of data. The workflow lets a modeler rapidly discover ranges of interesting behaviors predicted by the model. Utilizing that similar simulation output is in proximity of each other in a feature space, the modeler can focus on informing the system about what behaviors are more interesting than others by labeling, rather than analyzing simulation results with custom scripts and workflows. This results in a large reduction in time-consuming manual work by the modeler early in a modeling project, which can substantially reduce the time needed to go from an initial model to testable predictions and downstream analysis. AVAILABILITY AND IMPLEMENTATION A python-package is available at https://github.com/Wrede/mio.git. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Fredrik Wrede
- Department of Information Technology, Uppsala University, Uppsala SE-75105, Sweden
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, Uppsala SE-75105, Sweden
| |
Collapse
|
23
|
Valderrama-Gómez MÁ, Lomnitz JG, Fasani RA, Savageau MA. Mechanistic Modeling of Biochemical Systems without A Priori Parameter Values Using the Design Space Toolbox v.3.0. iScience 2020; 23:101200. [PMID: 32531747 PMCID: PMC7287267 DOI: 10.1016/j.isci.2020.101200] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Revised: 05/04/2020] [Accepted: 05/21/2020] [Indexed: 01/24/2023] Open
Abstract
Mechanistic models of biochemical systems provide a rigorous description of biological phenomena. They are indispensable for making predictions and elucidating biological design principles. To date, mathematical analysis and characterization of these models encounter a bottleneck consisting of large numbers of unknown parameter values. Here, we introduce the Design Space Toolbox v.3.0 (DST3), a software implementation of the Design Space formalism enabling mechanistic modeling without requiring previous knowledge of parameter values. This is achieved by using a phenotype-centric modeling approach, in which the system is first decomposed into a series of biochemical phenotypes. Parameter values realizing phenotypes of interest are subsequently predicted. DST3 represents the most generally applicable implementation of the Design Space formalism and offers unique advantages over earlier versions. By expanding the Design Space formalism and streamlining its distribution, DST3 represents a valuable tool for elucidating biological design principles and designing novel synthetic circuits. DST3 extends the Design Space formalism by identifying additional phenotypes Additional phenotypes arise from cycles, conservations, and metabolic imbalances DST3 enables mechanistic modeling without previous knowledge of parameter values It fully unlocks the potential of the novel phenotype-centric modeling strategy
Collapse
Affiliation(s)
| | | | | | - Michael A Savageau
- Department of Microbiology & Molecular Genetics, University of California, Davis, CA 95616, USA; Department of Biomedical Engineering, University of California, Davis, CA 95616, USA.
| |
Collapse
|
24
|
Valderrama-Gómez MÁ, Schomer RA, Savageau MA, Parales RE. TaxisPy: A Python-based software for the quantitative analysis of bacterial chemotaxis. J Microbiol Methods 2020; 175:105918. [PMID: 32512119 DOI: 10.1016/j.mimet.2020.105918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2020] [Revised: 04/09/2020] [Accepted: 04/10/2020] [Indexed: 10/24/2022]
Abstract
Several species of bacteria are able to modify their swimming behavior in response to chemical attractants or repellents. Methods for the quantitative analysis of bacterial chemotaxis such as quantitative capillary assays are tedious and time-consuming. Computer-based video analysis of swimming bacteria represents a valuable method to directly assess their chemotactic response. Even though multiple studies have used this approach to elucidate various aspects of bacterial chemotaxis, to date, no computer software for such analyses is freely available. Here, we introduce TaxisPy, a Python-based software for the quantitative analysis of bacterial chemotaxis. The software comes with an intuitive graphical user interface and can be accessed easily through Docker on any operating system. Using a video of freely swimming cells as input, TaxisPy estimates the culture's average tumbling frequency over time. We demonstrate the utility of the software by assessing the effect of different concentrations of the attractant shikimate on the swimming behavior of Pseudomonas putida F1 and by capturing the adaptation process that Escherichia coli undergoes after being exposed to l-aspartate.
Collapse
Affiliation(s)
- Miguel Á Valderrama-Gómez
- Department of Microbiology & Molecular Genetics, College of Biological Sciences, University of California, Davis, USA.
| | - Rebecca A Schomer
- Department of Microbiology & Molecular Genetics, College of Biological Sciences, University of California, Davis, USA
| | - Michael A Savageau
- Department of Microbiology & Molecular Genetics, College of Biological Sciences, University of California, Davis, USA; Department of Biomedical Engineering, University of California, Davis, USA
| | - Rebecca E Parales
- Department of Microbiology & Molecular Genetics, College of Biological Sciences, University of California, Davis, USA
| |
Collapse
|
25
|
Hellander S, Hellander A. Hierarchical algorithm for the reaction-diffusion master equation. J Chem Phys 2020; 152:034104. [PMID: 31968960 PMCID: PMC6964990 DOI: 10.1063/1.5095075] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
We have developed an algorithm coupling mesoscopic simulations on different levels in a hierarchy of Cartesian meshes. Based on the multiscale nature of the chemical reactions, some molecules in the system will live on a fine-grained mesh, while others live on a coarse-grained mesh. By allowing molecules to transfer from the fine levels to the coarse levels when appropriate, we show that we can save up to three orders of magnitude of computational time compared to microscopic simulations or highly resolved mesoscopic simulations, without losing significant accuracy. We demonstrate this in several numerical examples with systems that cannot be accurately simulated with a coarse-grained mesoscopic model.
Collapse
Affiliation(s)
- Stefan Hellander
- Department of Information Technology, Uppsala University, Box 337, SE-755 01 Uppsala, Sweden
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, Box 337, SE-755 01 Uppsala, Sweden
| |
Collapse
|
26
|
Cornell SJ, Suprunenko YF, Finkelshtein D, Somervuo P, Ovaskainen O. A unified framework for analysis of individual-based models in ecology and beyond. Nat Commun 2019; 10:4716. [PMID: 31624268 PMCID: PMC6797757 DOI: 10.1038/s41467-019-12172-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Accepted: 08/22/2019] [Indexed: 11/20/2022] Open
Abstract
Individual-based models, ‘IBMs’, describe naturally the dynamics of interacting organisms or social or financial agents. They are considered too complex for mathematical analysis, but computer simulations of them cannot give the general insights required. Here, we resolve this problem with a general mathematical framework for IBMs containing interactions of an unlimited level of complexity, and derive equations that reliably approximate the effects of space and stochasticity. We provide software, specified in an accessible and intuitive graphical way, so any researcher can obtain analytical and simulation results for any particular IBM without algebraic manipulation. We illustrate the framework with examples from movement ecology, conservation biology, and evolutionary ecology. This framework will provide unprecedented insights into a hitherto intractable panoply of complex models across many scientific fields. Individual-based models are widely used to simulate complex systems of interacting agents. Here the authors provide a mathematical framework that automates the analysis of any model in a wide class, facilitating a deeper understanding of the scientific questions these models are used to address.
Collapse
Affiliation(s)
- Stephen J Cornell
- Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK.
| | - Yevhen F Suprunenko
- Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK.,Department of Plant Sciences, University of Cambridge, Downing Street, Cambridge, CB2 3EA, UK
| | - Dmitri Finkelshtein
- Department of Mathematics, Swansea University, Fabian Way, Swansea, SA1 8EN, UK
| | - Panu Somervuo
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Helsinki, P.O. Box 65, FI-00014, Finland
| | - Otso Ovaskainen
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Helsinki, P.O. Box 65, FI-00014, Finland.,Centre for Biodiversity Dynamics, Department of Biology, Norwegian University of Science and Technology, N-7491, Trondheim, Norway
| |
Collapse
|
27
|
Multiscale Modelling Tool: Mathematical modelling of collective behaviour without the maths. PLoS One 2019; 14:e0222906. [PMID: 31568526 PMCID: PMC6768458 DOI: 10.1371/journal.pone.0222906] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 09/10/2019] [Indexed: 12/28/2022] Open
Abstract
Collective behaviour is of fundamental importance in the life sciences, where it appears at levels of biological complexity from single cells to superorganisms, in demography and the social sciences, where it describes the behaviour of populations, and in the physical and engineering sciences, where it describes physical phenomena and can be used to design distributed systems. Reasoning about collective behaviour is inherently difficult, as the non-linear interactions between individuals give rise to complex emergent dynamics. Mathematical techniques have been developed to analyse systematically collective behaviour in such systems, yet these frequently require extensive formal training and technical ability to apply. Even for those with the requisite training and ability, analysis using these techniques can be laborious, time-consuming and error-prone. Together these difficulties raise a barrier-to-entry for practitioners wishing to analyse models of collective behaviour. However, rigorous modelling of collective behaviour is required to make progress in understanding and applying it. Here we present an accessible tool which aims to automate the process of modelling and analysing collective behaviour, as far as possible. We focus our attention on the general class of systems described by reaction kinetics, involving interactions between components that change state as a result, as these are easily understood and extracted from data by natural, physical and social scientists, and correspond to algorithms for component-level controllers in engineering applications. By providing simple automated access to advanced mathematical techniques from statistical physics, nonlinear dynamical systems analysis, and computational simulation, we hope to advance standards in modelling collective behaviour. At the same time, by providing expert users with access to the results of automated analyses, sophisticated investigations that could take significant effort are substantially facilitated. Our tool can be accessed online without installing software, uses a simple programmatic interface, and provides interactive graphical plots for users to develop understanding of their models.
Collapse
|
28
|
Lötstedt P. The Linear Noise Approximation for Spatially Dependent Biochemical Networks. Bull Math Biol 2019; 81:2873-2901. [PMID: 29644520 PMCID: PMC6677697 DOI: 10.1007/s11538-018-0428-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Accepted: 03/29/2018] [Indexed: 10/26/2022]
Abstract
An algorithm for computing the linear noise approximation (LNA) of the reaction-diffusion master equation (RDME) is developed and tested. The RDME is often used as a model for biochemical reaction networks. The LNA is derived for a general discretization of the spatial domain of the problem. If M is the number of chemical species in the network and N is the number of nodes in the discretization in space, then the computational work to determine approximations of the mean and the covariances of the probability distributions is proportional to [Formula: see text] in a straightforward implementation. In our LNA algorithm, the work is proportional to [Formula: see text]. Since N usually is larger than M, this is a significant reduction. The accuracy of the approximation in the algorithm is estimated analytically and evaluated in numerical experiments.
Collapse
Affiliation(s)
- Per Lötstedt
- Division of Scientific Computing, Department of Information Technology, Uppsala University, SE-75105, Uppsala, Sweden.
| |
Collapse
|
29
|
Morton EL, Forst CV, Zheng Y, DePaula-Silva AB, Ramirez NGP, Planelles V, D'Orso I. Transcriptional Circuit Fragility Influences HIV Proviral Fate. Cell Rep 2019; 27:154-171.e9. [PMID: 30943398 PMCID: PMC6461408 DOI: 10.1016/j.celrep.2019.03.007] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Revised: 11/14/2018] [Accepted: 02/28/2019] [Indexed: 01/12/2023] Open
Abstract
Transcriptional circuit architectures in several organisms have been evolutionarily selected to dictate precise given responses. Unlike these cellular systems, HIV is regulated through a complex circuit composed of two successive phases (host and viral), which create a positive feedback loop facilitating viral replication. However, it has long remained unclear whether both phases operate identically and to what extent the host phase influences the entire circuit. Here, we report that, although the host phase is regulated by a checkpoint whereby KAP1 mediates transcription activation, the virus evolved a minimalist system bypassing KAP1. Given the complex circuit's architecture, cell-to-cell KAP1 fluctuations impart heterogeneity in the host transcriptional responses, thus affecting the feedback loop. Mathematical modeling of a complete circuit reveals how these oscillations ultimately influence homogeneous reactivation potential of a latent virus. Thus, although HIV drives molecular innovation to fuel robust gene activation, it experiences transcriptional fragility, thereby influencing viral fate and cure efforts.
Collapse
Affiliation(s)
- Emily L Morton
- Department of Microbiology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Christian V Forst
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Yue Zheng
- Division of Microbiology and Immunology, Department of Pathology, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Ana B DePaula-Silva
- Division of Microbiology and Immunology, Department of Pathology, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Nora-Guadalupe P Ramirez
- Department of Microbiology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Vicente Planelles
- Division of Microbiology and Immunology, Department of Pathology, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Iván D'Orso
- Department of Microbiology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA.
| |
Collapse
|
30
|
Sokolowski TR, Paijmans J, Bossen L, Miedema T, Wehrens M, Becker NB, Kaizu K, Takahashi K, Dogterom M, Ten Wolde PR. eGFRD in all dimensions. J Chem Phys 2019; 150:054108. [PMID: 30736681 DOI: 10.1063/1.5064867] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Biochemical reactions often occur at low copy numbers but at once in crowded and diverse environments. Space and stochasticity therefore play an essential role in biochemical networks. Spatial-stochastic simulations have become a prominent tool for understanding how stochasticity at the microscopic level influences the macroscopic behavior of such systems. While particle-based models guarantee the level of detail necessary to accurately describe the microscopic dynamics at very low copy numbers, the algorithms used to simulate them typically imply trade-offs between computational efficiency and biochemical accuracy. eGFRD (enhanced Green's Function Reaction Dynamics) is an exact algorithm that evades such trade-offs by partitioning the N-particle system into M ≤ N analytically tractable one- and two-particle systems; the analytical solutions (Green's functions) then are used to implement an event-driven particle-based scheme that allows particles to make large jumps in time and space while retaining access to their state variables at arbitrary simulation times. Here we present "eGFRD2," a new eGFRD version that implements the principle of eGFRD in all dimensions, thus enabling efficient particle-based simulation of biochemical reaction-diffusion processes in the 3D cytoplasm, on 2D planes representing membranes, and on 1D elongated cylinders representative of, e.g., cytoskeletal tracks or DNA; in 1D, it also incorporates convective motion used to model active transport. We find that, for low particle densities, eGFRD2 is up to 6 orders of magnitude faster than conventional Brownian dynamics. We exemplify the capabilities of eGFRD2 by simulating an idealized model of Pom1 gradient formation, which involves 3D diffusion, active transport on microtubules, and autophosphorylation on the membrane, confirming recent experimental and theoretical results on this system to hold under genuinely stochastic conditions.
Collapse
Affiliation(s)
| | - Joris Paijmans
- FOM Institute AMOLF, Science Park 104, 1098 XG Amsterdam, The Netherlands
| | - Laurens Bossen
- FOM Institute AMOLF, Science Park 104, 1098 XG Amsterdam, The Netherlands
| | - Thomas Miedema
- FOM Institute AMOLF, Science Park 104, 1098 XG Amsterdam, The Netherlands
| | - Martijn Wehrens
- FOM Institute AMOLF, Science Park 104, 1098 XG Amsterdam, The Netherlands
| | - Nils B Becker
- FOM Institute AMOLF, Science Park 104, 1098 XG Amsterdam, The Netherlands
| | - Kazunari Kaizu
- Center for Biosystems Dynamics Research (BDR), RIKEN, 6-2-3 Furuedai, Suita, Osaka 565-0874, Japan
| | - Koichi Takahashi
- Center for Biosystems Dynamics Research (BDR), RIKEN, 6-2-3 Furuedai, Suita, Osaka 565-0874, Japan
| | - Marileen Dogterom
- FOM Institute AMOLF, Science Park 104, 1098 XG Amsterdam, The Netherlands
| | | |
Collapse
|
31
|
Brian D, Bruno J, Zhen L, Tau-Mu Y, Linda P. A hybrid smoothed dissipative particle dynamics (SDPD) spatial stochastic simulation algorithm (sSSA) for advection-diffusion-reaction problems. JOURNAL OF COMPUTATIONAL PHYSICS 2019; 378:1-17. [PMID: 31031417 PMCID: PMC6481948 DOI: 10.1016/j.jcp.2018.10.043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
We have developed a new algorithm which merges discrete stochastic simulation, using the spatial stochastic simulation algorithm (sSSA), with the particle based fluid dynamics simulation framework of smoothed dissipative particle dynamics (SDPD). This hybrid algorithm enables discrete stochastic simulation of spatially resolved chemically reacting systems on a mesh-free dynamic domain with a Lagrangian frame of reference. SDPD combines two popular mesoscopic techniques: smoothed particle hydrodynamics and dissipative particle dynamics (DPD), linking the macroscopic and mesoscopic hydrodynamics effects of these two methods. We have implemented discrete stochastic simulation using the reaction-diffusion master equations (RDME) formalism, and deterministic reaction-diffusion equations based on the SDPD method. We validate the new method by comparing our results to four canonical models, and demonstrate the versatility of our method by simulating a flow containing a chemical gradient past a yeast cell in a microfluidics chamber.
Collapse
Affiliation(s)
- Drawert Brian
- Department of Computer Science, University of North Carolina at Asheville, Asheville, North Carolina, 28804, USA
| | - Jacob Bruno
- Department of Mechanical Engineering, University of California-Santa Barbara, Santa Barbara, California, 93106, USA
| | - Li Zhen
- Division of Applied Mathematics, Brown University, Providence, Rhode Island 02912, USA
| | - Yi Tau-Mu
- Department of Molecular, Cellular, and Developmental Biology, University of California-Santa Barbara, Santa Barbara, California 93106, USA
| | - Petzold Linda
- Department of Mechanical Engineering, University of California-Santa Barbara, Santa Barbara, California, 93106, USA
- Department of Computer Science, University of California-Santa Barbara, Santa Barbara, California, 93106, USA
| |
Collapse
|
32
|
Tapia JJ, Saglam AS, Czech J, Kuczewski R, Bartol TM, Sejnowski TJ, Faeder JR. MCell-R: A Particle-Resolution Network-Free Spatial Modeling Framework. Methods Mol Biol 2019; 1945:203-229. [PMID: 30945248 PMCID: PMC6580425 DOI: 10.1007/978-1-4939-9102-0_9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Spatial heterogeneity can have dramatic effects on the biochemical networks that drive cell regulation and decision-making. For this reason, a number of methods have been developed to model spatial heterogeneity and incorporated into widely used modeling platforms. Unfortunately, the standard approaches for specifying and simulating chemical reaction networks become untenable when dealing with multistate, multicomponent systems that are characterized by combinatorial complexity. To address this issue, we developed MCell-R, a framework that extends the particle-based spatial Monte Carlo simulator, MCell, with the rule-based model specification and simulation capabilities provided by BioNetGen and NFsim. The BioNetGen syntax enables the specification of biomolecules as structured objects whose components can have different internal states that represent such features as covalent modification and conformation and which can bind components of other molecules to form molecular complexes. The network-free simulation algorithm used by NFsim enables efficient simulation of rule-based models even when the size of the network implied by the biochemical rules is too large to enumerate explicitly, which frequently occurs in detailed models of biochemical signaling. The result is a framework that can efficiently simulate systems characterized by combinatorial complexity at the level of spatially resolved individual molecules over biologically relevant time and length scales.
Collapse
Affiliation(s)
- Jose-Juan Tapia
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Ali Sinan Saglam
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Jacob Czech
- Pittsburgh Supercomputing Center, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Robert Kuczewski
- Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Thomas M. Bartol
- Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Terrence J. Sejnowski
- Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - James R. Faeder
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
33
|
Landeros A, Stutz T, Keys KL, Alekseyenko A, Sinsheimer JS, Lange K, Sehl ME. BioSimulator.jl: Stochastic simulation in Julia. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2018; 167:23-35. [PMID: 30501857 PMCID: PMC6388686 DOI: 10.1016/j.cmpb.2018.09.009] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2018] [Revised: 09/11/2018] [Accepted: 09/26/2018] [Indexed: 06/09/2023]
Abstract
BACKGROUND AND OBJECTIVES Biological systems with intertwined feedback loops pose a challenge to mathematical modeling efforts. Moreover, rare events, such as mutation and extinction, complicate system dynamics. Stochastic simulation algorithms are useful in generating time-evolution trajectories for these systems because they can adequately capture the influence of random fluctuations and quantify rare events. We present a simple and flexible package, BioSimulator.jl, for implementing the Gillespie algorithm, τ-leaping, and related stochastic simulation algorithms. The objective of this work is to provide scientists across domains with fast, user-friendly simulation tools. METHODS We used the high-performance programming language Julia because of its emphasis on scientific computing. Our software package implements a suite of stochastic simulation algorithms based on Markov chain theory. We provide the ability to (a) diagram Petri Nets describing interactions, (b) plot average trajectories and attached standard deviations of each participating species over time, and (c) generate frequency distributions of each species at a specified time. RESULTS BioSimulator.jl's interface allows users to build models programmatically within Julia. A model is then passed to the simulate routine to generate simulation data. The built-in tools allow one to visualize results and compute summary statistics. Our examples highlight the broad applicability of our software to systems of varying complexity from ecology, systems biology, chemistry, and genetics. CONCLUSION The user-friendly nature of BioSimulator.jl encourages the use of stochastic simulation, minimizes tedious programming efforts, and reduces errors during model specification.
Collapse
Affiliation(s)
- Alfonso Landeros
- Department of Biomathematics, David Geffen School of Medicine at UCLA, USA.
| | - Timothy Stutz
- Department of Biomathematics, David Geffen School of Medicine at UCLA, USA.
| | - Kevin L Keys
- Department of Medicine, University of California, San Francisco, CA, USA.
| | | | - Janet S Sinsheimer
- Department of Human Genetics, David Geffen School of Medicine at UCLA, USA.
| | - Kenneth Lange
- Department of Biomathematics, David Geffen School of Medicine at UCLA, USA.
| | - Mary E Sehl
- Department of Biomathematics, David Geffen School of Medicine at UCLA, USA.
| |
Collapse
|
34
|
Medley JK, Choi K, König M, Smith L, Gu S, Hellerstein J, Sealfon SC, Sauro HM. Tellurium notebooks-An environment for reproducible dynamical modeling in systems biology. PLoS Comput Biol 2018; 14:e1006220. [PMID: 29906293 PMCID: PMC6021116 DOI: 10.1371/journal.pcbi.1006220] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Revised: 06/27/2018] [Accepted: 05/20/2018] [Indexed: 01/26/2023] Open
Abstract
The considerable difficulty encountered in reproducing the results of published dynamical models limits validation, exploration and reuse of this increasingly large biomedical research resource. To address this problem, we have developed Tellurium Notebook, a software system for model authoring, simulation, and teaching that facilitates building reproducible dynamical models and reusing models by 1) providing a notebook environment which allows models, Python code, and narrative to be intermixed, 2) supporting the COMBINE archive format during model development for capturing model information in an exchangeable format and 3) enabling users to easily simulate and edit public COMBINE-compliant models from public repositories to facilitate studying model dynamics, variants and test cases. Tellurium Notebook, a Python–based Jupyter–like environment, is designed to seamlessly inter-operate with these community standards by automating conversion between COMBINE standards formulations and corresponding in–line, human–readable representations. Thus, Tellurium brings to systems biology the strategy used by other literate notebook systems such as Mathematica. These capabilities allow users to edit every aspect of the standards–compliant models and simulations, run the simulations in–line, and re–export to standard formats. We provide several use cases illustrating the advantages of our approach and how it allows development and reuse of models without requiring technical knowledge of standards. Adoption of Tellurium should accelerate model development, reproducibility and reuse. There is considerable value to systems and synthetic biology in creating reproducible models. An essential element of reproducibility is the use of community standards, an often challenging undertaking for modelers. This article describes Tellurium Notebook, a tool for developing dynamical models that provides an intuitive approach to building and reusing models built with community standards. Tellurium automates embedding human–readable representations of COMBINE archives in literate coding notebooks, bringing to systems biology this strategy central to other literate notebook systems such as Mathematica. We show that the ability to easily edit this human–readable representation enables users to test models under a variety of conditions, thereby providing a way to create, reuse, and modify standard–encoded models and simulations, regardless of the user’s level of technical knowledge of said standards.
Collapse
Affiliation(s)
- J. Kyle Medley
- Department of Bioengineering, University of Washington, Seattle, Washington, United States of America
- * E-mail:
| | - Kiri Choi
- Department of Bioengineering, University of Washington, Seattle, Washington, United States of America
| | - Matthias König
- Institute for Theoretical Biology, Humboldt University of Berlin, Berlin, Germany
| | - Lucian Smith
- Department of Bioengineering, University of Washington, Seattle, Washington, United States of America
| | - Stanley Gu
- Department of Bioengineering, University of Washington, Seattle, Washington, United States of America
| | - Joseph Hellerstein
- eScience Institute, University of Washington, Seattle, Washington, United States of America
| | - Stuart C. Sealfon
- Department of Neurology and Center for Advanced Research on Diagnostic Assays Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Herbert M. Sauro
- Department of Bioengineering, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
35
|
Pischel D, Sundmacher K, Flassig RJ. Efficient simulation of intrinsic, extrinsic and external noise in biochemical systems. Bioinformatics 2018; 33:i319-i324. [PMID: 28881987 PMCID: PMC5870780 DOI: 10.1093/bioinformatics/btx253] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Motivation Biological cells operate in a noisy regime influenced by intrinsic, extrinsic and external noise, which leads to large differences of individual cell states. Stochastic effects must be taken into account to characterize biochemical kinetics accurately. Since the exact solution of the chemical master equation, which governs the underlying stochastic process, cannot be derived for most biochemical systems, approximate methods are used to obtain a solution. Results In this study, a method to efficiently simulate the various sources of noise simultaneously is proposed and benchmarked on several examples. The method relies on the combination of the sigma point approach to describe extrinsic and external variability and the τ-leaping algorithm to account for the stochasticity due to probabilistic reactions. The comparison of our method to extensive Monte Carlo calculations demonstrates an immense computational advantage while losing an acceptable amount of accuracy. Additionally, the application to parameter optimization problems in stochastic biochemical reaction networks is shown, which is rarely applied due to its huge computational burden. To give further insight, a MATLAB script is provided including the proposed method applied to a simple toy example of gene expression. Availability and implementation MATLAB code is available at Bioinformatics online. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dennis Pischel
- Otto-von-Guericke-University Magdeburg, Process Systems Engineering, Magdeburg, Germany
| | - Kai Sundmacher
- Otto-von-Guericke-University Magdeburg, Process Systems Engineering, Magdeburg, Germany.,Max Planck Institute for Dynamics of Complex Technical Systems, Process Systems Engineering, Magdeburg, Germany
| | - Robert J Flassig
- Max Planck Institute for Dynamics of Complex Technical Systems, Process Systems Engineering, Magdeburg, Germany
| |
Collapse
|
36
|
Hellander S, Hellander A, Petzold L. Mesoscopic-microscopic spatial stochastic simulation with automatic system partitioning. J Chem Phys 2017; 147:234101. [PMID: 29272930 PMCID: PMC5732015 DOI: 10.1063/1.5002773] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 11/27/2017] [Indexed: 01/02/2023] Open
Abstract
The reaction-diffusion master equation (RDME) is a model that allows for efficient on-lattice simulation of spatially resolved stochastic chemical kinetics. Compared to off-lattice hard-sphere simulations with Brownian dynamics or Green's function reaction dynamics, the RDME can be orders of magnitude faster if the lattice spacing can be chosen coarse enough. However, strongly diffusion-controlled reactions mandate a very fine mesh resolution for acceptable accuracy. It is common that reactions in the same model differ in their degree of diffusion control and therefore require different degrees of mesh resolution. This renders mesoscopic simulation inefficient for systems with multiscale properties. Mesoscopic-microscopic hybrid methods address this problem by resolving the most challenging reactions with a microscale, off-lattice simulation. However, all methods to date require manual partitioning of a system, effectively limiting their usefulness as "black-box" simulation codes. In this paper, we propose a hybrid simulation algorithm with automatic system partitioning based on indirect a priori error estimates. We demonstrate the accuracy and efficiency of the method on models of diffusion-controlled networks in 3D.
Collapse
Affiliation(s)
- Stefan Hellander
- Department of Information Technology, Uppsala University, P.O.Box 337, SE-75105 Uppsala, Sweden
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, P.O.Box 337, SE-75105 Uppsala, Sweden
| | - Linda Petzold
- Department of Computer Science, University of California, Santa Barbara, California 93106-5070, USA
| |
Collapse
|