1
|
Nguyen T, Narayanareddy BRJ, Gross SP, Miles CE. Competition between physical search and a weak-to-strong transition rate-limits kinesin binding times. PLoS Comput Biol 2024; 20:e1012158. [PMID: 38768214 PMCID: PMC11142708 DOI: 10.1371/journal.pcbi.1012158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/31/2024] [Accepted: 05/10/2024] [Indexed: 05/22/2024] Open
Abstract
The self-organization of cells relies on the profound complexity of protein-protein interactions. Challenges in directly observing these events have hindered progress toward understanding their diverse behaviors. One notable example is the interaction between molecular motors and cytoskeletal systems that combine to perform a variety of cellular functions. In this work, we leverage theory and experiments to identify and quantify the rate-limiting mechanism of the initial association between a cargo-bound kinesin motor and a microtubule track. Recent advances in optical tweezers provide binding times for several lengths of kinesin motors trapped at varying distances from a microtubule, empowering the investigation of competing models. We first explore a diffusion-limited model of binding. Through Brownian dynamics simulations and simulation-based inference, we find this simple diffusion model fails to explain the experimental binding times, but an extended model that accounts for the ADP state of the molecular motor agrees closely with the data, even under the scrutiny of penalizing for additional model complexity. We provide quantification of both kinetic rates and biophysical parameters underlying the proposed binding process. Our model suggests that a typical binding event is limited by ADP state rather than physical search. Lastly, we predict how these association rates can be modulated in distinct ways through variation of environmental concentrations and physical properties.
Collapse
Affiliation(s)
- Trini Nguyen
- Center for Complex Biological Systems, University of California, Irvine, Irvine, California, United States of America
| | | | - Steven P. Gross
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, California, United States of America
- Department of Physics, University of California, Irvine, Irvine, California, United States of America
| | - Christopher E. Miles
- Department of Mathematics, University of California, Irvine, Irvine, California, United States of America
| |
Collapse
|
2
|
Anderson HG, Takacs GP, Harris DC, Kuang Y, Harrison JK, Stepien TL. Global stability and parameter analysis reinforce therapeutic targets of PD-L1-PD-1 and MDSCs for glioblastoma. J Math Biol 2023; 88:10. [PMID: 38099947 PMCID: PMC10724342 DOI: 10.1007/s00285-023-02027-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 08/30/2023] [Accepted: 11/05/2023] [Indexed: 12/18/2023]
Abstract
Glioblastoma (GBM) is an aggressive primary brain cancer that currently has minimally effective treatments. Like other cancers, immunosuppression by the PD-L1-PD-1 immune checkpoint complex is a prominent axis by which glioma cells evade the immune system. Myeloid-derived suppressor cells (MDSCs), which are recruited to the glioma microenviroment, also contribute to the immunosuppressed GBM microenvironment by suppressing T cell functions. In this paper, we propose a GBM-specific tumor-immune ordinary differential equations model of glioma cells, T cells, and MDSCs to provide theoretical insights into the interactions between these cells. Equilibrium and stability analysis indicates that there are unique tumorous and tumor-free equilibria which are locally stable under certain conditions. Further, the tumor-free equilibrium is globally stable when T cell activation and the tumor kill rate by T cells overcome tumor growth, T cell inhibition by PD-L1-PD-1 and MDSCs, and the T cell death rate. Bifurcation analysis suggests that a treatment plan that includes surgical resection and therapeutics targeting immune suppression caused by the PD-L1-PD1 complex and MDSCs results in the system tending to the tumor-free equilibrium. Using a set of preclinical experimental data, we implement the approximate Bayesian computation (ABC) rejection method to construct probability density distributions that estimate model parameters. These distributions inform an appropriate search curve for global sensitivity analysis using the extended fourier amplitude sensitivity test. Sensitivity results combined with the ABC method suggest that parameter interaction is occurring between the drivers of tumor burden, which are the tumor growth rate and carrying capacity as well as the tumor kill rate by T cells, and the two modeled forms of immunosuppression, PD-L1-PD-1 immune checkpoint and MDSC suppression of T cells. Thus, treatment with an immune checkpoint inhibitor in combination with a therapeutic targeting the inhibitory mechanisms of MDSCs should be explored.
Collapse
Affiliation(s)
- Hannah G Anderson
- Department of Mathematics, University of Florida, Gainesville, FL, USA
| | - Gregory P Takacs
- Department of Pharmacology and Therapeutics, University of Florida, Gainesville, FL, USA
| | - Duane C Harris
- School of Mathematical and Statistical Sciences, Arizona State University, Tempe, AZ, USA
| | - Yang Kuang
- School of Mathematical and Statistical Sciences, Arizona State University, Tempe, AZ, USA
| | - Jeffrey K Harrison
- Department of Pharmacology and Therapeutics, University of Florida, Gainesville, FL, USA
| | - Tracy L Stepien
- Department of Mathematics, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
3
|
Nguyen T, Narayanareddy BJ, Gross SP, Miles CE. ADP release can explain spatially-dependent kinesin binding times. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.08.563482. [PMID: 37986962 PMCID: PMC10659338 DOI: 10.1101/2023.11.08.563482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]
Abstract
The self-organization of cells relies on the profound complexity of protein-protein interactions. Challenges in directly observing these events have hindered progress toward understanding their diverse behaviors. One notable example is the interaction between molecular motors and cytoskeletal systems that combine to perform a variety of cellular functions. In this work, we leverage theory and experiments to identify and quantify the rate-limiting mechanism of the initial association between a cargo-bound kinesin motor and a microtubule track. Recent advances in optical tweezers provide binding times for several lengths of kinesin motors trapped at varying distances from a microtubule, empowering the investigation of competing models. We first explore a diffusion-limited model of binding. Through Brownian dynamics simulations and simulation-based inference, we find this simple diffusion model fails to explain the experimental binding times, but an extended model that accounts for the ADP state of the molecular motor agrees closely with the data, even under the scrutiny of penalizing for additional model complexity. We provide quantification of both kinetic rates and biophysical parameters underlying the proposed binding process. Our model suggests that most but not every motor binding event is limited by their ADP state. Lastly, we predict how these association rates can be modulated in distinct ways through variation of environmental concentrations and spatial distances.
Collapse
Affiliation(s)
- Trini Nguyen
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA 92697
| | | | - Steven P. Gross
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA 92697
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA 92697
- Department of Physics and Astronomy, University of California, Irvine, Irvine, CA 92697
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA 92697
| | - Christopher E. Miles
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA 92697
- Department of Mathematics, University of California, Irvine, Irvine, CA 92697
- Center for Multiscale Cell Fate, University of California, Irvine, Irvine, CA 92697
| |
Collapse
|
4
|
Carvalho J, Morales HE, Faria R, Butlin RK, Sousa VC. Integrating Pool-seq uncertainties into demographic inference. Mol Ecol Resour 2023; 23:1737-1755. [PMID: 37475177 DOI: 10.1111/1755-0998.13834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 06/16/2023] [Accepted: 06/30/2023] [Indexed: 07/22/2023]
Abstract
Next-generation sequencing of pooled samples (Pool-seq) is a popular method to assess genome-wide diversity patterns in natural and experimental populations. However, Pool-seq is associated with specific sources of noise, such as unequal individual contributions. Consequently, using Pool-seq for the reconstruction of evolutionary history has remained underexplored. Here we describe a novel Approximate Bayesian Computation (ABC) method to infer demographic history, explicitly modelling Pool-seq sources of error. By jointly modelling Pool-seq data, demographic history and the effects of selection due to barrier loci, we obtain estimates of demographic history parameters accounting for technical errors associated with Pool-seq. Our ABC approach is computationally efficient as it relies on simulating subsets of loci (rather than the whole-genome) and on using relative summary statistics and relative model parameters. Our simulation study results indicate Pool-seq data allows distinction between general scenarios of ecotype formation (single versus parallel origin) and to infer relevant demographic parameters (e.g. effective sizes and split times). We exemplify the application of our method to Pool-seq data from the rocky-shore gastropod Littorina saxatilis, sampled on a narrow geographical scale at two Swedish locations where two ecotypes (Wave and Crab) are found. Our model choice and parameter estimates show that ecotypes formed before colonization of the two locations (i.e. single origin) and are maintained despite gene flow. These results indicate that demographic modelling and inference can be successful based on pool-sequencing using ABC, contributing to the development of suitable null models that allow for a better understanding of the genetic basis of divergent adaptation.
Collapse
Affiliation(s)
- João Carvalho
- cE3c - Centre for Ecology, Evolution and Environmental Changes & CHANGE - Global Change and Sustainability Institute, Departamento de Biologia Animal, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Portugal
| | - Hernán E Morales
- Section for Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Rui Faria
- CIBIO - Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO, Laboratório Associado, Universidade do Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
| | - Roger K Butlin
- Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield, UK
- Department of Marine Sciences, University of Gothenburg, Gothenburg, Sweden
| | - Vítor C Sousa
- cE3c - Centre for Ecology, Evolution and Environmental Changes & CHANGE - Global Change and Sustainability Institute, Departamento de Biologia Animal, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Portugal
| |
Collapse
|
5
|
Anderson HG, Takacs GP, Harris DC, Kuang Y, Harrison JK, Stepien TL. Global stability and parameter analysis reinforce therapeutic targets of PD-L1-PD-1 and MDSCs for glioblastoma. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.15.540846. [PMID: 37292799 PMCID: PMC10245580 DOI: 10.1101/2023.05.15.540846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Glioblastoma (GBM) is an aggressive primary brain cancer that currently has minimally effective treatments. Like other cancers, immunosuppression by the PD-L1-PD-1 immune checkpoint complex is a prominent axis by which glioma cells evade the immune system. Myeloid-derived suppressor cells (MDSCs), which are recruited to the glioma microenviroment, also contribute to the immunosuppressed GBM microenvironment by suppressing T cell functions. In this paper, we propose a GBM-specific tumor-immune ordinary differential equations model of glioma cells, T cells, and MDSCs to provide theoretical insights into the interactions between these cells. Equilibrium and stability analysis indicates that there are unique tumorous and tumor-free equilibria which are locally stable under certain conditions. Further, the tumor-free equilibrium is globally stable when T cell activation and the tumor kill rate by T cells overcome tumor growth, T cell inhibition by PD-L1-PD-1 and MDSCs, and the T cell death rate. Bifurcation analysis suggests that a treatment plan that includes surgical resection and therapeutics targeting immune suppression caused by the PD-L1-PD1 complex and MDSCs results in the system tending to the tumor-free equilibrium. Using a set of preclinical experimental data, we implement the Approximate Bayesian Computation (ABC) rejection method to construct probability density distributions that estimate model parameters. These distributions inform an appropriate search curve for global sensitivity analysis using the extended Fourier Amplitude Sensitivity Test (eFAST). Sensitivity results combined with the ABC method suggest that parameter interaction is occurring between the drivers of tumor burden, which are the tumor growth rate and carrying capacity as well as the tumor kill rate by T cells, and the two modeled forms of immunosuppression, PD-L1-PD-1 immune checkpoint and MDSC suppression of T cells. Thus, treatment with an immune checkpoint inhibitor in combination with a therapeutic targeting the inhibitory mechanisms of MDSCs should be explored.
Collapse
|
6
|
Kim S, Abdulali A, Lee S. Heterogeneity is a key factor describing the initial outbreak of COVID-19. APPLIED MATHEMATICAL MODELLING 2023; 117:714-725. [PMID: 36643779 PMCID: PMC9827748 DOI: 10.1016/j.apm.2023.01.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Revised: 11/11/2022] [Accepted: 01/03/2023] [Indexed: 06/17/2023]
Abstract
Assessing the transmission potential of emerging infectious diseases, such as COVID-19, is crucial for implementing prompt and effective intervention policies. The basic reproduction number is widely used to measure the severity of the early stages of disease outbreaks. The basic reproduction number of standard ordinary differential equation models is computed for homogeneous contact patterns; however, realistic contact patterns are far from homogeneous, specifically during the early stages of disease transmission. Heterogeneity of contact patterns can lead to superspreading events that show a significantly high level of heterogeneity in generating secondary infections. This is primarily due to the large variance in the contact patterns of complex human behaviours. Hence, in this work, we investigate the impacts of heterogeneity in contact patterns on the basic reproduction number by developing two distinct model frameworks: 1) an SEIR-Erlang ordinary differential equation model and 2) an SEIR stochastic agent-based model. Furthermore, we estimated the transmission probability of both models in the context of COVID-19 in South Korea. Our results highlighted the importance of heterogeneity in contact patterns and indicated that there should be more information than one quantity (the basic reproduction number as the mean quantity), such as a degree-specific basic reproduction number in the distributional sense when the contact pattern is highly heterogeneous.
Collapse
Affiliation(s)
- Sungchan Kim
- Department of Applied Mathematics, Kyung Hee University, Republic of Korea
| | - Arsen Abdulali
- Department of Engineering, University of Cambridge, United Kingdom
| | - Sunmi Lee
- Department of Applied Mathematics, Kyung Hee University, Republic of Korea
| |
Collapse
|
7
|
Roesch E, Greener JG, MacLean AL, Nassar H, Rackauckas C, Holy TE, Stumpf MPH. Julia for biologists. Nat Methods 2023; 20:655-664. [PMID: 37024649 PMCID: PMC10216852 DOI: 10.1038/s41592-023-01832-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 02/27/2023] [Indexed: 04/08/2023]
Abstract
Major computational challenges exist in relation to the collection, curation, processing and analysis of large genomic and imaging datasets, as well as the simulation of larger and more realistic models in systems biology. Here we discuss how a relative newcomer among programming languages-Julia-is poised to meet the current and emerging demands in the computational biosciences and beyond. Speed, flexibility, a thriving package ecosystem and readability are major factors that make high-performance computing and data analysis available to an unprecedented degree. We highlight how Julia's design is already enabling new ways of analyzing biological data and systems, and we provide a list of resources that can facilitate the transition into Julian computing.
Collapse
Affiliation(s)
- Elisabeth Roesch
- School of Mathematics and Statistics, University of Melbourne, Melbourne, Victoria, Australia
- Melbourne Integrative Genomics, University of Melbourne, Melbourne, Victoria, Australia
- JuliaHub, Somerville, MA, USA
| | - Joe G Greener
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| | - Adam L MacLean
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | | | - Christopher Rackauckas
- JuliaHub, Somerville, MA, USA
- Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA, USA
- Pumas-AI, Centreville, VA, USA
| | - Timothy E Holy
- Departments of Neuroscience and Biomedical Engineering, Washington University in St. Louis, St. Louis, MO, USA
| | - Michael P H Stumpf
- School of Mathematics and Statistics, University of Melbourne, Melbourne, Victoria, Australia.
- Melbourne Integrative Genomics, University of Melbourne, Melbourne, Victoria, Australia.
- School of BioSciences, The University of Melbourne, Melbourne, Victoria, Australia.
- ARC Centre of Excellence for the Mathematical Analysis of Cellular Systems, Melbourne, Victoria, Australia.
| |
Collapse
|
8
|
Browning AP, Simpson MJ. Geometric analysis enables biological insight from complex non-identifiable models using simple surrogates. PLoS Comput Biol 2023; 19:e1010844. [PMID: 36662831 PMCID: PMC9891533 DOI: 10.1371/journal.pcbi.1010844] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Revised: 02/01/2023] [Accepted: 12/26/2022] [Indexed: 01/22/2023] Open
Abstract
An enduring challenge in computational biology is to balance data quality and quantity with model complexity. Tools such as identifiability analysis and information criterion have been developed to harmonise this juxtaposition, yet cannot always resolve the mismatch between available data and the granularity required in mathematical models to answer important biological questions. Often, it is only simple phenomenological models, such as the logistic and Gompertz growth models, that are identifiable from standard experimental measurements. To draw insights from complex, non-identifiable models that incorporate key biological mechanisms of interest, we study the geometry of a map in parameter space from the complex model to a simple, identifiable, surrogate model. By studying how non-identifiable parameters in the complex model quantitatively relate to identifiable parameters in surrogate, we introduce and exploit a layer of interpretation between the set of non-identifiable parameters and the goodness-of-fit metric or likelihood studied in typical identifiability analysis. We demonstrate our approach by analysing a hierarchy of mathematical models for multicellular tumour spheroid growth experiments. Typical data from tumour spheroid experiments are limited and noisy, and corresponding mathematical models are very often made arbitrarily complex. Our geometric approach is able to predict non-identifiabilities, classify non-identifiable parameter spaces into identifiable parameter combinations that relate to features in the data characterised by parameters in a surrogate model, and overall provide additional biological insight from complex non-identifiable models.
Collapse
Affiliation(s)
- Alexander P. Browning
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- QUT Centre for Data Science, Queensland University of Technology, Brisbane, Australia
- Mathematical Institute, University of Oxford, Oxford, United Kingdom
| | - Matthew J. Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- QUT Centre for Data Science, Queensland University of Technology, Brisbane, Australia
- * E-mail:
| |
Collapse
|
9
|
Vittadello ST, Stumpf MPH. Open problems in mathematical biology. Math Biosci 2022; 354:108926. [PMID: 36377100 DOI: 10.1016/j.mbs.2022.108926] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 10/21/2022] [Accepted: 10/21/2022] [Indexed: 11/06/2022]
Abstract
Biology is data-rich, and it is equally rich in concepts and hypotheses. Part of trying to understand biological processes and systems is therefore to confront our ideas and hypotheses with data using statistical methods to determine the extent to which our hypotheses agree with reality. But doing so in a systematic way is becoming increasingly challenging as our hypotheses become more detailed, and our data becomes more complex. Mathematical methods are therefore gaining in importance across the life- and biomedical sciences. Mathematical models allow us to test our understanding, make testable predictions about future behaviour, and gain insights into how we can control the behaviour of biological systems. It has been argued that mathematical methods can be of great benefit to biologists to make sense of data. But mathematics and mathematicians are set to benefit equally from considering the often bewildering complexity inherent to living systems. Here we present a small selection of open problems and challenges in mathematical biology. We have chosen these open problems because they are of both biological and mathematical interest.
Collapse
Affiliation(s)
- Sean T Vittadello
- Melbourne Integrative Genomics, University of Melbourne, Australia; School of BioSciences, University of Melbourne, Australia
| | - Michael P H Stumpf
- Melbourne Integrative Genomics, University of Melbourne, Australia; School of BioSciences, University of Melbourne, Australia; School of Mathematics and Statistics, University of Melbourne, Australia.
| |
Collapse
|
10
|
Treloar NJ, Braniff N, Ingalls B, Barnes CP. Deep reinforcement learning for optimal experimental design in biology. PLoS Comput Biol 2022; 18:e1010695. [PMID: 36409776 PMCID: PMC9721483 DOI: 10.1371/journal.pcbi.1010695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 12/05/2022] [Accepted: 10/31/2022] [Indexed: 11/22/2022] Open
Abstract
The field of optimal experimental design uses mathematical techniques to determine experiments that are maximally informative from a given experimental setup. Here we apply a technique from artificial intelligence-reinforcement learning-to the optimal experimental design task of maximizing confidence in estimates of model parameter values. We show that a reinforcement learning approach performs favourably in comparison with a one-step ahead optimisation algorithm and a model predictive controller for the inference of bacterial growth parameters in a simulated chemostat. Further, we demonstrate the ability of reinforcement learning to train over a distribution of parameters, indicating that this approach is robust to parametric uncertainty.
Collapse
Affiliation(s)
- Neythen J. Treloar
- Department of Cell and Developmental Biology, University College London, London, United Kingdom
| | - Nathan Braniff
- Department of Applied Mathematics, University of Waterloo, Waterloo, Canada
| | - Brian Ingalls
- Department of Applied Mathematics, University of Waterloo, Waterloo, Canada
| | - Chris P. Barnes
- Department of Cell and Developmental Biology, University College London, London, United Kingdom
- UCL Genetics Institute, University College London, London, United Kingdom
| |
Collapse
|
11
|
Linden NJ, Kramer B, Rangamani P. Bayesian parameter estimation for dynamical models in systems biology. PLoS Comput Biol 2022; 18:e1010651. [PMID: 36269772 PMCID: PMC9629650 DOI: 10.1371/journal.pcbi.1010651] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 11/02/2022] [Accepted: 10/12/2022] [Indexed: 11/07/2022] Open
Abstract
Dynamical systems modeling, particularly via systems of ordinary differential equations, has been used to effectively capture the temporal behavior of different biochemical components in signal transduction networks. Despite the recent advances in experimental measurements, including sensor development and '-omics' studies that have helped populate protein-protein interaction networks in great detail, modeling in systems biology lacks systematic methods to estimate kinetic parameters and quantify associated uncertainties. This is because of multiple reasons, including sparse and noisy experimental measurements, lack of detailed molecular mechanisms underlying the reactions, and missing biochemical interactions. Additionally, the inherent nonlinearities with respect to the states and parameters associated with the system of differential equations further compound the challenges of parameter estimation. In this study, we propose a comprehensive framework for Bayesian parameter estimation and complete quantification of the effects of uncertainties in the data and models. We apply these methods to a series of signaling models of increasing mathematical complexity. Systematic analysis of these dynamical systems showed that parameter estimation depends on data sparsity, noise level, and model structure, including the existence of multiple steady states. These results highlight how focused uncertainty quantification can enrich systems biology modeling and enable additional quantitative analyses for parameter estimation.
Collapse
Affiliation(s)
- Nathaniel J. Linden
- Department of Mechanical and Aerospace Engineering, University of California San Diego, San Diego, California, United States of America
| | - Boris Kramer
- Department of Mechanical and Aerospace Engineering, University of California San Diego, San Diego, California, United States of America
- * E-mail: (BK); (PR)
| | - Padmini Rangamani
- Department of Mechanical and Aerospace Engineering, University of California San Diego, San Diego, California, United States of America
- * E-mail: (BK); (PR)
| |
Collapse
|
12
|
Solomatina A, Cezanne A, Kalaidzidis Y, Zerial M, Sbalzarini IF. Design centering enables robustness screening of pattern formation models. Bioinformatics 2022; 38:ii134-ii140. [PMID: 36124805 PMCID: PMC9486588 DOI: 10.1093/bioinformatics/btac480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Access to unprecedented amounts of quantitative biological data allows us to build and test biochemically accurate reaction-diffusion models of intracellular processes. However, any increase in model complexity increases the number of unknown parameters and, thus, the computational cost of model analysis. To efficiently characterize the behavior and robustness of models with many unknown parameters remains, therefore, a key challenge in systems biology. RESULTS We propose a novel computational framework for efficient high-dimensional parameter space characterization of reaction-diffusion models in systems biology. The method leverages the Lp-Adaptation algorithm, an adaptive-proposal statistical method for approximate design centering and robustness estimation. Our approach is based on an oracle function, which predicts for any given point in parameter space whether the model fulfills given specifications. We propose specific oracles to efficiently predict four characteristics of Turing-type reaction-diffusion models: bistability, instability, capability of spontaneous pattern formation and capability of pattern maintenance. We benchmark the method and demonstrate that it enables global exploration of a model's ability to undergo pattern-forming instabilities and to quantify robustness for model selection in polynomial time with dimensionality. We present an application of the framework to pattern formation on the endosomal membrane by the small GTPase Rab5 and its effectors, and we propose molecular mechanisms underlying this system. AVAILABILITY AND IMPLEMENTATION Our code is implemented in MATLAB and is available as open source under https://git.mpi-cbg.de/mosaic/software/black-box-optimization/rd-parameter-space-screening. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Anastasia Solomatina
- Faculty of Computer Science, Technische Universität Dresden, Dresden D-01187, Germany,Max Planck Institute of Molecular Cell Biology and Genetics, Dresden D-01307, Germany,Center for Systems Biology Dresden, Dresden D-01307, Germany
| | - Alice Cezanne
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden D-01307, Germany
| | - Yannis Kalaidzidis
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden D-01307, Germany
| | - Marino Zerial
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden D-01307, Germany,Center for Systems Biology Dresden, Dresden D-01307, Germany,Cluster of Excellence Physics of Life, TU Dresden, Dresden D-01187, Germany
| | | |
Collapse
|
13
|
Cárdenas SD, Reznik CJ, Ranaweera R, Song F, Chung CH, Fertig EJ, Gevertz JL. Model-informed experimental design recommendations for distinguishing intrinsic and acquired targeted therapeutic resistance in head and neck cancer. NPJ Syst Biol Appl 2022; 8:32. [PMID: 36075912 PMCID: PMC9458753 DOI: 10.1038/s41540-022-00244-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 08/05/2022] [Indexed: 11/09/2022] Open
Abstract
The promise of precision medicine has been limited by the pervasive resistance to many targeted therapies for cancer. Inferring the timing (i.e., pre-existing or acquired) and mechanism (i.e., drug-induced) of such resistance is crucial for designing effective new therapeutics. This paper studies cetuximab resistance in head and neck squamous cell carcinoma (HNSCC) using tumor volume data obtained from patient-derived tumor xenografts. We ask if resistance mechanisms can be determined from this data alone, and if not, what data would be needed to deduce the underlying mode(s) of resistance. To answer these questions, we propose a family of mathematical models, with each member of the family assuming a different timing and mechanism of resistance. We present a method for fitting these models to individual volumetric data, and utilize model selection and parameter sensitivity analyses to ask: which member(s) of the family of models best describes HNSCC response to cetuximab, and what does that tell us about the timing and mechanisms driving resistance? We find that along with time-course volumetric data to a single dose of cetuximab, the initial resistance fraction and, in some instances, dose escalation volumetric data are required to distinguish among the family of models and thereby infer the mechanisms of resistance. These findings can inform future experimental design so that we can best leverage the synergy of wet laboratory experimentation and mathematical modeling in the study of novel targeted cancer therapeutics.
Collapse
Affiliation(s)
- Santiago D Cárdenas
- Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ, USA
| | - Constance J Reznik
- Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ, USA.,Datacor, Inc., Florham Park, NJ, USA
| | - Ruchira Ranaweera
- Department of Head and Neck-Endocrine Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - Feifei Song
- Department of Head and Neck-Endocrine Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - Christine H Chung
- Department of Head and Neck-Endocrine Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - Elana J Fertig
- Convergence Institute, Department of Oncology, Department of Biomedical Engineering, Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD, USA.
| | - Jana L Gevertz
- Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ, USA.
| |
Collapse
|
14
|
Quantifying biochemical reaction rates from static population variability within incompletely observed complex networks. PLoS Comput Biol 2022; 18:e1010183. [PMID: 35731728 PMCID: PMC9216546 DOI: 10.1371/journal.pcbi.1010183] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 05/07/2022] [Indexed: 11/19/2022] Open
Abstract
Quantifying biochemical reaction rates within complex cellular processes remains a key challenge of systems biology even as high-throughput single-cell data have become available to characterize snapshots of population variability. That is because complex systems with stochastic and non-linear interactions are difficult to analyze when not all components can be observed simultaneously and systems cannot be followed over time. Instead of using descriptive statistical models, we show that incompletely specified mechanistic models can be used to translate qualitative knowledge of interactions into reaction rate functions from covariability data between pairs of components. This promises to turn a globally intractable problem into a sequence of solvable inference problems to quantify complex interaction networks from incomplete snapshots of their stochastic fluctuations.
Collapse
|
15
|
Mester R, Landeros A, Rackauckas C, Lange K. Differential methods for assessing sensitivity in biological models. PLoS Comput Biol 2022; 18:e1009598. [PMID: 35696417 PMCID: PMC9232177 DOI: 10.1371/journal.pcbi.1009598] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 06/24/2022] [Accepted: 05/12/2022] [Indexed: 11/18/2022] Open
Abstract
Differential sensitivity analysis is indispensable in fitting parameters, understanding uncertainty, and forecasting the results of both thought and lab experiments. Although there are many methods currently available for performing differential sensitivity analysis of biological models, it can be difficult to determine which method is best suited for a particular model. In this paper, we explain a variety of differential sensitivity methods and assess their value in some typical biological models. First, we explain the mathematical basis for three numerical methods: adjoint sensitivity analysis, complex perturbation sensitivity analysis, and forward mode sensitivity analysis. We then carry out four instructive case studies. (a) The CARRGO model for tumor-immune interaction highlights the additional information that differential sensitivity analysis provides beyond traditional naive sensitivity methods, (b) the deterministic SIR model demonstrates the value of using second-order sensitivity in refining model predictions, (c) the stochastic SIR model shows how differential sensitivity can be attacked in stochastic modeling, and (d) a discrete birth-death-migration model illustrates how the complex perturbation method of differential sensitivity can be generalized to a broader range of biological models. Finally, we compare the speed, accuracy, and ease of use of these methods. We find that forward mode automatic differentiation has the quickest computational time, while the complex perturbation method is the simplest to implement and the most generalizable.
Collapse
Affiliation(s)
- Rachel Mester
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail:
| | - Alfonso Landeros
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| | - Chris Rackauckas
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Pumas-AI, Annapolis, Maryland, United States of America
- Julia Computing, Cambridge, Massachusetts, United States of America
| | - Kenneth Lange
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Statistics, University of California Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
16
|
Baltussen MG, van de Wiel J, Fernández Regueiro CL, Jakštaitė M, Huck WTS. A Bayesian Approach to Extracting Kinetic Information from Artificial Enzymatic Networks. Anal Chem 2022; 94:7311-7318. [PMID: 35549162 PMCID: PMC9134183 DOI: 10.1021/acs.analchem.2c00659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In order to create artificial enzymatic networks capable of increasingly complex behavior, an improved methodology in understanding and controlling the kinetics of these networks is needed. Here, we introduce a Bayesian analysis method allowing for the accurate inference of enzyme kinetic parameters and determination of most likely reaction mechanisms, by combining data from different experiments and network topologies in a single probabilistic analysis framework. This Bayesian approach explicitly allows us to continuously improve our parameter estimates and behavior predictions by iteratively adding new data to our models, while automatically taking into account uncertainties introduced by the experimental setups or the chemical processes in general. We demonstrate the potential of this approach by characterizing systems of enzymes compartmentalized in beads inside flow reactors. The methods we introduce here provide a new approach to the design of increasingly complex artificial enzymatic networks, making the design of such networks more efficient, and robust against the accumulation of experimental errors.
Collapse
Affiliation(s)
- Mathieu G Baltussen
- Institute for Molecules and Materials, Radboud University Nijmegen, 6525 AJ, Nijmegen, The Netherlands
| | - Jeroen van de Wiel
- Institute for Molecules and Materials, Radboud University Nijmegen, 6525 AJ, Nijmegen, The Netherlands
| | | | - Miglė Jakštaitė
- Institute for Molecules and Materials, Radboud University Nijmegen, 6525 AJ, Nijmegen, The Netherlands
| | - Wilhelm T S Huck
- Institute for Molecules and Materials, Radboud University Nijmegen, 6525 AJ, Nijmegen, The Netherlands
| |
Collapse
|
17
|
Zhang H, Chen J, Tian T. Bayesian Inference of Stochastic Dynamic Models Using Early-Rejection Methods Based on Sequential Stochastic Simulations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1484-1494. [PMID: 33216717 DOI: 10.1109/tcbb.2020.3039490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Stochastic modelling is an important method to investigate the functions of noise in a wide range of biological systems. However, the parameter inference for stochastic models is still a challenging problem partially due to the large computing time required for stochastic simulations. To address this issue, we propose a novel early-rejection method by using sequential stochastic simulations. We first show that a large number of stochastic simulations are required to obtain reliable inference results. Instead of generating a large number of simulations for each parameter sample, we propose to generate these simulations in a number of stages. The simulation process will go to the next stage only if the accuracy of simulations at the current stage satisfies a given error criterion. We propose a formula to determine the error criterion and use a stochastic differential equation model to examine the effects of different criteria. Three biochemical network models are used to evaluate the efficiency and accuracy of the proposed method. Numerical results suggest the proposed early-rejection method achieves substantial improvement in the efficiency for the inference of stochastic models.
Collapse
|
18
|
Sharp JA, Browning AP, Burrage K, Simpson MJ. Parameter estimation and uncertainty quantification using information geometry. J R Soc Interface 2022; 19:20210940. [PMID: 35472269 PMCID: PMC9042578 DOI: 10.1098/rsif.2021.0940] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
In this work, we: (i) review likelihood-based inference for parameter estimation and the construction of confidence regions; and (ii) explore the use of techniques from information geometry, including geodesic curves and Riemann scalar curvature, to supplement typical techniques for uncertainty quantification, such as Bayesian methods, profile likelihood, asymptotic analysis and bootstrapping. These techniques from information geometry provide data-independent insights into uncertainty and identifiability, and can be used to inform data collection decisions. All code used in this work to implement the inference and information geometry techniques is available on GitHub.
Collapse
Affiliation(s)
- Jesse A Sharp
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland, Australia.,ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Alexander P Browning
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland, Australia.,ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Kevin Burrage
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland, Australia.,ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Queensland, Australia.,Department of Computer Science, University of Oxford, Oxford, UK
| | - Matthew J Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland, Australia.,Centre for Data Science, Queensland University of Technology, Brisbane, Queensland, Australia
| |
Collapse
|
19
|
Stimulating at the right time to recover network states in a model of the cortico-basal ganglia-thalamic circuit. PLoS Comput Biol 2022; 18:e1009887. [PMID: 35245281 PMCID: PMC8939795 DOI: 10.1371/journal.pcbi.1009887] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 03/22/2022] [Accepted: 01/31/2022] [Indexed: 11/26/2022] Open
Abstract
Synchronization of neural oscillations is thought to facilitate communication in the brain. Neurodegenerative pathologies such as Parkinson’s disease (PD) can result in synaptic reorganization of the motor circuit, leading to altered neuronal dynamics and impaired neural communication. Treatments for PD aim to restore network function via pharmacological means such as dopamine replacement, or by suppressing pathological oscillations with deep brain stimulation. We tested the hypothesis that brain stimulation can operate beyond a simple “reversible lesion” effect to augment network communication. Specifically, we examined the modulation of beta band (14–30 Hz) activity, a known biomarker of motor deficits and potential control signal for stimulation in Parkinson’s. To do this we setup a neural mass model of population activity within the cortico-basal ganglia-thalamic (CBGT) circuit with parameters that were constrained to yield spectral features comparable to those in experimental Parkinsonism. We modulated the connectivity of two major pathways known to be disrupted in PD and constructed statistical summaries of the spectra and functional connectivity of the resulting spontaneous activity. These were then used to assess the network-wide outcomes of closed-loop stimulation delivered to motor cortex and phase locked to subthalamic beta activity. Our results demonstrate that the spatial pattern of beta synchrony is dependent upon the strength of inputs to the STN. Precisely timed stimulation has the capacity to recover network states, with stimulation phase inducing activity with distinct spectral and spatial properties. These results provide a theoretical basis for the design of the next-generation brain stimulators that aim to restore neural communication in disease. Diseases of the brain lead to a wide range of disabling symptoms for patients, by affecting their ability to move or think properly. These symptoms arise from disruption to both the organization of networks in the brain, but also the timing of neural activity that propagates around it. Treatments for disease with drugs can restore the organization of these networks to some extent, yet it is very difficult to deliver drugs with good spatial or temporal selectivity. Brain stimulation provides one way in which to improve the spatial specificity of treatment, yet understanding how to stimulate at the right time to achieve the best outcome for patients, remains an outstanding question. In this work we use simulations of an important circuit involved in Parkinson’s disease that has parameters chosen to reflect recordings made in animal models of the disease. Using this computer model, we show how brain rhythms can act as signatures of underlying changes in networks. Further, we simulate intervention with temporally precise stimulation to show how future approaches to brain stimulation can act to restore or even augment neural networks following their degeneration in disease.
Collapse
|
20
|
Parameter inference for stochastic biochemical models from perturbation experiments parallelised at the single cell level. PLoS Comput Biol 2022; 18:e1009950. [PMID: 35303737 PMCID: PMC8967023 DOI: 10.1371/journal.pcbi.1009950] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 03/30/2022] [Accepted: 02/21/2022] [Indexed: 01/30/2023] Open
Abstract
Understanding and characterising biochemical processes inside single cells requires experimental platforms that allow one to perturb and observe the dynamics of such processes as well as computational methods to build and parameterise models from the collected data. Recent progress with experimental platforms and optogenetics has made it possible to expose each cell in an experiment to an individualised input and automatically record cellular responses over days with fine time resolution. However, methods to infer parameters of stochastic kinetic models from single-cell longitudinal data have generally been developed under the assumption that experimental data is sparse and that responses of cells to at most a few different input perturbations can be observed. Here, we investigate and compare different approaches for calculating parameter likelihoods of single-cell longitudinal data based on approximations of the chemical master equation (CME) with a particular focus on coupling the linear noise approximation (LNA) or moment closure methods to a Kalman filter. We show that, as long as cells are measured sufficiently frequently, coupling the LNA to a Kalman filter allows one to accurately approximate likelihoods and to infer model parameters from data even in cases where the LNA provides poor approximations of the CME. Furthermore, the computational cost of filtering-based iterative likelihood evaluation scales advantageously in the number of measurement times and different input perturbations and is thus ideally suited for data obtained from modern experimental platforms. To demonstrate the practical usefulness of these results, we perform an experiment in which single cells, equipped with an optogenetic gene expression system, are exposed to various different light-input sequences and measured at several hundred time points and use parameter inference based on iterative likelihood evaluation to parameterise a stochastic model of the system. A common result for the modelling of cellular processes is that available data is not sufficiently rich to uniquely determine the biological mechanism or even just to ensure identifiability of parameters of a given model. Perturbing cellular processes with informative input stimuli and measuring dynamical responses may alleviate this problem. With the development of novel experimental platforms, we are now in a position to parallelise such perturbation experiments at the single cell level. This raises a plethora of new questions. Is it more informative to diversify input perturbations but to observe only few cells for each input or should we rather ensure that many cells are observed for only few inputs? How can we calculate likelihoods and infer parameters of stochastic kinetic models from data sets in which each cell receives a different input perturbation? How does the computational efficiency of parameter inference methods scale with the number of inputs and the number of measurement times? Are there approaches that are particularly well-suited for such data sets? In this paper, we investigate these questions using the CcaS/CcaR optogenetic system driving the expression of a fluorescent reporter protein as primary case study.
Collapse
|
21
|
Villaverde AF, Pathirana D, Fröhlich F, Hasenauer J, Banga JR. A protocol for dynamic model calibration. Brief Bioinform 2022; 23:bbab387. [PMID: 34619769 PMCID: PMC8769694 DOI: 10.1093/bib/bbab387] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 08/06/2021] [Accepted: 08/29/2021] [Indexed: 12/23/2022] Open
Abstract
Ordinary differential equation models are nowadays widely used for the mechanistic description of biological processes and their temporal evolution. These models typically have many unknown and nonmeasurable parameters, which have to be determined by fitting the model to experimental data. In order to perform this task, known as parameter estimation or model calibration, the modeller faces challenges such as poor parameter identifiability, lack of sufficiently informative experimental data and the existence of local minima in the objective function landscape. These issues tend to worsen with larger model sizes, increasing the computational complexity and the number of unknown parameters. An incorrectly calibrated model is problematic because it may result in inaccurate predictions and misleading conclusions. For nonexpert users, there are a large number of potential pitfalls. Here, we provide a protocol that guides the user through all the steps involved in the calibration of dynamic models. We illustrate the methodology with two models and provide all the code required to reproduce the results and perform the same analysis on new models. Our protocol provides practitioners and researchers in biological modelling with a one-stop guide that is at the same time compact and sufficiently comprehensive to cover all aspects of the problem.
Collapse
Affiliation(s)
- Alejandro F Villaverde
- Universidade de Vigo, Department of Systems Engineering & Control, Vigo 36310, Galicia, Spain
| | - Dilan Pathirana
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn 53115, Germany
| | - Fabian Fröhlich
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg 85764, Germany
| | - Jan Hasenauer
- Center for Mathematics, Technische Universität München, Garching 85748, Germany
- Harvard Medical School, Cambridge, MA 02115, USA
| | - Julio R Banga
- Bioprocess Engineering Group, IIM-CSIC, Vigo 36208, Galicia, Spain
| |
Collapse
|
22
|
Thorne T, Kirk PDW, Harrington HA. OUP accepted manuscript. Bioinformatics 2022; 38:2529-2535. [PMID: 35191485 PMCID: PMC9048691 DOI: 10.1093/bioinformatics/btac118] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 02/07/2022] [Accepted: 02/18/2022] [Indexed: 12/03/2022] Open
Abstract
Motivation Inferring the parameters of models describing biological systems is an important problem in the reverse engineering of the mechanisms underlying these systems. Much work has focused on parameter inference of stochastic and ordinary differential equation models using Approximate Bayesian Computation (ABC). While there is some recent work on inference in spatial models, this remains an open problem. Simultaneously, advances in topological data analysis (TDA), a field of computational mathematics, have enabled spatial patterns in data to be characterized. Results Here, we focus on recent work using TDA to study different regimes of parameter space for a well-studied model of angiogenesis. We propose a method for combining TDA with ABC to infer parameters in the Anderson–Chaplain model of angiogenesis. We demonstrate that this topological approach outperforms ABC approaches that use simpler statistics based on spatial features of the data. This is a first step toward a general framework of spatial parameter inference for biological systems, for which there may be a variety of filtrations, vectorizations and summary statistics to be considered. Availability and implementation All code used to produce our results is available as a Snakemake workflow from github.com/tt104/tabc_angio.
Collapse
Affiliation(s)
| | - Paul D W Kirk
- MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), University of Cambridge, Cambridge CB2 0AW, UK
- Cancer Research UK Cambridge Centre, Ovarian Cancer Programme, Cambridge CB2 0RE, UK
| | | |
Collapse
|
23
|
Androulakis IP. Teaching computational systems biology with an eye on quantitative systems pharmacology at the undergraduate level: Why do it, who would take it, and what should we teach? FRONTIERS IN SYSTEMS BIOLOGY 2022; 2:1044281. [PMID: 36866242 PMCID: PMC9977321 DOI: 10.3389/fsysb.2022.1044281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Computational systems biology (CSB) is a field that emerged primarily as the product of research activities. As such, it grew in several directions in a distributed and uncoordinated manner making the area appealing and fascinating. The idea of not having to follow a specific path but instead creating one fueled innovation. As the field matured, several interdisciplinary graduate programs emerged attempting to educate future generations of computational systems biologists. These educational initiatives coordinated the dissemination of information across student populations that had already decided to specialize in this field. However, we are now entering an era where CSB, having established itself as a valuable research discipline, is attempting the next major step: Entering undergraduate curricula. As interesting as this endeavor may sound, it has several difficulties, mainly because the field is not uniformly defined. In this manuscript, we argue that this diversity is a significant advantage and that several incarnations of an undergraduate-level CSB biology course could, and should, be developed tailored to programmatic needs. In this manuscript, we share our experiences creating a course as part of a Biomedical Engineering program.
Collapse
Affiliation(s)
- Ioannis P Androulakis
- Biomedical Engineering Department, New Brunswick, NJ, United States.,Chemical and Biochemical Engineering Department, Rutgers University, New Brunswick, NJ, United States
| |
Collapse
|
24
|
Relationship Between Dimensionality and Convergence of Optimization Algorithms: A Comparison Between Data-Driven Normalization and Scaling Factor-Based Methods Using PEPSSBI. Methods Mol Biol 2022; 2385:91-115. [PMID: 34888717 PMCID: PMC9446379 DOI: 10.1007/978-1-0716-1767-0_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Ordinary differential equation models are used to represent intracellular signaling pathways in silico, aiding and guiding biological experiments to elucidate intracellular regulation. To construct such quantitative and predictive models of intracellular interactions, unknown parameters need to be estimated. Most of biological data are expressed in relative or arbitrary units, raising the question of how to compare model simulations with data. It has recently been shown that for models with large number of unknown parameters, fitting algorithms using a data-driven normalization of the simulations (DNS) performs best in terms of the convergence time and parameter identifiability. DNS approach compares model simulations and corresponding data both normalized by the same normalization procedure, without requiring additional parameters to be estimated, as necessary for widely used scaling factor-based methods. However, currently there is no parameter estimation software that directly supports DNS. In this chapter, we show how to apply DNS to dynamic models of systems and synthetic biology using PEPSSBI (Parameter Estimation Pipeline for Systems and Synthetic Biology). PEPSSBI is the first software that supports DNS, through algorithmically supported data normalization and objective function construction. PEPSSBI also supports model import using SBML and repeated parameter estimation runs executed in parallel either on a personal computer or a multi-CPU cluster.
Collapse
|
25
|
Rocca A, Kholodenko BN. Can Systems Biology Advance Clinical Precision Oncology? Cancers (Basel) 2021; 13:6312. [PMID: 34944932 PMCID: PMC8699328 DOI: 10.3390/cancers13246312] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Accepted: 12/10/2021] [Indexed: 12/13/2022] Open
Abstract
Precision oncology is perceived as a way forward to treat individual cancer patients. However, knowing particular cancer mutations is not enough for optimal therapeutic treatment, because cancer genotype-phenotype relationships are nonlinear and dynamic. Systems biology studies the biological processes at the systems' level, using an array of techniques, ranging from statistical methods to network reconstruction and analysis, to mathematical modeling. Its goal is to reconstruct the complex and often counterintuitive dynamic behavior of biological systems and quantitatively predict their responses to environmental perturbations. In this paper, we review the impact of systems biology on precision oncology. We show examples of how the analysis of signal transduction networks allows to dissect resistance to targeted therapies and inform the choice of combinations of targeted drugs based on tumor molecular alterations. Patient-specific biomarkers based on dynamical models of signaling networks can have a greater prognostic value than conventional biomarkers. These examples support systems biology models as valuable tools to advance clinical and translational oncological research.
Collapse
Affiliation(s)
- Andrea Rocca
- Hygiene and Public Health, Local Health Unit of Romagna, 47121 Forlì, Italy
| | - Boris N. Kholodenko
- Systems Biology Ireland, School of Medicine, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland
- Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland
- Department of Pharmacology, Yale University School of Medicine, New Haven, CT 06520, USA
| |
Collapse
|
26
|
Martina-Perez S, Simpson MJ, Baker RE. Bayesian uncertainty quantification for data-driven equation learning. Proc Math Phys Eng Sci 2021; 477:20210426. [PMID: 35153587 PMCID: PMC8548080 DOI: 10.1098/rspa.2021.0426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 09/30/2021] [Indexed: 12/27/2022] Open
Abstract
Equation learning aims to infer differential equation models from data. While a number of studies have shown that differential equation models can be successfully identified when the data are sufficiently detailed and corrupted with relatively small amounts of noise, the relationship between observation noise and uncertainty in the learned differential equation models remains unexplored. We demonstrate that for noisy datasets there exists great variation in both the structure of the learned differential equation models and their parameter values. We explore how to exploit multiple datasets to quantify uncertainty in the learned models, and at the same time draw mechanistic conclusions about the target differential equations. We showcase our results using simulation data from a relatively straightforward agent-based model (ABM) which has a well-characterized partial differential equation description that provides highly accurate predictions of averaged ABM behaviours in relevant regions of parameter space. Our approach combines equation learning methods with Bayesian inference approaches so that a quantification of uncertainty can be given by the posterior parameter distribution of the learned model.
Collapse
Affiliation(s)
| | - Matthew J Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
| | - Ruth E Baker
- Mathematical Institute, University of Oxford, Oxford, UK
| |
Collapse
|
27
|
Burton J, Manning CS, Rattray M, Papalopulu N, Kursawe J. Inferring kinetic parameters of oscillatory gene regulation from single cell time-series data. J R Soc Interface 2021; 18:20210393. [PMID: 34583566 PMCID: PMC8479358 DOI: 10.1098/rsif.2021.0393] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 08/26/2021] [Indexed: 11/19/2022] Open
Abstract
Gene expression dynamics, such as stochastic oscillations and aperiodic fluctuations, have been associated with cell fate changes in multiple contexts, including development and cancer. Single cell live imaging of protein expression with endogenous reporters is widely used to observe such gene expression dynamics. However, the experimental investigation of regulatory mechanisms underlying the observed dynamics is challenging, since these mechanisms include complex interactions of multiple processes, including transcription, translation and protein degradation. Here, we present a Bayesian method to infer kinetic parameters of oscillatory gene expression regulation using an auto-negative feedback motif with delay. Specifically, we use a delay-adapted nonlinear Kalman filter within a Metropolis-adjusted Langevin algorithm to identify posterior probability distributions. Our method can be applied to time-series data on gene expression from single cells and is able to infer multiple parameters simultaneously. We apply it to published data on murine neural progenitor cells and show that it outperforms alternative methods. We further analyse how parameter uncertainty depends on the duration and time resolution of an imaging experiment, to make experimental design recommendations. This work demonstrates the utility of parameter inference on time course data from single cells and enables new studies on cell fate changes and population heterogeneity.
Collapse
Affiliation(s)
- Joshua Burton
- Division of Informatics, Imaging and Data Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Cerys S. Manning
- Division of Developmental Biology and Medicine, School of Medical Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Magnus Rattray
- Division of Informatics, Imaging and Data Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Nancy Papalopulu
- Division of Developmental Biology and Medicine, School of Medical Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Oxford Road, Manchester M13 9PT, UK
| | - Jochen Kursawe
- School of Mathematics and Statistics, University of St Andrews, North Haugh, St Andrews, KY16 9SS, UK
| |
Collapse
|
28
|
New strategies for targeting kinase networks in cancer. J Biol Chem 2021; 297:101128. [PMID: 34461089 PMCID: PMC8449055 DOI: 10.1016/j.jbc.2021.101128] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 08/23/2021] [Accepted: 08/26/2021] [Indexed: 12/18/2022] Open
Abstract
Targeted strategies against specific driver molecules of cancer have brought about many advances in cancer treatment since the early success of the first small-molecule inhibitor Gleevec. Today, there are a multitude of targeted therapies approved by the Food and Drug Administration for the treatment of cancer. However, the initial efficacy of virtually every targeted treatment is often reversed by tumor resistance to the inhibitor through acquisition of new mutations in the target molecule, or reprogramming of the epigenome, transcriptome, or kinome of the tumor cells. At the core of this clinical problem lies the assumption that targeted treatments will only be efficacious if the inhibitors are used at their maximum tolerated doses. Such aggressive regimens create strong selective pressure on the evolutionary progression of the tumor, resulting in resistant cells. High-dose single agent treatments activate alternative mechanisms that bypass the inhibitor, while high-dose combinatorial treatments suffer from increased toxicity resulting in treatment cessation. Although there is an arsenal of targeted agents being tested clinically and preclinically, identifying the most effective combination treatment plan remains a challenge. In this review, we discuss novel targeted strategies with an emphasis on the recent cross-disciplinary studies demonstrating that it is possible to achieve antitumor efficacy without increasing toxicity by adopting low-dose multitarget approaches to treatment of cancer and metastasis.
Collapse
|
29
|
West TO, Berthouze L, Farmer SF, Cagnan H, Litvak V. Inference of brain networks with approximate Bayesian computation - assessing face validity with an example application in Parkinsonism. Neuroimage 2021; 236:118020. [PMID: 33839264 PMCID: PMC8270890 DOI: 10.1016/j.neuroimage.2021.118020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 03/16/2021] [Accepted: 03/21/2021] [Indexed: 11/21/2022] Open
Abstract
This paper describes and validates a novel framework using the Approximate Bayesian Computation (ABC) algorithm for parameter estimation and model selection in models of mesoscale brain network activity. We provide a proof of principle, first pass validation of this framework using a set of neural mass models of the cortico-basal ganglia thalamic circuit inverted upon spectral features from experimental, in vivo recordings. This optimization scheme relaxes an assumption of fixed-form posteriors (i.e. the Laplace approximation) taken in previous approaches to inverse modelling of spectral features. This enables the exploration of model dynamics beyond that approximated from local linearity assumptions and so fit to explicit, numerical solutions of the underlying non-linear system of equations. In this first paper, we establish a face validation of the optimization procedures in terms of: (i) the ability to approximate posterior densities over parameters that are plausible given the known causes of the data; (ii) the ability of the model comparison procedures to yield posterior model probabilities that can identify the model structure known to generate the data; and (iii) the robustness of these procedures to local minima in the face of different starting conditions. Finally, as an illustrative application we show (iv) that model comparison can yield plausible conclusions given the known neurobiology of the cortico-basal ganglia-thalamic circuit in Parkinsonism. These results lay the groundwork for future studies utilizing highly nonlinear or brittle models that can explain time dependant dynamics, such as oscillatory bursts, in terms of the underlying neural circuits.
Collapse
Affiliation(s)
- Timothy O West
- Nuffield Department of Clinical Neurosciences, Medical Sciences Division, University of Oxford, Oxford OX3 9DU, United Kingdom; Medical Research Council Brain Network Dynamics Unit, University of Oxford, Oxford OX1 3TH, United Kingdom; Wellcome Trust Centre for Human Neuroimaging, UCL Institute of Neurology, Queen Square, London WC1N 3BG, United Kingdom.
| | - Luc Berthouze
- Centre for Computational Neuroscience and Robotics, University of Sussex, Falmer, United Kingdom; UCL Great Ormond Street Institute of Child Health, Guildford St., London WC1N 1EH, United Kingdom
| | - Simon F Farmer
- Department of Neurology, National Hospital for Neurology & Neurosurgery, Queen Square, London WC1N 3BG, United Kingdom; Department of Clinical and Movement Neurosciences, Institute of Neurology, Queen Square, UCL, London WC1N 3BG, United Kingdom
| | - Hayriye Cagnan
- Nuffield Department of Clinical Neurosciences, Medical Sciences Division, University of Oxford, Oxford OX3 9DU, United Kingdom; Medical Research Council Brain Network Dynamics Unit, University of Oxford, Oxford OX1 3TH, United Kingdom; Wellcome Trust Centre for Human Neuroimaging, UCL Institute of Neurology, Queen Square, London WC1N 3BG, United Kingdom
| | - Vladimir Litvak
- Wellcome Trust Centre for Human Neuroimaging, UCL Institute of Neurology, Queen Square, London WC1N 3BG, United Kingdom
| |
Collapse
|
30
|
Bittner SR, Palmigiano A, Piet AT, Duan CA, Brody CD, Miller KD, Cunningham J. Interrogating theoretical models of neural computation with emergent property inference. eLife 2021; 10:e56265. [PMID: 34323690 PMCID: PMC8321557 DOI: 10.7554/elife.56265] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 06/30/2021] [Indexed: 11/13/2022] Open
Abstract
A cornerstone of theoretical neuroscience is the circuit model: a system of equations that captures a hypothesized neural mechanism. Such models are valuable when they give rise to an experimentally observed phenomenon -- whether behavioral or a pattern of neural activity -- and thus can offer insights into neural computation. The operation of these circuits, like all models, critically depends on the choice of model parameters. A key step is then to identify the model parameters consistent with observed phenomena: to solve the inverse problem. In this work, we present a novel technique, emergent property inference (EPI), that brings the modern probabilistic modeling toolkit to theoretical neuroscience. When theorizing circuit models, theoreticians predominantly focus on reproducing computational properties rather than a particular dataset. Our method uses deep neural networks to learn parameter distributions with these computational properties. This methodology is introduced through a motivational example of parameter inference in the stomatogastric ganglion. EPI is then shown to allow precise control over the behavior of inferred parameters and to scale in parameter dimension better than alternative techniques. In the remainder of this work, we present novel theoretical findings in models of primary visual cortex and superior colliculus, which were gained through the examination of complex parametric structure captured by EPI. Beyond its scientific contribution, this work illustrates the variety of analyses possible once deep learning is harnessed towards solving theoretical inverse problems.
Collapse
Affiliation(s)
- Sean R Bittner
- Department of Neuroscience, Columbia UniversityNew YorkUnited States
| | | | - Alex T Piet
- Princeton Neuroscience InstitutePrincetonUnited States
- Princeton UniversityPrincetonUnited States
- Allen Institute for Brain ScienceSeattleUnited States
| | - Chunyu A Duan
- Institute of Neuroscience, Chinese Academy of SciencesShanghaiChina
| | - Carlos D Brody
- Princeton Neuroscience InstitutePrincetonUnited States
- Princeton UniversityPrincetonUnited States
- Howard Hughes Medical InstituteChevy ChaseUnited States
| | - Kenneth D Miller
- Department of Neuroscience, Columbia UniversityNew YorkUnited States
| | - John Cunningham
- Department of Statistics, Columbia UniversityNew YorkUnited States
| |
Collapse
|
31
|
Roesch E, Rackauckas C, Stumpf MPH. Collocation based training of neural ordinary differential equations. Stat Appl Genet Mol Biol 2021; 20:37-49. [PMID: 34237805 DOI: 10.1515/sagmb-2020-0025] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 05/04/2021] [Indexed: 11/15/2022]
Abstract
The predictive power of machine learning models often exceeds that of mechanistic modeling approaches. However, the interpretability of purely data-driven models, without any mechanistic basis is often complicated, and predictive power by itself can be a poor metric by which we might want to judge different methods. In this work, we focus on the relatively new modeling techniques of neural ordinary differential equations. We discuss how they relate to machine learning and mechanistic models, with the potential to narrow the gulf between these two frameworks: they constitute a class of hybrid model that integrates ideas from data-driven and dynamical systems approaches. Training neural ODEs as representations of dynamical systems data has its own specific demands, and we here propose a collocation scheme as a fast and efficient training strategy. This alleviates the need for costly ODE solvers. We illustrate the advantages that collocation approaches offer, as well as their robustness to qualitative features of a dynamical system, and the quantity and quality of observational data. We focus on systems that exemplify some of the hallmarks of complex dynamical systems encountered in systems biology, and we map out how these methods can be used in the analysis of mathematical models of cellular and physiological processes.
Collapse
Affiliation(s)
- Elisabeth Roesch
- Melbourne Integrative Genomics, University of Melbourne, 30 Royal Parade, Parkville, VIC3052, Australia.,School of Mathematics and Statistics, University of Melbourne, 813 Swanston Street, Parkville, VIC3010, Australia
| | - Christopher Rackauckas
- Department of Mathematics, Massachusetts Institute of Technology, 182 Memorial Dr, Cambridge, MA02142, USA.,Julia Computing, 240 Elm Street, 2nd Floor, Somerville, Massachusetts02144, USA.,Pumas-AI, 14711 Kamputa Drive, Centerville, VA20120, USA
| | - Michael P H Stumpf
- Melbourne Integrative Genomics, University of Melbourne, 30 Royal Parade, Parkville, VIC3052, Australia.,School of Mathematics and Statistics, University of Melbourne, 813 Swanston Street, Parkville, VIC3010, Australia
| |
Collapse
|
32
|
Camacho-Aguilar E, Warmflash A, Rand DA. Quantifying cell transitions in C. elegans with data-fitted landscape models. PLoS Comput Biol 2021; 17:e1009034. [PMID: 34061834 PMCID: PMC8195438 DOI: 10.1371/journal.pcbi.1009034] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Revised: 06/11/2021] [Accepted: 05/03/2021] [Indexed: 12/19/2022] Open
Abstract
Increasing interest has emerged in new mathematical approaches that simplify the study of complex differentiation processes by formalizing Waddington's landscape metaphor. However, a rational method to build these landscape models remains an open problem. Here we study vulval development in C. elegans by developing a framework based on Catastrophe Theory (CT) and approximate Bayesian computation (ABC) to build data-fitted landscape models. We first identify the candidate qualitative landscapes, and then use CT to build the simplest model consistent with the data, which we quantitatively fit using ABC. The resulting model suggests that the underlying mechanism is a quantifiable two-step decision controlled by EGF and Notch-Delta signals, where a non-vulval/vulval decision is followed by a bistable transition to the two vulval states. This new model fits a broad set of data and makes several novel predictions.
Collapse
Affiliation(s)
- Elena Camacho-Aguilar
- Mathematics Institute, University of Warwick, Coventry, United Kingdom
- Department of Biosciences, Rice University, Houston, Texas, United States of America
| | - Aryeh Warmflash
- Department of Biosciences, Rice University, Houston, Texas, United States of America
- Department of Bioengineering, Rice University, Houston, Texas, United States of America
| | - David A. Rand
- Mathematics Institute, University of Warwick, Coventry, United Kingdom
- Zeeman Institute for Systems Biology & Infectious Disease Epidemiology Research, University of Warwick, Coventry, United Kingdom
| |
Collapse
|
33
|
Yesilkanal AE, Yang D, Valdespino A, Tiwari P, Sabino AU, Nguyen LC, Lee J, Xie XH, Sun S, Dann C, Robinson-Mailman L, Steinberg E, Stuhlmiller T, Frankenberger C, Goldsmith E, Johnson GL, Ramos AF, Rosner MR. Limited inhibition of multiple nodes in a driver network blocks metastasis. eLife 2021; 10:59696. [PMID: 33973518 PMCID: PMC8128439 DOI: 10.7554/elife.59696] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 04/29/2021] [Indexed: 12/14/2022] Open
Abstract
Metastasis suppression by high-dose, multi-drug targeting is unsuccessful due to network heterogeneity and compensatory network activation. Here, we show that targeting driver network signaling capacity by limited inhibition of core pathways is a more effective anti-metastatic strategy. This principle underlies the action of a physiological metastasis suppressor, Raf Kinase Inhibitory Protein (RKIP), that moderately decreases stress-regulated MAP kinase network activity, reducing output to transcription factors such as pro-metastastic BACH1 and motility-related target genes. We developed a low-dose four-drug mimic that blocks metastatic colonization in mouse breast cancer models and increases survival. Experiments and network flow modeling show limited inhibition of multiple pathways is required to overcome variation in MAPK network topology and suppress signaling output across heterogeneous tumor cells. Restricting inhibition of individual kinases dissipates surplus signal, preventing threshold activation of compensatory kinase networks. This low-dose multi-drug approach to decrease signaling capacity of driver networks represents a transformative, clinically relevant strategy for anti-metastatic treatment.
Collapse
Affiliation(s)
- Ali Ekrem Yesilkanal
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Dongbo Yang
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Andrea Valdespino
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Payal Tiwari
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Alan U Sabino
- Instituto do Câncer do Estado de São Paulo, Faculdade de Medicina and Escola de Artes, Ciências e Humanidades; University of São Paulo, São Paulo, Brazil
| | - Long Chi Nguyen
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Jiyoung Lee
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Xiao-He Xie
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Siqi Sun
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Christopher Dann
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | | | - Ethan Steinberg
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | - Timothy Stuhlmiller
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, United States
| | - Casey Frankenberger
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| | | | - Gary L Johnson
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, United States
| | - Alexandre F Ramos
- Instituto do Câncer do Estado de São Paulo, Faculdade de Medicina and Escola de Artes, Ciências e Humanidades; University of São Paulo, São Paulo, Brazil
| | - Marsha R Rosner
- Ben May Department for Cancer Research, University of Chicago, Chicago, United States
| |
Collapse
|
34
|
Fisher HF, Boys RJ, Gillespie CS, Proctor CJ, Golightly A. Parameter inference for a stochastic kinetic model of expanded polyglutamine proteins. Biometrics 2021; 78:1195-1208. [PMID: 33837525 DOI: 10.1111/biom.13467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 03/21/2021] [Accepted: 03/24/2021] [Indexed: 11/30/2022]
Abstract
The presence of protein aggregates in cells is a known feature of many human age-related diseases, such as Huntington's disease. Simulations using fixed parameter values in a model of the dynamic evolution of expanded polyglutaime (PolyQ) proteins in cells have been used to gain a better understanding of the biological system. However, there is considerable uncertainty about the values of some of the parameters governing the system. Currently, appropriate values are chosen by ad hoc attempts to tune the parameters so that the model output matches experimental data. The problem is further complicated by the fact that the data only offer a partial insight into the underlying biological process: the data consist only of the proportions of cell death and of cells with inclusion bodies at a few time points, corrupted by measurement error. Developing inference procedures to estimate the model parameters in this scenario is a significant task. The model probabilities corresponding to the observed proportions cannot be evaluated exactly, and so they are estimated within the inference algorithm by repeatedly simulating realizations from the model. In general such an approach is computationally very expensive, and we therefore construct Gaussian process emulators for the key quantities and reformulate our algorithm around these fast stochastic approximations. We conclude by highlighting appropriate values of the model parameters leading to new insights into the underlying biological processes.
Collapse
Affiliation(s)
- H F Fisher
- School of Mathematics, Statistics & Physics, Newcastle University, Newcastle Upon Tyne, UK.,Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, UK
| | - R J Boys
- School of Mathematics, Statistics & Physics, Newcastle University, Newcastle Upon Tyne, UK
| | - C S Gillespie
- School of Mathematics, Statistics & Physics, Newcastle University, Newcastle Upon Tyne, UK
| | - C J Proctor
- Institute of Cellular Medicine, Newcastle University, Newcastle Upon Tyne, UK
| | - A Golightly
- School of Mathematics, Statistics & Physics, Newcastle University, Newcastle Upon Tyne, UK
| |
Collapse
|
35
|
Goyal A, Reeves DB, Cardozo-Ojeda EF, Schiffer JT, Mayer BT. Viral load and contact heterogeneity predict SARS-CoV-2 transmission and super-spreading events. eLife 2021; 10:e63537. [PMID: 33620317 PMCID: PMC7929560 DOI: 10.7554/elife.63537] [Citation(s) in RCA: 101] [Impact Index Per Article: 33.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 02/22/2021] [Indexed: 12/22/2022] Open
Abstract
SARS-CoV-2 is difficult to contain because many transmissions occur during pre-symptomatic infection. Unlike influenza, most SARS-CoV-2-infected people do not transmit while a small percentage infect large numbers of people. We designed mathematical models which link observed viral loads with epidemiologic features of each virus, including distribution of transmissions attributed to each infected person and duration between symptom onset in the transmitter and secondarily infected person. We identify that people infected with SARS-CoV-2 or influenza can be highly contagious for less than 1 day, congruent with peak viral load. SARS-CoV-2 super-spreader events occur when an infected person is shedding at a very high viral load and has a high number of exposed contacts. The higher predisposition of SARS-CoV-2 toward super-spreading events cannot be attributed to additional weeks of shedding relative to influenza. Rather, a person infected with SARS-CoV-2 exposes more people within equivalent physical contact networks, likely due to aerosolization.
Collapse
Affiliation(s)
- Ashish Goyal
- Vaccine and Infectious Diseases Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Daniel B Reeves
- Vaccine and Infectious Diseases Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - E Fabian Cardozo-Ojeda
- Vaccine and Infectious Diseases Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Joshua T Schiffer
- Vaccine and Infectious Diseases Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
- Department of Medicine, University of WashingtonSeattleUnited States
- Clinical Research Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Bryan T Mayer
- Vaccine and Infectious Diseases Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| |
Collapse
|
36
|
Browning AP, Warne DJ, Burrage K, Baker RE, Simpson MJ. Identifiability analysis for stochastic differential equation models in systems biology. J R Soc Interface 2020; 17:20200652. [PMID: 33323054 PMCID: PMC7811582 DOI: 10.1098/rsif.2020.0652] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 11/24/2020] [Indexed: 12/26/2022] Open
Abstract
Mathematical models are routinely calibrated to experimental data, with goals ranging from building predictive models to quantifying parameters that cannot be measured. Whether or not reliable parameter estimates are obtainable from the available data can easily be overlooked. Such issues of parameter identifiability have important ramifications for both the predictive power of a model, and the mechanistic insight that can be obtained. Identifiability analysis is well-established for deterministic, ordinary differential equation (ODE) models, but there are no commonly adopted methods for analysing identifiability in stochastic models. We provide an accessible introduction to identifiability analysis and demonstrate how existing ideas for analysis of ODE models can be applied to stochastic differential equation (SDE) models through four practical case studies. To assess structural identifiability, we study ODEs that describe the statistical moments of the stochastic process using open-source software tools. Using practically motivated synthetic data and Markov chain Monte Carlo methods, we assess parameter identifiability in the context of available data. Our analysis shows that SDE models can often extract more information about parameters than deterministic descriptions. All code used to perform the analysis is available on Github.
Collapse
Affiliation(s)
- Alexander P. Browning
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - David J. Warne
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - Kevin Burrage
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, Queensland University of Technology, Brisbane, Australia
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Ruth E. Baker
- Mathematical Institute, University of Oxford, Oxford, UK
| | - Matthew J. Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
37
|
Gonçalves PJ, Lueckmann JM, Deistler M, Nonnenmacher M, Öcal K, Bassetto G, Chintaluri C, Podlaski WF, Haddad SA, Vogels TP, Greenberg DS, Macke JH. Training deep neural density estimators to identify mechanistic models of neural dynamics. eLife 2020; 9:e56261. [PMID: 32940606 PMCID: PMC7581433 DOI: 10.7554/elife.56261] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 09/16/2020] [Indexed: 01/27/2023] Open
Abstract
Mechanistic modeling in neuroscience aims to explain observed phenomena in terms of underlying causes. However, determining which model parameters agree with complex and stochastic neural data presents a significant challenge. We address this challenge with a machine learning tool which uses deep neural density estimators-trained using model simulations-to carry out Bayesian inference and retrieve the full space of parameters compatible with raw data or selected data features. Our method is scalable in parameters and data features and can rapidly analyze new data after initial training. We demonstrate the power and flexibility of our approach on receptive fields, ion channels, and Hodgkin-Huxley models. We also characterize the space of circuit configurations giving rise to rhythmic activity in the crustacean stomatogastric ganglion, and use these results to derive hypotheses for underlying compensation mechanisms. Our approach will help close the gap between data-driven and theory-driven models of neural dynamics.
Collapse
Affiliation(s)
- Pedro J Gonçalves
- Computational Neuroengineering, Department of Electrical and Computer Engineering, Technical University of MunichMunichGermany
- Max Planck Research Group Neural Systems Analysis, Center of Advanced European Studies and Research (caesar)BonnGermany
| | - Jan-Matthis Lueckmann
- Computational Neuroengineering, Department of Electrical and Computer Engineering, Technical University of MunichMunichGermany
- Max Planck Research Group Neural Systems Analysis, Center of Advanced European Studies and Research (caesar)BonnGermany
| | - Michael Deistler
- Computational Neuroengineering, Department of Electrical and Computer Engineering, Technical University of MunichMunichGermany
- Machine Learning in Science, Excellence Cluster Machine Learning, Tübingen UniversityTübingenGermany
| | - Marcel Nonnenmacher
- Computational Neuroengineering, Department of Electrical and Computer Engineering, Technical University of MunichMunichGermany
- Max Planck Research Group Neural Systems Analysis, Center of Advanced European Studies and Research (caesar)BonnGermany
- Model-Driven Machine Learning, Institute of Coastal Research, Helmholtz Centre GeesthachtGeesthachtGermany
| | - Kaan Öcal
- Max Planck Research Group Neural Systems Analysis, Center of Advanced European Studies and Research (caesar)BonnGermany
- Mathematical Institute, University of BonnBonnGermany
| | - Giacomo Bassetto
- Computational Neuroengineering, Department of Electrical and Computer Engineering, Technical University of MunichMunichGermany
- Max Planck Research Group Neural Systems Analysis, Center of Advanced European Studies and Research (caesar)BonnGermany
| | - Chaitanya Chintaluri
- Centre for Neural Circuits and Behaviour, University of OxfordOxfordUnited Kingdom
- Institute of Science and Technology AustriaKlosterneuburgAustria
| | - William F Podlaski
- Centre for Neural Circuits and Behaviour, University of OxfordOxfordUnited Kingdom
| | - Sara A Haddad
- Max Planck Institute for Brain ResearchFrankfurtGermany
| | - Tim P Vogels
- Centre for Neural Circuits and Behaviour, University of OxfordOxfordUnited Kingdom
- Institute of Science and Technology AustriaKlosterneuburgAustria
| | - David S Greenberg
- Computational Neuroengineering, Department of Electrical and Computer Engineering, Technical University of MunichMunichGermany
- Model-Driven Machine Learning, Institute of Coastal Research, Helmholtz Centre GeesthachtGeesthachtGermany
| | - Jakob H Macke
- Computational Neuroengineering, Department of Electrical and Computer Engineering, Technical University of MunichMunichGermany
- Max Planck Research Group Neural Systems Analysis, Center of Advanced European Studies and Research (caesar)BonnGermany
- Machine Learning in Science, Excellence Cluster Machine Learning, Tübingen UniversityTübingenGermany
- Max Planck Institute for Intelligent SystemsTübingenGermany
| |
Collapse
|
38
|
Harrison JU, Baker RE. An automatic adaptive method to combine summary statistics in approximate Bayesian computation. PLoS One 2020; 15:e0236954. [PMID: 32760106 PMCID: PMC7410215 DOI: 10.1371/journal.pone.0236954] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 07/16/2020] [Indexed: 11/18/2022] Open
Abstract
To infer the parameters of mechanistic models with intractable likelihoods, techniques such as approximate Bayesian computation (ABC) are increasingly being adopted. One of the main disadvantages of ABC in practical situations, however, is that parameter inference must generally rely on summary statistics of the data. This is particularly the case for problems involving high-dimensional data, such as biological imaging experiments. However, some summary statistics contain more information about parameters of interest than others, and it is not always clear how to weight their contributions within the ABC framework. We address this problem by developing an automatic, adaptive algorithm that chooses weights for each summary statistic. Our algorithm aims to maximize the distance between the prior and the approximate posterior by automatically adapting the weights within the ABC distance function. Computationally, we use a nearest neighbour estimator of the distance between distributions. We justify the algorithm theoretically based on properties of the nearest neighbour distance estimator. To demonstrate the effectiveness of our algorithm, we apply it to a variety of test problems, including several stochastic models of biochemical reaction networks, and a spatial model of diffusion, and compare our results with existing algorithms.
Collapse
Affiliation(s)
- Jonathan U. Harrison
- Mathematical Institute, Mathematical Sciences Building, University of Warwick, Coventry, United Kingdom
- * E-mail:
| | - Ruth E. Baker
- Mathematical Institute, Andrew Wiles Building, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
39
|
Shafiekhani S, Shafiekhani M, Rahbar S, Jafari AH. Extended Robust Boolean Network of Budding Yeast Cell Cycle. JOURNAL OF MEDICAL SIGNALS & SENSORS 2020; 10:94-104. [PMID: 32676445 PMCID: PMC7359953 DOI: 10.4103/jmss.jmss_40_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Revised: 10/22/2019] [Accepted: 10/20/2019] [Indexed: 11/17/2022]
Abstract
Background: How to explore the dynamics of transition probabilities between phases of budding yeast cell cycle (BYCC) network based on the dynamics of protein activities that control this network? How to identify the robust structure of protein interactions of BYCC Boolean network (BN)? Budding yeast allows scientists to put experiments into effect in order to discover the intracellular cell cycle regulating structures which are well simulated by mathematical modeling. Methods: We extended an available deterministic BN of proteins responsible for the cell cycle to a Markov chain model containing apoptosis besides G1, S, G2, M, and stationary G1. Using genetic algorithm (GA), we estimated the kinetic parameters of the extended BN model so that the subsequent transition probabilities derived using Markov chain model of cell states as normal cell cycle becomes the maximum while the structure of chemical interactions of extended BN of cell cycle becomes more stable. Results: Using kinetic parameters optimized by GA, the probability of the subsequent transitions between cell cycle phases is maximized. The relative basin size of stationary G1 increased from 86% to 96.48% while the number of attractors decreased from 7 in the original model to 5 in the extended one. Hence, an increase in the robustness of the system has been achieved. Conclusion: The structure of interacting proteins in cell cycle network affects its robustness and probabilities of transitions between different cell cycle phases. Markov chain and BN are good approaches to study the stability and dynamics of the cell cycle network.
Collapse
Affiliation(s)
- Sajad Shafiekhani
- Department of Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran.,Research Center for Biomedical Technologies and Robotics, Tehran University of Medical Sciences, Tehran, Iran.,Students' Scientific Research Center, Tehran University of Medical Sciences, Tehran, Iran
| | - Mojtaba Shafiekhani
- Department of Biomedical Engineering, Amirkabir University of Technology, Tehran, Iran
| | - Sara Rahbar
- Department of Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran.,Research Center for Biomedical Technologies and Robotics, Tehran University of Medical Sciences, Tehran, Iran
| | - Amir Homayoun Jafari
- Department of Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran.,Research Center for Biomedical Technologies and Robotics, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
40
|
Bianconi F, Antonini C, Tomassoni L, Valigi P. Application of conditional robust calibration to ordinary differential equations models in computational systems biology: a comparison of two sampling strategies. IET Syst Biol 2020; 14:107-119. [PMID: 32406375 PMCID: PMC8687221 DOI: 10.1049/iet-syb.2018.5091] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Revised: 07/30/2019] [Accepted: 11/15/2019] [Indexed: 01/01/2023] Open
Abstract
Mathematical modelling is a widely used technique for describing the temporal behaviour of biological systems. One of the most challenging topics in computational systems biology is the calibration of non-linear models; i.e. the estimation of their unknown parameters. The state-of-the-art methods in this field are the frequentist and Bayesian approaches. For both of them, the performance and accuracy of results greatly depend on the sampling technique employed. Here, the authors test a novel Bayesian procedure for parameter estimation, called conditional robust calibration (CRC), comparing two different sampling techniques: uniform and logarithmic Latin hypercube sampling. CRC is an iterative algorithm based on parameter space sampling and on the estimation of parameter density functions. They apply CRC with both sampling strategies to the three ordinary differential equations (ODEs) models of increasing complexity. They obtain a more precise and reliable solution through logarithmically spaced samples.
Collapse
Affiliation(s)
| | - Chiara Antonini
- Department of Engineering, University of Perugia, G. Duranti 95, 06132 Perugia, Italy
| | - Lorenzo Tomassoni
- Department of Engineering, University of Perugia, G. Duranti 95, 06132 Perugia, Italy
| | - Paolo Valigi
- Department of Engineering, University of Perugia, G. Duranti 95, 06132 Perugia, Italy
| |
Collapse
|
41
|
Bowyer JE, Chakravarti D, Wong WW, Bates DG. Mechanistic modelling of tyrosine recombination reveals key parameters determining the performance of a CAR T cell switching circuit. ENGINEERING BIOLOGY 2020; 4:10-19. [PMID: 36970230 PMCID: PMC9996713 DOI: 10.1049/enb.2019.0020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Revised: 01/30/2020] [Accepted: 02/10/2020] [Indexed: 12/30/2022] Open
Abstract
Inducible genetic switches based on tyrosine recombinase-based DNA excision are a promising platform for the regulation and control of chimeric antigen receptor (CAR) T cell activity in cancer immunotherapy. These switches exploit the increased stability of DNA excision in tyrosine recombinases through an inversion-excision circuit design. Here, the authors develop the first mechanistic mathematical model of switching dynamics in tyrosine recombinases and validate it against experimental data through both global optimisation and statistical approximation approaches. Analysis of this model provides guidelines regarding which system parameters are best suited to experimental tuning in order to establish optimal switch performance in vivo. In particular, they find that the switching response can be made significantly faster by increasing the concentration of the inducer drug 4-OHT and/or by using promoters generating higher expression levels of the FlpO recombinase.
Collapse
Affiliation(s)
- Jack E. Bowyer
- School of Engineering University of Warwick Coventry CV4 7AL UK
- Warwick Integrative Synthetic Biology Centre Coventry CV4 7AL UK
| | - Deboki Chakravarti
- Department of Biomedical Engineering Boston University Boston MA 02215 USA
- Biological Design Center Boston University Boston MA 02215 USA
| | - Wilson W. Wong
- Department of Biomedical Engineering Boston University Boston MA 02215 USA
- Biological Design Center Boston University Boston MA 02215 USA
| | - Declan G. Bates
- School of Engineering University of Warwick Coventry CV4 7AL UK
- Warwick Integrative Synthetic Biology Centre Coventry CV4 7AL UK
| |
Collapse
|
42
|
Gupta S, Lee REC, Faeder JR. Parallel Tempering with Lasso for model reduction in systems biology. PLoS Comput Biol 2020; 16:e1007669. [PMID: 32150537 PMCID: PMC7082068 DOI: 10.1371/journal.pcbi.1007669] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 03/19/2020] [Accepted: 01/20/2020] [Indexed: 01/08/2023] Open
Abstract
Systems Biology models reveal relationships between signaling inputs and observable molecular or cellular behaviors. The complexity of these models, however, often obscures key elements that regulate emergent properties. We use a Bayesian model reduction approach that combines Parallel Tempering with Lasso regularization to identify minimal subsets of reactions in a signaling network that are sufficient to reproduce experimentally observed data. The Bayesian approach finds distinct reduced models that fit data equivalently. A variant of this approach that uses Lasso to perform selection at the level of reaction modules is applied to the NF-κB signaling network to test the necessity of feedback loops for responses to pulsatile and continuous pathway stimulation. Taken together, our results demonstrate that Bayesian parameter estimation combined with regularization can isolate and reveal core motifs sufficient to explain data from complex signaling systems. Cells respond to diverse environmental cues using complex networks of interacting proteins and other biomolecules. Mathematical and computational models have become invaluable tools to understand these networks and make informed predictions to rationally perturb cell behavior. However, the complexity of detailed models that try to capture all known biochemical elements of signaling networks often makes it difficult to determine the key regulatory elements that are responsible for specific cell behaviors. Here, we present a Bayesian computational approach, PTLasso, to automatically extract minimal subsets of detailed models that are sufficient to explain experimental data. The method simultaneously calibrates and reduces models, and the Bayesian approach samples globally, allowing us to find alternate mechanistic explanations for the data if present. We demonstrate the method on both synthetic and real biological data and show that PTLasso is an effective method to isolate distinct parts of a larger signaling model that are sufficient for specific data.
Collapse
Affiliation(s)
- Sanjana Gupta
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America
| | - Robin E C Lee
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America
| | - James R Faeder
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
43
|
Warne DJ, Baker RE, Simpson MJ. Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art. J R Soc Interface 2020; 16:20180943. [PMID: 30958205 DOI: 10.1098/rsif.2018.0943] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Stochasticity is a key characteristic of intracellular processes such as gene regulation and chemical signalling. Therefore, characterizing stochastic effects in biochemical systems is essential to understand the complex dynamics of living things. Mathematical idealizations of biochemically reacting systems must be able to capture stochastic phenomena. While robust theory exists to describe such stochastic models, the computational challenges in exploring these models can be a significant burden in practice since realistic models are analytically intractable. Determining the expected behaviour and variability of a stochastic biochemical reaction network requires many probabilistic simulations of its evolution. Using a biochemical reaction network model to assist in the interpretation of time-course data from a biological experiment is an even greater challenge due to the intractability of the likelihood function for determining observation probabilities. These computational challenges have been subjects of active research for over four decades. In this review, we present an accessible discussion of the major historical developments and state-of-the-art computational techniques relevant to simulation and inference problems for stochastic biochemical reaction network models. Detailed algorithms for particularly important methods are described and complemented with Matlab® implementations. As a result, this review provides a practical and accessible introduction to computational methods for stochastic models within the life sciences community.
Collapse
Affiliation(s)
- David J Warne
- 1 School of Mathematical Sciences, Queensland University of Technology , Brisbane, Queensland 4001 , Australia
| | - Ruth E Baker
- 2 Mathematical Institute, University of Oxford , Oxford OX2 6GG , UK
| | - Matthew J Simpson
- 1 School of Mathematical Sciences, Queensland University of Technology , Brisbane, Queensland 4001 , Australia
| |
Collapse
|
44
|
Alahmadi AA, Flegg JA, Cochrane DG, Drovandi CC, Keith JM. A comparison of approximate versus exact techniques for Bayesian parameter inference in nonlinear ordinary differential equation models. ROYAL SOCIETY OPEN SCIENCE 2020; 7:191315. [PMID: 32269786 PMCID: PMC7137938 DOI: 10.1098/rsos.191315] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Accepted: 01/27/2020] [Indexed: 05/05/2023]
Abstract
The behaviour of many processes in science and engineering can be accurately described by dynamical system models consisting of a set of ordinary differential equations (ODEs). Often these models have several unknown parameters that are difficult to estimate from experimental data, in which case Bayesian inference can be a useful tool. In principle, exact Bayesian inference using Markov chain Monte Carlo (MCMC) techniques is possible; however, in practice, such methods may suffer from slow convergence and poor mixing. To address this problem, several approaches based on approximate Bayesian computation (ABC) have been introduced, including Markov chain Monte Carlo ABC (MCMC ABC) and sequential Monte Carlo ABC (SMC ABC). While the system of ODEs describes the underlying process that generates the data, the observed measurements invariably include errors. In this paper, we argue that several popular ABC approaches fail to adequately model these errors because the acceptance probability depends on the choice of the discrepancy function and the tolerance without any consideration of the error term. We observe that the so-called posterior distributions derived from such methods do not accurately reflect the epistemic uncertainties in parameter values. Moreover, we demonstrate that these methods provide minimal computational advantages over exact Bayesian methods when applied to two ODE epidemiological models with simulated data and one with real data concerning malaria transmission in Afghanistan.
Collapse
Affiliation(s)
- Amani A. Alahmadi
- School of Mathematics, Monash University, Clayton, Victoria, Australia
- College of Science and Humanities, Shaqra University, Shaqra, Saudi Arabia
- Author for correspondence: Amani A. Alahmadi e-mail:
| | - Jennifer A. Flegg
- School of Mathematics and Statistics, University of Melbourne, Parkville, Victoria, Australia
| | - Davis G. Cochrane
- School of Mathematics, Monash University, Clayton, Victoria, Australia
| | - Christopher C. Drovandi
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Jonathan M. Keith
- School of Mathematics, Monash University, Clayton, Victoria, Australia
| |
Collapse
|
45
|
Lee D, Jayaraman A, Kwon JS. Identification of cell‐to‐cell heterogeneity through systems engineering approaches. AIChE J 2020. [DOI: 10.1002/aic.16925] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Dongheon Lee
- Artie McFerrin Department of Chemical EngineeringTexas A&M University Texas
| | - Arul Jayaraman
- Artie McFerrin Department of Chemical EngineeringTexas A&M University Texas
| | - Joseph S.‐I. Kwon
- Artie McFerrin Department of Chemical EngineeringTexas A&M University Texas
| |
Collapse
|
46
|
Rybiński M, Möller S, Sunnåker M, Lormeau C, Stelling J. TopoFilter: a MATLAB package for mechanistic model identification in systems biology. BMC Bioinformatics 2020; 21:34. [PMID: 31996136 PMCID: PMC6990465 DOI: 10.1186/s12859-020-3343-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 01/08/2020] [Indexed: 12/27/2022] Open
Abstract
Background To develop mechanistic dynamic models in systems biology, one often needs to identify all (or minimal) representations of the biological processes that are consistent with experimental data, out of a potentially large set of hypothetical mechanisms. However, a simple enumeration of all alternatives becomes quickly intractable when the number of model parameters grows. Selecting appropriate dynamic models out of a large ensemble of models, taking the uncertainty in our biological knowledge and in the experimental data into account, is therefore a key current problem in systems biology. Results The TopoFilter package addresses this problem in a heuristic and automated fashion by implementing the previously described topological filtering method for Bayesian model selection. It includes a core heuristic for searching the space of submodels of a parametrized model, coupled with a sampling-based exploration of the parameter space. Recent developments of the method allow to balance exhaustiveness and speed of the model space search, to efficiently re-sample parameters, to parallelize the search, and to use custom scoring functions. We use a theoretical example to motivate these features and then demonstrate TopoFilter’s applicability for a yeast signaling network with more than 250’000 possible model structures. Conclusions TopoFilter is a flexible software framework that makes Bayesian model selection and reduction efficient and scalable to network models of a complexity that represents contemporary problems in, for example, cell signaling. TopoFilter is open-source, available under the GPL-3.0 license at https://gitlab.com/csb.ethz/TopoFilter. It includes installation instructions, a quickstart guide, a description of all package options, and multiple examples.
Collapse
Affiliation(s)
- Mikołaj Rybiński
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.,ID Scientific IT Services, ETH Zurich, Zurich, 8092, Switzerland
| | - Simon Möller
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland
| | - Mikael Sunnåker
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland
| | - Claude Lormeau
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.,Life Science Zurich Ph.D. program "Systems Biology", Zurich, 8092, Switzerland
| | - Jörg Stelling
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.
| |
Collapse
|
47
|
Xu J, Koelle S, Guttorp P, Wu C, Dunbar C, Abkowitz JL, Minin VN. Statistical inference for partially observed branching processes with application to cell lineage tracking of in vivo hematopoiesis. Ann Appl Stat 2019. [DOI: 10.1214/19-aoas1272] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
48
|
Tomczak JM, Węglarz‐Tomczak E. Estimating kinetic constants in the Michaelis–Menten model from one enzymatic assay using Approximate Bayesian Computation. FEBS Lett 2019; 593:2742-2750. [DOI: 10.1002/1873-3468.13531] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2019] [Revised: 06/05/2019] [Accepted: 06/27/2019] [Indexed: 01/04/2023]
Affiliation(s)
- Jakub M. Tomczak
- Institute of Informatics, Faculty of Science University of Amsterdam The Netherlands
| | - Ewelina Węglarz‐Tomczak
- Swammerdam Institute for Life Sciences, Faculty of Science University of Amsterdam The Netherlands
| |
Collapse
|
49
|
Stepien TL, Lynch HE, Yancey SX, Dempsey L, Davidson LA. Using a continuum model to decipher the mechanics of embryonic tissue spreading from time-lapse image sequences: An approximate Bayesian computation approach. PLoS One 2019; 14:e0218021. [PMID: 31246967 PMCID: PMC6597152 DOI: 10.1371/journal.pone.0218021] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Accepted: 05/24/2019] [Indexed: 11/18/2022] Open
Abstract
Advanced imaging techniques generate large datasets capable of describing the structure and kinematics of tissue spreading in embryonic development, wound healing, and the progression of many diseases. These datasets can be integrated with mathematical models to infer biomechanical properties of the system, typically identifying an optimal set of parameters for an individual experiment. However, these methods offer little information on the robustness of the fit and are generally ill-suited for statistical tests of multiple experiments. To overcome this limitation and enable efficient use of large datasets in a rigorous experimental design, we use the approximate Bayesian computation rejection algorithm to construct probability density distributions that estimate model parameters for a defined theoretical model and set of experimental data. Here, we demonstrate this method with a 2D Eulerian continuum mechanical model of spreading embryonic tissue. The model is tightly integrated with quantitative image analysis of different sized embryonic tissue explants spreading on extracellular matrix (ECM) and is regulated by a small set of parameters including forces on the free edge, tissue stiffness, strength of cell-ECM adhesions, and active cell shape changes. We find statistically significant trends in key parameters that vary with initial size of the explant, e.g., for larger explants cell-ECM adhesion forces are weaker and free edge forces are stronger. Furthermore, we demonstrate that estimated parameters for one explant can be used to predict the behavior of other similarly sized explants. These predictive methods can be used to guide further experiments to better understand how collective cell migration is regulated during development.
Collapse
Affiliation(s)
- Tracy L. Stepien
- Department of Mathematics, University of Arizona, Tucson, AZ, United States of America
- * E-mail: (LAD); (TLS); (HEL)
| | - Holley E. Lynch
- Department of Physics, Stetson University, DeLand, FL, United States of America
- * E-mail: (LAD); (TLS); (HEL)
| | - Shirley X. Yancey
- Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Laura Dempsey
- Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Lance A. Davidson
- Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, United States of America
- Department of Developmental Biology, University of Pittsburgh, Pittsburgh, PA, United States of America
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, United States of America
- * E-mail: (LAD); (TLS); (HEL)
| |
Collapse
|
50
|
Loskot P, Atitey K, Mihaylova L. Comprehensive Review of Models and Methods for Inferences in Bio-Chemical Reaction Networks. Front Genet 2019; 10:549. [PMID: 31258548 PMCID: PMC6588029 DOI: 10.3389/fgene.2019.00549] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Accepted: 05/24/2019] [Indexed: 01/30/2023] Open
Abstract
The key processes in biological and chemical systems are described by networks of chemical reactions. From molecular biology to biotechnology applications, computational models of reaction networks are used extensively to elucidate their non-linear dynamics. The model dynamics are crucially dependent on the parameter values which are often estimated from observations. Over the past decade, the interest in parameter and state estimation in models of (bio-) chemical reaction networks (BRNs) grew considerably. The related inference problems are also encountered in many other tasks including model calibration, discrimination, identifiability, and checking, and optimum experiment design, sensitivity analysis, and bifurcation analysis. The aim of this review paper is to examine the developments in literature to understand what BRN models are commonly used, and for what inference tasks and inference methods. The initial collection of about 700 documents concerning estimation problems in BRNs excluding books and textbooks in computational biology and chemistry were screened to select over 270 research papers and 20 graduate research theses. The paper selection was facilitated by text mining scripts to automate the search for relevant keywords and terms. The outcomes are presented in tables revealing the levels of interest in different inference tasks and methods for given models in the literature as well as the research trends are uncovered. Our findings indicate that many combinations of models, tasks and methods are still relatively unexplored, and there are many new research opportunities to explore combinations that have not been considered-perhaps for good reasons. The most common models of BRNs in literature involve differential equations, Markov processes, mass action kinetics, and state space representations whereas the most common tasks are the parameter inference and model identification. The most common methods in literature are Bayesian analysis, Monte Carlo sampling strategies, and model fitting to data using evolutionary algorithms. The new research problems which cannot be directly deduced from the text mining data are also discussed.
Collapse
Affiliation(s)
- Pavel Loskot
- College of Engineering, Swansea University, Swansea, United Kingdom
| | - Komlan Atitey
- College of Engineering, Swansea University, Swansea, United Kingdom
| | - Lyudmila Mihaylova
- Department of Automatic Control and Systems Engineering, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|