1
|
Bayer D, Goldstein IH, Fintzi J, Lumbard K, Ricotta E, Warner S, Busch LM, Strich JR, Chertow DS, Parker DM, Boden-Albala B, Dratch A, Chhuon R, Quick N, Zahn M, Minin VM. Semi-parametric modeling of SARS-CoV-2 transmission using tests, cases, deaths, and seroprevalence data. ARXIV 2023:arXiv:2009.02654v3. [PMID: 32908946 PMCID: PMC7480029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Mechanistic models fit to streaming surveillance data are critical to understanding the transmission dynamics of an outbreak as it unfolds in real-time. However, transmission model parameter estimation can be imprecise, and sometimes even impossible, because surveillance data are noisy and not informative about all aspects of the mechanistic model. To partially overcome this obstacle, Bayesian models have been proposed to integrate multiple surveillance data streams. We devised a modeling framework for integrating SARS-CoV-2 diagnostics test and mortality time series data, as well as seroprevalence data from cross-sectional studies, and tested the importance of individual data streams for both inference and forecasting. Importantly, our model for incidence data accounts for changes in the total number of tests performed. We model the transmission rate, infection-to-fatality ratio, and a parameter controlling a functional relationship between the true case incidence and the fraction of positive tests as time-varying quantities and estimate changes of these parameters nonparametrically. We compare our base model against modified versions which do not use diagnostics test counts or seroprevalence data to demonstrate the utility of including these often unused data streams. We apply our Bayesian data integration method to COVID-19 surveillance data collected in Orange County, California between March 2020 and February 2021 and find that 32-72% of the Orange County residents experienced SARS-CoV-2 infection by mid-January, 2021. Despite this high number of infections, our results suggest that the abrupt end of the winter surge in January 2021 was due to both behavioral changes and a high level of accumulated natural immunity.
Collapse
Affiliation(s)
- Damon Bayer
- Department of Statistics, University of California, Irvine, California, U.S.A
| | - Isaac H. Goldstein
- Department of Statistics, University of California, Irvine, California, U.S.A
| | - Jonathan Fintzi
- Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, Rockville, Maryland, U.S.A
| | - Keith Lumbard
- Clinical Monitoring Research Program Directorate, Frederick National Laboratory for Cancer Research, Frederick, Maryland, U.S.A
| | - Emily Ricotta
- Epidemiology Unit, National Institute of Allergy and Infectious Diseases, Bethesda, Maryland, U.S.A
| | - Sarah Warner
- Critical Care Medicine Department, Clinical Center, National Institutes of Health, Bethesda, Maryland, U.S.A
| | - Lindsay M. Busch
- Division of Infectious Diseases, Emory University School of Medicine, Atlanta, Georgia, U.S.A
| | - Jeffrey R. Strich
- Critical Care Medicine Department, Clinical Center, National Institutes of Health, Bethesda, Maryland, U.S.A
| | - Daniel S. Chertow
- Critical Care Medicine Department, Clinical Center, National Institutes of Health, Bethesda, Maryland, U.S.A
| | - Daniel M. Parker
- Susan and Henry Samueli College of Health Sciences, University of California, Irvine, California, U.S.A
| | - Bernadette Boden-Albala
- Susan and Henry Samueli College of Health Sciences, University of California, Irvine, California, U.S.A
| | - Alissa Dratch
- Orange County Health Care Agency, Santa Ana, California, U.S.A
| | - Richard Chhuon
- Orange County Health Care Agency, Santa Ana, California, U.S.A
| | | | - Matthew Zahn
- Orange County Health Care Agency, Santa Ana, California, U.S.A
| | - Volodymyr M. Minin
- Department of Statistics, University of California, Irvine, California, U.S.A
| |
Collapse
|
2
|
Fintzi J, Wakefield J, Minin VN. A linear noise approximation for stochastic epidemic models fit to partially observed incidence counts. Biometrics 2022; 78:1530-1541. [PMID: 34374071 DOI: 10.1111/biom.13538] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2020] [Revised: 06/10/2021] [Accepted: 06/17/2021] [Indexed: 12/30/2022]
Abstract
Stochastic epidemic models (SEMs) fit to incidence data are critical to elucidating outbreak dynamics, shaping response strategies, and preparing for future epidemics. SEMs typically represent counts of individuals in discrete infection states using Markov jump processes (MJPs), but are computationally challenging as imperfect surveillance, lack of subject-level information, and temporal coarseness of the data obscure the true epidemic. Analytic integration over the latent epidemic process is impossible, and integration via Markov chain Monte Carlo (MCMC) is cumbersome due to the dimensionality and discreteness of the latent state space. Simulation-based computational approaches can address the intractability of the MJP likelihood, but are numerically fragile and prohibitively expensive for complex models. A linear noise approximation (LNA) that approximates the MJP transition density with a Gaussian density has been explored for analyzing prevalence data in large-population settings, but requires modification for analyzing incidence counts without assuming that the data are normally distributed. We demonstrate how to reparameterize SEMs to appropriately analyze incidence data, and fold the LNA into a data augmentation MCMC framework that outperforms deterministic methods, statistically, and simulation-based methods, computationally. Our framework is computationally robust when the model dynamics are complex and applies to a broad class of SEMs. We evaluate our method in simulations that reflect Ebola, influenza, and SARS-CoV-2 dynamics, and apply our method to national surveillance counts from the 2013-2015 West Africa Ebola outbreak.
Collapse
Affiliation(s)
- Jonathan Fintzi
- Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, Rockville, Maryland, USA
| | - Jon Wakefield
- Departments of Biostatistics and Statistics, University of Washington, Seattle, Washington, USA
| | - Vladimir N Minin
- Department of Statistics, University of California, Irvine, California, USA
| |
Collapse
|
3
|
Pooley CM, Doeschl-Wilson AB, Marion G. Estimation of age-stratified contact rates during the COVID-19 pandemic using a novel inference algorithm. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2022; 380:20210298. [PMID: 35965466 PMCID: PMC9376725 DOI: 10.1098/rsta.2021.0298] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 03/10/2022] [Indexed: 05/08/2023]
Abstract
Well parameterized epidemiological models including accurate representation of contacts are fundamental to controlling epidemics. However, age-stratified contacts are typically estimated from pre-pandemic/peace-time surveys, even though interventions and public response likely alter contacts. Here, we fit age-stratified models, including re-estimation of relative contact rates between age classes, to public data describing the 2020-2021 COVID-19 outbreak in England. This data includes age-stratified population size, cases, deaths, hospital admissions and results from the Coronavirus Infection Survey (almost 9000 observations in all). Fitting stochastic compartmental models to such detailed data is extremely challenging, especially considering the large number of model parameters being estimated (over 150). An efficient new inference algorithm ABC-MBP combining existing approximate Bayesian computation (ABC) methodology with model-based proposals (MBPs) is applied. Modified contact rates are inferred alongside time-varying reproduction numbers that quantify changes in overall transmission due to pandemic response, and age-stratified proportions of asymptomatic cases, hospitalization rates and deaths. These inferences are robust to a range of assumptions including the values of parameters that cannot be estimated from available data. ABC-MBP is shown to enable reliable joint analysis of complex epidemiological data yielding consistent parametrization of dynamic transmission models that can inform data-driven public health policy and interventions. This article is part of the theme issue 'Technical challenges of modelling real-life epidemics and examples of overcoming these'.
Collapse
Affiliation(s)
- Christopher M. Pooley
- Biomathematics and Statistics Scotland, James Clerk Maxwell Building, The King's Buildings, Peter Guthrie Tait Road, Edinburgh EH9 3FD, UK
| | | | - Glenn Marion
- Biomathematics and Statistics Scotland, James Clerk Maxwell Building, The King's Buildings, Peter Guthrie Tait Road, Edinburgh EH9 3FD, UK
| |
Collapse
|
4
|
Swallow B, Birrell P, Blake J, Burgman M, Challenor P, Coffeng LE, Dawid P, De Angelis D, Goldstein M, Hemming V, Marion G, McKinley TJ, Overton CE, Panovska-Griffiths J, Pellis L, Probert W, Shea K, Villela D, Vernon I. Challenges in estimation, uncertainty quantification and elicitation for pandemic modelling. Epidemics 2022; 38:100547. [PMID: 35180542 PMCID: PMC7612598 DOI: 10.1016/j.epidem.2022.100547] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 12/22/2021] [Accepted: 02/09/2022] [Indexed: 12/15/2022] Open
Abstract
The estimation of parameters and model structure for informing infectious disease response has become a focal point of the recent pandemic. However, it has also highlighted a plethora of challenges remaining in the fast and robust extraction of information using data and models to help inform policy. In this paper, we identify and discuss four broad challenges in the estimation paradigm relating to infectious disease modelling, namely the Uncertainty Quantification framework, data challenges in estimation, model-based inference and prediction, and expert judgement. We also postulate priorities in estimation methodology to facilitate preparation for future pandemics.
Collapse
Affiliation(s)
- Ben Swallow
- School of Mathematics and Statistics, University of Glasgow, Glasgow, UK; Scottish COVID-19 Response Consortium, UK.
| | - Paul Birrell
- Analytics & Data Science, UKHSA, UK; MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Joshua Blake
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Mark Burgman
- Centre for Environmental Policy, Imperial College London, London, UK
| | - Peter Challenor
- The Alan Turing Institute, London, UK; College of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter, UK
| | - Luc E Coffeng
- Department of Public Health, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Philip Dawid
- Statistical Laboratory, University of Cambridge, Cambridge, UK
| | - Daniela De Angelis
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK; Joint UNIversities Pandemic and Epidemiological Research, UK
| | - Michael Goldstein
- Department of Mathematical Sciences, Durham University, Stockton Road, Durham, UK
| | - Victoria Hemming
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, Canada
| | - Glenn Marion
- Scottish COVID-19 Response Consortium, UK; Biomathematics and Statistics Scotland, Edinburgh, UK
| | - Trevelyan J McKinley
- College of Medicine and Health, University of Exeter, Exeter, UK; Joint UNIversities Pandemic and Epidemiological Research, UK
| | - Christopher E Overton
- Department of Mathematics, University of Manchester, Manchester, UK; Clinical Data Science Unit, Manchester University NHS Foundation Trust, Manchester, UK; Joint UNIversities Pandemic and Epidemiological Research, UK
| | - Jasmina Panovska-Griffiths
- The Big Data Institute, University of Oxford, Oxford, UK; The Queen's College, University of Oxford, Oxford, UK
| | - Lorenzo Pellis
- Department of Mathematics, University of Manchester, Manchester, UK; Joint UNIversities Pandemic and Epidemiological Research, UK; The Alan Turing Institute, London, UK
| | - Will Probert
- The Big Data Institute, University of Oxford, Oxford, UK
| | - Katriona Shea
- Department of Biology and Centre for Infectious Disease Dynamics, The Pennsylvania State University, PA 16802, USA
| | - Daniel Villela
- Program of Scientific Computing, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Ian Vernon
- Department of Mathematical Sciences, Durham University, Stockton Road, Durham, UK
| |
Collapse
|
5
|
Nguyen-Van-Yen B, Del Moral P, Cazelles B. Stochastic Epidemic Models inference and diagnosis with Poisson Random Measure Data Augmentation. Math Biosci 2021; 335:108583. [PMID: 33713696 DOI: 10.1016/j.mbs.2021.108583] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 12/22/2020] [Accepted: 02/28/2021] [Indexed: 11/24/2022]
Abstract
We present a new Bayesian inference method for compartmental models that takes into account the intrinsic stochasticity of the process. We show how to formulate a SIR-type Markov jump process as the solution of a stochastic differential equation with respect to a Poisson Random Measure (PRM), and how to simulate the process trajectory deterministically from a parameter value and a PRM realization. This forms the basis of our Data Augmented MCMC, which consists of augmenting parameter space with the unobserved PRM value. The resulting simple Metropolis-Hastings sampler acts as an efficient simulation-based inference method, that can easily be transferred from model to model. Compared with a recent Data Augmentation method based on Gibbs sampling of individual infection histories, PRM-augmented MCMC scales much better with epidemic size and is far more flexible. It is also found to be competitive with Particle MCMC for moderate epidemics when using approximate simulations. PRM-augmented MCMC also yields a posteriori estimates of the PRM, that represent process stochasticity, and which can be used to validate the model. A pattern of deviation from the PRM prior distribution will indicate that the model underfits the data and help to understand the cause. We illustrate this by fitting a non-seasonal model to some simulated seasonal case count data. Applied to the Zika epidemic of 2013 in French Polynesia, our approach shows that a simple SEIR model cannot correctly reproduce both the initial sharp increase in the number of cases as well as the final proportion of seropositive. PRM augmentation thus provides a coherent story for Stochastic Epidemic Model inference, where explicitly inferring process stochasticity helps with model validation.
Collapse
Affiliation(s)
- Benjamin Nguyen-Van-Yen
- Institut Pasteur, Unité de Génétique Fonctionnelle des Maladies Infectieuses, UMR 2000 CNRS, Paris, France; Institut de Biologie de l'ENS (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France.
| | | | - Bernard Cazelles
- Institut de Biologie de l'ENS (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France; International Center for Mathematical and Computational Modeling of Complex Systems (UMMISCO), UMI 209, Sorbonne Université, France; iGLOBE, UMI CNRS 3157, University of Arizona, Tucson, AZ, United States of America
| |
Collapse
|
6
|
Browning AP, Warne DJ, Burrage K, Baker RE, Simpson MJ. Identifiability analysis for stochastic differential equation models in systems biology. J R Soc Interface 2020; 17:20200652. [PMID: 33323054 PMCID: PMC7811582 DOI: 10.1098/rsif.2020.0652] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 11/24/2020] [Indexed: 12/26/2022] Open
Abstract
Mathematical models are routinely calibrated to experimental data, with goals ranging from building predictive models to quantifying parameters that cannot be measured. Whether or not reliable parameter estimates are obtainable from the available data can easily be overlooked. Such issues of parameter identifiability have important ramifications for both the predictive power of a model, and the mechanistic insight that can be obtained. Identifiability analysis is well-established for deterministic, ordinary differential equation (ODE) models, but there are no commonly adopted methods for analysing identifiability in stochastic models. We provide an accessible introduction to identifiability analysis and demonstrate how existing ideas for analysis of ODE models can be applied to stochastic differential equation (SDE) models through four practical case studies. To assess structural identifiability, we study ODEs that describe the statistical moments of the stochastic process using open-source software tools. Using practically motivated synthetic data and Markov chain Monte Carlo methods, we assess parameter identifiability in the context of available data. Our analysis shows that SDE models can often extract more information about parameters than deterministic descriptions. All code used to perform the analysis is available on Github.
Collapse
Affiliation(s)
- Alexander P. Browning
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - David J. Warne
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - Kevin Burrage
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, Queensland University of Technology, Brisbane, Australia
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Ruth E. Baker
- Mathematical Institute, University of Oxford, Oxford, UK
| | - Matthew J. Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
7
|
Bu F, Aiello AE, Xu J, Volfovsky A. Likelihood-Based Inference for Partially Observed Epidemics on Dynamic Networks. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1790376] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Fan Bu
- Department of Statistical Science, Duke University, Durham, NC
| | - Allison E. Aiello
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Jason Xu
- Department of Statistical Science, Duke University, Durham, NC
| | | |
Collapse
|
8
|
Warne DJ, Baker RE, Simpson MJ. A practical guide to pseudo-marginal methods for computational inference in systems biology. J Theor Biol 2020; 496:110255. [PMID: 32223995 DOI: 10.1016/j.jtbi.2020.110255] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Revised: 03/11/2020] [Accepted: 03/18/2020] [Indexed: 01/07/2023]
Abstract
For many stochastic models of interest in systems biology, such as those describing biochemical reaction networks, exact quantification of parameter uncertainty through statistical inference is intractable. Likelihood-free computational inference techniques enable parameter inference when the likelihood function for the model is intractable but the generation of many sample paths is feasible through stochastic simulation of the forward problem. The most common likelihood-free method in systems biology is approximate Bayesian computation that accepts parameters that result in low discrepancy between stochastic simulations and measured data. However, it can be difficult to assess how the accuracy of the resulting inferences are affected by the choice of acceptance threshold and discrepancy function. The pseudo-marginal approach is an alternative likelihood-free inference method that utilises a Monte Carlo estimate of the likelihood function. This approach has several advantages, particularly in the context of noisy, partially observed, time-course data typical in biochemical reaction network studies. Specifically, the pseudo-marginal approach facilitates exact inference and uncertainty quantification, and may be efficiently combined with particle filters for low variance, high-accuracy likelihood estimation. In this review, we provide a practical introduction to the pseudo-marginal approach using inference for biochemical reaction networks as a series of case studies. Implementations of key algorithms and examples are provided using the Julia programming language; a high performance, open source programming language for scientific computing (https://github.com/davidwarne/Warne2019_GuideToPseudoMarginal).
Collapse
Affiliation(s)
- David J Warne
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland 4001, Australia.
| | - Ruth E Baker
- Mathematical Institute, University of Oxford, Oxford, OX2 6GG, United Kingdom
| | - Matthew J Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland 4001, Australia
| |
Collapse
|
9
|
Warne DJ, Baker RE, Simpson MJ. Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art. J R Soc Interface 2020; 16:20180943. [PMID: 30958205 DOI: 10.1098/rsif.2018.0943] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Stochasticity is a key characteristic of intracellular processes such as gene regulation and chemical signalling. Therefore, characterizing stochastic effects in biochemical systems is essential to understand the complex dynamics of living things. Mathematical idealizations of biochemically reacting systems must be able to capture stochastic phenomena. While robust theory exists to describe such stochastic models, the computational challenges in exploring these models can be a significant burden in practice since realistic models are analytically intractable. Determining the expected behaviour and variability of a stochastic biochemical reaction network requires many probabilistic simulations of its evolution. Using a biochemical reaction network model to assist in the interpretation of time-course data from a biological experiment is an even greater challenge due to the intractability of the likelihood function for determining observation probabilities. These computational challenges have been subjects of active research for over four decades. In this review, we present an accessible discussion of the major historical developments and state-of-the-art computational techniques relevant to simulation and inference problems for stochastic biochemical reaction network models. Detailed algorithms for particularly important methods are described and complemented with Matlab® implementations. As a result, this review provides a practical and accessible introduction to computational methods for stochastic models within the life sciences community.
Collapse
Affiliation(s)
- David J Warne
- 1 School of Mathematical Sciences, Queensland University of Technology , Brisbane, Queensland 4001 , Australia
| | - Ruth E Baker
- 2 Mathematical Institute, University of Oxford , Oxford OX2 6GG , UK
| | - Matthew J Simpson
- 1 School of Mathematical Sciences, Queensland University of Technology , Brisbane, Queensland 4001 , Australia
| |
Collapse
|
10
|
Pooley CM, Bishop SC, Doeschl-Wilson A, Marion G. Posterior-based proposals for speeding up Markov chain Monte Carlo. ROYAL SOCIETY OPEN SCIENCE 2019; 6:190619. [PMID: 31827823 PMCID: PMC6894579 DOI: 10.1098/rsos.190619] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2019] [Accepted: 10/23/2019] [Indexed: 05/23/2023]
Abstract
Markov chain Monte Carlo (MCMC) is widely used for Bayesian inference in models of complex systems. Performance, however, is often unsatisfactory in models with many latent variables due to so-called poor mixing, necessitating the development of application-specific implementations. This paper introduces 'posterior-based proposals' (PBPs), a new type of MCMC update applicable to a huge class of statistical models (whose conditional dependence structures are represented by directed acyclic graphs). PBPs generate large joint updates in parameter and latent variable space, while retaining good acceptance rates (typically 33%). Evaluation against other approaches (from standard Gibbs/random walk updates to state-of-the-art Hamiltonian and particle MCMC methods) was carried out for widely varying model types: an individual-based model for disease diagnostic test data, a financial stochastic volatility model, a mixed model used in statistical genetics and a population model used in ecology. While different methods worked better or worse in different scenarios, PBPs were found to be either near to the fastest or significantly faster than the next best approach (by up to a factor of 10). PBPs, therefore, represent an additional general purpose technique that can be usefully applied in a wide variety of contexts.
Collapse
Affiliation(s)
- C. M. Pooley
- The Roslin Institute, The University of Edinburgh, Midlothian EH25 9RG, UK
- Biomathematics and Statistics Scotland, James Clerk Maxwell Building, The King's Buildings, Peter Guthrie Tait Road, Edinburgh EH9 3FD, UK
| | - S. C. Bishop
- The Roslin Institute, The University of Edinburgh, Midlothian EH25 9RG, UK
| | - A. Doeschl-Wilson
- The Roslin Institute, The University of Edinburgh, Midlothian EH25 9RG, UK
| | - G. Marion
- Biomathematics and Statistics Scotland, James Clerk Maxwell Building, The King's Buildings, Peter Guthrie Tait Road, Edinburgh EH9 3FD, UK
| |
Collapse
|
11
|
Walker JN, Black AJ, Ross JV. Bayesian model discrimination for partially-observed epidemic models. Math Biosci 2019; 317:108266. [PMID: 31589881 DOI: 10.1016/j.mbs.2019.108266] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 08/22/2019] [Accepted: 09/27/2019] [Indexed: 10/25/2022]
Abstract
An efficient method for Bayesian model selection is presented for a broad class of continuous-time Markov chain models and is subsequently applied to two important problems in epidemiology. The first problem is to identify the shape of the infectious period distribution; the second problem is to determine whether individuals display symptoms before, at the same time, or after they become infectious. In both cases we show that the correct model can be identified, in the majority of cases, from symptom onset data generated from multiple outbreaks in small populations. The method works by evaluating the likelihood using a particle filter that incorporates a novel importance sampling algorithm designed for partially-observed continuous-time Markov chains. This is combined with another importance sampling method to unbiasedly estimate the model evidence. These come with estimates of precision, which allow for stopping criterion to be employed. Our method is general and can be applied to a wide range of model selection problems in biological and epidemiological systems with intractable likelihood functions.
Collapse
Affiliation(s)
- James N Walker
- School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia; ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia.
| | - Andrew J Black
- School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia; ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Joshua V Ross
- School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia; ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| |
Collapse
|
12
|
Warne DJ, Baker RE, Simpson MJ. Multilevel rejection sampling for approximate Bayesian computation. Comput Stat Data Anal 2018. [DOI: 10.1016/j.csda.2018.02.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
13
|
Walker JN, Ross JV, Black AJ. Inference of epidemiological parameters from household stratified data. PLoS One 2017; 12:e0185910. [PMID: 29045456 PMCID: PMC5646782 DOI: 10.1371/journal.pone.0185910] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Accepted: 09/21/2017] [Indexed: 11/18/2022] Open
Abstract
We consider a continuous-time Markov chain model of SIR disease dynamics with two levels of mixing. For this so-called stochastic households model, we provide two methods for inferring the model parameters-governing within-household transmission, recovery, and between-household transmission-from data of the day upon which each individual became infectious and the household in which each infection occurred, as might be available from First Few Hundred studies. Each method is a form of Bayesian Markov Chain Monte Carlo that allows us to calculate a joint posterior distribution for all parameters and hence the household reproduction number and the early growth rate of the epidemic. The first method performs exact Bayesian inference using a standard data-augmentation approach; the second performs approximate Bayesian inference based on a likelihood approximation derived from branching processes. These methods are compared for computational efficiency and posteriors from each are compared. The branching process is shown to be a good approximation and remains computationally efficient as the amount of data is increased.
Collapse
Affiliation(s)
- James N. Walker
- Stochastic Modelling and Operations Research Group, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
- ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Joshua V. Ross
- Stochastic Modelling and Operations Research Group, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
- ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Andrew J. Black
- Stochastic Modelling and Operations Research Group, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
- ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
- * E-mail:
| |
Collapse
|
14
|
Fintzi J, Cui X, Wakefield J, Minin VN. Efficient Data Augmentation for Fitting Stochastic Epidemic Models to Prevalence Data. J Comput Graph Stat 2017; 26:918-929. [PMID: 30515026 PMCID: PMC6275108 DOI: 10.1080/10618600.2017.1328365] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Revised: 02/01/2017] [Indexed: 10/19/2022]
Abstract
Stochastic epidemic models describe the dynamics of an epidemic as a disease spreads through a population. Typically, only a fraction of cases are observed at a set of discrete times. The absence of complete information about the time evolution of an epidemic gives rise to a complicated latent variable problem in which the state space size of the epidemic grows large as the population size increases. This makes analytically integrating over the missing data infeasible for populations of even moderate size. We present a data augmentation Markov chain Monte Carlo (MCMC) framework for Bayesian estimation of stochastic epidemic model parameters, in which measurements are augmented with subject-level disease histories. In our MCMC algorithm, we propose each new subject-level path, conditional on the data, using a time-inhomogeneous continuous-time Markov process with rates determined by the infection histories of other individuals. The method is general, and may be applied to a broad class of epidemic models with only minimal modifications to the model dynamics and/or emission distribution. We present our algorithm in the context of multiple stochastic epidemic models in which the data are binomially sampled prevalence counts, and apply our method to data from an outbreak of influenza in a British boarding school.
Collapse
Affiliation(s)
- Jonathan Fintzi
- Department of Biostatistics, University of Washington, Seattle
| | - Xiang Cui
- Department of Statistics, University of Washington, Seattle
| | - Jon Wakefield
- Department of Biostatistics, University of Washington, Seattle
- Department of Statistics, University of Washington, Seattle
| | - Vladimir N Minin
- Department of Statistics, University of Washington, Seattle
- Department of Biology, University of Washington, Seattle
| |
Collapse
|