1
|
Andronico A, Paireau J, Cauchemez S. Integrating information from historical data into mechanistic models for influenza forecasting. PLoS Comput Biol 2024; 20:e1012523. [PMID: 39475955 PMCID: PMC11524484 DOI: 10.1371/journal.pcbi.1012523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 09/27/2024] [Indexed: 11/02/2024] Open
Abstract
Seasonal influenza causes significant annual morbidity and mortality worldwide. In France, it is estimated that, on average, 2 million individuals consult their GP for influenza-like-illness (ILI) every year. Traditionally, mathematical models used for epidemic forecasting can either include parameters capturing the infection process (mechanistic or compartmental models) or rely on time series analysis approaches that do not make mechanistic assumptions (statistical or phenomenological models). While the latter make extensive use of past epidemic data, mechanistic models are usually independently initialized in each season. As a result, forecasts from such models can contain trajectories that are vastly different from past epidemics. We developed a mechanistic model that takes into account epidemic data from training seasons when producing forecasts. The parameters of the model are estimated via a first particle filter running on the observed data. A second particle filter is then used to produce forecasts compatible with epidemic trajectories from the training set. The model was calibrated and tested on 35 years' worth of surveillance data from the French Sentinelles Network, representing the weekly number of patients consulting for ILI over the period 1985-2019. Our results show that the new method improves upon standard mechanistic approaches. In particular, when retrospectively tested on the available data, our model provides increased accuracy for short-term forecasts (from one to four weeks into the future) and peak timing and intensity. Our new approach for epidemic forecasting allows the integration of key strengths of the statistical approach into the mechanistic modelling framework and represents an attempt to provide accurate forecasts by making full use of the rich surveillance dataset collected in France since 1985.
Collapse
Affiliation(s)
- Alessio Andronico
- Mathematical Modelling of Infectious Diseases Unit, Institut Pasteur, Université Paris Cité, UMR2000 CNRS, Paris, France
| | - Juliette Paireau
- Mathematical Modelling of Infectious Diseases Unit, Institut Pasteur, Université Paris Cité, UMR2000 CNRS, Paris, France
- Infectious Diseases Department, Santé publique France, Saint-Maurice, France
| | - Simon Cauchemez
- Mathematical Modelling of Infectious Diseases Unit, Institut Pasteur, Université Paris Cité, UMR2000 CNRS, Paris, France
| |
Collapse
|
2
|
de Jong SP, Conlan A, Han AX, Russell CA. Commuting-driven competition between transmission chains shapes seasonal influenza virus epidemics in the United States. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.08.09.24311720. [PMID: 39148829 PMCID: PMC11326338 DOI: 10.1101/2024.08.09.24311720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Despite intensive study, much remains unknown about the dynamics of seasonal influenza virus epidemic establishment and spread in the United States (US) each season. By reconstructing transmission lineages from seasonal influenza virus genomes collected in the US from 2014 to 2023, we show that most epidemics consisted of multiple distinct transmission lineages. Spread of these lineages exhibited strong spatiotemporal hierarchies and lineage size was correlated with timing of lineage establishment in the US. Mechanistic epidemic simulations suggest that mobility-driven competition between lineages determined the extent of individual lineages' geographical spread. Based on phylogeographic analyses and epidemic simulations, lineage-specific movement patterns were dominated by human commuting behavior. These results suggest that given the locations of early-season epidemic sparks, the topology of inter-state human mobility yields repeatable patterns of which influenza viruses will circulate where, but the importance of short-term processes limits predictability of regional and national epidemics.
Collapse
Affiliation(s)
- Simon P.J. de Jong
- Department of Medical Microbiology & Infection Prevention, Amsterdam University Medical Centers, University of Amsterdam; Amsterdam, The Netherlands
| | - Andrew Conlan
- Department of Veterinary Medicine, University of Cambridge; Cambridge, United Kingdom
| | - Alvin X. Han
- Department of Medical Microbiology & Infection Prevention, Amsterdam University Medical Centers, University of Amsterdam; Amsterdam, The Netherlands
| | - Colin A. Russell
- Department of Medical Microbiology & Infection Prevention, Amsterdam University Medical Centers, University of Amsterdam; Amsterdam, The Netherlands
| |
Collapse
|
3
|
Turtle J, Ben-Nun M, Riley P. Enhancing seasonal influenza projections: A mechanistic metapopulation model for long-term scenario planning. Epidemics 2024; 47:100758. [PMID: 38574441 DOI: 10.1016/j.epidem.2024.100758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 11/17/2023] [Accepted: 02/26/2024] [Indexed: 04/06/2024] Open
Abstract
In temperate regions, annual preparation by public health officials for seasonal influenza requires early-season long-term projections. These projections are different from short-term (e.g., 1-4 weeks ahead) forecasts that are typically updated weekly. Whereas short-term forecasts estimate what "will" likely happen in the near term, the goal of scenario projections is to guide long-term decision-making using "what if" scenarios. We developed a mechanistic metapopulation model and used it to provide long-term influenza projections to the Flu Scenario Modeling Hub. The scenarios differed in their assumptions about influenza vaccine effectiveness and prior immunity. The parameters of the model were inferred from early season hospitalization data and then simulated forward in time until June 3, 2023. We submitted two rounds of projections (mid-November and early December), with the second round being a repeat of the first with three more weeks of data (and consequently different model parameters). In this study, we describe the model, its calibration, and projections targets. The scenario projection outcomes for two rounds are compared with each other at state and national level reported daily hospitalizations. We show that although Rounds 2 and 3 were identical in definition, the addition of three weeks of data produced an improvement to model fits. These changes resulted in earlier projections for peak incidence, lower projections for peak magnitude and relatively small changes to cumulative projections. In both rounds, all four scenarios presented conceivable outcomes, with some scenarios agreeing well with observations. We discuss how to interpret this agreement, emphasizing that this does not imply that one scenario or another provides the ground truth. Our model's performance suggests that its underlying assumptions provided plausible bounds for what could happen during an influenza season following two seasons of low circulation. We suggest that such projections would provide actionable estimates for public health officials.
Collapse
Affiliation(s)
- James Turtle
- Infectious Disease Group, Predictive Science Inc., San Diego, CA, United States.
| | - Michal Ben-Nun
- Infectious Disease Group, Predictive Science Inc., San Diego, CA, United States
| | - Pete Riley
- Infectious Disease Group, Predictive Science Inc., San Diego, CA, United States
| |
Collapse
|
4
|
Lopez VK, Cramer EY, Pagano R, Drake JM, O’Dea EB, Adee M, Ayer T, Chhatwal J, Dalgic OO, Ladd MA, Linas BP, Mueller PP, Xiao J, Bracher J, Castro Rivadeneira AJ, Gerding A, Gneiting T, Huang Y, Jayawardena D, Kanji AH, Le K, Mühlemann A, Niemi J, Ray EL, Stark A, Wang Y, Wattanachit N, Zorn MW, Pei S, Shaman J, Yamana TK, Tarasewicz SR, Wilson DJ, Baccam S, Gurung H, Stage S, Suchoski B, Gao L, Gu Z, Kim M, Li X, Wang G, Wang L, Wang Y, Yu S, Gardner L, Jindal S, Marshall M, Nixon K, Dent J, Hill AL, Kaminsky J, Lee EC, Lemaitre JC, Lessler J, Smith CP, Truelove S, Kinsey M, Mullany LC, Rainwater-Lovett K, Shin L, Tallaksen K, Wilson S, Karlen D, Castro L, Fairchild G, Michaud I, Osthus D, Bian J, Cao W, Gao Z, Lavista Ferres J, Li C, Liu TY, Xie X, Zhang S, Zheng S, Chinazzi M, Davis JT, Mu K, Pastore y Piontti A, Vespignani A, Xiong X, Walraven R, Chen J, Gu Q, Wang L, Xu P, Zhang W, Zou D, Gibson GC, Sheldon D, Srivastava A, Adiga A, Hurt B, Kaur G, Lewis B, Marathe M, Peddireddy AS, Porebski P, Venkatramanan S, Wang L, Prasad PV, Walker JW, Webber AE, Slayton RB, Biggerstaff M, Reich NG, Johansson MA. Challenges of COVID-19 Case Forecasting in the US, 2020-2021. PLoS Comput Biol 2024; 20:e1011200. [PMID: 38709852 PMCID: PMC11098513 DOI: 10.1371/journal.pcbi.1011200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 05/16/2024] [Accepted: 04/01/2024] [Indexed: 05/08/2024] Open
Abstract
During the COVID-19 pandemic, forecasting COVID-19 trends to support planning and response was a priority for scientists and decision makers alike. In the United States, COVID-19 forecasting was coordinated by a large group of universities, companies, and government entities led by the Centers for Disease Control and Prevention and the US COVID-19 Forecast Hub (https://covid19forecasthub.org). We evaluated approximately 9.7 million forecasts of weekly state-level COVID-19 cases for predictions 1-4 weeks into the future submitted by 24 teams from August 2020 to December 2021. We assessed coverage of central prediction intervals and weighted interval scores (WIS), adjusting for missing forecasts relative to a baseline forecast, and used a Gaussian generalized estimating equation (GEE) model to evaluate differences in skill across epidemic phases that were defined by the effective reproduction number. Overall, we found high variation in skill across individual models, with ensemble-based forecasts outperforming other approaches. Forecast skill relative to the baseline was generally higher for larger jurisdictions (e.g., states compared to counties). Over time, forecasts generally performed worst in periods of rapid changes in reported cases (either in increasing or decreasing epidemic phases) with 95% prediction interval coverage dropping below 50% during the growth phases of the winter 2020, Delta, and Omicron waves. Ideally, case forecasts could serve as a leading indicator of changes in transmission dynamics. However, while most COVID-19 case forecasts outperformed a naïve baseline model, even the most accurate case forecasts were unreliable in key phases. Further research could improve forecasts of leading indicators, like COVID-19 cases, by leveraging additional real-time data, addressing performance across phases, improving the characterization of forecast confidence, and ensuring that forecasts were coherent across spatial scales. In the meantime, it is critical for forecast users to appreciate current limitations and use a broad set of indicators to inform pandemic-related decision making.
Collapse
Affiliation(s)
- Velma K. Lopez
- COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Estee Y. Cramer
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Robert Pagano
- Unaffiliated, Tucson, Arizona, United States of America
| | - John M. Drake
- University of Georgia, Athens, Georgia, United States of America
| | - Eamon B. O’Dea
- University of Georgia, Athens, Georgia, United States of America
| | - Madeline Adee
- Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - Turgay Ayer
- Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Jagpreet Chhatwal
- Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Ozden O. Dalgic
- Value Analytics Labs, Boston, Massachusetts, United States of America
| | - Mary A. Ladd
- Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - Benjamin P. Linas
- Boston University School of Medicine, Boston, Massachusetts, United States of America
| | - Peter P. Mueller
- Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jade Xiao
- Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Johannes Bracher
- Chair of Econometrics and Statistics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | | | - Aaron Gerding
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Tilmann Gneiting
- Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Yuxin Huang
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Dasuni Jayawardena
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Abdul H. Kanji
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Khoa Le
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Anja Mühlemann
- Institute of Mathematical Statistics and Actuarial Science, University of Bern, Bern, Switzerland
| | - Jarad Niemi
- Iowa State University, Ames, Iowa, United States of America
| | - Evan L. Ray
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Ariane Stark
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Yijin Wang
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Nutcha Wattanachit
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Martha W. Zorn
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Sen Pei
- Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Jeffrey Shaman
- Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Teresa K. Yamana
- Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Samuel R. Tarasewicz
- Federal Reserve Bank of San Francisco, San Francisco, California, United States of America
| | - Daniel J. Wilson
- Federal Reserve Bank of San Francisco, San Francisco, California, United States of America
| | - Sid Baccam
- IEM, Bel Air, Maryland, United States of America
| | - Heidi Gurung
- IEM, Bel Air, Maryland, United States of America
| | - Steve Stage
- IEM, Baton Rouge, Louisiana, United States of America
| | | | - Lei Gao
- George Mason University, Fairfax, Virginia, United States of America
| | - Zhiling Gu
- Iowa State University, Ames, Iowa, United States of America
| | - Myungjin Kim
- Kyungpook National University, Bukgu, Daegu, Republic of Korea
| | - Xinyi Li
- Clemson University, Clemson, South Carolina, United States of America
| | - Guannan Wang
- College of William & Mary, Williamsburg, Virginia, United States of America
| | - Lily Wang
- George Mason University, Fairfax, Virginia, United States of America
| | - Yueying Wang
- Amazon, Seattle, Washington, United States of America
| | - Shan Yu
- University of Virginia, Charlottesville, Virginia, United States of America
| | - Lauren Gardner
- Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Sonia Jindal
- Johns Hopkins University, Baltimore, Maryland, United States of America
| | | | - Kristen Nixon
- Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Juan Dent
- Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Alison L. Hill
- Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Joshua Kaminsky
- Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Elizabeth C. Lee
- Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | | | - Justin Lessler
- Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Claire P. Smith
- Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Shaun Truelove
- Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Matt Kinsey
- Johns Hopkins University Applied Physics Lab, Baltimore, Maryland, United States of America
| | - Luke C. Mullany
- Johns Hopkins University Applied Physics Lab, Baltimore, Maryland, United States of America
| | | | - Lauren Shin
- Johns Hopkins University Applied Physics Lab, Baltimore, Maryland, United States of America
| | - Katharine Tallaksen
- Johns Hopkins University Applied Physics Lab, Baltimore, Maryland, United States of America
| | - Shelby Wilson
- Johns Hopkins University Applied Physics Lab, Baltimore, Maryland, United States of America
| | - Dean Karlen
- University of Victoria and TRIUMF, Victoria, British Columbia, Canada
| | - Lauren Castro
- Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Geoffrey Fairchild
- Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Isaac Michaud
- Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Dave Osthus
- Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Jiang Bian
- Microsoft, Redmond, Washington, United States of America
| | - Wei Cao
- Microsoft, Redmond, Washington, United States of America
| | - Zhifeng Gao
- Microsoft, Redmond, Washington, United States of America
| | | | - Chaozhuo Li
- Microsoft, Redmond, Washington, United States of America
| | - Tie-Yan Liu
- Microsoft, Redmond, Washington, United States of America
| | - Xing Xie
- Microsoft, Redmond, Washington, United States of America
| | - Shun Zhang
- Microsoft, Redmond, Washington, United States of America
| | - Shun Zheng
- Microsoft, Redmond, Washington, United States of America
| | - Matteo Chinazzi
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, Massachusetts, United States of America
| | - Jessica T. Davis
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, Massachusetts, United States of America
| | - Kunpeng Mu
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, Massachusetts, United States of America
| | - Ana Pastore y Piontti
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, Massachusetts, United States of America
| | - Alessandro Vespignani
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, Massachusetts, United States of America
| | - Xinyue Xiong
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, Massachusetts, United States of America
| | | | - Jinghui Chen
- University of California, Los Angeles, Los Angeles, California, United States of America
| | - Quanquan Gu
- University of California, Los Angeles, Los Angeles, California, United States of America
| | - Lingxiao Wang
- University of California, Los Angeles, Los Angeles, California, United States of America
| | - Pan Xu
- University of California, Los Angeles, Los Angeles, California, United States of America
| | - Weitong Zhang
- University of California, Los Angeles, Los Angeles, California, United States of America
| | - Difan Zou
- University of California, Los Angeles, Los Angeles, California, United States of America
| | - Graham Casey Gibson
- Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Daniel Sheldon
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Ajitesh Srivastava
- University of Southern California, Los Angeles, California, United States of America
| | - Aniruddha Adiga
- University of Virginia, Charlottesville, Virginia, United States of America
| | - Benjamin Hurt
- University of Virginia, Charlottesville, Virginia, United States of America
| | - Gursharn Kaur
- University of Virginia, Charlottesville, Virginia, United States of America
| | - Bryan Lewis
- University of Virginia, Charlottesville, Virginia, United States of America
| | - Madhav Marathe
- University of Virginia, Charlottesville, Virginia, United States of America
| | | | | | | | - Lijing Wang
- New Jersey Institute of Technology, Newark, New Jersey, United States of America
| | - Pragati V. Prasad
- COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Jo W. Walker
- COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Alexander E. Webber
- COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Rachel B. Slayton
- COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Matthew Biggerstaff
- COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Nicholas G. Reich
- University of Massachusetts, Amherst, Amherst, Massachusetts, United States of America
| | - Michael A. Johansson
- COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| |
Collapse
|
5
|
Bay C, St-Onge G, Davis JT, Chinazzi M, Howerton E, Lessler J, Runge MC, Shea K, Truelove S, Viboud C, Vespignani A. Ensemble 2: Scenarios ensembling for communication and performance analysis. Epidemics 2024; 46:100748. [PMID: 38394928 DOI: 10.1016/j.epidem.2024.100748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 12/19/2023] [Accepted: 02/06/2024] [Indexed: 02/25/2024] Open
Abstract
Throughout the COVID-19 pandemic, scenario modeling played a crucial role in shaping the decision-making process of public health policies. Unlike forecasts, scenario projections rely on specific assumptions about the future that consider different plausible states-of-the-world that may or may not be realized and that depend on policy interventions, unpredictable changes in the epidemic outlook, etc. As a consequence, long-term scenario projections require different evaluation criteria than the ones used for traditional short-term epidemic forecasts. Here, we propose a novel ensemble procedure for assessing pandemic scenario projections using the results of the Scenario Modeling Hub (SMH) for COVID-19 in the United States (US). By defining a "scenario ensemble" for each model and the ensemble of models, termed "Ensemble2", we provide a synthesis of potential epidemic outcomes, which we use to assess projections' performance, bypassing the identification of the most plausible scenario. We find that overall the Ensemble2 models are well-calibrated and provide better performance than the scenario ensemble of individual models. The ensemble procedure accounts for the full range of plausible outcomes and highlights the importance of scenario design and effective communication. The scenario ensembling approach can be extended to any scenario design strategy, with potential refinements including weighting scenarios and allowing the ensembling process to evolve over time.
Collapse
Affiliation(s)
- Clara Bay
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Network Science Institute, Boston, MA, USA
| | - Guillaume St-Onge
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Network Science Institute, Boston, MA, USA
| | - Jessica T Davis
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Network Science Institute, Boston, MA, USA
| | - Matteo Chinazzi
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Network Science Institute, Boston, MA, USA; The Roux Institute, Northeastern University, Portland, ME, USA
| | - Emily Howerton
- Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park, PA, USA
| | - Justin Lessler
- Department of Epidemiology, University of North Carolina Gillings School of Public Health, Chapel Hill, NC, USA; Carolina Population Center, University of North Carolina, Chapel Hill, NC, USA; Department of Epidemiology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USA
| | - Michael C Runge
- U.S. Geological Survey, Eastern Ecological Science Center, Laurel, MD, USA
| | - Katriona Shea
- Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park, PA, USA
| | - Shaun Truelove
- Department of Epidemiology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USA; Department of International Health, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USA
| | - Cecile Viboud
- Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, MD, USA
| | - Alessandro Vespignani
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Network Science Institute, Boston, MA, USA; The Roux Institute, Northeastern University, Portland, ME, USA.
| |
Collapse
|
6
|
Holcomb KM, Staples JE, Nett RJ, Beard CB, Petersen LR, Benjamin SG, Green BW, Jones H, Johansson MA. Multi-Model Prediction of West Nile Virus Neuroinvasive Disease With Machine Learning for Identification of Important Regional Climatic Drivers. GEOHEALTH 2023; 7:e2023GH000906. [PMID: 38023388 PMCID: PMC10654557 DOI: 10.1029/2023gh000906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 09/15/2023] [Accepted: 10/21/2023] [Indexed: 12/01/2023]
Abstract
West Nile virus (WNV) is the leading cause of mosquito-borne illness in the continental United States (CONUS). Spatial heterogeneity in historical incidence, environmental factors, and complex ecology make prediction of spatiotemporal variation in WNV transmission challenging. Machine learning provides promising tools for identification of important variables in such situations. To predict annual WNV neuroinvasive disease (WNND) cases in CONUS (2015-2021), we fitted 10 probabilistic models with variation in complexity from naïve to machine learning algorithm and an ensemble. We made predictions in each of nine climate regions on a hexagonal grid and evaluated each model's predictive accuracy. Using the machine learning models (random forest and neural network), we identified the relative importance and variation in ranking of predictors (historical WNND cases, climate anomalies, human demographics, and land use) across regions. We found that historical WNND cases and population density were among the most important factors while anomalies in temperature and precipitation often had relatively low importance. While the relative performance of each model varied across climatic regions, the magnitude of difference between models was small. All models except the naïve model had non-significant differences in performance relative to the baseline model (negative binomial model fit per hexagon). No model, including the ensemble or more complex machine learning models, outperformed models based on historical case counts on the hexagon or region level; these models are good forecasting benchmarks. Further work is needed to assess if predictive capacity can be improved beyond that of these historical baselines.
Collapse
Affiliation(s)
- Karen M. Holcomb
- Global Systems LaboratoryNational Oceanic and Atmospheric AdministrationBoulderCOUSA
- Now at Division of Vector‐Borne DiseasesCenters for Disease Control and PreventionFort CollinsCOUSA
| | - J. Erin Staples
- Division of Vector‐Borne DiseasesCenters for Disease Control and PreventionFort CollinsCOUSA
| | - Randall J. Nett
- Division of Vector‐Borne DiseasesCenters for Disease Control and PreventionFort CollinsCOUSA
| | - Charles B. Beard
- Division of Vector‐Borne DiseasesCenters for Disease Control and PreventionFort CollinsCOUSA
| | - Lyle R. Petersen
- Division of Vector‐Borne DiseasesCenters for Disease Control and PreventionFort CollinsCOUSA
| | - Stanley G. Benjamin
- Global Systems LaboratoryNational Oceanic and Atmospheric AdministrationBoulderCOUSA
- Cooperative Institute for Research in Environmental SciencesUniversity of Colorado BoulderBoulderCOUSA
| | - Benjamin W. Green
- Global Systems LaboratoryNational Oceanic and Atmospheric AdministrationBoulderCOUSA
- Cooperative Institute for Research in Environmental SciencesUniversity of Colorado BoulderBoulderCOUSA
| | - Hunter Jones
- Climate Prediction OfficeNational Oceanic and Atmospheric AdministrationSilver SpringMDUSA
| | - Michael A. Johansson
- Division of Vector‐Borne DiseasesCenters for Disease Control and PreventionSan JuanPRUSA
| |
Collapse
|
7
|
Bhatia S, Parag KV, Wardle J, Nash RK, Imai N, Elsland SLV, Lassmann B, Brownstein JS, Desai A, Herringer M, Sewalk K, Loeb SC, Ramatowski J, Cuomo-Dannenburg G, Jauneikaite E, Unwin HJT, Riley S, Ferguson N, Donnelly CA, Cori A, Nouvellet P. Retrospective evaluation of real-time estimates of global COVID-19 transmission trends and mortality forecasts. PLoS One 2023; 18:e0286199. [PMID: 37851661 PMCID: PMC10584190 DOI: 10.1371/journal.pone.0286199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 05/11/2023] [Indexed: 10/20/2023] Open
Abstract
Since 8th March 2020 up to the time of writing, we have been producing near real-time weekly estimates of SARS-CoV-2 transmissibility and forecasts of deaths due to COVID-19 for all countries with evidence of sustained transmission, shared online. We also developed a novel heuristic to combine weekly estimates of transmissibility to produce forecasts over a 4-week horizon. Here we present a retrospective evaluation of the forecasts produced between 8th March to 29th November 2020 for 81 countries. We evaluated the robustness of the forecasts produced in real-time using relative error, coverage probability, and comparisons with null models. During the 39-week period covered by this study, both the short- and medium-term forecasts captured well the epidemic trajectory across different waves of COVID-19 infections with small relative errors over the forecast horizon. The model was well calibrated with 56.3% and 45.6% of the observations lying in the 50% Credible Interval in 1-week and 4-week ahead forecasts respectively. The retrospective evaluation of our models shows that simple transmission models calibrated using routine disease surveillance data can reliably capture the epidemic trajectory in multiple countries. The medium-term forecasts can be used in conjunction with the short-term forecasts of COVID-19 mortality as a useful planning tool as countries continue to relax public health measures.
Collapse
Affiliation(s)
- Sangeeta Bhatia
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
- NIHR Health Protection Research Unit in Modelling and Health Economics, Modelling & Economics Unit, UK Health Security Agency, London, United Kingdom
| | - Kris V. Parag
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Jack Wardle
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Rebecca K. Nash
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Natsuko Imai
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Sabine L. Van Elsland
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Britta Lassmann
- ProMED-mail, International Society for Infectious Diseases, Brookline, MA, United States of America
| | - John S. Brownstein
- Boston Children’s Hospital, Computational Epidemiology Lab, Boston, MA, United States of America
| | - Angel Desai
- ProMED-mail, International Society for Infectious Diseases, Brookline, MA, United States of America
- Division of Infectious Diseases, Department of Internal Medicine, University of California Davis, Sacramento, California, United States of America
| | - Mark Herringer
- Healthsites.io, The Global Healthsites Mapping Project, London, United Kingdom
| | - Kara Sewalk
- Boston Children’s Hospital, Computational Epidemiology Lab, Boston, MA, United States of America
| | - Sarah Claire Loeb
- ProMED-mail, International Society for Infectious Diseases, Brookline, MA, United States of America
| | - John Ramatowski
- ProMED-mail, International Society for Infectious Diseases, Brookline, MA, United States of America
| | - Gina Cuomo-Dannenburg
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Elita Jauneikaite
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - H. Juliette T. Unwin
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Steven Riley
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Neil Ferguson
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Christl A. Donnelly
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
- Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Anne Cori
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
| | - Pierre Nouvellet
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
8
|
Morris M, Hayes P, Cox IJ, Lampos V. Neural network models for influenza forecasting with associated uncertainty using Web search activity trends. PLoS Comput Biol 2023; 19:e1011392. [PMID: 37639427 PMCID: PMC10491400 DOI: 10.1371/journal.pcbi.1011392] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 09/08/2023] [Accepted: 07/26/2023] [Indexed: 08/31/2023] Open
Abstract
Influenza affects millions of people every year. It causes a considerable amount of medical visits and hospitalisations as well as hundreds of thousands of deaths. Forecasting influenza prevalence with good accuracy can significantly help public health agencies to timely react to seasonal or novel strain epidemics. Although significant progress has been made, influenza forecasting remains a challenging modelling task. In this paper, we propose a methodological framework that improves over the state-of-the-art forecasting accuracy of influenza-like illness (ILI) rates in the United States. We achieve this by using Web search activity time series in conjunction with historical ILI rates as observations for training neural network (NN) architectures. The proposed models incorporate Bayesian layers to produce associated uncertainty intervals to their forecast estimates, positioning themselves as legitimate complementary solutions to more conventional approaches. The best performing NN, referred to as the iterative recurrent neural network (IRNN) architecture, reduces mean absolute error by 10.3% and improves skill by 17.1% on average in nowcasting and forecasting tasks across 4 consecutive flu seasons.
Collapse
Affiliation(s)
- Michael Morris
- University College London, Centre for Artificial Intelligence, Department of Computer Science, London, United Kingdom
| | - Peter Hayes
- University College London, Centre for Artificial Intelligence, Department of Computer Science, London, United Kingdom
| | - Ingemar J. Cox
- University College London, Centre for Artificial Intelligence, Department of Computer Science, London, United Kingdom
- University of Copenhagen, Department of Computer Science, Copenhagen, Denmark
| | - Vasileios Lampos
- University College London, Centre for Artificial Intelligence, Department of Computer Science, London, United Kingdom
| |
Collapse
|
9
|
Aronis JM, Ye Y, Espino J, Hochheiser H, Michaels MG, Cooper GF. A Bayesian System to Track Outbreaks of Influenza-Like Illnesses Including Novel Diseases. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.05.10.23289799. [PMID: 37293033 PMCID: PMC10246032 DOI: 10.1101/2023.05.10.23289799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
It would be highly desirable to have a tool that detects the outbreak of a new influenza-like illness, such as COVID-19, accurately and early. This paper describes the ILI Tracker algorithm that first models the daily occurrence of a set of known influenza-like illnesses in a hospital emergency department using findings extracted from patient-care reports using natural language processing. We include results based on modeling the diseases influenza, respiratory syncytial virus, human metapneumovirus, and parainfluenza for five emergency departments in Allegheny County Pennsylvania from June 1, 2010 through May 31, 2015. We then show how the algorithm can be extended to detect the presence of an unmodeled disease which may represent a novel disease outbreak. We also include results for detecting an outbreak of an unmodeled disease during the mentioned time period, which in retrospect was very likely an outbreak of Enterovirus D68.
Collapse
|
10
|
Pei S, Blumberg S, Vega JC, Robin T, Zhang Y, Medford RJ, Adhikari B, Shaman J. Challenges in Forecasting Antimicrobial Resistance. Emerg Infect Dis 2023; 29:679-685. [PMID: 36958029 PMCID: PMC10045679 DOI: 10.3201/eid2904.221552] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2023] Open
Abstract
Antimicrobial resistance is a major threat to human health. Since the 2000s, computational tools for predicting infectious diseases have been greatly advanced; however, efforts to develop real-time forecasting models for antimicrobial-resistant organisms (AMROs) have been absent. In this perspective, we discuss the utility of AMRO forecasting at different scales, highlight the challenges in this field, and suggest future research priorities. We also discuss challenges in scientific understanding, access to high-quality data, model calibration, and implementation and evaluation of forecasting models. We further highlight the need to initiate research on AMRO forecasting using currently available data and resources to galvanize the research community and address initial practical questions.
Collapse
|
11
|
Bleichrodt A, Dahal S, Maloney K, Casanova L, Luo R, Chowell G. Real-time forecasting the trajectory of monkeypox outbreaks at the national and global levels, July-October 2022. BMC Med 2023; 21:19. [PMID: 36647108 PMCID: PMC9841951 DOI: 10.1186/s12916-022-02725-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 12/28/2022] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Beginning May 7, 2022, multiple nations reported an unprecedented surge in monkeypox cases. Unlike past outbreaks, differences in affected populations, transmission mode, and clinical characteristics have been noted. With the existing uncertainties of the outbreak, real-time short-term forecasting can guide and evaluate the effectiveness of public health measures. METHODS We obtained publicly available data on confirmed weekly cases of monkeypox at the global level and for seven countries (with the highest burden of disease at the time this study was initiated) from the Our World in Data (OWID) GitHub repository and CDC website. We generated short-term forecasts of new cases of monkeypox across the study areas using an ensemble n-sub-epidemic modeling framework based on weekly cases using 10-week calibration periods. We report and assess the weekly forecasts with quantified uncertainty from the top-ranked, second-ranked, and ensemble sub-epidemic models. Overall, we conducted 324 weekly sequential 4-week ahead forecasts across the models from the week of July 28th, 2022, to the week of October 13th, 2022. RESULTS The last 10 of 12 forecasting periods (starting the week of August 11th, 2022) show either a plateauing or declining trend of monkeypox cases for all models and areas of study. According to our latest 4-week ahead forecast from the top-ranked model, a total of 6232 (95% PI 487.8, 12,468.0) cases could be added globally from the week of 10/20/2022 to the week of 11/10/2022. At the country level, the top-ranked model predicts that the USA will report the highest cumulative number of new cases for the 4-week forecasts (median based on OWID data: 1806 (95% PI 0.0, 5544.5)). The top-ranked and weighted ensemble models outperformed all other models in short-term forecasts. CONCLUSIONS Our top-ranked model consistently predicted a decreasing trend in monkeypox cases on the global and country-specific scale during the last ten sequential forecasting periods. Our findings reflect the potential impact of increased immunity, and behavioral modification among high-risk populations.
Collapse
Affiliation(s)
- Amanda Bleichrodt
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, USA
| | - Sushma Dahal
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, USA
| | - Kevin Maloney
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, USA
| | - Lisa Casanova
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, USA
| | - Ruiyan Luo
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, USA
| | - Gerardo Chowell
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, USA.
| |
Collapse
|
12
|
Iftikhar H, Daniyal M, Qureshi M, Tawaiah K, Ansah RK, Afriyie JK. A hybrid forecasting technique for infection and death from the mpox virus. Digit Health 2023; 9:20552076231204748. [PMID: 37799502 PMCID: PMC10548807 DOI: 10.1177/20552076231204748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Accepted: 09/14/2023] [Indexed: 10/07/2023] Open
Abstract
Objectives The rising of new cases and death counts from the mpox virus (MPV) is alarming. In order to mitigate the impact of the MPV it is essential to have information of the virus's future position using more precise time series and stochastic models. In this present study, a hybrid forecasting system has been developed for new cases and death counts for MPV infection using the world daily cumulative confirmed and death series. Methods The original cumulative series was decomposed into new two subseries, such as a trend component and a stochastic series using the Hodrick-Prescott filter. To assess the efficacy of the proposed models, a comparative analysis with several widely recognized benchmark models, including auto-regressive (AR) model, auto-regressive moving average (ARMA) model, non-parametric auto-regressive (NPAR) model and artificial neural network (ANN), was performed. Results The introduction of two novel hybrid models, HPF 1 1 and HPF 3 4 , which demonstrated superior performance compared to all other models, as evidenced by their remarkable results in key performance indicators such as root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE), is a significant advancement in disease prediction. Conclusion The new models developed can be implemented in forecasting other diseases in the future. To address the current situation effectively, governments and stakeholders must implement significant changes to ensure strict adherence to standard operating procedures (SOPs) by the public. Given the anticipated continuation of increasing trends in the coming days, these measures are essential for mitigating the impact of the outbreak.
Collapse
Affiliation(s)
- Hasnain Iftikhar
- Department of Statistics, Quaid-i-Azam University, Islamabad, Pakistan
| | - Muhammad Daniyal
- Department of Statistics, The Islamia University of Bahawalpur, Bahawalpur, Pakistan
| | - Moiz Qureshi
- Department of Statistics, Shaheed Benazir Bhutto University, Shaheed Benazirabad, Pakistan
| | - Kassim Tawaiah
- Department of Mathematics and Statistics, University of Energy and Natural Resources, Sunyani, Ghana
- Department of Statistics and Actuarial Science, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Richard Kwame Ansah
- Department of Mathematics and Statistics, University of Energy and Natural Resources, Sunyani, Ghana
- Department of Mathematics, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Jonathan Kwaku Afriyie
- Department of Statistics and Actuarial Science, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| |
Collapse
|
13
|
Osthus D. Fast and accurate influenza forecasting in the United States with Inferno. PLoS Comput Biol 2022; 18:e1008651. [PMID: 35100253 PMCID: PMC8830797 DOI: 10.1371/journal.pcbi.1008651] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 02/10/2022] [Accepted: 01/02/2022] [Indexed: 01/15/2023] Open
Abstract
Infectious disease forecasting is an emerging field and has the potential to improve public health through anticipatory resource allocation, situational awareness, and mitigation planning. By way of exploring and operationalizing disease forecasting, the U.S. Centers for Disease Control and Prevention (CDC) has hosted FluSight since the 2013/14 flu season, an annual flu forecasting challenge. Since FluSight’s onset, forecasters have developed and improved forecasting models in an effort to provide more timely, reliable, and accurate information about the likely progression of the outbreak. While improving the predictive performance of these forecasting models is often the primary objective, it is also important for a forecasting model to run quickly, facilitating further model development and improvement while providing flexibility when deployed in a real-time setting. In this vein I introduce Inferno, a fast and accurate flu forecasting model inspired by Dante, the top performing model in the 2018/19 FluSight challenge. When pseudoprospectively compared to all models that participated in FluSight 2018/19, Inferno would have placed 2nd in the national and regional challenge as well as the state challenge, behind only Dante. Inferno, however, runs in minutes and is trivially parallelizable, while Dante takes hours to run, representing a significant operational improvement with minimal impact to performance. Forecasting challenges like FluSight should continue to monitor and evaluate how they can be modified and expanded to incentivize the development of forecasting models that benefit public health. Infectious disease forecasting, if accurate, timely, and reliable, can assist decision makers with resource allocation planning in an attempt to curb the negative impacts of an outbreak. Forecasting challenges, like the U.S. Centers for Disease Control and Prevention’s flu forecasting challenge, FluSight, provide a space for teams to develop and operationalize real-time forecasting models that benefit public health, with weekly forecasts made at the state-level, Health and Human Services region-level, and the United States. The ultimate goal of these models is to produce accurate forecasts within the constraints of the forecasting challenge. Having a forecasting model that runs quickly is also important for future scalability, model development, and operational flexibility. In this paper, I present a fast and accurate flu forecasting model, Inferno. Through retrospective comparisons with FluSight-participating models, Inferno was shown to be a leading forecasting model in the field. Inferno, however, runs in minutes not hours, as other leading forecasting models do. This reduction in runtime constitutes an advancement in flu forecasting, positioning Inferno to scale to more granular geographic units, like counties or health care providers.
Collapse
Affiliation(s)
- Dave Osthus
- Statistical Sciences Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- * E-mail:
| |
Collapse
|
14
|
McAndrew T, Reich NG. Adaptively stacking ensembles for influenza forecasting. Stat Med 2021; 40:6931-6952. [PMID: 34647627 PMCID: PMC8671371 DOI: 10.1002/sim.9219] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 09/13/2021] [Accepted: 09/14/2021] [Indexed: 01/01/2023]
Abstract
Seasonal influenza infects between 10 and 50 million people in the United States every year. Accurate forecasts of influenza and influenza-like illness (ILI) have been named by the CDC as an important tool to fight the damaging effects of these epidemics. Multi-model ensembles make accurate forecasts of seasonal influenza, but current operational ensemble forecasts are static: they require an abundance of past ILI data and assign fixed weights to component models at the beginning of a season, but do not update weights as new data on component model performance is collected. We propose an adaptive ensemble that (i) does not initially need data to combine forecasts and (ii) finds optimal weights which are updated week-by-week throughout the influenza season. We take a regularized likelihood approach and investigate this regularizer's ability to impact adaptive ensemble performance. After finding an optimal regularization value, we compare our adaptive ensemble to an equal-weighted and static ensemble. Applied to forecasts of short-term ILI incidence at the regional and national level, our adaptive model outperforms an equal-weighted ensemble and has similar performance to the static ensemble using only a fraction of the data available to the static ensemble. Needing no data at the beginning of an epidemic, an adaptive ensemble can quickly train and forecast an outbreak, providing a practical tool to public health officials looking for a forecast to conform to unique features of a specific season.
Collapse
Affiliation(s)
- Thomas McAndrew
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts Amherst, Amherst, Massachusetts, United States,College of Health, Lehigh University, Bethlehem, Pennsylvania, United States,Correspondence: Thomas McAndrew, Lehigh University Bethlehem, Pennsylvania, United States of America.
| | - Nicholas G. Reich
- College of Health, Lehigh University, Bethlehem, Pennsylvania, United States
| |
Collapse
|
15
|
Epidemic tracking and forecasting: Lessons learned from a tumultuous year. Proc Natl Acad Sci U S A 2021; 118:2111456118. [PMID: 34903658 PMCID: PMC8713795 DOI: 10.1073/pnas.2111456118] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/23/2021] [Indexed: 01/15/2023] Open
|
16
|
Abstract
Influenza is a common respiratory infection that causes considerable morbidity and mortality worldwide each year. In recent years, along with the improvement in computational resources, there have been a number of important developments in the science of influenza surveillance and forecasting. Influenza surveillance systems have been improved by synthesizing multiple sources of information. Influenza forecasting has developed into an active field, with annual challenges in the United States that have stimulated improved methodologies. Work continues on the optimal approaches to assimilating surveillance data and information on relevant driving factors to improve estimates of the current situation (nowcasting) and to forecast future dynamics.
Collapse
Affiliation(s)
- Sheikh Taslim Ali
- World Health Organization Collaborating Centre for Infectious Disease Epidemiology and Control, School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China;
| | - Benjamin J Cowling
- World Health Organization Collaborating Centre for Infectious Disease Epidemiology and Control, School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China;
| |
Collapse
|
17
|
St-Onge G, Sun H, Allard A, Hébert-Dufresne L, Bianconi G. Universal Nonlinear Infection Kernel from Heterogeneous Exposure on Higher-Order Networks. PHYSICAL REVIEW LETTERS 2021; 127:158301. [PMID: 34678024 PMCID: PMC9199393 DOI: 10.1103/physrevlett.127.158301] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 07/26/2021] [Accepted: 08/25/2021] [Indexed: 06/13/2023]
Abstract
The collocation of individuals in different environments is an important prerequisite for exposure to infectious diseases on a social network. Standard epidemic models fail to capture the potential complexity of this scenario by (1) neglecting the higher-order structure of contacts that typically occur through environments like workplaces, restaurants, and households, and (2) assuming a linear relationship between the exposure to infected contacts and the risk of infection. Here, we leverage a hypergraph model to embrace the heterogeneity of environments and the heterogeneity of individual participation in these environments. We find that combining heterogeneous exposure with the concept of minimal infective dose induces a universal nonlinear relationship between infected contacts and infection risk. Under nonlinear infection kernels, conventional epidemic wisdom breaks down with the emergence of discontinuous transitions, superexponential spread, and hysteresis.
Collapse
Affiliation(s)
- Guillaume St-Onge
- Département de physique, de génie physique et d’optique, Université Laval, Québec (Québec) G1V 0A6, Canada
- Centre interdisciplinaire en modélisation mathématique, Université Laval, Québec (Québec) G1V 0A6, Canada
| | - Hanlin Sun
- School of Mathematical Sciences, Queen Mary University of London, London E1 4NS, United Kingdom
| | - Antoine Allard
- Département de physique, de génie physique et d’optique, Université Laval, Québec (Québec) G1V 0A6, Canada
- Centre interdisciplinaire en modélisation mathématique, Université Laval, Québec (Québec) G1V 0A6, Canada
- Vermont Complex Systems Center, University of Vermont, Burlington, Vermont 05405, USA
| | - Laurent Hébert-Dufresne
- Département de physique, de génie physique et d’optique, Université Laval, Québec (Québec) G1V 0A6, Canada
- Vermont Complex Systems Center, University of Vermont, Burlington, Vermont 05405, USA
- Department of Computer Science, University of Vermont, Burlington, Vermont 05405, USA
| | - Ginestra Bianconi
- School of Mathematical Sciences, Queen Mary University of London, London E1 4NS, United Kingdom
- The Alan Turing Institute, 96 Euston Road, London NW1 2DB, United Kingdom
| |
Collapse
|
18
|
Pollett S, Johansson MA, Reich NG, Brett-Major D, Del Valle SY, Venkatramanan S, Lowe R, Porco T, Berry IM, Deshpande A, Kraemer MUG, Blazes DL, Pan-ngum W, Vespigiani A, Mate SE, Silal SP, Kandula S, Sippy R, Quandelacy TM, Morgan JJ, Ball J, Morton LC, Althouse BM, Pavlin J, van Panhuis W, Riley S, Biggerstaff M, Viboud C, Brady O, Rivers C. Recommended reporting items for epidemic forecasting and prediction research: The EPIFORGE 2020 guidelines. PLoS Med 2021; 18:e1003793. [PMID: 34665805 PMCID: PMC8525759 DOI: 10.1371/journal.pmed.1003793] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND The importance of infectious disease epidemic forecasting and prediction research is underscored by decades of communicable disease outbreaks, including COVID-19. Unlike other fields of medical research, such as clinical trials and systematic reviews, no reporting guidelines exist for reporting epidemic forecasting and prediction research despite their utility. We therefore developed the EPIFORGE checklist, a guideline for standardized reporting of epidemic forecasting research. METHODS AND FINDINGS We developed this checklist using a best-practice process for development of reporting guidelines, involving a Delphi process and broad consultation with an international panel of infectious disease modelers and model end users. The objectives of these guidelines are to improve the consistency, reproducibility, comparability, and quality of epidemic forecasting reporting. The guidelines are not designed to advise scientists on how to perform epidemic forecasting and prediction research, but rather to serve as a standard for reporting critical methodological details of such studies. CONCLUSIONS These guidelines have been submitted to the EQUATOR network, in addition to hosting by other dedicated webpages to facilitate feedback and journal endorsement.
Collapse
Affiliation(s)
- Simon Pollett
- Walter Reed Army Institute of Research, Silver Spring, Maryland, United States of America
| | - Michael A. Johansson
- Division of Vector-Borne Diseases, Centers for Disease Control & Prevention, San Juan, Puerto Rico, United States of America
| | - Nicholas G. Reich
- University of Massachusetts–Amherst, School of Public Health and Health Sciences, Amherst, Massachusetts, United States of America
| | - David Brett-Major
- University of Nebraska Medical Center, Omaha, Nebraska, United States of America
| | - Sara Y. Del Valle
- Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Srinivasan Venkatramanan
- Biocomplexity Institute and Initiative, University of Virginia, Charlottesville, Virginia, United States of America
| | - Rachel Lowe
- Centre for Mathematical Modelling of Infectious Diseases and Centre on Climate Change and Planetary Health, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Barcelona Institute for Global Health, Barcelona, Spain
| | - Travis Porco
- University of California at San Francisco, San Francisco, California, United States of America
| | - Irina Maljkovic Berry
- Walter Reed Army Institute of Research, Silver Spring, Maryland, United States of America
| | - Alina Deshpande
- Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | | | - David L. Blazes
- Bill and Melinda Gates Foundation, Seattle, Washington, United States of America
| | - Wirichada Pan-ngum
- Mahidol-Oxford Tropical Medicine Research Unit and Department of Tropical Hygiene, Mahidol University, Thailand
| | - Alessandro Vespigiani
- Network Science Institute, Northeastern University, Boston, Massachusetts, United States of America
| | - Suzanne E. Mate
- Walter Reed Army Institute of Research, Silver Spring, Maryland, United States of America
| | - Sheetal P. Silal
- Modelling and Simulation Hub, Africa, Department of Statistical Sciences, University of Cape Town, Cape Town, South Africa
- Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Sasikiran Kandula
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York City, New York, United States of America
| | - Rachel Sippy
- Institute for Global Health and Translational Science, State University of New York Upstate Medical University, Syracuse, New York, United States of America
| | - Talia M. Quandelacy
- Division of Vector-Borne Diseases, Centers for Disease Control & Prevention, San Juan, Puerto Rico, United States of America
| | - Jeffrey J. Morgan
- Catholic University of America, Washington, DC, United States of America
| | - Jacob Ball
- U.S. Army Public Health Center, Edgewood, Maryland, United States of America
| | - Lindsay C. Morton
- Armed Forces Health Surveillance Division, Global Emerging Infections Surveillance, Silver Spring, Maryland, United States of America
- George Washington University, Milken Institute School of Public Health, Washington, DC, United States of America
| | - Benjamin M. Althouse
- University of Washington, Seattle, Washington, United States of America
- Institute for Disease Modeling, Bellevue, Washington, United States of America
- New Mexico State University, Las Cruces, New Mexico, United States of America
| | - Julie Pavlin
- National Academies of Sciences, Engineering, and Medicine, Washington, DC, United States of America
| | - Wilbert van Panhuis
- University of Pittsburgh Graduate School of Public Health, Pittsburgh, Pennsylvania, United States of America
| | - Steven Riley
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College, London, United Kingdom
| | - Matthew Biggerstaff
- Influenza Division, Centers for Disease Control & Prevention, Atlanta, Georgia, United States of America
| | - Cecile Viboud
- Fogarty International Center, National Institutes for Health, Bethesda, Maryland, United States of America
| | - Oliver Brady
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Caitlin Rivers
- Johns Hopkins Center for Health Security, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| |
Collapse
|
19
|
De Toni G, Consonni C, Montresor A. A general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data. PLoS One 2021; 16:e0256858. [PMID: 34464416 PMCID: PMC8407583 DOI: 10.1371/journal.pone.0256858] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 08/17/2021] [Indexed: 12/04/2022] Open
Abstract
Influenza is an acute respiratory seasonal disease that affects millions of people worldwide and causes thousands of deaths in Europe alone. Estimating in a fast and reliable way the impact of an illness on a given country is essential to plan and organize effective countermeasures, which is now possible by leveraging unconventional data sources like web searches and visits. In this study, we show the feasibility of exploiting machine learning models and information about Wikipedia’s page views of a selected group of articles to obtain accurate estimates of influenza-like illnesses incidence in four European countries: Italy, Germany, Belgium, and the Netherlands. We propose a novel language-agnostic method, based on two algorithms, Personalized PageRank and CycleRank, to automatically select the most relevant Wikipedia pages to be monitored without the need for expert supervision. We then show how our model can reach state-of-the-art results by comparing it with previous solutions.
Collapse
Affiliation(s)
- Giovanni De Toni
- Department of Information Engineering and Computer Science (DISI), University of Trento, Trento, Italy
| | - Cristian Consonni
- Big Data and Data Science Unit, Eurecat - Centre Tecnològic de Catalunya, Barcelona, Spain
| | - Alberto Montresor
- Department of Information Engineering and Computer Science (DISI), University of Trento, Trento, Italy
| |
Collapse
|
20
|
Murphy C, Laurence E, Allard A. Deep learning of contagion dynamics on complex networks. Nat Commun 2021; 12:4720. [PMID: 34354055 PMCID: PMC8342694 DOI: 10.1038/s41467-021-24732-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 07/05/2021] [Indexed: 11/26/2022] Open
Abstract
Forecasting the evolution of contagion dynamics is still an open problem to which mechanistic models only offer a partial answer. To remain mathematically or computationally tractable, these models must rely on simplifying assumptions, thereby limiting the quantitative accuracy of their predictions and the complexity of the dynamics they can model. Here, we propose a complementary approach based on deep learning where effective local mechanisms governing a dynamic on a network are learned from time series data. Our graph neural network architecture makes very few assumptions about the dynamics, and we demonstrate its accuracy using different contagion dynamics of increasing complexity. By allowing simulations on arbitrary network structures, our approach makes it possible to explore the properties of the learned dynamics beyond the training data. Finally, we illustrate the applicability of our approach using real data of the COVID-19 outbreak in Spain. Our results demonstrate how deep learning offers a new and complementary perspective to build effective models of contagion dynamics on networks.
Collapse
Affiliation(s)
- Charles Murphy
- Département de physique, de génie physique et d'optique, Université Laval, Québec, Québec, Canada
- Centre interdisciplinaire en modélisation mathématique, Université Laval, Québec, Québec, Canada
| | - Edward Laurence
- Département de physique, de génie physique et d'optique, Université Laval, Québec, Québec, Canada
- Centre interdisciplinaire en modélisation mathématique, Université Laval, Québec, Québec, Canada
| | - Antoine Allard
- Département de physique, de génie physique et d'optique, Université Laval, Québec, Québec, Canada.
- Centre interdisciplinaire en modélisation mathématique, Université Laval, Québec, Québec, Canada.
| |
Collapse
|
21
|
|
22
|
Miliou I, Xiong X, Rinzivillo S, Zhang Q, Rossetti G, Giannotti F, Pedreschi D, Vespignani A. Predicting seasonal influenza using supermarket retail records. PLoS Comput Biol 2021; 17:e1009087. [PMID: 34252075 PMCID: PMC8297944 DOI: 10.1371/journal.pcbi.1009087] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 07/22/2021] [Accepted: 05/15/2021] [Indexed: 11/19/2022] Open
Abstract
Increased availability of epidemiological data, novel digital data streams, and the rise of powerful machine learning approaches have generated a surge of research activity on real-time epidemic forecast systems. In this paper, we propose the use of a novel data source, namely retail market data to improve seasonal influenza forecasting. Specifically, we consider supermarket retail data as a proxy signal for influenza, through the identification of sentinel baskets, i.e., products bought together by a population of selected customers. We develop a nowcasting and forecasting framework that provides estimates for influenza incidence in Italy up to 4 weeks ahead. We make use of the Support Vector Regression (SVR) model to produce the predictions of seasonal flu incidence. Our predictions outperform both a baseline autoregressive model and a second baseline based on product purchases. The results show quantitatively the value of incorporating retail market data in forecasting models, acting as a proxy that can be used for the real-time analysis of epidemics.
Collapse
Affiliation(s)
- Ioanna Miliou
- University of Pisa, Pisa, Italy
- ISTI-CNR, Pisa, Italy
| | - Xinyue Xiong
- Northeastern University, Boston, Massachusetts, United States of America
| | | | - Qian Zhang
- Northeastern University, Boston, Massachusetts, United States of America
| | | | | | | | | |
Collapse
|
23
|
Muchnik L, Yom-Tov E, Levy N, Rubin A, Louzoun Y. Initial growth rates of malware epidemics fail to predict their reach. Sci Rep 2021; 11:11750. [PMID: 34083697 PMCID: PMC8175743 DOI: 10.1038/s41598-021-91321-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 05/18/2021] [Indexed: 11/09/2022] Open
Abstract
Empirical studies show that epidemiological models based on an epidemic’s initial spread rate often fail to predict the true scale of that epidemic. Most epidemics with a rapid early rise die out before affecting a significant fraction of the population, whereas the early pace of some pandemics is rather modest. Recent models suggest that this could be due to the heterogeneity of the target population’s susceptibility. We study a computer malware ecosystem exhibiting spread mechanisms resembling those of biological systems while offering details unavailable for human epidemics. Rather than comparing models, we directly estimate reach from a new and vastly more complete data from a parallel domain, that offers superior details and insight as concerns biological outbreaks. We find a highly heterogeneous distribution of computer susceptibilities, with nearly all outbreaks initially over-affecting the tail of the distribution, then collapsing quickly once this tail is depleted. This mechanism restricts the correlation between an epidemic’s initial growth rate and its total reach, thus preventing the majority of epidemics, including initially fast-growing outbreaks, from reaching a macroscopic fraction of the population. The few pervasive malwares distinguish themselves early on via the following key trait: they avoid infecting the tail, while preferentially targeting computers unaffected by typical malware.
Collapse
Affiliation(s)
- Lev Muchnik
- School of Business Administration, Hebrew University of Jerusalem, Jerusalem, Israel.,Microsoft Research, Herzliya, Israel
| | | | - Nir Levy
- Microsoft Israel Research and Development Center, Tel Aviv, Israel
| | - Amir Rubin
- Microsoft Israel Research and Development Center, Tel Aviv, Israel.,Department of Computer Science, Ben-Gurion University of The Negev, Be'er Sheva, Israel
| | - Yoram Louzoun
- Department of Mathematics, Bar Ilan University, Ramat Gan, Israel.
| |
Collapse
|
24
|
Abstract
Influenza forecasting in the United States (US) is complex and challenging due to spatial and temporal variability, nested geographic scales of interest, and heterogeneous surveillance participation. Here we present Dante, a multiscale influenza forecasting model that learns rather than prescribes spatial, temporal, and surveillance data structure and generates coherent forecasts across state, regional, and national scales. We retrospectively compare Dante's short-term and seasonal forecasts for previous flu seasons to the Dynamic Bayesian Model (DBM), a leading competitor. Dante outperformed DBM for nearly all spatial units, flu seasons, geographic scales, and forecasting targets. Dante's sharper and more accurate forecasts also suggest greater public health utility. Dante placed 1st in the Centers for Disease Control and Prevention's prospective 2018/19 FluSight challenge in both the national and regional competition and the state competition. The methodology underpinning Dante can be used in other seasonal disease forecasting contexts having nested geographic scales of interest.
Collapse
Affiliation(s)
- Dave Osthus
- Los Alamos National Laboratory, Statistical Sciences Group, Los Alamos, NM, USA.
| | - Kelly R Moran
- Los Alamos National Laboratory, Statistical Sciences Group, Los Alamos, NM, USA.,Department of Statistical Science, Duke University, Durham, NC, USA
| |
Collapse
|
25
|
Poirier C, Hswen Y, Bouzillé G, Cuggia M, Lavenu A, Brownstein JS, Brewer T, Santillana M. Influenza forecasting for French regions combining EHR, web and climatic data sources with a machine learning ensemble approach. PLoS One 2021; 16:e0250890. [PMID: 34010293 PMCID: PMC8133501 DOI: 10.1371/journal.pone.0250890] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 04/16/2021] [Indexed: 11/25/2022] Open
Abstract
Effective and timely disease surveillance systems have the potential to help public health officials design interventions to mitigate the effects of disease outbreaks. Currently, healthcare-based disease monitoring systems in France offer influenza activity information that lags real-time by one to three weeks. This temporal data gap introduces uncertainty that prevents public health officials from having a timely perspective on the population-level disease activity. Here, we present a machine-learning modeling approach that produces real-time estimates and short-term forecasts of influenza activity for the twelve continental regions of France by leveraging multiple disparate data sources that include, Google search activity, real-time and local weather information, flu-related Twitter micro-blogs, electronic health records data, and historical disease activity synchronicities across regions. Our results show that all data sources contribute to improving influenza surveillance and that machine-learning ensembles that combine all data sources lead to accurate and timely predictions.
Collapse
Affiliation(s)
- Canelle Poirier
- INSERM, U1099, Rennes, France
- Université de Rennes 1, LTSI, Rennes, France
- Department of Pediatrics, Harvard Medical School, Boston, MA, United States of America
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, MA, United States of America
- * E-mail: (CP); (MS)
| | - Yulin Hswen
- Department of Social and Behavioral Sciences, Harvard T.H. Chan School of Public Health, Boston, MA, United States of America
- Innovation Program, Boston Children’s Hospital, Boston, MA, United States of America
| | - Guillaume Bouzillé
- INSERM, U1099, Rennes, France
- Université de Rennes 1, LTSI, Rennes, France
- CHU Rennes, Centre de Données Cliniques, Rennes, France
| | - Marc Cuggia
- INSERM, U1099, Rennes, France
- Université de Rennes 1, LTSI, Rennes, France
- CHU Rennes, Centre de Données Cliniques, Rennes, France
| | - Audrey Lavenu
- Université de Rennes 1, Faculté de médecine, Rennes, France
- INSERM CIC 1414, Université de Rennes 1, Rennes, France
- IRMAR, Institut de Recherche Mathématique de Rennes, Rennes, France
| | - John S. Brownstein
- Innovation Program, Boston Children’s Hospital, Boston, MA, United States of America
- Department of Pediatrics, Harvard Medical School, Boston, MA, United States of America
| | - Thomas Brewer
- Innovation Program, Boston Children’s Hospital, Boston, MA, United States of America
| | - Mauricio Santillana
- Department of Pediatrics, Harvard Medical School, Boston, MA, United States of America
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, MA, United States of America
- * E-mail: (CP); (MS)
| |
Collapse
|
26
|
Biegel HR, Lega J. EpiCovDA: a mechanistic COVID-19 forecasting model with data assimilation. ARXIV 2021:arXiv:2105.05471v2. [PMID: 34012991 PMCID: PMC8132228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Revised: 10/09/2021] [Indexed: 11/04/2022]
Abstract
We introduce a minimalist outbreak forecasting model that combines data-driven parameter estimation with variational data assimilation. By focusing on the fundamental components of nonlinear disease transmission and representing data in a domain where model stochasticity simplifies into a process with independent increments, we design an approach that only requires four core parameters to be estimated. We illustrate this novel methodology on COVID-19 forecasts. Results include case count and deaths predictions for the US and all of its 50 states, the District of Columbia, and Puerto Rico. The method is computationally efficient and is not disease- or location-specific. It may therefore be applied to other outbreaks or other countries, provided case counts and/or deaths data are available.
Collapse
Affiliation(s)
- Hannah R Biegel
- Department of Mathematics, University of Arizona, 617 N. Santa Rita Avenue, Tucson, AZ 85721
| | - Joceline Lega
- Department of Mathematics, University of Arizona, 617 N. Santa Rita Avenue, Tucson, AZ 85721
| |
Collapse
|
27
|
Keyes KM, Kandula S, Olfson M, Gould MS, Martínez-Alés G, Rutherford C, Shaman J. Suicide and the agent-host-environment triad: leveraging surveillance sources to inform prevention. Psychol Med 2021; 51:529-537. [PMID: 33663629 PMCID: PMC8020492 DOI: 10.1017/s003329172000536x] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 12/14/2020] [Accepted: 01/06/2021] [Indexed: 12/11/2022]
Abstract
Suicide in the US has increased in the last decade, across virtually every age and demographic group. Parallel increases have occurred in non-fatal self-harm as well. Research on suicide across the world has consistently demonstrated that suicide shares many properties with a communicable disease, including person-to-person transmission and point-source outbreaks. This essay illustrates the communicable nature of suicide through analogy to basic infectious disease principles, including evidence for transmission and vulnerability through the agent-host-environment triad. We describe how mathematical modeling, a suite of epidemiological methods, which the COVID-19 pandemic has brought into renewed focus, can and should be applied to suicide in order to understand the dynamics of transmission and to forecast emerging risk areas. We describe how new and innovative sources of data, including social media and search engine data, can be used to augment traditional suicide surveillance, as well as the opportunities and challenges for modeling suicide as a communicable disease process in an effort to guide clinical and public health suicide prevention efforts.
Collapse
Affiliation(s)
| | - Sasikiran Kandula
- Department of Environmental Health Sciences, Columbia University, New York, NY, USA
| | - Mark Olfson
- Department of Epidemiology, Columbia University, New York, NY, USA
- Department of Psychiatry, Columbia University, New York, NY, USA
| | - Madelyn S. Gould
- Department of Epidemiology, Columbia University, New York, NY, USA
- Department of Psychiatry, Columbia University, New York, NY, USA
| | - Gonzalo Martínez-Alés
- Department of Epidemiology, Columbia University, New York, NY, USA
- Universidad Autónoma de Madrid School of Medicine, Madrid, Spain
| | | | - Jeffrey Shaman
- Department of Environmental Health Sciences, Columbia University, New York, NY, USA
| |
Collapse
|
28
|
Forecasting influenza activity using machine-learned mobility map. Nat Commun 2021; 12:726. [PMID: 33563980 PMCID: PMC7873234 DOI: 10.1038/s41467-021-21018-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Accepted: 01/07/2021] [Indexed: 11/18/2022] Open
Abstract
Human mobility is a primary driver of infectious disease spread. However, existing data is limited in availability, coverage, granularity, and timeliness. Data-driven forecasts of disease dynamics are crucial for decision-making by health officials and private citizens alike. In this work, we focus on a machine-learned anonymized mobility map (hereon referred to as AMM) aggregated over hundreds of millions of smartphones and evaluate its utility in forecasting epidemics. We factor AMM into a metapopulation model to retrospectively forecast influenza in the USA and Australia. We show that the AMM model performs on-par with those based on commuter surveys, which are sparsely available and expensive. We also compare it with gravity and radiation based models of mobility, and find that the radiation model’s performance is quite similar to AMM and commuter flows. Additionally, we demonstrate our model’s ability to predict disease spread even across state boundaries. Our work contributes towards developing timely infectious disease forecasting at a global scale using human mobility datasets expanding their applications in the area of infectious disease epidemiology. Human mobility plays a central role in the spread of infectious diseases and can help in forecasting incidence. Here the authors show a comparison of multiple mobility benchmarks in forecasting influenza, and demonstrate the value of a machine-learned mobility map with global coverage at multiple spatial scales.
Collapse
|
29
|
Pei S, Teng X, Lewis P, Shaman J. Optimizing respiratory virus surveillance networks using uncertainty propagation. Nat Commun 2021; 12:222. [PMID: 33431854 PMCID: PMC7801666 DOI: 10.1038/s41467-020-20399-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 12/01/2020] [Indexed: 02/07/2023] Open
Abstract
Infectious disease prevention, control and forecasting rely on sentinel observations; however, many locations lack the capacity for routine surveillance. Here we show that, by using data from multiple sites collectively, accurate estimation and forecasting of respiratory diseases for locations without surveillance is feasible. We develop a framework to optimize surveillance sites that suppresses uncertainty propagation in a networked disease transmission model. Using influenza outbreaks from 35 US states, the optimized system generates better near-term predictions than alternate systems designed using population and human mobility. We also find that monitoring regional population centers serves as a reasonable proxy for the optimized network and could direct surveillance for diseases with limited records. The proxy method is validated using model simulations for 3,108 US counties and historical data for two other respiratory pathogens - human metapneumovirus and seasonal coronavirus - from 35 US states and can be used to guide systemic allocation of surveillance efforts.
Collapse
Affiliation(s)
- Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA.
| | - Xian Teng
- School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Paul Lewis
- Integrated Biosurveillance Section, Armed Forces Health Surveillance Branch, Silver Spring, MD, 20904, USA
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA.
| |
Collapse
|
30
|
Gibson GC, Moran KR, Reich NG, Osthus D. Improving probabilistic infectious disease forecasting through coherence. PLoS Comput Biol 2021; 17:e1007623. [PMID: 33406068 PMCID: PMC7837472 DOI: 10.1371/journal.pcbi.1007623] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 01/26/2021] [Accepted: 09/14/2020] [Indexed: 11/19/2022] Open
Abstract
With an estimated $10.4 billion in medical costs and 31.4 million outpatient visits each year, influenza poses a serious burden of disease in the United States. To provide insights and advance warning into the spread of influenza, the U.S. Centers for Disease Control and Prevention (CDC) runs a challenge for forecasting weighted influenza-like illness (wILI) at the national and regional level. Many models produce independent forecasts for each geographical unit, ignoring the constraint that the national wILI is a weighted sum of regional wILI, where the weights correspond to the population size of the region. We propose a novel algorithm that transforms a set of independent forecast distributions to obey this constraint, which we refer to as probabilistically coherent. Enforcing probabilistic coherence led to an increase in forecast skill for 79% of the models we tested over multiple flu seasons, highlighting the importance of respecting the forecasting system’s geographical hierarchy. Seasonal influenza causes a significant public health burden nationwide. Accurate influenza forecasting may help public health officials allocate resources and plan responses to emerging outbreaks. The U.S. Centers for Disease Control and Prevention (CDC) reports influenza data at multiple geographical units, including regionally and nationally, where the national data are by construction a weighted sum of the regional data. In an effort to improve influenza forecast accuracy across all models submitted to the CDC’s annual flu forecasting challenge, we examined the effect of imposing this geographical constraint on the set of independent forecasts, made publicly available by the CDC. We developed a novel method to transform forecast densities to obey the geographical constraint that respects the correlation structure between geographical units. This method showed consistent improvement across 79% of models and that held when stratified by targets and test seasons. Our method can be applied to other forecasting systems both within and outside an infectious disease context that have a geographical hierarchy.
Collapse
Affiliation(s)
- Graham Casey Gibson
- Statistical Sciences Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst, Amherst, Massachusetts, United States of America
- * E-mail:
| | - Kelly R. Moran
- Statistical Sciences Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Department of Statistical Science, Duke University, Durham, North Carolina, United States of America
| | - Nicholas G. Reich
- Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst, Amherst, Massachusetts, United States of America
| | - Dave Osthus
- Statistical Sciences Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| |
Collapse
|
31
|
Sachak-Patwa R, Byrne HM, Thompson RN. Accounting for cross-immunity can improve forecast accuracy during influenza epidemics. Epidemics 2020; 34:100432. [PMID: 33360870 DOI: 10.1016/j.epidem.2020.100432] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Revised: 12/11/2020] [Accepted: 12/15/2020] [Indexed: 11/17/2022] Open
Abstract
Previous exposure to influenza viruses confers cross-immunity against future infections with related strains. However, this is not always accounted for explicitly in mathematical models used for forecasting during influenza outbreaks. We show that, if an influenza outbreak is due to a strain that is similar to one that has emerged previously, then accounting for cross-immunity explicitly can improve the accuracy of real-time forecasts. To do this, we consider two infectious disease outbreak forecasting models. In the first (the "1-group model"), all individuals are assumed to be identical and cross-immunity is not accounted for. In the second (the "2-group model"), individuals who have previously been infected by a related strain are assumed to be less likely to experience severe disease, and therefore recover more quickly, than immunologically naive individuals. We fit both models to estimated case notification data (including symptomatic individuals as well as laboratory-confirmed cases) from Japan from the 2009 H1N1 influenza pandemic, and then generate synthetic data for a future outbreak by assuming that the 2-group model represents the epidemiology of influenza infections more accurately. We use the 1-group model (as well as the 2-group model for comparison) to generate forecasts that would be obtained in real-time as the future outbreak is ongoing, using parameter values estimated from the 2009 epidemic as informative priors, motivated by the fact that without using prior information from 2009, the forecasts are highly uncertain. In the scenario that we consider, the 1-group model only produces accurate outbreak forecasts once the peak of the epidemic has passed, even when the values of important epidemiological parameters such as the lengths of the mean incubation and infectious periods are known exactly. As a result, it is necessary to use the more epidemiologically realistic 2-group model to generate accurate forecasts. Accounting for cross-immunity driven by exposures in previous outbreaks explicitly is expected to improve the accuracy of epidemiological modelling forecasts during influenza outbreaks.
Collapse
Affiliation(s)
- Rahil Sachak-Patwa
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Radcliffe Observatory Quarter, Woodstock Road, Oxford, OX2 6GG, UK.
| | - Helen M Byrne
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Radcliffe Observatory Quarter, Woodstock Road, Oxford, OX2 6GG, UK
| | - Robin N Thompson
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Radcliffe Observatory Quarter, Woodstock Road, Oxford, OX2 6GG, UK; Christ Church, University of Oxford, St Aldates, Oxford, OX1 1DP, UK; Present address: Mathematics Institute, University of Warwick, Zeeman Building, Coventry, CV4 7AL, UK
| |
Collapse
|
32
|
Bomfim R, Pei S, Shaman J, Yamana T, Makse HA, Andrade JS, Lima Neto AS, Furtado V. Predicting dengue outbreaks at neighbourhood level using human mobility in urban areas. J R Soc Interface 2020; 17:20200691. [PMID: 33109025 DOI: 10.1098/rsif.2020.0691] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Dengue is a vector-borne disease transmitted by the Aedes genus mosquito. It causes financial burdens on public health systems and considerable morbidity and mortality. Tropical regions in the Americas and Asia are the areas most affected by the virus. Fortaleza is a city with approximately 2.6 million inhabitants in northeastern Brazil that, during the recent decades, has been suffering from endemic dengue transmission, interspersed with larger epidemics. The objective of this paper is to study the impact of human mobility in urban areas on the spread of the dengue virus, and to test whether human mobility data can be used to improve predictions of dengue virus transmission at the neighbourhood level. We present two distinct forecasting systems for dengue transmission in Fortaleza: the first using artificial neural network methods and the second developed using a mechanistic model of disease transmission. We then present enhanced versions of the two forecasting systems that incorporate bus transportation data cataloguing movement among 119 neighbourhoods in Fortaleza. Each forecasting system was used to perform retrospective forecasts for historical dengue outbreaks from 2007 to 2015. Results show that both artificial neural networks and mechanistic models can accurately forecast dengue cases, and that the inclusion of human mobility data substantially improves the performance of both forecasting systems. While the mechanistic models perform better in capturing seasons with large-scale outbreaks, the neural networks more accurately forecast outbreak peak timing, peak intensity and annual dengue time series. These results have two practical implications: they support the creation of public policies from the use of the models created here to combat the disease and help to understand the impact of urban mobility on the epidemic in large cities.
Collapse
Affiliation(s)
- Rafael Bomfim
- Programa de Pós Graduação em Informática Aplicada Universidade de Fortaleza, Fortaleza, Brazil
| | - Sen Pei
- Department of Environmental Health Sciences, Columbia University, New York, NY 10032, USA
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Columbia University, New York, NY 10032, USA
| | - Teresa Yamana
- Department of Environmental Health Sciences, Columbia University, New York, NY 10032, USA
| | - Hernán A Makse
- Levich Institute and Physics Department, City College of New York, New York, NY 10031, USA
| | - José S Andrade
- Departamento de Física, Universidade Federal do Ceará, Campus do Pici, 60451-970 Fortaleza, Ceará, Brazil
| | - Antonio S Lima Neto
- Secretaria Municipal de Saúde de Fortaleza (SMS-Fortaleza), Fortaleza, Ceará, Brazil.,Centro de Ciências da Saúde, Universidade de Fortaleza (UNIFOR), Fortaleza, Ceará, Brazil
| | - Vasco Furtado
- Programa de Pós Graduação em Informática Aplicada Universidade de Fortaleza, Fortaleza, Brazil
| |
Collapse
|
33
|
Pei S, Shaman J. Aggregating forecasts of multiple respiratory pathogens supports more accurate forecasting of influenza-like illness. PLoS Comput Biol 2020; 16:e1008301. [PMID: 33090997 PMCID: PMC7608986 DOI: 10.1371/journal.pcbi.1008301] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 11/03/2020] [Accepted: 09/02/2020] [Indexed: 12/12/2022] Open
Abstract
Influenza-like illness (ILI) is a commonly measured syndromic signal representative of a range of acute respiratory infections. Reliable forecasts of ILI can support better preparation for patient surges in healthcare systems. Although ILI is an amalgamation of multiple pathogens with variable seasonal phasing and attack rates, most existing process-based forecasting systems treat ILI as a single infectious agent. Here, using ILI records and virologic surveillance data, we show that ILI signal can be disaggregated into distinct viral components. We generate separate predictions for six contributing pathogens (influenza A/H1, A/H3, B, respiratory syncytial virus, and human parainfluenza virus types 1-2 and 3), and develop a method to forecast ILI by aggregating these predictions. The relative contribution of each pathogen to the total ILI signal is estimated using a Markov Chain Monte Carlo (MCMC) method upon forecast aggregation. We find highly variable overall contributions from influenza type A viruses across seasons, but relatively stable contributions for the other pathogens. Using historical data from 1997 to 2014 at US national and regional levels, the proposed forecasting system generates improved predictions of both seasonal and near-term targets relative to a baseline method that simulates ILI as a single pathogen. The hierarchical forecasting system can generate predictions for each viral component, as well as infer and predict their contributions to ILI, which may additionally help physicians determine the etiological causes of ILI in clinical settings.
Collapse
Affiliation(s)
- Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, United States of America
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, United States of America
| |
Collapse
|
34
|
Kramer SC, Pei S, Shaman J. Forecasting influenza in Europe using a metapopulation model incorporating cross-border commuting and air travel. PLoS Comput Biol 2020; 16:e1008233. [PMID: 33052907 PMCID: PMC7588111 DOI: 10.1371/journal.pcbi.1008233] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 10/26/2020] [Accepted: 08/10/2020] [Indexed: 11/18/2022] Open
Abstract
Past work has shown that models incorporating human travel can improve the quality of influenza forecasts. Here, we develop and validate a metapopulation model of twelve European countries, in which international translocation of virus is driven by observed commuting and air travel flows, and use this model to generate influenza forecasts in conjunction with incidence data from the World Health Organization. We find that, although the metapopulation model fits the data well, it offers no improvement over isolated models in forecast quality. We discuss several potential reasons for these results. In particular, we note the need for data that are more comparable from country to country, and offer suggestions as to how surveillance systems might be improved to achieve this goal.
Collapse
Affiliation(s)
- Sarah C Kramer
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| |
Collapse
|
35
|
Haw DJ, Pung R, Read JM, Riley S. Strong spatial embedding of social networks generates nonstandard epidemic dynamics independent of degree distribution and clustering. Proc Natl Acad Sci U S A 2020; 117:23636-23642. [PMID: 32900923 PMCID: PMC7519285 DOI: 10.1073/pnas.1910181117] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Some directly transmitted human pathogens, such as influenza and measles, generate sustained exponential growth in incidence and have a high peak incidence consistent with the rapid depletion of susceptible individuals. Many do not. While a prolonged exponential phase typically arises in traditional disease-dynamic models, current quantitative descriptions of nonstandard epidemic profiles are either abstract, phenomenological, or rely on highly skewed offspring distributions in network models. Here, we create large socio-spatial networks to represent contact behavior using human population-density data, a previously developed fitting algorithm, and gravity-like mobility kernels. We define a basic reproductive number [Formula: see text] for this system, analogous to that used for compartmental models. Controlling for [Formula: see text], we then explore networks with a household-workplace structure in which between-household contacts can be formed with varying degrees of spatial correlation, determined by a single parameter from the gravity-like kernel. By varying this single parameter and simulating epidemic spread, we are able to identify how more frequent local movement can lead to strong spatial correlation and, thus, induce subexponential outbreak dynamics with lower, later epidemic peaks. Also, the ratio of peak height to final size was much smaller when movement was highly spatially correlated. We investigate the topological properties of our networks via a generalized clustering coefficient that extends beyond immediate neighborhoods, identifying very strong correlations between fourth-order clustering and nonstandard epidemic dynamics. Our results motivate the observation of both incidence and socio-spatial human behavior during epidemics that exhibit nonstandard incidence patterns.
Collapse
Affiliation(s)
- David J Haw
- Medical Research Council Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W2 1PG, United Kingdom
| | - Rachael Pung
- Medical Research Council Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W2 1PG, United Kingdom
| | - Jonathan M Read
- Centre for Health Informatics Computing and Statistics, Lancaster Medical School, Lancaster University, Lancaster LA1 4YW, United Kingdom
| | - Steven Riley
- Medical Research Council Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W2 1PG, United Kingdom;
| |
Collapse
|
36
|
Galanti M, Comito D, Ligon C, Lane B, Matienzo N, Ibrahim S, Shittu A, Tagne E, Birger R, Ud-Dean M, Filip I, Morita H, Rabadan R, Anthony S, Freyer GA, Dayan P, Shopsin B, Shaman J. Active surveillance documents rates of clinical care seeking due to respiratory illness. Influenza Other Respir Viruses 2020; 14:499-506. [PMID: 32415751 PMCID: PMC7276732 DOI: 10.1111/irv.12753] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 02/04/2020] [Accepted: 04/12/2020] [Indexed: 12/17/2022] Open
Abstract
Background Respiratory viral infections are a leading cause of disease worldwide. However, the overall community prevalence of infections has not been properly assessed, as standard surveillance is typically acquired passively among individuals seeking clinical care. Methods We conducted a prospective cohort study in which participants provided daily diaries and weekly nasopharyngeal specimens that were tested for respiratory viruses. These data were used to analyze healthcare seeking behavior, compared with cross‐sectional ED data and NYC surveillance reports, and used to evaluate biases of medically attended ILI as signal for population respiratory disease and infection. Results The likelihood of seeking medical attention was virus‐dependent: higher for influenza and metapneumovirus (19%‐20%), lower for coronavirus and RSV (4%), and 71% of individuals with self‐reported ILI did not seek care and half of medically attended symptomatic manifestations did not meet the criteria for ILI. Only 5% of cohort respiratory virus infections and 21% of influenza infections were medically attended and classifiable as ILI. We estimated 1 ILI event per person/year but multiple respiratory infections per year. Conclusion Standard, healthcare‐based respiratory surveillance has multiple limitations. Specifically, ILI is an incomplete metric for quantifying respiratory disease, viral respiratory infection, and influenza infection. The prevalence of respiratory viruses, as reported by standard, healthcare‐based surveillance, is skewed toward viruses producing more severe symptoms. Active, longitudinal studies are a helpful supplement to standard surveillance, can improve understanding of the overall circulation and burden of respiratory viruses, and can aid development of more robust measures for controlling the spread of these pathogens.
Collapse
Affiliation(s)
- Marta Galanti
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Devon Comito
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA.,Department of Integrative Biology, University of California, Berkeley, CA, USA
| | - Chanel Ligon
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Benjamin Lane
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Nelsa Matienzo
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Sadiat Ibrahim
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Atinuke Shittu
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Eudosie Tagne
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Ruthie Birger
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA.,Department of Epidemiology of Microbial Diseases, Yale School of Public Health, Yale University, New Haven, CT, USA.,The Earth Institute, Columbia University, New York, NY, USA
| | - Minhaz Ud-Dean
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Ioan Filip
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Haruka Morita
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Raul Rabadan
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Simon Anthony
- Department of Epidemiology, Columbia University, New York, NY, USA
| | - Greg A Freyer
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Peter Dayan
- Department of Pediatrics, Columbia University, New York, NY, USA
| | - Bo Shopsin
- Departments of Medicine and Microbiology, New York University School of Medicine, New York, NY, USA
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| |
Collapse
|
37
|
Forecasting the 2017/2018 seasonal influenza epidemic in England using multiple dynamic transmission models: a case study. BMC Public Health 2020; 20:486. [PMID: 32293372 PMCID: PMC7158152 DOI: 10.1186/s12889-020-8455-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 03/04/2020] [Indexed: 01/13/2023] Open
Abstract
Background Since the 2009 A/H1N1 pandemic, Public Health England have developed a suite of real-time statistical models utilising enhanced pandemic surveillance data to nowcast and forecast a future pandemic. Their ability to track seasonal influenza and predict heightened winter healthcare burden in the light of high activity in Australia in 2017 was untested. Methods Four transmission models were used in forecasting the 2017/2018 seasonal influenza epidemic in England: a stratified primary care model using daily, region-specific, counts and virological swab positivity of influenza-like illness consultations in general practice (GP); a strain-specific (SS) model using weekly, national GP ILI and virological data; an intensive care model (ICU) using reports of ICU influenza admissions; and a synthesis model that included all data sources. For the first 12 weeks of 2018, each model was applied to the latest data to provide estimates of epidemic parameters and short-term influenza forecasts. The added value of pre-season population susceptibility data was explored. Results The combined results provided valuable nowcasts of the state of the epidemic. Short-term predictions of burden on primary and secondary health services were initially highly variable before reaching consensus beyond the observed peaks in activity between weeks 3–4 of 2018. Estimates for R0 were consistent over time for three of the four models until week 12 of 2018, and there was consistency in the estimation of R0 across the SPC and SS models, and in the ICU attack rates estimated by the ICU and the synthesis model. Estimation and predictions varied according to the assumed levels of pre-season immunity. Conclusions This exercise successfully applied a range of pandemic models to seasonal influenza. Forecasting early in the season remains challenging but represents a crucially important activity to inform planning. Improved knowledge of pre-existing levels of immunity would be valuable.
Collapse
|
38
|
Behrend MR, Basáñez MG, Hamley JID, Porco TC, Stolk WA, Walker M, de Vlas SJ. Modelling for policy: The five principles of the Neglected Tropical Diseases Modelling Consortium. PLoS Negl Trop Dis 2020; 14:e0008033. [PMID: 32271755 PMCID: PMC7144973 DOI: 10.1371/journal.pntd.0008033] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Affiliation(s)
- Matthew R. Behrend
- Neglected Tropical Diseases, Bill & Melinda Gates Foundation, Seattle, Washington, United States of America
- Blue Well 8, Seattle, Washington, United States of America
- * E-mail:
| | - María-Gloria Basáñez
- MRC Centre for Global Infectious Disease Analysis and London Centre for Neglected Tropical Disease Research, Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - Jonathan I. D. Hamley
- MRC Centre for Global Infectious Disease Analysis and London Centre for Neglected Tropical Disease Research, Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - Travis C. Porco
- Francis I. Proctor Foundation for Research in Ophthalmology, Department of Epidemiology and Biostatistics, and Department of Ophthalmology, University of California, San Francisco, United States of America
| | - Wilma A. Stolk
- Department of Public Health, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Martin Walker
- London Centre for Neglected Tropical Disease Research, Department of Pathobiology and Population Sciences, Royal Veterinary College, Hatfield, Hertfordshire, United Kingdom
- London Centre for Neglected Tropical Disease Research and Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - Sake J. de Vlas
- Department of Public Health, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | | |
Collapse
|
39
|
Lu J, Meyer S. Forecasting Flu Activity in the United States: Benchmarking an Endemic-Epidemic Beta Model. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:E1381. [PMID: 32098038 PMCID: PMC7068443 DOI: 10.3390/ijerph17041381] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 02/07/2020] [Accepted: 02/15/2020] [Indexed: 11/25/2022]
Abstract
Accurate prediction of flu activity enables health officials to plan disease prevention and allocate treatment resources. A promising forecasting approach is to adapt the well-established endemic-epidemic modeling framework to time series of infectious disease proportions. Using U.S. influenza-like illness surveillance data over 18 seasons, we assessed probabilistic forecasts of this new beta autoregressive model with proper scoring rules. Other readily available forecasting tools were used for comparison, including Prophet, (S)ARIMA and kernel conditional density estimation (KCDE). Short-term flu activity was equally well predicted up to four weeks ahead by the beta model with four autoregressive lags and by KCDE; however, the beta model runs much faster. Non-dynamic Prophet scored worst. Relative performance differed for seasonal peak prediction. Prophet produced the best peak intensity forecasts in seasons with standard epidemic curves; otherwise, KCDE outperformed all other methods. Peak timing was best predicted by SARIMA, KCDE or the beta model, depending on the season. The best overall performance when predicting peak timing and intensity was achieved by KCDE. Only KCDE and naive historical forecasts consistently outperformed the equal-bin reference approach for all test seasons. We conclude that the endemic-epidemic beta model is a performant and easy-to-implement tool to forecast flu activity a few weeks ahead. Real-time forecasting of the seasonal peak, however, should consider outputs of multiple models simultaneously, weighing their usefulness as the season progresses.
Collapse
Affiliation(s)
| | - Sebastian Meyer
- Institute of Medical Informatics, Biometry, and Epidemiology, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany;
| |
Collapse
|
40
|
Brown AC, Lauer SA, Robinson CC, Nyquist AC, Rao S, Reich NG. Evaluating the ALERT algorithm for local outbreak onset detection in seasonal infectious disease surveillance data. Stat Med 2020; 39:1145-1155. [PMID: 31985869 PMCID: PMC7169531 DOI: 10.1002/sim.8467] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 11/29/2019] [Accepted: 12/14/2019] [Indexed: 11/06/2022]
Abstract
Estimation of epidemic onset timing is an important component of controlling the spread of seasonal infectious diseases within community healthcare sites. The Above Local Elevated Respiratory Illness Threshold (ALERT) algorithm uses a threshold-based approach to suggest incidence levels that historically have indicated the transition from endemic to epidemic activity. In this paper, we present the first detailed overview of the computational approach underlying the algorithm. In the motivating example section, we evaluate the performance of ALERT in determining the onset of increased respiratory virus incidence using laboratory testing data from the Children's Hospital of Colorado. At a threshold of 10 cases per week, ALERT-selected intervention periods performed better than the observed hospital site periods (2004/2005-2012/2013) and a CUSUM method. Additional simulation studies show how data properties may effect ALERT performance on novel data. We found that the conditions under which ALERT showed ideal performance generally included high seasonality and low off-season incidence.
Collapse
Affiliation(s)
- Alexandria C Brown
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, Massachusetts
| | - Stephen A Lauer
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, Massachusetts
| | - Christine C Robinson
- Department of Pediatrics, Section of Infectious Diseases and Epidemiology, Department of Epidemiology, University of Colorado School of Medicine and Children's Hospital Colorado, Aurora, Colorado
| | - Ann-Christine Nyquist
- Department of Pediatrics, Section of Infectious Diseases and Epidemiology, Department of Epidemiology, University of Colorado School of Medicine and Children's Hospital Colorado, Aurora, Colorado
| | - Suchitra Rao
- Pediatric Infectious Diseases/Hospital Medicine/Epidemiology, Children's Hospital Colorado and University of Colorado, Aurora, Colorado
| | - Nicholas G Reich
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, Massachusetts
| |
Collapse
|
41
|
Darwish A, Rahhal Y, Jafar A. A comparative study on predicting influenza outbreaks using different feature spaces: application of influenza-like illness data from Early Warning Alert and Response System in Syria. BMC Res Notes 2020; 13:33. [PMID: 31948473 PMCID: PMC6964210 DOI: 10.1186/s13104-020-4889-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 01/03/2020] [Indexed: 11/10/2022] Open
Abstract
Objective An accurate forecasting of outbreaks of influenza-like illness (ILI) could support public health officials to suggest public health actions earlier. We investigated the performance of three different feature spaces in different models to forecast the weekly ILI rate in Syria using EWARS data from World Health Organization (WHO). Time series feature space was first used and we applied the seven models which are Naïve, Average, Seasonal naïve, drift, dynamic harmonic regression (Dhr), seasonal and trend decomposition using loess (STL) and TBATS. The Second feature space is like some state-of-the-art, which we named \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$53-weeks-before\_52-first-order-difference$$\end{document}53-weeks-before_52-first-order-difference feature space. The third one, we proposed and named \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$n-years-before\_m-weeks-around$$\end{document}n-years-before_m-weeks-around (YnWm) feature space. Machine learning (ML) and deep learning (DL) model were applied to the second and third feature spaces (generalized linear model (GLM), support vector regression (SVR), gradient boosting (GB), random forest (RF) and long short term memory (LSTM)). Results It was indicated that the LSTM model of four layers with \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$1-year-before\_4-weeks-around$$\end{document}1-year-before_4-weeks-around feature space gave more accurate results than other models and reached the lowest MAPE of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$3.52\%$$\end{document}3.52% and the lowest RMSE of 0.01662. I hope that this modelling methodology can be applied in other countries and therefore help prevent and control influenza worldwide.
Collapse
Affiliation(s)
- Ali Darwish
- Department of Informatics, Higher Institute for Applied Sciences and Technology, Damascus, Syria.
| | - Yasser Rahhal
- Department of Informatics, Higher Institute for Applied Sciences and Technology, Damascus, Syria
| | - Assef Jafar
- Department of Informatics, Higher Institute for Applied Sciences and Technology, Damascus, Syria
| |
Collapse
|
42
|
Aiello AE, Renson A, Zivich PN. Social Media- and Internet-Based Disease Surveillance for Public Health. Annu Rev Public Health 2020; 41:101-118. [PMID: 31905322 DOI: 10.1146/annurev-publhealth-040119-094402] [Citation(s) in RCA: 117] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Disease surveillance systems are a cornerstone of public health tracking and prevention. This review addresses the use, promise, perils, and ethics of social media- and Internet-based data collection for public health surveillance. Our review highlights untapped opportunities for integrating digital surveillance in public health and current applications that could be improved through better integration, validation, and clarity on rules surrounding ethical considerations. Promising developments include hybrid systems that couple traditional surveillance data with data from search queries, social media posts, and crowdsourcing. In the future, it will be important to identify opportunities for public and private partnerships, train public health experts in data science, reduce biases related to digital data (gathered from Internet use, wearable devices, etc.), and address privacy. We are on the precipice of an unprecedented opportunity to track, predict, and prevent global disease burdens in the population using digital data.
Collapse
Affiliation(s)
- Allison E Aiello
- Department of Epidemiology, Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7435, USA; , ,
| | - Audrey Renson
- Department of Epidemiology, Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7435, USA; , ,
| | - Paul N Zivich
- Department of Epidemiology, Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7435, USA; , ,
| |
Collapse
|
43
|
Shearer FM, Moss R, McVernon J, Ross JV, McCaw JM. Infectious disease pandemic planning and response: Incorporating decision analysis. PLoS Med 2020; 17:e1003018. [PMID: 31917786 PMCID: PMC6952100 DOI: 10.1371/journal.pmed.1003018] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Freya Shearer and co-authors discuss the use of decision analysis in planning for infectious disease pandemics.
Collapse
Affiliation(s)
- Freya M. Shearer
- Modelling and Simulation Unit, Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Australia
| | - Robert Moss
- Modelling and Simulation Unit, Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Australia
| | - Jodie McVernon
- Modelling and Simulation Unit, Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Australia
- Peter Doherty Institute for Infection and Immunity, The Royal Melbourne Hospital and The University of Melbourne, Australia
- Murdoch Children’s Research Institute, The Royal Children’s Hospital, Melbourne, Australia
| | - Joshua V. Ross
- School of Mathematical Sciences, The University of Adelaide, Adelaide, Australia
| | - James M. McCaw
- Modelling and Simulation Unit, Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Australia
- Peter Doherty Institute for Infection and Immunity, The Royal Melbourne Hospital and The University of Melbourne, Australia
- Murdoch Children’s Research Institute, The Royal Children’s Hospital, Melbourne, Australia
- School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
- * E-mail:
| |
Collapse
|
44
|
Lutz CS, Huynh MP, Schroeder M, Anyatonwu S, Dahlgren FS, Danyluk G, Fernandez D, Greene SK, Kipshidze N, Liu L, Mgbere O, McHugh LA, Myers JF, Siniscalchi A, Sullivan AD, West N, Johansson MA, Biggerstaff M. Applying infectious disease forecasting to public health: a path forward using influenza forecasting examples. BMC Public Health 2019; 19:1659. [PMID: 31823751 PMCID: PMC6902553 DOI: 10.1186/s12889-019-7966-8] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Accepted: 11/19/2019] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Infectious disease forecasting aims to predict characteristics of both seasonal epidemics and future pandemics. Accurate and timely infectious disease forecasts could aid public health responses by informing key preparation and mitigation efforts. MAIN BODY For forecasts to be fully integrated into public health decision-making, federal, state, and local officials must understand how forecasts were made, how to interpret forecasts, and how well the forecasts have performed in the past. Since the 2013-14 influenza season, the Influenza Division at the Centers for Disease Control and Prevention (CDC) has hosted collaborative challenges to forecast the timing, intensity, and short-term trajectory of influenza-like illness in the United States. Additional efforts to advance forecasting science have included influenza initiatives focused on state-level and hospitalization forecasts, as well as other infectious diseases. Using CDC influenza forecasting challenges as an example, this paper provides an overview of infectious disease forecasting; applications of forecasting to public health; and current work to develop best practices for forecast methodology, applications, and communication. CONCLUSIONS These efforts, along with other infectious disease forecasting initiatives, can foster the continued advancement of forecasting science.
Collapse
Affiliation(s)
- Chelsea S Lutz
- Influenza Division, National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Atlanta, GA, 30329, USA.
- Oak Ridge Institute for Science and Education, United States Department of Energy, Oak Ridge, TN, 37830, USA.
- Department of International Health, Johns Hopkins Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, 21205, USA.
| | - Mimi P Huynh
- Infectious Disease Program, Council of State and Territorial Epidemiologists, Atlanta, GA, 30345, USA
| | - Monica Schroeder
- Infectious Disease Program, Council of State and Territorial Epidemiologists, Atlanta, GA, 30345, USA
| | - Sophia Anyatonwu
- PHI/CDC Global Health Fellowship Program, Public Health Institute, Oakland, CA, 94607, USA
| | - F Scott Dahlgren
- Influenza Division, National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Atlanta, GA, 30329, USA
| | - Gregory Danyluk
- Florida Department of Health in Polk County, Bartow, FL, 33830, USA
| | - Danielle Fernandez
- Epidemiology, Disease Control, and Immunization Services, Florida Department of Health in Miami-Dade County, Miami, FL, 33126, USA
| | - Sharon K Greene
- Bureau of Communicable Disease, New York City Department of Health and Mental Hygiene, Queens, New York, NY, 11101, USA
| | | | - Leann Liu
- Office of Science, Surveillance, and Technology, Harris County Public Health, Houston, TX, 77027, USA
| | - Osaro Mgbere
- Disease Prevention and Control Division, Houston Health Department, Houston, TX, 77054, USA
| | - Lisa A McHugh
- Communicable Disease Service, New Jersey Department of Health, Trenton, NJ, 08608, USA
| | - Jennifer F Myers
- Infectious Diseases Branch, California Department of Public Health, Richmond, CA, 94804, USA
| | - Alan Siniscalchi
- Infectious Disease Section, Epidemiology & Emerging Infections Program, State of Connecticut Department of Health, Hartford, CT, 06134, USA
| | - Amy D Sullivan
- Division of Prevention and Community Health, Washington State Department of Health, Olympia, WA, 98504, USA
| | - Nicole West
- Acute and Communicable Disease Prevention, Oregon Health Authority, Portland, OR, 97232, USA
| | - Michael A Johansson
- Division of Vector-Borne Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, San Juan, PR, 00920, USA
| | - Matthew Biggerstaff
- Influenza Division, National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Atlanta, GA, 30329, USA
| |
Collapse
|
45
|
Johansson MA, Apfeldorf KM, Dobson S, Devita J, Buczak AL, Baugher B, Moniz LJ, Bagley T, Babin SM, Guven E, Yamana TK, Shaman J, Moschou T, Lothian N, Lane A, Osborne G, Jiang G, Brooks LC, Farrow DC, Hyun S, Tibshirani RJ, Rosenfeld R, Lessler J, Reich NG, Cummings DAT, Lauer SA, Moore SM, Clapham HE, Lowe R, Bailey TC, García-Díez M, Carvalho MS, Rodó X, Sardar T, Paul R, Ray EL, Sakrejda K, Brown AC, Meng X, Osoba O, Vardavas R, Manheim D, Moore M, Rao DM, Porco TC, Ackley S, Liu F, Worden L, Convertino M, Liu Y, Reddy A, Ortiz E, Rivero J, Brito H, Juarrero A, Johnson LR, Gramacy RB, Cohen JM, Mordecai EA, Murdock CC, Rohr JR, Ryan SJ, Stewart-Ibarra AM, Weikel DP, Jutla A, Khan R, Poultney M, Colwell RR, Rivera-García B, Barker CM, Bell JE, Biggerstaff M, Swerdlow D, Mier-Y-Teran-Romero L, Forshey BM, Trtanj J, Asher J, Clay M, Margolis HS, Hebbeler AM, George D, Chretien JP. An open challenge to advance probabilistic forecasting for dengue epidemics. Proc Natl Acad Sci U S A 2019; 116:24268-24274. [PMID: 31712420 PMCID: PMC6883829 DOI: 10.1073/pnas.1909865116] [Citation(s) in RCA: 104] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A wide range of research has promised new tools for forecasting infectious disease dynamics, but little of that research is currently being applied in practice, because tools do not address key public health needs, do not produce probabilistic forecasts, have not been evaluated on external data, or do not provide sufficient forecast skill to be useful. We developed an open collaborative forecasting challenge to assess probabilistic forecasts for seasonal epidemics of dengue, a major global public health problem. Sixteen teams used a variety of methods and data to generate forecasts for 3 epidemiological targets (peak incidence, the week of the peak, and total incidence) over 8 dengue seasons in Iquitos, Peru and San Juan, Puerto Rico. Forecast skill was highly variable across teams and targets. While numerous forecasts showed high skill for midseason situational awareness, early season skill was low, and skill was generally lowest for high incidence seasons, those for which forecasts would be most valuable. A comparison of modeling approaches revealed that average forecast skill was lower for models including biologically meaningful data and mechanisms and that both multimodel and multiteam ensemble forecasts consistently outperformed individual model forecasts. Leveraging these insights, data, and the forecasting framework will be critical to improve forecast skill and the application of forecasts in real time for epidemic preparedness and response. Moreover, key components of this project-integration with public health needs, a common forecasting framework, shared and standardized data, and open participation-can help advance infectious disease forecasting beyond dengue.
Collapse
Affiliation(s)
- Michael A Johansson
- Division of Vector-Borne Diseases, Centers for Disease Control and Prevention, San Juan 00920, Puerto Rico;
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA 02115
| | | | - Scott Dobson
- Data Analytics, Areté Associates, Northridge, CA 91324
| | - Jason Devita
- Data Analytics, Areté Associates, Northridge, CA 91324
| | - Anna L Buczak
- Systems Integration Branch, Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | - Benjamin Baugher
- Systems Integration Branch, Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | - Linda J Moniz
- Systems Integration Branch, Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | - Thomas Bagley
- Systems Integration Branch, Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | - Steven M Babin
- Systems Integration Branch, Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | - Erhan Guven
- Systems Integration Branch, Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | - Teresa K Yamana
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY 10032
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY 10032
| | - Terry Moschou
- Data to Decisions Cooperative Research Center, Kent Town, SA 5067, Australia
| | - Nick Lothian
- Data to Decisions Cooperative Research Center, Kent Town, SA 5067, Australia
| | - Aaron Lane
- Data to Decisions Cooperative Research Center, Kent Town, SA 5067, Australia
| | - Grant Osborne
- Data to Decisions Cooperative Research Center, Kent Town, SA 5067, Australia
| | - Gao Jiang
- Heinz College Information System Management, Carnegie Mellon University, Adelaide, SA 5000, Australia
| | - Logan C Brooks
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213
| | - David C Farrow
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Sangwon Hyun
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Ryan J Tibshirani
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Roni Rosenfeld
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Justin Lessler
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205
| | - Nicholas G Reich
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003
| | - Derek A T Cummings
- Department of Biology, University of Florida, Gainesville, FL 32611
- Emerging Pathogens Institute, University of Florida, Gainesville, FL 32611
| | - Stephen A Lauer
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003
| | - Sean M Moore
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556
| | - Hannah E Clapham
- Hospital for Tropical Diseases, Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
| | - Rachel Lowe
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London WC1E 7HT, United Kingdom
- Climate and Health Program, Barcelona Institute for Global Health, 08003 Barcelona, Spain
| | - Trevor C Bailey
- College of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter EX4 4QF, United Kingdom
| | | | - Marilia Sá Carvalho
- Scientific Computation Program, Oswaldo Cruz Foundation, Rio de Janeiro 21040-900, Brazil
| | - Xavier Rodó
- Climate and Health Program, Barcelona Institute for Global Health, 08003 Barcelona, Spain
- Catalan Institution for Research and Advanced Studies, 08010 Barcelona, Spain
| | - Tridip Sardar
- Catalan Institution for Research and Advanced Studies, 08010 Barcelona, Spain
| | - Richard Paul
- Department of Mathematical Biology, Indian Statistical Institute, Kolkata, India 700108
- Pasteur Kyoto International Joint Research Unit for Integrative Vaccinomics, 606-8501 Kyoto, Japan
| | - Evan L Ray
- Department of Global Health, Centre National de la Recherche Scientifique, 75016 Paris, France
| | - Krzysztof Sakrejda
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003
| | - Alexandria C Brown
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003
| | - Xi Meng
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003
| | - Osonde Osoba
- Department of Mathematics and Statistics, Mount Holyoke College, South Hadley, MA 01075
| | - Raffaele Vardavas
- Department of Mathematics and Statistics, Mount Holyoke College, South Hadley, MA 01075
| | | | - Melinda Moore
- Department of Mathematics and Statistics, Mount Holyoke College, South Hadley, MA 01075
| | | | - Travis C Porco
- Department of Computer Science and Software Engineering, Miami University, Oxford, OH 45056
| | - Sarah Ackley
- Department of Computer Science and Software Engineering, Miami University, Oxford, OH 45056
| | - Fengchen Liu
- Department of Computer Science and Software Engineering, Miami University, Oxford, OH 45056
| | - Lee Worden
- Department of Computer Science and Software Engineering, Miami University, Oxford, OH 45056
| | - Matteo Convertino
- F. I. Proctor Foundation for Research in Ophthalmology, University of California, San Francisco, CA 94122
| | - Yang Liu
- Information Science and Technology, Hokkaido University, Sapporo 060-0808, Japan
| | - Abraham Reddy
- Information Science and Technology, Hokkaido University, Sapporo 060-0808, Japan
| | - Eloy Ortiz
- Division of Environmental Health Sciences, School of Public Health, University of Minnesota, Twin Cities, MN 55455
| | - Jorge Rivero
- Division of Environmental Health Sciences, School of Public Health, University of Minnesota, Twin Cities, MN 55455
| | - Humberto Brito
- Division of Environmental Health Sciences, School of Public Health, University of Minnesota, Twin Cities, MN 55455
- VectorAnalytica, Washington, DC 20007
| | - Alicia Juarrero
- Division of Environmental Health Sciences, School of Public Health, University of Minnesota, Twin Cities, MN 55455
- Department of Aeronautical Engineering, Universidade de Sao Paolo, Sao Paolo 13566-590, Brazil
| | - Leah R Johnson
- Department of Philosophy, University of Miami, Coral Gables, FL 33146
| | | | - Jeremy M Cohen
- Department of Statistics, Virginia Tech, Blacksburg, VA 24060
| | - Erin A Mordecai
- Integrative Biology, University of South Florida, Tampa, FL 33620
| | - Courtney C Murdock
- Department of Biology, Stanford University, Stanford, CA 94305
- Infectious Diseases, College of Veterinary Medicine, University of Georgia, Athens, GA 30602
| | - Jason R Rohr
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556
| | - Sadie J Ryan
- Emerging Pathogens Institute, University of Florida, Gainesville, FL 32611
- Odum School of Ecology, University of Georgia, Athens, GA 30602
- Department of Geography, University of Florida, Gainesville, FL 32608
| | | | - Daniel P Weikel
- Department of Medicine, State University of New York Upstate Medical University, Syracuse, NY 13421
| | - Antarpreet Jutla
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109
| | - Rakibul Khan
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109
| | - Marissa Poultney
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109
| | - Rita R Colwell
- Department of Civil and Environmental Engineering, West Virginia University, Morgantown, WV 26505
| | - Brenda Rivera-García
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742
| | | | - Jesse E Bell
- Department of Pathology, Microbiology, and Immunology, School of Veterinary Medicine, University of California, Davis, CA 95616
| | - Matthew Biggerstaff
- Department of Environmental, Agricultural, and Occupational Health, College of Public Health, University of Nebraska Medical Center, Omaha, NE 68198
| | - David Swerdlow
- Department of Environmental, Agricultural, and Occupational Health, College of Public Health, University of Nebraska Medical Center, Omaha, NE 68198
| | - Luis Mier-Y-Teran-Romero
- Division of Vector-Borne Diseases, Centers for Disease Control and Prevention, San Juan 00920, Puerto Rico
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205
| | - Brett M Forshey
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, GA 30329
| | - Juli Trtanj
- Armed Forces Health Surveillance Branch, Department of Defense, Silver Spring, MD 20904
| | - Jason Asher
- Climate Program Office, National Oceanic and Atmospheric Administration, Silver Spring, MD 20910
| | - Matt Clay
- Climate Program Office, National Oceanic and Atmospheric Administration, Silver Spring, MD 20910
| | - Harold S Margolis
- Division of Vector-Borne Diseases, Centers for Disease Control and Prevention, San Juan 00920, Puerto Rico
| | - Andrew M Hebbeler
- Leidos supporting the Biomedical Advanced Research and Development Authority, Department of Health and Human Services, Washington, DC 20201
- Bureau of Oceans, International Environmental and Scientific Affairs, US Department of State, Washington, DC 20520
| | - Dylan George
- Bureau of Oceans, International Environmental and Scientific Affairs, US Department of State, Washington, DC 20520
- Office of Science and Technology Policy, The White House, Washington, DC 20502
| | - Jean-Paul Chretien
- Bureau of Oceans, International Environmental and Scientific Affairs, US Department of State, Washington, DC 20520
- BNext, In-Q-Tel, Arlington, VA 22201
| |
Collapse
|
46
|
Rangarajan P, Mody SK, Marathe M. Forecasting dengue and influenza incidences using a sparse representation of Google trends, electronic health records, and time series data. PLoS Comput Biol 2019; 15:e1007518. [PMID: 31751346 PMCID: PMC6894887 DOI: 10.1371/journal.pcbi.1007518] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Revised: 12/05/2019] [Accepted: 10/29/2019] [Indexed: 12/20/2022] Open
Abstract
Dengue and influenza-like illness (ILI) are two of the leading causes of viral infection in the world and it is estimated that more than half the world’s population is at risk for developing these infections. It is therefore important to develop accurate methods for forecasting dengue and ILI incidences. Since data from multiple sources (such as dengue and ILI case counts, electronic health records and frequency of multiple internet search terms from Google Trends) can improve forecasts, standard time series analysis methods are inadequate to estimate all the parameter values from the limited amount of data available if we use multiple sources. In this paper, we use a computationally efficient implementation of the known variable selection method that we call the Autoregressive Likelihood Ratio (ARLR) method. This method combines sparse representation of time series data, electronic health records data (for ILI) and Google Trends data to forecast dengue and ILI incidences. This sparse representation method uses an algorithm that maximizes an appropriate likelihood ratio at every step. Using numerical experiments, we demonstrate that our method recovers the underlying sparse model much more accurately than the lasso method. We apply our method to dengue case count data from five countries/states: Brazil, Mexico, Singapore, Taiwan, and Thailand and to ILI case count data from the United States. Numerical experiments show that our method outperforms existing time series forecasting methods in forecasting the dengue and ILI case counts. In particular, our method gives a 18 percent forecast error reduction over a leading method that also uses data from multiple sources. It also performs better than other methods in predicting the peak value of the case count and the peak time. Dengue and influenza-like illness (ILI) are leading causes of viral infection in the world and hence it is important to develop accurate methods for forecasting their incidence. We use Autoregressive Likelihood Ratio method, which is a computationally efficient implementation of the variable selection method, in order to obtain a sparse (non-lasso) representation of time series, Google Trends and electronic health records (for ILI) data. This method is used to forecast dengue incidence in five countries/states and ILI incidence in USA. We show that this method outperforms existing time series methods in forecasting these diseases. The method is general and can also be used to forecast other diseases.
Collapse
Affiliation(s)
- Prashant Rangarajan
- Departments of Computer Science and Mathematics, Birla Institute of Technology and Science, Pilani, India
| | - Sandeep K. Mody
- Department of Mathematics, Indian Institute of Science, Bangalore, India
| | - Madhav Marathe
- Department of Computer Science, Network, Simulation Science and Advanced Computing Division, Biocomplexity Institute, University of Virginia, Charlottesville, Virginia, United States of America
- * E-mail:
| |
Collapse
|
47
|
Baltrusaitis K, Vespignani A, Rosenfeld R, Gray J, Raymond D, Santillana M. Differences in Regional Patterns of Influenza Activity Across Surveillance Systems in the United States: Comparative Evaluation. JMIR Public Health Surveill 2019; 5:e13403. [PMID: 31579019 PMCID: PMC6777281 DOI: 10.2196/13403] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 07/02/2019] [Accepted: 07/19/2019] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND The Centers for Disease Control and Prevention (CDC) tracks influenza-like illness (ILI) using information on patient visits to health care providers through the Outpatient Influenza-like Illness Surveillance Network (ILINet). As participation in this system is voluntary, the composition, coverage, and consistency of health care reports vary from state to state, leading to different measures of ILI activity between regions. The degree to which these measures reflect actual differences in influenza activity or systematic differences in the methods used to collect and aggregate the data is unclear. OBJECTIVE The objective of our study was to qualitatively and quantitatively compare national and region-specific ILI activity in the United States across 4 surveillance data sources-CDC ILINet, Flu Near You (FNY), athenahealth, and HealthTweets.org-to determine whether these data sources, commonly used as input in influenza modeling efforts, show geographical patterns that are similar to those observed in CDC ILINet's data. We also compared the yearly percentage of FNY participants who sought health care for ILI symptoms across geographical areas. METHODS We compared the national and regional 2018-2019 ILI activity baselines, calculated using noninfluenza weeks from previous years, for each surveillance data source. We also compared measures of ILI activity across geographical areas during 3 influenza seasons, 2015-2016, 2016-2017, and 2017-2018. Geographical differences in weekly ILI activity within each data source were also assessed using relative mean differences and time series heatmaps. National and regional age-adjusted health care-seeking percentages were calculated for each influenza season by dividing the number of FNY participants who sought medical care for ILI symptoms by the total number of ILI reports within an influenza season. Pearson correlations were used to assess the association between the health care-seeking percentages and baselines for each surveillance data source. RESULTS We observed consistent differences in ILI activity across geographical areas for CDC ILINet and athenahealth data. ILI activity for FNY displayed little variation across geographical areas, whereas differences in ILI activity for HealthTweets.org were associated with the total number of tweets within a geographical area. The percentage of FNY participants who sought health care for ILI symptoms differed slightly across geographical areas, and these percentages were positively correlated with CDC ILINet and athenahealth baselines. CONCLUSIONS Our findings suggest that differences in ILI activity across geographical areas as reported by a given surveillance system may not accurately reflect true differences in the prevalence of ILI. Instead, these differences may reflect systematic collection and aggregation biases that are particular to each system and consistent across influenza seasons. These findings are potentially relevant in the real-time analysis of the influenza season and in the definition of unbiased forecast models.
Collapse
Affiliation(s)
- Kristin Baltrusaitis
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, United States
| | | | - Roni Rosenfeld
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Josh Gray
- athenaResearch at athenahealth, Watertown, MA, United States
| | - Dorrie Raymond
- athenaResearch at athenahealth, Watertown, MA, United States
| | - Mauricio Santillana
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States.,Department of Pediatrics, Harvard Medical School, Boston, MA, United States
| |
Collapse
|
48
|
George DB, Taylor W, Shaman J, Rivers C, Paul B, O'Toole T, Johansson MA, Hirschman L, Biggerstaff M, Asher J, Reich NG. Technology to advance infectious disease forecasting for outbreak management. Nat Commun 2019; 10:3932. [PMID: 31477707 PMCID: PMC6718692 DOI: 10.1038/s41467-019-11901-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 08/07/2019] [Indexed: 12/17/2022] Open
Abstract
Forecasting is beginning to be integrated into decision-making processes for infectious disease outbreak response. We discuss how technologies could accelerate the adoption of forecasting among public health practitioners, improve epidemic management, save lives, and reduce the economic impact of outbreaks.
Collapse
Affiliation(s)
| | | | - Jeffrey Shaman
- Climate and Health Program, Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Caitlin Rivers
- Center for Health Security, Johns Hopkins University, Baltimore, MD, USA
| | | | | | - Michael A Johansson
- Division of Vector-Borne Diseases, Centers for Disease Control and Prevention, San Juan, PR, USA
| | | | - Matthew Biggerstaff
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | | | - Nicholas G Reich
- Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst, Amherst, MA, USA
| |
Collapse
|
49
|
Soliman M, Lyubchich V, Gel YR. Complementing the power of deep learning with statistical model fusion: Probabilistic forecasting of influenza in Dallas County, Texas, USA. Epidemics 2019; 28:100345. [PMID: 31182294 DOI: 10.1016/j.epidem.2019.05.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2018] [Revised: 03/08/2019] [Accepted: 05/06/2019] [Indexed: 02/06/2023] Open
Abstract
Influenza is one of the main causes of death, not only in the USA but worldwide. Its significant economic and public health impacts necessitate development of accurate and efficient algorithms for forecasting of any upcoming influenza outbreaks. Most currently available methods for influenza prediction are based on parametric time series and regression models that impose restrictive and often unverifiable assumptions on the data. In turn, more flexible machine learning models and, particularly, deep learning tools whose utility is proven in a wide range of disciplines, remain largely under-explored in epidemiological forecasting. We study the seasonal influenza in Dallas County by evaluating the forecasting ability of deep learning with feedforward neural networks as well as performance of more conventional statistical models, such as beta regression, autoregressive integrated moving average (ARIMA), least absolute shrinkage and selection operators (LASSO), and non-parametric multivariate adaptive regression splines (MARS) models for one week and two weeks ahead forecasting. Furthermore, we assess forecasting utility of Google search queries and meteorological data as exogenous predictors of influenza activity. Finally, we develop a probabilistic forecasting of influenza in Dallas County by fusing all the considered models using Bayesian model averaging.
Collapse
Affiliation(s)
- Marwah Soliman
- Department of Mathematical Sciences, University of Texas at Dallas, Richardson, TX, USA
| | - Vyacheslav Lyubchich
- Chesapeake Biological Laboratory, University of Maryland Center for Environmental Science, Solomons, MD, USA.
| | - Yulia R Gel
- Department of Mathematical Sciences, University of Texas at Dallas, Richardson, TX, USA
| |
Collapse
|
50
|
Kim J, Ahn I. Weekly ILI patient ratio change prediction using news articles with support vector machine. BMC Bioinformatics 2019; 20:259. [PMID: 31109286 PMCID: PMC6528302 DOI: 10.1186/s12859-019-2894-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 05/10/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Influenza continues to pose a serious threat to human health worldwide. For this reason, detecting influenza infection patterns is critical. However, as the epidemic spread of influenza occurs sporadically and rapidly, it is not easy to estimate the future variance of influenza virus infection. Furthermore, accumulating influenza related data is not easy, because the type of data that is associated with influenza is very limited. For these reasons, identifying useful data and building a prediction model with these data are necessary steps toward predicting if the number of patients will increase or decrease. On the Internet, numerous press releases are published every day that reflect currently pending issues. RESULTS In this research, we collected Internet articles related to infectious diseases from the Centre for Health Protection (CHP), which is maintained the by Hong Kong Department of Health, to see if news text data could be used to predict the spread of influenza. In total, 7769 articles related to infectious diseases published from 2004 January to 2018 January were collected. We evaluated the predictive ability of article text data from the period of 2013-2018 for each of the weekly time horizons. The support vector machine (SVM) model was used for prediction in order to examine the use of information embedded in the web articles and detect the pattern of influenza spread variance. The prediction result using news text data with SVM exhibited a mean accuracy of 86.7 % on predicting whether weekly ILI patient ratio would increase or decrease, and a root mean square error of 0.611 on estimating the weekly ILI patient ratio. CONCLUSIONS In order to remedy the problems of conventional data, using news articles can be a suitable choice, because they can help estimate if ILI patient ratio will increase or decrease as well as how many patients will be affected, as shown in the result of research. Thus, advancements in research on using news articles for influenza prediction should continue to be pursed, as the result showed acceptable performance as compared to existing influenza prediction researches.
Collapse
Affiliation(s)
- Juhyeon Kim
- Department of data-centric problem solving research, Korea Institute of Science and Technology Information, Yuseong-gu, Daejeon, Korea
- Center for Convergent Research of Emerging Virus Infection, Korea Research Institute of Chemical Technology, Yuseong-gu, Daejeon, Korea
| | - Insung Ahn
- Department of data-centric problem solving research, Korea Institute of Science and Technology Information, Yuseong-gu, Daejeon, Korea.
- Center for Convergent Research of Emerging Virus Infection, Korea Research Institute of Chemical Technology, Yuseong-gu, Daejeon, Korea.
| |
Collapse
|