1
|
Andronico A, Paireau J, Cauchemez S. Integrating information from historical data into mechanistic models for influenza forecasting. PLoS Comput Biol 2024; 20:e1012523. [PMID: 39475955 PMCID: PMC11524484 DOI: 10.1371/journal.pcbi.1012523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 09/27/2024] [Indexed: 11/02/2024] Open
Abstract
Seasonal influenza causes significant annual morbidity and mortality worldwide. In France, it is estimated that, on average, 2 million individuals consult their GP for influenza-like-illness (ILI) every year. Traditionally, mathematical models used for epidemic forecasting can either include parameters capturing the infection process (mechanistic or compartmental models) or rely on time series analysis approaches that do not make mechanistic assumptions (statistical or phenomenological models). While the latter make extensive use of past epidemic data, mechanistic models are usually independently initialized in each season. As a result, forecasts from such models can contain trajectories that are vastly different from past epidemics. We developed a mechanistic model that takes into account epidemic data from training seasons when producing forecasts. The parameters of the model are estimated via a first particle filter running on the observed data. A second particle filter is then used to produce forecasts compatible with epidemic trajectories from the training set. The model was calibrated and tested on 35 years' worth of surveillance data from the French Sentinelles Network, representing the weekly number of patients consulting for ILI over the period 1985-2019. Our results show that the new method improves upon standard mechanistic approaches. In particular, when retrospectively tested on the available data, our model provides increased accuracy for short-term forecasts (from one to four weeks into the future) and peak timing and intensity. Our new approach for epidemic forecasting allows the integration of key strengths of the statistical approach into the mechanistic modelling framework and represents an attempt to provide accurate forecasts by making full use of the rich surveillance dataset collected in France since 1985.
Collapse
Affiliation(s)
- Alessio Andronico
- Mathematical Modelling of Infectious Diseases Unit, Institut Pasteur, Université Paris Cité, UMR2000 CNRS, Paris, France
| | - Juliette Paireau
- Mathematical Modelling of Infectious Diseases Unit, Institut Pasteur, Université Paris Cité, UMR2000 CNRS, Paris, France
- Infectious Diseases Department, Santé publique France, Saint-Maurice, France
| | - Simon Cauchemez
- Mathematical Modelling of Infectious Diseases Unit, Institut Pasteur, Université Paris Cité, UMR2000 CNRS, Paris, France
| |
Collapse
|
2
|
Zhang H, Yin L, Mao L, Mei S, Chen T, Liu K, Feng S. Combinational Recommendation of Vaccinations, Mask-Wearing, and Home-Quarantine to Control Influenza in Megacities: An Agent-Based Modeling Study With Large-Scale Trajectory Data. Front Public Health 2022; 10:883624. [PMID: 35719665 PMCID: PMC9204335 DOI: 10.3389/fpubh.2022.883624] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 03/31/2022] [Indexed: 11/15/2022] Open
Abstract
The outbreak of COVID-19 stimulated a new round of discussion on how to deal with respiratory infectious diseases. Influenza viruses have led to several pandemics worldwide. The spatiotemporal characteristics of influenza transmission in modern cities, especially megacities, are not well-known, which increases the difficulty of influenza prevention and control for populous urban areas. For a long time, influenza prevention and control measures have focused on vaccination of the elderly and children, and school closure. Since the outbreak of COVID-19, the public's awareness of measures such as vaccinations, mask-wearing, and home-quarantine has generally increased in some regions of the world. To control the influenza epidemic and reduce the proportion of infected people with high mortality, the combination of these three measures needs quantitative evaluation based on the spatiotemporal transmission characteristics of influenza in megacities. Given that the agent-based model with both demographic attributes and fine-grained mobility is a key planning tool in deploying intervention strategies, this study proposes a spatially explicit agent-based influenza model for assessing and recommending the combinations of influenza control measures. This study considers Shenzhen city, China as the research area. First, a spatially explicit agent-based influenza transmission model was developed by integrating large-scale individual trajectory data and human response behavior. Then, the model was evaluated across multiple intra-urban spatial scales based on confirmed influenza cases. Finally, the model was used to evaluate the combined effects of the three interventions (V: vaccinations, M: mask-wearing, and Q: home-quarantining) under different compliance rates, and their optimal combinations for given control objectives were recommended. This study reveals that adults were a high-risk population with a low reporting rate, and children formed the lowest infected proportion and had the highest reporting rate in Shenzhen. In addition, this study systematically recommended different combinations of vaccinations, mask-wearing, and home-quarantine with different compliance rates for different control objectives to deal with the influenza epidemic. For example, the "V45%-M60%-Q20%" strategy can maintain the infection percentage below 5%, while the "V20%-M60%-Q20%" strategy can maintain the infection percentage below 15%. The model and policy recommendations from this study provide a tool and intervention reference for influenza epidemic management in the post-COVID-19 era.
Collapse
Affiliation(s)
- Hao Zhang
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ling Yin
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Liang Mao
- Department of Geography, University of Florida, Gainesville, FL, United States
| | - Shujiang Mei
- Shenzhen Center for Disease Control and Prevention, Shenzhen, China
| | - Tianmu Chen
- State Key Laboratory of Molecular Vaccinology and Molecular Diagnostics, School of Public Health, Xiamen University, Xiamen, China
| | - Kang Liu
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | | |
Collapse
|
3
|
Miliou I, Xiong X, Rinzivillo S, Zhang Q, Rossetti G, Giannotti F, Pedreschi D, Vespignani A. Predicting seasonal influenza using supermarket retail records. PLoS Comput Biol 2021; 17:e1009087. [PMID: 34252075 PMCID: PMC8297944 DOI: 10.1371/journal.pcbi.1009087] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 07/22/2021] [Accepted: 05/15/2021] [Indexed: 11/19/2022] Open
Abstract
Increased availability of epidemiological data, novel digital data streams, and the rise of powerful machine learning approaches have generated a surge of research activity on real-time epidemic forecast systems. In this paper, we propose the use of a novel data source, namely retail market data to improve seasonal influenza forecasting. Specifically, we consider supermarket retail data as a proxy signal for influenza, through the identification of sentinel baskets, i.e., products bought together by a population of selected customers. We develop a nowcasting and forecasting framework that provides estimates for influenza incidence in Italy up to 4 weeks ahead. We make use of the Support Vector Regression (SVR) model to produce the predictions of seasonal flu incidence. Our predictions outperform both a baseline autoregressive model and a second baseline based on product purchases. The results show quantitatively the value of incorporating retail market data in forecasting models, acting as a proxy that can be used for the real-time analysis of epidemics.
Collapse
Affiliation(s)
- Ioanna Miliou
- University of Pisa, Pisa, Italy
- ISTI-CNR, Pisa, Italy
| | - Xinyue Xiong
- Northeastern University, Boston, Massachusetts, United States of America
| | | | - Qian Zhang
- Northeastern University, Boston, Massachusetts, United States of America
| | | | | | | | | |
Collapse
|
4
|
Bhatia S, Lassmann B, Cohn E, Desai AN, Carrion M, Kraemer MUG, Herringer M, Brownstein J, Madoff L, Cori A, Nouvellet P. Using digital surveillance tools for near real-time mapping of the risk of infectious disease spread. NPJ Digit Med 2021; 4:73. [PMID: 33864009 PMCID: PMC8052406 DOI: 10.1038/s41746-021-00442-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 03/16/2021] [Indexed: 02/02/2023] Open
Abstract
Data from digital disease surveillance tools such as ProMED and HealthMap can complement the field surveillance during ongoing outbreaks. Our aim was to investigate the use of data collected through ProMED and HealthMap in real-time outbreak analysis. We developed a flexible statistical model to quantify spatial heterogeneity in the risk of spread of an outbreak and to forecast short term incidence trends. The model was applied retrospectively to data collected by ProMED and HealthMap during the 2013-2016 West African Ebola epidemic and for comparison, to WHO data. Using ProMED and HealthMap data, the model was able to robustly quantify the risk of disease spread 1-4 weeks in advance and for countries at risk of case importations, quantify where this risk comes from. Our study highlights that ProMED and HealthMap data could be used in real-time to quantify the spatial heterogeneity in risk of spread of an outbreak.
Collapse
Affiliation(s)
- Sangeeta Bhatia
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, Faculty of Medicine, London, UK.
| | - Britta Lassmann
- ProMED, International Society for Infectious Diseases, Brookline, MA, USA
| | - Emily Cohn
- Computational Epidemiology Group, Division of Emergency Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Angel N Desai
- ProMED, International Society for Infectious Diseases, Brookline, MA, USA
| | - Malwina Carrion
- ProMED, International Society for Infectious Diseases, Brookline, MA, USA
- Department of Health Science, Sargent College, Boston University, Boston, MA, USA
| | - Moritz U G Kraemer
- Computational Epidemiology Group, Division of Emergency Medicine, Boston Children's Hospital, Boston, MA, USA
- Department of Zoology, Tinbergen Building, Oxford University, Oxford, UK
- Department of Pediatrics, Harvard Medical School, Boston, USA
| | | | - John Brownstein
- Computational Epidemiology Group, Division of Emergency Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Larry Madoff
- ProMED, International Society for Infectious Diseases, Brookline, MA, USA
| | - Anne Cori
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, Faculty of Medicine, London, UK
| | - Pierre Nouvellet
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, Faculty of Medicine, London, UK
- School of Life Sciences, University of Sussex, Brighton, UK
| |
Collapse
|
5
|
Singh RK, Drews M, De la Sen M, Srivastava PK, Trisasongko BH, Kumar M, Pandey MK, Anand A, Singh SS, Pandey AK, Dobriyal M, Rani M, Kumar P. Highlighting the compound risk of COVID-19 and environmental pollutants using geospatial technology. Sci Rep 2021; 11:8363. [PMID: 33863975 PMCID: PMC8052456 DOI: 10.1038/s41598-021-87877-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 04/06/2021] [Indexed: 02/06/2023] Open
Abstract
The new COVID-19 coronavirus disease has emerged as a global threat and not just to human health but also the global economy. Due to the pandemic, most countries affected have therefore imposed periods of full or partial lockdowns to restrict community transmission. This has had the welcome but unexpected side effect that existing levels of atmospheric pollutants, particularly in cities, have temporarily declined. As found by several authors, air quality can inherently exacerbate the risks linked to respiratory diseases, including COVID-19. In this study, we explore patterns of air pollution for ten of the most affected countries in the world, in the context of the 2020 development of the COVID-19 pandemic. We find that the concentrations of some of the principal atmospheric pollutants were temporarily reduced during the extensive lockdowns in the spring. Secondly, we show that the seasonality of the atmospheric pollutants is not significantly affected by these temporary changes, indicating that observed variations in COVID-19 conditions are likely to be linked to air quality. On this background, we confirm that air pollution may be a good predictor for the local and national severity of COVID-19 infections.
Collapse
Affiliation(s)
- Ram Kumar Singh
- Department of Natural Resources, TERI School of Advanced Studies, New Delhi, 110070, India
| | - Martin Drews
- Department of Technology, Management and Economics, Technical University of Denmark, 2800, Kgs. Lyngby, Denmark
| | - Manuel De la Sen
- Department of Electricity and Electronics, Institute of Research and Development of Processes IIDP, University of the Basque Country, Campus of Leioa, PO Box 48940, Leioa (Bizkaia), Spain
| | - Prashant Kumar Srivastava
- Remote Sensing Laboratory, Institute of Environment and Sustainable Development, Banaras Hindu University, Varanasi, Uttar Pradesh, 221005, India
- DST-Mahamana Centre of Excellence in Climate Change Research, Institute of Environment and Sustainable Development, Banaras Hindu University, Varanasi, Uttar Pradesh, 221005, India
| | - Bambang H Trisasongko
- Department of Soil Science and Land Resource and Geospatial Information and Technologies for the Integrative and Intelligent Agriculture (GITIIA), Bogor Agricultural University, Bogor, 16680, Indonesia
| | - Manoj Kumar
- GIS Centre, Forest Research Institute (FRI), PO: New Forest, Dehradun, 248006, India
| | - Manish Kumar Pandey
- Remote Sensing Laboratory, Institute of Environment and Sustainable Development, Banaras Hindu University, Varanasi, Uttar Pradesh, 221005, India
| | - Akash Anand
- Remote Sensing Laboratory, Institute of Environment and Sustainable Development, Banaras Hindu University, Varanasi, Uttar Pradesh, 221005, India
| | - S S Singh
- Directorate of Extension Education, Rani Lakshmi Bai Central Agricultural University, Jhansi, 284003, India
| | - A K Pandey
- College of Horticulture and Forestry, Rani Lakshmi Bai Central Agricultural University, Jhansi, 284003, India
| | - Manmohan Dobriyal
- College of Horticulture and Forestry, Rani Lakshmi Bai Central Agricultural University, Jhansi, 284003, India
| | - Meenu Rani
- Department of Geography, Kumaun University, Nainital, Uttarakhand, 263001, India
| | - Pavan Kumar
- College of Horticulture and Forestry, Rani Lakshmi Bai Central Agricultural University, Jhansi, 284003, India.
| |
Collapse
|
6
|
Nikolopoulos K, Punia S, Schäfers A, Tsinopoulos C, Vasilakis C. Forecasting and planning during a pandemic: COVID-19 growth rates, supply chain disruptions, and governmental decisions. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2021; 290:99-115. [PMID: 32836717 PMCID: PMC7413852 DOI: 10.1016/j.ejor.2020.08.001] [Citation(s) in RCA: 140] [Impact Index Per Article: 46.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 08/03/2020] [Indexed: 05/03/2023]
Abstract
Policymakers during COVID-19 operate in uncharted territory and must make tough decisions. Operational Research - the ubiquitous 'science of better' - plays a vital role in supporting this decision-making process. To that end, using data from the USA, India, UK, Germany, and Singapore up to mid-April 2020, we provide predictive analytics tools for forecasting and planning during a pandemic. We forecast COVID-19 growth rates with statistical, epidemiological, machine- and deep-learning models, and a new hybrid forecasting method based on nearest neighbors and clustering. We further model and forecast the excess demand for products and services during the pandemic using auxiliary data (google trends) and simulating governmental decisions (lockdown). Our empirical results can immediately help policymakers and planners make better decisions during the ongoing and future pandemics.
Collapse
Affiliation(s)
| | | | | | | | - Chrysovalantis Vasilakis
- forLAB, Bangor Business School, Wales
- Université Catholique de Louvain (IRES) (Belgium), Institute for the Study of Labor (IZA) (Germany), University of the Aegean (Greece)
| |
Collapse
|
7
|
Forecasting Weekly Influenza Outpatient Visits Using a Two-Dimensional Hierarchical Decision Tree Scheme. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17134743. [PMID: 32630311 PMCID: PMC7369891 DOI: 10.3390/ijerph17134743] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 06/24/2020] [Accepted: 06/27/2020] [Indexed: 01/20/2023]
Abstract
Influenza is a serious public health issue, as it can cause acute suffering and even death, social disruption, and economic loss. Effective forecasting of influenza outpatient visits is beneficial to anticipate and prevent medical resource shortages. This study uses regional data on influenza outpatient visits to propose a two-dimensional hierarchical decision tree scheme for forecasting influenza outpatient visits. The Taiwan weekly influenza outpatient visit data were collected from the national infectious disease statistics system and used for an empirical example. The 788 data points start in the first week of 2005 and end in the second week of 2020. The empirical results revealed that the proposed forecasting scheme outperformed five competing models and was able to forecast one to four weeks of anticipated influenza outpatient visits. The scheme may be an effective and promising alternative for forecasting one to four steps (weeks) ahead of nationwide influenza outpatient visits in Taiwan. Our results also suggest that, for forecasting nationwide influenza outpatient visits in Taiwan, one- and two-time lag information and regional information from the Taipei, North, and South regions are significant.
Collapse
|
8
|
The design and evaluation of a Bayesian system for detecting and characterizing outbreaks of influenza. Online J Public Health Inform 2019; 11:e6. [PMID: 31632600 DOI: 10.5210/ojphi.v11i2.9952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The prediction and characterization of outbreaks of infectious diseases such as influenza remains an open and important problem. This paper describes a framework for detecting and characterizing outbreaks of influenza and the results of testing it on data from ten outbreaks collected from two locations over five years. We model outbreaks with compartment models and explicitly model non-influenza influenza-like illnesses.
Collapse
|
9
|
Moa A, Muscatello D, Chughtai A, Chen X, MacIntyre CR. Flucast: A Real-Time Tool to Predict Severity of an Influenza Season. JMIR Public Health Surveill 2019; 5:e11780. [PMID: 31339102 PMCID: PMC6683655 DOI: 10.2196/11780] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Revised: 05/31/2019] [Accepted: 06/18/2019] [Indexed: 01/09/2023] Open
Abstract
Background Influenza causes serious illness requiring annual health system surge capacity, yet annual seasonal variation makes it difficult to forecast and plan for the severity of an upcoming season. Research shows that hospital and health system stakeholders indicate a preference for forecasting tools that are easy to use and understand to assist with surge capacity planning for influenza. Objective This study aimed to develop a simple risk prediction tool, Flucast, to predict the severity of an emerging influenza season. Methods Study data were obtained from the National Notifiable Diseases Surveillance System and Australian Influenza Surveillance Reports from the Department of Health, Australia. We tested Flucast using retrospective seasonal data for 11 Australian influenza seasons. We compared five different models using parameters known early in the season that may be associated with the severity of the season. To calibrate the tool, the resulting estimates of seasonal severity were validated against independent reports of influenza-attributable morbidity and mortality. The model with the highest predictive accuracy against retrospective seasonal activity was chosen as a best-fit model to develop the Flucast tool. The tool was prospectively tested against the 2018 and the emerging 2019 influenza season. Results The Flucast tool predicted the severity of all retrospectively studied years correctly for influenza seasonal activity in Australia. With the use of real-time data, the tool provided a reasonable early prediction of a low to moderate season for the 2018 and severe seasonal activity for the upcoming 2019 season. The tool meets stakeholder preferences for simplicity and ease of use to assist with surge capacity planning. Conclusions The Flucast tool may be useful to inform future health system influenza preparedness planning, surge capacity, and intervention programs in real time, and can be adapted for different settings and geographic locations.
Collapse
Affiliation(s)
- Aye Moa
- Biosecurity Program, The Kirby Institute, University of New South Wales, Sydney, Australia
| | - David Muscatello
- School of Public Health and Community Medicine, Faculty of Medicine, University of New South Wales, Sydney, Australia
| | - Abrar Chughtai
- School of Public Health and Community Medicine, Faculty of Medicine, University of New South Wales, Sydney, Australia
| | - Xin Chen
- Biosecurity Program, The Kirby Institute, University of New South Wales, Sydney, Australia
| | - C Raina MacIntyre
- Biosecurity Program, The Kirby Institute, University of New South Wales, Sydney, Australia.,College of Health Solutions and College of Public Service & Community Solutions, Arizona State University, Tempe, AZ, United States
| |
Collapse
|
10
|
Funk S, Camacho A, Kucharski AJ, Lowe R, Eggo RM, Edmunds WJ. Assessing the performance of real-time epidemic forecasts: A case study of Ebola in the Western Area region of Sierra Leone, 2014-15. PLoS Comput Biol 2019; 15:e1006785. [PMID: 30742608 PMCID: PMC6386417 DOI: 10.1371/journal.pcbi.1006785] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Revised: 02/22/2019] [Accepted: 01/14/2019] [Indexed: 11/30/2022] Open
Abstract
Real-time forecasts based on mathematical models can inform critical decision-making during infectious disease outbreaks. Yet, epidemic forecasts are rarely evaluated during or after the event, and there is little guidance on the best metrics for assessment. Here, we propose an evaluation approach that disentangles different components of forecasting ability using metrics that separately assess the calibration, sharpness and bias of forecasts. This makes it possible to assess not just how close a forecast was to reality but also how well uncertainty has been quantified. We used this approach to analyse the performance of weekly forecasts we generated in real time for Western Area, Sierra Leone, during the 2013-16 Ebola epidemic in West Africa. We investigated a range of forecast model variants based on the model fits generated at the time with a semi-mechanistic model, and found that good probabilistic calibration was achievable at short time horizons of one or two weeks ahead but model predictions were increasingly unreliable at longer forecasting horizons. This suggests that forecasts may have been of good enough quality to inform decision making based on predictions a few weeks ahead of time but not longer, reflecting the high level of uncertainty in the processes driving the trajectory of the epidemic. Comparing forecasts based on the semi-mechanistic model to simpler null models showed that the best semi-mechanistic model variant performed better than the null models with respect to probabilistic calibration, and that this would have been identified from the earliest stages of the outbreak. As forecasts become a routine part of the toolkit in public health, standards for evaluation of performance will be important for assessing quality and improving credibility of mathematical models, and for elucidating difficulties and trade-offs when aiming to make the most useful and reliable forecasts.
Collapse
Affiliation(s)
- Sebastian Funk
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Anton Camacho
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Epicentre, Paris, France
| | - Adam J. Kucharski
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Rachel Lowe
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Barcelona Institute for Global Health (ISGlobal), Barcelona, Spain
| | - Rosalind M. Eggo
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - W. John Edmunds
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| |
Collapse
|
11
|
Osthus D, Daughton AR, Priedhorsky R. Even a good influenza forecasting model can benefit from internet-based nowcasts, but those benefits are limited. PLoS Comput Biol 2019; 15:e1006599. [PMID: 30707689 PMCID: PMC6373968 DOI: 10.1371/journal.pcbi.1006599] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Revised: 02/13/2019] [Accepted: 10/30/2018] [Indexed: 11/19/2022] Open
Abstract
The ability to produce timely and accurate flu forecasts in the United States can significantly impact public health. Augmenting forecasts with internet data has shown promise for improving forecast accuracy and timeliness in controlled settings, but results in practice are less convincing, as models augmented with internet data have not consistently outperformed models without internet data. In this paper, we perform a controlled experiment, taking into account data backfill, to improve clarity on the benefits and limitations of augmenting an already good flu forecasting model with internet-based nowcasts. Our results show that a good flu forecasting model can benefit from the augmentation of internet-based nowcasts in practice for all considered public health-relevant forecasting targets. The degree of forecast improvement due to nowcasting, however, is uneven across forecasting targets, with short-term forecasting targets seeing the largest improvements and seasonal targets such as the peak timing and intensity seeing relatively marginal improvements. The uneven forecasting improvements across targets hold even when "perfect" nowcasts are used. These findings suggest that further improvements to flu forecasting, particularly seasonal targets, will need to derive from other, non-nowcasting approaches.
Collapse
Affiliation(s)
- Dave Osthus
- Los Alamos National Laboratory, Los Alamos, New Mexico, USA
| | - Ashlynn R. Daughton
- Los Alamos National Laboratory, Los Alamos, New Mexico, USA
- University of Colorado Boulder, Boulder, Colorado, USA
| | | |
Collapse
|
12
|
Ertem Z, Raymond D, Meyers LA. Optimal multi-source forecasting of seasonal influenza. PLoS Comput Biol 2018; 14:e1006236. [PMID: 30180212 PMCID: PMC6138397 DOI: 10.1371/journal.pcbi.1006236] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Revised: 09/14/2018] [Accepted: 05/28/2018] [Indexed: 11/18/2022] Open
Abstract
Forecasting the emergence and spread of influenza viruses is an important public health challenge. Timely and accurate estimates of influenza prevalence, particularly of severe cases requiring hospitalization, can improve control measures to reduce transmission and mortality. Here, we extend a previously published machine learning method for influenza forecasting to integrate multiple diverse data sources, including traditional surveillance data, electronic health records, internet search traffic, and social media activity. Our hierarchical framework uses multi-linear regression to combine forecasts from multiple data sources and greedy optimization with forward selection to sequentially choose the most predictive combinations of data sources. We show that the systematic integration of complementary data sources can substantially improve forecast accuracy over single data sources. When forecasting the Center for Disease Control and Prevention (CDC) influenza-like-illness reports (ILINet) from week 48 through week 20, the optimal combination of predictors includes public health surveillance data and commercially available electronic medical records, but neither search engine nor social media data.
Collapse
Affiliation(s)
- Zeynep Ertem
- Department of Statistics and Data Science, The University of Texas at Austin, Austin, Texas, United States of America
- * E-mail:
| | - Dorrie Raymond
- athenaResearch, Watertown, Massachusetts, United States of America
| | - Lauren Ancel Meyers
- Departments of Integrative Biology and Statistics and Data Science, The University of Texas at Austin, Austin, Texas, United States of America
- The Santa Fe Institute, Santa Fe, New Mexico, United States of America
| |
Collapse
|
13
|
Morita H, Kramer S, Heaney A, Gil H, Shaman J. Influenza forecast optimization when using different surveillance data types and geographic scale. Influenza Other Respir Viruses 2018; 12:755-764. [PMID: 30028083 PMCID: PMC6185890 DOI: 10.1111/irv.12594] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2018] [Accepted: 07/11/2018] [Indexed: 11/15/2022] Open
Abstract
Background Advance warning of influenza incidence levels from skillful forecasts could help public health officials and healthcare providers implement more timely preparedness and intervention measures to combat outbreaks. Compared to influenza predictions generated at regional and national levels, those generated at finer scales could offer greater value in determining locally appropriate measures; however, to date, the various influenza surveillance data that are collected by state and county departments of health have not been well utilized in influenza prediction. Objectives To assess whether an influenza forecast model system can be optimized to generate accurate forecasts using novel surveillance data streams. Methods Here, we generate retrospective influenza forecasts with a dynamic, compartmental model‐inference system using surveillance data for influenza‐like illness (ILI), laboratory‐confirmed cases, and pneumonia and influenza mortality at state and county levels. We evaluate how specification of 3 system inputs—scaling, observational error variance (OEV), and filter divergence (lambda)—affects forecast accuracy. Results In retrospective forecasts, and across data types, there were no clear optimal combinations for the 3 system inputs; however, scaling was most critical to forecast accuracy, whereas OEV and lambda were not. Conclusions Forecasts using new data streams should be tested to determine an appropriate scaling value using historical data and analyzed for forecast accuracy.
Collapse
Affiliation(s)
- Haruka Morita
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York City, New York
| | - Sarah Kramer
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York City, New York
| | - Alexandra Heaney
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York City, New York
| | - Harold Gil
- Marion County Public Health Department, Indianapolis, Indiana
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York City, New York
| |
Collapse
|
14
|
Willem L, Verelst F, Bilcke J, Hens N, Beutels P. Lessons from a decade of individual-based models for infectious disease transmission: a systematic review (2006-2015). BMC Infect Dis 2017; 17:612. [PMID: 28893198 PMCID: PMC5594572 DOI: 10.1186/s12879-017-2699-8] [Citation(s) in RCA: 77] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2016] [Accepted: 08/22/2017] [Indexed: 02/18/2023] Open
Abstract
Background Individual-based models (IBMs) are useful to simulate events subject to stochasticity and/or heterogeneity, and have become well established to model the potential (re)emergence of pathogens (e.g., pandemic influenza, bioterrorism). Individual heterogeneity at the host and pathogen level is increasingly documented to influence transmission of endemic diseases and it is well understood that the final stages of elimination strategies for vaccine-preventable childhood diseases (e.g., polio, measles) are subject to stochasticity. Even so it appears IBMs for both these phenomena are not well established. We review a decade of IBM publications aiming to obtain insights in their advantages, pitfalls and rationale for use and to make recommendations facilitating knowledge transfer within and across disciplines. Methods We systematically identified publications in Web of Science and PubMed from 2006-2015 based on title/abstract/keywords screening (and full-text if necessary) to retrieve topics, modeling purposes and general specifications. We extracted detailed modeling features from papers on established vaccine-preventable childhood diseases based on full-text screening. Results We identified 698 papers, which applied an IBM for infectious disease transmission, and listed these in a reference database, describing their general characteristics. The diversity of disease-topics and overall publication frequency have increased over time (38 to 115 annual publications from 2006 to 2015). The inclusion of intervention strategies (8 to 52) and economic consequences (1 to 20) are increasing, to the detriment of purely theoretical explorations. Unfortunately, terminology used to describe IBMs is inconsistent and ambiguous. We retrieved 24 studies on a vaccine-preventable childhood disease (covering 7 different diseases), with publication frequency increasing from the first such study published in 2008. IBMs have been useful to explore heterogeneous between- and within-host interactions, but combined applications are still sparse. The amount of missing information on model characteristics and study design is remarkable. Conclusions IBMs are suited to combine heterogeneous within- and between-host interactions, which offers many opportunities, especially to analyze targeted interventions for endemic infections. We advocate the exchange of (open-source) platforms and stress the need for consistent “branding”. Using (existing) conventions and reporting protocols would stimulate cross-fertilization between research groups and fields, and ultimately policy making in decades to come. Electronic supplementary material The online version of this article (doi:10.1186/s12879-017-2699-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lander Willem
- Centre for Health Economics Research & Modeling Infectious Diseases, Vaccine and Infectious Disease Institute, University of Antwerp, Antwerp, Belgium.
| | - Frederik Verelst
- Centre for Health Economics Research & Modeling Infectious Diseases, Vaccine and Infectious Disease Institute, University of Antwerp, Antwerp, Belgium
| | - Joke Bilcke
- Centre for Health Economics Research & Modeling Infectious Diseases, Vaccine and Infectious Disease Institute, University of Antwerp, Antwerp, Belgium
| | - Niel Hens
- Centre for Health Economics Research & Modeling Infectious Diseases, Vaccine and Infectious Disease Institute, University of Antwerp, Antwerp, Belgium.,Interuniversity Institute for Biostatistics and statistical Bioinformatics, UHasselt, Hasselt, Belgium
| | - Philippe Beutels
- Centre for Health Economics Research & Modeling Infectious Diseases, Vaccine and Infectious Disease Institute, University of Antwerp, Antwerp, Belgium.,School of Public Health and Community Medicine, The University of New South Wales, Sydney, Australia
| |
Collapse
|
15
|
|
16
|
Muscatello DJ, Chughtai AA, Heywood A, Gardner LM, Heslop DJ, MacIntyre CR. Translation of Real-Time Infectious Disease Modeling into Routine Public Health Practice. Emerg Infect Dis 2017; 23. [PMID: 28418309 PMCID: PMC5403034 DOI: 10.3201/eid2305.161720] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Infectious disease dynamic modeling can support outbreak emergency responses. We conducted a workshop to canvas the needs of stakeholders in Australia for practical, real-time modeling tools for infectious disease emergencies. The workshop was attended by 29 participants who represented government, defense, general practice, and academia stakeholders. We found that modeling is underused in Australia and its potential is poorly understood by practitioners involved in epidemic responses. The development of better modeling tools is desired. Ideal modeling tools for operational use would be easy to use, clearly indicate underlying parameterization and assumptions, and assist with policy and decision making.
Collapse
|
17
|
Xu Q, Gel YR, Ramirez Ramirez LL, Nezafati K, Zhang Q, Tsui KL. Forecasting influenza in Hong Kong with Google search queries and statistical model fusion. PLoS One 2017; 12:e0176690. [PMID: 28464015 PMCID: PMC5413039 DOI: 10.1371/journal.pone.0176690] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 04/15/2017] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND The objective of this study is to investigate predictive utility of online social media and web search queries, particularly, Google search data, to forecast new cases of influenza-like-illness (ILI) in general outpatient clinics (GOPC) in Hong Kong. To mitigate the impact of sensitivity to self-excitement (i.e., fickle media interest) and other artifacts of online social media data, in our approach we fuse multiple offline and online data sources. METHODS Four individual models: generalized linear model (GLM), least absolute shrinkage and selection operator (LASSO), autoregressive integrated moving average (ARIMA), and deep learning (DL) with Feedforward Neural Networks (FNN) are employed to forecast ILI-GOPC both one week and two weeks in advance. The covariates include Google search queries, meteorological data, and previously recorded offline ILI. To our knowledge, this is the first study that introduces deep learning methodology into surveillance of infectious diseases and investigates its predictive utility. Furthermore, to exploit the strength from each individual forecasting models, we use statistical model fusion, using Bayesian model averaging (BMA), which allows a systematic integration of multiple forecast scenarios. For each model, an adaptive approach is used to capture the recent relationship between ILI and covariates. RESULTS DL with FNN appears to deliver the most competitive predictive performance among the four considered individual models. Combing all four models in a comprehensive BMA framework allows to further improve such predictive evaluation metrics as root mean squared error (RMSE) and mean absolute predictive error (MAPE). Nevertheless, DL with FNN remains the preferred method for predicting locations of influenza peaks. CONCLUSIONS The proposed approach can be viewed a feasible alternative to forecast ILI in Hong Kong or other countries where ILI has no constant seasonal trend and influenza data resources are limited. The proposed methodology is easily tractable and computationally efficient.
Collapse
Affiliation(s)
- Qinneng Xu
- City University of Hong Kong, Hong Kong SAR, China
| | - Yulia R. Gel
- University of Texas at Dallas, Dallas, United States of America
| | | | - Kusha Nezafati
- University of Texas at Dallas, Dallas, United States of America
| | | | | |
Collapse
|
18
|
Knight J, Wells S, Marshall R, Exeter D, Jackson R. Developing a synthetic national population to investigate the impact of different cardiovascular disease risk management strategies: A derivation and validation study. PLoS One 2017; 12:e0173170. [PMID: 28384217 PMCID: PMC5383032 DOI: 10.1371/journal.pone.0173170] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2016] [Accepted: 02/16/2017] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Many national cardiovascular disease (CVD) risk factor management guidelines now recommend that drug treatment decisions should be informed primarily by patients' multi-variable predicted risk of CVD, rather than on the basis of single risk factor thresholds. To investigate the potential impact of treatment guidelines based on CVD risk thresholds at a national level requires individual level data representing the multi-variable CVD risk factor profiles for a country's total adult population. As these data are seldom, if ever, available, we aimed to create a synthetic population, representing the joint CVD risk factor distributions of the adult New Zealand population. METHODS AND RESULTS A synthetic population of 2,451,278 individuals, representing the actual age, gender, ethnicity and social deprivation composition of people aged 30-84 years who completed the 2013 New Zealand census was generated using Monte Carlo sampling. Each 'synthetic' person was then probabilistically assigned values of the remaining cardiovascular disease (CVD) risk factors required for predicting their CVD risk, based on data from the national census national hospitalisation and drug dispensing databases and a large regional cohort study, using Monte Carlo sampling and multiple imputation. Where possible, the synthetic population CVD risk distributions for each non-demographic risk factor were validated against independent New Zealand data sources. CONCLUSIONS We were able to develop a synthetic national population with realistic multi-variable CVD risk characteristics. The construction of this population is the first step in the development of a micro-simulation model intended to investigate the likely impact of a range of national CVD risk management strategies that will inform CVD risk management guideline updates in New Zealand and elsewhere.
Collapse
Affiliation(s)
- Josh Knight
- Centre for Health Policy, School of Population and Global Health, University of Melbourne, Melbourne, Australia
- School of Population Health, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| | - Susan Wells
- School of Population Health, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| | - Roger Marshall
- School of Population Health, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| | - Daniel Exeter
- School of Population Health, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| | - Rod Jackson
- School of Population Health, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| |
Collapse
|
19
|
Osthus D, Hickmann KS, Caragea PC, Higdon D, Del Valle SY. Forecasting seasonal influenza with a state-space SIR model. Ann Appl Stat 2017; 11:202-224. [PMID: 28979611 PMCID: PMC5623938 DOI: 10.1214/16-aoas1000] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Seasonal influenza is a serious public health and societal problem due to its consequences resulting from absenteeism, hospitalizations, and deaths. The overall burden of influenza is captured by the Centers for Disease Control and Prevention's influenza-like illness network, which provides invaluable information about the current incidence. This information is used to provide decision support regarding prevention and response efforts. Despite the relatively rich surveillance data and the recurrent nature of seasonal influenza, forecasting the timing and intensity of seasonal influenza in the U.S. remains challenging because the form of the disease transmission process is uncertain, the disease dynamics are only partially observed, and the public health observations are noisy. Fitting a probabilistic state-space model motivated by a deterministic mathematical model [a susceptible-infectious-recovered (SIR) model] is a promising approach for forecasting seasonal influenza while simultaneously accounting for multiple sources of uncertainty. A significant finding of this work is the importance of thoughtfully specifying the prior, as results critically depend on its specification. Our conditionally specified prior allows us to exploit known relationships between latent SIR initial conditions and parameters and functions of surveillance data. We demonstrate advantages of our approach relative to alternatives via a forecasting comparison using several forecast accuracy metrics.
Collapse
Affiliation(s)
- Dave Osthus
- Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
- Department of Statistics, Iowa State University, 2409 Snedecor Hall, Ames, Iowa 50011, USA
| | - Kyle S Hickmann
- Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Petruţa C Caragea
- Department of Statistics, Iowa State University, 2409 Snedecor Hall, Ames, Iowa 50011, USA
| | - Dave Higdon
- Social Decision Analytics Laboratory, Biocomplexity Institute of Virginia Tech, 900 N Glebe Rd., Arlington, Virginia 22203, USA
- Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Sara Y Del Valle
- Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| |
Collapse
|
20
|
Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates. Online J Public Health Inform 2016; 8:e198. [PMID: 28210419 PMCID: PMC5302465 DOI: 10.5210/ojphi.v8i3.7011] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Influenza (flu) surveillance using Twitter data can potentially save lives and increase efficiency by providing governments and healthcare organizations with greater situational awareness. However, research is needed to determine the impact of Twitter users' misdiagnoses on surveillance estimates. OBJECTIVE This study establishes the importance of Twitter users' misdiagnoses by showing that Twitter flu surveillance in the United States failed during the 2011-2012 flu season, estimates the extent of misdiagnoses, and tests several methods for reducing the adverse effects of misdiagnoses. METHODS Metrics representing flu prevalence, seasonal misdiagnosis patterns, diagnosis uncertainty, flu symptoms, and noise were produced using Twitter data in conjunction with OpenSextant for geo-inferencing, and a maximum entropy classifier for identifying tweets related to illness. These metrics were tested for correlations with World Health Organization (WHO) positive specimen counts of flu from 2011 to 2014. RESULTS Twitter flu surveillance erroneously indicated a typical flu season during 2011-2012, even though the flu season peaked three months late, and erroneously indicated plateaus of flu tweets before the 2012-2013 and 2013-2014 flu seasons. Enhancements based on estimates of misdiagnoses removed the erroneous plateaus and increased the Pearson correlation coefficients by .04 and .23, but failed to correct the 2011-2012 flu season estimate. A rough estimate indicates that approximately 40% of flu tweets reflected misdiagnoses. CONCLUSIONS Further research into factors affecting Twitter users' misdiagnoses, in conjunction with data from additional atypical flu seasons, is needed to enable Twitter flu surveillance systems to produce reliable estimates during atypical flu seasons.
Collapse
|
21
|
Yang W, Olson DR, Shaman J. Forecasting Influenza Outbreaks in Boroughs and Neighborhoods of New York City. PLoS Comput Biol 2016; 12:e1005201. [PMID: 27855155 PMCID: PMC5113861 DOI: 10.1371/journal.pcbi.1005201] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2016] [Accepted: 10/05/2016] [Indexed: 11/26/2022] Open
Abstract
The ideal spatial scale, or granularity, at which infectious disease incidence should be monitored and forecast has been little explored. By identifying the optimal granularity for a given disease and host population, and matching surveillance and prediction efforts to this scale, response to emergent and recurrent outbreaks can be improved. Here we explore how granularity and representation of spatial structure affect influenza forecast accuracy within New York City. We develop network models at the borough and neighborhood levels, and use them in conjunction with surveillance data and a data assimilation method to forecast influenza activity. These forecasts are compared to an alternate system that predicts influenza for each borough or neighborhood in isolation. At the borough scale, influenza epidemics are highly synchronous despite substantial differences in intensity, and inclusion of network connectivity among boroughs generally improves forecast accuracy. At the neighborhood scale, we observe much greater spatial heterogeneity among influenza outbreaks including substantial differences in local outbreak timing and structure; however, inclusion of the network model structure generally degrades forecast accuracy. One notable exception is that local outbreak onset, particularly when signal is modest, is better predicted with the network model. These findings suggest that observation and forecast at sub-municipal scales within New York City provides richer, more discriminant information on influenza incidence, particularly at the neighborhood scale where greater heterogeneity exists, and that the spatial spread of influenza among localities can be forecast. Influenza, or the flu, causes significant morbidity and mortality during both seasonal and pandemic outbreaks. Recently developed influenza forecast systems have the potential to aid public health planning for and mitigation of the burden of this disease. However, current forecasts are often generated at spatial scales (e.g. national level) that are coarser than the scales at which public health measures and interventions are implemented (e.g. community level). Here we build and test influenza forecast systems at county and community levels, which either include spatial connectivity among locations or are run in isolation. We test these four flu forecast systems (i.e. 2 models × 2 spatial scales) using data collected from 2008 to 2013, including the 2009 pandemic, for the five boroughs (corresponding to county level) and 42 neighborhoods (corresponding to community level) in New York City. We compare the performance of the four forecast systems in predicting the onset, duration, and intensity of flu outbreaks and found that the performance varied by spatial scale (borough vs. neighborhood), season (non-pandemic vs. pandemic) and metric (onset, duration, and intensity). In general, the inclusion of spatial network connectivity in the forecast model improves forecast accuracy at the borough scale but degrades accuracy at the neighborhood scale.
Collapse
Affiliation(s)
- Wan Yang
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
- * E-mail:
| | - Donald R. Olson
- New York City Department of Health and Mental Hygiene, New York, New York, United States of America
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| |
Collapse
|
22
|
Yamana TK, Kandula S, Shaman J. Superensemble forecasts of dengue outbreaks. J R Soc Interface 2016; 13:20160410. [PMID: 27733698 PMCID: PMC5095208 DOI: 10.1098/rsif.2016.0410] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 09/14/2016] [Indexed: 11/12/2022] Open
Abstract
In recent years, a number of systems capable of predicting future infectious disease incidence have been developed. As more of these systems are operationalized, it is important that the forecasts generated by these different approaches be formally reconciled so that individual forecast error and bias are reduced. Here we present a first example of such multi-system, or superensemble, forecast. We develop three distinct systems for predicting dengue, which are applied retrospectively to forecast outbreak characteristics in San Juan, Puerto Rico. We then use Bayesian averaging methods to combine the predictions from these systems and create superensemble forecasts. We demonstrate that on average, the superensemble approach produces more accurate forecasts than those made from any of the individual forecasting systems.
Collapse
Affiliation(s)
- Teresa K Yamana
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY 10032, US
| | - Sasikiran Kandula
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY 10032, US
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY 10032, US
| |
Collapse
|
23
|
Accurate estimation of influenza epidemics using Google search data via ARGO. Proc Natl Acad Sci U S A 2015; 112:14473-8. [PMID: 26553980 DOI: 10.1073/pnas.1515373112] [Citation(s) in RCA: 164] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Accurate real-time tracking of influenza outbreaks helps public health officials make timely and meaningful decisions that could save lives. We propose an influenza tracking model, ARGO (AutoRegression with GOogle search data), that uses publicly available online search data. In addition to having a rigorous statistical foundation, ARGO outperforms all previously available Google-search-based tracking models, including the latest version of Google Flu Trends, even though it uses only low-quality search data as input from publicly available Google Trends and Google Correlate websites. ARGO not only incorporates the seasonality in influenza epidemics but also captures changes in people's online search behavior over time. ARGO is also flexible, self-correcting, robust, and scalable, making it a potentially powerful tool that can be used for real-time tracking of other social events at multiple temporal and spatial resolutions.
Collapse
|
24
|
Deodhar S, Bisset K, Chen J, Barrett C, Wilson M, Marathe M. EpiCaster: An Integrated Web Application For Situation Assessment and Forecasting of Global Epidemics. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2015; 2015:156-165. [PMID: 27796009 PMCID: PMC5082988 DOI: 10.1145/2808719.2808735] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Public health decision makers need access to high resolution situation assessment tools for understanding the extent of various epidemics in different regions of the world. In addition, they need insights into the future course of epidemics by way of forecasts. Such forecasts are essential for planning the allocation of limited resources and for implementing several policy-level and behavioral intervention strategies. The need for such forecasting systems became evident in the wake of the recent Ebola outbreak in West Africa. We have developed EpiCaster, an integrated Web application for situation assessment and forecasting of various epidemics, such as Flu and Ebola, that are prevalent in different regions of the world. Using EpiCaster, users can assess the magnitude and severity of different epidemics at highly resolved spatio-temporal levels. EpiCaster provides time-varying heat maps and graphical plots to view trends in the disease dynamics. EpiCaster also allows users to visualize data gathered through surveillance mechanisms, such as Google Flu Trends (GFT) and the World Health Organization (WHO). The forecasts provided by EpiCaster are generated using different epidemiological models, and the users can select the models through the interface to filter the corresponding forecasts. EpiCaster also allows the users to study epidemic propagation in the presence of a number of intervention strategies specific to certain diseases. Here we describe the modeling techniques, methodologies and computational infrastructure that EpiCaster relies on to support large-scale predictive analytics for situation assessment and forecasting of global epidemics.
Collapse
Affiliation(s)
- Suruchi Deodhar
- Network Dynamics and Simulation Science Laboratory, VBI, Virginia Tech, Blacksburg, VA
| | - Keith Bisset
- Network Dynamics and Simulation Science Laboratory, VBI, Virginia Tech, Blacksburg, VA
| | - Jiangzhuo Chen
- Network Dynamics and Simulation Science Laboratory, VBI, Virginia Tech, Blacksburg, VA
| | - Chris Barrett
- Network Dynamics and Simulation Science Laboratory, VBI, Virginia Tech, Blacksburg, VA
| | - Mandy Wilson
- Network Dynamics and Simulation Science Laboratory, VBI, Virginia Tech, Blacksburg, VA
| | - Madhav Marathe
- Network Dynamics and Simulation Science Laboratory, VBI, Virginia Tech, Blacksburg, VA
| |
Collapse
|
25
|
Flexible Modeling of Epidemics with an Empirical Bayes Framework. PLoS Comput Biol 2015; 11:e1004382. [PMID: 26317693 PMCID: PMC4552841 DOI: 10.1371/journal.pcbi.1004382] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 06/05/2015] [Indexed: 11/19/2022] Open
Abstract
Seasonal influenza epidemics cause consistent, considerable, widespread loss annually in terms of economic burden, morbidity, and mortality. With access to accurate and reliable forecasts of a current or upcoming influenza epidemic’s behavior, policy makers can design and implement more effective countermeasures. This past year, the Centers for Disease Control and Prevention hosted the “Predict the Influenza Season Challenge”, with the task of predicting key epidemiological measures for the 2013–2014 U.S. influenza season with the help of digital surveillance data. We developed a framework for in-season forecasts of epidemics using a semiparametric Empirical Bayes framework, and applied it to predict the weekly percentage of outpatient doctors visits for influenza-like illness, and the season onset, duration, peak time, and peak height, with and without using Google Flu Trends data. Previous work on epidemic modeling has focused on developing mechanistic models of disease behavior and applying time series tools to explain historical data. However, tailoring these models to certain types of surveillance data can be challenging, and overly complex models with many parameters can compromise forecasting ability. Our approach instead produces possibilities for the epidemic curve of the season of interest using modified versions of data from previous seasons, allowing for reasonable variations in the timing, pace, and intensity of the seasonal epidemics, as well as noise in observations. Since the framework does not make strict domain-specific assumptions, it can easily be applied to some other diseases with seasonal epidemics. This method produces a complete posterior distribution over epidemic curves, rather than, for example, solely point predictions of forecasting targets. We report prospective influenza-like-illness forecasts made for the 2013–2014 U.S. influenza season, and compare the framework’s cross-validated prediction error on historical data to that of a variety of simpler baseline predictors. Influenza epidemics occur annually, and incur significant losses in terms of lost productivity, sickness, and death. Policy makers employ countermeasures, such as vaccination campaigns, to combat the occurrence and spread of infectious diseases, but epidemics exhibit a wide range of behavior, which makes designing and planning these efforts difficult. Accurate and reliable numerical forecasts of how an epidemic will behave, as well as advance notice of key events, could enable policy makers to further specialize countermeasures for a particular season. While a large amount of work already exists on modeling epidemics in past seasons, work on forecasting is relatively sparse. Specially tailored models for historical data may be overly strict and fail to produce behavior similar to the current season. We designed a framework for predicting epidemics without making strong assumptions about how the disease propagates by relying on slightly modified versions of past epidemics to form possibilities for the current season. We report forecasts generated for the 2013–2014 Centers for Disease Control and Prevention (CDC) “Predict the Influenza Season Challenge”, and assess its accuracy retrospectively.
Collapse
|
26
|
Gambhir M, Bozio C, O'Hagan JJ, Uzicanin A, Johnson LE, Biggerstaff M, Swerdlow DL. Infectious disease modeling methods as tools for informing response to novel influenza viruses of unknown pandemic potential. Clin Infect Dis 2015; 60 Suppl 1:S11-9. [PMID: 25878297 DOI: 10.1093/cid/civ083] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The rising importance of infectious disease modeling makes this an appropriate time for a guide for public health practitioners tasked with preparing for, and responding to, an influenza pandemic. We list several questions that public health practitioners commonly ask about pandemic influenza and match these with analytical methods, giving details on when during a pandemic the methods can be used, how long it might take to implement them, and what data are required. Although software to perform these tasks is available, care needs to be taken to understand: (1) the type of data needed, (2) the implementation of the methods, and (3) the interpretation of results in terms of model uncertainty and sensitivity. Public health leaders can use this article to evaluate the modeling literature, determine which methods can provide appropriate evidence for decision-making, and to help them request modeling work from in-house teams or academic groups.
Collapse
Affiliation(s)
- Manoj Gambhir
- Epidemiological Modelling Unit, Department of Epidemiology and Preventive Medicine, Monash University, Melbourne, Australia Modeling Unit, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC) IHRC Inc
| | - Catherine Bozio
- Graduate Program in Epidemiology and Molecules to Mankind, Laney Graduate School, Emory University
| | - Justin J O'Hagan
- Modeling Unit, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC) IHRC Inc
| | - Amra Uzicanin
- Division of Global Migration and Quarantine, National Center for Emerging and Zoonotic Infectious Diseases
| | | | | | - David L Swerdlow
- Modeling Unit and Office of the Director, NCIRD, CDC, Atlanta, Georgia
| |
Collapse
|
27
|
Hickmann KS, Fairchild G, Priedhorsky R, Generous N, Hyman JM, Deshpande A, Del Valle SY. Forecasting the 2013-2014 influenza season using Wikipedia. PLoS Comput Biol 2015; 11:e1004239. [PMID: 25974758 PMCID: PMC4431683 DOI: 10.1371/journal.pcbi.1004239] [Citation(s) in RCA: 106] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 03/13/2015] [Indexed: 11/18/2022] Open
Abstract
Infectious diseases are one of the leading causes of morbidity and mortality around the world; thus, forecasting their impact is crucial for planning an effective response strategy. According to the Centers for Disease Control and Prevention (CDC), seasonal influenza affects 5% to 20% of the U.S. population and causes major economic impacts resulting from hospitalization and absenteeism. Understanding influenza dynamics and forecasting its impact is fundamental for developing prevention and mitigation strategies. We combine modern data assimilation methods with Wikipedia access logs and CDC influenza-like illness (ILI) reports to create a weekly forecast for seasonal influenza. The methods are applied to the 2013-2014 influenza season but are sufficiently general to forecast any disease outbreak, given incidence or case count data. We adjust the initialization and parametrization of a disease model and show that this allows us to determine systematic model bias. In addition, we provide a way to determine where the model diverges from observation and evaluate forecast accuracy. Wikipedia article access logs are shown to be highly correlated with historical ILI records and allow for accurate prediction of ILI data several weeks before it becomes available. The results show that prior to the peak of the flu season, our forecasting method produced 50% and 95% credible intervals for the 2013-2014 ILI observations that contained the actual observations for most weeks in the forecast. However, since our model does not account for re-infection or multiple strains of influenza, the tail of the epidemic is not predicted well after the peak of flu season has passed.
Collapse
Affiliation(s)
- Kyle S. Hickmann
- Theoretical Division Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- * E-mail:
| | - Geoffrey Fairchild
- Defense Systems Analysis Division Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Reid Priedhorsky
- High Performance Computing Division Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Nicholas Generous
- Defense Systems Analysis Division Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - James M. Hyman
- Department of Mathematics, Tulane University, New Orleans, Louisiana, United States of America
| | - Alina Deshpande
- Defense Systems Analysis Division Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Sara Y. Del Valle
- Defense Systems Analysis Division Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| |
Collapse
|
28
|
Martin LJ, Xu B, Yasui Y. Improving Google Flu Trends estimates for the United States through transformation. PLoS One 2014; 9:e109209. [PMID: 25551391 PMCID: PMC4281210 DOI: 10.1371/journal.pone.0109209] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 08/28/2014] [Indexed: 11/18/2022] Open
Abstract
Google Flu Trends (GFT) uses Internet search queries in an effort to provide early warning of increases in influenza-like illness (ILI). In the United States, GFT estimates the percentage of physician visits related to ILI (%ILINet) reported by the Centers for Disease Control and Prevention (CDC). However, during the 2012-13 influenza season, GFT overestimated %ILINet by an appreciable amount and estimated the peak in incidence three weeks late. Using data from 2010-14, we investigated the relationship between GFT estimates (%GFT) and %ILINet. Based on the relationship between the relative change in %GFT and the relative change in %ILINet, we transformed %GFT estimates to better correspond with %ILINet values. In 2010-13, our transformed %GFT estimates were within ± 10% of %ILINet values for 17 of the 29 weeks that %ILINet was above the seasonal baseline value determined by the CDC; in contrast, the original %GFT estimates were within ± 10% of %ILINet values for only two of these 29 weeks. Relative to the %ILINet peak in 2012-13, the peak in our transformed %GFT estimates was 2% lower and one week later, whereas the peak in the original %GFT estimates was 74% higher and three weeks later. The same transformation improved %GFT estimates using the recalibrated 2013 GFT model in early 2013-14. Our transformed %GFT estimates can be calculated approximately one week before %ILINet values are reported by the CDC and the transformation equation was stable over the time period investigated (2010-13). We anticipate our results will facilitate future use of GFT.
Collapse
Affiliation(s)
- Leah J. Martin
- School of Public Health, University of Alberta, Edmonton, Alberta, Canada
| | - Biying Xu
- School of Public Health, University of Alberta, Edmonton, Alberta, Canada
- Zhejiang University, Hangzhou, China
| | - Yutaka Yasui
- School of Public Health, University of Alberta, Edmonton, Alberta, Canada
| |
Collapse
|
29
|
Abstract
Accurate disease forecasts are imperative when preparing for influenza epidemic outbreaks; nevertheless, these forecasts are often limited by the time required to collect new, accurate data. In this paper, we show that data from the microblogging community Twitter significantly improves influenza forecasting. Most prior influenza forecast models are tested against historical influenza-like illness (ILI) data from the U.S. Centers for Disease Control and Prevention (CDC). These data are released with a one-week lag and are often initially inaccurate until the CDC revises them weeks later. Since previous studies utilize the final, revised data in evaluation, their evaluations do not properly determine the effectiveness of forecasting. Our experiments using ILI data available at the time of the forecast show that models incorporating data derived from Twitter can reduce forecasting error by 17-30% over a baseline that only uses historical data. For a given level of accuracy, using Twitter data produces forecasts that are two to four weeks ahead of baseline models. Additionally, we find that models using Twitter data are, on average, better predictors of influenza prevalence than are models using data from Google Flu Trends, the leading web data source.
Collapse
Affiliation(s)
- Michael J Paul
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland, USA
| | - Mark Dredze
- Human Language Technology Center of Excellence, Johns Hopkins University, Baltimore, Maryland, USA
| | - David Broniatowski
- Engineering Management and Systems Engineering, The George Washington University, Washington, District of Columbia, USA
| |
Collapse
|
30
|
Lee JL, DeCamp M, Dredze M, Chisolm MS, Berger ZD. What are health-related users tweeting? A qualitative content analysis of health-related users and their messages on twitter. J Med Internet Res 2014; 16:e237. [PMID: 25591063 PMCID: PMC4296104 DOI: 10.2196/jmir.3765] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Revised: 09/12/2014] [Accepted: 09/16/2014] [Indexed: 11/24/2022] Open
Abstract
Background Twitter is home to many health professionals who send messages about a variety of health-related topics. Amid concerns about physicians posting inappropriate content online, more in-depth knowledge about these messages is needed to understand health professionals’ behavior on Twitter. Objective Our goal was to characterize the content of Twitter messages, specifically focusing on health professionals and their tweets relating to health. Methods We performed an in-depth content analysis of 700 tweets. Qualitative content analysis was conducted on tweets by health users on Twitter. The primary objective was to describe the general type of content (ie, health-related versus non-health related) on Twitter authored by health professionals and further to describe health-related tweets on the basis of the type of statement made. Specific attention was given to whether a tweet was personal (as opposed to professional) or made a claim that users would expect to be supported by some level of medical evidence (ie, a “testable” claim). A secondary objective was to compare content types among different users, including patients, physicians, nurses, health care organizations, and others. Results Health-related users are posting a wide range of content on Twitter. Among health-related tweets, 53.2% (184/346) contained a testable claim. Of health-related tweets by providers, 17.6% (61/346) were personal in nature; 61% (59/96) made testable statements. While organizations and businesses use Twitter to promote their services and products, patient advocates are using this tool to share their personal experiences with health. Conclusions Twitter users in health-related fields tweet about both testable claims and personal experiences. Future work should assess the relationship between testable tweets and the actual level of evidence supporting them, including how Twitter users—especially patients—interpret the content of tweets posted by health providers.
Collapse
Affiliation(s)
- Joy L Lee
- Johns Hopkins Bloomberg School of Public Health, Department of Health Policy & Management, Baltimore, MD, United States.
| | | | | | | | | |
Collapse
|
31
|
Chretien JP, George D, Shaman J, Chitale RA, McKenzie FE. Influenza forecasting in human populations: a scoping review. PLoS One 2014; 9:e94130. [PMID: 24714027 PMCID: PMC3979760 DOI: 10.1371/journal.pone.0094130] [Citation(s) in RCA: 110] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 03/12/2014] [Indexed: 11/18/2022] Open
Abstract
Forecasts of influenza activity in human populations could help guide key preparedness tasks. We conducted a scoping review to characterize these methodological approaches and identify research gaps. Adapting the PRISMA methodology for systematic reviews, we searched PubMed, CINAHL, Project Euclid, and Cochrane Database of Systematic Reviews for publications in English since January 1, 2000 using the terms “influenza AND (forecast* OR predict*)”, excluding studies that did not validate forecasts against independent data or incorporate influenza-related surveillance data from the season or pandemic for which the forecasts were applied. We included 35 publications describing population-based (N = 27), medical facility-based (N = 4), and regional or global pandemic spread (N = 4) forecasts. They included areas of North America (N = 15), Europe (N = 14), and/or Asia-Pacific region (N = 4), or had global scope (N = 3). Forecasting models were statistical (N = 18) or epidemiological (N = 17). Five studies used data assimilation methods to update forecasts with new surveillance data. Models used virological (N = 14), syndromic (N = 13), meteorological (N = 6), internet search query (N = 4), and/or other surveillance data as inputs. Forecasting outcomes and validation metrics varied widely. Two studies compared distinct modeling approaches using common data, 2 assessed model calibration, and 1 systematically incorporated expert input. Of the 17 studies using epidemiological models, 8 included sensitivity analysis. This review suggests need for use of good practices in influenza forecasting (e.g., sensitivity analysis); direct comparisons of diverse approaches; assessment of model calibration; integration of subjective expert input; operational research in pilot, real-world applications; and improved mutual understanding among modelers and public health officials.
Collapse
Affiliation(s)
- Jean-Paul Chretien
- Division of Integrated Biosurveillance, Armed Forces Health Surveillance Center, Silver Spring, Maryland, United States of America
- * E-mail:
| | - Dylan George
- Division of Analytic Decision Support, Biomedical Advanced Research and Development Authority, Department of Health and Human Services, Washington, DC, United States of America
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Rohit A. Chitale
- Division of Integrated Biosurveillance, Armed Forces Health Surveillance Center, Silver Spring, Maryland, United States of America
| | - F. Ellis McKenzie
- Fogarty International Center, National Institutes of Health, Bethesda, Maryland, United States of America
| |
Collapse
|
32
|
Prieto D, Das TK. An operational epidemiological model for calibrating agent-based simulations of pandemic influenza outbreaks. Health Care Manag Sci 2014; 19:1-19. [PMID: 24710651 DOI: 10.1007/s10729-014-9273-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2013] [Accepted: 02/12/2014] [Indexed: 12/20/2022]
Abstract
Uncertainty of pandemic influenza viruses continue to cause major preparedness challenges for public health policymakers. Decisions to mitigate influenza outbreaks often involve tradeoff between the social costs of interventions (e.g., school closure) and the cost of uncontrolled spread of the virus. To achieve a balance, policymakers must assess the impact of mitigation strategies once an outbreak begins and the virus characteristics are known. Agent-based (AB) simulation is a useful tool for building highly granular disease spread models incorporating the epidemiological features of the virus as well as the demographic and social behavioral attributes of tens of millions of affected people. Such disease spread models provide excellent basis on which various mitigation strategies can be tested, before they are adopted and implemented by the policymakers. However, to serve as a testbed for the mitigation strategies, the AB simulation models must be operational. A critical requirement for operational AB models is that they are amenable for quick and simple calibration. The calibration process works as follows: the AB model accepts information available from the field and uses those to update its parameters such that some of its outputs in turn replicate the field data. In this paper, we present our epidemiological model based calibration methodology that has a low computational complexity and is easy to interpret. Our model accepts a field estimate of the basic reproduction number, and then uses it to update (calibrate) the infection probabilities in a way that its effect combined with the effects of the given virus epidemiology, demographics, and social behavior results in an infection pattern yielding a similar value of the basic reproduction number. We evaluate the accuracy of the calibration methodology by applying it for an AB simulation model mimicking a regional outbreak in the US. The calibrated model is shown to yield infection patterns closely replicating the input estimates of the basic reproduction number. The calibration method is also tested to replicate an initial infection incidence trend for a H1N1 outbreak like that of 2009.
Collapse
Affiliation(s)
- D Prieto
- Department of Industrial and Manufacturing Engineering, Western Michigan University, 1903 W. Michigan Ave., Kalamazoo, MI, 49008-5336, USA.
| | - T K Das
- Department of Industrial and Management Systems Engineering, University of South Florida, Tampa, FL, 33620, USA
| |
Collapse
|
33
|
Nsoesie EO, Brownstein JS, Ramakrishnan N, Marathe MV. A systematic review of studies on forecasting the dynamics of influenza outbreaks. Influenza Other Respir Viruses 2013; 8:309-16. [PMID: 24373466 PMCID: PMC4181479 DOI: 10.1111/irv.12226] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/24/2013] [Indexed: 12/24/2022] Open
Abstract
Forecasting the dynamics of influenza outbreaks could be useful for decision-making regarding the allocation of public health resources. Reliable forecasts could also aid in the selection and implementation of interventions to reduce morbidity and mortality due to influenza illness. This paper reviews methods for influenza forecasting proposed during previous influenza outbreaks and those evaluated in hindsight. We discuss the various approaches, in addition to the variability in measures of accuracy and precision of predicted measures. PubMed and Google Scholar searches for articles on influenza forecasting retrieved sixteen studies that matched the study criteria. We focused on studies that aimed at forecasting influenza outbreaks at the local, regional, national, or global level. The selected studies spanned a wide range of regions including USA, Sweden, Hong Kong, Japan, Singapore, United Kingdom, Canada, France, and Cuba. The methods were also applied to forecast a single measure or multiple measures. Typical measures predicted included peak timing, peak height, daily/weekly case counts, and outbreak magnitude. Due to differences in measures used to assess accuracy, a single estimate of predictive error for each of the measures was difficult to obtain. However, collectively, the results suggest that these diverse approaches to influenza forecasting are capable of capturing specific outbreak measures with some degree of accuracy given reliable data and correct disease assumptions. Nonetheless, several of these approaches need to be evaluated and their performance quantified in real-time predictions.
Collapse
Affiliation(s)
- Elaine O Nsoesie
- Children's Hospital Informatics Program, Boston Children's Hospital, Boston, MA, USA; Department of Pediatrics, Harvard Medical School, Boston, MA, USA; Network Dynamics and Simulation Science Laboratory, Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA
| | | | | | | |
Collapse
|
34
|
Broniatowski DA, Paul MJ, Dredze M. National and local influenza surveillance through Twitter: an analysis of the 2012-2013 influenza epidemic. PLoS One 2013; 8:e83672. [PMID: 24349542 PMCID: PMC3857320 DOI: 10.1371/journal.pone.0083672] [Citation(s) in RCA: 230] [Impact Index Per Article: 20.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2013] [Accepted: 11/14/2013] [Indexed: 11/18/2022] Open
Abstract
Social media have been proposed as a data source for influenza surveillance because they have the potential to offer real-time access to millions of short, geographically localized messages containing information regarding personal well-being. However, accuracy of social media surveillance systems declines with media attention because media attention increases "chatter" - messages that are about influenza but that do not pertain to an actual infection - masking signs of true influenza prevalence. This paper summarizes our recently developed influenza infection detection algorithm that automatically distinguishes relevant tweets from other chatter, and we describe our current influenza surveillance system which was actively deployed during the full 2012-2013 influenza season. Our objective was to analyze the performance of this system during the most recent 2012-2013 influenza season and to analyze the performance at multiple levels of geographic granularity, unlike past studies that focused on national or regional surveillance. Our system's influenza prevalence estimates were strongly correlated with surveillance data from the Centers for Disease Control and Prevention for the United States (r = 0.93, p < 0.001) as well as surveillance data from the Department of Health and Mental Hygiene of New York City (r = 0.88, p < 0.001). Our system detected the weekly change in direction (increasing or decreasing) of influenza prevalence with 85% accuracy, a nearly twofold increase over a simpler model, demonstrating the utility of explicitly distinguishing infection tweets from other chatter.
Collapse
Affiliation(s)
- David A. Broniatowski
- Department of Engineering Management and Systems Engineering, The George Washington University, Washington, District of Columbia, United States of America
- Center for Advanced Modeling in the Social, Behavioral, and Health Sciences, Department of Emergency Medicine, School of Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Michael J. Paul
- Department of Computer Science and Center for Language and Speech Processing, Johns Hopkins University, Baltimore, Maryland, United States of America
- * E-mail:
| | - Mark Dredze
- Human Language Technology Center of Excellence and Department of Computer Science, Johns Hopkins University, Baltimore, Maryland, United States of America
| |
Collapse
|
35
|
Seasonal distribution of severe ADAMTS13 deficient idiopathic thrombotic thrombocytopenic purpura. J Clin Apher 2013; 29:113-9. [DOI: 10.1002/jca.21300] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2013] [Accepted: 08/12/2013] [Indexed: 12/29/2022]
|