Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hickmann KS, Fairchild G, Priedhorsky R, Generous N, Hyman JM, Deshpande A, Del Valle SY. Forecasting the 2013-2014 influenza season using Wikipedia. PLoS Comput Biol 2015;11:e1004239. [PMID: 25974758 PMCID: PMC4431683 DOI: 10.1371/journal.pcbi.1004239] [Citation(s) in RCA: 106] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 03/13/2015] [Indexed: 11/18/2022] Open

For:	Hickmann KS, Fairchild G, Priedhorsky R, Generous N, Hyman JM, Deshpande A, Del Valle SY. Forecasting the 2013-2014 influenza season using Wikipedia. PLoS Comput Biol 2015;11:e1004239. [PMID: 25974758 PMCID: PMC4431683 DOI: 10.1371/journal.pcbi.1004239] [Citation(s) in RCA: 106] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 03/13/2015] [Indexed: 11/18/2022] Open

Number

Cited by Other Article(s)

Ab Rashid MA, Ahmad Zaki R, Wan Mahiyuddin WR, Yahya A. Forecasting New Tuberculosis Cases in Malaysia: A Time-Series Study Using the Autoregressive Integrated Moving Average (ARIMA) Model. Cureus 2023;15:e44676. [PMID: 37809275 PMCID: PMC10552684 DOI: 10.7759/cureus.44676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/04/2023] [Indexed: 10/10/2023] Open

Wang Y, Zhou H, Zheng L, Li M, Hu B. Using the Baidu index to predict trends in the incidence of tuberculosis in Jiangsu Province, China. Front Public Health 2023;11:1203628. [PMID: 37533520 PMCID: PMC10390734 DOI: 10.3389/fpubh.2023.1203628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 07/05/2023] [Indexed: 08/04/2023] Open

Mavragani A, Fragkozidis G, Zarkogianni K, Nikita KS. Long Short-term Memory-Based Prediction of the Spread of Influenza-Like Illness Leveraging Surveillance, Weather, and Twitter Data: Model Development and Validation. J Med Internet Res 2023;25:e42519. [PMID: 36745490 PMCID: PMC9941907 DOI: 10.2196/42519] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 11/29/2022] [Accepted: 11/30/2022] [Indexed: 12/05/2022] Open

Abstract

BACKGROUND

The potential to harness the plurality of available data in real time along with advanced data analytics for the accurate prediction of influenza-like illness (ILI) outbreaks has gained significant scientific interest. Different methodologies based on the use of machine learning techniques and traditional and alternative data sources, such as ILI surveillance reports, weather reports, search engine queries, and social media, have been explored with the ultimate goal of being used in the development of electronic surveillance systems that could complement existing monitoring resources.

OBJECTIVE

The scope of this study was to investigate for the first time the combined use of ILI surveillance data, weather data, and Twitter data along with deep learning techniques toward the development of prediction models able to nowcast and forecast weekly ILI cases. By assessing the predictive power of both traditional and alternative data sources on the use case of ILI, this study aimed to provide a novel approach for corroborating evidence and enhancing accuracy and reliability in the surveillance of infectious diseases.

METHODS

The model's input space consisted of information related to weekly ILI surveillance, web-based social (eg, Twitter) behavior, and weather conditions. For the design and development of the model, relevant data corresponding to the period of 2010 to 2019 and focusing on the Greek population and weather were collected. Long short-term memory (LSTM) neural networks were leveraged to efficiently handle the sequential and nonlinear nature of the multitude of collected data. The 3 data categories were first used separately for training 3 LSTM-based primary models. Subsequently, different transfer learning (TL) approaches were explored with the aim of creating various feature spaces combining the features extracted from the corresponding primary models' LSTM layers for the latter to feed a dense layer.

RESULTS

The primary model that learned from weather data yielded better forecast accuracy (root mean square error [RMSE]=0.144; Pearson correlation coefficient [PCC]=0.801) than the model trained with ILI historical data (RMSE=0.159; PCC=0.794). The best performance was achieved by the TL-based model leveraging the combination of the 3 data categories (RMSE=0.128; PCC=0.822).

CONCLUSIONS

The superiority of the TL-based model, which considers Twitter data, weather data, and ILI surveillance data, reflects the potential of alternative public sources to enhance accurate and reliable prediction of ILI spread. Despite its focus on the use case of Greece, the proposed approach can be generalized to other locations, populations, and social media platforms to support the surveillance of infectious diseases with the ultimate goal of reinforcing preparedness for future epidemics.

Collapse

Santangelo OE, Gianfredi V, Provenzano S. Wikipedia searches and the epidemiology of infectious diseases: A systematic review. DATA KNOWL ENG 2022. [DOI: 10.1016/j.datak.2022.102093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Abdulkhaleq MT, Rashid TA, Alsadoon A, Hassan BA, Mohammadi M, Abdullah JM, Chhabra A, Ali SL, Othman RN, Hasan HA, Azad S, Mahmood NA, Abdalrahman SS, Rasul HO, Bacanin N, Vimal S. Harmony search: Current studies and uses on healthcare systems. Artif Intell Med 2022;131:102348. [DOI: 10.1016/j.artmed.2022.102348] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 05/08/2022] [Accepted: 06/30/2022] [Indexed: 11/29/2022]

Beesley LJ, Osthus D, Del Valle SY. Addressing delayed case reporting in infectious disease forecast modeling. PLoS Comput Biol 2022;18:e1010115. [PMID: 35658007 PMCID: PMC9200328 DOI: 10.1371/journal.pcbi.1010115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 06/15/2022] [Accepted: 04/18/2022] [Indexed: 11/18/2022] Open

Abstract

Infectious disease forecasting is of great interest to the public health community and policymakers, since forecasts can provide insight into disease dynamics in the near future and inform interventions. Due to delays in case reporting, however, forecasting models may often underestimate the current and future disease burden.

In this paper, we propose a general framework for addressing reporting delay in disease forecasting efforts with the goal of improving forecasts. We propose strategies for leveraging either historical data on case reporting or external internet-based data to estimate the amount of reporting error. We then describe several approaches for adapting general forecasting pipelines to account for under- or over-reporting of cases. We apply these methods to address reporting delay in data on dengue fever cases in Puerto Rico from 1990 to 2009 and to reports of influenza-like illness (ILI) in the United States between 2010 and 2019. Through a simulation study, we compare method performance and evaluate robustness to assumption violations. Our results show that forecasting accuracy and prediction coverage almost always increase when correction methods are implemented to address reporting delay. Some of these methods required knowledge about the reporting error or high quality external data, which may not always be available. Provided alternatives include excluding recently-reported data and performing sensitivity analysis. This work provides intuition and guidance for handling delay in disease case reporting and may serve as a useful resource to inform practical infectious disease forecasting efforts.

The public health community and policymakers are interested in using models to predict future disease rates using information about disease rates in the past. However, our data about the recent past are less reliable than older data, due to a time lag between someone getting sick and their subsequent diagnosis being officially reported. In this paper, we describe strategies to correct reported disease rates from the recent past to account for disease diagnoses that haven’t yet been reported. Using more accurate information about the recent past, we can do a better job predicting what will happen in the future.

Collapse

Said Abasse K, Toulouse Fournier A, Paquet C, Côté A, Smith PY, Bergeron F, Archambault P. Collaborative Writing Applications in Support of Knowledge Translation and Management during Pandemics: A Scoping Review. Int J Med Inform 2022;165:104814. [DOI: 10.1016/j.ijmedinf.2022.104814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Revised: 04/17/2022] [Accepted: 06/05/2022] [Indexed: 11/28/2022]

AlRyalat SA, Al Oweidat K, Al-Essa M, Ashouri K, El Khatib O, Al-Rawashdeh A, Yaseen A, Toumar A, Alrwashdeh A. Influenza Altmetric Attention Score and its association with the influenza season in the USA. F1000Res 2022;9:96. [PMID: 35465063 PMCID: PMC9021684 DOI: 10.12688/f1000research.22127.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/04/2022] [Indexed: 11/20/2022] Open

Query-based-learning mortality-related decoders for the developed island economy. Sci Rep 2022;12:956. [PMID: 35046447 PMCID: PMC8770507 DOI: 10.1038/s41598-022-04855-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 12/30/2021] [Indexed: 11/09/2022] Open

He Y, Zhao Y, Chen Y, Yuan H, Tsui K. Nowcasting influenza‐like illness (ILI) via a deep learning approach using google search data: An empirical study on Taiwan ILI. INT J INTELL SYST 2021. [DOI: 10.1002/int.22788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Bannister A, Botta F. Rapid indicators of deprivation using grocery shopping data. ROYAL SOCIETY OPEN SCIENCE 2021;8:211069. [PMID: 34950487 PMCID: PMC8692957 DOI: 10.1098/rsos.211069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 11/29/2021] [Indexed: 06/14/2023]

Marmara V, Marmara D, McMenemy P, Kleczkowski A. Cross-sectional telephone surveys as a tool to study epidemiological factors and monitor seasonal influenza activity in Malta. BMC Public Health 2021;21:1828. [PMID: 34627201 PMCID: PMC8502089 DOI: 10.1186/s12889-021-11862-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 09/27/2021] [Indexed: 11/29/2022] Open

Abstract

Background

Seasonal influenza has major implications for healthcare services as outbreaks often lead to high activity levels in health systems. Being able to predict when such outbreaks occur is vital. Mathematical models have extensively been used to predict epidemics of infectious diseases such as seasonal influenza and to assess effectiveness of control strategies. Availability of comprehensive and reliable datasets used to parametrize these models is limited. In this paper we combine a unique epidemiological dataset collected in Malta through General Practitioners (GPs) with a novel method using cross-sectional surveys to study seasonal influenza dynamics in Malta in 2014–2016, to include social dynamics and self-perception related to seasonal influenza.

Methods

Two cross-sectional public surveys (n = 406 per survey) were performed by telephone across the Maltese population in 2014–15 and 2015–16 influenza seasons. Survey results were compared with incidence data (diagnosed seasonal influenza cases) collected by GPs in the same period and with Google Trends data for Malta. Information was collected on whether participants recalled their health status in past months, occurrences of influenza symptoms, hospitalisation rates due to seasonal influenza, seeking GP advice, and other medical information.

Results

We demonstrate that cross-sectional surveys are a reliable alternative data source to medical records. The two surveys gave comparable results, indicating that the level of recollection among the public is high. Based on two seasons of data, the reporting rate in Malta varies between 14 and 22%. The comparison with Google Trends suggests that the online searches peak at about the same time as the maximum extent of the epidemic, but the public interest declines and returns to background level. We also found that the public intensively searched the Internet for influenza-related terms even when number of cases was low.

Conclusions

Our research shows that a telephone survey is a viable way to gain deeper insight into a population’s self-perception of influenza and its symptoms and to provide another benchmark for medical statistics provided by GPs and Google Trends. The information collected can be used to improve epidemiological modelling of seasonal influenza and other infectious diseases, thus effectively contributing to public health.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12889-021-11862-x.

Collapse

Li J, Sia CL, Chen Z, Huang W. Enhancing Influenza Epidemics Forecasting Accuracy in China with Both Official and Unofficial Online News Articles, 2019-2020. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:ijerph18126591. [PMID: 34207479 PMCID: PMC8296334 DOI: 10.3390/ijerph18126591] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 06/05/2021] [Accepted: 06/15/2021] [Indexed: 11/16/2022]

Choi H, Choi WS, Han E. Suggestion of a simpler and faster influenza-like illness surveillance system using 2014-2018 claims data in Korea. Sci Rep 2021;11:11243. [PMID: 34045533 PMCID: PMC8159991 DOI: 10.1038/s41598-021-90511-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Accepted: 05/06/2021] [Indexed: 11/10/2022] Open

Osthus D, Moran KR. Multiscale influenza forecasting. Nat Commun 2021;12:2991. [PMID: 34016992 PMCID: PMC8137955 DOI: 10.1038/s41467-021-23234-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Accepted: 04/16/2021] [Indexed: 11/09/2022] Open

Poirier C, Hswen Y, Bouzillé G, Cuggia M, Lavenu A, Brownstein JS, Brewer T, Santillana M. Influenza forecasting for French regions combining EHR, web and climatic data sources with a machine learning ensemble approach. PLoS One 2021;16:e0250890. [PMID: 34010293 PMCID: PMC8133501 DOI: 10.1371/journal.pone.0250890] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 04/16/2021] [Indexed: 11/25/2022] Open

Chrzanowski J, Sołek J, Fendler W, Jemielniak D. Assessing Public Interest Based on Wikipedia's Most Visited Medical Articles During the SARS-CoV-2 Outbreak: Search Trends Analysis. J Med Internet Res 2021;23:e26331. [PMID: 33667176 PMCID: PMC8049630 DOI: 10.2196/26331] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Revised: 01/21/2021] [Accepted: 02/18/2021] [Indexed: 12/14/2022] Open

Abstract

Background

In the current era of widespread access to the internet, we can monitor public interest in a topic via information-targeted web browsing. We sought to provide direct proof of the global population’s altered use of Wikipedia medical knowledge resulting from the new COVID-19 pandemic and related global restrictions.

Objective

We aimed to identify temporal search trends and quantify changes in access to Wikipedia Medicine Project articles that were related to the COVID-19 pandemic.

Methods

We performed a retrospective analysis of medical articles across nine language versions of Wikipedia and country-specific statistics for registered COVID-19 deaths. The observed patterns were compared to a forecast model of Wikipedia use, which was trained on data from 2015 to 2019. The model comprehensively analyzed specific articles and similarities between access count data from before (ie, several years prior) and during the COVID-19 pandemic. Wikipedia articles that were linked to those directly associated with the pandemic were evaluated in terms of degrees of separation and analyzed to identify similarities in access counts. We assessed the correlation between article access counts and the number of diagnosed COVID-19 cases and deaths to identify factors that drove interest in these articles and shifts in public interest during the subsequent phases of the pandemic.

Results

We observed a significant (P<.001) increase in the number of entries on Wikipedia medical articles during the pandemic period. The increased interest in COVID-19–related articles temporally correlated with the number of global COVID-19 deaths and consistently correlated with the number of region-specific COVID-19 deaths. Articles with low degrees of separation were significantly similar (P<.001) in terms of access patterns that were indicative of information-seeking patterns.

Conclusions

The analysis of Wikipedia medical article popularity could be a viable method for epidemiologic surveillance, as it provides important information about the reasons behind public attention and factors that sustain public interest in the long term. Moreover, Wikipedia users can potentially be directed to credible and valuable information sources that are linked with the most prominent articles.

Collapse

Feature Selection for Colon Cancer Detection Using K-Means Clustering and Modified Harmony Search Algorithm. MATHEMATICS 2021. [DOI: 10.3390/math9050570] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Gianfredi V, Santangelo OE, Provenzano S. Correlation between flu and Wikipedia's pages visualization. ACTA BIO-MEDICA : ATENEI PARMENSIS 2021;92:e2021056. [PMID: 33682825 PMCID: PMC7975939 DOI: 10.23750/abm.v92i1.9790] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 12/10/2020] [Indexed: 12/17/2022]

Poulin R, Bennett J, Filion A, Bhattarai UR, Chai X, de Angeli Dutra D, Donlon E, Doherty JF, Jorge F, Milotic M, Park E, Sabadel A, Thomas LJ. iParasitology: Mining the Internet to Test Parasitological Hypotheses. Trends Parasitol 2021;37:267-272. [PMID: 33547010 DOI: 10.1016/j.pt.2021.01.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 01/12/2021] [Accepted: 01/13/2021] [Indexed: 12/17/2022]

Seasonality of Back Pain in Italy: An Infodemiology Study. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:ijerph18031325. [PMID: 33535709 PMCID: PMC7908346 DOI: 10.3390/ijerph18031325] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 01/21/2021] [Accepted: 01/28/2021] [Indexed: 12/27/2022]

Leuba SI, Yaesoubi R, Antillon M, Cohen T, Zimmer C. Tracking and predicting U.S. influenza activity with a real-time surveillance network. PLoS Comput Biol 2020;16:e1008180. [PMID: 33137088 PMCID: PMC7707518 DOI: 10.1371/journal.pcbi.1008180] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 12/01/2020] [Accepted: 07/22/2020] [Indexed: 12/29/2022] Open

Gozzi N, Tizzani M, Starnini M, Ciulla F, Paolotti D, Panisson A, Perra N. Collective Response to Media Coverage of the COVID-19 Pandemic on Reddit and Wikipedia: Mixed-Methods Analysis. J Med Internet Res 2020;22:e21597. [PMID: 32960775 PMCID: PMC7553788 DOI: 10.2196/21597] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/31/2020] [Accepted: 09/09/2020] [Indexed: 11/29/2022] Open

Abstract

BACKGROUND

The exposure and consumption of information during epidemic outbreaks may alter people's risk perception and trigger behavioral changes, which can ultimately affect the evolution of the disease. It is thus of utmost importance to map the dissemination of information by mainstream media outlets and the public response to this information. However, our understanding of this exposure-response dynamic during the COVID-19 pandemic is still limited.

OBJECTIVE

The goal of this study is to characterize the media coverage and collective internet response to the COVID-19 pandemic in four countries: Italy, the United Kingdom, the United States, and Canada.

METHODS

We collected a heterogeneous data set including 227,768 web-based news articles and 13,448 YouTube videos published by mainstream media outlets, 107,898 user posts and 3,829,309 comments on the social media platform Reddit, and 278,456,892 views of COVID-19-related Wikipedia pages. To analyze the relationship between media coverage, epidemic progression, and users' collective web-based response, we considered a linear regression model that predicts the public response for each country given the amount of news exposure. We also applied topic modelling to the data set using nonnegative matrix factorization.

RESULTS

Our results show that public attention, quantified as user activity on Reddit and active searches on Wikipedia pages, is mainly driven by media coverage; meanwhile, this activity declines rapidly while news exposure and COVID-19 incidence remain high. Furthermore, using an unsupervised, dynamic topic modeling approach, we show that while the levels of attention dedicated to different topics by media outlets and internet users are in good accordance, interesting deviations emerge in their temporal patterns.

CONCLUSIONS

Overall, our findings offer an additional key to interpret public perception and response to the current global health emergency and raise questions about the effects of attention saturation on people's collective awareness and risk perception and thus on their tendencies toward behavioral change.

Collapse

Kramer SC, Pei S, Shaman J. Forecasting influenza in Europe using a metapopulation model incorporating cross-border commuting and air travel. PLoS Comput Biol 2020;16:e1008233. [PMID: 33052907 PMCID: PMC7588111 DOI: 10.1371/journal.pcbi.1008233] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 10/26/2020] [Accepted: 08/10/2020] [Indexed: 11/18/2022] Open

Jia Q, Guo Y, Wang G, Barnes SJ. Big Data Analytics in the Fight against Major Public Health Incidents (Including COVID-19): A Conceptual Framework. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:E6161. [PMID: 32854265 PMCID: PMC7503476 DOI: 10.3390/ijerph17176161] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Revised: 08/19/2020] [Accepted: 08/21/2020] [Indexed: 11/16/2022]

Caldwell WK, Fairchild G, Del Valle SY. Surveilling Influenza Incidence With Centers for Disease Control and Prevention Web Traffic Data: Demonstration Using a Novel Dataset. J Med Internet Res 2020;22:e14337. [PMID: 32437327 PMCID: PMC7367534 DOI: 10.2196/14337] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 01/29/2020] [Accepted: 03/22/2020] [Indexed: 11/23/2022] Open

Abstract

Background

Influenza epidemics result in a public health and economic burden worldwide. Traditional surveillance techniques, which rely on doctor visits, provide data with a delay of 1 to 2 weeks. A means of obtaining real-time data and forecasting future outbreaks is desirable to provide more timely responses to influenza epidemics.

Objective

This study aimed to present the first implementation of a novel dataset by demonstrating its ability to supplement traditional disease surveillance at multiple spatial resolutions.

Methods

We used internet traffic data from the Centers for Disease Control and Prevention (CDC) website to determine the potential usability of this data source. We tested the traffic generated by 10 influenza-related pages in 8 states and 9 census divisions within the United States and compared it against clinical surveillance data.

Results

Our results yielded an r² value of 0.955 in the most successful case, promising results for some cases, and unsuccessful results for other cases. In the interest of scientific transparency to further the understanding of when internet data streams are an appropriate supplemental data source, we also included negative results (ie, unsuccessful models). Models that focused on a single influenza season were more successful than those that attempted to model multiple influenza seasons. Geographic resolution appeared to play a key role, with national and regional models being more successful, overall, than models at the state level.

Conclusions

These results demonstrate that internet data may be able to complement traditional influenza surveillance in some cases but not in others. Specifically, our results show that the CDC website traffic may inform national- and division-level models but not models for each individual state. In addition, our results show better agreement when the data were broken up by seasons instead of aggregated over several years. We anticipate that this work will lead to more complex nowcasting and forecasting models using this data stream.

Collapse

Barros JM, Duggan J, Rebholz-Schuhmann D. The Application of Internet-Based Sources for Public Health Surveillance (Infoveillance): Systematic Review. J Med Internet Res 2020;22:e13680. [PMID: 32167477 PMCID: PMC7101503 DOI: 10.2196/13680] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2019] [Revised: 09/18/2019] [Accepted: 11/26/2019] [Indexed: 12/30/2022] Open

Abstract

Background

Public health surveillance is based on the continuous and systematic collection, analysis, and interpretation of data. This informs the development of early warning systems to monitor epidemics and documents the impact of intervention measures. The introduction of digital data sources, and specifically sources available on the internet, has impacted the field of public health surveillance. New opportunities enabled by the underlying availability and scale of internet-based sources (IBSs) have paved the way for novel approaches for disease surveillance, exploration of health communities, and the study of epidemic dynamics. This field and approach is also known as infodemiology or infoveillance.

Objective

This review aimed to assess research findings regarding the application of IBSs for public health surveillance (infodemiology or infoveillance). To achieve this, we have presented a comprehensive systematic literature review with a focus on these sources and their limitations, the diseases targeted, and commonly applied methods.

Methods

A systematic literature review was conducted targeting publications between 2012 and 2018 that leveraged IBSs for public health surveillance, outbreak forecasting, disease characterization, diagnosis prediction, content analysis, and health-topic identification. The search results were filtered according to previously defined inclusion and exclusion criteria.

Results

Spanning a total of 162 publications, we determined infectious diseases to be the preferred case study (108/162, 66.7%). Of the eight categories of IBSs (search queries, social media, news, discussion forums, websites, web encyclopedia, and online obituaries), search queries and social media were applied in 95.1% (154/162) of the reviewed publications. We also identified limitations in representativeness and biased user age groups, as well as high susceptibility to media events by search queries, social media, and web encyclopedias.

Conclusions

IBSs are a valuable proxy to study illnesses affecting the general population; however, it is important to characterize which diseases are best suited for the available sources; the literature shows that the level of engagement among online platforms can be a potential indicator. There is a necessity to understand the population’s online behavior; in addition, the exploration of health information dissemination and its content is significantly unexplored. With this information, we can understand how the population communicates about illnesses online and, in the process, benefit public health.

Collapse

Lu J, Meyer S. Forecasting Flu Activity in the United States: Benchmarking an Endemic-Epidemic Beta Model. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:E1381. [PMID: 32098038 PMCID: PMC7068443 DOI: 10.3390/ijerph17041381] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 02/07/2020] [Accepted: 02/15/2020] [Indexed: 11/25/2022]

Darwish A, Rahhal Y, Jafar A. A comparative study on predicting influenza outbreaks using different feature spaces: application of influenza-like illness data from Early Warning Alert and Response System in Syria. BMC Res Notes 2020;13:33. [PMID: 31948473 PMCID: PMC6964210 DOI: 10.1186/s13104-020-4889-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 01/03/2020] [Indexed: 11/10/2022] Open

Abstract

Objective

An accurate forecasting of outbreaks of influenza-like illness (ILI) could support public health officials to suggest public health actions earlier. We investigated the performance of three different feature spaces in different models to forecast the weekly ILI rate in Syria using EWARS data from World Health Organization (WHO). Time series feature space was first used and we applied the seven models which are Naïve, Average, Seasonal naïve, drift, dynamic harmonic regression (Dhr), seasonal and trend decomposition using loess (STL) and TBATS. The Second feature space is like some state-of-the-art, which we named \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$53-weeks-before\_52-first-order-difference$$\end{document}53-weeks-before_52-first-order-difference feature space. The third one, we proposed and named \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$n-years-before\_m-weeks-around$$\end{document}n-years-before_m-weeks-around (YnWm) feature space. Machine learning (ML) and deep learning (DL) model were applied to the second and third feature spaces (generalized linear model (GLM), support vector regression (SVR), gradient boosting (GB), random forest (RF) and long short term memory (LSTM)).

Results

It was indicated that the LSTM model of four layers with \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1-year-before\_4-weeks-around$$\end{document}1-year-before_4-weeks-around feature space gave more accurate results than other models and reached the lowest MAPE of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.52\%$$\end{document}3.52% and the lowest RMSE of 0.01662. I hope that this modelling methodology can be applied in other countries and therefore help prevent and control influenza worldwide.

Collapse

Zimmer C, Leuba SI, Cohen T, Yaesoubi R. Accurate quantification of uncertainty in epidemic parameter estimates and predictions using stochastic compartmental models. Stat Methods Med Res 2019;28:3591-3608. [PMID: 30428780 PMCID: PMC6517086 DOI: 10.1177/0962280218805780] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Choi SB, Kim J, Ahn I. Forecasting type-specific seasonal influenza after 26 weeks in the United States using influenza activities in other countries. PLoS One 2019;14:e0220423. [PMID: 31765386 PMCID: PMC6876883 DOI: 10.1371/journal.pone.0220423] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 11/04/2019] [Indexed: 12/21/2022] Open

Abstract

To identify countries that have seasonal patterns similar to the time series of influenza surveillance data in the United States and other countries, and to forecast the 2018-2019 seasonal influenza outbreak in the U.S., we collected the surveillance data of 164 countries using the FluNet database, search queries from Google Trends, and temperature from 2010 to 2018. Data for influenza-like illness (ILI) in the U.S. were collected from the Fluview database. We identified the time lag between two time-series which were weekly surveillances for ILI, total influenza (Total INF), influenza A (INF A), and influenza B (INF B) viruses between two countries using cross-correlation analysis. In order to forecast ILI, Total INF, INF A, and INF B of next season (after 26 weeks) in the U.S., we developed prediction models using linear regression, auto regressive integrated moving average, and an artificial neural network (ANN). As a result of cross-correlation analysis between the countries located in northern and southern hemisphere, the seasonal influenza patterns in Australia and Chile showed a high correlation with those of the U.S. 22 weeks and 28 weeks earlier, respectively. The R2 score of ANN models for ILI for validation set in 2015-2019 was 0.758 despite how hard it is to forecast 26 weeks ahead. Our prediction models forecast that the ILI for the U.S. in 2018-2019 may be later and less severe than those in 2017-2018, judging from the influenza activity for Australia and Chile in 2018. It allows to estimate peak timing, peak intensity, and type-specific influenza activities for next season at 40th week. The correlation between seasonal influenza patterns in the U.S., Australia, and Chile could be used to forecast the next seasonal influenza pattern, which can help to determine influenza vaccine strategy approximately six months ahead in the U.S.

Collapse

Rangarajan P, Mody SK, Marathe M. Forecasting dengue and influenza incidences using a sparse representation of Google trends, electronic health records, and time series data. PLoS Comput Biol 2019;15:e1007518. [PMID: 31751346 PMCID: PMC6894887 DOI: 10.1371/journal.pcbi.1007518] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Revised: 12/05/2019] [Accepted: 10/29/2019] [Indexed: 12/20/2022] Open

Abstract

Dengue and influenza-like illness (ILI) are two of the leading causes of viral infection in the world and it is estimated that more than half the world’s population is at risk for developing these infections. It is therefore important to develop accurate methods for forecasting dengue and ILI incidences. Since data from multiple sources (such as dengue and ILI case counts, electronic health records and frequency of multiple internet search terms from Google Trends) can improve forecasts, standard time series analysis methods are inadequate to estimate all the parameter values from the limited amount of data available if we use multiple sources. In this paper, we use a computationally efficient implementation of the known variable selection method that we call the Autoregressive Likelihood Ratio (ARLR) method. This method combines sparse representation of time series data, electronic health records data (for ILI) and Google Trends data to forecast dengue and ILI incidences. This sparse representation method uses an algorithm that maximizes an appropriate likelihood ratio at every step. Using numerical experiments, we demonstrate that our method recovers the underlying sparse model much more accurately than the lasso method. We apply our method to dengue case count data from five countries/states: Brazil, Mexico, Singapore, Taiwan, and Thailand and to ILI case count data from the United States. Numerical experiments show that our method outperforms existing time series forecasting methods in forecasting the dengue and ILI case counts. In particular, our method gives a 18 percent forecast error reduction over a leading method that also uses data from multiple sources. It also performs better than other methods in predicting the peak value of the case count and the peak time.

Dengue and influenza-like illness (ILI) are leading causes of viral infection in the world and hence it is important to develop accurate methods for forecasting their incidence. We use Autoregressive Likelihood Ratio method, which is a computationally efficient implementation of the variable selection method, in order to obtain a sparse (non-lasso) representation of time series, Google Trends and electronic health records (for ILI) data. This method is used to forecast dengue incidence in five countries/states and ILI incidence in USA. We show that this method outperforms existing time series methods in forecasting these diseases. The method is general and can also be used to forecast other diseases.

Collapse

A Comparative Study on the Prediction of Occupational Diseases in China with Hybrid Algorithm Combing Models. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2019;2019:8159506. [PMID: 31662788 PMCID: PMC6791229 DOI: 10.1155/2019/8159506] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Revised: 08/03/2019] [Accepted: 08/27/2019] [Indexed: 11/17/2022]

Hosseini S, Karami M, Farhadian M, Mohammadi Y. Seasonal Activity of Influenza in Iran: Application of Influenza-like Illness Data from Sentinel Sites of Healthcare Centers during 2010 to 2015. J Epidemiol Glob Health 2019;8:29-33. [PMID: 30859784 PMCID: PMC7325813 DOI: 10.2991/j.jegh.2018.08.100] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Accepted: 06/21/2018] [Indexed: 11/26/2022] Open

Zhong X, Raghib M. Revisiting the use of web search data for stock market movements. Sci Rep 2019;9:13511. [PMID: 31534170 PMCID: PMC6751183 DOI: 10.1038/s41598-019-50131-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Accepted: 09/03/2019] [Indexed: 11/09/2022] Open

Leveraging Google Trends, Twitter, and Wikipedia to Investigate the Impact of a Celebrity's Death From Rheumatoid Arthritis. J Clin Rheumatol 2019;24:188-192. [PMID: 29461342 DOI: 10.1097/rhu.0000000000000692] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Su K, Xu L, Li G, Ruan X, Li X, Deng P, Li X, Li Q, Chen X, Xiong Y, Lu S, Qi L, Shen C, Tang W, Rong R, Hong B, Ning Y, Long D, Xu J, Shi X, Yang Z, Zhang Q, Zhuang Z, Zhang L, Xiao J, Li Y. Forecasting influenza activity using self-adaptive AI model and multi-source data in Chongqing, China. EBioMedicine 2019;47:284-292. [PMID: 31477561 PMCID: PMC6796527 DOI: 10.1016/j.ebiom.2019.08.024] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Revised: 08/09/2019] [Accepted: 08/09/2019] [Indexed: 02/05/2023] Open

Affiliation(s)

Kun Su Department of Epidemiology, College of Preventive Medicine, Army Medical University (Third Military Medical University), Chongqing, People's Republic of China; Chongqing Municipal Center for Disease Control and Prevention, Chongqing, People's Republic of China
Liang Xu Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Guanqiao Li Comprehensive AIDS Research Center and Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, School of Medicine, Tsinghua University, Beijing, People's Republic of China
Xiaowen Ruan Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Xian Li Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Pan Deng Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Xinmi Li Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Qin Li Chongqing Municipal Center for Disease Control and Prevention, Chongqing, People's Republic of China
Xianxian Chen Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Yu Xiong Chongqing Municipal Center for Disease Control and Prevention, Chongqing, People's Republic of China
Shaofeng Lu Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Li Qi Chongqing Municipal Center for Disease Control and Prevention, Chongqing, People's Republic of China
Chaobo Shen Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Wenge Tang Chongqing Municipal Center for Disease Control and Prevention, Chongqing, People's Republic of China
Rong Rong Chongqing Municipal Center for Disease Control and Prevention, Chongqing, People's Republic of China
Boran Hong Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Yi Ning Meinian Institute of Health, Beijing, People's Republic of China
Dongyan Long Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Jiaying Xu Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Xuanling Shi Comprehensive AIDS Research Center and Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, School of Medicine, Tsinghua University, Beijing, People's Republic of China
Zhihong Yang Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Qi Zhang Comprehensive AIDS Research Center and Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, School of Medicine, Tsinghua University, Beijing, People's Republic of China
Ziqi Zhuang Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China
Linqi Zhang Comprehensive AIDS Research Center and Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, School of Medicine, Tsinghua University, Beijing, People's Republic of China.
Jing Xiao Ping An Technology (Shenzhen) Co., Ltd, Shenzhen, People's Republic of China.
Yafei Li Department of Epidemiology, College of Preventive Medicine, Army Medical University (Third Military Medical University), Chongqing, People's Republic of China.

Collapse

Penny SG, Akella S, Balmaseda MA, Browne P, Carton JA, Chevallier M, Counillon F, Domingues C, Frolov S, Heimbach P, Hogan P, Hoteit I, Iovino D, Laloyaux P, Martin MJ, Masina S, Moore AM, de Rosnay P, Schepers D, Sloyan BM, Storto A, Subramanian A, Nam S, Vitart F, Yang C, Fujii Y, Zuo H, O’Kane T, Sandery P, Moore T, Chapman CC. Observational Needs for Improving Ocean and Coupled Reanalysis, S2S Prediction, and Decadal Prediction. FRONTIERS IN MARINE SCIENCE 2019;6:391. [PMID: 31534949 PMCID: PMC6750049 DOI: 10.3389/fmars.2019.00391] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Affiliation(s)

Stephen G. Penny Department of Atmospheric and Oceanic Science, University of Maryland, College Park, MD, United States
Santha Akella National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, MD, United States
Magdalena A. Balmaseda European Centre for Medium-Range Weather Forecasts, Reading, United Kingdom
Philip Browne European Centre for Medium-Range Weather Forecasts, Reading, United Kingdom
James A. Carton Department of Atmospheric and Oceanic Science, University of Maryland, College Park, MD, United States
Matthieu Chevallier Météo-France, Toulouse, France
Francois Counillon Nansen Environmental and Remote Sensing Center, Bergen, Norway
Catia Domingues Antarctic Climate and Ecosystems Cooperative Research Centre, Hobart, TAS, Australia
Sergey Frolov Naval Research Laboratory, Monterey, CA, United States
Patrick Heimbach The University of Texas at Austin, Austin, TX, United States
Patrick Hogan Naval Research Laboratory, Stennis Space Center, MS, United States
Ibrahim Hoteit King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Doroteaciro Iovino Euro-Mediterranean Center on Climate Change, Lecce, Italy
Patrick Laloyaux European Centre for Medium-Range Weather Forecasts, Reading, United Kingdom
Matthew J. Martin Met Office, Exeter, United Kingdom
Simona Masina Euro-Mediterranean Center on Climate Change, Lecce, Italy
Andrew M. Moore University of California, Santa Cruz, Santa Cruz, CA, United States
Patricia de Rosnay European Centre for Medium-Range Weather Forecasts, Reading, United Kingdom
Dinand Schepers European Centre for Medium-Range Weather Forecasts, Reading, United Kingdom
Bernadette M. Sloyan Commonwealth Scientific and Industrial Research Organisation, Canberra, ACT, Australia
Andrea Storto NATO Centre for Maritime Research and Experimentation, La Spezia, Italy
Aneesh Subramanian Department of Atmospheric and Oceanic Science, University of Colorado, Boulder, Boulder, CO, United States
SungHyun Nam Seoul National University, Seoul, South Korea
Frederic Vitart European Centre for Medium-Range Weather Forecasts, Reading, United Kingdom
Chunxue Yang Istituto di Scienze Marine, Consiglio Nazionale delle Ricerche, Rome, Italy
Yosuke Fujii JMA Meteorological Research Institute, Tsukuba, Japan
Hao Zuo European Centre for Medium-Range Weather Forecasts, Reading, United Kingdom
Terry O’Kane Commonwealth Scientific and Industrial Research Organisation, Canberra, ACT, Australia
Paul Sandery Commonwealth Scientific and Industrial Research Organisation, Canberra, ACT, Australia
Thomas Moore Commonwealth Scientific and Industrial Research Organisation, Canberra, ACT, Australia
Christopher C. Chapman Commonwealth Scientific and Industrial Research Organisation, Canberra, ACT, Australia

Collapse

Clemente L, Lu F, Santillana M. Improved Real-Time Influenza Surveillance: Using Internet Search Data in Eight Latin American Countries. JMIR Public Health Surveill 2019;5:e12214. [PMID: 30946017 PMCID: PMC6470460 DOI: 10.2196/12214] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 02/11/2019] [Accepted: 02/15/2019] [Indexed: 01/18/2023] Open

Abstract

Background

Novel influenza surveillance systems that leverage Internet-based real-time data sources including Internet search frequencies, social-network information, and crowd-sourced flu surveillance tools have shown improved accuracy over the past few years in data-rich countries like the United States. These systems not only track flu activity accurately, but they also report flu estimates a week or more ahead of the publication of reports produced by healthcare-based systems, such as those implemented and managed by the Centers for Disease Control and Prevention. Previous work has shown that the predictive capabilities of novel flu surveillance systems, like Google Flu Trends (GFT), in developing countries in Latin America have not yet delivered acceptable flu estimates.

Objective

The aim of this study was to show that recent methodological improvements on the use of Internet search engine information to track diseases can lead to improved retrospective flu estimates in multiple countries in Latin America.

Methods

A machine learning-based methodology that uses flu-related Internet search activity and historical information to monitor flu activity, named ARGO (AutoRegression with Google search), was extended to generate flu predictions for 8 Latin American countries (Argentina, Bolivia, Brazil, Chile, Mexico, Paraguay, Peru, and Uruguay) for the time period: January 2012 to December of 2016. These retrospective (out-of-sample) Influenza activity predictions were compared with historically observed flu suspected cases in each country, as reported by Flunet, an influenza surveillance database maintained by the World Health Organization. For a baseline comparison, retrospective (out-of-sample) flu estimates were produced for the same time period using autoregressive models that only leverage historical flu activity information.

Results

Our results show that ARGO-like models’ predictive power outperform autoregressive models in 6 out of 8 countries in the 2012-2016 time period. Moreover, ARGO significantly improves on historical flu estimates produced by the now discontinued GFT for the time period of 2012-2015, where GFT information is publicly available.

Conclusions

We demonstrate here that a self-correcting machine learning method, leveraging Internet-based disease-related search activity and historical flu trends, has the potential to produce reliable and timely flu estimates in multiple Latin American countries. This methodology may prove helpful to local public health officials who design and implement interventions aimed at mitigating the effects of influenza outbreaks. Our methodology generally outperforms both the now-discontinued tool GFT, and autoregressive methodologies that exploit only historical flu activity to produce future disease estimates.

Collapse

Ning S, Yang S, Kou SC. Accurate regional influenza epidemics tracking using Internet search data. Sci Rep 2019;9:5238. [PMID: 30918276 PMCID: PMC6437143 DOI: 10.1038/s41598-019-41559-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 03/12/2019] [Indexed: 12/12/2022] Open

A season for all things: Phenological imprints in Wikipedia usage and their relevance to conservation. PLoS Biol 2019;17:e3000146. [PMID: 30835729 PMCID: PMC6400330 DOI: 10.1371/journal.pbio.3000146] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 01/29/2019] [Indexed: 11/19/2022] Open

Abstract

Phenology plays an important role in many human–nature interactions, but these seasonal patterns are often overlooked in conservation. Here, we provide the first broad exploration of seasonal patterns of interest in nature across many species and cultures. Using data from Wikipedia, a large online encyclopedia, we analyzed 2.33 billion pageviews to articles for 31,751 species across 245 languages. We show that seasonality plays an important role in how and when people interact with plants and animals online. In total, over 25% of species in our data set exhibited a seasonal pattern in at least one of their language-edition pages, and seasonality is significantly more prevalent in pages for plants and animals than it is in a random selection of Wikipedia articles. Pageview seasonality varies across taxonomic clades in ways that reflect observable patterns in phenology, with groups such as insects and flowering plants having higher seasonality than mammals. Differences between Wikipedia language editions are significant; pages in languages spoken at higher latitudes exhibit greater seasonality overall, and species seldom show the same pattern across multiple language editions. These results have relevance to conservation policy formulation and to improving our understanding of what drives human interest in biodiversity.

Analysis of more than two billion page views over nearly three years for Wikipedia articles for 31,751 species across 245 languages reveals that more than a quarter of species show a seasonal pattern, and several online variations mirror real-world phenology.

Digital information archives offer novel opportunities to study human attitudes towards nature and to better understand how people interact with other species of animals and plants. The insights gained from such studies may be able to inform conservation efforts. Our study uses time-series of views to pages in the online encyclopedia Wikipedia to look at how human interest in other species varies seasonally across a wide range of different languages. In total, we extracted pageviews for 31,751 species of plants and animals across 245 Wikipedia language editions. Spanning nearly three years, our data set comprises 2.33 billion pageviews across 126,697 pages. We tested each time-series in our data set to see how well it fit a seasonal pattern and in doing so found several interesting patterns. First, seasonality is a significant factor in when people view information for many plants and animals online; over 20% of all of our species pages met our criteria for seasonality. Second, the prevalence of seasonality varies across different biological classes and also across languages. These variations appear to reflect differences in the life history of species and in the geographic distribution of languages and can correspond to phenological patterns in nature. Our results are relevant to conservationists seeking to understand how interest in various plants and animals may fluctuate over time.

Collapse

Real-Time Forecasting of Hand-Foot-and-Mouth Disease Outbreaks using the Integrating Compartment Model and Assimilation Filtering. Sci Rep 2019;9:2661. [PMID: 30804467 PMCID: PMC6389963 DOI: 10.1038/s41598-019-38930-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 01/15/2019] [Indexed: 11/09/2022] Open

Ferland R, Froda S. A statistical tool for comparing seasonal ILI surveillance data. Sci Rep 2019;9:1422. [PMID: 30723245 PMCID: PMC6363783 DOI: 10.1038/s41598-018-38292-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Accepted: 12/21/2018] [Indexed: 12/02/2022] Open

Kramer SC, Shaman J. Development and validation of influenza forecasting for 64 temperate and tropical countries. PLoS Comput Biol 2019;15:e1006742. [PMID: 30811396 PMCID: PMC6411231 DOI: 10.1371/journal.pcbi.1006742] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 03/11/2019] [Accepted: 12/21/2018] [Indexed: 11/19/2022] Open

Osthus D, Daughton AR, Priedhorsky R. Even a good influenza forecasting model can benefit from internet-based nowcasts, but those benefits are limited. PLoS Comput Biol 2019;15:e1006599. [PMID: 30707689 PMCID: PMC6373968 DOI: 10.1371/journal.pcbi.1006599] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Revised: 02/13/2019] [Accepted: 10/30/2018] [Indexed: 11/19/2022] Open

Khatua A, Khatua A, Cambria E. A tale of two epidemics: Contextual Word2Vec for classifying twitter streams during outbreaks. Inf Process Manag 2019. [DOI: 10.1016/j.ipm.2018.10.010] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Poirier C, Lavenu A, Bertaud V, Campillo-Gimenez B, Chazard E, Cuggia M, Bouzillé G. Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study. JMIR Public Health Surveill 2018;4:e11361. [PMID: 30578212 PMCID: PMC6320394 DOI: 10.2196/11361] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Revised: 09/10/2018] [Accepted: 09/10/2018] [Indexed: 11/25/2022] Open

Fairchild G, Tasseff B, Khalsa H, Generous N, Daughton AR, Velappan N, Priedhorsky R, Deshpande A. Epidemiological Data Challenges: Planning for a More Robust Future Through Data Standards. Front Public Health 2018;6:336. [PMID: 30533407 PMCID: PMC6265573 DOI: 10.3389/fpubh.2018.00336] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Accepted: 11/01/2018] [Indexed: 12/23/2022] Open

Apollonio DE, Broyde K, Azzam A, De Guia M, Heilman J, Brock T. Pharmacy students can improve access to quality medicines information by editing Wikipedia articles. BMC MEDICAL EDUCATION 2018;18:265. [PMID: 30454046 PMCID: PMC6245851 DOI: 10.1186/s12909-018-1375-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Accepted: 11/01/2018] [Indexed: 06/09/2023]

Chakraborty P, Lewis B, Eubank S, Brownstein JS, Marathe M, Ramakrishnan N. What to know before forecasting the flu. PLoS Comput Biol 2018;14:e1005964. [PMID: 30312305 PMCID: PMC6193572 DOI: 10.1371/journal.pcbi.1005964] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open