1
|
Hajlasz M, Pei S. Predictability of human mobility during the COVID-19 pandemic in the United States. PNAS NEXUS 2024; 3:pgae308. [PMID: 39114577 PMCID: PMC11305134 DOI: 10.1093/pnasnexus/pgae308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 07/10/2024] [Indexed: 08/10/2024]
Abstract
Human mobility is fundamental to a range of applications including epidemic control, urban planning, and traffic engineering. While laws governing individual movement trajectories and population flows across locations have been extensively studied, the predictability of population-level mobility during the COVID-19 pandemic driven by specific activities such as work, shopping, and recreation remains elusive. Here we analyze mobility data for six place categories at the US county level from 2020 February 15 to 2021 November 23 and measure how the predictability of these mobility metrics changed during the COVID-19 pandemic. We quantify the time-varying predictability in each place category using an information-theoretic metric, permutation entropy. We find disparate predictability patterns across place categories over the course of the pandemic, suggesting differential behavioral changes in human activities perturbed by disease outbreaks. Notably, predictability change in foot traffic to residential locations is mostly in the opposite direction to other mobility categories. Specifically, visits to residences had the highest predictability during stay-at-home orders in March 2020, while visits to other location types had low predictability during this period. This pattern flipped after the lifting of restrictions during summer 2020. We identify four key factors, including weather conditions, population size, COVID-19 case growth, and government policies, and estimate their nonlinear effects on mobility predictability. Our findings provide insights on how people change their behaviors during public health emergencies and may inform improved interventions in future epidemics.
Collapse
Affiliation(s)
- Michal Hajlasz
- Department of Computer Science, Columbia University, 500 W 120th St, New York, NY 10027, USA
| | - Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, 722 W 168th St, New York, NY 10032, USA
| |
Collapse
|
2
|
Böttcher L, Chou T, D’Orsogna MR. Forecasting drug-overdose mortality by age in the United States at the national and county levels. PNAS NEXUS 2024; 3:pgae050. [PMID: 38725534 PMCID: PMC11079616 DOI: 10.1093/pnasnexus/pgae050] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 01/25/2024] [Indexed: 05/12/2024]
Abstract
The drug-overdose crisis in the United States continues to intensify. Fatalities have increased 5-fold since 1999 reaching a record high of 108,000 deaths in 2021. The epidemic has unfolded through distinct waves of different drug types, uniquely impacting various age, gender, race, and ethnic groups in specific geographical areas. One major challenge in designing interventions and efficiently delivering treatment is forecasting age-specific overdose patterns at the local level. To address this need, we develop a forecasting method that assimilates observational data obtained from the CDC WONDER database with an age-structured model of addiction and overdose mortality. We apply our method nationwide and to three select areas: Los Angeles County, Cook County, and the five boroughs of New York City, providing forecasts of drug-overdose mortality and estimates of relevant epidemiological quantities, such as mortality and age-specific addiction rates.
Collapse
Affiliation(s)
- Lucas Böttcher
- Department of Computational Science and Philosophy, Frankfurt School of Finance and Management, 60322 Frankfurt am Main, Germany
| | - Tom Chou
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA 90095-1766, USA
| | - Maria R D’Orsogna
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA 90095-1766, USA
- Department of Mathematics, California State University at Northridge, Los Angeles, CA 91330-8313, USA
| |
Collapse
|
3
|
Ospina-Aguirre C, Soriano-Paños D, Olivar-Tost G, Galindo-González CC, Gómez-Gardeñes J, Osorio G. Effects of human mobility on the spread of Dengue in the region of Caldas, Colombia. PLoS Negl Trop Dis 2023; 17:e0011087. [PMID: 38011274 PMCID: PMC10703399 DOI: 10.1371/journal.pntd.0011087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 12/07/2023] [Accepted: 09/21/2023] [Indexed: 11/29/2023] Open
Abstract
According to the World Health Organization (WHO), dengue is the most common acute arthropod-borne viral infection in the world. The spread of dengue and other infectious diseases is closely related to human activity and mobility. In this paper we analyze the effect of introducing mobility restrictions as a public health policy on the total number of dengue cases within a population. To perform the analysis, we use a complex metapopulation in which we implement a compartmental propagation model coupled with the mobility of individuals between the patches. This model is used to investigate the spread of dengue in the municipalities of Caldas (CO). Two scenarios corresponding to different types of mobility restrictions are applied. In the first scenario, the effect of restricting mobility is analyzed in three different ways: a) limiting the access to the endemic node but allowing the movement of its inhabitants, b) restricting the diaspora of the inhabitants of the endemic node but allowing the access of outsiders, and c) a total isolation of the inhabitants of the endemic node. In this scenario, the best simulation results are obtained when specific endemic nodes are isolated during a dengue outbreak, obtaining a reduction of up to 2.5% of dengue cases. Finally, the second scenario simulates a total isolation of the network, i.e., mobility between nodes is completely limited. We have found that this control measure increases the number of total dengue cases in the network by 2.36%.
Collapse
Affiliation(s)
- Carolina Ospina-Aguirre
- ABCDynamics, Facultad de ciencias exactas y naturales, Universidad Nacional de Colombia - Sede Manizales, Manizales, Colombia
- Departamento de electrónica y automatización, Universidad Autonoma de Manizales, Manizales, Colombia
| | - David Soriano-Paños
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
- GOTHAM lab, Institute for Biocomputation & Physics of Complex Systems (BIFI), Zaragoza, España
| | - Gerard Olivar-Tost
- Departamento de Ciencias Naturales y Tecnología, Universidad de Aysén, Coyhaique, Chile
| | - Cristian C. Galindo-González
- Percepción y Control Inteligente (PCI), Departamento de Ingeniería Eléctrica, Electrónica y Computación, Universidad Nacional de Colombia - Sede Manizales, Manizales, Colombia
| | - Jesús Gómez-Gardeñes
- Departamento de Física de la Materia Condensada Facultad de Ciencias, Universidad de Zaragoza, Zaragoza, España
- GOTHAM lab, Institute for Biocomputation & Physics of Complex Systems (BIFI), Zaragoza, España
| | - Gustavo Osorio
- Percepción y Control Inteligente (PCI), Departamento de Ingeniería Eléctrica, Electrónica y Computación, Universidad Nacional de Colombia - Sede Manizales, Manizales, Colombia
| |
Collapse
|
4
|
Cascante-Vega J, Galanti M, Schley K, Pei S, Shaman J. Inference of transmission dynamics and retrospective forecast of invasive meningococcal disease. PLoS Comput Biol 2023; 19:e1011564. [PMID: 37889910 PMCID: PMC10655980 DOI: 10.1371/journal.pcbi.1011564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 11/17/2023] [Accepted: 10/02/2023] [Indexed: 10/29/2023] Open
Abstract
The pathogenic bacteria Neisseria meningitidis, which causes invasive meningococcal disease (IMD), predominantly colonizes humans asymptomatically; however, invasive disease occurs in a small proportion of the population. Here, we explore the seasonality of IMD and develop and validate a suite of models for simulating and forecasting disease outcomes in the United States. We combine the models into multi-model ensembles (MME) based on the past performance of the individual models, as well as a naive equally weighted aggregation, and compare the retrospective forecast performance over a six-month forecast horizon. Deployment of the complete vaccination regimen, introduced in 2011, coincided with a change in the periodicity of IMD, suggesting altered transmission dynamics. We found that a model forced with the period obtained by local power wavelet decomposition best fit and forecast observations. In addition, the MME performed the best across the entire study period. Finally, our study included US-level data until 2022, allowing study of a possible IMD rebound after relaxation of non-pharmaceutical interventions imposed in response to the COVID-19 pandemic; however, no evidence of a rebound was found. Our findings demonstrate the ability of process-based models to retrospectively forecast IMD and provide a first analysis of the seasonality of IMD before and after the complete vaccination regimen.
Collapse
Affiliation(s)
- Jaime Cascante-Vega
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Marta Galanti
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | | | - Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
- Columbia Climate School, Columbia University, New York, New York, United States of America
| |
Collapse
|
5
|
Zeng Q, Yu X, Ni H, Xiao L, Xu T, Wu H, Chen Y, Deng H, Zhang Y, Pei S, Xiao J, Guo P. Dengue transmission dynamics prediction by combining metapopulation networks and Kalman filter algorithm. PLoS Negl Trop Dis 2023; 17:e0011418. [PMID: 37285385 DOI: 10.1371/journal.pntd.0011418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 05/24/2023] [Indexed: 06/09/2023] Open
Abstract
Predicting the specific magnitude and the temporal peak of the epidemic of individual local outbreaks is critical for infectious disease control. Previous studies have indicated that significant differences in spatial transmission and epidemic magnitude of dengue were influenced by multiple factors, such as mosquito population density, climatic conditions, and population movement patterns. However, there is a lack of studies that combine the above factors to explain their complex nonlinear relationships in dengue transmission and generate accurate predictions. Therefore, to study the complex spatial diffusion of dengue, this research combined the above factors and developed a network model for spatiotemporal transmission prediction of dengue fever using metapopulation networks based on human mobility. For improving the prediction accuracy of the epidemic model, the ensemble adjusted Kalman filter (EAKF), a data assimilation algorithm, was used to iteratively assimilate the observed case data and adjust the model and parameters. Our study demonstrated that the metapopulation network-EAKF system provided accurate predictions for city-level dengue transmission trajectories in retrospective forecasts of 12 cities in Guangdong province, China. Specifically, the system accurately predicts local dengue outbreak magnitude and the temporal peak of the epidemic up to 10 wk in advance. In addition, the system predicted the peak time, peak intensity, and total number of dengue cases more accurately than isolated city-specific forecasts. The general metapopulation assimilation framework presented in our study provides a methodological foundation for establishing an accurate system with finer temporal and spatial resolution for retrospectively forecasting the magnitude and temporal peak of dengue fever outbreaks. These forecasts based on the proposed method can be interoperated to better support intervention decisions and inform the public of potential risks of disease transmission.
Collapse
Affiliation(s)
- Qinghui Zeng
- Department of Preventive Medicine, Shantou University Medical College, Shantou, China
| | - Xiaolin Yu
- Department of Preventive Medicine, Shantou University Medical College, Shantou, China
| | - Haobo Ni
- Department of Preventive Medicine, Shantou University Medical College, Shantou, China
| | - Lina Xiao
- Department of Preventive Medicine, Shantou University Medical College, Shantou, China
| | - Ting Xu
- Department of Preventive Medicine, Shantou University Medical College, Shantou, China
| | - Haisheng Wu
- Department of Preventive Medicine, Shantou University Medical College, Shantou, China
| | - Yuliang Chen
- Department of Medical Quality Management, Nanfang Hospital, Guangzhou, China
| | - Hui Deng
- Institute of Vector Control, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, China
| | - Yingtao Zhang
- Institute of Infectious Disease Control and Prevention, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, China
| | - Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Jianpeng Xiao
- Guangdong Provincial Institute of Public Health, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, China
| | - Pi Guo
- Department of Preventive Medicine, Shantou University Medical College, Shantou, China
- Guangdong Provincial Key Laboratory of Infectious Diseases and Molecular Immunopathology, Shantou, China
| |
Collapse
|
6
|
Böttcher L, Chou T, D’Orsogna MR. Modeling and forecasting age-specific drug overdose mortality in the United States. THE EUROPEAN PHYSICAL JOURNAL. SPECIAL TOPICS 2023; 232:1-10. [PMID: 37359186 PMCID: PMC10132445 DOI: 10.1140/epjs/s11734-023-00801-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 02/24/2023] [Indexed: 06/28/2023]
Abstract
Drug overdose deaths continue to increase in the United States for all major drug categories. Over the past two decades the total number of overdose fatalities has increased more than fivefold; since 2013 the surge in overdose rates is primarily driven by fentanyl and methamphetamines. Different drug categories and factors such as age, gender, and ethnicity are associated with different overdose mortality characteristics that may also change in time. For example, the average age at death from a drug overdose has decreased from 1940 to 1990 while the overall mortality rate has steadily increased. To provide insight into the population-level dynamics of drug overdose mortality, we develop an age-structured model for drug addiction. Using an augmented ensemble Kalman filter (EnKF), we show through a simple example how our model can be combined with synthetic observation data to estimate mortality rate and an age-distribution parameter. Finally, we use an EnKF to combine our model with observation data on overdose fatalities in the United States from 1999 to 2020 to forecast the evolution of overdose trends and estimate model parameters.
Collapse
Affiliation(s)
- Lucas Böttcher
- Department of Computational Science and Philosophy, Frankfurt School of Finance and Management, 60322 Frankfurt am Main, Germany
| | - Tom Chou
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, 90095 CA USA
- Department of Mathematics, University of California, Los Angeles, Los Angeles, 90095 CA USA
| | - Maria R. D’Orsogna
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, 90095 CA USA
- Department of Mathematics, California State University at Northridge, Los Angeles, 91330 CA USA
| |
Collapse
|
7
|
Araújo JLB, Oliveira EA, Lima Neto AS, Andrade JS, Furtado V. Unveiling the paths of COVID-19 in a large city based on public transportation data. Sci Rep 2023; 13:5761. [PMID: 37031258 PMCID: PMC10082688 DOI: 10.1038/s41598-023-32786-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 04/02/2023] [Indexed: 04/10/2023] Open
Abstract
Human mobility plays a key role in the dissemination of infectious diseases around the world. However, the complexity introduced by commuting patterns in the daily life of cities makes such a role unclear, especially at the intracity scale. Here, we propose a multiplex network fed with 9 months of mobility data with more than 107 million public bus validations in order to understand the relation between urban mobility and the spreading of COVID-19 within a large city, namely, Fortaleza in the northeast of Brazil. Our results suggest that the shortest bus rides in Fortaleza, measured in the number of daily rides among all neighborhoods, decreased [Formula: see text]% more than the longest ones after an epidemic wave. Such a result is the opposite of what has been observed at the intercity scale. We also find that mobility changes among the neighborhoods are synchronous and geographically homogeneous. Furthermore, we find that the most central neighborhoods in mobility are the first targets for infectious disease outbreaks, which is quantified here in terms of the positive linear relation between the disease arrival time and the average of the closeness centrality ranking. These central neighborhoods are also the top neighborhoods in the number of reported cases at the end of an epidemic wave as indicated by the exponential decay behavior of the disease arrival time in relation to the number of accumulated reported cases with decay constant [Formula: see text] days. We believe that these results can help in the development of new strategies to impose restriction measures in the cities guiding decision-makers with smart actions in public health policies, as well as supporting future research on urban mobility and epidemiology.
Collapse
Affiliation(s)
- Jorge L B Araújo
- Laboratório de Ciência de Dados e Inteligência Artificial Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brazil.
| | - Erneson A Oliveira
- Laboratório de Ciência de Dados e Inteligência Artificial Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brazil
- Programa de Pós Graduação em Informática Aplicada Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brazil
- Mestrado Profissional em Ciências da Cidade Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brazil
| | - Antonio S Lima Neto
- Célula de Vigilância Epidemiológica Secretaria Municipal da Saúde, Fortaleza, Ceará, 60810-670, Brazil
- Centro de Ciências da Saúde Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brazil
| | - José S Andrade
- Departamento de Física, Universidade Federal do Ceará, Fortaleza, Ceará, 60455-760, Brazil
| | - Vasco Furtado
- Laboratório de Ciência de Dados e Inteligência Artificial Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brazil
- Programa de Pós Graduação em Informática Aplicada Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brazil
- Empresa de Tecnologia da Informação do Ceará Governo do Estado do Ceará, Fortaleza, Ceará, 60130-240, Brazil
| |
Collapse
|
8
|
Assad DBN, Cara J, Ortega-Mier M. Comparing Short-Term Univariate and Multivariate Time-Series Forecasting Models in Infectious Disease Outbreak. Bull Math Biol 2023; 85:9. [PMID: 36565344 PMCID: PMC9789525 DOI: 10.1007/s11538-022-01112-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 11/29/2022] [Indexed: 12/25/2022]
Abstract
Predicting infectious disease outbreak impacts on population, healthcare resources and economics and has received a special academic focus during coronavirus (COVID-19) pandemic. Focus on human disease outbreak prediction techniques in current literature, Marques et al. (Predictive models for decision support in the COVID-19 crisis. Springer, Switzerland, 2021) state that there are four main methods to address forecasting problem: compartmental models, classic statistical models, space-state models and machine learning models. We adopt their framework to compare our research with previous works. Besides being divided by methods, forecasting problems can also be divided by the number of variables that are considered to make predictions. Considering this number of variables, forecasting problems can be classified as univariate, causal and multivariate models. Multivariate approaches have been applied in less than 10% of research found. This research is the first attempt to evaluate, over real time-series data of 3 different countries with univariate and multivariate methods to provide a short-term prediction. In literature we found no research with that scope and aim. A comparison of univariate and multivariate methods has been conducted and we concluded that besides the strong potential of multivariate methods, in our research univariate models presented best results in almost all regions' predictions.
Collapse
Affiliation(s)
- Daniel Bouzon Nagem Assad
- Universidad Politécnica de Madrid, Department of Organization Engineering, Business Administration and Statistics, Escuela Técnica Superior de Ingenieros Industriales, José Gutiérrez Abascal, 2, 28006 Madrid, Spain ,Universidade do Estado do Rio de Janeiro, Rua São Francisco Xavier, 524, Maracanã, 20550-900 Rio de Janeiro, Brazil
| | - Javier Cara
- Universidad Politécnica de Madrid, Department of Organization Engineering, Business Administration and Statistics, Escuela Técnica Superior de Ingenieros Industriales, José Gutiérrez Abascal, 2, 28006 Madrid, Spain
| | - Miguel Ortega-Mier
- Universidad Politécnica de Madrid, Department of Organization Engineering, Business Administration and Statistics, Escuela Técnica Superior de Ingenieros Industriales, José Gutiérrez Abascal, 2, 28006 Madrid, Spain
| |
Collapse
|
9
|
Keshavamurthy R, Dixon S, Pazdernik KT, Charles LE. Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches. One Health 2022; 15:100439. [PMID: 36277100 PMCID: PMC9582566 DOI: 10.1016/j.onehlt.2022.100439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 09/20/2022] [Accepted: 09/30/2022] [Indexed: 11/21/2022] Open
Abstract
The complex, unpredictable nature of pathogen occurrence has required substantial efforts to accurately predict infectious diseases (IDs). With rising popularity of Machine Learning (ML) and Deep Learning (DL) techniques combined with their unique ability to uncover connections between large amounts of diverse data, we conducted a PRISMA systematic review to investigate advances in ID prediction for human and animal diseases using ML and DL. This review included the type of IDs modeled, ML and DL techniques utilized, geographical distribution, prediction tasks performed, input features utilized, spatial and temporal scales, error metrics used, computational efficiency, uncertainty quantification, and missing data handling methods. Among 237 relevant articles published between January 2001 and May 2021, highly contagious diseases in humans were most often represented, including COVID-19 (37.1%), influenza/influenza-like illnesses (9.3%), dengue (8.9%), and malaria (5.1%). Out of 37 diseases identified, 51.4% were zoonotic, 37.8% were human-only, and 8.1% were animal-only, with only 1.6% economically significant, non-zoonotic livestock diseases. Despite the number of zoonoses, 86.5% of articles modeled humans whereas only a few articles (5.1%) contained more than one host species. Eastern Asia (32.5%), North America (17.7%), and Southern Asia (13.1%) were the most represented locations. Frequent approaches included tree-based ML (38.4%) and feed-forward neural networks (26.6%). Articles predicted temporal incidence (66.7%), disease risk (38.0%), and/or spatial movement (31.2%). Less than 10% of studies addressed uncertainty quantification, computational efficiency, and missing data, which are essential to operational use and deployment. This study highlights trends and gaps in ML and DL for ID prediction, providing guidelines for future works to better support biopreparedness and response. To fully utilize ML and DL for improved ID forecasting, models should include the full disease ecology in a One-Health context, important food and agricultural diseases, underrepresented hotspots, and important metrics required for operational deployment.
Collapse
Affiliation(s)
- Ravikiran Keshavamurthy
- Pacific Northwest National Laboratory, Richland, WA 99354, USA
- Paul G. Allen School for Global Health, Washington State University, Pullman, WA 99164, USA
| | - Samuel Dixon
- Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Karl T. Pazdernik
- Pacific Northwest National Laboratory, Richland, WA 99354, USA
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
| | - Lauren E. Charles
- Pacific Northwest National Laboratory, Richland, WA 99354, USA
- Paul G. Allen School for Global Health, Washington State University, Pullman, WA 99164, USA
| |
Collapse
|
10
|
Yin S, Ren C, Shi Y, Hua J, Yuan HY, Tian LW. A Systematic Review on Modeling Methods and Influential Factors for Mapping Dengue-Related Risk in Urban Settings. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph192215265. [PMID: 36429980 PMCID: PMC9690886 DOI: 10.3390/ijerph192215265] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 11/11/2022] [Accepted: 11/17/2022] [Indexed: 05/12/2023]
Abstract
Dengue fever is an acute mosquito-borne disease that mostly spreads within urban or semi-urban areas in warm climate zones. The dengue-related risk map is one of the most practical tools for executing effective control policies, breaking the transmission chain, and preventing disease outbreaks. Mapping risk at a small scale, such as at an urban level, can demonstrate the spatial heterogeneities in complicated built environments. This review aims to summarize state-of-the-art modeling methods and influential factors in mapping dengue fever risk in urban settings. Data were manually extracted from five major academic search databases following a set of querying and selection criteria, and a total of 28 studies were analyzed. Twenty of the selected papers investigated the spatial pattern of dengue risk by epidemic data, whereas the remaining eight papers developed an entomological risk map as a proxy for potential dengue burden in cities or agglomerated urban regions. The key findings included: (1) Big data sources and emerging data-mining techniques are innovatively employed for detecting hot spots of dengue-related burden in the urban context; (2) Bayesian approaches and machine learning algorithms have become more popular as spatial modeling tools for predicting the distribution of dengue incidence and mosquito presence; (3) Climatic and built environmental variables are the most common factors in making predictions, though the effects of these factors vary with the mosquito species; (4) Socio-economic data may be a better representation of the huge heterogeneity of risk or vulnerability spatial distribution on an urban scale. In conclusion, for spatially assessing dengue-related risk in an urban context, data availability and the purpose for mapping determine the analytical approaches and modeling methods used. To enhance the reliabilities of predictive models, sufficient data about dengue serotyping, socio-economic status, and spatial connectivity may be more important for mapping dengue-related risk in urban settings for future studies.
Collapse
Affiliation(s)
- Shi Yin
- Faculty of Architecture, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
- School of Architecture, South China University of Technology, Guangzhou 510641, China
| | - Chao Ren
- Faculty of Architecture, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
- Correspondence:
| | - Yuan Shi
- Department of Geography and Planning, University of Liverpool, Liverpool L69 3BX, UK
| | - Junyi Hua
- School of International Affairs and Public Administration, Ocean University of China, Qingdao 266100, China
| | - Hsiang-Yu Yuan
- Department of Biomedical Sciences, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| | - Lin-Wei Tian
- School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
| |
Collapse
|
11
|
Li Z. Forecasting Weekly Dengue Cases by Integrating Google Earth Engine-Based Risk Predictor Generation and Google Colab-Based Deep Learning Modeling in Fortaleza and the Federal District, Brazil. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:13555. [PMID: 36294134 PMCID: PMC9603269 DOI: 10.3390/ijerph192013555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 10/15/2022] [Accepted: 10/18/2022] [Indexed: 06/16/2023]
Abstract
Efficient and accurate dengue risk prediction is an important basis for dengue prevention and control, which faces challenges, such as downloading and processing multi-source data to generate risk predictors and consuming significant time and computational resources to train and validate models locally. In this context, this study proposed a framework for dengue risk prediction by integrating big geospatial data cloud computing based on Google Earth Engine (GEE) platform and artificial intelligence modeling on the Google Colab platform. It enables defining the epidemiological calendar, delineating the predominant area of dengue transmission in cities, generating the data of risk predictors, and defining multi-date ahead prediction scenarios. We implemented the experiments based on weekly dengue cases during 2013-2020 in the Federal District and Fortaleza, Brazil to evaluate the performance of the proposed framework. Four predictors were considered, including total rainfall (Rsum), mean temperature (Tmean), mean relative humidity (RHmean), and mean normalized difference vegetation index (NDVImean). Three models (i.e., random forest (RF), long-short term memory (LSTM), and LSTM with attention mechanism (LSTM-ATT)), and two modeling scenarios (i.e., modeling with or without dengue cases) were set to implement 1- to 4-week ahead predictions. A total of 24 models were built, and the results showed in general that LSTM and LSTM-ATT models outperformed RF models; modeling could benefit from using historical dengue cases as one of the predictors, and it makes the predicted curve fluctuation more stable compared with that only using climate and environmental factors; attention mechanism could further improve the performance of LSTM models. This study provides implications for future dengue risk prediction in terms of the effectiveness of GEE-based big geospatial data processing for risk predictor generation and Google Colab-based risk modeling and presents the benefits of using historical dengue data as one of the input features and the attention mechanism for LSTM modeling.
Collapse
Affiliation(s)
- Zhichao Li
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
12
|
Time-Scale Analysis and Parameter Fitting for Vector-Borne Diseases with Spatial Dynamics. Bull Math Biol 2022; 84:124. [PMID: 36121515 DOI: 10.1007/s11538-022-01083-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 09/07/2022] [Indexed: 11/02/2022]
Abstract
Vector-borne diseases are progressively spreading in a growing number of countries, and it has the potential to invade new areas and habitats. From the dynamical perspective, the spatial-temporal interaction of models that try to adjust to such events is rich and challenging. The first challenge is to address the dynamics of vectors (very fast and local) and the dynamics of humans (very heterogeneous and non-local). The objective of this work is to use the well-known Ross-Macdonald models, identifying different time scales, incorporating human spatial movements and estimate in a suitable way the parameters. We will concentrate on a practical example, a simplified space model, and apply it to dengue spread in the state of Rio de Janeiro, Brazil.
Collapse
|
13
|
de Lima CL, da Silva ACG, Moreno GMM, Cordeiro da Silva C, Musah A, Aldosery A, Dutra L, Ambrizzi T, Borges IVG, Tunali M, Basibuyuk S, Yenigün O, Massoni TL, Browning E, Jones K, Campos L, Kostkova P, da Silva Filho AG, dos Santos WP. Temporal and Spatiotemporal Arboviruses Forecasting by Machine Learning: A Systematic Review. Front Public Health 2022; 10:900077. [PMID: 35719644 PMCID: PMC9204152 DOI: 10.3389/fpubh.2022.900077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 05/03/2022] [Indexed: 11/13/2022] Open
Abstract
Arboviruses are a group of diseases that are transmitted by an arthropod vector. Since they are part of the Neglected Tropical Diseases that pose several public health challenges for countries around the world. The arboviruses' dynamics are governed by a combination of climatic, environmental, and human mobility factors. Arboviruses prediction models can be a support tool for decision-making by public health agents. In this study, we propose a systematic literature review to identify arboviruses prediction models, as well as models for their transmitter vector dynamics. To carry out this review, we searched reputable scientific bases such as IEE Xplore, PubMed, Science Direct, Springer Link, and Scopus. We search for studies published between the years 2015 and 2020, using a search string. A total of 429 articles were returned, however, after filtering by exclusion and inclusion criteria, 139 were included. Through this systematic review, it was possible to identify the challenges present in the construction of arboviruses prediction models, as well as the existing gap in the construction of spatiotemporal models.
Collapse
Affiliation(s)
- Clarisse Lins de Lima
- Nucleus for Computer Engineering, Polytechnique School of the University of Pernambuco, Poli-UPE, Recife, Brazil
| | - Ana Clara Gomes da Silva
- Nucleus for Computer Engineering, Polytechnique School of the University of Pernambuco, Poli-UPE, Recife, Brazil
| | | | | | - Anwar Musah
- Centre for Digital Public Health and Emergencies, Institute for Risk and Disaster Reduction, University College London, London, United Kingdom
| | - Aisha Aldosery
- Centre for Digital Public Health and Emergencies, Institute for Risk and Disaster Reduction, University College London, London, United Kingdom
| | - Livia Dutra
- Department of Atmospheric Sciences, IAG-USP, University of São Paulo, São Paulo, Brazil
| | - Tercio Ambrizzi
- Department of Atmospheric Sciences, IAG-USP, University of São Paulo, São Paulo, Brazil
| | - Iuri V. G. Borges
- Department of Atmospheric Sciences, IAG-USP, University of São Paulo, São Paulo, Brazil
| | - Merve Tunali
- Boǧaziçi University, Institute of Environmental Sciences, Istanbul, Turkey
| | - Selma Basibuyuk
- Boǧaziçi University, Institute of Environmental Sciences, Istanbul, Turkey
| | - Orhan Yenigün
- Boǧaziçi University, Institute of Environmental Sciences, Istanbul, Turkey
| | - Tiago Lima Massoni
- Department of Systems and Computing, Federal University of Campina Grande, Campina Grande, Brazil
| | - Ella Browning
- Centre for Biodiversity and Environment Research, Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| | - Kate Jones
- Centre for Biodiversity and Environment Research, Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| | - Luiza Campos
- Department of Civil Environmental and Geomatic Engineering, University College London, London, United Kingdom
| | - Patty Kostkova
- Centre for Digital Public Health and Emergencies, Institute for Risk and Disaster Reduction, University College London, London, United Kingdom
| | | | | |
Collapse
|
14
|
Chen Y, Liu T, Yu X, Zeng Q, Cai Z, Wu H, Zhang Q, Xiao J, Ma W, Pei S, Guo P. An ensemble forecast system for tracking dynamics of dengue outbreaks and its validation in China. PLoS Comput Biol 2022; 18:e1010218. [PMID: 35759513 PMCID: PMC9269975 DOI: 10.1371/journal.pcbi.1010218] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 07/08/2022] [Accepted: 05/17/2022] [Indexed: 02/05/2023] Open
Abstract
As a common vector-borne disease, dengue fever remains challenging to predict due to large variations in epidemic size across seasons driven by a number of factors including population susceptibility, mosquito density, meteorological conditions, geographical factors, and human mobility. An ensemble forecast system for dengue fever is first proposed that addresses the difficulty of predicting outbreaks with drastically different scales. The ensemble forecast system based on a susceptible-infected-recovered (SIR) type of compartmental model coupled with a data assimilation method called the ensemble adjusted Kalman filter (EAKF) is constructed to generate real-time forecasts of dengue fever spread dynamics. The model was informed by meteorological and mosquito density information to depict the transmission of dengue virus among human and mosquito populations, and generate predictions. To account for the dramatic variations of outbreak size in different seasons, the effective population size parameter that is sequentially updated to adjust the predicted outbreak scale is introduced into the model. Before optimizing the transmission model, we update the effective population size using the most recent observations and historical records so that the predicted outbreak size is dynamically adjusted. In the retrospective forecast of dengue outbreaks in Guangzhou, China during the 2011-2017 seasons, the proposed forecast model generates accurate projections of peak timing, peak intensity, and total incidence, outperforming a generalized additive model approach. The ensemble forecast system can be operated in real-time and inform control planning to reduce the burden of dengue fever.
Collapse
Affiliation(s)
- Yuliang Chen
- Department of Preventive Medicine, Shantou University Medical College, Shantou China
| | - Tao Liu
- Guangdong Provincial Institute of Public Health, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, China
| | - Xiaolin Yu
- Department of Preventive Medicine, Shantou University Medical College, Shantou China
| | - Qinghui Zeng
- Department of Preventive Medicine, Shantou University Medical College, Shantou China
| | - Zixi Cai
- Shantou Center for Disease Control and Prevention, Shantou, China
| | - Haisheng Wu
- Department of Preventive Medicine, Shantou University Medical College, Shantou China
| | - Qingying Zhang
- Department of Preventive Medicine, Shantou University Medical College, Shantou China
| | - Jianpeng Xiao
- Guangdong Provincial Institute of Public Health, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, China
| | - Wenjun Ma
- Guangdong Provincial Institute of Public Health, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, China
- * E-mail: (WM); (SP); (PG)
| | - Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, United States of America
- * E-mail: (WM); (SP); (PG)
| | - Pi Guo
- Department of Preventive Medicine, Shantou University Medical College, Shantou China
- * E-mail: (WM); (SP); (PG)
| |
Collapse
|
15
|
Shragai T, Pérez-Pérez J, Del Pilar Quimbayo-Forero M, Rojo R, Harrington LC, Rúa-Uribe G. Distance to public transit predicts spatial distribution of dengue virus incidence in Medellín, Colombia. Sci Rep 2022; 12:8333. [PMID: 35585133 PMCID: PMC9117184 DOI: 10.1038/s41598-022-12115-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 05/04/2022] [Indexed: 12/12/2022] Open
Abstract
Dengue is a growing global threat in some of the world’s most rapidly growing landscapes. Research shows that urbanization and human movement affect the spatial dynamics and magnitude of dengue outbreaks; however, precise effects of urban growth on dengue are not well understood because of a lack of sufficiently fine-scaled data. We analyzed nine years of address-level dengue case data in Medellin, Colombia during a period of public transit expansion. We correlate changes in the spread and magnitude of localized outbreaks to changes in accessibility and usage of public transit. Locations closer to and with a greater utilization of public transit had greater dengue incidence. This relationship was modulated by socioeconomic status; lower socioeconomic status locations experienced stronger effects of public transit accessibility and usage on dengue incidence. Public transit is a vital urban resource, particularly among low socioeconomic populations. These results highlight the importance of public health services concurrent with urban growth.
Collapse
Affiliation(s)
- Talya Shragai
- Department of Entomology, Cornell University, Ithaca, NY, 14853, USA
| | | | | | - Raúl Rojo
- Centro Administrativo la Alpujarra, Secretaría de Salud de Medellín, 050015, Medellín, Colombia
| | | | | |
Collapse
|
16
|
Sanchez-Gendriz I, de Souza GF, de Andrade IGM, Neto ADD, de Medeiros Tavares A, Barros DMS, de Morais AHF, Galvão-Lima LJ, de Medeiros Valentim RA. Data-driven computational intelligence applied to dengue outbreak forecasting: a case study at the scale of the city of Natal, RN-Brazil. Sci Rep 2022; 12:6550. [PMID: 35449179 PMCID: PMC9023501 DOI: 10.1038/s41598-022-10512-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 04/08/2022] [Indexed: 01/01/2023] Open
Abstract
Dengue is recognized as a health problem that causes significant socioeconomic impacts throughout the world, affecting millions of people each year. A commonly used method for monitoring the dengue vector is to count the eggs that Aedes aegypti mosquitoes have laid in spatially distributed ovitraps. Given this approach, the present study uses a database collected from 397 ovitraps allocated across the city of Natal, RN—Brazil. The Egg Density Index for each neighborhood was computed weekly, over four complete years (from 2016 to 2019), and simultaneously analyzed with the dengue case incidence. Our results illustrate that the incidence of dengue is related to the socioeconomic level of the neighborhoods in the city of Natal. A deep learning algorithm was used to predict future dengue case incidence, either based on the previous weeks of dengue incidence or the number of eggs present in the ovitraps. The analysis reveals that ovitrap data allows earlier prediction (four to six weeks) compared to dengue incidence itself (one week). Therefore, the results validate that the quantification of Aedes aegypti eggs can be valuable for the early planning of public health interventions.
Collapse
Affiliation(s)
- Ignacio Sanchez-Gendriz
- Laboratory for Technological Innovation in Health (LAIS), Hospital Universitário Onofre Lopes, Federal University of Rio Grande Do Norte (UFRN), Natal, Rio Grande do Norte, Brazil. .,Department of Computer Engineering and Automation, UFRN, Natal, Rio Grande do Norte, Brazil.
| | - Gustavo Fontoura de Souza
- Advanced Nucleus of Technological Innovation (NAVI), Federal Institute of Rio Grande Do Norte (IFRN), Natal, Rio Grande do Norte, Brazil
| | - Ion G M de Andrade
- Laboratory for Technological Innovation in Health (LAIS), Hospital Universitário Onofre Lopes, Federal University of Rio Grande Do Norte (UFRN), Natal, Rio Grande do Norte, Brazil
| | | | | | - Daniele M S Barros
- Laboratory for Technological Innovation in Health (LAIS), Hospital Universitário Onofre Lopes, Federal University of Rio Grande Do Norte (UFRN), Natal, Rio Grande do Norte, Brazil
| | - Antonio Higor Freire de Morais
- Advanced Nucleus of Technological Innovation (NAVI), Federal Institute of Rio Grande Do Norte (IFRN), Natal, Rio Grande do Norte, Brazil
| | - Leonardo J Galvão-Lima
- Laboratory for Technological Innovation in Health (LAIS), Hospital Universitário Onofre Lopes, Federal University of Rio Grande Do Norte (UFRN), Natal, Rio Grande do Norte, Brazil
| | - Ricardo Alexsandro de Medeiros Valentim
- Laboratory for Technological Innovation in Health (LAIS), Hospital Universitário Onofre Lopes, Federal University of Rio Grande Do Norte (UFRN), Natal, Rio Grande do Norte, Brazil
| |
Collapse
|
17
|
Terroso-Saenz F, Muñoz A, Arcas F, Curado M. An analysis of twitter as a relevant human mobility proxy: A comparative approach in spain during the COVID-19 pandemic. GEOINFORMATICA 2022; 26:677-706. [PMID: 35194389 PMCID: PMC8853326 DOI: 10.1007/s10707-021-00460-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 11/16/2021] [Accepted: 12/21/2021] [Indexed: 06/01/2023]
Abstract
During the last years, the analysis of spatio-temporal data extracted from Online Social Networks (OSNs) has become a prominent course of action within the human-mobility mining discipline. Due to the noisy and sparse nature of these data, an important effort has been done on validating these platforms as suitable mobility proxies. However, such a validation has been usually based on the computation of certain features from the raw spatio-temporal trajectories extracted from OSN documents. Hence, there is a scarcity of validation studies that evaluate whether geo-tagged OSN data are able to measure the evolution of the mobility in a region at multiple spatial scales. For that reason, this work proposes a comprehensive comparison of a nation-scale Twitter (TWT) dataset and an official mobility survey from the Spanish National Institute of Statistics. The target time period covers a three-month interval during which Spain was heavily affected by the COVID-19 pandemic. Both feeds have been compared in this context by considering different mobility-related features and spatial scales. The results show that TWT could capture only a limited number features of the latent mobility behaviour of Spain during the study period.
Collapse
Affiliation(s)
| | - Andres Muñoz
- Department of Computer Engineering, University of Cádiz, Puerto Real (Cádiz), Spain
| | - Francisco Arcas
- High Polytechnic School, Catholic University of Murcia, Murcia, Spain
| | - Manuel Curado
- High Polytechnic School, Catholic University of Murcia, Murcia, Spain
| |
Collapse
|
18
|
Li Z, Gurgel H, Xu L, Yang L, Dong J. Improving Dengue Forecasts by Using Geospatial Big Data Analysis in Google Earth Engine and the Historical Dengue Information-Aided Long Short Term Memory Modeling. BIOLOGY 2022; 11:biology11020169. [PMID: 35205036 PMCID: PMC8869738 DOI: 10.3390/biology11020169] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 01/04/2022] [Accepted: 01/17/2022] [Indexed: 11/26/2022]
Abstract
Simple Summary Forecasting dengue cases often face challenges from (1) time-effectiveness due to time-consuming satellite data downloading and processing, (2) weak spatial representation due to data dependence on administrative unit-based statistics or weather station-based observations, and (3) stagnant accuracy without historical dengue cases. With the advance of the geospatial big data cloud computing in Google Earth Engine and deep learning, this study proposed an efficient framework of dengue prediction at an epidemiological week basis using geospatial big data analysis in Google Earth Engine and Long Short Term Memory modeling. We focused on the dengue epidemics in the Federal District of Brazil during 2007–2019. Based on Google Earth Engine and epidemiological calendar, we computed the weekly composite for each dengue driving factor, and spatially aggregated the pixel values into dengue transmission areas to generate the time series of driving factors. A multi-step-ahead Long Short Term Memory modeling was used, and the time-differenced natural log-transformed dengue cases and the time series of driving factors were considered as outcomes and explantary factors, respectively, with two modeling scenarios (with and without historical cases). The performance is better when historical cases were used, and the 5-weeks-ahead forecast has the best performance. Abstract Timely and accurate forecasts of dengue cases are of great importance for guiding disease prevention strategies, but still face challenges from (1) time-effectiveness due to time-consuming satellite data downloading and processing, (2) weak spatial representation capability due to data dependence on administrative unit-based statistics or weather station-based observations, and (3) stagnant accuracy without the application of historical case information. Geospatial big data, cloud computing platforms (e.g., Google Earth Engine, GEE), and emerging deep learning algorithms (e.g., long short term memory, LSTM) provide new opportunities for advancing these efforts. Here, we focused on the dengue epidemics in the urban agglomeration of the Federal District of Brazil (FDB) during 2007–2019. A new framework was proposed using geospatial big data analysis in the Google Earth Engine (GEE) platform and long short term memory (LSTM) modeling for dengue case forecasts over an epidemiological week basis. We first defined a buffer zone around an impervious area as the main area of dengue transmission by considering the impervious area as a human-dominated area and used the maximum distance of the flight range of Aedes aegypti and Aedes albopictus as a buffer distance. Those zones were used as units for further attribution analyses of dengue epidemics by aggregating the pixel values into the zones. The near weekly composite of potential driving factors was generated in GEE using the epidemiological weeks during 2007–2019, from the relevant geospatial data with daily or sub-daily temporal resolution. A multi-step-ahead LSTM model was used, and the time-differenced natural log-transformed dengue cases were used as outcomes. Two modeling scenarios (with and without historical dengue cases) were set to examine the potential of historical information on dengue forecasts. The results indicate that the performance was better when historical dengue cases were used and the 5-weeks-ahead forecast had the best performance, and the peak of a large outbreak in 2019 was accurately forecasted. The proposed framework in this study suggests the potential of the GEE platform, the LSTM algorithm, as well as historical information for dengue risk forecasting, which can easily be extensively applied to other regions or globally for timely and practical dengue forecasts.
Collapse
Affiliation(s)
- Zhichao Li
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
| | - Helen Gurgel
- Department of Geography, University of Brasilia (UnB), Brasilia 70910-900, Brazil;
| | - Lei Xu
- Vanke School of Public Health, Tsinghua University, Beijing 100084, China;
| | - Linsheng Yang
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
| | - Jinwei Dong
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
- Correspondence:
| |
Collapse
|
19
|
Koplewitz G, Lu F, Clemente L, Buckee C, Santillana M. Predicting dengue incidence leveraging internet-based data sources. A case study in 20 cities in Brazil. PLoS Negl Trop Dis 2022; 16:e0010071. [PMID: 35073316 PMCID: PMC8824328 DOI: 10.1371/journal.pntd.0010071] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 02/08/2022] [Accepted: 12/07/2021] [Indexed: 11/25/2022] Open
Abstract
The dengue virus affects millions of people every year worldwide, causing large epidemic outbreaks that disrupt people’s lives and severely strain healthcare systems. In the absence of a reliable vaccine against dengue or an effective treatment to manage the illness in humans, most efforts to combat dengue infections have focused on preventing its vectors, mainly the Aedes aegypti mosquito, from flourishing across the world. These mosquito-control strategies need reliable disease activity surveillance systems to be deployed. Despite significant efforts to estimate dengue incidence using a variety of data sources and methods, little work has been done to understand the relative contribution of the different data sources to improved prediction. Additionally, scholarship on the topic had initially focused on prediction systems at the national- and state-levels, and much remains to be done at the finer spatial resolutions at which health policy interventions often occur. We develop a methodological framework to assess and compare dengue incidence estimates at the city level, and evaluate the performance of a collection of models on 20 different cities in Brazil. The data sources we use towards this end are weekly incidence counts from prior years (seasonal autoregressive terms), weekly-aggregated weather variables, and real-time internet search data. We find that both random forest-based models and LASSO regression-based models effectively leverage these multiple data sources to produce accurate predictions, and that while the performance between them is comparable on average, the former method produces fewer extreme outliers, and can thus be considered more robust. For real-time predictions that assume long delays (6–8 weeks) in the availability of epidemiological data, we find that real-time internet search data are the strongest predictors of dengue incidence, whereas for predictions that assume short delays (1–3 weeks), in which the error rate is halved (as measured by relative RMSE), short-term and seasonal autocorrelation are the dominant predictors. Despite the difficulties inherent to city-level prediction, our framework achieves meaningful and actionable estimates across cities with different demographic, geographic and epidemic characteristics. As the incidence of infectious diseases like dengue continues to increase throughout the world, tracking their spread in real time poses a significant challenge to local and national health authorities. Accurate incidence data are often difficult to obtain as outbreaks emerge and unfold, both due the partial reach of serological surveillance (especially in rural areas), and due to delays in reporting, which result in post-hoc adjustments to what should have been real-time data. Thus, a range of ‘nowcasting’ tools have been developed to estimate disease trends, using different mathematical and statistical methodologies to fill the temporal data gap. Over the past several years, researchers have investigated how to best incorporate internet search data into predictive models, since these can be obtained in real-time. Still, most such models have been regression-based, and have tended to underperform in cases when epidemiological data are only available after long reporting delays. Moreover, in tropical countries, attention has increasingly turned from testing and applying models at the national level to models at higher spatial resolutions, such as states and cities. Here, we develop machine learning models based on both LASSO regression and on random forest ensembles, and proceed to apply and compare them across 20 cities in Brazil. We find that our methodology produces meaningful and actionable disease estimates at the city level with both underlying model classes, and that the two perform comparably across most metrics, although the ensemble method produces fewer outliers. We also compare model performance and the relative contribution of different data sources across diverse geographic, demographic and epidemic conditions.
Collapse
Affiliation(s)
- Gal Koplewitz
- Harvard J. A. Paulson School of Engineering and Applied Sciences, Cambridge, Massachusetts, United States of America
- Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America
- * E-mail: (GK); (MS)
| | - Fred Lu
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, Massachusetts, United States of America
- Department of Statistics, Stanford University, California, United States of America
| | - Leonardo Clemente
- Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Caroline Buckee
- Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Mauricio Santillana
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, Massachusetts, United States of America
- Department of Pediatrics, Harvard Medical School, Boston, Massachusetts, United States of America
- * E-mail: (GK); (MS)
| |
Collapse
|
20
|
Ponte C, Carmona HA, Oliveira EA, Caminha C, Lima AS, Andrade JS, Furtado V. Tracing contacts to evaluate the transmission of COVID-19 from highly exposed individuals in public transportation. Sci Rep 2021; 11:24443. [PMID: 34961776 PMCID: PMC8712527 DOI: 10.1038/s41598-021-03998-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 12/06/2021] [Indexed: 11/18/2022] Open
Abstract
We investigate, through a data-driven contact tracing model, the transmission of COVID-19 inside buses during distinct phases of the pandemic in a large Brazilian city. From this microscopic approach, we recover the networks of close contacts within consecutive time windows. A longitudinal comparison is then performed by upscaling the traced contacts with the transmission computed from a mean-field compartmental model for the entire city. Our results show that the effective reproduction numbers inside the buses, [Formula: see text], and in the city, [Formula: see text], followed a compatible behavior during the first wave of the local outbreak. Moreover, by distinguishing the close contacts of healthcare workers in the buses, we discovered that their transmission, [Formula: see text], during the same period, was systematically higher than [Formula: see text]. This result reinforces the need for special public transportation policies for highly exposed groups of people.
Collapse
Affiliation(s)
- Caio Ponte
- Programa de Pós Graduação em Informática Aplicada, Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brasil
| | - Humberto A Carmona
- Departamento de Física, Universidade Federal do Ceará, Fortaleza, Ceará, 60455-760, Brasil
| | - Erneson A Oliveira
- Programa de Pós Graduação em Informática Aplicada, Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brasil.
- Laboratório de Ciência de Dados e Inteligência Artificial, Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brasil.
- Mestrado Profissional em Ciências da Cidade, Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brasil.
| | - Carlos Caminha
- Programa de Pós Graduação em Informática Aplicada, Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brasil
| | - Antonio S Lima
- Célula de Vigilância Epidemiológica, Secretaria Municipal da Saúde, Fortaleza, Ceará, 60810-670, Brasil
- Centro de Ciências da Saúde, Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brasil
| | - José S Andrade
- Departamento de Física, Universidade Federal do Ceará, Fortaleza, Ceará, 60455-760, Brasil
| | - Vasco Furtado
- Programa de Pós Graduação em Informática Aplicada, Universidade de Fortaleza, Fortaleza, Ceará, 60811-905, Brasil
- Empresa de Tecnologia da Informação do Ceará, Governo do Estado do Ceará, Fortaleza, Ceará, 60130-240, Brasil
| |
Collapse
|
21
|
Temperature, traveling, slums, and housing drive dengue transmission in a non-endemic metropolis. PLoS Negl Trop Dis 2021; 15:e0009465. [PMID: 34115753 PMCID: PMC8221794 DOI: 10.1371/journal.pntd.0009465] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 06/23/2021] [Accepted: 05/10/2021] [Indexed: 11/24/2022] Open
Abstract
Dengue is steadily increasing worldwide and expanding into higher latitudes. Current non-endemic areas are prone to become endemic soon. To improve understanding of dengue transmission in these settings, we assessed the spatiotemporal dynamics of the hitherto largest outbreak in the non-endemic metropolis of Buenos Aires, Argentina, based on detailed information on the 5,104 georeferenced cases registered during summer-autumn of 2016. The highly seasonal dengue transmission in Buenos Aires was modulated by temperature and triggered by imported cases coming from regions with ongoing outbreaks. However, local transmission was made possible and consolidated heterogeneously in the city due to housing and socioeconomic characteristics of the population, with 32.8% of autochthonous cases occurring in slums, which held only 6.4% of the city population. A hierarchical spatiotemporal model accounting for imperfect detection of cases showed that, outside slums, less-affluent neighborhoods of houses (vs. apartments) favored transmission. Global and local spatiotemporal point-pattern analyses demonstrated that most transmission occurred at or close to home. Additionally, based on these results, a point-pattern analysis was assessed for early identification of transmission foci during the outbreak while accounting for population spatial distribution. Altogether, our results reveal how social, physical, and biological processes shape dengue transmission in Buenos Aires and, likely, other non-endemic cities, and suggest multiple opportunities for control interventions. Dengue fever is mainly transmitted by a mosquito species that is highly urbanized, and lays eggs and develops mostly in artificial water containers. Dengue transmission is sustained year-round in most tropical regions of the world, but in many subtropical/temperate regions it occurs only in the warmest months. To improve understanding of dengue transmission in these regions, we analyzed one of the largest outbreaks in Buenos Aires city, a subtropical metropolis. Based on information on 5,104 georeferenced cases during summer-autumn 2016, we found that most transmission occurred in or near home, that slums had the highest risk of transmission, and that, outside slums, less-affluent neighborhoods of houses (vs. apartments) favored transmission. We showed that the cumulative effects of temperature over the previous few weeks set the temporal limits for transmission to occur, and that the outbreak was sparked by infected people arriving from regions with ongoing outbreaks. Additionally, we implemented a statistical method to identify transmission foci in real-time that improves targeting control interventions. Our results deepen the understanding of dengue transmission as a result of social, physical, and biological processes, and pose multiple opportunities for improving control of dengue and other mosquito-borne viruses such as Zika, chikungunya, and yellow fever.
Collapse
|
22
|
Lee SA, Jarvis CI, Edmunds WJ, Economou T, Lowe R. Spatial connectivity in mosquito-borne disease models: a systematic review of methods and assumptions. J R Soc Interface 2021; 18:20210096. [PMID: 34034534 PMCID: PMC8150046 DOI: 10.1098/rsif.2021.0096] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 04/26/2021] [Indexed: 12/14/2022] Open
Abstract
Spatial connectivity plays an important role in mosquito-borne disease transmission. Connectivity can arise for many reasons, including shared environments, vector ecology and human movement. This systematic review synthesizes the spatial methods used to model mosquito-borne diseases, their spatial connectivity assumptions and the data used to inform spatial model components. We identified 248 papers eligible for inclusion. Most used statistical models (84.2%), although mechanistic are increasingly used. We identified 17 spatial models which used one of four methods (spatial covariates, local regression, random effects/fields and movement matrices). Over 80% of studies assumed that connectivity was distance-based despite this approach ignoring distant connections and potentially oversimplifying the process of transmission. Studies were more likely to assume connectivity was driven by human movement if the disease was transmitted by an Aedes mosquito. Connectivity arising from human movement was more commonly assumed in studies using a mechanistic model, likely influenced by a lack of statistical models able to account for these connections. Although models have been increasing in complexity, it is important to select the most appropriate, parsimonious model available based on the research question, disease transmission process, the spatial scale and availability of data, and the way spatial connectivity is assumed to occur.
Collapse
Affiliation(s)
- Sophie A. Lee
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, UK
- Centre on Climate Change and Planetary Health, London School of Hygiene & Tropical Medicine, London, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - Christopher I. Jarvis
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - W. John Edmunds
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | | | - Rachel Lowe
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London, UK
- Centre on Climate Change and Planetary Health, London School of Hygiene & Tropical Medicine, London, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| |
Collapse
|
23
|
Thiruchelvam L, Dass SC, Asirvadam VS, Daud H, Gill BS. Determine neighboring region spatial effect on dengue cases using ensemble ARIMA models. Sci Rep 2021; 11:5873. [PMID: 33712664 PMCID: PMC7955078 DOI: 10.1038/s41598-021-84176-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 02/11/2021] [Indexed: 01/06/2023] Open
Abstract
The state of Selangor, in Malaysia consist of urban and peri-urban centres with good transportation system, and suitable temperature levels with high precipitations and humidity which make the state ideal for high number of dengue cases, annually. This study investigates if districts within the Selangor state do influence each other in determining pattern of dengue cases. Study compares two different models; the Autoregressive Integrated Moving Average (ARIMA) and Ensemble ARIMA models, using the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) measurement to gauge their performance tools. ARIMA model is developed using the epidemiological data of dengue cases, whereas ensemble ARIMA incorporates the neighbouring regions' dengue models as the exogenous variable (X), into traditional ARIMA model. Ensemble ARIMA models have better model fit compared to the basic ARIMA models by incorporating neighbuoring effects of seven districts which made of state of Selangor. The AIC and BIC values of ensemble ARIMA models to be smaller compared to traditional ARIMA counterpart models. Thus, study concludes that pattern of dengue cases for a district is subject to spatial effects of its neighbouring districts and number of dengue cases in the surrounding areas.
Collapse
Affiliation(s)
- Loshini Thiruchelvam
- Insititute of Autonomous Systems, Universiti Teknologi PETRONAS, Seri Iskandar, Perak, Malaysia
| | - Sarat Chandra Dass
- School of Mathematical and Computer Sciences, Heriot-Watt University Malaysia, Putrajaya, Malaysia
| | - Vijanth Sagayan Asirvadam
- Department of Electric and Electronic Engineering, Universiti Teknologi PETRONAS, Seri Iskandar, Perak, Malaysia.
| | - Hanita Daud
- Department of Fundamental and Applied Sciences, Universiti Teknologi PETRONAS, Seri Iskandar, Perak, Malaysia
| | - Balvinder Singh Gill
- Institute for Medical Research (IMR), Ministry of Health, Kuala Lumpur, Malaysia
| |
Collapse
|
24
|
Pei S, Teng X, Lewis P, Shaman J. Optimizing respiratory virus surveillance networks using uncertainty propagation. Nat Commun 2021; 12:222. [PMID: 33431854 PMCID: PMC7801666 DOI: 10.1038/s41467-020-20399-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 12/01/2020] [Indexed: 02/07/2023] Open
Abstract
Infectious disease prevention, control and forecasting rely on sentinel observations; however, many locations lack the capacity for routine surveillance. Here we show that, by using data from multiple sites collectively, accurate estimation and forecasting of respiratory diseases for locations without surveillance is feasible. We develop a framework to optimize surveillance sites that suppresses uncertainty propagation in a networked disease transmission model. Using influenza outbreaks from 35 US states, the optimized system generates better near-term predictions than alternate systems designed using population and human mobility. We also find that monitoring regional population centers serves as a reasonable proxy for the optimized network and could direct surveillance for diseases with limited records. The proxy method is validated using model simulations for 3,108 US counties and historical data for two other respiratory pathogens - human metapneumovirus and seasonal coronavirus - from 35 US states and can be used to guide systemic allocation of surveillance efforts.
Collapse
Affiliation(s)
- Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA.
| | - Xian Teng
- School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Paul Lewis
- Integrated Biosurveillance Section, Armed Forces Health Surveillance Branch, Silver Spring, MD, 20904, USA
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA.
| |
Collapse
|