1
|
Zhang X, Liu H, Sun M, Feng S. Multimodal approach to public health interventions using EGG and mobile health technologies. Front Public Health 2025; 12:1520343. [PMID: 39911780 PMCID: PMC11794255 DOI: 10.3389/fpubh.2024.1520343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Accepted: 12/09/2024] [Indexed: 02/07/2025] Open
Abstract
Introduction Public health interventions increasingly integrate multimodal data sources, such as Electroencephalogram (EEG) data, to enhance monitoring and predictive capabilities for mental health conditions. However, traditional models often face challenges with the complexity and high dimensionality of EEG signals. While recent advancements like Contrastive Language-lmage Pre-training(CLIP) models excel in cross-modal understanding, their application to EEG-based tasks remains limited due to the unique characteristics of EEG data. Methods In response, we introduce PH-CLIP (Public Health Contrastive Language-lmage Pretraining), a novel framework that combines CLIP's representational power with a multi-scale fusion mechanism designed specifically for EEG data within mobile health technologies. PH-CLIP employs hierarchical feature extraction to capture the temporal dynamics of EEG signals, aligning them with contextually relevant textual descriptions for improved public health insights. Through a multi-scale fusion layer, PH-CLIP enhances interpretability and robustness in EEG embeddings, thereby supporting more accurate and scalable interventions across diverse public health applications. Results and discussion Experimental results indicate that PH-CLIP achieves significant improvements in EEG classification accuracy and mental health prediction efficiency compared to leading EEG analysis models. This framework positions PH-CLIP as a transformative tool in public health monitoring, with the potential to advance large-scale mental health interventions through integrative mobile health technologies.
Collapse
Affiliation(s)
- Xiao Zhang
- School of Physical Education Institute, Yunnan Minzu University, Kunming, Yunnan, China
| | - Han Liu
- School of Physical Education Institute, Yunnan Minzu University, Kunming, Yunnan, China
| | - Mingyang Sun
- The Catholic University of Korea, Seoul, Republic of Korea
| | - Shuangyi Feng
- The Catholic University of Korea, Seoul, Republic of Korea
| |
Collapse
|
2
|
Tuan DA, Dang TN. Leveraging Climate Data for Dengue Forecasting in Ba Ria Vung Tau Province, Vietnam: An Advanced Machine Learning Approach. Trop Med Infect Dis 2024; 9:250. [PMID: 39453277 PMCID: PMC11511084 DOI: 10.3390/tropicalmed9100250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Revised: 10/10/2024] [Accepted: 10/17/2024] [Indexed: 10/26/2024] Open
Abstract
Dengue fever is a persistent public health issue in tropical regions, including Vietnam, where climate variability plays a crucial role in disease transmission dynamics. This study focuses on developing climate-based machine learning models to forecast dengue outbreaks in Ba Ria Vung Tau (BRVT) province, Vietnam, using meteorological data from 2003 to 2022. We utilized four predictive models-Negative Binomial Regression (NBR), Seasonal AutoRegressive Integrated Moving Average with Exogenous Regressors (SARIMAX), Extreme Gradient Boosting (XGBoost) v2.0.3, and long short-term memory (LSTM)-to predict weekly dengue incidence. Key climate variables, including temperature, humidity, precipitation, and wind speed, were integrated into these models, with lagged variables included to capture delayed climatic effects on dengue transmission. The NBR model demonstrated the best performance in terms of predictive accuracy, achieving the lowest Mean Absolute Error (MAE), compared to other models. The inclusion of lagged climate variables significantly enhanced the model's ability to predict dengue cases. Although effective in capturing seasonal trends, SARIMAX and LSTM models struggled with overfitting and failed to accurately predict short-term outbreaks. XGBoost exhibited moderate predictive power but was sensitive to overfitting, particularly without fine-tuning. Our findings confirm that climate-based machine learning models, particularly the NBR model, offer valuable tools for forecasting dengue outbreaks in BRVT. However, improving the models' ability to predict short-term peaks remains a challenge. The integration of meteorological data into early warning systems is crucial for public health authorities to plan timely and effective interventions. This research contributes to the growing body of literature on climate-based disease forecasting and underscores the need for further model refinement to address the complexities of dengue transmission in highly endemic regions.
Collapse
Affiliation(s)
| | - Tran Ngoc Dang
- Faculty of Public Health, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City 70000, Vietnam;
| |
Collapse
|
3
|
Francisco ME, Carvajal TM, Watanabe K. Hybrid Machine Learning Approach to Zero-Inflated Data Improves Accuracy of Dengue Prediction. PLoS Negl Trop Dis 2024; 18:e0012599. [PMID: 39432557 PMCID: PMC11527386 DOI: 10.1371/journal.pntd.0012599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 10/31/2024] [Accepted: 10/01/2024] [Indexed: 10/23/2024] Open
Abstract
BACKGROUND Spatiotemporal dengue forecasting using machine learning (ML) can contribute to the development of prevention and control strategies for impending dengue outbreaks. However, training data for dengue incidence may be inflated with frequent zero values because of the rarity of cases, which lowers the prediction accuracy. This study aimed to understand the influence of spatiotemporal resolutions of data on the accuracy of dengue incidence prediction using ML models, to understand how the influence of spatiotemporal resolution differs between quantitative and qualitative predictions of dengue incidence, and to improve the accuracy of dengue incidence prediction with zero-inflated data. METHODOLOGY We predicted dengue incidence at six spatiotemporal resolutions and compared their prediction accuracy. Six ML algorithms were compared: generalized additive models, random forests, conditional inference forest, artificial neural networks, support vector machines and regression, and extreme gradient boosting. Data from 2009 to 2012 were used for training, and data from 2013 were used for model validation with quantitative and qualitative dengue variables. To address the inaccuracy in the quantitative prediction of dengue incidence due to zero-inflated data at fine spatiotemporal scales, we developed a hybrid approach in which the second-stage quantitative prediction is performed only when/where the first-stage qualitative model predicts the occurrence of dengue cases. PRINCIPAL FINDINGS At higher resolutions, the dengue incidence data were zero-inflated, which was insufficient for quantitative pattern extraction of relationships between dengue incidence and environmental variables by ML. Qualitative models, used as binary variables, eased the effect of data distribution. Our novel hybrid approach of combining qualitative and quantitative predictions demonstrated high potential for predicting zero-inflated or rare phenomena, such as dengue. SIGNIFICANCE Our research contributes valuable insights to the field of spatiotemporal dengue prediction and provides an alternative solution to enhance prediction accuracy in zero-inflated data where hurdle or zero-inflated models cannot be applied.
Collapse
Affiliation(s)
- Micanaldo Ernesto Francisco
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama, Japan
- Graduate School of Science and Engineering, Ehime University, Matsuyama, Ehime, Japan
- Faculty of Architecture and Physical Planning (FAPF), Lurio University, Nampula, Mozambique
| | - Thaddeus M. Carvajal
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama, Japan
- Department of Biology De La Salle University, Taft Ave Manila, Philippines
| | - Kozo Watanabe
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama, Japan
| |
Collapse
|
4
|
El Morr C, Ozdemir D, Asdaah Y, Saab A, El-Lahib Y, Sokhn ES. AI-based epidemic and pandemic early warning systems: A systematic scoping review. Health Informatics J 2024; 30:14604582241275844. [PMID: 39172555 DOI: 10.1177/14604582241275844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2024]
Abstract
Background: Timely detection of disease outbreaks is critical in public health. Artificial Intelligence (AI) can identify patterns in data that signal the onset of epidemics and pandemics. This scoping review examines the effectiveness of AI in epidemic and pandemic early warning systems (EWS). Objective: To assess the capability of AI-based systems in predicting epidemics and pandemics and to identify challenges and strategies for improvement. Methods: A systematic scoping review was conducted. The review included studies from the last 5 years, focusing on AI and machine learning applications in EWS. After screening 1087 articles, 33 were selected for thematic analysis. Results: The review found that AI-based EWS have been effectively implemented in various contexts, using a range of algorithms. Key challenges identified include data quality, model explainability, bias, data volume, velocity, variety, availability, and granularity. Strategies for mitigating AI bias and improving system adaptability were also discussed. Conclusion: AI has shown promise in enhancing the speed and accuracy of epidemic detection. However, challenges related to data quality, bias, and model transparency need to be addressed to improve the reliability and generalizability of AI-based EWS. Continuous monitoring and improvement, as well as incorporating social and environmental data, are essential for future development.
Collapse
Affiliation(s)
- Christo El Morr
- School of Health Policy and Management, York University, Toronto, ON, Canada
| | - Deniz Ozdemir
- Department of Psychology, York University, Toronto, ON, Canada
| | - Yasmeen Asdaah
- School of Health Policy and Management, York University, Toronto, ON, Canada
| | - Antoine Saab
- Quality and Safety Department, Lebanese Hospital Geitaoui-UMC, Beirut, Lebanon
| | - Yahya El-Lahib
- Faculty of Social Work, University of Calgary, Calgary, AB, Canada
| | - Elie Salem Sokhn
- Laboratory Department, Lebanese Hospital-Geitaoui University Medical Center, Beirut, Lebanon
- Molecular testing Laboratory, Medical Laboratory Department, Faculty of Health Sciences, Beirut Arab University, Beirut, Lebanon
| |
Collapse
|
5
|
Fang L, Hu W, Pan G. Meteorological factors cannot be ignored in machine learning-based methods for predicting dengue, a systematic review. INTERNATIONAL JOURNAL OF BIOMETEOROLOGY 2024; 68:401-410. [PMID: 38150020 DOI: 10.1007/s00484-023-02605-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 09/18/2023] [Accepted: 12/13/2023] [Indexed: 12/28/2023]
Abstract
In recent years, there has been a rapid increase in the application of machine learning methods about predicting the incidence of dengue fever. However, the predictive factors and models employed in different studies vary greatly. Hence, we conducted a systematic review to summarize machine learning methods and predictors in previous studies. We searched PubMed, ScienceDirect, and Web of Science databases for articles published up to July 2023. The selected papers included not only the forecast of dengue incidence but also machine learning methods. A total of 23 papers were included in this study. Predictive factors included meteorological factors (22, 95.7%), historical dengue data (14, 60.9%), environmental factors (4, 17.4%), socioeconomic factors (4, 17.4%), vector surveillance data (2, 8.7%), and internet search data (3, 13.0%). Among meteorological factors, temperature (20, 87.0%), rainfall (20, 87.0%), and relative humidity (14, 60.9%) were the most commonly used. We found that Support Vector Machine (SVM) (6, 26.1%), Long Short-Term Memory (LSTM) (5, 21.7%), Random Forest (RF) (4, 17.4%), Least Absolute Shrinkage and Selection Operator (LASSO) (2, 8.7%), ensemble model (2, 8.7%), and other models (4, 17.4%) were identified as the best models based on evaluation metrics used in each article. These results indicate that meteorological factors are important predictors that cannot be ignored and SVM and LSTM algorithms are the most commonly used models in dengue fever prediction with good predictive performance. This review will contribute to the development of more robust early dengue warning systems and promote the application of machine learning methods in predicting climate-related infectious diseases.
Collapse
Affiliation(s)
- Lanlan Fang
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, 230032, Anhui, China
| | - Wan Hu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, 230032, Anhui, China
| | - Guixia Pan
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, 230032, Anhui, China.
- The Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Anhui Medical University, Hefei, China.
| |
Collapse
|
6
|
Sebastianelli A, Spiller D, Carmo R, Wheeler J, Nowakowski A, Jacobson LV, Kim D, Barlevi H, Cordero ZER, Colón-González FJ, Lowe R, Ullo SL, Schneider R. A reproducible ensemble machine learning approach to forecast dengue outbreaks. Sci Rep 2024; 14:3807. [PMID: 38360915 PMCID: PMC10869339 DOI: 10.1038/s41598-024-52796-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 01/23/2024] [Indexed: 02/17/2024] Open
Abstract
Dengue fever, a prevalent and rapidly spreading arboviral disease, poses substantial public health and economic challenges in tropical and sub-tropical regions worldwide. Predicting infectious disease outbreaks on a countrywide scale is complex due to spatiotemporal variations in dengue incidence across administrative areas. To address this, we propose a machine learning ensemble model for forecasting the dengue incidence rate (DIR) in Brazil, with a focus on the population under 19 years old. The model integrates spatial and temporal information, providing one-month-ahead DIR estimates at the state level. Comparative analyses with a dummy model and ablation studies demonstrate the ensemble model's qualitative and quantitative efficacy across the 27 Brazilian Federal Units. Furthermore, we showcase the transferability of this approach to Peru, another Latin American country with differing epidemiological characteristics. This timely forecast system can aid local governments in implementing targeted control measures. The study advances climate services for health by identifying factors triggering dengue outbreaks in Brazil and Peru, emphasizing collaborative efforts with intergovernmental organizations and public health institutions. The innovation lies not only in the algorithms themselves but in their application to a domain marked by data scarcity and operational scalability challenges. We bridge the gap by integrating well-curated ground data with advanced analytical methods, addressing a significant deficiency in current practices. The successful transfer of the model to Peru and its consistent performance during the 2019 outbreak in Brazil showcase its scalability and practical application. While acknowledging limitations in handling extreme values, especially in regions with low DIR, our approach excels where accurate predictions are critical. The study not only contributes to advancing DIR forecasting but also represents a paradigm shift in integrating advanced analytics into public health operational frameworks. This work, driven by a collaborative spirit involving intergovernmental organizations and public health institutions, sets a precedent for interdisciplinary collaboration in addressing global health challenges. It not only enhances our understanding of factors triggering dengue outbreaks but also serves as a template for the effective implementation of advanced analytical methods in public health.
Collapse
Affiliation(s)
- Alessandro Sebastianelli
- Engineering Department, University of Sannio, Benevento, Italy.
- European Space Agency, Φ-lab, Frascati, Italy.
| | - Dario Spiller
- School of Aerospace Engineering, Sapienza University of Rome, Rome, Italy
| | | | | | - Artur Nowakowski
- Faculty of Geodesy and Cartography, Warsaw University of Technology, Warsaw, Poland
| | | | | | | | | | - Felipe J Colón-González
- Wellcome Trust, Data for Science and Health, London, UK
- Centre on Climate Change and Planetary Health and Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene and Tropical Medicine, London, UK
- Tyndall Centre for Climate Change Research, School of Environmental Sciences, University of East Anglia, Norwich, UK
| | - Rachel Lowe
- Centre on Climate Change and Planetary Health and Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene and Tropical Medicine, London, UK
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
| | | | | |
Collapse
|
7
|
Yu Z, Wang J, Tan Z, Luo Y. Impact of climate change on SARS-CoV-2 epidemic in China. PLoS One 2023; 18:e0285179. [PMID: 37498956 PMCID: PMC10374073 DOI: 10.1371/journal.pone.0285179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 04/16/2023] [Indexed: 07/29/2023] Open
Abstract
The outbreak and prevalence of SARS-CoV-2 have severely affected social security. Physical isolation is an effective control that affects the short-term human-to-human transmission of the epidemic, although weather presents a long-term effect. Understanding the effect of weather on the outbreak allow it to be contained at the earliest possible. China is selected as the study area, and six weather factors that receive the most attention from January 20, 2020 to April 30, 2020 are selected to investigate the correlation between weather and SARS-CoV-2 to provide a theoretical basis for long-term epidemic prevention and control. The results show that (1) the average growth rate (GR) of SARS-CoV-2 in each province is logarithmically distributed with a mean value of 5.15%. The GR of the southeastern region is higher than that of the northwestern region, which is consistent with the Hu Line. (2) The specific humidity, 2-m temperature (T), ultraviolet (UV) radiation, and wind speed (WS) adversely affect the GR. By contrast, the total precipitation (TP) and surface pressure (SP) promote the GR. (3) For every 1 unit increase in UV radiation, the GR decreases by 0.30% in 11 days, and the UV radiation in China is higher than that worldwide (0.92% higher per day). Higher population aggregation and urbanization directly affect the epidemic, and weather is an indirect factor.
Collapse
Affiliation(s)
- Zhenyu Yu
- School of Geography and Remote Sensing, Guangzhou University, Guangdong, China
- Innovation Center for Remote Sensing Big Data Intelligent Applications, Guangzhou University, Guangdong, China
| | - Jinnian Wang
- School of Geography and Remote Sensing, Guangzhou University, Guangdong, China
- Innovation Center for Remote Sensing Big Data Intelligent Applications, Guangzhou University, Guangdong, China
| | - Zixuan Tan
- School of Geography and Remote Sensing, Guangzhou University, Guangdong, China
- Innovation Center for Remote Sensing Big Data Intelligent Applications, Guangzhou University, Guangdong, China
| | - Yiyun Luo
- School of Geography and Remote Sensing, Guangzhou University, Guangdong, China
- Innovation Center for Remote Sensing Big Data Intelligent Applications, Guangzhou University, Guangdong, China
| |
Collapse
|
8
|
Leung XY, Islam RM, Adhami M, Ilic D, McDonald L, Palawaththa S, Diug B, Munshi SU, Karim MN. A systematic review of dengue outbreak prediction models: Current scenario and future directions. PLoS Negl Trop Dis 2023; 17:e0010631. [PMID: 36780568 PMCID: PMC9956653 DOI: 10.1371/journal.pntd.0010631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 02/24/2023] [Accepted: 01/29/2023] [Indexed: 02/15/2023] Open
Abstract
Dengue is among the fastest-spreading vector-borne infectious disease, with outbreaks often overwhelm the health system and result in huge morbidity and mortality in its endemic populations in the absence of an efficient warning system. A large number of prediction models are currently in use globally. As such, this study aimed to systematically review the published literature that used quantitative models to predict dengue outbreaks and provide insights about the current practices. A systematic search was undertaken, using the Ovid MEDLINE, EMBASE, Scopus and Web of Science databases for published citations, without time or geographical restrictions. Study selection, data extraction and management process were devised in accordance with the 'Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies' ('CHARMS') framework. A total of 99 models were included in the review from 64 studies. Most models sourced climate (94.7%) and climate change (77.8%) data from agency reports and only 59.6% of the models adjusted for reporting time lag. All included models used climate predictors; 70.7% of them were built with only climate factors. Climate factors were used in combination with climate change factors (13.4%), both climate change and demographic factors (3.1%), vector factors (6.3%), and demographic factors (5.2%). Machine learning techniques were used for 39.4% of the models. Of these, random forest (15.4%), neural networks (23.1%) and ensemble models (10.3%) were notable. Among the statistical (60.6%) models, linear regression (18.3%), Poisson regression (18.3%), generalized additive models (16.7%) and time series/autoregressive models (26.7%) were notable. Around 20.2% of the models reported no validation at all and only 5.2% reported external validation. The reporting of methodology and model performance measures were inadequate in many of the existing prediction models. This review collates plausible predictors and methodological approaches, which will contribute to robust modelling in diverse settings and populations.
Collapse
Affiliation(s)
- Xing Yu Leung
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Rakibul M. Islam
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Mohammadmehdi Adhami
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Dragan Ilic
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Lara McDonald
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Shanika Palawaththa
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Basia Diug
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Saif U. Munshi
- Department of Virology, Bangabandhu Sheikh Mujib Medical University, Dhaka, Bangladesh
| | - Md Nazmul Karim
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
- * E-mail:
| |
Collapse
|
9
|
Li Z. Forecasting Weekly Dengue Cases by Integrating Google Earth Engine-Based Risk Predictor Generation and Google Colab-Based Deep Learning Modeling in Fortaleza and the Federal District, Brazil. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:13555. [PMID: 36294134 PMCID: PMC9603269 DOI: 10.3390/ijerph192013555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 10/15/2022] [Accepted: 10/18/2022] [Indexed: 06/16/2023]
Abstract
Efficient and accurate dengue risk prediction is an important basis for dengue prevention and control, which faces challenges, such as downloading and processing multi-source data to generate risk predictors and consuming significant time and computational resources to train and validate models locally. In this context, this study proposed a framework for dengue risk prediction by integrating big geospatial data cloud computing based on Google Earth Engine (GEE) platform and artificial intelligence modeling on the Google Colab platform. It enables defining the epidemiological calendar, delineating the predominant area of dengue transmission in cities, generating the data of risk predictors, and defining multi-date ahead prediction scenarios. We implemented the experiments based on weekly dengue cases during 2013-2020 in the Federal District and Fortaleza, Brazil to evaluate the performance of the proposed framework. Four predictors were considered, including total rainfall (Rsum), mean temperature (Tmean), mean relative humidity (RHmean), and mean normalized difference vegetation index (NDVImean). Three models (i.e., random forest (RF), long-short term memory (LSTM), and LSTM with attention mechanism (LSTM-ATT)), and two modeling scenarios (i.e., modeling with or without dengue cases) were set to implement 1- to 4-week ahead predictions. A total of 24 models were built, and the results showed in general that LSTM and LSTM-ATT models outperformed RF models; modeling could benefit from using historical dengue cases as one of the predictors, and it makes the predicted curve fluctuation more stable compared with that only using climate and environmental factors; attention mechanism could further improve the performance of LSTM models. This study provides implications for future dengue risk prediction in terms of the effectiveness of GEE-based big geospatial data processing for risk predictor generation and Google Colab-based risk modeling and presents the benefits of using historical dengue data as one of the input features and the attention mechanism for LSTM modeling.
Collapse
Affiliation(s)
- Zhichao Li
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
10
|
How heterogeneous is the dengue transmission profile in Brazil? A study in six Brazilian states. PLoS Negl Trop Dis 2022; 16:e0010746. [PMID: 36095004 PMCID: PMC9499305 DOI: 10.1371/journal.pntd.0010746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 09/22/2022] [Accepted: 08/17/2022] [Indexed: 11/19/2022] Open
Abstract
Dengue is a vector-borne disease present in most tropical countries, infecting an average of 50 to 100 million people per year. Socioeconomic, demographic, and environmental factors directly influence the transmission cycle of the dengue virus (DENV). In Brazil, these factors vary between regions producing different profiles of dengue transmission and challenging the epidemiological surveillance of the disease. In this article, we aimed at classifying the profiles of dengue transmission in 1,823 Brazilian municipalities, covering different climates, from 2010 to 2019. Time series data of dengue cases were obtained from six states: Ceará and Maranhão in the semiarid Northeast, Minas Gerais in the countryside, Espírito Santo and Rio de Janeiro in the tropical Atlantic coast, and Paraná in the subtropical region. To describe the time series, we proposed a set of epi-features of the magnitude and duration of the dengue epidemic cycles, totaling 13 indicators. Using these epi-features as inputs, a multivariate cluster algorithm was employed to classify the municipalities according to their dengue transmission profile. Municipalities were classified into four distinct dengue transmission profiles: persistent transmission (7.8%), epidemic (21.3%), episodic/epidemic (43.2%), and episodic transmission (27.6%). Different profiles were associated with the municipality’s population size and climate. Municipalities with higher incidence and larger populations tended to be classified as persistent transmission, suggesting the existence of critical community size. This association, however, varies depending on the state, indicating the importance of other factors. The proposed classification is useful for developing more specific and precise surveillance protocols for regions with different dengue transmission profiles, as well as more precise public policies for dengue prevention. Dengue is one of the fastest-growing vector-borne diseases in the world. Currently, vaccines are experimental and are not very effective, so prevention depends on the control of the mosquito Aedes aegypti. Health promotion campaigns aimed at encouraging people to reduce mosquito breeding sites have limited effect. In addition, the heterogeneity of the territories that have dengue becomes a major challenge for the epidemiological surveillance of the disease. Brazil has a territory of continental size, and single standardized surveillance is not very effective for monitoring this arbovirus. Classifying types of dengue dynamics based on features of the epidemiological cycle in each location has the potential to increase the precision of surveillance and control strategies. In our study, we were able to classify areas according to different dengue transmission profiles, ranging from episodic to persistent transmission. These results can provide tools to guide actions aimed at achieving the World Health Organization’s goals of eliminating neglected tropical diseases in countries that have the virus.
Collapse
|
11
|
Dey SK, Rahman MM, Howlader A, Siddiqi UR, Uddin KMM, Borhan R, Rahman EU. Prediction of dengue incidents using hospitalized patients, metrological and socio-economic data in Bangladesh: A machine learning approach. PLoS One 2022; 17:e0270933. [PMID: 35857776 PMCID: PMC9299345 DOI: 10.1371/journal.pone.0270933] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/21/2022] [Indexed: 11/20/2022] Open
Abstract
Dengue fever is a severe disease spread by Aedes mosquito-borne dengue viruses (DENVs) in tropical areas such as Bangladesh. Since its breakout in the 1960s, dengue fever has been endemic in Bangladesh, with the highest concentration of infections in the capital, Dhaka. This study aims to develop a machine learning model that can use relevant information about the factors that cause Dengue outbreaks within a geographic region. To predict dengue cases in 11 different districts of Bangladesh, we created a DengueBD dataset and employed two machine learning algorithms, Multiple Linear Regression (MLR) and Support Vector Regression (SVR). This research also explores the correlation among environmental factors like temperature, rainfall, and humidity with the rise and decline trend of Dengue cases in different cities of Bangladesh. The entire dataset was divided into an 80:20 ratio, with 80 percent used for training and 20% used for testing. The research findings imply that, for both the MLR with 67% accuracy along with Mean Absolute Error (MAE) of 4.57 and SVR models with 75% accuracy along with Mean Absolute Error (MAE) of 4.95, the number of dengue cases reduces throughout the winter season in the country and increases mainly during the rainy season in the next ten months, from August 2021 to May 2022. Importantly, Dhaka, Bangladesh's capital, will see the maximum number of dengue patients during this period. Overall, the results of this data-driven analysis show that machine learning algorithms have enormous potential for predicting dengue epidemics.
Collapse
Affiliation(s)
- Samrat Kumar Dey
- School of Science and Technology (SST), Bangladesh Open University (BOU), Gazipur, Bangladesh
| | - Md. Mahbubur Rahman
- Department of Computer Science and Engineering (CSE), Military Institute of Science and Technology (MIST), Dhaka, Bangladesh
| | - Arpita Howlader
- Department of Computer and Communication Engineering (CCE), Patuakhali Science and Technology University (PSTU), Dumki, Patuakhali, Bangladesh
| | - Umme Raihan Siddiqi
- Department of Physiology, Shaheed Suhrawardy Medical College (ShSMC), Dhaka, Bangladesh
| | | | - Rownak Borhan
- Department of Computer Science and Engineering (CSE), Dhaka International University (DIU), Dhaka, Bangladesh
| | - Elias Ur Rahman
- Department of Computer Science and Engineering (CSE), Dhaka International University (DIU), Dhaka, Bangladesh
| |
Collapse
|
12
|
Li Z, Gurgel H, Xu L, Yang L, Dong J. Improving Dengue Forecasts by Using Geospatial Big Data Analysis in Google Earth Engine and the Historical Dengue Information-Aided Long Short Term Memory Modeling. BIOLOGY 2022; 11:biology11020169. [PMID: 35205036 PMCID: PMC8869738 DOI: 10.3390/biology11020169] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 01/04/2022] [Accepted: 01/17/2022] [Indexed: 11/26/2022]
Abstract
Simple Summary Forecasting dengue cases often face challenges from (1) time-effectiveness due to time-consuming satellite data downloading and processing, (2) weak spatial representation due to data dependence on administrative unit-based statistics or weather station-based observations, and (3) stagnant accuracy without historical dengue cases. With the advance of the geospatial big data cloud computing in Google Earth Engine and deep learning, this study proposed an efficient framework of dengue prediction at an epidemiological week basis using geospatial big data analysis in Google Earth Engine and Long Short Term Memory modeling. We focused on the dengue epidemics in the Federal District of Brazil during 2007–2019. Based on Google Earth Engine and epidemiological calendar, we computed the weekly composite for each dengue driving factor, and spatially aggregated the pixel values into dengue transmission areas to generate the time series of driving factors. A multi-step-ahead Long Short Term Memory modeling was used, and the time-differenced natural log-transformed dengue cases and the time series of driving factors were considered as outcomes and explantary factors, respectively, with two modeling scenarios (with and without historical cases). The performance is better when historical cases were used, and the 5-weeks-ahead forecast has the best performance. Abstract Timely and accurate forecasts of dengue cases are of great importance for guiding disease prevention strategies, but still face challenges from (1) time-effectiveness due to time-consuming satellite data downloading and processing, (2) weak spatial representation capability due to data dependence on administrative unit-based statistics or weather station-based observations, and (3) stagnant accuracy without the application of historical case information. Geospatial big data, cloud computing platforms (e.g., Google Earth Engine, GEE), and emerging deep learning algorithms (e.g., long short term memory, LSTM) provide new opportunities for advancing these efforts. Here, we focused on the dengue epidemics in the urban agglomeration of the Federal District of Brazil (FDB) during 2007–2019. A new framework was proposed using geospatial big data analysis in the Google Earth Engine (GEE) platform and long short term memory (LSTM) modeling for dengue case forecasts over an epidemiological week basis. We first defined a buffer zone around an impervious area as the main area of dengue transmission by considering the impervious area as a human-dominated area and used the maximum distance of the flight range of Aedes aegypti and Aedes albopictus as a buffer distance. Those zones were used as units for further attribution analyses of dengue epidemics by aggregating the pixel values into the zones. The near weekly composite of potential driving factors was generated in GEE using the epidemiological weeks during 2007–2019, from the relevant geospatial data with daily or sub-daily temporal resolution. A multi-step-ahead LSTM model was used, and the time-differenced natural log-transformed dengue cases were used as outcomes. Two modeling scenarios (with and without historical dengue cases) were set to examine the potential of historical information on dengue forecasts. The results indicate that the performance was better when historical dengue cases were used and the 5-weeks-ahead forecast had the best performance, and the peak of a large outbreak in 2019 was accurately forecasted. The proposed framework in this study suggests the potential of the GEE platform, the LSTM algorithm, as well as historical information for dengue risk forecasting, which can easily be extensively applied to other regions or globally for timely and practical dengue forecasts.
Collapse
Affiliation(s)
- Zhichao Li
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
| | - Helen Gurgel
- Department of Geography, University of Brasilia (UnB), Brasilia 70910-900, Brazil;
| | - Lei Xu
- Vanke School of Public Health, Tsinghua University, Beijing 100084, China;
| | - Linsheng Yang
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
| | - Jinwei Dong
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
- Correspondence:
| |
Collapse
|