1
|
Chen S, Janies D, Paul R, Thill JC. Leveraging advances in data-driven deep learning methods for hybrid epidemic modeling. Epidemics 2024; 48:100782. [PMID: 38971085 DOI: 10.1016/j.epidem.2024.100782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 04/18/2024] [Accepted: 06/18/2024] [Indexed: 07/08/2024] Open
Abstract
Mathematical modeling of epidemic dynamics is crucial to understand its underlying mechanisms, quantify important parameters, and make predictions that facilitate more informed decision-making. There are three major types of models: mechanistic models including the SEIR-type paradigm, alternative data-driven (DD) approaches, and hybrid models that combine mechanistic models with DD approaches. In this paper, we summarize our work in the COVID-19 Scenario Modeling Hub (SMH) for more than 12 rounds since early 2021 for informed decision support. We emphasize the importance of deep learning techniques for epidemic modeling via a flexible DD framework that substantially complements the mechanistic paradigm to evaluate various future epidemic scenarios. We start with a traditional curve-fitting approach to model cumulative COVID-19 based on the underlying SEIR-type mechanisms. Hospitalizations and deaths are modeled as binomial processes of cases and hospitalization, respectively. We further formulate two types of deep learning models based on multivariate long short term memory (LSTM) to address the challenges of more traditional DD models. The first LSTM is structurally similar to the curve fitting approach and assumes that hospitalizations and deaths are binomial processes of cases. Instead of using a predefined exponential curve, LSTM relies on the underlying data to identify the most appropriate functions, and is capable of capturing both long-term and short-term epidemic behaviors. We then relax the assumption of dependent inputs among cases, hospitalizations, and death. Another type of LSTM that handles all input time series as parallel signals, the independent multivariate LSTM, is developed. Independent multivariate LSTM can incorporate a wide range of data sources beyond traditional case-based epidemiological surveillance. The DD framework unleashes its potential in big data era with previously neglected heterogeneous surveillance data sources, such as syndromic, environment, genomic, serologic, infoveillance, and mobility data. DD approaches, especially LSTM, complement and integrate with the mechanistic modeling paradigm, provide a feasible alternative approach to model today's complex socio-epidemiological systems, and further leverage our ability to explore different scenarios for more informed decision-making during health emergencies.
Collapse
Affiliation(s)
- Shi Chen
- Department of Public Health Sciences, University of North Carolina at Charlotte, Charlotte, NC, United States; School of Data Science, University of North Carolina at Charlotte, Charlotte, NC, United States.
| | - Daniel Janies
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, United States
| | - Rajib Paul
- Department of Public Health Sciences, University of North Carolina at Charlotte, Charlotte, NC, United States; School of Data Science, University of North Carolina at Charlotte, Charlotte, NC, United States
| | - Jean-Claude Thill
- School of Data Science, University of North Carolina at Charlotte, Charlotte, NC, United States; Department of Geography and Earth Sciences, University of North Carolina at Charlotte, Charlotte, NC, United States
| |
Collapse
|
2
|
Tariq MU, Ismail SB. Deep learning in public health: Comparative predictive models for COVID-19 case forecasting. PLoS One 2024; 19:e0294289. [PMID: 38483948 PMCID: PMC10939212 DOI: 10.1371/journal.pone.0294289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 10/28/2023] [Indexed: 03/17/2024] Open
Abstract
The COVID-19 pandemic has had a significant impact on both the United Arab Emirates (UAE) and Malaysia, emphasizing the importance of developing accurate and reliable forecasting mechanisms to guide public health responses and policies. In this study, we compared several cutting-edge deep learning models, including Long Short-Term Memory (LSTM), bidirectional LSTM, Convolutional Neural Networks (CNN), hybrid CNN-LSTM, Multilayer Perceptron's, and Recurrent Neural Networks (RNN), to project COVID-19 cases in the aforementioned regions. These models were calibrated and evaluated using a comprehensive dataset that includes confirmed case counts, demographic data, and relevant socioeconomic factors. To enhance the performance of these models, Bayesian optimization techniques were employed. Subsequently, the models were re-evaluated to compare their effectiveness. Analytic approaches, both predictive and retrospective in nature, were used to interpret the data. Our primary objective was to determine the most effective model for predicting COVID-19 cases in the United Arab Emirates (UAE) and Malaysia. The findings indicate that the selected deep learning algorithms were proficient in forecasting COVID-19 cases, although their efficacy varied across different models. After a thorough evaluation, the model architectures most suitable for the specific conditions in the UAE and Malaysia were identified. Our study contributes significantly to the ongoing efforts to combat the COVID-19 pandemic, providing crucial insights into the application of sophisticated deep learning algorithms for the precise and timely forecasting of COVID-19 cases. These insights hold substantial value for shaping public health strategies, enabling authorities to develop targeted and evidence-based interventions to manage the virus spread and its impact on the populations of the UAE and Malaysia. The study confirms the usefulness of deep learning methodologies in efficiently processing complex datasets and generating reliable projections, a skill of great importance in healthcare and professional settings.
Collapse
Affiliation(s)
- Muhammad Usman Tariq
- Abu Dhabi University, Abu Dhabi, United Arab Emirates
- Universiti Tun Hussein Onn Malaysia (UTHM), Parit Raja, Malaysia
| | | |
Collapse
|
3
|
Amoon M, Altameem T, Hashem M. Consistent Healthcare Safety Recommendation System for Preventing Contagious Disease Infections in Human Crowds. SENSORS (BASEL, SWITZERLAND) 2023; 23:9394. [PMID: 38067767 PMCID: PMC10708775 DOI: 10.3390/s23239394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 11/17/2023] [Accepted: 11/20/2023] [Indexed: 12/18/2023]
Abstract
The recent impact of COVID-19, as a contagious disease, led researchers to focus on designing and fabricating personal healthcare devices and systems. With the help of wearable sensors, sensing and communication technologies, and recommendation modules, personal healthcare systems were designed for ease of use. More specifically, personal healthcare systems were designed to provide recommendations for maintaining a safe distance and avoiding contagious disease spread after the COVID-19 pandemic. The personal recommendations are analyzed based on the wearable sensor signals and their consistency in sensing. This consistency varies with human movements or other activities that hike/cease the sensor values abruptly for a short period. Therefore, a consistency-focused recommendation system (CRS) for personal healthcare (PH) was designed in this research. The hardware sensing intervals for the system are calibrated per the conventional specifications from which abrupt changes can be observed. The changes are analyzed for their saturation and fluctuations observed from neighbors within the threshold distance. The saturation and fluctuation classifications are performed using random forest learning to differentiate the above data from the previously sensed healthy data. In this process, the saturated data and consistency data provide safety recommendations for the moving user. The consistency is verified for a series of intervals for the fluctuating sensed data. This alerts the user if the threshold distance for a contagious disease is violated. The proposed system was validated using a prototype model and experimental analysis through false rates, data analysis rates, and fluctuations.
Collapse
Affiliation(s)
- Mohammed Amoon
- Department of Computer Science, Community College, King Saud University, P.O. Box 28095, Riyadh 11437, Saudi Arabia;
| | - Torki Altameem
- Department of Computer Science, Community College, King Saud University, P.O. Box 28095, Riyadh 11437, Saudi Arabia;
| | - Mohammed Hashem
- Department of Dental Health, College of Applied Medical Sciences, King Saud University, P.O. Box 12372, Riyadh 12372, Saudi Arabia;
| |
Collapse
|
4
|
Erfani A, Frias-Martinez V. A fairness assessment of mobility-based COVID-19 case prediction models. PLoS One 2023; 18:e0292090. [PMID: 37851681 PMCID: PMC10584164 DOI: 10.1371/journal.pone.0292090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 09/12/2023] [Indexed: 10/20/2023] Open
Abstract
In light of the outbreak of COVID-19, analyzing and measuring human mobility has become increasingly important. A wide range of studies have explored spatiotemporal trends over time, examined associations with other variables, evaluated non-pharmacologic interventions (NPIs), and predicted or simulated COVID-19 spread using mobility data. Despite the benefits of publicly available mobility data, a key question remains unanswered: are models using mobility data performing equitably across demographic groups? We hypothesize that bias in the mobility data used to train the predictive models might lead to unfairly less accurate predictions for certain demographic groups. To test our hypothesis, we applied two mobility-based COVID infection prediction models at the county level in the United States using SafeGraph data, and correlated model performance with sociodemographic traits. Findings revealed that there is a systematic bias in models' performance toward certain demographic characteristics. Specifically, the models tend to favor large, highly educated, wealthy, young, and urban counties. We hypothesize that the mobility data currently used by many predictive models tends to capture less information about older, poorer, less educated and people from rural regions, which in turn negatively impacts the accuracy of the COVID-19 prediction in these areas. Ultimately, this study points to the need of improved data collection and sampling approaches that allow for an accurate representation of the mobility patterns across demographic groups.
Collapse
Affiliation(s)
- Abdolmajid Erfani
- Department of Civil, Environmental, and Geospatial Engineering, Michigan Technological University, Houghton, MI, United States of America
| | - Vanessa Frias-Martinez
- College of Information Studies, University of Maryland, College Park, MD, United States of America
- University of Maryland Institute for Advanced Computer Studies, University of Maryland, College Park, MD, United States of America
| |
Collapse
|
5
|
Ghosh S, Mukherjee A. STROVE: spatial data infrastructure enabled cloud-fog-edge computing framework for combating COVID-19 pandemic. INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING 2022:1-17. [PMID: 35677629 PMCID: PMC9162382 DOI: 10.1007/s11334-022-00458-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Accepted: 05/04/2022] [Indexed: 06/15/2023]
Abstract
The outbreak of 2019 novel coronavirus (COVID-19) has triggered unprecedented challenges and put the whole world in a parlous condition. The impacts of COVID-19 is a matter of grave concern in terms of fatality rate, socio-economical condition, health infrastructure. It is obvious that only pharmaceutical solutions (vaccine) cannot eradicate this pandemic completely, and effective strategies regarding lockdown measures, restricted mobility, emergency services to users-in brief data-driven decision system is of utmost importance. This necessitates an efficient data analytics framework, data infrastructure to store, manage pandemic related information, and distributed computing platform to support such data-driven operations. In the past few decades, Internet of Things-based devices and applications have emerged significantly in various sectors including healthcare and time-critical applications. To be specific, health-sensors help to accumulate health-related parameters at different time-instances of a day, the movement sensors keep track of mobility traces of the user, and helps to assist them in varied conditions. The smartphones are equipped with several such sensors and the ability of low-cost connected sensors to cover large areas makes it the most useful component to combat pandemics such as COVID-19. However, analysing and managing the huge amount of data generated by these sensors is a big challenge. In this paper we have proposed a unified framework which has three major components: (i) Spatial Data Infrastructure to manage, store, analyse and share spatio-temporal information with stakeholders efficiently, (ii) Cloud-Fog-Edge-based hierarchical architecture to support preliminary diagnosis, monitoring patients' mobility, health parameters and activities while they are in quarantine or home-based treatment, and (iii) Assisting users in varied emergency situation leveraging efficient data-driven techniques at low-latency and energy consumption. The mobility data analytics along with SDI is required to interpret the movement dynamics of the region and correlate with COVID-19 hotspots. Further, Cloud-Fog-Edge-based system architecture is required to provision healthcare services efficiently and in timely manner. The proposed framework yields encouraging results in taking decisions based on the COVID-19 context and assisting users effectively by enhancing accuracy of detecting suspected infected people by ∼ 24% and reducing delay by ∼ 55% compared to cloud-only system.
Collapse
Affiliation(s)
- Shreya Ghosh
- Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, India
- College of Information Sciences and Technology, The Pennsylvania State University, State College, USA
| | - Anwesha Mukherjee
- Department of Computer Science, Mahishadal Raj College, Mahishadal, West Bengal India
| |
Collapse
|
6
|
Zhao J, Han M, Wang Z, Wan B. Autoregressive count data modeling on mobility patterns to predict cases of COVID-19 infection. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT : RESEARCH JOURNAL 2022; 36:4185-4200. [PMID: 35765667 PMCID: PMC9223272 DOI: 10.1007/s00477-022-02255-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 05/18/2022] [Indexed: 05/07/2023]
Abstract
At the beginning of 2022 the global daily count of new cases of COVID-19 exceeded 3.2 million, a tripling of the historical peak value reported between the initial outbreak of the pandemic and the end of 2021. Aerosol transmission through interpersonal contact is the main cause of the disease's spread, although control measures have been put in place to reduce contact opportunities. Mobility pattern is a basic mechanism for understanding how people gather at a location and how long they stay there. Due to the inherent dependencies in disease transmission, models for associating mobility data with confirmed cases need to be individually designed for different regions and time periods. In this paper, we propose an autoregressive count data model under the framework of a generalized linear model to illustrate a process of model specification and selection. By evaluating a 14-day-ahead prediction from Sweden, the results showed that for a dense population region, using mobility data with a lag of 8 days is the most reliable way of predicting the number of confirmed cases in relative numbers at a high coverage rate. It is sufficient for both of the autoregressive terms, studied variable and conditional expectation, to take one day back. For sparsely populated regions, a lag of 10 days produced the lowest error in absolute value for the predictions, where weekly periodicity on the studied variable is recommended for use. Interventions were further included to identify the most relevant mobility categories. Statistical features were also presented to verify the model assumptions.
Collapse
Affiliation(s)
- Jing Zhao
- School of Business Administration, Xi’an Eurasia University, Yanta District, Xi’an, China
| | - Mengjie Han
- School of Information and Engineering, Dalarna University, 79188 Falun, Sweden
| | - Zhenwu Wang
- Department of Computer Science and Technology, China University of Mining and Technology, Beijing, 100083 China
| | - Benting Wan
- School of Software and IoT Engineering, Jiangxi University of Finance and Economics, Nanchang, 330013 China
| |
Collapse
|
7
|
Leite GS, Albuquerque AB, Pinheiro PR. Applications of Technological Solutions in Primary Ways of Preventing Transmission of Respiratory Infectious Diseases-A Systematic Literature Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:10765. [PMID: 34682511 PMCID: PMC8535524 DOI: 10.3390/ijerph182010765] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 10/09/2021] [Accepted: 10/11/2021] [Indexed: 12/23/2022]
Abstract
With the growing concern about the spread of new respiratory infectious diseases, several studies involving the application of technology in the prevention of these diseases have been carried out. Among these studies, it is worth highlighting the importance of those focused on the primary forms of prevention, such as social distancing, mask usage, quarantine, among others. This importance arises because, from the emergence of a new disease to the production of immunizers, preventive actions must be taken to reduce contamination and fatalities rates. Despite the considerable number of studies, no records of works aimed at the identification, registration, selection, and rigorous analysis and synthesis of the literature were found. For this purpose, this paper presents a systematic review of the literature on the application of technological solutions in the primary ways of respiratory infectious diseases transmission prevention. From the 1139 initially retrieved, 219 papers were selected for data extraction, analysis, and synthesis according to predefined inclusion and exclusion criteria. Results enabled the identification of a general categorization of application domains, as well as mapping of the adopted support mechanisms. Findings showed a greater trend in studies related to pandemic planning and, among the support mechanisms adopted, data and mathematical application-related solutions received greater attention. Topics for further research and improvement were also identified such as the need for a better description of data analysis and evidence.
Collapse
Affiliation(s)
- Gleidson Sobreira Leite
- UNIFOR, Department of Computer Science, University of Fortaleza, Fortaleza 60811-905, Ceará, Brazil; (A.B.A.); (P.R.P.)
| | | | | |
Collapse
|
8
|
Evaluation of Polycentric Spatial Structure in the Urban Agglomeration of the Pearl River Delta (PRD) Based on Multi-Source Big Data Fusion. REMOTE SENSING 2021. [DOI: 10.3390/rs13183639] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The rapid development of the urban city has led to great changes in the urban spatial structure. Thus, analyses of polycentric urban spatial structures are important for understanding these kinds of structures. In order to accurately evaluate the polycentric spatial structure of urban agglomerations and judge the differences between the actual development situation and overall planning of urban agglomerations, this study proposes a new method to identify the polycentric spatial structure of urban agglomerations in the Pearl River Delta based on the fusion of nighttime light (NTL) data, point of interest (POI) data, and Tencent migration data (TMG). In the first step, the NTL, POI, and TMG data are fused via wavelet transform; in the second step, Anselin local Moran’s I (LMI) and geographically weighted regression (GWR) were used to identify the main centers and subcenters, respectively. In the third step, the accuracy of the results of this study was further verified and discussed in the context of overall planning. The results show that the accuracy of urban polycenter identification via LMI and GWR after data fusion was 92.84%, and the Kappa value was 0.8971, which was higher than the results of polycenter identification via the traditional relative threshold. After comparing the identification results with the overall planning, firstly, we see that the fusion of multi-source big data can help to accurately evaluate the polycentric spatial structure within the urban agglomeration. Secondly, the fusion of dynamic data and static data can help identify the polycentric spatial structure of urban space more accurately. Therefore, this study can provide a new design for urban polycentric spatial structures, and further provide a reliable reference for the spatial optimization of urban agglomeration and the formulation of regional spatial development policies.
Collapse
|