1
|
Li R, Song Y, Qu H, Li M, Jiang GP. A data-driven epidemic model with human mobility and vaccination protection for COVID-19 prediction. J Biomed Inform 2024; 149:104571. [PMID: 38092247 DOI: 10.1016/j.jbi.2023.104571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 11/22/2023] [Accepted: 12/07/2023] [Indexed: 12/18/2023]
Abstract
Epidemiological models allow for quantifying the dynamic characteristics of large-scale outbreaks. However, capturing detailed and accurate epidemiological information often requires consideration of multiple kinetic mechanisms and parameters. Due to the uncertainty of pandemic evolution, such as pathogen variation, host immune response and changes in mitigation strategies, the parameter evaluation and state prediction of complex epidemiological models are challenging. Here, we develop a data-driven epidemic model with a generalized SEIR mechanistic structure that includes new compartments, human mobility and vaccination protection. To address the issue of model complexity, we embed the epidemiological model dynamics into physics-informed neural networks (PINN), taking the observed series of time instances as direct input of the network to simultaneously infer unknown parameters and unobserved dynamics of the underlying model. Using actual data during the COVID-19 outbreak in Australia, Israel, and Switzerland, our model framework demonstrates satisfactory performance in multi-step ahead predictions compared to several benchmark models. Moreover, our model infers time-varying parameters such as transmission rates, hospitalization ratios, and effective reproduction numbers, as well as calculates the latent period and asymptomatic infection count, which are typically unreported in public data. Finally, we employ the proposed data-driven model to analyze the impact of different mitigation strategies on COVID-19.
Collapse
Affiliation(s)
- Ruqi Li
- School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
| | - Yurong Song
- College of Automation and College of Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing 210023, China.
| | - Hongbo Qu
- School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
| | - Min Li
- College of Automation and College of Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Guo-Ping Jiang
- College of Automation and College of Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| |
Collapse
|
2
|
Pinto JPG, Magalhães PC, Figueiredo GM, Alves D, Angel DMS. Local protection bubbles: an interpretation of the slowdown in the spread of coronavirus in the city of São Paulo, Brazil, in July 2020. CAD SAUDE PUBLICA 2023; 39:e00109522. [PMID: 38126417 PMCID: PMC10727033 DOI: 10.1590/0102-311xen109522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 08/07/2023] [Accepted: 09/13/2023] [Indexed: 12/23/2023] Open
Abstract
After four months of fighting the pandemic, the city of São Paulo, Brazil, entered a phase of relaxed social distancing measures in July 2020. Simultaneously, there was a decline in the social distancing rate and a reduction in the number of cases, fatalities, and hospital bed occupancy. To understand the pandemic dynamics in the city of São Paulo, we developed a multi-agent simulation model. Surprisingly, the counter-intuitive results of the model followed the city's reality. We argue that this phenomenon could be attributed to local bubbles of protection that emerged in the absence of contagion networks. These bubbles reduced the transmission rate of the virus, causing short and temporary reductions in the epidemic curve - but manifested as an unstable equilibrium. Our hypothesis aligns with the virus spread dynamics observed thus far, without the need for ad hoc assumptions regarding the natural thresholds of collective immunity or the heterogeneity of the population's transmission rate, which may lead to erroneous predictions. Our model was designed to be user-friendly and does not require any scientific or programming expertise to generate outcomes on virus transmission in a given location. Furthermore, as an input to start our simulation model, we developed the COVID-19 Protection Index as an alternative to the Human Development Index, which measures a given territory vulnerability to the coronavirus and includes characteristics of the health system and socioeconomic development, as well as the infrastructure of the city of São Paulo.
Collapse
Affiliation(s)
| | | | | | - Domingos Alves
- Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brasil
| | | |
Collapse
|
3
|
Heredia Cacha I, Sáinz-Pardo Díaz J, Castrillo M, López García Á. Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain's case study. Sci Rep 2023; 13:6750. [PMID: 37185927 PMCID: PMC10127188 DOI: 10.1038/s41598-023-33795-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Accepted: 04/19/2023] [Indexed: 05/17/2023] Open
Abstract
In this work the applicability of an ensemble of population and machine learning models to predict the evolution of the COVID-19 pandemic in Spain is evaluated, relying solely on public datasets. Firstly, using only incidence data, we trained machine learning models and adjusted classical ODE-based population models, especially suited to capture long term trends. As a novel approach, we then made an ensemble of these two families of models in order to obtain a more robust and accurate prediction. We then proceed to improve machine learning models by adding more input features: vaccination, human mobility and weather conditions. However, these improvements did not translate to the overall ensemble, as the different model families had also different prediction patterns. Additionally, machine learning models degraded when new COVID variants appeared after training. We finally used Shapley Additive Explanation values to discern the relative importance of the different input features for the machine learning models' predictions. The conclusion of this work is that the ensemble of machine learning models and population models can be a promising alternative to SEIR-like compartmental models, especially given that the former do not need data from recovered patients, which are hard to collect and generally unavailable.
Collapse
Affiliation(s)
- Ignacio Heredia Cacha
- Instituto de Física de Cantabria (IFCA), CSIC-UC, Avda. los Castros s/n., 39005, Santander, Spain
| | - Judith Sáinz-Pardo Díaz
- Instituto de Física de Cantabria (IFCA), CSIC-UC, Avda. los Castros s/n., 39005, Santander, Spain
| | - María Castrillo
- Instituto de Física de Cantabria (IFCA), CSIC-UC, Avda. los Castros s/n., 39005, Santander, Spain
| | - Álvaro López García
- Instituto de Física de Cantabria (IFCA), CSIC-UC, Avda. los Castros s/n., 39005, Santander, Spain.
| |
Collapse
|
4
|
Muñoz-Organero M. Space-Distributed Traffic-Enhanced LSTM-Based Machine Learning Model for COVID-19 Incidence Forecasting. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:4307708. [PMID: 36438691 PMCID: PMC9699744 DOI: 10.1155/2022/4307708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 10/17/2022] [Accepted: 11/10/2022] [Indexed: 09/02/2023]
Abstract
The COVID-19 virus continues to generate waves of infections around the world. With major areas in developing countries still lagging behind in vaccination campaigns, the risk of new variants that can cause re-infections worldwide makes the monitoring and forecasting of the evolution of the virus a high priority. Having accurate models able to forecast the incidence of the spread of the virus provides help to policymakers and health professionals in managing the scarce resources in an optimal way. In this paper, a new machine learning model is proposed to forecast the spread of the virus one-week ahead in a geographic area which combines mobility and COVID-19 incidence data. The area is divided into zones or districts according to the location of the COVID-19 measuring points. A traffic-driven mobility estimate among adjacent districts is proposed to capture the spatial spread of the virus. Traffic-driven mobility in adjacent districts will be used together with COVID-19 incidence data to feed a new deep learning LSTM-based model which will extract patterns from mobility-modulated COVID-19 incidence spatiotemporal data in order to optimize one-week ahead estimations. The model is trained and validated with open data available for the city of Madrid (Spain) for 3 different validation scenarios. A baseline model based on previous literature able to extract temporal patterns in COVID-19 incidence time series is also trained with the same dataset. The results show that the proposed model, based on the combination of traffic and COVID-19 incidence data, is able to outperform the baseline model in all the validation scenarios.
Collapse
Affiliation(s)
- Mario Muñoz-Organero
- Telematic Engineering Department, Universidad Carlos III de Madrid, Leganes 28911, Madrid, Spain
| |
Collapse
|
5
|
Xu Y, Ye W, Song Q, Shen L, Liu Y, Guo Y, Liu G, Wu H, Wang X, Sun X, Bai L, Luo C, Liao T, Chen H, Song C, Huang C, Wu Y, Xu Z. Using machine learning models to predict the duration of the recovery of COVID-19 patients hospitalized in Fangcang shelter hospital during the Omicron BA. 2.2 pandemic. Front Med (Lausanne) 2022; 9:1001801. [PMID: 36405610 PMCID: PMC9666500 DOI: 10.3389/fmed.2022.1001801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 09/30/2022] [Indexed: 11/05/2022] Open
Abstract
Background Factors that may influence the recovery of patients with confirmed SARS-CoV-2 infection hospitalized in the Fangcang shelter were explored, and machine learning models were constructed to predict the duration of recovery during the Omicron BA. 2.2 pandemic. Methods A retrospective study was conducted at Hongqiao National Exhibition and Convention Center Fangcang shelter (Shanghai, China) from April 9, 2022 to April 25, 2022. The demographics, clinical data, inoculation history, and recovery information of the 13,162 enrolled participants were collected. A multivariable logistic regression model was used to identify independent factors associated with 7-day recovery and 14-day recovery. Machine learning algorithms (DT, SVM, RF, DT/AdaBoost, AdaBoost, SMOTEENN/DT, SMOTEENN/SVM, SMOTEENN/RF, SMOTEENN+DT/AdaBoost, and SMOTEENN/AdaBoost) were used to build models for predicting 7-day and 14-day recovery. Results Of the 13,162 patients in the study, the median duration of recovery was 8 days (interquartile range IQR, 6–10 d), 41.31% recovered within 7 days, and 94.83% recovered within 14 days. Univariate analysis showed that the administrative region, age, cough medicine, comorbidities, diabetes, coronary artery disease (CAD), hypertension, number of comorbidities, CT value of the ORF gene, CT value of the N gene, ratio of ORF/IC, and ratio of N/IC were associated with a duration of recovery within 7 days. Age, gender, vaccination dose, cough medicine, comorbidities, diabetes, CAD, hypertension, number of comorbidities, CT value of the ORF gene, CT value of the N gene, ratio of ORF/IC, and ratio of N/IC were related to a duration of recovery within 14 days. In the multivariable analysis, the receipt of two doses of the vaccination vs. unvaccinated (OR = 1.118, 95% CI = 1.003–1.248; p = 0.045), receipt of three doses of the vaccination vs. unvaccinated (OR = 1.114, 95% CI = 1.004–1.236; p = 0.043), diabetes (OR = 0.383, 95% CI = 0.194–0.749; p = 0.005), CAD (OR = 0.107, 95% CI = 0.016–0.421; p = 0.005), hypertension (OR = 0.371, 95% CI = 0.202–0.674; p = 0.001), and ratio of N/IC (OR = 3.686, 95% CI = 2.939–4.629; p < 0.001) were significantly and independently associated with a duration of recovery within 7 days. Gender (OR = 0.736, 95% CI = 0.63–0.861; p < 0.001), age (30–70) (OR = 0.738, 95% CI = 0.594–0.911; p < 0.001), age (>70) (OR = 0.38, 95% CI = 0292–0.494; p < 0.001), receipt of three doses of the vaccination vs. unvaccinated (OR = 1.391, 95% CI = 1.12–1.719; p = 0.0033), cough medicine (OR = 1.509, 95% CI = 1.075–2.19; p = 0.023), and symptoms (OR = 1.619, 95% CI = 1.306–2.028; p < 0.001) were significantly and independently associated with a duration of recovery within 14 days. The SMOTEEN/RF algorithm performed best, with an accuracy of 90.32%, sensitivity of 92.22%, specificity of 88.31%, F1 score of 90.71%, and AUC of 89.75% for the 7-day recovery prediction; and an accuracy of 93.81%, sensitivity of 93.40%, specificity of 93.81%, F1 score of 93.42%, and AUC of 93.53% for the 14-day recovery prediction. Conclusion Age and vaccination dose were factors robustly associated with accelerated recovery both on day 7 and day 14 from the onset of disease during the Omicron BA. 2.2 wave. The results suggest that the SMOTEEN/RF-based model could be used to predict the probability of 7-day and 14-day recovery from the Omicron variant of SARS-CoV-2 infection for COVID-19 prevention and control policy in other regions or countries. This may also help to generate external validation for the model.
Collapse
Affiliation(s)
- Yu Xu
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
| | - Wei Ye
- Department of Health Statistics, Army Medical University, Chongqing, China
| | - Qiuyue Song
- Department of Health Statistics, Army Medical University, Chongqing, China
| | - Linlin Shen
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
| | - Yu Liu
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
| | - Yuhang Guo
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
| | - Gang Liu
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
| | - Hongmei Wu
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
| | - Xia Wang
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
| | - Xiaorong Sun
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
| | - Li Bai
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
| | - Chunmei Luo
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
- Department of Orthopedics, Xinqiao Hospital, Army Medical University, Chongqing, China
| | - Tongquan Liao
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
- Department of Medical Administration, Xinqiao Hospital, Army Medical University, Chongqing, China
- *Correspondence: Tongquan Liao
| | - Hao Chen
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
- Academic Affairs Office, Army Medical University, Chongqing, China
- Hao Chen
| | - Caiping Song
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
- Xinqiao Hospital, Army Medical University, Chongqing, China
- Caiping Song
| | - Chunji Huang
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
- Army Medical University, Chongqing, China
- Chunji Huang
| | - Yazhou Wu
- Department of Health Statistics, Army Medical University, Chongqing, China
- Yazhou Wu
| | - Zhi Xu
- Respiratory and Critical Care Medical Center, Xinqiao Hospital, Army Medical University, Chongqing, China
- National Exhibition and Convention Center Fangcang Shelter Hospital, Shanghai, China
- Zhi Xu ;
| |
Collapse
|
6
|
Colnago M, Benvenuto GA, Casaca W, Negri RG, Fernandes EG, Cuminato JA. Risk Factors Associated with Mortality in Hospitalized Patients with COVID-19 during the Omicron Wave in Brazil. Bioengineering (Basel) 2022; 9:bioengineering9100584. [PMID: 36290552 PMCID: PMC9598428 DOI: 10.3390/bioengineering9100584] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 10/03/2022] [Accepted: 10/18/2022] [Indexed: 11/16/2022] Open
Abstract
Considering the imminence of new SARS-CoV-2 variants and COVID-19 vaccine availability, it is essential to understand the impact of the disease on the most vulnerable groups and those at risk of death from the disease. To this end, the odds ratio (OR) for mortality and hospitalization was calculated for different groups of patients by applying an adjusted logistic regression model based on the following variables of interest: gender, booster vaccination, age group, and comorbidity occurrence. A massive number of data were extracted and compiled from official Brazilian government resources, which include all reported cases of hospitalizations and deaths associated with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in Brazil during the “wave” of the Omicron variant (BA.1 substrain). Males (1.242; 95% CI 1.196–1.290) aged 60–79 (3.348; 95% CI 3.050–3.674) and 80 years or older (5.453; 95% CI 4.966–5.989), and hospitalized patients with comorbidities (1.418; 95% CI 1.355–1.483), were more likely to die. There was a reduction in the risk of death (0.907; 95% CI 0.866–0.951) among patients who had received the third dose of the anti-SARS-CoV-2 vaccine (booster). Additionally, this big data investigation has found statistical evidence that vaccination can support mitigation plans concerning the current scenario of COVID-19 in Brazil since the Omicron variant and its substrains are now prevalent across the entire country.
Collapse
Affiliation(s)
- Marilaine Colnago
- Institute of Chemistry, São Paulo State University (UNESP), Araraquara 14800-060, Brazil
| | - Giovana A. Benvenuto
- Faculty of Science and Technology (FCT), São Paulo State University (UNESP), Presidente Prudente 19060-900, Brazil
| | - Wallace Casaca
- Institute of Biosciences, Letters and Exact Sciences, São Paulo State University (UNESP), São José do Rio Preto 15054-000, Brazil
- Correspondence:
| | - Rogério G. Negri
- Science and Technology Institute, São Paulo State University (UNESP), São José dos Campos 12247-004, Brazil
| | - Eder G. Fernandes
- Immunization Division—Centre of Epidemiology Surveillance of the São Paulo State Health Department, São Paulo 01246-000, Brazil
| | - José A. Cuminato
- Institute of Mathematics and Computer Science, São Paulo University (USP), São Carlos 13566-590, Brazil
| |
Collapse
|
7
|
Pham P, Pedrycz W, Vo B. Dual attention-based sequential auto-encoder for Covid-19 outbreak forecasting: A case study in Vietnam. EXPERT SYSTEMS WITH APPLICATIONS 2022; 203:117514. [PMID: 35607612 PMCID: PMC9117090 DOI: 10.1016/j.eswa.2022.117514] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 05/02/2022] [Accepted: 05/03/2022] [Indexed: 06/15/2023]
Abstract
For preventing the outbreaks of Covid-19 infection in different countries, many organizations and governments have extensively studied and applied different kinds of quarantine isolation policies, medical treatments as well as organized massive/fast vaccination strategy for over-18 citizens. There are several valuable lessons have been achieved in different countries this Covid-19 battle. These studies have presented the usefulness of prompt actions in testing, isolating confirmed infectious cases from community as well as social resource planning/optimization through data-driven anticipation. In recent times, many studies have demonstrated the effectiveness of short/long-term forecasting in number of new Covid-19 cases in forms of time-series data. These predictions have directly supported to effectively optimize the available healthcare resources as well as imposing suitable policies for slowing down the Covid-19 spreads, especially in high-populated cities/regions/nations. There are several progresses of deep neural architectures, such as recurrent neural network (RNN) have demonstrated significant improvements in analyzing and learning the time-series datasets for conducting better predictions. However, most of recent RNN-based techniques are considered as unable to handle chaotic/non-smooth sequential datasets. The consecutive disturbances and lagged observations from chaotic time-series dataset like as routine Covid-19 confirmed cases have led to the low performance in temporal feature learning process through recent RNN-based models. To meet this challenge, in this paper, we proposed a novel dual attention-based sequential auto-encoding architecture, called as: DAttAE. Our proposed model supports to effectively learn and predict the new Covid-19 cases in forms of chaotic and non-smooth time series dataset. Specifically, the integration between dual self-attention mechanism in a given Bi-LSTM based auto-encoder in our proposed model supports to directly focus the model on a specific time-range sequence in order to achieve better prediction. We evaluated the performance of our proposed DAttAE model by comparing with multiple traditional and state-of-the-art deep learning-based techniques for time-series prediction task upon different real-world datasets. Experimental outputs demonstrated the effectiveness of our proposed attention-based deep neural approach in comparing with state-of-the-art RNN-based architectures for time series based Covid-19 outbreak prediction task.
Collapse
Affiliation(s)
- Phu Pham
- Faculty of Information Technology, HUTECH University, Ho Chi Minh City, Viet Nam
| | - Witold Pedrycz
- Department of Electrical & Computer Engineering, University of Alberta, Edmonton T6R 2V4, Canada
- Warsaw School of Information Technology, Newelska 6, Warsaw, Poland
| | - Bay Vo
- Faculty of Information Technology, HUTECH University, Ho Chi Minh City, Viet Nam
| |
Collapse
|
8
|
A Novel Approach on Deep Learning—Based Decision Support System Applying Multiple Output LSTM-Autoencoder: Focusing on Identifying Variations by PHSMs’ Effect over COVID-19 Pandemic. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19116763. [PMID: 35682349 PMCID: PMC9180123 DOI: 10.3390/ijerph19116763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 05/24/2022] [Accepted: 05/29/2022] [Indexed: 11/30/2022]
Abstract
Following the outbreak of the COVID-19 pandemic, the continued emergence of major variant viruses has caused enormous damage worldwide by generating social and economic ripple effects, and the importance of PHSMs (Public Health and Social Measures) is being highlighted to cope with this severe situation. Accordingly, there has also been an increase in research related to a decision support system based on simulation approaches used as a basis for PHSMs. However, previous studies showed limitations impeding utilization as a decision support system for policy establishment and implementation, such as the failure to reflect changes in the effectiveness of PHSMs and the restriction to short-term forecasts. Therefore, this study proposes an LSTM-Autoencoder-based decision support system for establishing and implementing PHSMs. To overcome the limitations of existing studies, the proposed decision support system used a methodology for predicting the number of daily confirmed cases over multiple periods based on multiple output strategies and a methodology for rapidly identifying varies in policy effects based on anomaly detection. It was confirmed that the proposed decision support system demonstrated excellent performance compared to models used for time series analysis such as statistical models and deep learning models. In addition, we endeavored to increase the usability of the proposed decision support system by suggesting a transfer learning-based methodology that can efficiently reflect variations in policy effects. Finally, the decision support system proposed in this study provides a methodology that provides multi-period forecasts, identifying variations in policy effects, and efficiently reflects the effects of variation policies. It was intended to provide reasonable and realistic information for the establishment and implementation of PHSMs and, through this, to yield information expected to be highly useful, which had not been provided in the decision support systems presented in previous studies.
Collapse
|
9
|
Verma H, Mandal S, Gupta A. Temporal deep learning architecture for prediction of COVID-19 cases in India. EXPERT SYSTEMS WITH APPLICATIONS 2022; 195:116611. [PMID: 35153389 PMCID: PMC8817764 DOI: 10.1016/j.eswa.2022.116611] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 01/09/2022] [Accepted: 01/22/2022] [Indexed: 05/05/2023]
Abstract
To combat the recent coronavirus disease 2019 (COVID-19), academician and clinician are in search of new approaches to predict the COVID-19 outbreak dynamic trends that may slow down or stop the pandemic. Epidemiological models like Susceptible-Infected-Recovered (SIR) and its variants are helpful to understand the dynamics trend of pandemic that may be used in decision making to optimize possible controls from the infectious disease. But these epidemiological models based on mathematical assumptions may not predict the real pandemic situation. Recently the new machine learning approaches are being used to understand the dynamic trend of COVID-19 spread. In this paper, we designed the recurrent and convolutional neural network models: vanilla LSTM, stacked LSTM, ED_LSTM, BiLSTM, CNN, and hybrid CNN+LSTM model to capture the complex trend of COVID-19 outbreak and perform the forecasting of COVID-19 daily confirmed cases of 7, 14, 21 days for India and its four most affected states (Maharashtra, Kerala, Karnataka, and Tamil Nadu). The root mean square error (RMSE) and mean absolute percentage error (MAPE) evaluation metric are computed on the testing data to demonstrate the relative performance of these models. The results show that the stacked LSTM and hybrid CNN+LSTM models perform best relative to other models.
Collapse
Affiliation(s)
- Hanuman Verma
- Bareilly College, Bareilly, Uttar Pradesh, 243005, India
| | - Saurav Mandal
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, 110067, India
| | - Akshansh Gupta
- CSIR-Central Electronics Engineering Research Institute, Pilani Rajasthan, 333031, India
| |
Collapse
|
10
|
Zhou B, Yang G, Shi Z, Ma S. Interpretable Temporal Attention Network for COVID-19 forecasting. Appl Soft Comput 2022; 120:108691. [PMID: 35281183 PMCID: PMC8905883 DOI: 10.1016/j.asoc.2022.108691] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 02/17/2022] [Accepted: 02/26/2022] [Indexed: 12/18/2022]
Abstract
The worldwide outbreak of coronavirus disease 2019 (COVID-19) has triggered an unprecedented global health and economic crisis. Early and accurate forecasts of COVID-19 and evaluation of government interventions are crucial for governments to take appropriate interventions to contain the spread of COVID-19. In this work, we propose the Interpretable Temporal Attention Network (ITANet) for COVID-19 forecasting and inferring the importance of government interventions. The proposed model is with an encoder–decoder architecture and employs long short-term memory (LSTM) for temporal feature extraction and multi-head attention for long-term dependency caption. The model simultaneously takes historical information, a priori known future information, and pseudo future information into consideration, where the pseudo future information is learned with the covariate forecasting network (CFN) and multi-task learning (MTL). In addition, we also propose the degraded teacher forcing (DTF) method to train the model efficiently. Compared with other models, the ITANet is more effective in the forecasting of COVID-19 new confirmed cases. The importance of government interventions against COVID-19 is further inferred by the Temporal Covariate Interpreter (TCI) of the model.
Collapse
|
11
|
Palermo MB, Policarpo LM, Costa CAD, Righi RDR. Tracking machine learning models for pandemic scenarios: a systematic review of machine learning models that predict local and global evolution of pandemics. NETWORK MODELING ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS 2022; 11:40. [PMID: 36249862 PMCID: PMC9553296 DOI: 10.1007/s13721-022-00384-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 09/09/2022] [Accepted: 09/20/2022] [Indexed: 11/26/2022]
Abstract
This systematic review aims to study and classify machine learning models that predict pandemics' evolution within affected regions or countries. The advantage of this systematic review is that it allows the health authorities to decide what prediction model fits best depending upon the region's criticality and optimize hospitals' approaches to preparing and anticipating patient care. We searched ACM Digital Library, Biomed Central, BioRxiv+MedRxiv, BMJ, Computers and Applied Sciences, IEEEXplore, JMIR Medical Informatics, Medline Daily Updates, Nature, Oxford Academic, PubMed, Sage Online, ScienceDirect, Scopus, SpringerLink, Web of Science, and Wiley Online Library between 1 January 2020 and 31 July 2022. We divided the interventions into similarities between cumulative COVID-19 real cases and machine learning prediction models' ability to track pandemics trending. We included 45 studies that rated low to high risk of bias. The standardized mean differences (SMD) for the two groups were 0.18, 95% CI, with interval of [0.01, 0.35], I 2 =0, and p value=0.04. We built a taxonomic analysis of the included studies and determined two domains: pandemics trending prediction models and geolocation tracking models. We performed the meta-analysis and data synthesis and got low publication bias because of missing results. The level of certainty varied from very low to high. By submitting the 45 studies on the risk of bias, the levels of certainty, the summary of findings, and the statistical analysis via the forest and funnel plots assessments, we could determine the satisfactory statistical significance homogeneity across the included studies to simulate the progress of the pandemics and help the healthcare authorities to take preventive decisions.
Collapse
Affiliation(s)
- Marcelo Benedeti Palermo
- Software Innovation Laboratory-SOFTWARELAB, Programa de Pós-Graduação em Computação Aplicada, Universidade do Vale do Rio dos Sinos, Av. Unisinos 950, São Leopoldo, RS 93022-750 Brazil
| | - Lucas Micol Policarpo
- Software Innovation Laboratory-SOFTWARELAB, Programa de Pós-Graduação em Computação Aplicada, Universidade do Vale do Rio dos Sinos, Av. Unisinos 950, São Leopoldo, RS 93022-750 Brazil
| | - Cristiano André da Costa
- Software Innovation Laboratory-SOFTWARELAB, Programa de Pós-Graduação em Computação Aplicada, Universidade do Vale do Rio dos Sinos, Av. Unisinos 950, São Leopoldo, RS 93022-750 Brazil
| | - Rodrigo da Rosa Righi
- Software Innovation Laboratory-SOFTWARELAB, Programa de Pós-Graduação em Computação Aplicada, Universidade do Vale do Rio dos Sinos, Av. Unisinos 950, São Leopoldo, RS 93022-750 Brazil
| |
Collapse
|
12
|
Data-Driven Analytics Leveraging Artificial Intelligence in the Era of COVID-19: An Insightful Review of Recent Developments. Symmetry (Basel) 2021. [DOI: 10.3390/sym14010016] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
This paper presents the role of artificial intelligence (AI) and other latest technologies that were employed to fight the recent pandemic (i.e., novel coronavirus disease-2019 (COVID-19)). These technologies assisted the early detection/diagnosis, trends analysis, intervention planning, healthcare burden forecasting, comorbidity analysis, and mitigation and control, to name a few. The key-enablers of these technologies was data that was obtained from heterogeneous sources (i.e., social networks (SN), internet of (medical) things (IoT/IoMT), cellular networks, transport usage, epidemiological investigations, and other digital/sensing platforms). To this end, we provide an insightful overview of the role of data-driven analytics leveraging AI in the era of COVID-19. Specifically, we discuss major services that AI can provide in the context of COVID-19 pandemic based on six grounds, (i) AI role in seven different epidemic containment strategies (a.k.a non-pharmaceutical interventions (NPIs)), (ii) AI role in data life cycle phases employed to control pandemic via digital solutions, (iii) AI role in performing analytics on heterogeneous types of data stemming from the COVID-19 pandemic, (iv) AI role in the healthcare sector in the context of COVID-19 pandemic, (v) general-purpose applications of AI in COVID-19 era, and (vi) AI role in drug design and repurposing (e.g., iteratively aligning protein spikes and applying three/four-fold symmetry to yield a low-resolution candidate template) against COVID-19. Further, we discuss the challenges involved in applying AI to the available data and privacy issues that can arise from personal data transitioning into cyberspace. We also provide a concise overview of other latest technologies that were increasingly applied to limit the spread of the ongoing pandemic. Finally, we discuss the avenues of future research in the respective area. This insightful review aims to highlight existing AI-based technological developments and future research dynamics in this area.
Collapse
|
13
|
Forecasting the Long-Term Trends of Coronavirus Disease 2019 (COVID-19) Epidemic Using the Susceptible-Infectious-Recovered (SIR) Model. Infect Dis Rep 2021; 13:668-684. [PMID: 34449629 PMCID: PMC8395750 DOI: 10.3390/idr13030063] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Revised: 07/20/2021] [Accepted: 07/21/2021] [Indexed: 12/16/2022] Open
Abstract
A simple model for predicting Coronavirus Disease 2019 (COVID-19) epidemic is presented in this study. The prediction model is presented based on the classic Susceptible-Infectious-Recovered (SIR) model, which has been widely used to describe the epidemic time evolution of infectious diseases. The original version of the Kermack and McKendrick model is used in this study. This included the daily rates of infection spread by infected individuals when these individuals interact with a susceptible population, which is denoted by the parameter β, while the recovery rates to determine the number of recovered individuals is expressed by the parameter γ. The parameters estimation of the three-compartment SIR model is determined through using a mathematical sequential reduction process from the logistic growth model equation. As the parameters are the basic characteristics of epidemic time evolution, the model is always tested and applied to the latest actual data of confirmed COVID-19 cases. It seems that this simple model is still reliable enough to describe the dynamics of the COVID-19 epidemic, not only qualitatively but also quantitatively with a high degree of correlation between actual data and prediction results. Therefore, it is possible to apply this model to predict cases of COVID-19 in several countries. In addition, the parameter characteristics of the classic SIR model can provide information on how these parameters reflect the efforts by each country to prevent the spread of the COVID-19 outbreak. This is clearly seen from the changes of the parameters shown by the classic SIR model.
Collapse
|
14
|
Gualano B, Brito GM, Pinto AJ, Lemes IR, Matos LDNJ, de Sá Pinto AL, Loturco I. High SARS-CoV-2 infection rate after resuming professional football in São Paulo, Brazil. Br J Sports Med 2021; 56:bjsports-2021-104431. [PMID: 34226184 PMCID: PMC8260282 DOI: 10.1136/bjsports-2021-104431] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/22/2021] [Indexed: 11/08/2022]
Abstract
OBJECTIVE To examine the SARS-CoV-2 infection rate in a cohort of 6500 professional athletes and staff during the 2020 football (soccer) season in São Paulo, Brazil. METHODS This retrospective cohort study included 4269 players (87% male, age: 21.7±4.2 years) and 2231 staff (87% male, age: 42.6±11.9 years) from 122 teams (women: n=16) involved in eight leagues (women: n=2), which took place in São Paulo, Brazil. Between 4 July 2020 and 21 December 2020, swab samples were collected weekly (n=29 507) and tested for SARS-Cov-2 via reverse transcription-PCR by an accredited laboratory commissioned by the São Paulo Football Federation. We contacted the medical staff of each team with positive cases to collect information on disease severity. RESULTS Among 662 PCR-confirmed cases, 501 were athletes and 161 were staff. The new infection rate was 11.7% and 7.2% for athletes and staff, respectively. Athletes were more susceptible to infection than staff (OR: 1.71, 95% CI: 1.42, 2.06, p<0.001), although with lower chance for moderate to severe disease (OR: 0.06, 95% CI: 0.01, 0.54, p=0.012). Six teams had ≥20 individuals testing positive for SARS-CoV-2, whereas 19 teams had ≥10 confirmed cases. Twenty-five mass outbreaks were identified (≥5 infections within a team in a 2-week period). The prevalence of SARS-CoV-2 infections was similar in athletes and staff as the general population in São Paulo. CONCLUSION Despite weekly testing and other preventive measures, we found a high SARS-CoV-2 infection rate in athletes and staff after resuming football, which coincides with the high prevalence of infection in the community during the same period. These data may assist policy-makers and sports federations for determining if and when it is safe to resume competitions.
Collapse
Affiliation(s)
- Bruno Gualano
- Applied Physiology & Nutrition Research Group; Laboratory of Assessment and Conditioning in Rheumatology; Faculdade de Medicina FMUSP, Universidade de São Paulo, São Paulo, Brazil
| | - Gisele Mendes Brito
- Applied Physiology & Nutrition Research Group; Laboratory of Assessment and Conditioning in Rheumatology; Faculdade de Medicina FMUSP, Universidade de São Paulo, São Paulo, Brazil
| | - Ana Jéssica Pinto
- Applied Physiology & Nutrition Research Group; Laboratory of Assessment and Conditioning in Rheumatology; Faculdade de Medicina FMUSP, Universidade de São Paulo, São Paulo, Brazil
| | - Italo Ribeiro Lemes
- Applied Physiology & Nutrition Research Group; Laboratory of Assessment and Conditioning in Rheumatology; Faculdade de Medicina FMUSP, Universidade de São Paulo, São Paulo, Brazil
| | | | - Ana Lúcia de Sá Pinto
- Applied Physiology & Nutrition Research Group; Laboratory of Assessment and Conditioning in Rheumatology; Faculdade de Medicina FMUSP, Universidade de São Paulo, São Paulo, Brazil
| | - Irineu Loturco
- Nucleus of High Performance in Sport - São Paulo, São Paulo, Brazil
- Universidade Federal de São Paulo, São Paulo, Brazil
| |
Collapse
|
15
|
Ferrari L, Gerardi G, Manzi G, Micheletti A, Nicolussi F, Biganzoli E, Salini S. Modeling Provincial Covid-19 Epidemic Data Using an Adjusted Time-Dependent SIRD Model. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:6563. [PMID: 34207174 PMCID: PMC8296340 DOI: 10.3390/ijerph18126563] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 06/12/2021] [Accepted: 06/14/2021] [Indexed: 12/27/2022]
Abstract
In this paper, we develop a forecasting model for the spread of COVID-19 infection at a provincial (i.e., EU NUTS-3) level in Italy by using official data from the Italian Ministry of Health integrated with data extracted from daily official press conferences of regional authorities and local newspaper websites. This data integration is needed as COVID-19 death data are not available at the NUTS-3 level from official open data channels. An adjusted time-dependent SIRD model is used to predict the behavior of the epidemic; specifically, the number of susceptible, infected, deceased, recovered people and epidemiological parameters. Predictive model performance is evaluated using comparison with real data.
Collapse
Affiliation(s)
- Luisa Ferrari
- Department of Statistical Science, University College London, London WC1E 6BT, UK;
| | - Giuseppe Gerardi
- Department of Economics, Management and Quantitative Methods, University of Milan, 20122 Milan, Italy;
| | - Giancarlo Manzi
- Department of Economics, Management and Quantitative Methods and Data Science Research Center, University of Milan, 20122 Milan, Italy; (F.N.); (S.S.)
| | - Alessandra Micheletti
- Department of Environmental Science and Policy and Data Science Research Center, University of Milan, 20122 Milan, Italy;
| | - Federica Nicolussi
- Department of Economics, Management and Quantitative Methods and Data Science Research Center, University of Milan, 20122 Milan, Italy; (F.N.); (S.S.)
| | - Elia Biganzoli
- Department of Clinical Sciences and Community Health and Data Science Research Center, University of Milan, 20122 Milan, Italy;
| | - Silvia Salini
- Department of Economics, Management and Quantitative Methods and Data Science Research Center, University of Milan, 20122 Milan, Italy; (F.N.); (S.S.)
| |
Collapse
|
16
|
Martin-Barreiro C, Ramirez-Figueroa JA, Cabezas X, Leiva V, Galindo-Villardón MP. Disjoint and Functional Principal Component Analysis for Infected Cases and Deaths Due to COVID-19 in South American Countries with Sensor-Related Data. SENSORS (BASEL, SWITZERLAND) 2021; 21:4094. [PMID: 34198627 PMCID: PMC8232170 DOI: 10.3390/s21124094] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 06/07/2021] [Accepted: 06/11/2021] [Indexed: 02/06/2023]
Abstract
In this paper, we group South American countries based on the number of infected cases and deaths due to COVID-19. The countries considered are: Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Peru, Paraguay, Uruguay, and Venezuela. The data used are collected from a database of Johns Hopkins University, an institution that is dedicated to sensing and monitoring the evolution of the COVID-19 pandemic. A statistical analysis, based on principal components with modern and recent techniques, is conducted. Initially, utilizing the correlation matrix, standard components and varimax rotations are calculated. Then, by using disjoint components and functional components, the countries are grouped. An algorithm that allows us to keep the principal component analysis updated with a sensor in the data warehouse is designed. As reported in the conclusions, this grouping changes depending on the number of components considered, the type of principal component (standard, disjoint or functional) and the variable to be considered (infected cases or deaths). The results obtained are compared to the k-means technique. The COVID-19 cases and their deaths vary in the different countries due to diverse reasons, as reported in the conclusions.
Collapse
Affiliation(s)
- Carlos Martin-Barreiro
- Department of Statistics, Universidad de Salamanca, 37008 Salamanca, Spain; (C.M.-B.); (J.A.R.-F.); (M.P.G.-V.)
- Faculty of Natural Sciences and Mathematics, Universidad Politécnica ESPOL, Guayaquil 090902, Ecuador;
| | - John A. Ramirez-Figueroa
- Department of Statistics, Universidad de Salamanca, 37008 Salamanca, Spain; (C.M.-B.); (J.A.R.-F.); (M.P.G.-V.)
- Faculty of Natural Sciences and Mathematics, Universidad Politécnica ESPOL, Guayaquil 090902, Ecuador;
| | - Xavier Cabezas
- Faculty of Natural Sciences and Mathematics, Universidad Politécnica ESPOL, Guayaquil 090902, Ecuador;
| | - Víctor Leiva
- School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile
| | | |
Collapse
|
17
|
Campos EL, Cysne RP, Madureira AL, Mendes GL. Multi-generational SIR modeling: Determination of parameters, epidemiological forecasting and age-dependent vaccination policies. Infect Dis Model 2021; 6:751-765. [PMID: 34127952 PMCID: PMC8189834 DOI: 10.1016/j.idm.2021.05.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 05/08/2021] [Accepted: 05/18/2021] [Indexed: 02/08/2023] Open
Abstract
We use an age-dependent SIR system of equations to model the evolution of the COVID-19. Parameters that measure the amount of interaction in different locations (home, work, school, other) are approximated from in-sample data using a random optimization scheme, and indicate changes in social distancing along the course of the pandemic. That allows the estimation of the time evolution of classical and age-dependent reproduction numbers. With those parameters we predict the disease dynamics, and compare our results with out-of-sample data from the City of Rio de Janeiro. Finally, we provide a numerical investigation regarding age-based vaccination policies, shedding some light on whether is preferable to vaccinate those at most risk (the elderly) or those who spread the disease the most (the youngest). There is no clear upshot, as the results depend on the age of those immunized, contagious parameters, vaccination schedules and efficiency.
Collapse
Affiliation(s)
- Eduardo Lima Campos
- EPGE Brazilian School of Economics and Finance (FGV EPGE), Rio de Janeiro, RJ, Brazil
- ENCE - Escola Nacional de Ciências Estatísticas (ENCE/IBGE), Rio de Janeiro, RJ, Brazil
| | - Rubens Penha Cysne
- EPGE Brazilian School of Economics and Finance (FGV EPGE), Rio de Janeiro, RJ, Brazil
| | - Alexandre L. Madureira
- EPGE Brazilian School of Economics and Finance (FGV EPGE), Rio de Janeiro, RJ, Brazil
- Laboratório Nacional de Computação Científica, Petrópolis, RJ, Brazil
| | - Gélcio L.Q. Mendes
- INCA - Brazilian National Cancer Institute, Coordination of Assistance, Rio de Janeiro, RJ, Brazil
| |
Collapse
|