1
|
Yin Y, Ahmadianfar I, Karim FK, Elmannai H. Advanced forecasting of COVID-19 epidemic: Leveraging ensemble models, advanced optimization, and decomposition techniques. Comput Biol Med 2024; 175:108442. [PMID: 38678939 DOI: 10.1016/j.compbiomed.2024.108442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Revised: 03/25/2024] [Accepted: 04/07/2024] [Indexed: 05/01/2024]
Abstract
In the global effort to address the outbreak of the new coronavirus pneumonia (COVID-19) pandemic, accurate forecasting of epidemic patterns has become crucial for implementing successful interventions aimed at preventing and controlling the spread of the disease. The correct prediction of the course of COVID-19 outbreaks is a complex and challenging task, mainly because of the significant volatility in the data series related to COVID-19. Previous studies have been limited by the exclusive use of individual forecasting techniques in epidemic modeling, disregarding the integration of diverse prediction procedures. The lack of attention to detail in this situation can yield worse-than-ideal results. Consequently, this study introduces a novel ensemble framework that integrates three machine learning methods (kernel ridge regression (KRidge), Deep random vector functional link (dRVFL), and ridge regression) within a linear relationship (L-KRidge-dRVFL-Ridge). The optimization of this framework is accomplished through a distinctive approach, specifically adaptive differential evolution and particle swarm optimization (A-DEPSO). Moreover, an effective decomposition method, known as time-varying filter empirical mode decomposition (TVF-EMD), is employed to decompose the input variables. A feature selection technique, specifically using the light gradient boosting machine (LGBM), is also implemented to extract the most influential input variables. The daily datasets of COVID-19 collected from two countries, namely Italy and Poland, were used as the experimental examples. Additionally, all models are implemented to forecast COVID-19 at two-time horizons: 10- and 14-day ahead (t+10 and t+14). According to the results, the proposed model can yield higher correlation coefficient (R) for both case studies: Italy (t+10 = 0.965, t+14 = 0.961) and Poland (t+10 = 0.952, t+14 = 0.940) than the other models. The experimental results demonstrate that the model suggested in this paper has outstanding results in various kinds of complex epidemic prediction situations. The proposed ensemble model demonstrates exceptional accuracy and resilience, outperforming all similar models in terms of efficacy.
Collapse
Affiliation(s)
- Yingyu Yin
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou, 510665, China.
| | - Iman Ahmadianfar
- Information and Communication Technology Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, Nasiriyah, 64001, Iraq.
| | - Faten Khalid Karim
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O.BOX 84428, Riyadh 11671, Saudi Arabia.
| | - Hela Elmannai
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O.BOX 84428, Riyadh 11671, Saudi Arabia.
| |
Collapse
|
2
|
Fonseca AU, Felix JP, Pinheiro H, Vieira GS, Mourão ÝC, Monteiro JCG, Soares F. An Intelligent System to Improve Diagnostic Support for Oral Squamous Cell Carcinoma. Healthcare (Basel) 2023; 11:2675. [PMID: 37830712 PMCID: PMC10572543 DOI: 10.3390/healthcare11192675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 09/12/2023] [Accepted: 09/25/2023] [Indexed: 10/14/2023] Open
Abstract
Oral squamous cell carcinoma (OSCC) is one of the most-prevalent cancer types worldwide, and it poses a serious threat to public health due to its high mortality and morbidity rates. OSCC typically has a poor prognosis, significantly reducing the chances of patient survival. Therefore, early detection is crucial to achieving a favorable prognosis by providing prompt treatment and increasing the chances of remission. Salivary biomarkers have been established in numerous studies to be a trustworthy and non-invasive alternative for early cancer detection. In this sense, we propose an intelligent system that utilizes feed-forward artificial neural networks to classify carcinoma with salivary biomarkers extracted from control and OSCC patient samples. We conducted experiments using various salivary biomarkers, ranging from 1 to 51, to train the model, and we achieved excellent results with precision, sensitivity, and specificity values of 98.53%, 96.30%, and 97.56%, respectively. Our system effectively classified the initial cases of OSCC with different amounts of biomarkers, aiding medical professionals in decision-making and providing a more-accurate diagnosis. This could contribute to a higher chance of treatment success and patient survival. Furthermore, the minimalist configuration of our model presents the potential for incorporation into resource-limited devices or environments.
Collapse
Affiliation(s)
- Afonso U. Fonseca
- Institute of Informatics, Federal University of Goiás, Goiânia 74690-900, GO, Brazil; (J.P.F.); (H.P.); (G.S.V.); (F.S.)
| | - Juliana P. Felix
- Institute of Informatics, Federal University of Goiás, Goiânia 74690-900, GO, Brazil; (J.P.F.); (H.P.); (G.S.V.); (F.S.)
| | - Hedenir Pinheiro
- Institute of Informatics, Federal University of Goiás, Goiânia 74690-900, GO, Brazil; (J.P.F.); (H.P.); (G.S.V.); (F.S.)
| | - Gabriel S. Vieira
- Institute of Informatics, Federal University of Goiás, Goiânia 74690-900, GO, Brazil; (J.P.F.); (H.P.); (G.S.V.); (F.S.)
- Federal Institute Goiano, Computer Vision Lab, Urutaí 75790-000, GO, Brazil
| | | | | | - Fabrizzio Soares
- Institute of Informatics, Federal University of Goiás, Goiânia 74690-900, GO, Brazil; (J.P.F.); (H.P.); (G.S.V.); (F.S.)
| |
Collapse
|
3
|
Xu H, Buckeridge DL, Wang F, Tarczy-Hornoch P. Novel informatics approaches to COVID-19 Research: From methods to applications. J Biomed Inform 2022; 129:104028. [PMID: 35181495 PMCID: PMC8847074 DOI: 10.1016/j.jbi.2022.104028] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 02/10/2022] [Indexed: 10/30/2022]
Affiliation(s)
- Hua Xu
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - David L Buckeridge
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec, Canada
| | - Fei Wang
- Department of Population Health Sciences, Cornell University, New York, NY, USA
| | - Peter Tarczy-Hornoch
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
| |
Collapse
|
4
|
Abstract
The COVID-19 pandemic spread rapidly around the world and is currently one of the most leading causes of death and heath disaster in the world. Turkey, like most of the countries, has been negatively affected by COVID-19. The aim of this study is to design a predictive model based on artificial neural network (ANN) model to predict the future number of daily cases and deaths caused by COVID-19 in a generalized way to fit different countries’ spreads. In this study, we used a dataset between 11 March 2020 and 23 January 2021 for different countries. This study provides an ANN model to assist the government to take preventive action for hospitals and medical facilities. The results show that there is an 86% overall accuracy in predicting the mortality rate and 87% in predicting the number of cases.
Collapse
|
5
|
Hernández-Pereira E, Fontenla-Romero O, Bolón-Canedo V, Cancela-Barizo B, Guijarro-Berdiñas B, Alonso-Betanzos A. Machine learning techniques to predict different levels of hospital care of CoVid-19. APPL INTELL 2021; 52:6413-6431. [PMID: 34764619 PMCID: PMC8429889 DOI: 10.1007/s10489-021-02743-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2021] [Indexed: 11/25/2022]
Abstract
In this study, we analyze the capability of several state of the art machine learning methods to predict whether patients diagnosed with CoVid-19 (CoronaVirus disease 2019) will need different levels of hospital care assistance (regular hospital admission or intensive care unit admission), during the course of their illness, using only demographic and clinical data. For this research, a data set of 10,454 patients from 14 hospitals in Galicia (Spain) was used. Each patient is characterized by 833 variables, two of which are age and gender and the other are records of diseases or conditions in their medical history. In addition, for each patient, his/her history of hospital or intensive care unit (ICU) admissions due to CoVid-19 is available. This clinical history will serve to label each patient and thus being able to assess the predictions of the model. Our aim is to identify which model delivers the best accuracies for both hospital and ICU admissions only using demographic variables and some structured clinical data, as well as identifying which of those are more relevant in both cases. The results obtained in the experimental study show that the best models are those based on oversampling as a preprocessing phase to balance the distribution of classes. Using these models and all the available features, we achieved an area under the curve (AUC) of 76.1% and 80.4% for predicting the need of hospital and ICU admissions, respectively. Furthermore, feature selection and oversampling techniques were applied and it has been experimentally verified that the relevant variables for the classification are age and gender, since only using these two features the performance of the models is not degraded for the two mentioned prediction problems.
Collapse
Affiliation(s)
- Elena Hernández-Pereira
- Universidade da Coruña. CITIC Research and Development Laboratory in Artificial Intelligence (LIDIA) Facultad de informática, Campus de Elviña s/n. A, Coruña, Spain
| | - Oscar Fontenla-Romero
- Universidade da Coruña. CITIC Research and Development Laboratory in Artificial Intelligence (LIDIA) Facultad de informática, Campus de Elviña s/n. A, Coruña, Spain
| | - Verónica Bolón-Canedo
- Universidade da Coruña. CITIC Research and Development Laboratory in Artificial Intelligence (LIDIA) Facultad de informática, Campus de Elviña s/n. A, Coruña, Spain
| | - Brais Cancela-Barizo
- Universidade da Coruña. CITIC Research and Development Laboratory in Artificial Intelligence (LIDIA) Facultad de informática, Campus de Elviña s/n. A, Coruña, Spain
| | - Bertha Guijarro-Berdiñas
- Universidade da Coruña. CITIC Research and Development Laboratory in Artificial Intelligence (LIDIA) Facultad de informática, Campus de Elviña s/n. A, Coruña, Spain
| | - Amparo Alonso-Betanzos
- Universidade da Coruña. CITIC Research and Development Laboratory in Artificial Intelligence (LIDIA) Facultad de informática, Campus de Elviña s/n. A, Coruña, Spain
| |
Collapse
|
6
|
Huang T, Chu Y, Shams S, Kim Y, Annapragada AV, Subramanian D, Kakadiaris I, Gottlieb A, Jiang X. Population stratification enables modeling effects of reopening policies on mortality and hospitalization rates. J Biomed Inform 2021; 119:103818. [PMID: 34022420 DOI: 10.1016/j.jbi.2021.103818] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 05/04/2021] [Accepted: 05/17/2021] [Indexed: 12/23/2022]
Abstract
OBJECTIVE Study the impact of local policies on near-future hospitalization and mortality rates. MATERIALS AND METHODS We introduce a novel risk-stratified SIR-HCD model that introduces new variables to model the dynamics of low-contact (e.g., work from home) and high-contact (e.g., work on-site) subpopulations while sharing parameters to control their respective R0(t) over time. We test our model on data of daily reported hospitalizations and cumulative mortality of COVID-19 in Harris County, Texas, from May 1, 2020, until October 4, 2020, collected from multiple sources (USA FACTS, U.S. Bureau of Labor Statistics, Southeast Texas Regional Advisory Council COVID-19 report, TMC daily news, and Johns Hopkins University county-level mortality reporting). RESULTS We evaluated our model's forecasting accuracy in Harris County, TX (the most populated county in the Greater Houston area) during Phase-I and Phase-II reopening. Not only does our model outperform other competing models, but it also supports counterfactual analysis to simulate the impact of future policies in a local setting, which is unique among existing approaches. DISCUSSION Mortality and hospitalization rates are significantly impacted by local quarantine and reopening policies. Existing models do not directly account for the effect of these policies on infection, hospitalization, and death rates in an explicit and explainable manner. Our work is an attempt to improve prediction of these trends by incorporating this information into the model, thus supporting decision-making. CONCLUSION Our work is a timely effort to attempt to model the dynamics of pandemics under the influence of local policies.
Collapse
Affiliation(s)
- Tongtong Huang
- School of Biomedical Informatics, UTHealth, Houston, TX, United States.
| | - Yan Chu
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Shayan Shams
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Yejin Kim
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Ananth V Annapragada
- Department of Pediatric Radiology, Texas Children's Hospital, Houston, TX, United States
| | - Devika Subramanian
- Department of Computer Science & Electrical and Computer Engineering, Rice University, Houston, TX, United States
| | - Ioannis Kakadiaris
- Department of Computer Science, Electrical & Computer Engineering, and Biomedical Engineering University of Houston, Houston, TX, United States
| | - Assaf Gottlieb
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Xiaoqian Jiang
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| |
Collapse
|