51
|
Fang L, Liang X. ISW-LM: An intensive symptom weight learning mechanism for early COVID-19 diagnosis. Comput Biol Med 2022; 146:105615. [PMID: 35605484 PMCID: PMC9112616 DOI: 10.1016/j.compbiomed.2022.105615] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 05/09/2022] [Accepted: 05/11/2022] [Indexed: 12/16/2022]
Abstract
The novel coronavirus disease 2019 (COVID-19) pandemic has severely impacted the world. The early diagnosis of COVID-19 and self-isolation can help curb the spread of the virus. Besides, a simple and accurate diagnostic method can help in making rapid decisions for the treatment and isolation of patients. The analysis of patient characteristics, case trajectory, comorbidities, symptoms, diagnosis, and outcomes will be performed in the model. In this paper, a symptom-based machine learning (ML) model with a new learning mechanism called Intensive Symptom Weight Learning Mechanism (ISW-LM) is proposed. The proposed model designs three new symptoms' weight functions to identify the most relevant symptoms used to diagnose and classify COVID-19. To verify the efficiency of the proposed model, multiple laboratory and clinical datasets containing epidemiological symptoms and blood tests are used. Experiments indicate that the importance of COVID-19 infection symptoms varies between countries and regions. In most datasets, the most frequent and significant predictive symptoms for diagnosing COVID-19 are fever, sore throat, and cough. The experiment also compares the state-of-the-art methods with the proposed method, which shows that the proposed model has a high accuracy rate of up to 97.1711%. The positive results indicate that the proposed learning mechanism can help clinicians quickly diagnose and screen patients for COVID-19 at an early stage.
Collapse
Affiliation(s)
- Lingling Fang
- Department of Computing and Information Technology, Liaoning Normal University, Dalian City, Liaoning Province, China.
| | - Xiyue Liang
- Department of Computing and Information Technology, Liaoning Normal University, Dalian City, Liaoning Province, China
| |
Collapse
|
52
|
Syed AH, Khan T, Alromema N. A Hybrid Feature Selection Approach to Screen a Novel Set of Blood Biomarkers for Early COVID-19 Mortality Prediction. Diagnostics (Basel) 2022; 12:1604. [PMID: 35885508 PMCID: PMC9316550 DOI: 10.3390/diagnostics12071604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Revised: 06/22/2022] [Accepted: 06/29/2022] [Indexed: 11/16/2022] Open
Abstract
The increase in coronavirus disease 2019 (COVID-19) infection caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has placed pressure on healthcare services worldwide. Therefore, it is crucial to identify critical factors for the assessment of the severity of COVID-19 infection and the optimization of an individual treatment strategy. In this regard, the present study leverages a dataset of blood samples from 485 COVID-19 individuals in the region of Wuhan, China to identify essential blood biomarkers that predict the mortality of COVID-19 individuals. For this purpose, a hybrid of filter, statistical, and heuristic-based feature selection approach was used to select the best subset of informative features. As a result, minimum redundancy maximum relevance (mRMR), a two-tailed unpaired t-test, and whale optimization algorithm (WOA) were eventually selected as the three most informative blood biomarkers: International normalized ratio (INR), platelet large cell ratio (P-LCR), and D-dimer. In addition, various machine learning (ML) algorithms (random forest (RF), support vector machine (SVM), extreme gradient boosting (EGB), naïve Bayes (NB), logistic regression (LR), and k-nearest neighbor (KNN)) were trained. The performance of the trained models was compared to determine the model that assist in predicting the mortality of COVID-19 individuals with higher accuracy, F1 score, and area under the curve (AUC) values. In this paper, the best performing RF-based model built using the three most informative blood parameters predicts the mortality of COVID-19 individuals with an accuracy of 0.96 ± 0.062, F1 score of 0.96 ± 0.099, and AUC value of 0.98 ± 0.024, respectively on the independent test data. Furthermore, the performance of our proposed RF-based model in terms of accuracy, F1 score, and AUC was significantly better than the known blood biomarkers-based ML models built using the Pre_Surv_COVID_19 data. Therefore, the present study provides a novel hybrid approach to screen the most informative blood biomarkers to develop an RF-based model, which accurately and reliably predicts in-hospital mortality of confirmed COVID-19 individuals, during surge periods. An application based on our proposed model was implemented and deployed at Heroku.
Collapse
Affiliation(s)
- Asif Hassan Syed
- Department of Computer Science, Faculty of Computing and Information Technology Rabigh (FCITR), King Abdulaziz University, Jeddah 22254, Saudi Arabia;
| | - Tabrej Khan
- Department of Information Systems, Faculty of Computing and Information Technology Rabigh (FCITR), King Abdulaziz University, Jeddah 22254, Saudi Arabia;
| | - Nashwan Alromema
- Department of Computer Science, Faculty of Computing and Information Technology Rabigh (FCITR), King Abdulaziz University, Jeddah 22254, Saudi Arabia;
| |
Collapse
|
53
|
Ramírez-del Real T, Martínez-García M, Márquez MF, López-Trejo L, Gutiérrez-Esparza G, Hernández-Lemus E. Individual Factors Associated With COVID-19 Infection: A Machine Learning Study. Front Public Health 2022; 10:912099. [PMID: 35844896 PMCID: PMC9279686 DOI: 10.3389/fpubh.2022.912099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 05/24/2022] [Indexed: 11/13/2022] Open
Abstract
The fast, exponential increase of COVID-19 infections and their catastrophic effects on patients' health have required the development of tools that support health systems in the quick and efficient diagnosis and prognosis of this disease. In this context, the present study aims to identify the potential factors associated with COVID-19 infections, applying machine learning techniques, particularly random forest, chi-squared, xgboost, and rpart for feature selection; ROSE and SMOTE were used as resampling methods due to the existence of class imbalance. Similarly, machine and deep learning algorithms such as support vector machines, C4.5, random forest, rpart, and deep neural networks were explored during the train/test phase to select the best prediction model. The dataset used in this study contains clinical data, anthropometric measurements, and other health parameters related to smoking habits, alcohol consumption, quality of sleep, physical activity, and health status during confinement due to the pandemic associated with COVID-19. The results showed that the XGBoost model got the best features associated with COVID-19 infection, and random forest approximated the best predictive model with a balanced accuracy of 90.41% using SMOTE as a resampling technique. The model with the best performance provides a tool to help prevent contracting SARS-CoV-2 since the variables with the highest risk factor are detected, and some of them are, to a certain extent controllable.
Collapse
Affiliation(s)
- Tania Ramírez-del Real
- Cátedras Conacyt, National Council on Science and Technology, Mexico City, Mexico
- Center for Research in Geospatial Information Sciences, Mexico City, Mexico
| | - Mireya Martínez-García
- Clinical Research Division, National Institute of Cardiology “Ignacio Chávez”, Mexico City, Mexico
| | - Manlio F. Márquez
- Clinical Research Division, National Institute of Cardiology “Ignacio Chávez”, Mexico City, Mexico
| | - Laura López-Trejo
- Institute for Security and Social Services of State Workers, Mexico City, Mexico
| | - Guadalupe Gutiérrez-Esparza
- Cátedras Conacyt, National Council on Science and Technology, Mexico City, Mexico
- Clinical Research Division, National Institute of Cardiology “Ignacio Chávez”, Mexico City, Mexico
| | - Enrique Hernández-Lemus
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
- Center for Complexity Sciences, Universidad Nacional Autónoma de México, Mexico City, Mexico
| |
Collapse
|
54
|
Jiang G, Wu J, Weidhaas J, Li X, Chen Y, Mueller J, Li J, Kumar M, Zhou X, Arora S, Haramoto E, Sherchan S, Orive G, Lertxundi U, Honda R, Kitajima M, Jackson G. Artificial neural network-based estimation of COVID-19 case numbers and effective reproduction rate using wastewater-based epidemiology. WATER RESEARCH 2022; 218:118451. [PMID: 35447417 PMCID: PMC9006161 DOI: 10.1016/j.watres.2022.118451] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 04/02/2022] [Accepted: 04/10/2022] [Indexed: 05/06/2023]
Abstract
As a cost-effective and objective population-wide surveillance tool, wastewater-based epidemiology (WBE) has been widely implemented worldwide to monitor the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) RNA concentration in wastewater. However, viral concentrations or loads in wastewater often correlate poorly with clinical case numbers. To date, there is no reliable method to back-estimate the coronavirus disease 2019 (COVID-19) case numbers from SARS-CoV-2 concentrations in wastewater. This greatly limits WBE in achieving its full potential in monitoring the unfolding pandemic. The exponentially growing SARS-CoV-2 WBE dataset, on the other hand, offers an opportunity to develop data-driven models for the estimation of COVID-19 case numbers (both incidence and prevalence) and transmission dynamics (effective reproduction rate). This study developed artificial neural network (ANN) models by innovatively expanding a conventional WBE dataset to include catchment, weather, clinical testing coverage and vaccination rate. The ANN models were trained and evaluated with a comprehensive state-wide wastewater monitoring dataset from Utah, USA during May 2020 to December 2021. In diverse sewer catchments, ANN models were found to accurately estimate the COVID-19 prevalence and incidence rates, with excellent precision for prevalence rates. Also, an ANN model was developed to estimate the effective reproduction number from both wastewater data and other pertinent factors affecting viral transmission and pandemic dynamics. The established ANN model was successfully validated for its transferability to other states or countries using the WBE dataset from Wisconsin, USA.
Collapse
Affiliation(s)
- Guangming Jiang
- School of Civil, Mining and Environmental Engineering, University of Wollongong, Australia; Illawarra Health and Medical Research Institute (IHMRI), University of Wollongong, Wollongong, Australia.
| | - Jiangping Wu
- School of Civil, Mining and Environmental Engineering, University of Wollongong, Australia
| | - Jennifer Weidhaas
- University of Utah, Civil and Environmental Engineering, 110 Central Campus Drive, Suite 2000, Salt Lake City, UT, USA
| | - Xuan Li
- School of Civil, Mining and Environmental Engineering, University of Wollongong, Australia
| | - Yan Chen
- School of Civil, Mining and Environmental Engineering, University of Wollongong, Australia
| | - Jochen Mueller
- Queensland Alliance for Environmental Health Sciences, The University of Queensland, Australia
| | - Jiaying Li
- Queensland Alliance for Environmental Health Sciences, The University of Queensland, Australia
| | - Manish Kumar
- Sustainability Cluster, School of Engineering, University of Petroleum & Energy Studies, Dehradun, Uttarakhand 248007, India
| | - Xu Zhou
- Shenzhen Engineering Laboratory of Microalgal Bioenergy, Harbin Institute of Technology (Shenzhen), Shenzhen 518055, China; State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin 150090, China
| | - Sudipti Arora
- Dr. B. Lal Institute of Biotechnology, Jaipur, India
| | - Eiji Haramoto
- Interdisciplinary Center for River Basin Environment, University of Yamanashi, Kofu, Japan
| | - Samendra Sherchan
- Department of Environmental Health Sciences, Tulane University, New Orleans, LA, USA
| | - Gorka Orive
- NanoBioCel Group, Laboratory of Pharmaceutics, School of Pharmacy, University of the Basque Country UPV/EHU, Paseo de la Universidad 7, Vitoria-Gasteiz 01006, Spain; Biomedical Research Networking Centre in Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Vitoria-Gasteiz, Spain
| | - Unax Lertxundi
- NanoBioCel Group, Laboratory of Pharmaceutics, School of Pharmacy, University of the Basque Country UPV/EHU, Paseo de la Universidad 7, Vitoria-Gasteiz 01006, Spain; Biomedical Research Networking Centre in Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Vitoria-Gasteiz, Spain
| | - Ryo Honda
- Faculty of Geosciences and Civil Engineering, Kanazawa University, Kanazawa 920-1192, Japan
| | - Masaaki Kitajima
- Division of Environmental Engineering, Hokkaido University, Hokkaido 060-8628, Japan
| | - Greg Jackson
- Queensland Alliance for Environmental Health Sciences (QAEHS), The University of Queensland, 4102, Brisbane, Australia
| |
Collapse
|
55
|
Raihan M, Hassan MM, Hasan T, Bulbul AAM, Hasan MK, Hossain MS, Roy DS, Awal MA. Development of a Smartphone-Based Expert System for COVID-19 Risk Prediction at Early Stage. Bioengineering (Basel) 2022; 9:bioengineering9070281. [PMID: 35877332 PMCID: PMC9311761 DOI: 10.3390/bioengineering9070281] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 06/15/2022] [Accepted: 06/23/2022] [Indexed: 01/03/2023] Open
Abstract
COVID-19 has imposed many challenges and barriers on traditional healthcare systems due to the high risk of being infected by the coronavirus. Modern electronic devices like smartphones with information technology can play an essential role in handling the current pandemic by contributing to different telemedical services. This study has focused on determining the presence of this virus by employing smartphone technology, as it is available to a large number of people. A publicly available COVID-19 dataset consisting of 33 features has been utilized to develop the aimed model, which can be collected from an in-house facility. The chosen dataset has 2.82% positive and 97.18% negative samples, demonstrating a high imbalance of class populations. The Adaptive Synthetic (ADASYN) has been applied to overcome the class imbalance problem with imbalanced data. Ten optimal features are chosen from the given 33 features, employing two different feature selection algorithms, such as K Best and recursive feature elimination methods. Mainly, three classification schemes, Random Forest (RF), eXtreme Gradient Boosting (XGB), and Support Vector Machine (SVM), have been applied for the ablation studies, where the accuracy from the XGB, RF, and SVM classifiers achieved 97.91%, 97.81%, and 73.37%, respectively. As the XGB algorithm confers the best results, it has been implemented in designing the Android operating system base and web applications. By analyzing 10 users’ questionnaires, the developed expert system can predict the presence of COVID-19 in the human body of the primary suspect. The preprocessed data and codes are available on the GitHub repository.
Collapse
Affiliation(s)
- M. Raihan
- Department of Computer Science and Engineering, North Western University, Khulna 9100, Bangladesh; (M.R.); (M.M.H.); (T.H.)
| | - Md. Mehedi Hassan
- Department of Computer Science and Engineering, North Western University, Khulna 9100, Bangladesh; (M.R.); (M.M.H.); (T.H.)
| | - Towhid Hasan
- Department of Computer Science and Engineering, North Western University, Khulna 9100, Bangladesh; (M.R.); (M.M.H.); (T.H.)
| | - Abdullah Al-Mamun Bulbul
- Electronics and Communication Engineering Discipline, Khulna University, Khulna 9208, Bangladesh;
| | - Md. Kamrul Hasan
- Department of Electrical & Electronic Engineering, Khulna University of Engineering & Technology, Khulna 9203, Bangladesh;
| | - Md. Shahadat Hossain
- Department of Quantitative Sciences, International University of Business Agriculture and Technology, Dhaka 1230, Bangladesh;
| | - Dipa Shuvo Roy
- Department of Information Science & Engineering, Visvesvaraya Technological University, Jnana Sangama, VTU Main Rd., Machhe, Belagavi 590018, India;
| | - Md. Abdul Awal
- Electronics and Communication Engineering Discipline, Khulna University, Khulna 9208, Bangladesh;
- Correspondence: ; Tel.: +880-178-619-6913
| |
Collapse
|
56
|
Diagnosis and Prognosis of COVID-19 Disease Using Routine Blood Values and LogNNet Neural Network. SENSORS 2022; 22:s22134820. [PMID: 35808317 PMCID: PMC9269123 DOI: 10.3390/s22134820] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 06/16/2022] [Accepted: 06/23/2022] [Indexed: 01/08/2023]
Abstract
Since February 2020, the world has been engaged in an intense struggle with the COVID-19 disease, and health systems have come under tragic pressure as the disease turned into a pandemic. The aim of this study is to obtain the most effective routine blood values (RBV) in the diagnosis and prognosis of COVID-19 using a backward feature elimination algorithm for the LogNNet reservoir neural network. The first dataset in the study consists of a total of 5296 patients with the same number of negative and positive COVID-19 tests. The LogNNet-model achieved the accuracy rate of 99.5% in the diagnosis of the disease with 46 features and the accuracy of 99.17% with only mean corpuscular hemoglobin concentration, mean corpuscular hemoglobin, and activated partial prothrombin time. The second dataset consists of a total of 3899 patients with a diagnosis of COVID-19 who were treated in hospital, of which 203 were severe patients and 3696 were mild patients. The model reached the accuracy rate of 94.4% in determining the prognosis of the disease with 48 features and the accuracy of 82.7% with only erythrocyte sedimentation rate, neutrophil count, and C reactive protein features. Our method will reduce the negative pressures on the health sector and help doctors to understand the pathogenesis of COVID-19 using the key features. The method is promising to create mobile health monitoring systems in the Internet of Things.
Collapse
|
57
|
Gong H, Wang M, Zhang H, Elahe MF, Jin M. An Explainable AI Approach for the Rapid Diagnosis of COVID-19 Using Ensemble Learning Algorithms. Front Public Health 2022; 10:874455. [PMID: 35801239 PMCID: PMC9253566 DOI: 10.3389/fpubh.2022.874455] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Accepted: 05/19/2022] [Indexed: 11/13/2022] Open
Abstract
Background Artificial intelligence-based disease prediction models have a greater potential to screen COVID-19 patients than conventional methods. However, their application has been restricted because of their underlying black-box nature. Objective To addressed this issue, an explainable artificial intelligence (XAI) approach was developed to screen patients for COVID-19. Methods A retrospective study consisting of 1,737 participants (759 COVID-19 patients and 978 controls) admitted to San Raphael Hospital (OSR) from February to May 2020 was used to construct a diagnosis model. Finally, 32 key blood test indices from 1,374 participants were used for screening patients for COVID-19. Four ensemble learning algorithms were used: random forest (RF), adaptive boosting (AdaBoost), gradient boosting decision tree (GBDT), and extreme gradient boosting (XGBoost). Feature importance from the perspective of the clinical domain and visualized interpretations were illustrated by using local interpretable model-agnostic explanations (LIME) plots. Results The GBDT model [area under the curve (AUC): 86.4%; 95% confidence interval (CI) 0.821–0.907] outperformed the RF model (AUC: 85.7%; 95% CI 0.813–0.902), AdaBoost model (AUC: 85.4%; 95% CI 0.810–0.899), and XGBoost model (AUC: 84.9%; 95% CI 0.803–0.894) in distinguishing patients with COVID-19 from those without. The cumulative feature importance of lactate dehydrogenase, white blood cells, and eosinophil counts was 0.145, 0.130, and 0.128, respectively. Conclusions Ensemble machining learning (ML) approaches, mainly GBDT and LIME plots, are efficient for screening patients with COVID-19 and might serve as a potential tool in the auxiliary diagnosis of COVID-19. Patients with higher WBC count, higher LDH level, or higher EOT count, were more likely to have COVID-19.
Collapse
Affiliation(s)
- Houwu Gong
- Department of Software Engineering, College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
- Academy of Military Sciences, Beijing, China
| | - Miye Wang
- Engineering Research Center of Medical Information Technology, Ministry of Education, West China Hospital, Chengdu, China
- Information Center, West China Hospital, Chengdu, China
| | - Hanxue Zhang
- Department of Software Engineering, College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Md Fazla Elahe
- Department of Software Engineering, College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Min Jin
- Department of Software Engineering, College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
- *Correspondence: Min Jin
| |
Collapse
|
58
|
Comito C, Pizzuti C. Artificial intelligence for forecasting and diagnosing COVID-19 pandemic: A focused review. Artif Intell Med 2022; 128:102286. [PMID: 35534142 PMCID: PMC8958821 DOI: 10.1016/j.artmed.2022.102286] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 03/16/2022] [Accepted: 03/21/2022] [Indexed: 02/05/2023]
Abstract
The outbreak of novel corona virus 2019 (COVID-19) has been treated as a public health crisis of global concern by the World Health Organization (WHO). COVID-19 pandemic hugely affected countries worldwide raising the need to exploit novel, alternative and emerging technologies to respond to the emergency created by the weak health-care systems. In this context, Artificial Intelligence (AI) techniques can give a valid support to public health authorities, complementing traditional approaches with advanced tools. This study provides a comprehensive review of methods, algorithms, applications, and emerging AI technologies that can be utilized for forecasting and diagnosing COVID-19. The main objectives of this review are summarized as follows. (i) Understanding the importance of AI approaches such as machine learning and deep learning for COVID-19 pandemic; (ii) discussing the efficiency and impact of these methods for COVID-19 forecasting and diagnosing; (iii) providing an extensive background description of AI techniques to help non-expert to better catch the underlying concepts; (iv) for each work surveyed, give a detailed analysis of the rationale behind the approach, highlighting the method used, the type and size of data analyzed, the validation method, the target application and the results achieved; (v) focusing on some future challenges in COVID-19 forecasting and diagnosing.
Collapse
Affiliation(s)
- Carmela Comito
- National Research Council of Italy (CNR), Institute for High Performance Computing and Networking (ICAR), Rende, Italy.
| | - Clara Pizzuti
- National Research Council of Italy (CNR), Institute for High Performance Computing and Networking (ICAR), Rende, Italy.
| |
Collapse
|
59
|
Meraihi Y, Gabis AB, Mirjalili S, Ramdane-Cherif A, Alsaadi FE. Machine Learning-Based Research for COVID-19 Detection, Diagnosis, and Prediction: A Survey. SN COMPUTER SCIENCE 2022; 3:286. [PMID: 35578678 PMCID: PMC9096341 DOI: 10.1007/s42979-022-01184-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 04/30/2022] [Indexed: 12/12/2022]
Abstract
The year 2020 experienced an unprecedented pandemic called COVID-19, which impacted the whole world. The absence of treatment has motivated research in all fields to deal with it. In Computer Science, contributions mainly include the development of methods for the diagnosis, detection, and prediction of COVID-19 cases. Data science and Machine Learning (ML) are the most widely used techniques in this area. This paper presents an overview of more than 160 ML-based approaches developed to combat COVID-19. They come from various sources like Elsevier, Springer, ArXiv, MedRxiv, and IEEE Xplore. They are analyzed and classified into two categories: Supervised Learning-based approaches and Deep Learning-based ones. In each category, the employed ML algorithm is specified and a number of used parameters is given. The parameters set for each of the algorithms are gathered in different tables. They include the type of the addressed problem (detection, diagnosis, or detection), the type of the analyzed data (Text data, X-ray images, CT images, Time series, Clinical data,...) and the evaluated metrics (accuracy, precision, sensitivity, specificity, F1-Score, and AUC). The study discusses the collected information and provides a number of statistics drawing a picture about the state of the art. Results show that Deep Learning is used in 79% of cases where 65% of them are based on the Convolutional Neural Network (CNN) and 17% use Specialized CNN. On his side, supervised learning is found in only 16% of the reviewed approaches and only Random Forest, Support Vector Machine (SVM) and Regression algorithms are employed.
Collapse
Affiliation(s)
- Yassine Meraihi
- LIST Laboratory, University of M’Hamed Bougara Boumerdes, Avenue of Independence, 35000 Boumerdes, Algeria
| | - Asma Benmessaoud Gabis
- Ecole nationale Supérieure d’Informatique, Laboratoire des Méthodes de Conception des Systèmes, BP 68 M, 16309 Oued-Smar, Alger Algeria
| | - Seyedali Mirjalili
- Centre for Artificial Intelligence Research and Optimisation, Torrens University Australia, Fortitude Valley, Brisbane, QLD 4006 Australia
- Yonsei Frontier Lab, Yonsei University, Seoul, Korea
| | - Amar Ramdane-Cherif
- LISV Laboratory, University of Versailles St-Quentin-en-Yvelines, 10-12 Avenue of Europe, 78140 Velizy, France
| | - Fawaz E. Alsaadi
- Information Technology Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
60
|
Mazloumi R, Abazari SR, Nafarieh F, Aghsami A, Jolai F. Statistical analysis of blood characteristics of COVID-19 patients and their survival or death prediction using machine learning algorithms. Neural Comput Appl 2022; 34:14729-14743. [PMID: 35571512 PMCID: PMC9092327 DOI: 10.1007/s00521-022-07325-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 04/18/2022] [Indexed: 11/29/2022]
Abstract
This study's main purpose is to provide helpful information using blood samples from COVID-19 patients as a non-medical approach for helping healthcare systems during the pandemic. Also, this paper aims to evaluate machine learning algorithms for predicting the survival or death of COVID-19 patients. We use a blood sample dataset of 306 infected patients in Wuhan, China, compiled by Tangji Hospital. The dataset consists of blood's clinical indicators and information about whether patients are recovering or not. The used methods include K-nearest neighbor (KNN), decision tree (DT), logistic regression (LR), support vector machine (SVM), random forest (RF), stochastic gradient descent (SGD), bagging classifier (BC), and adaptive boosting (AdaBoost). We compare the performance of machine learning algorithms using statistical hypothesis testing. The results show that the most critical feature is age, and there is a high correlation between LD and CRP, and leukocytes and CRP. Furthermore, RF, SVM, DT, AdaBoost, DT, and KNN outperform other machine learning algorithms in predicting the survival or death of COVID-19 patients.
Collapse
Affiliation(s)
- Rahil Mazloumi
- School of Industrial and Systems Engineering, College of Engineering, University of Tehran, P.O. Box, Tehran, 11155-4563 Iran
| | - Seyed Reza Abazari
- School of Industrial and Systems Engineering, College of Engineering, University of Tehran, P.O. Box, Tehran, 11155-4563 Iran
| | - Farnaz Nafarieh
- School of Industrial and Systems Engineering, College of Engineering, University of Tehran, P.O. Box, Tehran, 11155-4563 Iran
| | - Amir Aghsami
- School of Industrial and Systems Engineering, College of Engineering, University of Tehran, P.O. Box, Tehran, 11155-4563 Iran
- School of Industrial Engineering, K. N. Toosi University of Technology (KNTU), Tehran, Iran
| | - Fariborz Jolai
- School of Industrial and Systems Engineering, College of Engineering, University of Tehran, P.O. Box, Tehran, 11155-4563 Iran
| |
Collapse
|
61
|
Galetsi P, Katsaliaki K, Kumar S. The medical and societal impact of big data analytics and artificial intelligence applications in combating pandemics: A review focused on Covid-19. Soc Sci Med 2022; 301:114973. [PMID: 35452893 PMCID: PMC9001170 DOI: 10.1016/j.socscimed.2022.114973] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 02/21/2022] [Accepted: 04/08/2022] [Indexed: 12/23/2022]
Abstract
With Covid-19 impacting communities in different ways, research has increasingly turned to big data analytics (BDA) and artificial intelligence (AI) tools to track and monitor the virus's spread and its effect on humanity and the global economy. The purpose of this study is to conduct an in-depth literature review to identify how BDA and AI were involved in the management of Covid-19 (while considering diversity, equity, and inclusion (DEI)). The rigorous search resulted in a portfolio of 607 articles, retrieved from the Web of Science database, where content analysis has been conducted. This study identifies the BDA and AI applications developed to deal with the initial Covid-19 outbreak and the containment of the pandemic, along with their benefits for the social good. Moreover, this study reveals the DEI challenges related to these applications, ways to mitigate the concerns, and how to develop viable techniques to deal with similar crises in the future. The article pool recognized the high presence of machine learning (ML) and the role of mobile technology, social media and telemedicine in the use of BDA and AI during Covid-19. This study offers a collective insight into many of the key issues and underlying complexities affecting public health and society from Covid-19, and the solutions offered from information systems and technological perspectives.
Collapse
Affiliation(s)
- Panagiota Galetsi
- School of Humanities, Social Sciences and Economics, International Hellenic University, 14th Km Thessaloniki-N.Moudania, Thessaloniki, 57001, Greece
| | - Korina Katsaliaki
- School of Humanities, Social Sciences and Economics, International Hellenic University, 14th Km Thessaloniki-N.Moudania, Thessaloniki, 57001, Greece
| | - Sameer Kumar
- Opus College of Business, University of St. Thomas Minneapolis Campus 1000 LaSalle Ave, Schulze Hall 333, Minneapolis, MN, 55403, USA.
| |
Collapse
|
62
|
Kuo KM, Talley PC, Chang CS. The Accuracy of Machine Learning Approaches Using Non-image Data for the Prediction of COVID-19: A Meta-Analysis. Int J Med Inform 2022; 164:104791. [PMID: 35594810 PMCID: PMC9098530 DOI: 10.1016/j.ijmedinf.2022.104791] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 04/08/2022] [Accepted: 05/09/2022] [Indexed: 12/12/2022]
Abstract
Objective COVID-19 is a novel, severely contagious disease with enormous negative impact on humanity as well as the world economy. An expeditious, feasible tool for detecting COVID-19 remains yet elusive. Recently, there has been a surge of interest in applying machine learning techniques to predict COVID-19 using non-image data. We have therefore undertaken a meta-analysis to quantify the diagnostic performance of machine learning models facilitating the prediction of COVID-19. Materials and methods A comprehensive electronic database search for the period between January 1st, 2021 and December 3rd, 2021 was undertaken in order to identify eligible studies relevant to this meta-analysis. Summary sensitivity, specificity, and the area under receiver operating characteristic curves were used to assess potential diagnostic accuracy. Risk of bias was assessed by means of a revised Quality Assessment of Diagnostic Studies. Results A total of 30 studies, including 34 models, met all of the inclusion criteria. Summary sensitivity, specificity, and area under receiver operating characteristic curves were 0.86, 0.86, and 0.91, respectively. The purpose of machine learning models, class imbalance, and feature selection are significant covariates useful in explaining the between-study heterogeneity, in terms of both sensitivity and specificity. Conclusions Our study findings show that non-image data can be used to predict COVID-19 with an acceptable performance. Further, class imbalance and feature selection are suggested to be incorporated whenever building models for the prediction of COVID-19, thus improving further diagnostic performance.
Collapse
|
63
|
Speech as a Biomarker for COVID-19 Detection Using Machine Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6093613. [PMID: 35444694 PMCID: PMC9014833 DOI: 10.1155/2022/6093613] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 03/07/2022] [Accepted: 03/21/2022] [Indexed: 11/30/2022]
Abstract
The use of speech as a biomedical signal for diagnosing COVID-19 is investigated using statistical analysis of speech spectral features and classification algorithms based on machine learning. It is established that spectral features of speech, obtained by computing the short-time Fourier Transform (STFT), get altered in a statistical sense as a result of physiological changes. These spectral features are then used as input features to machine learning-based classification algorithms to classify them as coming from a COVID-19 positive individual or not. Speech samples from healthy as well as “asymptomatic” COVID-19 positive individuals have been used in this study. It is shown that the RMS error of statistical distribution fitting is higher in the case of speech samples of COVID-19 positive speech samples as compared to the speech samples of healthy individuals. Five state-of-the-art machine learning classification algorithms have also been analyzed, and the performance evaluation metrics of these algorithms are also presented. The tuning of machine learning model parameters is done so as to minimize the misclassification of COVID-19 positive individuals as being COVID-19 negative since the cost associated with this misclassification is higher than the opposite misclassification. The best performance in terms of the “recall” metric is observed for the Decision Forest algorithm which gives a recall value of 0.7892.
Collapse
|
64
|
An Improved Machine-Learning Approach for COVID-19 Prediction Using Harris Hawks Optimization and Feature Analysis Using SHAP. Diagnostics (Basel) 2022; 12:diagnostics12051023. [PMID: 35626179 PMCID: PMC9139459 DOI: 10.3390/diagnostics12051023] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 04/02/2022] [Accepted: 04/16/2022] [Indexed: 02/01/2023] Open
Abstract
A healthcare monitoring system needs the support of recent technologies such as artificial intelligence (AI), machine learning (ML), and big data, especially during the COVID-19 pandemic. This global pandemic has already taken millions of lives. Both infected and uninfected people have generated big data where AI and ML can use to combat and detect COVID-19 at an early stage. Motivated by this, an improved ML framework for the early detection of this disease is proposed in this paper. The state-of-the-art Harris hawks optimization (HHO) algorithm with an improved objective function is proposed and applied to optimize the hyperparameters of the ML algorithms, namely HHO-based eXtreme gradient boosting (HHOXGB), light gradient boosting (HHOLGB), categorical boosting (HHOCAT), random forest (HHORF) and support vector classifier (HHOSVC). An ensemble technique was applied to these optimized ML models to improve the prediction performance. Our proposed method was applied to publicly available big COVID-19 data and yielded a prediction accuracy of 92.38% using the ensemble model. In contrast, HHOXGB provided the highest accuracy of 92.23% as a single optimized model. The performance of the proposed method was compared with the traditional algorithms and other ML-based methods. In both cases, our proposed method performed better. Furthermore, not only the classification improvement, but also the features are analyzed in terms of feature importance calculated by SHapely adaptive exPlanations (SHAP) values. A graphical user interface is also discussed as a potential tool for nonspecialist users such as clinical staff and nurses. The processed data, trained model, and codes related to this study are available at GitHub.
Collapse
|
65
|
Anand S, Sharma V, Pourush R, Jaiswal S. A comprehensive survey on the biomedical signal processing methods for the detection of COVID-19. Ann Med Surg (Lond) 2022; 76:103519. [PMID: 35401978 PMCID: PMC8975609 DOI: 10.1016/j.amsu.2022.103519] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 03/09/2022] [Accepted: 03/26/2022] [Indexed: 12/16/2022] Open
Abstract
The novel coronavirus, renamed SARS-CoV-2 and most commonly referred to as COVID-19, has infected nearly 44.83 million people in 224 countries and has been designated SARS-CoV-2. In this study, we used 'web of Science', 'Scopus' and 'goggle scholar' with the keywords of "SARS-CoV-2 detection" or "coronavirus 2019 detection" or "COVID 2019 detection" or "COVID 19 detection" "corona virus techniques for detection of COVID-19", "audio techniques for detection of COVID-19", "speech techniques for detection of COVID-19", for period of 2019-2021. Some COVID-19 instances have an impact on speech production, which suggests that researchers should look for signs of disease detection in speech utilising audio and speech recognition signals from humans to better understand the condition. It is presented in this review that an overview of human audio signals is presented using an AI (Artificial Intelligence) model to diagnose, spread awareness, and monitor COVID-19, employing bio and non-obtrusive signals that communicated human speech and non-speech audio information is presented. Development of accurate and rapid screening techniques that permit testing at a reasonable cost is critical in the current COVID-19 pandemic crisis, according to the World Health Organization. In this context, certain existing investigations have shown potential in the detection of COVID 19 diagnostic signals from relevant auditory noises, which is a promising development. According to authors, it is not a single "perfect" COVID-19 test that is required, but rather a combination of rapid and affordable tests, non-clinic pre-screening tools, and tools from a variety of supply chains and technologies that will allow us to safely return to our normal lives while we await the completion of the hassle free COVID-19 vaccination process for all ages. This review was able to gather information on biomedical signal processing in the detection of speech, coughing sounds, and breathing signals for the purpose of diagnosing and screening the COVID-19 virus.
Collapse
Affiliation(s)
- Satyajit Anand
- Electronics and Communication Engineering, Mody University of Science and Technology, India
| | - Vikrant Sharma
- Mechanical Engineering, Mody University of Science and Technology, India
| | - Rajeev Pourush
- Electronics and Communication Engineering, Mody University of Science and Technology, India
| | - Sandeep Jaiswal
- Biomedical Engineering, Mody University of Science and Technology, India
| |
Collapse
|
66
|
Roland T, Böck C, Tschoellitsch T, Maletzky A, Hochreiter S, Meier J, Klambauer G. Domain Shifts in Machine Learning Based Covid-19 Diagnosis From Blood Tests. J Med Syst 2022. [PMID: 35348909 DOI: 10.1101/2021.04.06.21254997] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Many previous studies claim to have developed machine learning models that diagnose COVID-19 from blood tests. However, we hypothesize that changes in the underlying distribution of the data, so called domain shifts, affect the predictive performance and reliability and are a reason for the failure of such machine learning models in clinical application. Domain shifts can be caused, e.g., by changes in the disease prevalence (spreading or tested population), by refined RT-PCR testing procedures (way of taking samples, laboratory procedures), or by virus mutations. Therefore, machine learning models for diagnosing COVID-19 or other diseases may not be reliable and degrade in performance over time. We investigate whether domain shifts are present in COVID-19 datasets and how they affect machine learning methods. We further set out to estimate the mortality risk based on routinely acquired blood tests in a hospital setting throughout pandemics and under domain shifts. We reveal domain shifts by evaluating the models on a large-scale dataset with different assessment strategies, such as temporal validation. We present the novel finding that domain shifts strongly affect machine learning models for COVID-19 diagnosis and deteriorate their predictive performance and credibility. Therefore, frequent re-training and re-assessment are indispensable for robust models enabling clinical utility.
Collapse
Affiliation(s)
- Theresa Roland
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria.
| | - Carl Böck
- Department of Anesthesiology and Critical Care Medicine, Kepler University Hospital GmbH, Johannes Kepler University Linz, Linz, Austria
| | - Thomas Tschoellitsch
- Department of Anesthesiology and Critical Care Medicine, Kepler University Hospital GmbH, Johannes Kepler University Linz, Linz, Austria
| | | | - Sepp Hochreiter
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
| | - Jens Meier
- Department of Anesthesiology and Critical Care Medicine, Kepler University Hospital GmbH, Johannes Kepler University Linz, Linz, Austria
| | - Günter Klambauer
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
| |
Collapse
|
67
|
Roland T, Böck C, Tschoellitsch T, Maletzky A, Hochreiter S, Meier J, Klambauer G. Domain Shifts in Machine Learning Based Covid-19 Diagnosis From Blood Tests. J Med Syst 2022; 46:23. [PMID: 35348909 PMCID: PMC8960704 DOI: 10.1007/s10916-022-01807-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 02/10/2022] [Indexed: 12/23/2022]
Abstract
AbstractMany previous studies claim to have developed machine learning models that diagnose COVID-19 from blood tests. However, we hypothesize that changes in the underlying distribution of the data, so called domain shifts, affect the predictive performance and reliability and are a reason for the failure of such machine learning models in clinical application. Domain shifts can be caused, e.g., by changes in the disease prevalence (spreading or tested population), by refined RT-PCR testing procedures (way of taking samples, laboratory procedures), or by virus mutations. Therefore, machine learning models for diagnosing COVID-19 or other diseases may not be reliable and degrade in performance over time. We investigate whether domain shifts are present in COVID-19 datasets and how they affect machine learning methods. We further set out to estimate the mortality risk based on routinely acquired blood tests in a hospital setting throughout pandemics and under domain shifts. We reveal domain shifts by evaluating the models on a large-scale dataset with different assessment strategies, such as temporal validation. We present the novel finding that domain shifts strongly affect machine learning models for COVID-19 diagnosis and deteriorate their predictive performance and credibility. Therefore, frequent re-training and re-assessment are indispensable for robust models enabling clinical utility.
Collapse
Affiliation(s)
- Theresa Roland
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria.
| | - Carl Böck
- Department of Anesthesiology and Critical Care Medicine, Kepler University Hospital GmbH, Johannes Kepler University Linz, Linz, Austria
| | - Thomas Tschoellitsch
- Department of Anesthesiology and Critical Care Medicine, Kepler University Hospital GmbH, Johannes Kepler University Linz, Linz, Austria
| | | | - Sepp Hochreiter
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
| | - Jens Meier
- Department of Anesthesiology and Critical Care Medicine, Kepler University Hospital GmbH, Johannes Kepler University Linz, Linz, Austria
| | - Günter Klambauer
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
| |
Collapse
|
68
|
Ganjali R, Eslami S, Samimi T, Sargolzaei M, Firouraghi N, MohammadEbrahimi S, Khoshrounejad F, Kheirdoust A. Clinical informatics solutions in COVID-19 pandemic: Scoping literature review. INFORMATICS IN MEDICINE UNLOCKED 2022; 30:100929. [PMID: 35350124 PMCID: PMC8949656 DOI: 10.1016/j.imu.2022.100929] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 03/06/2022] [Accepted: 03/22/2022] [Indexed: 01/11/2023] Open
Abstract
Background The global outbreak of COVID-19 (coronavirus disease 2019) disease has highlighted the importance of disease monitoring, diagnosing, treating, and screening. Technology-based instruments could efficiently assist healthcare systems during pandemics by allowing rapid and widespread transfer of information, real-time tracking of data transfer, and virtualization of meetings and patient visits. Therefore, this study was conducted to investigate the applications of clinical informatics (CI) during the COVID-19 outbreak. Methods A comprehensive search was performed on Medline and Scopus databases in September 2020. Eligible studies were selected based on the inclusion and exclusion criteria. The extracted data from the studies reviewed were about study sample, study type, objectives, clinical informatics domain, applied method, sample size, outcomes, findings, and conclusion. The risk of bias was evaluated in the studies using appropriate instruments based on the type of each study. The selected studies were then subjected to thematic synthesis. Results In this review study, 72 out of 2716 retrieved articles met the inclusion criteria for full-text analysis. Most of the articles reviewed were done in China and the United States of America. The majority of the studies were conducted in the following CI domains: prediction models (60%), telehealth (36%), and mobile health (4%). Most of the studies in telehealth domain used synchronous methods, such as online and phone- or video-call consultations. Mobile applications were developed as self-triage, self-scheduling, and information delivery tools during the COVID-19 pandemic. The most common types of prediction models among the reviewed studies were neural network (49%), classification (42%), and linear models (4.5%). Conclusion The present study showed clinical informatics applications during COVID-19 and identified current gaps in this field. Health information technology and clinical informatics seem to be useful in assisting clinicians and managers to combat COVID-19. The most common domains in clinical informatics for research on the COVID-19 crisis were prediction models and telehealth. It is suggested that future researchers conduct scoping reviews to describe and analyze other levels of medical informatics, including bioinformatics, imaging informatics, and public health informatics.
Collapse
Affiliation(s)
- Raheleh Ganjali
- Clinical Research Development Unit, Emam Reza Hospital, Mashhad University of Medical Sciences, Mashhad, Iran
- Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Saeid Eslami
- Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
- Pharmaceutical Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
- Department of Medical Informatics, University of Amsterdam, Amsterdam, the Netherlands
| | - Tahereh Samimi
- Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Mahdi Sargolzaei
- Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Neda Firouraghi
- Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Shahab MohammadEbrahimi
- Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Farnaz Khoshrounejad
- Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Azam Kheirdoust
- Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| |
Collapse
|
69
|
Liu S, Xu Y, Jiang X, Tan H, Ying B. Translation of aptamers toward clinical diagnosis and commercialization. Biosens Bioelectron 2022; 208:114168. [PMID: 35364525 DOI: 10.1016/j.bios.2022.114168] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 02/28/2022] [Accepted: 03/07/2022] [Indexed: 02/08/2023]
Abstract
The dominance of antibodies in diagnostics has gradually changed following the discovery of aptamers in the early 1990s. Aptamers offer inherent advantages over traditional antibodies, including higher specificity, higher affinity, smaller size, greater stability, ease of manufacture, and low immunogenicity, rendering them the best candidates for point-of-care testing (POCT). In the past 20 years, the research community and pharmaceutical companies have made great efforts to promote the development of aptamer technology. Macugen® (pegaptanib) was the first aptamer drug approved by the US Food and Drug Administration (FDA), and various aptamer-based diagnostics show great promise in preclinical research and clinical trials. In this review, we introduce recent literature, ongoing clinical trials, commercial reagents of aptamer-based diagnostics, discuss the FDA regulatory mechanisms, and highlight the prospects and challenges in translating these studies into viable clinical diagnostic tools.
Collapse
Affiliation(s)
- Shan Liu
- Sichuan Provincial Key Laboratory for Human Disease Gene Study, Department of Medical Genetics, Department of Laboratory Medicine, Sichuan Academy of Medical Sciences & Sichuan Provincial People's Hospital, University of Electronic Science and Technology, Chengdu, 610072, China
| | - Yixin Xu
- Department of Laboratory Medicine, West China Hospital, Sichuan University, Chengdu, 610041, China; Med+ Molecular Diagnostics Institute of West China Hospital/West China School of Medicine, Chengdu, 610041, China
| | - Xin Jiang
- Department of Laboratory Medicine, West China Hospital, Sichuan University, Chengdu, 610041, China; Med+ Molecular Diagnostics Institute of West China Hospital/West China School of Medicine, Chengdu, 610041, China
| | - Hong Tan
- Department of General Surgery, Chengdu Integrated TCM&Western Medicine Hospital (Chengdu First People's Hospital), Chengdu, 610041, China.
| | - Binwu Ying
- Department of Laboratory Medicine, West China Hospital, Sichuan University, Chengdu, 610041, China; Med+ Molecular Diagnostics Institute of West China Hospital/West China School of Medicine, Chengdu, 610041, China.
| |
Collapse
|
70
|
Abayomi-Alli OO, Damaševičius R, Maskeliūnas R, Misra S. An Ensemble Learning Model for COVID-19 Detection from Blood Test Samples. SENSORS 2022; 22:s22062224. [PMID: 35336395 PMCID: PMC8955536 DOI: 10.3390/s22062224] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 02/28/2022] [Accepted: 03/10/2022] [Indexed: 02/04/2023]
Abstract
Current research endeavors in the application of artificial intelligence (AI) methods in the diagnosis of the COVID-19 disease has proven indispensable with very promising results. Despite these promising results, there are still limitations in real-time detection of COVID-19 using reverse transcription polymerase chain reaction (RT-PCR) test data, such as limited datasets, imbalance classes, a high misclassification rate of models, and the need for specialized research in identifying the best features and thus improving prediction rates. This study aims to investigate and apply the ensemble learning approach to develop prediction models for effective detection of COVID-19 using routine laboratory blood test results. Hence, an ensemble machine learning-based COVID-19 detection system is presented, aiming to aid clinicians to diagnose this virus effectively. The experiment was conducted using custom convolutional neural network (CNN) models as a first-stage classifier and 15 supervised machine learning algorithms as a second-stage classifier: K-Nearest Neighbors, Support Vector Machine (Linear and RBF), Naive Bayes, Decision Tree, Random Forest, MultiLayer Perceptron, AdaBoost, ExtraTrees, Logistic Regression, Linear and Quadratic Discriminant Analysis (LDA/QDA), Passive, Ridge, and Stochastic Gradient Descent Classifier. Our findings show that an ensemble learning model based on DNN and ExtraTrees achieved a mean accuracy of 99.28% and area under curve (AUC) of 99.4%, while AdaBoost gave a mean accuracy of 99.28% and AUC of 98.8% on the San Raffaele Hospital dataset, respectively. The comparison of the proposed COVID-19 detection approach with other state-of-the-art approaches using the same dataset shows that the proposed method outperforms several other COVID-19 diagnostics methods.
Collapse
Affiliation(s)
- Olusola O. Abayomi-Alli
- Department of Software Engineering, Kaunas University of Technology, 51368 Kaunas, Lithuania;
| | - Robertas Damaševičius
- Department of Software Engineering, Kaunas University of Technology, 51368 Kaunas, Lithuania;
- Correspondence:
| | - Rytis Maskeliūnas
- Department of Multimedia Engineering, Kaunas University of Technology, 51368 Kaunas, Lithuania;
| | - Sanjay Misra
- Department of Computer Science and Communication, Ostfold University College, 3001 Halden, Norway;
| |
Collapse
|
71
|
Feltes BC, Vieira IA, Parraga-Alava J, Meza J, Portmann E, Terán L, Dorn M. Feature selection reveal peripheral blood parameter's changes between COVID-19 infections patients from Brazil and Ecuador. INFECTION, GENETICS AND EVOLUTION 2022; 98:105228. [PMID: 35104680 PMCID: PMC8800568 DOI: 10.1016/j.meegid.2022.105228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Accepted: 01/23/2022] [Indexed: 11/29/2022]
Abstract
The investigation of conventional complete blood-count (CBC) data for classifying the SARS-CoV-2 infection status became a topic of interest, particularly as a complementary laboratory tool in developing and third-world countries that financially struggled to test their population. Although hematological parameters in COVID-19-affected individuals from Asian and USA populations are available, there are no descriptions of comparative analyses of CBC findings between COVID-19 positive and negative cases from Latin American countries. In this sense, machine learning techniques have been employed to examine CBC data and aid in screening patients suspected of SARS-CoV-2 infection. In this work, we used machine learning to compare CBC data between two highly genetically distinguished Latin American countries: Brazil and Ecuador. We notice a clear distribution pattern of positive and negative cases between the two countries. Interestingly, almost all red blood cell count parameters were divergent. For males, neutrophils and lymphocytes are distinct between Brazil and Ecuador, while eosinophils are distinguished for females. Finally, neutrophils, lymphocytes, and monocytes displayed a particular distribution for both genders. Therefore, our findings demonstrate that the same set of CBC features relevant to one population is unlikely to apply to another. This is the first study to compare CBC data from two genetically distinct Latin American countries.
Collapse
Affiliation(s)
- Bruno César Feltes
- Department of Genetics, Institute of Bioscience, and Department of Biophysics, Institute of Bioscience, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Igor Araújo Vieira
- Genomic Medicine Laboratory, Experimental Research Center, Hospital de Clínicas de Porto Alegre, Porto Alegre, RS, Brazil
| | - Jorge Parraga-Alava
- Facultad de Ciencias Informaticas, Universidad Técnica de Manabí, Portoviejo, Manabí, Ecuador
| | - Jaime Meza
- Facultad de Ciencias Informaticas, Universidad Técnica de Manabí, Portoviejo, Manabí, Ecuador
| | - Edy Portmann
- Human-IST Institute, University of Fribourg, Fribourg, Switzerland
| | - Luis Terán
- Human-IST Institute, University of Fribourg, Fribourg, Switzerland
| | - Márcio Dorn
- Institute of Informatics, Center of Biotechnology, Federal University of Rio Grande do Sul, RS, Brazil; National Institute of Science and Technology - Forensic Science, Porto Alegre, RS, Brazil.
| |
Collapse
|
72
|
Hasan A, Levene M, Weston D, Fromson R, Koslover N, Levene T. Monitoring Covid-19 on social media using a novel triage and diagnosis approach. J Med Internet Res 2022; 24:e30397. [PMID: 35142636 PMCID: PMC8887561 DOI: 10.2196/30397] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 07/09/2021] [Accepted: 02/05/2022] [Indexed: 12/23/2022] Open
Abstract
Background The COVID-19 pandemic has created a pressing need for integrating information from disparate sources in order to assist decision makers. Social media is important in this respect; however, to make sense of the textual information it provides and be able to automate the processing of large amounts of data, natural language processing methods are needed. Social media posts are often noisy, yet they may provide valuable insights regarding the severity and prevalence of the disease in the population. Here, we adopt a triage and diagnosis approach to analyzing social media posts using machine learning techniques for the purpose of disease detection and surveillance. We thus obtain useful prevalence and incidence statistics to identify disease symptoms and their severities, motivated by public health concerns. Objective This study aims to develop an end-to-end natural language processing pipeline for triage and diagnosis of COVID-19 from patient-authored social media posts in order to provide researchers and public health practitioners with additional information on the symptoms, severity, and prevalence of the disease rather than to provide an actionable decision at the individual level. Methods The text processing pipeline first extracted COVID-19 symptoms and related concepts, such as severity, duration, negations, and body parts, from patients’ posts using conditional random fields. An unsupervised rule-based algorithm was then applied to establish relations between concepts in the next step of the pipeline. The extracted concepts and relations were subsequently used to construct 2 different vector representations of each post. These vectors were separately applied to build support vector machine learning models to triage patients into 3 categories and diagnose them for COVID-19. Results We reported macro- and microaveraged F1 scores in the range of 71%-96% and 61%-87%, respectively, for the triage and diagnosis of COVID-19 when the models were trained on human-labeled data. Our experimental results indicated that similar performance can be achieved when the models are trained using predicted labels from concept extraction and rule-based classifiers, thus yielding end-to-end machine learning. In addition, we highlighted important features uncovered by our diagnostic machine learning models and compared them with the most frequent symptoms revealed in another COVID-19 data set. In particular, we found that the most important features are not always the most frequent ones. Conclusions Our preliminary results show that it is possible to automatically triage and diagnose patients for COVID-19 from social media natural language narratives, using a machine learning pipeline in order to provide information on the severity and prevalence of the disease for use within health surveillance systems.
Collapse
Affiliation(s)
- Abul Hasan
- Birkbeck, University of London, Malet street, bloomsbury, London, GB
| | - Mark Levene
- Birkbeck, University of London, Malet street, bloomsbury, London, GB
| | - David Weston
- Birkbeck, University of London, Malet street, bloomsbury, London, GB
| | - Renate Fromson
- Barnet General Hospital, Wellhouse Lane, London EN5 3DJ, United Kingdom, London, GB
| | - Nicolas Koslover
- Barnet General Hospital, Wellhouse Lane, London EN5 3DJ, United Kingdom, London, GB
| | - Tamara Levene
- Barnet General Hospital, Wellhouse Lane, London EN5 3DJ, United Kingdom, London, GB
| |
Collapse
|
73
|
Fathi Karkan S, Maleki Baladi R, Shahgolzari M, Gholizadeh M, Shayegh F, Arashkia A. The evolving direct and indirect platforms for the detection of SARS-CoV-2. J Virol Methods 2022; 300:114381. [PMID: 34843826 PMCID: PMC8626143 DOI: 10.1016/j.jviromet.2021.114381] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 06/26/2021] [Accepted: 11/25/2021] [Indexed: 01/08/2023]
Abstract
Diagnosis of SARS-CoV-2 by standard screening measures can reduce the chance of COVID-19 spread before the symptoms become severe. Detecting viral RNA and antigens, anti-viral antibodies, and CT-scan are the most routine diagnostic methods. Accordingly, several diagnostic platforms including thermal and isothermal amplifications, CRISPR/Cas‑based approaches, digital PCR, ELISA, NGS, and point-of-care testing methods with variable sensitivities, have been developed that may facilitate managing and preventing the further spread of the infection. Here, we summarized the currently available direct and indirect testing platforms in research and clinical settings, including recent progress in the methods to detect viral RNA, antigens, and specific antibodies. This summary may help in selecting the effective method for a special application sucha as routine laboratory diagnosis, point-of-care tests or tracing the the virus spread and mutations.
Collapse
Affiliation(s)
- Sonia Fathi Karkan
- Student Research Committee, Tabriz University of Medical Sciences, Tabriz, Iran,Department of Medical Nanotechnology, Tabriz Medical University, Tabriz, Iran
| | - Reza Maleki Baladi
- Young Researchers and Elite Club, Tabriz Branch, Islamic Azad University, Tabriz, Iran
| | - Mehdi Shahgolzari
- Department of Medical Nanotechnology, Tabriz Medical University, Tabriz, Iran
| | - Monireh Gholizadeh
- Department of Medical Biotechnology, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran,Department of Immunology, Pasteur Institute of Iran, Tehran, Iran
| | - Fahimeh Shayegh
- Department of Medical Biotechnology, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Arash Arashkia
- Deaprtment of Molecular Virology, Pasteur Institute of Iran, Tehran, Iran.
| |
Collapse
|
74
|
Egberts G, Schaaphok M, Vermolen F, Zuijlen PV. A Bayesian finite-element trained machine learning approach for predicting post-burn contraction. Neural Comput Appl 2022; 34:8635-8642. [PMID: 35125668 PMCID: PMC8801043 DOI: 10.1007/s00521-021-06772-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 11/21/2021] [Indexed: 11/29/2022]
Abstract
Burn injuries can decrease the quality of life of a patient tremendously, because of esthetic reasons and because of contractions that result from them. In severe case, skin contraction takes place at such a large extent that joint mobility of a patient is significantly inhibited. In these cases, one refers to a contracture. In order to predict the evolution of post-wounding skin, several mathematical model frameworks have been set up. These frameworks are based on complicated systems of partial differential equations that need finite element-like discretizations for the approximation of the solution. Since these computational frameworks can be expensive in terms of computation time and resources, we study the applicability of neural networks to reproduce the finite element results. Our neural network is able to simulate the evolution of skin in terms of contraction for over one year. The simulations are based on 25 input parameters that are characteristic for the patient and the injury. One of such input parameters is the stiffness of the skin. The neural network results have yielded an average goodness of fit (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$R^2$$\end{document}R2) of 0.9928 (± 0.0013). Further, a tremendous speed-up of 19354X was obtained with the neural network. We illustrate the applicability by an online medical App that takes into account the age of the patient and the length of the burn.
Collapse
Affiliation(s)
- Ginger Egberts
- Delft Institute of Applied Mathematics, Delft University of Technology, Delft, The Netherlands
- Research Group Computational Mathematics(CMAT),Department of Mathematics and Statistics, University of Hasselt, Hasselt, Belgium
| | - Marianne Schaaphok
- Delft Institute of Applied Mathematics, Delft University of Technology, Delft, The Netherlands
| | - Fred Vermolen
- Research Group Computational Mathematics(CMAT),Department of Mathematics and Statistics, University of Hasselt, Hasselt, Belgium
| | - Paul van Zuijlen
- Burn Centre and Department of Plastic,Reconstructive & Hand Surgery, Red Cross Hospital, Beverwijk, Netherlands
- Department of Plastic, Reconstructive & Hand Surgery, Amsterdam UMC, location VUmc, Amsterdam Movement Sciences, Amsterdam, The Netherlands
- Pediatric Surgical Centre, Emma Children’s Hospital, Amsterdam UMC, location AMC and VUmc, Amsterdam, Netherlands
| |
Collapse
|
75
|
Lucas B, Vahedi B, Karimzadeh M. A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2022; 15:247-266. [PMID: 35071733 PMCID: PMC8760128 DOI: 10.1007/s41060-021-00295-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 11/19/2021] [Indexed: 01/06/2023]
Abstract
With COVID-19 affecting every country globally and changing everyday life, the ability to forecast the spread of the disease is more important than any previous epidemic. The conventional methods of disease-spread modeling, compartmental models, are based on the assumption of spatiotemporal homogeneity of the spread of the virus, which may cause forecasting to underperform, especially at high spatial resolutions. In this paper, we approach the forecasting task with an alternative technique-spatiotemporal machine learning. We present COVID-LSTM, a data-driven model based on a long short-term memory deep learning architecture for forecasting COVID-19 incidence at the county level in the USA. We use the weekly number of new positive cases as temporal input, and hand-engineered spatial features from Facebook movement and connectedness datasets to capture the spread of the disease in time and space. COVID-LSTM outperforms the COVID-19 Forecast Hub's Ensemble model (COVIDhub-ensemble) on our 17-week evaluation period, making it the first model to be more accurate than the COVIDhub-ensemble over one or more forecast periods. Over the 4-week forecast horizon, our model is on average 50 cases per county more accurate than the COVIDhub-ensemble. We highlight that the underutilization of data-driven forecasting of disease spread prior to COVID-19 is likely due to the lack of sufficient data available for previous diseases, in addition to the recent advances in machine learning methods for spatiotemporal forecasting. We discuss the impediments to the wider uptake of data-driven forecasting, and whether it is likely that more deep learning-based models will be used in the future.
Collapse
Affiliation(s)
- Benjamin Lucas
- Department of Geography, University of Colorado Boulder, Boulder, CO USA
| | - Behzad Vahedi
- Department of Geography, University of Colorado Boulder, Boulder, CO USA
| | - Morteza Karimzadeh
- Department of Geography, University of Colorado Boulder, Boulder, CO USA
| |
Collapse
|
76
|
Saeed U, Shah SY, Ahmad J, Imran MA, Abbasi QH, Shah SA. Machine learning empowered COVID-19 patient monitoring using non-contact sensing: An extensive review. J Pharm Anal 2022; 12:193-204. [PMID: 35003825 PMCID: PMC8724017 DOI: 10.1016/j.jpha.2021.12.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 12/29/2021] [Accepted: 12/30/2021] [Indexed: 12/20/2022] Open
Abstract
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which caused the coronavirus disease 2019 (COVID-19) pandemic, has affected more than 400 million people worldwide. With the recent rise of new Delta and Omicron variants, the efficacy of the vaccines has become an important question. The goal of various studies has been to limit the spread of the virus by utilizing wireless sensing technologies to prevent human-to-human interactions, particularly for healthcare workers. In this paper, we discuss the current literature on invasive/contact and non-invasive/non-contact technologies (including Wi-Fi, radar, and software-defined radio) that have been effectively used to detect, diagnose, and monitor human activities and COVID-19 related symptoms, such as irregular respiration. In addition, we focused on cutting-edge machine learning algorithms (such as generative adversarial networks, random forest, multilayer perceptron, support vector machine, extremely randomized trees, and k-nearest neighbors) and their essential role in intelligent healthcare systems. Furthermore, this study highlights the limitations related to non-invasive techniques and prospective research directions. This article describes cutting-edge technology (invasive/non-invasive) and its role in the recognition of COVID-19 symptoms. This article summarizes state-of-art machine-learning algorithms and their roles in modern healthcare systems. This article presents the challenges associated with wireless sensing techniques and potential future research directions.
Collapse
Affiliation(s)
- Umer Saeed
- Research Centre for Intelligent Healthcare, Coventry University, Coventry, CV1 5FB, UK
| | - Syed Yaseen Shah
- School of Computing, Engineering and Built Environment, Glasgow Caledonian University, Glasgow, G4 0BA, UK
| | - Jawad Ahmad
- School of Computing, Edinburgh Napier University, Edinburgh, EH11 4BN, UK
| | - Muhammad Ali Imran
- James Watt School of Engineering, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Qammer H Abbasi
- James Watt School of Engineering, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Syed Aziz Shah
- Research Centre for Intelligent Healthcare, Coventry University, Coventry, CV1 5FB, UK
| |
Collapse
|
77
|
Word2vec neural model-based technique to generate protein vectors for combating COVID-19: a machine learning approach. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY : AN OFFICIAL JOURNAL OF BHARATI VIDYAPEETH'S INSTITUTE OF COMPUTER APPLICATIONS AND MANAGEMENT 2022; 14:3291-3299. [PMID: 35611155 PMCID: PMC9119569 DOI: 10.1007/s41870-022-00949-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 04/13/2022] [Indexed: 12/15/2022]
Abstract
The world was ambushed in 2019 by the COVID-19 virus which affected the health, economy, and lifestyle of individuals worldwide. One way of combating such a public health concern is by using appropriate, rapid, and unbiased diagnostic tools for quick detection of infected people. However, a current dearth of bioinformatics tools necessitates modeling studies to help diagnose COVID-19 cases. Molecular-based methods such as the real-time reverse transcription polymerase chain reaction (rRT-PCR) for detecting COVID-19 is time consuming and prone to contamination. Modern bioinformatics tools have made it possible to create large databases of protein sequences of various diseases, apply data mining techniques, and accurately diagnose diseases. However, the current sequence alignment tools that use these databases are not able to detect novel COVID-19 viral sequences due to high sequence dissimilarity. The objective of this study, therefore, was to develop models that can accurately classify COVID-19 viral sequences rapidly using protein vectors generated by neural word embedding technique. Five machine learning models; K nearest neighbor regression (KNN), support vector machine (SVM), random forest (RF), Linear discriminant analysis (LDA), and Logistic regression were developed using datasets from the National Center for Biotechnology. Our results suggest, the RF model performed better than all other models on the training dataset with 99% accuracy score and 99.5% accuracy on the testing dataset. The implication of this study is that, rapid detection of the COVID-19 virus in suspected cases could potentially save lives as less time will be needed to ascertain the status of a patient.
Collapse
|
78
|
|
79
|
Laatifi M, Douzi S, Bouklouz A, Ezzine H, Jaafari J, Zaid Y, El Ouahidi B, Naciri M. Machine learning approaches in Covid-19 severity risk prediction in Morocco. JOURNAL OF BIG DATA 2022; 9:5. [PMID: 35013702 PMCID: PMC8733912 DOI: 10.1186/s40537-021-00557-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 12/22/2021] [Indexed: 05/04/2023]
Abstract
The purpose of this study is to develop and test machine learning-based models for COVID-19 severity prediction. COVID-19 test samples from 337 COVID-19 positive patients at Cheikh Zaid Hospital were grouped according to the severity of their illness. Ours is the first study to estimate illness severity by combining biological and non-biological data from patients with COVID-19. Moreover the use of ML for therapeutic purposes in Morocco is currently restricted, and ours is the first study to investigate the severity of COVID-19. When data analysis approaches were used to uncover patterns and essential characteristics in the data, C-reactive protein, platelets, and D-dimers were determined to be the most associated to COVID-19 severity prediction. In this research, many data reduction algorithms were used, and Machine Learning models were trained to predict the severity of sickness using patient data. A new feature engineering method based on topological data analysis called Uniform Manifold Approximation and Projection (UMAP) shown that it achieves better results. It has 100% accuracy, specificity, sensitivity, and ROC curve in conducting a prognostic prediction using different machine learning classifiers such as X_GBoost, AdaBoost, Random Forest, and ExtraTrees. The proposed approach aims to assist hospitals and medical facilities in determining who should be seen first and who has a higher priority for admission to the hospital.
Collapse
Affiliation(s)
- Mariam Laatifi
- Department of Biology, Faculty of Sciences, Mohammed V University, Rabat, Morocco
| | | | - Abdelaziz Bouklouz
- Laboratory of Pharmacology and Toxicology, Faculty of Medicine and Pharmacy, Rabat, Morocco
| | - Hind Ezzine
- Department of Biology, Faculty of Sciences, Mohammed V University, Rabat, Morocco
| | | | - Younes Zaid
- Department of Biology, Faculty of Sciences, Mohammed V University, Rabat, Morocco
- Research Center of Abulcasis University of Health Sciences, Cheikh Zaïd Hospital, Rabat, Morocco
| | - Bouabid El Ouahidi
- Department of Computer Science, Faculty of Sciences, Mohammed V University, Rabat, Morocco
| | - Mariam Naciri
- Department of Biology, Faculty of Sciences, Mohammed V University, Rabat, Morocco
| |
Collapse
|
80
|
Rostami M, Oussalah M. A novel explainable COVID-19 diagnosis method by integration of feature selection with random forest. INFORMATICS IN MEDICINE UNLOCKED 2022; 30:100941. [PMID: 35399333 PMCID: PMC8985417 DOI: 10.1016/j.imu.2022.100941] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 04/01/2022] [Accepted: 04/01/2022] [Indexed: 12/12/2022] Open
Abstract
Several Artificial Intelligence-based models have been developed for COVID-19 disease diagnosis. In spite of the promise of artificial intelligence, there are very few models which bridge the gap between traditional human-centered diagnosis and the potential future of machine-centered disease diagnosis. Under the concept of human-computer interaction design, this study proposes a new explainable artificial intelligence method that exploits graph analysis for feature visualization and optimization for the purpose of COVID-19 diagnosis from blood test samples. In this developed model, an explainable decision forest classifier is employed to COVID-19 classification based on routinely available patient blood test data. The approach enables the clinician to use the decision tree and feature visualization to guide the explainability and interpretability of the prediction model. By utilizing this novel feature selection phase, the proposed diagnosis model will not only improve diagnosis accuracy but decrease the execution time as well.
Collapse
Affiliation(s)
- Mehrdad Rostami
- Centre for Machine Vision and Signal Processing, Faculty of Information Technology, University of Oulu, Oulu, Finland
| | - Mourad Oussalah
- Centre for Machine Vision and Signal Processing, Faculty of Information Technology, University of Oulu, Oulu, Finland
- Research Unit of Medical Imaging, Physics, and Technology, Faculty of Medicine, University of Oulu, Finland
| |
Collapse
|
81
|
Yu Z, He L, Luo W, Tse R, Pau G. Deep Learning Hybrid Models for COVID-19 Prediction. JOURNAL OF GLOBAL INFORMATION MANAGEMENT 2022. [DOI: 10.4018/jgim.302890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
COVID-19 is a highly contagious virus. Blood test is one of effective method for COVID-19 diagnosis. However, the issues of blood test are time-consuming and lack of medical staffs. In this paper, four deep learning hybrid models are proposed to address these issues, i.e., CNN+GRU, CNN+Bi-RNN, CNN+Bi-LSTM, CNN+Bi-GRU. Besides, two best models CNN and CNN+LSTM from Turabieh et al. and Alakus et al. are implemented, respectively. Blood test data from Hospital Israelita Albert Einstein is used to train and test six models. The proposed best model CNN+Bi-GRU is accuracy of 0.9415, precision of 0.9417, recall of 0.9417, F1-score of 0.9417, AUC of 0.91, which outperforms the best models from Turabieh et al. and Alakus et al. Furthermore, the proposed model can help patients to get blood test results faster than traditional manual tests, and do not have errors caused by fatigue. We can envisage a wide deployment of proposed model in hospitals to alleviate the testing pressure from medical workers, especially in developing and underdeveloped countries.
Collapse
Affiliation(s)
- Ziyue Yu
- Faculty of Applied Sciences, Macao Polytechnic University, China, Macao Polytechnic University, China
| | - Lihua He
- Faculty of Applied Sciences, Macao Polytechnic University, China, Macao Polytechnic University, China
| | - Wuman Luo
- Faculty of Applied Sciences, Engineering Research Centre of Applied Technology on Machine Translation and Artificial Intelligence
| | - Rita Tse
- Faculty of Applied Sciences, Engineering Research Centre of Applied Technology on Machine Translation and Artificial Intelligence
| | - Giovanni Pau
- Department of Computer Science and Engineering, University of Bologna, Bologna, Italy, University of Bologna, Italy
| |
Collapse
|
82
|
Ilu SY, Rajesh P, Mohammed H. Prediction of COVID-19 using long short-term memory by integrating principal component analysis and clustering techniques. INFORMATICS IN MEDICINE UNLOCKED 2022; 31:100990. [PMID: 35677733 PMCID: PMC9164683 DOI: 10.1016/j.imu.2022.100990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 06/01/2022] [Accepted: 06/01/2022] [Indexed: 11/27/2022] Open
Abstract
Severe acute respiratory syndrome coronavirus (SARS-COV) is a major family of viruses that cause infections in both animals and humans, including common cold, coronavirus disease (COVID-19), severe acute respiratory syndrome (SARS), and Middle East respiratory syndrome. This study primarily aims to predict the number of COVID-19 positive cases in 36 states of Nigeria using a long short-term memory (LSTM) algorithm of deep learning. The proposed approach employs K-means clustering to detect outliers and principal component analysis (PCA) to select important features from the dataset. The LSTM was chosen because of its non-linear characteristics to handle the dataset. As COVID-19 cases follow non-linear characteristics, LSTM is the most suitable algorithm for predicting their numbers. For comparison, several types of machine learning algorithms, such as naive Bayes, XG-boost, and SVM, were employed. After the comparison, LSTM was observed to be superior among all algorithms.
Collapse
|
83
|
Mansour NA, Saleh AI, Badawy M, Ali HA. Accurate detection of Covid-19 patients based on Feature Correlated Naïve Bayes (FCNB) classification strategy. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING 2022; 13:41-73. [PMID: 33469467 PMCID: PMC7809685 DOI: 10.1007/s12652-020-02883-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 12/23/2020] [Indexed: 05/03/2023]
Abstract
The outbreak of Coronavirus (COVID-19) has spread between people around the world at a rapid rate so that the number of infected people and deaths is increasing quickly every day. Accordingly, it is a vital process to detect positive cases at an early stage for treatment and controlling the disease from spreading. Several medical tests had been applied for COVID-19 detection in certain injuries, but with limited efficiency. In this study, a new COVID-19 diagnosis strategy called Feature Correlated Naïve Bayes (FCNB) has been introduced. The FCNB consists of four phases, which are; Feature Selection Phase (FSP), Feature Clustering Phase (FCP), Master Feature Weighting Phase (MFWP), and Feature Correlated Naïve Bayes Phase (FCNBP). The FSP selects only the most effective features among the extracted features from laboratory tests for both COVID-19 patients and non-COVID-19 people by using the Genetic Algorithm as a wrapper method. The FCP constructs many clusters of features based on the selected features from FSP by using a novel clustering technique. These clusters of features are called Master Features (MFs) in which each MF contains a set of dependent features. The MFWP assigns a weight value to each MF by using a new weight calculation method. The FCNBP is used to classify patients based on the weighted Naïve Bayes algorithm with many modifications as the correlation between features. The proposed FCNB strategy has been compared to recent competitive techniques. Experimental results have proven the effectiveness of the FCNB strategy in which it outperforms recent competitive techniques because it achieves the maximum (99%) detection accuracy.
Collapse
Affiliation(s)
- Nehal A. Mansour
- Nile Higher Institute for Engineering and Technology, Mansoura, Egypt
| | - Ahmed I. Saleh
- Computers and Control Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Mahmoud Badawy
- Computers and Control Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Hesham A. Ali
- Computers and Control Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| |
Collapse
|
84
|
Čolaković A, Avdagić-Golub E, Begović M, Memić B, Hasković-Džubur A. Application of machine learning in the fight against the COVID-19 pandemic: A review. ACTA FACULTATIS MEDICAE NAISSENSIS 2022. [DOI: 10.5937/afmnai39-38354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Introduction: Machine learning (ML) plays a significant role in the fight against the COVID-19 (officially known as SARS-CoV-2) pandemic. ML techniques enable the rapid detection of patterns and trends in large datasets. Therefore, ML provides efficient methods to generate knowledge from structured and unstructured data. This potential is particularly significant when the pandemic affects all aspects of human life. It is necessary to collect a large amount of data to identify methods to prevent the spread of infection, early detection, reduction of consequences, and finding appropriate medicine. Modern information and communication technologies (ICT) such as the Internet of Things (IoT) allow the collection of large amounts of data from various sources. Thus, we can create predictive ML-based models for assessments, predictions, and decisions. Methods: This is a review article based on previous studies and scientifically proven knowledge. In this paper, bibliometric data from authoritative databases of research publications (Web of Science, Scopus, PubMed) are combined for bibliometric analyses in the context of ML applications for COVID-19. Aim: This paper reviews some ML-based applications used for mitigating COVID-19. We aimed to identify and review ML potentials and solutions for mitigating the COVID-19 pandemic as well as to present some of the most commonly used ML techniques, algorithms, and datasets applied in the context of COVID-19. Also, we provided some insights into specific emerging ideas and open issues to facilitate future research. Conclusion: ML is an effective tool for diagnosing and early detection of symptoms, predicting the spread of a pandemic, developing medicines and vaccines, etc.
Collapse
|
85
|
Wang L, Yin Z, Puppala M, Ezeana C, Wong K, He T, Gotur D, Wong S. A Time-Series Feature-based Recursive Classification Model to Optimize Treatment Strategies for Improving Outcomes and Resource Allocations of COVID-19 Patients. IEEE J Biomed Health Inform 2021; 26:3323-3329. [PMID: 34971548 DOI: 10.1109/jbhi.2021.3139773] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
This paper presents a novel Lasso Logistic Regression model based on feature-based time series data to determine disease severity and when to administer drugs or escalate intervention procedures in patients with coronavirus disease 2019 (COVID-19). Advanced features were extracted from highly enriched and time series vital sign data of hospitalized COVID-19 patients, including oxygen saturation readings, and with a combination of patient demographic and comorbidity information, as inputs into the dynamic feature-based classification model. Such dynamic combinations brought deep insights to guide clinical decision-making of complex COVID-19 cases, including prognosis prediction, timing of drug administration, admission to intensive care units, and application of intervention procedures like ventilation and intubation. The COVID-19 patient classification model was developed utilizing 900 hospitalized COVID-19 patients in a leading multi-hospital system in Texas, United States. By providing mortality prediction based on time-series physiologic data, demographics, and clinical records of individual COVID-19 patients, the dynamic feature-based classification model can be used to improve efficacy of the COVID-19 patient treatment, prioritize medical resources, and reduce casualties. The uniqueness of our model is that it is based on just the first 24 hours of vital sign data such that clinical interventions can be decided early and applied effectively. Such a strategy could be extended to prioritize resource allocations and drug treatment for future pandemic events.
Collapse
|
86
|
Bezzan VP, Rocco CD. Using bi-dimensional representations to understand patterns in COVID-19 blood exam data. INFORMATICS IN MEDICINE UNLOCKED 2021; 28:100828. [PMID: 34981033 PMCID: PMC8716149 DOI: 10.1016/j.imu.2021.100828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 12/15/2021] [Accepted: 12/19/2021] [Indexed: 11/30/2022] Open
Abstract
Blood tests play an essential role in everyday medicine and are used by doctors in several diagnostic procedures. Moreover, this data is multivariate - and often some diseases, such as COVID-19, could have different symptom manifestations and outcomes. This study proposes a method of extracting useful information from blood tests using UMAP technique - Uniform Manifold Approximation and Projection for Dimension Reduction combined with DBSCAN clustering and statistical approaches. The analysis performed here indicates several clusters of infection prevalence varying between 2%-37%, showing that our procedure is indeed capable of finding different patterns. A possible explanation is that COVID-19 is not just a respiratory infection but a systemic disease with critical hematological implications, primarily on white-cell fractions, as indicated by relevant statistical test p -values in the range of 0.03-0.1. The novel analysis procedure proposed could be adopted in other data-sets of different illnesses to help researchers to discover new patterns of data that could be used in various diseases and contexts.
Collapse
Affiliation(s)
- Vitor P Bezzan
- Instituto de Matemática, Estatistica e Computação Científica, Universidade Estadual de Campinas, Brazil
| | - Cleber D Rocco
- Faculdade de Ciências Aplicadas, Universidade Estadual de Campinas, Brazil
| |
Collapse
|
87
|
An Overview of Supervised Machine Learning Methods and Data Analysis for COVID-19 Detection. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:4733167. [PMID: 34853669 PMCID: PMC8629644 DOI: 10.1155/2021/4733167] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 08/16/2021] [Accepted: 10/11/2021] [Indexed: 12/16/2022]
Abstract
Methods Our analysis and machine learning algorithm is based on most cited two clinical datasets from the literature: one from San Raffaele Hospital Milan Italia and the other from Hospital Israelita Albert Einstein São Paulo Brasilia. The datasets were processed to select the best features that most influence the target, and it turned out that almost all of them are blood parameters. EDA (Exploratory Data Analysis) methods were applied to the datasets, and a comparative study of supervised machine learning models was done, after which the support vector machine (SVM) was selected as the one with the best performance. Results SVM being the best performant is used as our proposed supervised machine learning algorithm. An accuracy of 99.29%, sensitivity of 92.79%, and specificity of 100% were obtained with the dataset from Kaggle (https://www.kaggle.com/einsteindata4u/covid19) after applying optimization to SVM. The same procedure and work were performed with the dataset taken from San Raffaele Hospital (https://zenodo.org/record/3886927#.YIluB5AzbMV). Once more, the SVM presented the best performance among other machine learning algorithms, and 92.86%, 93.55%, and 90.91% for accuracy, sensitivity, and specificity, respectively, were obtained. Conclusion The obtained results, when compared with others from the literature based on these same datasets, are superior, leading us to conclude that our proposed solution is reliable for the COVID-19 diagnosis.
Collapse
|
88
|
Chang Z, Zhan Z, Zhao Z, You Z, Liu Y, Yan Z, Fu Y, Liang W, Zhao L. Application of artificial intelligence in COVID-19 medical area: a systematic review. J Thorac Dis 2021; 13:7034-7053. [PMID: 35070385 PMCID: PMC8743418 DOI: 10.21037/jtd-21-747] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 09/02/2021] [Indexed: 01/08/2023]
Abstract
BACKGROUND Coronavirus disease 2019 (COVID-19) has caused a large-scale global epidemic, impacting international politics and the economy. At present, there is no particularly effective medicine and treatment plan. Therefore, it is urgent and significant to find new technologies to diagnose early, isolate early, and treat early. Multimodal data drove artificial intelligence (AI) can potentially be the option. During the COVID-19 Pandemic, AI provided cutting-edge applications in disease, medicine, treatment, and target recognition. This paper reviewed the literature on the intersection of AI and medicine to analyze and compare different AI model applications in the COVID-19 Pandemic, evaluate their effectiveness, show their advantages and differences, and introduce the main models and their characteristics. METHODS We searched PubMed, arXiv, medRxiv, and Google Scholar through February 2020 to identify studies on AI applications in the medical areas for the COVID-19 Pandemic. RESULTS We summarize the main AI applications in six areas: (I) epidemiology, (II) diagnosis, (III) progression, (IV) treatment, (V) psychological health impact, and (VI) data security. The ongoing development in AI has significantly improved prediction, contact tracing, screening, diagnosis, treatment, medication, and vaccine development for the COVID-19 Pandemic and reducing human intervention in medical practice. DISCUSSION This paper provides strong advice for using AI-based auxiliary tools for related applications of human diseases. We also discuss the clinicians' role in the further development of AI. They and AI researchers can integrate AI technology with current clinical processes and information systems into applications. In the future, AI personnel and medical workers will further cooperate closely.
Collapse
Affiliation(s)
- Zhoulin Chang
- College of Mechanical and Electrical Engineering, Guangdong University of Science and Technology, Dongguan, China
| | - Zhiqing Zhan
- The Third Clinical College, Guangzhou Medical University, Guangzhou, China
| | - Zifan Zhao
- Nanshan College, Guangzhou Medical University, Guangzhou, China
| | - Zhixuan You
- Nanshan College, Guangzhou Medical University, Guangzhou, China
| | - Yang Liu
- School of Information Engineering, Zhengzhou University, Zhengzhou, China
| | - Zhihong Yan
- Kuangji Medical Technology (Guangdong Hengqin) Co., Ltd., Zhuhai, China
| | - Yong Fu
- Kuangji Medical Technology (Guangdong Hengqin) Co., Ltd., Zhuhai, China
| | - Wenhua Liang
- Department of Thoracic Surgery, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Lei Zhao
- Department of Physiology, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
89
|
Campagner A, Carobene A, Cabitza F. External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count. Health Inf Sci Syst 2021; 9:37. [PMID: 34721844 PMCID: PMC8540880 DOI: 10.1007/s13755-021-00167-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 09/29/2021] [Indexed: 01/13/2023] Open
Abstract
PURPOSE The rRT-PCR for COVID-19 diagnosis is affected by long turnaround time, potential shortage of reagents, high false-negative rates and high costs. Routine hematochemical tests are a faster and less expensive alternative for diagnosis. Thus, Machine Learning (ML) has been applied to hematological parameters to develop diagnostic tools and help clinicians in promptly managing positive patients. However, few ML models have been externally validated, making their real-world applicability unclear. METHODS We externally validate 6 state-of-the-art diagnostic ML models, based on Complete Blood Count (CBC) and trained on a dataset encompassing 816 COVID-19 positive cases. The external validation was performed based on two datasets, collected at two different hospitals in northern Italy and encompassing 163 and 104 COVID-19 positive cases, in terms of both error rate and calibration. RESULTS AND CONCLUSION We report an average AUC of 95% and average Brier score of 0.11, out-performing existing ML methods, and showing good cross-site transportability. The best performing model (SVM) reported an average AUC of 97.5% (Sensitivity: 87.5%, Specificity: 94%), comparable with the performance of RT-PCR, and was also the best calibrated. The validated models can be useful in the early identification of potential COVID-19 patients, due to the rapid availability of CBC exams, and in multiple test settings.
Collapse
Affiliation(s)
| | - Anna Carobene
- Laboratory Medicine, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | | |
Collapse
|
90
|
Pang KW, Tham SL, Ng LS. Exploring the Clinical Utility of Gustatory Dysfunction (GD) as a Triage Symptom Prior to Reverse Transcription Polymerase Chain Reaction (RT-PCR) in the Diagnosis of COVID-19: A Meta-Analysis and Systematic Review. Life (Basel) 2021; 11:1315. [PMID: 34947846 PMCID: PMC8706269 DOI: 10.3390/life11121315] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Revised: 11/25/2021] [Accepted: 11/25/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND The diagnosis of COVID-19 is made using reverse transcription polymerase chain reaction (RT-PCR) but its sensitivity varies from 20 to 100%. The presence of gustatory dysfunction (GD) in a patient with upper respiratory tract symptoms might increase the clinical suspicion of COVID-19. AIMS To perform a systematic review and meta-analysis to determine the pooled sensitivity, specificity, positive likelihood ratio (LR+), negative likelihood ratio (LR-) and diagnostic odds ratio (DOR) of using GD as a triage symptom prior to RT-PCR. METHODS PubMed and Embase were searched up to 20 June 2021. Studies published in English were included if they compared the frequency of GD in COVID-19 adult patients (proven by RT-PCR) to COVID-19 negative controls in case control or cross-sectional studies. The Newcastle-Ottawa scale was used to assess the methodological quality of the included studies. RESULTS 21,272 COVID-19 patients and 52,298 COVID-19 negative patients were included across 44 studies from 21 countries. All studies were of moderate to high risk of bias. Patients with GD were more likely to test positive for COVID-19: DOR 6.39 (4.86-8.40), LR+ 3.84 (3.04-4.84), LR- 0.67 (0.64-0.70), pooled sensitivity 0.37 (0.29-0.47) and pooled specificity 0.92 (0.89-0.94). While history/questionnaire-based assessments were predictive of RT-PCR positivity (DOR 6.62 (4.95-8.85)), gustatory testing was not (DOR 3.53 (0.98-12.7)). There was significant heterogeneity among the 44 studies (I2 = 92%, p < 0.01). CONCLUSIONS GD is useful as a symptom to determine if a patient should undergo further testing, especially in resource-poor regions where COVID-19 testing is scarce. Patients with GD may be advised to quarantine while repeated testing is performed if the initial RT-PCR is negative. FUNDING None.
Collapse
Affiliation(s)
- Khang Wen Pang
- Department of Otolaryngology-Head and Neck Surgery, National University Hospital, Singapore 119228, Singapore; (S.-L.T.); (L.S.N.)
| | | | | |
Collapse
|
91
|
Dairi A, Harrou F, Sun Y. Deep Generative Learning-Based 1-SVM Detectors for Unsupervised COVID-19 Infection Detection Using Blood Tests. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT 2021; 71:2500211. [PMID: 35582656 PMCID: PMC8962827 DOI: 10.1109/tim.2021.3130675] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 10/03/2021] [Accepted: 11/08/2021] [Indexed: 05/02/2023]
Abstract
A sample blood test has recently become an important tool to help identify false-positive/false-negative real-time reverse transcription polymerase chain reaction (rRT-PCR) tests. Importantly, this is mainly because it is an inexpensive and handy option to detect the potential COVID-19 patients. However, this test should be conducted by certified laboratories, expensive equipment, and trained personnel, and 3-4 h are needed to deliver results. Furthermore, it has relatively large false-negative rates around 15%-20%. Consequently, an alternative and more accessible solution, quicker and less costly, is needed. This article introduces flexible and unsupervised data-driven approaches to detect the COVID-19 infection based on blood test samples. In other words, we address the problem of COVID-19 infection detection using a blood test as an anomaly detection problem through an unsupervised deep hybrid model. Essentially, we amalgamate the features extraction capability of the variational autoencoder (VAE) and the detection sensitivity of the one-class support vector machine (1SVM) algorithm. Two sets of routine blood tests samples from the Albert Einstein Hospital, S ao Paulo, Brazil, and the San Raffaele Hospital, Milan, Italy, are used to assess the performance of the investigated deep learning models. Here, missing values have been imputed based on a random forest regressor. Compared to generative adversarial networks (GANs), deep belief network (DBN), and restricted Boltzmann machine (RBM)-based 1SVM, the traditional VAE, GAN, DBN, and RBM with softmax layer as discriminator layer, and the standalone 1SVM, the proposed VAE-based 1SVM detector offers superior discrimination performance of potential COVID-19 infections. Results also revealed that the deep learning-driven 1SVM detection approaches provide promising detection performance compared to the conventional deep learning models.
Collapse
Affiliation(s)
- Abdelkader Dairi
- Université des Sciences et de la Technologie d’Oran Mohamed-Boudiaf (USTOMB)Oran31000Algérie
- Laboratoire des Technologies de l’Environnement (LTE)Ecole Nationale Polytechnique OranOran31000Algeria
| | - Fouzi Harrou
- Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) DivisionKing Abdullah University of Science and Technology (KAUST)Thuwal23955-6900Saudi Arabia
| | - Ying Sun
- Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) DivisionKing Abdullah University of Science and Technology (KAUST)Thuwal23955-6900Saudi Arabia
| |
Collapse
|
92
|
Marateb HR, Ziaie Nezhad F, Mohebian MR, Sami R, Haghjooy Javanmard S, Dehghan Niri F, Akafzadeh-Savari M, Mansourian M, Mañanas MA, Wolkewitz M, Binder H. Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study. Front Med (Lausanne) 2021; 8:768467. [PMID: 34869483 PMCID: PMC8640954 DOI: 10.3389/fmed.2021.768467] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 10/06/2021] [Indexed: 01/08/2023] Open
Abstract
Coronavirus disease-2019, also known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was a disaster in 2020. Accurate and early diagnosis of coronavirus disease-2019 (COVID-19) is still essential for health policymaking. Reverse transcriptase-polymerase chain reaction (RT-PCR) has been performed as the operational gold standard for COVID-19 diagnosis. We aimed to design and implement a reliable COVID-19 diagnosis method to provide the risk of infection using demographics, symptoms and signs, blood markers, and family history of diseases to have excellent agreement with the results obtained by the RT-PCR and CT-scan. Our study primarily used sample data from a 1-year hospital-based prospective COVID-19 open-cohort, the Khorshid COVID Cohort (KCC) study. A sample of 634 patients with COVID-19 and 118 patients with pneumonia with similar characteristics whose RT-PCR and chest CT scan were negative (as the control group) (dataset 1) was used to design the system and for internal validation. Two other online datasets, namely, some symptoms (dataset 2) and blood tests (dataset 3), were also analyzed. A combination of one-hot encoding, stability feature selection, over-sampling, and an ensemble classifier was used. Ten-fold stratified cross-validation was performed. In addition to gender and symptom duration, signs and symptoms, blood biomarkers, and comorbidities were selected. Performance indices of the cross-validated confusion matrix for dataset 1 were as follows: sensitivity of 96% [confidence interval, CI, 95%: 94-98], specificity of 95% [90-99], positive predictive value (PPV) of 99% [98-100], negative predictive value (NPV) of 82% [76-89], diagnostic odds ratio (DOR) of 496 [198-1,245], area under the ROC (AUC) of 0.96 [0.94-0.97], Matthews Correlation Coefficient (MCC) of 0.87 [0.85-0.88], accuracy of 96% [94-98], and Cohen's Kappa of 0.86 [0.81-0.91]. The proposed algorithm showed excellent diagnosis accuracy and class-labeling agreement, and fair discriminant power. The AUC on the datasets 2 and 3 was 0.97 [0.96-0.98] and 0.92 [0.91-0.94], respectively. The most important feature was white blood cell count, shortness of breath, and C-reactive protein for datasets 1, 2, and 3, respectively. The proposed algorithm is, thus, a promising COVID-19 diagnosis method, which could be an amendment to simple blood tests and screening of symptoms. However, the RT-PCR and chest CT-scan, performed as the gold standard, are not 100% accurate.
Collapse
Affiliation(s)
- Hamid Reza Marateb
- The Biomedical Engineering Department, Engineering Faculty, University of Isfahan, Isfahan, Iran
| | - Farzad Ziaie Nezhad
- The Biomedical Engineering Department, Engineering Faculty, University of Isfahan, Isfahan, Iran
| | - Mohammad Reza Mohebian
- Department of Electrical and Computer Engineering, University of Saskatchewan, Saskatoon, SK, Canada
| | - Ramin Sami
- Department of Internal Medicine, School of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Shaghayegh Haghjooy Javanmard
- Department of Physiology, Applied Physiology Research Center, School of Medicine, Cardiovascular Research Institute, Isfahan University of Medical Sciences, Isfahan, Iran
| | | | - Mahsa Akafzadeh-Savari
- Isfahan Clinical Toxicology Research Center, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Marjan Mansourian
- Automatic Control Department (ESAII), Biomedical Engineering Research Centre (CREB), Universitat Politècnica de Catalunya-Barcelona Tech (UPC), Barcelona, Spain
- Department of Epidemiology and Biostatistics, School of Health, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Miquel Angel Mañanas
- Automatic Control Department (ESAII), Biomedical Engineering Research Centre (CREB), Universitat Politècnica de Catalunya-Barcelona Tech (UPC), Barcelona, Spain
- Biomedical Research Networking Center in Bioengineering, Biomaterials, and Nanomedicine (CIBER-BBN), Madrid, Spain
| | - Martin Wolkewitz
- Faculty of Medicine and Medical Center, Institute of Medical Biometry and Statistics, University of Freiburg, Freiburg, Germany
| | - Harald Binder
- Faculty of Medicine and Medical Center, Institute of Medical Biometry and Statistics, University of Freiburg, Freiburg, Germany
| |
Collapse
|
93
|
Wu J, Shen J, Xu M, Shao M. A novel combined dynamic ensemble selection model for imbalanced data to detect COVID-19 from complete blood count. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2021; 211:106444. [PMID: 34614451 PMCID: PMC8479386 DOI: 10.1016/j.cmpb.2021.106444] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Accepted: 09/22/2021] [Indexed: 06/01/2023]
Abstract
BACKGROUND As blood testing is radiation-free, low-cost and simple to operate, some researchers use machine learning to detect COVID-19 from blood test data. However, few studies take into consideration the imbalanced data distribution, which can impair the performance of a classifier. METHOD A novel combined dynamic ensemble selection (DES) method is proposed for imbalanced data to detect COVID-19 from complete blood count. This method combines data preprocessing and improved DES. Firstly, we use the hybrid synthetic minority over-sampling technique and edited nearest neighbor (SMOTE-ENN) to balance data and remove noise. Secondly, in order to improve the performance of DES, a novel hybrid multiple clustering and bagging classifier generation (HMCBCG) method is proposed to reinforce the diversity and local regional competence of candidate classifiers. RESULTS The experimental results based on three popular DES methods show that the performance of HMCBCG is better than only use bagging. HMCBCG+KNE obtains the best performance for COVID-19 screening with 99.81% accuracy, 99.86% F1, 99.78% G-mean and 99.81% AUC. CONCLUSION Compared to other advanced methods, our combined DES model can improve accuracy, G-mean, F1 and AUC of COVID-19 screening.
Collapse
Affiliation(s)
- Jiachao Wu
- College of Management and Economics, Tianjin University, Tianjin, 300072, China
| | - Jiang Shen
- College of Management and Economics, Tianjin University, Tianjin, 300072, China
| | - Man Xu
- Business School, Nankai University, Tianjin, 300071, China
| | - Minglai Shao
- School of New Media and Communication, Tianjin University, Tianjin, 300072, China.
| |
Collapse
|
94
|
Babaei Rikan S, Sorayaie Azar A, Ghafari A, Bagherzadeh Mohasefi J, Pirnejad H. COVID-19 Diagnosis from Routine Blood Tests using Artificial Intelligence Techniques. Biomed Signal Process Control 2021; 72:103263. [PMID: 34745318 PMCID: PMC8559794 DOI: 10.1016/j.bspc.2021.103263] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 09/30/2021] [Accepted: 10/15/2021] [Indexed: 12/21/2022]
Abstract
Coronavirus disease (COVID-19) is a
unique worldwide pandemic. With new mutations of the virus with higher
transmission rates, it is imperative to diagnose positive cases as
quickly and accurately as possible. Therefore, a fast, accurate, and
automatic system for COVID-19 diagnosis can be very useful for
clinicians. In this study, seven machine learning and four deep learning
models were presented to diagnose positive cases of COVID-19 from three
routine laboratory blood tests datasets. Three correlation coefficient
methods, i.e., Pearson, Spearman, and Kendall, were used to demonstrate
the relevance among samples. A four-fold cross-validation method was used
to train, validate, and test the proposed models. In all three datasets,
the proposed deep neural network (DNN) model achieved the highest values
of accuracy, precision, recall or sensitivity, specificity, F1-Score,
AUC, and MCC. On average, accuracy 92.11%, specificity 84.56%, and AUC
92.20% values have been obtained in the first dataset. In the second
dataset, on average, accuracy 93.16%, specificity 93.02%, and AUC 93.20%
values have been obtained. Finally, in the third dataset, on average, the
values of accuracy 92.5%, specificity 85%, and AUC 92.20% have been
obtained. In this study, we used a statistical t-test to validate the
results. Finally, using artificial intelligence interpretation methods,
important and impactful features in the developed model were presented.
The proposed DNN model can be used as a supplementary tool for diagnosing
COVID-19, which can quickly provide clinicians with highly accurate
diagnoses of positive cases in a timely manner.
Collapse
Affiliation(s)
| | | | - Ali Ghafari
- Medical Physics and Biomedical Engineering Department, Medical Faculty, Tehran University of Medical Sciences, Tehran, Iran
| | | | - Habibollah Pirnejad
- Patient Safety Research Center, Clinical Research Institute, Urmia University of Medical Sciences, Urmia, Iran.,Erasmus School of Health Policy & Management (ESHPM), Erasmus University Rotterdam, Rotterdam, The Netherlands
| |
Collapse
|
95
|
Huyut MT, Üstündağ H. Prediction of diagnosis and prognosis of COVID-19 disease by blood gas parameters using decision trees machine learning model: a retrospective observational study. Med Gas Res 2021; 12:60-66. [PMID: 34677154 PMCID: PMC8562394 DOI: 10.4103/2045-9912.326002] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
The coronavirus disease 2019 (COVID-19) epidemic went down in history as a pandemic caused by corona-viruses that emerged in 2019 and spread rapidly around the world. The different symptoms of COVID-19 made it difficult to understand which variables were more influential on the diagnosis, course and mortality of the disease. Machine learning models can accurately assess hidden patterns among risk factors by analyzing large-datasets to quickly predict diagnosis, prognosis and mortality of diseases. Because of this advantage, the use of machine learning models as decision support systems in health services is increasing. The aim of this study is to determine the diagnosis and prognosis of COVID-19 disease with blood-gas data using the Chi-squared Automatic Interaction Detector (CHAID) decision-tree-model, one of the machine learning methods, which is a subfield of artificial intelligence. This study was carried out on a total of 686 patients with COVID-19 (n = 343) and non-COVID-19 (n = 343) treated at Erzincan-Mengücek-Gazi-Training and Research-Hospital between April 1, 2020 and March 1, 2021. Arterial blood gas values of all patients were obtained from the hospital registry system. While the total-accuracyratio of the decision-tree-model was 65.0% in predicting the prognosis of the disease, it was 68.2% in the diagnosis of the disease. According to the results obtained, the low ionized-calcium value (< 1.10 mM) significantly predicted the need for intensive care of COVID-19 patients. At admission, low-carboxyhemoglobin (< 1.00%), high-pH (> 7.43), low-sodium (< 135.0 mM), hematocrit (< 40.0%), and methemoglobin (< 1.30%) values are important biomarkers in the diagnosis of COVID-19 and the results were promising. The findings in the study may aid in the early-diagnosis of the disease and the intensive-care treatment of patients who are severe. The study was approved by the Ministry of Health and Erzincan University Faculty of Medicine Clinical Research Ethics Committee.
Collapse
Affiliation(s)
- Mehmet Tahir Huyut
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Erzincan Binali Yıldırım University, Erzincan, Turkey
| | - Hilal Üstündağ
- Department of Physiology, Faculty of Medicine, Erzincan Binali Yıldırım University, Erzincan, Turkey
| |
Collapse
|
96
|
Kaur J, Kaur P. Outbreak COVID-19 in Medical Image Processing Using Deep Learning: A State-of-the-Art Review. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING : STATE OF THE ART REVIEWS 2021; 29:2351-2382. [PMID: 34690493 PMCID: PMC8525064 DOI: 10.1007/s11831-021-09667-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 10/01/2021] [Indexed: 06/13/2023]
Abstract
From the month of December-19, the outbreak of Coronavirus (COVID-19) triggered several deaths and overstated every aspect of individual health. COVID-19 has been designated as a pandemic by World Health Organization. The circumstances placed serious trouble on every country worldwide, particularly with health arrangements and time-consuming responses. The increase in the positive cases of COVID-19 globally spread every day. The quantity of accessible diagnosing kits is restricted because of complications in detecting the existence of the illness. Fast and correct diagnosis of COVID-19 is a timely requirement for the prevention and controlling of the pandemic through suitable isolation and medicinal treatment. The significance of the present work is to discuss the outline of the deep learning techniques with medical imaging such as outburst prediction, virus transmitted indications, detection and treatment aspects, vaccine availability with remedy research. Abundant image resources of medical imaging as X-rays, Computed Tomography Scans, Magnetic Resonance imaging, formulate deep learning high-quality methods to fight against the pandemic COVID-19. The review presents a comprehensive idea of deep learning and its related applications in healthcare received over the past decade. At the last, some issues and confrontations to control the health crisis and outbreaks have been introduced. The progress in technology has contributed to developing individual's lives. The problems faced by the radiologists during medical imaging techniques and deep learning approaches for diagnosing the COVID-19 infections have been also discussed.
Collapse
Affiliation(s)
- Jaspreet Kaur
- Department of Computer Engineering & Technology, Guru Nanak Dev University, Amritsar, Punjab India
| | - Prabhpreet Kaur
- Department of Computer Engineering & Technology, Guru Nanak Dev University, Amritsar, Punjab India
| |
Collapse
|
97
|
Immune and cellular damage biomarkers to predict COVID-19 mortality in hospitalized patients. CURRENT RESEARCH IN IMMUNOLOGY 2021; 2:155-162. [PMID: 34545350 PMCID: PMC8444380 DOI: 10.1016/j.crimmu.2021.09.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2021] [Revised: 09/03/2021] [Accepted: 09/10/2021] [Indexed: 01/08/2023] Open
Abstract
Early prediction of COVID-19 in-hospital mortality relies usually on patients' preexisting comorbidities and is rarely reproducible in independent cohorts. We wanted to compare the role of routinely measured biomarkers of immunity, inflammation, and cellular damage with preexisting comorbidities in eight different machine-learning models to predict mortality, and evaluate their performance in an independent population. We recruited and followed-up consecutive adult patients with SARS-Cov-2 infection in two different Italian hospitals. We predicted 60-day mortality in one cohort (development dataset, n = 299 patients, of which 80% was allocated to the development dataset and 20% to the training set) and retested the models in the second cohort (external validation dataset, n = 402). Demographic, clinical, and laboratory features at admission, treatments and disease outcomes were significantly different between the two cohorts. Notably, significant differences were observed for %lymphocytes (p < 0.05), international-normalized-ratio (p < 0.01), platelets, alanine-aminotransferase, creatinine (all p < 0.001). The primary outcome (60-day mortality) was 29.10% (n = 87) in the development dataset, and 39.55% (n = 159) in the external validation dataset. The performance of the 8 tested models on the external validation dataset were similar to that of the holdout test dataset, indicating that the models capture the key predictors of mortality. The shap analysis in both datasets showed that age, immune features (%lymphocytes, platelets) and LDH substantially impacted on all models' predictions, while creatinine and CRP varied among the different models. The model with the better performance was model 8 (60-day mortality AUROC 0.83 ± 0.06 in holdout test set, 0.79 ± 0.02 in external validation dataset). The features that had the greatest impact on this model's prediction were age, LDH, platelets, and %lymphocytes, more than comorbidities or inflammation markers, and these findings were highly consistent in both datasets, likely reflecting the virus effect at the very beginning of the disease.
Collapse
|
98
|
Amin MS, Wozniak M, Barbaric L, Pickard S, Yerrabelli RS, Christensen A, Coiado OC. Experimental Technologies in the Diagnosis and Treatment of COVID-19 in Patients with Comorbidities. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2021; 6:48-71. [PMID: 34541448 PMCID: PMC8442516 DOI: 10.1007/s41666-021-00106-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 08/05/2021] [Accepted: 09/01/2021] [Indexed: 01/08/2023]
Abstract
The COVID-19 pandemic has impacted the whole world and raised concerns about its effects on different human organ systems. Early detection of COVID-19 may significantly increase the rate of survival; thus, it is critical that the disease is detected early. Emerging technologies have been used to prevent, diagnose, and manage COVID-19 among the populace in the USA and globally. Numerous studies have revealed the growing implementation of novel engineered systems during the intervention at various points of the disease’s pathogenesis, especially as it relates to comorbidities and complications related to cardiovascular and respiratory organ systems. In this review, we provide a succinct, but extensive, review of the pathogenesis of COVID-19, particularly as it relates to angiotensin-converting enzyme 2 (ACE2) as a viral entry point. This is followed by a comprehensive analysis of cardiovascular and respiratory comorbidities of COVID-19 and novel technologies that are used to diagnose and manage hospitalized patients. Continuous cardiorespiratory monitoring systems, novel machine learning algorithms for rapidly triaging patients, various imaging modalities, wearable immunosensors, hotspot tracking systems, and other emerging technologies are reviewed. COVID-19 effects on the immune system, associated inflammatory biomarkers, and innovative therapies are also assessed. Finally, with emphasis on the impact of wearable and non-wearable systems, this review highlights future technologies that could help diagnose, monitor, and mitigate disease progression. Technologies that account for an individual’s health conditions, comorbidities, and even socioeconomic factors can drastically reduce the high mortality seen among many COVID-19 patients, primarily via disease prevention, early detection, and pertinent management.
Collapse
Affiliation(s)
- Md Shahnoor Amin
- Carle Illinois College of Medicine, University of Illinois At Urbana-Champaign, Champaign, IL 61820 USA
| | - Marcin Wozniak
- Beckman Institute for Advanced Science and Technology, Urbana, IL 61801 USA.,Department of Medical Laboratory Diagnostics - Biobank, Medical University of Gdansk, Gdansk, Poland
| | - Lidija Barbaric
- Carle Illinois College of Medicine, University of Illinois At Urbana-Champaign, Champaign, IL 61820 USA
| | - Shanel Pickard
- Carle Illinois College of Medicine, University of Illinois At Urbana-Champaign, Champaign, IL 61820 USA
| | - Rahul S Yerrabelli
- Carle Illinois College of Medicine, University of Illinois At Urbana-Champaign, Champaign, IL 61820 USA
| | - Anton Christensen
- Carle Illinois College of Medicine, University of Illinois At Urbana-Champaign, Champaign, IL 61820 USA
| | - Olivia C Coiado
- Carle Illinois College of Medicine, University of Illinois At Urbana-Champaign, Champaign, IL 61820 USA.,Department of Bioengineering, University of Illinois At Urbana-Champaign, Urbana, IL 61801 USA.,Carle Illinois College of Medicine, 1406 W. Green St, Urbana, IL 61801 USA
| |
Collapse
|
99
|
Elimian KO, Aderinola O, Gibson J, Myles P, Ochu CL, King C, Okwor T, Gaudenzi G, Olayinka A, Zaiyad HG, Ohonsi C, Ebhodaghe B, Dan-Nwafor C, Nwachukwu W, Abdus-Salam IA, Akande OW, Falodun O, Arinze C, Ezeokafor C, Jafiya A, Ojimba A, Aremu JT, Joseph E, Bowale A, Mutiu B, Saka B, Jinadu A, Hamza K, Ibeh C, Bello S, Asuzu M, Mba N, Oladejo J, Ilori E, Alfvén T, Igumbor E, Ihekweazu C. Assessing the capacity of symptom scores to predict COVID-19 positivity in Nigeria: a national derivation and validation cohort study. BMJ Open 2021; 11:e049699. [PMID: 34479936 PMCID: PMC8421116 DOI: 10.1136/bmjopen-2021-049699] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
OBJECTIVES This study aimed to develop and validate a symptom prediction tool for COVID-19 test positivity in Nigeria. DESIGN Predictive modelling study. SETTING All Nigeria States and the Federal Capital Territory. PARTICIPANTS A cohort of 43 221 individuals within the national COVID-19 surveillance dataset from 27 February to 27 August 2020. Complete dataset was randomly split into two equal halves: derivation and validation datasets. Using the derivation dataset (n=21 477), backward multivariable logistic regression approach was used to identify symptoms positively associated with COVID-19 positivity (by real-time PCR) in children (≤17 years), adults (18-64 years) and elderly (≥65 years) patients separately. OUTCOME MEASURES Weighted statistical and clinical scores based on beta regression coefficients and clinicians' judgements, respectively. Using the validation dataset (n=21 744), area under the receiver operating characteristic curve (AUROC) values were used to assess the predictive capacity of individual symptoms, unweighted score and the two weighted scores. RESULTS Overall, 27.6% of children (4415/15 988), 34.6% of adults (9154/26 441) and 40.0% of elderly (317/792) that had been tested were positive for COVID-19. Best individual symptom predictor of COVID-19 positivity was loss of smell in children (AUROC 0.56, 95% CI 0.55 to 0.56), either fever or cough in adults (AUROC 0.57, 95% CI 0.56 to 0.58) and difficulty in breathing in the elderly (AUROC 0.53, 95% CI 0.48 to 0.58) patients. In children, adults and the elderly patients, all scoring approaches showed similar predictive performance. CONCLUSIONS The predictive capacity of various symptom scores for COVID-19 positivity was poor overall. However, the findings could serve as an advocacy tool for more investments in resources for capacity strengthening of molecular testing for COVID-19 in Nigeria.
Collapse
Affiliation(s)
- Kelly Osezele Elimian
- Nigeria Centre for Disease Control, Abuja, Nigeria
- Department of Global Public Health, Karolinska Institutet, Stockholm, Sweden
| | | | - Jack Gibson
- Department of Epidemiology and Public Health, University of Nottingham, Nottingham, UK
| | - Puja Myles
- Department of Epidemiology and Public Health, University of Nottingham, Nottingham, UK
| | | | - Carina King
- Department of Global Public Health, Karolinska Institutet, Stockholm, Sweden
| | - Tochi Okwor
- Nigeria Centre for Disease Control, Abuja, Nigeria
| | - Giulia Gaudenzi
- Department of Global Public Health, Karolinska Institutet, Stockholm, Sweden
| | | | | | | | | | | | | | | | - Oluwatosin Wuraola Akande
- Nigeria Centre for Disease Control, Abuja, Nigeria
- Department of Epidemiology and Community Health, University of Ilorin Teaching Hospital, Ilorin, Nigeria
| | | | | | | | | | | | | | - Emmanuel Joseph
- Kaduna State Infectious Disease Control Center Community Medicine, Kaduna, Nigeria
| | | | | | - Babatunde Saka
- Lagos State Government Ministry of Health, Ikeja, Nigeria
| | | | - Khadeejah Hamza
- Department of Community Medicine, Ahmadu Bello University, Zaria, Nigeria
| | - Christian Ibeh
- Department of Community Medicine, Nnamdi Azikiwe University Teaching Hospital, Nnewi, Nigeria
| | - Shaibu Bello
- Department of Medical Education, Usmanu Danfodiyo University, Sokoto, Nigeria
| | - Michael Asuzu
- Department of Community Medicine, University College Hospital, Ibadan, Nigeria
| | - Nwando Mba
- Nigeria Centre for Disease Control, Abuja, Nigeria
| | - John Oladejo
- Nigeria Centre for Disease Control, Abuja, Nigeria
| | - Elsie Ilori
- Nigeria Centre for Disease Control, Abuja, Nigeria
| | - Tobias Alfvén
- Department of Global Public Health, Karolinska Institutet, Stockholm, Sweden
| | - Ehimario Igumbor
- Nigeria Centre for Disease Control, Abuja, Nigeria
- School of Public Health, University of the Western Cape, Bellville, South Africa
| | | |
Collapse
|
100
|
Rahman T, Al-Ishaq FA, Al-Mohannadi FS, Mubarak RS, Al-Hitmi MH, Islam KR, Khandakar A, Hssain AA, Al-Madeed S, Zughaier SM, Chowdhury MEH. Mortality Prediction Utilizing Blood Biomarkers to Predict the Severity of COVID-19 Using Machine Learning Technique. Diagnostics (Basel) 2021; 11:1582. [PMID: 34573923 PMCID: PMC8469072 DOI: 10.3390/diagnostics11091582] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 08/10/2021] [Accepted: 08/25/2021] [Indexed: 12/24/2022] Open
Abstract
Healthcare researchers have been working on mortality prediction for COVID-19 patients with differing levels of severity. A rapid and reliable clinical evaluation of disease intensity will assist in the allocation and prioritization of mortality mitigation resources. The novelty of the work proposed in this paper is an early prediction model of high mortality risk for both COVID-19 and non-COVID-19 patients, which provides state-of-the-art performance, in an external validation cohort from a different population. Retrospective research was performed on two separate hospital datasets from two different countries for model development and validation. In the first dataset, COVID-19 and non-COVID-19 patients were admitted to the emergency department in Boston (24 March 2020 to 30 April 2020), and in the second dataset, 375 COVID-19 patients were admitted to Tongji Hospital in China (10 January 2020 to 18 February 2020). The key parameters to predict the risk of mortality for COVID-19 and non-COVID-19 patients were identified and a nomogram-based scoring technique was developed using the top-ranked five parameters. Age, Lymphocyte count, D-dimer, CRP, and Creatinine (ALDCC), information acquired at hospital admission, were identified by the logistic regression model as the primary predictors of hospital death. For the development cohort, and internal and external validation cohorts, the area under the curves (AUCs) were 0.987, 0.999, and 0.992, respectively. All the patients are categorized into three groups using ALDCC score and death probability: Low (probability < 5%), Moderate (5% < probability < 50%), and High (probability > 50%) risk groups. The prognostic model, nomogram, and ALDCC score will be able to assist in the early identification of both COVID-19 and non-COVID-19 patients with high mortality risk, helping physicians to improve patient management.
Collapse
Affiliation(s)
- Tawsifur Rahman
- Department of Electrical Engineering, Qatar University, Doha 2713, Qatar; (T.R.); (K.R.I.); (A.K.)
| | - Fajer A. Al-Ishaq
- Department of Basic Medical Sciences, College of Medicine, QU Health, Qatar University, Doha 2713, Qatar; (F.A.A.-I.); (F.S.A.-M.); (R.S.M.); (M.H.A.-H.)
| | - Fatima S. Al-Mohannadi
- Department of Basic Medical Sciences, College of Medicine, QU Health, Qatar University, Doha 2713, Qatar; (F.A.A.-I.); (F.S.A.-M.); (R.S.M.); (M.H.A.-H.)
| | - Reem S. Mubarak
- Department of Basic Medical Sciences, College of Medicine, QU Health, Qatar University, Doha 2713, Qatar; (F.A.A.-I.); (F.S.A.-M.); (R.S.M.); (M.H.A.-H.)
| | - Maryam H. Al-Hitmi
- Department of Basic Medical Sciences, College of Medicine, QU Health, Qatar University, Doha 2713, Qatar; (F.A.A.-I.); (F.S.A.-M.); (R.S.M.); (M.H.A.-H.)
| | - Khandaker Reajul Islam
- Department of Electrical Engineering, Qatar University, Doha 2713, Qatar; (T.R.); (K.R.I.); (A.K.)
| | - Amith Khandakar
- Department of Electrical Engineering, Qatar University, Doha 2713, Qatar; (T.R.); (K.R.I.); (A.K.)
| | | | - Somaya Al-Madeed
- Department of Computer Science and Engineering, Qatar University, Doha 2713, Qatar;
| | - Susu M. Zughaier
- Department of Basic Medical Sciences, College of Medicine, QU Health, Qatar University, Doha 2713, Qatar; (F.A.A.-I.); (F.S.A.-M.); (R.S.M.); (M.H.A.-H.)
| | - Muhammad E. H. Chowdhury
- Department of Electrical Engineering, Qatar University, Doha 2713, Qatar; (T.R.); (K.R.I.); (A.K.)
| |
Collapse
|