101
|
Bezzan VP, Rocco CD. Predicting special care during the COVID-19 pandemic: a machine learning approach. Health Inf Sci Syst 2021; 9:34. [PMID: 34413974 PMCID: PMC8363860 DOI: 10.1007/s13755-021-00164-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 08/05/2021] [Indexed: 12/13/2022] Open
Abstract
More than ever, COVID-19 is putting pressure on health systems worldwide, especially in Brazil. In this study, we propose a method based on statistics and machine learning that uses blood lab exam data from patients to predict whether patients will require special care (hospitalization in regular or special-care units). We also predict the number of days the patients will stay under such care. The two-step procedure developed uses Bayesian Optimisation to select the best model among several candidates. This leads us to final models that achieve 0.94 area under ROC curve performance for the first target and 1.87 root mean squared error for the second target (which is a 77% improvement over the mean baseline)—making our model ready to be deployed as a decision system that could be available for everyone interested. The analytical approach can be used in other diseases and can help to plan hospital resources in other contexts.
Collapse
Affiliation(s)
- Vitor P Bezzan
- Instituto de Matemática, Estatistica e Computação Científica - Universidade Estadual de Campinas, Campinas, Brazil
| | - Cleber D Rocco
- Faculdade de Ciências Aplicadas - Universidade Estadual de Campinas, Limeira, Brazil
| |
Collapse
|
102
|
Arora G, Joshi J, Mandal RS, Shrivastava N, Virmani R, Sethi T. Artificial Intelligence in Surveillance, Diagnosis, Drug Discovery and Vaccine Development against COVID-19. Pathogens 2021; 10:1048. [PMID: 34451513 PMCID: PMC8399076 DOI: 10.3390/pathogens10081048] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 08/11/2021] [Accepted: 08/11/2021] [Indexed: 12/15/2022] Open
Abstract
As of August 6th, 2021, the World Health Organization has notified 200.8 million laboratory-confirmed infections and 4.26 million deaths from COVID-19, making it the worst pandemic since the 1918 flu. The main challenges in mitigating COVID-19 are effective vaccination, treatment, and agile containment strategies. In this review, we focus on the potential of Artificial Intelligence (AI) in COVID-19 surveillance, diagnosis, outcome prediction, drug discovery and vaccine development. With the help of big data, AI tries to mimic the cognitive capabilities of a human brain, such as problem-solving and learning abilities. Machine Learning (ML), a subset of AI, holds special promise for solving problems based on experiences gained from the curated data. Advances in AI methods have created an unprecedented opportunity for building agile surveillance systems using the deluge of real-time data generated within a short span of time. During the COVID-19 pandemic, many reports have discussed the utility of AI approaches in prioritization, delivery, surveillance, and supply chain of drugs, vaccines, and non-pharmaceutical interventions. This review will discuss the clinical utility of AI-based models and will also discuss limitations and challenges faced by AI systems, such as model generalizability, explainability, and trust as pillars for real-life deployment in healthcare.
Collapse
Affiliation(s)
- Gunjan Arora
- Department of Internal Medicine, Yale University School of Medicine, New Haven, CT 06520, USA
| | - Jayadev Joshi
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44106, USA;
| | - Rahul Shubhra Mandal
- Department of Cancer Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA;
| | - Nitisha Shrivastava
- Department of Pathology, Albert Einstein College of Medicine/Montefiore Medical Center, Bronx, NY 10461, USA;
| | - Richa Virmani
- Confo Therapeutics, Technologiepark 94, 9052 Ghent, Belgium;
| | - Tavpritesh Sethi
- Indraprastha Institute of Information Technology, New Delhi 110020, India;
| |
Collapse
|
103
|
Arora G, Joshi J, Mandal RS, Shrivastava N, Virmani R, Sethi T. Artificial Intelligence in Surveillance, Diagnosis, Drug Discovery and Vaccine Development against COVID-19. Pathogens 2021; 10:1048. [PMID: 34451513 PMCID: PMC8399076 DOI: 10.3390/pathogens10081048,] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
As of August 6th, 2021, the World Health Organization has notified 200.8 million laboratory-confirmed infections and 4.26 million deaths from COVID-19, making it the worst pandemic since the 1918 flu. The main challenges in mitigating COVID-19 are effective vaccination, treatment, and agile containment strategies. In this review, we focus on the potential of Artificial Intelligence (AI) in COVID-19 surveillance, diagnosis, outcome prediction, drug discovery and vaccine development. With the help of big data, AI tries to mimic the cognitive capabilities of a human brain, such as problem-solving and learning abilities. Machine Learning (ML), a subset of AI, holds special promise for solving problems based on experiences gained from the curated data. Advances in AI methods have created an unprecedented opportunity for building agile surveillance systems using the deluge of real-time data generated within a short span of time. During the COVID-19 pandemic, many reports have discussed the utility of AI approaches in prioritization, delivery, surveillance, and supply chain of drugs, vaccines, and non-pharmaceutical interventions. This review will discuss the clinical utility of AI-based models and will also discuss limitations and challenges faced by AI systems, such as model generalizability, explainability, and trust as pillars for real-life deployment in healthcare.
Collapse
Affiliation(s)
- Gunjan Arora
- Department of Internal Medicine, Yale University School of Medicine, New Haven, CT 06520, USA
- Correspondence: or
| | - Jayadev Joshi
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44106, USA;
| | - Rahul Shubhra Mandal
- Department of Cancer Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA;
| | - Nitisha Shrivastava
- Department of Pathology, Albert Einstein College of Medicine/Montefiore Medical Center, Bronx, NY 10461, USA;
| | - Richa Virmani
- Confo Therapeutics, Technologiepark 94, 9052 Ghent, Belgium;
| | - Tavpritesh Sethi
- Indraprastha Institute of Information Technology, New Delhi 110020, India;
| |
Collapse
|
104
|
AlJame M, Imtiaz A, Ahmad I, Mohammed A. Deep forest model for diagnosing COVID-19 from routine blood tests. Sci Rep 2021; 11:16682. [PMID: 34404838 PMCID: PMC8371014 DOI: 10.1038/s41598-021-95957-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/03/2021] [Indexed: 02/07/2023] Open
Abstract
The Coronavirus Disease 2019 (COVID-19) global pandemic has threatened the lives of people worldwide and posed considerable challenges. Early and accurate screening of infected people is vital for combating the disease. To help with the limited quantity of swab tests, we propose a machine learning prediction model to accurately diagnose COVID-19 from clinical and/or routine laboratory data. The model exploits a new ensemble-based method called the deep forest (DF), where multiple classifiers in multiple layers are used to encourage diversity and improve performance. The cascade level employs the layer-by-layer processing and is constructed from three different classifiers: extra trees, XGBoost, and LightGBM. The prediction model was trained and evaluated on two publicly available datasets. Experimental results show that the proposed DF model has an accuracy of 99.5%, sensitivity of 95.28%, and specificity of 99.96%. These performance metrics are comparable to other well-established machine learning techniques, and hence DF model can serve as a fast screening tool for COVID-19 patients at places where testing is scarce.
Collapse
Affiliation(s)
- Maryam AlJame
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait.
| | | | - Imtiaz Ahmad
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait
| | - Ameer Mohammed
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait
| |
Collapse
|
105
|
Dorn M, Grisci BI, Narloch PH, Feltes BC, Avila E, Kahmann A, Alho CS. Comparison of machine learning techniques to handle imbalanced COVID-19 CBC datasets. PeerJ Comput Sci 2021; 7:e670. [PMID: 34458574 PMCID: PMC8372002 DOI: 10.7717/peerj-cs.670] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 07/20/2021] [Indexed: 06/13/2023]
Abstract
The Coronavirus pandemic caused by the novel SARS-CoV-2 has significantly impacted human health and the economy, especially in countries struggling with financial resources for medical testing and treatment, such as Brazil's case, the third most affected country by the pandemic. In this scenario, machine learning techniques have been heavily employed to analyze different types of medical data, and aid decision making, offering a low-cost alternative. Due to the urgency to fight the pandemic, a massive amount of works are applying machine learning approaches to clinical data, including complete blood count (CBC) tests, which are among the most widely available medical tests. In this work, we review the most employed machine learning classifiers for CBC data, together with popular sampling methods to deal with the class imbalance. Additionally, we describe and critically analyze three publicly available Brazilian COVID-19 CBC datasets and evaluate the performance of eight classifiers and five sampling techniques on the selected datasets. Our work provides a panorama of which classifier and sampling methods provide the best results for different relevant metrics and discuss their impact on future analyses. The metrics and algorithms are introduced in a way to aid newcomers to the field. Finally, the panorama discussed here can significantly benefit the comparison of the results of new ML algorithms.
Collapse
Affiliation(s)
- Marcio Dorn
- Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
- Center of Biotechnology, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
- Forensic Science, National Institute of Science and Technology, Porto Alegre, RS, Brazil
| | - Bruno Iochins Grisci
- Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Pedro Henrique Narloch
- Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Bruno César Feltes
- Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Eduardo Avila
- Forensic Science, National Institute of Science and Technology, Porto Alegre, RS, Brazil
- School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Alessandro Kahmann
- Institute of Mathematics, Statistics and Physics, Federal University of Rio Grande, Rio Grande, RS, Brazil
| | - Clarice Sampaio Alho
- Forensic Science, National Institute of Science and Technology, Porto Alegre, RS, Brazil
- School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| |
Collapse
|
106
|
Gladding PA, Ayar Z, Smith K, Patel P, Pearce J, Puwakdandawa S, Tarrant D, Atkinson J, McChlery E, Hanna M, Gow N, Bhally H, Read K, Jayathissa P, Wallace J, Norton S, Kasabov N, Calude CS, Steel D, Mckenzie C. A machine learning PROGRAM to identify COVID-19 and other diseases from hematology data. Future Sci OA 2021; 7:FSO733. [PMID: 34254032 PMCID: PMC8204819 DOI: 10.2144/fsoa-2020-0207] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 05/19/2021] [Indexed: 11/23/2022] Open
Abstract
AIM We propose a method for screening full blood count metadata for evidence of communicable and noncommunicable diseases using machine learning (ML). MATERIALS & METHODS High dimensional hematology metadata was extracted over an 11-month period from Sysmex hematology analyzers from 43,761 patients. Predictive models for age, sex and individuality were developed to demonstrate the personalized nature of hematology data. Both numeric and raw flow cytometry data were used for both supervised and unsupervised ML to predict the presence of pneumonia, urinary tract infection and COVID-19. Heart failure was used as an objective to prove method generalizability. RESULTS Chronological age was predicted by a deep neural network with R2: 0.59; mean absolute error: 12; sex with AUROC: 0.83, phi: 0.47; individuality with 99.7% accuracy, phi: 0.97; pneumonia with AUROC: 0.74, sensitivity 58%, specificity 79%, 95% CI: 0.73-0.75, p < 0.0001; urinary tract infection AUROC: 0.68, sensitivity 52%, specificity 79%, 95% CI: 0.67-0.68, p < 0.0001; COVID-19 AUROC: 0.8, sensitivity 82%, specificity 75%, 95% CI: 0.79-0.8, p = 0.0006; and heart failure area under the receiver operator curve (AUROC): 0.78, sensitivity 72%, specificity 72%, 95% CI: 0.77-0.78; p < 0.0001. CONCLUSION ML applied to hematology data could predict communicable and noncommunicable diseases, both at local and global levels.
Collapse
Affiliation(s)
- Patrick A Gladding
- Department of Cardiology, Waitematā District Health Board, Auckland, New Zealand
| | - Zina Ayar
- Clinical Information Services, Waitematā District Health Board, Auckland, New Zealand
| | - Kevin Smith
- Clinical laboratory, Waitematā District Health Board, Auckland, New Zealand
| | - Prashant Patel
- Clinical laboratory, Waitematā District Health Board, Auckland, New Zealand
| | - Julia Pearce
- Clinical laboratory, Waitematā District Health Board, Auckland, New Zealand
| | | | - Dianne Tarrant
- Clinical laboratory, Waitematā District Health Board, Auckland, New Zealand
| | - Jon Atkinson
- Clinical laboratory, Waitematā District Health Board, Auckland, New Zealand
| | - Elizabeth McChlery
- Clinical laboratory, Waitematā District Health Board, Auckland, New Zealand
| | - Merit Hanna
- Department of Hematology, Waitematā District Health Board, Auckland, New Zealand
| | - Nick Gow
- Department of Infectious diseases, Waitematā District Health Board, Auckland, New Zealand
| | - Hasan Bhally
- Department of Infectious diseases, Waitematā District Health Board, Auckland, New Zealand
| | - Kerry Read
- Department of Infectious diseases, Waitematā District Health Board, Auckland, New Zealand
| | - Prageeth Jayathissa
- Institute for Innovation & Improvement (i3), Waitematā District Health Board, Auckland, New Zealand
| | - Jonathan Wallace
- Institute for Innovation & Improvement (i3), Waitematā District Health Board, Auckland, New Zealand
| | | | - Nick Kasabov
- Knowledge Engineering & Discovery Research Institute (KEDRI), Auckland University of Technology, Auckland, New Zealand
| | - Cristian S Calude
- School of Computer Science, University of Auckland, Auckland, New Zealand
| | | | | |
Collapse
|
107
|
Du R, Tsougenis ED, Ho JWK, Chan JKY, Chiu KWH, Fang BXH, Ng MY, Leung ST, Lo CSY, Wong HYF, Lam HYS, Chiu LFJ, So TY, Wong KT, Wong YCI, Yu K, Yeung YC, Chik T, Pang JWK, Wai AKC, Kuo MD, Lam TPW, Khong PL, Cheung NT, Vardhanabhuti V. Machine learning application for the prediction of SARS-CoV-2 infection using blood tests and chest radiograph. Sci Rep 2021; 11:14250. [PMID: 34244563 PMCID: PMC8270945 DOI: 10.1038/s41598-021-93719-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Accepted: 06/21/2021] [Indexed: 01/08/2023] Open
Abstract
Triaging and prioritising patients for RT-PCR test had been essential in the management of COVID-19 in resource-scarce countries. In this study, we applied machine learning (ML) to the task of detection of SARS-CoV-2 infection using basic laboratory markers. We performed the statistical analysis and trained an ML model on a retrospective cohort of 5148 patients from 24 hospitals in Hong Kong to classify COVID-19 and other aetiology of pneumonia. We validated the model on three temporal validation sets from different waves of infection in Hong Kong. For predicting SARS-CoV-2 infection, the ML model achieved high AUCs and specificity but low sensitivity in all three validation sets (AUC: 89.9-95.8%; Sensitivity: 55.5-77.8%; Specificity: 91.5-98.3%). When used in adjunction with radiologist interpretations of chest radiographs, the sensitivity was over 90% while keeping moderate specificity. Our study showed that machine learning model based on readily available laboratory markers could achieve high accuracy in predicting SARS-CoV-2 infection.
Collapse
Affiliation(s)
- Richard Du
- Department of Diagnostic Radiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, SAR, China
- Artificial Intelligence Lab, Head Office Information Technology and Health Informatics Division, Hospital Authority, Hong Kong, SAR, China
| | - Efstratios D Tsougenis
- Artificial Intelligence Lab, Head Office Information Technology and Health Informatics Division, Hospital Authority, Hong Kong, SAR, China
| | - Joshua W K Ho
- The School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, SAR, China
| | - Joyce K Y Chan
- Clinical Systems, Information Technology and Health Informatics Division, Hospital Authority, Hong Kong, SAR, China
| | - Keith W H Chiu
- Department of Diagnostic Radiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, SAR, China
| | | | - Ming Yen Ng
- Department of Diagnostic Radiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, SAR, China
- Department of Medical Imaging, The University of Hong Kong-Shenzhen Hospital, Shenzhen, China
| | - Siu-Ting Leung
- Department of Radiology, Pamela Youde Nethersole Eastern Hospital, Hong Kong, SAR, China
| | - Christine S Y Lo
- Department of Radiology, Hong Kong Sanatorium & Hospital, Hong Kong, SAR, China
| | - Ho-Yuen F Wong
- Department of Radiology, Queen Mary Hospital, Hong Kong, SAR, China
| | - Hiu-Yin S Lam
- Department of Radiology, Queen Mary Hospital, Hong Kong, SAR, China
| | - Long-Fung J Chiu
- Department of Radiology and Imaging, Queen Elizabeth Hospital, Hong Kong, SAR, China
| | - Tiffany Y So
- Department of Imaging and Interventional Radiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Ka Tak Wong
- Department of Imaging and Interventional Radiology, Prince of Wales Hospital, Hong Kong, SAR, China
| | - Yiu Chung I Wong
- Department of Radiology, Tuen Muen Hospital, Hong Kong, SAR, China
| | - Kevin Yu
- Department of Radiology, Tuen Muen Hospital, Hong Kong, SAR, China
| | - Yiu-Cheong Yeung
- Department of Medicine, Princess Margaret Hospital, Hong Kong, SAR, China
| | - Thomas Chik
- Department of Medicine, Princess Margaret Hospital, Hong Kong, SAR, China
| | - Joanna W K Pang
- Health Informatics, Information Technology and Health Informatics Division, Hospital Authority, Hong Kong, SAR, China
| | - Abraham Ka-Chung Wai
- Emergency Medicine Unit, Li Ka Shing, Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Michael D Kuo
- Department of Diagnostic Radiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, SAR, China
| | - Tina P W Lam
- Department of Radiology, Queen Mary Hospital, Hong Kong, SAR, China
| | - Pek-Lan Khong
- Department of Diagnostic Radiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, SAR, China
| | - Ngai-Tseung Cheung
- Information Technology and Health Informatics Division, Hospital Authority, Hong Kong, SAR, China
| | - Varut Vardhanabhuti
- Department of Diagnostic Radiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, SAR, China.
| |
Collapse
|
108
|
Syrowatka A, Kuznetsova M, Alsubai A, Beckman AL, Bain PA, Craig KJT, Hu J, Jackson GP, Rhee K, Bates DW. Leveraging artificial intelligence for pandemic preparedness and response: a scoping review to identify key use cases. NPJ Digit Med 2021; 4:96. [PMID: 34112939 PMCID: PMC8192906 DOI: 10.1038/s41746-021-00459-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Accepted: 04/26/2021] [Indexed: 02/05/2023] Open
Abstract
Artificial intelligence (AI) represents a valuable tool that could be widely used to inform clinical and public health decision-making to effectively manage the impacts of a pandemic. The objective of this scoping review was to identify the key use cases for involving AI for pandemic preparedness and response from the peer-reviewed, preprint, and grey literature. The data synthesis had two parts: an in-depth review of studies that leveraged machine learning (ML) techniques and a limited review of studies that applied traditional modeling approaches. ML applications from the in-depth review were categorized into use cases related to public health and clinical practice, and narratively synthesized. One hundred eighty-three articles met the inclusion criteria for the in-depth review. Six key use cases were identified: forecasting infectious disease dynamics and effects of interventions; surveillance and outbreak detection; real-time monitoring of adherence to public health recommendations; real-time detection of influenza-like illness; triage and timely diagnosis of infections; and prognosis of illness and response to treatment. Data sources and types of ML that were useful varied by use case. The search identified 1167 articles that reported on traditional modeling approaches, which highlighted additional areas where ML could be leveraged for improving the accuracy of estimations or projections. Important ML-based solutions have been developed in response to pandemics, and particularly for COVID-19 but few were optimized for practical application early in the pandemic. These findings can support policymakers, clinicians, and other stakeholders in prioritizing research and development to support operationalization of AI for future pandemics.
Collapse
Affiliation(s)
- Ania Syrowatka
- Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA.
- Harvard Medical School, Boston, MA, USA.
| | | | - Ava Alsubai
- Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Adam L Beckman
- Harvard Medical School, Boston, MA, USA
- Harvard Business School, Boston, MA, USA
| | - Paul A Bain
- Countway Library of Medicine, Harvard Medical School, Boston, MA, USA
| | | | - Jianying Hu
- IBM Research, Center for Computational Health, Yorktown Heights, NY, USA
| | - Gretchen Purcell Jackson
- IBM Watson Health, Cambridge, MA, USA
- Department of Pediatric Surgery, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Kyu Rhee
- IBM Watson Health, Cambridge, MA, USA
- CVS Health, Wellesley Hills, MA, USA
| | - David W Bates
- Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Harvard T. H. Chan School of Public Health, Boston, MA, USA
| |
Collapse
|
109
|
Rasheed J, Jamil A, Hameed AA, Al-Turjman F, Rasheed A. COVID-19 in the Age of Artificial Intelligence: A Comprehensive Review. Interdiscip Sci 2021; 13:153-175. [PMID: 33886097 PMCID: PMC8060789 DOI: 10.1007/s12539-021-00431-w] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 04/03/2021] [Accepted: 04/09/2021] [Indexed: 12/23/2022]
Abstract
The recent COVID-19 pandemic, which broke at the end of the year 2019 in Wuhan, China, has infected more than 98.52 million people by today (January 23, 2021) with over 2.11 million deaths across the globe. To combat the growing pandemic on urgent basis, there is need to design effective solutions using new techniques that could exploit recent technology, such as machine learning, deep learning, big data, artificial intelligence, Internet of Things, for identification and tracking of COVID-19 cases in near real time. These technologies have offered inexpensive and rapid solution for proper screening, analyzing, prediction and tracking of COVID-19 positive cases. In this paper, a detailed review of the role of AI as a decisive tool for prognosis, analyze, and tracking the COVID-19 cases is performed. We searched various databases including Google Scholar, IEEE Library, Scopus and Web of Science using a combination of different keywords consisting of COVID-19 and AI. We have identified various applications, where AI can help healthcare practitioners in the process of identification and monitoring of COVID-19 cases. A compact summary of the corona virus cases are first highlighted, followed by the application of AI. Finally, we conclude the paper by highlighting new research directions and discuss the research challenges. Even though scientists and researchers have gathered and exchanged sufficient knowledge over last couple of months, but this structured review also examined technological perspectives while encompassing the medical aspect to help the healthcare practitioners, policymakers, decision makers, policymakers, AI scientists and virologists to quell this infectious COVID-19 pandemic outbreak.
Collapse
Affiliation(s)
- Jawad Rasheed
- Department of Computer Engineering, Istanbul Aydin University, Istanbul, 34295, Turkey.
| | - Akhtar Jamil
- Department of Computer Engineering, Istanbul Sabahattin Zaim University, Istanbul, 34303, Turkey
| | - Alaa Ali Hameed
- Department of Computer Engineering, Istanbul Sabahattin Zaim University, Istanbul, 34303, Turkey
| | - Fadi Al-Turjman
- Artificial Intelligence Engineering Department, Research Center for AI and IoT, Near East University, Nicosia, Mersin 10, Turkey
| | - Ahmad Rasheed
- Department of Electrical and Electronics Engineering, Eastern Mediterranean University, Famagusta, Mersin 10, Turkey
| |
Collapse
|
110
|
Soui M, Mansouri N, Alhamad R, Kessentini M, Ghedira K. NSGA-II as feature selection technique and AdaBoost classifier for COVID-19 prediction using patient's symptoms. NONLINEAR DYNAMICS 2021; 106:1453-1475. [PMID: 34025034 PMCID: PMC8129611 DOI: 10.1007/s11071-021-06504-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2021] [Accepted: 04/28/2021] [Indexed: 05/20/2023]
Abstract
Nowadays, humanity is facing one of the most dangerous pandemics known as COVID-19. Due to its high inter-person contagiousness, COVID-19 is rapidly spreading across the world. Positive patients are often suffering from different symptoms that can vary from mild to severe including cough, fever, sore throat, and body aches. In more dire cases, infected patients can experience severe symptoms that can cause breathing difficulties which lead to stern organ failure and die. The medical corps all over the world are overloaded because of the exponentially myriad number of contagions. Therefore, screening for the disease becomes overwrought with the limited tools of test. Additionally, test results may take a long time to acquire, leaving behind a higher potential for the prevalence of the virus among other individuals by the patients. To reduce the chances of infection, we suggest a prediction model that distinguishes the infected COVID-19 cases based on clinical symptoms and features. This model can be helpful for citizens to catch their infection without the need for visiting the hospital. Also, it helps the medical staff in triaging patients in case of a deficiency of medical amenities. In this paper, we use the non-dominated sorting genetic algorithm (NSGA-II) to select the interesting features by finding the best trade-offs between two conflicting objectives: minimizing the number of features and maximizing the weights of selected features. Then, a classification phase is conducted using an AdaBoost classifier. The proposed model is evaluated using two different datasets. To maximize results, we performed a natural selection of hyper-parameters of the classifier using the genetic algorithm. The obtained results prove the efficiency of NSGA-II as a feature selection algorithm combined with AdaBoost classifier. It exhibits higher classification results that outperformed the existing methods.
Collapse
Affiliation(s)
- Makram Soui
- College of Computing and Informatics, Saudi Electronic University, Riyadh, Saudi Arabia
| | | | - Raed Alhamad
- College of Computing and Informatics, Saudi Electronic University, Riyadh, Saudi Arabia
| | | | - Khaled Ghedira
- Private Higher School of Engineering and Technology, Ariana, Tunisia
| |
Collapse
|
111
|
Rehman MU, Shafique A, Khalid S, Driss M, Rubaiee S. Future Forecasting of COVID-19: A Supervised Learning Approach. SENSORS (BASEL, SWITZERLAND) 2021; 21:3322. [PMID: 34064735 PMCID: PMC8150959 DOI: 10.3390/s21103322] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 05/01/2021] [Accepted: 05/06/2021] [Indexed: 12/18/2022]
Abstract
A little over a year after the official announcement from the WHO, the COVID-19 pandemic has led to dramatic consequences globally. Today, millions of doses of vaccines have already been administered in several countries. However, the positive effect of these vaccines will probably be seen later than expected. In these circumstances, the rapid diagnosis of COVID-19 still remains the only way to slow the spread of this virus. However, it is difficult to predict whether a person is infected or not by COVID-19 while relying only on apparent symptoms. In this context, we propose to use machine learning (ML) algorithms in order to diagnose COVID-19 infected patients more effectively. The proposed diagnosis method takes into consideration several symptoms, such as flu symptoms, throat pain, immunity status, diarrhea, voice type, body temperature, joint pain, dry cough, vomiting, breathing problems, headache, and chest pain. Based on these symptoms that are modelled as ML features, our proposed method is able to predict the probability of contamination with the COVID-19 virus. This method is evaluated using different experimental analysis metrics such as accuracy, precision, recall, and F1-score. The obtained experimental results have shown that the proposed method can predict the presence of COVID-19 with over 97% accuracy.
Collapse
Affiliation(s)
- Mujeeb Ur Rehman
- Department of Electrical Engineering, Riphah International University, Islamabad 46000, Pakistan; (A.S.); (S.K.)
| | - Arslan Shafique
- Department of Electrical Engineering, Riphah International University, Islamabad 46000, Pakistan; (A.S.); (S.K.)
| | - Sohail Khalid
- Department of Electrical Engineering, Riphah International University, Islamabad 46000, Pakistan; (A.S.); (S.K.)
| | - Maha Driss
- RIADI Laboratory, University of Manouba, Manouba 2010, Tunisia;
- IS Department, College of Computer Science and Engineering, Taibah University, Medina 42353, Saudi Arabia
| | - Saeed Rubaiee
- Department of Industrial and Systems Engineering, College of Engineering, University of Jeddah, Jeddah 21589, Saudi Arabia;
| |
Collapse
|
112
|
Lybarger K, Ostendorf M, Thompson M, Yetisgen M. Extracting COVID-19 diagnoses and symptoms from clinical text: A new annotated corpus and neural event extraction framework. J Biomed Inform 2021; 117:103761. [PMID: 33781918 PMCID: PMC7997694 DOI: 10.1016/j.jbi.2021.103761] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 03/02/2021] [Accepted: 03/20/2021] [Indexed: 12/29/2022]
Abstract
Coronavirus disease 2019 (COVID-19) is a global pandemic. Although much has been learned about the novel coronavirus since its emergence, there are many open questions related to tracking its spread, describing symptomology, predicting the severity of infection, and forecasting healthcare utilization. Free-text clinical notes contain critical information for resolving these questions. Data-driven, automatic information extraction models are needed to use this text-encoded information in large-scale studies. This work presents a new clinical corpus, referred to as the COVID-19 Annotated Clinical Text (CACT) Corpus, which comprises 1,472 notes with detailed annotations characterizing COVID-19 diagnoses, testing, and clinical presentation. We introduce a span-based event extraction model that jointly extracts all annotated phenomena, achieving high performance in identifying COVID-19 and symptom events with associated assertion values (0.83-0.97 F1 for events and 0.73-0.79 F1 for assertions). Our span-based event extraction model outperforms an extractor built on MetaMapLite for the identification of symptoms with assertion values. In a secondary use application, we predicted COVID-19 test results using structured patient data (e.g. vital signs and laboratory results) and automatically extracted symptom information, to explore the clinical presentation of COVID-19. Automatically extracted symptoms improve COVID-19 prediction performance, beyond structured data alone.
Collapse
Affiliation(s)
- Kevin Lybarger
- Biomedical & Health Informatics, University of Washington, Box 358047, Seattle, WA 98109, USA.
| | - Mari Ostendorf
- Department of Electrical & Computer Engineering, University of Washington, Campus Box 352500 185, Seattle, WA 98195-2500, USA
| | - Matthew Thompson
- Department of Family Medicine, University of Washington, Box 354696, Seattle, WA 98195-2500, USA
| | - Meliha Yetisgen
- Biomedical & Health Informatics, University of Washington, Box 358047, Seattle, WA 98109, USA
| |
Collapse
|
113
|
Standard blood laboratory values as a clinical support tool to distinguish between SARS-CoV-2 positive and negative patients. Sci Rep 2021; 11:9365. [PMID: 33931692 PMCID: PMC8087776 DOI: 10.1038/s41598-021-88844-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 04/16/2021] [Indexed: 02/08/2023] Open
Abstract
Standard blood laboratory parameters may have diagnostic potential, if polymerase-chain-reaction (PCR) tests are not available on time. We evaluated standard blood laboratory parameters of 655 COVID-19 patients suspected to be infected with SARS-CoV-2, who underwent PCR testing in one of five hospitals in Vienna, Austria. We compared laboratory parameters, clinical characteristics, and outcomes between positive and negative PCR-tested patients and evaluated the ability of those parameters to distinguish between groups. Of the 590 patients (20-100 years, 276 females and 314 males), 208 were PCR-positive. Positive compared to negative PCR-tested patients had significantly lower levels of leukocytes, neutrophils, basophils, eosinophils, lymphocytes, neutrophil-to-lymphocyte ratio, monocytes, and thrombocytes; while significantly higher levels were detected with erythrocytes, hemoglobin, hematocrit, C-reactive-protein, ferritin, activated-partial-thromboplastin-time, alanine-aminotransferase, aspartate-aminotransferase, lipase, creatine-kinase, and lactate-dehydrogenase. From all blood parameters, eosinophils, ferritin, leukocytes, and erythrocytes showed the highest ability to distinguish between COVID-19 positive and negative patients (area-under-curve, AUC: 72.3-79.4%). The AUC of our model was 0.915 (95% confidence intervals, 0.876-0.955). Leukopenia, eosinopenia, elevated erythrocytes, and hemoglobin were among the strongest markers regarding accuracy, sensitivity, specificity, positive and negative predictive value, positive and negative likelihood ratio, and post-test probabilities. Our findings suggest that especially leukopenia, eosinopenia, and elevated hemoglobin are helpful to distinguish between COVID-19 positive and negative tested patients.
Collapse
|
114
|
Chee ML, Ong MEH, Siddiqui FJ, Zhang Z, Lim SL, Ho AFW, Liu N. Artificial Intelligence Applications for COVID-19 in Intensive Care and Emergency Settings: A Systematic Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18094749. [PMID: 33947006 PMCID: PMC8125462 DOI: 10.3390/ijerph18094749] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 04/20/2021] [Accepted: 04/21/2021] [Indexed: 02/07/2023]
Abstract
Background: Little is known about the role of artificial intelligence (AI) as a decisive technology in the clinical management of COVID-19 patients. We aimed to systematically review and critically appraise the current evidence on AI applications for COVID-19 in intensive care and emergency settings. Methods: We systematically searched PubMed, Embase, Scopus, CINAHL, IEEE Xplore, and ACM Digital Library databases from inception to 1 October 2020, without language restrictions. We included peer-reviewed original studies that applied AI for COVID-19 patients, healthcare workers, or health systems in intensive care, emergency, or prehospital settings. We assessed predictive modelling studies and critically appraised the methodology and key findings of all other studies. Results: Of fourteen eligible studies, eleven developed prognostic or diagnostic AI predictive models, all of which were assessed to be at high risk of bias. Common pitfalls included inadequate sample sizes, poor handling of missing data, failure to account for censored participants, and weak validation of models. Conclusions: Current AI applications for COVID-19 are not ready for deployment in acute care settings, given their limited scope and poor quality. Our findings underscore the need for improvements to facilitate safe and effective clinical adoption of AI applications, for and beyond the COVID-19 pandemic.
Collapse
Affiliation(s)
- Marcel Lucas Chee
- Faculty of Medicine, Nursing and Health Sciences, Monash University, Victoria 3800, Australia;
| | - Marcus Eng Hock Ong
- Duke-NUS Medical School, National University of Singapore, Singapore 169857, Singapore; (M.E.H.O.); (F.J.S.); (A.F.W.H.)
- Department of Emergency Medicine, Singapore General Hospital, Singapore 169608, Singapore
| | - Fahad Javaid Siddiqui
- Duke-NUS Medical School, National University of Singapore, Singapore 169857, Singapore; (M.E.H.O.); (F.J.S.); (A.F.W.H.)
| | - Zhongheng Zhang
- Department of Emergency Medicine, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou 310016, China;
| | - Shir Lynn Lim
- Department of Cardiology, National University Heart Centre, Singapore 119074, Singapore;
| | - Andrew Fu Wah Ho
- Duke-NUS Medical School, National University of Singapore, Singapore 169857, Singapore; (M.E.H.O.); (F.J.S.); (A.F.W.H.)
- Department of Emergency Medicine, Singapore General Hospital, Singapore 169608, Singapore
| | - Nan Liu
- Duke-NUS Medical School, National University of Singapore, Singapore 169857, Singapore; (M.E.H.O.); (F.J.S.); (A.F.W.H.)
- Health Service Research Centre, Singapore Health Services, Singapore 169856, Singapore
- Institute of Data Science, National University of Singapore, Singapore 117602, Singapore
- Correspondence:
| |
Collapse
|
115
|
Chung H, Ko H, Kang WS, Kim KW, Lee H, Park C, Song HO, Choi TY, Seo JH, Lee J. Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation. J Med Internet Res 2021; 23:e27060. [PMID: 33764883 PMCID: PMC8057199 DOI: 10.2196/27060] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2021] [Revised: 02/18/2021] [Accepted: 03/24/2021] [Indexed: 12/29/2022] Open
Abstract
Background The number of deaths from COVID-19 continues to surge worldwide. In particular, if a patient’s condition is sufficiently severe to require invasive ventilation, it is more likely to lead to death than to recovery. Objective The goal of our study was to analyze the factors related to COVID-19 severity in patients and to develop an artificial intelligence (AI) model to predict the severity of COVID-19 at an early stage. Methods We developed an AI model that predicts severity based on data from 5601 COVID-19 patients from all national and regional hospitals across South Korea as of April 2020. The clinical severity of COVID-19 was divided into two categories: low and high severity. The condition of patients in the low-severity group corresponded to no limit of activity, oxygen support with nasal prong or facial mask, and noninvasive ventilation. The condition of patients in the high-severity group corresponded to invasive ventilation, multi-organ failure with extracorporeal membrane oxygenation required, and death. For the AI model input, we used 37 variables from the medical records, including basic patient information, a physical index, initial examination findings, clinical findings, comorbid diseases, and general blood test results at an early stage. Feature importance analysis was performed with AdaBoost, random forest, and eXtreme Gradient Boosting (XGBoost); the AI model for predicting COVID-19 severity among patients was developed with a 5-layer deep neural network (DNN) with the 20 most important features, which were selected based on ranked feature importance analysis of 37 features from the comprehensive data set. The selection procedure was performed using sensitivity, specificity, accuracy, balanced accuracy, and area under the curve (AUC). Results We found that age was the most important factor for predicting disease severity, followed by lymphocyte level, platelet count, and shortness of breath or dyspnea. Our proposed 5-layer DNN with the 20 most important features provided high sensitivity (90.2%), specificity (90.4%), accuracy (90.4%), balanced accuracy (90.3%), and AUC (0.96). Conclusions Our proposed AI model with the selected features was able to predict the severity of COVID-19 accurately. We also made a web application so that anyone can access the model. We believe that sharing the AI model with the public will be helpful in validating and improving its performance.
Collapse
Affiliation(s)
- Heewon Chung
- Department of Artificial Intelligence, The Catholic University of Korea, Bucheon, Republic of Korea
| | - Hoon Ko
- Department of Artificial Intelligence, The Catholic University of Korea, Bucheon, Republic of Korea
| | - Wu Seong Kang
- Department of Trauma Surgery, Jeju Regional Trauma Center, Cheju Halla General Hospital, Jeju, Republic of Korea
| | - Kyung Won Kim
- Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Hooseok Lee
- Department of Artificial Intelligence, The Catholic University of Korea, Bucheon, Republic of Korea
| | - Chul Park
- Department of Internal Medicine, Wonkwang University School of Medicine, Iksan, Republic of Korea
| | - Hyun-Ok Song
- Department of Infection Biology, Wonkwang University School of Medicine, Iksan, Republic of Korea
| | - Tae-Young Choi
- Department of Pathology, Wonkwang University School of Medicine, Iksan, Republic of Korea
| | - Jae Ho Seo
- Department of Biochemistry, Wonkwang University School of Medicine, Iksan, Republic of Korea
| | - Jinseok Lee
- Department of Artificial Intelligence, The Catholic University of Korea, Bucheon, Republic of Korea
| |
Collapse
|
116
|
Huang S, Yang J, Fong S, Zhao Q. Artificial intelligence in the diagnosis of COVID-19: challenges and perspectives. Int J Biol Sci 2021; 17:1581-1587. [PMID: 33907522 PMCID: PMC8071762 DOI: 10.7150/ijbs.58855] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Accepted: 03/06/2021] [Indexed: 12/11/2022] Open
Abstract
Artificial intelligence (AI) is being used to aid in various aspects of the COVID-19 crisis, including epidemiology, molecular research and drug development, medical diagnosis and treatment, and socioeconomics. The association of AI and COVID-19 can accelerate to rapidly diagnose positive patients. To learn the dynamics of a pandemic with relevance to AI, we search the literature using the different academic databases (PubMed, PubMed Central, Scopus, Google Scholar) and preprint servers (bioRxiv, medRxiv, arXiv). In the present review, we address the clinical applications of machine learning and deep learning, including clinical characteristics, electronic medical records, medical images (CT, X-ray, ultrasound images, etc.) in the COVID-19 diagnosis. The current challenges and future perspectives provided in this review can be used to direct an ideal deployment of AI technology in a pandemic.
Collapse
Affiliation(s)
- Shigao Huang
- Cancer Centre, Institute of Translational Medicine, Faculty of Health Sciences, University of Macau 999078, Macau SAR, China
| | - Jie Yang
- Department of Computer and Information Science, University of Macau 999078, Macau SAR, China
- Chongqing Industry & Trade Polytechnic 408000, Chongqing, China
| | - Simon Fong
- Department of Computer and Information Science, University of Macau 999078, Macau SAR, China
| | - Qi Zhao
- Cancer Centre, Institute of Translational Medicine, Faculty of Health Sciences, University of Macau 999078, Macau SAR, China
| |
Collapse
|
117
|
Viana Dos Santos Santana Í, Cm da Silveira A, Sobrinho Á, Chaves E Silva L, Dias da Silva L, Santos DFS, Gurjão EC, Perkusich A. Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach. J Med Internet Res 2021; 23:e27293. [PMID: 33750734 PMCID: PMC8034680 DOI: 10.2196/27293] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 03/08/2021] [Accepted: 03/21/2021] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Controlling the COVID-19 outbreak in Brazil is a challenge due to the population's size and urban density, inefficient maintenance of social distancing and testing strategies, and limited availability of testing resources. OBJECTIVE The purpose of this study is to effectively prioritize patients who are symptomatic for testing to assist early COVID-19 detection in Brazil, addressing problems related to inefficient testing and control strategies. METHODS Raw data from 55,676 Brazilians were preprocessed, and the chi-square test was used to confirm the relevance of the following features: gender, health professional, fever, sore throat, dyspnea, olfactory disorders, cough, coryza, taste disorders, and headache. Classification models were implemented relying on preprocessed data sets; supervised learning; and the algorithms multilayer perceptron (MLP), gradient boosting machine (GBM), decision tree (DT), random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), and logistic regression (LR). The models' performances were analyzed using 10-fold cross-validation, classification metrics, and the Friedman and Nemenyi statistical tests. The permutation feature importance method was applied for ranking the features used by the classification models with the highest performances. RESULTS Gender, fever, and dyspnea were among the highest-ranked features used by the classification models. The comparative analysis presents MLP, GBM, DT, RF, XGBoost, and SVM as the highest performance models with similar results. KNN and LR were outperformed by the other algorithms. Applying the easy interpretability as an additional comparison criterion, the DT was considered the most suitable model. CONCLUSIONS The DT classification model can effectively (with a mean accuracy≥89.12%) assist COVID-19 test prioritization in Brazil. The model can be applied to recommend the prioritizing of a patient who is symptomatic for COVID-19 testing.
Collapse
Affiliation(s)
| | | | - Álvaro Sobrinho
- Federal University of the Agreste of Pernambuco, Garanhuns, Brazil.,Federal University of Alagoas, Maceió, Brazil
| | | | | | | | - Edmar C Gurjão
- Federal University of Campina Grande, Campina Grande, Brazil
| | | |
Collapse
|
118
|
Alballa N, Al-Turaiki I. Machine learning approaches in COVID-19 diagnosis, mortality, and severity risk prediction: A review. INFORMATICS IN MEDICINE UNLOCKED 2021; 24:100564. [PMID: 33842685 PMCID: PMC8018906 DOI: 10.1016/j.imu.2021.100564] [Citation(s) in RCA: 80] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 03/26/2021] [Accepted: 03/27/2021] [Indexed: 02/06/2023] Open
Abstract
The existence of widespread COVID-19 infections has prompted worldwide efforts to control and manage the virus, and hopefully curb it completely. One important line of research is the use of machine learning (ML) to understand and fight COVID-19. This is currently an active research field. Although there are already many surveys in the literature, there is a need to keep up with the rapidly growing number of publications on COVID-19-related applications of ML. This paper presents a review of recent reports on ML algorithms used in relation to COVID-19. We focus on the potential of ML for two main applications: diagnosis of COVID-19 and prediction of mortality risk and severity, using readily available clinical and laboratory data. Aspects related to algorithm types, training data sets, and feature selection are discussed. As we cover work published between January 2020 and January 2021, a few key points have come to light. The bulk of the machine learning algorithms used in these two applications are supervised learning algorithms. The established models are yet to be used in real-world implementations, and much of the associated research is experimental. The diagnostic and prognostic features discovered by ML models are consistent with results presented in the medical literature. A limitation of the existing applications is the use of imbalanced data sets that are prone to selection bias.
Collapse
Affiliation(s)
- Norah Alballa
- Computer Science Department, College of Computer and Information Sciences, King Saud University, Saudi Arabia
| | - Isra Al-Turaiki
- Information Technology Department, College of Computer and Information Sciences, King Saud University, Saudi Arabia
| |
Collapse
|
119
|
Chen Z, Russo NW, Miller MM, Murphy RX, Burmeister DB. An observational study to develop a scoring system and model to detect risk of hospital admission due to COVID-19. J Am Coll Emerg Physicians Open 2021; 2:e12406. [PMID: 33817689 PMCID: PMC8011617 DOI: 10.1002/emp2.12406] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Revised: 01/28/2021] [Accepted: 02/24/2021] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND COVID-19 has caused an unprecedented global health emergency. The strains of such a pandemic can overwhelm hospital capacity. Efficient clinical decision-making is crucial for proper healthcare resource utilization in this crisis. Using observational study data, we set out to create a predictive model that could anticipate which COVID-19 patients would likely be admitted and developed a scoring tool that could be used in the clinical setting and for population risk stratification. METHODS We retrospectively evaluated data from COVID-19 patients across a network of 6 hospitals in northeastern Pennsylvania. Analysis was limited to age, gender, and historical variables. After creating a variable importance plot, we chose a selection of the best predictors to train a logistic regression model. Variable selection was done using a lasso regularization technique. Using the coefficients in our logistic regression model, we then created a scoring tool and validated the score on a test set data. RESULTS A total of 6485 COVID-19 patients were included in our analysis, of which 707 were hospitalized. The biggest predictors of patient hospitalization included age, a history of hypertension, diabetes, chronic heart disease, gender, tobacco use, and chronic kidney disease. The logistic regression model demonstrated an AUC of 0.81. The coefficients for our logistic regression model were used to develop a scoring tool. Low-, intermediate-, and high-risk patients were deemed to have a 3.5%, 26%, and 38% chance of hospitalization, respectively. The best predictors of hospitalization included age (odds ratio [OR] = 1.03, confidence interval [CI] = 1.02-1.03), diabetes (OR = 2.08, CI = 1.69-2.57), hypertension (OR = 2.36, CI = 1.90-2.94), chronic heart disease (OR = 1.53, CI = 1.22-1.91), and male gender (OR = 1.32, CI = 1.11-1.58). CONCLUSIONS Using retrospective observational data from a 6-hospital network, we determined risk factors for admission and developed a predictive model and scoring tool for use in the clinical and population setting that could anticipate admission for COVID-19 patients.
Collapse
Affiliation(s)
- Zhe Chen
- Department of Emergency and Hospital Medicine/USF Morsani College of Medicine, Lehigh Valley Health NetworkAllentownPennsylvaniaUSA
| | - Nicholas W. Russo
- Department of Emergency and Hospital Medicine/USF Morsani College of Medicine, Lehigh Valley Health NetworkAllentownPennsylvaniaUSA
| | - Matthew M. Miller
- Department of Emergency and Hospital Medicine/USF Morsani College of Medicine, Lehigh Valley Health NetworkAllentownPennsylvaniaUSA
| | - Robert X. Murphy
- Department of SurgeryDivision of Plastic Surgery, Lehigh Valley Health NetworkAllentownPennsylvaniaUSA
| | - David B. Burmeister
- Department of Emergency and Hospital Medicine/USF Morsani College of Medicine, Lehigh Valley Health NetworkAllentownPennsylvaniaUSA
| |
Collapse
|
120
|
Elkorany AS, Elsharkawy ZF. COVIDetection-Net: A tailored COVID-19 detection from chest radiography images using deep learning. OPTIK 2021; 231:166405. [PMID: 33551492 PMCID: PMC7848537 DOI: 10.1016/j.ijleo.2021.166405] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 01/25/2021] [Indexed: 05/22/2023]
Abstract
In this study, a medical system based on Deep Learning (DL) which we called "COVIDetection-Net" is proposed for automatic detection of new corona virus disease 2019 (COVID-19) infection from chest radiography images (CRIs). The proposed system is based on ShuffleNet and SqueezeNet architecture to extract deep learned features and Multiclass Support Vector Machines (MSVM) for detection and classification. Our dataset contains 1200 CRIs that collected from two different publicly available databases. Extensive experiments were carried out using the proposed model. The highest detection accuracy of 100 % for COVID/NonCOVID, 99.72 % for COVID/Normal/pneumonia and 94.44 % for COVID/Normal/Bacterial pneumonia/Viral pneumonia have been obtained. The proposed system superior all published methods in recall, specificity, precision, F1-Score and accuracy. Confusion Matrix (CM) and Receiver Operation Characteristics (ROC) analysis are also used to depict the performance of the proposed model. Hence the proposed COVIDetection-Net can serve as an efficient system in the current state of COVID-19 pandemic and can be used in everywhere that are facing shortage of test kits.
Collapse
Affiliation(s)
- Ahmed S Elkorany
- Dept. of Electronics and Electrical Comm. Eng., Faculty of Electronic Engineering, Menouf, 32952, Menoufia University, Egypt
- High Institute of Electronic Engineering, Ministry of Higher Education and Scientific Research, Belbeis, Elsharkia, Egypt
| | - Zeinab F Elsharkawy
- Engineering Department, Nuclear Research Center, Atomic Energy Authority, Cairo, Egypt
| |
Collapse
|
121
|
A novel classifier architecture based on deep neural network for COVID-19 detection using laboratory findings. Appl Soft Comput 2021; 106:107329. [PMID: 33758581 PMCID: PMC7972831 DOI: 10.1016/j.asoc.2021.107329] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 02/15/2021] [Accepted: 03/07/2021] [Indexed: 01/08/2023]
Abstract
Unfortunately, Coronavirus disease 2019 (COVID-19) is spreading rapidly all over the world. Along with causing many deaths, it has substantially affected the social life, economics, and infrastructure worldwide in a negative manner. Therefore, it is very important to be able to diagnose the COVID-19 quickly and correctly. In this study, a new feature group based on laboratory findings was obtained considering ethnical and genetic differences for interpretation of blood data. Then, using this feature group, a new hybrid classifier architecture based on deep learning was designed and COVID-19 detection was made. Classification performance indicators were obtained as accuracy of 94.95%, F1-score of 94.98%, precision of 94.98%, recall of 94.98% and AUC of 100%. Achieved results were compared with those of the deep learning classifiers suggested in literature. According to these results, proposed method shows superior performance and can provide more convenience and precision to experts for diagnosis of COVID-19 disease.
Collapse
|
122
|
Goodswen SJ, Barratt JLN, Kennedy PJ, Kaufer A, Calarco L, Ellis JT. Machine learning and applications in microbiology. FEMS Microbiol Rev 2021; 45:6174022. [PMID: 33724378 PMCID: PMC8498514 DOI: 10.1093/femsre/fuab015] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 02/28/2021] [Indexed: 12/15/2022] Open
Abstract
To understand the intricacies of microorganisms at the molecular level requires making sense of copious volumes of data such that it may now be humanly impossible to detect insightful data patterns without an artificial intelligence application called machine learning. Applying machine learning to address biological problems is expected to grow at an unprecedented rate, yet it is perceived by the uninitiated as a mysterious and daunting entity entrusted to the domain of mathematicians and computer scientists. The aim of this review is to identify key points required to start the journey of becoming an effective machine learning practitioner. These key points are further reinforced with an evaluation of how machine learning has been applied so far in a broad scope of real-life microbiology examples. This includes predicting drug targets or vaccine candidates, diagnosing microorganisms causing infectious diseases, classifying drug resistance against antimicrobial medicines, predicting disease outbreaks and exploring microbial interactions. Our hope is to inspire microbiologists and other related researchers to join the emerging machine learning revolution.
Collapse
Affiliation(s)
- Stephen J Goodswen
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Joel L N Barratt
- Parasitic Diseases Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Paul J Kennedy
- School of Computer Science, Faculty of Engineering and Information Technology and the Australian Artificial Intelligence Institute, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Alexa Kaufer
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Larissa Calarco
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - John T Ellis
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| |
Collapse
|
123
|
Alves MA, Castro GZ, Oliveira BAS, Ferreira LA, Ramírez JA, Silva R, Guimarães FG. Explaining machine learning based diagnosis of COVID-19 from routine blood tests with decision trees and criteria graphs. Comput Biol Med 2021; 132:104335. [PMID: 33812263 PMCID: PMC7962588 DOI: 10.1016/j.compbiomed.2021.104335] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 02/25/2021] [Accepted: 03/10/2021] [Indexed: 01/08/2023]
Abstract
The sudden outbreak of coronavirus disease 2019 (COVID-19) revealed the need for fast and reliable automatic tools to help health teams. This paper aims to present understandable solutions based on Machine Learning (ML) techniques to deal with COVID-19 screening in routine blood tests. We tested different ML classifiers in a public dataset from the Hospital Albert Einstein, São Paulo, Brazil. After cleaning and pre-processing the data has 608 patients, of which 84 are positive for COVID-19 confirmed by RT-PCR. To understand the model decisions, we introduce (i) a local Decision Tree Explainer (DTX) for local explanation and (ii) a Criteria Graph to aggregate these explanations and portrait a global picture of the results. Random Forest (RF) classifier achieved the best results (accuracy 0.88, F1-score 0.76, sensitivity 0.66, specificity 0.91, and AUROC 0.86). By using DTX and Criteria Graph for cases confirmed by the RF, it was possible to find some patterns among the individuals able to aid the clinicians to understand the interconnection among the blood parameters either globally or on a case-by-case basis. The results are in accordance with the literature and the proposed methodology may be embedded in an electronic health record system.
Collapse
Affiliation(s)
- Marcos Antonio Alves
- Graduate Program in Electrical Engineering, Universidade Federal de Minas Gerais, Av. Antônio Carlos 6627, 31270-901, Belo Horizonte, MG, Brazil
| | - Giulia Zanon Castro
- Graduate Program in Electrical Engineering, Universidade Federal de Minas Gerais, Av. Antônio Carlos 6627, 31270-901, Belo Horizonte, MG, Brazil.
| | - Bruno Alberto Soares Oliveira
- Graduate Program in Electrical Engineering, Universidade Federal de Minas Gerais, Av. Antônio Carlos 6627, 31270-901, Belo Horizonte, MG, Brazil
| | - Leonardo Augusto Ferreira
- Graduate Program in Electrical Engineering, Universidade Federal de Minas Gerais, Av. Antônio Carlos 6627, 31270-901, Belo Horizonte, MG, Brazil
| | - Jaime Arturo Ramírez
- Department of Electrical Engineering, Federal University of Minas Gerais, Brazil.
| | - Rodrigo Silva
- Department of Computer Science, Federal University of Ouro Preto, Brazil.
| | | |
Collapse
|
124
|
Shaban WM, Rabie AH, Saleh AI, Abo-Elsoud MA. Detecting COVID-19 patients based on fuzzy inference engine and Deep Neural Network. Appl Soft Comput 2021; 99:106906. [PMID: 33204229 PMCID: PMC7659585 DOI: 10.1016/j.asoc.2020.106906] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 11/02/2020] [Accepted: 11/10/2020] [Indexed: 12/23/2022]
Abstract
COVID-19, as an infectious disease, has shocked the world and still threatens the lives of billions of people. Recently, the detection of coronavirus (COVID-19) is a critical task for the medical practitioner. Unfortunately, COVID-19 spreads so quickly between people and approaches millions of people worldwide in few months. It is very much essential to quickly and accurately identify the infected people so that prevention of spread can be taken. Although several medical tests have been used to detect certain injuries, the hopefully detection efficiency has not been accomplished yet. In this paper, a new Hybrid Diagnose Strategy (HDS) has been introduced. HDS relies on a novel technique for ranking selected features by projecting them into a proposed Patient Space (PS). A Feature Connectivity Graph (FCG) is constructed which indicates both the weight of each feature as well as the binding degree to other features. The rank of a feature is determined based on two factors; the first is the feature weight, while the second is its binding degree to its neighbors in PS. Then, the ranked features are used to derive the classification model that can classify new persons to decide whether they are infected or not. The classification model is a hybrid model that consists of two classifiers; fuzzy inference engine and Deep Neural Network (DNN). The proposed HDS has been compared against recent techniques. Experimental results have shown that the proposed HDS outperforms the other competitors in terms of the average value of accuracy, precision, recall, and F-measure in which it provides about of 97.658%, 96.756%, 96.55%, and 96.615% respectively. Additionally, HDS provides the lowest error value of 2.342%. Further, the results were validated statistically using Wilcoxon Signed Rank Test and Friedman Test.
Collapse
Affiliation(s)
- Warda M Shaban
- Nile higher institute for engineering and technology, Egypt
| | - Asmaa H Rabie
- Computers and Control Dept. Faculty of engineering, Mansoura University, Egypt
| | - Ahmed I Saleh
- Computers and Control Dept. Faculty of engineering, Mansoura University, Egypt
| | - M A Abo-Elsoud
- Electronics and Communication Dept. Faculty of engineering, Mansoura University, Egypt
| |
Collapse
|
125
|
Bhattacharya S, Reddy Maddikunta PK, Pham QV, Gadekallu TR, Krishnan S SR, Chowdhary CL, Alazab M, Jalil Piran M. Deep learning and medical image processing for coronavirus (COVID-19) pandemic: A survey. SUSTAINABLE CITIES AND SOCIETY 2021; 65:102589. [PMID: 33169099 PMCID: PMC7642729 DOI: 10.1016/j.scs.2020.102589] [Citation(s) in RCA: 109] [Impact Index Per Article: 36.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Since December 2019, the coronavirus disease (COVID-19) outbreak has caused many death cases and affected all sectors of human life. With gradual progression of time, COVID-19 was declared by the world health organization (WHO) as an outbreak, which has imposed a heavy burden on almost all countries, especially ones with weaker health systems and ones with slow responses. In the field of healthcare, deep learning has been implemented in many applications, e.g., diabetic retinopathy detection, lung nodule classification, fetal localization, and thyroid diagnosis. Numerous sources of medical images (e.g., X-ray, CT, and MRI) make deep learning a great technique to combat the COVID-19 outbreak. Motivated by this fact, a large number of research works have been proposed and developed for the initial months of 2020. In this paper, we first focus on summarizing the state-of-the-art research works related to deep learning applications for COVID-19 medical image processing. Then, we provide an overview of deep learning and its applications to healthcare found in the last decade. Next, three use cases in China, Korea, and Canada are also presented to show deep learning applications for COVID-19 medical image processing. Finally, we discuss several challenges and issues related to deep learning implementations for COVID-19 medical image processing, which are expected to drive further studies in controlling the outbreak and controlling the crisis, which results in smart healthy cities.
Collapse
Affiliation(s)
- Sweta Bhattacharya
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | | | - Quoc-Viet Pham
- Research Institute of Computer, Information and Communication, Pusan National University, Busan 46241, Republic of Korea
| | - Thippa Reddy Gadekallu
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Siva Rama Krishnan S
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Chiranji Lal Chowdhary
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Mamoun Alazab
- College of Engineering, IT & Environment, Charles Darwin University, Australia
| | - Md Jalil Piran
- Department of Computer Science and Engineering, Sejong University, 05006, Seoul, Republic of Korea
| |
Collapse
|
126
|
Soltan AAS, Kouchaki S, Zhu T, Kiyasseh D, Taylor T, Hussain ZB, Peto T, Brent AJ, Eyre DW, Clifton DA. Rapid triage for COVID-19 using routine clinical data for patients attending hospital: development and prospective validation of an artificial intelligence screening test. Lancet Digit Health 2021; 3:e78-e87. [PMID: 33509388 PMCID: PMC7831998 DOI: 10.1016/s2589-7500(20)30274-0] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 10/20/2020] [Accepted: 11/10/2020] [Indexed: 01/19/2023]
Abstract
BACKGROUND The early clinical course of COVID-19 can be difficult to distinguish from other illnesses driving presentation to hospital. However, viral-specific PCR testing has limited sensitivity and results can take up to 72 h for operational reasons. We aimed to develop and validate two early-detection models for COVID-19, screening for the disease among patients attending the emergency department and the subset being admitted to hospital, using routinely collected health-care data (laboratory tests, blood gas measurements, and vital signs). These data are typically available within the first hour of presentation to hospitals in high-income and middle-income countries, within the existing laboratory infrastructure. METHODS We trained linear and non-linear machine learning classifiers to distinguish patients with COVID-19 from pre-pandemic controls, using electronic health record data for patients presenting to the emergency department and admitted across a group of four teaching hospitals in Oxfordshire, UK (Oxford University Hospitals). Data extracted included presentation blood tests, blood gas testing, vital signs, and results of PCR testing for respiratory viruses. Adult patients (>18 years) presenting to hospital before Dec 1, 2019 (before the first COVID-19 outbreak), were included in the COVID-19-negative cohort; those presenting to hospital between Dec 1, 2019, and April 19, 2020, with PCR-confirmed severe acute respiratory syndrome coronavirus 2 infection were included in the COVID-19-positive cohort. Patients who were subsequently admitted to hospital were included in their respective COVID-19-negative or COVID-19-positive admissions cohorts. Models were calibrated to sensitivities of 70%, 80%, and 90% during training, and performance was initially assessed on a held-out test set generated by an 80:20 split stratified by patients with COVID-19 and balanced equally with pre-pandemic controls. To simulate real-world performance at different stages of an epidemic, we generated test sets with varying prevalences of COVID-19 and assessed predictive values for our models. We prospectively validated our 80% sensitivity models for all patients presenting or admitted to the Oxford University Hospitals between April 20 and May 6, 2020, comparing model predictions with PCR test results. FINDINGS We assessed 155 689 adult patients presenting to hospital between Dec 1, 2017, and April 19, 2020. 114 957 patients were included in the COVID-negative cohort and 437 in the COVID-positive cohort, for a full study population of 115 394 patients, with 72 310 admitted to hospital. With a sensitive configuration of 80%, our emergency department (ED) model achieved 77·4% sensitivity and 95·7% specificity (area under the receiver operating characteristic curve [AUROC] 0·939) for COVID-19 among all patients attending hospital, and the admissions model achieved 77·4% sensitivity and 94·8% specificity (AUROC 0·940) for the subset of patients admitted to hospital. Both models achieved high negative predictive values (NPV; >98·5%) across a range of prevalences (≤5%). We prospectively validated our models for all patients presenting and admitted to Oxford University Hospitals in a 2-week test period. The ED model (3326 patients) achieved 92·3% accuracy (NPV 97·6%, AUROC 0·881), and the admissions model (1715 patients) achieved 92·5% accuracy (97·7%, 0·871) in comparison with PCR results. Sensitivity analyses to account for uncertainty in negative PCR results improved apparent accuracy (ED model 95·1%, admissions model 94·1%) and NPV (ED model 99·0%, admissions model 98·5%). INTERPRETATION Our models performed effectively as a screening test for COVID-19, excluding the illness with high-confidence by use of clinical data routinely available within 1 h of presentation to hospital. Our approach is rapidly scalable, fitting within the existing laboratory testing infrastructure and standard of care of hospitals in high-income and middle-income countries. FUNDING Wellcome Trust, University of Oxford, Engineering and Physical Sciences Research Council, National Institute for Health Research Oxford Biomedical Research Centre.
Collapse
Affiliation(s)
- Andrew A S Soltan
- John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK; Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK.
| | - Samaneh Kouchaki
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK; Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, UK
| | - Tingting Zhu
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - Dani Kiyasseh
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - Thomas Taylor
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - Zaamin B Hussain
- Harvard Graduate School of Education and Harvard T H Chan School of Public Health, Harvard University, Boston MA, USA
| | - Tim Peto
- John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK; Nuffield Department of Medicine, University of Oxford, Oxford, UK; NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, University of Oxford and Public Health England, Oxford, UK
| | - Andrew J Brent
- John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK; Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - David W Eyre
- John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK; Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, UK; NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, University of Oxford and Public Health England, Oxford, UK
| | - David A Clifton
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| |
Collapse
|
127
|
Awal MA, Masud M, Hossain MS, Bulbul AAM, Mahmud SMH, Bairagi AK. A Novel Bayesian Optimization-Based Machine Learning Framework for COVID-19 Detection From Inpatient Facility Data. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2021; 9:10263-10281. [PMID: 34786301 PMCID: PMC8545233 DOI: 10.1109/access.2021.3050852] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 01/05/2021] [Indexed: 05/04/2023]
Abstract
The whole world faces a pandemic situation due to the deadly virus, namely COVID-19. It takes considerable time to get the virus well-matured to be traced, and during this time, it may be transmitted among other people. To get rid of this unexpected situation, quick identification of COVID-19 patients is required. We have designed and optimized a machine learning-based framework using inpatient's facility data that will give a user-friendly, cost-effective, and time-efficient solution to this pandemic. The proposed framework uses Bayesian optimization to optimize the hyperparameters of the classifier and ADAptive SYNthetic (ADASYN) algorithm to balance the COVID and non-COVID classes of the dataset. Although the proposed technique has been applied to nine state-of-the-art classifiers to show the efficacy, it can be used to many classifiers and classification problems. It is evident from this study that eXtreme Gradient Boosting (XGB) provides the highest Kappa index of 97.00%. Compared to without ADASYN, our proposed approach yields an improvement in the kappa index of 96.94%. Besides, Bayesian optimization has been compared to grid search, random search to show efficiency. Furthermore, the most dominating features have been identified using SHapely Adaptive exPlanations (SHAP) analysis. A comparison has also been made among other related works. The proposed method is capable enough of tracing COVID patients spending less time than that of the conventional techniques. Finally, two potential applications, namely, clinically operable decision tree and decision support system, have been demonstrated to support clinical staff and build a recommender system.
Collapse
Affiliation(s)
- Md. Abdul Awal
- Electronics and Communication Engineering DisciplineKhulna UniversityKhulna9208Bangladesh
| | - Mehedi Masud
- Department of Computer ScienceCollege of Computers and Information TechnologyTaif UniversityTaif21944Saudi Arabia
| | - Md. Shahadat Hossain
- Department of Quantitative SciencesInternational University of Business Agriculture and TechnologyDhaka1230Bangladesh
| | | | - S. M. Hasan Mahmud
- School of Computer Science and EngineeringUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Anupam Kumar Bairagi
- Computer Science and Engineering DisciplineKhulna UniversityKhulna9208Bangladesh
| |
Collapse
|
128
|
Xu M, Ouyang L, Han L, Sun K, Yu T, Li Q, Tian H, Safarnejad L, Zhang H, Gao Y, Bao FS, Chen Y, Robinson P, Ge Y, Zhu B, Liu J, Chen S. Accurately Differentiating Between Patients With COVID-19, Patients With Other Viral Infections, and Healthy Individuals: Multimodal Late Fusion Learning Approach. J Med Internet Res 2021; 23:e25535. [PMID: 33404516 PMCID: PMC7790733 DOI: 10.2196/25535] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 12/07/2020] [Accepted: 12/17/2020] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Effectively identifying patients with COVID-19 using nonpolymerase chain reaction biomedical data is critical for achieving optimal clinical outcomes. Currently, there is a lack of comprehensive understanding in various biomedical features and appropriate analytical approaches for enabling the early detection and effective diagnosis of patients with COVID-19. OBJECTIVE We aimed to combine low-dimensional clinical and lab testing data, as well as high-dimensional computed tomography (CT) imaging data, to accurately differentiate between healthy individuals, patients with COVID-19, and patients with non-COVID viral pneumonia, especially at the early stage of infection. METHODS In this study, we recruited 214 patients with nonsevere COVID-19, 148 patients with severe COVID-19, 198 noninfected healthy participants, and 129 patients with non-COVID viral pneumonia. The participants' clinical information (ie, 23 features), lab testing results (ie, 10 features), and CT scans upon admission were acquired and used as 3 input feature modalities. To enable the late fusion of multimodal features, we constructed a deep learning model to extract a 10-feature high-level representation of CT scans. We then developed 3 machine learning models (ie, k-nearest neighbor, random forest, and support vector machine models) based on the combined 43 features from all 3 modalities to differentiate between the following 4 classes: nonsevere, severe, healthy, and viral pneumonia. RESULTS Multimodal features provided substantial performance gain from the use of any single feature modality. All 3 machine learning models had high overall prediction accuracy (95.4%-97.7%) and high class-specific prediction accuracy (90.6%-99.9%). CONCLUSIONS Compared to the existing binary classification benchmarks that are often focused on single-feature modality, this study's hybrid deep learning-machine learning framework provided a novel and effective breakthrough for clinical applications. Our findings, which come from a relatively large sample size, and analytical workflow will supplement and assist with clinical decision support for current COVID-19 diagnostic methods and other clinical applications with high-dimensional multimodal biomedical features.
Collapse
Affiliation(s)
- Ming Xu
- Department of Occupational Disease Prevention, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing, China.,Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China.,Department of Public Health Sciences, College of Health and Human Services, University of North Carolina Charlotte, Charlotte, NC, United States
| | - Liu Ouyang
- Department of Orthopedics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Lei Han
- Department of Occupational Disease Prevention, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing, China.,Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Kai Sun
- Department of Emergency Medicine, The First Hospital of Nanjing Medical University, Nanjing, China
| | - Tingting Yu
- Department of Medical Genetics, School of Basic Medical Science Jiangsu Key Laboratory of Xenotransplantation, Nanjing Medical University, Nanjing, China
| | - Qian Li
- Department of Pediatrics, Affiliated Kunshan Hospital of Jiangsu University, Kunshan, China
| | - Hua Tian
- Department of Acute Infectious Disease Control and Prevention, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing, China
| | - Lida Safarnejad
- School of Medicine, Stanford University, Stanford, CA, United States.,Department of Software and Information Systems, College of Computing and Informatics, University of North Carolina Charlotte, Charlotte, NC, United States
| | - Hengdong Zhang
- Department of Occupational Disease Prevention, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing, China.,Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Yue Gao
- Department of Occupational Disease Prevention, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing, China
| | - Forrest Sheng Bao
- Department of Computer Science, Iowa State University, Ames, IA, United States
| | - Yuanfang Chen
- Institute of HIV/AIDS/STI Prevention and Control, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing, China
| | - Patrick Robinson
- Department of Public Health Sciences, College of Health and Human Services, University of North Carolina Charlotte, Charlotte, NC, United States
| | - Yaorong Ge
- Department of Software and Information Systems, College of Computing and Informatics, University of North Carolina Charlotte, Charlotte, NC, United States
| | - Baoli Zhu
- Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Jie Liu
- Department of Radiology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Shi Chen
- Department of Public Health Sciences, College of Health and Human Services, University of North Carolina Charlotte, Charlotte, NC, United States.,School of Data Science, University of North Carolina Charlotte, Charlotte, NC, United States
| |
Collapse
|
129
|
Nayak J, Naik B, Dinesh P, Vakula K, Rao BK, Ding W, Pelusi D. Intelligent system for COVID-19 prognosis: a state-of-the-art survey. APPL INTELL 2021; 51:2908-2938. [PMID: 34764577 PMCID: PMC7786871 DOI: 10.1007/s10489-020-02102-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/27/2020] [Indexed: 01/31/2023]
Abstract
This 21st century is notable for experiencing so many disturbances at economic, social, cultural, and political levels in the entire world. The outbreak of novel corona virus 2019 (COVID-19) has been treated as a Public Health crisis of global Concern by the World Health Organization (WHO). Various outbreak models for COVID-19 are being utilized by researchers throughout the world to get well-versed decisions and impose significant control measures. Amid the standard methods for COVID-19 worldwide epidemic prediction, easy statistical, as well as epidemiological methods have got more consideration by researchers and authorities. One main difficulty in controlling the spreading of COVID-19 is the inadequacy and lack of medical tests for detecting as well as identifying a solution. To solve this problem, a few statistical-based advances are being enhanced and turn into a partial resolution up-to some level. To deal with the challenges of the medical field, a broad range of intelligent based methods, frameworks, and equipment have been recommended by Machine Learning (ML) and Deep Learning. As ML and DL have the ability of identifying and predicting patterns in complex large datasets, they are recognized as a suitable procedure for producing effective solutions for the diagnosis of COVID-19. In this paper, a perspective research has been conducted in the applicability of intelligent systems such as ML, DL and others in solving COVID-19 related outbreak issues. The main intention behind this study is (i) to understand the importance of intelligent approaches such as ML and DL for COVID-19 pandemic, (ii) discussing the efficiency and impact of these methods in the prognosis of COVID-19, (iii) the growth in the development of type of ML and advanced ML methods for COVID-19 prognosis,(iv) analyzing the impact of data types and the nature of data along with challenges in processing the data for COVID-19,(v) to focus on some future challenges in COVID-19 prognosis to inspire the researchers for innovating and enhancing their knowledge and research on other impacted sectors due to COVID-19.
Collapse
Affiliation(s)
- Janmenjoy Nayak
- Department of Computer Science and Engineering, Aditya Institute of Technology and Management (AITAM), K Kotturu, Tekkali, AP 532201 India
| | - Bighnaraj Naik
- Department of Computer Application, Veer Surendra Sai University of Technology, Burla, Odisha 768018 India
| | - Paidi Dinesh
- Department of Computer Science and Engineering, Sri Sivani College of Engineering, Srikakulam, AP 532402 India
| | - Kanithi Vakula
- Department of Computer Science and Engineering, Sri Sivani College of Engineering, Srikakulam, AP 532402 India
| | - B. Kameswara Rao
- Department of Computer Science and Engineering, Aditya Institute of Technology and Management (AITAM), K Kotturu, Tekkali, AP 532201 India
| | - Weiping Ding
- School of Information Science and Technology, Nantong University, Nantong, China
| | - Danilo Pelusi
- Faculty of Communication Sciences, University of Teramo, Coste Sant', Agostino Campus, Teramo, Italy
| |
Collapse
|
130
|
Tayarani N MH. Applications of artificial intelligence in battling against covid-19: A literature review. CHAOS, SOLITONS, AND FRACTALS 2021; 142:110338. [PMID: 33041533 PMCID: PMC7532790 DOI: 10.1016/j.chaos.2020.110338] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 10/01/2020] [Indexed: 05/14/2023]
Abstract
Colloquially known as coronavirus, the Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2), that causes CoronaVirus Disease 2019 (COVID-19), has become a matter of grave concern for every country around the world. The rapid growth of the pandemic has wreaked havoc and prompted the need for immediate reactions to curb the effects. To manage the problems, many research in a variety of area of science have started studying the issue. Artificial Intelligence is among the area of science that has found great applications in tackling the problem in many aspects. Here, we perform an overview on the applications of AI in a variety of fields including diagnosis of the disease via different types of tests and symptoms, monitoring patients, identifying severity of a patient, processing covid-19 related imaging tests, epidemiology, pharmaceutical studies, etc. The aim of this paper is to perform a comprehensive survey on the applications of AI in battling against the difficulties the outbreak has caused. Thus we cover every way that AI approaches have been employed and to cover all the research until the writing of this paper. We try organize the works in a way that overall picture is comprehensible. Such a picture, although full of details, is very helpful in understand where AI sits in current pandemonium. We also tried to conclude the paper with ideas on how the problems can be tackled in a better way and provide some suggestions for future works.
Collapse
Affiliation(s)
- Mohammad-H Tayarani N
- Biocomputation Group, School of Computer Science, University of Hertfordshire, Hatfield, AL10 9AB, United Kingdom
| |
Collapse
|
131
|
Huang D, Miao H, Zhang Z, Yang Y, Zhang L, Lure FYM, Wang Z, Jaeger S, Guo L, Xu T, Liu J. Longitudinal changes of laboratory measurements after discharged from hospital in 268 COVID-19 pneumonia patients. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY 2021; 29:741-762. [PMID: 34397444 DOI: 10.3233/xst-210920] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
BACKGROUND AND OBJECTIVE Monitoring recovery process of coronavirus disease 2019 (COVID-19) patients released from hospital is crucial for exploring residual effects of COVID-19 and beneficial for clinical care. In this study, a comprehensive analysis was carried out to clarify residual effects of COVID-19 on hospital discharged patients. METHODS Two hundred sixty-eight cases with laboratory measured data at hospital discharge record and five follow-up visits were retrospectively collected to carry out statistical data analysis comprehensively, which includes multiple statistical methods (e.g., chi-square, T-test and regression) used in this study. RESULTS Study found that 13 of 21 hematologic parameters in laboratory measured dataset and volume ratio of right lung lesions on CT images highly associated with COVID-19. Moderate patients had statistically significant lower neutrophils than mild and severe patients after hospital discharge, which is probably caused by more efforts on severe patients and slightly neglection of moderate patients. COVID-19 has residual effects on neutrophil-to-lymphocyte ratio (NLR) of patients who have hypertension or chronic obstructive pulmonary disease (COPD). After released from hospital, female showed better performance in T lymphocytes subset cells, especially T helper lymphocyte% (16% higher than male). According to this sex-based differentiation of COVID-19, male should be recommended to take clinical test more frequently to monitor recovery of immune system. Patients over 60 years old showed unstable recovery process of immune cells (e.g., CD45 + lymphocyte) within 75 days after discharge requiring longer clinical care. Additionally, right lung was vulnerable to COVID-19 and required more time to recover than left lung. CONCLUSIONS Criterion of hospital discharge and strategy of clinical care should be flexible in different cases due to residual effects of COVID-19, which depend on several impact factors. Revealing remaining effects of COVID-19 is an effective way to eliminate disorder of mental health caused by COVID-19 infection.
Collapse
Affiliation(s)
- Deyang Huang
- Guangzhou Eighth People's Hospital, Guangdong, China
| | - Hengyuan Miao
- Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, Shenzhen, China
| | - Ziqi Zhang
- Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, Shenzhen, China
| | - Yanhong Yang
- Guangzhou Eighth People's Hospital, Guangdong, China
| | | | | | - Zixian Wang
- Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, Shenzhen, China
| | - Stefan Jaeger
- National Library of Medicine, National Institutes of Health, Rockville Pike, Bethesda, MD, USA
| | - Lin Guo
- Shenzhen Zhiying Medical Co., Ltd, Shenzhen, China
| | - Tao Xu
- Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, Shenzhen, China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijings, Department of Mechanical Engineering, Tsinghua University, Beijing, China
- Key Laboratory for Advanced Materials Processing Technology, Ministry of Education, Department of Mechanical Engineering, Tsinghua University, Beijing, China
| | - Jinxin Liu
- Guangzhou Eighth People's Hospital, Guangdong, China
| |
Collapse
|
132
|
Arpaci I, Huang S, Al-Emran M, Al-Kabi MN, Peng M. Predicting the COVID-19 infection with fourteen clinical features using machine learning classification algorithms. MULTIMEDIA TOOLS AND APPLICATIONS 2021; 80:11943-11957. [PMID: 33437173 PMCID: PMC7790521 DOI: 10.1007/s11042-020-10340-7] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 10/23/2020] [Accepted: 12/22/2020] [Indexed: 05/07/2023]
Abstract
While the RT-PCR is the silver bullet test for confirming the COVID-19 infection, it is limited by the lack of reagents, time-consuming, and the need for specialized labs. As an alternative, most of the prior studies have focused on Chest CT images and Chest X-Ray images using deep learning algorithms. However, these two approaches cannot always be used for patients' screening due to the radiation doses, high costs, and the low number of available devices. Hence, there is a need for a less expensive and faster diagnostic model to identify the positive and negative cases of COVID-19. Therefore, this study develops six predictive models for COVID-19 diagnosis using six different classifiers (i.e., BayesNet, Logistic, IBk, CR, PART, and J48) based on 14 clinical features. This study retrospected 114 cases from the Taizhou hospital of Zhejiang Province in China. The results showed that the CR meta-classifier is the most accurate classifier for predicting the positive and negative COVID-19 cases with an accuracy of 84.21%. The results could help in the early diagnosis of COVID-19, specifically when the RT-PCR kits are not sufficient for testing the infection and assist countries, specifically the developing ones that suffer from the shortage of RT-PCR tests and specialized laboratories.
Collapse
Affiliation(s)
- Ibrahim Arpaci
- Department of Computer Education and Instructional Technology, Tokat Gaziosmanpasa University, Tokat, Turkey
| | - Shigao Huang
- Cancer Centre, Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Taipa, Macao SAR China
| | - Mostafa Al-Emran
- Faculty of Engineering & IT, The British University in Dubai, Dubai, UAE
| | - Mohammed N. Al-Kabi
- Department of Information Technology, Al Buraimi University College, Al Buraimi, Oman
| | - Minfei Peng
- Zhejiang Taizhou Hospital, Wenzhou Medical University, Taizhou, China
| |
Collapse
|
133
|
Plante TB, Blau AM, Berg AN, Weinberg AS, Jun IC, Tapson VF, Kanigan TS, Adib AB. Development and External Validation of a Machine Learning Tool to Rule Out COVID-19 Among Adults in the Emergency Department Using Routine Blood Tests: A Large, Multicenter, Real-World Study. J Med Internet Res 2020; 22:e24048. [PMID: 33226957 PMCID: PMC7713695 DOI: 10.2196/24048] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/14/2020] [Accepted: 11/19/2020] [Indexed: 12/21/2022] Open
Abstract
Background Conventional diagnosis of COVID-19 with reverse transcription polymerase chain reaction (RT-PCR) testing (hereafter, PCR) is associated with prolonged time to diagnosis and significant costs to run the test. The SARS-CoV-2 virus might lead to characteristic patterns in the results of widely available, routine blood tests that could be identified with machine learning methodologies. Machine learning modalities integrating findings from these common laboratory test results might accelerate ruling out COVID-19 in emergency department patients. Objective We sought to develop (ie, train and internally validate with cross-validation techniques) and externally validate a machine learning model to rule out COVID 19 using only routine blood tests among adults in emergency departments. Methods Using clinical data from emergency departments (EDs) from 66 US hospitals before the pandemic (before the end of December 2019) or during the pandemic (March-July 2020), we included patients aged ≥20 years in the study time frame. We excluded those with missing laboratory results. Model training used 2183 PCR-confirmed cases from 43 hospitals during the pandemic; negative controls were 10,000 prepandemic patients from the same hospitals. External validation used 23 hospitals with 1020 PCR-confirmed cases and 171,734 prepandemic negative controls. The main outcome was COVID 19 status predicted using same-day routine laboratory results. Model performance was assessed with area under the receiver operating characteristic (AUROC) curve as well as sensitivity, specificity, and negative predictive value (NPV). Results Of 192,779 patients included in the training, external validation, and sensitivity data sets (median age decile 50 [IQR 30-60] years, 40.5% male [78,249/192,779]), AUROC for training and external validation was 0.91 (95% CI 0.90-0.92). Using a risk score cutoff of 1.0 (out of 100) in the external validation data set, the model achieved sensitivity of 95.9% and specificity of 41.7%; with a cutoff of 2.0, sensitivity was 92.6% and specificity was 59.9%. At the cutoff of 2.0, the NPVs at a prevalence of 1%, 10%, and 20% were 99.9%, 98.6%, and 97%, respectively. Conclusions A machine learning model developed with multicenter clinical data integrating commonly collected ED laboratory data demonstrated high rule-out accuracy for COVID-19 status, and might inform selective use of PCR-based testing.
Collapse
Affiliation(s)
- Timothy B Plante
- Larner College of Medicine at the University of Vermont, Colchester, VT, United States.,University of Vermont Medical Center, Burlington, VT, United States
| | - Aaron M Blau
- University of Vermont Medical Center, Burlington, VT, United States
| | - Adrian N Berg
- Larner College of Medicine at the University of Vermont, Burlington, VT, United States.,Biocogniv Inc, South Burlington, VT, United States
| | | | - Ik C Jun
- Cedars-Sinai Medical Center, Los Angeles, CA, United States
| | | | | | - Artur B Adib
- Biocogniv Inc, South Burlington, VT, United States
| |
Collapse
|
134
|
Rasheed J, Jamil A, Hameed AA, Aftab U, Aftab J, Shah SA, Draheim D. A survey on artificial intelligence approaches in supporting frontline workers and decision makers for the COVID-19 pandemic. CHAOS, SOLITONS, AND FRACTALS 2020; 141:110337. [PMID: 33071481 PMCID: PMC7547637 DOI: 10.1016/j.chaos.2020.110337] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 10/01/2020] [Indexed: 05/04/2023]
Abstract
While the world has experience with many different types of infectious diseases, the current crisis related to the spread of COVID-19 has challenged epidemiologists and public health experts alike, leading to a rapid search for, and development of, new and innovative solutions to combat its spread. The transmission of this virus has infected more than 18.92 million people as of August 6, 2020, with over half a million deaths across the globe; the World Health Organization (WHO) has declared this a global pandemic. A multidisciplinary approach needs to be followed for diagnosis, treatment and tracking, especially between medical and computer sciences, so, a common ground is available to facilitate the research work at a faster pace. With this in mind, this survey paper aimed to explore and understand how and which different technological tools and techniques have been used within the context of COVID-19. The primary contribution of this paper is in its collation of the current state-of-the-art technological approaches applied to the context of COVID-19, and doing this in a holistic way, covering multiple disciplines and different perspectives. The analysis is widened by investigating Artificial Intelligence (AI) approaches for the diagnosis, anticipate infection and mortality rate by tracing contacts and targeted drug designing. Moreover, the impact of different kinds of medical data used in diagnosis, prognosis and pandemic analysis is also provided. This review paper covers both medical and technological perspectives to facilitate the virologists, AI researchers and policymakers while in combating the COVID-19 outbreak.
Collapse
Affiliation(s)
- Jawad Rasheed
- Department of Computer Engineering, Istanbul Sabahattin Zaim University, Istanbul 34303, Turkey
| | - Akhtar Jamil
- Department of Computer Engineering, Istanbul Sabahattin Zaim University, Istanbul 34303, Turkey
| | - Alaa Ali Hameed
- Department of Computer Engineering, Istanbul Sabahattin Zaim University, Istanbul 34303, Turkey
| | - Usman Aftab
- Department of Pharmacology, University of Health Sciences, Lahore 54700, Pakistan
| | - Javaria Aftab
- Department of Chemistry, Istanbul Technical University, Istanbul 34467, Turkey
| | - Syed Attique Shah
- Department of IT, Balochistan University of Information Technology, Engineering and Management Sciences, Quetta 87300, Pakistan
| | - Dirk Draheim
- Information Systems Group, Tallinn University of Technology, Akadeemia tee 15a, 12618, Tallinn, Estonia
| |
Collapse
|
135
|
Ferrari D, Carobene A, Campagner A, Cabitza F, Sabetta E, Ceriotti D, Di Resta C, Locatelli M. Evidence of significant difference in key COVID-19 biomarkers during the Italian lockdown strategy. A retrospective study on patients admitted to a hospital emergency department in Northern Italy. ACTA BIO-MEDICA : ATENEI PARMENSIS 2020; 91:e2020156. [PMID: 33525206 PMCID: PMC7927476 DOI: 10.23750/abm.v91i4.10371] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 07/30/2020] [Indexed: 12/24/2022]
Abstract
BACKGROUND The Lombardy region, Italy, has been severely affected by COVID-19. During the epidemic peak, in March 2020, patients needing intensive care unit treatments were approximately 10% of those infected. This fraction decreased to approximately 2% in the second part of April, and to 0.4% at the beginning of July. COVID-19 is characterized by several biochemical abnormalities whose discrepancy from normal values was associated to the severity of the disease. The aim of this retrospective study was to compare the biochemical patterns of patients during and after the pandemic peak in order to verify whether later patients were experiencing a milder COVID-19 course, as anecdotally observed by several clinicians of the same Hospital. MATERIAL AND METHODS The laboratory findings of two equivalent groups of 84 patients each, admitted at the emergency department of the San Raffaele Hospital (Milan, Italy), during March and April respectively, were analyzed and compared. Results. White blood cell, platelets, lymphocytes and lactate dehydrogenase showed a statistically significant improvement (i.e. closer or within the normal clinical range) in the April group compared to March. Creatinine, C-reactive protein, Calcium and liver enzymes, were also pointing in that direction, although the differences were not significant. DISCUSSION The laboratory findings analyzed in this study were consistent with a milder COVID-19 course in the April group. After excluding several hypotheses, we concluded that our observation was likely the consequence of the lockdown strategy enforcement, which, by imposing social distancing and the use of respiratory protective devices, reduced viral loads upon infection.
Collapse
|
136
|
Cabitza F, Campagner A, Ferrari D, Di Resta C, Ceriotti D, Sabetta E, Colombini A, De Vecchi E, Banfi G, Locatelli M, Carobene A. Development, evaluation, and validation of machine learning models for COVID-19 detection based on routine blood tests. Clin Chem Lab Med 2020; 59:421-431. [PMID: 33079698 DOI: 10.1515/cclm-2020-1294] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 10/07/2020] [Indexed: 02/07/2023]
Abstract
Objectives The rRT-PCR test, the current gold standard for the detection of coronavirus disease (COVID-19), presents with known shortcomings, such as long turnaround time, potential shortage of reagents, false-negative rates around 15-20%, and expensive equipment. The hematochemical values of routine blood exams could represent a faster and less expensive alternative. Methods Three different training data set of hematochemical values from 1,624 patients (52% COVID-19 positive), admitted at San Raphael Hospital (OSR) from February to May 2020, were used for developing machine learning (ML) models: the complete OSR dataset (72 features: complete blood count (CBC), biochemical, coagulation, hemogasanalysis and CO-Oxymetry values, age, sex and specific symptoms at triage) and two sub-datasets (COVID-specific and CBC dataset, 32 and 21 features respectively). 58 cases (50% COVID-19 positive) from another hospital, and 54 negative patients collected in 2018 at OSR, were used for internal-external and external validation. Results We developed five ML models: for the complete OSR dataset, the area under the receiver operating characteristic curve (AUC) for the algorithms ranged from 0.83 to 0.90; for the COVID-specific dataset from 0.83 to 0.87; and for the CBC dataset from 0.74 to 0.86. The validations also achieved good results: respectively, AUC from 0.75 to 0.78; and specificity from 0.92 to 0.96. Conclusions ML can be applied to blood tests as both an adjunct and alternative method to rRT-PCR for the fast and cost-effective identification of COVID-19-positive patients. This is especially useful in developing countries, or in countries facing an increase in contagions.
Collapse
Affiliation(s)
| | - Andrea Campagner
- IRCCS Istituto Ortopedico Galeazzi, Laboratory of Clinical Chemistry and Microbiology, Milan, Italy
| | | | - Chiara Di Resta
- Vita-Salute San Raffaele University; Unit of Genomics for Human Disease Diagnosis, Division of Genetics and Cell Biology, Milan, Italy
| | - Daniele Ceriotti
- Laboratory Medicine, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Eleonora Sabetta
- Laboratory Medicine, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Alessandra Colombini
- IRCCS Istituto Ortopedico Galeazzi, Laboratory of Clinical Chemistry and Microbiology, Milan, Italy
| | - Elena De Vecchi
- IRCCS Istituto Ortopedico Galeazzi, Laboratory of Clinical Chemistry and Microbiology, Milan, Italy
| | - Giuseppe Banfi
- IRCCS Istituto Ortopedico Galeazzi, Laboratory of Clinical Chemistry and Microbiology, Milan, Italy
| | - Massimo Locatelli
- Laboratory Medicine, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Anna Carobene
- Laboratory Medicine, IRCCS San Raffaele Scientific Institute, Milan, Italy
| |
Collapse
|
137
|
AlJame M, Ahmad I, Imtiaz A, Mohammed A. Ensemble learning model for diagnosing COVID-19 from routine blood tests. INFORMATICS IN MEDICINE UNLOCKED 2020; 21:100449. [PMID: 33102686 PMCID: PMC7572278 DOI: 10.1016/j.imu.2020.100449] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 09/28/2020] [Accepted: 10/07/2020] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND AND OBJECTIVES The pandemic of novel coronavirus disease 2019 (COVID-19) has severely impacted human society with a massive death toll worldwide. There is an urgent need for early and reliable screening of COVID-19 patients to provide better and timely patient care and to combat the spread of the disease. In this context, recent studies have reported some key advantages of using routine blood tests for initial screening of COVID-19 patients. In this article, first we present a review of the emerging techniques for COVID-19 diagnosis using routine laboratory and/or clinical data. Then, we propose ERLX which is an ensemble learning model for COVID-19 diagnosis from routine blood tests. METHOD The proposed model uses three well-known diverse classifiers, extra trees, random forest and logistic regression, which have different architectures and learning characteristics at the first level, and then combines their predictions by using a second level extreme gradient boosting (XGBoost) classifier to achieve a better performance. For data preparation, the proposed methodology employs a KNNImputer algorithm to handle null values in the dataset, isolation forest (iForest) to remove outlier data, and a synthetic minority oversampling technique (SMOTE) to balance data distribution. For model interpretability, features importance are reported by using the SHapley Additive exPlanations (SHAP) technique. RESULTS The proposed model was trained and evaluated by using a publicly available data set from Albert Einstein Hospital in Brazil, which consisted of 5644 data samples with 559 confirmed COVID-19 cases. The ensemble model achieved outstanding performance with an overall accuracy of 99.88% [95% CI: 99.6-100], AUC of 99.38% [95% CI: 97.5-100], a sensitivity of 98.72% [95% CI: 94.6-100] and a specificity of 99.99% [95% CI: 99.99-100]. DISCUSSION The proposed model revealed better performance when compared against existing state-of-the-art studies (Banerjee et al., 2020; de Freitas Barbosa et al., 2020; de Moraes Batista et al., 2020; Soares et al., 2020) [3,22,56,71] for the same set of features employed by them. As compared to the best performing Bayes Net model (de Freitas Barbosa et al., 2020) [22] average accuracy of 95.159%, ERLX achieved an average accuracy of 99.94%. In comparison with AUC of 85% reported by the SVM model (de Moraes Batista et al., 2020) [56], ERLX obtained AUC of 99.77% in addition to improvements in sensitivity, and specificity. As compared with ER-COV model (Soares et al., 2020) [71] average sensitivity of 70.25% and specificity of 85.98%, ERLX model achieved sensitivity of 99.47% and specificity of 99.99%. The ERLX model obtained a considerably higher score as compared with ANN model (Banerjee et al., 2020) [3] in all performance metrics. Therefore, the model presented is robust and can be deployed for reliable early and rapid screening of COVID-19 patients.
Collapse
Affiliation(s)
- Maryam AlJame
- Computer Engineering Department, Kuwait University, Kuwait
| | - Imtiaz Ahmad
- Computer Engineering Department, Kuwait University, Kuwait
| | | | - Ameer Mohammed
- Computer Engineering Department, Kuwait University, Kuwait
| |
Collapse
|
138
|
Das AK, Mishra S, Saraswathy Gopalan S. Predicting CoVID-19 community mortality risk using machine learning and development of an online prognostic tool. PeerJ 2020; 8:e10083. [PMID: 33062451 PMCID: PMC7528809 DOI: 10.7717/peerj.10083] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2020] [Accepted: 09/11/2020] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND The recent pandemic of CoVID-19 has emerged as a threat to global health security. There are very few prognostic models on CoVID-19 using machine learning. OBJECTIVES To predict mortality among confirmed CoVID-19 patients in South Korea using machine learning and deploy the best performing algorithm as an open-source online prediction tool for decision-making. MATERIALS AND METHODS Mortality for confirmed CoVID-19 patients (n = 3,524) between January 20, 2020 and May 30, 2020 was predicted using five machine learning algorithms (logistic regression, support vector machine, K nearest neighbor, random forest and gradient boosting). The performance of the algorithms was compared, and the best performing algorithm was deployed as an online prediction tool. RESULTS The logistic regression algorithm was the best performer in terms of discrimination (area under ROC curve = 0.830), calibration (Matthews Correlation Coefficient = 0.433; Brier Score = 0.036) and. The best performing algorithm (logistic regression) was deployed as the online CoVID-19 Community Mortality Risk Prediction tool named CoCoMoRP (https://ashis-das.shinyapps.io/CoCoMoRP/). CONCLUSIONS We describe the development and deployment of an open-source machine learning tool to predict mortality risk among CoVID-19 confirmed patients using publicly available surveillance data. This tool can be utilized by potential stakeholders such as health providers and policymakers to triage patients at the community level in addition to other approaches.
Collapse
|
139
|
Ferrari D, Sabetta E, Ceriotti D, Motta A, Strollo M, Banfi G, Locatelli M. Routine blood analysis greatly reduces the false-negative rate of RT-PCR testing for Covid-19. ACTA BIO-MEDICA : ATENEI PARMENSIS 2020; 91:e2020003. [PMID: 32921701 PMCID: PMC7717005 DOI: 10.23750/abm.v91i3.9843] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2020] [Accepted: 05/18/2020] [Indexed: 12/28/2022]
Abstract
BACKGROUND The COVID-19 outbreak is now a pandemic disease reaching as much as 210 countries worldwide with more than 2.5 million infected people and nearly 200.000 deaths. Amplification of viral RNA by RT-PCR represents the gold standard for confirmation of infection, yet it showed false-negative rates as large as 15-20% which may jeopardize the effect of the restrictive measures taken by governments. We previously showed that several hematological parameters were significantly different between COVID-19 positive and negative patients. Among them aspartate aminotransferase and lactate dehydrogenase had predictive values as large as 90%. Thus a combination of RT-PCR and blood tests could reduce the false-negative rate of the genetic test. METHODS We retrospectively analyzed 24 patients showing multiple and inconsistent RT-PCR, test during their first hospitalization period, and compared the genetic tests results with their AST and LDH levels. RESULTS We showed that when considering the hematological parameters, the RT-PCR false-negative rates were reduced by almost 4-fold. CONCLUSIONS The study represents a preliminary work aiming at the development of strategies that, by combining RT-PCR tests with routine blood tests, will lower or even abolish the rate of RT-PCR false-negative results and thus will identify, with high accuracy, patients infected by COVID-19.
Collapse
Affiliation(s)
- Davide Ferrari
- SCVSA Department, University of Parma, Parma, Italy; 2 Laboratory Medicine Service, San Raffaele Hospital, Milano, Italy; 3 Vita-Salute University, San Raffaele, Milano, Italy; IRCCS Istituto Ortopedico Galeazzi, Milano, Italy.
| | | | | | | | | | | | | |
Collapse
|
140
|
Ferrari D, Cabitza F, Carobene A, Locatelli M. Routine blood tests as an active surveillance to monitor COVID-19 prevalence. A retrospective study. ACTA BIO-MEDICA : ATENEI PARMENSIS 2020; 91:e2020009. [PMID: 32921707 PMCID: PMC7716996 DOI: 10.23750/abm.v91i3.10218] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 07/14/2020] [Indexed: 12/23/2022]
Abstract
BACKGROUND In Italy, one of the country most affected by the COVID-19 pandemic, the first autochthonous case appeared in Lombardy on February 20th, 2020. One month later, the number of -COVID-19 patients in Lombardy exceeded 17000 and about 3500 had died. Because of this rapid increase in infected people scientists wonder whether SARS-CoV-2 was already highly circulating in Lombardy before such date. Plasma levels of aspartate aminotransferase (AST) and lactate dehydrogenase (LDH) were shown to be -highly increased in COVID-19 patients. Monitoring their levels in Emergency Room patients during the months preceding February 20th, 2020, might shade light on the prevalence of the disease in the pre-COVID-19 period. METHODS We retrospectively analyzed the AST and LDH levels from more than 30.000 patients admitted to the San Raffaele Hospital Emergency Room (ER) between September 2019 and May 2020 as well as between September 2018 and May 2019. The number of patients diagnosed with respiratory tract diseases were also analyzed. RESULTS Data showed that the ER averaged AST and LDH levels are highly sensitive to the presence of COVID-19 patients. During, the months preceding February 20th, 2020, AST and LDH levels, as well as the number of patients diagnosed with respiratory tract diseases were similar to their 2019 counterparts. CONCLUSIONS No significant evidence showing that a large number of COVID-19 patients were admitted to the San Raffaele Hospital ER before February 20th, 2020, was found. Thus, the virus was likely circulating, within the Hospital catchment area, either in low amounts or through asymptomatic individuals. Because of the high LDH and AST levels' variations induced by COVID-19, routine blood tests might be exploited as a surveillance indicator for a possible second wave.
Collapse
Affiliation(s)
- Davide Ferrari
- SCVSA Department, University of Parma, Parma, Italy; 2 Laboratory Medicine Service, San Raffaele Hospital, Milano, Italy; 3 Vita-Salute University, San Raffaele, Milano, Italy; IRCCS Istituto Ortopedico Galeazzi, Milano, Italy.
| | | | | | | |
Collapse
|
141
|
Diagnostic significance of C-reactive protein and hematological parameters in acute toxoplasmosis. J Parasit Dis 2020; 44:785-793. [PMID: 32904402 PMCID: PMC7456360 DOI: 10.1007/s12639-020-01262-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 08/17/2020] [Indexed: 01/08/2023] Open
Abstract
Abstract Toxoplasmosis is a zoonosis caused by Toxoplasma gondii, which can be acquired by oral contact and may cause severe health problems especially for pregnant (congenital toxoplasmosis) and immunocompromised patients. This study aimed to verify the diagnostic significance of hematological parameters and C-reactive protein (CRP) for toxoplasmosis acute detection. A case-control study was carried out between December 2017 and May 2018, in samples of convenience independent of age and sex. The case group was formed by 25 patients with positive anti-Toxoplasma gondii IgG/IgM antibody and the control group was formed by 21 patients with non-positive anti-Toxoplasma gondii IgG/IgM antibody. The results of the hematological parameters and CRP were analyzed in these patients. The patients with Toxoplasma gondii IgM antibody reagent showed higher lymphocytes counting and lower neutrophils counting than the control group. C-reactive protein levels were not different between the groups case and control. ROC curve analysis highlighted that the cut-off value of > 32.00% for lymphocytes and < 57.50% for neutrophils were able to produce specificity higher than 90% for IgM antibody detection. The Naïve Bayes classifier was considered suitable (AUC ≈ 0.700) to separate both groups according to their white cell counting. Changes in lymphocytes and neutrophils may be useful parameters for toxoplasmosis identification and may be used as a tool in the complementary diagnosis of toxoplasmosis. Graphic abstract ![]()
Collapse
|
142
|
Das AK, Mishra S, Saraswathy Gopalan S. Predicting CoVID-19 community mortality risk using machine learning and development of an online prognostic tool. PeerJ 2020. [PMID: 33062451 DOI: 10.1101/2020.04.27.20081794] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023] Open
Abstract
BACKGROUND The recent pandemic of CoVID-19 has emerged as a threat to global health security. There are very few prognostic models on CoVID-19 using machine learning. OBJECTIVES To predict mortality among confirmed CoVID-19 patients in South Korea using machine learning and deploy the best performing algorithm as an open-source online prediction tool for decision-making. MATERIALS AND METHODS Mortality for confirmed CoVID-19 patients (n = 3,524) between January 20, 2020 and May 30, 2020 was predicted using five machine learning algorithms (logistic regression, support vector machine, K nearest neighbor, random forest and gradient boosting). The performance of the algorithms was compared, and the best performing algorithm was deployed as an online prediction tool. RESULTS The logistic regression algorithm was the best performer in terms of discrimination (area under ROC curve = 0.830), calibration (Matthews Correlation Coefficient = 0.433; Brier Score = 0.036) and. The best performing algorithm (logistic regression) was deployed as the online CoVID-19 Community Mortality Risk Prediction tool named CoCoMoRP (https://ashis-das.shinyapps.io/CoCoMoRP/). CONCLUSIONS We describe the development and deployment of an open-source machine learning tool to predict mortality risk among CoVID-19 confirmed patients using publicly available surveillance data. This tool can be utilized by potential stakeholders such as health providers and policymakers to triage patients at the community level in addition to other approaches.
Collapse
|