1
|
Lee CT, Zhang K, Li W, Tang K, Ling Y, Walji MF, Jiang X. Identifying predictors of the tooth loss phenotype in a large periodontitis patient cohort using a machine learning approach. J Dent 2024; 144:104921. [PMID: 38437976 DOI: 10.1016/j.jdent.2024.104921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 02/17/2024] [Accepted: 03/01/2024] [Indexed: 03/06/2024] Open
Abstract
OBJECTIVES This study aimed to identify predictors associated with the tooth loss phenotype in a large periodontitis patient cohort in the university setting. METHODS Information on periodontitis patients and nineteen factors identified at the initial visit was extracted from electronic health records. The primary outcome is tooth loss phenotype (presence or absence of tooth loss). Prediction models were built on significant factors (single or combinatory) selected by the RuleFit algorithm, and these factors were further adopted by regression models. Model performance was evaluated by Area Under the Receiver Operating Characteristic Curve (AUROC) and Area Under the Precision-Recall Curve (AUPRC). Associations between predictors and the tooth loss phenotype were also evaluated by classical statistical approaches to validate the performance of machine learning models. RESULTS In total, 7840 patients were included. The machine learning model predicting the tooth loss phenotype achieved AUROC of 0.71 and AUPRC of 0.66. Age, periodontal diagnosis, number of missing teeth at baseline, furcation involvement, and tooth mobility were associated with the tooth loss phenotype in both machine learning and classical statistical models. CONCLUSIONS The rule-based machine learning approach improves model explainability compared to classical statistical methods. However, the model's generalizability needs to be further validated by external datasets. CLINICAL SIGNIFICANCE Predictors identified by the current machine learning approach using the RuleFit algorithm had clinically relevant thresholds in predicting the tooth loss phenotype in a large and diverse periodontitis patient cohort. The results of this study will assist clinicians in performing risk assessment for periodontitis at the initial visit.
Collapse
Affiliation(s)
- Chun-Teh Lee
- Department of Periodontics and Dental Hygiene, The University of Texas Health Science Center at Houston School of Dentistry, 7500 Cambridge Street, Houston, TX 77054, USA
| | - Kai Zhang
- The University of Texas Health Science Center at Houston School of Biomedical Informatics, 7000 Fannin St, Houston, Texas 77030, USA
| | - Wen Li
- Division of Clinical and Translational Sciences, Department of Internal Medicine, the University of Texas McGovern Medical School at Houston, 6431 Fannin St, Houston, Texas, USA; Biostatistics/Epidemiology/Research Design (BERD) Component, Center for Clinical and Translational Sciences (CCTS), University of Texas Health Science Center at Houston, 7000 Fannin St, Houston, Houston, Texas 77030, USA
| | - Kaichen Tang
- The University of Texas Health Science Center at Houston School of Biomedical Informatics, 7000 Fannin St, Houston, Texas 77030, USA
| | - Yaobin Ling
- The University of Texas Health Science Center at Houston School of Biomedical Informatics, 7000 Fannin St, Houston, Texas 77030, USA
| | - Muhammad F Walji
- The University of Texas Health Science Center at Houston School of Biomedical Informatics, 7000 Fannin St, Houston, Texas 77030, USA; Department of Diagnostic and Biomedical Sciences, The University of Texas Health Science Center at Houston School of Dentistry, 7000 Fannin St., Houston, Texas 77030, USA
| | - Xiaoqian Jiang
- The University of Texas Health Science Center at Houston School of Biomedical Informatics, 7000 Fannin St, Houston, Texas 77030, USA.
| |
Collapse
|
2
|
Singh K, Kaur N, Prabhu A. Combating COVID-19 Crisis using Artificial Intelligence (AI) Based Approach: Systematic Review. Curr Top Med Chem 2024; 24:737-753. [PMID: 38318824 DOI: 10.2174/0115680266282179240124072121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 12/19/2023] [Accepted: 12/27/2023] [Indexed: 02/07/2024]
Abstract
BACKGROUND SARS-CoV-2, the unique coronavirus that causes COVID-19, has wreaked damage around the globe, with victims displaying a wide range of difficulties that have encouraged medical professionals to look for innovative technical solutions and therapeutic approaches. Artificial intelligence-based methods have contributed a significant part in tackling complicated issues, and some institutions have been quick to embrace and tailor these solutions in response to the COVID-19 pandemic's obstacles. Here, in this review article, we have covered a few DL techniques for COVID-19 detection and diagnosis, as well as ML techniques for COVID-19 identification, severity classification, vaccine and drug development, mortality rate prediction, contact tracing, risk assessment, and public distancing. This review illustrates the overall impact of AI/ML tools on tackling and managing the outbreak. PURPOSE The focus of this research was to undertake a thorough evaluation of the literature on the part of Artificial Intelligence (AI) as a complete and efficient solution in the battle against the COVID-19 epidemic in the domains of detection and diagnostics of disease, mortality prediction and vaccine as well as drug development. METHODS A comprehensive exploration of PubMed, Web of Science, and Science Direct was conducted using PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analysis) regulations to find all possibly suitable papers conducted and made publicly available between December 1, 2019, and August 2023. COVID-19, along with AI-specific words, was used to create the query syntax. RESULTS During the period covered by the search strategy, 961 articles were published and released online. Out of these, a total of 135 papers were chosen for additional investigation. Mortality rate prediction, early detection and diagnosis, vaccine as well as drug development, and lastly, incorporation of AI for supervising and controlling the COVID-19 pandemic were the four main topics focused entirely on AI applications used to tackle the COVID-19 crisis. Out of 135, 60 research papers focused on the detection and diagnosis of the COVID-19 pandemic. Next, 19 of the 135 studies applied a machine-learning approach for mortality rate prediction. Another 22 research publications emphasized the vaccine as well as drug development. Finally, the remaining studies were concentrated on controlling the COVID-19 pandemic by applying AI AI-based approach to it. CONCLUSION We compiled papers from the available COVID-19 literature that used AI-based methodologies to impart insights into various COVID-19 topics in this comprehensive study. Our results suggest crucial characteristics, data types, and COVID-19 tools that can aid in medical and translational research facilitation.
Collapse
Affiliation(s)
- Kavya Singh
- Department of Biotechnology, Banasthali University, Banasthali Vidyapith, Banasthali, 304022, Rajasthan, India
| | - Navjeet Kaur
- Department of Chemistry & Division of Research and Development, Lovely Professional University, Phagwara, 144411, Punjab, India
| | - Ashish Prabhu
- Biotechnology Department, NIT Warangal, Warangal, 506004, Telangana, India
| |
Collapse
|
3
|
Huang Y, Li J, Li M, Aparasu RR. Application of machine learning in predicting survival outcomes involving real-world data: a scoping review. BMC Med Res Methodol 2023; 23:268. [PMID: 37957593 PMCID: PMC10641971 DOI: 10.1186/s12874-023-02078-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 10/20/2023] [Indexed: 11/15/2023] Open
Abstract
BACKGROUND Despite the interest in machine learning (ML) algorithms for analyzing real-world data (RWD) in healthcare, the use of ML in predicting time-to-event data, a common scenario in clinical practice, is less explored. ML models are capable of algorithmically learning from large, complex datasets and can offer advantages in predicting time-to-event data. We reviewed the recent applications of ML for survival analysis using RWD in healthcare. METHODS PUBMED and EMBASE were searched from database inception through March 2023 to identify peer-reviewed English-language studies of ML models for predicting time-to-event outcomes using the RWD. Two reviewers extracted information on the data source, patient population, survival outcome, ML algorithms, and the Area Under the Curve (AUC). RESULTS Of 257 citations, 28 publications were included. Random survival forests (N = 16, 57%) and neural networks (N = 11, 39%) were the most popular ML algorithms. There was variability across AUC for these ML models (median 0.789, range 0.6-0.950). ML algorithms were predominately considered for predicting overall survival in oncology (N = 12, 43%). ML survival models were often used to predict disease prognosis or clinical events (N = 27, 96%) in the oncology, while less were used for treatment outcomes (N = 1, 4%). CONCLUSIONS The ML algorithms, random survival forests and neural networks, are mainly used for RWD to predict survival outcomes such as disease prognosis or clinical events in the oncology. This review shows that more opportunities remain to apply these ML algorithms to inform treatment decision-making in clinical practice. More methodological work is also needed to ensure the utility and applicability of ML models in survival outcomes.
Collapse
Affiliation(s)
- Yinan Huang
- Department of Pharmacy Administration, School of Pharmacy, University of Mississippi, University, MS, 38677, USA
| | - Jieni Li
- Department of Pharmaceutical Health Outcomes and Policy, College of Pharmacy, University of Houston, Houston, TX, 77204, USA
| | - Mai Li
- Department of Industrial Engineering, Cullen College of Engineering, University of Houston, Houston, TX, USA
| | - Rajender R Aparasu
- Department of Pharmaceutical Health Outcomes and Policy, College of Pharmacy, University of Houston, Houston, TX, 77204, USA.
| |
Collapse
|
4
|
Lee CT, Zhang K, Li W, Tang K, Ling Y, Walji MF, Jiang X. Identifying predictors of tooth loss using a rule-based machine learning approach: A retrospective study at university-setting clinics. J Periodontol 2023; 94:1231-1242. [PMID: 37063053 DOI: 10.1002/jper.23-0030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 03/18/2023] [Accepted: 04/12/2023] [Indexed: 04/18/2023]
Abstract
BACKGROUND This study aimed to identify predictors associated with tooth loss in a large periodontitis patient cohort in the university setting using the machine learning approach. METHODS Information on periodontitis patients and 18 factors identified at the initial visit was extracted from electronic health records. A two-step machine learning pipeline was proposed to develop the tooth loss prediction model. The primary outcome is tooth loss count. The prediction model was built on significant factors (single or combination) selected by the RuleFit algorithm, and these factors were further adopted by the count regression model. Model performance was evaluated by root-mean-squared error (RMSE). Associations between predictors and tooth loss were also assessed by a classical statistical approach to validate the performance of the machine learning model. RESULTS In total, 7840 patients were included. The machine learning model predicting tooth loss count achieved RMSE of 2.71. Age, smoking, frequency of brushing, frequency of flossing, periodontal diagnosis, bleeding on probing percentage, number of missing teeth at baseline, and tooth mobility were associated with tooth loss in both machine learning and classical statistical models. CONCLUSION The two-step machine learning pipeline is feasible to predict tooth loss in periodontitis patients. Compared to classical statistical methods, this rule-based machine learning approach improves model explainability. However, the model's generalizability needs to be further validated by external datasets.
Collapse
Affiliation(s)
- Chun-Teh Lee
- Department of Periodontics and Dental Hygiene, The University of Texas Health Science Center at Houston School of Dentistry, Houston, Texas, USA
| | - Kai Zhang
- The University of Texas Health Science Center at Houston, McWilliams School of Biomedical Informatics, Houston, Texas, USA
| | - Wen Li
- Division of Clinical and Translational Sciences, Department of Internal Medicine, The University of Texas McGovern Medical School at Houston, Houston, Texas, USA
- Biostatistics/Epidemiology/Research Design (BERD) Component, Center for Clinical and Translational Sciences (CCTS), The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Kaichen Tang
- The University of Texas Health Science Center at Houston, McWilliams School of Biomedical Informatics, Houston, Texas, USA
| | - Yaobin Ling
- The University of Texas Health Science Center at Houston, McWilliams School of Biomedical Informatics, Houston, Texas, USA
| | - Muhammad F Walji
- Department of Diagnostic and Biomedical Sciences, The University of Texas Health Science Center at Houston School of Dentistry, Houston, Texas, USA
| | - Xiaoqian Jiang
- The University of Texas Health Science Center at Houston, McWilliams School of Biomedical Informatics, Houston, Texas, USA
| |
Collapse
|
5
|
Chen R, Chen J, Yang S, Luo S, Xiao Z, Lu L, Liang B, Liu S, Shi H, Xu J. Prediction of prognosis in COVID-19 patients using machine learning: A systematic review and meta-analysis. Int J Med Inform 2023; 177:105151. [PMID: 37473658 DOI: 10.1016/j.ijmedinf.2023.105151] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 07/07/2023] [Accepted: 07/08/2023] [Indexed: 07/22/2023]
Abstract
BACKGROUND Accurate prediction of prognostic outcomes in patients with COVID-19 could facilitate clinical decision-making and medical resource allocation. However, little is known about the ability of machine learning (ML) to predict prognosis in COVID-19 patients. OBJECTIVE This study aimed to systematically examine the prognostic value of ML in patients with COVID-19. METHODS A systematic search was conducted in PubMed, Web of Science, Embase, Cochrane Library, and IEEE Xplore up to December 15, 2021. Studies predicting the prognostic outcomes of COVID-19 patients using ML were eligible for inclusion. Risk of bias was evaluated by a tailored checklist based on Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2). Pooled sensitivity, specificity, and area under the receiver operating curve (AUC) were calculated to evaluate model performance. RESULTS A total of 33 studies that described 35 models were eligible for inclusion, with 27 models presenting mortality, four intensive care unit (ICU) admission, and four use of ventilation. For predicting mortality, ML gave a pooled sensitivity of 0.86 (95% CI, 0.79-0.90), a specificity of 0.87 (95% CI, 0.80-0.92), and an AUC of 0.93 (95% CI, 0.90-0.95). For the prediction of ICU admission, ML had a sensitivity of 0.86 (95% CI, 0.78-0.92), a specificity of 0.81 (95% CI, 0.66-0.91), and an AUC of 0.91 (95% CI, 0.88-0.93). For the prediction of ventilation, ML had a sensitivity of 0.81 (95% CI, 0.68-0.90), a specificity of 0.78 (95% CI, 0.66-0.87), and an AUC of 0.87 (95% CI, 0.83-0.89). Meta-regression analyses indicated that algorithm, population, study design, and source of dataset influenced the pooled estimate. CONCLUSION This meta-analysis demonstrated the satisfactory performance of ML in predicting prognostic outcomes in patients with COVID-19, suggesting the potential value of ML to support clinical decision-making. However, improvements to methodology and validation are still necessary before its application in routine clinical practice.
Collapse
Affiliation(s)
- Ruiyao Chen
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Jiayuan Chen
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Sen Yang
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Shuqing Luo
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Zhongzhou Xiao
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Lu Lu
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Bilin Liang
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | | | - Huwei Shi
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Jie Xu
- Shanghai Artificial Intelligence Laboratory, Shanghai, China.
| |
Collapse
|
6
|
Abbasi Habashi S, Koyuncu M, Alizadehsani R. A Survey of COVID-19 Diagnosis Using Routine Blood Tests with the Aid of Artificial Intelligence Techniques. Diagnostics (Basel) 2023; 13:1749. [PMID: 37238232 PMCID: PMC10217633 DOI: 10.3390/diagnostics13101749] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 04/19/2023] [Accepted: 04/29/2023] [Indexed: 05/28/2023] Open
Abstract
Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), causing a disease called COVID-19, is a class of acute respiratory syndrome that has considerably affected the global economy and healthcare system. This virus is diagnosed using a traditional technique known as the Reverse Transcription Polymerase Chain Reaction (RT-PCR) test. However, RT-PCR customarily outputs a lot of false-negative and incorrect results. Current works indicate that COVID-19 can also be diagnosed using imaging resolutions, including CT scans, X-rays, and blood tests. Nevertheless, X-rays and CT scans cannot always be used for patient screening because of high costs, radiation doses, and an insufficient number of devices. Therefore, there is a requirement for a less expensive and faster diagnostic model to recognize the positive and negative cases of COVID-19. Blood tests are easily performed and cost less than RT-PCR and imaging tests. Since biochemical parameters in routine blood tests vary during the COVID-19 infection, they may supply physicians with exact information about the diagnosis of COVID-19. This study reviewed some newly emerging artificial intelligence (AI)-based methods to diagnose COVID-19 using routine blood tests. We gathered information about research resources and inspected 92 articles that were carefully chosen from a variety of publishers, such as IEEE, Springer, Elsevier, and MDPI. Then, these 92 studies are classified into two tables which contain articles that use machine Learning and deep Learning models to diagnose COVID-19 while using routine blood test datasets. In these studies, for diagnosing COVID-19, Random Forest and logistic regression are the most widely used machine learning methods and the most widely used performance metrics are accuracy, sensitivity, specificity, and AUC. Finally, we conclude by discussing and analyzing these studies which use machine learning and deep learning models and routine blood test datasets for COVID-19 detection. This survey can be the starting point for a novice-/beginner-level researcher to perform on COVID-19 classification.
Collapse
Affiliation(s)
| | - Murat Koyuncu
- Department of Information Systems Engineering, Atilim University, 06830 Ankara, Turkey;
| | - Roohallah Alizadehsani
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Waurn Ponds, Geelong, VIC 3216, Australia
| |
Collapse
|
7
|
Gafni-Pappas G, Khan M. Predicting daily emergency department visits using machine learning could increase accuracy. Am J Emerg Med 2023; 65:5-11. [PMID: 36574748 DOI: 10.1016/j.ajem.2022.12.019] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 12/18/2022] [Indexed: 12/24/2022] Open
Abstract
OBJECTIVE Administrators and clinicians alike have attempted to predict emergency department visits for many years. The ability to predict or "forecast" ED visit volume can allow for more efficient resource allocation, including up-staffing or down-staffing, changing OR schedules, and predicting the need for significant resources. The goal of this study is to examine combinations of variables via machine learning to increase prediction accuracy and determine the factors that are most predictive of overall ED visits. As compared to a simple univariate time series model, we hypothesize that machine learning models will predict St. Joseph Mercy Ann Arbor's patient visit load for the emergency department (ED) with higher accuracy than a simple univariate time series model. METHODS Univariate time series models for daily ED visits, including ARIMA, Exponential Smoothing (ETS), and Facebook Inc.'s prophet algorithm were estimated as a baseline comparison. Machine learning models, including random forests and gradient boosted machines (GBM), were trained using data from 2017 to 2018. After final models were created, they were applied to the 2019 data to determine how well these models predicted actual ED patient volumes in data not utilized during the model fitting process. The accuracy of the machine learning and time series models were assessed based on out-of-sample predictive accuracy, compared using root mean squared error (RMSE). RESULTS Using root mean squared error (RMSE) to assess out-of-sample predictive accuracy of the models, the results showed that the random forest model was the most accurate at predicting daily ED visits in the 2019 test set, followed by the GBM model. These performed only slightly better than the simple exponential smoothing model predictions. The ARIMA model performed poorly in comparison. The day of the week (likely capturing differences between weekdays and weekends) was found to be the most important predictor of patient volumes. Weather-related features such as maximum temperature and SFC pressure appeared to capture some of the seasonality trends related to changes in patient volumes. CONCLUSIONS Machine learning models perform better at predicting daily patient volumes as compared to simple univariate time series models, though not by a substantial amount. Further research can help confirm these limited initial results. Gathering more training data and additional feature engineering could also be beneficial to training the models and potentially improving predictive accuracy.
Collapse
Affiliation(s)
- Gregory Gafni-Pappas
- Department of Emergency Medicine, St. Joseph Mercy Hospital, Ann Arbor, MI, USA.
| | - Mohammad Khan
- Department of Emergency Medicine, NYU Langone Medical Center, New York, NY, USA.
| |
Collapse
|
8
|
Cardiovascular and Renal Comorbidities Included into Neural Networks Predict the Outcome in COVID-19 Patients Admitted to an Intensive Care Unit: Three-Center, Cross-Validation, Age- and Sex-Matched Study. J Cardiovasc Dev Dis 2023; 10:jcdd10020039. [PMID: 36826535 PMCID: PMC9967447 DOI: 10.3390/jcdd10020039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 01/16/2023] [Accepted: 01/19/2023] [Indexed: 01/25/2023] Open
Abstract
Here, we performed a multicenter, age- and sex-matched study to compare the efficiency of various machine learning algorithms in the prediction of COVID-19 fatal outcomes and to develop sensitive, specific, and robust artificial intelligence tools for the prompt triage of patients with severe COVID-19 in the intensive care unit setting. In a challenge against other established machine learning algorithms (decision trees, random forests, extra trees, neural networks, k-nearest neighbors, and gradient boosting: XGBoost, LightGBM, and CatBoost) and multivariate logistic regression as a reference, neural networks demonstrated the highest sensitivity, sufficient specificity, and excellent robustness. Further, neural networks based on coronary artery disease/chronic heart failure, stage 3-5 chronic kidney disease, blood urea nitrogen, and C-reactive protein as the predictors exceeded 90% sensitivity and 80% specificity, reaching AUROC of 0.866 at primary cross-validation and 0.849 at secondary cross-validation on virtual samples generated by the bootstrapping procedure. These results underscore the impact of cardiovascular and renal comorbidities in the context of thrombotic complications characteristic of severe COVID-19. As aforementioned predictors can be obtained from the case histories or are inexpensive to be measured at admission to the intensive care unit, we suggest this predictor composition is useful for the triage of critically ill COVID-19 patients.
Collapse
|
9
|
Sievering AW, Wohlmuth P, Geßler N, Gunawardene MA, Herrlinger K, Bein B, Arnold D, Bergmann M, Nowak L, Gloeckner C, Koch I, Bachmann M, Herborn CU, Stang A. Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission. BMC Med Inform Decis Mak 2022; 22:309. [PMID: 36437469 PMCID: PMC9702742 DOI: 10.1186/s12911-022-02057-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Accepted: 11/17/2022] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Machine learning (ML) algorithms have been trained to early predict critical in-hospital events from COVID-19 using patient data at admission, but little is known on how their performance compares with each other and/or with statistical logistic regression (LR). This prospective multicentre cohort study compares the performance of a LR and five ML models on the contribution of influencing predictors and predictor-to-event relationships on prediction model´s performance. METHODS We used 25 baseline variables of 490 COVID-19 patients admitted to 8 hospitals in Germany (March-November 2020) to develop and validate (75/25 random-split) 3 linear (L1 and L2 penalty, elastic net [EN]) and 2 non-linear (support vector machine [SVM] with radial kernel, random forest [RF]) ML approaches for predicting critical events defined by intensive care unit transfer, invasive ventilation and/or death (composite end-point: 181 patients). Models were compared for performance (area-under-the-receiver-operating characteristic-curve [AUC], Brier score) and predictor importance (performance-loss metrics, partial-dependence profiles). RESULTS Models performed close with a small benefit for LR (utilizing restricted cubic splines for non-linearity) and RF (AUC means: 0.763-0.731 [RF-L1]); Brier scores: 0.184-0.197 [LR-L1]). Top ranked predictor variables (consistently highest importance: C-reactive protein) were largely identical across models, except creatinine, which exhibited marginal (L1, L2, EN, SVM) or high/non-linear effects (LR, RF) on events. CONCLUSIONS Although the LR and ML models analysed showed no strong differences in performance and the most influencing predictors for COVID-19-related event prediction, our results indicate a predictive benefit from taking account for non-linear predictor-to-event relationships and effects. Future efforts should focus on leveraging data-driven ML technologies from static towards dynamic modelling solutions that continuously learn and adapt to changes in data environments during the evolving pandemic. TRIAL REGISTRATION NUMBER NCT04659187.
Collapse
Affiliation(s)
| | - Peter Wohlmuth
- Semmelweis University, Asklepios Campus Hamburg, Budapest, Hungary.,Asklepios Proresearch, Research Institute, Hamburg, Germany
| | - Nele Geßler
- Semmelweis University, Asklepios Campus Hamburg, Budapest, Hungary.,Asklepios Proresearch, Research Institute, Hamburg, Germany.,Department of Cardiology and Intensive Care Medicine, Asklepios Hospital St. Georg, Hamburg, Germany
| | - Melanie A Gunawardene
- Department of Cardiology and Intensive Care Medicine, Asklepios Hospital St. Georg, Hamburg, Germany
| | - Klaus Herrlinger
- Department of Internal Medicine, Asklepios Hospital Nord-Heidberg, Hamburg, Germany.,Asklepios Tumorzentrum, Hamburg, Germany
| | - Berthold Bein
- Department of Anesthesiology and Intensive Care Medicine, Asklepios Hospital St. Georg, Hamburg, Germany
| | - Dirk Arnold
- Asklepios Tumorzentrum, Hamburg, Germany.,Department of Hematology, Oncology, Palliative Care and Rheumatology, Asklepios Hospital Altona, Hamburg, Germany
| | - Martin Bergmann
- Department of Internal Medicine, Cardiology, and Pneumology, Asklepios Hospital Wandsbek, Hamburg, Germany
| | - Lorenz Nowak
- Department of Intensive Care and Ventilation Medicine, Asklepios Hospital München-Gauting, Gauting, Germany
| | - Christian Gloeckner
- Department of Internal Medicine, Asklepios Hospital Oberviechtach, Oberviechtach, Germany
| | - Ina Koch
- Biobank for Pulmonary Diseases, Asklepios Hospital München-Gauting, Gauting, Germany
| | - Martin Bachmann
- Department of Intensive Care and Ventilatory Medicine, Asklepios Hospital Harburg, Hamburg, Germany
| | - Christoph U Herborn
- Semmelweis University, Asklepios Campus Hamburg, Budapest, Hungary.,Asklepios Hospitals GmbH & Co. KGaA, Hamburg, Germany
| | - Axel Stang
- Semmelweis University, Asklepios Campus Hamburg, Budapest, Hungary. .,Asklepios Tumorzentrum, Hamburg, Germany. .,Department of Hematology, Oncology and Palliative Care Medicine, Asklepios Hospital Barmbek, Rübenkamp 220, 22291, Hamburg, Germany.
| |
Collapse
|
10
|
Lasker A, Obaidullah SM, Chakraborty C, Roy K. Application of Machine Learning and Deep Learning Techniques for COVID-19 Screening Using Radiological Imaging: A Comprehensive Review. SN COMPUTER SCIENCE 2022; 4:65. [PMID: 36467853 PMCID: PMC9702883 DOI: 10.1007/s42979-022-01464-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2022] [Accepted: 10/18/2022] [Indexed: 11/26/2022]
Abstract
Lung, being one of the most important organs in human body, is often affected by various SARS diseases, among which COVID-19 has been found to be the most fatal disease in recent times. In fact, SARS-COVID 19 led to pandemic that spreads fast among the community causing respiratory problems. Under such situation, radiological imaging-based screening [mostly chest X-ray and computer tomography (CT) modalities] has been performed for rapid screening of the disease as it is a non-invasive approach. Due to scarcity of physician/chest specialist/expert doctors, technology-enabled disease screening techniques have been developed by several researchers with the help of artificial intelligence and machine learning (AI/ML). It can be remarkably observed that the researchers have introduced several AI/ML/DL (deep learning) algorithms for computer-assisted detection of COVID-19 using chest X-ray and CT images. In this paper, a comprehensive review has been conducted to summarize the works related to applications of AI/ML/DL for diagnostic prediction of COVID-19, mainly using X-ray and CT images. Following the PRISMA guidelines, total 265 articles have been selected out of 1715 published articles till the third quarter of 2021. Furthermore, this review summarizes and compares varieties of ML/DL techniques, various datasets, and their results using X-ray and CT imaging. A detailed discussion has been made on the novelty of the published works, along with advantages and limitations.
Collapse
Affiliation(s)
- Asifuzzaman Lasker
- Department of Computer Science & Engineering, Aliah University, Kolkata, India
| | - Sk Md Obaidullah
- Department of Computer Science & Engineering, Aliah University, Kolkata, India
| | - Chandan Chakraborty
- Department of Computer Science & Engineering, National Institute of Technical Teachers’ Training & Research Kolkata, Kolkata, India
| | - Kaushik Roy
- Department of Computer Science, West Bengal State University, Barasat, India
| |
Collapse
|
11
|
Bernau CR, Knödler M, Emonts J, Jäpel RC, Buyel JF. The use of predictive models to develop chromatography-based purification processes. Front Bioeng Biotechnol 2022; 10:1009102. [PMID: 36312533 PMCID: PMC9605695 DOI: 10.3389/fbioe.2022.1009102] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 09/23/2022] [Indexed: 11/13/2022] Open
Abstract
Chromatography is the workhorse of biopharmaceutical downstream processing because it can selectively enrich a target product while removing impurities from complex feed streams. This is achieved by exploiting differences in molecular properties, such as size, charge and hydrophobicity (alone or in different combinations). Accordingly, many parameters must be tested during process development in order to maximize product purity and recovery, including resin and ligand types, conductivity, pH, gradient profiles, and the sequence of separation operations. The number of possible experimental conditions quickly becomes unmanageable. Although the range of suitable conditions can be narrowed based on experience, the time and cost of the work remain high even when using high-throughput laboratory automation. In contrast, chromatography modeling using inexpensive, parallelized computer hardware can provide expert knowledge, predicting conditions that achieve high purity and efficient recovery. The prediction of suitable conditions in silico reduces the number of empirical tests required and provides in-depth process understanding, which is recommended by regulatory authorities. In this article, we discuss the benefits and specific challenges of chromatography modeling. We describe the experimental characterization of chromatography devices and settings prior to modeling, such as the determination of column porosity. We also consider the challenges that must be overcome when models are set up and calibrated, including the cross-validation and verification of data-driven and hybrid (combined data-driven and mechanistic) models. This review will therefore support researchers intending to establish a chromatography modeling workflow in their laboratory.
Collapse
Affiliation(s)
- C. R. Bernau
- Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Aachen, Germany
| | - M. Knödler
- Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Aachen, Germany
- Institute for Molecular Biotechnology, RWTH Aachen University, Aachen, Germany
| | - J. Emonts
- Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Aachen, Germany
| | - R. C. Jäpel
- Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Aachen, Germany
- Institute for Molecular Biotechnology, RWTH Aachen University, Aachen, Germany
| | - J. F. Buyel
- Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Aachen, Germany
- Institute for Molecular Biotechnology, RWTH Aachen University, Aachen, Germany
- University of Natural Resources and Life Sciences, Vienna (BOKU), Department of Biotechnology (DBT), Institute of Bioprocess Science and Engineering (IBSE), Vienna, Austria
- *Correspondence: J. F. Buyel,
| |
Collapse
|
12
|
Maestre-Muñiz MM, Arias Á, Lucendo AJ. Predicting In-Hospital Mortality in Severe COVID-19: A Systematic Review and External Validation of Clinical Prediction Rules. Biomedicines 2022; 10:biomedicines10102414. [PMID: 36289676 PMCID: PMC9599062 DOI: 10.3390/biomedicines10102414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 09/02/2022] [Accepted: 09/05/2022] [Indexed: 12/03/2022] Open
Abstract
Multiple prediction models for risk of in-hospital mortality from COVID-19 have been developed, but not applied, to patient cohorts different to those from which they were derived. The MEDLINE, EMBASE, Scopus, and Web of Science (WOS) databases were searched. Risk of bias and applicability were assessed with PROBAST. Nomograms, whose variables were available in a well-defined cohort of 444 patients from our site, were externally validated. Overall, 71 studies, which derived a clinical prediction rule for mortality outcome from COVID-19, were identified. Predictive variables consisted of combinations of patients′ age, chronic conditions, dyspnea/taquipnea, radiographic chest alteration, and analytical values (LDH, CRP, lymphocytes, D-dimer); and markers of respiratory, renal, liver, and myocardial damage, which were mayor predictors in several nomograms. Twenty-five models could be externally validated. Areas under receiver operator curve (AUROC) in predicting mortality ranged from 0.71 to 1 in derivation cohorts; C-index values ranged from 0.823 to 0.970. Overall, 37/71 models provided very-good-to-outstanding test performance. Externally validated nomograms provided lower predictive performances for mortality in their respective derivation cohorts, with the AUROC being 0.654 to 0.806 (poor to acceptable performance). We can conclude that available nomograms were limited in predicting mortality when applied to different populations from which they were derived.
Collapse
Affiliation(s)
- Modesto M. Maestre-Muñiz
- Department of Internal Medicine, Hospital General de Tomelloso, 13700 Ciudad Real, Spain
- Department of Medicine and Medical Specialties, Universidad de Alcalá, 28801 Alcalá de Henares, Spain
| | - Ángel Arias
- Hospital General La Mancha Centro, Research Unit, Alcázar de San Juan, 13600 Ciudad Real, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, 28006 Madrid, Spain
- Instituto de Investigación Sanitaria La Princesa, 28006 Madrid, Spain
- Instituto de Investigación Sanitaria de Castilla-La Mancha (IDISCAM), 13700 Tomelloso, Spain
| | - Alfredo J. Lucendo
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, 28006 Madrid, Spain
- Instituto de Investigación Sanitaria La Princesa, 28006 Madrid, Spain
- Instituto de Investigación Sanitaria de Castilla-La Mancha (IDISCAM), 13700 Tomelloso, Spain
- Department of Gastroenterology, Hospital General de Tomelloso, 13700 Ciudad Real, Spain
- Correspondence: ; Tel.: +34-926-525-927
| |
Collapse
|
13
|
Leiner J, Pellissier V, König S, Hohenstein S, Ueberham L, Nachtigall I, Meier-Hellmann A, Kuhlen R, Hindricks G, Bollmann A. Machine learning-derived prediction of in-hospital mortality in patients with severe acute respiratory infection: analysis of claims data from the German-wide Helios hospital network. Respir Res 2022; 23:264. [PMID: 36151525 PMCID: PMC9502925 DOI: 10.1186/s12931-022-02180-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 09/05/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Severe acute respiratory infections (SARI) are the most common infectious causes of death. Previous work regarding mortality prediction models for SARI using machine learning (ML) algorithms that can be useful for both individual risk stratification and quality of care assessment is scarce. We aimed to develop reliable models for mortality prediction in SARI patients utilizing ML algorithms and compare its performances with a classic regression analysis approach. METHODS Administrative data (dataset randomly split 75%/25% for model training/testing) from years 2016-2019 of 86 German Helios hospitals was retrospectively analyzed. Inpatient SARI cases were defined by ICD-codes J09-J22. Three ML algorithms were evaluated and its performance compared to generalized linear models (GLM) by computing receiver operating characteristic area under the curve (AUC) and area under the precision-recall curve (AUPRC). RESULTS The dataset contained 241,988 inpatient SARI cases (75 years or older: 49%; male 56.2%). In-hospital mortality was 11.6%. AUC and AUPRC in the testing dataset were 0.83 and 0.372 for GLM, 0.831 and 0.384 for random forest (RF), 0.834 and 0.382 for single layer neural network (NNET) and 0.834 and 0.389 for extreme gradient boosting (XGBoost). Statistical comparison of ROC AUCs revealed a better performance of NNET and XGBoost as compared to GLM. CONCLUSION ML algorithms for predicting in-hospital mortality were trained and tested on a large real-world administrative dataset of SARI patients and showed good discriminatory performances. Broad application of our models in clinical routine practice can contribute to patients' risk assessment and quality management.
Collapse
Affiliation(s)
- Johannes Leiner
- Department of Electrophysiology, Heart Center Leipzig at University of Leipzig, Leipzig, Germany. .,Real World Evidence and Health Technology Assessment, Helios Health Institute, Berlin, Germany.
| | - Vincent Pellissier
- Real World Evidence and Health Technology Assessment, Helios Health Institute, Berlin, Germany
| | - Sebastian König
- Department of Electrophysiology, Heart Center Leipzig at University of Leipzig, Leipzig, Germany.,Real World Evidence and Health Technology Assessment, Helios Health Institute, Berlin, Germany
| | - Sven Hohenstein
- Real World Evidence and Health Technology Assessment, Helios Health Institute, Berlin, Germany
| | - Laura Ueberham
- Clinic for Cardiology, University Hospital Leipzig, Leipzig, Germany
| | - Irit Nachtigall
- Department of Infectious Diseases and Infection Prevention, Helios Hospital Emil-von-Behring, Berlin, Germany.,Institute of Hygiene and Environmental Medicine, Charité - Universitaetsmedizin Berlin, Berlin, Germany
| | | | | | - Gerhard Hindricks
- Department of Electrophysiology, Heart Center Leipzig at University of Leipzig, Leipzig, Germany
| | - Andreas Bollmann
- Department of Electrophysiology, Heart Center Leipzig at University of Leipzig, Leipzig, Germany.,Real World Evidence and Health Technology Assessment, Helios Health Institute, Berlin, Germany
| |
Collapse
|
14
|
Guadiana-Alvarez JL, Hussain F, Morales-Menendez R, Rojas-Flores E, García-Zendejas A, Escobar CA, Ramírez-Mendoza RA, Wang J. Prognosis patients with COVID-19 using deep learning. BMC Med Inform Decis Mak 2022; 22:78. [PMID: 35346166 PMCID: PMC8959787 DOI: 10.1186/s12911-022-01820-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 03/14/2022] [Indexed: 11/18/2022] Open
Abstract
Background The coronavirus (COVID-19) is a novel pandemic and recently we do not have enough knowledge about the virus behaviour and key performance indicators (KPIs) to assess the mortality risk forecast. However, using a lot of complex and expensive biomarkers could be impossible for many low budget hospitals. Timely identification of the risk of mortality of COVID-19 patients (RMCPs) is essential to improve hospitals' management systems and resource allocation standards. Methods For the mortality risk prediction, this research work proposes a COVID-19 mortality risk calculator based on a deep learning (DL) model and based on a dataset provided by the HM Hospitals Madrid, Spain. A pre-processing strategy for unbalanced classes and feature selection is proposed. To evaluate the proposed methods, an over-sampling Synthetic Minority TEchnique (SMOTE) and data imputation approaches are introduced which is based on the K-nearest neighbour. Results A total of 1,503 seriously ill COVID-19 patients having a median age of 70 years old are comprised in the research work, with 927 (61.7%) males and 576 (38.3%) females. A total of 48 features are considered to evaluate the proposed method, and the following results are achieved. It includes the following values i.e., area under the curve (AUC) 0.93, F2 score 0.93, recall 1.00, accuracy, 0.95, precision 0.91, specificity 0.9279 and maximum probability of correct decision (MPCD) 0.93. Conclusion The results show that the proposed method is significantly best for the mortality risk prediction of patients with COVID-19 infection. The MPCD score shows that the proposed DL outperforms on every dataset when evaluating even with an over-sampling technique. The benefits of the data imputation algorithm for unavailable biomarker data are also evaluated. Based on the results, the proposed scheme could be an appropriate tool for critically ill Covid-19 patients to assess the risk of mortality and prognosis.
Collapse
|
15
|
Deep-Learning-Based Survival Prediction of Patients in Coronary Care Units. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2021:5745304. [PMID: 34976110 PMCID: PMC8720014 DOI: 10.1155/2021/5745304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Revised: 11/03/2021] [Accepted: 12/08/2021] [Indexed: 11/17/2022]
Abstract
Background A survival prediction model based on deep learning has higher accuracy than the CPH model in predicting the survival of CCU patients, and it also has a better discrimination ability. We collected information on patients with various diseases in coronary care units (CCUs) from the Medical Information Mart for Intensive Care III (MIMIC-III) database. The purpose of this study was to use this information to construct a neural-network model based on deep learning to predict the survival probabilities of patients with conditions that are common in CCUs. Method We collected information on patients in the United States with five common diseases in CCUs from 2001 to 2012. We randomly divided the patients into a training cohort and a testing cohort at a ratio of 7 : 3 and applied a survival prediction method based on deep learning to predict their survival probability. We compared our model with the Cox proportional-hazards regression (CPH) model and used the concordance indexes (C-indexes), receiver operating characteristic (ROC) curve, and calibration plots to evaluate the predictive performance of the model. Results The 3,388 CCU patients included in the study were randomly divided into 2,371 in the training cohort and 1,017 in the testing cohort. The stepwise regression results showed that the important factors affecting patient survival were the type of disease, age, race, anion gap, glucose, neutrophils, white blood cells, potassium, creatine kinase, and blood urea nitrogen (P < 0.05). We used the training cohort to construct a deep-learning model, for which the C-index was 0.833, or about 5% higher than that for the CPH model (0.786). The C-index of the deep-learning model for the test cohort was 0.822, which was also higher than that for the CPH model (0.782). The areas under the ROC curve for the 28-day, 90-day, and 1-year survival probabilities were 0.875, 0.865, and 0.874, respectively, in the deep-learning model, respectively, and 0.830, 0.843, and 0.806 in the CPH model. These values indicate that the survival analysis model based on deep learning is better than the traditional CPH model in predicting the survival of CCU patients. Conclusion A survival prediction model based on deep learning has higher accuracy than the CPH model in predicting the survival of CCU patients, and it also has a better discrimination ability.
Collapse
|
16
|
Al-Hindawi A, Abdulaal A, Rawson TM, Alqahtani SA, Mughal N, Moore LSP. COVID-19 Prognostic Models: A Pro-con Debate for Machine Learning vs. Traditional Statistics. Front Digit Health 2021; 3:637944. [PMID: 35005694 PMCID: PMC8734592 DOI: 10.3389/fdgth.2021.637944] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 11/15/2021] [Indexed: 01/08/2023] Open
Abstract
The SARS-CoV-2 virus, which causes the COVID-19 pandemic, has had an unprecedented impact on healthcare requiring multidisciplinary innovation and novel thinking to minimize impact and improve outcomes. Wide-ranging disciplines have collaborated including diverse clinicians (radiology, microbiology, and critical care), who are working increasingly closely with data-science. This has been leveraged through the democratization of data-science with the increasing availability of easy to access open datasets, tutorials, programming languages, and hardware which makes it significantly easier to create mathematical models. To address the COVID-19 pandemic, such data-science has enabled modeling of the impact of the virus on the population and individuals for diagnostic, prognostic, and epidemiological ends. This has led to two large systematic reviews on this topic that have highlighted the two different ways in which this feat has been attempted: one using classical statistics and the other using more novel machine learning techniques. In this review, we debate the relative strengths and weaknesses of each method toward the specific task of predicting COVID-19 outcomes.
Collapse
Affiliation(s)
- Ahmed Al-Hindawi
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom
| | - Ahmed Abdulaal
- Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Timothy M. Rawson
- Health Protection Research Unit for Healthcare Associated Infections and Antimicrobial Resistance, Imperial College London, London, United Kingdom
- Centre for Antimicrobial Optimisation, Imperial College London, London, United Kingdom
| | - Saleh A. Alqahtani
- King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia
- Johns Hopkins University, Baltimore, MD, United States
| | - Nabeela Mughal
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom
- Faculty of Medicine, Imperial College London, London, United Kingdom
- North West London Pathology, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Luke S. P. Moore
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom
- Faculty of Medicine, Imperial College London, London, United Kingdom
- North West London Pathology, Imperial College Healthcare NHS Trust, London, United Kingdom
| |
Collapse
|
17
|
Bottino F, Tagliente E, Pasquini L, Napoli AD, Lucignani M, Figà-Talamanca L, Napolitano A. COVID Mortality Prediction with Machine Learning Methods: A Systematic Review and Critical Appraisal. J Pers Med 2021; 11:893. [PMID: 34575670 PMCID: PMC8467935 DOI: 10.3390/jpm11090893] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 08/26/2021] [Accepted: 09/03/2021] [Indexed: 12/21/2022] Open
Abstract
More than a year has passed since the report of the first case of coronavirus disease 2019 (COVID), and increasing deaths continue to occur. Minimizing the time required for resource allocation and clinical decision making, such as triage, choice of ventilation modes and admission to the intensive care unit is important. Machine learning techniques are acquiring an increasingly sought-after role in predicting the outcome of COVID patients. Particularly, the use of baseline machine learning techniques is rapidly developing in COVID mortality prediction, since a mortality prediction model could rapidly and effectively help clinical decision-making for COVID patients at imminent risk of death. Recent studies reviewed predictive models for SARS-CoV-2 diagnosis, severity, length of hospital stay, intensive care unit admission or mechanical ventilation modes outcomes; however, systematic reviews focused on prediction of COVID mortality outcome with machine learning methods are lacking in the literature. The present review looked into the studies that implemented machine learning, including deep learning, methods in COVID mortality prediction thus trying to present the existing published literature and to provide possible explanations of the best results that the studies obtained. The study also discussed challenging aspects of current studies, providing suggestions for future developments.
Collapse
Affiliation(s)
- Francesca Bottino
- Medical Physics Department Bambino Gesù Children’s Hospital, Scientific Institute for Research, Hospitalization and Healthcare (IRCCS), 00165 Rome, Italy;
| | - Emanuela Tagliente
- Medical Physics Department Bambino Gesù Children’s Hospital, Scientific Institute for Research, Hospitalization and Healthcare (IRCCS), 00165 Rome, Italy;
| | - Luca Pasquini
- Neuroradiology Unit, NESMOS Department, Sant’Andrea Hospital, La Sapienza University, 00165 Rome, Italy; (L.P.); (A.D.N.)
- Neuroradiology Service, Radiology Department, Memorial Sloan Kettering Cancer Center, New York, NY 1275, USA
| | - Alberto Di Napoli
- Neuroradiology Unit, NESMOS Department, Sant’Andrea Hospital, La Sapienza University, 00165 Rome, Italy; (L.P.); (A.D.N.)
- Radiology Department, Castelli Romani Hospital, 00040 Ariccia (RM), Italy
| | - Martina Lucignani
- Medical Physics Department Bambino Gesù Children’s Hospital, Scientific Institute for Research, Hospitalization and Healthcare (IRCCS), 00165 Rome, Italy;
| | - Lorenzo Figà-Talamanca
- Neuroradiology Unit, Imaging Department, Bambino Gesù Children’s Hospital, Scientific Institute for Research, Hospitalization and Healthcare (IRCCS), 00165 Rome, Italy;
| | - Antonio Napolitano
- Medical Physics Department Bambino Gesù Children’s Hospital, Scientific Institute for Research, Hospitalization and Healthcare (IRCCS), 00165 Rome, Italy;
| |
Collapse
|
18
|
AlJame M, Imtiaz A, Ahmad I, Mohammed A. Deep forest model for diagnosing COVID-19 from routine blood tests. Sci Rep 2021; 11:16682. [PMID: 34404838 PMCID: PMC8371014 DOI: 10.1038/s41598-021-95957-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/03/2021] [Indexed: 02/07/2023] Open
Abstract
The Coronavirus Disease 2019 (COVID-19) global pandemic has threatened the lives of people worldwide and posed considerable challenges. Early and accurate screening of infected people is vital for combating the disease. To help with the limited quantity of swab tests, we propose a machine learning prediction model to accurately diagnose COVID-19 from clinical and/or routine laboratory data. The model exploits a new ensemble-based method called the deep forest (DF), where multiple classifiers in multiple layers are used to encourage diversity and improve performance. The cascade level employs the layer-by-layer processing and is constructed from three different classifiers: extra trees, XGBoost, and LightGBM. The prediction model was trained and evaluated on two publicly available datasets. Experimental results show that the proposed DF model has an accuracy of 99.5%, sensitivity of 95.28%, and specificity of 99.96%. These performance metrics are comparable to other well-established machine learning techniques, and hence DF model can serve as a fast screening tool for COVID-19 patients at places where testing is scarce.
Collapse
Affiliation(s)
- Maryam AlJame
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait.
| | | | - Imtiaz Ahmad
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait
| | - Ameer Mohammed
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait
| |
Collapse
|
19
|
Abdulaal A, Patel A, Al-Hindawi A, Charani E, Alqahtani SA, Davies GW, Mughal N, Moore LSP. Clinical Utility and Functionality of an Artificial Intelligence-Based App to Predict Mortality in COVID-19: Mixed Methods Analysis. JMIR Form Res 2021; 5:e27992. [PMID: 34115603 PMCID: PMC8320734 DOI: 10.2196/27992] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 03/19/2021] [Accepted: 05/31/2021] [Indexed: 12/24/2022] Open
Abstract
Background The artificial neural network (ANN) is an increasingly important tool in the context of solving complex medical classification problems. However, one of the principal challenges in leveraging artificial intelligence technology in the health care setting has been the relative inability to translate models into clinician workflow. Objective Here we demonstrate the development of a COVID-19 outcome prediction app that utilizes an ANN and assesses its usability in the clinical setting. Methods Usability assessment was conducted using the app, followed by a semistructured end-user interview. Usability was specified by effectiveness, efficiency, and satisfaction measures. These data were reported with descriptive statistics. The end-user interview data were analyzed using the thematic framework method, which allowed for the development of themes from the interview narratives. In total, 31 National Health Service physicians at a West London teaching hospital, including foundation physicians, senior house officers, registrars, and consultants, were included in this study. Results All participants were able to complete the assessment, with a mean time to complete separate patient vignettes of 59.35 (SD 10.35) seconds. The mean system usability scale score was 91.94 (SD 8.54), which corresponds to a qualitative rating of “excellent.” The clinicians found the app intuitive and easy to use, with the majority describing its predictions as a useful adjunct to their clinical practice. The main concern was related to the use of the app in isolation rather than in conjunction with other clinical parameters. However, most clinicians speculated that the app could positively reinforce or validate their clinical decision-making. Conclusions Translating artificial intelligence technologies into the clinical setting remains an important but challenging task. We demonstrate the effectiveness, efficiency, and system usability of a web-based app designed to predict the outcomes of patients with COVID-19 from an ANN.
Collapse
Affiliation(s)
- Ahmed Abdulaal
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom
| | - Aatish Patel
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom
| | - Ahmed Al-Hindawi
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom
| | - Esmita Charani
- National Institute for Health Research Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, Imperial College London, London, United Kingdom
| | - Saleh A Alqahtani
- Johns Hopkins University, Baltimore, MD, United States.,King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia
| | - Gary W Davies
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom
| | - Nabeela Mughal
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom.,National Institute for Health Research Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, Imperial College London, London, United Kingdom.,North West London Pathology, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Luke Stephen Prockter Moore
- Chelsea and Westminster NHS Foundation Trust, London, United Kingdom.,National Institute for Health Research Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, Imperial College London, London, United Kingdom.,North West London Pathology, Imperial College Healthcare NHS Trust, London, United Kingdom
| |
Collapse
|
20
|
Galanter W, Rodríguez-Fernández JM, Chow K, Harford S, Kochendorfer KM, Pishgar M, Theis J, Zulueta J, Darabi H. Predicting clinical outcomes among hospitalized COVID-19 patients using both local and published models. BMC Med Inform Decis Mak 2021; 21:224. [PMID: 34303356 PMCID: PMC8302976 DOI: 10.1186/s12911-021-01576-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Accepted: 06/29/2021] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Many models are published which predict outcomes in hospitalized COVID-19 patients. The generalizability of many is unknown. We evaluated the performance of selected models from the literature and our own models to predict outcomes in patients at our institution. METHODS We searched the literature for models predicting outcomes in inpatients with COVID-19. We produced models of mortality or criticality (mortality or ICU admission) in a development cohort. We tested external models which provided sufficient information and our models using a test cohort of our most recent patients. The performance of models was compared using the area under the receiver operator curve (AUC). RESULTS Our literature review yielded 41 papers. Of those, 8 were found to have sufficient documentation and concordance with features available in our cohort to implement in our test cohort. All models were from Chinese patients. One model predicted criticality and seven mortality. Tested against the test cohort, internal models had an AUC of 0.84 (0.74-0.94) for mortality and 0.83 (0.76-0.90) for criticality. The best external model had an AUC of 0.89 (0.82-0.96) using three variables, another an AUC of 0.84 (0.78-0.91) using ten variables. AUC's ranged from 0.68 to 0.89. On average, models tested were unable to produce predictions in 27% of patients due to missing lab data. CONCLUSION Despite differences in pandemic timeline, race, and socio-cultural healthcare context some models derived in China performed well. For healthcare organizations considering implementation of an external model, concordance between the features used in the model and features available in their own patients may be important. Analysis of both local and external models should be done to help decide on what prediction method is used to provide clinical decision support to clinicians treating COVID-19 patients as well as what lab tests should be included in order sets.
Collapse
Affiliation(s)
- William Galanter
- Departments of Medicine and Pharmacy Systems, Outcomes and Policy, University of Illinois At Chicago (UIC), Chicago, USA.
| | | | | | - Samuel Harford
- Department of Mechanical and Industrial Engineering, UIC, Chicago, USA
| | | | - Maryam Pishgar
- Department of Mechanical and Industrial Engineering, UIC, Chicago, USA
| | - Julian Theis
- Department of Mechanical and Industrial Engineering, UIC, Chicago, USA
| | | | - Houshang Darabi
- Department of Mechanical and Industrial Engineering, UIC, Chicago, USA
| |
Collapse
|
21
|
Ozdemir MA, Ozdemir GD, Guren O. Classification of COVID-19 electrocardiograms by using hexaxial feature mapping and deep learning. BMC Med Inform Decis Mak 2021; 21:170. [PMID: 34034715 PMCID: PMC8146190 DOI: 10.1186/s12911-021-01521-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Accepted: 05/05/2021] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Coronavirus disease 2019 (COVID-19) has become a pandemic since its first appearance in late 2019. Deaths caused by COVID-19 are still increasing day by day and early diagnosis has become crucial. Since current diagnostic methods have many disadvantages, new investigations are needed to improve the performance of diagnosis. METHODS A novel method is proposed to automatically diagnose COVID-19 by using Electrocardiogram (ECG) data with deep learning for the first time. Moreover, a new and effective method called hexaxial feature mapping is proposed to represent 12-lead ECG to 2D colorful images. Gray-Level Co-Occurrence Matrix (GLCM) method is used to extract features and generate hexaxial mapping images. These generated images are then fed into a new Convolutional Neural Network (CNN) architecture to diagnose COVID-19. RESULTS Two different classification scenarios are conducted on a publicly available paper-based ECG image dataset to reveal the diagnostic capability and performance of the proposed approach. In the first scenario, ECG data labeled as COVID-19 and No-Findings (normal) are classified to evaluate COVID-19 classification ability. According to results, the proposed approach provides encouraging COVID-19 detection performance with an accuracy of 96.20% and F1-Score of 96.30%. In the second scenario, ECG data labeled as Negative (normal, abnormal, and myocardial infarction) and Positive (COVID-19) are classified to evaluate COVID-19 diagnostic ability. The experimental results demonstrated that the proposed approach provides satisfactory COVID-19 prediction performance with an accuracy of 93.00% and F1-Score of 93.20%. Furthermore, different experimental studies are conducted to evaluate the robustness of the proposed approach. CONCLUSION Automatic detection of cardiovascular changes caused by COVID-19 can be possible with a deep learning framework through ECG data. This not only proves the presence of cardiovascular changes caused by COVID-19 but also reveals that ECG can potentially be used in the diagnosis of COVID-19. We believe the proposed study may provide a crucial decision-making system for healthcare professionals. SOURCE CODE All source codes are made publicly available at: https://github.com/mkfzdmr/COVID-19-ECG-Classification.
Collapse
Affiliation(s)
- Mehmet Akif Ozdemir
- Department of Biomedical Engineering, Faculty of Enigneering and Architecture, Izmir Katip Celebi University, 35620 Cigli, Izmir, Turkey
- Department of Biomedical Technologies, Graduate School of Natural and Applied Sciences, Izmir Katip Celebi University, 35620 Cigli, Izmir, Turkey
| | - Gizem Dilara Ozdemir
- Department of Biomedical Engineering, Faculty of Enigneering and Architecture, Izmir Katip Celebi University, 35620 Cigli, Izmir, Turkey
- Department of Biomedical Technologies, Graduate School of Natural and Applied Sciences, Izmir Katip Celebi University, 35620 Cigli, Izmir, Turkey
| | - Onan Guren
- Department of Biomedical Engineering, Faculty of Enigneering and Architecture, Izmir Katip Celebi University, 35620 Cigli, Izmir, Turkey
| |
Collapse
|
22
|
Tavolara TE, Niazi MKK, Gower AC, Ginese M, Beamer G, Gurcan MN. Deep learning predicts gene expression as an intermediate data modality to identify susceptibility patterns in Mycobacterium tuberculosis infected Diversity Outbred mice. EBioMedicine 2021; 67:103388. [PMID: 34000621 PMCID: PMC8138606 DOI: 10.1016/j.ebiom.2021.103388] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 04/22/2021] [Accepted: 04/23/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Machine learning sustains successful application to many diagnostic and prognostic problems in computational histopathology. Yet, few efforts have been made to model gene expression from histopathology. This study proposes a methodology which predicts selected gene expression values (microarray) from haematoxylin and eosin whole-slide images as an intermediate data modality to identify fulminant-like pulmonary tuberculosis ('supersusceptible') in an experimentally infected cohort of Diversity Outbred mice (n=77). METHODS Gradient-boosted trees were utilized as a novel feature selector to identify gene transcripts predictive of fulminant-like pulmonary tuberculosis. A novel attention-based multiple instance learning model for regression was used to predict selected genes' expression from whole-slide images. Gene expression predictions were shown to be sufficiently replicated to identify supersusceptible mice using gradient-boosted trees trained on ground truth gene expression data. FINDINGS The model was accurate, showing high positive correlations with ground truth gene expression on both cross-validation (n = 77, 0.63 ≤ ρ ≤ 0.84) and external testing sets (n = 33, 0.65 ≤ ρ ≤ 0.84). The sensitivity and specificity for gene expression predictions to identify supersusceptible mice (n=77) were 0.88 and 0.95, respectively, and for an external set of mice (n=33) 0.88 and 0.93, respectively. IMPLICATIONS Our methodology maps histopathology to gene expression with sufficient accuracy to predict a clinical outcome. The proposed methodology exemplifies a computational template for gene expression panels, in which relatively inexpensive and widely available tissue histopathology may be mapped to specific genes' expression to serve as a diagnostic or prognostic tool. FUNDING National Institutes of Health and American Lung Association.
Collapse
Affiliation(s)
- Thomas E Tavolara
- Center for Biomedical Informatics, Wake Forest School of Medicine, 486 Patterson Avenue, Winston-Salem, NC 27101, United States
| | - M K K Niazi
- Center for Biomedical Informatics, Wake Forest School of Medicine, 486 Patterson Avenue, Winston-Salem, NC 27101, United States.
| | - Adam C Gower
- Department of Medicine, Boston University School of Medicine, 72 E. Concord St Evans Building, Boston, MA 02118, United States
| | - Melanie Ginese
- Department of Infectious Disease and Global Health, Tufts University Cummings School of Veterinary Medicine, 200 Westboro Rd., North Grafton, MA 01536, United States
| | - Gillian Beamer
- Department of Infectious Disease and Global Health, Tufts University Cummings School of Veterinary Medicine, 200 Westboro Rd., North Grafton, MA 01536, United States
| | - Metin N Gurcan
- Center for Biomedical Informatics, Wake Forest School of Medicine, 486 Patterson Avenue, Winston-Salem, NC 27101, United States
| |
Collapse
|