1
|
Li H, Ashrafi N, Kang C, Zhao G, Chen Y, Pishgar M. A machine learning-based prediction of hospital mortality in mechanically ventilated ICU patients. PLoS One 2024; 19:e0309383. [PMID: 39231126 PMCID: PMC11373795 DOI: 10.1371/journal.pone.0309383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Accepted: 08/10/2024] [Indexed: 09/06/2024] Open
Abstract
BACKGROUND Mechanical ventilation (MV) is vital for critically ill ICU patients but carries significant mortality risks. This study aims to develop a predictive model to estimate hospital mortality among MV patients, utilizing comprehensive health data to assist ICU physicians with early-stage alerts. METHODS We developed a Machine Learning (ML) framework to predict hospital mortality in ICU patients receiving MV. Using the MIMIC-III database, we identified 25,202 eligible patients through ICD-9 codes. We employed backward elimination and the Lasso method, selecting 32 features based on clinical insights and literature. Data preprocessing included eliminating columns with over 90% missing data and using mean imputation for the remaining missing values. To address class imbalance, we used the Synthetic Minority Over-sampling Technique (SMOTE). We evaluated several ML models, including CatBoost, XGBoost, Decision Tree, Random Forest, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Logistic Regression, using a 70/30 train-test split. The CatBoost model was chosen for its superior performance in terms of accuracy, precision, recall, F1-score, AUROC metrics, and calibration plots. RESULTS The study involved a cohort of 25,202 patients on MV. The CatBoost model attained an AUROC of 0.862, an increase from an initial AUROC of 0.821, which was the best reported in the literature. It also demonstrated an accuracy of 0.789, an F1-score of 0.747, and better calibration, outperforming other models. These improvements are due to systematic feature selection and the robust gradient boosting architecture of CatBoost. CONCLUSION The preprocessing methodology significantly reduced the number of relevant features, simplifying computational processes, and identified critical features previously overlooked. Integrating these features and tuning the parameters, our model demonstrated strong generalization to unseen data. This highlights the potential of ML as a crucial tool in ICUs, enhancing resource allocation and providing more personalized interventions for MV patients.
Collapse
Affiliation(s)
- Hexin Li
- Department of Industrial and Systems Engineering, University of Southern California (USC), Los Angeles, CA, United States of America
| | - Negin Ashrafi
- Department of Industrial and Systems Engineering, University of Southern California (USC), Los Angeles, CA, United States of America
| | - Chris Kang
- Department of Industrial and Systems Engineering, University of Southern California (USC), Los Angeles, CA, United States of America
| | - Guanlan Zhao
- Department of Industrial and Systems Engineering, University of Southern California (USC), Los Angeles, CA, United States of America
| | - Yubing Chen
- Department of Industrial and Systems Engineering, University of Southern California (USC), Los Angeles, CA, United States of America
| | - Maryam Pishgar
- Department of Industrial and Systems Engineering, University of Southern California (USC), Los Angeles, CA, United States of America
| |
Collapse
|
2
|
Sikora A, Zhao B, Kong Y, Murray B, Shen Y. Machine learning based prediction of prolonged duration of mechanical ventilation incorporating medication data. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.18.23295724. [PMID: 37790491 PMCID: PMC10543219 DOI: 10.1101/2023.09.18.23295724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
Rationale Duration of mechanical ventilation is associated with adverse outcomes in critically ill patients and increased use of resources. The increasing complexity of medication regimens has been associated with increased mortality, length of stay, and fluid overload but has never been studied specifically in the setting of mechanical ventilation. Objective The purpose of this analysis was to develop prediction models for mechanical ventilation duration to test the hypothesis that incorporating medication data may improve model performance. Methods This was a retrospective cohort study of adults admitted to the ICU and undergoing mechanical ventilation for longer than 24 hours from October 2015 to October 2020. Patients were excluded if it was not their index ICU admission or if the patient was placed on comfort care in the first 24 hours of admission. Relevant patient characteristics including age, sex, body mass index, admission diagnosis, morbidities, vital signs measurements, severity of illness, medication regimen complexity as measured by the MRC-ICU, and medical treatments before intubation were collected. The primary outcome was area under the receiver operating characteristic (AUROC) of prediction models for prolonged mechanical ventilation (defined as greater than 5 days). Both logistic regression and supervised learning techniques including XGBoost, Random Forest, and Support Vector Machine were used to develop prediction models. Results The 318 patients [age 59.9 (SD 16.9), female 39.3%, medical 28.6%] had mean 24-hour MRC-ICU score of 21.3 (10.5), mean APACHE II score of 21.0 (5.4), mean SOFA score of 9.9 (3.3), and ICU mortality rate of 22.6% (n=72). The strongest performing logistic model was the base model with MRC-ICU added, with AUROC of 0.72, positive predictive value (PPV) of 0.83, and negative prediction value (NPV) of 0.92. The strongest overall model was Random Forest with an AUROC of 0.78, a PPV of 0.53, and NPV of 0.90. Feature importance analysis using support vector machine and Random Forest revealed severity of illness scores and medication related data were the most important predictors. Conclusions Medication regimen complexity is significantly associated with prolonged duration of mechanical ventilation in critically ill patients, and prediction models incorporating medication information showed modest improvement in this prediction.
Collapse
Affiliation(s)
- Andrea Sikora
- University of Georgia College of Pharmacy, Department of Clinical and Administrative Pharmacy, Augusta, GA, USA
| | - Bokai Zhao
- University of Georgia College of Public Health, Epidemiology & Biostatistics, Athens, GA, USA
| | - Yanlei Kong
- Renmin University of China, School of Statistics, Beijing, China
| | - Brian Murray
- University of North Carolina Medical Center, Department of Pharmacy, Chapel Hill, NC, USA
| | - Ye Shen
- University of Georgia College of Public Health, Epidemiology & Biostatistics, Athens, GA, USA
| |
Collapse
|
3
|
Iguina MM, Danyalian AM, Luque I, Shaikh U, Kashan SB, Morgan D, Heller D, Danckers M. Characteristics, ICU Interventions, and Clinical Outcomes of Patients With Palliative Care Triggers in a Mixed Community-Based Intensive Care Unit. J Palliat Care 2023; 38:126-134. [PMID: 36632687 DOI: 10.1177/08258597221145326] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Objective: Integration of palliative care initiatives in the intensive care unit (ICU) benefit patients and improve outcomes. Palliative care triggers (PCTs) is a screening tool that aides in stratifying patients who would benefit most from an early palliative care approach. There is no consensus on PCT selection or best timing for implementation. We evaluated the clinical characteristics, ICU and palliative care interventions, and clinical outcomes of critically ill patients with PCT in a community-based mixed ICU. Methods: This retrospective study was conducted in a 44-bed adult, mixed ICU in a 407-bed community-based teaching hospital in Florida. Eleven PCTs were used as a screening tool during multidisciplinary rounds (MDRs). Patients were analyzed based on presence or absence of PCT as well as having met high (>2) versus low (<2) PCT. Data collected included patient demographics, ICU resource utilization and clinical outcomes. We considered a two-sided P value of less than .05 to indicate statistical significance with a 95% confidence interval. Results: Of 388 ICU patients, 189 (48.7%) met at least 1 PCT and 199 (51.3%) did not. The trigger group had higher Acute Physiology and Chronic Evaluation (APACHE) and Sequential Organ Failure Assessment (SOFA) scores within 24 h of ICU admission. The most common PCTs identified were ICU length of stay greater than 7 days or readmission to ICU, terminal prognosis and assisting family in transitioning goals of care. There were statistically significant differences in ICU resource utilization, palliative care interventions, and overall worse clinical outcomes in the trigger-detected group. Similar findings were seen in the cohort with high PCT (>2). Conclusions: Our study supports the implementation of a tailored 11-item palliative care screening tool to effectively identify ICU patients with high ICU and palliative care interventions and worse clinical outcomes.
Collapse
Affiliation(s)
- Michele M Iguina
- Department of Medicine, HCA Florida Aventura Hospital, Aventura, FL, USA
- Division of Critical Care, HCA Florida Aventura Hospital, Aventura, FL, USA
| | - Aunie M Danyalian
- Department of Medicine, HCA Florida Aventura Hospital, Aventura, FL, USA
- Division of Critical Care, HCA Florida Aventura Hospital, Aventura, FL, USA
| | - Ilko Luque
- Research Department, Graduate Medical Education, HCA East Florida Division, 23686Aventura Hospital and Medical Center, Aventura, FL, USA
| | - Umair Shaikh
- Department of Medicine, Piedmont Eastside Medical Center, Snellville, GA, USA
| | - Sanaz B Kashan
- Department of Medicine, HCA Florida Aventura Hospital, Aventura, FL, USA
| | - Dionne Morgan
- Department of Medicine, HCA Florida Aventura Hospital, Aventura, FL, USA
- Division of Critical Care, HCA Florida Aventura Hospital, Aventura, FL, USA
| | - Daniel Heller
- Department of Medicine, HCA Florida Aventura Hospital, Aventura, FL, USA
- Division of Critical Care, HCA Florida Aventura Hospital, Aventura, FL, USA
| | - Mauricio Danckers
- Department of Medicine, HCA Florida Aventura Hospital, Aventura, FL, USA
- Division of Critical Care, HCA Florida Aventura Hospital, Aventura, FL, USA
| |
Collapse
|
4
|
Sarmet M, Kabani A, Coelho L, Dos Reis SS, Zeredo JL, Mehta AK. The use of natural language processing in palliative care research: A scoping review. Palliat Med 2023; 37:275-290. [PMID: 36495082 DOI: 10.1177/02692163221141969] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
BACKGROUND Natural language processing has been increasingly used in palliative care research over the last 5 years for its versatility and accuracy. AIM To evaluate and characterize natural language processing use in palliative care research, including the most commonly used natural language processing software and computational methods, data sources, trends in natural language processing use over time, and palliative care topics addressed. DESIGN A scoping review using the framework by Arksey and O'Malley and the updated recommendations proposed by Levac et al. was conducted. SOURCES PubMed, Web of Science, Embase, Scopus, and IEEE Xplore databases were searched for palliative care studies that utilized natural language processing tools. Data on study characteristics and natural language processing instruments used were collected and relevant palliative care topics were identified. RESULTS 197 relevant references were identified. Of these, 82 were included after full-text review. Studies were published in 48 different journals from 2007 to 2022. The average sample size was 21,541 (median 435). Thirty-two different natural language processing software and 33 machine-learning methods were identified. Nine main sources for data processing and 15 main palliative care topics across the included studies were identified. The most frequent topic was mortality and prognosis prediction. We also identified a trend where natural language processing was frequently used in analyzing clinical serious illness conversations extracted from audio recordings. CONCLUSIONS We found 82 papers on palliative care using natural language processing methods for a wide-range of topics and sources of data that could expand the use of this methodology. We encourage researchers to consider incorporating this cutting-edge research methodology in future studies to improve published palliative care data.
Collapse
Affiliation(s)
- Max Sarmet
- Tertiary Referral Center of Neuromuscular Diseases, Hospital de Apoio de Brasília, Brazil.,Graduate Department of Health Science and Technology, University of Brasília, Brazil
| | - Aamna Kabani
- Johns Hopkins University, School of Medicine, USA
| | - Luis Coelho
- Center of Innovation in Engineering and Industrial Technology, Polytechnic of Porto - School of Engineering (ISEP), Portugal
| | - Sara Seabra Dos Reis
- Center of Innovation in Engineering and Industrial Technology, Polytechnic of Porto - School of Engineering (ISEP), Portugal
| | - Jorge L Zeredo
- Graduate Department of Health Science and Technology, University of Brasília, Brazil
| | - Ambereen K Mehta
- Palliative Care Program, Division of General Internal Medicine, Johns Hopkins Bayview Medical Center, Johns Hopkins University, School of Medicine, USA
| |
Collapse
|
5
|
Salih A, Galazzo IB, Cruciani F, Brusini L, Radeva P. Investigating Explainable Artificial Intelligence for MRI-based Classification of Dementia: a New Stability Criterion for Explainable Methods. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) 2022:4003-4007. [DOI: 10.1109/icip46576.2022.9897253] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2025]
Affiliation(s)
- Ahmed Salih
- University of Verona,Dept. Computer Science,Italy
| | | | | | | | - Petia Radeva
- Universitat de Barcelona, Spain and Computer Vision Center, Cerdanyola del Vallés,Dept. de Matemàtiques i Informàtica,Barcelona,Spain
| |
Collapse
|
6
|
Establishment of ICU Mortality Risk Prediction Models with Machine Learning Algorithm Using MIMIC-IV Database. Diagnostics (Basel) 2022; 12:diagnostics12051068. [PMID: 35626224 PMCID: PMC9139972 DOI: 10.3390/diagnostics12051068] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 04/21/2022] [Accepted: 04/22/2022] [Indexed: 12/10/2022] Open
Abstract
Objective: The mortality rate of critically ill patients in ICUs is relatively high. In order to evaluate patients’ mortality risk, different scoring systems are used to help clinicians assess prognosis in ICUs, such as the Acute Physiology and Chronic Health Evaluation III (APACHE III) and the Logistic Organ Dysfunction Score (LODS). In this research, we aimed to establish and compare multiple machine learning models with physiology subscores of APACHE III—namely, the Acute Physiology Score III (APS III)—and LODS scoring systems in order to obtain better performance for ICU mortality prediction. Methods: A total number of 67,748 patients from the Medical Information Database for Intensive Care (MIMIC-IV) were enrolled, including 7055 deceased patients, and the same number of surviving patients were selected by the random downsampling technique, for a total of 14,110 patients included in the study. The enrolled patients were randomly divided into a training dataset (n = 9877) and a validation dataset (n = 4233). Fivefold cross-validation and grid search procedures were used to find and evaluate the best hyperparameters in different machine learning models. Taking the subscores of LODS and the physiology subscores that are part of the APACHE III scoring systems as input variables, four machine learning methods of XGBoost, logistic regression, support vector machine, and decision tree were used to establish ICU mortality prediction models, with AUCs as metrics. AUCs, specificity, sensitivity, positive predictive value, negative predictive value, and calibration curves were used to find the best model. Results: For the prediction of mortality risk in ICU patients, the AUC of the XGBoost model was 0.918 (95%CI, 0.915–0.922), and the AUCs of logistic regression, SVM, and decision tree were 0.872 (95%CI, 0.867–0.877), 0.872 (95%CI, 0.867–0.877), and 0.852 (95%CI, 0.847–0.857), respectively. The calibration curves of logistic regression and support vector machine performed better than the other two models in the ranges 0–40% and 70%–100%, respectively, while XGBoost performed better in the range of 40–70%. Conclusions: The mortality risk of ICU patients can be better predicted by the characteristics of the Acute Physiology Score III and the Logistic Organ Dysfunction Score with XGBoost in terms of ROC curve, sensitivity, and specificity. The XGBoost model could assist clinicians in judging in-hospital outcome of critically ill patients, especially in patients with a more uncertain survival outcome.
Collapse
|
7
|
Medical AI and human dignity: Contrasting perceptions of human and artificially intelligent (AI) decision making in diagnostic and medical resource allocation contexts. COMPUTERS IN HUMAN BEHAVIOR 2022. [DOI: 10.1016/j.chb.2022.107296] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
|