Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhu Y, Zhang J, Wang G, Yao R, Ren C, Chen G, Jin X, Guo J, Liu S, Zheng H, Chen Y, Guo Q, Li L, Du B, Xi X, Li W, Huang H, Li Y, Yu Q. Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database. Front Med (Lausanne) 2021;8:662340. [PMID: 34277655 PMCID: PMC8280779 DOI: 10.3389/fmed.2021.662340] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 06/01/2021] [Indexed: 01/27/2023] Open

For:	Zhu Y, Zhang J, Wang G, Yao R, Ren C, Chen G, Jin X, Guo J, Liu S, Zheng H, Chen Y, Guo Q, Li L, Du B, Xi X, Li W, Huang H, Li Y, Yu Q. Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database. Front Med (Lausanne) 2021;8:662340. [PMID: 34277655 PMCID: PMC8280779 DOI: 10.3389/fmed.2021.662340] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 06/01/2021] [Indexed: 01/27/2023] Open

Number

Cited by Other Article(s)

Chun L, Wang D, He L, Li D, Fu Z, Xue S, Su X, Zhou J. Explainable machine learning model for predicting paratracheal lymph node metastasis in cN0 papillary thyroid cancer. Sci Rep 2024;14:22361. [PMID: 39333646 PMCID: PMC11436978 DOI: 10.1038/s41598-024-73837-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Accepted: 09/20/2024] [Indexed: 09/29/2024] Open

Li H, Ashrafi N, Kang C, Zhao G, Chen Y, Pishgar M. A machine learning-based prediction of hospital mortality in mechanically ventilated ICU patients. PLoS One 2024;19:e0309383. [PMID: 39231126 PMCID: PMC11373795 DOI: 10.1371/journal.pone.0309383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Accepted: 08/10/2024] [Indexed: 09/06/2024] Open

Abstract

BACKGROUND

Mechanical ventilation (MV) is vital for critically ill ICU patients but carries significant mortality risks. This study aims to develop a predictive model to estimate hospital mortality among MV patients, utilizing comprehensive health data to assist ICU physicians with early-stage alerts.

METHODS

We developed a Machine Learning (ML) framework to predict hospital mortality in ICU patients receiving MV. Using the MIMIC-III database, we identified 25,202 eligible patients through ICD-9 codes. We employed backward elimination and the Lasso method, selecting 32 features based on clinical insights and literature. Data preprocessing included eliminating columns with over 90% missing data and using mean imputation for the remaining missing values. To address class imbalance, we used the Synthetic Minority Over-sampling Technique (SMOTE). We evaluated several ML models, including CatBoost, XGBoost, Decision Tree, Random Forest, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Logistic Regression, using a 70/30 train-test split. The CatBoost model was chosen for its superior performance in terms of accuracy, precision, recall, F1-score, AUROC metrics, and calibration plots.

RESULTS

The study involved a cohort of 25,202 patients on MV. The CatBoost model attained an AUROC of 0.862, an increase from an initial AUROC of 0.821, which was the best reported in the literature. It also demonstrated an accuracy of 0.789, an F1-score of 0.747, and better calibration, outperforming other models. These improvements are due to systematic feature selection and the robust gradient boosting architecture of CatBoost.

CONCLUSION

The preprocessing methodology significantly reduced the number of relevant features, simplifying computational processes, and identified critical features previously overlooked. Integrating these features and tuning the parameters, our model demonstrated strong generalization to unseen data. This highlights the potential of ML as a crucial tool in ICUs, enhancing resource allocation and providing more personalized interventions for MV patients.

Collapse

Haghayegh F, Norouziazad A, Haghani E, Feygin AA, Rahimi RH, Ghavamabadi HA, Sadighbayan D, Madhoun F, Papagelis M, Felfeli T, Salahandish R. Revolutionary Point-of-Care Wearable Diagnostics for Early Disease Detection and Biomarker Discovery through Intelligent Technologies. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024;11:e2400595. [PMID: 38958517 PMCID: PMC11423253 DOI: 10.1002/advs.202400595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 06/19/2024] [Indexed: 07/04/2024]

Affiliation(s)

Fatemeh Haghayegh Laboratory of Advanced Biotechnologies for Health Assessments (Lab‐HA)Biomedical Engineering ProgramLassonde School of EngineeringYork UniversityTorontoM3J 1P3Canada Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada
Alireza Norouziazad Laboratory of Advanced Biotechnologies for Health Assessments (Lab‐HA)Biomedical Engineering ProgramLassonde School of EngineeringYork UniversityTorontoM3J 1P3Canada Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada
Elnaz Haghani Laboratory of Advanced Biotechnologies for Health Assessments (Lab‐HA)Biomedical Engineering ProgramLassonde School of EngineeringYork UniversityTorontoM3J 1P3Canada Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada
Ariel Avraham Feygin Laboratory of Advanced Biotechnologies for Health Assessments (Lab‐HA)Biomedical Engineering ProgramLassonde School of EngineeringYork UniversityTorontoM3J 1P3Canada Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada
Reza Hamed Rahimi Laboratory of Advanced Biotechnologies for Health Assessments (Lab‐HA)Biomedical Engineering ProgramLassonde School of EngineeringYork UniversityTorontoM3J 1P3Canada Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada
Hamidreza Akbari Ghavamabadi Laboratory of Advanced Biotechnologies for Health Assessments (Lab‐HA)Biomedical Engineering ProgramLassonde School of EngineeringYork UniversityTorontoM3J 1P3Canada Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada
Deniz Sadighbayan Department of BiologyFaculty of ScienceYork UniversityTorontoONM3J 1P3Canada
Faress Madhoun Laboratory of Advanced Biotechnologies for Health Assessments (Lab‐HA)Biomedical Engineering ProgramLassonde School of EngineeringYork UniversityTorontoM3J 1P3Canada Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada
Manos Papagelis Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada
Tina Felfeli Department of Ophthalmology and Vision SciencesUniversity of TorontoOntarioM5T 3A9Canada Institute of Health PolicyManagement and EvaluationUniversity of TorontoOntarioM5T 3M6Canada
Razieh Salahandish Laboratory of Advanced Biotechnologies for Health Assessments (Lab‐HA)Biomedical Engineering ProgramLassonde School of EngineeringYork UniversityTorontoM3J 1P3Canada Department of Electrical Engineering and Computer Science (EECS)Lassonde School of EngineeringYork UniversityTorontoONM3J 1P3Canada

Collapse

Naderian S, Nikniaz Z, Farhangi MA, Nikniaz L, Sama-Soltani T, Rostami P. Predicting dyslipidemia incidence: unleashing machine learning algorithms on Lifestyle Promotion Project data. BMC Public Health 2024;24:1777. [PMID: 38961394 PMCID: PMC11223414 DOI: 10.1186/s12889-024-19261-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 06/25/2024] [Indexed: 07/05/2024] Open

Mao B, Ling L, Pan Y, Zhang R, Zheng W, Shen Y, Lu W, Lu Y, Xu S, Wu J, Wang M, Wan S. Machine learning for the prediction of in-hospital mortality in patients with spontaneous intracerebral hemorrhage in intensive care unit. Sci Rep 2024;14:14195. [PMID: 38902304 PMCID: PMC11190185 DOI: 10.1038/s41598-024-65128-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 06/17/2024] [Indexed: 06/22/2024] Open

Nguyen QT, Tran MP, Prabhakaran V, Liu A, Nguyen GH. Compact machine learning model for the accurate prediction of first 24-hour survival of mechanically ventilated patients. Front Med (Lausanne) 2024;11:1398565. [PMID: 38966539 PMCID: PMC11222318 DOI: 10.3389/fmed.2024.1398565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Accepted: 06/10/2024] [Indexed: 07/06/2024] Open

Abstract

Background

The field of machine learning has been evolving and applied in medical applications. We utilised a public dataset, MIMIC-III, to develop compact models that can accurately predict the outcome of mechanically ventilated patients in the first 24 h of first-time hospital admission.

Methods

67 predictive features, grouped into 6 categories, were selected for the classification and prediction task. 4 tree-based algorithms (Decision Tree, Bagging, eXtreme Gradient Boosting and Random Forest), and 5 non-tree-based algorithms (Logistic Regression, K-Nearest Neighbour, Linear Discriminant Analysis, Support Vector Machine and Naïve Bayes), were employed to predict the outcome of 18,883 mechanically ventilated patients. 5 scenarios were crafted to mirror the target population as per existing literature. S1.1 reflected an imbalanced situation, with significantly fewer mortality cases than survival ones, and both the training and test sets played similar target class distributions. S1.2 and S2.2 featured balanced classes; however, instances from the majority class were removed from the test set and/or the training set. S1.3 and S 2.3 generated additional instances of the minority class via the Synthetic Minority Over-sampling Technique. Standard evaluation metrics were used to determine the best-performing models for each scenario. With the best performers, Autofeat, an automated feature engineering library, was used to eliminate less important features per scenario.

Results

Tree-based models generally outperformed the non-tree-based ones. Moreover, XGB consistently yielded the highest AUC score (between 0.91 and 0.97), while exhibiting relatively high Sensitivity (between 0.58 and 0.88) on 4 scenarios (1.2, 2.2, 1.3, and 2.3). After reducing a significant number of predictors, the selected calibrated ML models were still able to achieve similar AUC and MCC scores across those scenarios. The calibration curves of the XGB and BG models, both prior to and post dimension reduction in Scenario 2.2, showed better alignment to the perfect calibration line than curves produced from other algorithms.

Conclusion

This study demonstrated that dimension-reduced models can perform well and are able to retain the important features for the classification tasks. Deploying a compact machine learning model into production helps reduce costs in terms of computational resources and monitoring changes in input data over time.

Collapse

Lin W, Huang Y, Zhu J, Sun H, Su N, Pan J, Xu J, Chen L. Machine learning improves early prediction of organ failure in hyperlipidemia acute pancreatitis using clinical and abdominal CT features. Pancreatology 2024;24:350-356. [PMID: 38342660 DOI: 10.1016/j.pan.2024.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 01/24/2024] [Accepted: 02/05/2024] [Indexed: 02/13/2024]

Lai K, Wang X, Cao C. A Continuous Non-Invasive Blood Pressure Prediction Method Based on Deep Sparse Residual U-Net Combined with Improved Squeeze and Excitation Skip Connections. SENSORS (BASEL, SWITZERLAND) 2024;24:2721. [PMID: 38732827 PMCID: PMC11086107 DOI: 10.3390/s24092721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 04/09/2024] [Accepted: 04/19/2024] [Indexed: 05/13/2024]

Abstract

Arterial blood pressure (ABP) serves as a pivotal clinical metric in cardiovascular health assessments, with the precise forecasting of continuous blood pressure assuming a critical role in both preventing and treating cardiovascular diseases. This study proposes a novel continuous non-invasive blood pressure prediction model, DSRUnet, based on deep sparse residual U-net combined with improved SE skip connections, which aim to enhance the accuracy of using photoplethysmography (PPG) signals for continuous blood pressure prediction. The model first introduces a sparse residual connection approach for path contraction and expansion, facilitating richer information fusion and feature expansion to better capture subtle variations in the original PPG signals, thereby enhancing the network's representational capacity and predictive performance and mitigating potential degradation in the network performance. Furthermore, an enhanced SE-GRU module was embedded in the skip connections to model and weight global information using an attention mechanism, capturing the temporal features of the PPG pulse signals through GRU layers to improve the quality of the transferred feature information and reduce redundant feature learning. Finally, a deep supervision mechanism was incorporated into the decoder module to guide the lower-level network to learn effective feature representations, alleviating the problem of gradient vanishing and facilitating effective training of the network. The proposed DSRUnet model was trained and tested on the publicly available UCI-BP dataset, with the average absolute errors for predicting systolic blood pressure (SBP), diastolic blood pressure (DBP), and mean blood pressure (MBP) being 3.36 ± 6.61 mmHg, 2.35 ± 4.54 mmHg, and 2.21 ± 4.36 mmHg, respectively, meeting the standards set by the Association for the Advancement of Medical Instrumentation (AAMI), and achieving Grade A according to the British Hypertension Society (BHS) Standard for SBP and DBP predictions. Through ablation experiments and comparisons with other state-of-the-art methods, the effectiveness of DSRUnet in blood pressure prediction tasks, particularly for SBP, which generally yields poor prediction results, was significantly higher. The experimental results demonstrate that the DSRUnet model can accurately utilize PPG signals for real-time continuous blood pressure prediction and obtain high-quality and high-precision blood pressure prediction waveforms. Due to its non-invasiveness, continuity, and clinical relevance, the model may have significant implications for clinical applications in hospitals and research on wearable devices in daily life.

Collapse

Nie D, Zhan Y, Xu K, Zou H, Li K, Chen L, Chen Q, Zheng W, Peng X, Yu M, Zhang S. Artificial intelligence differentiates abdominal Henoch-Schönlein purpura from acute appendicitis in children. Int J Rheum Dis 2023;26:2534-2542. [PMID: 37905746 DOI: 10.1111/1756-185x.14956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 10/08/2023] [Accepted: 10/18/2023] [Indexed: 11/02/2023]

Abstract

OBJECTIVE

This study aims to construct an artificial intelligence (AI) model capable of effectively discriminating between abdominal Henoch-Schönlein purpura (AHSP) and acute appendicitis (AA) in pediatric patients.

METHODS

A total of 6965 participants, comprising 2201 individuals with AHSP and 4764 patients with AA, were enrolled in the study. Additionally, 53 laboratory indicators were taken into consideration. Five distinct artificial intelligence (AI) models were developed employing machine learning algorithms, namely XGBoost, AdaBoost, Gaussian Naïve Bayes (GNB), MLPClassifier (MLP), and support vector machine (SVM). The performance of these prediction models was assessed through receiver operating characteristic (ROC) curve analysis, calibration curve assessment, and decision curve analysis (DCA).

RESULTS

We identified 32 discriminative indicators (p < .05) between AHSP and AA. Five indicators, namely the lymphocyte ratio (LYMPH ratio), eosinophil ratio (EO ratio), eosinophil count (EO count), neutrophil ratio (NEUT ratio), and C-reactive protein (CRP), exhibited strong performance in distinguishing AHSP from AA (AUC ≥ 0.80). Among the various prediction models, the XGBoost model displayed superior performance evidenced by the highest AUC (XGBoost = 0.895, other models < 0.89), accuracy (XGBoost = 0.824, other models < 0.81), and Kappa value (XGBoost = 0.621, other models < 0.60) in the validation set. After optimization, the XGBoost model demonstrated remarkable diagnostic performance for AHSP and AA (AUC > 0.95). Both the calibration curve and decision curve analysis suggested the promising clinical utility and net benefits of the XGBoost model.

CONCLUSION

The AI-based machine learning model exhibits high prediction accuracy and can differentiate AHSP and AA from a data-driven perspective.

Collapse

King AJ, Tang L, Davis BS, Preum SM, Bukowski LA, Zimmerman J, Kahn JM. Machine learning-based prediction of low-value care for hospitalized patients. INTELLIGENCE-BASED MEDICINE 2023;8:100115. [PMID: 38130744 PMCID: PMC10735238 DOI: 10.1016/j.ibmed.2023.100115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]

Abstract

Objective

Low-value care (i.e., costly health care treatments that provide little or no benefit) is an ongoing problem in United States hospitals. Traditional strategies for reducing low-value care are only moderately successful. Informed by behavioral science principles, we sought to use machine learning to inform a targeted prompting system that suggests preferred alternative treatments at the point of care but before clinicians have made a decision.

Methods

We used intravenous administration of albumin for fluid resuscitation in intensive care unit (ICU) patients as an exemplar of low-value care practice, identified using the electronic health record of a multi-hospital health system. We divided all ICU episodes into 4-h periods and defined a set of relevant clinical features at the period level. We then developed two machine learning models: a single-stage model that directly predicts if a patient will receive albumin in the next period; and a two-stage model that first predicts if any resuscitation fluid will be administered and then predicts albumin only among the patients with a high probability of fluid use.

Results

We examined 87,489 ICU episodes divided into approximately 1.5 million 4-h periods. The area under the receiver operating characteristic curve was 0.86 for both prediction models. The positive predictive value was 0.21 (95% confidence interval: 0.20, 0.23) for the single-stage model and 0.22 (0.20, 0.23) for the two-stage model. Applying either model in a targeted prompting system could prevent 10% of albumin administrations, with an attending physician receiving one prompt every 4.2 days of ICU service.

Conclusion

Prediction of low-value care is feasible and could enable a point-of-care, targeted prompting system that offers suggestions ahead of the moment of need before clinicians have already decided. A two-stage approach does not improve performance but does interject new levers for the calibration of such a system.

Collapse

Yang S, Cao L, Zhou Y, Hu C. A Retrospective Cohort Study: Predicting 90-Day Mortality for ICU Trauma Patients with a Machine Learning Algorithm Using XGBoost Using MIMIC-III Database. J Multidiscip Healthc 2023;16:2625-2640. [PMID: 37701177 PMCID: PMC10493110 DOI: 10.2147/jmdh.s416943] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 08/29/2023] [Indexed: 09/14/2023] Open

Wang X, Xia J, Shan Y, Yang Y, Li Y, Sun H. Predictive value of the Oxford Acute Severity of Illness Score in acute stroke patients with stroke-associated pneumonia. Front Neurol 2023;14:1251944. [PMID: 37731859 PMCID: PMC10507346 DOI: 10.3389/fneur.2023.1251944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 08/14/2023] [Indexed: 09/22/2023] Open

Abstract

Background

Stroke-associated pneumonia (SAP) is associated with a poor prognosis and a high mortality rate in stroke patients. However, the accuracy of early prediction of SAP is insufficient, and there is a lack of effective prognostic evaluation methods. Therefore, in this study, we investigated the predictive value of the Oxford Acute Severity of Illness Score (OASIS) in SAP to provide a potential reference index for the incidence and prognosis of SAP.

Methods

We recruited a total of 280 patients with acute ischemic stroke who had been diagnosed and treated in the Zhumadian Central Hospital between January 2021 and January 2023. These patients were divided into an SAP group (86 cases) and a non-SAP group (194 cases) according to SAP diagnostic criteria by expert consensus on the diagnosis and treatment of SAP. We collated general and clinical data from all patients, including the survival of SAP patients during the follow-up period. Multivariate logistic regression was used to analyze the risk factors for SAP. Kaplan-Meier and multivariate COX regression analyses were used to investigate the relationship between OASIS and the prognosis of SAP, and a receiver operating characteristic (ROC) curve was drawn to analyze the predictive value of OASIS for SAP.

Results

Our analyses identified body temperature, C-reactive protein, procalcitonin, OASIS, and a prolonged length of intensive care unit (ICU) stay as the main risk factors for SAP (all Ps < 0.05). Advanced age and an elevated OASIS were identified as the main risk factors for death in SAP patients (all Ps < 0.05). The risk of death in patients with OASIS of 31-42 points was significantly higher than that in patients with OASIS of 12-20 points (HR = 5.588, 95% CI = 1.531-20.401, P = 0.009). ROC curve analysis further showed that OASIS had a high predictive value for morbidity and the incidence of death in SAP patients.

Conclusion

OASIS can effectively predict the onset and death of SAP patients and provides a potential reference index for early diagnosis and the prediction of prognosis in patients with SAP. Our findings should be considered in clinical practice.

Collapse

Xing L, Zhang X, Guo Y, Bai D, Xu H. XGBoost-aided prediction of lip prominence based on hard-tissue measurements and demographic characteristics in an Asian population. Am J Orthod Dentofacial Orthop 2023;164:357-367. [PMID: 36959014 DOI: 10.1016/j.ajodo.2023.01.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 01/01/2023] [Accepted: 01/01/2023] [Indexed: 03/25/2023]

Deng Y, Ma Y, Fu J, Wang X, Yu C, Lv J, Man S, Wang B, Li L. A dynamic machine learning model for prediction of NAFLD in a health checkup population: A longitudinal study. Heliyon 2023;9:e18758. [PMID: 37576311 PMCID: PMC10412833 DOI: 10.1016/j.heliyon.2023.e18758] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 07/25/2023] [Accepted: 07/26/2023] [Indexed: 08/15/2023] Open

Abstract

Background

Non-alcoholic fatty liver disease (NAFLD) is one of the most common liver diseases worldwide. Currently, most NAFLD prediction models are diagnostic models based on cross-sectional data, which failed to provide early identification or clarify causal relationships. We aimed to use time-series deep learning models with longitudinal health checkup records to predict the onset of NAFLD in the future, and update the model stepwise by incorporating new checkup records to achieve dynamic prediction.

Methods

10,493 participants with over 6 health checkup records from Beijing MJ Health Screening Center were included to conduct a retrospective cohort study, in which the constantly updated initial 5 checkup data were incorporated stepwise to predict the risk of NAFLD at and after their sixth health checkups. A total of 33 variables were considered, consisting of demographic characteristics, medical history, lifestyle, physical examinations, and laboratory tests. L1-penalized logistic regression (LR) was used for feature selection. The long short-term memory (LSTM) algorithm was introduced for model development, and five-fold cross-validation was conducted to tune and choose optimal hyperparameters. Both internal validation and external validation were conducted, using the 20% randomly divided holdout test dataset and previously unseen data from Shanghai MJ Health Screening Center, respectively, to evaluate model performance. The evaluation metrics included area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, Brier score, and decision curve. Bootstrap sampling was implemented to generate 95% confidence intervals of all the metrics. Finally, the Shapley additive explanations (SHAP) algorithm was applied in the holdout test dataset for model interpretability to obtain time-specific and sample-specific contributions of each feature.

Results

Among the 10,493 participants, 1662 (15.84%) were diagnosed with NAFLD at and after their sixth health checkups. The predictive performance of the deep learning model in the internal validation dataset improved over the incorporation of the checkups, with AUROC increasing from 0.729 (95% CI: 0.698,0.760) at baseline to 0.818 (95% CI: 0.798,0.844) when consecutive 5 checkups were included. The external validation dataset, containing 1728 participants, was used to verify the results, in which AUROC increased from 0.700 (95% CI: 0.657,0.740) with only the first checkups to 0.792 (95% CI: 0.758,0.825) with all five. The results of feature significance showed that body fat percentage, alanine transaminase (ALT), and uric acid owned the greatest impact on the outcome, time-specific, individual-specific and dynamic feature contributions were also produced for model interpretability.

Conclusion

A dynamic prediction model was successfully established in our study, and the prediction capability kept improving with the renewal of the latest checkup records. In addition, we identified key features associated with the onset of NAFLD, making it possible to optimize the prevention and control strategies of the disease in the general population.

Collapse

Affiliation(s)

Yuhan Deng Chongqing Research Institute of Big Data, Peking University, Chongqing, China Meinian Institute of Health, Beijing, China
Yuan Ma School of Population Medicine and Public Health, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
Jingzhu Fu Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
Xiaona Wang MJ Health Screening Center, Beijing, China
Canqing Yu Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
Jun Lv Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
Sailimai Man Meinian Institute of Health, Beijing, China Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
Bo Wang Meinian Institute of Health, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
Liming Li Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China

Collapse

Qiu X, Tan X, Wang C, Chen S, Du B, Huang J. A long short-temory relation network for real-time prediction of patient-specific ventilator parameters. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023;20:14756-14776. [PMID: 37679157 DOI: 10.3934/mbe.2023660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]

Fan Z, Jiang J, Xiao C, Chen Y, Xia Q, Wang J, Fang M, Wu Z, Chen F. Construction and validation of prognostic models in critically Ill patients with sepsis-associated acute kidney injury: interpretable machine learning approach. J Transl Med 2023;21:406. [PMID: 37349774 PMCID: PMC10286378 DOI: 10.1186/s12967-023-04205-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 05/15/2023] [Indexed: 06/24/2023] Open

Abstract

BACKGROUND

Acute kidney injury (AKI) is a common complication in critically ill patients with sepsis and is often associated with a poor prognosis. We aimed to construct and validate an interpretable prognostic prediction model for patients with sepsis-associated AKI (S-AKI) using machine learning (ML) methods.

METHODS

Data on the training cohort were collected from the Medical Information Mart for Intensive Care IV database version 2.2 to build the model, and data of patients were extracted from Hangzhou First People's Hospital Affiliated to Zhejiang University School of Medicine for external validation of model. Predictors of mortality were identified using Recursive Feature Elimination (RFE). Then, random forest, extreme gradient boosting (XGBoost), multilayer perceptron classifier, support vector classifier, and logistic regression were used to establish a prognosis prediction model for 7, 14, and 28 days after intensive care unit (ICU) admission, respectively. Prediction performance was assessed using the receiver operating characteristic (ROC) curve and decision curve analysis (DCA). SHapley Additive exPlanations (SHAP) were used to interpret the ML models.

RESULTS

In total, 2599 patients with S-AKI were included in the analysis. Forty variables were selected for the model development. According to the areas under the ROC curve (AUC) and DCA results for the training cohort, XGBoost model exhibited excellent performance with F1 Score of 0.847, 0.715, 0.765 and AUC (95% CI) of 0.91 (0.90, 0.92), 0.78 (0.76, 0.80), and 0.83 (0.81, 0.85) in 7 days, 14 days and 28 days group, respectively. It also demonstrated excellent discrimination in the external validation cohort. Its AUC (95% CI) was 0.81 (0.79, 0.83), 0.75 (0.73, 0.77), 0.79 (0.77, 0.81) in 7 days, 14 days and 28 days group, respectively. SHAP-based summary plot and force plot were used to interpret the XGBoost model globally and locally.

CONCLUSIONS

ML is a reliable tool for predicting the prognosis of patients with S-AKI. SHAP methods were used to explain intrinsic information of the XGBoost model, which may prove clinically useful and help clinicians tailor precise management.

Collapse

Li X, Wu R, Zhao W, Shi R, Zhu Y, Wang Z, Pan H, Wang D. Machine learning algorithm to predict mortality in critically ill patients with sepsis-associated acute kidney injury. Sci Rep 2023;13:5223. [PMID: 36997585 PMCID: PMC10063657 DOI: 10.1038/s41598-023-32160-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Accepted: 03/23/2023] [Indexed: 04/01/2023] Open

Affiliation(s)

Xunliang Li Department of Nephrology, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China Institute of Kidney Disease, Inflammation and Immunity Mediated Diseases, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China
Ruijuan Wu Department of Nephrology, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China Institute of Kidney Disease, Inflammation and Immunity Mediated Diseases, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China
Wenman Zhao Department of Nephrology, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China Institute of Kidney Disease, Inflammation and Immunity Mediated Diseases, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China
Rui Shi Department of Nephrology, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China Institute of Kidney Disease, Inflammation and Immunity Mediated Diseases, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China
Yuyu Zhu Department of Nephrology, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China Institute of Kidney Disease, Inflammation and Immunity Mediated Diseases, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China
Zhijuan Wang Department of Nephrology, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China Institute of Kidney Disease, Inflammation and Immunity Mediated Diseases, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China
Haifeng Pan Institute of Kidney Disease, Inflammation and Immunity Mediated Diseases, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China. Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, Hefei, People's Republic of China. Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei, People's Republic of China.
Deguang Wang Department of Nephrology, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China. Institute of Kidney Disease, Inflammation and Immunity Mediated Diseases, The Second Affiliated Hospital of Anhui Medical University, Anhui Medical University, Hefei, People's Republic of China.

Collapse

Chen X, Li J, Liu G, Chen X, Huang S, Li H, Liu S, Li D, Yang H, Zheng H, Hu L, Kong L, Liu H, Bellou A, Lei L, Liang H. Identification of Distinct Clinical Phenotypes of Heterogeneous Mechanically Ventilated ICU Patients Using Cluster Analysis. J Clin Med 2023;12:jcm12041499. [PMID: 36836034 PMCID: PMC9962046 DOI: 10.3390/jcm12041499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 02/01/2023] [Accepted: 02/09/2023] [Indexed: 02/16/2023] Open

Affiliation(s)

Xuanhui Chen Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Jiaxin Li Department of Intensive Care Unit of Cardiac Surgery, Guangdong Cardiovascular Institute, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Guangzhou 510080, China
Guangjian Liu Shenzhen Dymind Biotechnology Co., Ltd., Shenzhen 518000, China
Xiujuan Chen Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Shuai Huang Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Huixian Li Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Siyi Liu Department of Intensive Care Unit of Cardiac Surgery, Guangdong Cardiovascular Institute, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Guangzhou 510080, China
Dantong Li Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Huan Yang Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Haiqing Zheng Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Lianting Hu Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Lingcong Kong Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Huazhang Liu Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
Abdelouahab Bellou Institute of Sciences in Emergency Medicine, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China Department of Emergency Medicine, Wayne State University School of Medicine, Detroit, MI 48201, USA Correspondence: (A.B.); (L.L.); (H.L.)
Liming Lei Department of Intensive Care Unit of Cardiac Surgery, Guangdong Cardiovascular Institute, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Guangzhou 510080, China Correspondence: (A.B.); (L.L.); (H.L.)
Huiying Liang Medical Big Data Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China Correspondence: (A.B.); (L.L.); (H.L.)

Collapse

Zhou M, Yao T, Li J, Hui H, Fan W, Guan Y, Zhang A, Xu B. Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm. Front Med (Lausanne) 2022;9:811890. [PMID: 36177329 PMCID: PMC9514383 DOI: 10.3389/fmed.2022.811890] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 08/01/2022] [Indexed: 11/17/2022] Open

Abstract

Introduction

Semen quality has decreased gradually in recent years, and lifestyle changes are among the primary causes for this issue. Thus far, the specific lifestyle factors affecting semen quality remain to be elucidated.

Materials and methods

In this study, data on the following factors were collected from 5,109 men examined at our reproductive medicine center: 10 lifestyle factors that potentially affect semen quality (smoking status, alcohol consumption, staying up late, sleeplessness, consumption of pungent food, intensity of sports activity, sedentary lifestyle, working in hot conditions, sauna use in the last 3 months, and exposure to radioactivity); general factors including age, abstinence period, and season of semen examination; and comprehensive semen parameters [semen volume, sperm concentration, progressive and total sperm motility, sperm morphology, and DNA fragmentation index (DFI)]. Then, machine learning with the XGBoost algorithm was applied to establish a primary prediction model by using the collected data. Furthermore, the accuracy of the model was verified via multiple logistic regression following k-fold cross-validation analyses.

Results

The results indicated that for semen volume, sperm concentration, progressive and total sperm motility, and DFI, the area under the curve (AUC) values ranged from 0.648 to 0.697, while the AUC for sperm morphology was only 0.506. Among the 13 factors, smoking status was the major factor affecting semen volume, sperm concentration, and progressive and total sperm motility. Age was the most important factor affecting DFI. Logistic combined with cross-validation analysis revealed similar results. Furthermore, it showed that heavy smoking (>20 cigarettes/day) had an overall negative effect on semen volume and sperm concentration and progressive and total sperm motility (OR = 4.69, 6.97, 11.16, and 10.35, respectively), while age of >35 years was associated with increased DFI (OR = 5.47).

Conclusion

The preliminary lifestyle-based model developed for semen quality prediction by using the XGBoost algorithm showed potential for clinical application and further optimization with larger training datasets.

Collapse

Chen S, Qiu X, Tan X, Fang Z, Jin Y. A model-based hybrid soft actor-critic deep reinforcement learning algorithm for optimal ventilator settings. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.08.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Yue S, Li S, Huang X, Liu J, Hou X, Zhao Y, Niu D, Wang Y, Tan W, Wu J. Machine learning for the prediction of acute kidney injury in patients with sepsis. J Transl Med 2022;20:215. [PMID: 35562803 PMCID: PMC9101823 DOI: 10.1186/s12967-022-03364-0] [Citation(s) in RCA: 59] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 03/26/2022] [Indexed: 12/15/2022] Open

Abstract

BACKGROUND

Acute kidney injury (AKI) is the most common and serious complication of sepsis, accompanied by high mortality and disease burden. The early prediction of AKI is critical for timely intervention and ultimately improves prognosis. This study aims to establish and validate predictive models based on novel machine learning (ML) algorithms for AKI in critically ill patients with sepsis.

METHODS

Data of patients with sepsis were extracted from the Medical Information Mart for Intensive Care III (MIMIC- III) database. Feature selection was performed using a Boruta algorithm. ML algorithms such as logistic regression (LR), k-nearest neighbors (KNN), support vector machine (SVM), decision tree, random forest, Extreme Gradient Boosting (XGBoost), and artificial neural network (ANN) were applied for model construction by utilizing tenfold cross-validation. The performances of these models were assessed in terms of discrimination, calibration, and clinical application. Moreover, the discrimination of ML-based models was compared with those of Sequential Organ Failure Assessment (SOFA) and the customized Simplified Acute Physiology Score (SAPS) II model.

RESULTS

A total of 3176 critically ill patients with sepsis were included for analysis, of which 2397 cases (75.5%) developed AKI during hospitalization. A total of 36 variables were selected for model construction. The models of LR, KNN, SVM, decision tree, random forest, ANN, XGBoost, SOFA and SAPS II score were established and obtained area under the receiver operating characteristic curves of 0.7365, 0.6637, 0.7353, 0.7492, 0.7787, 0.7547, 0.821, 0.6457 and 0.7015, respectively. The XGBoost model had the best predictive performance in terms of discrimination, calibration, and clinical application among all models.

CONCLUSION

The ML models can be reliable tools for predicting AKI in septic patients. The XGBoost model has the best predictive performance, which can be used to assist clinicians in identifying high-risk patients and implementing early interventions to reduce mortality.

Collapse

Affiliation(s)

Suru Yue Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
Shasha Li Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
Xueying Huang Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
Jie Liu Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
Xuefei Hou Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
Yumei Zhao Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
Dongdong Niu Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
Yufeng Wang Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
Wenkai Tan Department of Gastroenterology, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.
Jiayuan Wu Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China. .,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.

Collapse

Establishment of ICU Mortality Risk Prediction Models with Machine Learning Algorithm Using MIMIC-IV Database. Diagnostics (Basel) 2022;12:diagnostics12051068. [PMID: 35626224 PMCID: PMC9139972 DOI: 10.3390/diagnostics12051068] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 04/21/2022] [Accepted: 04/22/2022] [Indexed: 12/10/2022] Open

Abstract

Objective: The mortality rate of critically ill patients in ICUs is relatively high. In order to evaluate patients’ mortality risk, different scoring systems are used to help clinicians assess prognosis in ICUs, such as the Acute Physiology and Chronic Health Evaluation III (APACHE III) and the Logistic Organ Dysfunction Score (LODS). In this research, we aimed to establish and compare multiple machine learning models with physiology subscores of APACHE III—namely, the Acute Physiology Score III (APS III)—and LODS scoring systems in order to obtain better performance for ICU mortality prediction. Methods: A total number of 67,748 patients from the Medical Information Database for Intensive Care (MIMIC-IV) were enrolled, including 7055 deceased patients, and the same number of surviving patients were selected by the random downsampling technique, for a total of 14,110 patients included in the study. The enrolled patients were randomly divided into a training dataset (n = 9877) and a validation dataset (n = 4233). Fivefold cross-validation and grid search procedures were used to find and evaluate the best hyperparameters in different machine learning models. Taking the subscores of LODS and the physiology subscores that are part of the APACHE III scoring systems as input variables, four machine learning methods of XGBoost, logistic regression, support vector machine, and decision tree were used to establish ICU mortality prediction models, with AUCs as metrics. AUCs, specificity, sensitivity, positive predictive value, negative predictive value, and calibration curves were used to find the best model. Results: For the prediction of mortality risk in ICU patients, the AUC of the XGBoost model was 0.918 (95%CI, 0.915–0.922), and the AUCs of logistic regression, SVM, and decision tree were 0.872 (95%CI, 0.867–0.877), 0.872 (95%CI, 0.867–0.877), and 0.852 (95%CI, 0.847–0.857), respectively. The calibration curves of logistic regression and support vector machine performed better than the other two models in the ranges 0–40% and 70%–100%, respectively, while XGBoost performed better in the range of 40–70%. Conclusions: The mortality risk of ICU patients can be better predicted by the characteristics of the Acute Physiology Score III and the Logistic Organ Dysfunction Score with XGBoost in terms of ROC curve, sensitivity, and specificity. The XGBoost model could assist clinicians in judging in-hospital outcome of critically ill patients, especially in patients with a more uncertain survival outcome.

Collapse

Machine Learning and Antibiotic Management. Antibiotics (Basel) 2022;11:antibiotics11030304. [PMID: 35326768 PMCID: PMC8944459 DOI: 10.3390/antibiotics11030304] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Revised: 02/07/2022] [Accepted: 02/18/2022] [Indexed: 11/17/2022] Open

Unstructured clinical notes within the 24 hours since admission predict short, mid & long-term mortality in adult ICU patients. PLoS One 2022;17:e0262182. [PMID: 34990485 PMCID: PMC8735614 DOI: 10.1371/journal.pone.0262182] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Accepted: 12/17/2021] [Indexed: 01/04/2023] Open

Chen J, Guo C, Lu M, Ding S. Unifying Diagnosis Identification and Prediction Method Embedding the Disease Ontology Structure From Electronic Medical Records. Front Public Health 2022;9:793801. [PMID: 35127624 PMCID: PMC8811031 DOI: 10.3389/fpubh.2021.793801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 12/21/2021] [Indexed: 11/13/2022] Open

Abstract

OBJECTIVE

The reasonable classification of a large number of distinct diagnosis codes can clarify patient diagnostic information and help clinicians to improve their ability to assign and target treatment for primary diseases. Our objective is to identify and predict a unifying diagnosis (UD) from electronic medical records (EMRs).

METHODS

We screened 4,418 sepsis patients from a public MIMIC-III database and extracted their diagnostic information for UD identification, their demographic information, laboratory examination information, chief complaint, and history of present illness information for UD prediction. We proposed a data-driven UD identification and prediction method (UDIPM) embedding the disease ontology structure. First, we designed a set similarity measure method embedding the disease ontology structure to generate a patient similarity matrix. Second, we applied affinity propagation clustering to divide patients into different clusters, and extracted a typical diagnosis code co-occurrence pattern from each cluster. Furthermore, we identified a UD by fusing visual analysis and a conditional co-occurrence matrix. Finally, we trained five classifiers in combination with feature fusion and feature selection method to unify the diagnosis prediction.

RESULTS

The experimental results on a public electronic medical record dataset showed that the UDIPM could extracted a typical diagnosis code co-occurrence pattern effectively, identified and predicted a UD based on patients' diagnostic and admission information, and outperformed other fusion methods overall.

CONCLUSIONS

The accurate identification and prediction of the UD from a large number of distinct diagnosis codes and multi-source heterogeneous patient admission information in EMRs can provide a data-driven approach to assist better coding integration of diagnosis.

Collapse