Xing F, Luo R, Liu M, Zhou Z, Xiang Z, Duan X. A New Random Forest Algorithm-Based Prediction Model of Post-operative Mortality in Geriatric Patients With Hip Fractures.
Front Med (Lausanne) 2022;
9:829977. [PMID:
35646950 PMCID:
PMC9130605 DOI:
10.3389/fmed.2022.829977]
[Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 03/31/2022] [Indexed: 02/05/2023] Open
Abstract
Background
Post-operative mortality risk assessment for geriatric patients with hip fractures (HF) is a challenge for clinicians. Early identification of geriatric HF patients with a high risk of post-operative death is helpful for early intervention and improving clinical prognosis. However, a single significant risk factor of post-operative death cannot accurately predict the prognosis of geriatric HF patients. Therefore, our study aims to utilize a machine learning approach, random forest algorithm, to fabricate a prediction model for post-operative death of geriatric HF patients.
Methods
This retrospective study enrolled consecutive geriatric HF patients who underwent treatment for surgery. The study cohort was divided into training and testing datasets at a 70:30 ratio. The random forest algorithm selected or excluded variables according to the feature importance. Least absolute shrinkage and selection operator (Lasso) was utilized to compare feature selection results of random forest. The confirmed variables were used to create a simplified model instead of a full model with all variables. The prediction model was then verified in the training dataset and testing dataset. Additionally, a prediction model constructed by logistic regression was used as a control to evaluate the efficiency of the new prediction model.
Results
Feature selection by random forest algorithm and Lasso regression demonstrated that seven variables, including age, time from injury to surgery, chronic obstructive pulmonary disease (COPD), albumin, hemoglobin, history of malignancy, and perioperative blood transfusion, could be used to predict the 1-year post-operative mortality. The area under the curve (AUC) of the random forest algorithm-based prediction model in training and testing datasets were 1.000, and 0.813, respectively. While the prediction tool constructed by logistic regression in training and testing datasets were 0.895, and 0.797, respectively.
Conclusions
Compared with logistic regression, the random forest algorithm-based prediction model exhibits better predictive ability for geriatric HF patients with a high risk of death within post-operative 1 year.
Collapse