1
|
Sanghvi PA, Shah AK, Hecht CJ, Karimi AH, Kamath AF. Optimal inputs for machine learning models in predicting total joint arthroplasty outcomes: a systematic review. EUROPEAN JOURNAL OF ORTHOPAEDIC SURGERY & TRAUMATOLOGY : ORTHOPEDIE TRAUMATOLOGIE 2024; 34:3809-3825. [PMID: 39212689 DOI: 10.1007/s00590-024-04076-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2024] [Accepted: 08/11/2024] [Indexed: 09/04/2024]
Abstract
INTRODUCTION Machine learning (ML) models may offer a novel solution to reducing postoperative complication rates and improving post-surgical outcomes after total joint arthroplasty (TJA). However, the variety of different ML models that exist paired with the increasing number of potential inputs can make the implementation of this tool challenging. Therefore, we conducted a systematic review to assess the most optimal inputs of different ML models in predicting postoperative (1) medical outcomes, (2) orthopedic outcomes, and (3) patient-reported outcome measures (PROMs) after total joint arthroplasty. METHODS The PubMed, MEDLINE, EBSCOhost, and Google Scholar databases were utilized to identify all studies evaluating ML models predicting outcomes following TJA between January 1, 2000, and June 23, 2023 (PROSPERO study protocol registration: CRD42023437586). The mean risk of bias in non-randomized studies-of interventions score, was 13.8 ± 0.5. Our initial query yielded 656 articles, of which 25 articles aligned with our aims, examining over 20 machine learning models and 1,555,300 surgeries. The area under the curve (AUC), accuracy, inputs, and the importance of each input were reported. RESULTS Twelve studies evaluating medical complications with 13 ML models reported AUCs ranging from 0.57 to 0.997 and accuracy between 88% and 99.98%. Key predictors included age, hyper-coagulopathy, total number of diagnoses, admission month, and malnutrition. Five studies evaluating orthopedic complications with 10 ML models reported AUCs from 0.49 to 0.93 and accuracy ranging from 92 to 97%, with age, BMI, CCI, AKSS scores, and height identified as key predictors. Ten studies evaluating PROMs comprising of 12 different ML models had an AUC ranging from 0.453 to 0.97 ranked preoperative PROMs as the post-predictive input. Overall, age was the most predictive risk factor for complications post-total joint arthroplasty (TJA). CONCLUSION These studies demonstrate the predictive capabilities of these models for anticipating complications and outcomes. Furthermore, these studies also highlight ML models' ability to identify non-classical variables not commonly considered in addition to confirming variables known to be crucial. To advance the field, forthcoming research should adhere to established guidelines for model development and training, employ industry-standard input parameters, and subject their models to external validity assessments.
Collapse
Affiliation(s)
- Parshva A Sanghvi
- Department of Orthopaedic Surgery, Center for Hip Preservation, Orthopaedic and Rheumatologic Institute, Institute Cleveland Clinic Foundation, Mail code A41, 9500 Euclid Ave, Cleveland, OH, 44195, USA
| | - Aakash K Shah
- Department of Orthopaedic Surgery, Center for Hip Preservation, Orthopaedic and Rheumatologic Institute, Institute Cleveland Clinic Foundation, Mail code A41, 9500 Euclid Ave, Cleveland, OH, 44195, USA
| | - Christian J Hecht
- Department of Orthopaedic Surgery, Center for Hip Preservation, Orthopaedic and Rheumatologic Institute, Institute Cleveland Clinic Foundation, Mail code A41, 9500 Euclid Ave, Cleveland, OH, 44195, USA
| | - Amir H Karimi
- Department of Orthopaedic Surgery, Center for Hip Preservation, Orthopaedic and Rheumatologic Institute, Institute Cleveland Clinic Foundation, Mail code A41, 9500 Euclid Ave, Cleveland, OH, 44195, USA
| | - Atul F Kamath
- Department of Orthopaedic Surgery, Center for Hip Preservation, Orthopaedic and Rheumatologic Institute, Institute Cleveland Clinic Foundation, Mail code A41, 9500 Euclid Ave, Cleveland, OH, 44195, USA.
| |
Collapse
|
2
|
Chavosh Nejad M, Vestergaard Matthiesen R, Dukovska-Popovska I, Jakobsen T, Johansen J. Machine learning for predicting duration of surgery and length of stay: A literature review on joint arthroplasty. Int J Med Inform 2024; 192:105631. [PMID: 39293161 DOI: 10.1016/j.ijmedinf.2024.105631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 08/15/2024] [Accepted: 09/13/2024] [Indexed: 09/20/2024]
Abstract
INTRODUCTION In recent years, different factors such as population aging have caused escalating demand for hip and knee arthroplasty straining already limited hospitals' resources. To address this challenge, focus is put on medical and operational efficiency improvements. This includes an increased use of machine learning (ML) to predict duration of surgery (DOS) and length of stay (LOS) for total knee and total hip arthroplasty, which can be utilized for optimizing resource allocation to satisfy medical and operational limitations. This paper explores the development and performance of ML models in predicting DOS and LOS. METHODS A systematic search of publications between 2010-2023 was conducted following PRISMA guidelines. Considering the inclusion and exclusion criteria, 28 out of 722 gathered papers from PubMed, Web of Science, and manual search were included in the study. Descriptive statistics was used to analyze the extracted data regarding data preprocessing, model development, and model performance assessment. RESULTS Most of the papers work on LOS as a binary variable. Patient's age was identified as the most frequently used and reported as important variable for predicting DOS and LOS. Investigations also illustrated that within the resulting 28 papers, more than 71% of models reached good to perfect performance based on the area under the receiver operating characteristic curve (AUC), where artificial neural networks and ensemble learning models had the biggest share among the best-performing models. CONCLUSION The utilization of ML models is increasing in the literature. The current performance level indicates that ML can potentially turn to powerful tools in predicting DOS and LOS for different purposes. Meanwhile, the literature is not matured yet in reporting real-life application. Future studies can focus on model specification and validation by considering empirical application.
Collapse
Affiliation(s)
- Mohammad Chavosh Nejad
- Department of Materials and Production, Aalborg University, Fibigerstræde 16, 2-109, Aalborg Ø 9220, Danmark.
| | | | - Iskra Dukovska-Popovska
- Department of Materials and Production, Aalborg University, Fibigerstræde 16, 2-107, Aalborg Ø 9220, Danmark.
| | - Thomas Jakobsen
- Department of Orthopaedics, Aalborg University Hospital, Hobrovej 18-22, Aalborg Universitetshospital, Aalborg Syd 9000, Danmark.
| | - John Johansen
- Department of Materials and Production, Aalborg University, Fibigerstræde 16, 2-114, Aalborg Ø 9220, Danmark.
| |
Collapse
|
3
|
Gotzler A, Glowalla C, Hinterwimmer F, Schneidmüller D, Hungerer S. [Endoprosthetic treatment of femoral neck fractures in Germany : Cumulative analysis of EPRD registry data from 2013 to 2020]. ORTHOPADIE (HEIDELBERG, GERMANY) 2024:10.1007/s00132-024-04568-6. [PMID: 39325195 DOI: 10.1007/s00132-024-04568-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 08/26/2024] [Indexed: 09/27/2024]
Abstract
BACKGROUND The German Arthroplasty Registry (EPRD) recorded nearly 52,000 femoral neck fractures treated with arthroplasty by 2020. This study aimed to identify survival rates and risk factors for hip prosthesis failure. MATERIAL AND METHODS The study included all patients with arthroplasty after hip fractures documented in the EPRD. Data were analyzed with focus on failure rate regarding implant, implantation technique, age, BMI, and comorbidities. For more complex analysis of dependencies, the machine learning algorithm (MLA) XGBoost (Extreme Gradient Boosting) was used. RESULTS The study included 51,938 patients. The failure rate was 3.7% for HEs and 5.6% for THA. The failure rate increased in male patients (p < 0.0001), those with higher BMI, young patients with a high Elixhauser Comorbidity Score (ECS) and a cementless technique. The timepoint of surgery, i.e. ,working day vs. weekend or holiday had no influence on the outcome. The feature importance (FI) generated by MLA demonstrated factors with the highest impact on failure, i.e., survival time (1029), BMI (722), and age (481). CONCLUSION For younger patients with comorbidities, a cemented implantation technique should be considered. Failure rates of arthroplasties did not differ on workdays compared to weekends or holidays. MLA are suitable to analyze registry data for complex correlations of factors.
Collapse
Affiliation(s)
- Alexander Gotzler
- Endoprothetikzentrum Murnau, BG Unfallklinik Murnau, Prof.-Küntscher-Str. 8, 82418, Murnau, Deutschland
| | - Claudio Glowalla
- Endoprothetikzentrum Murnau, BG Unfallklinik Murnau, Prof.-Küntscher-Str. 8, 82418, Murnau, Deutschland
- Klinikum Rechts der Isar der Technischen Universität München, München, Deutschland
| | - Florian Hinterwimmer
- Klinikum Rechts der Isar der Technischen Universität München, München, Deutschland
| | | | - Sven Hungerer
- Endoprothetikzentrum Murnau, BG Unfallklinik Murnau, Prof.-Küntscher-Str. 8, 82418, Murnau, Deutschland.
- BG Unfallklinik Murnau, Murnau, Deutschland.
- Institut für Biomechanik, Paracelsus Medizinischen Privatuniversität Salzburg, Salzburg, Österreich.
| |
Collapse
|
4
|
Huffman N, Pasqualini I, Khan ST, Klika AK, Deren ME, Jin Y, Kunze KN, Piuzzi NS. Enabling Personalized Medicine in Orthopaedic Surgery Through Artificial Intelligence: A Critical Analysis Review. JBJS Rev 2024; 12:01874474-202403000-00006. [PMID: 38466797 DOI: 10.2106/jbjs.rvw.23.00232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
» The application of artificial intelligence (AI) in the field of orthopaedic surgery holds potential for revolutionizing health care delivery across 3 crucial domains: (I) personalized prediction of clinical outcomes and adverse events, which may optimize patient selection, surgical planning, and enhance patient safety and outcomes; (II) diagnostic automated and semiautomated imaging analyses, which may reduce time burden and facilitate precise and timely diagnoses; and (III) forecasting of resource utilization, which may reduce health care costs and increase value for patients and institutions.» Computer vision is one of the most highly studied areas of AI within orthopaedics, with applications pertaining to fracture classification, identification of the manufacturer and model of prosthetic implants, and surveillance of prosthesis loosening and failure.» Prognostic applications of AI within orthopaedics include identifying patients who will likely benefit from a specified treatment, predicting prosthetic implant size, postoperative length of stay, discharge disposition, and surgical complications. Not only may these applications be beneficial to patients but also to institutions and payors because they may inform potential cost expenditure, improve overall hospital efficiency, and help anticipate resource utilization.» AI infrastructure development requires institutional financial commitment and a team of clinicians and data scientists with expertise in AI that can complement skill sets and knowledge. Once a team is established and a goal is determined, teams (1) obtain, curate, and label data; (2) establish a reference standard; (3) develop an AI model; (4) evaluate the performance of the AI model; (5) externally validate the model, and (6) reinforce, improve, and evaluate the model's performance until clinical implementation is possible.» Understanding the implications of AI in orthopaedics may eventually lead to wide-ranging improvements in patient care. However, AI, while holding tremendous promise, is not without methodological and ethical limitations that are essential to address. First, it is important to ensure external validity of programs before their use in a clinical setting. Investigators should maintain high quality data records and registry surveillance, exercise caution when evaluating others' reported AI applications, and increase transparency of the methodological conduct of current models to improve external validity and avoid propagating bias. By addressing these challenges and responsibly embracing the potential of AI, the medical field may eventually be able to harness its power to improve patient care and outcomes.
Collapse
Affiliation(s)
- Nickelas Huffman
- Cleveland Clinic, Department of Orthopaedic Surgery, Cleveland, Ohio
| | | | - Shujaa T Khan
- Cleveland Clinic, Department of Orthopaedic Surgery, Cleveland, Ohio
| | - Alison K Klika
- Cleveland Clinic, Department of Orthopaedic Surgery, Cleveland, Ohio
| | - Matthew E Deren
- Cleveland Clinic, Department of Orthopaedic Surgery, Cleveland, Ohio
| | - Yuxuan Jin
- Cleveland Clinic, Department of Orthopaedic Surgery, Cleveland, Ohio
| | - Kyle N Kunze
- Department of Orthopaedic Surgery, Hospital for Special Surgery, New York, New York
| | - Nicolas S Piuzzi
- Cleveland Clinic, Department of Orthopaedic Surgery, Cleveland, Ohio
- Department of Biomedical Engineering, Cleveland Clinic Foundation, Cleveland, Ohio
| |
Collapse
|
5
|
Hunter J, Soleymani F, Viktor H, Michalowski W, Poitras S, Beaulé PE. Using Unsupervised Machine Learning to Predict Quality of Life After Total Knee Arthroplasty. J Arthroplasty 2024; 39:677-682. [PMID: 37770008 DOI: 10.1016/j.arth.2023.09.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 09/08/2023] [Accepted: 09/16/2023] [Indexed: 10/03/2023] Open
Abstract
BACKGROUND Patient-reported outcome measures (PROMs) are an important metric to assess total knee arthroplasty (TKA) patients. The purpose of this study was to use a machine learning (ML) algorithm to identify patient features that impact PROMs after TKA. METHODS Data from 636 TKA patients enrolled in our patient database between 2018 and 2022, were retrospectively reviewed. Their mean age was 68 years (range, 39 to 92), 56.7% women, and mean body mass index of 31.17 (range, 16 to 58). Patient demographics and the Functional Comorbidity Index were collected alongside Patient-Reported Outcome Measures Information System Global Health v1.2 (PROMIS GH-P) physical component scores preoperatively, at 3 months, and 1 year after TKA. An unsupervised ML algorithm (spectral clustering) was used to identify patient features impacting PROMIS GH-P scores at the various time points. RESULTS The algorithm identified 5 patient clusters that varied by demographics, comorbidities, and pain scores. Each cluster was associated with predictable trends in PROMIS GH-P scores across the time points. Notably, patients who had the worst preoperative PROMIS GH-P scores (cluster 5) had the most improvement after TKA, whereas patients who had higher global health rating preoperatively had more modest improvement (clusters 1, 2, and 3). Two out of Five patient clusters (cluster 4 and 5) showed improvement in PROMIS GH-P scores that met a minimally clinically important difference at 1-year postoperative. CONCLUSIONS The unsupervised ML algorithm identified patient clusters that had predictable changes in PROMs after TKA. It is a positive step toward providing precision medical care for each of our arthroplasty patients.
Collapse
Affiliation(s)
- Jennifer Hunter
- Division of Orthopaedics, The Ottawa Hospital, Ottawa, Ontario, Canada
| | - Farzan Soleymani
- Faculty of Engineering, University of Ottawa, Ottawa, Ontario, Canada
| | - Herna Viktor
- Faculty of Engineering, University of Ottawa, Ottawa, Ontario, Canada
| | - Wojtek Michalowski
- Telfer School of Management, University of Ottawa, Ottawa, Ontario, Canada
| | - Stéphane Poitras
- School of Rehabilitation Sciences, University of Ottawa, Ottawa, Ontario, Canada
| | - Paul E Beaulé
- Division of Orthopaedics, The Ottawa Hospital, Ottawa, Ontario, Canada
| |
Collapse
|
6
|
Lippenberger F, Ziegelmayer S, Berlet M, Feussner H, Makowski M, Neumann PA, Graf M, Kaissis G, Wilhelm D, Braren R, Reischl S. Development of an image-based Random Forest classifier for prediction of surgery duration of laparoscopic sigmoid resections. Int J Colorectal Dis 2024; 39:21. [PMID: 38273097 PMCID: PMC10811180 DOI: 10.1007/s00384-024-04593-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/10/2024] [Indexed: 01/27/2024]
Abstract
PURPOSE Sigmoid diverticulitis is a disease with a high socioeconomic burden, accounting for a high number of left-sided colonic resections worldwide. Modern surgical scheduling relies on accurate prediction of operation times to enhance patient care and optimize healthcare resources. This study aims to develop a predictive model for surgery duration in laparoscopic sigmoid resections, based on preoperative CT biometric and demographic patient data. METHODS This retrospective single-center cohort study included 85 patients who underwent laparoscopic sigmoid resection for diverticular disease. Potentially relevant procedure-specific anatomical parameters recommended by a surgical expert were measured in preoperative CT imaging. After random split into training and test set (75% / 25%) multiclass logistic regression was performed and a Random Forest classifier was trained on CT imaging parameters, patient age, and sex in the training cohort to predict categorized surgery duration. The models were evaluated in the test cohort using established performance metrics including receiver operating characteristics area under the curve (AUROC). RESULTS The Random Forest model achieved a good average AUROC of 0.78. It allowed a very good prediction of long (AUROC = 0.89; specificity 0.71; sensitivity 1.0) and short (AUROC = 0.81; specificity 0.77; sensitivity 0.56) procedures. It clearly outperformed the multiclass logistic regression model (AUROC: average = 0.33; short = 0.31; long = 0.22). CONCLUSION A Random Forest classifier trained on demographic and CT imaging biometric patient data could predict procedure duration outliers of laparoscopic sigmoid resections. Pending validation in a multicenter study, this approach could potentially improve procedure scheduling in visceral surgery and be scaled to other procedures.
Collapse
Affiliation(s)
- Florian Lippenberger
- Institute of Diagnostic and Interventional Radiology, School of Medicine and Health, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Sebastian Ziegelmayer
- Institute of Diagnostic and Interventional Radiology, School of Medicine and Health, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Maximilian Berlet
- Department of Surgery, School of Medicine, Technical University of Munich, Ismaninger Straße 22, 81675, Munich, Germany
- Research Group MITI, Klinikum rechts der Isar, Technical University Munich, Munich, Germany
| | - Hubertus Feussner
- Department of Surgery, School of Medicine, Technical University of Munich, Ismaninger Straße 22, 81675, Munich, Germany
- Research Group MITI, Klinikum rechts der Isar, Technical University Munich, Munich, Germany
| | - Marcus Makowski
- Institute of Diagnostic and Interventional Radiology, School of Medicine and Health, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Philipp-Alexander Neumann
- Department of Surgery, School of Medicine, Technical University of Munich, Ismaninger Straße 22, 81675, Munich, Germany
| | - Markus Graf
- Institute of Diagnostic and Interventional Radiology, School of Medicine and Health, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Georgios Kaissis
- Institute of Diagnostic and Interventional Radiology, School of Medicine and Health, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
- Institute for Artificial Intelligence in Medicine and Healthcare, School of Medicine and Faculty of Informatics, Technical University of Munich, Munich, Germany
| | - Dirk Wilhelm
- Department of Surgery, School of Medicine, Technical University of Munich, Ismaninger Straße 22, 81675, Munich, Germany
- Research Group MITI, Klinikum rechts der Isar, Technical University Munich, Munich, Germany
| | - Rickmer Braren
- Institute of Diagnostic and Interventional Radiology, School of Medicine and Health, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
- German Cancer Consortium (DKTK, Partner Site Munich) and German Cancer Research Center, DKFZ, Heidelberg, Germany
| | - Stefan Reischl
- Institute of Diagnostic and Interventional Radiology, School of Medicine and Health, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.
| |
Collapse
|