1
|
Saravi B, Zink A, Ülkümen S, Couillard-Despres S, Lang G, Hassel F. Artificial intelligence-based analysis of associations between learning curve and clinical outcomes in endoscopic and microsurgical lumbar decompression surgery. EUROPEAN SPINE JOURNAL : OFFICIAL PUBLICATION OF THE EUROPEAN SPINE SOCIETY, THE EUROPEAN SPINAL DEFORMITY SOCIETY, AND THE EUROPEAN SECTION OF THE CERVICAL SPINE RESEARCH SOCIETY 2023:10.1007/s00586-023-08084-7. [PMID: 38156994 DOI: 10.1007/s00586-023-08084-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 11/22/2023] [Accepted: 12/03/2023] [Indexed: 01/03/2024]
Abstract
PURPOSE A common spine surgery procedure involves decompression of the lumbar spine. The impact of the surgeon's learning curve on relevant clinical outcomes is currently not well examined in the literature. A variety of machine learning algorithms have been investigated in this study to determine how a surgeon's learning curve and other clinical parameters will influence prolonged lengths of stay (LOS), extended operating times (OT), and complications, as well as whether these clinical parameters can be reliably predicted. METHODS A retrospective monocentric cohort study of patients with lumbar spinal stenosis treated with microsurgical (MSD) and full-endoscopic (FED) decompression was conducted. The study included 206 patients with lumbar spinal stenosis who underwent FED (63; 30.6%) and MSD (118; 57.3%). Prolonged LOS and OT were defined as those exceeding the 75th percentile of the cohort. Furthermore, complications were assessed as a dependent variable. Using unsupervised learning, clusters were identified in the data, which helped distinguish between the early learning curve (ELC) and the late learning curve (LLC). From 15 algorithms, the top five algorithms that best fit the data were selected for each prediction task. We calculated the accuracy of prediction (Acc) and the area under the curve (AUC). The most significant predictors were determined using a feature importance analysis. RESULTS For the FED group, the median number of surgeries with case surgery type at the time of surgery was 72 in the ELC group and 274 in the LLC group. FED patients did not significantly differ in outcome variables (LOS, OT, complication rate) between the ELC and LLC group. The random forest model demonstrated the highest mean accuracy and AUC across all folds for each classification task. For OT, it achieved an accuracy of 76.08% and an AUC of 0.89. For LOS, the model reached an accuracy of 83.83% and an AUC of 0.91. Lastly, in predicting complications, the random forest model attained the highest accuracy of 89.90% and an AUC of 0.94. Feature importance analysis indicated that LOS, OT, and complications were more significantly affected by patient characteristics than the surgical technique (FED versus MSD) or the surgeon's learning curve. CONCLUSIONS A median of 72 cases of FED surgeries led to comparable clinical outcomes in the early learning curve phase compared to experienced surgeons. These outcomes seem to be more significantly affected by patient characteristics than the learning curve or the surgical technique. Several study variables, including the learning curve, can be used to predict whether lumbar decompression surgery will result in an increased LOS, OT, or complications. To introduce the provided prediction tools into clinics, the algorithms need to be implemented into open-source software and externally validated through large-scale randomized controlled trials.
Collapse
Affiliation(s)
- Babak Saravi
- Department of Orthopedics and Trauma Surgery, Medical Centre - Albert-Ludwigs-University of Freiburg, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Hugstetterstrasse 55, 79106, Freiburg, Germany.
- Department of Spine Surgery, Loretto Hospital, Freiburg, Germany.
- Institute of Experimental Neuroregeneration, Spinal Cord Injury and Tissue Regeneration Center Salzburg (SCI-TReCS), Paracelsus Medical University, 5020, Salzburg, Austria.
| | - Alisia Zink
- Department of Spine Surgery, Loretto Hospital, Freiburg, Germany
| | - Sara Ülkümen
- Department of Spine Surgery, Loretto Hospital, Freiburg, Germany
| | - Sebastien Couillard-Despres
- Institute of Experimental Neuroregeneration, Spinal Cord Injury and Tissue Regeneration Center Salzburg (SCI-TReCS), Paracelsus Medical University, 5020, Salzburg, Austria
- Austrian Cluster for Tissue Regeneration, Vienna, Austria
| | - Gernot Lang
- Department of Orthopedics and Trauma Surgery, Medical Centre - Albert-Ludwigs-University of Freiburg, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Hugstetterstrasse 55, 79106, Freiburg, Germany
- Department of Spine Surgery, Loretto Hospital, Freiburg, Germany
| | - Frank Hassel
- Department of Spine Surgery, Loretto Hospital, Freiburg, Germany
| |
Collapse
|
2
|
Saravi B, Zink A, Ülkümen S, Couillard-Despres S, Wollborn J, Lang G, Hassel F. Clinical and radiomics feature-based outcome analysis in lumbar disc herniation surgery. BMC Musculoskelet Disord 2023; 24:791. [PMID: 37803313 PMCID: PMC10557221 DOI: 10.1186/s12891-023-06911-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 09/24/2023] [Indexed: 10/08/2023] Open
Abstract
BACKGROUND Low back pain is a widely prevalent symptom and the foremost cause of disability on a global scale. Although various degenerative imaging findings observed on magnetic resonance imaging (MRI) have been linked to low back pain and disc herniation, none of them can be considered pathognomonic for this condition, given the high prevalence of abnormal findings in asymptomatic individuals. Nevertheless, there is a lack of knowledge regarding whether radiomics features in MRI images combined with clinical features can be useful for prediction modeling of treatment success. The objective of this study was to explore the potential of radiomics feature analysis combined with clinical features and artificial intelligence-based techniques (machine learning/deep learning) in identifying MRI predictors for the prediction of outcomes after lumbar disc herniation surgery. METHODS We included n = 172 patients who underwent discectomy due to disc herniation with preoperative T2-weighted MRI examinations. Extracted clinical features included sex, age, alcohol and nicotine consumption, insurance type, hospital length of stay (LOS), complications, operation time, ASA score, preoperative CRP, surgical technique (microsurgical versus full-endoscopic), and information regarding the experience of the performing surgeon (years of experience with the surgical technique and the number of surgeries performed at the time of surgery). The present study employed a semiautomatic region-growing volumetric segmentation algorithm to segment herniated discs. In addition, 3D-radiomics features, which characterize phenotypic differences based on intensity, shape, and texture, were extracted from the computed magnetic resonance imaging (MRI) images. Selected features identified by feature importance analyses were utilized for both machine learning and deep learning models (n = 17 models). RESULTS The mean accuracy over all models for training and testing in the combined feature set was 93.31 ± 4.96 and 88.17 ± 2.58. The mean accuracy for training and testing in the clinical feature set was 91.28 ± 4.56 and 87.69 ± 3.62. CONCLUSIONS Our results suggest a minimal but detectable improvement in predictive tasks when radiomics features are included. However, the extent of this advantage should be considered with caution, emphasizing the potential of exploring multimodal data inputs in future predictive modeling.
Collapse
Affiliation(s)
- Babak Saravi
- Department of Orthopedics and Trauma Surgery, Faculty of Medicine, Medical Center - University of Freiburg, University of Freiburg, Freiburg, Germany.
- Department of Spine Surgery, Loretto Hospital, Freiburg, Germany.
- Institute of Experimental Neuroregeneration, Spinal Cord Injury and Tissue Regeneration Center Salzburg (SCI-TReCS), Paracelsus Medical University, Salzburg, 5020, Austria.
- Department of Anesthesiology, Perioperative and Pain Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, USA.
| | - Alisia Zink
- Department of Spine Surgery, Loretto Hospital, Freiburg, Germany
| | - Sara Ülkümen
- Department of Orthopedics and Trauma Surgery, Faculty of Medicine, Medical Center - University of Freiburg, University of Freiburg, Freiburg, Germany
| | - Sebastien Couillard-Despres
- Institute of Experimental Neuroregeneration, Spinal Cord Injury and Tissue Regeneration Center Salzburg (SCI-TReCS), Paracelsus Medical University, Salzburg, 5020, Austria
- Austrian Cluster for Tissue Regeneration, Vienna, Austria
| | - Jakob Wollborn
- Department of Anesthesiology, Perioperative and Pain Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, USA
| | - Gernot Lang
- Department of Orthopedics and Trauma Surgery, Faculty of Medicine, Medical Center - University of Freiburg, University of Freiburg, Freiburg, Germany
| | - Frank Hassel
- Department of Spine Surgery, Loretto Hospital, Freiburg, Germany
| |
Collapse
|
3
|
Saravi B, Zink A, Ülkümen S, Couillard-Despres S, Wollborn J, Lang G, Hassel F. Automated Detection and Measurement of Dural Sack Cross-Sectional Area in Lumbar Spine MRI Using Deep Learning. Bioengineering (Basel) 2023; 10:1072. [PMID: 37760174 PMCID: PMC10525778 DOI: 10.3390/bioengineering10091072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 07/21/2023] [Accepted: 09/07/2023] [Indexed: 09/29/2023] Open
Abstract
Lumbar spine magnetic resonance imaging (MRI) is a critical diagnostic tool for the assessment of various spinal pathologies, including degenerative disc disease, spinal stenosis, and spondylolisthesis. The accurate identification and quantification of the dural sack cross-sectional area are essential for the evaluation of these conditions. Current manual measurement methods are time-consuming and prone to inter-observer variability. Our study developed and validated deep learning models, specifically U-Net, Attention U-Net, and MultiResUNet, for the automated detection and measurement of the dural sack area in lumbar spine MRI, using a dataset of 515 patients with symptomatic back pain and externally validating the results based on 50 patient scans. The U-Net model achieved an accuracy of 0.9990 and 0.9987 on the initial and external validation datasets, respectively. The Attention U-Net model reported an accuracy of 0.9992 and 0.9989, while the MultiResUNet model displayed a remarkable accuracy of 0.9996 and 0.9995, respectively. All models showed promising precision, recall, and F1-score metrics, along with reduced mean absolute errors compared to the ground truth manual method. In conclusion, our study demonstrates the potential of these deep learning models for the automated detection and measurement of the dural sack cross-sectional area in lumbar spine MRI. The proposed models achieve high-performance metrics in both the initial and external validation datasets, indicating their potential utility as valuable clinical tools for the evaluation of lumbar spine pathologies. Future studies with larger sample sizes and multicenter data are warranted to validate the generalizability of the model further and to explore the potential integration of this approach into routine clinical practice.
Collapse
Affiliation(s)
- Babak Saravi
- Department of Orthopedics and Trauma Surgery, Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany; (S.Ü.); (G.L.)
- Department of Spine Surgery, Loretto Hospital, 79100 Freiburg, Germany; (A.Z.); (F.H.)
- Institute of Experimental Neuroregeneration, Spinal Cord Injury and Tissue Regeneration Center Salzburg (SCI-TReCS), Paracelsus Medical University, 5020 Salzburg, Austria;
- Department of Anesthesiology, Perioperative and Pain Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA;
| | - Alisia Zink
- Department of Spine Surgery, Loretto Hospital, 79100 Freiburg, Germany; (A.Z.); (F.H.)
| | - Sara Ülkümen
- Department of Orthopedics and Trauma Surgery, Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany; (S.Ü.); (G.L.)
| | - Sebastien Couillard-Despres
- Institute of Experimental Neuroregeneration, Spinal Cord Injury and Tissue Regeneration Center Salzburg (SCI-TReCS), Paracelsus Medical University, 5020 Salzburg, Austria;
- Austrian Cluster for Tissue Regeneration, 1200 Vienna, Austria
| | - Jakob Wollborn
- Department of Anesthesiology, Perioperative and Pain Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA;
| | - Gernot Lang
- Department of Orthopedics and Trauma Surgery, Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany; (S.Ü.); (G.L.)
| | - Frank Hassel
- Department of Spine Surgery, Loretto Hospital, 79100 Freiburg, Germany; (A.Z.); (F.H.)
| |
Collapse
|