1
|
Ruitenbeek HC, Oei EHG, Visser JJ, Kijowski R. Artificial intelligence in musculoskeletal imaging: realistic clinical applications in the next decade. Skeletal Radiol 2024:10.1007/s00256-024-04684-6. [PMID: 38902420 DOI: 10.1007/s00256-024-04684-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/06/2024] [Accepted: 04/15/2024] [Indexed: 06/22/2024]
Abstract
This article will provide a perspective review of the most extensively investigated deep learning (DL) applications for musculoskeletal disease detection that have the best potential to translate into routine clinical practice over the next decade. Deep learning methods for detecting fractures, estimating pediatric bone age, calculating bone measurements such as lower extremity alignment and Cobb angle, and grading osteoarthritis on radiographs have been shown to have high diagnostic performance with many of these applications now commercially available for use in clinical practice. Many studies have also documented the feasibility of using DL methods for detecting joint pathology and characterizing bone tumors on magnetic resonance imaging (MRI). However, musculoskeletal disease detection on MRI is difficult as it requires multi-task, multi-class detection of complex abnormalities on multiple image slices with different tissue contrasts. The generalizability of DL methods for musculoskeletal disease detection on MRI is also challenging due to fluctuations in image quality caused by the wide variety of scanners and pulse sequences used in routine MRI protocols. The diagnostic performance of current DL methods for musculoskeletal disease detection must be further evaluated in well-designed prospective studies using large image datasets acquired at different institutions with different imaging parameters and imaging hardware before they can be fully implemented in clinical practice. Future studies must also investigate the true clinical benefits of current DL methods and determine whether they could enhance quality, reduce error rates, improve workflow, and decrease radiologist fatigue and burnout with all of this weighed against the costs.
Collapse
Affiliation(s)
- Huibert C Ruitenbeek
- Department of Radiology and Nuclear Medicine, Erasmus MC, University Medical Center, P.O. Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Edwin H G Oei
- Department of Radiology and Nuclear Medicine, Erasmus MC, University Medical Center, P.O. Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Jacob J Visser
- Department of Radiology and Nuclear Medicine, Erasmus MC, University Medical Center, P.O. Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Richard Kijowski
- Department of Radiology, New York University Grossman School of Medicine, 660 First Avenue, 3rd Floor, New York, NY, 10016, USA.
| |
Collapse
|
2
|
Nowroozi A, Salehi MA, Shobeiri P, Agahi S, Momtazmanesh S, Kaviani P, Kalra MK. Artificial intelligence diagnostic accuracy in fracture detection from plain radiographs and comparing it with clinicians: a systematic review and meta-analysis. Clin Radiol 2024:S0009-9260(24)00200-9. [PMID: 38772766 DOI: 10.1016/j.crad.2024.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 04/09/2024] [Accepted: 04/15/2024] [Indexed: 05/23/2024]
Abstract
PURPOSE Fracture detection is one of the most commonly used and studied aspects of artificial intelligence (AI) in medicine. In this systematic review and meta-analysis, we aimed to summarize available literature and data regarding AI performance in fracture detection on plain radiographs and various factors affecting it. METHODS We systematically reviewed studies evaluating AI algorithms in detecting bone fractures in plain radiographs, combined their performance using meta-analysis (a bivariate regression approach), and compared it with that of clinicians. We also analyzed the factors potentially affecting algorithm performance using meta-regression. RESULTS Our analysis included 100 studies. In 83 studies with confusion matrices, AI algorithms showed a sensitivity of 91.43% and a specificity of 92.12% (Area under the summary receiver operator curve = 0.968). After adjustment and false discovery rate correction, tibia/fibula (excluding ankle) fractures were associated with higher (7.0%, p=0.004) AI sensitivity, while more recent publications (5.5%, p=0.003) and Xception architecture (6.6%, p<0.001) were associated with higher specificity. Clinicians and AI showed similar specificity in fracture identification, although AI leaned to higher sensitivity (7.6%, p=0.07). Radiologists, on the other hand, were more specific than AI overall and in several subgroups, and more sensitive to hip fractures before FDR correction. CONCLUSIONS Currently available AI aids could result in a significant improvement in care where radiologists are not readily available. Moreover, identifying factors affecting algorithm performance could guide AI development teams in their process of optimizing their products.
Collapse
Affiliation(s)
- A Nowroozi
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - M A Salehi
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - P Shobeiri
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - S Agahi
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - S Momtazmanesh
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - P Kaviani
- Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - M K Kalra
- Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA.
| |
Collapse
|
3
|
Cote MP, Lubowitz JH. Recommended Requirements and Essential Elements for Proper Reporting of the Use of Artificial Intelligence Machine Learning Tools in Biomedical Research and Scientific Publications. Arthroscopy 2024; 40:1033-1038. [PMID: 38300189 DOI: 10.1016/j.arthro.2023.12.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Accepted: 12/30/2023] [Indexed: 02/02/2024]
Abstract
Essential elements required for proper use of artificial intelligence machine learning tools in biomedical research and scientific publications include (1) explanation justifying why a machine learning approach contributes to the purpose of the study; (2) description of the adequacy of the data (input) to produce the desired results (output); (3) details of the algorithmic (i.e., computational) approach including methods for organizing the data (preprocessing); the machine learning computational algorithm(s) assessed; on what data the models were trained; the presence of bias and efforts to mitigate these effects; and the methods for quantifying the variables (features) most influential in determining the results (e.g., Shapley values); (4) description of methods, and reporting of results, quantitating performance in terms of both model accuracy and model calibration (level of confidence in the model's predictions); (5) availability of the programming code (including a link to the code when available-ideally, the code should be available); (6) discussion of model internal validation (results applicable and sensitive to the population investigated and data on which the model was trained) and external validation (were the results investigated as to whether they are generalizable to different populations? If not, consideration of this limitation and discussion of plans for external validation, i.e., next steps). As biomedical research submissions using artificial intelligence technology increase, these requirements could facilitate purposeful use and comprehensive methodological reporting.
Collapse
|
4
|
Khoriati AA, Shahid Z, Fok M, Frank RM, Voss A, D'Hooghe P, Imam MA. Artificial intelligence and the orthopaedic surgeon: A review of the literature and potential applications for future practice: Current concepts. J ISAKOS 2024; 9:227-233. [PMID: 37949113 DOI: 10.1016/j.jisako.2023.10.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 10/28/2023] [Accepted: 10/30/2023] [Indexed: 11/12/2023]
Affiliation(s)
- Al-Achraf Khoriati
- Rowley Bristow Orthopaedic Centre, Ashford and St Peter's NHS Foundation Trust, Chertsey, KT106PZ, UK.
| | - Zuhaib Shahid
- Rowley Bristow Orthopaedic Centre, Ashford and St Peter's NHS Foundation Trust, Chertsey, KT106PZ, UK.
| | - Margaret Fok
- Department of Orthopaedics and Traumatology, Queen Mary Hospital, The University of Hong Kong, Pok Fu Lam Rd, High West, Hong Kong, China; Asia Pacific Orthopaedic Association, 57000, Malaysia.
| | - Rachel M Frank
- Department of Orthopaedic Surgery, Joint Preservation Program, University of Colorado School of Medicine, 12631 E 17th Ave, Mail Stop B202, Aurora, CO 80045, USA.
| | - Andreas Voss
- Sporthopaedicum Regensburg, Street, Hildegard-von-Bingen-Straße 1, 93053, Regensburg, Germany.
| | - Pieter D'Hooghe
- Aspetar Orthopedic and Sports Medicine Hospital, Aspire Zone, Sportscity Street 1, P.O. Box 29222, Doha, Qatar
| | - Mohamed A Imam
- Rowley Bristow Orthopaedic Centre, Ashford and St Peter's NHS Foundation Trust, Chertsey, KT106PZ, UK; Smart Health Centre, University of East London, University Way, London, E16 2RD, United Kingdom.
| |
Collapse
|
5
|
Ackermann J, Hoch A, Snedeker JG, Zingg PO, Esfandiari H, Fürnstahl P. Automatic 3D Postoperative Evaluation of Complex Orthopaedic Interventions. J Imaging 2023; 9:180. [PMID: 37754944 PMCID: PMC10532700 DOI: 10.3390/jimaging9090180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 08/21/2023] [Accepted: 08/27/2023] [Indexed: 09/28/2023] Open
Abstract
In clinical practice, image-based postoperative evaluation is still performed without state-of-the-art computer methods, as these are not sufficiently automated. In this study we propose a fully automatic 3D postoperative outcome quantification method for the relevant steps of orthopaedic interventions on the example of Periacetabular Osteotomy of Ganz (PAO). A typical orthopaedic intervention involves cutting bone, anatomy manipulation and repositioning as well as implant placement. Our method includes a segmentation based deep learning approach for detection and quantification of the cuts. Furthermore, anatomy repositioning was quantified through a multi-step registration method, which entailed a coarse alignment of the pre- and postoperative CT images followed by a fine fragment alignment of the repositioned anatomy. Implant (i.e., screw) position was identified by 3D Hough transform for line detection combined with fast voxel traversal based on ray tracing. The feasibility of our approach was investigated on 27 interventions and compared against manually performed 3D outcome evaluations. The results show that our method can accurately assess the quality and accuracy of the surgery. Our evaluation of the fragment repositioning showed a cumulative error for the coarse and fine alignment of 2.1 mm. Our evaluation of screw placement accuracy resulted in a distance error of 1.32 mm for screw head location and an angular deviation of 1.1° for screw axis. As a next step we will explore generalisation capabilities by applying the method to different interventions.
Collapse
Affiliation(s)
- Joëlle Ackermann
- Research in Orthopedic Computer Science, Balgrist University Hospital, University of Zurich, 8008 Zurich, Switzerland
- Laboratory for Orthopaedic Biomechanics, ETH Zurich, 8093 Zurich, Switzerland
| | - Armando Hoch
- Department of Orthopedics, Balgrist University Hospital, University of Zurich, 8008 Zurich, Switzerland
| | - Jess Gerrit Snedeker
- Laboratory for Orthopaedic Biomechanics, ETH Zurich, 8093 Zurich, Switzerland
- Department of Orthopedics, Balgrist University Hospital, University of Zurich, 8008 Zurich, Switzerland
| | - Patrick Oliver Zingg
- Department of Orthopedics, Balgrist University Hospital, University of Zurich, 8008 Zurich, Switzerland
| | - Hooman Esfandiari
- Research in Orthopedic Computer Science, Balgrist University Hospital, University of Zurich, 8008 Zurich, Switzerland
| | - Philipp Fürnstahl
- Research in Orthopedic Computer Science, Balgrist University Hospital, University of Zurich, 8008 Zurich, Switzerland
| |
Collapse
|
6
|
Gao Y, Soh NYT, Liu N, Lim G, Ting D, Cheng LTE, Wong KM, Liew C, Oh HC, Tan JR, Venkataraman N, Goh SH, Yan YY. Application of a deep learning algorithm in the detection of hip fractures. iScience 2023; 26:107350. [PMID: 37554447 PMCID: PMC10404720 DOI: 10.1016/j.isci.2023.107350] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/30/2023] [Accepted: 07/06/2023] [Indexed: 08/10/2023] Open
Abstract
This paper describes the development of a deep learning model for prediction of hip fractures on pelvic radiographs (X-rays). Developed using over 40,000 pelvic radiographs from a single institution, the model demonstrated high sensitivity and specificity when applied to a test set of emergency department radiographs. This study approximates the real-world application of a deep learning fracture detection model by including radiographs with sub-optimal image quality, other non-hip fractures, and metallic implants, which were excluded from prior published work. The study also explores the effect of ethnicity on model performance, as well as the accuracy of visualization algorithm for fracture localization.
Collapse
Affiliation(s)
- Yan Gao
- Health Services Research, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Nicholas Yock Teck Soh
- Department of Diagnostic Radiology, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Nan Liu
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Gilbert Lim
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Daniel Ting
- Singapore Health Services (SingHealth), Duke-NUS Medical School, Singapore, Singapore
| | - Lionel Tim-Ee Cheng
- Department of Diagnostic Radiology, Singapore General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
- Radiological Sciences ACP, Duke-NUS Medical School, Singapore, Singapore
| | - Kang Min Wong
- Department of Diagnostic Radiology, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
- Radiological Sciences ACP, Duke-NUS Medical School, Singapore, Singapore
| | - Charlene Liew
- Department of Diagnostic Radiology, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
- Radiological Sciences ACP, Duke-NUS Medical School, Singapore, Singapore
| | - Hong Choon Oh
- Health Services Research, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Jin Rong Tan
- Department of Diagnostic Radiology, Singapore General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Narayan Venkataraman
- Department of Medical Informatics, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Siang Hiong Goh
- Department of Emergency Medicine, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Yet Yen Yan
- Department of Diagnostic Radiology, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
- Radiological Sciences ACP, Duke-NUS Medical School, Singapore, Singapore
| |
Collapse
|
7
|
Debs P, Fayad LM. The promise and limitations of artificial intelligence in musculoskeletal imaging. FRONTIERS IN RADIOLOGY 2023; 3:1242902. [PMID: 37609456 PMCID: PMC10440743 DOI: 10.3389/fradi.2023.1242902] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 07/26/2023] [Indexed: 08/24/2023]
Abstract
With the recent developments in deep learning and the rapid growth of convolutional neural networks, artificial intelligence has shown promise as a tool that can transform several aspects of the musculoskeletal imaging cycle. Its applications can involve both interpretive and non-interpretive tasks such as the ordering of imaging, scheduling, protocoling, image acquisition, report generation and communication of findings. However, artificial intelligence tools still face a number of challenges that can hinder effective implementation into clinical practice. The purpose of this review is to explore both the successes and limitations of artificial intelligence applications throughout the muscuskeletal imaging cycle and to highlight how these applications can help enhance the service radiologists deliver to their patients, resulting in increased efficiency as well as improved patient and provider satisfaction.
Collapse
Affiliation(s)
- Patrick Debs
- The Russell H. Morgan Department of Radiology & Radiological Science, The Johns Hopkins Medical Institutions, Baltimore, MD, United States
| | - Laura M. Fayad
- The Russell H. Morgan Department of Radiology & Radiological Science, The Johns Hopkins Medical Institutions, Baltimore, MD, United States
- Department of Orthopaedic Surgery, Johns Hopkins University School of Medicine, Baltimore, MD, United States
- Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| |
Collapse
|
8
|
Lin ZW, Dai WL, Lai QQ, Wu H. Deep learning-based computed tomography applied to the diagnosis of rib fractures. JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES 2023. [DOI: 10.1016/j.jrras.2023.100558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
|
9
|
Lex JR, Di Michele J, Koucheki R, Pincus D, Whyne C, Ravi B. Artificial Intelligence for Hip Fracture Detection and Outcome Prediction: A Systematic Review and Meta-analysis. JAMA Netw Open 2023; 6:e233391. [PMID: 36930153 PMCID: PMC10024206 DOI: 10.1001/jamanetworkopen.2023.3391] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/18/2023] Open
Abstract
IMPORTANCE Artificial intelligence (AI) enables powerful models for establishment of clinical diagnostic and prognostic tools for hip fractures; however the performance and potential impact of these newly developed algorithms are currently unknown. OBJECTIVE To evaluate the performance of AI algorithms designed to diagnose hip fractures on radiographs and predict postoperative clinical outcomes following hip fracture surgery relative to current practices. DATA SOURCES A systematic review of the literature was performed using the MEDLINE, Embase, and Cochrane Library databases for all articles published from database inception to January 23, 2023. A manual reference search of included articles was also undertaken to identify any additional relevant articles. STUDY SELECTION Studies developing machine learning (ML) models for the diagnosis of hip fractures from hip or pelvic radiographs or to predict any postoperative patient outcome following hip fracture surgery were included. DATA EXTRACTION AND SYNTHESIS This study followed the Preferred Reporting Items for Systematic Reviews and Meta-analyses and was registered with PROSPERO. Eligible full-text articles were evaluated and relevant data extracted independently using a template data extraction form. For studies that predicted postoperative outcomes, the performance of traditional predictive statistical models, either multivariable logistic or linear regression, was recorded and compared with the performance of the best ML model on the same out-of-sample data set. MAIN OUTCOMES AND MEASURES Diagnostic accuracy of AI models was compared with the diagnostic accuracy of expert clinicians using odds ratios (ORs) with 95% CIs. Areas under the curve for postoperative outcome prediction between traditional statistical models (multivariable linear or logistic regression) and ML models were compared. RESULTS Of 39 studies that met all criteria and were included in this analysis, 18 (46.2%) used AI models to diagnose hip fractures on plain radiographs and 21 (53.8%) used AI models to predict patient outcomes following hip fracture surgery. A total of 39 598 plain radiographs and 714 939 hip fractures were used for training, validating, and testing ML models specific to diagnosis and postoperative outcome prediction, respectively. Mortality and length of hospital stay were the most predicted outcomes. On pooled data analysis, compared with clinicians, the OR for diagnostic error of ML models was 0.79 (95% CI, 0.48-1.31; P = .36; I2 = 60%) for hip fracture radiographs. For the ML models, the mean (SD) sensitivity was 89.3% (8.5%), specificity was 87.5% (9.9%), and F1 score was 0.90 (0.06). The mean area under the curve for mortality prediction was 0.84 with ML models compared with 0.79 for alternative controls (P = .09). CONCLUSIONS AND RELEVANCE The findings of this systematic review and meta-analysis suggest that the potential applications of AI to aid with diagnosis from hip radiographs are promising. The performance of AI in diagnosing hip fractures was comparable with that of expert radiologists and surgeons. However, current implementations of AI for outcome prediction do not seem to provide substantial benefit over traditional multivariable predictive statistics.
Collapse
Affiliation(s)
- Johnathan R. Lex
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
- Orthopaedics Biomechanics Laboratory, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Joseph Di Michele
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
| | - Robert Koucheki
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
- Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Daniel Pincus
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
| | - Cari Whyne
- Orthopaedics Biomechanics Laboratory, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Bheeshma Ravi
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
| |
Collapse
|
10
|
Yang L, Gao S, Li P, Shi J, Zhou F. Recognition and Segmentation of Individual Bone Fragments with a Deep Learning Approach in CT Scans of Complex Intertrochanteric Fractures: A Retrospective Study. J Digit Imaging 2022; 35:1681-1689. [PMID: 35711073 PMCID: PMC9712885 DOI: 10.1007/s10278-022-00669-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 05/04/2022] [Accepted: 06/07/2022] [Indexed: 10/18/2022] Open
Abstract
The characteristics of bone fragments are the main influencing factors for the choice of treatment in intertrochanteric fractures. This study aimed to develop a deep learning algorithm for recognizing and segmenting individual fragments in CT images of complex intertrochanteric fractures for orthopedic surgeons. This study was based on 160 hip CT scans (43,510 images) of complex fractures of three types based on the Evans-Jensen classification (40 cases of type 3 (IIA) fractures, 80 cases of type 4 (IIB)fractures, and 40 cases of type 5 (III)fractures) retrospectively. The images were randomly split into two groups to construct a training set of 120 CT scans (32,045 images) and a testing set of 40 CT scans (11,465 images). A deep learning model was built into a cascaded architecture composed by a convolutional neural network (CNN) for location of the fracture ROI and another CNN for recognition and segmentation of individual fragments within the ROI. The accuracy of object detection and dice coefficient of segmentation of individual fragments were used to evaluate model performance. The model yielded an average accuracy of 89.4% for individual fragment recognition and an average dice coefficient of 90.5% for segmentation in CT images. The results demonstrated the feasibility of recognition and segmentation of individual fragments in complex intertrochanteric fractures with a deep learning approach. Altogether, these promising results suggest the potential of our model to be applied to many clinical scenarios.
Collapse
Affiliation(s)
- Lv Yang
- Department of Orthopedics, Peking University Third Hospital, Beijing, China
| | - Shan Gao
- Department of Orthopedics, Peking University Third Hospital, Beijing, China
| | - Pengfei Li
- Department of Orthopedics, Peking University Third Hospital, Beijing, China
| | - Jiancheng Shi
- Department of Radiology, Peking University Third Hospital, Yanqing Hospital, Beijing, China
| | - Fang Zhou
- Department of Orthopedics, Peking University Third Hospital, Beijing, China.
| |
Collapse
|
11
|
Cha Y, Kim JT, Park CH, Kim JW, Lee SY, Yoo JI. Artificial intelligence and machine learning on diagnosis and classification of hip fracture: systematic review. J Orthop Surg Res 2022; 17:520. [PMID: 36456982 PMCID: PMC9714164 DOI: 10.1186/s13018-022-03408-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 11/16/2022] [Indexed: 12/05/2022] Open
Abstract
BACKGROUND In the emergency room, clinicians spend a lot of time and are exposed to mental stress. In addition, fracture classification is important for determining the surgical method and restoring the patient's mobility. Recently, with the help of computers using artificial intelligence (AI) or machine learning (ML), diagnosis and classification of hip fractures can be performed easily and quickly. The purpose of this systematic review is to search for studies that diagnose and classify for hip fracture using AI or ML, organize the results of each study, analyze the usefulness of this technology and its future use value. METHODS PubMed Central, OVID Medline, Cochrane Collaboration Library, Web of Science, EMBASE, and AHRQ databases were searched to identify relevant studies published up to June 2022 with English language restriction. The following search terms were used [All Fields] AND (", "[MeSH Terms] OR (""[All Fields] AND "bone"[All Fields]) OR "bone fractures"[All Fields] OR "fracture"[All Fields]). The following information was extracted from the included articles: authors, publication year, study period, type of image, type of fracture, number of patient or used images, fracture classification, reference diagnosis of fracture diagnosis and classification, and augments of each studies. In addition, AI name, CNN architecture type, ROI or important region labeling, data input proportion in training/validation/test, and diagnosis accuracy/AUC, classification accuracy/AUC of each studies were also extracted. RESULTS In 14 finally included studies, the accuracy of diagnosis for hip fracture by AI was 79.3-98%, and the accuracy of fracture diagnosis in AI aided humans was 90.5-97.1. The accuracy of human fracture diagnosis was 77.5-93.5. AUC of fracture diagnosis by AI was 0.905-0.99. The accuracy of fracture classification by AI was 86-98.5 and AUC was 0.873-1.0. The forest plot represented that the mean AI diagnosis accuracy was 0.92, the mean AI diagnosis AUC was 0.969, the mean AI classification accuracy was 0.914, and the mean AI classification AUC was 0.933. Among the included studies, the architecture based on the GoogLeNet architectural model or the DenseNet architectural model was the most common with three each. Among the data input proportions, the study with the lowest training rate was 57%, and the study with the highest training rate was 95%. In 14 studies, 5 studies used Grad-CAM for highlight important regions. CONCLUSION We expected that our study may be helpful in making judgments about the use of AI in the diagnosis and classification of hip fractures. It is clear that AI is a tool that can help medical staff reduce the time and effort required for hip fracture diagnosis with high accuracy. Further studies are needed to determine what effect this causes in actual clinical situations.
Collapse
Affiliation(s)
- Yonghan Cha
- grid.411061.30000 0004 0647 205XDepartment of Orthopedic Surgery, Eulji University Hospital, Daejeon, Korea
| | - Jung-Taek Kim
- grid.251916.80000 0004 0532 3933Department of Orthopedic Surgery, Ajou Medical Center, Ajou University School of Medicine, Suwon, Korea
| | - Chan-Ho Park
- Department of Orthopedic Surgery, Yonsei 100 Percent Hospital, Incheon, Korea
| | - Jin-Woo Kim
- grid.255588.70000 0004 1798 4296Department of Orthopaedic Surgery, Nowon Eulji Medical Center, Eulji University, Seoul, Korea
| | - Sang Yeob Lee
- grid.411899.c0000 0004 0624 2502Department of Biomedical Research Institute, Gyeongsang National University Hospital, Jinju, South Korea
| | - Jun-Il Yoo
- grid.411899.c0000 0004 0624 2502Department of Orthopaedic Surgery, Gyeongsang National University Hospital, 90 Chilamdong, Jinju, Gyeongnamdo 660-702 Republic of Korea
| |
Collapse
|
12
|
D'Angelo T, Caudo D, Blandino A, Albrecht MH, Vogl TJ, Gruenewald LD, Gaeta M, Yel I, Koch V, Martin SS, Lenga L, Muscogiuri G, Sironi S, Mazziotti S, Booz C. Artificial intelligence, machine learning and deep learning in musculoskeletal imaging: Current applications. JOURNAL OF CLINICAL ULTRASOUND : JCU 2022; 50:1414-1431. [PMID: 36069404 DOI: 10.1002/jcu.23321] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 08/18/2022] [Accepted: 08/20/2022] [Indexed: 06/15/2023]
Abstract
Artificial intelligence is rapidly expanding in all technological fields. The medical field, and especially diagnostic imaging, has been showing the highest developmental potential. Artificial intelligence aims at human intelligence simulation through the management of complex problems. This review describes the technical background of artificial intelligence, machine learning, and deep learning. The first section illustrates the general potential of artificial intelligence applications in the context of request management, data acquisition, image reconstruction, archiving, and communication systems. In the second section, the prospective of dedicated tools for segmentation, lesion detection, automatic diagnosis, and classification of musculoskeletal disorders is discussed.
Collapse
Affiliation(s)
- Tommaso D'Angelo
- Department of Biomedical Sciences and Morphological and Functional Imaging, University Hospital Messina, Messina, Italy
- Department of Radiology and Nuclear Medicine, Rotterdam, Netherlands
| | - Danilo Caudo
- Department of Biomedical Sciences and Morphological and Functional Imaging, University Hospital Messina, Messina, Italy
- Department or Radiology, IRRCS Centro Neurolesi "Bonino Pulejo", Messina, Italy
| | - Alfredo Blandino
- Department of Biomedical Sciences and Morphological and Functional Imaging, University Hospital Messina, Messina, Italy
| | - Moritz H Albrecht
- Division of Experimental Imaging, Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Thomas J Vogl
- Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Leon D Gruenewald
- Division of Experimental Imaging, Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Michele Gaeta
- Department of Biomedical Sciences and Morphological and Functional Imaging, University Hospital Messina, Messina, Italy
| | - Ibrahim Yel
- Division of Experimental Imaging, Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Vitali Koch
- Division of Experimental Imaging, Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Simon S Martin
- Division of Experimental Imaging, Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Lukas Lenga
- Division of Experimental Imaging, Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Giuseppe Muscogiuri
- School of Medicine and Surgery, University of Milano-Bicocca, Milan, Italy
- Department of Radiology, IRCCS Istituto Auxologico Italiano, San Luca Hospital, Milan, Italy
| | - Sandro Sironi
- School of Medicine and Surgery, University of Milano-Bicocca, Milan, Italy
- Department of Radiology, ASST Papa Giovanni XXIII Hospital, Bergamo, Italy
| | - Silvio Mazziotti
- Department of Biomedical Sciences and Morphological and Functional Imaging, University Hospital Messina, Messina, Italy
| | - Christian Booz
- Division of Experimental Imaging, Department of Diagnostic and Interventional Radiology, University Hospital Frankfurt, Frankfurt am Main, Germany
| |
Collapse
|
13
|
Twinprai N, Boonrod A, Boonrod A, Chindaprasirt J, Sirithanaphol W, Chindaprasirt P, Twinprai P. Artificial intelligence (AI) vs. human in hip fracture detection. Heliyon 2022; 8:e11266. [PMID: 36339768 PMCID: PMC9634369 DOI: 10.1016/j.heliyon.2022.e11266] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Revised: 04/26/2022] [Accepted: 10/20/2022] [Indexed: 12/03/2022] Open
Abstract
Objective This study aimed to assess the diagnostic accuracy and sensitivity of a YOLOv4-tiny AI model for detecting and classifying hip fractures types. Materials and methods In this retrospective study, a dataset of 1000 hip and pelvic radiographs was divided into a training set consisting of 450 fracture and 450 normal images (900 images total) and a testing set consisting of 50 fracture and 50 normal images (100 images total). The training set images were each manually augmented with a bounding box drawn around each hip, and each bounding box was manually labeled either (1) normal, (2) femoral neck fracture, (3) intertrochanteric fracture, or (4) subtrochanteric fracture. Next, a deep convolutional neural network YOLOv4-tiny AI model was trained using the augmented training set images, and then model performance was evaluated with the testing set images. Human doctors then evaluated the same testing set images, and the performances of the model and doctors were compared. The testing set contained no crossover data. Results The resulting output images revealed that the AI model produced bounding boxes around each hip region and classified the fracture and normal hip regions with a sensitivity of 96.2%, specificity of 94.6%, and an accuracy of 95%. The human doctors performed with a sensitivity ranging from 69.2 to 96.2%. Compared with human doctors, the detection rate sensitivity of the model was significantly better than a general practitioner and first-year residents and equivalent to specialist doctors. Conclusions This model showed hip fracture detection sensitivity comparable to well-trained radiologists and orthopedists and classified hip fractures highly accurately.
Collapse
Affiliation(s)
- Nattaphon Twinprai
- Trauma Unit, Department of Orthopedics, Srinagarind Hospital, Khon Kaen University, Thailand
| | - Artit Boonrod
- Sport Unit, Department of Orthopedics, Srinagarind Hospital, Khon Kaen University, Thailand
| | - Arunnit Boonrod
- Neurology Unit, Department of Radiology, Srinagarind Hospital, Khon Kaen University, Thailand
| | - Jarin Chindaprasirt
- Department of Internal Medicine, Srinagarind Hospital, Khon Kaen University, Thailand
| | | | - Prinya Chindaprasirt
- Sustainable Infrastructure Research and Development Center, Department of Civil Engineering, Faculty of Engineering, Khon Kaen University, Thailand
| | - Prin Twinprai
- Musculoskeletal Unit, Department of Radiology, Srinagarind Hospital, Khon Kaen University, Thailand
- Corresponding author.
| |
Collapse
|
14
|
Hayashi D, Kompel AJ, Ventre J, Ducarouge A, Nguyen T, Regnard NE, Guermazi A. Automated detection of acute appendicular skeletal fractures in pediatric patients using deep learning. Skeletal Radiol 2022; 51:2129-2139. [PMID: 35522332 DOI: 10.1007/s00256-022-04070-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 04/28/2022] [Accepted: 04/28/2022] [Indexed: 02/02/2023]
Abstract
OBJECTIVE We aimed to perform an external validation of an existing commercial AI software program (BoneView™) for the detection of acute appendicular fractures in pediatric patients. MATERIALS AND METHODS In our retrospective study, anonymized radiographic exams of extremities, with or without fractures, from pediatric patients (aged 2-21) were included. Three hundred exams (150 with fractures and 150 without fractures) were included, comprising 60 exams per body part (hand/wrist, elbow/upper arm, shoulder/clavicle, foot/ankle, leg/knee). The Ground Truth was defined by experienced radiologists. A deep learning algorithm interpreted the radiographs for fracture detection, and its diagnostic performance was compared against the Ground Truth, and receiver operating characteristic analysis was done. Statistical analyses included sensitivity per patient (the proportion of patients for whom all fractures were identified) and sensitivity per fracture (the proportion of fractures identified by the AI among all fractures), specificity per patient, and false-positive rate per patient. RESULTS There were 167 boys and 133 girls with a mean age of 10.8 years. For all fractures, sensitivity per patient (average [95% confidence interval]) was 91.3% [85.6, 95.3], specificity per patient was 90.0% [84.0,94.3], sensitivity per fracture was 92.5% [87.0, 96.2], and false-positive rate per patient in patients who had no fracture was 0.11. The patient-wise area under the curve was 0.93 for all fractures. AI diagnostic performance was consistently high across all anatomical locations and different types of fractures except for avulsion fractures (sensitivity per fracture 72.7% [39.0, 94.0]). CONCLUSION The BoneView™ deep learning algorithm provides high overall diagnostic performance for appendicular fracture detection in pediatric patients.
Collapse
Affiliation(s)
- Daichi Hayashi
- Department of Radiology, Boston University School of Medicine, 820 Harrison Avenue, FGH Building, 3rd Floor, Boston, MA, 02118, USA. .,Department of Radiology, Stony Brook University Renaissance School of Medicine, HSc Level 4, Room 120, Stony Brook, NY, 11794, USA.
| | - Andrew J Kompel
- Department of Radiology, Boston University School of Medicine, 820 Harrison Avenue, FGH Building, 3rd Floor, Boston, MA, 02118, USA
| | - Jeanne Ventre
- Gleamer, 117-119 Quai de Valmy, 75010, Paris, France
| | | | - Toan Nguyen
- Gleamer, 117-119 Quai de Valmy, 75010, Paris, France.,Service de Radiopédiatrie, Hôpital Armand-Trousseau, AP-HP, Médecine Sorbonne Université, 26 avenue du Docteur Arnold-Netter, 75012, Paris, France
| | - Nor-Eddine Regnard
- Gleamer, 117-119 Quai de Valmy, 75010, Paris, France.,Réseau d'Imagerie Sud Francilien, 2 avenue de Mousseau, 91000, Evry, France
| | - Ali Guermazi
- Department of Radiology, Boston University School of Medicine, 820 Harrison Avenue, FGH Building, 3rd Floor, Boston, MA, 02118, USA.,Department of Radiology, VA Boston Healthcare System, 1400 VFW Parkway, Suite 1B105, West Roxbury, MA, 02132, USA
| |
Collapse
|
15
|
Meena T, Roy S. Bone Fracture Detection Using Deep Supervised Learning from Radiological Images: A Paradigm Shift. Diagnostics (Basel) 2022; 12:diagnostics12102420. [PMID: 36292109 PMCID: PMC9600559 DOI: 10.3390/diagnostics12102420] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 10/04/2022] [Accepted: 10/05/2022] [Indexed: 01/16/2023] Open
Abstract
Bone diseases are common and can result in various musculoskeletal conditions (MC). An estimated 1.71 billion patients suffer from musculoskeletal problems worldwide. Apart from musculoskeletal fractures, femoral neck injuries, knee osteoarthritis, and fractures are very common bone diseases, and the rate is expected to double in the next 30 years. Therefore, proper and timely diagnosis and treatment of a fractured patient are crucial. Contrastingly, missed fractures are a common prognosis failure in accidents and emergencies. This causes complications and delays in patients’ treatment and care. These days, artificial intelligence (AI) and, more specifically, deep learning (DL) are receiving significant attention to assist radiologists in bone fracture detection. DL can be widely used in medical image analysis. Some studies in traumatology and orthopaedics have shown the use and potential of DL in diagnosing fractures and diseases from radiographs. In this systematic review, we provide an overview of the use of DL in bone imaging to help radiologists to detect various abnormalities, particularly fractures. We have also discussed the challenges and problems faced in the DL-based method, and the future of DL in bone imaging.
Collapse
|
16
|
Artificial Intelligence in Orthopedic Radiography Analysis: A Narrative Review. Diagnostics (Basel) 2022; 12:diagnostics12092235. [PMID: 36140636 PMCID: PMC9498096 DOI: 10.3390/diagnostics12092235] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 09/12/2022] [Accepted: 09/13/2022] [Indexed: 11/17/2022] Open
Abstract
Artificial intelligence (AI) in medicine is a rapidly growing field. In orthopedics, the clinical implementations of AI have not yet reached their full potential. Deep learning algorithms have shown promising results in computed radiographs for fracture detection, classification of OA, bone age, as well as automated measurements of the lower extremities. Studies investigating the performance of AI compared to trained human readers often show equal or better results, although human validation is indispensable at the current standards. The objective of this narrative review is to give an overview of AI in medicine and summarize the current applications of AI in orthopedic radiography imaging. Due to the different AI software and study design, it is difficult to find a clear structure in this field. To produce more homogeneous studies, open-source access to AI software codes and a consensus on study design should be aimed for.
Collapse
|
17
|
Zhang X, Yang Y, Shen YW, Zhang KR, Jiang ZK, Ma LT, Ding C, Wang BY, Meng Y, Liu H. Diagnostic accuracy and potential covariates of artificial intelligence for diagnosing orthopedic fractures: a systematic literature review and meta-analysis. Eur Radiol 2022; 32:7196-7216. [PMID: 35754091 DOI: 10.1007/s00330-022-08956-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 05/07/2022] [Accepted: 06/08/2022] [Indexed: 02/05/2023]
Abstract
OBJECTIVES To systematically quantify the diagnostic accuracy and identify potential covariates affecting the performance of artificial intelligence (AI) in diagnosing orthopedic fractures. METHODS PubMed, Embase, Web of Science, and Cochrane Library were systematically searched for studies on AI applications in diagnosing orthopedic fractures from inception to September 29, 2021. Pooled sensitivity and specificity and the area under the receiver operating characteristic curves (AUC) were obtained. This study was registered in the PROSPERO database prior to initiation (CRD 42021254618). RESULTS Thirty-nine were eligible for quantitative analysis. The overall pooled AUC, sensitivity, and specificity were 0.96 (95% CI 0.94-0.98), 90% (95% CI 87-92%), and 92% (95% CI 90-94%), respectively. In subgroup analyses, multicenter designed studies yielded higher sensitivity (92% vs. 88%) and specificity (94% vs. 91%) than single-center studies. AI demonstrated higher sensitivity with transfer learning (with vs. without: 92% vs. 87%) or data augmentation (with vs. without: 92% vs. 87%), compared to those without. Utilizing plain X-rays as input images for AI achieved results comparable to CT (AUC 0.96 vs. 0.96). Moreover, AI achieved comparable results to humans (AUC 0.97 vs. 0.97) and better results than non-expert human readers (AUC 0.98 vs. 0.96; sensitivity 95% vs. 88%). CONCLUSIONS AI demonstrated high accuracy in diagnosing orthopedic fractures from medical images. Larger-scale studies with higher design quality are needed to validate our findings. KEY POINTS • Multicenter study design, application of transfer learning, and data augmentation are closely related to improving the performance of artificial intelligence models in diagnosing orthopedic fractures. • Utilizing plain X-rays as input images for AI to diagnose fractures achieved results comparable to CT (AUC 0.96 vs. 0.96). • AI achieved comparable results to humans (AUC 0.97 vs. 0.97) but was superior to non-expert human readers (AUC 0.98 vs. 0.96, sensitivity 95% vs. 88%) in diagnosing fractures.
Collapse
Affiliation(s)
- Xiang Zhang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yi Yang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yi-Wei Shen
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Ke-Rui Zhang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Ze-Kun Jiang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610000, China
| | - Li-Tai Ma
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Chen Ding
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Bei-Yu Wang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yang Meng
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Hao Liu
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China.
| |
Collapse
|
18
|
Wang Y, Li Y, Lin G, Zhang Q, Zhong J, Zhang Y, Ma K, Zheng Y, Lu G, Zhang Z. Lower-extremity fatigue fracture detection and grading based on deep learning models of radiographs. Eur Radiol 2022; 33:555-565. [PMID: 35748901 DOI: 10.1007/s00330-022-08950-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Revised: 05/18/2022] [Accepted: 06/08/2022] [Indexed: 11/04/2022]
Abstract
OBJECTIVES To identify the feasibility of deep learning-based diagnostic models for detecting and assessing lower-extremity fatigue fracture severity on plain radiographs. METHODS This retrospective study enrolled 1151 X-ray images (tibiofibula/foot: 682/469) of fatigue fractures and 2842 X-ray images (tibiofibula/foot: 2000/842) without abnormal presentations from two clinical centers. After labeling the lesions, images in a center (tibiofibula/foot: 2539/1180) were allocated at 7:1:2 for model construction, and the remaining images from another center (tibiofibula/foot: 143/131) for external validation. A ResNet-50 and a triplet branch network were adopted to construct diagnostic models for detecting and grading. The performances of detection models were evaluated with sensitivity, specificity, and area under the receiver operating characteristic curve (AUC), while grading models were evaluated with accuracy by confusion matrix. Visual estimations by radiologists were performed for comparisons with models. RESULTS For the detection model on tibiofibula, a sensitivity of 95.4%/85.5%, a specificity of 80.1%/77.0%, and an AUC of 0.965/0.877 were achieved in the internal testing/external validation set. The detection model on foot reached a sensitivity of 96.4%/90.8%, a specificity of 76.0%/66.7%, and an AUC of 0.947/0.911. The detection models showed superior performance to the junior radiologist, comparable to the intermediate or senior radiologist. The overall accuracy of the diagnostic model was 78.5%/62.9% for tibiofibula and 74.7%/61.1% for foot in the internal testing/external validation set. CONCLUSIONS The deep learning-based models could be applied to the radiological diagnosis of plain radiographs for assisting in the detection and grading of fatigue fractures on tibiofibula and foot. KEY POINTS • Fatigue fractures on radiographs are relatively difficult to detect, and apt to be misdiagnosed. • Detection and grading models based on deep learning were constructed on a large cohort of radiographs with lower-extremity fatigue fractures. • The detection model with high sensitivity would help to reduce the misdiagnosis of lower-extremity fatigue fractures.
Collapse
Affiliation(s)
- Yanping Wang
- Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, 305 East Zhongshan Rd, Nanjing, 210002, China
| | | | - Guang Lin
- Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, 305 East Zhongshan Rd, Nanjing, 210002, China
| | - Qirui Zhang
- Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, 305 East Zhongshan Rd, Nanjing, 210002, China
| | - Jing Zhong
- Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, 305 East Zhongshan Rd, Nanjing, 210002, China
| | - Yan Zhang
- Department of Radiology, Nanjing Qinhuai Medical Area, Jinling Hospital, 210002, Nanjing, China
| | - Kai Ma
- Tencent Jarvis Lab, Shenzhen, 518000, China
| | | | - Guangming Lu
- Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, 305 East Zhongshan Rd, Nanjing, 210002, China.,State Key Laboratory of Analytical Chemistry for Life Science, Nanjing University, Nanjing, 210093, China
| | - Zhiqiang Zhang
- Department of Diagnostic Radiology, Jinling Hospital, Medical School of Nanjing University, 305 East Zhongshan Rd, Nanjing, 210002, China. .,State Key Laboratory of Analytical Chemistry for Life Science, Nanjing University, Nanjing, 210093, China.
| |
Collapse
|
19
|
Kuo RYL, Harrison C, Curran TA, Jones B, Freethy A, Cussons D, Stewart M, Collins GS, Furniss D. Artificial Intelligence in Fracture Detection: A Systematic Review and Meta-Analysis. Radiology 2022; 304:50-62. [PMID: 35348381 DOI: 10.1148/radiol.211785] [Citation(s) in RCA: 60] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Background Patients with fractures are a common emergency presentation and may be misdiagnosed at radiologic imaging. An increasing number of studies apply artificial intelligence (AI) techniques to fracture detection as an adjunct to clinician diagnosis. Purpose To perform a systematic review and meta-analysis comparing the diagnostic performance in fracture detection between AI and clinicians in peer-reviewed publications and the gray literature (ie, articles published on preprint repositories). Materials and Methods A search of multiple electronic databases between January 2018 and July 2020 (updated June 2021) was performed that included any primary research studies that developed and/or validated AI for the purposes of fracture detection at any imaging modality and excluded studies that evaluated image segmentation algorithms. Meta-analysis with a hierarchical model to calculate pooled sensitivity and specificity was used. Risk of bias was assessed by using a modified Prediction Model Study Risk of Bias Assessment Tool, or PROBAST, checklist. Results Included for analysis were 42 studies, with 115 contingency tables extracted from 32 studies (55 061 images). Thirty-seven studies identified fractures on radiographs and five studies identified fractures on CT images. For internal validation test sets, the pooled sensitivity was 92% (95% CI: 88, 93) for AI and 91% (95% CI: 85, 95) for clinicians, and the pooled specificity was 91% (95% CI: 88, 93) for AI and 92% (95% CI: 89, 92) for clinicians. For external validation test sets, the pooled sensitivity was 91% (95% CI: 84, 95) for AI and 94% (95% CI: 90, 96) for clinicians, and the pooled specificity was 91% (95% CI: 81, 95) for AI and 94% (95% CI: 91, 95) for clinicians. There were no statistically significant differences between clinician and AI performance. There were 22 of 42 (52%) studies that were judged to have high risk of bias. Meta-regression identified multiple sources of heterogeneity in the data, including risk of bias and fracture type. Conclusion Artificial intelligence (AI) and clinicians had comparable reported diagnostic performance in fracture detection, suggesting that AI technology holds promise as a diagnostic adjunct in future clinical practice. Clinical trial registration no. CRD42020186641 © RSNA, 2022 Online supplemental material is available for this article. See also the editorial by Cohen and McInnes in this issue.
Collapse
Affiliation(s)
- Rachel Y L Kuo
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| | - Conrad Harrison
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| | - Terry-Ann Curran
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| | - Benjamin Jones
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| | - Alexander Freethy
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| | - David Cussons
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| | - Max Stewart
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| | - Gary S Collins
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| | - Dominic Furniss
- From the Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, Old Road Headington, Oxford OX3 7LD, UK (R.Y.L.K., C.H., M.S., G.S.C., D.F.); Department of Plastic Surgery, John Radcliffe Hospital, Oxford, UK (T.A.C., A.F.); Department of Vascular Surgery, Royal Berkshire Hospital, Reading, UK (B.J.); Department of Plastic Surgery, Stoke Mandeville Hospital, Aylesbury, Buckinghamshire UK (D.C.); and UK EQUATOR Center, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford UK (G.S.C.)
| |
Collapse
|
20
|
Artificial intelligence in musculoskeletal imaging: a perspective on value propositions, clinical use, and obstacles. Skeletal Radiol 2022; 51:239-243. [PMID: 33983500 DOI: 10.1007/s00256-021-03802-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 04/25/2021] [Accepted: 04/25/2021] [Indexed: 02/08/2023]
Abstract
Artificial intelligence and deep learning (DL) offer musculoskeletal radiology exciting possibilities in multiple areas, including image reconstruction and transformation, tissue segmentation, workflow support, and disease detection. Novel DL-based image reconstruction algorithms correcting aliasing artifacts, signal loss, and noise amplification with previously unobtainable effectiveness are prime examples of how DL algorithms deliver promised value propositions in musculoskeletal radiology. The speed of DL-based tissue segmentation promises great efficiency gains that may permit the inclusion of tissue compositional-based information routinely into radiology reports. Similarly, DL algorithms give rise to a myriad of opportunities for workflow improvements, including intelligent and adaptive hanging protocols, speech recognition, report generation, scheduling, precertification, and billing. The value propositions of disease-detecting DL algorithms include reduced error rates and increased productivity. However, more studies using authentic clinical workflow settings are necessary to fully understand the value of DL algorithms for disease detection in clinical practice. Successful workflow integration and management of multiple algorithms are critical for translating the value propositions of DL algorithms into clinical practice but represent a major roadblock for which solutions are critically needed. While there is no consensus about the most sustainable business model, radiology departments will need to carefully weigh the benefits and disadvantages of each commercially available DL algorithm. Although more studies are needed to understand the value and impact of DL algorithms on clinical practice, DL technology will likely play an important role in the future of musculoskeletal imaging.
Collapse
|
21
|
Offiah AC. Current and emerging artificial intelligence applications for pediatric musculoskeletal radiology. Pediatr Radiol 2022; 52:2149-2158. [PMID: 34272573 PMCID: PMC9537230 DOI: 10.1007/s00247-021-05130-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 04/28/2021] [Accepted: 06/10/2021] [Indexed: 12/03/2022]
Abstract
Artificial intelligence (AI) is playing an ever-increasing role in radiology (more so in the adult world than in pediatrics), to the extent that there are unfounded fears it will completely take over the role of the radiologist. In relation to musculoskeletal applications of AI in pediatric radiology, we are far from the time when AI will replace radiologists; even for the commonest application (bone age assessment), AI is more often employed in an AI-assist mode rather than an AI-replace or AI-extend mode. AI for bone age assessment has been in clinical use for more than a decade and is the area in which most research has been conducted. Most other potential indications in children (such as appendicular and vertebral fracture detection) remain largely in the research domain. This article reviews the areas in which AI is most prominent in relation to the pediatric musculoskeletal system, briefly summarizing the current literature and highlighting areas for future research. Pediatric radiologists are encouraged to participate as members of the research teams conducting pediatric radiology artificial intelligence research.
Collapse
Affiliation(s)
- Amaka C. Offiah
- grid.11835.3e0000 0004 1936 9262Department of Oncology and Metabolism, University of Sheffield, Damer Street Building, Sheffield, S10 2TH UK ,grid.419127.80000 0004 0463 9178Department of Radiology, Sheffield Children’s NHS Foundation Trust, Sheffield, UK
| |
Collapse
|
22
|
Lembersky Z, de Swarte M, Aisa J, Johnson K, Zhu X, Hespel AM. Repeatability and accuracy of a novel, quantitative radiographic method for differentiating normal canine sacroiliac joints from joints with subluxation or luxation: Pilot study. Vet Radiol Ultrasound 2021; 63:148-155. [PMID: 34870358 DOI: 10.1111/vru.13045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 08/23/2021] [Accepted: 08/30/2021] [Indexed: 11/30/2022] Open
Abstract
Canine sacroiliac (SI) joint subluxation or luxation is most commonly diagnosed based on qualitative radiographic assessments. Aims of this two-part, retrospective, diagnostic accuracy, pilot study were to develop and evaluate a novel quantitative method based on measuring the angle between a line connecting the iliac wings and parallel lines across three anatomical landmarks (cranial endplate of L7, caudal endplate of L6, cranial endplate of L6) on a single ventrodorsal radiograph. For the first part of the study, angle measurements from a single observer were compared for 20 normal canine pelvic radiographs and 20 pelvic radiographs with SI luxation or subluxation. Mean values significantly differed between datasets (P < 0.001). The angles for the normal pelves ranged from 0.6° to 1.5°, while abnormal angles ranged from 3.8° to 7.1°. For the second part of the study, a dataset of 25 normal and 25 abnormal canine pelvic radiographs was evaluated using the novel technique by three blinded readers with varying levels of expertise at two different time points. There was excellent reliability among the three readers with an intraclass correlation (ICC) value of 0.90 and an excellent agreement between day 0 and day 30 readings with an ICC value of 0.91. It was also determined that a cut-off angle of 2.0°, using the line parallel to the cranial endplate of L6, provided overall the best accuracy, sensitivity, and specificity to differentiate normal versus abnormal pelves. These findings may be helpful for clinical cases with equivocal diagnoses and for future development of automated diagnostic tools.
Collapse
Affiliation(s)
- Zachary Lembersky
- Department of Small Animal Clinical Sciences, University of Tennessee, Knoxville, Tennessee, USA
| | - Marie de Swarte
- Department of Small Animal Clinical Sciences, University of Tennessee, Knoxville, Tennessee, USA
| | - Josep Aisa
- Department of Small Animal Clinical Sciences, University of Tennessee, Knoxville, Tennessee, USA
| | - Kryssa Johnson
- Department of Small Animal Clinical Sciences, University of Tennessee, Knoxville, Tennessee, USA
| | - Xiaojuan Zhu
- Office of Information Technology, The University of Tennessee, Knoxville, Tennessee, USA
| | - Adrien-Maxence Hespel
- Department of Small Animal Clinical Sciences, University of Tennessee, Knoxville, Tennessee, USA
| |
Collapse
|
23
|
Kong SH, Shin CS. Applications of Machine Learning in Bone and Mineral Research. Endocrinol Metab (Seoul) 2021; 36:928-937. [PMID: 34674509 PMCID: PMC8566132 DOI: 10.3803/enm.2021.1111] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 08/23/2021] [Accepted: 09/09/2021] [Indexed: 12/26/2022] Open
Abstract
In this unprecedented era of the overwhelming volume of medical data, machine learning can be a promising tool that may shed light on an individualized approach and a better understanding of the disease in the field of osteoporosis research, similar to that in other research fields. This review aimed to provide an overview of the latest studies using machine learning to address issues, mainly focusing on osteoporosis and fractures. Machine learning models for diagnosing and classifying osteoporosis and detecting fractures from images have shown promising performance. Fracture risk prediction is another promising field of research, and studies are being conducted using various data sources. However, these approaches may be biased due to the nature of the techniques or the quality of the data. Therefore, more studies based on the proposed guidelines are needed to improve the technical feasibility and generalizability of artificial intelligence algorithms.
Collapse
Affiliation(s)
- Sung Hye Kong
- Department of Internal Medicine, Seoul National University College of Medicine, Seoul,
Korea
- Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam,
Korea
| | - Chan Soo Shin
- Department of Internal Medicine, Seoul National University College of Medicine, Seoul,
Korea
- Department of Internal Medicine, Seoul National University Hospital, Seoul,
Korea
| |
Collapse
|
24
|
Meng XH, Wu DJ, Wang Z, Ma XL, Dong XM, Liu AE, Chen L. A fully automated rib fracture detection system on chest CT images and its impact on radiologist performance. Skeletal Radiol 2021; 50:1821-1828. [PMID: 33599801 DOI: 10.1007/s00256-021-03709-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 01/04/2021] [Accepted: 01/04/2021] [Indexed: 02/02/2023]
Abstract
OBJECTIVE To compare rib fracture detection and classification by radiologists using CT images with and without a deep learning model. MATERIALS AND METHODS A total of 8529 chest CT images were collected from multiple hospitals for training the deep learning model. The test dataset included 300 chest CT images acquired using a single CT scanner. The rib fractures were marked in the bone window on each CT slice by experienced radiologists, and the ground truth included 861 rib fractures. We proposed a heterogeneous neural network for rib fracture detection and classification consisting of a cascaded feature pyramid network and a classification network. The deep learning-based model was evaluated based on the external testing data. The precision rate, recall rate, F1-score, and diagnostic time of two junior radiologists with and without the deep learning model were computed, and the Chi-square, one-way analysis of variance, and least significant difference tests were used to analyze the results. RESULTS The use of the deep learning model increased detection recall and classification accuracy (0.922 and 0.863) compared with the radiologists alone (0.812 vs. 0.850). The radiologists achieved a higher precision rate, recall rate, and F1-score for fracture detection when using the deep learning model, at 0.943, 0.978, and 0.960, respectively. When using the deep learning model, the radiologist's reading time was decreased from 158.3 ± 35.7 s to 42.3 ± 6.8 s. CONCLUSION Radiologists achieved the highest performance in diagnosing and classifying rib fractures on CT images when assisted by the deep learning model.
Collapse
Affiliation(s)
- Xiang Hong Meng
- Department of Radiology, Tianjin Hospital, Tianjin, 300211, China
| | - Di Jia Wu
- Shanghai United Imaging Intelligence Co.,Ltd., Shanghai, 201210, China
| | - Zhi Wang
- Department of Radiology, Tianjin Hospital, Tianjin, 300211, China
| | - Xin Long Ma
- Department of Orthopedics, Tianjin Hospital, Jiefangnan Road, Hexi District, Tianjin, 300211, China.
| | - Xiao Man Dong
- Department of Radiology, Tianjin Hospital, Tianjin, 300211, China
| | - Ai E Liu
- Shanghai United Imaging Intelligence Co.,Ltd., Shanghai, 201210, China
| | - Lei Chen
- Shanghai United Imaging Intelligence Co.,Ltd., Shanghai, 201210, China
| |
Collapse
|
25
|
Smets J, Shevroja E, Hügle T, Leslie WD, Hans D. Machine Learning Solutions for Osteoporosis-A Review. J Bone Miner Res 2021; 36:833-851. [PMID: 33751686 DOI: 10.1002/jbmr.4292] [Citation(s) in RCA: 64] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 02/04/2021] [Accepted: 03/16/2021] [Indexed: 12/11/2022]
Abstract
Osteoporosis and its clinical consequence, bone fracture, is a multifactorial disease that has been the object of extensive research. Recent advances in machine learning (ML) have enabled the field of artificial intelligence (AI) to make impressive breakthroughs in complex data environments where human capacity to identify high-dimensional relationships is limited. The field of osteoporosis is one such domain, notwithstanding technical and clinical concerns regarding the application of ML methods. This qualitative review is intended to outline some of these concerns and to inform stakeholders interested in applying AI for improved management of osteoporosis. A systemic search in PubMed and Web of Science resulted in 89 studies for inclusion in the review. These covered one or more of four main areas in osteoporosis management: bone properties assessment (n = 13), osteoporosis classification (n = 34), fracture detection (n = 32), and risk prediction (n = 14). Reporting and methodological quality was determined by means of a 12-point checklist. In general, the studies were of moderate quality with a wide range (mode score 6, range 2 to 11). Major limitations were identified in a significant number of studies. Incomplete reporting, especially over model selection, inadequate splitting of data, and the low proportion of studies with external validation were among the most frequent problems. However, the use of images for opportunistic osteoporosis diagnosis or fracture detection emerged as a promising approach and one of the main contributions that ML could bring to the osteoporosis field. Efforts to develop ML-based models for identifying novel fracture risk factors and improving fracture prediction are additional promising lines of research. Some studies also offered insights into the potential for model-based decision-making. Finally, to avoid some of the common pitfalls, the use of standardized checklists in developing and sharing the results of ML models should be encouraged. © 2021 American Society for Bone and Mineral Research (ASBMR).
Collapse
Affiliation(s)
- Julien Smets
- Center of Bone Diseases, Bone and Joint Department, Lausanne University Hospital, Lausanne, Switzerland
| | - Enisa Shevroja
- Center of Bone Diseases, Bone and Joint Department, Lausanne University Hospital, Lausanne, Switzerland
| | - Thomas Hügle
- Department of Rheumatology, Lausanne University Hospital, Lausanne, Switzerland
| | | | - Didier Hans
- Center of Bone Diseases, Bone and Joint Department, Lausanne University Hospital, Lausanne, Switzerland
| |
Collapse
|
26
|
Tobler P, Cyriac J, Kovacs BK, Hofmann V, Sexauer R, Paciolla F, Stieltjes B, Amsler F, Hirschmann A. AI-based detection and classification of distal radius fractures using low-effort data labeling: evaluation of applicability and effect of training set size. Eur Radiol 2021; 31:6816-6824. [PMID: 33742228 PMCID: PMC8379111 DOI: 10.1007/s00330-021-07811-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 02/18/2021] [Indexed: 12/12/2022]
Abstract
Objectives To evaluate the performance of a deep convolutional neural network (DCNN) in detecting and classifying distal radius fractures, metal, and cast on radiographs using labels based on radiology reports. The secondary aim was to evaluate the effect of the training set size on the algorithm’s performance. Methods A total of 15,775 frontal and lateral radiographs, corresponding radiology reports, and a ResNet18 DCNN were used. Fracture detection and classification models were developed per view and merged. Incrementally sized subsets served to evaluate effects of the training set size. Two musculoskeletal radiologists set the standard of reference on radiographs (test set A). A subset (B) was rated by three radiology residents. For a per-study-based comparison with the radiology residents, the results of the best models were merged. Statistics used were ROC and AUC, Youden’s J statistic (J), and Spearman’s correlation coefficient (ρ). Results The models’ AUC/J on (A) for metal and cast were 0.99/0.98 and 1.0/1.0. The models’ and residents’ AUC/J on (B) were similar on fracture (0.98/0.91; 0.98/0.92) and multiple fragments (0.85/0.58; 0.91/0.70). Training set size and AUC correlated on metal (ρ = 0.740), cast (ρ = 0.722), fracture (frontal ρ = 0.947, lateral ρ = 0.946), multiple fragments (frontal ρ = 0.856), and fragment displacement (frontal ρ = 0.595). Conclusions The models trained on a DCNN with report-based labels to detect distal radius fractures on radiographs are suitable to aid as a secondary reading tool; models for fracture classification are not ready for clinical use. Bigger training sets lead to better models in all categories except joint affection. Key Points • Detection of metal and cast on radiographs is excellent using AI and labels extracted from radiology reports. • Automatic detection of distal radius fractures on radiographs is feasible and the performance approximates radiology residents. • Automatic classification of the type of distal radius fracture varies in accuracy and is inferior for joint involvement and fragment displacement.
Collapse
Affiliation(s)
- Patrick Tobler
- University Hospital Basel, University of Basel, Clinic of Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Joshy Cyriac
- University Hospital Basel, University of Basel, Clinic of Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Balazs K Kovacs
- University Hospital Basel, University of Basel, Clinic of Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Verena Hofmann
- University Hospital Basel, University of Basel, Clinic of Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Raphael Sexauer
- University Hospital Basel, University of Basel, Clinic of Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Fabiano Paciolla
- University Hospital Basel, University of Basel, Clinic of Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Bram Stieltjes
- University Hospital Basel, University of Basel, Clinic of Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Felix Amsler
- Amsler Consulting Basel, Gundeldingerrain 111, 4059, Basel, Switzerland
| | - Anna Hirschmann
- University Hospital Basel, University of Basel, Clinic of Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, 4031, Basel, Switzerland.
| |
Collapse
|
27
|
Yang M, Zheng Y, Xie Z, Wang Z, Xiao J, Zhang J, Yuan Y. A deep learning model for diagnosing dystrophinopathies on thigh muscle MRI images. BMC Neurol 2021; 21:13. [PMID: 33430797 PMCID: PMC7798322 DOI: 10.1186/s12883-020-02036-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Accepted: 12/23/2020] [Indexed: 12/31/2022] Open
Abstract
Background Dystrophinopathies are the most common type of inherited muscular diseases. Muscle biopsy and genetic tests are effective to diagnose the disease but cost much more than primary hospitals can reach. The more available muscle MRI is promising but its diagnostic results highly depends on doctors’ experiences. This study intends to explore a way of deploying a deep learning model for muscle MRI images to diagnose dystrophinopathies. Methods This study collected 2536 T1WI images from 432 cases who had been diagnosed by genetic analysis and/or muscle biopsy, including 148 cases with dystrophinopathies and 284 cases with other diseases. The data was randomly divided into three sets: the data from 233 cases were used to train the CNN model, the data from 97 cases for the validation experiments, and the data from 102 cases for the test experiments. We also validated our models expertise at diagnosing by comparing the model’s results on the 102 cases with those of three skilled radiologists. Results The proposed model achieved 91% (95% CI: 0.88, 0.93) accuracy on the test set, higher than the best accuracy of 84% in radiologists. It also performed better than the skilled radiologists in sensitivity : sensitivities of the models and the doctors were 0.89 (95% CI: 0.85 0.93) versus 0.79 (95% CI:0.73, 0.84; p = 0.190). Conclusions The deep model achieved excellent accuracy and sensitivity in identifying cases with dystrophinopathies. The comparable performance of the model and skilled radiologists demonstrates the potential application of the model in diagnosing dystrophinopathies through MRI images.
Collapse
Affiliation(s)
- Mei Yang
- Department of Neurology, Peking University First Hospital, Beijing, China.,Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Yiming Zheng
- Department of Neurology, Peking University First Hospital, Beijing, China
| | - Zhiying Xie
- Department of Neurology, Peking University First Hospital, Beijing, China
| | - Zhaoxia Wang
- Department of Neurology, Peking University First Hospital, Beijing, China
| | - Jiangxi Xiao
- Department of Radiology, Peking University First Hospital, Beijing, China
| | - Jue Zhang
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Yun Yuan
- Department of Neurology, Peking University First Hospital, Beijing, China.
| |
Collapse
|
28
|
Diagnostic accuracy of deep learning in orthopaedic fractures: a systematic review and meta-analysis. Clin Radiol 2020; 75:713.e17-713.e28. [DOI: 10.1016/j.crad.2020.05.021] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 05/20/2020] [Indexed: 02/07/2023]
|
29
|
Gorelik N, Gyftopoulos S. Applications of Artificial Intelligence in Musculoskeletal Imaging: From the Request to the Report. Can Assoc Radiol J 2020; 72:45-59. [PMID: 32809857 DOI: 10.1177/0846537120947148] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Artificial intelligence (AI) will transform every step in the imaging value chain, including interpretive and noninterpretive components. Radiologists should familiarize themselves with AI developments to become leaders in their clinical implementation. This article explores the impact of AI through the entire imaging cycle of musculoskeletal radiology, from the placement of the requisition to the generation of the report, with an added Canadian perspective. Noninterpretive tasks which may be assisted by AI include the ordering of appropriate imaging tests, automatic exam protocoling, optimized scheduling, shorter magnetic resonance imaging acquisition time, computed tomography imaging with reduced artifact and radiation dose, and new methods of generation and utilization of radiology reports. Applications of AI for image interpretation consist of the determination of bone age, body composition measurements, screening for osteoporosis, identification of fractures, evaluation of segmental spine pathology, detection and temporal monitoring of osseous metastases, diagnosis of primary bone and soft tissue tumors, and grading of osteoarthritis.
Collapse
Affiliation(s)
- Natalia Gorelik
- Department of Diagnostic Radiology, 54473McGill University Health Center, Montreal, Quebec, Canada
| | - Soterios Gyftopoulos
- Department of Radiology, 12297NYU Langone Medical Center/NYU Langone Orthopedic Center, New York, NY, USA.,Department of Orthopedic Surgery, 12297NYU Langone Medical Center/NYU Langone Orthopedic Center, New York, NY, USA
| |
Collapse
|
30
|
Adams SJ, Henderson RDE, Yi X, Babyn P. Artificial Intelligence Solutions for Analysis of X-ray Images. Can Assoc Radiol J 2020; 72:60-72. [PMID: 32757950 DOI: 10.1177/0846537120941671] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Artificial intelligence (AI) presents a key opportunity for radiologists to improve quality of care and enhance the value of radiology in patient care and population health. The potential opportunity of AI to aid in triage and interpretation of conventional radiographs (X-ray images) is particularly significant, as radiographs are the most common imaging examinations performed in most radiology departments. Substantial progress has been made in the past few years in the development of AI algorithms for analysis of chest and musculoskeletal (MSK) radiographs, with deep learning now the dominant approach for image analysis. Large public and proprietary image data sets have been compiled and have aided the development of AI algorithms for analysis of radiographs, many of which demonstrate accuracy equivalent to radiologists for specific, focused tasks. This article describes (1) the basis for the development of AI solutions for radiograph analysis, (2) current AI solutions to aid in the triage and interpretation of chest radiographs and MSK radiographs, (3) opportunities for AI to aid in noninterpretive tasks related to radiographs, and (4) considerations for radiology practices selecting AI solutions for radiograph analysis and integrating them into existing IT systems. Although comprehensive AI solutions across modalities have yet to be developed, institutions can begin to select and integrate focused solutions which increase efficiency, increase quality and patient safety, and add value for their patients.
Collapse
Affiliation(s)
- Scott J Adams
- Department of Medical Imaging, Royal University Hospital, 7235University of Saskatchewan, Saskatoon, Canada
| | - Robert D E Henderson
- Department of Medical Imaging, Royal University Hospital, 7235University of Saskatchewan, Saskatoon, Canada
| | - Xin Yi
- Department of Medical Imaging, Royal University Hospital, 7235University of Saskatchewan, Saskatoon, Canada
| | - Paul Babyn
- Department of Medical Imaging, Royal University Hospital, 7235University of Saskatchewan, Saskatoon, Canada
| |
Collapse
|