1
|
Gatineau G, Shevroja E, Vendrami C, Gonzalez-Rodriguez E, Leslie WD, Lamy O, Hans D. Development and reporting of artificial intelligence in osteoporosis management. J Bone Miner Res 2024; 39:1553-1573. [PMID: 39163489 PMCID: PMC11523092 DOI: 10.1093/jbmr/zjae131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 07/17/2024] [Accepted: 08/01/2024] [Indexed: 08/22/2024]
Abstract
An abundance of medical data and enhanced computational power have led to a surge in artificial intelligence (AI) applications. Published studies involving AI in bone and osteoporosis research have increased exponentially, raising the need for transparent model development and reporting strategies. This review offers a comprehensive overview and systematic quality assessment of AI articles in osteoporosis while highlighting recent advancements. A systematic search in the PubMed database, from December 17, 2020 to February 1, 2023 was conducted to identify AI articles that relate to osteoporosis. The quality assessment of the studies relied on the systematic evaluation of 12 quality items derived from the minimum information about clinical artificial intelligence modeling checklist. The systematic search yielded 97 articles that fell into 5 areas; bone properties assessment (11 articles), osteoporosis classification (26 articles), fracture detection/classification (25 articles), risk prediction (24 articles), and bone segmentation (11 articles). The average quality score for each study area was 8.9 (range: 7-11) for bone properties assessment, 7.8 (range: 5-11) for osteoporosis classification, 8.4 (range: 7-11) for fracture detection, 7.6 (range: 4-11) for risk prediction, and 9.0 (range: 6-11) for bone segmentation. A sixth area, AI-driven clinical decision support, identified the studies from the 5 preceding areas that aimed to improve clinician efficiency, diagnostic accuracy, and patient outcomes through AI-driven models and opportunistic screening by automating or assisting with specific clinical tasks in complex scenarios. The current work highlights disparities in study quality and a lack of standardized reporting practices. Despite these limitations, a wide range of models and examination strategies have shown promising outcomes to aid in the earlier diagnosis and improve clinical decision-making. Through careful consideration of sources of bias in model performance assessment, the field can build confidence in AI-based approaches, ultimately leading to improved clinical workflows and patient outcomes.
Collapse
Affiliation(s)
- Guillaume Gatineau
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| | - Enisa Shevroja
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| | - Colin Vendrami
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| | - Elena Gonzalez-Rodriguez
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| | - William D Leslie
- Department of Medicine, University of Manitoba, Winnipeg, MB R3T 2N2, Canada
| | - Olivier Lamy
- Internal Medicine Unit, Internal Medicine Department, Lausanne University Hospital and University of Lausanne, 1005 Lausanne, Switzerland
| | - Didier Hans
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| |
Collapse
|
2
|
Ramadanov N, Lettner J, Hable R, Hakam HT, Prill R, Dimitrov D, Becker R, Schreyer AG, Salzmann M. Artificial Intelligence-Guided Assessment of Femoral Neck Fractures in Radiographs: A Systematic Review and Multilevel Meta-Analysis. Orthop Surg 2024. [PMID: 39334556 DOI: 10.1111/os.14250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 08/28/2024] [Accepted: 09/01/2024] [Indexed: 09/30/2024] Open
Abstract
Artificial Intelligence (AI) is a dynamic area of computer science that is constantly expanding its practical benefits in various fields. The aim of this study was to analyze AI-guided radiological assessment of femoral neck fractures by performing a systematic review and multilevel meta-analysis of primary studies. The study protocol was registered in the International Prospective Register of Systematic Reviews (PROSPERO) on May 21, 2024 [CRD42024541055]. The updated Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines were strictly followed. A systematic literature search of PubMed, Web of Science, Ovid (Med), and Epistemonikos databases was conducted until May 31, 2024. Critical appraisal using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool showed that the overall quality of the included studies was moderate. In addition, publication bias was presented in funnel plots. A frequentist multilevel meta-analysis was performed using a random effects model with inverse variance and restricted maximum likelihood heterogeneity estimator with Hartung-Knapp adjustment. The accuracy between AI-based and human assessment of femoral neck fractures, sensitivity and specificity with 95% confidence intervals (CIs) were calculated. Study heterogeneity was assessed using the Higgins test I2 (low heterogeneity <25%, moderate heterogeneity: 25%-75%, and high heterogeneity >75%). Finally, 11 studies with a total of 21,163 radiographs were included for meta-analysis. The results of the study quality assessment using the QUADAS-2 tool are presented in Table 2. The funnel plots indicated a moderate publication bias. The AI showed excellent accuracy in assessment of femoral neck fractures (Accuracy = 0.91, 95% CI 0.83 to 0.96; I2 = 99%; p < 0.01). The AI showed good sensitivity in assessment of femoral neck fractures (Sensitivity = 0.87, 95% CI 0.77 to 0.93; I2 = 98%; p < 0.01). The AI showed excellent specificity in assessment of femoral neck fractures (Specificity = 0.91, 95% CI 0.77 to 0.97; I2 = 97%; p < 0.01). AI-guided radiological assessment of femoral neck fractures showed excellent accuracy and specificity as well as good sensitivity. The use of AI as a faster and more reliable assessment tool and as an aid in radiological routine seems justified.
Collapse
Affiliation(s)
- Nikolai Ramadanov
- Center of Orthopaedics and Traumatology, Brandenburg Medical School, University Hospital Brandenburg an der Havel, Brandenburg an der Havel, Germany
- Faculty of Health Science Brandenburg, Brandenburg Medical School Theodor Fontane, Brandenburg an der Havel, Germany
| | - Jonathan Lettner
- Center of Orthopaedics and Traumatology, Brandenburg Medical School, University Hospital Brandenburg an der Havel, Brandenburg an der Havel, Germany
- Faculty of Health Science Brandenburg, Brandenburg Medical School Theodor Fontane, Brandenburg an der Havel, Germany
| | - Robert Hable
- Faculty of Applied Computer Science, Deggendorf Institute of Technology, Deggendorf, Germany
| | - Hassan Tarek Hakam
- Center of Orthopaedics and Traumatology, Brandenburg Medical School, University Hospital Brandenburg an der Havel, Brandenburg an der Havel, Germany
- Faculty of Health Science Brandenburg, Brandenburg Medical School Theodor Fontane, Brandenburg an der Havel, Germany
| | - Robert Prill
- Center of Orthopaedics and Traumatology, Brandenburg Medical School, University Hospital Brandenburg an der Havel, Brandenburg an der Havel, Germany
- Faculty of Health Science Brandenburg, Brandenburg Medical School Theodor Fontane, Brandenburg an der Havel, Germany
| | - Dobromir Dimitrov
- Department of Surgical Propedeutics, Faculty of Medicine, Medical University of Pleven, Pleven, Bulgaria
| | - Roland Becker
- Center of Orthopaedics and Traumatology, Brandenburg Medical School, University Hospital Brandenburg an der Havel, Brandenburg an der Havel, Germany
- Faculty of Health Science Brandenburg, Brandenburg Medical School Theodor Fontane, Brandenburg an der Havel, Germany
| | - Andreas G Schreyer
- Institute for Diagnostic and Interventional Radiology, Brandenburg Medical School Theodor Fontane, Brandenburg an der Havel, Germany
| | - Mikhail Salzmann
- Center of Orthopaedics and Traumatology, Brandenburg Medical School, University Hospital Brandenburg an der Havel, Brandenburg an der Havel, Germany
- Faculty of Health Science Brandenburg, Brandenburg Medical School Theodor Fontane, Brandenburg an der Havel, Germany
| |
Collapse
|
3
|
Ruitenbeek HC, Oei EHG, Visser JJ, Kijowski R. Artificial intelligence in musculoskeletal imaging: realistic clinical applications in the next decade. Skeletal Radiol 2024; 53:1849-1868. [PMID: 38902420 DOI: 10.1007/s00256-024-04684-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/06/2024] [Accepted: 04/15/2024] [Indexed: 06/22/2024]
Abstract
This article will provide a perspective review of the most extensively investigated deep learning (DL) applications for musculoskeletal disease detection that have the best potential to translate into routine clinical practice over the next decade. Deep learning methods for detecting fractures, estimating pediatric bone age, calculating bone measurements such as lower extremity alignment and Cobb angle, and grading osteoarthritis on radiographs have been shown to have high diagnostic performance with many of these applications now commercially available for use in clinical practice. Many studies have also documented the feasibility of using DL methods for detecting joint pathology and characterizing bone tumors on magnetic resonance imaging (MRI). However, musculoskeletal disease detection on MRI is difficult as it requires multi-task, multi-class detection of complex abnormalities on multiple image slices with different tissue contrasts. The generalizability of DL methods for musculoskeletal disease detection on MRI is also challenging due to fluctuations in image quality caused by the wide variety of scanners and pulse sequences used in routine MRI protocols. The diagnostic performance of current DL methods for musculoskeletal disease detection must be further evaluated in well-designed prospective studies using large image datasets acquired at different institutions with different imaging parameters and imaging hardware before they can be fully implemented in clinical practice. Future studies must also investigate the true clinical benefits of current DL methods and determine whether they could enhance quality, reduce error rates, improve workflow, and decrease radiologist fatigue and burnout with all of this weighed against the costs.
Collapse
Affiliation(s)
- Huibert C Ruitenbeek
- Department of Radiology and Nuclear Medicine, Erasmus MC, University Medical Center, P.O. Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Edwin H G Oei
- Department of Radiology and Nuclear Medicine, Erasmus MC, University Medical Center, P.O. Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Jacob J Visser
- Department of Radiology and Nuclear Medicine, Erasmus MC, University Medical Center, P.O. Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Richard Kijowski
- Department of Radiology, New York University Grossman School of Medicine, 660 First Avenue, 3rd Floor, New York, NY, 10016, USA.
| |
Collapse
|
4
|
Fathi M, Eshraghi R, Behzad S, Tavasol A, Bahrami A, Tafazolimoghadam A, Bhatt V, Ghadimi D, Gholamrezanezhad A. Potential strength and weakness of artificial intelligence integration in emergency radiology: a review of diagnostic utilizations and applications in patient care optimization. Emerg Radiol 2024:10.1007/s10140-024-02278-2. [PMID: 39190230 DOI: 10.1007/s10140-024-02278-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 08/08/2024] [Indexed: 08/28/2024]
Abstract
Artificial intelligence (AI) and its recent increasing healthcare integration has created both new opportunities and challenges in the practice of radiology and medical imaging. Recent advancements in AI technology have allowed for more workplace efficiency, higher diagnostic accuracy, and overall improvements in patient care. Limitations of AI such as data imbalances, the unclear nature of AI algorithms, and the challenges in detecting certain diseases make it difficult for its widespread adoption. This review article presents cases involving the use of AI models to diagnose intracranial hemorrhage, spinal fractures, and rib fractures, while discussing how certain factors like, type, location, size, presence of artifacts, calcification, and post-surgical changes, affect AI model performance and accuracy. While the use of artificial intelligence has the potential to improve the practice of emergency radiology, it is important to address its limitations to maximize its advantages while ensuring the safety of patients overall.
Collapse
Affiliation(s)
- Mobina Fathi
- Advanced Diagnostic and Interventional Radiology Research Center (ADIR), Tehran University of Medical Sciences, Tehran, Iran
- School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Reza Eshraghi
- Student Research Committee, Kashan University of Medical Science, Kashan, Iran
| | | | - Arian Tavasol
- School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Ashkan Bahrami
- Student Research Committee, Kashan University of Medical Science, Kashan, Iran
| | | | - Vivek Bhatt
- School of Medicine, University of California, Riverside, CA, USA
| | - Delaram Ghadimi
- School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Ali Gholamrezanezhad
- Keck School of Medicine of University of Southern California, Los Angeles, CA, USA.
- Department of Radiology, Division of Emergency Radiology, Keck School of Medicine, Cedars Sinai Hospital, University of Southern California, 1500 San Pablo Street, Los Angeles, CA, 90033, USA.
| |
Collapse
|
5
|
Nowroozi A, Salehi MA, Shobeiri P, Agahi S, Momtazmanesh S, Kaviani P, Kalra MK. Artificial intelligence diagnostic accuracy in fracture detection from plain radiographs and comparing it with clinicians: a systematic review and meta-analysis. Clin Radiol 2024; 79:579-588. [PMID: 38772766 DOI: 10.1016/j.crad.2024.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 04/09/2024] [Accepted: 04/15/2024] [Indexed: 05/23/2024]
Abstract
PURPOSE Fracture detection is one of the most commonly used and studied aspects of artificial intelligence (AI) in medicine. In this systematic review and meta-analysis, we aimed to summarize available literature and data regarding AI performance in fracture detection on plain radiographs and various factors affecting it. METHODS We systematically reviewed studies evaluating AI algorithms in detecting bone fractures in plain radiographs, combined their performance using meta-analysis (a bivariate regression approach), and compared it with that of clinicians. We also analyzed the factors potentially affecting algorithm performance using meta-regression. RESULTS Our analysis included 100 studies. In 83 studies with confusion matrices, AI algorithms showed a sensitivity of 91.43% and a specificity of 92.12% (Area under the summary receiver operator curve = 0.968). After adjustment and false discovery rate correction, tibia/fibula (excluding ankle) fractures were associated with higher (7.0%, p=0.004) AI sensitivity, while more recent publications (5.5%, p=0.003) and Xception architecture (6.6%, p<0.001) were associated with higher specificity. Clinicians and AI showed similar specificity in fracture identification, although AI leaned to higher sensitivity (7.6%, p=0.07). Radiologists, on the other hand, were more specific than AI overall and in several subgroups, and more sensitive to hip fractures before FDR correction. CONCLUSIONS Currently available AI aids could result in a significant improvement in care where radiologists are not readily available. Moreover, identifying factors affecting algorithm performance could guide AI development teams in their process of optimizing their products.
Collapse
Affiliation(s)
- A Nowroozi
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - M A Salehi
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - P Shobeiri
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - S Agahi
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - S Momtazmanesh
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - P Kaviani
- Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - M K Kalra
- Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA.
| |
Collapse
|
6
|
Zhu X, Liu D, Liu L, Guo J, Li Z, Zhao Y, Wu T, Liu K, Liu X, Pan X, Qi L, Zhang Y, Cheng L, Chen B. Fully Automatic Deep Learning Model for Spine Refracture in Patients with OVCF: A Multi-Center Study. Orthop Surg 2024; 16:2052-2065. [PMID: 38952050 PMCID: PMC11293932 DOI: 10.1111/os.14155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 06/06/2024] [Accepted: 06/09/2024] [Indexed: 07/03/2024] Open
Abstract
BACKGROUND The reaserch of artificial intelligence (AI) model for predicting spinal refracture is limited to bone mineral density, X-ray and some conventional laboratory indicators, which has its own limitations. Besides, it lacks specific indicators related to osteoporosis and imaging factors that can better reflect bone quality, such as computed tomography (CT). OBJECTIVE To construct a novel predicting model based on bone turn-over markers and CT to identify patients who were more inclined to suffer spine refracture. METHODS CT images and clinical information of 383 patients (training set = 240 cases of osteoporotic vertebral compression fractures (OVCF), validation set = 63, test set = 80) were retrospectively collected from January 2015 to October 2022 at three medical centers. The U-net model was adopted to automatically segment ROI. Three-dimensional (3D) cropping of all spine regions was used to achieve the final ROI regions including 3D_Full and 3D_RoiOnly. We used the Densenet 121-3D model to model the cropped region and simultaneously build a T-NIPT prediction model. Diagnostics of deep learning models were assessed by constructing ROC curves. We generated calibration curves to assess the calibration performance. Additionally, decision curve analysis (DCA) was used to assess the clinical utility of the predictive models. RESULTS The performance of the test model is comparable to its performance on the training set (dice coefficients of 0.798, an mIOU of 0.755, an SA of 0.767, and an OS of 0.017). Univariable and multivariable analysis indicate that T_P1NT was an independent risk factor for refracture. The performance of predicting refractures in different ROI regions showed that 3D_Full model exhibits the highest calibration performance, with a Hosmer-Lemeshow goodness-of-fit (HL) test statistic exceeding 0.05. The analysis of the training and test sets showed that the 3D_Full model, which integrates clinical and deep learning results, demonstrated superior performance with significant improvement (p-value < 0.05) compared to using clinical features independently or using only 3D_RoiOnly. CONCLUSION T_P1NT was an independent risk factor of refracture. Our 3D-FULL model showed better performance in predicting high-risk population of spine refracture than other models and junior doctors do. This model can be applicable to real-world translation due to its automatic segmentation and detection.
Collapse
Affiliation(s)
- Xuetao Zhu
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Dejian Liu
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Lian Liu
- Department of Emergency SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Jingxuan Guo
- Department of anesthesiologyAffiliated Hospital of Shandong University of Traditional Chinese MedicineJinanChina
| | - Zedi Li
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Yixiang Zhao
- Department of Orthopaedic SurgeryYantaishan HospitalYantaiChina
| | - Tianhao Wu
- Department of Hepatopancreatobiliary SurgeryGraduate School of Dalian Medical UniversityDalianChina
| | - Kaiwen Liu
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Xinyu Liu
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Xin Pan
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Lei Qi
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Yuanqiang Zhang
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Lei Cheng
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| | - Bin Chen
- Department of Orthopaedic SurgeryQilu Hospital of Shandong University, Cheeloo College of Medicine of Shandong UniversityJinanP. R. China
| |
Collapse
|
7
|
Zeng Y, Zhang X, Wang J, Usui A, Ichiji K, Bukovsky I, Chou S, Funayama M, Homma N. Inconsistency between Human Observation and Deep Learning Models: Assessing Validity of Postmortem Computed Tomography Diagnosis of Drowning. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:1-10. [PMID: 38336949 PMCID: PMC11169324 DOI: 10.1007/s10278-024-00974-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 10/18/2023] [Accepted: 11/17/2023] [Indexed: 02/12/2024]
Abstract
Drowning diagnosis is a complicated process in the autopsy, even with the assistance of autopsy imaging and the on-site information from where the body was found. Previous studies have developed well-performed deep learning (DL) models for drowning diagnosis. However, the validity of the DL models was not assessed, raising doubts about whether the learned features accurately represented the medical findings observed by human experts. In this paper, we assessed the medical validity of DL models that had achieved high classification performance for drowning diagnosis. This retrospective study included autopsy cases aged 8-91 years who underwent postmortem computed tomography between 2012 and 2021 (153 drowning and 160 non-drowning cases). We first trained three deep learning models from a previous work and generated saliency maps that highlight important features in the input. To assess the validity of models, pixel-level annotations were created by four radiological technologists and further quantitatively compared with the saliency maps. All the three models demonstrated high classification performance with areas under the receiver operating characteristic curves of 0.94, 0.97, and 0.98, respectively. On the other hand, the assessment results revealed unexpected inconsistency between annotations and models' saliency maps. In fact, each model had, respectively, around 30%, 40%, and 80% of irrelevant areas in the saliency maps, suggesting the predictions of the DL models might be unreliable. The result alerts us in the careful assessment of DL tools, even those with high classification performance.
Collapse
Affiliation(s)
- Yuwen Zeng
- Department of Radiological Imaging and Informatics, Tohoku University Graduate School of Medicine, Sendai, Japan.
| | - Xiaoyong Zhang
- National Institute of Technology, Sendai College, Sendai, Japan
| | - Jiaoyang Wang
- Department of Intelligent Biomedical System Engineering, Graduate School of Biomedical Engineering, Tohoku University, Sendai, Japan
| | - Akihito Usui
- Department of Radiological Imaging and Informatics, Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Kei Ichiji
- Department of Radiological Imaging and Informatics, Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Ivo Bukovsky
- Faculty of Science, University of South Bohemia in Ceske Budejovice, Ceske Budejovice, Czech Republic
- Mechanical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| | - Shuoyan Chou
- Department of Industrial Management, National Taiwan University of Science and Technology, Taipei, Taiwan
| | - Masato Funayama
- Department of Radiological Imaging and Informatics, Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Noriyasu Homma
- Department of Radiological Imaging and Informatics, Tohoku University Graduate School of Medicine, Sendai, Japan
| |
Collapse
|
8
|
Park JY, Lee SH, Kim YJ, Kim KG, Lee GJ. Machine learning model based on radiomics features for AO/OTA classification of pelvic fractures on pelvic radiographs. PLoS One 2024; 19:e0304350. [PMID: 38814948 PMCID: PMC11139281 DOI: 10.1371/journal.pone.0304350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Accepted: 05/10/2024] [Indexed: 06/01/2024] Open
Abstract
Depending on the degree of fracture, pelvic fracture can be accompanied by vascular damage, and in severe cases, it may progress to hemorrhagic shock. Pelvic radiography can quickly diagnose pelvic fractures, and the Association for Osteosynthesis Foundation and Orthopedic Trauma Association (AO/OTA) classification system is useful for evaluating pelvic fracture instability. This study aimed to develop a radiomics-based machine-learning algorithm to quickly diagnose fractures on pelvic X-ray and classify their instability. data used were pelvic anteroposterior radiographs of 990 adults over 18 years of age diagnosed with pelvic fractures, and 200 normal subjects. A total of 93 features were extracted based on radiomics:18 first-order, 24 GLCM, 16 GLRLM, 16 GLSZM, 5 NGTDM, and 14 GLDM features. To improve the performance of machine learning, the feature selection methods RFE, SFS, LASSO, and Ridge were used, and the machine learning models used LR, SVM, RF, XGB, MLP, KNN, and LGBM. Performance measurement was evaluated by area under the curve (AUC) by analyzing the receiver operating characteristic curve. The machine learning model was trained based on the selected features using four feature-selection methods. When the RFE feature selection method was used, the average AUC was higher than that of the other methods. Among them, the combination with the machine learning model SVM showed the best performance, with an average AUC of 0.75±0.06. By obtaining a feature-importance graph for the combination of RFE and SVM, it is possible to identify features with high importance. The AO/OTA classification of normal pelvic rings and pelvic fractures on pelvic AP radiographs using a radiomics-based machine learning model showed the highest AUC when using the SVM classification combination. Further research on the radiomic features of each part of the pelvic bone constituting the pelvic ring is needed.
Collapse
Affiliation(s)
- Jun Young Park
- Department of Health Sciences and Technology, Gachon Advanced Institute for Health Sciences and Technology (GAIHST), Gachon University, Incheon, Republic of Korea
| | - Seung Hwan Lee
- Department of Trauma Surgery, Gachon University Gil Medical Center, Gachon University, Incheon, Republic of Korea
- Department of Traumatology, Gachon University College of Medicine, Gachon University, Incheon, Republic of Korea
| | - Young Jae Kim
- Department of Health Sciences and Technology, Gachon Advanced Institute for Health Sciences and Technology (GAIHST), Gachon University, Incheon, Republic of Korea
- Department of Medical Devices R&D Center, Gachon University Gil Medical Center, Gachon University, Incheon, Republic of Korea
- Department of Biomedical Engineering, Pre-medical Course, College of Medicine, Gachon University, Incheon, Republic of Korea
| | - Kwang Gi Kim
- Department of Health Sciences and Technology, Gachon Advanced Institute for Health Sciences and Technology (GAIHST), Gachon University, Incheon, Republic of Korea
- Department of Medical Devices R&D Center, Gachon University Gil Medical Center, Gachon University, Incheon, Republic of Korea
- Department of Biomedical Engineering, Pre-medical Course, College of Medicine, Gachon University, Incheon, Republic of Korea
| | - Gil Jae Lee
- Department of Trauma Surgery, Gachon University Gil Medical Center, Gachon University, Incheon, Republic of Korea
- Department of Traumatology, Gachon University College of Medicine, Gachon University, Incheon, Republic of Korea
| |
Collapse
|
9
|
Yıldız Potter İ, Yeritsyan D, Mahar S, Kheir N, Vaziri A, Putman M, Rodriguez EK, Wu J, Nazarian A, Vaziri A. Proximal femur fracture detection on plain radiography via feature pyramid networks. Sci Rep 2024; 14:12046. [PMID: 38802519 PMCID: PMC11130146 DOI: 10.1038/s41598-024-63001-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 05/23/2024] [Indexed: 05/29/2024] Open
Abstract
Hip fractures exceed 250,000 cases annually in the United States, with the worldwide incidence projected to increase by 240-310% by 2050. Hip fractures are predominantly diagnosed by radiologist review of radiographs. In this study, we developed a deep learning model by extending the VarifocalNet Feature Pyramid Network (FPN) for detection and localization of proximal femur fractures from plain radiography with clinically relevant metrics. We used a dataset of 823 hip radiographs of 150 subjects with proximal femur fractures and 362 controls to develop and evaluate the deep learning model. Our model attained 0.94 specificity and 0.95 sensitivity in fracture detection over the diverse imaging dataset. We compared the performance of our model against five benchmark FPN models, demonstrating 6-14% sensitivity and 1-9% accuracy improvement. In addition, we demonstrated that our model outperforms a state-of-the-art transformer model based on DINO network by 17% sensitivity and 5% accuracy, while taking half the time on average to process a radiograph. The developed model can aid radiologists and support on-premise integration with hospital cloud services to enable automatic, opportunistic screening for hip fractures.
Collapse
Affiliation(s)
| | - Diana Yeritsyan
- Carl J. Shapiro Department of Orthopaedic Surgery, Beth Israel Deaconess Medical Center (BIDMC) and Harvard Medical School, 330 Brookline Avenue, Stoneman 10, Boston, MA, 02215, USA
- Musculoskeletal Translational Innovation Initiative, Beth Israel Deaconess Medical Center and Harvard Medical School, 330 Brookline Avenue RN123, Boston, MA, 02215, USA
| | - Sarah Mahar
- Carl J. Shapiro Department of Orthopaedic Surgery, Beth Israel Deaconess Medical Center (BIDMC) and Harvard Medical School, 330 Brookline Avenue, Stoneman 10, Boston, MA, 02215, USA
- Musculoskeletal Translational Innovation Initiative, Beth Israel Deaconess Medical Center and Harvard Medical School, 330 Brookline Avenue RN123, Boston, MA, 02215, USA
| | - Nadim Kheir
- Carl J. Shapiro Department of Orthopaedic Surgery, Beth Israel Deaconess Medical Center (BIDMC) and Harvard Medical School, 330 Brookline Avenue, Stoneman 10, Boston, MA, 02215, USA
- Musculoskeletal Translational Innovation Initiative, Beth Israel Deaconess Medical Center and Harvard Medical School, 330 Brookline Avenue RN123, Boston, MA, 02215, USA
| | - Aidin Vaziri
- BioSensics, LLC, 57 Chapel Street, Newton, MA, 02458, USA
| | - Melissa Putman
- Division of Endocrinology, Massachusetts General Hospital and Harvard Medical School, 55 Fruit Street, Boston, MA, 02114, USA
| | - Edward K Rodriguez
- Carl J. Shapiro Department of Orthopaedic Surgery, Beth Israel Deaconess Medical Center (BIDMC) and Harvard Medical School, 330 Brookline Avenue, Stoneman 10, Boston, MA, 02215, USA
- Musculoskeletal Translational Innovation Initiative, Beth Israel Deaconess Medical Center and Harvard Medical School, 330 Brookline Avenue RN123, Boston, MA, 02215, USA
| | - Jim Wu
- Department of Radiology, Massachusetts General Brigham (MGB) and Harvard Medical School, 75 Francis Street, Boston, MA, 02215, USA
| | - Ara Nazarian
- Carl J. Shapiro Department of Orthopaedic Surgery, Beth Israel Deaconess Medical Center (BIDMC) and Harvard Medical School, 330 Brookline Avenue, Stoneman 10, Boston, MA, 02215, USA
- Musculoskeletal Translational Innovation Initiative, Beth Israel Deaconess Medical Center and Harvard Medical School, 330 Brookline Avenue RN123, Boston, MA, 02215, USA
- Department of Orthopaedic Surgery, Yerevan State University, Yerevan, Armenia
| | - Ashkan Vaziri
- BioSensics, LLC, 57 Chapel Street, Newton, MA, 02458, USA
| |
Collapse
|
10
|
Raj M, Ayub A, Pal AK, Pradhan J, Varish N, Kumar S, Varikasuvu SR. Diagnostic Accuracy of Artificial Intelligence-Based Algorithms in Automated Detection of Neck of Femur Fracture on a Plain Radiograph: A Systematic Review and Meta-analysis. Indian J Orthop 2024; 58:457-469. [PMID: 38694696 PMCID: PMC11058182 DOI: 10.1007/s43465-024-01130-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Accepted: 02/27/2024] [Indexed: 05/04/2024]
Abstract
Objectives To evaluate the diagnostic accuracy of artificial intelligence-based algorithms in identifying neck of femur fracture on a plain radiograph. Design Systematic review and meta-analysis. Data sources PubMed, Web of science, Scopus, IEEE, and the Science direct databases were searched from inception to 30 July 2023. Eligibility criteria for study selection Eligible article types were descriptive, analytical, or trial studies published in the English language providing data on the utility of artificial intelligence (AI) based algorithms in the detection of the neck of the femur (NOF) fracture on plain X-ray. Main outcome measures The prespecified primary outcome was to calculate the sensitivity, specificity, accuracy, Youden index, and positive and negative likelihood ratios. Two teams of reviewers (each consisting of two members) extracted the data from available information in each study. The risk of bias was assessed using a mix of the CLAIM (the Checklist for AI in Medical Imaging) and QUADAS-2 (A Revised Tool for the Quality Assessment of Diagnostic Accuracy Studies) criteria. Results Of the 437 articles retrieved, five were eligible for inclusion, and the pooled sensitivity of AIs in diagnosing the fracture NOF was 85%, with a specificity of 87%. For all studies, the pooled Youden index (YI) was 0.73. The average positive likelihood ratio (PLR) was 19.88, whereas the negative likelihood ratio (NLR) was 0.17. The random effects model showed an overall odds of 1.16 (0.84-1.61) in the forest plot, comparing the AI system with those of human diagnosis. The overall heterogeneity of the studies was marginal (I2 = 51%). The CLAIM criteria for risk of bias assessment had an overall >70% score. Conclusion Artificial intelligence (AI)-based algorithms can be used as a diagnostic adjunct, benefiting clinicians by taking less time and effort in neck of the femur (NOF) fracture diagnosis. Study registration PROSPERO CRD42022375449. Supplementary Information The online version contains supplementary material available at 10.1007/s43465-024-01130-6.
Collapse
Affiliation(s)
- Manish Raj
- Department of Orthopaedic, All India Institute of Medical Sciences, Deoghar, Jharkhand India
| | - Arshad Ayub
- Department of Community and Family Medicine, All India Institute of Medical Sciences, Deoghar, Jharkhand India
| | - Arup Kumar Pal
- Department of Computer Science and Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Jharkhand India
| | - Jitesh Pradhan
- Department of Computer Science and Engineering, National Institute of Technology (NIT), Jamshedpur, Jharkhand India
| | - Naushad Varish
- Department of Computer Science and Engineering, GITAM University, Hyderabad Campus, Telangana, India
| | - Sumit Kumar
- Informatics Cluster, School of Computer Science, University of Petroleum and Energy Studies, Dehradun, Uttarakhand India
| | | |
Collapse
|
11
|
Cheng CT, Ooyang CH, Kang SC, Liao CH. Applications of Deep Learning in Trauma Radiology: A Narrative Review. Biomed J 2024:100743. [PMID: 38679199 DOI: 10.1016/j.bj.2024.100743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 03/26/2024] [Accepted: 04/24/2024] [Indexed: 05/01/2024] Open
Abstract
Diagnostic imaging is essential in modern trauma care for initial evaluation and identifying injuries requiring intervention. Deep learning (DL) has become mainstream in medical image analysis and has shown promising efficacy for classification, segmentation, and lesion detection. This narrative review provides the fundamental concepts for developing DL algorithms in trauma imaging and presents an overview of current progress in each modality. DL has been applied to detect free fluid on Focused Assessment with Sonography for Trauma (FAST), traumatic findings on chest and pelvic X-rays, and computed tomography (CT) scans, identify intracranial hemorrhage on head CT, detect vertebral fractures, and identify injuries to organs like the spleen, liver, and lungs on abdominal and chest CT. Future directions involve expanding dataset size and diversity through federated learning, enhancing model explainability and transparency to build clinician trust, and integrating multimodal data to provide more meaningful insights into traumatic injuries. Though some commercial artificial intelligence products are Food and Drug Administration-approved for clinical use in the trauma field, adoption remains limited, highlighting the need for multi-disciplinary teams to engineer practical, real-world solutions. Overall, DL shows immense potential to improve the efficiency and accuracy of trauma imaging, but thoughtful development and validation are critical to ensure these technologies positively impact patient care.
Collapse
Affiliation(s)
- Chi-Tung Cheng
- Department of Trauma and Emergency Surgery, Chang Gung Memorial Hospital, Linkou, Chang Gung University, Taoyuan Taiwan
| | - Chun-Hsiang Ooyang
- Department of Trauma and Emergency Surgery, Chang Gung Memorial Hospital, Linkou, Chang Gung University, Taoyuan Taiwan
| | - Shih-Ching Kang
- Department of Trauma and Emergency Surgery, Chang Gung Memorial Hospital, Linkou, Chang Gung University, Taoyuan Taiwan.
| | - Chien-Hung Liao
- Department of Trauma and Emergency Surgery, Chang Gung Memorial Hospital, Linkou, Chang Gung University, Taoyuan Taiwan
| |
Collapse
|
12
|
Zhang Z, Ke C, Zhang Z, Chen Y, Weng H, Dong J, Hao M, Liu B, Zheng M, Li J, Ding S, Dong Y, Peng Z. Re-tear after arthroscopic rotator cuff repair can be predicted using deep learning algorithm. Front Artif Intell 2024; 7:1331853. [PMID: 38487743 PMCID: PMC10938848 DOI: 10.3389/frai.2024.1331853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 02/12/2024] [Indexed: 03/17/2024] Open
Abstract
The application of artificial intelligence technology in the medical field has become increasingly prevalent, yet there remains significant room for exploration in its deep implementation. Within the field of orthopedics, which integrates closely with AI due to its extensive data requirements, rotator cuff injuries are a commonly encountered condition in joint motion. One of the most severe complications following rotator cuff repair surgery is the recurrence of tears, which has a significant impact on both patients and healthcare professionals. To address this issue, we utilized the innovative EV-GCN algorithm to train a predictive model. We collected medical records of 1,631 patients who underwent rotator cuff repair surgery at a single center over a span of 5 years. In the end, our model successfully predicted postoperative re-tear before the surgery using 62 preoperative variables with an accuracy of 96.93%, and achieved an accuracy of 79.55% on an independent external dataset of 518 cases from other centers. This model outperforms human doctors in predicting outcomes with high accuracy. Through this methodology and research, our aim is to utilize preoperative prediction models to assist in making informed medical decisions during and after surgery, leading to improved treatment effectiveness. This research method and strategy can be applied to other medical fields, and the research findings can assist in making healthcare decisions.
Collapse
Affiliation(s)
- Zhewei Zhang
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
- Health Science Center, Ningbo University, Ningbo, China
| | - Chunhai Ke
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
| | - Zhibin Zhang
- Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
- Key Laboratory of Mobile Network Application Technology of Zhejiang Province, Ningbo University, Ningbo, China
| | - Yujiong Chen
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
- Health Science Center, Ningbo University, Ningbo, China
| | - Hangbin Weng
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
- Health Science Center, Ningbo University, Ningbo, China
| | - Jieyang Dong
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
- Health Science Center, Ningbo University, Ningbo, China
| | - Mingming Hao
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
| | - Botao Liu
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
- Health Science Center, Ningbo University, Ningbo, China
| | - Minzhe Zheng
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
| | - Jin Li
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
| | - Shaohua Ding
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
| | - Yihong Dong
- Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
- Key Laboratory of Mobile Network Application Technology of Zhejiang Province, Ningbo University, Ningbo, China
| | - Zhaoxiang Peng
- Ningbo University affiliated Li Huili Hospital, Ningbo University, Ningbo, China
| |
Collapse
|
13
|
Wang LX, Zhu ZH, Chen QC, Jiang WB, Wang YZ, Sun NK, Hu BS, Rui G, Wang LS. Development and validation of a deep-learning model for the detection of non-displaced femoral neck fractures with anteroposterior and lateral hip radiographs. Quant Imaging Med Surg 2024; 14:527-539. [PMID: 38223105 PMCID: PMC10784052 DOI: 10.21037/qims-23-814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 10/24/2023] [Indexed: 01/16/2024]
Abstract
Background Hip fractures, including femoral neck fractures, are a significant cause of morbidity and mortality in the elderly population and are typically diagnosed using plain radiography. However, diagnosing non-displaced femoral neck fractures can be challenging due to their subtle appearance on hip radiographs. Previous deep-learning models have shown low accuracy in identifying these fractures on anteroposterior (AP) radiographs; however, no studies have used lateral radiographs. This study aimed to evaluate the potential of using deep-learning with both AP and lateral hip radiographs to automatically identify non-displaced femoral neck fractures. Methods We conducted a retrospective analysis of patients with femoral neck fractures at The First Affiliated Hospital of Xiamen University. All the hip radiographs were reviewed, and cases of non-displaced femoral neck fractures were included in the study. Additionally, 439 participants with normal hip radiographs were also included in the study. A vision transformer (Vit) model was developed using 1,536 AP and lateral hip radiograph. The model's performance was compared to the performance of two groups of human observers: an expert group comprising orthopedic surgeons and radiologists, and a non-expert group, including emergency physicians and general practice doctors. We also carried out the external validation using two additional data sets to assess the generalizability of the model. Results The Vit model showed exceptional performance in detecting non-displaced femoral neck fractures on paired AP and lateral hip radiographs, achieving a binary accuracy of 95.8% [95% confidence interval (CI): 94.9%, 96.8%] and an area under the curve (AUC) of 0.988. Compared to the human observers, the model had a higher accuracy of 96.7% (95% CI: 93.9%, 99.5%) on the paired AP and lateral hip radiographs, while the accuracy of the expert group was 90.5% (95% CI: 85.7%, 95.2%). Further, the model maintained good performance during the external validation, with an AUC of 0.959 on the paired AP and lateral views. Conclusions Our Vit model showed expert-level performance in identifying non-displaced femoral neck fractures on paired AP and lateral hip radiographs. This model has the potential to enhance diagnosis accuracy and improve patient outcomes by reducing the need for additional examinations and preoperative time.
Collapse
Affiliation(s)
- Lian-Xin Wang
- Department of Orthopedics, The First Affiliated Hospital of Xiamen University, Xiamen, China
| | - Zhong-Hang Zhu
- Department of Computer Science, Xiamen University, Xiamen, China
| | - Qi-Chang Chen
- Department of Computer Science, Xiamen University, Xiamen, China
| | - Wei-Bo Jiang
- Department of Orthopedics, The Second Affiliated Hospital of Jilin University, Changchun, China
| | - Yao-Zong Wang
- Department of Orthopedics, Zhongshan Hospital of Xiamen University, Xiamen, China
| | - Nai-Kun Sun
- Department of Orthopedics, The First Affiliated Hospital of Xiamen University, Xiamen, China
| | - Bao-Shan Hu
- Department of Orthopedics, The First Affiliated Hospital of Xiamen University, Xiamen, China
| | - Gang Rui
- Department of Orthopedics, The First Affiliated Hospital of Xiamen University, Xiamen, China
| | - Lian-Sheng Wang
- Department of Computer Science, Xiamen University, Xiamen, China
| |
Collapse
|
14
|
Jung J, Dai J, Liu B, Wu Q. Artificial intelligence in fracture detection with different image modalities and data types: A systematic review and meta-analysis. PLOS DIGITAL HEALTH 2024; 3:e0000438. [PMID: 38289965 PMCID: PMC10826962 DOI: 10.1371/journal.pdig.0000438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 12/25/2023] [Indexed: 02/01/2024]
Abstract
Artificial Intelligence (AI), encompassing Machine Learning and Deep Learning, has increasingly been applied to fracture detection using diverse imaging modalities and data types. This systematic review and meta-analysis aimed to assess the efficacy of AI in detecting fractures through various imaging modalities and data types (image, tabular, or both) and to synthesize the existing evidence related to AI-based fracture detection. Peer-reviewed studies developing and validating AI for fracture detection were identified through searches in multiple electronic databases without time limitations. A hierarchical meta-analysis model was used to calculate pooled sensitivity and specificity. A diagnostic accuracy quality assessment was performed to evaluate bias and applicability. Of the 66 eligible studies, 54 identified fractures using imaging-related data, nine using tabular data, and three using both. Vertebral fractures were the most common outcome (n = 20), followed by hip fractures (n = 18). Hip fractures exhibited the highest pooled sensitivity (92%; 95% CI: 87-96, p< 0.01) and specificity (90%; 95% CI: 85-93, p< 0.01). Pooled sensitivity and specificity using image data (92%; 95% CI: 90-94, p< 0.01; and 91%; 95% CI: 88-93, p < 0.01) were higher than those using tabular data (81%; 95% CI: 77-85, p< 0.01; and 83%; 95% CI: 76-88, p < 0.01), respectively. Radiographs demonstrated the highest pooled sensitivity (94%; 95% CI: 90-96, p < 0.01) and specificity (92%; 95% CI: 89-94, p< 0.01). Patient selection and reference standards were major concerns in assessing diagnostic accuracy for bias and applicability. AI displays high diagnostic accuracy for various fracture outcomes, indicating potential utility in healthcare systems for fracture diagnosis. However, enhanced transparency in reporting and adherence to standardized guidelines are necessary to improve the clinical applicability of AI. Review Registration: PROSPERO (CRD42021240359).
Collapse
Affiliation(s)
- Jongyun Jung
- Department of Biomedical Informatics (Dr. Qing Wu, Jongyun Jung, and Jingyuan Dai), College of Medicine, The Ohio State University, Columbus, Ohio, United States of America
| | - Jingyuan Dai
- Department of Biomedical Informatics (Dr. Qing Wu, Jongyun Jung, and Jingyuan Dai), College of Medicine, The Ohio State University, Columbus, Ohio, United States of America
| | - Bowen Liu
- Department of Mathematics and Statistics, Division of Computing, Analytics, and Mathematics, School of Science and Engineering (Bowen Liu), University of Missouri-Kansas City, Kansas City, Missouri, United States of America
| | - Qing Wu
- Department of Biomedical Informatics (Dr. Qing Wu, Jongyun Jung, and Jingyuan Dai), College of Medicine, The Ohio State University, Columbus, Ohio, United States of America
| |
Collapse
|
15
|
Ueda Y, Morishita J. Patient Identification Based on Deep Metric Learning for Preventing Human Errors in Follow-up X-Ray Examinations. J Digit Imaging 2023; 36:1941-1953. [PMID: 37308675 PMCID: PMC10501972 DOI: 10.1007/s10278-023-00850-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 05/08/2023] [Accepted: 05/10/2023] [Indexed: 06/14/2023] Open
Abstract
Biological fingerprints extracted from clinical images can be used for patient identity verification to determine misfiled clinical images in picture archiving and communication systems. However, such methods have not been incorporated into clinical use, and their performance can degrade with variability in the clinical images. Deep learning can be used to improve the performance of these methods. A novel method is proposed to automatically identify individuals among examined patients using posteroanterior (PA) and anteroposterior (AP) chest X-ray images. The proposed method uses deep metric learning based on a deep convolutional neural network (DCNN) to overcome the extreme classification requirements for patient validation and identification. It was trained on the NIH chest X-ray dataset (ChestX-ray8) in three steps: preprocessing, DCNN feature extraction with an EfficientNetV2-S backbone, and classification with deep metric learning. The proposed method was evaluated using two public datasets and two clinical chest X-ray image datasets containing data from patients undergoing screening and hospital care. A 1280-dimensional feature extractor pretrained for 300 epochs performed the best with an area under the receiver operating characteristic curve of 0.9894, an equal error rate of 0.0269, and a top-1 accuracy of 0.839 on the PadChest dataset containing both PA and AP view positions. The findings of this study provide considerable insights into the development of automated patient identification to reduce the possibility of medical malpractice due to human errors.
Collapse
Affiliation(s)
- Yasuyuki Ueda
- Department of Medical Physics and Engineering, Area of Medical Imaging Technology and Science, Graduate School of Medicine, Division of Health Sciences, Osaka University, Osaka, Japan.
| | - Junji Morishita
- Department of Health Sciences, Faculty of Medical Sciences, Kyushu University, Fukuoka, Japan
| |
Collapse
|
16
|
Sun H, Wang X, Li Z, Liu A, Xu S, Jiang Q, Li Q, Xue Z, Gong J, Chen L, Xiao Y, Liu S. Automated Rib Fracture Detection on Chest X-Ray Using Contrastive Learning. J Digit Imaging 2023; 36:2138-2147. [PMID: 37407842 PMCID: PMC10501970 DOI: 10.1007/s10278-023-00868-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Revised: 06/05/2023] [Accepted: 06/06/2023] [Indexed: 07/07/2023] Open
Abstract
To develop a deep learning-based model for detecting rib fractures on chest X-Ray and to evaluate its performance based on a multicenter study. Chest digital radiography (DR) images from 18,631 subjects were used for the training, testing, and validation of the deep learning fracture detection model. We first built a pretrained model, a simple framework for contrastive learning of visual representations (simCLR), using contrastive learning with the training set. Then, simCLR was used as the backbone for a fully convolutional one-stage (FCOS) objective detection network to identify rib fractures from chest X-ray images. The detection performance of the network for four different types of rib fractures was evaluated using the testing set. A total of 127 images from Data-CZ and 109 images from Data-CH with the annotations for four types of rib fractures were used for evaluation. The results showed that for Data-CZ, the sensitivities of the detection model with no pretraining, pretrained ImageNet, and pretrained DR were 0.465, 0.735, and 0.822, respectively, and the average number of false positives per scan was five in all cases. For the Data-CH test set, the sensitivities of three different pretraining methods were 0.403, 0.655, and 0.748. In the identification of four fracture types, the detection model achieved the highest performance for displaced fractures, with sensitivities of 0.873 and 0.774 for the Data-CZ and Data-CH test sets, respectively, with 5 false positives per scan, followed by nondisplaced fractures, buckle fractures, and old fractures. A pretrained model can significantly improve the performance of the deep learning-based rib fracture detection based on X-ray images, which can reduce missed diagnoses and improve the diagnostic efficacy.
Collapse
Affiliation(s)
- Hongbiao Sun
- Department of Radiology, Shanghai Changzheng Hospital, Navy Medical University, No.415, Fengyang Road, Huangpu District, Shanghai, 200003, China
| | - Xiang Wang
- Department of Radiology, Shanghai Changzheng Hospital, Navy Medical University, No.415, Fengyang Road, Huangpu District, Shanghai, 200003, China
| | - Zheren Li
- Institute for Medical Imaging Technology, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, 200030, China
- Shanghai United Imaging Intelligence Co., Ltd., No.701, Yunjin Road, Xuhui District, Shanghai, 200232, China
| | - Aie Liu
- Shanghai United Imaging Intelligence Co., Ltd., No.701, Yunjin Road, Xuhui District, Shanghai, 200232, China
| | - Shaochun Xu
- Department of Radiology, Shanghai Changzheng Hospital, Navy Medical University, No.415, Fengyang Road, Huangpu District, Shanghai, 200003, China
| | - Qinling Jiang
- Department of Radiology, Shanghai Changzheng Hospital, Navy Medical University, No.415, Fengyang Road, Huangpu District, Shanghai, 200003, China
| | - Qingchu Li
- Department of Radiology, Shanghai Changzheng Hospital, Navy Medical University, No.415, Fengyang Road, Huangpu District, Shanghai, 200003, China
| | - Zhong Xue
- Shanghai United Imaging Intelligence Co., Ltd., No.701, Yunjin Road, Xuhui District, Shanghai, 200232, China
| | - Jing Gong
- Departments of Radiology, Changhai Hospital, Navy Medical University, Shanghai, 200433, China
| | - Lei Chen
- Shanghai United Imaging Intelligence Co., Ltd., No.701, Yunjin Road, Xuhui District, Shanghai, 200232, China.
| | - Yi Xiao
- Department of Radiology, Shanghai Changzheng Hospital, Navy Medical University, No.415, Fengyang Road, Huangpu District, Shanghai, 200003, China.
| | - Shiyuan Liu
- Department of Radiology, Shanghai Changzheng Hospital, Navy Medical University, No.415, Fengyang Road, Huangpu District, Shanghai, 200003, China.
| |
Collapse
|
17
|
Gao Y, Soh NYT, Liu N, Lim G, Ting D, Cheng LTE, Wong KM, Liew C, Oh HC, Tan JR, Venkataraman N, Goh SH, Yan YY. Application of a deep learning algorithm in the detection of hip fractures. iScience 2023; 26:107350. [PMID: 37554447 PMCID: PMC10404720 DOI: 10.1016/j.isci.2023.107350] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/30/2023] [Accepted: 07/06/2023] [Indexed: 08/10/2023] Open
Abstract
This paper describes the development of a deep learning model for prediction of hip fractures on pelvic radiographs (X-rays). Developed using over 40,000 pelvic radiographs from a single institution, the model demonstrated high sensitivity and specificity when applied to a test set of emergency department radiographs. This study approximates the real-world application of a deep learning fracture detection model by including radiographs with sub-optimal image quality, other non-hip fractures, and metallic implants, which were excluded from prior published work. The study also explores the effect of ethnicity on model performance, as well as the accuracy of visualization algorithm for fracture localization.
Collapse
Affiliation(s)
- Yan Gao
- Health Services Research, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Nicholas Yock Teck Soh
- Department of Diagnostic Radiology, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Nan Liu
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Gilbert Lim
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Daniel Ting
- Singapore Health Services (SingHealth), Duke-NUS Medical School, Singapore, Singapore
| | - Lionel Tim-Ee Cheng
- Department of Diagnostic Radiology, Singapore General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
- Radiological Sciences ACP, Duke-NUS Medical School, Singapore, Singapore
| | - Kang Min Wong
- Department of Diagnostic Radiology, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
- Radiological Sciences ACP, Duke-NUS Medical School, Singapore, Singapore
| | - Charlene Liew
- Department of Diagnostic Radiology, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
- Radiological Sciences ACP, Duke-NUS Medical School, Singapore, Singapore
| | - Hong Choon Oh
- Health Services Research, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Jin Rong Tan
- Department of Diagnostic Radiology, Singapore General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Narayan Venkataraman
- Department of Medical Informatics, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Siang Hiong Goh
- Department of Emergency Medicine, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
| | - Yet Yen Yan
- Department of Diagnostic Radiology, Changi General Hospital, Singapore Health Services (SingHealth), Singapore, Singapore
- Radiological Sciences ACP, Duke-NUS Medical School, Singapore, Singapore
| |
Collapse
|
18
|
Salimi M, Parry JA, Shahrokhi R, Mosalamiaghili S. Application of artificial intelligence in trauma orthopedics: Limitation and prospects. World J Clin Cases 2023; 11:4231-4240. [PMID: 37449222 PMCID: PMC10337008 DOI: 10.12998/wjcc.v11.i18.4231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 04/23/2023] [Accepted: 05/08/2023] [Indexed: 06/26/2023] Open
Abstract
The varieties and capabilities of artificial intelligence and machine learning in orthopedic surgery are extensively expanding. One promising method is neural networks, emphasizing big data and computer-based learning systems to develop a statistical fracture-detecting model. It derives patterns and rules from outstanding amounts of data to analyze the probabilities of different outcomes using new sets of similar data. The sensitivity and specificity of machine learning in detecting fractures vary from previous studies. AI may be most promising in the diagnosis of less-obvious fractures that are more commonly missed. Future studies are necessary to develop more accurate and effective detection models that can be used clinically.
Collapse
Affiliation(s)
- Maryam Salimi
- Department of Orthopaedic Surgery, Denver Health Medical Center, Denver, CO 80215, United States
| | - Joshua A Parry
- Department of Orthopaedic Surgery, Denver Health Medical Center, Denver, CO 80215, United States
| | - Raha Shahrokhi
- Student Research Committee, Shiraz University of Medical Sciences, Shiraz 7138433608, Iran
| | | |
Collapse
|
19
|
Yoon MS, Kwon G, Oh J, Ryu J, Lim J, Kang BK, Lee J, Han DK. Effect of Contrast Level and Image Format on a Deep Learning Algorithm for the Detection of Pneumothorax with Chest Radiography. J Digit Imaging 2023; 36:1237-1247. [PMID: 36698035 PMCID: PMC10287877 DOI: 10.1007/s10278-022-00772-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 12/23/2022] [Accepted: 12/29/2022] [Indexed: 01/26/2023] Open
Abstract
Under the black-box nature in the deep learning model, it is uncertain how the change in contrast level and format affects the performance. We aimed to investigate the effect of contrast level and image format on the effectiveness of deep learning for diagnosing pneumothorax on chest radiographs. We collected 3316 images (1016 pneumothorax and 2300 normal images), and all images were set to the standard contrast level (100%) and stored in the Digital Imaging and Communication in Medicine and Joint Photographic Experts Group (JPEG) formats. Data were randomly separated into 80% of training and 20% of test sets, and the contrast of images in the test set was changed to 5 levels (50%, 75%, 100%, 125%, and 150%). We trained the model to detect pneumothorax using ResNet-50 with 100% level images and tested with 5-level images in the two formats. While comparing the overall performance between each contrast level in the two formats, the area under the receiver-operating characteristic curve (AUC) was significantly different (all p < 0.001) except between 125 and 150% in JPEG format (p = 0.382). When comparing the two formats at same contrast levels, AUC was significantly different (all p < 0.001) except 50% and 100% (p = 0.079 and p = 0.082, respectively). The contrast level and format of medical images could influence the performance of the deep learning model. It is required to train with various contrast levels and formats of image, and further image processing for improvement and maintenance of the performance.
Collapse
Affiliation(s)
- Myeong Seong Yoon
- Department of Emergency Medicine, College of Medicine, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea
- Machine Learning Research Center for Medical Data, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea
- Department of Radiological Science, Eulji University, 553 Sanseong-daero, Seongnam-si, Gyeonggi Do, 13135, Republic of Korea
| | - Gitaek Kwon
- Department of Computer Science, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea
- VUNO, Inc, 479 Gangnam-daero, Seocho-gu, Seoul, 06541, Republic of Korea
| | - Jaehoon Oh
- Department of Emergency Medicine, College of Medicine, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea.
- Machine Learning Research Center for Medical Data, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea.
| | - Jongbin Ryu
- Department of Software and Computer Engineering, Ajou University, 206 World cup-ro, Suwon-si, Gyeonggi Do, 16499, Republic of Korea.
| | - Jongwoo Lim
- Department of Computer Science, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea
- Machine Learning Research Center for Medical Data, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea
| | - Bo-Kyeong Kang
- Machine Learning Research Center for Medical Data, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea
- Department of Radiology, College of Medicine, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea
| | - Juncheol Lee
- Department of Emergency Medicine, College of Medicine, Hanyang University, 222 Wangsimni-Ro, Seongdong-Gu, Seoul, 04763, Republic of Korea
| | - Dong-Kyoon Han
- Department of Radiological Science, Eulji University, 553 Sanseong-daero, Seongnam-si, Gyeonggi Do, 13135, Republic of Korea
| |
Collapse
|
20
|
Cao Z, Chen F, Grais EM, Yue F, Cai Y, Swanepoel DW, Zhao F. Machine Learning in Diagnosing Middle Ear Disorders Using Tympanic Membrane Images: A Meta-Analysis. Laryngoscope 2023; 133:732-741. [PMID: 35848851 DOI: 10.1002/lary.30291] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Revised: 06/18/2022] [Accepted: 06/21/2022] [Indexed: 11/12/2022]
Abstract
OBJECTIVE To systematically evaluate the development of Machine Learning (ML) models and compare their diagnostic accuracy for the classification of Middle Ear Disorders (MED) using Tympanic Membrane (TM) images. METHODS PubMed, EMBASE, CINAHL, and CENTRAL were searched up until November 30, 2021. Studies on the development of ML approaches for diagnosing MED using TM images were selected according to the inclusion criteria. PRISMA guidelines were followed with study design, analysis method, and outcomes extracted. Sensitivity, specificity, and area under the curve (AUC) were used to summarize the performance metrics of the meta-analysis. Risk of Bias was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 tool in combination with the Prediction Model Risk of Bias Assessment Tool. RESULTS Sixteen studies were included, encompassing 20254 TM images (7025 normal TM and 13229 MED). The sample size ranged from 45 to 6066 per study. The accuracy of the 25 included ML approaches ranged from 76.00% to 98.26%. Eleven studies (68.8%) were rated as having a low risk of bias, with the reference standard as the major domain of high risk of bias (37.5%). Sensitivity and specificity were 93% (95% CI, 90%-95%) and 85% (95% CI, 82%-88%), respectively. The AUC of total TM images was 94% (95% CI, 91%-96%). The greater AUC was found using otoendoscopic images than otoscopic images. CONCLUSIONS ML approaches perform robustly in distinguishing between normal ears and MED, however, it is proposed that a standardized TM image acquisition and annotation protocol should be developed. LEVEL OF EVIDENCE NA Laryngoscope, 133:732-741, 2023.
Collapse
Affiliation(s)
- Zuwei Cao
- Center for Rehabilitative Auditory Research, Guizhou Provincial People's Hospital, Guiyang City, China
| | - Feifan Chen
- Centre for Speech and Language Therapy and Hearing Science, Cardiff School of Sport and Health Sciences, Cardiff Metropolitan University, Cardiff, UK
| | - Emad M Grais
- Centre for Speech and Language Therapy and Hearing Science, Cardiff School of Sport and Health Sciences, Cardiff Metropolitan University, Cardiff, UK
| | - Fengjuan Yue
- Medical Examination Center, Guizhou Provincial People's Hospital, Guiyang City, China
| | - Yuexin Cai
- Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou City, China
| | - De Wet Swanepoel
- Department of Speech-Language Pathology and Audiology, University of Pretoria, Pretoria, South Africa
| | - Fei Zhao
- Centre for Speech and Language Therapy and Hearing Science, Cardiff School of Sport and Health Sciences, Cardiff Metropolitan University, Cardiff, UK
| |
Collapse
|
21
|
Lex JR, Di Michele J, Koucheki R, Pincus D, Whyne C, Ravi B. Artificial Intelligence for Hip Fracture Detection and Outcome Prediction: A Systematic Review and Meta-analysis. JAMA Netw Open 2023; 6:e233391. [PMID: 36930153 PMCID: PMC10024206 DOI: 10.1001/jamanetworkopen.2023.3391] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/18/2023] Open
Abstract
IMPORTANCE Artificial intelligence (AI) enables powerful models for establishment of clinical diagnostic and prognostic tools for hip fractures; however the performance and potential impact of these newly developed algorithms are currently unknown. OBJECTIVE To evaluate the performance of AI algorithms designed to diagnose hip fractures on radiographs and predict postoperative clinical outcomes following hip fracture surgery relative to current practices. DATA SOURCES A systematic review of the literature was performed using the MEDLINE, Embase, and Cochrane Library databases for all articles published from database inception to January 23, 2023. A manual reference search of included articles was also undertaken to identify any additional relevant articles. STUDY SELECTION Studies developing machine learning (ML) models for the diagnosis of hip fractures from hip or pelvic radiographs or to predict any postoperative patient outcome following hip fracture surgery were included. DATA EXTRACTION AND SYNTHESIS This study followed the Preferred Reporting Items for Systematic Reviews and Meta-analyses and was registered with PROSPERO. Eligible full-text articles were evaluated and relevant data extracted independently using a template data extraction form. For studies that predicted postoperative outcomes, the performance of traditional predictive statistical models, either multivariable logistic or linear regression, was recorded and compared with the performance of the best ML model on the same out-of-sample data set. MAIN OUTCOMES AND MEASURES Diagnostic accuracy of AI models was compared with the diagnostic accuracy of expert clinicians using odds ratios (ORs) with 95% CIs. Areas under the curve for postoperative outcome prediction between traditional statistical models (multivariable linear or logistic regression) and ML models were compared. RESULTS Of 39 studies that met all criteria and were included in this analysis, 18 (46.2%) used AI models to diagnose hip fractures on plain radiographs and 21 (53.8%) used AI models to predict patient outcomes following hip fracture surgery. A total of 39 598 plain radiographs and 714 939 hip fractures were used for training, validating, and testing ML models specific to diagnosis and postoperative outcome prediction, respectively. Mortality and length of hospital stay were the most predicted outcomes. On pooled data analysis, compared with clinicians, the OR for diagnostic error of ML models was 0.79 (95% CI, 0.48-1.31; P = .36; I2 = 60%) for hip fracture radiographs. For the ML models, the mean (SD) sensitivity was 89.3% (8.5%), specificity was 87.5% (9.9%), and F1 score was 0.90 (0.06). The mean area under the curve for mortality prediction was 0.84 with ML models compared with 0.79 for alternative controls (P = .09). CONCLUSIONS AND RELEVANCE The findings of this systematic review and meta-analysis suggest that the potential applications of AI to aid with diagnosis from hip radiographs are promising. The performance of AI in diagnosing hip fractures was comparable with that of expert radiologists and surgeons. However, current implementations of AI for outcome prediction do not seem to provide substantial benefit over traditional multivariable predictive statistics.
Collapse
Affiliation(s)
- Johnathan R. Lex
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
- Orthopaedics Biomechanics Laboratory, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Joseph Di Michele
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
| | - Robert Koucheki
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
- Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Daniel Pincus
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
| | - Cari Whyne
- Orthopaedics Biomechanics Laboratory, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Bheeshma Ravi
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
| |
Collapse
|
22
|
van de Kuit A, Oosterhoff JHF, Dijkstra H, Sprague S, Bzovsky S, Bhandari M, Swiontkowski M, Schemitsch EH, IJpma FFA, Poolman RW, Doornberg JN, Hendrickx LAM. Patients With Femoral Neck Fractures Are at Risk for Conversion to Arthroplasty After Internal Fixation: A Machine-learning Algorithm. Clin Orthop Relat Res 2022; 480:2350-2360. [PMID: 35767811 PMCID: PMC9653184 DOI: 10.1097/corr.0000000000002283] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/01/2022] [Accepted: 05/31/2022] [Indexed: 01/31/2023]
Abstract
BACKGROUND Femoral neck fractures are common and are frequently treated with internal fixation. A major disadvantage of internal fixation is the substantially high number of conversions to arthroplasty because of nonunion, malunion, avascular necrosis, or implant failure. A clinical prediction model identifying patients at high risk of conversion to arthroplasty may help clinicians in selecting patients who could have benefited from arthroplasty initially. QUESTION/PURPOSE What is the predictive performance of a machine-learning (ML) algorithm to predict conversion to arthroplasty within 24 months after internal fixation in patients with femoral neck fractures? METHODS We included 875 patients from the Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trial. The FAITH trial consisted of patients with low-energy femoral neck fractures who were randomly assigned to receive a sliding hip screw or cancellous screws for internal fixation. Of these patients, 18% (155 of 875) underwent conversion to THA or hemiarthroplasty within the first 24 months. All patients were randomly divided into a training set (80%) and test set (20%). First, we identified 27 potential patient and fracture characteristics that may have been associated with our primary outcome, based on biomechanical rationale and previous studies. Then, random forest algorithms (an ML learning, decision tree-based algorithm that selects variables) identified 10 predictors of conversion: BMI, cardiac disease, Garden classification, use of cardiac medication, use of pulmonary medication, age, lung disease, osteoarthritis, sex, and the level of the fracture line. Based on these variables, five different ML algorithms were trained to identify patterns related to conversion. The predictive performance of these trained ML algorithms was assessed on the training and test sets based on the following performance measures: (1) discrimination (the model's ability to distinguish patients who had conversion from those who did not; expressed with the area under the receiver operating characteristic curve [AUC]), (2) calibration (the plotted estimated versus the observed probabilities; expressed with the calibration curve intercept and slope), and (3) the overall model performance (Brier score: a composite of discrimination and calibration). RESULTS None of the five ML algorithms performed well in predicting conversion to arthroplasty in the training set and the test set; AUCs of the algorithms in the training set ranged from 0.57 to 0.64, slopes of calibration plots ranged from 0.53 to 0.82, calibration intercepts ranged from -0.04 to 0.05, and Brier scores ranged from 0.14 to 0.15. The algorithms were further evaluated in the test set; AUCs ranged from 0.49 to 0.73, calibration slopes ranged from 0.17 to 1.29, calibration intercepts ranged from -1.28 to 0.34, and Brier scores ranged from 0.13 to 0.15. CONCLUSION The predictive performance of the trained algorithms was poor, despite the use of one of the best datasets available worldwide on this subject. If the current dataset consisted of different variables or more patients, the performance may have been better. Also, various reasons for conversion to arthroplasty were pooled in this study, but the separate prediction of underlying pathology (such as, avascular necrosis or nonunion) may be more precise. Finally, it may be possible that it is inherently difficult to predict conversion to arthroplasty based on preoperative variables alone. Therefore, future studies should aim to include more variables and to differentiate between the various reasons for arthroplasty. LEVEL OF EVIDENCE Level III, prognostic study.
Collapse
Affiliation(s)
- Anouk van de Kuit
- Department of Orthopaedic Surgery, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands
| | - Jacobien H. F. Oosterhoff
- Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Department of Orthopaedic Surgery, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
| | - Hidde Dijkstra
- Department of Trauma Surgery, University Medical Centre Groningen, University of Groningen, Groningen, the Netherlands
| | - Sheila Sprague
- Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, ON, Canada
- Division of Orthopaedic Surgery, Department of Surgery, McMaster University, Hamilton, ON, Canada
| | - Sofia Bzovsky
- Division of Orthopaedic Surgery, Department of Surgery, McMaster University, Hamilton, ON, Canada
| | - Mohit Bhandari
- Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, ON, Canada
- Division of Orthopaedic Surgery, Department of Surgery, McMaster University, Hamilton, ON, Canada
| | - Marc Swiontkowski
- Department of Orthopaedic Surgery, University of Minnesota, Minneapolis, MN, USA
| | | | - Frank F. A. IJpma
- Department of Trauma Surgery, University Medical Centre Groningen, University of Groningen, Groningen, the Netherlands
| | - Rudolf W. Poolman
- Department of Orthopaedic Surgery, University Medical Center Leiden, Leiden University, Leiden, the Netherlands
| | - Job N. Doornberg
- Department of Orthopaedic Surgery, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands
- Department of Trauma Surgery, University Medical Centre Groningen, University of Groningen, Groningen, the Netherlands
- Department of Orthopaedic and Trauma Surgery, Flinders Medical Centre, Flinders University, Adelaide, Australia
| | | |
Collapse
|
23
|
Cha Y, Kim JT, Park CH, Kim JW, Lee SY, Yoo JI. Artificial intelligence and machine learning on diagnosis and classification of hip fracture: systematic review. J Orthop Surg Res 2022; 17:520. [PMID: 36456982 PMCID: PMC9714164 DOI: 10.1186/s13018-022-03408-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 11/16/2022] [Indexed: 12/05/2022] Open
Abstract
BACKGROUND In the emergency room, clinicians spend a lot of time and are exposed to mental stress. In addition, fracture classification is important for determining the surgical method and restoring the patient's mobility. Recently, with the help of computers using artificial intelligence (AI) or machine learning (ML), diagnosis and classification of hip fractures can be performed easily and quickly. The purpose of this systematic review is to search for studies that diagnose and classify for hip fracture using AI or ML, organize the results of each study, analyze the usefulness of this technology and its future use value. METHODS PubMed Central, OVID Medline, Cochrane Collaboration Library, Web of Science, EMBASE, and AHRQ databases were searched to identify relevant studies published up to June 2022 with English language restriction. The following search terms were used [All Fields] AND (", "[MeSH Terms] OR (""[All Fields] AND "bone"[All Fields]) OR "bone fractures"[All Fields] OR "fracture"[All Fields]). The following information was extracted from the included articles: authors, publication year, study period, type of image, type of fracture, number of patient or used images, fracture classification, reference diagnosis of fracture diagnosis and classification, and augments of each studies. In addition, AI name, CNN architecture type, ROI or important region labeling, data input proportion in training/validation/test, and diagnosis accuracy/AUC, classification accuracy/AUC of each studies were also extracted. RESULTS In 14 finally included studies, the accuracy of diagnosis for hip fracture by AI was 79.3-98%, and the accuracy of fracture diagnosis in AI aided humans was 90.5-97.1. The accuracy of human fracture diagnosis was 77.5-93.5. AUC of fracture diagnosis by AI was 0.905-0.99. The accuracy of fracture classification by AI was 86-98.5 and AUC was 0.873-1.0. The forest plot represented that the mean AI diagnosis accuracy was 0.92, the mean AI diagnosis AUC was 0.969, the mean AI classification accuracy was 0.914, and the mean AI classification AUC was 0.933. Among the included studies, the architecture based on the GoogLeNet architectural model or the DenseNet architectural model was the most common with three each. Among the data input proportions, the study with the lowest training rate was 57%, and the study with the highest training rate was 95%. In 14 studies, 5 studies used Grad-CAM for highlight important regions. CONCLUSION We expected that our study may be helpful in making judgments about the use of AI in the diagnosis and classification of hip fractures. It is clear that AI is a tool that can help medical staff reduce the time and effort required for hip fracture diagnosis with high accuracy. Further studies are needed to determine what effect this causes in actual clinical situations.
Collapse
Affiliation(s)
- Yonghan Cha
- grid.411061.30000 0004 0647 205XDepartment of Orthopedic Surgery, Eulji University Hospital, Daejeon, Korea
| | - Jung-Taek Kim
- grid.251916.80000 0004 0532 3933Department of Orthopedic Surgery, Ajou Medical Center, Ajou University School of Medicine, Suwon, Korea
| | - Chan-Ho Park
- Department of Orthopedic Surgery, Yonsei 100 Percent Hospital, Incheon, Korea
| | - Jin-Woo Kim
- grid.255588.70000 0004 1798 4296Department of Orthopaedic Surgery, Nowon Eulji Medical Center, Eulji University, Seoul, Korea
| | - Sang Yeob Lee
- grid.411899.c0000 0004 0624 2502Department of Biomedical Research Institute, Gyeongsang National University Hospital, Jinju, South Korea
| | - Jun-Il Yoo
- grid.411899.c0000 0004 0624 2502Department of Orthopaedic Surgery, Gyeongsang National University Hospital, 90 Chilamdong, Jinju, Gyeongnamdo 660-702 Republic of Korea
| |
Collapse
|
24
|
Kumar V, Patel S, Baburaj V, Vardhan A, Singh PK, Vaishya R. Current understanding on artificial intelligence and machine learning in orthopaedics - A scoping review. J Orthop 2022; 34:201-206. [PMID: 36104993 PMCID: PMC9465367 DOI: 10.1016/j.jor.2022.08.020] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 08/16/2022] [Accepted: 08/17/2022] [Indexed: 11/25/2022] Open
Abstract
Background Artificial Intelligence (AI) has improved the way of looking at technological challenges. Today, we can afford to see many of the problems as just an input-output system rather than solving from the first principles. The field of Orthopaedics is not spared from this rapidly expanding technology. The recent surge in the use of AI can be attributed mainly to advancements in deep learning methodologies and computing resources. This review was conducted to draw an outline on the role of AI in orthopaedics. Methods We developed a search strategy and looked for articles on PubMed, Scopus, and EMBASE. A total of 40 articles were selected for this study, from tools for medical aid like imaging solutions, implant management, and robotic surgery to understanding scientific questions. Results A total of 40 studies have been included in this review. The role of AI in the various subspecialties such as arthroplasty, trauma, orthopaedic oncology, foot and ankle etc. have been discussed in detail. Conclusion AI has touched most of the aspects of Orthopaedics. The increase in technological literacy, data management plans, and hardware systems, amalgamated with the access to hand-held devices like mobiles, and electronic pads, augur well for the exciting times ahead in this field. We have discussed various technological breakthroughs in AI that have been able to perform in Orthopaedics, and also the limitations and the problem with the black-box approach of modern AI algorithms. We advocate for better interpretable algorithms which can help both the patients and surgeons alike.
Collapse
Affiliation(s)
- Vishal Kumar
- Department of Orthopaedics, Postgraduate Institute of Medical Education and Research, Chandigarh, 160012, India
| | - Sandeep Patel
- Department of Orthopaedics, Postgraduate Institute of Medical Education and Research, Chandigarh, 160012, India
| | - Vishnu Baburaj
- Department of Orthopaedics, Postgraduate Institute of Medical Education and Research, Chandigarh, 160012, India
| | - Aditya Vardhan
- Department of Orthopaedics, Postgraduate Institute of Medical Education and Research, Chandigarh, 160012, India
| | - Prasoon Kumar Singh
- Department of Orthopaedics, Postgraduate Institute of Medical Education and Research, Chandigarh, 160012, India
| | | |
Collapse
|
25
|
Hu X, Zhu Y, Qian Y, Huang R, Yin S, Zeng Z, Xie N, Ma B, Yu Y, Zhao Q, Wu Z, Wang J, Xu W, Ren Y, Li C, Zhu R, Cheng L. Prediction of subsequent osteoporotic vertebral compression fracture on CT radiography via deep learning. VIEW 2022. [DOI: 10.1002/viw.20220012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Affiliation(s)
- Xiao Hu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Yanjing Zhu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
| | - Yadong Qian
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Ruiqi Huang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
| | - Shuai Yin
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
| | - Zhili Zeng
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Ning Xie
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Bin Ma
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Yan Yu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Qing Zhao
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
| | - Zhourui Wu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Jianjie Wang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Wei Xu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Yilong Ren
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Chen Li
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| | - Rongrong Zhu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
| | - Liming Cheng
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Orthopaedic Department of Tongji Hospital, School of Medicine, School of Life Sciences and Technology Tongji University Shanghai China
- Division of Spine Department of Orthopaedics Tongji Hospital Tongji University School of Medicine Shanghai China
| |
Collapse
|
26
|
Feng C, Zhou X, Wang H, He Y, Li Z, Tu C. Research hotspots and emerging trends of deep learning applications in orthopedics: A bibliometric and visualized study. Front Public Health 2022; 10:949366. [PMID: 35928480 PMCID: PMC9343683 DOI: 10.3389/fpubh.2022.949366] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 06/27/2022] [Indexed: 11/13/2022] Open
Abstract
Background As a research hotspot, deep learning has been continuously combined with various research fields in medicine. Recently, there is a growing amount of deep learning-based researches in orthopedics. This bibliometric analysis aimed to identify the hotspots of deep learning applications in orthopedics in recent years and infer future research trends. Methods We screened global publication on deep learning applications in orthopedics by accessing the Web of Science Core Collection. The articles and reviews were collected without language and time restrictions. Citespace was applied to conduct the bibliometric analysis of the publications. Results A total of 822 articles and reviews were finally retrieved. The analysis showed that the application of deep learning in orthopedics has great prospects for development based on the annual publications. The most prolific country is the USA, followed by China. University of California San Francisco, and Skeletal Radiology are the most prolific institution and journal, respectively. LeCun Y is the most frequently cited author, and Nature has the highest impact factor in the cited journals. The current hot keywords are convolutional neural network, classification, segmentation, diagnosis, image, fracture, and osteoarthritis. The burst keywords are risk factor, identification, localization, and surgery. The timeline viewer showed two recent research directions for bone tumors and osteoporosis. Conclusion Publications on deep learning applications in orthopedics have increased in recent years, with the USA being the most prolific. The current research mainly focused on classifying, diagnosing and risk predicting in osteoarthritis and fractures from medical images. Future research directions may put emphasis on reducing intraoperative risk, predicting the occurrence of postoperative complications, screening for osteoporosis, and identification and classification of bone tumors from conventional imaging.
Collapse
Affiliation(s)
- Chengyao Feng
- The Department of Orthopaedics, The Second Xiangya Hospital of Central South University, Changsha, China
- Hunan Key Laboratory of Tumor Models and Individualized Medicine, The Second Xiangya Hospital of Central South University, Changsha, China
| | - Xiaowen Zhou
- Xiangya School of Medicine, Central South University, Changsha, China
| | - Hua Wang
- Xiangya School of Medicine, Central South University, Changsha, China
| | - Yu He
- The Department of Radiology, The Second Xiangya Hospital of Central South University, Changsha, China
| | - Zhihong Li
- The Department of Orthopaedics, The Second Xiangya Hospital of Central South University, Changsha, China
- Hunan Key Laboratory of Tumor Models and Individualized Medicine, The Second Xiangya Hospital of Central South University, Changsha, China
| | - Chao Tu
- The Department of Orthopaedics, The Second Xiangya Hospital of Central South University, Changsha, China
- Hunan Key Laboratory of Tumor Models and Individualized Medicine, The Second Xiangya Hospital of Central South University, Changsha, China
- *Correspondence: Chao Tu
| |
Collapse
|
27
|
Zhang X, Yang Y, Shen YW, Zhang KR, Jiang ZK, Ma LT, Ding C, Wang BY, Meng Y, Liu H. Diagnostic accuracy and potential covariates of artificial intelligence for diagnosing orthopedic fractures: a systematic literature review and meta-analysis. Eur Radiol 2022; 32:7196-7216. [PMID: 35754091 DOI: 10.1007/s00330-022-08956-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 05/07/2022] [Accepted: 06/08/2022] [Indexed: 02/05/2023]
Abstract
OBJECTIVES To systematically quantify the diagnostic accuracy and identify potential covariates affecting the performance of artificial intelligence (AI) in diagnosing orthopedic fractures. METHODS PubMed, Embase, Web of Science, and Cochrane Library were systematically searched for studies on AI applications in diagnosing orthopedic fractures from inception to September 29, 2021. Pooled sensitivity and specificity and the area under the receiver operating characteristic curves (AUC) were obtained. This study was registered in the PROSPERO database prior to initiation (CRD 42021254618). RESULTS Thirty-nine were eligible for quantitative analysis. The overall pooled AUC, sensitivity, and specificity were 0.96 (95% CI 0.94-0.98), 90% (95% CI 87-92%), and 92% (95% CI 90-94%), respectively. In subgroup analyses, multicenter designed studies yielded higher sensitivity (92% vs. 88%) and specificity (94% vs. 91%) than single-center studies. AI demonstrated higher sensitivity with transfer learning (with vs. without: 92% vs. 87%) or data augmentation (with vs. without: 92% vs. 87%), compared to those without. Utilizing plain X-rays as input images for AI achieved results comparable to CT (AUC 0.96 vs. 0.96). Moreover, AI achieved comparable results to humans (AUC 0.97 vs. 0.97) and better results than non-expert human readers (AUC 0.98 vs. 0.96; sensitivity 95% vs. 88%). CONCLUSIONS AI demonstrated high accuracy in diagnosing orthopedic fractures from medical images. Larger-scale studies with higher design quality are needed to validate our findings. KEY POINTS • Multicenter study design, application of transfer learning, and data augmentation are closely related to improving the performance of artificial intelligence models in diagnosing orthopedic fractures. • Utilizing plain X-rays as input images for AI to diagnose fractures achieved results comparable to CT (AUC 0.96 vs. 0.96). • AI achieved comparable results to humans (AUC 0.97 vs. 0.97) but was superior to non-expert human readers (AUC 0.98 vs. 0.96, sensitivity 95% vs. 88%) in diagnosing fractures.
Collapse
Affiliation(s)
- Xiang Zhang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yi Yang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yi-Wei Shen
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Ke-Rui Zhang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Ze-Kun Jiang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610000, China
| | - Li-Tai Ma
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Chen Ding
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Bei-Yu Wang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yang Meng
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Hao Liu
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China.
| |
Collapse
|