Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: de Vries BM, Zwezerijnen GJC, Burchell GL, van Velden FHP, Menke-van der Houven van Oordt CW, Boellaard R. Explainable artificial intelligence (XAI) in radiology and nuclear medicine: a literature review. Front Med (Lausanne) 2023;10:1180773. [PMID: 37250654 PMCID: PMC10213317 DOI: 10.3389/fmed.2023.1180773] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 04/17/2023] [Indexed: 05/31/2023] Open

For:	de Vries BM, Zwezerijnen GJC, Burchell GL, van Velden FHP, Menke-van der Houven van Oordt CW, Boellaard R. Explainable artificial intelligence (XAI) in radiology and nuclear medicine: a literature review. Front Med (Lausanne) 2023;10:1180773. [PMID: 37250654 PMCID: PMC10213317 DOI: 10.3389/fmed.2023.1180773] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 04/17/2023] [Indexed: 05/31/2023] Open

Number

Cited by Other Article(s)

Pasvantis K, Protopapadakis E. Enhancing Deep Learning Model Explainability in Brain Tumor Datasets Using Post-Heuristic Approaches. J Imaging 2024;10:232. [PMID: 39330452 PMCID: PMC11433079 DOI: 10.3390/jimaging10090232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2024] [Revised: 09/02/2024] [Accepted: 09/05/2024] [Indexed: 09/28/2024] Open

Fanizzi A, Fadda F, Maddalo M, Saponaro S, Lorenzon L, Ubaldi L, Lambri N, Giuliano A, Loi E, Signoriello M, Branchini M, Belmonte G, Giannelli M, Mancosu P, Talamonti C, Iori M, Tangaro S, Avanzo M, Massafra R. Developing an ensemble machine learning study: Insights from a multi-center proof-of-concept study. PLoS One 2024;19:e0303217. [PMID: 39255296 PMCID: PMC11386419 DOI: 10.1371/journal.pone.0303217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 04/21/2024] [Indexed: 09/12/2024] Open

Abstract

BACKGROUND

To address the numerous unmeet clinical needs, in recent years several Machine Learning models applied to medical images and clinical data have been introduced and developed. Even when they achieve encouraging results, they lack evolutionary progression, thus perpetuating their status as autonomous entities. We postulated that different algorithms which have been proposed in the literature to address the same diagnostic task, can be aggregated to enhance classification performance. We suggested a proof of concept to define an ensemble approach useful for integrating different algorithms proposed to solve the same clinical task.

METHODS

The proposed approach was developed starting from a public database consisting of radiomic features extracted from CT images relating to 535 patients suffering from lung cancer. Seven algorithms were trained independently by participants in the AI4MP working group on Artificial Intelligence of the Italian Association of Physics in Medicine to discriminate metastatic from non-metastatic patients. The classification scores generated by these algorithms are used to train SVM classifier. The Explainable Artificial Intelligence approach is applied to the final model. The ensemble model was validated following an 80-20 hold-out and leave-one-out scheme on the training set.

RESULTS

Compared to individual algorithms, a more accurate result was achieved. On the independent test the ensemble model achieved an accuracy of 0.78, a F1-score of 0.57 and a log-loss of 0.49. Shapley values representing the contribution of each algorithm to the final classification result of the ensemble model were calculated. This information represents an added value for the end user useful for evaluating the appropriateness of the classification result on a particular case. It also allows us to evaluate on a global level which methodological approaches of the individual algorithms are likely to have the most impact.

CONCLUSION

Our proposal represents an innovative approach useful for integrating different algorithms that populate the literature and which lays the foundations for future evaluations in broader application scenarios.

Collapse

Affiliation(s)

Annarita Fanizzi Laboratorio Biostatistica e Bioinformatica, I.R.C.C.S. Istituto Tumori 'Giovanni Paolo II', Bari, Italy
Federico Fadda Laboratorio Biostatistica e Bioinformatica, I.R.C.C.S. Istituto Tumori 'Giovanni Paolo II', Bari, Italy
Michele Maddalo Servizio di Fisica Sanitaria, Azienda Ospedaliero-Universitaria di Parma, Parma, Italy
Sara Saponaro Fisica Sanitaria, Azienda Usl Toscana Nord Ovest, Lucca, Italy
Leda Lorenzon Fisica Sanitaria, Azienda Sanitaria dell'Alto Adige, Bolzano, Italy
Leonardo Ubaldi Dip. Scienze Biomediche Sperimentali e Cliniche "Mario Serio", Università degli Studi di Firenze,Viale Morgagni, Firenze Istituto Nazionale di Fisica Nucleare, Sez. Firenze, Via Sansone 1, Sesto Fiorentino, Firenze
Nicola Lambri IRCCS Humanitas Research Hospital, Medical Physics Unit of Radiotherapy and Radiosurgery Department, via Manzoni, Rozzano, Milan, Italy Department of Biomedical Sciences, Humanitas University, via Rita Levi Montalcini, Pieve Emanuele, Milan, Italy
Alessia Giuliano U.O.C. Fisica Sanitaria, Azienda Ospedaliero-Universitaria Pisana, Pisa, Italy
Emiliano Loi SC Fisica Sanitaria, IRCCS Istituto Romagnolo per lo Studio dei Tumori (IRST) "Dino Amadori", Meldola, Italy
Michele Signoriello Fisica Sanitaria, Azienda sanitaria universitaria Giuliano Isontina, Trieste, Italy
Marco Branchini Fisica Sanitaria, Azienda Socio Sanitaria Territoriale della Valtellina e dell'Alto Lario, Sondrio, Italy
Gina Belmonte Fisica Sanitaria, Azienda Usl Toscana Nord Ovest, Lucca, Italy
Marco Giannelli U.O.C. Fisica Sanitaria, Azienda Ospedaliero-Universitaria Pisana, Pisa, Italy
Pietro Mancosu IRCCS Humanitas Research Hospital, Medical Physics Unit of Radiotherapy and Radiosurgery Department, via Manzoni, Rozzano, Milan, Italy
Cinzia Talamonti Dip. Scienze Biomediche Sperimentali e Cliniche "Mario Serio", Università degli Studi di Firenze,Viale Morgagni, Firenze Istituto Nazionale di Fisica Nucleare, Sez. Firenze, Via Sansone 1, Sesto Fiorentino, Firenze
Mauro Iori Medical Physics Unit, Azienda USL-IRCCS di Reggio Emilia, Reggio Emilia, Italy
Sabina Tangaro Dipartimento di Fisica Applicata, Università degli Studi di Bari Aldo Moro, Bari, Italy
Michele Avanzo Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, Via F. Gallini, Aviano, Italy
Raffaella Massafra Laboratorio Biostatistica e Bioinformatica, I.R.C.C.S. Istituto Tumori 'Giovanni Paolo II', Bari, Italy

Collapse

Hoogteijling S, Schaft EV, Dirks EHM, Straumann S, Demuru M, van Eijsden P, Gebbink T, Otte WM, Huiskamp GM, van 't Klooster MA, Zijlmans M. Machine learning for (non-)epileptic tissue detection from the intraoperative electrocorticogram. Clin Neurophysiol 2024;167:14-25. [PMID: 39265288 DOI: 10.1016/j.clinph.2024.08.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 08/01/2024] [Accepted: 08/15/2024] [Indexed: 09/14/2024]

Affiliation(s)

Sem Hoogteijling Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands; Stichting Epilepsie Instellingen Nederland (SEIN), The Netherlands; Technical Medicine, University of Twente, Enschede, The Netherlands
Eline V Schaft Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands
Evi H M Dirks Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands; Technical Medicine, University of Twente, Enschede, The Netherlands
Sven Straumann Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands; Department of Anesthesiology, University Hospital Bern, Switzerland
Matteo Demuru Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands; Stichting Epilepsie Instellingen Nederland (SEIN), The Netherlands
Pieter van Eijsden Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands
Tineke Gebbink Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands; Stichting Epilepsie Instellingen Nederland (SEIN), The Netherlands
Willem M Otte Department of Child Neurology, University Medical Center Utrecht, and Utrecht University, Utrecht, The Netherlands
Geertjan M Huiskamp Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands
Maryse A van 't Klooster Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands
Maeike Zijlmans Department of Neurology and Neurosurgery, University Medical Center Utrecht Brain Center, University Medical Center Utrecht, Part of ERN EpiCARE, P.O. box 85500, 3508 GA Utrecht, The Netherlands; Stichting Epilepsie Instellingen Nederland (SEIN), The Netherlands.

Collapse

Shi M, Gong Z, Zeng P, Xiang D, Cai G, Liu H, Chen S, Liu R, Chen Z, Zhang X, Chen Z. Multi-Quantifying Maxillofacial Traits via a Demographic Parity-Based AI Model. BME FRONTIERS 2024;5:0054. [PMID: 39139805 PMCID: PMC11319927 DOI: 10.34133/bmef.0054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 07/08/2024] [Indexed: 08/15/2024] Open

Abstract

Objective and Impact Statement: The multi-quantification of the distinct individualized maxillofacial traits, that is, quantifying multiple indices, is vital for diagnosis, decision-making, and prognosis of the maxillofacial surgery. Introduction: While the discrete and demographically disproportionate distributions of the multiple indices restrict the generalization ability of artificial intelligence (AI)-based automatic analysis, this study presents a demographic-parity strategy for AI-based multi-quantification. Methods: In the aesthetic-concerning maxillary alveolar basal bone, which requires quantifying a total of 9 indices from length and width dimensional, this study collected a total of 4,000 cone-beam computed tomography (CBCT) sagittal images, and developed a deep learning model composed of a backbone and multiple regression heads with fully shared parameters to intelligently predict these quantitative metrics. Through auditing of the primary generalization result, the sensitive attribute was identified and the dataset was subdivided to train new submodels. Then, submodels trained from respective subsets were ensembled for final generalization. Results: The primary generalization result showed that the AI model underperformed in quantifying major basal bone indices. The sex factor was proved to be the sensitive attribute. The final model was ensembled by the male and female submodels, which yielded equal performance between genders, low error, high consistency, satisfying correlation coefficient, and highly focused attention. The ensemble model exhibited high similarity to clinicians with minor processing time. Conclusion: This work validates that the demographic parity strategy enables the AI algorithm with greater model generalization ability, even for the highly variable traits, which benefits for the appearance-concerning maxillofacial surgery.

Collapse

Captier N, Orlhac F, Hovhannisyan-Baghdasarian N, Luporsi M, Girard N, Buvat I. RadShap: An Explanation Tool for Highlighting the Contributions of Multiple Regions of Interest to the Prediction of Radiomic Models. J Nucl Med 2024;65:1307-1312. [PMID: 38906555 PMCID: PMC11294068 DOI: 10.2967/jnumed.124.267434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 05/22/2024] [Indexed: 06/23/2024] Open

Abstract

Explaining the decisions made by a radiomic model is of significant interest, as it can provide valuable insights into the information learned by complex models and foster trust in well-performing ones, thereby facilitating their clinical adoption. Promising radiomic approaches that aggregate information from multiple regions within an image currently lack suitable explanation tools that could identify the regions that most significantly influence their decisions. Here we present a model- and modality-agnostic tool (RadShap, https://github.com/ncaptier/radshap), based on Shapley values, that explains the predictions of multiregion radiomic models by highlighting the contribution of each individual region. Methods: The explanation tool leverages Shapley values to distribute the aggregative radiomic model's output among all the regions of interest of an image, highlighting their individual contribution. RadShap was validated using a retrospective cohort of 130 patients with advanced non-small cell lung cancer undergoing first-line immunotherapy. Their baseline PET scans were used to build 1,000 synthetic tasks to evaluate the degree of alignment between the tool's explanations and our data generation process. RadShap's potential was then illustrated through 2 real case studies by aggregating information from all segmented tumors: the prediction of the progression-free survival of the non-small cell lung cancer patients and the classification of the histologic tumor subtype. Results: RadShap demonstrated strong alignment with the ground truth, with a median frequency of 94% for consistently explained predictions in the synthetic tasks. In both real-case studies, the aggregative models yielded superior performance to the single-lesion models (average [±SD] time-dependent area under the receiver operating characteristic curve was 0.66 ± 0.02 for the aggregative survival model vs. 0.55 ± 0.04 for the primary tumor survival model). The tool's explanations provided relevant insights into the behavior of the aggregative models, highlighting that for the classification of the histologic subtype, the aggregative model used information beyond the biopsy site to correctly classify patients who were initially misclassified by a model focusing only on the biopsied tumor. Conclusion: RadShap aligned with ground truth explanations and provided valuable insights into radiomic models' behaviors. It is implemented as a user-friendly Python package with documentation and tutorials, facilitating its smooth integration into radiomic pipelines.

Collapse

D'hondt L, Kellens PJ, Torfs K, Bosmans H, Bacher K, Snoeckx A. Absolute ground truth-based validation of computer-aided nodule detection and volumetry in low-dose CT imaging. Phys Med 2024;121:103344. [PMID: 38593627 DOI: 10.1016/j.ejmp.2024.103344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 02/20/2024] [Accepted: 03/27/2024] [Indexed: 04/11/2024] Open

Abstract

PURPOSE

To validate the performance of computer-aided detection (CAD) and volumetry software using an anthropomorphic phantom with a ground truth (GT) set of 3D-printed nodules.

METHODS

The Kyoto Kaguku Lungman phantom, containing 3D-printed solid nodules including six diameters (4 to 9 mm) and three morphologies (smooth, lobulated, spiculated), was scanned at varying CTDIvol levels (6.04, 1.54 and 0.20 mGy). Combinations of reconstruction algorithms (iterative and deep learning image reconstruction) and kernels (soft and hard) were applied. Detection, volumetry and density results recorded by a commercially available AI-based algorithm (AVIEW LCS + ) were compared to the absolute GT, which was determined through µCT scanning at 50 µm resolution. The associations between image acquisition parameters or nodule characteristics and accuracy of nodule detection and characterization were analyzed with chi square tests and multiple linear regression.

RESULTS

High levels of detection sensitivity and precision (minimal 83 % and 91 % respectively) were observed across all acquisitions. Neither reconstruction algorithm nor radiation dose showed significant associations with detection. Nodule diameter however showed a highly significant association with detection (p < 0.0001). Volumetric measurements for nodules > 6 mm were accurate within 10 % absolute range from volumeGT, regardless of dose and reconstruction. Nodule diameter and morphology are major determinants of volumetric accuracy (p < 0.001). Density assignment was not significantly influenced by any parameters.

CONCLUSIONS

Our study confirms the software's accurate performance in nodule volumetry, detection and density characterization with robustness for variations in CT imaging protocols. This study suggests the incorporation of similar phantom setups in quality assurance of CAD tools.

Collapse

Yurkovich JT, Evans SJ, Rappaport N, Boore JL, Lovejoy JC, Price ND, Hood LE. The transition from genomics to phenomics in personalized population health. Nat Rev Genet 2024;25:286-302. [PMID: 38093095 DOI: 10.1038/s41576-023-00674-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/03/2023] [Indexed: 03/21/2024]

Gong Z, Li X, Shi M, Cai G, Chen S, Ye Z, Gan X, Yang R, Wang R, Chen Z. Measuring the binary thickness of buccal bone of anterior maxilla in low-resolution cone-beam computed tomography via a bilinear convolutional neural network. Quant Imaging Med Surg 2023;13:8053-8066. [PMID: 38106266 PMCID: PMC10722026 DOI: 10.21037/qims-23-744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 08/28/2023] [Indexed: 12/19/2023]

Abstract

Background

The thickness of the buccal bone of the anterior maxilla is an important aesthetic-determining factor for dental implant, which is divided into the thick (≥1 mm) and thin type (<1 mm). However, as a micro-scale structure that is evaluated through low-resolution cone-beam computed tomography (CBCT), its thickness measurement is error-prone under the circumstance of enormous patients and relatively inexperienced primary dentists. Further, the challenges of deep learning-based analysis of the binary thickness of buccal bone include the substantial real-world variance caused by pixel error, the extraction of fine-grained features, and burdensome annotations.

Methods

This study built bilinear convolutional neural network (BCNN) with 2 convolutional neural network (CNN) backbones and a bilinear pooling module to predict the binary thickness of buccal bone (thick or thin) of the anterior maxilla in an end-to-end manner. The methods of 5-fold cross-validation and model ensemble were adopted at the training and testing stages. The visualization methods of Gradient Weighted Class Activation Mapping (Grad-CAM), Guided Grad-CAM, and layer-wise relevance propagation (LRP) were used for revealing the important features on which the model focused. The performance metrics and efficacy were compared between BCNN, dentists of different clinical experience (i.e., dental student, junior dentist, and senior dentist), and the fusion of BCNN and dentists to investigate the clinical feasibility of BCNN.

Results

Based on the dataset of 4,000 CBCT images from 1,000 patients (aged 36.15±13.09 years), the BCNN with visual geometry group (VGG)16 backbone achieved an accuracy of 0.870 [95% confidence interval (CI): 0.838-0.902] and an area under the receiver operating characteristic (ROC) curve (AUC) of 0.924 (95% CI: 0.896-0.948). Compared with the conventional CNNs, BCNN precisely located the buccal bone wall over irrelevant regions. The BCNN generally outperformed the expert-level dentists. The clinical diagnostic performance of the dentists was improved with the assistance of BCNN.

Conclusions

The application of BCNN to the quantitative analysis of binary buccal bone thickness validated the model's excellent ability of subtle feature extraction and achieved expert-level performance. This work signals the potential of fine-grained image recognition networks to the precise quantitative analysis of micro-scale structures.

Collapse

Fragoso-Garcia M, Wilm F, Bertram CA, Merz S, Schmidt A, Donovan T, Fuchs-Baumgartinger A, Bartel A, Marzahl C, Diehl L, Puget C, Maier A, Aubreville M, Breininger K, Klopfleisch R. Automated diagnosis of 7 canine skin tumors using machine learning on H&E-stained whole slide images. Vet Pathol 2023;60:865-875. [PMID: 37515411 PMCID: PMC10583479 DOI: 10.1177/03009858231189205] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2023]

Lu D, Yan Y, Jiang M, Sun S, Jiang H, Lu Y, Zhang W, Zhou X. Predictive value of radiomics-based machine learning for the disease-free survival in breast cancer: a systematic review and meta-analysis. Front Oncol 2023;13:1173090. [PMID: 37664048 PMCID: PMC10469000 DOI: 10.3389/fonc.2023.1173090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 07/28/2023] [Indexed: 09/05/2023] Open