Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Jiang PT, Zhang CB, Hou Q, Cheng MM, Wei Y. LayerCAM: Exploring Hierarchical Class Activation Maps for Localization. IEEE Trans Image Process 2021;30:5875-5888. [PMID: 34156941 DOI: 10.1109/tip.2021.3089943] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

For:	Jiang PT, Zhang CB, Hou Q, Cheng MM, Wei Y. LayerCAM: Exploring Hierarchical Class Activation Maps for Localization. IEEE Trans Image Process 2021;30:5875-5888. [PMID: 34156941 DOI: 10.1109/tip.2021.3089943] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Number

Cited by Other Article(s)

Cai L, Chen L, Huang J, Wang Y, Zhang Y. Know your orientation: A viewpoint-aware framework for polyp segmentation. Med Image Anal 2024;97:103288. [PMID: 39096844 DOI: 10.1016/j.media.2024.103288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Revised: 07/23/2024] [Accepted: 07/24/2024] [Indexed: 08/05/2024]

Joye AS, Firlie MG, Wittberg DM, Aragie S, Nash SD, Tadesse Z, Dagnew A, Hailu D, Admassu F, Wondimteka B, Getachew H, Kabtu E, Beyecha S, Shibiru M, Getnet B, Birhanu T, Abdu S, Tekew S, Lietman TM, Keenan JD, Redd TK. Computer Vision Identification of Trachomatous Inflammation-Follicular Using Deep Learning. Cornea 2024:00003226-990000000-00692. [PMID: 39312712 DOI: 10.1097/ico.0000000000003701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 07/25/2024] [Indexed: 09/25/2024]

Affiliation(s)

Ashlin S Joye Casey Eye Institute, Oregon Health and Science University, Portland, OR Francis I Proctor Foundation, University of California San Francisco, San Francisco, CA
Marissa G Firlie George Washington University, School of Medicine and Health Sciences, Washington, DC
Dionna M Wittberg Francis I Proctor Foundation, University of California San Francisco, San Francisco, CA
Solomon Aragie The Carter Center Ethiopia, Addis Ababa, Ethiopia
Scott D Nash The Carter Center, Atlanta, GA; and
Zerihun Tadesse The Carter Center Ethiopia, Addis Ababa, Ethiopia
Adane Dagnew The Carter Center Ethiopia, Addis Ababa, Ethiopia
Dagnachew Hailu The Carter Center Ethiopia, Addis Ababa, Ethiopia
Fisseha Admassu Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Bilen Wondimteka Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Habib Getachew Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Endale Kabtu Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Social Beyecha Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Meskerem Shibiru Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Banchalem Getnet Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Tibebe Birhanu Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Seid Abdu Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Solomon Tekew Department of Ophthalmology, University of Gondar, Gondar, Ethiopia
Thomas M Lietman Francis I Proctor Foundation, University of California San Francisco, San Francisco, CA
Jeremy D Keenan Francis I Proctor Foundation, University of California San Francisco, San Francisco, CA
Travis K Redd Casey Eye Institute, Oregon Health and Science University, Portland, OR Francis I Proctor Foundation, University of California San Francisco, San Francisco, CA

Collapse

El Hmimdi AE, Palpanas T, Kapoula Z. Efficient diagnostic classification of diverse pathologies through contextual eye movement data analysis with a novel hybrid architecture. Sci Rep 2024;14:21461. [PMID: 39271749 PMCID: PMC11399410 DOI: 10.1038/s41598-024-68056-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Accepted: 07/19/2024] [Indexed: 09/15/2024] Open

Bankin M, Tyrykin Y, Duk M, Samsonova M, Kozlov K. Modeling Chickpea Productivity with Artificial Image Objects and Convolutional Neural Network. PLANTS (BASEL, SWITZERLAND) 2024;13:2444. [PMID: 39273927 PMCID: PMC11397516 DOI: 10.3390/plants13172444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 08/23/2024] [Accepted: 08/28/2024] [Indexed: 09/15/2024]

Zhao C, Hsiao JH, Chan AB. Gradient-Based Instance-Specific Visual Explanations for Object Specification and Object Discrimination. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024;46:5967-5985. [PMID: 38517727 DOI: 10.1109/tpami.2024.3380604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/24/2024]

Abstract

We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visual explanation technique for interpreting the predictions of object detectors. Utilizing the gradients of detector targets flowing into the intermediate feature maps, ODAM produces heat maps that show the influence of regions on the detector's decision for each predicted attribute. Compared to previous works on classification activation maps (CAM), ODAM generates instance-specific explanations rather than class-specific ones. We show that ODAM is applicable to one-stage, two-stage, and transformer-based detectors with different types of detector backbones and heads, and produces higher-quality visual explanations than the state-of-the-art in terms of both effectiveness and efficiency. We discuss two explanation tasks for object detection: 1) object specification: what is the important region for the prediction? 2) object discrimination: which object is detected? Aiming at these two aspects, we present a detailed analysis of the visual explanations of detectors and carry out extensive experiments to validate the effectiveness of the proposed ODAM. Furthermore, we investigate user trust on the explanation maps, how well the visual explanations of object detectors agrees with human explanations, as measured through human eye gaze, and whether this agreement is related with user trust. Finally, we also propose two applications, ODAM-KD and ODAM-NMS, based on these two abilities of ODAM. ODAM-KD utilizes the object specification of ODAM to generate top-down attention for key predictions and instruct the knowledge distillation of object detection. ODAM-NMS considers the location of the model's explanation for each prediction to distinguish the duplicate detected objects. A training scheme, ODAM-Train, is proposed to improve the quality on object discrimination, and help with ODAM-NMS.

Collapse

Won H, Lee HS, Youn D, Park D, Eo T, Kim W, Hwang D. Deep Learning-Based Joint Effusion Classification in Adult Knee Radiographs: A Multi-Center Prospective Study. Diagnostics (Basel) 2024;14:1900. [PMID: 39272685 PMCID: PMC11394442 DOI: 10.3390/diagnostics14171900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 08/09/2024] [Accepted: 08/23/2024] [Indexed: 09/15/2024] Open

Guo W, Jin S, Li Y, Jiang Y. The dynamic-static dual-branch deep neural network for urban speeding hotspot identification using street view image data. ACCIDENT; ANALYSIS AND PREVENTION 2024;203:107636. [PMID: 38776837 DOI: 10.1016/j.aap.2024.107636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 04/24/2024] [Accepted: 05/10/2024] [Indexed: 05/25/2024]

Abstract

The visual information regarding the road environment can influence drivers' perception and judgment, often resulting in frequent speeding incidents. Identifying speeding hotspots in cities can prevent potential speeding incidents, thereby improving traffic safety levels. We propose the Dual-Branch Contextual Dynamic-Static Feature Fusion Network based on static panoramic images and dynamically changing sequence data, aiming to capture global features in the macro scene of the area and dynamically changing information in the micro view for a more accurate urban speeding hotspot area identification. For the static branch, we propose the Multi-scale Contextual Feature Aggregation Network for learning global spatial contextual association information. In the dynamic branch, we construct the Multi-view Dynamic Feature Fusion Network to capture the dynamically changing features of a scene from a continuous sequence of street view images. Additionally, we designed the Dynamic-Static Feature Correlation Fusion Structure to correlate and fuse dynamic and static features. The experimental results show that the model has good performance, and the overall recognition accuracy reaches 99.4%. The ablation experiments show that the recognition effect after the fusion of dynamic and static features is better than that of static and dynamic branches. The proposed model also shows better performance than other deep learning models. In addition, we combine image processing methods and different Class Activation Mapping (CAM) methods to extract speeding frequency visual features from the model perception results. The results show that more accurate speeding frequency features can be obtained by using LayerCAM and GradCAM-Plus for static global scenes and dynamic local sequences, respectively. In the static global scene, the speeding frequency features are mainly concentrated on the buildings and green layout on both sides of the road, while in the dynamic scene, the speeding frequency features shift with the scene changes and are mainly concentrated on the dynamically changing transition areas of greenery, roads, and surrounding buildings. The code and model used for identifying hotspots of urban traffic accidents in this study are available for access: https://github.com/gwt-ZJU/DCDSFF-Net.

Collapse

Dai W, Wu T, Liu R, Wang M, Yin J, Liu J. Any region can be perceived equally and effectively on rotation pretext task using full rotation and weighted-region mixture. Neural Netw 2024;176:106350. [PMID: 38723309 DOI: 10.1016/j.neunet.2024.106350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 01/15/2024] [Accepted: 04/28/2024] [Indexed: 06/17/2024]

Yuan H, Hong C, Jiang PT, Zhao G, Tran NTA, Xu X, Yan YY, Liu N. Clinical domain knowledge-derived template improves post hoc AI explanations in pneumothorax classification. J Biomed Inform 2024;156:104673. [PMID: 38862083 DOI: 10.1016/j.jbi.2024.104673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 06/01/2024] [Accepted: 06/07/2024] [Indexed: 06/13/2024]

Abstract

OBJECTIVE

Pneumothorax is an acute thoracic disease caused by abnormal air collection between the lungs and chest wall. Recently, artificial intelligence (AI), especially deep learning (DL), has been increasingly employed for automating the diagnostic process of pneumothorax. To address the opaqueness often associated with DL models, explainable artificial intelligence (XAI) methods have been introduced to outline regions related to pneumothorax. However, these explanations sometimes diverge from actual lesion areas, highlighting the need for further improvement.

METHOD

We propose a template-guided approach to incorporate the clinical knowledge of pneumothorax into model explanations generated by XAI methods, thereby enhancing the quality of the explanations. Utilizing one lesion delineation created by radiologists, our approach first generates a template that represents potential areas of pneumothorax occurrence. This template is then superimposed on model explanations to filter out extraneous explanations that fall outside the template's boundaries. To validate its efficacy, we carried out a comparative analysis of three XAI methods (Saliency Map, Grad-CAM, and Integrated Gradients) with and without our template guidance when explaining two DL models (VGG-19 and ResNet-50) in two real-world datasets (SIIM-ACR and ChestX-Det).

RESULTS

The proposed approach consistently improved baseline XAI methods across twelve benchmark scenarios built on three XAI methods, two DL models, and two datasets. The average incremental percentages, calculated by the performance improvements over the baseline performance, were 97.8% in Intersection over Union (IoU) and 94.1% in Dice Similarity Coefficient (DSC) when comparing model explanations and ground-truth lesion areas. We further visualized baseline and template-guided model explanations on radiographs to showcase the performance of our approach.

CONCLUSIONS

In the context of pneumothorax diagnoses, we proposed a template-guided approach for improving model explanations. Our approach not only aligns model explanations more closely with clinical insights but also exhibits extensibility to other thoracic diseases. We anticipate that our template guidance will forge a novel approach to elucidating AI models by integrating clinical domain expertise.

Collapse

Li C, Narayanan A, Ghobakhlou A. Overlapping Shoeprint Detection by Edge Detection and Deep Learning. J Imaging 2024;10:186. [PMID: 39194975 DOI: 10.3390/jimaging10080186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/04/2024] [Accepted: 07/30/2024] [Indexed: 08/29/2024] Open

Bhave S, Rodriguez V, Poterucha T, Mutasa S, Aberle D, Capaccione KM, Chen Y, Dsouza B, Dumeer S, Goldstein J, Hodes A, Leb J, Lungren M, Miller M, Monoky D, Navot B, Wattamwar K, Wattamwar A, Clerkin K, Ouyang D, Ashley E, Topkara VK, Maurer M, Einstein AJ, Uriel N, Homma S, Schwartz A, Jaramillo D, Perotte AJ, Elias P. Deep learning to detect left ventricular structural abnormalities in chest X-rays. Eur Heart J 2024;45:2002-2012. [PMID: 38503537 PMCID: PMC11156488 DOI: 10.1093/eurheartj/ehad782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 09/24/2023] [Accepted: 11/14/2023] [Indexed: 03/21/2024] Open

Affiliation(s)

Shreyas Bhave Division of Cardiology and Department of Biomedical Informatics, Columbia University Irving Medical Center, 622 West 168th Street, PH20, NewYork, NY 10032, USA
Victor Rodriguez Division of Cardiology and Department of Biomedical Informatics, Columbia University Irving Medical Center, 622 West 168th Street, PH20, NewYork, NY 10032, USA
Timothy Poterucha Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA
Simukayi Mutasa Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Dwight Aberle Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Kathleen M Capaccione Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Yibo Chen Inova Fairfax Hospital Imaging Center, Inova Fairfax Medical Campus, Falls Church, VA, USA
Belinda Dsouza Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Shifali Dumeer Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Jonathan Goldstein Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Aaron Hodes Hackensack Radiology Group, Hackensack Meridian School of Medicine, Nutley, NJ, USA
Jay Leb Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Matthew Lungren Department of Radiology, University of California, SanFrancisco, CA, USA
Mitchell Miller Hackensack Radiology Group, Hackensack Meridian School of Medicine, Nutley, NJ, USA
David Monoky Hackensack Radiology Group, Hackensack Meridian School of Medicine, Nutley, NJ, USA
Benjamin Navot Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Kapil Wattamwar Division of Vascular and Interventional Radiology, Department of Radiology, Montefiore Medical Center, Bronx, NY, USA
Anoop Wattamwar Hackensack Radiology Group, Hackensack Meridian School of Medicine, Nutley, NJ, USA
Kevin Clerkin Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA
David Ouyang Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Euan Ashley Stanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Palo Alto, CA, USA
Veli K Topkara Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA
Mathew Maurer Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA
Andrew J Einstein Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Nir Uriel Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA
Shunichi Homma Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA
Allan Schwartz Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA
Diego Jaramillo Department of Radiology, Columbia University Irving Medical Center, NewYork, NY, USA
Adler J Perotte Division of Cardiology and Department of Biomedical Informatics, Columbia University Irving Medical Center, 622 West 168th Street, PH20, NewYork, NY 10032, USA
Pierre Elias Division of Cardiology and Department of Biomedical Informatics, Columbia University Irving Medical Center, 622 West 168th Street, PH20, NewYork, NY 10032, USA Seymour, Paul, and Gloria Milstein Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center, NewYork-Presbyterian Hospital, 630 West 168th Street, NewYork, NY 10032, USA

Collapse

You J, Ajlouni S, Kakaletri I, Charalampaki P, Giannarou S. XRelevanceCAM: towards explainable tissue characterization with improved localisation of pathological structures in probe-based confocal laser endomicroscopy. Int J Comput Assist Radiol Surg 2024;19:1061-1073. [PMID: 38538880 PMCID: PMC11178611 DOI: 10.1007/s11548-024-03096-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 02/29/2024] [Indexed: 06/15/2024]

Wang S, Sun M, Sun J, Wang Q, Wang G, Wang X, Meng X, Wang Z, Yu H. Advancing musculoskeletal tumor diagnosis: Automated segmentation and predictive classification using deep learning and radiomics. Comput Biol Med 2024;175:108502. [PMID: 38678943 DOI: 10.1016/j.compbiomed.2024.108502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 03/18/2024] [Accepted: 04/21/2024] [Indexed: 05/01/2024]

Rao S, Bohle M, Schiele B. Better Understanding Differences in Attribution Methods via Systematic Evaluations. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024;46:4090-4101. [PMID: 38215324 DOI: 10.1109/tpami.2024.3353528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/14/2024]

Odusami M, Maskeliūnas R, Damaševičius R, Misra S. Machine learning with multimodal neuroimaging data to classify stages of Alzheimer's disease: a systematic review and meta-analysis. Cogn Neurodyn 2024;18:775-794. [PMID: 38826669 PMCID: PMC11143094 DOI: 10.1007/s11571-023-09993-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 06/23/2023] [Accepted: 07/17/2023] [Indexed: 06/04/2024] Open

Abstract

In recent years, Alzheimer's disease (AD) has been a serious threat to human health. Researchers and clinicians alike encounter a significant obstacle when trying to accurately identify and classify AD stages. Several studies have shown that multimodal neuroimaging input can assist in providing valuable insights into the structural and functional changes in the brain related to AD. Machine learning (ML) algorithms can accurately categorize AD phases by identifying patterns and linkages in multimodal neuroimaging data using powerful computational methods. This study aims to assess the contribution of ML methods to the accurate classification of the stages of AD using multimodal neuroimaging data. A systematic search is carried out in IEEE Xplore, Science Direct/Elsevier, ACM DigitalLibrary, and PubMed databases with forward snowballing performed on Google Scholar. The quantitative analysis used 47 studies. The explainable analysis was performed on the classification algorithm and fusion methods used in the selected studies. The pooled sensitivity and specificity, including diagnostic efficiency, were evaluated by conducting a meta-analysis based on a bivariate model with the hierarchical summary receiver operating characteristics (ROC) curve of multimodal neuroimaging data and ML methods in the classification of AD stages. Wilcoxon signed-rank test is further used to statistically compare the accuracy scores of the existing models. With a 95% confidence interval of 78.87-87.71%, the combined sensitivity for separating participants with mild cognitive impairment (MCI) from healthy control (NC) participants was 83.77%; for separating participants with AD from NC, it was 94.60% (90.76%, 96.89%); for separating participants with progressive MCI (pMCI) from stable MCI (sMCI), it was 80.41% (74.73%, 85.06%). With a 95% confidence interval (78.87%, 87.71%), the Pooled sensitivity for distinguishing mild cognitive impairment (MCI) from healthy control (NC) participants was 83.77%, with a 95% confidence interval (90.76%, 96.89%), the Pooled sensitivity for distinguishing AD from NC was 94.60%, likewise (MCI) from healthy control (NC) participants was 83.77% progressive MCI (pMCI) from stable MCI (sMCI) was 80.41% (74.73%, 85.06%), and early MCI (EMCI) from NC was 86.63% (82.43%, 89.95%). Pooled specificity for differentiating MCI from NC was 79.16% (70.97%, 87.71%), AD from NC was 93.49% (91.60%, 94.90%), pMCI from sMCI was 81.44% (76.32%, 85.66%), and EMCI from NC was 85.68% (81.62%, 88.96%). The Wilcoxon signed rank test showed a low P-value across all the classification tasks. Multimodal neuroimaging data with ML is a promising future in classifying the stages of AD but more research is required to increase the validity of its application in clinical practice.

Collapse

Song B, Yoshida S. Explainability of three-dimensional convolutional neural networks for functional magnetic resonance imaging of Alzheimer's disease classification based on gradient-weighted class activation mapping. PLoS One 2024;19:e0303278. [PMID: 38771733 PMCID: PMC11108152 DOI: 10.1371/journal.pone.0303278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 04/22/2024] [Indexed: 05/23/2024] Open

Abstract

Currently, numerous studies focus on employing fMRI-based deep neural networks to diagnose neurological disorders such as Alzheimer's Disease (AD), yet only a handful have provided results regarding explainability. We address this gap by applying several prevalent explainability methods such as gradient-weighted class activation mapping (Grad-CAM) to an fMRI-based 3D-VGG16 network for AD diagnosis to improve the model's explainability. The aim is to explore the specific Region of Interest (ROI) of brain the model primarily focuses on when making predictions, as well as whether there are differences in these ROIs between AD and normal controls (NCs). First, we utilized multiple resting-state functional activity maps including ALFF, fALFF, ReHo, and VMHC to reduce the complexity of fMRI data, which differed from many studies that utilized raw fMRI data. Compared to methods utilizing raw fMRI data, this manual feature extraction approach may potentially alleviate the model's burden. Subsequently, 3D-VGG16 were employed for AD classification, where the final fully connected layers were replaced with a Global Average Pooling (GAP) layer, aimed at mitigating overfitting while preserving spatial information within the feature maps. The model achieved a maximum of 96.4% accuracy on the test set. Finally, several 3D CAM methods were employed to interpret the models. In the explainability results of the models with relatively high accuracy, the highlighted ROIs were primarily located in the precuneus and the hippocampus for AD subjects, while the models focused on the entire brain for NC. This supports current research on ROIs involved in AD. We believe that explaining deep learning models would not only provide support for existing research on brain disorders, but also offer important referential recommendations for the study of currently unknown etiologies.

Collapse

Fan Y, Li Q, Mao H, Jiang F. Magnetoencephalography Decoding Transfer Approach: From Deep Learning Models to Intrinsically Interpretable Models. IEEE J Biomed Health Inform 2024;28:2818-2829. [PMID: 38349827 DOI: 10.1109/jbhi.2024.3365051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2024]

Niu Y, Ding M, Ge M, Karlsson R, Zhang Y, Carballo A, Takeda K. R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut. SENSORS (BASEL, SWITZERLAND) 2024;24:2695. [PMID: 38732800 PMCID: PMC11085337 DOI: 10.3390/s24092695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 04/20/2024] [Accepted: 04/21/2024] [Indexed: 05/13/2024]

Huang C, Jiang Y, Yang X, Wei C, Chen H, Xiong W, Lin H, Wang X, Tian T, Tan H. Enhancing Retinal Fundus Image Quality Assessment With Swin-Transformer-Based Learning Across Multiple Color-Spaces. Transl Vis Sci Technol 2024;13:8. [PMID: 38568606 PMCID: PMC10996994 DOI: 10.1167/tvst.13.4.8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 02/18/2024] [Indexed: 04/05/2024] Open

Abstract

Purpose

The assessment of retinal image (RI) quality holds significant importance in both clinical trials and large datasets, because suboptimal images can potentially conceal early signs of diseases, thereby resulting in inaccurate medical diagnoses. This study aims to develop an automatic method for Retinal Image Quality Assessment (RIQA) that incorporates visual explanations, aiming to comprehensively evaluate the quality of retinal fundus images (RIs).

Methods

We developed an automatic RIQA system, named Swin-MCSFNet, utilizing 28,792 RIs from the EyeQ dataset, as well as 2000 images from the EyePACS dataset and an additional 1,000 images from the OIA-ODIR dataset. After preprocessing, including cropping black regions, data augmentation, and normalization, a Swin-MCSFNet classifier based on the Swin-Transformer for multiple color-space fusion was proposed to grade the quality of RIs. The generalizability of Swin-MCSFNet was validated across multiple data centers. Additionally, for enhanced interpretability, a Score-CAM-generated heatmap was applied to provide visual explanations.

Results

Experimental results reveal that the proposed Swin-MCSFNet achieves promising performance, yielding a micro-receiver operating characteristic (ROC) of 0.93 and ROC scores of 0.96, 0.81, and 0.96 for the "Good," "Usable," and "Reject" categories, respectively. These scores underscore the accuracy of RIQA based on Swin-MCSF in distinguishing among the three categories. Furthermore, heatmaps generated across different RIQA classification scores and various color spaces suggest that regions in the retinal images from multiple color spaces contribute significantly to the decision-making process of the Swin-MCSFNet classifier.

Conclusions

Our study demonstrates that the proposed Swin-MCSFNet outperforms other methods in experiments conducted on multiple datasets, as evidenced by the superior performance metrics and insightful Score-CAM heatmaps.

Translational Relevance

This study constructs a new retinal image quality evaluation system, which will contribute to the subsequent research of retinal images.

Collapse

Deng J, Heybati K, Shammas-Toma M. When vision meets reality: Exploring the clinical applicability of GPT-4 with vision. Clin Imaging 2024;108:110101. [PMID: 38341880 DOI: 10.1016/j.clinimag.2024.110101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 01/29/2024] [Accepted: 02/01/2024] [Indexed: 02/13/2024]

Hong SJ, Hou JU, Chung MJ, Kang SH, Shim BS, Lee SL, Park DH, Choi A, Oh JY, Lee KJ, Shin E, Cho E, Park SW. Convolutional neural network model for automatic recognition and classification of pancreatic cancer cell based on analysis of lipid droplet on unlabeled sample by 3D optical diffraction tomography. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024;246:108041. [PMID: 38325025 DOI: 10.1016/j.cmpb.2024.108041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 01/05/2024] [Accepted: 01/19/2024] [Indexed: 02/09/2024]

Abstract

INTRODUCTION

Pancreatic cancer cells generally accumulate large numbers of lipid droplets (LDs), which regulate lipid storage. To promote rapid diagnosis, an automatic pancreatic cancer cell recognition system based on a deep convolutional neural network was proposed in this study using quantitative images of LDs from stain-free cytologic samples by optical diffraction tomography.

METHODS

We retrieved 3D refractive index tomograms and reconstructed 37 optical images of one cell. From the four cell lines, the obtained fields were separated into training and test datasets with 10,397 and 3,478 images, respectively. Furthermore, we adopted several machine learning techniques based on a single image-based prediction model to improve the performance of the computer-aided diagnostic system.

RESULTS

Pancreatic cancer cells had a significantly lower total cell volume and dry mass than did normal pancreatic cells and were accompanied by greater numbers of lipid droplets (LDs). When evaluating multitask learning techniques utilizing the EfficientNet-b3 model through confusion matrices, the overall 2-category accuracy for cancer classification reached 96.7 %. Simultaneously, the overall 4-category accuracy for individual cell line classification achieved a high accuracy of 96.2 %. Furthermore, when we added the core techniques one by one, the overall performance of the proposed technique significantly improved, reaching an area under the curve (AUC) of 0.997 and an accuracy of 97.06 %. Finally, the AUC reached 0.998 through the ablation study with the score fusion technique.

DISCUSSION

Our novel training strategy has significant potential for automating and promoting rapid recognition of pancreatic cancer cells. In the near future, deep learning-embedded medical devices will substitute laborious manual cytopathologic examinations for sustainable economic potential.

Collapse

Affiliation(s)

Seok Jin Hong Department of Otolaryngology-Head and Neck Surgery, Kangbuk Samsung Hospital Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
Jong-Uk Hou School of Software, Hallym University, Chuncheon, Republic of Korea
Moon Jae Chung Division of Gastroenterology, Department of Internal Medicine, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea
Sung Hun Kang Department of Otolaryngology-Head and Neck Surgery, Kangbuk Samsung Hospital Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
Bo-Seok Shim School of Software, Hallym University, Chuncheon, Republic of Korea
Seung-Lee Lee School of Software, Hallym University, Chuncheon, Republic of Korea
Da Hae Park Division of Gastroenterology, Department of Internal Medicine, Hallym University Dongtan Sacred Heart Hospital, Hallym University College of Medicine, 7, Keunjaebong-gil, Hwaseong-si, Gyeonggi-do 18450, Republic of Korea
Anna Choi Division of Gastroenterology, Department of Internal Medicine, Hallym University Dongtan Sacred Heart Hospital, Hallym University College of Medicine, 7, Keunjaebong-gil, Hwaseong-si, Gyeonggi-do 18450, Republic of Korea
Jae Yeon Oh Hallym University College of Medicine, Chuncheon, Republic of Korea
Kyong Joo Lee Division of Gastroenterology, Department of Internal Medicine, Hallym University Dongtan Sacred Heart Hospital, Hallym University College of Medicine, 7, Keunjaebong-gil, Hwaseong-si, Gyeonggi-do 18450, Republic of Korea
Eun Shin Department of Pathology, Hallym University Dongtan Sacred Heart Hospital, Hallym University College of Medicine, Hwaseong, Republic of Korea
Eunae Cho Division of Gastroenterology, Department of Internal Medicine, Chonnam National University Hospital, Gwangju, Republic of Korea
Se Woo Park Division of Gastroenterology, Department of Internal Medicine, Hallym University Dongtan Sacred Heart Hospital, Hallym University College of Medicine, 7, Keunjaebong-gil, Hwaseong-si, Gyeonggi-do 18450, Republic of Korea.

Collapse

Famiglini L, Campagner A, Barandas M, La Maida GA, Gallazzi E, Cabitza F. Evidence-based XAI: An empirical approach to design more effective and explainable decision support systems. Comput Biol Med 2024;170:108042. [PMID: 38308866 DOI: 10.1016/j.compbiomed.2024.108042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 12/19/2023] [Accepted: 01/26/2024] [Indexed: 02/05/2024]

Fuentes AM, Milligan K, Wiebe M, Narayan A, Lum JJ, Brolo AG, Andrews JL, Jirasek A. Stratification of tumour cell radiation response and metabolic signatures visualization with Raman spectroscopy and explainable convolutional neural network. Analyst 2024;149:1645-1657. [PMID: 38312026 DOI: 10.1039/d3an01797d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]

Abstract

Reprogramming of cellular metabolism is a driving factor of tumour progression and radiation therapy resistance. Identifying biochemical signatures associated with tumour radioresistance may assist with the development of targeted treatment strategies to improve clinical outcomes. Raman spectroscopy (RS) can monitor post-irradiation biomolecular changes and signatures of radiation response in tumour cells in a label-free manner. Convolutional Neural Networks (CNN) perform feature extraction directly from data in an end-to-end learning manner, with high classification performance. Furthermore, recently developed CNN explainability techniques help visualize the critical discriminative features captured by the model. In this work, a CNN is developed to characterize tumour response to radiotherapy based on its degree of radioresistance. The model was trained to classify Raman spectra of three human tumour cell lines as radiosensitive (LNCaP) or radioresistant (MCF7, H460) over a range of treatment doses and data collection time points. Additionally, a method based on Gradient-Weighted Class Activation Mapping (Grad-CAM) was used to determine response-specific salient Raman peaks influencing the CNN predictions. The CNN effectively classified the cell spectra, with accuracy, sensitivity, specificity, and F1 score exceeding 99.8%. Grad-CAM heatmaps of H460 and MCF7 cell spectra (radioresistant) exhibited high contributions from Raman bands tentatively assigned to glycogen, amino acids, and nucleic acids. Conversely, heatmaps of LNCaP cells (radiosensitive) revealed activations at lipid and phospholipid bands. Finally, Grad-CAM variable importance scores were derived for glycogen, asparagine, and phosphatidylcholine, and we show that their trends over cell line, dose, and acquisition time agreed with previously established models. Thus, the CNN can accurately detect biomolecular differences in the Raman spectra of tumour cells of varying radiosensitivity without requiring manual feature extraction. Finally, Grad-CAM may help identify metabolic signatures associated with the observed categories, offering the potential for automated clinical tumour radiation response characterization.

Collapse

Yao Y, Yang J, Sun H, Kong H, Wang S, Xu K, Dai W, Jiang S, Bai Q, Xing S, Yuan J, Liu X, Lu F, Chen Z, Qu J, Su J. DeepGraFT: A novel semantic segmentation auxiliary ROI-based deep learning framework for effective fundus tessellation classification. Comput Biol Med 2024;169:107881. [PMID: 38159401 DOI: 10.1016/j.compbiomed.2023.107881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 12/04/2023] [Accepted: 12/18/2023] [Indexed: 01/03/2024]

Abstract

Fundus tessellation (FT) is a prevalent clinical feature associated with myopia and has implications in the development of myopic maculopathy, which causes irreversible visual impairment. Accurate classification of FT in color fundus photo can help predict the disease progression and prognosis. However, the lack of precise detection and classification tools has created an unmet medical need, underscoring the importance of exploring the clinical utility of FT. Thus, to address this gap, we introduce an automatic FT grading system (called DeepGraFT) using classification-and-segmentation co-decision models by deep learning. ConvNeXt, utilizing transfer learning from pretrained ImageNet weights, was employed for the classification algorithm, aligning with a region of interest based on the ETDRS grading system to boost performance. A segmentation model was developed to detect FT exits, complementing the classification for improved grading accuracy. The training set of DeepGraFT was from our in-house cohort (MAGIC), and the validation sets consisted of the rest part of in-house cohort and an independent public cohort (UK Biobank). DeepGraFT demonstrated a high performance in the training stage and achieved an impressive accuracy in validation phase (in-house cohort: 86.85 %; public cohort: 81.50 %). Furthermore, our findings demonstrated that DeepGraFT surpasses machine learning-based classification models in FT classification, achieving a 5.57 % increase in accuracy. Ablation analysis revealed that the introduced modules significantly enhanced classification effectiveness and elevated accuracy from 79.85 % to 86.85 %. Further analysis using the results provided by DeepGraFT unveiled a significant negative association between FT and spherical equivalent (SE) in the UK Biobank cohort. In conclusion, DeepGraFT accentuates potential benefits of the deep learning model in automating the grading of FT and allows for potential utility as a clinical-decision support tool for predicting progression of pathological myopia.

Collapse

Affiliation(s)

Yinghao Yao Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Jiaying Yang Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Haojun Sun Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Hengte Kong Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Sheng Wang Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Ke Xu National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Wei Dai National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Siyi Jiang Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
QingShi Bai Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Shilai Xing Institute of PSI Genomics, Wenzhou Global Eye & Vision Innovation Center, Wenzhou, 325024, China
Jian Yuan National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China
Xinting Liu National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China; National Clinical Research Center for Ocular Diseases, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, China
Fan Lu Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China; National Clinical Research Center for Ocular Diseases, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, China
Zhenhui Chen National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China; National Clinical Research Center for Ocular Diseases, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, China.
Jia Qu Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China; National Clinical Research Center for Ocular Diseases, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, China.
Jianzhong Su Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Eye Hospital, Wenzhou Medical University, Wenzhou, 325011, Zhejiang, China; National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, Zhejiang, China; National Clinical Research Center for Ocular Diseases, Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, China.

Collapse

Zhang J, Jia X, Zhou J, Zhang J, Hu J. Weakly Supervised Solar Panel Mapping via Uncertainty Adjusted Label Transition in Aerial Images. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2024;33:881-896. [PMID: 38064328 DOI: 10.1109/tip.2023.3336170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2024]

Wang C, He N, Zhang Y, Li Y, Huang P, Liu Y, Jin Z, Cheng Z, Liu Y, Wang Y, Zhang C, Haacke EM, Chen S, Yan F, Yang G. Enhancing Nigrosome-1 Sign Identification via Interpretable AI using True Susceptibility Weighted Imaging. J Magn Reson Imaging 2024. [PMID: 38236577 DOI: 10.1002/jmri.29245] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 01/05/2024] [Accepted: 01/08/2024] [Indexed: 01/19/2024] Open

Abstract

BACKGROUND

Nigrosome 1 (N1), the largest nigrosome region in the ventrolateral area of the substantia nigra pars compacta, is identifiable by the "N1 sign" in long echo time gradient echo MRI. The N1 sign's absence is a vital Parkinson's disease (PD) diagnostic marker. However, it is challenging to visualize and assess the N1 sign in clinical practice.

PURPOSE

To automatically detect the presence or absence of the N1 sign from true susceptibility weighted imaging by using deep-learning method.

STUDY TYPE

Prospective.

POPULATION/SUBJECTS

453 subjects, including 225 PD patients, 120 healthy controls (HCs), and 108 patients with other movement disorders, were prospectively recruited including 227 males and 226 females. They were divided into training, validation, and test cohorts of 289, 73, and 91 cases, respectively.

FIELD STRENGTH/SEQUENCE

3D gradient echo SWI sequence at 3T; 3D multiecho strategically acquired gradient echo imaging at 3T; NM-sensitive 3D gradient echo sequence with MTC pulse at 3T.

ASSESSMENT

A neuroradiologist with 5 years of experience manually delineated substantia nigra regions. Two raters with 2 and 36 years of experience assessed the N1 sign on true susceptibility weighted imaging (tSWI), QSM with high-pass filter, and magnitude data combined with MTC data. We proposed NINet, a neural model, for automatic N1 sign identification in tSWI images.

STATISTICAL TESTS

We compared the performance of NINet to the subjective reference standard using Receiver Operating Characteristic analyses, and a decision curve analysis assessed identification accuracy.

RESULTS

NINet achieved an area under the curve (AUC) of 0.87 (CI: 0.76-0.89) in N1 sign identification, surpassing other models and neuroradiologists. NINet localized the putative N1 sign within tSWI images with 67.3% accuracy.

DATA CONCLUSION

Our proposed NINet model's capability to determine the presence or absence of the N1 sign, along with its localization, holds promise for enhancing diagnostic accuracy when evaluating PD using MR images.

LEVEL OF EVIDENCE

2 TECHNICAL EFFICACY: Stage 1.

Collapse

Herr J, Stoyanova R, Mellon EA. Convolutional Neural Networks for Glioma Segmentation and Prognosis: A Systematic Review. Crit Rev Oncog 2024;29:33-65. [PMID: 38683153 DOI: 10.1615/critrevoncog.2023050852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2024]

Zhang C, Dong K, Aihara K, Chen L, Zhang S. STAMarker: determining spatial domain-specific variable genes with saliency maps in deep learning. Nucleic Acids Res 2023;51:e103. [PMID: 37811885 PMCID: PMC10639070 DOI: 10.1093/nar/gkad801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 08/26/2023] [Accepted: 09/19/2023] [Indexed: 10/10/2023] Open

Sujatha Ravindran A, Contreras-Vidal J. An empirical comparison of deep learning explainability approaches for EEG using simulated ground truth. Sci Rep 2023;13:17709. [PMID: 37853010 PMCID: PMC10584975 DOI: 10.1038/s41598-023-43871-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 09/29/2023] [Indexed: 10/20/2023] Open

Szczepankiewicz K, Popowicz A, Charkiewicz K, Nałęcz-Charkiewicz K, Szczepankiewicz M, Lasota S, Zawistowski P, Radlak K. Ground truth based comparison of saliency maps algorithms. Sci Rep 2023;13:16887. [PMID: 37803108 PMCID: PMC10558518 DOI: 10.1038/s41598-023-42946-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 09/16/2023] [Indexed: 10/08/2023] Open

Zheng Y, Huang D, Hao X, Wei J, Lu H, Liu Y. UniVisNet: A Unified Visualization and Classification Network for accurate grading of gliomas from MRI. Comput Biol Med 2023;165:107332. [PMID: 37598632 DOI: 10.1016/j.compbiomed.2023.107332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 07/30/2023] [Accepted: 08/07/2023] [Indexed: 08/22/2023]

Abstract

Accurate grading of brain tumors plays a crucial role in the diagnosis and treatment of glioma. While convolutional neural networks (CNNs) have shown promising performance in this task, their clinical applicability is still constrained by the interpretability and robustness of the models. In the conventional framework, the classification model is trained first, and then visual explanations are generated. However, this approach often leads to models that prioritize classification performance or complexity, making it difficult to achieve a precise visual explanation. Motivated by these challenges, we propose the Unified Visualization and Classification Network (UniVisNet), a novel framework that aims to improve both the classification performance and the generation of high-resolution visual explanations. UniVisNet addresses attention misalignment by introducing a subregion-based attention mechanism, which replaces traditional down-sampling operations. Additionally, multiscale feature maps are fused to achieve higher resolution, enabling the generation of detailed visual explanations. To streamline the process, we introduce the Unified Visualization and Classification head (UniVisHead), which directly generates visual explanations without the need for additional separation steps. Through extensive experiments, our proposed UniVisNet consistently outperforms strong baseline classification models and prevalent visualization methods. Notably, UniVisNet achieves remarkable results on the glioma grading task, including an AUC of 94.7%, an accuracy of 89.3%, a sensitivity of 90.4%, and a specificity of 85.3%. Moreover, UniVisNet provides visually interpretable explanations that surpass existing approaches. In conclusion, UniVisNet innovatively generates visual explanations in brain tumor grading by simultaneously improving the classification performance and generating high-resolution visual explanations. This work contributes to the clinical application of deep learning, empowering clinicians with comprehensive insights into the spatial heterogeneity of glioma.

Collapse

Baraheem SS, Nguyen TV. AI vs. AI: Can AI Detect AI-Generated Images? J Imaging 2023;9:199. [PMID: 37888306 PMCID: PMC10607823 DOI: 10.3390/jimaging9100199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 09/13/2023] [Accepted: 09/27/2023] [Indexed: 10/28/2023] Open

Lei Y, Wang T, Roper J, Tian S, Patel P, Bradley JD, Jani AB, Liu T, Yang X. Automatic segmentation of neurovascular bundle on mri using deep learning based topological modulated network. Med Phys 2023;50:5479-5488. [PMID: 36939189 PMCID: PMC10509305 DOI: 10.1002/mp.16378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 01/20/2023] [Accepted: 03/09/2023] [Indexed: 03/21/2023] Open

Abstract

PURPOSE

Radiation damage on neurovascular bundles (NVBs) may be the cause of sexual dysfunction after radiotherapy for prostate cancer. However, it is challenging to delineate NVBs as organ-at-risks from planning CTs during radiotherapy. Recently, the integration of MR into radiotherapy made NVBs contour delineating possible. In this study, we aim to develop an MRI-based deep learning method for automatic NVB segmentation.

METHODS

The proposed method, named topological modulated network, consists of three subnetworks, that is, a focal modulation, a hierarchical block and a topological fully convolutional network (FCN). The focal modulation is used to derive the location and bounds of left and right NVBs', namely the candidate volume-of-interests (VOIs). The hierarchical block aims to highlight the NVB boundaries information on derived feature map. The topological FCN then segments the NVBs inside the VOIs by considering the topological consistency nature of the vascular delineating. Based on the location information of candidate VOIs, the segmentations of NVBs can then be brought back to the input MRI's coordinate system.

RESULTS

A five-fold cross-validation study was performed on 60 patient cases to evaluate the performance of the proposed method. The segmented results were compared with manual contours. The Dice similarity coefficient (DSC) and 95th percentile Hausdorff distance (HD95 ) are (left NVB) 0.81 ± 0.10, 1.49 ± 0.88 mm, and (right NVB) 0.80 ± 0.15, 1.54 ± 1.22 mm, respectively.

CONCLUSION

We proposed a novel deep learning-based segmentation method for NVBs on pelvic MR images. The good segmentation agreement of our method with the manually drawn ground truth contours supports the feasibility of the proposed method, which can be potentially used to spare NVBs during proton and photon radiotherapy and thereby improve the quality of life for prostate cancer patients.

Collapse

Sun R, Wei C, Jiang Z, Huang G, Xie Y, Nie S. Weakly Supervised Breast Lesion Detection in Dynamic Contrast-Enhanced MRI. J Digit Imaging 2023;36:1553-1564. [PMID: 37253896 PMCID: PMC10406986 DOI: 10.1007/s10278-023-00846-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 05/05/2023] [Accepted: 05/08/2023] [Indexed: 06/01/2023] Open

Zheng Y, Huang D, Feng Y, Hao X, He Y, Liu Y. CSF-Glioma: A Causal Segmentation Framework for Accurate Grading and Subregion Identification of Gliomas. Bioengineering (Basel) 2023;10:887. [PMID: 37627772 PMCID: PMC10451284 DOI: 10.3390/bioengineering10080887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 07/22/2023] [Accepted: 07/24/2023] [Indexed: 08/27/2023] Open

TCNN: A Transformer Convolutional Neural Network for artifact classification in whole slide images. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2023]

Yuan J, Wu F, Li Y, Li J, Huang G, Huang Q. DPDH-CapNet: A Novel Lightweight Capsule Network with Non-routing for COVID-19 Diagnosis Using X-ray Images. J Digit Imaging 2023;36:988-1000. [PMID: 36813978 PMCID: PMC9946284 DOI: 10.1007/s10278-023-00791-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 01/26/2023] [Accepted: 01/29/2023] [Indexed: 02/24/2023] Open

Zhu X, Sun J, Liu G, Shen C, Dai Z, Zhao L. Hybrid Domain Consistency Constraints-Based Deep Neural Network for Facial Expression Recognition. SENSORS (BASEL, SWITZERLAND) 2023;23:s23115201. [PMID: 37299930 DOI: 10.3390/s23115201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 05/23/2023] [Accepted: 05/28/2023] [Indexed: 06/12/2023]

Lin SY, Chiang PL, Chen MH, Lee MY, Lin WC, Chen YS. DGA3-Net: A parameter-efficient deep learning model for ASPECTS assessment for acute ischemic stroke using non-contrast computed tomography. Neuroimage Clin 2023;38:103441. [PMID: 37224605 PMCID: PMC10225927 DOI: 10.1016/j.nicl.2023.103441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2023] [Revised: 05/15/2023] [Accepted: 05/16/2023] [Indexed: 05/26/2023]

Abstract

Detecting the early signs of stroke using non-contrast computerized tomography (NCCT) is essential for the diagnosis of acute ischemic stroke (AIS). However, the hypoattenuation in NCCT is difficult to precisely identify, and accurate assessments of the Alberta Stroke Program Early CT Score (ASPECTS) are usually time-consuming and require experienced neuroradiologists. To this end, this study proposes DGA3-Net, a convolutional neural network (CNN)-based model for ASPECTS assessment via detecting early ischemic changes in ASPECTS regions. DGA3-Net is based on a novel parameter-efficient dihedral group CNN encoder to exploit the rotation and reflection symmetry of convolution kernels. The bounding volume of each ASPECTS region is extracted from the encoded feature, and an attention-guided slice aggregation module is used to aggregate features from all slices. An asymmetry-aware classifier is then used to predict stroke presence via comparison between ASPECTS regions from the left and right hemispheres. Pre-treatment NCCTs of suspected AIS patients were collected retrospectively, which consists of a primary dataset (n = 170) and an external validation dataset (n = 90), with expert consensus ASPECTS readings as ground truth. DGA3-Net outperformed two expert neuroradiologists in regional stroke identification (F1 = 0.69) and ASPECTS evaluation (Cohen's weighted Kappa = 0.70). Our ablation study also validated the efficacy of the proposed model design. In addition, class-relevant areas highlighted by visualization techniques corresponded highly with various well-established qualitative imaging signs, further validating the learned representation. This study demonstrates the potential of deep learning techniques for timely and accurate AIS diagnosis from NCCT, which could substantially improve the quality of treatment for AIS patients.

Collapse

Watanabe N, Miyoshi K, Jimura K, Shimane D, Keerativittayayut R, Nakahara K, Takeda M. Multimodal deep neural decoding reveals highly resolved spatiotemporal profile of visual object representation in humans. Neuroimage 2023;275:120164. [PMID: 37169115 DOI: 10.1016/j.neuroimage.2023.120164] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 05/02/2023] [Accepted: 05/09/2023] [Indexed: 05/13/2023] Open

Yang S, Xing Z, Wang H, Gao X, Dong X, Yao Y, Zhang R, Zhang X, Li S, Zhao Y, Liu Z. Classification and localization of maize leaf spot disease based on weakly supervised learning. FRONTIERS IN PLANT SCIENCE 2023;14:1128399. [PMID: 37223797 PMCID: PMC10201986 DOI: 10.3389/fpls.2023.1128399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 04/10/2023] [Indexed: 05/25/2023]

Affiliation(s)

Shuai Yang College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Ziyao Xing College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Hengbin Wang College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Xiang Gao College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Xinrui Dong College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Yu Yao College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Runda Zhang College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Xiaodong Zhang College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Shaoming Li College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Yuanyuan Zhao College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China
Zhe Liu College of Land Science and Technology, China Agricultural University, Beijing, China Key Laboratory of Remote Sensing for Agri-Hazards, Ministry of Agriculture and Rural Affairs, Beijing, China

Collapse

Mao J, Qiu S, Wei W, He H. Cross-modal guiding and reweighting network for multi-modal RSVP-based target detection. Neural Netw 2023;161:65-82. [PMID: 36736001 DOI: 10.1016/j.neunet.2023.01.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 10/31/2022] [Accepted: 01/11/2023] [Indexed: 01/17/2023]

Meng Z, Zhu Y, Pang W, Tian J, Nie F, Wang K. MSMFN: An Ultrasound Based Multi-Step Modality Fusion Network for Identifying the Histologic Subtypes of Metastatic Cervical Lymphadenopathy. IEEE TRANSACTIONS ON MEDICAL IMAGING 2023;42:996-1008. [PMID: 36383594 DOI: 10.1109/tmi.2022.3222541] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Mukhtorov D, Rakhmonova M, Muksimova S, Cho YI. Endoscopic Image Classification Based on Explainable Deep Learning. SENSORS (BASEL, SWITZERLAND) 2023;23:3176. [PMID: 36991887 PMCID: PMC10058443 DOI: 10.3390/s23063176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 03/09/2023] [Accepted: 03/10/2023] [Indexed: 06/19/2023]

Xiang T, Liu H, Guo S, Gan Y, He W, Liao X. Towards Query Efficient Black-Box Attacks: A Universal Dual Transferability-Based Framework. ACM T INTEL SYST TEC 2023. [DOI: 10.1145/3583777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]

Syed S, Anderssen KE, Stormo SK, Kranz M. Weakly supervised semantic segmentation for MRI: exploring the advantages and disadvantages of class activation maps for biological image segmentation with soft boundaries. Sci Rep 2023;13:2574. [PMID: 36781947 PMCID: PMC9925800 DOI: 10.1038/s41598-023-29665-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 02/08/2023] [Indexed: 02/15/2023] Open

TSSK-Net: Weakly supervised biomarker localization and segmentation with image-level annotation in retinal OCT images. Comput Biol Med 2023;153:106467. [PMID: 36584602 DOI: 10.1016/j.compbiomed.2022.106467] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 11/16/2022] [Accepted: 12/19/2022] [Indexed: 12/24/2022]

Abstract

The localization and segmentation of biomarkers in OCT images are critical steps in retina-related disease diagnosis. Although fully supervised deep learning models can segment pathological regions, their performance relies on labor-intensive pixel-level annotations. Compared with dense pixel-level annotation, image-level annotation can reduce the burden of manual annotation. Existing methods for image-level annotation are usually based on class activation maps (CAM). However, current methods still suffer from model collapse, training instability, and anatomical mismatch due to the considerable variation in retinal biomarkers' shape, texture, and size. This paper proposes a novel weakly supervised biomarkers localization and segmentation method, requiring only image-level annotations. The technique is a Teacher-Student network with joint Self-supervised contrastive learning and Knowledge distillation-based anomaly localization, namely TSSK-Net. Specifically, we treat retinal biomarker regions as abnormal regions distinct from normal regions. First, we propose a novel pre-training strategy based on supervised contrastive learning that encourages the model to learn the anatomical structure of normal OCT images. Second, we design a fine-tuning module and propose a novel hybrid network structure. The network includes supervised contrastive loss for feature learning and cross-entropy loss for classification learning. To further improve the performance, we propose an efficient strategy to combine these two losses to preserve the anatomical structure and enhance the encoding representation of features. Finally, we design a knowledge distillation-based anomaly segmentation method that is effectively combined with the previous model to alleviate the challenge of insufficient supervision. Experimental results on a local dataset and a public dataset demonstrated the effectiveness of our proposed method. Our proposed method can effectively reduce the annotation burden of ophthalmologists in OCT images.

Collapse

Deeply Explain CNN Via Hierarchical Decomposition. Int J Comput Vis 2023. [DOI: 10.1007/s11263-022-01746-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Weakly-supervised localization and classification of biomarkers in OCT images with integrated reconstruction and attention. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Wang C, Zhang Y, Xu S, Liu Y, Xie L, Wu C, Yang Q, Chu Y, Ye Q. Research on Assistant Diagnosis of Fundus Optic Neuropathy Based on Deep Learning. Curr Eye Res 2023;48:51-59. [PMID: 36264060 DOI: 10.1080/02713683.2022.2138917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]