Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tajbakhsh N, Shin JY, Gurudu SR, Hurst RT, Kendall CB, Gotway MB. Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? IEEE Trans Med Imaging 2016;35:1299-1312. [PMID: 26978662 DOI: 10.1109/tmi.2016.2535302] [Citation(s) in RCA: 1052] [Impact Index Per Article: 116.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

For:	Tajbakhsh N, Shin JY, Gurudu SR, Hurst RT, Kendall CB, Gotway MB. Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? IEEE Trans Med Imaging 2016;35:1299-1312. [PMID: 26978662 DOI: 10.1109/tmi.2016.2535302] [Citation(s) in RCA: 1052] [Impact Index Per Article: 116.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Number

Cited by Other Article(s)

Wang T, Dai Q, Xiong W. Escarcitys: A framework for enhancing medical image classification performance in scarcity of trainable samples scenarios. Neural Netw 2025;189:107573. [PMID: 40382989 DOI: 10.1016/j.neunet.2025.107573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2025] [Revised: 04/11/2025] [Accepted: 04/30/2025] [Indexed: 05/20/2025]

Ma Y, Al-Aroomi MA, Zheng Y, Ren W, Liu P, Wu Q, Liang Y, Jiang C. Application of Mask R-CNN for automatic recognition of teeth and caries in cone-beam computerized tomography. BMC Oral Health 2025;25:927. [PMID: 40481434 PMCID: PMC12143100 DOI: 10.1186/s12903-025-06293-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2024] [Accepted: 05/28/2025] [Indexed: 06/11/2025] Open

Abstract

OBJECTIVES

Deep convolutional neural networks (CNNs) are advancing rapidly in medical research, demonstrating promising results in diagnosis and prediction within radiology and pathology. This study evaluates the efficacy of deep learning algorithms for detecting and diagnosing dental caries using cone-beam computed tomography (CBCT) with the Mask R-CNN architecture while comparing various hyperparameters to enhance detection.

MATERIALS AND METHODS

A total of 2,128 CBCT images were divided into training and validation and test datasets in a 7:1:1 ratio. For the verification of tooth recognition, the data from the validation set were randomly selected for analysis. Three groups of Mask R-CNN networks were compared: A scratch-trained baseline using randomly initialized weights (R group); A transfer learning approach with models pre-trained on COCO for object detection (C group); A variant pre-trained on ImageNetfor for object detection (I group). All configurations maintained identical hyperparameter settings to ensure fair comparison. The deep learning model used ResNet-50 as the backbone network and was trained to 300epoch respectively. We assessed training loss, detection and training times, diagnostic accuracy, specificity, positive and negative predictive values, and coverage precision to compare performance across the groups.

RESULTS

Transfer learning significantly reduced training times compared to non-transfer learning approach (p < 0.05). The average detection time for group R was 0.269 ± 0.176 s, whereas groups I (0.323 ± 0.196 s) and C (0.346 ± 0.195 s) exhibited significantly longer detection times (p < 0.05). C-group, trained for 200 epochs, achieved a mean average precision (mAP) of 81.095, outperforming all other groups. The mAP for caries recognition in group R, trained for 300 epochs, was 53.328, with detection times under 0.5 s. Overall, C-group demonstrated significantly higher average precision across all epochs (100, 200, and 300) (p < 0.05).

CONCLUSION

Neural networks pre-trained with COCO transfer learning exhibit superior annotation accuracy compared to those pre-trained with ImageNet. This suggests that COCO's diverse and richly annotated images offer more relevant features for detecting dental structures and carious lesions. Furthermore, employing ResNet-50 as the backbone architecture enhances the detection of teeth and carious regions, achieving significant improvements with just 200 training epochs, potentially increasing the efficiency of clinical image interpretation.

Collapse

Matsubara N, Teramoto A, Takei M, Kitoh Y, Kawakami S. Retaking assessment system based on the inspiratory state of chest X-ray image. Radiol Phys Technol 2025;18:384-398. [PMID: 39969765 PMCID: PMC12103368 DOI: 10.1007/s12194-025-00888-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2024] [Revised: 02/03/2025] [Accepted: 02/05/2025] [Indexed: 02/20/2025]

Harris CE, Liu L, Almeida L, Kassick C, Makrogiannis S. Artificial intelligence in pediatric osteopenia diagnosis: evaluating deep network classification and model interpretability using wrist X-rays. Bone Rep 2025;25:101845. [PMID: 40343188 PMCID: PMC12059325 DOI: 10.1016/j.bonr.2025.101845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/16/2024] [Revised: 04/11/2025] [Accepted: 04/21/2025] [Indexed: 05/11/2025] Open

Ashi L, Taurin S. Computational modeling of breast tissue mechanics and machine learning in cancer diagnostics: enhancing precision in risk prediction and therapeutic strategies. Expert Rev Anticancer Ther 2025:1-14. [PMID: 40380913 DOI: 10.1080/14737140.2025.2508850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2025] [Revised: 03/19/2025] [Accepted: 05/16/2025] [Indexed: 05/19/2025]

Cai L, Williamson C, Nguyen A, Wittrup E, Najarian K. Adapting segment anything model for hematoma segmentation in traumatic brain injury. DISCOVER IMAGING 2025;2:6. [PMID: 40438440 PMCID: PMC12106135 DOI: 10.1007/s44352-025-00011-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2024] [Accepted: 05/09/2025] [Indexed: 06/01/2025]

Fang W, Tang S, Yan D, Dai X, Zhang W, Xiong J. Breast cancer pathology image recognition based on convolutional neural network. PLoS One 2025;20:e0311728. [PMID: 40388398 PMCID: PMC12088023 DOI: 10.1371/journal.pone.0311728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2023] [Accepted: 09/18/2024] [Indexed: 05/21/2025] Open

Zhang Y, Huang YA, Hu Y, Liu R, Wu J, Huang ZA, Tan KC. CausalMixNet: A mixed-attention framework for causal intervention in robust medical image diagnosis. Med Image Anal 2025;103:103581. [PMID: 40359724 DOI: 10.1016/j.media.2025.103581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2024] [Revised: 03/25/2025] [Accepted: 04/01/2025] [Indexed: 05/15/2025]

Sasmal P, Kumar Panigrahi S, Panda SL, Bhuyan MK. Attention-guided deep framework for polyp localization and subsequent classification via polyp local and Siamese feature fusion. Med Biol Eng Comput 2025:10.1007/s11517-025-03369-z. [PMID: 40314710 DOI: 10.1007/s11517-025-03369-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2024] [Accepted: 04/16/2025] [Indexed: 05/03/2025]

Hosseinzadeh Taher MR, Haghighi F, Gotway MB, Liang J. Large-scale benchmarking and boosting transfer learning for medical image analysis. Med Image Anal 2025;102:103487. [PMID: 40117988 DOI: 10.1016/j.media.2025.103487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 08/03/2024] [Accepted: 01/27/2025] [Indexed: 03/23/2025]

Abstract

Transfer learning, particularly fine-tuning models pretrained on photographic images to medical images, has proven indispensable for medical image analysis. There are numerous models with distinct architectures pretrained on various datasets using different strategies. But, there is a lack of up-to-date large-scale evaluations of their transferability to medical imaging, posing a challenge for practitioners in selecting the most proper pretrained models for their tasks at hand. To fill this gap, we conduct a comprehensive systematic study, focusing on (i) benchmarking numerous conventional and modern convolutional neural network (ConvNet) and vision transformer architectures across various medical tasks; (ii) investigating the impact of fine-tuning data size on the performance of ConvNets compared with vision transformers in medical imaging; (iii) examining the impact of pretraining data granularity on transfer learning performance; (iv) evaluating transferability of a wide range of recent self-supervised methods with diverse training objectives to a variety of medical tasks across different modalities; and (v) delving into the efficacy of domain-adaptive pretraining on both photographic and medical datasets to develop high-performance models for medical tasks. Our large-scale study (∼5,000 experiments) yields impactful insights: (1) ConvNets demonstrate higher transferability than vision transformers when fine-tuning for medical tasks; (2) ConvNets prove to be more annotation efficient than vision transformers when fine-tuning for medical tasks; (3) Fine-grained representations, rather than high-level semantic features, prove pivotal for fine-grained medical tasks; (4) Self-supervised models excel in learning holistic features compared with supervised models; and (5) Domain-adaptive pretraining leads to performant models via harnessing knowledge acquired from ImageNet and enhancing it through the utilization of readily accessible expert annotations associated with medical datasets. As open science, all codes and pretrained models are available at GitHub.com/JLiangLab/BenchmarkTransferLearning (Version 2).

Collapse

Xia C, Zuo M, Lin Z, Deng L, Rao Y, Chen W, Chen J, Yao W, Hu M. Multimodal Deep Learning Fusing Clinical and Radiomics Scores for Prediction of Early-Stage Lung Adenocarcinoma Lymph Node Metastasis. Acad Radiol 2025;32:2977-2989. [PMID: 39730249 DOI: 10.1016/j.acra.2024.12.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Revised: 11/27/2024] [Accepted: 12/09/2024] [Indexed: 12/29/2024]

Abstract

RATIONALE AND OBJECTIVES

To develop and validate a multimodal deep learning (DL) model based on computed tomography (CT) images and clinical knowledge to predict lymph node metastasis (LNM) in early lung adenocarcinoma.

MATERIALS AND METHODS

A total of 724 pathologically confirmed early invasive lung adenocarcinoma patients were retrospectively included from two centers. Clinical and CT semantic features of the patients were collected, and 3D radiomics features were extracted from nonenhanced CT images. We proposed a multimodal feature fusion DL network based on the InceptionResNetV2 architecture, which can effectively extract and integrate image and clinical knowledge to predict LNM.

RESULTS

A total of 524 lung adenocarcinoma patients from Center 1 were randomly divided into training (n=418) and internal validation (n=106) sets in a 4:1 ratio, while 200 lung adenocarcinoma patients from Center 2 served as the independent test set. Among the 16 collected clinical and imaging features, 8 were selected: gender, serum carcinoembryonic antigen, cytokeratin 19 fragment antigen 21-1, neuron-specific enolase, tumor size, location, density, and centrality. From the 1595 extracted radiomics features, six key features were identified. The CS-RS-DL fusion model achieved the highest area under the receiver operating characteristic curve in both the internal validation set (0.877) and the independent test set (0.906) compared to other models. The Delong test results for the independent test set indicated that the CS-RS-DL model significantly outperformed the clinical model (0.844), radiomics model (0.850), CS-RS model (0.872), single DL model (0.848), and the CS-DL model (0.875) (all P<0.05). Additionally, the CS-RS-DL model exhibited the highest sensitivity (0.941) and average precision (0.642).

CONCLUSION

The knowledge derived from clinical, radiomics, and DL is complementary in predicting LNM in lung adenocarcinoma. The integration of clinical and radiomics scores through DL can significantly improve the accuracy of lymph node status assessment.

Collapse

Affiliation(s)

Chengcheng Xia School of Public Health, Jiangxi Medical College, Nanchang University, Nanchang 330006, China (C.X., L.D., W.C., M.H.); Jiangxi Provincial Key Laboratory of Disease Prevention and Public Health, Nanchang University, Nanchang 330006, China (C.X., L.D., W.C., M.H.)
Minjing Zuo Department of Radiology, The Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang 330006, China (M.Z.); Intelligent Medical Imaging of Jiangxi Key Laboratory, Nanchang 330006, China (M.Z.)
Ze Lin Department of Radiology, Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan 430022, China (Z.L.); Affiliated Hospital of Hubei University of Chinese Medicine, Wuhan 430022, China (Z.L.)
Libin Deng School of Public Health, Jiangxi Medical College, Nanchang University, Nanchang 330006, China (C.X., L.D., W.C., M.H.); Jiangxi Provincial Key Laboratory of Disease Prevention and Public Health, Nanchang University, Nanchang 330006, China (C.X., L.D., W.C., M.H.)
Yulian Rao Wanli District Center for Disease Control and Prevention of Nanchang, Nanchang 330004, China (Y.R.)
Wenxiang Chen School of Public Health, Jiangxi Medical College, Nanchang University, Nanchang 330006, China (C.X., L.D., W.C., M.H.); Jiangxi Provincial Key Laboratory of Disease Prevention and Public Health, Nanchang University, Nanchang 330006, China (C.X., L.D., W.C., M.H.)
Jinqin Chen Jiangxi Medical College, Nanchang University, Nanchang, China (J.C.)
Weirong Yao Department of Oncology, Jiangxi Provincial People's Hospital, The First Affiliated Hospital of Nanchang Medical College, Nanchang, China (W.Y.)
Min Hu School of Public Health, Jiangxi Medical College, Nanchang University, Nanchang 330006, China (C.X., L.D., W.C., M.H.); Jiangxi Provincial Key Laboratory of Disease Prevention and Public Health, Nanchang University, Nanchang 330006, China (C.X., L.D., W.C., M.H.).

Collapse

Arnab SP, Campelo dos Santos AL, Fumagalli M, DeGiorgio M. Efficient Detection and Characterization of Targets of Natural Selection Using Transfer Learning. Mol Biol Evol 2025;42:msaf094. [PMID: 40341942 PMCID: PMC12062966 DOI: 10.1093/molbev/msaf094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Revised: 04/16/2025] [Accepted: 04/17/2025] [Indexed: 05/11/2025] Open

Liang X, Han L, Zhang X, Li X, Sun Y, Tong T, Tan T, Mann R. Singular value decomposition based under-sampling pattern optimization for MRI reconstruction. Med Phys 2025. [PMID: 40296184 DOI: 10.1002/mp.17860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2024] [Revised: 03/26/2025] [Accepted: 04/16/2025] [Indexed: 04/30/2025] Open

Aktar M, Tampieri D, Xiao Y, Rivaz H, Kersten-Oertel M. CASCADE-FSL: Few-shot learning for collateral evaluation in ischemic stroke. Comput Med Imaging Graph 2025;123:102550. [PMID: 40250214 DOI: 10.1016/j.compmedimag.2025.102550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 03/20/2025] [Accepted: 04/02/2025] [Indexed: 04/20/2025]

Li Y, Hui L, Wang X, Zou L, Chua S. Lung nodule detection using a multi-scale convolutional neural network and global channel spatial attention mechanisms. Sci Rep 2025;15:12313. [PMID: 40210738 PMCID: PMC11986029 DOI: 10.1038/s41598-025-97187-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2024] [Accepted: 04/02/2025] [Indexed: 04/12/2025] Open

Santoro-Fernandes V, Schott B, Weisman AJ, Lokre O, Cho SY, Perlman SB, Perk TG, Jeraj R. Full-Body Tumor Response Heterogeneity of Metastatic Neuroendocrine Tumor Patients Undergoing Peptide Receptor Radiopharmaceutical Therapy. J Nucl Med 2025;66:565-571. [PMID: 39947917 DOI: 10.2967/jnumed.124.267809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 01/06/2025] [Indexed: 04/03/2025] Open

Abstract

Patients with metastatic neuroendocrine tumors (NETs) can present with hundreds of lesions, and each lesion might have a unique response pattern to peptide receptor radiopharmaceutical therapy (PRRT). This response heterogeneity has been observed but is poorly understood. In this work, we perform a quantitative analysis of longitudinal PET/CT scans to comprehensively characterize the NET response to PRRT. Methods: NET patients treated with [177Lu]Lu-DOTATATE PRRT imaged at baseline, during, and after PRRT with [68Ga]Ga-DOTATATE PET/CT were enrolled in this retrospective single-institutional study. A deep-learning model was used to identify and contour regions of nonphysiological elevated tracer uptake (lesion-regions of interest [ROIs]). An automated analysis was performed to identify, contour, and quantify the individual lesion-ROI uptake, match ROI between time points, and categorize each lesion-ROI as disappearing, decreasing (ΔSUVtotal < -30%), stable (-30% ≤ ΔSUVtotal ≤ 30%), increasing (ΔSUVtotal > 30%), or new. A patient was considered to have response heterogeneity if both new or increasing lesion-ROIs and decreasing or disappearing lesion-ROIs were present after therapy. Results: Eighteen patients who received between 2 and 7 [68Ga]Ga-DOTATATE PET/CT scans were enrolled. In total, 3,289 lesion-ROIs were contoured in the 67 scans acquired (median of 24 lesion-ROIs per image), and 1,459 lesion-ROI tracks, defined as the path that each unique lesion-ROI follows across all time points, were determined by the ROI tracking method (median of 49 tracks per patient). All patients presented with disease response heterogeneity at the first follow-up scan. All 10 patients with more than 1 follow-up scan showed nonmonotonic change in lesion-ROI uptake. Of 129 tracks containing new lesion-ROIs at the first follow-up, 80 (62%) eventually resolved on final follow-up, whereas only 12% (7/60) of the tracks with lesion-ROIs disappearing at the first follow-up scan returned on final follow-up. Conclusion: To the best of our knowledge, this is the first study to evaluate response comprehensively and quantitatively in terms of individual lesion-ROIs. Response heterogeneity was observed in 100% of the patients, which suggests that comprehensive, lesion-level, response assessment is vital for the accurate understanding of the NET response to PRRT.

Collapse

Sammad A, Ding Z. Harnessing Multi-Omics: Integrating Radiomics and Pathomics for Predicting Microsatellite Instability in Rectal Cancer. Acad Radiol 2025;32:1946-1948. [PMID: 39955254 DOI: 10.1016/j.acra.2025.02.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2025] [Accepted: 02/10/2025] [Indexed: 02/17/2025]

Sekkat H, Khallouqi A, Rhazouani OE, Halimi A. Automated Detection of Hydrocephalus in Pediatric Head Computed Tomography Using VGG 16 CNN Deep Learning Architecture and Based Automated Segmentation Workflow for Ventricular Volume Estimation. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2025:10.1007/s10278-025-01482-x. [PMID: 40108068 DOI: 10.1007/s10278-025-01482-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2024] [Revised: 02/23/2025] [Accepted: 03/11/2025] [Indexed: 03/22/2025]

Abstract

Hydrocephalus, particularly congenital hydrocephalus in infants, remains underexplored in deep learning research. While deep learning has been widely applied to medical image analysis, few studies have specifically addressed the automated classification of hydrocephalus. This study proposes a convolutional neural network (CNN) model based on the VGG16 architecture to detect hydrocephalus in infant head CT images. The model integrates an automated method for ventricular volume extraction, applying windowing, histogram equalization, and thresholding techniques to segment the ventricles from surrounding brain structures. Morphological operations refine the segmentation and contours are extracted for visualization and volume measurement. The dataset consists of 105 head CT scans, each with 60 slices covering the ventricular volume, resulting in 6300 slices. Manual segmentation by three trained radiologists served as the reference standard. The automated method showed a high correlation with manual measurements, with R2 values ranging from 0.94 to 0.99. The mean absolute percentage error (MAPE) ranged 3.99 to 11.13%, while the root mean square error (RRMSE) from 4.56 to 13.74%. To improve model robustness, the dataset was preprocessed, normalized, and augmented with rotation, shifting, zooming, and flipping. The VGG16-based CNN used pre-trained convolutional layers with additional fully connected layers for classification, predicting hydrocephalus or normal labels. Performance evaluation using a multi-split strategy (15 independent splits) achieved a mean accuracy of 90.4% ± 1.2%. This study presents an automated approach for ventricular volume extraction and hydrocephalus detection, offering a promising tool for clinical and research applications with high accuracy and reduced observer bias.

Collapse

Pelcat A, Le Berre A, Ben Hassen W, Debacker C, Charron S, Thirion B, Legrand L, Turc G, Oppenheim C, Benzakoun J. Generative T2*-weighted images as a substitute for true T2*-weighted images on brain MRI in patients with acute stroke. Diagn Interv Imaging 2025:S2211-5684(25)00048-8. [PMID: 40113490 DOI: 10.1016/j.diii.2025.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2024] [Revised: 03/11/2025] [Accepted: 03/14/2025] [Indexed: 03/22/2025]

Abstract

PURPOSE

The purpose of this study was to validate a deep learning algorithm that generates T2*-weighted images from diffusion-weighted (DW) images and to compare its performance with that of true T2*-weighted images for hemorrhage detection on MRI in patients with acute stroke.

MATERIALS AND METHODS

This single-center, retrospective study included DW- and T2*-weighted images obtained less than 48 hours after symptom onset in consecutive patients admitted for acute stroke. Datasets were divided into training (60 %), validation (20 %), and test (20 %) sets, with stratification by stroke type (hemorrhagic/ischemic). A generative adversarial network was trained to produce generative T2*-weighted images using DW images. Concordance between true T2*-weighted images and generative T2*-weighted images for hemorrhage detection was independently graded by two readers into three categories (parenchymal hematoma, hemorrhagic infarct or no hemorrhage), and discordances were resolved by consensus reading. Sensitivity, specificity and accuracy of generative T2*-weighted images were estimated using true T2*-weighted images as the standard of reference.

RESULTS

A total of 1491 MRI sets from 939 patients (487 women, 452 men) with a median age of 71 years (first quartile, 57; third quartile, 81; range: 21-101) were included. In the test set (n = 300), there were no differences between true T2*-weighted images and generative T2*-weighted images for intraobserver reproducibility (κ = 0.97 [95 % CI: 0.95-0.99] vs. 0.95 [95 % CI: 0.92-0.97]; P = 0.27) and interobserver reproducibility (κ = 0.93 [95 % CI: 0.90-0.97] vs. 0.92 [95 % CI: 0.88-0.96]; P = 0.64). After consensus reading, concordance between true T2*-weighted images and generative T2*-weighted images was excellent (κ = 0.92; 95 % CI: 0.91-0.96). Generative T2*-weighted images achieved 90 % sensitivity (73/81; 95 % CI: 81-96), 97 % specificity (213/219; 95 % CI: 94-99) and 95 % accuracy (286/300; 95 % CI: 92-97) for the diagnosis of any cerebral hemorrhage (hemorrhagic infarct or parenchymal hemorrhage).

CONCLUSION

Generative T2*-weighted images and true T2*-weighted images have non-different diagnostic performances for hemorrhage detection in patients with acute stroke and may be used to shorten MRI protocols.

Collapse

Affiliation(s)

Antoine Pelcat Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, IMA-BRAIN, 75014 Paris, France
Alice Le Berre Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, IMA-BRAIN, 75014 Paris, France; GHU Paris Psychiatrie et Neurosciences, Hôpital Sainte Anne, Department of Neuroradiology, 75014 Paris, France
Wagih Ben Hassen Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, IMA-BRAIN, 75014 Paris, France; GHU Paris Psychiatrie et Neurosciences, Hôpital Sainte Anne, Department of Neuroradiology, 75014 Paris, France
Clement Debacker Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, IMA-BRAIN, 75014 Paris, France; GHU Paris Psychiatrie et Neurosciences, Hôpital Sainte Anne, Department of Neuroradiology, 75014 Paris, France
Sylvain Charron Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, IMA-BRAIN, 75014 Paris, France
Bertrand Thirion INRIA, CEA, Université Paris-Saclay, MIND Team, 91400 Palaiseau, France
Laurence Legrand Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, IMA-BRAIN, 75014 Paris, France; GHU Paris Psychiatrie et Neurosciences, Hôpital Sainte Anne, Department of Neuroradiology, 75014 Paris, France
Guillaume Turc Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, Stroke Team, 75014 Paris, France; GHU Paris Psychiatrie et Neurosciences, Hôpital Sainte Anne, Department of Neurology, 75014 Paris, France
Catherine Oppenheim Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, IMA-BRAIN, 75014 Paris, France; GHU Paris Psychiatrie et Neurosciences, Hôpital Sainte Anne, Department of Neuroradiology, 75014 Paris, France
Joseph Benzakoun Université Paris Cité, Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, IMA-BRAIN, 75014 Paris, France; GHU Paris Psychiatrie et Neurosciences, Hôpital Sainte Anne, Department of Neuroradiology, 75014 Paris, France.

Collapse

Fasihi-Shirehjini O, Babapour-Mofrad F. Effectiveness of ConvNeXt variants in diabetic feet diagnosis using plantar thermal images. QUANTITATIVE INFRARED THERMOGRAPHY JOURNAL 2025;22:155-172. [DOI: 10.1080/17686733.2024.2310794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 01/23/2024] [Indexed: 10/11/2024]

Arnab SP, Dos Santos ALC, Fumagalli M, DeGiorgio M. Efficient detection and characterization of targets of natural selection using transfer learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.05.641710. [PMID: 40093065 PMCID: PMC11908262 DOI: 10.1101/2025.03.05.641710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 03/19/2025]

Deebani W, Aziz L, Aziz A, Basri WS, Alawad WM, Althubiti SA. Synergistic transfer learning and adversarial networks for breast cancer diagnosis: benign vs. invasive classification. Sci Rep 2025;15:7461. [PMID: 40032913 PMCID: PMC11876678 DOI: 10.1038/s41598-025-90288-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2024] [Accepted: 02/11/2025] [Indexed: 03/05/2025] Open

Han K, Lou Q, Lu F. A semi-supervised domain adaptation method with scale-aware and global-local fusion for abdominal multi-organ segmentation. J Appl Clin Med Phys 2025;26:e70008. [PMID: 39924943 PMCID: PMC11905256 DOI: 10.1002/acm2.70008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2024] [Revised: 11/02/2024] [Accepted: 11/27/2024] [Indexed: 02/11/2025] Open

Abstract

BACKGROUND

Abdominal multi-organ segmentation remains a challenging task. Semi-supervised domain adaptation (SSDA) has emerged as an innovative solution. However, SSDA frameworks based on UNet struggle to capture multi-scale and global information.

PURPOSE

Our work aimed to propose a novel SSDA method to achieve more accurate abdominal multi-organ segmentation with limited labeled target domain data, which has a superior ability to capture the multi-scale features and integrate local and global information effectively.

METHODS

The proposed network is based on UNet. In the encoder part, a scale-aware with domain-specific batch normalization (SAD) module is integrated to adaptively extract multi-scale features and to get better generalization across source and target domains. In the bottleneck part, a global-local fusion (GLF) module is utilized for capturing and integrating both local and global information. They are integrated into the framework of self-ensembling mean-teacher (SE-MT) to enhance the model's capability to learn common features across source and target domains.

RESULTS

To validate the performance of the proposed model, we evaluated it on the public CHAOS and BTCV datasets. For CHAOS, the proposed method obtains an average DSC of 88.97% and ASD of 1.12 mm with only 20% labeled target data. For BTCV, it achieves an average DSC of 88.95% and ASD of 1.13 mm with 20% labeled target data. Compared with the state-of-the-art methods, DSC and ASD increased by at least 0.72% and 0.33 mm on CHAOS, 1.29% and 0.06 mm on BTCV, respectively. Ablation studies were also conducted to verify the contribution of each component of the model. The proposed method achieves a DSC improvement of 3.17% over the baseline with 20% labeled target data.

CONCLUSION

The proposed SSDA method for abdominal multi-organ segmentation has a powerful ability to extract multi-scale and more global features, significantly improving segmentation accuracy and robustness.

Collapse

Shao X, Niu R. Bridging Artificial Intelligence Models to Clinical Practice: Challenges in Lung Cancer Prediction. Radiol Artif Intell 2025;7:e250080. [PMID: 40072120 DOI: 10.1148/ryai.250080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2025]

Giannakopoulos II, Carluccio G, Keerthivasan MB, Koerzdoerfer G, Lakshmanan K, De Moura HL, Serrallés JEC, Lattanzi R. MR electrical properties mapping using vision transformers and canny edge detectors. Magn Reson Med 2025;93:1117-1131. [PMID: 39415436 PMCID: PMC11955224 DOI: 10.1002/mrm.30338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 09/24/2024] [Accepted: 09/24/2024] [Indexed: 10/18/2024]

Buga R, Buzea CG, Agop M, Ochiuz L, Vasincu D, Popa O, Rusu DI, Știrban I, Eva L. Streamlit Application and Deep Learning Model for Brain Metastasis Monitoring After Gamma Knife Treatment. Biomedicines 2025;13:423. [PMID: 40002836 PMCID: PMC11852629 DOI: 10.3390/biomedicines13020423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2025] [Revised: 02/05/2025] [Accepted: 02/08/2025] [Indexed: 02/27/2025] Open

Abstract

Background/Objective: This study explores the use of AI-powered radiomics to classify and monitor brain metastasis progression and regression following Gamma Knife radiosurgery (GKRS) based on MRI imaging. A clinical decision support application was developed using Streamlit to provide real-time, AI-driven predictions for treatment monitoring. Methods: MRI scans from 60 patients (3194 images) were analyzed using a transfer learning-enhanced AlexNet deep learning model. Class imbalance was mitigated through dynamic class weighting and data augmentation to ensure equitable performance across all classes. Optimized preprocessing pipelines ensured dataset standardization. Model performance was evaluated using accuracy, precision, recall, F1-scores, and AUC, with 95% confidence intervals. Additionally, a comparative analysis of Gamma Knife radiosurgery (GKRS) outcomes and predictive modeling demonstrated strong correlations between tumor volume evolution and treatment response. The AI predictions and visualizations were integrated into a Streamlit-based application to ensure clinical usability and ease of access. The AI-driven approach effectively classified progression and regression patterns, reinforcing its potential for clinical integration. Results: The transfer learning model achieved flawless classification accuracy (100%; 95% CI: 100-100%) along with perfect precision, recall, and F1-scores. The AUC score of 1.0000 (95% CI: 1.0000-1.0000) indicated excellent discrimination between progression and regression cases. Compared to the baseline AlexNet model (99.53% accuracy; 95% CI: 98.90-100.00%), the TL-enhanced model resolved all misclassifications. Tumor volume analysis identified the baseline size as a key predictor of progression (Pearson r = 0.795, r = 0.795, r = 0.795, p < 0.0001, p < 0.0001, and p < 0.0001). The training time (420.12 s) was faster than ResNet-50 (443.38 s) and EfficientNet-B0 (439.87 s), while achieving equivalent metrics. Despite 100% accuracy, the model requires multi-center validation for generalizability. Conclusions: This study demonstrates that transfer learning with dynamic class weighting provides a highly accurate and reliable framework for monitoring brain metastases post-GKRS. The Streamlit-based AI application enhances clinical decision-making by improving diagnostic precision and reducing variability. Explainable AI techniques, such as Grad-CAM visualizations, improve interpretability and support clinical adoption. These findings emphasize the transformative potential of AI in personalized treatment strategies, extending applications to genomic profiling, survival modeling, and longitudinal follow-ups for brain metastasis management.

Collapse

Afzal S, Rauf M, Ashraf S, Bin Md Ayob S, Ahmad Arfeen Z. CART-ANOVA-Based Transfer Learning Approach for Seven Distinct Tumor Classification Schemes with Generalization Capability. Diagnostics (Basel) 2025;15:378. [PMID: 39941307 PMCID: PMC11816775 DOI: 10.3390/diagnostics15030378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2024] [Revised: 12/31/2024] [Accepted: 01/22/2025] [Indexed: 02/16/2025] Open

Abstract

Background/Objectives: Deep transfer learning, leveraging convolutional neural networks (CNNs), has become a pivotal tool for brain tumor detection. However, key challenges include optimizing hyperparameter selection and enhancing the generalization capabilities of models. This study introduces a novel CART-ANOVA (Cartesian-ANOVA) hyperparameter tuning framework, which differs from traditional optimization methods by systematically integrating statistical significance testing (ANOVA) with the Cartesian product of hyperparameter values. This approach ensures robust and precise parameter tuning by evaluating the interaction effects between hyperparameters, such as batch size and learning rate, rather than relying solely on grid or random search. Additionally, it implements seven distinct classification schemes for brain tumors, aimed at improving diagnostic accuracy and robustness. Methods: The proposed framework employs a ResNet18-based knowledge transfer learning (KTL) model trained on a primary dataset, with 20% allocated for testing. Hyperparameters were optimized using CART-ANOVA analysis, and statistical validation ensured robust parameter selection. The model's generalization and robustness were evaluated on an independent second dataset. Performance metrics, including precision, accuracy, sensitivity, and F1 score, were compared against other pre-trained CNN models. Results: The framework achieved exceptional testing accuracy of 99.65% for four-class classification and 98.05% for seven-class classification on the source 1 dataset. It also maintained high generalization capabilities, achieving accuracies of 98.77% and 96.77% on the source 2 datasets for the same tasks. The incorporation of seven distinct classification schemes further enhanced variability and diagnostic capability, surpassing the performance of other pre-trained models. Conclusions: The CART-ANOVA hyperparameter tuning framework, combined with a ResNet18-based KTL approach, significantly improves brain tumor classification accuracy, robustness, and generalization. These advancements demonstrate strong potential for enhancing diagnostic precision and informing effective treatment strategies, contributing to advancements in medical imaging and AI-driven healthcare solutions.

Collapse

Rey-Barroso L, Vilaseca M, Royo S, Díaz-Doutón F, Lihacova I, Bondarenko A, Burgos-Fernández FJ. Training State-of-the-Art Deep Learning Algorithms with Visible and Extended Near-Infrared Multispectral Images of Skin Lesions for the Improvement of Skin Cancer Diagnosis. Diagnostics (Basel) 2025;15:355. [PMID: 39941285 PMCID: PMC11817636 DOI: 10.3390/diagnostics15030355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Revised: 01/20/2025] [Accepted: 01/21/2025] [Indexed: 02/16/2025] Open

Zhang X, Zhao J, Zong D, Ren H, Gao C. Taming vision transformers for clinical laryngoscopy assessment. J Biomed Inform 2025;162:104766. [PMID: 39827999 DOI: 10.1016/j.jbi.2024.104766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2024] [Revised: 12/09/2024] [Accepted: 12/26/2024] [Indexed: 01/22/2025]

Xu S, Li W, Li Z, Zhao T, Zhang B. Facing Differences of Similarity: Intra- and Inter-Correlation Unsupervised Learning for Chest X-Ray Anomaly Detection. IEEE TRANSACTIONS ON MEDICAL IMAGING 2025;44:801-814. [PMID: 39283780 DOI: 10.1109/tmi.2024.3461231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]

Huang GH, Lai WC, Chen TB, Hsu CC, Chen HY, Wu YC, Yeh LR. Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson's Disease Stage Prediction. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2025:10.1007/s10278-025-01402-z. [PMID: 39849204 DOI: 10.1007/s10278-025-01402-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2024] [Revised: 12/11/2024] [Accepted: 01/01/2025] [Indexed: 01/25/2025]

Zhang M, Deng Y, Zhou Q, Gao J, Zhang D, Pan X. Advancing micro-nano supramolecular assembly mechanisms of natural organic matter by machine learning for unveiling environmental geochemical processes. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2025;27:24-45. [PMID: 39745028 DOI: 10.1039/d4em00662c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/23/2025]

Qiong L, Chaofan L, Jinnan T, Liping C, Jianxiang S. Medical image segmentation based on frequency domain decomposition SVD linear attention. Sci Rep 2025;15:2833. [PMID: 39843905 PMCID: PMC11754837 DOI: 10.1038/s41598-025-86315-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2024] [Accepted: 01/09/2025] [Indexed: 01/24/2025] Open

Abstract

Convolutional Neural Networks (CNNs) have achieved remarkable segmentation accuracy in medical image segmentation tasks. However, the Vision Transformer (ViT) model, with its capability of extracting global information, offers a significant advantage in contextual information compared to the limited receptive field of convolutional kernels in CNNs. Despite this, ViT models struggle to fully detect and extract high-frequency signals, such as textures and boundaries, in medical images. These high-frequency features are essential in medical imaging, as targets like tumors and pathological organs exhibit significant differences in texture and boundaries across different stages. Additionally, the high resolution of medical images leads to computational complexity in the self-attention mechanism of ViTs. To address these limitations, we propose a medical image segmentation network framework based on frequency domain decomposition using a Laplacian pyramid. This approach selectively computes attention features for high-frequency signals in the original image to enhance spatial structural information effectively. During attention feature computation, we introduce Singular Value Decomposition (SVD) to extract an effective representation matrix from the original image, which is then applied in the attention computation process for linear projection. This method reduces computational complexity while preserving essential features. We demonstrated the segmentation validity and superiority of our model on the Abdominal Multi-Organ Segmentation dataset and the Dermatological Disease dataset, and on the Synapse dataset our model achieved a score of 82.68 on the Dice metrics and 17.23 mm on the HD metrics. Experimental results indicate that our model consistently exhibits segmentation effectiveness and improved accuracy across various datasets.

Collapse

Fang X, Chong CF, Wong KL, Simões M, Ng BK. Investigating the key principles in two-step heterogeneous transfer learning for early laryngeal cancer identification. Sci Rep 2025;15:2146. [PMID: 39820368 PMCID: PMC11739633 DOI: 10.1038/s41598-024-84836-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Accepted: 12/27/2024] [Indexed: 01/19/2025] Open

Maruyama S, Mizutani F, Watanabe H. Novel approach for quality control testing of medical displays using deep learning technology. Biomed Phys Eng Express 2025;11:025004. [PMID: 39773861 DOI: 10.1088/2057-1976/ada6bd] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2024] [Accepted: 01/07/2025] [Indexed: 01/11/2025]

Abstract

Objectives:In digital image diagnosis using medical displays, it is crucial to rigorously manage display devices to ensure appropriate image quality and diagnostic safety. The aim of this study was to develop a model for the efficient quality control (QC) of medical displays, specifically addressing the measurement items of contrast response and maximum luminance as part of constancy testing, and to evaluate its performance. In addition, the study focused on whether these tasks could be addressed using a multitasking strategy.Methods:The model used in this study was constructed by fine-tuning a pretrained model and expanding it to a multioutput configuration that could perform both contrast response classification and maximum luminance regression. QC images displayed on a medical display were captured using a smartphone, and these images served as the input for the model. The performance was evaluated using the area under the receiver operating characteristic curve (AUC) for the classification task. For the regression task, correlation coefficients and Bland-Altman analysis were applied. We investigated the impact of different architectures and verified the performance of multi-task models against single-task models as a baseline.Results:Overall, the classification task achieved a high AUC of approximately 0.9. The correlation coefficients for the regression tasks ranged between 0.6 and 0.7 on average. Although the model tended to underestimate the maximum luminance values, the error margin was consistently within 5% for all conditions.Conclusion:These results demonstrate the feasibility of implementing an efficient QC system for medical displays and the usefulness of a multitask-based method. Thus, this study provides valuable insights into the potential to reduce the workload associated with medical-device management the development of QC systems for medical devices, highlighting the importance of future efforts to improve their accuracy and applicability.

Collapse

Kishor Kumar Reddy C, Kaza VS, Madana Mohana R, Alhameed M, Jeribi F, Alam S, Shuaib M. Detecting anomalies in smart wearables for hypertension: a deep learning mechanism. Front Public Health 2025;12:1426168. [PMID: 39850864 PMCID: PMC11755415 DOI: 10.3389/fpubh.2024.1426168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Accepted: 11/25/2024] [Indexed: 01/25/2025] Open

Jiang Y, Ebrahimpour L, Després P, Manem VS. A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort. Sci Rep 2025;15:1736. [PMID: 39799226 PMCID: PMC11724919 DOI: 10.1038/s41598-024-84193-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 12/20/2024] [Indexed: 01/15/2025] Open

Abstract

Deep learning (DL) methods have demonstrated remarkable effectiveness in assisting with lung cancer risk prediction tasks using computed tomography (CT) scans. However, the lack of comprehensive comparison and validation of state-of-the-art (SOTA) models in practical settings limits their clinical application. This study aims to review and analyze current SOTA deep learning models for lung cancer risk prediction (malignant-benign classification). To evaluate our model's general performance, we selected 253 out of 467 patients from a subset of the National Lung Screening Trial (NLST) who had CT scans without contrast, which are the most commonly used, and divided them into training and test cohorts. The CT scans were preprocessed into 2D-image and 3D-volume formats according to their nodule annotations. We evaluated ten 3D and eleven 2D SOTA deep learning models, which were pretrained on large-scale general-purpose datasets (Kinetics and ImageNet) and radiological datasets (3DSeg-8, nnUnet and RadImageNet), for their lung cancer risk prediction performance. Our results showed that 3D-based deep learning models generally perform better than 2D models. On the test cohort, the best-performing 3D model achieved an AUROC of 0.86, while the best 2D model reached 0.79. The lowest AUROCs for the 3D and 2D models were 0.70 and 0.62, respectively. Furthermore, pretraining on large-scale radiological image datasets did not show the expected performance advantage over pretraining on general-purpose datasets. Both 2D and 3D deep learning models can handle lung cancer risk prediction tasks effectively, although 3D models generally have superior performance than their 2D competitors. Our findings highlight the importance of carefully selecting pretrained datasets and model architectures for lung cancer risk prediction. Overall, these results have important implications for the development and clinical integration of DL-based tools in lung cancer screening.

Collapse

Lee H, Cho S, Song J, Kim H, Shin Y. An Enhanced Approach Using AGS Network for Skin Cancer Classification. SENSORS (BASEL, SWITZERLAND) 2025;25:394. [PMID: 39860766 PMCID: PMC11769443 DOI: 10.3390/s25020394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Revised: 12/19/2024] [Accepted: 01/10/2025] [Indexed: 01/27/2025]

Li C, Liao Y, Ding C, Ye Z. MDAPT: Multi-Modal Depth Adversarial Prompt Tuning to Enhance the Adversarial Robustness of Visual Language Models. SENSORS (BASEL, SWITZERLAND) 2025;25:258. [PMID: 39797049 PMCID: PMC11723442 DOI: 10.3390/s25010258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2024] [Revised: 12/23/2024] [Accepted: 01/02/2025] [Indexed: 01/13/2025]

Silva-Rodríguez J, Chakor H, Kobbi R, Dolz J, Ben Ayed I. A Foundation Language-Image Model of the Retina (FLAIR): encoding expert knowledge in text supervision. Med Image Anal 2025;99:103357. [PMID: 39418828 DOI: 10.1016/j.media.2024.103357] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 05/06/2024] [Accepted: 09/23/2024] [Indexed: 10/19/2024]

Abstract

Foundation vision-language models are currently transforming computer vision, and are on the rise in medical imaging fueled by their very promising generalization capabilities. However, the initial attempts to transfer this new paradigm to medical imaging have shown less impressive performances than those observed in other domains, due to the significant domain shift and the complex, expert domain knowledge inherent to medical-imaging tasks. Motivated by the need for domain-expert foundation models, we present FLAIR, a pre-trained vision-language model for universal retinal fundus image understanding. To this end, we compiled 38 open-access, mostly categorical fundus imaging datasets from various sources, with up to 101 different target conditions and 288,307 images. We integrate the expert's domain knowledge in the form of descriptive textual prompts, during both pre-training and zero-shot inference, enhancing the less-informative categorical supervision of the data. Such a textual expert's knowledge, which we compiled from the relevant clinical literature and community standards, describes the fine-grained features of the pathologies as well as the hierarchies and dependencies between them. We report comprehensive evaluations, which illustrate the benefit of integrating expert knowledge and the strong generalization capabilities of FLAIR under difficult scenarios with domain shifts or unseen categories. When adapted with a lightweight linear probe, FLAIR outperforms fully-trained, dataset-focused models, more so in the few-shot regimes. Interestingly, FLAIR outperforms by a wide margin larger-scale generalist image-language models and retina domain-specific self-supervised networks, which emphasizes the potential of embedding experts' domain knowledge and the limitations of generalist models in medical imaging. The pre-trained model is available at: https://github.com/jusiro/FLAIR.

Collapse

Drazinos P, Gatos I, Katsakiori PF, Tsantis S, Syrmas E, Spiliopoulos S, Karnabatidis D, Theotokas I, Zoumpoulis P, Hazle JD, Kagadis GC. Comparison of deep learning schemes in grading non-alcoholic fatty liver disease using B-mode ultrasound hepatorenal window images with liver biopsy as the gold standard. Phys Med 2025;129:104862. [PMID: 39626614 DOI: 10.1016/j.ejmp.2024.104862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 10/11/2024] [Accepted: 11/27/2024] [Indexed: 01/07/2025] Open

Xu J, Huang K, Zhong L, Gao Y, Sun K, Liu W, Zhou Y, Guo W, Guo Y, Zou Y, Duan Y, Lu L, Wang Y, Chen X, Zhao S. RemixFormer++: A Multi-Modal Transformer Model for Precision Skin Tumor Differential Diagnosis With Memory-Efficient Attention. IEEE TRANSACTIONS ON MEDICAL IMAGING 2025;44:320-337. [PMID: 39120989 DOI: 10.1109/tmi.2024.3441012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/11/2024]

Abstract

Diagnosing malignant skin tumors accurately at an early stage can be challenging due to ambiguous and even confusing visual characteristics displayed by various categories of skin tumors. To improve diagnosis precision, all available clinical data from multiple sources, particularly clinical images, dermoscopy images, and medical history, could be considered. Aligning with clinical practice, we propose a novel Transformer model, named RemixFormer++ that consists of a clinical image branch, a dermoscopy image branch, and a metadata branch. Given the unique characteristics inherent in clinical and dermoscopy images, specialized attention strategies are adopted for each type. Clinical images are processed through a top-down architecture, capturing both localized lesion details and global contextual information. Conversely, dermoscopy images undergo a bottom-up processing with two-level hierarchical encoders, designed to pinpoint fine-grained structural and textural features. A dedicated metadata branch seamlessly integrates non-visual information by encoding relevant patient data. Fusing features from three branches substantially boosts disease classification accuracy. RemixFormer++ demonstrates exceptional performance on four single-modality datasets (PAD-UFES-20, ISIC 2017/2018/2019). Compared with the previous best method using a public multi-modal Derm7pt dataset, we achieved an absolute 5.3% increase in averaged F1 and 1.2% in accuracy for the classification of five skin tumors. Furthermore, using a large-scale in-house dataset of 10,351 patients with the twelve most common skin tumors, our method obtained an overall classification accuracy of 92.6%. These promising results, on par or better with the performance of 191 dermatologists through a comprehensive reader study, evidently imply the potential clinical usability of our method.

Collapse

Wang Y, Zhang W, Liu X, Tian L, Li W, He P, Huang S, He F, Pan X. Artificial intelligence in precision medicine for lung cancer: A bibliometric analysis. Digit Health 2025;11:20552076241300229. [PMID: 39758259 PMCID: PMC11696962 DOI: 10.1177/20552076241300229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 10/28/2024] [Indexed: 01/07/2025] Open

Pérez-Núñez JR, Rodríguez C, Vásquez-Serpa LJ, Navarro C. The Challenge of Deep Learning for the Prevention and Automatic Diagnosis of Breast Cancer: A Systematic Review. Diagnostics (Basel) 2024;14:2896. [PMID: 39767257 PMCID: PMC11675111 DOI: 10.3390/diagnostics14242896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Revised: 11/24/2024] [Accepted: 12/18/2024] [Indexed: 01/11/2025] Open

Liu G, He J, Li P, Zhao Z, Zhong S. Cross-Modal self-supervised vision language pre-training with multiple objectives for medical visual question answering. J Biomed Inform 2024;160:104748. [PMID: 39536998 DOI: 10.1016/j.jbi.2024.104748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 09/29/2024] [Accepted: 11/03/2024] [Indexed: 11/16/2024]

Jun E, Jeong S, Heo DW, Suk HI. Medical Transformer: Universal Encoder for 3-D Brain MRI Analysis. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:17779-17789. [PMID: 37738193 DOI: 10.1109/tnnls.2023.3308712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/24/2023]

Rundo L, Militello C. Image biomarkers and explainable AI: handcrafted features versus deep learned features. Eur Radiol Exp 2024;8:130. [PMID: 39560820 PMCID: PMC11576747 DOI: 10.1186/s41747-024-00529-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Accepted: 10/16/2024] [Indexed: 11/20/2024] Open

Abstract

Feature extraction and selection from medical data are the basis of radiomics and image biomarker discovery for various architectures, including convolutional neural networks (CNNs). We herein describe the typical radiomics steps and the components of a CNN for both deep feature extraction and end-to-end approaches. We discuss the curse of dimensionality, along with dimensionality reduction techniques. Despite the outstanding performance of deep learning (DL) approaches, the use of handcrafted features instead of deep learned features needs to be considered for each specific study. Dataset size is a key factor: large-scale datasets with low sample diversity could lead to overfitting; limited sample sizes can provide unstable models. The dataset must be representative of all the "facets" of the clinical phenomenon/disease investigated. The access to high-performance computational resources from graphics processing units is another key factor, especially for the training phase of deep architectures. The advantages of multi-institutional federated/collaborative learning are described. When large language models are used, high stability is needed to avoid catastrophic forgetting in complex domain-specific tasks. We highlight that non-DL approaches provide model explainability superior to that provided by DL approaches. To implement explainability, the need for explainable AI arises, also through post hoc mechanisms. RELEVANCE STATEMENT: This work aims to provide the key concepts for processing the imaging features to extract reliable and robust image biomarkers. KEY POINTS: The key concepts for processing the imaging features to extract reliable and robust image biomarkers are provided. The main differences between radiomics and representation learning approaches are highlighted. The advantages and disadvantages of handcrafted versus learned features are given without losing sight of the clinical purpose of artificial intelligence models.

Collapse

Liu K, Zhang J. Development of a Cost-Efficient and Glaucoma-Specialized OD/OC Segmentation Model for Varying Clinical Scenarios. SENSORS (BASEL, SWITZERLAND) 2024;24:7255. [PMID: 39599032 PMCID: PMC11597940 DOI: 10.3390/s24227255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Revised: 10/31/2024] [Accepted: 11/11/2024] [Indexed: 11/29/2024]

Abstract

Most existing optic disc (OD) and cup (OC) segmentation models are biased to the dominant size and easy class (normal class), resulting in suboptimal performances on glaucoma-confirmed samples. Thus, these models are not optimal choices for assisting in tracking glaucoma progression and prognosis. Moreover, fully supervised models employing annotated glaucoma samples can achieve superior performances, although restricted by the high cost of collecting and annotating the glaucoma samples. Therefore, in this paper, we are dedicated to developing a glaucoma-specialized model by exploiting low-cost annotated normal fundus images, simultaneously adapting various common scenarios in clinical practice. We employ a contrastive learning and domain adaptation-based model by exploiting shared knowledge from normal samples. To capture glaucoma-related features, we utilize a Gram matrix to encode style information and the domain adaptation strategy to encode domain information, followed by narrowing the style and domain gaps between normal and glaucoma samples by contrastive and adversarial learning, respectively. To validate the efficacy of our proposed model, we conducted experiments utilizing two public datasets to mimic various common scenarios. The results demonstrate the superior performance of our proposed model across multi-scenarios, showcasing its proficiency in both the segmentation- and glaucoma-related metrics. In summary, our study illustrates a concerted effort to target confirmed glaucoma samples, mitigating the inherent bias issue in most existing models. Moreover, we propose an annotation-efficient strategy that exploits low-cost, normal-labeled fundus samples, mitigating the economic- and labor-related burdens by employing a fully supervised strategy. Simultaneously, our approach demonstrates its adaptability across various scenarios, highlighting its potential utility in both assisting in the monitoring of glaucoma progression and assessing glaucoma prognosis.

Collapse

Gravina M, Maddaluno M, Marrone S, Sansone M, Fusco R, Granata V, Petrillo A, Sansone C. A Physiological-Informed Generative Model for Improving Breast Lesion Classification in Small DCE-MRI Datasets. IEEE J Biomed Health Inform 2024;28:6764-6777. [PMID: 39141452 DOI: 10.1109/jbhi.2024.3443705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/16/2024]

Hao J, Chen S. Language-aware multiple datasets detection pretraining for DETRs. Neural Netw 2024;179:106506. [PMID: 38996689 DOI: 10.1016/j.neunet.2024.106506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2024] [Revised: 05/17/2024] [Accepted: 07/02/2024] [Indexed: 07/14/2024]