1
|
Mehrnia SS, Safahi Z, Mousavi A, Panahandeh F, Farmani A, Yuan R, Rahmim A, Salmanpour MR. Landscape of 2D Deep Learning Segmentation Networks Applied to CT Scan from Lung Cancer Patients: A Systematic Review. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2025:10.1007/s10278-025-01458-x. [PMID: 40038137 DOI: 10.1007/s10278-025-01458-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2024] [Revised: 02/06/2025] [Accepted: 02/16/2025] [Indexed: 03/06/2025]
Abstract
BACKGROUND The increasing rates of lung cancer emphasize the need for early detection through computed tomography (CT) scans, enhanced by deep learning (DL) to improve diagnosis, treatment, and patient survival. This review examines current and prospective applications of 2D- DL networks in lung cancer CT segmentation, summarizing research, highlighting essential concepts and gaps; Methods: Following the Preferred Reporting Items for Systematic Reviews and Meta-Analysis guidelines, a systematic search of peer-reviewed studies from 01/2020 to 12/2024 on data-driven population segmentation using structured data was conducted across databases like Google Scholar, PubMed, Science Direct, IEEE (Institute of Electrical and Electronics Engineers) and ACM (Association for Computing Machinery) library. 124 studies met the inclusion criteria and were analyzed. RESULTS The LIDC-LIDR dataset was the most frequently used; The finding particularly relies on supervised learning with labeled data. The UNet model and its variants were the most frequently used models in medical image segmentation, achieving Dice Similarity Coefficients (DSC) of up to 0.9999. The reviewed studies primarily exhibit significant gaps in addressing class imbalances (67%), underuse of cross-validation (21%), and poor model stability evaluations (3%). Additionally, 88% failed to address the missing data, and generalizability concerns were only discussed in 34% of cases. CONCLUSIONS The review emphasizes the importance of Convolutional Neural Networks, particularly UNet, in lung CT analysis and advocates for a combined 2D/3D modeling approach. It also highlights the need for larger, diverse datasets and the exploration of semi-supervised and unsupervised learning to enhance automated lung cancer diagnosis and early detection.
Collapse
Affiliation(s)
- Somayeh Sadat Mehrnia
- Department of Integrative Oncology, Breast Cancer Research Center, Motamed Cancer Institute, ACECR, Tehran, Iran
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC, Canada
| | - Zhino Safahi
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC, Canada
- Department of Computer Engineering, University of Kurdistan, Sanandaj, Iran
| | - Amin Mousavi
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC, Canada
| | - Fatemeh Panahandeh
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC, Canada
| | - Arezoo Farmani
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC, Canada
| | - Ren Yuan
- Department of Radiology, University of British Columbia, Vancouver, BC, Canada
- BC Cancer, Vancouver Center, Vancouver, BC, Canada
| | - Arman Rahmim
- Department of Radiology, University of British Columbia, Vancouver, BC, Canada
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, V5Z 1L3, Canada
- Department of Physics & Astronomy, University of British Columbia, Vancouver, BC, Canada
| | - Mohammad R Salmanpour
- Technological Virtual Collaboration (TECVICO Corp.), Vancouver, BC, Canada.
- Department of Radiology, University of British Columbia, Vancouver, BC, Canada.
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, V5Z 1L3, Canada.
| |
Collapse
|
2
|
Qiong L, Chaofan L, Jinnan T, Liping C, Jianxiang S. Medical image segmentation based on frequency domain decomposition SVD linear attention. Sci Rep 2025; 15:2833. [PMID: 39843905 PMCID: PMC11754837 DOI: 10.1038/s41598-025-86315-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2024] [Accepted: 01/09/2025] [Indexed: 01/24/2025] Open
Abstract
Convolutional Neural Networks (CNNs) have achieved remarkable segmentation accuracy in medical image segmentation tasks. However, the Vision Transformer (ViT) model, with its capability of extracting global information, offers a significant advantage in contextual information compared to the limited receptive field of convolutional kernels in CNNs. Despite this, ViT models struggle to fully detect and extract high-frequency signals, such as textures and boundaries, in medical images. These high-frequency features are essential in medical imaging, as targets like tumors and pathological organs exhibit significant differences in texture and boundaries across different stages. Additionally, the high resolution of medical images leads to computational complexity in the self-attention mechanism of ViTs. To address these limitations, we propose a medical image segmentation network framework based on frequency domain decomposition using a Laplacian pyramid. This approach selectively computes attention features for high-frequency signals in the original image to enhance spatial structural information effectively. During attention feature computation, we introduce Singular Value Decomposition (SVD) to extract an effective representation matrix from the original image, which is then applied in the attention computation process for linear projection. This method reduces computational complexity while preserving essential features. We demonstrated the segmentation validity and superiority of our model on the Abdominal Multi-Organ Segmentation dataset and the Dermatological Disease dataset, and on the Synapse dataset our model achieved a score of 82.68 on the Dice metrics and 17.23 mm on the HD metrics. Experimental results indicate that our model consistently exhibits segmentation effectiveness and improved accuracy across various datasets.
Collapse
Affiliation(s)
- Liu Qiong
- School of Medical Imaging, Jiangsu Medical College, Yancheng, 224005, Jiangsu, China.
| | - Li Chaofan
- Affiliated Hospital 6 of Nantong University, Yancheng Third People's Hospital, Yancheng, 224001, Jiangsu, China
| | - Teng Jinnan
- Affiliated Hospital 6 of Nantong University, Yancheng Third People's Hospital, Yancheng, 224001, Jiangsu, China
| | - Chen Liping
- Affiliated Hospital 6 of Nantong University, Yancheng Third People's Hospital, Yancheng, 224001, Jiangsu, China
| | - Song Jianxiang
- Affiliated Hospital 6 of Nantong University, Yancheng Third People's Hospital, Yancheng, 224001, Jiangsu, China.
| |
Collapse
|
3
|
Kang HY, Zhang W, Li S, Wang X, Sun Y, Sun X, Li FX, Hou C, Lam SK, Zheng YP. A comprehensive benchmarking of a U-Net based model for midbrain auto-segmentation on transcranial sonography. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 258:108494. [PMID: 39536407 DOI: 10.1016/j.cmpb.2024.108494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 10/30/2024] [Accepted: 11/05/2024] [Indexed: 11/16/2024]
Abstract
BACKGROUND AND OBJECTIVE Transcranial sonography-based grading of Parkinson's Disease has gained increasing attention in recent years, and it is currently used for assistive differential diagnosis in some specialized centers. To this end, accurate midbrain segmentation is considered an important initial step. However, current practice is manual, time-consuming, and bias-prone due to the subjective nature. Relevant studies in the literature are scarce and lacks comprehensive model evaluations from application perspectives. Herein, we aimed to benchmark the best-performing U-Net model for objective, stable and robust midbrain auto-segmentation using transcranial sonography images. METHODS A total of 584 patients who were suspected of Parkinson's Disease were retrospectively enrolled from Beijing Tiantan Hospital. The dataset was divided into training (n = 416), validation (n = 104), and testing (n = 64) sets. Three state-of-the-art deep-learning networks (U-Net, U-Net+++, and nnU-Net) were utilized to develop segmentation models, under 5-fold cross-validation and three randomization seeds for safeguarding model validity and stability. Model evaluation was conducted in testing set in three key aspects: (i) segmentation agreement using DICE coefficients (DICE), Intersection over Union (IoU), and Hausdorff Distance (HD); (ii) model stability using standard deviations of segmentation agreement metrics; (iii) prediction time efficiency, and (iv) model robustness against various degrees of ultrasound imaging noise produced by the salt-and-pepper noise and Gaussian noise. RESULTS The nnU-Net achieved the best segmentation agreement (averaged DICE: 0.910, IoU: 0.836, HD: 2.793-mm) and time efficiency (1.456-s). Under mild noise corruption, the nnU-Net outperformed others with averaged scores of DICE (0.904), IoU (0.827), HD (2.941 mm) in the salt-and-pepper noise (signal-to-noise ratio, SNR = 0.95), and DICE (0.906), IoU (0.830), HD (2.967 mm) in the Gaussian noise (sigma value, σ = 0.1); by contrast, intriguingly, performance of the U-Net and U-Net+++ models were remarkably degraded. Under increasing levels of simulated noise corruption (SNR decreased from 0.95 to 0.75; σ increased from 0.1 to 0.5), the nnU-Net network exhibited marginal decline in segmentation agreement meanwhile yielding decent performance as if there were absence of noise corruption. CONCLUSIONS The nnU-Net model was the best-performing midbrain segmentation model in terms of segmentation agreement, stability, time efficiency and robustness, providing the community with an objective, effective and automated alternative. Moving forward, a multi-center multi-vendor study is warranted when it comes to clinical implementation.
Collapse
Affiliation(s)
- Hong-Yu Kang
- Department of Biomedical Engineering, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China
| | - Wei Zhang
- Department of Ultrasound, Beijing Tiantan Hospital, Capital Medical University, NO.119, South 4th Ring West Road, Fengtai District, Beijing 100070, China.
| | - Shuai Li
- Department of Biomedical Engineering, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China
| | - Xinyi Wang
- Department of Biomedical Engineering, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China
| | - Yu Sun
- Department of Ultrasound, Beijing Tiantan Hospital, Capital Medical University, NO.119, South 4th Ring West Road, Fengtai District, Beijing 100070, China
| | - Xin Sun
- Department of Ultrasound, Beijing Tiantan Hospital, Capital Medical University, NO.119, South 4th Ring West Road, Fengtai District, Beijing 100070, China
| | - Fang-Xian Li
- Department of Ultrasound, Beijing Tiantan Hospital, Capital Medical University, NO.119, South 4th Ring West Road, Fengtai District, Beijing 100070, China
| | - Chao Hou
- Department of Ultrasound, Beijing Tiantan Hospital, Capital Medical University, NO.119, South 4th Ring West Road, Fengtai District, Beijing 100070, China
| | - Sai-Kit Lam
- Department of Biomedical Engineering, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China; Research Institute of Smart Ageing, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China
| | - Yong-Ping Zheng
- Department of Biomedical Engineering, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China; Research Institute of Smart Ageing, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China.
| |
Collapse
|
4
|
Xu Y, Quan R, Xu W, Huang Y, Chen X, Liu F. Advances in Medical Image Segmentation: A Comprehensive Review of Traditional, Deep Learning and Hybrid Approaches. Bioengineering (Basel) 2024; 11:1034. [PMID: 39451409 PMCID: PMC11505408 DOI: 10.3390/bioengineering11101034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Revised: 10/08/2024] [Accepted: 10/11/2024] [Indexed: 10/26/2024] Open
Abstract
Medical image segmentation plays a critical role in accurate diagnosis and treatment planning, enabling precise analysis across a wide range of clinical tasks. This review begins by offering a comprehensive overview of traditional segmentation techniques, including thresholding, edge-based methods, region-based approaches, clustering, and graph-based segmentation. While these methods are computationally efficient and interpretable, they often face significant challenges when applied to complex, noisy, or variable medical images. The central focus of this review is the transformative impact of deep learning on medical image segmentation. We delve into prominent deep learning architectures such as Convolutional Neural Networks (CNNs), Fully Convolutional Networks (FCNs), U-Net, Recurrent Neural Networks (RNNs), Adversarial Networks (GANs), and Autoencoders (AEs). Each architecture is analyzed in terms of its structural foundation and specific application to medical image segmentation, illustrating how these models have enhanced segmentation accuracy across various clinical contexts. Finally, the review examines the integration of deep learning with traditional segmentation methods, addressing the limitations of both approaches. These hybrid strategies offer improved segmentation performance, particularly in challenging scenarios involving weak edges, noise, or inconsistent intensities. By synthesizing recent advancements, this review provides a detailed resource for researchers and practitioners, offering valuable insights into the current landscape and future directions of medical image segmentation.
Collapse
Affiliation(s)
- Yan Xu
- School of Electrical, Electronic and Mechanical Engineering, University of Bristol, Bristol BS8 1QU, UK; (Y.X.); (R.Q.); (W.X.)
| | - Rixiang Quan
- School of Electrical, Electronic and Mechanical Engineering, University of Bristol, Bristol BS8 1QU, UK; (Y.X.); (R.Q.); (W.X.)
| | - Weiting Xu
- School of Electrical, Electronic and Mechanical Engineering, University of Bristol, Bristol BS8 1QU, UK; (Y.X.); (R.Q.); (W.X.)
| | - Yi Huang
- Bristol Medical School, University of Bristol, Bristol BS8 1UD, UK;
| | - Xiaolong Chen
- Department of Mechanical, Materials and Manufacturing Engineering, University of Nottingham, Nottingham NG7 2RD, UK;
| | - Fengyuan Liu
- School of Electrical, Electronic and Mechanical Engineering, University of Bristol, Bristol BS8 1QU, UK; (Y.X.); (R.Q.); (W.X.)
| |
Collapse
|
5
|
Huang T, Yin H, Huang X. Improved genetic algorithm for multi-threshold optimization in digital pathology image segmentation. Sci Rep 2024; 14:22454. [PMID: 39341998 PMCID: PMC11439074 DOI: 10.1038/s41598-024-73335-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Accepted: 09/16/2024] [Indexed: 10/01/2024] Open
Abstract
This paper presents an improved genetic algorithm focused on multi-threshold optimization for image segmentation in digital pathology. By innovatively enhancing the selection mechanism and crossover operation, the limitations of traditional genetic algorithms are effectively addressed, significantly improving both segmentation accuracy and computational efficiency. Experimental results demonstrate that the improved genetic algorithm achieves the best balance between precision and recall within the threshold range of 0.02 to 0.05, and it significantly outperforms traditional methods in terms of segmentation performance. Segmentation quality is quantified using metrics such as precision, recall, and F1 score, and statistical tests confirm the superior performance of the algorithm, especially in its global search capabilities for complex optimization problems. Although the algorithm's computation time is relatively long, its notable advantages in segmentation quality, particularly in handling high-precision segmentation tasks for complex images, are highly pronounced. The experiments also show that the algorithm exhibits strong robustness and stability, maintaining reliable performance under different initial conditions. Compared to general segmentation models, this algorithm demonstrates significant advantages in specialized tasks, such as pathology image segmentation, especially in resource-constrained environments. Therefore, this improved genetic algorithm offers an efficient and precise multi-threshold optimization solution for image segmentation, providing valuable reference for practical applications.
Collapse
Affiliation(s)
- Tangsen Huang
- School of Communication Engineering, Hangzhou Dianzi University, Hangzhou, 310018, China.
- School of Information Engineering, Hunan University of Science and Engineering, Yongzhou, 425199, China.
- Lishui Institute of Hangzhou Dianzi University, Lishui, 323000, China.
| | - Haibing Yin
- School of Communication Engineering, Hangzhou Dianzi University, Hangzhou, 310018, China
| | - Xingru Huang
- School of Communication Engineering, Hangzhou Dianzi University, Hangzhou, 310018, China
- Lishui Institute of Hangzhou Dianzi University, Lishui, 323000, China
| |
Collapse
|
6
|
Siami M, Barszcz T, Wodecki J, Zimroz R. Semantic segmentation of thermal defects in belt conveyor idlers using thermal image augmentation and U-Net-based convolutional neural networks. Sci Rep 2024; 14:5748. [PMID: 38459162 PMCID: PMC10923815 DOI: 10.1038/s41598-024-55864-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2023] [Accepted: 02/28/2024] [Indexed: 03/10/2024] Open
Abstract
The belt conveyor (BC) is the main means of horizontal transportation of bulk materials at mining sites. The sudden fault in BC modules may cause unexpected stops in production lines. With the increasing number of applications of inspection mobile robots in condition monitoring (CM) of industrial infrastructure in hazardous environments, in this article we introduce an image processing pipeline for automatic segmentation of thermal defects in thermal images captured from BC idlers using a mobile robot. This study follows the fact that CM of idler temperature is an important task for preventing sudden breakdowns in BC system networks. We compared the performance of three different types of U-Net-based convolutional neural network architectures for the identification of thermal anomalies using a small number of hand-labeled thermal images. Experiments on the test data set showed that the attention residual U-Net with binary cross entropy as the loss function handled the semantic segmentation problem better than our previous research and other studied U-Net variations.
Collapse
Affiliation(s)
- Mohammad Siami
- AMC Vibro Sp. z o.o., Pilotow 2e, 31-462, Kraków, Poland.
| | - Tomasz Barszcz
- Faculty of Mechanical Engineering and Robotics, AGH University, Al. Mickiewicza 30, 30-059, Kraków, Poland
| | - Jacek Wodecki
- Faculty of Geoengineering, Mining and Geology, Wroclaw University of Science and Technology, Na Grobli 15, 50-421, Wroclaw, Poland
| | - Radoslaw Zimroz
- Faculty of Geoengineering, Mining and Geology, Wroclaw University of Science and Technology, Na Grobli 15, 50-421, Wroclaw, Poland
| |
Collapse
|
7
|
Hu F, Chen Z, Wu F. A novel difficult-to-segment samples focusing network for oral CBCT image segmentation. Sci Rep 2024; 14:5068. [PMID: 38429362 PMCID: PMC10907706 DOI: 10.1038/s41598-024-55522-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 02/24/2024] [Indexed: 03/03/2024] Open
Abstract
Using deep learning technology to segment oral CBCT images for clinical diagnosis and treatment is one of the important research directions in the field of clinical dentistry. However, the blurred contour and the scale difference limit the segmentation accuracy of the crown edge and the root part of the current methods, making these regions become difficult-to-segment samples in the oral CBCT segmentation task. Aiming at the above problems, this work proposed a Difficult-to-Segment Focus Network (DSFNet) for segmenting oral CBCT images. The network utilizes a Feature Capturing Module (FCM) to efficiently capture local and long-range features, enhancing the feature extraction performance. Additionally, a Multi-Scale Feature Fusion Module (MFFM) is employed to merge multiscale feature information. To further improve the loss ratio for difficult-to-segment samples, a hybrid loss function is proposed, combining Focal Loss and Dice Loss. By utilizing the hybrid loss function, DSFNet achieves 91.85% Dice Similarity Coefficient (DSC) and 0.216 mm Average Symmetric Surface Distance (ASSD) performance in oral CBCT segmentation tasks. Experimental results show that the proposed method is superior to current dental CBCT image segmentation techniques and has real-world applicability.
Collapse
Affiliation(s)
- Fengjun Hu
- College of Information Science and Technology, Zhejiang Shuren University, Hangzhou, 310015, China
- Zhejiang-Netherlands Joint Laboratory for Digital Diagnosis and Treatment of Oral Diseases, Zhejiang Shuren University, Hangzhou, 310015, China
| | - Zeyu Chen
- Zhejiang-Netherlands Joint Laboratory for Digital Diagnosis and Treatment of Oral Diseases, Zhejiang Shuren University, Hangzhou, 310015, China
| | - Fan Wu
- College of Information Science and Technology, Zhejiang Shuren University, Hangzhou, 310015, China.
- Zhejiang-Netherlands Joint Laboratory for Digital Diagnosis and Treatment of Oral Diseases, Zhejiang Shuren University, Hangzhou, 310015, China.
| |
Collapse
|
8
|
Montgomery MK, Duan C, Manzuk L, Chang S, Cubias A, Brun S, Giddabasappa A, Jiang ZK. Applying deep learning to segmentation of murine lung tumors in pre-clinical micro-computed tomography. Transl Oncol 2024; 40:101833. [PMID: 38128467 PMCID: PMC10776660 DOI: 10.1016/j.tranon.2023.101833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 11/01/2023] [Accepted: 11/14/2023] [Indexed: 12/23/2023] Open
Abstract
Lung cancer remains a leading cause of cancer-related death, but scientists have made great strides in developing new treatments recently, partly owing to the use of genetically engineered mouse models (GEMMs). GEMM tumors represent a translational model that recapitulates human disease better than implanted models because tumors develop spontaneously in the lungs. However, detection of these tumors relies on in vivo imaging tools, specifically micro-Computed Tomography (micro-CT or µCT), and image analysis can be laborious with high inter-user variability. Here we present a deep learning model trained to perform fully automated segmentation of lung tumors without the interference of other soft tissues. Trained and tested on 100 3D µCT images (18,662 slices) that were manually segmented, the model demonstrated a high correlation to manual segmentations on the testing data (r2=0.99, DSC=0.78) and on an independent dataset (n = 12 3D scans or 2328 2D slices, r2=0.97, DSC=0.73). In a comparison against manual segmentation performed by multiple analysts, the model (r2=0.98, DSC=0.78) performed within inter-reader variability (r2=0.79, DSC=0.69) and close to intra-reader variability (r2=0.99, DSC=0.82), all while completing 5+ hours of manual segmentations in 1 minute. Finally, when applied to a real-world longitudinal study (n = 55 mice), the model successfully detected tumor progression over time and the differences in tumor burden between groups induced with different virus titers, aligning well with more traditional analysis methods. In conclusion, we have developed a deep learning model which can perform fast, accurate, and fully automated segmentation of µCT scans of murine lung tumors.
Collapse
Affiliation(s)
| | - Chong Duan
- Early Clinical Development, Pfizer Inc., 1 Portland Street, Cambridge, MA 02139, United States
| | - Lisa Manzuk
- Comparative Medicine, Pfizer Inc., 10646 Science Center Drive, La Jolla, CA 92121, United States
| | - Stephanie Chang
- Comparative Medicine, Pfizer Inc., 10646 Science Center Drive, La Jolla, CA 92121, United States
| | - Aiyana Cubias
- Early Clinical Development, Pfizer Inc., 1 Portland Street, Cambridge, MA 02139, United States
| | - Sonja Brun
- Oncology Research and Development, Pfizer Inc., 10646 Science Center Drive, La Jolla, CA 92121, United States
| | - Anand Giddabasappa
- Comparative Medicine, Pfizer Inc., 10646 Science Center Drive, La Jolla, CA 92121, United States
| | - Ziyue Karen Jiang
- Comparative Medicine, Pfizer Inc., 10646 Science Center Drive, La Jolla, CA 92121, United States.
| |
Collapse
|
9
|
Doppala BP, Al Bataineh A, Vamsi B. An Efficient, Lightweight, Tiny 2D-CNN Ensemble Model to Detect Cardiomegaly in Heart CT Images. J Pers Med 2023; 13:1338. [PMID: 37763106 PMCID: PMC10532522 DOI: 10.3390/jpm13091338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 08/26/2023] [Accepted: 08/28/2023] [Indexed: 09/29/2023] Open
Abstract
Cardiomegaly is a significant global health concern, especially in developing nations. Although advanced clinical care is available for newly diagnosed patients, many in resource-limited regions face late diagnoses and consequent increased mortality. This challenge is accentuated by a scarcity of radiography equipment and radiologists. Hence, we propose the development of a computer-aided diagnostic (CAD) system, specifically a lightweight, tiny 2D-CNN ensemble model, to facilitate early detection and, potentially, reduce mortality rates. Deep learning, with its subset of convolutional neural networks (CNN), has shown potential in visual applications, especially in medical image diagnosis. However, traditional deep CNNs often face compatibility issues with object-oriented human factor technology. Our proposed model aims to bridge this gap. Using CT scan images sourced from the Mendeley data center, our tiny 2D-CNN ensemble learning model achieved an accuracy of 96.32%, offering a promising tool for efficient and accurate cardiomegaly detection.
Collapse
Affiliation(s)
| | - Ali Al Bataineh
- Artificial Intelligence Center, Norwich University, Northfield, VT 05663, USA
| | - Bandi Vamsi
- Department of Computer Science—Artificial Intelligence & Data Science, Madanapalle Institute of Technology & Science, Madanapalle 517325, India;
| |
Collapse
|
10
|
Dhalla S, Mittal A, Gupta S, Kaur J, Harshit, Kaur H. A combination of simple and dilated convolution with attention mechanism in a feature pyramid network to segment leukocytes from blood smear images. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
11
|
Modak S, Abdel-Raheem E, Rueda L. Applications of Deep Learning in Disease Diagnosis of Chest Radiographs: A Survey on Materials and Methods. BIOMEDICAL ENGINEERING ADVANCES 2023. [DOI: 10.1016/j.bea.2023.100076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
|
12
|
Jeon HJ, Lim HG, Shung KK, Lee OJ, Kim MG. Automated cell-type classification combining dilated convolutional neural networks with label-free acoustic sensing. Sci Rep 2022; 12:19873. [PMID: 36400803 PMCID: PMC9674693 DOI: 10.1038/s41598-022-22075-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 10/10/2022] [Indexed: 11/19/2022] Open
Abstract
This study aimed to automatically classify live cells based on their cell type by analyzing the patterns of backscattered signals of cells with minimal effect on normal cell physiology and activity. Our previous studies have demonstrated that label-free acoustic sensing using high-frequency ultrasound at a high pulse repetition frequency (PRF) can capture and analyze a single object from a heterogeneous sample. However, eliminating possible errors in the manual setting and time-consuming processes when postprocessing integrated backscattering (IB) coefficients of backscattered signals is crucial. In this study, an automated cell-type classification system that combines a label-free acoustic sensing technique with deep learning-empowered artificial intelligence models is proposed. We applied an one-dimensional (1D) convolutional autoencoder to denoise the signals and conducted data augmentation based on Gaussian noise injection to enhance the robustness of the proposed classification system to noise. Subsequently, denoised backscattered signals were classified into specific cell types using convolutional neural network (CNN) models for three types of signal data representations, including 1D CNN models for waveform and frequency spectrum analysis and two-dimensional (2D) CNN models for spectrogram analysis. We evaluated the proposed system by classifying two types of cells (e.g., RBC and PNT1A) and two types of polystyrene microspheres by analyzing their backscattered signal patterns. We attempted to discover cell physical properties reflected on backscattered signals by controlling experimental variables, such as diameter and structure material. We further evaluated the effectiveness of the neural network models and efficacy of data representations by comparing their accuracy with that of baseline methods. Therefore, the proposed system can be used to classify reliably and precisely several cell types with different intrinsic physical properties for personalized cancer medicine development.
Collapse
Affiliation(s)
- Hyeon-Ju Jeon
- grid.482520.90000 0004 0578 4668Data Assimilation Group, Korea Institute of Atmospheric Prediction Systems, Seoul, 07071 Republic of Korea
| | - Hae Gyun Lim
- grid.412576.30000 0001 0719 8994Department of Biomedical Engineering, Pukyong National University, Busan, 48513 Republic of Korea
| | - K. Kirk Shung
- grid.42505.360000 0001 2156 6853Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089 USA
| | - O-Joun Lee
- grid.411947.e0000 0004 0470 4224Department of Artificial Intelligence, The Catholic University of Korea, Bucheon, 14662 Republic of Korea
| | - Min Gon Kim
- grid.42505.360000 0001 2156 6853Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089 USA
| |
Collapse
|
13
|
Li Z, Yang L, Shu L, Yu Z, Huang J, Li J, Chen L, Hu S, Shu T, Yu G. Research on CT Lung Segmentation Method of Preschool Children based on Traditional Image Processing and ResUnet. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:7321330. [PMID: 36262868 PMCID: PMC9576440 DOI: 10.1155/2022/7321330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 09/13/2022] [Accepted: 09/21/2022] [Indexed: 11/22/2022]
Abstract
Lung segmentation using computed tomography (CT) images is important for diagnosing various lung diseases. Currently, no lung segmentation method has been developed for assessing the CT images of preschool children, which may differ from those of adults due to (1) presence of artifacts caused by the shaking of children, (2) loss of a localized lung area due to a failure to hold their breath, and (3) a smaller CT chest area, compared with adults. To solve these unique problems, this study developed an automatic lung segmentation method by combining traditional imaging methods with ResUnet using the CT images of 60 children, aged 0-6 years. First, the CT images were cropped and zoomed through ecological operations to concentrate the segmentation task on the chest area. Then, a ResUnet model was used to improve the loss for lung segmentation, and case-based connected domain operations were performed to filter the segmentation results and improve segmentation accuracy. The proposed method demonstrated promising segmentation results on a test set of 12 cases, with average accuracy, Dice, precision, and recall of 0.9479, 0.9678, 0.9711, and 0.9715, respectively, which achieved the best performance relative to the other six models. This study shows that the proposed method can achieve good segmentation results in CT of preschool children, laying a good foundation for the diagnosis of children's lung diseases.
Collapse
Affiliation(s)
- Zheming Li
- Department of Data and Information, The Children's Hospital Zhejiang University School of Medicine, Hangzhou 310052, China
- Sino-Finland Joint AI Laboratory for Child Health of Zhejiang Province, Hangzhou 310052, China
- National Clinical Research Center for Child Health, Hangzhou 310052, China
- Polytechnic Institute, Zhejiang University, 866 Yuhangtang Rd, Hangzhou 310058, China
| | - Li Yang
- National Clinical Research Center for Child Health, Hangzhou 310052, China
- Department of Radiology, Children's Hospital, Zhejiang University School of Medicine, Hangzhou 310052, China
| | - Liqi Shu
- Department of Neurology, The Warren Alpert Medical School of Brown University, USA
| | - Zhuo Yu
- Huiying Medical Technology (Beijing), Beijing 100192, China
| | - Jian Huang
- Department of Data and Information, The Children's Hospital Zhejiang University School of Medicine, Hangzhou 310052, China
- Sino-Finland Joint AI Laboratory for Child Health of Zhejiang Province, Hangzhou 310052, China
- National Clinical Research Center for Child Health, Hangzhou 310052, China
| | - Jing Li
- Department of Data and Information, The Children's Hospital Zhejiang University School of Medicine, Hangzhou 310052, China
- Sino-Finland Joint AI Laboratory for Child Health of Zhejiang Province, Hangzhou 310052, China
- National Clinical Research Center for Child Health, Hangzhou 310052, China
| | - Lingdong Chen
- Department of Data and Information, The Children's Hospital Zhejiang University School of Medicine, Hangzhou 310052, China
- Sino-Finland Joint AI Laboratory for Child Health of Zhejiang Province, Hangzhou 310052, China
- National Clinical Research Center for Child Health, Hangzhou 310052, China
| | - Shasha Hu
- The Children's Hospital Zhejiang University School of Medicine, Hangzhou 310052, China
| | - Ting Shu
- National Institute of Hospital Administration, NHC, Beijing 100044, China
| | - Gang Yu
- Department of Data and Information, The Children's Hospital Zhejiang University School of Medicine, Hangzhou 310052, China
- Sino-Finland Joint AI Laboratory for Child Health of Zhejiang Province, Hangzhou 310052, China
- National Clinical Research Center for Child Health, Hangzhou 310052, China
- Polytechnic Institute, Zhejiang University, 866 Yuhangtang Rd, Hangzhou 310058, China
| |
Collapse
|
14
|
Bhattacharyya D, Thirupathi Rao N, Joshua ESN, Hu YC. A bi-directional deep learning architecture for lung nodule semantic segmentation. THE VISUAL COMPUTER 2022; 39:1-17. [PMID: 36097497 PMCID: PMC9453728 DOI: 10.1007/s00371-022-02657-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 08/20/2022] [Indexed: 06/15/2023]
Abstract
Lung nodules are abnormal growths and lesions may exist. Both lungs may have nodules. Most lung nodules are harmless (not cancerous/malignant). Pulmonary nodules are rare in lung cancer. X-rays and CT scans identify the lung nodules. Doctors may term the growth a lung spot, coin lesion, or shadow. It is necessary to obtain properly computed tomography (CT) scans of the lungs to get an accurate diagnosis and a good estimate of the severity of lung cancer. This study aims to design and evaluate a deep learning (DL) algorithm for identifying pulmonary nodules (PNs) using the LUNA-16 dataset and examine the prevalence of PNs using DB-Net. The paper states that a new, resource-efficient deep learning architecture is called for, and it has been given the name of DB-NET. When a physician orders a CT scan, they need to employ an accurate and efficient lung nodule segmentation method because they need to detect lung cancer at an early stage. However, segmentation of lung nodules is a difficult task because of the nodules' characteristics on the CT image as well as the nodules' concealed shape, visual quality, and context. The DB-NET model architecture is presented as a resource-efficient deep learning solution for handling the challenge at hand in this paper. Furthermore, it incorporates the Mish nonlinearity function and the mask class weights to improve segmentation effectiveness. In addition to the LUNA-16 dataset, which contained 1200 lung nodules collected during the LUNA-16 test, the LUNA-16 dataset was extensively used to train and assess the proposed model. The DB-NET architecture surpasses the existing U-NET model by a dice coefficient index of 88.89%, and it also achieves a similar level of accuracy to that of human experts.
Collapse
Affiliation(s)
- Debnath Bhattacharyya
- Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Guntur, 522 502 India
| | - N. Thirupathi Rao
- Department of Computer Science and Engineering, Vignan’s Institute of Information Technology (A), Visakhapatnam, 530049 AP India
| | - Eali Stephen Neal Joshua
- Department of Computer Science and Engineering, Vignan’s Institute of Information Technology (A), Visakhapatnam, 530049 AP India
| | - Yu-Chen Hu
- Department of Computer Science and Information Management, Providence University, 200, Sec. 7, Taiwan Boulevard, Shalu Dist., Taichung City, 43301 Taiwan R.O.C
| |
Collapse
|
15
|
Shu X, Gu Y, Zhang X, Hu C, Cheng K. FCRB U-Net: A novel fully connected residual block U-Net for fetal cerebellum ultrasound image segmentation. Comput Biol Med 2022; 148:105693. [PMID: 35717404 DOI: 10.1016/j.compbiomed.2022.105693] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 05/15/2022] [Accepted: 05/31/2022] [Indexed: 11/29/2022]
Abstract
In this paper, we propose a novel U-Net with fully connected residual blocks (FCRB U-Net) for the fetal cerebellum Ultrasound image segmentation task. FCRB U-Net, an improved convolutional neural network (CNN) based on U-Net, replaces the double convolution operation in the original model with the fully connected residual block and embeds an effective channel attention module to enhance the extraction of valid features. Moreover, in the decoding stage, a feature reuse module is employed to form a fully connected decoder to make full use of deep features. FCRB U-Net can effectively alleviate the problem of the loss of feature information during the convolution process and improve segmentation accuracy. Experimental results demonstrate that the proposed approach is effective and promising in the field of fetal cerebellar segmentation in actual Ultrasound images. The average IoU value and mean Dice index reach 86.72% and 90.45%, respectively, which are 3.07% and 5.25% higher than that of the basic U-Net.
Collapse
Affiliation(s)
- Xin Shu
- School of Computer Science, Jiangsu University of Science and Technology, Zhenjiang, 212100, China.
| | - Yingyan Gu
- School of Computer Science, Jiangsu University of Science and Technology, Zhenjiang, 212100, China
| | - Xin Zhang
- Department of Medical Ultrasound, Affiliated Hospital of Jiangsu University, Zhenjiang, 212003, China.
| | - Chunlong Hu
- School of Computer Science, Jiangsu University of Science and Technology, Zhenjiang, 212100, China
| | - Ke Cheng
- School of Computer Science, Jiangsu University of Science and Technology, Zhenjiang, 212100, China
| |
Collapse
|
16
|
Semantic Segmentation and Edge Detection—Approach to Road Detection in Very High Resolution Satellite Images. REMOTE SENSING 2022. [DOI: 10.3390/rs14030613] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Road detection technology plays an essential role in a variety of applications, such as urban planning, map updating, traffic monitoring and automatic vehicle navigation. Recently, there has been much development in detecting roads in high-resolution (HR) satellite images based on semantic segmentation. However, the objects being segmented in such images are of small size, and not all the information in the images is equally important when making a decision. This paper proposes a novel approach to road detection based on semantic segmentation and edge detection. Our approach aims to combine these two techniques to improve road detection, and it produces sharp-pixel segmentation maps, using the segmented masks to generate road edges. In addition, some well-known architectures, such as SegNet, used multi-scale features without refinement; thus, using attention blocks in the encoder to predict fine segmentation masks resulted in finer edges. A combination of weighted cross-entropy loss and the focal Tversky loss as the loss function is also used to deal with the highly imbalanced dataset. We conducted various experiments on two datasets describing real-world datasets covering the three largest regions in Saudi Arabia and Massachusetts. The results demonstrated that the proposed method of encoding HR feature maps effectively predicts sharp segmentation masks to facilitate accurate edge detection, even against a harsh and complicated background.
Collapse
|
17
|
Herrmann P, Busana M, Cressoni M, Lotz J, Moerer O, Saager L, Meissner K, Quintel M, Gattinoni L. Using Artificial Intelligence for Automatic Segmentation of CT Lung Images in Acute Respiratory Distress Syndrome. Front Physiol 2021; 12:676118. [PMID: 34594233 PMCID: PMC8476971 DOI: 10.3389/fphys.2021.676118] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 08/17/2021] [Indexed: 01/17/2023] Open
Abstract
Knowledge of gas volume, tissue mass and recruitability measured by the quantitative CT scan analysis (CT-qa) is important when setting the mechanical ventilation in acute respiratory distress syndrome (ARDS). Yet, the manual segmentation of the lung requires a considerable workload. Our goal was to provide an automatic, clinically applicable and reliable lung segmentation procedure. Therefore, a convolutional neural network (CNN) was used to train an artificial intelligence (AI) algorithm on 15 healthy subjects (1,302 slices), 100 ARDS patients (12,279 slices), and 20 COVID-19 (1,817 slices). Eighty percent of this populations was used for training, 20% for testing. The AI and manual segmentation at slice level were compared by intersection over union (IoU). The CT-qa variables were compared by regression and Bland Altman analysis. The AI-segmentation of a single patient required 5–10 s vs. 1–2 h of the manual. At slice level, the algorithm showed on the test set an IOU across all CT slices of 91.3 ± 10.0, 85.2 ± 13.9, and 84.7 ± 14.0%, and across all lung volumes of 96.3 ± 0.6, 88.9 ± 3.1, and 86.3 ± 6.5% for normal lungs, ARDS and COVID-19, respectively, with a U-shape in the performance: better in the lung middle region, worse at the apex and base. At patient level, on the test set, the total lung volume measured by AI and manual segmentation had a R2 of 0.99 and a bias −9.8 ml [CI: +56.0/−75.7 ml]. The recruitability measured with manual and AI-segmentation, as change in non-aerated tissue fraction had a bias of +0.3% [CI: +6.2/−5.5%] and −0.5% [CI: +2.3/−3.3%] expressed as change in well-aerated tissue fraction. The AI-powered lung segmentation provided fast and clinically reliable results. It is able to segment the lungs of seriously ill ARDS patients fully automatically.
Collapse
Affiliation(s)
- Peter Herrmann
- Department of Anesthesiology, University Medical Center Göttingen, Göttingen, Germany
| | - Mattia Busana
- Department of Anesthesiology, University Medical Center Göttingen, Göttingen, Germany
| | | | - Joachim Lotz
- Institute for Diagnostic and Interventional Radiology, University Medical Center Göttingen, Göttingen, Germany
| | - Onnen Moerer
- Department of Anesthesiology, University Medical Center Göttingen, Göttingen, Germany
| | - Leif Saager
- Department of Anesthesiology, University Medical Center Göttingen, Göttingen, Germany
| | - Konrad Meissner
- Department of Anesthesiology, University Medical Center Göttingen, Göttingen, Germany
| | - Michael Quintel
- Department of Anesthesiology, University Medical Center Göttingen, Göttingen, Germany.,Department of Anesthesiology, DONAUISAR Klinikum Deggendorf, Deggendorf, Germany
| | - Luciano Gattinoni
- Department of Anesthesiology, University Medical Center Göttingen, Göttingen, Germany
| |
Collapse
|