1
|
Liu J, Mu J, Sun H, Dai C, Ji Z, Ganchev I. DLGRAFE-Net: A double loss guided residual attention and feature enhancement network for polyp segmentation. PLoS One 2024; 19:e0308237. [PMID: 39264899 PMCID: PMC11392264 DOI: 10.1371/journal.pone.0308237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 07/18/2024] [Indexed: 09/14/2024] Open
Abstract
Colon polyps represent a common gastrointestinal form. In order to effectively treat and prevent complications arising from colon polyps, colon polypectomy has become a commonly used therapeutic approach. Accurately segmenting polyps from colonoscopy images can provide valuable information for early diagnosis and treatment. Due to challenges posed by illumination and contrast variations, noise and artifacts, as well as variations in polyp size and blurred boundaries in polyp images, the robustness of segmentation algorithms is a significant concern. To address these issues, this paper proposes a Double Loss Guided Residual Attention and Feature Enhancement Network (DLGRAFE-Net) for polyp segmentation. Firstly, a newly designed Semantic and Spatial Information Aggregation (SSIA) module is used to extract and fuse edge information from low-level feature graphs and semantic information from high-level feature graphs, generating local loss-guided training for the segmentation network. Secondly, newly designed Deep Supervision Feature Fusion (DSFF) modules are utilized to fuse local loss feature graphs with multi-level features from the encoder, addressing the negative impact of background imbalance caused by varying polyp sizes. Finally, Efficient Feature Extraction (EFE) decoding modules are used to extract spatial information at different scales, establishing longer-distance spatial channel dependencies to enhance the overall network performance. Extensive experiments conducted on the CVC-ClinicDB and Kvasir-SEG datasets demonstrate that the proposed network outperforms all mainstream networks and state-of-the-art networks, exhibiting superior performance and stronger generalization capabilities.
Collapse
Affiliation(s)
- Jianuo Liu
- College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou, China
- Hebei Key Laboratory of Industrial Intelligent Perception, North China University of Science and Technology, Tangshan, China
| | - Juncheng Mu
- Hebei Key Laboratory of Industrial Intelligent Perception, North China University of Science and Technology, Tangshan, China
| | - Haoran Sun
- Hebei Key Laboratory of Industrial Intelligent Perception, North China University of Science and Technology, Tangshan, China
| | - Chenxu Dai
- Hebei Key Laboratory of Industrial Intelligent Perception, North China University of Science and Technology, Tangshan, China
| | - Zhanlin Ji
- College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou, China
- Telecommunications Research Centre (TRC), University of Limerick, Limerick, Ireland
| | - Ivan Ganchev
- Telecommunications Research Centre (TRC), University of Limerick, Limerick, Ireland
- Department of Computer Systems, University of Plovdiv "Paisii Hilendarski", Plovdiv, Bulgaria
- Institute of Mathematics and Informatics-Bulgarian Academy of Sciences, Sofia, Bulgaria
| |
Collapse
|
2
|
Ji Z, Liu J, Mu J, Zhang H, Dai C, Yuan N, Ganchev I. ResDAC-Net: a novel pancreas segmentation model utilizing residual double asymmetric spatial kernels. Med Biol Eng Comput 2024; 62:2087-2100. [PMID: 38457066 PMCID: PMC11190007 DOI: 10.1007/s11517-024-03052-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Accepted: 02/13/2024] [Indexed: 03/09/2024]
Abstract
The pancreas not only is situated in a complex abdominal background but is also surrounded by other abdominal organs and adipose tissue, resulting in blurred organ boundaries. Accurate segmentation of pancreatic tissue is crucial for computer-aided diagnosis systems, as it can be used for surgical planning, navigation, and assessment of organs. In the light of this, the current paper proposes a novel Residual Double Asymmetric Convolution Network (ResDAC-Net) model. Firstly, newly designed ResDAC blocks are used to highlight pancreatic features. Secondly, the feature fusion between adjacent encoding layers fully utilizes the low-level and deep-level features extracted by the ResDAC blocks. Finally, parallel dilated convolutions are employed to increase the receptive field to capture multiscale spatial information. ResDAC-Net is highly compatible to the existing state-of-the-art models, according to three (out of four) evaluation metrics, including the two main ones used for segmentation performance evaluation (i.e., DSC and Jaccard index).
Collapse
Affiliation(s)
- Zhanlin Ji
- Department of Artificial Intelligence, North China University of Science and Technology, Tangshan, 063009, China
| | - Jianuo Liu
- Department of Artificial Intelligence, North China University of Science and Technology, Tangshan, 063009, China
| | - Juncheng Mu
- Department of Artificial Intelligence, North China University of Science and Technology, Tangshan, 063009, China
| | - Haiyang Zhang
- Department of Computing, Xi'an Jiaotong-Liverpool University, Suzhou, China
| | - Chenxu Dai
- Department of Artificial Intelligence, North China University of Science and Technology, Tangshan, 063009, China
| | - Na Yuan
- Intelligence and Information Engineering College, Tangshan University, Tangshan, 063000, China.
| | - Ivan Ganchev
- Telecommunications Research Centre (TRC), University of Limerick, Limerick, V94 T9PX, Ireland.
- Department of Computer Systems, University of Plovdiv "Paisii Hilendarski", Plovdiv, 4000, Bulgaria.
- Institute of Mathematics and Informatics-Bulgarian Academy of Sciences, Sofia, 1040, Bulgaria.
| |
Collapse
|
3
|
He K, Peng B, Yu W, Liu Y, Liu S, Cheng J, Dai Y. A Novel Mis-Seg-Focus Loss Function Based on a Two-Stage nnU-Net Framework for Accurate Brain Tissue Segmentation. Bioengineering (Basel) 2024; 11:427. [PMID: 38790294 PMCID: PMC11118222 DOI: 10.3390/bioengineering11050427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2024] [Revised: 04/14/2024] [Accepted: 04/24/2024] [Indexed: 05/26/2024] Open
Abstract
Brain tissue segmentation plays a critical role in the diagnosis, treatment, and study of brain diseases. Accurately identifying these boundaries is essential for improving segmentation accuracy. However, distinguishing boundaries between different brain tissues can be challenging, as they often overlap. Existing deep learning methods primarily calculate the overall segmentation results without adequately addressing local regions, leading to error propagation and mis-segmentation along boundaries. In this study, we propose a novel mis-segmentation-focused loss function based on a two-stage nnU-Net framework. Our approach aims to enhance the model's ability to handle ambiguous boundaries and overlapping anatomical structures, thereby achieving more accurate brain tissue segmentation results. Specifically, the first stage targets the identification of mis-segmentation regions using a global loss function, while the second stage involves defining a mis-segmentation loss function to adaptively adjust the model, thus improving its capability to handle ambiguous boundaries and overlapping anatomical structures. Experimental evaluations on two datasets demonstrate that our proposed method outperforms existing approaches both quantitatively and qualitatively.
Collapse
Affiliation(s)
- Keyi He
- Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, China; (K.H.); (B.P.); (Y.L.); (S.L.)
- The School of Electrical and Electronic Engineering, Changchun University of Technology, Changchun 130012, China;
| | - Bo Peng
- Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, China; (K.H.); (B.P.); (Y.L.); (S.L.)
| | - Weibo Yu
- The School of Electrical and Electronic Engineering, Changchun University of Technology, Changchun 130012, China;
| | - Yan Liu
- Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, China; (K.H.); (B.P.); (Y.L.); (S.L.)
| | - Surui Liu
- Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, China; (K.H.); (B.P.); (Y.L.); (S.L.)
| | - Jian Cheng
- State Key Laboratory of Complex & Critical Software Environment, Beihang University, Beijing 100191, China
- International Innovation Institute, Beihang University, 166 Shuanghongqiao Street, Pingyao Town, Yuhang District, Hangzhou 311115, China
| | - Yakang Dai
- Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, China; (K.H.); (B.P.); (Y.L.); (S.L.)
| |
Collapse
|
4
|
Zhang L, Xiao X, Wen J, Li H. MDKLoss: Medicine domain knowledge loss for skin lesion recognition. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024; 21:2671-2690. [PMID: 38454701 DOI: 10.3934/mbe.2024118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
Methods based on deep learning have shown good advantages in skin lesion recognition. However, the diversity of lesion shapes and the influence of noise disturbances such as hair, bubbles, and markers leads to large intra-class differences and small inter-class similarities, which existing methods have not yet effectively resolved. In addition, most existing methods enhance the performance of skin lesion recognition by improving deep learning models without considering the guidance of medical knowledge of skin lesions. In this paper, we innovatively construct feature associations between different lesions using medical knowledge, and design a medical domain knowledge loss function (MDKLoss) based on these associations. By expanding the gap between samples of various lesion categories, MDKLoss enhances the capacity of deep learning models to differentiate between different lesions and consequently boosts classification performance. Extensive experiments on ISIC2018 and ISIC2019 datasets show that the proposed method achieves a maximum of 91.6% and 87.6% accuracy. Furthermore, compared with existing state-of-the-art loss functions, the proposed method demonstrates its effectiveness, universality, and superiority.
Collapse
Affiliation(s)
- Li Zhang
- The Second School of Clinical Medicine, Southern Medical University, Guangzhou 510515, China
- Department of Dermatology, Guangdong Second Provincial General Hospital, Guangzhou 510317, China
- Department of Dermatology, Ningbo No. 6 Hospital, Ningbo 315040, China
| | - Xiangling Xiao
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou 510665, China
| | - Ju Wen
- The Second School of Clinical Medicine, Southern Medical University, Guangzhou 510515, China
- Department of Dermatology, Guangdong Second Provincial General Hospital, Guangzhou 510317, China
| | - Huihui Li
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou 510665, China
| |
Collapse
|
5
|
Lin X, Yu L, Cheng KT, Yan Z. BATFormer: Towards Boundary-Aware Lightweight Transformer for Efficient Medical Image Segmentation. IEEE J Biomed Health Inform 2023; 27:3501-3512. [PMID: 37053058 DOI: 10.1109/jbhi.2023.3266977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/14/2023]
Abstract
OBJECTIVE Transformers, born to remedy the inadequate receptive fields of CNNs, have drawn explosive attention recently. However, the daunting computational complexity of global representation learning, together with rigid window partitioning, hinders their deployment in medical image segmentation. This work aims to address the above two issues in transformers for better medical image segmentation. METHODS We propose a boundary-aware lightweight transformer (BATFormer) that can build cross-scale global interaction with lower computational complexity and generate windows flexibly under the guidance of entropy. Specifically, to fully explore the benefits of transformers in long-range dependency establishment, a cross-scale global transformer (CGT) module is introduced to jointly utilize multiple small-scale feature maps for richer global features with lower computational complexity. Given the importance of shape modeling in medical image segmentation, a boundary-aware local transformer (BLT) module is constructed. Different from rigid window partitioning in vanilla transformers which would produce boundary distortion, BLT adopts an adaptive window partitioning scheme under the guidance of entropy for both computational complexity reduction and shape preservation. RESULTS BATFormer achieves the best performance in Dice of 92.84 %, 91.97 %, 90.26 %, and 96.30 % for the average, right ventricle, myocardium, and left ventricle respectively on the ACDC dataset and the best performance in Dice, IoU, and ACC of 90.76 %, 84.64 %, and 96.76 % respectively on the ISIC 2018 dataset. More importantly, BATFormer requires the least amount of model parameters and the lowest computational complexity compared to the state-of-the-art approaches. CONCLUSION AND SIGNIFICANCE Our results demonstrate the necessity of developing customized transformers for efficient and better medical image segmentation. We believe the design of BATFormer is inspiring and extendable to other applications/frameworks.
Collapse
|