1
|
Liu X, Tan H, Wang W, Chen Z. Deep learning based retinal vessel segmentation and hypertensive retinopathy quantification using heterogeneous features cross-attention neural network. Front Med (Lausanne) 2024; 11:1377479. [PMID: 38841586 PMCID: PMC11150614 DOI: 10.3389/fmed.2024.1377479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 05/09/2024] [Indexed: 06/07/2024] Open
Abstract
Retinal vessels play a pivotal role as biomarkers in the detection of retinal diseases, including hypertensive retinopathy. The manual identification of these retinal vessels is both resource-intensive and time-consuming. The fidelity of vessel segmentation in automated methods directly depends on the fundus images' quality. In instances of sub-optimal image quality, applying deep learning-based methodologies emerges as a more effective approach for precise segmentation. We propose a heterogeneous neural network combining the benefit of local semantic information extraction of convolutional neural network and long-range spatial features mining of transformer network structures. Such cross-attention network structure boosts the model's ability to tackle vessel structures in the retinal images. Experiments on four publicly available datasets demonstrate our model's superior performance on vessel segmentation and the big potential of hypertensive retinopathy quantification.
Collapse
Affiliation(s)
- Xinghui Liu
- School of Clinical Medicine, Guizhou Medical University, Guiyang, China
- Department of Cardiovascular Medicine, Guizhou Provincial People's Hospital, Guiyang, China
| | - Hongwen Tan
- Department of Cardiovascular Medicine, Guizhou Provincial People's Hospital, Guiyang, China
| | - Wu Wang
- Electrical Engineering College, Guizhou University, Guiyang, China
| | - Zhangrong Chen
- School of Clinical Medicine, Guizhou Medical University, Guiyang, China
- Department of Cardiovascular Medicine, The Affiliated Hospital of Guizhou Medical University, Guiyang, China
| |
Collapse
|
2
|
Wang W, Xiao L, Qu G, Calhoun VD, Wang YP, Sun X. Multiview hyperedge-aware hypergraph embedding learning for multisite, multiatlas fMRI based functional connectivity network analysis. Med Image Anal 2024; 94:103144. [PMID: 38518530 DOI: 10.1016/j.media.2024.103144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 03/17/2024] [Accepted: 03/18/2024] [Indexed: 03/24/2024]
Abstract
Recently, functional magnetic resonance imaging (fMRI) based functional connectivity network (FCN) analysis via graph convolutional networks (GCNs) has shown promise for automated diagnosis of brain diseases by regarding the FCNs as irregular graph-structured data. However, multiview information and site influences of the FCNs in a multisite, multiatlas fMRI scenario have been understudied. In this paper, we propose a Class-consistency and Site-independence Multiview Hyperedge-Aware HyperGraph Embedding Learning (CcSi-MHAHGEL) framework to integrate FCNs constructed on multiple brain atlases in a multisite fMRI study. Specifically, for each subject, we first model brain network as a hypergraph for every brain atlas to characterize high-order relations among multiple vertexes, and then introduce a multiview hyperedge-aware hypergraph convolutional network (HGCN) to extract a multiatlas-based FCN embedding where hyperedge weights are adaptively learned rather than employing the fixed weights precalculated in traditional HGCNs. In addition, we formulate two modules to jointly learn the multiatlas-based FCN embeddings by considering the between-subject associations across classes and sites, respectively, i.e., a class-consistency module to encourage both compactness within every class and separation between classes for promoting discrimination in the embedding space, and a site-independence module to minimize the site dependence of the embeddings for mitigating undesired site influences due to differences in scanning platforms and/or protocols at multiple sites. Finally, the multiatlas-based FCN embeddings are fed into a few fully connected layers followed by the soft-max classifier for diagnosis decision. Extensive experiments on the ABIDE demonstrate the effectiveness of our method for autism spectrum disorder (ASD) identification. Furthermore, our method is interpretable by revealing ASD-relevant brain regions that are biologically significant.
Collapse
Affiliation(s)
- Wei Wang
- MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China, Hefei 230052, China
| | - Li Xiao
- MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China, Hefei 230052, China; Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei 230088, China.
| | - Gang Qu
- Department of Biomedical Engineering, Tulane University, New Orleans, LA 70118, USA
| | - Vince D Calhoun
- Tri-Institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS), Georgia State University, Georgia Institute of Technology, Emory University, Atlanta, GA 30030, USA
| | - Yu-Ping Wang
- Department of Biomedical Engineering, Tulane University, New Orleans, LA 70118, USA
| | - Xiaoyan Sun
- MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China, Hefei 230052, China; Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei 230088, China
| |
Collapse
|
3
|
Kang N, Wang M, Pang C, Lan R, Li B, Guan J, Wang H. Cross-patch feature interactive net with edge refinement for retinal vessel segmentation. Comput Biol Med 2024; 174:108443. [PMID: 38608328 DOI: 10.1016/j.compbiomed.2024.108443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 03/14/2024] [Accepted: 04/07/2024] [Indexed: 04/14/2024]
Abstract
Retinal vessel segmentation based on deep learning is an important auxiliary method for assisting clinical doctors in diagnosing retinal diseases. However, existing methods often produce mis-segmentation when dealing with low contrast images and thin blood vessels, which affects the continuity and integrity of the vessel skeleton. In addition, existing deep learning methods tend to lose a lot of detailed information during training, which affects the accuracy of segmentation. To address these issues, we propose a novel dual-decoder based Cross-patch Feature Interactive Net with Edge Refinement (CFI-Net) for end-to-end retinal vessel segmentation. In the encoder part, a joint refinement down-sampling method (JRDM) is proposed to compress feature information in the process of reducing image size, so as to reduce the loss of thin vessels and vessel edge information during the encoding process. In the decoder part, we adopt a dual-path model based on edge detection, and propose a Cross-patch Interactive Attention Mechanism (CIAM) in the main path to enhancing multi-scale spatial channel features and transferring cross-spatial information. Consequently, it improve the network's ability to segment complete and continuous vessel skeletons, reducing vessel segmentation fractures. Finally, the Adaptive Spatial Context Guide Method (ASCGM) is proposed to fuse the prediction results of the two decoder paths, which enhances segmentation details while removing part of the background noise. We evaluated our model on two retinal image datasets and one coronary angiography dataset, achieving outstanding performance in segmentation comprehensive assessment metrics such as AUC and CAL. The experimental results showed that the proposed CFI-Net has superior segmentation performance compared with other existing methods, especially for thin vessels and vessel edges. The code is available at https://github.com/kita0420/CFI-Net.
Collapse
Affiliation(s)
- Ning Kang
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China
| | - Maofa Wang
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China
| | - Cheng Pang
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China; Guangxi Key Laboratory of Image and Graphic Intelligent Processing, Guilin, 541004, China
| | - Rushi Lan
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China; Guangxi Key Laboratory of Image and Graphic Intelligent Processing, Guilin, 541004, China
| | - Bingbing Li
- Department of Pathology, Ganzhou Municipal Hospital, Ganzhou, 341000, China
| | - Junlin Guan
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China.
| | - Huadeng Wang
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China; Guangxi Key Laboratory of Image and Graphic Intelligent Processing, Guilin, 541004, China
| |
Collapse
|
4
|
Jiang Q, Ye H, Yang B, Cao F. Label-Decoupled Medical Image Segmentation With Spatial-Channel Graph Convolution and Dual Attention Enhancement. IEEE J Biomed Health Inform 2024; 28:2830-2841. [PMID: 38376972 DOI: 10.1109/jbhi.2024.3367756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2024]
Abstract
Deep learning-based methods have been widely used in medical image segmentation recently. However, existing works are usually difficult to simultaneously capture global long-range information from images and topological correlations among feature maps. Further, medical images often suffer from blurred target edges. Accordingly, this paper proposes a novel medical image segmentation framework named a label-decoupled network with spatial-channel graph convolution and dual attention enhancement mechanism (LADENet for short). It constructs learnable adjacency matrices and utilizes graph convolutions to effectively capture global long-range information on spatial locations and topological dependencies between different channels in an image. Then a label-decoupled strategy based on distance transformation is introduced to decouple an original segmentation label into a body label and an edge label for supervising the body branch and edge branch. Again, a dual attention enhancement mechanism, designing a body attention block in the body branch and an edge attention block in the edge branch, is built to promote the learning ability of spatial region and boundary features. Besides, a feature interactor is devised to fully consider the information interaction between the body and edge branches to improve segmentation performance. Experiments on benchmark datasets reveal the superiority of LADENet compared to state-of-the-art approaches.
Collapse
|
5
|
Ma Y, Guo Y, Cui W, Liu J, Li Y, Wang Y, Qiang Y. SG-Transunet: A segmentation-guided Transformer U-Net model for KRAS gene mutation status identification in colorectal cancer. Comput Biol Med 2024; 173:108293. [PMID: 38574528 DOI: 10.1016/j.compbiomed.2024.108293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 02/28/2024] [Accepted: 03/12/2024] [Indexed: 04/06/2024]
Abstract
Accurately identifying the Kirsten rat sarcoma virus (KRAS) gene mutation status in colorectal cancer (CRC) patients can assist doctors in deciding whether to use specific targeted drugs for treatment. Although deep learning methods are popular, they are often affected by redundant features from non-lesion areas. Moreover, existing methods commonly extract spatial features from imaging data, which neglect important frequency domain features and may degrade the performance of KRAS gene mutation status identification. To address this deficiency, we propose a segmentation-guided Transformer U-Net (SG-Transunet) model for KRAS gene mutation status identification in CRC. Integrating the strength of convolutional neural networks (CNNs) and Transformers, SG-Transunet offers a unique approach for both lesion segmentation and KRAS mutation status identification. Specifically, for precise lesion localization, we employ an encoder-decoder to obtain segmentation results and guide the KRAS gene mutation status identification task. Subsequently, a frequency domain supplement block is designed to capture frequency domain features, integrating it with high-level spatial features extracted in the encoding path to derive advanced spatial-frequency domain features. Furthermore, we introduce a pre-trained Xception block to mitigate the risk of overfitting associated with small-scale datasets. Following this, an aggregate attention module is devised to consolidate spatial-frequency domain features with global information extracted by the Transformer at shallow and deep levels, thereby enhancing feature discriminability. Finally, we propose a mutual-constrained loss function that simultaneously constrains the segmentation mask acquisition and gene status identification process. Experimental results demonstrate the superior performance of SG-Transunet over state-of-the-art methods in discriminating KRAS gene mutation status.
Collapse
Affiliation(s)
- Yulan Ma
- Department of Automation Science and Electrical Engineering, Beihang University, Beijing, 100191, China
| | - Yuzhu Guo
- Department of Automation Science and Electrical Engineering, Beihang University, Beijing, 100191, China
| | - Weigang Cui
- School of Engineering Medicine, Beihang University, Beijing, 100191, China
| | - Jingyu Liu
- School of Medical Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Yang Li
- Department of Automation Science and Electrical Engineering, Beihang University, Beijing, 100191, China.
| | - Yingsen Wang
- College of Computer Science and Technology, Taiyuan University of Technology, Taiyuan, China
| | - Yan Qiang
- School of Software, North University of China, Taiyuan, China; College of Computer Science and Technology, Taiyuan University of Technology, Taiyuan, China.
| |
Collapse
|
6
|
Huang H, Shang Z, Yu C. FRD-Net: a full-resolution dilated convolution network for retinal vessel segmentation. BIOMEDICAL OPTICS EXPRESS 2024; 15:3344-3365. [PMID: 38855685 PMCID: PMC11161363 DOI: 10.1364/boe.522482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 04/13/2024] [Accepted: 04/17/2024] [Indexed: 06/11/2024]
Abstract
Accurate and automated retinal vessel segmentation is essential for performing diagnosis and surgical planning of retinal diseases. However, conventional U-shaped networks often suffer from segmentation errors when dealing with fine and low-contrast blood vessels due to the loss of continuous resolution in the encoding stage and the inability to recover the lost information in the decoding stage. To address this issue, this paper introduces an effective full-resolution retinal vessel segmentation network, namely FRD-Net, which consists of two core components: the backbone network and the multi-scale feature fusion module (MFFM). The backbone network achieves horizontal and vertical expansion through the interaction mechanism of multi-resolution dilated convolutions while preserving the complete image resolution. In the backbone network, the effective application of dilated convolutions with varying dilation rates, coupled with the utilization of dilated residual modules for integrating multi-scale feature maps from adjacent stages, facilitates continuous learning of multi-scale features to enhance high-level contextual information. Moreover, MFFM further enhances segmentation by fusing deeper multi-scale features with the original image, facilitating edge detail recovery for accurate vessel segmentation. In tests on multiple classical datasets,compared to state-of-the-art segmentation algorithms, FRD-Net achieves superior performance and generalization with fewer model parameters.
Collapse
Affiliation(s)
- Hua Huang
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
| | - Zhenhong Shang
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
- Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650500, China
| | - Chunhui Yu
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
| |
Collapse
|
7
|
Luo J, Cui W, Xu S, Wang L, Chen H, Li Y. A Cross-Scale Transformer and Triple-View Attention Based Domain-Rectified Transfer Learning for EEG Classification in RSVP Tasks. IEEE Trans Neural Syst Rehabil Eng 2024; 32:672-683. [PMID: 38285586 DOI: 10.1109/tnsre.2024.3359191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2024]
Abstract
Rapid serial visual presentation (RSVP)-based brain-computer interface (BCI) is a promising target detection technique by using electroencephalogram (EEG) signals. However, existing deep learning approaches seldom considered dependencies of multi-scale temporal features and discriminative multi-view spectral features simultaneously, which limits the representation learning ability of the model and undermine the EEG classification performance. In addition, recent transfer learning-based methods generally failed to obtain transferable cross-subject invariant representations and commonly ignore the individual-specific information, leading to the poor cross-subject transfer performance. In response to these limitations, we propose a cross-scale Transformer and triple-view attention based domain-rectified transfer learning (CST-TVA-DRTL) for the RSVP classification. Specially, we first develop a cross-scale Transformer (CST) to extract multi-scale temporal features and exploit the dependencies of different scales features. Then, a triple-view attention (TVA) is designed to capture spectral features from triple views of multi-channel time-frequency images. Finally, a domain-rectified transfer learning (DRTL) framework is proposed to simultaneously obtain transferable domain-invariant representations and untransferable domain-specific representations, then utilize domain-specific information to rectify domain-invariant representations to adapt to target data. Experimental results on two public RSVP datasets suggests that our CST-TVA-DRTL outperforms the state-of-the-art methods in the RSVP classification task. The source code of our model is publicly available in https://github.com/ljbuaa/CST_TVA_DRTL.
Collapse
|
8
|
Chen S, Fan J, Ding Y, Geng H, Ai D, Xiao D, Song H, Wang Y, Yang J. PEA-Net: A progressive edge information aggregation network for vessel segmentation. Comput Biol Med 2024; 169:107766. [PMID: 38150885 DOI: 10.1016/j.compbiomed.2023.107766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 10/18/2023] [Accepted: 11/21/2023] [Indexed: 12/29/2023]
Abstract
Automatic vessel segmentation is a critical area of research in medical image analysis, as it can greatly assist doctors in accurately and efficiently diagnosing vascular diseases. However, accurately extracting the complete vessel structure from images remains a challenge due to issues such as uneven contrast and background noise. Existing methods primarily focus on segmenting individual pixels and often fail to consider vessel features and morphology. As a result, these methods often produce fragmented results and misidentify vessel-like background noise, leading to missing and outlier points in the overall segmentation. To address these issues, this paper proposes a novel approach called the progressive edge information aggregation network for vessel segmentation (PEA-Net). The proposed method consists of several key components. First, a dual-stream receptive field encoder (DRE) is introduced to preserve fine structural features and mitigate false positive predictions caused by background noise. This is achieved by combining vessel morphological features obtained from different receptive field sizes. Second, a progressive complementary fusion (PCF) module is designed to enhance fine vessel detection and improve connectivity. This module complements the decoding path by combining features from previous iterations and the DRE, incorporating nonsalient information. Additionally, segmentation-edge decoupling enhancement (SDE) modules are employed as decoders to integrate upsampling features with nonsalient information provided by the PCF. This integration enhances both edge and segmentation information. The features in the skip connection and decoding path are iteratively updated to progressively aggregate fine structure information, thereby optimizing segmentation results and reducing topological disconnections. Experimental results on multiple datasets demonstrate that the proposed PEA-Net model and strategy achieve optimal performance in both pixel-level and topology-level metrics.
Collapse
Affiliation(s)
- Sigeng Chen
- Laboratory of Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing, 100081, China
| | - Jingfan Fan
- Laboratory of Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing, 100081, China.
| | - Yang Ding
- Laboratory of Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing, 100081, China
| | - Haixiao Geng
- Laboratory of Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing, 100081, China
| | - Danni Ai
- Laboratory of Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing, 100081, China
| | - Deqiang Xiao
- Laboratory of Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing, 100081, China
| | - Hong Song
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, 100081, China
| | - Yining Wang
- Department of Radiology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China.
| | - Jian Yang
- Laboratory of Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing, 100081, China.
| |
Collapse
|
9
|
Wang G, Zhou P, Gao H, Qin Z, Wang S, Sun J, Yu H. Coronary vessel segmentation in coronary angiography with a multi-scale U-shaped transformer incorporating boundary aggregation and topology preservation. Phys Med Biol 2024; 69:025012. [PMID: 38200403 DOI: 10.1088/1361-6560/ad0b63] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 11/09/2023] [Indexed: 01/12/2024]
Abstract
Coronary vessel segmentation plays a pivotal role in automating the auxiliary diagnosis of coronary heart disease. The continuity and boundary accuracy of the segmented vessels directly affect the subsequent processing. Notably, during segmentation, vessels with severe stenosis can easily cause boundary errors and breakage, resulting in isolated islands. To address these issues, we propose a novel multi-scale U-shaped transformer with boundary aggregation and topology preservation (UT-BTNet) for coronary vessel segmentation in coronary angiography. Specifically, considering the characteristics of coronary vessels, we first develop the UT-BTNet for coronary vessels segmentation, which combines the advantages of a convolutional neural networks (CNN) and a transformer, and is able to effectively extract the local and global features of angiographic images. Secondly, we innovatively employ boundary loss and topological loss in two stages, in addition to the traditional losses. In the first stage, boundary loss is adopted, which has the effect of boundary aggregation. In the second stage, topological loss is applied to preserve the topology of the vessels, after the network converges. In the experiment, in addition to the two metrics of Dice and intersection over union (IoU), we specifically propose two metrics of boundary intersection over union (BIoU) and Betti error to evaluate boundary accuracy and the continuity of segmentation results. The results show that the Dice is 0.9291, the IoU is 0.8687, the BIoU is 0.5094, and the Betti error is 0.3400. Compared with the other state-of-the-art methods, UT-BTNet achieves better segmentation results, while ensuring the continuity and boundary accuracy of the vessels, indicating its potential clinical value.
Collapse
Affiliation(s)
- Guangpu Wang
- Department of Biomedical Engineering, School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, People's Republic of China
| | - Peng Zhou
- Department of Biomedical Engineering, School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, People's Republic of China
- Academy of Medical Engineering and Translational Medicine, Tianjin University, People's Republic of China
| | - Hui Gao
- Department of Biomedical Engineering, School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, People's Republic of China
| | - Zewei Qin
- Department of Biomedical Engineering, School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, People's Republic of China
| | - Shuo Wang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, People's Republic of China
| | - Jinglai Sun
- Department of Biomedical Engineering, School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, People's Republic of China
| | - Hui Yu
- Department of Biomedical Engineering, School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, People's Republic of China
- Academy of Medical Engineering and Translational Medicine, Tianjin University, People's Republic of China
| |
Collapse
|
10
|
Liu R, Pu W, Nan H, Zou Y. Retina image segmentation using the three-path Unet model. Sci Rep 2023; 13:22579. [PMID: 38114637 PMCID: PMC10730848 DOI: 10.1038/s41598-023-50141-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 12/15/2023] [Indexed: 12/21/2023] Open
Abstract
Unsupervised image segmentation is a technique that divides an image into distinct regions or objects without prior labeling. This approach offers flexibility and adaptability to various types of image data. Particularly for large datasets, it eliminates the need for manual labeling, thereby it presents advantages in terms of time and labor costs. However, when applied to retinal image segmentation, challenges arise due to variations in data, presence of noise, and manual threshold adjustments, which can lead to over-segmentation or under-segmentation of small blood vessel boundaries and endpoints. In order to enhance the precision and accuracy of retinal image segmentation, we propose a novel image supervised segmentation network based on three-path Unet model.Firstly, the Haar wavelet transform is employed to extract high-frequency image information, which forms the foundation for the proposed HaarNet, a Unet-inspired architecture. Next, the HaarNet is integrated with the Unet and SegNet frameworks to develop a three-path Unet model, referred to as TP-Unet. Finally, the model is further refined into TP-Unet+AE+DSL by incorporating the advantages of auto-encoding (AE) and deep supervised learning (DSL) techniques, thereby enhancing the overall performance of the system. To evaluate the effectiveness of our proposed model, we conduct experiments using the DRIVE and CHASE public datasets. On the DRIVE dataset, our recommended model achieves a Dice coefficient of 0.8291 and a sensitivity index of 0.8184. These results significantly outperform the Unet model by [Formula: see text] and [Formula: see text], respectively. Furthermore, our model demonstrates excellent performance on the CHASE dataset, with a Dice coefficient of 0.8162, a sensitivity of 0.8242, and an accuracy of 0.9664. These metrics surpass the Unet model by [Formula: see text], [Formula: see text], and [Formula: see text], respectively. Our proposed model provides more accurate and reliable results for retinal vessel segmentation, which holds significant potential for assisting doctors in their diagnosis.
Collapse
Affiliation(s)
- Ruihua Liu
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China.
| | - Wei Pu
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
- Chongqing Vocational College of Transportation, Chongqing, China
| | - Haoyu Nan
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China.
- OPT Machine Vision Tech Co., Ltd., Guangdong, China.
| | - Yangyang Zou
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| |
Collapse
|
11
|
Xu S, Duan L, Zhang Y, Zhang Z, Sun T, Tian L. Graph- and transformer-guided boundary aware network for medical image segmentation. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 242:107849. [PMID: 37837887 DOI: 10.1016/j.cmpb.2023.107849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 09/29/2023] [Accepted: 10/06/2023] [Indexed: 10/16/2023]
Abstract
BACKGROUND AND OBJECTIVE Despite the considerable progress achieved by U-Net-based models, medical image segmentation remains a challenging task due to complex backgrounds, irrelevant noises, and ambiguous boundaries. In this study, we present a novel approach called U-shaped Graph- and Transformer-guided Boundary Aware Network (GTBA-Net) to tackle these challenges. METHODS GTBA-Net uses the pre-trained ResNet34 as its basic structure, and involves Global Feature Aggregation (GFA) modules for target localization, Graph-based Dynamic Feature Fusion (GDFF) modules for effective noise suppression, and Uncertainty-based Boundary Refinement (UBR) modules for accurate delineation of ambiguous boundaries. The GFA modules employ an efficient self-attention mechanism to facilitate coarse target localization amidst complex backgrounds, without introducing additional computational complexity. The GDFF modules leverage graph attention mechanism to aggregate information hidden among high- and low-level features, effectively suppressing target-irrelevant noises while preserving valuable spatial details. The UBR modules introduce an uncertainty quantification strategy and auxiliary loss to guide the model's focus towards target regions and uncertain "ridges", gradually mitigating boundary uncertainty and ultimately achieving accurate boundary delineation. RESULTS Comparative experiments on five datasets encompassing diverse modalities (including X-ray, CT, endoscopic procedures, and ultrasound) demonstrate that the proposed GTBA-Net outperforms existing methods in various challenging scenarios. Subsequent ablation studies further demonstrate the efficacy of the GFA, GDFF, and UBR modules in target localization, noise suppression, and ambiguous boundary delineation, respectively. CONCLUSIONS GTBA-Net exhibits substantial potential for extensive application in the field of medical image segmentation, particularly in scenarios involving complex backgrounds, target-irrelevant noises, or ambiguous boundaries.
Collapse
Affiliation(s)
- Shanshan Xu
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing 100044, China
| | - Lianhong Duan
- The Second School of Clinical Medicine, Southern Medical University, Guangzhou, China; Senior Department of Orthopedics, The Fourth Medical Center of PLA General Hospital, Beijing, China
| | - Yang Zhang
- Senior Department of Orthopedics, The Fourth Medical Center of PLA General Hospital, Beijing, China
| | - Zhicheng Zhang
- Senior Department of Orthopedics, The Fourth Medical Center of PLA General Hospital, Beijing, China
| | - Tiansheng Sun
- The Second School of Clinical Medicine, Southern Medical University, Guangzhou, China; Senior Department of Orthopedics, The Fourth Medical Center of PLA General Hospital, Beijing, China.
| | - Lixia Tian
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China.
| |
Collapse
|
12
|
Liu F, Huang W. ESDiff: a joint model for low-quality retinal image enhancement and vessel segmentation using a diffusion model. BIOMEDICAL OPTICS EXPRESS 2023; 14:6563-6578. [PMID: 38420298 PMCID: PMC10898574 DOI: 10.1364/boe.506205] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 11/01/2023] [Accepted: 11/13/2023] [Indexed: 03/02/2024]
Abstract
In clinical screening, accurate diagnosis of various diseases relies on the extraction of blood vessels from fundus images. However, clinical fundus images often suffer from uneven illumination, blur, and artifacts caused by equipment or environmental factors. In this paper, we propose a unified framework called ESDiff to address these challenges by integrating retinal image enhancement and vessel segmentation. Specifically, we introduce a novel diffusion model-based framework for image enhancement, incorporating mask refinement as an auxiliary task via a vessel mask-aware diffusion model. Furthermore, we utilize low-quality retinal fundus images and their corresponding illumination maps as inputs to the modified UNet to obtain degradation factors that effectively preserve pathological features and pertinent information. This approach enhances the intermediate results within the iterative process of the diffusion model. Extensive experiments on publicly available fundus retinal datasets (i.e. DRIVE, STARE, CHASE_DB1 and EyeQ) demonstrate the effectiveness of ESDiff compared to state-of-the-art methods.
Collapse
Affiliation(s)
- Fengting Liu
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250300, China
| | - Wenhui Huang
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250300, China
| |
Collapse
|
13
|
Shen N, Xu T, Huang S, Mu F, Li J. Expert-Guided Knowledge Distillation for Semi-Supervised Vessel Segmentation. IEEE J Biomed Health Inform 2023; 27:5542-5553. [PMID: 37669209 DOI: 10.1109/jbhi.2023.3312338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2023]
Abstract
In medical image analysis, blood vessel segmentation is of considerable clinical value for diagnosis and surgery. The predicaments of complex vascular structures obstruct the development of the field. Despite many algorithms have emerged to get off the tight corners, they rely excessively on careful annotations for tubular vessel extraction. A practical solution is to excavate the feature information distribution from unlabeled data. This work proposes a novel semi-supervised vessel segmentation framework, named EXP-Net, to navigate through finite annotations. Based on the training mechanism of the Mean Teacher model, we innovatively engage an expert network in EXP-Net to enhance knowledge distillation. The expert network comprises knowledge and connectivity enhancement modules, which are respectively in charge of modeling feature relationships from global and detailed perspectives. In particular, the knowledge enhancement module leverages the vision transformer to highlight the long-range dependencies among multi-level token components; the connectivity enhancement module maximizes the properties of topology and geometry by skeletonizing the vessel in a non-parametric manner. The key components are dedicated to the conditions of weak vessel connectivity and poor pixel contrast. Extensive evaluations show that our EXP-Net achieves state-of-the-art performance on subcutaneous vessel, retinal vessel, and coronary artery segmentations.
Collapse
|
14
|
Wu Q, Tang X, Li R, Liu L, Chen HL. An enhanced decision-making framework for predicting future trends of sharing economy. PLoS One 2023; 18:e0291626. [PMID: 37797038 PMCID: PMC10553323 DOI: 10.1371/journal.pone.0291626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 09/04/2023] [Indexed: 10/07/2023] Open
Abstract
This work aims to provide a reliable and intelligent prediction model for future trends in sharing economy. Moreover, it presents valuable insights for decision-making and policy development by relevant governmental bodies. Furthermore, the study introduces a predictive system that incorporates an enhanced Harris Hawk Optimization (HHO) algorithm and a K-Nearest Neighbor (KNN) forecasting framework. The method utilizes an improved simulated annealing mechanism and a Gaussian bare bone structure to improve the original HHO, termed SGHHO. To achieve optimal prediction performance and identify essential features, a refined simulated annealing mechanism is employed to mitigate the susceptibility of the original HHO algorithm to local optima. The algorithm employs a mechanism that boosts its global search ability by generating fresh solution sets at a specific likelihood. This mechanism dynamically adjusts the equilibrium between the exploration and exploitation phases, incorporating the Gaussian bare bone strategy. The best classification model (SGHHO-KNN) is developed to mine the key features with the improvement of both strategies. To assess the exceptional efficacy of the SGHHO algorithm, this investigation conducted a series of comparative trials employing the function set of IEEE CEC 2014. The outcomes of these experiments unequivocally demonstrate that the SGHHO algorithm outperforms the original HHO algorithm on 96.7% of the functions, substantiating its remarkable superiority. The algorithm can achieve the optimal value of the function on 67% of the tested functions and significantly outperforms other competing algorithms. In addition, the key features selected by the SGHHO-KNN model in the prediction experiment, including " Form of sharing economy in your region " and " Attitudes to the sharing economy ", are important for predicting the future trends of the sharing economy in this study. The results of the prediction demonstrate that the proposed model achieves an accuracy rate of 99.70% and a specificity rate of 99.38%. Consequently, the SGHHO-KNN model holds great potential as a reliable tool for forecasting the forthcoming trajectory of the sharing economy.
Collapse
Affiliation(s)
- Qiong Wu
- School of Marxism, Wenzhou University, Wenzhou, China
| | - Xiaoxiao Tang
- School of Marxism, Wenzhou University, Wenzhou, China
| | - Rongjie Li
- Wenzhou Business College, Wenzhou, China
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, China
| | - Hui-Ling Chen
- College of Computer Science an Artificial Intelligence, Wenzhou University, Wenzhou, China
| |
Collapse
|
15
|
Huang Y, Deng T. Multi-level spatial-temporal and attentional information deep fusion network for retinal vessel segmentation. Phys Med Biol 2023; 68:195026. [PMID: 37567227 DOI: 10.1088/1361-6560/acefa0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Accepted: 08/11/2023] [Indexed: 08/13/2023]
Abstract
Objective.Automatic segmentation of fundus vessels has the potential to enhance the judgment ability of intelligent disease diagnosis systems. Even though various methods have been proposed, it is still a demanding task to accurately segment the fundus vessels. The purpose of our study is to develop a robust and effective method to segment the vessels in human color retinal fundus images.Approach.We present a novel multi-level spatial-temporal and attentional information deep fusion network for the segmentation of retinal vessels, called MSAFNet, which enhances segmentation performance and robustness. Our method utilizes the multi-level spatial-temporal encoding module to obtain spatial-temporal information and the Self-Attention module to capture feature correlations in different levels of our network. Based on the encoder and decoder structure, we combine these features to get the final segmentation results.Main results.Through abundant experiments on four public datasets, our method achieves preferable performance compared with other SOTA retinal vessel segmentation methods. Our Accuracy and Area Under Curve achieve the highest scores of 96.96%, 96.57%, 96.48% and 98.78%, 98.54%, 98.27% on DRIVE, CHASE_DB1, and HRF datasets. Our Specificity achieves the highest score of 98.58% and 99.08% on DRIVE and STARE datasets.Significance.The experimental results demonstrate that our method has strong learning and representation capabilities and can accurately detect retinal blood vessels, thereby serving as a potential tool for assisting in diagnosis.
Collapse
Affiliation(s)
- Yi Huang
- School of Information Science and Technology, Southwest Jiaotong University, 611756, Chengdu, People's Republic of China
| | - Tao Deng
- School of Information Science and Technology, Southwest Jiaotong University, 611756, Chengdu, People's Republic of China
| |
Collapse
|
16
|
Hao S, Huang C, Heidari AA, Xu Z, Chen H, Alabdulkreem E, Elmannai H, Wang X. Multi-threshold image segmentation using an enhanced fruit fly optimization for COVID-19 X-ray images. Biomed Signal Process Control 2023; 86:105147. [PMID: 37361197 PMCID: PMC10266503 DOI: 10.1016/j.bspc.2023.105147] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Revised: 04/22/2023] [Accepted: 06/08/2023] [Indexed: 06/28/2023]
Abstract
Since the outbreak of COVID-19, it has seriously endangered the health of human beings. Computer automatic segmentation of COVID-19 X-ray images is an important means to assist doctors in rapid and accurate diagnosis. Therefore, this paper proposes a modified FOA (EEFOA) with two optimization strategies added to the original FOA, including elite natural evolution (ENE) and elite random mutation (ERM). To be specific, ENE and ERM can effectively speed up the convergence and deal with the problem of local optima, respectively. The outstanding performance of EEFOA was confirmed by experimental results comparing EEFOA with the original FOA, other FOA variants, and advanced algorithms at CEC2014. After that, EEFOA is implemented for multi-threshold image segmentation (MIS) of COVID-19 X-ray images, where a 2D histogram consisting of the original greyscale image and the non-local means image is used to represent the image information, and Rényi's entropy is used as the objective function to find the maximum value. The evaluation results of the MIS segmentation experiments show that, whether high or low threshold, EEFOA can achieve higher quality segmentation results and greater robustness than other advanced segmentation methods.
Collapse
Affiliation(s)
- Shuhui Hao
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Changcheng Huang
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Ali Asghar Heidari
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Zhangze Xu
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Huiling Chen
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Eatedal Alabdulkreem
- Department of Computer Science, College of Computer and Information Science, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Hela Elmannai
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Xianchuan Wang
- Information Technology Center, Wenzhou Medical University, Wenzhou 325035, China
| |
Collapse
|
17
|
Li Y, Zhang Y, Liu JY, Wang K, Zhang K, Zhang GS, Liao XF, Yang G. Global Transformer and Dual Local Attention Network via Deep-Shallow Hierarchical Feature Fusion for Retinal Vessel Segmentation. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:5826-5839. [PMID: 35984806 DOI: 10.1109/tcyb.2022.3194099] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Clinically, retinal vessel segmentation is a significant step in the diagnosis of fundus diseases. However, recent methods generally neglect the difference of semantic information between deep and shallow features, which fail to capture the global and local characterizations in fundus images simultaneously, resulting in the limited segmentation performance for fine vessels. In this article, a global transformer (GT) and dual local attention (DLA) network via deep-shallow hierarchical feature fusion (GT-DLA-dsHFF) are investigated to solve the above limitations. First, the GT is developed to integrate the global information in the retinal image, which effectively captures the long-distance dependence between pixels, alleviating the discontinuity of blood vessels in the segmentation results. Second, DLA, which is constructed using dilated convolutions with varied dilation rates, unsupervised edge detection, and squeeze-excitation block, is proposed to extract local vessel information, consolidating the edge details in the segmentation result. Finally, a novel deep-shallow hierarchical feature fusion (dsHFF) algorithm is studied to fuse the features in different scales in the deep learning framework, respectively, which can mitigate the attenuation of valid information in the process of feature fusion. We verified the GT-DLA-dsHFF on four typical fundus image datasets. The experimental results demonstrate our GT-DLA-dsHFF achieves superior performance against the current methods and detailed discussions verify the efficacy of the proposed three modules. Segmentation results of diseased images show the robustness of our proposed GT-DLA-dsHFF. Implementation codes will be available on https://github.com/YangLibuaa/GT-DLA-dsHFF.
Collapse
|
18
|
Lu H, Huang L, Xie Y, Zhou Z, Cui H, Jing S, Yang Z, Zhu D, Wang S, Bao D, Liang G, Cai Z, Chen H, He W. Prediction of fractional flow reserve with enhanced ant lion optimized support vector machine. Heliyon 2023; 9:e18832. [PMID: 37588610 PMCID: PMC10425907 DOI: 10.1016/j.heliyon.2023.e18832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 07/13/2023] [Accepted: 07/31/2023] [Indexed: 08/18/2023] Open
Abstract
The evaluation of coronary morphology provides important guidance for the treatment of coronary heart disease (CHD). A chaotic Gaussian mutation antlion optimizer algorithm (CGALO) is proposed in the paper, and it is combined with SVM to construct a classification prediction model for Fractional flow reserve (FFR). To overcome the limitations of the original antlion optimizer (ALO) algorithm, the chaotic Gaussian mutation strategy is introduced, which leads to an improvement in its convergence speed and accuracy. To evaluate the proposed algorithm's performance, comparative experiments were conducted on 23 benchmark functions alongside 12 other cutting-edge optimization algorithms. The experimental outcomes demonstrate that the proposed algorithm achieves superior convergence accuracy and speed compared to the alternative comparison algorithms. Additionally, it is combined with SVM and FS to construct a hierarchical FFR classification model, which is utilized to make effective predictions for 84 patients at the affiliated hospital of medical school, Ningbo university. The experimental results demonstrate that the proposed model achieves an average accuracy of 92%. Moreover, it concludes that smoking history, number of lesion vessels, lesion location, diffuse lesions and ST segment changes, and other factors are the most critical indicators for FFR. Therefore, the model that has been established is a new FFR intelligent classification prediction technology that can effectively assist doctors in making corresponding decisions and evaluation plans.
Collapse
Affiliation(s)
- Haoxuan Lu
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Li Huang
- Department of Emergency, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Yanqing Xie
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Zhong Zhou
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Hanbin Cui
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Sheng Jing
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Zhuo Yang
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Decai Zhu
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Shiqi Wang
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Donggang Bao
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| | - Guoxi Liang
- Department of Information Technology, Wenzhou Polytechnic, Wenzhou, 325035, China
| | - Zhennao Cai
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China
| | - Huiling Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China
| | - Wenming He
- Department of Cardiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, 315020, PR China
| |
Collapse
|
19
|
Shi T, Ding X, Zhou W, Pan F, Yan Z, Bai X, Yang X. Affinity Feature Strengthening for Accurate, Complete and Robust Vessel Segmentation. IEEE J Biomed Health Inform 2023; 27:4006-4017. [PMID: 37163397 DOI: 10.1109/jbhi.2023.3274789] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Vessel segmentation is crucial in many medical image applications, such as detecting coronary stenoses, retinal vessel diseases and brain aneurysms. However, achieving high pixel-wise accuracy, complete topology structure and robustness to various contrast variations are critical and challenging, and most existing methods focus only on achieving one or two of these aspects. In this paper, we present a novel approach, the affinity feature strengthening network (AFN), which jointly models geometry and refines pixel-wise segmentation features using a contrast-insensitive, multiscale affinity approach. Specifically, we compute a multiscale affinity field for each pixel, capturing its semantic relationships with neighboring pixels in the predicted mask image. This field represents the local geometry of vessel segments of different sizes, allowing us to learn spatial- and scale-aware adaptive weights to strengthen vessel features. We evaluate our AFN on four different types of vascular datasets: X-ray angiography coronary vessel dataset (XCAD), portal vein dataset (PV), digital subtraction angiography cerebrovascular vessel dataset (DSA) and retinal vessel dataset (DRIVE). Extensive experimental results demonstrate that our AFN outperforms the state-of-the-art methods in terms of both higher accuracy and topological metrics, while also being more robust to various contrast changes.
Collapse
|
20
|
Chen J, Cai Z, Heidari AA, Chen H, He Q, Escorcia-Gutierrez J, Mansour RF. Multi-threshold image segmentation based on an improved differential evolution: Case study of thyroid papillary carcinoma. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]
|
21
|
He X, Shan W, Zhang R, Heidari AA, Chen H, Zhang Y. Improved Colony Predation Algorithm Optimized Convolutional Neural Networks for Electrocardiogram Signal Classification. Biomimetics (Basel) 2023; 8:268. [PMID: 37504156 PMCID: PMC10377160 DOI: 10.3390/biomimetics8030268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 06/18/2023] [Accepted: 06/18/2023] [Indexed: 07/29/2023] Open
Abstract
Recently, swarm intelligence algorithms have received much attention because of their flexibility for solving complex problems in the real world. Recently, a new algorithm called the colony predation algorithm (CPA) has been proposed, taking inspiration from the predatory habits of groups in nature. However, CPA suffers from poor exploratory ability and cannot always escape solutions known as local optima. Therefore, to improve the global search capability of CPA, an improved variant (OLCPA) incorporating an orthogonal learning strategy is proposed in this paper. Then, considering the fact that the swarm intelligence algorithm can go beyond the local optimum and find the global optimum solution, a novel OLCPA-CNN model is proposed, which uses the OLCPA algorithm to tune the parameters of the convolutional neural network. To verify the performance of OLCPA, comparison experiments are designed to compare with other traditional metaheuristics and advanced algorithms on IEEE CEC 2017 benchmark functions. The experimental results show that OLCPA ranks first in performance compared to the other algorithms. Additionally, the OLCPA-CNN model achieves high accuracy rates of 97.7% and 97.8% in classifying the MIT-BIH Arrhythmia and European ST-T datasets.
Collapse
Affiliation(s)
- Xinxin He
- School of Emergency Management, Institute of Disaster Prevention, Sanhe 065201, China
| | - Weifeng Shan
- School of Emergency Management, Institute of Disaster Prevention, Sanhe 065201, China
| | - Ruilei Zhang
- School of Emergency Management, Institute of Disaster Prevention, Sanhe 065201, China
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran 1417935840, Iran
| | - Huiling Chen
- Institute of Big Data and Information Technology, Wenzhou University, Wenzhou 325000, China
| | - Yudong Zhang
- School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK
| |
Collapse
|
22
|
Huang KW, Yang YR, Huang ZH, Liu YY, Lee SH. Retinal Vascular Image Segmentation Using Improved UNet Based on Residual Module. Bioengineering (Basel) 2023; 10:722. [PMID: 37370653 DOI: 10.3390/bioengineering10060722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 06/01/2023] [Accepted: 06/12/2023] [Indexed: 06/29/2023] Open
Abstract
In recent years, deep learning technology for clinical diagnosis has progressed considerably, and the value of medical imaging continues to increase. In the past, clinicians evaluated medical images according to their individual expertise. In contrast, the application of artificial intelligence technology for automatic analysis and diagnostic assistance to support clinicians in evaluating medical information more efficiently has become an important trend. In this study, we propose a machine learning architecture designed to segment images of retinal blood vessels based on an improved U-Net neural network model. The proposed model incorporates a residual module to extract features more effectively, and includes a full-scale skip connection to combine low level details with high-level features at different scales. The results of an experimental evaluation show that the model was able to segment images of retinal vessels accurately. The proposed method also outperformed several existing models on the benchmark datasets DRIVE and ROSE, including U-Net, ResUNet, U-Net3+, ResUNet++, and CaraNet.
Collapse
Affiliation(s)
- Ko-Wei Huang
- Department of Electrical Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 80778, Taiwan
| | - Yao-Ren Yang
- Department of Electrical Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 80778, Taiwan
| | - Zih-Hao Huang
- Department of Electrical Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 80778, Taiwan
| | - Yi-Yang Liu
- Department of Electrical Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 80778, Taiwan
- Department of Urology, Kaohsiung Chang Gung Memorial Hospital, Chang Gung University College of Medicine, Kaohsiung 83301, Taiwan
| | - Shih-Hsiung Lee
- Department of Intelligent Commerce, National Kaohsiung University of Science and Technology, Kaohsiung 82444, Taiwan
| |
Collapse
|
23
|
Chen Z, Xuan P, Heidari AA, Liu L, Wu C, Chen H, Escorcia-Gutierrez J, Mansour RF. An artificial bee bare-bone hunger games search for global optimization and high-dimensional feature selection. iScience 2023; 26:106679. [PMID: 37216098 PMCID: PMC10193239 DOI: 10.1016/j.isci.2023.106679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 03/01/2023] [Accepted: 04/12/2023] [Indexed: 05/24/2023] Open
Abstract
The domains of contemporary medicine and biology have generated substantial high-dimensional genetic data. Identifying representative genes and decreasing the dimensionality of the data can be challenging. The goal of gene selection is to minimize computing costs and enhance classification precision. Therefore, this article designs a new wrapper gene selection algorithm named artificial bee bare-bone hunger games search (ABHGS), which is the hunger games search (HGS) integrated with an artificial bee strategy and a Gaussian bare-bone structure to address this issue. To evaluate and validate the performance of our proposed method, ABHGS is compared to HGS and a single strategy embedded in HGS, six classic algorithms, and ten advanced algorithms on the CEC 2017 functions. The experimental results demonstrate that the bABHGS outperforms the original HGS. Compared to peers, it increases classification accuracy and decreases the number of selected features, indicating its actual engineering utility in spatial search and feature selection.
Collapse
Affiliation(s)
- Zhiqing Chen
- School of Intelligent Manufacturing, Wenzhou Polytechnic, Wenzhou 325035, China
| | - Ping Xuan
- Department of Computer Science, School of Engineering, Shantou University, Shantou 515063, China
| | - Ali Asghar Heidari
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China
| | - Chengwen Wu
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Huiling Chen
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - José Escorcia-Gutierrez
- Department of Computational Science and Electronics, Universidad de la Costa, CUC, Barranquilla 080002, Colombia
| | - Romany F. Mansour
- Department of Mathematics, Faculty of Science, New Valley University, El-Kharga 72511, Egypt
| |
Collapse
|
24
|
Qiao Z, Li L, Zhao X, Liu L, Zhang Q, Hechmi S, Atri M, Li X. An enhanced Runge Kutta boosted machine learning framework for medical diagnosis. Comput Biol Med 2023; 160:106949. [PMID: 37159961 DOI: 10.1016/j.compbiomed.2023.106949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Revised: 03/27/2023] [Accepted: 04/15/2023] [Indexed: 05/11/2023]
Abstract
With the development and maturity of machine learning methods, medical diagnosis aided with machine learning methods has become a popular method to assist doctors in diagnosing and treating patients. However, machine learning methods are greatly affected by their hyperparameters, for instance, the kernel parameter in kernel extreme learning machine (KELM) and the learning rate in residual neural networks (ResNet). If the hyperparameters are appropriately set, the performance of the classifier can be significantly improved. To boost the performance of the machine learning methods, this paper proposes to improve the Runge Kutta optimizer (RUN) to adaptively adjust the hyperparameters of the machine learning methods for medical diagnosis purposes. Although RUN has a solid mathematical theoretical foundation, there are still some performance defects when dealing with complex optimization problems. To remedy these defects, this paper proposes a new enhanced RUN method with a grey wolf mechanism and an orthogonal learning mechanism called GORUN. The superior performance of the GORUN was validated against other well-established optimizers on IEEE CEC 2017 benchmark functions. Then, the proposed GORUN is employed to optimize the machine learning models, including the KELM and ResNet, to construct robust models for medical diagnosis. The performance of the proposed machine learning framework was validated on several medical data sets, and the experimental results have demonstrated its superiority.
Collapse
Affiliation(s)
- Zenglin Qiao
- School of Science, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
| | - Lynn Li
- China Telecom Stocks Co.,Ltd., Hangzhou Branch, Hangzhou, 310000, China.
| | - Xinchao Zhao
- School of Science, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, 610065, China.
| | - Qian Zhang
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, Zhejiang, 325035, China.
| | - Shili Hechmi
- Dept. Computer Sciences, Tabuk University, Tabuk, Saudi Arabia.
| | - Mohamed Atri
- College of Computer Science, King Khalid University, Abha, Saudi Arabia.
| | - Xiaohua Li
- Library, Wenzhou University, Wenzhou, Zhejiang, 325035, China.
| |
Collapse
|
25
|
Yang S, Lou L, Wang W, Li J, Jin X, Wang S, Cai J, Kuang F, Liu L, Hadjouni M, Elmannai H, Cai C. Pneumothorax prediction using a foraging and hunting based ant colony optimizer assisted support vector machine. Comput Biol Med 2023; 161:106948. [PMID: 37207406 DOI: 10.1016/j.compbiomed.2023.106948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 04/04/2023] [Accepted: 04/15/2023] [Indexed: 05/21/2023]
Abstract
Although PNLB is generally considered safe, it is still invasive and risky. Pneumothorax, the most common complication of lung puncture, can cause shortness of breath, chest pain, and even life-threatening. Therefore, the auxiliary diagnosis for pneumothorax is of great clinical interest. This paper proposes an ant colony optimizer with slime mould foraging behavior and collaborative hunting, called SCACO, in which slime mould foraging behavior is combined to improve the convergence accuracy and solution quality of ACOR. Then the ability of ACO to jump out of the local optimum is optimized by an adaptive collaborative hunting strategy when trapped in the local optimum. As a first step toward Pneumothorax diagnostic prediction, we suggested an SVM classifier based on bSCACO (bSCACO-SVM), which uses the proposed SCACO's binary version as the basis for its feature selection algorithms. To demonstrate the SCACO performance, we first used the slime mould foraging behavior and adaptive cooperative hunting strategy, then compared SCACO with nine basic algorithms and nine variants, respectively. Finally, we verified bSCACO-SVM on various widely used public datasets and applied it to the Pneumothorax prediction issue, showing that it has robust classification prediction capacity and can be successfully employed for tuberculous pleural effusion diagnostic prediction.
Collapse
Affiliation(s)
- Song Yang
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
| | - Lejing Lou
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
| | - Wangjia Wang
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
| | - Jie Li
- Key Laboratory of Diagnosis and Treatment of Severe Hepato-Pancreatic Diseases of Zhejiang Province, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| | - Xiao Jin
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
| | - Shijia Wang
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
| | - Jihao Cai
- Wenzhou Medical University Renji College, Wenzhou, China.
| | - Fangjun Kuang
- School of Information Engineering, Wenzhou Business College, Wenzhou, 325035, China.
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, 610065, China.
| | - Myriam Hadjouni
- Department of Computer Sciences, College of Computer and Information Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia.
| | - Hela Elmannai
- Department of Information Technology, College of Computer and Information Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia.
| | - Chang Cai
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
| |
Collapse
|
26
|
Shi M, Chen C, Liu L, Kuang F, Zhao D, Chen X. A grade-based search adaptive random slime mould optimizer for lupus nephritis image segmentation. Comput Biol Med 2023; 160:106950. [PMID: 37120988 DOI: 10.1016/j.compbiomed.2023.106950] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 04/04/2023] [Accepted: 04/15/2023] [Indexed: 05/02/2023]
Abstract
The segmentation of medical images is a crucial and demanding step in medical image processing that offers a solid foundation for subsequent extraction and analysis of medical image data. Although multi-threshold image segmentation is the most used and specialized basic image segmentation technique, it is computationally demanding and often produces subpar segmentation results, hence restricting its application. To solve this issue, this work develops a multi-strategy-driven slime mould algorithm (RWGSMA) for multi-threshold image segmentation. Specifically, the random spare strategy, the double adaptive weigh strategy, and the grade-based search strategy are used to improve the performance of SMA, resulting in an enhanced SMA version. The random spare strategy is mainly used to accelerate the convergence rate of the algorithm. To prevent SMA from falling towards the local optimum, the double adaptive weights are also applied. The grade-based search approach has also been developed to boost convergence performance. This study evaluates the efficacy of RWGSMA from many viewpoints using 30 test suites from IEEE CEC2017 to effectively demonstrate the importance of these techniques in RWGSMA. In addition, numerous typical images were used to show RWGSMA's segmentation performance. Using the multi-threshold segmentation approach with 2D Kapur's entropy as the RWGSMA fitness function, the suggested algorithm was then used to segment instances of lupus nephritis. The experimental findings demonstrate that the suggested RWGSMA beats numerous similar rivals, suggesting that it has a great deal of promise for segmenting histopathological images.
Collapse
Affiliation(s)
- Manrong Shi
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Chi Chen
- Wenzhou University of Technology, Wenzhou, 325035, China.
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, 610065, China.
| | - Fangjun Kuang
- School of Information engineering, Wenzhou Business College, Wenzhou, 325035, China.
| | - Dong Zhao
- College of Computer Science and Technology, Changchun Normal University, Changchun, Jilin, 130032, China.
| | - Xiaowei Chen
- Department of Rheumatology and Immunology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| |
Collapse
|
27
|
Zhao X, Liu L, Heidari AA, Chen Y, Ma BJ, Chen H, Quan S. An enhanced ant colony optimizer with Cauchy-Gaussian fusion and novel movement strategy for multi-threshold COVID-19 X-ray image segmentation. Front Neuroinform 2023; 17:1126783. [PMID: 37006638 PMCID: PMC10064065 DOI: 10.3389/fninf.2023.1126783] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2022] [Accepted: 02/16/2023] [Indexed: 03/19/2023] Open
Abstract
The novel coronavirus pneumonia (COVID-19) is a respiratory disease of great concern in terms of its dissemination and severity, for which X-ray imaging-based diagnosis is one of the effective complementary diagnostic methods. It is essential to be able to separate and identify lesions from their pathology images regardless of the computer-aided diagnosis techniques. Therefore, image segmentation in the pre-processing stage of COVID-19 pathology images would be more helpful for effective analysis. In this paper, to achieve highly effective pre-processing of COVID-19 pathological images by using multi-threshold image segmentation (MIS), an enhanced version of ant colony optimization for continuous domains (MGACO) is first proposed. In MGACO, not only a new move strategy is introduced, but also the Cauchy-Gaussian fusion strategy is incorporated. It has been accelerated in terms of convergence speed and has significantly enhanced its ability to jump out of the local optimum. Furthermore, an MIS method (MGACO-MIS) based on MGACO is developed, where it applies the non-local means, 2D histogram as the basis, and employs 2D Kapur’s entropy as the fitness function. To demonstrate the performance of MGACO, we qualitatively analyze it in detail and compare it with other peers on 30 benchmark functions from IEEE CEC2014, which proves that it has a stronger capability of solving problems over the original ant colony optimization for continuous domains. To verify the segmentation effect of MGACO-MIS, we conducted a comparison experiment with eight other similar segmentation methods based on real pathology images of COVID-19 at different threshold levels. The final evaluation and analysis results fully demonstrate that the developed MGACO-MIS is sufficient to obtain high-quality segmentation results in the COVID-19 image segmentation and has stronger adaptability to different threshold levels than other methods. Therefore, it has been well-proven that MGACO is an excellent swarm intelligence optimization algorithm, and MGACO-MIS is also an excellent segmentation method.
Collapse
Affiliation(s)
- Xiuzhi Zhao
- College of Artificial Intelligence, Zhejiang Industry & Trade Vocational College, Wenzhou, Zhejiang, China
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, China
- *Correspondence: Lei Liu,
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Yi Chen
- Institute of Big Data and Information Technology, Wenzhou University, Wenzhou, China
| | - Benedict Jun Ma
- Department of Industrial and Manufacturing Systems Engineering, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Huiling Chen
- Institute of Big Data and Information Technology, Wenzhou University, Wenzhou, China
- Huiling Chen,
| | - Shichao Quan
- Department of Big Data in Health Science, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
- Key Laboratory of Intelligent Treatment and Life Support for Critical Diseases of Zhejiang Province, Wenzhou, China
- Zhejiang Engineering Research Center for Hospital Emergency and Process Digitization, Wenzhou, China
- Shichao Quan,
| |
Collapse
|
28
|
Tang W, Deng H, Yin S. CPMF-Net: Multi-Feature Network Based on Collaborative Patches for Retinal Vessel Segmentation. SENSORS (BASEL, SWITZERLAND) 2022; 22:9210. [PMID: 36501911 PMCID: PMC9736046 DOI: 10.3390/s22239210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 11/23/2022] [Accepted: 11/24/2022] [Indexed: 06/17/2023]
Abstract
As an important basis of clinical diagnosis, the morphology of retinal vessels is very useful for the early diagnosis of some eye diseases. In recent years, with the rapid development of deep learning technology, automatic segmentation methods based on it have made considerable progresses in the field of retinal blood vessel segmentation. However, due to the complexity of vessel structure and the poor quality of some images, retinal vessel segmentation, especially the segmentation of Capillaries, is still a challenging task. In this work, we propose a new retinal blood vessel segmentation method, called multi-feature segmentation, based on collaborative patches. First, we design a new collaborative patch training method which effectively compensates for the pixel information loss in the patch extraction through information transmission between collaborative patches. Additionally, the collaborative patch training strategy can simultaneously have the characteristics of low occupancy, easy structure and high accuracy. Then, we design a multi-feature network to gather a variety of information features. The hierarchical network structure, together with the integration of the adaptive coordinate attention module and the gated self-attention module, enables these rich information features to be used for segmentation. Finally, we evaluate the proposed method on two public datasets, namely DRIVE and STARE, and compare the results of our method with those of other nine advanced methods. The results show that our method outperforms other existing methods.
Collapse
|
29
|
Nijiati M, Tuersun A, Zhang Y, Yuan Q, Gong P, Abulizi A, Tuoheti A, Abulaiti A, Zou X. A symmetric prior knowledge based deep learning model for intracerebral hemorrhage lesion segmentation. Front Physiol 2022; 13:977427. [PMID: 36505076 PMCID: PMC9727183 DOI: 10.3389/fphys.2022.977427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 11/11/2022] [Indexed: 11/24/2022] Open
Abstract
Background: Accurate localization and classification of intracerebral hemorrhage (ICH) lesions are of great significance for the treatment and prognosis of patients with ICH. The purpose of this study is to develop a symmetric prior knowledge based deep learning model to segment ICH lesions in computed tomography (CT). Methods: A novel symmetric Transformer network (Sym-TransNet) is designed to segment ICH lesions in CT images. A cohort of 1,157 patients diagnosed with ICH is established to train (n = 857), validate (n = 100), and test (n = 200) the Sym-TransNet. A healthy cohort of 200 subjects is added, establishing a test set with balanced positive and negative cases (n = 400), to further evaluate the accuracy, sensitivity, and specificity of the diagnosis of ICH. The segmentation results are obtained after data pre-processing and Sym-TransNet. The DICE coefficient is used to evaluate the similarity between the segmentation results and the segmentation gold standard. Furthermore, some recent deep learning methods are reproduced to compare with Sym-TransNet, and statistical analysis is performed to prove the statistical significance of the proposed method. Ablation experiments are conducted to prove that each component in Sym-TransNet could effectively improve the DICE coefficient of ICH lesions. Results: For the segmentation of ICH lesions, the DICE coefficient of Sym-TransNet is 0.716 ± 0.031 in the test set which contains 200 CT images of ICH. The DICE coefficients of five subtypes of ICH, including intraparenchymal hemorrhage (IPH), intraventricular hemorrhage (IVH), extradural hemorrhage (EDH), subdural hemorrhage (SDH), and subarachnoid hemorrhage (SAH), are 0.784 ± 0.039, 0.680 ± 0.049, 0.359 ± 0.186, 0.534 ± 0.455, and 0.337 ± 0.044, respectively. Statistical results show that the proposed Sym-TransNet can significantly improve the DICE coefficient of ICH lesions in most cases. In addition, the accuracy, sensitivity, and specificity of Sym-TransNet in the diagnosis of ICH in 400 CT images are 91.25%, 98.50%, and 84.00%, respectively. Conclusion: Compared with recent mainstream deep learning methods, the proposed Sym-TransNet can segment and identify different types of lesions from CT images of ICH patients more effectively. Moreover, the Sym-TransNet can diagnose ICH more stably and efficiently, which has clinical application prospects.
Collapse
Affiliation(s)
- Mayidili Nijiati
- Department of Radiology, The First People’s Hospital of Kashi Prefecture, Kashi, China
| | - Abudouresuli Tuersun
- Department of Radiology, The First People’s Hospital of Kashi Prefecture, Kashi, China
| | | | - Qing Yuan
- Department of Radiology, The First People’s Hospital of Kashi Prefecture, Kashi, China
| | | | | | - Awanisa Tuoheti
- Department of Radiology, The First People’s Hospital of Kashi Prefecture, Kashi, China
| | - Adili Abulaiti
- Department of Radiology, The First People’s Hospital of Kashi Prefecture, Kashi, China,*Correspondence: Adili Abulaiti, ; Xiaoguang Zou,
| | - Xiaoguang Zou
- Clinical Medical Research Center, The First People’s Hospital of Kashi Prefecture, Kashi, China,*Correspondence: Adili Abulaiti, ; Xiaoguang Zou,
| |
Collapse
|
30
|
DGRUnit: Dual graph reasoning unit for brain tumor segmentation. Comput Biol Med 2022; 149:106079. [PMID: 36108413 DOI: 10.1016/j.compbiomed.2022.106079] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2022] [Revised: 08/27/2022] [Accepted: 09/03/2022] [Indexed: 11/20/2022]
Abstract
Many fully automatic segmentation models have been created to solve the difficulty of brain tumor segmentation, thanks to the rapid growth of deep learning. However, few approaches focus on the long-range relationships and contextual interdependence in multimodal Magnetic Resonance (MR) images. In this paper, we propose a novel approach for brain tumor segmentation called the dual graph reasoning unit (DGRUnit). Two parallel graph reasoning modules are included in our proposed method: a spatial reasoning module and a channel reasoning module. The spatial reasoning module models the long-range spatial dependencies between distinct regions in an image using a graph convolutional network (GCN). The channel reasoning module uses a graph attention network (GAT) to model the rich contextual interdependencies between different channels with similar semantic representations. Our experimental results clearly demonstrate the superior performance of the proposed DGRUnit. The ablation study shows the flexibility and generalizability of our model, which can be easily integrated into a wide range of neural networks and further improve them. When compared to several state-of-the-art methods, experimental results show that the proposed approach significantly improves both visual inspection and quantitative metrics for brain tumor segmentation tasks.
Collapse
|