1
|
Chen C, Chen Y, Li X, Ning H, Xiao R. Linear semantic transformation for semi-supervised medical image segmentation. Comput Biol Med 2024; 173:108331. [PMID: 38522252 DOI: 10.1016/j.compbiomed.2024.108331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 02/29/2024] [Accepted: 03/17/2024] [Indexed: 03/26/2024]
Abstract
Medical image segmentation is a focus research and foundation in developing intelligent medical systems. Recently, deep learning for medical image segmentation has become a standard process and succeeded significantly, promoting the development of reconstruction, and surgical planning of disease diagnosis. However, semantic learning is often inefficient owing to the lack of supervision of feature maps, resulting in that high-quality segmentation models always rely on numerous and accurate data annotations. Learning robust semantic representation in latent spaces remains a challenge. In this paper, we propose a novel semi-supervised learning framework to learn vital attributes in medical images, which constructs generalized representation from diverse semantics to realize medical image segmentation. We first build a self-supervised learning part that achieves context recovery by reconstructing space and intensity of medical images, which conduct semantic representation for feature maps. Subsequently, we combine semantic-rich feature maps and utilize simple linear semantic transformation to convert them into image segmentation. The proposed framework was tested using five medical segmentation datasets. Quantitative assessments indicate the highest scores of our method on IXI (73.78%), ScaF (47.50%), COVID-19-Seg (50.72%), PC-Seg (65.06%), and Brain-MR (72.63%) datasets. Finally, we compared our method with the latest semi-supervised learning methods and obtained 77.15% and 75.22% DSC values, respectively, ranking first on two representative datasets. The experimental results not only proved that the proposed linear semantic transformation was effectively applied to medical image segmentation, but also presented its simplicity and ease-of-use to pursue robust segmentation in semi-supervised learning. Our code is now open at: https://github.com/QingYunA/Linear-Semantic-Transformation-for-Semi-Supervised-Medical-Image-Segmentation.
Collapse
Affiliation(s)
- Cheng Chen
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, 100083, China
| | - Yunqing Chen
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, 100083, China
| | - Xiaoheng Li
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, 100083, China
| | - Huansheng Ning
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, 100083, China
| | - Ruoxiu Xiao
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, 100083, China; Shunde Innovation School, University of Science and Technology Beijing, Foshan, 100024, China.
| |
Collapse
|
2
|
Zhou L, Wu H, Luo G, Zhou H. Deep learning-based 3D cerebrovascular segmentation workflow on bright and black blood sequences magnetic resonance angiography. Insights Imaging 2024; 15:81. [PMID: 38517610 PMCID: PMC10959883 DOI: 10.1186/s13244-024-01657-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 02/18/2024] [Indexed: 03/24/2024] Open
Abstract
BACKGROUND Cerebrovascular diseases have emerged as significant threats to human life and health. Effectively segmenting brain blood vessels has become a crucial scientific challenge. We aimed to develop a fully automated deep learning workflow that achieves accurate 3D segmentation of cerebral blood vessels by incorporating classic convolutional neural networks (CNNs) and transformer models. METHODS We used a public cerebrovascular segmentation dataset (CSD) containing 45 volumes of 1.5 T time-of-flight magnetic resonance angiography images. We collected data from another private middle cerebral artery (MCA) with lenticulostriate artery (LSA) segmentation dataset (MLD), which encompassed 3.0 T three-dimensional T1-weighted sequences of volumetric isotropic turbo spin echo acquisition MRI images of 107 patients aged 62 ± 11 years (42 females). The workflow includes data analysis, preprocessing, augmentation, model training with validation, and postprocessing techniques. Brain vessels were segmented using the U-Net, V-Net, UNETR, and SwinUNETR models. The model performances were evaluated using the dice similarity coefficient (DSC), average surface distance (ASD), precision (PRE), sensitivity (SEN), and specificity (SPE). RESULTS During 4-fold cross-validation, SwinUNETR obtained the highest DSC in each fold. On the CSD test set, SwinUNETR achieved the best DSC (0.853), PRE (0.848), SEN (0.860), and SPE (0.9996), while V-Net achieved the best ASD (0.99). On the MLD test set, SwinUNETR demonstrated good MCA segmentation performance and had the best DSC, ASD, PRE, and SPE for segmenting the LSA. CONCLUSIONS The workflow demonstrated excellent performance on different sequences of MRI images for vessels of varying sizes. This method allows doctors to visualize cerebrovascular structures. CRITICAL RELEVANCE STATEMENT A deep learning-based 3D cerebrovascular segmentation workflow is feasible and promising for visualizing cerebrovascular structures and monitoring cerebral small vessels, such as lenticulostriate arteries. KEY POINTS • The proposed deep learning-based workflow performs well in cerebrovascular segmentation tasks. • Among comparison models, SwinUNETR achieved the best DSC, ASD, PRE, and SPE values in lenticulostriate artery segmentation. • The proposed workflow can be used for different MR sequences, such as bright and black blood imaging.
Collapse
Affiliation(s)
- Langtao Zhou
- School of Cyberspace Security, Guangzhou University, Guangzhou, 510006, China
- Department of Radiology of the First Affiliated Hospital of the University of South China, Hengyang, 421001, China
| | - Huiting Wu
- Department of Radiology of the First Affiliated Hospital of the University of South China, Hengyang, 421001, China
| | - Guanghua Luo
- Department of Radiology of the First Affiliated Hospital of the University of South China, Hengyang, 421001, China.
| | - Hong Zhou
- Department of Radiology of the First Affiliated Hospital of the University of South China, Hengyang, 421001, China.
| |
Collapse
|
3
|
Zhang C, Zhao M, Xie Y, Ding R, Ma M, Guo K, Jiang H, Xi W, Xia L. TL-MSE 2-Net: Transfer learning based nested model for cerebrovascular segmentation with aneurysms. Comput Biol Med 2023; 167:107609. [PMID: 37883854 DOI: 10.1016/j.compbiomed.2023.107609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 10/11/2023] [Accepted: 10/17/2023] [Indexed: 10/28/2023]
Abstract
Cerebrovascular (i.e., cerebral vessel) segmentation is essential for diagnosing and treating brain diseases. Convolutional neural network models, such as U-Net, are commonly used for this purpose. Unfortunately, such models may not be entirely satisfactory in dealing with cerebrovascular segmentation with tumors due to the following issues: (1) Relatively small number of clinical datasets from patients obtained through different modalities such as computed tomography (CT) and magnetic resonance imaging (MRI), leading to inadequate training and lack of transferability in the modeling; (2) Insufficient feature extraction caused by less attention to both convolution sizes and cerebral vessel edges. Inspired by the existence of similar features on cerebral vessels between normal subjects and patients, we propose a transfer learning strategy based on a pre-trained nested model called TL-MSE2-Net. This model uses one of the publicly available datasets for cerebrovascular segmentation with aneurysms. To address issue (1), our transfer learning strategy leverages a pre-trained model that uses a large number of datasets from normal subjects, providing a potential solution to the lack of sufficient clinical datasets. To tackle issue (2), we structure the pre-trained model based on 3D U-Net, comprising three blocks: ResMul, DeRes, and REAM. The ResMul and DeRes blocks enhance feature extraction by utilizing multiple convolution sizes to capture multiscale features, and the REAM block increases the weight of the voxels on the edges of the given 3D volume. We evaluated the proposed model on one small private clinical dataset and two publicly available datasets. The experimental results demonstrated that our MSE2-Net framework achieved an average Dice score of 70.81 % and 89.08 % on the two publicly available datasets, outperforming other state-of-the-art methods. Ablation studies were also conducted to validate the effectiveness of each block. The proposed TL-MSE2-Net yielded better results than MSE2-Net on a small private clinical dataset, with increases of 5.52 %, 3.37 %, 6.71 %, and 0.85 % for the Dice score, sensitivity, Jaccard index, and precision, respectively.
Collapse
Affiliation(s)
- Chaoran Zhang
- Laboratory of Neural Computing and Intelligent Perception (NCIP), Capital Normal University, Beijing, 100048, China
| | - Ming Zhao
- Department of Neurosurgery, First Medical Center, Chinese PLA General Hospital, Beijing, 100853, China
| | - Yixuan Xie
- Laboratory of Neural Computing and Intelligent Perception (NCIP), Capital Normal University, Beijing, 100048, China
| | - Rui Ding
- Laboratory of Neural Computing and Intelligent Perception (NCIP), Capital Normal University, Beijing, 100048, China
| | - Ming Ma
- Department of Computer Science, Winona State University, Winona, MN, 55987, USA
| | - Kaiwen Guo
- Laboratory of Neural Computing and Intelligent Perception (NCIP), Capital Normal University, Beijing, 100048, China
| | - Hongzhen Jiang
- Department of Neurosurgery, First Medical Center, Chinese PLA General Hospital, Beijing, 100853, China
| | - Wei Xi
- Department of Radiology, Fourth Medical Center, Chinese PLA General Hospital, Beijing, 100048, China
| | - Likun Xia
- Laboratory of Neural Computing and Intelligent Perception (NCIP), Capital Normal University, Beijing, 100048, China.
| |
Collapse
|
4
|
Zeng X, Guo Y, Zaman A, Hassan H, Lu J, Xu J, Yang H, Miao X, Cao A, Yang Y, Chen R, Kang Y. Tubular Structure Segmentation via Multi-Scale Reverse Attention Sparse Convolution. Diagnostics (Basel) 2023; 13:2161. [PMID: 37443556 DOI: 10.3390/diagnostics13132161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 06/17/2023] [Accepted: 06/20/2023] [Indexed: 07/15/2023] Open
Abstract
Cerebrovascular and airway structures are tubular structures used for transporting blood and gases, respectively, providing essential support for the normal activities of the human body. Accurately segmenting these tubular structures is the basis of morphology research and pathological detection. Nevertheless, accurately segmenting these structures from images presents great challenges due to their complex morphological and topological characteristics. To address this challenge, this paper proposes a framework UARAI based on the U-Net multi-scale reverse attention network and sparse convolution network. The framework utilizes a multi-scale structure to effectively extract the global and deep detail features of vessels and airways. Further, it enhances the extraction ability of fine-edged features by a joint reverse attention module. In addition, the sparse convolution structure is introduced to improve the features' expression ability without increasing the model's complexity. Finally, the proposed training sample cropping strategy reduces the influence of block boundaries on the accuracy of tubular structure segmentation. The experimental findings demonstrate that the UARAI-based metrics, namely Dice and IoU, achieve impressive scores of 90.31% and 82.33% for cerebrovascular segmentation and 93.34% and 87.51% for airway segmentation, respectively. Compared to commonly employed segmentation techniques, the proposed method exhibits remarkable accuracy and robustness in delineating tubular structures such as cerebrovascular and airway structures. These results hold significant promise in facilitating medical image analysis and clinical diagnosis, offering invaluable support to healthcare professionals.
Collapse
Affiliation(s)
- Xueqiang Zeng
- School of Applied Technology, Shenzhen University, Shenzhen 518060, China
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
| | - Yingwei Guo
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110169, China
| | - Asim Zaman
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
- School of Biomedical Engineering, Medical School, Shenzhen University, Shenzhen 518060, China
| | - Haseeb Hassan
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
| | - Jiaxi Lu
- School of Applied Technology, Shenzhen University, Shenzhen 518060, China
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
| | - Jiaxuan Xu
- State Key Laboratory of Respiratory Disease, National Center for Respiratory Medicine, National Clinical Research Center for Respiratory Disease, First Affiliated Hospital, Guangzhou Medical University, Guangzhou 510120, China
| | - Huihui Yang
- School of Applied Technology, Shenzhen University, Shenzhen 518060, China
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
| | - Xiaoqiang Miao
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110169, China
| | - Anbo Cao
- School of Applied Technology, Shenzhen University, Shenzhen 518060, China
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
| | - Yingjian Yang
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110169, China
| | - Rongchang Chen
- Shenzhen Institute of Respiratory Diseases, Shenzhen People's Hospital, Shenzhen 518001, China
- The Second Clinical Medical College, Jinan University, Guangzhou 518001, China
- The First Affiliated Hospital, Southern University of Science and Technology, Shenzhen 518001, China
| | - Yan Kang
- School of Applied Technology, Shenzhen University, Shenzhen 518060, China
- College of Health Science and Environmental Engineering, Shenzhen Technology University, Shenzhen 518118, China
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110169, China
- Engineering Research Centre of Medical Imaging and Intelligent Analysis, Ministry of Education, Shenyang 110169, China
| |
Collapse
|
5
|
Chen C, Zhou K, Lu T, Ning H, Xiao R. Integration- and separation-aware adversarial model for cerebrovascular segmentation from TOF-MRA. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 233:107475. [PMID: 36931018 DOI: 10.1016/j.cmpb.2023.107475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 03/08/2023] [Accepted: 03/09/2023] [Indexed: 06/18/2023]
Abstract
PURPOSE Cerebrovascular segmentation from time-of-flight magnetic resonance angiography (TOF-MRA) is important but challenging for the simulation and measurement of cerebrovascular diseases. Recently, deep learning has promoted the rapid development of cerebrovascular segmentation. However, model optimization relies on voxel or regional punishment and lacks global awareness and interpretation from the texture and edge. To overcome the limitations of the existing methods, we propose a new cerebrovascular segmentation method to obtain more refined structures. METHODS In this paper, we propose a new adversarial model that achieves segmentation using segmentation model and filters the results using discriminator. Considering the sample imbalance in cerebrovascular imaging, we separated the TOF-MRA images and utilized high- and low-frequency images to enhance the texture and edge representation. The encoder weight sharing from the segmentation model not only saves the model parameters, but also strengthens the integration and separation correlation. Diversified discrimination enhances the robustness and regularization of the model. RESULTS The adversarial model was tested using two cerebrovascular datasets. It scored 82.26% and 73.38%, respectively, ranking first on both datasets. The results show that our method not only outperforms the recent cerebrovascular segmentation model, but also surpasses the common adversarial models. CONCLUSION Our adversarial model focuses on improving the extraction ability of the model on texture and edge, thereby achieving awareness of the global cerebrovascular topology. Therefore, we obtained an accurate and robust cerebrovascular segmentation. This framework has potential applications in many imaging fields, particularly in the application of sample imbalance. Our code is available at the website https://github.com/MontaEllis/ISA-model.
Collapse
Affiliation(s)
- Cheng Chen
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Kangneng Zhou
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Tong Lu
- Visual 3D Medical Science and Technology Development, Co. Ltd, Beijing 100082, China
| | - Huansheng Ning
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Ruoxiu Xiao
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China; Shunde Innovation School, University of Science and Technology Beijing, Foshan 100024, China.
| |
Collapse
|
6
|
Sun Q, Yang J, Zhao S, Chen C, Hou Y, Yuan Y, Ma S, Huang Y. LIVE-Net: Comprehensive 3D vessel extraction framework in CT angiography. Comput Biol Med 2023; 159:106886. [PMID: 37062255 DOI: 10.1016/j.compbiomed.2023.106886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 03/04/2023] [Accepted: 04/01/2023] [Indexed: 04/18/2023]
Abstract
The extraction of vessels from computed tomography angiography (CTA) is significant in diagnosing and evaluating vascular diseases. However, due to the anatomical complexity, wide intensity distribution, and small volume proportion of vessels, vessel extraction is laborious and time-consuming, and it is easy to lead to error-prone diagnostic results in clinical practice. This study proposes a novel comprehensive vessel extraction framework, called the Local Iterative-based Vessel Extraction Network (LIVE-Net), to achieve 3D vessel segmentation while tracking vessel centerlines. LIVE-Net contains dual dataflow pathways that work alternately: an iterative tracking network and a local segmentation network. The former can generate the fine-grain direction and radius prediction of a vascular patch by using the attention-embedded atrous pyramid network (aAPN), and the latter can achieve 3D vascular lumen segmentation by constructing the multi-order self-attention U-shape network (MOSA-UNet). LIVE-Net is trained and evaluated on two datasets: the MICCAI 2008 Coronary Artery Tracking Challenge (CAT08) dataset and head and neck CTA dataset from the clinic. Experimental results of both tracking and segmentation show that our proposed LIVE-Net exhibits superior performance compared with other state-of-the-art (SOTA) networks. In the CAT08 dataset, the tracked centerlines have an average overlap of 95.2%, overlap until first error of 91.2%, overlap with the clinically relevant vessels of 98.3%, and error distance inside of 0.21 mm. The corresponding tracking overlap metrics in the head and neck CTA dataset are 96.7%, 91.0%, and 99.8%, respectively. In addition, the results of the consistent experiment also show strong clinical correspondence. For the segmentation of bilateral carotid and vertebral arteries, our method can not only achieve better accuracy with an average dice similarity coefficient (DSC) of 90.03%, Intersection over Union (IoU) of 81.97%, and 95% Hausdorff distance (95%HD) of 3.42 mm , but higher efficiency with an average time of 67.25 s , even three times faster compared to some methods applied in full field view. Both the tracking and segmentation results prove the potential clinical utility of our network.
Collapse
Affiliation(s)
- Qi Sun
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, Liaoning, China; School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Jinzhu Yang
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, Liaoning, China; School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning, China.
| | - Sizhe Zhao
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, Liaoning, China; School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Chen Chen
- Northeastern University, Shenyang, Liaoning, China
| | - Yang Hou
- Department of Radiology, ShengJing Hospital of China Medical University, Shenyang, Liaoning, China
| | - Yuliang Yuan
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, Liaoning, China; School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Shuang Ma
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, Liaoning, China; School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Yan Huang
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, Liaoning, China; School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning, China
| |
Collapse
|
7
|
Chen C, Zhou K, Wang Z, Zhang Q, Xiao R. All answers are in the images: A review of deep learning for cerebrovascular segmentation. Comput Med Imaging Graph 2023; 107:102229. [PMID: 37043879 DOI: 10.1016/j.compmedimag.2023.102229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 03/03/2023] [Accepted: 04/03/2023] [Indexed: 04/14/2023]
Abstract
Cerebrovascular imaging is a common examination. Its accurate cerebrovascular segmentation become an important auxiliary method for the diagnosis and treatment of cerebrovascular diseases, which has received extensive attention from researchers. Deep learning is a heuristic method that encourages researchers to derive answers from the images by driving datasets. With the continuous development of datasets and deep learning theory, it has achieved important success for cerebrovascular segmentation. Detailed survey is an important reference for researchers. To comprehensively analyze the newest cerebrovascular segmentation, we have organized and discussed researches centered on deep learning. This survey comprehensively reviews deep learning for cerebrovascular segmentation since 2015, it mainly includes sliding window based models, U-Net based models, other CNNs based models, small-sample based models, semi-supervised or unsupervised models, fusion based models, Transformer based models, and graphics based models. We organize the structures, improvement, and important parameters of these models, as well as analyze development trends and quantitative assessment. Finally, we have discussed the challenges and opportunities of possible research directions, hoping that our survey can provide researchers with convenient reference.
Collapse
Affiliation(s)
- Cheng Chen
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Kangneng Zhou
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Zhiliang Wang
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Qian Zhang
- Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical University, Beijing 100070, China; China National Clinical Research Center for Neurological Diseases, Beijing 100070, China
| | - Ruoxiu Xiao
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China; Shunde Innovation School, University of Science and Technology Beijing, Foshan 100024, China.
| |
Collapse
|
8
|
Chen C, Qi S, Zhou K, Lu T, Ning H, Xiao R. Pairwise attention-enhanced adversarial model for automatic bone segmentation in CT images. Phys Med Biol 2023; 68. [PMID: 36634367 DOI: 10.1088/1361-6560/acb2ab] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 01/12/2023] [Indexed: 01/14/2023]
Abstract
Objective. Bone segmentation is a critical step in screw placement navigation. Although the deep learning methods have promoted the rapid development for bone segmentation, the local bone separation is still challenging due to irregular shapes and similar representational features.Approach. In this paper, we proposed the pairwise attention-enhanced adversarial model (Pair-SegAM) for automatic bone segmentation in computed tomography images, which includes the two parts of the segmentation model and discriminator. Considering that the distributions of the predictions from the segmentation model contains complicated semantics, we improve the discriminator to strengthen the awareness ability of the target region, improving the parsing of semantic information features. The Pair-SegAM has a pairwise structure, which uses two calculation mechanics to set up pairwise attention maps, then we utilize the semantic fusion to filter unstable regions. Therefore, the improved discriminator provides more refinement information to capture the bone outline, thus effectively enhancing the segmentation models for bone segmentation.Main results. To test the Pair-SegAM, we selected the two bone datasets for assessment. We evaluated our method against several bone segmentation models and latest adversarial models on the both datasets. The experimental results prove that our method not only exhibits superior bone segmentation performance, but also states effective generalization.Significance. Our method provides a more efficient segmentation of specific bones and has the potential to be extended to other semantic segmentation domains.
Collapse
Affiliation(s)
- Cheng Chen
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, People's Republic of China
| | - Siyu Qi
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, People's Republic of China
| | - Kangneng Zhou
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, People's Republic of China
| | - Tong Lu
- Visual 3D Medical Science and Technology Development Co. Ltd, Beijing 100082, People's Republic of China
| | - Huansheng Ning
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, People's Republic of China
| | - Ruoxiu Xiao
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, People's Republic of China.,Shunde Innovation School, University of Science and Technology Beijing, Foshan 100024, People's Republic of China
| |
Collapse
|