1
|
Li X, Wang J, Wei H, Cong J, Sun H, Wang P, Wei B. MH2AFormer: An Efficient Multiscale Hierarchical Hybrid Attention With a Transformer for Bladder Wall and Tumor Segmentation. IEEE J Biomed Health Inform 2024; 28:4772-4784. [PMID: 38713566 DOI: 10.1109/jbhi.2024.3397698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/09/2024]
Abstract
Achieving accurate bladder wall and tumor segmentation from MRI is critical for diagnosing and treating bladder cancer. However, automated segmentation remains challenging due to factors such as comparable density distributions, intricate tumor morphologies, and unclear boundaries. Considering the attributes of bladder MRI images, we propose an efficient multiscale hierarchical hybrid attention with a transformer (MH2AFormer) for bladder cancer and wall segmentation. Specifically, a multiscale hybrid attention and transformer (MHAT) module in the encoder is designed to adaptively extract and aggregate multiscale hybrid feature representations from the input image. In the decoder stage, we devise a multiscale hybrid attention (MHA) module to generate high-quality segmentation results from multiscale hybrid features. Combining these modules enhances the feature representation and guides the model to focus on tumor and wall regions, which helps to solve bladder image segmentation challenges. Moreover, MHAT utilizes the Fast Fourier Transformer with a large kernel (e.g., 224 × 224) to model global feature relationships while reducing computational complexity in the encoding stage. The model performance was evaluated on two datasets. As a result, the model achieves relatively best results regarding the intersection over union (IoU) and dice similarity coefficient (DSC) on both datasets (Dataset A: IoU = 80.26%, DSC = 88.20%; Dataset B: IoU = 89.74%, DSC = 94.48%). These advantageous outcomes substantiate the practical utility of our approach, highlighting its potential to alleviate the workload of radiologists when applied in clinical settings.
Collapse
|
2
|
Xu R, Xu C, Li Z, Zheng T, Yu W, Yang C. Boundary guidance network for medical image segmentation. Sci Rep 2024; 14:17345. [PMID: 39069513 PMCID: PMC11284230 DOI: 10.1038/s41598-024-67554-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 07/12/2024] [Indexed: 07/30/2024] Open
Abstract
Accurate segmentation of the tumor area is crucial for the treatment and prognosis of patients with bladder cancer. Cystoscopy is the gold standard for diagnosing bladder tumors. However, The vast majority of current work uses deep learning to identify and segment tumors from CT and MRI findings, and rarely involves cystoscopy findings. Accurately segmenting bladder tumors remains a great challenge due to their diverse morphology and fuzzy boundaries. In order to solve the above problems, this paper proposes a medical image segmentation network with boundary guidance called boundary guidance network. This network combines local features extracted by CNNs and long-range dependencies between different levels inscribed by Parallel ViT, which can capture tumor features more effectively. The Boundary extracted module is designed to extract boundary features and utilize the boundary features to guide the decoding process. Foreground-background dual-channel decoding is performed by boundary integrated module. Experimental results on our proposed new cystoscopic bladder tumor dataset (BTD) show that our method can efficiently perform accurate segmentation of tumors and retain more boundary information, achieving an IoU score of 91.3%, a Hausdorff Distance of 10.43, an mAP score of 85.3%, and a F1 score of 94.8%. On BTD and three other public datasets, our model achieves the best scores compared to state-of-the-art methods, which proves the effectiveness of our model for common medical image segmentation.
Collapse
Affiliation(s)
- Rubin Xu
- School of Integrated Circuits, Anhui University, HeFei, 230601, China
- Anhui Engineering Laboratory of Agro-Ecological Big Data, HeFei, 230601, China
| | - Chao Xu
- School of Integrated Circuits, Anhui University, HeFei, 230601, China.
- Anhui Engineering Laboratory of Agro-Ecological Big Data, HeFei, 230601, China.
| | - Zhengping Li
- School of Integrated Circuits, Anhui University, HeFei, 230601, China
- Anhui Engineering Laboratory of Agro-Ecological Big Data, HeFei, 230601, China
| | - Tianyu Zheng
- School of Integrated Circuits, Anhui University, HeFei, 230601, China
- Anhui Engineering Laboratory of Agro-Ecological Big Data, HeFei, 230601, China
| | - Weidong Yu
- Department of Urology, The First Affiliated Hospital of Anhui Medical University, HeFei, 230022, China
- Institute of Urology, Anhui Medical University, HeFei, 230022, China
- Anhui Province Key Laboratory of Genitourinary Diseases, Anhui Medical University, HeFei, 230022, China
| | - Cheng Yang
- Department of Urology, The First Affiliated Hospital of Anhui Medical University, HeFei, 230022, China.
- Institute of Urology, Anhui Medical University, HeFei, 230022, China.
- Anhui Province Key Laboratory of Genitourinary Diseases, Anhui Medical University, HeFei, 230022, China.
| |
Collapse
|
3
|
Li Y, Li J, Ozcan A. Nonlinear encoding in diffractive information processing using linear optical materials. LIGHT, SCIENCE & APPLICATIONS 2024; 13:173. [PMID: 39043641 PMCID: PMC11266679 DOI: 10.1038/s41377-024-01529-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2024] [Revised: 07/11/2024] [Accepted: 07/12/2024] [Indexed: 07/25/2024]
Abstract
Nonlinear encoding of optical information can be achieved using various forms of data representation. Here, we analyze the performances of different nonlinear information encoding strategies that can be employed in diffractive optical processors based on linear materials and shed light on their utility and performance gaps compared to the state-of-the-art digital deep neural networks. For a comprehensive evaluation, we used different datasets to compare the statistical inference performance of simpler-to-implement nonlinear encoding strategies that involve, e.g., phase encoding, against data repetition-based nonlinear encoding strategies. We show that data repetition within a diffractive volume (e.g., through an optical cavity or cascaded introduction of the input data) causes the loss of the universal linear transformation capability of a diffractive optical processor. Therefore, data repetition-based diffractive blocks cannot provide optical analogs to fully connected or convolutional layers commonly employed in digital neural networks. However, they can still be effectively trained for specific inference tasks and achieve enhanced accuracy, benefiting from the nonlinear encoding of the input information. Our results also reveal that phase encoding of input information without data repetition provides a simpler nonlinear encoding strategy with comparable statistical inference accuracy to data repetition-based diffractive processors. Our analyses and conclusions would be of broad interest to explore the push-pull relationship between linear material-based diffractive optical systems and nonlinear encoding strategies in visual information processors.
Collapse
Affiliation(s)
- Yuhang Li
- Electrical & Computer Engineering Department, University of California, Los Angeles, CA, 90095, USA
- Bioengineering Department, University of California, Los Angeles, CA, 90095, USA
- California NanoSystems Institute (CNSI), University of California, Los Angeles, CA, 90095, USA
| | - Jingxi Li
- Electrical & Computer Engineering Department, University of California, Los Angeles, CA, 90095, USA
- Bioengineering Department, University of California, Los Angeles, CA, 90095, USA
- California NanoSystems Institute (CNSI), University of California, Los Angeles, CA, 90095, USA
| | - Aydogan Ozcan
- Electrical & Computer Engineering Department, University of California, Los Angeles, CA, 90095, USA.
- Bioengineering Department, University of California, Los Angeles, CA, 90095, USA.
- California NanoSystems Institute (CNSI), University of California, Los Angeles, CA, 90095, USA.
| |
Collapse
|
4
|
Yue X, Huang X, Xu Z, Chen Y, Xu C. Involving logical clinical knowledge into deep neural networks to improve bladder tumor segmentation. Med Image Anal 2024; 95:103189. [PMID: 38776840 DOI: 10.1016/j.media.2024.103189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 04/06/2024] [Accepted: 05/01/2024] [Indexed: 05/25/2024]
Abstract
Segmentation of bladder tumors from medical radiographic images is of great significance for early detection, diagnosis and prognosis evaluation of bladder cancer. Deep Convolution Neural Networks (DCNNs) have been successfully used for bladder tumor segmentation, but the segmentation based on DCNN is data-hungry for model training and ignores clinical knowledge. From the clinical view, bladder tumors originate from the mucosal surface of bladder and must rely on the bladder wall to survive and grow. This clinical knowledge of tumor location is helpful to improve the bladder tumor segmentation. To achieve this, we propose a novel bladder tumor segmentation method, which incorporates the clinical logic rules of bladder tumor and bladder wall into DCNNs to harness the tumor segmentation. Clinical logical rules provide a semantic and human-readable knowledge representation and are easy for knowledge acquisition from clinicians. In addition, incorporating logical rules of clinical knowledge helps to reduce the data dependency of the segmentation network, and enables precise segmentation results even with limited number of annotated images. Experiments on bladder MR images collected from the collaborating hospital validate the effectiveness of the proposed bladder tumor segmentation method.
Collapse
Affiliation(s)
- Xiaodong Yue
- Artificial Intelligence Institute of Shanghai University, Shanghai University, Shanghai 200444, China; School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China.
| | - Xiao Huang
- School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
| | - Zhikang Xu
- School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
| | - Yufei Chen
- College of Electronics and Information Engineering, Tongji University, Shanghai 201804, China.
| | - Chuanliang Xu
- Department of Urology, Changhai hospital, Shanghai 200433, China
| |
Collapse
|
5
|
Ye Y, Luo Z, Qiu Z, Cao K, Huang B, Deng L, Zhang W, Liu G, Zou Y, Zhang J, Li J. Radiomics Prediction of Muscle Invasion in Bladder Cancer Using Semi-Automatic Lesion Segmentation of MRI Compared with Manual Segmentation. Bioengineering (Basel) 2023; 10:1355. [PMID: 38135946 PMCID: PMC10740947 DOI: 10.3390/bioengineering10121355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 11/10/2023] [Accepted: 11/23/2023] [Indexed: 12/24/2023] Open
Abstract
Conventional radiomics analysis requires the manual segmentation of lesions, which is time-consuming and subjective. This study aimed to assess the feasibility of predicting muscle invasion in bladder cancer (BCa) with radiomics using a semi-automatic lesion segmentation method on T2-weighted images. Cases of non-muscle-invasive BCa (NMIBC) and muscle-invasive BCa (MIBC) were pathologically identified in a training cohort and in internal and external validation cohorts. For bladder tumor segmentation, a deep learning-based semi-automatic model was constructed, while manual segmentation was performed by a radiologist. Semi-automatic and manual segmentation results were respectively used in radiomics analyses to distinguish NMIBC from MIBC. An equivalence test was used to compare the models' performance. The mean Dice similarity coefficients of the semi-automatic segmentation method were 0.836 and 0.801 in the internal and external validation cohorts, respectively. The area under the receiver operating characteristic curve (AUC) were 1.00 (0.991) and 0.892 (0.894) for the semi-automated model (manual) on the internal and external validation cohort, respectively (both p < 0.05). The average total processing time for semi-automatic segmentation was significantly shorter than that for manual segmentation (35 s vs. 92 s, p < 0.001). The BCa radiomics model based on semi-automatic segmentation method had a similar diagnostic performance as that of manual segmentation, while being less time-consuming and requiring fewer manual interventions.
Collapse
Affiliation(s)
- Yaojiang Ye
- Department of Radiology, The Tenth Affiliated Hospital of Southern Medical University (Dongguan People’s Hospital), Dongguan 523059, China; (Y.Y.); (L.D.); (Y.Z.)
| | - Zixin Luo
- Medical AI Lab, School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen 518060, China; (Z.L.); (Z.Q.); (K.C.); (B.H.)
| | - Zhengxuan Qiu
- Medical AI Lab, School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen 518060, China; (Z.L.); (Z.Q.); (K.C.); (B.H.)
| | - Kangyang Cao
- Medical AI Lab, School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen 518060, China; (Z.L.); (Z.Q.); (K.C.); (B.H.)
| | - Bingsheng Huang
- Medical AI Lab, School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen 518060, China; (Z.L.); (Z.Q.); (K.C.); (B.H.)
| | - Lei Deng
- Department of Radiology, The Tenth Affiliated Hospital of Southern Medical University (Dongguan People’s Hospital), Dongguan 523059, China; (Y.Y.); (L.D.); (Y.Z.)
| | - Weijing Zhang
- Imaging Department, Sun Yat-Sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangzhou 510060, China;
| | - Guoqing Liu
- College of Physics and Optoelectronic Engineering, Shenzhen University, Shenzhen 518060, China;
| | - Yujian Zou
- Department of Radiology, The Tenth Affiliated Hospital of Southern Medical University (Dongguan People’s Hospital), Dongguan 523059, China; (Y.Y.); (L.D.); (Y.Z.)
| | - Jian Zhang
- Shenzhen University Medical School, Shenzhen University, Shenzhen 518060, China
- Shenzhen-Hong Kong Institute of Brain Science-Shenzhen Fundamental Research Institutions, Shenzhen 518060, China
| | - Jianpeng Li
- Department of Radiology, The Tenth Affiliated Hospital of Southern Medical University (Dongguan People’s Hospital), Dongguan 523059, China; (Y.Y.); (L.D.); (Y.Z.)
| |
Collapse
|
6
|
Shen T, Huang F, Zhang X. CT medical image segmentation algorithm based on deep learning technology. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:10954-10976. [PMID: 37322967 DOI: 10.3934/mbe.2023485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
For the problems of blurred edges, uneven background distribution, and many noise interferences in medical image segmentation, we proposed a medical image segmentation algorithm based on deep neural network technology, which adopts a similar U-Net backbone structure and includes two parts: encoding and decoding. Firstly, the images are passed through the encoder path with residual and convolutional structures for image feature information extraction. We added the attention mechanism module to the network jump connection to address the problems of redundant network channel dimensions and low spatial perception of complex lesions. Finally, the medical image segmentation results are obtained using the decoder path with residual and convolutional structures. To verify the validity of the model in this paper, we conducted the corresponding comparative experimental analysis, and the experimental results show that the DICE and IOU of the proposed model are 0.7826, 0.9683, 0.8904, 0.8069, and 0.9462, 0.9537 for DRIVE, ISIC2018 and COVID-19 CT datasets, respectively. The segmentation accuracy is effectively improved for medical images with complex shapes and adhesions between lesions and normal tissues.
Collapse
Affiliation(s)
- Tongping Shen
- School of Information Engineering, Anhui University of Chinese Medicine, Hefei, 230012, China
- Graduate School, Angeles University Foundation, Angeles 2009, Philippines
| | - Fangliang Huang
- School of Information Engineering, Anhui University of Chinese Medicine, Hefei, 230012, China
| | - Xusong Zhang
- Graduate School, Angeles University Foundation, Angeles 2009, Philippines
| |
Collapse
|