1
|
Ji Z, Mu J, Liu J, Zhang H, Dai C, Zhang X, Ganchev I. ASD-Net: a novel U-Net based asymmetric spatial-channel convolution network for precise kidney and kidney tumor image segmentation. Med Biol Eng Comput 2024; 62:1673-1687. [PMID: 38326677 PMCID: PMC11076390 DOI: 10.1007/s11517-024-03025-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 01/15/2024] [Indexed: 02/09/2024]
Abstract
Early intervention in tumors can greatly improve human survival rates. With the development of deep learning technology, automatic image segmentation has taken a prominent role in the field of medical image analysis. Manually segmenting kidneys on CT images is a tedious task, and due to the diversity of these images and varying technical skills of professionals, segmentation results can be inconsistent. To address this problem, a novel ASD-Net network is proposed in this paper for kidney and kidney tumor segmentation tasks. First, the proposed network employs newly designed Adaptive Spatial-channel Convolution Optimization (ASCO) blocks to capture anisotropic information in the images. Then, other newly designed blocks, i.e., Dense Dilated Enhancement Convolution (DDEC) blocks, are utilized to enhance feature propagation and reuse it across the network, thereby improving its segmentation accuracy. To allow the network to segment complex and small kidney tumors more effectively, the Atrous Spatial Pyramid Pooling (ASPP) module is incorporated in its middle layer. With its generalized pyramid feature, this module enables the network to better capture and understand context information at various scales within the images. In addition to this, the concurrent spatial and channel squeeze & excitation (scSE) attention mechanism is adopted to better comprehend and manage context information in the images. Additional encoding layers are also added to the base (U-Net) and connected to the original encoding layer through skip connections. The resultant enhanced U-Net structure allows for better extraction and merging of high-level and low-level features, further boosting the network's ability to restore segmentation details. In addition, the combined Binary Cross Entropy (BCE)-Dice loss is utilized as the network's loss function. Experiments, conducted on the KiTS19 dataset, demonstrate that the proposed ASD-Net network outperforms the existing segmentation networks according to all evaluation metrics used, except for recall in the case of kidney tumor segmentation, where it takes the second place after Attention-UNet.
Collapse
Affiliation(s)
- Zhanlin Ji
- Department of Artificial Intelligence, North China University of Science and Technology, Tangshan, 063009, People's Republic of China
| | - Juncheng Mu
- Department of Artificial Intelligence, North China University of Science and Technology, Tangshan, 063009, People's Republic of China
| | - Jianuo Liu
- Department of Artificial Intelligence, North China University of Science and Technology, Tangshan, 063009, People's Republic of China
| | - Haiyang Zhang
- Department of Computing, Xi'an Jiaotong-Liverpool University, Suzhou, People's Republic of China
| | - Chenxu Dai
- Department of Artificial Intelligence, North China University of Science and Technology, Tangshan, 063009, People's Republic of China
| | - Xueji Zhang
- School of Biomedical Engineering, Shenzhen University Health Science Center, Shenzhen, Guangdong, 518060, People's Republic of China.
| | - Ivan Ganchev
- Telecommunications Research Centre (TRC), University of Limerick, Limerick, V94 T9PX, Ireland.
- Department of Computer Systems, University of Plovdiv "Paisii Hilendarski", Plovdiv, 4000, Bulgaria.
- Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Sofia, 1040, Bulgaria.
| |
Collapse
|
2
|
Rao PK, Chatterjee S, Janardhan M, Nagaraju K, Khan SB, Almusharraf A, Alharbe AI. Optimizing Inference Distribution for Efficient Kidney Tumor Segmentation Using a UNet-PWP Deep-Learning Model with XAI on CT Scan Images. Diagnostics (Basel) 2023; 13:3244. [PMID: 37892065 PMCID: PMC10606269 DOI: 10.3390/diagnostics13203244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 10/10/2023] [Accepted: 10/10/2023] [Indexed: 10/29/2023] Open
Abstract
Kidney tumors represent a significant medical challenge, characterized by their often-asymptomatic nature and the need for early detection to facilitate timely and effective intervention. Although neural networks have shown great promise in disease prediction, their computational demands have limited their practicality in clinical settings. This study introduces a novel methodology, the UNet-PWP architecture, tailored explicitly for kidney tumor segmentation, designed to optimize resource utilization and overcome computational complexity constraints. A key novelty in our approach is the application of adaptive partitioning, which deconstructs the intricate UNet architecture into smaller submodels. This partitioning strategy reduces computational requirements and enhances the model's efficiency in processing kidney tumor images. Additionally, we augment the UNet's depth by incorporating pre-trained weights, therefore significantly boosting its capacity to handle intricate and detailed segmentation tasks. Furthermore, we employ weight-pruning techniques to eliminate redundant zero-weighted parameters, further streamlining the UNet-PWP model without compromising its performance. To rigorously assess the effectiveness of our proposed UNet-PWP model, we conducted a comparative evaluation alongside the DeepLab V3+ model, both trained on the "KiTs 19, 21, and 23" kidney tumor dataset. Our results are optimistic, with the UNet-PWP model achieving an exceptional accuracy rate of 97.01% on both the training and test datasets, surpassing the DeepLab V3+ model in performance. Furthermore, to ensure our model's results are easily understandable and explainable. We included a fusion of the attention and Grad-CAM XAI methods. This approach provides valuable insights into the decision-making process of our model and the regions of interest that affect its predictions. In the medical field, this interpretability aspect is crucial for healthcare professionals to trust and comprehend the model's reasoning.
Collapse
Affiliation(s)
- P. Kiran Rao
- Artificial Intelligence, Department of Computer Science and Engineering, Ravindra College of Engineering for Women, Kurnool 518001, India
- Department of Computer Science and Engineering, Faculty of Engineering, MS Ramaiah University of Applied Sciences, Bengaluru 560058, India;
| | - Subarna Chatterjee
- Department of Computer Science and Engineering, Faculty of Engineering, MS Ramaiah University of Applied Sciences, Bengaluru 560058, India;
| | - M. Janardhan
- Artificial Intelligence, Department of Computer Science and Engineering, G. Pullaiah College of Engineering and Technology, Kurnool 518008, India;
| | - K. Nagaraju
- Department of Computer Science and Engineering, Indian Institute of Information Technology Design and Manufacturing Kurnool, Kurnool 518008, India;
| | - Surbhi Bhatia Khan
- Department of Data Science, School of Science, Engineering and Environment, University of Salford, Salford M5 4WT, UK
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos 13-5053, Lebanon
| | - Ahlam Almusharraf
- Department of Business Administration, College of Business and Administration, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia;
| | - Abdullah I. Alharbe
- Department of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Rabigh 21911, Saudi Arabia
| |
Collapse
|
3
|
Sun P, Mo Z, Hu F, Song X, Mo T, Yu B, Zhang Y, Chen Z. 2.5D MFFAU-Net: a convolutional neural network for kidney segmentation. BMC Med Inform Decis Mak 2023; 23:92. [PMID: 37165349 PMCID: PMC10173575 DOI: 10.1186/s12911-023-02189-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Accepted: 05/04/2023] [Indexed: 05/12/2023] Open
Abstract
BACKGROUND Kidney tumors have become increasingly prevalent among adults and are now considered one of the most common types of tumors. Accurate segmentation of kidney tumors can help physicians assess tumor complexity and aggressiveness before surgery. However, segmenting kidney tumors manually can be difficult because of their heterogeneity. METHODS This paper proposes a 2.5D MFFAU-Net (multi-level Feature Fusion Attention U-Net) to segment kidneys, tumors and cysts. First, we propose a 2.5D model for learning to combine and represent a given slice in 2D slices, thereby introducing 3D information to balance memory consumption and model complexity. Then, we propose a ResConv architecture in MFFAU-Net and use the high-level and low-level feature in the model. Finally, we use multi-level information to analyze the spatial features between slices to segment kidneys and tumors. RESULTS The 2.5D MFFAU-Net was evaluated on KiTS19 and KiTS21 kidney datasets and demonstrated an average dice score of 0.924 and 0.875, respectively, and an average Surface dice (SD) score of 0.794 in KiTS21. CONCLUSION The 2.5D MFFAU-Net model can effectively segment kidney tumors, and the results are comparable to those obtained with high-performance 3D CNN models, and have the potential to serve as a point of reference in clinical practice.
Collapse
Affiliation(s)
- Peng Sun
- School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China
| | - Zengnan Mo
- Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, China
| | - Fangrong Hu
- School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China
| | - Xin Song
- School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China
| | - Taiping Mo
- School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China
| | - Bonan Yu
- School of Architecture and Transportation Engineering, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China.
| | - Yewei Zhang
- Hepatopancreatobiliary Center, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Zhencheng Chen
- School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, 541004, Guangxi, China.
| |
Collapse
|