1
|
Aumente-Maestro C, Díez J, Remeseiro B. A multi-task framework for breast cancer segmentation and classification in ultrasound imaging. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 260:108540. [PMID: 39647406 DOI: 10.1016/j.cmpb.2024.108540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Revised: 11/08/2024] [Accepted: 11/28/2024] [Indexed: 12/10/2024]
Abstract
BACKGROUND Ultrasound (US) is a medical imaging modality that plays a crucial role in the early detection of breast cancer. The emergence of numerous deep learning systems has offered promising avenues for the segmentation and classification of breast cancer tumors in US images. However, challenges such as the absence of data standardization, the exclusion of non-tumor images during training, and the narrow view of single-task methodologies have hindered the practical applicability of these systems, often resulting in biased outcomes. This study aims to explore the potential of multi-task systems in enhancing the detection of breast cancer lesions. METHODS To address these limitations, our research introduces an end-to-end multi-task framework designed to leverage the inherent correlations between breast cancer lesion classification and segmentation tasks. Additionally, a comprehensive analysis of a widely utilized public breast cancer ultrasound dataset named BUSI was carried out, identifying its irregularities and devising an algorithm tailored for detecting duplicated images in it. RESULTS Experiments are conducted utilizing the curated dataset to minimize potential biases in outcomes. Our multi-task framework exhibits superior performance in breast cancer respecting single-task approaches, achieving improvements close to 15% in segmentation and classification. Moreover, a comparative analysis against the state-of-the-art reveals statistically significant enhancements across both tasks. CONCLUSION The experimental findings underscore the efficacy of multi-task techniques, showcasing better generalization capabilities when considering all image types: benign, malignant, and non-tumor images. Consequently, our methodology represents an advance towards more general architectures with real clinical applications in the breast cancer field.
Collapse
Affiliation(s)
| | - Jorge Díez
- Artificial Intelligence Center, Universidad de Oviedo, Gijón, 33204, Spain
| | - Beatriz Remeseiro
- Artificial Intelligence Center, Universidad de Oviedo, Gijón, 33204, Spain.
| |
Collapse
|
2
|
Qu M, Yang J, Li H, Qi Y, Yu Q. Contour-constrained branch U-Net for accurate left ventricular segmentation in echocardiography. Med Biol Eng Comput 2025; 63:561-573. [PMID: 39417962 DOI: 10.1007/s11517-024-03201-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 09/16/2024] [Indexed: 10/19/2024]
Abstract
Using echocardiography to assess the left ventricular function is one of the most crucial cardiac examinations in clinical diagnosis, and LV segmentation plays a particularly vital role in medical image processing as many important clinical diagnostic parameters are derived from the segmentation results, such as ejection function. However, echocardiography typically has a lower resolution and contains a significant amount of noise and motion artifacts, making it a challenge to accurate segmentation, especially in the region of the cardiac chamber boundary, which significantly restricts the accurate calculation of subsequent clinical parameters. In this paper, our goal is to achieve accurate LV segmentation through a simplified approach by introducing a branch sub-network into the decoder of the traditional U-Net. Specifically, we employed the LV contour features to supervise the branch decoding process and used a cross attention module to facilitate the interaction relationship between the branch and the original decoding process, thereby improving the segmentation performance in the region LV boundaries. In the experiments, the proposed branch U-Net (BU-Net) demonstrated superior performance on CAMUS and EchoNet-dynamic public echocardiography segmentation datasets in comparison to state-of-the-art segmentation models, without the need for complex residual connections or transformer-based architectures. Our codes are publicly available at Anonymous Github https://anonymous.4open.science/r/Anoymous_two-BFF2/ .
Collapse
Affiliation(s)
- Mingjun Qu
- Computer Science and Engineering, Northeastern University, Shenyang, China
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, China
- National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Shenyang, China
| | - Jinzhu Yang
- Computer Science and Engineering, Northeastern University, Shenyang, China.
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, China.
- National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Shenyang, China.
| | - Honghe Li
- Computer Science and Engineering, Northeastern University, Shenyang, China
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, China
- National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Shenyang, China
| | - Yiqiu Qi
- Computer Science and Engineering, Northeastern University, Shenyang, China
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, China
- National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Shenyang, China
| | - Qi Yu
- Computer Science and Engineering, Northeastern University, Shenyang, China
- Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, China
- National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Shenyang, China
| |
Collapse
|
3
|
Niri R, Zahia S, Stefanelli A, Sharma K, Probst S, Pichon S, Chanel G. Wound Segmentation with U-Net Using a Dual Attention Mechanism and Transfer Learning. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2025:10.1007/s10278-025-01386-w. [PMID: 39849203 DOI: 10.1007/s10278-025-01386-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 12/18/2024] [Accepted: 12/19/2024] [Indexed: 01/25/2025]
Abstract
Accurate wound segmentation is crucial for the precise diagnosis and treatment of various skin conditions through image analysis. In this paper, we introduce a novel dual attention U-Net model designed for precise wound segmentation. Our proposed architecture integrates two widely used deep learning models, VGG16 and U-Net, incorporating dual attention mechanisms to focus on relevant regions within the wound area. Initially trained on diabetic foot ulcer images, we fine-tuned the model to acute and chronic wound images and conducted a comprehensive comparison with other state-of-the-art models. The results highlight the superior performance of our proposed dual attention model, achieving a Dice coefficient and IoU of 94.1% and 89.3%, respectively, on the test set. This underscores the robustness of our method and its capacity to generalize effectively to new data.
Collapse
Affiliation(s)
- Rania Niri
- Computer Science Department, University of Geneva, Geneva, Switzerland.
| | | | - Alessio Stefanelli
- School of Health Sciences, HES-SO Geneva University of Applied Sciences and Arts, Western Switzerland, Geneva, Switzerland
| | - Kaushal Sharma
- Computer Science Department, University of Geneva, Geneva, Switzerland
| | - Sebastian Probst
- School of Health Sciences, HES-SO Geneva University of Applied Sciences and Arts, Western Switzerland, Geneva, Switzerland
- Care Directorate, Geneva University Hospitals, Geneva, Switzerland
| | - Swann Pichon
- Institute of Industrial and IT Engineering, HEPIA, HES-SO Geneva University of Applied Sciences and Arts, Western Switzerland, Geneva, Switzerland
| | - Guillaume Chanel
- Computer Science Department, University of Geneva, Geneva, Switzerland
- Institute of Industrial and IT Engineering, HEPIA, HES-SO Geneva University of Applied Sciences and Arts, Western Switzerland, Geneva, Switzerland
| |
Collapse
|
4
|
Tinglan L, Jun Q, Guihe Q, Weili S, Wentao Z. Liver segmentation network based on detail enhancement and multi-scale feature fusion. Sci Rep 2025; 15:683. [PMID: 39753603 PMCID: PMC11699127 DOI: 10.1038/s41598-024-78917-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Accepted: 11/05/2024] [Indexed: 01/06/2025] Open
Abstract
Due to the low contrast of abdominal CT (Computer Tomography) images and the similar color and shape of the liver to other organs such as the spleen, stomach, and kidneys, liver segmentation presents significant challenges. Additionally, 2D CT images obtained from different angles (such as sagittal, coronal, and transverse planes) increase the diversity of liver morphology and the complexity of segmentation. To address these issues, this paper proposes a Detail Enhanced Convolution (DE Conv) to improve liver feature learning and thereby enhance liver segmentation performance. Furthermore, to enable the model to better learn liver features at different scales, a Multi-Scale Feature Fusion module (MSFF) is added to the skip connections in the model. The MSFF module enhances the capture of global features, thus improving the accuracy of the liver segmentation model. Through the aforementioned research, this paper proposes a liver segmentation network based on detail enhancement and multi-scale feature fusion (DEMF-Net). We conducted extensive experiments on the LiTS17 dataset, and the results demonstrate that the DEMF-Net model achieved significant improvements across various evaluation metrics. Therefore, the proposed DEMF-Net model can achieve precise liver segmentation.
Collapse
Affiliation(s)
- Lu Tinglan
- Changchun University of Science and Technology, Changchun, China
| | - Qin Jun
- Changchun University of Science and Technology, Changchun, China.
| | | | - Shi Weili
- Changchun University of Science and Technology, Changchun, China
| | - Zhang Wentao
- Zhongshan Institute of Changchun University of Science and Technology, Changchun, China
| |
Collapse
|
5
|
Gul S, Khan MS, Hossain MSA, Chowdhury MEH, Sumon MSI. A Comparative Study of Decoders for Liver and Tumor Segmentation Using a Self-ONN-Based Cascaded Framework. Diagnostics (Basel) 2024; 14:2761. [PMID: 39682669 DOI: 10.3390/diagnostics14232761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2024] [Revised: 11/22/2024] [Accepted: 12/06/2024] [Indexed: 12/18/2024] Open
Abstract
Background/Objectives: Accurate liver and tumor detection and segmentation are crucial in diagnosis of early-stage liver malignancies. As opposed to manual interpretation, which is a difficult and time-consuming process, accurate tumor detection using a computer-aided diagnosis system can save both time and human efforts. Methods: We propose a cascaded encoder-decoder technique based on self-organized neural networks, which is a recent variant of operational neural networks (ONNs), for accurate segmentation and identification of liver tumors. The first encoder-decoder CNN segments the liver. For generating the liver region of interest, the segmented liver mask is placed over the input computed tomography (CT) image and then fed to the second Self-ONN model for tumor segmentation. For further investigation the other three distinct encoder-decoder architectures U-Net, feature pyramid networks (FPNs), and U-Net++, have also been investigated by altering the backbone at the encoders utilizing ResNet and DenseNet variants for transfer learning. Results: For the liver segmentation task, Self-ONN with a ResNet18 backbone has achieved a dice similarity coefficient score of 98.182% and an intersection over union of 97.436%. Tumor segmentation with Self-ONN with the DenseNet201 encoder resulted in an outstanding DSC of 92.836% and IoU of 91.748%. Conclusions: The suggested method is capable of precisely locating liver tumors of various sizes and shapes, including tiny infection patches that were said to be challenging to find in earlier research.
Collapse
Affiliation(s)
- Sidra Gul
- Department of Computer Systems Engineering, University of Engineering and Technology, Peshawar 25000, Pakistan
- Artificial Intelligence in Healthcare, Intelligent Information Processing Lab, National Center of Artificial Intelligence, Peshawar 25000, Pakistan
| | - Muhammad Salman Khan
- Department of Electrical Engineering, College of Engineering, Qatar University, Doha 2713, Qatar
| | - Md Sakib Abrar Hossain
- Department of Electrical Engineering, College of Engineering, Qatar University, Doha 2713, Qatar
| | - Muhammad E H Chowdhury
- Department of Electrical Engineering, College of Engineering, Qatar University, Doha 2713, Qatar
| | - Md Shaheenur Islam Sumon
- Department of Electrical Engineering, College of Engineering, Qatar University, Doha 2713, Qatar
| |
Collapse
|
6
|
Azad R, Aghdam EK, Rauland A, Jia Y, Avval AH, Bozorgpour A, Karimijafarbigloo S, Cohen JP, Adeli E, Merhof D. Medical Image Segmentation Review: The Success of U-Net. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024; 46:10076-10095. [PMID: 39167505 DOI: 10.1109/tpami.2024.3435571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/23/2024]
Abstract
Automatic medical image segmentation is a crucial topic in the medical domain and successively a critical counterpart in the computer-aided diagnosis paradigm. U-Net is the most widespread image segmentation architecture due to its flexibility, optimized modular design, and success in all medical image modalities. Over the years, the U-Net model has received tremendous attention from academic and industrial researchers who have extended it to address the scale and complexity created by medical tasks. These extensions are commonly related to enhancing the U-Net's backbone, bottleneck, or skip connections, or including representation learning, or combining it with a Transformer architecture, or even addressing probabilistic prediction of the segmentation map. Having a compendium of different previously proposed U-Net variants makes it easier for machine learning researchers to identify relevant research questions and understand the challenges of the biological tasks that challenge the model. In this work, we discuss the practical aspects of the U-Net model and organize each variant model into a taxonomy. Moreover, to measure the performance of these strategies in a clinical application, we propose fair evaluations of some unique and famous designs on well-known datasets. Furthermore, we provide a comprehensive implementation library with trained models. In addition, for ease of future studies, we created an online list of U-Net papers with their possible official implementation.
Collapse
|
7
|
Delmoral JC, R S Tavares JM. Semantic Segmentation of CT Liver Structures: A Systematic Review of Recent Trends and Bibliometric Analysis : Neural Network-based Methods for Liver Semantic Segmentation. J Med Syst 2024; 48:97. [PMID: 39400739 PMCID: PMC11473507 DOI: 10.1007/s10916-024-02115-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 10/02/2024] [Indexed: 10/15/2024]
Abstract
The use of artificial intelligence (AI) in the segmentation of liver structures in medical images has become a popular research focus in the past half-decade. The performance of AI tools in screening for this task may vary widely and has been tested in the literature in various datasets. However, no scientometric report has provided a systematic overview of this scientific area. This article presents a systematic and bibliometric review of recent advances in neuronal network modeling approaches, mainly of deep learning, to outline the multiple research directions of the field in terms of algorithmic features. Therefore, a detailed systematic review of the most relevant publications addressing fully automatic semantic segmenting liver structures in Computed Tomography (CT) images in terms of algorithm modeling objective, performance benchmark, and model complexity is provided. The review suggests that fully automatic hybrid 2D and 3D networks are the top performers in the semantic segmentation of the liver. In the case of liver tumor and vasculature segmentation, fully automatic generative approaches perform best. However, the reported performance benchmark indicates that there is still much to be improved in segmenting such small structures in high-resolution abdominal CT scans.
Collapse
Affiliation(s)
- Jessica C Delmoral
- Instituto de Ciência e Inovação em Engenharia Mecânica e Engenharia Industrial, Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s/n, 4200-465, Porto, Portugal
| | - João Manuel R S Tavares
- Instituto de Ciência e Inovação em Engenharia Mecânica e Engenharia Industrial, Departamento de Engenharia Mecânica, Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s/n, 4200-465, Porto, Portugal.
| |
Collapse
|
8
|
Chen Y, Li Y, Wu B, Liu F, Deng Y, Jiang X, Lin Z, Ren K, Gao D. Lightweight Hotspot Detection Model Fusing SE and ECA Mechanisms. MICROMACHINES 2024; 15:1217. [PMID: 39459092 PMCID: PMC11509392 DOI: 10.3390/mi15101217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2024] [Revised: 09/23/2024] [Accepted: 09/24/2024] [Indexed: 10/28/2024]
Abstract
In this paper, we propose a lightweight lithography machine learning-based hotspot detection model that integrates the Squeeze-and-Excitation (SE) attention mechanism and the Efficient Channel Attention (ECA) mechanism. These mechanisms can adaptively adjust channel weights, significantly enhancing the model's ability to extract relevant features of hotspots and non-hotspots through cross-channel interaction without dimensionality reduction. Our model extracts feature vectors through seven convolutional layers and four pooling layers, followed by three fully connected layers that map to the output, thereby simplifying the CNN network structure. Experimental results on our collected layout dataset and the ICCAD 2012 layout dataset demonstrate that our model is more lightweight. By evaluating overall accuracy, recall, and runtime, the comprehensive performance of our model is shown to exceed that of ConvNeXt, Swin transformer, and ResNet 50.
Collapse
Affiliation(s)
- Yanning Chen
- Beijing Smartchip Microelectronics Technology Co., Ltd., Beijing 100192, China; (Y.C.); (B.W.); (F.L.); (Y.D.)
| | - Yanjiang Li
- College of Integrated Circuits, Zhejiang University, Hangzhou 311200, China; (Y.L.); (X.J.); (Z.L.)
| | - Bo Wu
- Beijing Smartchip Microelectronics Technology Co., Ltd., Beijing 100192, China; (Y.C.); (B.W.); (F.L.); (Y.D.)
| | - Fang Liu
- Beijing Smartchip Microelectronics Technology Co., Ltd., Beijing 100192, China; (Y.C.); (B.W.); (F.L.); (Y.D.)
| | - Yongfeng Deng
- Beijing Smartchip Microelectronics Technology Co., Ltd., Beijing 100192, China; (Y.C.); (B.W.); (F.L.); (Y.D.)
| | - Xiaolong Jiang
- College of Integrated Circuits, Zhejiang University, Hangzhou 311200, China; (Y.L.); (X.J.); (Z.L.)
| | - Zebang Lin
- College of Integrated Circuits, Zhejiang University, Hangzhou 311200, China; (Y.L.); (X.J.); (Z.L.)
| | - Kun Ren
- College of Integrated Circuits, Zhejiang University, Hangzhou 311200, China; (Y.L.); (X.J.); (Z.L.)
| | - Dawei Gao
- College of Integrated Circuits, Zhejiang University, Hangzhou 311200, China; (Y.L.); (X.J.); (Z.L.)
| |
Collapse
|
9
|
Madsen SJ, Uddin LQ, Mumford JA, Barch DM, Fair DA, Gotlib IH, Poldrack RA, Kuceyeski A, Saggar M. Predicting Task Activation Maps from Resting-State Functional Connectivity using Deep Learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.10.612309. [PMID: 39314460 PMCID: PMC11419026 DOI: 10.1101/2024.09.10.612309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Recent work has shown that deep learning is a powerful tool for predicting brain activation patterns evoked through various tasks using resting state features. We replicate and improve upon this recent work to introduce two models, BrainSERF and BrainSurfGCN, that perform at least as well as the state-of-the-art while greatly reducing memory and computational footprints. Our performance analysis observed that low predictability was associated with a possible lack of task engagement derived from behavioral performance. Furthermore, a deficiency in model performance was also observed for closely matched task contrasts, likely due to high individual variability confirmed by low test-retest reliability. Overall, we successfully replicate recently developed deep learning architecture and provide scalable models for further research.
Collapse
Affiliation(s)
| | - Lucina Q. Uddin
- Department of Psychiatry, University of California, Los Angeles, USA
| | | | - Deanna M. Barch
- Department of Psychology, Washington University in St. Louis, USA
| | | | | | | | - Amy Kuceyeski
- Department of Radiology, Weill Cornell Medicine, USA
| | | |
Collapse
|
10
|
Alzubaidi L, Al-Dulaimi K, Salhi A, Alammar Z, Fadhel MA, Albahri AS, Alamoodi AH, Albahri OS, Hasan AF, Bai J, Gilliland L, Peng J, Branni M, Shuker T, Cutbush K, Santamaría J, Moreira C, Ouyang C, Duan Y, Manoufali M, Jomaa M, Gupta A, Abbosh A, Gu Y. Comprehensive review of deep learning in orthopaedics: Applications, challenges, trustworthiness, and fusion. Artif Intell Med 2024; 155:102935. [PMID: 39079201 DOI: 10.1016/j.artmed.2024.102935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 03/18/2024] [Accepted: 07/22/2024] [Indexed: 08/24/2024]
Abstract
Deep learning (DL) in orthopaedics has gained significant attention in recent years. Previous studies have shown that DL can be applied to a wide variety of orthopaedic tasks, including fracture detection, bone tumour diagnosis, implant recognition, and evaluation of osteoarthritis severity. The utilisation of DL is expected to increase, owing to its ability to present accurate diagnoses more efficiently than traditional methods in many scenarios. This reduces the time and cost of diagnosis for patients and orthopaedic surgeons. To our knowledge, no exclusive study has comprehensively reviewed all aspects of DL currently used in orthopaedic practice. This review addresses this knowledge gap using articles from Science Direct, Scopus, IEEE Xplore, and Web of Science between 2017 and 2023. The authors begin with the motivation for using DL in orthopaedics, including its ability to enhance diagnosis and treatment planning. The review then covers various applications of DL in orthopaedics, including fracture detection, detection of supraspinatus tears using MRI, osteoarthritis, prediction of types of arthroplasty implants, bone age assessment, and detection of joint-specific soft tissue disease. We also examine the challenges for implementing DL in orthopaedics, including the scarcity of data to train DL and the lack of interpretability, as well as possible solutions to these common pitfalls. Our work highlights the requirements to achieve trustworthiness in the outcomes generated by DL, including the need for accuracy, explainability, and fairness in the DL models. We pay particular attention to fusion techniques as one of the ways to increase trustworthiness, which have also been used to address the common multimodality in orthopaedics. Finally, we have reviewed the approval requirements set forth by the US Food and Drug Administration to enable the use of DL applications. As such, we aim to have this review function as a guide for researchers to develop a reliable DL application for orthopaedic tasks from scratch for use in the market.
Collapse
Affiliation(s)
- Laith Alzubaidi
- School of Mechanical, Medical, and Process Engineering, Queensland University of Technology, Brisbane, QLD 4000, Australia; QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia; Research and Development department, Akunah Med Technology Pty Ltd Co, Brisbane, QLD 4120, Australia.
| | - Khamael Al-Dulaimi
- Computer Science Department, College of Science, Al-Nahrain University, Baghdad, Baghdad 10011, Iraq; School of Electrical Engineering and Robotics, Queensland University of Technology, Brisbane, QLD 4000, Australia
| | - Asma Salhi
- QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia; Research and Development department, Akunah Med Technology Pty Ltd Co, Brisbane, QLD 4120, Australia
| | - Zaenab Alammar
- School of Computer Science, Queensland University of Technology, Brisbane, QLD 4000, Australia
| | - Mohammed A Fadhel
- Research and Development department, Akunah Med Technology Pty Ltd Co, Brisbane, QLD 4120, Australia
| | - A S Albahri
- Technical College, Imam Ja'afar Al-Sadiq University, Baghdad, Iraq
| | - A H Alamoodi
- Institute of Informatics and Computing in Energy, Universiti Tenaga Nasional, Kajang 43000, Malaysia
| | - O S Albahri
- Australian Technical and Management College, Melbourne, Australia
| | - Amjad F Hasan
- Faculty of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
| | - Jinshuai Bai
- School of Mechanical, Medical, and Process Engineering, Queensland University of Technology, Brisbane, QLD 4000, Australia; QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia
| | - Luke Gilliland
- QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia; Research and Development department, Akunah Med Technology Pty Ltd Co, Brisbane, QLD 4120, Australia
| | - Jing Peng
- Research and Development department, Akunah Med Technology Pty Ltd Co, Brisbane, QLD 4120, Australia
| | - Marco Branni
- QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia; Research and Development department, Akunah Med Technology Pty Ltd Co, Brisbane, QLD 4120, Australia
| | - Tristan Shuker
- QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia; St Andrew's War Memorial Hospital, Brisbane, QLD 4000, Australia
| | - Kenneth Cutbush
- QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia; St Andrew's War Memorial Hospital, Brisbane, QLD 4000, Australia
| | - Jose Santamaría
- Department of Computer Science, University of Jaén, Jaén 23071, Spain
| | - Catarina Moreira
- Data Science Institute, University of Technology Sydney, Australia
| | - Chun Ouyang
- School of Information Systems, Queensland University of Technology, Brisbane, QLD 4000, Australia
| | - Ye Duan
- School of Computing, Clemson University, Clemson, 29631, SC, USA
| | - Mohamed Manoufali
- CSIRO, Kensington, WA 6151, Australia; School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD 4067, Australia
| | - Mohammad Jomaa
- QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia; St Andrew's War Memorial Hospital, Brisbane, QLD 4000, Australia
| | - Ashish Gupta
- School of Mechanical, Medical, and Process Engineering, Queensland University of Technology, Brisbane, QLD 4000, Australia; QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia; Research and Development department, Akunah Med Technology Pty Ltd Co, Brisbane, QLD 4120, Australia
| | - Amin Abbosh
- School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD 4067, Australia
| | - Yuantong Gu
- School of Mechanical, Medical, and Process Engineering, Queensland University of Technology, Brisbane, QLD 4000, Australia; QUASR/ARC Industrial Transformation Training Centre-Joint Biomechanics, Queensland University of Technology, Brisbane, QLD 4000, Australia
| |
Collapse
|
11
|
Yang Y, Huang H, Shao Y, Chen B. DAC-Net: A light-weight U-shaped network based efficient convolution and attention for thyroid nodule segmentation. Comput Biol Med 2024; 180:108972. [PMID: 39126790 DOI: 10.1016/j.compbiomed.2024.108972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 05/06/2024] [Accepted: 07/29/2024] [Indexed: 08/12/2024]
Abstract
Recently, there has been a focused effort to improve the efficiency of thyroid nodule segmentation algorithms. This endeavor has resulted in the development of increasingly complex modules, such as the Transformer, leading to models with a higher number of parameters and computing requirements. Sophisticated models have difficulties in being implemented in clinical medicine platforms because of limited resources. DAC-Net is a Lightweight U-shaped network created to achieve high performance in segmenting thyroid nodules. Our method consists of three main components: DWSE, which combines depthwise convolution and squeeze-excitation block to enhance feature extraction and connections between samples; ADA, which includes Split Atrous and Dual Attention to extract global and local feature information from various viewpoints; and CSSC, which involves channel- scale and spatial-scale connections. This module enables the fusing of multi-stage features at global and local levels, producing feature maps at different channel and geographical scales, delivering a streamlined integration of multi-scale information. Combining these three components in our U- shaped design allows us to achieve competitive performance while also decreasing the number of parameters and computing complexity. Several experiments were conducted on the DDTI and TN3K datasets. The experimental results demonstrate that our model outperforms state-of-the-art thyroid nodule segmentation architectures in terms of segmentation performance. Our model not only reduces the number of parameters and computing expenses by 73x and 56x, respectively, but also exceeds TransUNet in segmentation performance. The source code is accessible at https://github.com/Phil-y/DAC-Net.
Collapse
Affiliation(s)
- Yingwei Yang
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325000, China
| | - Haiguang Huang
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325000, China.
| | - Yingsheng Shao
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325000, China
| | - Beilei Chen
- Department of Ultrasonic Imaging, Wenzhou Central Hospital, Wenzhou 325000, China
| |
Collapse
|
12
|
Huynh BN, Groendahl AR, Tomic O, Liland KH, Knudtsen IS, Hoebers F, van Elmpt W, Dale E, Malinen E, Futsaether CM. Deep learning with uncertainty estimation for automatic tumor segmentation in PET/CT of head and neck cancers: impact of model complexity, image processing and augmentation. Biomed Phys Eng Express 2024; 10:055038. [PMID: 39127060 DOI: 10.1088/2057-1976/ad6dcd] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 08/09/2024] [Indexed: 08/12/2024]
Abstract
Objective.Target volumes for radiotherapy are usually contoured manually, which can be time-consuming and prone to inter- and intra-observer variability. Automatic contouring by convolutional neural networks (CNN) can be fast and consistent but may produce unrealistic contours or miss relevant structures. We evaluate approaches for increasing the quality and assessing the uncertainty of CNN-generated contours of head and neck cancers with PET/CT as input.Approach.Two patient cohorts with head and neck squamous cell carcinoma and baseline18F-fluorodeoxyglucose positron emission tomography and computed tomography images (FDG-PET/CT) were collected retrospectively from two centers. The union of manual contours of the gross primary tumor and involved nodes was used to train CNN models for generating automatic contours. The impact of image preprocessing, image augmentation, transfer learning and CNN complexity, architecture, and dimension (2D or 3D) on model performance and generalizability across centers was evaluated. A Monte Carlo dropout technique was used to quantify and visualize the uncertainty of the automatic contours.Main results. CNN models provided contours with good overlap with the manually contoured ground truth (median Dice Similarity Coefficient: 0.75-0.77), consistent with reported inter-observer variations and previous auto-contouring studies. Image augmentation and model dimension, rather than model complexity, architecture, or advanced image preprocessing, had the largest impact on model performance and cross-center generalizability. Transfer learning on a limited number of patients from a separate center increased model generalizability without decreasing model performance on the original training cohort. High model uncertainty was associated with false positive and false negative voxels as well as low Dice coefficients.Significance.High quality automatic contours can be obtained using deep learning architectures that are not overly complex. Uncertainty estimation of the predicted contours shows potential for highlighting regions of the contour requiring manual revision or flagging segmentations requiring manual inspection and intervention.
Collapse
Affiliation(s)
- Bao Ngoc Huynh
- Faculty of Science and Technology, Norwegian University of Life Sciences, Ås, Norway
| | - Aurora Rosvoll Groendahl
- Faculty of Science and Technology, Norwegian University of Life Sciences, Ås, Norway
- Section of Oncology, Vestre Viken Hospital Trust, Drammen, Norway
| | - Oliver Tomic
- Faculty of Science and Technology, Norwegian University of Life Sciences, Ås, Norway
| | - Kristian Hovde Liland
- Faculty of Science and Technology, Norwegian University of Life Sciences, Ås, Norway
| | - Ingerid Skjei Knudtsen
- Department of Circulation and Medical Imaging, Norwegian University of Science and Technology, Trondheim, Norway
| | - Frank Hoebers
- Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht, Netherlands
| | - Wouter van Elmpt
- Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht, Netherlands
| | - Einar Dale
- Department of Oncology, Oslo University Hospital, Oslo, Norway
| | - Eirik Malinen
- Department of Medical Physics, Oslo University Hospital, Oslo, Norway
- Department of Physics, University of Oslo, Oslo, Norway
| | | |
Collapse
|
13
|
Huang B, Li H, Fujita H, Sun X, Fang Z, Wang H, Su B. G-MBRMD: Lightweight liver segmentation model based on guided teaching with multi-head boundary reconstruction mapping distillation. Comput Biol Med 2024; 178:108733. [PMID: 38897144 DOI: 10.1016/j.compbiomed.2024.108733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 05/03/2024] [Accepted: 06/08/2024] [Indexed: 06/21/2024]
Abstract
BACKGROUND AND OBJECTIVES Liver segmentation is pivotal for the quantitative analysis of liver cancer. Although current deep learning methods have garnered remarkable achievements for medical image segmentation, they come with high computational costs, significantly limiting their practical application in the medical field. Therefore, the development of an efficient and lightweight liver segmentation model becomes particularly important. METHODS In our paper, we propose a real-time, lightweight liver segmentation model named G-MBRMD. Specifically, we employ a Transformer-based complex model as the teacher and a convolution-based lightweight model as the student. By introducing proposed multi-head mapping and boundary reconstruction strategies during the knowledge distillation process, Our method effectively guides the student model to gradually comprehend and master the global boundary processing capabilities of the complex teacher model, significantly enhancing the student model's segmentation performance without adding any computational complexity. RESULTS On the LITS dataset, we conducted rigorous comparative and ablation experiments, four key metrics were used for evaluation, including model size, inference speed, Dice coefficient, and HD95. Compared to other methods, our proposed model achieved an average Dice coefficient of 90.14±16.78%, with only 0.6 MB memory and 0.095 s inference speed for a single image on a standard CPU. Importantly, this approach improved the average Dice coefficient of the baseline student model by 1.64% without increasing computational complexity. CONCLUSION The results demonstrate that our method successfully realizes the unification of segmentation precision and lightness, and greatly enhances its potential for widespread application in practical settings.
Collapse
Affiliation(s)
- Bo Huang
- School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Hongxu Li
- School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Hamido Fujita
- Malaysia-Japan International Institute of Technology(MJIIT), Universiti Teknologi Malaysia, 54100 Kuala Lumpur, Malaysia; Andalusian Research Institute in Data Science and Computational Intelligence(DaSCI), University of Granada, Granada, Spain; Iwate Prefectural University, Iwate, Japan.
| | - Xiaoning Sun
- School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | | | - Hailing Wang
- School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Bo Su
- Central Laboratory, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, China
| |
Collapse
|
14
|
Ou J, Jiang L, Bai T, Zhan P, Liu R, Xiao H. ResTransUnet: An effective network combined with Transformer and U-Net for liver segmentation in CT scans. Comput Biol Med 2024; 177:108625. [PMID: 38823365 DOI: 10.1016/j.compbiomed.2024.108625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Revised: 04/15/2024] [Accepted: 05/18/2024] [Indexed: 06/03/2024]
Abstract
Liver segmentation is a fundamental prerequisite for the diagnosis and surgical planning of hepatocellular carcinoma. Traditionally, the liver contour is drawn manually by radiologists using a slice-by-slice method. However, this process is time-consuming and error-prone, depending on the radiologist's experience. In this paper, we propose a new end-to-end automatic liver segmentation framework, named ResTransUNet, which exploits the transformer's ability to capture global context for remote interactions and spatial relationships, as well as the excellent performance of the original U-Net architecture. The main contribution of this paper lies in proposing a novel fusion network that combines Unet and Transformer architectures. In the encoding structure, a dual-path approach is utilized, where features are extracted separately using both convolutional neural networks (CNNs) and Transformer networks. Additionally, an effective feature enhancement unit is designed to transfer the global features extracted by the Transformer network to the CNN for feature enhancement. This model aims to address the drawbacks of traditional Unet-based methods, such as feature loss during encoding and poor capture of global features. Moreover, it avoids the disadvantages of pure Transformer models, which suffer from large parameter sizes and high computational complexity. The experimental results on the LiTS2017 dataset demonstrate remarkable performance for our proposed model, with Dice coefficients, volumetric overlap error (VOE), and relative volume difference (RVD) values for liver segmentation reaching 0.9535, 0.0804, and -0.0007, respectively. Furthermore, to further validate the model's generalization capability, we conducted tests on the 3Dircadb, Chaos, and Sliver07 datasets. The experimental results demonstrate that the proposed method outperforms other closely related models with higher liver segmentation accuracy. In addition, significant improvements can be achieved by applying our method when handling liver segmentation with small and discontinuous liver regions, as well as blurred liver boundaries. The code is available at the website: https://github.com/Jouiry/ResTransUNet.
Collapse
Affiliation(s)
- Jiajie Ou
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Linfeng Jiang
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China; School of Computing and College of Design and Engineering, National University of Singapore, Singapore.
| | - Ting Bai
- School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Peidong Zhan
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Ruihua Liu
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Hanguang Xiao
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| |
Collapse
|
15
|
Xuan P, Chu X, Cui H, Nakaguchi T, Wang L, Ning Z, Ning Z, Li C, Zhang T. Multi-view attribute learning and context relationship encoding enhanced segmentation of lung tumors from CT images. Comput Biol Med 2024; 177:108640. [PMID: 38833798 DOI: 10.1016/j.compbiomed.2024.108640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Revised: 04/25/2024] [Accepted: 05/18/2024] [Indexed: 06/06/2024]
Abstract
Graph convolutional neural networks (GCN) have shown the promise in medical image segmentation due to the flexibility of representing diverse range of image regions using graph nodes and propagating knowledge via graph edges. However, existing methods did not fully exploit the various attributes of image nodes and the context relationship among their attributes. We propose a new segmentation method with multi-similarity view enhancement and node attribute context learning (MNSeg). First, multiple views were formed by measuring the similarities among the image nodes, and MNSeg has a GCN based multi-view image node attribute learning (MAL) module to integrate various node attributes learnt from multiple similarity views. Each similarity view contains the specific similarities among all the image nodes, and it was integrated with the node attributes from all the channels to form the enhanced attributes of image nodes. Second, the context relationships among the attributes of image nodes are formulated by a transformer-based context relationship encoding (CRE) strategy to propagate these relationships across all the image nodes. During the transformer-based learning, the relationships were estimated based on the self-attention on all the image nodes, and then they were encoded into the learned node features. Finally, we design an attention at attribute category level (ACA) to discriminate and fuse the learnt diverse information from MAL, CRE, and the original node attributes. ACA identifies the more informative attribute categories by adaptively learn their importance. We validate the performance of MNSeg on a public lung tumor CT dataset and an in-house non-small cell lung cancer (NSCLC) dataset collected from the hospital. The segmentation results show that MNSeg outperformed the compared segmentation methods in terms of spatial overlap and the shape similarities. The ablation studies demonstrated the effectiveness of MAL, CRE, and ACA. The generalization ability of MNSeg was proved by the consistent improved segmentation performances using different 3D segmentation backbones.
Collapse
Affiliation(s)
- Ping Xuan
- Department of Computer Science and Technology, Shantou University, Shantou, China; School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Xiuqiang Chu
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Hui Cui
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, Australia
| | - Toshiya Nakaguchi
- Center for Frontier Medical Engineering, Chiba University, Chiba, Japan
| | - Linlin Wang
- Department of Radiation Oncology, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan, China
| | - Zhiyuan Ning
- School of Electrical and Information Engineering, The University of Sydney, Sydney, Australia
| | - Zhiyu Ning
- School of Electrical and Information Engineering, The University of Sydney, Sydney, Australia
| | | | - Tiangang Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin, China; School of Mathematical Science, Heilongjiang University, Harbin, China.
| |
Collapse
|
16
|
Song W, Yu H, Wu J. PLU-Net: Extraction of multiscale feature fusion. Med Phys 2024; 51:2733-2740. [PMID: 38010657 DOI: 10.1002/mp.16840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 10/11/2023] [Accepted: 10/30/2023] [Indexed: 11/29/2023] Open
Abstract
In recent years, deep learning algorithms have achieved remarkable results in medical image segmentation. These networks with an enormous number of parameters often encounter challenges in handling image boundaries and details, which may result in suboptimal segmentation results. To solve the problem, we develop atrous spatial pyramid pooling (ASPP) and combine it with the squeeze-and-excitation block (SE block), as well as present the PS module, which employs a broader and multiscale receptive field at the network's bottom to obtain more detailed semantic information. We also propose the local guided block (LG block) and also its combination with the SE block to form the LS block, which can obtain more abundant local features in the feature map, so that more edge information can be retained in each down sampling process, thereby improving the performance of boundary segmentation. We propose PLU-Net and integrate our PS module and LS block into U-Net. We put our PLU-Net to the test on three benchmark datasets, and the results show that by fewer parameters and FLOPs, it outperforms on medical semantic segmentation tasks.
Collapse
Affiliation(s)
- Weihu Song
- School of Computer Science and Engineering, Beihang University, Beijing, China
| | - Heng Yu
- Robotics Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
| | - Jianhua Wu
- College of Artificial Intelligence, Nankai University, Tianjing, China
| |
Collapse
|
17
|
Xie Z, Lu Q, Guo J, Lin W, Ge G, Tang Y, Pasini D, Wang W. Semantic segmentation for tooth cracks using improved DeepLabv3+ model. Heliyon 2024; 10:e25892. [PMID: 38380020 PMCID: PMC10877285 DOI: 10.1016/j.heliyon.2024.e25892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 02/01/2024] [Accepted: 02/05/2024] [Indexed: 02/22/2024] Open
Abstract
Objective Accurate and prompt detection of cracked teeth plays a critical role for human oral health. The aim of this paper is to evaluate the performance of a tooth crack segmentation model (namely, FDB-DeepLabv3+) on optical microscopic images. Method The FDB-DeepLabv3+ model proposed here improves feature learning by replacing the backbone with ResNet50. Feature pyramid network (FPN) is introduced to fuse muti-level features. Densely linked atrous spatial pyramid pooling (Dense ASPP) is applied to achieve denser pixel sampling and wider receptive field. Bottleneck attention module (BAM) is embedded to enhance local feature extraction. Results Through testing on a self-made hidden cracked tooth dataset, the proposed method outperforms four classical networks (FCN, U-Net, SegNet, DeepLabv3+) on segmentation results in terms of mean pixel accuracy (MPA) and mean intersection over union (MIoU). The network achieves an increase of 11.41% in MPA and 12.14% in MIoU compared to DeepLabv3+. Ablation experiments shows that all the modifications are beneficial. Conclusion An improved network is designed for segmenting tooth surface cracks with good overall performance and robustness, which may hold significant potential in computer-aided diagnosis of cracked teeth.
Collapse
Affiliation(s)
- Zewen Xie
- School of Mechanical and Electrical Engineering, Guangzhou University, Guangzhou, 510006, China
- School of Physics and Material Science, Guangzhou University, Guangzhou, 510006, China
| | - Qilin Lu
- School of Mechanical and Electrical Engineering, Guangzhou University, Guangzhou, 510006, China
| | - Juncheng Guo
- School of Mechanical and Electrical Engineering, Guangzhou University, Guangzhou, 510006, China
| | - Weiren Lin
- School of Mechanical and Electrical Engineering, Guangzhou University, Guangzhou, 510006, China
| | - Guanghua Ge
- Department of Dentistry, Hospital of Guangdong University of Technology, Guangdong University of Technology, Guangzhou, 510006, China
| | - Yadong Tang
- School of Biomedical and Pharmaceutical Sciences, Guangdong University of Technology, Guangzhou, 510006, China
| | - Damiano Pasini
- Department of Mechanical Engineering, McGill University, 817 Sherbrooke Street West, Montreal, QC H3A 0C3, Canada
| | - Wenlong Wang
- School of Mechanical and Electrical Engineering, Guangzhou University, Guangzhou, 510006, China
- Department of Mechanical Engineering, McGill University, 817 Sherbrooke Street West, Montreal, QC H3A 0C3, Canada
| |
Collapse
|
18
|
Jiang X, Zheng H, Yuan Z, Lan K, Wu Y. HIMS-Net: Horizontal-vertical interaction and multiple side-outputs network for cyst segmentation in jaw images. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024; 21:4036-4055. [PMID: 38549317 DOI: 10.3934/mbe.2024178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2024]
Abstract
Jaw cysts are mainly caused by abnormal tooth development, chronic oral inflammation, or jaw damage, which may lead to facial swelling, deformity, tooth loss, and other symptoms. Due to the diversity and complexity of cyst images, deep-learning algorithms still face many difficulties and challenges. In response to these problems, we present a horizontal-vertical interaction and multiple side-outputs network for cyst segmentation in jaw images. First, the horizontal-vertical interaction mechanism facilitates complex communication paths in the vertical and horizontal dimensions, and it has the ability to capture a wide range of context dependencies. Second, the feature-fused unit is introduced to adjust the network's receptive field, which enhances the ability of acquiring multi-scale context information. Third, the multiple side-outputs strategy intelligently combines feature maps to generate more accurate and detailed change maps. Finally, experiments were carried out on the self-established jaw cyst dataset and compared with different specialist physicians to evaluate its clinical usability. The research results indicate that the Matthews correlation coefficient (Mcc), Dice, and Jaccard of HIMS-Net were 93.61, 93.66 and 88.10% respectively, which may contribute to rapid and accurate diagnosis in clinical practice.
Collapse
Affiliation(s)
- Xiaoliang Jiang
- College of Mechanical Engineering, Quzhou University, Quzhou 324000, China
| | - Huixia Zheng
- Department of Stomatology, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, Quzhou 324000, China
| | - Zhenfei Yuan
- Department of Stomatology, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, Quzhou 324000, China
| | - Kun Lan
- College of Mechanical Engineering, Quzhou University, Quzhou 324000, China
| | - Yaoyang Wu
- Department of Computer and Information Science, University of Macau, Macau 999078, China
| |
Collapse
|
19
|
Chen T, You W, Zhang L, Ye W, Feng J, Lu J, Lv J, Tang Y, Wei D, Gui S, Jiang J, Wang Z, Wang Y, Zhao Q, Zhang Y, Qu J, Li C, Jiang Y, Zhang X, Li Y, Guan S. Automated anatomical labeling of the intracranial arteries via deep learning in computed tomography angiography. Front Physiol 2024; 14:1310357. [PMID: 38239880 PMCID: PMC10794642 DOI: 10.3389/fphys.2023.1310357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 11/28/2023] [Indexed: 01/22/2024] Open
Abstract
Background and purpose: Anatomical labeling of the cerebral vasculature is a crucial topic in determining the morphological nature and characterizing the vital variations of vessels, yet precise labeling of the intracranial arteries is time-consuming and challenging, given anatomical structural variability and surging imaging data. We present a U-Net-based deep learning (DL) model to automatically label detailed anatomical segments in computed tomography angiography (CTA) for the first time. The trained DL algorithm was further tested on a clinically relevant set for the localization of intracranial aneurysms (IAs). Methods: 457 examinations with varying degrees of arterial stenosis were used to train, validate, and test the model, aiming to automatically label 42 segments of the intracranial arteries [e.g., 7 segments of the internal carotid artery (ICA)]. Evaluation metrics included Dice similarity coefficient (DSC), mean surface distance (MSD), and Hausdorff distance (HD). Additionally, 96 examinations containing at least one IA were enrolled to assess the model's potential in enhancing clinicians' precision in IA localization. A total of 5 clinicians with different experience levels participated as readers in the clinical experiment and identified the precise location of IA without and with algorithm assistance, where there was a washout period of 14 days between two interpretations. The diagnostic accuracy, time, and mean interrater agreement (Fleiss' Kappa) were calculated to assess the differences in clinical performance of clinicians. Results: The proposed model exhibited notable labeling performance on 42 segments that included 7 anatomical segments of ICA, with the mean DSC of 0.88, MSD of 0.82 mm and HD of 6.59 mm. Furthermore, the model demonstrated superior labeling performance in healthy subjects compared to patients with stenosis (DSC: 0.91 vs. 0.89, p < 0.05; HD: 4.75 vs. 6.19, p < 0.05). Concurrently, clinicians with model predictions achieved significant improvements when interpreting the precise location of IA. The clinicians' mean accuracy increased by 0.04 (p = 0.003), mean time to diagnosis reduced by 9.76 s (p < 0.001), and mean interrater agreement (Fleiss' Kappa) increased by 0.07 (p = 0.029). Conclusion: Our model stands proficient for labeling intracranial arteries using the largest CTA dataset. Crucially, it demonstrates clinical utility, helping prioritize the patients with high risks and ease clinical workload.
Collapse
Affiliation(s)
- Ting Chen
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
- School of Biomedical Engineering, Capital Medical University, Beijing, China
| | - Wei You
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
- Department of Neurointerventional Engineering and Technology, Beijing Engineering Research Center (NO: BG0287), Beijing, China
| | - Liyuan Zhang
- China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Wanxing Ye
- China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Junqiang Feng
- Department of Neurosurgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Jing Lu
- Department of Radiology, Third Medical Center of Chinese PLA General Hospital, Beijing, China
| | - Jian Lv
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
| | - Yudi Tang
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
| | - Dachao Wei
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
| | - Siming Gui
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
| | - Jia Jiang
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
| | - Ziyao Wang
- Department of Interventional Neuroradiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Yanwen Wang
- China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Qi Zhao
- China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Yifan Zhang
- China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Junda Qu
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Chunlin Li
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Yuhua Jiang
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
| | - Xu Zhang
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Youxiang Li
- Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Department of Neurointerventional Engineering and Technology, Beijing Engineering Research Center (NO: BG0287), Beijing, China
| | - Sheng Guan
- Department of Interventional Neuroradiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| |
Collapse
|
20
|
Zhang Q, Cheng J, Zhou C, Jiang X, Zhang Y, Zeng J, Liu L. PDC-Net: parallel dilated convolutional network with channel attention mechanism for pituitary adenoma segmentation. Front Physiol 2023; 14:1259877. [PMID: 37711463 PMCID: PMC10498772 DOI: 10.3389/fphys.2023.1259877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 08/16/2023] [Indexed: 09/16/2023] Open
Abstract
Accurate segmentation of the medical image is the basis and premise of intelligent diagnosis and treatment, which has a wide range of clinical application value. However, the robustness and effectiveness of medical image segmentation algorithms remains a challenging subject due to the unbalanced categories, blurred boundaries, highly variable anatomical structures and lack of training samples. For this reason, we present a parallel dilated convolutional network (PDC-Net) to address the pituitary adenoma segmentation in magnetic resonance imaging images. Firstly, the standard convolution block in U-Net is replaced by a basic convolution operation and a parallel dilated convolutional module (PDCM), to extract the multi-level feature information of different dilations. Furthermore, the channel attention mechanism (CAM) is integrated to enhance the ability of the network to distinguish between lesions and non-lesions in pituitary adenoma. Then, we introduce residual connections at each layer of the encoder-decoder, which can solve the problem of gradient disappearance and network performance degradation caused by network deepening. Finally, we employ the dice loss to deal with the class imbalance problem in samples. By testing on the self-established patient dataset from Quzhou People's Hospital, the experiment achieves 90.92% of Sensitivity, 99.68% of Specificity, 88.45% of Dice value and 79.43% of Intersection over Union (IoU).
Collapse
Affiliation(s)
- Qile Zhang
- Department of Rehabilitation, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People’s Hospital, Quzhou, China
| | - Jianzhen Cheng
- Department of Rehabilitation, Quzhou Third Hospital, Quzhou, China
| | - Chun Zhou
- Department of Rehabilitation, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People’s Hospital, Quzhou, China
| | - Xiaoliang Jiang
- College of Mechanical Engineering, Quzhou University, Quzhou, China
| | - Yuanxiang Zhang
- College of Mechanical Engineering, Quzhou University, Quzhou, China
| | - Jiantao Zeng
- College of Mechanical Engineering, Quzhou University, Quzhou, China
| | - Li Liu
- Department of Thyroid and Breast Surgery, Kecheng District People’s Hospital, Quzhou, China
| |
Collapse
|
21
|
Qin C, Zheng B, Zeng J, Chen Z, Zhai Y, Genovese A, Piuri V, Scotti F. Dynamically aggregating MLPs and CNNs for skin lesion segmentation with geometry regularization. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 238:107601. [PMID: 37210926 DOI: 10.1016/j.cmpb.2023.107601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 04/24/2023] [Accepted: 05/13/2023] [Indexed: 05/23/2023]
Abstract
BACKGROUND AND OBJECTIVE Melanoma is a highly malignant skin tumor. Accurate segmentation of skin lesions from dermoscopy images is pivotal for computer-aided diagnosis of melanoma. However, blurred lesion boundaries, variable lesion shapes, and other interference factors pose a challenge in this regard. METHODS This work proposes a novel framework called CFF-Net (Cross Feature Fusion Network) for supervised skin lesion segmentation. The encoder of the network includes dual branches, where the CNNs branch aims to extract rich local features while MLPs branch is used to establish both the global-spatial-dependencies and global-channel-dependencies for precise delineation of skin lesions. Besides, a feature-interaction module between two branches is designed for strengthening the feature representation by allowing dynamic exchange of spatial and channel information, so as to retain more spatial details and inhibit irrelevant noise. Moreover, an auxiliary prediction task is introduced to learn the global geometric information, highlighting the boundary of the skin lesion. RESULTS Comprehensive experiments using four publicly available skin lesion datasets (i.e., ISIC 2018, ISIC 2017, ISIC 2016, and PH2) indicated that CFF-Net outperformed the state-of-the-art models. In particular, CFF-Net greatly increased the average Jaccard Index score from 79.71% to 81.86% in ISIC 2018, from 78.03% to 80.21% in ISIC 2017, from 82.58% to 85.38% in ISIC 2016, and from 84.18% to 89.71% in PH2 compared with U-Net. Ablation studies demonstrated the effectiveness of each proposed component. Cross-validation experiments in ISIC 2018 and PH2 datasets verified the generalizability of CFF-Net under different skin lesion data distributions. Finally, comparison experiments using three public datasets demonstrated the superior performance of our model. CONCLUSION The proposed CFF-Net performed well in four public skin lesion datasets, especially for challenging cases with blurred edges of skin lesions and low contrast between skin lesions and background. CFF-Net can be employed for other segmentation tasks with better prediction and more accurate delineation of boundaries.
Collapse
Affiliation(s)
- Chuanbo Qin
- Faculty of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China
| | - Bin Zheng
- Faculty of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China
| | - Junying Zeng
- Faculty of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China.
| | - Zhuyuan Chen
- Faculty of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China
| | - Yikui Zhai
- Faculty of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China
| | - Angelo Genovese
- Departimento di Information, Università degli Studi di Milano, 20133 Milano, Italy
| | - Vincenzo Piuri
- Departimento di Information, Università degli Studi di Milano, 20133 Milano, Italy
| | - Fabio Scotti
- Departimento di Information, Università degli Studi di Milano, 20133 Milano, Italy
| |
Collapse
|
22
|
Wang H, Wang ZM, Cui XT, Li L. TDS-U-Net: Automatic liver and tumor separate segmentation of CT volumes using attention gates1. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2023. [DOI: 10.3233/jifs-221111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/12/2023]
Abstract
Considering the heterogeneity, diffusive shape, and complex background of tumors, automatic segmentation of hepatic lesions in computed tomography (CT) images has been considered a challenging task. The performance of existing methods remains subject to segmentation uncertainties, especially in tumor boundary regions. The pixel information in these regions will be affected by both sides, thereby exposing the segmentation results to missing marks. To this end, a new network architecture named Two Direction Segmentation U-Net (TDS-U-Net) is hereby designed based on the classic Attention U-Net to tackle this problem. As the most important blocks of the Attention U-Net network, attention gates (AGs) focus on the target structures of different shapes and sizes. In the last layer of TDS-U-Net, two dichotomous convolutional networks are applied to obtain the segmentation maps of the liver and the tumor respectively. Superimposing two segmented maps to obtain the final image addresses the above problems. The entire structure has been verified on two widely accepted public CT datasets, LiTS17 and KiTS19. Compared with the state of the art, this method exhibits superior performance and excellent shape extractions with high detection sensitivity, perfectly demonstrating its effectiveness in medical image segmentation.
Collapse
Affiliation(s)
- Hua Wang
- School of Mechatronic Engineering and Automation, Shanghai University, Shanghai, P. R. China
- DewertOKIN Technology Group Co, Ltd, Jiaxing, P. R. China
- Bewatec(Zhejiang) Medical Equipment Co., Ltd. Jiaxing, P. R. China
| | - Zhi-Ming Wang
- School of Mechatronic Engineering and Automation, Shanghai University, Shanghai, P. R. China
| | - Xiu-Tao Cui
- DewertOKIN Technology Group Co, Ltd, Jiaxing, P. R. China
- Bewatec(Zhejiang) Medical Equipment Co., Ltd. Jiaxing, P. R. China
| | - Long Li
- DewertOKIN Technology Group Co, Ltd, Jiaxing, P. R. China
| |
Collapse
|
23
|
Liu J, Yan Z, Zhou C, Shao L, Han Y, Song Y. mfeeU-Net: A multi-scale feature extraction and enhancement U-Net for automatic liver segmentation from CT Images. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:7784-7801. [PMID: 37161172 DOI: 10.3934/mbe.2023336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Medical image segmentation of the liver is an important prerequisite for clinical diagnosis and evaluation of liver cancer. For automatic liver segmentation from Computed Tomography (CT) images, we proposed a Multi-scale Feature Extraction and Enhancement U-Net (mfeeU-Net), incorporating Res2Net blocks, Squeeze-and-Excitation (SE) blocks, and Edge Attention (EA) blocks. The Res2Net blocks which are conducive to extracting multi-scale features of the liver were used as the backbone of the encoder, while the SE blocks were also added to the encoder to enhance channel information. The EA blocks were introduced to skip connections between the encoder and the decoder, to facilitate the detection of blurred liver edges where the intensities of nearby organs are close to the liver. The proposed mfeeU-Net was trained and evaluated using a publicly available CT dataset of LiTS2017. The average dice similarity coefficient, intersection-over-union ratio, and sensitivity of the mfeeU-Net for liver segmentation were 95.32%, 91.67%, and 95.53%, respectively, and all these metrics were better than those of U-Net, Res-U-Net, and Attention U-Net. The experimental results demonstrate that the mfeeU-Net can compete with and even outperform recently proposed convolutional neural networks and effectively overcome challenges, such as discontinuous liver regions and fuzzy liver boundaries.
Collapse
Affiliation(s)
- Jun Liu
- Department of Information Engineering, Nanchang Hangkong University, Nanchang 330063, Jiangxi, China
| | - Zhenhua Yan
- Department of Information Engineering, Nanchang Hangkong University, Nanchang 330063, Jiangxi, China
| | - Chaochao Zhou
- Department of Radiology, Feinberg School of Medicine, Northwestern University, Chicago 60611, Illinois, U.S
| | - Liren Shao
- Department of Information Engineering, Nanchang Hangkong University, Nanchang 330063, Jiangxi, China
| | - Yuanyuan Han
- Department of Information Engineering, Nanchang Hangkong University, Nanchang 330063, Jiangxi, China
| | - Yusheng Song
- Interventional Radiology, The People's Hospital of Ganzhou, Ganzhou 34100, Jiangxi, China
| |
Collapse
|
24
|
Vijay S, Guhan T, Srinivasan K, Vincent PMDR, Chang CY. MRI brain tumor segmentation using residual Spatial Pyramid Pooling-powered 3D U-Net. Front Public Health 2023; 11:1091850. [PMID: 36817919 PMCID: PMC9932049 DOI: 10.3389/fpubh.2023.1091850] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 01/16/2023] [Indexed: 02/04/2023] Open
Abstract
Brain tumor diagnosis has been a lengthy process, and automation of a process such as brain tumor segmentation speeds up the timeline. U-Nets have been a commonly used solution for semantic segmentation, and it uses a downsampling-upsampling approach to segment tumors. U-Nets rely on residual connections to pass information during upsampling; however, an upsampling block only receives information from one downsampling block. This restricts the context and scope of an upsampling block. In this paper, we propose SPP-U-Net where the residual connections are replaced with a combination of Spatial Pyramid Pooling (SPP) and Attention blocks. Here, SPP provides information from various downsampling blocks, which will increase the scope of reconstruction while attention provides the necessary context by incorporating local characteristics with their corresponding global dependencies. Existing literature uses heavy approaches such as the usage of nested and dense skip connections and transformers. These approaches increase the training parameters within the model which therefore increase the training time and complexity of the model. The proposed approach on the other hand attains comparable results to existing literature without changing the number of trainable parameters over larger dimensions such as 160 × 192 × 192. All in all, the proposed model scores an average dice score of 0.883 and a Hausdorff distance of 7.84 on Brats 2021 cross validation.
Collapse
Affiliation(s)
- Sanchit Vijay
- School of Electronics Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Thejineaswar Guhan
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Kathiravan Srinivasan
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - P. M. Durai Raj Vincent
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Chuan-Yu Chang
- Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin, Taiwan
- Service Systems Technology Center, Industrial Technology Research Institute, Hsinchu, Taiwan
| |
Collapse
|
25
|
Balasubramanian PK, Lai WC, Seng GH, C K, Selvaraj J. APESTNet with Mask R-CNN for Liver Tumor Segmentation and Classification. Cancers (Basel) 2023; 15:cancers15020330. [PMID: 36672281 PMCID: PMC9857237 DOI: 10.3390/cancers15020330] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 12/21/2022] [Accepted: 12/30/2022] [Indexed: 01/06/2023] Open
Abstract
Diagnosis and treatment of hepatocellular carcinoma or metastases rely heavily on accurate segmentation and classification of liver tumours. However, due to the liver tumor's hazy borders and wide range of possible shapes, sizes, and positions, accurate and automatic tumour segmentation and classification remains a difficult challenge. With the advancement of computing, new models in artificial intelligence have evolved. Following its success in Natural language processing (NLP), the transformer paradigm has been adopted by the computer vision (CV) community of the NLP. While there are already accepted approaches to classifying the liver, especially in clinical settings, there is room for advancement in terms of their precision. This paper makes an effort to apply a novel model for segmenting and classifying liver tumours built on deep learning. In order to accomplish this, the created model follows a three-stage procedure consisting of (a) pre-processing, (b) liver segmentation, and (c) classification. In the first phase, the collected Computed Tomography (CT) images undergo three stages of pre-processing, including contrast improvement via histogram equalization and noise reduction via the median filter. Next, an enhanced mask region-based convolutional neural networks (Mask R-CNN) model is used to separate the liver from the CT abdominal image. To prevent overfitting, the segmented picture is fed onto an Enhanced Swin Transformer Network with Adversarial Propagation (APESTNet). The experimental results prove the superior performance of the proposed perfect on a wide variety of CT images, as well as its efficiency and low sensitivity to noise.
Collapse
Affiliation(s)
- Prabhu Kavin Balasubramanian
- Department of Data Science and Business System, Kattankulathur Campus, SRM Institute of Science and Technology, Chennai 603203, Tamil Nadu, India
| | - Wen-Cheng Lai
- Bachelor Program in Industrial Projects, National Yunlin University of Science and Technology, Douliu 640301, Taiwan
- Department of Electronic Engineering, National Yunlin University of Science and Technology, Douliu 640301, Taiwan
| | - Gan Hong Seng
- Department of Data Science, UMK City Campus, University Malaysia Kelantan, Pengkalan Chepa, Kelantan 16100, Malaysia
- Correspondence: (G.H.S.); (K.C.)
| | - Kavitha C
- Department of Computer Science and Engineering, Sathyabama Institute of Science and Technology, Chennai 600119, Tamil Nadu, India
- Correspondence: (G.H.S.); (K.C.)
| | - Jeeva Selvaraj
- Department of Data Science and Business System, Kattankulathur Campus, SRM Institute of Science and Technology, Chennai 603203, Tamil Nadu, India
- Department of Data Science, UMK City Campus, University Malaysia Kelantan, Pengkalan Chepa, Kelantan 16100, Malaysia
| |
Collapse
|
26
|
Huang ML, Wu YS. GCS-YOLOV4-Tiny: A lightweight group convolution network for multi-stage fruit detection. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:241-268. [PMID: 36650764 DOI: 10.3934/mbe.2023011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Fruits require different planting techniques at different growth stages. Traditionally, the maturity stage of fruit is judged visually, which is time-consuming and labor-intensive. Fruits differ in size and color, and sometimes leaves or branches occult some of fruits, limiting automatic detection of growth stages in a real environment. Based on YOLOV4-Tiny, this study proposes a GCS-YOLOV4-Tiny model by (1) adding squeeze and excitation (SE) and the spatial pyramid pooling (SPP) modules to improve the accuracy of the model and (2) using the group convolution to reduce the size of the model and finally achieve faster detection speed. The proposed GCS-YOLOV4-Tiny model was executed on three public fruit datasets. Results have shown that GCS-YOLOV4-Tiny has favorable performance on mAP, Recall, F1-Score and Average IoU on Mango YOLO and Rpi-Tomato datasets. In addition, with the smallest model size of 20.70 MB, the mAP, Recall, F1-score, Precision and Average IoU of GCS-YOLOV4-Tiny achieve 93.42 ± 0.44, 91.00 ± 1.87, 90.80 ± 2.59, 90.80 ± 2.77 and 76.94 ± 1.35%, respectively, on F. margarita dataset. The detection results outperform the state-of-the-art YOLOV4-Tiny model with a 17.45% increase in mAP and a 13.80% increase in F1-score. The proposed model provides an effective and efficient performance to detect different growth stages of fruits and can be extended for different fruits and crops for object or disease detections.
Collapse
Affiliation(s)
- Mei-Ling Huang
- Department of Industrial Engineering & Management, National Chin-Yi University of Technology, Taichung, Taiwan
| | - Yi-Shan Wu
- Department of Industrial Engineering & Management, National Chin-Yi University of Technology, Taichung, Taiwan
| |
Collapse
|
27
|
Xiong M, Xu Y, Zhao Y, He S, Zhu Q, Wu Y, Hu X, Liu L. Quantitative analysis of artificial intelligence on liver cancer: A bibliometric analysis. Front Oncol 2023; 13:990306. [PMID: 36874099 PMCID: PMC9978515 DOI: 10.3389/fonc.2023.990306] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Accepted: 02/03/2023] [Indexed: 02/18/2023] Open
Abstract
Objective To provide the current research progress, hotspots, and emerging trends for AI in liver cancer, we have compiled a relative comprehensive and quantitative report on the research of liver disease using artificial intelligence by employing bibliometrics in this study. Methods In this study, the Web of Science Core Collection (WoSCC) database was used to perform systematic searches using keywords and a manual screening strategy, VOSviewer was used to analyze the degree of cooperation between countries/regions and institutions, as well as the co-occurrence of cooperation between authors and cited authors. Citespace was applied to generate a dual map to analyze the relationship of citing journals and citied journals and conduct a strong citation bursts ranking analysis of references. Online SRplot was used for in-depth keyword analysis and Microsoft Excel 2019 was used to collect the targeted variables from retrieved articles. Results 1724 papers were collected in this study, including 1547 original articles and 177 reviews. The study of AI in liver cancer mostly began from 2003 and has developed rapidly from 2017. China has the largest number of publications, and the United States has the highest H-index and total citation counts. The top three most productive institutions are the League of European Research Universities, Sun Yat Sen University, and Zhejiang University. Jasjit S. Suri and Frontiers in Oncology are the most published author and journal, respectively. Keyword analysis showed that in addition to the research on liver cancer, research on liver cirrhosis, fatty liver disease, and liver fibrosis were also common. Computed tomography was the most used diagnostic tool, followed by ultrasound and magnetic resonance imaging. The diagnosis and differential diagnosis of liver cancer are currently the most widely adopted research goals, and comprehensive analyses of multi-type data and postoperative analysis of patients with advanced liver cancer are rare. The use of convolutional neural networks is the main technical method used in studies of AI on liver cancer. Conclusion AI has undergone rapid development and has a wide application in the diagnosis and treatment of liver diseases, especially in China. Imaging is an indispensable tool in this filed. Mmulti-type data fusion analysis and development of multimodal treatment plans for liver cancer could become the major trend of future research in AI in liver cancer.
Collapse
Affiliation(s)
- Ming Xiong
- Department of Digital Medicine, School of Biomedical Engineering and Medical Imaging, Third Military Medical University (Army Medical University), Chongqing, China
| | - Yaona Xu
- Department of Digital Medicine, School of Biomedical Engineering and Medical Imaging, Third Military Medical University (Army Medical University), Chongqing, China
| | - Yang Zhao
- Department of Digital Medicine, School of Biomedical Engineering and Medical Imaging, Third Military Medical University (Army Medical University), Chongqing, China
| | - Si He
- Department of Digital Medicine, School of Biomedical Engineering and Medical Imaging, Third Military Medical University (Army Medical University), Chongqing, China
| | - Qihan Zhu
- Department of Digital Medicine, School of Biomedical Engineering and Medical Imaging, Third Military Medical University (Army Medical University), Chongqing, China
| | - Yi Wu
- Department of Digital Medicine, School of Biomedical Engineering and Medical Imaging, Third Military Medical University (Army Medical University), Chongqing, China
| | - Xiaofei Hu
- Department of Nuclear Medicine, Southwest Hospital, Third Military Medical University (Army Medical University), Chongqing, China
| | - Li Liu
- Department of Digital Medicine, School of Biomedical Engineering and Medical Imaging, Third Military Medical University (Army Medical University), Chongqing, China.,Department of Ultrasound, Southwest Hospital, Third Military Medical University (Army Medical University), Chongqing, China
| |
Collapse
|
28
|
Liu H, Fu Y, Zhang S, Liu J, Wang Y, Wang G, Fang J. GCHA-Net: Global context and hybrid attention network for automatic liver segmentation. Comput Biol Med 2023; 152:106352. [PMID: 36481761 DOI: 10.1016/j.compbiomed.2022.106352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 11/15/2022] [Accepted: 11/23/2022] [Indexed: 11/27/2022]
Abstract
Liver segmentation is a critical step in liver cancer diagnosis and surgical planning. The U-Net's architecture is one of the most efficient deep networks for medical image segmentation. However, the continuous downsampling operators in U-Net causes the loss of spatial information. To solve these problems, we propose a global context and hybrid attention network, called GCHA-Net, to adaptive capture the structural and detailed features. To capture the global features, a global attention module (GAM) is designed to model the channel and positional dimensions of the interdependencies. To capture the local features, a feature aggregation module (FAM) is designed, where a local attention module (LAM) is proposed to capture the spatial information. LAM can make our model focus on the local liver regions and suppress irrelevant information. The experimental results on the dataset LiTS2017 show that the dice per case (DPC) value and dice global (DG) value of liver were 96.5% and 96.9%, respectively. Compared with the state-of-the-art models, our model has superior performance in liver segmentation. Meanwhile, we test the experiment results on the 3Dircadb dataset, and it shows our model can obtain the highest accuracy compared with the closely related models. From these results, it can been seen that the proposed model can effectively capture the global context information and build the correlation between different convolutional layers. The code is available at the website: https://github.com/HuaxiangLiu/GCAU-Net.
Collapse
Affiliation(s)
- Huaxiang Liu
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Youyao Fu
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Shiqing Zhang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Jun Liu
- College of Mechanical Engineering, Quzhou University, Quzhou, 324000, Zhejiang, China
| | - Yong Wang
- School of Aeronautics and Astronautics, Sun Yat Sen University, Guangzhou, 510275, Guangdong, China
| | - Guoyu Wang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Jiangxiong Fang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China; College of Mechanical Engineering, Quzhou University, Quzhou, 324000, Zhejiang, China.
| |
Collapse
|
29
|
Boundary Aware U-Net for Medical Image Segmentation. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2022. [DOI: 10.1007/s13369-022-07431-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
30
|
Xuan P, Jiang B, Cui H, Jin Q, Cheng P, Nakaguchi T, Zhang T, Li C, Ning Z, Guo M, Wang L. Convolutional bi-directional learning and spatial enhanced attentions for lung tumor segmentation. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107147. [PMID: 36206688 DOI: 10.1016/j.cmpb.2022.107147] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2022] [Revised: 09/16/2022] [Accepted: 09/18/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND AND OBJECTIVE Accurate lung tumor segmentation from computed tomography (CT) is complex due to variations in tumor sizes, shapes, patterns and growing locations. Learning semantic and spatial relations between different feature channels, image regions and positions is critical yet challenging. METHODS We propose a new segmentation method, PRCS, by learning and integrating multi-channel contextual relations, and spatial and position dependencies across image regions. Firstly, to extract contextual relationships between different deep image feature tensor channels, we propose a new convolutional bi-directional gated recurrent unit based module for forward and backward learning. Secondly, a novel cross-channel region-level attention mechanism is proposed to discriminate the contributions of different local regions and associated features in the global learning process. Finally, spatial and position dependencies are formulated by a new position-enhanced self-attention mechanism. The new attention can measure the diverse contributions of other positions to a target position and obtain an enhanced adaptive feature vector for the target position. RESULTS Our model outperformed seven state-of-the-art segmentation methods on both public and in-house lung tumor datasets in terms of spatial overlapping and shape similarity. Ablation study results proved the effectiveness of three technical innovations and generalization ability on different 3D CNN segmentation backbones. CONCLUSION The proposed model enhanced the learning and propagation of contextual, spatial and position relations in 3D volumes, improving lung tumours' segmentation performance with large variations and indistinct boundaries. PRCS provides an effective automated approach to support precision diagnosis and treatment planning of lung cancer.
Collapse
Affiliation(s)
- Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin, China; Department of Computer Science, School of Engineering, Shantou University, Shantou, China
| | - Bin Jiang
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Hui Cui
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, Australia
| | - Qiangguo Jin
- School of Software, Northwestern Polytechnical University, Xi' an, China
| | - Peng Cheng
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, Australia
| | - Toshiya Nakaguchi
- Center for Frontier Medical Engineering, Chiba University, Chiba, Japan
| | - Tiangang Zhang
- School of Mathematical Science, Heilongjiang University, Harbin, China.
| | | | - Zhiyu Ning
- Sydney Polytechnic Institute, Sydney, Australia
| | | | - Linlin Wang
- Department of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan, China
| |
Collapse
|
31
|
Lv P, Wang J, Zhang X, Shi C. Deep supervision and atrous inception-based U-Net combining CRF for automatic liver segmentation from CT. Sci Rep 2022; 12:16995. [PMID: 36216965 PMCID: PMC9550798 DOI: 10.1038/s41598-022-21562-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 09/28/2022] [Indexed: 12/29/2022] Open
Abstract
Due to low contrast and the blurred boundary between liver tissue and neighboring organs sharing similar intensity values, the problem of liver segmentation from CT images has not yet achieved satisfactory performance and remains a challenge. To alleviate these problems, we introduce deep supervision (DS) and atrous inception (AI) technologies with conditional random field (CRF) and propose three major improvements that are experimentally shown to have substantive and practical value. First, we replace the encoder's standard convolution with the residual block. Residual blocks can increase the depth of the network. Second, we provide an AI module to connect the encoder and decoder. AI allows us to obtain multi-scale features. Third, we incorporate the DS mechanism into the decoder. This helps to make full use of information of the shallow layers. In addition, we employ the Tversky loss function to balance the segmented and non-segmented regions and perform further refinement with a dense CRF. Finally, we extensively validate the proposed method on three public databases: LiTS17, 3DIRCADb, and SLiver07. Compared to the state-of-the-art methods, the proposed method achieved increased segmentation accuracy for the livers with low contrast and the fuzzy boundary between liver tissue and neighboring organs and is, therefore, more suited for automatic segmentation of these livers.
Collapse
Affiliation(s)
- Peiqing Lv
- School of Automation, Harbin University of Science and Technology, Harbin, 150080, China
| | - Jinke Wang
- Department of Software Engineering, Harbin University of Science and Technology, Rongcheng, 264300, China.
- School of Automation, Harbin University of Science and Technology, Harbin, 150080, China.
| | - Xiangyang Zhang
- School of Automation, Harbin University of Science and Technology, Harbin, 150080, China
| | - Changfa Shi
- Mobile E-business Collaborative Innovation Center of Hunan Province, Hunan University of Technology and Business, Changsha, 410205, China
| |
Collapse
|
32
|
Popescu D, Stanciulescu A, Pomohaci MD, Ichim L. Decision Support System for Liver Lesion Segmentation Based on Advanced Convolutional Neural Network Architectures. Bioengineering (Basel) 2022; 9:bioengineering9090467. [PMID: 36135013 PMCID: PMC9495456 DOI: 10.3390/bioengineering9090467] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 08/20/2022] [Accepted: 09/09/2022] [Indexed: 11/16/2022] Open
Abstract
Given its essential role in body functions, liver cancer is the third most common cause of death from cancer, despite being the sixth most common type of cancer worldwide. Following advancements in medicine and image processing, medical image segmentation methods are receiving a great deal of attention. As a novelty, the paper proposes an intelligent decision system for segmenting liver and hepatic tumors by integrating four efficient neural networks (ResNet152, ResNeXt101, DenseNet201, and InceptionV3). Images from computed tomography for training, validation, and testing were taken from the public LiTS17 database and preprocessed to better highlight liver tissue and tumors. Global segmentation is done by separately training individual classifiers and the global system of merging individual decisions. For the aforementioned application, classification neural networks have been modified for semantic segmentation. After segmentation based on the neural network system, the images were postprocessed to eliminate artifacts. The segmentation results obtained by the system were better, from the point of view of the Dice coefficient, than those obtained by the individual networks, and comparable with those reported in recent works.
Collapse
|
33
|
Czipczer V, Manno-Kovacs A. Adaptable volumetric liver segmentation model for CT images using region-based features and convolutional neural network. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.07.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
34
|
Gul S, Khan MS, Bibi A, Khandakar A, Ayari MA, Chowdhury ME. Deep learning techniques for liver and liver tumor segmentation: A review. Comput Biol Med 2022; 147:105620. [DOI: 10.1016/j.compbiomed.2022.105620] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Revised: 02/18/2022] [Accepted: 03/19/2022] [Indexed: 12/29/2022]
|
35
|
Improved U-Net Remote Sensing Classification Algorithm Fusing Attention and Multiscale Features. REMOTE SENSING 2022. [DOI: 10.3390/rs14153591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The selection and representation of classification features in remote sensing image play crucial roles in image classification accuracy. To effectively improve the features classification accuracy, an improved U-Net remote sensing classification algorithm fusing attention and multiscale features is proposed in this paper, called spatial attention-atrous spatial pyramid pooling U-Net (SA-UNet). This framework connects atrous spatial pyramid pooling (ASPP) with the convolutional units of the encoder of the original U-Net in the form of residuals. The ASPP module expands the receptive field, integrates multiscale features in the network, and enhances the ability to express shallow features. Through the fusion residual module, shallow and deep features are deeply fused, and the characteristics of shallow and deep features are further used. The spatial attention mechanism is used to combine spatial with semantic information so that the decoder can recover more spatial information. In this study, the crop distribution in central Guangxi province was analyzed, and experiments were conducted based on Landsat 8 multispectral remote sensing images. The experimental results showed that the improved algorithm increases the classification accuracy, with the accuracy increasing from 93.33% to 96.25%, The segmentation accuracy of sugarcane, rice, and other land increased from 96.42%, 63.37%, and 88.43% to 98.01%, 83.21%, and 95.71%, respectively. The agricultural planting area results obtained by the proposed algorithm can be used as input data for regional ecological models, which is conducive to the development of accurate and real-time crop growth change models.
Collapse
|
36
|
Automatic liver tumor segmentation used the cascade multi-scale attention architecture method based on 3D U-Net. Int J Comput Assist Radiol Surg 2022; 17:1915-1922. [DOI: 10.1007/s11548-022-02653-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Accepted: 04/21/2022] [Indexed: 11/05/2022]
|
37
|
Shen X, Xu J, Jia H, Fan P, Dong F, Yu B, Ren S. Self-attentional microvessel segmentation via squeeze-excitation transformer Unet. Comput Med Imaging Graph 2022; 97:102055. [DOI: 10.1016/j.compmedimag.2022.102055] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 02/17/2022] [Accepted: 03/12/2022] [Indexed: 11/27/2022]
|
38
|
Bi R, Ji C, Yang Z, Qiao M, Lv P, Wang H. Residual based attention-Unet combing DAC and RMP modules for automatic liver tumor segmentation in CT. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:4703-4718. [PMID: 35430836 DOI: 10.3934/mbe.2022219] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Purpose: Due to the complex distribution of liver tumors in the abdomen, the accuracy of liver tumor segmentation cannot meet the needs of clinical assistance yet. This paper aims to propose a new end-to-end network to improve the segmentation accuracy of liver tumors from CT. Method: We proposed a hybrid network, leveraging the residual block, the context encoder (CE), and the Attention-Unet, called ResCEAttUnet. The CE comprises a dense atrous convolution (DAC) module and a residual multi-kernel pooling (RMP) module. The DAC module ensures the network derives high-level semantic information and minimizes detailed information loss. The RMP module improves the ability of the network to extract multi-scale features. Moreover, a hybrid loss function based on cross-entropy and Tversky loss function is employed to distribute the weights of the two-loss parts through training iterations. Results: We evaluated the proposed method in LiTS17 and 3DIRCADb databases. It significantly improved the segmentation accuracy compared to state-of-the-art methods. Conclusions: Experimental results demonstrate the satisfying effects of the proposed method through both quantitative and qualitative analyses, thus proving a promising tool in liver tumor segmentation.
Collapse
Affiliation(s)
- Rongrong Bi
- Department of Software Engineering, Harbin University of Science and Technology, Rongcheng 264300, China
| | - Chunlei Ji
- Department of Software Engineering, Harbin University of Science and Technology, Rongcheng 264300, China
| | - Zhipeng Yang
- School of Automation, Harbin University of Science and Technology, Harbin 150080, China
| | - Meixia Qiao
- Department of Software Engineering, Harbin University of Science and Technology, Rongcheng 264300, China
| | - Peiqing Lv
- School of Automation, Harbin University of Science and Technology, Harbin 150080, China
| | - Haiying Wang
- School of Automation, Harbin University of Science and Technology, Harbin 150080, China
| |
Collapse
|
39
|
Improved U-Net Remote Sensing Classification Algorithm Based on Multi-Feature Fusion Perception. REMOTE SENSING 2022. [DOI: 10.3390/rs14051118] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The selection and representation of remote sensing image classification features play crucial roles in image classification accuracy. To effectively improve the classification accuracy of features, an improved U-Net network framework based on multi-feature fusion perception is proposed in this paper. This framework adds the channel attention module (CAM-UNet) to the original U-Net framework and cascades the shallow features with the deep semantic features, replaces the classification layer in the original U-Net network with a support vector machine, and finally uses the majority voting game theory algorithm to fuse the multifeature classification results and obtain the final classification results. This study used the forest distribution in Xingbin District, Laibin City, Guangxi Zhuang Autonomous Region as the research object, which is based on Landsat 8 multispectral remote sensing images, and, by combining spectral features, spatial features, and advanced semantic features, overcame the influence of the reduction in spatial resolution that occurs with the deepening of the network on the classification results. The experimental results showed that the improved algorithm can improve classification accuracy. Before the improvement, the overall segmentation accuracy and segmentation accuracy of the forestland increased from 90.50% to 92.82% and from 95.66% to 97.16%, respectively. The forest cover results obtained by the algorithm proposed in this paper can be used as input data for regional ecological models, which is conducive to the development of accurate and real-time vegetation growth change models.
Collapse
|
40
|
Mohagheghi S, Foruzan AH. Developing an explainable deep learning boundary correction method by incorporating cascaded x-Dim models to improve segmentation defects in liver CT images. Comput Biol Med 2022; 140:105106. [PMID: 34864581 DOI: 10.1016/j.compbiomed.2021.105106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 11/16/2021] [Accepted: 11/29/2021] [Indexed: 11/21/2022]
Abstract
Deep learning methods achieved remarkable results in medical image analysis tasks but it has not yet been widely used by medical professionals. One of the main reasons for this restricted usage is the uncertainty of the reasons that influence the decision of the model. Explainable AI methods have been developed to improve the transparency, interpretability, and explainability of the black-box AI methods. The result of an explainable segmentation method will be more trusted by experts. In this study, we designed an explainable deep correction method by incorporating cascaded 1D and 2D models to refine the output of other models and provide reliable yet accurate results. We implemented a 2-step loop with a 1D local boundary validation model in the first step, and a 2D image patch segmentation model in the second step, to refine incorrect segmented regions slice-by-slice. The proposed method improved the result of the CNN segmentation models and achieved state-of-the-art results on 3D liver segmentation with the average dice coefficient of 98.27 on the Sliver07 dataset.
Collapse
Affiliation(s)
- Saeed Mohagheghi
- Department of Biomedical Engineering, Engineering Faculty, Shahed University, Tehran, Iran.
| | - Amir Hossein Foruzan
- Department of Biomedical Engineering, Engineering Faculty, Shahed University, Tehran, Iran.
| |
Collapse
|
41
|
Araújo JDL, da Cruz LB, Diniz JOB, Ferreira JL, Silva AC, de Paiva AC, Gattass M. Liver segmentation from computed tomography images using cascade deep learning. Comput Biol Med 2022; 140:105095. [PMID: 34902610 DOI: 10.1016/j.compbiomed.2021.105095] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 11/17/2021] [Accepted: 11/27/2021] [Indexed: 12/18/2022]
Abstract
BACKGROUND Liver segmentation is a fundamental step in the treatment planning and diagnosis of liver cancer. However, manual segmentation of liver is time-consuming because of the large slice quantity and subjectiveness associated with the specialist's experience, which can lead to segmentation errors. Thus, the segmentation process can be automated using computational methods for better time efficiency and accuracy. However, automatic liver segmentation is a challenging task, as the liver can vary in shape, ill-defined borders, and lesions, which affect its appearance. We aim to propose an automatic method for liver segmentation using computed tomography (CT) images. METHODS The proposed method, based on deep convolutional neural network models and image processing techniques, comprise of four main steps: (1) image preprocessing, (2) initial segmentation, (3) reconstruction, and (4) final segmentation. RESULTS We evaluated the proposed method using 131 CT images from the LiTS image base. An average sensitivity of 95.45%, an average specificity of 99.86%, an average Dice coefficient of 95.64%, an average volumetric overlap error (VOE) of 8.28%, an average relative volume difference (RVD) of -0.41%, and an average Hausdorff distance (HD) of 26.60 mm were achieved. CONCLUSIONS This study demonstrates that liver segmentation, even when lesions are present in CT images, can be efficiently performed using a cascade approach and including a reconstruction step based on deep convolutional neural networks.
Collapse
Affiliation(s)
- José Denes Lima Araújo
- Applied Computing Group (NCA - UFMA), Federal University of Maranhão, Av. dos Portugueses, SN, Campus do Bacanga, Bacanga, 65 085-580, São Luís, MA, Brazil.
| | - Luana Batista da Cruz
- Applied Computing Group (NCA - UFMA), Federal University of Maranhão, Av. dos Portugueses, SN, Campus do Bacanga, Bacanga, 65 085-580, São Luís, MA, Brazil.
| | - João Otávio Bandeira Diniz
- Applied Computing Group (NCA - UFMA), Federal University of Maranhão, Av. dos Portugueses, SN, Campus do Bacanga, Bacanga, 65 085-580, São Luís, MA, Brazil; Federal Institute of Maranhão, BR-226, SN, Campus Grajaú, Vila Nova, 65 940-000, Grajaú, MA, Brazil.
| | - Jonnison Lima Ferreira
- Applied Computing Group (NCA - UFMA), Federal University of Maranhão, Av. dos Portugueses, SN, Campus do Bacanga, Bacanga, 65 085-580, São Luís, MA, Brazil; Federal Institute of Amazonas, Rua Santos Dumont, SN, Campus Tabatinga, Vila Verde, 69 640-000, Tabatinga, AM, Brazil.
| | - Aristófanes Corrêa Silva
- Applied Computing Group (NCA - UFMA), Federal University of Maranhão, Av. dos Portugueses, SN, Campus do Bacanga, Bacanga, 65 085-580, São Luís, MA, Brazil.
| | - Anselmo Cardoso de Paiva
- Applied Computing Group (NCA - UFMA), Federal University of Maranhão, Av. dos Portugueses, SN, Campus do Bacanga, Bacanga, 65 085-580, São Luís, MA, Brazil.
| | - Marcelo Gattass
- Pontifical Catholic University of Rio de Janeiro, R. São Vicente, 225, Gávea, 22 453-900, Rio de Janeiro, RJ, Brazil.
| |
Collapse
|