1
|
Li G, Otake Y, Soufi M, Taniguchi M, Yagi M, Ichihashi N, Uemura K, Takao M, Sugano N, Sato Y. Hybrid representation-enhanced sampling for Bayesian active learning in musculoskeletal segmentation of lower extremities. Int J Comput Assist Radiol Surg 2024; 19:2177-2186. [PMID: 38282095 DOI: 10.1007/s11548-024-03065-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 01/10/2024] [Indexed: 01/30/2024]
Abstract
PURPOSE Manual annotations for training deep learning models in auto-segmentation are time-intensive. This study introduces a hybrid representation-enhanced sampling strategy that integrates both density and diversity criteria within an uncertainty-based Bayesian active learning (BAL) framework to reduce annotation efforts by selecting the most informative training samples. METHODS The experiments are performed on two lower extremity datasets of MRI and CT images, focusing on the segmentation of the femur, pelvis, sacrum, quadriceps femoris, hamstrings, adductors, sartorius, and iliopsoas, utilizing a U-net-based BAL framework. Our method selects uncertain samples with high density and diversity for manual revision, optimizing for maximal similarity to unlabeled instances and minimal similarity to existing training data. We assess the accuracy and efficiency using dice and a proposed metric called reduced annotation cost (RAC), respectively. We further evaluate the impact of various acquisition rules on BAL performance and design an ablation study for effectiveness estimation. RESULTS In MRI and CT datasets, our method was superior or comparable to existing ones, achieving a 0.8% dice and 1.0% RAC increase in CT (statistically significant), and a 0.8% dice and 1.1% RAC increase in MRI (not statistically significant) in volume-wise acquisition. Our ablation study indicates that combining density and diversity criteria enhances the efficiency of BAL in musculoskeletal segmentation compared to using either criterion alone. CONCLUSION Our sampling method is proven efficient in reducing annotation costs in image segmentation tasks. The combination of the proposed method and our BAL framework provides a semi-automatic way for efficient annotation of medical image datasets.
Collapse
Affiliation(s)
- Ganping Li
- Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara, 630-0192, Japan.
| | - Yoshito Otake
- Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara, 630-0192, Japan
| | - Mazen Soufi
- Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara, 630-0192, Japan
| | - Masashi Taniguchi
- Human Health Sciences, Graduate School of Medicine, Kyoto University, 53-Kawahara-cho, Shogoin, Sakyo-ku, Kyoto, 606-8507, Japan
| | - Masahide Yagi
- Human Health Sciences, Graduate School of Medicine, Kyoto University, 53-Kawahara-cho, Shogoin, Sakyo-ku, Kyoto, 606-8507, Japan
| | - Noriaki Ichihashi
- Human Health Sciences, Graduate School of Medicine, Kyoto University, 53-Kawahara-cho, Shogoin, Sakyo-ku, Kyoto, 606-8507, Japan
| | - Keisuke Uemura
- Department of Orthopedic Surgery, Osaka University Graduate School of Medicine, Osaka University, 2-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Masaki Takao
- Department of Bone and Joint Surgery, School of Medicine, Ehime University, 454 Shitsugawa, Toon, Ehime, 791-0295, Japan
| | - Nobuhiko Sugano
- Department of Orthopedic Surgery, Osaka University Graduate School of Medicine, Osaka University, 2-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Yoshinobu Sato
- Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara, 630-0192, Japan
| |
Collapse
|
2
|
Wang H, Jin Q, Li S, Liu S, Wang M, Song Z. A comprehensive survey on deep active learning in medical image analysis. Med Image Anal 2024; 95:103201. [PMID: 38776841 DOI: 10.1016/j.media.2024.103201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 04/25/2024] [Accepted: 05/06/2024] [Indexed: 05/25/2024]
Abstract
Deep learning has achieved widespread success in medical image analysis, leading to an increasing demand for large-scale expert-annotated medical image datasets. Yet, the high cost of annotating medical images severely hampers the development of deep learning in this field. To reduce annotation costs, active learning aims to select the most informative samples for annotation and train high-performance models with as few labeled samples as possible. In this survey, we review the core methods of active learning, including the evaluation of informativeness and sampling strategy. For the first time, we provide a detailed summary of the integration of active learning with other label-efficient techniques, such as semi-supervised, self-supervised learning, and so on. We also summarize active learning works that are specifically tailored to medical image analysis. Additionally, we conduct a thorough comparative analysis of the performance of different AL methods in medical image analysis with experiments. In the end, we offer our perspectives on the future trends and challenges of active learning and its applications in medical image analysis. An accompanying paper list and code for the comparative analysis is available in https://github.com/LightersWang/Awesome-Active-Learning-for-Medical-Image-Analysis.
Collapse
Affiliation(s)
- Haoran Wang
- Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China; Shanghai Key Laboratory of Medical Image Computing and Computer Assisted Intervention, Shanghai 200032, China
| | - Qiuye Jin
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| | - Shiman Li
- Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China; Shanghai Key Laboratory of Medical Image Computing and Computer Assisted Intervention, Shanghai 200032, China
| | - Siyu Liu
- Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China; Shanghai Key Laboratory of Medical Image Computing and Computer Assisted Intervention, Shanghai 200032, China
| | - Manning Wang
- Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China; Shanghai Key Laboratory of Medical Image Computing and Computer Assisted Intervention, Shanghai 200032, China.
| | - Zhijian Song
- Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China; Shanghai Key Laboratory of Medical Image Computing and Computer Assisted Intervention, Shanghai 200032, China.
| |
Collapse
|
3
|
Ma S, Mathur P, Ju Z, Lawlor A, Dong R. Model-data-driven adversarial active learning for brain tumor segmentation. Comput Biol Med 2024; 176:108585. [PMID: 38761499 DOI: 10.1016/j.compbiomed.2024.108585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 04/28/2024] [Accepted: 05/08/2024] [Indexed: 05/20/2024]
Abstract
Active learning (AL) attempts to select informative samples in a dataset to minimize the number of required labels while maximizing the performance of the model. Current AL in segmentation tasks is limited to the expansion of popular classification-based methods including entropy, MC-dropout, etc. Meanwhile, most applications in the medical field are simply migrations that fail to consider the nature of medical images, such as high class imbalance, high domain difference, and data scarcity. In this study, we address these challenges and propose a novel AL framework for medical image segmentation task. Our approach introduces a pseudo-label-based filter addressing excessive blank patches in medical abnormalities segmentation tasks, e.g., lesions, and tumors, used before the AL selection. This filter helps reduce resource usage and allows the model to focus on selecting more informative samples. For the sample selection, we propose a novel query strategy that combines both model impact and data stability by employing adversarial attack. Furthermore, we harness the adversarial samples generated during the query process to enhance the robustness of the model. The experimental results verify our framework's effectiveness over various state-of-the-art methods. Our proposed method only needs less than 14% annotated patches in 3D brain MRI multiple sclerosis (MS) segmentation tasks and 20% for Low-Grade Glioma (LGG) tumor segmentation to achieve competitive results with full supervision. These promising outcomes not only improve performance but alleviate the time burden associated with expert annotation, thereby facilitating further advancements in the field of medical image segmentation. Our code is available at https://github.com/HelenMa9998/adversarial_active_learning.
Collapse
Affiliation(s)
- Siteng Ma
- School of Computer Science, University College Dublin, Dublin, D04 V1W8, Ireland.
| | - Prateek Mathur
- School of Computer Science, University College Dublin, Dublin, D04 V1W8, Ireland.
| | - Zheng Ju
- School of Computer Science, University College Dublin, Dublin, D04 V1W8, Ireland.
| | - Aonghus Lawlor
- School of Computer Science, University College Dublin, Dublin, D04 V1W8, Ireland.
| | - Ruihai Dong
- School of Computer Science, University College Dublin, Dublin, D04 V1W8, Ireland.
| |
Collapse
|
4
|
Alajrami E, Ng T, Jevsikov J, Naidoo P, Fernandes P, Azarmehr N, Dinmohammadi F, Shun-Shin MJ, Dadashi Serej N, Francis DP, Zolgharni M. Active learning for left ventricle segmentation in echocardiography. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 248:108111. [PMID: 38479147 DOI: 10.1016/j.cmpb.2024.108111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 02/21/2024] [Accepted: 03/01/2024] [Indexed: 04/02/2024]
Abstract
BACKGROUND AND OBJECTIVE Training deep learning models for medical image segmentation require large annotated datasets, which can be expensive and time-consuming to create. Active learning is a promising approach to reduce this burden by strategically selecting the most informative samples for segmentation. This study investigates the use of active learning for efficient left ventricle segmentation in echocardiography with sparse expert annotations. METHODS We adapt and evaluate various sampling techniques, demonstrating their effectiveness in judiciously selecting samples for segmentation. Additionally, we introduce a novel strategy, Optimised Representativeness Sampling, which combines feature-based outliers with the most representative samples to enhance annotation efficiency. RESULTS Our findings demonstrate a substantial reduction in annotation costs, achieving a remarkable 99% upper bound performance while utilising only 20% of the labelled data. This equates to a reduction of 1680 images needing annotation within our dataset. When applied to a publicly available dataset, our approach yielded a remarkable 70% reduction in required annotation efforts, representing a significant advancement compared to baseline active learning strategies, which achieved only a 50% reduction. Our experiments highlight the nuanced performance of diverse sampling strategies across datasets within the same domain. CONCLUSIONS The study provides a cost-effective approach to tackle the challenges of limited expert annotations in echocardiography. By introducing a distinct dataset, made publicly available for research purposes, our work contributes to the field's understanding of efficient annotation strategies in medical image segmentation.
Collapse
Affiliation(s)
- Eman Alajrami
- Intelligent Sensing and Vision, University of West London, London, UK.
| | - Tiffany Ng
- National Heart and Lung Institute, Imperial College London, London, UK
| | - Jevgeni Jevsikov
- Intelligent Sensing and Vision, University of West London, London, UK; National Heart and Lung Institute, Imperial College London, London, UK
| | - Preshen Naidoo
- Intelligent Sensing and Vision, University of West London, London, UK
| | | | - Neda Azarmehr
- Intelligent Sensing and Vision, University of West London, London, UK
| | | | | | | | - Darrel P Francis
- National Heart and Lung Institute, Imperial College London, London, UK
| | - Massoud Zolgharni
- Intelligent Sensing and Vision, University of West London, London, UK; National Heart and Lung Institute, Imperial College London, London, UK
| |
Collapse
|
5
|
Gaillochet M, Desrosiers C, Lombaert H. Active learning for medical image segmentation with stochastic batches. Med Image Anal 2023; 90:102958. [PMID: 37769549 DOI: 10.1016/j.media.2023.102958] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 09/01/2023] [Accepted: 09/04/2023] [Indexed: 10/03/2023]
Abstract
The performance of learning-based algorithms improves with the amount of labelled data used for training. Yet, manually annotating data is particularly difficult for medical image segmentation tasks because of the limited expert availability and intensive manual effort required. To reduce manual labelling, active learning (AL) targets the most informative samples from the unlabelled set to annotate and add to the labelled training set. On the one hand, most active learning works have focused on the classification or limited segmentation of natural images, despite active learning being highly desirable in the difficult task of medical image segmentation. On the other hand, uncertainty-based AL approaches notoriously offer sub-optimal batch-query strategies, while diversity-based methods tend to be computationally expensive. Over and above methodological hurdles, random sampling has proven an extremely difficult baseline to outperform when varying learning and sampling conditions. This work aims to take advantage of the diversity and speed offered by random sampling to improve the selection of uncertainty-based AL methods for segmenting medical images. More specifically, we propose to compute uncertainty at the level of batches instead of samples through an original use of stochastic batches (SB) during sampling in AL. Stochastic batch querying is a simple and effective add-on that can be used on top of any uncertainty-based metric. Extensive experiments on two medical image segmentation datasets show that our strategy consistently improves conventional uncertainty-based sampling methods. Our method can hence act as a strong baseline for medical image segmentation. The code is available on: https://github.com/Minimel/StochasticBatchAL.git.
Collapse
Affiliation(s)
| | | | - Hervé Lombaert
- ETS Montréal, 1100 Notre-Dame St W, Montreal H3C 1K3, QC, Canada
| |
Collapse
|
6
|
Moreira da Silva DE, Gonçalves L, Franco-Gonçalo P, Colaço B, Alves-Pimenta S, Ginja M, Ferreira M, Filipe V. Active learning for data efficient semantic segmentation of canine bones in radiographs. Front Artif Intell 2022; 5:939967. [PMCID: PMC9644053 DOI: 10.3389/frai.2022.939967] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 10/07/2022] [Indexed: 11/16/2022] Open
Abstract
X-ray bone semantic segmentation is one crucial task in medical imaging. Due to deep learning's emergence, it was possible to build high-precision models. However, these models require a large quantity of annotated data. Furthermore, semantic segmentation requires pixel-wise labeling, thus being a highly time-consuming task. In the case of hip joints, there is still a need for increased anatomic knowledge due to the intrinsic nature of the femur and acetabulum. Active learning aims to maximize the model's performance with the least possible amount of data. In this work, we propose and compare the use of different queries, including uncertainty and diversity-based queries. Our results show that the proposed methods permit state-of-the-art performance using only 81.02% of the data, with O(1) time complexity.
Collapse
Affiliation(s)
- D. E. Moreira da Silva
- School of Science and Technology, University of Trás-os-Montes e Alto Douro (UTAD), Vila Real, Portugal
| | - Lio Gonçalves
- School of Science and Technology, University of Trás-os-Montes e Alto Douro (UTAD), Vila Real, Portugal
- INESC Technology and Science (INESC TEC), Porto, Portugal
| | - Pedro Franco-Gonçalo
- Department of Veterinary Science, UTAD, Vila Real, Portugal
- Veterinary and Animal Research Centre (CECAV), Vila Real, Portugal
| | - Bruno Colaço
- Veterinary and Animal Research Centre (CECAV), Vila Real, Portugal
- Department of Animal Science, UTAD, Vila Real, Portugal
- Centre for the Research and Technology of Agro-Environmental and Biological Sciences (CITAB), Vila Real, Portugal
| | - Sofia Alves-Pimenta
- Veterinary and Animal Research Centre (CECAV), Vila Real, Portugal
- Department of Animal Science, UTAD, Vila Real, Portugal
- Centre for the Research and Technology of Agro-Environmental and Biological Sciences (CITAB), Vila Real, Portugal
| | - Mário Ginja
- Department of Veterinary Science, UTAD, Vila Real, Portugal
- Veterinary and Animal Research Centre (CECAV), Vila Real, Portugal
- Centre for the Research and Technology of Agro-Environmental and Biological Sciences (CITAB), Vila Real, Portugal
| | | | - Vitor Filipe
- School of Science and Technology, University of Trás-os-Montes e Alto Douro (UTAD), Vila Real, Portugal
- INESC Technology and Science (INESC TEC), Porto, Portugal
- *Correspondence: Vitor Filipe
| |
Collapse
|
7
|
One-shot active learning for image segmentation via contrastive learning and diversity-based sampling. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108278] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|