1
|
Azad R, Kazerouni A, Heidari M, Aghdam EK, Molaei A, Jia Y, Jose A, Roy R, Merhof D. Advances in medical image analysis with vision Transformers: A comprehensive review. Med Image Anal 2024; 91:103000. [PMID: 37883822 DOI: 10.1016/j.media.2023.103000] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 09/30/2023] [Accepted: 10/11/2023] [Indexed: 10/28/2023]
Abstract
The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer.
Collapse
Affiliation(s)
- Reza Azad
- Faculty of Electrical Engineering and Information Technology, RWTH Aachen University, Aachen, Germany
| | - Amirhossein Kazerouni
- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
| | - Moein Heidari
- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
| | | | - Amirali Molaei
- School of Computer Engineering, Iran University of Science and Technology, Tehran, Iran
| | - Yiwei Jia
- Faculty of Electrical Engineering and Information Technology, RWTH Aachen University, Aachen, Germany
| | - Abin Jose
- Faculty of Electrical Engineering and Information Technology, RWTH Aachen University, Aachen, Germany
| | - Rijo Roy
- Faculty of Electrical Engineering and Information Technology, RWTH Aachen University, Aachen, Germany
| | - Dorit Merhof
- Faculty of Informatics and Data Science, University of Regensburg, Regensburg, Germany; Fraunhofer Institute for Digital Medicine MEVIS, Bremen, Germany.
| |
Collapse
|
2
|
Liu Z, Lv Q, Yang Z, Li Y, Lee CH, Shen L. Recent progress in transformer-based medical image analysis. Comput Biol Med 2023; 164:107268. [PMID: 37494821 DOI: 10.1016/j.compbiomed.2023.107268] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 05/30/2023] [Accepted: 07/16/2023] [Indexed: 07/28/2023]
Abstract
The transformer is primarily used in the field of natural language processing. Recently, it has been adopted and shows promise in the computer vision (CV) field. Medical image analysis (MIA), as a critical branch of CV, also greatly benefits from this state-of-the-art technique. In this review, we first recap the core component of the transformer, the attention mechanism, and the detailed structures of the transformer. After that, we depict the recent progress of the transformer in the field of MIA. We organize the applications in a sequence of different tasks, including classification, segmentation, captioning, registration, detection, enhancement, localization, and synthesis. The mainstream classification and segmentation tasks are further divided into eleven medical image modalities. A large number of experiments studied in this review illustrate that the transformer-based method outperforms existing methods through comparisons with multiple evaluation metrics. Finally, we discuss the open challenges and future opportunities in this field. This task-modality review with the latest contents, detailed information, and comprehensive comparison may greatly benefit the broad MIA community.
Collapse
Affiliation(s)
- Zhaoshan Liu
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore, 117575, Singapore.
| | - Qiujie Lv
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore, 117575, Singapore; School of Intelligent Systems Engineering, Sun Yat-sen University, No. 66, Gongchang Road, Guangming District, 518107, China.
| | - Ziduo Yang
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore, 117575, Singapore; School of Intelligent Systems Engineering, Sun Yat-sen University, No. 66, Gongchang Road, Guangming District, 518107, China.
| | - Yifan Li
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore, 117575, Singapore.
| | - Chau Hung Lee
- Department of Radiology, Tan Tock Seng Hospital, 11 Jalan Tan Tock Seng, Singapore, 308433, Singapore.
| | - Lei Shen
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore, 117575, Singapore.
| |
Collapse
|
3
|
SegPC-2021: A challenge & dataset on segmentation of Multiple Myeloma plasma cells from microscopic images. Med Image Anal 2023; 83:102677. [PMID: 36403309 DOI: 10.1016/j.media.2022.102677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Revised: 08/25/2022] [Accepted: 10/27/2022] [Indexed: 11/05/2022]
Abstract
Multiple Myeloma (MM) is an emerging ailment of global concern. Its diagnosis at the early stages is critical for recovery. Therefore, efforts are underway to produce digital pathology tools with human-level intelligence that are efficient, scalable, accessible, and cost-effective. Following the trend, a medical imaging challenge on "Segmentation of Multiple Myeloma Plasma Cells in Microscopic Images (SegPC-2021)" was organized at the IEEE International Symposium on Biomedical Imaging (ISBI), 2021, France. The challenge addressed the problem of cell segmentation in microscopic images captured from the slides prepared from the bone marrow aspirate of patients diagnosed with Multiple Myeloma. The challenge released a total of 775 images with 690 and 85 images of sizes 2040×1536 and 1920×2560 pixels, respectively, captured from two different (microscope and camera) setups. The participants had to segment the plasma cells with a separate label on each cell's nucleus and cytoplasm. This problem comprises many challenges, including a reduced color contrast between the cytoplasm and the background, and the clustering of cells with a feeble boundary separation of individual cells. To our knowledge, the SegPC-2021 challenge dataset is the largest publicly available annotated data on plasma cell segmentation in MM so far. The challenge targets a semi-automated tool to ensure the supervision of medical experts. It was conducted for a span of five months, from November 2020 to April 2021. Initially, the data was shared with 696 people from 52 teams, of which 41 teams submitted the results of their models on the evaluation portal in the validation phase. Similarly, 20 teams qualified for the last round, of which 16 teams submitted the results in the final test phase. All the top-5 teams employed DL-based approaches, and the best mIoU obtained on the final test set of 277 microscopic images was 0.9389. All these five models have been analyzed and discussed in detail. This challenge task is a step towards the target of creating an automated MM diagnostic tool.
Collapse
|
4
|
Mei L, Shen H, Yu Y, Weng Y, Li X, Zahid KR, Huang J, Wang D, Liu S, Zhou F, Lei C. High-throughput and high-accuracy diagnosis of multiple myeloma with multi-object detection. BIOMEDICAL OPTICS EXPRESS 2022; 13:6631-6644. [PMID: 36589588 PMCID: PMC9774865 DOI: 10.1364/boe.475166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 10/16/2022] [Accepted: 11/11/2022] [Indexed: 06/17/2023]
Abstract
Multiple myeloma (MM) is a type of blood cancer where plasma cells abnormally multiply and crowd out regular blood cells in the bones. Automated analysis of bone marrow smear examination is considered promising to improve the performance and reduce the labor cost in MM diagnosis. To address the drawbacks in established methods, which mainly aim at identifying monoclonal plasma cells (monoclonal PCs) via binary classification, in this work, considering that monoclonal PCs is not the only basis in MM diagnosis, for the first we construct a multi-object detection model for MM diagnosis. The experimental results show that our model can handle the images at a throughput of 80 slides/s and identify six lineages of bone marrow cells with an average accuracy of 90.8%. This work makes a step further toward full-automatic and high-efficiency MM diagnosis.
Collapse
Affiliation(s)
- Liye Mei
- The Institute of Technological Sciences, Wuhan University, Wuhan, 430072, China
- These authors contributed equally to this work
| | - Hui Shen
- The Department of Hematology, Zhongnan Hospital of Wuhan University, 430062, China
- These authors contributed equally to this work
| | - Yalan Yu
- The Department of Hematology, Zhongnan Hospital of Wuhan University, 430062, China
- These authors contributed equally to this work
| | - Yueyun Weng
- The Institute of Technological Sciences, Wuhan University, Wuhan, 430072, China
- The Key Laboratory of Transients in Hydraulic Machinery of Ministry of Education, School of Power and Mechanical Engineering, Wuhan University, Wuhan, 430072, China
| | - Xiaoxiao Li
- The Institute of Technological Sciences, Wuhan University, Wuhan, 430072, China
| | - Kashif Rafiq Zahid
- Department of Radiation Oncology, Indiana University Melvin and Bren Simon Comprehensive Cancer Center, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Jin Huang
- The Institute of Technological Sciences, Wuhan University, Wuhan, 430072, China
| | - Du Wang
- The Institute of Technological Sciences, Wuhan University, Wuhan, 430072, China
| | - Sheng Liu
- The Institute of Technological Sciences, Wuhan University, Wuhan, 430072, China
- The Key Laboratory of Transients in Hydraulic Machinery of Ministry of Education, School of Power and Mechanical Engineering, Wuhan University, Wuhan, 430072, China
| | - Fuling Zhou
- The Department of Hematology, Zhongnan Hospital of Wuhan University, 430062, China
| | - Cheng Lei
- The Institute of Technological Sciences, Wuhan University, Wuhan, 430072, China
| |
Collapse
|
5
|
Detecting Malignant Leukemia Cells Using Microscopic Blood Smear Images: A Deep Learning Approach. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12136317] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Leukemia is a form of blood cancer that develops when the human body’s bone marrow contains too many white blood cells. This medical condition affects adults and is considered a prevalent form of cancer in children. Treatment for leukaemia is determined by the type and the extent to which cancer has developed across the body. It is crucial to diagnose leukaemia early in order to provide adequate care and to cure patients. Researchers have been working on advanced diagnostics systems based on Machine Learning (ML) approaches to diagnose leukaemia early. In this research, we employ deep learning (DL) based convolutional neural network (CNN) and hybridized two individual blocks of CNN named CNN-1 and CNN-2 to detect acute lymphoblastic leukaemia (ALL), acute myeloid leukaemia (AML), and multiple myeloma (MM). The proposed model detects malignant leukaemia cells using microscopic blood smear images. We construct a dataset of about 4150 images from a public directory. The main challenges were background removal, ripping out un-essential blood components of blood supplies, reduce the noise and blurriness and minimal method for image segmentation. To accomplish the pre-processing and segmentation, we transform RGB color-space into the greyscale 8-bit mode, enhancing the contrast of images using the image intensity adjustment method and adaptive histogram equalisation (AHE) method. We increase the structure and sharpness of images by multiplication of binary image with the output of enhanced images. In the next step, complement is done to get the background in black colour and nucleus of blood in white colour. Thereafter, we applied area operation and closing operation to remove background noise. Finally, we multiply the final output to source image to regenerate the images dataset in RGB colour space, and we resize dataset images to [400, 400]. After applying all methods and techniques, we have managed to get noiseless, non-blurred, sharped and segmented images of the lesion. In next step, enhanced segmented images are given as input to CNNs. Two parallel CCN models are trained, which extract deep features. The extracted features are further combined using the Canonical Correlation Analysis (CCA) fusion method to get more prominent features. We used five classification algorithms, namely, SVM, Bagging ensemble, total boosts, RUSBoost, and fine KNN, to evaluate the performance of feature extraction algorithms. Among the classification algorithms, Bagging ensemble outperformed the other algorithms by achieving the highest accuracy of 97.04%.
Collapse
|
6
|
Paing MP, Sento A, Bui TH, Pintavirooj C. Instance Segmentation of Multiple Myeloma Cells Using Deep-Wise Data Augmentation and Mask R-CNN. ENTROPY 2022; 24:e24010134. [PMID: 35052160 PMCID: PMC8774909 DOI: 10.3390/e24010134] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 01/14/2022] [Accepted: 01/15/2022] [Indexed: 11/28/2022]
Abstract
Multiple myeloma is a condition of cancer in the bone marrow that can lead to dysfunction of the body and fatal expression in the patient. Manual microscopic analysis of abnormal plasma cells, also known as multiple myeloma cells, is one of the most commonly used diagnostic methods for multiple myeloma. However, as it is a manual process, it consumes too much effort and time. Besides, it has a higher chance of human errors. This paper presents a computer-aided detection and segmentation of myeloma cells from microscopic images of the bone marrow aspiration. Two major contributions are presented in this paper. First, different Mask R-CNN models using different images, including original microscopic images, contrast-enhanced images and stained cell images, are developed to perform instance segmentation of multiple myeloma cells. As a second contribution, a deep-wise augmentation, a deep learning-based data augmentation method, is applied to increase the performance of Mask R-CNN models. Based on the experimental findings, the Mask R-CNN model using contrast-enhanced images combined with the proposed deep-wise data augmentation provides a superior performance compared to other models. It achieves a mean precision of 0.9973, mean recall of 0.8631, and mean intersection over union (IOU) of 0.9062.
Collapse
Affiliation(s)
- May Phu Paing
- School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand
- Correspondence: (M.P.P.); (C.P.)
| | - Adna Sento
- Faculty of Engineering, Thai-Nichi Institute of Technology, Bangkok 10250, Thailand;
| | - Toan Huy Bui
- Course of Science and Technology, Graduate School of Science and Technology, Tokai University, Tokyo 108-8619, Japan;
| | - Chuchart Pintavirooj
- School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand
- Correspondence: (M.P.P.); (C.P.)
| |
Collapse
|
7
|
GCTI-SN: Geometry-inspired chemical and tissue invariant stain normalization of microscopic medical images. Med Image Anal 2020; 65:101788. [DOI: 10.1016/j.media.2020.101788] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 06/30/2020] [Accepted: 07/16/2020] [Indexed: 12/31/2022]
|
8
|
Roy K, Lewis CW, Chan GK, Bhattacharjee D. Automated classification of mitotic catastrophe by use of the centromere fragmentation morphology. Biochem Cell Biol 2020; 99:261-271. [PMID: 32905704 DOI: 10.1139/bcb-2020-0395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Mitotic catastrophe is a common mode of tumor cell death. Cancer cells with a defective cell-cycle checkpoint often enter mitosis with damaged or under replicated chromosomes following genotoxic treatment. Premature condensation of the under-replicated (or damaged) chromosomes results in double-stranded DNA breaks at the centromere (centromere fragmentation). Centromere fragmentation is a morphological marker of mitotic catastrophe and is distinguished by the clustering of centromeres away from the chromosomes. We present an automated 2-step system for segmentation of cells exhibiting centromere fragmentation. The first step segments individual cells from clumps. We added two new terms, weighted local repelling term (WLRt) and weighted gradient term (WGt), in the energy functional of the traditional Chan-Vese based level set method. WLRt was used to generate a repelling force when contours of adjacent cells merged and then penalized the overlap. WGt enhances gradients between overlapping cells. The second step consists of a new algorithm, SBaN (shape-based analysis of each nucleus), which extracts features like circularity, major-axis length, minor-axis length, area, and eccentricity from each chromosome to identify cells with centromere fragmentation. The performance of SBaN algorithm for centromere fragmentation detection was statistically evaluated and the results were robust.
Collapse
Affiliation(s)
- Kaushiki Roy
- Department of Oncology, University of Alberta, Edmonton, AB T6G 1Z2, Canada.,Experimental Oncology, Cross Cancer Institute, Edmonton, AB T6G 1Z2, Canada.,Cancer Research Institute of Northern Alberta, University of Alberta, Edmonton, AB T6G 2J7, Canada.,Department of Computer Science and Engineering, Jadavpur University, 188 Raja S.C. Mallick Road, Kolkata, WB, India 700032
| | - Cody W Lewis
- Department of Oncology, University of Alberta, Edmonton, AB T6G 1Z2, Canada.,Experimental Oncology, Cross Cancer Institute, Edmonton, AB T6G 1Z2, Canada.,Cancer Research Institute of Northern Alberta, University of Alberta, Edmonton, AB T6G 2J7, Canada
| | - Gordon K Chan
- Department of Oncology, University of Alberta, Edmonton, AB T6G 1Z2, Canada.,Experimental Oncology, Cross Cancer Institute, Edmonton, AB T6G 1Z2, Canada.,Cancer Research Institute of Northern Alberta, University of Alberta, Edmonton, AB T6G 2J7, Canada
| | - Debotosh Bhattacharjee
- Department of Computer Science and Engineering, Jadavpur University, 188 Raja S.C. Mallick Road, Kolkata, WB, India 700032
| |
Collapse
|