1
|
Wang CK, Wang TW, Yang YX, Wu YT. Deep Learning for Nasopharyngeal Carcinoma Segmentation in Magnetic Resonance Imaging: A Systematic Review and Meta-Analysis. Bioengineering (Basel) 2024; 11:504. [PMID: 38790370 PMCID: PMC11118180 DOI: 10.3390/bioengineering11050504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 05/11/2024] [Accepted: 05/15/2024] [Indexed: 05/26/2024] Open
Abstract
Nasopharyngeal carcinoma is a significant health challenge that is particularly prevalent in Southeast Asia and North Africa. MRI is the preferred diagnostic tool for NPC due to its superior soft tissue contrast. The accurate segmentation of NPC in MRI is crucial for effective treatment planning and prognosis. We conducted a search across PubMed, Embase, and Web of Science from inception up to 20 March 2024, adhering to the PRISMA 2020 guidelines. Eligibility criteria focused on studies utilizing DL for NPC segmentation in adults via MRI. Data extraction and meta-analysis were conducted to evaluate the performance of DL models, primarily measured by Dice scores. We assessed methodological quality using the CLAIM and QUADAS-2 tools, and statistical analysis was performed using random effects models. The analysis incorporated 17 studies, demonstrating a pooled Dice score of 78% for DL models (95% confidence interval: 74% to 83%), indicating a moderate to high segmentation accuracy by DL models. Significant heterogeneity and publication bias were observed among the included studies. Our findings reveal that DL models, particularly convolutional neural networks, offer moderately accurate NPC segmentation in MRI. This advancement holds the potential for enhancing NPC management, necessitating further research toward integration into clinical practice.
Collapse
Affiliation(s)
- Chih-Keng Wang
- School of Medicine, College of Medicine, National Yang-Ming Chiao Tung University, Taipei 112304, Taiwan; (C.-K.W.)
- Department of Otolaryngology-Head and Neck Surgery, Taichung Veterans General Hospital, Taichung 407219, Taiwan
| | - Ting-Wei Wang
- School of Medicine, College of Medicine, National Yang-Ming Chiao Tung University, Taipei 112304, Taiwan; (C.-K.W.)
- Institute of Biophotonics, National Yang-Ming Chiao Tung University, 155, Sec. 2, Li-Nong St. Beitou Dist., Taipei 112304, Taiwan
| | - Ya-Xuan Yang
- Department of Otolaryngology-Head and Neck Surgery, Taichung Veterans General Hospital, Taichung 407219, Taiwan
| | - Yu-Te Wu
- Institute of Biophotonics, National Yang-Ming Chiao Tung University, 155, Sec. 2, Li-Nong St. Beitou Dist., Taipei 112304, Taiwan
| |
Collapse
|
2
|
Nakagawa J, Fujima N, Hirata K, Harada T, Wakabayashi N, Takano Y, Homma A, Kano S, Minowa K, Kudo K. Diagnosis of skull-base invasion by nasopharyngeal tumors on CT with a deep-learning approach. Jpn J Radiol 2024; 42:450-459. [PMID: 38280100 PMCID: PMC11056334 DOI: 10.1007/s11604-023-01527-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 12/24/2023] [Indexed: 01/29/2024]
Abstract
PURPOSE To develop a convolutional neural network (CNN) model to diagnose skull-base invasion by nasopharyngeal malignancies in CT images and evaluate the model's diagnostic performance. MATERIALS AND METHODS We divided 100 malignant nasopharyngeal tumor lesions into a training (n = 70) and a test (n = 30) dataset. Two head/neck radiologists reviewed CT and MRI images and determined the positive/negative skull-base invasion status of each case (training dataset: 29 invasion-positive and 41 invasion-negative; test dataset: 13 invasion-positive and 17 invasion-negative). Preprocessing involved extracting continuous slices of the nasopharynx and clivus. The preprocessed training dataset was used for transfer learning with Residual Neural Networks 50 to create a diagnostic CNN model, which was then tested on the preprocessed test dataset to determine the invasion status and model performance. Original CT images from the test dataset were reviewed by a radiologist with extensive head/neck imaging experience (senior reader: SR) and another less-experienced radiologist (junior reader: JR). Gradient-weighted class activation maps (Grad-CAMs) were created to visualize the explainability of the invasion status classification. RESULTS The CNN model's diagnostic accuracy was 0.973, significantly higher than those of the two radiologists (SR: 0.838; JR: 0.595). Receiver operating characteristic curve analysis gave an area under the curve of 0.953 for the CNN model (versus 0.832 and 0.617 for SR and JR; both p < 0.05). The Grad-CAMs suggested that the invasion-negative cases were present predominantly in bone marrow, while the invasion-positive cases exhibited osteosclerosis and nasopharyngeal masses. CONCLUSIONS This CNN technique would be useful for CT-based diagnosis of skull-base invasion by nasopharyngeal malignancies.
Collapse
Affiliation(s)
- Junichi Nakagawa
- Department of Diagnostic Imaging, Graduate School of Medicine, Hokkaido University, N15 W7, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan
- Department of Diagnostic and Interventional Radiology, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
| | - Noriyuki Fujima
- Department of Diagnostic and Interventional Radiology, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan.
- Global Center for Biomedical Science and Engineering, Faculty of Medicine, Hokkaido University, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan.
| | - Kenji Hirata
- Department of Diagnostic Imaging, Graduate School of Medicine, Hokkaido University, N15 W7, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan
- Global Center for Biomedical Science and Engineering, Faculty of Medicine, Hokkaido University, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan
- Department of Nuclear Medicine, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
- Medical AI Research and Development Center, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
| | - Taisuke Harada
- Department of Diagnostic Imaging, Graduate School of Medicine, Hokkaido University, N15 W7, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan
- Department of Diagnostic and Interventional Radiology, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
- Global Center for Biomedical Science and Engineering, Faculty of Medicine, Hokkaido University, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan
| | - Naoto Wakabayashi
- Department of Diagnostic Imaging, Graduate School of Medicine, Hokkaido University, N15 W7, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan
- Department of Diagnostic and Interventional Radiology, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
| | - Yuki Takano
- Department of Diagnostic Imaging, Graduate School of Medicine, Hokkaido University, N15 W7, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan
- Department of Diagnostic and Interventional Radiology, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
| | - Akihiro Homma
- Department of Otolaryngology-Head and Neck Surgery, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, N15 W7, Kita Ku, Sapporo, 060-8638, Japan
| | - Satoshi Kano
- Department of Otolaryngology-Head and Neck Surgery, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, N15 W7, Kita Ku, Sapporo, 060-8638, Japan
| | - Kazuyuki Minowa
- Faculty of Dental Medicine Department of Radiology, Hokkaido University, N13 W7, Kita-Ku, Sapporo, Hokkaido, 060-8586, Japan
| | - Kohsuke Kudo
- Department of Diagnostic Imaging, Graduate School of Medicine, Hokkaido University, N15 W7, Kita-Ku, Sapporo, Hokkaido, 060-8638, Japan
- Department of Diagnostic and Interventional Radiology, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
- Department of Nuclear Medicine, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
- Medical AI Research and Development Center, Hokkaido University Hospital, N14 W5, Kita-Ku, Sapporo, Hokkaido, 060-8648, Japan
| |
Collapse
|
3
|
Domoguen JKL, Manuel JJA, Cañal JPA, Naval PC. Automatic segmentation of nasopharyngeal carcinoma on CT images using efficient UNet‐2.5D ensemble with semi‐supervised pretext task pretraining. Front Oncol 2022; 12:980312. [DOI: 10.3389/fonc.2022.980312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 10/17/2022] [Indexed: 11/12/2022] Open
Abstract
Nasopharyngeal carcinoma (NPC) is primarily treated with radiation therapy. Accurate delineation of target volumes and organs at risk is important. However, manual delineation is time-consuming, variable, and subjective depending on the experience of the radiation oncologist. This work explores the use of deep learning methods to automate the segmentation of NPC primary gross tumor volume (GTVp) in planning computer tomography (CT) images. A total of sixty-three (63) patients diagnosed with NPC were included in this study. Although a number of studies applied have shown the effectiveness of deep learning methods in medical imaging, their high performance has mainly been due to the wide availability of data. In contrast, the data for NPC is scarce and inaccessible. To tackle this problem, we propose two sequential approaches. First we propose a much simpler architecture which follows the UNet design but using 2D convolutional network for 3D segmentation. We find that this specific architecture is much more effective in the segmentation of GTV in NPC. We highlight its efficacy over other more popular and modern architecture by achieving significantly higher performance. Moreover to further improve performance, we trained the model using multi-scale dataset to create an ensemble of models. However, the performance of the model is ultimately dependent on the availability of labelled data. Hence building on top of this proposed architecture, we employ the use of semi-supervised learning by proposing the use of a combined pre-text tasks. Specifically we use the combination of 3D rotation and 3D relative-patch location pre-texts tasks to pretrain the feature extractor. We use an additional 50 CT images of healthy patients which have no annotation or labels. By semi-supervised pretraining the feature extractor can be frozen after pretraining which essentially makes it much more efficient in terms of the number of parameters since only the decoder is trained. Finally it is not only efficient in terms of parameters but also data, which is shown when the pretrained model with only portion of the labelled training data was able to achieve very close performance to the model trained with the full labelled data.
Collapse
|
4
|
Chen Y, Han G, Lin T, Liu X. CAFS: An Attention-Based Co-Segmentation Semi-Supervised Method for Nasopharyngeal Carcinoma Segmentation. SENSORS (BASEL, SWITZERLAND) 2022; 22:5053. [PMID: 35808548 PMCID: PMC9269783 DOI: 10.3390/s22135053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Revised: 06/25/2022] [Accepted: 06/30/2022] [Indexed: 02/06/2023]
Abstract
Accurate segmentation of nasopharyngeal carcinoma is essential to its treatment effect. However, there are several challenges in existing deep learning-based segmentation methods. First, the acquisition of labeled data are challenging. Second, the nasopharyngeal carcinoma is similar to the surrounding tissues. Third, the shape of nasopharyngeal carcinoma is complex. These challenges make the segmentation of nasopharyngeal carcinoma difficult. This paper proposes a novel semi-supervised method named CAFS for automatic segmentation of nasopharyngeal carcinoma. CAFS addresses the above challenges through three mechanisms: the teacher-student cooperative segmentation mechanism, the attention mechanism, and the feedback mechanism. CAFS can use only a small amount of labeled nasopharyngeal carcinoma data to segment the cancer region accurately. The average DSC value of CAFS is 0.8723 on the nasopharyngeal carcinoma segmentation task. Moreover, CAFS has outperformed the state-of-the-art nasopharyngeal carcinoma segmentation methods in the comparison experiment. Among the compared state-of-the-art methods, CAFS achieved the highest values of DSC, Jaccard, and precision. In particular, the DSC value of CAFS is 7.42% higher than the highest DSC value in the state-of-the-art methods.
Collapse
Affiliation(s)
- Yitong Chen
- School of Biomedical Engineering, Sun Yat-sen University, Shenzhen 518107, China; (Y.C.); (G.H.); (T.L.)
| | - Guanghui Han
- School of Biomedical Engineering, Sun Yat-sen University, Shenzhen 518107, China; (Y.C.); (G.H.); (T.L.)
- School of Information Engineering, North China University of Water Resources and Electric Power, Zhengzhou 450046, China
| | - Tianyu Lin
- School of Biomedical Engineering, Sun Yat-sen University, Shenzhen 518107, China; (Y.C.); (G.H.); (T.L.)
| | - Xiujian Liu
- School of Biomedical Engineering, Sun Yat-sen University, Shenzhen 518107, China; (Y.C.); (G.H.); (T.L.)
| |
Collapse
|