1
|
Wang TW, Hong JS, Chiu HY, Chao HS, Chen YM, Wu YT. Standalone deep learning versus experts for diagnosis lung cancer on chest computed tomography: a systematic review. Eur Radiol 2024; 34:7397-7407. [PMID: 38777902 PMCID: PMC11519296 DOI: 10.1007/s00330-024-10804-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Revised: 03/10/2024] [Accepted: 04/01/2024] [Indexed: 05/25/2024]
Abstract
PURPOSE To compare the diagnostic performance of standalone deep learning (DL) algorithms and human experts in lung cancer detection on chest computed tomography (CT) scans. MATERIALS AND METHODS This study searched for studies on PubMed, Embase, and Web of Science from their inception until November 2023. We focused on adult lung cancer patients and compared the efficacy of DL algorithms and expert radiologists in disease diagnosis on CT scans. Quality assessment was performed using QUADAS-2, QUADAS-C, and CLAIM. Bivariate random-effects and subgroup analyses were performed for tasks (malignancy classification vs invasiveness classification), imaging modalities (CT vs low-dose CT [LDCT] vs high-resolution CT), study region, software used, and publication year. RESULTS We included 20 studies on various aspects of lung cancer diagnosis on CT scans. Quantitatively, DL algorithms exhibited superior sensitivity (82%) and specificity (75%) compared to human experts (sensitivity 81%, specificity 69%). However, the difference in specificity was statistically significant, whereas the difference in sensitivity was not statistically significant. The DL algorithms' performance varied across different imaging modalities and tasks, demonstrating the need for tailored optimization of DL algorithms. Notably, DL algorithms matched experts in sensitivity on standard CT, surpassing them in specificity, but showed higher sensitivity with lower specificity on LDCT scans. CONCLUSION DL algorithms demonstrated improved accuracy over human readers in malignancy and invasiveness classification on CT scans. However, their performance varies by imaging modality, underlining the importance of continued research to fully assess DL algorithms' diagnostic effectiveness in lung cancer. CLINICAL RELEVANCE STATEMENT DL algorithms have the potential to refine lung cancer diagnosis on CT, matching human sensitivity and surpassing in specificity. These findings call for further DL optimization across imaging modalities, aiming to advance clinical diagnostics and patient outcomes. KEY POINTS Lung cancer diagnosis by CT is challenging and can be improved with AI integration. DL shows higher accuracy in lung cancer detection on CT than human experts. Enhanced DL accuracy could lead to improved lung cancer diagnosis and outcomes.
Collapse
Affiliation(s)
- Ting-Wei Wang
- Institute of Biophotonics, National Yang-Ming Chiao Tung University, Taipei, Taiwan
- School of Medicine, National Yang-Ming Chiao Tung University, Taipei, Taiwan
| | - Jia-Sheng Hong
- Institute of Biophotonics, National Yang-Ming Chiao Tung University, Taipei, Taiwan
| | - Hwa-Yen Chiu
- Institute of Biophotonics, National Yang-Ming Chiao Tung University, Taipei, Taiwan
- School of Medicine, National Yang-Ming Chiao Tung University, Taipei, Taiwan
- Department of Chest Medicine, Taipei Veteran General Hospital, Taipei, Taiwan
| | - Heng-Sheng Chao
- Department of Chest Medicine, Taipei Veteran General Hospital, Taipei, Taiwan
| | - Yuh-Min Chen
- School of Medicine, National Yang-Ming Chiao Tung University, Taipei, Taiwan
- Department of Chest Medicine, Taipei Veteran General Hospital, Taipei, Taiwan
| | - Yu-Te Wu
- Institute of Biophotonics, National Yang-Ming Chiao Tung University, Taipei, Taiwan.
| |
Collapse
|
2
|
Paramasamy J, Mandal S, Blomjous M, Mulders T, Bos D, Aerts JGJV, Vanapalli P, Challa V, Sathyamurthy S, Devi R, Jain R, Visser JJ. Validation of a commercially available CAD-system for lung nodule detection and characterization using CT-scans. Eur Radiol 2024:10.1007/s00330-024-10969-0. [PMID: 39042303 DOI: 10.1007/s00330-024-10969-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/27/2024] [Accepted: 06/30/2024] [Indexed: 07/24/2024]
Abstract
OBJECTIVES This study aims to externally validate a commercially available Computer-Aided Detection (CAD)-system for the automatic detection and characterization of solid, part-solid, and ground-glass lung nodules (LN) on CT scans. METHODS This retrospective study encompasses 263 chest CT scans performed between January 2020 and December 2021 at a Dutch university hospital. All scans were read by a radiologist (R1) and compared with the initial radiology report. Conflicting scans were assessed by an adjudicating radiologist (R2). All scans were also processed by CAD. The standalone performance of CAD in terms of sensitivity and false-positive (FP)-rate for detection was calculated together with the sensitivity for characterization, including texture, calcification, speculation, and location. The R1's detection sensitivity was also assessed. RESULTS A total of 183 true nodules were identified in 121 nodule-containing scans (142 non-nodule-containing scans), of which R1 identified 165/183 (90.2%). CAD detected 149 nodules, of which 12 were not identified by R1, achieving a sensitivity of 149/183 (81.4%) with an FP-rate of 49/121 (0.405). CAD's detection sensitivity for solid, part-solid, and ground-glass LNs was 82/94 (87.2%), 42/47 (89.4%), and 25/42 (59.5%), respectively. The classification accuracy for solid, part-solid, and ground-glass LNs was 81/82 (98.8%), 16/42 (38.1%), and 18/25 (72.0%), respectively. Additionally, CAD demonstrated overall classification accuracies of 137/149 (91.9%), 123/149 (82.6%), and 141/149 (94.6%) for calcification, spiculation, and location, respectively. CONCLUSIONS Although the overall detection rate of this system slightly lags behind that of a radiologist, CAD is capable of detecting different LNs and thereby has the potential to enhance a reader's detection rate. While promising characterization performances are obtained, the tool's performance in terms of texture classification remains a subject of concern. CLINICAL RELEVANCE STATEMENT Numerous lung nodule computer-aided detection-systems are commercially available, with some of them solely being externally validated based on their detection performance on solid nodules. We encourage researchers to assess performances by incorporating all relevant characteristics, including part-solid and ground-glass nodules. KEY POINTS Few computer-aided detection (CAD) systems are externally validated for automatic detection and characterization of lung nodules. A detection sensitivity of 81.4% and an overall texture classification sensitivity of 77.2% were measured utilizing CAD. CAD has the potential to increase single reader detection rate, however, improvement in texture classification is required.
Collapse
Affiliation(s)
- Jasika Paramasamy
- Department of Radiology and Nuclear Medicine, Erasmus Medical Center, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Souvik Mandal
- Qure.ai, Level 7, Oberoi Commerz II, Goregaon East, Mumbai, 400063, India
| | - Maurits Blomjous
- Department of Radiology and Nuclear Medicine, Erasmus Medical Center, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Ties Mulders
- Department of Radiology and Nuclear Medicine, Erasmus Medical Center, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Daniel Bos
- Department of Radiology and Nuclear Medicine, Erasmus Medical Center, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Joachim G J V Aerts
- Department of Radiology and Nuclear Medicine, Erasmus Medical Center, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Prakash Vanapalli
- Qure.ai, Level 7, Oberoi Commerz II, Goregaon East, Mumbai, 400063, India
| | - Vikash Challa
- Qure.ai, Level 7, Oberoi Commerz II, Goregaon East, Mumbai, 400063, India
| | | | - Ranjana Devi
- Qure.ai, Level 7, Oberoi Commerz II, Goregaon East, Mumbai, 400063, India
| | - Ritvik Jain
- Qure.ai, Level 7, Oberoi Commerz II, Goregaon East, Mumbai, 400063, India
| | - Jacob J Visser
- Department of Radiology and Nuclear Medicine, Erasmus Medical Center, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands.
| |
Collapse
|
3
|
Sollini M, Kirienko M, Gozzi N, Bruno A, Torrisi C, Balzarini L, Voulaz E, Alloisio M, Chiti A. The Development of an Intelligent Agent to Detect and Non-Invasively Characterize Lung Lesions on CT Scans: Ready for the "Real World"? Cancers (Basel) 2023; 15:cancers15020357. [PMID: 36672306 PMCID: PMC9856443 DOI: 10.3390/cancers15020357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 12/04/2022] [Accepted: 01/02/2023] [Indexed: 01/06/2023] Open
Abstract
(1) Background: Once lung lesions are identified on CT scans, they must be characterized by assessing the risk of malignancy. Despite the promising performance of computer-aided systems, some limitations related to the study design and technical issues undermine these tools' efficiency; an "intelligent agent" to detect and non-invasively characterize lung lesions on CT scans is proposed. (2) Methods: Two main modules tackled the detection of lung nodules on CT scans and the diagnosis of each nodule into benign and malignant categories. Computer-aided detection (CADe) and computer aided-diagnosis (CADx) modules relied on deep learning techniques such as Retina U-Net and the convolutional neural network; (3) Results: Tests were conducted on one publicly available dataset and two local datasets featuring CT scans acquired with different devices to reveal deep learning performances in "real-world" clinical scenarios. The CADe module reached an accuracy rate of 78%, while the CADx's accuracy, specificity, and sensitivity stand at 80%, 73%, and 85.7%, respectively; (4) Conclusions: Two different deep learning techniques have been adapted for CADe and CADx purposes in both publicly available and private CT scan datasets. Experiments have shown adequate performance in both detection and diagnosis tasks. Nevertheless, some drawbacks still characterize the supervised learning paradigm employed in networks such as CNN and Retina U-Net in real-world clinical scenarios, with CT scans from different devices with different sensors' fingerprints and spatial resolution. Continuous reassessment of CADe and CADx's performance is needed during their implementation in clinical practice.
Collapse
Affiliation(s)
- Martina Sollini
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20090 Milan, Italy
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, 20089 Milan, Italy
| | - Margarita Kirienko
- Fondazione IRCCS Istituto Nazionale Tumori, Via G. Venezian 1, 20133 Milan, Italy
| | - Noemi Gozzi
- Laboratory for Neuroengineering, Department of Health Sciences and Technology, Institute for Robotics and Intelligent Systems, ETH Zurich, 8092 Zurich, Switzerland
| | - Alessandro Bruno
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20090 Milan, Italy
| | - Chiara Torrisi
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, 20089 Milan, Italy
| | - Luca Balzarini
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, 20089 Milan, Italy
| | - Emanuele Voulaz
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20090 Milan, Italy
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, 20089 Milan, Italy
| | - Marco Alloisio
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20090 Milan, Italy
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, 20089 Milan, Italy
| | - Arturo Chiti
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20090 Milan, Italy
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, 20089 Milan, Italy
- Correspondence:
| |
Collapse
|
4
|
Zhang R, Guo Y, Yan Y, Liu Y, Zhu Y, Kang J, Li F, Sun X, Xing L, Xu Y. A Propensity-Matched Analysis of Survival of Clinically Diagnosed Early-Stage Lung Cancer and Biopsy-Proven Early-Stage Non-Small Cell Lung Cancer Following Stereotactic Ablative Radiotherapy. Front Oncol 2021; 11:720847. [PMID: 34504798 PMCID: PMC8421845 DOI: 10.3389/fonc.2021.720847] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2021] [Accepted: 08/05/2021] [Indexed: 12/25/2022] Open
Abstract
Purpose Stereotactic body radiotherapy (SBRT) has been increasingly regarded as a reasonable option for early-stage lung cancer patients without pretreatment pathologic results, but the efficacy and safety in a Chinese population remains unclear. The aim of this study was to compare survival outcomes and toxicities between patients with clinically diagnosed early-stage lung cancer or biopsy-proven early-stage non-small cell lung cancer and to demonstrate the rationality of this treatment. Material and Methods From May 2012 to December 2018, 56 patients with clinically diagnosed early-stage lung cancer and 60 patients with early-stage biopsy-proven were selected into non-pathological group and pathological group, respectively. Propensity score matching (PSM) was performed to reduce patient selection bias. Survival analysis with log-rank test was used to assess the differences of treatment outcomes, which included local control (LC), progression-free survival (PFS), and overall survival (OS). Results The median age was 76 (range 47–93) years, and the median follow-up time was 58.3 (range 4.3–95.1) months in the cohort without pathologic results. The median age was 74 (range 57–88) years, and the median follow-up time was 56.3 (range 2.6–94) months in the cohort with pathologic results. 45 matched-pair were analyzed. The 5-year LC, PFS, and OS rates in matched-pair patients with or without pathologic biopsy were 85.5% and 89.8%, 40.6% and 70.9%, and 63.2% and 76.1%, respectively. On Kaplan-Meier survival analysis after PSM analysis, there was no significant difference between patients with pathologic results versus patients with no pathologic results in terms of LC (P= 0.498) and OS (P=0.141). Of the matched-pair patients treated with SBRT, only 1 patient experienced grade 3 or above radiation pneumonitis. Conclusion For early-stage lung cancer patients with medically inoperable or not suitable for invasive diagnosis, SBRT may be a good local treatment.
Collapse
Affiliation(s)
- Ran Zhang
- Department of Radiation Oncology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai, China.,Tongji University, Shanghai, China
| | - Yanling Guo
- Department of Radiation Oncology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai, China.,Tongji University, Shanghai, China
| | - Yujie Yan
- Department of Radiation Oncology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai, China.,Tongji University, Shanghai, China
| | - Yuanjun Liu
- First Clinical Medical School, Wenzhou Medical University, Wenzhou, China
| | - Yaoyao Zhu
- Department of Radiation Oncology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Jingjing Kang
- Department of Radiation Oncology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Fangjuan Li
- Department of Radiation Oncology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Xiaojiang Sun
- Department of Radiation Oncology, Cancer Hospital of University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Cancer and Basic Medicine (IBMC), Chinese Academy of Sciences, Hangzhou, China
| | - Ligang Xing
- Department of Radiation Oncology, Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan, China
| | - Yaping Xu
- Department of Radiation Oncology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai, China
| |
Collapse
|
5
|
Dipaola F, Shiffer D, Gatti M, Menè R, Solbiati M, Furlan R. Machine Learning and Syncope Management in the ED: The Future Is Coming. ACTA ACUST UNITED AC 2021; 57:medicina57040351. [PMID: 33917508 PMCID: PMC8067452 DOI: 10.3390/medicina57040351] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Revised: 03/30/2021] [Accepted: 04/02/2021] [Indexed: 11/16/2022]
Abstract
In recent years, machine learning (ML) has been promisingly applied in many fields of clinical medicine, both for diagnosis and prognosis prediction. Aims of this narrative review were to summarize the basic concepts of ML applied to clinical medicine and explore its main applications in the emergency department (ED) setting, with a particular focus on syncope management. Through an extensive literature search in PubMed and Embase, we found increasing evidence suggesting that the use of ML algorithms can improve ED triage, diagnosis, and risk stratification of many diseases. However, the lacks of external validation and reliable diagnostic standards currently limit their implementation in clinical practice. Syncope represents a challenging problem for the emergency physician both because its diagnosis is not supported by specific tests and the available prognostic tools proved to be inefficient. ML algorithms have the potential to overcome these limitations and, in the future, they could support the clinician in managing syncope patients more efficiently. However, at present only few studies have addressed this issue, albeit with encouraging results.
Collapse
Affiliation(s)
- Franca Dipaola
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, 20090 Milan, Italy; (D.S.); (R.F.)
- Internal Medicine, Humanitas Clinical and Research Center—IRCCS, Rozzano, 20089 Milan, Italy
- Correspondence: ; Tel.: +39-0282247266
| | - Dana Shiffer
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, 20090 Milan, Italy; (D.S.); (R.F.)
| | - Mauro Gatti
- IBM, Active Intelligence Center, 40121 Bologna, Italy;
| | - Roberto Menè
- Department of Medicine and Surgery, University of Milano-Bicocca, 20126 Milan, Italy;
| | - Monica Solbiati
- Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, 20122 Milan, Italy;
- Dipartimento di Scienze Cliniche e di Comunità, Università degli Studi di Milano, 20122 Milan, Italy
| | - Raffaello Furlan
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, 20090 Milan, Italy; (D.S.); (R.F.)
- Internal Medicine, Humanitas Clinical and Research Center—IRCCS, Rozzano, 20089 Milan, Italy
| |
Collapse
|