1
|
Ma J, Yang H, Chou Y, Yoon J, Allison T, Komandur R, McDunn J, Tasneem A, Do RK, Schwartz LH, Zhao B. Generalizability of lesion detection and segmentation when ScaleNAS is trained on a large multi-organ dataset and validated in the liver. Med Phys 2025; 52:1005-1018. [PMID: 39576046 DOI: 10.1002/mp.17504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 09/25/2024] [Accepted: 10/05/2024] [Indexed: 02/04/2025] Open
Abstract
BACKGROUND Tumor assessment through imaging is crucial for diagnosing and treating cancer. Lesions in the liver, a common site for metastatic disease, are particularly challenging to accurately detect and segment. This labor-intensive task is subject to individual variation, which drives interest in automation using artificial intelligence (AI). PURPOSE Evaluate AI for lesion detection and lesion segmentation using CT in the context of human performance on the same task. Use internal testing to determine how an AI-developed model (ScaleNAS) trained on lesions in multiple organs performs when tested specifically on liver lesions in a dataset integrating real-world and clinical trial data. Use external testing to evaluate whether ScaleNAS's performance generalizes to publicly available colorectal liver metastases (CRLM) from The Cancer Imaging Archive (TCIA). METHODS The CUPA study dataset included patients whose CT scan of chest, abdomen, or pelvis at Columbia University between 2010-2020 indicated solid tumors (CUIMC, n = 5011) and from two clinical trials in metastatic colorectal cancer, PRIME (n = 1183) and Amgen (n = 463). Inclusion required ≥1 measurable lesion; exclusion criteria eliminated 1566 patients. Data were divided at the patient level into training (n = 3996), validation (n = 570), and testing (n = 1529) sets. To create the reference standard for training and validation, each case was annotated by one of six radiologists, randomly assigned, who marked the CUPA lesions without access to any previous annotations. For internal testing we refined the CUPA test set to contain only patients who had liver lesions (n = 525) and formed an enhanced reference standard through expert consensus reviewing prior annotations. For external testing, TCIA-CRLM (n = 197) formed the test set. The reference standard for TCIA-CRLM was formed by consensus review of the original annotation and contours by two new radiologists. Metrics for lesion detection were sensitivity and false positives. Lesion segmentation was assessed with median Dice coefficient, under-segmentation ratio (USR), and over-segmentation ratio (OSR). Subgroup analysis examined the influence of lesion size ≥ 10 mm (measurable by RECIST1.1) versus all lesions (important for early identification of disease progression). RESULTS ScaleNAS trained on all lesions achieved sensitivity of 71.4% and Dice of 70.2% for liver lesions in the CUPA internal test set (3,495 lesions) and sensitivity of 68.2% and Dice 64.2% in the TCIA-CRLM external test set (638 lesions). Human radiologists had mean sensitivity of 53.5% and Dice of 73.9% in CUPA and sensitivity of 84.1% and Dice of 88.4% in TCIA-CRLM. Performance improved for ScaleNAS and radiologists in the subgroup of lesions that excluded sub-centimeter lesions. CONCLUSIONS Our study presents the first evaluation of ScaleNAS in medical imaging, demonstrating its liver lesion detection and segmentation performance across diverse datasets. Using consensus reference standards from multiple radiologists, we addressed inter-observer variability and contributed to consistency in lesion annotation. While ScaleNAS does not surpass radiologists in performance, it offers fast and reliable results with potential utility in providing initial contours for radiologists. Future work will extend this model to lung and lymph node lesions, ultimately aiming to enhance clinical applications by generalizing detection and segmentation across tissue types.
Collapse
Affiliation(s)
- Jingchen Ma
- Department of Radiology, Columbia University Irving Medical Center, New York, New York, USA
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, New York, USA
| | - Hao Yang
- Department of Radiology, Columbia University Irving Medical Center, New York, New York, USA
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, New York, USA
| | - Yen Chou
- Department of Radiology, Columbia University Irving Medical Center, New York, New York, USA
- Fu Jen Catholic University Hospital, Department of Medical Imaging and Fu Jen Catholic University, School of Medicine, New Taipei City, Taiwan
| | - Jin Yoon
- Department of Radiology, Columbia University Irving Medical Center, New York, New York, USA
| | - Tavis Allison
- Department of Radiology, Columbia University Irving Medical Center, New York, New York, USA
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, New York, USA
| | | | - Jon McDunn
- Project Data Sphere, Cary, North Carolina, USA
| | | | - Richard K Do
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, New York, USA
| | - Lawrence H Schwartz
- Department of Radiology, Columbia University Irving Medical Center, New York, New York, USA
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, New York, USA
| | - Binsheng Zhao
- Department of Radiology, Columbia University Irving Medical Center, New York, New York, USA
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, New York, USA
| |
Collapse
|
2
|
Hajjo R, Sabbah DA, Bardaweel SK, Tropsha A. Identification of Tumor-Specific MRI Biomarkers Using Machine Learning (ML). Diagnostics (Basel) 2021; 11:742. [PMID: 33919342 PMCID: PMC8143297 DOI: 10.3390/diagnostics11050742] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Revised: 04/09/2021] [Accepted: 04/12/2021] [Indexed: 02/06/2023] Open
Abstract
The identification of reliable and non-invasive oncology biomarkers remains a main priority in healthcare. There are only a few biomarkers that have been approved as diagnostic for cancer. The most frequently used cancer biomarkers are derived from either biological materials or imaging data. Most cancer biomarkers suffer from a lack of high specificity. However, the latest advancements in machine learning (ML) and artificial intelligence (AI) have enabled the identification of highly predictive, disease-specific biomarkers. Such biomarkers can be used to diagnose cancer patients, to predict cancer prognosis, or even to predict treatment efficacy. Herein, we provide a summary of the current status of developing and applying Magnetic resonance imaging (MRI) biomarkers in cancer care. We focus on all aspects of MRI biomarkers, starting from MRI data collection, preprocessing and machine learning methods, and ending with summarizing the types of existing biomarkers and their clinical applications in different cancer types.
Collapse
Affiliation(s)
- Rima Hajjo
- Department of Pharmacy, Faculty of Pharmacy, Al-Zaytoonah University of Jordan, P.O. Box 130, Amman 11733, Jordan;
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, The University of North Carlina at Chapel Hill, Chapel Hill, NC 27599, USA;
- National Center for Epidemics and Communicable Disease Control, Amman 11118, Jordan
| | - Dima A. Sabbah
- Department of Pharmacy, Faculty of Pharmacy, Al-Zaytoonah University of Jordan, P.O. Box 130, Amman 11733, Jordan;
| | - Sanaa K. Bardaweel
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Jordan, Amman 11942, Jordan;
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, The University of North Carlina at Chapel Hill, Chapel Hill, NC 27599, USA;
| |
Collapse
|
3
|
deSouza NM, Achten E, Alberich-Bayarri A, Bamberg F, Boellaard R, Clément O, Fournier L, Gallagher F, Golay X, Heussel CP, Jackson EF, Manniesing R, Mayerhofer ME, Neri E, O'Connor J, Oguz KK, Persson A, Smits M, van Beek EJR, Zech CJ. Validated imaging biomarkers as decision-making tools in clinical trials and routine practice: current status and recommendations from the EIBALL* subcommittee of the European Society of Radiology (ESR). Insights Imaging 2019; 10:87. [PMID: 31468205 PMCID: PMC6715762 DOI: 10.1186/s13244-019-0764-0] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Accepted: 06/28/2019] [Indexed: 12/12/2022] Open
Abstract
Observer-driven pattern recognition is the standard for interpretation of medical images. To achieve global parity in interpretation, semi-quantitative scoring systems have been developed based on observer assessments; these are widely used in scoring coronary artery disease, the arthritides and neurological conditions and for indicating the likelihood of malignancy. However, in an era of machine learning and artificial intelligence, it is increasingly desirable that we extract quantitative biomarkers from medical images that inform on disease detection, characterisation, monitoring and assessment of response to treatment. Quantitation has the potential to provide objective decision-support tools in the management pathway of patients. Despite this, the quantitative potential of imaging remains under-exploited because of variability of the measurement, lack of harmonised systems for data acquisition and analysis, and crucially, a paucity of evidence on how such quantitation potentially affects clinical decision-making and patient outcome. This article reviews the current evidence for the use of semi-quantitative and quantitative biomarkers in clinical settings at various stages of the disease pathway including diagnosis, staging and prognosis, as well as predicting and detecting treatment response. It critically appraises current practice and sets out recommendations for using imaging objectively to drive patient management decisions.
Collapse
Affiliation(s)
- Nandita M deSouza
- Cancer Research UK Imaging Centre, The Institute of Cancer Research and The Royal Marsden Hospital, Downs Road, Sutton, Surrey, SM2 5PT, UK.
| | | | | | - Fabian Bamberg
- Department of Radiology, University of Freiburg, Freiburg im Breisgau, Germany
| | | | | | | | | | | | - Claus Peter Heussel
- Universitätsklinik Heidelberg, Translational Lung Research Center (TLRC), German Center for Lung Research (DZL), University of Heidelberg, Im Neuenheimer Feld 156, 69120, Heidelberg, Germany
| | - Edward F Jackson
- University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
| | - Rashindra Manniesing
- Department of Radiology and Nuclear Medicine, Radboud University Medical Center, Geert Grooteplein 10, 6525, GA, Nijmegen, The Netherlands
| | | | - Emanuele Neri
- Department of Translational Research, University of Pisa, Pisa, Italy
| | - James O'Connor
- Division of Cancer Sciences, University of Manchester, Manchester, UK
| | | | | | - Marion Smits
- Department of Radiology and Nuclear Medicine (Ne-515), Erasmus MC, PO Box 2040, 3000, CA, Rotterdam, The Netherlands
| | - Edwin J R van Beek
- Edinburgh Imaging, Queen's Medical Research Institute, Edinburgh Bioquarter, 47 Little France Crescent, Edinburgh, UK
| | - Christoph J Zech
- University Hospital Basel, Radiology and Nuclear Medicine, University of Basel, Petersgraben 4, CH-4031, Basel, Switzerland
| |
Collapse
|
4
|
Athelogou M, Kim HJ, Dima A, Obuchowski N, Peskin A, Gavrielides MA, Petrick N, Saiprasad G, Colditz Colditz D, Beaumont H, Oubel E, Tan Y, Zhao B, Kuhnigk JM, Moltz JH, Orieux G, Gillies RJ, Gu Y, Mantri N, Goldmacher G, Zhang L, Vega E, Bloom M, Jarecha R, Soza G, Tietjen C, Takeguchi T, Yamagata H, Peterson S, Masoud O, Buckler AJ. Algorithm Variability in the Estimation of Lung Nodule Volume From Phantom CT Scans: Results of the QIBA 3A Public Challenge. Acad Radiol 2016; 23:940-52. [PMID: 27215408 PMCID: PMC6237094 DOI: 10.1016/j.acra.2016.02.018] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2014] [Revised: 02/29/2016] [Accepted: 02/29/2016] [Indexed: 11/19/2022]
Abstract
RATIONALE AND OBJECTIVES Quantifying changes in lung tumor volume is important for diagnosis, therapy planning, and evaluation of response to therapy. The aim of this study was to assess the performance of multiple algorithms on a reference data set. The study was organized by the Quantitative Imaging Biomarker Alliance (QIBA). MATERIALS AND METHODS The study was organized as a public challenge. Computed tomography scans of synthetic lung tumors in an anthropomorphic phantom were acquired by the Food and Drug Administration. Tumors varied in size, shape, and radiodensity. Participants applied their own semi-automated volume estimation algorithms that either did not allow or allowed post-segmentation correction (type 1 or 2, respectively). Statistical analysis of accuracy (percent bias) and precision (repeatability and reproducibility) was conducted across algorithms, as well as across nodule characteristics, slice thickness, and algorithm type. RESULTS Eighty-four percent of volume measurements of QIBA-compliant tumors were within 15% of the true volume, ranging from 66% to 93% across algorithms, compared to 61% of volume measurements for all tumors (ranging from 37% to 84%). Algorithm type did not affect bias substantially; however, it was an important factor in measurement precision. Algorithm precision was notably better as tumor size increased, worse for irregularly shaped tumors, and on the average better for type 1 algorithms. Over all nodules meeting the QIBA Profile, precision, as measured by the repeatability coefficient, was 9.0% compared to 18.4% overall. CONCLUSION The results achieved in this study, using a heterogeneous set of measurement algorithms, support QIBA quantitative performance claims in terms of volume measurement repeatability for nodules meeting the QIBA Profile criteria.
Collapse
Affiliation(s)
| | - Hyun J Kim
- UCLA, Center for Computer Vision and Imaging Biomarkers, Dept. of Radiological Sciences David Geffen School of Medicine at UCLA Dept. of Biostatistics Fielding School of Public at UCLA, Los Angeles, USA
| | - Alden Dima
- National Institute of Standards and Technology, Gaithersburg, USA
| | - Nancy Obuchowski
- Quantitative Health Sciences/JJN3, Cleveland Clinic Foundation, Cleveland, USA
| | - Adele Peskin
- National Institute of Standards and Technology, Gaithersburg, USA
| | | | | | - Ganesh Saiprasad
- National Institute of Standards and Technology, Gaithersburg, USA
| | | | | | | | - Yongqiang Tan
- Columbia University Medical Center, Department of Radiology, New York, USA
| | - Binsheng Zhao
- Columbia University Medical Center, Department of Radiology, New York, USA
| | - Jan-Martin Kuhnigk
- Fraunhofer MEVIS, Institute for Medical Image Computing, Bremen, Germany
| | - Jan Hendrik Moltz
- Fraunhofer MEVIS, Institute for Medical Image Computing, Bremen, Germany
| | | | - Robert J Gillies
- Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Yuhua Gu
- Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Ninad Mantri
- ICON Medical Imaging, Warrington, Pennsylvania, USA
| | | | | | - Emilio Vega
- NYU Langone Medical Center Faculty Practice Radiology, New York, USA
| | - Michael Bloom
- NYU Langone Medical Center Faculty Practice Radiology, New York, USA
| | | | - Grzegorz Soza
- Siemens AG, Healthcare Sector, Computed Tomography, Forchheim, Germany
| | - Christian Tietjen
- Siemens AG, Healthcare Sector, Computed Tomography, Forchheim, Germany
| | | | - Hitoshi Yamagata
- Toshiba Corporation, Toshiba Medical Systems Corporation, Otawara, Japan
| | - Sam Peterson
- Vital Images, Inc. (a Toshiba Medical Systems Group), Minnesota, USA
| | - Osama Masoud
- Vital Images, Inc. (a Toshiba Medical Systems Group), Minnesota, USA
| | | |
Collapse
|