Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Metz CE, Pan X. "Proper" Binormal ROC Curves: Theory and Maximum-Likelihood Estimation. J Math Psychol 1999;43:1-33. [PMID: 10069933 DOI: 10.1006/jmps.1998.1218] [Citation(s) in RCA: 181] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]

Number

Cited by Other Article(s)

Wayne N, Wu Q, Moore SC, Ferrari VA, Metzler SD, Guerraty MA. Multimodality assessment of the coronary microvasculature with TIMI frame count versus perfusion PET highlights coronary changes characteristic of coronary microvascular disease. Front Cardiovasc Med 2024;11:1395036. [PMID: 38966750 PMCID: PMC11222597 DOI: 10.3389/fcvm.2024.1395036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2024] [Accepted: 06/07/2024] [Indexed: 07/06/2024] Open

Jiang Y, Iuanow E, Malik B, Klock J. A Multireader Multicase (MRMC) Receiver Operating Characteristic (ROC) Study Evaluating Noninferiority of Quantitative Transmission (QT) Ultrasound to Digital Breast Tomosynthesis (DBT) on Detection and Recall of Breast Lesions. Acad Radiol 2024;31:2248-2258. [PMID: 38290888 DOI: 10.1016/j.acra.2023.12.038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 12/16/2023] [Accepted: 12/26/2023] [Indexed: 02/01/2024]

Abstract

RATIONALE AND OBJECTIVES

Quantitative transmission (QT) imaging is an emerging volumetric ultrasound modality for women too young for mammography. QT images tissue without overlap seen in mammography, thereby can potentially improve breast mass detection and characterization and noncancer recall. We compared radiologists' interpretation of QT vs digital breast tomosynthesis (DBT) with a multireader multicase observer performance study.

MATERIALS AND METHODS

Study subjects received screening DBT and QT scans in HIPAA-compliant, institutional review board-approved prospective case-collection studies at four clinical sites. Twenty-four Mammography Quality Standards Act-qualified radiologists interpreted 177 cases (66 with cancer, atypia, or solid mass and 111 normal or with nonsolid benign abnormality), first QT, then 2 weeks later DBT synthesized 2D-views. Readers reported up to three findings per case and for each finding a recall or no recall decision and confidence of that decision. The study hypothesis was area under receiver operating characteristic curve (AUC) of QT was noninferior to DBT. Sensitivity and specificity were also compared.

RESULTS

AUC of QT (0.746 ± 0.028, mean ± SD) was noninferior to DBT (0.700 ± 0.028) for AUC difference margin of -0.05 (P < .05). AUC difference was 0.046 ± 0.028 (95% CI: [-0.008, 0.101]). Sensitivity was 70.6 ± 7.2% for QT and 85.2 ± 6.4% for DBT, specificity was 60.1 ± 12.3% vs 37.2 ± 11.0%, and both differences were statistically significant. Of a total of 21 cases of cysts, readers recommended recall, on average, in 1.1 ± 1.4 cases with QT, but not with DBT, and 10.6 ± 2.2 cases with DBT, but not with QT.

CONCLUSION

QT can be a potential alternative to mammography for breast cancer screening of women too young to undergo mammography.

Collapse

Whitney HM, Drukker K, Vieceli M, Dusen AV, de Oliveira M, Abe H, Giger ML. Role of sureness in evaluating AI/CADx: Lesion-based repeatability of machine learning classification performance on breast MRI. Med Phys 2024;51:1812-1821. [PMID: 37602841 PMCID: PMC10879454 DOI: 10.1002/mp.16673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 07/24/2023] [Accepted: 07/24/2023] [Indexed: 08/22/2023] Open

Abstract

BACKGROUND

Artificial intelligence/computer-aided diagnosis (AI/CADx) and its use of radiomics have shown potential in diagnosis and prognosis of breast cancer. Performance metrics such as the area under the receiver operating characteristic (ROC) curve (AUC) are frequently used as figures of merit for the evaluation of CADx. Methods for evaluating lesion-based measures of performance may enhance the assessment of AI/CADx pipelines, particularly in the situation of comparing performances by classifier.

PURPOSE

The purpose of this study was to investigate the use case of two standard classifiers to (1) compare overall classification performance of the classifiers in the task of distinguishing between benign and malignant breast lesions using radiomic features extracted from dynamic contrast-enhanced magnetic resonance (DCE-MR) images, (2) define a new repeatability metric (termed sureness), and (3) use sureness to examine if one classifier provides an advantage in AI diagnostic performance by lesion when using radiomic features.

METHODS

Images of 1052 breast lesions (201 benign, 851 cancers) had been retrospectively collected under HIPAA/IRB compliance. The lesions had been segmented automatically using a fuzzy c-means method and thirty-two radiomic features had been extracted. Classification was investigated for the task of malignant lesions (81% of the dataset) versus benign lesions (19%). Two classifiers (linear discriminant analysis, LDA and support vector machines, SVM) were trained and tested within 0.632 bootstrap analyses (2000 iterations). Whole-set classification performance was evaluated at two levels: (1) the 0.632+ bias-corrected area under the ROC curve (AUC) and (2) performance metric curves which give variability in operating sensitivity and specificity at a target operating point (95% target sensitivity). Sureness was defined as 1-95% confidence interval of the classifier output for each lesion for each classifier. Lesion-based repeatability was evaluated at two levels: (1) repeatability profiles, which represent the distribution of sureness across the decision threshold and (2) sureness of each lesion. The latter was used to identify lesions with better sureness with one classifier over another while maintaining lesion-based performance across the bootstrap iterations.

RESULTS

In classification performance assessment, the median and 95% CI of difference in AUC between the two classifiers did not show evidence of difference (ΔAUC = -0.003 [-0.031, 0.018]). Both classifiers achieved the target sensitivity. Sureness was more consistent across the classifier output range for the SVM classifier than the LDA classifier. The SVM resulted in a net gain of 33 benign lesions and 307 cancers with higher sureness and maintained lesion-based performance. However, with the LDA there was a notable percentage of benign lesions (42%) with better sureness but lower lesion-based performance.

CONCLUSIONS

When there is no evidence for difference in performance between classifiers using AUC or other performance summary measures, a lesion-based sureness metric may provide additional insight into AI pipeline design. These findings present and emphasize the utility of lesion-based repeatability via sureness in AI/CADx as a complementary enhancement to other evaluation measures.

Collapse

Pretorius PH, Liu J, Kalluri KS, Jiang Y, Leppo JA, Dahlberg ST, Kikut J, Parker MW, Keating FK, Licho R, Auer B, Lindsay C, Konik A, Yang Y, Wernick MN, King MA. Observer studies of image quality of denoising reduced-count cardiac single photon emission computed tomography myocardial perfusion imaging by three-dimensional Gaussian post-reconstruction filtering and deep learning. J Nucl Cardiol 2023;30:2427-2437. [PMID: 37221409 PMCID: PMC11401514 DOI: 10.1007/s12350-023-03295-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 04/25/2023] [Indexed: 05/25/2023]

Zhou W, Villa U, Anastasio MA. Ideal Observer Computation by Use of Markov-Chain Monte Carlo With Generative Adversarial Networks. IEEE TRANSACTIONS ON MEDICAL IMAGING 2023;42:3715-3724. [PMID: 37578916 PMCID: PMC10769588 DOI: 10.1109/tmi.2023.3304907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/16/2023]

Shenouda M, Flerlage I, Kaveti A, Giger ML, Armato SG. Assessment of a deep learning model for COVID-19 classification on chest radiographs: a comparison across image acquisition techniques and clinical factors. J Med Imaging (Bellingham) 2023;10:064504. [PMID: 38162317 PMCID: PMC10753846 DOI: 10.1117/1.jmi.10.6.064504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 11/30/2023] [Accepted: 12/06/2023] [Indexed: 01/03/2024] Open

Abstract

Purpose

The purpose is to assess the performance of a pre-trained deep learning model in the task of classifying between coronavirus disease (COVID)-positive and COVID-negative patients from chest radiographs (CXRs) while considering various image acquisition parameters, clinical factors, and patient demographics.

Methods

Standard and soft-tissue CXRs of 9860 patients comprised the "original dataset," consisting of training and test sets and were used to train a DenseNet-121 architecture model to classify COVID-19 using three classification algorithms: standard, soft tissue, and a combination of both types of images via feature fusion. A larger more-current test set of 5893 patients (the "current test set") was used to assess the performance of the pretrained model. The current test set contained a larger span of dates, incorporated different variants of the virus and included different immunization statuses. Model performance between the original and current test sets was evaluated using area under the receiver operating characteristic curve (ROC AUC) [95% CI].

Results

The model achieved AUC values of 0.67 [0.65, 0.70] for cropped standard images, 0.65 [0.63, 0.67] for cropped soft-tissue images, and 0.67 [0.65, 0.69] for both types of cropped images. These were all significantly lower than the performance of the model on the original test set. Investigations regarding matching the acquisition dates between the test sets (i.e., controlling for virus variants), immunization status, disease severity, and age and sex distributions did not fully explain the discrepancy in performance.

Conclusions

Several relevant factors were considered to determine whether differences existed in the test sets, including time period of image acquisition, vaccination status, and disease severity. The lower performance on the current test set may have occurred due to model overfitting and a lack of generalizability.

Collapse

Granstedt JL, Zhou W, Anastasio MA. Approximating the Hotelling observer with autoencoder-learned efficient channels for binary signal detection tasks. J Med Imaging (Bellingham) 2023;10:055501. [PMID: 37767114 PMCID: PMC10520791 DOI: 10.1117/1.jmi.10.5.055501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 08/31/2023] [Accepted: 09/05/2023] [Indexed: 09/29/2023] Open

Abstract

Purpose

The objective assessment of image quality (IQ) has been advocated for the analysis and optimization of medical imaging systems. One method of computing such IQ metrics is through a numerical observer. The Hotelling observer (HO) is the optimal linear observer, but conventional methods for obtaining the HO can become intractable due to large image sizes or insufficient data. Channelized methods are sometimes employed in such circumstances to approximate the HO. The performance of such channelized methods varies, with different methods obtaining superior performance to others depending on the imaging conditions and detection task. A channelized HO method using an AE is presented and implemented across several tasks to characterize its performance.

Approach

The process for training an AE is demonstrated to be equivalent to developing a set of channels for approximating the HO. The efficiency of the learned AE-channels is increased by modifying the conventional AE loss function to incorporate task-relevant information. Multiple binary detection tasks involving lumpy and breast phantom backgrounds across varying dataset sizes are considered to evaluate the performance of the proposed method and compare to current state-of-the-art channelized methods. Additionally, the ability of the channelized methods to generalize to images outside of the training dataset is investigated.

Results

AE-learned channels are demonstrated to have comparable performance with other state-of-the-art channel methods in the detection studies and superior performance in the generalization studies. Incorporating a cleaner estimate of the signal for the detection task is also demonstrated to significantly improve the performance of the proposed method, particularly in datasets with fewer images.

Conclusions

AEs are demonstrated to be capable of learning efficient channels for the HO. The resulting significant increase in detection performance for small dataset sizes when incorporating a signal prior holds promising implications for future assessments of imaging technologies.

Collapse

Ji Y, Whitney HM, Li H, Liu P, Giger ML, Zhang X. Differences in Molecular Subtype Reference Standards Impact AI-based Breast Cancer Classification with Dynamic Contrast-enhanced MRI. Radiology 2023;307:e220984. [PMID: 36594836 PMCID: PMC10068887 DOI: 10.1148/radiol.220984] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 10/20/2022] [Accepted: 11/01/2022] [Indexed: 01/04/2023]

Abstract

Background Breast cancer tumors can be identified as different luminal molecular subtypes depending on either immunohistochemical (IHC) staining or St Gallen criteria that includes Ki-67. Purpose To characterize molecular subtypes and understand the impact of disagreement among IHC and St Gallen molecular subtype reference standards on artificial intelligence classification of luminal A and luminal B tumors with use of radiomic features extracted from dynamic contrast-enhanced (DCE) MRI scans. Materials and Methods In this retrospective study, 28 radiomic features previously extracted from DCE-MRI scans of breast tumors imaged between February 2015 and October 2017 were examined in the following groups: (a) tumors classified as luminal A by both reference standards ("agreement"), (b) tumors classified as luminal A by IHC and luminal B by St Gallen ("disagreement"), and (c) tumors classified as luminal B by both ("agreement"). Luminal A or luminal B tumor classification with use of radiomic features was conducted with use of three sets: (a) IHC molecular subtyping, (b) St Gallen molecular subtyping, and (c) agreement tumors. The Kruskal-Wallis test was followed by the Mann-Whitney U test to determine pair-wise differences of radiomic features among agreement and disagreement tumors. Fivefold cross-validation with use of stepwise feature selection and linear discriminant analysis classified tumors in each set, with performance measured with use of area under the receiver operating characteristic curve (AUC). Results A total of 877 breast cancer tumors from 872 women (mean age, 48 years [range, 19-75 years]) were analyzed. Six features (sphericity, irregularity, surface area to volume ratio, variance of radial gradient histogram, sum average, volume of most enhancing voxels) were different (P ≤ .001) among agreement and disagreement tumors. AUC (median, 0.74 [95% CI: 0.68, 0.80]) was higher than when using tumors subtyped by either reference standard (IHC, 0.66 [0.60, 0.71], P = .003; St Gallen, 0.62 [0.58, 0.67], P = .001). Conclusion Differences in reference standards can hinder artificial intelligence classification performance of luminal molecular subtypes with dynamic contrast-enhanced MRI. © RSNA, 2023 Supplemental material is available for this article. See also the editorial by Bae in this issue.

Collapse

Affiliation(s)

Yu Ji From the Department of Radiology, The Second Hospital of Tianjin Medical University, No. 23 Pingjiang Rd, Hexi District, Tianjin, China 300211 (Y.J., X.Z.); National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China (Y.J., P.L.); Department of Radiology, The University of Chicago, Chicago, Ill (H.M.W., H.L., M.L.G.); and Department of Physics, Wheaton College, Wheaton, Ill (H.M.W.)
Heather M. Whitney From the Department of Radiology, The Second Hospital of Tianjin Medical University, No. 23 Pingjiang Rd, Hexi District, Tianjin, China 300211 (Y.J., X.Z.); National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China (Y.J., P.L.); Department of Radiology, The University of Chicago, Chicago, Ill (H.M.W., H.L., M.L.G.); and Department of Physics, Wheaton College, Wheaton, Ill (H.M.W.)
Hui Li From the Department of Radiology, The Second Hospital of Tianjin Medical University, No. 23 Pingjiang Rd, Hexi District, Tianjin, China 300211 (Y.J., X.Z.); National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China (Y.J., P.L.); Department of Radiology, The University of Chicago, Chicago, Ill (H.M.W., H.L., M.L.G.); and Department of Physics, Wheaton College, Wheaton, Ill (H.M.W.)
Peifang Liu From the Department of Radiology, The Second Hospital of Tianjin Medical University, No. 23 Pingjiang Rd, Hexi District, Tianjin, China 300211 (Y.J., X.Z.); National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China (Y.J., P.L.); Department of Radiology, The University of Chicago, Chicago, Ill (H.M.W., H.L., M.L.G.); and Department of Physics, Wheaton College, Wheaton, Ill (H.M.W.)
Maryellen L. Giger From the Department of Radiology, The Second Hospital of Tianjin Medical University, No. 23 Pingjiang Rd, Hexi District, Tianjin, China 300211 (Y.J., X.Z.); National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China (Y.J., P.L.); Department of Radiology, The University of Chicago, Chicago, Ill (H.M.W., H.L., M.L.G.); and Department of Physics, Wheaton College, Wheaton, Ill (H.M.W.)
Xuening Zhang From the Department of Radiology, The Second Hospital of Tianjin Medical University, No. 23 Pingjiang Rd, Hexi District, Tianjin, China 300211 (Y.J., X.Z.); National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China (Y.J., P.L.); Department of Radiology, The University of Chicago, Chicago, Ill (H.M.W., H.L., M.L.G.); and Department of Physics, Wheaton College, Wheaton, Ill (H.M.W.)

Collapse

Al-Labadi L, Evans M, Liang Q. ROC Analyses Based on Measuring Evidence Using the Relative Belief Ratio. ENTROPY (BASEL, SWITZERLAND) 2022;24:1710. [PMID: 36554115 PMCID: PMC9777999 DOI: 10.3390/e24121710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 11/18/2022] [Accepted: 11/19/2022] [Indexed: 06/17/2023]

Martínez-Camblor P. The fundamental role of density functions in the binary classification problem. J STAT COMPUT SIM 2022. [DOI: 10.1080/00949655.2022.2051026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Ghebremichael M, Michael H. Comparison of the binormal and Lehman receiver operating characteristic curves. COMMUN STAT-SIMUL C 2022. [DOI: 10.1080/03610918.2022.2032159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Whitney HM, Li H, Ji Y, Liu P, Giger ML. Multi-Stage Harmonization for Robust AI across Breast MR Databases. Cancers (Basel) 2021;13:cancers13194809. [PMID: 34638294 PMCID: PMC8508003 DOI: 10.3390/cancers13194809] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 09/16/2021] [Accepted: 09/18/2021] [Indexed: 12/22/2022] Open

Abstract

Simple Summary

Batch harmonization of radiomic features extracted from magnetic resonance images of breast lesions from two databases was applied to an artificial intelligence/machine learning classification workflow. Training and independent test sets from the two databases, as well as the combination of them, were used in pre-harmonization and post-harmonization forms to investigate the generalizability of performance in the task of distinguishing between malignant and benign lesions. Most training and independent test scenarios were statistically equivalent, demonstrating that batch harmonization with feature selection harmonization can potentially develop generalizable classification models.

Abstract

Radiomic features extracted from medical images may demonstrate a batch effect when cases come from different sources. We investigated classification performance using training and independent test sets drawn from two sources using both pre-harmonization and post-harmonization features. In this retrospective study, a database of thirty-two radiomic features, extracted from DCE-MR images of breast lesions after fuzzy c-means segmentation, was collected. There were 944 unique lesions in Database A (208 benign lesions, 736 cancers) and 1986 unique lesions in Database B (481 benign lesions, 1505 cancers). The lesions from each database were divided by year of image acquisition into training and independent test sets, separately by database and in combination. ComBat batch harmonization was conducted on the combined training set to minimize the batch effect on eligible features by database. The empirical Bayes estimates from the feature harmonization were applied to the eligible features of the combined independent test set. The training sets (A, B, and combined) were then used in training linear discriminant analysis classifiers after stepwise feature selection. The classifiers were then run on the A, B, and combined independent test sets. Classification performance was compared using pre-harmonization features to post-harmonization features, including their corresponding feature selection, evaluated using the area under the receiver operating characteristic curve (AUC) as the figure of merit. Four out of five training and independent test scenarios demonstrated statistically equivalent classification performance when compared pre- and post-harmonization. These results demonstrate that translation of machine learning techniques with batch data harmonization can potentially yield generalizable models that maintain classification performance.

Collapse

Hu Q, Whitney HM, Li H, Ji Y, Liu P, Giger ML. Improved Classification of Benign and Malignant Breast Lesions Using Deep Feature Maximum Intensity Projection MRI in Breast Cancer Diagnosis Using Dynamic Contrast-enhanced MRI. Radiol Artif Intell 2021;3:e200159. [PMID: 34235439 PMCID: PMC8231792 DOI: 10.1148/ryai.2021200159] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Revised: 02/04/2021] [Accepted: 02/09/2021] [Indexed: 04/16/2023]

Martínez-Camblor P, Pérez-Fernández S, Díaz-Coto S. The area under the generalized receiver-operating characteristic curve. Int J Biostat 2021;18:293-306. [PMID: 33761578 DOI: 10.1515/ijb-2020-0091] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2020] [Accepted: 03/01/2021] [Indexed: 12/22/2022]

Hu Q, Drukker K, Giger ML. Role of standard and soft tissue chest radiography images in deep-learning-based early diagnosis of COVID-19. J Med Imaging (Bellingham) 2021;8:014503. [PMID: 34595245 PMCID: PMC8478672 DOI: 10.1117/1.jmi.8.s1.014503] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 09/13/2021] [Indexed: 12/24/2022] Open

Fuhrman JD, Chen J, Dong Z, Lure FYM, Luo Z, Giger ML. Cascaded deep transfer learning on thoracic CT in COVID-19 patients treated with steroids. J Med Imaging (Bellingham) 2021;8:014501. [PMID: 33415179 PMCID: PMC7773028 DOI: 10.1117/1.jmi.8.s1.014501] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 11/04/2020] [Indexed: 12/15/2022] Open

Jiang Y. Receiver Operating Characteristic (ROC) Analysis of Image Search-and-Localize Tasks. Acad Radiol 2020;27:1742-1750. [PMID: 32033862 DOI: 10.1016/j.acra.2019.12.020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 12/18/2019] [Accepted: 12/20/2019] [Indexed: 10/25/2022]

Zhou W, Li H, Anastasio MA. Approximating the Ideal Observer for Joint Signal Detection and Localization Tasks by use of Supervised Learning Methods. IEEE TRANSACTIONS ON MEDICAL IMAGING 2020;39:3992-4000. [PMID: 32746143 PMCID: PMC7768793 DOI: 10.1109/tmi.2020.3009022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Abstract

Medical imaging systems are commonly assessed and optimized by use of objective measures of image quality (IQ). The Ideal Observer (IO) performance has been advocated to provide a figure-of-merit for use in assessing and optimizing imaging systems because the IO sets an upper performance limit among all observers. When joint signal detection and localization tasks are considered, the IO that employs a modified generalized likelihood ratio test maximizes observer performance as characterized by the localization receiver operating characteristic (LROC) curve. Computations of likelihood ratios are analytically intractable in the majority of cases. Therefore, sampling-based methods that employ Markov-Chain Monte Carlo (MCMC) techniques have been developed to approximate the likelihood ratios. However, the applications of MCMC methods have been limited to relatively simple object models. Supervised learning-based methods that employ convolutional neural networks have been recently developed to approximate the IO for binary signal detection tasks. In this paper, the ability of supervised learning-based methods to approximate the IO for joint signal detection and localization tasks is explored. Both background-known-exactly and background-known-statistically signal detection and localization tasks are considered. The considered object models include a lumpy object model and a clustered lumpy model, and the considered measurement noise models include Laplacian noise, Gaussian noise, and mixed Poisson-Gaussian noise. The LROC curves produced by the supervised learning-based method are compared to those produced by the MCMC approach or analytical computation when feasible. The potential utility of the proposed method for computing objective measures of IQ for optimizing imaging system performance is explored.

Collapse

Jiang Y, Edwards AV, Newstead GM. Artificial Intelligence Applied to Breast MRI for Improved Diagnosis. Radiology 2020;298:38-46. [PMID: 33078996 DOI: 10.1148/radiol.2020200292] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Abstract

Background Recognition of salient MRI morphologic and kinetic features of various malignant tumor subtypes and benign diseases, either visually or with artificial intelligence (AI), allows radiologists to improve diagnoses that may improve patient treatment. Purpose To evaluate whether the diagnostic performance of radiologists in the differentiation of cancer from noncancer at dynamic contrast material-enhanced (DCE) breast MRI is improved when using an AI system compared with conventionally available software. Materials and Methods In a retrospective clinical reader study, images from breast DCE MRI examinations were interpreted by 19 breast imaging radiologists from eight academic and 11 private practices. Readers interpreted each examination twice. In the "first read," they were provided with conventionally available computer-aided evaluation software, including kinetic maps. In the "second read," they were also provided with AI analytics through computer-aided diagnosis software. Reader diagnostic performance was evaluated with receiver operating characteristic (ROC) analysis, with the area under the ROC curve (AUC) as a figure of merit in the task of distinguishing between malignant and benign lesions. The primary study end point was the difference in AUC between the first-read and the second-read conditions. Results One hundred eleven women (mean age, 52 years ± 13 [standard deviation]) were evaluated with a total of 111 breast DCE MRI examinations (54 malignant and 57 nonmalignant lesions). The average AUC of all readers improved from 0.71 to 0.76 (P = .04) when using the AI system. The average sensitivity improved when Breast Imaging Reporting and Data System (BI-RADS) category 3 was used as the cut point (from 90% to 94%; 95% confidence interval [CI] for the change: 0.8%, 7.4%) but not when using BI-RADS category 4a (from 80% to 85%; 95% CI: -0.9%, 11%). The average specificity showed no difference when using either BI-RADS category 4a or category 3 as the cut point (52% and 52% [95% CI: -7.3%, 6.0%], and from 29% to 28% [95% CI: -6.4%, 4.3%], respectively). Conclusion Use of an artificial intelligence system improves radiologists' performance in the task of differentiating benign and malignant MRI breast lesions. © RSNA, 2020 Online supplemental material is available for this article. See also the editorial by Krupinski in this issue.

Collapse

Yousef WA. Prudence when assuming normality: An advice for machine learning practitioners. Pattern Recognit Lett 2020. [DOI: 10.1016/j.patrec.2020.06.026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Hu Q, Whitney HM, Giger ML. Radiomics methodology for breast cancer diagnosis using multiparametric magnetic resonance imaging. J Med Imaging (Bellingham) 2020;7:044502. [PMID: 32864390 PMCID: PMC7444714 DOI: 10.1117/1.jmi.7.4.044502] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Accepted: 07/29/2020] [Indexed: 12/30/2022] Open

A deep learning methodology for improved breast cancer diagnosis using multiparametric MRI. Sci Rep 2020;10:10536. [PMID: 32601367 PMCID: PMC7324398 DOI: 10.1038/s41598-020-67441-4] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 06/05/2020] [Indexed: 12/21/2022] Open

Blangero Y, Rabilloud M, Laurent-Puig P, Le Malicot K, Lepage C, Ecochard R, Taieb J, Subtil F. The area between ROC curves, a non-parametric method to evaluate a biomarker for patient treatment selection. Biom J 2020;62:1476-1493. [PMID: 32346912 DOI: 10.1002/bimj.201900171] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 09/26/2019] [Accepted: 01/10/2020] [Indexed: 12/19/2022]

Jang EJ, Nandram B, Ko Y, Kim DH. Small area estimation of receiver operating characteristic curves for ordinal data under stochastic ordering. Stat Med 2020;39:1514-1528. [PMID: 32017182 DOI: 10.1002/sim.8493] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 08/31/2019] [Accepted: 11/21/2019] [Indexed: 11/11/2022]

McKinney SM, Sieniek M, Godbole V, Godwin J, Antropova N, Ashrafian H, Back T, Chesus M, Corrado GS, Darzi A, Etemadi M, Garcia-Vicente F, Gilbert FJ, Halling-Brown M, Hassabis D, Jansen S, Karthikesalingam A, Kelly CJ, King D, Ledsam JR, Melnick D, Mostofi H, Peng L, Reicher JJ, Romera-Paredes B, Sidebottom R, Suleyman M, Tse D, Young KC, De Fauw J, Shetty S. International evaluation of an AI system for breast cancer screening. Nature 2020;577:89-94. [PMID: 31894144 DOI: 10.1038/s41586-019-1799-6] [Citation(s) in RCA: 1000] [Impact Index Per Article: 250.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Accepted: 11/05/2019] [Indexed: 02/07/2023]

Affiliation(s)

Scott Mayer McKinney Google Health, Palo Alto, CA, USA.
Marcin Sieniek Google Health, Palo Alto, CA, USA
Varun Godbole Google Health, Palo Alto, CA, USA
Jonathan Godwin DeepMind, London, UK
Natasha Antropova DeepMind, London, UK
Hutan Ashrafian Department of Surgery and Cancer, Imperial College London, London, UK Institute of Global Health Innovation, Imperial College London, London, UK
Trevor Back DeepMind, London, UK
Mary Chesus DeepMind, London, UK
Greg S Corrado Google Health, Palo Alto, CA, USA
Ara Darzi Department of Surgery and Cancer, Imperial College London, London, UK Institute of Global Health Innovation, Imperial College London, London, UK Cancer Research UK Imperial Centre, Imperial College London, London, UK
Mozziyar Etemadi Northwestern Medicine, Chicago, IL, USA
Florencia Garcia-Vicente Northwestern Medicine, Chicago, IL, USA
Fiona J Gilbert Department of Radiology, Cambridge Biomedical Research Centre, University of Cambridge, Cambridge, UK
Mark Halling-Brown Royal Surrey County Hospital, Guildford, UK
Demis Hassabis DeepMind, London, UK
Sunny Jansen Verily Life Sciences, South San Francisco, CA, USA
Alan Karthikesalingam Google Health, London, UK
Christopher J Kelly Google Health, London, UK
Dominic King Google Health, London, UK
Joseph R Ledsam DeepMind, London, UK
David Melnick Northwestern Medicine, Chicago, IL, USA
Hormuz Mostofi Google Health, Palo Alto, CA, USA
Lily Peng Google Health, Palo Alto, CA, USA
Joshua Jay Reicher Stanford Health Care and Palo Alto Veterans Affairs, Palo Alto, CA, USA
Bernardino Romera-Paredes DeepMind, London, UK
Richard Sidebottom The Royal Marsden Hospital, London, UK Thirlestaine Breast Centre, Cheltenham, UK
Mustafa Suleyman DeepMind, London, UK
Daniel Tse Google Health, Palo Alto, CA, USA.
Kenneth C Young Royal Surrey County Hospital, Guildford, UK
Jeffrey De Fauw DeepMind, London, UK
Shravya Shetty Google Health, Palo Alto, CA, USA.

Collapse

Whitney HM, Li H, Ji Y, Liu P, Giger ML. Harmonization of radiomic features of breast lesions across international DCE-MRI datasets. J Med Imaging (Bellingham) 2020;7:012707. [PMID: 32206682 PMCID: PMC7056633 DOI: 10.1117/1.jmi.7.1.012707] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Accepted: 02/24/2020] [Indexed: 12/12/2022] Open

Zhou W, Li H, Anastasio MA. Approximating the Ideal Observer and Hotelling Observer for Binary Signal Detection Tasks by Use of Supervised Learning Methods. IEEE TRANSACTIONS ON MEDICAL IMAGING 2019;38:2456-2468. [PMID: 30990425 PMCID: PMC6858982 DOI: 10.1109/tmi.2019.2911211] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Abstract

It is widely accepted that the optimization of medical imaging system performance should be guided by task-based measures of image quality (IQ). Task-based measures of IQ quantify the ability of an observer to perform a specific task, such as detection or estimation of a signal (e.g., a tumor). For binary signal detection tasks, the Bayesian Ideal Observer (IO) sets an upper limit of observer performance and has been advocated for use in optimizing medical imaging systems and data-acquisition designs. Except in special cases, the determination of the IO test statistic is analytically intractable. Markov-chain Monte Carlo (MCMC) techniques can be employed to approximate the IO detection performance, but their reported applications have been limited to relatively simple object models. In cases where the IO test statistic is difficult to compute, the Hotelling Observer (HO) can be employed. To compute the HO test statistic, potentially large covariance matrices must be accurately estimated and subsequently inverted, which can present computational challenges. This paper investigates the supervised learning-based methodologies for approximating the IO and HO test statistics. Convolutional neural networks (CNNs) and single-layer neural networks (SLNNs) are employed to approximate the IO and HO test statistics, respectively. The numerical simulations were conducted for both signal-known-exactly (SKE) and signal-known-statistically (SKS) signal detection tasks. The considered background models include the lumpy object model and the clustered lumpy object model. The measurement noise models considered are Gaussian, Laplacian, and mixed Poisson-Gaussian. The performances of the supervised learning methods are assessed via receiver operating characteristic (ROC) analysis, and the results are compared to those produced by the use of traditional numerical methods or analytical calculations when feasible. The potential advantages of the proposed supervised learning approaches for approximating the IO and HO test statistics are discussed.

Collapse

Accuracy of the Vancouver Lung Cancer Risk Prediction Model Compared With That of Radiologists. Chest 2019;156:112-119. [DOI: 10.1016/j.chest.2019.04.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 02/20/2019] [Accepted: 04/02/2019] [Indexed: 12/17/2022] Open

Kim SH. Assessment of solid components of borderline ovarian tumor and stage I carcinoma: added value of combined diffusion- and perfusion-weighted magnetic resonance imaging. Yeungnam Univ J Med 2019;36:231-240. [PMID: 31620638 PMCID: PMC6784647 DOI: 10.12701/yujm.2019.00234] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Revised: 06/04/2019] [Accepted: 06/06/2019] [Indexed: 01/29/2023] Open

Bayesian hierarchical model for the estimation of proper receiver operating characteristic curves using stochastic ordering. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2019. [DOI: 10.29220/csam.2019.26.2.205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Samala RK, Hadjiiski L, Helvie MA, Richter CD, Cha KH. Breast Cancer Diagnosis in Digital Breast Tomosynthesis: Effects of Training Sample Size on Multi-Stage Transfer Learning Using Deep Neural Nets. IEEE TRANSACTIONS ON MEDICAL IMAGING 2019;38:686-696. [PMID: 31622238 PMCID: PMC6812655 DOI: 10.1109/tmi.2018.2870343] [Citation(s) in RCA: 86] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Charles Edgar Metz, Ph.D. (1942–2012): pioneer in receiver operating characteristic (ROC) analysis. Radiol Phys Technol 2019;12:1-5. [DOI: 10.1007/s12194-018-0483-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Binormal Precision–Recall Curves for Optimal Classification of Imbalanced Data. STATISTICS IN BIOSCIENCES 2019. [DOI: 10.1007/s12561-019-09231-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Hillis SL, Mohammad BA, Brennan PC. Estimating latent reader-performance variability using the Obuchowski-Rockette method. PROCEEDINGS OF SPIE--THE INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING 2019;10952:10952F. [PMID: 32390679 PMCID: PMC7210714 DOI: 10.1117/12.2513106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Kim ST, Lee JH, Lee H, Ro YM. Visually interpretable deep network for diagnosis of breast masses on mammograms. Phys Med Biol 2018;63:235025. [PMID: 30511660 DOI: 10.1088/1361-6560/aaef0a] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]

Radiologist performance in the detection of lung cancer using CT. Clin Radiol 2018;74:67-75. [PMID: 30470412 DOI: 10.1016/j.crad.2018.10.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Accepted: 10/16/2018] [Indexed: 12/17/2022]

Shiraishi J, Fukuoka D, Iha R, Inada H, Tanaka R, Hara T. Verification of modified receiver-operating characteristic software using simulated rating data. Radiol Phys Technol 2018;11:406-414. [PMID: 30244314 DOI: 10.1007/s12194-018-0479-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Revised: 09/18/2018] [Accepted: 09/18/2018] [Indexed: 11/25/2022]

Hillis SL. Relationship between Roe and Metz simulation model for multireader diagnostic data and Obuchowski-Rockette model parameters. Stat Med 2018;37:2067-2093. [PMID: 29609206 PMCID: PMC5980727 DOI: 10.1002/sim.7616] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Revised: 10/08/2017] [Accepted: 01/02/2018] [Indexed: 11/06/2022]

Interpretation Time Using a Concurrent-Read Computer-Aided Detection System for Automated Breast Ultrasound in Breast Cancer Screening of Women With Dense Breast Tissue. AJR Am J Roentgenol 2018;211:452-461. [PMID: 29792747 DOI: 10.2214/ajr.18.19516] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

OBJECTIVE

The purpose of this study was to compare diagnostic accuracy and interpretation time of screening automated breast ultrasound (ABUS) for women with dense breast tissue without and with use of a recently U.S. Food and Drug Administration-approved computer-aided detection (CAD) system for concurrent read.

MATERIALS AND METHODS

In a retrospective observer performance study, 18 radiologists interpreted a cancer-enriched set (i.e., cancer prevalence higher than in the original screening cohort) of 185 screening ABUS studies (52 with and 133 without breast cancer). These studies were from a large cohort of ABUS-screened patients interpreted as BI-RADS density C or D. Each reader interpreted each case twice in a counterbalanced study, once without the CAD system and once with it, separated by 4 weeks. For each case, each reader identified abnormal findings and reported BI-RADS assessment category and level of suspicion for breast cancer. Interpretation time was recorded. Level of suspicion data were compared to evaluate diagnostic accuracy by means of the Dorfman-Berbaum-Metz method of jackknife with ANOVA ROC analysis. Interpretation times were compared by ANOVA.

RESULTS

The ROC AUC was 0.848 with the CAD system, compared with 0.828 without it, for a difference of 0.020 (95% CI, -0.011 to 0.051) and was statistically noninferior to the AUC without the CAD system with respect to a margin of -0.05 (p = 0.000086). The mean interpretation time was 3 minutes 33 seconds per case without the CAD system and 2 minutes 24 seconds with it, for a difference of 1 minute 9 seconds saved (95% CI, 44-93 seconds; p = 0.000014), or a reduction in interpretation time to 67% of the time without the CAD system.

CONCLUSION

Use of the concurrent-read CAD system for interpretation of screening ABUS studies of women with dense breast tissue who do not have symptoms is expected to make interpretation significantly faster and produce noninferior diagnostic accuracy compared with interpretation without the CAD system.

Collapse

Chen W, Sahiner B, Samuelson F, Pezeshk A, Petrick N. Calibration of medical diagnostic classifier scores to the probability of disease. Stat Methods Med Res 2018;27:1394-1409. [PMID: 27507287 PMCID: PMC5548655 DOI: 10.1177/0962280216661371] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Antropova N, Abe H, Giger ML. Use of clinical MRI maximum intensity projections for improved breast lesion classification with deep convolutional neural networks. J Med Imaging (Bellingham) 2018;5:014503. [PMID: 29430478 PMCID: PMC5798576 DOI: 10.1117/1.jmi.5.1.014503] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Accepted: 01/11/2018] [Indexed: 12/26/2022] Open

Samala RK, Chan HP, Hadjiiski LM, Helvie MA, Cha KH, Richter CD. Multi-task transfer learning deep convolutional neural network: application to computer-aided diagnosis of breast cancer on mammograms. Phys Med Biol 2017;62:8894-8908. [PMID: 29035873 PMCID: PMC5859950 DOI: 10.1088/1361-6560/aa93d4] [Citation(s) in RCA: 97] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Effectiveness of Bone Suppression Imaging in the Detection of Lung Nodules on Chest Radiographs. J Thorac Imaging 2017;32:398-405. [DOI: 10.1097/rti.0000000000000299] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Antropova N, Huynh BQ, Giger ML. A deep feature fusion methodology for breast cancer diagnosis demonstrated on three imaging modality datasets. Med Phys 2017;44:5162-5171. [PMID: 28681390 DOI: 10.1002/mp.12453] [Citation(s) in RCA: 207] [Impact Index Per Article: 29.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2017] [Revised: 06/12/2017] [Accepted: 06/25/2017] [Indexed: 12/13/2022] Open

Leng S, Takahashi N, Gomez Cardona D, Kitajima K, McCollough B, Li Z, Kawashima A, Leibovich BC, McCollough CH. Subjective and objective heterogeneity scores for differentiating small renal masses using contrast-enhanced CT. Abdom Radiol (NY) 2017;42:1485-1492. [PMID: 28025654 DOI: 10.1007/s00261-016-1014-2] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Abstract

PURPOSE

The aim of this study was to assess the effect of denoising on objective heterogeneity scores and its diagnostic capability for the diagnosis of angiomyolipoma (AML) and renal cell carcinoma (RCC).

MATERIALS AND METHODS

A total of 158 resected renal masses ≤4 cm [98 clear cell (cc) RCCs, 36 papillary (pap)-RCCs, and 24 AMLs] from 139 patients were evaluated. A representative contrast-enhanced computed tomography (CT) image for each mass was selected by a genitourinary radiologist. A largest possible region of interest was drawn on each mass by the radiologist, from which three objective heterogeneity indices were calculated: standard deviation (SD), entropy (Ent), and uniformity (Uni). Objective heterogeneity indices were also calculated after images were processed with a denoising algorithm (non-local means) at three strengths: weak, medium, and strong. Two genitourinary radiologists also subjectively scored each mass independently using a three-point scale (1-3; with 1 the least and 3 the most heterogeneous), which were added to represent the final subjective heterogeneity score of each mass. Heterogeneity scores were compared among mass types, and area under the ROC curve (AUC) was calculated.

RESULTS

For all heterogeneity indices, cc-RCC was significantly more heterogeneous than pap-RCC and AML (p < 0.001), but no significant difference was found between pap-RCC and AML (p > 0.01). For cc-RCC and pap-RCC differentiation, AUCs were 0.91, 0.81, 0.78, and 0.78 for the subjective score, SD, Ent, and Uni, respectively, using original images. The corresponding AUC values were 0.84, 0.74, 0.79, and 0.80 for differentiation of AML and cc-RCC. Noise reduction at weak setting improves AUC values by 0.03, 0.05, and 0.05 for SD, entropy, and uniformity for differentiation of cc-RCC from pap-RCC. Further increase of filtering strength did not improve AUC values. For differentiation of AML vs. cc-RCC, the AUC values stayed relatively flat using the noise reduction technique at different strengths for all three indices.

CONCLUSIONS

Both subjective and objective heterogeneity indices can differentiate cc-RCC from pap-RCC and AML. Noise reduction improved differentiation of cc-RCC from pap-RCC, but not differentiation of AML from cc-RCC.

Collapse

Zhai X, Chakraborty DP. A bivariate contaminated binormal model for robust fitting of proper ROC curves to a pair of correlated, possibly degenerate, ROC datasets. Med Phys 2017;44:2207-2222. [PMID: 28382718 DOI: 10.1002/mp.12263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2016] [Revised: 03/13/2017] [Accepted: 03/27/2017] [Indexed: 11/06/2022] Open

Abstract

PURPOSE

The objective was to design and implement a bivariate extension to the contaminated binormal model (CBM) to fit paired receiver operating characteristic (ROC) datasets-possibly degenerate-with proper ROC curves. Paired datasets yield two correlated ratings per case. Degenerate datasets have no interior operating points and proper ROC curves do not inappropriately cross the chance diagonal. The existing method, developed more than three decades ago utilizes a bivariate extension to the binormal model, implemented in CORROC2 software, which yields improper ROC curves and cannot fit degenerate datasets. CBM can fit proper ROC curves to unpaired (i.e., yielding one rating per case) and degenerate datasets, and there is a clear scientific need to extend it to handle paired datasets.

METHODS

In CBM, nondiseased cases are modeled by a probability density function (pdf) consisting of a unit variance peak centered at zero. Diseased cases are modeled with a mixture distribution whose pdf consists of two unit variance peaks, one centered at positive μ with integrated probability α, the mixing fraction parameter, corresponding to the fraction of diseased cases where the disease was visible to the radiologist, and one centered at zero, with integrated probability (1-α), corresponding to disease that was not visible. It is shown that: (a) for nondiseased cases the bivariate extension is a unit variances bivariate normal distribution centered at (0,0) with a specified correlation ρ₁ ; (b) for diseased cases the bivariate extension is a mixture distribution with four peaks, corresponding to disease not visible in either condition, disease visible in only one condition, contributing two peaks, and disease visible in both conditions. An expression for the likelihood function is derived. A maximum likelihood estimation (MLE) algorithm, CORCBM, was implemented in the R programming language that yields parameter estimates and the covariance matrix of the parameters, and other statistics. A limited simulation validation of the method was performed.

RESULTS

CORCBM and CORROC2 were applied to two datasets containing nine readers each contributing paired interpretations. CORCBM successfully fitted the data for all readers, whereas CORROC2 failed to fit a degenerate dataset. All fits were visually reasonable. All CORCBM fits were proper, whereas all CORROC2 fits were improper. CORCBM and CORROC2 were in agreement (a) in declaring only one of the nine readers as having significantly different performances in the two modalities; (b) in estimating higher correlations for diseased cases than for nondiseased ones; and (c) in finding that the intermodality correlation estimates for nondiseased cases were consistent between the two methods. All CORCBM fits yielded higher area under curve (AUC) than the CORROC2 fits, consistent with the fact that a proper ROC model like CORCBM is based on a likelihood-ratio-equivalent decision variable, and consequently yields higher performance than the binormal model-based CORROC2. The method gave satisfactory fits to four simulated datasets.

CONCLUSIONS

CORCBM is a robust method for fitting paired ROC datasets, always yielding proper ROC curves, and able to fit degenerate datasets.

Collapse

Yin J. Using the ROC Curve to Measure Association and Evaluate Prediction Accuracy for a Binary Outcome. ACTA ACUST UNITED AC 2017. [DOI: 10.15406/bbij.2017.05.00134] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Cook JA. ROC curves and nonrandom data. Pattern Recognit Lett 2017. [DOI: 10.1016/j.patrec.2016.11.015] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Samala RK, Chan HP, Hadjiiski LM, Helvie MA. Analysis of computer-aided detection techniques and signal characteristics for clustered microcalcifications on digital mammography and digital breast tomosynthesis. Phys Med Biol 2016;61:7092-7112. [PMID: 27648708 DOI: 10.1088/0031-9155/61/19/7092] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Li H, Zhu Y, Burnside ES, Huang E, Drukker K, Hoadley KA, Fan C, Conzen SD, Zuley M, Net JM, Sutton E, Whitman GJ, Morris E, Perou CM, Ji Y, Giger ML. Quantitative MRI radiomics in the prediction of molecular classifications of breast cancer subtypes in the TCGA/TCIA data set. NPJ Breast Cancer 2016;2. [PMID: 27853751 PMCID: PMC5108580 DOI: 10.1038/npjbcancer.2016.12] [Citation(s) in RCA: 230] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Abstract

Using quantitative radiomics, we demonstrate that computer-extracted magnetic resonance (MR) image-based tumor phenotypes can be predictive of the molecular classification of invasive breast cancers. Radiomics analysis was performed on 91 MRIs of biopsy-proven invasive breast cancers from National Cancer Institute’s multi-institutional TCGA/TCIA. Immunohistochemistry molecular classification was performed including estrogen receptor, progesterone receptor, human epidermal growth factor receptor 2, and for 84 cases, the molecular subtype (normal-like, luminal A, luminal B, HER2-enriched, and basal-like). Computerized quantitative image analysis included: three-dimensional lesion segmentation, phenotype extraction, and leave-one-case-out cross validation involving stepwise feature selection and linear discriminant analysis. The performance of the classifier model for molecular subtyping was evaluated using receiver operating characteristic analysis. The computer-extracted tumor phenotypes were able to distinguish between molecular prognostic indicators; area under the ROC curve values of 0.89, 0.69, 0.65, and 0.67 in the tasks of distinguishing between ER+ versus ER−, PR+ versus PR−, HER2+ versus HER2−, and triple-negative versus others, respectively. Statistically significant associations between tumor phenotypes and receptor status were observed. More aggressive cancers are likely to be larger in size with more heterogeneity in their contrast enhancement. Even after controlling for tumor size, a statistically significant trend was observed within each size group (P=0.04 for lesions ⩽2 cm; P=0.02 for lesions >2 to ⩽5 cm) as with the entire data set (P-value=0.006) for the relationship between enhancement texture (entropy) and molecular subtypes (normal-like, luminal A, luminal B, HER2-enriched, basal-like). In conclusion, computer-extracted image phenotypes show promise for high-throughput discrimination of breast cancer subtypes and may yield a quantitative predictive signature for advancing precision medicine.

Collapse