1
|
Zeng R, Samuelson FW, Sharma D, Badal A, Christian GG, Glick SJ, Myers KJ, Badano A. Computational reader design and statistical performance evaluation of an in-silico imaging clinical trial comparing digital breast tomosynthesis with full-field digital mammography. J Med Imaging (Bellingham) 2020; 7:042802. [PMID: 32118094 PMCID: PMC7043285 DOI: 10.1117/1.jmi.7.4.042802] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 01/07/2020] [Indexed: 01/15/2023] Open
Abstract
A recent study reported on an in-silico imaging trial that evaluated the performance of digital breast tomosynthesis (DBT) as a replacement for full-field digital mammography (FFDM) for breast cancer screening. In this in-silico trial, the whole imaging chain was simulated, including the breast phantom generation, the x-ray transport process, and computational readers for image interpretation. We focus on the design and performance characteristics of the computational reader in the above-mentioned trial. Location-known lesion (spiculated mass and clustered microcalcifications) detection tasks were used to evaluate the imaging system performance. The computational readers were designed based on the mechanism of a channelized Hotelling observer (CHO), and the reader models were selected to trend human performance. Parameters were tuned to ensure stable lesion detectability. A convolutional CHO that can adapt a round channel function to irregular lesion shapes was compared with the original CHO and was found to be suitable for detecting clustered microcalcifications but was less optimal in detecting spiculated masses. A three-dimensional CHO that operated on the multiple slices was compared with a two-dimensional (2-D) CHO that operated on three versions of 2-D slabs converted from the multiple slices and was found to be optimal in detecting lesions in DBT. Multireader multicase reader output analysis was used to analyze the performance difference between FFDM and DBT for various breast and lesion types. The results showed that DBT was more beneficial in detecting masses than detecting clustered microcalcifications compared with FFDM, consistent with the finding in a clinical imaging trial. Statistical uncertainty smaller than 0.01 standard error for the estimated performance differences was achieved with a dataset containing approximately 3000 breast phantoms. The computational reader design methodology presented provides evidence that model observers can be useful in-silico tools for supporting the performance comparison of breast imaging systems.
Collapse
Affiliation(s)
- Rongping Zeng
- Division of Imaging, Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, CDRH, FDA, Silver Spring, Maryland, United States
| | - Frank W. Samuelson
- Division of Imaging, Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, CDRH, FDA, Silver Spring, Maryland, United States
| | - Diksha Sharma
- Division of Imaging, Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, CDRH, FDA, Silver Spring, Maryland, United States
| | - Andreu Badal
- Division of Imaging, Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, CDRH, FDA, Silver Spring, Maryland, United States
| | - Graff G. Christian
- Division of Imaging, Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, CDRH, FDA, Silver Spring, Maryland, United States
| | - Stephen J. Glick
- Division of Imaging, Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, CDRH, FDA, Silver Spring, Maryland, United States
| | - Kyle J. Myers
- Division of Imaging, Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, CDRH, FDA, Silver Spring, Maryland, United States
| | - Aldo Badano
- Division of Imaging, Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, CDRH, FDA, Silver Spring, Maryland, United States
| |
Collapse
|
2
|
Different pixel pitch and maximum luminance of medical grade displays may result in different evaluations of digital radiography images. Radiol Med 2018; 123:586-592. [DOI: 10.1007/s11547-018-0891-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 04/09/2018] [Indexed: 10/17/2022]
|
3
|
Bandos AI, Obuchowski NA. Evaluation of diagnostic accuracy in free-response detection-localization tasks using ROC tools. Stat Methods Med Res 2018; 28:1808-1825. [DOI: 10.1177/0962280218776683] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Diagnostic systems designed to detect possibly multiple lesions per patient (e.g. multiple polyps during CT colonoscopy) are often evaluated in “free-response” studies that allow for diagnostic responses unconstrained in their number and locations. Analysis of free-response studies requires extensions of the traditional receiver operating characteristic (ROC) analysis, which are termed free-response ROC (FROC) methodology. Despite substantial developments in this area, FROC tools and approaches are much more cumbersome than traditional ROC methods. Alternative approaches that use well-known ROC tools (e.g. ROI-ROC) require defining and physically delineating regions of interest (ROI) and combine FROC data within ROIs. We propose an approach that allows analyzing FROC data using conventional ROC tools without delineating the actual ROIs or reducing data. The design parameters of FROC study are used to make FROC data analyzable using ROC tools and to calibrate the corresponding FROC and ROC curves on both conceptual and numerical levels. Differences in the performance indices of the nonparametric FROC and the new approach are shown to be asymptotically negligible and typically rather small in practice. Data from a large multi-reader study of colon cancer detection are used to illustrate the new approach.
Collapse
Affiliation(s)
- Andriy I Bandos
- Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA, USA
| | - Nancy A Obuchowski
- Quantitative Health Sciences, Cleveland Clinic Foundation, Cleveland, OH, USA
| |
Collapse
|
4
|
Ferranti C, Primolevo A, Cartia F, Cavatorta C, Ciniselli CM, Lualdi M, Meroni S, Pignoli E, Plebani M, Siciliano C, Verderio P, Scaperrotta G. How Does the Display Luminance Level Affect Detectability of Breast Microcalcifications and Spiculated Lesions in Digital Breast Tomosynthesis (DBT) Images? Acad Radiol 2017; 24:795-801. [PMID: 28189505 DOI: 10.1016/j.acra.2017.01.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Revised: 01/18/2017] [Accepted: 01/20/2017] [Indexed: 10/20/2022]
Abstract
RATIONALE AND OBJECTIVES This study evaluates the influence of the calibrated luminance level of medical displays in the detectability of microcalcifications and spiculated lesions in digital breast tomosynthesis images. MATERIALS AND METHODS Four models of medical displays with calibrated maximum and minimum luminance, respectively, ranging from 500 to 1000 cd/m2 and from 0.5 to 1.0 cd/m2, were investigated. Forty-eight studies were selected by a senior radiologist: 16 with microcalcifications, 16 with spiculated lesions, and 16 without lesions. All images were anonymized and blindly evaluated by one senior and two junior radiologists. For each study, lesion presence or absence and localization statements, interpretative difficulty level, and overall quality were reported. Cohen's kappa statistic was computed between monitors and within or between radiologists to estimate the reproducibility in correctly identifying lesions; for multireader-multicase analysis, the weighted jackknife alternative free-response receiver operating characteristic statistical tool was applied. RESULTS Intraradiologist reproducibility ranged from 0.75 to 1.00. Interreader as well as reader-truth agreement values were >0.80 and higher with the two 1000 cd/m2 luminance displays than with the lower luminance displays for each radiologist. Performances in the detectability of breast lesions were significantly greater with the 1000 cd/m2 luminance displays when compared to the display with the lowest luminance value (P value <0.001). CONCLUSIONS Our findings highlight the role of display luminance level on the accuracy of detecting breast lesions.
Collapse
|
5
|
Wen G, Markey MK, Park S. Model observer design for multi-signal detection in the presence of anatomical noise. Phys Med Biol 2017; 62:1396-1415. [DOI: 10.1088/1361-6560/aa51e9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
6
|
Wunderlich A, Goossens B, Abbey CK. Optimal Joint Detection and Estimation That Maximizes ROC-Type Curves. IEEE TRANSACTIONS ON MEDICAL IMAGING 2016; 35:2164-73. [PMID: 27093544 PMCID: PMC5555688 DOI: 10.1109/tmi.2016.2553001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Combined detection-estimation tasks are frequently encountered in medical imaging. Optimal methods for joint detection and estimation are of interest because they provide upper bounds on observer performance, and can potentially be utilized for imaging system optimization, evaluation of observer efficiency, and development of image formation algorithms. We present a unified Bayesian framework for decision rules that maximize receiver operating characteristic (ROC)-type summary curves, including ROC, localization ROC (LROC), estimation ROC (EROC), free-response ROC (FROC), alternative free-response ROC (AFROC), and exponentially-transformed FROC (EFROC) curves, succinctly summarizing previous results. The approach relies on an interpretation of ROC-type summary curves as plots of an expected utility versus an expected disutility (or penalty) for signal-present decisions. We propose a general utility structure that is flexible enough to encompass many ROC variants and yet sufficiently constrained to allow derivation of a linear expected utility equation that is similar to that for simple binary detection. We illustrate our theory with an example comparing decision strategies for joint detection-estimation of a known signal with unknown amplitude. In addition, building on insights from our utility framework, we propose new ROC-type summary curves and associated optimal decision rules for joint detection-estimation tasks with an unknown, potentially-multiple, number of signals in each observation.
Collapse
|
7
|
Abbey CK, Wu Y, Burnside ES, Wunderlich A, Samuelson FW, Boone JM. A Utility/Cost Analysis of Breast Cancer Risk Prediction Algorithms. PROCEEDINGS OF SPIE--THE INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING 2016; 9787:97871J. [PMID: 27335532 PMCID: PMC4913185 DOI: 10.1117/12.2217850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Breast cancer risk prediction algorithms are used to identify subpopulations that are at increased risk for developing breast cancer. They can be based on many different sources of data such as demographics, relatives with cancer, gene expression, and various phenotypic features such as breast density. Women who are identified as high risk may undergo a more extensive (and expensive) screening process that includes MRI or ultrasound imaging in addition to the standard full-field digital mammography (FFDM) exam. Given that there are many ways that risk prediction may be accomplished, it is of interest to evaluate them in terms of expected cost, which includes the costs of diagnostic outcomes. In this work we perform an expected-cost analysis of risk prediction algorithms that is based on a published model that includes the costs associated with diagnostic outcomes (true-positive, false-positive, etc.). We assume the existence of a standard screening method and an enhanced screening method with higher scan cost, higher sensitivity, and lower specificity. We then assess expected cost of using a risk prediction algorithm to determine who gets the enhanced screening method under the strong assumption that risk and diagnostic performance are independent. We find that if risk prediction leads to a high enough positive predictive value, it will be cost-effective regardless of the size of the subpopulation. Furthermore, in terms of the hit-rate and false-alarm rate of the of the risk-prediction algorithm, iso-cost contours are lines with slope determined by properties of the available diagnostic systems for screening.
Collapse
Affiliation(s)
- Craig K Abbey
- Dept. of Psychological and Brain Sciences, UC Santa Barbara, Santa Barbara, CA. USA 93106
| | - Yirong Wu
- Department of Radiology, University of Wisconsin, Madison, WI
| | | | - Adam Wunderlich
- Division of Imaging and Applied Mathematics, OSEL, CDRH, U.S. Food and Drug Administration, Silver Spring, Maryland 20993, USA
| | - Frank W Samuelson
- Division of Imaging and Applied Mathematics, OSEL, CDRH, U.S. Food and Drug Administration, Silver Spring, Maryland 20993, USA
| | - John M Boone
- Dept of Radiology, UC Davis Medical Center, Sacramento CA. USA
| |
Collapse
|
8
|
Verdun F, Racine D, Ott J, Tapiovaara M, Toroi P, Bochud F, Veldkamp W, Schegerer A, Bouwman R, Giron IH, Marshall N, Edyvean S. Image quality in CT: From physical measurements to model observers. Phys Med 2015; 31:823-843. [DOI: 10.1016/j.ejmp.2015.08.007] [Citation(s) in RCA: 140] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/08/2015] [Revised: 08/04/2015] [Accepted: 08/23/2015] [Indexed: 12/18/2022] Open
|
9
|
Barrett HH, Myers KJ, Hoeschen C, Kupinski MA, Little MP. Task-based measures of image quality and their relation to radiation dose and patient risk. Phys Med Biol 2015; 60:R1-75. [PMID: 25564960 PMCID: PMC4318357 DOI: 10.1088/0031-9155/60/2/r1] [Citation(s) in RCA: 79] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The theory of task-based assessment of image quality is reviewed in the context of imaging with ionizing radiation, and objective figures of merit (FOMs) for image quality are summarized. The variation of the FOMs with the task, the observer and especially with the mean number of photons recorded in the image is discussed. Then various standard methods for specifying radiation dose are reviewed and related to the mean number of photons in the image and hence to image quality. Current knowledge of the relation between local radiation dose and the risk of various adverse effects is summarized, and some graphical depictions of the tradeoffs between image quality and risk are introduced. Then various dose-reduction strategies are discussed in terms of their effect on task-based measures of image quality.
Collapse
Affiliation(s)
- Harrison H. Barrett
- College of Optical Sciences, University of Arizona, Tucson, AZ
- Center for Gamma-Ray Imaging, Department of Medical Imaging, University of Arizona, Tucson, AZ
| | - Kyle J. Myers
- Division of Imaging and Applied Mathematics, Office of Scientific and Engineering Laboratories, Center for Devices and Radiological Health, Food and Drug Administration, Silver Spring, MD
| | - Christoph Hoeschen
- Department of Electrical Engineering and Information Technology, Otto-von-Guericke University, Magdeburg, Germany
- Research unit Medical Radiation Physics and Diagnostics, Helmholtz Zentrum München, Oberschleissheim, Germany
| | - Matthew A. Kupinski
- College of Optical Sciences, University of Arizona, Tucson, AZ
- Center for Gamma-Ray Imaging, Department of Medical Imaging, University of Arizona, Tucson, AZ
| | - Mark P. Little
- Division of Cancer Epidemiology and Genetics, Radiation Epidemiology Branch, National Cancer Institute, Bethesda, MD
| |
Collapse
|