1
|
Ding W, Wang J, Zhou W, Zhou S, Chang C, Shi J. Joint Localization and Classification of Breast Cancer in B-Mode Ultrasound Imaging via Collaborative Learning with Elastography. IEEE J Biomed Health Inform 2022; 26:4474-4485. [PMID: 35763467 DOI: 10.1109/jbhi.2022.3186933] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2022]
Abstract
Convolutional neural networks (CNNs) have been successfully applied in the computer-aided ultrasound diagnosis for breast cancer. Up to now, several CNN-based methods have been proposed. However, most of them consider tumor localization and classification as two separate steps, rather than performing them simultaneously. Besides, they suffer from the limited diagnosis information in the B-mode ultrasound (BUS) images. In this study, we develop a novel network ResNet-GAP that incorporates both localization and classification into a unified procedure. To enhance the performance of ResNet-GAP, we leverage stiffness information in the elastography ultrasound (EUS) modality by collaborative learning in the training stage. Specifically, a dual-channel ResNet-GAP is developed, one channel for BUS and the other for EUS. In each channel, multiple class activity maps (CAMs) are generated using a series of convolutional kernels of different sizes. The multi-scale consistency of the CAMs in both channels are further considered in network optimization. Experiments on 264 patients in this study show that the newly developed ResNet-GAP achieves an accuracy of 88.6%, a sensitivity of 95.3%, a specificity of 84.6%, and an AUC of 93.6% on the classification task, and a 1.0NLF of 87.9% on the localization task, which is better than some state-of-the-art approaches.
Collapse
|
2
|
Detectability of small objects in PET/computed tomography phantom images with Bayesian penalised likelihood reconstruction. Nucl Med Commun 2020; 41:666-673. [DOI: 10.1097/mnm.0000000000001204] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
3
|
Veiga D, Martins N, Ferreira M, Monteiro J. Automatic microaneurysm detection using laws texture masks and support vector machines. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION 2017. [DOI: 10.1080/21681163.2017.1296379] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Diana Veiga
- Enermeter, Braga, Portugal
- Centro Algoritmi, University of Minho, Guimarães, Portugal
| | | | | | - João Monteiro
- Centro Algoritmi, University of Minho, Guimarães, Portugal
| |
Collapse
|
4
|
Lockwood P, Pittock L, Lockwood C, Jeffery C, Piper K. Intraorbital foreign body detection and localisation by radiographers: A preliminary JAFROC observer performance study. Radiography (Lond) 2016. [DOI: 10.1016/j.radi.2015.06.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
5
|
Analysing data from observer studies in medical imaging research: An introductory guide to free-response techniques. Radiography (Lond) 2014. [DOI: 10.1016/j.radi.2014.04.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Pons G, Martí R, Ganau S, Sentís M, Martí J. Computerized detection of breast lesions using deformable part models in ultrasound images. ULTRASOUND IN MEDICINE & BIOLOGY 2014; 40:2252-2264. [PMID: 24912370 DOI: 10.1016/j.ultrasmedbio.2014.03.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2013] [Revised: 01/18/2014] [Accepted: 03/06/2014] [Indexed: 06/03/2023]
Abstract
Ultrasound imaging is considered an important complementary technique for the screening of dense breasts. Detection of lesions at an early stage is a key step in which computerized lesion detection systems could play an important role in the analysis of US images. In this article, we propose adaptation of a generic object detection technique, deformable part models, to detect lesions in breast US images. The data set used in this study included 326 images, all from different patients (54 malignant lesions, 109 benign lesions and 163 healthy breasts). In terms of lesion detection, our proposal outperformed some of the most relevant approaches described in the literature; we obtained a sensitivity of 86% with 0.28 false-positive detection per image and an Az value of 0.975. In the detection of malignant lesions, our proposed approached had an Az value of 0.93 and a sensitivity of 78% at a 1.15 false-positive detections per image.
Collapse
Affiliation(s)
- Gerard Pons
- Department of Computer Architecture and Technology, University of Girona, Girona, Spain.
| | - Robert Martí
- Department of Computer Architecture and Technology, University of Girona, Girona, Spain
| | - Sergi Ganau
- Radiology Department, UDIAT-Centre Diagnòstic, Corporació Parc Taulí, Sabadell, Spain
| | - Melcior Sentís
- Radiology Department, UDIAT-Centre Diagnòstic, Corporació Parc Taulí, Sabadell, Spain
| | - Joan Martí
- Department of Computer Architecture and Technology, University of Girona, Girona, Spain
| |
Collapse
|
7
|
Helbren E, Halligan S, Phillips P, Boone D, Fanshawe TR, Taylor SA, Manning D, Gale A, Altman DG, Mallett S. Towards a framework for analysis of eye-tracking studies in the three dimensional environment: a study of visual search by experienced readers of endoluminal CT colonography. Br J Radiol 2014; 87:20130614. [PMID: 24689842 PMCID: PMC4075527 DOI: 10.1259/bjr.20130614] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2013] [Revised: 01/16/2014] [Accepted: 02/17/2014] [Indexed: 11/05/2022] Open
Abstract
OBJECTIVE Eye tracking in three dimensions is novel, but established descriptors derived from two-dimensional (2D) studies are not transferable. We aimed to develop metrics suitable for statistical comparison of eye-tracking data obtained from readers of three-dimensional (3D) "virtual" medical imaging, using CT colonography (CTC) as a typical example. METHODS Ten experienced radiologists were eye tracked while observing eight 3D endoluminal CTC videos. Subsequently, we developed metrics that described their visual search patterns based on concepts derived from 2D gaze studies. Statistical methods were developed to allow analysis of the metrics. RESULTS Eye tracking was possible for all readers. Visual dwell on the moving region of interest (ROI) was defined as pursuit of the moving object across multiple frames. Using this concept of pursuit, five categories of metrics were defined that allowed characterization of reader gaze behaviour. These were time to first pursuit, identification and assessment time, pursuit duration, ROI size and pursuit frequency. Additional subcategories allowed us to further characterize visual search between readers in the test population. CONCLUSION We propose metrics for the characterization of visual search of 3D moving medical images. These metrics can be used to compare readers' visual search patterns and provide a reproducible framework for the analysis of gaze tracking in the 3D environment. ADVANCES IN KNOWLEDGE This article describes a novel set of metrics that can be used to describe gaze behaviour when eye tracking readers during interpretation of 3D medical images. These metrics build on those established for 2D eye tracking and are applicable to increasingly common 3D medical image displays.
Collapse
Affiliation(s)
- E Helbren
- Centre for Medical Imaging, University College London, London, UK
| | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Thompson JD, Hogg P, Manning DJ, Szczepura K, Chakraborty DP. A free-response evaluation determining value in the computed tomography attenuation correction image for revealing pulmonary incidental findings: a phantom study. Acad Radiol 2014; 21:538-45. [PMID: 24594424 DOI: 10.1016/j.acra.2014.01.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Revised: 01/08/2014] [Accepted: 01/08/2014] [Indexed: 11/28/2022]
Abstract
RATIONALE AND OBJECTIVES The purpose of this study was to compare lesion-detection performance when interpreting computed tomography (CT) images that are acquired for attenuation correction when performing single photon emission computed tomography/computed tomography (SPECT/CT) myocardial perfusion studies. In the United Kingdom, there is a requirement that these images be interpreted; thus, it is necessary to understand observer performance on these images. MATERIALS AND METHODS An anthropomorphic chest phantom with inserted spherical lesions of different sizes and contrasts was scanned on five different SPECT/CT systems using site-specific CT protocols for SPECT/CT myocardial perfusion imaging. Twenty-one observers (0-4 years of CT experience) searched 26 image slices (17 abnormal, containing 1-3 lesions, and 9 normal, containing no lesions) for each CT acquisition. The observers marked and rated perceived lesions under the free-response paradigm. Four analyses were conducted using jackknife alternative free-response receiver operating characteristic (JAFROC) analysis: (1) 20-pixel acceptance radius (AR) with all 21 readers, abbreviated to 20/ALL analysis, (2) 40-pixel AR with 21 readers (40/ALL), (3) 20-pixel AR with 14 readers experienced in CT (20/EXP), and (4) 20-pixel AR with 7 readers with no CT experience (20/NOT). The significance level of the test was set so as to conservatively control the overall probability of a type I error to <0.05. RESULTS The mean JAFROC figure of merit (FOM) for the five CT acquisitions for the 20/ALL study were 0.602, 0.639, 0.372, 0.475, and 0.719 with a significant difference in lesion-detection performance evident between all individual treatment pairs (P < .0001) with the exception of the 1-2 pairing, which was not significant (these differed only in milliamp seconds). System 5, which had the highest performance, had the smallest slice thickness and the largest matrix size. For the other analyses, the system orderings remained unchanged, and the significance of FOM difference findings remained identical to those for 20/ALL, with one exception: for 20/EXP analysis the 1-2 difference became significant with the higher milliamp seconds superior. Improved detection performance was associated with a smaller slice thickness, increased matrix size, and, to a lesser extent, increased tube charge. CONCLUSIONS Protocol variations for CT-based attenuation correction (AC) in SPECT/CT imaging have a measurable impact on lesion-detection performance. The results imply that z-axis resolution and matrix size had the greatest impact on lesion detection, with a weaker but detectable dependence on the product of milliamp and seconds.
Collapse
Affiliation(s)
- John D Thompson
- University of Salford, Directorate of Radiography, 6th Floor Allerton Building, Frederick Road Campus, Greater Manchester, M6 6PU, UK; University Hospitals of Morecambe Bay NHS Foundation Trust, Nuclear Medicine, Furness General Hospital, Dalton Lane, Barrow-in-Furness, Cumbria, LA14 4LF, UK.
| | - Peter Hogg
- University of Salford, Directorate of Radiography, 6th Floor Allerton Building, Frederick Road Campus, Greater Manchester, M6 6PU, UK
| | - David J Manning
- Lancaster Medical School, Faculty of Health & Medicine, Furness College, Lancaster University, Lancaster LA1 4YG, UK
| | - Katy Szczepura
- University of Salford, Directorate of Radiography, 6th Floor Allerton Building, Frederick Road Campus, Greater Manchester, M6 6PU, UK
| | - Dev P Chakraborty
- Medical Image Interpretation Assessment Laboratory, University of Pittsburgh, Presby South Tower, Room 4771, 200 Lothrop Street, Pittsburgh, PA 15213, USA
| |
Collapse
|
9
|
A brief history of free-response receiver operating characteristic paradigm data analysis. Acad Radiol 2013; 20:915-9. [PMID: 23583665 DOI: 10.1016/j.acra.2013.03.001] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Revised: 03/01/2013] [Accepted: 03/07/2013] [Indexed: 11/23/2022]
Abstract
In the receiver operating characteristic paradigm the observer assigns a single rating to each image and the location of the perceived abnormality, if any, is ignored. In the free-response receiver operating characteristic paradigm the observer is free to mark and rate as many suspicious regions as are considered clinically reportable. Credit for a correct localization is given only if a mark is sufficiently close to an actual lesion; otherwise, the observer's mark is scored as a location-level false positive. Until fairly recently there existed no accepted method for analyzing the resulting relatively unstructured data containing random numbers of mark-rating pairs per image. This report reviews the history of work in this field, which has now spanned more than five decades. It introduces terminology used to describe the paradigm, proposed measures of performance (figures of merit), ways of visualizing the data (operating characteristics), and software for analyzing free-response receiver operating characteristic studies.
Collapse
|
10
|
Phillips P, Boone D, Mallett S, Taylor SA, Altman DG, Manning D, Gale A, Halligan S. Method for Tracking Eye Gaze during Interpretation of Endoluminal 3D CT Colonography: Technical Description and Proposed Metrics for Analysis. Radiology 2013; 267:924-31. [DOI: 10.1148/radiol.12120062] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
11
|
Thompson J, Hogg P, Thompson S, Manning D, Szczepura K. ROCView: prototype software for data collection in jackknife alternative free-response receiver operating characteristic analysis. Br J Radiol 2012; 85:1320-6. [PMID: 22573294 DOI: 10.1259/bjr/99497945] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
ROCView has been developed as an image display and response capture (IDRC) solution to image display and consistent recording of reader responses in relation to the free-response receiver operating characteristic paradigm. A web-based solution to IDRC for observer response studies allows observations to be completed from any location, assuming that display performance and viewing conditions are consistent with the study being completed. The simplistic functionality of the software allows observations to be completed without supervision. ROCView can display images from multiple modalities, in a randomised order if required. Following registration, observers are prompted to begin their image evaluation. All data are recorded via mouse clicks, one to localise (mark) and one to score confidence (rate) using either an ordinal or continuous rating scale. Up to nine "mark-rating" pairs can be made per image. Unmarked images are given a default score of zero. Upon completion of the study, both true-positive and false-positive reports can be downloaded and adapted for analysis. ROCView has the potential to be a useful tool in the assessment of modality performance difference for a range of imaging methods.
Collapse
Affiliation(s)
- J Thompson
- Department of Nuclear Medicine, Furness General Hospital, Barrow-in-Furness, UK.
| | | | | | | | | |
Collapse
|
12
|
Haygood TM, Ryan J, Brennan PC, Li S, Marom EM, McEntee MF, Itani M, Evanoff M, Chakraborty D. On the choice of acceptance radius in free-response observer performance studies. Br J Radiol 2012; 86:42313554. [PMID: 22573302 DOI: 10.1259/bjr/42313554] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
OBJECTIVES Choosing an acceptance radius or proximity criterion is necessary to analyse free-response receiver operating characteristic (FROC) observer performance data. This is currently subjective, with little guidance in the literature about what is an appropriate acceptance radius. We evaluated varying acceptance radii in a nodule detection task in chest radiography and suggest guidelines for determining an acceptance radius. METHODS 80 chest radiographs were chosen, half of which contained nodules. We determined each nodule's centre. 21 radiologists read the images. We created acceptance radii bins of <5 pixels, <10 pixels, <20 pixels and onwards up to <200 and 200+ pixels. We counted lesion localisations in each bin and visually compared marks with the borders of nodules. RESULTS Most reader marks were tightly clustered around nodule centres, with tighter clustering for smaller than for larger nodules. At least 70% of readers' marks were placed within <10 pixels for small nodules, <20 pixels for medium nodules and <30 pixels for large nodules. Of 72 inspected marks that were less than 50 pixels from the centre of a nodule, only 1 fell outside the border of a nodule. CONCLUSION The acceptance radius should be based on the larger nodule sizes. For our data, an acceptance radius of 50 pixels would have captured all but 2 reader marks within the borders of a nodule, while excluding only 1 true-positive mark. The choice of an acceptance radius for FROC analysis of observer performance studies should be based on the size of larger abnormalities.
Collapse
Affiliation(s)
- T M Haygood
- The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Chakraborty DP, Haygood TM, Ryan J, Marom EM, Evanoff M, McEntee MF, Brennan PC. Quantifying the clinical relevance of a laboratory observer performance paradigm. Br J Radiol 2012; 85:1287-302. [PMID: 22573296 DOI: 10.1259/bjr/45866310] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
OBJECTIVE Laboratory observer performance measurements, receiver operating characteristic (ROC) and free-response ROC (FROC) differ from actual clinical interpretations in several respects, which could compromise their clinical relevance. The objective of this study was to develop a method for quantifying the clinical relevance of a laboratory paradigm and apply it to compare the ROC and FROC paradigms in a nodule detection task. METHODS The original prospective interpretations of 80 digital chest radiographs were classified by the truth panel as correct (C=1) or incorrect (C=0), depending on correlation with additional imaging, and the average of C was interpreted as the clinical figure of merit. FROC data were acquired for 21 radiologists and ROC data were inferred using the highest ratings. The areas under the ROC and alternative FROC curves were used as laboratory figures of merit. Bootstrap analysis was conducted to estimate conventional agreement measures between laboratory and clinical figures of merit. Also computed was a pseudovalue-based image-level correctness measure of the laboratory interpretations, whose association with C as measured by the area (rAUC) under an appropriately defined relevance ROC curve, is as a measure of the clinical relevance of a laboratory paradigm. RESULTS Low correlations (e.g. κ=0.244) and near chance level rAUC values (e.g. 0.598), attributable to differences between the clinical and laboratory paradigms, were observed. The absolute width of the confidence interval was 0.38 for the interparadigm differences of the conventional measures and 0.14 for the difference of the rAUCs. CONCLUSION The rAUC measure was consistent with the traditional measures but was more sensitive to the differences in clinical relevance. A new relevance ROC method for quantifying the clinical relevance of a laboratory paradigm is proposed.
Collapse
Affiliation(s)
- D P Chakraborty
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA 15213, USA.
| | | | | | | | | | | | | |
Collapse
|
14
|
Abstract
A common task in medical imaging is assessing whether a new imaging system, or a variant of an existing one, is an improvement over an existing imaging technology. Imaging systems are generally quite complex, consisting of several components-for example, image acquisition hardware, image processing and display hardware and software, and image interpretation by radiologists- each of which can affect performance. Although it may appear odd to include the radiologist as a "component" of the imaging chain, because the radiologist's decision determines subsequent patient care, the effect of the human interpretation has to be included. Physical measurements such as modulation transfer function, signal-to-noise ratio, are useful for characterizing the nonhuman parts of the imaging chain under idealized and often unrealistic conditions, such as uniform background phantoms and target objects with sharp edges. Measuring the performance of the entire imaging chain, including the radiologist, and using real clinical images requires different methods that fall under the rubric of observer performance methods or "ROC" analysis, that involve collecting rating data on images. The purpose of this work is to review recent developments in this field, particularly with respect to the free-response method, where location information is also collected.
Collapse
Affiliation(s)
- Dev P Chakraborty
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA 15213, USA.
| |
Collapse
|
15
|
Chakraborty DP. Recent developments in imaging system assessment methodology, FROC analysis and the search model. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH. SECTION A, ACCELERATORS, SPECTROMETERS, DETECTORS AND ASSOCIATED EQUIPMENT 2011; 648 Supplement 1:S297-S301. [PMID: 21804679 PMCID: PMC3144765 DOI: 10.1016/j.nima.2010.11.042] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
A frequent problem in imaging is assessing whether a new imaging system is an improvement over an existing standard. Observer performance methods, in particular the receiver operating characteristic (ROC) paradigm, are widely used in this context. In ROC analysis lesion location information is not used and consequently scoring ambiguities can arise in tasks, such as nodule detection, involving finding localized lesions. This paper reviews progress in the free-response ROC (FROC) paradigm in which the observer marks and rates suspicious regions and the location information is used to determine whether lesions were correctly localized. Reviewed are FROC data analysis, a search-model for simulating FROC data, predictions of the model and a method for estimating the parameters. The search model parameters are physically meaningful quantities that can guide system optimization.
Collapse
|
16
|
Choi SA, Lee SS, Jung IH, Kim HA, Byun JH, Lee MG. The effect of gadoxetic acid enhancement on lesion detection and characterisation using T₂ weighted imaging and diffusion weighted imaging of the liver. Br J Radiol 2010; 85:29-36. [PMID: 21123305 DOI: 10.1259/bjr/12929687] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
OBJECTIVES To evaluate the effect of gadoxetic acid enhancement on the detection and characterisation of focal hepatic lesions on T(2) weighted and diffusion weighted (DW) images. METHODS A total of 63 consecutive patients underwent T(2) weighted and DW imaging before and after gadoxetic acid enhancement. Two blinded readers independently identified all of the focal lesions using a five-point confidence scale and characterised each lesion using a three-point scale: 1, non-solid; 2, indeterminate; and 3, solid. For both T(2) weighted and DW imaging, the accuracies for detecting focal lesions were compared using the free-response receiver operating characteristic analysis; the accuracies for lesion characterisation were compared using the McNemar test between non-enhanced and gadoxetic acid-enhanced image sets. For hepatic lesions ≥ 1 cm, the lesion-to-liver contrast-to-noise ratio (CNR) and the apparent diffusion coefficient (ADC) were compared in the non-enhanced and enhanced image sets using the generalised estimating equations. RESULTS For both T(2) weighted and DW images, the accuracies for detecting focal lesions (p ≥ 0.52) and those for lesion characterisation (p ≥ 0.63) did not differ significantly between the non-enhanced and enhanced image sets. The lesion-to-liver CNR was significantly higher on enhanced DW images than on non-enhanced DW images (p=0.02), although the difference was not significant for T(2) weighted imaging (p=0.65). The mean ADC values of lesions did not differ significantly on enhanced and non-enhanced DW imaging (p=0.75). CONCLUSION The acquisition of T(2) weighted and DW images after administration of gadoxetic acid has no significant effect on the detection or characterisation of focal hepatic lesions, although it improves the lesion-to-liver CNR on DW images.
Collapse
Affiliation(s)
- S A Choi
- Department of Radiology and Research Institute of Radiology, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| | | | | | | | | | | |
Collapse
|
17
|
Abstract
The purpose of this paper is to summarise recent progress in free-response receiver operating characteristic (FROC) methodology. These are: (1) jackknife alternative FROC analysis including recent extensions and alternative methods; (2) the search-model simulator that enables validation and objective comparison of different methods of analysing the data; (3) case-based analysis that has the potential of greater clinical relevance than conventional free-response analysis; (4) a method for collectively analysing the multiple lesion types in an image (e.g. microcalcifications, masses and architectural distortions); (5) a method for sample-size estimation for FROC studies; and (6) a method for determining an objective proximity criterion, namely how close must a mark be to a true lesion in order to credit the observer for a true localisation. FROC analysis is being increasingly used to evaluate the imaging systems and understanding of recent progress should help researchers conduct better FROC studies.
Collapse
Affiliation(s)
- D P Chakraborty
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
18
|
Zanca F, Chakraborty DP, Marchal G, Bosmans H. Consistency of methods for analysing location-specific data. RADIATION PROTECTION DOSIMETRY 2010; 139:52-56. [PMID: 20159917 PMCID: PMC2868070 DOI: 10.1093/rpd/ncq030] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Although the receiver operating characteristic (ROC) method is the acknowledged gold-standard for imaging system assessment, it ignores localisation information and differentiation between multiple abnormalities per case. As the free-response ROC (FROC) method uses localisation information and more closely resembles the clinical reporting process, it is being increasingly used. A number of methods have been proposed to analyse the data that result from an FROC study: jackknife alternative FROC (JAFROC) and a variant termed JAFROC1, initial detection and candidate analysis (IDCA) and ROC analysis via the reduction of the multiple ratings on a case to a single rating. The focus of this paper was to compare JAFROC1, IDCA and the ROC analysis methods using a clinical FROC human data set. All methods agreed on the ordering of the modalities and all yielded statistically significant differences of the figures-of-merit, i.e. p < 0.05. Both IDCA and JAFROC1 yielded much smaller p-values than ROC. The results are consistent with a recent simulation-based validation study comparing these and other methods. In conclusion, IDCA or JAFROC1 analysis of FROC human data may be superior at detecting modality differences than ROC analysis.
Collapse
Affiliation(s)
- F Zanca
- Department of Radiology, Leuven University Center of Medical Physics in Radiology, University Hospitals Leuven, 3000 Leuven, Belgium.
| | | | | | | |
Collapse
|
19
|
Gur D, Bandos AI, Klym AH, Cohen CS, Hakim CM, Hardesty LA, Ganott MA, Perrin RL, Poller WR, Shah R, Sumkin JH, Wallace LP, Rockette HE. Agreement of the order of overall performance levels under different reading paradigms. Acad Radiol 2008; 15:1567-73. [PMID: 19000873 DOI: 10.1016/j.acra.2008.07.011] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2008] [Revised: 07/15/2008] [Accepted: 07/15/2008] [Indexed: 11/27/2022]
Abstract
RATIONALE AND OBJECTIVES To investigate consistency of the orders of performance levels when interpreting mammograms under three different reading paradigms. MATERIALS AND METHODS We performed a retrospective observer study in which nine experienced radiologists rated an enriched set of mammography examinations that they personally had read in the clinic ("individualized") mixed with a set that none of them had read in the clinic ("common set"). Examinations were interpreted under three different reading paradigms: binary using screening Breast Imaging Reporting and Data System (BI-RADS), receiver-operating characteristic (ROC), and free-response ROC (FROC). The performance in discriminating between cancer and noncancer findings under each of the paradigms was summarized using Youden's index/2+0.5 (Binary), nonparameteric area under the ROC curve (AUC), and an overall FROC index (JAFROC-2). Pearson correlation coefficients were then computed to assess consistency in the ordering of observers' performance levels. Statistical significance of the computed correlation coefficients was assessed using bootstrap confidence intervals obtained by resampling sets of examination-specific observations. RESULTS All but one of the computed pair-wise correlation coefficients were larger than 0.66 and were significantly different from zero. The correlation between the overall performance measures under the Binary and ROC paradigms was the lowest (0.43) and was not significantly different from zero (95% confidence interval -0.078 to 0.733). CONCLUSION The use of different evaluation paradigms in the laboratory tends to lead to consistent ordering of the overall performance levels of observers. However, one should recognize that conceptually similar performance indexes resulting from different paradigms often measure different performance characteristics and thus disagreements are not only possible but frequently quite natural.
Collapse
Affiliation(s)
- David Gur
- Department of Radiology, University of Pittsburgh, 3362 Fifth Avenue, Pittsburgh, PA 15213-3180, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Performance assessments of diagnostic systems under the FROC paradigm: experimental, analytical, and results interpretation issues. Acad Radiol 2008; 15:1312-5. [PMID: 18790403 DOI: 10.1016/j.acra.2008.05.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2008] [Revised: 05/22/2008] [Accepted: 04/29/2008] [Indexed: 11/22/2022]
Abstract
As use of free response receiver-operating characteristic (FROC) curves gains more acceptance for quantitatively assessing the performance of diagnostic systems, it is important that the experimentalist understands the possible role of this approach as one of the experimental design paradigms that are available to him or her among all other approaches as well as some of the issues associated with FROC type studies. In a number of experimental scenarios, the FROC paradigm and associated analytical tools have theoretical and practical advantages over both the binary and the ROC approaches to performance assessments of diagnostic systems, but it also has some limitations related to experimental design, data analyses, clinical relevance, and complexity in the interpretation of the results. These issues are rarely discussed and are the focus of this work.
Collapse
|
21
|
Chakraborty DP, Yoon HJ. Operating characteristics predicted by models for diagnostic tasks involving lesion localization. Med Phys 2008; 35:435-45. [PMID: 18383663 DOI: 10.1118/1.2820902] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
In 1996 Swensson published an observer model that predicted receiver operating characteristic (ROC), localization ROC (LROC), free-response ROC (FROC) and alternative FROC (AFROC) curves, thereby achieving "unification" of different observer performance paradigms. More recently a model termed initial detection and candidate analysis (IDCA) has been proposed for fitting computer aided detection (CAD) generated FROC data, and recently a search model for human observer FROC data has been proposed. The purpose of this study was to derive IDCA and the search model based expressions for operating characteristics, and to compare the predictions to the Swensson model. For three out of four mammography CAD data sets all models yielded good fits in the high-confidence region, i.e., near the lower end of the plots. The search model and IDCA tended to better fit the data in the low-confidence region, i.e., near the upper end of the plots, particularly for FROC curves for which the Swensson model predictions departed markedly from the data. For one data set none of the models yielded satisfactory fits. A unique characteristic of search model and IDCA predicted operating characteristics is that the operating point is not allowed to move continuously to the lowest confidence limit of the corresponding Swensson model curves. This prediction is actually observed in the CAD raw data and it is the primary reason for the poor FROC fits of the Swensson model in the low-confidence region.
Collapse
Affiliation(s)
- D P Chakraborty
- Department of Radiology, University of Pittsburgh, 3520 Forbes Avenue, Parkvale Building, Room 109, Pittsburgh, Pennsylvania 15261, USA.
| | | |
Collapse
|
22
|
ROC analysis in medical imaging: a tutorial review of the literature. Radiol Phys Technol 2007; 1:2-12. [PMID: 20821157 DOI: 10.1007/s12194-007-0002-1] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2007] [Accepted: 09/25/2007] [Indexed: 10/22/2022]
|
23
|
Abstract
Computer-aided detection (CAD) has been attracting extensive research interest during the last two decades. It is recognized that the full potential of CAD can only be realized by improving the performance and robustness of CAD algorithms and this requires good evaluation methodology that would permit CAD designers to optimize their algorithms. Free-response receiver operating characteristic (FROC) curves are widely used to assess CAD performance, however, evaluation rarely proceeds beyond determination of lesion localization fraction (sensitivity) at an arbitrarily selected value of nonlesion localizations (false marks) per image. This work describes a FROC curve fitting procedure that uses a recent model of visual search that serves as a framework for the free-response task. A maximum likelihood procedure for estimating the parameters of the model from free-response data and fitting CAD generated FROC curves was implemented. Procedures were implemented to estimate two figures of merit and associated statistics such as 95% confidence intervals and goodness of fit. One of the figures of merit does not require the arbitrary specification of an operating point at which to evaluate CAD performance. For comparison a related method termed initial detection and candidate analysis was also implemented that is applicable when all suspicious regions are reported. The two methods were tested on seven mammography CAD data sets and both yielded good to excellent fits. The search model approach has the advantage that it can potentially be applied to radiologist generated free-response data where not all suspicious regions are reported, only the ones that are deemed sufficiently suspicious to warrant clinical follow-up. This work represents the first practical application of the search model to an important evaluation problem in diagnostic radiology. Software based on this work is expected to benefit CAD developers working in diverse areas of medical imaging.
Collapse
Affiliation(s)
- Hong Jun Yoon
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA 15261
| | - Bin Zheng
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA 15261
| | - Berkman Sahiner
- Department of Radiology, University of Michigan, Ann Arbor, MI 48109
| | | |
Collapse
|