51
|
Buist DSM, Anderson ML, Smith RA, Carney PA, Miglioretti DL, Monsees BS, Sickles EA, Taplin SH, Geller BM, Yankaskas BC, Onega TL. Effect of radiologists' diagnostic work-up volume on interpretive performance. Radiology 2014; 273:351-64. [PMID: 24960110 DOI: 10.1148/radiol.14132806] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
PURPOSE To examine radiologists' screening performance in relation to the number of diagnostic work-ups performed after abnormal findings are discovered at screening mammography by the same radiologist or by different radiologists. MATERIALS AND METHODS In an institutional review board-approved HIPAA-compliant study, the authors linked 651 671 screening mammograms interpreted from 2002 to 2006 by 96 radiologists in the Breast Cancer Surveillance Consortium to cancer registries (standard of reference) to evaluate the performance of screening mammography (sensitivity, false-positive rate [ FPR false-positive rate ], and cancer detection rate [ CDR cancer detection rate ]). Logistic regression was used to assess the association between the volume of recalled screening mammograms ("own" mammograms, where the radiologist who interpreted the diagnostic image was the same radiologist who had interpreted the screening image, and "any" mammograms, where the radiologist who interpreted the diagnostic image may or may not have been the radiologist who interpreted the screening image) and screening performance and whether the association between total annual volume and performance differed according to the volume of diagnostic work-up. RESULTS Annually, 38% of radiologists performed the diagnostic work-up for 25 or fewer of their own recalled screening mammograms, 24% performed the work-up for 0-50, and 39% performed the work-up for more than 50. For the work-up of recalled screening mammograms from any radiologist, 24% of radiologists performed the work-up for 0-50 mammograms, 32% performed the work-up for 51-125, and 44% performed the work-up for more than 125. With increasing numbers of radiologist work-ups for their own recalled mammograms, the sensitivity (P = .039), FPR false-positive rate (P = .004), and CDR cancer detection rate (P < .001) of screening mammography increased, yielding a stepped increase in women recalled per cancer detected from 17.4 for 25 or fewer mammograms to 24.6 for more than 50 mammograms. Increases in work-ups for any radiologist yielded significant increases in FPR false-positive rate (P = .011) and CDR cancer detection rate (P = .001) and a nonsignificant increase in sensitivity (P = .15). Radiologists with a lower annual volume of any work-ups had consistently lower FPR false-positive rate , sensitivity, and CDR cancer detection rate at all annual interpretive volumes. CONCLUSION These findings support the hypothesis that radiologists may improve their screening performance by performing the diagnostic work-up for their own recalled screening mammograms and directly receiving feedback afforded by means of the outcomes associated with their initial decision to recall. Arranging for radiologists to work up a minimum number of their own recalled cases could improve screening performance but would need systems to facilitate this workflow.
Collapse
Affiliation(s)
- Diana S M Buist
- From the Group Health Research Institute, Group Health Cooperative, 1730 Minor Ave, Suite 1600, Seattle, WA 98101 (D.S.M.B., M.L.A., D.L.M.); Cancer Control Science Department, American Cancer Society, Atlanta, Ga (R.A.S.); Departments of Family Medicine and Public Health and Preventive Medicine, Oregon Health & Science University, Portland, Ore (P.A.C.); Department of Biostatistics, University of Washington School of Public Health, Seattle, Wash (D.L.M.); Mallinckrodt Institute of Radiology, Washington University School of Medicine, St Louis, Mo (B.S.M.); Department of Radiology, University of California, San Francisco, Calif (E.A.S.); Division of Cancer Control and Population Science, Behavioral Research Program, National Cancer Institute, Rockville, Md (S.H.T.); Department of Family Medicine, University of Vermont, College of Medicine, Burlington, Vt (B.M.G.); Department of Radiology, University of North Carolina, Chapel Hill, NC (B.C.Y.); and Department of Community and Family Medicine, Dartmouth Institute for Health Policy and Clinical Practice, Geisel School of Medicine at Dartmouth, Norris Cotton Cancer Center, Lebanon, NH (T.L.O.)
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
52
|
Zheng B, Tan M, Ramalingam P, Gur D. Association between computed tissue density asymmetry in bilateral mammograms and near-term breast cancer risk. Breast J 2014; 20:249-57. [PMID: 24673749 DOI: 10.1111/tbj.12255] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
This study investigated association between bilateral mammographic density asymmetry and near-term breast cancer risk. A data base of digital mammograms acquired from 690 women was retrospectively collected. All images were originally interpreted as negative by radiologists. During the next subsequent screening examinations (between 12 and 36 months later), 230 women were diagnosed positive for cancer, 230 were recalled for additional diagnostic workups and proved to be benign, and 230 remained negative (not recalled). We applied a computerized scheme to compute the differences of five image features between the left and right mammograms, and trained an artificial neural network (ANN) to compute a bilateral mammographic density asymmetry score. Odds ratios (ORs) were used to assess associations between the ANN-generated scores and risk of women having detectable cancers during the next screening examinations. A logistic regression method was applied to test for trend as a function of the increase in ANN-generated scores. The results were also compared with ORs computed using other existing cancer risk factors. The ORs showed an increasing risk trend with the increase in ANN-generated scores (from 1.00 to 9.07 between positive and negative case groups). The regression analysis also showed a significant increase trend in slope (p < 0.05). No significant increase trends of the ORs were found when using woman's age, subjectively rated breast density, or family history of breast cancer. This study demonstrated that the computed bilateral mammographic density asymmetry had potential to be used as a new risk factor to improve discriminatory power in predicting near-term risk of women developing breast cancer.
Collapse
Affiliation(s)
- Bin Zheng
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma
| | | | | | | |
Collapse
|
53
|
Lee CI, Elmore JG. Increasing value by increasing volume: call for changes in US breast cancer screening practices. J Natl Cancer Inst 2014; 106:dju028. [PMID: 24598716 DOI: 10.1093/jnci/dju028] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Affiliation(s)
- Christoph I Lee
- Affiliations of authors: Department of Radiology (CIL) and Department of Medicine (JGE), University of Washington School of Medicine, Seattle WA; Department of Health Services (CIL) and Department of Epidemiology (JGE), University of Washington School of Public Health, Seattle WA
| | | |
Collapse
|
54
|
Théberge I, Chang SL, Vandal N, Daigle JM, Guertin MH, Pelletier E, Brisson J. Radiologist interpretive volume and breast cancer screening accuracy in a Canadian organized screening program. J Natl Cancer Inst 2014; 106:djt461. [PMID: 24598715 DOI: 10.1093/jnci/djt461] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND To strengthen evidence on which radiologist mammography interpretive volume requirements can be based, we assessed the relation of volume to accuracy in the Quebec Breast Cancer Screening Program. METHODS Annual interpretive volume (total, screening, and diagnostic) for all 340 radiologists who interpreted 1315327 screening examinations in the period from 2000 to 2006 was obtained using provincial databases. The association of volume to sensitivity, false-positive rate, and accuracy (sensitivity/false-positive rate) was assessed by multivariable Poisson regression with robust error variance. All statistical tests were two-sided. RESULTS Radiologists consistently interpreting less than 500 mammograms annually experienced a 58% reduction in accuracy (adjusted accuracy ratio = 0.42; 95% confidence interval [CI] = 0.24 to 0.74) compared with those who consistently interpreted at least 500 mammograms annually. Moreover, accuracy increased progressively as total annual volume increased (P trend = .0005). Radiologists interpreting at least 4000 mammograms annually experienced a 32% increase in accuracy (adjusted accuracy ratio = 1.32; 95% CI = 1.13 to 1.54) compared with those interpreting 500 to 999 mammograms annually. This increase in accuracy is attributable to a reduction in false-positive rate as total volume increased (P trend = .001). Sensitivity changed little with total volume (P trend = .68). Gains in accuracy were greater up to approximately 3000 mammograms interpreted annually. CONCLUSIONS The minimum annual volume of 500 mammograms required in North America is justified; radiologist accuracy may be compromised if interpretive volume is consistently less than this requirement. Raising interpretive volume may help to reduce the frequency of false positives without loss of sensitivity. Possible gains in accuracy may be greater with increases in volume of up to approximately 3000 mammograms interpreted annually.
Collapse
Affiliation(s)
- Isabelle Théberge
- Affiliation of authors: Direction de l'analyse et de l'évaluation des systèmes de soins et services, Institut national de santé publique du Québec, Québec City, Canada (IT, SLC, NV, JMD, MHG, EP, JB); Unité de Recherche en Santé des Populations (URESP), Centre de recherche FRSQ du Centre hospitalier affilié universitaire de Québec, Québec city, Canada (JB)
| | | | | | | | | | | | | |
Collapse
|
55
|
Variations in screening outcome among pairs of screening radiologists at non-blinded double reading of screening mammograms: a population-based study. Eur Radiol 2014; 24:1097-104. [PMID: 24500086 DOI: 10.1007/s00330-014-3102-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2013] [Revised: 12/10/2013] [Accepted: 01/17/2014] [Indexed: 01/09/2023]
Abstract
OBJECTIVES Substantial inter-observer variability in screening mammography interpretation has been reported at single reading. However, screening results of pairs of screening radiologists have not yet been published. We determined variations in screening performances among pairs of screening radiologists at non-blinded double reading. METHODS We included pairs of screening radiologists with at least 7,500 screening examinations per pair, obtained between 1997 and 2011. During 2-year follow-up, breast imaging reports, surgical reports and pathology results were collected of all referred women and interval cancers. Referral rate, cancer detection rate, positive predictive value and sensitivity were calculated for each pair. RESULTS A total of 310,906 screening mammograms, read by 26 pairs of screening radiologists, were included for analysis. The referral rate ranged from 1.0 % (95 % CI 0.8 %-1.2 %) to 1.5 % (95 % CI 1.3 %-1.8 %), the cancer detection rate from 4.0 (95 % CI 2.8-5.2) to 6.3 (95 % CI 4.5-8.0) per 1,000 screens. The programme sensitivity and positive predictive value of referral ranged from 55.1 % (95 % CI 45.1 %-65.1 %) to 81.5 % (95 % CI 73.4 %-89.6 %) and from 28.7 % (95 % CI 20.8 %-36.6 %) to 49.5 % (95 % CI 39.7 %-59.3 %), respectively. CONCLUSION We found significant variations in screening outcomes among pairs of screening radiologists at non-blinded double reading. This stresses the importance of monitoring screening results on a local scale. KEY POINTS • Substantial inter-observer variability in screening mammography interpretation is known at single reading • Population-based study showed significant variations in outcomes among pairs of screening radiologists • Local monitoring and regular feedback are important to optimise screening outcome.
Collapse
|
56
|
Prediction of near-term breast cancer risk based on bilateral mammographic feature asymmetry. Acad Radiol 2013; 20:1542-50. [PMID: 24200481 DOI: 10.1016/j.acra.2013.08.020] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2013] [Revised: 08/28/2013] [Accepted: 08/29/2013] [Indexed: 11/23/2022]
Abstract
RATIONALE AND OBJECTIVES The objective of this study is to investigate the feasibility of predicting near-term risk of breast cancer development in women after a negative mammography screening examination. It is based on a statistical learning model that combines computerized image features related to bilateral mammographic tissue asymmetry and other clinical factors. MATERIALS AND METHODS A database of negative digital mammograms acquired from 994 women was retrospectively collected. In the next sequential screening examination (12 to 36 months later), 283 women were diagnosed positive for cancer, 349 were recalled for additional diagnostic workups and later proved to be benign, and 362 remain negative (not recalled). From an initial pool of 183 features, we applied a Sequential Forward Floating Selection feature selection method to search for effective features. Using 10 selected features, we developed and trained a support vector machine classification model to compute a cancer risk or probability score for each case. The area under the receiver operating characteristic curve and odds ratios (ORs) were used as the two performance assessment indices. RESULTS The area under the receiver operating characteristic curve = 0.725 ± 0.018 was obtained for positive and negative/benign case classification. The ORs showed an increasing risk trend with increasing model-generated risk scores (from 1.00 to 12.34, between positive and negative/benign case groups). Regression analysis of ORs also indicated a significant increase trend in slope (P = .006). CONCLUSIONS This study demonstrates that the risk scores computed by a new support vector machine model involving bilateral mammographic feature asymmetry have potential to assist the prediction of near-term risk of women for developing breast cancer.
Collapse
|
57
|
Carney PA, Bogart A, Sickles EA, Smith R, Buist DSM, Kerlikowske K, Onega T, Miglioretti DL, Rosenberg R, Yankaskas BC, Geller BM. Feasibility and acceptability of conducting a randomized clinical trial designed to improve interpretation of screening mammography. Acad Radiol 2013; 20:1389-98. [PMID: 24119351 PMCID: PMC4152937 DOI: 10.1016/j.acra.2013.08.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2013] [Revised: 08/20/2013] [Accepted: 08/21/2013] [Indexed: 11/25/2022]
Abstract
PURPOSE To describe recruitment, enrollment, and participation in a study of US radiologists invited to participate in a randomized controlled trial of two continuing medical education (CME) interventions designed to improve interpretation of screening mammography. METHODS We collected recruitment, consent, and intervention-completion information as part of a large study involving radiologists in California, Oregon, Washington, New Mexico, New Hampshire, North Carolina, and Vermont. Consenting radiologists were randomized to receive either a 1-day live, expert-led educational session; to receive a self-paced DVD with similar content; or to a control group (delayed intervention). The impact of the interventions was assessed using a preintervention-postintervention test set design. All activities were institutional review board approved and HIPAA compliant. RESULTS Of 403 eligible radiologists, 151 of 403 (37.5%) consented to participate in the trial and 119 of 151 (78.8%) completed the preintervention test set, leaving 119 available for randomization to one of the two intervention groups or to controls. Female radiologists were more likely than male radiologists to consent to and complete the study (P = .03). Consenting radiologists who completed all study activities were more likely to have been interpreting mammography for 10 years or less compared to radiologists who consented and did not complete all study activities or did not consent at all. The live intervention group was more likely to report their intent to change their clinical practice as a result of the intervention compared to those who received the DVD (50% versus 17.6%, P = .02). The majority of participants in both interventions groups felt the interventions were a useful way to receive CME mammography credits. CONCLUSIONS Community radiologists found interactive interventions designed to improve interpretative mammography performance acceptable and useful for clinical practice. This suggests CME credits for radiologists should, in part, be for examining practice skills.
Collapse
Affiliation(s)
- Patricia A Carney
- Departments of Family Medicine and Public Health and Preventive Medicine, Oregon Health & Science University, 3181 SW Sam Jackson Park Rd, Mail Code: FM, Portland, OR 97239.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
58
|
Chen L, Abbey CK, Nosratieh A, Lindfors KK, Boone JM. Anatomical complexity in breast parenchyma and its implications for optimal breast imaging strategies. Med Phys 2013; 39:1435-41. [PMID: 22380376 DOI: 10.1118/1.3685462] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
PURPOSE The purpose of this investigation was to assess the anatomical noise in breast images using a mathematically derived parameter β as a surrogate for detection performance, across the same patient cohort but in different imaging modalities including mammography, tomosynthesis, and breast CT. METHODS Women who were scheduled for breast biopsy were approached for participation in this IRB and HIPPA-compliant investigation. A total of 23 women had all views of each modality and represent the cohort studied in this investigation. Image data sets across all modalities were analyzed using 1000 regions of interest per image data set, and the anatomical noise power spectrum, NPS(a)(f), was computed and averaged for each breast image data set. After windowing the total noise power spectrum NPS(t)(f) to a specific frequency range corresponding to anatomical noise, the power-law slope (β) of the NPS(a)(f) was computed where NPS(a)(f) = α f(-) (β). RESULTS The value of β was determined for breast CT data sets, and they were 1.75 (0.424), 1.83 (0.352), and 1.79 (0.397), for the coronal, sagittal, and axial views, respectively. For tomosynthesis, β was 3.06 (0.361) and 3.10 (0.315) for the craniocaudal (CC) and medial lateral oblique (MLO) views, respectively. For mammography, these values were 3.17 (0.226) and 3.30 (0.236), for the CC and MLO views, respectively. The values of β for breast CT were significantly different than those for tomosynthesis and mammography (p < 0.001, all 12 comparisons). CONCLUSIONS Based on the parameter β which is thought to describe anatomical noise in breast images, breast CT was shown to have a statistically significant lower β than mammography or tomosynthesis. It has been suggested in the literature that a lower β may correspond to increased cancer detection performance; however, this has yet to be demonstrated unequivocally.
Collapse
Affiliation(s)
- Lin Chen
- Biomedical Engineering Graduate Group, University of California, Sacramento, CA 95817, USA
| | | | | | | | | |
Collapse
|
59
|
Abstract
As a result of recent successful studies on lung cancer screening, 12,000 deaths can be averted if screening is implemented. A variety of considerations make this important although it is not clear we are ready for a policy change on screening.
Collapse
|
60
|
Abstract
Although mammography is the only clinically accepted imaging modality for screening the general population to detect breast cancer, interpreting mammograms is difficult with lower sensitivity and specificity. To provide radiologists "a visual aid" in interpreting mammograms, we developed and tested an interactive system for computer-aided detection and diagnosis (CAD) of mass-like cancers. Using this system, an observer can view CAD-cued mass regions depicted on one image and then query any suspicious regions (either cued or not cued by CAD). CAD scheme automatically segments the suspicious region or accepts manually defined region and computes a set of image features. Using content-based image retrieval (CBIR) algorithm, CAD searches for a set of reference images depicting "abnormalities" similar to the queried region. Based on image retrieval results and a decision algorithm, a classification score is assigned to the queried region. In this study, a reference database with 1,800 malignant mass regions and 1,800 benign and CAD-generated false-positive regions was used. A modified CBIR algorithm with a new function of stretching the attributes in the multi-dimensional space and decision scheme was optimized using a genetic algorithm. Using a leave-one-out testing method to classify suspicious mass regions, we compared the classification performance using two CBIR algorithms with either equally weighted or optimally stretched attributes. Using the modified CBIR algorithm, the area under receiver operating characteristic curve was significantly increased from 0.865 ± 0.006 to 0.897 ± 0.005 (p < 0.001). This study demonstrated the feasibility of developing an interactive CAD system with a large reference database and achieving improved performance.
Collapse
|
61
|
Carney PA, Parikh J, Sickles EA, Feig SA, Monsees B, Bassett LW, Smith RA, Rosenberg R, Ichikawa L, Wallace J, Tran K, Miglioretti DL. Diagnostic mammography: identifying minimally acceptable interpretive performance criteria. Radiology 2013; 267:359-67. [PMID: 23297329 DOI: 10.1148/radiol.12121216] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
PURPOSE To develop criteria to identify thresholds for the minimally acceptable performance of physicians interpreting diagnostic mammography studies. MATERIALS AND METHODS In an institutional review board-approved HIPAA-compliant study, an Angoff approach was used to set criteria for identifying minimally acceptable interpretive performance for both workup after abnormal screening examinations and workup of a breast lump. Normative data from the Breast Cancer Surveillance Consortium (BCSC) was used to help the expert radiologist identify the impact of cut points. Simulations, also using data from the BCSC, were used to estimate the expected clinical impact from the recommended performance thresholds. RESULTS Final cut points for workup of abnormal screening examinations were as follows: sensitivity, less than 80%; specificity, less than 80% or greater than 95%; abnormal interpretation rate, less than 8% or greater than 25%; positive predictive value (PPV) of biopsy recommendation (PPV2), less than 15% or greater than 40%; PPV of biopsy performed (PPV3), less than 20% or greater than 45%; and cancer diagnosis rate, less than 20 per 1000 interpretations. Final cut points for workup of a breast lump were as follows: sensitivity, less than 85%; specificity, less than 83% or greater than 95%; abnormal interpretation rate, less than 10% or greater than 25%; PPV2, less than 25% or greater than 50%; PPV3, less than 30% or greater than 55%; and cancer diagnosis rate, less than 40 per 1000 interpretations. If underperforming physicians moved into the acceptable range after remedial training, the expected result would be (a) diagnosis of an additional 86 cancers per 100,000 women undergoing workup after screening examinations, with a reduction in the number of false-positive examinations by 1067 per 100,000 women undergoing this workup, and (b) diagnosis of an additional 335 cancers per 100,000 women undergoing workup of a breast lump, with a reduction in the number of false-positive examinations by 634 per 100,000 women undergoing this workup. CONCLUSION Interpreting physicians who fall outside one or more of the identified cut points should be reviewed in the context of an overall assessment of all their performance measures and their specific practice setting to determine if remedial training is indicated.
Collapse
Affiliation(s)
- Patricia A Carney
- Departments of Family Medicine and Public Health and Preventive Medicine, Oregon Health and Science University, 3181 SW Sam Jackson Park Rd, Portland, OR 97239-3098, USA.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
62
|
Hofvind S, Geller BM, Skelly J, Vacek PM. Sensitivity and specificity of mammographic screening as practised in Vermont and Norway. Br J Radiol 2012; 85:e1226-32. [PMID: 22993383 PMCID: PMC3611728 DOI: 10.1259/bjr/15168178] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2011] [Revised: 03/13/2012] [Accepted: 04/16/2012] [Indexed: 11/05/2022] Open
Abstract
OBJECTIVE The aim of this study was to examine the sensitivity and specificity of screening mammography as performed in Vermont, USA, and Norway. METHODS Incident screening data from 1997 to 2003 for female patients aged 50-69 years from the Vermont Breast Cancer Surveillance System (116 996 subsequent screening examinations) and the Norwegian Breast Cancer Screening Program (360 872 subsequent screening examinations) were compared. Sensitivity and specificity estimates for the initial (based on screening mammogram only) and final (screening mammogram plus any further diagnostic imaging) interpretations were directly adjusted for age using 5-year age intervals for the combined Vermont and Norway population, and computed for 1 and 2 years of follow-up, which ended at the time of the next screening mammogram. RESULTS For the 1-year follow-up, sensitivities for initial assessments were 82.0%, 88.2% and 92.5% for 1-, 2- and >2-year screening intervals, respectively, in Vermont (p=0.022). For final assessments, the values were 73.6%, 83.3% and 81.2% (p=0.047), respectively. For Norway, sensitivities for initial assessments were 91.0% and 91.3% (p=0.529) for 2- and >2-year intervals, and 90.7% and 91.3%, respectively, for final assessments (p=0.630). Specificity was lower in Vermont than in Norway for each screening interval and for all screening intervals combined, for both initial (90.6% vs 97.8% for all intervals; p<0.001) and final (98.8% vs 99.5% for all intervals; p<0.001) assessments. CONCLUSION Our study showed higher sensitivity and specificity in a biennial screening programme with an independent double reading than in a predominantly annual screening program with a single reading. ADVANCES IN KNOWLEDGE This study demonstrates that higher recall rates and lower specificity are not always associated with higher sensitivity of screening mammography. Differences in the screening processes in Norway and Vermont suggest potential areas for improvement in the latter.
Collapse
Affiliation(s)
- S Hofvind
- Cancer Registry of Norway, Oslo, Norway.
| | | | | | | |
Collapse
|
63
|
Abstract
OBJECTIVE Interpretive accuracy varies among radiologists, especially in mammography. This study examines the relationship between radiologists' confidence in their assessments and their accuracy in interpreting mammograms. MATERIALS AND METHODS In this study, 119 community radiologists interpreted 109 expert-defined screening mammography examinations in test sets and rated their confidence in their assessment for each case. They also provided a global assessment of their ability to interpret mammograms. Positive predictive value (PPV) and negative predictive value (NPV) were modeled as functions of self-rated confidence on each examination using log-linear regression estimated with generalized estimating equations. Reference measures were cancer status and expert-defined need for recall. Effect modification by weekly mammography volume was examined. RESULTS Radiologists who self-reported higher global interpretive ability tended to interpret more mammograms per week (p = 0.08), were more likely to specialize (p = 0.02) and to have completed a fellowship in breast or women's imaging (p = 0.05), and had a higher PPV for cancer detection (p = 0.01). Examinations for which low-volume radiologists were "very confident" had a PPV of 2.93 times (95% CI, 2.01-4.27) higher than examinations they rated with neutral confidence. Trends of increasing NPVs with increasing confidence were significant for low-volume radiologists relative to noncancers (p = 0.01) and expert nonrecalls (p < 0.001). A trend of significantly increasing NPVs existed for high-volume radiologists relative to expert nonrecall (p = 0.02) but not relative to noncancer status (p = 0.32). CONCLUSION Confidence in mammography assessments was associated with better accuracy, especially for low-volume readers. Asking for a second opinion when confidence in an assessment is low may increase accuracy.
Collapse
|
64
|
Point: Generalism vs Subspecialization—The ACR Should Encourage Radiologists to Structure Their Practices Around a Model of Subspecialization. J Am Coll Radiol 2012; 9:535-6. [DOI: 10.1016/j.jacr.2012.04.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 04/04/2012] [Indexed: 11/23/2022]
|
65
|
Zheng B, Sumkin JH, Zuley ML, Wang X, Klym AH, Gur D. Bilateral mammographic density asymmetry and breast cancer risk: a preliminary assessment. Eur J Radiol 2012; 81:3222-8. [PMID: 22579527 DOI: 10.1016/j.ejrad.2012.04.018] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2011] [Revised: 04/17/2012] [Accepted: 04/19/2012] [Indexed: 12/31/2022]
Abstract
To improve efficacy of breast cancer screening and prevention programs, it requires a risk assessment model with high discriminatory power. This study aimed to assess classification performance of using computed bilateral mammographic density asymmetry to predict risk of individual women developing breast cancer in near-term. The database includes 451 cases with multiple screening mammography examinations. The first (baseline) examinations of all case were interpreted negative. In the next sequential examinations, 187 cases developed cancer or surgically excised high-risk lesions, 155 remained negative (not-recalled), and 109 were recalled benign cases. From each of two bilateral cranio-caudal view images acquired from the baseline examination, we computed two features of average pixel value and local pixel value fluctuation. We then computed mean and difference of each feature computed from two images. When applying the computed features and other two risk factors (woman's age and subjectively rated mammographic density) to predict risk of cancer development, areas under receiver operating characteristic curves (AUC) were computed to evaluate the discriminatory/classification performance. The AUCs are 0.633±0.030, 0.535±0.036, 0.567±0.031, and 0.719±0.027 when using woman's age, subjectively rated, computed mean and asymmetry of mammographic density, to classify between two groups of cancer-verified and negative cases, respectively. When using an equal-weighted fusion method to combine woman's age and computed density asymmetry, AUC increased to 0.761±0.025 (p<0.05). The study demonstrated that bilateral mammographic density asymmetry could be a significantly stronger risk factor associated to the risk of women developing breast cancer in near-term than woman's age and assessed mean mammographic density.
Collapse
Affiliation(s)
- Bin Zheng
- Department of Radiology, University of Pittsburgh, Magee Womens Hospital, 3362 Fifth Ave, Pittsburgh, PA 15213, USA.
| | | | | | | | | | | |
Collapse
|
66
|
Facility characteristics do not explain higher false-positive rates in diagnostic mammography at facilities serving vulnerable women. Med Care 2012; 50:210-6. [PMID: 22186768 DOI: 10.1097/mlr.0b013e3182407c8a] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
BACKGROUND Facilities serving vulnerable women have higher false-positive rates for diagnostic mammography than facilities serving nonvulnerable women. False positives lead to anxiety, unnecessary biopsies, and higher costs. OBJECTIVE Examine whether availability of on-site breast ultrasound or biopsy services, academic medical center affiliation, or profit status explains differences in false-positive rates. DESIGN We examined 78,733 diagnostic mammograms performed to evaluate breast problems at Breast Cancer Surveillance Consortium facilities from 1999 to 2005. We used logistic-normal mixed effects regression to determine if adjusting for facility characteristics accounts for observed differences in false-positive rates. MEASURES Facilities were characterized as serving vulnerable women based on the proportion of mammograms performed on racial/ethnic minorities, women with lower educational attainment, limited household income, or rural residence. RESULTS Although the availability of on-site ultrasound and biopsy services was associated with greater odds of a false positive in most models [odds ratios (OR) ranging from 1.24 to 1.88; P<0.05], adjustment for these services did not attenuate the association between vulnerability and false-positive rates. Estimated ORs for the effect of vulnerability indexes on false-positive rates unadjusted for facility services were: lower educational attainment [OR 1.33; 95% confidence intervals (CI), 1.03-1.74]; racial/ethnic minority status (OR 1.33; 95% CI, 0.98-1.80); rural residence (OR 1.56; 95% CI, 1.26-1.92); limited household income (OR 1.38; 95% CI, 1.10-1.73). After adjustment, estimates remained relatively unchanged. CONCLUSIONS On-site diagnostic service availability may contribute to unnecessary biopsies, but does not explain the higher diagnostic mammography false-positive rates at facilities serving vulnerable women.
Collapse
|
67
|
Current world literature. Curr Opin Urol 2012; 22:254-62. [PMID: 22469752 DOI: 10.1097/mou.0b013e328352c3f8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
68
|
Jackson SL, Cook AJ, Miglioretti DL, Carney PA, Geller BM, Onega T, Rosenberg RD, Brenner RJ, Elmore JG. Are radiologists' goals for mammography accuracy consistent with published recommendations? Acad Radiol 2012; 19:289-95. [PMID: 22130089 DOI: 10.1016/j.acra.2011.10.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2011] [Revised: 10/03/2011] [Accepted: 10/07/2011] [Indexed: 10/14/2022]
Abstract
RATIONALE AND OBJECTIVES Mammography quality assurance programs have been in place for more than a decade. We studied radiologists' self-reported performance goals for accuracy in screening mammography and compared them to published recommendations. MATERIALS AND METHODS A mailed survey of radiologists at mammography registries in seven states within the Breast Cancer Surveillance Consortium (BCSC) assessed radiologists' performance goals for interpreting screening mammograms. Self-reported goals were compared to published American College of Radiology (ACR) recommended desirable ranges for recall rate, false-positive rate, positive predictive value of biopsy recommendation (PPV2), and cancer detection rate. Radiologists' goals for interpretive accuracy within desirable range were evaluated for associations with their demographic characteristics, clinical experience, and receipt of audit reports. RESULTS The survey response rate was 71% (257 of 364 radiologists). The percentage of radiologists reporting goals within desirable ranges was 79% for recall rate, 22% for false-positive rate, 39% for PPV2, and 61% for cancer detection rate. The range of reported goals was 0%-100% for false-positive rate and PPV2. Primary academic affiliation, receiving more hours of breast imaging continuing medical education, and receiving audit reports at least annually were associated with desirable PPV2 goals. Radiologists reporting desirable cancer detection rate goals were more likely to have interpreted mammograms for 10 or more years, and >1000 mammograms per year. CONCLUSION Many radiologists report goals for their accuracy when interpreting screening mammograms that fall outside of published desirable benchmarks, particularly for false-positive rate and PPV2, indicating an opportunity for education.
Collapse
|
69
|
Wang X, Li L, Xu W, Liu W, Lederman D, Zheng B. Improving the performance of computer-aided detection of subtle breast masses using an adaptive cueing method. Phys Med Biol 2012; 57:561-75. [PMID: 22218075 PMCID: PMC3310913 DOI: 10.1088/0031-9155/57/2/561] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Current computer-aided detection (CAD) schemes for detecting mammographic masses have several limitations including high correlation with radiologists' detection and cueing most subtle masses only on one view. To increase CAD sensitivity in cueing more subtle masses that are likely missed and/or overlooked by radiologists without increasing false-positive rates, we investigated a new case-dependent cueing method by combining the original CAD-generated detection scores with a computed bilateral mammographic density asymmetry index. Using the new method, we adaptively raise the CAD-generated scores of the regions detected on 'high-risk' cases to cue more subtle mass regions and reduce the CAD scores of the regions detected on 'low-risk' cases to discard more false-positive regions. A testing dataset involving 78 positive and 338 negative cases was used to test this adaptive cueing method. Each positive case involves two sequential examinations in which the mass was detected in 'current' examination and missed in 'prior' examination but detected in a retrospective review by radiologists. Applying to this dataset, a pre-optimized CAD scheme yielded 75% case-based and 55% region-based sensitivity on 'current' examinations at a false-positive rate of 0.25 per image. CAD sensitivity was reduced to 42% (case based) and 27% (region based) on 'prior' examinations. Using the new cueing method, case-based and region-based sensitivity could maximally increase 9% and 33% on the 'prior' examinations, respectively. The percentages of the masses cued on two views also increased from 27% to 65%. The study demonstrated that using this adaptive cueing method enabled us to help CAD cue more subtle cancers without increasing the false-positive cueing rate.
Collapse
Affiliation(s)
- Xingwei Wang
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | | | | | | | | | | |
Collapse
|
70
|
Haneuse S, Buist DSM, Miglioretti DL, Anderson ML, Carney PA, Onega T, Geller BM, Kerlikowske K, Rosenberg RD, Yankaskas BC, Elmore JG, Taplin SH, Smith RA, Sickles EA. Mammographic interpretive volume and diagnostic mammogram interpretation performance in community practice. Radiology 2011; 262:69-79. [PMID: 22106351 DOI: 10.1148/radiol.11111026] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
PURPOSE To investigate the association between radiologist interpretive volume and diagnostic mammography performance in community-based settings. MATERIALS AND METHODS This study received institutional review board approval and was HIPAA compliant. A total of 117,136 diagnostic mammograms that were interpreted by 107 radiologists between 2002 and 2006 in the Breast Cancer Surveillance Consortium were included. Logistic regression analysis was used to estimate the adjusted effect on sensitivity and the rates of false-positive findings and cancer detection of four volume measures: annual diagnostic volume, screening volume, total volume, and diagnostic focus (percentage of total volume that is diagnostic). Analyses were stratified by the indication for imaging: additional imaging after screening mammography or evaluation of a breast concern or problem. RESULTS Diagnostic volume was associated with sensitivity; the odds of a true-positive finding rose until a diagnostic volume of 1000 mammograms was reached; thereafter, they either leveled off (P < .001 for additional imaging) or decreased (P = .049 for breast concerns or problems) with further volume increases. Diagnostic focus was associated with false-positive rate; the odds of a false-positive finding increased until a diagnostic focus of 20% was reached and decreased thereafter (P < .024 for additional imaging and P < .001 for breast concerns or problems with no self-reported lump). Neither total volume nor screening volume was consistently associated with diagnostic performance. CONCLUSION Interpretive volume and diagnostic performance have complex multifaceted relationships. Our results suggest that diagnostic interpretive volume is a key determinant in the development of thresholds for considering a diagnostic mammogram to be abnormal. Current volume regulations do not distinguish between screening and diagnostic mammography, and doing so would likely be challenging.
Collapse
Affiliation(s)
- Sebastien Haneuse
- Department of Biostatistics, Harvard School of Public Health, 677 Huntington Ave, Boston, MA 02115, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
71
|
Kerlikowske K, Hubbard RA, Miglioretti DL, Geller BM, Yankaskas BC, Lehman CD, Taplin SH, Sickles EA. Comparative effectiveness of digital versus film-screen mammography in community practice in the United States: a cohort study. Ann Intern Med 2011; 155:493-502. [PMID: 22007043 PMCID: PMC3726800 DOI: 10.7326/0003-4819-155-8-201110180-00005] [Citation(s) in RCA: 191] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND Few studies have examined the comparative effectiveness of digital versus film-screen mammography in U.S. community practice. OBJECTIVE To determine whether the interpretive performance of digital and film-screen mammography differs. DESIGN Prospective cohort study. SETTING Mammography facilities in the Breast Cancer Surveillance Consortium. PARTICIPANTS 329,261 women aged 40 to 79 years underwent 869 286 mammograms (231 034 digital; 638 252 film-screen). MEASUREMENTS Invasive cancer or ductal carcinoma in situ diagnosed within 12 months of a digital or film-screen examination and calculation of mammography sensitivity, specificity, cancer detection rates, and tumor outcomes. RESULTS Overall, cancer detection rates and tumor characteristics were similar for digital and film-screen mammography, but the sensitivity and specificity of each modality varied by age, tumor characteristics, breast density, and menopausal status. Compared with film-screen mammography, the sensitivity of digital mammography was significantly higher for women aged 60 to 69 years (89.9% vs. 83.0%; P = 0.014) and those with estrogen receptor-negative cancer (78.5% vs. 65.8%; P = 0.016); borderline significantly higher for women aged 40 to 49 years (82.4% vs. 75.6%; P = 0.071), those with extremely dense breasts (83.6% vs. 68.1%; P = 0.051), and pre- or perimenopausal women (87.1% vs. 81.7%; P = 0.057); and borderline significantly lower for women aged 50 to 59 years (80.5% vs. 85.1%; P = 0.097). The specificity of digital and film-screen mammography was similar by decade of age, except for women aged 40 to 49 years (88.0% vs. 89.7%; P < 0.001). LIMITATION Statistical power for subgroup analyses was limited. CONCLUSION Overall, cancer detection with digital or film-screen mammography is similar in U.S. women aged 50 to 79 years undergoing screening mammography. Women aged 40 to 49 years are more likely to have extremely dense breasts and estrogen receptor-negative tumors; if they are offered mammography screening, they may choose to undergo digital mammography to optimize cancer detection. PRIMARY FUNDING SOURCE National Cancer Institute.
Collapse
|
72
|
Oberaigner W, Daniaux M, Geiger-Gritsch S, Knapp R, Siebert U, Buchberger W. Introduction of organised mammography screening in Tyrol: results following first year of complete rollout. BMC Public Health 2011; 11:673. [PMID: 21878122 PMCID: PMC3182932 DOI: 10.1186/1471-2458-11-673] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2011] [Accepted: 08/30/2011] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND In Tyrol, Austria, the existing system of spontaneous mammography screening was switched in 2007 to an organised program by smoothly changing the established framework. This process followed most EU recommendations for organised mammography screening with the following exceptions: women aged 40-49 are part of the target population, screening is offered annually to the age group 40-59, breast ultrasound is available as an additional diagnostic tool, and double reading has not yet been implemented. After a pilot phase the program was rolled out to all of Tyrol in June 2008. The aim of this study was to analyse the performance of the organised screening system by comparing quality indices and recommended levels given in the well-established EU guidelines. METHODS Working from the results of the pilot phase, we extended the organised mammography system to all counties in Tyrol. All women living in Tyrol and covered by compulsory social insurance were invited for a mammography, in the age group 40-59 annually and in the age group 60-69 biennially. Screening mammography was offered mainly by radiologists in private practice, with further assessment performed at hospitals. Using the screening database, all well-established performance indicators were analysed and compared with accepted/desired levels as per the EU guidelines. RESULTS From June 2008 to May 2009, 120,440 women were invited. Per 1000 mammograms, 14 women were recalled for further assessment, nine underwent biopsy and four cancer cases were detected. Of invasive breast cancer cases, 32.3% and 68.4% were ≤ 10 mm and ≤ 15 mm in size, respectively, and 79.2% were node-negative. The positive predictive value for further assessment and for biopsy was 25.9% and 39.9%, respectively. Estimated two-year participation rate was 57.0%. In total, 14 interval cancer cases were detected during one year of follow-up; this is 18.4% of the background incidence rate. CONCLUSIONS In Tyrol, Austria, an organised mammography screening program was implemented in a smooth transition from an existing spontaneous screening system and was completely rolled out within a short time. The high level of performance already seen in the pilot phase was maintained after rollout, and improvements resulting from the pilot phase were affirmed after one year of complete rollout.
Collapse
Affiliation(s)
- Willi Oberaigner
- Department of Clinical Epidemiology of the Tyrolean State Hospitals Ltd., Cancer Registry of Tyrol, Innsbruck, Austria
- Institute of Public Health, Medical Decision Making and Health Technology Assessment, Department of Public Health and Health Technology Assessment, UMIT - University for Health Sciences, Medical Informatics and Technology, Hall i.T., Austria
- ONCOTYROL - Center for Personalized Cancer Medicine, Innsbruck, Austria
| | - Martin Daniaux
- Innsbruck Medical University, Department of Radiology, Innsbruck, Austria
| | - Sabine Geiger-Gritsch
- Department of Clinical Epidemiology of the Tyrolean State Hospitals Ltd., Cancer Registry of Tyrol, Innsbruck, Austria
| | - Rudolf Knapp
- Kufstein County Hospital, Department of Radiology, Kufstein, Austria
| | - Uwe Siebert
- Institute of Public Health, Medical Decision Making and Health Technology Assessment, Department of Public Health and Health Technology Assessment, UMIT - University for Health Sciences, Medical Informatics and Technology, Hall i.T., Austria
- ONCOTYROL - Center for Personalized Cancer Medicine, Innsbruck, Austria
- Center for Health Decision Science, Department of Health Policy and Management, Harvard School of Public Health, Boston, MA, USA
- Institute for Technology Assessment and Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Wolfgang Buchberger
- Institute of Public Health, Medical Decision Making and Health Technology Assessment, Department of Public Health and Health Technology Assessment, UMIT - University for Health Sciences, Medical Informatics and Technology, Hall i.T., Austria
- Tyrolean State Hospitals Ltd., Medical Department, Innsbruck, Austria
| |
Collapse
|