Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dinnes J, Moss S, Melia J, Blanks R, Song F, Kleijnen J. Effectiveness and cost-effectiveness of double reading of mammograms in breast cancer screening: findings of a systematic review. Breast 2004;10:455-63. [PMID: 14965624 DOI: 10.1054/brst.2001.0350] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2001] [Revised: 05/07/2001] [Accepted: 05/16/2001] [Indexed: 11/18/2022] Open

For:	Dinnes J, Moss S, Melia J, Blanks R, Song F, Kleijnen J. Effectiveness and cost-effectiveness of double reading of mammograms in breast cancer screening: findings of a systematic review. Breast 2004;10:455-63. [PMID: 14965624 DOI: 10.1054/brst.2001.0350] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2001] [Revised: 05/07/2001] [Accepted: 05/16/2001] [Indexed: 11/18/2022] Open

Number

Cited by Other Article(s)

Condon JJJ, Trinh V, Hall KA, Reintals M, Holmes AS, Oakden-Rayner L, Palmer LJ. Impact of Transfer Learning Using Local Data on Performance of a Deep Learning Model for Screening Mammography. Radiol Artif Intell 2024;6:e230383. [PMID: 38717291 PMCID: PMC11294949 DOI: 10.1148/ryai.230383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Revised: 03/25/2024] [Accepted: 04/24/2024] [Indexed: 06/21/2024]

Abstract

Purpose To investigate the issues of generalizability and replication of deep learning models by assessing performance of a screening mammography deep learning system developed at New York University (NYU) on a local Australian dataset. Materials and Methods In this retrospective study, all individuals with biopsy or surgical pathology-proven lesions and age-matched controls were identified from a South Australian public mammography screening program (January 2010 to December 2016). The primary outcome was deep learning system performance-measured with area under the receiver operating characteristic curve (AUC)-in classifying invasive breast cancer or ductal carcinoma in situ (n = 425) versus no malignancy (n = 490) or benign lesions (n = 44). The NYU system, including models without (NYU1) and with (NYU2) heatmaps, was tested in its original form, after training from scratch (without transfer learning), and after retraining with transfer learning. Results The local test set comprised 959 individuals (mean age, 62.5 years ± 8.5 [SD]; all female). The original AUCs for the NYU1 and NYU2 models were 0.83 (95% CI: 0.82, 0.84) and 0.89 (95% CI: 0.88, 0.89), respectively. When NYU1 and NYU2 were applied in their original form to the local test set, the AUCs were 0.76 (95% CI: 0.73, 0.79) and 0.84 (95% CI: 0.82, 0.87), respectively. After local training without transfer learning, the AUCs were 0.66 (95% CI: 0.62, 0.69) and 0.86 (95% CI: 0.84, 0.88). After retraining with transfer learning, the AUCs were 0.82 (95% CI: 0.80, 0.85) and 0.86 (95% CI: 0.84, 0.88). Conclusion A deep learning system developed using a U.S. dataset showed reduced performance when applied "out of the box" to an Australian dataset. Local retraining with transfer learning using available model weights improved model performance. Keywords: Screening Mammography, Convolutional Neural Network (CNN), Deep Learning Algorithms, Breast Cancer Supplemental material is available for this article. © RSNA, 2024 See also commentary by Cadrin-Chênevert in this issue.

Collapse

Affiliation(s)

James J. J. Condon From the Australian Institute for Machine Learning (J.J.J.C., V.T., L.O.R., L.J.P.) and School of Public Health (J.J.J.C., V.T., K.A.H., L.O.R., L.J.P.), University of Adelaide, N Terrace, Adelaide, South Australia 5005, Australia; and BreastScreen SA, Adelaide, South Australia, Australia (M.R., A.S.H.)
Vincent Trinh From the Australian Institute for Machine Learning (J.J.J.C., V.T., L.O.R., L.J.P.) and School of Public Health (J.J.J.C., V.T., K.A.H., L.O.R., L.J.P.), University of Adelaide, N Terrace, Adelaide, South Australia 5005, Australia; and BreastScreen SA, Adelaide, South Australia, Australia (M.R., A.S.H.)
Kelly A. Hall From the Australian Institute for Machine Learning (J.J.J.C., V.T., L.O.R., L.J.P.) and School of Public Health (J.J.J.C., V.T., K.A.H., L.O.R., L.J.P.), University of Adelaide, N Terrace, Adelaide, South Australia 5005, Australia; and BreastScreen SA, Adelaide, South Australia, Australia (M.R., A.S.H.)
Michelle Reintals From the Australian Institute for Machine Learning (J.J.J.C., V.T., L.O.R., L.J.P.) and School of Public Health (J.J.J.C., V.T., K.A.H., L.O.R., L.J.P.), University of Adelaide, N Terrace, Adelaide, South Australia 5005, Australia; and BreastScreen SA, Adelaide, South Australia, Australia (M.R., A.S.H.)
Andrew S. Holmes From the Australian Institute for Machine Learning (J.J.J.C., V.T., L.O.R., L.J.P.) and School of Public Health (J.J.J.C., V.T., K.A.H., L.O.R., L.J.P.), University of Adelaide, N Terrace, Adelaide, South Australia 5005, Australia; and BreastScreen SA, Adelaide, South Australia, Australia (M.R., A.S.H.)
Lauren Oakden-Rayner From the Australian Institute for Machine Learning (J.J.J.C., V.T., L.O.R., L.J.P.) and School of Public Health (J.J.J.C., V.T., K.A.H., L.O.R., L.J.P.), University of Adelaide, N Terrace, Adelaide, South Australia 5005, Australia; and BreastScreen SA, Adelaide, South Australia, Australia (M.R., A.S.H.)
Lyle J. Palmer From the Australian Institute for Machine Learning (J.J.J.C., V.T., L.O.R., L.J.P.) and School of Public Health (J.J.J.C., V.T., K.A.H., L.O.R., L.J.P.), University of Adelaide, N Terrace, Adelaide, South Australia 5005, Australia; and BreastScreen SA, Adelaide, South Australia, Australia (M.R., A.S.H.)

Collapse

Kim H, Choi JS, Kim K, Ko ES, Ko EY, Han BK. Effect of artificial intelligence-based computer-aided diagnosis on the screening outcomes of digital mammography: a matched cohort study. Eur Radiol 2023;33:7186-7198. [PMID: 37188881 DOI: 10.1007/s00330-023-09692-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 02/21/2023] [Accepted: 03/09/2023] [Indexed: 05/17/2023]

Abstract

OBJECTIVE

To investigate whether artificial intelligence-based computer-aided diagnosis (AI-CAD) can improve radiologists' performance when used to support radiologists' interpretation of digital mammography (DM) in breast cancer screening.

METHODS

A retrospective database search identified 3158 asymptomatic Korean women who consecutively underwent screening DM between January and December 2019 without AI-CAD support, and screening DM between February and July 2020 with image interpretation aided by AI-CAD in a tertiary referral hospital using single reading. Propensity score matching was used to match the DM with AI-CAD group in a 1:1 ratio with the DM without AI-CAD group according to age, breast density, experience level of the interpreting radiologist, and screening round. Performance measures were compared with the McNemar test and generalized estimating equations.

RESULTS

A total of 1579 women who underwent DM with AI-CAD were matched with 1579 women who underwent DM without AI-CAD. Radiologists showed higher specificity (96% [1500 of 1563] vs 91.6% [1430 of 1561]; p < 0.001) and lower abnormal interpretation rates (AIR) (4.9% [77 of 1579] vs 9.2% [145 of 1579]; p < 0.001) with AI-CAD than without. There was no significant difference in the cancer detection rate (CDR) (AI-CAD vs no AI-CAD, 8.9 vs 8.9 per 1000 examinations; p = 0.999), sensitivity (87.5% vs 77.8%; p = 0.999), and positive predictive value for biopsy (PPV3) (35.0% vs 35.0%; p = 0.999) according to AI-CAD support.

CONCLUSIONS

AI-CAD increases the specificity for radiologists without decreasing sensitivity as a supportive tool in the single reading of DM for breast cancer screening.

CLINICAL RELEVANCE STATEMENT

This study shows that AI-CAD could improve the specificity of radiologists' DM interpretation in the single reading system without decreasing sensitivity, suggesting that it can benefit patients by reducing false positive and recall rates.

KEY POINTS

• In this retrospective-matched cohort study (DM without AI-CAD vs DM with AI-CAD), radiologists showed higher specificity and lower AIR when AI-CAD was used to support decision-making in DM screening. • CDR, sensitivity, and PPV for biopsy did not differ with and without AI-CAD support.

Collapse

Smith JT. It's not about the errors, it's about the learning: How the Royal College of Radiologists has developed a Radiology Events and Learning process in the United Kingdom. J Med Imaging Radiat Oncol 2022;66:185-192. [PMID: 35243780 DOI: 10.1111/1754-9485.13355] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 11/03/2021] [Indexed: 11/30/2022]

Deep-Learning-Based Computer-Aided Systems for Breast Cancer Imaging: A Critical Review. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10228298] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Biennial screening mammography: How many women ask for more? Estimate of the interval mammogram rate in an organised population-based screening programme. LA RADIOLOGIA MEDICA 2020;126:200-205. [PMID: 32577906 DOI: 10.1007/s11547-020-01238-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 06/07/2020] [Indexed: 11/27/2022]

Ramadan SZ. Methods Used in Computer-Aided Diagnosis for Breast Cancer Detection Using Mammograms: A Review. JOURNAL OF HEALTHCARE ENGINEERING 2020;2020:9162464. [PMID: 32300474 PMCID: PMC7091549 DOI: 10.1155/2020/9162464] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Revised: 12/25/2019] [Accepted: 02/13/2020] [Indexed: 12/28/2022]

Mandrik O, Ekwunife OI, Meheus F, Severens JL(H, Lhachimi S, Uyl‐de Groot CA, Murillo R. Systematic reviews as a "lens of evidence": Determinants of cost-effectiveness of breast cancer screening. Cancer Med 2019;8:7846-7858. [PMID: 31568702 PMCID: PMC6912065 DOI: 10.1002/cam4.2498] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Revised: 07/25/2019] [Accepted: 07/25/2019] [Indexed: 12/30/2022] Open

Mandrik O, Zielonke N, Meheus F, Severens J(H, Guha N, Herrero Acosta R, Murillo R. Systematic reviews as a 'lens of evidence': Determinants of benefits and harms of breast cancer screening. Int J Cancer 2019;145:994-1006. [PMID: 30762235 PMCID: PMC6619055 DOI: 10.1002/ijc.32211] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2018] [Revised: 01/17/2019] [Accepted: 01/29/2019] [Indexed: 12/14/2022]

Hoff SR, Myklebust TÅ, Lee CI, Hofvind S. Influence of Mammography Volume on Radiologists’ Performance: Results from BreastScreen Norway. Radiology 2019;292:289-296. [DOI: 10.1148/radiol.2019182684] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Ventura-Alfaro CE. [Measurements errors in screening mammogram interpretation by radiologists]. ACTA ACUST UNITED AC 2019;20:518-522. [PMID: 30843990 DOI: 10.15446/rsap.v20n4.52035] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2015] [Accepted: 02/12/2018] [Indexed: 11/09/2022]

Wender RC, Brawley OW, Fedewa SA, Gansler T, Smith RA. A blueprint for cancer screening and early detection: Advancing screening's contribution to cancer control. CA Cancer J Clin 2019;69:50-79. [PMID: 30452086 DOI: 10.3322/caac.21550] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Katzen J, Dodelzon K. A review of computer aided detection in mammography. Clin Imaging 2018;52:305-309. [PMID: 30216858 DOI: 10.1016/j.clinimag.2018.08.014] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Revised: 08/13/2018] [Accepted: 08/16/2018] [Indexed: 01/23/2023]

Gandomkar Z, Tay K, Brennan PC, Kozuch E, Mello-Thoms C. Can eye-tracking metrics be used to better pair radiologists in a mammogram reading task? Med Phys 2018;45:4844-4856. [PMID: 30168153 DOI: 10.1002/mp.13161] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 08/06/2018] [Accepted: 08/10/2018] [Indexed: 11/08/2022] Open

Abstract

PURPOSE

To propose a framework for optimal pairing of radiologists when reading mammograms based on their search patterns.

MATERIALS AND METHODS

Four experienced and four less-experienced radiologists were asked to assess 120 cases (59 with cancers) while their eye positions were tracked. Fourteen eye-tracking metrics were extracted to quantify the differences among radiologists' visual search pattern. For each radiologist and metric, less-experienced radiologists and expert readers were ranked based on the level of similarities in gaze patterns (from the most different to the most similar). Less-experienced readers and experts were also ranked based on the values of area under the receiver operating characteristic curve (AUC) after pairing (the best possible way of ranking). Using the Kendall's tau distance, rankings based on different metrics were compared with the best possible ranking. Using paired Wilcoxon signed-rank test, the AUC values when pairing in the best way were compared with pairing based on different metrics. Finally, we investigated the robustness of pairing strategies against the small sample size.

RESULTS

For ranking the experienced radiologists, results from eight metrics were as good as the best possible ranking. For the less-experienced ones, only one metric resulted in a ranking comparable to the best possible way of ranking. The AUC values of pairings based on these metrics did not differ significantly from the best pairing scenario. Compared to the pairings based on the cognitive metrics, the ranking based on AUC values varied more greatly with the sample size, suggesting that it is less robust against the small sample size compared to the cognitive metrics.

CONCLUSION

Different pairings may have different effects on performance; some are detrimental while some improve the performance of the pair. Using the suggested cognitive metrics, we can optimize the pairings even with a small dataset.

Collapse

Upadhyay N, Soneji N, Stewart V, Ralleigh G. The effect of the addition of tomosynthesis to digital mammography on reader recall rate and reader confidence in the UK prevalent screening round. Clin Radiol 2018;73:744-749. [DOI: 10.1016/j.crad.2018.03.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2017] [Accepted: 03/28/2018] [Indexed: 10/17/2022]

Kurvers RHJM, de Zoete A, Bachman SL, Algra PR, Ostelo R. Combining independent decisions increases diagnostic accuracy of reading lumbosacral radiographs and magnetic resonance imaging. PLoS One 2018;13:e0194128. [PMID: 29614070 PMCID: PMC5882099 DOI: 10.1371/journal.pone.0194128] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2017] [Accepted: 02/26/2018] [Indexed: 11/18/2022] Open

Arenson RL. Factors Affecting Interpretative Accuracy: How Can We Reduce Errors? Radiology 2018;287:213-214. [DOI: 10.1148/radiol.2017172781] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Combined screening with mammography and ultrasound in a population-based screening program. Eur J Radiol 2018;101:24-29. [DOI: 10.1016/j.ejrad.2018.01.022] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Revised: 01/20/2018] [Accepted: 01/22/2018] [Indexed: 11/20/2022]

Mullen LA, Panigrahi B, Hollada J, Panigrahi B, Falomo ET, Harvey SC. Strategies for Decreasing Screening Mammography Recall Rates While Maintaining Performance Metrics. Acad Radiol 2017;24:1556-1560. [PMID: 28760363 DOI: 10.1016/j.acra.2017.06.009] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2017] [Revised: 04/14/2017] [Accepted: 06/08/2017] [Indexed: 01/09/2023]

Trainees May Add Value to Patient Care by Decreasing Addendum Utilization in Radiology Reports. AJR Am J Roentgenol 2017;209:976-981. [DOI: 10.2214/ajr.17.18339] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Posso M, Puig T, Carles M, Rué M, Canelo-Aybar C, Bonfill X. Effectiveness and cost-effectiveness of double reading in digital mammography screening: A systematic review and meta-analysis. Eur J Radiol 2017;96:40-49. [PMID: 29103474 DOI: 10.1016/j.ejrad.2017.09.013] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2017] [Revised: 09/14/2017] [Accepted: 09/19/2017] [Indexed: 10/18/2022]

Abstract

PURPOSE

Double reading is the strategy of choice for mammogram interpretation in screening programmes. It remains, however, unknown whether double reading is still the strategy of choice in the context of digital mammography. Our aim was to determine the effectiveness and cost-effectiveness of double reading versus single reading of digital mammograms in screening programmes.

METHODS

We performed a systematic review by searching the PubMed, Embase, and Cochrane Library databases up to April 2017. We used the QUADAS-2 (Quality Assessment of Diagnostic Accuracy Studies) tool and CHEERS (Consolidated Health Economic Evaluation Reporting Standards) checklist to assess the methodological quality of the diagnostic studies and economic evaluations, respectively. A proportion's meta-analysis approach, 95% Confidence Intervals (95% CI) and test of heterogeneity (P values) were used for pooled results. Costs are expressed US$ PPP (United States Dollar purchasing power parities). The PROSPERO ID of this Systematic Review's protocol is CRD42014013804.

RESULTS

Of 1473 potentially relevant hits, four high-quality studies were included. The pooled cancer detection rate of double reading was 6.01 per 1000 screens (CI: 4.47‰-7.77‰), and it was 5.65 per 1000 screens (CI: 3.95‰-7.65‰) for single reading (P=0.76). The pooled proportion of false-positives of double reading was 47.03 per 1000 screens (CI: 39.13‰-55.62‰) and it was 40.60 per 1000 screens (CI: 38.58‰-42.67‰) for single reading (P=0.12). One study reported, for double reading, an ICER (Incremental Cost-Effectiveness Ratio) of 16,684 Euros (24,717 US$ PPP; 2015 value) per detected cancer. Single reading+CAD (computer-aided-detection) was cost-effective in Japan.

CONCLUSION

The evidence of benefit for double reading compared to single reading for digital mammography interpretation is scarce. Double reading seems to increase operational costs, have a not significantly higher false-positive rate, and a similar cancer detection rate.

Collapse

Review of the evidence on the use of arbitration or consensus within breast screening: A systematic scoping review. Radiography (Lond) 2017;23:171-176. [DOI: 10.1016/j.radi.2017.01.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Revised: 12/29/2016] [Accepted: 01/05/2017] [Indexed: 11/23/2022]

Jalalian A, Mashohor S, Mahmud R, Karasfi B, Saripan MIB, Ramli ARB. Foundation and methodologies in computer-aided diagnosis systems for breast cancer detection. EXCLI JOURNAL 2017;16:113-137. [PMID: 28435432 PMCID: PMC5379115 DOI: 10.17179/excli2016-701] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Accepted: 01/05/2017] [Indexed: 12/15/2022]

Posso M, Carles M, Rué M, Puig T, Bonfill X. Cost-Effectiveness of Double Reading versus Single Reading of Mammograms in a Breast Cancer Screening Programme. PLoS One 2016;11:e0159806. [PMID: 27459663 PMCID: PMC4961365 DOI: 10.1371/journal.pone.0159806] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 07/10/2016] [Indexed: 11/18/2022] Open

Abstract

OBJECTIVES

The usual practice in breast cancer screening programmes for mammogram interpretation is to perform double reading. However, little is known about its cost-effectiveness in the context of digital mammography. Our purpose was to evaluate the cost-effectiveness of double reading versus single reading of digital mammograms in a population-based breast cancer screening programme.

METHODS

Data from 28,636 screened women was used to establish a decision-tree model and to compare three strategies: 1) double reading; 2) double reading for women in their first participation and single reading for women in their subsequent participations; and 3) single reading. We calculated the incremental cost-effectiveness ratio (ICER), which was defined as the expected cost per one additionally detected cancer. We performed a deterministic sensitivity analysis to test the robustness of the ICER.

RESULTS

The detection rate of double reading (5.17‰) was similar to that of single reading (4.78‰; P = .768). The mean cost of each detected cancer was €8,912 for double reading and €8,287 for single reading. The ICER of double reading versus single reading was €16,684. The sensitivity analysis showed variations in the ICER according to the sensitivity of reading strategies. The strategy that combines double reading in first participation with single reading in subsequent participations was ruled out due to extended dominance.

CONCLUSIONS

From our results, double reading appears not to be a cost-effective strategy in the context of digital mammography. Double reading would eventually be challenged in screening programmes, as single reading might entail important net savings without significantly changing the cancer detection rate. These results are not conclusive and should be confirmed in prospective studies that investigate long-term outcomes like quality adjusted life years (QALYs).

Collapse

Elmore JG, Tosteson AN, Pepe MS, Longton GM, Nelson HD, Geller B, Carney PA, Onega T, Allison KH, Jackson SL, Weaver DL. Evaluation of 12 strategies for obtaining second opinions to improve interpretation of breast histopathology: simulation study. BMJ 2016;353:i3069. [PMID: 27334105 PMCID: PMC4916777 DOI: 10.1136/bmj.i3069] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Abstract

OBJECTIVE

To evaluate the potential effect of second opinions on improving the accuracy of diagnostic interpretation of breast histopathology.

DESIGN

Simulation study.

SETTING

12 different strategies for acquiring independent second opinions.

PARTICIPANTS

Interpretations of 240 breast biopsy specimens by 115 pathologists, one slide for each case, compared with reference diagnoses derived by expert consensus.

MAIN OUTCOME MEASURES

Misclassification rates for individual pathologists and for 12 simulated strategies for second opinions. Simulations compared accuracy of diagnoses from single pathologists with that of diagnoses based on pairing interpretations from first and second independent pathologists, where resolution of disagreements was by an independent third pathologist. 12 strategies were evaluated in which acquisition of second opinions depended on initial diagnoses, assessment of case difficulty or borderline characteristics, pathologists' clinical volumes, or whether a second opinion was required by policy or desired by the pathologists. The 240 cases included benign without atypia (10% non-proliferative, 20% proliferative without atypia), atypia (30%), ductal carcinoma in situ (DCIS, 30%), and invasive cancer (10%). Overall misclassification rates and agreement statistics depended on the composition of the test set, which included a higher prevalence of difficult cases than in typical practice.

RESULTS

Misclassification rates significantly decreased (P<0.001) with all second opinion strategies except for the strategy limiting second opinions only to cases of invasive cancer. The overall misclassification rate decreased from 24.7% to 18.1% when all cases received second opinions (P<0.001). Obtaining both first and second opinions from pathologists with a high volume (≥10 breast biopsy specimens weekly) resulted in the lowest misclassification rate in this test set (14.3%, 95% confidence interval 10.9% to 18.0%). Obtaining second opinions only for cases with initial interpretations of atypia, DCIS, or invasive cancer decreased the over-interpretation of benign cases without atypia from 12.9% to 6.0%. Atypia cases had the highest misclassification rate after single interpretation (52.2%), remaining at more than 34% in all second opinion scenarios.

CONCLUSION

Second opinions can statistically significantly improve diagnostic agreement for pathologists' interpretations of breast biopsy specimens; however, variability in diagnosis will not be completely eliminated, especially for breast specimens with atypia.

Collapse

Damases CN, Mello-Thoms C, McEntee MF. Inter-observer variability in mammographic density assessment using Royal Australian and New Zealand College of Radiologists (RANZCR) synoptic scales. J Med Imaging Radiat Oncol 2016;60:329-36. [PMID: 27059785 DOI: 10.1111/1754-9485.12451] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2015] [Accepted: 02/26/2016] [Indexed: 11/30/2022]

Pow RE, Mello-Thoms C, Brennan P. Evaluation of the effect of double reporting on test accuracy in screening and diagnostic imaging studies: A review of the evidence. J Med Imaging Radiat Oncol 2016;60:306-14. [DOI: 10.1111/1754-9485.12450] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 02/26/2016] [Indexed: 12/01/2022]

Posso MC, Puig T, Quintana MJ, Solà-Roca J, Bonfill X. Double versus single reading of mammograms in a breast cancer screening programme: a cost-consequence analysis. Eur Radiol 2016;26:3262-71. [PMID: 26747264 DOI: 10.1007/s00330-015-4175-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Revised: 11/30/2015] [Accepted: 12/15/2015] [Indexed: 12/29/2022]

Li J, Shao Z. Mammography screening in less developed countries. SPRINGERPLUS 2015;4:615. [PMID: 26543750 PMCID: PMC4627993 DOI: 10.1186/s40064-015-1394-8] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Accepted: 10/05/2015] [Indexed: 12/29/2022]

Wolf M, Krause J, Carney PA, Bogart A, Kurvers RHJM. Collective intelligence meets medical decision-making: the collective outperforms the best radiologist. PLoS One 2015;10:e0134269. [PMID: 26267331 PMCID: PMC4534443 DOI: 10.1371/journal.pone.0134269] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 07/07/2015] [Indexed: 11/24/2022] Open

Dhahbi S, Barhoumi W, Zagrouba E. Breast cancer diagnosis in digitized mammograms using curvelet moments. Comput Biol Med 2015;64:79-90. [PMID: 26151831 DOI: 10.1016/j.compbiomed.2015.06.012] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Revised: 06/16/2015] [Accepted: 06/17/2015] [Indexed: 11/26/2022]

Abstract

BACKGROUND

Feature extraction is a key issue in designing a computer aided diagnosis system. Recent researches on breast cancer diagnosis have reported the effectiveness of multiscale transforms (wavelets and curvelets) for mammogram analysis and have shown the superiority of curvelet transform. However, the curse of dimensionality problem arises when using the curvelet coefficients and therefore a reduction method is required to extract a reduced set of discriminative features.

METHODS

This paper deals with this problem and proposes a feature extraction method based on curvelet transform and moment theory for mammogram description. First, we performed discrete curvelet transform and we computed the four first-order moments from curvelet coefficients distribution. Hence, two feature sets can be obtained: moments from each band and moments from each level. In this work, both sets are studied. Then, the t-test ranking technique was applied to select the best features from each set. Finally, a k-nearest neighbor classifier was used to distinguish between normal and abnormal breast tissues and to classify tumors as malignant or benign. Experiments were performed on 252 mammograms from the Mammographic Image Analysis Society (mini-MIAS) database using the leave-one-out cross validation as well as on 11553 mammograms from the Digital Database for Screening Mammography (DDSM) database using 2×5-fold cross validation.

RESULTS

Experimental results prove the effectiveness and the superiority of curvelet moments for mammogram analysis. Indeed, results on the mini-MIAS database show that curvelet moments yield an accuracy of 91.27% (resp. 81.35 %) with 10 (resp. 8) features for abnormality (resp. malignancy) detection. In addition, empirical comparisons of the proposed method against state-of-the-art curvelet-based methods on the DDSM database show that the suggested method does not only lead to a more reduced feature set, but it also statistically outperforms all the compared methods in terms of accuracy.

CONCLUSIONS

In summary, curvelet moments are an efficient and effective way to extract a reduced set of discriminative features for breast cancer diagnosis.

Collapse

Quality Management in Musculoskeletal Imaging: Form, Content, and Diagnosis of Knee MRI Reports and Effectiveness of Three Different Quality Improvement Measures. AJR Am J Roentgenol 2015;204:1069-74. [DOI: 10.2214/ajr.14.13216] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Lauritzen PM, Hurlen P, Sandbæk G, Gulbrandsen P. Double reading rates and quality assurance practices in Norwegian hospital radiology departments: two parallel national surveys. Acta Radiol 2015;56:78-86. [PMID: 24425793 DOI: 10.1177/0284185113519988] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Abstract

BACKGROUND

Double reading as a quality assurance (QA) tool is employed extensively in Norwegian hospital radiology departments. The practice is resource consuming and regularly debated.

PURPOSE

To investigate the rates of double reading in Norwegian hospital radiology departments, to identify department characteristics associated with double reading rates, and to investigate associations between double reading and other quality improvement.

MATERIAL AND METHODS

We issued two parallel national surveys to management and to consultant radiologists, respectively. Management was defined as the chief medical officer and/or the head of the radiology department. The management survey covered staffing, perceived resource situation, double reading, guidelines, and quality improvement. The radiologist survey served to validate management responses concerning double reading. Management survey items concerning practices of quality improvement were organized into three indices reflecting different quality approaches, namely: appropriateness of investigations; personal performance feedback; and system performance feedback.

RESULTS

The response rates of the surveys were 100% (45/45) for management and 55% (266/483) for radiologists. Of all exams read by consultants, 33% were double read. The double reading rate was highest in university hospital departments (59%), intermediate in other teaching departments (30%), and lowest in non-teaching departments (11%) (P = 0.01). Among the quality indices, mean scores were highest on appropriateness index (68%), intermediate on the person index (56%), and lowest on system index (37%). There were no correlations between double reading rates and scores on any of the quality indices.

CONCLUSION

The rate of double reading in Norwegian hospital radiology is significantly correlated to department teaching status, but not to other practices of quality work.

Collapse

Geller BM, Nelson HD, Carney PA, Weaver DL, Onega T, Allison KH, Frederick PD, Tosteson ANA, Elmore JG. Second opinion in breast pathology: policy, practice and perception. J Clin Pathol 2014;67:955-60. [PMID: 25053542 DOI: 10.1136/jclinpath-2014-202290] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Cascio D, Fauci F, Iacomi M, Raso G, Magro R, Castrogiovanni D, Filosto G, Ienzi R, Vasile MS. Computer-aided diagnosis in digital mammography: comparison of two commercial systems. ACTA ACUST UNITED AC 2014. [DOI: 10.2217/iim.13.68] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

MDCT in the assessment of laryngeal trauma: value of 2D multiplanar and 3D reconstructions. AJR Am J Roentgenol 2013;201:W639-47. [PMID: 24059404 DOI: 10.2214/ajr.12.9813] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Abstract

OBJECTIVE

The purpose of this study was to analyze fracture patterns and related effects of laryngeal trauma and to assess the value of 2D multiplanar reformation (MPR) and 3D reconstruction.

MATERIALS AND METHODS

Among 4222 consecutively registered trauma patients who underwent emergency MDCT, 38 patients had presented with laryngeal trauma. Axial, 2D MPR, 3D volume-rendered, and virtual endoscopic images were analyzed retrospectively by two blinded observers according to predefined criteria. Laryngeal fractures, soft-tissue injuries, and airway compromise were evaluated and correlated with clinical, endoscopic, surgical, and follow-up findings.

RESULTS

Fifty-nine fractures (37 thyroid, 13 cricoid, nine arytenoid) were present in 38 patients. They were isolated in 21 (55%) patients. The other 17 (45%) patients had additional injuries to the neck, face, brain, chest, or abdomen. Laryngeal fractures were bilateral in 31 (82%) patients and were associated with hyoid bone fractures in nine (24%) patients. Arytenoid luxation was present in eight cartilages. Axial imaging missed 7 of 59 (12%) laryngeal fractures, six of eight (75%) arytenoid luxations, and four of nine (44%) hyoid bone fractures. Additional 2D MPR imaging missed 5 of 59 (8%) laryngeal fractures, five of eight (62.5%) arytenoid luxations, and two of nine (22%) hyoid bone fractures, whereas 3D volume-rendered images depicted them all. Virtual endoscopy and 3D volume rendering added diagnostic accuracy with respect to the length, width, shape, and spatial orientation of fractures in 22 of 38 (58%) patients; arytenoid luxation in six of eight (75%) luxations; and the evaluation of airway narrowing in 19 of 38 (50%) patients. Three-dimensional volume rendering was not of additional value in evaluation of the cricoid cartilage.

CONCLUSION

The use of 2D MPR and 3D volume rendering with or without virtual endoscopy improved assessment of thyroid and hyoid bone fractures, arytenoid luxations, and laryngotracheal narrowing, providing helpful data for optimal management.

Collapse

Moradi M, Ganji K, Teyfouri N, Kolahdoozan F. Performance of double reading mammography in an Iranian population and its effect on patient outcome. IRANIAN JOURNAL OF RADIOLOGY 2013;10:51-5. [PMID: 24046778 PMCID: PMC3767012 DOI: 10.5812/iranjradiol.11729] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 10/29/2011] [Revised: 09/29/2012] [Accepted: 10/09/2012] [Indexed: 11/27/2022]

Abstract

Background

Considering the importance and responsibility of reporting mammography and the necessity to notice details with a high degree of precision, double reading mammography has been introduced and recommended.

Objectives

This study aimed to assess the performance of double reading of mammograms and its effect on patient outcomes.

Patients and Methods

Throughout this cross sectional study, 1284 digitized mammographic views of 642 breasts which belonged to 339 women (of which 303 were bilateral and 36 were unilateral mammographies) were enrolled. Two independent radiologists interpreted these mammograms and BI-RADS categories of both reports were compared. Discordant results were determined and assumed significant if they were in the positive (BI-RADS 0, 4, 5) versus negative (BI-RADS 1, 2, 3) groups and then significant discordant cases were followed up to determine benign versus malignant final diagnosis. The recall rate was calculated for each reader. Inter-observer agreement in breast density was determined by Kappa test.

Results

Readers had consensus on BI-RADS categories in 459 breasts (71%), but diverse categories were used for 183 breasts (29%), including 132 significant and 51 non-significant discrepancies. According to weighted Kappa test, agreement between two readers in positive or negative reports was 0.78 (95% CI=0.73-0.83) and in parenchymal density, it was 0.73 (95% CI=0.7-0.77). Most of the discrepancies were between category zero versus categories 1 and 2 (63.4%). The recall rate was 36% for the first and 44% for the second reader. Among 132 significant discordant results, one case had the final diagnosis of malignancy and the others had benign or negative diagnosis. There was 0.2% increase in cancer detection rate by double reading.

Conclusion

This study shows no significant improvement in the cancer detection rate by double reading; however, a lower recall rate could be a more helpful consequence.

Collapse

Prospective trial comparing full-field digital mammography (FFDM) versus combined FFDM and tomosynthesis in a population-based screening programme using independent double reading with arbitration. Eur Radiol 2013;23:2061-71. [PMID: 23553585 PMCID: PMC3701792 DOI: 10.1007/s00330-013-2820-3] [Citation(s) in RCA: 161] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2012] [Revised: 01/31/2013] [Accepted: 02/02/2013] [Indexed: 10/31/2022]

Abstract

OBJECTIVES

To compare double readings when interpreting full field digital mammography (2D) and tomosynthesis (3D) during mammographic screening.

METHODS

A prospective, Ethical Committee approved screening study is underway. During the first year 12,621 consenting women underwent both 2D and 3D imaging. Each examination was independently interpreted by four radiologists under four reading modes: Arm A-2D; Arm B-2D + CAD; Arm C-2D + 3D; Arm D-synthesised 2D + 3D. Examinations with a positive score by at least one reader were discussed at an arbitration meeting before a final management decision. Paired double reading of 2D (Arm A + B) and 2D + 3D (Arm C + D) were analysed. Performance measures were compared using generalised linear mixed models, accounting for inter-reader performance heterogeneity (P < 0.05).

RESULTS

Pre-arbitration false-positive scores were 10.3 % (1,286/12,501) and 8.5 % (1,057/12,501) for 2D and 2D + 3D, respectively (P < 0.001). Recall rates were 2.9 % (365/12,621) and 3.7 % (463/12,621), respectively (P = 0.005). Cancer detection was 7.1 (90/12,621) and 9.4 (119/12,621) per 1,000 examinations, respectively (30 % increase, P < 0.001); positive predictive values (detected cancer patients per 100 recalls) were 24.7 % and 25.5 %, respectively (P = 0.97). Using 2D + 3D, double-reading radiologists detected 27 additional invasive cancers (P < 0.001).

CONCLUSION

Double reading of 2D + 3D significantly improves the cancer detection rate in mammography screening.

KEY POINTS

• Tomosynthesis-based screening was successfully implemented in a large prospective screening trial. • Double reading of tomosynthesis-based examinations significantly reduced false-positive interpretations. • Double reading of tomosynthesis significantly increased the detection of invasive cancers.

Collapse

Couraud S, Cortot AB, Greillier L, Gounant V, Mennecier B, Girard N, Besse B, Brouchet L, Castelnau O, Frappé P, Ferretti GR, Guittet L, Khalil A, Lefebure P, Laurent F, Liebart S, Molinier O, Quoix E, Revel MP, Stach B, Souquet PJ, Thomas P, Trédaniel J, Lemarié E, Zalcman G, Barlési F, Milleron B. From randomized trials to the clinic: is it time to implement individual lung-cancer screening in clinical practice? A multidisciplinary statement from French experts on behalf of the French intergroup (IFCT) and the groupe d'Oncologie de langue francaise (GOLF). Ann Oncol 2012;24:586-97. [PMID: 23136229 PMCID: PMC3574545 DOI: 10.1093/annonc/mds476] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Redondo A, Comas M, Macià F, Ferrer F, Murta-Nascimento C, Maristany MT, Molins E, Sala M, Castells X. Inter- and intraradiologist variability in the BI-RADS assessment and breast density categories for screening mammograms. Br J Radiol 2012;85:1465-70. [PMID: 22993385 DOI: 10.1259/bjr/21256379] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

Abstract

OBJECTIVE

The aim of this study was to evaluate reader variability in screening mammograms according to the American College of Radiology Breast Imaging Reporting and Data System (BI-RADS) assessment and breast density categories.

METHODS

A stratified random sample of 100 mammograms was selected from a population-based breast cancer screening programme in Barcelona, Spain: 13 histopathologically confirmed breast cancers and 51 with true-negative and 36 with false-positive results. 21 expert radiologists from radiological units of breast cancer screening programmes in Catalonia, Spain, reviewed the mammography images twice within a 6-month interval. The readers described each mammography using BI-RADS assessment and breast density categories. Inter- and intraradiologist agreement was assessed using percentage of concordance and the kappa (κ) statistic.

RESULTS

Fair interobserver agreement was observed for the BI-RADS assessment [κ=0.37, 95% confidence interval (CI) 0.36-0.38]. When the categories were collapsed in terms of whether additional evaluation was required (Categories III, 0, IV, V) or not (I and II), moderate agreement was found (κ=0.53, 95% CI 0.52-0.54). Intra-observer agreement for BI-RADS assessment was moderate using all categories (κ=0.53, 95% CI 0.50-0.55) and substantial on recall (κ=0.66, 95% CI 0.63-0.70). Regarding breast density, inter- and intraradiologist agreement was substantial (κ=0.73, 95% CI 0.72-0.74 and κ=0.69, 95% CI 0.68-0.70, respectively).

CONCLUSION

We observed a substantial intra-observer agreement in the BI-RADS assessment but only moderate interobserver agreement. Both inter- and intra-observer agreement in mammographic interpretation of breast density was substantial. Advances in knowledge Educational efforts should be made to decrease radiologists' variability in BI-RADS assessment interpretation in population-based breast screening programmes.

Collapse

Skaane P, Kshirsagar A, Hofvind S, Jahr G, Castellino RA. Mammography screening using independent double reading with consensus: is there a potential benefit for computer-aided detection? Acta Radiol 2012;53:241-8. [PMID: 22287148 DOI: 10.1258/ar.2011.110452] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

BACKGROUND

Double reading improves the cancer detection rate in mammography screening. Single reading with computer-aided detection (CAD) has been considered to be an alternative to double reading. Little is known about the potential benefit of CAD in breast cancer screening with double reading.

PURPOSE

To compare prospective independent double reading of screen-film (SFM) and full-field digital (FFDM) mammography in population-based screening with retrospective standalone CAD performance on the baseline mammograms of the screen-detected cancers and subsequent cancers diagnosed during the follow-up period.

MATERIAL AND METHODS

The study had ethics committee approval. A 5-point rating scale for probability of cancer was used for 23,923 (SFM = 16,983; FFDM = 6940) screening mammograms. Of 208 evaluable cancers, 104 were screen-detected and 104 were subsequent (44 interval and 60 next screening round) cancers. Baseline mammograms of subsequent cancers were retrospectively classified in consensus without information about cancer location, histology, or CAD prompting as normal, non-specific minimal signs, significant minimal signs, and false-negatives. The baseline mammograms of the screen-detected cancers and subsequent cancers were evaluated by CAD. Significant minimal signs and false-negatives were considered 'actionable' and potentially diagnosable if correctly prompted by CAD.

RESULTS

CAD correctly marked 94% (98/104) of the baseline mammograms of the screen-detected cancers (SFM = 95% [61/64]; FFDM = 93% [37/40]), including 96% (23/24) of those with discordant interpretations. Considering only those baseline examinations of subsequent cancers prospectively interpreted as normal and retrospectively categorized as 'actionable', CAD input at baseline screening had the potential to increase the cancer detection rate from 0.43% to 0.51% (P = 0.13); and to increase cancer detection by 16% ([104 + 17]/104) and decrease interval cancers by 20% (from 44 to 35).

CONCLUSION

CAD may have the potential to increase cancer detection by up to 16%, and to reduce the number of interval cancers by up to 20% in SFM and FFDM screening programs using independent double reading with consensus review. The influence of true- and false-positive CAD marks on decision-making can, however, only be evaluated in a prospective clinical study.

Collapse

Lian K, Bharatha A, Aviv RI, Symons SP. Interpretation errors in CT angiography of the head and neck and the benefit of double reading. AJNR Am J Neuroradiol 2011;32:2132-5. [PMID: 21903915 DOI: 10.3174/ajnr.a2678] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Husby JA, Espeland A, Kalyanpur A, Brocker C, Haldorsen IS. Double reading of radiological examinations in Norway. Acta Radiol 2011;52:516-21. [PMID: 21498308 DOI: 10.1258/ar.2011.100347] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Sohns C, Angic B, Sossalla S, Konietschke F, Obenauer S. Computer-assisted Diagnosis in Full-field Digital Mammography-Results in Dependence of Readers Experiences. Breast J 2010;16:490-7. [DOI: 10.1111/j.1524-4741.2010.00963.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Shaw CM, Flanagan FL, Fenlon HM, McNicholas MM. Consensus Review of Discordant Findings Maximizes Cancer Detection Rate in Double-Reader Screening Mammography: Irish National Breast Screening Program Experience. Radiology 2009;250:354-62. [DOI: 10.1148/radiol.2502080224] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Computer-aided detection mammography for breast cancer screening: systematic review and meta-analysis. Arch Gynecol Obstet 2008;279:881-90. [PMID: 19023581 DOI: 10.1007/s00404-008-0841-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2008] [Accepted: 10/30/2008] [Indexed: 10/21/2022]

Abstract

CONTEXT

Mammography is generally accepted as the best available breast cancer screening method; however, some cancers detectable on mammography images are missed. Computer-aided detection (CAD) systems for mammography are intended to reduce false negatives by marking suspicious areas of the mammograms for reviewers to consider. Although the prospect of improving the sensitivity of screening mammograms has led to the diffusion of CAD for mammography, little is known about its diagnostic accuracy.

OBJECTIVE

To assess the diagnostic performance of CAD for screening mammography in terms of sensitivity and specificity and incremental recall, biopsy, and cancer diagnosis rates.

DATA SOURCES

Published literature identified by systematic literature searches of 17 databases, including MEDLINE, EMBASE, and the Cochrane Library, searched through 25 September 2008.

STUDY SELECTION

A reviewer and an information specialist selected full-length English-language articles that enrolled asymptomatic women for routine breast cancer screening and provided data needed for our analyses using criteria established a priori. We identified 75 potentially relevant publications, of which 7 (9%) were included.

DATA EXTRACTION

Data were extracted and internal validity was assessed by a single review author, and forms were approved by the co-authors.

RESULTS

Three studies (n = 347,324) reported sensitivity and specificity, or data to calculate them, and five studies (n = 51,162) reported data to calculate incremental rates of cancer diagnoses and recall and biopsy of women who did not have breast cancer. The pooled sensitivity was 86.0% (95% CI 84.2-87.6%) and specificity was 88.2% (95% CI 88.1-88.3%). Of the 100,000 women screened, CAD yielded an additional 50 (95% CI 30-80) correct breast cancer diagnoses, 1,190 (95% CI 1,090-1,290) recalls of healthy women, and 80 (95% CI 60-100) biopsies of healthy women. A total of 96% (95% CI 93.9-97.3%) of women recalled based upon CAD and 65.1% (95% CI 52.3-76.0%) of women biopsied based upon CAD were healthy. No studies reported patient-oriented clinical outcomes.

Collapse

Taylor P, Potts HW. Computer aids and human second reading as interventions in screening mammography: Two systematic reviews to compare effects on cancer detection and recall rate. Eur J Cancer 2008;44:798-807. [PMID: 18353630 DOI: 10.1016/j.ejca.2008.02.016] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2008] [Revised: 02/08/2008] [Accepted: 02/14/2008] [Indexed: 10/22/2022]

Mencattini A, Rabottino G, Salmeri M, Lojacono R, Colini E. Breast Mass Segmentation in Mammographic Images by an Effective Region Growing Algorithm. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS 2008. [DOI: 10.1007/978-3-540-88458-3_86] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Doutriaux-Dumoulin I, Allioux A, Campion L, Meingan P, Molina L. Cancers détectés par le deuxième lecteur : analyse des données de la campagne de dépistage du cancer du sein en Loire-Atlantique, 2003-2005 (nouveau cahier des charges). ACTA ACUST UNITED AC 2007;88:1873-80. [DOI: 10.1016/s0221-0363(07)78365-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Skaane P, Hofvind S, Skjennald A. Randomized trial of screen-film versus full-field digital mammography with soft-copy reading in population-based screening program: follow-up and final results of Oslo II study. Radiology 2007;244:708-17. [PMID: 17709826 DOI: 10.1148/radiol.2443061478] [Citation(s) in RCA: 175] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Skaane P, Kshirsagar A, Stapleton S, Young K, Castellino RA. Effect of Computer-Aided Detection on Independent Double Reading of Paired Screen-Film and Full-Field Digital Screening Mammograms. AJR Am J Roentgenol 2007;188:377-84. [PMID: 17242245 DOI: 10.2214/ajr.05.2207] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

OBJECTIVE

The purpose of this study was to evaluate the performance and potential contribution of computer-aided detection (CAD) to independent double reading of paired screen-film and full-field digital screening mammograms.

MATERIALS AND METHODS

The cases of 3,683 women who underwent both screen-film mammography and full-field digital mammography (FFDM) with independent double reading for each technique were followed for 2 years to include cancers detected in the interval between screening rounds and cancers detected at the next screening round. Fifty-five biopsy-proven cancers were diagnosed. The baseline screening mammograms of the 55 cancers were defined as having positive findings if at least one of two independent readers scored it 2 or higher on a 5-point rating scale. The baseline mammograms of interval (n = 10) or secondround (n = 16) cancers were retrospectively classified as overlooked (n = 2), minimal sign actionable (n = 8), minimal sign nonactionable (n = 5), and normal (n = 11). The baseline mammograms of these cases of cancer were evaluated with a CAD system, and the CAD results were compared (McNemar's test for paired proportions) with the findings at prospective independent double reading of mammograms obtained with each technique.

RESULTS

For FFDM, CAD sensitivity was 95% (37/39) compared with 64% (25/39) for double reading (p = 0.006), and for screen-film mammography, CAD sensitivity was 85% (33/39) compared with 77% (30/39) for prospective double reading (p = 0.57) of radiographically visible lesions in baseline mammograms. CAD correctly marked five (13%) of 39 cancers on screen-film mammography and 14 (36%) of 39 cancers on FFDM not detected at prospective independent double reading.

CONCLUSION

CAD showed the potential to increase the cancer detection rate for FFDM and for screen-film mammography in breast cancer screening performed with independent double reading.

Collapse