Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Elmore JG, Jackson SL, Abraham L, Miglioretti DL, Carney PA, Geller BM, Yankaskas BC, Kerlikowske K, Onega T, Rosenberg RD, Sickles EA, Buist DSM. Variability in interpretive performance at screening mammography and radiologists' characteristics associated with accuracy. Radiology 2009;253:641-51. [PMID: 19864507 DOI: 10.1148/radiol.2533082308] [Citation(s) in RCA: 158] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

For:	Elmore JG, Jackson SL, Abraham L, Miglioretti DL, Carney PA, Geller BM, Yankaskas BC, Kerlikowske K, Onega T, Rosenberg RD, Sickles EA, Buist DSM. Variability in interpretive performance at screening mammography and radiologists' characteristics associated with accuracy. Radiology 2009;253:641-51. [PMID: 19864507 DOI: 10.1148/radiol.2533082308] [Citation(s) in RCA: 158] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Number

Cited by Other Article(s)

Gommers JJJ, Abbey CK, Strand F, Taylor-Phillips S, Jenkinson DJ, Larsen M, Hofvind S, Broeders MJM, Sechopoulos I. Modeling Radiologists' Assessments to Explore Pairing Strategies for Optimized Double Reading of Screening Mammograms. Med Decis Making 2024:272989X241264572. [PMID: 39077968 DOI: 10.1177/0272989x241264572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/31/2024]

Abstract

PURPOSE

To develop a model that simulates radiologist assessments and use it to explore whether pairing readers based on their individual performance characteristics could optimize screening performance.

METHODS

Logistic regression models were designed and used to model individual radiologist assessments. For model evaluation, model-predicted individual performance metrics and paired disagreement rates were compared against the observed data using Pearson correlation coefficients. The logistic regression models were subsequently used to simulate different screening programs with reader pairing based on individual true-positive rates (TPR) and/or false-positive rates (FPR). For this, retrospective results from breast cancer screening programs employing double reading in Sweden, England, and Norway were used. Outcomes of random pairing were compared against those composed of readers with similar and opposite TPRs/FPRs, with positive assessments defined by either reader flagging an examination as abnormal.

RESULTS

The analysis data sets consisted of 936,621 (Sweden), 435,281 (England), and 1,820,053 (Norway) examinations. There was good agreement between the model-predicted and observed radiologists' TPR and FPR (r ≥ 0.969). Model-predicted negative-case disagreement rates showed high correlations (r ≥ 0.709), whereas positive-case disagreement rates had lower correlation levels due to sparse data (r ≥ 0.532). Pairing radiologists with similar FPR characteristics (Sweden: 4.50% [95% confidence interval: 4.46%-4.54%], England: 5.51% [5.47%-5.56%], Norway: 8.03% [7.99%-8.07%]) resulted in significantly lower FPR than with random pairing (Sweden: 4.74% [4.70%-4.78%], England: 5.76% [5.71%-5.80%], Norway: 8.30% [8.26%-8.34%]), reducing examinations sent to consensus/arbitration while the TPR did not change significantly. Other pairing strategies resulted in equal or worse performance than random pairing.

CONCLUSIONS

Logistic regression models accurately predicted screening mammography assessments and helped explore different radiologist pairing strategies. Pairing readers with similar modeled FPR characteristics reduced the number of examinations unnecessarily sent to consensus/arbitration without significantly compromising the TPR.

HIGHLIGHTS

A logistic-regression model can be derived that accurately predicts individual and paired reader performance during mammography screening reading.Pairing screening mammography radiologists with similar false-positive characteristics reduced false-positive rates with no significant loss in true positives and may reduce the number of examinations unnecessarily sent to consensus/arbitration.

Collapse

Kim HJ, Choi WJ, Gwon HY, Jang SJ, Chae EY, Shin HJ, Cha JH, Kim HH. Improving mammography interpretation for both novice and experienced readers: a comparative study of two commercial artificial intelligence software. Eur Radiol 2024;34:3924-3934. [PMID: 37938383 DOI: 10.1007/s00330-023-10422-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 09/15/2023] [Accepted: 10/14/2023] [Indexed: 11/09/2023]

Abstract

OBJECTIVES

To evaluate the improvement of mammography interpretation for novice and experienced radiologists assisted by two commercial AI software.

METHODS

We compared the performance of two AI software (AI-1 and AI-2) in two experienced and two novice readers for 200 mammographic examinations (80 cancer cases). Two reading sessions were conducted within 4 weeks. The readers rated the likelihood of malignancy (range, 1-7) and the percentage probability of malignancy (range, 0-100%), with and without AI assistance. Differences in AUROC, sensitivity, and specificity were analyzed.

RESULTS

Mean AUROC increased in both novice (0.86 to 0.90 with AI-1 [p = 0.005]; 0.91 with AI-2 [p < 0.001]) and experienced readers (0.87 to 0.92 with AI-1 [p < 0.001]; 0.90 with AI-2 [p = 0.004]). Sensitivities increased from 81.3 to 88.8% with AI-1 (p = 0.027) and to 91.3% with AI-2 (p = 0.005) in novice readers, and from 81.9 to 90.6% with AI-1 (p = 0.001) and to 87.5% with AI-2 (p = 0.016) in experienced readers. Specificity did not decrease significantly in both novice (p > 0.999, both) and experienced readers (p > 0.999 with AI-1 and 0.282 with AI-2). There was no significant difference in the performance change depending on the type of AI software (p > 0.999).

CONCLUSION

Commercial AI software improved the diagnostic performance of both novice and experienced readers. The type of AI software used did not significantly impact performance changes. Further validation with a larger number of cases and readers is needed.

CLINICAL RELEVANCE STATEMENT

Commercial AI software effectively aided mammography interpretation irrespective of the experience level of human readers.

KEY POINTS

• Mammography interpretation remains challenging and is subject to a wide range of interobserver variability. • In this multi-reader study, two commercial AI software improved the sensitivity of mammography interpretation by both novice and experienced readers. The type of AI software used did not significantly impact performance changes. • Commercial AI software may effectively support mammography interpretation irrespective of the experience level of human readers.

Collapse

Cerekci E, Alis D, Denizoglu N, Camurdan O, Ege Seker M, Ozer C, Hansu MY, Tanyel T, Oksuz I, Karaarslan E. Quantitative evaluation of Saliency-Based Explainable artificial intelligence (XAI) methods in Deep Learning-Based mammogram analysis. Eur J Radiol 2024;173:111356. [PMID: 38364587 DOI: 10.1016/j.ejrad.2024.111356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 12/10/2023] [Accepted: 02/02/2024] [Indexed: 02/18/2024]

Kim JG, Haslam B, Diab AR, Sakhare A, Grisot G, Lee H, Holt J, Lee CI, Lotter W, Sorensen AG. Impact of a Categorical AI System for Digital Breast Tomosynthesis on Breast Cancer Interpretation by Both General Radiologists and Breast Imaging Specialists. Radiol Artif Intell 2024;6:e230137. [PMID: 38323914 PMCID: PMC10982824 DOI: 10.1148/ryai.230137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 12/26/2023] [Accepted: 01/22/2024] [Indexed: 02/08/2024]

Abstract

Purpose To evaluate performance improvements of general radiologists and breast imaging specialists when interpreting a set of diverse digital breast tomosynthesis (DBT) examinations with the aid of a custom-built categorical artificial intelligence (AI) system. Materials and Methods A fully balanced multireader, multicase reader study was conducted to compare the performance of 18 radiologists (nine general radiologists and nine breast imaging specialists) reading 240 retrospectively collected screening DBT mammograms (mean patient age, 59.8 years ± 11.3 [SD]; 100% women), acquired between August 2016 and March 2019, with and without the aid of a custom-built categorical AI system. The area under the receiver operating characteristic curve (AUC), sensitivity, and specificity across general radiologists and breast imaging specialists reading with versus without AI were assessed. Reader performance was also analyzed as a function of breast cancer characteristics and patient subgroups. Results Every radiologist demonstrated improved interpretation performance when reading with versus without AI, with an average AUC of 0.93 versus 0.87, demonstrating a difference in AUC of 0.06 (95% CI: 0.04, 0.08; P < .001). Improvement in AUC was observed for both general radiologists (difference of 0.08; P < .001) and breast imaging specialists (difference of 0.04; P < .001) and across all cancer characteristics (lesion type, lesion size, and pathology) and patient subgroups (race and ethnicity, age, and breast density) examined. Conclusion A categorical AI system helped improve overall radiologist interpretation performance of DBT screening mammograms for both general radiologists and breast imaging specialists and across various patient subgroups and breast cancer characteristics. Keywords: Computer-aided Diagnosis, Screening Mammography, Digital Breast Tomosynthesis, Breast Cancer, Screening, Convolutional Neural Network (CNN), Artificial Intelligence Supplemental material is available for this article. © RSNA, 2024.

Collapse

Affiliation(s)

Jiye G. Kim From DeepHealth, RadNet AI Solutions, 212 Elm Street, Somerville, MA 02144 (J.G.K., B.H., A.R.D., A.S., G.G., H.L., W.L., A.G.S.); Atos zData, Newark, Del (A.S.); Delaware Imaging Network, RadNet, Wilmington, Del (J.H.); Department of Radiology, University of Washington School of Medicine, Fred Hutchinson Cancer Center, Seattle, Wash (C.I.L.); Department of Health Systems & Population Health, School of Public Health, University of Washington, Seattle, Wash (C.I.L.); and Dana-Farber Cancer Institute, Harvard Medical School, Boston, Mass (W.L.)
Bryan Haslam From DeepHealth, RadNet AI Solutions, 212 Elm Street, Somerville, MA 02144 (J.G.K., B.H., A.R.D., A.S., G.G., H.L., W.L., A.G.S.); Atos zData, Newark, Del (A.S.); Delaware Imaging Network, RadNet, Wilmington, Del (J.H.); Department of Radiology, University of Washington School of Medicine, Fred Hutchinson Cancer Center, Seattle, Wash (C.I.L.); Department of Health Systems & Population Health, School of Public Health, University of Washington, Seattle, Wash (C.I.L.); and Dana-Farber Cancer Institute, Harvard Medical School, Boston, Mass (W.L.)
Abdul Rahman Diab From DeepHealth, RadNet AI Solutions, 212 Elm Street, Somerville, MA 02144 (J.G.K., B.H., A.R.D., A.S., G.G., H.L., W.L., A.G.S.); Atos zData, Newark, Del (A.S.); Delaware Imaging Network, RadNet, Wilmington, Del (J.H.); Department of Radiology, University of Washington School of Medicine, Fred Hutchinson Cancer Center, Seattle, Wash (C.I.L.); Department of Health Systems & Population Health, School of Public Health, University of Washington, Seattle, Wash (C.I.L.); and Dana-Farber Cancer Institute, Harvard Medical School, Boston, Mass (W.L.)
Ashwin Sakhare From DeepHealth, RadNet AI Solutions, 212 Elm Street, Somerville, MA 02144 (J.G.K., B.H., A.R.D., A.S., G.G., H.L., W.L., A.G.S.); Atos zData, Newark, Del (A.S.); Delaware Imaging Network, RadNet, Wilmington, Del (J.H.); Department of Radiology, University of Washington School of Medicine, Fred Hutchinson Cancer Center, Seattle, Wash (C.I.L.); Department of Health Systems & Population Health, School of Public Health, University of Washington, Seattle, Wash (C.I.L.); and Dana-Farber Cancer Institute, Harvard Medical School, Boston, Mass (W.L.)
Giorgia Grisot From DeepHealth, RadNet AI Solutions, 212 Elm Street, Somerville, MA 02144 (J.G.K., B.H., A.R.D., A.S., G.G., H.L., W.L., A.G.S.); Atos zData, Newark, Del (A.S.); Delaware Imaging Network, RadNet, Wilmington, Del (J.H.); Department of Radiology, University of Washington School of Medicine, Fred Hutchinson Cancer Center, Seattle, Wash (C.I.L.); Department of Health Systems & Population Health, School of Public Health, University of Washington, Seattle, Wash (C.I.L.); and Dana-Farber Cancer Institute, Harvard Medical School, Boston, Mass (W.L.)
Hyunkwang Lee From DeepHealth, RadNet AI Solutions, 212 Elm Street, Somerville, MA 02144 (J.G.K., B.H., A.R.D., A.S., G.G., H.L., W.L., A.G.S.); Atos zData, Newark, Del (A.S.); Delaware Imaging Network, RadNet, Wilmington, Del (J.H.); Department of Radiology, University of Washington School of Medicine, Fred Hutchinson Cancer Center, Seattle, Wash (C.I.L.); Department of Health Systems & Population Health, School of Public Health, University of Washington, Seattle, Wash (C.I.L.); and Dana-Farber Cancer Institute, Harvard Medical School, Boston, Mass (W.L.)
Jacqueline Holt From DeepHealth, RadNet AI Solutions, 212 Elm Street, Somerville, MA 02144 (J.G.K., B.H., A.R.D., A.S., G.G., H.L., W.L., A.G.S.); Atos zData, Newark, Del (A.S.); Delaware Imaging Network, RadNet, Wilmington, Del (J.H.); Department of Radiology, University of Washington School of Medicine, Fred Hutchinson Cancer Center, Seattle, Wash (C.I.L.); Department of Health Systems & Population Health, School of Public Health, University of Washington, Seattle, Wash (C.I.L.); and Dana-Farber Cancer Institute, Harvard Medical School, Boston, Mass (W.L.)
Christoph I. Lee From DeepHealth, RadNet AI Solutions, 212 Elm Street, Somerville, MA 02144 (J.G.K., B.H., A.R.D., A.S., G.G., H.L., W.L., A.G.S.); Atos zData, Newark, Del (A.S.); Delaware Imaging Network, RadNet, Wilmington, Del (J.H.); Department of Radiology, University of Washington School of Medicine, Fred Hutchinson Cancer Center, Seattle, Wash (C.I.L.); Department of Health Systems & Population Health, School of Public Health, University of Washington, Seattle, Wash (C.I.L.); and Dana-Farber Cancer Institute, Harvard Medical School, Boston, Mass (W.L.)
William Lotter
A. Gregory Sorensen

Collapse

Shafique A, Gonzalez R, Pantanowitz L, Tan PH, Machado A, Cree IA, Tizhoosh HR. A Preliminary Investigation into Search and Matching for Tumor Discrimination in World Health Organization Breast Taxonomy Using Deep Networks. Mod Pathol 2024;37:100381. [PMID: 37939901 PMCID: PMC10891482 DOI: 10.1016/j.modpat.2023.100381] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 10/26/2023] [Accepted: 10/31/2023] [Indexed: 11/10/2023]

Harrison P, Hasan R, Park K. State-of-the-Art of Breast Cancer Diagnosis in Medical Images via Convolutional Neural Networks (CNNs). JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2023;7:387-432. [PMID: 37927373 PMCID: PMC10620373 DOI: 10.1007/s41666-023-00144-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Revised: 08/14/2023] [Accepted: 08/22/2023] [Indexed: 11/07/2023]

Abstract

Early detection of breast cancer is crucial for a better prognosis. Various studies have been conducted where tumor lesions are detected and localized on images. This is a narrative review where the studies reviewed are related to five different image modalities: histopathological, mammogram, magnetic resonance imaging (MRI), ultrasound, and computed tomography (CT) images, making it different from other review studies where fewer image modalities are reviewed. The goal is to have the necessary information, such as pre-processing techniques and CNN-based diagnosis techniques for the five modalities, readily available in one place for future studies. Each modality has pros and cons, such as mammograms might give a high false positive rate for radiographically dense breasts, while ultrasounds with low soft tissue contrast result in early-stage false detection, and MRI provides a three-dimensional volumetric image, but it is expensive and cannot be used as a routine test. Various studies were manually reviewed using particular inclusion and exclusion criteria; as a result, 91 recent studies that classify and detect tumor lesions on breast cancer images from 2017 to 2022 related to the five image modalities were included. For histopathological images, the maximum accuracy achieved was around 99 % , and the maximum sensitivity achieved was 97.29 % by using DenseNet, ResNet34, and ResNet50 architecture. For mammogram images, the maximum accuracy achieved was 96.52 % using a customized CNN architecture. For MRI, the maximum accuracy achieved was 98.33 % using customized CNN architecture. For ultrasound, the maximum accuracy achieved was around 99 % by using DarkNet-53, ResNet-50, G-CNN, and VGG. For CT, the maximum sensitivity achieved was 96 % by using Xception architecture. Histopathological and ultrasound images achieved higher accuracy of around 99 % by using ResNet34, ResNet50, DarkNet-53, G-CNN, and VGG compared to other modalities for either of the following reasons: use of pre-trained architectures with pre-processing techniques, use of modified architectures with pre-processing techniques, use of two-stage CNN, and higher number of studies available for Artificial Intelligence (AI)/machine learning (ML) researchers to reference. One of the gaps we found is that only a single image modality is used for CNN-based diagnosis; in the future, a multiple image modality approach can be used to design a CNN architecture with higher accuracy.

Collapse

Yoon JH, Han K, Suh HJ, Youk JH, Lee SE, Kim EK. Artificial intelligence-based computer-assisted detection/diagnosis (AI-CAD) for screening mammography: Outcomes of AI-CAD in the mammographic interpretation workflow. Eur J Radiol Open 2023;11:100509. [PMID: 37484980 PMCID: PMC10362167 DOI: 10.1016/j.ejro.2023.100509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Revised: 07/03/2023] [Accepted: 07/09/2023] [Indexed: 07/25/2023] Open

Becker AS, Das JP, Woo S, Perez-Johnston R, Vargas HA. Improving Radiology Oncologic Imaging Trainee Case Diversity through Automatic Examination Assignment: Retrospective Study from a Tertiary Cancer Center. Radiol Imaging Cancer 2023;5:e230035. [PMID: 37889137 DOI: 10.1148/rycan.230035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]

Ahn JS, Shin S, Yang SA, Park EK, Kim KH, Cho SI, Ock CY, Kim S. Artificial Intelligence in Breast Cancer Diagnosis and Personalized Medicine. J Breast Cancer 2023;26:405-435. [PMID: 37926067 PMCID: PMC10625863 DOI: 10.4048/jbc.2023.26.e45] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 09/25/2023] [Accepted: 10/06/2023] [Indexed: 11/07/2023] Open

Harris C, Okorie U, Makrogiannis S. Spatially localized sparse approximations of deep features for breast mass characterization. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023;20:15859-15882. [PMID: 37919992 PMCID: PMC10949936 DOI: 10.3934/mbe.2023706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/04/2023]

Rawashdeh MA, Brennan PC. Reducing ' probably benign ' assessments in normal mammograms: The role of radiologist experience. Eur J Radiol Open 2023;10:100498. [PMID: 37359179 PMCID: PMC10285087 DOI: 10.1016/j.ejro.2023.100498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 06/07/2023] [Accepted: 06/09/2023] [Indexed: 06/28/2023] Open

Arzamasov K, Vasilev Y, Vladzymyrskyy A, Omelyanskaya O, Shulkin I, Kozikhina D, Goncharova I, Gelezhe P, Kirpichev Y, Bobrovskaya T, Andreychenko A. An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography. Healthcare (Basel) 2023;11:1684. [PMID: 37372802 DOI: 10.3390/healthcare11121684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 06/01/2023] [Accepted: 06/04/2023] [Indexed: 06/29/2023] Open

Affiliation(s)

Kirill Arzamasov State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Yuriy Vasilev State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia Federal State Budgetary Institution "National Medical and Surgical Center Named after N.I. Pirogov" of the Ministry of Health of the Russian Federation, Nizhnyaya Pervomayskaya Street, 70, 105203 Moscow, Russia
Anton Vladzymyrskyy State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia Department of Information and Internet Technologies, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), Trubetskaya Street, 8, Building 2, 119991 Moscow, Russia
Olga Omelyanskaya State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Igor Shulkin State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Darya Kozikhina State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Inna Goncharova State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Pavel Gelezhe State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Yury Kirpichev State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Tatiana Bobrovskaya State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Anna Andreychenko State Budget-Funded Health Care Institution of the City of Moscow "Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department", Petrovka Street, 24, Building 1, 127051 Moscow, Russia

Collapse

Hovda T, Larsen M, Romundstad L, Sahlberg KK, Hofvind S. Breast cancer missed at screening; hindsight or mistakes? Eur J Radiol 2023;165:110913. [PMID: 37311339 DOI: 10.1016/j.ejrad.2023.110913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 04/01/2023] [Accepted: 05/31/2023] [Indexed: 06/15/2023]

Abstract

PURPOSE

To investigate radiologists' interpretation scores of screening mammograms prior to diagnosis of screen-detected and interval breast cancers retrospectively classified as missed or true negative.

METHODS

We included data on radiologists' interpretation scores at screening prior to diagnosis for 1223 screen-detected and 1007 interval cancer cases classified as missed or true negative in an informed consensus-based review. All prior screening examinations were independently scored 1-5 by two radiologists; score 1 by both was considered concordant negative, score ≥ 2 by one radiologist discordant, and score ≥ 2 by both concordant positive. We analyzed associations between interpretation, review categories, mammographic features and histopathological findings using descriptive statistics and logistic regression.

RESULTS

Among screen-detected cancers, 31% of missed and 10% of true negative cancers had discordant or concordant positive interpretation at prior screening. The corresponding percentages for interval cancer were 21% and 8%. Age-adjusted odds ratio (OR) and 95% confidence interval (CI) for missed screen-detected cancer was 3.8 (95% CI: 2.6-5.4) after discordant and 5.5 (95% CI: 3.2-9.5) after concordant positive interpretation, using concordant negative as reference. Corresponding ORs for missed interval cancer were 3.0 (95% CI: 2.0-4.5) for discordant and 6.3 (95% CI: 2.3-17.5) for concordant positive interpretation. Asymmetry was the dominating mammographic feature at prior screening for all, except concordant positive screen-detected cancers where a mass dominated. Histopathological characteristics did not vary statistically with interpretation.

CONCLUSIONS

Most cancers were interpreted negatively at screening prior to diagnosis. Increased risk for missed screen-detected or interval cancer was observed after positive interpretation at prior screening.

Collapse

Wong DJ, Gandomkar Z, Lewis S, Reed W, Suleiman M, Siviengphanom S, Ekpo E. Do Reader Characteristics Affect Diagnostic Efficacy in Screening Mammography? A Systematic Review. Clin Breast Cancer 2023;23:e56-e67. [PMID: 36792458 DOI: 10.1016/j.clbc.2023.01.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 01/10/2023] [Accepted: 01/21/2023] [Indexed: 01/27/2023]

Abstract

To examine reader characteristics associated with diagnostic efficacy in the interpretation of screening mammograms. A systematic search of the literature was conducted using databases such as Cochrane, Scopus, Medline, Embase, Web of Science, and PubMed. Search terms were combined with "AND" or "OR" and included: "Radiologist's characteristics AND performance"; "radiologist experience AND screening mammography"; "annual volume read AND diagnostic efficacy"; "screening mammography performance OR diagnostic efficacy". Studies were included if they assessed reader performance in screening mammography interpretation, breast readers, used a reference standard to assess the performance, and were published in the English language. Twenty-eight studies were reviewed. Increasing reader's age was associated with lower false positive rates. No association was found between gender and performance. Half of the studies showed no association between years of reading mammograms and performance. Most studies showed that high reading volume was more likely to be associated with increased sensitivity, cancer detection rates (CDR), lower recall rate, and lower false positive rates. Inconsistent associations were found between fellowship training in breast imaging and reader performance. Specialization in breast imaging was associated with better CDR, sensitivity, and specificity. Limited studies were available to establish the association between performance and factors such as time spent in breast imaging (n = 2), screening focus (n = 1), formal rotation in mammography (n = 1), owner of practice (n = 1), and practice type (n = 1). No individual characteristics is associated with versatility in diagnostic efficacy, albeit reading volume and specialization in breast imaging appear to be associated with with increased sensitivity and CDR without significantly affecting other performance metrics.

Collapse

Proposal and Definition of an Intelligent Clinical Decision Support System Applied to the Screening and Early Diagnosis of Breast Cancer. Cancers (Basel) 2023;15:cancers15061711. [PMID: 36980595 PMCID: PMC10046257 DOI: 10.3390/cancers15061711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 02/24/2023] [Accepted: 03/07/2023] [Indexed: 03/14/2023] Open

Abstract Breast cancer is the most frequently diagnosed tumor pathology on a global scale, being the leading cause of mortality in women. In light of this problem, screening programs have been implemented on the population at risk in the form of mammograms, starting in the 20th century. This has considerably reduced the associated deaths, as well as improved the prognosis of the patients who suffer from this disease. In spite of this, the evaluation of mammograms is not without certain variability and depends, to a large extent, on the experience and training of the medical team carrying out the assessment. With the aim of supporting the evaluation process of mammogram images and improving the diagnosis process, this work presents the design, development and proof of concept of a novel intelligent clinical decision support system, grounded on two predictive approaches that work concurrently. The first of them applies a series of expert systems based on fuzzy inferential engines, geared towards the treatment of the characteristics associated with the main findings present in mammograms. This allows the determination of a series of risk indicators, the Symbolic Risks, related to the risk of developing breast cancer according to the different findings. The second one implements a classification machine learning algorithm, which using data related to mammography findings as well as general patient information determines another metric, the Statistical Risk, also linked to the risk of developing breast cancer. These risk indicators are then combined, resulting in a new indicator, the Global Risk. This could then be corrected using a weighting factor according to the BI-RADS category, allocated to each patient by the medical team in charge. Thus, the Corrected Global Risk is obtained, which after interpretation can be used to establish the patient’s status as well as generate personalized recommendations. The proof of concept and software implementation of the system were carried out using a data set with 130 patients from a database from the School of Medicine and Public Health of the University of Wisconsin-Madison. The results obtained were encouraging, highlighting the potential use of the application, albeit pending intensive clinical validation in real environments. Moreover, its possible integration in hospital computer systems is expected to improve diagnostic processes as well as patient prognosis. Collapse

Gautam SK, Khan P, Natarajan G, Atri P, Aithal A, Ganti AK, Batra SK, Nasser MW, Jain M. Mucins as Potential Biomarkers for Early Detection of Cancer. Cancers (Basel) 2023;15:1640. [PMID: 36980526 PMCID: PMC10046558 DOI: 10.3390/cancers15061640] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 02/25/2023] [Accepted: 02/27/2023] [Indexed: 03/10/2023] Open

Clerkin N, Ski CF, Brennan PC, Strudwick R. Identification of factors associated with diagnostic performance variation in reporting of mammograms: A review. Radiography (Lond) 2023;29:340-346. [PMID: 36731351 DOI: 10.1016/j.radi.2023.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 12/13/2022] [Accepted: 01/04/2023] [Indexed: 02/01/2023]

Yapp KE, Suleiman M, Brennan P, Ekpo E. Periapical Radiography versus Cone Beam Computed Tomography in Endodontic Disease Detection: A Free-response, Factorial Study. J Endod 2023;49:419-429. [PMID: 36773745 DOI: 10.1016/j.joen.2023.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 01/17/2023] [Accepted: 02/01/2023] [Indexed: 02/11/2023]

Elezaby MA, Narayan A. Breast Cancer Screening Interpretation Model: An Opportunity for Optimization of Patient and Practice Outcomes. J Am Coll Radiol 2023;20:215-217. [PMID: 36503174 DOI: 10.1016/j.jacr.2022.12.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 11/27/2022] [Accepted: 12/06/2022] [Indexed: 12/13/2022]

The effect of clinical history on diagnostic performance of endodontic cone-beam CT interpretation. Clin Radiol 2023;78:e433-e441. [PMID: 36702710 DOI: 10.1016/j.crad.2022.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 11/21/2022] [Accepted: 12/09/2022] [Indexed: 01/12/2023]

Wang Z, Manassi M, Ren Z, Ghirardo C, Canas-Bajo T, Murai Y, Zhou M, Whitney D. Idiosyncratic biases in the perception of medical images. Front Psychol 2022;13:1049831. [PMID: 36600706 PMCID: PMC9806180 DOI: 10.3389/fpsyg.2022.1049831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 11/29/2022] [Indexed: 12/23/2022] Open

Abstract

Introduction

Radiologists routinely make life-altering decisions. Optimizing these decisions has been an important goal for many years and has prompted a great deal of research on the basic perceptual mechanisms that underlie radiologists' decisions. Previous studies have found that there are substantial individual differences in radiologists' diagnostic performance (e.g., sensitivity) due to experience, training, or search strategies. In addition to variations in sensitivity, however, another possibility is that radiologists might have perceptual biases-systematic misperceptions of visual stimuli. Although a great deal of research has investigated radiologist sensitivity, very little has explored the presence of perceptual biases or the individual differences in these.

Methods

Here, we test whether radiologists' have perceptual biases using controlled artificial and Generative Adversarial Networks-generated realistic medical images. In Experiment 1, observers adjusted the appearance of simulated tumors to match the previously shown targets. In Experiment 2, observers were shown with a mix of real and GAN-generated CT lesion images and they rated the realness of each image.

Results

We show that every tested individual radiologist was characterized by unique and systematic perceptual biases; these perceptual biases cannot be simply explained by attentional differences, and they can be observed in different imaging modalities and task settings, suggesting that idiosyncratic biases in medical image perception may widely exist.

Discussion

Characterizing and understanding these biases could be important for many practical settings such as training, pairing readers, and career selection for radiologists. These results may have consequential implications for many other fields as well, where individual observers are the linchpins for life-altering perceptual decisions.

Collapse

Wei T, Aviles-Rivero AI, Wang S, Huang Y, Gilbert FJ, Schönlieb CB, Chen CW. Beyond fine-tuning: Classifying high resolution mammograms using function-preserving transformations. Med Image Anal 2022;82:102618. [PMID: 36183607 DOI: 10.1016/j.media.2022.102618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 08/03/2022] [Accepted: 09/02/2022] [Indexed: 11/15/2022]

Abstract

The task of classifying mammograms is very challenging because the lesion is usually small in the high resolution image. The current state-of-the-art approaches for medical image classification rely on using the de-facto method for convolutional neural networks-fine-tuning. However, there are fundamental differences between natural images and medical images, which based on existing evidence from the literature, limits the overall performance gain when designed with algorithmic approaches. In this paper, we propose to go beyond fine-tuning by introducing a novel framework called MorphHR, in which we highlight a new transfer learning scheme. The idea behind the proposed framework is to integrate function-preserving transformations, for any continuous non-linear activation neurons, to internally regularise the network for improving mammograms classification. The proposed solution offers two major advantages over the existing techniques. Firstly and unlike fine-tuning, the proposed approach allows for modifying not only the last few layers but also several of the first ones on a deep ConvNet. By doing this, we can design the network front to be suitable for learning domain specific features. Secondly, the proposed scheme is scalable to hardware. Therefore, one can fit high resolution images on standard GPU memory. We show that by using high resolution images, one prevents losing relevant information. We demonstrate, through numerical and visual experiments, that the proposed approach yields to a significant improvement in the classification performance over state-of-the-art techniques, and is indeed on a par with radiology experts. Moreover and for generalisation purposes, we show the effectiveness of the proposed learning scheme on another large dataset, the ChestX-ray14, surpassing current state-of-the-art techniques.

Collapse

Syed AH, Khan T. Evolution of research trends in artificial intelligence for breast cancer diagnosis and prognosis over the past two decades: A bibliometric analysis. Front Oncol 2022;12:854927. [PMID: 36267967 PMCID: PMC9578338 DOI: 10.3389/fonc.2022.854927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 08/30/2022] [Indexed: 01/27/2023] Open

CoroNet: Deep Neural Network-Based End-to-End Training for Breast Cancer Diagnosis. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12147080] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Gandomkar Z, Lewis SJ, Li T, Ekpo EU, Brennan PC. A machine learning model based on readers' characteristics to predict their performances in reading screening mammograms. Breast Cancer 2022;29:589-598. [PMID: 35122217 PMCID: PMC9226081 DOI: 10.1007/s12282-022-01335-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 01/20/2022] [Indexed: 11/30/2022]

Hadadi I, Rae W, Clarke J, McEntee M, Ekpo E. Breast cancer detection across dense and non-dense breasts: Markers of diagnostic confidence and efficacy. Acta Radiol Open 2022;11:20584601211072279. [PMID: 35111337 PMCID: PMC8801646 DOI: 10.1177/20584601211072279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 12/17/2021] [Indexed: 11/17/2022] Open

Abstract

Background

The impact of radiologists’ characteristics has become a major focus of recent research. However, the markers of diagnostic efficacy and confidence in dense and non-dense breasts are poorly understood.

Purpose

This study aims to assess the relationship between radiologists’ characteristics and diagnostic performance across dense and non-dense breasts.

Materials and methods

Radiologists specialising in breast imaging (n = 128) who had 0.5–40 (13±10.6) years of experience reading mammograms were recruited. Participants independently interpreted a test set containing 60 digital mammograms (40 normal and 20 abnormal) with similarly distributed breast densities. Diagnostic performance measures were analysed via Jamovi software (version 1.6.22).

Results

In dense breasts, breast-imaging fellowship completion significantly improved specificity (p = 0.004), location sensitivity (p = 0.01) and the area under the curve (AUC) of the receiver operating characteristic (p = 0.03). Only participation in BreastScreen reading significantly improved all performance metrics: specificity (p = 0.04), sensitivity (p = 0.005), location sensitivity (p < 0.001) and AUC (p < 0.001). Reading > 100 mammograms weekly significantly improved sensitivity (p = 0.03), location sensitivity (p = 0.001), and AUC (p = 0.03).In non-dense breasts, breast fellowship completion significantly improved sensitivity (p = 0.02), location sensitivity (p = 0.04) and AUC (p = 0.002). Participation in BreastScreen reading and reading > 100 mammograms weekly significantly improved only sensitivity (p = 0.002 and p = 0.003, respectively) and location sensitivity (p < 0.001 and p < 0.001, respectively).

Conclusion

Participating in screening programs, breast fellowships and reading > 100 mammograms weekly are important indicators of the diagnostic performance of radiologists across dense and non-dense breasts. In dense breasts, optimal performance resulted from participation in a breast screening program.

Collapse

Casal-Guisande M, Comesaña-Campos A, Dutra I, Cerqueiro-Pequeño J, Bouza-Rodríguez JB. Design and Development of an Intelligent Clinical Decision Support System Applied to the Evaluation of Breast Cancer Risk. J Pers Med 2022;12:jpm12020169. [PMID: 35207657 PMCID: PMC8880667 DOI: 10.3390/jpm12020169] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 01/14/2022] [Accepted: 01/24/2022] [Indexed: 12/24/2022] Open

Hooshmand S, Reed WM, Suleiman ME, Brennan PC. A review of screening mammography: The benefits and radiation risks put into perspective. J Med Imaging Radiat Sci 2021;53:147-158. [PMID: 34969620 DOI: 10.1016/j.jmir.2021.12.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 12/01/2021] [Accepted: 12/01/2021] [Indexed: 12/28/2022]

Mridha MF, Hamid MA, Monowar MM, Keya AJ, Ohi AQ, Islam MR, Kim JM. A Comprehensive Survey on Deep-Learning-Based Breast Cancer Diagnosis. Cancers (Basel) 2021;13:6116. [PMID: 34885225 PMCID: PMC8656730 DOI: 10.3390/cancers13236116] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 11/25/2021] [Accepted: 12/01/2021] [Indexed: 12/11/2022] Open

Classification of Breast Cancer in Mammograms with Deep Learning Adding a Fifth Class. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app112311398] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Mori Y, Bretthauer M, Kalager M. Hopes and Hypes for Artificial Intelligence in Colorectal Cancer Screening. Gastroenterology 2021;161:774-777. [PMID: 33989659 DOI: 10.1053/j.gastro.2021.04.078] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 04/20/2021] [Accepted: 04/26/2021] [Indexed: 12/13/2022]

Wang Z, Zhang L, Shu X, Lv Q, Yi Z. An End-to-End Mammogram Diagnosis: A New Multi-Instance and Multiscale Method Based on Single-Image Feature. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2019.2963682] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Zhao W, Wang R, Qi Y, Lou M, Wang Y, Yang Y, Deng X, Ma Y. BASCNet: Bilateral adaptive spatial and channel attention network for breast density classification in the mammogram. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.103073] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Patel MM, Kapoor MM, Whitman GJ. Transitioning to Practice: Getting up to Speed in Efficiency and Accuracy. JOURNAL OF BREAST IMAGING 2021;3:607-611. [PMID: 34545352 PMCID: PMC8445236 DOI: 10.1093/jbi/wbaa100] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Indexed: 11/13/2022]

Sun Y, Ji Y. AAWS-Net: Anatomy-aware weakly-supervised learning network for breast mass segmentation. PLoS One 2021;16:e0256830. [PMID: 34460852 PMCID: PMC8405027 DOI: 10.1371/journal.pone.0256830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 08/16/2021] [Indexed: 11/18/2022] Open

Walker MJ, Hartman K, Majpruz V, Leung YW, Fienberg S, Rabeneck L, Chiarelli AM. The Impact of Radiologist Screening Mammogram Reading Volume on Performance in the Ontario Breast Screening Program. Can Assoc Radiol J 2021;73:362-370. [PMID: 34423685 DOI: 10.1177/08465371211031186] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Abstract

PURPOSE

Although some studies have shown increasing radiologists' mammography volumes improves performance, there is a lack of evidence specific to digital mammography and breast screening program performance targets. This study evaluates the relationship between digital screening volume and meeting performance targets.

METHODS

This retrospective cohort study included 493 radiologists in the Ontario Breast Screening Program who interpreted 1,762,173 screening mammograms in participants ages 50-90 between 2014 and 2016. Associations between annual screening volume and meeting performance targets for abnormal call rate, positive predictive value (PPV), invasive cancer detection rate (CDR), sensitivity, and specificity were modeled using mixed-effects multivariate logistic regression.

RESULTS

Most radiologists read 500-999 (36.7%) or 1,000-1,999 (31.0%) screens annually, and 18.5% read ≥2,000. Radiologists who read ≥2,000 annually were more likely to meet abnormal call rate (OR = 3.85; 95% CI: 1.17-12.61), PPV (OR = 5.36; 95% CI: 2.53-11.34), invasive CDR (OR = 4.14; 95% CI: 1.50-11.46), and specificity (OR = 4.07; 95% CI: 1.89-8.79) targets versus those who read 100-499 screens. Radiologists reading 1,000-1,999 screens annually were more likely to meet PPV (OR = 2.32; 95% CI: 1.22-4.40), invasive CDR (OR = 3.36; 95% CI: 1.49-7.59) and specificity (OR = 2.00; 95% CI: 1.04-3.84) targets versus those who read 100-499 screens. No significant differences were observed for sensitivity.

CONCLUSIONS

Annual reading volume requirements of 1,000 in Canada are supported as screening volume above 1,000 was strongly associated with achieving performance targets for nearly all measures. Increasing the minimum volume to 2,000 may further reduce the potential limitations of screening due to false positives, leading to improvements in overall breast screening program quality.

Collapse

Lee CS, Moy L, Hughes D, Golden D, Bhargavan-Chatfield M, Hemingway J, Geras A, Duszak R, Rosenkrantz AB. Radiologist Characteristics Associated with Interpretive Performance of Screening Mammography: A National Mammography Database (NMD) Study. Radiology 2021;300:518-528. [PMID: 34156300 DOI: 10.1148/radiol.2021204379] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

Background Factors affecting radiologists' performance in screening mammography interpretation remain poorly understood. Purpose To identify radiologists characteristics that affect screening mammography interpretation performance. Materials and Methods This retrospective study included 1223 radiologists in the National Mammography Database (NMD) from 2008 to 2019 who could be linked to Centers for Medicare & Medicaid Services (CMS) datasets. NMD screening performance metrics were extracted. Acceptable ranges were defined as follows: recall rate (RR) between 5% and 12%; cancer detection rate (CDR) of at least 2.5 per 1000 screening examinations; positive predictive value of recall (PPV1) between 3% and 8%; positive predictive value of biopsies recommended (PPV2) between 20% and 40%; positive predictive value of biopsies performed (PPV3) between the 25th and 75th percentile of study sample; invasive CDR of at least the 25th percentile of the study sample; and percentage of ductal carcinoma in situ (DCIS) of at least the 25th percentile of the study sample. Radiologist characteristics extracted from CMS datasets included demographics, subspecialization, and clinical practice patterns. Multivariable stepwise logistic regression models were performed to identify characteristics independently associated with acceptable performance for the seven metrics. The most influential characteristics were defined as those independently associated with the majority of the metrics (at least four). Results Relative to radiologists practicing in the Northeast, those in the Midwest were more likely to achieve acceptable RR, PPV1, PPV2, and CDR (odds ratio [OR], 1.4-2.5); those practicing in the West were more likely to achieve acceptable RR, PPV2, and PPV3 (OR, 1.7-2.1) but less likely to achieve acceptable invasive CDR (OR, 0.6). Relative to general radiologists, breast imagers were more likely to achieve acceptable PPV1, invasive CDR, percentage DCIS, and CDR (OR, 1.4-4.4). Those performing diagnostic mammography were more likely to achieve acceptable PPV1, PPV2, PPV3, invasive CDR, and CDR (OR, 1.9-2.9). Those performing breast US were less likely to achieve acceptable PPV1, PPV2, percentage DCIS, and CDR (OR, 0.5-0.7). Conclusion The geographic location of the radiology practice, subspecialization in breast imaging, and performance of diagnostic mammography are associated with better screening mammography performance; performance of breast US is associated with lower performance. ^©RSNA, 2021 Online supplemental material is available for this article.

Collapse

Affiliation(s)

Cindy S Lee From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Linda Moy From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Danny Hughes From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Dan Golden From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Mythreyi Bhargavan-Chatfield From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Jennifer Hemingway From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Agnieszka Geras From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Richard Duszak From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Andrew B Rosenkrantz From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)

Collapse

Adlung L, Cohen Y, Mor U, Elinav E. Machine learning in clinical decision making. MED 2021;2:642-665. [DOI: 10.1016/j.medj.2021.04.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 03/22/2021] [Accepted: 04/06/2021] [Indexed: 12/24/2022]

Alakhras M, Al-Mousa DS, Alqadi AK, Sabaneh HA, Karasneh RM, Spuur KM. The influence of breast density and key demographics of radiographers on mammography reporting performance - a pilot study. J Med Radiat Sci 2021;69:30-36. [PMID: 34028205 PMCID: PMC8892415 DOI: 10.1002/jmrs.486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 04/22/2021] [Accepted: 04/30/2021] [Indexed: 11/24/2022] Open

Improving radiologist's ability in identifying particular abnormal lesions on mammograms through training test set with immediate feedback. Sci Rep 2021;11:9899. [PMID: 33972611 PMCID: PMC8110801 DOI: 10.1038/s41598-021-89214-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 04/06/2021] [Indexed: 12/24/2022] Open

Abstract

It has been shown that there are differences in diagnostic accuracy of cancer detection on mammograms, from below 50% in developing countries to over 80% in developed world. One previous study reported that radiologists from a population in Asia displayed a low mammographic cancer detection of 48% compared with over 80% in developed countries, and more importantly, that most lesions missed by these radiologists were spiculated masses or stellate lesions. The aim of this study was to explore the performance of radiologists after undertaking a training test set which had been designed to improve the capability in detecting a specific type of cancers on mammograms. Twenty-five radiologists read two sets of 60 mammograms in a standardized mammogram reading room. The first test set focused on stellate or spiculated masses. When radiologists completed the first set, the system displayed immediate feedback to the readers comparing their performances in each case with the truth of cancer cases and cancer types so that the readers could identify individual-based errors. Later radiologists were asked to read the second set of mammograms which contained different types of cancers including stellate/spiculated masses, asymmetric density, calcification, discrete mass and architectural distortion. Case sensitivity, lesion sensitivity, specificity, receiver operating characteristics (ROC) and Jackknife alternative free-response receiver operating characteristics (JAFROC) were calculated for each participant and their diagnostic accuracy was compared between two sessions. Results showed significant improvement among radiologists in case sensitivity (+ 11.4%; P < 0.05), lesion sensitivity (+ 18.7%; P < 0.01) and JAFROC (+ 11%; P < 0.01) in the second set compared with the first set. The increase in diagnostic accuracy was also recorded in the detection of stellate/spiculated mass (+ 20.6%; P < 0.05). This indicated that the performance of radiologists in detecting malignant lesions on mammograms can be improved if an appropriate training intervention is applied after the readers' weakness and strength are identified.

Collapse

A review on recent advancements in diagnosis and classification of cancers using artificial intelligence. Biomedicine (Taipei) 2021;10:5-17. [PMID: 33854922 PMCID: PMC7721470 DOI: 10.37796/2211-8039.1012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2020] [Accepted: 06/16/2020] [Indexed: 12/09/2022] Open

Cornford E, Cheung S, Press M, Kearins O, Taylor-Phillips S. Optimum screening mammography reading volumes: evidence from the NHS Breast Screening Programme. Eur Radiol 2021;31:6909-6915. [PMID: 33630161 DOI: 10.1007/s00330-021-07754-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 01/06/2021] [Accepted: 02/04/2021] [Indexed: 11/28/2022]

Abstract

OBJECTIVES

Minimum caseload standards for professionals examining breast screening mammograms vary from 480 (US) to 5000 (Europe). We measured the relationship between the number of women's mammograms examined per year and reader performance.

METHODS

We extracted routine records from the English NHS Breast Screening Programme for readers examining between 1000 and 45,000 mammograms between April 2014 and March 2017. We measured the relationship between the volume of cases read and screening performance (cancer detection rate, recall rate, positive predictive value of recall (PPV) and discrepant cancers) using linear logistic regression. We also examined the effect of reader occupational group on performance.

RESULTS

In total, 759 eligible mammography readers (445 consultant radiologists, 235 radiography advanced practitioners, 79 consultant radiographers) examined 6.1 million women's mammograms during the study period. PPV increased from 12.9 to 14.4 to 17.0% for readers examining 2000, 5000 and 10000 cases per year respectively. This was driven by decreases in recall rates from 5.8 to 5.3 to 4.5 with increasing volume read, and no change in cancer detection rate (from 7.6 to 7.6 to 7.7). There was no difference in cancer detection rate with reader occupational group. Consultant radiographers had higher recall rate and lower PPV compared to radiologists (OR 1.105, p = 0.012; OR 0.874, p = 0.002, unadjusted).

CONCLUSION

Positive predictive value of screening increases with the total volume of cases examined per reader, through decreases in numbers of cases recalled with no concurrent change in numbers of cancers detected.

KEY POINTS

• In the English Breast Screening Programme, readers who examined a larger number of cases per year had a higher positive predictive value, because they recalled fewer women for further tests but detected the same number of cancers. • Reader type did not affect cancer detection rate, but consultant radiographers had a higher recall rate and lower positive predictive value than consultant radiologists, although this was not adjusted for length of experience.

Collapse

Waite S, Scott J, Colombo D. Narrowing the Gap: Imaging Disparities in Radiology. Radiology 2021;299:27-35. [PMID: 33560191 DOI: 10.1148/radiol.2021203742] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

de Margerie-Mellon C, Debry JB, Dupont A, Cuvier C, Giacchetti S, Teixeira L, Espié M, de Bazelaire C. Nonpalpable breast lesions: impact of a second-opinion review at a breast unit on BI-RADS classification. Eur Radiol 2021;31:5913-5923. [PMID: 33462625 DOI: 10.1007/s00330-020-07664-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 12/10/2020] [Accepted: 12/22/2020] [Indexed: 11/30/2022]

Abstract

OBJECTIVE

To compare BI-RADS classification, management, and outcome of nonpalpable breast lesions assessed both by community practices and by a multidisciplinary tumor board (MTB) at a breast unit.

METHODS

All nonpalpable lesions that were first assigned a BI-RADS score by community practices and then reassessed by an MTB at a single breast unit from 2009 to 2017 were retrospectively reviewed. Inter-review agreement was assessed with Cohen's kappa statistic. Changes in biopsy recommendation were calculated. The percentage of additional tumor lesions detected by the MTB was obtained. The sensitivity, AUC, and cancer rates for BI-RADS category 3, 4, and 5 lesions were computed for both reviews.

RESULTS

A total of 1909 nonpalpable lesions in 1732 patients were included. For BI-RADS scores in the whole cohort, a fair agreement was found (κ = 0.40 [0.36-0.45]) between the two reviews. Agreement was higher when considering only mammography combined with ultrasound (κ = 0.53 [0.44-0.62]), masses (κ = 0.50 [0.44-0.56]), and architectural distortion (κ = 0.44 [0.11-0.78]). Changes in biopsy recommendation occurred in 589 cases (31%). Ninety of 345 additional biopsies revealed high-risk or malignant lesions. Overall, the MTB identified 27% additional high-risk and malignant lesions compared to community practices. The BI-RADS classification AUCs for detecting malignant lesions were 0.66 (0.63-0.69) for community practices and 0.76 (0.75-0.78) for the MTB (p < 0.001).

CONCLUSION

Agreement between community practices and MTB reviews for BI-RADS classification in nonpalpable lesions is only fair. MTB review improves diagnostic performances of breast imaging and patient management.

KEY POINTS

• The inter-review agreement for BI-RADS classification between community practices and the multidisciplinary board was only fair (κ = 0.40). • Disagreements resulted in changes of biopsy recommendation in 31% of the lesions. • The multidisciplinary board identified 27% additional high-risk and malignant lesions compared to community practices.

Collapse

Funaro K, Ataya D, Niell B. Understanding the Mammography Audit. Radiol Clin North Am 2020;59:41-55. [PMID: 33222999 DOI: 10.1016/j.rcl.2020.09.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Trieu PD, Lewis SJ, Li T, Ho K, Tapia KA, Brennan PC. Reader characteristics and mammogram features associated with breast imaging reporting scores. Br J Radiol 2020;93:20200363. [DOI: 10.1259/bjr.20200363] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

Abstract Objectives: This study aims to explore the reading performances of radiologists in detecting cancers on mammograms using Tabar Breast Imaging Reporting and Data System (BIRADS) classification and identify factors related to breast imaging reporting scores. Methods: 117 readings of five different mammogram test sets with each set containing 20 cancer and 40 normal cases were performed by Australian radiologists. Each radiologist evaluated the mammograms using the BIRADS lexicon with category 1 - negative, category 2 - benign findings, category 3 - equivocal findings (Recall), category 4 - suspicious findings (Recall), and category 5 - highly suggestive of malignant findings (Recall). Performance metrics (true positive, false positive, true negative, and false negative) were calculated for each radiologist and the distribution of reporting categories was analyzed in reader-based and case-based groups. The association of reader characteristics and case features among categories was examined using Mann-Whitney U and Kruskal-Wallis tests. Results: 38% of cancer-containing mammograms were reported with category 3 which decreased to 32.3% with category 4 and 16.2% with category 5 while 16.6 and 10.3% of cancer cases were marked with categories 1 and 2. Female readers had less false-negative rates when using categories 1 and 2 for cancer cases than male readers (p < 0.01). A similar pattern as gender category was also found in Breast Screen readers and readers completed breast reading fellowships compared with non-Breast Screen and non-fellowship readers (p < 0.05). Radiologists with low number of cases read per week were more likely to record the cancer cases with category 4 while the ones with high number of cases were with category 3 (p < 0.01). Discrete mass and asymmetric density were the two types of abnormalities reported mostly as equivocal findings with category 3 (47–50%; p = 0.005) while spiculated mass or stellate lesions were mostly selected as highly suggestive of malignancy with category 5 (26%, p = 0.001). Conclusions: Most radiologists used category 3 when reporting cancer mammograms. Gender, working for BreastScreen, fellowship completion, and number of cases read per week were factors associated with scoring selection. Radiologists reported higher Tabar BIRADS category for specific types of abnormalities on mammograms than others. Advances in knowledge: The study identified factors associated with the decision of radiologists in assigning a BIRADS Tabar score for mammograms with abnormality. These findings will be useful for individual training programs to improve the confidence of radiologists in recognizing abnormal lesions on screening mammograms. Collapse

Salim M, Wåhlin E, Dembrower K, Azavedo E, Foukakis T, Liu Y, Smith K, Eklund M, Strand F. External Evaluation of 3 Commercial Artificial Intelligence Algorithms for Independent Assessment of Screening Mammograms. JAMA Oncol 2020;6:1581-1588. [PMID: 32852536 PMCID: PMC7453345 DOI: 10.1001/jamaoncol.2020.3321] [Citation(s) in RCA: 133] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Accepted: 06/02/2020] [Indexed: 12/21/2022]

Abstract

Importance

A computer algorithm that performs at or above the level of radiologists in mammography screening assessment could improve the effectiveness of breast cancer screening.

Objective

To perform an external evaluation of 3 commercially available artificial intelligence (AI) computer-aided detection algorithms as independent mammography readers and to assess the screening performance when combined with radiologists.

Design, Setting, and Participants

This retrospective case-control study was based on a double-reader population-based mammography screening cohort of women screened at an academic hospital in Stockholm, Sweden, from 2008 to 2015. The study included 8805 women aged 40 to 74 years who underwent mammography screening and who did not have implants or prior breast cancer. The study sample included 739 women who were diagnosed as having breast cancer (positive) and a random sample of 8066 healthy controls (negative for breast cancer).

Main Outcomes and Measures

Positive follow-up findings were determined by pathology-verified diagnosis at screening or within 12 months thereafter. Negative follow-up findings were determined by a 2-year cancer-free follow-up. Three AI computer-aided detection algorithms (AI-1, AI-2, and AI-3), sourced from different vendors, yielded a continuous score for the suspicion of cancer in each mammography examination. For a decision of normal or abnormal, the cut point was defined by the mean specificity of the first-reader radiologists (96.6%).

Results

The median age of study participants was 60 years (interquartile range, 50-66 years) for 739 women who received a diagnosis of breast cancer and 54 years (interquartile range, 47-63 years) for 8066 healthy controls. The cases positive for cancer comprised 618 (84%) screen detected and 121 (16%) clinically detected within 12 months of the screening examination. The area under the receiver operating curve for cancer detection was 0.956 (95% CI, 0.948-0.965) for AI-1, 0.922 (95% CI, 0.910-0.934) for AI-2, and 0.920 (95% CI, 0.909-0.931) for AI-3. At the specificity of the radiologists, the sensitivities were 81.9% for AI-1, 67.0% for AI-2, 67.4% for AI-3, 77.4% for first-reader radiologist, and 80.1% for second-reader radiologist. Combining AI-1 with first-reader radiologists achieved 88.6% sensitivity at 93.0% specificity (abnormal defined by either of the 2 making an abnormal assessment). No other examined combination of AI algorithms and radiologists surpassed this sensitivity level.

Conclusions and Relevance

To our knowledge, this study is the first independent evaluation of several AI computer-aided detection algorithms for screening mammography. The results of this study indicated that a commercially available AI computer-aided detection algorithm can assess screening mammograms with a sufficient diagnostic performance to be further evaluated as an independent reader in prospective clinical trials. Combining the first readers with the best algorithm identified more cases positive for cancer than combining the first readers with second readers.

Collapse

Trieu PDY, Puslednik L, Colley B, Brennan A, Rodriguez VC, Cook N, Dean K, Dryburgh S, Lowe H, Mahon C, McGowan S, O'Brien J, Moog W, Whale J, Wong D, Li T, Brennan PC. Interpretative characteristics and case features associated with the performances of radiologists in reading mammograms: A study from a non-screening population in Asia. Asia Pac J Clin Oncol 2020;17:139-148. [PMID: 32894814 DOI: 10.1111/ajco.13429] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 06/20/2020] [Indexed: 11/30/2022]

Abstract

AIMS

To explore radiologist characteristics and case features associated with diagnostic performances in cancer detection on mammograms in a South East Asian population.

METHODS

Fifty-three radiologists reported 60 mammographic examinations which consisted of 40 normal and 20 cancer-containing cases at the BREAST workshops. Radiologists were asked to examine each mammogram using the BIRADS on diagnostic monitors. Differences in reader characteristics and case features between correct and incorrect decisions were assessed separately for cancer and normal cases. Univariate and multivariate logistic regressions were applied to generate odds ratios (OR) for significant factors related to correct decisions.

RESULTS

Radiologists who spent ≥10 hours/week reporting mammograms had a higher possibility of detecting cancer lesions (OR = 1.6; P = 0.01). A higher rate of accuracy in reporting negative cases was associated with female radiologists (OR = 1.4; P = 0.002), radiologists who read ≤20 mammograms per week (OR = 1.5; P < 0.0001), had completed training course (OR = 1.7; P < 0.0001) or wore eyeglasses (OR = 1.4; P = 0.01). Cancer cases with breast density >50% (OR = 2.1; P < 0.0001), having abnormal lesions ≥9 mm (OR = 1.8; P < 0.0001), or displaying calcifications, a discrete mass or nonspecific density (OR = 1.6; P < 0.0001) were recorded with a higher detection rate by radiologists than other cases. Lesions located on the right breasts (OR = 1.8; P < 0.0001) or found in the lower inner, upper outer or mixed locations (OR = 2.7; P < 0.0001) were also recorded with a better diagnostic possibility compared with other lesions.

CONCLUSION

This work identified key features related to diagnostic accuracy of breast cancer on mammograms in a nonscreening population, which is helpful for developing appropriate strategies to improve breast cancer detectability of radiologists.

Collapse

Li T, Taba ST, Khong PL, Tan TXL, Trieu PDY, Chan E, Suleiman ME, Li Y, Brennan P, Lewis S. Reading High Breast Density Mammograms: Differences in Diagnostic Performance between Radiologists from Hong Kong SAR/Guangdong Province in China and Australia. Asian Pac J Cancer Prev 2020;21:2623-2629. [PMID: 32986361 PMCID: PMC7779441 DOI: 10.31557/apjcp.2020.21.9.2623] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Indexed: 12/29/2022] Open

Salim M, Dembrower K, Eklund M, Lindholm P, Strand F. Range of Radiologist Performance in a Population-based Screening Cohort of 1 Million Digital Mammography Examinations. Radiology 2020;297:33-39. [PMID: 32720866 DOI: 10.1148/radiol.2020192212] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

Background There is great interest in developing artificial intelligence (AI)-based computer-aided detection (CAD) systems for use in screening mammography. Comparative performance benchmarks from true screening cohorts are needed. Purpose To determine the range of human first-reader performance measures within a population-based screening cohort of 1 million screening mammograms to gauge the performance of emerging AI CAD systems. Materials and Methods This retrospective study consisted of all screening mammograms in women aged 40-74 years in Stockholm County, Sweden, who underwent screening with full-field digital mammography between 2008 and 2015. There were 110 interpreting radiologists, of whom 24 were defined as high-volume readers (ie, those who interpreted more than 5000 annual screening mammograms). A true-positive finding was defined as the presence of a pathology-confirmed cancer within 12 months. Performance benchmarks included sensitivity and specificity, examined per quartile of radiologists' performance. First-reader sensitivity was determined for each tumor subgroup, overall and by quartile of high-volume reader sensitivity. Screening outcomes were examined based on the first reader's sensitivity quartile with 10 000 screening mammograms per quartile. Linear regression models were fitted to test for a linear trend across quartiles of performance. Results A total of 418 041 women (mean age, 54 years ± 10 [standard deviation]) were included, and 1 186 045 digital mammograms were evaluated, with 972 899 assessed by high-volume readers. Overall sensitivity was 73% (95% confidence interval [CI]: 69%, 77%), and overall specificity was 96% (95% CI: 95%, 97%). The mean values per quartile of high-volume reader performance ranged from 63% to 84% for sensitivity and from 95% to 98% for specificity. The sensitivity difference was very large for basal cancers, with the least sensitive and most sensitive high-volume readers detecting 53% and 89% of cancers, respectively (P < .001). Conclusion Benchmarks showed a wide range of performance differences between high-volume readers. Sensitivity varied by tumor characteristics. © RSNA, 2020 Online supplemental material is available for this article.

Collapse

Affiliation(s)

Mattie Salim From the Departments of Pathology and Oncology (M.S., F.S.), Physiology and Pharmacology (K.D., P.L.), and Medical Epidemiology and Biostatistics (M.E.), Karolinska Institute, Stockholm, Sweden; Department of Radiology (M.S.) and Breast Radiology (F.S.), Karolinska University Hospital, Dalagatan 90, 113 43 Stockholm, Sweden; and the Department of Radiology, Capio Sankt Görans Hospital, Stockholm, Sweden (K.D.)
Karin Dembrower From the Departments of Pathology and Oncology (M.S., F.S.), Physiology and Pharmacology (K.D., P.L.), and Medical Epidemiology and Biostatistics (M.E.), Karolinska Institute, Stockholm, Sweden; Department of Radiology (M.S.) and Breast Radiology (F.S.), Karolinska University Hospital, Dalagatan 90, 113 43 Stockholm, Sweden; and the Department of Radiology, Capio Sankt Görans Hospital, Stockholm, Sweden (K.D.)
Martin Eklund From the Departments of Pathology and Oncology (M.S., F.S.), Physiology and Pharmacology (K.D., P.L.), and Medical Epidemiology and Biostatistics (M.E.), Karolinska Institute, Stockholm, Sweden; Department of Radiology (M.S.) and Breast Radiology (F.S.), Karolinska University Hospital, Dalagatan 90, 113 43 Stockholm, Sweden; and the Department of Radiology, Capio Sankt Görans Hospital, Stockholm, Sweden (K.D.)
Peter Lindholm From the Departments of Pathology and Oncology (M.S., F.S.), Physiology and Pharmacology (K.D., P.L.), and Medical Epidemiology and Biostatistics (M.E.), Karolinska Institute, Stockholm, Sweden; Department of Radiology (M.S.) and Breast Radiology (F.S.), Karolinska University Hospital, Dalagatan 90, 113 43 Stockholm, Sweden; and the Department of Radiology, Capio Sankt Görans Hospital, Stockholm, Sweden (K.D.)
Fredrik Strand From the Departments of Pathology and Oncology (M.S., F.S.), Physiology and Pharmacology (K.D., P.L.), and Medical Epidemiology and Biostatistics (M.E.), Karolinska Institute, Stockholm, Sweden; Department of Radiology (M.S.) and Breast Radiology (F.S.), Karolinska University Hospital, Dalagatan 90, 113 43 Stockholm, Sweden; and the Department of Radiology, Capio Sankt Görans Hospital, Stockholm, Sweden (K.D.)

Collapse