1
|
Gommers JJJ, Abbey CK, Strand F, Taylor-Phillips S, Jenkinson DJ, Larsen M, Hofvind S, Sechopoulos I, Broeders MJM. Optimizing the Pairs of Radiologists That Double Read Screening Mammograms. Radiology 2023; 309:e222691. [PMID: 37874241 DOI: 10.1148/radiol.222691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Background Despite variation in performance characteristics among radiologists, the pairing of radiologists for the double reading of screening mammograms is performed randomly. It is unknown how to optimize pairing to improve screening performance. Purpose To investigate whether radiologist performance characteristics can be used to determine the optimal set of pairs of radiologists to double read screening mammograms for improved accuracy. Materials and Methods This retrospective study was performed with reading outcomes from breast cancer screening programs in Sweden (2008-2015), England (2012-2014), and Norway (2004-2018). Cancer detection rates (CDRs) and abnormal interpretation rates (AIRs) were calculated, with AIR defined as either reader flagging an examination as abnormal. Individual readers were divided into performance categories based on their high and low CDR and AIR. The performance of individuals determined the classification of pairs. Random pair performance, for which any type of pair was equally represented, was compared with the performance of specific pairing strategies, which consisted of pairs of readers who were either opposite or similar in AIR and/or CDR. Results Based on a minimum number of examinations per reader and per pair, the final study sample consisted of 3 592 414 examinations (Sweden, n = 965 263; England, n = 837 048; Norway, n = 1 790 103). The overall AIRs and CDRs for all specific pairing strategies (Sweden AIR range, 45.5-56.9 per 1000 examinations and CDR range, 3.1-3.6 per 1000; England AIR range, 68.2-70.5 per 1000 and CDR range, 8.9-9.4 per 1000; Norway AIR range, 81.6-88.1 per 1000 and CDR range, 6.1-6.8 per 1000) were not significantly different from the random pairing strategy (Sweden AIR, 54.1 per 1000 examinations and CDR, 3.3 per 1000; England AIR, 69.3 per 1000 and CDR, 9.1 per 1000; Norway AIR, 84.1 per 1000 and CDR, 6.3 per 1000). Conclusion Pairing a set of readers based on different pairing strategies did not show a significant difference in screening performance when compared with random pairing. © RSNA, 2023.
Collapse
Affiliation(s)
- Jessie J J Gommers
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| | - Craig K Abbey
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| | - Fredrik Strand
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| | - Sian Taylor-Phillips
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| | - David J Jenkinson
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| | - Marthe Larsen
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| | - Solveig Hofvind
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| | - Ioannis Sechopoulos
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| | - Mireille J M Broeders
- From the Department of Medical Imaging (J.J.J.G., I.S.) and Department for Health Evidence (M.J.M.B.), Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Post 766, Nijmegen, the Netherlands; Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, Calif (C.K.A.); Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden (F.S.); Department of Breast Radiology, Karolinska University Hospital, Stockholm, Sweden (F.S.); Warwick Medical School, University of Warwick, Coventry, United Kingdom (S.T.P., D.J.J.); Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway (M.L., S.H.); Department of Health and Care Sciences, UiT The Arctic University of Norway, Tromsø, Norway (S.H.); Dutch Expert Center for Screening (LRCB), Nijmegen, the Netherlands (I.S., M.J.M.B.); and Technical Medicine Center, University of Twente, Enschede, the Netherlands (I.S.)
| |
Collapse
|
2
|
Siviengphanom S, Gandomkar Z, Lewis SJ, Brennan PC. Global Radiomic Features from Mammography for Predicting Difficult-To-Interpret Normal Cases. J Digit Imaging 2023; 36:1541-1552. [PMID: 37253894 PMCID: PMC10406750 DOI: 10.1007/s10278-023-00836-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 04/05/2023] [Accepted: 04/13/2023] [Indexed: 06/01/2023] Open
Abstract
This work aimed to investigate whether global radiomic features (GRFs) from mammograms can predict difficult-to-interpret normal cases (NCs). Assessments from 537 readers interpreting 239 normal mammograms were used to categorise cases as 120 difficult-to-interpret and 119 easy-to-interpret based on cases having the highest and lowest difficulty scores, respectively. Using lattice- and squared-based approaches, 34 handcrafted GRFs per image were extracted and normalised. Three classifiers were constructed: (i) CC and (ii) MLO using the GRFs from corresponding craniocaudal and mediolateral oblique images only, based on the random forest technique for distinguishing difficult- from easy-to-interpret NCs, and (iii) CC + MLO using the median predictive scores from both CC and MLO models. Useful GRFs for the CC and MLO models were recognised using a scree test. The CC and MLO models were trained and validated using the leave-one-out-cross-validation. The models' performances were assessed by the AUC and compared using the DeLong test. A Kruskal-Wallis test was used to examine if the 34 GRFs differed between difficult- and easy-to-interpret NCs and if difficulty level based on the traditional breast density (BD) categories differed among 115 low-BD and 124 high-BD NCs. The CC + MLO model achieved higher performance (0.71 AUC) than the individual CC and MLO model alone (0.66 each), but statistically non-significant difference was found (all p > 0.05). Six GRFs were identified to be valuable in describing difficult-to-interpret NCs. Twenty features, when compared between difficult- and easy-to-interpret NCs, differed significantly (p < 0.05). No statistically significant difference was observed in difficulty between low- and high-BD NCs (p = 0.709). GRF mammographic analysis can predict difficult-to-interpret NCs.
Collapse
Affiliation(s)
- Somphone Siviengphanom
- Medical Image Optimisation and Perception Group, Discipline of Medical Imaging Science, Sydney School of Health Sciences, Faculty of Medicine and Health, the University of Sydney, Sydney, NSW, 2006, Australia.
| | - Ziba Gandomkar
- Medical Image Optimisation and Perception Group, Discipline of Medical Imaging Science, Sydney School of Health Sciences, Faculty of Medicine and Health, the University of Sydney, Sydney, NSW, 2006, Australia
| | - Sarah J Lewis
- Medical Image Optimisation and Perception Group, Discipline of Medical Imaging Science, Sydney School of Health Sciences, Faculty of Medicine and Health, the University of Sydney, Sydney, NSW, 2006, Australia
| | - Patrick C Brennan
- Medical Image Optimisation and Perception Group, Discipline of Medical Imaging Science, Sydney School of Health Sciences, Faculty of Medicine and Health, the University of Sydney, Sydney, NSW, 2006, Australia
| |
Collapse
|
3
|
Hovda T, Sagstad S, Larsen M, Chen Y, Hofvind S. Screening outcome for interpretation by the first and second reader in a population-based mammographic screening program with independent double reading. Acta Radiol 2023; 64:2371-2378. [PMID: 37246466 DOI: 10.1177/02841851231176272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
BACKGROUND Double reading of screening mammograms is associated with a higher rate of screen-detected cancer than single reading, but different strategies exist regarding reader pairing and blinding. Knowledge about these aspects is important when considering strategies for future use of artificial intelligence in mammographic screening. PURPOSE To investigate screening outcome, histopathological tumor characteristics, and mammographic features stratified by the first and the second reader in a population based screening program for breast cancer. MATERIAL AND METHODS The study sample consisted of data from 3,499,048 screening examinations from 834,691 women performed during 1996-2018 in BreastScreen Norway. All examinations were interpreted independently by two radiologists, 272 in total. We analyzed interpretation score, recall, and cancer detection, as well as histopathological tumor characteristics and mammographic features of the cancers, stratified by the first and second readers. RESULTS For Reader 1, the rate of positive interpretations was 4.8%, recall 2.3%, and cancer detection 0.5%. The corresponding percentages for Reader 2 were 4.9%, 2.5%, and 0.5% (P < 0.05 compared with Reader 1). No statistical difference was observed for histopathological tumor characteristics or mammographic features when stratified by Readers 1 and 2. Recall and cancer detection were statistically higher and histopathological tumor characteristics less favorable for cases detected after concordant positive compared with discordant interpretations. CONCLUSION Despite reaching statistical significance, mainly due to the large study sample, we consider the differences in interpretation scores, recall, and cancer detection between the first and second readers to be clinically negligible. For practical and clinical purposes, double reading in BreastScreen Norway is independent.
Collapse
Affiliation(s)
- Tone Hovda
- Department of Radiology, Vestre Viken Hospital Trust, Drammen, Norway
| | - Silje Sagstad
- Section for breast cancer screening, Cancer Registry of Norway, Oslo, Norway
| | - Marthe Larsen
- Section for breast cancer screening, Cancer Registry of Norway, Oslo, Norway
| | - Yan Chen
- Translational Medical Sciences, School of Medicine, University of Nottingham, Nottingham, UK
| | - Solveig Hofvind
- Section for breast cancer screening, Cancer Registry of Norway, Oslo, Norway
- Department of Health and Care Sciences, Faculty of Health Sciences, The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
4
|
Hovda T, Larsen M, Romundstad L, Sahlberg KK, Hofvind S. Breast cancer missed at screening; hindsight or mistakes? Eur J Radiol 2023; 165:110913. [PMID: 37311339 DOI: 10.1016/j.ejrad.2023.110913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 04/01/2023] [Accepted: 05/31/2023] [Indexed: 06/15/2023]
Abstract
PURPOSE To investigate radiologists' interpretation scores of screening mammograms prior to diagnosis of screen-detected and interval breast cancers retrospectively classified as missed or true negative. METHODS We included data on radiologists' interpretation scores at screening prior to diagnosis for 1223 screen-detected and 1007 interval cancer cases classified as missed or true negative in an informed consensus-based review. All prior screening examinations were independently scored 1-5 by two radiologists; score 1 by both was considered concordant negative, score ≥ 2 by one radiologist discordant, and score ≥ 2 by both concordant positive. We analyzed associations between interpretation, review categories, mammographic features and histopathological findings using descriptive statistics and logistic regression. RESULTS Among screen-detected cancers, 31% of missed and 10% of true negative cancers had discordant or concordant positive interpretation at prior screening. The corresponding percentages for interval cancer were 21% and 8%. Age-adjusted odds ratio (OR) and 95% confidence interval (CI) for missed screen-detected cancer was 3.8 (95% CI: 2.6-5.4) after discordant and 5.5 (95% CI: 3.2-9.5) after concordant positive interpretation, using concordant negative as reference. Corresponding ORs for missed interval cancer were 3.0 (95% CI: 2.0-4.5) for discordant and 6.3 (95% CI: 2.3-17.5) for concordant positive interpretation. Asymmetry was the dominating mammographic feature at prior screening for all, except concordant positive screen-detected cancers where a mass dominated. Histopathological characteristics did not vary statistically with interpretation. CONCLUSIONS Most cancers were interpreted negatively at screening prior to diagnosis. Increased risk for missed screen-detected or interval cancer was observed after positive interpretation at prior screening.
Collapse
Affiliation(s)
- Tone Hovda
- Department of Radiology, Vestre Viken Hospital Trust, PO Box 800, 3004 Drammen, Norway.
| | - Marthe Larsen
- Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway
| | - Linda Romundstad
- Department of Radiology, Vestre Viken Hospital Trust, Drammen, Norway
| | - Kristine Kleivi Sahlberg
- Department of Research and Innovation, Vestre Viken Hospital Trust, Drammen, Norway; Institute for Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway
| | - Solveig Hofvind
- Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway; Department of Health and Care Sciences, Faculty of Health Sciences, The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
5
|
Wang Z, Manassi M, Ren Z, Ghirardo C, Canas-Bajo T, Murai Y, Zhou M, Whitney D. Idiosyncratic biases in the perception of medical images. Front Psychol 2022; 13:1049831. [PMID: 36600706 PMCID: PMC9806180 DOI: 10.3389/fpsyg.2022.1049831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 11/29/2022] [Indexed: 12/23/2022] Open
Abstract
Introduction Radiologists routinely make life-altering decisions. Optimizing these decisions has been an important goal for many years and has prompted a great deal of research on the basic perceptual mechanisms that underlie radiologists' decisions. Previous studies have found that there are substantial individual differences in radiologists' diagnostic performance (e.g., sensitivity) due to experience, training, or search strategies. In addition to variations in sensitivity, however, another possibility is that radiologists might have perceptual biases-systematic misperceptions of visual stimuli. Although a great deal of research has investigated radiologist sensitivity, very little has explored the presence of perceptual biases or the individual differences in these. Methods Here, we test whether radiologists' have perceptual biases using controlled artificial and Generative Adversarial Networks-generated realistic medical images. In Experiment 1, observers adjusted the appearance of simulated tumors to match the previously shown targets. In Experiment 2, observers were shown with a mix of real and GAN-generated CT lesion images and they rated the realness of each image. Results We show that every tested individual radiologist was characterized by unique and systematic perceptual biases; these perceptual biases cannot be simply explained by attentional differences, and they can be observed in different imaging modalities and task settings, suggesting that idiosyncratic biases in medical image perception may widely exist. Discussion Characterizing and understanding these biases could be important for many practical settings such as training, pairing readers, and career selection for radiologists. These results may have consequential implications for many other fields as well, where individual observers are the linchpins for life-altering perceptual decisions.
Collapse
Affiliation(s)
- Zixuan Wang
- Department of Psychology, University of California, Berkeley, Berkeley, CA, United States,*Correspondence: Zixuan Wang,
| | - Mauro Manassi
- School of Psychology, University of Aberdeen, King’s College, Aberdeen, United Kingdom
| | - Zhihang Ren
- Department of Psychology, University of California, Berkeley, Berkeley, CA, United States,Vision Science Group, University of California, Berkeley, Berkeley, CA, United States
| | - Cristina Ghirardo
- Department of Psychology, University of California, Berkeley, Berkeley, CA, United States
| | - Teresa Canas-Bajo
- Department of Psychology, University of California, Berkeley, Berkeley, CA, United States,Vision Science Group, University of California, Berkeley, Berkeley, CA, United States
| | - Yuki Murai
- Center for Information and Neural Networks, National Institute of Information and Communications Technology, Koganei, Japan
| | - Min Zhou
- Department of Pediatrics, The First People's Hospital of Shuangliu District, Chengdu, Sichuan, China
| | - David Whitney
- Department of Psychology, University of California, Berkeley, Berkeley, CA, United States,Vision Science Group, University of California, Berkeley, Berkeley, CA, United States,Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
6
|
Gandomkar Z, Lewis SJ, Li T, Ekpo EU, Brennan PC. A machine learning model based on readers' characteristics to predict their performances in reading screening mammograms. Breast Cancer 2022; 29:589-598. [PMID: 35122217 PMCID: PMC9226081 DOI: 10.1007/s12282-022-01335-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 01/20/2022] [Indexed: 11/30/2022]
Abstract
OBJECTIVES Proposing a machine learning model to predict readers' performances, as measured by the area under the receiver operating characteristics curve (AUC) and lesion sensitivity, using the readers' characteristics. METHODS Data were collected from 905 radiologists and breast physicians who completed at least one case-set of 60 mammographic images containing 40 normal and 20 biopsy-proven cancer cases. Nine different case-sets were available. Using a questionnaire, we collected radiologists' demographic details, such as reading volume and years of experience. These characteristics along with a case set difficulty measure were fed into two ensemble of regression trees to predict the readers' AUCs and lesion sensitivities. We calculated the Pearson correlation coefficient between the predicted values by the model and the actual AUC and lesion sensitivity. The usefulness of the model to categorize readers as low and high performers based on different criteria was also evaluated. The performances of the models were evaluated using leave-one-out cross-validation. RESULTS The Pearson correlation coefficient between the predicted AUC and actual one was 0.60 (p < 0.001). The model's performance for differentiating the reader in the first and fourth quartile based on the AUC values was 0.86 (95% CI 0.83-0.89). The model reached an AUC of 0.91 (95% CI 0.88-0.93) for distinguishing the readers in the first quartile from the fourth one based on the lesion sensitivity. CONCLUSION A machine learning model can be used to categorize readers as high- or low-performing. Such model could be useful for screening programs for designing a targeted quality assurance and optimizing the double reading practice.
Collapse
Affiliation(s)
- Ziba Gandomkar
- Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Sciences, Faculty of Medicine and Health, University of Sydney, Western Ave, Camperdown, Sydney, NSW, 2006, Australia.
| | - Sarah J Lewis
- Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Sciences, Faculty of Medicine and Health, University of Sydney, Western Ave, Camperdown, Sydney, NSW, 2006, Australia
| | - Tong Li
- Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Sciences, Faculty of Medicine and Health, University of Sydney, Western Ave, Camperdown, Sydney, NSW, 2006, Australia
| | - Ernest U Ekpo
- Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Sciences, Faculty of Medicine and Health, University of Sydney, Western Ave, Camperdown, Sydney, NSW, 2006, Australia
| | - Patrick C Brennan
- Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Sciences, Faculty of Medicine and Health, University of Sydney, Western Ave, Camperdown, Sydney, NSW, 2006, Australia
| |
Collapse
|
7
|
Diagnostic Efficacy across Dense and Non-Dense Breasts during Digital Breast Tomosynthesis and Ultrasound Assessment for Recalled Women. Diagnostics (Basel) 2022; 12:diagnostics12061477. [PMID: 35741287 PMCID: PMC9222054 DOI: 10.3390/diagnostics12061477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 06/13/2022] [Accepted: 06/14/2022] [Indexed: 11/20/2022] Open
Abstract
Background: To compare the diagnostic efficacy of digital breast tomosynthesis (DBT) and ultrasound across breast densities in women recalled for assessment. Methods: A total of 482 women recalled for assessment from January 2017 to December 2019 were selected for the study. Women met the inclusion criteria if they had undergone DBT, ultrasound and had confirmed biopsy results. We calculated sensitivity, specificity, PPV, and AUC for DBT and ultrasound. Results: In dense breasts, DBT showed significantly higher sensitivity than ultrasound (98.2% vs. 80%; p < 0.001), but lower specificity (15.4% vs. 55%; p < 0.001), PPV (61.3% vs. 71%; p = 0.04) and AUC (0.568 vs. 0.671; p = 0.001). In non-dense breasts, DBT showed significantly higher sensitivity than ultrasound (99.2% vs. 84%; p < 0.001), but no differences in specificity (22% vs. 33%; p = 0.14), PPV (69.2% vs. 68.8%; p = 0.93) or AUC (0.606 vs. 0.583; p = 0.57). Around 73% (74% dense and 71% non-dense) and 77% (81% dense and 72% non-dense) of lesions assigned a RANZCR 3 by DBT and ultrasound, respectively, were benign. Conclusion: DBT has higher sensitivity, but lower specificity and PPV than ultrasound in women with dense breasts recalled for assessment. Most lesions rated RANZCR 3 on DBT and ultrasound are benign and may benefit from short interval follow-up rather than biopsy.
Collapse
|
8
|
Hovda T, Hoff SR, Larsen M, Romundstad L, Sahlberg KK, Hofvind S. True and Missed Interval Cancer in Organized Mammographic Screening: A Retrospective Review Study of Diagnostic and Prior Screening Mammograms. Acad Radiol 2022; 29 Suppl 1:S180-S191. [PMID: 33926794 DOI: 10.1016/j.acra.2021.03.022] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 03/22/2021] [Accepted: 03/23/2021] [Indexed: 01/22/2023]
Abstract
RATIONALE AND OBJECTIVES To explore radiological aspects of interval breast cancer in a population-based screening program. MATERIALS AND METHODS We performed a consensus-based informed review of mammograms from diagnosis and prior screening from women diagnosed with interval cancer 2004-2016 in BreastScreen Norway. Cases were classified as true (no findings on prior screening mammograms), occult (no findings at screening or diagnosis), minimal signs (minor/non-specific findings) and missed (obvious findings). We analyzed mammographic findings, density, time since prior screening, and histopathological characteristics between the classification groups. RESULTS The study included 1010 interval cancer cases. Mean age at diagnosis was 61 years (SD = 6), mean time between screening and diagnosis 14 months (SD = 7). A total of 48% (479/1010) were classified as true or occult, 28% (285/1010) as minimal signs and 24% (246/1010) as missed. We observed no differences in mammographic density between the groups, except from a higher percentage of dense breasts in women with occult cancer. Among cancers classified as missed, about 1/3 were masses and 1/3 asymmetries at prior screening. True interval cancers were diagnosed later in the screening interval than the other classification categories. No differences in histopathological characteristics were observed between true, minimal signs and missed cases. CONCLUSION In an informed review, 24% of the interval cancers were classified as missed based on visibility and mammographic findings on prior screening mammograms. Three out of four true interval cancers were diagnosed in the second year of the screening interval. We observed no statistical differences in histopathological characteristics between true and missed interval cancers.
Collapse
Affiliation(s)
- Tone Hovda
- Department of Radiology, Vestre Viken Hospital Trust, PO Box 800, 3004 Drammen, Norway; Institute of Clinical Medicine, University of Oslo, PO Box 1171 Blindern, 0318 Oslo, Norway
| | - Solveig Roth Hoff
- Department of Radiology, Ålesund hospital, Møre og Romsdal Hospital Trust, Åsehaugen 5, 6017 Ålesund, Norway; NTNU, Faculty of Medicine and Health Sciences, Department of Circulation and Medical Imaging, PO Box 8905, 7491 Trondheim, Norway
| | - Marthe Larsen
- Section for breast cancer screening, Cancer Registry of Norway, PO Box 5313 Majorstuen, 0304 Oslo, Norway
| | - Linda Romundstad
- Department of Radiology, Vestre Viken Hospital Trust, PO Box 800, 3004 Drammen, Norway
| | - Kristine Kleivi Sahlberg
- Department of Research and Innovation, Vestre Viken Hospital Trust, PO Box 800, 3004 Drammen, Norway; Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital Trust, PO Box 4950, 0424 Oslo, Norway
| | - Solveig Hofvind
- Faculty of Health Science, Oslo Metropolitan University, PO Box 4 St. Olavs plass, 0130 Oslo, Norway.
| |
Collapse
|
9
|
Hooshmand S, Reed WM, Suleiman ME, Brennan PC. SCREENING MAMMOGRAPHY: DIAGNOSTIC EFFICACY-ISSUES AND CONSIDERATIONS FOR THE 2020S. RADIATION PROTECTION DOSIMETRY 2021; 197:54-62. [PMID: 34729603 DOI: 10.1093/rpd/ncab160] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 10/04/2021] [Accepted: 10/08/2021] [Indexed: 06/13/2023]
Abstract
Diagnostic efficacy in medical imaging is ultimately a reflection of radiologist performance. This can be influenced by numerous factors, some of which are patient related, such as the physical size and density of the breast, and machine related, where some lesions are difficult to visualise on traditional imaging techniques. Other factors are human reader errors that occur during the diagnostic process, which relate to reader experience and their perceptual and cognitive oversights. Given the large-scale nature of breast cancer screening, even small increases in diagnostic performance equate to large numbers of women saved. It is important to identify the causes of diagnostic errors and how detection efficacy can be improved. This narrative review will therefore explore the various factors that influence mammographic performance and the potential solutions used in an attempt to ameliorate the errors made.
Collapse
Affiliation(s)
- Sahand Hooshmand
- Faculty of Medicine and Health, The Discipline of Medical Imaging Sciences, The University of Sydney, Susan Wakil Health Building (D18), Sydney, NSW 2050, Australia
| | - Warren M Reed
- Faculty of Medicine and Health, The Discipline of Medical Imaging Sciences, The University of Sydney, Susan Wakil Health Building (D18), Sydney, NSW 2050, Australia
| | - Mo'ayyad E Suleiman
- Faculty of Medicine and Health, The Discipline of Medical Imaging Sciences, The University of Sydney, Susan Wakil Health Building (D18), Sydney, NSW 2050, Australia
| | - Patrick C Brennan
- Faculty of Medicine and Health, The Discipline of Medical Imaging Sciences, The University of Sydney, Susan Wakil Health Building (D18), Sydney, NSW 2050, Australia
| |
Collapse
|
10
|
Akpan E, Kitundu J, Ekpo E. Public Health Radiography: A Scoping Review of Benefits, and Growth Opportunities for Radiographers. J Med Imaging Radiat Sci 2021; 52:615-625. [PMID: 34531164 DOI: 10.1016/j.jmir.2021.08.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2020] [Revised: 06/17/2021] [Accepted: 08/06/2021] [Indexed: 10/20/2022]
Abstract
INTRODUCTION There is growing adoption of radiographic techniques in public health to improve outcomes of chronic and communicable diseases. This review examines the applications, benefits, and implications of radiography in public health. It also examines the challenges and potential advanced practice roles for radiographers in public health radiography (PHR). METHODOLOGY Preferred Reporting Items for Systematic Reviews and Meta-Analyses - Scoping review extension (PRISMA- ScR) checklist was employed, and the search was conducted using PubMed, Medline, Web of Science, ScienceDirect, and Google Scholar to identify relevant articles that explored the concept of radiography in public health. Evidence was analysed using an inductive iterative approach. RESULTS Radiographic imaging modalities such as ultrasound, computed tomography, and plain X-ray had wide applicability in public health fields of preventive cardiology, preventive oncology, maternal health, infectious disease epidemiology, and radiographic informatics. PHR effectively reduced mortality, improved outcomes, informed lifestyle changes to mitigate the risk of impending disease. PHR also helped in monitoring disease progression and predicting treatment outcomes. However, evidence establishing a competency framework that supports PHR is scarce. CONCLUSION Radiography makes a significant contribution to public health in reducing mortality and morbidity. Therefore, developing a PHR competency framework can accentuate the contribution Radiographers make to solving public health issues.
Collapse
Affiliation(s)
- Eyo Akpan
- Grayscale International Ltd, Lagos, Nigeria.
| | - Jane Kitundu
- Vijibweni District Hospital, Kigamboni Municipal, Dar es Salaam, Tanzania
| | - Ernest Ekpo
- Image Optimisation and Perception Group, Discipline of Medical Imaging Science, School of Health Sciences, Faculty of Medicine and Health, The University of Sydney, Cumberland Campus C42
- 75 East Street, Lidcombe, NS, W
- 2141
| |
Collapse
|
11
|
Frazer HM, Qin AK, Pan H, Brotchie P. Evaluation of deep learning-based artificial intelligence techniques for breast cancer detection on mammograms: Results from a retrospective study using a BreastScreen Victoria dataset. J Med Imaging Radiat Oncol 2021; 65:529-537. [PMID: 34212526 PMCID: PMC8456839 DOI: 10.1111/1754-9485.13278] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 06/15/2021] [Indexed: 11/28/2022]
Abstract
Introduction This study aims to evaluate deep learning (DL)‐based artificial intelligence (AI) techniques for detecting the presence of breast cancer on a digital mammogram image. Methods We evaluated several DL‐based AI techniques that employ different approaches and backbone DL models and tested the effect on performance of using different data‐processing strategies on a set of digital mammographic images with annotations of pathologically proven breast cancer. Results Our evaluation uses the area under curve (AUC) and accuracy (ACC) for performance measurement. The best evaluation result, based on 349 test cases (930 test images), was an AUC of 0.8979 [95% confidence interval (CI) 0.873, 0.923] and ACC of 0.8178 [95% CI 0.785, 0.850]. This was achieved by an AI technique that utilises a certain family of DL models, namely ResNet, as its backbone, combines the global features extracted from the whole mammogram and the local features extracted from the automatically detected cancer and non‐cancer local regions in the whole image, and leverages background cropping and text removal, contrast adjustment and more training data. Conclusion DL‐based AI techniques have shown promising results in retrospective studies for many medical image analysis applications. Our study demonstrates a significant opportunity to boost the performance of such techniques applied to breast cancer detection by exploring different types of approaches, backbone DL models and data‐processing strategies. The promising results we have obtained suggest further development of AI reading services could transform breast cancer screening in the future.
Collapse
Affiliation(s)
- Helen Ml Frazer
- St Vincent's BreastScreen, St Vincent's Hospital Melbourne, Melbourne, Victoria, Australia
| | - Alex K Qin
- Department of Computer Science and Software Engineering, Swinburne University of Technology, Melbourne, Victoria, Australia
| | - Hong Pan
- Department of Computer Science and Software Engineering, Swinburne University of Technology, Melbourne, Victoria, Australia
| | - Peter Brotchie
- Department of Medical Imaging, St Vincent's Hospital Melbourne, Melbourne, Victoria, Australia
| |
Collapse
|
12
|
Cornford E, Cheung S, Press M, Kearins O, Taylor-Phillips S. Optimum screening mammography reading volumes: evidence from the NHS Breast Screening Programme. Eur Radiol 2021; 31:6909-6915. [PMID: 33630161 DOI: 10.1007/s00330-021-07754-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 01/06/2021] [Accepted: 02/04/2021] [Indexed: 11/28/2022]
Abstract
OBJECTIVES Minimum caseload standards for professionals examining breast screening mammograms vary from 480 (US) to 5000 (Europe). We measured the relationship between the number of women's mammograms examined per year and reader performance. METHODS We extracted routine records from the English NHS Breast Screening Programme for readers examining between 1000 and 45,000 mammograms between April 2014 and March 2017. We measured the relationship between the volume of cases read and screening performance (cancer detection rate, recall rate, positive predictive value of recall (PPV) and discrepant cancers) using linear logistic regression. We also examined the effect of reader occupational group on performance. RESULTS In total, 759 eligible mammography readers (445 consultant radiologists, 235 radiography advanced practitioners, 79 consultant radiographers) examined 6.1 million women's mammograms during the study period. PPV increased from 12.9 to 14.4 to 17.0% for readers examining 2000, 5000 and 10000 cases per year respectively. This was driven by decreases in recall rates from 5.8 to 5.3 to 4.5 with increasing volume read, and no change in cancer detection rate (from 7.6 to 7.6 to 7.7). There was no difference in cancer detection rate with reader occupational group. Consultant radiographers had higher recall rate and lower PPV compared to radiologists (OR 1.105, p = 0.012; OR 0.874, p = 0.002, unadjusted). CONCLUSION Positive predictive value of screening increases with the total volume of cases examined per reader, through decreases in numbers of cases recalled with no concurrent change in numbers of cancers detected. KEY POINTS • In the English Breast Screening Programme, readers who examined a larger number of cases per year had a higher positive predictive value, because they recalled fewer women for further tests but detected the same number of cancers. • Reader type did not affect cancer detection rate, but consultant radiographers had a higher recall rate and lower positive predictive value than consultant radiologists, although this was not adjusted for length of experience.
Collapse
Affiliation(s)
- Eleanor Cornford
- Thirlestaine Breast Unit, Cobalt House, Gloucestershire Hospitals NHS Foundation Trust, Thirlestaine Road, Cheltenham, Gloucestershire, GL53 7AS, UK.
| | - Shan Cheung
- Public Health England, 5 St Philips Place, Birmingham, B3 2PW, UK
| | - Mike Press
- Screening QA Service (South) Public Health England, Birmingham, UK
| | - Olive Kearins
- National Lead Breast Screening Research & Data, Screening Division, Public Health England, Birmingham, UK
| | - Sian Taylor-Phillips
- Division of Health Sciences, Warwick Medical School, University of Warwick, Coventry, CV4 7A, UK
| |
Collapse
|
13
|
Cohen EO, Lesslie M, Weaver O, Phalak K, Tso H, Perry R, Leung JWT. Batch Reading and Interrupted Interpretation of Digital Screening Mammograms Without and With Tomosynthesis. J Am Coll Radiol 2020; 18:280-293. [PMID: 32861601 DOI: 10.1016/j.jacr.2020.07.033] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 07/23/2020] [Accepted: 07/29/2020] [Indexed: 11/19/2022]
Abstract
OBJECTIVE To compare batch reading and interrupted interpretation for modern screening mammography. METHODS We retrospectively reviewed digital mammograms without and with tomosynthesis that were originally interpreted with batch reading or interrupted interpretation between January 2015 and June 2017. The following performance metrics were compared: recall rate (per 100 examinations), cancer detection rate (per 1,000 examinations), and positive predictive values for recall and biopsy. RESULTS In all, 9,832 digital mammograms were batch read, yielding a recall rate of 9.98%, cancer detection rate of 4.27, and positive predictive values for recall and biopsy of 4.40% and 35.5%, respectively. There were 49,496 digital mammograms that were read with interrupted interpretation, yielding a recall rate of 11.3%, cancer detection rate of 4.44, and positive predictive values for recall and biopsy of 3.92% and 30.1%, respectively. Of the digital mammograms with tomosynthesis, 7,075 were batch read, yielding a recall rate of 6.98%, cancer detection rate of 5.37, and positive predictive values for recall and biopsy of 7.69% and 38.0%, respectively. Of the digital mammograms with tomosynthesis, 24,380 were read with interrupted interpretation, yielding a recall rate of 8.30%, cancer detection rate of 5.41, and positive predictive values for recall and biopsy of 6.52% and 33.3%, respectively. For both digital mammograms without and with tomosynthesis, recall rates improved with batch reading compared with interrupted interpretation (P < .001), but no significant differences were seen for other metrics. DISCUSSION Batch reading digital mammograms without and with tomosynthesis improves recall rates while maintaining cancer detection rates and positive predictive values compared with interrupted interpretation.
Collapse
Affiliation(s)
- Ethan O Cohen
- Faculty Lead of Marketing, Department of Breast Imaging, The University of Texas MD Anderson Cancer Center, Houston, Texas.
| | - Michele Lesslie
- Department of Breast Imaging, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Olena Weaver
- Director of Bone Densitometry, Department of Breast Imaging, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Kanchan Phalak
- Patient Safety Officer, Department of Breast Imaging, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Hilda Tso
- Department of Breast Imaging, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Rachel Perry
- Department of Breast Imaging, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Jessica W T Leung
- Deputy Chair, Department of Breast Imaging, The University of Texas MD Anderson Cancer Center, Houston, Texas
| |
Collapse
|
14
|
Taylor-Phillips S, Stinton C. Double reading in breast cancer screening: considerations for policy-making. Br J Radiol 2020; 93:20190610. [PMID: 31617741 PMCID: PMC7055445 DOI: 10.1259/bjr.20190610] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Revised: 10/09/2019] [Accepted: 10/13/2019] [Indexed: 01/04/2023] Open
Abstract
In this article, we explore the evidence around the relative benefits and harms of breast cancer screening using a single radiologist to examine each female's mammograms for signs of cancer (single reading), or two radiologists (double reading). First, we briefly explore the historical evidence using film-screen mammography, before providing an in-depth description of evidence using digital mammography. We classify studies according to which exact version of double reading they use, because the evidence suggests that effectiveness of double reading is contingent on whether the two radiologists are blinded to one another's decisions, and how the decisions of the two radiologists are integrated. Finally, we explore the implications for future mammography, including using artificial intelligence as the second reader, and applications to more complex three-dimensional imaging techniques such as tomosynthesis.
Collapse
Affiliation(s)
| | - Chris Stinton
- Warwick Medical School, University of Warwick, Coventry, CV4 7AL, England
| |
Collapse
|