Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Miglioretti DL, Gard CC, Carney PA, Onega TL, Buist DSM, Sickles EA, Kerlikowske K, Rosenberg RD, Yankaskas BC, Geller BM, Elmore JG. When radiologists perform best: the learning curve in screening mammogram interpretation. Radiology 2009;253:632-40. [PMID: 19789234 PMCID: PMC2786195 DOI: 10.1148/radiol.2533090070] [Citation(s) in RCA: 93] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

For:	Miglioretti DL, Gard CC, Carney PA, Onega TL, Buist DSM, Sickles EA, Kerlikowske K, Rosenberg RD, Yankaskas BC, Geller BM, Elmore JG. When radiologists perform best: the learning curve in screening mammogram interpretation. Radiology 2009;253:632-40. [PMID: 19789234 PMCID: PMC2786195 DOI: 10.1148/radiol.2533090070] [Citation(s) in RCA: 93] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Number

Cited by Other Article(s)

Bellmann Q, Peng Y, Genske U, Yan L, Wagner M, Jahnke P. Low-contrast lesion detection in neck CT: a multireader study comparing deep learning, iterative, and filtered back projection reconstructions using realistic phantoms. Eur Radiol Exp 2024;8:84. [PMID: 39046565 PMCID: PMC11269546 DOI: 10.1186/s41747-024-00486-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 06/18/2024] [Indexed: 07/25/2024] Open

Abstract

BACKGROUND

Computed tomography (CT) reconstruction algorithms can improve image quality, especially deep learning reconstruction (DLR). We compared DLR, iterative reconstruction (IR), and filtered back projection (FBP) for lesion detection in neck CT.

METHODS

Nine patient-mimicking neck phantoms were examined with a 320-slice scanner at six doses: 0.5, 1, 1.6, 2.1, 3.1, and 5.2 mGy. Each of eight phantoms contained one circular lesion (diameter 1 cm; contrast -30 HU to the background) in the parapharyngeal space; one phantom had no lesions. Reconstruction was made using FBP, IR, and DLR. Thirteen readers were tasked with identifying and localizing lesions in 32 images with a lesion and 20 without lesions for each dose and reconstruction algorithm. Receiver operating characteristic (ROC) and localization ROC (LROC) analysis were performed.

RESULTS

DLR improved lesion detection with ROC area under the curve (AUC) 0.724 ± 0.023 (mean ± standard error of the mean) using DLR versus 0.696 ± 0.021 using IR (p = 0.037) and 0.671 ± 0.023 using FBP (p < 0.001). Likewise, DLR improved lesion localization, with LROC AUC 0.407 ± 0.039 versus 0.338 ± 0.041 using IR (p = 0.002) and 0.313 ± 0.044 using FBP (p < 0.001). Dose reduction to 0.5 mGy compromised lesion detection in FBP-reconstructed images compared to doses ≥ 2.1 mGy (p ≤ 0.024), while no effect was observed with DLR or IR (p ≥ 0.058).

CONCLUSION

DLR improved the detectability of lesions in neck CT imaging. Dose reduction to 0.5 mGy maintained lesion detectability when denoising reconstruction was used.

RELEVANCE STATEMENT

Deep learning enhances lesion detection in neck CT imaging compared to iterative reconstruction and filtered back projection, offering improved diagnostic performance and potential for x-ray dose reduction.

KEY POINTS

Low-contrast lesion detectability was assessed in anatomically realistic neck CT phantoms. Deep learning reconstruction (DLR) outperformed filtered back projection and iterative reconstruction. Dose has little impact on lesion detectability against anatomical background structures.

Collapse

Kim HJ, Choi WJ, Gwon HY, Jang SJ, Chae EY, Shin HJ, Cha JH, Kim HH. Improving mammography interpretation for both novice and experienced readers: a comparative study of two commercial artificial intelligence software. Eur Radiol 2024;34:3924-3934. [PMID: 37938383 DOI: 10.1007/s00330-023-10422-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 09/15/2023] [Accepted: 10/14/2023] [Indexed: 11/09/2023]

Abstract

OBJECTIVES

To evaluate the improvement of mammography interpretation for novice and experienced radiologists assisted by two commercial AI software.

METHODS

We compared the performance of two AI software (AI-1 and AI-2) in two experienced and two novice readers for 200 mammographic examinations (80 cancer cases). Two reading sessions were conducted within 4 weeks. The readers rated the likelihood of malignancy (range, 1-7) and the percentage probability of malignancy (range, 0-100%), with and without AI assistance. Differences in AUROC, sensitivity, and specificity were analyzed.

RESULTS

Mean AUROC increased in both novice (0.86 to 0.90 with AI-1 [p = 0.005]; 0.91 with AI-2 [p < 0.001]) and experienced readers (0.87 to 0.92 with AI-1 [p < 0.001]; 0.90 with AI-2 [p = 0.004]). Sensitivities increased from 81.3 to 88.8% with AI-1 (p = 0.027) and to 91.3% with AI-2 (p = 0.005) in novice readers, and from 81.9 to 90.6% with AI-1 (p = 0.001) and to 87.5% with AI-2 (p = 0.016) in experienced readers. Specificity did not decrease significantly in both novice (p > 0.999, both) and experienced readers (p > 0.999 with AI-1 and 0.282 with AI-2). There was no significant difference in the performance change depending on the type of AI software (p > 0.999).

CONCLUSION

Commercial AI software improved the diagnostic performance of both novice and experienced readers. The type of AI software used did not significantly impact performance changes. Further validation with a larger number of cases and readers is needed.

CLINICAL RELEVANCE STATEMENT

Commercial AI software effectively aided mammography interpretation irrespective of the experience level of human readers.

KEY POINTS

• Mammography interpretation remains challenging and is subject to a wide range of interobserver variability. • In this multi-reader study, two commercial AI software improved the sensitivity of mammography interpretation by both novice and experienced readers. The type of AI software used did not significantly impact performance changes. • Commercial AI software may effectively support mammography interpretation irrespective of the experience level of human readers.

Collapse

Bragg AC, Chung HL, Speer ME, Shin K, Sun J, Leung JWT. Screening chest wall ultrasound in the mastectomy patient. Clin Imaging 2024;108:110114. [PMID: 38460253 DOI: 10.1016/j.clinimag.2024.110114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Revised: 02/23/2024] [Accepted: 02/26/2024] [Indexed: 03/11/2024]

Seth I, Lim B, Joseph K, Gracias D, Xie Y, Ross RJ, Rozen WM. Use of artificial intelligence in breast surgery: a narrative review. Gland Surg 2024;13:395-411. [PMID: 38601286 PMCID: PMC11002485 DOI: 10.21037/gs-23-414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 02/21/2024] [Indexed: 04/12/2024]

Abstract

Background and Objective

We have witnessed tremendous advances in artificial intelligence (AI) technologies. Breast surgery, a subspecialty of general surgery, has notably benefited from AI technologies. This review aims to evaluate how AI has been integrated into breast surgery practices, to assess its effectiveness in improving surgical outcomes and operational efficiency, and to identify potential areas for future research and application.

Methods

Two authors independently conducted a comprehensive search of PubMed, Google Scholar, EMBASE, and Cochrane CENTRAL databases from January 1, 1950, to September 4, 2023, employing keywords pertinent to AI in conjunction with breast surgery or cancer. The search focused on English language publications, where relevance was determined through meticulous screening of titles, abstracts, and full-texts, followed by an additional review of references within these articles. The review covered a range of studies illustrating the applications of AI in breast surgery encompassing lesion diagnosis to postoperative follow-up. Publications focusing specifically on breast reconstruction were excluded.

Key Content and Findings

AI models have preoperative, intraoperative, and postoperative applications in the field of breast surgery. Using breast imaging scans and patient data, AI models have been designed to predict the risk of breast cancer and determine the need for breast cancer surgery. In addition, using breast imaging scans and histopathological slides, models were used for detecting, classifying, segmenting, grading, and staging breast tumors. Preoperative applications included patient education and the display of expected aesthetic outcomes. Models were also designed to provide intraoperative assistance for precise tumor resection and margin status assessment. As well, AI was used to predict postoperative complications, survival, and cancer recurrence.

Conclusions

Extra research is required to move AI models from the experimental stage to actual implementation in healthcare. With the rapid evolution of AI, further applications are expected in the coming years including direct performance of breast surgery. Breast surgeons should be updated with the advances in AI applications in breast surgery to provide the best care for their patients.

Collapse

De Luca F, Finnbogason T, Kvist O. Specialist learning curves and clinical feasibility of introducing a new MRI grading system for skeletal maturity. BJR Open 2024;6:tzae008. [PMID: 38680899 PMCID: PMC11052657 DOI: 10.1093/bjro/tzae008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 09/05/2023] [Accepted: 04/02/2024] [Indexed: 05/01/2024] Open

Abstract

Objective

MRI is an emerging imaging modality to assess skeletal maturity. This study aimed to chart the learning curves of paediatric radiologists when using an unfamiliar MRI grading system of skeletal maturity and to assess the clinical feasibility of implementing said system.

Methods

958 healthy paediatric volunteers were prospectively included in a dual-facility study. Each subject underwent a conventional MRI scan at 1.5 T. To perform the image reading, the participants were grouped into five subsets (subsets 1-5) of equal size (n∼192) in chronological order for scan acquisition. Two paediatric radiologists (R1-2) with different levels of MRI experience, both of whom were previously unfamiliar with the study's MRI grading system, independently evaluated the subsets to assess skeletal maturity in five different growth plate locations. Congruent cases at blinded reading established the consensus reading. For discrepant cases, the consensus reading was obtained through an unblinded reading by a third paediatric radiologist (R3), also unfamiliar with the MRI grading system. Further, R1 performed a second blinded image reading for all included subjects with a memory wash-out of 180 days. Weighted Cohen kappa was used to assess interreader reliability (R1 vs consensus; R2 vs consensus) at non-cumulative and cumulative time points, as well as interreader (R1 vs R2) and intrareader (R1 vs R1) reliability at non-cumulative time points.

Results

Mean weighted Cohen kappa values for each pair of blinded readers compared to consensus reading (interreader reliability, R1-2 vs consensus) were ≥0.85, showing a strong to almost perfect interreader agreement at both non-cumulative and cumulative time points and in all growth plate locations. Weighted Cohen kappa values for interreader (R1 vs R2) and intrareader reliability (R1 vs R1) were ≥0.72 at non-cumulative time points, with values ≥0.82 at subset 5.

Conclusions

Paediatric radiologists' clinical confidence when introduced to a new MRI grading system for skeletal maturity was high from the outset of their learning curve, despite the radiologists' varying levels of work experience with MRI assessment. The MRI grading system for skeletal maturity investigated in this study is a robust clinical method when used by paediatric radiologists and can be used in clinical practice.

Advances in knowledge

Radiologists with fellowship training in paediatric radiology experienced no learning curve progress when introduced to a new MRI grading system for skeletal maturity and achieved desirable agreement from the first time point of the learning curve. The robustness of the investigated MRI grading system was not affected by the earlier different levels of MRI experience among the readers.

Collapse

Kim YS, Lee SH, Kim SY, Kim ES, Park AR, Chang JM, Park VY, Yoon JH, Kang BJ, Yun BL, Kim TH, Ko ES, Chu AJ, Kim JY, Youn I, Chae EY, Choi WJ, Kim HJ, Kang SH, Ha SM, Moon WK. Unenhanced Breast MRI With Diffusion-Weighted Imaging for Breast Cancer Detection: Effects of Training on Performance and Agreement of Subspecialty Radiologists. Korean J Radiol 2024;25:11-23. [PMID: 38184765 PMCID: PMC10788600 DOI: 10.3348/kjr.2023.0528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 10/26/2023] [Accepted: 10/30/2023] [Indexed: 01/08/2024] Open

Abstract

OBJECTIVE

To investigate whether reader training improves the performance and agreement of radiologists in interpreting unenhanced breast magnetic resonance imaging (MRI) scans using diffusion-weighted imaging (DWI).

MATERIALS AND METHODS

A study of 96 breasts (35 cancers, 24 benign, and 37 negative) in 48 asymptomatic women was performed between June 2019 and October 2020. High-resolution DWI with b-values of 0, 800, and 1200 sec/mm² was performed using a 3.0-T system. Sixteen breast radiologists independently reviewed the DWI, apparent diffusion coefficient maps, and T1-weighted MRI scans and recorded the Breast Imaging Reporting and Data System (BI-RADS) category for each breast. After a 2-h training session and a 5-month washout period, they re-evaluated the BI-RADS categories. A BI-RADS category of 4 (lesions with at least two suspicious criteria) or 5 (more than two suspicious criteria) was considered positive. The per-breast diagnostic performance of each reader was compared between the first and second reviews. Inter-reader agreement was evaluated using a multi-rater κ analysis and intraclass correlation coefficient (ICC).

RESULTS

Before training, the mean sensitivity, specificity, and accuracy of the 16 readers were 70.7% (95% confidence interval [CI]: 59.4-79.9), 90.8% (95% CI: 85.6-94.2), and 83.5% (95% CI: 78.6-87.4), respectively. After training, significant improvements in specificity (95.2%; 95% CI: 90.8-97.5; P = 0.001) and accuracy (85.9%; 95% CI: 80.9-89.8; P = 0.01) were observed, but no difference in sensitivity (69.8%; 95% CI: 58.1-79.4; P = 0.58) was observed. Regarding inter-reader agreement, the κ values were 0.57 (95% CI: 0.52-0.63) before training and 0.68 (95% CI: 0.62-0.74) after training, with a difference of 0.11 (95% CI: 0.02-0.18; P = 0.01). The ICC was 0.73 (95% CI: 0.69-0.74) before training and 0.79 (95% CI: 0.76-0.80) after training (P = 0.002).

CONCLUSION

Brief reader training improved the performance and agreement of interpretations by breast radiologists using unenhanced MRI with DWI.

Collapse

Affiliation(s)

Yeon Soo Kim Department of Radiology, Seoul National University Hospital, Seoul, Republic of Korea Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea
Su Hyun Lee Department of Radiology, Seoul National University Hospital, Seoul, Republic of Korea Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea
Soo-Yeon Kim Department of Radiology, Seoul National University Hospital, Seoul, Republic of Korea Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea
Eun Sil Kim Department of Radiology, Seoul National University Hospital, Seoul, Republic of Korea Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea
Ah Reum Park Department of Radiology, Seoul National University Hospital, Seoul, Republic of Korea Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea
Jung Min Chang Department of Radiology, Seoul National University Hospital, Seoul, Republic of Korea Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea
Vivian Youngjean Park Department of Radiology, Severance Hospital, Research Institute of Radiological Science, Yonsei University College of Medicine, Seoul, Republic of Korea
Jung Hyun Yoon Department of Radiology, Severance Hospital, Research Institute of Radiological Science, Yonsei University College of Medicine, Seoul, Republic of Korea
Bong Joo Kang Department of Radiology, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
Bo La Yun Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea Department of Radiology, Seoul National University Bundang Hospital, Seoul, Republic of Korea
Tae Hee Kim Department of Radiology, Ajou University Medical Center, Suwon, Republic of Korea
Eun Sook Ko Department of Radiology and Center for Imaging Science, Samsung Medical Center, Seoul, Republic of Korea
A Jung Chu Department of Radiology, Seoul Metropolitan Government-Seoul National University Boramae Medical Center, Seoul, Republic of Korea
Jin You Kim Department of Radiology, Medical Research Institute, Pusan National University Hospital, Pusan National University School of Medicine, Busan, Republic of Korea
Inyoung Youn Department of Radiology, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
Eun Young Chae Department of Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
Woo Jung Choi Department of Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
Hee Jeong Kim Department of Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
Soo Hee Kang Medical Research Collaborating Center, Seoul National University Hospital, Seoul, Republic of Korea
Su Min Ha Department of Radiology, Seoul National University Hospital, Seoul, Republic of Korea Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Republic of Korea.
Woo Kyung Moon Department of Radiology, Seoul National University Hospital, Seoul, Republic of Korea Department of Radiology, Seoul National University College of Medicine, Seoul, Republic of Korea Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Republic of Korea

Collapse

Yoon JH, Han K, Suh HJ, Youk JH, Lee SE, Kim EK. Artificial intelligence-based computer-assisted detection/diagnosis (AI-CAD) for screening mammography: Outcomes of AI-CAD in the mammographic interpretation workflow. Eur J Radiol Open 2023;11:100509. [PMID: 37484980 PMCID: PMC10362167 DOI: 10.1016/j.ejro.2023.100509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Revised: 07/03/2023] [Accepted: 07/09/2023] [Indexed: 07/25/2023] Open

Kim H, Choi JS, Kim K, Ko ES, Ko EY, Han BK. Effect of artificial intelligence-based computer-aided diagnosis on the screening outcomes of digital mammography: a matched cohort study. Eur Radiol 2023;33:7186-7198. [PMID: 37188881 DOI: 10.1007/s00330-023-09692-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 02/21/2023] [Accepted: 03/09/2023] [Indexed: 05/17/2023]

Abstract

OBJECTIVE

To investigate whether artificial intelligence-based computer-aided diagnosis (AI-CAD) can improve radiologists' performance when used to support radiologists' interpretation of digital mammography (DM) in breast cancer screening.

METHODS

A retrospective database search identified 3158 asymptomatic Korean women who consecutively underwent screening DM between January and December 2019 without AI-CAD support, and screening DM between February and July 2020 with image interpretation aided by AI-CAD in a tertiary referral hospital using single reading. Propensity score matching was used to match the DM with AI-CAD group in a 1:1 ratio with the DM without AI-CAD group according to age, breast density, experience level of the interpreting radiologist, and screening round. Performance measures were compared with the McNemar test and generalized estimating equations.

RESULTS

A total of 1579 women who underwent DM with AI-CAD were matched with 1579 women who underwent DM without AI-CAD. Radiologists showed higher specificity (96% [1500 of 1563] vs 91.6% [1430 of 1561]; p < 0.001) and lower abnormal interpretation rates (AIR) (4.9% [77 of 1579] vs 9.2% [145 of 1579]; p < 0.001) with AI-CAD than without. There was no significant difference in the cancer detection rate (CDR) (AI-CAD vs no AI-CAD, 8.9 vs 8.9 per 1000 examinations; p = 0.999), sensitivity (87.5% vs 77.8%; p = 0.999), and positive predictive value for biopsy (PPV3) (35.0% vs 35.0%; p = 0.999) according to AI-CAD support.

CONCLUSIONS

AI-CAD increases the specificity for radiologists without decreasing sensitivity as a supportive tool in the single reading of DM for breast cancer screening.

CLINICAL RELEVANCE STATEMENT

This study shows that AI-CAD could improve the specificity of radiologists' DM interpretation in the single reading system without decreasing sensitivity, suggesting that it can benefit patients by reducing false positive and recall rates.

KEY POINTS

• In this retrospective-matched cohort study (DM without AI-CAD vs DM with AI-CAD), radiologists showed higher specificity and lower AIR when AI-CAD was used to support decision-making in DM screening. • CDR, sensitivity, and PPV for biopsy did not differ with and without AI-CAD support.

Collapse

Davis SE, Ssemaganda H, Koola JD, Mao J, Westerman D, Speroff T, Govindarajulu US, Ramsay CR, Sedrakyan A, Ohno-Machado L, Resnic FS, Matheny ME. Simulating complex patient populations with hierarchical learning effects to support methods development for post-market surveillance. BMC Med Res Methodol 2023;23:89. [PMID: 37041457 PMCID: PMC10088292 DOI: 10.1186/s12874-023-01913-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 04/04/2023] [Indexed: 04/13/2023] Open

Abstract

BACKGROUND

Validating new algorithms, such as methods to disentangle intrinsic treatment risk from risk associated with experiential learning of novel treatments, often requires knowing the ground truth for data characteristics under investigation. Since the ground truth is inaccessible in real world data, simulation studies using synthetic datasets that mimic complex clinical environments are essential. We describe and evaluate a generalizable framework for injecting hierarchical learning effects within a robust data generation process that incorporates the magnitude of intrinsic risk and accounts for known critical elements in clinical data relationships.

METHODS

We present a multi-step data generating process with customizable options and flexible modules to support a variety of simulation requirements. Synthetic patients with nonlinear and correlated features are assigned to provider and institution case series. The probability of treatment and outcome assignment are associated with patient features based on user definitions. Risk due to experiential learning by providers and/or institutions when novel treatments are introduced is injected at various speeds and magnitudes. To further reflect real-world complexity, users can request missing values and omitted variables. We illustrate an implementation of our method in a case study using MIMIC-III data for reference patient feature distributions.

RESULTS

Realized data characteristics in the simulated data reflected specified values. Apparent deviations in treatment effects and feature distributions, though not statistically significant, were most common in small datasets (n < 3000) and attributable to random noise and variability in estimating realized values in small samples. When learning effects were specified, synthetic datasets exhibited changes in the probability of an adverse outcomes as cases accrued for the treatment group impacted by learning and stable probabilities as cases accrued for the treatment group not affected by learning.

CONCLUSIONS

Our framework extends clinical data simulation techniques beyond generation of patient features to incorporate hierarchical learning effects. This enables the complex simulation studies required to develop and rigorously test algorithms developed to disentangle treatment safety signals from the effects of experiential learning. By supporting such efforts, this work can help identify training opportunities, avoid unwarranted restriction of access to medical advances, and hasten treatment improvements.

Collapse

Affiliation(s)

Sharon E Davis Department of Biomedical Informatics, Vanderbilt University Medical Center, 2525 West End Ave, Suite 1475, Nashville, TN, 37203, USA.
Henry Ssemaganda Comparative Effectiveness Research Institute, Lahey Hospital and Medical Center, 41 Mall Road, Burlington, MA, 01803, USA
Jejo D Koola UC Health Department of Biomedical Informatics, University of California San Diego, 9500 Gilman Dr. MC 0728, La Jolla, San Diego, CA, 92093-0728, USA
Jialin Mao Department of Population Health Sciences, Weill Cornell Medicine, 1300 York Avenue, New York, NY, 10065, USA
Dax Westerman Department of Biomedical Informatics, Vanderbilt University Medical Center, 2525 West End Ave, Suite 1475, Nashville, TN, 37203, USA
Theodore Speroff Departments of Medicine and Biostatistics, Vanderbilt University Medical Center, 1313 21St Avenue South, Oxford House, Room 209, Nashville, TN, 37232, USA
Usha S Govindarajulu Center for Biostatistics, Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, Box 1077, New York, NY, 10029, USA
Craig R Ramsay Health Services Research Unit, University of Aberdeen, Health Sciences Building, Foresterhill, 3rd Floor, Aberdeen, AB25 2ZD, UK
Art Sedrakyan Department of Population Health Sciences, Weill Cornell Medicine, 1300 York Avenue, New York, NY, 10065, USA
Lucila Ohno-Machado Biomedical Informatics and Data Science, Yale School of Medicine, 100 College Street, New Haven, CT, 06510, USA
Frederic S Resnic Division of Cardiovascular Medicine and Comparative Effectiveness Research Institute, Lahey Hospital and Medical Center, Tufts University School of Medicine, 41 Burlington Mall Road, Burlington, MA, 01805, USA
Michael E Matheny Departments of Biomedical Informatics, Biostatistics, and Medicine, Vanderbilt University Medical Center, 2525 West End Ave, Suite 1475, Nashville, TN, 37203, USA Geriatric Research Education and Clinical Care Center, Tennessee Valley Healthcare System VA, 1310 24th Avenue South, Nashville, TN, 37212, USA

Collapse

Wong DJ, Gandomkar Z, Lewis S, Reed W, Suleiman M, Siviengphanom S, Ekpo E. Do Reader Characteristics Affect Diagnostic Efficacy in Screening Mammography? A Systematic Review. Clin Breast Cancer 2023;23:e56-e67. [PMID: 36792458 DOI: 10.1016/j.clbc.2023.01.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 01/10/2023] [Accepted: 01/21/2023] [Indexed: 01/27/2023]

Abstract

To examine reader characteristics associated with diagnostic efficacy in the interpretation of screening mammograms. A systematic search of the literature was conducted using databases such as Cochrane, Scopus, Medline, Embase, Web of Science, and PubMed. Search terms were combined with "AND" or "OR" and included: "Radiologist's characteristics AND performance"; "radiologist experience AND screening mammography"; "annual volume read AND diagnostic efficacy"; "screening mammography performance OR diagnostic efficacy". Studies were included if they assessed reader performance in screening mammography interpretation, breast readers, used a reference standard to assess the performance, and were published in the English language. Twenty-eight studies were reviewed. Increasing reader's age was associated with lower false positive rates. No association was found between gender and performance. Half of the studies showed no association between years of reading mammograms and performance. Most studies showed that high reading volume was more likely to be associated with increased sensitivity, cancer detection rates (CDR), lower recall rate, and lower false positive rates. Inconsistent associations were found between fellowship training in breast imaging and reader performance. Specialization in breast imaging was associated with better CDR, sensitivity, and specificity. Limited studies were available to establish the association between performance and factors such as time spent in breast imaging (n = 2), screening focus (n = 1), formal rotation in mammography (n = 1), owner of practice (n = 1), and practice type (n = 1). No individual characteristics is associated with versatility in diagnostic efficacy, albeit reading volume and specialization in breast imaging appear to be associated with with increased sensitivity and CDR without significantly affecting other performance metrics.

Collapse

Clerkin N, Ski CF, Brennan PC, Strudwick R. Identification of factors associated with diagnostic performance variation in reporting of mammograms: A review. Radiography (Lond) 2023;29:340-346. [PMID: 36731351 DOI: 10.1016/j.radi.2023.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 12/13/2022] [Accepted: 01/04/2023] [Indexed: 02/01/2023]

Li D, Pehrson LM, Tøttrup L, Fraccaro M, Bonnevie R, Thrane J, Sørensen PJ, Rykkje A, Andersen TT, Steglich-Arnholm H, Stærk DMR, Borgwardt L, Hansen KL, Darkner S, Carlsen JF, Nielsen MB. Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays-An Early Step in the Development of a Deep Learning-Based Decision Support System. Diagnostics (Basel) 2022;12:diagnostics12123112. [PMID: 36553118 PMCID: PMC9776917 DOI: 10.3390/diagnostics12123112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 11/21/2022] [Accepted: 11/26/2022] [Indexed: 12/14/2022] Open

Affiliation(s)

Dana Li Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark Department of Clinical Medicine, University of Copenhagen, 2100 Copenhagen, Denmark Correspondence:
Lea Marie Pehrson Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark Department of Computer Science, University of Copenhagen, 2100 Copenhagen, Denmark
Lea Tøttrup Unumed Aps, 1055 Copenhagen, Denmark
Marco Fraccaro Unumed Aps, 1055 Copenhagen, Denmark
Rasmus Bonnevie Unumed Aps, 1055 Copenhagen, Denmark
Jakob Thrane Unumed Aps, 1055 Copenhagen, Denmark
Peter Jagd Sørensen Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark Department of Clinical Medicine, University of Copenhagen, 2100 Copenhagen, Denmark
Alexander Rykkje Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark Department of Clinical Medicine, University of Copenhagen, 2100 Copenhagen, Denmark
Tobias Thostrup Andersen Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark
Henrik Steglich-Arnholm Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark
Dorte Marianne Rohde Stærk Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark
Lotte Borgwardt Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark
Kristoffer Lindskov Hansen Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark Department of Clinical Medicine, University of Copenhagen, 2100 Copenhagen, Denmark
Sune Darkner Department of Computer Science, University of Copenhagen, 2100 Copenhagen, Denmark
Jonathan Frederik Carlsen Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark Department of Clinical Medicine, University of Copenhagen, 2100 Copenhagen, Denmark
Michael Bachmann Nielsen Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark Department of Clinical Medicine, University of Copenhagen, 2100 Copenhagen, Denmark

Collapse

Liu Z, Liang K, Zhang L, Lai C, Li R, Yi L, Li R, Zhang L, Long W. Small lesion classification on abbreviated breast MRI: training can improve diagnostic performance and inter-reader agreement. Eur Radiol 2022;32:5742-5751. [PMID: 35212772 DOI: 10.1007/s00330-022-08622-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 12/25/2021] [Accepted: 01/29/2022] [Indexed: 11/04/2022]

Abstract

OBJECTIVE

To determine whether the diagnostic performance and inter-reader agreement for small lesion classification on abbreviated breast MRI (AB-MRI) can be improved by training, and can achieve the level of full diagnostic protocol MRI (FDP-MRI).

METHODS

This retrospective study enrolled 1165 breast lesions (≤ 2 cm; 409 malignant and 756 benign) from 1165 MRI examinations for reading test. Twelve radiologists were assigned into a trained group and a non-trained group. They interpreted each AB-MRI twice, which was extracted from FDP-MRI. After the first read, the trained group received a structured training for AB-MRI interpretation while the non-trained group did not. FDP-MRIs were interpreted by the trained group after the second read. BI-RADS category for each lesion was compared to the standard of reference (histopathological examination or follow-up) to calculate diagnostic accuracy. Inter-reader agreement was assessed using multirater k analysis. Diagnostic accuracy and inter-reader agreement were compared between the trained and non-trained groups, between the first and second reads, and between AB-MRI and FDP-MRI.

RESULTS

After training, the diagnostic accuracy of AB-MRI increased from 77.6 to 84.4%, and inter-reader agreement improved from 0.410 to 0.579 (both p < 0.001), which were higher than those of the non-trained group (accuracy, 84.4% vs 78.0%; weighted k, 0.579 vs 0.461; both p < 0.001). The post-training accuracy and inter-reader agreement of AB-MRI were lower than those of FDP-MRI (accuracy, 84.4% vs 92.8%; weighted k, 0.579 vs 0.602; both p < 0.001).

CONCLUSIONS

Training can improve the diagnostic performance and inter-reader agreement for small lesion classification on AB-MRI; however, it remains inferior to those of FDP-MRI.

KEY POINTS

• Training can improve the diagnostic performance for small breast lesions on AB-MRI. • Training can reduce inter-observer variation for breast lesion classification on AB-MRI, especially among junior radiologists. • The post-training diagnostic performance and inter-reader agreement of AB-MRI remained inferior to those of FDP-MRI.

Collapse

Hadadi I, Rae W, Clarke J, McEntee M, Ekpo E. Breast cancer detection across dense and non-dense breasts: Markers of diagnostic confidence and efficacy. Acta Radiol Open 2022;11:20584601211072279. [PMID: 35111337 PMCID: PMC8801646 DOI: 10.1177/20584601211072279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 12/17/2021] [Indexed: 11/17/2022] Open

Abstract

Background

The impact of radiologists’ characteristics has become a major focus of recent research. However, the markers of diagnostic efficacy and confidence in dense and non-dense breasts are poorly understood.

Purpose

This study aims to assess the relationship between radiologists’ characteristics and diagnostic performance across dense and non-dense breasts.

Materials and methods

Radiologists specialising in breast imaging (n = 128) who had 0.5–40 (13±10.6) years of experience reading mammograms were recruited. Participants independently interpreted a test set containing 60 digital mammograms (40 normal and 20 abnormal) with similarly distributed breast densities. Diagnostic performance measures were analysed via Jamovi software (version 1.6.22).

Results

In dense breasts, breast-imaging fellowship completion significantly improved specificity (p = 0.004), location sensitivity (p = 0.01) and the area under the curve (AUC) of the receiver operating characteristic (p = 0.03). Only participation in BreastScreen reading significantly improved all performance metrics: specificity (p = 0.04), sensitivity (p = 0.005), location sensitivity (p < 0.001) and AUC (p < 0.001). Reading > 100 mammograms weekly significantly improved sensitivity (p = 0.03), location sensitivity (p = 0.001), and AUC (p = 0.03).In non-dense breasts, breast fellowship completion significantly improved sensitivity (p = 0.02), location sensitivity (p = 0.04) and AUC (p = 0.002). Participation in BreastScreen reading and reading > 100 mammograms weekly significantly improved only sensitivity (p = 0.002 and p = 0.003, respectively) and location sensitivity (p < 0.001 and p < 0.001, respectively).

Conclusion

Participating in screening programs, breast fellowships and reading > 100 mammograms weekly are important indicators of the diagnostic performance of radiologists across dense and non-dense breasts. In dense breasts, optimal performance resulted from participation in a breast screening program.

Collapse

Empowering study of breast cancer data with application of artificial intelligence technology: promises, challenges, and use cases. Clin Exp Metastasis 2022;39:249-254. [PMID: 34697751 PMCID: PMC8967766 DOI: 10.1007/s10585-021-10125-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 09/25/2021] [Indexed: 12/15/2022]

Perspective: in pursuit of a learning culture. Abdom Radiol (NY) 2021;46:5017-5020. [PMID: 34075467 DOI: 10.1007/s00261-021-03156-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 05/13/2021] [Accepted: 05/26/2021] [Indexed: 10/21/2022]

Patel MM, Kapoor MM, Whitman GJ. Transitioning to Practice: Getting up to Speed in Efficiency and Accuracy. JOURNAL OF BREAST IMAGING 2021;3:607-611. [PMID: 34545352 PMCID: PMC8445236 DOI: 10.1093/jbi/wbaa100] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Indexed: 11/13/2022]

Winkler N, Braden S, Al-Dulaimi R, Morgan M, Walczak C, Freer P. Perceptions Regarding Optimal Breast Imaging Education for Radiology Residents: Results of a National Survey. Curr Probl Diagn Radiol 2021;51:454-459. [PMID: 34561152 DOI: 10.1067/j.cpradiol.2021.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 07/18/2021] [Indexed: 11/22/2022]

Lee CS, Moy L, Hughes D, Golden D, Bhargavan-Chatfield M, Hemingway J, Geras A, Duszak R, Rosenkrantz AB. Radiologist Characteristics Associated with Interpretive Performance of Screening Mammography: A National Mammography Database (NMD) Study. Radiology 2021;300:518-528. [PMID: 34156300 DOI: 10.1148/radiol.2021204379] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

Background Factors affecting radiologists' performance in screening mammography interpretation remain poorly understood. Purpose To identify radiologists characteristics that affect screening mammography interpretation performance. Materials and Methods This retrospective study included 1223 radiologists in the National Mammography Database (NMD) from 2008 to 2019 who could be linked to Centers for Medicare & Medicaid Services (CMS) datasets. NMD screening performance metrics were extracted. Acceptable ranges were defined as follows: recall rate (RR) between 5% and 12%; cancer detection rate (CDR) of at least 2.5 per 1000 screening examinations; positive predictive value of recall (PPV1) between 3% and 8%; positive predictive value of biopsies recommended (PPV2) between 20% and 40%; positive predictive value of biopsies performed (PPV3) between the 25th and 75th percentile of study sample; invasive CDR of at least the 25th percentile of the study sample; and percentage of ductal carcinoma in situ (DCIS) of at least the 25th percentile of the study sample. Radiologist characteristics extracted from CMS datasets included demographics, subspecialization, and clinical practice patterns. Multivariable stepwise logistic regression models were performed to identify characteristics independently associated with acceptable performance for the seven metrics. The most influential characteristics were defined as those independently associated with the majority of the metrics (at least four). Results Relative to radiologists practicing in the Northeast, those in the Midwest were more likely to achieve acceptable RR, PPV1, PPV2, and CDR (odds ratio [OR], 1.4-2.5); those practicing in the West were more likely to achieve acceptable RR, PPV2, and PPV3 (OR, 1.7-2.1) but less likely to achieve acceptable invasive CDR (OR, 0.6). Relative to general radiologists, breast imagers were more likely to achieve acceptable PPV1, invasive CDR, percentage DCIS, and CDR (OR, 1.4-4.4). Those performing diagnostic mammography were more likely to achieve acceptable PPV1, PPV2, PPV3, invasive CDR, and CDR (OR, 1.9-2.9). Those performing breast US were less likely to achieve acceptable PPV1, PPV2, percentage DCIS, and CDR (OR, 0.5-0.7). Conclusion The geographic location of the radiology practice, subspecialization in breast imaging, and performance of diagnostic mammography are associated with better screening mammography performance; performance of breast US is associated with lower performance. ^©RSNA, 2021 Online supplemental material is available for this article.

Collapse

Affiliation(s)

Cindy S Lee From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Linda Moy From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Danny Hughes From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Dan Golden From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Mythreyi Bhargavan-Chatfield From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Jennifer Hemingway From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Agnieszka Geras From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Richard Duszak From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)
Andrew B Rosenkrantz From the Department of Radiology, New York University Langone Health, 660 1st Ave, 3rd Floor, New York, NY 10016 (C.S.L., L.M., A.B.R.); Harvey L. Neiman Health Policy Institute, Reston, Va (D.H., J.H., R.D., A.B.R.); American College of Radiology, Reston, Va (D.G., M.B.C.); Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland (A.G.); and Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (R.D.)

Collapse

Aklilu S, Bain C, Bansil P, de Sanjose S, Dunstan JA, Castillo V, Tsu V, Contreras I, Balassanian R, Hayes Constant TK, Scheel JR. Evaluation of diagnostic ultrasound use in a breast cancer detection strategy in Northern Peru. PLoS One 2021;16:e0252902. [PMID: 34115775 PMCID: PMC8195385 DOI: 10.1371/journal.pone.0252902] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 05/24/2021] [Indexed: 11/18/2022] Open

Improving radiologist's ability in identifying particular abnormal lesions on mammograms through training test set with immediate feedback. Sci Rep 2021;11:9899. [PMID: 33972611 PMCID: PMC8110801 DOI: 10.1038/s41598-021-89214-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 04/06/2021] [Indexed: 12/24/2022] Open

Abstract

It has been shown that there are differences in diagnostic accuracy of cancer detection on mammograms, from below 50% in developing countries to over 80% in developed world. One previous study reported that radiologists from a population in Asia displayed a low mammographic cancer detection of 48% compared with over 80% in developed countries, and more importantly, that most lesions missed by these radiologists were spiculated masses or stellate lesions. The aim of this study was to explore the performance of radiologists after undertaking a training test set which had been designed to improve the capability in detecting a specific type of cancers on mammograms. Twenty-five radiologists read two sets of 60 mammograms in a standardized mammogram reading room. The first test set focused on stellate or spiculated masses. When radiologists completed the first set, the system displayed immediate feedback to the readers comparing their performances in each case with the truth of cancer cases and cancer types so that the readers could identify individual-based errors. Later radiologists were asked to read the second set of mammograms which contained different types of cancers including stellate/spiculated masses, asymmetric density, calcification, discrete mass and architectural distortion. Case sensitivity, lesion sensitivity, specificity, receiver operating characteristics (ROC) and Jackknife alternative free-response receiver operating characteristics (JAFROC) were calculated for each participant and their diagnostic accuracy was compared between two sessions. Results showed significant improvement among radiologists in case sensitivity (+ 11.4%; P < 0.05), lesion sensitivity (+ 18.7%; P < 0.01) and JAFROC (+ 11%; P < 0.01) in the second set compared with the first set. The increase in diagnostic accuracy was also recorded in the detection of stellate/spiculated mass (+ 20.6%; P < 0.05). This indicated that the performance of radiologists in detecting malignant lesions on mammograms can be improved if an appropriate training intervention is applied after the readers' weakness and strength are identified.

Collapse

Radiologists and Clinical Trials: Part 1 The Truth About Reader Disagreements. Ther Innov Regul Sci 2021;55:1111-1121. [PMID: 34228319 PMCID: PMC8259547 DOI: 10.1007/s43441-021-00316-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Accepted: 06/18/2021] [Indexed: 02/06/2023]

Dou E, Ksepka M, Dodelzon K, Shingala PY, Katzen JT. Assessing the Positive Predictive Value of Architectural Distortion Detected with Digital Breast Tomosynthesis in BI-RADS 4 Cases. JOURNAL OF BREAST IMAGING 2020;2:552-560. [PMID: 38424858 DOI: 10.1093/jbi/wbaa078] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Indexed: 03/02/2024]

Chung R, Rosenkrantz AB, Shanbhogue KP. Expert radiologist review at a hepatobiliary multidisciplinary tumor board: impact on patient management. Abdom Radiol (NY) 2020;45:3800-3808. [PMID: 32444889 DOI: 10.1007/s00261-020-02587-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Abstract

PURPOSE

To identify the frequency, source, and management impact of discrepancies between the initial radiology report and expert reinterpretation occurring in the context of a hepatobiliary multidisciplinary tumor board (MTB).

METHODS

This retrospective study included 974 consecutive patients discussed at a weekly MTB at a large tertiary care academic medical center over a 2-year period. A single radiologist with dedicated hepatobiliary imaging expertise attended all conferences to review and discuss the relevant liver imaging and rated the concordance between original and re-reads based on RADPEER scoring criteria. Impact on management was based on the conference discussion and reflected changes in follow-up imaging, recommendations for biopsy/surgery, or liver transplant eligibility.

RESULTS

Image reinterpretation was discordant with the initial report in 19.9% (194/974) of cases (59.8%, 34.5%, 5.7% RADPEER 2/3/4 discrepancies, respectively). A change in LI-RADS category occurred in 59.8% of discrepancies. Most common causes of discordance included re-classification of a lesion as benign rather than malignant (16.0%) and missed tumor recurrence (13.9%). Impact on management occurred in 99.0% of discordant cases and included loco-regional therapy instead of follow-up imaging (19.1%), follow-up imaging instead of treatment (17.5%), and avoidance of biopsy (12.4%). 11.3% received OPTN exception scores due to the revised interpretation, and 8.8% were excluded from listing for orthotopic liver transplant.

CONCLUSION

Even in a sub-specialized abdominal imaging academic practice, expert radiologist review in the MTB setting identified discordant interpretations and impacted management in a substantial fraction of patients, potentially impacting transplant allocation. The findings may impact how abdominal imaging sections best staff advanced MTBs.

Collapse

Caton MT, Wiggins WF, Pomerantz SR, Andriole KP. Lumbar MRI Reporting Efficiency for Trainees Over the Academic Year: An Opportunity for Improving Clinical Workflows in Academic Medical Centers. J Am Coll Radiol 2020;18:428-434. [PMID: 32916156 DOI: 10.1016/j.jacr.2020.08.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 08/10/2020] [Accepted: 08/12/2020] [Indexed: 11/28/2022]

Pull back the curtain: External data validation is an essential element of quality improvement benchmark reporting. J Trauma Acute Care Surg 2020;89:199-207. [PMID: 31914009 DOI: 10.1097/ta.0000000000002579] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Abstract

BACKGROUND

Accurate and reliable data are pivotal to credible risk-adjusted modeling and hospital benchmarking. Evidence assessing the reliability and accuracy of data elements considered as variables in risk-adjustment modeling and measurement of outcomes is lacking. This deficiency holds the potential to compromise benchmarking integrity. We detail the findings of a longitudinal program to evaluate the impact of external data validation on data validity and reliability for variables utilized in benchmarking of trauma centers.

METHODS

A collaborative quality initiative-based study was conducted of 29 trauma centers from March 2010 through December 2018. Case selection criteria were applied to identify high-yield cases that were likely to challenge data abstractors. There were 127,238 total variables validated (i.e., reabstracted, compared, and reported to trauma centers). Study endpoints included data accuracy (agreement between registry data and contemporaneous documentation) and reliability (consistency of accuracy within and between hospitals). Data accuracy was assessed by mean error rate and type (under capture, inaccurate capture, or over capture). Cohen's kappa estimates were calculated to evaluate reliability.

RESULTS

There were 185,120 patients that met the collaborative inclusion criteria. There were 1,243 submissions reabstracted. The initial validation visit demonstrated the highest mean error rate at 6.2% ± 4.7%, and subsequent validation visits demonstrated a statistically significant decrease in error rate compared with the first visit (p < 0.05). The mean hospital error rate within the collaborative steadily improved over time (2010, 8.0%; 2018, 3.2%) compared with the first year (p < 0.05). Reliability of substantial or higher (kappa ≥0.61) was demonstrated in 90% of the 20 comorbid conditions considered in the benchmark risk-adjustment modeling, 39% of these variables exhibited a statistically significant (p < 0.05) interval decrease in error rate from the initial visit.

CONCLUSION

Implementation of an external data validation program is correlated with increased data accuracy and reliability. Improved data reliability both within and between trauma centers improved risk-adjustment model validity and quality improvement program feedback.

Collapse

Callen AL, Dupont SM, Price A, Laguna B, McCoy D, Do B, Talbott J, Kohli M, Narvid J. Between Always and Never: Evaluating Uncertainty in Radiology Reports Using Natural Language Processing. J Digit Imaging 2020;33:1194-1201. [PMID: 32813098 DOI: 10.1007/s10278-020-00379-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Revised: 06/10/2020] [Accepted: 07/23/2020] [Indexed: 02/06/2023] Open

Abstract

The ideal radiology report reduces diagnostic uncertainty, while avoiding ambiguity whenever possible. The purpose of this study was to characterize the use of uncertainty terms in radiology reports at a single institution and compare the use of these terms across imaging modalities, anatomic sections, patient characteristics, and radiologist characteristics. We hypothesized that there would be variability among radiologists and between subspecialities within radiology regarding the use of uncertainty terms and that the length of the impression of a report would be a predictor of use of uncertainty terms. Finally, we hypothesized that use of uncertainty terms would often be interpreted by human readers as "hedging." To test these hypotheses, we applied a natural language processing (NLP) algorithm to assess and count the number of uncertainty terms within radiology reports. An algorithm was created to detect usage of a published set of uncertainty terms. All 642,569 radiology report impressions from 171 reporting radiologists were collected from 2011 through 2015. For validation, two radiologists without knowledge of the software algorithm reviewed report impressions and were asked to determine whether the report was "uncertain" or "hedging." The relationship between the presence of 1 or more uncertainty terms and the human readers' assessment was compared. There were significant differences in the proportion of reports containing uncertainty terms across patient admission status and across anatomic imaging subsections. Reports with uncertainty were significantly longer than those without, although report length was not significantly different between subspecialities or modalities. There were no significant differences in rates of uncertainty when comparing the experience of the attending radiologist. When compared with reader 1 as a gold standard, accuracy was 0.91, sensitivity was 0.92, specificity was 0.9, and precision was 0.88, with an F1-score of 0.9. When compared with reader 2, accuracy was 0.84, sensitivity was 0.88, specificity was 0.82, and precision was 0.68, with an F1-score of 0.77. Substantial variability exists among radiologists and subspecialities regarding the use of uncertainty terms, and this variability cannot be explained by years of radiologist experience or differences in proportions of specific modalities. Furthermore, detection of uncertainty terms demonstrates good test characteristics for predicting human readers' assessment of uncertainty.

Collapse

Frank SM, Qi A, Ravasio D, Sasaki Y, Rosen EL, Watanabe T. Supervised Learning Occurs in Visual Perceptual Learning of Complex Natural Images. Curr Biol 2020;30:2995-3000.e3. [PMID: 32502415 DOI: 10.1016/j.cub.2020.05.050] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 04/14/2020] [Accepted: 05/14/2020] [Indexed: 01/13/2023]

Henderson LM, Bacchus L, Benefield T, Huamani Velasquez R, Rivera MP. Rates of positive lung cancer screening examinations in academic versus community practice. Transl Lung Cancer Res 2020;9:1528-1532. [PMID: 32953524 PMCID: PMC7481616 DOI: 10.21037/tlcr-19-673] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Brunetti N, De Giorgis S, Zawaideh J, Rossi F, Calabrese M, Tagliafico AS. Comparison between execution and reading time of 3D ABUS versus HHUS. Radiol Med 2020;125:1243-1248. [PMID: 32367322 DOI: 10.1007/s11547-020-01209-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 04/20/2020] [Indexed: 01/09/2023]

Chung HL, Parikh JR. Telemammography: Technical Advances Improve Patient Access in Breast Care. JOURNAL OF BREAST IMAGING 2020;2:152-156. [PMID: 38424884 DOI: 10.1093/jbi/wbz088] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2019] [Indexed: 03/02/2024]

Kim HE, Kim HH, Han BK, Kim KH, Han K, Nam H, Lee EH, Kim EK. Changes in cancer detection and false-positive recall in mammography using artificial intelligence: a retrospective, multireader study. Lancet Digit Health 2020;2:e138-e148. [PMID: 33334578 DOI: 10.1016/s2589-7500(20)30003-0] [Citation(s) in RCA: 177] [Impact Index Per Article: 44.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 01/07/2020] [Accepted: 01/13/2020] [Indexed: 02/08/2023]

Abstract

BACKGROUND

Mammography is the current standard for breast cancer screening. This study aimed to develop an artificial intelligence (AI) algorithm for diagnosis of breast cancer in mammography, and explore whether it could benefit radiologists by improving accuracy of diagnosis.

METHODS

In this retrospective study, an AI algorithm was developed and validated with 170 230 mammography examinations collected from five institutions in South Korea, the USA, and the UK, including 36 468 cancer positive confirmed by biopsy, 59 544 benign confirmed by biopsy (8827 mammograms) or follow-up imaging (50 717 mammograms), and 74 218 normal. For the multicentre, observer-blinded, reader study, 320 mammograms (160 cancer positive, 64 benign, 96 normal) were independently obtained from two institutions. 14 radiologists participated as readers and assessed each mammogram in terms of likelihood of malignancy (LOM), location of malignancy, and necessity to recall the patient, first without and then with assistance of the AI algorithm. The performance of AI and radiologists was evaluated in terms of LOM-based area under the receiver operating characteristic curve (AUROC) and recall-based sensitivity and specificity.

FINDINGS

The AI standalone performance was AUROC 0·959 (95% CI 0·952-0·966) overall, and 0·970 (0·963-0·978) in the South Korea dataset, 0·953 (0·938-0·968) in the USA dataset, and 0·938 (0·918-0·958) in the UK dataset. In the reader study, the performance level of AI was 0·940 (0·915-0·965), significantly higher than that of the radiologists without AI assistance (0·810, 95% CI 0·770-0·850; p<0·0001). With the assistance of AI, radiologists' performance was improved to 0·881 (0·850-0·911; p<0·0001). AI was more sensitive to detect cancers with mass (53 [90%] vs 46 [78%] of 59 cancers detected; p=0·044) or distortion or asymmetry (18 [90%] vs ten [50%] of 20 cancers detected; p=0·023) than radiologists. AI was better in detection of T1 cancers (73 [91%] vs 59 [74%] of 80; p=0·0039) or node-negative cancers (104 [87%] vs 88 [74%] of 119; p=0·0025) than radiologists.

INTERPRETATION

The AI algorithm developed with large-scale mammography data showed better diagnostic performance in breast cancer detection compared with radiologists. The significant improvement in radiologists' performance when aided by AI supports application of AI to mammograms as a diagnostic support tool.

FUNDING

Lunit.

Collapse

Zorn C, Bauer E, Feffer ML, Moerschel E, Bierry G, Choquet P, Dillenseger JP. Building and Exploitation of Learning Curves to Train Radiographer Students in X-Ray CT Image Postprocessing. J Med Imaging Radiat Sci 2020;51:173-181. [PMID: 32057745 DOI: 10.1016/j.jmir.2019.11.135] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Revised: 11/08/2019] [Accepted: 11/19/2019] [Indexed: 12/01/2022]

Geertse TD, Paap E, van der Waal D, Duijm LEM, Pijnappel RM, Broeders MJM. Utility of Supplemental Training to Improve Radiologist Performance in Breast Cancer Screening: A Literature Review. J Am Coll Radiol 2019;16:1528-1546. [PMID: 31247156 DOI: 10.1016/j.jacr.2019.04.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 04/23/2019] [Accepted: 04/23/2019] [Indexed: 10/26/2022]

Messinger J, Crawford S, Roland L, Mizuguchi S. Review of Subtypes of Interval Breast Cancers With Discussion of Radiographic Findings. Curr Probl Diagn Radiol 2019;48:592-598. [DOI: 10.1067/j.cpradiol.2018.08.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2018] [Revised: 08/25/2018] [Accepted: 08/29/2018] [Indexed: 11/22/2022]

Wang S, Li XT, Zhang XY, Sun RJ, Qu YH, Zhu HC, Guan Z, Sun YS. MRI evaluation of extramural vascular invasion by inexperienced radiologists. Br J Radiol 2019;92:20181055. [PMID: 31596129 DOI: 10.1259/bjr.20181055] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Abstract

OBJECTIVE

We proposed to determine whether the performance of inexperienced radiologists in determining extramural vascular invasion (EMVI) in rectal cancer on MRI can be promoted by means of targeted training.

METHODS

230 rectal cancer patients who underwent pre-operative chemoradiotherapy were included. Pre-therapy and post-therapy MR images and pathology EMVI evaluation were available for cases. 230 cases were randomly divided into 150 training cases and 80 testing cases, including 40 testing case A and 40 testing case B. Four radiologists were included for MRI EMVI evaluation, who were divided into targeted training group and non-targeted training group. The two groups evaluated testing case A at baseline, 3 month and 6 month, evaluated testing case B at 6 month. The main outcome was agreement with expert-reference for pre-therapy and post-therapy evaluation, the other outcome was accuracy with pathology for post-therapy evaluation.

RESULTS

After 6 months of training, targeted training group showed statistically higher agreement with expert-reference than non-targeted training group for both pre-therapy and post-therapy MRI EMVI evaluation of testing case A and testing case B, all p < 0.05. Targeted training group also showed significantly higher accuracy with pathology than non-targeted training group for post-therapy evaluation of testing case A and testing case B after 6 months of training, all p < 0.05.

CONCLUSION

The diagnostic performance for MRI EMVI evaluation could be promoted by targeted training for inexperienced radiologist.

ADVANCES IN KNOWLEDGE

This study provided the first evidence that after 6 month targeted training, inexperienced radiologists demonstrated improved diagnostic performance, with a 20% increase in agreement with expert-reference for both pre-therapy and post-therapy MRI EMVI evaluation and also a 20% increase in or accuracy with pathology for post-therapy evaluation, while inexperienced radiologists could not gain obvious improvement in MRI EMVI evaluation through the same period of regular clinical practice. It indicated that targeted training may be necessary for helping inexperienced radiologist to acquire adequate experience for the MRI EMVI evaluation of rectal cancer, especially for radiologist who works in a medical unit where MRI EMVI diagnosis is uncommon.

Collapse

Rosenberg RD, Seidenwurm D. Optimizing Breast Cancer Screening Programs: Experience and Structures. Radiology 2019;292:297-298. [DOI: 10.1148/radiol.2019190924] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Koo A, Smith JT. Does learning from mistakes have to be painful? Analysis of 5 years' experience from the Leeds radiology educational cases meetings identifies common repetitive reporting errors and suggests acknowledging and celebrating excellence (ACE) as a more positive way of teaching the same lessons. Insights Imaging 2019;10:68. [PMID: 31312978 PMCID: PMC6635510 DOI: 10.1186/s13244-019-0751-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 05/14/2019] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The Royal College of Radiologists (RCR) and General Medical Council (GMC) encourage learning from mistakes. But negative feedback can be a demoralising process with adverse implications for staff morale, clinical engagement, team working and perhaps even patient outcomes. We first reviewed the literature regarding positive feedback and teamworking. We wanted to see if we could reconcile our guidance to review and learn from mistakes with evidence that positive interactions had a better effect on teamworking and outcomes than negative interactions. We then aimed to review and categorise the over 600 (mainly discrepancy) cases discussed in our educational cases meeting into educational 'themes'. Finally, we explored whether we could use these educational themes to deliver the same teaching points in a more positive way.

METHODS AND RESULTS

The attendance records, programmes and educational cases from 30 consecutive bimonthly meetings between 2011 and 2017 were prospectively collated and retrospectively analysed. Six hundred and thirty-two cases were collated over the study period where 76% of the cases submitted were discrepancies, or perceived errors. Eight percent were 'good spots' where examples of good calls, excellent reporting, exemplary practice or subtle findings that were successfully reported. Eight percent were educational cases in which no mistake had been made. The remaining 7% included procedural complications or system errors.

CONCLUSION

By analysing the pattern of discrepancies in a department and delivering the teaching in a less negative way, the 'lead' of clinical errors can be turned in to the 'gold' of useful educational tools. Interrogating the whole database periodically can enable a more constructive, wider view of the meeting itself, highlight recurrent deficiencies in practice, and point to where the need for continuing medical training is greatest. Three ways in which our department have utilised this material are outlined: the use of 'good spots', arrangement of targeted teaching and production of specialist educational material. These techniques can all contribute to a more positive learning experience with the emphasis on acknowledging and celebrating excellence (ACE).

Collapse

Mercan E, Shapiro LG, Brunyé TT, Weaver DL, Elmore JG. Characterizing Diagnostic Search Patterns in Digital Breast Pathology: Scanners and Drillers. J Digit Imaging 2019;31:32-41. [PMID: 28681097 DOI: 10.1007/s10278-017-9990-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open

Rzyman W, Szurowska E, Adamek M. Implementation of lung cancer screening at the national level: Polish example. Transl Lung Cancer Res 2019;8:S95-S105. [PMID: 31211110 DOI: 10.21037/tlcr.2019.03.09] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Geel KV, Kok EM, Aldekhayel AD, Robben SGF, van Merriënboer JJG. Chest X-ray evaluation training: impact of normal and abnormal image ratio and instructional sequence. MEDICAL EDUCATION 2019;53:153-164. [PMID: 30474292 PMCID: PMC6587445 DOI: 10.1111/medu.13756] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Revised: 09/07/2018] [Accepted: 09/13/2018] [Indexed: 06/09/2023]

Abstract

CONTEXT

Medical image perception training generally focuses on abnormalities, whereas normal images are more prevalent in medical practice. Furthermore, instructional sequences that let students practice prior to expert instruction (inductive) may lead to improved performance compared with methods that give students expert instruction before practice (deductive). This study investigates the effects of the proportion of normal images and practice-instruction order on learning to interpret medical images. It is hypothesised that manipulation of the proportion of normal images will lead to a sensitivity-specificity trade-off and that students in practice-first (inductive) conditons need more time per practice case but will correctly identify more test cases.

METHODS

Third-year medical students (n = 103) learned radiograph interpretation by practising cases with, respectively, 30% or 70% normal radiographs prior to expert instruction (practice-first order) or after expert instruction (instruction-first order). After training, students performed a test (60% normal) and sensitivity (% of correctly identified abnormal radiographs), specificity (% of correctly identified normal radiographs), diagnostic performance (% of correct diagnoses) and case duration were measured.

RESULTS

The conditions with 30% of normal images scored higher on sensitivity but the conditions with 70% of normal images scored higher on specificity, indicating a sensitivity and specificity trade-off. Those who participated in inductive conditions took less time per practice case but more per test case. They had similar test sensitivity, but scored lower on test specificity.

CONCLUSIONS

The proportion of normal images impacted the sensitivity-specificity trade-off. This trade-off should be an important consideration for the alignment of training with future practice. Furthermore, the deductive conditions unexpectedly scored higher on specificity when participants took less time per case. An inductive approach did not lead to higher diagnostic performance, possibly because participants might already have relevant prior knowledge. Deductive approaches are therefore advised for the training of advanced learners.

Collapse

Govindarajulu U, Bedi S, Kluger A, Resnic F. Survival analysis of hierarchical learning curves in assessment of cardiac device and procedural safety. Stat Med 2018;37:4185-4199. [PMID: 30062850 DOI: 10.1002/sim.7906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 06/14/2018] [Indexed: 11/07/2022]

Huppe AI, Inciardi MF, Redick M, Carroll M, Buckley J, Hill JD, Gatewood JB. Automated Breast Ultrasound Interpretation Times: A Reader Performance Study. Acad Radiol 2018;25:1577-1581. [PMID: 29661602 DOI: 10.1016/j.acra.2018.03.010] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Revised: 03/15/2018] [Accepted: 03/15/2018] [Indexed: 11/19/2022]

Demchig D, Mello-Thoms C, Lee WB, Khurelsukh K, Ramish A, Brennan PC. Mammographic detection of breast cancer in a non-screening country. Br J Radiol 2018;91:20180071. [PMID: 29987982 DOI: 10.1259/bjr.20180071] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Abstract

OBJECTIVE:

To compare the diagnostic accuracy between radiologists' from a country with and without breast cancer screening.

METHODS:

All participating radiologists gave informed consent. A test-set involving 60 mammographic cases (20 cancer and 40 non-cancer) were read by 11 radiologists from a non-screening (NS) country during a workshop in July 2016. 52 radiologists from a screening country read the same test-set at the Royal Australian and New Zealand College of Radiologists' meetings in July 2015. The screening radiologists were classified into two groups: those with less than or equal to 5 years of experience; those with more than 5 years of experience, and each group was compared to the group of NS radiologists. A Kruskal-Wallis test followed by post-hoc multiple comparisons test were used to compare measures of diagnostic accuracy among the reader groups.

RESULTS:

The diagnostic accuracy of the NS radiologists was significantly lower in terms of sensitivity [mean = 54.0; 95% confidence interval (CI) (40.0-67.0)], location sensitivity [mean = 26.0; 95% CI (16.0-37.0)], receive roperating characteristic area under curve [mean = 73.0; 95% CI (66.5-81.0)] and Jackknifefree-response receiver operating characteristics figure-of-merit [mean = 45.0; 95% CI (40.0-50.0)] when compared with the less and more experienced screening radiologists, whilst no difference in specificity [mean = 75.0; 95% CI (70.0- 81.0)] was found. No significant differences in all measured diagnostic accuracy were found between the two groups of screening radiologists.

CONCLUSION:

The mammographic performance of a group of radiologists from a country without screening program was suboptimal compared with radiologists from Australia.

ADVANCES IN KNOWLEDGE:

Identifying mammographic performance in developing countries is required to optimize breast cancer diagnosis.

Collapse

Seely JM, Alhassan T. Screening for breast cancer in 2018-what should we be doing today? ACTA ACUST UNITED AC 2018;25:S115-S124. [PMID: 29910654 DOI: 10.3747/co.25.3770] [Citation(s) in RCA: 112] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Assessing Resident Performance in Screening Mammography: Development of a Quantitative Algorithm. Acad Radiol 2018;25:659-664. [PMID: 29366681 DOI: 10.1016/j.acra.2017.11.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 11/16/2017] [Accepted: 11/21/2017] [Indexed: 11/23/2022]

Abstract

RATIONALE AND OBJECTIVES

This study aims to provide objective performance data and feedback, including examination volumes, recall rates, and concordance with faculty interpretations, for residents performing independent interpretation of screening mammography examinations.

METHOD AND MATERIALS

Residents (r) and faculty (f) interpret screening mammograms separately and identify non-callbacks (NCBs) and callbacks (CBs). Residents review all discordant results. The number of concordant interpretations (fCB-rCB and fNCB-rNCB) and discordant interpretations (fCB-rNCB and fNCB-rCB) are entered into a macro-driven spreadsheet. These macros weigh the data dependent on the perceived clinical impact of the resident's decision. Weighted outcomes are combined with volumes to generate a weighted mammography performance score. Rotation-specific goals are assigned for the weighted score, screening volumes, recall rate relative to faculty, and concordance rates. Residents receive one point for achieving each goal.

RESULTS

Between July 2013 and May 2017, 18,747 mammography examinations were reviewed by 31 residents, in 71 resident rotations, over 246 resident weeks. Mean resident recall rate was 9.9% and significantly decreased with resident level (R), R2 = 11.3% vs R3 = 9.4%, R4 = 9.2%. Mean resident-faculty discordance rate was 10% and significantly decreased from R2 = 12% to R4 = 9.6%. Weighted performance scores ranged from 1.1 to 2.0 (mean 1.6, standard deviation 0.17), but did not change with rotation experience. Residents had a mean goal achievement score of 2.6 (standard deviation 0.47).

CONCLUSIONS

This method provides residents with easily accessible case-by-case individualized screening outcome data over the longitudinal period of their residency, and provides an objective method of assessing resident screening mammography performance.

Collapse

Vásquez A, Palazuelos G, Pinzon BA, Romero J. Blended Learning in Radiology: Evaluation of a Nationwide Training Program on Breast Imaging. J Am Coll Radiol 2018;15:458-462. [PMID: 29301727 DOI: 10.1016/j.jacr.2017.11.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Revised: 10/24/2017] [Accepted: 11/02/2017] [Indexed: 12/21/2022]

Carter BW, Steele JR, Sun J, Wu CC. Analysis of the Completeness and Clarity of Free-Form Radiology Dictations for the Reporting of Pulmonary Embolism. J Am Coll Radiol 2017;14:1556-1559. [DOI: 10.1016/j.jacr.2017.03.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2017] [Revised: 03/08/2017] [Accepted: 03/09/2017] [Indexed: 01/18/2023]

Karpinski J, Ajjawi R, Moreau K. Fellowship training: a qualitative study of scope and purpose across one department of medicine. BMC MEDICAL EDUCATION 2017;17:223. [PMID: 29157228 PMCID: PMC5697383 DOI: 10.1186/s12909-017-1062-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2017] [Accepted: 11/07/2017] [Indexed: 06/01/2023]

Abstract

BACKGROUND

Fellowship training follows certification in a primary specialty or subspecialty and focusses on distinct and advanced clinical and/or academic skills. This phase of medical education is growing in prevalence, but has been an "invisible phase of postgraduate training" lacking standards for education and accreditation, as well as funding. We aimed to explore fellowship programs and examine the reasons to host and participate in fellowship training, seeking to inform the future development of fellowship education.

METHODS

During the 2013-14 academic year, we conducted interviews and focus groups to examine the current status of fellowship training from the perspectives of division heads, fellowship directors and current fellows at the Department of Medicine, University of Ottawa, Canada. Descriptive statistics were used to depict the prevailing status of fellowship training. A process of data reduction, data analysis and conclusions/verifications was performed to analyse the quantitative data.

RESULTS

We interviewed 16 division heads (94%), 15 fellowship directors (63%) and 8 fellows (21%). We identified three distinct types of fellowships. Individualized fellowships focus on the career goals of the trainee and/or the recruitment goals of the division. Clinical fellowships focus on the attainment of clinical expertise over and above the competencies of residency. Research fellowships focus on research productivity. Participants identified a variety of reasons to offer fellowships: improve academic productivity; improve clinical productivity; share/develop enhanced clinical expertise; recruit future faculty members/attain an academic position; enhance the reputation of the division/department/trainee; and enhance the scholarly environment.

CONCLUSIONS

Fellowships serve a variety of purposes which benefit both individual trainees as well as the academic enterprise. Fellowships can be categorized within a distinct taxonomy: individualized; clinical; and research. Each type of fellowship may serve a variety of purposes, and each may need distinct support and resources. Further research is needed to catalogue the operational requirements for hosting and undertaking fellowship training, and establish recommendations for educational and administrative policy and processes in this new phase of postgraduate education.

Collapse

Lourenco AP, DiFlorio-Alexander RM, Slanetz PJ. Breast Density Legislation in New England: A Survey Study of Practicing Radiologists. Acad Radiol 2017;24:1265-1267. [PMID: 28495213 DOI: 10.1016/j.acra.2017.03.009] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2017] [Revised: 03/10/2017] [Accepted: 03/11/2017] [Indexed: 10/19/2022]