1
|
Akwo JD, Trieu P, Lewis S. Does the availability of prior mammograms improve radiologists' observer performance?-a scoping review. BJR Open 2023; 5:20230038. [PMID: 37942498 PMCID: PMC10630973 DOI: 10.1259/bjro.20230038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 07/31/2023] [Accepted: 08/01/2023] [Indexed: 11/10/2023] Open
Abstract
Objective The objective of this review was to examine the impact of previous mammogram availability on radiologists' performance from screening populations and experimental studies. Materials and Methods A search of the literature was conducted using five databases: MEDLINE, PubMed, Web of Science, ScienceDirect, and CINAHL as well as Google and reference lists of articles. Keywords were combined with "AND" or "OR" or "WITH" and included "prior mammograms, diagnostic performance, initial images, diagnostic efficacy, subsequent images, previous imaging, and radiologist's performance". Studies that assessed the impact of previous mammogram availability on radiologists' performance were reviewed. The Standard for Reporting Diagnostic Accuracy guidelines was used to critically appraise individual sources of evidence. Results A total of 15 articles were reviewed. The sample of mammogram cases used across these studies varied from 36 to 1,208,051. Prior mammograms did not affect sensitivity [with priors: 62-86% (mean = 73.3%); without priors: 69.4-87.4% (mean = 75.8%)] and cancer detection rate, but increased specificity [with priors: 72-96% (mean = 87.5%); without priors: 63-87% (mean = 80.5%)] and reduced false-positive rates [with priors: 3.7 to 36% (mean = 19.9%); without priors 13.3-49% (mean = 31.4%)], recall rates [with priors: 3.8-57% (mean = 26.6%); without priors: [4.9%-67.5% (mean = 37.9%)], and abnormal interpretation rate decreased by 4% with priors. Evidence for the associations between the availability of prior mammograms and positive-predictive value, area under the curve (AUC) from the receiver operating characteristic curve (ROC) and localisation ROC AUC, and positive-predictive value of recall is limited and unclear. Conclusion Availability of prior mammograms reduces recall rates, false-positive rates, abnormal interpretation rates, and increases specificity without affecting sensitivity and cancer detection rate.
Collapse
Affiliation(s)
| | - Phuong Trieu
- Medical Image Optimisation and Perception Group, Sydney School of Health Sciences, Faculty of Medicine and Health, The University of Sydney, Camperdown, Australia
| | - Sarah Lewis
- Medical Image Optimisation and Perception Group, Sydney School of Health Sciences, Faculty of Medicine and Health, The University of Sydney, Camperdown, Australia
| |
Collapse
|
2
|
Murray E, Velleman S, Preston JL, Heard R, Shibu A, McCabe P. The Reliability of Expert Diagnosis of Childhood Apraxia of Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023:1-18. [PMID: 37642523 DOI: 10.1044/2023_jslhr-22-00677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
PURPOSE The current standard for clinical diagnosis of childhood apraxia of speech (CAS) is expert clinician judgment. The psychometric properties of this standard are not well understood; however, they are important for improving clinical diagnosis. The purpose of this study is to determine the extent to which experts agree on the clinical diagnosis of CAS using two cohorts of children with mixed speech sound disorders (SSDs). METHOD Speech samples of children with SSDs were obtained from previous and ongoing research from video recordings of children aged 3-8 years (n = 36) and audio recordings of children aged 8-17 years (n = 56). A total of 23 expert, English-speaking clinicians were recruited internationally. Three of these experts rated each speech sample to provide a description of the observed features and a diagnosis. Intrarater reliability was acceptable at 85% agreement. RESULTS Interrater reliability on the presence or absence of CAS among experts was poor both as a categorical diagnosis (κ = .187, 95% confidence interval [CI] [0.089, 0.286]) and on a continuous "likelihood of CAS" scale (0-100; intraclass correlation = .183, 95% CI [.037, .347]). Reliability was similar across the video-recorded and audio-only samples. There was greater agreement on other diagnoses (such as articulation disorder) than on the diagnosis of CAS, although these too did not meet the predetermined standard. Likelihood of CAS was greater in children who presented with more American Speech-Language-Hearing Association CAS consensus features. CONCLUSIONS Different expert raters had different thresholds for applying the diagnosis of CAS. If expert clinician judgment is to be used for diagnosis of CAS or other SSDs, further standardization and calibration is needed to increase interrater reliability. Diagnosis may require operationalized checklists or reliable measures that operate along a diagnostic continuum. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.23949105.
Collapse
Affiliation(s)
- Elizabeth Murray
- The University of Sydney, New South Wales, Australia
- Remarkable Speech + Movement, Sydney, New South Wales, Australia
| | | | | | - Robert Heard
- The University of Sydney, New South Wales, Australia
| | - Akhila Shibu
- The University of Sydney, New South Wales, Australia
| | | |
Collapse
|
3
|
Clerkin N, Ski CF, Brennan PC, Strudwick R. Identification of factors associated with diagnostic performance variation in reporting of mammograms: A review. Radiography (Lond) 2023; 29:340-346. [PMID: 36731351 DOI: 10.1016/j.radi.2023.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 12/13/2022] [Accepted: 01/04/2023] [Indexed: 02/01/2023]
Abstract
OBJECTIVES This narrative review aims to identify what factors are linked to diagnostic performance variation for those who interpret mammograms. Identification of influential factors has potential to contribute to the optimisation of breast cancer diagnosis. PubMed, ScienceDirect and Google Scholar databases were searched using the following terms: 'Radiology', 'Radiologist', 'Radiographer', 'Radiography', 'Mammography', 'Interpret', 'read', 'observe' 'report', 'screen', 'image', 'performance' and 'characteristics.' Exclusion criteria included articles published prior to 2000 as digital mammography was introduced at this time. Non-English articles language were also excluded. 38 of 2542 studies identified were analysed. KEY FINDINGS Influencing factors included, new technology, volume of reads, experience and training, availability of prior images, social networking, fatigue and time-of-day of interpretation. Advancements in breast imaging such as digital breast tomosynthesis and volume of mammograms are primary factors that affect performance as well as tiredness, time-of-day when images are interpreted, stages of training and years of experience. Recent studies emphasised the importance of social networking and knowledge sharing if breast cancer diagnosis is to be optimised. CONCLUSION It was demonstrated that data on radiologist performance variability is widely available but there is a paucity of data on radiographers who interpret mammographic images. IMPLICATIONS FOR PRACTICE This scarcity of research needs to be addressed in order to optimise radiography-led reporting and set baseline values for diagnostic efficacy.
Collapse
Affiliation(s)
- N Clerkin
- University of Suffolk, Waterfront Building, 19 Neptune Quay, Ipswich IP4 1QJ, United Kingdom.
| | - C F Ski
- University of Suffolk, Waterfront Building, 19 Neptune Quay, Ipswich IP4 1QJ, United Kingdom
| | - P C Brennan
- University of Sydney, Cumberland Campus, 75 East St, Lidcombe, NSW, 2141, Australia
| | - R Strudwick
- University of Suffolk, Waterfront Building, 19 Neptune Quay, Ipswich IP4 1QJ, United Kingdom
| |
Collapse
|
4
|
Yapp KE, Suleiman M, Brennan P, Ekpo E. Periapical Radiography versus Cone Beam Computed Tomography in Endodontic Disease Detection: A Free-response, Factorial Study. J Endod 2023; 49:419-429. [PMID: 36773745 DOI: 10.1016/j.joen.2023.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 01/17/2023] [Accepted: 02/01/2023] [Indexed: 02/11/2023]
Abstract
AIM To assess and compare reader performance in interpreting digital periapical (PA) radiography and cone beam computed tomography (CBCT) in endodontic disease detection, using a free-response, factorial model. MATERIALS AND METHODS A reader performance study of 2 image test sets was undertaken using a factorial, free-response design, accounting for the independent variables: case type, case severity, reader type, and imaging modality. Twenty-two readers interpreted 60 PA and 60 CBCT images divided into 5 categories: diseased-subtle, diseased-moderate, diseased-obvious, nondiseased-subtle, and nondiseased-obvious. Lesion localization fraction, specificity, false positive (FP) marks, and the weighted alternative free-response receiver operating characteristic figure of merit were calculated. RESULTS CBCT had greater specificity than PA in the obvious nondiseased cases (P = .01) and no significant difference in the subtle nondiseased category. Weighted alternative free-response receiver operating characteristic values were higher for PA than CBCT in the subtle diseased (P = .02) and moderate diseased (P = .01) groups with no significant difference between in the obvious diseased groups. CBCT had higher mean FPs than PA (P < .05) in subtle diseased cases. Mean lesion localization fraction in the moderate diseased group was higher in PA than CBCT (P = .003). No relationships were found between clinical experience and all diagnostic performance measures, except for in the obvious diseased CBCT group, where increasing experience was associated mean FP marks (P = .04). CONCLUSIONS Reader performance in the detection of endodontic disease is better with PA radiography than CBCT. Clinical experience does not impact upon the accuracy of interpretation of both PA radiography and CBCT.
Collapse
Affiliation(s)
- Kehn E Yapp
- Medical Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Science, Faculty of Medicine and Health, School of Health Sciences, The University of Sydney, Camperdown, New South Wales, Australia.
| | - Mo'ayyad Suleiman
- Medical Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Science, Faculty of Medicine and Health, School of Health Sciences, The University of Sydney, Camperdown, New South Wales, Australia
| | - Patrick Brennan
- Medical Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Science, Faculty of Medicine and Health, School of Health Sciences, The University of Sydney, Camperdown, New South Wales, Australia
| | - Ernest Ekpo
- Medical Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Science, Faculty of Medicine and Health, School of Health Sciences, The University of Sydney, Camperdown, New South Wales, Australia
| |
Collapse
|
5
|
The effect of clinical history on diagnostic performance of endodontic cone-beam CT interpretation. Clin Radiol 2023; 78:e433-e441. [PMID: 36702710 DOI: 10.1016/j.crad.2022.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 11/21/2022] [Accepted: 12/09/2022] [Indexed: 01/12/2023]
Abstract
AIM To assess the effect of clinical history on the interpretation of endodontic disease in dental cone-beam computed tomography (CBCT). MATERIALS AND METHODS A reader performance study of an image test-set was undertaken using a factorial, free-response, crossover design, accounting for the independent variables: case type, case severity, reader type, and reading modality. Twenty-three readers interpreted 60 CBCT images twice over two reading sessions using a balanced design, once with access to clinical history and once without, where 30 in each session included history. Lesion localisations, specificity, false-positive marks and the weighted alternative free-response receiver operating characteristic (wAFROC1) figure of merit were calculated. RESULTS Clinical history had no significant effect on specificity and false-positive rates in non-diseased cases (p>0.05), but improved lesion localisation in subtle and obvious diseased cases (p<0.01). wAFROC1 values were higher with clinical history for subtle (0.58 versus 0.48; p<0.001) and obvious (0.77 versus 0.71; p=0.006) diseased categories. No associations were observed between clinical history and both readers' years of experience and reading volume in the non-diseased categories. Readers with fewer (p=0.03) and moderate (p=0.008) years of experience and low (p=0.002) CBCT reading volume demonstrated better lesion localisation in subtle diseased cases when clinical history was available. CONCLUSIONS Clinical history improved the interpretation of CBCT images with disease without affecting the interpretation of images without disease. Less and moderately experienced readers and low-volume readers benefitted more from availability of clinical history.
Collapse
|
6
|
Yapp KE, Brennan P, Ekpo E. The Effect of Clinical History on Diagnostic Imaging Interpretation - A Systematic Review. Acad Radiol 2022; 29:255-266. [PMID: 33183952 DOI: 10.1016/j.acra.2020.10.021] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 10/21/2020] [Accepted: 10/21/2020] [Indexed: 12/25/2022]
Abstract
RATIONALE AND OBJECTIVES To provide updated information on the effect of clinical history on diagnostic image interpretation and to provide study methodology and design recommendations for future studies assessing the effect of clinical history on diagnostic image performance. MATERIALS AND METHODS A literature search of Medline, Embase, Scopus, Web of Science, and the Cochrane Central Register of Controlled Trials (CENTRAL) databases was conducted from database inception to July 21, 2020. Studies comparing diagnostic imaging performance with and without clinical history, using observers reading images under both conditions that used an independent reference standard were included. RESULTS Twenty-two studies met the inclusion criteria, with 15 showing clinical history improved diagnostic performance. One study reported a decrease in diagnostic performance with clinical history and the remaining six studies found no significant change in performance. Two studies used the free response paradigm with both reporting clinical history increased location sensitivity, decreased specificity and had no overall change in diagnostic performance. The disease spectrum of included cases was largely unreported and a balanced reading design was not used in 19 studies. CONCLUSION Most published studies found that clinical history improved diagnostic performance. More recent studies accounting for abnormality location and multiple abnormalities showed an increase in false positives and no significant change in overall diagnostic performance with clinical history.
Collapse
Affiliation(s)
- Kehn E Yapp
- Medical Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Science, School of Health Sciences, Faculty of Medicine and Health, The University of Sydney, Australia.
| | - Patrick Brennan
- Medical Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Science, School of Health Sciences, Faculty of Medicine and Health, The University of Sydney, Australia
| | - Ernest Ekpo
- Medical Image Optimisation and Perception Group (MIOPeG), Discipline of Medical Imaging Science, School of Health Sciences, Faculty of Medicine and Health, The University of Sydney, Australia
| |
Collapse
|
7
|
Brancato B, Peruzzi F, Saieva C, Schiaffino S, Catarzi S, Risso GG, Cozzi A, Carriero S, Calabrese M, Montemezzi S, Zuiani C, Sardanelli F. Mammography self-evaluation online test for screening readers: an Italian Society of Medical Radiology (SIRM) initiative. Eur Radiol 2021; 32:1624-1633. [PMID: 34480624 DOI: 10.1007/s00330-021-08241-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 07/06/2021] [Accepted: 07/23/2021] [Indexed: 11/25/2022]
Abstract
OBJECTIVES To report and analyse the characteristics and performance of the first cohort of Italian radiologists completing the national mammography self-evaluation online test established by the Italian Society of Medical Radiology (SIRM). METHODS A specifically-built dataset of 132 mammograms (24 with screen-detected cancers and 108 negative cases) was preliminarily tested on 48 radiologists to define pass thresholds (62% sensitivity and 86% specificity) and subsequently made available online to SIRM members during a 13-month timeframe between 2018 and 2019. Associations between participants' characteristics, pass rates, and diagnostic accuracy were then investigated with descriptive statistics and univariate and multivariable regression analyses. RESULTS A total of 342 radiologists completed the test, 151/342 (44.2%) with success. All individual variables, except gender, showed a significant correlation with pass rates and diagnostic sensitivity, confirmed by univariate logistic regression, while only involvement in organised screening programs and number of mammograms read per year showed a positive association with specificity at univariate logistic regression. In the multivariable regression analysis, fewer variables remained significant: > 3000 mammograms read per year for success rate; female gender, public practice setting, and higher experience self-judgement for sensitivity; no variables were significantly associated with specificity. CONCLUSIONS This national self-evaluation test effectively differentiated multiple aspects of mammographic reading experience, but specific breast imaging experience was shown not to strictly guarantee good diagnostic accuracy. Due to its easy use and the validity of obtained results, this test could be extended to all Italian breast radiologists, regardless of their experience, also as a Breast Unit accreditation criterion. KEY POINTS • This self-evaluation test was found to be able to differentiate various degrees of mammographic interpretation experience. • Breast cancer screening readers should undergo a self-assessment test, since experience parameters alone do not guarantee diagnostic ability.
Collapse
Affiliation(s)
- Beniamino Brancato
- Unit of Breast Imaging, Istituto per lo Studio, la Prevenzione e la Rete Oncologica - ISPRO, Via Cosimo il Vecchio 2, 50139, Firenze, Italy.
| | - Francesca Peruzzi
- Department of Diagnostic Imaging, Azienda Ospedaliero Universitaria Pisana, Via Paradisa 2, 56124, Pisa, Italy
| | - Calogero Saieva
- Cancer Risk Factors and Lifestyle Epidemiology Unit, Molecular and Lifestyle Epidemiology Branch, Istituto per lo Studio, la Prevenzione e la Rete Oncologica - ISPRO, Via Cosimo il Vecchio 2, 50139, Firenze, Italy
| | - Simone Schiaffino
- Unit of Radiology, IRCCS Policlinico San Donato, Via Rodolfo Morandi 30, 20097, San Donato Milanese, Italy
| | - Sandra Catarzi
- Unit of Breast Imaging, Istituto per lo Studio, la Prevenzione e la Rete Oncologica - ISPRO, Via Cosimo il Vecchio 2, 50139, Firenze, Italy
| | - Gabriella Gemma Risso
- Unit of Breast Imaging, Istituto per lo Studio, la Prevenzione e la Rete Oncologica - ISPRO, Via Cosimo il Vecchio 2, 50139, Firenze, Italy
| | - Andrea Cozzi
- Department of Biomedical Sciences for Health, Università degli Studi di Milano, Via Luigi Mangiagalli 31, 20133, Milano, Italy
| | - Serena Carriero
- Postgraduate School in Radiodiagnostics, Università degli Studi di Milano, Via Festa del Perdono 7, 20122, Milano, Italy
| | - Massimo Calabrese
- Unit of Breast Imaging, IRCCS Ospedale Policlinico San Martino, Largo Rosanna Benzi 10, 16132, Genova, Italy
| | - Stefania Montemezzi
- Radiology Unit - Department of Pathology and Diagnostics, Azienda Ospedaliera Universitaria Integrata Verona, Piazzale Aristide Stefani 1, 37126, Verona, Italy
| | - Chiara Zuiani
- Department of Medical Area - Institute of Radiology, Università degli Studi di Udine, Piazzale Santa Maria della Misericordia 15, 33100, Udine, Italy
| | - Francesco Sardanelli
- Unit of Radiology, IRCCS Policlinico San Donato, Via Rodolfo Morandi 30, 20097, San Donato Milanese, Italy
- Department of Biomedical Sciences for Health, Università degli Studi di Milano, Via Luigi Mangiagalli 31, 20133, Milano, Italy
| |
Collapse
|