1
|
Nervil GG, Ternov NK, Lorentzen H, Kromann C, Ingvar Å, Nielsen K, Tolsgaard M, Vestergaard T, Hölmich LR. Teledermoscopic triage of melanoma-suspicious skin lesions is safe: A retrospective comparative diagnostic accuracy study with multiple assessors. J Telemed Telecare 2024:1357633X241286003. [PMID: 39387164 DOI: 10.1177/1357633x241286003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/12/2024]
Abstract
BACKGROUND The rising incidence of melanoma and the high number of benign lesions excised due to diagnostic uncertainty highlight the need for effective patient triage. This study assesses the safety and accuracy of teledermoscopic triage on a high-prevalence case set with pre-triaged, challenging, melanoma-suspicious lesions. METHODS Five dermatologists independently reviewed 250 retrospectively extracted patient cases. Teledermoscopy assessments were simulated for panels of 1, 2, 3 and 5 assessors using two distinct consensus strategies, Caution Protocol and Majority Vote, and the sensitivity and specificity of the patient triages were calculated. RESULTS Triage by a single teledermatologist showed a sensitivity of 92.3% and a specificity of 58.7%. Sensitivity improved with the number of assessors, particularly when using the Caution Protocol, though with a considerable drop in specificity. The Majority Vote showed a more balanced improvement in sensitivity and specificity. Safety analyses indicated that diagnostic accuracy decreased with poor image quality and increased case difficulty. DISCUSSION Expert teledermoscopic triage of melanocytic skin lesions is highly sensitive and lowers the need for unnecessary excision procedures by half while dismissing as few as 0.4% (95% confidence interval 0-0.6%) of melanomas, even when applied to a high-prevalence pre-triaged subpopulation. Implementation of safety procedures increases accuracy. Using multiple teledermatologists increases sensitivity but at the cost of specificity unless a Majority Vote consensus strategy is applied. Future teledermoscopy guidelines should encompass safety procedures and protocols for disagreement between assessors.
Collapse
Affiliation(s)
- Gustav Gede Nervil
- Department of Plastic Surgery, Copenhagen University Hospital - Herlev and Gentofte, Copenhagen, Denmark
| | - Niels Kvorning Ternov
- Department of Plastic Surgery, Copenhagen University Hospital - Herlev and Gentofte, Copenhagen, Denmark
| | - Henrik Lorentzen
- Department of Dermatology and Allergy Centre, Odense University Hospital, Odense, Denmark
| | | | - Åsa Ingvar
- Department of Dermatology Lund, Skåne University Hospital, Skåne, Sweden
- Lund University Skin Cancer Research Group, Lund University, Lund, Sweden
- Department of Clinical Sciences, Dermatology, Lund University, Lund, Sweden
| | - Kari Nielsen
- Department of Dermatology Lund, Skåne University Hospital, Skåne, Sweden
- Lund University Skin Cancer Research Group, Lund University, Lund, Sweden
- Department of Clinical Sciences, Dermatology, Lund University, Lund, Sweden
- Department of Dermatology, Helsingborg Hospital, Helsingborg, Sweden
| | - Martin Tolsgaard
- Copenhagen Academy for Medical Education and Simulation, Copenhagen University Hospital Rigshospitalet, Copenhagen, Denmark
- Department of Obstetrics, Copenhagen University Hospital Rigshospitalet, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Tine Vestergaard
- Department of Dermatology and Allergy Centre, Odense University Hospital, Odense, Denmark
| | - Lisbet Rosenkrantz Hölmich
- Department of Plastic Surgery, Copenhagen University Hospital - Herlev and Gentofte, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
2
|
Rawath R M, Agrawal A, Kalyanpur A. Assessing the impact of trained Radiologist Assistants in a busy emergency teleradiology practice: a comprehensive evaluation. Emerg Radiol 2024; 31:677-685. [PMID: 38990429 DOI: 10.1007/s10140-024-02264-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 07/01/2024] [Indexed: 07/12/2024]
Abstract
PURPOSE This study aims to study the feasibility and usefulness of trained Radiologist Assistants in a busy emergency teleradiology practice. METHOD This is a retrospective study over a 21-month period (January 2021 to September 2022). The study analysed archived data from 247118 peer review studies performed by Radiologist Assistants (RAs) out of a total case volume of 828526 and evaluated the rate of discrepancies, the study types commonly noted to have discrepancies, and the severity of errors. These missed findings were brought to the attention of the radiologists for approval and further decision-making. RESULTS Peer review by RAs was performed on 30% (n = 247118) of the total volume 828526 studies reported, and yielded additional findings including but not limited to fractures (218; 23%), hemorrhage,(94; 10%) pulmonary thromboembolism, (n = 104; 11%), Calculus (n = 75; 8%) lesion (n = 66; 5%), appendicitis(n = 50; 4%) and others. These were brought to the attention of the radiologist, who agreed in 97% (1279 out of 1318) of cases, and communicated the same to the referring facility, with an addended report. CONCLUSION Trained RAs can provide value to the peer review program of a busy teleradiology practice and decrease errors. This may be useful to meet the ongoing radiologist shortages.
Collapse
Affiliation(s)
- Muktha Rawath R
- Clinical Operations Department, Teleradiology Solutions, Bengaluru, India.
| | - Anjali Agrawal
- Clinical Operations Department, Teleradiology Solutions, Bengaluru, India
| | - Arjun Kalyanpur
- Clinical Operations Department, Teleradiology Solutions, Bengaluru, India
| |
Collapse
|
3
|
Brown EG, Adkins ES, Mattei P, Hoffer FA, Wootton-Gorges SL, London WB, Naranjo A, Schmidt ML, Hogarty MD, Irwin MS, Cohn SL, Park JR, Maris JM, Bagatell R, Twist CJ, Nuchtern JG, Davidoff AM, Newman EA, Lal DR. Evaluation of Image-Defined Risk Factor (IDRF) Assessment in Patients With Intermediate-risk Neuroblastoma: A Report From the Children's Oncology Group Study ANBL0531. J Pediatr Surg 2024:161896. [PMID: 39317567 DOI: 10.1016/j.jpedsurg.2024.161896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/21/2024] [Accepted: 09/03/2024] [Indexed: 09/26/2024]
Abstract
BACKGROUND The International Neuroblastoma Risk Group (INRG) classifier utilizes a staging system based on pretreatment imaging criteria in which image-defined risk factors (IDRFs) are used to evaluate the extent of locoregional disease. Children's Oncology Group (COG) study ANBL0531 prospectively examined institutional determination of IDRF status and compared that to a standardized central review. METHODS Between 9/2009-6/2011, patients with intermediate-risk neuroblastoma were enrolled on ANBL0531 and had IDRF assessment at treating institutions. Paired COG pediatric surgeons and radiologists performed blinded central review of diagnostic imaging for the presence or absence of IDRFs. Second blinded review was performed in cases of discordance. Comparison of local and central review was performed using the Kappa coefficient to determine concordance in IDRF assessment. RESULTS 211 patients enrolled in ANBL0531 underwent IDRF assessment; 3 patients were excluded due to poor image quality. Central reviewer pairs agreed on the presence or absence of any IDRF in 170/208 (81.7%; κ = 0.48) cases. Thirteen (6.3%) cases could not be adjudicated after second blinded review. Radiologists were more likely to identify IRDFs as present than surgeons (p < 0.001). Local and central reviewers agreed on the presence or absence of any IDRF in only108/208 (51.9%; κ = 0.06) cases. CONCLUSIONS Among experienced pediatric surgeons and radiologists participating in central review, concordance was moderate, with agreement in 81.7% of cases. On comparison of local and central assessment of IDRFs, concordance was poor. These data indicate that greater standardization, education, technology, and training are needed to improve the assessment of IDRFs in children with neuroblastoma. LEVEL OF EVIDENCE Treatment Study, Level III.
Collapse
Affiliation(s)
- Erin G Brown
- Division of Pediatric Surgery, Department of Surgery, University of California Davis Children's Hospital, Sacramento, CA, USA.
| | - E Stanton Adkins
- Department of Pediatrics, University of South Carolina, Columbia, SC, USA
| | - Peter Mattei
- Division of Pediatic Surgery, Department of Surgery, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Fredric A Hoffer
- Department of Radiology, University of Washington, Seattle, WA, USA
| | - Sandra L Wootton-Gorges
- Department of Radiology, University of California Davis Children's Hospital, Sacramento, CA, USA
| | - Wendy B London
- Department of Pediatrics, Dana-Farber Cancer Institute and Boston Children's Hospital, Boston, MA, USA
| | - Arlene Naranjo
- University of Florida Children's Oncology Group Statistics and Data Center, Gainesville, FL, USA
| | - Mary L Schmidt
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, University of Illinois Cancer Center, Chicago, IL, USA
| | - Michael D Hogarty
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Meredith S Irwin
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, The Hospital for Sick Children, Toronto, ON, USA
| | - Susan L Cohn
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, University of Chicago, Chicago, IL, USA
| | - Julie R Park
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, St. Jude Children's Research Center, Memphis, TN, USA
| | - John M Maris
- Division of Oncology, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Rochelle Bagatell
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Clare J Twist
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA
| | - Jed G Nuchtern
- Division of Pediatric Surgery, Department of Surgery, Texas Children's Hospital, Houston, TX, USA
| | - Andrew M Davidoff
- Department of Surgery, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Erika A Newman
- Division of Pediatric Surgery, Department of Surgery, CS Mott Children's Hospital, Ann Arbor, MI, USA
| | - Dave R Lal
- Division of Pediatric Surgery, Department of Surgery, Children's Wisconsin, Milwaukee, WI, USA
| |
Collapse
|
4
|
Topff L, Steltenpool S, Ranschaert ER, Ramanauskas N, Menezes R, Visser JJ, Beets-Tan RGH, Hartkamp NS. Artificial intelligence-assisted double reading of chest radiographs to detect clinically relevant missed findings: a two-centre evaluation. Eur Radiol 2024; 34:5876-5885. [PMID: 38466390 PMCID: PMC11364654 DOI: 10.1007/s00330-024-10676-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 01/21/2024] [Accepted: 02/01/2024] [Indexed: 03/13/2024]
Abstract
OBJECTIVES To evaluate an artificial intelligence (AI)-assisted double reading system for detecting clinically relevant missed findings on routinely reported chest radiographs. METHODS A retrospective study was performed in two institutions, a secondary care hospital and tertiary referral oncology centre. Commercially available AI software performed a comparative analysis of chest radiographs and radiologists' authorised reports using a deep learning and natural language processing algorithm, respectively. The AI-detected discrepant findings between images and reports were assessed for clinical relevance by an external radiologist, as part of the commercial service provided by the AI vendor. The selected missed findings were subsequently returned to the institution's radiologist for final review. RESULTS In total, 25,104 chest radiographs of 21,039 patients (mean age 61.1 years ± 16.2 [SD]; 10,436 men) were included. The AI software detected discrepancies between imaging and reports in 21.1% (5289 of 25,104). After review by the external radiologist, 0.9% (47 of 5289) of cases were deemed to contain clinically relevant missed findings. The institution's radiologists confirmed 35 of 47 missed findings (74.5%) as clinically relevant (0.1% of all cases). Missed findings consisted of lung nodules (71.4%, 25 of 35), pneumothoraces (17.1%, 6 of 35) and consolidations (11.4%, 4 of 35). CONCLUSION The AI-assisted double reading system was able to identify missed findings on chest radiographs after report authorisation. The approach required an external radiologist to review the AI-detected discrepancies. The number of clinically relevant missed findings by radiologists was very low. CLINICAL RELEVANCE STATEMENT The AI-assisted double reader workflow was shown to detect diagnostic errors and could be applied as a quality assurance tool. Although clinically relevant missed findings were rare, there is potential impact given the common use of chest radiography. KEY POINTS • A commercially available double reading system supported by artificial intelligence was evaluated to detect reporting errors in chest radiographs (n=25,104) from two institutions. • Clinically relevant missed findings were found in 0.1% of chest radiographs and consisted of unreported lung nodules, pneumothoraces and consolidations. • Applying AI software as a secondary reader after report authorisation can assist in reducing diagnostic errors without interrupting the radiologist's reading workflow. However, the number of AI-detected discrepancies was considerable and required review by a radiologist to assess their relevance.
Collapse
Affiliation(s)
- Laurens Topff
- Department of Radiology, Netherlands Cancer Institute, Amsterdam, The Netherlands.
- GROW School for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands.
| | - Sanne Steltenpool
- Department of Radiology and Nuclear Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
- Department of Radiology, Elisabeth-TweeSteden Hospital, Tilburg, The Netherlands
| | - Erik R Ranschaert
- Department of Radiology, St. Nikolaus Hospital, Eupen, Belgium
- Ghent University, Ghent, Belgium
| | - Naglis Ramanauskas
- Oxipit UAB, Vilnius, Lithuania
- Department of Radiology, Nuclear Medicine and Medical Physics, Institute of Biomedical Sciences, Faculty of Medicine, Vilnius University, Vilnius, Lithuania
| | - Renee Menezes
- Biostatistics Centre, Department of Psychosocial Research and Epidemiology, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Jacob J Visser
- Department of Radiology and Nuclear Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Regina G H Beets-Tan
- Department of Radiology, Netherlands Cancer Institute, Amsterdam, The Netherlands
- GROW School for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands
| | - Nolan S Hartkamp
- Department of Radiology, Elisabeth-TweeSteden Hospital, Tilburg, The Netherlands
| |
Collapse
|
5
|
Batheja V, Javan R, Awan OA. Understanding and Embracing Error in Diagnostic Radiology. Acad Radiol 2024:S1076-6332(24)00457-4. [PMID: 39155156 DOI: 10.1016/j.acra.2024.07.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Accepted: 07/15/2024] [Indexed: 08/20/2024]
Affiliation(s)
- Vivek Batheja
- George Washington School of Medicine and Health Sciences, Washington DC (V.B.)
| | - Ramin Javan
- George Washington University Hospital, Washington, DC (R.J.)
| | - Omer A Awan
- University of Maryland School of Medicine, 655 W Baltimore Street, Baltimore, MD 21201 (O.A.A.).
| |
Collapse
|
6
|
Combs CA, Amara S, Kline C, Ashimi Balogun O, Bowman ZS. Quantitative Approach to Quality Review of Prenatal Ultrasound Examinations: Fetal Biometry. J Clin Med 2024; 13:4860. [PMID: 39201002 PMCID: PMC11355637 DOI: 10.3390/jcm13164860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Revised: 08/02/2024] [Accepted: 08/13/2024] [Indexed: 09/02/2024] Open
Abstract
Background/Objectives: To evaluate the quality of an ultrasound practice, both large-scale and focused audits are recommended by professional organizations, but such audits can be time-consuming, inefficient, and expensive. Our objective was to develop a time-efficient, quantitative, objective, large-scale method to evaluate fetal biometry measurements for an entire practice, combined with a process for focused image review for personnel whose measurements are outliers. Methods: Ultrasound exam data for a full year are exported from commercial ultrasound reporting software to a statistical package. Fetal biometry measurements are converted to z-scores to standardize across gestational ages. For a large-scale audit, sonographer mean z-scores are compared using analysis of variance (ANOVA) with Scheffe multiple comparisons test. A focused image review is performed on a random sample of exams for sonographers whose mean z-scores differ significantly from the practice mean. A similar large-scale audit is performed, comparing physician mean z-scores. Results: Using fetal abdominal circumference measurements as an example, significant differences between sonographer mean z-scores are readily identified by the ANOVA and Scheffe test. A method is described for the blinded image audit of sonographers with outlier mean z-scores. Examples are also given for the identification and interpretation of several types of systematic errors that are unlikely to be detectable by image review, including z-scores with large or small standard deviations and physicians with outlier mean z-scores. Conclusions: The large-scale quantitative analysis provides an overview of the biometry measurements of all the sonographers and physicians in a practice, so that image audits can be focused on those whose measurements are outliers. The analysis takes little time to perform after initial development and avoids the time, complexity, and expense of auditing providers whose measurements fall within the expected range. We encourage commercial software developers to include tools in their ultrasound reporting software to facilitate such quantitative reviews.
Collapse
Affiliation(s)
- C. Andrew Combs
- The Pediatrix Center for Research, Education, Quality & Safety, Sunrise, FL 33323, USA
- Obstetrix of California, Campbell, CA 95008, USA
| | - Sushma Amara
- Eastside Maternal-Fetal Medicine, Bellevue, WA 98004, USA
| | - Carolyn Kline
- Eastside Maternal-Fetal Medicine, Bellevue, WA 98004, USA
| | | | | |
Collapse
|
7
|
Sivan Sulaja J, Kannath SK, Kalaparti Sri Venkata Ganesh V, Thomas B. Evaluation of multiple deep neural networks for detection of intracranial dural arteriovenous fistula on susceptibility weighted angiography imaging. Neuroradiol J 2024:19714009241269491. [PMID: 39089849 DOI: 10.1177/19714009241269491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/04/2024] Open
Abstract
BACKGROUND The natural history of intracranial dural arteriovenous fistula (DAVF) is variable and early diagnosis is crucial in order to positively impact the clinical course of aggressive DAVF. Artificial intelligence (AI) based techniques can be promising in this regard, and in this study, we used various deep neural network (DNN) architectures to determine whether DAVF could be reliably identified on susceptibility-weighted angiography images (SWAN). MATERIALS AND METHODS A total of 3965 SWAN image slices from 30 digital subtraction angiographically proven DAVF patients and 4380 SWAN image slices from 40 age-matched patients with normal MRI findings as control group were included. The images were categorized as either DAVF or normal and the data was trained using various DNN such as VGG-16, EfficientNet-B0, and ResNet-50. RESULTS Various DNN architectures showed the accuracy of 95.96% (VGG-16), 91.75% (EfficientNet-B0), and 86.23% (ResNet-50) on the SWAN image dataset. ROC analysis yielded an area under the curve of 0.796 (p < .001), best for VGG-16 model. Criterion of seven consecutive positive slices for DAVF diagnosis yielded a sensitivity of 74.68% with a specificity of 69.15%, while setting eight slices improved the sensitivity to above 80.38%, with a decrease of specificity up to 56.38%. Based on seven consecutive positive slices criteria, EfficientNet-B0 yielded a sensitivity of 73.21% with a specificity of 45.92% and ResNet-50 yielded a sensitivity of 72.39% with a specificity of 67.42%. CONCLUSION This study shows that DNN can extract discriminative features of SWAN for the classification of DAVF from normal with good accuracy, reasonably good sensitivity and specificity.
Collapse
Affiliation(s)
- Jithin Sivan Sulaja
- Department of Imaging Sciences and Interventional Radiology, Sree Chitra Tirunal Institute for Medical Sciences and Technology, Kerala, India
| | - Santhosh K Kannath
- Department of Imaging Sciences and Interventional Radiology, Sree Chitra Tirunal Institute for Medical Sciences and Technology, Kerala, India
| | | | - Bejoy Thomas
- Department of Imaging Sciences and Interventional Radiology, Sree Chitra Tirunal Institute for Medical Sciences and Technology, Kerala, India
| |
Collapse
|
8
|
Schaffer AC, Zawi T, Einbinder JS, Sato L, Sodickson AD. Assessment of Claimant, Clinical, and Financial Characteristics of Teleradiology Medical Malpractice Cases. Radiology 2024; 311:e232806. [PMID: 38563670 DOI: 10.1148/radiol.232806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Background The increasing use of teleradiology has been accompanied by concerns relating to risk management and patient safety. Purpose To compare characteristics of teleradiology and nonteleradiology radiology malpractice cases and identify contributing factors underlying these cases. Materials and Methods In this retrospective analysis, a national database of medical malpractice cases was queried to identify cases involving telemedicine that closed between January 2010 and March 2022. Teleradiology malpractice cases were identified based on manual review of cases in which telemedicine was coded as one of the contributing factors. These cases were compared with nonteleradiology cases that closed during the same time period in which radiology had been determined to be the primary responsible clinical service. Claimant, clinical, and financial characteristics of the cases were recorded, and continuous or categorical data were compared using the Wilcoxon rank-sum test or Fisher exact test, respectively. Results This study included 135 teleradiology and 3474 radiology malpractices cases. The death of a patient occurred more frequently in teleradiology cases (48 of 135 [35.6%]) than in radiology cases (685 of 3474 [19.7%]; P < .001). Cerebrovascular disease was a more common final diagnosis in the teleradiology cases (13 of 135 [9.6%]) compared with the radiology cases (124 of 3474 [3.6%]; P = .002). Problems with communication among providers was a more frequent contributing factor in the teleradiology cases (35 of 135 [25.9%]) than in the radiology cases (439 of 3474 [12.6%]; P < .001). Teleradiology cases were more likely to close with indemnity payment (79 of 135 [58.5%]) than the radiology cases (1416 of 3474 [40.8%]; P < .001) and had a higher median indemnity payment than the radiology cases ($339 230 [IQR, $120 790-$731 615] vs $214 063 [IQR, $66 620-$585 424]; P = .01). Conclusion Compared with radiology cases, teleradiology cases had higher clinical and financial severity and were more likely to involve issues with communication. © RSNA, 2024 See also the editorial by Mezrich in this issue.
Collapse
Affiliation(s)
- Adam C Schaffer
- From the CRICO/Risk Management Foundation of the Harvard Medical Institutions, Boston, Mass (A.C.S., T.Z., J.S.E., L.S.); and Department of Medicine (A.C.S., J.S.E., L.S.) and Department of Radiology, Division of Emergency Radiology (A.D.S.), Brigham and Women's Hospital, Harvard Medical School, 75 Francis St, PBB-B-422, Boston, MA 02215
| | - Tarek Zawi
- From the CRICO/Risk Management Foundation of the Harvard Medical Institutions, Boston, Mass (A.C.S., T.Z., J.S.E., L.S.); and Department of Medicine (A.C.S., J.S.E., L.S.) and Department of Radiology, Division of Emergency Radiology (A.D.S.), Brigham and Women's Hospital, Harvard Medical School, 75 Francis St, PBB-B-422, Boston, MA 02215
| | - Jonathan S Einbinder
- From the CRICO/Risk Management Foundation of the Harvard Medical Institutions, Boston, Mass (A.C.S., T.Z., J.S.E., L.S.); and Department of Medicine (A.C.S., J.S.E., L.S.) and Department of Radiology, Division of Emergency Radiology (A.D.S.), Brigham and Women's Hospital, Harvard Medical School, 75 Francis St, PBB-B-422, Boston, MA 02215
| | - Luke Sato
- From the CRICO/Risk Management Foundation of the Harvard Medical Institutions, Boston, Mass (A.C.S., T.Z., J.S.E., L.S.); and Department of Medicine (A.C.S., J.S.E., L.S.) and Department of Radiology, Division of Emergency Radiology (A.D.S.), Brigham and Women's Hospital, Harvard Medical School, 75 Francis St, PBB-B-422, Boston, MA 02215
| | - Aaron D Sodickson
- From the CRICO/Risk Management Foundation of the Harvard Medical Institutions, Boston, Mass (A.C.S., T.Z., J.S.E., L.S.); and Department of Medicine (A.C.S., J.S.E., L.S.) and Department of Radiology, Division of Emergency Radiology (A.D.S.), Brigham and Women's Hospital, Harvard Medical School, 75 Francis St, PBB-B-422, Boston, MA 02215
| |
Collapse
|
9
|
Pierre K, Slater R, Raviprasad A, Griffin I, Talati J, Mathelier M, Sistrom C, Mancuso A, Sabat S. Enhancing Radiology Education With a Case-Based Intro to Radiology on the UF WIDI e-Learning Platform. Curr Probl Diagn Radiol 2024; 53:22-26. [PMID: 37690966 DOI: 10.1067/j.cpradiol.2023.08.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 07/02/2023] [Accepted: 08/23/2023] [Indexed: 09/12/2023]
Abstract
RATIONALE AND OBJECTIVES This study explores the implementation and efficacy of an online, interactive, case-based radiology education tool, Wisdom in Diagnostic Imaging (WIDI) Case-Based Intro to Radiology (CBIR). We hypothesize that the WIDI CBIR platform would enhance radiology teaching, foster critical thinking, and provide a comprehensive curriculum in imaging interpretation and utilization. MATERIALS AND METHODS A focus group consisting of 1 undergraduate, 7 medical students, 9 physician assistant students, and 3 PhD students participated in this study. We tested 3 different teaching methods: a didactic approach without WIDI, a proctored didactic approach using WIDI, and a flipped classroom approach using WIDI. An online survey was conducted to assess student preference and feedback on these methods and the use of WIDI in their curriculum. RESULTS Most students preferred the proctored didactic approach with WIDI. They reported that the platform complemented their curriculum and encouraged critical thinking. The modules covered adequate clinical and imaging details and enhanced their skills in imaging interpretation. Despite the limitations of a small sample size and reliance on self-reported outcomes, this study indicates that the WIDI platform could be integrated into PA and medical school curricula throughout the US, offering a standardized radiology curriculum. CONCLUSION The UF WIDI appears to be a promising tool for modernizing radiology education, improving imaging interpretation skills, and enhancing appropriate imaging selection among nonradiologist medical learners. WIDI offers case-based education in imaging use, workflow, search-pattern selection, and interpretation of common radiological findings, potentially bridging the gap in radiology education. Further research and larger studies are required to assess the long-term impact on performance and clinical practice.
Collapse
Affiliation(s)
- Kevin Pierre
- Department of Radiology, College of Medicine - University of Florida, Gainesville, FL.
| | - Roberta Slater
- Department of Radiology, College of Medicine - University of Florida, Gainesville, FL
| | - Abheek Raviprasad
- Department of Radiology, College of Medicine - University of Florida, Gainesville, FL
| | - Ian Griffin
- College of Medicine, University of Florida, Gainesville, FL
| | - Jay Talati
- College of Medicine, University of Florida, Gainesville, FL
| | | | - Christopher Sistrom
- Department of Radiology, College of Medicine - University of Florida, Gainesville, FL
| | - Anthony Mancuso
- Department of Radiology, College of Medicine - University of Florida, Gainesville, FL
| | - Shyamsunder Sabat
- Department of Radiology, College of Medicine - University of Florida, Gainesville, FL
| |
Collapse
|
10
|
Bachmann R, Gunes G, Hangaard S, Nexmann A, Lisouski P, Boesen M, Lundemann M, Baginski SG. Improving traumatic fracture detection on radiographs with artificial intelligence support: a multi-reader study. BJR Open 2024; 6:tzae011. [PMID: 38757067 PMCID: PMC11096271 DOI: 10.1093/bjro/tzae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 10/13/2023] [Accepted: 04/21/2024] [Indexed: 05/18/2024] Open
Abstract
Objectives The aim of this study was to evaluate the diagnostic performance of nonspecialist readers with and without the use of an artificial intelligence (AI) support tool to detect traumatic fractures on radiographs of the appendicular skeleton. Methods The design was a retrospective, fully crossed multi-reader, multi-case study on a balanced dataset of patients (≥2 years of age) with an AI tool as a diagnostic intervention. Fifteen readers assessed 340 radiographic exams, with and without the AI tool in 2 different sessions and the time spent was automatically recorded. Reference standard was established by 3 consultant radiologists. Sensitivity, specificity, and false positives per patient were calculated. Results Patient-wise sensitivity increased from 72% to 80% (P < .05) and patient-wise specificity increased from 81% to 85% (P < .05) in exams aided by the AI tool compared to the unaided exams. The increase in sensitivity resulted in a relative reduction of missed fractures of 29%. The average rate of false positives per patient decreased from 0.16 to 0.14, corresponding to a relative reduction of 21%. There was no significant difference in average reading time spent per exam. The largest gain in fracture detection performance, with AI support, across all readers, was on nonobvious fractures with a significant increase in sensitivity of 11 percentage points (pp) (60%-71%). Conclusions The diagnostic performance for detection of traumatic fractures on radiographs of the appendicular skeleton improved among nonspecialist readers tested AI fracture detection support tool showed an overall reader improvement in sensitivity and specificity when supported by an AI tool. Improvement was seen in both sensitivity and specificity without negatively affecting the interpretation time. Advances in knowledge The division and analysis of obvious and nonobvious fractures are novel in AI reader comparison studies like this.
Collapse
Affiliation(s)
| | | | - Stine Hangaard
- Department of Radiology, Herlev and Gentofte, Copenhagen University Hospital, Denmark
| | | | | | - Mikael Boesen
- Department of Radiology and Radiological AI Testcenter (RAIT) Denmark, Bispebjerg and Frederiksberg, Copenhagen University Hospital, Denmark
- Department of Clinical Medicine, Faculty of Health, and Medical Sciences, University of Copenhagen, Denmark
| | | | | |
Collapse
|
11
|
Xu X, Jia Q, Yuan H, Qiu H, Dong Y, Xie W, Yao Z, Zhang J, Nie Z, Li X, Shi Y, Zou JY, Huang M, Zhuang J. A clinically applicable AI system for diagnosis of congenital heart diseases based on computed tomography images. Med Image Anal 2023; 90:102953. [PMID: 37734140 DOI: 10.1016/j.media.2023.102953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 08/22/2023] [Accepted: 09/01/2023] [Indexed: 09/23/2023]
Abstract
Congenital heart disease (CHD) is the most common type of birth defect. Without timely detection and treatment, approximately one-third of children with CHD would die in the infant period. However, due to the complicated heart structures, early diagnosis of CHD and its types is quite challenging, even for experienced radiologists. Here, we present an artificial intelligence (AI) system that achieves a comparable performance of human experts in the critical task of classifying 17 categories of CHD types. We collected the first-large CT dataset from three different CT machines, including more than 3750 CHD patients over 14 years. Experimental results demonstrate that it can achieve diagnosis accuracy (86.03%) comparable with junior cardiovascular radiologists (86.27%) in a World Health Organization appointed research and cooperation center in China on most types of CHD, and obtains a higher sensitivity (82.91%) than junior cardiovascular radiologists (76.18%). The accuracy of the combination of our AI system (97.20%) and senior radiologists achieves comparable results to that of junior radiologists and senior radiologists (97.16%) which is the current clinical routine. Our AI system can further provide 3D visualization of hearts to senior radiologists for interpretation and flexible review, surgeons for precise intuition of heart structures, and clinicians for more precise outcome prediction. We demonstrate the potential of our model to be integrated into current clinic practice to improve the diagnosis of CHD globally, especially in regions where experienced radiologists can be scarce.
Collapse
Affiliation(s)
- Xiaowei Xu
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Qianjun Jia
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Catheterization Lab, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Haiyun Yuan
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Cardiovascular Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Hailong Qiu
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Cardiovascular Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Yuhao Dong
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Catheterization Lab, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Wen Xie
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Cardiovascular Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Zeyang Yao
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Cardiovascular Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Jiawei Zhang
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Zhiqaing Nie
- Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Xiaomeng Li
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong Special Administrative Region
| | - Yiyu Shi
- Computer Science and Engineering, University of Notre Dame, IN, 46656, USA
| | - James Y Zou
- Department of Computer Science, Stanford University, Stanford, CA, 94305, USA; Department of Electrical Engineering, Stanford University, Stanford, CA, 94305, USA.
| | - Meiping Huang
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Catheterization Lab, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China.
| | - Jian Zhuang
- Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Cardiovascular Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China.
| |
Collapse
|
12
|
Schechter MS, Kanmaniraja D, Berkenblit RG, Ye K, Shamah S, Janmey V, Yee J, Ricci ZJ. Abdominopelvic CT in COVID-19 patients with abdominal complaints including both waves and controls: reader agreement and overcalls after consensus review. Clin Imaging 2023; 104:109988. [PMID: 37845167 DOI: 10.1016/j.clinimag.2023.109988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 08/29/2023] [Accepted: 09/18/2023] [Indexed: 10/18/2023]
Abstract
BACKGROUND Since many COVID-19 publications lack consensus reviews or controls, interpretive accuracy is unclear; abdominal processes unique or infrequent during the pandemic remain unknown. The incidence and nature of CT findings accounting for abdominal complaints in COVID patients, reader agreement and overcalling will be determined. METHODS A retrospective study was performed on COVID patients with abdominal complaints from 3/15/2020-5/31/2020 and 11/1/2020-4/15/2021 including matched controls. Reviewers blinded to initial reads interpreted abdominopelvic CT exams, with discordant cases resolved in consensus. Reader agreement was measured by Cohen's Kappa, differences between cohorts by permutation tests and factors affecting false positive/negative rates by Fisher's Exact Test and logistic regression. RESULTS 116 first wave (average age 65 years [±15.3], 63 [54%] women) and 194 second wave COVID cases (average age 64 years [±16.3], 103 [53%] women) including 116 wave 1 and 194 wave 2 prepandemic controls were included. Concordance was lower among COVID cases than controls (Cohen's Kappa of 0.58 vs. 0.82 [p ≤ 0.001]) and among wave 1 than wave 2 cases (Cohen's Kappa of 0.45 vs. 0.66 [p = 0.052]). With true positives defined as consensus between the initial reader and study reader, false positive rates were higher among COVID cases than controls (OR = 0.42, p = 0.003) and for initial than study reader (OR = 0.36, p ≤ 0.001), but lower in wave 2 than 1 (OR = 0.5, p = 0.028). CONCLUSION Greater reader disagreement occurred during COVID than prepandemic with no reader bias as both initial and study readers called more false positives among COVID cases than controls. More overcalling occurred during COVID with colitis and cystitis most common.
Collapse
Affiliation(s)
- Max S Schechter
- Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461, United States of America.
| | - Devaraju Kanmaniraja
- Montefiore Medical Center and Hospital, 111 East 210th St, Bronx, NY 10467, United States of America.
| | - Robert G Berkenblit
- Montefiore Medical Center and Hospital, 111 East 210th St, Bronx, NY 10467, United States of America.
| | - Kenny Ye
- Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461, United States of America.
| | - Steven Shamah
- Montefiore Medical Center and Hospital, 111 East 210th St, Bronx, NY 10467, United States of America.
| | - Victor Janmey
- Montefiore Medical Center and Hospital, 111 East 210th St, Bronx, NY 10467, United States of America.
| | - Judy Yee
- Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461, United States of America; Montefiore Medical Center and Hospital, 111 East 210th St, Bronx, NY 10467, United States of America.
| | - Zina J Ricci
- Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461, United States of America; Montefiore Medical Center and Hospital, 111 East 210th St, Bronx, NY 10467, United States of America.
| |
Collapse
|
13
|
Huhtanen JT, Nyman M, Sequeiros RB, Koskinen SK, Pudas TK, Kajander S, Niemi P, Löyttyniemi E, Aronen HJ, Hirvonen J. Discrepancies between Radiology Specialists and Residents in Fracture Detection from Musculoskeletal Radiographs. Diagnostics (Basel) 2023; 13:3207. [PMID: 37892028 PMCID: PMC10605667 DOI: 10.3390/diagnostics13203207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 10/03/2023] [Accepted: 10/11/2023] [Indexed: 10/29/2023] Open
Abstract
(1) Background: The aim of this study was to compare the competence in appendicular trauma radiograph image interpretation between radiology specialists and residents. (2) Methods: In this multicenter retrospective cohort study, we collected radiology reports from radiology specialists (N = 506) and residents (N = 500) during 2018-2021. As a reference standard, we used the consensus of two subspecialty-level musculoskeletal (MSK) radiologists, who reviewed all original reports. (3) Results: A total of 1006 radiograph reports were reviewed by the two subspecialty-level MSK radiologists. Out of the 1006 radiographs, 41% were abnormal. In total, 67 radiographic findings were missed (6.7%) and 31 findings were overcalled (3.1%) in the original reports. Sensitivity, specificity, positive predictive value, and negative predictive value were 0.86, 0.92, 0.91 and 0.88 respectively. There were no statistically significant differences between radiology specialists' and residents' competence in interpretation (p = 0.44). However, radiology specialists reported more subtle cases than residents did (p = 0.04). There were no statistically significant differences between errors made in the morning, evening, or night shifts (p = 0.57). (4) Conclusions: This study found a lack of major discrepancies between radiology specialists and residents in radiograph interpretation, although there were differences between MSK regions and in subtle or obvious radiographic findings. In addition, missed findings found in this study often affected patient treatment. Finally, there are MSK regions where the sensitivity or specificity is below 90%, and these should raise concerns and highlight the need for double reading and should be taken into consideration in radiology education.
Collapse
Affiliation(s)
- Jarno T. Huhtanen
- Faculty of Health and Well-Being, Turku University of Applied Sciences, 20520 Turku, Finland
- Department of Radiology, University of Turku, 20014 Turku, Finland; (S.K.); (P.N.)
| | - Mikko Nyman
- Department of Radiology, Turku University Hospital, University of Turku, 20014 Turku, Finland; (M.N.); (R.B.S.); (H.J.A.); (J.H.)
| | - Roberto Blanco Sequeiros
- Department of Radiology, Turku University Hospital, University of Turku, 20014 Turku, Finland; (M.N.); (R.B.S.); (H.J.A.); (J.H.)
| | - Seppo K. Koskinen
- Terveystalo Inc., Jaakonkatu 3, 00100 Helsinki, Finland; (S.K.K.); (T.K.P.)
| | - Tomi K. Pudas
- Terveystalo Inc., Jaakonkatu 3, 00100 Helsinki, Finland; (S.K.K.); (T.K.P.)
| | - Sami Kajander
- Department of Radiology, University of Turku, 20014 Turku, Finland; (S.K.); (P.N.)
| | - Pekka Niemi
- Department of Radiology, University of Turku, 20014 Turku, Finland; (S.K.); (P.N.)
| | | | - Hannu J. Aronen
- Department of Radiology, Turku University Hospital, University of Turku, 20014 Turku, Finland; (M.N.); (R.B.S.); (H.J.A.); (J.H.)
| | - Jussi Hirvonen
- Department of Radiology, Turku University Hospital, University of Turku, 20014 Turku, Finland; (M.N.); (R.B.S.); (H.J.A.); (J.H.)
- Department of Radiology, Faculty of Medicine and Health Technology, Tampere University Hospital, Tampere University, 33100 Tampere, Finland
| |
Collapse
|
14
|
Beaumont H, Iannessi A. Can we predict discordant RECIST 1.1 evaluations in double read clinical trials? Front Oncol 2023; 13:1239570. [PMID: 37869080 PMCID: PMC10585359 DOI: 10.3389/fonc.2023.1239570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 09/05/2023] [Indexed: 10/24/2023] Open
Abstract
Background In lung clinical trials with imaging, blinded independent central review with double reads is recommended to reduce evaluation bias and the Response Evaluation Criteria In Solid Tumor (RECIST) is still widely used. We retrospectively analyzed the inter-reader discrepancies rate over time, the risk factors for discrepancies related to baseline evaluations, and the potential of machine learning to predict inter-reader discrepancies. Materials and methods We retrospectively analyzed five BICR clinical trials for patients on immunotherapy or targeted therapy for lung cancer. Double reads of 1724 patients involving 17 radiologists were performed using RECIST 1.1. We evaluated the rate of discrepancies over time according to four endpoints: progressive disease declared (PDD), date of progressive disease (DOPD), best overall response (BOR), and date of the first response (DOFR). Risk factors associated with discrepancies were analyzed, two predictive models were evaluated. Results At the end of trials, the discrepancy rates between trials were not different. On average, the discrepancy rates were 21.0%, 41.0%, 28.8%, and 48.8% for PDD, DOPD, BOR, and DOFR, respectively. Over time, the discrepancy rate was higher for DOFR than DOPD, and the rates increased as the trial progressed, even after accrual was completed. It was rare for readers to not find any disease, for less than 7% of patients, at least one reader selected non-measurable disease only (NTL). Often the readers selected some of their target lesions (TLs) and NTLs in different organs, with ranges of 36.0-57.9% and 60.5-73.5% of patients, respectively. Rarely (4-8.1%) two readers selected all their TLs in different locations. Significant risk factors were different depending on the endpoint and the trial being considered. Prediction had a poor performance but the positive predictive value was higher than 80%. The best classification was obtained with BOR. Conclusion Predicting discordance rates necessitates having knowledge of patient accrual, patient survival, and the probability of discordances over time. In lung cancer trials, although risk factors for inter-reader discrepancies are known, they are weakly significant, the ability to predict discrepancies from baseline data is limited. To boost prediction accuracy, it would be necessary to enhance baseline-derived features or create new ones, considering other risk factors and looking into optimal reader associations.
Collapse
|
15
|
Huang J, Neill L, Wittbrodt M, Melnick D, Klug M, Thompson M, Bailitz J, Loftus T, Malik S, Phull A, Weston V, Heller JA, Etemadi M. Generative Artificial Intelligence for Chest Radiograph Interpretation in the Emergency Department. JAMA Netw Open 2023; 6:e2336100. [PMID: 37796505 PMCID: PMC10556963 DOI: 10.1001/jamanetworkopen.2023.36100] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 08/21/2023] [Indexed: 10/06/2023] Open
Abstract
Importance Multimodal generative artificial intelligence (AI) methodologies have the potential to optimize emergency department care by producing draft radiology reports from input images. Objective To evaluate the accuracy and quality of AI-generated chest radiograph interpretations in the emergency department setting. Design, Setting, and Participants This was a retrospective diagnostic study of 500 randomly sampled emergency department encounters at a tertiary care institution including chest radiographs interpreted by both a teleradiology service and on-site attending radiologist from January 2022 to January 2023. An AI interpretation was generated for each radiograph. The 3 radiograph interpretations were each rated in duplicate by 6 emergency department physicians using a 5-point Likert scale. Main Outcomes and Measures The primary outcome was any difference in Likert scores between radiologist, AI, and teleradiology reports, using a cumulative link mixed model. Secondary analyses compared the probability of each report type containing no clinically significant discrepancy with further stratification by finding presence, using a logistic mixed-effects model. Physician comments on discrepancies were recorded. Results A total of 500 ED studies were included from 500 unique patients with a mean (SD) age of 53.3 (21.6) years; 282 patients (56.4%) were female. There was a significant association of report type with ratings, with post hoc tests revealing significantly greater scores for AI (mean [SE] score, 3.22 [0.34]; P < .001) and radiologist (mean [SE] score, 3.34 [0.34]; P < .001) reports compared with teleradiology (mean [SE] score, 2.74 [0.34]) reports. AI and radiologist reports were not significantly different. On secondary analysis, there was no difference in the probability of no clinically significant discrepancy between the 3 report types. Further stratification of reports by presence of cardiomegaly, pulmonary edema, pleural effusion, infiltrate, pneumothorax, and support devices also yielded no difference in the probability of containing no clinically significant discrepancy between the report types. Conclusions and Relevance In a representative sample of emergency department chest radiographs, results suggest that the generative AI model produced reports of similar clinical accuracy and textual quality to radiologist reports while providing higher textual quality than teleradiologist reports. Implementation of the model in the clinical workflow could enable timely alerts to life-threatening pathology while aiding imaging interpretation and documentation.
Collapse
Affiliation(s)
- Jonathan Huang
- Department of Emergency Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois
- Department of Anesthesiology, Northwestern University Feinberg School of Medicine, Chicago, Illinois
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, Illinois
| | - Luke Neill
- Department of Emergency Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois
| | - Matthew Wittbrodt
- Research & Development, Northwestern Medicine Information Services, Chicago, Illinois
| | - David Melnick
- Research & Development, Northwestern Medicine Information Services, Chicago, Illinois
| | - Matthew Klug
- Research & Development, Northwestern Medicine Information Services, Chicago, Illinois
| | - Michael Thompson
- Research & Development, Northwestern Medicine Information Services, Chicago, Illinois
| | - John Bailitz
- Department of Emergency Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois
| | - Timothy Loftus
- Department of Emergency Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois
| | - Sanjeev Malik
- Department of Emergency Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois
| | - Amit Phull
- Department of Emergency Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois
| | - Victoria Weston
- Department of Emergency Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois
| | - J. Alex Heller
- Research & Development, Northwestern Medicine Information Services, Chicago, Illinois
| | - Mozziyar Etemadi
- Research & Development, Northwestern Medicine Information Services, Chicago, Illinois
- Department of Anesthesiology, Northwestern University Feinberg School of Medicine, Chicago, Illinois
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, Illinois
| |
Collapse
|
16
|
Lamoureux C, Hanna TN, Callaway E, Bruno MA, Weber S, Sprecher D, Johnson TD. Radiologist age and diagnostic errors. Emerg Radiol 2023; 30:577-587. [PMID: 37458917 DOI: 10.1007/s10140-023-02158-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Accepted: 07/10/2023] [Indexed: 09/27/2023]
Abstract
PURPOSE Previous investigations into the causes of error by radiologists have addressed work schedule, volume, shift length, and sub-specialization. Studies regarding possible associations between radiologist errors and radiologist age and timing of residency training are lacking in the literature, to our knowledge. The aim of our study was to determine if radiologist age and residency graduation date is associated with diagnostic errors. METHODS Our retrospective analysis included 1.9 million preliminary interpretations (out of a total of 5.2 million preliminary and final interpretations) of imaging examinations by 361 radiologists in a US-based national teleradiology practice between 1/1/2019 and 1/1/2020. Quality assurance data regarding the number of radiologist errors was generated through client facility feedback to the teleradiology practice. With input from both the client radiologist and the teleradiologist, the final determination of the presence, absence, and severity of a teleradiologist error was determined by the quality assurance committee of radiologists within the teleradiology company using standardized criteria. Excluded were 3.2 million final examination interpretations and 93,963 (1.8%) of total examinations from facilities reporting less than one discrepancy in examination interpretation in 2019. Logistic regression with covariates radiologist age and residency graduation date was performed for calculation of relative risk of overall error rates and by major imaging modality. Major errors were separated from minor errors as those with a greater likelihood of affecting patient care. Logistic regression with covariates radiologist age, residency graduation date, and log total examinations interpreted was used to calculate odds of making a major error to that of making a minor error. RESULTS Mean age of the 361 radiologists was 51.1 years, with a mean residency graduation date of 2001. Mean error rate for all examinations was 0.5%. Radiologist age at any residency graduation date was positively associated with major errors (p < 0.05), with a relative risk 1.021 for each 1-year increase in age and relative risk 1.235 for each decade as well as for minor errors (p < 0.05, relative risk 1.007 for each year, relative risk 1.082 for each decade). By major imaging modality, radiologist age at any residency graduation date was positively associated with computed tomography (CT) and X-ray (XR) major and minor error, magnetic resonance imaging (MRI) major error, and ultrasound (US) minor error (p < 0.05). Radiologist age was positively associated with odds of making a major vs. minor error (p < 0.05). CONCLUSIONS The mean error rate for all radiologists was low. We observed that increasing age at any residency graduation date was associated with increasing relative risk of major and minor errors as well as increasing odds of a major vs. minor error among providers. Further study is needed to corroborate these results, determine clinical relevance, and highlight strategies to address these findings.
Collapse
Affiliation(s)
| | - Tarek N Hanna
- Division of Emergency Radiology, Department of Radiology and Imaging Sciences, Emory University School of Medicine, 550 Peachtree Rd, Atlanta, GA, 30308, USA
| | - Edward Callaway
- Virtual Radiologic, 11995 Singletree Ln #500, Eden Prairie, MN, 55344, USA
| | - Michael A Bruno
- Penn State Milton S. Hershey Medical Center, 500 University Dr, Hershey, PA, 17033, USA
| | - Scott Weber
- Virtual Radiologic, 11995 Singletree Ln #500, Eden Prairie, MN, 55344, USA
| | - Devin Sprecher
- Virtual Radiologic, 11995 Singletree Ln #500, Eden Prairie, MN, 55344, USA
| | | |
Collapse
|
17
|
Kasalak Ö, Pennings JP, den Akker JWO, Yakar D, Kwee TC. Why don't we inform patients about the risk of diagnostic errors? Eur J Radiol 2023; 165:110956. [PMID: 37418799 DOI: 10.1016/j.ejrad.2023.110956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 06/23/2023] [Accepted: 07/02/2023] [Indexed: 07/09/2023]
Abstract
The principles of autonomy and informed consent dictate that patients who undergo a radiological examination should actually be informed about the risk of diagnostic errors. Implementing such a policy could potentially increase the quality of care. However, due to the vast number of radiological examinations that are performed in each hospital each day, financial constraints, and the risk of losing trust, patients, and income if the requirement for informed consent is not imposed by law on a state or national level, it may be challenging to inform patients about the risk of diagnostic errors. Future research is necessary to determine if and how an informed consent procedure for diagnostic errors can be implemented in clinical practice.
Collapse
Affiliation(s)
- Ömer Kasalak
- Medical Imaging Center, Department of Radiology, University Medical Center Groningen, University of Groningen, The Netherlands.
| | - Jan P Pennings
- Medical Imaging Center, Department of Radiology, University Medical Center Groningen, University of Groningen, The Netherlands
| | - Jeroen W Op den Akker
- Medical Imaging Center, Department of Radiology, University Medical Center Groningen, University of Groningen, The Netherlands
| | - Derya Yakar
- Medical Imaging Center, Department of Radiology, University Medical Center Groningen, University of Groningen, The Netherlands
| | - Thomas C Kwee
- Medical Imaging Center, Department of Radiology, University Medical Center Groningen, University of Groningen, The Netherlands
| |
Collapse
|
18
|
Pierre K, Gupta M, Raviprasad A, Sadat Razavi SM, Patel A, Peters K, Hochhegger B, Mancuso A, Forghani R. Medical imaging and multimodal artificial intelligence models for streamlining and enhancing cancer care: opportunities and challenges. Expert Rev Anticancer Ther 2023; 23:1265-1279. [PMID: 38032181 DOI: 10.1080/14737140.2023.2286001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 11/16/2023] [Indexed: 12/01/2023]
Abstract
INTRODUCTION Artificial intelligence (AI) has the potential to transform oncologic care. There have been significant developments in AI applications in medical imaging and increasing interest in multimodal models. These are likely to enable improved oncologic care through more precise diagnosis, increasingly in a more personalized and less invasive manner. In this review, we provide an overview of the current state and challenges that clinicians, administrative personnel and policy makers need to be aware of and mitigate for the technology to reach its full potential. AREAS COVERED The article provides a brief targeted overview of AI, a high-level review of the current state and future potential AI applications in diagnostic radiology and to a lesser extent digital pathology, focusing on oncologic applications. This is followed by a discussion of emerging approaches, including multimodal models. The article concludes with a discussion of technical, regulatory challenges and infrastructure needs for AI to realize its full potential. EXPERT OPINION There is a large volume of promising research, and steadily increasing commercially available tools using AI. For the most advanced and promising precision diagnostic applications of AI to be used clinically, robust and comprehensive quality monitoring systems and informatics platforms will likely be required.
Collapse
Affiliation(s)
- Kevin Pierre
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
- Department of Radiology, University of Florida College of Medicine, Gainesville, FL, USA
| | - Manas Gupta
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
| | - Abheek Raviprasad
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
- University of Florida College of Medicine, Gainesville, FL, USA
| | - Seyedeh Mehrsa Sadat Razavi
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
- University of Florida College of Medicine, Gainesville, FL, USA
| | - Anjali Patel
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
- University of Florida College of Medicine, Gainesville, FL, USA
| | - Keith Peters
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
- Department of Radiology, University of Florida College of Medicine, Gainesville, FL, USA
| | - Bruno Hochhegger
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
- Department of Radiology, University of Florida College of Medicine, Gainesville, FL, USA
| | - Anthony Mancuso
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
- Department of Radiology, University of Florida College of Medicine, Gainesville, FL, USA
| | - Reza Forghani
- Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL, USA
- Department of Radiology, University of Florida College of Medicine, Gainesville, FL, USA
- Division of Medical Physics, University of Florida College of Medicine, Gainesville, FL, USA
- Department of Neurology, Division of Movement Disorders, University of Florida College of Medicine, Gainesville, FL, USA
| |
Collapse
|
19
|
Mračko A, Vanovčanová L, Cimrák I. Mammography Datasets for Neural Networks-Survey. J Imaging 2023; 9:jimaging9050095. [PMID: 37233314 DOI: 10.3390/jimaging9050095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 05/02/2023] [Accepted: 05/05/2023] [Indexed: 05/27/2023] Open
Abstract
Deep neural networks have gained popularity in the field of mammography. Data play an integral role in training these models, as training algorithms requires a large amount of data to capture the general relationship between the model's input and output. Open-access databases are the most accessible source of mammography data for training neural networks. Our work focuses on conducting a comprehensive survey of mammography databases that contain images with defined abnormal areas of interest. The survey includes databases such as INbreast, the Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM), the OPTIMAM Medical Image Database (OMI-DB), and The Mammographic Image Analysis Society Digital Mammogram Database (MIAS). Additionally, we surveyed recent studies that have utilized these databases in conjunction with neural networks and the results they have achieved. From these databases, it is possible to obtain at least 3801 unique images with 4125 described findings from approximately 1842 patients. The number of patients with important findings can be increased to approximately 14,474, depending on the type of agreement with the OPTIMAM team. Furthermore, we provide a description of the annotation process for mammography images to enhance the understanding of the information gained from these datasets.
Collapse
Affiliation(s)
- Adam Mračko
- Faculty of Management Science and Informatics, University of Žilina, 010 26 Žilina, Slovakia
- Research Centre, University of Žilina, 010 26 Žilina, Slovakia
| | - Lucia Vanovčanová
- 2nd Radiology Department, Faculty of Medicine, Comenius University in Bratislava, 813 72 Bratislava, Slovakia
- St. Elizabeth Cancer Institute, 812 50 Bratislava, Slovakia
| | - Ivan Cimrák
- Faculty of Management Science and Informatics, University of Žilina, 010 26 Žilina, Slovakia
- Research Centre, University of Žilina, 010 26 Žilina, Slovakia
| |
Collapse
|
20
|
Walther F, Eberlein-Gonska M, Hoffmann RT, Schmitt J, Blum SFU. Measuring appropriateness of diagnostic imaging: a scoping review. Insights Imaging 2023; 14:62. [PMID: 37052758 PMCID: PMC10102275 DOI: 10.1186/s13244-023-01409-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 03/18/2023] [Indexed: 04/14/2023] Open
Abstract
In radiology, the justification of diagnostic imaging is a key performance indicator. To date, specific recommendations on the measurement of appropriateness in diagnostic imaging are missing. To map the study literature concerning the definition, measures, methods and data used for analyses of appropriateness in research of diagnostic imaging. We conducted a scoping review in Medline, EMBASE, Scopus and the Cochrane Central Register of Controlled Trials. Two independent reviewers undertook screening and data extraction. After screening 6021 records, we included 50 studies. National guidelines (n = 22/50) or American College of Radiology Appropriateness Criteria (n = 23/50) were used to define and rate appropriateness. 22/50 studies did not provide methodological details about the appropriateness assessment. The included studies varied concerning modality, amount of reviewed examinations (88-13,941) and body regions. Computed tomography (27 studies, 27,168 examinations) was the most frequently analyzed modality, followed by magnetic resonance imaging (17 studies, 6559 examinations) and radiography (10 studies, 7095 examinations). Heterogeneous appropriateness rates throughout single studies (0-100%), modalities, and body regions (17-95%) were found. Research on pediatric and outpatient imaging was sparse. Multicentric, methodologically robust and indication-oriented studies would strengthen appropriateness research in diagnostic imaging and help to develop reliable key performance indicators.
Collapse
Affiliation(s)
- Felix Walther
- Center for Evidence-Based Healthcare, University Hospital Carl Gustav Carus and Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Fetscherstr. 74, 01307, Dresden, Germany.
- Quality and Medical Risk Management, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany.
| | - Maria Eberlein-Gonska
- Quality and Medical Risk Management, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Ralf-Thorsten Hoffmann
- Institute and Polyclinic for Diagnostic and Interventional Radiology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Jochen Schmitt
- Center for Evidence-Based Healthcare, University Hospital Carl Gustav Carus and Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Fetscherstr. 74, 01307, Dresden, Germany
| | - Sophia F U Blum
- Quality and Medical Risk Management, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
- Institute and Polyclinic for Diagnostic and Interventional Radiology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
21
|
Iannessi A, Beaumont H. Breaking down the RECIST 1.1 double read variability in lung trials: What do baseline assessments tell us? Front Oncol 2023; 13:988784. [PMID: 37007064 PMCID: PMC10060958 DOI: 10.3389/fonc.2023.988784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 02/03/2023] [Indexed: 03/18/2023] Open
Abstract
BackgroundIn clinical trials with imaging, Blinded Independent Central Review (BICR) with double reads ensures data blinding and reduces bias in drug evaluations. As double reads can cause discrepancies, evaluations require close monitoring which substantially increases clinical trial costs. We sought to document the variability of double reads at baseline, and variabilities across individual readers and lung trials.Material and methodsWe retrospectively analyzed data from five BICR clinical trials evaluating 1720 lung cancer patients treated with immunotherapy or targeted therapy. Fifteen radiologists were involved. The variability was analyzed using a set of 71 features derived from tumor selection, measurements, and disease location. We selected a subset of readers that evaluated ≥50 patients in ≥two trials, to compare individual reader’s selections. Finally, we evaluated inter-trial homogeneity using a subset of patients for whom both readers assessed the exact same disease locations. Significance level was 0.05. Multiple pair-wise comparisons of continuous variables and proportions were performed using one-way ANOVA and Marascuilo procedure, respectively.ResultsAcross trials, on average per patient, target lesion (TL) number ranged 1.9 to 3.0, sum of tumor diameter (SOD) 57.1 to 91.9 mm. MeanSOD=83.7 mm. In four trials, MeanSOD of double reads was significantly different. Less than 10% of patients had TLs selected in completely different organs and 43.5% had at least one selected in different organs. Discrepancies in disease locations happened mainly in lymph nodes (20.1%) and bones (12.2%). Discrepancies in measurable disease happened mainly in lung (19.6%). Between individual readers, the MeanSOD and disease selection were significantly different (p<0.001). In inter-trials comparisons, on average per patient, the number of selected TLs ranged 2.1 to 2.8, MeanSOD 61.0 to 92.4 mm. Trials were significantly different in MeanSOD (p<0.0001) and average number of selected TLs (p=0.007). The proportion of patients having one of the top diseases was significantly different only between two trials for lung. Significant differences were observed for all other disease locations (p<0.05).ConclusionsWe found significant double read variabilities at baseline, evidence of reading patterns and a means to compare trials. Clinical trial reliability is influenced by the interplay of readers, patients and trial design.
Collapse
|
22
|
Utility of dual read in the setting of prostate MRI interpretation. Abdom Radiol (NY) 2023; 48:1395-1400. [PMID: 36881131 DOI: 10.1007/s00261-023-03853-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 02/11/2023] [Accepted: 02/13/2023] [Indexed: 03/08/2023]
Abstract
PURPOSE The purpose of this study is to assess the utility of dual reader interpretation of prostate MRI in the evaluation/detection of prostate cancer, using the PI-RADS v2.1 scoring system. METHODS We performed a retrospective study to assess the utility of dual reader interpretation for prostate MRI. All MRI cases compiled for analysis were accompanied with prostate biopsy pathology reports that included Gleason scores to correlate to the MRI PI-RADS v2.1 score, tissue findings and location of pathology within the prostate gland. To assess for dual reader utility, two fellowship trained abdominal imagers (each with > 5 years of experience) provided independent and concurrent PI-RADS v2.1 scores on all included MRI examinations, which were then compared to the biopsy proven Gleason scores. RESULTS After application of inclusion criteria, 131 cases were used for analysis. The mean age of the cohort was 63.6 years. Sensitivity, specificity and positive/negative predictive values were calculated for each reader and concurrent scores. Reader 1 demonstrated 71.43% sensitivity, 85.39% specificity, 69.77% PPV and 86.36% NPV. Reader 2 demonstrated 83.33% sensitivity, 78.65% specificity, 64.81% PPV and 90.91% NPV. Concurrent reads demonstrated 78.57% sensitivity, 80.9% specificity, 66% PPV and 88.89% NPV. There was no statistically significant difference between the individual readers or concurrent reads (p = 0.79). CONCLUSION Our results highlight that dual reader interpretation in prostate MRI is not needed to detect clinically relevant tumor and that radiologists with experience and training in prostate MRI interpretation establish acceptable sensitivity and specificity marks on PI-RADS v2.1 assessment.
Collapse
|
23
|
DiPiro PJ, Licaros A, Zhao AH, Glazer DI, Healey MJ, Curley PJ, Giess CS, Khorasani R. Frequency and Clinical Utility of Alerts for Intra-Institutional Radiologist Discrepant Opinions. J Am Coll Radiol 2023; 20:431-437. [PMID: 36841320 DOI: 10.1016/j.jacr.2022.12.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 12/15/2022] [Accepted: 12/17/2022] [Indexed: 02/26/2023]
Abstract
OBJECTIVE Determine the rate of documented notification, via an alert, for intra-institutional discrepant radiologist opinions and addended reports and resulting clinical management changes. METHODS This institutional review board-exempt, retrospective study was performed at a large academic medical center. We defined an intra-institutional discrepant opinion as when a consultant radiologist provides a different interpretation from that formally rendered by a colleague at our institution. We implemented a discrepant opinion policy requiring closed-loop notification of the consulting radiologist's second opinion to the original radiologist, who must acknowledge this alert within 30 days. This study included all discrepant opinion alerts created December 1, 2019, to December 31, 2021, of which two radiologists and an internal medicine physician performed consensus review. Primary outcomes were degree of discrepancy and percent of discrepant opinions leading to change in clinical management. Secondary outcome was report addendum rate compared with an existing peer learning program using Fisher's exact test. RESULTS Of 114 discrepant opinion alerts among 1,888,147 reports generated during the study period (0.006%), 58 alerts were categorized as major (50.9%), 41 as moderate (36.0%), and 15 as minor discrepancies (13.1%). Clinical management change occurred in 64 of 114 cases (56.1%). Report addendum rate for discrepant opinion alerts was 4-fold higher than for peer learning alerts at our institution (66 of 315 = 21% versus 432 of 8,273 =5.2%; P < .0001). DISCUSSION Although discrepant intra-institutional radiologist second opinions were rare, they frequently led to changes in clinical management. Capturing these discrepancies by encouraging alert use may help optimize patient care and document what was communicated to the referring or consulting care team by consulting radiologists.
Collapse
Affiliation(s)
- Pamela J DiPiro
- Radiology Quality and Safety Officer, Department of Radiology, Center for Evidence-Based Imaging, Brigham and Women's Hospital, Boston, Massachusetts; Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts; and Department of Imaging, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts.
| | - Andro Licaros
- Department of Radiology, Center for Evidence-Based Imaging, Brigham and Women's Hospital, Boston, Massachusetts; and Oncologic Imaging Fellow, Department of Imaging, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts
| | - Anna H Zhao
- Department of Radiology, Center for Evidence-Based Imaging, Brigham and Women's Hospital, Boston, Massachusetts; and Radiology Resident, Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts
| | - Daniel I Glazer
- Department of Radiology, Center for Evidence-Based Imaging, Brigham and Women's Hospital, Boston, Massachusetts; Medical Director of CT and Director, Cross-Sectional Interventional Radiology (CSIR), Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts; and Department of Imaging, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts
| | - Michael J Healey
- Department of Radiology, Center for Evidence-Based Imaging, Brigham and Women's Hospital, Boston, Massachusetts; and Associate Chief Medical Officer, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts
| | - Patrick J Curley
- Executive Director, Quality, Safety, Equity & Experience, Enterprise Radiology, Department of Radiology, Center for Evidence-Based Imaging, Brigham and Women's Hospital, Boston, Massachusetts
| | - Catherine S Giess
- Department of Radiology, Center for Evidence-Based Imaging, Brigham and Women's Hospital, Boston, Massachusetts; Deputy Chair, Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts; and Department of Imaging, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts
| | - Ramin Khorasani
- Vice Chair, Radiology Quality and Safety, Mass General Brigham; Vice Chair, Department of Radiology; Distinguished Chair, Medical Informatics; Director, Center for Evidence Based Imaging; Department of Radiology, Center for Evidence-Based Imaging, Brigham and Women's Hospital, Boston, Massachusetts; Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts; and Department of Imaging, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts
| |
Collapse
|
24
|
Gaube S, Suresh H, Raue M, Lermer E, Koch TK, Hudecek MFC, Ackery AD, Grover SC, Coughlin JF, Frey D, Kitamura FC, Ghassemi M, Colak E. Non-task expert physicians benefit from correct explainable AI advice when reviewing X-rays. Sci Rep 2023; 13:1383. [PMID: 36697450 PMCID: PMC9876883 DOI: 10.1038/s41598-023-28633-w] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 01/20/2023] [Indexed: 01/26/2023] Open
Abstract
Artificial intelligence (AI)-generated clinical advice is becoming more prevalent in healthcare. However, the impact of AI-generated advice on physicians' decision-making is underexplored. In this study, physicians received X-rays with correct diagnostic advice and were asked to make a diagnosis, rate the advice's quality, and judge their own confidence. We manipulated whether the advice came with or without a visual annotation on the X-rays, and whether it was labeled as coming from an AI or a human radiologist. Overall, receiving annotated advice from an AI resulted in the highest diagnostic accuracy. Physicians rated the quality of AI advice higher than human advice. We did not find a strong effect of either manipulation on participants' confidence. The magnitude of the effects varied between task experts and non-task experts, with the latter benefiting considerably from correct explainable AI advice. These findings raise important considerations for the deployment of diagnostic advice in healthcare.
Collapse
Affiliation(s)
- Susanne Gaube
- LMU Center for Leadership and People Management, Department of Psychology, LMU Munich, Munich, Germany. .,Department of Infection Prevention and Infectious Diseases, University Hospital Regensburg, Regensburg, Germany.
| | - Harini Suresh
- MIT Computer Science & Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Martina Raue
- MIT AgeLab, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Eva Lermer
- LMU Center for Leadership and People Management, Department of Psychology, LMU Munich, Munich, Germany.,Department of Business Psychology, University of Applied Sciences Augsburg, Augsburg, Germany
| | - Timo K Koch
- LMU Center for Leadership and People Management, Department of Psychology, LMU Munich, Munich, Germany.,Department of Psychology, LMU Munich, Munich, Germany
| | - Matthias F C Hudecek
- Department of Experimental Psychology, University of Regensburg, Regensburg, Germany
| | - Alun D Ackery
- Department of Emergency Medicine, St. Michael's Hospital, Unity Health Toronto, Toronto, Canada.,Division of Emergency Medicine, University of Toronto, Toronto, Canada
| | - Samir C Grover
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, Canada.,Division of Gastroenterology, St. Michael's Hospital, Toronto, Canada
| | - Joseph F Coughlin
- MIT AgeLab, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Dieter Frey
- LMU Center for Leadership and People Management, Department of Psychology, LMU Munich, Munich, Germany
| | - Felipe C Kitamura
- Departamento de Diagnóstico por Imagem, Universidade Federal de São Paulo, São Paulo, Brazil.,DasaInova, Dasa, São Paulo, Brazil
| | - Marzyeh Ghassemi
- Electrical Engineering and Computer Science, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA.,Vector Institute, Toronto, Canada
| | - Errol Colak
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, Canada.,Department of Medical Imaging, St. Michael's Hospital, Unity Health Toronto, Toronto, Canada.,Department of Medical Imaging, Faculty of Medicine, University of Toronto, Toronto, Canada
| |
Collapse
|
25
|
Robinson JD, Kessler R, Vrablik ME, Vrablik MC, Hippe DS, Hall MK, Mitchell SH, Linnau KF. Transfer Patient Imaging: Assessment of the Impact of Discrepancies Identified by Emergency Radiologists. J Am Coll Radiol 2022; 19:1244-1252. [PMID: 35973650 PMCID: PMC10695447 DOI: 10.1016/j.jacr.2022.05.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Revised: 05/12/2022] [Accepted: 05/18/2022] [Indexed: 11/19/2022]
Abstract
PURPOSE Advanced imaging examinations of emergently transferred patients (ETPs) are overread to various degrees by receiving institutions. The practical clinical impact of these second opinions has not been studied in the past. The purpose of this study is to determine if emergency radiology overreads change emergency medicine decision making on ETPs in the emergency department (ED). METHODS All CT and MRI examinations on patients transferred to a level I trauma center during calendar year 2018 were routinely overread by emergency radiologists and discrepancies with the outside report electronically flagged. All discrepant reports compared with the outside interpretations were reviewed by one of four emergency medicine physicians. Comparing the original and final reports, reviewers identified changes in patient management that could be attributed to the additional information contained in the final report. Changes in patient care were categorized as affecting ED management, disposition, follow-up, or consulting services. RESULTS Over a 12-month period, 5,834 patients were accepted in transfer. Among 5,631 CT or MRI examinations with outside reports available, 669 examinations (12%) had at least one discrepancy in the corresponding outside report. In 219 examinations (33%), ED management was changed by discrepancies noted on the final report; patient disposition was affected in 84 (13%), outpatient follow-up in 54 (8%), and selection of consulting services in 411 (61%), and ED stay was extended in 544 (81%). Discrepant findings affected decision making in 613 of 669 of examinations (92%). CONCLUSION Emergency radiology overreading of transferred patients' advanced imaging examinations provided actionable additional information to emergency medicine physicians in the care of 613 of 669 (92%) examinations with discrepant findings. This added value is worth the effort to design workflows to routinely overread CT and MRI examinations of ETPs.
Collapse
Affiliation(s)
| | - Ross Kessler
- Department of Emergency Medicine, University of Washington, Seattle, Washington
| | - Michael E Vrablik
- Department of Emergency Medicine, University of Washington, Seattle, Washington
| | - Marie C Vrablik
- Department of Emergency Medicine, University of Washington, Seattle, Washington
| | - Daniel S Hippe
- Clinical Biostatistics, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - M Kennedy Hall
- Department of Emergency Medicine, University of Washington, Seattle, Washington
| | - Steven H Mitchell
- Department of Emergency Medicine, University of Washington, Seattle, Washington; Medical Director, Emergency Services, Harborview Medical Center, Seattle, Washington
| | - Ken F Linnau
- Department of Radiology, University of Washington, Seattle, Washington; Assistant Chief of Service, Department of Radiology, Harborview Medical Center, Seattle, Washington
| |
Collapse
|
26
|
Freeman A. Paediatric gonad shielding in pelvic radiography: A systematic review and meta-analysis. Radiography (Lond) 2022; 28:964-972. [PMID: 35849887 DOI: 10.1016/j.radi.2022.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 06/05/2022] [Accepted: 06/17/2022] [Indexed: 10/31/2022]
Abstract
INTRODUCTION The British Institute of Radiology (BIR) and American Association of Physicists in Medicine (AAPM) have recommended that gonad shielding is no longer used during pelvic X-ray examinations. The BIR guidance states that shielding may still be considered for use on males, but should not be used on females. This paper aimed to evaluate if this decision was supported by evidence from practice, by comparing the accuracy of gonad shield placement in paediatric males and females. METHODS A systematic review of databases including EMBASE, MEDLINE and PubMed was performed in February 2021. Studies were considered eligible if they provided data on the use of gonad shielding during pelvic X-ray examinations on male and female patients under the age of 18. Nine studies met the inclusion criteria and data extraction was performed. Quality appraisal was undertaken, and a meta-analysis of shielding accuracy was performed on seven studies. RESULTS The results from the meta-analysis (2187 total radiographs) demonstrated that female patients were significantly more likely (OR 1.38, 95% CI 0.88-1.87) than males to have gonad shields placed inaccurately (p value < 0.001). CONCLUSION Gonad shield placement on paediatric female patients is significantly less accurate than on males, and so the results support the AAPM and BIR guidance to stop the practice for females. Shield application may also be frequently inaccurate for males, but the review does not provide clear evidence for or against continuing the practice for males. IMPLICATIONS FOR PRACTICE Discontinuing the use of gonad shields in paediatric pelvic radiography on female patients is supported. Any continued use on male patients, or for reasons such as psychological reassurance, should be subject to enhanced training and audit to ensure benefits outweigh any risks.
Collapse
Affiliation(s)
- A Freeman
- University of Leeds School of Medicine, Worsley Building, Woodhouse, Leeds, LS2 9JT, UK
| |
Collapse
|
27
|
Grogan A, Loveday B, Michael M, Wong H, Gibbs P, Thomson B, Lee B, Ko HS. Real-world staging computed tomography scanning technique and important reporting discrepancies in pancreatic ductal adenocarcinoma. ANZ J Surg 2022; 92:1789-1796. [PMID: 35614381 PMCID: PMC9545551 DOI: 10.1111/ans.17787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Revised: 04/14/2022] [Accepted: 04/22/2022] [Indexed: 12/24/2022]
Abstract
BACKGROUND Computed tomography (CT) is the first-line staging imaging modality for pancreatic ductal adenocarcinoma (PDAC) which determines resectability and treatment pathways. METHODS Between January 2016 and December 2019, prospectively collated data from two Australian cancer centres was extracted from the PURPLE Pancreatic Cancer registry. Real-world staging CTs and corresponding reports were blindly reviewed by a sub-specialist radiologist and compared to initial reports. RESULTS Of 131 patients assessed, 117 (89.3%) presented with symptoms, 74 (56.5%) CTs included slices ≤3 mm thickness and CT pancreas protocol was applied in 69 (52.7%) patients. Initial reports lacked synoptic reporting in 131 (100%), tumour identification in 2 (1.6%) and tumour measurement in 13 (9.9%) cases. Tumour-vascular relationship reporting was missing in 69-109 (52.7-83.2%) for regarding the key arterial and venous structures that is required to assess resectability. Initial reports had no comment on venous thrombus or venous collaterals in 80 (61.1%) and 109 (83.2%) and lacked locoregional lymphadenopathy interpretation in 13 (9.9%) cases. Complete initial staging report was present in 72 (55.0%) patients. Sub-specialist radiological review resulted in down-staging in 16 (22.2%) and up-staging in 1 (1.4%) patient. Staging discrepancies were mainly regarding metastatic disease (12, 70.6%) and tumour-vascular relationship (5, 29.4%). CONCLUSION Real-world staging imaging in PDAC patients show low proportion of dedicated CT pancreas protocol, high proportion of incomplete staging reports and no synoptic reporting. The most common discrepancy between initial and sub-specialist reporting was regarding metastases and tumour-vascular relationship assessment resulting in sub-specialist down-staging in almost every fifth case.
Collapse
Affiliation(s)
- Alexander Grogan
- Personalised Oncology DivisionThe Walter and Eliza Hall Institute of Medical ResearchMelbourneVictoriaAustralia
- Faculty of Medicine, Dentistry and Health SciencesThe University of MelbourneMelbourneVictoriaAustralia
- Department of Cancer ImagingThe Peter MacCallum Cancer CentreMelbourneVictoriaAustralia
| | - Benjamin Loveday
- Department of SurgeryMelbourne HealthMelbourneVictoriaAustralia
- Department of Surgical OncologyThe Peter MacCallum Cancer CentreMelbourneVictoriaAustralia
- Department of SurgeryUniversity of AucklandAucklandNew Zealand
| | - Michael Michael
- Department of Medical OncologyThe Peter MacCallum Cancer CentreMelbourneVictoriaAustralia
- The Sir Peter MacCallum Department of OncologyThe University of MelbourneMelbourneVictoriaAustralia
| | - Hui‐Li Wong
- Personalised Oncology DivisionThe Walter and Eliza Hall Institute of Medical ResearchMelbourneVictoriaAustralia
- Department of Medical OncologyThe Peter MacCallum Cancer CentreMelbourneVictoriaAustralia
- The Sir Peter MacCallum Department of OncologyThe University of MelbourneMelbourneVictoriaAustralia
- Department of Medical OncologyWestern HealthMelbourneVictoriaAustralia
| | - Peter Gibbs
- Personalised Oncology DivisionThe Walter and Eliza Hall Institute of Medical ResearchMelbourneVictoriaAustralia
- Faculty of Medicine, Dentistry and Health SciencesThe University of MelbourneMelbourneVictoriaAustralia
- The Sir Peter MacCallum Department of OncologyThe University of MelbourneMelbourneVictoriaAustralia
| | - Benjamin Thomson
- Department of SurgeryMelbourne HealthMelbourneVictoriaAustralia
- Department of Surgical OncologyThe Peter MacCallum Cancer CentreMelbourneVictoriaAustralia
| | - Belinda Lee
- Personalised Oncology DivisionThe Walter and Eliza Hall Institute of Medical ResearchMelbourneVictoriaAustralia
- Faculty of Medicine, Dentistry and Health SciencesThe University of MelbourneMelbourneVictoriaAustralia
- Department of Medical OncologyThe Peter MacCallum Cancer CentreMelbourneVictoriaAustralia
- Department of Medical OncologyWestern HealthMelbourneVictoriaAustralia
- Department of Medical OncologyNorthern HealthMelbourneVictoriaAustralia
| | - Hyun Soo Ko
- Personalised Oncology DivisionThe Walter and Eliza Hall Institute of Medical ResearchMelbourneVictoriaAustralia
- Department of Cancer ImagingThe Peter MacCallum Cancer CentreMelbourneVictoriaAustralia
- The Sir Peter MacCallum Department of OncologyThe University of MelbourneMelbourneVictoriaAustralia
| |
Collapse
|
28
|
Alexander R, Waite S, Bruno MA, Krupinski EA, Berlin L, Macknik S, Martinez-Conde S. Mandating Limits on Workload, Duty, and Speed in Radiology. Radiology 2022; 304:274-282. [PMID: 35699581 PMCID: PMC9340237 DOI: 10.1148/radiol.212631] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Research has not yet quantified the effects of workload or duty hours on the accuracy of radiologists. With the exception of a brief reduction in imaging studies during the 2020 peak of the COVID-19 pandemic, the workload of radiologists in the United States has seen relentless growth in recent years. One concern is that this increased demand could lead to reduced accuracy. Behavioral studies in species ranging from insects to humans have shown that decision speed is inversely correlated to decision accuracy. A potential solution is to institute workload and duty limits to optimize radiologist performance and patient safety. The concern, however, is that any prescribed mandated limits would be arbitrary and thus no more advantageous than allowing radiologists to self-regulate. Specific studies have been proposed to determine whether limits reduce error, and if so, to provide a principled basis for such limits. This could determine the precise susceptibility of individual radiologists to medical error as a function of speed during image viewing, the maximum number of studies that could be read during a work shift, and the appropriate shift duration as a function of time of day. Before principled recommendations for restrictions are made, however, it is important to understand how radiologists function both optimally and at the margins of adequate performance. This study examines the relationship between interpretation speed and error rates in radiology, the potential influence of artificial intelligence on reading speed and error rates, and the possible outcomes of imposed limits on both caseload and duty hours. This review concludes that the scientific evidence needed to make meaningful rules is lacking and notes that regulating workloads without scientific principles can be more harmful than not regulating at all.
Collapse
Affiliation(s)
- Robert Alexander
- From the Departments of Ophthalmology (R.A., S.M., S.M.C.), Radiology (S.W.), Neurology (S.M., S.M.C.), and Physiology & Pharmacology (S.M., S.M.C.), SUNY Downstate Health Sciences University, 450 Clarkson Ave, Brooklyn, NY 11203; Department of Radiology, Penn State Milton S. Hershey Medical Center, Hershey, Pa (M.A.B.); Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (E.A.K.); and Department of Radiology, Rush University Medical College and University of Illinois, Chicago, Ill (L.B.)
| | - Stephen Waite
- From the Departments of Ophthalmology (R.A., S.M., S.M.C.), Radiology (S.W.), Neurology (S.M., S.M.C.), and Physiology & Pharmacology (S.M., S.M.C.), SUNY Downstate Health Sciences University, 450 Clarkson Ave, Brooklyn, NY 11203; Department of Radiology, Penn State Milton S. Hershey Medical Center, Hershey, Pa (M.A.B.); Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (E.A.K.); and Department of Radiology, Rush University Medical College and University of Illinois, Chicago, Ill (L.B.)
| | - Michael A Bruno
- From the Departments of Ophthalmology (R.A., S.M., S.M.C.), Radiology (S.W.), Neurology (S.M., S.M.C.), and Physiology & Pharmacology (S.M., S.M.C.), SUNY Downstate Health Sciences University, 450 Clarkson Ave, Brooklyn, NY 11203; Department of Radiology, Penn State Milton S. Hershey Medical Center, Hershey, Pa (M.A.B.); Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (E.A.K.); and Department of Radiology, Rush University Medical College and University of Illinois, Chicago, Ill (L.B.)
| | - Elizabeth A Krupinski
- From the Departments of Ophthalmology (R.A., S.M., S.M.C.), Radiology (S.W.), Neurology (S.M., S.M.C.), and Physiology & Pharmacology (S.M., S.M.C.), SUNY Downstate Health Sciences University, 450 Clarkson Ave, Brooklyn, NY 11203; Department of Radiology, Penn State Milton S. Hershey Medical Center, Hershey, Pa (M.A.B.); Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (E.A.K.); and Department of Radiology, Rush University Medical College and University of Illinois, Chicago, Ill (L.B.)
| | - Leonard Berlin
- From the Departments of Ophthalmology (R.A., S.M., S.M.C.), Radiology (S.W.), Neurology (S.M., S.M.C.), and Physiology & Pharmacology (S.M., S.M.C.), SUNY Downstate Health Sciences University, 450 Clarkson Ave, Brooklyn, NY 11203; Department of Radiology, Penn State Milton S. Hershey Medical Center, Hershey, Pa (M.A.B.); Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (E.A.K.); and Department of Radiology, Rush University Medical College and University of Illinois, Chicago, Ill (L.B.)
| | - Stephen Macknik
- From the Departments of Ophthalmology (R.A., S.M., S.M.C.), Radiology (S.W.), Neurology (S.M., S.M.C.), and Physiology & Pharmacology (S.M., S.M.C.), SUNY Downstate Health Sciences University, 450 Clarkson Ave, Brooklyn, NY 11203; Department of Radiology, Penn State Milton S. Hershey Medical Center, Hershey, Pa (M.A.B.); Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (E.A.K.); and Department of Radiology, Rush University Medical College and University of Illinois, Chicago, Ill (L.B.)
| | - Susana Martinez-Conde
- From the Departments of Ophthalmology (R.A., S.M., S.M.C.), Radiology (S.W.), Neurology (S.M., S.M.C.), and Physiology & Pharmacology (S.M., S.M.C.), SUNY Downstate Health Sciences University, 450 Clarkson Ave, Brooklyn, NY 11203; Department of Radiology, Penn State Milton S. Hershey Medical Center, Hershey, Pa (M.A.B.); Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga (E.A.K.); and Department of Radiology, Rush University Medical College and University of Illinois, Chicago, Ill (L.B.)
| |
Collapse
|
29
|
Lee JH, Hwang EJ, Kim H, Park CM. A narrative review of deep learning applications in lung cancer research: from screening to prognostication. Transl Lung Cancer Res 2022; 11:1217-1229. [PMID: 35832457 PMCID: PMC9271435 DOI: 10.21037/tlcr-21-1012] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Accepted: 05/16/2022] [Indexed: 01/17/2023]
Abstract
Background and Objective Deep learning (DL) algorithms have been developed for various tasks, including lung nodule detection on chest radiographs or lung cancer computed tomography screening, potential candidate selection in lung cancer screening, malignancy prediction for indeterminate pulmonary nodules, lung cancer staging, treatment response prediction, prognostication, and prediction of genetic mutations in lung cancer. Furthermore, these DL algorithms have been applied in various clinical settings in order for them to be generalized in real-world clinical practice. Multiple DL algorithms have been corroborated to be on par with experts or current clinical prediction models for several specific tasks. However, no article has yet comprehensively reviewed DL algorithms dedicated to lung cancer research. This narrative review presents an overview of the literature dealing with DL techniques applied in lung cancer research and briefly summarizes the results according to the DL algorithms’ clinical use cases. Methods we performed a narrative review by searching the Embase and OVID-MEDLINE databases for articles published in English from October, 2016 until September, 2021 and reviewing the bibliographies of key references to identify important literature related to DL in lung cancer research. The background, development, results, and clinical implications of each DL algorithm are briefly discussed. Lastly, we end this review article by highlighting future directions in lung cancer research using DL techniques. Key Content and Findings DL algorithms have been introduced to show comparable or higher performance than human experts in various clinical settings. Specifically, they have been actively applied to detect lung nodules in chest radiographs or computed tomography (CT) examinations, optimize candidate selection for lung cancer screening (LCS), predict the malignancy of lung nodules, stage lung cancer, and predict treatment response, patients’ prognoses, and genetic mutations in lung cancers. Conclusions DL algorithms have corroborated their potential value for various tasks, ranging from lung cancer screening to prognostication of lung cancer patients. Future research is warranted for the clinical application of these algorithms in daily clinical practice and verification of their real-world clinical usefulness.
Collapse
Affiliation(s)
- Jong Hyuk Lee
- Department of Radiology, Seoul National University Hospital, Seoul, Korea.,Department of Radiology, Seoul National University College of Medicine, Seoul, Korea.,Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea
| | - Eui Jin Hwang
- Department of Radiology, Seoul National University Hospital, Seoul, Korea.,Department of Radiology, Seoul National University College of Medicine, Seoul, Korea.,Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea
| | - Hyungjin Kim
- Department of Radiology, Seoul National University Hospital, Seoul, Korea.,Department of Radiology, Seoul National University College of Medicine, Seoul, Korea.,Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea
| | - Chang Min Park
- Department of Radiology, Seoul National University Hospital, Seoul, Korea.,Department of Radiology, Seoul National University College of Medicine, Seoul, Korea.,Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea.,Institute of Medical and Biological Engineering, Medical Research Center, Seoul National University, Seoul, Korea
| |
Collapse
|
30
|
Abadia AF, Yacoub B, Stringer N, Snoddy M, Kocher M, Schoepf UJ, Aquino GJ, Kabakus I, Dargis D, Hoelzer P, Sperl JI, Sahbaee P, Vingiani V, Mercer M, Burt JR. Diagnostic Accuracy and Performance of Artificial Intelligence in Detecting Lung Nodules in Patients With Complex Lung Disease: A Noninferiority Study. J Thorac Imaging 2022; 37:154-161. [PMID: 34387227 DOI: 10.1097/rti.0000000000000613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES The aim of the study is to investigate the performance of artificial intelligence (AI) convolutional neural networks (CNN) in detecting lung nodules on chest computed tomography of patients with complex lung disease, and demonstrate its noninferiority when compared against an experienced radiologist through clinically relevant assessments. METHODS A CNN prototype was used to retrospectively evaluate 103 complex lung disease cases and 40 control cases without reported nodules. Computed tomography scans were blindly evaluated by an expert thoracic radiologist; a month after initial analyses, 20 positive cases were re-evaluated with the assistance of AI. For clinically relevant applications: (1) AI was asked to classify each patient into nodules present or absent and (2) AI results were compared against standard radiology reports. Standard statistics were performed to determine detection performance. RESULTS AI was, on average, 27 seconds faster than the expert and detected 8.4% of nodules that would have been missed. AI had a sensitivity of 67.7%, similar to an accuracy reported for experienced radiologists. AI correctly classified each patient (nodules present/absent) with a sensitivity of 96.1%. When matched against radiology reports, AI performed with a sensitivity of 89.4%. Control group assessment demonstrated an overall specificity of 82.5%. When aided by AI, the expert decreased the average assessment time per case from 2:44 minutes to 35.7 seconds, while reporting an overall increase in confidence. CONCLUSION In a group of patients with complex lung disease, the sensitivity of AI is similar to an experienced radiologist and the tool helps detect previously missed nodules. AI also helps experts analyze for lung nodules faster and more confidently, a feature that is beneficial to patients and favorable to hospitals due to increased patient load and need for shorter turnaround times.
Collapse
Affiliation(s)
- Andres F Abadia
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - Basel Yacoub
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - Natalie Stringer
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - Madalyn Snoddy
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - Madison Kocher
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - U Joseph Schoepf
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - Gilberto J Aquino
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - Ismail Kabakus
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - Danielle Dargis
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | | | | | | | - Vincenzo Vingiani
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
- U.O.C. Radiologia, Ospedali Riuniti "Area Peninsola Sorrentina," P.O. Sorrento, Italy
| | - Megan Mercer
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| | - Jeremy R Burt
- Department of Radiology and Radiological Science, Division of Cardiovascular Imaging, Medical University of South Carolina, Charleston, SC
| |
Collapse
|
31
|
Lim CH, Park SB, Kim HK, Choi YS, Kim J, Ahn YC, Ahn MJ, Choi JY. Clinical Value of Surveillance 18F-fluorodeoxyglucose PET/CT for Detecting Unsuspected Recurrence or Second Primary Cancer in Non-Small Cell Lung Cancer after Curative Therapy. Cancers (Basel) 2022; 14:cancers14030632. [PMID: 35158900 PMCID: PMC8833387 DOI: 10.3390/cancers14030632] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 01/25/2022] [Accepted: 01/25/2022] [Indexed: 12/19/2022] Open
Abstract
Simple Summary Non-small cell lung cancer (NSCLC) patients are at considerable risk of recurrence or second primary cancer (SPC) after curative therapy. The utility of 18F-fluorodeoxyglucose (FDG) positron emission tomography/computed tomography (PET/CT) surveillance to detect recurrent lesions in NSCLC patients without suspicion of recurrence has not been established. The aim of our retrospective study was to evaluate the diagnostic value of surveillance FDG PET/CT for detecting clinically unsuspected recurrence or SPC in patients with NSCLC after curative therapy. In a cohort of 2684 NSCLC patients after curative therapy, surveillance FDG PET/CT showed good diagnostic efficacy for detecting clinically unexpected recurrence or SPC. Furthermore, the diagnostic performance was improved in subgroups of patients with advanced stage prior to curative therapy, PET/CT scans performed within 3 years after curative-intent therapy, and curative surgery. Surveillance PET/CT can be more useful when performed soon after therapy in curative surgery recipients and those with an advanced disease stage considering its diagnostic efficacy and yield. Abstract We evaluated the diagnostic value of 18F-fluorodeoxyglucose (FDG) positron emission tomography (PET)/CT surveillance for detecting clinically unsuspected recurrence or second primary cancer (SPC) in patients with non-small cell lung cancer (NSCLC) after curative therapy. A total of 4478 surveillance FDG PET/CT scans from 2864 NSCLC patients without suspicion of recurrence after curative therapy were reviewed retrospectively. In 274 of 2864 (9.6%) patients, recurrent NSCLC or SPC was found by surveillance PET/CT during clinical follow-up. Surveillance PET/CT scans showed sensitivity of 98.9% (274/277), specificity of 98.1% (4122/4201), accuracy of 98.2% (4396/4478), positive predictive value (PPV) of 77.6% (274/353), and negative predictive value of 99.9% (4122/4125). The specificity and accuracy in the curative surgery group were significantly higher than those in the curative radiotherapy group. PPV was significantly improved in subgroups of patients with advanced stage prior to curative therapy, PET/CT scans performed within 3 years after curative-intent therapy, and curative surgery. FDG PET/CT surveillance showed good diagnostic efficacy for detecting clinically unexpected recurrence or SPC in NSCLC patients after curative therapy. It can be more useful when performed soon after therapy in curative surgery recipients and those with an advanced disease stage considering its diagnostic efficacy and yield.
Collapse
Affiliation(s)
- Chae Hong Lim
- Department of Nuclear Medicine, Soonchunhyang University Hospital Seoul, Soonchunhyang University College of Medicine, Seoul 04401, Korea; (C.H.L.); (S.B.P.)
| | - Soo Bin Park
- Department of Nuclear Medicine, Soonchunhyang University Hospital Seoul, Soonchunhyang University College of Medicine, Seoul 04401, Korea; (C.H.L.); (S.B.P.)
| | - Hong Kwan Kim
- Department of Thoracic and Cardiovascular Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul 06351, Korea; (H.K.K.); (Y.S.C.); (J.K.)
| | - Yong Soo Choi
- Department of Thoracic and Cardiovascular Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul 06351, Korea; (H.K.K.); (Y.S.C.); (J.K.)
| | - Jhingook Kim
- Department of Thoracic and Cardiovascular Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul 06351, Korea; (H.K.K.); (Y.S.C.); (J.K.)
| | - Yong Chan Ahn
- Department of Radiation Oncology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul 06351, Korea;
| | - Myung-ju Ahn
- Division of Hematology-Oncology, Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul 06351, Korea;
| | - Joon Young Choi
- Department of Nuclear Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul 06351, Korea
- Correspondence: ; Tel.: +82-2-3410-2648; Fax: +82-2-3410-2639
| |
Collapse
|
32
|
Agreement on grading of normal clivus using magnetic resonance imaging among radiologists. Eur J Radiol Open 2022; 9:100395. [PMID: 35059474 PMCID: PMC8760553 DOI: 10.1016/j.ejro.2022.100395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Accepted: 01/06/2022] [Indexed: 11/21/2022] Open
Abstract
Purpose The present study was aimed to evaluate the agreement on grading normal clivus on MRI among radiologists. Methods A retrospective study included patients who underwent MRI brain during January 1, 2015 to October 31, 2019. Two hundred forty-four patients who had no marrow pathology on MRI were included and divided into 8 age groups by decades. Three radiologists independently reviewed the signal intensity of clivus in mid sagittal T1-weighted image. The signal intensity was classified into three grades (Grade I-III). Fleiss’ kappa coefficients (k) were calculated to assess interrater agreement. Results Of 244 patients, there were 123 (50.4%) males and 121 (49.6%) females. Age ranged from 1 to 79 years old. Clivus Grade II was more frequently reported (> 50%) by radiologists. The agreement (kappa) among all three radiologists on evaluation of clivus irrespective of the grading equals to 0.67 (95%CI: 0.60–0.74). In stratified analyses by the grade of clivus, the kappa values for Grade I to III and were 0.73, 0.62, and 0.69 respectively. Conclusion Interrater agreement of MRI evaluation of normal clivus among radiologists was good. The visual grading criteria to classify the clivus is sufficient to distinguish the marrow maturation. However, the consensus reading should be made whenever normal clivus Grade II is read.
Collapse
|
33
|
Elsholtz FHJ, Ro SR, Shnayien S, Dinkelborg P, Hamm B, Schaafs LA. Impact of double reading on NI-RADS diagnostic accuracy in reporting oral squamous cell carcinoma surveillance imaging - a single-center study. Dentomaxillofac Radiol 2022; 51:20210168. [PMID: 34233509 PMCID: PMC8693328 DOI: 10.1259/dmfr.20210168] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 05/17/2021] [Accepted: 05/31/2021] [Indexed: 01/03/2023] Open
Abstract
OBJECTIVES The Neck Imaging Reporting and Data System (NI-RADS) is an increasingly utilized risk stratification tool for imaging surveillance after treatment for head and neck cancer. This study aims to measure the impact of supervision by subspecialized radiologists on diagnostic accuracy of NI-RADS when initial reading is performed by residents. METHODS 150 CT and MRI datasets were initially read by two trained residents, and then supervised by two subspecialized radiologists. Recurrence rates by NI-RADS category were calculated, and receiver operating characteristic (ROC) curves were plotted. After dichotomization of the NI-RADS system (category 1 vs categories 2 + 3+4 and categories 1 + 2 vs 3 + 4), sensitivity, specificity, positive and negative predictive value were calculated. RESULTS 26% of the reports were modified by the supervising radiologists. Area under the curve of ROC plots values of the supervision session were higher than those of the initial reading session for both the primary site (0.89 vs 0.86) and the neck (0.94 vs 0.91), but the difference was not statistically significant. For dichotomized NI-RADS category assignments, differences between the initial reading and the supervision session were statistically significant regarding specificity and PPV for the primary site (1 + 2 vs 3 + 4 and 1 vs 2 + 3+4) or even for both sites combined (1 vs 2 + 3+4). CONCLUSION NI-RADS enables trained resident radiologists to report surveillance imaging in patients with treated oral squamous cell carcinoma with high discriminatory power. Additional supervision by a subspecialized head and neck radiologist particularly improves specificity of radiological reports.
Collapse
Affiliation(s)
- Fabian Henry Jürgen Elsholtz
- Department of Radiology, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Sa-Ra Ro
- Department of Radiology, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Seyd Shnayien
- Department of Radiology, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Patrick Dinkelborg
- Department of Oral and Maxillofacial Surgery, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Bernd Hamm
- Department of Radiology, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Lars-Arne Schaafs
- Department of Radiology, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| |
Collapse
|
34
|
Kim Y, Park JY, Hwang EJ, Lee SM, Park CM. Applications of artificial intelligence in the thorax: a narrative review focusing on thoracic radiology. J Thorac Dis 2021; 13:6943-6962. [PMID: 35070379 PMCID: PMC8743417 DOI: 10.21037/jtd-21-1342] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 12/14/2021] [Indexed: 12/12/2022]
Abstract
OBJECTIVE This review will focus on how AI-and, specifically, deep learning-can be applied to complement aspects of the current healthcare system. We describe how AI-based tools can augment existing clinical workflows by discussing the applications of AI to worklist prioritization and patient triage, the performance-boosting effects of AI as a second reader, and the use of AI to facilitate complex quantifications. We also introduce prominent examples of recent AI applications, such as tuberculosis screening in resource-constrained environments, the detection of lung cancer with screening CT, and the diagnosis of COVID-19. We also provide examples of prognostic predictions and new discoveries beyond existing clinical practices. BACKGROUND Artificial intelligence (AI) has shown promising performance for thoracic diseases, particularly in the field of thoracic radiology. However, it has not yet been established how AI-based image analysis systems can help physicians in clinical practice. METHODS This review included peer-reviewed research articles on AI in the thorax published in English between 2015 and 2021. CONCLUSIONS With advances in technology and appropriate preparation of physicians, AI could address various clinical problems that have not been solved due to a lack of clinical resources or technological limitations. KEYWORDS Artificial intelligence (AI); deep learning (DL); computer aided diagnosis (CAD); thoracic radiology; pulmonary medicine.
Collapse
Affiliation(s)
- Yisak Kim
- Interdisciplinary Program in Bioengineering, Graduate School, Seoul National University, Seoul, Korea
- Integrated Major in Innovative Medical Science, Seoul National University Graduate School, Seoul, Korea
| | - Ji Yoon Park
- Department of Radiology, Seoul National University College of Medicine, Seoul, Korea
| | - Eui Jin Hwang
- Department of Radiology, Seoul National University College of Medicine, Seoul, Korea
| | - Sang Min Lee
- Departments of Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | - Chang Min Park
- Interdisciplinary Program in Bioengineering, Graduate School, Seoul National University, Seoul, Korea
- Integrated Major in Innovative Medical Science, Seoul National University Graduate School, Seoul, Korea
- Department of Radiology, Seoul National University College of Medicine, Seoul, Korea
- Institute of Medical and Biological Engineering, Medical Research Center, Seoul National University, Seoul, Korea
| |
Collapse
|
35
|
Winder M, Owczarek AJ, Chudek J, Pilch-Kowalczyk J, Baron J. Are We Overdoing It? Changes in Diagnostic Imaging Workload during the Years 2010-2020 including the Impact of the SARS-CoV-2 Pandemic. Healthcare (Basel) 2021; 9:healthcare9111557. [PMID: 34828603 PMCID: PMC8621920 DOI: 10.3390/healthcare9111557] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 11/09/2021] [Accepted: 11/10/2021] [Indexed: 12/30/2022] Open
Abstract
Since the 1990s, there has been a significant increase in the number of imaging examinations as well as a related increase in the healthcare expenditure and the exposure of the population to X-rays. This study aimed to analyze the workload trends in radiology during the last decade, including the impact of COVID-19 in a single university hospital in Poland and to identify possible solutions to the challenges that radiology could face in the future. We compared the annual amount of computed tomography (CT), radiography (X-ray), and ultrasound (US) examinations performed between the years 2010 and 2020 and analyzed the changes in the number of practicing radiologists in Poland. The mean number of patients treated in our hospital was 60,727 per year. During the last decade, the number of CT and US examinations nearly doubled (from 87.4 to 155.7 and from 52.1 to 86.5 per 1000 patients in 2010 and 2020 respectively), while X-ray examinations decreased from 115.1 to 96.9 per 1000 patients. The SARS-CoV-2 pandemic did not change the workload trends as more chest examinations were performed. AI, which contributed to the COVID-19 diagnosis, could aid radiologists in the future with the growing workload by increasing the efficiency of radiology departments as well as by potentially minimizing the related costs.
Collapse
Affiliation(s)
- Mateusz Winder
- Department of Radiology and Nuclear Medicine, Medical University of Silesia, 40-055 Katowice, Poland; (J.P.-K.); (J.B.)
- Correspondence: ; Tel.: +48-32-789-47-51
| | - Aleksander Jerzy Owczarek
- Health Promotion and Obesity Management Unit, Department of Pathophysiology, Medical University of Silesia, 40-055 Katowice, Poland;
| | - Jerzy Chudek
- Department of Internal Medicine and Oncological Chemotherapy, Medical University of Silesia, 40-055 Katowice, Poland;
| | - Joanna Pilch-Kowalczyk
- Department of Radiology and Nuclear Medicine, Medical University of Silesia, 40-055 Katowice, Poland; (J.P.-K.); (J.B.)
| | - Jan Baron
- Department of Radiology and Nuclear Medicine, Medical University of Silesia, 40-055 Katowice, Poland; (J.P.-K.); (J.B.)
| |
Collapse
|
36
|
Wang X, Shen T, Yang S, Lan J, Xu Y, Wang M, Zhang J, Han X. A deep learning algorithm for automatic detection and classification of acute intracranial hemorrhages in head CT scans. Neuroimage Clin 2021; 32:102785. [PMID: 34411910 PMCID: PMC8377493 DOI: 10.1016/j.nicl.2021.102785] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 07/01/2021] [Accepted: 08/06/2021] [Indexed: 02/06/2023]
Abstract
Acute Intracranial hemorrhage (ICH) is a life-threatening disease that requires emergency medical attention, which is routinely diagnosed using non-contrast head CT imaging. The diagnostic accuracy of acute ICH on CT varies greatly among radiologists due to the difficulty of interpreting subtle findings and the time pressure associated with the ever-increasing workload. The use of artificial intelligence technology may help automate the process and assist radiologists for more prompt and better decision-making. In this work, we design a deep learning approach that mimics the interpretation process of radiologists, and combines a 2D CNN model and two sequence models to achieve accurate acute ICH detection and subtype classification. Being developed using the extensive 2019-RSNA Brain CT Hemorrhage Challenge dataset with over 25000 CT scans, our deep learning algorithm can accurately classify the acute ICH and its five subtypes with AUCs of 0.988 (ICH), 0.984 (EDH), 0.992 (IPH), 0.996 (IVH), 0.985 (SAH), and 0.983 (SDH), respectively, reaching the accuracy level of expert radiologists. Our method won 1st place among 1345 teams from 75 countries in the RSNA challenge. We have further evaluated our algorithm on two independent external validation datasets with 75 and 491 CT scans, respectively, and our method maintained high AUCs of 0.964 and 0.949 for acute ICH detection. These results have demonstrated the high performance and robust generalization ability of our proposed method, which makes it a useful second-read or triage tool that can facilitate routine clinical applications.
Collapse
Affiliation(s)
- Xiyue Wang
- College of Computer Science, Sichuan University, Chengdu 610065, China
| | - Tao Shen
- Tencent AI Lab, Shenzhen 518057, China
| | - Sen Yang
- Tencent AI Lab, Shenzhen 518057, China
| | - Jun Lan
- Winning Health Technology Group Co., Ltd, Shanghai, China
| | - Yanming Xu
- Department of Neurology, West China Hospital, Sichuan University, Chengdu, China
| | - Minghui Wang
- College of Computer Science, Sichuan University, Chengdu 610065, China
| | - Jing Zhang
- College of Biomedical Engineering, Sichuan University, Chengdu, China.
| | - Xiao Han
- Tencent AI Lab, Shenzhen 518057, China.
| |
Collapse
|
37
|
Doyle SP, Duszak R, Heilbrun ME, Saindane AM, Sadigh G. Secondary Interpretations of Diagnostic Imaging Examinations: Patient Liabilities and Out-of-Pocket Costs. J Am Coll Radiol 2021; 18:1547-1555. [PMID: 34293329 DOI: 10.1016/j.jacr.2021.07.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Revised: 07/01/2021] [Accepted: 07/02/2021] [Indexed: 11/16/2022]
Abstract
BACKGROUND Secondary interpretations of diagnostic imaging examinations are increasingly performed to improve care for complex patients. We sought to determine associated patient-billed liabilities and out-of-pocket payments and to identify patient and imaging study characteristics that correlate with higher patient bills and out-of-pocket payments. METHODS Data extracted for 7,740 secondary imaging interpretations performed across our large metropolitan health system over 25 months included total professional charges, insurance payments, patient-billed liabilities, and patient out-of-pocket payments. Multivariable linear regression analyses were performed to identify patient and imaging factors associated with higher patient bills and out-of-pocket payments. RESULTS Mean secondary interpretation professional charges, insurance payments, patient-billed liabilities, and patient out-of-pocket payments were $306.50, $108.02, $27.80, and $14.55, respectively. Patients received bills for 47.5% of services and made out-of-pocket payments for 17.1%. Patient-billed liabilities and out-of-pocket payments were higher for patients who were younger and uninsured and for secondary interpretations requested for patients seen in outpatient (versus inpatient) settings. Patient-billed liabilities and out-of-pocket payments were lower for patients who were Black (versus White) and had government-sponsored (versus commercial) insurance and for secondary interpretations performed during the second, third, or fourth (versus first) quarter of each calendar year. CONCLUSION Observed differences between patient-billed liabilities and out-of-pocket payments suggest that secondary interpretations of diagnostic imaging examinations can result in small but real patient financial burdens. Improved price transparency and enhanced patient communication about the value of secondary interpretations could reduce potential surprises when patients receive these bills.
Collapse
Affiliation(s)
- Sean P Doyle
- Department of Radiology and Imaging Sciences, Emory University School of Medicine, Atlanta, Georgia
| | - Richard Duszak
- Vice Chair for Health Policy and Practice, Department of Radiology and Imaging Sciences, Emory University School of Medicine, Atlanta, Georgia
| | - Marta E Heilbrun
- Vice Chair for Quality, Department of Radiology and Imaging Sciences, Emory University School of Medicine, Atlanta, Georgia
| | - Amit M Saindane
- Vice Chair for Clinical Affairs, Department of Radiology and Imaging Sciences, Emory University School of Medicine, Atlanta, Georgia
| | - Gelareh Sadigh
- Department of Radiology and Imaging Sciences, Emory University School of Medicine, Atlanta, Georgia.
| |
Collapse
|
38
|
Morozov SP, Gombolevskiy VA, Elizarov AB, Gusev MA, Novik VP, Prokudaylo SB, Bardin AS, Popov EV, Ledikhova NV, Chernina VY, Blokhin IA, Nikolaev AE, Reshetnikov RV, Vladzymyrskyy AV, Kulberg NS. A simplified cluster model and a tool adapted for collaborative labeling of lung cancer CT scans. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2021; 206:106111. [PMID: 33957377 DOI: 10.1016/j.cmpb.2021.106111] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Accepted: 04/07/2021] [Indexed: 06/12/2023]
Abstract
BACKGROUND AND OBJECTIVE Lung cancer is the most common type of cancer with a high mortality rate. Early detection using medical imaging is critically important for the long-term survival of the patients. Computer-aided diagnosis (CAD) tools can potentially reduce the number of incorrect interpretations of medical image data by radiologists. Datasets with adequate sample size, annotation, and truth are the dominant factors in developing and training effective CAD algorithms. The objective of this study was to produce a practical approach and a tool for the creation of medical image datasets. METHODS The proposed model uses the modified maximum transverse diameter approach to mark a putative lung nodule. The modification involves the possibility to use a set of overlapping spheres of appropriate size to approximate the shape of the nodule. The algorithm embedded in the model also groups the marks made by different readers for the same lesion. We used the data of 536 randomly selected patients of Moscow outpatient clinics to create a dataset of standard-dose chest computed tomography (CT) scans utilizing the double-reading approach with arbitration. Six volunteer radiologists independently produced a report for each scan using the proposed model with the main focus on the detection of lesions with sizes ranging from 3 to 30 mm. After this, an arbitrator reviewed their marks and annotations. RESULTS The maximum transverse diameter approach outperformed the alternative methods (3D box, ellipsoid, and complete outline construction) in a study of 10,000 computer-generated tumor models of different shapes in terms of accuracy and speed of nodule shape approximation. The markup and annotation of the CTLungCa-500 dataset revealed 72 studies containing no lung nodules. The remaining 464 CT scans contained 3151 lesions marked by at least one radiologist: 56%, 14%, and 29% of the lesions were malignant, benign, and non-nodular, respectively. 2887 lesions have the target size of 3-30 mm. Only 70 nodules were uniformly identified by all the six readers. An increase in the number of independent readers providing CT scans interpretations led to an accuracy increase associated with a decrease in agreement. The dataset markup process took three working weeks. CONCLUSIONS The developed cluster model simplifies the collaborative and crowdsourced creation of image repositories and makes it time-efficient. Our proof-of-concept dataset provides a valuable source of annotated medical imaging data for training CAD algorithms aimed at early detection of lung nodules. The tool and the dataset are publicly available at https://github.com/Center-of-Diagnostics-and-Telemedicine/FAnTom.git and https://mosmed.ai/en/datasets/ct_lungcancer_500/, respectively.
Collapse
Affiliation(s)
- S P Morozov
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - V A Gombolevskiy
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - A B Elizarov
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - M A Gusev
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia; Federal State Budgetary Educational Institution of Higher Education "Moscow Polytechnic University", Tverskaya str., 11, Moscow, 125993, Russia
| | - V P Novik
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - S B Prokudaylo
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - A S Bardin
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - E V Popov
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - N V Ledikhova
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - V Y Chernina
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - I A Blokhin
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - A E Nikolaev
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - R V Reshetnikov
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia; Institute of Molecular Medicine, Sechenov First Moscow State Medical University, Trubetskaya str. 8-2, Moscow, 119991, Russia
| | - A V Vladzymyrskyy
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia
| | - N S Kulberg
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Petrovka str., 24, Moscow, 127051, Russia; Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Vavilova str., 44/2, Moscow, 119333, Russia.
| |
Collapse
|
39
|
Ludwig DR, Petraglia FW, Shetty AS, Yano M. Limited added value of Doppler ultrasound of the liver after recent contrast-enhanced computed tomography. Abdom Radiol (NY) 2021; 46:2567-2574. [PMID: 33479832 DOI: 10.1007/s00261-021-02950-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 12/26/2020] [Accepted: 01/02/2021] [Indexed: 11/30/2022]
Abstract
PURPOSE The aim of this study is to assess the added diagnostic value of Doppler ultrasound of the liver (DUL) performed within 3 days of contrast-enhanced CT (CECT) for the diagnosis of portal vein (PV) or hepatic vein (HV) thrombosis. METHODS Adult patients were included if they underwent DUL within three days after a CECT of the abdomen in the emergency or inpatient setting. Retrospective review of clinical data and imaging reports was performed. In patients with discrepant or positive findings on CECT and/or DUL with respect to PV or HV thrombosis, image review was performed by three fellowship-trained abdominal radiologists in consensus. RESULTS The final cohort consisted of 468 patients. Of these, 26 (5.6%) patients had equivocal findings for thrombosis on CECT, and DUL could make a confident diagnosis of positive or negative in 18 (69%) patients. Additionally, there were 2 (0.4%) patients with PV or HV thrombosis on DUL following a limited CECT, and 2 (0.4%) patients who developed interval PV thrombosis between CECT and DUL. CONCLUSION DUL after CECT added diagnostic value for PV and/or HV thrombosis in less than 5% of patients. The patency of PV and HV is often not explicitly mentioned in CECT reports at our institution, which may lead to uncertainty for the referring provider as to whether the PV and HV were adequately evaluated. Few CECT have false positive or missed or underreported findings, and a careful review of the original CECT should be performed if DUL is requested.
Collapse
Affiliation(s)
- Daniel R Ludwig
- Mallinckrodt Institute of Radiology, Washington University School of Medicine, 510 S. Kingshighway Blvd, Campus Box 8131, Saint Louis, MO, 63110, USA.
| | - Frank W Petraglia
- Mallinckrodt Institute of Radiology, Washington University School of Medicine, 510 S. Kingshighway Blvd, Campus Box 8131, Saint Louis, MO, 63110, USA
| | - Anup S Shetty
- Mallinckrodt Institute of Radiology, Washington University School of Medicine, 510 S. Kingshighway Blvd, Campus Box 8131, Saint Louis, MO, 63110, USA
| | - Motoyo Yano
- Department of Radiology, Mayo Clinic, Scottsdale, AZ, USA
| |
Collapse
|
40
|
Faraj A, Andrews M, Li W. Inter and intra-observer errors for postoperative total hip radiographic assessment using computer aided design. Acta Orthop Belg 2021. [DOI: 10.52628/87.1.09] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Plain radiographic assessment of primary total hip arthroplasty following surgery remains to be the commonest radiological assessment. The current paper, studies the accuracy and concordance between observers reviewing these radiographs.
A prospective radiographic and medical note review of ten patients who underwent total hip replacement for primary osteoarthritis, with a mean age of 69 years. Early and 6 weeks postoperative x-rays were assessed for hip profile and version profile using computer aided design (CAD) by two observers on two different occasions. The observers were Orthopaedic surgeons who perform arthroplasty of the hip. The results were analyzed statistically.
Dimensions, including Femoral offset, medial offset and ilioischial offset showed a high degree of inter- film and intra-film correlation, with inter-class correlation (ICC) over 0.8. Except of the intra-film correlation of ilioischial offset measured on the post- operative films (p=0.067) by the first rater, all the intra and inter film correlation were significantly over the benchmark of 0.6. In terms of stem alignment, cup inclination and cup version, the intra-film correlation by rater n°2 ranges from 0.574 to 0.975 and were significantly over the benchmark of 0.6, except in the case of cup inclination measured on the 6 th? week follow-up ; meanwhile the intra-film correlation by rater n°1 ranges from 0.581 to 0.819 and none were significantly over the benchmark of 0.6.
The inter-rater reliability and inter-film correlation showed a dichotomy of results among different dimensions of the measurement. Dimensions of femo- ral offset, medial offset and ilioischial offset showed a substantial degree of reliability in terms of inter-rater reliability, inter-film correlation, and intra-rater/film reliability.
Collapse
|
41
|
Abstract
The objective of this paper is to review common challenges when evaluating fractures in the setting of possible child abuse and approaches to navigate them. This paper reviews the variety of imaging modalities available for evaluating child abuse and the advantages/disadvantages of each. Additionally, the authors discuss management of equivocal fractures, including the impact of double-reading skeletal surveys. The complexity of dating the acuity of fractures in young children is discussed. Utilizing the knowledge of fracture type, fracture patterns and patient history, as well as in the setting of cardiopulmonary resuscitation, the authors provide methods for determining the likelihood of abuse.
Collapse
|
42
|
Cardobi N, Dal Palù A, Pedrini F, Beleù A, Nocini R, De Robertis R, Ruzzenente A, Salvia R, Montemezzi S, D’Onofrio M. An Overview of Artificial Intelligence Applications in Liver and Pancreatic Imaging. Cancers (Basel) 2021; 13:2162. [PMID: 33946223 PMCID: PMC8124771 DOI: 10.3390/cancers13092162] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 04/27/2021] [Accepted: 04/28/2021] [Indexed: 12/11/2022] Open
Abstract
Artificial intelligence (AI) is one of the most promising fields of research in medical imaging so far. By means of specific algorithms, it can be used to help radiologists in their routine workflow. There are several papers that describe AI approaches to solve different problems in liver and pancreatic imaging. These problems may be summarized in four different categories: segmentation, quantification, characterization and image quality improvement. Segmentation is usually the first step of successive elaborations. If done manually, it is a time-consuming process. Therefore, the semi-automatic and automatic creation of a liver or a pancreatic mask may save time for other evaluations, such as quantification of various parameters, from organs volume to their textural features. The alterations of normal liver and pancreas structure may give a clue to the presence of a diffuse or focal pathology. AI can be trained to recognize these alterations and propose a diagnosis, which may then be confirmed or not by radiologists. Finally, AI may be applied in medical image reconstruction in order to increase image quality, decrease dose administration (referring to computed tomography) and reduce scan times. In this article, we report the state of the art of AI applications in these four main categories.
Collapse
Affiliation(s)
- Nicolò Cardobi
- Radiology Unit, Department of Pathology and Diagnostics, University Hospital of Verona, Piazzale Aristide Stefani, 1, 37126 Verona, Italy; (R.D.R.); (S.M.)
| | - Alessandro Dal Palù
- Department of Mathematical, Physical and Computer Sciences, University of Parma, 43121 Parma, Italy;
| | - Federica Pedrini
- Department of Radiology, G.B. Rossi University Hospital, University of Verona, 37129 Verona, Italy; (F.P.); (A.B.); (M.D.)
| | - Alessandro Beleù
- Department of Radiology, G.B. Rossi University Hospital, University of Verona, 37129 Verona, Italy; (F.P.); (A.B.); (M.D.)
| | - Riccardo Nocini
- Otolaryngology-Head and Neck Surgery Department, University Hospital of Verona, Piazzale Aristide Stefani, 1, 37126 Verona, Italy;
| | - Riccardo De Robertis
- Radiology Unit, Department of Pathology and Diagnostics, University Hospital of Verona, Piazzale Aristide Stefani, 1, 37126 Verona, Italy; (R.D.R.); (S.M.)
| | - Andrea Ruzzenente
- Department of Surgery, General and Hepatobiliary Surgery, University Hospital G.B. Rossi, University and Hospital Trust of Verona, 37126 Verona, Italy;
| | - Roberto Salvia
- Unit of General and Pancreatic Surgery, Department of Surgery and Oncology, University of Verona Hospital Trust, 37126 Verona, Italy;
| | - Stefania Montemezzi
- Radiology Unit, Department of Pathology and Diagnostics, University Hospital of Verona, Piazzale Aristide Stefani, 1, 37126 Verona, Italy; (R.D.R.); (S.M.)
| | - Mirko D’Onofrio
- Department of Radiology, G.B. Rossi University Hospital, University of Verona, 37129 Verona, Italy; (F.P.); (A.B.); (M.D.)
| |
Collapse
|
43
|
Evans J, Sapsford M, McDonald S, Poole K, Raine T, Jadon DR. Prevalence of axial spondyloarthritis in patients with inflammatory bowel disease using cross-sectional imaging: a systematic literature review. Ther Adv Musculoskelet Dis 2021; 13:1759720X21996973. [PMID: 33786068 PMCID: PMC7958176 DOI: 10.1177/1759720x21996973] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 02/01/2021] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Patients with inflammatory bowel disease (IBD) have an excess burden of axial spondyloarthritis (axSpA), which, if left untreated, may significantly impact on clinical outcomes. We aimed to estimate the prevalence of axSpA, including previously undiagnosed cases, in IBD patients from studies involving cross-sectional imaging and identify the IBD features potentially associated with axSpA. METHODS PubMed, Embase and Cochrane databases were searched systematically between 1990 and 2018. Article reference lists and key conference abstract lists from 2012 to 2018 were also reviewed. All abstracts were reviewed by two authors to determine eligibility for inclusion. The study inclusion criteria were (a) adults aged 18 years or above, (b) a clinical diagnosis of IBD and (c) reporting identification of sacroiliitis using cross-sectional imaging. RESULTS A total of 20 observational studies were identified: 12 used CT, 6 used MR and 2 utilised both computed tomography (CT) and magnetic resonance (MR) imaging. Sample sizes ranged from 25 to 1247 (a total of 4096 patients); 31 studies were considered to have low selection bias, 13 included two or more radiology readers, and 3 included rheumatological assessments. The prevalence of sacroiliitis, the most commonly reported axSpA feature, ranged from 2.2% to 68.0% with a pooled prevalence of 21.0% [95% confidence interval (CI) 17-26%]. Associated IBD features include increasing IBD duration, increasing age, male sex, IBD location, inflammatory back pain and peripheral arthritis. No significant difference in the prevalence of sacroiliitis between Crohn's disease and ulcerative colitis was identified. Study limitations include variability in the individual study sample sizes and patient demographics. CONCLUSION This review highlights the need for larger, well-designed studies using more sensitive imaging modalities and multivariable modelling to better estimate the prevalence of axSpA in IBD. An improved knowledge of the IBD phenotype(s) associated with axSpA and use of cross-sectional imaging intended for IBD assessment to screen for axSpA may help clinicians identify those patients most at risk.
Collapse
Affiliation(s)
- Jobie Evans
- Department of Rheumatology, Cambridge University
Hospitals NHSFT, Hills Road, Cambridge, CB2 0QQ, UK
- Department of Medicine, University of Cambridge,
Cambridge, UK
| | - Mark Sapsford
- North Shore Hospital, Waitemata District Health
Board, Auckland, New Zealand
| | - Scott McDonald
- Department of Radiology, Cambridge University
Hospitals NHSFT, Cambridge, UK
| | - Kenneth Poole
- Department of Rheumatology, Cambridge University
Hospitals NHSFT, Cambridge, UK
- Department of Medicine, University of Cambridge,
Cambridge, UK
| | - Tim Raine
- Department of Gastroenterology, Cambridge
University Hospitals NHSFT, Cambridge, UK
| | - Deepak R. Jadon
- Department of Rheumatology, Cambridge University
Hospitals NHSFT, Cambridge, UK
- Department of Medicine, University of Cambridge,
Cambridge, UK
| |
Collapse
|
44
|
Lago MA, Jonnalagadda A, Abbey CK, Barufaldi BB, Bakic PR, Maidment ADA, Leung WK, Weinstein SP, Englander BS, Eckstein MP. Under-exploration of Three-Dimensional Images Leads to Search Errors for Small Salient Targets. Curr Biol 2021; 31:1099-1106.e5. [PMID: 33472051 PMCID: PMC8048135 DOI: 10.1016/j.cub.2020.12.029] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 10/09/2020] [Accepted: 12/18/2020] [Indexed: 10/22/2022]
Abstract
Advances in 3D imaging technology are transforming how radiologists search for cancer1,2 and how security officers scrutinize baggage for dangerous objects.3 These new 3D technologies often improve search over 2D images4,5 but vastly increase the image data. Here, we investigate 3D search for targets of various sizes in filtered noise and digital breast phantoms. For a Bayesian ideal observer optimally processing the filtered noise and a convolutional neural network processing the digital breast phantoms, search with 3D image stacks increases target information and improves accuracy over search with 2D images. In contrast, 3D search by humans leads to high miss rates for small targets easily detected in 2D search, but not for larger targets more visible in the visual periphery. Analyses of human eye movements, perceptual judgments, and a computational model with a foveated visual system suggest that human errors can be explained by interaction among a target's peripheral visibility, eye movement under-exploration of the 3D images, and a perceived overestimation of the explored area. Instructing observers to extend the search reduces 75% of the small target misses without increasing false positives. Results with twelve radiologists confirm that even medical professionals reading realistic breast phantoms have high miss rates for small targets in 3D search. Thus, under-exploration represents a fundamental limitation to the efficacy with which humans search in 3D image stacks and miss targets with these prevalent image technologies.
Collapse
Affiliation(s)
- Miguel A Lago
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, Santa Barbara, CA 93106, USA
| | - Aditya Jonnalagadda
- Department of Electrical and Computer Engineering, University of California, Santa Barbara, Santa Barbara, CA 93106, USA; Institute for Collaborative Biotechnologies, University of California, Santa Barbara, Santa Barbara, CA 93106, USA
| | - Craig K Abbey
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, Santa Barbara, CA 93106, USA
| | - Bruno B Barufaldi
- Department of Radiology, University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA 19104, USA
| | - Predrag R Bakic
- Department of Radiology, University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA 19104, USA
| | - Andrew D A Maidment
- Department of Radiology, University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA 19104, USA
| | - Winifred K Leung
- Ridley-Tree Cancer Center, Sansum Clinic, 540 W. Pueblo Street, Santa Barbara, CA 93105, USA
| | - Susan P Weinstein
- Department of Radiology, University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA 19104, USA
| | - Brian S Englander
- Department of Radiology, University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA 19104, USA
| | - Miguel P Eckstein
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, Santa Barbara, CA 93106, USA; Department of Electrical and Computer Engineering, University of California, Santa Barbara, Santa Barbara, CA 93106, USA; Institute for Collaborative Biotechnologies, University of California, Santa Barbara, Santa Barbara, CA 93106, USA.
| |
Collapse
|
45
|
Buls N, Watté N, Nieboer K, Ilsen B, de Mey J. Performance of an artificial intelligence tool with real-time clinical workflow integration – Detection of intracranial hemorrhage and pulmonary embolism. Phys Med 2021; 83:154-160. [DOI: 10.1016/j.ejmp.2021.03.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 02/23/2021] [Accepted: 03/08/2021] [Indexed: 01/11/2023] Open
|
46
|
Cabitza F, Campagner A, Sconfienza LM. Studying human-AI collaboration protocols: the case of the Kasparov's law in radiological double reading. Health Inf Sci Syst 2021; 9:8. [PMID: 33585029 PMCID: PMC7864624 DOI: 10.1007/s13755-021-00138-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 01/13/2021] [Indexed: 12/17/2022] Open
Abstract
Purpose The integration of Artificial Intelligence into medical practices has recently been advocated for the promise to bring increased efficiency and effectiveness to these practices. Nonetheless, little research has so far been aimed at understanding the best human-AI interaction protocols in collaborative tasks, even in currently more viable settings, like independent double-reading screening tasks. Methods To this aim, we report about a retrospective case–control study, involving 12 board-certified radiologists, in the detection of knee lesions by means of Magnetic Resonance Imaging, in which we simulated the serial combination of two Deep Learning models with humans in eight double-reading protocols. Inspired by the so-called Kasparov’s Laws, we investigate whether the combination of humans and AI models could achieve better performance than AI models alone, and whether weak reader, when supported by fit-for-use interaction protocols, could out-perform stronger readers. Results We discuss two main findings: groups of humans who perform significantly worse than a state-of-the-art AI can significantly outperform it if their judgements are aggregated by majority voting (in concordance with the first part of the Kasparov’s law); small ensembles of significantly weaker readers can significantly outperform teams of stronger readers, supported by the same computational tool, when the judgments of the former ones are combined within “fit-for-use” protocols (in concordance with the second part of the Kasparov’s law). Conclusion Our study shows that good interaction protocols can guarantee improved decision performance that easily surpasses the performance of individual agents, even of realistic super-human AI systems. This finding highlights the importance of focusing on how to guarantee better co-operation within human-AI teams, so to enable safer and more human sustainable care practices.
Collapse
Affiliation(s)
- Federico Cabitza
- Università degli Studi di Milano-Bicocca, Viale Sarca 336, 20126 Milan, Italy
| | - Andrea Campagner
- Università degli Studi di Milano-Bicocca, Viale Sarca 336, 20126 Milan, Italy
| | - Luca Maria Sconfienza
- Department of Biomedical Sciences for Health, University of Milan, Milan, Italy.,IRCCS Istituto Ortopedico Galeazzi, Milan, Italy
| |
Collapse
|
47
|
Rao B, Zohrabian V, Cedeno P, Saha A, Pahade J, Davis MA. Utility of Artificial Intelligence Tool as a Prospective Radiology Peer Reviewer - Detection of Unreported Intracranial Hemorrhage. Acad Radiol 2021; 28:85-93. [PMID: 32102747 DOI: 10.1016/j.acra.2020.01.035] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 01/27/2020] [Accepted: 01/29/2020] [Indexed: 01/14/2023]
Abstract
RATIONALE AND OBJECTIVES Misdiagnosis of intracranial hemorrhage (ICH) can adversely impact patient outcomes. The increasing workload on the radiologists may increase the chance of error and compromise the quality of care provided by the radiologists. MATERIALS AND METHODS We used an FDA approved artificial intelligence (AI) solution based on a convolutional neural network to assess the prevalence of ICH in scans, which were reported as negative for ICH. We retrospectively applied the AI solution to all consecutive noncontrast computed tomography (CT) head scans performed at eight imaging sites affiliated to our institution. RESULTS In the 6565 noncontrast CT head scans, which met the inclusion criteria, 5585 scans were reported to have no ICH ("negative-by-report" cases). We applied AI solution to these "negative-by-report" cases. AI solution suggested there were ICH in 28 of these scans ("negative-by-report" and "positive-by-AI solution"). After consensus review by three neuroradiologists, 16 of these scans were found to have ICH, which was not reported (missed diagnosis by radiologists), with a false-negative rate of radiologists for ICH detection at 1.6%. Most commonly missed ICH was overlying the cerebral convexity and in the parafalcine regions. CONCLUSION Our study demonstrates that an AI solution can help radiologists to diagnose ICH and thus decrease the error rate. AI solution can serve as a prospective peer review tool for non-contrast head CT scans to identify ICH and thus minimize false negatives.
Collapse
Affiliation(s)
- Balaji Rao
- Department of Radiology and Biomedical Imaging, Yale University School of Medicine, 330 Cedar St. Tompkins East TE-2, New Haven, CT 06520.
| | - Vahe Zohrabian
- Department of Radiology and Biomedical Imaging, Yale University School of Medicine, 330 Cedar St. Tompkins East TE-2, New Haven, CT 06520
| | - Paul Cedeno
- Department of Radiology and Biomedical Imaging, Yale University School of Medicine, 330 Cedar St. Tompkins East TE-2, New Haven, CT 06520
| | - Atin Saha
- Department of Radiology and Biomedical Imaging, Yale University School of Medicine, 330 Cedar St. Tompkins East TE-2, New Haven, CT 06520
| | - Jay Pahade
- Department of Radiology and Biomedical Imaging, Yale University School of Medicine, 330 Cedar St. Tompkins East TE-2, New Haven, CT 06520
| | - Melissa A Davis
- Department of Radiology and Biomedical Imaging, Yale University School of Medicine, 330 Cedar St. Tompkins East TE-2, New Haven, CT 06520
| |
Collapse
|
48
|
Chung R, Rosenkrantz AB, Shanbhogue KP. Expert radiologist review at a hepatobiliary multidisciplinary tumor board: impact on patient management. Abdom Radiol (NY) 2020; 45:3800-3808. [PMID: 32444889 DOI: 10.1007/s00261-020-02587-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
PURPOSE To identify the frequency, source, and management impact of discrepancies between the initial radiology report and expert reinterpretation occurring in the context of a hepatobiliary multidisciplinary tumor board (MTB). METHODS This retrospective study included 974 consecutive patients discussed at a weekly MTB at a large tertiary care academic medical center over a 2-year period. A single radiologist with dedicated hepatobiliary imaging expertise attended all conferences to review and discuss the relevant liver imaging and rated the concordance between original and re-reads based on RADPEER scoring criteria. Impact on management was based on the conference discussion and reflected changes in follow-up imaging, recommendations for biopsy/surgery, or liver transplant eligibility. RESULTS Image reinterpretation was discordant with the initial report in 19.9% (194/974) of cases (59.8%, 34.5%, 5.7% RADPEER 2/3/4 discrepancies, respectively). A change in LI-RADS category occurred in 59.8% of discrepancies. Most common causes of discordance included re-classification of a lesion as benign rather than malignant (16.0%) and missed tumor recurrence (13.9%). Impact on management occurred in 99.0% of discordant cases and included loco-regional therapy instead of follow-up imaging (19.1%), follow-up imaging instead of treatment (17.5%), and avoidance of biopsy (12.4%). 11.3% received OPTN exception scores due to the revised interpretation, and 8.8% were excluded from listing for orthotopic liver transplant. CONCLUSION Even in a sub-specialized abdominal imaging academic practice, expert radiologist review in the MTB setting identified discordant interpretations and impacted management in a substantial fraction of patients, potentially impacting transplant allocation. The findings may impact how abdominal imaging sections best staff advanced MTBs.
Collapse
Affiliation(s)
- Ryan Chung
- Department of Radiology, NYU Langone Health, 660 First Ave, New York, NY, 10016, USA
| | - Andrew B Rosenkrantz
- Department of Radiology, NYU Langone Health, 660 First Ave, New York, NY, 10016, USA
| | - Krishna P Shanbhogue
- Department of Radiology, NYU Langone Health, 660 First Ave, New York, NY, 10016, USA.
| |
Collapse
|
49
|
Harreld JH, Kaufman RA, Kang G, Maron G, Mitchell W, Thompson JW, Srinivasan A. The use of imaging to identify immunocompromised children requiring biopsy for invasive fungal rhinosinusitis. Pediatr Blood Cancer 2020; 67:e28676. [PMID: 32860662 DOI: 10.1002/pbc.28676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 08/12/2020] [Accepted: 08/13/2020] [Indexed: 01/11/2023]
Abstract
BACKGROUND AND PURPOSE Children with severe immunocompromise due to cancer therapy or hematopoietic cell transplant are at risk both for potentially lethal invasive fungal rhinosinusitis (IFRS), and for complications associated with gold-standard biopsy diagnosis. We investigated whether early imaging could reliably identify or exclude IFRS in this population, thereby reducing unnecessary biopsy. METHODS We reviewed clinical/laboratory data and cross-sectional imaging from 31 pediatric patients evaluated for suspicion of IFRS, 19 without (age 11.8 ± 5.4 years) and 12 with proven IFRS (age 11.9 ± 4.6 years). Imaging examinations were graded for mucosal thickening (Lund score), for fungal-specific signs (FSS) of bone destruction, extra-sinus inflammation, and nasal mucosal ulceration. Loss of contrast enhancement (LoCE) was assessed separately where possible. Clinical and imaging findings were compared with parametric or nonparametric tests as appropriate. Diagnostic accuracy was assessed by receiver operating characteristic (ROC) analysis. Positive (+LR) and negative likelihood ratios (-LR) and probabilities were calculated. RESULTS Ten of 12 patients with IFRS and one of 19 without IFRS had at least one FSS on early imaging (83% sensitive, 95% specific, +LR = 15.83, -LR = 0.18; P < .001). Absolute neutrophil count (ANC) ≤ 200/mm3 was 100% sensitive and 58% specific for IFRS (+LR = 2.38, -LR = 0; P = .001). Facial pain was the only discriminating symptom of IFRS (P < .001). In a symptomatic child with ANC ≤ 200/m3 , the presence of at least one FSS indicated high (79%) probability of IFRS; absence of FSS suggested low (<4%) probability. CONCLUSION In symptomatic, severely immunocompromised children, the presence or absence of fungal-specific imaging findings may effectively rule in or rule out early IFRS, potentially sparing some patients the risks associated with biopsy.
Collapse
Affiliation(s)
- Julie H Harreld
- Department of Diagnostic Imaging, St Jude Children's Research Hospital, Memphis, Tennessee
| | - Robert A Kaufman
- Department of Diagnostic Imaging, St Jude Children's Research Hospital, Memphis, Tennessee
| | - Guolian Kang
- Department of Biostatistics, St Jude Children's Research Hospital, Memphis, Tennessee
| | - Gabriela Maron
- Department of Infectious Disease, St Jude Children's Research Hospital, Memphis, Tennessee
| | - William Mitchell
- Department of Bone Marrow Transplantation and Cellular Therapy, St Jude Children's Research Hospital, Memphis, Tennessee
| | - Jerome W Thompson
- Department of Otolaryngology, University of Tennessee Health Sciences Center; Department of Surgery, St Jude Children's Research Hospital, Memphis, Tennessee
| | - Ashok Srinivasan
- Department of Bone Marrow Transplantation and Cellular Therapy, St Jude Children's Research Hospital, Memphis, Tennessee
| |
Collapse
|
50
|
Body MRI Subspecialty Reinterpretations at a Tertiary Care Center: Discrepancy Rates and Error Types. AJR Am J Roentgenol 2020; 215:1384-1388. [PMID: 33052740 DOI: 10.2214/ajr.20.22797] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
OBJECTIVE. Radiology departments in tertiary care centers are frequently asked to perform secondary interpretations of imaging studies, particularly when a patient is transferred from a community hospital. Discrepancy rates in radiology vary widely, with low rates reported for preliminary resident reports that are overread by attending radiologists (2-6%) and higher rates (up to 56%) for secondary interpretations. Abdominal and pelvic imaging and cross-sectional imaging have the highest discrepancy rates. The purpose of our study was to determine the discrepancy rate and the most common reasons for discrepancies between abdominal and pelvic MRI reports obtained from outside institutions and secondary interpretations of these reports by a fellowship-trained radiologist at a tertiary care center. MATERIALS AND METHODS. We retrospectively identified 395 secondary MRI reports from January 2015 to December 2018 that were labeled as body MRI examinations at a tertiary care center. Thirty-eight cases were excluded for various reasons, including incorrect categorization or lack of outside report. We reviewed the outside reports, compared them with the secondary interpretations, and categorized the cases as discrepancy or no discrepancy. The discrepancies were subdivided into the most likely reason for the error using previously published categories; these categories were also divided into perceptive and cognitive errors. RESULTS. Of the 357 included cases, 246 (68.9%) had at least one discrepancy. The most common reason for error was faulty reasoning (34.3%), which is a cognitive error characterized by misidentifying an abnormality. Satisfaction of search, which is a perceptive error, was the most common reason for second discrepancies (15.0%). CONCLUSION. Secondary interpretations of body MR images at a tertiary care center identify a high rate of discrepancies, with cognitive error types predominating.
Collapse
|