1
|
Jeblick K, Schachtner B, Dexl J, Mittermeier A, Stüber AT, Topalis J, Weber T, Wesp P, Sabel BO, Ricke J, Ingrisch M. ChatGPT makes medicine easy to swallow: an exploratory case study on simplified radiology reports. Eur Radiol 2023:10.1007/s00330-023-10213-1. [PMID: 37794249 DOI: 10.1007/s00330-023-10213-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 05/24/2023] [Accepted: 07/07/2023] [Indexed: 10/06/2023]
Abstract
OBJECTIVES To assess the quality of simplified radiology reports generated with the large language model (LLM) ChatGPT and to discuss challenges and chances of ChatGPT-like LLMs for medical text simplification. METHODS In this exploratory case study, a radiologist created three fictitious radiology reports which we simplified by prompting ChatGPT with "Explain this medical report to a child using simple language." In a questionnaire, we tasked 15 radiologists to rate the quality of the simplified radiology reports with respect to their factual correctness, completeness, and potential harm for patients. We used Likert scale analysis and inductive free-text categorization to assess the quality of the simplified reports. RESULTS Most radiologists agreed that the simplified reports were factually correct, complete, and not potentially harmful to the patient. Nevertheless, instances of incorrect statements, missed relevant medical information, and potentially harmful passages were reported. CONCLUSION While we see a need for further adaption to the medical field, the initial insights of this study indicate a tremendous potential in using LLMs like ChatGPT to improve patient-centered care in radiology and other medical domains. CLINICAL RELEVANCE STATEMENT Patients have started to use ChatGPT to simplify and explain their medical reports, which is expected to affect patient-doctor interaction. This phenomenon raises several opportunities and challenges for clinical routine. KEY POINTS • Patients have started to use ChatGPT to simplify their medical reports, but their quality was unknown. • In a questionnaire, most participating radiologists overall asserted good quality to radiology reports simplified with ChatGPT. However, they also highlighted a notable presence of errors, potentially leading patients to draw harmful conclusions. • Large language models such as ChatGPT have vast potential to enhance patient-centered care in radiology and other medical domains. To realize this potential while minimizing harm, they need supervision by medical experts and adaption to the medical field.
Collapse
Affiliation(s)
- Katharina Jeblick
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany.
- Comprehensive Pneumology Center (CPC-M), Member of the German Center for Lung Research (DZL), Munich, Germany.
- Munich Center for Machine Learning (MCML), Munich, Germany.
| | - Balthasar Schachtner
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), Munich, Germany
| | - Jakob Dexl
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), Munich, Germany
| | - Andreas Mittermeier
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), Munich, Germany
| | - Anna Theresa Stüber
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), Munich, Germany
- Department of Statistics, LMU Munich, Munich, Germany
| | - Johanna Topalis
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
| | - Tobias Weber
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), Munich, Germany
- Department of Statistics, LMU Munich, Munich, Germany
| | - Philipp Wesp
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), Munich, Germany
| | | | - Jens Ricke
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
| | - Michael Ingrisch
- Department of Radiology, LMU University Hospital, LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), Munich, Germany
| |
Collapse
|
2
|
Aubreville M, Stathonikos N, Bertram CA, Klopfleisch R, Ter Hoeve N, Ciompi F, Wilm F, Marzahl C, Donovan TA, Maier A, Breen J, Ravikumar N, Chung Y, Park J, Nateghi R, Pourakpour F, Fick RHJ, Ben Hadj S, Jahanifar M, Shephard A, Dexl J, Wittenberg T, Kondo S, Lafarge MW, Koelzer VH, Liang J, Wang Y, Long X, Liu J, Razavi S, Khademi A, Yang S, Wang X, Erber R, Klang A, Lipnik K, Bolfa P, Dark MJ, Wasinger G, Veta M, Breininger K. Mitosis domain generalization in histopathology images - The MIDOG challenge. Med Image Anal 2023; 84:102699. [PMID: 36463832 DOI: 10.1016/j.media.2022.102699] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 10/28/2022] [Accepted: 11/17/2022] [Indexed: 11/27/2022]
Abstract
The density of mitotic figures (MF) within tumor tissue is known to be highly correlated with tumor proliferation and thus is an important marker in tumor grading. Recognition of MF by pathologists is subject to a strong inter-rater bias, limiting its prognostic value. State-of-the-art deep learning methods can support experts but have been observed to strongly deteriorate when applied in a different clinical environment. The variability caused by using different whole slide scanners has been identified as one decisive component in the underlying domain shift. The goal of the MICCAI MIDOG 2021 challenge was the creation of scanner-agnostic MF detection algorithms. The challenge used a training set of 200 cases, split across four scanning systems. As test set, an additional 100 cases split across four scanning systems, including two previously unseen scanners, were provided. In this paper, we evaluate and compare the approaches that were submitted to the challenge and identify methodological factors contributing to better performance. The winning algorithm yielded an F1 score of 0.748 (CI95: 0.704-0.781), exceeding the performance of six experts on the same task.
Collapse
Affiliation(s)
| | | | - Christof A Bertram
- Institute of Pathology, University of Veterinary Medicine, Vienna, Austria
| | - Robert Klopfleisch
- Institute of Veterinary Pathology, Freie Universität Berlin, Berlin, Germany
| | | | - Francesco Ciompi
- Computational Pathology Group, Radboud UMC, Nijmegen, The Netherlands
| | - Frauke Wilm
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany; Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Christian Marzahl
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Taryn A Donovan
- Department of Anatomic Pathology, Schwarzman Animal Medical Center, NY, USA
| | - Andreas Maier
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Jack Breen
- CISTIB Center for Computational Imaging and Simulation Technologies in Biomedicine, School of Computing, University of Leeds, Leeds, UK
| | - Nishant Ravikumar
- CISTIB Center for Computational Imaging and Simulation Technologies in Biomedicine, School of Computing, University of Leeds, Leeds, UK
| | - Youjin Chung
- Korea Advanced Institute of Science and Technology, Daejeon, South Korea
| | - Jinah Park
- Korea Advanced Institute of Science and Technology, Daejeon, South Korea
| | - Ramin Nateghi
- Electrical and Electronics Engineering Department, Shiraz University of Technology, Shiraz, Iran
| | - Fattaneh Pourakpour
- Iranian Brain Mapping Biobank (IBMB), National Brain Mapping Laboratory (NBML), Tehran, Iran
| | | | | | - Mostafa Jahanifar
- Tissue Image Analytics Centre, Department of Computer Science, University of Warwick, Warwick, UK
| | - Adam Shephard
- Tissue Image Analytics Centre, Department of Computer Science, University of Warwick, Warwick, UK
| | - Jakob Dexl
- Fraunhofer-Institute for Integrated Circuits IIS, Erlangen, Germany
| | | | | | - Maxime W Lafarge
- Department of Pathology and Molecular Pathology, University Hospital and University of Zurich, Zurich, Switzerland
| | - Viktor H Koelzer
- Department of Pathology and Molecular Pathology, University Hospital and University of Zurich, Zurich, Switzerland
| | - Jingtang Liang
- School of Life Science and Technology, Xidian University, Shannxi, China
| | - Yubo Wang
- School of Life Science and Technology, Xidian University, Shannxi, China
| | - Xi Long
- Histo Pathology Diagnostic Center, Shanghai, China
| | - Jingxin Liu
- Xi'an Jiaotong-Liverpool University, Suzhou, China
| | - Salar Razavi
- Image Analysis in Medicine Lab (IAMLAB), Electrical, Computer and Biomedical Engineering, Ryerson University, Toronto, ON, Canada
| | - April Khademi
- Image Analysis in Medicine Lab (IAMLAB), Electrical, Computer and Biomedical Engineering, Ryerson University, Toronto, ON, Canada
| | - Sen Yang
- Tencent AI Lab, Shenzhen 518057, China
| | - Xiyue Wang
- College of Computer Science, Sichuan University, Chengdu 610065, China
| | - Ramona Erber
- Institute of Pathology, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Andrea Klang
- Institute of Pathology, University of Veterinary Medicine, Vienna, Austria
| | - Karoline Lipnik
- Institute of Pathology, University of Veterinary Medicine, Vienna, Austria
| | - Pompei Bolfa
- Ross University School of Veterinary Medicine, Basseterre, Saint Kitts and Nevis
| | - Michael J Dark
- College of Veterinary Medicine, University of Florida, Gainesville, FL, USA
| | - Gabriel Wasinger
- Department of Pathology, General Hospital of Vienna, Medical University of Vienna, Vienna, Austria
| | - Mitko Veta
- Medical Image Analysis Group, TU Eindhoven, Eindhoven, The Netherlands
| | - Katharina Breininger
- Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| |
Collapse
|
3
|
Dexl J, Benz M, Kuritcyn P, Wittenberg T, Bruns V, Geppert C, Hartmann A, Bischl B, Goschenhofer J. Robust Colon Tissue Cartography with Semi-Supervision. Current Directions in Biomedical Engineering 2022. [DOI: 10.1515/cdbme-2022-1088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Abstract
We explore the task of tissue classification for colon cancer histology in a low label regime comparing a semi-supervised and a supervised learning strategy in a series of experiments. Further, we investigate the model robustness w.r.t. distribution shifts in the unlabeled data and domain shifts across different scanners to prove their practicality in a histology context. By utilizing unlabeled data in addition to nl = 1000 labeled tiles per class, we yield a substantial increase in accuracy from 89.9% to 91.4%.
Collapse
Affiliation(s)
- Jakob Dexl
- Fraunhofer Institute for Integrated Circuits IIS, Am Wolfsmantel 33, Erlangen , Germany
| | - Michaela Benz
- Fraunhofer Institute for Integrated Circuits IIS, Am Wolfsmantel 33, Erlangen , Germany
| | - Petr Kuritcyn
- Fraunhofer Institute for Integrated Circuits IIS, Am Wolfsmantel 33, Erlangen , Germany
| | - Thomas Wittenberg
- Fraunhofer Institute for Integrated Circuits IIS, Am Wolfsmantel 33, Erlangen , Germany
| | - Volker Bruns
- Fraunhofer Institute for Integrated Circuits IIS, Am Wolfsmantel 33, Erlangen , Germany
| | - Carol Geppert
- Institut fur Pathologie, Universitatsklinikum Erlangen; Friedrich-Alexander-Universitat (FAU), Erlangen-Nurnberg , Germany
| | - Arndt Hartmann
- Institut fur Pathologie, Universitatsklinikum Erlangen; Friedrich-Alexander-Universitat (FAU), Erlangen-Nurnberg , Germany
| | - Bernd Bischl
- Department of Statistics, Ludwig-Maximilians-Universitat, Munich , Germany
| | - Jann Goschenhofer
- Department of Statistics, Ludwig-Maximilians-Universitat, Munich , Germany
| |
Collapse
|
4
|
Wilm F, Benz M, Bruns V, Baghdadlian S, Dexl J, Hartmann D, Kuritcyn P, Weidenfeller M, Wittenberg T, Merkel S, Hartmann A, Eckstein M, Geppert CI. Fast whole-slide cartography in colon cancer histology using superpixels and CNN classification. J Med Imaging (Bellingham) 2022; 9:027501. [PMID: 35300344 PMCID: PMC8920491 DOI: 10.1117/1.jmi.9.2.027501] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 02/17/2022] [Indexed: 11/14/2022] Open
Abstract
Purpose: Automatic outlining of different tissue types in digitized histological specimen provides a basis for follow-up analyses and can potentially guide subsequent medical decisions. The immense size of whole-slide-images (WSIs), however, poses a challenge in terms of computation time. In this regard, the analysis of nonoverlapping patches outperforms pixelwise segmentation approaches but still leaves room for optimization. Furthermore, the division into patches, regardless of the biological structures they contain, is a drawback due to the loss of local dependencies. Approach: We propose to subdivide the WSI into coherent regions prior to classification by grouping visually similar adjacent pixels into superpixels. Afterward, only a random subset of patches per superpixel is classified and patch labels are combined into a superpixel label. We propose a metric for identifying superpixels with an uncertain classification and evaluate two medical applications, namely tumor area and invasive margin estimation and tumor composition analysis. Results: The algorithm has been developed on 159 hand-annotated WSIs of colon resections and its performance is compared with an analysis without prior segmentation. The algorithm shows an average speed-up of 41% and an increase in accuracy from 93.8% to 95.7%. By assigning a rejection label to uncertain superpixels, we further increase the accuracy by 0.4%. While tumor area estimation shows high concordance to the annotated area, the analysis of tumor composition highlights limitations of our approach. Conclusion: By combining superpixel segmentation and patch classification, we designed a fast and accurate framework for whole-slide cartography that is AI-model agnostic and provides the basis for various medical endpoints.
Collapse
Affiliation(s)
- Frauke Wilm
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany.,Friedrich-Alexander-University, Erlangen-Nuremberg, Department of Computer Science, Erlangen, Germany
| | - Michaela Benz
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany
| | - Volker Bruns
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany
| | - Serop Baghdadlian
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany
| | - Jakob Dexl
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany
| | - David Hartmann
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany
| | - Petr Kuritcyn
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany
| | - Martin Weidenfeller
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany
| | - Thomas Wittenberg
- Fraunhofer Institute for Integrated Circuits IIS, Image Processing and Medical Engineering Department, Erlangen, Germany.,Friedrich-Alexander-University, Erlangen-Nuremberg, Department of Computer Science, Erlangen, Germany
| | - Susanne Merkel
- University Hospital Erlangen, Department of Surgery, FAU Erlangen-Nuremberg, Erlangen, Germany.,University Hospital Erlangen, Comprehensive Cancer Center Erlangen-EMN (CCC), FAU Erlangen-Nuremberg, Erlangen, Germany
| | - Arndt Hartmann
- University Hospital Erlangen, Comprehensive Cancer Center Erlangen-EMN (CCC), FAU Erlangen-Nuremberg, Erlangen, Germany.,University Hospital Erlangen, Institute of Pathology, FAU Erlangen-Nuremberg, Erlangen, Germany
| | - Markus Eckstein
- University Hospital Erlangen, Comprehensive Cancer Center Erlangen-EMN (CCC), FAU Erlangen-Nuremberg, Erlangen, Germany.,University Hospital Erlangen, Institute of Pathology, FAU Erlangen-Nuremberg, Erlangen, Germany
| | - Carol Immanuel Geppert
- University Hospital Erlangen, Comprehensive Cancer Center Erlangen-EMN (CCC), FAU Erlangen-Nuremberg, Erlangen, Germany.,University Hospital Erlangen, Institute of Pathology, FAU Erlangen-Nuremberg, Erlangen, Germany
| |
Collapse
|