1
|
Hosseini MS, Bejnordi BE, Trinh VQH, Chan L, Hasan D, Li X, Yang S, Kim T, Zhang H, Wu T, Chinniah K, Maghsoudlou S, Zhang R, Zhu J, Khaki S, Buin A, Chaji F, Salehi A, Nguyen BN, Samaras D, Plataniotis KN. Computational pathology: A survey review and the way forward. J Pathol Inform 2024; 15:100357. [PMID: 38420608 PMCID: PMC10900832 DOI: 10.1016/j.jpi.2023.100357] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Revised: 12/21/2023] [Accepted: 12/23/2023] [Indexed: 03/02/2024] Open
Abstract
Computational Pathology (CPath) is an interdisciplinary science that augments developments of computational approaches to analyze and model medical histopathology images. The main objective for CPath is to develop infrastructure and workflows of digital diagnostics as an assistive CAD system for clinical pathology, facilitating transformational changes in the diagnosis and treatment of cancer that are mainly address by CPath tools. With evergrowing developments in deep learning and computer vision algorithms, and the ease of the data flow from digital pathology, currently CPath is witnessing a paradigm shift. Despite the sheer volume of engineering and scientific works being introduced for cancer image analysis, there is still a considerable gap of adopting and integrating these algorithms in clinical practice. This raises a significant question regarding the direction and trends that are undertaken in CPath. In this article we provide a comprehensive review of more than 800 papers to address the challenges faced in problem design all-the-way to the application and implementation viewpoints. We have catalogued each paper into a model-card by examining the key works and challenges faced to layout the current landscape in CPath. We hope this helps the community to locate relevant works and facilitate understanding of the field's future directions. In a nutshell, we oversee the CPath developments in cycle of stages which are required to be cohesively linked together to address the challenges associated with such multidisciplinary science. We overview this cycle from different perspectives of data-centric, model-centric, and application-centric problems. We finally sketch remaining challenges and provide directions for future technical developments and clinical integration of CPath. For updated information on this survey review paper and accessing to the original model cards repository, please refer to GitHub. Updated version of this draft can also be found from arXiv.
Collapse
Affiliation(s)
- Mahdi S. Hosseini
- Department of Computer Science and Software Engineering (CSSE), Concordia Univeristy, Montreal, QC H3H 2R9, Canada
| | | | - Vincent Quoc-Huy Trinh
- Institute for Research in Immunology and Cancer of the University of Montreal, Montreal, QC H3T 1J4, Canada
| | - Lyndon Chan
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Danial Hasan
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Xingwen Li
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Stephen Yang
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Taehyo Kim
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Haochen Zhang
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Theodore Wu
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Kajanan Chinniah
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Sina Maghsoudlou
- Department of Computer Science and Software Engineering (CSSE), Concordia Univeristy, Montreal, QC H3H 2R9, Canada
| | - Ryan Zhang
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Jiadai Zhu
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Samir Khaki
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| | - Andrei Buin
- Huron Digitial Pathology, St. Jacobs, ON N0B 2N0, Canada
| | - Fatemeh Chaji
- Department of Computer Science and Software Engineering (CSSE), Concordia Univeristy, Montreal, QC H3H 2R9, Canada
| | - Ala Salehi
- Department of Electrical and Computer Engineering, University of New Brunswick, Fredericton, NB E3B 5A3, Canada
| | - Bich Ngoc Nguyen
- University of Montreal Hospital Center, Montreal, QC H2X 0C2, Canada
| | - Dimitris Samaras
- Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, United States
| | - Konstantinos N. Plataniotis
- The Edward S. Rogers Sr. Department of Electrical & Computer Engineering (ECE), University of Toronto, Toronto, ON M5S 3G4, Canada
| |
Collapse
|
2
|
Hägele M, Eschrich J, Ruff L, Alber M, Schallenberg S, Guillot A, Roderburg C, Tacke F, Klauschen F. Leveraging weak complementary labels enhances semantic segmentation of hepatocellular carcinoma and intrahepatic cholangiocarcinoma. Sci Rep 2024; 14:24988. [PMID: 39443575 PMCID: PMC11499859 DOI: 10.1038/s41598-024-75256-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2024] [Accepted: 10/03/2024] [Indexed: 10/25/2024] Open
Abstract
In this paper we present a deep learning segmentation approach to classify and quantify the two most prevalent primary liver cancers - hepatocellular carcinoma and intrahepatic cholangiocarcinoma - from hematoxylin and eosin (H&E) stained whole slide images. While semantic segmentation of medical images typically requires costly pixel-level annotations by domain experts, there often exists additional information which is routinely obtained in clinical diagnostics but rarely utilized for model training. We propose to leverage such weak information from patient diagnoses by deriving complementary labels that indicate to which class a sample cannot belong to. To integrate these labels, we formulate a complementary loss for segmentation. Motivated by the medical application, we demonstrate for general segmentation tasks that including additional patches with solely weak complementary labels during model training can significantly improve the predictive performance and robustness of a model. On the task of diagnostic differentiation between hepatocellular carcinoma and intrahepatic cholangiocarcinoma, we achieve a balanced accuracy of 0.91 (CI 95%: 0.86-0.95) at case level for 165 hold-out patients. Furthermore, we also show that leveraging complementary labels improves the robustness of segmentation and increases performance at case level.
Collapse
Affiliation(s)
- Miriam Hägele
- Machine learning group, Technische Universität Berlin, 10623, Berlin, Germany.
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Berlin, Germany.
- Aignostics GmbH, 10555, Berlin, Germany.
| | - Johannes Eschrich
- Department of Hepatology and Gastroenterology, Charité Universitätsmedizin Berlin, 13353, Berlin, Germany.
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Charitéplatz 1, 10117, Berlin, Germany.
| | | | - Maximilian Alber
- Aignostics GmbH, 10555, Berlin, Germany
- Institute of Pathology, Charité Universitätsmedizin Berlin, 10117, Berlin, Germany
| | - Simon Schallenberg
- Institute of Pathology, Charité Universitätsmedizin Berlin, 10117, Berlin, Germany
| | - Adrien Guillot
- Department of Hepatology and Gastroenterology, Charité Universitätsmedizin Berlin, 13353, Berlin, Germany
| | - Christoph Roderburg
- Clinic for Gastroenterology, Hepatology and Infectious Diseases, University Hospital Düsseldorf, Medical Faculty of Heinrich Heine University Düsseldorf, Moorenstrasse 5, 40225, Düsseldorf, Germany
| | - Frank Tacke
- Department of Hepatology and Gastroenterology, Charité Universitätsmedizin Berlin, 13353, Berlin, Germany
| | - Frederick Klauschen
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Berlin, Germany.
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Charitéplatz 1, 10117, Berlin, Germany.
- Institute of Pathology, Charité Universitätsmedizin Berlin, 10117, Berlin, Germany.
- Institute of Pathology, Ludwig-Maximilians-Universität München, 80337, Munich, Germany.
- German Cancer Consortium, Munich Partner Site, German Cancer Research Center, 69210, Heidelberg, Germany.
| |
Collapse
|
3
|
Mostafa G, Mahmoud H, Abd El-Hafeez T, E ElAraby M. The power of deep learning in simplifying feature selection for hepatocellular carcinoma: a review. BMC Med Inform Decis Mak 2024; 24:287. [PMID: 39367397 PMCID: PMC11452940 DOI: 10.1186/s12911-024-02682-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 09/13/2024] [Indexed: 10/06/2024] Open
Abstract
BACKGROUND Hepatocellular Carcinoma (HCC) is a highly aggressive, prevalent, and deadly type of liver cancer. With the advent of deep learning techniques, significant advancements have been made in simplifying and optimizing the feature selection process. OBJECTIVE Our scoping review presents an overview of the various deep learning models and algorithms utilized to address feature selection for HCC. The paper highlights the strengths and limitations of each approach, along with their potential applications in clinical practice. Additionally, it discusses the benefits of using deep learning to identify relevant features and their impact on the accuracy and efficiency of diagnosis, prognosis, and treatment of HCC. DESIGN The review encompasses a comprehensive analysis of the research conducted in the past few years, focusing on the methodologies, datasets, and evaluation metrics adopted by different studies. The paper aims to identify the key trends and advancements in the field, shedding light on the promising areas for future research and development. RESULTS The findings of this review indicate that deep learning techniques have shown promising results in simplifying feature selection for HCC. By leveraging large-scale datasets and advanced neural network architectures, these methods have demonstrated improved accuracy and robustness in identifying predictive features. CONCLUSIONS We analyze published studies to reveal the state-of-the-art HCC prediction and showcase how deep learning can boost accuracy and decrease false positives. But we also acknowledge the challenges that remain in translating this potential into clinical reality.
Collapse
Affiliation(s)
- Ghada Mostafa
- Computer Science Department, Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef, Egypt.
- Computer Science Unit, Deraya University, EL-Minia, Egypt.
| | - Hamdi Mahmoud
- Computer Science Department, Faculty of Computers and Artificial Intelligence, Beni-Suef National University, Beni-Suef, Egypt.
| | - Tarek Abd El-Hafeez
- Department of Computer Science, Faculty of Science, Minia University, EL-Minia, Egypt.
- Computer Science Unit, Deraya University, EL-Minia, Egypt.
| | - Mohamed E ElAraby
- Computer Science Department, Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef, Egypt.
| |
Collapse
|
4
|
Chatzipanagiotou OP, Loukas C, Vailas M, Machairas N, Kykalos S, Charalampopoulos G, Filippiadis D, Felekouras E, Schizas D. Artificial intelligence in hepatocellular carcinoma diagnosis: a comprehensive review of current literature. J Gastroenterol Hepatol 2024; 39:1994-2005. [PMID: 38923550 DOI: 10.1111/jgh.16663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 04/26/2024] [Accepted: 06/07/2024] [Indexed: 06/28/2024]
Abstract
BACKGROUND AND AIM Hepatocellular carcinoma (HCC) diagnosis mainly relies on its pathognomonic radiological profile, obviating the need for biopsy. The project of incorporating artificial intelligence (AI) techniques in HCC aims to improve the performance of image recognition. Herein, we thoroughly analyze and evaluate proposed AI models in the field of HCC diagnosis. METHODS A comprehensive review of the literature was performed utilizing MEDLINE/PubMed and Web of Science databases with the end of search date being the 30th of September 2023. The MESH terms "Artificial Intelligence," "Liver Cancer," "Hepatocellular Carcinoma," "Machine Learning," and "Deep Learning" were searched in the title and/or abstract. All references of the obtained articles were also evaluated for any additional information. RESULTS Our search resulted in 183 studies meeting our inclusion criteria. Across all diagnostic modalities, reported area under the curve (AUC) of most developed models surpassed 0.900. A B-mode US and a contrast-enhanced US model achieved AUCs of 0.947 and 0.957, respectively. Regarding the more challenging task of HCC diagnosis, a 2021 deep learning model, trained with CT scans, classified hepatic malignant lesions with an AUC of 0.986. Finally, a MRI machine learning model developed in 2021 displayed an AUC of 0.975 when differentiating small HCCs from benign lesions, while another MRI-based model achieved HCC diagnosis with an AUC of 0.970. CONCLUSIONS AI tools may lead to significant improvement in diagnostic management of HCC. Many models fared better or comparable to experienced radiologists while proving capable of elevating radiologists' accuracy, demonstrating promising results for AI implementation in HCC-related diagnostic tasks.
Collapse
Affiliation(s)
- Odysseas P Chatzipanagiotou
- First Department of Surgery, National and Kapodistrian University of Athens, Laikon General Hospital, Athens, Greece
| | - Constantinos Loukas
- Laboratory of Medical Physics, Medical School, National and Kapodistrian University of Athens, Athens, Greece
| | - Michail Vailas
- First Department of Surgery, National and Kapodistrian University of Athens, Laikon General Hospital, Athens, Greece
| | - Nikolaos Machairas
- Second Department of Propaedeutic Surgery, National and Kapodistrian University of Athens, Laikon General Hospital, Athens, Greece
| | - Stylianos Kykalos
- Second Department of Propaedeutic Surgery, National and Kapodistrian University of Athens, Laikon General Hospital, Athens, Greece
| | - Georgios Charalampopoulos
- Second Department of Radiology, National and Kapodistrian University of Athens, Attikon University Hospital, Athens, Greece
| | - Dimitrios Filippiadis
- Second Department of Radiology, National and Kapodistrian University of Athens, Attikon University Hospital, Athens, Greece
| | - Evangellos Felekouras
- First Department of Surgery, National and Kapodistrian University of Athens, Laikon General Hospital, Athens, Greece
| | - Dimitrios Schizas
- First Department of Surgery, National and Kapodistrian University of Athens, Laikon General Hospital, Athens, Greece
| |
Collapse
|
5
|
Bhati D, Neha F, Amiruzzaman M. A Survey on Explainable Artificial Intelligence (XAI) Techniques for Visualizing Deep Learning Models in Medical Imaging. J Imaging 2024; 10:239. [PMID: 39452402 PMCID: PMC11508748 DOI: 10.3390/jimaging10100239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2024] [Revised: 09/14/2024] [Accepted: 09/21/2024] [Indexed: 10/26/2024] Open
Abstract
The combination of medical imaging and deep learning has significantly improved diagnostic and prognostic capabilities in the healthcare domain. Nevertheless, the inherent complexity of deep learning models poses challenges in understanding their decision-making processes. Interpretability and visualization techniques have emerged as crucial tools to unravel the black-box nature of these models, providing insights into their inner workings and enhancing trust in their predictions. This survey paper comprehensively examines various interpretation and visualization techniques applied to deep learning models in medical imaging. The paper reviews methodologies, discusses their applications, and evaluates their effectiveness in enhancing the interpretability, reliability, and clinical relevance of deep learning models in medical image analysis.
Collapse
Affiliation(s)
- Deepshikha Bhati
- Department of Computer Science, Kent State University, Kent, OH 44242, USA;
| | - Fnu Neha
- Department of Computer Science, Kent State University, Kent, OH 44242, USA;
| | - Md Amiruzzaman
- Department of Computer Science, West Chester University, West Chester, PA 19383, USA;
| |
Collapse
|
6
|
Jehanzaib M, Almalioglu Y, Ozyoruk KB, Williamson DFK, Abdullah T, Basak K, Demir D, Keles GE, Zafar K, Turan M. A robust image segmentation and synthesis pipeline for histopathology. Med Image Anal 2024; 99:103344. [PMID: 39265361 DOI: 10.1016/j.media.2024.103344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Revised: 03/10/2024] [Accepted: 09/03/2024] [Indexed: 09/14/2024]
Abstract
Significant diagnostic variability between and within observers persists in pathology, despite the fact that digital slide images provide the ability to measure and quantify features much more precisely compared to conventional methods. Automated and accurate segmentation of cancerous cell and tissue regions can streamline the diagnostic process, providing insights into the cancer progression, and helping experts decide on the most effective treatment. Here, we evaluate the performance of the proposed PathoSeg model, with an architecture comprising of a modified HRNet encoder and a UNet++ decoder integrated with a CBAM block to utilize attention mechanism for an improved segmentation capability. We demonstrate that PathoSeg outperforms the current state-of-the-art (SOTA) networks in both quantitative and qualitative assessment of instance and semantic segmentation. Notably, we leverage the use of synthetic data generated by PathopixGAN, which effectively addresses the data imbalance problem commonly encountered in histopathology datasets, further improving the performance of PathoSeg. It utilizes spatially adaptive normalization within a generative and discriminative mechanism to synthesize diverse histopathological environments dictated through semantic information passed through pixel-level annotated Ground Truth semantic masks.Besides, we contribute to the research community by providing an in-house dataset that includes semantically segmented masks for breast carcinoma tubules (BCT), micro/macrovesicular steatosis of the liver (MSL), and prostate carcinoma glands (PCG). In the first part of the dataset, we have a total of 14 whole slide images from 13 patients' liver, with fat cell segmented masks, totaling 951 masks of size 512 × 512 pixels. In the second part, it includes 17 whole slide images from 13 patients with prostate carcinoma gland segmentation masks, amounting to 30,000 masks of size 512 × 512 pixels. In the third part, the dataset contains 51 whole slides from 36 patients, with breast carcinoma tubule masks totaling 30,000 masks of size 512 × 512 pixels. To ensure transparency and encourage further research, we will make this dataset publicly available for non-commercial and academic purposes. To facilitate reproducibility and encourage further research, we will also make our code and pre-trained models publicly available at https://github.com/DeepMIALab/PathoSeg.
Collapse
Affiliation(s)
- Muhammad Jehanzaib
- Department of Computer Engineering, Bogazici University, Istanbul, Turkey; Department of Computer Science, FAST-NUCES, Lahore, Pakistan
| | - Yasin Almalioglu
- Computer Science Department, Oxford University, England, United Kingdom
| | | | - Drew F K Williamson
- Department of Pathology, Brigham and Women's Hospital, USA; Harvard Medical School, Boston, MA, USA
| | - Talha Abdullah
- Department of Computer Engineering, Bogazici University, Istanbul, Turkey; Department of Computer Science, FAST-NUCES, Lahore, Pakistan
| | - Kayhan Basak
- Saglık Bilimleri University, Kartal Dr.Lutfi Kırdar City Hospital, Department of Pathology, Istanbul, Turkey
| | - Derya Demir
- Faculty of Medicine, Department of Pathology, Ege University, Izmir, Turkey
| | | | - Kashif Zafar
- Department of Computer Science, FAST-NUCES, Lahore, Pakistan
| | - Mehmet Turan
- Department of Computer Engineering, Bogazici University, Istanbul, Turkey.
| |
Collapse
|
7
|
Chang Q, Zhou X, Mao H, Feng J, Wu X, Zhang Z, Hu Z. ALKBH5 promotes hepatocellular carcinoma cell proliferation, migration and invasion by regulating TTI1 expression. BIOMOLECULES & BIOMEDICINE 2024; 24:1216-1230. [PMID: 38501918 PMCID: PMC11379018 DOI: 10.17305/bb.2024.10247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 03/07/2024] [Accepted: 03/07/2024] [Indexed: 03/20/2024]
Abstract
The objective of this research was to investigate the potential mechanisms of AlkB homolog 5, RNA demethylase (ALKBH5) in hepatocellular carcinoma (HCC). We used The Cancer Genome Atlas (TCGA), Kruskal-Wallis method and Kaplan-Meier (KM) survival analysis to study the expression of ALKBH5 and its correlation with clinical factors in HCC. In vitro experiments verified the expression of ALKBH5 and its effect on HCC cell phenotype. We screened differentially expressed genes (DEGs) from HCC patients associated with ALKBH5. Through this screening we identified the downstream gene TTI1 which is associated with ALKBH5 and investigated its function using Gene Expression Profiling Interaction Analysis (GEPIA) along with univariate Cox proportional hazards regression analysis. Finally, we analyzed the functions of ALKBH5 and TTI1 in HCC cells. Across numerous pan-cancer types, we observed significant overexpression of ALKBH5. In vitro experiments confirmed ALKBH5 as an oncogene in HCC, with its knockdown leading to suppressed cell proliferation, migration, and invasion. Bioinformatics analyses also demonstrated a significant positive correlation between ALKBH5 and TTI1. TTI1, highly expressed in cells, showed promising prognostic ability for patients. Further experiments confirmed that suppressing TTI1 impeded cell growth and movement, with this effect partially offset by increased ALKBH5 expression. Conversely, promoting these cellular processes was observed with TTI1 overexpression, but was dampened by decreased ALKBH5 expression. In conclusion, our findings suggest that ALKBH5 may influence proliferation, migration and invasion of HCC by modulating TTI1 expression, providing a new direction for treating HCC.
Collapse
Affiliation(s)
- Qimeng Chang
- Department of Hepatobiliary-Pancreatic Surgery, Minhang Hospital, Fudan University, Shanghai, China
- Institute of Fudan-Minhang Academic Health System, Minhang Hospital, Fudan University, Shanghai, China
| | - Xiang Zhou
- Department of Hepatobiliary-Pancreatic Surgery, Minhang Hospital, Fudan University, Shanghai, China
- Institute of Fudan-Minhang Academic Health System, Minhang Hospital, Fudan University, Shanghai, China
| | - Huarong Mao
- Department of Hepatobiliary-Pancreatic Surgery, Minhang Hospital, Fudan University, Shanghai, China
- Institute of Fudan-Minhang Academic Health System, Minhang Hospital, Fudan University, Shanghai, China
| | - Jinfeng Feng
- Department of Hepatobiliary-Pancreatic Surgery, Minhang Hospital, Fudan University, Shanghai, China
- Institute of Fudan-Minhang Academic Health System, Minhang Hospital, Fudan University, Shanghai, China
| | - Xubo Wu
- Department of Hepatobiliary-Pancreatic Surgery, Minhang Hospital, Fudan University, Shanghai, China
- Institute of Fudan-Minhang Academic Health System, Minhang Hospital, Fudan University, Shanghai, China
| | - Ziping Zhang
- Department of Hepatobiliary-Pancreatic Surgery, Minhang Hospital, Fudan University, Shanghai, China
- Institute of Fudan-Minhang Academic Health System, Minhang Hospital, Fudan University, Shanghai, China
| | - Zhiqiu Hu
- Department of Hepatobiliary-Pancreatic Surgery, Minhang Hospital, Fudan University, Shanghai, China
- Institute of Fudan-Minhang Academic Health System, Minhang Hospital, Fudan University, Shanghai, China
| |
Collapse
|
8
|
Hager P, Jungmann F, Holland R, Bhagat K, Hubrecht I, Knauer M, Vielhauer J, Makowski M, Braren R, Kaissis G, Rueckert D. Evaluation and mitigation of the limitations of large language models in clinical decision-making. Nat Med 2024; 30:2613-2622. [PMID: 38965432 PMCID: PMC11405275 DOI: 10.1038/s41591-024-03097-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 05/29/2024] [Indexed: 07/06/2024]
Abstract
Clinical decision-making is one of the most impactful parts of a physician's responsibilities and stands to benefit greatly from artificial intelligence solutions and large language models (LLMs) in particular. However, while LLMs have achieved excellent performance on medical licensing exams, these tests fail to assess many skills necessary for deployment in a realistic clinical decision-making environment, including gathering information, adhering to guidelines, and integrating into clinical workflows. Here we have created a curated dataset based on the Medical Information Mart for Intensive Care database spanning 2,400 real patient cases and four common abdominal pathologies as well as a framework to simulate a realistic clinical setting. We show that current state-of-the-art LLMs do not accurately diagnose patients across all pathologies (performing significantly worse than physicians), follow neither diagnostic nor treatment guidelines, and cannot interpret laboratory results, thus posing a serious risk to the health of patients. Furthermore, we move beyond diagnostic accuracy and demonstrate that they cannot be easily integrated into existing workflows because they often fail to follow instructions and are sensitive to both the quantity and order of information. Overall, our analysis reveals that LLMs are currently not ready for autonomous clinical decision-making while providing a dataset and framework to guide future studies.
Collapse
Affiliation(s)
- Paul Hager
- Institute for AI and Informatics, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.
- Institute for Diagnostic and Interventional Radiology, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.
| | - Friederike Jungmann
- Institute for AI and Informatics, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
- Institute for Diagnostic and Interventional Radiology, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | | | - Kunal Bhagat
- Department of Medicine, ChristianaCare Health System, Wilmington, DE, USA
| | - Inga Hubrecht
- Department of Medicine III, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Manuel Knauer
- Department of Medicine III, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Jakob Vielhauer
- Department of Medicine II, University Hospital of the Ludwig Maximilian University of Munich, Munich, Germany
| | - Marcus Makowski
- Institute for Diagnostic and Interventional Radiology, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Rickmer Braren
- Institute for Diagnostic and Interventional Radiology, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Georgios Kaissis
- Institute for AI and Informatics, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
- Institute for Diagnostic and Interventional Radiology, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
- Department of Computing, Imperial College, London, UK
- Reliable AI Group, Institute for Machine Learning in Biomedical Imaging, Helmholtz Munich, Munich, Germany
| | - Daniel Rueckert
- Institute for AI and Informatics, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
- Department of Computing, Imperial College, London, UK
| |
Collapse
|
9
|
Salloch S, Eriksen A. What Are Humans Doing in the Loop? Co-Reasoning and Practical Judgment When Using Machine Learning-Driven Decision Aids. THE AMERICAN JOURNAL OF BIOETHICS : AJOB 2024; 24:67-78. [PMID: 38767971 DOI: 10.1080/15265161.2024.2353800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
Within the ethical debate on Machine Learning-driven decision support systems (ML_CDSS), notions such as "human in the loop" or "meaningful human control" are often cited as being necessary for ethical legitimacy. In addition, ethical principles usually serve as the major point of reference in ethical guidance documents, stating that conflicts between principles need to be weighed and balanced against each other. Starting from a neo-Kantian viewpoint inspired by Onora O'Neill, this article makes a concrete suggestion of how to interpret the role of the "human in the loop" and to overcome the perspective of rivaling ethical principles in the evaluation of AI in health care. We argue that patients should be perceived as "fellow workers" and epistemic partners in the interpretation of ML_CDSS outputs. We further highlight that a meaningful process of integrating (rather than weighing and balancing) ethical principles is most appropriate in the evaluation of medical AI.
Collapse
|
10
|
Laurent-Bellue A, Sadraoui A, Claude L, Calderaro J, Posseme K, Vibert E, Cherqui D, Rosmorduc O, Lewin M, Pesquet JC, Guettier C. Deep Learning Classification and Quantification of Pejorative and Nonpejorative Architectures in Resected Hepatocellular Carcinoma from Digital Histopathologic Images. THE AMERICAN JOURNAL OF PATHOLOGY 2024; 194:1684-1700. [PMID: 38879083 DOI: 10.1016/j.ajpath.2024.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/17/2024] [Accepted: 05/16/2024] [Indexed: 06/27/2024]
Abstract
Liver resection is one of the best treatments for small hepatocellular carcinoma (HCC), but post-resection recurrence is frequent. Biotherapies have emerged as an efficient adjuvant treatment, making the identification of patients at high risk of recurrence critical. Microvascular invasion (mVI), poor differentiation, pejorative macrotrabecular architectures, and vessels encapsulating tumor clusters architectures are the most accurate histologic predictors of recurrence, but their evaluation is time-consuming and imperfect. Herein, a supervised deep learning-based approach with ResNet34 on 680 whole slide images (WSIs) from 107 liver resection specimens was used to build an algorithm for the identification and quantification of these pejorative architectures. This model achieved an accuracy of 0.864 at patch level and 0.823 at WSI level. To assess its robustness, it was validated on an external cohort of 29 HCCs from another hospital, with an accuracy of 0.787 at WSI level, affirming its generalization capabilities. Moreover, the largest connected areas of the pejorative architectures extracted from the model were positively correlated to the presence of mVI and the number of tumor emboli. These results suggest that the identification of pejorative architectures could be an efficient surrogate of mVI and have a strong predictive value for the risk of recurrence. This study is the first step in the construction of a composite predictive algorithm for early post-resection recurrence of HCC, including artificial intelligence-based features.
Collapse
Affiliation(s)
- Astrid Laurent-Bellue
- Department of Pathology, Bicêtre Hospital, Assistance Publique-Hôpitaux de Paris, Le Kremlin-Bicêtre, France
| | - Aymen Sadraoui
- Centre de Vision Numérique, Paris-Saclay University, Inria, CentraleSupélec, Gif-sur-Yvette, France
| | - Laura Claude
- Department of Pathology, Charles Nicolle Hospital, Rouen, France
| | - Julien Calderaro
- Department of Pathology, Henri-Mondor Hospital, Assistance Publique-Hôpitaux de Paris, Créteil, France
| | - Katia Posseme
- Department of Pathology, Bicêtre Hospital, Assistance Publique-Hôpitaux de Paris, Le Kremlin-Bicêtre, France
| | - Eric Vibert
- Centre Hépato-Biliaire, Paul-Brousse Hospital, Assistance Publique-Hôpitaux de Paris, Villejuif, France; Faculté de Médecine, Paris-Saclay University, Le Kremlin-Bicêtre, France; Unité Mixte de Recherche 1193, Paris-Saclay University, INSERM, Villejuif, France
| | - Daniel Cherqui
- Centre Hépato-Biliaire, Paul-Brousse Hospital, Assistance Publique-Hôpitaux de Paris, Villejuif, France; Faculté de Médecine, Paris-Saclay University, Le Kremlin-Bicêtre, France; Unité Mixte de Recherche 1193, Paris-Saclay University, INSERM, Villejuif, France
| | - Olivier Rosmorduc
- Centre Hépato-Biliaire, Paul-Brousse Hospital, Assistance Publique-Hôpitaux de Paris, Villejuif, France; Faculté de Médecine, Paris-Saclay University, Le Kremlin-Bicêtre, France; Unité Mixte de Recherche 1193, Paris-Saclay University, INSERM, Villejuif, France
| | - Maïté Lewin
- Centre Hépato-Biliaire, Paul-Brousse Hospital, Assistance Publique-Hôpitaux de Paris, Villejuif, France; Faculté de Médecine, Paris-Saclay University, Le Kremlin-Bicêtre, France; Unité Mixte de Recherche 1193, Paris-Saclay University, INSERM, Villejuif, France
| | - Jean-Christophe Pesquet
- Centre de Vision Numérique, Paris-Saclay University, Inria, CentraleSupélec, Gif-sur-Yvette, France
| | - Catherine Guettier
- Department of Pathology, Bicêtre Hospital, Assistance Publique-Hôpitaux de Paris, Le Kremlin-Bicêtre, France.
| |
Collapse
|
11
|
He T, Shi S, Liu Y, Zhu L, Wei Y, Zhang F, Shi H, He Y, Han A. Pathology diagnosis of intraoperative frozen thyroid lesions assisted by deep learning. BMC Cancer 2024; 24:1069. [PMID: 39210289 PMCID: PMC11363383 DOI: 10.1186/s12885-024-12849-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 08/26/2024] [Indexed: 09/04/2024] Open
Abstract
BACKGROUND Thyroid cancer is a common thyroid malignancy. The majority of thyroid lesion needs intraoperative frozen pathology diagnosis, which provides important information for precision operation. As digital whole slide images (WSIs) develop, deep learning methods for histopathological classification of the thyroid gland (paraffin sections) have achieved outstanding results. Our current study is to clarify whether deep learning assists pathology diagnosis for intraoperative frozen thyroid lesions or not. METHODS We propose an artificial intelligence-assisted diagnostic system for frozen thyroid lesions that applies prior knowledge in tandem with a dichotomous judgment of whether the lesion is cancerous or not and a quadratic judgment of the type of cancerous lesion to categorize the frozen thyroid lesions into five categories: papillary thyroid carcinoma, medullary thyroid carcinoma, anaplastic thyroid carcinoma, follicular thyroid tumor, and non-cancerous lesion. We obtained 4409 frozen digital pathology sections (WSI) of thyroid from the First Affiliated Hospital of Sun Yat-sen University (SYSUFH) to train and test the model, and the performance was validated by a six-fold cross validation, 101 papillary microcarcinoma sections of thyroid were used to validate the system's sensitivity, and 1388 WSIs of thyroid were used for the evaluation of the external dataset. The deep learning models were compared in terms of several metrics such as accuracy, F1 score, recall, precision and AUC (Area Under Curve). RESULTS We developed the first deep learning-based frozen thyroid diagnostic classifier for histopathological WSI classification of papillary carcinoma, medullary carcinoma, follicular tumor, anaplastic carcinoma, and non-carcinoma lesion. On test slides, the system had an accuracy of 0.9459, a precision of 0.9475, and an AUC of 0.9955. In the papillary carcinoma test slides, the system was able to accurately predict even lesions as small as 2 mm in diameter. Tested with the acceleration component, the cut processing can be performed in 346.12 s and the visual inference prediction results can be obtained in 98.61 s, thus meeting the time requirements for intraoperative diagnosis. Our study employs a deep learning approach for high-precision classification of intraoperative frozen thyroid lesion distribution in the clinical setting, which has potential clinical implications for assisting pathologists and precision surgery of thyroid lesions.
Collapse
MESH Headings
- Humans
- Deep Learning
- Thyroid Neoplasms/pathology
- Thyroid Neoplasms/diagnosis
- Thyroid Neoplasms/surgery
- Frozen Sections
- Thyroid Cancer, Papillary/pathology
- Thyroid Cancer, Papillary/diagnosis
- Thyroid Cancer, Papillary/surgery
- Carcinoma, Papillary/pathology
- Carcinoma, Papillary/surgery
- Carcinoma, Papillary/diagnosis
- Adenocarcinoma, Follicular/pathology
- Adenocarcinoma, Follicular/diagnosis
- Adenocarcinoma, Follicular/surgery
- Thyroid Gland/pathology
- Thyroid Gland/surgery
- Carcinoma, Neuroendocrine/pathology
- Carcinoma, Neuroendocrine/diagnosis
- Carcinoma, Neuroendocrine/surgery
- Female
- Male
- Middle Aged
- Adult
- Intraoperative Period
- Thyroid Carcinoma, Anaplastic/pathology
- Thyroid Carcinoma, Anaplastic/diagnosis
- Thyroid Carcinoma, Anaplastic/surgery
Collapse
Affiliation(s)
- Tingting He
- Institute of Biopharmaceutical and Health Engineering, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong, China
| | - Shanshan Shi
- Institute of Biopharmaceutical and Health Engineering, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong, China
| | - Yiqing Liu
- Institute of Biopharmaceutical and Health Engineering, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong, China
| | - Lianghui Zhu
- Institute of Biopharmaceutical and Health Engineering, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong, China
| | - Yani Wei
- Department of Pathology, the First Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Fenfen Zhang
- Department of Pathology, the First Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Huijuan Shi
- Department of Pathology, the First Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, China.
| | - Yonghong He
- Institute of Biopharmaceutical and Health Engineering, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong, China.
| | - Anjia Han
- Department of Pathology, the First Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, China.
| |
Collapse
|
12
|
Calderaro J, Žigutytė L, Truhn D, Jaffe A, Kather JN. Artificial intelligence in liver cancer - new tools for research and patient management. Nat Rev Gastroenterol Hepatol 2024; 21:585-599. [PMID: 38627537 DOI: 10.1038/s41575-024-00919-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/11/2024] [Indexed: 07/31/2024]
Abstract
Liver cancer has high incidence and mortality globally. Artificial intelligence (AI) has advanced rapidly, influencing cancer care. AI systems are already approved for clinical use in some tumour types (for example, colorectal cancer screening). Crucially, research demonstrates that AI can analyse histopathology, radiology and natural language in liver cancer, and can replace manual tasks and access hidden information in routinely available clinical data. However, for liver cancer, few of these applications have translated into large-scale clinical trials or clinically approved products. Here, we advocate for the incorporation of AI in all stages of liver cancer management. We present a taxonomy of AI approaches in liver cancer, highlighting areas with academic and commercial potential, and outline a policy for AI-based liver cancer management, including interdisciplinary training of researchers, clinicians and patients. The potential of AI in liver cancer is immense, but effort is required to ensure that AI can fulfil expectations.
Collapse
Affiliation(s)
- Julien Calderaro
- Département de Pathologie, Assistance Publique Hôpitaux de Paris, Groupe Hospitalier Henri Mondor, Créteil, France
- Institut Mondor de Recherche Biomédicale, MINT-HEP Mondor Integrative Hepatology, Université Paris Est Créteil, Créteil, France
| | - Laura Žigutytė
- Else Kroener Fresenius Center for Digital Health (EKFZ), Medical Faculty Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
| | - Daniel Truhn
- Department of Diagnostic and Interventional Radiology, University Hospital RWTH Aachen, Aachen, Germany
| | - Ariel Jaffe
- Mayo Clinic, Rochester, MN, USA
- Department of Internal Medicine, Section of Digestive Diseases, Yale School of Medicine, New Haven, CT, USA
| | - Jakob Nikolas Kather
- Else Kroener Fresenius Center for Digital Health (EKFZ), Medical Faculty Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.
- Department of Medicine I, University Hospital Dresden, Dresden, Germany.
- Medical Oncology, National Center for Tumour Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany.
| |
Collapse
|
13
|
Laohawetwanit T, Apornvirat S, Namboonlue C. Thinking like a pathologist: Morphologic approach to hepatobiliary tumors by ChatGPT. Am J Clin Pathol 2024:aqae087. [PMID: 39030695 DOI: 10.1093/ajcp/aqae087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 06/22/2024] [Indexed: 07/21/2024] Open
Abstract
OBJECTIVES This research aimed to evaluate the effectiveness of ChatGPT in accurately diagnosing hepatobiliary tumors using histopathologic images. METHODS The study compared the diagnostic accuracies of the GPT-4 model, providing the same set of images and 2 different input prompts. The first prompt, the morphologic approach, was designed to mimic pathologists' approach to analyzing tissue morphology. In contrast, the second prompt functioned without incorporating this morphologic analysis feature. Diagnostic accuracy and consistency were analyzed. RESULTS A total of 120 photomicrographs, composed of 60 images of each hepatobiliary tumor and nonneoplastic liver tissue, were used. The findings revealed that the morphologic approach significantly enhanced the diagnostic accuracy and consistency of the artificial intelligence (AI). This version was particularly more accurate in identifying hepatocellular carcinoma (mean accuracy: 62.0% vs 27.3%), bile duct adenoma (10.7% vs 3.3%), and cholangiocarcinoma (68.7% vs 16.0%), as well as in distinguishing nonneoplastic liver tissues (77.3% vs 37.5%) (Ps ≤ .01). It also demonstrated higher diagnostic consistency than the other model without a morphologic analysis (κ: 0.46 vs 0.27). CONCLUSIONS This research emphasizes the importance of incorporating pathologists' diagnostic approaches into AI to enhance accuracy and consistency in medical diagnostics. It mainly showcases the AI's histopathologic promise when replicating expert diagnostic processes.
Collapse
Affiliation(s)
- Thiyaphat Laohawetwanit
- Division of Pathology, Chulabhorn International College of Medicine, Thammasat University, Pathum Thani, Thailand
- Division of Pathology, Thammasat University Hospital, Pathum Thani, Thailand
| | - Sompon Apornvirat
- Division of Pathology, Chulabhorn International College of Medicine, Thammasat University, Pathum Thani, Thailand
- Division of Pathology, Thammasat University Hospital, Pathum Thani, Thailand
| | | |
Collapse
|
14
|
González-Pérez Y, Montero Delgado A, Martinez Sesmero JM. [Translated article] Introducing artificial intelligence to hospital pharmacy departments. FARMACIA HOSPITALARIA 2024; 48 Suppl 1:TS35-TS44. [PMID: 39097375 DOI: 10.1016/j.farma.2024.04.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 01/03/2024] [Accepted: 02/14/2024] [Indexed: 08/05/2024] Open
Abstract
Artificial intelligence is a broad concept that includes the study of the ability of computers to perform tasks that would normally require the intervention of human intelligence. By exploiting large volumes of healthcare data, Artificial intelligence algorithms can identify patterns and predict outcomes, which can help healthcare organizations and their professionals make better decisions and achieve better results. Machine learning, deep learning, neural networks, or natural language processing are among the most important methods, allowing systems to learn and improve from data without the need for explicit programming. Artificial intelligence has been introduced in biomedicine, accelerating processes, improving accuracy and efficiency, and improving patient care. By using Artificial intelligence algorithms and machine learning, hospital pharmacists can analyze a large volume of patient data, including medical records, laboratory results, and medication profiles, aiding them in identifying potential drug-drug interactions, assessing the safety and efficacy of medicines, and making informed recommendations. Artificial intelligence integration will improve the quality of pharmaceutical care, optimize processes, promote research, deploy open innovation, and facilitate education. Hospital pharmacists who master Artificial intelligence will play a crucial role in this transformation.
Collapse
Affiliation(s)
- Yared González-Pérez
- Servicio de Farmacia, Hospital Universitario de Canarias, San Cristóbal de La Laguna, Spain.
| | - Alfredo Montero Delgado
- Servicio de Farmacia, Hospital Nuestra Señora de la Candelaria, Santa Cruz de Tenerife, Spain
| | | |
Collapse
|
15
|
González-Pérez Y, Montero Delgado A, Martinez Sesmero JM. Approaching artificial intelligence to Hospital Pharmacy. FARMACIA HOSPITALARIA 2024; 48 Suppl 1:S35-S44. [PMID: 39097366 DOI: 10.1016/j.farma.2024.02.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 01/03/2024] [Accepted: 02/14/2024] [Indexed: 08/05/2024] Open
Abstract
Artificial intelligence (AI) is a broad concept that includes the study of the ability of computers to perform tasks that would normally require the intervention of human intelligence. By exploiting large volumes of healthcare data, artificial intelligence algorithms can identify patterns and predict outcomes, which can help healthcare organizations and their professionals make better decisions and achieve better results. Machine learning, deep learning, neural networks or natural language processing are among the most important methods, allowing systems to learn and improve from data without the need for explicit programming. AI has been introduced in biomedicine, accelerating processes, improving safety and efficiency, and improving patient care. By using AI algorithms and Machine Learning, hospital pharmacists can analyze a large volume of patient data, including medical records, laboratory results, and medication profiles, aiding them in identifying potential drug-drug interactions, assessing the safety and efficacy of medicines, and making informed recommendations. AI integration will improve the quality of pharmaceutical care, optimize processes, promote research, deploy open innovation, and facilitate education. Hospital pharmacists who master AI will play a crucial role in this transformation.
Collapse
Affiliation(s)
- Yared González-Pérez
- Servicio de Farmacia, Hospital Universitario de Canarias, San Cristóbal de La Laguna, España.
| | - Alfredo Montero Delgado
- Servicio de Farmacia, Hospital Nuestra Señora de la Candelaria, Santa Cruz de Tenerife, España
| | | |
Collapse
|
16
|
Huang Z, Yang E, Shen J, Gratzinger D, Eyerer F, Liang B, Nirschl J, Bingham D, Dussaq AM, Kunder C, Rojansky R, Gilbert A, Chang-Graham AL, Howitt BE, Liu Y, Ryan EE, Tenney TB, Zhang X, Folkins A, Fox EJ, Montine KS, Montine TJ, Zou J. A pathologist-AI collaboration framework for enhancing diagnostic accuracies and efficiencies. Nat Biomed Eng 2024:10.1038/s41551-024-01223-5. [PMID: 38898173 DOI: 10.1038/s41551-024-01223-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 05/03/2024] [Indexed: 06/21/2024]
Abstract
In pathology, the deployment of artificial intelligence (AI) in clinical settings is constrained by limitations in data collection and in model transparency and interpretability. Here we describe a digital pathology framework, nuclei.io, that incorporates active learning and human-in-the-loop real-time feedback for the rapid creation of diverse datasets and models. We validate the effectiveness of the framework via two crossover user studies that leveraged collaboration between the AI and the pathologist, including the identification of plasma cells in endometrial biopsies and the detection of colorectal cancer metastasis in lymph nodes. In both studies, nuclei.io yielded considerable diagnostic performance improvements. Collaboration between clinicians and AI will aid digital pathology by enhancing accuracies and efficiencies.
Collapse
Affiliation(s)
- Zhi Huang
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
| | - Eric Yang
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Jeanne Shen
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Dita Gratzinger
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Frederick Eyerer
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Brooke Liang
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Jeffrey Nirschl
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - David Bingham
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Alex M Dussaq
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Christian Kunder
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Rebecca Rojansky
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Aubre Gilbert
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Brooke E Howitt
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Ying Liu
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Emily E Ryan
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Troy B Tenney
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Xiaoming Zhang
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Ann Folkins
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Edward J Fox
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Kathleen S Montine
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Thomas J Montine
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA.
| | - James Zou
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
17
|
Wang J, Jin Y, Jiang A, Chen W, Shan G, Gu Y, Ming Y, Li J, Yue C, Huang Z, Librach C, Lin G, Wang X, Zhao H, Sun Y, Zhang Z. Testing the generalizability and effectiveness of deep learning models among clinics: sperm detection as a pilot study. Reprod Biol Endocrinol 2024; 22:59. [PMID: 38778327 PMCID: PMC11110326 DOI: 10.1186/s12958-024-01232-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/03/2024] [Accepted: 05/14/2024] [Indexed: 05/25/2024] Open
Abstract
BACKGROUND Deep learning has been increasingly investigated for assisting clinical in vitro fertilization (IVF). The first technical step in many tasks is to visually detect and locate sperm, oocytes, and embryos in images. For clinical deployment of such deep learning models, different clinics use different image acquisition hardware and different sample preprocessing protocols, raising the concern over whether the reported accuracy of a deep learning model by one clinic could be reproduced in another clinic. Here we aim to investigate the effect of each imaging factor on the generalizability of object detection models, using sperm analysis as a pilot example. METHODS Ablation studies were performed using state-of-the-art models for detecting human sperm to quantitatively assess how model precision (false-positive detection) and recall (missed detection) were affected by imaging magnification, imaging mode, and sample preprocessing protocols. The results led to the hypothesis that the richness of image acquisition conditions in a training dataset deterministically affects model generalizability. The hypothesis was tested by first enriching the training dataset with a wide range of imaging conditions, then validated through internal blind tests on new samples and external multi-center clinical validations. RESULTS Ablation experiments revealed that removing subsets of data from the training dataset significantly reduced model precision. Removing raw sample images from the training dataset caused the largest drop in model precision, whereas removing 20x images caused the largest drop in model recall. by incorporating different imaging and sample preprocessing conditions into a rich training dataset, the model achieved an intraclass correlation coefficient (ICC) of 0.97 (95% CI: 0.94-0.99) for precision, and an ICC of 0.97 (95% CI: 0.93-0.99) for recall. Multi-center clinical validation showed no significant differences in model precision or recall across different clinics and applications. CONCLUSIONS The results validated the hypothesis that the richness of data in the training dataset is a key factor impacting model generalizability. These findings highlight the importance of diversity in a training dataset for model evaluation and suggest that future deep learning models in andrology and reproductive medicine should incorporate comprehensive feature sets for enhanced generalizability across clinics.
Collapse
Affiliation(s)
- Jiaqi Wang
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, China
| | - Yufei Jin
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, China
| | - Aojun Jiang
- Department of Mechanical Engineering, University of Toronto, Toronto, Canada
| | - Wenyuan Chen
- Department of Mechanical Engineering, University of Toronto, Toronto, Canada
| | - Guanqiao Shan
- Department of Mechanical Engineering, University of Toronto, Toronto, Canada
| | - Yifan Gu
- Institute of Reproductive and Stem Cell Engineering, School of Basic Medical Science, Central South University, Changsha, China
- Reproductive & Genetic Hospital of Citic-Xiangya, Changsha, China
| | - Yue Ming
- School of Medicine, The Chinese University of Hong Kong, Shenzhen, China
| | - Jichang Li
- School of Medicine, The Chinese University of Hong Kong, Shenzhen, China
| | - Chunfeng Yue
- Suzhou Boundless Medical Technology Ltd., Co., Suzhou, China
| | - Zongjie Huang
- Suzhou Boundless Medical Technology Ltd., Co., Suzhou, China
| | | | - Ge Lin
- Institute of Reproductive and Stem Cell Engineering, School of Basic Medical Science, Central South University, Changsha, China
- Reproductive & Genetic Hospital of Citic-Xiangya, Changsha, China
| | - Xibu Wang
- The 3rd Affiliated Hospital of Shenzhen University, Shenzhen, China
| | - Huan Zhao
- The 3rd Affiliated Hospital of Shenzhen University, Shenzhen, China.
| | - Yu Sun
- Department of Mechanical Engineering, University of Toronto, Toronto, Canada.
- Department of Computer Science, University of Toronto, Toronto, Canada.
- Institute of Biomedical Engineering, University of Toronto, Toronto, Canada.
- Department of Electrical and Computer Engineering, University of Toronto, Toronto, Canada.
| | - Zhuoran Zhang
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, China.
| |
Collapse
|
18
|
McGenity C, Clarke EL, Jennings C, Matthews G, Cartlidge C, Freduah-Agyemang H, Stocken DD, Treanor D. Artificial intelligence in digital pathology: a systematic review and meta-analysis of diagnostic test accuracy. NPJ Digit Med 2024; 7:114. [PMID: 38704465 PMCID: PMC11069583 DOI: 10.1038/s41746-024-01106-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Accepted: 04/12/2024] [Indexed: 05/06/2024] Open
Abstract
Ensuring diagnostic performance of artificial intelligence (AI) before introduction into clinical practice is essential. Growing numbers of studies using AI for digital pathology have been reported over recent years. The aim of this work is to examine the diagnostic accuracy of AI in digital pathology images for any disease. This systematic review and meta-analysis included diagnostic accuracy studies using any type of AI applied to whole slide images (WSIs) for any disease. The reference standard was diagnosis by histopathological assessment and/or immunohistochemistry. Searches were conducted in PubMed, EMBASE and CENTRAL in June 2022. Risk of bias and concerns of applicability were assessed using the QUADAS-2 tool. Data extraction was conducted by two investigators and meta-analysis was performed using a bivariate random effects model, with additional subgroup analyses also performed. Of 2976 identified studies, 100 were included in the review and 48 in the meta-analysis. Studies were from a range of countries, including over 152,000 whole slide images (WSIs), representing many diseases. These studies reported a mean sensitivity of 96.3% (CI 94.1-97.7) and mean specificity of 93.3% (CI 90.5-95.4). There was heterogeneity in study design and 99% of studies identified for inclusion had at least one area at high or unclear risk of bias or applicability concerns. Details on selection of cases, division of model development and validation data and raw performance data were frequently ambiguous or missing. AI is reported as having high diagnostic accuracy in the reported areas but requires more rigorous evaluation of its performance.
Collapse
Affiliation(s)
- Clare McGenity
- University of Leeds, Leeds, UK.
- Leeds Teaching Hospitals NHS Trust, Leeds, UK.
| | - Emily L Clarke
- University of Leeds, Leeds, UK
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | - Charlotte Jennings
- University of Leeds, Leeds, UK
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | | | | | | | | | - Darren Treanor
- University of Leeds, Leeds, UK
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
- Department of Clinical Pathology and Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
- Centre for Medical Image Science and Visualization (CMIV), Linköping University, Linköping, Sweden
| |
Collapse
|
19
|
Varghese C, Harrison EM, O'Grady G, Topol EJ. Artificial intelligence in surgery. Nat Med 2024; 30:1257-1268. [PMID: 38740998 DOI: 10.1038/s41591-024-02970-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 04/03/2024] [Indexed: 05/16/2024]
Abstract
Artificial intelligence (AI) is rapidly emerging in healthcare, yet applications in surgery remain relatively nascent. Here we review the integration of AI in the field of surgery, centering our discussion on multifaceted improvements in surgical care in the preoperative, intraoperative and postoperative space. The emergence of foundation model architectures, wearable technologies and improving surgical data infrastructures is enabling rapid advances in AI interventions and utility. We discuss how maturing AI methods hold the potential to improve patient outcomes, facilitate surgical education and optimize surgical care. We review the current applications of deep learning approaches and outline a vision for future advances through multimodal foundation models.
Collapse
Affiliation(s)
- Chris Varghese
- Department of Surgery, University of Auckland, Auckland, New Zealand
| | - Ewen M Harrison
- Centre for Medical Informatics, Usher Institute, University of Edinburgh, Edinburgh, UK
| | - Greg O'Grady
- Department of Surgery, University of Auckland, Auckland, New Zealand
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Eric J Topol
- Scripps Research Translational Institute, La Jolla, CA, USA.
| |
Collapse
|
20
|
Li Y, Liu H, Lv Q, Long J. Diagnosis model of early Pneumocystis jirovecii pneumonia based on convolutional neural network: a comparison with traditional PCR diagnostic method. BMC Pulm Med 2024; 24:205. [PMID: 38664747 PMCID: PMC11046959 DOI: 10.1186/s12890-024-02987-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 03/29/2024] [Indexed: 04/28/2024] Open
Abstract
BACKGROUND Pneumocystis jirovecii pneumonia (PJP) is an interstitial pneumonia caused by pneumocystis jirovecii (PJ). The diagnosis of PJP primarily relies on the detection of the pathogen from lower respiratory tract specimens. However, it faces challenges such as difficulty in obtaining specimens and low detection rates. In the clinical diagnosis process, it is necessary to combine clinical symptoms, serological test results, chest Computed tomography (CT) images, molecular biology techniques, and metagenomics next-generation sequencing (mNGS) for comprehensive analysis. PURPOSE This study aims to overcome the limitations of traditional PJP diagnosis methods and develop a non-invasive, efficient, and accurate diagnostic approach for PJP. By using this method, patients can receive early diagnosis and treatment, effectively improving their prognosis. METHODS We constructed an intelligent diagnostic model for PJP based on the different Convolutional Neural Networks. Firstly, we used the Convolutional Neural Network to extract CT image features from patients. Then, we fused the CT image features with clinical information features using a feature fusion function. Finally, the fused features were input into the classification network to obtain the patient's diagnosis result. RESULTS In this study, for the diagnosis of PJP, the accuracy of the traditional PCR diagnostic method is 77.58%, while the mean accuracy of the optimal diagnostic model based on convolutional neural networks is 88.90%. CONCLUSION The accuracy of the diagnostic method proposed in this paper is 11.32% higher than that of the traditional PCR diagnostic method. The method proposed in this paper is an efficient, accurate, and non-invasive early diagnosis approach for PJP.
Collapse
Affiliation(s)
- Yingying Li
- Department of Clinical Laboratory, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Hailin Liu
- School of Biomedical Engineering, Southern Medical University, Guangzhou, China
- Department of Information, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Qingwen Lv
- Department of Information, Zhujiang Hospital, Southern Medical University, Guangzhou, China.
| | - Jun Long
- Department of Clinical Laboratory, Zhujiang Hospital, Southern Medical University, Guangzhou, China.
| |
Collapse
|
21
|
Omar M, Xu Z, Rand SB, Alexanderani MK, Salles DC, Valencia I, Schaeffer EM, Robinson BD, Lotan TL, Loda M, Marchionni L. Semi-Supervised, Attention-Based Deep Learning for Predicting TMPRSS2:ERG Fusion Status in Prostate Cancer Using Whole Slide Images. Mol Cancer Res 2024; 22:347-359. [PMID: 38284821 PMCID: PMC10985477 DOI: 10.1158/1541-7786.mcr-23-0639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 12/26/2023] [Accepted: 01/22/2024] [Indexed: 01/30/2024]
Abstract
IMPLICATIONS Our study illuminates the potential of deep learning in effectively inferring key prostate cancer genetic alterations from the tissue morphology depicted in routinely available histology slides, offering a cost-effective method that could revolutionize diagnostic strategies in oncology.
Collapse
Affiliation(s)
- Mohamed Omar
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, New York
- Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Zhuoran Xu
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, New York
- Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Sophie B. Rand
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, New York
- Dana-Farber Cancer Institute, Boston, Massachusetts
| | | | - Daniela C. Salles
- Department of Pathology, Johns Hopkins University, Baltimore, Maryland
| | - Itzel Valencia
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, New York
| | | | - Brian D. Robinson
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, New York
| | - Tamara L. Lotan
- Department of Pathology, Johns Hopkins University, Baltimore, Maryland
| | - Massimo Loda
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, New York
| | - Luigi Marchionni
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, New York
| |
Collapse
|
22
|
Jongsma KR, Sand M, Milota M. Why we should not mistake accuracy of medical AI for efficiency. NPJ Digit Med 2024; 7:57. [PMID: 38438477 PMCID: PMC10912629 DOI: 10.1038/s41746-024-01047-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 02/16/2024] [Indexed: 03/06/2024] Open
Affiliation(s)
- Karin Rolanda Jongsma
- Bioethics & Health Humanities, Julius Center, University Medical Center Utrecht, Utrecht University, PO Box 85500, 3508 CA, Utrecht, The Netherlands.
| | - Martin Sand
- TU Delft, Department of Values, Technology and Innovation, Faculty of Technology, Policy and Management, Jaffalaan 5, 2628 BX, Delft, The Netherlands
| | - Megan Milota
- Bioethics & Health Humanities, Julius Center, University Medical Center Utrecht, Utrecht University, PO Box 85500, 3508 CA, Utrecht, The Netherlands
| |
Collapse
|
23
|
Gupta P, Basu S, Arora C. Applications of artificial intelligence in biliary tract cancers. Indian J Gastroenterol 2024:10.1007/s12664-024-01518-0. [PMID: 38427281 DOI: 10.1007/s12664-024-01518-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/03/2023] [Accepted: 12/29/2023] [Indexed: 03/02/2024]
Abstract
Biliary tract cancers are malignant neoplasms arising from bile duct epithelial cells. They include cholangiocarcinomas and gallbladder cancer. Gallbladder cancer has a marked geographical preference and is one of the most common cancers in women in northern India. Biliary tract cancers are usually diagnosed at an advanced, unresectable stage. Hence, the prognosis is extremely dismal. The five-year survival rate in advanced gallbladder cancer is < 5%. Hence, early detection and radical surgery are critical to improving biliary tract cancer prognoses. Radiological imaging plays an essential role in diagnosing and managing biliary tract cancers. However, the diagnosis is challenging because the biliary tract is affected by many diseases that may have radiological appearances similar to cancer. Artificial intelligence (AI) can improve radiologists' performance in various tasks. Deep learning (DL)-based approaches are increasingly incorporated into medical imaging to improve diagnostic performance. This paper reviews the AI-based strategies in biliary tract cancers to improve the diagnosis and prognosis.
Collapse
Affiliation(s)
- Pankaj Gupta
- Department of Radiodiagnosis and Imaging, Postgraduate Institute of Medical Education and Research, Chandigarh, 160 012, India.
| | - Soumen Basu
- Department of Computer Science and Engineering, Indian Institute of Technology - Delhi, New Delhi, 110 016, India
| | - Chetan Arora
- Department of Computer Science and Engineering, Indian Institute of Technology - Delhi, New Delhi, 110 016, India
| |
Collapse
|
24
|
Huang J, Bai X, Qiu Y, He X. Application of AI on cholangiocarcinoma. Front Oncol 2024; 14:1324222. [PMID: 38347839 PMCID: PMC10859478 DOI: 10.3389/fonc.2024.1324222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 01/08/2024] [Indexed: 02/15/2024] Open
Abstract
Cholangiocarcinoma, classified as intrahepatic, perihilar, and extrahepatic, is considered a deadly malignancy of the hepatobiliary system. Most cases of cholangiocarcinoma are asymptomatic. Therefore, early detection of cholangiocarcinoma is significant but still challenging. The routine screening of a tumor lacks specificity and accuracy. With the application of AI, high-risk patients can be easily found by analyzing their clinical characteristics, serum biomarkers, and medical images. Moreover, AI can be used to predict the prognosis including recurrence risk and metastasis. Although they have some limitations, AI algorithms will still significantly improve many aspects of cholangiocarcinoma in the medical field with the development of computing power and technology.
Collapse
Affiliation(s)
| | | | | | - Xiaodong He
- Department of General Surgery, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, China
| |
Collapse
|
25
|
Klauschen F, Dippel J, Keyl P, Jurmeister P, Bockmayr M, Mock A, Buchstab O, Alber M, Ruff L, Montavon G, Müller KR. Toward Explainable Artificial Intelligence for Precision Pathology. ANNUAL REVIEW OF PATHOLOGY 2024; 19:541-570. [PMID: 37871132 DOI: 10.1146/annurev-pathmechdis-051222-113147] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The rapid development of precision medicine in recent years has started to challenge diagnostic pathology with respect to its ability to analyze histological images and increasingly large molecular profiling data in a quantitative, integrative, and standardized way. Artificial intelligence (AI) and, more precisely, deep learning technologies have recently demonstrated the potential to facilitate complex data analysis tasks, including clinical, histological, and molecular data for disease classification; tissue biomarker quantification; and clinical outcome prediction. This review provides a general introduction to AI and describes recent developments with a focus on applications in diagnostic pathology and beyond. We explain limitations including the black-box character of conventional AI and describe solutions to make machine learning decisions more transparent with so-called explainable AI. The purpose of the review is to foster a mutual understanding of both the biomedical and the AI side. To that end, in addition to providing an overview of the relevant foundations in pathology and machine learning, we present worked-through examples for a better practical understanding of what AI can achieve and how it should be done.
Collapse
Affiliation(s)
- Frederick Klauschen
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
- Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany
- Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany
- German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
| | - Jonas Dippel
- Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany
- Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany;
| | - Philipp Keyl
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
| | - Philipp Jurmeister
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
- German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
| | - Michael Bockmayr
- Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany
- Department of Pediatric Hematology and Oncology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- Research Institute Children's Cancer Center Hamburg, Hamburg, Germany
| | - Andreas Mock
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
- German Cancer Consortium, German Cancer Research Center (DKTK/DKFZ), Munich Partner Site, Munich, Germany
| | - Oliver Buchstab
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany;
| | - Maximilian Alber
- Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany
- Aignostics, Berlin, Germany
| | | | - Grégoire Montavon
- Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany
- Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany;
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | - Klaus-Robert Müller
- Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany
- Machine Learning Group, Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany;
- Department of Artificial Intelligence, Korea University, Seoul, Korea
- Max Planck Institute for Informatics, Saarbrücken, Germany
| |
Collapse
|
26
|
Moterani VC, Abbade JF, Borges VTM, Fonseca CGF, Desiderio N, Moterani Junior NJW, Gonçalves Moterani LBB. [Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extensionDiretrizes para protocolos de ensaios clínicos com intervenções que utilizam inteligência artificial: a extensão SPIRIT-AI]. Rev Panam Salud Publica 2023; 47:e149. [PMID: 38361499 PMCID: PMC10868409 DOI: 10.26633/rpsp.2023.149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Accepted: 07/23/2020] [Indexed: 01/10/2024] Open
Abstract
The SPIRIT 2013 statement aims to improve the completeness of clinical trial protocol reporting by providing evidence-based recommendations for the minimum set of items to be addressed. This guidance has been instrumental in promoting transparent evaluation of new interventions. More recently, there has been a growing recognition that interventions involving artificial intelligence (AI) need to undergo rigorous, prospective evaluation to demonstrate their impact on health outcomes. The SPIRIT-AI (Standard Protocol Items: Recommendations for Interventional Trials-Artificial Intelligence) extension is a new reporting guideline for clinical trial protocols evaluating interventions with an AI component. It was developed in parallel with its companion statement for trial reports: CONSORT-AI (Consolidated Standards of Reporting Trials-Artificial Intelligence). Both guidelines were developed through a staged consensus process involving literature review and expert consultation to generate 26 candidate items, which were consulted upon by an international multi-stakeholder group in a two-stage Delphi survey (103 stakeholders), agreed upon in a consensus meeting (31 stakeholders) and refined through a checklist pilot (34 participants). The SPIRIT-AI extension includes 15 new items that were considered sufficiently important for clinical trial protocols of AI interventions. These new items should be routinely reported in addition to the core SPIRIT 2013 items. SPIRIT-AI recommends that investigators provide clear descriptions of the AI intervention, including instructions and skills required for use, the setting in which the AI intervention will be integrated, considerations for the handling of input and output data, the human-AI interaction and analysis of error cases. SPIRIT-AI will help promote transparency and completeness for clinical trial protocols for AI interventions. Its use will assist editors and peer reviewers, as well as the general readership, to understand, interpret and critically appraise the design and risk of bias for a planned clinical trial.
Collapse
Affiliation(s)
- Vinicius Cesar Moterani
- Universidade Estadual Paulista “Júlio de Mesquita Filho,”BotucatuBrazilUniversidade Estadual Paulista “Júlio de Mesquita Filho,” Botucatu, Brazil
| | - Joelcio Francisco Abbade
- Universidade Estadual Paulista “Júlio de Mesquita Filho,”BotucatuBrazilUniversidade Estadual Paulista “Júlio de Mesquita Filho,” Botucatu, Brazil
| | - Vera Therezinha Medeiros Borges
- Universidade Estadual Paulista “Júlio de Mesquita Filho,”BotucatuBrazilUniversidade Estadual Paulista “Júlio de Mesquita Filho,” Botucatu, Brazil
| | - Cecilia Guimarães Ferreira Fonseca
- Universidade Estadual Paulista “Júlio de Mesquita Filho,”BotucatuBrazilUniversidade Estadual Paulista “Júlio de Mesquita Filho,” Botucatu, Brazil
| | - Nathalia Desiderio
- Marilia Medical SchoolMariliaBrazilMarilia Medical School, Marilia, Brazil
| | | | | |
Collapse
|
27
|
Ke J, Liu K, Sun Y, Xue Y, Huang J, Lu Y, Dai J, Chen Y, Han X, Shen Y, Shen D. Artifact Detection and Restoration in Histology Images With Stain-Style and Structural Preservation. IEEE TRANSACTIONS ON MEDICAL IMAGING 2023; 42:3487-3500. [PMID: 37352087 DOI: 10.1109/tmi.2023.3288940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/25/2023]
Abstract
The artifacts in histology images may encumber the accurate interpretation of medical information and cause misdiagnosis. Accordingly, prepending manual quality control of artifacts considerably decreases the degree of automation. To close this gap, we propose a methodical pre-processing framework to detect and restore artifacts, which minimizes their impact on downstream AI diagnostic tasks. First, the artifact recognition network AR-Classifier first differentiates common artifacts from normal tissues, e.g., tissue folds, marking dye, tattoo pigment, spot, and out-of-focus, and also catalogs artifact patches by their restorability. Then, the succeeding artifact restoration network AR-CycleGAN performs de-artifact processing where stain styles and tissue structures can be maximally retained. We construct a benchmark for performance evaluation, curated from both clinically collected WSIs and public datasets of colorectal and breast cancer. The functional structures are compared with state-of-the-art methods, and also comprehensively evaluated by multiple metrics across multiple tasks, including artifact classification, artifact restoration, downstream diagnostic tasks of tumor classification and nuclei segmentation. The proposed system allows full automation of deep learning based histology image analysis without human intervention. Moreover, the structure-independent characteristic enables its processing with various artifact subtypes. The source code and data in this research are available at https://github.com/yunboer/AR-classifier-and-AR-CycleGAN.
Collapse
|
28
|
Masoumian Hosseini M, Masoumian Hosseini ST, Qayumi K, Hosseinzadeh S, Sajadi Tabar SS. Smartwatches in healthcare medicine: assistance and monitoring; a scoping review. BMC Med Inform Decis Mak 2023; 23:248. [PMID: 37924029 PMCID: PMC10625201 DOI: 10.1186/s12911-023-02350-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 10/22/2023] [Indexed: 11/06/2023] Open
Abstract
Smartwatches have become increasingly popular in recent times because of their capacity to track different health indicators, including heart rate, patterns of sleep, and physical movements. This scoping review aims to explore the utilisation of smartwatches within the healthcare sector. According to Arksey and O'Malley's methodology, an organised search was performed in PubMed/Medline, Scopus, Embase, Web of Science, ERIC and Google Scholar. In our search strategy, 761 articles were returned. The exclusion/inclusion criteria were applied. Finally, 35 articles were selected for extracting data. These included six studies on stress monitoring, six on movement disorders, three on sleep tracking, three on blood pressure, two on heart disease, six on covid pandemic, three on safety and six on validation. The use of smartwatches has been found to be effective in diagnosing the symptoms of various diseases. In particular, smartwatches have shown promise in detecting heart diseases, movement disorders, and even early signs of COVID-19. Nevertheless, it should be emphasised that there is an ongoing discussion concerning the reliability of smartwatch diagnoses within healthcare systems. Despite the potential advantages offered by utilising smartwatches for disease detection, it is imperative to approach their data interpretation with prudence. The discrepancies in detection between smartwatches and their algorithms have important implications for healthcare use. The accuracy and reliability of the algorithms used are crucial, as well as high accuracy in detecting changes in health status by the smartwatches themselves. This calls for the development of medical watches and the creation of AI-hospital assistants. These assistants will be designed to help with patient monitoring, appointment scheduling, and medication management tasks. They can educate patients and answer common questions, freeing healthcare providers to focus on more complex tasks.
Collapse
Affiliation(s)
- Mohsen Masoumian Hosseini
- Department of E-Learning in Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
- CyberPatient Research Affiliate, Interactive Health International, Department of the surgery, University of British Columbia, Vancouver, Canada
| | - Seyedeh Toktam Masoumian Hosseini
- CyberPatient Research Affiliate, Interactive Health International, Department of the surgery, University of British Columbia, Vancouver, Canada.
- Department of Nursing, School of Nursing and Midwifery, Torbat Heydariyeh University of Medical Sciences, Torbat Heydariyeh, Iran.
| | - Karim Qayumi
- Professor at Department of Surgery, University of British Columbia, Vancouver, Canada
| | - Shahriar Hosseinzadeh
- CyberPatient Research Coordinator, Interactive Health International, Department of Surgery, University of British Columbia, Vancouver, Canada
| | | |
Collapse
|
29
|
Albrecht T, Rossberg A, Albrecht JD, Nicolay JP, Straub BK, Gerber TS, Albrecht M, Brinkmann F, Charbel A, Schwab C, Schreck J, Brobeil A, Flechtenmacher C, von Winterfeld M, Köhler BC, Springfeld C, Mehrabi A, Singer S, Vogel MN, Neumann O, Stenzinger A, Schirmacher P, Weis CA, Roessler S, Kather JN, Goeppert B. Deep Learning-Enabled Diagnosis of Liver Adenocarcinoma. Gastroenterology 2023; 165:1262-1275. [PMID: 37562657 DOI: 10.1053/j.gastro.2023.07.026] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 07/18/2023] [Accepted: 07/20/2023] [Indexed: 08/12/2023]
Abstract
BACKGROUND & AIMS Diagnosis of adenocarcinoma in the liver is a frequent scenario in routine pathology and has a critical impact on clinical decision making. However, rendering a correct diagnosis can be challenging, and often requires the integration of clinical, radiologic, and immunohistochemical information. We present a deep learning model (HEPNET) to distinguish intrahepatic cholangiocarcinoma from colorectal liver metastasis, as the most frequent primary and secondary forms of liver adenocarcinoma, with clinical grade accuracy using H&E-stained whole-slide images. METHODS HEPNET was trained on 714,589 image tiles from 456 patients who were randomly selected in a stratified manner from a pool of 571 patients who underwent surgical resection or biopsy at Heidelberg University Hospital. Model performance was evaluated on a hold-out internal test set comprising 115 patients and externally validated on 159 patients recruited at Mainz University Hospital. RESULTS On the hold-out internal test set, HEPNET achieved an area under the receiver operating characteristic curve of 0.994 (95% CI, 0.989-1.000) and an accuracy of 96.522% (95% CI, 94.521%-98.694%) at the patient level. Validation on the external test set yielded an area under the receiver operating characteristic curve of 0.997 (95% CI, 0.995-1.000), corresponding to an accuracy of 98.113% (95% CI, 96.907%-100.000%). HEPNET surpassed the performance of 6 pathology experts with different levels of experience in a reader study of 50 patients (P = .0005), boosted the performance of resident pathologists to the level of senior pathologists, and reduced potential downstream analyses. CONCLUSIONS We provided a ready-to-use tool with clinical grade performance that may facilitate routine pathology by rendering a definitive diagnosis and guiding ancillary testing. The incorporation of HEPNET into pathology laboratories may optimize the diagnostic workflow, complemented by test-related labor and cost savings.
Collapse
Affiliation(s)
- Thomas Albrecht
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany; Liver Cancer Center Heidelberg, Heidelberg, Germany.
| | - Annik Rossberg
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany
| | | | - Jan Peter Nicolay
- Department of Dermatology, University Medical Centre Mannheim, Mannheim, Germany
| | - Beate Katharina Straub
- Institute of Pathology, University Medicine, Johannes Gutenberg University, Mainz, Germany
| | - Tiemo Sven Gerber
- Institute of Pathology, University Medicine, Johannes Gutenberg University, Mainz, Germany
| | - Michael Albrecht
- European Center for Angioscience, Medical Faculty of Mannheim, Mannheim, Germany
| | - Fritz Brinkmann
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany
| | - Alphonse Charbel
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany
| | - Constantin Schwab
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany
| | - Johannes Schreck
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany
| | - Alexander Brobeil
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany
| | | | | | - Bruno Christian Köhler
- Liver Cancer Center Heidelberg, Heidelberg, Germany; Department of Medical Oncology, National Center for Tumor Diseases, Heidelberg University Hospital, Heidelberg, Germany
| | - Christoph Springfeld
- Liver Cancer Center Heidelberg, Heidelberg, Germany; Department of Medical Oncology, National Center for Tumor Diseases, Heidelberg University Hospital, Heidelberg, Germany
| | - Arianeb Mehrabi
- Liver Cancer Center Heidelberg, Heidelberg, Germany; Department of General, Visceral and Transplantation Surgery, Heidelberg University Hospital, Heidelberg, Germany
| | - Stephan Singer
- Institute of Pathology and Neuropathology, Eberhard-Karls University, Tübingen, Germany
| | - Monika Nadja Vogel
- Diagnostic and Interventional Radiology, Thoraxklinik at Heidelberg University Hospital, Heidelberg, Germany
| | - Olaf Neumann
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany
| | | | - Peter Schirmacher
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany; Liver Cancer Center Heidelberg, Heidelberg, Germany
| | - Cleo-Aron Weis
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany
| | - Stephanie Roessler
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany; Liver Cancer Center Heidelberg, Heidelberg, Germany
| | - Jakob Nikolas Kather
- Department of Medical Oncology, National Center for Tumor Diseases, Heidelberg University Hospital, Heidelberg, Germany; Else Kroener Fresenius Center for Digital Health, Medical Faculty Carl Gustav Carus, Technical University Dresden, Dresden, Germany
| | - Benjamin Goeppert
- Institute of Pathology, Heidelberg University Hospital, Heidelberg, Germany; Institute of Pathology and Neuropathology, RKH Hospital Ludwigsburg, Ludwigsburg, Germany; Institute of Tissue Medicine and Pathology, University of Bern, Bern, Switzerland
| |
Collapse
|
30
|
Feng S, Wang J, Wang L, Qiu Q, Chen D, Su H, Li X, Xiao Y, Lin C. Current Status and Analysis of Machine Learning in Hepatocellular Carcinoma. J Clin Transl Hepatol 2023; 11:1184-1191. [PMID: 37577233 PMCID: PMC10412715 DOI: 10.14218/jcth.2022.00077s] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 12/11/2022] [Accepted: 02/21/2023] [Indexed: 07/03/2023] Open
Abstract
Hepatocellular carcinoma (HCC) is a common tumor. Although the diagnosis and treatment of HCC have made great progress, the overall prognosis remains poor. As the core component of artificial intelligence, machine learning (ML) has developed rapidly in the past decade. In particular, ML has become widely used in the medical field, and it has helped in the diagnosis and treatment of cancer. Different algorithms of ML have different roles in diagnosis, treatment, and prognosis. This article reviews recent research, explains the application of different ML models in HCC, and provides suggestions for follow-up research.
Collapse
Affiliation(s)
- Sijia Feng
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| | - Jianhua Wang
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| | - Liheng Wang
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| | - Qixuan Qiu
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| | - Dongdong Chen
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| | - Huo Su
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| | - Xiaoli Li
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| | - Yao Xiao
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| | - Chiayen Lin
- General Surgery, Central South University Xiangya Hospital, Changsha, Hunan, China
| |
Collapse
|
31
|
Oudin A, Maatoug R, Bourla A, Ferreri F, Bonnot O, Millet B, Schoeller F, Mouchabac S, Adrien V. Digital Phenotyping: Data-Driven Psychiatry to Redefine Mental Health. J Med Internet Res 2023; 25:e44502. [PMID: 37792430 PMCID: PMC10585447 DOI: 10.2196/44502] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 07/10/2023] [Accepted: 08/21/2023] [Indexed: 10/05/2023] Open
Abstract
The term "digital phenotype" refers to the digital footprint left by patient-environment interactions. It has potential for both research and clinical applications but challenges our conception of health care by opposing 2 distinct approaches to medicine: one centered on illness with the aim of classifying and curing disease, and the other centered on patients, their personal distress, and their lived experiences. In the context of mental health and psychiatry, the potential benefits of digital phenotyping include creating new avenues for treatment and enabling patients to take control of their own well-being. However, this comes at the cost of sacrificing the fundamental human element of psychotherapy, which is crucial to addressing patients' distress. In this viewpoint paper, we discuss the advances rendered possible by digital phenotyping and highlight the risk that this technology may pose by partially excluding health care professionals from the diagnosis and therapeutic process, thereby foregoing an essential dimension of care. We conclude by setting out concrete recommendations on how to improve current digital phenotyping technology so that it can be harnessed to redefine mental health by empowering patients without alienating them.
Collapse
Affiliation(s)
- Antoine Oudin
- Infrastructure for Clinical Research in Neurosciences, Paris Brain Institute, Sorbonne University- Institut national de la santé et de la recherche médicale - Centre national de la recherche scientifique, Paris, France
- Department of Psychiatry, Pitié-Salpêtrière Hospital, Public Hospitals of Sorbonne University, Paris, France
| | - Redwan Maatoug
- Infrastructure for Clinical Research in Neurosciences, Paris Brain Institute, Sorbonne University- Institut national de la santé et de la recherche médicale - Centre national de la recherche scientifique, Paris, France
- Department of Psychiatry, Pitié-Salpêtrière Hospital, Public Hospitals of Sorbonne University, Paris, France
| | - Alexis Bourla
- Infrastructure for Clinical Research in Neurosciences, Paris Brain Institute, Sorbonne University- Institut national de la santé et de la recherche médicale - Centre national de la recherche scientifique, Paris, France
- Department of Psychiatry, Saint-Antoine Hospital, Public Hospitals of Sorbonne University, Paris, France
- Medical Strategy and Innovation Department, Clariane, Paris, France
- NeuroStim Psychiatry Practice, Paris, France
| | - Florian Ferreri
- Infrastructure for Clinical Research in Neurosciences, Paris Brain Institute, Sorbonne University- Institut national de la santé et de la recherche médicale - Centre national de la recherche scientifique, Paris, France
- Department of Psychiatry, Saint-Antoine Hospital, Public Hospitals of Sorbonne University, Paris, France
| | - Olivier Bonnot
- Department of Child and Adolescent Psychiatry, Nantes University Hospital, Nantes, France
- Pays de la Loire Psychology Laboratory, Nantes University, Nantes, France
| | - Bruno Millet
- Infrastructure for Clinical Research in Neurosciences, Paris Brain Institute, Sorbonne University- Institut national de la santé et de la recherche médicale - Centre national de la recherche scientifique, Paris, France
- Department of Psychiatry, Pitié-Salpêtrière Hospital, Public Hospitals of Sorbonne University, Paris, France
| | - Félix Schoeller
- Institute for Advanced Consciousness Studies, Santa Monica, CA, United States
- Media Lab, Massachusetts Institute of Technology, Cambridge, MA, United States
| | - Stéphane Mouchabac
- Infrastructure for Clinical Research in Neurosciences, Paris Brain Institute, Sorbonne University- Institut national de la santé et de la recherche médicale - Centre national de la recherche scientifique, Paris, France
- Department of Psychiatry, Saint-Antoine Hospital, Public Hospitals of Sorbonne University, Paris, France
| | - Vladimir Adrien
- Infrastructure for Clinical Research in Neurosciences, Paris Brain Institute, Sorbonne University- Institut national de la santé et de la recherche médicale - Centre national de la recherche scientifique, Paris, France
- Department of Psychiatry, Saint-Antoine Hospital, Public Hospitals of Sorbonne University, Paris, France
| |
Collapse
|
32
|
Gichoya JW, Thomas K, Celi LA, Safdar N, Banerjee I, Banja JD, Seyyed-Kalantari L, Trivedi H, Purkayastha S. AI pitfalls and what not to do: mitigating bias in AI. Br J Radiol 2023; 96:20230023. [PMID: 37698583 PMCID: PMC10546443 DOI: 10.1259/bjr.20230023] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Revised: 08/10/2023] [Accepted: 08/14/2023] [Indexed: 09/13/2023] Open
Abstract
Various forms of artificial intelligence (AI) applications are being deployed and used in many healthcare systems. As the use of these applications increases, we are learning the failures of these models and how they can perpetuate bias. With these new lessons, we need to prioritize bias evaluation and mitigation for radiology applications; all the while not ignoring the impact of changes in the larger enterprise AI deployment which may have downstream impact on performance of AI models. In this paper, we provide an updated review of known pitfalls causing AI bias and discuss strategies for mitigating these biases within the context of AI deployment in the larger healthcare enterprise. We describe these pitfalls by framing them in the larger AI lifecycle from problem definition, data set selection and curation, model training and deployment emphasizing that bias exists across a spectrum and is a sequela of a combination of both human and machine factors.
Collapse
Affiliation(s)
| | - Kaesha Thomas
- Department of Radiology, Emory University, Atlanta, United States
| | | | - Nabile Safdar
- Department of Radiology, Emory University, Atlanta, United States
| | - Imon Banerjee
- School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, Tempe, United States
| | - John D Banja
- Emory University Center for Ethics, Emory University, Atlanta, United States
| | - Laleh Seyyed-Kalantari
- Department of Electrical Engineering and Computer Science, Lassonde School of Engineering, York University, North York, United States
| | - Hari Trivedi
- Department of Radiology, Emory University, Atlanta, United States
| | - Saptarshi Purkayastha
- School of Informatics and Computing, Indiana University Purdue University, Indianapolis, United States
| |
Collapse
|
33
|
Zhang A, Wu Z, Wu E, Wu M, Snyder MP, Zou J, Wu JC. Leveraging physiology and artificial intelligence to deliver advancements in health care. Physiol Rev 2023; 103:2423-2450. [PMID: 37104717 PMCID: PMC10390055 DOI: 10.1152/physrev.00033.2022] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 03/06/2023] [Accepted: 04/25/2023] [Indexed: 04/29/2023] Open
Abstract
Artificial intelligence in health care has experienced remarkable innovation and progress in the last decade. Significant advancements can be attributed to the utilization of artificial intelligence to transform physiology data to advance health care. In this review, we explore how past work has shaped the field and defined future challenges and directions. In particular, we focus on three areas of development. First, we give an overview of artificial intelligence, with special attention to the most relevant artificial intelligence models. We then detail how physiology data have been harnessed by artificial intelligence to advance the main areas of health care: automating existing health care tasks, increasing access to care, and augmenting health care capabilities. Finally, we discuss emerging concerns surrounding the use of individual physiology data and detail an increasingly important consideration for the field, namely the challenges of deploying artificial intelligence models to achieve meaningful clinical impact.
Collapse
Affiliation(s)
- Angela Zhang
- Stanford Cardiovascular Institute, School of Medicine, Stanford University, Stanford, California, United States
- Department of Genetics, School of Medicine, Stanford University, Stanford, California, United States
- Greenstone Biosciences, Palo Alto, California, United States
| | - Zhenqin Wu
- Department of Chemistry, Stanford University, Stanford, California, United States
| | - Eric Wu
- Department of Electrical Engineering, Stanford University, Stanford, California, United States
| | - Matthew Wu
- Greenstone Biosciences, Palo Alto, California, United States
| | - Michael P Snyder
- Department of Genetics, School of Medicine, Stanford University, Stanford, California, United States
| | - James Zou
- Department of Biomedical Informatics, School of Medicine, Stanford University, Stanford, California, United States
- Department of Computer Science, Stanford University, Stanford, California, United States
| | - Joseph C Wu
- Stanford Cardiovascular Institute, School of Medicine, Stanford University, Stanford, California, United States
- Greenstone Biosciences, Palo Alto, California, United States
- Division of Cardiovascular Medicine, Department of Medicine, Stanford University, Stanford, California, United States
- Department of Radiology, School of Medicine, Stanford University, Stanford, California, United States
| |
Collapse
|
34
|
Wang DY, Ding J, Sun AL, Liu SG, Jiang D, Li N, Yu JK. Artificial intelligence suppression as a strategy to mitigate artificial intelligence automation bias. J Am Med Inform Assoc 2023; 30:1684-1692. [PMID: 37561535 PMCID: PMC10531198 DOI: 10.1093/jamia/ocad118] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2022] [Revised: 05/30/2023] [Accepted: 06/19/2023] [Indexed: 08/11/2023] Open
Abstract
BACKGROUND Incorporating artificial intelligence (AI) into clinics brings the risk of automation bias, which potentially misleads the clinician's decision-making. The purpose of this study was to propose a potential strategy to mitigate automation bias. METHODS This was a laboratory study with a randomized cross-over design. The diagnosis of anterior cruciate ligament (ACL) rupture, a common injury, on magnetic resonance imaging (MRI) was used as an example. Forty clinicians were invited to diagnose 200 ACLs with and without AI assistance. The AI's correcting and misleading (automation bias) effects on the clinicians' decision-making processes were analyzed. An ordinal logistic regression model was employed to predict the correcting and misleading probabilities of the AI. We further proposed an AI suppression strategy that retracted AI diagnoses with a higher misleading probability and provided AI diagnoses with a higher correcting probability. RESULTS The AI significantly increased clinicians' accuracy from 87.2%±13.1% to 96.4%±1.9% (P < .001). However, the clinicians' errors in the AI-assisted round were associated with automation bias, accounting for 45.5% of the total mistakes. The automation bias was found to affect clinicians of all levels of expertise. Using a logistic regression model, we identified an AI output zone with higher probability to generate misleading diagnoses. The proposed AI suppression strategy was estimated to decrease clinicians' automation bias by 41.7%. CONCLUSION Although AI improved clinicians' diagnostic performance, automation bias was a serious problem that should be addressed in clinical practice. The proposed AI suppression strategy is a practical method for decreasing automation bias.
Collapse
Affiliation(s)
- Ding-Yu Wang
- Department of Sports Medicine, Peking University Third Hospital, Institute of Sports Medicine of Peking University, Beijing, China
- Beijing Key Laboratory of Sports Injuries, Beijing, China
- Engineering Research Center of Sports Trauma Treatment Technology and Devices, Ministry of Education, Beijing, China
| | - Jia Ding
- Beijing Yizhun Medical AI Co., Ltd, Beijing, China
| | - An-Lan Sun
- Beijing Yizhun Medical AI Co., Ltd, Beijing, China
| | - Shang-Gui Liu
- Department of Sports Medicine, Peking University Third Hospital, Institute of Sports Medicine of Peking University, Beijing, China
- Beijing Key Laboratory of Sports Injuries, Beijing, China
- Engineering Research Center of Sports Trauma Treatment Technology and Devices, Ministry of Education, Beijing, China
| | - Dong Jiang
- Department of Sports Medicine, Peking University Third Hospital, Institute of Sports Medicine of Peking University, Beijing, China
- Beijing Key Laboratory of Sports Injuries, Beijing, China
- Engineering Research Center of Sports Trauma Treatment Technology and Devices, Ministry of Education, Beijing, China
| | - Nan Li
- Research Center of Clinical Epidemiology, Peking University Third Hospital, Beijing, China
| | - Jia-Kuo Yu
- Department of Sports Medicine, Peking University Third Hospital, Institute of Sports Medicine of Peking University, Beijing, China
- Beijing Key Laboratory of Sports Injuries, Beijing, China
- Engineering Research Center of Sports Trauma Treatment Technology and Devices, Ministry of Education, Beijing, China
| |
Collapse
|
35
|
Solomon BD, Chung WK. Artificial intelligence and the impact on medical genetics. AMERICAN JOURNAL OF MEDICAL GENETICS. PART C, SEMINARS IN MEDICAL GENETICS 2023; 193:e32060. [PMID: 37565625 DOI: 10.1002/ajmg.c.32060] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 07/24/2023] [Accepted: 07/29/2023] [Indexed: 08/12/2023]
Abstract
Virtually all areas of biomedicine will be increasingly affected by applications of artificial intelligence (AI). We discuss how AI may affect fields of medical genetics, including both clinicians and laboratorians. In addition to reviewing the anticipated impact, we provide recommendations for ways in which these groups may want to evolve in light of the influence of AI. We also briefly discuss how educational and training programs can play a key role in preparing the future workforce given these anticipated changes.
Collapse
Affiliation(s)
- Benjamin D Solomon
- Medical Genetics Branch, National Human Genome Research Institute, Bethesda, Maryland, USA
| | - Wendy K Chung
- Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
36
|
Cooper M, Ji Z, Krishnan RG. Machine learning in computational histopathology: Challenges and opportunities. Genes Chromosomes Cancer 2023; 62:540-556. [PMID: 37314068 DOI: 10.1002/gcc.23177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 05/18/2023] [Accepted: 05/20/2023] [Indexed: 06/15/2023] Open
Abstract
Digital histopathological images, high-resolution images of stained tissue samples, are a vital tool for clinicians to diagnose and stage cancers. The visual analysis of patient state based on these images are an important part of oncology workflow. Although pathology workflows have historically been conducted in laboratories under a microscope, the increasing digitization of histopathological images has led to their analysis on computers in the clinic. The last decade has seen the emergence of machine learning, and deep learning in particular, a powerful set of tools for the analysis of histopathological images. Machine learning models trained on large datasets of digitized histopathology slides have resulted in automated models for prediction and stratification of patient risk. In this review, we provide context for the rise of such models in computational histopathology, highlight the clinical tasks they have found success in automating, discuss the various machine learning techniques that have been applied to this domain, and underscore open problems and opportunities.
Collapse
Affiliation(s)
- Michael Cooper
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- University Health Network, Toronto, Ontario, Canada
- Vector Institute, Toronto, Ontario, Canada
| | - Zongliang Ji
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute, Toronto, Ontario, Canada
| | - Rahul G Krishnan
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute, Toronto, Ontario, Canada
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
37
|
Köteles MM, Vigdorovits A, Kumar D, Mihai IM, Jurescu A, Gheju A, Bucur A, Harich OO, Olteanu GE. Comparative Evaluation of Breast Ductal Carcinoma Grading: A Deep-Learning Model and General Pathologists' Assessment Approach. Diagnostics (Basel) 2023; 13:2326. [PMID: 37510069 PMCID: PMC10377791 DOI: 10.3390/diagnostics13142326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/05/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023] Open
Abstract
Breast cancer is the most prevalent neoplasia among women, with early and accurate diagnosis critical for effective treatment. In clinical practice, however, the subjective nature of histological grading of infiltrating ductal adenocarcinoma of the breast (DAC-NOS) often leads to inconsistencies among pathologists, posing a significant challenge to achieving optimal patient outcomes. Our study aimed to address this reproducibility problem by leveraging artificial intelligence (AI). We trained a deep-learning model using a convolutional neural network-based algorithm (CNN-bA) on 100 whole slide images (WSIs) of DAC-NOS from the Cancer Genome Atlas Breast Invasive Carcinoma (TCGA-BRCA) dataset. Our model demonstrated high precision, sensitivity, and F1 score across different grading components in about 17.5 h with 19,000 iterations. However, the agreement between the model's grading and that of general pathologists varied, showing the highest agreement for the mitotic count score. These findings suggest that AI has the potential to enhance the accuracy and reproducibility of breast cancer grading, warranting further refinement and validation of this approach.
Collapse
Affiliation(s)
| | - Alon Vigdorovits
- Bihor County Clinical Emergency Hospital, Gh. Doja Street No. 65, 410169 Oradea, Romania
- Center for Research and Innovation in Personalized Medicine of Respiratory Diseases, "Victor Babes" University of Medicine and Pharmacy, Timisoara Eftimie Murgu Sq. No. 2, 300041 Timisoara, Romania
- Victor Babes Institute of Pathology-Next Generation Pathology Research Group, Splaiul Independenţei 99-101, 050096 Bucharest, Romania
| | | | - Ioana-Maria Mihai
- Department of Microscopic Morphology-Morphopatology, ANAPATMOL Research Center, "Victor Babes" University of Medicine and Pharmacy, 300041 Timisoara, Romania
| | - Aura Jurescu
- Department of Microscopic Morphology-Morphopatology, ANAPATMOL Research Center, "Victor Babes" University of Medicine and Pharmacy, 300041 Timisoara, Romania
| | - Adelina Gheju
- Emergency County Hospital Deva, Bulevardul 22 Decembrie 58, 330032 Deva, Romania
| | - Adeline Bucur
- Department of Microscopic Morphology, Discipline of Histology, "Victor Babes" University of Medicine and Pharmacy, Timisoara Eftimie Murgu Sq. No. 2, 300041 Timisoara, Romania
| | - Octavia Oana Harich
- Department of Functional Sciences, "Victor Babes" University of Medicine and Pharmacy, Timisoara Eftimie Murgu Sq. No. 2, 300041 Timisoara, Romania
| | - Gheorghe-Emilian Olteanu
- Center for Research and Innovation in Personalized Medicine of Respiratory Diseases, "Victor Babes" University of Medicine and Pharmacy, Timisoara Eftimie Murgu Sq. No. 2, 300041 Timisoara, Romania
- Faculty of Pharmacy, "Victor Babes" University of Medicine and Pharmacy, Eftimie Murgu Square No. 2, 300041 Timisoara, Romania
- Research Center for Pharmaco-Toxicological Evaluations, Faculty of Pharmacy, "Victor Babes" University of Medicine and Pharmacy, Eftimie Murgu Square No. 2, 300041 Timisoara, Romania
- Center of Expertise for Rare Lung Diseases, Clinical Hospital of Infectious Diseases and Pneumophthisiology "Dr. Victor Babes" Timisoara, Gh. Adam Street No. 13, 300310 Timisoara, Romania
| |
Collapse
|
38
|
Kriegsmann M, Kriegsmann K, Steinbuss G, Zgorzelski C, Albrecht T, Heinrich S, Farkas S, Roth W, Dang H, Hausen A, Gaida MM. Implementation of deep learning in liver pathology optimizes diagnosis of benign lesions and adenocarcinoma metastasis. Clin Transl Med 2023; 13:e1299. [PMID: 37415390 PMCID: PMC10326372 DOI: 10.1002/ctm2.1299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 05/28/2023] [Indexed: 07/08/2023] Open
Abstract
INTRODUCTION Differentiation of histologically similar structures in the liver, including anatomical structures, benign bile duct lesions, or common types of liver metastases, can be challenging with conventional histological tissue sections alone. Accurate histopathological classification is paramount for the diagnosis and adequate treatment of the disease. Deep learning algorithms have been proposed for objective and consistent assessment of digital histopathological images. MATERIALS AND METHODS In the present study, we trained and evaluated deep learning algorithms based on the EfficientNetV2 and ResNetRS architectures to discriminate between different histopathological classes. For the required dataset, specialized surgical pathologists annotated seven different histological classes, including different non-neoplastic anatomical structures, benign bile duct lesions, and liver metastases from colorectal and pancreatic adenocarcinoma in a large patient cohort. Annotation resulted in a total of 204.159 image patches, followed by discrimination analysis using our deep learning models. Model performance was evaluated on validation and test data using confusion matrices. RESULTS Evaluation of the test set based on tiles and cases revealed overall highly satisfactory prediction capability of our algorithm for the different histological classes, resulting in a tile accuracy of 89% (38 413/43 059) and case accuracy of 94% (198/211). Importantly, the separation of metastasis versus benign lesions was certainly confident on case level, confirming the classification model performed with high diagnostic accuracy. Moreover, the whole curated raw data set is made publically available. CONCLUSIONS Deep learning is a promising approach in surgical liver pathology supporting decision making in personalized medicine.
Collapse
Affiliation(s)
- Mark Kriegsmann
- Institute of PathologyHeidelberg UniversityHeidelbergGermany
- Pathology WiesbadenWiesbadenGermany
| | - Katharina Kriegsmann
- Department of HematologyOncology and RheumatologyHeidelberg UniversityHeidelbergGermany
- Laborarztpraxis Rhein‐Main MVZ GbRFrankfurt am MainFrankfurtGermany
| | - Georg Steinbuss
- Department of HematologyOncology and RheumatologyHeidelberg UniversityHeidelbergGermany
| | | | - Thomas Albrecht
- Institute of PathologyHeidelberg UniversityHeidelbergGermany
| | - Stefan Heinrich
- Department of SurgeryJGU‐MainzUniversity Medical Center MainzMainzGermany
| | - Stefan Farkas
- Department of SurgerySt. Josefs‐ HospitalWiesbadenGermany
| | - Wilfried Roth
- Institute of PathologyJGU‐MainzUniversity Medical Center MainzMainzGermany
| | - Hien Dang
- Department of SurgeryDepartment of Surgical ResearchThomas Jefferson UniversityPhiladelphiaPennsylvaniaUSA
| | - Anne Hausen
- Institute of PathologyJGU‐MainzUniversity Medical Center MainzMainzGermany
| | - Matthias M. Gaida
- Institute of PathologyJGU‐MainzUniversity Medical Center MainzMainzGermany
- TRONJGU‐MainzTranslational Oncology at the University Medical CenterMainzGermany
- Research Center for ImmunotherapyJGU‐MainzUniversity Medical Center MainzMainzGermany
| |
Collapse
|
39
|
Nuutinen M, Leskelä RL. Systematic review of the performance evaluation of clinicians with or without the aid of machine learning clinical decision support system. HEALTH AND TECHNOLOGY 2023; 13:1-14. [PMID: 37363342 PMCID: PMC10262137 DOI: 10.1007/s12553-023-00763-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 06/01/2023] [Indexed: 06/28/2023]
Abstract
Background For the adoption of machine learning clinical decision support systems (ML-CDSS) it is critical to understand the performance aid of the ML-CDSS. However, it is not trivial, how the performance aid should be evaluated. To design reliable performance evaluation study, both the knowledge from the practical framework of experimental study design and the understanding of domain specific design factors are required. Objective The aim of this review study was to form a practical framework and identify key design factors for experimental design in evaluating the performance of clinicians with or without the aid of ML-CDSS. Methods The study was based on published ML-CDSS performance evaluation studies. We systematically searched articles published between January 2016 and December 2022. From the articles we collected a set of design factors. Only the articles comparing the performance of clinicians with or without the aid of ML-CDSS using experimental study methods were considered. Results The identified key design factors for the practical framework of ML-CDSS experimental study design were performance measures, user interface, ground truth data and the selection of samples and participants. In addition, we identified the importance of randomization, crossover design and training and practice rounds. Previous studies had shortcomings in the rationale and documentation of choices regarding the number of participants and the duration of the experiment. Conclusion The design factors of ML-CDSS experimental study are interdependent and all factors must be considered in individual choices. Supplementary Information The online version contains supplementary material available at 10.1007/s12553-023-00763-1.
Collapse
Affiliation(s)
- Mikko Nuutinen
- Nordic Healthcare Group, Helsinki, Finland
- Haartman Institute, University of Helsinki, Helsinki, Finland
| | | |
Collapse
|
40
|
Lee JH, Hong H, Nam G, Hwang EJ, Park CM. Effect of Human-AI Interaction on Detection of Malignant Lung Nodules on Chest Radiographs. Radiology 2023; 307:e222976. [PMID: 37367443 DOI: 10.1148/radiol.222976] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/28/2023]
Abstract
Background The factors affecting radiologists' diagnostic determinations in artificial intelligence (AI)-assisted image reading remain underexplored. Purpose To assess how AI diagnostic performance and reader characteristics influence detection of malignant lung nodules during AI-assisted reading of chest radiographs. Materials and Methods This retrospective study consisted of two reading sessions from April 2021 to June 2021. Based on the first session without AI assistance, 30 readers were assigned into two groups with equivalent areas under the free-response receiver operating characteristic curve (AUFROCs). In the second session, each group reinterpreted radiographs assisted by either a high or low accuracy AI model (blinded to the fact that two different AI models were used). Reader performance for detecting lung cancer and reader susceptibility (changing the original reading following the AI suggestion) were compared. A generalized linear mixed model was used to identify the factors influencing AI-assisted detection performance, including readers' attitudes and experiences of AI and Grit score. Results Of the 120 chest radiographs assessed, 60 were obtained in patients with lung cancer (mean age, 67 years ± 12 [SD]; 32 male; 63 cancers) and 60 in controls (mean age, 67 years ± 12; 36 male). Readers included 20 thoracic radiologists (5-18 years of experience) and 10 radiology residents (2-3 years of experience). Use of the high accuracy AI model improved readers' detection performance to a greater extent than use of the low accuracy AI model (area under the receiver operating characteristic curve, 0.77 to 0.82 vs 0.75 to 0.75; AUFROC, 0.71 to 0.79 vs 0.7 to 0.72). Readers who used the high accuracy AI showed a higher susceptibility (67%, 224 of 334 cases) to changing their diagnosis based on the AI suggestions than those using the low accuracy AI (59%, 229 of 386 cases). Accurate readings at the first session, correct AI suggestions, high accuracy Al, and diagnostic difficulty were associated with accurate AI-assisted readings, but readers' characteristics were not. Conclusion An AI model with high diagnostic accuracy led to improved performance of radiologists in detecting lung cancer on chest radiographs and increased radiologists' susceptibility to AI suggestions. © RSNA, 2023 Supplemental material is available for this article.
Collapse
Affiliation(s)
- Jong Hyuk Lee
- From the Department of Radiology (J.H.L., E.J.H., C.M.P.) and Medical Research Collaborating Center (H.H.), Seoul National University Hospital, Seoul, Korea; Lunit, Seoul, Korea (G.N.); Institute of Medical and Biological Engineering and Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea (C.M.P.); and Department of Radiology, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (C.M.P.)
| | - Hyunsook Hong
- From the Department of Radiology (J.H.L., E.J.H., C.M.P.) and Medical Research Collaborating Center (H.H.), Seoul National University Hospital, Seoul, Korea; Lunit, Seoul, Korea (G.N.); Institute of Medical and Biological Engineering and Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea (C.M.P.); and Department of Radiology, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (C.M.P.)
| | - Gunhee Nam
- From the Department of Radiology (J.H.L., E.J.H., C.M.P.) and Medical Research Collaborating Center (H.H.), Seoul National University Hospital, Seoul, Korea; Lunit, Seoul, Korea (G.N.); Institute of Medical and Biological Engineering and Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea (C.M.P.); and Department of Radiology, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (C.M.P.)
| | - Eui Jin Hwang
- From the Department of Radiology (J.H.L., E.J.H., C.M.P.) and Medical Research Collaborating Center (H.H.), Seoul National University Hospital, Seoul, Korea; Lunit, Seoul, Korea (G.N.); Institute of Medical and Biological Engineering and Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea (C.M.P.); and Department of Radiology, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (C.M.P.)
| | - Chang Min Park
- From the Department of Radiology (J.H.L., E.J.H., C.M.P.) and Medical Research Collaborating Center (H.H.), Seoul National University Hospital, Seoul, Korea; Lunit, Seoul, Korea (G.N.); Institute of Medical and Biological Engineering and Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea (C.M.P.); and Department of Radiology, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (C.M.P.)
| |
Collapse
|
41
|
Kim M, Sekiya H, Yao G, Martin NB, Castanedes-Casey M, Dickson DW, Hwang TH, Koga S. Diagnosis of Alzheimer Disease and Tauopathies on Whole-Slide Histopathology Images Using a Weakly Supervised Deep Learning Algorithm. J Transl Med 2023; 103:100127. [PMID: 36889541 DOI: 10.1016/j.labinv.2023.100127] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 02/06/2023] [Accepted: 02/17/2023] [Indexed: 03/08/2023] Open
Abstract
Neuropathologic assessment during autopsy is the gold standard for diagnosing neurodegenerative disorders. Neurodegenerative conditions, such as Alzheimer disease (AD) neuropathological change, are a continuous process from normal aging rather than categorical; therefore, diagnosing neurodegenerative disorders is a complicated task. We aimed to develop a pipeline for diagnosing AD and other tauopathies, including corticobasal degeneration (CBD), globular glial tauopathy, Pick disease, and progressive supranuclear palsy. We used a weakly supervised deep learning-based approach called clustering-constrained-attention multiple-instance learning (CLAM) on the whole-slide images (WSIs) of patients with AD (n = 30), CBD (n = 20), globular glial tauopathy (n = 10), Pick disease (n = 20), and progressive supranuclear palsy (n = 20), as well as nontauopathy controls (n = 21). Three sections (A: motor cortex; B: cingulate gyrus and superior frontal gyrus; and C: corpus striatum) that had been immunostained for phosphorylated tau were scanned and converted to WSIs. We evaluated 3 models (classic multiple-instance learning, single-attention-branch CLAM, and multiattention-branch CLAM) using 5-fold cross-validation. Attention-based interpretation analysis was performed to identify the morphologic features contributing to the classification. Within highly attended regions, we also augmented gradient-weighted class activation mapping to the model to visualize cellular-level evidence of the model's decisions. The multiattention-branch CLAM model using section B achieved the highest area under the curve (0.970 ± 0.037) and diagnostic accuracy (0.873 ± 0.087). A heatmap showed the highest attention in the gray matter of the superior frontal gyrus in patients with AD and the white matter of the cingulate gyrus in patients with CBD. Gradient-weighted class activation mapping showed the highest attention in characteristic tau lesions for each disease (eg, numerous tau-positive threads in the white matter inclusions for CBD). Our findings support the feasibility of deep learning-based approaches for the classification of neurodegenerative disorders on WSIs. Further investigation of this method, focusing on clinicopathologic correlations, is warranted.
Collapse
Affiliation(s)
- Minji Kim
- Department of Artificial Intelligence and Informatics Research, Mayo Clinic, Jacksonville, Florida
| | - Hiroaki Sekiya
- Department of Neuroscience, Mayo Clinic, Jacksonville, Florida
| | - Gary Yao
- Department of Artificial Intelligence and Informatics Research, Mayo Clinic, Jacksonville, Florida
| | | | | | | | - Tae Hyun Hwang
- Department of Artificial Intelligence and Informatics Research, Mayo Clinic, Jacksonville, Florida
| | - Shunsuke Koga
- Department of Neuroscience, Mayo Clinic, Jacksonville, Florida.
| |
Collapse
|
42
|
Allaume P, Rabilloud N, Turlin B, Bardou-Jacquet E, Loréal O, Calderaro J, Khene ZE, Acosta O, De Crevoisier R, Rioux-Leclercq N, Pecot T, Kammerer-Jacquet SF. Artificial Intelligence-Based Opportunities in Liver Pathology-A Systematic Review. Diagnostics (Basel) 2023; 13:diagnostics13101799. [PMID: 37238283 DOI: 10.3390/diagnostics13101799] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 05/04/2023] [Accepted: 05/12/2023] [Indexed: 05/28/2023] Open
Abstract
BACKGROUND Artificial Intelligence (AI)-based Deep Neural Networks (DNNs) can handle a wide range of applications in image analysis, ranging from automated segmentation to diagnostic and prediction. As such, they have revolutionized healthcare, including in the liver pathology field. OBJECTIVE The present study aims to provide a systematic review of applications and performances provided by DNN algorithms in liver pathology throughout the Pubmed and Embase databases up to December 2022, for tumoral, metabolic and inflammatory fields. RESULTS 42 articles were selected and fully reviewed. Each article was evaluated through the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool, highlighting their risks of bias. CONCLUSIONS DNN-based models are well represented in the field of liver pathology, and their applications are diverse. Most studies, however, presented at least one domain with a high risk of bias according to the QUADAS-2 tool. Hence, DNN models in liver pathology present future opportunities and persistent limitations. To our knowledge, this review is the first one solely focused on DNN-based applications in liver pathology, and to evaluate their bias through the lens of the QUADAS2 tool.
Collapse
Affiliation(s)
- Pierre Allaume
- Department of Pathology CHU de Rennes, Rennes 1 University, Pontchaillou Hospital, 2 rue Henri Le Guilloux, CEDEX 09, 35033 Rennes, France
| | - Noémie Rabilloud
- Impact TEAM, Laboratoire Traitement du Signal et de l'Image (LTSI) INSERM, Rennes 1 University, Pontchaillou Hospital, 35033 Rennes, France
| | - Bruno Turlin
- Department of Pathology CHU de Rennes, Rennes 1 University, Pontchaillou Hospital, 2 rue Henri Le Guilloux, CEDEX 09, 35033 Rennes, France
- Research Unit n°UMR1341 NuMeCan-Nutrition, Métabolismes et Cancer, Rennes 1 University, Pontchaillou Hospital, 2 rue Henri Le Guilloux, CEDEX 09, 35033 Rennes, France
| | - Edouard Bardou-Jacquet
- Research Unit n°UMR1341 NuMeCan-Nutrition, Métabolismes et Cancer, Rennes 1 University, Pontchaillou Hospital, 2 rue Henri Le Guilloux, CEDEX 09, 35033 Rennes, France
- Department of Liver Diseases CHU de Rennes, Rennes 1 University, Pontchaillou Hospital, 35033 Rennes, France
| | - Olivier Loréal
- Research Unit n°UMR1341 NuMeCan-Nutrition, Métabolismes et Cancer, Rennes 1 University, Pontchaillou Hospital, 2 rue Henri Le Guilloux, CEDEX 09, 35033 Rennes, France
| | - Julien Calderaro
- Assistance Publique-Hôpitaux de Paris, Department of Pathology Henri Mondor, 94000 Créteil, France
- INSERM U955, Team Pathophysiology and Therapy of Chronic Viral Hepatitis and Related Cancers, 94000 Créteil, France
| | - Zine-Eddine Khene
- Impact TEAM, Laboratoire Traitement du Signal et de l'Image (LTSI) INSERM, Rennes 1 University, Pontchaillou Hospital, 35033 Rennes, France
- Department of Urology, CHU de Rennes, Rennes 1 University, Pontchaillou Hospital, 2 rue Henri Le Guilloux, CEDEX 09, 35033 Rennes, France
| | - Oscar Acosta
- Impact TEAM, Laboratoire Traitement du Signal et de l'Image (LTSI) INSERM, Rennes 1 University, Pontchaillou Hospital, 35033 Rennes, France
| | - Renaud De Crevoisier
- Impact TEAM, Laboratoire Traitement du Signal et de l'Image (LTSI) INSERM, Rennes 1 University, Pontchaillou Hospital, 35033 Rennes, France
- Department of Radiotherapy, Centre Eugène Marquis, 35033 Rennes, France
| | - Nathalie Rioux-Leclercq
- Department of Pathology CHU de Rennes, Rennes 1 University, Pontchaillou Hospital, 2 rue Henri Le Guilloux, CEDEX 09, 35033 Rennes, France
| | - Thierry Pecot
- Biosit Platform UAR 3480 CNRS US18 INSERM U955, Rennes 1 University, Pontchaillou Hospital, 35033 Rennes, France
| | - Solène-Florence Kammerer-Jacquet
- Department of Pathology CHU de Rennes, Rennes 1 University, Pontchaillou Hospital, 2 rue Henri Le Guilloux, CEDEX 09, 35033 Rennes, France
- Impact TEAM, Laboratoire Traitement du Signal et de l'Image (LTSI) INSERM, Rennes 1 University, Pontchaillou Hospital, 35033 Rennes, France
| |
Collapse
|
43
|
Nazir S, Dickson DM, Akram MU. Survey of explainable artificial intelligence techniques for biomedical imaging with deep neural networks. Comput Biol Med 2023; 156:106668. [PMID: 36863192 DOI: 10.1016/j.compbiomed.2023.106668] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Revised: 01/12/2023] [Accepted: 02/10/2023] [Indexed: 02/21/2023]
Abstract
Artificial Intelligence (AI) techniques of deep learning have revolutionized the disease diagnosis with their outstanding image classification performance. In spite of the outstanding results, the widespread adoption of these techniques in clinical practice is still taking place at a moderate pace. One of the major hindrance is that a trained Deep Neural Networks (DNN) model provides a prediction, but questions about why and how that prediction was made remain unanswered. This linkage is of utmost importance for the regulated healthcare domain to increase the trust in the automated diagnosis system by the practitioners, patients and other stakeholders. The application of deep learning for medical imaging has to be interpreted with caution due to the health and safety concerns similar to blame attribution in the case of an accident involving autonomous cars. The consequences of both a false positive and false negative cases are far reaching for patients' welfare and cannot be ignored. This is exacerbated by the fact that the state-of-the-art deep learning algorithms comprise of complex interconnected structures, millions of parameters, and a 'black box' nature, offering little understanding of their inner working unlike the traditional machine learning algorithms. Explainable AI (XAI) techniques help to understand model predictions which help develop trust in the system, accelerate the disease diagnosis, and meet adherence to regulatory requirements. This survey provides a comprehensive review of the promising field of XAI for biomedical imaging diagnostics. We also provide a categorization of the XAI techniques, discuss the open challenges, and provide future directions for XAI which would be of interest to clinicians, regulators and model developers.
Collapse
Affiliation(s)
- Sajid Nazir
- Department of Computing, Glasgow Caledonian University, Glasgow, UK.
| | - Diane M Dickson
- Department of Podiatry and Radiography, Research Centre for Health, Glasgow Caledonian University, Glasgow, UK
| | - Muhammad Usman Akram
- Computer and Software Engineering Department, National University of Sciences and Technology, Islamabad, Pakistan
| |
Collapse
|
44
|
Lundström C, Lindvall M. Mapping the Landscape of Care Providers' Quality Assurance Approaches for AI in Diagnostic Imaging. J Digit Imaging 2023; 36:379-387. [PMID: 36352164 PMCID: PMC10039170 DOI: 10.1007/s10278-022-00731-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 10/26/2022] [Accepted: 10/28/2022] [Indexed: 11/10/2022] Open
Abstract
The discussion on artificial intelligence (AI) solutions in diagnostic imaging has matured in recent years. The potential value of AI adoption is well established, as are the potential risks associated. Much focus has, rightfully, been on regulatory certification of AI products, with the strong incentive of being an enabling step for the commercial actors. It is, however, becoming evident that regulatory approval is not enough to ensure safe and effective AI usage in the local setting. In other words, care providers need to develop and implement quality assurance (QA) approaches for AI solutions in diagnostic imaging. The domain of AI-specific QA is still in an early development phase. We contribute to this development by describing the current landscape of QA-for-AI approaches in medical imaging, with focus on radiology and pathology. We map the potential quality threats and review the existing QA approaches in relation to those threats. We propose a practical categorization of QA approaches, based on key characteristics corresponding to means, situation, and purpose. The review highlights the heterogeneity of methods and practices relevant for this domain and points to targets for future research efforts.
Collapse
Affiliation(s)
- Claes Lundström
- Center for Medical Image Science and Visualization, Linköping University, Linköping, Sweden.
- Sectra AB, Linköping, Sweden.
| | | |
Collapse
|
45
|
Shen X, Wu J, Su J, Yao Z, Huang W, Zhang L, Jiang Y, Yu W, Li Z. Revisiting artificial intelligence diagnosis of hepatocellular carcinoma with DIKWH framework. Front Genet 2023; 14:1004481. [PMID: 37007970 PMCID: PMC10064216 DOI: 10.3389/fgene.2023.1004481] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Accepted: 02/14/2023] [Indexed: 03/09/2023] Open
Abstract
Hepatocellular carcinoma (HCC) is the most common type of liver cancer with a high morbidity and fatality rate. Traditional diagnostic methods for HCC are primarily based on clinical presentation, imaging features, and histopathology. With the rapid development of artificial intelligence (AI), which is increasingly used in the diagnosis, treatment, and prognosis prediction of HCC, an automated approach to HCC status classification is promising. AI integrates labeled clinical data, trains on new data of the same type, and performs interpretation tasks. Several studies have shown that AI techniques can help clinicians and radiologists be more efficient and reduce the misdiagnosis rate. However, the coverage of AI technologies leads to difficulty in which the type of AI technology is preferred to choose for a given problem and situation. Solving this concern, it can significantly reduce the time required to determine the required healthcare approach and provide more precise and personalized solutions for different problems. In our review of research work, we summarize existing research works, compare and classify the main results of these according to the specified data, information, knowledge, wisdom (DIKW) framework.
Collapse
Affiliation(s)
- Xiaomin Shen
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou, China
| | - Jinxin Wu
- School of Computer Science, The University of Sydney, Sydney, NSW, Australia
| | - Junwei Su
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou, China
| | - Zhenyu Yao
- School of Computer Science, King’s College London, London, United Kingdom
| | - Wei Huang
- Department of Gastroenterology II, The First Affiliated Hospital, Zhejiang University, Hangzhou, China
| | - Li Zhang
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou, China
| | - Yiheng Jiang
- Clinical Medicine, Nanjing Medical University, Nanjing, China
| | - Wei Yu
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou, China
| | - Zhao Li
- School of Computer Science, Zhejiang University, Hangzhou, China
| |
Collapse
|
46
|
Xue P, Si M, Qin D, Wei B, Seery S, Ye Z, Chen M, Wang S, Song C, Zhang B, Ding M, Zhang W, Bai A, Yan H, Dang L, Zhao Y, Rezhake R, Zhang S, Qiao Y, Qu Y, Jiang Y. Unassisted Clinicians Versus Deep Learning-Assisted Clinicians in Image-Based Cancer Diagnostics: Systematic Review With Meta-analysis. J Med Internet Res 2023; 25:e43832. [PMID: 36862499 PMCID: PMC10020907 DOI: 10.2196/43832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 01/19/2023] [Accepted: 02/13/2023] [Indexed: 02/16/2023] Open
Abstract
BACKGROUND A number of publications have demonstrated that deep learning (DL) algorithms matched or outperformed clinicians in image-based cancer diagnostics, but these algorithms are frequently considered as opponents rather than partners. Despite the clinicians-in-the-loop DL approach having great potential, no study has systematically quantified the diagnostic accuracy of clinicians with and without the assistance of DL in image-based cancer identification. OBJECTIVE We systematically quantified the diagnostic accuracy of clinicians with and without the assistance of DL in image-based cancer identification. METHODS PubMed, Embase, IEEEXplore, and the Cochrane Library were searched for studies published between January 1, 2012, and December 7, 2021. Any type of study design was permitted that focused on comparing unassisted clinicians and DL-assisted clinicians in cancer identification using medical imaging. Studies using medical waveform-data graphics material and those investigating image segmentation rather than classification were excluded. Studies providing binary diagnostic accuracy data and contingency tables were included for further meta-analysis. Two subgroups were defined and analyzed, including cancer type and imaging modality. RESULTS In total, 9796 studies were identified, of which 48 were deemed eligible for systematic review. Twenty-five of these studies made comparisons between unassisted clinicians and DL-assisted clinicians and provided sufficient data for statistical synthesis. We found a pooled sensitivity of 83% (95% CI 80%-86%) for unassisted clinicians and 88% (95% CI 86%-90%) for DL-assisted clinicians. Pooled specificity was 86% (95% CI 83%-88%) for unassisted clinicians and 88% (95% CI 85%-90%) for DL-assisted clinicians. The pooled sensitivity and specificity values for DL-assisted clinicians were higher than for unassisted clinicians, at ratios of 1.07 (95% CI 1.05-1.09) and 1.03 (95% CI 1.02-1.05), respectively. Similar diagnostic performance by DL-assisted clinicians was also observed across the predefined subgroups. CONCLUSIONS The diagnostic performance of DL-assisted clinicians appears better than unassisted clinicians in image-based cancer identification. However, caution should be exercised, because the evidence provided in the reviewed studies does not cover all the minutiae involved in real-world clinical practice. Combining qualitative insights from clinical practice with data-science approaches may improve DL-assisted practice, although further research is required. TRIAL REGISTRATION PROSPERO CRD42021281372; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=281372.
Collapse
Affiliation(s)
- Peng Xue
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Mingyu Si
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Dongxu Qin
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Bingrui Wei
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Samuel Seery
- Faculty of Health and Medicine, Division of Health Research, Lancaster University, Lancaster, United Kingdom
| | - Zichen Ye
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Mingyang Chen
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Sumeng Wang
- Department of Cancer Epidemiology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Cheng Song
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Bo Zhang
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Ming Ding
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Wenling Zhang
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Anying Bai
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Huijiao Yan
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Le Dang
- Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yuqian Zhao
- Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science & Technology of China, Sichuan, China
| | - Remila Rezhake
- Affiliated Cancer Hospital, The 3rd Affiliated Teaching Hospital of Xinjiang Medical University, Xinjiang, China
| | - Shaokai Zhang
- Henan Cancer Hospital, Affiliated Cancer Hospital of Zhengzhou University, Henan, China
| | - Youlin Qiao
- Center for Global Health, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yimin Qu
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yu Jiang
- Department of Epidemiology and Biostatistics, School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| |
Collapse
|
47
|
Intharah T, Wiratchawa K, Wanna Y, Junsawang P, Titapun A, Techasen A, Boonrod A, Laopaiboon V, Chamadol N, Khuntikeo N. BiTNet: Hybrid deep convolutional model for ultrasound image analysis of human biliary tract and its applications. Artif Intell Med 2023; 139:102539. [PMID: 37100509 DOI: 10.1016/j.artmed.2023.102539] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Revised: 03/15/2023] [Accepted: 03/27/2023] [Indexed: 04/03/2023]
Abstract
Certain life-threatening abnormalities, such as cholangiocarcinoma, in the human biliary tract are curable if detected at an early stage, and ultrasonography has been proven to be an effective tool for identifying them. However, the diagnosis often requires a second opinion from experienced radiologists, who are usually overwhelmed by many cases. Therefore, we propose a deep convolutional neural network model, named biliary tract network (BiTNet), developed to solve problems in the current screening system and to avoid overconfidence issues of traditional deep convolutional neural networks. Additionally, we present an ultrasound image dataset for the human biliary tract and demonstrate two artificial intelligence (AI) applications: auto-prescreening and assisting tools. The proposed model is the first AI model to automatically screen and diagnose upper-abdominal abnormalities from ultrasound images in real-world healthcare scenarios. Our experiments suggest that prediction probability has an impact on both applications, and our modifications to EfficientNet solve the overconfidence problem, thereby improving the performance of both applications and of healthcare professionals. The proposed BiTNet can reduce the workload of radiologists by 35% while keeping the false negatives to as low as 1 out of every 455 images. Our experiments involving 11 healthcare professionals with four different levels of experience reveal that BiTNet improves the diagnostic performance of participants of all levels. The mean accuracy and precision of the participants with BiTNet as an assisting tool (0.74 and 0.61, respectively) are statistically higher than those of participants without the assisting tool (0.50 and 0.46, respectively (p<0.001)). These experimental results demonstrate the high potential of BiTNet for use in clinical settings.
Collapse
Affiliation(s)
- Thanapong Intharah
- Visual Intelligence Laboratory, Department of Statistics, Faculty of Science, Khon Kaen University, Khon Kaen, Thailand.
| | - Kannika Wiratchawa
- Visual Intelligence Laboratory, Department of Statistics, Faculty of Science, Khon Kaen University, Khon Kaen, Thailand
| | - Yupaporn Wanna
- Visual Intelligence Laboratory, Department of Statistics, Faculty of Science, Khon Kaen University, Khon Kaen, Thailand
| | - Prem Junsawang
- Visual Intelligence Laboratory, Department of Statistics, Faculty of Science, Khon Kaen University, Khon Kaen, Thailand
| | - Attapol Titapun
- Department of Surgery, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand; Cholangiocarcinoma Research Institute, Khon Kaen University, Khon Kaen, Thailand
| | - Anchalee Techasen
- Faculty of Associated Medical Sciences, Khon Kaen University, Khon Kaen, Thailand; Cholangiocarcinoma Research Institute, Khon Kaen University, Khon Kaen, Thailand
| | - Arunnit Boonrod
- Department of Radiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
| | - Vallop Laopaiboon
- Department of Radiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand; Cholangiocarcinoma Research Institute, Khon Kaen University, Khon Kaen, Thailand
| | - Nittaya Chamadol
- Department of Radiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand; Cholangiocarcinoma Research Institute, Khon Kaen University, Khon Kaen, Thailand
| | - Narong Khuntikeo
- Department of Surgery, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand; Cholangiocarcinoma Research Institute, Khon Kaen University, Khon Kaen, Thailand.
| |
Collapse
|
48
|
Bakrania A, Joshi N, Zhao X, Zheng G, Bhat M. Artificial intelligence in liver cancers: Decoding the impact of machine learning models in clinical diagnosis of primary liver cancers and liver cancer metastases. Pharmacol Res 2023; 189:106706. [PMID: 36813095 DOI: 10.1016/j.phrs.2023.106706] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 02/17/2023] [Accepted: 02/19/2023] [Indexed: 02/22/2023]
Abstract
Liver cancers are the fourth leading cause of cancer-related mortality worldwide. In the past decade, breakthroughs in the field of artificial intelligence (AI) have inspired development of algorithms in the cancer setting. A growing body of recent studies have evaluated machine learning (ML) and deep learning (DL) algorithms for pre-screening, diagnosis and management of liver cancer patients through diagnostic image analysis, biomarker discovery and predicting personalized clinical outcomes. Despite the promise of these early AI tools, there is a significant need to explain the 'black box' of AI and work towards deployment to enable ultimate clinical translatability. Certain emerging fields such as RNA nanomedicine for targeted liver cancer therapy may also benefit from application of AI, specifically in nano-formulation research and development given that they are still largely reliant on lengthy trial-and-error experiments. In this paper, we put forward the current landscape of AI in liver cancers along with the challenges of AI in liver cancer diagnosis and management. Finally, we have discussed the future perspectives of AI application in liver cancer and how a multidisciplinary approach using AI in nanomedicine could accelerate the transition of personalized liver cancer medicine from bench side to the clinic.
Collapse
Affiliation(s)
- Anita Bakrania
- Toronto General Hospital Research Institute, Toronto, ON, Canada; Ajmera Transplant Program, University Health Network, Toronto, ON, Canada; Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
| | | | - Xun Zhao
- Toronto General Hospital Research Institute, Toronto, ON, Canada; Ajmera Transplant Program, University Health Network, Toronto, ON, Canada
| | - Gang Zheng
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada; Institute of Biomedical Engineering, University of Toronto, Toronto, ON, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Mamatha Bhat
- Toronto General Hospital Research Institute, Toronto, ON, Canada; Ajmera Transplant Program, University Health Network, Toronto, ON, Canada; Division of Gastroenterology, Department of Medicine, University Health Network and University of Toronto, Toronto, ON, Canada; Department of Medical Sciences, Toronto, ON, Canada.
| |
Collapse
|
49
|
Zhang X, Gleber‐Netto FO, Wang S, Martins‐Chaves RR, Gomez RS, Vigneswaran N, Sarkar A, William WN, Papadimitrakopoulou V, Williams M, Bell D, Palsgrove D, Bishop J, Heymach JV, Gillenwater AM, Myers JN, Ferrarotto R, Lippman SM, Pickering CR, Xiao G. Deep learning-based pathology image analysis predicts cancer progression risk in patients with oral leukoplakia. Cancer Med 2023; 12:7508-7518. [PMID: 36721313 PMCID: PMC10067069 DOI: 10.1002/cam4.5478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 10/15/2022] [Accepted: 11/14/2022] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Oral leukoplakia (OL) is associated with an increased risk for oral cancer (OC) development. Prediction of OL cancer progression may contribute to decreased OC morbidity and mortality by favoring early intervention. Current OL progression risk assessment approaches face large interobserver variability and is weakly prognostic. We hypothesized that convolutional neural networks (CNN)-based histology image analyses could accelerate the discovery of better OC progression risk models. METHODS Our CNN-based oral mucosa risk stratification model (OMRS) was trained to classify a set of nondysplastic oral mucosa (OM) and a set of OC H&E slides. As a result, the OMRS model could identify abnormal morphological features of the oral epithelium. By applying this model to OL slides, we hypothesized that the extent of OC-like features identified in the OL epithelium would correlate with its progression risk. The OMRS model scored and categorized the OL cohort (n = 62) into high- and low-risk groups. RESULTS OL patients classified as high-risk (n = 31) were 3.98 (95% CI 1.36-11.7) times more likely to develop OC than low-risk ones (n = 31). Time-to-progression significantly differed between high- and low-risk groups (p = 0.003). The 5-year OC development probability was 21.3% for low-risk and 52.5% for high-risk patients. The predictive power of the OMRS model was sustained even after adjustment for age, OL site, and OL dysplasia grading (HR = 4.52, 1.5-13.7). CONCLUSION The ORMS model successfully identified OL patients with a high risk of OC development and can potentially benefit OC early diagnosis and prevention policies.
Collapse
Affiliation(s)
- Xinyi Zhang
- Quantitative Biomedical Research Center, Department of Population and Data SciencesUniversity of Texas Southwestern Medical CenterDallasTexasUSA
| | | | - Shidan Wang
- Quantitative Biomedical Research Center, Department of Population and Data SciencesUniversity of Texas Southwestern Medical CenterDallasTexasUSA
| | | | - Ricardo Santiago Gomez
- Department of Oral Surgery and Pathology, School of DentistryUniversidade Federal de Minas GeraisBelo HorizonteBrazil
| | - Nadarajah Vigneswaran
- Department of Diagnostic and Biomedical SciencesThe University of Texas Health Science Center at Houston School of DentistryHoustonTexasUSA
| | - Arunangshu Sarkar
- Department of Head & Neck SurgeryThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
| | - William N. William
- Department of Thoracic‐Head & Neck Medical OncologyThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
- Hospital BPA Beneficência Portuguesa de São PauloSao PaoloBrazil
| | - Vassiliki Papadimitrakopoulou
- Department of Thoracic‐Head & Neck Medical OncologyThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
- Global Product DevelopmentOncology, Pfizer, Inc.New YorkNew YorkUSA
| | - Michelle Williams
- Department of Anatomical PathologyThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
| | - Diana Bell
- Department of Anatomical PathologyThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
- Department of PathologyCity of HopeDuarteCaliforniaUSA
| | - Doreen Palsgrove
- Department of PathologyUniversity of Texas Southwestern Medical CenterDallasTexasUSA
| | - Justin Bishop
- Department of PathologyUniversity of Texas Southwestern Medical CenterDallasTexasUSA
| | - John V. Heymach
- Department of Thoracic‐Head & Neck Medical OncologyThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
| | - Ann M. Gillenwater
- Department of Head & Neck SurgeryThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
| | - Jeffrey N. Myers
- Department of Head & Neck SurgeryThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
| | - Renata Ferrarotto
- Department of Thoracic‐Head & Neck Medical OncologyThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
| | - Scott M. Lippman
- Department of Thoracic‐Head & Neck Medical OncologyThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
- Department of MedicineUniversity of California San DiegoSan DiegoCaliforniaUSA
| | - Curtis Rg Pickering
- Department of Head & Neck SurgeryThe University of Texas MD Anderson Cancer CenterHoustonTexasUSA
| | - Guanghua Xiao
- Quantitative Biomedical Research Center, Department of Population and Data SciencesUniversity of Texas Southwestern Medical CenterDallasTexasUSA
- Department of BioinformaticsUniversity of Texas Southwestern Medical CenterDallasTexasUSA
| |
Collapse
|
50
|
Wen Z, Wang S, Yang DM, Xie Y, Chen M, Bishop J, Xiao G. Deep learning in digital pathology for personalized treatment plans of cancer patients. Semin Diagn Pathol 2023; 40:109-119. [PMID: 36890029 DOI: 10.1053/j.semdp.2023.02.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 02/22/2023] [Indexed: 02/27/2023]
Abstract
Over the past decade, many new cancer treatments have been developed and made available to patients. However, in most cases, these treatments only benefit a specific subgroup of patients, making the selection of treatment for a specific patient an essential but challenging task for oncologists. Although some biomarkers were found to associate with treatment response, manual assessment is time-consuming and subjective. With the rapid developments and expanded implementation of artificial intelligence (AI) in digital pathology, many biomarkers can be quantified automatically from histopathology images. This approach allows for a more efficient and objective assessment of biomarkers, aiding oncologists in formulating personalized treatment plans for cancer patients. This review presents an overview and summary of the recent studies on biomarker quantification and treatment response prediction using hematoxylin-eosin (H&E) stained pathology images. These studies have shown that an AI-based digital pathology approach can be practical and will become increasingly important in improving the selection of cancer treatments for patients.
Collapse
Affiliation(s)
- Zhuoyu Wen
- Quantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Shidan Wang
- Quantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Donghan M Yang
- Quantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Yang Xie
- Quantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA; Simmons Comprehensive Cancer Center, UT Southwestern Medical Center, Dallas, TX, USA; Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Mingyi Chen
- Department of Bioinformatics, UT Southwestern Medical Center, Dallas, TX, USA
| | - Justin Bishop
- Department of Bioinformatics, UT Southwestern Medical Center, Dallas, TX, USA
| | - Guanghua Xiao
- Quantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA; Simmons Comprehensive Cancer Center, UT Southwestern Medical Center, Dallas, TX, USA; Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX, USA.
| |
Collapse
|