51
|
Tran KA, Kondrashova O, Bradley A, Williams ED, Pearson JV, Waddell N. Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Med 2021; 13:152. [PMID: 34579788 PMCID: PMC8477474 DOI: 10.1186/s13073-021-00968-x] [Citation(s) in RCA: 256] [Impact Index Per Article: 85.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Accepted: 09/12/2021] [Indexed: 12/13/2022] Open
Abstract
Deep learning is a subdiscipline of artificial intelligence that uses a machine learning technique called artificial neural networks to extract patterns and make predictions from large data sets. The increasing adoption of deep learning across healthcare domains together with the availability of highly characterised cancer datasets has accelerated research into the utility of deep learning in the analysis of the complex biology of cancer. While early results are promising, this is a rapidly evolving field with new knowledge emerging in both cancer biology and deep learning. In this review, we provide an overview of emerging deep learning techniques and how they are being applied to oncology. We focus on the deep learning applications for omics data types, including genomic, methylation and transcriptomic data, as well as histopathology-based genomic inference, and provide perspectives on how the different data types can be integrated to develop decision support tools. We provide specific examples of how deep learning may be applied in cancer diagnosis, prognosis and treatment management. We also assess the current limitations and challenges for the application of deep learning in precision oncology, including the lack of phenotypically rich data and the need for more explainable deep learning models. Finally, we conclude with a discussion of how current obstacles can be overcome to enable future clinical utilisation of deep learning.
Collapse
Affiliation(s)
- Khoa A. Tran
- Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, 4006 Australia
- School of Biomedical Sciences, Faculty of Health, Queensland University of Technology (QUT), Brisbane, 4059 Australia
| | - Olga Kondrashova
- Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, 4006 Australia
| | - Andrew Bradley
- Faculty of Engineering, Queensland University of Technology (QUT), Brisbane, 4000 Australia
| | - Elizabeth D. Williams
- School of Biomedical Sciences, Faculty of Health, Queensland University of Technology (QUT), Brisbane, 4059 Australia
- Australian Prostate Cancer Research Centre - Queensland (APCRC-Q) and Queensland Bladder Cancer Initiative (QBCI), Brisbane, 4102 Australia
| | - John V. Pearson
- Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, 4006 Australia
| | - Nicola Waddell
- Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, 4006 Australia
| |
Collapse
|
52
|
Ye Q, Zhang Q, Tian Y, Zhou T, Ge H, Wu J, Lu N, Bai X, Liang T, Li J. Method of Tumor Pathological Micronecrosis Quantification Via Deep Learning From Label Fuzzy Proportions. IEEE J Biomed Health Inform 2021; 25:3288-3299. [PMID: 33822729 DOI: 10.1109/jbhi.2021.3071276] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The presence of necrosis is associated with tumor progression and patient outcomes in many cancers, but existing analyses rarely adopt quantitative methods because the manual quantification of histopathological features is too expensive. We aim to accurately identify necrotic regions on hematoxylin and eosin (HE)-stained slides and to calculate the ratio of necrosis with minimal annotations on the images. An adaptive method named Learning from Label Fuzzy Proportions (LLFP) was introduced to histopathological image analysis. Two datasets of liver cancer HE slides were collected to verify the feasibility of the method by training on the internal set using cross validation and performing validation on the external set, along with ensemble learning to improve performance. The models from cross validation performed relatively stably in identifying necrosis, with a Concordance Index of the Slide Necrosis Score (CISNS) of 0.9165±0.0089 in the internal test set. The integration model improved the CISNS to 0.9341 and achieved a CISNS of 0.8278 on the external set. There were significant differences in survival (p = 0.0060) between the three groups divided according to the calculated necrosis ratio. The proposed method can build an integration model good at distinguishing necrosis and capable of clinical assistance as an automatic tool to stratify patients with different risks or as a cluster tool for the quantification of histopathological features. We presented a method effective for identifying histopathological features and suggested that the extent of necrosis, especially micronecrosis, in liver cancer is related to patient outcomes.
Collapse
|
53
|
Irshaid L, Bleiberg J, Weinberger E, Garritano J, Shallis RM, Patsenker J, Lindenbaum O, Kluger Y, Katz SG, Xu ML. Histopathologic and Machine Deep Learning Criteria to Predict Lymphoma Transformation in Bone Marrow Biopsies. Arch Pathol Lab Med 2021; 146:182-193. [PMID: 34086849 DOI: 10.5858/arpa.2020-0510-oa] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/19/2021] [Indexed: 11/06/2022]
Abstract
CONTEXT.— Large-cell transformation (LCT) of indolent B-cell lymphomas, such as follicular lymphoma (FL) and chronic lymphocytic leukemia (CLL), signals a worse prognosis, at which point aggressive chemotherapy is initiated. Although LCT is relatively straightforward to diagnose in lymph nodes, a marrow biopsy is often obtained first given its ease of procedure, low cost, and low morbidity. However, consensus criteria for LCT in bone marrow have not been established. OBJECTIVE.— To study the accuracy and reproducibility of a trained convolutional neural network in identifying LCT, in light of promising machine learning tools that may introduce greater objectivity to morphologic analysis. DESIGN.— We retrospectively identified patients who had a diagnosis of FL or CLL who had undergone bone marrow biopsy for the clinical question of LCT. We scored morphologic criteria and correlated results with clinical disease progression. In addition, whole slide scans were annotated into patches to train convolutional neural networks to discriminate between small and large tumor cells and to predict the patient's probability of transformation. RESULTS.— Using morphologic examination, the proportion of large lymphoma cells (≥10% in FL and ≥30% in CLL), chromatin pattern, distinct nucleoli, and proliferation index were significantly correlated with LCT in FL and CLL. Compared to pathologist-derived estimates, machine generated quantification demonstrated better reproducibility and stronger correlation with final outcome data. CONCLUSIONS.— These histologic findings may serve as indications of LCT in bone marrow biopsies. The pathologist-augmented with machine system appeared to be the most predictive, arguing for greater efforts to validate and implement these tools to further enhance physician practice.
Collapse
Affiliation(s)
- Lina Irshaid
- From the Department of Pathology (Irshaid, Garritano, Patsenker, Kluger, Katz, Xu), Yale New Haven Hospital, Yale School of Medicine, New Haven, Connecticut
| | - Jonathan Bleiberg
- The Program of Applied Mathematics, Yale University, New Haven, Connecticut (Bleiberg, Weinberger, Lindenbaum, Kluger)
| | - Ethan Weinberger
- The Program of Applied Mathematics, Yale University, New Haven, Connecticut (Bleiberg, Weinberger, Lindenbaum, Kluger)
| | - James Garritano
- From the Department of Pathology (Irshaid, Garritano, Patsenker, Kluger, Katz, Xu), Yale New Haven Hospital, Yale School of Medicine, New Haven, Connecticut
| | - Rory M Shallis
- Department of Internal Medicine (Shallis), Yale New Haven Hospital, Yale School of Medicine, New Haven, Connecticut
| | - Jonathan Patsenker
- From the Department of Pathology (Irshaid, Garritano, Patsenker, Kluger, Katz, Xu), Yale New Haven Hospital, Yale School of Medicine, New Haven, Connecticut
| | - Ofir Lindenbaum
- The Program of Applied Mathematics, Yale University, New Haven, Connecticut (Bleiberg, Weinberger, Lindenbaum, Kluger)
| | - Yuval Kluger
- From the Department of Pathology (Irshaid, Garritano, Patsenker, Kluger, Katz, Xu), Yale New Haven Hospital, Yale School of Medicine, New Haven, Connecticut.,The Program of Applied Mathematics, Yale University, New Haven, Connecticut (Bleiberg, Weinberger, Lindenbaum, Kluger)
| | - Samuel G Katz
- From the Department of Pathology (Irshaid, Garritano, Patsenker, Kluger, Katz, Xu), Yale New Haven Hospital, Yale School of Medicine, New Haven, Connecticut
| | - Mina L Xu
- From the Department of Pathology (Irshaid, Garritano, Patsenker, Kluger, Katz, Xu), Yale New Haven Hospital, Yale School of Medicine, New Haven, Connecticut
| |
Collapse
|
54
|
Resource-frugal classification and analysis of pathology slides using image entropy. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2020.102388] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
55
|
Yang H, Chen L, Cheng Z, Yang M, Wang J, Lin C, Wang Y, Huang L, Chen Y, Peng S, Ke Z, Li W. Deep learning-based six-type classifier for lung cancer and mimics from histopathological whole slide images: a retrospective study. BMC Med 2021; 19:80. [PMID: 33775248 PMCID: PMC8006383 DOI: 10.1186/s12916-021-01953-2] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/26/2020] [Accepted: 02/26/2021] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Targeted therapy and immunotherapy put forward higher demands for accurate lung cancer classification, as well as benign versus malignant disease discrimination. Digital whole slide images (WSIs) witnessed the transition from traditional histopathology to computational approaches, arousing a hype of deep learning methods for histopathological analysis. We aimed at exploring the potential of deep learning models in the identification of lung cancer subtypes and cancer mimics from WSIs. METHODS We initially obtained 741 WSIs from the First Affiliated Hospital of Sun Yat-sen University (SYSUFH) for the deep learning model development, optimization, and verification. Additional 318 WSIs from SYSUFH, 212 from Shenzhen People's Hospital, and 422 from The Cancer Genome Atlas were further collected for multi-centre verification. EfficientNet-B5- and ResNet-50-based deep learning methods were developed and compared using the metrics of recall, precision, F1-score, and areas under the curve (AUCs). A threshold-based tumour-first aggregation approach was proposed and implemented for the label inferencing of WSIs with complex tissue components. Four pathologists of different levels from SYSUFH reviewed all the testing slides blindly, and the diagnosing results were used for quantitative comparisons with the best performing deep learning model. RESULTS We developed the first deep learning-based six-type classifier for histopathological WSI classification of lung adenocarcinoma, lung squamous cell carcinoma, small cell lung carcinoma, pulmonary tuberculosis, organizing pneumonia, and normal lung. The EfficientNet-B5-based model outperformed ResNet-50 and was selected as the backbone in the classifier. Tested on 1067 slides from four cohorts of different medical centres, AUCs of 0.970, 0.918, 0.963, and 0.978 were achieved, respectively. The classifier achieved high consistence to the ground truth and attending pathologists with high intraclass correlation coefficients over 0.873. CONCLUSIONS Multi-cohort testing demonstrated our six-type classifier achieved consistent and comparable performance to experienced pathologists and gained advantages over other existing computational methods. The visualization of prediction heatmap improved the model interpretability intuitively. The classifier with the threshold-based tumour-first label inferencing method exhibited excellent accuracy and feasibility in classifying lung cancers and confused nonneoplastic tissues, indicating that deep learning can resolve complex multi-class tissue classification that conforms to real-world histopathological scenarios.
Collapse
Affiliation(s)
- Huan Yang
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Lili Chen
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Zhiqiang Cheng
- Department of Pathology, Shenzhen People's Hospital, Shenzhen, 518020, China
| | - Minglei Yang
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Jianbo Wang
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Chenghao Lin
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Yuefeng Wang
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Leilei Huang
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Yangshan Chen
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Sui Peng
- Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China.,Molecular Diagnosis Center or Institute of Precision Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Zunfu Ke
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China. .,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China. .,Molecular Diagnosis Center or Institute of Precision Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China.
| | - Weizhong Li
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China. .,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China. .,Key Laboratory of Tropical Disease Control (Ministry of Education), Sun Yat-sen University, Guangzhou, 510080, China.
| |
Collapse
|
56
|
Marostica E, Barber R, Denize T, Kohane IS, Signoretti S, Golden JA, Yu KH. Development of a Histopathology Informatics Pipeline for Classification and Prediction of Clinical Outcomes in Subtypes of Renal Cell Carcinoma. Clin Cancer Res 2021; 27:2868-2878. [PMID: 33722896 DOI: 10.1158/1078-0432.ccr-20-4119] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 01/25/2021] [Accepted: 03/10/2021] [Indexed: 12/24/2022]
Abstract
PURPOSE Histopathology evaluation is the gold standard for diagnosing clear cell (ccRCC), papillary, and chromophobe renal cell carcinoma (RCC). However, interrater variability has been reported, and the whole-slide histopathology images likely contain underutilized biological signals predictive of genomic profiles. EXPERIMENTAL DESIGN To address this knowledge gap, we obtained whole-slide histopathology images and demographic, genomic, and clinical data from The Cancer Genome Atlas, the Clinical Proteomic Tumor Analysis Consortium, and Brigham and Women's Hospital (Boston, MA) to develop computational methods for integrating data analyses. Leveraging these large and diverse datasets, we developed fully automated convolutional neural networks to diagnose renal cancers and connect quantitative pathology patterns with patients' genomic profiles and prognoses. RESULTS Our deep convolutional neural networks successfully detected malignancy (AUC in the independent validation cohort: 0.964-0.985), diagnosed RCC histologic subtypes (independent validation AUCs of the best models: 0.953-0.993), and predicted stage I ccRCC patients' survival outcomes (log-rank test P = 0.02). Our machine learning approaches further identified histopathology image features indicative of copy-number alterations (AUC > 0.7 in multiple genes in patients with ccRCC) and tumor mutation burden. CONCLUSIONS Our results suggest that convolutional neural networks can extract histologic signals predictive of patients' diagnoses, prognoses, and genomic variations of clinical importance. Our approaches can systematically identify previously unknown relations among diverse data modalities.
Collapse
Affiliation(s)
- Eliana Marostica
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts
| | - Rebecca Barber
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts.,Department of Computer Science, Princeton University, Princeton, New Jersey
| | - Thomas Denize
- Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts
| | - Sabina Signoretti
- Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts
| | - Jeffrey A Golden
- Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts.,Cedars-Sinai Medical Center, Los Angeles, California
| | - Kun-Hsing Yu
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts. .,Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts
| |
Collapse
|
57
|
Noorbakhsh J, Farahmand S, Foroughi Pour A, Namburi S, Caruana D, Rimm D, Soltanieh-Ha M, Zarringhalam K, Chuang JH. Deep learning-based cross-classifications reveal conserved spatial behaviors within tumor histological images. Nat Commun 2020; 11:6367. [PMID: 33311458 PMCID: PMC7733499 DOI: 10.1038/s41467-020-20030-5] [Citation(s) in RCA: 92] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 11/05/2020] [Indexed: 02/07/2023] Open
Abstract
Histopathological images are a rich but incompletely explored data type for studying cancer. Manual inspection is time consuming, making it challenging to use for image data mining. Here we show that convolutional neural networks (CNNs) can be systematically applied across cancer types, enabling comparisons to reveal shared spatial behaviors. We develop CNN architectures to analyze 27,815 hematoxylin and eosin scanned images from The Cancer Genome Atlas for tumor/normal, cancer subtype, and mutation classification. Our CNNs are able to classify TCGA pathologist-annotated tumor/normal status of whole slide images (WSIs) in 19 cancer types with consistently high AUCs (0.995 ± 0.008), as well as subtypes with lower but significant accuracy (AUC 0.87 ± 0.1). Remarkably, tumor/normal CNNs trained on one tissue are effective in others (AUC 0.88 ± 0.11), with classifier relationships also recapitulating known adenocarcinoma, carcinoma, and developmental biology. Moreover, classifier comparisons reveal intra-slide spatial similarities, with an average tile-level correlation of 0.45 ± 0.16 between classifier pairs. Breast cancers, bladder cancers, and uterine cancers have spatial patterns that are particularly easy to detect, suggesting these cancers can be canonical types for image analysis. Patterns for TP53 mutations can also be detected, with WSI self- and cross-tissue AUCs ranging from 0.65-0.80. Finally, we comparatively evaluate CNNs on 170 breast and colon cancer images with pathologist-annotated nuclei, finding that both cellular and intercellular regions contribute to CNN accuracy. These results demonstrate the power of CNNs not only for histopathological classification, but also for cross-comparisons to reveal conserved spatial behaviors across tumors.
Collapse
Affiliation(s)
- Javad Noorbakhsh
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Saman Farahmand
- Computational Sciences PhD Program, University of Massachusetts-Boston, Boston, MA, USA
| | | | - Sandeep Namburi
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Dennis Caruana
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | - David Rimm
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | | | - Kourosh Zarringhalam
- Computational Sciences PhD Program, University of Massachusetts-Boston, Boston, MA, USA
- Department of Mathematics, University of Massachusetts-Boston, Boston, MA, USA
| | - Jeffrey H Chuang
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
- UCONN Health, Department of Genetics and Genome Sciences, Farmington, CT, USA.
| |
Collapse
|
58
|
Singh N, Mishra A, Sahu DK, Jain M, Shyam H, Tripathi RK, Shankar P, Kumar A, Alam N, Jaiswal R, Kumar S. Comprehensive Characterization of Stage IIIA Non-Small Cell Lung Carcinoma. Cancer Manag Res 2020; 12:11973-11988. [PMID: 33244273 PMCID: PMC7685366 DOI: 10.2147/cmar.s279974] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 10/16/2020] [Indexed: 12/21/2022] Open
Abstract
Introduction Heterogeneity of non-small cell lung carcinoma (NSCLC) among patients is currently not well studied. Pathologic markers and staging systems have not been a precise predictor of the prognosis of an individual patient. Hence, we hypothesize to develop a transcript-based signature to categorize stage IIIA-NSCLC in lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC), plus identify markers that could indicate the prognosis of the disease. Methods Human Transcriptome Array 2.0 (HTA) and NanoString nCounter® platform were used for high-throughput gene-expression profiling. Initially, we profiled stage IIIA-NSCLC through HTA and validated through NanoString. Additionally, two metastatic markers SPP1 and CDH2 were validated in 47 NSCLC stage IIIA samples through real-time PCR. Results We observed distinct gene clusters in LUAD and LUSC with down-regulation of six genes and up-regulation of 57 genes through HTA. Ninety-six transcripts were randomly selected after analyzing HTA data and validated on the NanoString platform. We found 40 differentially expressed transcripts that categorized NSCLC into LUAD and LUSC. SPP1 is significantly overexpressed (4.311±1.27 fold in LUAD and 13.41±3.82 fold in LUSC compared to control), and the CDH2 transcript was significantly overexpressed (11.53 ± 4.027-fold compared to control) only in LUSC. Discussion These markers enable us to categorize stage IIIA NSCLC into LUAD and LUSC plus these markers may be helpful to understand the pathophysiology of NSCLC. However, more data required to make these findings useful in general clinical practice.
Collapse
Affiliation(s)
- Neetu Singh
- Department of Centre for Advanced Research, King George's Medical University, Lucknow, 226003, India
| | - Archana Mishra
- Department of Surgery, King George's Medical University, Lucknow 226003, India
| | - Dinesh Kumar Sahu
- Department of Centre for Advanced Research, King George's Medical University, Lucknow, 226003, India
| | - Mayank Jain
- Department of Centre for Advanced Research, King George's Medical University, Lucknow, 226003, India
| | - Hari Shyam
- Department of Centre for Advanced Research, King George's Medical University, Lucknow, 226003, India
| | - Ratnesh Kumar Tripathi
- Department of Centre for Advanced Research, King George's Medical University, Lucknow, 226003, India
| | - Pratap Shankar
- Department of Centre for Advanced Research, King George's Medical University, Lucknow, 226003, India
| | - Anil Kumar
- Department of Centre for Advanced Research, King George's Medical University, Lucknow, 226003, India
| | - Nawazish Alam
- Department of Centre for Advanced Research, King George's Medical University, Lucknow, 226003, India
| | - Riddhi Jaiswal
- Department of Pathology, King George's Medical University, Lucknow 226003, India
| | - Shailendra Kumar
- Department of Surgery, King George's Medical University, Lucknow 226003, India
| |
Collapse
|
59
|
Sakamoto T, Furukawa T, Lami K, Pham HHN, Uegami W, Kuroda K, Kawai M, Sakanashi H, Cooper LAD, Bychkov A, Fukuoka J. A narrative review of digital pathology and artificial intelligence: focusing on lung cancer. Transl Lung Cancer Res 2020; 9:2255-2276. [PMID: 33209648 PMCID: PMC7653145 DOI: 10.21037/tlcr-20-591] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The emergence of whole slide imaging technology allows for pathology diagnosis on a computer screen. The applications of digital pathology are expanding, from supporting remote institutes suffering from a shortage of pathologists to routine use in daily diagnosis including that of lung cancer. Through practice and research large archival databases of digital pathology images have been developed that will facilitate the development of artificial intelligence (AI) methods for image analysis. Currently, several AI applications have been reported in the field of lung cancer; these include the segmentation of carcinoma foci, detection of lymph node metastasis, counting of tumor cells, and prediction of gene mutations. Although the integration of AI algorithms into clinical practice remains a significant challenge, we have implemented tumor cell count for genetic analysis, a helpful application for routine use. Our experience suggests that pathologists often overestimate the contents of tumor cells, and the use of AI-based analysis increases the accuracy and makes the tasks less tedious. However, there are several difficulties encountered in the practical use of AI in clinical diagnosis. These include the lack of sufficient annotated data for the development and validation of AI systems, the explainability of black box AI models, such as those based on deep learning that offer the most promising performance, and the difficulty in defining the ground truth data for training and validation owing to inherent ambiguity in most applications. All of these together present significant challenges in the development and clinical translation of AI methods in the practice of pathology. Additional research on these problems will help in resolving the barriers to the clinical use of AI. Helping pathologists in developing knowledge of the working and limitations of AI will benefit the use of AI in both diagnostics and research.
Collapse
Affiliation(s)
- Taro Sakamoto
- Department of Pathology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
| | - Tomoi Furukawa
- Department of Pathology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
| | - Kris Lami
- Department of Pathology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
| | - Hoa Hoang Ngoc Pham
- Department of Pathology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
| | - Wataru Uegami
- Department of Pathology, Kameda Medical Center, Kamogawa, Chiba, Japan
| | - Kishio Kuroda
- Department of Pathology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
| | - Masataka Kawai
- Department of Pathology, Interdisciplinary Graduate School of Medicine and Engineering, University of Yamanashi, Yamanashi, Japan
| | - Hidenori Sakanashi
- Configurable Learning Mechanism Research Team, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | | | - Andrey Bychkov
- Department of Pathology, Kameda Medical Center, Kamogawa, Chiba, Japan
| | - Junya Fukuoka
- Department of Pathology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan.,Department of Pathology, Kameda Medical Center, Kamogawa, Chiba, Japan
| |
Collapse
|
60
|
Yu KH, Hu V, Wang F, Matulonis UA, Mutter GL, Golden JA, Kohane IS. Deciphering serous ovarian carcinoma histopathology and platinum response by convolutional neural networks. BMC Med 2020; 18:236. [PMID: 32807164 PMCID: PMC7433108 DOI: 10.1186/s12916-020-01684-w] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 06/28/2020] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND Ovarian cancer causes 151,900 deaths per year worldwide. Treatment and prognosis are primarily determined by the histopathologic interpretation in combination with molecular diagnosis. However, the relationship between histopathology patterns and molecular alterations is not fully understood, and it is difficult to predict patients' chemotherapy response using the known clinical and histological variables. METHODS We analyzed the whole-slide histopathology images, RNA-Seq, and proteomics data from 587 primary serous ovarian adenocarcinoma patients and developed a systematic algorithm to integrate histopathology and functional omics findings and to predict patients' response to platinum-based chemotherapy. RESULTS Our convolutional neural networks identified the cancerous regions with areas under the receiver operating characteristic curve (AUCs) > 0.95 and classified tumor grade with AUCs > 0.80. Functional omics analysis revealed that expression levels of proteins participated in innate immune responses and catabolic pathways are associated with tumor grade. Quantitative histopathology analysis successfully stratified patients with different response to platinum-based chemotherapy (P = 0.003). CONCLUSIONS These results indicated the potential clinical utility of quantitative histopathology evaluation in tumor cell detection and chemotherapy response prediction. The developed algorithm is easily extensible to other tumor types and treatment modalities.
Collapse
Affiliation(s)
- Kun-Hsing Yu
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA. .,Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA.
| | - Vincent Hu
- Department of Bioengineering, University of California San Diego, San Diego, CA, USA
| | - Feiran Wang
- Department of Electrical Engineering, Stanford University, Stanford, CA, USA
| | - Ursula A Matulonis
- Division of Gynecologic Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - George L Mutter
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA
| | - Jeffrey A Golden
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
61
|
Yu KH, Lee TLM, Yen MH, Kou SC, Rosen B, Chiang JH, Kohane IS. Reproducible Machine Learning Methods for Lung Cancer Detection Using Computed Tomography Images: Algorithm Development and Validation. J Med Internet Res 2020; 22:e16709. [PMID: 32755895 PMCID: PMC7439139 DOI: 10.2196/16709] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 05/25/2020] [Accepted: 06/11/2020] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Chest computed tomography (CT) is crucial for the detection of lung cancer, and many automated CT evaluation methods have been proposed. Due to the divergent software dependencies of the reported approaches, the developed methods are rarely compared or reproduced. OBJECTIVE The goal of the research was to generate reproducible machine learning modules for lung cancer detection and compare the approaches and performances of the award-winning algorithms developed in the Kaggle Data Science Bowl. METHODS We obtained the source codes of all award-winning solutions of the Kaggle Data Science Bowl Challenge, where participants developed automated CT evaluation methods to detect lung cancer (training set n=1397, public test set n=198, final test set n=506). The performance of the algorithms was evaluated by the log-loss function, and the Spearman correlation coefficient of the performance in the public and final test sets was computed. RESULTS Most solutions implemented distinct image preprocessing, segmentation, and classification modules. Variants of U-Net, VGGNet, and residual net were commonly used in nodule segmentation, and transfer learning was used in most of the classification algorithms. Substantial performance variations in the public and final test sets were observed (Spearman correlation coefficient = .39 among the top 10 teams). To ensure the reproducibility of results, we generated a Docker container for each of the top solutions. CONCLUSIONS We compared the award-winning algorithms for lung cancer detection and generated reproducible Docker images for the top solutions. Although convolutional neural networks achieved decent accuracy, there is plenty of room for improvement regarding model generalizability.
Collapse
Affiliation(s)
- Kun-Hsing Yu
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States.,Department of Statistics, Harvard University, Cambridge, MA, United States.,Department of Pathology, Brigham and Women's Hospital, Boston, MA, United States
| | | | - Ming-Hsuan Yen
- Graduate Program of Multimedia Systems and Intelligent Computing, National Cheng Kung University and Academia Sinica, Tainan, Taiwan.,Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan
| | - S C Kou
- Department of Statistics, Harvard University, Cambridge, MA, United States
| | - Bruce Rosen
- Department of Radiology, Athinoula A Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Boston, MA, United States.,Division of Health Sciences and Technology, Harvard-Massachusetts Institute of Technology, Boston, MA, United States
| | - Jung-Hsien Chiang
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States.,Division of Health Sciences and Technology, Harvard-Massachusetts Institute of Technology, Boston, MA, United States
| |
Collapse
|
62
|
He B, Bergenstråhle L, Stenbeck L, Abid A, Andersson A, Borg Å, Maaskola J, Lundeberg J, Zou J. Integrating spatial gene expression and breast tumour morphology via deep learning. Nat Biomed Eng 2020; 4:827-834. [PMID: 32572199 DOI: 10.1038/s41551-020-0578-x] [Citation(s) in RCA: 160] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Accepted: 05/23/2020] [Indexed: 11/09/2022]
Abstract
Spatial transcriptomics allows for the measurement of RNA abundance at a high spatial resolution, making it possible to systematically link the morphology of cellular neighbourhoods and spatially localized gene expression. Here, we report the development of a deep learning algorithm for the prediction of local gene expression from haematoxylin-and-eosin-stained histopathology images using a new dataset of 30,612 spatially resolved gene expression data matched to histopathology images from 23 patients with breast cancer. We identified over 100 genes, including known breast cancer biomarkers of intratumoral heterogeneity and the co-localization of tumour growth and immune activation, the expression of which can be predicted from the histopathology images at a resolution of 100 µm. We also show that the algorithm generalizes well to The Cancer Genome Atlas and to other breast cancer gene expression datasets without the need for re-training. Predicting the spatially resolved transcriptome of a tissue directly from tissue images may enable image-based screening for molecular biomarkers with spatial variation.
Collapse
Affiliation(s)
- Bryan He
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | | | - Linnea Stenbeck
- School of Biotechnology, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Abubakar Abid
- Department of Electrical Engineering, Stanford University, Stanford, CA, USA
| | - Alma Andersson
- School of Biotechnology, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Åke Borg
- Division of Oncology and Pathology, Lund University, Lund, Sweden
| | - Jonas Maaskola
- School of Biotechnology, KTH Royal Institute of Technology, Stockholm, Sweden.
| | - Joakim Lundeberg
- School of Biotechnology, KTH Royal Institute of Technology, Stockholm, Sweden.
| | - James Zou
- Department of Computer Science, Stanford University, Stanford, CA, USA. .,Department of Electrical Engineering, Stanford University, Stanford, CA, USA. .,Department of Biomedical Data Science, Stanford University, Stanford, CA, USA. .,Chan-Zuckerberg Biohub, San Francisco, CA, USA.
| |
Collapse
|