1
|
DeVoe K, Takahashi G, Tarshizi E, Sacker A. Evaluation of the precision and accuracy in the classification of breast histopathology images using the MobileNetV3 model. J Pathol Inform 2024; 15:100377. [PMID: 38706514 PMCID: PMC11066512 DOI: 10.1016/j.jpi.2024.100377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 04/05/2024] [Accepted: 04/08/2024] [Indexed: 05/07/2024] Open
Abstract
Accurate surgical pathological assessment of breast biopsies is essential to the proper management of breast lesions. Identifying histological features, such as nuclear pleomorphism, increased mitotic activity, cellular atypia, patterns of architectural disruption, as well as invasion through basement membranes into surrounding stroma and normal structures, including invasion of vascular and lymphatic spaces, help to classify lesions as malignant. This visual assessment is repeated on numerous slides taken at various sections through the resected tumor, each at different magnifications. Computer vision models have been proposed to assist human pathologists in classification tasks such as these. Using MobileNetV3, a convolutional architecture designed to achieve high accuracy with a compact parameter footprint, we attempted to classify breast cancer images in the BreakHis_v1 breast pathology dataset to determine the performance of this model out-of-the-box. Using transfer learning to take advantage of ImageNet embeddings without special feature extraction, we were able to correctly classify histopathology images broadly as benign or malignant with 0.98 precision, 0.97 recall, and an F1 score of 0.98. The ability to classify into histological subcategories was varied, with the greatest success being with classifying ductal carcinoma (accuracy 0.95), and the lowest success being with lobular carcinoma (accuracy 0.59). Multiclass ROC assessment of performance as a multiclass classifier yielded AUC values ≥0.97 in both benign and malignant subsets. In comparison with previous efforts, using older and larger convolutional network architectures with feature extraction pre-processing, our work highlights that modern, resource-efficient architectures can classify histopathological images with accuracy that at least matches that of previous efforts, without the need for labor-intensive feature extraction protocols. Suggestions to further refine the model are discussed.
Collapse
Affiliation(s)
- Kenneth DeVoe
- Shiley-Marcos School of Engineering, Applied Artificial Intelligence MS Program, University of San Diego, 5998 Alcalá Park, San Diego, CA 92110, USA
| | - Gary Takahashi
- Shiley-Marcos School of Engineering, Applied Artificial Intelligence MS Program, University of San Diego, 5998 Alcalá Park, San Diego, CA 92110, USA
| | - Ebrahim Tarshizi
- Shiley-Marcos School of Engineering, Applied Artificial Intelligence MS Program, University of San Diego, 5998 Alcalá Park, San Diego, CA 92110, USA
| | - Allan Sacker
- Department of Pathology, Providence St. Vincent Medical Center, 9205 SW Barnes Road, Portland, OR 97225, USA
| |
Collapse
|
2
|
Gonzalez R, Saha A, Campbell CJ, Nejat P, Lokker C, Norgan AP. Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities. J Pathol Inform 2024; 15:100347. [PMID: 38162950 PMCID: PMC10755052 DOI: 10.1016/j.jpi.2023.100347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/06/2023] [Accepted: 11/01/2023] [Indexed: 01/03/2024] Open
Abstract
This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and those that require a conceptual reappraisal from a critical perspective. Then, a novel opportunity to support "Learning Health Systems" by integrating hidden information extracted by ML models from digitalized histopathology slides with other healthcare big data is presented.
Collapse
Affiliation(s)
- Ricardo Gonzalez
- DeGroote School of Business, McMaster University, Hamilton, Ontario, Canada
- Division of Computational Pathology and Artificial Intelligence, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
| | - Ashirbani Saha
- Department of Oncology, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
- Escarpment Cancer Research Institute, McMaster University and Hamilton Health Sciences, Hamilton, Ontario, Canada
| | - Clinton J.V. Campbell
- William Osler Health System, Brampton, Ontario, Canada
- Department of Pathology and Molecular Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
| | - Peyman Nejat
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, United States
| | - Cynthia Lokker
- Health Information Research Unit, Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, Ontario, Canada
| | - Andrew P. Norgan
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
| |
Collapse
|
3
|
Prassas I, Clarke B, Youssef T, Phlamon J, Dimitrakopoulos L, Rofaeil A, Yousef GM. Computational pathology: an evolving concept. Clin Chem Lab Med 2024; 62:2148-2155. [PMID: 38646706 DOI: 10.1515/cclm-2023-1124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 04/10/2024] [Indexed: 04/23/2024]
Abstract
The initial enthusiasm about computational pathology (CP) and artificial intelligence (AI) was that they will replace pathologists entirely on the way to fully automated diagnostics. It is becoming clear that currently this is not the immediate model to pursue. On top of the legal and regulatory complexities surrounding its implementation, the majority of tested machine learning (ML)-based predictive algorithms do not display the exquisite performance needed to render them unequivocal, standalone decision makers for matters with direct implications to human health. We are thus moving into a different model of "computer-assisted diagnostics", where AI is there to provide support, rather than replacing, the pathologist. Herein we focus on the practical aspects of CP, from a pathologist perspective. There is a wide range of potential applications where CP can enhance precision of pathology diagnosis, tailor prognostic and predictive information, as well as save time. There are, however, a number of potential limitations for CP that currently hinder their wider adoption in the clinical setting. We address the key necessary steps towards clinical implementation of computational pathology, discuss the significant obstacles that hinders its adoption in the clinical context and summarize some proposed solutions. We conclude that the advancement of CP in the clinic is a promising resource-intensive endeavour that requires broad and inclusive collaborations between academia, industry, and regulatory bodies.
Collapse
Affiliation(s)
- Ioannis Prassas
- Laboratory Medicine Program, 7989 University Health Network , Toronto, ON, Canada
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
| | - Blaise Clarke
- Laboratory Medicine Program, 7989 University Health Network , Toronto, ON, Canada
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
| | - Timothy Youssef
- Laboratory Medicine Program, 7989 University Health Network , Toronto, ON, Canada
| | - Juliana Phlamon
- Laboratory Medicine Program, 7989 University Health Network , Toronto, ON, Canada
| | | | - Andrew Rofaeil
- Laboratory Medicine Program, 7989 University Health Network , Toronto, ON, Canada
| | - George M Yousef
- Laboratory Medicine Program, 7989 University Health Network , Toronto, ON, Canada
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
4
|
Chakraborty C, Bhattacharya M, Lee SS, Wen ZH, Lo YH. The changing scenario of drug discovery using AI to deep learning: Recent advancement, success stories, collaborations, and challenges. MOLECULAR THERAPY. NUCLEIC ACIDS 2024; 35:102295. [PMID: 39257717 PMCID: PMC11386122 DOI: 10.1016/j.omtn.2024.102295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
Due to the transformation of artificial intelligence (AI) tools and technologies, AI-driven drug discovery has come to the forefront. It reduces the time and expenditure. Due to these advantages, pharmaceutical industries are concentrating on AI-driven drug discovery. Several drug molecules have been discovered using AI-based techniques and tools, and several newly AI-discovered drug molecules have already entered clinical trials. In this review, we first present the data and their resources in the pharmaceutical sector for AI-driven drug discovery and illustrated some significant algorithms or techniques used for AI and ML which are used in this field. We gave an overview of the deep neural network (NN) models and compared them with artificial NNs. Then, we illustrate the recent advancement of the landscape of drug discovery using AI to deep learning, such as the identification of drug targets, prediction of their structure, estimation of drug-target interaction, estimation of drug-target binding affinity, design of de novo drug, prediction of drug toxicity, estimation of absorption, distribution, metabolism, excretion, toxicity; and estimation of drug-drug interaction. Moreover, we highlighted the success stories of AI-driven drug discovery and discussed several collaboration and the challenges in this area. The discussions in the article will enrich the pharmaceutical industry.
Collapse
Affiliation(s)
- Chiranjib Chakraborty
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata, West Bengal 700126, India
| | - Manojit Bhattacharya
- Department of Zoology, Fakir Mohan University, Vyasa Vihar, Balasore, Odisha 756020, India
| | - Sang-Soo Lee
- Institute for Skeletal Aging & Orthopedic Surgery, Hallym University-Chuncheon Sacred Heart Hospital, Chuncheon, Gangwon-Do 24252, Republic of Korea
| | - Zhi-Hong Wen
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung 80424, Taiwan
| | - Yi-Hao Lo
- Department of Family Medicine, Zuoying Armed Forces General Hospital, Kaohsiung 813204, Taiwan
- Shu-Zen Junior College of Medicine and Management, Kaohsiung 821004, Taiwan
- Institute of Medical Science and Technology, National Sun Yat-sen University, Kaohsiung 804201, Taiwan
| |
Collapse
|
5
|
Sun G, Cai L, Yan X, Nie W, Liu X, Xu J, Zou X. A prediction model based on digital breast pathology image information. PLoS One 2024; 19:e0294923. [PMID: 38758814 PMCID: PMC11101065 DOI: 10.1371/journal.pone.0294923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 11/11/2023] [Indexed: 05/19/2024] Open
Abstract
BACKGROUND The workload of breast cancer pathological diagnosis is very heavy. The purpose of this study is to establish a nomogram model based on pathological images to predict the benign and malignant nature of breast diseases and to validate its predictive performance. METHODS In retrospect, a total of 2,723 H&E-stained pathological images were collected from 1,474 patients at Qingdao Central Hospital between 2019 and 2022. The dataset consisted of 509 benign tumor images (adenosis and fibroadenoma) and 2,214 malignant tumor images (infiltrating ductal carcinoma). The images were divided into a training set (1,907) and a validation set (816). Python3.7 was used to extract the values of the R channel, G channel, B channel, and one-dimensional information entropy from all images. Multivariable logistic regression was used to select variables and establish the breast tissue pathological image prediction model. RESULTS The R channel value, B channel value, and one-dimensional information entropy of the images were identified as independent predictive factors for the classification of benign and malignant pathological images (P < 0.05). The area under the curve (AUC) of the nomogram model in the training set was 0.889 (95% CI: 0.869, 0.909), and the AUC in the validation set was 0.838 (95% CI: 0.7980.877). The calibration curve results showed that the calibration curve of this nomogram model was close to the ideal curve. The decision curve results indicated that the predictive model curve had a high value for auxiliary diagnosis. CONCLUSION The nomogram model for the prediction of benign and malignant breast diseases based on pathological images demonstrates good predictive performance. This model can assist in the diagnosis of breast tissue pathological images.
Collapse
Affiliation(s)
- Guoxin Sun
- School of Clinical Medicine, Qingdao University, Qingdao, China
| | - Liying Cai
- College of Nursing and Rehabilitation, North China University of Science and Technology, Tangshan City, China
| | - Xiong Yan
- Department of Pathology, Qingdao Central Hospital, Qingdao, China
| | - Weihong Nie
- School of Clinical Medicine, Qingdao University, Qingdao, China
| | - Xin Liu
- School of Clinical Medicine, Qingdao University, Qingdao, China
| | - Jing Xu
- Department of Pathology, Qingdao Central Hospital, Qingdao, China
| | - Xiao Zou
- Department of Breast Surgery, Xiangdong Hospital Affiliated to Hunan Normal University, Hunan, China
| |
Collapse
|
6
|
Zarif S, Abdulkader H, Elaraby I, Alharbi A, Elkilani WS, Pławiak P. Using hybrid pre-trained models for breast cancer detection. PLoS One 2024; 19:e0296912. [PMID: 38252633 PMCID: PMC10802945 DOI: 10.1371/journal.pone.0296912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 12/21/2023] [Indexed: 01/24/2024] Open
Abstract
Breast cancer is a prevalent and life-threatening disease that affects women globally. Early detection and access to top-notch treatment are crucial in preventing fatalities from this condition. However, manual breast histopathology image analysis is time-consuming and prone to errors. This study proposed a hybrid deep learning model (CNN+EfficientNetV2B3). The proposed approach utilizes convolutional neural networks (CNNs) for the identification of positive invasive ductal carcinoma (IDC) and negative (non-IDC) tissue using whole slide images (WSIs), which use pre-trained models to classify breast cancer in images, supporting pathologists in making more accurate diagnoses. The proposed model demonstrates outstanding performance with an accuracy of 96.3%, precision of 93.4%, recall of 86.4%, F1-score of 89.7%, Matthew's correlation coefficient (MCC) of 87.6%, the Area Under the Curve (AUC) of a Receiver Operating Characteristic (ROC) curve of 97.5%, and the Area Under the Curve of the Precision-Recall Curve (AUPRC) of 96.8%, which outperforms the accuracy achieved by other models. The proposed model was also tested against MobileNet+DenseNet121, MobileNetV2+EfficientNetV2B0, and other deep learning models, proving more powerful than contemporary machine learning and deep learning approaches.
Collapse
Affiliation(s)
- Sameh Zarif
- Department of Information Technology, Faculty of Computers and Information, Menoufia University, Shebin El-kom, Menoufia, Egypt
- Artificial Intelligence Department, Faculty of Artificial Intelligence, Egyptian Russian University, Cairo, Egypt
| | - Hatem Abdulkader
- Department of Information Systems, Faculty of Computers and Information, Menoufia University, Shebin El-kom, Menoufia, Egypt
| | - Ibrahim Elaraby
- Department of Information Systems Management, Higher Institute of Qualitative Studies, Cairo, Egypt
| | - Abdullah Alharbi
- Department of Computer Science, Community College, King Saud University, Riyadh, Saudi Arabia
| | - Wail S. Elkilani
- College of Applied Computer Science, King Saud University, Riyadh, Saudi Arabia
| | - Paweł Pławiak
- Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Krakow, Poland
| |
Collapse
|
7
|
Bakasa W, Viriri S. VGG16 Feature Extractor with Extreme Gradient Boost Classifier for Pancreas Cancer Prediction. J Imaging 2023; 9:138. [PMID: 37504815 PMCID: PMC10381878 DOI: 10.3390/jimaging9070138] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/19/2023] [Accepted: 07/04/2023] [Indexed: 07/29/2023] Open
Abstract
The prognosis of patients with pancreatic ductal adenocarcinoma (PDAC) is greatly improved by an early and accurate diagnosis. Several studies have created automated methods to forecast PDAC development utilising various medical imaging modalities. These papers give a general overview of the classification, segmentation, or grading of many cancer types utilising conventional machine learning techniques and hand-engineered characteristics, including pancreatic cancer. This study uses cutting-edge deep learning techniques to identify PDAC utilising computerised tomography (CT) medical imaging modalities. This work suggests that the hybrid model VGG16-XGBoost (VGG16-backbone feature extractor and Extreme Gradient Boosting-classifier) for PDAC images. According to studies, the proposed hybrid model performs better, obtaining an accuracy of 0.97 and a weighted F1 score of 0.97 for the dataset under study. The experimental validation of the VGG16-XGBoost model uses the Cancer Imaging Archive (TCIA) public access dataset, which has pancreas CT images. The results of this study can be extremely helpful for PDAC diagnosis from computerised tomography (CT) pancreas images, categorising them into five different tumours (T), node (N), and metastases (M) (TNM) staging system class labels, which are T0, T1, T2, T3, and T4.
Collapse
Affiliation(s)
- Wilson Bakasa
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Durban 4041, South Africa
| | - Serestina Viriri
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Durban 4041, South Africa
| |
Collapse
|
8
|
del Olmo Lianes I, Yubero P, Gómez-Luengo Á, Nogales J, Espeso DR. Technical upgrade of an open-source liquid handler to support bacterial colony screening. Front Bioeng Biotechnol 2023; 11:1202836. [PMID: 37404684 PMCID: PMC10315574 DOI: 10.3389/fbioe.2023.1202836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Accepted: 06/07/2023] [Indexed: 07/06/2023] Open
Abstract
The optimization of genetically engineered biological constructs is a key step to deliver high-impact biotechnological applications. The use of high-throughput DNA assembly methods allows the construction of enough genotypic variants to successfully cover the target design space. This, however, entails extra workload for researchers during the screening stage of candidate variants. Despite the existence of commercial colony pickers, their high price excludes small research laboratories and budget-adjusted institutions from accessing such extensive screening capability. In this work we present COPICK, a technical solution to automatize colony picking in an open-source liquid handler Opentrons OT-2. COPICK relies on a mounted camera to capture images of regular Petri dishes and detect microbial colonies for automated screening. COPICK's software can then automatically select the best colonies according to different criteria (size, color and fluorescence) and execute a protocol to pick them for further analysis. Benchmark tests performed for E. coli and P. putida colonies delivers a raw picking performance over pickable colonies of 82% with an accuracy of 73.4% at an estimated rate of 240 colonies/h. These results validate the utility of COPICK, and highlight the importance of ongoing technical improvements in open-source laboratory equipment to support smaller research teams.
Collapse
Affiliation(s)
- Irene del Olmo Lianes
- Department of Systems Biology, Centro Nacional de Biotecnología—Consejo Superior de Investigaciones Científicas, Madrid, Spain
| | - Pablo Yubero
- Department of Systems Biology, Centro Nacional de Biotecnología—Consejo Superior de Investigaciones Científicas, Madrid, Spain
| | - Álvaro Gómez-Luengo
- Department of Systems Biology, Centro Nacional de Biotecnología—Consejo Superior de Investigaciones Científicas, Madrid, Spain
- Interdisciplinary Platform for Sustainable Plastics Towards a Circular Economy—Consejo Superior de Investigaciones Científicas, SusPlast-CSIC, Madrid, Spain
| | - Juan Nogales
- Department of Systems Biology, Centro Nacional de Biotecnología—Consejo Superior de Investigaciones Científicas, Madrid, Spain
- Interdisciplinary Platform for Sustainable Plastics Towards a Circular Economy—Consejo Superior de Investigaciones Científicas, SusPlast-CSIC, Madrid, Spain
| | - David R. Espeso
- Department of Systems Biology, Centro Nacional de Biotecnología—Consejo Superior de Investigaciones Científicas, Madrid, Spain
| |
Collapse
|
9
|
Burrai GP, Gabrieli A, Polinas M, Murgia C, Becchere MP, Demontis P, Antuofermo E. Canine Mammary Tumor Histopathological Image Classification via Computer-Aided Pathology: An Available Dataset for Imaging Analysis. Animals (Basel) 2023; 13:ani13091563. [PMID: 37174600 PMCID: PMC10177203 DOI: 10.3390/ani13091563] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 04/27/2023] [Accepted: 05/04/2023] [Indexed: 05/15/2023] Open
Abstract
Histopathology, the gold-standard technique in classifying canine mammary tumors (CMTs), is a time-consuming process, affected by high inter-observer variability. Digital (DP) and Computer-aided pathology (CAD) are emergent fields that will improve overall classification accuracy. In this study, the ability of the CAD systems to distinguish benign from malignant CMTs has been explored on a dataset-namely CMTD-of 1056 hematoxylin and eosin JPEG images from 20 benign and 24 malignant CMTs, with three different CAD systems based on the combination of a convolutional neural network (VGG16, Inception v3, EfficientNet), which acts as a feature extractor, and a classifier (support vector machines (SVM) or stochastic gradient boosting (SGB)), placed on top of the neural net. Based on a human breast cancer dataset (i.e., BreakHis) (accuracy from 0.86 to 0.91), our models were applied to the CMT dataset, showing accuracy from 0.63 to 0.85 across all architectures. The EfficientNet framework coupled with SVM resulted in the best performances with an accuracy from 0.82 to 0.85. The encouraging results obtained by the use of DP and CAD systems in CMTs provide an interesting perspective on the integration of artificial intelligence and machine learning technologies in cancer-related research.
Collapse
Affiliation(s)
- Giovanni P Burrai
- Department of Veterinary Medicine, University of Sassari, Via Vienna 2, 07100 Sassari, Italy
- Mediterranean Center for Disease Control (MCDC), University of Sassari, Via Vienna 2, 07100 Sassari, Italy
| | - Andrea Gabrieli
- Department of Veterinary Medicine, University of Sassari, Via Vienna 2, 07100 Sassari, Italy
| | - Marta Polinas
- Department of Veterinary Medicine, University of Sassari, Via Vienna 2, 07100 Sassari, Italy
| | - Claudio Murgia
- Department of Veterinary Medicine, University of Sassari, Via Vienna 2, 07100 Sassari, Italy
| | | | - Pierfranco Demontis
- Department of Chemical, Physical, Mathematical and Natural Sciences, University of Sassari, Via Vienna 2, 07100 Sassari, Italy
| | - Elisabetta Antuofermo
- Department of Veterinary Medicine, University of Sassari, Via Vienna 2, 07100 Sassari, Italy
- Mediterranean Center for Disease Control (MCDC), University of Sassari, Via Vienna 2, 07100 Sassari, Italy
| |
Collapse
|