1
|
Siddiqui AA, Tirunagari S, Zia T, Windridge D. A latent diffusion approach to visual attribution in medical imaging. Sci Rep 2025; 15:962. [PMID: 39762275 PMCID: PMC11704132 DOI: 10.1038/s41598-024-81646-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 11/28/2024] [Indexed: 01/11/2025] Open
Abstract
Visual attribution in medical imaging seeks to make evident the diagnostically-relevant components of a medical image, in contrast to the more common detection of diseased tissue deployed in standard machine vision pipelines (which are less straightforwardly interpretable/explainable to clinicians). We here present a novel generative visual attribution technique, one that leverages latent diffusion models in combination with domain-specific large language models, in order to generate normal counterparts of abnormal images. The discrepancy between the two hence gives rise to a mapping indicating the diagnostically-relevant image components. To achieve this, we deploy image priors in conjunction with appropriate conditioning mechanisms in order to control the image generative process, including natural language text prompts acquired from medical science and applied radiology. We perform experiments and quantitatively evaluate our results on the COVID-19 Radiography Database containing labelled chest X-rays with differing pathologies via the Frechet Inception Distance (FID), Structural Similarity (SSIM) and Multi Scale Structural Similarity Metric (MS-SSIM) metrics obtained between real and generated images. The resulting system also exhibits a range of latent capabilities including zero-shot localized disease induction, which are evaluated with real examples from the cheXpert dataset.
Collapse
|
2
|
Liu Y, Fan Q, Xu C, Ning X, Wang Y, Liu Y, Xie Y, Zhang Y, Chen Y, Liu H. GDMol: Generative Double-Masking Self-Supervised Learning for Molecular Property Prediction. Mol Inform 2025; 44:e202400146. [PMID: 39444340 DOI: 10.1002/minf.202400146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 08/19/2024] [Accepted: 09/04/2024] [Indexed: 10/25/2024]
Abstract
BACKGROUND Effective molecular feature representation is crucial for drug property prediction. Recent years have seen increased attention on graph neural networks (GNNs) that are pre-trained using self-supervised learning techniques, aiming to overcome the scarcity of labeled data in molecular property prediction. Traditional GNNs in self-supervised molecular property prediction typically perform a single masking operation on the nodes and edges of the input molecular graph, masking only local information and insufficient for thorough self-supervised training. METHOD Hence, we propose a model for molecular property prediction based on generative double-masking self-supervised learning, termed as GDMol. This integrates generative learning into the self-supervised learning framework for latent representation, and applies a second round of masking to these latent representations, enabling the model to better capture global information and semantic knowledge of the molecules for a richer, more informative representation, thereby achieving more accurate and robust molecular property prediction. RESULTS Our experiments on 5 datasets demonstrated superior performance of GDMol in predicting molecular properties across different domains. Moreover, we used the masking operation to traverse through the gradient changes of each node, the magnitude and sign of which reflect the positive and negative contribution respectively of the local structure in the molecule to the prediction outcome. This in-depth interpretative analysis not only enhances the model's interpretability, but also provides more targeted insights and direction for optimizing drug molecules. CONCLUSIONS In summary, this research offers novel insights on improving molecular property prediction tasks, and paves the way for further research on the application of generative learning and self-supervised learning in the field of chemistry.
Collapse
Affiliation(s)
- Yingxu Liu
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Qing Fan
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Chengcheng Xu
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Xiangzhen Ning
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Yu Wang
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Yang Liu
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Yu Xie
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Yanmin Zhang
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Yadong Chen
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| | - Haichun Liu
- School of Science, China Pharmaceutical University, Nanjing, 210009, China
| |
Collapse
|
3
|
Ramouz A, Adeliansedehi A, Khajeh E, März K, Michael D, Wagner M, Müller-Stich BP, Mehrabi A, Majlesara A. Introducing and Validating the Multiphasic Evidential Decision-Making Matrix (MedMax) for Clinical Management in Patients with Intrahepatic Cholangiocarcinoma. Cancers (Basel) 2024; 17:52. [PMID: 39796681 PMCID: PMC11718823 DOI: 10.3390/cancers17010052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2024] [Revised: 11/22/2024] [Accepted: 12/17/2024] [Indexed: 01/13/2025] Open
Abstract
Background: Despite the significant advancements of liver surgery in the last few decades, the survival rate of patients with liver and pancreatic cancers has improved by only 10% in 30 years. Precision medicine offers a patient-centered approach, which, when combined with machine learning, could enhance decision making and treatment outcomes in surgical management of ihCC. This study aims to develop a decision support model to optimize treatment strategies for patients with ihCC, a prevalent primary liver cancer. Methods: The decision support model, named MedMax, was developed using three data sources: studies retrieved through a systematic literature review, expert opinions from HPB surgeons, and data from ihCC patients treated at Heidelberg University Hospital. Expert opinions were collected via surveys, with factors rated on a Likert scale, while patient data were used to validate the model's accuracy. Results: The model is structured into four decision-making phases, assessing diagnosis, treatment modality, surgical approach, and prognosis. Prospectively, 44 patients with ihCC were included for internal primary validation of the model. MedMax could predict the appropriate treatment considering the resectability of the lesions in 100% of patients. Also, MedMax could predict a decent surgical approach in 77% of the patients. The model proved effective in making decisions regarding surgery and patient management, demonstrating its potential as a clinical decision support tool. Conclusions: MedMax offers a transparent, personalized approach to decision making in HPB surgery, particularly for ihCC patients. Initial results show high accuracy in treatment selection, and the model's flexibility allows for future expansion to other liver tumors and HPB surgeries. Further validation with larger patient cohorts is required to enhance its clinical utility.
Collapse
Affiliation(s)
- Ali Ramouz
- Department of General, Visceral and Transplantation Surgery, University of Heidelberg, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT) Heidelberg, 69120 Heidelberg, Germany
- Liver Cancer Center Heidelberg (LCCH), University of Heidelberg, 69120 Heidelberg, Germany
| | - Ali Adeliansedehi
- Department of General, Visceral and Transplantation Surgery, University of Heidelberg, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT) Heidelberg, 69120 Heidelberg, Germany
| | - Elias Khajeh
- Department of General, Visceral and Transplantation Surgery, University of Heidelberg, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT) Heidelberg, 69120 Heidelberg, Germany
- Liver Cancer Center Heidelberg (LCCH), University of Heidelberg, 69120 Heidelberg, Germany
| | - Keno März
- National Center for Tumor Diseases (NCT) Heidelberg, 69120 Heidelberg, Germany
- Division of Computer Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Dominik Michael
- Division of Computer Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Martin Wagner
- Department of General, Visceral and Transplantation Surgery, University of Heidelberg, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT) Heidelberg, 69120 Heidelberg, Germany
- Center for the Tactile Internet with Human in the Loop (CeTI), Technical University Dresden, 01069 Dresden, Germany
| | - Beat Peter Müller-Stich
- Department of General, Visceral and Transplantation Surgery, University of Heidelberg, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT) Heidelberg, 69120 Heidelberg, Germany
- Department of Surgery, Clarunis University Center for Gastrointestinal and Liver Disease, University Hospital and St. Clara Hospital Basel, 4052 Basel, Switzerland
| | - Arianeb Mehrabi
- Department of General, Visceral and Transplantation Surgery, University of Heidelberg, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT) Heidelberg, 69120 Heidelberg, Germany
- Liver Cancer Center Heidelberg (LCCH), University of Heidelberg, 69120 Heidelberg, Germany
| | - Ali Majlesara
- Department of General, Visceral and Transplantation Surgery, University of Heidelberg, 69120 Heidelberg, Germany
- National Center for Tumor Diseases (NCT) Heidelberg, 69120 Heidelberg, Germany
- Liver Cancer Center Heidelberg (LCCH), University of Heidelberg, 69120 Heidelberg, Germany
| |
Collapse
|
4
|
Ocampo Osorio F, Alzate-Ricaurte S, Mejia Vallecilla TE, Cruz-Suarez GA. The anesthesiologist's guide to critically assessing machine learning research: a narrative review. BMC Anesthesiol 2024; 24:452. [PMID: 39695968 DOI: 10.1186/s12871-024-02840-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Accepted: 11/28/2024] [Indexed: 12/20/2024] Open
Abstract
Artificial Intelligence (AI), especially Machine Learning (ML), has developed systems capable of performing tasks that require human intelligence. In anesthesiology and other medical fields, AI applications can improve the precision and efficiency of daily clinical practice, and can also facilitate a personalized approach to patient care, which can lead to improved outcomes and quality of care. ML has been successfully applied in various settings of daily anesthesiology practice, such as predicting acute kidney injury, optimizing anesthetic doses, and managing postoperative nausea and vomiting. The critical evaluation of ML models in healthcare is crucial to assess their validity, safety, and clinical applicability. Evaluation metrics allow an objective statistical assessment of model performance. Tools such as Shapley Values (SHAP) help interpret how individual variables contribute to model predictions. Transparency in reporting is key in maintaining trust in these technologies and to ensure their use follows ethical principles, aiming to reduce safety concerns while also benefiting patients. Understanding evaluation metrics is essential, as they provide detailed information on model performance and their ability to discriminate between individual class rates. This article offers a comprehensive framework in assessing the validity, applicability, and limitations of models, guiding responsible and effective integration of ML technologies into clinical practice. A balance between innovation, patient safety and ethical considerations must be pursued.
Collapse
Affiliation(s)
- Felipe Ocampo Osorio
- Unidad de Inteligencia Artificial, Fundación Valle del Lili, Cra 98 Num.18-49, Cali, 760032, Valle del Cauca, Colombia
- Departamento de Salud Pública y Medicina Comunitaria, Universidad Icesi, Cali, 760000, Valle del Cauca, Colombia
- Centro de Investigaciones Clínicas, Fundación Valle del Lili, Cra 98 Num.18-49, Cali, 760032, Valle del Cauca, Colombia
| | - Sergio Alzate-Ricaurte
- Unidad de Inteligencia Artificial, Fundación Valle del Lili, Cra 98 Num.18-49, Cali, 760032, Valle del Cauca, Colombia
- Centro de Investigaciones Clínicas, Fundación Valle del Lili, Cra 98 Num.18-49, Cali, 760032, Valle del Cauca, Colombia
| | | | - Gustavo Adolfo Cruz-Suarez
- Unidad de Inteligencia Artificial, Fundación Valle del Lili, Cra 98 Num.18-49, Cali, 760032, Valle del Cauca, Colombia.
- Departamento de Salud Pública y Medicina Comunitaria, Universidad Icesi, Cali, 760000, Valle del Cauca, Colombia.
- Centro de Investigaciones Clínicas, Fundación Valle del Lili, Cra 98 Num.18-49, Cali, 760032, Valle del Cauca, Colombia.
| |
Collapse
|
5
|
Leckey C, van Dyk N, Doherty C, Lawlor A, Delahunt E. Machine learning approaches to injury risk prediction in sport: a scoping review with evidence synthesis. Br J Sports Med 2024:bjsports-2024-108576. [PMID: 39613453 DOI: 10.1136/bjsports-2024-108576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/07/2024] [Indexed: 12/01/2024]
Abstract
OBJECTIVE This study reviewed the current state of machine learning (ML) research for the prediction of sports-related injuries. It aimed to chart the various approaches used and assess their efficacy, considering factors such as data heterogeneity, model specificity and contextual factors when developing predictive models. DESIGN Scoping review. DATA SOURCES PubMed, EMBASE, SportDiscus and IEEEXplore. RESULTS In total, 1241 studies were identified, 58 full texts were screened, and 38 relevant studies were reviewed and charted. Football (soccer) was the most commonly investigated sport. Area under the curve (AUC) was the most common means of model evaluation; it was reported in 71% of studies. In 60% of studies, tree-based solutions provided the highest statistical predictive performance. Random Forest and Extreme Gradient Boosting (XGBoost) were found to provide the highest performance for injury risk prediction. Logistic regression outperformed ML methods in 4 out of 12 studies. Three studies reported model performance of AUC>0.9, yet the clinical relevance is questionable. CONCLUSIONS A variety of different ML models have been applied to the prediction of sports-related injuries. While several studies report strong predictive performance, their clinical utility can be limited, with wide prediction windows or broad definitions of injury. The efficacy of ML is hampered by small datasets and numerous methodological heterogeneities (cohort sizes, definition of injury and dependent variables), which were common across the reviewed studies.
Collapse
Affiliation(s)
- Christopher Leckey
- School of Public Health Physiotherapy and Sports Science, University College Dublin, Dublin, Ireland
- High Performance Unit, Irish Rugby Football Union, Dublin, Dublin, Ireland
| | - Nicol van Dyk
- School of Public Health Physiotherapy and Sports Science, University College Dublin, Dublin, Ireland
- Section Sports Medicine, Faculty of Health Sciences, University of Pretoria, Pretoria, Gauteng, South Africa
| | - Cailbhe Doherty
- School of Public Health Physiotherapy and Sports Science, University College Dublin, Dublin, Ireland
- Insight Centre for Data Analytics, University College Dublin, Dublin, Dublin 4, Ireland
| | - Aonghus Lawlor
- Insight Centre for Data Analytics, University College Dublin, Dublin, Dublin 4, Ireland
| | - Eamonn Delahunt
- School of Public Health Physiotherapy and Sports Science, University College Dublin, Dublin, Ireland
- Institute for Sport and Health, University College Dublin, Dublin, Ireland
| |
Collapse
|
6
|
Nguyen M, Amanian A, Wei M, Prisman E, Mendez‐Tellez PA. Predicting Tracheostomy Need on Admission to the Intensive Care Unit-A Multicenter Machine Learning Analysis. Otolaryngol Head Neck Surg 2024; 171:1736-1750. [PMID: 39077854 PMCID: PMC11605030 DOI: 10.1002/ohn.919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 06/12/2024] [Accepted: 07/06/2024] [Indexed: 07/31/2024]
Abstract
OBJECTIVE It is difficult to predict which mechanically ventilated patients will ultimately require a tracheostomy which further predisposes them to unnecessary spontaneous breathing trials, additional time on the ventilator, increased costs, and further ventilation-related complications such as subglottic stenosis. In this study, we aimed to develop a machine learning tool to predict which patients need a tracheostomy at the onset of admission to the intensive care unit (ICU). STUDY DESIGN Retrospective Cohort Study. SETTING Multicenter Study of 335 Intensive Care Units between 2014 and 2015. METHODS The eICU Collaborative Research Database (eICU-CRD) was utilized to obtain the patient cohort. Inclusion criteria included: (1) Age >18 years and (2) ICU admission requiring mechanical ventilation. The primary outcome of interest included tracheostomy assessed via a binary classification model. Models included logistic regression (LR), random forest (RF), and Extreme Gradient Boosting (XGBoost). RESULTS Of 38,508 invasively mechanically ventilated patients, 1605 patients underwent a tracheostomy. The XGBoost, RF, and LR models had fair performances at an AUROC 0.794, 0.780, and 0.775 respectively. Limiting the XGBoost model to 20 features out of 331, a minimal reduction in performance was observed with an AUROC of 0.778. Using Shapley Additive Explanations, the top features were an admission diagnosis of pneumonia or sepsis and comorbidity of chronic respiratory failure. CONCLUSIONS Our machine learning model accurately predicts the probability that a patient will eventually require a tracheostomy upon ICU admission, and upon prospective validation, we have the potential to institute earlier interventions and reduce the complications of prolonged ventilation.
Collapse
Affiliation(s)
| | - Ameen Amanian
- Department of Surgery, Division of Otolaryngology–Head & Neck SurgeryUniversity of British ColumbiaVancouverCanada
| | - Meihan Wei
- Department of Biomedical Engineering–Whiting School of EngineeringJohns Hopkins UniversityBaltimoreUSA
| | - Eitan Prisman
- Department of Surgery, Division of Otolaryngology–Head & Neck SurgeryUniversity of British ColumbiaVancouverCanada
| | | |
Collapse
|
7
|
Florentino BR, Parmezan Bonidia R, Sanches NH, da Rocha UN, de Carvalho AC. BioPrediction-RPI: Democratizing the prediction of interaction between non-coding RNA and protein with end-to-end machine learning. Comput Struct Biotechnol J 2024; 23:2267-2276. [PMID: 38827228 PMCID: PMC11140557 DOI: 10.1016/j.csbj.2024.05.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 05/16/2024] [Accepted: 05/16/2024] [Indexed: 06/04/2024] Open
Abstract
Machine Learning (ML) algorithms have been important tools for the extraction of useful knowledge from biological sequences, particularly in healthcare, agriculture, and the environment. However, the categorical and unstructured nature of these sequences requiring usually additional feature engineering steps, before an ML algorithm can be efficiently applied. The addition of these steps to the ML algorithm creates a processing pipeline, known as end-to-end ML. Despite the excellent results obtained by applying end-to-end ML to biotechnology problems, the performance obtained depends on the expertise of the user in the components of the pipeline. In this work, we propose an end-to-end ML-based framework called BioPrediction-RPI, which can identify implicit interactions between sequences, such as pairs of non-coding RNA and proteins, without the need for specialized expertise in end-to-end ML. This framework applies feature engineering to represent each sequence by structural and topological features. These features are divided into feature groups and used to train partial models, whose partial decisions are combined into a final decision, which, provides insights to the user by giving an interpretability report. In our experiments, the developed framework was competitive when compared with various expert-created models. We assessed BioPrediction-RPI with 12 datasets when it presented equal or better performance than all tools in 40% to 100% of cases, depending on the experiment. Finally, BioPrediction-RPI can fine-tune models based on new data and perform at the same level as ML experts, democratizing end-to-end ML and increasing its access to those working in biological sciences.
Collapse
Affiliation(s)
- Bruno Rafael Florentino
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, 13566-590, São Paulo, Brazil
| | - Robson Parmezan Bonidia
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, 13566-590, São Paulo, Brazil
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, 86300-000, Paraná, Brazil
| | - Natan Henrique Sanches
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, 13566-590, São Paulo, Brazil
| | - Ulisses N. da Rocha
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research-UFZ GmbH, Leipzig, Saxony, Germany
| | - André C.P.L.F. de Carvalho
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, 13566-590, São Paulo, Brazil
| |
Collapse
|
8
|
Frascarelli C, Venetis K, Marra A, Mane E, Ivanova M, Cursano G, Porta FM, Concardi A, Ceol AGM, Farina A, Criscitiello C, Curigliano G, Guerini-Rocco E, Fusco N. Deep learning algorithm on H&E whole slide images to characterize TP53 alterations frequency and spatial distribution in breast cancer. Comput Struct Biotechnol J 2024; 23:4252-4259. [PMID: 39678362 PMCID: PMC11638532 DOI: 10.1016/j.csbj.2024.11.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Revised: 11/20/2024] [Accepted: 11/21/2024] [Indexed: 12/17/2024] Open
Abstract
The tumor suppressor TP53 is frequently mutated in hormone receptor-negative, HER2-positive breast cancer (BC), contributing to tumor aggressiveness. Traditional ancillary methods like immunohistochemistry (IHC) to assess TP53 functionality face pre- and post-analytical challenges. This proof-of-concept study employed a deep learning (DL) algorithm to predict TP53 mutational status from H&E-stained whole slide images (WSIs) of BC tissue. Using a pre-trained convolutional neural network, the model identified tumor areas and predicted TP53 mutations with a Dice coefficient score of 0.82. Predictions were validated through IHC and next-generation sequencing (NGS), confirming TP53 aberrant expression in 92 % of the tumor area, closely matching IHC findings (90 %). The DL model exhibited high accuracy in tissue quantification and TP53 status prediction, outperforming traditional methods in terms of precision and efficiency. DL-based approaches offer significant promise for enhancing biomarker testing and precision oncology by reducing intra- and inter-observer variability, but further validation is required to optimize their integration into real-world clinical workflows. This study underscores the potential of DL algorithms to predict key genetic alterations, such as TP53 mutations, in BC. DL-based histopathological analysis represents a valuable tool for improving patient management and tailoring treatment approaches based on molecular biomarker status.
Collapse
Affiliation(s)
- Chiara Frascarelli
- Division of Pathology, European Institute of Oncology IRCCS, Milan, Italy
- Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
| | | | - Antonio Marra
- Division of New Drugs and Early Drug Development for Innovative Therapies, European Institute of Oncology IRCCS, Milan, Italy
| | - Eltjona Mane
- Division of Pathology, European Institute of Oncology IRCCS, Milan, Italy
| | - Mariia Ivanova
- Division of Pathology, European Institute of Oncology IRCCS, Milan, Italy
| | - Giulia Cursano
- Division of Pathology, European Institute of Oncology IRCCS, Milan, Italy
- Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
| | | | - Alberto Concardi
- Division of Pathology, European Institute of Oncology IRCCS, Milan, Italy
| | - Arnaud Gerard Michel Ceol
- Department of Information and Communications Technology, European Institute of Oncology IRCCS, Milan, Italy
| | - Annarosa Farina
- Department of Information and Communications Technology, European Institute of Oncology IRCCS, Milan, Italy
| | - Carmen Criscitiello
- Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
- Division of New Drugs and Early Drug Development for Innovative Therapies, European Institute of Oncology IRCCS, Milan, Italy
| | - Giuseppe Curigliano
- Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
- Division of New Drugs and Early Drug Development for Innovative Therapies, European Institute of Oncology IRCCS, Milan, Italy
| | - Elena Guerini-Rocco
- Division of Pathology, European Institute of Oncology IRCCS, Milan, Italy
- Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
| | - Nicola Fusco
- Division of Pathology, European Institute of Oncology IRCCS, Milan, Italy
- Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
| |
Collapse
|
9
|
Wan K, Tanioka K, Shimokawa T. Survival causal rule ensemble method considering the main effect for estimating heterogeneous treatment effects. Stat Med 2024; 43:5234-5271. [PMID: 39576217 DOI: 10.1002/sim.10180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 05/20/2024] [Accepted: 07/11/2024] [Indexed: 11/24/2024]
Abstract
With an increasing focus on precision medicine in medical research, numerous studies have been conducted in recent years to clarify the relationship between treatment effects and patient characteristics. The treatment effects for patients with different characteristics are always heterogeneous, and therefore, various heterogeneous treatment effect machine learning estimation methods have been proposed owing to their flexibility and high estimation accuracy. However, most machine learning methods rely on black-box models, preventing direct interpretation of the relationship between patient characteristics and treatment effects. Moreover, most of these studies have focused on continuous or binary outcomes, although survival outcomes are also important in medical research. To address these challenges, we propose a heterogeneous treatment effect estimation method for survival data based on RuleFit, an interpretable machine learning method. Numerical simulation results confirmed that the prediction performance of the proposed method was comparable to that of existing methods. We also applied a dataset from an HIV study, the AIDS Clinical Trials Group Protocol 175 dataset, to illustrate the interpretability of the proposed method using real data. Consequently, the proposed survival causal rule ensemble method provides an interpretable model with sufficient estimation accuracy.
Collapse
Affiliation(s)
- Ke Wan
- Department of Medicine, Wakayama Medical University, Wakayama, Japan
| | - Kensuke Tanioka
- Department of Biomedical Sciences and Informatics, Doshisha University, Kyoto, Japan
| | - Toshio Shimokawa
- Department of Medicine, Wakayama Medical University, Wakayama, Japan
| |
Collapse
|
10
|
Shahabi F, Battalio SL, Pfammatter AF, Hedeker D, Spring B, Alshurafa N. A machine-learned model for predicting weight loss success using weight change features early in treatment. NPJ Digit Med 2024; 7:344. [PMID: 39613928 DOI: 10.1038/s41746-024-01299-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 10/12/2024] [Indexed: 12/01/2024] Open
Abstract
Stepped-care obesity treatments aim to improve efficiency by early identification of non-responders and adjusting interventions but lack validated models. We trained a random forest classifier to improve the predictive utility of a clinical decision rule (>0.5 lb weight loss/week) that identifies non-responders in the first 2 weeks of a stepped-care weight loss trial (SMART). From 2009 to 2021, 1058 individuals with obesity participated in three studies: SMART, Opt-IN, and ENGAGED. The model was trained on 80% of the SMART data (224 participants), and its in-distribution generalizability was tested on the remaining 20% (remaining 57 participants). The out-of-distribution generalizability was tested on the ENGAGED and Opt-IN studies (472 participants). The model predicted weight loss at month 6 with an 84.5% AUROC and an 86.3% AUPRC. SHAP identified predictive features: weight loss at week 2, ranges/means and ranges of weight loss, slope, and age. The SMART-trained model showed generalizable performance with no substantial difference across studies.
Collapse
Grants
- F31HL162555 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- F31HL162555 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- F31HL162555 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- F31HL162555 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- F31HL162555 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
Collapse
Affiliation(s)
- Farzad Shahabi
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA.
- Department of Computer Science, McCormick School of Engineering, Northwestern University, Evanston, IL, USA.
| | - Samuel L Battalio
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | | | - Donald Hedeker
- Department of Public Health Sciences, The University of Chicago, Chicago, IL, USA
| | - Bonnie Spring
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Nabil Alshurafa
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
- Department of Computer Science, McCormick School of Engineering, Northwestern University, Evanston, IL, USA
| |
Collapse
|
11
|
Nelson W, Petch J, Ranisau J, Zhao R, Balasubramanian K, Bangdiwala SI. Detecting irregularities in randomized controlled trials using machine learning. Clin Trials 2024:17407745241297947. [PMID: 39587801 DOI: 10.1177/17407745241297947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2024]
Abstract
BACKGROUND Over the course of a clinical trial, irregularities may arise in the data. Trialists implement human-intensive, expensive central statistical monitoring procedures to identify and correct these irregularities before the results of the trial are analyzed and disseminated. Machine learning algorithms have shown promise for identifying center-level irregularities in multi-center clinical trials with minimal human intervention. We aimed to characterize the form-level data irregularities in several historical clinical trials and evaluate the ability of a machine learning-based outlier detection algorithm to identify them. METHODS Data irregularities previously identified by humans in historical clinical trials were ascertained by comparing preliminary snapshots of the trial databases to the final, locked databases. We measured the ability of a machine learning based outlier detection algorithm to identify form-level irregularities using concordance (area under the receiver operator characteristic), positive predictive value (precision), and sensitivity (recall). RESULTS We examined preliminary snapshots of seven historical clinical trials which randomized a total of 77,001 participants. We extracted a total of 1,267,484 completed entries from 358 case report forms containing irregularities from all snapshots across all trials, containing a total of 24,850 form-wide irregularities (median per-form form-level irregularity rate: 1.81%). Our proposed machine learning algorithm detects form-level irregularities with a median concordance of 0.74 (interquartile range = 0.57-0.89), slightly exceeding the performance of a previously proposed machine learning approach with a median area under the receiver operator characteristic of 0.73 (interquartile range = 0.54-0.88). CONCLUSION Data irregularities in historical clinical trials were ascertained by comparing preliminary snapshots of the trial database to the final database. These irregularities can be categorized according to their scope. Irregularities can be successfully detected by a machine learning algorithm as early or earlier than a human can, without human intervention. Such an approach may complement existing techniques for central statistical monitoring in large multi-center randomized controlled trials and possibly improve the efficiency of costly data verification processes.
Collapse
Affiliation(s)
- Walter Nelson
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
| | - Jeremy Petch
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Population Health Research Institute, Hamilton, ON, Canada
- Department of Medicine, McMaster University, Hamilton, ON, Canada
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada
| | - Jonathan Ranisau
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
| | - Robin Zhao
- Population Health Research Institute, Hamilton, ON, Canada
| | | | - Shrikant I Bangdiwala
- Population Health Research Institute, Hamilton, ON, Canada
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
12
|
Xu H, Peng X, Peng Z, Wang R, Zhou R, Fu L. Construction and SHAP interpretability analysis of a risk prediction model for feeding intolerance in preterm newborns based on machine learning. BMC Med Inform Decis Mak 2024; 24:342. [PMID: 39558307 PMCID: PMC11572196 DOI: 10.1186/s12911-024-02751-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2024] [Accepted: 11/04/2024] [Indexed: 11/20/2024] Open
Abstract
OBJECTIVE To construct a highly accurate and interpretable feeding intolerance (FI) risk prediction model for preterm newborns based on machine learning (ML) to assist medical staff in clinical diagnosis. METHODS In this study, a sample of 350 hospitalized preterm newborns were retrospectively analysed. First, dual feature selection was conducted to identify important feature variables for model construction. Second, ML models were constructed based on the logistic regression (LR), decision tree (DT), support vector machine (SVM) and eXtreme Gradient Boosting (XGBoost) algorithms, after which random sampling and tenfold cross-validation were separately used to evaluate and compare these models and identify the optimal model. Finally, we apply the SHapley Additive exPlanation (SHAP) interpretable framework to analyse the decision-making principles of the optimal model and expound upon the important factors affecting FI in preterm newborns and their modes of action. RESULTS The accuracy of XGBoost was 87.62%, and the area under the curve (AUC) was 92.2%. After the application of tenfold cross-validation, the accuracy was 83.43%, and the AUC was 89.45%, which was significantly better than those of the other models. Analysis of the XGBoost model with the SHAP interpretable framework showed that a history of resuscitation, use of probiotics, milk opening time, interval between two stools and gestational age were the main factors affecting the occurrence of FI in preterm newborns, yielding importance scores of 0.632, 0.407, 0.313, 0.313, and 0.258, respectively. A history of resuscitation, first milk opening time ≥ 24 h and interval between stools ≥ 3 days were risk factors for FI, while the use of probiotics and gestational age ≥ 34 weeks were protective factors against FI in preterm newborns. CONCLUSIONS In practice, we should improve perinatal care and obstetrics with the aim of reducing the occurrence of hypoxia and preterm delivery. When feeding, early milk opening, the use of probiotics, the stimulation of defecation and other measures should be implemented with the aim of reducing the occurrence of FI.
Collapse
Affiliation(s)
- Hui Xu
- Department of Medical Records & Statistics, Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
- Department of Child and Adolescent Health, School of Public Health, Bengbu Medical University, Bengbu, Anhui, China
| | - Xingwang Peng
- Changjiang Road Community Health Service Center, Zhangmiao Street, Baoshan District, Shanghai, China
- Department of Child and Adolescent Health, School of Public Health, Bengbu Medical University, Bengbu, Anhui, China
| | - Ziyu Peng
- Department of Child and Adolescent Health, School of Public Health, Bengbu Medical University, Bengbu, Anhui, China
| | - Rui Wang
- Department of Child and Adolescent Health, School of Public Health, Bengbu Medical University, Bengbu, Anhui, China
| | - Rui Zhou
- The First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China.
| | - Lianguo Fu
- Department of Child and Adolescent Health, School of Public Health, Bengbu Medical University, Bengbu, Anhui, China.
| |
Collapse
|
13
|
Lin Y, Li C, Wang X, Li H. Development of a machine learning-based risk assessment model for loneliness among elderly Chinese: a cross-sectional study based on Chinese longitudinal healthy longevity survey. BMC Geriatr 2024; 24:939. [PMID: 39543473 PMCID: PMC11562678 DOI: 10.1186/s12877-024-05443-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2024] [Accepted: 10/07/2024] [Indexed: 11/17/2024] Open
Abstract
BACKGROUND Loneliness is prevalent among the elderly and has intensified due to global aging trends. It adversely affects both mental and physical health. Traditional scales for measuring loneliness may yield biased results due to varying definitions. The advancements in machine learning offer new opportunities for improving the measurement and assessment of loneliness through the development of risk assessment models. METHODS Data from the 2018 Chinese Longitudinal Healthy Longevity Survey, involving about 16,000 participants aged ≥ 65 years, were used. The study examined the relationships between loneliness and factors such as functional limitations, living conditions, environmental influences, age-related health issues, and health behaviors. Using R 4.4.1, seven assessment models were developed: logistic regression, ridge regression, support vector machines, K-nearest neighbors, decision trees, random forests, and multi-layer perceptron. Models were evaluated based on ROC curves, accuracy, precision, recall, F1 scores, and AUC. RESULTS Loneliness prevalence among elderly Chinese was 23.4%. Analysis identified 15 evaluative factors and evaluated seven models. Multi-layer perceptron stands out for its strong nonlinear mapping capability and adaptability to complex data, making it one of the most effective models for assessing loneliness risk. CONCLUSION The study found a 23.4% prevalence of loneliness among elderly individuals in China. SHAP values indicated that marital status has the strongest evaluative value across all forecasting periods. Specifically, elderly individuals who are never married, widowed, divorced, or separated are more likely to experience loneliness compared to their married counterparts.
Collapse
Affiliation(s)
- Youbei Lin
- Jinzhou Medical University, School of Nursing, Jinzhou City, Liaoning Province, 121001, China
| | - Chuang Li
- Jinzhou Medical University, School of Nursing, Jinzhou City, Liaoning Province, 121001, China
| | - Xiuli Wang
- The First Affiliated Hospital of Jinzhou Medical University, Jinzhou City, Liaoning Province, 121001, China
| | - Hongyu Li
- Jinzhou Medical University, School of Nursing, Jinzhou City, Liaoning Province, 121001, China.
| |
Collapse
|
14
|
Lu X, Qiao C, Wang H, Li Y, Wang J, Wang C, Wang Y, Qie S. Application of Isokinetic Dynamometry Data in Predicting Gait Deviation Index Using Machine Learning in Stroke Patients: A Cross-Sectional Study. SENSORS (BASEL, SWITZERLAND) 2024; 24:7258. [PMID: 39599035 PMCID: PMC11598631 DOI: 10.3390/s24227258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2024] [Revised: 11/06/2024] [Accepted: 11/12/2024] [Indexed: 11/29/2024]
Abstract
BACKGROUND Three-dimensional gait analysis, supported by advanced sensor systems, is a crucial component in the rehabilitation assessment of post-stroke hemiplegic patients. However, the sensor data generated from such analyses are often complex and challenging to interpret in clinical practice, requiring significant time and complicated procedures. The Gait Deviation Index (GDI) serves as a simplified metric for quantifying the severity of pathological gait. Although isokinetic dynamometry, utilizing sophisticated sensors, is widely employed in muscle function assessment and rehabilitation, its application in gait analysis remains underexplored. OBJECTIVE This study aims to investigate the use of sensor-acquired isokinetic muscle strength data, combined with machine learning techniques, to predict the GDI in hemiplegic patients. This study utilizes data captured from sensors embedded in the Biodex dynamometry system and the Vicon 3D motion capture system, highlighting the integration of sensor technology in clinical gait analysis. METHODS This study was a cross-sectional, observational study that included a cohort of 150 post-stroke hemiplegic patients. The sensor data included measurements such as peak torque, peak torque/body weight, maximum work of repeated actions, coefficient of variation, average power, total work, acceleration time, deceleration time, range of motion, and average peak torque for both flexor and extensor muscles on the affected side at three angular velocities (60°/s, 90°/s, and 120°/s) using the Biodex System 4 Pro. The GDI was calculated using data from a Vicon 3D motion capture system. This study employed four machine learning models-Lasso Regression, Random Forest (RF), Support Vector regression (SVR), and BP Neural Network-to model and validate the sensor data. Model performance was evaluated using mean squared error (MSE), the coefficient of determination (R2), and mean absolute error (MAE). SHapley Additive exPlanations (SHAP) analysis was used to enhance model interpretability. RESULTS The RF model outperformed others in predicting GDI, with an MSE of 16.18, an R2 of 0.89, and an MAE of 2.99. In contrast, the Lasso Regression model yielded an MSE of 22.29, an R2 of 0.85, and an MAE of 3.71. The SVR model had an MSE of 31.58, an R2 of 0.82, and an MAE of 7.68, while the BP Neural Network model exhibited the poorest performance with an MSE of 50.38, an R2 of 0.79, and an MAE of 9.59. SHAP analysis identified the maximum work of repeated actions of the extensor muscles at 60°/s and 120°/s as the most critical sensor-derived features for predicting GDI, underscoring the importance of muscle strength metrics at varying speeds in rehabilitation assessments. CONCLUSIONS This study highlights the potential of integrating advanced sensor technology with machine learning techniques in the analysis of complex clinical data. The developed GDI prediction model, based on sensor-acquired isokinetic dynamometry data, offers a novel, streamlined, and effective tool for assessing rehabilitation progress in post-stroke hemiplegic patients, with promising implications for broader clinical application.
Collapse
Affiliation(s)
- Xiaolei Lu
- Department of Rehabilitation, Beijing Rehabilitation Hospital, Capital Medical University, Beijing 100144, China; (X.L.); (H.W.); (Y.L.)
| | - Chenye Qiao
- Beijing Rehabilitation Medicine, Beijing Rehabilitation Hospital, Capital Medical University, Beijing 100144, China; (C.Q.); (J.W.)
| | - Hujun Wang
- Department of Rehabilitation, Beijing Rehabilitation Hospital, Capital Medical University, Beijing 100144, China; (X.L.); (H.W.); (Y.L.)
| | - Yingqi Li
- Department of Rehabilitation, Beijing Rehabilitation Hospital, Capital Medical University, Beijing 100144, China; (X.L.); (H.W.); (Y.L.)
| | - Jingxuan Wang
- Beijing Rehabilitation Medicine, Beijing Rehabilitation Hospital, Capital Medical University, Beijing 100144, China; (C.Q.); (J.W.)
| | - Congxiao Wang
- Department of Rehabilitation, Beijing Rehabilitation Hospital, Capital Medical University, Beijing 100144, China; (X.L.); (H.W.); (Y.L.)
| | - Yingpeng Wang
- Department of Rehabilitation, Beijing Rehabilitation Hospital, Capital Medical University, Beijing 100144, China; (X.L.); (H.W.); (Y.L.)
| | - Shuyan Qie
- Department of Rehabilitation, Beijing Rehabilitation Hospital, Capital Medical University, Beijing 100144, China; (X.L.); (H.W.); (Y.L.)
| |
Collapse
|
15
|
Ly CO, Suszko AM, Denham NC, Chakraborty P, Rahimi M, McIntosh C, Chauhan VS. Machine Learning Identifies Arrhythmogenic Features of QRS Fragmentation in Human Cardiomyopathy: Implications for Improving Risk Stratification. Heart Rhythm 2024:S1547-5271(24)03543-4. [PMID: 39515502 DOI: 10.1016/j.hrthm.2024.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 10/31/2024] [Accepted: 11/01/2024] [Indexed: 11/16/2024]
Abstract
BACKGROUND Heterogeneous ventricular activation can provide the substrate for ventricular arrhythmias (VA), but its manifestation on the electrocardiogram (ECG) as a risk stratifier is not well-defined. OBJECTIVE To characterize the spatiotemporal features of QRS peaks that best predict VA in patients with cardiomyopathy (CM) using machine learning (ML). METHODS Prospectively enrolled CM patients with prophylactic defibrillators (n=95) underwent digital, high-resolution ECG recordings during intrinsic rhythm and ventricular pacing at 100 to 120 beats/min. Intra QRS peaks in the signal-averaged precordial leads were identified and their characteristics (amplitude, width, and timing within the QRS) were transformed into 4-bin histograms. Random forest models of these characteristics in each lead alongside clinical data were trained on 76 patients and tested on 19 patients with cross-validation to determine the features that predicted VA. RESULTS Patients were followed up for 45 (22-48) months, and 21% had VA. The individual machine learning (ML) models determined (area under the receiver operating characteristic curve [AUROC]) intrinsic QRS peak amplitude (0.88), width (0.78), and location (0.84) to all predict VA. In a combined model including all QRS peak characteristics, peaks with amplitude < 31 μV in V1, a width of 4 to 8 ms in V1, and location in the final quarter of the QRS of V1 were most predictive. Neither clinical data nor QRS peak characteristics assessed during ventricular pacing improved VA prediction when combined with intrinsic QRS peak characteristics. CONCLUSIONS Arrhythmogenic QRS fragmentation is characterized by narrow, low-amplitude peaks in the terminal QRS of lead V1. These features alone without clinical variables or ventricular pacing are sufficient to accurately risk stratify CM patients.
Collapse
Affiliation(s)
- Cathy Ong Ly
- Division of Cardiology, Peter Munk Cardiac Center, University Health Network, Toronto, Canada
| | - Adrian M Suszko
- Division of Cardiology, Peter Munk Cardiac Center, University Health Network, Toronto, Canada
| | - Nathan C Denham
- Division of Cardiology, Peter Munk Cardiac Center, University Health Network, Toronto, Canada
| | - Praloy Chakraborty
- Division of Cardiology, Peter Munk Cardiac Center, University Health Network, Toronto, Canada
| | - Mahbod Rahimi
- Division of Cardiology, Peter Munk Cardiac Center, University Health Network, Toronto, Canada
| | - Chris McIntosh
- Division of Cardiology, Peter Munk Cardiac Center, University Health Network, Toronto, Canada
| | - Vijay S Chauhan
- Division of Cardiology, Peter Munk Cardiac Center, University Health Network, Toronto, Canada.
| |
Collapse
|
16
|
Yadav A, Raj A, Yadav B. Enhancing local-scale groundwater quality predictions using advanced machine learning approaches. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 370:122903. [PMID: 39413632 DOI: 10.1016/j.jenvman.2024.122903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 09/16/2024] [Accepted: 10/10/2024] [Indexed: 10/18/2024]
Abstract
Assessing groundwater quality typically involves labor-intensive, time-consuming, and costly laboratory tests, making real-time monitoring impractical, especially at the local level. Groundwater quality projections at the local scale using broad spatial datasets have been inaccurate due to variations in hydrogeology, human activities, industrial operations, groundwater extraction, and waste disposal. This study aims to identify the most dependable and resilient machine learning algorithms for forecasting groundwater quality at nearby monitoring locations by utilizing simple water quality metrics that can be quickly assessed without extensive sampling and laboratory testing. The Entropy-weighted Water Quality Index (EWQI) was calculated using a large spatial and temporal dataset (2014-2021) of 977 wells with parameters including pH, total hardness (TH), calcium (Ca2⁺), magnesium (Mg2⁺), sodium (Na⁺), potassium (K⁺), sulfate (SO₄2⁻), chloride (Cl⁻), nitrate (NO₃⁻), total dissolved solids (TDS), and fluoride (F⁻). Further, similar parameters were also observed in 33 open wells at the three local monitoring sites from December 2022 to March 2023. The EWQI was predicted using a Random Forest (RF), eXtreme Gradient Boosting (XGB), and Deep Neural Network (DNN). The pH, TH, and TDS were used as input variables for EWQI predictions, as they can be easily measured using handheld probes or multi-parameters. The model performance was evaluated using R2, MAE, and RMSE. During the training stage, all three models predicted the EWQI with an R2 greater than 90%, with minimal errors when pH, TH, and TDS were input variables. To validate the models at a local scale, the EWQI was predicted at the village level (e.g., Antoli, Balapura, and Lapodiaya) using pH, TH, and TDS as input variables. The machine learning models were able to predict the EWQI very closely to the actual EWQI, with an R2 greater than 90%. It is also evident that the models could predict the EWQI using basic parameters that are easily measured, providing an overall idea of the water quality for a small area. Hence, these machine learning models could be useful for accurately representing groundwater quality, thereby avoiding the use of time-consuming and costly laboratory techniques.
Collapse
Affiliation(s)
- Abhimanyu Yadav
- Department of Water Resources Development and Management, Indian Institute of Technology Roorkee, 247667, India
| | - Abhay Raj
- Department of Water Resources Development and Management, Indian Institute of Technology Roorkee, 247667, India
| | - Basant Yadav
- Department of Water Resources Development and Management, Indian Institute of Technology Roorkee, 247667, India.
| |
Collapse
|
17
|
Riek NT, Gokhale TA, Martin-Gill C, Kraevsky-Philips K, Zègre-Hemsey JK, Saba S, Callaway CW, Akcakaya M, Al-Zaiti SS. Clinical usability of deep learning-based saliency maps for occlusion myocardial infarction identification from the prehospital 12-Lead electrocardiogram. J Electrocardiol 2024; 87:153792. [PMID: 39255653 DOI: 10.1016/j.jelectrocard.2024.153792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 08/26/2024] [Accepted: 08/27/2024] [Indexed: 09/12/2024]
Abstract
INTRODUCTION Deep learning (DL) models offer improved performance in electrocardiogram (ECG)-based classification over rule-based methods. However, for widespread adoption by clinicians, explainability methods, like saliency maps, are essential. METHODS On a subset of 100 ECGs from patients with chest pain, we generated saliency maps using a previously validated convolutional neural network for occlusion myocardial infarction (OMI) classification. Three clinicians reviewed ECG-saliency map dyads, first assessing the likelihood of OMI from standard ECGs and then evaluating clinical relevance and helpfulness of the saliency maps, as well as their confidence in the model's predictions. Questions were answered on a Likert scale ranging from +3 (most useful/relevant) to -3 (least useful/relevant). RESULTS The adjudicated accuracy of the three clinicians matched the DL model when considering area under the receiver operating characteristics curve (AUC) and F1 score (AUC 0.855 vs. 0.872, F1 score = 0.789 vs. 0.747). On average, clinicians found saliency maps slightly clinically relevant (0.96 ± 0.92) and slightly helpful (0.66 ± 0.98) in identifying or ruling out OMI but had higher confidence in the model's predictions (1.71 ± 0.56). Clinicians noted that leads I and aVL were often emphasized, even when obvious ST changes were present in other leads. CONCLUSION In this clinical usability study, clinicians deemed saliency maps somewhat helpful in enhancing explainability of DL-based ECG models. The spatial convolutional layers across the 12 leads in these models appear to contribute to the discrepancy between ECG segments considered most relevant by clinicians and segments that drove DL model predictions.
Collapse
Affiliation(s)
- Nathan T Riek
- Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Tanmay A Gokhale
- Division of Cardiology, University of Pittsburgh, Pittsburgh, PA, USA; University of Pittsburgh Medical Center (UPMC), Pittsburgh, PA, USA
| | - Christian Martin-Gill
- Department of Emergency Medicine, University of Pittsburgh, Pittsburgh, PA, USA; University of Pittsburgh Medical Center (UPMC), Pittsburgh, PA, USA
| | - Karina Kraevsky-Philips
- Department of Acute and Tertiary Care Nursing, University of Pittsburgh, Pittsburgh, PA, USA; University of Pittsburgh Medical Center (UPMC), Pittsburgh, PA, USA
| | | | - Samir Saba
- Division of Cardiology, University of Pittsburgh, Pittsburgh, PA, USA; University of Pittsburgh Medical Center (UPMC), Pittsburgh, PA, USA
| | - Clifton W Callaway
- Department of Emergency Medicine, University of Pittsburgh, Pittsburgh, PA, USA; University of Pittsburgh Medical Center (UPMC), Pittsburgh, PA, USA
| | - Murat Akcakaya
- Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Salah S Al-Zaiti
- Division of Cardiology, University of Pittsburgh, Pittsburgh, PA, USA; Department of Emergency Medicine, University of Pittsburgh, Pittsburgh, PA, USA; Department of Acute and Tertiary Care Nursing, University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
18
|
Zinzuwadia AN, Mineeva O, Li C, Farukhi Z, Giulianini F, Cade B, Chen L, Karlson E, Paynter N, Mora S, Demler O. Tailoring Risk Prediction Models to Local Populations. JAMA Cardiol 2024; 9:1018-1028. [PMID: 39292486 PMCID: PMC11411452 DOI: 10.1001/jamacardio.2024.2912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 07/23/2024] [Indexed: 09/19/2024]
Abstract
Importance Risk estimation is an integral part of cardiovascular care. Local recalibration of guideline-recommended models could address the limitations of existing tools. Objective To provide a machine learning (ML) approach to augment the performance of the American Heart Association's Predicting Risk of Cardiovascular Disease Events (AHA-PREVENT) equations when applied to a local population while preserving clinical interpretability. Design, Setting, and Participants This cohort study used a New England-based electronic health record cohort of patients without prior atherosclerotic cardiovascular disease (ASCVD) who had the data necessary to calculate the AHA-PREVENT 10-year risk of developing ASCVD in the event period (2007-2016). Patients with prior ASCVD events, death prior to 2007, or age 79 years or older in 2007 were subsequently excluded. The final study population of 95 326 patients was split into 3 nonoverlapping subsets for training, testing, and validation. The AHA-PREVENT model was adapted to this local population using the open-source ML model (MLM) Extreme Gradient Boosting model (XGBoost) with minimal predictor variables, including age, sex, and AHA-PREVENT. The MLM was monotonically constrained to preserve known associations between risk factors and ASCVD risk. Along with sex, race and ethnicity data from the electronic health record were collected to validate the performance of ASCVD risk prediction in subgroups. Data were analyzed from August 2021 to February 2024. Main Outcomes and Measures Consistent with the AHA-PREVENT model, ASCVD events were defined as the first occurrence of either nonfatal myocardial infarction, coronary artery disease, ischemic stroke, or cardiovascular death. Cardiovascular death was coded via government registries. Discrimination, calibration, and risk reclassification were assessed using the Harrell C index, a modified Hosmer-Lemeshow goodness-of-fit test and calibration curves, and reclassification tables, respectively. Results In the test set of 38 137 patients (mean [SD] age, 64.8 [6.9] years, 22 708 [59.5]% women and 15 429 [40.5%] men; 935 [2.5%] Asian, 2153 [5.6%] Black, 1414 [3.7%] Hispanic, 31 400 [82.3%] White, and 2235 [5.9%] other, including American Indian, multiple races, unspecified, and unrecorded, consolidated owing to small numbers), MLM-PREVENT had improved calibration (modified Hosmer-Lemeshow P > .05) compared to the AHA-PREVENT model across risk categories in the overall cohort (χ23 = 2.2; P = .53 vs χ23 > 16.3; P < .001) and sex subgroups (men: χ23 = 2.1; P = .55 vs χ23 > 16.3; P < .001; women: χ23 = 6.5; P = .09 vs. χ23 > 16.3; P < .001), while also surpassing a traditional recalibration approach. MLM-PREVENT maintained or improved AHA-PREVENT's calibration in Asian, Black, and White individuals. Both MLM-PREVENT and AHA-PREVENT performed equally well in discriminating risk (approximate ΔC index, ±0.01). Using a clinically significant 7.5% risk threshold, MLM-PREVENT reclassified a total of 11.5% of patients. We visualize the recalibration through MLM-PREVENT ASCVD risk charts that highlight preserved risk associations of the original AHA-PREVENT model. Conclusions and Relevance The interpretable ML approach presented in this article enhanced the accuracy of the AHA-PREVENT model when applied to a local population while still preserving the risk associations found by the original model. This method has the potential to recalibrate other established risk tools and is implementable in electronic health record systems for improved cardiovascular risk assessment.
Collapse
Affiliation(s)
| | | | - Chunying Li
- Brigham & Women’s Hospital, Boston, Massachusetts
| | - Zareen Farukhi
- Brigham & Women’s Hospital, Boston, Massachusetts
- Massachusetts General Hospital, Boston
| | | | - Brian Cade
- Brigham & Women’s Hospital, Boston, Massachusetts
| | - Lin Chen
- Brigham & Women’s Hospital, Boston, Massachusetts
| | | | - Nina Paynter
- Brigham & Women’s Hospital, Boston, Massachusetts
| | - Samia Mora
- Brigham & Women’s Hospital, Boston, Massachusetts
| | - Olga Demler
- Brigham & Women’s Hospital, Boston, Massachusetts
- ETH Zurich, Zurich, Switzerland
| |
Collapse
|
19
|
Jaltotage B, Dwivedi G. Essentials for AI Research in Cardiology: Challenges and Mitigations. CJC Open 2024; 6:1334-1341. [PMID: 39582710 PMCID: PMC11583857 DOI: 10.1016/j.cjco.2024.07.015] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 07/29/2024] [Indexed: 11/26/2024] Open
Abstract
Technology using artificial intelligence (AI) is flourishing; the same advancements can be seen in health care. Cardiology in particular is well placed to take advantage of AI because of the data-intensive nature of the field and the current strain on existing resources in the management of cardiovascular disease. With AI nearing the stage of routine implementation into clinical care, considerations need to be made to ensure the software is effective and safe. The benefits of AI are well established, but the challenges and ethical considerations are less well understood. As a result, there is currently a lack of consensus on what the essential components are in an AI study. In this review we aim to assess and provide greater clarity on the challenges encountered in conducting AI studies and explore potential mitigations that could facilitate the successful integration of AI in the management of cardiovascular disease.
Collapse
Affiliation(s)
- Biyanka Jaltotage
- Department of Cardiology, Fiona Stanley Hospital, Perth, Western Australia, Australia
| | - Girish Dwivedi
- Department of Cardiology, Fiona Stanley Hospital, Perth, Western Australia, Australia
- Harry Perkins Institute of Medical Research, Perth, Western Australia, Australia
- School of Medicine, The University of Western Australia, Perth, Western Australia, Australia
| |
Collapse
|
20
|
Lee J, Duperrex E, El-Battrawy I, Hohn A, Saguner AM, Duru F, Emmenegger V, Cyganek L, Hierlemann A, Ulusan H. CardioMEA: comprehensive data analysis platform for studying cardiac diseases and drug responses. Front Physiol 2024; 15:1472126. [PMID: 39539954 PMCID: PMC11557525 DOI: 10.3389/fphys.2024.1472126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Accepted: 10/18/2024] [Indexed: 11/16/2024] Open
Abstract
Introduction In recent years, high-density microelectrode arrays (HD-MEAs) have emerged as a valuable tool in preclinical research for characterizing the electrophysiology of human induced pluripotent stem-cell-derived cardiomyocytes (iPSC-CMs). HD-MEAs enable the capturing of both extracellular and intracellular signals on a large scale, while minimizing potential damage to the cell. However, despite technological advancements of HD-MEAs, there is a lack of effective data-analysis platforms that are capable of processing and analyzing the data, particularly in the context of cardiac arrhythmias and drug testing. Methods To address this need, we introduce CardioMEA, a comprehensive data-analysis platform designed specifically for HD-MEA data that have been obtained from iPSCCMs. CardioMEA features scalable data processing pipelines and an interactive web-based dashboard for advanced visualization and analysis. In addition to its core functionalities, CardioMEA incorporates modules designed to discern crucial electrophysiological features between diseased and healthy iPSC-CMs. Notably, CardioMEA has the unique capability to analyze both extracellular and intracellular signals, thereby facilitating customized analyses for specific research tasks. Results and discussion We demonstrate the practical application of CardioMEA by analyzing electrophysiological signals from iPSC-CM cultures exposed to seven antiarrhythmic drugs. CardioMEA holds great potential as an intuitive, userfriendly platform for studying cardiac diseases and assessing drug effects.
Collapse
Affiliation(s)
- Jihyun Lee
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Eliane Duperrex
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Ibrahim El-Battrawy
- St. Josef Hospital, Ruhr-University Bochum, Bochum, Germany
- German Center for Cardiovascular Research, Partner Site Heidelberg-Mannheim and Göttingen, Heidelberg, Germany
- Department of Molecular and Experimental Cardiology, Institut für Forschung und Lehre (IFL), Ruhr-University Bochum, Bochum, Germany
- Institute of Physiology, Ruhr-University Bochum, Bochum, Germany
| | - Alyssa Hohn
- German Center for Cardiovascular Research, Partner Site Heidelberg-Mannheim and Göttingen, Heidelberg, Germany
| | - Ardan M. Saguner
- Department of Cardiology, Electrophysiology Division, University Heart Center Zurich, Zurich, Switzerland
- Center for Translational and Experimental Cardiology (CTEC), University of Zürich, Zurich, Switzerland
| | - Firat Duru
- Department of Cardiology, Electrophysiology Division, University Heart Center Zurich, Zurich, Switzerland
- Center for Translational and Experimental Cardiology (CTEC), University of Zürich, Zurich, Switzerland
| | - Vishalini Emmenegger
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Lukas Cyganek
- German Center for Cardiovascular Research, Partner Site Heidelberg-Mannheim and Göttingen, Heidelberg, Germany
- Stem Cell Unit, Clinic for Cardiology and Pneumology, University Medical Center Göttingen, Göttingen, Germany
| | - Andreas Hierlemann
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Hasan Ulusan
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| |
Collapse
|
21
|
Dabbah S, Mishani I, Davidov Y, Ben Ari Z. Implementation of Machine Learning Algorithms to Screen for Advanced Liver Fibrosis in Metabolic Dysfunction-Associated Steatotic Liver Disease: An In-Depth Explanatory Analysis. Digestion 2024:1-14. [PMID: 39462487 DOI: 10.1159/000542241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Accepted: 10/21/2024] [Indexed: 10/29/2024]
Abstract
INTRODUCTION This study aimed to train machine learning algorithms (MLAs) to detect advanced fibrosis (AF) in metabolic dysfunction-associated steatotic liver disease (MASLD) patients at the level of primary care setting and to explain the predictions to ensure responsible use by clinicians. METHODS Readily available features of 618 MASLD patients followed up at a tertiary center were used to train five MLAs. AF was defined as liver stiffness ≥9.3 kPa, measured via 2-dimension shear wave elastography (n = 495) or liver biopsy ≥F3 (n = 123). MLAs were compared to Fibrosis-4 index (FIB-4) and non-alcoholic fatty liver disease (NAFLD) fibrosis score (NFS) on 540 MASLD patients from the primary care setting as validation. Feature importance, partial dependence, and shapely additive explanations (SHAPs) were utilized for explanation. RESULTS Extreme gradient boosting (XGBoost) achieved an AUC = 0.91, outperforming FIB-4 (AUC = 0.78) and NFS (AUC = 0.81, both p < 0.05) with specificity = 76% versus 59% and 48% for FIB-4 ≥1.3 and NFS ≥-1.45, respectively (p < 0.05). Its sensitivity (91%) was superior to FIB-4 (79%). XGBoost confidently excluded AF (negative predictive value = 99%) with the highest positive predictive value (31%), superior to FIB-4 and NFS (all p < 0.05). The most important features were HbA1c and gamma glutamyl transpeptidase (GGT) with a steep increase in AF probability at HbA1c >6.5%. The strongest interaction was between AST and age. XGBoost, but not logistic regression, extracted informative patterns from ALT, low-density lipoprotein cholesterol, and alkaline phosphatase (p < 0.001). One-quarter of the false positives (FPs) were correctly reclassified with only one additional false negative based on the SHAP values of GGT, platelets, and ALT which were found to be associated with a FP classification. CONCLUSIONS An explainable XGBoost algorithm was demonstrated superior to FIB-4 and NFS for screening of AF in MASLD patients at the primary care setting. The algorithm also proved safe for use as clinicians can understand the predictions and flag FP classifications.
Collapse
Affiliation(s)
- Shoham Dabbah
- Liver Diseases Center, Sheba Medical Center, Ramat Gan, Israel
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Division of Internal Medicine, Sheba Medical Center, Ramat Gan, Israel
| | - Itamar Mishani
- Robotics Institute, Carnegie-Mellon University, Pittsburgh, Pennsylvania, USA
| | - Yana Davidov
- Liver Diseases Center, Sheba Medical Center, Ramat Gan, Israel
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Division of Internal Medicine, Sheba Medical Center, Ramat Gan, Israel
| | - Ziv Ben Ari
- Liver Diseases Center, Sheba Medical Center, Ramat Gan, Israel
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
22
|
Nwafor CN, Nwafor O, Brahma S. Enhancing transparency and fairness in automated credit decisions: an explainable novel hybrid machine learning approach. Sci Rep 2024; 14:25174. [PMID: 39448646 PMCID: PMC11502870 DOI: 10.1038/s41598-024-75026-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Accepted: 10/01/2024] [Indexed: 10/26/2024] Open
Abstract
This paper uses a generalised stacking method to introduce a novel hybrid model that combines a one-dimensional convolutional neural network 1DCNN with extreme gradient boosting XGBoost. We compared the predictive accuracies of the proposed hybrid architecture with three conventional algorithms-1DCNN, XGBoost and logistic regression (LR) using a dataset of over twenty thousand peer-to-peer (P2P) consumer credit observations. By leveraging the SHAP algorithm, the research provides a detailed analysis of feature importance, contributing to the model's predictions and offering insights into the overall and individual significance of different features. The findings demonstrate that the hybrid model outperforms the LR, XGBoost and 1DCNN models in terms of classification accuracy. Furthermore, the research addresses concern regarding fairness and bias by showing that removing potentially discriminatory features, such as age and gender, does not significantly impact the hybrid model's classification capabilities. This suggests that fair and unbiased credit scoring models can achieve high effectiveness levels without compromising accuracy. This paper makes significant contributions to academic research and practical applications in credit risk management by presenting a hybrid model that offers superior classification accuracy and promotes interpretability using the model agnostic SHAP framework.
Collapse
Affiliation(s)
- Chioma Ngozi Nwafor
- Glasgow School for Business and Society, Department of Finance, Accountancy and Risk, Glasgow Caledonia University, Glasgow, Scotland.
| | - Obumneme Nwafor
- School of Computing, Engineering and Built Environment, Glasgow Caledonian University, Glasgow, Scotland
| | - Sanjukta Brahma
- Glasgow School for Business and Society, Department of Finance, Accountancy and Risk, Glasgow Caledonia University, Glasgow, Scotland
| |
Collapse
|
23
|
Guo K, Chaudhari N, Jafar T, Chowdhury N, Bogdan P, Irimia A. Anatomic Interpretability in Neuroimage Deep Learning: Saliency Approaches for Typical Aging and Traumatic Brain Injury. RESEARCH SQUARE 2024:rs.3.rs-4960427. [PMID: 39483910 PMCID: PMC11527355 DOI: 10.21203/rs.3.rs-4960427/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
The black box nature of deep neural networks (DNNs) makes researchers and clinicians hesitant to rely on their findings. Saliency maps can enhance DNN explainability by suggesting the anatomic localization of relevant brain features. This study compares seven popular attribution-based saliency approaches to assign neuroanatomic interpretability to DNNs that estimate biological brain age (BA) from magnetic resonance imaging (MRI). Cognitively normal (CN) adults ( N = 13,394 , 5,900 males; mean age: 65.82 ± 8.89 years) are included for DNN training, testing, validation, and saliency map generation to estimate BA. To study saliency robustness to the presence of anatomic deviations from normality, saliency maps are also generated for adults with mild traumatic brain injury (mTBI, N = 214 , 135 males; mean age: 55.3 ± 9.9 years). We assess saliency methods' capacities to capture known anatomic features of brain aging and compare them to a surrogate ground truth whose anatomic saliency is known a priori. Anatomic aging features are identified most reliably by the integrated gradients method, which outperforms all others through its ability to localize relevant anatomic features. Gradient Shapley additive explanations, input × gradient, and masked gradient perform less consistently but still highlight ubiquitous neuroanatomic features of aging (ventricle dilation, hippocampal atrophy, sulcal widening). Saliency methods involving gradient saliency, guided backpropagation, and guided gradient-weight class attribution mapping localize saliency outside the brain, which is undesirable. Our research suggests the relative tradeoffs of saliency methods to interpret DNN findings during BA estimation in typical aging and after mTBI.
Collapse
Affiliation(s)
- Kevin Guo
- Thomas Lord Department of Computer Science, Viterbi School of Engineering, University of Southern California
| | - Nikhil Chaudhari
- Corwin D. Denney Research Center, Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California
| | - Tamara Jafar
- Neuroscience Graduate Program, University of Southern California
| | - Nahian Chowdhury
- Neuroscience Graduate Program, University of Southern California
| | - Paul Bogdan
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California
| | - Andrei Irimia
- Ethel Percy Andrus Gerontology Center, Leonard Davis School of Gerontology, University of Southern California
| |
Collapse
|
24
|
Wang M, Hu Z, Wang Z, Chen H, Xu X, Zheng S, Yao Y, Li J. Interpretable Clinical Decision-Making Application for Etiological Diagnosis of Ventricular Tachycardia Based on Machine Learning. Diagnostics (Basel) 2024; 14:2291. [PMID: 39451614 PMCID: PMC11506268 DOI: 10.3390/diagnostics14202291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2024] [Revised: 10/05/2024] [Accepted: 10/14/2024] [Indexed: 10/26/2024] Open
Abstract
Background: Ventricular tachycardia (VT) can broadly be categorised into ischemic heart disease, non-ischemic structural heart disease, and idiopathic VT. There are few studies related to the application of machine learning for the etiological diagnosis of VT, and the interpretable methods are still in the exploratory stage for clinical decision-making applications. Objectives: The aim is to propose a machine learning model for the etiological diagnosis of VT. Interpretable results based on models are compared with expert knowledge, and interpretable evaluation protocols for clinical decision-making applications are developed. Methods: A total of 1305 VT patient data from 1 January 2013 to 1 September 2023 at the Arrhythmia Centre of Fuwai Hospital were included in the study. Clinical data collected during hospitalisation included demographics, medical history, vital signs, echocardiographic results, and laboratory test outcomes. Results: The XGBoost model demonstrated the best performance in VT etiological diagnosis (precision, recall, and F1 were 88.4%, 88.5%, and 88.4%, respectively). A total of four interpretable machine learning methods applicable to clinical decision-making were evaluated in terms of visualisation, clinical usability, clinical applicability, and efficiency with expert knowledge interpretation. Conclusions: The XGBoost model demonstrated superior performance in the etiological diagnosis of VT, and SHAP and decision tree interpretable methods are more favoured by clinicians for decision-making.
Collapse
Affiliation(s)
- Min Wang
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, China; (M.W.); (H.C.)
| | - Zhao Hu
- Chinese Academy of Medical Sciences & Peking Union Medical College/National Center for Cardiovascular Diseases, Fuwai Hospital, Beijing 100037, China;
| | - Ziyang Wang
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, China; (M.W.); (H.C.)
| | - Haoran Chen
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, China; (M.W.); (H.C.)
| | - Xiaowei Xu
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, China; (M.W.); (H.C.)
| | - Si Zheng
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, China; (M.W.); (H.C.)
| | - Yan Yao
- Chinese Academy of Medical Sciences & Peking Union Medical College/National Center for Cardiovascular Diseases, Fuwai Hospital, Beijing 100037, China;
| | - Jiao Li
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, China; (M.W.); (H.C.)
| |
Collapse
|
25
|
Griné M, Guerreiro C, Moscoso Costa F, Nobre Menezes M, Ladeiras-Lopes R, Ferreira D, Oliveira-Santos M. Digital health in cardiovascular medicine: An overview of key applications and clinical impact by the Portuguese Society of Cardiology Study Group on Digital Health. Rev Port Cardiol 2024:S0870-2551(24)00283-X. [PMID: 39393635 DOI: 10.1016/j.repc.2024.08.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 07/26/2024] [Accepted: 08/01/2024] [Indexed: 10/13/2024] Open
Abstract
Digital health interventions including telehealth, mobile health, artificial intelligence, big data, robotics, extended reality, computational and high-fidelity bench simulations are an integral part of the path toward precision medicine. Current applications encompass risk factor modification, chronic disease management, clinical decision support, diagnostics interpretation, preprocedural planning, evidence generation, education, and training. Despite the acknowledged potential, their development and implementation have faced several challenges and constraints, meaning few digital health tools have reached daily clinical practice. As a result, the Portuguese Society of Cardiology Study Group on Digital Health set out to outline the main digital health applications, address some of the roadblocks hampering large-scale deployment, and discuss future directions in support of cardiovascular health at large.
Collapse
Affiliation(s)
- Mafalda Griné
- Serviço de Cardiologia, Hospitais da Universidade de Coimbra, Unidade Local de Saúde de Coimbra, Coimbra, Portugal.
| | - Cláudio Guerreiro
- Serviço de Cardiologia, Centro Hospitalar de Vila Nova de Gaia, Vila Nova de Gaia, Portugal
| | | | - Miguel Nobre Menezes
- Structural and Coronary Heart Disease Unit, Cardiovascular Center of the University of Lisbon, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal; Serviço de Cardiologia, Departamento de Coração e Vasos, CHULN Hospital de Santa Maria, Lisboa, Portugal
| | - Ricardo Ladeiras-Lopes
- UnIC@RISE, Cardiovascular Research and Development Center, Department of Surgery and Physiology, Faculdade de Medicina, Universidade do Porto, Porto, Portugal; Hospital da Luz, Lisboa, Portugal
| | - Daniel Ferreira
- Serviço de Medicina Intensiva, Hospital da Luz, Lisboa, Portugal; Hospital da Luz Digital, Lisboa, Portugal
| | - Manuel Oliveira-Santos
- Serviço de Cardiologia, Hospitais da Universidade de Coimbra, Unidade Local de Saúde de Coimbra, Coimbra, Portugal; Faculdade de Medicina, Universidade de Coimbra, Coimbra, Portugal
| |
Collapse
|
26
|
Wang X, Borjesson T, Wetterlind J, van der Fels-Klerx HJ. Prediction of deoxynivalenol contamination in spring oats in Sweden using explainable artificial intelligence. NPJ Sci Food 2024; 8:75. [PMID: 39366957 PMCID: PMC11452490 DOI: 10.1038/s41538-024-00310-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 09/15/2024] [Indexed: 10/06/2024] Open
Abstract
Weather conditions and agronomical factors are known to affect Fusarium spp. growth and ultimately deoxynivalenol (DON) contamination in oat. This study aimed to develop predictive models for the contamination of spring oat at harvest with DON on a regional basis in Sweden using machine-learning algorithms. Three models were developed as regional risk-assessment tools for farmers, crop collectors, and food safety inspectors, respectively. Data included: weather data from different oat growing periods, agronomical data, site-specific data, and DON contamination data from the previous year. Results showed that: (1) RF models were able to predict DON contamination at harvest with a total classification accuracy of minimal 0.72; (2) good predictions could already be made in June; (3) rainfall, relative humidity, and wind speed in different oat growing stages, followed by crop variety and elevation were the most important features for predicting DON contamination in spring oats at harvest.
Collapse
Affiliation(s)
- X Wang
- Business Economics Group, Wageningen University, Hollandseweg 1, Wageningen, the Netherlands.
- Wageningen Food Safety Research, Akkermaalsbos 2, Wageningen, the Netherlands.
| | - T Borjesson
- Agroväst Livsmedel AB, PO Box 234, Skara, Sweden
| | - J Wetterlind
- Department of Soil and Environment, Swedish, University of Agricultural Sciences, Skara, Sweden
| | - H J van der Fels-Klerx
- Business Economics Group, Wageningen University, Hollandseweg 1, Wageningen, the Netherlands
- Wageningen Food Safety Research, Akkermaalsbos 2, Wageningen, the Netherlands
| |
Collapse
|
27
|
Aramruang T, Malhotra A, Numthavaj P, Looareesuwan P, Anothaisintawee T, Dejthevaporn C, Sirirutbunkajorn N, Attia J, Thakkinstian A. Prediction models for identifying medication overuse or medication overuse headache in migraine patients: a systematic review. J Headache Pain 2024; 25:165. [PMID: 39363297 PMCID: PMC11450990 DOI: 10.1186/s10194-024-01874-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Accepted: 09/22/2024] [Indexed: 10/05/2024] Open
Abstract
BACKGROUND Migraine is a debilitating neurological disorder that presents significant management challenges, resulting in underdiagnosis and inappropriate treatments, leaving patients at risk of medication overuse (MO). MO contributes to disease progression and the development of medication overuse headache (MOH). Predicting which migraine patients are at risk of MO/MOH is crucial for effective management. Thus, this systematic review aims to review and critique available prediction models for MO/MOH in migraine patients. METHODS A systematic search was conducted using Embase, Scopus, Medline/PubMed, ACM Digital Library, and IEEE databases from inception to April 22, 2024. The risk of bias was assessed using the prediction model risk of bias assessment tool. RESULTS Out of 1,579 articles, six studies with nine models met the inclusion criteria. Three studies developed new prediction models, while the remaining validated existing scores. Most studies utilized cross-sectional and prospective data collection in specific headache settings and migraine types. The models included up to 53 predictors, with sample sizes from 17 to 1,419 participants. Traditional statistical models (logistic regression and least absolute shrinkage and selection operator regression) were used in two studies, while one utilized a machine learning (ML) technique (support vector machines). Receiver operating characteristic analysis was employed to validate existing scores. The area under the receiver operating characteristic (AUROC) for the ML model (0.83) outperformed the traditional statistical model (0.62) in internal validation. The AUROCs ranged from 0.84 to 0.85 for the validation of existing scores. Common predictors included age and gender; genetic data and questionnaire evaluations were also included. All studies demonstrated a high risk of bias in model construction and high concerns regarding applicability to participants. CONCLUSION This review identified promising results for MO/MOH prediction models in migraine patients, although the field remains limited. Future research should incorporate important risk factors, assess discrimination and calibration, and perform external validation. Further studies with robust designs, appropriate settings, high-quality and quantity data, and rigorous methodologies are necessary to advance this field.
Collapse
Affiliation(s)
- Teerapong Aramruang
- Department of Clinical Epidemiology and Biostatistics, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
- Department of Pharmacy, Faculty of Pharmacy, Mahidol University, Bangkok, Thailand
| | | | - Pawin Numthavaj
- Department of Clinical Epidemiology and Biostatistics, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand.
| | - Panu Looareesuwan
- Department of Clinical Epidemiology and Biostatistics, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| | - Thunyarat Anothaisintawee
- Department of Clinical Epidemiology and Biostatistics, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| | - Charungthai Dejthevaporn
- Department of Medicine, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| | - Nat Sirirutbunkajorn
- Department of Diagnostic and Therapeutic Radiology, Division of Radiation Oncology, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| | - John Attia
- Centre for Clinical Epidemiology and Biostatistics, School of Medicine and Public Health, University of Newcastle, Newcastle, NSW, Australia
| | - Ammarin Thakkinstian
- Department of Clinical Epidemiology and Biostatistics, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| |
Collapse
|
28
|
Marvasti TB, Gao Y, Murray KR, Hershman S, McIntosh C, Moayedi Y. Unlocking Tomorrow's Health Care: Expanding the Clinical Scope of Wearables by Applying Artificial Intelligence. Can J Cardiol 2024; 40:1934-1945. [PMID: 39025363 DOI: 10.1016/j.cjca.2024.07.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 07/11/2024] [Accepted: 07/12/2024] [Indexed: 07/20/2024] Open
Abstract
As an integral aspect of health care, digital technology has enabled modelling of complex relationships to detect, screen, diagnose, and predict patient outcomes. With massive data sets, artificial intelligence (AI) can have marked effects on 3 levels: for patients, clinicians, and health systems. In this review, we discuss contemporary AI-enabled wearable devices undergoing research in the field of cardiovascular medicine. These include devices such as smart watches, electrocardiogram patches, and smart textiles such as smart socks and chest sensors for diagnosis, management, and prognostication of conditions such as atrial fibrillation, heart failure, and hypertension as well as monitoring for cardiac rehabilitation. We review the evolution of machine learning algorithms used in wearable devices from random forest models to the use of convolutional neural networks and transformers. We further discuss frameworks for wearable technologies such as the V3-stage process of verification, analytical validation, and clinical validation as well as challenges of AI integration in medicine such as data veracity, validity, and security and provide a reference framework to maintain fairness and equity. Last, clinician and patient perspectives are discussed to highlight the importance of considering end-user feedback in development and regulatory processes.
Collapse
Affiliation(s)
- Tina Binesh Marvasti
- Ted Rogers Centre for Heart Research, Ajmera Transplant Centre, University of Toronto, Toronto, Ontario, Canada
| | - Yuan Gao
- Ted Rogers Centre for Heart Research, Ajmera Transplant Centre, University of Toronto, Toronto, Ontario, Canada
| | - Kevin R Murray
- Ted Rogers Centre for Heart Research, Ajmera Transplant Centre, University of Toronto, Toronto, Ontario, Canada
| | - Steve Hershman
- Ted Rogers Centre for Heart Research, Ajmera Transplant Centre, University of Toronto, Toronto, Ontario, Canada
| | - Chris McIntosh
- Ted Rogers Centre for Heart Research, Ajmera Transplant Centre, University of Toronto, Toronto, Ontario, Canada
| | - Yasbanoo Moayedi
- Ted Rogers Centre for Heart Research, Ajmera Transplant Centre, University of Toronto, Toronto, Ontario, Canada; Ajmera Transplant Centre, University of Toronto, Toronto, Ontario, Canada.
| |
Collapse
|
29
|
Guo KH, Chaudhari NN, Jafar T, Chowdhury NF, Bogdan P, Irimia A. Anatomic Interpretability in Neuroimage Deep Learning: Saliency Approaches for Typical Aging and Traumatic Brain Injury. Neuroinformatics 2024; 22:591-606. [PMID: 39503843 PMCID: PMC11579113 DOI: 10.1007/s12021-024-09694-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/15/2024] [Indexed: 11/13/2024]
Abstract
The black box nature of deep neural networks (DNNs) makes researchers and clinicians hesitant to rely on their findings. Saliency maps can enhance DNN explainability by suggesting the anatomic localization of relevant brain features. This study compares seven popular attribution-based saliency approaches to assign neuroanatomic interpretability to DNNs that estimate biological brain age (BA) from magnetic resonance imaging (MRI). Cognitively normal (CN) adults (N = 13,394, 5,900 males; mean age: 65.82 ± 8.89 years) are included for DNN training, testing, validation, and saliency map generation to estimate BA. To study saliency robustness to the presence of anatomic deviations from normality, saliency maps are also generated for adults with mild traumatic brain injury (mTBI, N = 214, 135 males; mean age: 55.3 ± 9.9 years). We assess saliency methods' capacities to capture known anatomic features of brain aging and compare them to a surrogate ground truth whose anatomic saliency is known a priori. Anatomic aging features are identified most reliably by the integrated gradients method, which outperforms all others through its ability to localize relevant anatomic features. Gradient Shapley additive explanations, input × gradient, and masked gradient perform less consistently but still highlight ubiquitous neuroanatomic features of aging (ventricle dilation, hippocampal atrophy, sulcal widening). Saliency methods involving gradient saliency, guided backpropagation, and guided gradient-weight class attribution mapping localize saliency outside the brain, which is undesirable. Our research suggests the relative tradeoffs of saliency methods to interpret DNN findings during BA estimation in typical aging and after mTBI.
Collapse
Affiliation(s)
- Kevin H Guo
- Thomas Lord Department of Computer Science, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, 90089, USA
- Ethel Percy Andrus Gerontology Center, Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA, 90089, USA
| | - Nikhil N Chaudhari
- Ethel Percy Andrus Gerontology Center, Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA, 90089, USA
- Corwin D. Denney Research Center, Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, 90089, USA
| | - Tamara Jafar
- Ethel Percy Andrus Gerontology Center, Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA, 90089, USA
- Neuroscience Graduate Program, University of Southern California, Los Angeles, CA, 90089, USA
| | - Nahian F Chowdhury
- Ethel Percy Andrus Gerontology Center, Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA, 90089, USA
- Neuroscience Graduate Program, University of Southern California, Los Angeles, CA, 90089, USA
| | - Paul Bogdan
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, 90089, USA
| | - Andrei Irimia
- Ethel Percy Andrus Gerontology Center, Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA, 90089, USA.
- Corwin D. Denney Research Center, Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, 90089, USA.
- Department of Quantitative and Computational Biology, Dornsife College of Arts and Sciences, University of Southern California, Los Angeles, CA, 90089, USA.
- Centre for Healthy Brain Aging, Institute of Psychiatry, Psychology & Neuroscience, King's College London, 16 de Crespigny Park, London, SE5 8AF, UK.
| |
Collapse
|
30
|
Athar M. Potentials of artificial intelligence in familial hypercholesterolemia: Advances in screening, diagnosis, and risk stratification for early intervention and treatment. Int J Cardiol 2024; 412:132315. [PMID: 38972488 DOI: 10.1016/j.ijcard.2024.132315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Revised: 05/21/2024] [Accepted: 07/01/2024] [Indexed: 07/09/2024]
Abstract
Familial hypercholesterolemia (FH) poses a global health challenge due to high incidence rates and underdiagnosis, leading to increased risks of early-onset atherosclerosis and cardiovascular diseases. Early detection and treatment of FH is critical in reducing the risk of cardiovascular events and improving the long-term outcomes and quality of life for affected individuals and their families. Traditional therapeutic approaches revolve around lipid-lowering interventions, yet challenges persist, particularly in accurate and timely diagnosis. The current diagnostic landscape heavily relies on genetic testing of specific LDL-C metabolism genes, often limited to specialized centers. This constraint has led to the adoption of alternative clinical scores for FH diagnosis. However, the rapid advancements in artificial intelligence (AI) and machine learning (ML) present promising solutions to these diagnostic challenges. This review explores the intricacies of FH, highlighting the challenges that are encountered in the diagnosis and management of the disorder. The revolutionary potential of ML, particularly in large-scale population screening, is highlighted. Applications of ML in FH screening, diagnosis, and risk stratification are discussed, showcasing its ability to outperform traditional criteria. However, challenges and ethical considerations, including algorithmic stability, data quality, privacy, and consent issues, are crucial areas that require attention. The review concludes by emphasizing the significant promise of AI and ML in FH management while underscoring the need for ethical and practical vigilance to ensure responsible and effective integration into healthcare practices.
Collapse
Affiliation(s)
- Mohammad Athar
- Science and Technology Unit, Umm Al-Qura University, Makkah, Saudi Arabia; Department of Medical Genetics, Faculty of Medicine, Umm Al-Qura University, Makkah, Saudi Arabia.
| |
Collapse
|
31
|
Xu L, Li C, Gao S, Zhao L, Guan C, Shen X, Zhu Z, Guo C, Zhang L, Yang C, Bu Q, Zhou B, Xu Y. Personalized Prediction of Long-Term Renal Function Prognosis Following Nephrectomy Using Interpretable Machine Learning Algorithms: Case-Control Study. JMIR Med Inform 2024; 12:e52837. [PMID: 39303280 PMCID: PMC11452755 DOI: 10.2196/52837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 04/08/2024] [Accepted: 07/21/2024] [Indexed: 09/22/2024] Open
Abstract
BACKGROUND Acute kidney injury (AKI) is a common adverse outcome following nephrectomy. The progression from AKI to acute kidney disease (AKD) and subsequently to chronic kidney disease (CKD) remains a concern; yet, the predictive mechanisms for these transitions are not fully understood. Interpretable machine learning (ML) models offer insights into how clinical features influence long-term renal function outcomes after nephrectomy, providing a more precise framework for identifying patients at risk and supporting improved clinical decision-making processes. OBJECTIVE This study aimed to (1) evaluate postnephrectomy rates of AKI, AKD, and CKD, analyzing long-term renal outcomes along different trajectories; (2) interpret AKD and CKD models using Shapley Additive Explanations values and Local Interpretable Model-Agnostic Explanations algorithm; and (3) develop a web-based tool for estimating AKD or CKD risk after nephrectomy. METHODS We conducted a retrospective cohort study involving patients who underwent nephrectomy between July 2012 and June 2019. Patient data were randomly split into training, validation, and test sets, maintaining a ratio of 76.5:8.5:15. Eight ML algorithms were used to construct predictive models for postoperative AKD and CKD. The performance of the best-performing models was assessed using various metrics. We used various Shapley Additive Explanations plots and Local Interpretable Model-Agnostic Explanations bar plots to interpret the model and generated directed acyclic graphs to explore the potential causal relationships between features. Additionally, we developed a web-based prediction tool using the top 10 features for AKD prediction and the top 5 features for CKD prediction. RESULTS The study cohort comprised 1559 patients. Incidence rates for AKI, AKD, and CKD were 21.7% (n=330), 15.3% (n=238), and 10.6% (n=165), respectively. Among the evaluated ML models, the Light Gradient-Boosting Machine (LightGBM) model demonstrated superior performance, with an area under the receiver operating characteristic curve of 0.97 for AKD prediction and 0.96 for CKD prediction. Performance metrics and plots highlighted the model's competence in discrimination, calibration, and clinical applicability. Operative duration, hemoglobin, blood loss, urine protein, and hematocrit were identified as the top 5 features associated with predicted AKD. Baseline estimated glomerular filtration rate, pathology, trajectories of renal function, age, and total bilirubin were the top 5 features associated with predicted CKD. Additionally, we developed a web application using the LightGBM model to estimate AKD and CKD risks. CONCLUSIONS An interpretable ML model effectively elucidated its decision-making process in identifying patients at risk of AKD and CKD following nephrectomy by enumerating critical features. The web-based calculator, found on the LightGBM model, can assist in formulating more personalized and evidence-based clinical strategies.
Collapse
Affiliation(s)
- Lingyu Xu
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
| | - Chenyu Li
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
- Medizinische Klinik und Poliklinik IV, Klinikum der Universität, Munich, Germany
| | - Shuang Gao
- Ocean University of China, Qingdao, CN, Qingdao, China
| | - Long Zhao
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
| | - Chen Guan
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
| | - Xuefei Shen
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
| | - Zhihui Zhu
- Center of Structural Heart Disease, Beijing Anzhen Hospital, Capital Medical University, Beijing, China
| | - Cheng Guo
- Allianz Technology, Allianz, Munich, Germany
| | - Liwei Zhang
- Institute of Diabetes and Regeneration Research, Helmholtz Diabetes Center, Helmholtz Center Munich, Neuherberg, Germany
| | - Chengyu Yang
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
| | - Quandong Bu
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
| | - Bin Zhou
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
| | - Yan Xu
- Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
| |
Collapse
|
32
|
Husvogt L, Yaghy A, Camacho A, Lam K, Schottenhamml J, Ploner SB, Fujimoto JG, Waheed NK, Maier A. Ensembling U-Nets for microaneurysm segmentation in optical coherence tomography angiography in patients with diabetic retinopathy. Sci Rep 2024; 14:21520. [PMID: 39277636 PMCID: PMC11401926 DOI: 10.1038/s41598-024-72375-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 09/06/2024] [Indexed: 09/17/2024] Open
Abstract
Diabetic retinopathy is one of the leading causes of blindness around the world. This makes early diagnosis and treatment important in preventing vision loss in a large number of patients. Microaneurysms are the key hallmark of the early stage of the disease, non-proliferative diabetic retinopathy, and can be detected using OCT angiography quickly and non-invasively. Screening tools for non-proliferative diabetic retinopathy using OCT angiography thus have the potential to lead to improved outcomes in patients. We compared different configurations of ensembled U-nets to automatically segment microaneurysms from OCT angiography fundus projections. For this purpose, we created a new database to train and evaluate the U-nets, created by two expert graders in two stages of grading. We present the first U-net neural networks using ensembling for the detection of microaneurysms from OCT angiography en face images from the superficial and deep capillary plexuses in patients with non-proliferative diabetic retinopathy trained on a database labeled by two experts with repeats.
Collapse
Affiliation(s)
- Lennart Husvogt
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen , Germany.
| | - Antonio Yaghy
- New England Eye Center, Tufts School of Medicine, Boston, MA, 02111, USA
| | - Alex Camacho
- New England Eye Center, Tufts School of Medicine, Boston, MA, 02111, USA
| | - Kenneth Lam
- New England Eye Center, Tufts School of Medicine, Boston, MA, 02111, USA
| | - Julia Schottenhamml
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen , Germany
| | - Stefan B Ploner
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen , Germany
| | - James G Fujimoto
- Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Nadia K Waheed
- New England Eye Center, Tufts School of Medicine, Boston, MA, 02111, USA
| | - Andreas Maier
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen , Germany
| |
Collapse
|
33
|
Lin Y, Li C, Li H, Wang X. Can Loneliness be Predicted? Development of a Risk Prediction Model for Loneliness among Elderly Chinese: A Study Based on CLHLS. RESEARCH SQUARE 2024:rs.3.rs-4773143. [PMID: 39281880 PMCID: PMC11398568 DOI: 10.21203/rs.3.rs-4773143/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/18/2024]
Abstract
Background Loneliness is prevalent among the elderly, worsened by global aging trends. It impacts mental and physiological health. Traditional scales for measuring loneliness may be biased due to cognitive decline and varying definitions. Machine learning advancements offer potential improvements in risk prediction models. Methods Data from the 2018 Chinese Longitudinal Healthy Longevity Survey (CLHLS), involving over 16,000 participants aged ≥65 years, were used. The study examined the relationships between loneliness and factors such as cognitive function, functional limitations, living conditions, environmental influences, age-related health issues, and health behaviors. Using R 4.4.1, seven predictive models were developed: logistic regression, ridge regression, support vector machines, K-nearest neighbors, decision trees, random forests, and multi-layer perceptron. Models were evaluated based on ROC curves, accuracy, precision, recall, F1 scores, and AUC. Results Loneliness prevalence among elderly Chinese was 23.4%. Analysis identified 16 predictive factors and evaluated seven models. Logistic regression was the most effective model for predicting loneliness risk due to its economic and operational advantages. Conclusion The study found a 23.4% prevalence of loneliness among elderly individuals in China. SHAP values indicated that higher MMSE scores correlate with lower loneliness levels. Logistic regression was the superior model for predicting loneliness risk in this population.
Collapse
Affiliation(s)
- Youbei Lin
- The First Affiliated Hospital of Jinzhou Medical University
| | | | | | - Xiuli Wang
- The First Affiliated Hospital of Jinzhou Medical University
| |
Collapse
|
34
|
Cálem J, Moreira C, Jorge J. Intelligent systems in healthcare: A systematic survey of explainable user interfaces. Comput Biol Med 2024; 180:108908. [PMID: 39067152 DOI: 10.1016/j.compbiomed.2024.108908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Revised: 07/05/2024] [Accepted: 07/15/2024] [Indexed: 07/30/2024]
Abstract
With radiology shortages affecting over half of the global population, the potential of artificial intelligence to revolutionize medical diagnosis and treatment is ever more important. However, lacking trust from medical professionals hinders the widespread adoption of AI models in health sciences. Explainable AI (XAI) aims to increase trust and understanding of black box models by identifying biases and providing transparent explanations. This is the first survey that explores explainable user interfaces (XUI) from a medical domain perspective, analysing the visualization and interaction methods employed in current medical XAI systems. We analysed 42 explainable interfaces following the PRISMA methodology, emphasizing the critical role of effectively conveying information to users as part of the explanation process. We contribute a taxonomy of interface design properties and identify five distinct clusters of research papers. Future research directions include contestability in medical decision support, counterfactual explanations for images, and leveraging Large Language Models to enhance XAI interfaces in healthcare.
Collapse
Affiliation(s)
- João Cálem
- Instituto Superior Técnico, Universidade de Lisboa, Portugal; INESC-ID, Portugal.
| | - Catarina Moreira
- Data Science Institute, University of Technology Sydney, Australia; INESC-ID, Portugal
| | - Joaquim Jorge
- Instituto Superior Técnico, Universidade de Lisboa, Portugal; INESC-ID, Portugal
| |
Collapse
|
35
|
Griot M, Hemptinne C, Vanderdonckt J, Yuksel D. Impact of high-quality, mixed-domain data on the performance of medical language models. J Am Med Inform Assoc 2024; 31:1875-1883. [PMID: 38781312 PMCID: PMC11339514 DOI: 10.1093/jamia/ocae120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/31/2024] [Accepted: 05/08/2024] [Indexed: 05/25/2024] Open
Abstract
OBJECTIVE To optimize the training strategy of large language models for medical applications, focusing on creating clinically relevant systems that efficiently integrate into healthcare settings, while ensuring high standards of accuracy and reliability. MATERIALS AND METHODS We curated a comprehensive collection of high-quality, domain-specific data and used it to train several models, each with different subsets of this data. These models were rigorously evaluated against standard medical benchmarks, such as the USMLE, to measure their performance. Furthermore, for a thorough effectiveness assessment, they were compared with other state-of-the-art medical models of comparable size. RESULTS The models trained with a mix of high-quality, domain-specific, and general data showed superior performance over those trained on larger, less clinically relevant datasets (P < .001). Our 7-billion-parameter model Med5 scores 60.5% on MedQA, outperforming the previous best of 49.3% from comparable models, and becomes the first of its size to achieve a passing score on the USMLE. Additionally, this model retained its proficiency in general domain tasks, comparable to state-of-the-art general domain models of similar size. DISCUSSION Our findings underscore the importance of integrating high-quality, domain-specific data in training large language models for medical purposes. The balanced approach between specialized and general data significantly enhances the model's clinical relevance and performance. CONCLUSION This study sets a new standard in medical language models, proving that a strategically trained, smaller model can outperform larger ones in clinical relevance and general proficiency, highlighting the importance of data quality and expert curation in generative artificial intelligence for healthcare applications.
Collapse
Affiliation(s)
- Maxime Griot
- Institute of NeuroScience, Université catholique de Louvain, Brussels, 1200, Belgium
- Louvain Research Institute in Management and Organizations, Université catholique de Louvain, Louvain-la-Neuve, 1348, Belgium
| | - Coralie Hemptinne
- Ophthalmology, Cliniques Universitaires Saint-Luc, Brussels, 1200, Belgium
| | - Jean Vanderdonckt
- Louvain Research Institute in Management and Organizations, Université catholique de Louvain, Louvain-la-Neuve, 1348, Belgium
| | - Demet Yuksel
- Institute of NeuroScience, Université catholique de Louvain, Brussels, 1200, Belgium
- Medical Information Department, Cliniques Universitaires Saint-Luc, Brussels, 1200, Belgium
| |
Collapse
|
36
|
Wong CYT, O'Byrne C, Taribagil P, Liu T, Antaki F, Keane PA. Comparing code-free and bespoke deep learning approaches in ophthalmology. Graefes Arch Clin Exp Ophthalmol 2024; 262:2785-2798. [PMID: 38446200 PMCID: PMC11377500 DOI: 10.1007/s00417-024-06432-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/13/2024] [Accepted: 02/27/2024] [Indexed: 03/07/2024] Open
Abstract
AIM Code-free deep learning (CFDL) allows clinicians without coding expertise to build high-quality artificial intelligence (AI) models without writing code. In this review, we comprehensively review the advantages that CFDL offers over bespoke expert-designed deep learning (DL). As exemplars, we use the following tasks: (1) diabetic retinopathy screening, (2) retinal multi-disease classification, (3) surgical video classification, (4) oculomics and (5) resource management. METHODS We performed a search for studies reporting CFDL applications in ophthalmology in MEDLINE (through PubMed) from inception to June 25, 2023, using the keywords 'autoML' AND 'ophthalmology'. After identifying 5 CFDL studies looking at our target tasks, we performed a subsequent search to find corresponding bespoke DL studies focused on the same tasks. Only English-written articles with full text available were included. Reviews, editorials, protocols and case reports or case series were excluded. We identified ten relevant studies for this review. RESULTS Overall, studies were optimistic towards CFDL's advantages over bespoke DL in the five ophthalmological tasks. However, much of such discussions were identified to be mono-dimensional and had wide applicability gaps. High-quality assessment of better CFDL applicability over bespoke DL warrants a context-specific, weighted assessment of clinician intent, patient acceptance and cost-effectiveness. We conclude that CFDL and bespoke DL are unique in their own assets and are irreplaceable with each other. Their benefits are differentially valued on a case-to-case basis. Future studies are warranted to perform a multidimensional analysis of both techniques and to improve limitations of suboptimal dataset quality, poor applicability implications and non-regulated study designs. CONCLUSION For clinicians without DL expertise and easy access to AI experts, CFDL allows the prototyping of novel clinical AI systems. CFDL models concert with bespoke models, depending on the task at hand. A multidimensional, weighted evaluation of the factors involved in the implementation of those models for a designated task is warranted.
Collapse
Affiliation(s)
- Carolyn Yu Tung Wong
- Institute of Ophthalmology, University College London, 11-43 Bath St, London, EC1V 9EL, UK
- Moorfields Eye Hospital NHS Foundation Trust, London, UK
- Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Ciara O'Byrne
- Institute of Ophthalmology, University College London, 11-43 Bath St, London, EC1V 9EL, UK
- Moorfields Eye Hospital NHS Foundation Trust, London, UK
| | - Priyal Taribagil
- Institute of Ophthalmology, University College London, 11-43 Bath St, London, EC1V 9EL, UK
- Moorfields Eye Hospital NHS Foundation Trust, London, UK
| | - Timing Liu
- Institute of Ophthalmology, University College London, 11-43 Bath St, London, EC1V 9EL, UK
- Moorfields Eye Hospital NHS Foundation Trust, London, UK
| | - Fares Antaki
- Institute of Ophthalmology, University College London, 11-43 Bath St, London, EC1V 9EL, UK
- Moorfields Eye Hospital NHS Foundation Trust, London, UK
- The CHUM School of Artificial Intelligence in Healthcare, Montreal, QC, Canada
| | - Pearse Andrew Keane
- Institute of Ophthalmology, University College London, 11-43 Bath St, London, EC1V 9EL, UK.
- Moorfields Eye Hospital NHS Foundation Trust, London, UK.
- NIHR Moorfields Biomedical Research Centre, London, UK.
| |
Collapse
|
37
|
Stahl D. New horizons in prediction modelling using machine learning in older people's healthcare research. Age Ageing 2024; 53:afae201. [PMID: 39311424 PMCID: PMC11417961 DOI: 10.1093/ageing/afae201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 06/26/2024] [Indexed: 09/26/2024] Open
Abstract
Machine learning (ML) and prediction modelling have become increasingly influential in healthcare, providing critical insights and supporting clinical decisions, particularly in the age of big data. This paper serves as an introductory guide for health researchers and readers interested in prediction modelling and explores how these technologies support clinical decisions, particularly with big data, and covers all aspects of the development, assessment and reporting of a model using ML. The paper starts with the importance of prediction modelling for precision medicine. It outlines different types of prediction and machine learning approaches, including supervised, unsupervised and semi-supervised learning, and provides an overview of popular algorithms for various outcomes and settings. It also introduces key theoretical ML concepts. The importance of data quality, preprocessing and unbiased model performance evaluation is highlighted. Concepts of apparent, internal and external validation will be introduced along with metrics for discrimination and calibration for different types of outcomes. Additionally, the paper addresses model interpretation, fairness and implementation in clinical practice. Finally, the paper provides recommendations for reporting and identifies common pitfalls in prediction modelling and machine learning. The aim of the paper is to help readers understand and critically evaluate research papers that present ML models and to serve as a first guide for developing, assessing and implementing their own.
Collapse
Affiliation(s)
- Daniel Stahl
- Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK
| |
Collapse
|
38
|
Chen Z, Liang N, Li H, Zhang H, Li H, Yan L, Hu Z, Chen Y, Zhang Y, Wang Y, Ke D, Shi N. Exploring explainable AI features in the vocal biomarkers of lung disease. Comput Biol Med 2024; 179:108844. [PMID: 38981214 DOI: 10.1016/j.compbiomed.2024.108844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 05/15/2024] [Accepted: 06/04/2024] [Indexed: 07/11/2024]
Abstract
This review delves into the burgeoning field of explainable artificial intelligence (XAI) in the detection and analysis of lung diseases through vocal biomarkers. Lung diseases, often elusive in their early stages, pose a significant public health challenge. Recent advancements in AI have ushered in innovative methods for early detection, yet the black-box nature of many AI models limits their clinical applicability. XAI emerges as a pivotal tool, enhancing transparency and interpretability in AI-driven diagnostics. This review synthesizes current research on the application of XAI in analyzing vocal biomarkers for lung diseases, highlighting how these techniques elucidate the connections between specific vocal features and lung pathology. We critically examine the methodologies employed, the types of lung diseases studied, and the performance of various XAI models. The potential for XAI to aid in early detection, monitor disease progression, and personalize treatment strategies in pulmonary medicine is emphasized. Furthermore, this review identifies current challenges, including data heterogeneity and model generalizability, and proposes future directions for research. By offering a comprehensive analysis of explainable AI features in the context of lung disease detection, this review aims to bridge the gap between advanced computational approaches and clinical practice, paving the way for more transparent, reliable, and effective diagnostic tools.
Collapse
Affiliation(s)
- Zhao Chen
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ning Liang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Haoyuan Li
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Haili Zhang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Huizhen Li
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Lijiao Yan
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ziteng Hu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yaxin Chen
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yujing Zhang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yanping Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Dandan Ke
- Special Disease Clinic, Huaishuling Branch of Beijing Fengtai Hospital of Integrated Traditional Chinese and Western Medicine, Beijing, China.
| | - Nannan Shi
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China; Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China.
| |
Collapse
|
39
|
Kapral L, Dibiasi C, Jeremic N, Bartos S, Behrens S, Bilir A, Heitzinger C, Kimberger O. Development and external validation of temporal fusion transformer models for continuous intraoperative blood pressure forecasting. EClinicalMedicine 2024; 75:102797. [PMID: 39281101 PMCID: PMC11402414 DOI: 10.1016/j.eclinm.2024.102797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 08/02/2024] [Accepted: 08/06/2024] [Indexed: 09/18/2024] Open
Abstract
Background During surgery, intraoperative hypotension is associated with postoperative morbidity and should therefore be avoided. Predicting the occurrence of hypotension in advance may allow timely interventions to prevent hypotension. Previous prediction models mostly use high-resolution waveform data, which is often not available. Methods We utilised a novel temporal fusion transformer (TFT) algorithm to predict intraoperative blood pressure trajectories 7 min in advance. We trained the model with low-resolution data (sampled every 15 s) from 73,009 patients who were undergoing general anaesthesia for non-cardiothoracic surgery between January 1, 2017, and December 30, 2020, at the General Hospital of Vienna, Austria. The data set contained information on patient demographics, vital signs, medication, and ventilation. The model was evaluated using an internal (n = 8113) and external test set (n = 5065) obtained from the openly accessible Vital Signs Database. Findings In the internal test set, the mean absolute error for predicting mean arterial blood pressure was 0.376 standard deviations-or 4 mmHg-and 0.622 standard deviations-or 7 mmHg-in the external test set. We also adapted the TFT model to binarily predict the occurrence of hypotension as defined by mean arterial blood pressure < 65 mmHg in the next one, three, five, and 7 min. Here, model discrimination was excellent, with a mean area under the receiver operating characteristic curve (AUROC) of 0.933 in the internal test set and 0.919 in the external test set. Interpretation Our TFT model is capable of accurately forecasting intraoperative arterial blood pressure using only low-resolution data showing a low prediction error. When used for binary prediction of hypotension, we obtained excellent performance. Funding No external funding.
Collapse
Affiliation(s)
- Lorenz Kapral
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
- Technical University Vienna, Department of Informatics, Research Unit Machine Learning, Favoritenstraße 9/11, Vienna 1040 Wien, Austria
| | - Christoph Dibiasi
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| | - Natasa Jeremic
- Medical University of Vienna, Department of Ophthalmology and Optometry, Währinger Gürtel 18-20, Vienna 1090 Wien, Austria
| | - Stefan Bartos
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| | - Sybille Behrens
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| | - Aylin Bilir
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| | - Clemens Heitzinger
- Technical University Vienna, Department of Informatics, Research Unit Machine Learning, Favoritenstraße 9/11, Vienna 1040 Wien, Austria
| | - Oliver Kimberger
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| |
Collapse
|
40
|
Nasir Y, Kadian K, Sharma A, Dwivedi V. Interpretable machine learning for dermatological disease detection: Bridging the gap between accuracy and explainability. Comput Biol Med 2024; 179:108919. [PMID: 39047502 DOI: 10.1016/j.compbiomed.2024.108919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Revised: 07/15/2024] [Accepted: 07/15/2024] [Indexed: 07/27/2024]
Abstract
Research on disease detection by leveraging machine learning techniques has been under significant focus. The use of machine learning techniques is important to detect critical diseases promptly and provide the appropriate treatment. Disease detection is a vital and sensitive task and while machine learning models may provide a robust solution, they can come across as complex and unintuitive. Therefore, it is important to gauge a better understanding of the predictions and trust the results. This paper takes up the crucial task of skin disease detection and introduces a hybrid machine learning model combining SVM and XGBoost for the detection task. The proposed model outperformed the existing machine learning models - Support Vector Machine (SVM), decision tree, and XGBoost with an accuracy of 99.26%. The increased accuracy is essential for detecting skin disease due to the similarity in the symptoms which make it challenging to differentiate between the different conditions. In order to foster trust and gain insights into the results we turn to the promising field of Explainable Artificial Intelligence (XAI). We explore two such frameworks for local as well as global explanations for these machine learning models namely, SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME).
Collapse
Affiliation(s)
- Yusra Nasir
- CSE, Indira Gandhi Delhi Technical University for Women, Kashmere Gate, New Delhi, 110006, Delhi, India.
| | - Karuna Kadian
- CSE, Indira Gandhi Delhi Technical University for Women, Kashmere Gate, New Delhi, 110006, Delhi, India.
| | - Arun Sharma
- IT, Indira Gandhi Delhi Technical University for Women, Kashmere Gate, New Delhi, 110006, Delhi, India.
| | - Vimal Dwivedi
- School of Computing, Engineering & Intelligent Systems, Ulster University, Londonderry, Northern Ireland, United Kingdom.
| |
Collapse
|
41
|
Lu W, Zhao L, Wang S, Zhang H, Jiang K, Ji J, Chen S, Wang C, Wei C, Zhou R, Wang Z, Li X, Wang F, Wei X, Hou W. Explainable and visualizable machine learning models to predict biochemical recurrence of prostate cancer. Clin Transl Oncol 2024; 26:2369-2379. [PMID: 38602643 DOI: 10.1007/s12094-024-03480-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 03/23/2024] [Indexed: 04/12/2024]
Abstract
PURPOSE Machine learning (ML) models presented an excellent performance in the prognosis prediction. However, the black box characteristic of ML models limited the clinical applications. Here, we aimed to establish explainable and visualizable ML models to predict biochemical recurrence (BCR) of prostate cancer (PCa). MATERIALS AND METHODS A total of 647 PCa patients were retrospectively evaluated. Clinical parameters were identified using LASSO regression. Then, cohort was split into training and validation datasets with a ratio of 0.75:0.25 and BCR-related features were included in Cox regression and five ML algorithm to construct BCR prediction models. The clinical utility of each model was evaluated by concordance index (C-index) values and decision curve analyses (DCA). Besides, Shapley Additive Explanation (SHAP) values were used to explain the features in the models. RESULTS We identified 11 BCR-related features using LASSO regression, then establishing five ML-based models, including random survival forest (RSF), survival support vector machine (SSVM), survival Tree (sTree), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and a Cox regression model, C-index were 0.846 (95%CI 0.796-0.894), 0.774 (95%CI 0.712-0.834), 0.757 (95%CI 0.694-0.818), 0.820 (95%CI 0.765-0.869), 0.793 (95%CI 0.735-0.852), and 0.807 (95%CI 0.753-0.858), respectively. The DCA showed that RSF model had significant advantages over all models. In interpretability of ML models, the SHAP value demonstrated the tangible contribution of each feature in RSF model. CONCLUSIONS Our score system provide reference for the identification for BCR, and the crafting of a framework for making therapeutic decisions for PCa on a personalized basis.
Collapse
Affiliation(s)
- Wenhao Lu
- Collaborative Innovation Centre of Regenerative Medicine and Medical BioResource Development and Application Co-Constructed By the Province and Ministry, Guangxi Medical University, No. 22, Shuangyong Road, Qingxiu District, Nanning City, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Lin Zhao
- Department of Urology, Shanghai Changhai Hospital, Second Military Medical University, Shanghai, 200433, People's Republic of China
| | - Shenfan Wang
- Department of Urology, Shanghai Changhai Hospital, Second Military Medical University, Shanghai, 200433, People's Republic of China
| | - Huiyong Zhang
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Kangxian Jiang
- Department of Urology, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, 362000, People's Republic of China
| | - Jin Ji
- Department of Urology, Shanghai Changhai Hospital, Second Military Medical University, Shanghai, 200433, People's Republic of China
| | - Shaohua Chen
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China
| | - Chengbang Wang
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Chunmeng Wei
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Rongbin Zhou
- Collaborative Innovation Centre of Regenerative Medicine and Medical BioResource Development and Application Co-Constructed By the Province and Ministry, Guangxi Medical University, No. 22, Shuangyong Road, Qingxiu District, Nanning City, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Zuheng Wang
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China
| | - Xiao Li
- Collaborative Innovation Centre of Regenerative Medicine and Medical BioResource Development and Application Co-Constructed By the Province and Ministry, Guangxi Medical University, No. 22, Shuangyong Road, Qingxiu District, Nanning City, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Fubo Wang
- Collaborative Innovation Centre of Regenerative Medicine and Medical BioResource Development and Application Co-Constructed By the Province and Ministry, Guangxi Medical University, No. 22, Shuangyong Road, Qingxiu District, Nanning City, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China.
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China.
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China.
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China.
| | - Xuedong Wei
- Department of Urology, the First Affiliated Hospital of Soochow University, Suzhou, 210000, Jiangsu, People's Republic of China.
| | - Wenlei Hou
- Information Technology School of Guangxi Police College, Nanning, 530021, Guangxi, People's Republic of China.
| |
Collapse
|
42
|
Jiang H, Ye LS, Yuan XL, Luo Q, Zhou NY, Hu B. Artificial intelligence in pancreaticobiliary endoscopy: Current applications and future directions. J Dig Dis 2024; 25:564-572. [PMID: 39740251 DOI: 10.1111/1751-2980.13324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 11/13/2024] [Accepted: 12/03/2024] [Indexed: 01/02/2025]
Abstract
Pancreaticobiliary endoscopy is an essential tool for diagnosing and treating pancreaticobiliary diseases. However, it does not fully meet clinical needs, which presents challenges such as significant difficulty in operation and risks of missed diagnosis or misdiagnosis. In recent years, artificial intelligence (AI) has enhanced the diagnostic and treatment efficiency and quality of pancreaticobiliary endoscopy. Diagnosis and differential diagnosis based on endoscopic ultrasound (EUS) images, pathology of EUS-guided fine-needle aspiration or biopsy, need for endoscopic retrograde cholangiopancreatography (ERCP) and assessment of operational difficulty, postoperative complications and prediction of patient prognosis, and real-time procedure guidance. This review provides an overview of AI applications in pancreaticobiliary endoscopy and proposes future development directions in aspects such as data quality and algorithmic interpretability, aiming to provide new insights for the integration of AI technology with pancreaticobiliary endoscopy.
Collapse
Affiliation(s)
- Huan Jiang
- Department of Gastroenterology and Hepatology, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
- Digestive Endoscopy Medical Engineering Research Laboratory, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Lian Song Ye
- Department of Gastroenterology and Hepatology, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
- Digestive Endoscopy Medical Engineering Research Laboratory, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Xiang Lei Yuan
- Department of Gastroenterology and Hepatology, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
- Digestive Endoscopy Medical Engineering Research Laboratory, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Qi Luo
- Department of Gastroenterology and Hepatology, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
- Digestive Endoscopy Medical Engineering Research Laboratory, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Nuo Ya Zhou
- Department of Gastroenterology and Hepatology, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
- Digestive Endoscopy Medical Engineering Research Laboratory, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Bing Hu
- Department of Gastroenterology and Hepatology, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
- Med-X Center for Materials, Sichuan University, Chengdu, Sichuan Province, China
| |
Collapse
|
43
|
Yoo JW, Park J, Park H. The impact of AI-enabled CRM systems on organizational competitive advantage: A mixed-method approach using BERTopic and PLS-SEM. Heliyon 2024; 10:e36392. [PMID: 39253149 PMCID: PMC11382090 DOI: 10.1016/j.heliyon.2024.e36392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 08/11/2024] [Accepted: 08/14/2024] [Indexed: 09/11/2024] Open
Abstract
The recent advances in machine learning and deep learning algorithms, along with the advent of generative AI, have led AI to become the "new normal" in organizations. This trend has extended to CRM, resulting in the development of AI-enabled CRM systems, or AI-CRM. Despite the growing adoption of AI as part of competitive strategies, many firms report minimal or no positive effect of AI on performance. This study addresses the research questions: "What are the critical features of AI-CRM systems?" and "How do these features impact organizational competitive advantage?" To explore this, we aim to identify key characteristics of AI-CRM and assess their impact on organizational performance. In Study 1, we utilize BERTopic topic modeling to extract critical features of AI-CRM from user reviews. Study 2 employs PLS-SEM to examine how these features influence organizational competitive advantage. Study 1 reveals four main characteristics of AI-CRM (general, marketing, sales, and service/support), each comprising distinct features. Study 2 shows that these characteristics differentially impact CRM capability, significantly affecting performance and competitive advantage. The findings offer valuable insights for both theory and practice regarding the effective use of AI in organizations.
Collapse
Affiliation(s)
- Joon Woo Yoo
- Dept. of Industrial Engineering, Yonsei University, Seoul, South Korea
| | - Junsung Park
- Dept. of Industrial Engineering, Yonsei University, Seoul, South Korea
| | - Heejun Park
- Dept. of Industrial Engineering, Yonsei University, Seoul, South Korea
| |
Collapse
|
44
|
Askar M, Tafavvoghi M, Småbrekke L, Bongo LA, Svendsen K. Using machine learning methods to predict all-cause somatic hospitalizations in adults: A systematic review. PLoS One 2024; 19:e0309175. [PMID: 39178283 PMCID: PMC11343463 DOI: 10.1371/journal.pone.0309175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 08/06/2024] [Indexed: 08/25/2024] Open
Abstract
AIM In this review, we investigated how Machine Learning (ML) was utilized to predict all-cause somatic hospital admissions and readmissions in adults. METHODS We searched eight databases (PubMed, Embase, Web of Science, CINAHL, ProQuest, OpenGrey, WorldCat, and MedNar) from their inception date to October 2023, and included records that predicted all-cause somatic hospital admissions and readmissions of adults using ML methodology. We used the CHARMS checklist for data extraction, PROBAST for bias and applicability assessment, and TRIPOD for reporting quality. RESULTS We screened 7,543 studies of which 163 full-text records were read and 116 met the review inclusion criteria. Among these, 45 predicted admission, 70 predicted readmission, and one study predicted both. There was a substantial variety in the types of datasets, algorithms, features, data preprocessing steps, evaluation, and validation methods. The most used types of features were demographics, diagnoses, vital signs, and laboratory tests. Area Under the ROC curve (AUC) was the most used evaluation metric. Models trained using boosting tree-based algorithms often performed better compared to others. ML algorithms commonly outperformed traditional regression techniques. Sixteen studies used Natural language processing (NLP) of clinical notes for prediction, all studies yielded good results. The overall adherence to reporting quality was poor in the review studies. Only five percent of models were implemented in clinical practice. The most frequently inadequately addressed methodological aspects were: providing model interpretations on the individual patient level, full code availability, performing external validation, calibrating models, and handling class imbalance. CONCLUSION This review has identified considerable concerns regarding methodological issues and reporting quality in studies investigating ML to predict hospitalizations. To ensure the acceptability of these models in clinical settings, it is crucial to improve the quality of future studies.
Collapse
Affiliation(s)
- Mohsen Askar
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Masoud Tafavvoghi
- Faculty of Science and Technology, Department of Computer Science, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Lars Småbrekke
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Lars Ailo Bongo
- Faculty of Science and Technology, Department of Computer Science, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Kristian Svendsen
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
45
|
Liang C, Sun Q, Li J, Ji B, Wu W, Zhang F, Chen Y, Wang C. An interpretable ensemble trees method with joint analysis of static and dynamic features for myocardial infarction detection. Physiol Meas 2024; 45:085006. [PMID: 39025104 DOI: 10.1088/1361-6579/ad6529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Accepted: 07/18/2024] [Indexed: 07/20/2024]
Abstract
Objective.In recent years, artificial intelligence-based electrocardiogram (ECG) methods have been massively applied to myocardial infarction (MI). However, the joint analysis of static and dynamic features to achieve accurate and interpretable MI detection has not been comprehensively addressed.Approach.This paper proposes a simplified ensemble tree method with a joint analysis of static and dynamic features to solve this issue for MI detection. Initially, the dynamic features are extracted by modeling the intrinsic dynamics of ECG via dynamic learning in addition to extracting classical static features. Secondly, a two-stage feature selection strategy is designed to identify a few significant features, which substitute the original variables that are employed in constructing the ensemble tree. This approach enhances the discriminative ability by selecting significant static and dynamic features. Subsequently, this paper presents an interpretable classification method named StackTree by introducing a stacked ensemble scheme to modify the ensemble tree simplification algorithm. The representative rules of the raw ensemble trees are selected as the intermediate training data that is used to retrain a decision tree with performance close to that of the source ensemble model. Using this scheme, the significant precision and interpretability of MI detection are thus comprehensively addressed.Main results.The effectiveness of our method in detecting MI is evaluated using the Physikalisch-Technische Bundesanstalt (PTB) and clinical database. The findings suggest that our algorithm outperforms the traditional methods based on a single type of feature. Additionally, it is comparable to the conventional random forest, achieving 97.1% accuracy under the inter-patient framework on the PTB database. Furthermore, feature subsets trained on PTB are validated using the clinical database, resulting in an accuracy of 84.5%. The chosen important features demonstrate that both static and dynamic information have crucial roles in MI detection. Crucially, the proposed method provides clear internal workings in an easy-to-understand visual manner.
Collapse
Affiliation(s)
- Chunmiao Liang
- School of Control Science and Engineering, Shandong University, Jinan 250061, People's Republic of China
| | - Qinghua Sun
- School of Control Science and Engineering, Shandong University, Jinan 250061, People's Republic of China
- Center for Intelligent Medical Engineering, Shandong University, Jinan 250061, People's Republic of China
| | - Jiali Li
- School of Control Science and Engineering, Shandong University, Jinan 250061, People's Republic of China
| | - Bing Ji
- School of Control Science and Engineering, Shandong University, Jinan 250061, People's Republic of China
- Center for Intelligent Medical Engineering, Shandong University, Jinan 250061, People's Republic of China
| | - Weiming Wu
- School of Control Science and Engineering, Shandong University, Jinan 250061, People's Republic of China
- Center for Intelligent Medical Engineering, Shandong University, Jinan 250061, People's Republic of China
| | - Fukai Zhang
- School of Control Science and Engineering, Shandong University, Jinan 250061, People's Republic of China
- Center for Intelligent Medical Engineering, Shandong University, Jinan 250061, People's Republic of China
| | - Yuguo Chen
- Department of Emergency, Qilu Hospital of Shandong University, Jinan 250012, People's Republic of China
| | - Cong Wang
- School of Control Science and Engineering, Shandong University, Jinan 250061, People's Republic of China
- Center for Intelligent Medical Engineering, Shandong University, Jinan 250061, People's Republic of China
| |
Collapse
|
46
|
Zhang R, Zhu H, Chen M, Sang W, Lu K, Li Z, Wang C, Zhang L, Yin FF, Yang Z. A dual-radiomics model for overall survival prediction in early-stage NSCLC patient using pre-treatment CT images. Front Oncol 2024; 14:1419621. [PMID: 39206157 PMCID: PMC11349529 DOI: 10.3389/fonc.2024.1419621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Accepted: 07/26/2024] [Indexed: 09/04/2024] Open
Abstract
Introduction Radiation therapy (RT) is one of the primary treatment options for early-stage non-small cell lung cancer (ES-NSCLC). Therefore, accurately predicting the overall survival (OS) rate following radiotherapy is crucial for implementing personalized treatment strategies. This work aims to develop a dual-radiomics (DR) model to (1) predict 3-year OS in ES-NSCLC patients receiving RT using pre-treatment CT images, and (2) provide explanations between feature importanceand model prediction performance. Methods The publicly available TCIA Lung1 dataset with 132 ES-NSCLC patients received RT were studied: 89/43 patients in the under/over 3-year OS group. For each patient, two types of radiomic features were examined: 56 handcrafted radiomic features (HRFs) extracted within gross tumor volume, and 512 image deep features (IDFs) extracted using a pre-trained U-Net encoder. They were combined as inputs to an explainable boosting machine (EBM) model for OS prediction. The EBM's mean absolute scores for HRFs and IDFs were used as feature importance explanations. To evaluate identified feature importance, the DR model was compared with EBM using either (1) key or (2) non-key feature type only. Comparison studies with other models, including supporting vector machine (SVM) and random forest (RF), were also included. The performance was evaluated by the area under the receiver operating characteristic curve (AUCROC), accuracy, sensitivity, and specificity with a 100-fold Monte Carlo cross-validation. Results The DR model showed highestperformance in predicting 3-year OS (AUCROC=0.81 ± 0.04), and EBM scores suggested that IDFs showed significantly greater importance (normalized mean score=0.0019) than HRFs (score=0.0008). The comparison studies showed that EBM with key feature type (IDFs-only demonstrated comparable AUCROC results (0.81 ± 0.04), while EBM with non-key feature type (HRFs-only) showed limited AUCROC (0.64 ± 0.10). The results suggested that feature importance score identified by EBM is highly correlated with OS prediction performance. Both SVM and RF models were unable to explain key feature type while showing limited overall AUCROC=0.66 ± 0.07 and 0.77 ± 0.06, respectively. Accuracy, sensitivity, and specificity showed a similar trend. Discussion In conclusion, a DR model was successfully developed to predict ES-NSCLC OS based on pre-treatment CT images. The results suggested that the feature importance from DR model is highly correlated to the model prediction power.
Collapse
Affiliation(s)
- Rihui Zhang
- Medical Physics Graduate Program, Duke Kunshan University, Kunshan, Jiangsu, China
| | - Haiming Zhu
- Medical Physics Graduate Program, Duke Kunshan University, Kunshan, Jiangsu, China
| | - Minbin Chen
- Department of Radiotherapy & Oncology, The First People’s Hospital of Kunshan, Kunshan, Jiangsu, China
| | - Weiwei Sang
- Medical Physics Graduate Program, Duke Kunshan University, Kunshan, Jiangsu, China
| | - Ke Lu
- Deparment of Radiation Oncology, Duke University, Durham, NC, United States
| | - Zhen Li
- Radiation Oncology Department, Shanghai Sixth People’s Hospital, Shanghai, China
| | - Chunhao Wang
- Deparment of Radiation Oncology, Duke University, Durham, NC, United States
| | - Lei Zhang
- Medical Physics Graduate Program, Duke Kunshan University, Kunshan, Jiangsu, China
| | - Fang-Fang Yin
- Medical Physics Graduate Program, Duke Kunshan University, Kunshan, Jiangsu, China
| | - Zhenyu Yang
- Medical Physics Graduate Program, Duke Kunshan University, Kunshan, Jiangsu, China
| |
Collapse
|
47
|
Chen X, Liu M, Wang Z, Wang Y. Explainable Deep Learning-Based Feature Selection and Intrusion Detection Method on the Internet of Things. SENSORS (BASEL, SWITZERLAND) 2024; 24:5223. [PMID: 39204919 PMCID: PMC11359160 DOI: 10.3390/s24165223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 08/02/2024] [Accepted: 08/08/2024] [Indexed: 09/04/2024]
Abstract
With the rapid advancement of the Internet of Things, network security has garnered increasing attention from researchers. Applying deep learning (DL) has significantly enhanced the performance of Network Intrusion Detection Systems (NIDSs). However, due to its complexity and "black box" problem, deploying DL-based NIDS models in practical scenarios poses several challenges, including model interpretability and being lightweight. Feature selection (FS) in DL models plays a crucial role in minimizing model parameters and decreasing computational overheads while enhancing NIDS performance. Hence, selecting effective features remains a pivotal concern for NIDSs. In light of this, this paper proposes an interpretable feature selection method for encrypted traffic intrusion detection based on SHAP and causality principles. This approach utilizes the results of model interpretation for feature selection to reduce feature count while ensuring model reliability. We evaluate and validate our proposed method on two public network traffic datasets, CICIDS2017 and NSL-KDD, employing both a CNN and a random forest (RF). Experimental results demonstrate superior performance achieved by our proposed method.
Collapse
Affiliation(s)
- Xuejiao Chen
- School of Communications, Nanjing Vocational College of Information Technology, Nanjing 210023, China
| | - Minyao Liu
- School of Modern Posts, Nanjing University of Posts & Telecommunications, Nanjing 210003, China; (M.L.); (Z.W.); (Y.W.)
| | - Zixuan Wang
- School of Modern Posts, Nanjing University of Posts & Telecommunications, Nanjing 210003, China; (M.L.); (Z.W.); (Y.W.)
| | - Yun Wang
- School of Modern Posts, Nanjing University of Posts & Telecommunications, Nanjing 210003, China; (M.L.); (Z.W.); (Y.W.)
| |
Collapse
|
48
|
Khan S, Bhushan B. Machine Learning Predicts Patients With New-onset Diabetes at Risk of Pancreatic Cancer. J Clin Gastroenterol 2024; 58:681-691. [PMID: 37522752 DOI: 10.1097/mcg.0000000000001897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 06/22/2023] [Indexed: 08/01/2023]
Abstract
BACKGROUND New-onset diabetes represent a high-risk cohort to screen for pancreatic cancer. GOALS Develop a machine model to predict pancreatic cancer among patients with new-onset diabetes. STUDY A retrospective cohort of patients with new-onset diabetes was assembled from multiple health care networks in the United States. An XGBoost machine learning model was designed from a portion of this cohort (the training set) and tested on the remaining part of the cohort (the test set). Shapley values were used to explain the XGBoost's model features. Model performance was compared with 2 contemporary models designed to predict pancreatic cancer among patients with new-onset diabetes. RESULTS In the test set, the XGBoost model had an area under the curve of 0.80 (0.76 to 0.85) compared with 0.63 and 0.68 for other models. Using cutoffs based on the Youden index, the sensitivity of the XGBoost model was 75%, the specificity was 70%, the accuracy was 70%, the positive predictive value was 1.2%, and the negative predictive value was >99%. The XGBoost model obtained a positive predictive value of at least 2.5% with a sensitivity of 38%. The XGBoost model was the only model that detected at least 50% of patients with cancer one year after the onset of diabetes. All 3 models had similar features that predicted pancreatic cancer, including older age, weight loss, and the rapid destabilization of glucose homeostasis. CONCLUSION Machine learning models isolate a high-risk cohort from those with new-onset diabetes at risk for pancreatic cancer.
Collapse
Affiliation(s)
- Salman Khan
- Department of Medicine, West Virginia University School of Medicine, West Virginia University, Morgantown, WV
- Northeast Ohio Medical University, Rootstown, OH
| | - Bharath Bhushan
- Department of Medicine, West Virginia University School of Medicine, West Virginia University, Morgantown, WV
| |
Collapse
|
49
|
Chang TG, Cao Y, Sfreddo HJ, Dhruba SR, Lee SH, Valero C, Yoo SK, Chowell D, Morris LGT, Ruppin E. LORIS robustly predicts patient outcomes with immune checkpoint blockade therapy using common clinical, pathologic and genomic features. NATURE CANCER 2024; 5:1158-1175. [PMID: 38831056 DOI: 10.1038/s43018-024-00772-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 04/24/2024] [Indexed: 06/05/2024]
Abstract
Despite the revolutionary impact of immune checkpoint blockade (ICB) in cancer treatment, accurately predicting patient responses remains challenging. Here, we analyzed a large dataset of 2,881 ICB-treated and 841 non-ICB-treated patients across 18 solid tumor types, encompassing a wide range of clinical, pathologic and genomic features. We developed a clinical score called LORIS (logistic regression-based immunotherapy-response score) using a six-feature logistic regression model. LORIS outperforms previous signatures in predicting ICB response and identifying responsive patients even with low tumor mutational burden or programmed cell death 1 ligand 1 expression. LORIS consistently predicts patient objective response and short-term and long-term survival across most cancer types. Moreover, LORIS showcases a near-monotonic relationship with ICB response probability and patient survival, enabling precise patient stratification. As an accurate, interpretable method using a few readily measurable features, LORIS may help improve clinical decision-making in precision medicine to maximize patient benefit. LORIS is available as an online tool at https://loris.ccr.cancer.gov/ .
Collapse
Affiliation(s)
- Tian-Gen Chang
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Yingying Cao
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Hannah J Sfreddo
- Department of Surgery and Cancer Immunogenomics Research Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Saugato Rahman Dhruba
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA
| | - Se-Hoon Lee
- Department of Health Sciences and Technology, Samsung Advanced Institute of Health Science and Technology, Sungkyunkwan University, Seoul, South Korea
| | - Cristina Valero
- Department of Surgery and Cancer Immunogenomics Research Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Seong-Keun Yoo
- The Marc and Jennifer Lipschultz Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Oncological Sciences, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Artificial Intelligence and Human Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Diego Chowell
- The Marc and Jennifer Lipschultz Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Oncological Sciences, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Artificial Intelligence and Human Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Luc G T Morris
- Department of Surgery and Cancer Immunogenomics Research Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
| | - Eytan Ruppin
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD, USA.
| |
Collapse
|
50
|
Bu ZJ, Jiang N, Li KC, Lu ZL, Zhang N, Yan SS, Chen ZL, Hao YH, Zhang YH, Xu RB, Chi HW, Chen ZY, Liu JP, Wang D, Xu F, Liu ZL. Development and Validation of an Interpretable Machine Learning Model for Early Prognosis Prediction in ICU Patients with Malignant Tumors and Hyperkalemia. Medicine (Baltimore) 2024; 103:e38747. [PMID: 39058887 PMCID: PMC11272258 DOI: 10.1097/md.0000000000038747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 06/07/2024] [Indexed: 07/28/2024] Open
Abstract
This study aims to develop and validate a machine learning (ML) predictive model for assessing mortality in patients with malignant tumors and hyperkalemia (MTH). We extracted data on patients with MTH from the Medical Information Mart for Intensive Care-IV, version 2.2 (MIMIC-IV v2.2) database. The dataset was split into a training set (75%) and a validation set (25%). We used the Least Absolute Shrinkage and Selection Operator (LASSO) regression to identify potential predictors, which included clinical laboratory indicators and vital signs. Pearson correlation analysis tested the correlation between predictors. In-hospital death was the prediction target. The Area Under the Curve (AUC) and accuracy of the training and validation sets of 7 ML algorithms were compared, and the optimal 1 was selected to develop the model. The calibration curve was used to evaluate the prediction accuracy of the model further. SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) enhanced model interpretability. 496 patients with MTH in the Intensive Care Unit (ICU) were included. After screening, 17 clinical features were included in the construction of the ML model, and the Pearson correlation coefficient was <0.8, indicating that the correlation between the clinical features was small. eXtreme Gradient Boosting (XGBoost) outperformed other algorithms, achieving perfect scores in the training set (accuracy: 1.000, AUC: 1.000) and high scores in the validation set (accuracy: 0.734, AUC: 0.733). The calibration curves indicated good predictive calibration of the model. SHAP analysis identified the top 8 predictive factors: urine output, mean heart rate, maximum urea nitrogen, minimum oxygen saturation, minimum mean blood pressure, maximum total bilirubin, mean respiratory rate, and minimum pH. In addition, SHAP and LIME performed in-depth individual case analyses. This study demonstrates the effectiveness of ML methods in predicting mortality risk in ICU patients with MTH. It highlights the importance of predictors like urine output and mean heart rate. SHAP and LIME significantly enhanced the model's interpretability.
Collapse
Affiliation(s)
- Zhi-Jun Bu
- Centre for Evidence-Based Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Nan Jiang
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- The Third Affiliated Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Ke-Cheng Li
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- Department of Andrology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhi-Lin Lu
- First Clinical College, Hubei University of Chinese Medicine, Wuhan, China
| | - Nan Zhang
- School of International Studies, University of International Business and Economics, Beijing, China
| | - Shao-Shuai Yan
- Department of Thyropathy, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhi-Lin Chen
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Yu-Han Hao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Yu-Huan Zhang
- School of Acupuncture and Orthopedics, Hubei University of Chinese Medicine, Wuhan, China
| | - Run-Bing Xu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- Department of Hematology and Oncology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Han-Wei Chi
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Zu-Yi Chen
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Jian-Ping Liu
- Centre for Evidence-Based Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Dan Wang
- Surgery of Thyroid Gland and Breast, Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China
- Hubei Shizhen Laboratory, Wuhan, China
| | - Feng Xu
- The Third Affiliated Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhao-Lan Liu
- Centre for Evidence-Based Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| |
Collapse
|