1
|
Li M, Xu P, Hu J, Tang Z, Yang G. From challenges and pitfalls to recommendations and opportunities: Implementing federated learning in healthcare. Med Image Anal 2025; 101:103497. [PMID: 39961211 DOI: 10.1016/j.media.2025.103497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 01/18/2025] [Accepted: 02/03/2025] [Indexed: 03/05/2025]
Abstract
Federated learning holds great potential for enabling large-scale healthcare research and collaboration across multiple centers while ensuring data privacy and security are not compromised. Although numerous recent studies suggest or utilize federated learning based methods in healthcare, it remains unclear which ones have potential clinical utility. This review paper considers and analyzes the most recent studies up to May 2024 that describe federated learning based methods in healthcare. After a thorough review, we find that the vast majority are not appropriate for clinical use due to their methodological flaws and/or underlying biases which include but are not limited to privacy concerns, generalization issues, and communication costs. As a result, the effectiveness of federated learning in healthcare is significantly compromised. To overcome these challenges, we provide recommendations and promising opportunities that might be implemented to resolve these problems and improve the quality of model development in federated learning with healthcare.
Collapse
Affiliation(s)
- Ming Li
- Bioengineering Department and Imperial-X, Imperial College London, London W12 7SL, UK; National Heart and Lung Institute, Imperial College London, London SW7 2AZ, UK.
| | - Pengcheng Xu
- Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Boston, MA, USA; State Key Laboratory of Extreme Photonics and Instrumentation, College of Optical Science and Engineering, Zhejiang University, Hangzhou, China.
| | - Junjie Hu
- National Heart and Lung Institute, Imperial College London, London SW7 2AZ, UK.
| | - Zeyu Tang
- Bioengineering Department and Imperial-X, Imperial College London, London W12 7SL, UK; Tri-Institutional Computational Biology & Medicine Program, Weill Cornell Medicine of Cornell University, NY, USA.
| | - Guang Yang
- Bioengineering Department and Imperial-X, Imperial College London, London W12 7SL, UK; National Heart and Lung Institute, Imperial College London, London SW7 2AZ, UK; Cardiovascular Research Centre, Royal Brompton Hospital, London SW3 6NP, UK; School of Biomedical Engineering & Imaging Sciences, King's College London, London WC2R 2LS, UK.
| |
Collapse
|
2
|
Szunerits S, Boukherroub R, Kleber C, Knoll W, Yunda J, Rumipamba J, Torres G, Melinte S. Biosensors integrated within wearable devices for monitoring chronic wound status. APL Bioeng 2025; 9:010901. [PMID: 39926013 PMCID: PMC11803754 DOI: 10.1063/5.0220516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Accepted: 01/06/2025] [Indexed: 02/11/2025] Open
Abstract
Slowly healing wounds significantly affect the life quality of patients in different ways, due to constant pain, unpleasant odor, reduced mobility up to social isolation, and personal frustration. While remote wound management has become more widely accepted since the COVID-19 pandemic, delayed treatment remains frequent and results in several wound healing related complications. As inappropriate management of notably diabetic foot ulcers is linked to a high risk of amputation, effective management of wounds in a patient-centered manner remains important to be implemented. The integration of diagnostic devices into wound bandages is under way, owing to advancements in materials science and nanofabrication strategies as well as innovation in communication technologies together with machine learning and data-driven assessment tools. Leveraging advanced analytical approaches around local pH, temperature, pressure, and wound biomarker sensing is expected to facilitate adequate wound treatment. The state-of-the-art of time-resolved monitoring of the wound status by quantifying key physiological parameters as well as wound biomarkers' concentration is presented herewith. A special focus will be given to smart bandages with on-demand delivery capabilities for improved wound management.
Collapse
Affiliation(s)
| | - Rabah Boukherroub
- Univ. Lille, CNRS, Univ. Polytechnique Hauts-de-France, UMR 8520—IEMN, F-59000 Lille, France
| | - Christoph Kleber
- Laboratory for Life Sciences and Technology (LiST), Faculty of Medicine and Dentistry, Danube Private University, 3500 Krems, Austria
| | - Wolfgang Knoll
- Laboratory for Life Sciences and Technology (LiST), Faculty of Medicine and Dentistry, Danube Private University, 3500 Krems, Austria
| | - Jhonny Yunda
- Université catholique de Louvain, ICTEAM, 1348 Louvain-la-Neuve, Belgium
| | - José Rumipamba
- Université catholique de Louvain, ICTEAM, 1348 Louvain-la-Neuve, Belgium
| | - Guido Torres
- Université catholique de Louvain, ICTEAM, 1348 Louvain-la-Neuve, Belgium
| | - Sorin Melinte
- Université catholique de Louvain, ICTEAM, 1348 Louvain-la-Neuve, Belgium
| |
Collapse
|
3
|
Wang M, Fan S, Li Y, Gao B, Xie Z, Chen H. Robust multi-modal fusion architecture for medical data with knowledge distillation. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 260:108568. [PMID: 39709743 DOI: 10.1016/j.cmpb.2024.108568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 10/01/2024] [Accepted: 12/16/2024] [Indexed: 12/24/2024]
Abstract
BACKGROUND The fusion of multi-modal data has been shown to significantly enhance the performance of deep learning models, particularly on medical data. However, missing modalities are common in medical data due to patient specificity, which poses a substantial challenge to the application of these models. OBJECTIVE This study aimed to develop a novel and efficient multi-modal fusion framework for medical datasets that maintains consistent performance, even in the absence of one or more modalities. METHODS In this paper, we fused three modalities: chest X-ray radiographs, history of present illness text, and tabular data such as demographics and laboratory tests. A multi-modal fusion module based on pooled bottleneck (PB) attention was proposed in conjunction with knowledge distillation (KD) for enhancing model inference in the case of missing modalities. In addition, we introduced a gradient modulation (GM) method to deal with the unbalanced optimization in multi-modal model training. Finally, we designed comparison and ablation experiments to evaluate the fusion effect, the model robustness to missing modalities, and the contribution of each component (PB, KD, and GM). The evaluation experiments were performed on the MIMIC-IV datasets with the task of predicting in-hospital mortality risk. Model performance was assessed using the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC). RESULTS The proposed multi-modal fusion framework achieved an AUROC of 0.886 and AUPRC of 0.459, significantly surpassing the performance of baseline models. Even when one or two modalities were missing, our model consistently outperformed the reference models. Ablation of each of the three components resulted in varying degrees of performance degradation, highlighting their distinct contributions to the model's overall effectiveness. CONCLUSIONS This innovative multi-modal fusion architecture has demonstrated robustness to missing modalities, and has shown excellent performance in fusing three medical modalities for patient outcome prediction. This study provides a novel idea for addressing the challenge of missing modalities and has the potential be scaled to additional modalities.
Collapse
Affiliation(s)
- Muyu Wang
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Shiyu Fan
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Yichen Li
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Binyu Gao
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Zhongrang Xie
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Hui Chen
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China.
| |
Collapse
|
4
|
Wang M, Fan S, Li Y, Xie Z, Chen H. Missing-modality enabled multi-modal fusion architecture for medical data. J Biomed Inform 2025; 164:104796. [PMID: 39988001 DOI: 10.1016/j.jbi.2025.104796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2024] [Revised: 01/23/2025] [Accepted: 02/01/2025] [Indexed: 02/25/2025]
Abstract
BACKGROUND Fusion of multi-modal data can improve the performance of deep learning models. However, missing modalities are common in medical data due to patient specificity, which is detrimental to the performance of multi-modal models in applications. Therefore, it is critical to adapt the models to missing modalities. OBJECTIVE This study aimed to develop an effective multi-modal fusion architecture for medical data that was robust to missing modalities and further improved the performance for clinical tasks. METHODS X-ray chest radiographs for the image modality, radiology reports for the text modality, and structured value data for the tabular data modality were fused in this study. Each modality pair was fused with a Transformer-based bi-modal fusion module, and the three bi-modal fusion modules were then combined into a tri-modal fusion framework. Additionally, multivariate loss functions were introduced into the training process to improve models' robustness to missing modalities during the inference process. Finally, we designed comparison and ablation experiments to validate the effectiveness of the fusion, the robustness to missing modalities, and the enhancements from each key component. Experiments were conducted on MIMIC-IV and MIMIC-CXR datasets with the 14-label disease diagnosis and patient in-hospital mortality prediction task The area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC) were used to evaluate models' performance. RESULTS Our proposed architecture showed superior predictive performance, achieving the average AUROC and AUPRC of 0.916 and 0.551 in the 14-label classification task, 0.816 and 0.392 in the mortality prediction task. while the best average AUROC and AUPRC among the comparison methods were 0.876, 0.492 in the 14-label classification task and 0.806, 0.366 in the mortality prediction task. Both metrics decreased only slightly when tested with modal-incomplete data. Different levels of enhancements were achieved through three key components. CONCLUSIONS The proposed multi-modal fusion architecture effectively fused three modalities and showed strong robustness to missing modalities. This architecture holds promise for scaling up to more modalities to enhance the clinical practicality of the model.
Collapse
Affiliation(s)
- Muyu Wang
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Shiyu Fan
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Yichen Li
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Zhongrang Xie
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China
| | - Hui Chen
- School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China; Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing 100069, China.
| |
Collapse
|
5
|
Larsen K, He Z, de A Fernandes F, Zhang X, Zhao C, Sha Q, Mesquita CT, Paez D, Garcia EV, Zou J, Peix A, Hung GU, Zhou W. A New Method Using Deep Learning to Predict the Response to Cardiac Resynchronization Therapy. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2025:10.1007/s10278-024-01380-8. [PMID: 39979759 DOI: 10.1007/s10278-024-01380-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 11/30/2024] [Accepted: 12/09/2024] [Indexed: 02/22/2025]
Abstract
Clinical parameters measured from gated single-photon emission computed tomography myocardial perfusion imaging (SPECT MPI) have value in predicting cardiac resynchronization therapy (CRT) patient outcomes, but still show limitations. The purpose of this study is to combine clinical variables, features from electrocardiogram (ECG), and parameters from assessment of cardiac function with polar maps from gated SPECT MPI through deep learning (DL) to predict CRT response. A total of 218 patients who underwent rest-gated SPECT MPI were enrolled in this study. CRT response was defined as an increase in left ventricular ejection fraction (LVEF) > 5% at a 6-month follow-up. A DL model was constructed by combining a pre-trained VGG16 model and a multilayer perceptron. Two modalities of data were input to the model: polar map images from SPECT MPI and tabular data from clinical features, ECG parameters, and SPECT-MPI-derived parameters. Gradient-weighted class activation mapping (Grad-CAM) was applied to the VGG16 model to provide explainability for the polar maps. For comparison, four machine learning (ML) models were trained using only the tabular features. Modeling was performed on 218 patients who underwent CRT implantation with a response rate of 55.5% (n = 121). The DL model demonstrated average AUC (0.83), accuracy (0.73), sensitivity (0.76), and specificity (0.69) surpassing ML models and guideline criteria. Guideline recommendations achieved accuracy (0.53), sensitivity (0.75), and specificity (0.26). The DL model trended towards improvement over the ML models, showcasing the additional predictive benefit of utilizing SPECT MPI polar maps. Incorporating additional patient data directly in the form of medical imagery can improve CRT response prediction.
Collapse
Affiliation(s)
- Kristoffer Larsen
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, USA
| | - Zhuo He
- Department of Applied Computing, Michigan Technological University, Houghton, MI, 49931, USA
| | - Fernando de A Fernandes
- Nuclear Medicine Department, Hospital Universitario Antonio Pedro-EBSERH-UFF, Niteroi, Brazil
| | - Xinwei Zhang
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical University, Guangzhou Road 300, Nanjing, Jiangsu, 210029, China
| | - Chen Zhao
- Department of Computer Science, Kennesaw State University, Marietta, GA, USA
| | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, USA
| | - Claudio T Mesquita
- Nuclear Medicine Department, Hospital Universitario Antonio Pedro-EBSERH-UFF, Niteroi, Brazil
| | - Diana Paez
- Nuclear Medicine and Diagnostic Imaging Section, Division of Human Health, Department of Nuclear Sciences and Applications, International Atomic Energy Agency, Vienna, Austria
| | - Ernest V Garcia
- Department of Radiology and Imaging Sciences, Emory University, Atlanta, GA, USA
| | - Jiangang Zou
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical University, Guangzhou Road 300, Nanjing, Jiangsu, 210029, China.
| | - Amalia Peix
- Nuclear Medicine Department, Institute of Cardiology, 17 No. 702La Habana, Vedado, CP10 400, , Cuba.
| | - Guang-Uei Hung
- Department of Nuclear Medicine, Chang Bing Show Chwan Memorial Hospital, Changhua, Taiwan
| | - Weihua Zhou
- Department of Applied Computing, Michigan Technological University, Houghton, MI, 49931, USA.
- Center for Biocomputing and Digital Health, Institute of Computing and Cybersystems, and Health Research Institute, Michigan Technological University, 1400 Townsend Drive, Houghton, MI, 49931, USA.
| |
Collapse
|
6
|
Segado M, Prosser L, Duncan AF, Johnson MJ, Kording KP. Data-Driven Early Prediction of Cerebral Palsy Using AutoML and interpretable kinematic features. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.02.10.25322007. [PMID: 39990562 PMCID: PMC11844582 DOI: 10.1101/2025.02.10.25322007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2025]
Abstract
Early identification of cerebral palsy (CP) remains a major challenge due to the reliance on expert assessments that are time-intensive and not scalable. Consequently, a range of studies have aimed at using machine learning to predict CP scores based on motion tracking,e.g. from video data. These studies generally predict clinical scores which are a proxy for CP risk. However, clinicians do not REALLY want to estimate scores, they want to estimate the patients' risk of developing clinical symptoms. Here we present a data-driven machine-learning (ML) pipeline that extracts movement features from infant video based motion tracking and estimates CP risk using AutoML. Using AutoSklearn, our framework minimizes risk of overfitting by abstracting away researcher-driver hyperparameter optimization. Trained on movement data from 3- to 4-month-old infants, our classifier predicts a highly indicative clinical score (General Movements Assessment [GMA]) with an ROC-AUC of 0.78 on a held-out test set, indicating that kinematic movement features capture clinically relevant variability. Without retraining, the same model predicts the risk of cerebral palsy outcomes at later clinical follow-ups with an ROC-AUC of 0.74, demonstrating that early motor representations generalize to long-term neurodevelopmental risk. We employ pre-registered lock-box validation to ensure rig-orous performance evaluation. This study highlights the potential of AutoML-powered movement analytics for neurodevelopmental screening, demonstrating that data-driven feature extraction from movement trajectories can provide an interpretable and scalable approach to early risk assessment. By integrating pre-trained vision transformers, AutoML-driven model selection, and rigorous validation protocols, this work advances the use of video-derived movement features for scalable, data-driven clinical assessment, demonstrating how computational methods based on readily available data like infant videos can enhance early risk detection in neurodevelopmental disorders. CCS Concepts Computing methodologies → Machine learning approaches ; Applied computing → Health informatics . ACM Reference Format Melanie Segado, Laura Prosser, Andrea F. Duncan, Michelle J. Johnson, and Konrad P. Kording.. Data-Driven Early Prediction of Cerebral Palsy Using AutoML and interpretable kinematic features. In. ACM, New York, NY, USA, 8 pages.
Collapse
|
7
|
Kayaalp ME, Prill R, Sezgin EA, Cong T, Królikowska A, Hirschmann MT. DeepSeek versus ChatGPT: Multimodal artificial intelligence revolutionizing scientific discovery. From language editing to autonomous content generation-Redefining innovation in research and practice. Knee Surg Sports Traumatol Arthrosc 2025. [PMID: 39936363 DOI: 10.1002/ksa.12628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/04/2025] [Revised: 02/05/2025] [Accepted: 02/05/2025] [Indexed: 02/13/2025]
Affiliation(s)
- Mahmut Enes Kayaalp
- Department of Orthopaedics and Traumatology, University of Health Sciences, Istanbul Fatih Sultan Mehmet Training and Research Hospital, Istanbul, Turkey
| | - Robert Prill
- Department of Orthopaedics and Traumatology, University Hospital Brandenburg/Havel, Brandenburg Medical School Theodor Fontane, Brandenburg/Havel, Germany
| | - Erdem Aras Sezgin
- Department of Orthopaedics and Traumatology, Gazi University, Ankara, Turkey
| | - Ting Cong
- Department of Orthopaedic Surgery, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Aleksandra Królikowska
- Physiotherapy Research Laboratory, University Centre of Physiotherapy and Rehabilitation, Faculty of Physiotherapy, Wroclaw Medical University, Wroclaw, Poland
| | - Michael T Hirschmann
- Department of Orthopedic Surgery and Traumatology, Kantonsspital Baselland, Bruderholz, Switzerland
- Department of Clinical Research, Research Group Michael T. Hirschmann, Regenerative Medicine & Biomechanics, University of Basel, Basel, Switzerland
| |
Collapse
|
8
|
Rijken L, Zwetsloot S, Smorenburg S, Wolterink J, Išgum I, Marquering H, van Duivenvoorde J, Ploem C, Jessen R, Catarinella F, Lee R, Bera K, Buisan J, Zhang P, Dias-Neto M, Raffort J, Lareyre F, Muller C, Koncar I, Tomic I, Živković M, Djuric T, Stankovic A, Venermo M, Tulamo R, Behrendt CA, Smit N, Schijven M, van den Born BJ, Delewi R, Jongkind V, Ayyalasomayajula V, Khee Yeung K. Developing Trustworthy Artificial Intelligence Models to Predict Vascular Disease Progression: the VASCUL-AID-RETRO Study Protocol. J Endovasc Ther 2025:15266028251313963. [PMID: 39921236 DOI: 10.1177/15266028251313963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2025]
Abstract
INTRODUCTION Abdominal aortic aneurysms (AAAs) and peripheral artery disease (PAD) are two vascular diseases with a significant risk of major adverse cardiovascular events and mortality. A challenge in current disease management is the unpredictable disease progression in individual patients. The VASCUL-AID-RETRO study aims to develop trustworthy multimodal predictive artificial intelligence (AI) models for multiple tasks including risk stratification of disease progression and cardiovascular events in patients with AAA and PAD. METHODS The VASCUL-AID-RETRO study will collect data from 5000 AAA and 6000 PAD patients across multiple European centers of the VASCUL-AID consortium using electronic health records from 2015 to 2024. This retrospectively-collected data will be enriched with additional data from existing biobanks and registries. Multimodal data, including clinical records, radiological imaging, proteomics, and genomics, will be collected to develop AI models predicting disease progression and cardiovascular risks. This will be done while integrating the international ethics guidelines and legal standards for trustworthy AI, to ensure a socially-responsible data integration and analysis. PROPOSED ANALYSES A consensus-based variable list of clinical parameters and core outcome set for both diseases will be developed through meetings with key opinion leaders. Blood, plasma, and tissue samples from existing biobanks will be analyzed for proteomic and genomic variations. AI models will be trained on segmented AAA and PAD artery geometries for estimation of hemodynamic parameters to quantify disease progression. Initially, risk prediction models will be developed for each modality separately, and subsequently, all data will be combined to be used as input to multimodal prediction models. During all processes, data security, data quality, and ethical guidelines and legal standards will be carefully considered. As a next step, the developed models will be further adjusted with prospective data and internally validated in a prospective cohort (VASCUL-AID-PRO study). CONCLUSION The VASCUL-AID-RETRO study will utilize advanced AI techniques and integrate clinical, imaging, and multi-omics data to predict AAA and PAD progression and cardiovascular events. CLINICAL TRIAL REGISTRATION The VASCUL-AID-RETRO study is registered at www.clinicaltrials.gov under the identification number NCT06206369. CLINICAL IMPACT The VASCUL-AID-RETRO study aims to improve clinical practice of vascular surgery by developing artificial intelligence-driven multimodal predictive models for patients with abdominal aortic aneurysms or peripheral artery disease, enhancing personalized medicine. By integrating comprehensive data sets including clinical, imaging, and multi-omics data, these models have the potential to provide accurate risk stratification for disease progression and cardiovascular events. An innovation lies in the extensive European data set in combination with multimodal analyses approaches, which enables the development of advanced models to facilitate better understanding of disease mechanisms and progression. For clinicians, this means that more precise, individualized treatment plans can be established, ultimately aiming to improve patient outcomes.
Collapse
Affiliation(s)
- Lotte Rijken
- Department of Surgery, Amsterdam University Medical Center, Location Vrije Universiteit, Amsterdam, The Netherlands
- Atherosclerosis and Ischemic Syndromes, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, Amsterdam, The Netherlands
- Digital Health Amsterdam Public Health, Amsterdam University Medical Center, Amsterdam, The Netherlands
| | - Sabrina Zwetsloot
- Department of Surgery, Amsterdam University Medical Center, Location Vrije Universiteit, Amsterdam, The Netherlands
- Atherosclerosis and Ischemic Syndromes, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, Amsterdam, The Netherlands
| | - Stefan Smorenburg
- Department of Surgery, Amsterdam University Medical Center, Location Vrije Universiteit, Amsterdam, The Netherlands
- Atherosclerosis and Ischemic Syndromes, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, Amsterdam, The Netherlands
| | - Jelmer Wolterink
- Department of Applied Mathematics, Technical Medical Centre, University of Twente, Enschede, The Netherlands
| | - Ivana Išgum
- Department of Biomedical Engineering and Physics, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
- Department of Radiology and Nuclear Medicine, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
- Informatics Institute, Faculty of Science, University of Amsterdam, Amsterdam, The Netherlands
| | - Henk Marquering
- Department of Biomedical Engineering and Physics, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
- Department of Radiology and Nuclear Medicine, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
| | - Jan van Duivenvoorde
- Department of Surgery, Amsterdam University Medical Center, Location Vrije Universiteit, Amsterdam, The Netherlands
- Atherosclerosis and Ischemic Syndromes, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, Amsterdam, The Netherlands
| | - Corrette Ploem
- Department of Ethics, Law and Humanities, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
| | - Roosmarie Jessen
- Department of Ethics, Law and Humanities, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
| | | | - Regent Lee
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK
| | - Katarzyna Bera
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK
| | - Jenny Buisan
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK
| | - Ping Zhang
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK
| | - Marina Dias-Neto
- Department of Angiology and Vascular Surgery, Centro Hospitalar Universitário de São João, Porto, Portugal
- UnIC@RISE, Department of Surgery and Physiology, Faculty of Medicine, University of Porto, Porto, Portugal
| | - Juliette Raffort
- Clinical Chemistry Laboratory, University Hospital of Nice, Nice, France
- Institute 3IA Côte d'Azur, Université Côte d'Azur, Nice, France
- CNRS, UMR7370, LP2M, Université Côte d'Azur, Nice, France
| | - Fabien Lareyre
- CNRS, UMR7370, LP2M, Université Côte d'Azur, Nice, France
- Department of Vascular Surgery, Hospital of Antibes Juan-les-Pins, Antibes, France
| | | | - Igor Koncar
- Faculty of Medicine, University of Belgrade, Belgrade, Serbia
- Clinic for Vascular and Endovascular Surgery, Clinical Center of Serbia, Belgrade, Serbia
| | - Ivan Tomic
- Faculty of Medicine, University of Belgrade, Belgrade, Serbia
- Clinic for Vascular and Endovascular Surgery, Clinical Center of Serbia, Belgrade, Serbia
| | - Maja Živković
- Laboratory for Radiobiology and Molecular Genetics, VINCA Institute of Nuclear Sciences-National Institute of the Republic of Serbia, University of Belgrade, Belgrade, Serbia
| | - Tamara Djuric
- Laboratory for Radiobiology and Molecular Genetics, VINCA Institute of Nuclear Sciences-National Institute of the Republic of Serbia, University of Belgrade, Belgrade, Serbia
| | - Aleksandra Stankovic
- Laboratory for Radiobiology and Molecular Genetics, VINCA Institute of Nuclear Sciences-National Institute of the Republic of Serbia, University of Belgrade, Belgrade, Serbia
| | - Maarit Venermo
- Department of Vascular Surgery, Helsinki University Hospital, Helsinki, Finland
- Department of Vascular Surgery, University of Helsinki, Helsinki, Finland
| | - Riikka Tulamo
- Department of Vascular Surgery, Helsinki University Hospital, Helsinki, Finland
- Department of Vascular Surgery, University of Helsinki, Helsinki, Finland
| | - Christian-Alexander Behrendt
- Department of Vascular and Endovascular Surgery, Asklepios Clinic Wandsbek, Asklepios Medical School, Hamburg, Germany
| | - Noeska Smit
- Department of Informatics, University of Bergen, Bergen, Norway
- Department of Radiology, Mohn Medical Imaging and Visualization Centre, Haukeland University Hospital, Bergen, Norway
| | - Marlies Schijven
- Digital Health Amsterdam Public Health, Amsterdam University Medical Center, Amsterdam, The Netherlands
- Department of Surgery, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
- Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam University Medical Center, Amsterdam, The Netherlands
| | - Bert-Jan van den Born
- Department of Public and Occupational Health, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
- Department of Vascular Medicine, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
| | - Ronak Delewi
- Department of Cardiology, Amsterdam University Medical Center, Location University of Amsterdam, Amsterdam, The Netherlands
- Amsterdam Cardiovascular Sciences, Amsterdam, The Netherlands
| | - Vincent Jongkind
- Department of Surgery, Amsterdam University Medical Center, Location Vrije Universiteit, Amsterdam, The Netherlands
- Microcirculation, Amsterdam Cardiovascular Sciences, Amsterdam, The Netherlands
| | - Venkat Ayyalasomayajula
- Department of Surgery, Amsterdam University Medical Center, Location Vrije Universiteit, Amsterdam, The Netherlands
- Atherosclerosis and Ischemic Syndromes, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, Amsterdam, The Netherlands
| | - Kak Khee Yeung
- Department of Surgery, Amsterdam University Medical Center, Location Vrije Universiteit, Amsterdam, The Netherlands
- Atherosclerosis and Ischemic Syndromes, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, Amsterdam, The Netherlands
| |
Collapse
|
9
|
Kim YE, Dobko M, Li H, Shao T, Periyakoil P, Tipton C, Colasacco C, Serpedin A, Elemento O, Sabuncu M, Pitman M, Sulica L, Rameau A. A Deep-Learning Model for Multi-class Audio Classification of Vocal Fold Pathologies in Office Stroboscopy. Laryngoscope 2025. [PMID: 39907244 DOI: 10.1002/lary.32036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 01/06/2025] [Accepted: 01/16/2025] [Indexed: 02/06/2025]
Abstract
OBJECTIVE To develop and validate a deep-learning classifier trained on voice data extracted from videolaryngostroboscopy recordings, differentiating between three different vocal fold (VF) states: healthy (HVF), unilateral paralysis (UVFP), and VF lesions, including benign and malignant pathologies. METHODS Patients with UVFP (n = 105), VF lesions (n = 63), and HVF (n = 41) were retrospectively identified. Voice samples were extracted from stroboscopic videos (Pentax Laryngeal Strobe Model 9400), including sustained /i/ phonation, pitch glide, and /i/ sniff task. Extracted audio files were converted into Mel-spectrograms. Voice samples were independently divided into training (80%), validation (10%), and test (10%) by patient. Pretrained ResNet18 models were trained to classify (1) HVF and pathological VF (lesions and UVFP), and (2) HVF, UVFP, and VF lesions. Both classifiers were further validated on an external dataset consisting of 12 UVFP, 13 VF lesions, and 15 HVF patients. Model performances were evaluated by accuracy and F1-score. RESULTS When evaluated on a hold-out test set, the binary classifier demonstrated stronger performance compared to the multi-class classifier (accuracy 83% vs. 40%; F1-score 0.90 vs. 0.36). When evaluated on an external dataset, the binary classifier achieved an accuracy of 63% and F1-score of 0.48, compared to 35% and 0.25 for the multi-class classifier. CONCLUSIONS Deep-learning classifiers differentiating HVF, UVFP, and VF lesions were developed using voice data from stroboscopic videos. Although healthy and pathological voice were differentiated with moderate accuracy, multi-class classification lowered model performance. The model performed poorly on an external dataset. Voice captured in stroboscopic videos may have limited diagnostic value, though further studies are needed. LEVEL OF EVIDENCE 4 Laryngoscope, 2025.
Collapse
Affiliation(s)
- Yeo E Kim
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York, U.S.A
| | - Maria Dobko
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York, U.S.A
| | | | - Tianlan Shao
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York, U.S.A
| | - Preethi Periyakoil
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York, U.S.A
| | - Courtney Tipton
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York, U.S.A
| | - Christine Colasacco
- The Center for Voice and Swallowing, Department of Otolaryngology-Head and Neck Surgery, New York-Presbyterian Hospital, Columbia University Irvine Medical Center, New York City, New York, U.S.A
| | | | - Olivier Elemento
- Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, New York, U.S.A
| | - Mert Sabuncu
- Department of Radiology, Weill Cornell Medicine, New York, New York, U.S.A
| | - Michael Pitman
- The Center for Voice and Swallowing, Department of Otolaryngology-Head and Neck Surgery, New York-Presbyterian Hospital, Columbia University Irvine Medical Center, New York City, New York, U.S.A
| | - Lucian Sulica
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York, U.S.A
| | - Anaïs Rameau
- Sean Parker Institute for the Voice, Department of Otolaryngology-Head and Neck Surgery, Weill Cornell Medicine, New York, New York, U.S.A
| |
Collapse
|
10
|
Yalin N. Revisiting Neuroimaging Endophenotypes in the Era of Machine Learning: The Key Role of Clinical Measures in Identifying Risk for Bipolar Disorder. Biol Psychiatry 2025; 97:215-216. [PMID: 39722257 DOI: 10.1016/j.biopsych.2024.11.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/05/2024] [Revised: 11/13/2024] [Accepted: 11/14/2024] [Indexed: 12/28/2024]
Affiliation(s)
- Nefize Yalin
- Experimental Therapeutics and Pathophysiology Branch, Section on the Neurobiology and Treatment of Mood Disorders, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland.
| |
Collapse
|
11
|
Willem T, Wollek A, Cheslerean-Boghiu T, Kenney M, Buyx A. The Social Construction of Categorical Data: Mixed Methods Approach to Assessing Data Features in Publicly Available Datasets. JMIR Med Inform 2025; 13:e59452. [PMID: 39874567 DOI: 10.2196/59452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 11/14/2024] [Accepted: 11/17/2024] [Indexed: 01/30/2025] Open
Abstract
BACKGROUND In data-sparse areas such as health care, computer scientists aim to leverage as much available information as possible to increase the accuracy of their machine learning models' outputs. As a standard, categorical data, such as patients' gender, socioeconomic status, or skin color, are used to train models in fusion with other data types, such as medical images and text-based medical information. However, the effects of including categorical data features for model training in such data-scarce areas are underexamined, particularly regarding models intended to serve individuals equitably in a diverse population. OBJECTIVE This study aimed to explore categorical data's effects on machine learning model outputs, rooted the effects in the data collection and dataset publication processes, and proposed a mixed methods approach to examining datasets' data categories before using them for machine learning training. METHODS Against the theoretical background of the social construction of categories, we suggest a mixed methods approach to assess categorical data's utility for machine learning model training. As an example, we applied our approach to a Brazilian dermatological dataset (Dermatological and Surgical Assistance Program at the Federal University of Espírito Santo [PAD-UFES] 20). We first present an exploratory, quantitative study that assesses the effects when including or excluding each of the unique categorical data features of the PAD-UFES 20 dataset for training a transformer-based model using a data fusion algorithm. We then pair our quantitative analysis with a qualitative examination of the data categories based on interviews with the dataset authors. RESULTS Our quantitative study suggests scattered effects of including categorical data for machine learning model training across predictive classes. Our qualitative analysis gives insights into how the categorical data were collected and why they were published, explaining some of the quantitative effects that we observed. Our findings highlight the social constructedness of categorical data in publicly available datasets, meaning that the data in a category heavily depend on both how these categories are defined by the dataset creators and the sociomedico context in which the data are collected. This reveals relevant limitations of using publicly available datasets in contexts different from those of the collection of their data. CONCLUSIONS We caution against using data features of publicly available datasets without reflection on the social construction and context dependency of their categorical data features, particularly in data-sparse areas. We conclude that social scientific, context-dependent analysis of available data features using both quantitative and qualitative methods is helpful in judging the utility of categorical data for the population for which a model is intended.
Collapse
Affiliation(s)
- Theresa Willem
- Institute of History and Ethics in Medicine, School of Medicine and Health, Technical University of Munich, Munich, Germany
- Helmholtz AI, Helmholtz Munich, Munich, Germany
| | - Alessandro Wollek
- Munich Institute of Biomedical Engineering, School of Computation, Information, and Technology, Technical University of Munich, Munich, Germany
| | - Theodor Cheslerean-Boghiu
- Munich Institute of Biomedical Engineering, School of Computation, Information, and Technology, Technical University of Munich, Munich, Germany
| | - Martha Kenney
- Women & Gender Studies, San Francisco State University, San Francisco, CA, United States
| | - Alena Buyx
- Institute of History and Ethics in Medicine, School of Medicine and Health, Technical University of Munich, Munich, Germany
| |
Collapse
|
12
|
Khodaee F, Zandie R, Edelman ER. Multimodal learning for mapping genotype-phenotype dynamics. NATURE COMPUTATIONAL SCIENCE 2025:10.1038/s43588-024-00765-7. [PMID: 39875699 DOI: 10.1038/s43588-024-00765-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Accepted: 12/20/2024] [Indexed: 01/30/2025]
Abstract
How complex phenotypes emerge from intricate gene expression patterns is a fundamental question in biology. Integrating high-content genotyping approaches such as single-cell RNA sequencing and advanced learning methods such as language models offers an opportunity for dissecting this complex relationship. Here we present a computational integrated genetics framework designed to analyze and interpret the high-dimensional landscape of genotypes and their associated phenotypes simultaneously. We applied this approach to develop a multimodal foundation model to explore the genotype-phenotype relationship manifold for human transcriptomics at the cellular level. Analyzing this joint manifold showed a refined resolution of cellular heterogeneity, uncovered potential cross-tissue biomarkers and provided contextualized embeddings to investigate the polyfunctionality of genes shown for the von Willebrand factor (VWF) gene in endothelial cells. Overall, this study advances our understanding of the dynamic interplay between gene expression and phenotypic manifestation and demonstrates the potential of integrated genetics in uncovering new dimensions of cellular function and complexity.
Collapse
Affiliation(s)
- Farhan Khodaee
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Rohola Zandie
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Elazer R Edelman
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Medicine (Cardiovascular Medicine), Brigham and Women's Hospital, Boston, MA, USA
| |
Collapse
|
13
|
Guo X, Ma M, Zhao L, Wu J, Lin Y, Fei F, Tarimo CS, Wang S, Zhang J, Cheng X, Ye B. The association of lifestyle with cardiovascular and all-cause mortality based on machine learning: a prospective study from the NHANES. BMC Public Health 2025; 25:319. [PMID: 39856612 PMCID: PMC11763113 DOI: 10.1186/s12889-025-21339-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 01/06/2025] [Indexed: 01/27/2025] Open
Abstract
BACKGROUND Lifestyle and cardiovascular mortality and all-cause mortality have been exhaustively explored by traditional methods, but the advantages of machine learning (ML) over traditional methods may lead to different or more precise conclusions. The aim of this study was to evaluate the effectiveness of machine learning-based lifestyle factors in predicting cardiovascular and all-cause mortality and compare the results obtained by traditional methods. METHOD A prospective cohort study was conducted using a nationally representative sample of adults aged 40 years or older, drawn from the US National Health and Nutrition Examination Survey from 2007 to 2010. The participants underwent a comprehensive in-person interview and medical laboratory examinations, and subsequently, their records were linked with the National Death Index for further analysis. Extreme gradient enhancement, random forest, support vector machine and other machine learning methods are used to build the prediction model. RESULT Within a cohort comprising 7921 participants, spanning an average follow-up duration of 9.75 years, a total of 1911 deaths, including 585 cardiovascular-related deaths, were recorded. The model predicted mortality with an area under the receiver operating characteristic curve (AUC) of 0.862 and 0.836. Stratifying participants into distinct risk groups based on ML scores proved effective. All lifestyle behaviors were associated with a reduced risk of all-cause and cardiovascular mortality. As age increases, the effects of dietary scores and sedentary time on mortality risk become more pronounced, while the influence of physical activity tends to diminish. CONCLUSION We develop a ML model based on lifestyle behaviors to predict all-cause and cardiovascular mortality. The developed model offers valuable insights for the assessment of individual lifestyle-related risks. It applies to individuals, healthcare professionals, and policymakers to make informed decisions.
Collapse
Affiliation(s)
- Xinghong Guo
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China
| | - Mingze Ma
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China
| | - Lipei Zhao
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China
| | - Jian Wu
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China
| | - Yan Lin
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, No.100 Science Avenue, Zhengzhou, Henan, 450001, China
| | - Fengyi Fei
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China
| | - Clifford Silver Tarimo
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China
| | - Saiyi Wang
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China
| | - Jingyi Zhang
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China
| | - Xinya Cheng
- Faculty of Arts and Social Sciences, Hong Kong Baptist University, 224 Waterloo Road, Kowloon Tong, Hong Kong
| | - Beizhu Ye
- Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, 450001, Henan, China.
| |
Collapse
|
14
|
Haider SA, Ho OA, Borna S, Gomez-Cabello CA, Pressman SM, Cole D, Sehgal A, Leibovich BC, Forte AJ. Use of Multimodal Artificial Intelligence in Surgical Instrument Recognition. Bioengineering (Basel) 2025; 12:72. [PMID: 39851346 PMCID: PMC11761865 DOI: 10.3390/bioengineering12010072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Revised: 01/11/2025] [Accepted: 01/14/2025] [Indexed: 01/26/2025] Open
Abstract
Accurate identification of surgical instruments is crucial for efficient workflows and patient safety within the operating room, particularly in preventing complications such as retained surgical instruments. Artificial Intelligence (AI) models have shown the potential to automate this process. This study evaluates the accuracy of publicly available Large Language Models (LLMs)-ChatGPT-4, ChatGPT-4o, and Gemini-and a specialized commercial mobile application, Surgical-Instrument Directory (SID 2.0), in identifying surgical instruments from images. The study utilized a dataset of 92 high-resolution images of 25 surgical instruments (retractors, forceps, scissors, and trocars) photographed from multiple angles. Model performance was evaluated using accuracy, weighted precision, recall, and F1 score. ChatGPT-4o exhibited the highest accuracy (89.1%) in categorizing instruments (e.g., scissors, forceps). SID 2.0 (77.2%) and ChatGPT-4 (76.1%) achieved comparable accuracy, while Gemini (44.6%) demonstrated lower accuracy in this task. For precise subtype identification of instrument names (like "Mayo scissors" or "Kelly forceps"), all models had low accuracy, with SID 2.0 having an accuracy of 39.1%, followed by ChatGPT-4o (33.69%). Subgroup analysis revealed ChatGPT-4 and 4o recognized trocars in all instances. Similarly, Gemini identified surgical scissors in all instances. In conclusion, publicly available LLMs can reliably identify surgical instruments at the category level, with ChatGPT-4o demonstrating an overall edge. However, precise subtype identification remains a challenge for all models. These findings highlight the potential of AI-driven solutions to enhance surgical-instrument management and underscore the need for further refinements to improve accuracy and support patient safety.
Collapse
Affiliation(s)
- Syed Ali Haider
- Division of Plastic Surgery, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Olivia A. Ho
- Division of Plastic Surgery, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Sahar Borna
- Division of Plastic Surgery, Mayo Clinic, Jacksonville, FL 32224, USA
| | | | | | - Dave Cole
- Center for Digital Health, Mayo Clinic, Rochester, MN 55905, USA
| | - Ajai Sehgal
- Center for Digital Health, Mayo Clinic, Rochester, MN 55905, USA
| | - Bradley C. Leibovich
- Center for Digital Health, Mayo Clinic, Rochester, MN 55905, USA
- Department of Urology, Mayo Clinic, Rochester, MN 55905, USA
| | - Antonio Jorge Forte
- Division of Plastic Surgery, Mayo Clinic, Jacksonville, FL 32224, USA
- Center for Digital Health, Mayo Clinic, Rochester, MN 55905, USA
| |
Collapse
|
15
|
Golder S, Xu D, O'Connor K, Wang Y, Batra M, Hernandez GG. Leveraging Natural Language Processing and Machine Learning Methods for Adverse Drug Event Detection in Electronic Health/Medical Records: A Scoping Review. Drug Saf 2025:10.1007/s40264-024-01505-6. [PMID: 39786481 DOI: 10.1007/s40264-024-01505-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/24/2024] [Indexed: 01/12/2025]
Abstract
BACKGROUND Natural language processing (NLP) and machine learning (ML) techniques may help harness unstructured free-text electronic health record (EHR) data to detect adverse drug events (ADEs) and thus improve pharmacovigilance. However, evidence of their real-world effectiveness remains unclear. OBJECTIVE To summarise the evidence on the effectiveness of NLP/ML in detecting ADEs from unstructured EHR data and ultimately improve pharmacovigilance in comparison to other data sources. METHODS A scoping review was conducted by searching six databases in July 2023. Studies leveraging NLP/ML to identify ADEs from EHR were included. Titles/abstracts were screened by two independent researchers as were full-text articles. Data extraction was conducted by one researcher and checked by another. A narrative synthesis summarises the research techniques, ADEs analysed, model performance and pharmacovigilance impacts. RESULTS Seven studies met the inclusion criteria covering a wide range of ADEs and medications. The utilisation of rule-based NLP, statistical models, and deep learning approaches was observed. Natural language processing/ML techniques with unstructured data improved the detection of under-reported adverse events and safety signals. However, substantial variability was noted in the techniques and evaluation methods employed across the different studies and limitations exist in integrating the findings into practice. CONCLUSIONS Natural language processing (NLP) and machine learning (ML) have promising possibilities in extracting valuable insights with regard to pharmacovigilance from unstructured EHR data. These approaches have demonstrated proficiency in identifying specific adverse events and uncovering previously unknown safety signals that would not have been apparent through structured data alone. Nevertheless, challenges such as the absence of standardised methodologies and validation criteria obstruct the widespread adoption of NLP/ML for pharmacovigilance leveraging of unstructured EHR data.
Collapse
Affiliation(s)
- Su Golder
- Department of Health Sciences, University of York, York, YO10 5DD, UK.
| | - Dongfang Xu
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Karen O'Connor
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Yunwen Wang
- William Allen White School of Journalism and Mass Communications, The University of Kansas, Lawrence, KS, USA
| | - Mahak Batra
- Department of Health Sciences, University of York, York, YO10 5DD, UK
| | | |
Collapse
|
16
|
Park JH, Shin YB, Jung D, Hur JW, Pack SP, Lee HJ, Lee H, Cho CH. Machine learning prediction of anxiety symptoms in social anxiety disorder: utilizing multimodal data from virtual reality sessions. Front Psychiatry 2025; 15:1504190. [PMID: 39896993 PMCID: PMC11784525 DOI: 10.3389/fpsyt.2024.1504190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Accepted: 12/09/2024] [Indexed: 02/04/2025] Open
Abstract
Introduction Machine learning (ML) is an effective tool for predicting mental states and is a key technology in digital psychiatry. This study aimed to develop ML algorithms to predict the upper tertile group of various anxiety symptoms based on multimodal data from virtual reality (VR) therapy sessions for social anxiety disorder (SAD) patients and to evaluate their predictive performance across each data type. Methods This study included 32 SAD-diagnosed individuals, and finalized a dataset of 132 samples from 25 participants. It utilized multimodal (physiological and acoustic) data from VR sessions to simulate social anxiety scenarios. This study employed extended Geneva minimalistic acoustic parameter set for acoustic feature extraction and extracted statistical attributes from time series-based physiological responses. We developed ML models that predict the upper tertile group for various anxiety symptoms in SAD using Random Forest, extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), and categorical boosting (CatBoost) models. The best parameters were explored through grid search or random search, and the models were validated using stratified cross-validation and leave-one-out cross-validation. Results The CatBoost, using multimodal features, exhibited high performance, particularly for the Social Phobia Scale with an area under the receiver operating characteristics curve (AUROC) of 0.852. It also showed strong performance in predicting cognitive symptoms, with the highest AUROC of 0.866 for the Post-Event Rumination Scale. For generalized anxiety, the LightGBM's prediction for the State-Trait Anxiety Inventory-trait led to an AUROC of 0.819. In the same analysis, models using only physiological features had AUROCs of 0.626, 0.744, and 0.671, whereas models using only acoustic features had AUROCs of 0.788, 0.823, and 0.754. Conclusions This study showed that a ML algorithm using integrated multimodal data can predict upper tertile anxiety symptoms in patients with SAD with higher performance than acoustic or physiological data obtained during a VR session. The results of this study can be used as evidence for personalized VR sessions and to demonstrate the strength of the clinical use of multimodal data.
Collapse
Affiliation(s)
- Jin-Hyun Park
- Department of Biomedical Informatics, Korea University College of Medicine, Seoul, Republic of Korea
| | - Yu-Bin Shin
- Department of Psychiatry, Korea University College of Medicine, Seoul, Republic of Korea
| | - Dooyoung Jung
- Graduate School of Health Science and Technology, Department of Biomedical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, Republic of Korea
| | - Ji-Won Hur
- School of Psychiatry, Korea University, Seoul, Republic of Korea
| | - Seung Pil Pack
- Department of Biotechnology and Bioinformatics, Korea University, Sejong, Republic of Korea
| | - Heon-Jeong Lee
- Department of Psychiatry, Korea University College of Medicine, Seoul, Republic of Korea
| | - Hwamin Lee
- Department of Biomedical Informatics, Korea University College of Medicine, Seoul, Republic of Korea
| | - Chul-Hyun Cho
- Department of Biomedical Informatics, Korea University College of Medicine, Seoul, Republic of Korea
- Department of Psychiatry, Korea University College of Medicine, Seoul, Republic of Korea
| |
Collapse
|
17
|
Abe D, Inaji M, Hase T, Suehiro E, Shiomi N, Yatsushige H, Hirota S, Hasegawa S, Karibe H, Miyata A, Kawakita K, Haji K, Aihara H, Yokobori S, Maeda T, Onuki T, Oshio K, Komoribayashi N, Suzuki M, Maehara T. A machine learning model to predict neurological deterioration after mild traumatic brain injury in older adults. Front Neurol 2025; 15:1502153. [PMID: 39830200 PMCID: PMC11739101 DOI: 10.3389/fneur.2024.1502153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Accepted: 12/10/2024] [Indexed: 01/22/2025] Open
Abstract
Objective Neurological deterioration after mild traumatic brain injury (TBI) has been recognized as a poor prognostic factor. Early detection of neurological deterioration would allow appropriate monitoring and timely therapeutic interventions to improve patient outcomes. In this study, we developed a machine learning model to predict the occurrence of neurological deterioration after mild TBI using information obtained on admission. Methods This was a retrospective cohort study of data from the Think FAST registry, a multicenter prospective observational study of elderly TBI patients in Japan. Patients with an admission Glasgow Coma Scale (GCS) score of 12 or below or who underwent surgical treatment immediately upon admission were excluded. Neurological deterioration was defined as a decrease of 2 or more points from a GCS score of 13 or more within 24 h of hospital admission. The model predictive accuracy was judged with the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC), and the Youden index was used to determine the cutoff value. Results A total of 421 of 721 patients registered in the Think FAST registry between December 2019 and May 2021 were included in our study, among whom 25 demonstrated neurological deterioration. Among several machine learning algorithms, eXtreme Gradient Boosting (XGBoost) demonstrated the highest predictive accuracy in cross-validation, with an AUROC of 0.81 (±0.07) and an AUPRC of 0.33 (±0.08). Through SHapley Additive exPlanations (SHAP) analysis, five important features (D-dimer, fibrinogen, acute subdural hematoma thickness, cerebral contusion size, and systolic blood pressure) were identified and used to construct a better performing model (cross-validation AUROC of 0.84 and AUPRC of 0.34; testing data AUROC of 0.77 and AUPRC of 0.19). At the cutoff value from the Youden index, the model showed a sensitivity, specificity, and positive predictive value of 60, 96, and 38%, respectively. When neurosurgeons attempted to predict neurological deterioration using the same testing data, their values were 20, 94, and 19%, respectively. Conclusion In this study, our predictive model showed an acceptable performance in detecting neurological deterioration after mild TBI. Further validation through prospective studies is necessary to confirm these results.
Collapse
Affiliation(s)
- Daisu Abe
- Department of Neurosurgery, Tokyo Medical and Dental University, Bunkyo-ku, Japan
| | - Motoki Inaji
- Department of Neurosurgery, Tokyo Medical and Dental University, Bunkyo-ku, Japan
| | - Takeshi Hase
- Institute of Education, Innovative Human Resource Development Division, Tokyo Medical and Dental University, Bunkyo-ku, Japan
| | - Eiichi Suehiro
- Department of Neurosurgery, School of Medicine, International University of Health and Welfare, Narita, Japan
| | - Naoto Shiomi
- Emergency Medical Care Center, Saiseikai Shiga Hospital, Ritto, Shiga, Japan
| | - Hiroshi Yatsushige
- Department of Neurosurgery, NHO Disaster Medical Center, Tachikawa, Japan
| | - Shin Hirota
- Department of Neurosurgery, Tsuchiura Kyodo General Hospital, Tsuchiura, Ibaraki, Japan
| | - Shu Hasegawa
- Department of Neurosurgery, Kumamoto Red Cross Hospital, Kumamoto, Japan
| | - Hiroshi Karibe
- Department of Neurosurgery, Sendai City Hospital, Sendai, Miyagi, Japan
| | - Akihiro Miyata
- Department of Neurosurgery, Chiba Emergency Medical Center, Chiba, Japan
| | - Kenya Kawakita
- Emergency Medical Center, Kagawa University Hospital, Kita-gun, Kagawa, Japan
| | - Kohei Haji
- Department of Neurosurgery, Yamaguchi University School of Medicine, Ube, Yamaguchi, Japan
| | - Hideo Aihara
- Department of Neurosurgery, Hyogo Prefectural Kakogawa Medical Center, Kakogawa, Hyogo, Japan
| | - Shoji Yokobori
- Department of Emergency and Critical Care Medicine, Graduate School of Medicine, Nippon Medical School, Bunkyo-ku, Japan
| | - Takeshi Maeda
- Department of Neurological Surgery, Nihon University School of Medicine, Itabashi-ku, Japan
| | - Takahiro Onuki
- Department of Emergency Medicine, Teikyo University School of Medicine, Itabashi-ku, Japan
| | - Kotaro Oshio
- Department of Neurosurgery, St. Marianna University School of Medicine, Kawasaki, Kanagawa, Japan
| | - Nobukazu Komoribayashi
- Iwate Prefectural Advanced Critical Care and Emergency Center, Iwate Medical University, Yahaba, Iwate, Japan
| | - Michiyasu Suzuki
- Department of Neurosurgery, Yamaguchi University School of Medicine, Ube, Yamaguchi, Japan
| | - Taketoshi Maehara
- Department of Neurosurgery, Tokyo Medical and Dental University, Bunkyo-ku, Japan
| |
Collapse
|
18
|
Cho E, Yi JM, Chun J, Jang H, Yoon SH, Lee SH, Jang SH, Park DW, Kim SJ, Um SW, Lee SY, Jeong MK. Efficacy and Safety of Herbal Medicine Bojungikki-Tang in Combination with Pembrolizumab versus Pembrolizumab Monotherapy for Stage IV Non-Small Cell Lung Cancer: Study Protocol for a Randomized, Open-Label, Double-Arm, Multicenter Trial. Integr Cancer Ther 2025; 24:15347354251319339. [PMID: 39985390 PMCID: PMC11847320 DOI: 10.1177/15347354251319339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Revised: 01/12/2025] [Accepted: 01/23/2025] [Indexed: 02/24/2025] Open
Abstract
BACKGROUND Non-small cell lung cancer (NSCLC) exhibits low survival rates. Although immune checkpoint inhibitors (ICIs) have become first-line treatment for NSCLC, their limited response to ICI monotherapy has led to exploration of combination treatments. However, the high incidence of treatment-related adverse events associated with conventional drug combinations has highlighted the need for alternative herbal therapy. Bojungikki-tang (BJIKT), a traditional herbal medicine, has been used to treat gastrointestinal disorders and enhance immune function. Our preclinical studies have demonstrated that BJIKT combined with anti-PD-1 or anti-PD-L1 antibodies exhibits significant efficacy in suppressing tumor growth by modulating the immunosuppressive tumor microenvironment. Building on these preclinical findings, this study aims to evaluate the efficacy and safety of BJIKT with pembrolizumab combination therapy compared to pembrolizumab monotherapy in advanced NSCLC patients. METHODS 70 individuals with stage IV NSCLC scheduled for first-line pembrolizumab monotherapy will be randomly assigned to intervention or control groups. The primary outcome will be progression-free survival, with secondary outcomes including disease control rate, overall survival, and quality of life assessment. Adverse events will be monitored for safety. This study will explore the synergistic mechanism of combinatorial therapy using immune profiling and multi-omics analysis, and the possibility for personalized integrative therapy based on cold-heat syndrome differentiation (SD) types in East Asian medicine. DISCUSSION This study will provide novel evidence regarding survival outcomes, quality of life, and safety profiles of combined ICI and BJIKT therapy for advanced NSCLC. The exploratory data will contribute to tailoring treatments to immune-based SD types in NSCLC patients.
Collapse
MESH Headings
- Humans
- Carcinoma, Non-Small-Cell Lung/drug therapy
- Carcinoma, Non-Small-Cell Lung/pathology
- Antibodies, Monoclonal, Humanized/adverse effects
- Antibodies, Monoclonal, Humanized/therapeutic use
- Antibodies, Monoclonal, Humanized/pharmacology
- Antibodies, Monoclonal, Humanized/administration & dosage
- Lung Neoplasms/drug therapy
- Lung Neoplasms/pathology
- Drugs, Chinese Herbal/therapeutic use
- Drugs, Chinese Herbal/administration & dosage
- Male
- Female
- Middle Aged
- Neoplasm Staging
- Immune Checkpoint Inhibitors/adverse effects
- Immune Checkpoint Inhibitors/therapeutic use
- Immune Checkpoint Inhibitors/pharmacology
- Immune Checkpoint Inhibitors/administration & dosage
- Adult
- Aged
- Antineoplastic Combined Chemotherapy Protocols/therapeutic use
- Antineoplastic Combined Chemotherapy Protocols/pharmacology
- Antineoplastic Combined Chemotherapy Protocols/adverse effects
Collapse
Affiliation(s)
- Eunbyul Cho
- Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| | - Jin-Mu Yi
- Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| | - Jaemoo Chun
- Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| | - Ho Jang
- Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| | - Seong Hoon Yoon
- Pusan National University, Pusan National University Yangsan Hospital, Yangsan, Republic of Korea
| | | | - Seung Hun Jang
- Hallym University Sacred Heart Hospital, Anyang, Republic of Korea
| | - Dong Won Park
- Hanyang University College of Medicine, Seoul, Republic of Korea
| | - Seung Joon Kim
- The Catholic University of Korea, Seoul, Republic of Korea
| | - Sang-Won Um
- Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
| | - Sung Yong Lee
- Korea University Guro Hospital, Seoul, Republic of Korea
| | - Mi-Kyung Jeong
- Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| |
Collapse
|
19
|
Zhao Q, Nooner KB, Tapert SF, Adeli E, Pohl KM, Kuceyeski A, Sabuncu MR. The Transition From Homogeneous to Heterogeneous Machine Learning in Neuropsychiatric Research. BIOLOGICAL PSYCHIATRY GLOBAL OPEN SCIENCE 2025; 5:100397. [PMID: 39526023 PMCID: PMC11546160 DOI: 10.1016/j.bpsgos.2024.100397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 09/17/2024] [Accepted: 09/18/2024] [Indexed: 11/16/2024] Open
Abstract
Despite the advantage of neuroimaging-based machine learning (ML) models as pivotal tools for investigating brain-behavior relationships in neuropsychiatric studies, these data-driven predictive approaches have yet to yield substantial, clinically actionable insights for mental health care. A notable impediment lies in the inadequate accommodation of most ML research to the natural heterogeneity within large samples. Although commonly thought of as individual-level analyses, many ML algorithms are unimodal and homogeneous and thus incapable of capturing the potentially heterogeneous relationships between biology and psychopathology. We review the current landscape of computational research targeting population heterogeneity and argue that there is a need to expand from brain subtyping and behavioral phenotyping to analyses that focus on heterogeneity at the relational level. To this end, we review and suggest several existing ML models with the capacity to discern how external environmental and sociodemographic factors moderate the brain-behavior mapping function in a data-driven fashion. These heterogeneous ML models hold promise for enhancing the discovery of individualized brain-behavior associations and advancing precision psychiatry.
Collapse
Affiliation(s)
- Qingyu Zhao
- Department of Radiology, Weill Cornell Medicine, New York, New York
| | - Kate B. Nooner
- Department of Psychology, University of North Carolina Wilmington, Wilmington, North Carolina
| | - Susan F. Tapert
- Department of Psychiatry, University of California San Diego, La Jolla, California
| | - Ehsan Adeli
- Department of Psychiatry & Behavioral Sciences, Stanford University, Stanford, California
- Department of Computer Science, Stanford University, Stanford, California
| | - Kilian M. Pohl
- Department of Psychiatry & Behavioral Sciences, Stanford University, Stanford, California
- Department of Electrical Engineering, Stanford University, Stanford, California
| | - Amy Kuceyeski
- Department of Radiology, Weill Cornell Medicine, New York, New York
| | - Mert R. Sabuncu
- Department of Radiology, Weill Cornell Medicine, New York, New York
- School of Electrical and Computer Engineering, Cornell University and Cornell Tech, New York, New York
| |
Collapse
|
20
|
Zhang Y, Sun K, Liu Y, Xie F, Guo Q, Shen D. A Modality-Flexible Framework for Alzheimer's Disease Diagnosis Following Clinical Routine. IEEE J Biomed Health Inform 2025; 29:535-546. [PMID: 39352829 DOI: 10.1109/jbhi.2024.3472011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/04/2024]
Abstract
Dementia has high incidence among the elderly, and Alzheimer's disease (AD) is the most common dementia. The procedure of AD diagnosis in clinics usually follows a standard routine consisting of different phases, from acquiring non-imaging tabular data in the screening phase to MR imaging and ultimately to PET imaging. Most of the existing AD diagnosis studies are dedicated to a specific phase using either single or multi-modal data. In this paper, we introduce a modality-flexible classification framework, which is applicable for different AD diagnosis phases following the clinical routine. Specifically, our framework consists of three branches corresponding to three diagnosis phases: 1) a tabular branch using only tabular data for screening phase, 2) an MRI branch using both MRI and tabular data for uncertain cases in screening phase, and 3) ultimately a PET branch for the challenging cases using all the modalities including PET, MRI, and tabular data. To achieve effective fusion of imaging and non-imaging modalities, we introduce an image-tabular transformer block to adaptively scale and shift the image and tabular features according to modality importance determined by the network. The proposed framework is extensively validated on four cohorts containing 6495 subjects. Experiments demonstrate that our framework achieves superior diagnostic performance than the other representative methods across various AD diagnosis tasks, and shows promising performance for all the diagnosis phases, which exhibits great potential for clinical application.
Collapse
|
21
|
Mohamud KA, Elzubair Eltahir SA, Ahmed Alhardalo HA, Albashir HB, Ali Mohamed Zain NQA, Abdelrahman Ibrahim ME, Ahmed Fadlallah EN. The Role of Machine Learning Models in Predicting Cirrhosis Mortality: A Systematic Review. Cureus 2025; 17:e78155. [PMID: 40026938 PMCID: PMC11867977 DOI: 10.7759/cureus.78155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/28/2025] [Indexed: 03/05/2025] Open
Abstract
Liver cirrhosis affects millions of individuals worldwide and is one of the primary causes of mortality. Early mortality prediction for cirrhosis patients may increase the possibility for medical professionals to treat the illness successfully. This study assesses the ability of machine learning (ML) models to predict cirrhosis mortality. We followed Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines to search for relevant literature across four different databases. We found 379 studies of which 10 were eligible for inclusion in the current study. We analyzed 10 retrospective studies that showed that ML models outperformed conventional scores in predicting the death rate from end-stage liver disease (ESLD). Interestingly, models that used more parameters, such as patient demographics and extensive laboratory testing, exhibited higher prediction accuracy. With an area under the receiver operating characteristic (AUROC) ranging from 0.71 to 0.96, ML models showed consistently significant gains over traditional prognostic ratings. This review emphasizes how ML models might improve ESLD patient death prediction. Because machine learning models are more accurate than conventional approaches, it is important to incorporate data-driven informatics technologies into clinical settings. Additional validation and openness are required to guarantee model dependability and interpretability before ML may be used in clinical practice. The goal of future research should be to create reliable, interpretable models that may be used successfully in a variety of clinical contexts, enhancing ESLD patient treatment and results.
Collapse
Affiliation(s)
| | | | - Hind AbdAlla Ahmed Alhardalo
- Department of General Medicine, Abu Dhabi Health Services Company (SEHA) - Salma Rehabilitation Hospital, Abu Dhabi, ARE
| | - Hadel Bakhet Albashir
- Department of General Medicine, Abu Dhabi Health Services Company (SEHA) - Salma Rehabilitation Hospital, Abu Dhabi, ARE
| | | | | | | |
Collapse
|
22
|
Xu Z, Scharp D, Hobensack M, Ye J, Zou J, Ding S, Shang J, Topaz M. Machine learning-based infection diagnostic and prognostic models in post-acute care settings: a systematic review. J Am Med Inform Assoc 2025; 32:241-252. [PMID: 39530740 PMCID: PMC11648729 DOI: 10.1093/jamia/ocae278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 10/10/2024] [Accepted: 10/21/2024] [Indexed: 11/16/2024] Open
Abstract
OBJECTIVES This study aims to (1) review machine learning (ML)-based models for early infection diagnostic and prognosis prediction in post-acute care (PAC) settings, (2) identify key risk predictors influencing infection-related outcomes, and (3) examine the quality and limitations of these models. MATERIALS AND METHODS PubMed, Web of Science, Scopus, IEEE Xplore, CINAHL, and ACM digital library were searched in February 2024. Eligible studies leveraged PAC data to develop and evaluate ML models for infection-related risks. Data extraction followed the CHARMS checklist. Quality appraisal followed the PROBAST tool. Data synthesis was guided by the socio-ecological conceptual framework. RESULTS Thirteen studies were included, mainly focusing on respiratory infections and nursing homes. Most used regression models with structured electronic health record data. Since 2020, there has been a shift toward advanced ML algorithms and multimodal data, biosensors, and clinical notes being significant sources of unstructured data. Despite these advances, there is insufficient evidence to support performance improvements over traditional models. Individual-level risk predictors, like impaired cognition, declined function, and tachycardia, were commonly used, while contextual-level predictors were barely utilized, consequently limiting model fairness. Major sources of bias included lack of external validation, inadequate model calibration, and insufficient consideration of data complexity. DISCUSSION AND CONCLUSION Despite the growth of advanced modeling approaches in infection-related models in PAC settings, evidence supporting their superiority remains limited. Future research should leverage a socio-ecological lens for predictor selection and model construction, exploring optimal data modalities and ML model usage in PAC, while ensuring rigorous methodologies and fairness considerations.
Collapse
Affiliation(s)
- Zidu Xu
- School of Nursing, Columbia University, New York, NY 10032,
United States
| | - Danielle Scharp
- School of Nursing, Columbia University, New York, NY 10032,
United States
| | - Mollie Hobensack
- Icahn School of Medicine at Mount Sinai, New York, NY 10029,
United States
| | - Jiancheng Ye
- Weill Cornell Medicine, Cornell University, New York, NY
10065, United States
| | - Jungang Zou
- Department of Biostatistics, Mailman School of Public Health, Columbia
University, New York, NY 10032, United
States
| | - Sirui Ding
- Bakar Computational Health Sciences Institute, University of
California, San Francisco, CA 94158, United
States
| | - Jingjing Shang
- School of Nursing, Columbia University, New York, NY 10032,
United States
| | - Maxim Topaz
- School of Nursing, Columbia University, New York, NY 10032,
United States
- Center for Home Care Policy & Research, VNS Health, New
York, NY 10001, United States
- Data Science Institute, Columbia University, New York, NY
10027, United States
| |
Collapse
|
23
|
Sun CL. Breaking Barriers in Cardiovascular and Cancer Care With Multimodal AI. JACC. ADVANCES 2025; 4:101433. [PMID: 39759438 PMCID: PMC11697234 DOI: 10.1016/j.jacadv.2024.101433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/07/2025]
Affiliation(s)
- Christopher L.F. Sun
- Telfer School of Management, University of Ottawa, Ottawa, Ontario, Canada
- University of Ottawa Heart Institute, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
24
|
Kumar S, Rani S, Sharma S, Min H. Multimodality Fusion Aspects of Medical Diagnosis: A Comprehensive Review. Bioengineering (Basel) 2024; 11:1233. [PMID: 39768051 PMCID: PMC11672922 DOI: 10.3390/bioengineering11121233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2024] [Revised: 11/28/2024] [Accepted: 11/30/2024] [Indexed: 01/11/2025] Open
Abstract
Utilizing information from multiple sources is a preferred and more precise method for medical experts to confirm a diagnosis. Each source provides critical information about the disease that might otherwise be absent in other modalities. Combining information from various medical sources boosts confidence in the diagnosis process, enabling the creation of an effective treatment plan for the patient. The scarcity of medical experts to diagnose diseases motivates the development of automatic diagnoses relying on multimodal data. With the progress in artificial intelligence technology, automated diagnosis using multimodal fusion techniques is now possible. Nevertheless, the concept of multimodal medical diagnosis is still new and requires an understanding of the diverse aspects of multimodal data and its related challenges. This review article examines the various aspects of multimodal medical diagnosis to equip readers, academicians, and researchers with necessary knowledge to advance multimodal medical research. The chosen articles in the study underwent thorough screening from reputable journals and publishers to offer high-quality content to readers, who can then apply the knowledge to produce quality research. Besides, the need for multimodal information and the associated challenges are discussed with solutions. Additionally, ethical issues of using artificial intelligence in medical diagnosis is also discussed.
Collapse
Affiliation(s)
- Sachin Kumar
- Akian College of Science and Engineering, American University of Armenia, Yerevan 0019, Armenia
| | - Sita Rani
- Department of Computer Science and Engineering, Guru Nanak Dev Engineering College, Ludhiana 141006, India;
| | - Shivani Sharma
- Department of Computer Science and Engineering, Thapar Institute of Engineering and Technology, Patiala 147004, India;
| | - Hong Min
- School of Computing, Gachon University, Seongnam 13120, Republic of Korea
| |
Collapse
|
25
|
Gonzalez R, Saha A, Campbell CJ, Nejat P, Lokker C, Norgan AP. Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities. J Pathol Inform 2024; 15:100347. [PMID: 38162950 PMCID: PMC10755052 DOI: 10.1016/j.jpi.2023.100347] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/06/2023] [Accepted: 11/01/2023] [Indexed: 01/03/2024] Open
Abstract
This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and those that require a conceptual reappraisal from a critical perspective. Then, a novel opportunity to support "Learning Health Systems" by integrating hidden information extracted by ML models from digitalized histopathology slides with other healthcare big data is presented.
Collapse
Affiliation(s)
- Ricardo Gonzalez
- DeGroote School of Business, McMaster University, Hamilton, Ontario, Canada
- Division of Computational Pathology and Artificial Intelligence, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
| | - Ashirbani Saha
- Department of Oncology, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
- Escarpment Cancer Research Institute, McMaster University and Hamilton Health Sciences, Hamilton, Ontario, Canada
| | - Clinton J.V. Campbell
- William Osler Health System, Brampton, Ontario, Canada
- Department of Pathology and Molecular Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
| | - Peyman Nejat
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, United States
| | - Cynthia Lokker
- Health Information Research Unit, Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, Ontario, Canada
| | - Andrew P. Norgan
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
| |
Collapse
|
26
|
Ragni F, Bovo S, Zen A, Sona D, De Nadai K, Adamo GG, Pellegrini M, Nasini F, Vivarelli C, Tavolato M, Mura M, Parmeggiani F, Jurman G. Session-by-Session Prediction of Anti-Endothelial Growth Factor Injection Needs in Neovascular Age-Related Macular Degeneration Using Optical-Coherence-Tomography-Derived Features and Machine Learning. Diagnostics (Basel) 2024; 14:2609. [PMID: 39682518 DOI: 10.3390/diagnostics14232609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2024] [Revised: 11/15/2024] [Accepted: 11/19/2024] [Indexed: 12/18/2024] Open
Abstract
BACKGROUND/OBJECTIVES Neovascular age-related macular degeneration (nAMD) is a retinal disorder leading to irreversible central vision loss. The pro-re-nata (PRN) treatment for nAMD involves frequent intravitreal injections of anti-VEGF medications, placing a burden on patients and healthcare systems. Predicting injections needs at each monitoring session could optimize treatment outcomes and reduce unnecessary interventions. METHODS To achieve these aims, machine learning (ML) models were evaluated using different combinations of clinical variables, including retinal thickness and volume, best-corrected visual acuity, and features derived from macular optical coherence tomography (OCT). A "Leave Some Subjects Out" (LSSO) nested cross-validation approach ensured robust evaluation. Moreover, the SHapley Additive exPlanations (SHAP) analysis was employed to quantify the contribution of each feature to model predictions. RESULTS Results demonstrated that models incorporating both structural and functional features achieved high classification accuracy in predicting injection necessity (AUC = 0.747 ± 0.046, MCC = 0.541 ± 0.073). Moreover, the explainability analysis identified as key predictors both subretinal and intraretinal fluid, alongside central retinal thickness. CONCLUSIONS These findings suggest that session-by-session prediction of injection needs in nAMD patients is feasible, even without processing the entire OCT image. The proposed ML framework has the potential to be integrated into routine clinical workflows, thereby optimizing nAMD therapeutic management.
Collapse
Affiliation(s)
- Flavio Ragni
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| | - Stefano Bovo
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| | - Andrea Zen
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| | - Diego Sona
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| | - Katia De Nadai
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- ERN-EYE Network-Center Retinitis Pigmentosa of Veneto Region, Camposampiero Hospital, 35012 Padua, Italy
| | - Ginevra Giovanna Adamo
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- Unit of Ophthalmology, Azienda Ospedaliero Universitaria di Ferrara, 44100 Ferrara, Italy
| | - Marco Pellegrini
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- Unit of Ophthalmology, Azienda Ospedaliero Universitaria di Ferrara, 44100 Ferrara, Italy
| | - Francesco Nasini
- Unit of Ophthalmology, Azienda Ospedaliero Universitaria di Ferrara, 44100 Ferrara, Italy
| | - Chiara Vivarelli
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
| | - Marco Tavolato
- ERN-EYE Network-Center Retinitis Pigmentosa of Veneto Region, Camposampiero Hospital, 35012 Padua, Italy
- Unit of Ophthalmology, Azienda ULSS Euganea di Padova, 35131 Padova, Italy
| | - Marco Mura
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- King Khaled Eye Specialist Hospital, Riyadh 12211, Saudi Arabia
| | - Francesco Parmeggiani
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- ERN-EYE Network-Center Retinitis Pigmentosa of Veneto Region, Camposampiero Hospital, 35012 Padua, Italy
| | - Giuseppe Jurman
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| |
Collapse
|
27
|
Phillips E, O'Donoghue O, Zhang Y, Tsimpos P, Mallinger LA, Chatzidakis S, Pohlmann J, Du Y, Kim I, Song J, Brush B, Smirnakis S, Ong CJ, Orfanoudaki A. HELMET: A Hybrid Machine Learning Framework for Real-Time Prediction of Edema Trajectory in Large Middle Cerebral Artery Stroke. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.11.13.24317229. [PMID: 39606388 PMCID: PMC11601687 DOI: 10.1101/2024.11.13.24317229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
Malignant cerebral edema occurs when brain swelling displaces and compresses vital midline structures within the first week of a large middle cerebral artery stroke. Early interventions such as hyperosmolar therapy or surgical decompression may reverse secondary injury but must be administered judiciously. To optimize treatment and reduce secondary damage, clinicians need strategies to frequently and quantitatively assess the trajectory of edema using updated, relevant information. However, existing risk assessment tools are limited by the absence of structured records capturing the evolution of edema and typically estimate risk at a single time point early in the admission, therefore failing to account for changes in variables over the following hours or days. To address this, we developed and validated dynamic machine learning models capable of accurately predicting the severity of midline structure displacement, an established indicator of malignant edema, in real-time. Our models can provide updated estimations as frequently as every hour, using data from structured time-varying patient records, radiographic text, and human-curated neurological characteristics. Our work resulted in two novel multi-class classification models, collectively named Hybrid Ensemble Learning Models for Edema Trajectory (HELMET), predicting the progression of midline shift over 8-hour (HELMET-8) and 24-hour windows (HELMET-24), respectively. HELMET combines transformer-based large language models with supervised ensemble learning, demonstrating the value of merging human expertise and multimodal health records in developing clinical risk scores. Both models were trained on a retrospective cohort of 15,696 observations from 623 patients hospitalized with large middle cerebral artery ischemic stroke and were externally validated using 3,713 observations from 60 patients at a separate hospital system. Our HELMET models are accurate and generalize effectively to diverse populations, achieving a cross-validated mean area under the receiver operating characteristic score of 96.6% in the derivation cohort and 92.5% in the external validation cohort. Moreover, our approach provides a framework for developing hybrid risk prediction models that integrate both human-extracted and algorithm-derived multi-modal inputs. Our work enables accurate estimation of complex, dynamic, and highly specific clinical targets, such as midline shift, in real-time, even when relevant structured information is limited in electronic health record databases.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Yili Du
- Boston University School of Public Health
| | - Ivy Kim
- Boston Medical Center, Department of Neurology
| | - Jonathan Song
- Boston University Chobanian & Avedisian School of Medicine
| | | | - Stelios Smirnakis
- Brigham & Women's Hospital, Department of Neurology
- Harvard Medical School
- Jamaica Plain Veterans Administration Hospital, Department of Neurology
| | - Charlene J Ong
- Boston Medical Center, Department of Neurology
- Boston University Chobanian & Avedisian School of Medicine
| | | |
Collapse
|
28
|
Yammouri G, Ait Lahcen A. AI-Reinforced Wearable Sensors and Intelligent Point-of-Care Tests. J Pers Med 2024; 14:1088. [PMID: 39590580 PMCID: PMC11595538 DOI: 10.3390/jpm14111088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Revised: 10/25/2024] [Accepted: 10/28/2024] [Indexed: 11/28/2024] Open
Abstract
Artificial intelligence (AI) techniques offer great potential to advance point-of-care testing (POCT) and wearable sensors for personalized medicine applications. This review explores the recent advances and the transformative potential of the use of AI in improving wearables and POCT. The integration of AI significantly contributes to empowering these tools and enables continuous monitoring, real-time analysis, and rapid diagnostics, thus enhancing patient outcomes and healthcare efficiency. Wearable sensors powered by AI models offer tremendous opportunities for precise and non-invasive tracking of physiological conditions that are essential for early disease detection and personalized treatments. AI-empowered POCT facilitates rapid, accurate diagnostics, making these medical testing kits accessible and available even in resource-limited settings. This review discusses the key advances in AI applications for data processing, sensor fusion, and multivariate analytics, highlighting case examples that exhibit their impact in different medical scenarios. In addition, the challenges associated with data privacy, regulatory approvals, and technology integrations into the existing healthcare system have been overviewed. The outlook emphasizes the urgent need for continued innovation in AI-driven health technologies to overcome these challenges and to fully achieve the potential of these techniques to revolutionize personalized medicine.
Collapse
Affiliation(s)
- Ghita Yammouri
- Chemical Analysis & Biosensors, Process Engineering and Environment Laboratory, Faculty of Science and Techniques, Hassan II University of Casablanca, Mohammedia 28806, Morocco;
| | | |
Collapse
|
29
|
Benito GV, Goldberg X, Brachowicz N, Castaño-Vinyals G, Blay N, Espinosa A, Davidhi F, Torres D, Kogevinas M, de Cid R, Petrone P. Machine learning for anxiety and depression profiling and risk assessment in the aftermath of an emergency. Artif Intell Med 2024; 157:102991. [PMID: 39383706 DOI: 10.1016/j.artmed.2024.102991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 09/23/2024] [Accepted: 09/26/2024] [Indexed: 10/11/2024]
Abstract
BACKGROUND & OBJECTIVES Mental health disorders pose an increasing public health challenge worsened by the COVID-19 pandemic. The pandemic highlighted gaps in preparedness, emphasizing the need for early identification of at-risk groups and targeted interventions. This study aims to develop a risk assessment tool for anxiety, depression, and self-perceived stress using machine learning (ML) and explainable AI to identify key risk factors and stratify the population into meaningful risk profiles. METHODS We utilized a cohort of 9291 individuals from Northern Spain, with extensive post-COVID-19 mental health surveys. ML classification algorithms predicted depression, anxiety, and self-reported stress in three classes: healthy, mild, and severe outcomes. A novel combination of SHAP (SHapley Additive exPlanations) and UMAP (Uniform Manifold Approximation and Projection) was employed to interpret model predictions and facilitate the identification of high-risk phenotypic clusters. RESULTS The mean macro-averaged one-vs-one AUROC was 0.77 (± 0.01) for depression, 0.72 (± 0.01) for anxiety, and 0.73 (± 0.02) for self-perceived stress. Key risk factors included poor self-reported health, chronic mental health conditions, and poor social support. High-risk profiles, such as women with reduced sleep hours, were identified for self-perceived stress. Binary classification of healthy vs. at-risk classes yielded F1-Scores over 0.70. CONCLUSIONS Combining SHAP with UMAP for risk profile stratification offers valuable insights for developing effective interventions and shaping public health policies. This data-driven approach to mental health preparedness, when validated in real-world scenarios, can significantly address the mental health impact of public health crises like COVID-19.
Collapse
Affiliation(s)
- Guillermo Villanueva Benito
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain; Universitat Pompeu Fabra (UPF), Spain
| | - Ximena Goldberg
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Nicolai Brachowicz
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Gemma Castaño-Vinyals
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain; Universitat Pompeu Fabra (UPF), Spain; CIBER de Epidemiología y Salud Pública (CIBERESP), Spain
| | - Natalia Blay
- Genomes for Life-GCAT lab. CORE program. Germans Trias I Pujol Research Institute (IGTP), Camí de les Escoles, s/n, Badalona 08916, Catalonia, Spain
| | - Ana Espinosa
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Flavia Davidhi
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Diego Torres
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Manolis Kogevinas
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Rafael de Cid
- Genomes for Life-GCAT lab. CORE program. Germans Trias I Pujol Research Institute (IGTP), Camí de les Escoles, s/n, Badalona 08916, Catalonia, Spain
| | - Paula Petrone
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain.
| |
Collapse
|
30
|
Baharoon M, Almatar H, Alduhayan R, Aldebasi T, Alahmadi B, Bokhari Y, Alawad M, Almazroa A, Aljouie A. HyMNet: A Multimodal Deep Learning System for Hypertension Prediction Using Fundus Images and Cardiometabolic Risk Factors. Bioengineering (Basel) 2024; 11:1080. [PMID: 39593740 PMCID: PMC11591283 DOI: 10.3390/bioengineering11111080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 10/22/2024] [Accepted: 10/22/2024] [Indexed: 11/28/2024] Open
Abstract
STUDY OBJECTIVES This study aimed to develop a multimodal deep learning (MMDL) system called HyMNet, integrating fundus images and cardiometabolic factors (age and sex) to enhance hypertension (HTN) detection. METHODS HyMNet employed RETFound, a model pretrained on 1.6 million retinal images, for the fundus data, in conjunction with a fully connected neural network for age and sex. The two pathways were jointly trained by joining their feature vectors into a fusion network. The system was trained on 5016 retinal images from 1243 individuals provided by the Saudi Ministry of National Guard Health Affairs. The influence of diabetes on HTN detection was also assessed. RESULTS HyMNet surpassed the unimodal system, achieving an F1 score of 0.771 compared to 0.745 for the unimodal model. For diabetic patients, the F1 score was 0.796, while it was 0.466 for non-diabetic patients. CONCLUSIONS HyMNet exhibited superior performance relative to unimodal approaches, with an F1 score of 0.771 for HyMNet compared to 0.752 for models trained on demographic data alone, underscoring the advantages of MMDL systems in HTN detection. The findings indicate that diabetes significantly impacts HTN prediction, enhancing detection accuracy among diabetic patients. Utilizing MMDL with diverse data sources could improve clinical applicability and generalization.
Collapse
Affiliation(s)
- Mohammed Baharoon
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
- Data Management Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia
| | - Hessa Almatar
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
| | - Reema Alduhayan
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
| | - Tariq Aldebasi
- Ophthalmology Department, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh 14611, Saudi Arabia;
| | - Badr Alahmadi
- Ophthalmology Department, Prince Mohammad bin Abdulaziz Hospital, Ministry of National Guard Health Affairs, Al Madinah 42324, Saudi Arabia;
| | - Yahya Bokhari
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
- College of Public Health and Health Informatics, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 14815, Saudi Arabia
| | - Mohammed Alawad
- National Center for Artificial Intelligence (NCAI), Saudi Data and Artificial Intelligence Authority (SDAIA), Riyadh 12382, Saudi Arabia;
| | - Ahmed Almazroa
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
| | - Abdulrhman Aljouie
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
- Data Management Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia
- College of Public Health and Health Informatics, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 14815, Saudi Arabia
| |
Collapse
|
31
|
Natraj S, Kojovic N, Maillart T, Schaer M. Video-audio neural network ensemble for comprehensive screening of autism spectrum disorder in young children. PLoS One 2024; 19:e0308388. [PMID: 39361665 PMCID: PMC11449333 DOI: 10.1371/journal.pone.0308388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Accepted: 07/12/2024] [Indexed: 10/05/2024] Open
Abstract
A timely diagnosis of autism is paramount to allow early therapeutic intervention in preschoolers. Deep Learning tools have been increasingly used to identify specific autistic symptoms. But they also offer opportunities for broad automated detection of autism at an early age. Here, we leverage a multi-modal approach by combining two neural networks trained on video and audio features of semi-standardized social interactions in a sample of 160 children aged 1 to 5 years old. Our ensemble model performs with an accuracy of 82.5% (F1 score: 0.816, Precision: 0.775, Recall: 0.861) for screening Autism Spectrum Disorders (ASD). Additional combinations of our model were developed to achieve higher specificity (92.5%, i.e., few false negatives) or sensitivity (90%, i.e. few false positives). Finally, we found a relationship between the neural network modalities and specific audio versus video ASD characteristics, bringing evidence that our neural network implementation was effective in taking into account different features that are currently standardized under the gold standard ASD assessment.
Collapse
Affiliation(s)
- Shreyasvi Natraj
- Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Nada Kojovic
- Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Thomas Maillart
- Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Geneva School of Economics and Management, University of Geneva, Geneva, Switzerland
| | - Marie Schaer
- Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| |
Collapse
|
32
|
Nedadur R, Bhatt N, Liu T, Chu MWA, McCarthy PM, Kline A. The Emerging and Important Role of Artificial Intelligence in Cardiac Surgery. Can J Cardiol 2024; 40:1865-1879. [PMID: 39098601 DOI: 10.1016/j.cjca.2024.07.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Revised: 07/29/2024] [Accepted: 07/29/2024] [Indexed: 08/06/2024] Open
Abstract
Artificial Intelligence (AI) has greatly affected our everyday lives and holds great promise to change the landscape of medicine. AI is particularly positioned to improve care for the increasingly complex patients undergoing cardiac surgery using the immense amount of data generated in the course of their care. When deployed, AI can be used to analyze this information at the patient's bedside more expediently and accurately, all while providing new insights. This review summarizes the current applications of AI in cardiac surgery from the vantage point of a patient's journey. Applications of AI include preoperative risk assessment, intraoperative planning, postoperative patient care, and outpatient telemonitoring, encompassing the spectrum of cardiac surgical care. Offloading of administrative processes and enhanced experience with information gathering also represent a unique and under-represented avenue for future use of AI. As clinicians, understanding the nomenclature and applications of AI is important to contextualize issues, to ensure problem-driven solutions, and for clinical benefit. Precision medicine, and thus clinically relevant AI, remains dependent on data curation and warehousing to gather insights from large multicentre repositories while treating privacy with the utmost importance. AI tasks should not be siloed but rather holistically integrated into clinical workflow to retain context and relevance. As cardiac surgeons, AI allows us to look forward to a bright future of more efficient use of our clinical expertise toward high-level decision making and technical prowess.
Collapse
Affiliation(s)
- Rashmi Nedadur
- Feinberg School of Medicine, Division of Cardiac Surgery, Northwestern University, Chicago, Illinois, USA; Center for Artificial Intelligence, Bluhm Cardiovascular Institute, Northwestern Medicine, Chicago, Illinois, USA.
| | - Nitish Bhatt
- Peter Munk Cardiac Center, Toronto General Hospital, Toronto, Ontario, Canada
| | - Tom Liu
- Feinberg School of Medicine, Division of Cardiac Surgery, Northwestern University, Chicago, Illinois, USA; Center for Artificial Intelligence, Bluhm Cardiovascular Institute, Northwestern Medicine, Chicago, Illinois, USA
| | | | - Patrick M McCarthy
- Feinberg School of Medicine, Division of Cardiac Surgery, Northwestern University, Chicago, Illinois, USA; Center for Artificial Intelligence, Bluhm Cardiovascular Institute, Northwestern Medicine, Chicago, Illinois, USA
| | - Adrienne Kline
- Feinberg School of Medicine, Division of Cardiac Surgery, Northwestern University, Chicago, Illinois, USA; Center for Artificial Intelligence, Bluhm Cardiovascular Institute, Northwestern Medicine, Chicago, Illinois, USA
| |
Collapse
|
33
|
Chinni BK, Manlhiot C. Emerging Analytical Approaches for Personalized Medicine Using Machine Learning In Pediatric and Congenital Heart Disease. Can J Cardiol 2024; 40:1880-1896. [PMID: 39097187 DOI: 10.1016/j.cjca.2024.07.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/25/2024] [Accepted: 07/26/2024] [Indexed: 08/05/2024] Open
Abstract
Precision and personalized medicine, the process by which patient management is tailored to individual circumstances, are now terms that are familiar to cardiologists, despite it still being an emerging field. Although precision medicine relies most often on the underlying biology and pathophysiology of a patient's condition, personalized medicine relies on digital biomarkers generated through algorithms. Given the complexity of the underlying data, these digital biomarkers are most often generated through machine-learning algorithms. There are a number of analytic considerations regarding the creation of digital biomarkers that are discussed in this review, including data preprocessing, time dependency and gating, dimensionality reduction, and novel methods, both in the realm of supervised and unsupervised machine learning. Some of these considerations, such as sample size requirements and measurements of model performance, are particularly challenging in small and heterogeneous populations with rare outcomes such as children with congenital heart disease. Finally, we review analytic considerations for the deployment of digital biomarkers in clinical settings, including the emerging field of clinical artificial intelligence (AI) operations, computational needs for deployment, efforts to increase the explainability of AI, algorithmic drift, and the needs for distributed surveillance and federated learning. We conclude this review by discussing a recent simulation study that shows that, despite these analytic challenges and complications, the use of digital biomarkers in managing clinical care might have substantial benefits regarding individual patient outcomes.
Collapse
Affiliation(s)
- Bhargava K Chinni
- The Blalock-Taussig-Thomas Pediatric and Congenital Heart Center, Department of Pediatrics, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, Maryland, USA
| | - Cedric Manlhiot
- The Blalock-Taussig-Thomas Pediatric and Congenital Heart Center, Department of Pediatrics, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, Maryland, USA; Research Institute, SickKids Hospital, Department of Pediatrics, University of Toronto, Toronto, Ontario, Canada.
| |
Collapse
|
34
|
Shurrab S, Guerra-Manzanares A, E Shamout F. Multimodal masked siamese network improves chest X-ray representation learning. Sci Rep 2024; 14:22516. [PMID: 39341871 PMCID: PMC11439023 DOI: 10.1038/s41598-024-74043-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 09/23/2024] [Indexed: 10/01/2024] Open
Abstract
Self-supervised learning methods for medical images primarily rely on the imaging modality during pretraining. Although such approaches deliver promising results, they do not take advantage of the associated patient or scan information collected within Electronic Health Records (EHR). This study aims to develop a multimodal pretraining approach for chest radiographs that considers EHR data incorporation as an additional modality that during training. We propose to incorporate EHR data during self-supervised pretraining with a Masked Siamese Network (MSN) to enhance the quality of chest radiograph representations. We investigate three types of EHR data, including demographic, scan metadata, and inpatient stay information. We evaluate the multimodal MSN on three publicly available chest X-ray datasets, MIMIC-CXR, CheXpert, and NIH-14, using two vision transformer (ViT) backbones, specifically ViT-Tiny and ViT-Small. In assessing the quality of the representations through linear evaluation, our proposed method demonstrates significant improvement compared to vanilla MSN and state-of-the-art self-supervised learning baselines. In particular, our proposed method achieves an improvement of of 2% in the Area Under the Receiver Operating Characteristic Curve (AUROC) compared to vanilla MSN and 5% to 8% compared to other baselines, including uni-modal ones. Furthermore, our findings reveal that demographic features provide the most significant performance improvement. Our work highlights the potential of EHR-enhanced self-supervised pretraining for medical imaging and opens opportunities for future research to address limitations in existing representation learning methods for other medical imaging modalities, such as neuro-, ophthalmic, and sonar imaging.
Collapse
Affiliation(s)
- Saeed Shurrab
- New York University Abu Dhabi, Computer Engineering, Abu Dhabi, 129188, UAE
| | | | - Farah E Shamout
- New York University Abu Dhabi, Computer Engineering, Abu Dhabi, 129188, UAE.
| |
Collapse
|
35
|
Sun D, Macedonia C, Chen Z, Chandrasekaran S, Najarian K, Zhou S, Cernak T, Ellingrod VL, Jagadish HV, Marini B, Pai M, Violi A, Rech JC, Wang S, Li Y, Athey B, Omenn GS. Can Machine Learning Overcome the 95% Failure Rate and Reality that Only 30% of Approved Cancer Drugs Meaningfully Extend Patient Survival? J Med Chem 2024; 67:16035-16055. [PMID: 39253942 DOI: 10.1021/acs.jmedchem.4c01684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Despite implementing hundreds of strategies, cancer drug development suffers from a 95% failure rate over 30 years, with only 30% of approved cancer drugs extending patient survival beyond 2.5 months. Adding more criteria without eliminating nonessential ones is impractical and may fall into the "survivorship bias" trap. Machine learning (ML) models may enhance efficiency by saving time and cost. Yet, they may not improve success rate without identifying the root causes of failure. We propose a "STAR-guided ML system" (structure-tissue/cell selectivity-activity relationship) to enhance success rate and efficiency by addressing three overlooked interdependent factors: potency/specificity to the on/off-targets determining efficacy in tumors at clinical doses, on/off-target-driven tissue/cell selectivity influencing adverse effects in the normal organs at clinical doses, and optimal clinical doses balancing efficacy/safety as determined by potency/specificity and tissue/cell selectivity. STAR-guided ML models can directly predict clinical dose/efficacy/safety from five features to design/select the best drugs, enhancing success and efficiency of cancer drug development.
Collapse
Affiliation(s)
| | | | - Zhigang Chen
- LabBotics.ai, Palo Alto, California 94303, United States
| | | | | | - Simon Zhou
- Aurinia Pharmaceuticals Inc., Rockville, Maryland 20850, United States
| | | | | | | | | | | | | | | | | | - Yan Li
- Translational Medicine and Clinical Pharmacology, Bristol Myers Squibb, Summit, New Jersey 07901, United States
| | | | | |
Collapse
|
36
|
Thangaraj PM, Benson SH, Oikonomou EK, Asselbergs FW, Khera R. Cardiovascular care with digital twin technology in the era of generative artificial intelligence. Eur Heart J 2024; 45:ehae619. [PMID: 39322420 PMCID: PMC11638093 DOI: 10.1093/eurheartj/ehae619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 07/16/2024] [Accepted: 09/01/2024] [Indexed: 09/27/2024] Open
Abstract
Digital twins, which are in silico replications of an individual and its environment, have advanced clinical decision-making and prognostication in cardiovascular medicine. The technology enables personalized simulations of clinical scenarios, prediction of disease risk, and strategies for clinical trial augmentation. Current applications of cardiovascular digital twins have integrated multi-modal data into mechanistic and statistical models to build physiologically accurate cardiac replicas to enhance disease phenotyping, enrich diagnostic workflows, and optimize procedural planning. Digital twin technology is rapidly evolving in the setting of newly available data modalities and advances in generative artificial intelligence, enabling dynamic and comprehensive simulations unique to an individual. These twins fuse physiologic, environmental, and healthcare data into machine learning and generative models to build real-time patient predictions that can model interactions with the clinical environment to accelerate personalized patient care. This review summarizes digital twins in cardiovascular medicine and their potential future applications by incorporating new personalized data modalities. It examines the technical advances in deep learning and generative artificial intelligence that broaden the scope and predictive power of digital twins. Finally, it highlights the individual and societal challenges as well as ethical considerations that are essential to realizing the future vision of incorporating cardiology digital twins into personalized cardiovascular care.
Collapse
Affiliation(s)
- Phyllis M Thangaraj
- Section of Cardiology, Department of Internal Medicine, Yale School of Medicine, 789 Howard Ave., New Haven, CT, USA
| | - Sean H Benson
- Department of Cardiology, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, University of Amsterdam, Amsterdam, Netherlands
| | - Evangelos K Oikonomou
- Section of Cardiology, Department of Internal Medicine, Yale School of Medicine, 789 Howard Ave., New Haven, CT, USA
| | - Folkert W Asselbergs
- Department of Cardiology, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, University of Amsterdam, Amsterdam, Netherlands
- Institute of Health Informatics, University College London, London, UK
- The National Institute for Health Research University College London Hospitals Biomedical Research Center, University College London, London, UK
| | - Rohan Khera
- Section of Cardiology, Department of Internal Medicine, Yale School of Medicine, 789 Howard Ave., New Haven, CT, USA
- Section of Health Informatics, Department of Biostatistics, Yale School of Public Health, 47 College St., New Haven, CT, USA
- Department of Biomedical Informatics and Data Science, Yale School of Medicine, 100 College St. Fl 9, New Haven, CT, USA
- Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, 195 Church St. Fl 6, New Haven, CT 06510, USA
| |
Collapse
|
37
|
AlSaad R, Abd-Alrazaq A, Boughorbel S, Ahmed A, Renault MA, Damseh R, Sheikh J. Multimodal Large Language Models in Health Care: Applications, Challenges, and Future Outlook. J Med Internet Res 2024; 26:e59505. [PMID: 39321458 PMCID: PMC11464944 DOI: 10.2196/59505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Revised: 08/07/2024] [Accepted: 08/20/2024] [Indexed: 09/27/2024] Open
Abstract
In the complex and multidimensional field of medicine, multimodal data are prevalent and crucial for informed clinical decisions. Multimodal data span a broad spectrum of data types, including medical images (eg, MRI and CT scans), time-series data (eg, sensor data from wearable devices and electronic health records), audio recordings (eg, heart and respiratory sounds and patient interviews), text (eg, clinical notes and research articles), videos (eg, surgical procedures), and omics data (eg, genomics and proteomics). While advancements in large language models (LLMs) have enabled new applications for knowledge retrieval and processing in the medical field, most LLMs remain limited to processing unimodal data, typically text-based content, and often overlook the importance of integrating the diverse data modalities encountered in clinical practice. This paper aims to present a detailed, practical, and solution-oriented perspective on the use of multimodal LLMs (M-LLMs) in the medical field. Our investigation spanned M-LLM foundational principles, current and potential applications, technical and ethical challenges, and future research directions. By connecting these elements, we aimed to provide a comprehensive framework that links diverse aspects of M-LLMs, offering a unified vision for their future in health care. This approach aims to guide both future research and practical implementations of M-LLMs in health care, positioning them as a paradigm shift toward integrated, multimodal data-driven medical practice. We anticipate that this work will spark further discussion and inspire the development of innovative approaches in the next generation of medical M-LLM systems.
Collapse
Affiliation(s)
- Rawan AlSaad
- Weill Cornell Medicine-Qatar, Education City, Doha, Qatar
| | | | - Sabri Boughorbel
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Arfan Ahmed
- Weill Cornell Medicine-Qatar, Education City, Doha, Qatar
| | | | - Rafat Damseh
- Department of Computer Science and Software Engineering, United Arab Emirates University, Al Ain, United Arab Emirates
| | - Javaid Sheikh
- Weill Cornell Medicine-Qatar, Education City, Doha, Qatar
| |
Collapse
|
38
|
Mess F, Blaschke S, Gebhard D, Friedrich J. Precision prevention in occupational health: a conceptual analysis and development of a unified understanding and an integrative framework. Front Public Health 2024; 12:1444521. [PMID: 39360261 PMCID: PMC11445082 DOI: 10.3389/fpubh.2024.1444521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Accepted: 09/02/2024] [Indexed: 10/04/2024] Open
Abstract
Introduction Precision prevention implements highly precise, tailored health interventions for individuals by directly addressing personal and environmental determinants of health. However, precision prevention does not yet appear to be fully established in occupational health. There are numerous understandings and conceptual approaches, but these have not yet been systematically presented or synthesized. Therefore, this conceptual analysis aims to propose a unified understanding and develop an integrative conceptual framework for precision prevention in occupational health. Methods Firstly, to systematically present definitions and frameworks of precision prevention in occupational health, six international databases were searched for studies published between January 2010 and January 2024 that used the term precision prevention or its synonyms in the context of occupational health. Secondly, a qualitative content analysis was conducted to analyze the existing definitions and propose a unified understanding. Thirdly, based on the identified frameworks, a multi-stage exploratory development process was applied to develop and propose an integrative conceptual framework for precision prevention in occupational health. Results After screening 3,681 articles, 154 publications were reviewed, wherein 29 definitions of precision prevention and 64 different frameworks were found, which can be summarized in eight higher-order categories. The qualitative content analysis revealed seven themes and illustrated many different wordings. The proposed unified understanding of precision prevention in occupational health takes up the identified themes. It includes, among other things, a contrast to a "one-size-fits-all approach" with a risk- and resource-oriented data collection and innovative data analytics with profiling to provide and improve tailored interventions. The developed and proposed integrative conceptual framework comprises three overarching stages: (1) data generation, (2) data management lifecycle and (3) interventions (development, implementation and adaptation). Discussion Although there are already numerous studies on precision prevention in occupational health, this conceptual analysis offers, for the first time, a proposal for a unified understanding and an integrative conceptual framework. However, the proposed unified understanding and the developed integrative conceptual framework should only be seen as an initial proposal that should be critically discussed and further developed to expand and strengthen both research on precision prevention in occupational health and its practical application in the workplace.
Collapse
Affiliation(s)
- Filip Mess
- Department Health and Sport Sciences, TUM School of Medicine and Health, Technical University of Munich, Munich, Germany
| | | | | | - Julian Friedrich
- Department Health and Sport Sciences, TUM School of Medicine and Health, Technical University of Munich, Munich, Germany
| |
Collapse
|
39
|
Cirincione A, Lynch K, Bennett J, Choupan J, Varghese B, Sheikh-Bahaei N, Pandey G. Prediction of future dementia among patients with mild cognitive impairment (MCI) by integrating multimodal clinical data. Heliyon 2024; 10:e36728. [PMID: 39281465 PMCID: PMC11399681 DOI: 10.1016/j.heliyon.2024.e36728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 08/20/2024] [Accepted: 08/21/2024] [Indexed: 09/18/2024] Open
Abstract
Efficiently and objectively analyzing the complex, diverse multimodal data collected from patients at risk for dementia can be difficult in the clinical setting, contributing to high rates of underdiagnosis or misdiagnosis of this serious disorder. Patients with mild cognitive impairment (MCI) are especially at risk of developing dementia in the future. This study evaluated the ability of multi-modal machine learning (ML) methods, especially the Ensemble Integration (EI) framework, to predict future dementia development among patients with MCI. EI is a machine learning framework designed to leverage complementarity and consensus in multimodal data, which may not be adequately captured by methods used by prior dementia-related prediction studies. We tested EI's ability to predict future dementia development among MCI patients using multimodal clinical and imaging data, such as neuroanatomical measurements from structural magnetic resonance imaging (MRI) and positron emission tomography (PET) scans, from The Alzheimer's Disease Prediction of Longitudinal Evolution (TADPOLE) challenge. For predicting future dementia development among MCI patients, on a held out test set, the EI-based model performed better (AUC = 0.81, F-measure = 0.68) than the more commonly used XGBoost (AUC = 0.68, F-measure = 0.57) and deep learning (AUC = 0.79, F-measure = 0.61) approaches. This EI-based model also suggested MRI-derived volumes of regions in the middle temporal gyrus, posterior cingulate gyrus and inferior lateral ventricle brain regions to be predictive of progression to dementia.
Collapse
Affiliation(s)
- Andrew Cirincione
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| | - Kirsten Lynch
- Laboratory of Neuro Imaging (LONI), USC Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
| | - Jamie Bennett
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| | - Jeiran Choupan
- Laboratory of Neuro Imaging (LONI), USC Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
- NeuroScope Inc., Scarsdale, NY, 10583, USA
| | - Bino Varghese
- Department of Radiology, Keck School of Medicine, University of Southern California, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
| | - Nasim Sheikh-Bahaei
- Department of Radiology, Keck School of Medicine, University of Southern California, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
| | - Gaurav Pandey
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| |
Collapse
|
40
|
Amini Gougeh R, Zilic Z. Systematic Review of IoT-Based Solutions for User Tracking: Towards Smarter Lifestyle, Wellness and Health Management. SENSORS (BASEL, SWITZERLAND) 2024; 24:5939. [PMID: 39338683 PMCID: PMC11435569 DOI: 10.3390/s24185939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 09/07/2024] [Accepted: 09/09/2024] [Indexed: 09/30/2024]
Abstract
The Internet of Things (IoT) base has grown to over 20 billion devices currently operational worldwide. As they greatly extend the applicability and use of biosensors, IoT developments are transformative. Recent studies show that IoT, coupled with advanced communication frameworks, such as machine-to-machine (M2M) interactions, can lead to (1) improved efficiency in data exchange, (2) accurate and timely health monitoring, and (3) enhanced user engagement and compliance through advancements in human-computer interaction. This systematic review of the 19 most relevant studies examines the potential of IoT in health and lifestyle management by conducting detailed analyses and quality assessments of each study. Findings indicate that IoT-based systems effectively monitor various health parameters using biosensors, facilitate real-time feedback, and support personalized health recommendations. Key limitations include small sample sizes, insufficient security measures, practical issues with wearable sensors, and reliance on internet connectivity in areas with poor network infrastructure. The reviewed studies demonstrated innovative applications of IoT, focusing on M2M interactions, edge devices, multimodality health monitoring, intelligent decision-making, and automated health management systems. These insights offer valuable recommendations for optimizing IoT technologies in health and wellness management.
Collapse
Affiliation(s)
- Reza Amini Gougeh
- Faculty of Engineering, Department of Electrical and Computer Engineering, McGill University, Montreal, QC H3A 0G4, Canada;
| | | |
Collapse
|
41
|
Liu W, Chen J, Wang H, Fu Z, Peijnenburg WJGM, Hong H. Perspectives on Advancing Multimodal Learning in Environmental Science and Engineering Studies. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024. [PMID: 39226136 DOI: 10.1021/acs.est.4c03088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
The environment faces increasing anthropogenic impacts, resulting in a rapid increase in environmental issues that undermine the natural capital essential for human wellbeing. These issues are complex and often influenced by various factors represented by data with different modalities. While machine learning (ML) provides data-driven tools for addressing the environmental issues, the current ML models in environmental science and engineering (ES&E) often neglect the utilization of multimodal data. With the advancement in deep learning, multimodal learning (MML) holds promise for comprehensive descriptions of the environmental issues by harnessing data from diverse modalities. This advancement has the potential to significantly elevate the accuracy and robustness of prediction models in ES&E studies, providing enhanced solutions for various environmental modeling tasks. This perspective summarizes MML methodologies and proposes potential applications of MML models in ES&E studies, including environmental quality assessment, prediction of chemical hazards, and optimization of pollution control techniques. Additionally, we discuss the challenges associated with implementing MML in ES&E and propose future research directions in this domain.
Collapse
Affiliation(s)
- Wenjia Liu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Jingwen Chen
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Haobo Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Zhiqiang Fu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Willie J G M Peijnenburg
- Institute of Environmental Sciences (CML), Leiden University, Leiden 2300 RA, The Netherlands
- Centre for Safety of Substances and Products, National Institute of Public Health and the Environment (RIVM), Bilthoven 3720 BA, The Netherlands
| | - Huixiao Hong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, Arkansas 72079, United States
| |
Collapse
|
42
|
Zong H, Wu R, Cha J, Feng W, Wu E, Li J, Shao A, Tao L, Li Z, Tang B, Shen B. Advancing Chinese biomedical text mining with community challenges. J Biomed Inform 2024; 157:104716. [PMID: 39197732 DOI: 10.1016/j.jbi.2024.104716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 08/22/2024] [Accepted: 08/25/2024] [Indexed: 09/01/2024]
Abstract
OBJECTIVE This study aims to review the recent advances in community challenges for biomedical text mining in China. METHODS We collected information of evaluation tasks released in community challenges of biomedical text mining, including task description, dataset description, data source, task type and related links. A systematic summary and comparative analysis were conducted on various biomedical natural language processing tasks, such as named entity recognition, entity normalization, attribute extraction, relation extraction, event extraction, text classification, text similarity, knowledge graph construction, question answering, text generation, and large language model evaluation. RESULTS We identified 39 evaluation tasks from 6 community challenges that spanned from 2017 to 2023. Our analysis revealed the diverse range of evaluation task types and data sources in biomedical text mining. We explored the potential clinical applications of these community challenge tasks from a translational biomedical informatics perspective. We compared with their English counterparts, and discussed the contributions, limitations, lessons and guidelines of these community challenges, while highlighting future directions in the era of large language models. CONCLUSION Community challenge evaluation competitions have played a crucial role in promoting technology innovation and fostering interdisciplinary collaboration in the field of biomedical text mining. These challenges provide valuable platforms for researchers to develop state-of-the-art solutions.
Collapse
Affiliation(s)
- Hui Zong
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Rongrong Wu
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Jiaxue Cha
- Shanghai Key Laboratory of Signaling and Disease Research, Laboratory of Receptor-Based Bio-Medicine, Collaborative Innovation Center for Brain Science, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Weizhe Feng
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Erman Wu
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Jiakun Li
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China; Department of Urology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Aibin Shao
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Liang Tao
- Faculty of Business Information, Shanghai Business School, Shanghai 201400, China
| | | | - Buzhou Tang
- Department of Computer Science, Harbin Institute of Technology, Shenzhen 518055, China
| | - Bairong Shen
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
43
|
Isavand P, Aghamiri SS, Amin R. Applications of Multimodal Artificial Intelligence in Non-Hodgkin Lymphoma B Cells. Biomedicines 2024; 12:1753. [PMID: 39200217 PMCID: PMC11351272 DOI: 10.3390/biomedicines12081753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Revised: 07/22/2024] [Accepted: 08/01/2024] [Indexed: 09/02/2024] Open
Abstract
Given advancements in large-scale data and AI, integrating multimodal artificial intelligence into cancer research can enhance our understanding of tumor behavior by simultaneously processing diverse biomedical data types. In this review, we explore the potential of multimodal AI in comprehending B-cell non-Hodgkin lymphomas (B-NHLs). B-cell non-Hodgkin lymphomas (B-NHLs) represent a particular challenge in oncology due to tumor heterogeneity and the intricate ecosystem in which tumors develop. These complexities complicate diagnosis, prognosis, and therapy response, emphasizing the need to use sophisticated approaches to enhance personalized treatment strategies for better patient outcomes. Therefore, multimodal AI can be leveraged to synthesize critical information from available biomedical data such as clinical record, imaging, pathology and omics data, to picture the whole tumor. In this review, we first define various types of modalities, multimodal AI frameworks, and several applications in precision medicine. Then, we provide several examples of its usage in B-NHLs, for analyzing the complexity of the ecosystem, identifying immune biomarkers, optimizing therapy strategy, and its clinical applications. Lastly, we address the limitations and future directions of multimodal AI, highlighting the need to overcome these challenges for better clinical practice and application in healthcare.
Collapse
Affiliation(s)
- Pouria Isavand
- Department of Radiology, School of Medicine, Zanjan University of Medical Sciences, Zanjan 4513956184, Iran
| | | | - Rada Amin
- Department of Biochemistry, University of Nebraska, Lincoln, NE 68503, USA
| |
Collapse
|
44
|
Ferrara E. Large Language Models for Wearable Sensor-Based Human Activity Recognition, Health Monitoring, and Behavioral Modeling: A Survey of Early Trends, Datasets, and Challenges. SENSORS (BASEL, SWITZERLAND) 2024; 24:5045. [PMID: 39124092 PMCID: PMC11314694 DOI: 10.3390/s24155045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 07/29/2024] [Accepted: 07/31/2024] [Indexed: 08/12/2024]
Abstract
The proliferation of wearable technology enables the generation of vast amounts of sensor data, offering significant opportunities for advancements in health monitoring, activity recognition, and personalized medicine. However, the complexity and volume of these data present substantial challenges in data modeling and analysis, which have been addressed with approaches spanning time series modeling to deep learning techniques. The latest frontier in this domain is the adoption of large language models (LLMs), such as GPT-4 and Llama, for data analysis, modeling, understanding, and human behavior monitoring through the lens of wearable sensor data. This survey explores the current trends and challenges in applying LLMs for sensor-based human activity recognition and behavior modeling. We discuss the nature of wearable sensor data, the capabilities and limitations of LLMs in modeling them, and their integration with traditional machine learning techniques. We also identify key challenges, including data quality, computational requirements, interpretability, and privacy concerns. By examining case studies and successful applications, we highlight the potential of LLMs in enhancing the analysis and interpretation of wearable sensor data. Finally, we propose future directions for research, emphasizing the need for improved preprocessing techniques, more efficient and scalable models, and interdisciplinary collaboration. This survey aims to provide a comprehensive overview of the intersection between wearable sensor data and LLMs, offering insights into the current state and future prospects of this emerging field.
Collapse
Affiliation(s)
- Emilio Ferrara
- Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, CA 90007, USA;
- Information Sciences Institute, School of Advanced Computing, University of Southern California, Los Angeles, CA 90007, USA
| |
Collapse
|
45
|
Johnson R, Li MM, Noori A, Queen O, Zitnik M. Graph Artificial Intelligence in Medicine. Annu Rev Biomed Data Sci 2024; 7:345-368. [PMID: 38749465 PMCID: PMC11344018 DOI: 10.1146/annurev-biodatasci-110723-024625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2024]
Abstract
In clinical artificial intelligence (AI), graph representation learning, mainly through graph neural networks and graph transformer architectures, stands out for its capability to capture intricate relationships and structures within clinical datasets. With diverse data-from patient records to imaging-graph AI models process data holistically by viewing modalities and entities within them as nodes interconnected by their relationships. Graph AI facilitates model transfer across clinical tasks, enabling models to generalize across patient populations without additional parameters and with minimal to no retraining. However, the importance of human-centered design and model interpretability in clinical decision-making cannot be overstated. Since graph AI models capture information through localized neural transformations defined on relational datasets, they offer both an opportunity and a challenge in elucidating model rationale. Knowledge graphs can enhance interpretability by aligning model-driven insights with medical knowledge. Emerging graph AI models integrate diverse data modalities through pretraining, facilitate interactive feedback loops, and foster human-AI collaboration, paving the way toward clinically meaningful predictions.
Collapse
Affiliation(s)
- Ruth Johnson
- Berkowitz Family Living Laboratory, Harvard Medical School, Boston, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| | - Michelle M Li
- Bioinformatics and Integrative Genomics Program, Harvard Medical School, Boston, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| | - Ayush Noori
- Department of Computer Science, Harvard John A. Paulson School of Engineering and Applied Sciences, Allston, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| | - Owen Queen
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| | - Marinka Zitnik
- Harvard Data Science Initiative, Cambridge, Massachusetts, USA
- Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Allston, Massachusetts, USA
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| |
Collapse
|
46
|
Wang K, Margolis S, Cho JM, Wang S, Arianpour B, Jabalera A, Yin J, Hong W, Zhang Y, Zhao P, Zhu E, Reddy S, Hsiai TK. Non-Invasive Detection of Early-Stage Fatty Liver Disease via an On-Skin Impedance Sensor and Attention-Based Deep Learning. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2400596. [PMID: 38887178 PMCID: PMC11336938 DOI: 10.1002/advs.202400596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 03/17/2024] [Indexed: 06/20/2024]
Abstract
Early-stage nonalcoholic fatty liver disease (NAFLD) is a silent condition, with most cases going undiagnosed, potentially progressing to liver cirrhosis and cancer. A non-invasive and cost-effective detection method for early-stage NAFLD detection is a public health priority but challenging. In this study, an adhesive, soft on-skin sensor with low electrode-skin contact impedance for early-stage NAFLD detection is fabricated. A method is developed to synthesize platinum nanoparticles and reduced graphene quantum dots onto the on-skin sensor to reduce electrode-skin contact impedance by increasing double-layer capacitance, thereby enhancing detection accuracy. Furthermore, an attention-based deep learning algorithm is introduced to differentiate impedance signals associated with early-stage NAFLD in high-fat-diet-fed low-density lipoprotein receptor knockout (Ldlr-/-) mice compared to healthy controls. The integration of an adhesive, soft on-skin sensor with low electrode-skin contact impedance and the attention-based deep learning algorithm significantly enhances the detection accuracy for early-stage NAFLD, achieving a rate above 97.5% with an area under the receiver operating characteristic curve (AUC) of 1.0. The findings present a non-invasive approach for early-stage NAFLD detection and display a strategy for improved early detection through on-skin electronics and deep learning.
Collapse
Affiliation(s)
- Kaidong Wang
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
- Department of MedicineGreater Los Angeles Veterans Affairs (VA) Healthcare SystemLos AngelesCA90073USA
| | - Samuel Margolis
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
| | - Jae Min Cho
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
| | - Shaolei Wang
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Brian Arianpour
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Alejandro Jabalera
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Junyi Yin
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Wen Hong
- Department of Materials Science and EngineeringUniversity of California Los AngelesLos AngelesCA90095USA
| | - Yaran Zhang
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Peng Zhao
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
| | - Enbo Zhu
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
- Department of Materials Science and EngineeringUniversity of California Los AngelesLos AngelesCA90095USA
| | - Srinivasa Reddy
- Department of Molecular and Medical PharmacologyUniversity of California Los AngelesLos AngelesCA90095USA
| | - Tzung K. Hsiai
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
- Department of MedicineGreater Los Angeles Veterans Affairs (VA) Healthcare SystemLos AngelesCA90073USA
| |
Collapse
|
47
|
Li Y, El Habib Daho M, Conze PH, Zeghlache R, Le Boité H, Tadayoni R, Cochener B, Lamard M, Quellec G. A review of deep learning-based information fusion techniques for multimodal medical image classification. Comput Biol Med 2024; 177:108635. [PMID: 38796881 DOI: 10.1016/j.compbiomed.2024.108635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 03/18/2024] [Accepted: 05/18/2024] [Indexed: 05/29/2024]
Abstract
Multimodal medical imaging plays a pivotal role in clinical diagnosis and research, as it combines information from various imaging modalities to provide a more comprehensive understanding of the underlying pathology. Recently, deep learning-based multimodal fusion techniques have emerged as powerful tools for improving medical image classification. This review offers a thorough analysis of the developments in deep learning-based multimodal fusion for medical classification tasks. We explore the complementary relationships among prevalent clinical modalities and outline three main fusion schemes for multimodal classification networks: input fusion, intermediate fusion (encompassing single-level fusion, hierarchical fusion, and attention-based fusion), and output fusion. By evaluating the performance of these fusion techniques, we provide insight into the suitability of different network architectures for various multimodal fusion scenarios and application domains. Furthermore, we delve into challenges related to network architecture selection, handling incomplete multimodal data management, and the potential limitations of multimodal fusion. Finally, we spotlight the promising future of Transformer-based multimodal fusion techniques and give recommendations for future research in this rapidly evolving field.
Collapse
Affiliation(s)
- Yihao Li
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France
| | - Mostafa El Habib Daho
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France.
| | | | - Rachid Zeghlache
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France
| | - Hugo Le Boité
- Sorbonne University, Paris, France; Ophthalmology Department, Lariboisière Hospital, AP-HP, Paris, France
| | - Ramin Tadayoni
- Ophthalmology Department, Lariboisière Hospital, AP-HP, Paris, France; Paris Cité University, Paris, France
| | - Béatrice Cochener
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France; Ophthalmology Department, CHRU Brest, Brest, France
| | - Mathieu Lamard
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France
| | | |
Collapse
|
48
|
Luo Y, Mao C, Sanchez‐Pinto LN, Ahmad FS, Naidech A, Rasmussen L, Pacheco JA, Schneider D, Mithal LB, Dresden S, Holmes K, Carson M, Shah SJ, Khan S, Clare S, Wunderink RG, Liu H, Walunas T, Cooper L, Yue F, Wehbe F, Fang D, Liebovitz DM, Markl M, Michelson KN, McColley SA, Green M, Starren J, Ackermann RT, D'Aquila RT, Adams J, Lloyd‐Jones D, Chisholm RL, Kho A. Northwestern University resource and education development initiatives to advance collaborative artificial intelligence across the learning health system. Learn Health Syst 2024; 8:e10417. [PMID: 39036530 PMCID: PMC11257059 DOI: 10.1002/lrh2.10417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 02/22/2024] [Accepted: 02/26/2024] [Indexed: 07/23/2024] Open
Abstract
Introduction The rapid development of artificial intelligence (AI) in healthcare has exposed the unmet need for growing a multidisciplinary workforce that can collaborate effectively in the learning health systems. Maximizing the synergy among multiple teams is critical for Collaborative AI in Healthcare. Methods We have developed a series of data, tools, and educational resources for cultivating the next generation of multidisciplinary workforce for Collaborative AI in Healthcare. We built bulk-natural language processing pipelines to extract structured information from clinical notes and stored them in common data models. We developed multimodal AI/machine learning (ML) tools and tutorials to enrich the toolbox of the multidisciplinary workforce to analyze multimodal healthcare data. We have created a fertile ground to cross-pollinate clinicians and AI scientists and train the next generation of AI health workforce to collaborate effectively. Results Our work has democratized access to unstructured health information, AI/ML tools and resources for healthcare, and collaborative education resources. From 2017 to 2022, this has enabled studies in multiple clinical specialties resulting in 68 peer-reviewed publications. In 2022, our cross-discipline efforts converged and institutionalized into the Center for Collaborative AI in Healthcare. Conclusions Our Collaborative AI in Healthcare initiatives has created valuable educational and practical resources. They have enabled more clinicians, scientists, and hospital administrators to successfully apply AI methods in their daily research and practice, develop closer collaborations, and advanced the institution-level learning health system.
Collapse
Affiliation(s)
- Yuan Luo
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Chengsheng Mao
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Lazaro N. Sanchez‐Pinto
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of Critical Care, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Stanley Manne Children's Research InstituteAnn & Robert H. Lurie Children's Hospital of ChicagoChicagoIllinoisUSA
| | - Faraz S. Ahmad
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Cardiology, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Andrew Naidech
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Neurocritical Care, Department of NeurologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Luke Rasmussen
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Jennifer A. Pacheco
- Center for Genetic MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Daniel Schneider
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
| | - Leena B. Mithal
- Stanley Manne Children's Research InstituteAnn & Robert H. Lurie Children's Hospital of ChicagoChicagoIllinoisUSA
- Division of Infectious Diseases, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Scott Dresden
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of Emergency MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Kristi Holmes
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Galter Health Sciences LibraryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Matthew Carson
- Galter Health Sciences LibraryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Sanjiv J. Shah
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Cardiology, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Seema Khan
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of SurgeryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Susan Clare
- Department of SurgeryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Richard G. Wunderink
- Division of Critical Care, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Pulmonary and Critical Care Division, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Huiping Liu
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of PharmacologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of Hematology and Oncology, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Theresa Walunas
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Center for Health Information PartnershipsInstitute for Public Health and Medicine, Northwestern UniversityChicagoIllinoisUSA
- Department of Microbiology‐ImmunologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Lee Cooper
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Department of PathologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Feng Yue
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of PathologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Department of Biochemistry and Molecular GeneticsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Firas Wehbe
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of SurgeryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Deyu Fang
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of PathologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - David M. Liebovitz
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Center for Health Information PartnershipsInstitute for Public Health and Medicine, Northwestern UniversityChicagoIllinoisUSA
| | - Michael Markl
- Department of RadiologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Kelly N. Michelson
- Division of Critical Care, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Stanley Manne Children's Research InstituteAnn & Robert H. Lurie Children's Hospital of ChicagoChicagoIllinoisUSA
- Center for Bioethics and Medical Humanities, Institute for Public Health and MedicineNorthwestern UniversityChicagoIllinoisUSA
| | - Susanna A. McColley
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Stanley Manne Children's Research InstituteAnn & Robert H. Lurie Children's Hospital of ChicagoChicagoIllinoisUSA
- Division of Pulmonary and Sleep Medicine, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Marianne Green
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Justin Starren
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Ronald T. Ackermann
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Institute for Public Health and MedicineNorthwestern UniversityChicagoIllinoisUSA
| | - Richard T. D'Aquila
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Division of Infectious Diseases, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - James Adams
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of Emergency MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Donald Lloyd‐Jones
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Epidemiology, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Rex L. Chisholm
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of SurgeryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Center for Health Information PartnershipsInstitute for Public Health and Medicine, Northwestern UniversityChicagoIllinoisUSA
| | - Abel Kho
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Center for Health Information PartnershipsInstitute for Public Health and Medicine, Northwestern UniversityChicagoIllinoisUSA
| |
Collapse
|
49
|
Schiffer-Kane K, Liu C, Callahan TJ, Ta C, Nestor JG, Weng C. Converting OMOP CDM to phenopackets: A model alignment and patient data representation evaluation. J Biomed Inform 2024; 155:104659. [PMID: 38777085 PMCID: PMC11181468 DOI: 10.1016/j.jbi.2024.104659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 05/11/2024] [Accepted: 05/18/2024] [Indexed: 05/25/2024]
Abstract
OBJECTIVE This study aims to promote interoperability in precision medicine and translational research by aligning the Observational Medical Outcomes Partnership (OMOP) and Phenopackets data models. Phenopackets is an expert knowledge-driven schema designed to facilitate the storage and exchange of multimodal patient data, and support downstream analysis. The first goal of this paper is to explore model alignment by characterizing the common data models using a newly developed data transformation process and evaluation method. Second, using OMOP normalized clinical data, we evaluate the mapping of real-world patient data to Phenopackets. We evaluate the suitability of Phenopackets as a patient data representation for real-world clinical cases. METHODS We identified mappings between OMOP and Phenopackets and applied them to a real patient dataset to assess the transformation's success. We analyzed gaps between the models and identified key considerations for transforming data between them. Further, to improve ambiguous alignment, we incorporated Unified Medical Language System (UMLS) semantic type-based filtering to direct individual concepts to their most appropriate domain and conducted a domain-expert evaluation of the mapping's clinical utility. RESULTS The OMOP to Phenopacket transformation pipeline was executed for 1,000 Alzheimer's disease patients and successfully mapped all required entities. However, due to missing values in OMOP for required Phenopacket attributes, 10.2 % of records were lost. The use of UMLS-semantic type filtering for ambiguous alignment of individual concepts resulted in 96 % agreement with clinical thinking, increased from 68 % when mapping exclusively by domain correspondence. CONCLUSION This study presents a pipeline to transform data from OMOP to Phenopackets. We identified considerations for the transformation to ensure data quality, handling restrictions for successful Phenopacket validation and discrepant data formats. We identified unmappable Phenopacket attributes that focus on specialty use cases, such as genomics or oncology, which OMOP does not currently support. We introduce UMLS semantic type filtering to resolve ambiguous alignment to Phenopacket entities to be most appropriate for real-world interpretation. We provide a systematic approach to align OMOP and Phenopackets schemas. Our work facilitates future use of Phenopackets in clinical applications by addressing key barriers to interoperability when deriving a Phenopacket from real-world patient data.
Collapse
Affiliation(s)
- Kayla Schiffer-Kane
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Casey Ta
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Jordan G Nestor
- Department of Medicine, Division of Nephrology, Columbia University Irving Medical Center, New York, NY, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.
| |
Collapse
|
50
|
Loutati R, Ben-Yehuda A, Rosenberg S, Rottenberg Y. Multimodal Machine Learning for Prediction of 30-Day Readmission Risk in Elderly Population. Am J Med 2024; 137:617-628. [PMID: 38588939 DOI: 10.1016/j.amjmed.2024.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Revised: 04/01/2024] [Accepted: 04/01/2024] [Indexed: 04/10/2024]
Abstract
BACKGROUND Readmission within 30 days is a prevalent issue among elderly patients, linked to unfavorable health outcomes. Our objective was to develop and validate multimodal machine learning models for predicting 30-day readmission risk in elderly patients discharged from internal medicine departments. METHODS This was a retrospective cohort study which included elderly patients aged 75 or older, who were hospitalized at the Hadassah Medical Center internal medicine departments between 2014 and 2020. Three machine learning algorithms were developed and employed to predict 30-day readmission risk. The primary measures were predictive model performance scores, specifically area under the receiver operator curve (AUROC), and average precision. RESULTS This study included 19,569 admissions. Of them, 3258 (16.65%) resulted in 30-day readmission. Our 3 proposed models demonstrated high accuracy and precision on an unseen test set, with AUROC values of 0.87, 0.89, and 0.93, respectively, and average precision values of 0.76, 0.78, and 0.81. Feature importance analysis revealed that the number of admissions in the past year, history of 30-day readmission, Charlson score, and admission length were the most influential variables. Notably, the natural language processing score, representing the probability of readmission according to a textual-based model trained on social workers' assessment letters during hospitalization, ranked among the top 10 contributing factors. CONCLUSIONS Leveraging multimodal machine learning offers a promising strategy for identifying elderly patients who are at high risk for 30-day readmission. By identifying these patients, machine learning models may facilitate the effective execution of preventive actions to reduce avoidable readmission incidents.
Collapse
Affiliation(s)
- Ranel Loutati
- Department of Military Medicine and "Tzameret", Faculty of Medicine, Hebrew University of Jerusalem; and the Medical Corps, Israel Defense Forces, Israel; Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel; The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel.
| | - Arie Ben-Yehuda
- Department of Internal Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel
| | - Shai Rosenberg
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel; The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel
| | - Yakir Rottenberg
- Sharett Institute of Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel
| |
Collapse
|