1
|
Zhao Q, Nooner KB, Tapert SF, Adeli E, Pohl KM, Kuceyeski A, Sabuncu MR. The Transition From Homogeneous to Heterogeneous Machine Learning in Neuropsychiatric Research. BIOLOGICAL PSYCHIATRY GLOBAL OPEN SCIENCE 2025; 5:100397. [PMID: 39526023 PMCID: PMC11546160 DOI: 10.1016/j.bpsgos.2024.100397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 09/17/2024] [Accepted: 09/18/2024] [Indexed: 11/16/2024] Open
Abstract
Despite the advantage of neuroimaging-based machine learning (ML) models as pivotal tools for investigating brain-behavior relationships in neuropsychiatric studies, these data-driven predictive approaches have yet to yield substantial, clinically actionable insights for mental health care. A notable impediment lies in the inadequate accommodation of most ML research to the natural heterogeneity within large samples. Although commonly thought of as individual-level analyses, many ML algorithms are unimodal and homogeneous and thus incapable of capturing the potentially heterogeneous relationships between biology and psychopathology. We review the current landscape of computational research targeting population heterogeneity and argue that there is a need to expand from brain subtyping and behavioral phenotyping to analyses that focus on heterogeneity at the relational level. To this end, we review and suggest several existing ML models with the capacity to discern how external environmental and sociodemographic factors moderate the brain-behavior mapping function in a data-driven fashion. These heterogeneous ML models hold promise for enhancing the discovery of individualized brain-behavior associations and advancing precision psychiatry.
Collapse
Affiliation(s)
- Qingyu Zhao
- Department of Radiology, Weill Cornell Medicine, New York, New York
| | - Kate B. Nooner
- Department of Psychology, University of North Carolina Wilmington, Wilmington, North Carolina
| | - Susan F. Tapert
- Department of Psychiatry, University of California San Diego, La Jolla, California
| | - Ehsan Adeli
- Department of Psychiatry & Behavioral Sciences, Stanford University, Stanford, California
- Department of Computer Science, Stanford University, Stanford, California
| | - Kilian M. Pohl
- Department of Psychiatry & Behavioral Sciences, Stanford University, Stanford, California
- Department of Electrical Engineering, Stanford University, Stanford, California
| | - Amy Kuceyeski
- Department of Radiology, Weill Cornell Medicine, New York, New York
| | - Mert R. Sabuncu
- Department of Radiology, Weill Cornell Medicine, New York, New York
- School of Electrical and Computer Engineering, Cornell University and Cornell Tech, New York, New York
| |
Collapse
|
2
|
Xu Z, Scharp D, Hobensack M, Ye J, Zou J, Ding S, Shang J, Topaz M. Machine learning-based infection diagnostic and prognostic models in post-acute care settings: a systematic review. J Am Med Inform Assoc 2025; 32:241-252. [PMID: 39530740 DOI: 10.1093/jamia/ocae278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 10/10/2024] [Accepted: 10/21/2024] [Indexed: 11/16/2024] Open
Abstract
OBJECTIVES This study aims to (1) review machine learning (ML)-based models for early infection diagnostic and prognosis prediction in post-acute care (PAC) settings, (2) identify key risk predictors influencing infection-related outcomes, and (3) examine the quality and limitations of these models. MATERIALS AND METHODS PubMed, Web of Science, Scopus, IEEE Xplore, CINAHL, and ACM digital library were searched in February 2024. Eligible studies leveraged PAC data to develop and evaluate ML models for infection-related risks. Data extraction followed the CHARMS checklist. Quality appraisal followed the PROBAST tool. Data synthesis was guided by the socio-ecological conceptual framework. RESULTS Thirteen studies were included, mainly focusing on respiratory infections and nursing homes. Most used regression models with structured electronic health record data. Since 2020, there has been a shift toward advanced ML algorithms and multimodal data, biosensors, and clinical notes being significant sources of unstructured data. Despite these advances, there is insufficient evidence to support performance improvements over traditional models. Individual-level risk predictors, like impaired cognition, declined function, and tachycardia, were commonly used, while contextual-level predictors were barely utilized, consequently limiting model fairness. Major sources of bias included lack of external validation, inadequate model calibration, and insufficient consideration of data complexity. DISCUSSION AND CONCLUSION Despite the growth of advanced modeling approaches in infection-related models in PAC settings, evidence supporting their superiority remains limited. Future research should leverage a socio-ecological lens for predictor selection and model construction, exploring optimal data modalities and ML model usage in PAC, while ensuring rigorous methodologies and fairness considerations.
Collapse
Affiliation(s)
- Zidu Xu
- School of Nursing, Columbia University, New York, NY 10032, United States
| | - Danielle Scharp
- School of Nursing, Columbia University, New York, NY 10032, United States
| | - Mollie Hobensack
- Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States
| | - Jiancheng Ye
- Weill Cornell Medicine, Cornell University, New York, NY 10065, United States
| | - Jungang Zou
- Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY 10032, United States
| | - Sirui Ding
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA 94158, United States
| | - Jingjing Shang
- School of Nursing, Columbia University, New York, NY 10032, United States
| | - Maxim Topaz
- School of Nursing, Columbia University, New York, NY 10032, United States
- Center for Home Care Policy & Research, VNS Health, New York, NY 10001, United States
- Data Science Institute, Columbia University, New York, NY 10027, United States
| |
Collapse
|
3
|
Gonzalez R, Saha A, Campbell CJ, Nejat P, Lokker C, Norgan AP. Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities. J Pathol Inform 2024; 15:100347. [PMID: 38162950 PMCID: PMC10755052 DOI: 10.1016/j.jpi.2023.100347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/06/2023] [Accepted: 11/01/2023] [Indexed: 01/03/2024] Open
Abstract
This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and those that require a conceptual reappraisal from a critical perspective. Then, a novel opportunity to support "Learning Health Systems" by integrating hidden information extracted by ML models from digitalized histopathology slides with other healthcare big data is presented.
Collapse
Affiliation(s)
- Ricardo Gonzalez
- DeGroote School of Business, McMaster University, Hamilton, Ontario, Canada
- Division of Computational Pathology and Artificial Intelligence, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
| | - Ashirbani Saha
- Department of Oncology, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
- Escarpment Cancer Research Institute, McMaster University and Hamilton Health Sciences, Hamilton, Ontario, Canada
| | - Clinton J.V. Campbell
- William Osler Health System, Brampton, Ontario, Canada
- Department of Pathology and Molecular Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
| | - Peyman Nejat
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, United States
| | - Cynthia Lokker
- Health Information Research Unit, Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, Ontario, Canada
| | - Andrew P. Norgan
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
| |
Collapse
|
4
|
Ragni F, Bovo S, Zen A, Sona D, De Nadai K, Adamo GG, Pellegrini M, Nasini F, Vivarelli C, Tavolato M, Mura M, Parmeggiani F, Jurman G. Session-by-Session Prediction of Anti-Endothelial Growth Factor Injection Needs in Neovascular Age-Related Macular Degeneration Using Optical-Coherence-Tomography-Derived Features and Machine Learning. Diagnostics (Basel) 2024; 14:2609. [PMID: 39682518 DOI: 10.3390/diagnostics14232609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2024] [Revised: 11/15/2024] [Accepted: 11/19/2024] [Indexed: 12/18/2024] Open
Abstract
BACKGROUND/OBJECTIVES Neovascular age-related macular degeneration (nAMD) is a retinal disorder leading to irreversible central vision loss. The pro-re-nata (PRN) treatment for nAMD involves frequent intravitreal injections of anti-VEGF medications, placing a burden on patients and healthcare systems. Predicting injections needs at each monitoring session could optimize treatment outcomes and reduce unnecessary interventions. METHODS To achieve these aims, machine learning (ML) models were evaluated using different combinations of clinical variables, including retinal thickness and volume, best-corrected visual acuity, and features derived from macular optical coherence tomography (OCT). A "Leave Some Subjects Out" (LSSO) nested cross-validation approach ensured robust evaluation. Moreover, the SHapley Additive exPlanations (SHAP) analysis was employed to quantify the contribution of each feature to model predictions. RESULTS Results demonstrated that models incorporating both structural and functional features achieved high classification accuracy in predicting injection necessity (AUC = 0.747 ± 0.046, MCC = 0.541 ± 0.073). Moreover, the explainability analysis identified as key predictors both subretinal and intraretinal fluid, alongside central retinal thickness. CONCLUSIONS These findings suggest that session-by-session prediction of injection needs in nAMD patients is feasible, even without processing the entire OCT image. The proposed ML framework has the potential to be integrated into routine clinical workflows, thereby optimizing nAMD therapeutic management.
Collapse
Affiliation(s)
- Flavio Ragni
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| | - Stefano Bovo
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| | - Andrea Zen
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| | - Diego Sona
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| | - Katia De Nadai
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- ERN-EYE Network-Center Retinitis Pigmentosa of Veneto Region, Camposampiero Hospital, 35012 Padua, Italy
| | - Ginevra Giovanna Adamo
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- Unit of Ophthalmology, Azienda Ospedaliero Universitaria di Ferrara, 44100 Ferrara, Italy
| | - Marco Pellegrini
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- Unit of Ophthalmology, Azienda Ospedaliero Universitaria di Ferrara, 44100 Ferrara, Italy
| | - Francesco Nasini
- Unit of Ophthalmology, Azienda Ospedaliero Universitaria di Ferrara, 44100 Ferrara, Italy
| | - Chiara Vivarelli
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
| | - Marco Tavolato
- ERN-EYE Network-Center Retinitis Pigmentosa of Veneto Region, Camposampiero Hospital, 35012 Padua, Italy
- Unit of Ophthalmology, Azienda ULSS Euganea di Padova, 35131 Padova, Italy
| | - Marco Mura
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- King Khaled Eye Specialist Hospital, Riyadh 12211, Saudi Arabia
| | - Francesco Parmeggiani
- Department of Translational Medicine and for Romagna, University of Ferrara, 44121 Ferrara, Italy
- ERN-EYE Network-Center Retinitis Pigmentosa of Veneto Region, Camposampiero Hospital, 35012 Padua, Italy
| | - Giuseppe Jurman
- Data Science for Health Unit, Fondazione Bruno Kessler, 38123 Trento, Italy
| |
Collapse
|
5
|
Phillips E, O'Donoghue O, Zhang Y, Tsimpos P, Mallinger LA, Chatzidakis S, Pohlmann J, Du Y, Kim I, Song J, Brush B, Smirnakis S, Ong CJ, Orfanoudaki A. HELMET: A Hybrid Machine Learning Framework for Real-Time Prediction of Edema Trajectory in Large Middle Cerebral Artery Stroke. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.11.13.24317229. [PMID: 39606388 PMCID: PMC11601687 DOI: 10.1101/2024.11.13.24317229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
Malignant cerebral edema occurs when brain swelling displaces and compresses vital midline structures within the first week of a large middle cerebral artery stroke. Early interventions such as hyperosmolar therapy or surgical decompression may reverse secondary injury but must be administered judiciously. To optimize treatment and reduce secondary damage, clinicians need strategies to frequently and quantitatively assess the trajectory of edema using updated, relevant information. However, existing risk assessment tools are limited by the absence of structured records capturing the evolution of edema and typically estimate risk at a single time point early in the admission, therefore failing to account for changes in variables over the following hours or days. To address this, we developed and validated dynamic machine learning models capable of accurately predicting the severity of midline structure displacement, an established indicator of malignant edema, in real-time. Our models can provide updated estimations as frequently as every hour, using data from structured time-varying patient records, radiographic text, and human-curated neurological characteristics. Our work resulted in two novel multi-class classification models, collectively named Hybrid Ensemble Learning Models for Edema Trajectory (HELMET), predicting the progression of midline shift over 8-hour (HELMET-8) and 24-hour windows (HELMET-24), respectively. HELMET combines transformer-based large language models with supervised ensemble learning, demonstrating the value of merging human expertise and multimodal health records in developing clinical risk scores. Both models were trained on a retrospective cohort of 15,696 observations from 623 patients hospitalized with large middle cerebral artery ischemic stroke and were externally validated using 3,713 observations from 60 patients at a separate hospital system. Our HELMET models are accurate and generalize effectively to diverse populations, achieving a cross-validated mean area under the receiver operating characteristic score of 96.6% in the derivation cohort and 92.5% in the external validation cohort. Moreover, our approach provides a framework for developing hybrid risk prediction models that integrate both human-extracted and algorithm-derived multi-modal inputs. Our work enables accurate estimation of complex, dynamic, and highly specific clinical targets, such as midline shift, in real-time, even when relevant structured information is limited in electronic health record databases.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Yili Du
- Boston University School of Public Health
| | - Ivy Kim
- Boston Medical Center, Department of Neurology
| | - Jonathan Song
- Boston University Chobanian & Avedisian School of Medicine
| | | | - Stelios Smirnakis
- Brigham & Women's Hospital, Department of Neurology
- Harvard Medical School
- Jamaica Plain Veterans Administration Hospital, Department of Neurology
| | - Charlene J Ong
- Boston Medical Center, Department of Neurology
- Boston University Chobanian & Avedisian School of Medicine
| | | |
Collapse
|
6
|
Yammouri G, Ait Lahcen A. AI-Reinforced Wearable Sensors and Intelligent Point-of-Care Tests. J Pers Med 2024; 14:1088. [PMID: 39590580 PMCID: PMC11595538 DOI: 10.3390/jpm14111088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Revised: 10/25/2024] [Accepted: 10/28/2024] [Indexed: 11/28/2024] Open
Abstract
Artificial intelligence (AI) techniques offer great potential to advance point-of-care testing (POCT) and wearable sensors for personalized medicine applications. This review explores the recent advances and the transformative potential of the use of AI in improving wearables and POCT. The integration of AI significantly contributes to empowering these tools and enables continuous monitoring, real-time analysis, and rapid diagnostics, thus enhancing patient outcomes and healthcare efficiency. Wearable sensors powered by AI models offer tremendous opportunities for precise and non-invasive tracking of physiological conditions that are essential for early disease detection and personalized treatments. AI-empowered POCT facilitates rapid, accurate diagnostics, making these medical testing kits accessible and available even in resource-limited settings. This review discusses the key advances in AI applications for data processing, sensor fusion, and multivariate analytics, highlighting case examples that exhibit their impact in different medical scenarios. In addition, the challenges associated with data privacy, regulatory approvals, and technology integrations into the existing healthcare system have been overviewed. The outlook emphasizes the urgent need for continued innovation in AI-driven health technologies to overcome these challenges and to fully achieve the potential of these techniques to revolutionize personalized medicine.
Collapse
Affiliation(s)
- Ghita Yammouri
- Chemical Analysis & Biosensors, Process Engineering and Environment Laboratory, Faculty of Science and Techniques, Hassan II University of Casablanca, Mohammedia 28806, Morocco;
| | | |
Collapse
|
7
|
Benito GV, Goldberg X, Brachowicz N, Castaño-Vinyals G, Blay N, Espinosa A, Davidhi F, Torres D, Kogevinas M, de Cid R, Petrone P. Machine learning for anxiety and depression profiling and risk assessment in the aftermath of an emergency. Artif Intell Med 2024; 157:102991. [PMID: 39383706 DOI: 10.1016/j.artmed.2024.102991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 09/23/2024] [Accepted: 09/26/2024] [Indexed: 10/11/2024]
Abstract
BACKGROUND & OBJECTIVES Mental health disorders pose an increasing public health challenge worsened by the COVID-19 pandemic. The pandemic highlighted gaps in preparedness, emphasizing the need for early identification of at-risk groups and targeted interventions. This study aims to develop a risk assessment tool for anxiety, depression, and self-perceived stress using machine learning (ML) and explainable AI to identify key risk factors and stratify the population into meaningful risk profiles. METHODS We utilized a cohort of 9291 individuals from Northern Spain, with extensive post-COVID-19 mental health surveys. ML classification algorithms predicted depression, anxiety, and self-reported stress in three classes: healthy, mild, and severe outcomes. A novel combination of SHAP (SHapley Additive exPlanations) and UMAP (Uniform Manifold Approximation and Projection) was employed to interpret model predictions and facilitate the identification of high-risk phenotypic clusters. RESULTS The mean macro-averaged one-vs-one AUROC was 0.77 (± 0.01) for depression, 0.72 (± 0.01) for anxiety, and 0.73 (± 0.02) for self-perceived stress. Key risk factors included poor self-reported health, chronic mental health conditions, and poor social support. High-risk profiles, such as women with reduced sleep hours, were identified for self-perceived stress. Binary classification of healthy vs. at-risk classes yielded F1-Scores over 0.70. CONCLUSIONS Combining SHAP with UMAP for risk profile stratification offers valuable insights for developing effective interventions and shaping public health policies. This data-driven approach to mental health preparedness, when validated in real-world scenarios, can significantly address the mental health impact of public health crises like COVID-19.
Collapse
Affiliation(s)
- Guillermo Villanueva Benito
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain; Universitat Pompeu Fabra (UPF), Spain
| | - Ximena Goldberg
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Nicolai Brachowicz
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Gemma Castaño-Vinyals
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain; Universitat Pompeu Fabra (UPF), Spain; CIBER de Epidemiología y Salud Pública (CIBERESP), Spain
| | - Natalia Blay
- Genomes for Life-GCAT lab. CORE program. Germans Trias I Pujol Research Institute (IGTP), Camí de les Escoles, s/n, Badalona 08916, Catalonia, Spain
| | - Ana Espinosa
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Flavia Davidhi
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Diego Torres
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Manolis Kogevinas
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Rafael de Cid
- Genomes for Life-GCAT lab. CORE program. Germans Trias I Pujol Research Institute (IGTP), Camí de les Escoles, s/n, Badalona 08916, Catalonia, Spain
| | - Paula Petrone
- Barcelona Institute for Global Health (ISGlobal), C/ del Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain.
| |
Collapse
|
8
|
Baharoon M, Almatar H, Alduhayan R, Aldebasi T, Alahmadi B, Bokhari Y, Alawad M, Almazroa A, Aljouie A. HyMNet: A Multimodal Deep Learning System for Hypertension Prediction Using Fundus Images and Cardiometabolic Risk Factors. Bioengineering (Basel) 2024; 11:1080. [PMID: 39593740 PMCID: PMC11591283 DOI: 10.3390/bioengineering11111080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 10/22/2024] [Accepted: 10/22/2024] [Indexed: 11/28/2024] Open
Abstract
STUDY OBJECTIVES This study aimed to develop a multimodal deep learning (MMDL) system called HyMNet, integrating fundus images and cardiometabolic factors (age and sex) to enhance hypertension (HTN) detection. METHODS HyMNet employed RETFound, a model pretrained on 1.6 million retinal images, for the fundus data, in conjunction with a fully connected neural network for age and sex. The two pathways were jointly trained by joining their feature vectors into a fusion network. The system was trained on 5016 retinal images from 1243 individuals provided by the Saudi Ministry of National Guard Health Affairs. The influence of diabetes on HTN detection was also assessed. RESULTS HyMNet surpassed the unimodal system, achieving an F1 score of 0.771 compared to 0.745 for the unimodal model. For diabetic patients, the F1 score was 0.796, while it was 0.466 for non-diabetic patients. CONCLUSIONS HyMNet exhibited superior performance relative to unimodal approaches, with an F1 score of 0.771 for HyMNet compared to 0.752 for models trained on demographic data alone, underscoring the advantages of MMDL systems in HTN detection. The findings indicate that diabetes significantly impacts HTN prediction, enhancing detection accuracy among diabetic patients. Utilizing MMDL with diverse data sources could improve clinical applicability and generalization.
Collapse
Affiliation(s)
- Mohammed Baharoon
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
- Data Management Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia
| | - Hessa Almatar
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
| | - Reema Alduhayan
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
| | - Tariq Aldebasi
- Ophthalmology Department, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh 14611, Saudi Arabia;
| | - Badr Alahmadi
- Ophthalmology Department, Prince Mohammad bin Abdulaziz Hospital, Ministry of National Guard Health Affairs, Al Madinah 42324, Saudi Arabia;
| | - Yahya Bokhari
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
- College of Public Health and Health Informatics, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 14815, Saudi Arabia
| | - Mohammed Alawad
- National Center for Artificial Intelligence (NCAI), Saudi Data and Artificial Intelligence Authority (SDAIA), Riyadh 12382, Saudi Arabia;
| | - Ahmed Almazroa
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
| | - Abdulrhman Aljouie
- AI and Bioinformatics Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia; (M.B.); (H.A.); (R.A.); (Y.B.)
- Data Management Department, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 11481, Saudi Arabia
- College of Public Health and Health Informatics, King Saud bin Abdulaziz University for Health Sciences (KSAU-HS), Riyadh 14815, Saudi Arabia
| |
Collapse
|
9
|
Natraj S, Kojovic N, Maillart T, Schaer M. Video-audio neural network ensemble for comprehensive screening of autism spectrum disorder in young children. PLoS One 2024; 19:e0308388. [PMID: 39361665 PMCID: PMC11449333 DOI: 10.1371/journal.pone.0308388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Accepted: 07/12/2024] [Indexed: 10/05/2024] Open
Abstract
A timely diagnosis of autism is paramount to allow early therapeutic intervention in preschoolers. Deep Learning tools have been increasingly used to identify specific autistic symptoms. But they also offer opportunities for broad automated detection of autism at an early age. Here, we leverage a multi-modal approach by combining two neural networks trained on video and audio features of semi-standardized social interactions in a sample of 160 children aged 1 to 5 years old. Our ensemble model performs with an accuracy of 82.5% (F1 score: 0.816, Precision: 0.775, Recall: 0.861) for screening Autism Spectrum Disorders (ASD). Additional combinations of our model were developed to achieve higher specificity (92.5%, i.e., few false negatives) or sensitivity (90%, i.e. few false positives). Finally, we found a relationship between the neural network modalities and specific audio versus video ASD characteristics, bringing evidence that our neural network implementation was effective in taking into account different features that are currently standardized under the gold standard ASD assessment.
Collapse
Affiliation(s)
- Shreyasvi Natraj
- Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Nada Kojovic
- Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Thomas Maillart
- Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Geneva School of Economics and Management, University of Geneva, Geneva, Switzerland
| | - Marie Schaer
- Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| |
Collapse
|
10
|
Nedadur R, Bhatt N, Liu T, Chu MWA, McCarthy PM, Kline A. The Emerging and Important Role of Artificial Intelligence in Cardiac Surgery. Can J Cardiol 2024; 40:1865-1879. [PMID: 39098601 DOI: 10.1016/j.cjca.2024.07.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Revised: 07/29/2024] [Accepted: 07/29/2024] [Indexed: 08/06/2024] Open
Abstract
Artificial Intelligence (AI) has greatly affected our everyday lives and holds great promise to change the landscape of medicine. AI is particularly positioned to improve care for the increasingly complex patients undergoing cardiac surgery using the immense amount of data generated in the course of their care. When deployed, AI can be used to analyze this information at the patient's bedside more expediently and accurately, all while providing new insights. This review summarizes the current applications of AI in cardiac surgery from the vantage point of a patient's journey. Applications of AI include preoperative risk assessment, intraoperative planning, postoperative patient care, and outpatient telemonitoring, encompassing the spectrum of cardiac surgical care. Offloading of administrative processes and enhanced experience with information gathering also represent a unique and under-represented avenue for future use of AI. As clinicians, understanding the nomenclature and applications of AI is important to contextualize issues, to ensure problem-driven solutions, and for clinical benefit. Precision medicine, and thus clinically relevant AI, remains dependent on data curation and warehousing to gather insights from large multicentre repositories while treating privacy with the utmost importance. AI tasks should not be siloed but rather holistically integrated into clinical workflow to retain context and relevance. As cardiac surgeons, AI allows us to look forward to a bright future of more efficient use of our clinical expertise toward high-level decision making and technical prowess.
Collapse
Affiliation(s)
- Rashmi Nedadur
- Feinberg School of Medicine, Division of Cardiac Surgery, Northwestern University, Chicago, Illinois, USA; Center for Artificial Intelligence, Bluhm Cardiovascular Institute, Northwestern Medicine, Chicago, Illinois, USA.
| | - Nitish Bhatt
- Peter Munk Cardiac Center, Toronto General Hospital, Toronto, Ontario, Canada
| | - Tom Liu
- Feinberg School of Medicine, Division of Cardiac Surgery, Northwestern University, Chicago, Illinois, USA; Center for Artificial Intelligence, Bluhm Cardiovascular Institute, Northwestern Medicine, Chicago, Illinois, USA
| | | | - Patrick M McCarthy
- Feinberg School of Medicine, Division of Cardiac Surgery, Northwestern University, Chicago, Illinois, USA; Center for Artificial Intelligence, Bluhm Cardiovascular Institute, Northwestern Medicine, Chicago, Illinois, USA
| | - Adrienne Kline
- Feinberg School of Medicine, Division of Cardiac Surgery, Northwestern University, Chicago, Illinois, USA; Center for Artificial Intelligence, Bluhm Cardiovascular Institute, Northwestern Medicine, Chicago, Illinois, USA
| |
Collapse
|
11
|
Chinni BK, Manlhiot C. Emerging Analytical Approaches for Personalized Medicine Using Machine Learning In Pediatric and Congenital Heart Disease. Can J Cardiol 2024; 40:1880-1896. [PMID: 39097187 DOI: 10.1016/j.cjca.2024.07.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/25/2024] [Accepted: 07/26/2024] [Indexed: 08/05/2024] Open
Abstract
Precision and personalized medicine, the process by which patient management is tailored to individual circumstances, are now terms that are familiar to cardiologists, despite it still being an emerging field. Although precision medicine relies most often on the underlying biology and pathophysiology of a patient's condition, personalized medicine relies on digital biomarkers generated through algorithms. Given the complexity of the underlying data, these digital biomarkers are most often generated through machine-learning algorithms. There are a number of analytic considerations regarding the creation of digital biomarkers that are discussed in this review, including data preprocessing, time dependency and gating, dimensionality reduction, and novel methods, both in the realm of supervised and unsupervised machine learning. Some of these considerations, such as sample size requirements and measurements of model performance, are particularly challenging in small and heterogeneous populations with rare outcomes such as children with congenital heart disease. Finally, we review analytic considerations for the deployment of digital biomarkers in clinical settings, including the emerging field of clinical artificial intelligence (AI) operations, computational needs for deployment, efforts to increase the explainability of AI, algorithmic drift, and the needs for distributed surveillance and federated learning. We conclude this review by discussing a recent simulation study that shows that, despite these analytic challenges and complications, the use of digital biomarkers in managing clinical care might have substantial benefits regarding individual patient outcomes.
Collapse
Affiliation(s)
- Bhargava K Chinni
- The Blalock-Taussig-Thomas Pediatric and Congenital Heart Center, Department of Pediatrics, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, Maryland, USA
| | - Cedric Manlhiot
- The Blalock-Taussig-Thomas Pediatric and Congenital Heart Center, Department of Pediatrics, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, Maryland, USA; Research Institute, SickKids Hospital, Department of Pediatrics, University of Toronto, Toronto, Ontario, Canada.
| |
Collapse
|
12
|
Shurrab S, Guerra-Manzanares A, E Shamout F. Multimodal masked siamese network improves chest X-ray representation learning. Sci Rep 2024; 14:22516. [PMID: 39341871 PMCID: PMC11439023 DOI: 10.1038/s41598-024-74043-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 09/23/2024] [Indexed: 10/01/2024] Open
Abstract
Self-supervised learning methods for medical images primarily rely on the imaging modality during pretraining. Although such approaches deliver promising results, they do not take advantage of the associated patient or scan information collected within Electronic Health Records (EHR). This study aims to develop a multimodal pretraining approach for chest radiographs that considers EHR data incorporation as an additional modality that during training. We propose to incorporate EHR data during self-supervised pretraining with a Masked Siamese Network (MSN) to enhance the quality of chest radiograph representations. We investigate three types of EHR data, including demographic, scan metadata, and inpatient stay information. We evaluate the multimodal MSN on three publicly available chest X-ray datasets, MIMIC-CXR, CheXpert, and NIH-14, using two vision transformer (ViT) backbones, specifically ViT-Tiny and ViT-Small. In assessing the quality of the representations through linear evaluation, our proposed method demonstrates significant improvement compared to vanilla MSN and state-of-the-art self-supervised learning baselines. In particular, our proposed method achieves an improvement of of 2% in the Area Under the Receiver Operating Characteristic Curve (AUROC) compared to vanilla MSN and 5% to 8% compared to other baselines, including uni-modal ones. Furthermore, our findings reveal that demographic features provide the most significant performance improvement. Our work highlights the potential of EHR-enhanced self-supervised pretraining for medical imaging and opens opportunities for future research to address limitations in existing representation learning methods for other medical imaging modalities, such as neuro-, ophthalmic, and sonar imaging.
Collapse
Affiliation(s)
- Saeed Shurrab
- New York University Abu Dhabi, Computer Engineering, Abu Dhabi, 129188, UAE
| | | | - Farah E Shamout
- New York University Abu Dhabi, Computer Engineering, Abu Dhabi, 129188, UAE.
| |
Collapse
|
13
|
Sun D, Macedonia C, Chen Z, Chandrasekaran S, Najarian K, Zhou S, Cernak T, Ellingrod VL, Jagadish HV, Marini B, Pai M, Violi A, Rech JC, Wang S, Li Y, Athey B, Omenn GS. Can Machine Learning Overcome the 95% Failure Rate and Reality that Only 30% of Approved Cancer Drugs Meaningfully Extend Patient Survival? J Med Chem 2024; 67:16035-16055. [PMID: 39253942 DOI: 10.1021/acs.jmedchem.4c01684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Despite implementing hundreds of strategies, cancer drug development suffers from a 95% failure rate over 30 years, with only 30% of approved cancer drugs extending patient survival beyond 2.5 months. Adding more criteria without eliminating nonessential ones is impractical and may fall into the "survivorship bias" trap. Machine learning (ML) models may enhance efficiency by saving time and cost. Yet, they may not improve success rate without identifying the root causes of failure. We propose a "STAR-guided ML system" (structure-tissue/cell selectivity-activity relationship) to enhance success rate and efficiency by addressing three overlooked interdependent factors: potency/specificity to the on/off-targets determining efficacy in tumors at clinical doses, on/off-target-driven tissue/cell selectivity influencing adverse effects in the normal organs at clinical doses, and optimal clinical doses balancing efficacy/safety as determined by potency/specificity and tissue/cell selectivity. STAR-guided ML models can directly predict clinical dose/efficacy/safety from five features to design/select the best drugs, enhancing success and efficiency of cancer drug development.
Collapse
Affiliation(s)
| | | | - Zhigang Chen
- LabBotics.ai, Palo Alto, California 94303, United States
| | | | | | - Simon Zhou
- Aurinia Pharmaceuticals Inc., Rockville, Maryland 20850, United States
| | | | | | | | | | | | | | | | | | - Yan Li
- Translational Medicine and Clinical Pharmacology, Bristol Myers Squibb, Summit, New Jersey 07901, United States
| | | | | |
Collapse
|
14
|
Thangaraj PM, Benson SH, Oikonomou EK, Asselbergs FW, Khera R. Cardiovascular care with digital twin technology in the era of generative artificial intelligence. Eur Heart J 2024; 45:ehae619. [PMID: 39322420 PMCID: PMC11638093 DOI: 10.1093/eurheartj/ehae619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 07/16/2024] [Accepted: 09/01/2024] [Indexed: 09/27/2024] Open
Abstract
Digital twins, which are in silico replications of an individual and its environment, have advanced clinical decision-making and prognostication in cardiovascular medicine. The technology enables personalized simulations of clinical scenarios, prediction of disease risk, and strategies for clinical trial augmentation. Current applications of cardiovascular digital twins have integrated multi-modal data into mechanistic and statistical models to build physiologically accurate cardiac replicas to enhance disease phenotyping, enrich diagnostic workflows, and optimize procedural planning. Digital twin technology is rapidly evolving in the setting of newly available data modalities and advances in generative artificial intelligence, enabling dynamic and comprehensive simulations unique to an individual. These twins fuse physiologic, environmental, and healthcare data into machine learning and generative models to build real-time patient predictions that can model interactions with the clinical environment to accelerate personalized patient care. This review summarizes digital twins in cardiovascular medicine and their potential future applications by incorporating new personalized data modalities. It examines the technical advances in deep learning and generative artificial intelligence that broaden the scope and predictive power of digital twins. Finally, it highlights the individual and societal challenges as well as ethical considerations that are essential to realizing the future vision of incorporating cardiology digital twins into personalized cardiovascular care.
Collapse
Affiliation(s)
- Phyllis M Thangaraj
- Section of Cardiology, Department of Internal Medicine, Yale School of Medicine, 789 Howard Ave., New Haven, CT, USA
| | - Sean H Benson
- Department of Cardiology, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, University of Amsterdam, Amsterdam, Netherlands
| | - Evangelos K Oikonomou
- Section of Cardiology, Department of Internal Medicine, Yale School of Medicine, 789 Howard Ave., New Haven, CT, USA
| | - Folkert W Asselbergs
- Department of Cardiology, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Center, University of Amsterdam, Amsterdam, Netherlands
- Institute of Health Informatics, University College London, London, UK
- The National Institute for Health Research University College London Hospitals Biomedical Research Center, University College London, London, UK
| | - Rohan Khera
- Section of Cardiology, Department of Internal Medicine, Yale School of Medicine, 789 Howard Ave., New Haven, CT, USA
- Section of Health Informatics, Department of Biostatistics, Yale School of Public Health, 47 College St., New Haven, CT, USA
- Department of Biomedical Informatics and Data Science, Yale School of Medicine, 100 College St. Fl 9, New Haven, CT, USA
- Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, 195 Church St. Fl 6, New Haven, CT 06510, USA
| |
Collapse
|
15
|
AlSaad R, Abd-Alrazaq A, Boughorbel S, Ahmed A, Renault MA, Damseh R, Sheikh J. Multimodal Large Language Models in Health Care: Applications, Challenges, and Future Outlook. J Med Internet Res 2024; 26:e59505. [PMID: 39321458 PMCID: PMC11464944 DOI: 10.2196/59505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Revised: 08/07/2024] [Accepted: 08/20/2024] [Indexed: 09/27/2024] Open
Abstract
In the complex and multidimensional field of medicine, multimodal data are prevalent and crucial for informed clinical decisions. Multimodal data span a broad spectrum of data types, including medical images (eg, MRI and CT scans), time-series data (eg, sensor data from wearable devices and electronic health records), audio recordings (eg, heart and respiratory sounds and patient interviews), text (eg, clinical notes and research articles), videos (eg, surgical procedures), and omics data (eg, genomics and proteomics). While advancements in large language models (LLMs) have enabled new applications for knowledge retrieval and processing in the medical field, most LLMs remain limited to processing unimodal data, typically text-based content, and often overlook the importance of integrating the diverse data modalities encountered in clinical practice. This paper aims to present a detailed, practical, and solution-oriented perspective on the use of multimodal LLMs (M-LLMs) in the medical field. Our investigation spanned M-LLM foundational principles, current and potential applications, technical and ethical challenges, and future research directions. By connecting these elements, we aimed to provide a comprehensive framework that links diverse aspects of M-LLMs, offering a unified vision for their future in health care. This approach aims to guide both future research and practical implementations of M-LLMs in health care, positioning them as a paradigm shift toward integrated, multimodal data-driven medical practice. We anticipate that this work will spark further discussion and inspire the development of innovative approaches in the next generation of medical M-LLM systems.
Collapse
Affiliation(s)
- Rawan AlSaad
- Weill Cornell Medicine-Qatar, Education City, Doha, Qatar
| | | | - Sabri Boughorbel
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Arfan Ahmed
- Weill Cornell Medicine-Qatar, Education City, Doha, Qatar
| | | | - Rafat Damseh
- Department of Computer Science and Software Engineering, United Arab Emirates University, Al Ain, United Arab Emirates
| | - Javaid Sheikh
- Weill Cornell Medicine-Qatar, Education City, Doha, Qatar
| |
Collapse
|
16
|
Mess F, Blaschke S, Gebhard D, Friedrich J. Precision prevention in occupational health: a conceptual analysis and development of a unified understanding and an integrative framework. Front Public Health 2024; 12:1444521. [PMID: 39360261 PMCID: PMC11445082 DOI: 10.3389/fpubh.2024.1444521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Accepted: 09/02/2024] [Indexed: 10/04/2024] Open
Abstract
Introduction Precision prevention implements highly precise, tailored health interventions for individuals by directly addressing personal and environmental determinants of health. However, precision prevention does not yet appear to be fully established in occupational health. There are numerous understandings and conceptual approaches, but these have not yet been systematically presented or synthesized. Therefore, this conceptual analysis aims to propose a unified understanding and develop an integrative conceptual framework for precision prevention in occupational health. Methods Firstly, to systematically present definitions and frameworks of precision prevention in occupational health, six international databases were searched for studies published between January 2010 and January 2024 that used the term precision prevention or its synonyms in the context of occupational health. Secondly, a qualitative content analysis was conducted to analyze the existing definitions and propose a unified understanding. Thirdly, based on the identified frameworks, a multi-stage exploratory development process was applied to develop and propose an integrative conceptual framework for precision prevention in occupational health. Results After screening 3,681 articles, 154 publications were reviewed, wherein 29 definitions of precision prevention and 64 different frameworks were found, which can be summarized in eight higher-order categories. The qualitative content analysis revealed seven themes and illustrated many different wordings. The proposed unified understanding of precision prevention in occupational health takes up the identified themes. It includes, among other things, a contrast to a "one-size-fits-all approach" with a risk- and resource-oriented data collection and innovative data analytics with profiling to provide and improve tailored interventions. The developed and proposed integrative conceptual framework comprises three overarching stages: (1) data generation, (2) data management lifecycle and (3) interventions (development, implementation and adaptation). Discussion Although there are already numerous studies on precision prevention in occupational health, this conceptual analysis offers, for the first time, a proposal for a unified understanding and an integrative conceptual framework. However, the proposed unified understanding and the developed integrative conceptual framework should only be seen as an initial proposal that should be critically discussed and further developed to expand and strengthen both research on precision prevention in occupational health and its practical application in the workplace.
Collapse
Affiliation(s)
- Filip Mess
- Department Health and Sport Sciences, TUM School of Medicine and Health, Technical University of Munich, Munich, Germany
| | | | | | - Julian Friedrich
- Department Health and Sport Sciences, TUM School of Medicine and Health, Technical University of Munich, Munich, Germany
| |
Collapse
|
17
|
Cirincione A, Lynch K, Bennett J, Choupan J, Varghese B, Sheikh-Bahaei N, Pandey G. Prediction of future dementia among patients with mild cognitive impairment (MCI) by integrating multimodal clinical data. Heliyon 2024; 10:e36728. [PMID: 39281465 PMCID: PMC11399681 DOI: 10.1016/j.heliyon.2024.e36728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 08/20/2024] [Accepted: 08/21/2024] [Indexed: 09/18/2024] Open
Abstract
Efficiently and objectively analyzing the complex, diverse multimodal data collected from patients at risk for dementia can be difficult in the clinical setting, contributing to high rates of underdiagnosis or misdiagnosis of this serious disorder. Patients with mild cognitive impairment (MCI) are especially at risk of developing dementia in the future. This study evaluated the ability of multi-modal machine learning (ML) methods, especially the Ensemble Integration (EI) framework, to predict future dementia development among patients with MCI. EI is a machine learning framework designed to leverage complementarity and consensus in multimodal data, which may not be adequately captured by methods used by prior dementia-related prediction studies. We tested EI's ability to predict future dementia development among MCI patients using multimodal clinical and imaging data, such as neuroanatomical measurements from structural magnetic resonance imaging (MRI) and positron emission tomography (PET) scans, from The Alzheimer's Disease Prediction of Longitudinal Evolution (TADPOLE) challenge. For predicting future dementia development among MCI patients, on a held out test set, the EI-based model performed better (AUC = 0.81, F-measure = 0.68) than the more commonly used XGBoost (AUC = 0.68, F-measure = 0.57) and deep learning (AUC = 0.79, F-measure = 0.61) approaches. This EI-based model also suggested MRI-derived volumes of regions in the middle temporal gyrus, posterior cingulate gyrus and inferior lateral ventricle brain regions to be predictive of progression to dementia.
Collapse
Affiliation(s)
- Andrew Cirincione
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| | - Kirsten Lynch
- Laboratory of Neuro Imaging (LONI), USC Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
| | - Jamie Bennett
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| | - Jeiran Choupan
- Laboratory of Neuro Imaging (LONI), USC Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
- NeuroScope Inc., Scarsdale, NY, 10583, USA
| | - Bino Varghese
- Department of Radiology, Keck School of Medicine, University of Southern California, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
| | - Nasim Sheikh-Bahaei
- Department of Radiology, Keck School of Medicine, University of Southern California, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
| | - Gaurav Pandey
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| |
Collapse
|
18
|
Amini Gougeh R, Zilic Z. Systematic Review of IoT-Based Solutions for User Tracking: Towards Smarter Lifestyle, Wellness and Health Management. SENSORS (BASEL, SWITZERLAND) 2024; 24:5939. [PMID: 39338683 PMCID: PMC11435569 DOI: 10.3390/s24185939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 09/07/2024] [Accepted: 09/09/2024] [Indexed: 09/30/2024]
Abstract
The Internet of Things (IoT) base has grown to over 20 billion devices currently operational worldwide. As they greatly extend the applicability and use of biosensors, IoT developments are transformative. Recent studies show that IoT, coupled with advanced communication frameworks, such as machine-to-machine (M2M) interactions, can lead to (1) improved efficiency in data exchange, (2) accurate and timely health monitoring, and (3) enhanced user engagement and compliance through advancements in human-computer interaction. This systematic review of the 19 most relevant studies examines the potential of IoT in health and lifestyle management by conducting detailed analyses and quality assessments of each study. Findings indicate that IoT-based systems effectively monitor various health parameters using biosensors, facilitate real-time feedback, and support personalized health recommendations. Key limitations include small sample sizes, insufficient security measures, practical issues with wearable sensors, and reliance on internet connectivity in areas with poor network infrastructure. The reviewed studies demonstrated innovative applications of IoT, focusing on M2M interactions, edge devices, multimodality health monitoring, intelligent decision-making, and automated health management systems. These insights offer valuable recommendations for optimizing IoT technologies in health and wellness management.
Collapse
Affiliation(s)
- Reza Amini Gougeh
- Faculty of Engineering, Department of Electrical and Computer Engineering, McGill University, Montreal, QC H3A 0G4, Canada;
| | | |
Collapse
|
19
|
Liu W, Chen J, Wang H, Fu Z, Peijnenburg WJGM, Hong H. Perspectives on Advancing Multimodal Learning in Environmental Science and Engineering Studies. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024. [PMID: 39226136 DOI: 10.1021/acs.est.4c03088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
The environment faces increasing anthropogenic impacts, resulting in a rapid increase in environmental issues that undermine the natural capital essential for human wellbeing. These issues are complex and often influenced by various factors represented by data with different modalities. While machine learning (ML) provides data-driven tools for addressing the environmental issues, the current ML models in environmental science and engineering (ES&E) often neglect the utilization of multimodal data. With the advancement in deep learning, multimodal learning (MML) holds promise for comprehensive descriptions of the environmental issues by harnessing data from diverse modalities. This advancement has the potential to significantly elevate the accuracy and robustness of prediction models in ES&E studies, providing enhanced solutions for various environmental modeling tasks. This perspective summarizes MML methodologies and proposes potential applications of MML models in ES&E studies, including environmental quality assessment, prediction of chemical hazards, and optimization of pollution control techniques. Additionally, we discuss the challenges associated with implementing MML in ES&E and propose future research directions in this domain.
Collapse
Affiliation(s)
- Wenjia Liu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Jingwen Chen
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Haobo Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Zhiqiang Fu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Willie J G M Peijnenburg
- Institute of Environmental Sciences (CML), Leiden University, Leiden 2300 RA, The Netherlands
- Centre for Safety of Substances and Products, National Institute of Public Health and the Environment (RIVM), Bilthoven 3720 BA, The Netherlands
| | - Huixiao Hong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, Arkansas 72079, United States
| |
Collapse
|
20
|
Zong H, Wu R, Cha J, Feng W, Wu E, Li J, Shao A, Tao L, Li Z, Tang B, Shen B. Advancing Chinese biomedical text mining with community challenges. J Biomed Inform 2024; 157:104716. [PMID: 39197732 DOI: 10.1016/j.jbi.2024.104716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 08/22/2024] [Accepted: 08/25/2024] [Indexed: 09/01/2024]
Abstract
OBJECTIVE This study aims to review the recent advances in community challenges for biomedical text mining in China. METHODS We collected information of evaluation tasks released in community challenges of biomedical text mining, including task description, dataset description, data source, task type and related links. A systematic summary and comparative analysis were conducted on various biomedical natural language processing tasks, such as named entity recognition, entity normalization, attribute extraction, relation extraction, event extraction, text classification, text similarity, knowledge graph construction, question answering, text generation, and large language model evaluation. RESULTS We identified 39 evaluation tasks from 6 community challenges that spanned from 2017 to 2023. Our analysis revealed the diverse range of evaluation task types and data sources in biomedical text mining. We explored the potential clinical applications of these community challenge tasks from a translational biomedical informatics perspective. We compared with their English counterparts, and discussed the contributions, limitations, lessons and guidelines of these community challenges, while highlighting future directions in the era of large language models. CONCLUSION Community challenge evaluation competitions have played a crucial role in promoting technology innovation and fostering interdisciplinary collaboration in the field of biomedical text mining. These challenges provide valuable platforms for researchers to develop state-of-the-art solutions.
Collapse
Affiliation(s)
- Hui Zong
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Rongrong Wu
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Jiaxue Cha
- Shanghai Key Laboratory of Signaling and Disease Research, Laboratory of Receptor-Based Bio-Medicine, Collaborative Innovation Center for Brain Science, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Weizhe Feng
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Erman Wu
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Jiakun Li
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China; Department of Urology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Aibin Shao
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Liang Tao
- Faculty of Business Information, Shanghai Business School, Shanghai 201400, China
| | | | - Buzhou Tang
- Department of Computer Science, Harbin Institute of Technology, Shenzhen 518055, China
| | - Bairong Shen
- Joint Laboratory of Artificial Intelligence for Critical Care Medicine, Department of Critical Care Medicine and Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
21
|
Isavand P, Aghamiri SS, Amin R. Applications of Multimodal Artificial Intelligence in Non-Hodgkin Lymphoma B Cells. Biomedicines 2024; 12:1753. [PMID: 39200217 PMCID: PMC11351272 DOI: 10.3390/biomedicines12081753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Revised: 07/22/2024] [Accepted: 08/01/2024] [Indexed: 09/02/2024] Open
Abstract
Given advancements in large-scale data and AI, integrating multimodal artificial intelligence into cancer research can enhance our understanding of tumor behavior by simultaneously processing diverse biomedical data types. In this review, we explore the potential of multimodal AI in comprehending B-cell non-Hodgkin lymphomas (B-NHLs). B-cell non-Hodgkin lymphomas (B-NHLs) represent a particular challenge in oncology due to tumor heterogeneity and the intricate ecosystem in which tumors develop. These complexities complicate diagnosis, prognosis, and therapy response, emphasizing the need to use sophisticated approaches to enhance personalized treatment strategies for better patient outcomes. Therefore, multimodal AI can be leveraged to synthesize critical information from available biomedical data such as clinical record, imaging, pathology and omics data, to picture the whole tumor. In this review, we first define various types of modalities, multimodal AI frameworks, and several applications in precision medicine. Then, we provide several examples of its usage in B-NHLs, for analyzing the complexity of the ecosystem, identifying immune biomarkers, optimizing therapy strategy, and its clinical applications. Lastly, we address the limitations and future directions of multimodal AI, highlighting the need to overcome these challenges for better clinical practice and application in healthcare.
Collapse
Affiliation(s)
- Pouria Isavand
- Department of Radiology, School of Medicine, Zanjan University of Medical Sciences, Zanjan 4513956184, Iran
| | | | - Rada Amin
- Department of Biochemistry, University of Nebraska, Lincoln, NE 68503, USA
| |
Collapse
|
22
|
Ferrara E. Large Language Models for Wearable Sensor-Based Human Activity Recognition, Health Monitoring, and Behavioral Modeling: A Survey of Early Trends, Datasets, and Challenges. SENSORS (BASEL, SWITZERLAND) 2024; 24:5045. [PMID: 39124092 PMCID: PMC11314694 DOI: 10.3390/s24155045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 07/29/2024] [Accepted: 07/31/2024] [Indexed: 08/12/2024]
Abstract
The proliferation of wearable technology enables the generation of vast amounts of sensor data, offering significant opportunities for advancements in health monitoring, activity recognition, and personalized medicine. However, the complexity and volume of these data present substantial challenges in data modeling and analysis, which have been addressed with approaches spanning time series modeling to deep learning techniques. The latest frontier in this domain is the adoption of large language models (LLMs), such as GPT-4 and Llama, for data analysis, modeling, understanding, and human behavior monitoring through the lens of wearable sensor data. This survey explores the current trends and challenges in applying LLMs for sensor-based human activity recognition and behavior modeling. We discuss the nature of wearable sensor data, the capabilities and limitations of LLMs in modeling them, and their integration with traditional machine learning techniques. We also identify key challenges, including data quality, computational requirements, interpretability, and privacy concerns. By examining case studies and successful applications, we highlight the potential of LLMs in enhancing the analysis and interpretation of wearable sensor data. Finally, we propose future directions for research, emphasizing the need for improved preprocessing techniques, more efficient and scalable models, and interdisciplinary collaboration. This survey aims to provide a comprehensive overview of the intersection between wearable sensor data and LLMs, offering insights into the current state and future prospects of this emerging field.
Collapse
Affiliation(s)
- Emilio Ferrara
- Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, CA 90007, USA;
- Information Sciences Institute, School of Advanced Computing, University of Southern California, Los Angeles, CA 90007, USA
| |
Collapse
|
23
|
Johnson R, Li MM, Noori A, Queen O, Zitnik M. Graph Artificial Intelligence in Medicine. Annu Rev Biomed Data Sci 2024; 7:345-368. [PMID: 38749465 PMCID: PMC11344018 DOI: 10.1146/annurev-biodatasci-110723-024625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2024]
Abstract
In clinical artificial intelligence (AI), graph representation learning, mainly through graph neural networks and graph transformer architectures, stands out for its capability to capture intricate relationships and structures within clinical datasets. With diverse data-from patient records to imaging-graph AI models process data holistically by viewing modalities and entities within them as nodes interconnected by their relationships. Graph AI facilitates model transfer across clinical tasks, enabling models to generalize across patient populations without additional parameters and with minimal to no retraining. However, the importance of human-centered design and model interpretability in clinical decision-making cannot be overstated. Since graph AI models capture information through localized neural transformations defined on relational datasets, they offer both an opportunity and a challenge in elucidating model rationale. Knowledge graphs can enhance interpretability by aligning model-driven insights with medical knowledge. Emerging graph AI models integrate diverse data modalities through pretraining, facilitate interactive feedback loops, and foster human-AI collaboration, paving the way toward clinically meaningful predictions.
Collapse
Affiliation(s)
- Ruth Johnson
- Berkowitz Family Living Laboratory, Harvard Medical School, Boston, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| | - Michelle M Li
- Bioinformatics and Integrative Genomics Program, Harvard Medical School, Boston, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| | - Ayush Noori
- Department of Computer Science, Harvard John A. Paulson School of Engineering and Applied Sciences, Allston, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| | - Owen Queen
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| | - Marinka Zitnik
- Harvard Data Science Initiative, Cambridge, Massachusetts, USA
- Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Allston, Massachusetts, USA
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA;
| |
Collapse
|
24
|
Wang K, Margolis S, Cho JM, Wang S, Arianpour B, Jabalera A, Yin J, Hong W, Zhang Y, Zhao P, Zhu E, Reddy S, Hsiai TK. Non-Invasive Detection of Early-Stage Fatty Liver Disease via an On-Skin Impedance Sensor and Attention-Based Deep Learning. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2400596. [PMID: 38887178 PMCID: PMC11336938 DOI: 10.1002/advs.202400596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 03/17/2024] [Indexed: 06/20/2024]
Abstract
Early-stage nonalcoholic fatty liver disease (NAFLD) is a silent condition, with most cases going undiagnosed, potentially progressing to liver cirrhosis and cancer. A non-invasive and cost-effective detection method for early-stage NAFLD detection is a public health priority but challenging. In this study, an adhesive, soft on-skin sensor with low electrode-skin contact impedance for early-stage NAFLD detection is fabricated. A method is developed to synthesize platinum nanoparticles and reduced graphene quantum dots onto the on-skin sensor to reduce electrode-skin contact impedance by increasing double-layer capacitance, thereby enhancing detection accuracy. Furthermore, an attention-based deep learning algorithm is introduced to differentiate impedance signals associated with early-stage NAFLD in high-fat-diet-fed low-density lipoprotein receptor knockout (Ldlr-/-) mice compared to healthy controls. The integration of an adhesive, soft on-skin sensor with low electrode-skin contact impedance and the attention-based deep learning algorithm significantly enhances the detection accuracy for early-stage NAFLD, achieving a rate above 97.5% with an area under the receiver operating characteristic curve (AUC) of 1.0. The findings present a non-invasive approach for early-stage NAFLD detection and display a strategy for improved early detection through on-skin electronics and deep learning.
Collapse
Affiliation(s)
- Kaidong Wang
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
- Department of MedicineGreater Los Angeles Veterans Affairs (VA) Healthcare SystemLos AngelesCA90073USA
| | - Samuel Margolis
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
| | - Jae Min Cho
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
| | - Shaolei Wang
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Brian Arianpour
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Alejandro Jabalera
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Junyi Yin
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Wen Hong
- Department of Materials Science and EngineeringUniversity of California Los AngelesLos AngelesCA90095USA
| | - Yaran Zhang
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
| | - Peng Zhao
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
| | - Enbo Zhu
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
- Department of Materials Science and EngineeringUniversity of California Los AngelesLos AngelesCA90095USA
| | - Srinivasa Reddy
- Department of Molecular and Medical PharmacologyUniversity of California Los AngelesLos AngelesCA90095USA
| | - Tzung K. Hsiai
- Department of MedicineDavid Geffen School of MedicineUniversity of California Los AngelesLos AngelesCA90095USA
- Department of Bioengineering, Henry Samueli School of Engineering and Applied SciencesUniversity of California Los AngelesLos AngelesCA90095USA
- Department of MedicineGreater Los Angeles Veterans Affairs (VA) Healthcare SystemLos AngelesCA90073USA
| |
Collapse
|
25
|
Li Y, El Habib Daho M, Conze PH, Zeghlache R, Le Boité H, Tadayoni R, Cochener B, Lamard M, Quellec G. A review of deep learning-based information fusion techniques for multimodal medical image classification. Comput Biol Med 2024; 177:108635. [PMID: 38796881 DOI: 10.1016/j.compbiomed.2024.108635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 03/18/2024] [Accepted: 05/18/2024] [Indexed: 05/29/2024]
Abstract
Multimodal medical imaging plays a pivotal role in clinical diagnosis and research, as it combines information from various imaging modalities to provide a more comprehensive understanding of the underlying pathology. Recently, deep learning-based multimodal fusion techniques have emerged as powerful tools for improving medical image classification. This review offers a thorough analysis of the developments in deep learning-based multimodal fusion for medical classification tasks. We explore the complementary relationships among prevalent clinical modalities and outline three main fusion schemes for multimodal classification networks: input fusion, intermediate fusion (encompassing single-level fusion, hierarchical fusion, and attention-based fusion), and output fusion. By evaluating the performance of these fusion techniques, we provide insight into the suitability of different network architectures for various multimodal fusion scenarios and application domains. Furthermore, we delve into challenges related to network architecture selection, handling incomplete multimodal data management, and the potential limitations of multimodal fusion. Finally, we spotlight the promising future of Transformer-based multimodal fusion techniques and give recommendations for future research in this rapidly evolving field.
Collapse
Affiliation(s)
- Yihao Li
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France
| | - Mostafa El Habib Daho
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France.
| | | | - Rachid Zeghlache
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France
| | - Hugo Le Boité
- Sorbonne University, Paris, France; Ophthalmology Department, Lariboisière Hospital, AP-HP, Paris, France
| | - Ramin Tadayoni
- Ophthalmology Department, Lariboisière Hospital, AP-HP, Paris, France; Paris Cité University, Paris, France
| | - Béatrice Cochener
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France; Ophthalmology Department, CHRU Brest, Brest, France
| | - Mathieu Lamard
- LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France
| | | |
Collapse
|
26
|
Luo Y, Mao C, Sanchez‐Pinto LN, Ahmad FS, Naidech A, Rasmussen L, Pacheco JA, Schneider D, Mithal LB, Dresden S, Holmes K, Carson M, Shah SJ, Khan S, Clare S, Wunderink RG, Liu H, Walunas T, Cooper L, Yue F, Wehbe F, Fang D, Liebovitz DM, Markl M, Michelson KN, McColley SA, Green M, Starren J, Ackermann RT, D'Aquila RT, Adams J, Lloyd‐Jones D, Chisholm RL, Kho A. Northwestern University resource and education development initiatives to advance collaborative artificial intelligence across the learning health system. Learn Health Syst 2024; 8:e10417. [PMID: 39036530 PMCID: PMC11257059 DOI: 10.1002/lrh2.10417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 02/22/2024] [Accepted: 02/26/2024] [Indexed: 07/23/2024] Open
Abstract
Introduction The rapid development of artificial intelligence (AI) in healthcare has exposed the unmet need for growing a multidisciplinary workforce that can collaborate effectively in the learning health systems. Maximizing the synergy among multiple teams is critical for Collaborative AI in Healthcare. Methods We have developed a series of data, tools, and educational resources for cultivating the next generation of multidisciplinary workforce for Collaborative AI in Healthcare. We built bulk-natural language processing pipelines to extract structured information from clinical notes and stored them in common data models. We developed multimodal AI/machine learning (ML) tools and tutorials to enrich the toolbox of the multidisciplinary workforce to analyze multimodal healthcare data. We have created a fertile ground to cross-pollinate clinicians and AI scientists and train the next generation of AI health workforce to collaborate effectively. Results Our work has democratized access to unstructured health information, AI/ML tools and resources for healthcare, and collaborative education resources. From 2017 to 2022, this has enabled studies in multiple clinical specialties resulting in 68 peer-reviewed publications. In 2022, our cross-discipline efforts converged and institutionalized into the Center for Collaborative AI in Healthcare. Conclusions Our Collaborative AI in Healthcare initiatives has created valuable educational and practical resources. They have enabled more clinicians, scientists, and hospital administrators to successfully apply AI methods in their daily research and practice, develop closer collaborations, and advanced the institution-level learning health system.
Collapse
Affiliation(s)
- Yuan Luo
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Chengsheng Mao
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Lazaro N. Sanchez‐Pinto
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of Critical Care, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Stanley Manne Children's Research InstituteAnn & Robert H. Lurie Children's Hospital of ChicagoChicagoIllinoisUSA
| | - Faraz S. Ahmad
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Cardiology, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Andrew Naidech
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Neurocritical Care, Department of NeurologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Luke Rasmussen
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Jennifer A. Pacheco
- Center for Genetic MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Daniel Schneider
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
| | - Leena B. Mithal
- Stanley Manne Children's Research InstituteAnn & Robert H. Lurie Children's Hospital of ChicagoChicagoIllinoisUSA
- Division of Infectious Diseases, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Scott Dresden
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of Emergency MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Kristi Holmes
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Galter Health Sciences LibraryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Matthew Carson
- Galter Health Sciences LibraryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Sanjiv J. Shah
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Cardiology, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Seema Khan
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of SurgeryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Susan Clare
- Department of SurgeryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Richard G. Wunderink
- Division of Critical Care, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Pulmonary and Critical Care Division, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Huiping Liu
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of PharmacologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of Hematology and Oncology, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Theresa Walunas
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Center for Health Information PartnershipsInstitute for Public Health and Medicine, Northwestern UniversityChicagoIllinoisUSA
- Department of Microbiology‐ImmunologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Lee Cooper
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Department of PathologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Feng Yue
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of PathologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Department of Biochemistry and Molecular GeneticsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Firas Wehbe
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of SurgeryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Deyu Fang
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of PathologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - David M. Liebovitz
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Center for Health Information PartnershipsInstitute for Public Health and Medicine, Northwestern UniversityChicagoIllinoisUSA
| | - Michael Markl
- Department of RadiologyNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Kelly N. Michelson
- Division of Critical Care, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Stanley Manne Children's Research InstituteAnn & Robert H. Lurie Children's Hospital of ChicagoChicagoIllinoisUSA
- Center for Bioethics and Medical Humanities, Institute for Public Health and MedicineNorthwestern UniversityChicagoIllinoisUSA
| | - Susanna A. McColley
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Stanley Manne Children's Research InstituteAnn & Robert H. Lurie Children's Hospital of ChicagoChicagoIllinoisUSA
- Division of Pulmonary and Sleep Medicine, Department of PediatricsNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Marianne Green
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Justin Starren
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Ronald T. Ackermann
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Institute for Public Health and MedicineNorthwestern UniversityChicagoIllinoisUSA
| | - Richard T. D'Aquila
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Division of Infectious Diseases, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - James Adams
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of Emergency MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Donald Lloyd‐Jones
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Epidemiology, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
| | - Rex L. Chisholm
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Department of SurgeryNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Center for Health Information PartnershipsInstitute for Public Health and Medicine, Northwestern UniversityChicagoIllinoisUSA
| | - Abel Kho
- Northwestern University Clinical and Translational Sciences InstituteChicagoIllinoisUSA
- Institute for Augmented Intelligence in MedicineNorthwestern UniversityChicagoIllinoisUSA
- Division of Health and Biomedical Informatics, Department of Preventive MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Division of General Internal Medicine, Department of MedicineNorthwestern University Feinberg School of MedicineChicagoIllinoisUSA
- Center for Health Information PartnershipsInstitute for Public Health and Medicine, Northwestern UniversityChicagoIllinoisUSA
| |
Collapse
|
27
|
Schiffer-Kane K, Liu C, Callahan TJ, Ta C, Nestor JG, Weng C. Converting OMOP CDM to phenopackets: A model alignment and patient data representation evaluation. J Biomed Inform 2024; 155:104659. [PMID: 38777085 PMCID: PMC11181468 DOI: 10.1016/j.jbi.2024.104659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 05/11/2024] [Accepted: 05/18/2024] [Indexed: 05/25/2024]
Abstract
OBJECTIVE This study aims to promote interoperability in precision medicine and translational research by aligning the Observational Medical Outcomes Partnership (OMOP) and Phenopackets data models. Phenopackets is an expert knowledge-driven schema designed to facilitate the storage and exchange of multimodal patient data, and support downstream analysis. The first goal of this paper is to explore model alignment by characterizing the common data models using a newly developed data transformation process and evaluation method. Second, using OMOP normalized clinical data, we evaluate the mapping of real-world patient data to Phenopackets. We evaluate the suitability of Phenopackets as a patient data representation for real-world clinical cases. METHODS We identified mappings between OMOP and Phenopackets and applied them to a real patient dataset to assess the transformation's success. We analyzed gaps between the models and identified key considerations for transforming data between them. Further, to improve ambiguous alignment, we incorporated Unified Medical Language System (UMLS) semantic type-based filtering to direct individual concepts to their most appropriate domain and conducted a domain-expert evaluation of the mapping's clinical utility. RESULTS The OMOP to Phenopacket transformation pipeline was executed for 1,000 Alzheimer's disease patients and successfully mapped all required entities. However, due to missing values in OMOP for required Phenopacket attributes, 10.2 % of records were lost. The use of UMLS-semantic type filtering for ambiguous alignment of individual concepts resulted in 96 % agreement with clinical thinking, increased from 68 % when mapping exclusively by domain correspondence. CONCLUSION This study presents a pipeline to transform data from OMOP to Phenopackets. We identified considerations for the transformation to ensure data quality, handling restrictions for successful Phenopacket validation and discrepant data formats. We identified unmappable Phenopacket attributes that focus on specialty use cases, such as genomics or oncology, which OMOP does not currently support. We introduce UMLS semantic type filtering to resolve ambiguous alignment to Phenopacket entities to be most appropriate for real-world interpretation. We provide a systematic approach to align OMOP and Phenopackets schemas. Our work facilitates future use of Phenopackets in clinical applications by addressing key barriers to interoperability when deriving a Phenopacket from real-world patient data.
Collapse
Affiliation(s)
- Kayla Schiffer-Kane
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Casey Ta
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Jordan G Nestor
- Department of Medicine, Division of Nephrology, Columbia University Irving Medical Center, New York, NY, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.
| |
Collapse
|
28
|
Loutati R, Ben-Yehuda A, Rosenberg S, Rottenberg Y. Multimodal Machine Learning for Prediction of 30-Day Readmission Risk in Elderly Population. Am J Med 2024; 137:617-628. [PMID: 38588939 DOI: 10.1016/j.amjmed.2024.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Revised: 04/01/2024] [Accepted: 04/01/2024] [Indexed: 04/10/2024]
Abstract
BACKGROUND Readmission within 30 days is a prevalent issue among elderly patients, linked to unfavorable health outcomes. Our objective was to develop and validate multimodal machine learning models for predicting 30-day readmission risk in elderly patients discharged from internal medicine departments. METHODS This was a retrospective cohort study which included elderly patients aged 75 or older, who were hospitalized at the Hadassah Medical Center internal medicine departments between 2014 and 2020. Three machine learning algorithms were developed and employed to predict 30-day readmission risk. The primary measures were predictive model performance scores, specifically area under the receiver operator curve (AUROC), and average precision. RESULTS This study included 19,569 admissions. Of them, 3258 (16.65%) resulted in 30-day readmission. Our 3 proposed models demonstrated high accuracy and precision on an unseen test set, with AUROC values of 0.87, 0.89, and 0.93, respectively, and average precision values of 0.76, 0.78, and 0.81. Feature importance analysis revealed that the number of admissions in the past year, history of 30-day readmission, Charlson score, and admission length were the most influential variables. Notably, the natural language processing score, representing the probability of readmission according to a textual-based model trained on social workers' assessment letters during hospitalization, ranked among the top 10 contributing factors. CONCLUSIONS Leveraging multimodal machine learning offers a promising strategy for identifying elderly patients who are at high risk for 30-day readmission. By identifying these patients, machine learning models may facilitate the effective execution of preventive actions to reduce avoidable readmission incidents.
Collapse
Affiliation(s)
- Ranel Loutati
- Department of Military Medicine and "Tzameret", Faculty of Medicine, Hebrew University of Jerusalem; and the Medical Corps, Israel Defense Forces, Israel; Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel; The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel.
| | - Arie Ben-Yehuda
- Department of Internal Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel
| | - Shai Rosenberg
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel; The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel
| | - Yakir Rottenberg
- Sharett Institute of Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel
| |
Collapse
|
29
|
Tran XT, Do T, Pal NR, Jung TP, Lin CT. Multimodal fusion for anticipating human decision performance. Sci Rep 2024; 14:13217. [PMID: 38851836 PMCID: PMC11162455 DOI: 10.1038/s41598-024-63651-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 05/30/2024] [Indexed: 06/10/2024] Open
Abstract
Anticipating human decisions while performing complex tasks remains a formidable challenge. This study proposes a multimodal machine-learning approach that leverages image features and electroencephalography (EEG) data to predict human response correctness in a demanding visual searching task. Notably, we extract a novel set of image features pertaining to object relationships using the Segment Anything Model (SAM), which enhances prediction accuracy compared to traditional features. Additionally, our approach effectively utilizes a combination of EEG signals and image features to streamline the feature set required for the Random Forest Classifier (RFC) while maintaining high accuracy. The findings of this research hold substantial potential for developing advanced fault alert systems, particularly in critical decision-making environments such as the medical and defence sectors.
Collapse
Affiliation(s)
- Xuan-The Tran
- GrapheneX-UTS HAI Centre, Australian AI Institute, Faculty of Engineering and Information Technology (FEIT), University of Technology Sydney (UTS), Sydney, NSW, 2007, Australia
| | - Thomas Do
- GrapheneX-UTS HAI Centre, Australian AI Institute, Faculty of Engineering and Information Technology (FEIT), University of Technology Sydney (UTS), Sydney, NSW, 2007, Australia
| | - Nikhil R Pal
- Electronics and Communication Sciences Unit, Indian Statistical Institute, Calcutta, West Bengal, 700108, India
| | - Tzyy-Ping Jung
- Institute for Neural Computation and Institute of Engineering in Medicine, University of California, San Diego (UCSD), La Jolla, CA, 92093, USA
| | - Chin-Teng Lin
- GrapheneX-UTS HAI Centre, Australian AI Institute, Faculty of Engineering and Information Technology (FEIT), University of Technology Sydney (UTS), Sydney, NSW, 2007, Australia.
| |
Collapse
|
30
|
Mirzaei A, Hiller BC, Stelzer IA, Thiele K, Tan Y, Becker M. Computational Approaches for Connecting Maternal Stress to Preterm Birth. Clin Perinatol 2024; 51:345-360. [PMID: 38705645 DOI: 10.1016/j.clp.2024.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2024]
Abstract
Multiple studies have hinted at a complex connection between maternal stress and preterm birth (PTB). This article describes the potential of computational methods to provide new insights into this relationship. For this, we outline existing approaches for stress assessments and various data modalities available for profiling stress responses, and review studies that sought either to establish a connection between stress and PTB or to predict PTB based on stress-related factors. Finally, we summarize the challenges of computational methods, highlighting potential future research directions within this field.
Collapse
Affiliation(s)
- Amin Mirzaei
- Department of Computer Science and Electrical Engineering, Institute for Visual and Analytic Computing, Universität Rostock, Albert-Einstein-Straße 22, 18059 Rostock, Germany
| | - Bjarne C Hiller
- Department of Computer Science and Electrical Engineering, Institute for Visual and Analytic Computing, Universität Rostock, Albert-Einstein-Straße 22, 18059 Rostock, Germany
| | - Ina A Stelzer
- Department of Pathology, University of California San Diego, GPL/CMM-West, 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Kristin Thiele
- Division for Experimental Feto-Maternal Medicine, Department of Obstetrics and Fetal Medicine, University Medical Center Hamburg-Eppendorf, Center for Obstetrics and Pediatrics, Martinistrasse 52, 20246 Hamburg, Germany
| | - Yuqi Tan
- Department of Microbiology and Immunology, Stanford University School of Medicine, CSSR3220, 269 Campus Drive, Stanford, CA 94305, USA
| | - Martin Becker
- Department of Computer Science and Electrical Engineering, Institute for Visual and Analytic Computing, Universität Rostock, Albert-Einstein-Straße 22, 18059 Rostock, Germany.
| |
Collapse
|
31
|
Ganjizadeh A, Zawada SJ, Langer SG, Erickson BJ. Visualizing Clinical Data Retrieval and Curation in Multimodal Healthcare AI Research: A Technical Note on RIL-workflow. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:1239-1247. [PMID: 38366291 PMCID: PMC11169146 DOI: 10.1007/s10278-024-00977-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 12/02/2023] [Accepted: 12/04/2023] [Indexed: 02/18/2024]
Abstract
Curating and integrating data from sources are bottlenecks to procuring robust training datasets for artificial intelligence (AI) models in healthcare. While numerous applications can process discrete types of clinical data, it is still time-consuming to integrate heterogenous data types. Therefore, there exists a need for more efficient retrieval and storage of curated patient data from dissimilar sources, such as biobanks, health records, and sensors. We describe a customizable, modular data retrieval application (RIL-workflow), which integrates clinical notes, images, and prescription data, and show its feasibility applied to research at our institution. It uses the workflow automation platform Camunda (Camunda Services GmbH, Berlin, Germany) to collect internal data from Fast Healthcare Interoperability Resources (FHIR) and Digital Imaging and Communications in Medicine (DICOM) sources. Using the web-based graphical user interface (GUI), the workflow runs tasks to completion according to visual representation, retrieving and storing results for patients meeting study inclusion criteria while segregating errors for human review. We showcase RIL-workflow with its library of ready-to-use modules, enabling researchers to specify human input or automation at fixed steps. We validated our workflow by demonstrating its capability to aggregate, curate, and handle errors related to data from multiple sources to generate a multimodal database for clinical AI research. Further, we solicited user feedback to highlight the pros and cons associated with RIL-workflow. The source code is available at github.com/magnooj/RIL-workflow.
Collapse
Affiliation(s)
- Ali Ganjizadeh
- Mayo Clinic Artificial Intelligence Laboratory, 200 1st Street SW, Rochester, MN, 55902, USA
- Mayo Clinic Department of Radiology, 200 1st Street SW, Rochester, MN, 55902, USA
| | - Stephanie J Zawada
- Mayo Clinic Artificial Intelligence Laboratory, 200 1st Street SW, Rochester, MN, 55902, USA
- Mayo Clinic College of Medicine and Science, 5777 E. Mayo Boulevard, Scottsdale, AZ, 85054, USA
| | - Steve G Langer
- Mayo Clinic Artificial Intelligence Laboratory, 200 1st Street SW, Rochester, MN, 55902, USA
- Mayo Clinic Department of Radiology, 200 1st Street SW, Rochester, MN, 55902, USA
| | - Bradley J Erickson
- Mayo Clinic Artificial Intelligence Laboratory, 200 1st Street SW, Rochester, MN, 55902, USA.
- Mayo Clinic Department of Radiology, 200 1st Street SW, Rochester, MN, 55902, USA.
- Mayo Clinic College of Medicine and Science, 5777 E. Mayo Boulevard, Scottsdale, AZ, 85054, USA.
| |
Collapse
|
32
|
Yang H, Zhu D, He S, Xu Z, Liu Z, Zhang W, Cai J. Enhancing psychiatric rehabilitation outcomes through a multimodal multitask learning model based on BERT and TabNet: An approach for personalized treatment and improved decision-making. Psychiatry Res 2024; 336:115896. [PMID: 38626625 DOI: 10.1016/j.psychres.2024.115896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 04/03/2024] [Accepted: 04/05/2024] [Indexed: 04/18/2024]
Abstract
Evaluating the rehabilitation status of individuals with serious mental illnesses (SMI) necessitates a comprehensive analysis of multimodal data, including unstructured text records and structured diagnostic data. However, progress in the effective assessment of rehabilitation status remains limited. Our study develops a deep learning model integrating Bidirectional Encoder Representations from Transformers (BERT) and TabNet through a late fusion strategy to enhance rehabilitation prediction, including referral risk, dangerous behaviors, self-awareness, and medication adherence, in patients with SMI. BERT processes unstructured textual data, such as doctor's notes, whereas TabNet manages structured diagnostic information. The model's interpretability function serves to assist healthcare professionals in understanding the model's predictive decisions, improving patient care. Our model exhibited excellent predictive performance for all four tasks, with an accuracy exceeding 0.78 and an area under the curve of 0.70. In addition, a series of tests proved the model's robustness, fairness, and interpretability. This study combines multimodal and multitask learning strategies into a model and applies it to rehabilitation assessment tasks, offering a promising new tool that can be seamlessly integrated with the clinical workflow to support the provision of optimized patient care.
Collapse
Affiliation(s)
- Hongyi Yang
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Dian Zhu
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Siyuan He
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Zhiqi Xu
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Zhao Liu
- School of Design, Shanghai Jiao Tong University, Shanghai, China.
| | - Weibo Zhang
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China; Mental Health Branch, China Hospital Development Institute, Shanghai Jiao Tong University, Shanghai, China.
| | - Jun Cai
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Mental Health Branch, China Hospital Development Institute, Shanghai Jiao Tong University, Shanghai, China.
| |
Collapse
|
33
|
Mennickent D, Romero-Albornoz L, Gutiérrez-Vega S, Aguayo C, Marini F, Guzmán-Gutiérrez E, Araya J. Simple and Fast Prediction of Gestational Diabetes Mellitus Based on Machine Learning and Near-Infrared Spectra of Serum: A Proof of Concept Study at Different Stages of Pregnancy. Biomedicines 2024; 12:1142. [PMID: 38927349 PMCID: PMC11200648 DOI: 10.3390/biomedicines12061142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Revised: 04/26/2024] [Accepted: 04/26/2024] [Indexed: 06/28/2024] Open
Abstract
Gestational diabetes mellitus (GDM) is a hyperglycemic state that is typically diagnosed by an oral glucose tolerance test (OGTT), which is unpleasant, time-consuming, has low reproducibility, and results are tardy. The machine learning (ML) predictive models that have been proposed to improve GDM diagnosis are usually based on instrumental methods that take hours to produce a result. Near-infrared (NIR) spectroscopy is a simple, fast, and low-cost analytical technique that has never been assessed for the prediction of GDM. This study aims to develop ML predictive models for GDM based on NIR spectroscopy, and to evaluate their potential as early detection or alternative screening tools according to their predictive power and duration of analysis. Serum samples from the first trimester (before GDM diagnosis) and the second trimester (at the time of GDM diagnosis) of pregnancy were analyzed by NIR spectroscopy. Four spectral ranges were considered, and 80 mathematical pretreatments were tested for each. NIR data-based models were built with single- and multi-block ML techniques. Every model was subjected to double cross-validation. The best models for first and second trimester achieved areas under the receiver operating characteristic curve of 0.5768 ± 0.0635 and 0.8836 ± 0.0259, respectively. This is the first study reporting NIR-spectroscopy-based methods for the prediction of GDM. The developed methods allow for prediction of GDM from 10 µL of serum in only 32 min. They are simple, fast, and have a great potential for application in clinical practice, especially as alternative screening tools to the OGTT for GDM diagnosis.
Collapse
Affiliation(s)
- Daniela Mennickent
- Departamento de Ciencias Básicas y Morfología, Facultad de Medicina, Universidad Católica de la Santísima Concepción, 4090541 Concepción, Chile;
- Departamento de Análisis Instrumental, Facultad de Farmacia, Universidad de Concepción, 4070386 Concepción, Chile;
| | - Lucas Romero-Albornoz
- Departamento de Análisis Instrumental, Facultad de Farmacia, Universidad de Concepción, 4070386 Concepción, Chile;
| | - Sebastián Gutiérrez-Vega
- Departamento de Bioquímica Clínica e Inmunología, Facultad de Farmacia, Universidad de Concepción, 4070386 Concepción, Chile; (S.G.-V.); (C.A.)
| | - Claudio Aguayo
- Departamento de Bioquímica Clínica e Inmunología, Facultad de Farmacia, Universidad de Concepción, 4070386 Concepción, Chile; (S.G.-V.); (C.A.)
| | - Federico Marini
- Department of Chemistry, University of Rome La Sapienza, 00185 Rome, Italy;
| | - Enrique Guzmán-Gutiérrez
- Departamento de Bioquímica Clínica e Inmunología, Facultad de Farmacia, Universidad de Concepción, 4070386 Concepción, Chile; (S.G.-V.); (C.A.)
| | - Juan Araya
- Departamento de Análisis Instrumental, Facultad de Farmacia, Universidad de Concepción, 4070386 Concepción, Chile;
| |
Collapse
|
34
|
Khodaee F, Zandie R, Edelman ER. Multimodal Learning for Mapping the Genotype-Phenotype Dynamics. RESEARCH SQUARE 2024:rs.3.rs-4355413. [PMID: 38798675 PMCID: PMC11118704 DOI: 10.21203/rs.3.rs-4355413/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
How complex phenotypes emerge from intricate gene expression patterns is a fundamental question in biology. Quantitative characterization of this relationship, however, is challenging due to the vast combinatorial possibilities and dynamic interplay between genotype and phenotype landscapes. Integrating high-content genotyping approaches such as single-cell RNA sequencing and advanced learning methods such as language models offers an opportunity for dissecting this complex relationship. Here, we present a computational integrated genetics framework designed to analyze and interpret the high-dimensional landscape of genotypes and their associated phenotypes simultaneously. We applied this approach to develop a multimodal foundation model to explore the genotype-phenotype relationship manifold for human transcriptomics at the cellular level. Analyzing this joint manifold showed a refined resolution of cellular heterogeneity, enhanced precision in phenotype annotating, and uncovered potential cross-tissue biomarkers that are undetectable through conventional gene expression analysis alone. Moreover, our results revealed that the gene networks are characterized by scale-free patterns and show context-dependent gene-gene interactions, both of which result in significant variations in the topology of the gene network, particularly evident during aging. Finally, utilizing contextualized embeddings, we investigated gene polyfunctionality which illustrates the multifaceted roles that genes play in different biological processes, and demonstrated that for VWF gene in endothelial cells. Overall, this study advances our understanding of the dynamic interplay between gene expression and phenotypic manifestation and demonstrates the potential of integrated genetics in uncovering new dimensions of cellular function and complexity.
Collapse
Affiliation(s)
- Farhan Khodaee
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA
| | - Rohola Zandie
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA
| | - Elazer R. Edelman
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA
- Department of Medicine (Cardiovascular Medicine), Brigham and Women’s Hospital, Boston, 02115, MA, USA
| |
Collapse
|
35
|
Yao X, Ouyang S, Lian Y, Peng Q, Zhou X, Huang F, Hu X, Shi F, Xia J. PheSeq, a Bayesian deep learning model to enhance and interpret the gene-disease association studies. Genome Med 2024; 16:56. [PMID: 38627848 PMCID: PMC11020195 DOI: 10.1186/s13073-024-01330-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 04/02/2024] [Indexed: 04/19/2024] Open
Abstract
Despite the abundance of genotype-phenotype association studies, the resulting association outcomes often lack robustness and interpretations. To address these challenges, we introduce PheSeq, a Bayesian deep learning model that enhances and interprets association studies through the integration and perception of phenotype descriptions. By implementing the PheSeq model in three case studies on Alzheimer's disease, breast cancer, and lung cancer, we identify 1024 priority genes for Alzheimer's disease and 818 and 566 genes for breast cancer and lung cancer, respectively. Benefiting from data fusion, these findings represent moderate positive rates, high recall rates, and interpretation in gene-disease association studies.
Collapse
Affiliation(s)
- Xinzhi Yao
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Sizhuo Ouyang
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Yulong Lian
- College of Science, Huazhong Agricultural University, Wuhan, China
| | - Qianqian Peng
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Xionghui Zhou
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Feier Huang
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Xuehai Hu
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Feng Shi
- College of Science, Huazhong Agricultural University, Wuhan, China
| | - Jingbo Xia
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China.
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China.
| |
Collapse
|
36
|
Muse ED, Topol EJ. Transforming the cardiometabolic disease landscape: Multimodal AI-powered approaches in prevention and management. Cell Metab 2024; 36:670-683. [PMID: 38428435 PMCID: PMC10990799 DOI: 10.1016/j.cmet.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 01/25/2024] [Accepted: 02/06/2024] [Indexed: 03/03/2024]
Abstract
The rise of artificial intelligence (AI) has revolutionized various scientific fields, particularly in medicine, where it has enabled the modeling of complex relationships from massive datasets. Initially, AI algorithms focused on improved interpretation of diagnostic studies such as chest X-rays and electrocardiograms in addition to predicting patient outcomes and future disease onset. However, AI has evolved with the introduction of transformer models, allowing analysis of the diverse, multimodal data sources existing in medicine today. Multimodal AI holds great promise in more accurate disease risk assessment and stratification as well as optimizing the key driving factors in cardiometabolic disease: blood pressure, sleep, stress, glucose control, weight, nutrition, and physical activity. In this article we outline the current state of medical AI in cardiometabolic disease, highlighting the potential of multimodal AI to augment personalized prevention and treatment strategies in cardiometabolic disease.
Collapse
Affiliation(s)
- Evan D Muse
- Scripps Research Translational Institute, Scripps Research, La Jolla, CA 92037, USA; Division of Cardiovascular Diseases, Scripps Clinic, La Jolla, CA 92037, USA
| | - Eric J Topol
- Scripps Research Translational Institute, Scripps Research, La Jolla, CA 92037, USA; Division of Cardiovascular Diseases, Scripps Clinic, La Jolla, CA 92037, USA.
| |
Collapse
|
37
|
Thirunavukarasu AJ, Mahmood S, Malem A, Foster WP, Sanghera R, Hassan R, Zhou S, Wong SW, Wong YL, Chong YJ, Shakeel A, Chang YH, Tan BKJ, Jain N, Tan TF, Rauz S, Ting DSW, Ting DSJ. Large language models approach expert-level clinical knowledge and reasoning in ophthalmology: A head-to-head cross-sectional study. PLOS DIGITAL HEALTH 2024; 3:e0000341. [PMID: 38630683 PMCID: PMC11023493 DOI: 10.1371/journal.pdig.0000341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 02/26/2024] [Indexed: 04/19/2024]
Abstract
Large language models (LLMs) underlie remarkable recent advanced in natural language processing, and they are beginning to be applied in clinical contexts. We aimed to evaluate the clinical potential of state-of-the-art LLMs in ophthalmology using a more robust benchmark than raw examination scores. We trialled GPT-3.5 and GPT-4 on 347 ophthalmology questions before GPT-3.5, GPT-4, PaLM 2, LLaMA, expert ophthalmologists, and doctors in training were trialled on a mock examination of 87 questions. Performance was analysed with respect to question subject and type (first order recall and higher order reasoning). Masked ophthalmologists graded the accuracy, relevance, and overall preference of GPT-3.5 and GPT-4 responses to the same questions. The performance of GPT-4 (69%) was superior to GPT-3.5 (48%), LLaMA (32%), and PaLM 2 (56%). GPT-4 compared favourably with expert ophthalmologists (median 76%, range 64-90%), ophthalmology trainees (median 59%, range 57-63%), and unspecialised junior doctors (median 43%, range 41-44%). Low agreement between LLMs and doctors reflected idiosyncratic differences in knowledge and reasoning with overall consistency across subjects and types (p>0.05). All ophthalmologists preferred GPT-4 responses over GPT-3.5 and rated the accuracy and relevance of GPT-4 as higher (p<0.05). LLMs are approaching expert-level knowledge and reasoning skills in ophthalmology. In view of the comparable or superior performance to trainee-grade ophthalmologists and unspecialised junior doctors, state-of-the-art LLMs such as GPT-4 may provide useful medical advice and assistance where access to expert ophthalmologists is limited. Clinical benchmarks provide useful assays of LLM capabilities in healthcare before clinical trials can be designed and conducted.
Collapse
Affiliation(s)
- Arun James Thirunavukarasu
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
- Oxford University Clinical Academic Graduate School, University of Oxford, Oxford, United Kingdom
| | - Shathar Mahmood
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - Andrew Malem
- Eye Institute, Cleveland Clinic Abu Dhabi, Abu Dhabi Emirate, United Arab Emirates
| | - William Paul Foster
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, United Kingdom
| | - Rohan Sanghera
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - Refaat Hassan
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - Sean Zhou
- West Suffolk NHS Foundation Trust, Bury St Edmunds, United Kingdom
| | - Shiao Wei Wong
- Manchester Royal Eye Hospital, Manchester University NHS Foundation Trust, Manchester, United Kingdom
| | - Yee Ling Wong
- Manchester Royal Eye Hospital, Manchester University NHS Foundation Trust, Manchester, United Kingdom
| | - Yu Jeat Chong
- Birmingham and Midland Eye Centre, Sandwell and West Birmingham NHS Foundation Trust, Birmingham, United Kingdom
| | - Abdullah Shakeel
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - Yin-Hsi Chang
- Department of Ophthalmology, Chang Gung Memorial Hospital, Linkou Medical Center, Taoyuan, Taiwan
| | | | - Nikhil Jain
- Bedfordshire Hospitals NHS Foundation Trust, Luton and Dunstable, United Kingdom
| | - Ting Fang Tan
- Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
| | - Saaeha Rauz
- Birmingham and Midland Eye Centre, Sandwell and West Birmingham NHS Foundation Trust, Birmingham, United Kingdom
- Academic Unit of Ophthalmology, Institute of Inflammation and Ageing, University of Birmingham, Birmingham, United Kingdom
| | - Daniel Shu Wei Ting
- Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
- Duke-NUS Medical School, Singapore, Singapore
- Byers Eye Institute, Stanford University, Palo Alto, California, United States of America
| | - Darren Shu Jeng Ting
- Birmingham and Midland Eye Centre, Sandwell and West Birmingham NHS Foundation Trust, Birmingham, United Kingdom
- Academic Unit of Ophthalmology, Institute of Inflammation and Ageing, University of Birmingham, Birmingham, United Kingdom
- Academic Ophthalmology, School of Medicine, University of Nottingham, Nottingham, United Kingdom
| |
Collapse
|
38
|
Zhou Y, Cosentino J, Yun T, Biradar MI, Shreibati J, Lai D, Schwantes-An TH, Luben R, McCaw Z, Engmann J, Providencia R, Schmidt AF, Munroe P, Yang H, Carroll A, Khawaja AP, McLean CY, Behsaz B, Hormozdiari F. Utilizing multimodal AI to improve genetic analyses of cardiovascular traits. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.19.24304547. [PMID: 38562791 PMCID: PMC10984061 DOI: 10.1101/2024.03.19.24304547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Electronic health records, biobanks, and wearable biosensors contain multiple high-dimensional clinical data (HDCD) modalities (e.g., ECG, Photoplethysmography (PPG), and MRI) for each individual. Access to multimodal HDCD provides a unique opportunity for genetic studies of complex traits because different modalities relevant to a single physiological system (e.g., circulatory system) encode complementary and overlapping information. We propose a novel multimodal deep learning method, M-REGLE, for discovering genetic associations from a joint representation of multiple complementary HDCD modalities. We showcase the effectiveness of this model by applying it to several cardiovascular modalities. M-REGLE jointly learns a lower representation (i.e., latent factors) of multimodal HDCD using a convolutional variational autoencoder, performs genome wide association studies (GWAS) on each latent factor, then combines the results to study the genetics of the underlying system. To validate the advantages of M-REGLE and multimodal learning, we apply it to common cardiovascular modalities (PPG and ECG), and compare its results to unimodal learning methods in which representations are learned from each data modality separately, but the downstream genetic analyses are performed on the combined unimodal representations. M-REGLE identifies 19.3% more loci on the 12-lead ECG dataset, 13.0% more loci on the ECG lead I + PPG dataset, and its genetic risk score significantly outperforms the unimodal risk score at predicting cardiac phenotypes, such as atrial fibrillation (Afib), in multiple biobanks.
Collapse
Affiliation(s)
| | | | | | - Mahantesh I Biradar
- NIHR Biomedical Research Centre at Moorfields Eye Hospital & UCL Institute of Ophthalmology, London EC1V 9EL, UK
- MRC Epidemiology Unit, University of Cambridge, Cambridge CB2 0SL, UK
| | | | - Dongbing Lai
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Tae-Hwi Schwantes-An
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Robert Luben
- NIHR Biomedical Research Centre at Moorfields Eye Hospital & UCL Institute of Ophthalmology, London EC1V 9EL, UK
- MRC Epidemiology Unit, University of Cambridge, Cambridge CB2 0SL, UK
| | - Zachary McCaw
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jorgen Engmann
- Center for Translational Genomics, Population Science and Experimental Medicine, Institute of Cardiovascular Science, University College London, UK
| | - Rui Providencia
- Institute of Health Informatics Research, University College London, London, UK
- Electrophysiology Department, Barts Heart Centre, St. Bartholomew's Hospital, London, UK
| | - Amand Floriaan Schmidt
- Department of Cardiology; Amsterdam University Medical Centres, Amsterdam, The Netherlands
- Institute of Cardiovascular Science; University College London, London, UK
- Division of Heart and Lungs, University Medical Center Utrecht, Utrecht, Netherlands
| | - Patricia Munroe
- William Harvey Research Institute, Barts and the London Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Howard Yang
- Google Research, San Francisco CA, 94105 USA
| | | | - Anthony P Khawaja
- NIHR Biomedical Research Centre at Moorfields Eye Hospital & UCL Institute of Ophthalmology, London EC1V 9EL, UK
- MRC Epidemiology Unit, University of Cambridge, Cambridge CB2 0SL, UK
| | | | | | | |
Collapse
|
39
|
Reddy S. Generative AI in healthcare: an implementation science informed translational path on application, integration and governance. Implement Sci 2024; 19:27. [PMID: 38491544 PMCID: PMC10941464 DOI: 10.1186/s13012-024-01357-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 03/06/2024] [Indexed: 03/18/2024] Open
Abstract
BACKGROUND Artificial intelligence (AI), particularly generative AI, has emerged as a transformative tool in healthcare, with the potential to revolutionize clinical decision-making and improve health outcomes. Generative AI, capable of generating new data such as text and images, holds promise in enhancing patient care, revolutionizing disease diagnosis and expanding treatment options. However, the utility and impact of generative AI in healthcare remain poorly understood, with concerns around ethical and medico-legal implications, integration into healthcare service delivery and workforce utilisation. Also, there is not a clear pathway to implement and integrate generative AI in healthcare delivery. METHODS This article aims to provide a comprehensive overview of the use of generative AI in healthcare, focusing on the utility of the technology in healthcare and its translational application highlighting the need for careful planning, execution and management of expectations in adopting generative AI in clinical medicine. Key considerations include factors such as data privacy, security and the irreplaceable role of clinicians' expertise. Frameworks like the technology acceptance model (TAM) and the Non-Adoption, Abandonment, Scale-up, Spread and Sustainability (NASSS) model are considered to promote responsible integration. These frameworks allow anticipating and proactively addressing barriers to adoption, facilitating stakeholder participation and responsibly transitioning care systems to harness generative AI's potential. RESULTS Generative AI has the potential to transform healthcare through automated systems, enhanced clinical decision-making and democratization of expertise with diagnostic support tools providing timely, personalized suggestions. Generative AI applications across billing, diagnosis, treatment and research can also make healthcare delivery more efficient, equitable and effective. However, integration of generative AI necessitates meticulous change management and risk mitigation strategies. Technological capabilities alone cannot shift complex care ecosystems overnight; rather, structured adoption programs grounded in implementation science are imperative. CONCLUSIONS It is strongly argued in this article that generative AI can usher in tremendous healthcare progress, if introduced responsibly. Strategic adoption based on implementation science, incremental deployment and balanced messaging around opportunities versus limitations helps promote safe, ethical generative AI integration. Extensive real-world piloting and iteration aligned to clinical priorities should drive development. With conscientious governance centred on human wellbeing over technological novelty, generative AI can enhance accessibility, affordability and quality of care. As these models continue advancing rapidly, ongoing reassessment and transparent communication around their strengths and weaknesses remain vital to restoring trust, realizing positive potential and, most importantly, improving patient outcomes.
Collapse
Affiliation(s)
- Sandeep Reddy
- Deakin School of Medicine, Waurn Ponds, Geelong, VIC, 3215, Australia.
| |
Collapse
|
40
|
Wang Y, Yin C, Zhang P. Multimodal risk prediction with physiological signals, medical images and clinical notes. Heliyon 2024; 10:e26772. [PMID: 38455585 PMCID: PMC10918115 DOI: 10.1016/j.heliyon.2024.e26772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 02/17/2024] [Accepted: 02/20/2024] [Indexed: 03/09/2024] Open
Abstract
The broad adoption of electronic health record (EHR) systems brings us a tremendous amount of clinical data and thus provides opportunities to conduct data-based healthcare research to solve various clinical problems in the medical domain. Machine learning and deep learning methods are widely used in the medical informatics and healthcare domain due to their power to mine insights from raw data. When adapting deep learning models for EHR data, it is essential to consider its heterogeneous nature: EHR contains patient records from various sources including medical tests (e.g. blood test, microbiology test), medical imaging, diagnosis, medications, procedures, clinical notes, etc. Those modalities together provide a holistic view of patient health status and complement each other. Therefore, combining data from multiple modalities that are intrinsically different is challenging but intuitively promising in deep learning for EHR. To assess the expectations of multimodal data, we introduce a comprehensive fusion framework designed to integrate temporal variables, medical images, and clinical notes in EHR for enhanced performance in clinical risk prediction. Early, joint, and late fusion strategies are employed to combine data from various modalities effectively. We test the model with three predictive tasks: in-hospital mortality, long length of stay, and 30-day readmission. Experimental results show that multimodal models outperform uni-modal models in the tasks involved. Additionally, by training models with different input modality combinations, we calculate the Shapley value for each modality to quantify their contribution to multimodal performance. It is shown that temporal variables tend to be more helpful than CXR images and clinical notes in the three explored predictive tasks.
Collapse
Affiliation(s)
- Yuanlong Wang
- Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA
| | - Changchang Yin
- Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH 43210, USA
| | - Ping Zhang
- Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
41
|
Montesinos-López A, Crespo-Herrera L, Dreisigacker S, Gerard G, Vitale P, Saint Pierre C, Govindan V, Tarekegn ZT, Flores MC, Pérez-Rodríguez P, Ramos-Pulido S, Lillemo M, Li H, Montesinos-López OA, Crossa J. Deep learning methods improve genomic prediction of wheat breeding. FRONTIERS IN PLANT SCIENCE 2024; 15:1324090. [PMID: 38504889 PMCID: PMC10949530 DOI: 10.3389/fpls.2024.1324090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 02/19/2024] [Indexed: 03/21/2024]
Abstract
In the field of plant breeding, various machine learning models have been developed and studied to evaluate the genomic prediction (GP) accuracy of unseen phenotypes. Deep learning has shown promise. However, most studies on deep learning in plant breeding have been limited to small datasets, and only a few have explored its application in moderate-sized datasets. In this study, we aimed to address this limitation by utilizing a moderately large dataset. We examined the performance of a deep learning (DL) model and compared it with the widely used and powerful best linear unbiased prediction (GBLUP) model. The goal was to assess the GP accuracy in the context of a five-fold cross-validation strategy and when predicting complete environments using the DL model. The results revealed the DL model outperformed the GBLUP model in terms of GP accuracy for two out of the five included traits in the five-fold cross-validation strategy, with similar results in the other traits. This indicates the superiority of the DL model in predicting these specific traits. Furthermore, when predicting complete environments using the leave-one-environment-out (LOEO) approach, the DL model demonstrated competitive performance. It is worth noting that the DL model employed in this study extends a previously proposed multi-modal DL model, which had been primarily applied to image data but with small datasets. By utilizing a moderately large dataset, we were able to evaluate the performance and potential of the DL model in a context with more information and challenging scenario in plant breeding.
Collapse
Affiliation(s)
- Abelardo Montesinos-López
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, Guadalajara, Jalisco, Mexico
| | - Leonardo Crespo-Herrera
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Estado. de México, Mexico
| | - Susanna Dreisigacker
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Estado. de México, Mexico
| | - Guillermo Gerard
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Estado. de México, Mexico
| | - Paolo Vitale
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Estado. de México, Mexico
| | - Carolina Saint Pierre
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Estado. de México, Mexico
| | - Velu Govindan
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Estado. de México, Mexico
| | | | - Moisés Chavira Flores
- Instituto de Investigaciones en Matemáticas Aplicadas y Sistemas (IIMAS), Universidad Nacional Autónoma de México (UNAM), Ciudad Universitaria, Ciudad de México, Mexico
| | - Paulino Pérez-Rodríguez
- Estudios del Desarrollo Rural, Economía, Estadística y Cómputo Aplicado, Colegio de Postgraduados, Texcoco, Estado de México, Mexico
| | - Sofía Ramos-Pulido
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, Guadalajara, Jalisco, Mexico
| | - Morten Lillemo
- Department of Plant Science, Norwegian University of Life Science (NMBU), Ås, Norway
| | - Huihui Li
- 6State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences and CIMMYT China Office, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | | | - Jose Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Estado. de México, Mexico
- Estudios del Desarrollo Rural, Economía, Estadística y Cómputo Aplicado, Colegio de Postgraduados, Texcoco, Estado de México, Mexico
| |
Collapse
|
42
|
Lin WC, Jordan BK, Scottoline B, Ostmo SR, Coyner AS, Singh P, Kalpathy-Cramer J, Erdogmus D, Chan RP, Chiang MF, Campbell JP. Oxygenation Fluctuations Associated with Severe Retinopathy of Prematurity: Insights from a Multimodal Deep Learning Approach. OPHTHALMOLOGY SCIENCE 2024; 4:100417. [PMID: 38059124 PMCID: PMC10696464 DOI: 10.1016/j.xops.2023.100417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 09/27/2023] [Accepted: 10/18/2023] [Indexed: 12/08/2023]
Abstract
Purpose Retinopathy of prematurity (ROP) is one of the leading causes of blindness in children. Although the role of oxygen in the pathophysiology of ROP is well established, a precise understanding of the dynamic relationship between oxygen exposure ROP incidence and severity is lacking. The purpose of this study was to evaluate the correlation between time-dependent oxygen variables and the onset of ROP. Design Retrospective cohort study. Participants Two hundred thirty infants who were born at a single academic center and met the inclusion criteria were included. Infants are mainly born between January 2011 and October 2022. Methods Patient data were extracted from electronic health records (EHRs), with sufficient time-dependent oxygen data. Clinical outcomes for ROP were recorded as none/mild or moderate/severe (defined as type II or worse). Mixed-effects linear models were used to compare the 2 groups in terms of dynamic oxygen variables, such as daily average and the coefficient of variation (COV) fraction of inspired oxygen (FiO2). Support vector machine (SVM) and long-short-term memory (LSTM)-based multimodal models were trained with fivefold cross-validation to predict which infants would develop moderate/severe ROP. Gestational age (GA), birth weight, and time-dependent oxygen variables were used to develop predictive models. Main Outcome Measures Model cross-validation performance was evaluated by computing the mean area under the receiver operating characteristic (AUROC) curve, precision, recall, and F1 score. Results We found that both daily average and COV of FiO2 were associated with more severe ROP (adjusted P < 0.001). With fivefold cross-validation, the multimodal LSTM models had higher performance than the best static models (SVM using GA and 3 average FiO2 features) and SVM models trained on GA alone (mean AUROC = 0.89 ± 0.04 vs. 0.86 ± 0.05 vs. 0.83 ± 0.04). Conclusions The development of severe ROP might not only be influenced by oxygen exposure but also by its fluctuation, which provides direction for future study of pathophysiological factors associated with severe ROP development. Additionally, we demonstrated that multimodal neural networks can be a method to extract useful information from time-series data, which may be a valuable methodology for the investigation of other diseases using EHR data. Financial Disclosures Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
Collapse
Affiliation(s)
- Wei-Chun Lin
- Department of Ophthalmology, Oregon Health & Science University, Portland, Oregon
| | - Brian K. Jordan
- Department of Neonatology, Oregon Health and Science University, Portland, Oregon
| | - Brian Scottoline
- Department of Neonatology, Oregon Health and Science University, Portland, Oregon
| | - Susan R. Ostmo
- Department of Ophthalmology, Oregon Health & Science University, Portland, Oregon
| | - Aaron S. Coyner
- Department of Ophthalmology, Oregon Health & Science University, Portland, Oregon
| | - Praveer Singh
- Department of Ophthalmology, University of Colorado (CU) School of Medicine, Denver, Colorado
| | | | - Deniz Erdogmus
- Department of Electrical and Computer Engineering, Northeastern University, Boston, Massachusetts
| | - R.V. Paul Chan
- Department of Ophthalmology and Visual Sciences, University of Illinois at Chicago, Chicago, Illinois
| | - Michael F. Chiang
- National Eye Institute, National Institutes of Health, Bethesda, Maryland
- National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| | - J. Peter Campbell
- Department of Ophthalmology, Oregon Health & Science University, Portland, Oregon
| |
Collapse
|
43
|
Bilal A, Liu X, Shafiq M, Ahmed Z, Long H. NIMEQ-SACNet: A novel self-attention precision medicine model for vision-threatening diabetic retinopathy using image data. Comput Biol Med 2024; 171:108099. [PMID: 38364659 DOI: 10.1016/j.compbiomed.2024.108099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 02/02/2024] [Accepted: 02/02/2024] [Indexed: 02/18/2024]
Abstract
In the realm of precision medicine, the potential of deep learning is progressively harnessed to facilitate intricate clinical decision-making, especially when navigating multifaceted datasets encompassing Omics, Clinical, image, device, social, and environmental dimensions. This study accentuates the criticality of image data, given its instrumental role in detecting and classifying vision-threatening diabetic retinopathy (VTDR) - a predominant global contributor to vision impairment. The timely identification of VTDR is a linchpin for efficacious interventions and the mitigation of vision loss. Addressing this, This study introduces "NIMEQ-SACNet," a novel hybrid model by the prowess of the Enhanced Quantum-Inspired Binary Grey Wolf Optimizer (EQI-BGWO) with a self-attention capsule network. The proposed approach is characterized by two pivotal advancements: firstly, the augmentation of the Binary Grey Wolf Optimization through Quantum Computing methodologies, and secondly, the deployment of the enhanced EQI-BGWO to adeptly calibrate the SACNet's parameters, culminating in a notable uplift in VTDR classification accuracy. The proposed model's ability to handle binary, 5-stage, and 7-stage VTDR classifications adroitly is noteworthy. Rigorous assessments on the fundus image dataset, underscored by metrics such as Accuracy, Sensitivity, Specificity, Precision, F1-Score, and MCC, bear testament to NIMEQ-SACNet's pre-eminence over prevailing algorithms and classification frameworks.
Collapse
Affiliation(s)
- Anas Bilal
- College of Information Science and Technology, Hainan Normal University, Haikou, 571158, China
| | - Xiaowen Liu
- College of Information Science and Technology, Hainan Normal University, Haikou, 571158, China
| | - Muhammad Shafiq
- School of Information Engineering, Qujing Normal University, Sichuan, China
| | - Zohaib Ahmed
- Department of Criminology and Forensic Sciences, Lahore Garrison University, Lahore, Pakistan
| | - Haixia Long
- College of Information Science and Technology, Hainan Normal University, Haikou, 571158, China.
| |
Collapse
|
44
|
Jiao Y, Hart R, Reading S, Zhang Y. Systematic review of automatic post-stroke gait classification systems. Gait Posture 2024; 109:259-270. [PMID: 38367457 DOI: 10.1016/j.gaitpost.2024.02.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 01/11/2024] [Accepted: 02/12/2024] [Indexed: 02/19/2024]
Abstract
BACKGROUND Gait classification is a clinically helpful task performed after a stroke in order to guide rehabilitation therapy. Gait disorders are commonly identified using observational gait analysis in clinical settings, but this approach is limited due to low reliability and accuracy. Data-driven gait classification can quantify gait deviations and categorise gait patterns automatically possibly improving reliability and accuracy; however, the development and clinical utility of current data driven systems has not been reviewed previously. RESEARCH QUESTION The purpose of this systematic review is to evaluate the literature surrounding the methodology used to develop automatic gait classification systems, and their potential effectiveness in the clinical management of stroke-affected gait. METHOD The database search included PubMed, IEEE Xplore, and Scopus. Twenty-one studies were identified through inclusion and exclusion criteria from 407 available studies published between 2015 and 2022. Development methodology, classification performance, and clinical utility information were extracted for review. RESULTS AND SIGNIFICANCE Most of gait classification systems reported a classification accuracy between 80%-100%. However, collated studies presented methodological errors in machine learning (ML) model development. Further, many studies neglected model components such as clinical utility (e.g., predictions don't assist clinicians or therapists in making decisions, interpretability, and generalisability). We provided recommendations to guide development of future post-stroke automatic gait classification systems to better assist clinicians and therapists. Future automatic gait classification systems should emphasise the clinical significance and adopt a standardised development methodology of ML model.
Collapse
Affiliation(s)
- Yiran Jiao
- Department of Exercise Sciences, Faculty of Science, University of Auckland, Auckland 1023, New Zealand
| | - Rylea Hart
- Department of Exercise Sciences, Faculty of Science, University of Auckland, Auckland 1023, New Zealand
| | - Stacey Reading
- Department of Exercise Sciences, Faculty of Science, University of Auckland, Auckland 1023, New Zealand
| | - Yanxin Zhang
- Department of Exercise Sciences, Faculty of Science, University of Auckland, Auckland 1023, New Zealand.
| |
Collapse
|
45
|
van de Beld JJ, Crull D, Mikhal J, Geerdink J, Veldhuis A, Poel M, Kouwenhoven EA. Complication Prediction after Esophagectomy with Machine Learning. Diagnostics (Basel) 2024; 14:439. [PMID: 38396478 PMCID: PMC10888312 DOI: 10.3390/diagnostics14040439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 12/28/2023] [Accepted: 12/29/2023] [Indexed: 02/25/2024] Open
Abstract
Esophageal cancer can be treated effectively with esophagectomy; however, the postoperative complication rate is high. In this paper, we study to what extent machine learning methods can predict anastomotic leakage and pneumonia up to two days in advance. We use a dataset with 417 patients who underwent esophagectomy between 2011 and 2021. The dataset contains multimodal temporal information, specifically, laboratory results, vital signs, thorax images, and preoperative patient characteristics. The best models scored mean test set AUROCs of 0.87 and 0.82 for leakage 1 and 2 days ahead, respectively. For pneumonia, this was 0.74 and 0.61 for 1 and 2 days ahead, respectively. We conclude that machine learning models can effectively predict anastomotic leakage and pneumonia after esophagectomy.
Collapse
Affiliation(s)
- Jorn-Jan van de Beld
- Faculty of EEMCS, University of Twente, 7500 AE Enschede, The Netherlands
- Hospital Group Twente (ZGT), 7609 PP Almelo, The Netherlands
| | - David Crull
- Hospital Group Twente (ZGT), 7609 PP Almelo, The Netherlands
| | - Julia Mikhal
- Hospital Group Twente (ZGT), 7609 PP Almelo, The Netherlands
- Faculty of BMS, University of Twente, 7500 AE Enschede, The Netherlands
| | - Jeroen Geerdink
- Hospital Group Twente (ZGT), 7609 PP Almelo, The Netherlands
| | - Anouk Veldhuis
- Hospital Group Twente (ZGT), 7609 PP Almelo, The Netherlands
| | - Mannes Poel
- Faculty of EEMCS, University of Twente, 7500 AE Enschede, The Netherlands
| | | |
Collapse
|
46
|
Sambarey A, Smith K, Chung C, Arora HS, Yang Z, Agarwal PP, Chandrasekaran S. Integrative analysis of multimodal patient data identifies personalized predictors of tuberculosis treatment prognosis. iScience 2024; 27:109025. [PMID: 38357663 PMCID: PMC10865408 DOI: 10.1016/j.isci.2024.109025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/08/2023] [Accepted: 01/22/2024] [Indexed: 02/16/2024] Open
Abstract
Tuberculosis (TB) afflicted 10.6 million people in 2021, and its global burden is increasing due to multidrug-resistant TB (MDR-TB) and extensively resistant TB (XDR-TB). Here, we analyze multi-domain information from 5,060 TB patients spanning 10 countries with high burden of MDR-TB from the NIAID TB Portals database to determine predictors of TB treatment outcome. Our analysis revealed significant associations between radiological, microbiological, therapeutic, and demographic data modalities. Our machine learning model, built with 203 features across modalities outperforms models built using each modality alone in predicting treatment outcomes, with an accuracy of 83% and area under the curve of 0.84. Notably, our analysis revealed that the drug regimens Bedaquiline-Clofazimine-Cycloserine-Levofloxacin-Linezolid and Bedaquiline-Clofazimine-Linezolid-Moxifloxacin were associated with treatment success and failure, respectively, for MDR non-XDR-TB. Drug combinations predicted to be synergistic by the INDIGO algorithm performed better than antagonistic combinations. Our prioritized set of features predictive of treatment outcomes can ultimately guide the personalized clinical management of TB.
Collapse
Affiliation(s)
- Awanti Sambarey
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA
| | - Kirk Smith
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA
| | - Carolina Chung
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA
| | - Harkirat Singh Arora
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA
| | - Zhenhua Yang
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA
| | - Prachi P. Agarwal
- Department of Radiology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Sriram Chandrasekaran
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA
- Program in Chemical Biology, University of Michigan, Ann Arbor, MI 48109, USA
- Center for Bioinformatics and Computational Medicine, Ann Arbor, MI 48109, USA
| |
Collapse
|
47
|
Rajendran S, Pan W, Sabuncu MR, Chen Y, Zhou J, Wang F. Learning across diverse biomedical data modalities and cohorts: Challenges and opportunities for innovation. PATTERNS (NEW YORK, N.Y.) 2024; 5:100913. [PMID: 38370129 PMCID: PMC10873158 DOI: 10.1016/j.patter.2023.100913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
In healthcare, machine learning (ML) shows significant potential to augment patient care, improve population health, and streamline healthcare workflows. Realizing its full potential is, however, often hampered by concerns about data privacy, diversity in data sources, and suboptimal utilization of different data modalities. This review studies the utility of cross-cohort cross-category (C4) integration in such contexts: the process of combining information from diverse datasets distributed across distinct, secure sites. We argue that C4 approaches could pave the way for ML models that are both holistic and widely applicable. This paper provides a comprehensive overview of C4 in health care, including its present stage, potential opportunities, and associated challenges.
Collapse
Affiliation(s)
- Suraj Rajendran
- Tri-Institutional Computational Biology & Medicine Program, Cornell University, Ithaca, NY, USA
| | - Weishen Pan
- Division of Health Informatics, Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
| | - Mert R. Sabuncu
- School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, USA
- Cornell Tech, Cornell University, New York, NY, USA
- Department of Radiology, Weill Cornell Medical School, New York, NY, USA
| | - Yong Chen
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Jiayu Zhou
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Fei Wang
- Division of Health Informatics, Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
| |
Collapse
|
48
|
Becker M, Fehr K, Goguen S, Miliku K, Field C, Robertson B, Yonemitsu C, Bode L, Simons E, Marshall J, Dawod B, Mandhane P, Turvey SE, Moraes TJ, Subbarao P, Rodriguez N, Aghaeepour N, Azad MB. Multimodal machine learning for modeling infant head circumference, mothers' milk composition, and their shared environment. Sci Rep 2024; 14:2977. [PMID: 38316895 PMCID: PMC10844250 DOI: 10.1038/s41598-024-52323-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 01/17/2024] [Indexed: 02/07/2024] Open
Abstract
Links between human milk (HM) and infant development are poorly understood and often focus on individual HM components. Here we apply multi-modal predictive machine learning to study HM and head circumference (a proxy for brain development) among 1022 mother-infant dyads of the CHILD Cohort. We integrated HM data (19 oligosaccharides, 28 fatty acids, 3 hormones, 28 chemokines) with maternal and infant demographic, health, dietary and home environment data. Head circumference was significantly predictable at 3 and 12 months. Two of the most associated features were HM n3-polyunsaturated fatty acid C22:6n3 (docosahexaenoic acid, DHA; p = 9.6e-05) and maternal intake of fish (p = 4.1e-03), a key dietary source of DHA with established relationships to brain function. Thus, using a systems biology approach, we identified meaningful relationships between HM and brain development, which validates our statistical approach, gives credence to the novel associations we observed, and sets the foundation for further research with additional cohorts and HM analytes.
Collapse
Affiliation(s)
- Martin Becker
- International Milk Composition (IMiC) Consortium, Winnipeg, Canada
- Stanford University, Stanford, 94305, USA
| | - Kelsey Fehr
- International Milk Composition (IMiC) Consortium, Winnipeg, Canada
- Manitoba Interdisciplinary Lactation Centre (MILC), Winnipeg, Canada
- Children's Hospital Research Institute of Manitoba, Winnipeg, Canada
- University of Manitoba, Winnipeg, R3E3P4, Canada
| | - Stephanie Goguen
- International Milk Composition (IMiC) Consortium, Winnipeg, Canada
- Manitoba Interdisciplinary Lactation Centre (MILC), Winnipeg, Canada
- Children's Hospital Research Institute of Manitoba, Winnipeg, Canada
- University of Manitoba, Winnipeg, R3E3P4, Canada
| | - Kozeta Miliku
- University of Toronto, Toronto, M5S 1A8, Canada
- McMaster University, Hamilton, M5S 1A8, Canada
| | | | | | - Chloe Yonemitsu
- University of California, San Diego, La Jolla, CA, 92093, USA
| | - Lars Bode
- International Milk Composition (IMiC) Consortium, Winnipeg, Canada
- University of California, San Diego, La Jolla, CA, 92093, USA
| | | | | | | | | | - Stuart E Turvey
- University of British Columbia and British Columbia Children's Hospital, Vancouver, V5Z4H4, Canada
| | | | - Padmaja Subbarao
- University of Toronto, Toronto, M5S 1A8, Canada
- McMaster University, Hamilton, M5S 1A8, Canada
- SickKids, Toronto, M5G 0A4, Canada
| | - Natalie Rodriguez
- International Milk Composition (IMiC) Consortium, Winnipeg, Canada
- Manitoba Interdisciplinary Lactation Centre (MILC), Winnipeg, Canada
- Children's Hospital Research Institute of Manitoba, Winnipeg, Canada
- University of Manitoba, Winnipeg, R3E3P4, Canada
| | - Nima Aghaeepour
- International Milk Composition (IMiC) Consortium, Winnipeg, Canada.
- Stanford University, Stanford, 94305, USA.
| | - Meghan B Azad
- International Milk Composition (IMiC) Consortium, Winnipeg, Canada.
- Manitoba Interdisciplinary Lactation Centre (MILC), Winnipeg, Canada.
- Children's Hospital Research Institute of Manitoba, Winnipeg, Canada.
- University of Manitoba, Winnipeg, R3E3P4, Canada.
| |
Collapse
|
49
|
Chung MK, House JS, Akhtari FS, Makris KC, Langston MA, Islam KT, Holmes P, Chadeau-Hyam M, Smirnov AI, Du X, Thessen AE, Cui Y, Zhang K, Manrai AK, Motsinger-Reif A, Patel CJ. Decoding the exposome: data science methodologies and implications in exposome-wide association studies (ExWASs). EXPOSOME 2024; 4:osae001. [PMID: 38344436 PMCID: PMC10857773 DOI: 10.1093/exposome/osae001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 10/16/2023] [Accepted: 11/20/2023] [Indexed: 03/07/2024]
Abstract
This paper explores the exposome concept and its role in elucidating the interplay between environmental exposures and human health. We introduce two key concepts critical for exposomics research. Firstly, we discuss the joint impact of genetics and environment on phenotypes, emphasizing the variance attributable to shared and nonshared environmental factors, underscoring the complexity of quantifying the exposome's influence on health outcomes. Secondly, we introduce the importance of advanced data-driven methods in large cohort studies for exposomic measurements. Here, we introduce the exposome-wide association study (ExWAS), an approach designed for systematic discovery of relationships between phenotypes and various exposures, identifying significant associations while controlling for multiple comparisons. We advocate for the standardized use of the term "exposome-wide association study, ExWAS," to facilitate clear communication and literature retrieval in this field. The paper aims to guide future health researchers in understanding and evaluating exposomic studies. Our discussion extends to emerging topics, such as FAIR Data Principles, biobanked healthcare datasets, and the functional exposome, outlining the future directions in exposomic research. This abstract provides a succinct overview of our comprehensive approach to understanding the complex dynamics of the exposome and its significant implications for human health.
Collapse
Affiliation(s)
- Ming Kei Chung
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong, China
- Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
| | - John S House
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Farida S Akhtari
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Konstantinos C Makris
- Cyprus International Institute for Environmental and Public Health, School of Health Sciences, Cyprus University of Technology, Limassol, Cyprus
| | - Michael A Langston
- Department of Electrical Engineering and Computer Science, University of TN, Knoxville, TN, USA
| | - Khandaker Talat Islam
- Department of Population and Public Health Sciences, Keck School of Medicine of the University of Southern CA, Los Angeles, CA, USA
| | - Philip Holmes
- Department of Physics, Villanova University, Villanova, Philadelphia, USA
| | - Marc Chadeau-Hyam
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK
| | - Alex I Smirnov
- Department of Chemistry, NC State University, Raleigh, NC, USA
| | - Xiuxia Du
- Department of Bioinformatics and Genomics, College of Computing and Informatics, University of NC at Charlotte, Charlotte, NC, USA
| | - Anne E Thessen
- Department of Biomedical Informatics, University of CO Anschutz Medical Campus, Aurora, CO, USA
| | - Yuxia Cui
- Exposure, Response, and Technology Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Kai Zhang
- Department of Environmental Health Sciences, School of Public Health, University at Albany, State University of NY, Rensselaer, NY, USA
| | - Arjun K Manrai
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Alison Motsinger-Reif
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Chirag J Patel
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
50
|
Mondello A, Dal Bo M, Toffoli G, Polano M. Machine learning in onco-pharmacogenomics: a path to precision medicine with many challenges. Front Pharmacol 2024; 14:1260276. [PMID: 38264526 PMCID: PMC10803549 DOI: 10.3389/fphar.2023.1260276] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 12/26/2023] [Indexed: 01/25/2024] Open
Abstract
Over the past two decades, Next-Generation Sequencing (NGS) has revolutionized the approach to cancer research. Applications of NGS include the identification of tumor specific alterations that can influence tumor pathobiology and also impact diagnosis, prognosis and therapeutic options. Pharmacogenomics (PGx) studies the role of inheritance of individual genetic patterns in drug response and has taken advantage of NGS technology as it provides access to high-throughput data that can, however, be difficult to manage. Machine learning (ML) has recently been used in the life sciences to discover hidden patterns from complex NGS data and to solve various PGx problems. In this review, we provide a comprehensive overview of the NGS approaches that can be employed and the different PGx studies implicating the use of NGS data. We also provide an excursus of the ML algorithms that can exert a role as fundamental strategies in the PGx field to improve personalized medicine in cancer.
Collapse
Affiliation(s)
| | | | | | - Maurizio Polano
- Experimental and Clinical Pharmacology Unit, Centro di Riferimento Oncologico di Aviano (CRO), Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS), Aviano, Italy
| |
Collapse
|