1
|
DeGroat W, Abdelhalim H, Peker E, Sheth N, Narayanan R, Zeeshan S, Liang BT, Ahmed Z. Multimodal AI/ML for discovering novel biomarkers and predicting disease using multi-omics profiles of patients with cardiovascular diseases. Sci Rep 2024; 14:26503. [PMID: 39489837 PMCID: PMC11532369 DOI: 10.1038/s41598-024-78553-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 10/31/2024] [Indexed: 11/05/2024] Open
Abstract
Cardiovascular diseases (CVDs) are complex, multifactorial conditions that require personalized assessment and treatment. Advancements in multi-omics technologies, namely RNA sequencing and whole-genome sequencing, have provided translational researchers with a comprehensive view of the human genome. The efficient synthesis and analysis of this data through integrated approach that characterizes genetic variants alongside expression patterns linked to emerging phenotypes, can reveal novel biomarkers and enable the segmentation of patient populations based on personalized risk factors. In this study, we present a cutting-edge methodology rooted in the integration of traditional bioinformatics, classical statistics, and multimodal machine learning techniques. Our approach has the potential to uncover the intricate mechanisms underlying CVD, enabling patient-specific risk and response profiling. We sourced transcriptomic expression data and single nucleotide polymorphisms (SNPs) from both CVD patients and healthy controls. By integrating these multi-omics datasets with clinical demographic information, we generated patient-specific profiles. Utilizing a robust feature selection approach, we identified a signature of 27 transcriptomic features and SNPs that are effective predictors of CVD. Differential expression analysis, combined with minimum redundancy maximum relevance feature selection, highlighted biomarkers that explain the disease phenotype. This approach prioritizes both biological relevance and efficiency in machine learning. We employed Combination Annotation Dependent Depletion scores and allele frequencies to identify variants with pathogenic characteristics in CVD patients. Classification models trained on this signature demonstrated high-accuracy predictions for CVD. The best performing of these models was an XGBoost classifier optimized via Bayesian hyperparameter tuning, which was able to correctly classify all patients in our test dataset. Using SHapley Additive exPlanations, we created risk assessments for patients, offering further contextualization of these predictions in a clinical setting. Across the cohort, RPL36AP37 and HBA1 were scored as the most important biomarkers for predicting CVDs. A comprehensive literature review revealed that a substantial portion of the diagnostic biomarkers identified have previously been associated with CVD. The framework we propose in this study is unbiased and generalizable to other diseases and disorders.
Collapse
Affiliation(s)
- William DeGroat
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, 112 Paterson St, New Brunswick, NJ, 08901, USA
| | - Habiba Abdelhalim
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, 112 Paterson St, New Brunswick, NJ, 08901, USA
| | - Elizabeth Peker
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, 112 Paterson St, New Brunswick, NJ, 08901, USA
| | - Neev Sheth
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, 112 Paterson St, New Brunswick, NJ, 08901, USA
| | - Rishabh Narayanan
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, 112 Paterson St, New Brunswick, NJ, 08901, USA
| | - Saman Zeeshan
- Department of Biomedical and Health Informatics, UMKC School of Medicine, 2411 Holmes Street, Kansas City, MO, 64108, USA
| | - Bruce T Liang
- Pat and Jim Calhoun Cardiology Center, UConn Health, 263 Farmington Ave, Farmington, CT, USA
- UConn School of Medicine, University of Connecticut, 263 Farmington Ave, Farmington, CT, USA
| | - Zeeshan Ahmed
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, 112 Paterson St, New Brunswick, NJ, 08901, USA.
- UConn School of Medicine, University of Connecticut, 263 Farmington Ave, Farmington, CT, USA.
- Department of Medicine, Division of Cardiovascular Disease and Hypertension, Robert Wood Johnson Medical School, Rutgers Health, 125 Paterson St, New Brunswick, NJ, 08901, USA.
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ, 08901, USA.
| |
Collapse
|
2
|
Aberuyi N, Rahgozar S, Ghodousi ES, Ghaedi K. Drug Resistance Biomarkers and Their Clinical Applications in Childhood Acute Lymphoblastic Leukemia. Front Oncol 2020; 9:1496. [PMID: 32010613 PMCID: PMC6978753 DOI: 10.3389/fonc.2019.01496] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 12/12/2019] [Indexed: 12/12/2022] Open
Abstract
Biomarkers are biological molecules found in body fluids or tissues, which can be considered as indications of a normal or abnormal process, or of a condition or disease. There are various types of biomarkers based on their application and molecular alterations. Treatment-sensitivity or drug resistance biomarkers include prognostic and predictive molecules with utmost importance in selecting appropriate treatment protocols and improving survival rates. Acute lymphoblastic leukemia (ALL) is the most prevalent hematological malignancy diagnosed in children with nearly 80% cure rate. Despite the favorable survival rates of childhood ALL (chALL), resistance to chemotherapeutic agents and, as a consequence, a dismal prognosis develops in a significant number of patients. Therefore, there are urgent needs to have robust, sensitive, and disease-specific molecular prognostic and predictive biomarkers, which could allow better risk classification and then better clinical results. In this article, we review the currently known drug resistance biomarkers, including somatic or germ line nucleic acids, epigenetic alterations, protein expressions and metabolic variations. Moreover, biomarkers with potential clinical applications are discussed.
Collapse
Affiliation(s)
- Narges Aberuyi
- Division of Cellular and Molecular Biology, Department of Cell and Molecular Biology & Microbiology, Faculty of Biological Sciences and Technologies, University of Isfahan, Isfahan, Iran
| | - Soheila Rahgozar
- Division of Cellular and Molecular Biology, Department of Cell and Molecular Biology & Microbiology, Faculty of Biological Sciences and Technologies, University of Isfahan, Isfahan, Iran
| | - Elaheh Sadat Ghodousi
- Division of Cellular and Molecular Biology, Department of Cell and Molecular Biology & Microbiology, Faculty of Biological Sciences and Technologies, University of Isfahan, Isfahan, Iran
| | - Kamran Ghaedi
- Division of Cellular and Molecular Biology, Department of Cell and Molecular Biology & Microbiology, Faculty of Biological Sciences and Technologies, University of Isfahan, Isfahan, Iran
| |
Collapse
|