1
|
Ahmed Z, Degroat W, Abdelhalim H, Zeeshan S, Fine D. Deciphering genomic signatures associating human dental oral craniofacial diseases with cardiovascular diseases using machine learning approaches. Clin Oral Investig 2024; 28:52. [PMID: 38163819 DOI: 10.1007/s00784-023-05406-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 12/02/2023] [Indexed: 01/03/2024]
Abstract
OBJECTIVES Periodontal diseases are chronic, inflammatory disorders that involve the destruction of supporting tissues surrounding the teeth which leads to permanent damage and substantially heightens systemic exposure. If left untreated, dental, oral, and craniofacial diseases (DOCs), especially periodontitis, can increase an individual's risk in developing complex traits including cardiovascular diseases (CVDs). In this study, we are focused on systematically investigating causality between periodontitis with CVDs with the application of artificial intelligence (AI), machine learning (ML) algorithms, and state-of-the-art bioinformatics approaches using RNA-seq-driven gene expression data of CVD patients. MATERIALS AND METHODS In this study, we built a cohort of CVD patients, collected their blood samples, and performed RNA-seq and gene expression analysis to generate transcriptomic profiles. We proposed a nexus of AI/ML approaches for the identification of significant biomarkers, and predictive analysis. We implemented recursive feature elimination, Pearson correlation, chi-square, and analysis of variance to detect significant biomarkers, and utilized random forest and support vector machines for predictive analysis. RESULTS Our AI/ML analyses have led us to the preliminary conclusion that GAS5, GPX1, HLA-B, and SNHG6 are the potential gene markers that can be used to explain the causal relationship between periodontitis and CVDs. CONCLUSIONS CVDs are relatively common in patients with periodontal disease, and an increased risk of CVD is associated with periodontal disease independent of gender. Genetic susceptibility contributing to periodontitis and CVDs have been suggested to some extent, based on the similar degree of heritability shared between both complex diseases.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Department of Medicine/Cardiovascular Disease and Hypertension, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, 125 Paterson St, New Brunswick, NJ, USA.
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA.
| | - William Degroat
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA
| | - Habiba Abdelhalim
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA
| | - Saman Zeeshan
- Department of Radiation Oncology, Washington University School of Medicine, 660 S. Euclid Ave, St. Louis, MO, USA
| | - Daniel Fine
- Department of Oral Biology, Rutgers School of Dental Medicine, 110 Bergen Street, Newark, NJ, USA
| |
Collapse
|
2
|
Mhatre I, Abdelhalim H, Degroat W, Ashok S, Liang BT, Ahmed Z. Functional mutation, splice, distribution, and divergence analysis of impactful genes associated with heart failure and other cardiovascular diseases. Sci Rep 2023; 13:16769. [PMID: 37798313 PMCID: PMC10556087 DOI: 10.1038/s41598-023-44127-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Accepted: 10/04/2023] [Indexed: 10/07/2023] Open
Abstract
Cardiovascular disease (CVD) is caused by a multitude of complex and largely heritable conditions. Identifying key genes and understanding their susceptibility to CVD in the human genome can assist in early diagnosis and personalized treatment of the relevant patients. Heart failure (HF) is among those CVD phenotypes that has a high rate of mortality. In this study, we investigated genes primarily associated with HF and other CVDs. Achieving the goals of this study, we built a cohort of thirty-five consented patients, and sequenced their serum-based samples. We have generated and processed whole genome sequence (WGS) data, and performed functional mutation, splice, variant distribution, and divergence analysis to understand the relationships between each mutation type and its impact. Our variant and prevalence analysis found FLNA, CST3, LGALS3, and HBA1 linked to many enrichment pathways. Functional mutation analysis uncovered ACE, MME, LGALS3, NR3C2, PIK3C2A, CALD1, TEK, and TRPV1 to be notable and potentially significant genes. We discovered intron, 5' Flank, 3' UTR, and 3' Flank mutations to be the most common among HF and other CVD genes. Missense mutations were less common among HF and other CVD genes but had more of a functional impact. We reported HBA1, FADD, NPPC, ADRB2, ADBR1, MYH6, and PLN to be consequential based on our divergence analysis.
Collapse
Affiliation(s)
- Ishani Mhatre
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ, 08901, USA
| | - Habiba Abdelhalim
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ, 08901, USA
| | - William Degroat
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ, 08901, USA
| | - Shreya Ashok
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ, 08901, USA
| | - Bruce T Liang
- Pat and Jim Calhoun Cardiology Center, UConn Health, 263 Farmington Ave, Farmington, CT, USA
- UConn School of Medicine, University of Connecticut, 263 Farmington Ave, Farmington, CT, USA
| | - Zeeshan Ahmed
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ, 08901, USA.
- Department of Genetics and Genome Sciences, UConn Health, 400 Farmington Ave, Farmington, CT, USA.
- Department of Medicine/Cardiovascular Disease and Hypertension, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, 125 Paterson St, New Brunswick, NJ, USA.
| |
Collapse
|
3
|
Venkat V, Abdelhalim H, DeGroat W, Zeeshan S, Ahmed Z. Investigating genes associated with heart failure, atrial fibrillation, and other cardiovascular diseases, and predicting disease using machine learning techniques for translational research and precision medicine. Genomics 2023; 115:110584. [PMID: 36813091 DOI: 10.1016/j.ygeno.2023.110584] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 02/06/2023] [Accepted: 02/11/2023] [Indexed: 02/22/2023]
Abstract
Cardiovascular disease (CVD) is the leading cause of mortality and loss of disability adjusted life years (DALYs) globally. CVDs like Heart Failure (HF) and Atrial Fibrillation (AF) are associated with physical effects on the heart muscles. As a result of the complex nature, progression, inherent genetic makeup, and heterogeneity of CVDs, personalized treatments are believed to be critical. Rightful application of artificial intelligence (AI) and machine learning (ML) approaches can lead to new insights into CVDs for providing better personalized treatments with predictive analysis and deep phenotyping. In this study we focused on implementing AI/ML techniques on RNA-seq driven gene-expression data to investigate genes associated with HF, AF, and other CVDs, and predict disease with high accuracy. The study involved generating RNA-seq data derived from the serum of consented CVD patients. Next, we processed the sequenced data using our RNA-seq pipeline and applied GVViZ for gene-disease data annotation and expression analysis. To achieve our research objectives, we developed a new Findable, Accessible, Intelligent, and Reproducible (FAIR) approach that includes a five-level biostatistical evaluation, primarily based on the Random Forest (RF) algorithm. During our AI/ML analysis, we have fitted, trained, and implemented our model to classify and distinguish high-risk CVD patients based on their age, gender, and race. With the successful execution of our model, we predicted the association of highly significant HF, AF, and other CVDs genes with demographic variables.
Collapse
Affiliation(s)
- Vignesh Venkat
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA
| | - Habiba Abdelhalim
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA
| | - William DeGroat
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA
| | - Saman Zeeshan
- Rutgers Cancer Institute of New Jersey, Rutgers University, 195 Little Albany St, New Brunswick, NJ, USA
| | - Zeeshan Ahmed
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA; Department of Medicine, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, 125 Paterson St, New Brunswick, NJ, USA.
| |
Collapse
|
4
|
Berber A, Abdelhalim H, Zeeshan S, Vadapalli S, von Oehsen B, Yanamala N, Sengupta P, Ahmed Z. RNA-seq-driven expression analysis to investigate cardiovascular disease genes with associated phenotypes among atrial fibrillation patients. Clin Transl Med 2022; 12:e974. [PMID: 35875838 PMCID: PMC9309637 DOI: 10.1002/ctm2.974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 06/22/2022] [Accepted: 06/27/2022] [Indexed: 11/28/2022] Open
Affiliation(s)
- Asude Berber
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, New Jersey, USA
| | - Habiba Abdelhalim
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, New Jersey, USA
| | - Saman Zeeshan
- Rutgers Cancer Institute of New Jersey, Rutgers University, New Brunswick, New Jersey, USA
| | - Sreya Vadapalli
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, New Jersey, USA
| | - Barr von Oehsen
- Office of Advanced Research Computing, Rutgers, The State University of New Jersey, Computing Research and Education (CoRE) Building, Piscataway, New Jersey, USA
| | - Naveena Yanamala
- Division of Cardiovascular Disease, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, New Brunswick, New Jersey, USA
| | - Partho Sengupta
- Division of Cardiovascular Disease, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, New Brunswick, New Jersey, USA
| | - Zeeshan Ahmed
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, New Jersey, USA.,Department of Medicine, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, New Brunswick, New Jersey, USA
| |
Collapse
|
5
|
Vadapalli S, Abdelhalim H, Zeeshan S, Ahmed Z. Artificial intelligence and machine learning approaches using gene expression and variant data for personalized medicine. Brief Bioinform 2022; 23:6590150. [PMID: 35595537 DOI: 10.1093/bib/bbac191] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 04/02/2022] [Accepted: 04/26/2022] [Indexed: 12/16/2022] Open
Abstract
Precision medicine uses genetic, environmental and lifestyle factors to more accurately diagnose and treat disease in specific groups of patients, and it is considered one of the most promising medical efforts of our time. The use of genetics is arguably the most data-rich and complex components of precision medicine. The grand challenge today is the successful assimilation of genetics into precision medicine that translates across different ancestries, diverse diseases and other distinct populations, which will require clever use of artificial intelligence (AI) and machine learning (ML) methods. Our goal here was to review and compare scientific objectives, methodologies, datasets, data sources, ethics and gaps of AI/ML approaches used in genomics and precision medicine. We selected high-quality literature published within the last 5 years that were indexed and available through PubMed Central. Our scope was narrowed to articles that reported application of AI/ML algorithms for statistical and predictive analyses using whole genome and/or whole exome sequencing for gene variants, and RNA-seq and microarrays for gene expression. We did not limit our search to specific diseases or data sources. Based on the scope of our review and comparative analysis criteria, we identified 32 different AI/ML approaches applied in variable genomics studies and report widely adapted AI/ML algorithms for predictive diagnostics across several diseases.
Collapse
Affiliation(s)
- Sreya Vadapalli
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA
| | - Habiba Abdelhalim
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA
| | - Saman Zeeshan
- Rutgers Cancer Institute of New Jersey, Rutgers University, 195 Little Albany St, New Brunswick, NJ, USA
| | - Zeeshan Ahmed
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA.,Department of Medicine, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, 125 Paterson St, New Brunswick, NJ, USA
| |
Collapse
|
6
|
Multi-omics strategies for personalized and predictive medicine: past, current, and future translational opportunities. Emerg Top Life Sci 2022; 6:215-225. [PMID: 35234253 DOI: 10.1042/etls20210244] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 02/13/2022] [Accepted: 02/21/2022] [Indexed: 12/12/2022]
Abstract
Precision medicine is driven by the paradigm shift of empowering clinicians to predict the most appropriate course of action for patients with complex diseases and improve routine medical and public health practice. It promotes integrating collective and individualized clinical data with patient specific multi-omics data to develop therapeutic strategies, and knowledgebase for predictive and personalized medicine in diverse populations. This study is based on the hypothesis that understanding patient's metabolomics and genetic make-up in conjunction with clinical data will significantly lead to determining predisposition, diagnostic, prognostic and predictive biomarkers and optimal paths providing personalized care for diverse and targeted chronic, acute, and infectious diseases. This study briefs emerging significant, and recently reported multi-omics and translational approaches aimed to facilitate implementation of precision medicine. Furthermore, it discusses current grand challenges, and the future need of Findable, Accessible, Intelligent, and Reproducible (FAIR) approach to accelerate diagnostic and preventive care delivery strategies beyond traditional symptom-driven, disease-causal medical practice.
Collapse
|
7
|
Ahmed Z. Precision medicine with multi-omics strategies, deep phenotyping, and predictive analysis. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2022; 190:101-125. [DOI: 10.1016/bs.pmbts.2022.02.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
8
|
Ahmed Z, Zeeshan S, Liang BT. RNA-seq driven expression and enrichment analysis to investigate CVD genes with associated phenotypes among high-risk heart failure patients. Hum Genomics 2021; 15:67. [PMID: 34774109 PMCID: PMC8590246 DOI: 10.1186/s40246-021-00367-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 10/31/2021] [Indexed: 01/08/2023] Open
Abstract
Background Heart failure (HF) is one of the most common complications of cardiovascular diseases (CVDs) and among the leading causes of death in the US. Many other CVDs can lead to increased mortality as well. Investigating the genetic epidemiology and susceptibility to CVDs is a central focus of cardiology and biomedical life sciences. Several studies have explored expression of key CVD genes specially in HF, yet new targets and biomarkers for early diagnosis are still missing to support personalized treatment. Lack of gender-specific cardiac biomarker thresholds in men and women may be the reason for CVD underdiagnosis in women, and potentially increased morbidity and mortality as a result, or conversely, an overdiagnosis in men. In this context, it is important to analyze the expression and enrichment of genes with associated phenotypes and disease-causing variants among high-risk CVD populations. Methods We performed RNA sequencing focusing on key CVD genes with a great number of genetic associations to HF. Peripheral blood samples were collected from a broad age range of adult male and female CVD patients. These patients were clinically diagnosed with CVDs and CMS/HCC HF, as well as including cardiomyopathy, hypertension, obesity, diabetes, asthma, high cholesterol, hernia, chronic kidney, joint pain, dizziness and giddiness, osteopenia of multiple sites, chest pain, osteoarthritis, and other diseases. Results We report RNA-seq driven case–control study to analyze patterns of expression in genes and differentiating the pathways, which differ between healthy and diseased patients. Our in-depth gene expression and enrichment analysis of RNA-seq data from patients with mostly HF and other CVDs on differentially expressed genes and CVD annotated genes revealed 4,885 differentially expressed genes (DEGs) and regulation of 41 genes known for HF and 23 genes related to other CVDs, with 15 DEGs as significantly expressed including four genes already known (FLNA, CST3, LGALS3, and HBA1) for HF and CVDs with the enrichment of many pathways. Furthermore, gender and ethnic group specific analysis showed shared and unique genes between the genders, and among different races. Broadening the scope of the results in clinical settings, we have linked the CVD genes with ICD codes. Conclusions Many pathways were found to be enriched, and gender-specific analysis showed shared and unique genes between the genders. Additional testing of these genes may lead to the development of new clinical tools to improve diagnosis and prognosis of CVD patients. Supplementary Information The online version contains supplementary material available at 10.1186/s40246-021-00367-8.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ, 08901, USA. .,Department of Medicine, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, 125 Paterson St, New Brunswick, NJ, USA. .,Department of Genetics and Genome Sciences, UConn Health, 400 Farmington Ave, Farmington, CT, USA. .,Pat and Jim Calhoun Cardiology Center, UConn School of Medicine, University of Connecticut Health Center, 263 Farmington Ave, Farmington, CT, USA.
| | - Saman Zeeshan
- Rutgers Cancer Institute of New Jersey, Rutgers University, 195 Little Albany St, New Brunswick, NJ, USA
| | - Bruce T Liang
- Pat and Jim Calhoun Cardiology Center, UConn School of Medicine, University of Connecticut Health Center, 263 Farmington Ave, Farmington, CT, USA
| |
Collapse
|
9
|
Ahmed Z. Intelligent health system for the investigation of consenting COVID-19 patients and precision medicine. Per Med 2021; 18:573-582. [PMID: 34619976 PMCID: PMC8544483 DOI: 10.2217/pme-2021-0068] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Advancing frontiers of clinical research, we discuss the need for intelligent health systems to support a deeper investigation of COVID-19. We hypothesize that the convergence of the healthcare data and staggering developments in artificial intelligence have the potential to elevate the recovery process with diagnostic and predictive analysis to identify major causes of mortality, modifiable risk factors and actionable information that supports the early detection and prevention of COVID-19. However, current constraints include the recruitment of COVID-19 patients for research; translational integration of electronic health records and diversified public datasets; and the development of artificial intelligence systems for data-intensive computational modeling to assist clinical decision making. We propose a novel nexus of machine learning algorithms to examine COVID-19 data granularity from population studies to subgroups stratification and ensure best modeling strategies within the data continuum.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Rutgers Institute for Health, Health Care Policy & Aging Research, Rutgers University, 112 Paterson Street, New Brunswick, NJ 08901, USA.,Department of Medicine, Robert Wood Johnson Medical School, Rutgers Biomedical & Health Sciences, 125 Paterson Street, New Brunswick, NJ 08901, USA
| |
Collapse
|
10
|
Ahmed Z, Renart EG, Zeeshan S. Genomics pipelines to investigate susceptibility in whole genome and exome sequenced data for variant discovery, annotation, prediction and genotyping. PeerJ 2021; 9:e11724. [PMID: 34395068 PMCID: PMC8320519 DOI: 10.7717/peerj.11724] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 06/14/2021] [Indexed: 12/12/2022] Open
Abstract
Over the last few decades, genomics is leading toward audacious future, and has been changing our views about conducting biomedical research, studying diseases, and understanding diversity in our society across the human species. The whole genome and exome sequencing (WGS/WES) are two of the most popular next-generation sequencing (NGS) methodologies that are currently being used to detect genetic variations of clinical significance. Investigating WGS/WES data for the variant discovery and genotyping is based on the nexus of different data analytic applications. Although several bioinformatics applications have been developed, and many of those are freely available and published. Timely finding and interpreting genetic variants are still challenging tasks among diagnostic laboratories and clinicians. In this study, we are interested in understanding, evaluating, and reporting the current state of solutions available to process the NGS data of variable lengths and types for the identification of variants, alleles, and haplotypes. Residing within the scope, we consulted high quality peer reviewed literature published in last 10 years. We were focused on the standalone and networked bioinformatics applications proposed to efficiently process WGS and WES data, and support downstream analysis for gene-variant discovery, annotation, prediction, and interpretation. We have discussed our findings in this manuscript, which include but not are limited to the set of operations, workflow, data handling, involved tools, technologies and algorithms and limitations of the assessed applications.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA.,Department of Medicine, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| | - Eduard Gibert Renart
- Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| | - Saman Zeeshan
- Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
| |
Collapse
|