1
|
Zhang Z, Liu X, Zhang S, Song Z, Lu K, Yang W. A review and analysis of key biomarkers in Alzheimer's disease. Front Neurosci 2024; 18:1358998. [PMID: 38445255 PMCID: PMC10912539 DOI: 10.3389/fnins.2024.1358998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 02/02/2024] [Indexed: 03/07/2024] Open
Abstract
Alzheimer's disease (AD) is a progressive neurodegenerative disorder that affects over 50 million elderly individuals worldwide. Although the pathogenesis of AD is not fully understood, based on current research, researchers are able to identify potential biomarker genes and proteins that may serve as effective targets against AD. This article aims to present a comprehensive overview of recent advances in AD biomarker identification, with highlights on the use of various algorithms, the exploration of relevant biological processes, and the investigation of shared biomarkers with co-occurring diseases. Additionally, this article includes a statistical analysis of key genes reported in the research literature, and identifies the intersection with AD-related gene sets from databases such as AlzGen, GeneCard, and DisGeNet. For these gene sets, besides enrichment analysis, protein-protein interaction (PPI) networks utilized to identify central genes among the overlapping genes. Enrichment analysis, protein interaction network analysis, and tissue-specific connectedness analysis based on GTEx database performed on multiple groups of overlapping genes. Our work has laid the foundation for a better understanding of the molecular mechanisms of AD and more accurate identification of key AD markers.
Collapse
Affiliation(s)
- Zhihao Zhang
- School of Computer Science and Technology, Xinjiang University, Ürümqi, China
- College of Medical Engineering and Technology, Xinjiang Medical University, Ürümqi, China
| | - Xiangtao Liu
- College of Medical Engineering and Technology, Xinjiang Medical University, Ürümqi, China
| | - Suixia Zhang
- College of Medical Engineering and Technology, Xinjiang Medical University, Ürümqi, China
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
- State Key Laboratory of Pathogenesis, Prevention, Treatment of Central Asian High Incidence Diseases, First Affiliated Hospital of Xinjiang Medical University, Ürümqi, China
| | - Zhixin Song
- College of Medical Engineering and Technology, Xinjiang Medical University, Ürümqi, China
| | - Ke Lu
- School of Computer Science and Technology, Xinjiang University, Ürümqi, China
| | - Wenzhong Yang
- School of Computer Science and Technology, Xinjiang University, Ürümqi, China
| |
Collapse
|
2
|
Geraci J, Bhargava R, Qorri B, Leonchyk P, Cook D, Cook M, Sie F, Pani L. Machine learning hypothesis-generation for patient stratification and target discovery in rare disease: our experience with Open Science in ALS. Front Comput Neurosci 2024; 17:1199736. [PMID: 38260713 PMCID: PMC10801647 DOI: 10.3389/fncom.2023.1199736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 11/20/2023] [Indexed: 01/24/2024] Open
Abstract
Introduction Advances in machine learning (ML) methodologies, combined with multidisciplinary collaborations across biological and physical sciences, has the potential to propel drug discovery and development. Open Science fosters this collaboration by releasing datasets and methods into the public space; however, further education and widespread acceptance and adoption of Open Science approaches are necessary to tackle the plethora of known disease states. Motivation In addition to providing much needed insights into potential therapeutic protein targets, we also aim to demonstrate that small patient datasets have the potential to provide insights that usually require many samples (>5,000). There are many such datasets available and novel advancements in ML can provide valuable insights from these patient datasets. Problem statement Using a public dataset made available by patient advocacy group AnswerALS and a multidisciplinary Open Science approach with a systems biology augmented ML technology, we aim to validate previously reported drug targets in ALS and provide novel insights about ALS subpopulations and potential drug targets using a unique combination of ML methods and graph theory. Methodology We use NetraAI to generate hypotheses about specific patient subpopulations, which were then refined and validated through a combination of ML techniques, systems biology methods, and expert input. Results We extracted 8 target classes, each comprising of several genes that shed light into ALS pathophysiology and represent new avenues for treatment. These target classes are broadly categorized as inflammation, epigenetic, heat shock, neuromuscular junction, autophagy, apoptosis, axonal transport, and excitotoxicity. These findings are not mutually exclusive, and instead represent a systematic view of ALS pathophysiology. Based on these findings, we suggest that simultaneous targeting of ALS has the potential to mitigate ALS progression, with the plausibility of maintaining and sustaining an improved quality of life (QoL) for ALS patients. Even further, we identified subpopulations based on disease onset. Conclusion In the spirit of Open Science, this work aims to bridge the knowledge gap in ALS pathophysiology to aid in diagnostic, prognostic, and therapeutic strategies and pave the way for the development of personalized treatments tailored to the individual's needs.
Collapse
Affiliation(s)
- Joseph Geraci
- NetraMark Corp, Toronto, ON, Canada
- Department of Pathology and Molecular Medicine, Queen's University, Kingston, ON, Canada
- Centre for Biotechnology and Genomic Medicine, Medical College of Georgia, Augusta University, Augusta, GA, United States
- Arthur C. Clarke Center for Human Imagination, School of Physical Sciences, University of California San Diego, San Diego, CA, United States
| | - Ravi Bhargava
- Department of Biomedical and Molecular Science, Queens University, Kingston, ON, Canada
- Science and Research, Roche Integrated Informatics, F. Hoffmann La-Roche, Toronto, ON, Canada
| | | | | | - Douglas Cook
- NetraMark Corp, Toronto, ON, Canada
- Department of Surgery, Queen's University, Kingston, ON, Canada
| | - Moses Cook
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Fanny Sie
- Science and Research, Roche Integrated Informatics, F. Hoffmann La-Roche, Toronto, ON, Canada
| | - Luca Pani
- NetraMark Corp, Toronto, ON, Canada
- Department of Psychiatry and Behavioral Sciences, Leonard M. Miller School of Medicine, University of Miami, Coral Gables, FL, United States
- Department of Biomedical, Metabolic, and Neural Sciences, University of Modena and Reggio Emilia, Modena, Italy
| |
Collapse
|
3
|
Alamro H, Thafar MA, Albaradei S, Gojobori T, Essack M, Gao X. Exploiting machine learning models to identify novel Alzheimer’s disease biomarkers and potential targets. Sci Rep 2023; 13:4979. [PMID: 36973386 PMCID: PMC10043000 DOI: 10.1038/s41598-023-30904-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 03/03/2023] [Indexed: 03/29/2023] Open
Abstract
AbstractWe still do not have an effective treatment for Alzheimer's disease (AD) despite it being the most common cause of dementia and impaired cognitive function. Thus, research endeavors are directed toward identifying AD biomarkers and targets. In this regard, we designed a computational method that exploits multiple hub gene ranking methods and feature selection methods with machine learning and deep learning to identify biomarkers and targets. First, we used three AD gene expression datasets to identify 1/ hub genes based on six ranking algorithms (Degree, Maximum Neighborhood Component (MNC), Maximal Clique Centrality (MCC), Betweenness Centrality (BC), Closeness Centrality, and Stress Centrality), 2/ gene subsets based on two feature selection methods (LASSO and Ridge). Then, we developed machine learning and deep learning models to determine the gene subset that best distinguishes AD samples from the healthy controls. This work shows that feature selection methods achieve better prediction performances than the hub gene sets. Beyond this, the five genes identified by both feature selection methods (LASSO and Ridge algorithms) achieved an AUC = 0.979. We further show that 70% of the upregulated hub genes (among the 28 overlapping hub genes) are AD targets based on a literature review and six miRNA (hsa-mir-16-5p, hsa-mir-34a-5p, hsa-mir-1-3p, hsa-mir-26a-5p, hsa-mir-93-5p, hsa-mir-155-5p) and one transcription factor, JUN, are associated with the upregulated hub genes. Furthermore, since 2020, four of the six microRNA were also shown to be potential AD targets. To our knowledge, this is the first work showing that such a small number of genes can distinguish AD samples from healthy controls with high accuracy and that overlapping upregulated hub genes can narrow the search space for potential novel targets.
Collapse
|
4
|
Tang AS, Oskotsky T, Havaldar S, Mantyh WG, Bicak M, Solsberg CW, Woldemariam S, Zeng B, Hu Z, Oskotsky B, Dubal D, Allen IE, Glicksberg BS, Sirota M. Deep phenotyping of Alzheimer's disease leveraging electronic medical records identifies sex-specific clinical associations. Nat Commun 2022; 13:675. [PMID: 35115528 PMCID: PMC8814236 DOI: 10.1038/s41467-022-28273-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Accepted: 01/18/2022] [Indexed: 12/14/2022] Open
Abstract
Alzheimer's Disease (AD) is a neurodegenerative disorder that is still not fully understood. Sex modifies AD vulnerability, but the reasons for this are largely unknown. We utilize two independent electronic medical record (EMR) systems across 44,288 patients to perform deep clinical phenotyping and network analysis to gain insight into clinical characteristics and sex-specific clinical associations in AD. Embeddings and network representation of patient diagnoses demonstrate greater comorbidity interactions in AD in comparison to matched controls. Enrichment analysis identifies multiple known and new diagnostic, medication, and lab result associations across the whole cohort and in a sex-stratified analysis. With this data-driven method of phenotyping, we can represent AD complexity and generate hypotheses of clinical factors that can be followed-up for further diagnostic and predictive analyses, mechanistic understanding, or drug repurposing and therapeutic approaches.
Collapse
Affiliation(s)
- Alice S Tang
- Bakar Computational Health Sciences Institute, UCSF, San Francisco, CA, USA.
- Graduate Program in Bioengineering, UCSF, San Francisco, CA, USA.
- School of Medicine, UCSF, San Francisco, CA, USA.
| | - Tomiko Oskotsky
- Bakar Computational Health Sciences Institute, UCSF, San Francisco, CA, USA
- Department of Pediatrics, UCSF, San Francisco, CA, USA
| | - Shreyas Havaldar
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - William G Mantyh
- Department of Neurology, University of Minnesota School of Medicine, Minneapolis, MN, USA
| | - Mesude Bicak
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Caroline Warly Solsberg
- Pharmaceutical Sciences and Pharmacogenomics, UCSF, San Francisco, CA, USA
- Department of Neurology and Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, 94158, USA
- Memory and Aging Center, UCSF, San Francisco, CA, USA
| | - Sarah Woldemariam
- Bakar Computational Health Sciences Institute, UCSF, San Francisco, CA, USA
| | - Billy Zeng
- School of Medicine, UCSF, San Francisco, CA, USA
| | - Zicheng Hu
- Bakar Computational Health Sciences Institute, UCSF, San Francisco, CA, USA
| | - Boris Oskotsky
- Bakar Computational Health Sciences Institute, UCSF, San Francisco, CA, USA
| | - Dena Dubal
- Department of Neurology and Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, 94158, USA
| | - Isabel E Allen
- Department of Epidemiology and Biostatistics, UCSF, San Francisco, CA, USA
| | - Benjamin S Glicksberg
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Marina Sirota
- Bakar Computational Health Sciences Institute, UCSF, San Francisco, CA, USA.
- Department of Pediatrics, UCSF, San Francisco, CA, USA.
| |
Collapse
|
5
|
Abstract
Machine learning has become a standard tool for medical researchers attempting to model disease in various ways, including building models to predict response to medications, classifying disease subtypes, and discovering new therapies. In this preview, we review a paper that utilizes quantum computation in order to tackle a critical issue that exists with medical datasets: they are small, in that they contain few samples. The authors' work demonstrates the possibility that these quantum-based methods may provide an advantage for small datasets and thus have a real impact for medical researchers in the future.
Collapse
Affiliation(s)
- Joseph Geraci
- Department of Molecular Medicine, Queen’s University, Kingston, ON, Canada
- Centre for Biotechnology and Genomics Medicine, Medical College of Georgia, Augusta, GA, USA
- NetraMark Corp, Toronto, ON, Canada
| |
Collapse
|