1
|
Sebastian S, Roy S, Kalita J. Network-based analysis of Alzheimer's Disease genes using multi-omics network integration with graph diffusion. J Biomed Inform 2025:104797. [PMID: 39993589 DOI: 10.1016/j.jbi.2025.104797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 01/16/2025] [Accepted: 02/10/2025] [Indexed: 02/26/2025]
Abstract
Alzheimer's Disease (AD) is a complex neurodegenerative disorder affecting millions worldwide. Despite extensive research, the mechanisms behind AD remain elusive. Many studies suggest that disease-responsible genes often act as hub genes in biological networks. However, this assumption requires further investigation in the context of AD. To examine the network characteristics of known AD genes, it is crucial to construct a highly confident network, which is challenging to achieve using a single data source. This work integrates multi-omics networks inferred from microarray, single-cell RNA sequencing, and single-nuclei RNA sequencing expression data, weighted with protein interaction and gene ontology information. We generate a high-quality integrated network by utilizing various inference methods and combining them through a graph diffusion-based integration approach. This network is then analyzed to investigate the properties of known AD-specific genes. Our findings reveal that AD genes are not always high-degree or central hub nodes in the network. Instead, these genes are distributed across different quartiles of degree centrality while maintaining significant interconnections for effective regulation. Furthermore, our study highlights that peripheral genes, often overlooked, also play crucial roles by connecting to relevant disease nodes and hub genes. These findings challenge the conventional understanding that AD-responsible genes are primarily the hub genes in the network, offering new insights into the complex regulatory mechanisms of AD and suggesting novel directions for future research.
Collapse
Affiliation(s)
- Softya Sebastian
- Network Reconstruction and Analysis (NetRA) Lab, Department of Computer Applications, Sikkim (Central) University, India; Department of Computer Science and Engineering, Vellore Institute of Technology, Chennai, India
| | - Swarup Roy
- Network Reconstruction and Analysis (NetRA) Lab, Department of Computer Applications, Sikkim (Central) University, India.
| | - Jugal Kalita
- Department of Computer Science, University of Colorado at Colorado Springs, USA
| |
Collapse
|
2
|
Bradshaw MS, Gibbs C, Martin S, Firman T, Gaskell A, Fosdick B, Layer R. Hypothesis generation for rare and undiagnosed diseases through clustering and classifying time-versioned biological ontologies. PLoS One 2024; 19:e0309205. [PMID: 39724242 DOI: 10.1371/journal.pone.0309205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 08/06/2024] [Indexed: 12/28/2024] Open
Abstract
Rare diseases affect 1-in-10 people in the United States and despite increased genetic testing, up to half never receive a diagnosis. Even when using advanced genome sequencing platforms to discover variants, if there is no connection between the variants found in the patient's genome and their phenotypes in the literature, then the patient will remain undiagnosed. When a direct variant-phenotype connection is not known, putting a patient's information in the larger context of phenotype relationships and protein-protein interactions may provide an opportunity to find an indirect explanation. Databases such as STRING contain millions of protein-protein interactions, and the Human Phenotype Ontology (HPO) contains the relations of thousands of phenotypes. By integrating these networks and clustering the entities within, we can potentially discover latent gene-to-phenotype connections. The historical records for STRING and HPO provide a unique opportunity to create a network time series for evaluating the cluster significance. Most excitingly, working with Children's Hospital Colorado, we have provided promising hypotheses about latent gene-to-phenotype connections for 38 patients. We also provide potential answers for 14 patients listed on MyGene2. Clusters our tool finds significant harbor 2.35 to 8.72 times as many gene-to-phenotype edges inferred from known drug interactions than clusters found to be insignificant. Our tool, BOCC, is available as a web app and command line tool.
Collapse
Affiliation(s)
- Michael S Bradshaw
- Department of Computer Science, University of Colorado Boulder, Boulder, CO, United States of America
| | - Connor Gibbs
- Department of Statistics, Colorado State University, Fort Collins, CO, United States of America
| | - Skylar Martin
- Department of Computer Science, University of Colorado Boulder, Boulder, CO, United States of America
| | - Taylor Firman
- Precision Medicine Institute, Children's Hospital Colorado, Aurora, CO, United States of America
| | - Alisa Gaskell
- Precision Medicine Institute, Children's Hospital Colorado, Aurora, CO, United States of America
| | - Bailey Fosdick
- Department of Biostatistics & Informatics, Colorado School of Public Health, Aurora, CO, United States of America
| | - Ryan Layer
- Department of Computer Science, University of Colorado Boulder, Boulder, CO, United States of America
| |
Collapse
|
3
|
Sebastian S, Roy S, Kalita J. A generic parallel framework for inferring large-scale gene regulatory networks from expression profiles: application to Alzheimer's disease network. Brief Bioinform 2023; 24:6868522. [PMID: 36534961 DOI: 10.1093/bib/bbac482] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 09/14/2022] [Accepted: 10/11/2022] [Indexed: 12/23/2022] Open
Abstract
The inference of large-scale gene regulatory networks is essential for understanding comprehensive interactions among genes. Most existing methods are limited to reconstructing networks with a few hundred nodes. Therefore, parallel computing paradigms must be leveraged to construct large networks. We propose a generic parallel framework that enables any existing method, without re-engineering, to infer large networks in parallel, guaranteeing quality output. The framework is tested on 15 inference methods (not limited to) employing in silico benchmarks and real-world large expression matrices, followed by qualitative and speedup assessment. The framework does not compromise the quality of the base serial inference method. We rank the candidate methods and use the top-performing method to infer an Alzheimer's Disease (AD) affected network from large expression profiles of a triple transgenic mouse model consisting of 45,101 genes. The resultant network is further explored to obtain hub genes that emerge functionally related to the disease. We partition the network into 41 modules and conduct pathway enrichment analysis, revealing that a good number of participating genes are collectively responsible for several brain disorders, including AD. Finally, we extract the interactions of a few known AD genes and observe that they are periphery genes connected to the network's hub genes. Availability: The R implementation of the framework is downloadable from https://github.com/Netralab/GenericParallelFramework.
Collapse
Affiliation(s)
- Softya Sebastian
- Network Reconstruction and Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, 6th Mile, Gangtok, 737102, Sikkim, India
| | - Swarup Roy
- Network Reconstruction and Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, 6th Mile, Gangtok, 737102, Sikkim, India
| | - Jugal Kalita
- Department of Computer Science, University of Colorado at Colorado Springs, CO, 80918 USA
| |
Collapse
|
4
|
Eskandarzade N, Ghorbani A, Samarfard S, Diaz J, Guzzi PH, Fariborzi N, Tahmasebi A, Izadpanah K. Network for network concept offers new insights into host- SARS-CoV-2 protein interactions and potential novel targets for developing antiviral drugs. Comput Biol Med 2022; 146:105575. [PMID: 35533462 PMCID: PMC9055686 DOI: 10.1016/j.compbiomed.2022.105575] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 04/16/2022] [Accepted: 04/27/2022] [Indexed: 01/08/2023]
Abstract
SARS-CoV-2, the causal agent of COVID-19, is primarily a pulmonary virus that can directly or indirectly infect several organs. Despite many studies carried out during the current COVID-19 pandemic, some pathological features of SARS-CoV-2 have remained unclear. It has been recently attempted to address the current knowledge gaps on the viral pathogenicity and pathological mechanisms via cellular-level tropism of SARS-CoV-2 using human proteomics, visualization of virus-host protein-protein interactions (PPIs), and enrichment analysis of experimental results. The synergistic use of models and methods that rely on graph theory has enabled the visualization and analysis of the molecular context of virus/host PPIs. We review current knowledge on the SARS-COV-2/host interactome cascade involved in the viral pathogenicity through the graph theory concept and highlight the hub proteins in the intra-viral network that create a subnet with a small number of host central proteins, leading to cell disintegration and infectivity. Then we discuss the putative principle of the "gene-for-gene and "network for network" concepts as platforms for future directions toward designing efficient anti-viral therapies.
Collapse
Affiliation(s)
- Neda Eskandarzade
- Department of Basic Sciences, School of Veterinary Medicine, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Abozar Ghorbani
- Nuclear Agriculture Research School, Nuclear Science and Technology Research Institute (NSTRI), Karaj, Iran,Corresponding author
| | - Samira Samarfard
- Berrimah Veterinary Laboratory, Department of Primary Industry and Resources, Berrimah, NT, 0828, Australia
| | - Jose Diaz
- Laboratorio de Dinámica de Redes Genéticas, Centro de Investigación en Dinámica Celular, Universidad Autónoma del Estado de Morelos, Cuernavaca, Mexico
| | - Pietro H. Guzzi
- Department of Medical and Surgical Sciences, Laboratory of Bioinformatics Unit, Italy
| | - Niloofar Fariborzi
- Department of Medical Entomology and Vector Control, School of Health, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Ahmad Tahmasebi
- Institute of Biotechnology, College of Agriculture, Shiraz University, Shiraz, Iran
| | | |
Collapse
|
5
|
Demenkov PS, Oshchepkova ЕА, Demenkov PS, Ivanisenko TV, Ivanisenko VA. Prioritization of biological processes based on the reconstruction and analysis of associative gene networks describing the response of plants to adverse environmental factors. Vavilovskii Zhurnal Genet Selektsii 2021; 25:580-592. [PMID: 34723066 PMCID: PMC8543060 DOI: 10.18699/vj21.065] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 06/21/2021] [Accepted: 06/21/2021] [Indexed: 11/23/2022] Open
Abstract
Methods for prioritizing or ranking candidate genes according to their importance based on specif ic criteria
via the analysis of gene networks are widely used in biomedicine to search for genes associated with diseases and to
predict biomarkers, pharmacological targets and other clinically relevant molecules. These methods have also been
used in other f ields, particularly in crop production. This is largely due to the development of technologies to solve
problems in marker-oriented and genomic selection, which requires knowledge of the molecular genetic mechanisms
underlying the formation of agriculturally valuable traits. A new direction for the study of molecular genetic mechanisms
is the prioritization of biological processes based on the analysis of associative gene networks. Associative gene
networks are heterogeneous networks whose vertices can depict both molecular genetic objects (genes, proteins, metabolites,
etc.) and the higher-level factors (biological processes, diseases, external environmental factors, etc.) related
to regulatory, physicochemical or associative interactions. Using a previously developed method, biological processes
involved in plant responses to increased cadmium content, saline stress and drought conditions were prioritized according
to their degree of connection with the gene networks in the SOLANUM TUBEROSUM knowledge base. The
prioritization results indicate that fundamental processes, such as gene expression, post-translational modif ications,
protein degradation, programmed cell death, photosynthesis, signal transmission and stress response play important
roles in the common molecular genetic mechanisms for plant response to various adverse factors. On the other hand, a
group of processes related to the development of seeds (“seeding development”) was revealed to be drought specif ic,
while processes associated with ion transport (“ion transport”) were included in the list of responses specif ic to salt
stress and processes associated with the metabolism of lipids were found to be involved specif ically in the response to
cadmium.
Collapse
Affiliation(s)
- P S Demenkov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| | - Е А Oshchepkova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - P S Demenkov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| | - T V Ivanisenko
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - V A Ivanisenko
- Novosibirsk State University, Novosibirsk, Russiavosibirsk, Russia Kurchatov Genomic Center of ICG SB RAS, Novosibirsk, Russia
| |
Collapse
|
6
|
Das JK, Roy S, Guzzi PH. Analyzing host-viral interactome of SARS-CoV-2 for identifying vulnerable host proteins during COVID-19 pathogenesis. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2021; 93:104921. [PMID: 34004362 PMCID: PMC8123524 DOI: 10.1016/j.meegid.2021.104921] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Revised: 05/04/2021] [Accepted: 05/07/2021] [Indexed: 02/07/2023]
Abstract
The development of therapeutic targets for COVID-19 relies on understanding the molecular mechanism of pathogenesis. Identifying genes or proteins involved in the infection mechanism is the key to shedding light on the complex molecular mechanisms. The combined effort of many laboratories distributed throughout the world has produced protein and genetic interactions. We integrated available results and obtained a host protein-protein interaction network composed of 1432 human proteins. Next, we performed network centrality analysis to identify critical proteins in the derived network. Finally, we performed a functional enrichment analysis of central proteins. We observed that the identified proteins are primarily associated with several crucial pathways, including cellular process, signaling transduction, neurodegenerative diseases. We focused on the proteins that are involved in human respiratory tract diseases. We highlighted many potential therapeutic targets, including RBX1, HSPA5, ITCH, RAB7A, RAB5A, RAB8A, PSMC5, CAPZB, CANX, IGF2R, and HSPA1A, which are central and also associated with multiple diseases.
Collapse
Affiliation(s)
- Jayanta Kumar Das
- Department of Pediatrics, School of Medicine, Johns Hopkins University, MD, USA
| | - Swarup Roy
- Network Reconstruction & Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, Gangtok, India,Corresponding authors
| | - Pietro Hiram Guzzi
- Department of Medical and Surgical Sciences, Magna Graecia University, Catanzaro, Italy,Corresponding authors
| |
Collapse
|