1
|
Holfeld A, Schuster D, Sesterhenn F, Gillingham AK, Stalder P, Haenseler W, Barrio-Hernandez I, Ghosh D, Vowles J, Cowley SA, Nagel L, Khanppnavar B, Serdiuk T, Beltrao P, Korkhov VM, Munro S, Riek R, de Souza N, Picotti P. Systematic identification of structure-specific protein-protein interactions. Mol Syst Biol 2024; 20:651-675. [PMID: 38702390 PMCID: PMC11148107 DOI: 10.1038/s44320-024-00037-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 04/12/2024] [Accepted: 04/15/2024] [Indexed: 05/06/2024] Open
Abstract
The physical interactome of a protein can be altered upon perturbation, modulating cell physiology and contributing to disease. Identifying interactome differences of normal and disease states of proteins could help understand disease mechanisms, but current methods do not pinpoint structure-specific PPIs and interaction interfaces proteome-wide. We used limited proteolysis-mass spectrometry (LiP-MS) to screen for structure-specific PPIs by probing for protease susceptibility changes of proteins in cellular extracts upon treatment with specific structural states of a protein. We first demonstrated that LiP-MS detects well-characterized PPIs, including antibody-target protein interactions and interactions with membrane proteins, and that it pinpoints interfaces, including epitopes. We then applied the approach to study conformation-specific interactors of the Parkinson's disease hallmark protein alpha-synuclein (aSyn). We identified known interactors of aSyn monomer and amyloid fibrils and provide a resource of novel putative conformation-specific aSyn interactors for validation in further studies. We also used our approach on GDP- and GTP-bound forms of two Rab GTPases, showing detection of differential candidate interactors of conformationally similar proteins. This approach is applicable to screen for structure-specific interactomes of any protein, including posttranslationally modified and unmodified, or metabolite-bound and unbound protein states.
Collapse
Affiliation(s)
- Aleš Holfeld
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland
| | - Dina Schuster
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland
- Institute of Molecular Biology and Biophysics, Department of Biology, ETH Zurich, Zurich, Switzerland
- Laboratory of Biomolecular Research, Division of Biology and Chemistry, Paul Scherrer Institute, Villigen, Switzerland
| | - Fabian Sesterhenn
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland
| | | | - Patrick Stalder
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland
| | - Walther Haenseler
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland
- University Research Priority Program AdaBD (Adaptive Brain Circuits in Development and Learning), University of Zurich, Zurich, Switzerland
| | - Inigo Barrio-Hernandez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Open Targets, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Dhiman Ghosh
- Laboratory of Physical Chemistry, ETH Zurich, Zurich, Switzerland
| | - Jane Vowles
- James and Lillian Martin Centre for Stem Cell Research, Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Sally A Cowley
- James and Lillian Martin Centre for Stem Cell Research, Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Luise Nagel
- Cluster of Excellence Cellular Stress Responses in Aging-associated Diseases (CECAD), University of Cologne, Cologne, Germany
| | - Basavraj Khanppnavar
- Institute of Molecular Biology and Biophysics, Department of Biology, ETH Zurich, Zurich, Switzerland
- Laboratory of Biomolecular Research, Division of Biology and Chemistry, Paul Scherrer Institute, Villigen, Switzerland
| | - Tetiana Serdiuk
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland
| | - Pedro Beltrao
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland
| | - Volodymyr M Korkhov
- Institute of Molecular Biology and Biophysics, Department of Biology, ETH Zurich, Zurich, Switzerland
- Laboratory of Biomolecular Research, Division of Biology and Chemistry, Paul Scherrer Institute, Villigen, Switzerland
| | - Sean Munro
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Roland Riek
- Laboratory of Physical Chemistry, ETH Zurich, Zurich, Switzerland
| | - Natalie de Souza
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland
- Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland
| | - Paola Picotti
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland.
| |
Collapse
|
2
|
Cui H, Srinivasan S, Gao Z, Korkin D. The Extent of Edgetic Perturbations in the Human Interactome Caused by Population-Specific Mutations. Biomolecules 2023; 14:40. [PMID: 38254640 PMCID: PMC11154503 DOI: 10.3390/biom14010040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 11/30/2023] [Accepted: 12/03/2023] [Indexed: 01/24/2024] Open
Abstract
Until recently, efforts in population genetics have been focused primarily on people of European ancestry. To attenuate this bias, global population studies, such as the 1000 Genomes Project, have revealed differences in genetic variation across ethnic groups. How many of these differences can be attributed to population-specific traits? To answer this question, the mutation data must be linked with functional outcomes. A new "edgotype" concept has been proposed, which emphasizes the interaction-specific, "edgetic", perturbations caused by mutations in the interacting proteins. In this work, we performed systematic in silico edgetic profiling of ~50,000 non-synonymous SNVs (nsSNVs) from the 1000 Genomes Project by leveraging our semi-supervised learning approach SNP-IN tool on a comprehensive set of over 10,000 protein interaction complexes. We interrogated the functional roles of the variants and their impact on the human interactome and compared the results with the pathogenic variants disrupting PPIs in the same interactome. Our results demonstrated that a considerable number of nsSNVs from healthy populations could rewire the interactome. We also showed that the proteins enriched with interaction-disrupting mutations were associated with diverse functions and had implications in a broad spectrum of diseases. Further analysis indicated that distinct gene edgetic profiles among major populations could shed light on the molecular mechanisms behind the population phenotypic variances. Finally, the network analysis revealed that the disease-associated modules surprisingly harbored a higher density of interaction-disrupting mutations from healthy populations. The variation in the cumulative network damage within these modules could potentially account for the observed disparities in disease susceptibility, which are distinctly specific to certain populations. Our work demonstrates the feasibility of a large-scale in silico edgetic study, and reveals insights into the orchestrated play of population-specific mutations in the human interactome.
Collapse
Affiliation(s)
- Hongzhu Cui
- Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA;
- Chromatography and Mass Spectrometry Division, Thermo Fisher Scientific, San Jose, CA 95134, USA
| | - Suhas Srinivasan
- Data Science Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA;
- Program in Epithelial Biology, Stanford School of Medicine, Stanford, CA 94305, USA
- Center for Personal Dynamic Regulomes, Stanford School of Medicine, Stanford, CA 94305, USA
| | - Ziyang Gao
- Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA;
| | - Dmitry Korkin
- Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA;
- Data Science Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA;
- Computer Science Department, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| |
Collapse
|
3
|
Feizi A, Ray K. otargen: GraphQL-based R package for tidy data accessing and processing from Open Targets Genetics. Bioinformatics 2023; 39:btad441. [PMID: 37467069 PMCID: PMC10394122 DOI: 10.1093/bioinformatics/btad441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 06/26/2023] [Accepted: 07/18/2023] [Indexed: 07/21/2023] Open
Abstract
MOTIVATION Open Target Genetics is a comprehensive resource portal that offers variant-centric statistical evidence, enabling the prioritization of causal variants and the identification of potential drug targets. The portal uses GraphQL technology for efficient data query and provides endpoints for programmatic access for R and Python users. However, leveraging GraphQL for data retrieval can be challenging, time-consuming, and repetitive, requiring familiarity with the GraphQL query language and processing outputs in nested JSON (JavaScript Object Notation) format into tidy data tables. Therefore, developing open-source tools are required to simplify data retrieval processes to integrate valuable genetic information into data-driven target discovery pipelines seamlessly. RESULTS otargen is an open-source R package designed to make data retrieval and analysis from the Open Target Genetics portal as simple as possible for R users. The package offers a suite of functions covering all query types, allowing streamlined data access in a tidy table format. By executing only a single line of code, the otargen users avoid the repetitive scripting of complex GraphQL queries, including the post-processing steps. In addition, otargen contains convenient plotting functions to visualize and gain insights from complex data tables returned by several key functions. AVAILABILITY AND IMPLEMENTATION otargen is available at https://amirfeizi.github.io/otargen/.
Collapse
Affiliation(s)
- Amir Feizi
- Bioinformatics Department, OMass Therapeutics, Oxford Business Park, ARC, Oxford OX4 2GX, United Kingdom
| | - Kamalika Ray
- Bioinformatics Department, OMass Therapeutics, Oxford Business Park, ARC, Oxford OX4 2GX, United Kingdom
| |
Collapse
|
4
|
Carlin DE, Larsen SJ, Sirupurapu V, Cho MH, Silverman EK, Baumbach J, Ideker T. Hierarchical association of COPD to principal genetic components of biological systems. PLoS One 2023; 18:e0286064. [PMID: 37228113 DOI: 10.1371/journal.pone.0286064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Accepted: 05/08/2023] [Indexed: 05/27/2023] Open
Abstract
Many disease-causing genetic variants converge on common biological functions and pathways. Precisely how to incorporate pathway knowledge in genetic association studies is not yet clear, however. Previous approaches employ a two-step approach, in which a regular association test is first performed to identify variants associated with the disease phenotype, followed by a test for functional enrichment within the genes implicated by those variants. Here we introduce a concise one-step approach, Hierarchical Genetic Analysis (Higana), which directly computes phenotype associations against each function in the large hierarchy of biological functions documented by the Gene Ontology. Using this approach, we identify risk genes and functions for Chronic Obstructive Pulmonary Disease (COPD), highlighting microtubule transport, muscle adaptation, and nicotine receptor signaling pathways. Microtubule transport has not been previously linked to COPD, as it integrates genetic variants spread over numerous genes. All associations validate strongly in a second COPD cohort.
Collapse
Affiliation(s)
- Daniel E Carlin
- Department of Medicine, Division of Genetics, University of California San Diego, La Jolla, CA, United States of America
| | | | - Vikram Sirupurapu
- Department of Medicine, Division of Genetics, University of California San Diego, La Jolla, CA, United States of America
| | - Michael H Cho
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, United States of America
| | - Edwin K Silverman
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, United States of America
| | - Jan Baumbach
- Department of Computational Systems Biology, University of Hamburg, Hamburg, Germany
| | - Trey Ideker
- Department of Medicine, Division of Genetics, University of California San Diego, La Jolla, CA, United States of America
| |
Collapse
|
5
|
Barrio-Hernandez I, Schwartzentruber J, Shrivastava A, Del-Toro N, Gonzalez A, Zhang Q, Mountjoy E, Suveges D, Ochoa D, Ghoussaini M, Bradley G, Hermjakob H, Orchard S, Dunham I, Anderson CA, Porras P, Beltrao P. Network expansion of genetic associations defines a pleiotropy map of human cell biology. Nat Genet 2023; 55:389-398. [PMID: 36823319 PMCID: PMC10011132 DOI: 10.1038/s41588-023-01327-9] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Accepted: 01/30/2023] [Indexed: 02/25/2023]
Abstract
Interacting proteins tend to have similar functions, influencing the same organismal traits. Interaction networks can be used to expand the list of candidate trait-associated genes from genome-wide association studies. Here, we performed network-based expansion of trait-associated genes for 1,002 human traits showing that this recovers known disease genes or drug targets. The similarity of network expansion scores identifies groups of traits likely to share an underlying genetic and biological process. We identified 73 pleiotropic gene modules linked to multiple traits, enriched in genes involved in processes such as protein ubiquitination and RNA processing. In contrast to gene deletion studies, pleiotropy as defined here captures specifically multicellular-related processes. We show examples of modules linked to human diseases enriched in genes with known pathogenic variants that can be used to map targets of approved drugs for repurposing. Finally, we illustrate the use of network expansion scores to study genes at inflammatory bowel disease genome-wide association study loci, and implicate inflammatory bowel disease-relevant genes with strong functional and genetic support.
Collapse
Affiliation(s)
- Inigo Barrio-Hernandez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Jeremy Schwartzentruber
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Anjali Shrivastava
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Noemi Del-Toro
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Asier Gonzalez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Qian Zhang
- Wellcome Sanger Institute, Cambridge, UK
| | - Edward Mountjoy
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Daniel Suveges
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - David Ochoa
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Maya Ghoussaini
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Glyn Bradley
- Computational Biology, Genomic Sciences, GSK, Stevenage, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Carl A Anderson
- Open Targets, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Pablo Porras
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Pedro Beltrao
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK.
- Open Targets, Cambridge, UK.
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland.
| |
Collapse
|
6
|
Sarafidou T, Galliopoulou E, Apostolopoulou D, Fragkiadakis GA, Moschonas NK. Reconstruction of a Comprehensive Interactome and Experimental Data Analysis of FRA10AC1 May Provide Insights into Its Biological Role in Health and Disease. Genes (Basel) 2023; 14:genes14030568. [PMID: 36980839 PMCID: PMC10048706 DOI: 10.3390/genes14030568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 02/14/2023] [Accepted: 02/21/2023] [Indexed: 03/03/2023] Open
Abstract
FRA10AC1, the causative gene for the manifestation of the FRA10A fragile site, encodes a well-conserved nuclear protein characterized as a non-core spliceosomal component. Pre-mRNA splicing perturbations have been linked with neurodevelopmental diseases. FRA10AC1 variants have been, recently, causally linked with severe neuropathological and growth retardation phenotypes. To further elucidate the participation of FRA10AC1 in spliceosomal multiprotein complexes and its involvement in neurological phenotypes related to splicing, we exploited protein–protein interaction experimental data and explored network information and information deduced from transcriptomics. We confirmed the direct interaction of FRA10AC1with ESS2, a non-core spliceosomal protein, mapped their interacting domains, and documented their tissue co-localization and physical interaction at the level of intracellular protein stoichiometries. Although FRA10AC1 and SF3B2, a major core spliceosomal protein, were shown to interact under in vitro conditions, the endogenous proteins failed to co-immunoprecipitate. A reconstruction of a comprehensive, strictly binary, protein–protein interaction network of FRA10AC1 revealed dense interconnectivity with many disease-associated spliceosomal components and several non-spliceosomal regulatory proteins. The topological neighborhood of FRA10AC1 depicts an interactome associated with multiple severe monogenic and multifactorial neurodevelopmental diseases mainly referring to spliceosomopathies. Our results suggest that FRA10AC1 involvement in pre-mRNA processing might be strengthened by interconnecting splicing with transcription and mRNA export, and they propose the broader role(s) of FRA10AC1 in cell pathophysiology.
Collapse
Affiliation(s)
- Theologia Sarafidou
- Department of Biochemistry and Biotechnology, University of Thessaly, Viopolis, 41500 Larissa, Greece
- Correspondence: (T.S.); (N.K.M.)
| | - Eleni Galliopoulou
- Department of Biochemistry and Biotechnology, University of Thessaly, Viopolis, 41500 Larissa, Greece
| | | | - Georgios A. Fragkiadakis
- Department of Nutrition and Dietetics Sciences, Hellenic Mediterranean University, Tripitos, 72300 Siteia, Greece
| | - Nicholas K. Moschonas
- School of Medicine, University of Patras, 26500 Patras, Greece
- Institute of Chemical Engineering Sciences, Foundation for Research and Technology Hellas (FORTH/ICE-HT), 26504 Patras, Greece
- Correspondence: (T.S.); (N.K.M.)
| |
Collapse
|
7
|
Integrative pathway and network analysis provide insights on flooding-tolerance genes in soybean. Sci Rep 2023; 13:1980. [PMID: 36737640 PMCID: PMC9898312 DOI: 10.1038/s41598-023-28593-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 01/20/2023] [Indexed: 02/05/2023] Open
Abstract
Soybean is highly sensitive to flooding and extreme rainfall. The phenotypic variation of flooding tolerance is a complex quantitative trait controlled by many genes and their interaction with environmental factors. We previously constructed a gene-pool relevant to soybean flooding-tolerant responses from integrated multiple omics and non-omics databases, and selected 144 prioritized flooding tolerance genes (FTgenes). In this study, we proposed a comprehensive framework at the systems level, using competitive (hypergeometric test) and self-contained (sum-statistic, sum-square-statistic) pathway-based approaches to identify biologically enriched pathways through evaluating the joint effects of the FTgenes within annotated pathways. These FTgenes were significantly enriched in 36 pathways in the Gene Ontology database. These pathways were related to plant hormones, defense-related, primary metabolic process, and system development pathways, which plays key roles in soybean flooding-induced responses. We further identified nine key FTgenes from important subnetworks extracted from several gene networks of enriched pathways. The nine key FTgenes were significantly expressed in soybean root under flooding stress in a qRT-PCR analysis. We demonstrated that this systems biology framework is promising to uncover important key genes underlying the molecular mechanisms of flooding-tolerant responses in soybean. This result supplied a good foundation for gene function analysis in further work.
Collapse
|
8
|
Howlett-Prieto Q, Oommen C, Carrithers MD, Wunsch DC, Hier DB. Subtypes of relapsing-remitting multiple sclerosis identified by network analysis. Front Digit Health 2023; 4:1063264. [PMID: 36714613 PMCID: PMC9874946 DOI: 10.3389/fdgth.2022.1063264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 12/22/2022] [Indexed: 01/12/2023] Open
Abstract
We used network analysis to identify subtypes of relapsing-remitting multiple sclerosis subjects based on their cumulative signs and symptoms. The electronic medical records of 113 subjects with relapsing-remitting multiple sclerosis were reviewed, signs and symptoms were mapped to classes in a neuro-ontology, and classes were collapsed into sixteen superclasses by subsumption. After normalization and vectorization of the data, bipartite (subject-feature) and unipartite (subject-subject) network graphs were created using NetworkX and visualized in Gephi. Degree and weighted degree were calculated for each node. Graphs were partitioned into communities using the modularity score. Feature maps visualized differences in features by community. Network analysis of the unipartite graph yielded a higher modularity score (0.49) than the bipartite graph (0.25). The bipartite network was partitioned into five communities which were named fatigue, behavioral, hypertonia/weakness, abnormal gait/sphincter, and sensory, based on feature characteristics. The unipartite network was partitioned into five communities which were named fatigue, pain, cognitive, sensory, and gait/weakness/hypertonia based on features. Although we did not identify pure subtypes (e.g., pure motor, pure sensory, etc.) in this cohort of multiple sclerosis subjects, we demonstrated that network analysis could partition these subjects into different subtype communities. Larger datasets and additional partitioning algorithms are needed to confirm these findings and elucidate their significance. This study contributes to the literature investigating subtypes of multiple sclerosis by combining feature reduction by subsumption with network analysis.
Collapse
Affiliation(s)
- Quentin Howlett-Prieto
- Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States
| | - Chelsea Oommen
- Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States
| | - Michael D. Carrithers
- Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States
| | - Donald C. Wunsch
- Department of Electrical and Computer Engineering, Missouri University of Science and Technology, Rolla, MO, United States
| | - Daniel B. Hier
- Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States,Department of Electrical and Computer Engineering, Missouri University of Science and Technology, Rolla, MO, United States,Correspondence: Daniel B. Hier
| |
Collapse
|
9
|
Translational Bioinformatics for Human Reproductive Biology Research: Examples, Opportunities and Challenges for a Future Reproductive Medicine. Int J Mol Sci 2022; 24:ijms24010004. [PMID: 36613446 PMCID: PMC9819745 DOI: 10.3390/ijms24010004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 12/16/2022] [Accepted: 12/16/2022] [Indexed: 12/24/2022] Open
Abstract
Since 1978, with the first IVF (in vitro fertilization) baby birth in Manchester (England), more than eight million IVF babies have been born throughout the world, and many new techniques and discoveries have emerged in reproductive medicine. To summarize the modern technology and progress in reproductive medicine, all scientific papers related to reproductive medicine, especially papers related to reproductive translational medicine, were fully searched, manually curated and reviewed. Results indicated whether male reproductive medicine or female reproductive medicine all have made significant progress, and their markers have experienced the progress from karyotype analysis to single-cell omics. However, due to the lack of comprehensive databases, especially databases collecting risk exposures, disease markers and models, prevention drugs and effective treatment methods, the application of the latest precision medicine technologies and methods in reproductive medicine is limited.
Collapse
|
10
|
Barrio-Hernandez I, Beltrao P. Network analysis of genome-wide association studies for drug target prioritisation. Curr Opin Chem Biol 2022; 71:102206. [PMID: 36087372 DOI: 10.1016/j.cbpa.2022.102206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 07/29/2022] [Accepted: 08/05/2022] [Indexed: 01/27/2023]
Abstract
Over the past decades, genome-wide association studies (GWAS) have led to a dramatic expansion of genetic variants implicated with human traits and diseases. These advances are expected to result in new drug targets but the identification of causal genes and the cell biology underlying human diseases from GWAS remains challenging. Here, we review protein interaction network-based methods to analyse GWAS data. These approaches can rank candidate drug targets at GWAS-associated loci or among interactors of disease genes without direct genetic support. These methods identify the cell biology affected in common across diseases, offering opportunities for drug repurposing, as well as be combined with expression data to identify focal tissues and cell types. Going forward, we expect that these methods will further improve from advances in the characterisation of context specific interaction networks and the joint analysis of rare and common genetic signals.
Collapse
Affiliation(s)
- Inigo Barrio-Hernandez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, CB10 1SD, UK; Open Targets, Wellcome Genome Campus, Cambridge, CB10 1SA, UK.
| | - Pedro Beltrao
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, CB10 1SD, UK; Open Targets, Wellcome Genome Campus, Cambridge, CB10 1SA, UK; Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, 8093, Switzerland.
| |
Collapse
|
11
|
Dursun C, Kwitek AE, Bozdag S. PhenoGeneRanker: Gene and Phenotype Prioritization Using Multiplex Heterogeneous Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2950-2962. [PMID: 34283720 PMCID: PMC9704494 DOI: 10.1109/tcbb.2021.3098278] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Uncovering genotype-phenotype relationships is a fundamental challenge in genomics. Gene prioritization is an important step for this endeavor to make a short manageable list from a list of thousands of genes coming from high-throughput studies. Network propagation methods are promising and state of the art methods for gene prioritization based on the premise that functionally related genes tend to be close to each other in the biological networks. Recently, we introduced PhenoGeneRanker, a network-propagation algorithm for multiplex heterogeneous networks. PhenoGeneRanker allows multi-layer gene and phenotype networks. It also calculates empirical p values of gene and phenotype ranks using random stratified sampling of seeds of genes and phenotypes based on their connectivity degree in the network. In this study, we introduce the PhenoGeneRanker Bioconductor package and its application to multi-omics rat genome datasets to rank hypertension disease-related genes and strains. We showed that PhenoGeneRanker performed better to rank hypertension disease-related genes using multiplex gene networks than aggregated gene networks. We also showed that PhenoGeneRanker performed better to rank hypertension disease-related strains using multiplex phenotype network than single or aggregated phenotype networks. We performed a rigorous hyperparameter analysis and, finally showed that Gene Ontology (GO) enrichment of statistically significant top-ranked genes resulted in hypertension disease-related GO terms.
Collapse
|
12
|
Ramkumar MK, Mulani E, Jadon V, Sureshkumar V, Krishnan SG, Senthil Kumar S, Raveendran M, Singh AK, Solanke AU, Singh NK, Sevanthi AM. Identification of major candidate genes for multiple abiotic stress tolerance at seedling stage by network analysis and their validation by expression profiling in rice ( Oryza sativa L.). 3 Biotech 2022; 12:127. [PMID: 35573803 DOI: 10.1007/s13205-022-03182-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 04/03/2022] [Indexed: 11/01/2022] Open
Abstract
A wealth of microarray and RNA-seq data for studying abiotic stress tolerance in rice exists but only limited studies have been carried out on multiple stress-tolerance responses and mechanisms. In this study, we identified 6657 abiotic stress-responsive genes pertaining to drought, salinity and heat stresses from the seedling stage microarray data of 83 samples and used them to perform unweighted network analysis and to identify key hub genes or master regulators for multiple abiotic stress tolerance. Of the total 55 modules identified from the analysis, the top 10 modules with 8-61 nodes comprised 239 genes. From these 10 modules, 10 genes common to all the three stresses were selected. Further, based on the centrality properties and highly dense interactions, we identified 7 intra-modular hub genes leading to a total of 17 potential candidate genes. Out of these 17 genes, 15 were validated by expression analysis using a panel of 4 test genotypes and a pair of standard check genotypes for each abiotic stress response. Interestingly, all the 15 genes showed upregulation under all stresses and in all the genotypes, suggesting that they could be representing some of the core abiotic stress-responsive genes. More pertinently, eight of the genes were found to be co-localized with the stress-tolerance QTL regions. Thus, in conclusion, our study not only provided an effective approach for studying abiotic stress tolerance in rice, but also identified major candidate genes which could be further validated by functional genomics for abiotic stress tolerance. Supplementary Information The online version contains supplementary material available at 10.1007/s13205-022-03182-7.
Collapse
|
13
|
Abstract
Summary Computational methods to predict protein–protein interaction (PPI) typically segregate into sequence-based ‘bottom-up’ methods that infer properties from the characteristics of the individual protein sequences, or global ‘top-down’ methods that infer properties from the pattern of already known PPIs in the species of interest. However, a way to incorporate top-down insights into sequence-based bottom-up PPI prediction methods has been elusive. We thus introduce Topsy-Turvy, a method that newly synthesizes both views in a sequence-based, multi-scale, deep-learning model for PPI prediction. While Topsy-Turvy makes predictions using only sequence data, during the training phase it takes a transfer-learning approach by incorporating patterns from both global and molecular-level views of protein interaction. In a cross-species context, we show it achieves state-of-the-art performance, offering the ability to perform genome-scale, interpretable PPI prediction for non-model organisms with no existing experimental PPI data. In species with available experimental PPI data, we further present a Topsy-Turvy hybrid (TT-Hybrid) model which integrates Topsy-Turvy with a purely network-based model for link prediction that provides information about species-specific network rewiring. TT-Hybrid makes accurate predictions for both well- and sparsely-characterized proteins, outperforming both its constituent components as well as other state-of-the-art PPI prediction methods. Furthermore, running Topsy-Turvy and TT-Hybrid screens is feasible for whole genomes, and thus these methods scale to settings where other methods (e.g. AlphaFold-Multimer) might be infeasible. The generalizability, accuracy and genome-level scalability of Topsy-Turvy and TT-Hybrid unlocks a more comprehensive map of protein interaction and organization in both model and non-model organisms. Availability and implementation https://topsyturvy.csail.mit.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Samuel Sledzieski
- Computer Science and Artificial Intelligence Lab., Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Bonnie Berger
- To whom correspondence should be addressed. E-mail: or
| | - Lenore Cowen
- To whom correspondence should be addressed. E-mail: or
| |
Collapse
|
14
|
Miyoshi E, Morabito S, Swarup V. Systems biology approaches to unravel the molecular and genetic architecture of Alzheimer's disease and related tauopathies. Neurobiol Dis 2021; 160:105530. [PMID: 34634459 PMCID: PMC8616667 DOI: 10.1016/j.nbd.2021.105530] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 08/30/2021] [Accepted: 10/07/2021] [Indexed: 11/19/2022] Open
Abstract
Over the years, genetic studies have identified multiple genetic risk variants associated with neurodegenerative disorders and helped reveal new biological pathways and genes of interest. However, genetic risk variants commonly reside in non-coding regions and may regulate distant genes rather than the nearest gene, as well as a gene's interaction partners in biological networks. Systems biology and functional genomics approaches provide the framework to unravel the functional significance of genetic risk variants in disease. In this review, we summarize the genetic and transcriptomic studies of Alzheimer's disease and related tauopathies and focus on the advantages of performing systems-level analyses to interrogate the biological pathways underlying neurodegeneration. Finally, we highlight new avenues of multi-omics analysis with single-cell approaches, which provide unparalleled opportunities to systematically explore cellular heterogeneity, and present an example of how to integrate publicly available single-cell datasets. Systems-level analysis has illuminated the function of many disease risk genes, but much work remains to study tauopathies and to understand spatiotemporal gene expression changes of specific cell types.
Collapse
Affiliation(s)
- Emily Miyoshi
- Department of Neurobiology and Behavior, University of California, Irvine, CA 92697, USA; Institute for Memory Impairments and Neurological Disorders (MIND), University of California, Irvine, CA 92697, USA
| | - Samuel Morabito
- Institute for Memory Impairments and Neurological Disorders (MIND), University of California, Irvine, CA 92697, USA; Mathematical, Computational and Systems Biology (MCSB) Program, University of California, Irvine, CA 92697, USA
| | - Vivek Swarup
- Department of Neurobiology and Behavior, University of California, Irvine, CA 92697, USA; Institute for Memory Impairments and Neurological Disorders (MIND), University of California, Irvine, CA 92697, USA.
| |
Collapse
|
15
|
Yang J, Shu L, Duan H, Li H. A Visual Phenotype-Based Differential Diagnosis Process for Rare Diseases. Interdiscip Sci 2021; 14:331-348. [PMID: 34751921 DOI: 10.1007/s12539-021-00490-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Revised: 10/23/2021] [Accepted: 10/28/2021] [Indexed: 02/01/2023]
Abstract
PURPOSE Phenotype-based rapid diagnosis can make up for the time-consuming genetic sequencing diagnosis of rare diseases. However, the collected phenotypes of patients can sometimes be inaccurate or incomplete, which limits the accuracy of diagnostic results. To solve this problem, we try to design a phenotype-based differential diagnosis process for rare diseases to achieve rapid and accurate diagnosis of rare diseases. METHODS The core of the differential diagnosis of rare diseases is to optimize the phenotype information of a specific patient and the visualized comparative analysis of diseases. To recommend additional phenotypes, replace the fuzzy phenotypes and filter the unexplained phenotypes for patients, we constructed a phenotype hierarchical network and a disease-phenotype differential network and calculated the phenotype co-occurrence relationship. In addition, we designed a visual comparative analysis method to explore the correlation and difference of disease phenotypes. RESULTS The evaluation based on the published 10 rare disease cases demonstrated that after the optimization of patient phenotype information through our differential diagnosis, the target disease often got a better ranking and recommendation score than before. We have deployed this scheme on the RDmap project ( http://rdmap.nbscn.org ). CONCLUSION Compared to genetic and molecular analysis, phenotype-based diagnosis is faster, cheaper, and easier. The differential diagnosis process we designed can optimize the phenotype information of patients and better locate the target disease. It can also help to make screening decisions before genetic testing.
Collapse
Affiliation(s)
- Jian Yang
- The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Binsheng Road 3333#, Hangzhou, 310052, Zhejiang, China.,The College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China
| | - Liqi Shu
- Rhode Island Hospital, Warren Alpert Medical School of Brown University, Rhode Island, USA
| | - Huilong Duan
- The College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China
| | - Haomin Li
- The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Binsheng Road 3333#, Hangzhou, 310052, Zhejiang, China.
| |
Collapse
|
16
|
Ung CY, Weiskittel TM, Correia C, Kaufmann SH, Li H. Manifold medicine: A schema that expands treatment dimensionality. Drug Discov Today 2021; 27:8-16. [PMID: 34600126 PMCID: PMC8714694 DOI: 10.1016/j.drudis.2021.09.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 09/14/2021] [Accepted: 09/23/2021] [Indexed: 11/05/2022]
Abstract
Drug discovery currently focuses on identifying new druggable targets and drug repurposing. Here, we illustrate a third domain of drug discovery: the dimensionality of treatment regimens. We formulate a new schema called ‘Manifold Medicine’, in which disease states are described by vectorial positions on several body-wide axes. Thus, pathological states are represented by multidimensional ‘vectors’ that traverse the body-wide axes. We then delineate the manifold nature of drug action to provide a strategy for designing manifold drug cocktails by design using state-of-the-art biomedical and technological innovations. Manifold Medicine offers a roadmap for translating knowledge gained from next-generation technologies into individualized clinical practice.
Collapse
Affiliation(s)
- Choong Yong Ung
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
| | - Taylor M Weiskittel
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
| | - Cristina Correia
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
| | - Scott H Kaufmann
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
| | - Hu Li
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA.
| |
Collapse
|
17
|
Tanaka H, Kreisberg JF, Ideker T. Genetic dissection of complex traits using hierarchical biological knowledge. PLoS Comput Biol 2021; 17:e1009373. [PMID: 34534210 PMCID: PMC8480841 DOI: 10.1371/journal.pcbi.1009373] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 09/29/2021] [Accepted: 08/23/2021] [Indexed: 11/18/2022] Open
Abstract
Despite the growing constellation of genetic loci linked to common traits, these loci have yet to account for most heritable variation, and most act through poorly understood mechanisms. Recent machine learning (ML) systems have used hierarchical biological knowledge to associate genetic mutations with phenotypic outcomes, yielding substantial predictive power and mechanistic insight. Here, we use an ontology-guided ML system to map single nucleotide variants (SNVs) focusing on 6 classic phenotypic traits in natural yeast populations. The 29 identified loci are largely novel and account for ~17% of the phenotypic variance, versus <3% for standard genetic analysis. Representative results show that sensitivity to hydroxyurea is linked to SNVs in two alternative purine biosynthesis pathways, and that sensitivity to copper arises through failure to detoxify reactive oxygen species in fatty acid metabolism. This work demonstrates a knowledge-based approach to amplifying and interpreting signals in population genetic studies. Genome-wide association studies (GWAS) have identified many important loci for common diseases and other traits. However, the loci identified by these studies are almost always many steps away from an understanding of underlying biological mechanisms. Here we develop an approach using hierarchical biological knowledge to identify genes and pathways responsible for phenotypic traits. Variants identified by the new method could explain a substantially greater fraction of heritability than previously reported. Moreover, we identified mechanistic pathways by which each causal variant affects cellular function. For example, we find that sensitivity to hydroxyurea is tied to genetic variants in two alternative purine biosynthesis pathways, and that sensitivity to copper arises through failure to detoxify reactive oxygen species in fatty acid metabolism. The new approach is a potentially transformative concept for understanding the genetic drivers of phenotypic variance, with potential applications in understanding traits in biomedicine and agriculture.
Collapse
Affiliation(s)
- Hidenori Tanaka
- Department of Medicine, University of California San Diego, La Jolla, California, United States of America
| | - Jason F. Kreisberg
- Department of Medicine, University of California San Diego, La Jolla, California, United States of America
- * E-mail: (JFK); (TI)
| | - Trey Ideker
- Department of Medicine, University of California San Diego, La Jolla, California, United States of America
- * E-mail: (JFK); (TI)
| |
Collapse
|
18
|
Emmert-Streib F. Grand Challenges for Artificial Intelligence in Molecular Medicine. FRONTIERS IN MOLECULAR MEDICINE 2021; 1:734659. [PMID: 39087080 PMCID: PMC11285658 DOI: 10.3389/fmmed.2021.734659] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 07/08/2021] [Indexed: 08/02/2024]
Affiliation(s)
- Frank Emmert-Streib
- Predictive Society and Data Analytics Lab, Faculty of Information Technolgy and Communication Sciences, Tampere University, Tampere, Finland
- Institute of Biosciences and Medical Technology, Tampere, Finland
| |
Collapse
|
19
|
Ginsberg SD, Neubert TA, Sharma S, Digwal CS, Yan P, Timbus C, Wang T, Chiosis G. Disease-specific interactome alterations via epichaperomics: the case for Alzheimer's disease. FEBS J 2021; 289:2047-2066. [PMID: 34028172 DOI: 10.1111/febs.16031] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 04/23/2021] [Accepted: 05/20/2021] [Indexed: 12/22/2022]
Abstract
The increasingly appreciated prevalence of complicated stressor-to-phenotype associations in human disease requires a greater understanding of how specific stressors affect systems or interactome properties. Many currently untreatable diseases arise due to variations in, and through a combination of, multiple stressors of genetic, epigenetic, and environmental nature. Unfortunately, how such stressors lead to a specific disease phenotype or inflict a vulnerability to some cells and tissues but not others remains largely unknown and unsatisfactorily addressed. Analysis of cell- and tissue-specific interactome networks may shed light on organization of biological systems and subsequently to disease vulnerabilities. However, deriving human interactomes across different cell and disease contexts remains a challenge. To this end, this opinion article links stressor-induced protein interactome network perturbations to the formation of pathologic scaffolds termed epichaperomes, revealing a viable and reproducible experimental solution to obtaining rigorous context-dependent interactomes. This article presents our views on how a specialized 'omics platform called epichaperomics may complement and enhance the currently available conventional approaches and aid the scientific community in defining, understanding, and ultimately controlling interactome networks of complex diseases such as Alzheimer's disease. Ultimately, this approach may aid the transition from a limited single-alteration perspective in disease to a comprehensive network-based mindset, which we posit will result in precision medicine paradigms for disease diagnosis and treatment.
Collapse
Affiliation(s)
- Stephen D Ginsberg
- Center for Dementia Research, Nathan Kline Institute, Orangeburg, NY, USA.,Departments of Psychiatry, Neuroscience & Physiology, The NYU Neuroscience Institute, New York University Grossman School of Medicine, NY, USA
| | - Thomas A Neubert
- Kimmel Center for Biology and Medicine at the Skirball Institute, NYU School of Medicine, New York, NY, USA
| | - Sahil Sharma
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA
| | - Chander S Digwal
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA
| | - Pengrong Yan
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA
| | - Calin Timbus
- Department of Mathematics, Technical University of Cluj-Napoca, CJ, Romania
| | - Tai Wang
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA
| | - Gabriela Chiosis
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA.,Breast Cancer Medicine Service, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| |
Collapse
|
20
|
Murray D, Petrey D, Honig B. Integrating 3D structural information into systems biology. J Biol Chem 2021; 296:100562. [PMID: 33744294 PMCID: PMC8095114 DOI: 10.1016/j.jbc.2021.100562] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 02/18/2021] [Accepted: 03/17/2021] [Indexed: 12/12/2022] Open
Abstract
Systems biology is a data-heavy field that focuses on systems-wide depictions of biological phenomena necessarily sacrificing a detailed characterization of individual components. As an example, genome-wide protein interaction networks are widely used in systems biology and continuously extended and refined as new sources of evidence become available. Despite the vast amount of information about individual protein structures and protein complexes that has accumulated in the past 50 years in the Protein Data Bank, the data, computational tools, and language of structural biology are not an integral part of systems biology. However, increasing effort has been devoted to this integration, and the related literature is reviewed here. Relationships between proteins that are detected via structural similarity offer a rich source of information not available from sequence similarity, and homology modeling can be used to leverage Protein Data Bank structures to produce 3D models for a significant fraction of many proteomes. A number of structure-informed genomic and cross-species (i.e., virus–host) interactomes will be described, and the unique information they provide will be illustrated with a number of examples. Tissue- and tumor-specific interactomes have also been developed through computational strategies that exploit patient information and through genetic interactions available from increasingly sensitive screens. Strategies to integrate structural information with these alternate data sources will be described. Finally, efforts to link protein structure space with chemical compound space offer novel sources of information in drug design, off-target identification, and the identification of targets for compounds found to be effective in phenotypic screens.
Collapse
Affiliation(s)
- Diana Murray
- Department of Systems Biology, Columbia University, New York, New York, USA
| | - Donald Petrey
- Department of Systems Biology, Columbia University, New York, New York, USA
| | - Barry Honig
- Department of Systems Biology, Department of Biochemistry and Molecular Biophysics, Department of Medicine, Zuckerman Mind Brain and Behavior Institute, Columbia University, New York, New York, USA.
| |
Collapse
|
21
|
Levi H, Elkon R, Shamir R. DOMINO: a network-based active module identification algorithm with reduced rate of false calls. Mol Syst Biol 2021; 17:e9593. [PMID: 33471440 PMCID: PMC7816759 DOI: 10.15252/msb.20209593] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Revised: 11/09/2020] [Accepted: 11/11/2020] [Indexed: 01/18/2023] Open
Abstract
Algorithms for active module identification (AMI) are central to analysis of omics data. Such algorithms receive a gene network and nodes' activity scores as input and report subnetworks that show significant over-representation of accrued activity signal ("active modules"), thus representing biological processes that presumably play key roles in the analyzed conditions. Here, we systematically evaluated six popular AMI methods on gene expression and GWAS data. We observed that GO terms enriched in modules detected on the real data were often also enriched on modules found on randomly permuted data. This indicated that AMI methods frequently report modules that are not specific to the biological context measured by the analyzed omics dataset. To tackle this bias, we designed a permutation-based method that empirically evaluates GO terms reported by AMI methods. We used the method to fashion five novel AMI performance criteria. Last, we developed DOMINO, a novel AMI algorithm, that outperformed the other six algorithms in extensive testing on GE and GWAS data. Software is available at https://github.com/Shamir-Lab.
Collapse
Affiliation(s)
- Hagai Levi
- The Blavatnik School of Computer ScienceTel Aviv UniversityTel AvivIsrael
| | - Ran Elkon
- Department of Human Molecular Genetics and BiochemistrySackler School of MedicineTel Aviv UniversityTel AvivIsrael
- Sagol School of NeuroscienceTel Aviv UniversityTel AvivIsrael
| | - Ron Shamir
- The Blavatnik School of Computer ScienceTel Aviv UniversityTel AvivIsrael
| |
Collapse
|
22
|
Yin Z, Guo B, Ma S, Sun Y, Mi Z, Zheng Z. DReSS: a method to quantitatively describe the influence of structural perturbations on state spaces of genetic regulatory networks. Brief Bioinform 2020; 22:6032613. [PMID: 33313791 DOI: 10.1093/bib/bbaa315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2020] [Revised: 09/23/2020] [Accepted: 10/16/2020] [Indexed: 11/14/2022] Open
Abstract
Structures of genetic regulatory networks are not fixed. These structural perturbations can cause changes to the reachability of systems' state spaces. As system structures are related to genotypes and state spaces are related to phenotypes, it is important to study the relationship between structures and state spaces. However, there is still no method can quantitively describe the reachability differences of two state spaces caused by structural perturbations. Therefore, Difference in Reachability between State Spaces (DReSS) is proposed. DReSS index family can quantitively describe differences of reachability, attractor sets between two state spaces and can help find the key structure in a system, which may influence system's state space significantly. First, basic properties of DReSS including non-negativity, symmetry and subadditivity are proved. Then, typical examples are shown to explain the meaning of DReSS and the differences between DReSS and traditional graph distance. Finally, differences of DReSS distribution between real biological regulatory networks and random networks are compared. Results show most structural perturbations in biological networks tend to affect reachability inside and between attractor basins rather than to affect attractor set itself when compared with random networks, which illustrates that most genotype differences tend to influence the proportion of different phenotypes and only a few ones can create new phenotypes. DReSS can provide researchers with a new insight to study the relation between genotypes and phenotypes.
Collapse
Affiliation(s)
- Ziqiao Yin
- Shenyuan Honors College and School of Mathematical Sciences, Beihang University, and Key Laboratory of Mathematics, Informatics and Behavioral Semantics, Ministry of Education. He currently works as a visiting scholar at Yale University
| | - Binghui Guo
- Artificial Intelligence Institute, Beijing Advanced Innovation Center for Big Data and Brain Computing, LMIB, NLSDE, School of Mathematical Sciences, Beihang University, and Peng Cheng Laboratory
| | - Shuangge Ma
- Department of Biostatistics, Yale University
| | - Yifan Sun
- School of Statistics, Renmin University of China
| | - Zhilong Mi
- Key Laboratory of Mathematics, Informatics and Behavioral Semantics, Ministry of Education, and School of Mathematical Sciences from Beihang University
| | - Zhiming Zheng
- Artificial Intelligence Institute, Beijing Advanced Innovation Center for Big Data and Brain Computing, LMIB, NLSDE, School of Mathematical Sciences, Beihang University, and Peng Cheng Laboratory
| |
Collapse
|
23
|
Wamsley B, Geschwind DH. Functional genomics links genetic origins to pathophysiology in neurodegenerative and neuropsychiatric disease. Curr Opin Genet Dev 2020; 65:117-125. [PMID: 32634676 PMCID: PMC8171040 DOI: 10.1016/j.gde.2020.05.032] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 05/24/2020] [Indexed: 12/30/2022]
Abstract
Neurodegenerative and neuropsychiatric disorders are pervasive and debilitating conditions characterized by diverse clinical syndromes and comorbidities, whose origins are as complex and heterogeneous as their associated phenotypes. Risk for these disorders involves substantial genetic liability, which has fueled large-scale genetic studies that have led to a flood of discoveries. In turn, these discoveries have exposed substantial gaps in our knowledge with regards to the complicated genetic architecture of each disorder and the substantial amount of genetic overlap among disorders, which implies some degree of shared pathophysiology underlying these clinically distinct, multifactorial disorders. Understanding the role of specific genetic variants will involve resolving the connections between molecular pathways, heterogeneous cell types, specific circuits and disease pathogenesis at the tissue and patient level. We consider the current known genetic basis of these disorders and highlight the utility of molecular systems approaches that establish the function of genetic variation in the context of specific neurobiological networks, cell-types, and life stages. Beyond expanding our knowledge of disease mechanisms, understanding these relationships provides promise for early detection and potential therapeutic interventions.
Collapse
Affiliation(s)
- Brie Wamsley
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Daniel H Geschwind
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA; Program in Neurobehavioral Genetics and Center for Autism Research and Treatment Semel Institute and Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
24
|
Wu R, Lin Y, Liu X, Zhan C, He H, Shi M, Jiang Z, Shen B. Phenotype-genotype network construction and characterization: a case study of cardiovascular diseases and associated non-coding RNAs. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2020:5706767. [PMID: 31942979 PMCID: PMC6964217 DOI: 10.1093/database/baz147] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 11/21/2019] [Accepted: 12/09/2020] [Indexed: 02/05/2023]
Abstract
The phenotype–genotype relationship is a key for personalized and precision medicine for complex diseases. To unravel the complexity of the clinical phenotype–genotype network, we used cardiovascular diseases (CVDs) and associated non-coding RNAs (ncRNAs) (i.e. miRNAs, long ncRNAs, etc.) as the case for the study of CVDs at a systems or network level. We first integrated a database of CVDs and ncRNAs (CVDncR, http://sysbio.org.cn/cvdncr/) to construct CVD–ncRNA networks and annotate their clinical associations. To characterize the networks, we then separated the miRNAs into two groups, i.e. universal miRNAs associated with at least two types of CVDs and specific miRNAs related only to one type of CVD. Our analyses indicated two interesting patterns in these CVD–ncRNA networks. First, scale-free features were present within both CVD–miRNA and CVD–lncRNA networks; second, universal miRNAs were more likely to be CVDs biomarkers. These results were confirmed by computational functional analyses. The findings offer theoretical guidance for decoding CVD–ncRNA associations and will facilitate the screening of CVD ncRNA biomarkers. Database URL: http://sysbio.org.cn/cvdncr/
Collapse
Affiliation(s)
- Rongrong Wu
- Center for Systems Biology, Soochow University, No. 199 Renai Road, Suzhou, Jiangsu 215123, China
| | - Yuxin Lin
- Center for Systems Biology, Soochow University, No. 199 Renai Road, Suzhou, Jiangsu 215123, China
| | - Xingyun Liu
- Center for Systems Biology, Soochow University, No. 199 Renai Road, Suzhou, Jiangsu 215123, China.,Institutes for Systems Genetics, West China Hospital, Sichuan University, No. 17 Gaopeng Avenue, Ji Tai'an Center, Chengdu, Sichuan 610041, China
| | - Chaoying Zhan
- Center for Systems Biology, Soochow University, No. 199 Renai Road, Suzhou, Jiangsu 215123, China
| | - Hongxin He
- Center for Systems Biology, Soochow University, No. 199 Renai Road, Suzhou, Jiangsu 215123, China
| | - Manhong Shi
- Center for Systems Biology, Soochow University, No. 199 Renai Road, Suzhou, Jiangsu 215123, China.,College of Information and Network Engineering, Anhui Science and Technology University, No. 9 Donghua Road, Fengyang, Anhui 233100, China
| | - Zhi Jiang
- Department of Biochemistry and Molecular Biology, School of Medicine, Soochow University, No. 199 Renai Road, Suzhou, Jiangsu 215123, China
| | - Bairong Shen
- Institutes for Systems Genetics, West China Hospital, Sichuan University, No. 17 Gaopeng Avenue, Ji Tai'an Center, Chengdu, Sichuan 610041, China
| |
Collapse
|
25
|
Lenz AR, Galán-Vásquez E, Balbinot E, de Abreu FP, Souza de Oliveira N, da Rosa LO, de Avila e Silva S, Camassola M, Dillon AJP, Perez-Rueda E. Gene Regulatory Networks of Penicillium echinulatum 2HH and Penicillium oxalicum 114-2 Inferred by a Computational Biology Approach. Front Microbiol 2020; 11:588263. [PMID: 33193246 PMCID: PMC7652724 DOI: 10.3389/fmicb.2020.588263] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 09/23/2020] [Indexed: 11/29/2022] Open
Abstract
Penicillium echinulatum 2HH and Penicillium oxalicum 114-2 are well-known cellulase fungal producers. However, few studies addressing global mechanisms for gene regulation of these two important organisms are available so far. A recent finding that the 2HH wild-type is closely related to P. oxalicum leads to a combined study of these two species. Firstly, we provide a global gene regulatory network for P. echinulatum 2HH and P. oxalicum 114-2, based on TF-TG orthology relationships, considering three related species with well-known regulatory interactions combined with TFBSs prediction. The network was then analyzed in terms of topology, identifying TFs as hubs, and modules. Based on this approach, we explore numerous identified modules, such as the expression of cellulolytic and xylanolytic systems, where XlnR plays a key role in positive regulation of the xylanolytic system. It also regulates positively the cellulolytic system by acting indirectly through the cellodextrin induction system. This remarkable finding suggests that the XlnR-dependent cellulolytic and xylanolytic regulatory systems are probably conserved in both P. echinulatum and P. oxalicum. Finally, we explore the functional congruency on the genes clustered in terms of communities, where the genes related to cellular nitrogen, compound metabolic process and macromolecule metabolic process were the most abundant. Therefore, our approach allows us to confer a degree of accuracy regarding the existence of each inferred interaction.
Collapse
Affiliation(s)
- Alexandre Rafael Lenz
- Unidad Académica Yucatán, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de Mexico, Mérida, Mexico
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
- Departamento de Ciências Exatas e da Terra, Universidade do Estado da Bahia, Salvador, Brazil
| | - Edgardo Galán-Vásquez
- Departamento de Ingeniería de Sistemas Computacionales y Automatización, Instituto de Investigaciones en Matemàticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de Mexico, Ciudad Universitaria, Mexico
| | - Eduardo Balbinot
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Fernanda Pessi de Abreu
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Nikael Souza de Oliveira
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
- Laboratório de Enzimas e Biomassas, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Letícia Osório da Rosa
- Laboratório de Enzimas e Biomassas, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Scheila de Avila e Silva
- Laboratório de Bioinformática e Biologia Computacional, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Marli Camassola
- Laboratório de Enzimas e Biomassas, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Aldo José Pinheiro Dillon
- Laboratório de Enzimas e Biomassas, Instituto de Biotecnologia, Universidade de Caxias do Sul, Caxias do Sul, Brazil
| | - Ernesto Perez-Rueda
- Unidad Académica Yucatán, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de Mexico, Mérida, Mexico
- Facultad de Ciencias, Centro de Genómica y Bioinformática, Universidad Mayor, Santiago, Chile
| |
Collapse
|
26
|
A strategy to incorporate prior knowledge into correlation network cutoff selection. Nat Commun 2020; 11:5153. [PMID: 33056991 PMCID: PMC7560866 DOI: 10.1038/s41467-020-18675-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2018] [Accepted: 08/27/2020] [Indexed: 12/16/2022] Open
Abstract
Correlation networks are frequently used to statistically extract biological interactions between omics markers. Network edge selection is typically based on the statistical significance of the correlation coefficients. This procedure, however, is not guaranteed to capture biological mechanisms. We here propose an alternative approach for network reconstruction: a cutoff selection algorithm that maximizes the overlap of the inferred network with available prior knowledge. We first evaluate the approach on IgG glycomics data, for which the biochemical pathway is known and well-characterized. Importantly, even in the case of incomplete or incorrect prior knowledge, the optimal network is close to the true optimum. We then demonstrate the generalizability of the approach with applications to untargeted metabolomics and transcriptomics data. For the transcriptomics case, we demonstrate that the optimized network is superior to statistical networks in systematically retrieving interactions that were not included in the biological reference used for optimization.
Collapse
|
27
|
Kim YA, Sarto Basso R, Wojtowicz D, Liu AS, Hochbaum DS, Vandin F, Przytycka TM. Identifying Drug Sensitivity Subnetworks with NETPHIX. iScience 2020; 23:101619. [PMID: 33089107 PMCID: PMC7566085 DOI: 10.1016/j.isci.2020.101619] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 09/08/2020] [Accepted: 09/24/2020] [Indexed: 12/29/2022] Open
Abstract
Phenotypic heterogeneity in cancer is often caused by different patterns of genetic alterations. Understanding such phenotype-genotype relationships is fundamental for the advance of personalized medicine. We develop a computational method, named NETPHIX (NETwork-to-PHenotype association with eXclusivity) to identify subnetworks of genes whose genetic alterations are associated with drug response or other continuous cancer phenotypes. Leveraging interaction information among genes and properties of cancer mutations such as mutual exclusivity, we formulate the problem as an integer linear program and solve it optimally to obtain a subnetwork of associated genes. Applied to a large-scale drug screening dataset, NETPHIX uncovered gene modules significantly associated with drug responses. Utilizing interaction information, NETPHIX modules are functionally coherent and can thus provide important insights into drug action. In addition, we show that modules identified by NETPHIX together with their association patterns can be leveraged to suggest drug combinations.
Collapse
Affiliation(s)
- Yoo-Ah Kim
- National Center of Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
| | - Rebecca Sarto Basso
- Department of Industrial Engineering and Operations Research, University of California at Berkeley, Berkeley, CA 94709, USA
| | - Damian Wojtowicz
- National Center of Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
| | - Amanda S Liu
- Montgomery Blair High School, Silver Spring, MD 20901, USA
| | - Dorit S Hochbaum
- Department of Industrial Engineering and Operations Research, University of California at Berkeley, Berkeley, CA 94709, USA
| | - Fabio Vandin
- Department of Information Engineering, University of Padova, Padova 35131, Italy
| | - Teresa M Przytycka
- National Center of Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
| |
Collapse
|
28
|
Ren X, Wang S, Huang T. Decipher the connections between proteins and phenotypes. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2020; 1868:140503. [PMID: 32707349 DOI: 10.1016/j.bbapap.2020.140503] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Revised: 06/30/2020] [Accepted: 07/16/2020] [Indexed: 10/23/2022]
Abstract
As the outward-most representation of life, phenotype is the fundamental basis with which humans understand life and disease. But with the advent of molecular and sequencing technique and research, a growing portion of science research focuses primarily on the molecular level of life. Our understanding in molecular variations and mechanisms can only be fully utilized when they are translated into the phenotypic level. In this study, we constructed similarity network for phenotype ontology, and then applied network analysis methods to discover phenotype/disease clusters. Then, we used machine learning models to predict protein-phenotype associations. Each protein was characterized by the functional profiles of its interaction neighbors on the protein-protein interaction network. Our methods can not only predict protein-phenotype associations, but also reveal the underlying mechanisms from protein to phenotype.
Collapse
Affiliation(s)
- Xiaohui Ren
- Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Steven Wang
- Department of Molecular Biology, Columbia University, New York, USA
| | - Tao Huang
- Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China.
| |
Collapse
|
29
|
Oertelt-Prigione S, Mariman E. The impact of sex differences on genomic research. Int J Biochem Cell Biol 2020; 124:105774. [PMID: 32470538 DOI: 10.1016/j.biocel.2020.105774] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 05/15/2020] [Accepted: 05/22/2020] [Indexed: 01/23/2023]
Abstract
Sex and gender differences affect all dimensions of human health ranging from the biological basis of disease to therapeutic access, choice and response. Genomics research has long ignored the role of sex differences as potential modulators and the concept is gaining more attention only recently. In the present review we summarize the current knowledge of the impact of sex differences on genomic and epigenomic research, the potential interaction of genomics and gender and the role of these differences in disease etiopathogenesis. Sex differences can emerge from differences in the sex chromosomes themselves, from their interaction with the genome and from the influence of hormones on genomic processes. The impact of these processes on the incidence of autoimmune and oncologic disease is well documented. The growing field of systems biology, which aims at integrating information from different networks of the human body, could also greatly benefit from this approach. In the present review we summarize the current knowledge and provide recommendations for the future performance of sex-sensitive genomics research.
Collapse
Affiliation(s)
- Sabine Oertelt-Prigione
- Department of Primary and Community Care, Radboud Institute of Health Sciences, Radboudumc, Nijmegen, The Netherlands; Institute of Legal and Forensic Medicine, Charité - Universitätsmedizin, Berlin, Germany.
| | - Edwin Mariman
- Department of Human Biology, NUTRIM School of Nutrition and Translational Research in Metabolism, Maastricht University Medical Centre+, Maastricht, The Netherlands
| |
Collapse
|
30
|
Bludau I, Aebersold R. Proteomic and interactomic insights into the molecular basis of cell functional diversity. Nat Rev Mol Cell Biol 2020; 21:327-340. [PMID: 32235894 DOI: 10.1038/s41580-020-0231-2] [Citation(s) in RCA: 134] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/26/2020] [Indexed: 02/06/2023]
Abstract
The ability of living systems to adapt to changing conditions originates from their capacity to change their molecular constitution. This is achieved by multiple mechanisms that modulate the quantitative composition and the diversity of the molecular inventory. Molecular diversification is particularly pronounced on the proteome level, at which multiple proteoforms derived from the same gene can in turn combinatorially form different protein complexes, thus expanding the repertoire of functional modules in the cell. The study of molecular and modular diversity and their involvement in responses to changing conditions has only recently become possible through the development of new 'omics'-based screening technologies. This Review explores our current knowledge of the mechanisms regulating functional diversification along the axis of gene expression, with a focus on the proteome and interactome. We explore the interdependence between different molecular levels and how this contributes to functional diversity. Finally, we highlight several recent techniques for studying molecular diversity, with specific focus on mass spectrometry-based analysis of the proteome and its organization into functional modules, and examine future directions for this rapidly growing field.
Collapse
Affiliation(s)
- Isabell Bludau
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland. .,Faculty of Science, University of Zurich, Zurich, Switzerland.
| |
Collapse
|
31
|
Halakou F, Gursoy A, Keskin O. Embedding Alternative Conformations of Proteins in Protein-Protein Interaction Networks. Methods Mol Biol 2020; 2074:113-124. [PMID: 31583634 DOI: 10.1007/978-1-4939-9873-9_9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
While many proteins act alone, the majority of them interact with others and form molecular complexes to undertake biological functions at both cellular and systems levels. Two proteins should have complementary shapes to physically connect to each other. As proteins are dynamic and changing their conformations, it is vital to track in which conformation a specific interaction can happen. Here, we present a step-by-step guide to embedding the protein alternative conformations in each protein-protein interaction in a systems level. All external tools/websites used in each step are explained, and some notes and suggestions are provided to clear any ambiguous point.
Collapse
Affiliation(s)
- Farideh Halakou
- Computer Science and Engineering Department, Koc University, Istanbul, Turkey
| | - Attila Gursoy
- Computer Science and Engineering Department, Koc University, Istanbul, Turkey.
| | - Ozlem Keskin
- Chemical and Biological Engineering Department, Koc University, Istanbul, Turkey
| |
Collapse
|
32
|
Sonawane AR, Weiss ST, Glass K, Sharma A. Network Medicine in the Age of Biomedical Big Data. Front Genet 2019; 10:294. [PMID: 31031797 PMCID: PMC6470635 DOI: 10.3389/fgene.2019.00294] [Citation(s) in RCA: 111] [Impact Index Per Article: 22.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Accepted: 03/19/2019] [Indexed: 12/13/2022] Open
Abstract
Network medicine is an emerging area of research dealing with molecular and genetic interactions, network biomarkers of disease, and therapeutic target discovery. Large-scale biomedical data generation offers a unique opportunity to assess the effect and impact of cellular heterogeneity and environmental perturbations on the observed phenotype. Marrying the two, network medicine with biomedical data provides a framework to build meaningful models and extract impactful results at a network level. In this review, we survey existing network types and biomedical data sources. More importantly, we delve into ways in which the network medicine approach, aided by phenotype-specific biomedical data, can be gainfully applied. We provide three paradigms, mainly dealing with three major biological network archetypes: protein-protein interaction, expression-based, and gene regulatory networks. For each of these paradigms, we discuss a broad overview of philosophies under which various network methods work. We also provide a few examples in each paradigm as a test case of its successful application. Finally, we delineate several opportunities and challenges in the field of network medicine. We hope this review provides a lexicon for researchers from biological sciences and network theory to come on the same page to work on research areas that require interdisciplinary expertise. Taken together, the understanding gained from combining biomedical data with networks can be useful for characterizing disease etiologies and identifying therapeutic targets, which, in turn, will lead to better preventive medicine with translational impact on personalized healthcare.
Collapse
Affiliation(s)
- Abhijeet R. Sonawane
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
| | - Scott T. Weiss
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
| | - Kimberly Glass
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
| | - Amitabh Sharma
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- Department of Medicine, Harvard Medical School, Boston, MA, United States
- Center for Interdisciplinary Cardiovascular Sciences, Cardiovascular Division, Brigham and Women’s Hospital, Boston, MA, United States
| |
Collapse
|
33
|
Wang W, Corominas R, Lin GN. De novo Mutations From Whole Exome Sequencing in Neurodevelopmental and Psychiatric Disorders: From Discovery to Application. Front Genet 2019; 10:258. [PMID: 31001316 PMCID: PMC6456656 DOI: 10.3389/fgene.2019.00258] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Accepted: 03/08/2019] [Indexed: 12/13/2022] Open
Abstract
Neurodevelopmental and psychiatric disorders are a highly disabling and heterogeneous group of developmental and mental disorders, resulting from complex interactions of genetic and environmental risk factors. The nature of multifactorial traits and the presence of comorbidity and polygenicity in these disorders present challenges in both disease risk identification and clinical diagnoses. The genetic component has been firmly established, but the identification of all the causative variants remains elusive. The development of next-generation sequencing, especially whole exome sequencing (WES), has greatly enriched our knowledge of the precise genetic alterations of human diseases, including brain-related disorders. In particular, the extensive usage of WES in research studies has uncovered the important contribution of de novo mutations (DNMs) to these disorders. Trio and quad familial WES are a particularly useful approach to discover DNMs. Here, we review the major WES studies in neurodevelopmental and psychiatric disorders and summarize how genes hit by discovered DNMs are shared among different disorders. Next, we discuss different integrative approaches utilized to interrogate DNMs and to identify biological pathways that may disrupt brain development and shed light on our understanding of the genetic architecture underlying these disorders. Lastly, we discuss the current state of the transition from WES research to its routine clinical application. This review will assist researchers and clinicians in the interpretation of variants obtained from WES studies, and highlights the need to develop consensus analytical protocols and validated lists of genes appropriate for clinical laboratory analysis, in order to reach the growing demands.
Collapse
Affiliation(s)
- Weidi Wang
- Shanghai Mental Health Center, School of Biomedical Engineering, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai, China
- Brain Science and Technology Research Center, Shanghai Jiao Tong University, Shanghai, China
| | - Roser Corominas
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras, Valencia, Spain
- Institut de Biomedicina de la Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, Barcelona, Spain
| | - Guan Ning Lin
- Shanghai Mental Health Center, School of Biomedical Engineering, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai, China
- Brain Science and Technology Research Center, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
34
|
The metabolic network coherence of human transcriptomes is associated with genetic variation at the cadherin 18 locus. Hum Genet 2019; 138:375-388. [PMID: 30852652 PMCID: PMC6483969 DOI: 10.1007/s00439-019-01994-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 02/27/2019] [Indexed: 01/11/2023]
Abstract
Metabolic coherence (MC) is a network-based approach to dimensionality reduction that can be used, for example, to interpret the joint expression of genes linked to human metabolism. Computationally, the derivation of 'transcriptomic' MC involves mapping of an individual gene expression profile onto a gene-centric network derived beforehand from a metabolic network (currently Recon2), followed by the determination of the connectivity of a particular, profile-specific subnetwork. The biological significance of MC has been exemplified previously in the context of human inflammatory bowel disease, among others, but the genetic architecture of this quantitative cellular trait is still unclear. Therefore, we performed a genome-wide association study (GWAS) of MC in the 1000 Genomes/ GEUVADIS data (n = 457) and identified a solitary genome-wide significant association with single nucleotide polymorphisms (SNPs) in the intronic region of the cadherin 18 (CDH18) gene on chromosome 5 (lead SNP: rs11744487, p = 1.2 × 10- 8). Cadherin 18 is a transmembrane protein involved in human neural development and cell-to-cell signaling. Notably, genetic variation at the CDH18 locus has been associated with metabolic syndrome-related traits before. Replication of our genome-wide significant GWAS result was successful in another population study from the Netherlands (BIOS, n = 2661; lead SNP), but failed in two additional studies (KORA, Germany, n = 711; GENOA, USA, n = 411). Besides sample size issues, we surmise that these discrepant findings may be attributable to technical differences. While 1000 Genomes/GEUVADIS and BIOS gene expression profiles were generated by RNA sequencing, the KORA and GENOA data were microarray-based. In addition to providing first evidence for a link between regional genetic variation and a metabolism-related characteristic of human transcriptomes, our findings highlight the benefit of adopting a systems biology-oriented approach to molecular data analysis.
Collapse
|
35
|
Wilkinson B, Evgrafov O, Zheng D, Hartel N, Knowles JA, Graham NA, Ichida J, Coba MP. Endogenous Cell Type-Specific Disrupted in Schizophrenia 1 Interactomes Reveal Protein Networks Associated With Neurodevelopmental Disorders. Biol Psychiatry 2019; 85:305-316. [PMID: 29961565 PMCID: PMC6251761 DOI: 10.1016/j.biopsych.2018.05.009] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 04/03/2018] [Accepted: 05/03/2018] [Indexed: 11/17/2022]
Abstract
BACKGROUND Disrupted in schizophrenia 1 (DISC1) has been implicated in a number of psychiatric diseases along with neurodevelopmental phenotypes such as the proliferation and differentiation of neural progenitor cells. While there has been significant effort directed toward understanding the function of DISC1 through the determination of its protein-protein interactions within an in vitro setting, endogenous interactions involving DISC1 within a cell type-specific setting relevant to neural development remain unclear. METHODS Using CRISPR/Cas9 (clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9) genome engineering technology, we inserted an endogenous 3X-FLAG tag at the C-terminus of the canonical DISC1 gene in human induced pluripotent stem cells (iPSCs). We further differentiated these cells and used affinity purification to determine protein-protein interactions involving DISC1 in iPSC-derived neural progenitor cells and astrocytes. RESULTS We were able to determine 151 novel cell type-specific proteins present in DISC1 endogenous interactomes. The DISC1 interactomes can be clustered into several subcomplexes that suggest novel DISC1 cell-specific functions. In addition, the DISC1 interactome in iPSC-derived neural progenitor cells associates in a connected network containing proteins found to harbor de novo mutations in patients affected by schizophrenia and contains a subset of novel interactions that are known to harbor syndromic mutations in neurodevelopmental disorders. CONCLUSIONS Endogenous DISC1 interactomes within iPSC-derived human neural progenitor cells and astrocytes are able to provide context to DISC1 function in a cell type-specific setting relevant to neural development and enables the integration of psychiatric disease risk factors within a set of defined molecular functions.
Collapse
Affiliation(s)
- Brent Wilkinson
- Zilkha Neurogenetic Institute, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Oleg Evgrafov
- Department of Cell Biology, SUNY Downstate Medical Center, Brooklyn, NY 11203, USA
| | - DongQing Zheng
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, CA 90033, USA
| | - Nicolas Hartel
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, CA 90033, USA
| | - James A. Knowles
- Department of Cell Biology, SUNY Downstate Medical Center, Brooklyn, NY 11203, USA
| | - Nicholas A. Graham
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, CA 90033, USA
| | - Justin Ichida
- Zilkha Neurogenetic Institute, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA,Department of Stem Cell Biology and Regenerative Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA,Eli and Edythe Broad CIRM Center for Regenerative Medicine and Stem Cell Research at USC
| | - Marcelo P. Coba
- Zilkha Neurogenetic Institute, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA,Department of Psychiatry and Behavioral Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA,Corresponding Author: Marcelo P. Coba, Keck School of Medicine, University of Southern California, Zilkha Neurogenetic Institute, 1501 San Pablo St, Los Angeles, CA 90033, USA. Phone: 323-442-4345.
| |
Collapse
|
36
|
Islamaj Dogan R, Kim S, Chatr-Aryamontri A, Wei CH, Comeau DC, Antunes R, Matos S, Chen Q, Elangovan A, Panyam NC, Verspoor K, Liu H, Wang Y, Liu Z, Altinel B, Hüsünbeyi ZM, Özgür A, Fergadis A, Wang CK, Dai HJ, Tran T, Kavuluru R, Luo L, Steppi A, Zhang J, Qu J, Lu Z. Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine. Database (Oxford) 2019; 2019:5303240. [PMID: 30689846 PMCID: PMC6348314 DOI: 10.1093/database/bay147] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2018] [Accepted: 12/19/2018] [Indexed: 12/16/2022]
Abstract
The Precision Medicine Initiative is a multicenter effort aiming at formulating personalized treatments leveraging on individual patient data (clinical, genome sequence and functional genomic data) together with the information in large knowledge bases (KBs) that integrate genome annotation, disease association studies, electronic health records and other data types. The biomedical literature provides a rich foundation for populating these KBs, reporting genetic and molecular interactions that provide the scaffold for the cellular regulatory systems and detailing the influence of genetic variants in these interactions. The goal of BioCreative VI Precision Medicine Track was to extract this particular type of information and was organized in two tasks: (i) document triage task, focused on identifying scientific literature containing experimentally verified protein-protein interactions (PPIs) affected by genetic mutations and (ii) relation extraction task, focused on extracting the affected interactions (protein pairs). To assist system developers and task participants, a large-scale corpus of PubMed documents was manually annotated for this task. Ten teams worldwide contributed 22 distinct text-mining models for the document triage task, and six teams worldwide contributed 14 different text-mining systems for the relation extraction task. When comparing the text-mining system predictions with human annotations, for the triage task, the best F-score was 69.06%, the best precision was 62.89%, the best recall was 98.0% and the best average precision was 72.5%. For the relation extraction task, when taking homologous genes into account, the best F-score was 37.73%, the best precision was 46.5% and the best recall was 54.1%. Submitted systems explored a wide range of methods, from traditional rule-based, statistical and machine learning systems to state-of-the-art deep learning methods. Given the level of participation and the individual team results we find the precision medicine track to be successful in engaging the text-mining research community. In the meantime, the track produced a manually annotated corpus of 5509 PubMed documents developed by BioGRID curators and relevant for precision medicine. The data set is freely available to the community, and the specific interactions have been integrated into the BioGRID data set. In addition, this challenge provided the first results of automatically identifying PubMed articles that describe PPI affected by mutations, as well as extracting the affected relations from those articles. Still, much progress is needed for computer-assisted precision medicine text mining to become mainstream. Future work should focus on addressing the remaining technical challenges and incorporating the practical benefits of text-mining tools into real-world precision medicine information-related curation.
Collapse
Affiliation(s)
- Rezarta Islamaj Dogan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Sun Kim
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | - Chih-Hsuan Wei
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Donald C Comeau
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Rui Antunes
- Department of Electronics, Telecommunications and Informatics (DETI)/Institute of Electronics and Informatics Engineering of Aveiro (IEETA), University of Aveiro, Aveiro, Portugal
| | - Sérgio Matos
- Department of Electronics, Telecommunications and Informatics (DETI)/Institute of Electronics and Informatics Engineering of Aveiro (IEETA), University of Aveiro, Aveiro, Portugal
| | - Qingyu Chen
- School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC, Australia
| | - Aparna Elangovan
- School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC, Australia
| | - Nagesh C Panyam
- School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC, Australia
| | - Karin Verspoor
- School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC, Australia
| | - Hongfang Liu
- Department of Health Science Research, Mayo Clinic, Rochester, MN, USA
| | - Yanshan Wang
- Department of Health Science Research, Mayo Clinic, Rochester, MN, USA
| | - Zhuang Liu
- School of Computer Science and Technology, Dalian University of Technology, Dalian, China
| | - Berna Altinel
- Department of Computer Engineering, Marmara University, Istanbul, Turkey
| | | | | | - Aris Fergadis
- School of Electrical and Computer Engineering, National Technical University of Athens, Zografou, Athens, Greece
| | - Chen-Kai Wang
- Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei, Taiwan
| | - Hong-Jie Dai
- Department of Electrical Engineering, National Kaousiung University of Science and Technology, Kaohsiung, Taiwan
| | - Tung Tran
- Department of Computer Science, University of Kentucky, Lexington, KY, USA
| | - Ramakanth Kavuluru
- Division of Biomedical Informatics, Department of Internal Medicine, University of Kentucky, Lexington, KY, USA
| | - Ling Luo
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China
| | - Albert Steppi
- Department of Statistics, Florida State University, Florida, USA
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Florida, USA
| | - Jinchan Qu
- Department of Statistics, Florida State University, Florida, USA
| | - Zhiyong Lu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
37
|
Capriotti E, Ozturk K, Carter H. Integrating molecular networks with genetic variant interpretation for precision medicine. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2018; 11:e1443. [PMID: 30548534 PMCID: PMC6450710 DOI: 10.1002/wsbm.1443] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 10/23/2018] [Accepted: 10/30/2018] [Indexed: 02/01/2023]
Abstract
More reliable and cheaper sequencing technologies have revealed the vast mutational landscapes characteristic of many phenotypes. The analysis of such genetic variants has led to successful identification of altered proteins underlying many Mendelian disorders. Nevertheless the simple one‐variant one‐phenotype model valid for many monogenic diseases does not capture the complexity of polygenic traits and disorders. Although experimental and computational approaches have improved detection of functionally deleterious variants and important interactions between gene products, the development of comprehensive models relating genotype and phenotypes remains a challenge in the field of genomic medicine. In this context, a new view of the pathologic state as significant perturbation of the network of interactions between biomolecules is crucial for the identification of biochemical pathways associated with complex phenotypes. Seminal studies in systems biology combined the analysis of genetic variation with protein–protein interaction networks to demonstrate that even as biological systems evolve to be robust to genetic variation, their topologies create disease vulnerabilities. More recent analyses model the impact of genetic variants as changes to the “wiring” of the interactome to better capture heterogeneity in genotype–phenotype relationships. These studies lay the foundation for using networks to predict variant effects at scale using machine‐learning or algorithmic approaches. A wealth of databases and resources for the annotation of genotype–phenotype relationships have been developed to support developments in this area. This overview describes how study of the molecular interactome has generated insights linking the organization of biological systems to disease mechanism, and how this information can enable precision medicine. This article is categorized under:
Translational, Genomic, and Systems Medicine > Translational Medicine Biological Mechanisms > Cell Signaling Models of Systems Properties and Processes > Mechanistic Models Analytical and Computational Methods > Computational Methods
Collapse
Affiliation(s)
- Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| | - Kivilcim Ozturk
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, California
| | - Hannah Carter
- Department of Medicine and Institute for Genomic Medicine, University of California, San Diego, La Jolla, California
| |
Collapse
|
38
|
Higareda-Almaraz JC, Karbiener M, Giroud M, Pauler FM, Gerhalter T, Herzig S, Scheideler M. Norepinephrine triggers an immediate-early regulatory network response in primary human white adipocytes. BMC Genomics 2018; 19:794. [PMID: 30390616 PMCID: PMC6215669 DOI: 10.1186/s12864-018-5173-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Accepted: 10/16/2018] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Norepinephrine (NE) signaling has a key role in white adipose tissue (WAT) functions, including lipolysis, free fatty acid liberation and, under certain conditions, conversion of white into brite (brown-in-white) adipocytes. However, acute effects of NE stimulation have not been described at the transcriptional network level. RESULTS We used RNA-seq to uncover a broad transcriptional response. The inference of protein-protein and protein-DNA interaction networks allowed us to identify a set of immediate-early genes (IEGs) with high betweenness, validating our approach and suggesting a hierarchical control of transcriptional regulation. In addition, we identified a transcriptional regulatory network with IEGs as master regulators, including HSF1 and NFIL3 as novel NE-induced IEG candidates. Moreover, a functional enrichment analysis and gene clustering into functional modules suggest a crosstalk between metabolic, signaling, and immune responses. CONCLUSIONS Altogether, our network biology approach explores for the first time the immediate-early systems level response of human adipocytes to acute sympathetic activation, thereby providing a first network basis of early cell fate programs and crosstalks between metabolic and transcriptional networks required for proper WAT function.
Collapse
Affiliation(s)
- Juan Carlos Higareda-Almaraz
- Institute for Diabetes and Cancer (IDC), Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Joint Heidelberg-IDC Translational Diabetes Program, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Metabolic Control, Medical Faculty, Technical University, Munich, Germany
- German Center for Diabetes Research (DZD), Neuherberg, Germany
- NMR laboratory, Institute of Myology, Hopital Universitaire Pitie Salpetriere, Paris, France
| | - Michael Karbiener
- Department of Phoniatrics, ENT University Hospital, Medical University of Graz, Graz, Austria
| | - Maude Giroud
- Institute for Diabetes and Cancer (IDC), Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Joint Heidelberg-IDC Translational Diabetes Program, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Metabolic Control, Medical Faculty, Technical University, Munich, Germany
- German Center for Diabetes Research (DZD), Neuherberg, Germany
| | - Florian M. Pauler
- CeMM, Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
- Present Address: Institute of Science and Technology (IST) Austria, Klosterneuburg, Austria
| | - Teresa Gerhalter
- Present Address: Institute of Science and Technology (IST) Austria, Klosterneuburg, Austria
| | - Stephan Herzig
- Institute for Diabetes and Cancer (IDC), Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Joint Heidelberg-IDC Translational Diabetes Program, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Metabolic Control, Medical Faculty, Technical University, Munich, Germany
- German Center for Diabetes Research (DZD), Neuherberg, Germany
| | - Marcel Scheideler
- Institute for Diabetes and Cancer (IDC), Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Joint Heidelberg-IDC Translational Diabetes Program, Heidelberg University Hospital, Heidelberg, Germany
- Molecular Metabolic Control, Medical Faculty, Technical University, Munich, Germany
- German Center for Diabetes Research (DZD), Neuherberg, Germany
- NMR laboratory, Institute of Myology, Hopital Universitaire Pitie Salpetriere, Paris, France
| |
Collapse
|
39
|
Lacher SE, Levings DC, Freeman S, Slattery M. Identification of a functional antioxidant response element at the HIF1A locus. Redox Biol 2018; 19:401-411. [PMID: 30241031 PMCID: PMC6146589 DOI: 10.1016/j.redox.2018.08.014] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Revised: 08/21/2018] [Accepted: 08/22/2018] [Indexed: 12/12/2022] Open
Abstract
Reactive oxygen species (ROS), which are a byproduct of oxidative metabolism, serve as signaling molecules in a number of physiological settings. However, if their levels are not tightly maintained, excess ROS lead to potentially cytotoxic oxidative stress. Accordingly, several transcriptional regulatory networks have evolved to include components that are highly ROS-responsive. Depending on the context, these regulatory networks can leverage ROS to respond to nutrient conditions, metabolism, or other physiological signals, or to respond to oxidative stress. However, ROS signaling is complex, so regulatory interactions between various ROS-responsive transcription factors are still being mapped out. Here we show that the transcription factor NRF2, a key regulator of the adaptive response to oxidative stress, directly regulates expression of HIF1A, which encodes HIF1α, a key transcriptional regulator of the adaptive response to hypoxia. We used an integrative genomics approach to identify HIF1A as a ROS-responsive transcript and we found an NRF2-bound antioxidant response element (ARE) approximately 30 kilobases upstream of HIF1A. This ARE sequence is deeply conserved, and we verified that it is directly bound and activated by NRF2. In addition, we found that HIF1A is upregulated in breast and bladder tumors with high NRF2 activity. Taken together, our results demonstrate that NRF2 targets a functional ARE at the HIF1A locus, and reveal a direct regulatory connection between two important oxygen responsive transcription factors.
Collapse
Affiliation(s)
- Sarah E Lacher
- Department of Biomedical Sciences, University of Minnesota Medical School, 1035 University Drive, SMed 255, Duluth, MN 55812, United States
| | - Daniel C Levings
- Department of Biomedical Sciences, University of Minnesota Medical School, 1035 University Drive, SMed 255, Duluth, MN 55812, United States
| | - Samuel Freeman
- Department of Biomedical Sciences, University of Minnesota Medical School, 1035 University Drive, SMed 255, Duluth, MN 55812, United States
| | - Matthew Slattery
- Department of Biomedical Sciences, University of Minnesota Medical School, 1035 University Drive, SMed 255, Duluth, MN 55812, United States.
| |
Collapse
|
40
|
Najafi H, Hosseini SM, Tavallaie M, Soltani BM. A Predicted Molecular Model for Development of Human Intelligence. NEUROCHEM J+ 2018. [DOI: 10.1134/s1819712418030091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
41
|
Cui H, Zhao N, Korkin D. Multilayer View of Pathogenic SNVs in Human Interactome through In Silico Edgetic Profiling. J Mol Biol 2018; 430:2974-2992. [DOI: 10.1016/j.jmb.2018.07.012] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Revised: 06/28/2018] [Accepted: 07/09/2018] [Indexed: 12/21/2022]
|
42
|
Hutt DM, Loguercio S, Roth DM, Su AI, Balch WE. Correcting the F508del-CFTR variant by modulating eukaryotic translation initiation factor 3-mediated translation initiation. J Biol Chem 2018; 293:13477-13495. [PMID: 30006345 DOI: 10.1074/jbc.ra118.003192] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 07/05/2018] [Indexed: 12/31/2022] Open
Abstract
Inherited and somatic rare diseases result from >200,000 genetic variants leading to loss- or gain-of-toxic function, often caused by protein misfolding. Many of these misfolded variants fail to properly interact with other proteins. Understanding the link between factors mediating the transcription, translation, and protein folding of these disease-associated variants remains a major challenge in cell biology. Herein, we utilized the cystic fibrosis transmembrane conductance regulator (CFTR) protein as a model and performed a proteomics-based high-throughput screen (HTS) to identify pathways and components affecting the folding and function of the most common cystic fibrosis-associated mutation, the F508del variant of CFTR. Using a shortest-path algorithm we developed, we mapped HTS hits to the CFTR interactome to provide functional context to the targets and identified the eukaryotic translation initiation factor 3a (eIF3a) as a central hub for the biogenesis of CFTR. Of note, siRNA-mediated silencing of eIF3a reduced the polysome-to-monosome ratio in F508del-expressing cells, which, in turn, decreased the translation of CFTR variants, leading to increased CFTR stability, trafficking, and function at the cell surface. This finding suggested that eIF3a is involved in mediating the impact of genetic variations in CFTR on the folding of this protein. We posit that the number of ribosomes on a CFTR mRNA transcript is inversely correlated with the stability of the translated polypeptide. Polysome-based translation challenges the capacity of the proteostasis environment to balance message fidelity with protein folding, leading to disease. We suggest that this deficit can be corrected through control of translation initiation.
Collapse
Affiliation(s)
| | | | | | - Andrew I Su
- Integrative Structural and Computational Biology and
| | - William E Balch
- From the Departments of Molecular Medicine and .,the Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037
| |
Collapse
|
43
|
Chen Z, Quan L, Huang A, Zhao Q, Yuan Y, Yuan X, Shen Q, Shang J, Ben Y, Qin FXF, Wu A. seq-ImmuCC: Cell-Centric View of Tissue Transcriptome Measuring Cellular Compositions of Immune Microenvironment From Mouse RNA-Seq Data. Front Immunol 2018; 9:1286. [PMID: 29922297 PMCID: PMC5996037 DOI: 10.3389/fimmu.2018.01286] [Citation(s) in RCA: 63] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 05/22/2018] [Indexed: 12/02/2022] Open
Abstract
The RNA sequencing approach has been broadly used to provide gene-, pathway-, and network-centric analyses for various cell and tissue samples. However, thus far, rich cellular information carried in tissue samples has not been thoroughly characterized from RNA-Seq data. Therefore, it would expand our horizons to better understand the biological processes of the body by incorporating a cell-centric view of tissue transcriptome. Here, a computational model named seq-ImmuCC was developed to infer the relative proportions of 10 major immune cells in mouse tissues from RNA-Seq data. The performance of seq-ImmuCC was evaluated among multiple computational algorithms, transcriptional platforms, and simulated and experimental datasets. The test results showed its stable performance and superb consistency with experimental observations under different conditions. With seq-ImmuCC, we generated the comprehensive landscape of immune cell compositions in 27 normal mouse tissues and extracted the distinct signatures of immune cell proportion among various tissue types. Furthermore, we quantitatively characterized and compared 18 different types of mouse tumor tissues of distinct cell origins with their immune cell compositions, which provided a comprehensive and informative measurement for the immune microenvironment inside tumor tissues. The online server of seq-ImmuCC are freely available at http://wap-lab.org:3200/immune/.
Collapse
Affiliation(s)
- Ziyi Chen
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| | - Lijun Quan
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| | - Anfei Huang
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| | - Qiang Zhao
- Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China.,School of Life Science and Technology, China Pharmaceutical University, Nanjing, China
| | - Yao Yuan
- Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China.,School of Pharmacy, Health Science Center, Xi'an Jiaotong University, Xi'an, China
| | - Xuye Yuan
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| | - Qin Shen
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| | - Jingzhe Shang
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| | - Yinyin Ben
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| | - F Xiao-Feng Qin
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| | - Aiping Wu
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, China
| |
Collapse
|
44
|
Ranjitha Dhanasekaran A, Gardiner KJ. RPPAware: A software suite to preprocess, analyze and visualize reverse phase protein array data. J Bioinform Comput Biol 2018; 16:1850001. [PMID: 29478376 DOI: 10.1142/s0219720018500014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Reverse Phase Protein Arrays (RPPA) is a high-throughput technology used to profile levels of protein expression. Handling the large datasets generated by RPPA can be facilitated by appropriate software tools. Here, we describe RPPAware, a free and intuitive software suite that was developed specifically for analysis and visualization of RPPA data. RPPAware is a portable tool that requires no installation and was built using Java. Many modules of the tool invoke R to utilize the statistical features. To demonstrate the utility of RPPAware, data generated from screening brain regions of a mouse model of Down syndrome with 62 antibodies were used as a case study. The ease of use and efficiency of RPPAware can accelerate data analysis to facilitate biological discovery. RPPAware 1.0 is freely available under GNU General Public License from the project website at http://downsyndrome.ucdenver.edu/iddrc/rppaware/home.htm along with a full documentation of the tool.
Collapse
Affiliation(s)
- A Ranjitha Dhanasekaran
- * Rocky Mountain Alzheimer's Disease Center, School of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, Colorado 80045, USA
- † Department of Neurology, School of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, Colorado 80045, USA
- ‡ Linda Crnic Institute for Down Syndrome, School of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, Colorado 80045, USA
| | - Katheleen J Gardiner
- ‡ Linda Crnic Institute for Down Syndrome, School of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, Colorado 80045, USA
- § Department of Pediatrics, School of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, Colorado 80045, USA
- ¶ Human Medical Genetics and Genomics and Neuroscience Programs, School of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, Colorado 80045, USA
| |
Collapse
|
45
|
Monraz Gomez LC, Kondratova M, Ravel JM, Barillot E, Zinovyev A, Kuperstein I. Application of Atlas of Cancer Signalling Network in preclinical studies. Brief Bioinform 2018; 20:701-716. [DOI: 10.1093/bib/bby031] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 03/28/2018] [Indexed: 12/27/2022] Open
Affiliation(s)
- L Cristobal Monraz Gomez
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Maria Kondratova
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Jean-Marie Ravel
- Genetic Laboratory, Nancy's Regional University Hospital, Vandœuvre-lès-Nancy and INSERM UMR 954, Lorraine University, Vandœuvre-lès-Nancy
| | - Emmanuel Barillot
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| | - Inna Kuperstein
- Institut Curie, PSL Research University, F-75005 Paris, France, INSERM, U900, F-75005 Paris, France and MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, F-75006 Paris, France
| |
Collapse
|
46
|
Ashraf N, Basu S, Narula K, Ghosh S, Tayal R, Gangisetty N, Biswas S, Aggarwal PR, Chakraborty N, Chakraborty S. Integrative network analyses of wilt transcriptome in chickpea reveal genotype dependent regulatory hubs in immunity and susceptibility. Sci Rep 2018; 8:6528. [PMID: 29695764 PMCID: PMC5916944 DOI: 10.1038/s41598-018-19919-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2017] [Accepted: 01/05/2018] [Indexed: 12/12/2022] Open
Abstract
Host specific resistance and non-host resistance are two plant immune responses to counter pathogen invasion. Gene network organizing principles leading to quantitative differences in resistant and susceptible host during host specific resistance are poorly understood. Vascular wilt caused by root pathogen Fusarium species is complex and governed by host specific resistance in crop plants, including chickpea. Here, we temporally profiled two contrasting chickpea genotypes in disease and immune state to better understand gene expression switches in host specific resistance. Integrative gene-regulatory network elucidated tangible insight into interaction coordinators leading to pathway determination governing distinct (disease or immune) phenotypes. Global network analysis identified five major hubs with 389 co-regulated genes. Functional enrichment revealed immunome containing three subnetworks involving CTI, PTI and ETI and wilt diseasome encompassing four subnetworks highlighting pathogen perception, penetration, colonization and disease establishment. These subnetworks likely represent key components that coordinate various biological processes favouring defence or disease. Furthermore, we identified core 76 disease/immunity related genes through subcellular analysis. Our regularized network with robust statistical assessment captured known and unexpected gene interaction, candidate novel regulators as future biomarkers and first time showed system-wide quantitative architecture corresponding to genotypic characteristics in wilt landscape.
Collapse
Affiliation(s)
- Nasheeman Ashraf
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Swaraj Basu
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Kanika Narula
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Sudip Ghosh
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Rajul Tayal
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Nagaraju Gangisetty
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Sushmita Biswas
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Pooja R Aggarwal
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Niranjan Chakraborty
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Subhra Chakraborty
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India.
| |
Collapse
|
47
|
Systematic Evaluation of Molecular Networks for Discovery of Disease Genes. Cell Syst 2018; 6:484-495.e5. [PMID: 29605183 DOI: 10.1016/j.cels.2018.03.001] [Citation(s) in RCA: 173] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2017] [Revised: 12/19/2017] [Accepted: 02/28/2018] [Indexed: 12/27/2022]
Abstract
Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall. A general tendency is that performance scales with network size, suggesting that new interaction discovery currently outweighs the detrimental effects of false positives. Correcting for size, we find that the DIP network provides the highest efficiency (value per interaction). Based on these results, we create a parsimonious composite network with both high efficiency and performance. This work provides a benchmark for selection of molecular networks in human disease research.
Collapse
|
48
|
Molecular Genetic Analysis of Human Endometrial Mesenchymal Stem Cells That Survived Sublethal Heat Shock. Stem Cells Int 2017; 2017:2362630. [PMID: 29375621 PMCID: PMC5742502 DOI: 10.1155/2017/2362630] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 07/13/2017] [Indexed: 02/07/2023] Open
Abstract
High temperature is a critical environmental and personal factor. Although heat shock is a well-studied biological phenomenon, hyperthermia response of stem cells is poorly understood. Previously, we demonstrated that sublethal heat shock induced premature senescence in human endometrial mesenchymal stem cells (eMSC). This study aimed to investigate the fate of eMSC-survived sublethal heat shock (SHS) with special emphasis on their genetic stability and possible malignant transformation using methods of classic and molecular karyotyping, next-generation sequencing, and transcriptome functional analysis. G-banding revealed random chromosome breakages and aneuploidy in the SHS-treated eMSC. Molecular karyotyping found no genomic imbalance in these cells. Gene module and protein interaction network analysis of mRNA sequencing data showed that compared to untreated cells, SHS-survived progeny revealed some difference in gene expression. However, no hallmarks of cancer were found. Our data identified downregulation of oncogenic signaling, upregulation of tumor-suppressing and prosenescence signaling, induction of mismatch, and excision DNA repair. The common feature of heated eMSC is the silence of MYC, AKT1/PKB oncogenes, and hTERT telomerase. Overall, our data indicate that despite genetic instability, SHS-survived eMSC do not undergo transformation. After long-term cultivation, these cells like their unheated counterparts enter replicative senescence and die.
Collapse
|
49
|
Isik Z, Ercan ME. Integration of RNA-Seq and RPPA data for survival time prediction in cancer patients. Comput Biol Med 2017; 89:397-404. [PMID: 28869900 DOI: 10.1016/j.compbiomed.2017.08.028] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2017] [Revised: 08/20/2017] [Accepted: 08/25/2017] [Indexed: 10/19/2022]
Abstract
Integration of several types of patient data in a computational framework can accelerate the identification of more reliable biomarkers, especially for prognostic purposes. This study aims to identify biomarkers that can successfully predict the potential survival time of a cancer patient by integrating the transcriptomic (RNA-Seq), proteomic (RPPA), and protein-protein interaction (PPI) data. The proposed method -RPBioNet- employs a random walk-based algorithm that works on a PPI network to identify a limited number of protein biomarkers. Later, the method uses gene expression measurements of the selected biomarkers to train a classifier for the survival time prediction of patients. RPBioNet was applied to classify kidney renal clear cell carcinoma (KIRC), glioblastoma multiforme (GBM), and lung squamous cell carcinoma (LUSC) patients based on their survival time classes (long- or short-term). The RPBioNet method correctly identified the survival time classes of patients with between 66% and 78% average accuracy for three data sets. RPBioNet operates with only 20 to 50 biomarkers and can achieve on average 6% higher accuracy compared to the closest alternative method, which uses only RNA-Seq data in the biomarker selection. Further analysis of the most predictive biomarkers highlighted genes that are common for both cancer types, as they may be driver proteins responsible for cancer progression. The novelty of this study is the integration of a PPI network with mRNA and protein expression data to identify more accurate prognostic biomarkers that can be used for clinical purposes in the future.
Collapse
Affiliation(s)
- Zerrin Isik
- Computer Engineering Department, Dokuz Eylul Universitesi, 35160, Izmir, Turkey.
| | - Muserref Ece Ercan
- Computer Engineering Department, Dokuz Eylul Universitesi, 35160, Izmir, Turkey
| |
Collapse
|
50
|
|