1
|
Shetty KS, Jose A, Bani M, Vinod PK. Network diffusion-based approach for survival prediction and identification of biomarkers using multi-omics data of papillary renal cell carcinoma. Mol Genet Genomics 2023; 298:871-882. [PMID: 37093328 DOI: 10.1007/s00438-023-02022-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 04/12/2023] [Indexed: 04/25/2023]
Abstract
Identification of cancer subtypes based on molecular knowledge is crucial for improving the patient diagnosis, prognosis, and treatment. In this work, we integrated copy number variations (CNVs) and transcriptomic data of Kidney Papillary Renal Cell Carcinoma (KIRP) using a network diffusion strategy to stratify cancers into clinically and biologically relevant subtypes. We constructed GeneNet, a KIRP specific gene expression network from RNA-seq data. The copy number variation data was projected onto GeneNet and propagated on the network for clustering. We identified robust subtypes that are biologically informative and significantly associated with patient survival, tumor stage and clinical subtypes of KIRP. We performed a Singular Value Decomposition (SVD) analysis of KIRP subtypes, which revealed the genes/silent players related to poor survival. A differential gene expression analysis between subtypes showed that genes related to immune, extracellular matrix organization, and genomic instability are upregulated in the poor survival group. Overall, the network-based approach revealed the molecular subtypes of KIRP and captured the relationship between gene expression and CNVs. This framework can be further expanded to integrate other omics data.
Collapse
Affiliation(s)
- Keerthi S Shetty
- Center for Computational Natural Sciences and Bioinformatics, IIIT Hyderabad, Hyderabad, 500032, India
| | - Aswin Jose
- Center for Computational Natural Sciences and Bioinformatics, IIIT Hyderabad, Hyderabad, 500032, India
| | - Mihir Bani
- Center for Computational Natural Sciences and Bioinformatics, IIIT Hyderabad, Hyderabad, 500032, India
| | - P K Vinod
- Center for Computational Natural Sciences and Bioinformatics, IIIT Hyderabad, Hyderabad, 500032, India.
| |
Collapse
|
2
|
Renaux A, Papadimitriou S, Versbraegen N, Nachtegael C, Boutry S, Nowé A, Smits G, Lenaerts T. ORVAL: a novel platform for the prediction and exploration of disease-causing oligogenic variant combinations. Nucleic Acids Res 2020; 47:W93-W98. [PMID: 31147699 PMCID: PMC6602484 DOI: 10.1093/nar/gkz437] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Revised: 05/01/2019] [Accepted: 05/09/2019] [Indexed: 12/16/2022] Open
Abstract
A tremendous amount of DNA sequencing data is being produced around the world with the ambition to capture in more detail the mechanisms underlying human diseases. While numerous bioinformatics tools exist that allow the discovery of causal variants in Mendelian diseases, little to no support is provided to do the same for variant combinations, an essential task for the discovery of the causes of oligogenic diseases. ORVAL (the Oligogenic Resource for Variant AnaLysis), which is presented here, provides an answer to this problem by focusing on generating networks of candidate pathogenic variant combinations in gene pairs, as opposed to isolated variants in unique genes. This online platform integrates innovative machine learning methods for combinatorial variant pathogenicity prediction with visualization techniques, offering several interactive and exploratory tools, such as pathogenic gene and protein interaction networks, a ranking of pathogenic gene pairs, as well as visual mappings of the cellular location and pathway information. ORVAL is the first web-based exploration platform dedicated to identifying networks of candidate pathogenic variant combinations with the sole ambition to help in uncovering oligogenic causes for patients that cannot rely on the classical disease analysis tools. ORVAL is available at https://orval.ibsquare.be.
Collapse
Affiliation(s)
- Alexandre Renaux
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium.,Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium.,Artificial Intelligence lab, Vrije Universiteit Brussel, 1050 Brussels, Belgium
| | - Sofia Papadimitriou
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium.,Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium.,Artificial Intelligence lab, Vrije Universiteit Brussel, 1050 Brussels, Belgium
| | - Nassim Versbraegen
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium.,Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium
| | - Charlotte Nachtegael
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium.,Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium
| | - Simon Boutry
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium.,Laboratory of Human Molecular Genetics, de Duve Institute, UCLouvain, 1200 Brussels, Belgium
| | - Ann Nowé
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium.,Artificial Intelligence lab, Vrije Universiteit Brussel, 1050 Brussels, Belgium
| | - Guillaume Smits
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium.,Hôpital Universitaire des Enfants Reine Fabiola, 1020 Brussels, Belgium.,Center of Human Genetics, Hôpital Erasme, 1070 Brussels, Belgium
| | - Tom Lenaerts
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium.,Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium.,Artificial Intelligence lab, Vrije Universiteit Brussel, 1050 Brussels, Belgium
| |
Collapse
|
3
|
Di Nanni N, Bersanelli M, Milanesi L, Mosca E. Network Diffusion Promotes the Integrative Analysis of Multiple Omics. Front Genet 2020; 11:106. [PMID: 32180795 PMCID: PMC7057719 DOI: 10.3389/fgene.2020.00106] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Accepted: 01/29/2020] [Indexed: 02/01/2023] Open
Abstract
The development of integrative methods is one of the main challenges in bioinformatics. Network-based methods for the analysis of multiple gene-centered datasets take into account known and/or inferred relations between genes. In the last decades, the mathematical machinery of network diffusion—also referred to as network propagation—has been exploited in several network-based pipelines, thanks to its ability of amplifying association between genes that lie in network proximity. Indeed, network diffusion provides a quantitative estimation of network proximity between genes associated with one or more different data types, from simple binary vectors to real vectors. Therefore, this powerful data transformation method has also been increasingly used in integrative analyses of multiple collections of biological scores and/or one or more interaction networks. We present an overview of the state of the art of bioinformatics pipelines that use network diffusion processes for the integrative analysis of omics data. We discuss the fundamental ways in which network diffusion is exploited, open issues and potential developments in the field. Current trends suggest that network diffusion is a tool of broad utility in omics data analysis. It is reasonable to think that it will continue to be used and further refined as new data types arise (e.g. single cell datasets) and the identification of system-level patterns will be considered more and more important in omics data analysis.
Collapse
Affiliation(s)
- Noemi Di Nanni
- Institute of Biomedical Technologies, National Research Council, Milan, Italy.,Department of Industrial and Information Engineering, University of Pavia, Pavia, Italy
| | - Matteo Bersanelli
- Department of Physics and Astronomy, University of Bologna, Bologna, Italy.,National Institute of Nuclear Physics (INFN), Bologna, Italy
| | - Luciano Milanesi
- Institute of Biomedical Technologies, National Research Council, Milan, Italy
| | - Ettore Mosca
- Institute of Biomedical Technologies, National Research Council, Milan, Italy
| |
Collapse
|
4
|
Blatti C, Emad A, Berry MJ, Gatzke L, Epstein M, Lanier D, Rizal P, Ge J, Liao X, Sobh O, Lambert M, Post CS, Xiao J, Groves P, Epstein AT, Chen X, Srinivasan S, Lehnert E, Kalari KR, Wang L, Weinshilboum RM, Song JS, Jongeneel CV, Han J, Ravaioli U, Sobh N, Bushell CB, Sinha S. Knowledge-guided analysis of "omics" data using the KnowEnG cloud platform. PLoS Biol 2020; 18:e3000583. [PMID: 31971940 PMCID: PMC6977717 DOI: 10.1371/journal.pbio.3000583] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 12/19/2019] [Indexed: 12/19/2022] Open
Abstract
We present Knowledge Engine for Genomics (KnowEnG), a free-to-use computational system for analysis of genomics data sets, designed to accelerate biomedical discovery. It includes tools for popular bioinformatics tasks such as gene prioritization, sample clustering, gene set analysis, and expression signature analysis. The system specializes in "knowledge-guided" data mining and machine learning algorithms, in which user-provided data are analyzed in light of prior information about genes, aggregated from numerous knowledge bases and encoded in a massive "Knowledge Network." KnowEnG adheres to "FAIR" principles (findable, accessible, interoperable, and reuseable): its tools are easily portable to diverse computing environments, run on the cloud for scalable and cost-effective execution, and are interoperable with other computing platforms. The analysis tools are made available through multiple access modes, including a web portal with specialized visualization modules. We demonstrate the KnowEnG system's potential value in democratization of advanced tools for the modern genomics era through several case studies that use its tools to recreate and expand upon the published analysis of cancer data sets.
Collapse
Affiliation(s)
- Charles Blatti
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Amin Emad
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Department of Electrical and Computer Engineering, McGill University, Montreal, Canada
| | - Matthew J. Berry
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Lisa Gatzke
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Milt Epstein
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Daniel Lanier
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Pramod Rizal
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Jing Ge
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Xiaoxia Liao
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Omar Sobh
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Mike Lambert
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Corey S. Post
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Jinfeng Xiao
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Peter Groves
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Aidan T. Epstein
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Xi Chen
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Subhashini Srinivasan
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Erik Lehnert
- Seven Bridges Genomics, Charlestown, Massachusetts, United States of America
| | - Krishna R. Kalari
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, United States of America
| | - Liewei Wang
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, Minnesota, United States of America
| | - Richard M. Weinshilboum
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, Minnesota, United States of America
| | - Jun S. Song
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - C. Victor Jongeneel
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Jiawei Han
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Umberto Ravaioli
- Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Nahil Sobh
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Colleen B. Bushell
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Saurabh Sinha
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- * E-mail:
| |
Collapse
|
5
|
Zhu J, Seo JE, Wang S, Ashby K, Ballard R, Yu D, Ning B, Agarwal R, Borlak J, Tong W, Chen M. The Development of a Database for Herbal and Dietary Supplement Induced Liver Toxicity. Int J Mol Sci 2018; 19:E2955. [PMID: 30274144 PMCID: PMC6213387 DOI: 10.3390/ijms19102955] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2018] [Revised: 09/10/2018] [Accepted: 09/21/2018] [Indexed: 12/14/2022] Open
Abstract
The growing use of herbal dietary supplements (HDS) in the United States provides compelling evidence for risk of herbal-induced liver injury (HILI). Information on HDS products was retrieved from MedlinePlus of the U.S. National Library of Medicine and the herbal monograph of the European Medicines Agency. The hepatotoxic potential of HDS was ascertained by considering published case reports. Other relevant data were collected from governmental documents, public databases, web sources, and the literature. We collected information for 296 unique HDS products. Evidence of hepatotoxicity was reported for 67, that is 1 in 5, of these HDS products. The database revealed an apparent gender preponderance with women representing 61% of HILI cases. Culprit hepatotoxic HDS were mostly used for weight control, followed by pain and inflammation, mental stress, and mood disorders. Commonly discussed mechanistic events associated with HILI are reactive metabolites and oxidative stress, mitochondrial injury, as well as inhibition of transporters. HDS⁻drug interactions, causing both synergistic and antagonizing effects of drugs, were also reported for certain HDS. The database contains information for nearly 300 commonly used HDS products to provide a single-entry point for better comprehension of their impact on public health.
Collapse
Affiliation(s)
- Jieqiang Zhu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
| | - Ji-Eun Seo
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
| | - Sanlong Wang
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
- National Center for Safety Evaluation of Drugs, National Institutes for Food and Drug Control, China's State Food and Drug Administration, Beijing 100176, China.
| | - Kristin Ashby
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
| | - Rodney Ballard
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
| | - Dianke Yu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
| | - Baitang Ning
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
| | - Rajiv Agarwal
- Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD 20993, USA.
| | - Jürgen Borlak
- Center of Pharmacology and Toxicology, Hannover Medical School, 30625 Hannover, Germany.
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
| | - Minjun Chen
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA.
| |
Collapse
|
6
|
Eason K, Nyamundanda G, Sadanandam A. polyClustR: defining communities of reconciled cancer subtypes with biological and prognostic significance. BMC Bioinformatics 2018; 19:182. [PMID: 29801433 PMCID: PMC5970540 DOI: 10.1186/s12859-018-2204-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Accepted: 05/14/2018] [Indexed: 11/30/2022] Open
Abstract
Background To ensure cancer patients are stratified towards treatments that are optimally beneficial, it is a priority to define robust molecular subtypes using clustering methods applied to high-dimensional biological data. If each of these methods produces different numbers of clusters for the same data, it is difficult to achieve an optimal solution. Here, we introduce “polyClustR”, a tool that reconciles clusters identified by different methods into subtype “communities” using a hypergeometric test or a measure of relative proportion of common samples. Results The polyClustR pipeline was initially tested using a breast cancer dataset to demonstrate how results are compatible with and add to the understanding of this well-characterised cancer. Two uveal melanoma datasets were then utilised to identify and validate novel subtype communities with significant metastasis-free prognostic differences and associations with known chromosomal aberrations. Conclusion We demonstrate the value of the polyClustR approach of applying multiple consensus clustering algorithms and systematically reconciling the results in identifying novel subtype communities of two cancer types, which nevertheless are compatible with established understanding of these diseases. An R implementation of the pipeline is available at: https://github.com/syspremed/polyClustR Electronic supplementary material The online version of this article (10.1186/s12859-018-2204-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Katherine Eason
- Division of Molecular Pathology, The Institute of Cancer Research (ICR), London, UK
| | - Gift Nyamundanda
- Division of Molecular Pathology, The Institute of Cancer Research (ICR), London, UK.,Centre for Molecular Pathology, Royal Marsden Hospital (RMH), London, UK
| | - Anguraj Sadanandam
- Division of Molecular Pathology, The Institute of Cancer Research (ICR), London, UK. .,Centre for Molecular Pathology, Royal Marsden Hospital (RMH), London, UK.
| |
Collapse
|
7
|
Hong H, Slikker W. Advancing translation of biomarkers into regulatory science. Biomark Med 2016; 9:1043-6. [PMID: 26573514 DOI: 10.2217/bmm.15.104] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Affiliation(s)
- Huixiao Hong
- Division of Bioinformatics & Biostatistics, National Center for Toxicological Research, US Food & Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA
| | - William Slikker
- Office of the Director, National Center for Toxicological Research, US Food & Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA
| |
Collapse
|