1
|
Choudhury P, Dasgupta S, Bhattacharyya P, Roychowdhury S, Chaudhury K. Understanding pulmonary hypertension: the need for an integrative metabolomics and transcriptomics approach. Mol Omics 2024; 20:366-389. [PMID: 38853716 DOI: 10.1039/d3mo00266g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Pulmonary hypertension (PH), characterised by mean pulmonary arterial pressure (mPAP) >20 mm Hg at rest, is a complex pathophysiological disorder associated with multiple clinical conditions. The high prevalence of the disease along with increased mortality and morbidity makes it a global health burden. Despite major advances in understanding the disease pathophysiology, much of the underlying complex molecular mechanism remains to be elucidated. Lack of a robust diagnostic test and specific therapeutic targets also poses major challenges. This review provides a comprehensive update on the dysregulated pathways and promising candidate markers identified in PH patients using the transcriptomics and metabolomics approach. The review also highlights the need of using an integrative multi-omics approach for obtaining insight into the disease at a molecular level. The integrative multi-omics/pan-omics approach envisaged to help in bridging the gap from genotype to phenotype is outlined. Finally, the challenges commonly encountered while conducting omics-driven studies are also discussed.
Collapse
Affiliation(s)
- Priyanka Choudhury
- School of Medical Science and Technology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, West Bengal, India.
| | - Sanjukta Dasgupta
- Department of Biotechnology, Brainware University, Barasat, West Bengal, India
| | | | | | - Koel Chaudhury
- School of Medical Science and Technology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, West Bengal, India.
| |
Collapse
|
2
|
Düz E, Çakır T. Effect of RNA-Seq data normalization on protein interactome mapping for Alzheimer's disease. Comput Biol Chem 2024; 109:108028. [PMID: 38377697 DOI: 10.1016/j.compbiolchem.2024.108028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 02/01/2024] [Accepted: 02/04/2024] [Indexed: 02/22/2024]
Abstract
High throughput RNA sequencing brings new perspective to the elucidation of molecular mechanisms of diseases. Normalization is the first and most important step for RNA-Seq data, and it can differ based on the purpose of the analysis. Within-sample normalization methods (eg. TPM) are preferred when genes in a sample are compared with each other, and between-sample normalization methods (eg. deseq2, TMM, Voom) are used when the samples in a dataset are compared. Normalization approaches rescale the data, and, therefore, they affect the results of the analysis. Here, we selected two most commonly used Alzheimer's disease RNA-Seq datasets from ROSMAP and Mayo Clinic cohorts and mapped the differentially expressed genes on human protein interactome to discover disease-specific subnetworks. To this end, the raw count data were first processed with four different, commonly used RNA-Seq normalization methods (deseq2, TMM, Voom and TPM). Then, covariate adjustment was applied to the normalized data for gender, age of death and post-mortem interval. Each normalized dataset was separately mapped on the human protein-protein interaction network either in covariate-adjusted or non-adjusted form. Capturing known Alzheimer's disease genes and genes associated with the disease-related functional terms in the discovered subnetworks were the criteria to compare different normalization methods. Based on our results, applying covariate adjustment has a positive effect on normalization by removing the confounder effects. Covariate-adjusted TMM and covariate-adjusted deseq2 methods performed better in both transcriptome datasets.
Collapse
Affiliation(s)
- Elif Düz
- Department of Bioengineering, Gebze Technical University, Gebze, Kocaeli, 41400, Turkey
| | - Tunahan Çakır
- Department of Bioengineering, Gebze Technical University, Gebze, Kocaeli, 41400, Turkey.
| |
Collapse
|
3
|
Banerjee U, Chedere A, Padaki R, Mohan A, Sambaturu N, Singh A, Chandra N. PathTracer Comprehensively Identifies Hypoxia-Induced Dormancy Adaptations in Mycobacterium tuberculosis. J Chem Inf Model 2023; 63:6156-6167. [PMID: 37756209 DOI: 10.1021/acs.jcim.3c00845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/29/2023]
Abstract
Mining large-scale data to discover biologically relevant information remains a challenge despite the rapid development of bioinformatics tools. Here, we have developed a new tool, PathTracer, to identify biologically relevant information flows by mining genome-wide protein-protein interaction networks following integration of gene expression data. PathTracer successfully mines interactions between genes and traces the most perturbed paths of perceived activities under the conditions of the study. We further demonstrated the utility of this tool by identifying adaptation mechanisms of hypoxia-induced dormancy in Mycobacterium tuberculosis (Mtb).
Collapse
Affiliation(s)
- Ushashi Banerjee
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India
| | - Adithya Chedere
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India
| | - Raksha Padaki
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India
| | - Abhilash Mohan
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India
| | - Narmada Sambaturu
- IISc Mathematics Initiative, Indian Institute of Science, Bangalore 560012, Karnataka, India
| | - Amit Singh
- Center for Infectious Disease Research, Indian Institute of Science, Bangalore 560012, Karnataka, India
| | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India
- IISc Mathematics Initiative, Indian Institute of Science, Bangalore 560012, Karnataka, India
- BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, Karnataka, India
| |
Collapse
|
4
|
Mogilenko DA, Sergushichev A, Artyomov MN. Systems Immunology Approaches to Metabolism. Annu Rev Immunol 2023; 41:317-342. [PMID: 37126419 DOI: 10.1146/annurev-immunol-101220-031513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Over the last decade, immunometabolism has emerged as a novel interdisciplinary field of research and yielded significant fundamental insights into the regulation of immune responses. Multiple classical approaches to interrogate immunometabolism, including bulk metabolic profiling and analysis of metabolic regulator expression, paved the way to appreciating the physiological complexity of immunometabolic regulation in vivo. Studying immunometabolism at the systems level raised the need to transition towards the next-generation technology for metabolic profiling and analysis. Spatially resolved metabolic imaging and computational algorithms for multi-modal data integration are new approaches to connecting metabolism and immunity. In this review, we discuss recent studies that highlight the complex physiological interplay between immune responses and metabolism and give an overview of technological developments that bear the promise of capturing this complexity most directly and comprehensively.
Collapse
Affiliation(s)
- Denis A Mogilenko
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri, USA; ,
- Current affiliation: Department of Medicine, Department of Pathology, Microbiology, and Immunology, and Vanderbilt Center for Immunobiology, Vanderbilt University Medical Center, Nashville, Tennessee, USA;
| | - Alexey Sergushichev
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri, USA; ,
- Computer Technologies Laboratory, ITMO University, Saint Petersburg, Russia
| | - Maxim N Artyomov
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri, USA; ,
| |
Collapse
|
5
|
Yiğit EN, Sönmez E, Yüksel İ, Aksan Kurnaz I, Çakır T. A transcriptome based approach to predict candidate drug targets and drugs for Parkinson's disease using an in vitro 6-OHDA model. Mol Omics 2023; 19:218-228. [PMID: 36723117 DOI: 10.1039/d2mo00267a] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The most common treatment strategies for Parkinson's disease (PD) aim to slow down the neurodegeneration process or control the symptoms. In this study, using an in vitro PD model we carried out a transcriptome-based drug target prediction strategy. We identified novel drug target candidates by mapping genes upregulated in 6-OHDA-treated cells on a human protein-protein interaction network. Among the predicted targets, we show that AKR1C3 and CEBPB are promising in validating our bioinformatics approach since their known ligands, rutin and quercetin, respectively, act as neuroprotective drugs that effectively decrease cell death, and restore the expression profiles of key genes upregulated in 6-OHDA-treated cells. We also show that these two genes upregulated in our in vitro PD model are downregulated to basal levels upon drug administration. As a further validation of our methodology, we further confirm that the potential target genes identified with our bioinformatics approach are also upregulated in post-mortem transcriptome samples of PD patients from the literature. Therefore, we propose that this methodology predicts novel drug targets AKR1C3 and CEBPB, which are relevant to future clinical applications as potential drug repurposing targets for PD. Our systems-based computational approach to predict candidate drug targets can be employed in identifying novel drug targets in other diseases without a priori assumption.
Collapse
Affiliation(s)
- Esra Nur Yiğit
- Institute of Biotechnology, Gebze Technical University, 41400, Gebze, Kocaeli, Turkey.,Research Institute for Health Sciences and Technologies (SABITA), İstanbul Medipol University, İstanbul, Turkey
| | - Ekin Sönmez
- Institute of Biotechnology, Gebze Technical University, 41400, Gebze, Kocaeli, Turkey
| | - İsa Yüksel
- Department of Bioengineering, Gebze Technical University, 41400, Gebze, Kocaeli, Turkey.
| | - Işıl Aksan Kurnaz
- Institute of Biotechnology, Gebze Technical University, 41400, Gebze, Kocaeli, Turkey.,Department of Molecular Biology and Genetics, Gebze Technical University, 41400, Gebze, Kocaeli, Turkey
| | - Tunahan Çakır
- Department of Bioengineering, Gebze Technical University, 41400, Gebze, Kocaeli, Turkey.
| |
Collapse
|
6
|
Short Linear Motifs in Colorectal Cancer Interactome and Tumorigenesis. Cells 2022; 11:cells11233739. [PMID: 36496998 PMCID: PMC9737320 DOI: 10.3390/cells11233739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 11/16/2022] [Accepted: 11/21/2022] [Indexed: 11/25/2022] Open
Abstract
Colorectal tumorigenesis is driven by alterations in genes and proteins responsible for cancer initiation, progression, and invasion. This multistage process is based on a dense network of protein-protein interactions (PPIs) that become dysregulated as a result of changes in various cell signaling effectors. PPIs in signaling and regulatory networks are known to be mediated by short linear motifs (SLiMs), which are conserved contiguous regions of 3-10 amino acids within interacting protein domains. SLiMs are the minimum sequences required for modulating cellular PPI networks. Thus, several in silico approaches have been developed to predict and analyze SLiM-mediated PPIs. In this review, we focus on emerging evidence supporting a crucial role for SLiMs in driver pathways that are disrupted in colorectal cancer (CRC) tumorigenesis and related PPI network alterations. As a result, SLiMs, along with short peptides, are attracting the interest of researchers to devise small molecules amenable to be used as novel anti-CRC targeted therapies. Overall, the characterization of SLiMs mediating crucial PPIs in CRC may foster the development of more specific combined pharmacological approaches.
Collapse
|
7
|
Winkler S, Winkler I, Figaschewski M, Tiede T, Nordheim A, Kohlbacher O. De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet. BMC Bioinformatics 2022; 23:139. [PMID: 35439941 PMCID: PMC9020058 DOI: 10.1186/s12859-022-04670-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2021] [Accepted: 03/29/2022] [Indexed: 12/14/2022] Open
Abstract
Background With a growing amount of (multi-)omics data being available, the extraction of knowledge from these datasets is still a difficult problem. Classical enrichment-style analyses require predefined pathways or gene sets that are tested for significant deregulation to assess whether the pathway is functionally involved in the biological process under study. De novo identification of these pathways can reduce the bias inherent in predefined pathways or gene sets. At the same time, the definition and efficient identification of these pathways de novo from large biological networks is a challenging problem. Results We present a novel algorithm, DeRegNet, for the identification of maximally deregulated subnetworks on directed graphs based on deregulation scores derived from (multi-)omics data. DeRegNet can be interpreted as maximum likelihood estimation given a certain probabilistic model for de-novo subgraph identification. We use fractional integer programming to solve the resulting combinatorial optimization problem. We can show that the approach outperforms related algorithms on simulated data with known ground truths. On a publicly available liver cancer dataset we can show that DeRegNet can identify biologically meaningful subgraphs suitable for patient stratification. DeRegNet can also be used to find explicitly multi-omics subgraphs which we demonstrate by presenting subgraphs with consistent methylation-transcription patterns. DeRegNet is freely available as open-source software. Conclusion The proposed algorithmic framework and its available implementation can serve as a valuable heuristic hypothesis generation tool contextualizing omics data within biomolecular networks.
Collapse
Affiliation(s)
- Sebastian Winkler
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany. .,International Max Planck Research School (IMPRS) "From Molecules to Organism", Tübingen, Germany.
| | - Ivana Winkler
- International Max Planck Research School (IMPRS) "From Molecules to Organism", Tübingen, Germany.,Interfaculty Institute for Cell Biology (IFIZ), University of Tuebingen, Tübingen, Germany.,German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Mirjam Figaschewski
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
| | - Thorsten Tiede
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
| | - Alfred Nordheim
- Interfaculty Institute for Cell Biology (IFIZ), University of Tuebingen, Tübingen, Germany.,Leibniz Institute on Aging (FLI), Jena, Germany
| | - Oliver Kohlbacher
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany.,Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany.,Translational Bioinformatics, University Hospital Tuebingen, Tübingen, Germany
| |
Collapse
|
8
|
Mechteridis K, Lauber M, Baumbach J, List M. KeyPathwayMineR: De Novo Pathway Enrichment in the R Ecosystem. Front Genet 2022; 12:812853. [PMID: 35173764 PMCID: PMC8842393 DOI: 10.3389/fgene.2021.812853] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 12/24/2021] [Indexed: 11/13/2022] Open
Abstract
De novo pathway enrichment is a systems biology approach in which OMICS data are projected onto a molecular interaction network to identify subnetworks representing condition-specific functional modules and molecular pathways. Compared to classical pathway enrichment analysis methods, de novo pathway enrichment is not limited to predefined lists of pathways from (curated) databases and thus particularly suited for discovering novel disease mechanisms. While several tools have been proposed for pathway enrichment, the integration of de novo pathway enrichment in end-to-end OMICS analysis workflows in the R programming language is currently limited to a single tool. To close this gap, we have implemented an R package KeyPathwayMineR (KPM-R). The package extends the features and usability of existing versions of KeyPathwayMiner by leveraging the power, flexibility and versatility of R and by providing various novel functionalities for performing data preparation, visualization, and comparison. In addition, thanks to its interoperability with a plethora of existing R packages in e.g., Bioconductor, CRAN, and GitHub, KPM-R allows carrying out the initial preparation of the datasets and to meaningfully interpret the extracted subnetworks. To demonstrate the package's potential, KPM-R was applied to bulk RNA-Seq data of nasopharyngeal swabs from SARS-CoV-2 infected individuals, and on single cell RNA-Seq data of aging mice tissue from the Tabula Muris Senis atlas.
Collapse
Affiliation(s)
- Konstantinos Mechteridis
- Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
| | - Michael Lauber
- Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
| | - Jan Baumbach
- Chair of Computational Systems Biology, University of Hamburg, Hamburg, Germany.,Department of Mathematics and Computer Science, Faculty of Science, University of Southern Denmark, Odense, Denmark
| | - Markus List
- Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
| |
Collapse
|
9
|
Alveolar Regeneration in COVID-19 Patients: A Network Perspective. Int J Mol Sci 2021; 22:ijms222011279. [PMID: 34681944 PMCID: PMC8538208 DOI: 10.3390/ijms222011279] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 12/12/2022] Open
Abstract
A viral infection involves entry and replication of viral nucleic acid in a host organism, subsequently leading to biochemical and structural alterations in the host cell. In the case of SARS-CoV-2 viral infection, over-activation of the host immune system may lead to lung damage. Albeit the regeneration and fibrotic repair processes being the two protective host responses, prolonged injury may lead to excessive fibrosis, a pathological state that can result in lung collapse. In this review, we discuss regeneration and fibrosis processes in response to SARS-CoV-2 and provide our viewpoint on the triggering of alveolar regeneration in coronavirus disease 2019 (COVID-19) patients.
Collapse
|
10
|
Maden SF, Acuner SE. Mapping Transcriptome Data to Protein-Protein Interaction Networks of Inflammatory Bowel Diseases Reveals Disease-Specific Subnetworks. Front Genet 2021; 12:688447. [PMID: 34484291 PMCID: PMC8416454 DOI: 10.3389/fgene.2021.688447] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 07/19/2021] [Indexed: 12/31/2022] Open
Abstract
Inflammatory bowel disease (IBD) is the common name for chronic disorders associated with the inflammation of the gastrointestinal tract. IBD is triggered by environmental factors in genetically susceptible individuals and has a significant number of incidences worldwide. Crohn’s disease (CD) and ulcerative colitis (UC) are the two distinct types of IBD. While involvement in ulcerative colitis is limited to the colon, Crohn’s disease may involve the whole gastrointestinal tract. Although these two disorders differ in macroscopic inflammation patterns, they share various molecular pathogenesis, yet the diagnosis can remain unclear, and it is important to reveal their molecular signatures in the network level. Improved molecular understanding may reveal disease type-specific and even individual-specific targets. To this aim, we determine the subnetworks specific to UC and CD by mapping transcriptome data to protein–protein interaction (PPI) networks using two different approaches [KeyPathwayMiner (KPM) and stringApp] and perform the functional enrichment analysis of the resulting disease type-specific subnetworks. TP63 was identified as the hub gene in the UC-specific subnet and p63 tumor protein, being in the same family as p53 and p73, has been studied in literature for the risk associated with colorectal cancer and IBD. APP was identified as the hub gene in the CD-specific subnet, and it has an important role in the pathogenesis of Alzheimer’s disease (AD). This relation suggests that some similar genetic factors may be effective in both AD and CD. Last, in order to understand the biological meaning of these disease-specific subnets, they were functionally enriched. It is important to note that chemokines—special types of cytokines—and antibacterial response are important in UC-specific subnets, whereas cytokines and antimicrobial responses as well as cancer-related pathways are important in CD-specific subnets. Overall, these findings reveal the differences between IBD subtypes at the molecular level and can facilitate diagnosis for UC and CD as well as provide potential molecular targets that are specific to disease subtypes.
Collapse
Affiliation(s)
- Sefika Feyza Maden
- Department of Bioengineering, Istanbul Medeniyet University, Istanbul, Turkey
| | - Saliha Ece Acuner
- Department of Bioengineering, Istanbul Medeniyet University, Istanbul, Turkey
| |
Collapse
|
11
|
Köhler N, Rose TD, Falk L, Pauling JK. Investigating Global Lipidome Alterations with the Lipid Network Explorer. Metabolites 2021; 11:metabo11080488. [PMID: 34436429 PMCID: PMC8398636 DOI: 10.3390/metabo11080488] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 07/26/2021] [Accepted: 07/27/2021] [Indexed: 11/18/2022] Open
Abstract
Lipids play an important role in biological systems and have the potential to serve as biomarkers in medical applications. Advances in lipidomics allow identification of hundreds of lipid species from biological samples. However, a systems biological analysis of the lipidome, by incorporating pathway information remains challenging, leaving lipidomics behind compared to other omics disciplines. An especially uncharted territory is the integration of statistical and network-based approaches for studying global lipidome changes. Here we developed the Lipid Network Explorer (LINEX), a web-tool addressing this gap by providing a way to visualize and analyze functional lipid metabolic networks. It utilizes metabolic rules to match biochemically connected lipids on a species level and combine it with a statistical correlation and testing analysis. Researchers can customize the biochemical rules considered, to their tissue or organism specific analysis and easily share them. We demonstrate the benefits of combining network-based analyses with statistics using publicly available lipidomics data sets. LINEX facilitates a biochemical knowledge-based data analysis for lipidomics. It is availableas a web-application and as a publicly available docker container.
Collapse
|
12
|
Emanetci E, Çakır T. Network-Based Analysis of Cognitive Impairment and Memory Deficits from Transcriptome Data. J Mol Neurosci 2021; 71:2415-2428. [PMID: 33713319 DOI: 10.1007/s12031-021-01807-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 02/01/2021] [Indexed: 12/12/2022]
Abstract
Aging is an inevitable process that negatively affects all living organisms and their vital functions. The brain is one of the most important organs in living beings and is primarily impacted by aging. The molecular mechanisms of learning, memory and cognition are altered over time, and the impairment in these mechanisms can lead to neurodegenerative diseases. Transcriptomics can be used to study these impairments to acquire more detailed information on the affected molecular mechanisms. Here we analyzed learning- and memory-related transcriptome data by mapping it on the organism-specific protein-protein interactome network. Subnetwork discovery algorithms were applied to discover highly dysregulated subnetworks, which were complemented with co-expression-based interactions. The functional analysis shows that the identified subnetworks are enriched with genes having roles in synaptic plasticity, gliogenesis, neurogenesis and cognition, which are reported to be related to memory and learning. With a detailed analysis, we show that the results from different subnetwork discovery algorithms or from different transcriptomic datasets can be successfully reconciled, leading to a memory-learning network that sheds light on the molecular mechanisms behind aging and memory-related impairments.
Collapse
Affiliation(s)
- Elif Emanetci
- Department of Bioengineering, Gebze Technical University, 41400, Gebze, Kocaeli, Turkey
| | - Tunahan Çakır
- Department of Bioengineering, Gebze Technical University, 41400, Gebze, Kocaeli, Turkey.
| |
Collapse
|
13
|
Wörheide MA, Krumsiek J, Kastenmüller G, Arnold M. Multi-omics integration in biomedical research - A metabolomics-centric review. Anal Chim Acta 2021; 1141:144-162. [PMID: 33248648 PMCID: PMC7701361 DOI: 10.1016/j.aca.2020.10.038] [Citation(s) in RCA: 101] [Impact Index Per Article: 33.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 10/09/2020] [Accepted: 10/19/2020] [Indexed: 02/07/2023]
Abstract
Recent advances in high-throughput technologies have enabled the profiling of multiple layers of a biological system, including DNA sequence data (genomics), RNA expression levels (transcriptomics), and metabolite levels (metabolomics). This has led to the generation of vast amounts of biological data that can be integrated in so-called multi-omics studies to examine the complex molecular underpinnings of health and disease. Integrative analysis of such datasets is not straightforward and is particularly complicated by the high dimensionality and heterogeneity of the data and by the lack of universal analysis protocols. Previous reviews have discussed various strategies to address the challenges of data integration, elaborating on specific aspects, such as network inference or feature selection techniques. Thereby, the main focus has been on the integration of two omics layers in their relation to a phenotype of interest. In this review we provide an overview over a typical multi-omics workflow, focusing on integration methods that have the potential to combine metabolomics data with two or more omics. We discuss multiple integration concepts including data-driven, knowledge-based, simultaneous and step-wise approaches. We highlight the application of these methods in recent multi-omics studies, including large-scale integration efforts aiming at a global depiction of the complex relationships within and between different biological layers without focusing on a particular phenotype.
Collapse
Affiliation(s)
- Maria A Wörheide
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Jan Krumsiek
- Institute for Computational Biomedicine, Englander Institute for Precision Medicine, Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Gabi Kastenmüller
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany; German Center for Diabetes Research (DZD), Neuherberg, Germany
| | - Matthias Arnold
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany; Department of Psychiatry and Behavioral Sciences, Duke University, Durham, NC, USA.
| |
Collapse
|
14
|
Soerensen M, Hozakowska-Roszkowska DM, Nygaard M, Larsen MJ, Schwämmle V, Christensen K, Christiansen L, Tan Q. A Genome-Wide Integrative Association Study of DNA Methylation and Gene Expression Data and Later Life Cognitive Functioning in Monozygotic Twins. Front Neurosci 2020; 14:233. [PMID: 32327964 PMCID: PMC7160301 DOI: 10.3389/fnins.2020.00233] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Accepted: 03/02/2020] [Indexed: 12/02/2022] Open
Abstract
Monozygotic twins are genetically identical but rarely phenotypically identical. Epigenetic and transcriptional variation could influence this phenotypic discordance. Investigation of intra-pair differences in molecular markers and a given phenotype in monozygotic twins controls most of the genetic contribution, enabling studies of the molecular features of the phenotype. This study aimed to identify genes associated with cognition in later life using integrated enrichment analyses of the results of blood-derived intra-pair epigenome-wide and transcriptome-wide association analyses of cognition in 452 middle-aged and old-aged monozygotic twins (56–80 years). Integrated analyses were performed with an unsupervised approach using KeyPathwayMiner, and a supervised approach using the KEGG and Reactome databases. The supervised approach identified several enriched gene sets, including “neuroactive ligand receptor interaction” (p-value = 1.62∗10-2), “Neurotrophin signaling” (p-value = 2.52∗10-3), “Alzheimer’s disease” (p-value = 1.20∗10-2), and “long-term depression” (p-value = 1.62∗10-2). The unsupervised approach resulted in a 238 gene network, including the Alzheimer’s disease gene APP (Amyloid Beta Precursor Protein) as an exception node, and several novel candidate genes. The strength of the unsupervised method is that it can reveal previously uncharacterized sub-pathways and detect interplay between biological processes, which remain undetected by the current supervised methods. In conclusion, this study identified several previously reported cognition genes and pathways and, additionally, puts forward novel candidates for further verification and validation.
Collapse
Affiliation(s)
- Mette Soerensen
- Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, Odense, Denmark.,Center for Individualized Medicine in Arterial Diseases, Department of Clinical Biochemistry and Pharmacology, Odense University Hospital, Odense, Denmark.,Department of Clinical Genetics, Odense University Hospital, Odense, Denmark
| | - Dominika Marzena Hozakowska-Roszkowska
- Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, Odense, Denmark.,Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Marianne Nygaard
- Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, Odense, Denmark.,Department of Clinical Genetics, Odense University Hospital, Odense, Denmark
| | - Martin J Larsen
- Department of Clinical Genetics, Odense University Hospital, Odense, Denmark.,Human Genetics, Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Kaare Christensen
- Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, Odense, Denmark.,Department of Clinical Genetics, Odense University Hospital, Odense, Denmark.,Department of Clinical Biochemistry and Pharmacology, Odense University Hospital, Odense, Denmark
| | - Lene Christiansen
- Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, Odense, Denmark.,Department of Clinical Immunology, Copenhagen University Hospital, Rigshospitalet, Denmark
| | - Qihua Tan
- Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, Odense, Denmark.,Department of Clinical Genetics, Odense University Hospital, Odense, Denmark.,Human Genetics, Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
15
|
Abstract
Biomolecular networks such as protein-protein interaction networks provide a static picture of the interplay of genes and their products, and, consequently, they fail to capture dynamic changes taking place during the development of complex diseases. KeyPathwayMiner is a software platform designed to fill this gap by integrating previous knowledge captured in molecular interaction networks with OMICS datasets (DNA microarrays, RNA sequencing, genome-wide methylation studies, etc.) to extract connected subnetworks with a high number of deregulated genes. This protocol describes how to use KeyPathwayMiner for integrated analysis of multi-omics datasets in the network analysis tool Cytoscape and in a stand-alone web application available at https://keypathwayminer.compbio.sdu.dk .
Collapse
Affiliation(s)
- Nicolas Alcaraz
- Department of Biology, The Bioinformatics Centre, University of Copenhagen, Copenhagen, Denmark
| | - Anne Hartebrodt
- TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Markus List
- TUM School of Life Sciences, Technical University of Munich, Freising, Germany.
| |
Collapse
|
16
|
Abstract
Network-based approach is rapidly emerging as a promising strategy to integrate and interpret different -omics datasets, including metabolomics. The first section of this chapter introduces the current progresses and main concepts in multi-omics integration. The second section provides an overview of the public resources available for creation of biological networks. The third section describes three common application scenarios including subnetwork identification, network-based enrichment analysis, and systems metabolomics. The section four introduces the concept of hierarchical community network analysis. The section five discusses different tools for network visualization. The chapter ends with a future perspective on multi-omics integration.
Collapse
Affiliation(s)
- Guangyan Zhou
- Institute of Parasitology, McGill University, Montreal, QC, Canada
| | - Shuzhao Li
- Department of Medicine, Emory University School of Medicine, Atlanta, GA, USA
| | - Jianguo Xia
- Institute of Parasitology, McGill University, Montreal, QC, Canada. .,Department of Animal Science, McGill University, Montreal, QC, Canada. .,Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada. .,Department of Human Genetics, McGill University, Montreal, QC, Canada.
| |
Collapse
|
17
|
Elkjaer ML, Frisch T, Reynolds R, Kacprowski T, Burton M, Kruse TA, Thomassen M, Baumbach J, Illes Z. Molecular signature of different lesion types in the brain white matter of patients with progressive multiple sclerosis. Acta Neuropathol Commun 2019; 7:205. [PMID: 31829262 PMCID: PMC6907342 DOI: 10.1186/s40478-019-0855-7] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Accepted: 11/25/2019] [Indexed: 12/21/2022] Open
Abstract
To identify pathogenetic markers and potential drivers of different lesion types in the white matter (WM) of patients with progressive multiple sclerosis (PMS), we sequenced RNA from 73 different WM areas. Compared to 25 WM controls, 6713 out of 18,609 genes were significantly differentially expressed in MS tissues (FDR < 0.05). A computational systems medicine analysis was performed to describe the MS lesion endophenotypes. The cellular source of specific molecules was examined by RNAscope, immunohistochemistry, and immunofluorescence. To examine common lesion specific mechanisms, we performed de novo network enrichment based on shared differentially expressed genes (DEGs), and found TGFβ-R2 as a central hub. RNAscope revealed astrocytes as the cellular source of TGFβ-R2 in remyelinating lesions. Since lesion-specific unique DEGs were more common than shared signatures, we examined lesion-specific pathways and de novo networks enriched with unique DEGs. Such network analysis indicated classic inflammatory responses in active lesions; catabolic and heat shock protein responses in inactive lesions; neuronal/axonal specific processes in chronic active lesions. In remyelinating lesions, de novo analyses identified axonal transport responses and adaptive immune markers, which was also supported by the most heterogeneous immunoglobulin gene expression. The signature of the normal-appearing white matter (NAWM) was more similar to control WM than to lesions: only 465 DEGs differentiated NAWM from controls, and 16 were unique. The upregulated marker CD26/DPP4 was expressed by microglia in the NAWM but by mononuclear cells in active lesions, which may indicate a special subset of microglia before the lesion develops, but also emphasizes that omics related to MS lesions should be interpreted in the context of different lesions types. While chronic active lesions were the most distinct from control WM based on the highest number of unique DEGs (n = 2213), remyelinating lesions had the highest gene expression levels, and the most different molecular map from chronic active lesions. This may suggest that these two lesion types represent two ends of the spectrum of lesion evolution in PMS. The profound changes in chronic active lesions, the predominance of synaptic/neural/axonal signatures coupled with minor inflammation may indicate end-stage irreversible molecular events responsible for this less treatable phase.
Collapse
|
18
|
Mapping a Transcriptome-Guided Arabidopsis SAM Interactome. Methods Mol Biol 2019. [PMID: 31797296 DOI: 10.1007/978-1-0716-0183-9_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
The advent of multi-OMICS approaches has a significant impact on the investigation of biological processes occurring in plants. RNA-SEQ, cellular proteomics, and metabolomics have added a considerable ease in studying the dynamics of stem cell niches. New cell sorting approaches coupled with the labeling of stem cell population specific marker genes are highly instrumental in enriching distinct cellular populations for various types of analysis. One more promising field of OMICS is the mapping of cellular interactomes. The plant stem cells research is barely profited from this newly emerging field of OMICS. Generation of stem cell/niche-specific interactome is a time-consuming and labor-intensive task. Here, we describe a method on how to assemble a SAM-based interactome after using the available generic Arabidopsis interactomes. To define the context of SAM in a generic interactome, we used SAM cell population transcriptome datasets. Our step-by-step protocol can easily be adopted for other stem cell niches such as RAM and lateral meristems keeping in view the availability of transcriptome datasets for cellular populations of these niches.
Collapse
|
19
|
Corwin T, Woodsmith J, Apelt F, Fontaine JF, Meierhofer D, Helmuth J, Grossmann A, Andrade-Navarro MA, Ballif BA, Stelzl U. Defining Human Tyrosine Kinase Phosphorylation Networks Using Yeast as an In Vivo Model Substrate. Cell Syst 2019; 5:128-139.e4. [PMID: 28837810 DOI: 10.1016/j.cels.2017.08.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Revised: 05/02/2017] [Accepted: 08/03/2017] [Indexed: 12/13/2022]
Abstract
Systematic assessment of tyrosine kinase-substrate relationships is fundamental to a better understanding of cellular signaling and its profound alterations in human diseases such as cancer. In human cells, such assessments are confounded by complex signaling networks, feedback loops, conditional activity, and intra-kinase redundancy. Here we address this challenge by exploiting the yeast proteome as an in vivo model substrate. We individually expressed 16 human non-receptor tyrosine kinases (NRTKs) in Saccharomyces cerevisiae and identified 3,279 kinase-substrate relationships involving 1,351 yeast phosphotyrosine (pY) sites. Based on the yeast data without prior information, we generated a set of linear kinase motifs and assigned ∼1,300 known human pY sites to specific NRTKs. Furthermore, experimentally defined pY sites for each individual kinase were shown to cluster within the yeast interactome network irrespective of linear motif information. We therefore applied a network inference approach to predict kinase-substrate relationships for more than 3,500 human proteins, providing a resource to advance our understanding of kinase biology.
Collapse
Affiliation(s)
- Thomas Corwin
- Otto-Warburg Laboratory, Max-Planck Institute for Molecular Genetics (MPIMG), 14195 Berlin, Germany
| | - Jonathan Woodsmith
- Otto-Warburg Laboratory, Max-Planck Institute for Molecular Genetics (MPIMG), 14195 Berlin, Germany; Institute of Pharmaceutical Sciences, University of Graz and BioTechMed-Graz, 8010 Graz, Austria
| | - Federico Apelt
- Otto-Warburg Laboratory, Max-Planck Institute for Molecular Genetics (MPIMG), 14195 Berlin, Germany
| | - Jean-Fred Fontaine
- Genomics and Computational Biology, Kernel Press UG, 55128 Mainz, Germany; Faculty of Biology, Johannes Gutenberg University Mainz and Institute of Molecular Biology, 55128 Mainz, Germany
| | - David Meierhofer
- Otto-Warburg Laboratory, Max-Planck Institute for Molecular Genetics (MPIMG), 14195 Berlin, Germany
| | - Johannes Helmuth
- Otto-Warburg Laboratory, Max-Planck Institute for Molecular Genetics (MPIMG), 14195 Berlin, Germany
| | - Arndt Grossmann
- Otto-Warburg Laboratory, Max-Planck Institute for Molecular Genetics (MPIMG), 14195 Berlin, Germany
| | - Miguel A Andrade-Navarro
- Faculty of Biology, Johannes Gutenberg University Mainz and Institute of Molecular Biology, 55128 Mainz, Germany
| | - Bryan A Ballif
- Department of Biology, University of Vermont, Burlington, VT 05405, USA
| | - Ulrich Stelzl
- Otto-Warburg Laboratory, Max-Planck Institute for Molecular Genetics (MPIMG), 14195 Berlin, Germany; Institute of Pharmaceutical Sciences, University of Graz and BioTechMed-Graz, 8010 Graz, Austria.
| |
Collapse
|
20
|
Perez‐Añorve IX, Gonzalez‐De la Rosa CH, Soto‐Reyes E, Beltran‐Anaya FO, Del Moral‐Hernandez O, Salgado‐Albarran M, Angeles‐Zaragoza O, Gonzalez‐Barrios JA, Landero‐Huerta DA, Chavez‐Saldaña M, Garcia‐Carranca A, Villegas‐Sepulveda N, Arechaga‐Ocampo E. New insights into radioresistance in breast cancer identify a dual function of miR-122 as a tumor suppressor and oncomiR. Mol Oncol 2019; 13:1249-1267. [PMID: 30938061 PMCID: PMC6487688 DOI: 10.1002/1878-0261.12483] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 02/10/2019] [Accepted: 03/13/2019] [Indexed: 12/25/2022] Open
Abstract
Radioresistance of tumor cells gives rise to local recurrence and disease progression in many patients. MicroRNAs (miRNAs) are master regulators of gene expression that control oncogenic pathways to modulate the radiotherapy response of cells. In the present study, differential expression profiling assays identified 16 deregulated miRNAs in acquired radioresistant breast cancer cells, of which miR-122 was observed to be up-regulated. Functional analysis revealed that miR-122 has a role as a tumor suppressor in parental cells by decreasing survival and promoting radiosensitivity. However, in radioresistant cells, miR-122 functions as an oncomiR by promoting survival. The transcriptomic landscape resulting from knockdown of miR-122 in radioresistant cells showed modulation of the ZNF611, ZNF304, RIPK1, HRAS, DUSP8 and TNFRSF21 genes. Moreover, miR-122 and the set of affected genes were prognostic factors in breast cancer patients treated with radiotherapy. Our data indicate that up-regulation of miR-122 promotes cell survival in acquired radioresistant breast cancer and also suggest that miR-122 differentially controls the response to radiotherapy by a dual function as a tumor suppressor an and oncomiR dependent on cell phenotype.
Collapse
Affiliation(s)
- Isidro X. Perez‐Añorve
- Posgrado en Ciencias Naturales e IngenieriaDivision de Ciencias Naturales e IngenieriaUniversidad Autonoma MetropolitanaMexico CityMexico
- Departamento de Ciencias NaturalesUniversidad Autonoma Metropolitana, Unidad CuajimalpaMexico CityMexico
| | | | - Ernesto Soto‐Reyes
- Departamento de Ciencias NaturalesUniversidad Autonoma Metropolitana, Unidad CuajimalpaMexico CityMexico
| | - Fredy O. Beltran‐Anaya
- Laboratorio de Genomica del CancerInstituto Nacional de Medicina GenomicaMexico CityMexico
| | - Oscar Del Moral‐Hernandez
- Laboratorio de Virologia y Epigenetica del CancerFacultad de Ciencias Quimico BiologicasUniversidad Autonoma de GuerreroChilpancingoMexico
| | - Marisol Salgado‐Albarran
- Departamento de Ciencias NaturalesUniversidad Autonoma Metropolitana, Unidad CuajimalpaMexico CityMexico
| | | | | | - Daniel A. Landero‐Huerta
- Posgrado en Ciencias Naturales e IngenieriaDivision de Ciencias Naturales e IngenieriaUniversidad Autonoma MetropolitanaMexico CityMexico
- Departamento de Ciencias NaturalesUniversidad Autonoma Metropolitana, Unidad CuajimalpaMexico CityMexico
- Laboratorio de Biologia de la ReproduccionInstituto Nacional de PediatríaMexico CityMexico
| | | | - Alejandro Garcia‐Carranca
- Unidad de Investigacion Biomedica en Cancer‐Laboratorio de Virus y CancerInstituto Nacional de CancerologiaMexico CityMexico
| | - Nicolas Villegas‐Sepulveda
- Departamento de Biomedicina MolecularCentro de Investigacion y de Estudios Avanzados (CINVESTAV)Mexico CityMexico
| | - Elena Arechaga‐Ocampo
- Departamento de Ciencias NaturalesUniversidad Autonoma Metropolitana, Unidad CuajimalpaMexico CityMexico
| |
Collapse
|
21
|
Elkjaer ML, Frisch T, Reynolds R, Kacprowski T, Burton M, Kruse TA, Thomassen M, Baumbach J, Illes Z. Unique RNA signature of different lesion types in the brain white matter in progressive multiple sclerosis. Acta Neuropathol Commun 2019; 7:58. [PMID: 31023379 PMCID: PMC6482546 DOI: 10.1186/s40478-019-0709-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Accepted: 03/22/2019] [Indexed: 01/18/2023] Open
Abstract
The heterogeneity of multiple sclerosis is reflected by dynamic changes of different lesion types in the brain white matter (WM). To identify potential drivers of this process, we RNA-sequenced 73 WM areas from patients with progressive MS (PMS) and 25 control WM. Lesion endophenotypes were described by a computational systems medicine analysis combined with RNAscope, immunohistochemistry, and immunofluorescence. The signature of the normal-appearing WM (NAWM) was more similar to control WM than to lesions: one of the six upregulated genes in NAWM was CD26/DPP4 expressed by microglia. Chronic active lesions that become prominent in PMS had a signature that were different from all other lesion types, and were differentiated from them by two clusters of 62 differentially expressed genes (DEGs). An upcoming MS biomarker, CHI3L1 was among the top ten upregulated genes in chronic active lesions expressed by astrocytes in the rim. TGFβ-R2 was the central hub in a remyelination-related protein interaction network, and was expressed there by astrocytes. We used de novo networks enriched by unique DEGs to determine lesion-specific pathway regulation, i.e. cellular trafficking and activation in active lesions; healing and immune responses in remyelinating lesions characterized by the most heterogeneous immunoglobulin gene expression; coagulation and ion balance in inactive lesions; and metabolic changes in chronic active lesions. Because we found inverse differential regulation of particular genes among different lesion types, our data emphasize that omics related to MS lesions should be interpreted in the context of lesion pathology. Our data indicate that the impact of molecular pathways is substantially changing as different lesions develop. This was also reflected by the high number of unique DEGs that were more common than shared signatures. A special microglia subset characterized by CD26 may play a role in early lesion development, while astrocyte-derived TGFβ-R2 and TGFβ pathways may be drivers of repair in contrast to chronic tissue damage. The highly specific mechanistic signature of chronic active lesions indicates that as these lesions develop in PMS, the molecular changes are substantially skewed: the unique mitochondrial/metabolic changes and specific downregulation of molecules involved in tissue repair may reflect a stage of exhaustion.
Collapse
|
22
|
Wiwie C, Kuznetsova I, Mostafa A, Rauch A, Haakonsson A, Barrio-Hernandez I, Blagoev B, Mandrup S, Schmidt HHHW, Pleschka S, Röttger R, Baumbach J. Time-Resolved Systems Medicine Reveals Viral Infection-Modulating Host Targets. SYSTEMS MEDICINE 2019; 2:1-9. [PMID: 31119214 PMCID: PMC6524659 DOI: 10.1089/sysm.2018.0013] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Introduction: Drug-resistant infections are becoming increasingly frequent worldwide, causing hundreds of thousands of deaths annually. This is partly due to the very limited set of protein drug targets known for human-infecting viral genomes. The eleven influenza virus proteins, for instance, exploit host cell factors for replication and suppression of the antiviral immune responses. A systems medicine approach to identify relevant and druggable host factors would dramatically expand therapeutic options. Therapeutic target identification, however, has hitherto relied on static molecular networks, whereas in reality the interactome, in particular during an infection, is subject to constant change. Methods: We developed time-course network enrichment (TiCoNE), an expert-centered approach for discovering temporal response pathways. In the first stage of TiCoNE, time-series expression data is clustered in a human-augmented manner to identify groups of biological entities with coherent temporal responses. Throughout this process, the expert can add, remove, merge, or split temporal patterns. The resulting groups can then be mapped to an interaction network to identify enriched pathways and to analyze cross-talk enrichments and depletions between groups. Finally, temporal response groups of two experiments can be intersected, to identify condition-variant response patterns that represent promising drug-target candidates. Results: We applied TiCoNE to human gene expression data for influenza A virus infection and rhino virus infection, respectively. We then identified coherent temporal response patterns and employed our cross-talk analysis to establish two potential timelines of systems-level host responses for either infection. Next, we compared the two phenotypes and unraveled condition-variant temporal groups interacting on a networks level. The highest-ranking ones we then validated via literature search and wet-lab experiments. This not only confirmed many of our candidates as previously known, but we also identified phospholipid scramblase 1 (encoded by PLSCR1) as a previously not recognized host factor that is essential for influenza A virus infection. Conclusion: With TiCoNE we developed a novel approach for conjointly analyzing molecular networks with time-series expression data and demonstrated its power by identifying temporal drug-targets. We provide proof-of-concept that not only novel targets can be identified using our approach, but also that anti-infective drug target discovery can be enhanced by investigating temporal molecular networks of the host in response to viral infection.
Collapse
Affiliation(s)
- Christian Wiwie
- Institute of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| | - Irina Kuznetsova
- Institute of Medical Virology, Justus Liebig University Giessen, Giessen, Germany
| | - Ahmed Mostafa
- Institute of Medical Virology, Justus Liebig University Giessen, Giessen, Germany.,Center of Scientific Excellence for Influenza Viruses, National Research Centre, Cairo, Egypt
| | - Alexander Rauch
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Anders Haakonsson
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Inigo Barrio-Hernandez
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Blagoy Blagoev
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Susanne Mandrup
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Harald H H W Schmidt
- Department of Pharmacology and Personalized Medicine, Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, the Netherlands
| | - Stephan Pleschka
- Institute of Medical Virology, Justus Liebig University Giessen, Giessen, Germany
| | - Richard Röttger
- Institute of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| | - Jan Baumbach
- Institute of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark.,Department of Experimental Bioinformatics, Technical University of Munich, Freising, Germany
| |
Collapse
|
23
|
Identification of pathways associated with chemosensitivity through network embedding. PLoS Comput Biol 2019; 15:e1006864. [PMID: 30893303 PMCID: PMC6443184 DOI: 10.1371/journal.pcbi.1006864] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Revised: 04/01/2019] [Accepted: 02/09/2019] [Indexed: 12/27/2022] Open
Abstract
Basal gene expression levels have been shown to be predictive of cellular response to cytotoxic treatments. However, such analyses do not fully reveal complex genotype- phenotype relationships, which are partly encoded in highly interconnected molecular networks. Biological pathways provide a complementary way of understanding drug response variation among individuals. In this study, we integrate chemosensitivity data from a large-scale pharmacogenomics study with basal gene expression data from the CCLE project and prior knowledge of molecular networks to identify specific pathways mediating chemical response. We first develop a computational method called PACER, which ranks pathways for enrichment in a given set of genes using a novel network embedding method. It examines a molecular network that encodes known gene-gene as well as gene-pathway relationships, and determines a vector representation of each gene and pathway in the same low-dimensional vector space. The relevance of a pathway to the given gene set is then captured by the similarity between the pathway vector and gene vectors. To apply this approach to chemosensitivity data, we identify genes whose basal expression levels in a panel of cell lines are correlated with cytotoxic response to a compound, and then rank pathways for relevance to these response-correlated genes using PACER. Extensive evaluation of this approach on benchmarks constructed from databases of compound target genes and large collections of drug response signatures demonstrates its advantages in identifying compound-pathway associations compared to existing statistical methods of pathway enrichment analysis. The associations identified by PACER can serve as testable hypotheses on chemosensitivity pathways and help further study the mechanisms of action of specific cytotoxic drugs. More broadly, PACER represents a novel technique of identifying enriched properties of any gene set of interest while also taking into account networks of known gene-gene relationships and interactions. Gene expression levels have been used to study the cellular response to drug treatments. However, analysis of gene expression without considering gene interactions cannot fully reveal complex genotype-phenotype relationships. Biological pathways reveal the interactions among genes, thus providing a complementary way of understanding the drug response variation among individuals. In this paper, we aim to identify pathways that mediate the chemical response of each drug. We used the recently generated CTRP pharmacogenomics data and CCLE basal expression data to identify these pathways. We showed that using the prior knowledge encoded in molecular networks substantially improves pathway identification. In particular, we integrate genes and pathways into a large heterogeneous network in which links are protein-protein interactions and gene-pathway affiliations. We then project this heterogeneous network onto a low-dimensional space, which enables more precise similarity measurements between pathways and drug-response-correlated genes. Extensive experiments on two benchmarks show that our method substantially improved the pathway identification performance by using the molecular networks. More importantly, our method represents a novel technique of identifying enriched properties of any gene set of interest while also taking into account networks of known gene-gene relationships and interactions.
Collapse
|
24
|
Cantone M, Küspert M, Reiprich S, Lai X, Eberhardt M, Göttle P, Beyer F, Azim K, Küry P, Wegner M, Vera J. A gene regulatory architecture that controls region-independent dynamics of oligodendrocyte differentiation. Glia 2019; 67:825-843. [PMID: 30730593 DOI: 10.1002/glia.23569] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Revised: 10/16/2018] [Accepted: 10/25/2018] [Indexed: 12/18/2022]
Abstract
Oligodendrocytes (OLs) facilitate information processing in the vertebrate central nervous system via axonal ensheathment. The structure and dynamics of the regulatory network that mediates oligodendrogenesis are poorly understood. We employed bioinformatics and meta-analysis of high-throughput datasets to reconstruct a regulatory network underpinning OL differentiation. From this network, we identified families of feedforward loops comprising the transcription factors (TFs) Olig2, Sox10, and Tcf7l2 and their targets. Among the targets, we found eight other TFs related to OL differentiation, suggesting a hierarchical architecture in which some TFs (Olig2, Sox10, and Tcf7l2) regulate via feedforward loops the expression of others (Sox2, Sox6, Sox11, Nkx2-2, Nkx6-2, Hes5, Myt1, and Myrf). Model simulations with a kinetic model reproduced the mechanisms of OL differentiation only when in the model, Sox10-mediated repression of Tcf7l2 by miR-338/miR-155 was introduced, a prediction confirmed in genetic functional experiments. Additional model simulations suggested that OLs from dorsal regions emerge through BMP/Sox9 signaling.
Collapse
Affiliation(s)
- Martina Cantone
- Laboratory of Systems Tumor Immunology, Hautklinik, Universitätsklinikum Erlangen and Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.,Faculty of Mechanical Engineering, Specialty Division for Systems Biotechnology, Technische Universität München, Munich, Germany
| | - Melanie Küspert
- Institut für Biochemie, Emil-Fischer-Zentrum, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
| | - Simone Reiprich
- Institut für Biochemie, Emil-Fischer-Zentrum, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
| | - Xin Lai
- Laboratory of Systems Tumor Immunology, Hautklinik, Universitätsklinikum Erlangen and Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Martin Eberhardt
- Laboratory of Systems Tumor Immunology, Hautklinik, Universitätsklinikum Erlangen and Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Peter Göttle
- Neuroregeneration, Department of Neurology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
| | - Felix Beyer
- Neuroregeneration, Department of Neurology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
| | - Kasum Azim
- Neuroregeneration, Department of Neurology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
| | - Patrick Küry
- Neuroregeneration, Department of Neurology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
| | - Michael Wegner
- Institut für Biochemie, Emil-Fischer-Zentrum, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
| | - Julio Vera
- Laboratory of Systems Tumor Immunology, Hautklinik, Universitätsklinikum Erlangen and Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| |
Collapse
|
25
|
Kusonmano K, Halle MK, Wik E, Hoivik EA, Krakstad C, Mauland KK, Tangen IL, Berg A, Werner HMJ, Trovik J, Øyan AM, Kalland KH, Jonassen I, Salvesen HB, Petersen K. Identification of highly connected and differentially expressed gene subnetworks in metastasizing endometrial cancer. PLoS One 2018; 13:e0206665. [PMID: 30383835 PMCID: PMC6211718 DOI: 10.1371/journal.pone.0206665] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 10/17/2018] [Indexed: 12/22/2022] Open
Abstract
We have identified nine highly connected and differentially expressed gene subnetworks between aggressive primary tumors and metastatic lesions in endometrial carcinomas. We implemented a novel pipeline combining gene set and network approaches, which here allows integration of protein-protein interactions and gene expression data. The resulting subnetworks are significantly associated with disease progression across tumor stages from complex atypical hyperplasia, primary tumors to metastatic lesions. The nine subnetworks include genes related to metastasizing features such as epithelial-mesenchymal transition (EMT), hypoxia and cell proliferation. TCF4 and TWIST2 were found as central genes in the subnetwork related to EMT. Two of the identified subnetworks display statistically significant association to patient survival, which were further supported by an independent validation in the data from The Cancer Genome Atlas data collection. The first subnetwork contains genes related to cell proliferation and cell cycle, while the second contains genes involved in hypoxia such as HIF1A and EGLN3. Our findings provide a promising context to elucidate the biological mechanisms of metastasis, suggest potential prognostic markers and further identify therapeutic targets. The pipeline R source code is freely available, including permutation tests to assess statistical significance of the identified subnetworks.
Collapse
Affiliation(s)
- Kanthida Kusonmano
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Bioinformatics and Systems Biology Program, School of Bioresources and Technology, King Mongkut’s University of Technology Thonburi, Bangkok, Thailand
- * E-mail:
| | - Mari K. Halle
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Elisabeth Wik
- Centre for Cancer Biomarkers, Department of Clinical Medicine, University of Bergen, Bergen, Norway
- Department of Pathology, The Gade Institute, Haukeland University Hospital, Bergen, Norway
| | - Erling A. Hoivik
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Camilla Krakstad
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Karen K. Mauland
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Ingvild L. Tangen
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Anna Berg
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Henrica M. J. Werner
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Jone Trovik
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Anne M. Øyan
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
- Department of Microbiology, Haukeland University Hospital, Bergen, Norway
| | - Karl-Henning Kalland
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
- Department of Microbiology, Haukeland University Hospital, Bergen, Norway
| | - Inge Jonassen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Informatics, University of Bergen, Bergen, Norway
| | - Helga B. Salvesen
- Department of Obstetrics and Gynecology, Haukeland University Hospital, Bergen, Norway
- Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Kjell Petersen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| |
Collapse
|
26
|
Conrad T, Kniemeyer O, Henkel SG, Krüger T, Mattern DJ, Valiante V, Guthke R, Jacobsen ID, Brakhage AA, Vlaic S, Linde J. Module-detection approaches for the integration of multilevel omics data highlight the comprehensive response of Aspergillus fumigatus to caspofungin. BMC SYSTEMS BIOLOGY 2018; 12:88. [PMID: 30342519 PMCID: PMC6195963 DOI: 10.1186/s12918-018-0620-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 10/08/2018] [Indexed: 12/20/2022]
Abstract
Background Omics data provide deep insights into overall biological processes of organisms. However, integration of data from different molecular levels such as transcriptomics and proteomics, still remains challenging. Analyzing lists of differentially abundant molecules from diverse molecular levels often results in a small overlap mainly due to different regulatory mechanisms, temporal scales, and/or inherent properties of measurement methods. Module-detecting algorithms identifying sets of closely related proteins from protein-protein interaction networks (PPINs) are promising approaches for a better data integration. Results Here, we made use of transcriptome, proteome and secretome data from the human pathogenic fungus Aspergillus fumigatus challenged with the antifungal drug caspofungin. Caspofungin targets the fungal cell wall which leads to a compensatory stress response. We analyzed the omics data using two different approaches: First, we applied a simple, classical approach by comparing lists of differentially expressed genes (DEGs), differentially synthesized proteins (DSyPs) and differentially secreted proteins (DSePs); second, we used a recently published module-detecting approach, ModuleDiscoverer, to identify regulatory modules from PPINs in conjunction with the experimental data. Our results demonstrate that regulatory modules show a notably higher overlap between the different molecular levels and time points than the classical approach. The additional structural information provided by regulatory modules allows for topological analyses. As a result, we detected a significant association of omics data with distinct biological processes such as regulation of kinase activity, transport mechanisms or amino acid metabolism. We also found a previously unreported increased production of the secondary metabolite fumagillin by A. fumigatus upon exposure to caspofungin. Furthermore, a topology-based analysis of potential key factors contributing to drug-caused side effects identified the highly conserved protein polyubiquitin as a central regulator. Interestingly, polyubiquitin UbiD neither belonged to the groups of DEGs, DSyPs nor DSePs but most likely strongly influenced their levels. Conclusion Module-detecting approaches support the effective integration of multilevel omics data and provide a deep insight into complex biological relationships connecting these levels. They facilitate the identification of potential key players in the organism’s stress response which cannot be detected by commonly used approaches comparing lists of differentially abundant molecules. Electronic supplementary material The online version of this article (10.1186/s12918-018-0620-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- T Conrad
- Systems Biology/Bioinformatics, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany.
| | - O Kniemeyer
- Molecular and Applied Microbiology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany
| | | | - T Krüger
- Molecular and Applied Microbiology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany
| | - D J Mattern
- Molecular and Applied Microbiology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany.,Present address: PerkinElmer Inc., Rodgau, Germany
| | - V Valiante
- Biobricks of Microbial Natural Product Syntheses, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany
| | - R Guthke
- Systems Biology/Bioinformatics, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany
| | - I D Jacobsen
- Microbial Immunology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany.,Institute for Microbiology, Friedrich Schiller University, Jena, Germany
| | - A A Brakhage
- Molecular and Applied Microbiology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany.,Institute for Microbiology, Friedrich Schiller University, Jena, Germany
| | - S Vlaic
- Systems Biology/Bioinformatics, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany
| | - J Linde
- Research Group PiDOMICs, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, Jena, Germany.,Institute for Bacterial Infections and Zoonoses, Federal Research Institute for Animal Health - Friedrich Loeffler Institute, Jena, Germany
| |
Collapse
|
27
|
Huang X, Lin X, Zeng J, Wang L, Yin P, Zhou L, Hu C, Yao W. A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks. Sci Rep 2017; 7:14339. [PMID: 29085035 PMCID: PMC5662748 DOI: 10.1038/s41598-017-14682-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2017] [Accepted: 10/16/2017] [Indexed: 01/05/2023] Open
Abstract
Analyzing omics data from a network-based perspective can facilitate biomarker discovery. To improve disease diagnosis and identify prospective information indicating the onset of complex disease, a computational method for identifying potential biomarkers based on differential sub-networks (PB-DSN) is developed. In PB-DSN, Pearson correlation coefficient (PCC) is used to measure the relationship between feature ratios and to infer potential networks. A differential sub-network is extracted to identify crucial information for discriminating different groups and indicating the emergence of complex diseases. Subsequently, PB-DSN defines potential biomarkers based on the topological analysis of these differential sub-networks. In this study, PB-DSN is applied to handle a static genomics dataset of small, round blue cell tumors and a time-series metabolomics dataset of hepatocellular carcinoma. PB-DSN is compared with support vector machine-recursive feature elimination, multivariate empirical Bayes statistics, analyzing time-series data based on dynamic networks, molecular networks based on PCC, PinnacleZ, graph-based iterative group analysis, KeyPathwayMiner and BioNet. The better performance of PB-DSN not only demonstrates its effectiveness for the identification of discriminative features that facilitate disease classification, but also shows its potential for the identification of warning signals.
Collapse
Affiliation(s)
- Xin Huang
- School of Computer Science & Technology, Dalian University of Technology, 116024, Dalian, China
| | - Xiaohui Lin
- School of Computer Science & Technology, Dalian University of Technology, 116024, Dalian, China.
| | - Jun Zeng
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China
| | - Lichao Wang
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China
| | - Peiyuan Yin
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China
| | - Lina Zhou
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China
| | - Chunxiu Hu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China
| | - Weihong Yao
- School of Computer Science & Technology, Dalian University of Technology, 116024, Dalian, China
| |
Collapse
|
28
|
Papaleo E, Gromova I, Gromov P. Gaining insights into cancer biology through exploration of the cancer secretome using proteomic and bioinformatic tools. Expert Rev Proteomics 2017; 14:1021-1035. [PMID: 28967788 DOI: 10.1080/14789450.2017.1387053] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
INTRODUCTION Tumor-associated proteins released by cancer cells and by tumor stroma cells, referred as 'cancer secretome', represent a valuable resource for discovery of potential cancer biomarkers. The last decade was marked by a great increase in number of studies focused on various aspects of cancer secretome including, composition and identification of components externalized by malignant cells and by the components of tumor microenvironment. Areas covered: Here, we provide an overview of achievements in the proteomic analysis of the cancer secretome, elicited through the tumor-associated interstitial fluid recovered from malignant tissues ex vivo or the protein component of conditioned media obtained from cultured cancer cells in vitro. We summarize various bioinformatic tools and approaches and critically appraise their outcomes, focusing on problems and challenges that arise when applied for the analysis of cancer secretomic databases. Expert commentary: Recent achievements in the omics- analysis of structural and metabolic aspects of altered cancer secretome contribute greatly to the various hallmarks of cancer including the identification of clinically significant biomarkers and potential targets for therapeutic intervention.
Collapse
Affiliation(s)
- Elena Papaleo
- a Danish Cancer Society Research Center, Computational Biology Laboratory , Copenhagen , Denmark
| | - Irina Gromova
- b Danish Cancer Society Research Center, Genome Integrity Unit, Breast Cancer Biology Group , Copenhagen , Denmark
| | - Pavel Gromov
- b Danish Cancer Society Research Center, Genome Integrity Unit, Breast Cancer Biology Group , Copenhagen , Denmark
| |
Collapse
|
29
|
(Re-)programming of subtype specific cardiomyocytes. Adv Drug Deliv Rev 2017; 120:142-167. [PMID: 28916499 DOI: 10.1016/j.addr.2017.09.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Revised: 08/29/2017] [Accepted: 09/07/2017] [Indexed: 01/10/2023]
Abstract
Adult cardiomyocytes (CMs) possess a highly restricted intrinsic regenerative potential - a major barrier to the effective treatment of a range of chronic degenerative cardiac disorders characterized by cellular loss and/or irreversible dysfunction and which underlies the majority of deaths in developed countries. Both stem cell programming and direct cell reprogramming hold promise as novel, potentially curative approaches to address this therapeutic challenge. The advent of induced pluripotent stem cells (iPSCs) has introduced a second pluripotent stem cell source besides embryonic stem cells (ESCs), enabling even autologous cardiomyocyte production. In addition, the recent achievement of directly reprogramming somatic cells into cardiomyocytes is likely to become of great importance. In either case, different clinical scenarios will require the generation of highly pure, specific cardiac cellular-subtypes. In this review, we discuss these themes as related to the cardiovascular stem cell and programming field, including a focus on the emergent topic of pacemaker cell generation for the development of biological pacemakers and in vitro drug testing.
Collapse
|
30
|
Nikolayeva I, Guitart Pla O, Schwikowski B. Network module identification-A widespread theoretical bias and best practices. Methods 2017; 132:19-25. [PMID: 28941788 DOI: 10.1016/j.ymeth.2017.08.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2017] [Revised: 08/14/2017] [Accepted: 08/18/2017] [Indexed: 10/18/2022] Open
Abstract
Biological processes often manifest themselves as coordinated changes across modules, i.e., sets of interacting genes. Commonly, the high dimensionality of genome-scale data prevents the visual identification of such modules, and straightforward computational search through a set of known pathways is a limited approach. Therefore, tools for the data-driven, computational, identification of modules in gene interaction networks have become popular components of visualization and visual analytics workflows. However, many such tools are known to result in modules that are large, and therefore hard to interpret biologically. Here, we show that the empirically known tendency towards large modules can be attributed to a statistical bias present in many module identification tools, and discuss possible remedies from a mathematical perspective. In the current absence of a straightforward practical solution, we outline our view of best practices for the use of the existing tools.
Collapse
Affiliation(s)
- Iryna Nikolayeva
- Systems Biology Lab, Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, Paris, France; Functional Genetics of Infectious Diseases Unit, Department Genomes and Genetics, Institut Pasteur, Paris, France; Université Paris-Descartes, Sorbonne Paris Cité, Paris, France
| | - Oriol Guitart Pla
- Systems Biology Lab, Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, Paris, France
| | - Benno Schwikowski
- Systems Biology Lab, Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, Paris, France.
| |
Collapse
|
31
|
Ghanat Bari M, Ung CY, Zhang C, Zhu S, Li H. Machine Learning-Assisted Network Inference Approach to Identify a New Class of Genes that Coordinate the Functionality of Cancer Networks. Sci Rep 2017; 7:6993. [PMID: 28765560 PMCID: PMC5539301 DOI: 10.1038/s41598-017-07481-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Accepted: 06/27/2017] [Indexed: 12/25/2022] Open
Abstract
Emerging evidence indicates the existence of a new class of cancer genes that act as "signal linkers" coordinating oncogenic signals between mutated and differentially expressed genes. While frequently mutated oncogenes and differentially expressed genes, which we term Class I cancer genes, are readily detected by most analytical tools, the new class of cancer-related genes, i.e., Class II, escape detection because they are neither mutated nor differentially expressed. Given this hypothesis, we developed a Machine Learning-Assisted Network Inference (MALANI) algorithm, which assesses all genes regardless of expression or mutational status in the context of cancer etiology. We used 8807 expression arrays, corresponding to 9 cancer types, to build more than 2 × 108 Support Vector Machine (SVM) models for reconstructing a cancer network. We found that ~3% of ~19,000 not differentially expressed genes are Class II cancer gene candidates. Some Class II genes that we found, such as SLC19A1 and ATAD3B, have been recently reported to associate with cancer outcomes. To our knowledge, this is the first study that utilizes both machine learning and network biology approaches to uncover Class II cancer genes in coordinating functionality in cancer networks and will illuminate our understanding of how genes are modulated in a tissue-specific network contribute to tumorigenesis and therapy development.
Collapse
Affiliation(s)
- Mehrab Ghanat Bari
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN, 55905, USA
| | - Choong Yong Ung
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN, 55905, USA
| | - Cheng Zhang
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN, 55905, USA
| | - Shizhen Zhu
- Department of Biochemistry and Molecular Biology, Mayo Clinic College of Medicine, Rochester, MN, 55905, USA
| | - Hu Li
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN, 55905, USA.
| |
Collapse
|
32
|
Taye B, Vaz C, Tanavde V, Kuznetsov VA, Eisenhaber F, Sugrue RJ, Maurer-Stroh S. Benchmarking selected computational gene network growing tools in context of virus-host interactions. Sci Rep 2017; 7:5805. [PMID: 28724991 PMCID: PMC5517527 DOI: 10.1038/s41598-017-06020-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Accepted: 06/07/2017] [Indexed: 01/04/2023] Open
Abstract
Several available online tools provide network growing functions where an algorithm utilizing different data sources suggests additional genes/proteins that should connect an input gene set into functionally meaningful networks. Using the well-studied system of influenza host interactions, we compare the network growing function of two free tools GeneMANIA and STRING and the commercial IPA for their performance of recovering known influenza A virus host factors previously identified from siRNA screens. The result showed that given small (~30 genes) or medium (~150 genes) input sets all three network growing tools detect significantly more known host factors than random human genes with STRING overall performing strongest. Extending the networks with all the three tools significantly improved the detection of GO biological processes of known host factors compared to not growing networks. Interestingly, the rate of identification of true host factors using computational network growing is equal or better to doing another experimental siRNA screening study which could also be true and applied to other biological pathways/processes.
Collapse
Affiliation(s)
- Biruhalem Taye
- Bioinformatics Institute, A*STAR, 30 Biopolis Street #07-01 Matrix, Singapore, 138671, Singapore. .,School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore. .,Aklilu Lemma Institute of Pathobiology, Addis Ababa University, P.O.BOX 1176, Addis Ababa, Ethiopia.
| | - Candida Vaz
- Bioinformatics Institute, A*STAR, 30 Biopolis Street #07-01 Matrix, Singapore, 138671, Singapore
| | - Vivek Tanavde
- Bioinformatics Institute, A*STAR, 30 Biopolis Street #07-01 Matrix, Singapore, 138671, Singapore.,Institute of Medical Biology, A*STAR, 8A Biomedical Grove, #06-06 Immunos, Singapore, 138648, Singapore
| | - Vladimir A Kuznetsov
- Bioinformatics Institute, A*STAR, 30 Biopolis Street #07-01 Matrix, Singapore, 138671, Singapore.,School of Computer Engineering, Nanyang Technological University, 50 Nanyang Drive, Singapore, 637553, Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute, A*STAR, 30 Biopolis Street #07-01 Matrix, Singapore, 138671, Singapore.,Department of Biological Sciences, National University of Singapore, 8 Medical Drive, Singapore, 117597, Singapore.,School of Computer Engineering, Nanyang Technological University, 50 Nanyang Drive, Singapore, 637553, Singapore
| | - Richard J Sugrue
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore
| | - Sebastian Maurer-Stroh
- Bioinformatics Institute, A*STAR, 30 Biopolis Street #07-01 Matrix, Singapore, 138671, Singapore.,Department of Biological Sciences, National University of Singapore, 8 Medical Drive, Singapore, 117597, Singapore.,National Public Health Laboratory, Ministry of Health, 3 Biopolis Drive, Synapse #05-14/16, Singapore, 138623, Singapore
| |
Collapse
|
33
|
Abstract
De novo pathway enrichment is a powerful approach to discover previously uncharacterized molecular mechanisms in addition to already known pathways. To achieve this, condition-specific functional modules are extracted from large interaction networks. Here, we give an overview of the state of the art and present the first framework for assessing the performance of existing methods. We identified 19 tools and selected seven representative candidates for a comparative analysis with more than 12,000 runs, spanning different biological networks, molecular profiles, and parameters. Our results show that none of the methods consistently outperforms the others. To mitigate this issue for biomedical researchers, we provide guidelines to choose the appropriate tool for a given dataset. Moreover, our framework is the first attempt for a quantitative evaluation of de novo methods, which will allow the bioinformatics community to objectively compare future tools against the state of the art. De novo pathway enrichment methods are essential to understand disease complexity. They can uncover disease-specific functional modules by integrating molecular interaction networks with expression profiles. However, how should researchers choose one method out of several? In this article, a group of scientists from Denmark and Germany presents the first attempt to quantitatively evaluate existing methods. This framework will help the biomedical community to find the appropriate tool(s) for their data. They created synthetic gold standards and simulated expression profiles to perform a systematic assessment of various tools. They observed that the choice of interaction network, parameter settings, preprocessing of expression data and statistical properties of the expression profiles influence the results to a large extent. The results reveal strengths and limitations of the individual methods and suggest using two or more tools to obtain comprehensive disease-modules.
Collapse
|
34
|
Differential Effects of Vitamins A and D on the Transcriptional Landscape of Human Monocytes during Infection. Sci Rep 2017; 7:40599. [PMID: 28094291 PMCID: PMC5240108 DOI: 10.1038/srep40599] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 12/06/2016] [Indexed: 12/16/2022] Open
Abstract
Vitamin A and vitamin D are essential nutrients with a wide range of pleiotropic effects in humans. Beyond their well-documented roles in cellular differentiation, embryogenesis, tissue maintenance and bone/calcium homeostasis, both vitamins have attracted considerable attention due to their association with-immunological traits. Nevertheless, our knowledge of their immunomodulatory potential during infection is restricted to single gene-centric studies, which do not reflect the complexity of immune processes. In the present study, we performed a comprehensive RNA-seq-based approach to define the whole immunomodulatory role of vitamins A and D during infection. Using human monocytes as host cells, we characterized the differential role of both vitamins upon infection with three different pathogens: Aspergillus fumigatus, Candida albicans and Escherichia coli. Both vitamins showed an unexpected ability to counteract the pathogen-induced transcriptional responses. Upon infection, we identified 346 and 176 immune-relevant genes that were regulated by atRA and vitD, respectively. This immunomodulatory activity was dependent on the inflammatory stimulus, allowing us to distinguish regulatory patterns which were specific for each stimulatory setting. Moreover, we explored possible direct and indirect mechanisms of vitamin-mediated regulation of the immune response. Our findings highlight the importance of vitamin-monitoring in critically ill patients. Moreover, our results underpin the potential of atRA and vitD as therapeutic options for anti-inflammatory treatment.
Collapse
|
35
|
Gligorijević V, Malod-Dognin N, Pržulj N. Integrative methods for analyzing big data in precision medicine. Proteomics 2016; 16:741-58. [PMID: 26677817 DOI: 10.1002/pmic.201500396] [Citation(s) in RCA: 98] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Revised: 11/16/2015] [Accepted: 12/09/2015] [Indexed: 12/19/2022]
Abstract
We provide an overview of recent developments in big data analyses in the context of precision medicine and health informatics. With the advance in technologies capturing molecular and medical data, we entered the area of "Big Data" in biology and medicine. These data offer many opportunities to advance precision medicine. We outline key challenges in precision medicine and present recent advances in data integration-based methods to uncover personalized information from big data produced by various omics studies. We survey recent integrative methods for disease subtyping, biomarkers discovery, and drug repurposing, and list the tools that are available to domain scientists. Given the ever-growing nature of these big data, we highlight key issues that big data integration methods will face.
Collapse
Affiliation(s)
| | | | - Nataša Pržulj
- Department of Computing, Imperial College London, London, UK
| |
Collapse
|
36
|
Herskind C, Talbot CJ, Kerns SL, Veldwijk MR, Rosenstein BS, West CML. Radiogenomics: A systems biology approach to understanding genetic risk factors for radiotherapy toxicity? Cancer Lett 2016; 382:95-109. [PMID: 26944314 PMCID: PMC5016239 DOI: 10.1016/j.canlet.2016.02.035] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2015] [Revised: 02/17/2016] [Accepted: 02/19/2016] [Indexed: 02/06/2023]
Abstract
Adverse reactions in normal tissue after radiotherapy (RT) limit the dose that can be given to tumour cells. Since 80% of individual variation in clinical response is estimated to be caused by patient-related factors, identifying these factors might allow prediction of patients with increased risk of developing severe reactions. While inactivation of cell renewal is considered a major cause of toxicity in early-reacting normal tissues, complex interactions involving multiple cell types, cytokines, and hypoxia seem important for late reactions. Here, we review 'omics' approaches such as screening of genetic polymorphisms or gene expression analysis, and assess the potential of epigenetic factors, posttranslational modification, signal transduction, and metabolism. Furthermore, functional assays have suggested possible associations with clinical risk of adverse reaction. Pathway analysis incorporating different 'omics' approaches may be more efficient in identifying critical pathways than pathway analysis based on single 'omics' data sets. Integrating these pathways with functional assays may be powerful in identifying multiple subgroups of RT patients characterised by different mechanisms. Thus 'omics' and functional approaches may synergise if they are integrated into radiogenomics 'systems biology' to facilitate the goal of individualised radiotherapy.
Collapse
Affiliation(s)
- Carsten Herskind
- Department of Radiation Oncology, Universitätsmedizin Mannheim, Medical Faculty Mannheim, Heidelberg University, Germany.
| | | | - Sarah L Kerns
- Department of Radiation Oncology, Mount Sinai School of Medicine, New York, USA; Department of Radiation Oncology, University of Rochester Medical Center, Rochester, USA
| | - Marlon R Veldwijk
- Department of Radiation Oncology, Universitätsmedizin Mannheim, Medical Faculty Mannheim, Heidelberg University, Germany
| | - Barry S Rosenstein
- Department of Radiation Oncology, Mount Sinai School of Medicine, New York, USA; Department of Radiation Oncology, New York University School of Medicine, USA; Department of Dermatology, Mount Sinai School of Medicine, New York, USA
| | - Catharine M L West
- Institute of Cancer Sciences, University of Manchester, Christie Hospital, Manchester, UK
| |
Collapse
|
37
|
Koumakis L, Kanterakis A, Kartsaki E, Chatzimina M, Zervakis M, Tsiknakis M, Vassou D, Kafetzopoulos D, Marias K, Moustakis V, Potamias G. MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways. PLoS Comput Biol 2016; 12:e1005187. [PMID: 27832067 PMCID: PMC5104320 DOI: 10.1371/journal.pcbi.1005187] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 10/10/2016] [Indexed: 01/04/2023] Open
Abstract
Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the unique characteristic to color regulatory relations between genes and reveal their phenotype inclination. This unique characteristic makes MinePath a valuable tool for in silico molecular biology experimentation as it serves the biomedical researchers' exploratory needs to reveal and interpret the regulatory mechanisms that underlie and putatively govern the expression of target phenotypes.
Collapse
Affiliation(s)
- Lefteris Koumakis
- Computational BioMedicine Laboratory (CBML), Institute of Computers Science (ICS), Foundation for Research and Technology-Hellas (FORTH), Heraklion, Crete, Greece
| | - Alexandros Kanterakis
- Computational BioMedicine Laboratory (CBML), Institute of Computers Science (ICS), Foundation for Research and Technology-Hellas (FORTH), Heraklion, Crete, Greece
| | - Evgenia Kartsaki
- Computational BioMedicine Laboratory (CBML), Institute of Computers Science (ICS), Foundation for Research and Technology-Hellas (FORTH), Heraklion, Crete, Greece
| | - Maria Chatzimina
- Computational BioMedicine Laboratory (CBML), Institute of Computers Science (ICS), Foundation for Research and Technology-Hellas (FORTH), Heraklion, Crete, Greece
| | - Michalis Zervakis
- School of Electrical and Computer Engineering, Technical University of Crete, Greece
| | - Manolis Tsiknakis
- Computational BioMedicine Laboratory (CBML), Institute of Computers Science (ICS), Foundation for Research and Technology-Hellas (FORTH), Heraklion, Crete, Greece
- Department of Informatics Engineering, Technological Educational Institute of Crete, Greece
| | - Despoina Vassou
- Institute of Molecular Biology & Biotechnology, FORTH, Heraklion, Crete, Greece
| | | | - Kostas Marias
- Computational BioMedicine Laboratory (CBML), Institute of Computers Science (ICS), Foundation for Research and Technology-Hellas (FORTH), Heraklion, Crete, Greece
| | - Vassilis Moustakis
- School of Production Engineering & Management, Technical University of Crete, Greece
| | - George Potamias
- Computational BioMedicine Laboratory (CBML), Institute of Computers Science (ICS), Foundation for Research and Technology-Hellas (FORTH), Heraklion, Crete, Greece
| |
Collapse
|
38
|
Alcaraz N, List M, Dissing-Hansen M, Rehmsmeier M, Tan Q, Mollenhauer J, Ditzel HJ, Baumbach J. Robust de novo pathway enrichment with KeyPathwayMiner 5. F1000Res 2016; 5:1531. [PMID: 27540470 PMCID: PMC4965696 DOI: 10.12688/f1000research.9054.1] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/22/2016] [Indexed: 01/26/2023] Open
Abstract
Identifying functional modules or novel active pathways, recently termed de novo pathway enrichment, is a computational systems biology challenge that has gained much attention during the last decade. Given a large biological interaction network, KeyPathwayMiner extracts connected subnetworks that are enriched for differentially active entities from a series of molecular profiles encoded as binary indicator matrices. Since interaction networks constantly evolve, an important question is how robust the extracted results are when the network is modified. We enable users to study this effect through several network perturbation techniques and over a range of perturbation degrees. In addition, users may now provide a gold-standard set to determine how enriched extracted pathways are with relevant genes compared to randomized versions of the original network.
Collapse
Affiliation(s)
- Nicolas Alcaraz
- Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark; Department of Cancer and Inflammation Research, Institute of Molecular Medicine, University of Southern Denmark, 5000 Odense, Denmark
| | - Markus List
- Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark; Department of Cancer and Inflammation Research, Institute of Molecular Medicine, University of Southern Denmark, 5000 Odense, Denmark; Lundbeckfonden Center of Excellence in Nanomedicine NanoCAN, University of Southern Denmark, 5000 Odense, Denmark; Institute of Clinical Research, University of Southern Denmark, 5000 Odense, Denmark; Max Planck Institute for Informatics, 66123 Saarbrucken, Germany
| | - Martin Dissing-Hansen
- Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark
| | - Marc Rehmsmeier
- Integrated Research Institute (IRI) for the Life Sciences and Department of Biology, Humboldt-Universitat zu Berlin, 10099 Berlin, Germany
| | - Qihua Tan
- Institute of Clinical Research, University of Southern Denmark, 5000 Odense, Denmark; Epidemiology, Biostatistics and Biodemography, Institute of Public Health, University of Southern Denmark, 5000 Odense, Denmark
| | - Jan Mollenhauer
- Department of Cancer and Inflammation Research, Institute of Molecular Medicine, University of Southern Denmark, 5000 Odense, Denmark; Lundbeckfonden Center of Excellence in Nanomedicine NanoCAN, University of Southern Denmark, 5000 Odense, Denmark
| | - Henrik J Ditzel
- Department of Cancer and Inflammation Research, Institute of Molecular Medicine, University of Southern Denmark, 5000 Odense, Denmark; Lundbeckfonden Center of Excellence in Nanomedicine NanoCAN, University of Southern Denmark, 5000 Odense, Denmark; Department of Oncology, Odense University Hospital, 5000 Odense, Denmark
| | - Jan Baumbach
- Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark; Max Planck Institute for Informatics, 66123 Saarbrucken, Germany
| |
Collapse
|
39
|
List M, Schmidt S, Christiansen H, Rehmsmeier M, Tan Q, Mollenhauer J, Baumbach J. Comprehensive analysis of high-throughput screens with HiTSeekR. Nucleic Acids Res 2016; 44:6639-48. [PMID: 27330136 PMCID: PMC5001608 DOI: 10.1093/nar/gkw554] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Accepted: 06/08/2016] [Indexed: 12/30/2022] Open
Abstract
High-throughput screening (HTS) is an indispensable tool for drug (target) discovery that currently lacks user-friendly software tools for the robust identification of putative hits from HTS experiments and for the interpretation of these findings in the context of systems biology. We developed HiTSeekR as a one-stop solution for chemical compound screens, siRNA knock-down and CRISPR/Cas9 knock-out screens, as well as microRNA inhibitor and -mimics screens. We chose three use cases that demonstrate the potential of HiTSeekR to fully exploit HTS screening data in quite heterogeneous contexts to generate novel hypotheses for follow-up experiments: (i) a genome-wide RNAi screen to uncover modulators of TNFα, (ii) a combined siRNA and miRNA mimics screen on vorinostat resistance and (iii) a small compound screen on KRAS synthetic lethality. HiTSeekR is publicly available at http://hitseekr.compbio.sdu.dk It is the first approach to close the gap between raw data processing, network enrichment and wet lab target generation for various HTS screen types.
Collapse
Affiliation(s)
- Markus List
- Lundbeckfonden Center of Excellence in Nanomedicine (NanoCAN), University of Southern Denmark, 5000 Odense, Denmark Molecular Oncology, Institute of Molecular Medicin (IMM), University of Southern Denmark, 5000 Odense, Denmark Clinical Institute (CI), University of Southern Denmark, 5000 Odense, Denmark
| | - Steffen Schmidt
- Lundbeckfonden Center of Excellence in Nanomedicine (NanoCAN), University of Southern Denmark, 5000 Odense, Denmark Molecular Oncology, Institute of Molecular Medicin (IMM), University of Southern Denmark, 5000 Odense, Denmark
| | - Helle Christiansen
- Lundbeckfonden Center of Excellence in Nanomedicine (NanoCAN), University of Southern Denmark, 5000 Odense, Denmark Molecular Oncology, Institute of Molecular Medicin (IMM), University of Southern Denmark, 5000 Odense, Denmark
| | - Marc Rehmsmeier
- Computational Biology Unit, Department of Informatics, University of Bergen, 5020 Bergen, Norway
| | - Qihua Tan
- Clinical Institute (CI), University of Southern Denmark, 5000 Odense, Denmark Epidemiology, Biostatistics and Biodemography, Institute of Public Health, University of Southern Denmark, 5000 Odense, Denmark
| | - Jan Mollenhauer
- Lundbeckfonden Center of Excellence in Nanomedicine (NanoCAN), University of Southern Denmark, 5000 Odense, Denmark Molecular Oncology, Institute of Molecular Medicin (IMM), University of Southern Denmark, 5000 Odense, Denmark
| | - Jan Baumbach
- Department of Mathematics and Computer Science (IMADA), University of Southern Denmark, 5230 Odense, Denmark Max Planck Institute for Informatics, 66123 Saarbrücken, Germany
| |
Collapse
|
40
|
List M, Alcaraz N, Dissing-Hansen M, Ditzel HJ, Mollenhauer J, Baumbach J. KeyPathwayMinerWeb: online multi-omics network enrichment. Nucleic Acids Res 2016; 44:W98-W104. [PMID: 27150809 PMCID: PMC4987922 DOI: 10.1093/nar/gkw373] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Accepted: 04/25/2016] [Indexed: 01/25/2023] Open
Abstract
We present KeyPathwayMinerWeb, the first online platform for de novo pathway enrichment analysis directly in the browser. Given a biological interaction network (e.g. protein–protein interactions) and a series of molecular profiles derived from one or multiple OMICS studies (gene expression, for instance), KeyPathwayMiner extracts connected sub-networks containing a high number of active or differentially regulated genes (proteins, metabolites) in the molecular profiles. The web interface at (http://keypathwayminer.compbio.sdu.dk) implements all core functionalities of the KeyPathwayMiner tool set such as data integration, input of background knowledge, batch runs for parameter optimization and visualization of extracted pathways. In addition to an intuitive web interface, we also implemented a RESTful API that now enables other online developers to integrate network enrichment as a web service into their own platforms.
Collapse
Affiliation(s)
- Markus List
- Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark Lundbeckfonden Center of Excellence in Nanomedicine NanoCAN, University of Southern Denmark, 5000 Odense, Denmark Institute of Molecular Medicine, University of Southern Denmark, 5000 Odense, Denmark Institute of Clinical Research, University of Southern Denmark, 5000 Odense, Denmark Max Planck Institute for Informatics, 66123 Saarbrücken, Germany
| | - Nicolas Alcaraz
- Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark Institute of Molecular Medicine, University of Southern Denmark, 5000 Odense, Denmark
| | - Martin Dissing-Hansen
- Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark
| | - Henrik J Ditzel
- Lundbeckfonden Center of Excellence in Nanomedicine NanoCAN, University of Southern Denmark, 5000 Odense, Denmark Institute of Molecular Medicine, University of Southern Denmark, 5000 Odense, Denmark Department of Oncology, Odense University Hospital, 5000 Odense, Denmark
| | - Jan Mollenhauer
- Lundbeckfonden Center of Excellence in Nanomedicine NanoCAN, University of Southern Denmark, 5000 Odense, Denmark Institute of Molecular Medicine, University of Southern Denmark, 5000 Odense, Denmark
| | - Jan Baumbach
- Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark Max Planck Institute for Informatics, 66123 Saarbrücken, Germany
| |
Collapse
|
41
|
Fonseca A, Gubitoso MD, Reis MS, de Souza SJ, Barrera J. A New Approach for Identification of Cancer-related Pathways using Protein Networks and Genomic Data. Cancer Inform 2016; 14:139-49. [PMID: 27158220 PMCID: PMC4854218 DOI: 10.4137/cin.s30800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2015] [Revised: 02/11/2016] [Accepted: 02/17/2016] [Indexed: 11/11/2022] Open
Abstract
Cancer cells have anomalous development and proliferation due to disturbances in their control systems. The study of the behavior of cellular control system requires high-throughput dynamical data. Unfortunately, this type of data is not largely available. This fact motivates the main issue of this article: how to use static omics data and available biological knowledge to get new information about the elements of the control system in cancer cells. Two important measures to access the state of the cellular control system are the gene expression profile and the signaling pathways. This article uses a combination of these two static omics data to gain insights on the states of a cancer cell. To extract information from this kind of data, a statistical computational model was formalized and implemented. In order to exemplify the application of some aspects of the developed conceptual framework, we verified the hypothesis that different types of cancer cells have different disturbed signaling pathways. To this end, we developed a method that recovers small protein networks, called motifs, which are differentially represented in some subtypes of breast cancer. These differentially represented motifs are enriched with specific gene ontologies as well as with new putative cancer genes.
Collapse
Affiliation(s)
| | | | | | | | - Junior Barrera
- Institute of Mathematics and Statistics, USP, São Paulo, Brazil.; LETA, CeTICS, Butantan Institute, São Paulo, Brazil
| |
Collapse
|
42
|
Sergushichev AA, Loboda AA, Jha AK, Vincent EE, Driggers EM, Jones RG, Pearce EJ, Artyomov MN. GAM: a web-service for integrated transcriptional and metabolic network analysis. Nucleic Acids Res 2016; 44:W194-200. [PMID: 27098040 PMCID: PMC4987878 DOI: 10.1093/nar/gkw266] [Citation(s) in RCA: 65] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2016] [Accepted: 04/04/2016] [Indexed: 01/14/2023] Open
Abstract
Novel techniques for high-throughput steady-state metabolomic profiling yield information about changes of nearly thousands of metabolites. Such metabolomic profiles, when analyzed together with transcriptional profiles, can reveal novel insights about underlying biological processes. While a number of conceptual approaches have been developed for data integration, easily accessible tools for integrated analysis of mammalian steady-state metabolomic and transcriptional data are lacking. Here we present GAM (‘genes and metabolites’): a web-service for integrated network analysis of transcriptional and steady-state metabolomic data focused on identification of the most changing metabolic subnetworks between two conditions of interest. In the web-service, we have pre-assembled metabolic networks for humans, mice, Arabidopsis and yeast and adapted exact solvers for an optimal subgraph search to work in the context of these metabolic networks. The output is the most regulated metabolic subnetwork of size controlled by false discovery rate parameters. The subnetworks are then visualized online and also can be downloaded in Cytoscape format for subsequent processing. The web-service is available at: https://artyomovlab.wustl.edu/shiny/gam/
Collapse
Affiliation(s)
- Alexey A Sergushichev
- Computer Technologies Department, ITMO University, Saint Petersburg, 197101, Russia Department of Pathology & Immunology, Washington University in St. Louis, St.Louis, MO 63110, USA
| | - Alexander A Loboda
- Computer Technologies Department, ITMO University, Saint Petersburg, 197101, Russia
| | | | - Emma E Vincent
- Goodman Cancer Research Centre, McGill University, Montreal, QC H3A 1A3, Canada Department of Physiology, McGill University, Montreal, QC H3G 1Y6, Canada
| | | | - Russell G Jones
- Goodman Cancer Research Centre, McGill University, Montreal, QC H3A 1A3, Canada Department of Physiology, McGill University, Montreal, QC H3G 1Y6, Canada
| | - Edward J Pearce
- Department of Immunometabolism, Max Planck Institute of Immunobiology and Epigenetics, Freiburg, D-79108, Germany
| | - Maxim N Artyomov
- Computer Technologies Department, ITMO University, Saint Petersburg, 197101, Russia Department of Pathology & Immunology, Washington University in St. Louis, St.Louis, MO 63110, USA
| |
Collapse
|
43
|
Tuncbag N, Gosline SJC, Kedaigle A, Soltis AR, Gitter A, Fraenkel E. Network-Based Interpretation of Diverse High-Throughput Datasets through the Omics Integrator Software Package. PLoS Comput Biol 2016; 12:e1004879. [PMID: 27096930 PMCID: PMC4838263 DOI: 10.1371/journal.pcbi.1004879] [Citation(s) in RCA: 91] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2015] [Accepted: 03/23/2016] [Indexed: 02/07/2023] Open
Abstract
High-throughput, ‘omic’ methods provide sensitive measures of biological responses to perturbations. However, inherent biases in high-throughput assays make it difficult to interpret experiments in which more than one type of data is collected. In this work, we introduce Omics Integrator, a software package that takes a variety of ‘omic’ data as input and identifies putative underlying molecular pathways. The approach applies advanced network optimization algorithms to a network of thousands of molecular interactions to find high-confidence, interpretable subnetworks that best explain the data. These subnetworks connect changes observed in gene expression, protein abundance or other global assays to proteins that may not have been measured in the screens due to inherent bias or noise in measurement. This approach reveals unannotated molecular pathways that would not be detectable by searching pathway databases. Omics Integrator also provides an elegant framework to incorporate not only positive data, but also negative evidence. Incorporating negative evidence allows Omics Integrator to avoid unexpressed genes and avoid being biased toward highly-studied hub proteins, except when they are strongly implicated by the data. The software is comprised of two individual tools, Garnet and Forest, that can be run together or independently to allow a user to perform advanced integration of multiple types of high-throughput data as well as create condition-specific subnetworks of protein interactions that best connect the observed changes in various datasets. It is available at http://fraenkel.mit.edu/omicsintegrator and on GitHub at https://github.com/fraenkel-lab/OmicsIntegrator.
Collapse
Affiliation(s)
- Nurcan Tuncbag
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Sara J. C. Gosline
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Amanda Kedaigle
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Anthony R. Soltis
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Anthony Gitter
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Ernest Fraenkel
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
44
|
Misra BB, van der Hooft JJJ. Updates in metabolomics tools and resources: 2014-2015. Electrophoresis 2015; 37:86-110. [DOI: 10.1002/elps.201500417] [Citation(s) in RCA: 100] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2015] [Revised: 10/04/2015] [Accepted: 10/05/2015] [Indexed: 12/12/2022]
Affiliation(s)
- Biswapriya B. Misra
- Department of Biology, Genetics Institute; University of Florida; Gainesville FL USA
| | | |
Collapse
|
45
|
Gligorijević V, Pržulj N. Methods for biological data integration: perspectives and challenges. J R Soc Interface 2015; 12:20150571. [PMID: 26490630 PMCID: PMC4685837 DOI: 10.1098/rsif.2015.0571] [Citation(s) in RCA: 157] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 09/25/2015] [Indexed: 12/17/2022] Open
Abstract
Rapid technological advances have led to the production of different types of biological data and enabled construction of complex networks with various types of interactions between diverse biological entities. Standard network data analysis methods were shown to be limited in dealing with such heterogeneous networked data and consequently, new methods for integrative data analyses have been proposed. The integrative methods can collectively mine multiple types of biological data and produce more holistic, systems-level biological insights. We survey recent methods for collective mining (integration) of various types of networked biological data. We compare different state-of-the-art methods for data integration and highlight their advantages and disadvantages in addressing important biological problems. We identify the important computational challenges of these methods and provide a general guideline for which methods are suited for specific biological problems, or specific data types. Moreover, we propose that recent non-negative matrix factorization-based approaches may become the integration methodology of choice, as they are well suited and accurate in dealing with heterogeneous data and have many opportunities for further development.
Collapse
Affiliation(s)
| | - Nataša Pržulj
- Department of Computing, Imperial College London, London SW7 2AZ, UK
| |
Collapse
|
46
|
Tsoi LC, Elder JT, Abecasis GR. Graphical algorithm for integration of genetic and biological data: proof of principle using psoriasis as a model. ACTA ACUST UNITED AC 2014; 31:1243-9. [PMID: 25480373 DOI: 10.1093/bioinformatics/btu799] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Accepted: 11/26/2014] [Indexed: 01/17/2023]
Abstract
MOTIVATION Pathway analysis to reveal biological mechanisms for results from genetic association studies have great potential to better understand complex traits with major human disease impact. However, current approaches have not been optimized to maximize statistical power to identify enriched functions/pathways, especially when the genetic data derives from studies using platforms (e.g. Immunochip and Metabochip) customized to have pre-selected markers from previously identified top-rank loci. We present here a novel approach, called Minimum distance-based Enrichment Analysis for Genetic Association (MEAGA), with the potential to address both of these important concerns. RESULTS MEAGA performs enrichment analysis using graphical algorithms to identify sub-graphs among genes and measure their closeness in interaction database. It also incorporates a statistic summarizing the numbers and total distances of the sub-graphs, depicting the overlap between observed genetic signals and defined function/pathway gene-sets. MEAGA uses sampling technique to approximate empirical and multiple testing-corrected P-values. We show in simulation studies that MEAGA is more powerful compared to count-based strategies in identifying disease-associated functions/pathways, and the increase in power is influenced by the shortest distances among associated genes in the interactome. We applied MEAGA to the results of a meta-analysis of psoriasis using Immunochip datasets, and showed that associated genes are significantly enriched in immune-related functions and closer with each other in the protein-protein interaction network. AVAILABILITY AND IMPLEMENTATION http://genome.sph.umich.edu/wiki/MEAGA CONTACT: : tsoi.teen@gmail.com or goncalo@umich.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lam C Tsoi
- Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA, Department of Dermatology, University of Michigan, Ann Arbor, MI, USA, and Ann Arbor Veterans Affairs Hospital, Ann Arbor, MI, USA
| | - James T Elder
- Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA, Department of Dermatology, University of Michigan, Ann Arbor, MI, USA, and Ann Arbor Veterans Affairs Hospital, Ann Arbor, MI, USA Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA, Department of Dermatology, University of Michigan, Ann Arbor, MI, USA, and Ann Arbor Veterans Affairs Hospital, Ann Arbor, MI, USA
| | - Goncalo R Abecasis
- Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA, Department of Dermatology, University of Michigan, Ann Arbor, MI, USA, and Ann Arbor Veterans Affairs Hospital, Ann Arbor, MI, USA
| |
Collapse
|