1
|
Stock M, Popp N, Fiorentino J, Scialdone A. Topological benchmarking of algorithms to infer gene regulatory networks from single-cell RNA-seq data. Bioinformatics 2024; 40:btae267. [PMID: 38627250 PMCID: PMC11096270 DOI: 10.1093/bioinformatics/btae267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 02/28/2024] [Accepted: 04/16/2024] [Indexed: 05/18/2024] Open
Abstract
MOTIVATION In recent years, many algorithms for inferring gene regulatory networks from single-cell transcriptomic data have been published. Several studies have evaluated their accuracy in estimating the presence of an interaction between pairs of genes. However, these benchmarking analyses do not quantify the algorithms' ability to capture structural properties of networks, which are fundamental, e.g., for studying the robustness of a gene network to external perturbations. Here, we devise a three-step benchmarking pipeline called STREAMLINE that quantifies the ability of algorithms to capture topological properties of networks and identify hubs. RESULTS To this aim, we use data simulated from different types of networks as well as experimental data from three different organisms. We apply our benchmarking pipeline to four inference algorithms and provide guidance on which algorithm should be used depending on the global network property of interest. AVAILABILITY AND IMPLEMENTATION STREAMLINE is available at https://github.com/ScialdoneLab/STREAMLINE. The data generated in this study are available at https://doi.org/10.5281/zenodo.10710444.
Collapse
Affiliation(s)
- Marco Stock
- Institute of Epigenetics and Stem Cells, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 81377, Germany
- Institute of Functional Epigenetics, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 85764, Germany
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 85764, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich 85354, Germany
| | - Niclas Popp
- Institute of Epigenetics and Stem Cells, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 81377, Germany
- Institute of Functional Epigenetics, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 85764, Germany
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 85764, Germany
| | - Jonathan Fiorentino
- Institute of Epigenetics and Stem Cells, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 81377, Germany
- Institute of Functional Epigenetics, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 85764, Germany
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 85764, Germany
| | - Antonio Scialdone
- Institute of Epigenetics and Stem Cells, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 81377, Germany
- Institute of Functional Epigenetics, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 85764, Germany
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Munich 85764, Germany
| |
Collapse
|
2
|
Truong TTT, Liu ZSJ, Panizzutti B, Kim JH, Dean OM, Berk M, Walder K. Network-based drug repurposing for schizophrenia. Neuropsychopharmacology 2024; 49:983-992. [PMID: 38321095 PMCID: PMC11039639 DOI: 10.1038/s41386-024-01805-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 01/10/2024] [Accepted: 01/12/2024] [Indexed: 02/08/2024]
Abstract
Despite recent progress, the challenges in drug discovery for schizophrenia persist. However, computational drug repurposing has gained popularity as it leverages the wealth of expanding biomedical databases. Network analyses provide a comprehensive understanding of transcription factor (TF) regulatory effects through gene regulatory networks, which capture the interactions between TFs and target genes by integrating various lines of evidence. Using the PANDA algorithm, we examined the topological variances in TF-gene regulatory networks between individuals with schizophrenia and healthy controls. This algorithm incorporates binding motifs, protein interactions, and gene co-expression data. To identify these differences, we subtracted the edge weights of the healthy control network from those of the schizophrenia network. The resulting differential network was then analysed using the CLUEreg tool in the GRAND database. This tool employs differential network signatures to identify drugs that potentially target the gene signature associated with the disease. Our analysis utilised a large RNA-seq dataset comprising 532 post-mortem brain samples from the CommonMind project. We constructed co-expression gene regulatory networks for both schizophrenia cases and healthy control subjects, incorporating 15,831 genes and 413 overlapping TFs. Through drug repurposing, we identified 18 promising candidates for repurposing as potential treatments for schizophrenia. The analysis of TF-gene regulatory networks revealed that the TFs in schizophrenia predominantly regulate pathways associated with energy metabolism, immune response, cell adhesion, and thyroid hormone signalling. These pathways represent significant targets for therapeutic intervention. The identified drug repurposing candidates likely act through TF-targeted pathways. These promising candidates, particularly those with preclinical evidence such as rimonabant and kaempferol, warrant further investigation into their potential mechanisms of action and efficacy in alleviating the symptoms of schizophrenia.
Collapse
Affiliation(s)
- Trang T T Truong
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
| | - Zoe S J Liu
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
| | - Bruna Panizzutti
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
| | - Jee Hyun Kim
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
- Florey Institute of Neuroscience and Mental Health, Parkville, Australia
| | - Olivia M Dean
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
- Florey Institute of Neuroscience and Mental Health, Parkville, Australia
| | - Michael Berk
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
- Orygen, The National Centre of Excellence in Youth Mental Health, Centre for Youth Mental Health, The Florey Institute for Neuroscience and Mental Health and the Department of Psychiatry, University of Melbourne, Parkville, 3010, Australia
| | - Ken Walder
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia.
| |
Collapse
|
3
|
Truong TTT, Liu ZSJ, Panizzutti B, Dean OM, Berk M, Kim JH, Walder K. Use of gene regulatory network analysis to repurpose drugs to treat bipolar disorder. J Affect Disord 2024; 350:230-239. [PMID: 38190860 DOI: 10.1016/j.jad.2024.01.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 12/03/2023] [Accepted: 01/03/2024] [Indexed: 01/10/2024]
Abstract
BACKGROUND Bipolar disorder (BD) presents significant challenges in drug discovery, necessitating alternative approaches. Drug repurposing, leveraging computational techniques and expanding biomedical data, holds promise for identifying novel treatment strategies. METHODS This study utilized gene regulatory networks (GRNs) to identify significant regulatory changes in BD, using network-based signatures for drug repurposing. Employing the PANDA algorithm, we investigated the variations in transcription factor-GRNs between individuals with BD and unaffected individuals, incorporating binding motifs, protein interactions, and gene co-expression data. The differences in edge weights between BD and controls were then used as differential network signatures to identify drugs potentially targeting the disease-associated gene signature, employing the CLUEreg tool in the GRAND database. RESULTS Using a large RNA-seq dataset of 216 post-mortem brain samples from the CommonMind consortium, we constructed GRNs based on co-expression for individuals with BD and unaffected controls, involving 15,271 genes and 405 TFs. Our analysis highlighted significant influences of these TFs on immune response, energy metabolism, cell signalling, and cell adhesion pathways in the disorder. By employing drug repurposing, we identified 10 promising candidates potentially repurposed as BD treatments. LIMITATIONS Non-drug-naïve transcriptomics data, bulk analysis of BD samples, potential bias of GRNs towards well-studied genes. CONCLUSIONS Further investigation into repurposing candidates, especially those with preclinical evidence supporting their efficacy, like kaempferol and pramocaine, is warranted to understand their mechanisms of action and effectiveness in treating BD. Additionally, novel targets such as PARP1 and A2b offer opportunities for future research on their relevance to the disorder.
Collapse
Affiliation(s)
- Trang T T Truong
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
| | - Zoe S J Liu
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
| | - Bruna Panizzutti
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
| | - Olivia M Dean
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia; Florey Institute of Neuroscience and Mental Health, Parkville, Australia
| | - Michael Berk
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia; Florey Institute of Neuroscience and Mental Health, Parkville, Australia; Orygen, The National Centre of Excellence in Youth Mental Health, Centre for Youth Mental Health, The Florey Institute for Neuroscience and Mental Health and the Department of Psychiatry, University of Melbourne, Parkville 3010, Australia
| | - Jee Hyun Kim
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia; Florey Institute of Neuroscience and Mental Health, Parkville, Australia
| | - Ken Walder
- Deakin University, IMPACT, The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia.
| |
Collapse
|
4
|
Kumar S, Sharma N, Sopory SK, Sanan-Mishra N. miRNAs and genes as molecular regulators of rice grain morphology and yield. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2024; 207:108363. [PMID: 38281341 DOI: 10.1016/j.plaphy.2024.108363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 12/07/2023] [Accepted: 01/10/2024] [Indexed: 01/30/2024]
Abstract
Rice is one of the most consumed crops worldwide and the genetic and molecular basis of its grain yield attributes are well understood. Various studies have identified different yield-related parameters in rice that are regulated by the microRNAs (miRNAs). MiRNAs are endogenous small non-coding RNAs that silence gene expression during or after transcription. They control a variety of biological or genetic activities in plants including growth, development and response to stress. In this review, we have summarized the available information on the genetic control of panicle architecture and grain yield (number and morphology) in rice. The miRNA nodes that are associated with their regulation are also described while focussing on the central role of miR156-SPL node to highlight the co-regulation of two master regulators that determine the fate of panicle development. Since abiotic stresses are known to negatively affect yield, the impact of abiotic stress induced alterations on the levels of these miRNAs are also discussed to highlight the potential of miRNAs for regulating crop yields.
Collapse
Affiliation(s)
- Sudhir Kumar
- Plant RNAi Biology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India.
| | - Neha Sharma
- Plant RNAi Biology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India.
| | - Sudhir K Sopory
- Plant RNAi Biology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India.
| | - Neeti Sanan-Mishra
- Plant RNAi Biology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India.
| |
Collapse
|
5
|
Brouard C, Mourad R, Vialaneix N. Should we really use graph neural networks for transcriptomic prediction? Brief Bioinform 2024; 25:bbae027. [PMID: 38349060 PMCID: PMC10939369 DOI: 10.1093/bib/bbae027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 12/20/2023] [Accepted: 01/17/2024] [Indexed: 02/15/2024] Open
Abstract
The recent development of deep learning methods have undoubtedly led to great improvement in various machine learning tasks, especially in prediction tasks. This type of methods have also been adapted to answer various problems in bioinformatics, including automatic genome annotation, artificial genome generation or phenotype prediction. In particular, a specific type of deep learning method, called graph neural network (GNN) has repeatedly been reported as a good candidate to predict phenotypes from gene expression because its ability to embed information on gene regulation or co-expression through the use of a gene network. However, up to date, no complete and reproducible benchmark has ever been performed to analyze the trade-off between cost and benefit of this approach compared to more standard (and simpler) machine learning methods. In this article, we provide such a benchmark, based on clear and comparable policies to evaluate the different methods on several datasets. Our conclusion is that GNN rarely provides a real improvement in prediction performance, especially when compared to the computation effort required by the methods. Our findings on a limited but controlled simulated dataset shows that this could be explained by the limited quality or predictive power of the input biological gene network itself.
Collapse
Affiliation(s)
- Céline Brouard
- Université Fédérale de Toulouse, INRAE, MIAT, 31326 Castanet-Tolosan, France
| | - Raphaël Mourad
- Université Fédérale de Toulouse, INRAE, MIAT, 31326 Castanet-Tolosan, France
- Université Paul Sabatier, 31062 Toulouse, France
| | - Nathalie Vialaneix
- Université Fédérale de Toulouse, INRAE, MIAT, 31326 Castanet-Tolosan, France
| |
Collapse
|
6
|
Eble H, Joswig M, Lamberti L, Ludington WB. Master regulators of biological systems in higher dimensions. Proc Natl Acad Sci U S A 2023; 120:e2300634120. [PMID: 38096409 PMCID: PMC10743376 DOI: 10.1073/pnas.2300634120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 10/23/2023] [Indexed: 12/18/2023] Open
Abstract
A longstanding goal of biology is to identify the key genes and species that critically impact evolution, ecology, and health. Network analysis has revealed keystone species that regulate ecosystems and master regulators that regulate cellular genetic networks. Yet these studies have focused on pairwise biological interactions, which can be affected by the context of genetic background and other species present, generating higher-order interactions. The important regulators of higher-order interactions are unstudied. To address this, we applied a high-dimensional geometry approach that quantifies epistasis in a fitness landscape to ask how individual genes and species influence the interactions in the rest of the biological network. We then generated and also reanalyzed 5-dimensional datasets (two genetic, two microbiome). We identified key genes (e.g., the rbs locus and pykF) and species (e.g., Lactobacilli) that control the interactions of many other genes and species. These higher-order master regulators can induce or suppress evolutionary and ecological diversification by controlling the topography of the fitness landscape. Thus, we provide a method and mathematical justification for exploration of biological networks in higher dimensions.
Collapse
Affiliation(s)
- Holger Eble
- Chair of Discrete Mathematics/Geometry, Technical University Berlin, Berlin10623, Germany
| | - Michael Joswig
- Chair of Discrete Mathematics/Geometry, Technical University Berlin, Berlin10623, Germany
- Max Planck Institute for Mathematics in the Sciences, Leipzig04103, Germany
| | - Lisa Lamberti
- Department of Biosystems Science and Engineering, Federal Institute of Technology (ETH Zürich), Basel4058, Switzerland
- Swiss Institute of Bioinformatics, Basel4058, Switzerland
| | - William B. Ludington
- Department of Biosphere Sciences and Engineering, Carnegie Institution for Science, Baltimore, MD21218
- Department of Biology, Johns Hopkins University, Baltimore, MD21218
| |
Collapse
|
7
|
Raharinirina NA, Peppert F, von Kleist M, Schütte C, Sunkara V. Inferring gene regulatory networks from single-cell RNA-seq temporal snapshot data requires higher-order moments. PATTERNS 2021; 2:100332. [PMID: 34553172 PMCID: PMC8441581 DOI: 10.1016/j.patter.2021.100332] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 02/23/2021] [Accepted: 07/22/2021] [Indexed: 11/30/2022]
Abstract
Single-cell RNA sequencing (scRNA-seq) has become ubiquitous in biology. Recently, there has been a push for using scRNA-seq snapshot data to infer the underlying gene regulatory networks (GRNs) steering cellular function. To date, this aspiration remains unrealized due to technical and computational challenges. In this work we focus on the latter, which is under-represented in the literature. We took a systemic approach by subdividing the GRN inference into three fundamental components: data pre-processing, feature extraction, and inference. We observed that the regulatory signature is captured in the statistical moments of scRNA-seq data and requires computationally intensive minimization solvers to extract it. Furthermore, current data pre-processing might not conserve these statistical moments. Although our moment-based approach is a didactic tool for understanding the different compartments of GRN inference, this line of thinking—finding computationally feasible multi-dimensional statistics of data—is imperative for designing GRN inference methods. Single-cell RNA-seq temporal snapshot data for detecting regulation Challenges in data pre-processing, feature extraction, and network inference for GRNs Encoding of regulatory information in higher-order raw moments Non-linear least-squares inference for temporal scRNA-seq snapshot data
Single-cell RNA sequencing (scRNA-seq) has become ubiquitous in biology. Recently, there has been a push for using scRNA-seq snapshot data to infer the underlying gene regulatory networks (GRNs) steering cellular function. A recent benchmark of 12 GRN methods demonstrated that the algorithms struggled to predict the ground-truth GRNs and speculated that the low performance was due to the insufficient resolution in the scRNA-seq data. Rather than proposing another method, this paper focuses on how to decompose a GRN problem into three subproblems (pre-processing, feature extraction, and inference), so that the gene regulatory information is preserved in each step. Subsequently, we discuss how to best approach each of the three subproblems.
Collapse
Affiliation(s)
| | - Felix Peppert
- Explainable A.I. for Biology, Zuse Institute Berlin, 14195 Berlin, Germany
| | - Max von Kleist
- MF1 Bioinformatics, Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany
| | - Christof Schütte
- Mathematics of Complex Systems, Zuse Institute Berlin, 14195 Berlin, Germany.,Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany
| | - Vikram Sunkara
- Mathematics of Complex Systems, Zuse Institute Berlin, 14195 Berlin, Germany.,Explainable A.I. for Biology, Zuse Institute Berlin, 14195 Berlin, Germany
| |
Collapse
|
8
|
Mahapatra S, Bhuyan R, Das J, Swarnkar T. Integrated multiplex network based approach for hub gene identification in oral cancer. Heliyon 2021; 7:e07418. [PMID: 34258466 PMCID: PMC8258848 DOI: 10.1016/j.heliyon.2021.e07418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 01/27/2021] [Accepted: 06/23/2021] [Indexed: 02/01/2023] Open
Abstract
Background: The incidence of Oral Cancer (OC) is high in Asian countries, which goes undetected at its early stage. The study of genetics, especially genetic networks holds great promise in this endeavor. Hub genes in a genetic network are prominent in regulating the whole network structure of genes. Thus identification of such genes related to specific cancer types can help in reducing the gap in OC prognosis. Methods: Traditional study of network biology is unable to decipher the inter-dependencies within and across diverse biological networks. Multiplex network provides a powerful representation of such systems and encodes much richer information than isolated networks. In this work, we focused on the entire multiplex structure of the genetic network integrating the gene expression profile and DNA methylation profile for OC. Further, hub genes were identified by considering their connectivity in the multiplex structure and the respective protein-protein interaction (PPI) network as well. Results: 46 hub genes were inferred in our approach with a high prediction accuracy (96%), outstanding Matthews coefficient correlation value (93%) and significant biological implications. Among them, genes PIK3CG, PIK3R5, MYH7, CDC20 and CCL4 were differentially expressed and predominantly enriched in molecular cascades specific to OC. Conclusions: The identified hub genes in this work carry ontological signatures specific to cancer, which may further facilitate improved understanding of the tumorigenesis process and the underlying molecular events. Result indicates the effectiveness of our integrated multiplex network approach for hub gene identification. This work puts an innovative research route for multi-omics biological data analysis.
Collapse
Affiliation(s)
- S. Mahapatra
- Department of Computer Application, Siksha O Anusandhan Deemed to be University, Bhubaneswar, India
| | - R. Bhuyan
- Department of Oral Pathology & Microbiology, Siksha O Anusandhan Deemed to be University, Bhubaneswar, India
| | - J. Das
- Centre for Genomics & Biomedical Informatics, Siksha O Anusandhan Deemed to be University, Bhubaneswar, India
| | - T. Swarnkar
- Department of Computer Application, Siksha O Anusandhan Deemed to be University, Bhubaneswar, India
| |
Collapse
|
9
|
Internetwork connectivity of molecular networks across species of life. Sci Rep 2021; 11:1168. [PMID: 33441907 PMCID: PMC7806680 DOI: 10.1038/s41598-020-80745-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 12/23/2020] [Indexed: 01/29/2023] Open
Abstract
Molecular interactions are studied as independent networks in systems biology. However, molecular networks do not exist independently of each other. In a network of networks approach (called multiplex), we study the joint organization of transcriptional regulatory network (TRN) and protein-protein interaction (PPI) network. We find that TRN and PPI are non-randomly coupled across five different eukaryotic species. Gene degrees in TRN (number of downstream genes) are positively correlated with protein degrees in PPI (number of interacting protein partners). Gene-gene and protein-protein interactions in TRN and PPI, respectively, also non-randomly overlap. These design principles are conserved across the five eukaryotic species. Robustness of the TRN-PPI multiplex is dependent on this coupling. Functionally important genes and proteins, such as essential, disease-related and those interacting with pathogen proteins, are preferentially situated in important parts of the human multiplex with highly overlapping interactions. We unveil the multiplex architecture of TRN and PPI. Multiplex architecture may thus define a general framework for studying molecular networks. This approach may uncover the building blocks of the hierarchical organization of molecular interactions.
Collapse
|
10
|
Chowdhury HA, Bhattacharyya DK, Kalita JK. (Differential) Co-Expression Analysis of Gene Expression: A Survey of Best Practices. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:1154-1173. [PMID: 30668502 DOI: 10.1109/tcbb.2019.2893170] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Analysis of gene expression data is widely used in transcriptomic studies to understand functions of molecules inside a cell and interactions among molecules. Differential co-expression analysis studies diseases and phenotypic variations by finding modules of genes whose co-expression patterns vary across conditions. We review the best practices in gene expression data analysis in terms of analysis of (differential) co-expression, co-expression network, differential networking, and differential connectivity considering both microarray and RNA-seq data along with comparisons. We highlight hurdles in RNA-seq data analysis using methods developed for microarrays. We include discussion of necessary tools for gene expression analysis throughout the paper. In addition, we shed light on scRNA-seq data analysis by including preprocessing and scRNA-seq in co-expression analysis along with useful tools specific to scRNA-seq. To get insights, biological interpretation and functional profiling is included. Finally, we provide guidelines for the analyst, along with research issues and challenges which should be addressed.
Collapse
|
11
|
Zaucha J, Heinzinger M, Tarnovskaya S, Rost B, Frishman D. Family-specific analysis of variant pathogenicity prediction tools. NAR Genom Bioinform 2020; 2:lqaa014. [PMID: 33575576 PMCID: PMC7671395 DOI: 10.1093/nargab/lqaa014] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 02/12/2020] [Accepted: 02/25/2020] [Indexed: 01/01/2023] Open
Abstract
Using the presently available datasets of annotated missense variants, we ran a protein family-specific benchmarking of tools for predicting the pathogenicity of single amino acid variants. We find that despite the high overall accuracy of all tested methods, each tool has its Achilles heel, i.e. protein families in which its predictions prove unreliable (expected accuracy does not exceed 51% in any method). As a proof of principle, we show that choosing the optimal tool and pathogenicity threshold at a protein family-individual level allows obtaining reliable predictions in all Pfam domains (accuracy no less than 68%). A functional analysis of the sets of protein domains annotated exclusively by neutral or pathogenic mutations indicates that specific protein functions can be associated with a high or low sensitivity to mutations, respectively. The highly sensitive sets of protein domains are involved in the regulation of transcription and DNA sequence-specific transcription factor binding, while the domains that do not result in disease when mutated are responsible for mediating immune and stress responses. These results suggest that future predictors of pathogenicity and especially variant prioritization tools may benefit from considering functional annotation.
Collapse
Affiliation(s)
- Jan Zaucha
- Department of Bioinformatics, Technical University of Munich, 85354 Freising, Germany
| | - Michael Heinzinger
- Department of Informatics, Bioinformatics & Computational Biology-i12, Technical University of Munich, 85748 Garching, Germany
| | | | - Burkhard Rost
- Department of Informatics, Bioinformatics & Computational Biology-i12, Technical University of Munich, 85748 Garching, Germany
| | - Dmitrij Frishman
- Department of Bioinformatics, Technical University of Munich, 85354 Freising, Germany
| |
Collapse
|
12
|
Tan A, Huang H, Zhang P, Li S. Network-based cancer precision medicine: A new emerging paradigm. Cancer Lett 2019; 458:39-45. [DOI: 10.1016/j.canlet.2019.05.015] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 04/29/2019] [Accepted: 05/15/2019] [Indexed: 12/20/2022]
|
13
|
Guo L, Wang J. rSNPBase 3.0: an updated database of SNP-related regulatory elements, element-gene pairs and SNP-based gene regulatory networks. Nucleic Acids Res 2019; 46:D1111-D1116. [PMID: 29140525 PMCID: PMC5753256 DOI: 10.1093/nar/gkx1101] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 10/23/2017] [Indexed: 12/14/2022] Open
Abstract
Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element–target gene pairs (E–G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies.
Collapse
Affiliation(s)
- Liyuan Guo
- CAS Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China.,Department of Psychology, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jing Wang
- CAS Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China.,Department of Psychology, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
14
|
Halu A, Wang JG, Iwata H, Mojcher A, Abib AL, Singh SA, Aikawa M, Sharma A. Context-enriched interactome powered by proteomics helps the identification of novel regulators of macrophage activation. eLife 2018; 7:37059. [PMID: 30303482 PMCID: PMC6179386 DOI: 10.7554/elife.37059] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Accepted: 08/30/2018] [Indexed: 02/06/2023] Open
Abstract
The role of pro-inflammatory macrophage activation in cardiovascular disease (CVD) is a complex one amenable to network approaches. While an indispensible tool for elucidating the molecular underpinnings of complex diseases including CVD, the interactome is limited in its utility as it is not specific to any cell type, experimental condition or disease state. We introduced context-specificity to the interactome by combining it with co-abundance networks derived from unbiased proteomics measurements from activated macrophage-like cells. Each macrophage phenotype contributed to certain regions of the interactome. Using a network proximity-based prioritization method on the combined network, we predicted potential regulators of macrophage activation. Prediction performance significantly increased with the addition of co-abundance edges, and the prioritized candidates captured inflammation, immunity and CVD signatures. Integrating the novel network topology with transcriptomics and proteomics revealed top candidate drivers of inflammation. In vitro loss-of-function experiments demonstrated the regulatory role of these proteins in pro-inflammatory signaling. When human cells or tissues are injured, the body triggers a response known as inflammation to repair the damage and protect itself from further harm. However, if the same issue keeps recurring, the tissues become inflamed for longer periods of time, which may ultimately lead to health problems. This is what could be happening in cardiovascular diseases, where long-term inflammation could damage the heart and blood vessels. Many different proteins interact with each other to control inflammation; gaining an insight into the nature of these interactions could help to pinpoint the role of each molecular actor. Researchers have used a combination of unbiased, large-scale experimental and computational approaches to develop the interactome, a map of the known interactions between all proteins in humans. However, interactions between proteins can change between cell types, or during disease. Here, Halu et al. aimed to refine the human interactome and identify new proteins involved in inflammation, especially in the context of cardiovascular disease. Cells called macrophages produce signals that trigger inflammation whey they detect damage in other cells or tissues. The experiments used a technique called proteomics to measure the amounts of all the proteins in human macrophages. Combining these data with the human interactome made it possible to predict new links between proteins known to have a role in inflammation and other proteins in the interactome. Further analysis using other sets of data from macrophages helped identify two new candidate proteins – GBP1 and WARS – that may promote inflammation. Halu et al. then used a genetic approach to deactivate the genes and decrease the levels of these two proteins in macrophages, which caused the signals that encourage inflammation to drop. These findings suggest that GBP1 and WARS regulate the activity of macrophages to promote inflammation. The two proteins could therefore be used as drug targets to treat cardiovascular diseases and other disorders linked to inflammation, but further studies will be needed to precisely dissect how GBP1 and WARS work in humans.
Collapse
Affiliation(s)
- Arda Halu
- Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, United States.,Center for Interdisciplinary Cardiovascular Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, United States
| | - Jian-Guo Wang
- Center for Interdisciplinary Cardiovascular Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, United States
| | - Hiroshi Iwata
- Center for Interdisciplinary Cardiovascular Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, United States
| | - Alexander Mojcher
- Center for Interdisciplinary Cardiovascular Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, United States
| | - Ana Luisa Abib
- Center for Interdisciplinary Cardiovascular Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, United States
| | - Sasha A Singh
- Center for Interdisciplinary Cardiovascular Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, United States
| | - Masanori Aikawa
- Center for Interdisciplinary Cardiovascular Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, United States
| | - Amitabh Sharma
- Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, United States
| |
Collapse
|
15
|
Biological networks integration based on dense module identification for gene prioritization from microarray data. GENE REPORTS 2018. [DOI: 10.1016/j.genrep.2018.07.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
16
|
De Bastiani MA, Pfaffenseller B, Klamt F. Master Regulators Connectivity Map: A Transcription Factors-Centered Approach to Drug Repositioning. Front Pharmacol 2018; 9:697. [PMID: 30034338 PMCID: PMC6043797 DOI: 10.3389/fphar.2018.00697] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Accepted: 06/08/2018] [Indexed: 01/09/2023] Open
Abstract
Drug discovery is a very expensive and time-consuming endeavor. Fortunately, recent omics technologies and Systems Biology approaches introduced interesting new tools to achieve this task, facilitating the repurposing of already known drugs to new therapeutic assignments using gene expression data and bioinformatics. The inherent role of transcription factors in gene expression modulation makes them strong candidates for master regulators of phenotypic transitions. However, transcription factors expression itself usually does not reflect its activity changes due to post-transcriptional modifications and other complications. In this aspect, the use of high-throughput transcriptomic data may be employed to infer transcription factors-targets interactions and assess their activity through co-expression networks, which can be further used to search for drugs capable of reverting the gene expression profile of pathological phenotypes employing the connectivity maps paradigm. Following this idea, we argue that a module-oriented connectivity map approach using transcription factors-centered networks would aid the query for new repositioning candidates. Through a brief case study, we explored this idea in bipolar disorder, retrieving known drugs used in the usual clinical scenario as well as new candidates with potential therapeutic application in this disease. Indeed, the results of the case study indicate just how promising our approach may be to drug repositioning.
Collapse
Affiliation(s)
- Marco A De Bastiani
- Laboratory of Cellular Biochemistry, Department of Biochemistry, Federal University of Rio Grande do Sul, Porto Alegre, Brazil.,National Institute of Science and Technology for Translational Medicine, Porto Alegre, Brazil
| | - Bianca Pfaffenseller
- Laboratory of Cellular Biochemistry, Department of Biochemistry, Federal University of Rio Grande do Sul, Porto Alegre, Brazil.,Laboratory of Molecular Psychiatry, Clinicas Hospital of Porto Alegre, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
| | - Fabio Klamt
- Laboratory of Cellular Biochemistry, Department of Biochemistry, Federal University of Rio Grande do Sul, Porto Alegre, Brazil.,National Institute of Science and Technology for Translational Medicine, Porto Alegre, Brazil
| |
Collapse
|
17
|
Detecting phenotype-driven transitions in regulatory network structure. NPJ Syst Biol Appl 2018; 4:16. [PMID: 29707235 PMCID: PMC5908977 DOI: 10.1038/s41540-018-0052-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Revised: 03/29/2018] [Accepted: 04/02/2018] [Indexed: 12/05/2022] Open
Abstract
Complex traits and diseases like human height or cancer are often not caused by a single mutation or genetic variant, but instead arise from functional changes in the underlying molecular network. Biological networks are known to be highly modular and contain dense “communities” of genes that carry out cellular processes, but these structures change between tissues, during development, and in disease. While many methods exist for inferring networks and analyzing their topologies separately, there is a lack of robust methods for quantifying differences in network structure. Here, we describe ALPACA (ALtered Partitions Across Community Architectures), a method for comparing two genome-scale networks derived from different phenotypic states to identify condition-specific modules. In simulations, ALPACA leads to more nuanced, sensitive, and robust module discovery than currently available network comparison methods. As an application, we use ALPACA to compare transcriptional networks in three contexts: angiogenic and non-angiogenic subtypes of ovarian cancer, human fibroblasts expressing transforming viral oncogenes, and sexual dimorphism in human breast tissue. In each case, ALPACA identifies modules enriched for processes relevant to the phenotype. For example, modules specific to angiogenic ovarian tumors are enriched for genes associated with blood vessel development, and modules found in female breast tissue are enriched for genes involved in estrogen receptor and ERK signaling. The functional relevance of these new modules suggests that not only can ALPACA identify structural changes in complex networks, but also that these changes may be relevant for characterizing biological phenotypes. Cells are controlled by complex regulatory networks, and disruptions in the structure of these networks can lead to disease. Understanding disease requires that we accurately identify changes in gene regulatory network structure. However, cellular networks have tens of thousands of components with complex connections between them. Megha Padi from the University of Arizona and John Quackenbush from Dana-Farber Cancer Institute developed a new algorithm that is far more effective than previous methods at finding disease-associated modules in regulatory networks. Applying this to ovarian cancer, they found new regulatory processes that may lead to more targeted treatments. In human breast tissue, they found that sex-specific differences were driven by hormone signaling and differentiation pathways. Decoding how network modules promote new functions may help to better model the relationship between genotype and phenotype.
Collapse
|
18
|
Jalili M, Gebhardt T, Wolkenhauer O, Salehzadeh-Yazdi A. Unveiling network-based functional features through integration of gene expression into protein networks. Biochim Biophys Acta Mol Basis Dis 2018; 1864:2349-2359. [PMID: 29466699 DOI: 10.1016/j.bbadis.2018.02.010] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Revised: 01/31/2018] [Accepted: 02/13/2018] [Indexed: 02/02/2023]
Abstract
Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers.
Collapse
Affiliation(s)
- Mahdi Jalili
- Hematology, Oncology and SCT Research Center, Tehran University of Medical Sciences, Tehran, Iran; Hematologic Malignancies Research Center, Tehran University of Medical Sciences, Tehran, Iran
| | - Tom Gebhardt
- Department of Systems Biology and Bioinformatics, University of Rostock, 18051 Rostock, Germany
| | - Olaf Wolkenhauer
- Department of Systems Biology and Bioinformatics, University of Rostock, 18051 Rostock, Germany
| | - Ali Salehzadeh-Yazdi
- Department of Systems Biology and Bioinformatics, University of Rostock, 18051 Rostock, Germany.
| |
Collapse
|
19
|
Manem VSK, Salgado R, Aftimos P, Sotiriou C, Haibe-Kains B. Network science in clinical trials: A patient-centered approach. Semin Cancer Biol 2017; 52:135-150. [PMID: 29278737 DOI: 10.1016/j.semcancer.2017.12.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Revised: 12/12/2017] [Accepted: 12/13/2017] [Indexed: 02/08/2023]
Abstract
There has been a paradigm shift in translational oncology with the advent of novel molecular diagnostic tools in the clinic. However, several challenges are associated with the integration of these sophisticated tools into clinical oncology and daily practice. High-throughput profiling at the DNA, RNA and protein levels (omics) generate a massive amount of data. The analysis and interpretation of these is non-trivial but will allow a more thorough understanding of cancer. Linear modelling of the data as it is often used today is likely to limit our understanding of cancer as a complex disease, and at times under-performs to capture a phenotype of interest. Network science and systems biology-based approaches, using machine learning and network science principles, that integrate multiple data sources, can uncover complex changes in a biological system. This approach will integrate a large number of potential biomarkers in preclinical studies to better inform therapeutic decisions and ultimately make substantial progress towards precision medicine. It will however require development of a new generation of clinical trials. Beyond discussing the challenges of high-throughput technologies, this review will develop a framework on how to implement a network science approach in new clinical trial designs in order to advance cancer care.
Collapse
Affiliation(s)
- Venkata S K Manem
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, Toronto, ON, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Roberto Salgado
- Breast Cancer Translational Research Laboratory, Université Libre de Bruxelles, Brussels, Belgium; Department of Pathology, GZA Hospitals Antwerp, Belgium
| | - Philippe Aftimos
- Medical Oncology Clinic, Institut Jules Bordet - Université Libre de Bruxelles, Brussels, Belgium
| | - Christos Sotiriou
- Breast Cancer Translational Research Laboratory, Université Libre de Bruxelles, Brussels, Belgium; Medical Oncology Clinic, Institut Jules Bordet - Université Libre de Bruxelles, Brussels, Belgium
| | - Benjamin Haibe-Kains
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, Toronto, ON, Canada; Department of Computer Science, University of Toronto, Toronto, ON, Canada; Ontario Institute of Cancer Research, Toronto, ON, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
20
|
Telonis AG, Rigoutsos I. Race Disparities in the Contribution of miRNA Isoforms and tRNA-Derived Fragments to Triple-Negative Breast Cancer. Cancer Res 2017; 78:1140-1154. [PMID: 29229607 DOI: 10.1158/0008-5472.can-17-1947] [Citation(s) in RCA: 83] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Revised: 10/19/2017] [Accepted: 11/30/2017] [Indexed: 12/14/2022]
Abstract
Triple-negative breast cancer (TNBC) is a breast cancer subtype characterized by marked differences between White and Black/African-American women. We performed a systems-level analysis on datasets from The Cancer Genome Atlas to elucidate how the expression patterns of mRNAs are shaped by regulatory noncoding RNAs (ncRNA). Specifically, we studied isomiRs, that is, isoforms of miRNAs, and tRNA-derived fragments (tRF). In normal breast tissue, we observed a marked cohesiveness in both the ncRNA and mRNA layers and the associations between them. This cohesiveness was widely disrupted in TNBC. Many mRNAs become either differentially expressed or differentially wired between normal breast and TNBC in tandem with isomiR or tRF dysregulation. The affected pathways included energy metabolism, cell signaling, and immune responses. Within TNBC, the wiring of the affected pathways with isomiRs and tRFs differed in each race. Multiple isomiRs and tRFs arising from specific miRNA loci (e.g., miR-200c, miR-21, the miR-17/92 cluster, the miR-183/96/182 cluster) and from specific tRNA loci (e.g., the nuclear tRNAGly and tRNALeu, the mitochondrial tRNAVal and tRNAPro) were strongly associated with the observed race disparities in TNBC. We highlight the race-specific aspects of transcriptome wiring by discussing in detail the metastasis-related MAPK and the Wnt/β-catenin signaling pathways, two of the many key pathways that were found differentially wired. In conclusion, by employing a data- and knowledge-driven approach, we comprehensively analyzed the normal and cancer transcriptomes to uncover novel key contributors to the race-based disparities of TNBC.Significance: This big data-driven study comparing normal and cancer transcriptomes uncovers RNA expression differences between Caucasian and African-American patients with triple-negative breast cancer that might help explain disparities in incidence and aggressive character. Cancer Res; 78(5); 1140-54. ©2017 AACR.
Collapse
Affiliation(s)
- Aristeidis G Telonis
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania.
| | - Isidore Rigoutsos
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania.
| |
Collapse
|
21
|
Bryan K, McGivney BA, Farries G, McGettigan PA, McGivney CL, Gough KF, MacHugh DE, Katz LM, Hill EW. Equine skeletal muscle adaptations to exercise and training: evidence of differential regulation of autophagosomal and mitochondrial components. BMC Genomics 2017; 18:595. [PMID: 28793853 PMCID: PMC5551008 DOI: 10.1186/s12864-017-4007-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2017] [Accepted: 08/02/2017] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND A single bout of exercise induces changes in gene expression in skeletal muscle. Regular exercise results in an adaptive response involving changes in muscle architecture and biochemistry, and is an effective way to manage and prevent common human diseases such as obesity, cardiovascular disorders and type II diabetes. However, the biomolecular mechanisms underlying such responses still need to be fully elucidated. Here we performed a transcriptome-wide analysis of skeletal muscle tissue in a large cohort of untrained Thoroughbred horses (n = 51) before and after a bout of high-intensity exercise and again after an extended period of training. We hypothesized that regular high-intensity exercise training primes the transcriptome for the demands of high-intensity exercise. RESULTS An extensive set of genes was observed to be significantly differentially regulated in response to a single bout of high-intensity exercise in the untrained cohort (3241 genes) and following multiple bouts of high-intensity exercise training over a six-month period (3405 genes). Approximately one-third of these genes (1025) and several biological processes related to energy metabolism were common to both the exercise and training responses. We then developed a novel network-based computational analysis pipeline to test the hypothesis that these transcriptional changes also influence the contextual molecular interactome and its dynamics in response to exercise and training. The contextual network analysis identified several important hub genes, including the autophagosomal-related gene GABARAPL1, and dynamic functional modules, including those enriched for mitochondrial respiratory chain complexes I and V, that were differentially regulated and had their putative interactions 're-wired' in the exercise and/or training responses. CONCLUSION Here we have generated for the first time, a comprehensive set of genes that are differentially expressed in Thoroughbred skeletal muscle in response to both exercise and training. These data indicate that consecutive bouts of high-intensity exercise result in a priming of the skeletal muscle transcriptome for the demands of the next exercise bout. Furthermore, this may also lead to an extensive 're-wiring' of the molecular interactome in both exercise and training and include key genes and functional modules related to autophagy and the mitochondrion.
Collapse
Affiliation(s)
- Kenneth Bryan
- UCD School of Agriculture and Food Science, University College Dublin, Belfield, D04 V1W8 Ireland
| | - Beatrice A. McGivney
- UCD School of Agriculture and Food Science, University College Dublin, Belfield, D04 V1W8 Ireland
| | - Gabriella Farries
- UCD School of Agriculture and Food Science, University College Dublin, Belfield, D04 V1W8 Ireland
| | - Paul A. McGettigan
- UCD School of Agriculture and Food Science, University College Dublin, Belfield, D04 V1W8 Ireland
| | - Charlotte L. McGivney
- UCD School of Agriculture and Food Science, University College Dublin, Belfield, D04 V1W8 Ireland
| | - Katie F. Gough
- UCD School of Agriculture and Food Science, University College Dublin, Belfield, D04 V1W8 Ireland
| | - David E. MacHugh
- UCD School of Agriculture and Food Science, University College Dublin, Belfield, D04 V1W8 Ireland
- UCD Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Belfield, D04 V1W8 Ireland
| | - Lisa M. Katz
- UCD School of Veterinary Medicine, University College Dublin, Belfield, D04 V1W8 Ireland
| | - Emmeline W. Hill
- UCD School of Agriculture and Food Science, University College Dublin, Belfield, D04 V1W8 Ireland
| |
Collapse
|
22
|
Sikdar S, Datta S. A novel statistical approach for identification of the master regulator transcription factor. BMC Bioinformatics 2017; 18:79. [PMID: 28148240 PMCID: PMC5288875 DOI: 10.1186/s12859-017-1499-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 01/27/2017] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Transcription factors are known to play key roles in carcinogenesis and therefore, are gaining popularity as potential therapeutic targets in drug development. A 'master regulator' transcription factor often appears to control most of the regulatory activities of the other transcription factors and the associated genes. This 'master regulator' transcription factor is at the top of the hierarchy of the transcriptomic regulation. Therefore, it is important to identify and target the master regulator transcription factor for proper understanding of the associated disease process and identifying the best therapeutic option. METHODS We present a novel two-step computational approach for identification of master regulator transcription factor in a genome. At the first step of our method we test whether there exists any master regulator transcription factor in the system. We evaluate the concordance of two ranked lists of transcription factors using a statistical measure. In case the concordance measure is statistically significant, we conclude that there is a master regulator. At the second step, our method identifies the master regulator transcription factor, if there exists one. RESULTS In the simulation scenario, our method performs reasonably well in validating the existence of a master regulator when the number of subjects in each treatment group is reasonably large. In application to two real datasets, our method ensures the existence of master regulators and identifies biologically meaningful master regulators. An R code for implementing our method in a sample test data can be found in http://www.somnathdatta.org/software . CONCLUSION We have developed a screening method of identifying the 'master regulator' transcription factor just using only the gene expression data. Understanding the regulatory structure and finding the master regulator help narrowing the search space for identifying biomarkers for complex diseases such as cancer. In addition to identifying the master regulator our method provides an overview of the regulatory structure of the transcription factors which control the global gene expression profiles and consequently the cell functioning.
Collapse
Affiliation(s)
- Sinjini Sikdar
- Department of Biostatistics, University of Florida, Gainesville, FL, 32611, USA
| | - Susmita Datta
- Department of Biostatistics, University of Florida, Gainesville, FL, 32611, USA.
| |
Collapse
|
23
|
Samad AFA, Sajad M, Nazaruddin N, Fauzi IA, Murad AMA, Zainal Z, Ismail I. MicroRNA and Transcription Factor: Key Players in Plant Regulatory Network. FRONTIERS IN PLANT SCIENCE 2017; 8:565. [PMID: 28446918 PMCID: PMC5388764 DOI: 10.3389/fpls.2017.00565] [Citation(s) in RCA: 185] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Accepted: 03/29/2017] [Indexed: 05/14/2023]
Abstract
Recent achievements in plant microRNA (miRNA), a large class of small and non-coding RNAs, are very exciting. A wide array of techniques involving forward genetic, molecular cloning, bioinformatic analysis, and the latest technology, deep sequencing have greatly advanced miRNA discovery. A tiny miRNA sequence has the ability to target single/multiple mRNA targets. Most of the miRNA targets are transcription factors (TFs) which have paramount importance in regulating the plant growth and development. Various families of TFs, which have regulated a range of regulatory networks, may assist plants to grow under normal and stress environmental conditions. This present review focuses on the regulatory relationships between miRNAs and different families of TFs like; NF-Y, MYB, AP2, TCP, WRKY, NAC, GRF, and SPL. For instance NF-Y play important role during drought tolerance and flower development, MYB are involved in signal transduction and biosynthesis of secondary metabolites, AP2 regulate the floral development and nodule formation, TCP direct leaf development and growth hormones signaling. WRKY have known roles in multiple stress tolerances, NAC regulate lateral root formation, GRF are involved in root growth, flower, and seed development, and SPL regulate plant transition from juvenile to adult. We also studied the relation between miRNAs and TFs by consolidating the research findings from different plant species which will help plant scientists in understanding the mechanism of action and interaction between these regulators in the plant growth and development under normal and stress environmental conditions.
Collapse
Affiliation(s)
- Abdul F. A. Samad
- School of Biosciences and Biotechnology, Faculty of Science and Technology, National University of Malaysia, SelangorMalaysia
| | - Muhammad Sajad
- Department of Plant Breeding and Genetics, University College of Agriculture and Environmental Sciences, The Islamia University of Bahawalpur, PunjabPakistan
- Centre of Plant Biotechnology, Institute of Systems Biology, National University of Malaysia, SelangorMalaysia
| | - Nazaruddin Nazaruddin
- School of Biosciences and Biotechnology, Faculty of Science and Technology, National University of Malaysia, SelangorMalaysia
- Department of Chemistry, Faculty of Mathematics and Natural Sciences, Syiah Kuala University, Darussalam, Banda AcehIndonesia
| | - Izzat A. Fauzi
- School of Biosciences and Biotechnology, Faculty of Science and Technology, National University of Malaysia, SelangorMalaysia
| | - Abdul M. A. Murad
- School of Biosciences and Biotechnology, Faculty of Science and Technology, National University of Malaysia, SelangorMalaysia
| | - Zamri Zainal
- School of Biosciences and Biotechnology, Faculty of Science and Technology, National University of Malaysia, SelangorMalaysia
- Centre of Plant Biotechnology, Institute of Systems Biology, National University of Malaysia, SelangorMalaysia
| | - Ismanizan Ismail
- School of Biosciences and Biotechnology, Faculty of Science and Technology, National University of Malaysia, SelangorMalaysia
- Centre of Plant Biotechnology, Institute of Systems Biology, National University of Malaysia, SelangorMalaysia
- *Correspondence: Ismanizan Ismail,
| |
Collapse
|
24
|
Zuo Y, Cui Y, Di Poto C, Varghese RS, Yu G, Li R, Ressom HW. INDEED: Integrated differential expression and differential network analysis of omic data for biomarker discovery. Methods 2016; 111:12-20. [PMID: 27592383 DOI: 10.1016/j.ymeth.2016.08.015] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Revised: 08/25/2016] [Accepted: 08/30/2016] [Indexed: 01/03/2023] Open
Abstract
Differential expression (DE) analysis is commonly used to identify biomarker candidates that have significant changes in their expression levels between distinct biological groups. One drawback of DE analysis is that it only considers the changes on single biomolecule level. Recently, differential network (DN) analysis has become popular due to its capability to measure the changes on biomolecular pair level. In DN analysis, network is typically built based on correlation and biomarker candidates are selected by investigating the network topology. However, correlation tends to generate over-complicated networks and the selection of biomarker candidates purely based on network topology ignores the changes on single biomolecule level. In this paper, we propose a novel approach, INDEED, that builds sparse differential network based on partial correlation and integrates DE and DN analyses for biomarker discovery. We applied this approach on real proteomic and glycomic data generated by liquid chromatography coupled with mass spectrometry for hepatocellular carcinoma (HCC) biomarker discovery study. For each omic data, we used one dataset to select biomarker candidates, built a disease classifier and evaluated the performance of the classifier on an independent dataset. The biomarker candidates, selected by INDEED, were more reproducible across independent datasets, and led to a higher classification accuracy in predicting HCC cases and cirrhotic controls compared with those selected by separate DE and DN analyses. INDEED also identified some candidates previously reported to be relevant to HCC, such as intercellular adhesion molecule 2 (ICAM2) and c4b-binding protein alpha chain (C4BPA), which were missed by both DE and DN analyses. In addition, we applied INDEED for survival time prediction based on transcriptomic data acquired by analysis of samples from breast cancer patients. We selected biomarker candidates and built a regression model for survival time prediction based on a gene expression dataset and patients' survival records. We evaluated the performance of the regression model on an independent dataset. Compared with the biomarker candidates selected by DE and DN analyses, those selected through INDEED led to more accurate survival time prediction.
Collapse
Affiliation(s)
- Yiming Zuo
- Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA 22203, USA; Department of Radiation Oncology, Stanford University, Palo Alto, CA 94304, USA; Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC 20007, USA.
| | - Yi Cui
- Department of Radiation Oncology, Stanford University, Palo Alto, CA 94304, USA.
| | - Cristina Di Poto
- Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC 20007, USA.
| | - Rency S Varghese
- Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC 20007, USA.
| | - Guoqiang Yu
- Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA 22203, USA.
| | - Ruijiang Li
- Department of Radiation Oncology, Stanford University, Palo Alto, CA 94304, USA.
| | - Habtom W Ressom
- Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC 20007, USA.
| |
Collapse
|