1
|
Song C, Zhang J, Liu Y, Hu Y, Feng C, Shi P, Zhang Y, Wang L, Xie Y, Zhang M, Zhao X, Cao Y, Li C, Sun H. Characterization and Validation of ceRNA-Mediated Pathway–Pathway Crosstalk Networks Across Eight Major Cardiovascular Diseases. Front Cell Dev Biol 2022; 10:762129. [PMID: 35433687 PMCID: PMC9010821 DOI: 10.3389/fcell.2022.762129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Accepted: 03/01/2022] [Indexed: 01/08/2023] Open
Abstract
Pathway analysis is considered as an important strategy to reveal the underlying mechanisms of diseases. Pathways that are involved in crosstalk can regulate each other and co-regulate downstream biological processes. Furthermore, some genes in the pathways can function with other genes via the relationship of the competing endogenous RNA (ceRNA) mechanism, which has also been demonstrated to play key roles in cellular biology. However, the comprehensive analysis of ceRNA-mediated pathway crosstalk is lacking. Here, we constructed the landscape of the ceRNA-mediated pathway–pathway crosstalk of eight major cardiovascular diseases (CVDs) based on sequencing data from ∼2,800 samples. Some common features shared by numerous CVDs were uncovered. A fraction of the pathway–pathway crosstalk was conserved in multiple CVDs and a core pathway–pathway crosstalk network was identified, suggesting the similarity of pathway–pathway crosstalk among CVDs. Experimental evidence also demonstrated that the pathway crosstalk was functioned in CVDs. We split all hub pathways of each pathway–pathway crosstalk network into three categories, namely, common hubs, differential hubs, and specific hubs, which could highlight the common or specific biological mechanisms. Importantly, after a comparison analysis of the hub pathways of networks, ∼480 hub pathway-induced common modules were identified to exert functions in CVDs broadly. Moreover, we performed a random walk algorithm on the hub pathway-induced sub-network and identified 23 potentially novel CVD-related pathways. In summary, our study revealed the potential molecular regulatory mechanisms of ceRNA crosstalk in pathway–pathway crosstalk levels and provided a novel routine to investigate the pathway–pathway crosstalk in cardiology. All CVD pathway–pathway crosstalks are provided in http://www.licpathway.net/cepathway/index.html.
Collapse
Affiliation(s)
- Chao Song
- Department of Pharmacology, Harbin Medical University-Daqing, Daqing, China
| | - Jian Zhang
- Department of Medical Informatics, Harbin Medical University-Daqing, Daqing, China
| | - Yongsheng Liu
- Department of Pharmacology, Harbin Medical University-Daqing, Daqing, China
| | - Yinling Hu
- Department of Rehabilitation, Beijing Rehabilitation Hospital of Capital Medical University, Beijing, China
| | - Chenchen Feng
- Department of Medical Informatics, Harbin Medical University-Daqing, Daqing, China
| | - Pilong Shi
- Department of Pharmacology, Harbin Medical University-Daqing, Daqing, China
| | - Yuexin Zhang
- Department of Medical Informatics, Harbin Medical University-Daqing, Daqing, China
| | - Lixin Wang
- Department of Pharmacology, Harbin Medical University-Daqing, Daqing, China
| | - Yawen Xie
- Department of Pharmacology, Harbin Medical University-Daqing, Daqing, China
| | - Meitian Zhang
- Department of Pharmacology, Harbin Medical University-Daqing, Daqing, China
| | - Xilong Zhao
- Department of Medical Informatics, Harbin Medical University-Daqing, Daqing, China
| | - Yonggang Cao
- Department of Pharmacology, Harbin Medical University-Daqing, Daqing, China
| | - Chunquan Li
- Department of Medical Informatics, Harbin Medical University-Daqing, Daqing, China
- *Correspondence: Hongli Sun, ; Chunquan Li,
| | - Hongli Sun
- Department of Pharmacology, Harbin Medical University-Daqing, Daqing, China
- *Correspondence: Hongli Sun, ; Chunquan Li,
| |
Collapse
|
2
|
Cava C, Pini S, Taramelli D, Castiglioni I. Perturbations of pathway co-expression network identify a core network in metastatic breast cancer. Comput Biol Chem 2020; 87:107313. [PMID: 32590221 DOI: 10.1016/j.compbiolchem.2020.107313] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Revised: 04/03/2020] [Accepted: 06/18/2020] [Indexed: 12/17/2022]
Abstract
Metastases are the main cause of death in advanced breast cancer (BC) patients. Although chemotherapy and hormone therapy are current treatment strategies, drug resistance is frequent and still not completely understood. In this study, a bioinformatics analysis was performed on BC patients to explore the molecular mechanisms associated with BC metastasis. Microarray gene expression profiles of metastatic and non metastatic BC patients were downloaded from Gene Expression Omnibus (GEO) dataset. Raw data were normalized and merged using the Combat tool. Pathways enriched with differently expressed genes were identified and a pathway co-expression network was generated using Pearson's correlation. We identified from this network, which includes 17 pathways and 128 interactions, the pathways that most influence the network efficiency. Moreover, protein interaction network was investigated to identify hub genes of the pathway network. The prognostic role of the network was evaluated with a survival analysis using an independent dataset. In conclusion, the pathway co-expression network could contribute to understanding the mechanism and development of BC metastases.
Collapse
Affiliation(s)
- Claudia Cava
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR), Via F.Cervi 93, 20090, Segrate, Milan, Italy.
| | - Simone Pini
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR), Via F.Cervi 93, 20090, Segrate, Milan, Italy; Department of Pharmacological & Biomolecular Sciences (DiSFeB), University of Milan, Via Pascal 36, 20133, Milan, Italy
| | - Donatella Taramelli
- Department of Pharmacological & Biomolecular Sciences (DiSFeB), University of Milan, Via Pascal 36, 20133, Milan, Italy
| | - Isabella Castiglioni
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR), Via F.Cervi 93, 20090, Segrate, Milan, Italy
| |
Collapse
|
3
|
Woldesemayat AA, Ntwasa M. Pathways and Network Based Analysis of Candidate Genes to Reveal Cross-Talk and Specificity in the Sorghum ( Sorghum bicolor (L.) Moench) Responses to Drought and It's Co-occurring Stresses. Front Genet 2018; 9:557. [PMID: 30515190 PMCID: PMC6255970 DOI: 10.3389/fgene.2018.00557] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2018] [Accepted: 10/31/2018] [Indexed: 12/01/2022] Open
Abstract
Drought alone or in combination with other stresses forms the major crop production constraint worldwide. Sorghum, one of the most important cereal crops is affected by drought alone or in combination with co-occurring stresses; notwithstanding, sorghum has evolved adaptive responses to combined stresses. Furthermore, an impressive number of sorghum genes have been investigated for drought tolerance. However, the molecular mechanism underling drought response remains poorly understood. We employed a systems biology approach to elucidate regulatory and broad functional features of these genes. Their interaction network would provide insight into understanding the molecular mechanisms of drought tolerance and underpinning signal pathways. Functional analysis was undertaken to determine significantly enriched genesets for pathways involved in drought tolerance. Analysis of distinct pathway cross-talk network was performed and drought-specific subnetwork was extracted. Investigation of various data sources such as gene expression, regulatory pathways, sorghumCyc, sorghum protein-protein interaction (PPI) and Gene Ontology (GO) revealed 14 major drought stress related hub genes (DSRhub genes). Significantly enriched genesets have shown association with various biological processes underlying drought-related responses. Key metabolic pathways were significantly enriched in the drought-related genes. Systematic analysis of pathways cross-talk and gene interaction network revealed major cross-talk pathway modules associated with drought tolerance. Further investigation of the major DSRhub genes revealed distinct regulatory genes such as ZEP, NCED, AAO, and MCSU and CYP707A1. These were involved in the regulation of ABA biosynthesis and signal transduction. Other protein families, namely, aldehyde and alcohol dehydrogenases, mitogene activated protein kinases (MAPKs), and Ribulose-1,5-biphosphate carboxylase (RuBisCO) were shown to be involved in the drought-related responses. This shows a diversity of complex functional features in sorghum to respond to various abiotic stresses. Finally, we constructed a drought-specific subnetwork, characterized by unique candidate genes that were associated with DSRhub genes. According to our knowledge, this is the first in sorghum drought investigation that introduces pathway and network-based candidate gene approach for analysis of drought tolerance. We provide novel information about pathways cross-talk and signaling networks used in further systems level analysis for understanding the molecular mechanism behind drought tolerance and can, therefore, be adapted to other model and non-model crops.
Collapse
Affiliation(s)
- Adugna Abdi Woldesemayat
- Department of Life and Consumer Sciences, College of Agriculture and Environmental Sciences, University of South Africa, Johannesburg, South Africa
| | - Monde Ntwasa
- Department of Life and Consumer Sciences, College of Agriculture and Environmental Sciences, University of South Africa, Johannesburg, South Africa
| |
Collapse
|
4
|
Pita-Juárez Y, Altschuler G, Kariotis S, Wei W, Koler K, Green C, Tanzi RE, Hide W. The Pathway Coexpression Network: Revealing pathway relationships. PLoS Comput Biol 2018; 14:e1006042. [PMID: 29554099 PMCID: PMC5875878 DOI: 10.1371/journal.pcbi.1006042] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2017] [Revised: 03/29/2018] [Accepted: 02/19/2018] [Indexed: 02/02/2023] Open
Abstract
A goal of genomics is to understand the relationships between biological processes. Pathways contribute to functional interplay within biological processes through complex but poorly understood interactions. However, limited functional references for global pathway relationships exist. Pathways from databases such as KEGG and Reactome provide discrete annotations of biological processes. Their relationships are currently either inferred from gene set enrichment within specific experiments, or by simple overlap, linking pathway annotations that have genes in common. Here, we provide a unifying interpretation of functional interaction between pathways by systematically quantifying coexpression between 1,330 canonical pathways from the Molecular Signatures Database (MSigDB) to establish the Pathway Coexpression Network (PCxN). We estimated the correlation between canonical pathways valid in a broad context using a curated collection of 3,207 microarrays from 72 normal human tissues. PCxN accounts for shared genes between annotations to estimate significant correlations between pathways with related functions rather than with similar annotations. We demonstrate that PCxN provides novel insight into mechanisms of complex diseases using an Alzheimer’s Disease (AD) case study. PCxN retrieved pathways significantly correlated with an expert curated AD gene list. These pathways have known associations with AD and were significantly enriched for genes independently associated with AD. As a further step, we show how PCxN complements the results of gene set enrichment methods by revealing relationships between enriched pathways, and by identifying additional highly correlated pathways. PCxN revealed that correlated pathways from an AD expression profiling study include functional clusters involved in cell adhesion and oxidative stress. PCxN provides expanded connections to pathways from the extracellular matrix. PCxN provides a powerful new framework for interrogation of global pathway relationships. Comprehensive exploration of PCxN can be performed at http://pcxn.org/. Genes do not function alone, but interact within pathways to carry out specific biological processes. Pathways, in turn, interact at a higher level to affect major cellular activities such as motility, growth and development. We present a pathway coexpression network (PCxN) that systematically maps and quantifies these high-level interactions and establishes a unifying reference for pathway relationships. The method uses 3,207 human microarrays from 72 normal human tissues and 1,330 of the most well established pathway annotations to describe global relationships between pathways. PCxN accounts for shared genes to estimate correlations between pathways with related functions rather than with redundant pathway definitions. PCxN can be used to discover and explore pathways correlated with a pathway of interest. We applied PCxN to identify key processes related to Alzheimer’s disease (AD), interpreting a mixed genetic association and experimental derived set of disease genes in the context of gene co-expression. We expand the known relationships between pathways identified by gene set enrichment analysis in brain tissues affected with AD. PCxN provides a high-level overview of pathway relationships. PCxN is available as a webtool at http://pcxn.org/, and as a Bioconductor package at http://bioconductor.org/packages/pcxn/.
Collapse
Affiliation(s)
- Yered Pita-Juárez
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, United States of America
| | - Gabriel Altschuler
- Sheffield Institute for Translational Neuroscience, Department of Neuroscience, University of Sheffield, Sheffield, United Kingdom
| | - Sokratis Kariotis
- Sheffield Institute for Translational Neuroscience, Department of Neuroscience, University of Sheffield, Sheffield, United Kingdom
| | - Wenbin Wei
- Sheffield Institute for Translational Neuroscience, Department of Neuroscience, University of Sheffield, Sheffield, United Kingdom
| | - Katjuša Koler
- Sheffield Institute for Translational Neuroscience, Department of Neuroscience, University of Sheffield, Sheffield, United Kingdom
| | - Claire Green
- Sheffield Institute for Translational Neuroscience, Department of Neuroscience, University of Sheffield, Sheffield, United Kingdom
| | - Rudolph E. Tanzi
- Genetics and Aging Research Unit, MassGeneral Institute for Neurodegenerative Disease, Massachusetts General Hospital and Harvard Medical School, Charlestown, Massachusetts, United States of America
| | - Winston Hide
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, United States of America
- Sheffield Institute for Translational Neuroscience, Department of Neuroscience, University of Sheffield, Sheffield, United Kingdom
- Harvard Stem Cell Institute, Cambridge, Massachusetts, United States of America
- National Institute Health Research, Sheffield Biomedical Research Centre, Sheffield, United Kingdom
- * E-mail:
| |
Collapse
|
5
|
Ensemble Methods with Voting Protocols Exhibit Superior Performance for Predicting Cancer Clinical Endpoints and Providing More Complete Coverage of Disease-Related Genes. Int J Genomics 2018; 2018:8124950. [PMID: 29546047 PMCID: PMC5818887 DOI: 10.1155/2018/8124950] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 11/06/2017] [Accepted: 11/14/2017] [Indexed: 11/17/2022] Open
Abstract
In genetic data modeling, the use of a limited number of samples for modeling and predicting, especially well below the attribute number, is difficult due to the enormous number of genes detected by a sequencing platform. In addition, many studies commonly use machine learning methods to evaluate genetic datasets to identify potential disease-related genes and drug targets, but to the best of our knowledge, the information associated with the selected gene set was not thoroughly elucidated in previous studies. To identify a relatively stable scheme for modeling limited samples in the gene datasets and reveal the information that they contain, the present study first evaluated the performance of a series of modeling approaches for predicting clinical endpoints of cancer and later integrated the results using various voting protocols. As a result, we proposed a relatively stable scheme that used a set of methods with an ensemble algorithm. Our findings indicated that the ensemble methodologies are more reliable for predicting cancer prognoses than single machine learning algorithms as well as for gene function evaluating. The ensemble methodologies provide a more complete coverage of relevant genes, which can facilitate the exploration of cancer mechanisms and the identification of potential drug targets.
Collapse
|
6
|
Padmanabhan K, Shpanskaya K, Bello G, Doraiswamy PM, Samatova NF. Toward Personalized Network Biomarkers in Alzheimer's Disease: Computing Individualized Genomic and Protein Crosstalk Maps. Front Aging Neurosci 2017; 9:315. [PMID: 29085293 PMCID: PMC5649142 DOI: 10.3389/fnagi.2017.00315] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2017] [Accepted: 09/15/2017] [Indexed: 01/12/2023] Open
Affiliation(s)
- Kanchana Padmanabhan
- Department of Computer Science, North Carolina State University, Raleigh, NC, United States.,Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Katie Shpanskaya
- Stanford University School of Medicine, Stanford, CA, United States
| | - Gonzalo Bello
- Department of Computer Science, North Carolina State University, Raleigh, NC, United States
| | - P Murali Doraiswamy
- Department of Psychiatry, Duke University, Durham, NC, United States.,Duke Institute for Brain Sciences, Duke University, Durham, NC, United States
| | - Nagiza F Samatova
- Department of Computer Science, North Carolina State University, Raleigh, NC, United States.,Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| |
Collapse
|
7
|
Ballester M, Ramayo-Caldas Y, Revilla M, Corominas J, Castelló A, Estellé J, Fernández AI, Folch JM. Integration of liver gene co-expression networks and eGWAs analyses highlighted candidate regulators implicated in lipid metabolism in pigs. Sci Rep 2017; 7:46539. [PMID: 28422154 PMCID: PMC5396199 DOI: 10.1038/srep46539] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Accepted: 03/22/2017] [Indexed: 12/14/2022] Open
Abstract
In the present study, liver co-expression networks and expression Genome Wide Association Study (eGWAS) were performed to identify DNA variants and molecular pathways implicated in the functional regulatory mechanisms of meat quality traits in pigs. With this purpose, the liver mRNA expression of 44 candidates genes related with lipid metabolism was analysed in 111 Iberian x Landrace backcross animals. The eGWAS identified 92 eSNPs located in seven chromosomal regions and associated with eight genes: CROT, CYP2U1, DGAT1, EGF, FABP1, FABP5, PLA2G12A, and PPARA. Remarkably, cis-eSNPs associated with FABP1 gene expression which may be determining the C18:2(n-6)/C18:3(n-3) ratio in backfat through the multiple interaction of DNA variants and genes were identified. Furthermore, a hotspot on SSC8 associated with the gene expression of eight genes was identified and the TBCK gene was pointed out as candidate gene regulating it. Our results also suggested that the PI3K-Akt-mTOR pathway plays an important role in the control of the analysed genes highlighting nuclear receptors as the NR3C1 or PPARA. Finally, sex-dimorphism associated with hepatic lipid metabolism was identified with over-representation of female-biased genes. These results increase our knowledge of the genetic architecture underlying fat composition traits.
Collapse
Affiliation(s)
- Maria Ballester
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Campus UAB, Bellaterra, 08193, Barcelona, Spain
- Plant and Animal Genomics, Centre de Recerca en Agrigenòmica (Consorci CSIC-IRTA-UAB-UB), Edifici CRAG, Campus UAB, Bellaterra, 08193, Barcelona, Spain
- IRTA, Genètica i Millora Animal, Torre Marimon, 08140 Caldes de Montbui, Spain
| | - Yuliaxis Ramayo-Caldas
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Campus UAB, Bellaterra, 08193, Barcelona, Spain
- Plant and Animal Genomics, Centre de Recerca en Agrigenòmica (Consorci CSIC-IRTA-UAB-UB), Edifici CRAG, Campus UAB, Bellaterra, 08193, Barcelona, Spain
- Génétique Animale et Biologie Intégrative, Institut National de la Recherche Agronomique, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Manuel Revilla
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Campus UAB, Bellaterra, 08193, Barcelona, Spain
- Plant and Animal Genomics, Centre de Recerca en Agrigenòmica (Consorci CSIC-IRTA-UAB-UB), Edifici CRAG, Campus UAB, Bellaterra, 08193, Barcelona, Spain
| | - Jordi Corominas
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Campus UAB, Bellaterra, 08193, Barcelona, Spain
- Plant and Animal Genomics, Centre de Recerca en Agrigenòmica (Consorci CSIC-IRTA-UAB-UB), Edifici CRAG, Campus UAB, Bellaterra, 08193, Barcelona, Spain
| | - Anna Castelló
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Campus UAB, Bellaterra, 08193, Barcelona, Spain
- Plant and Animal Genomics, Centre de Recerca en Agrigenòmica (Consorci CSIC-IRTA-UAB-UB), Edifici CRAG, Campus UAB, Bellaterra, 08193, Barcelona, Spain
| | - Jordi Estellé
- Génétique Animale et Biologie Intégrative, Institut National de la Recherche Agronomique, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Ana I. Fernández
- Departamento de Mejora Genética Animal, INIA, Ctra. de la Coruña km. 7, 28040, Madrid, Spain
| | - Josep M. Folch
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Campus UAB, Bellaterra, 08193, Barcelona, Spain
- Plant and Animal Genomics, Centre de Recerca en Agrigenòmica (Consorci CSIC-IRTA-UAB-UB), Edifici CRAG, Campus UAB, Bellaterra, 08193, Barcelona, Spain
| |
Collapse
|
8
|
Zhang YQ, Mao X, Guo QY, Lin N, Li S. Network Pharmacology-based Approaches Capture Essence of Chinese Herbal Medicines. CHINESE HERBAL MEDICINES 2016. [DOI: 10.1016/s1674-6384(16)60018-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
|
9
|
Angione C, Pratanwanich N, Lió P. A Hybrid of Metabolic Flux Analysis and Bayesian Factor Modeling for Multiomic Temporal Pathway Activation. ACS Synth Biol 2015; 4:880-9. [PMID: 25856685 DOI: 10.1021/sb5003407] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
The growing availability of multiomic data provides a highly comprehensive view of cellular processes at the levels of mRNA, proteins, metabolites, and reaction fluxes. However, due to probabilistic interactions between components depending on the environment and on the time course, casual, sometimes rare interactions may cause important effects in the cellular physiology. To date, interactions at the pathway level cannot be measured directly, and methodologies to predict pathway cross-correlations from reaction fluxes are still missing. Here, we develop a multiomic approach of flux-balance analysis combined with Bayesian factor modeling with the aim of detecting pathway cross-correlations and predicting metabolic pathway activation profiles. Starting from gene expression profiles measured in various environmental conditions, we associate a flux rate profile with each condition. We then infer pathway cross-correlations and identify the degrees of pathway activation with respect to the conditions and time course using Bayesian factor modeling. We test our framework on the most recent metabolic reconstruction of Escherichia coli in both static and dynamic environments, thus predicting the functionality of particular groups of reactions and how it varies over time. In a dynamic environment, our method can be readily used to characterize the temporal progression of pathway activation in response to given stimuli.
Collapse
Affiliation(s)
- Claudio Angione
- Computer Laboratory, University of Cambridge, Cambridge CB3 0FD, United Kingdom
| | | | - Pietro Lió
- Computer Laboratory, University of Cambridge, Cambridge CB3 0FD, United Kingdom
| |
Collapse
|
10
|
Dong X, Yambartsev A, Ramsey SA, Thomas LD, Shulzhenko N, Morgun A. Reverse enGENEering of Regulatory Networks from Big Data: A Roadmap for Biologists. Bioinform Biol Insights 2015; 9:61-74. [PMID: 25983554 PMCID: PMC4415676 DOI: 10.4137/bbi.s12467] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Revised: 02/16/2015] [Accepted: 02/17/2015] [Indexed: 12/29/2022] Open
Abstract
Omics technologies enable unbiased investigation of biological systems through massively parallel sequence acquisition or molecular measurements, bringing the life sciences into the era of Big Data. A central challenge posed by such omics datasets is how to transform these data into biological knowledge, for example, how to use these data to answer questions such as: Which functional pathways are involved in cell differentiation? Which genes should we target to stop cancer? Network analysis is a powerful and general approach to solve this problem consisting of two fundamental stages, network reconstruction, and network interrogation. Here we provide an overview of network analysis including a step-by-step guide on how to perform and use this approach to investigate a biological question. In this guide, we also include the software packages that we and others employ for each of the steps of a network analysis workflow.
Collapse
Affiliation(s)
- Xiaoxi Dong
- College of Pharmacy, Oregon State University, Corvallis, OR, USA
| | - Anatoly Yambartsev
- Department of Statistics, Institute of Mathematics and Statistics, University of Sao Paulo, Sao Paulo, SP, Brazil
| | - Stephen A Ramsey
- School of Electrical Engineering and Computer Science, Department of Biomedical Sciences, Oregon State University, Corvallis, OR, USA. ; College of Veterinary Medicine, Department of Biomedical Sciences, Oregon State University, Corvallis, OR, USA
| | - Lina D Thomas
- Department of Statistics, Institute of Mathematics and Statistics, University of Sao Paulo, Sao Paulo, SP, Brazil
| | - Natalia Shulzhenko
- College of Veterinary Medicine, Department of Biomedical Sciences, Oregon State University, Corvallis, OR, USA
| | - Andrey Morgun
- College of Pharmacy, Oregon State University, Corvallis, OR, USA
| |
Collapse
|
11
|
Pathway crosstalk analysis of microarray gene expression profile in human hepatocellular carcinoma. Pathol Oncol Res 2014; 21:563-9. [PMID: 25480734 DOI: 10.1007/s12253-014-9855-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Accepted: 10/14/2014] [Indexed: 01/03/2023]
Abstract
Liver cancer is the third most common cause of cancer death in the world. Hepatocellular carcinoma (HCC) is the main pathological types in liver cancer, which amounts to 70-85 % of primary liver cancer in the world and 90 % in China. The aim of this study was to establish a PPI network and a pathway crosstalk network to isolate important dysfunctional pathways which play an important role in the pathogenesis of HCC. System biology approach was used in this research. A PPI network was firstly built and then a dysfunctional crosstalk network of HCC related pathways was constructed. Several important significant dysfunctional crosstalk pathways were identified. Basal transcription factors (hsa03022), Glycerophospholipid metabolism (hsa00564) and Metabolism of xenobiotics by cytochrome P450 (hsa00980) were significantly interact with Pathway in cancer (hsa05200). Besides, pathway Axon guidance (hsa04360) was also dysfunctional crosstalk with Pathway in cancer (hsa05200). The crosstalks among these pathways reveal some evidence that the pathways closely cooperated and play important tasks in HCC progression. Besides, the pathway hsa04360 dysfunctional crosstalk with the hsa05200 indicates there would be a same mechanism for HCC invasion and migration.
Collapse
|
12
|
Identifying the gene signatures from gene-pathway bipartite network guarantees the robust model performance on predicting the cancer prognosis. BIOMED RESEARCH INTERNATIONAL 2014; 2014:424509. [PMID: 25126556 PMCID: PMC4122106 DOI: 10.1155/2014/424509] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Revised: 06/21/2014] [Accepted: 06/24/2014] [Indexed: 01/12/2023]
Abstract
For the purpose of improving the prediction of cancer prognosis in the clinical researches, various algorithms have been developed to construct the predictive models with the gene signatures detected by DNA microarrays. Due to the heterogeneity of the clinical samples, the list of differentially expressed genes (DEGs) generated by the statistical methods or the machine learning algorithms often involves a number of false positive genes, which are not associated with the phenotypic differences between the compared clinical conditions, and subsequently impacts the reliability of the predictive models. In this study, we proposed a strategy, which combined the statistical algorithm with the gene-pathway bipartite networks, to generate the reliable lists of cancer-related DEGs and constructed the models by using support vector machine for predicting the prognosis of three types of cancers, namely, breast cancer, acute myeloma leukemia, and glioblastoma. Our results demonstrated that, combined with the gene-pathway bipartite networks, our proposed strategy can efficiently generate the reliable cancer-related DEG lists for constructing the predictive models. In addition, the model performance in the swap analysis was similar to that in the original analysis, indicating the robustness of the models in predicting the cancer outcomes.
Collapse
|
13
|
Exploring the ligand-protein networks in traditional chinese medicine: current databases, methods and applications. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 827:227-57. [PMID: 25387968 PMCID: PMC7120483 DOI: 10.1007/978-94-017-9245-5_14] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
While the concept of "single component-single target" in drug discovery seems to have come to an end, "Multi-component-multi-target" is considered to be another promising way out in this field. The Traditional Chinese Medicine (TCM), which has thousands of years' clinical application among China and other Asian countries, is the pioneer of the "Multi-component-multi-target" and network pharmacology. Hundreds of different components in a TCM prescription can cure the diseases or relieve the patients by modulating the network of potential therapeutic targets. Although there is no doubt of the efficacy, it is difficult to elucidate convincing underlying mechanism of TCM due to its complex composition and unclear pharmacology. Without thorough investigation of its potential targets and side effects, TCM is not able to generate large-scale medicinal benefits, especially in the days when scientific reductionism and quantification are dominant. The use of ligand-protein networks has been gaining significant value in the history of drug discovery while its application in TCM is still in its early stage. This article firstly surveys TCM databases for virtual screening that have been greatly expanded in size and data diversity in recent years. On that basis, different screening methods and strategies for identifying active ingredients and targets of TCM are outlined based on the amount of network information available, both on sides of ligand bioactivity and the protein structures. Furthermore, applications of successful in silico target identification attempts are discussed in details along with experiments in exploring the ligand-protein networks of TCM. Finally, it will be concluded that the prospective application of ligand-protein networks can be used not only to predict protein targets of a small molecule, but also to explore the mode of action of TCM.
Collapse
|
14
|
Exploring the ligand-protein networks in traditional chinese medicine: current databases, methods, and applications. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2013; 2013:806072. [PMID: 23818932 PMCID: PMC3684027 DOI: 10.1155/2013/806072] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2013] [Revised: 05/06/2013] [Accepted: 05/07/2013] [Indexed: 12/22/2022]
Abstract
The traditional Chinese medicine (TCM), which has thousands of years of clinical application among China and other Asian countries, is the pioneer of the “multicomponent-multitarget” and network pharmacology. Although there is no doubt of the efficacy, it is difficult to elucidate convincing underlying mechanism of TCM due to its complex composition and unclear pharmacology. The use of ligand-protein networks has been gaining significant value in the history of drug discovery while its application in TCM is still in its early stage. This paper firstly surveys TCM databases for virtual screening that have been greatly expanded in size and data diversity in recent years. On that basis, different screening methods and strategies for identifying active ingredients and targets of TCM are outlined based on the amount of network information available, both on sides of ligand bioactivity and the protein structures. Furthermore, applications of successful in silico target identification attempts are discussed in detail along with experiments in exploring the ligand-protein networks of TCM. Finally, it will be concluded that the prospective application of ligand-protein networks can be used not only to predict protein targets of a small molecule, but also to explore the mode of action of TCM.
Collapse
|