1
|
Giudice G, Chen H, Koutsandreas T, Petsalaki E. phuEGO: A Network-Based Method to Reconstruct Active Signaling Pathways From Phosphoproteomics Datasets. Mol Cell Proteomics 2024; 23:100771. [PMID: 38642805 PMCID: PMC11134849 DOI: 10.1016/j.mcpro.2024.100771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 04/08/2024] [Accepted: 04/17/2024] [Indexed: 04/22/2024] Open
Abstract
Signaling networks are critical for virtually all cell functions. Our current knowledge of cell signaling has been summarized in signaling pathway databases, which, while useful, are highly biased toward well-studied processes, and do not capture context specific network wiring or pathway cross-talk. Mass spectrometry-based phosphoproteomics data can provide a more unbiased view of active cell signaling processes in a given context, however, it suffers from low signal-to-noise ratio and poor reproducibility across experiments. While progress in methods to extract active signaling signatures from such data has been made, there are still limitations with respect to balancing bias and interpretability. Here we present phuEGO, which combines up-to-three-layer network propagation with ego network decomposition to provide small networks comprising active functional signaling modules. PhuEGO boosts the signal-to-noise ratio from global phosphoproteomics datasets, enriches the resulting networks for functional phosphosites and allows the improved comparison and integration across datasets. We applied phuEGO to five phosphoproteomics data sets from cell lines collected upon infection with SARS CoV2. PhuEGO was better able to identify common active functions across datasets and to point to a subnetwork enriched for known COVID-19 targets. Overall, phuEGO provides a flexible tool to the community for the improved functional interpretation of global phosphoproteomics datasets.
Collapse
Affiliation(s)
- Girolamo Giudice
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridgeshire, United Kingdom
| | - Haoqi Chen
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridgeshire, United Kingdom
| | - Thodoris Koutsandreas
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridgeshire, United Kingdom
| | - Evangelia Petsalaki
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridgeshire, United Kingdom.
| |
Collapse
|
2
|
Ma T, Wang J. GraphPath: a graph attention model for molecular stratification with interpretability based on the pathway-pathway interaction network. Bioinformatics 2024; 40:btae165. [PMID: 38530778 PMCID: PMC11007237 DOI: 10.1093/bioinformatics/btae165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 02/22/2024] [Accepted: 03/22/2024] [Indexed: 03/28/2024] Open
Abstract
MOTIVATION Studying the molecular heterogeneity of cancer is essential for achieving personalized therapy. At the same time, understanding the biological processes that drive cancer development can lead to the identification of valuable therapeutic targets. Therefore, achieving accurate and interpretable clinical predictions requires paramount attention to thoroughly characterizing patients at both the molecular and biological pathway levels. RESULTS Here, we present GraphPath, a biological knowledge-driven graph neural network with multi-head self-attention mechanism that implements the pathway-pathway interaction network. We train GraphPath to classify the cancer status of patients with prostate cancer based on their multi-omics profiling. Experiment results show that our method outperforms P-NET and other baseline methods. Besides, two external cohorts are used to validate that the model can be generalized to unseen samples with adequate predictive performance. We reduce the dimensionality of latent pathway embeddings and visualize corresponding classes to further demonstrate the optimal performance of the model. Additionally, since GraphPath's predictions are interpretable, we identify target cancer-associated pathways that significantly contribute to the model's predictions. Such a robust and interpretable model has the potential to greatly enhance our understanding of cancer's biological mechanisms and accelerate the development of targeted therapies. AVAILABILITY AND IMPLEMENTATION https://github.com/amazingma/GraphPath.
Collapse
Affiliation(s)
- Teng Ma
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 41083, Hunan, China
| | - Jianxin Wang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 41083, Hunan, China
| |
Collapse
|
3
|
Wang S, Wu R, Lu J, Jiang Y, Huang T, Cai YD. Protein-protein interaction networks as miners of biological discovery. Proteomics 2022; 22:e2100190. [PMID: 35567424 DOI: 10.1002/pmic.202100190] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 03/28/2022] [Accepted: 04/29/2022] [Indexed: 11/12/2022]
Abstract
Protein-protein interactions (PPIs) form the basis of a myriad of biological pathways and mechanism, such as the formation of protein-complexes or the components of signaling cascades. Here, we reviewed experimental methods for identifying PPI pairs, including yeast two-hybrid, mass spectrometry, co-localization, and co-immunoprecipitation. Furthermore, a range of computational methods leveraging biochemical properties, evolution history, protein structures and more have enabled identification of additional PPIs. Given the wealth of known PPIs, we reviewed important network methods to construct and analyze networks of PPIs. These methods aid biological discovery through identifying hub genes and dynamic changes in the network, and have been thoroughly applied in various fields of biological research. Lastly, we discussed the challenges and future direction of research utilizing the power of PPI networks. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Steven Wang
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Runxin Wu
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jiaqi Lu
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN, USA
| | - Yijia Jiang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Tao Huang
- Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, China
| | - Yu-Dong Cai
- School of Life Sciences, Shanghai University, Shanghai, China
| |
Collapse
|
4
|
Wang Y, Yang C, Li W, Shen Y, Deng J, Lu W, Jin J, Liu Y, Liu Q. Identification of colon tumor marker NKD1 via integrated bioinformatics analysis and experimental validation. Cancer Med 2021; 10:7383-7394. [PMID: 34547189 PMCID: PMC8525156 DOI: 10.1002/cam4.4224] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 08/07/2021] [Accepted: 08/09/2021] [Indexed: 12/15/2022] Open
Abstract
Background Colorectal cancer is an important death‐related disease in the worldwide. However, specific colon cancer tumor markers currently used for diagnosis and treatment are few. The purpose of this study is to screen the potential colon cancer markers by bioinformatics and verify the results with experiments. Methods Gene expression data were downloaded from two different databases: TCGA database and GEO datasets, which were then analyzed by two different methods (difference analysis and WGCNA method). Venn and PPI analysis obtained the potential core genes, which were then performed the GO enrichment and KEGG pathway analysis. Expressions levels of NKD1 in colon carcinoma tissues were further confirmed by immunohistochemical staining and western blot assays. Moreover, the function was measured by MTT, clone formation, and tumor transplantation experiments. Importantly, co‐immunoprecipitation, immunofluorescence, and protein stability assays were further performed to explore the underlying mechanism of NKD1 promoting cell proliferation. Results Nine potential core genes highly expressed in colon cancer samples were screened out by bioinformatics analysis. NKD1, one of the hub genes, highly expressed in the colon carcinoma tissues could enhance the proliferation of colon cancer cells. Mechanism research demonstrated that NKD1 was essential for the combination between Wnt signalosome (DVL) and β‐catenin, and that NKD1 knockout remarkably decreased the β‐catenin expression. Immunofluorescence assays further implied that NKD1 knockout significantly inhibited β‐catenin nuclear accumulation. Importantly, the stability of β‐catenin proteins was maintained by NKD1 in the colon cancer cells. Conclusion We believe that NKD1 well expressed in the colorectal carcinoma tissues can enhance the proliferation of colon cancer cells. Furthermore, the functions that NKD1 may have in colon cancer cells should be different from that NKD1 has played in the zebrafish. Thus, NKD1 could be a specific colorectal cancer marker.
Collapse
Affiliation(s)
- Yue Wang
- The Third Affiliated Hospital of Soochow University, Changzhou, Jiangsu Province, China.,Clinical Oncology Laboratory, Changzhou Tumor Hospital Affiliated to Soochow University, Changzhou, Changzhou, China.,Department of Oncology, Wujin Hospital Affiliated with Jiangsu University, Jiangsu Province, China.,Department of Oncology, The Wujin Clinical College of Xuzhou Medical University, Jiangsu Province, China
| | - Chunxia Yang
- Department of Oncology, Wujin Hospital Affiliated with Jiangsu University, Jiangsu Province, China.,Department of Oncology, The Wujin Clinical College of Xuzhou Medical University, Jiangsu Province, China
| | - Wenjing Li
- Department of Oncology, Wujin Hospital Affiliated with Jiangsu University, Jiangsu Province, China.,Department of Oncology, The Wujin Clinical College of Xuzhou Medical University, Jiangsu Province, China
| | - Ying Shen
- Department of Oncology, Wujin Hospital Affiliated with Jiangsu University, Jiangsu Province, China.,Department of Oncology, The Wujin Clinical College of Xuzhou Medical University, Jiangsu Province, China
| | - Jianzhong Deng
- Department of Oncology, Wujin Hospital Affiliated with Jiangsu University, Jiangsu Province, China.,Department of Oncology, The Wujin Clinical College of Xuzhou Medical University, Jiangsu Province, China
| | - Wenbin Lu
- Department of Oncology, Wujin Hospital Affiliated with Jiangsu University, Jiangsu Province, China.,Department of Oncology, The Wujin Clinical College of Xuzhou Medical University, Jiangsu Province, China
| | - Jianhua Jin
- Department of Oncology, Wujin Hospital Affiliated with Jiangsu University, Jiangsu Province, China.,Department of Oncology, The Wujin Clinical College of Xuzhou Medical University, Jiangsu Province, China
| | - Yongping Liu
- The Third Affiliated Hospital of Soochow University, Changzhou, Jiangsu Province, China.,Clinical Oncology Laboratory, Changzhou Tumor Hospital Affiliated to Soochow University, Changzhou, Changzhou, China
| | - Qian Liu
- Department of Oncology, Wujin Hospital Affiliated with Jiangsu University, Jiangsu Province, China.,Department of Oncology, The Wujin Clinical College of Xuzhou Medical University, Jiangsu Province, China
| |
Collapse
|
5
|
Lee S, Lim S, Lee T, Sung I, Kim S. Cancer subtype classification and modeling by pathway attention and propagation. Bioinformatics 2020; 36:3818-3824. [PMID: 32207514 DOI: 10.1093/bioinformatics/btaa203] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 01/13/2020] [Accepted: 03/19/2020] [Indexed: 01/04/2023] Open
Abstract
MOTIVATION Biological pathway is an important curated knowledge of biological processes. Thus, cancer subtype classification based on pathways will be very useful to understand differences in biological mechanisms among cancer subtypes. However, pathways include only a fraction of the entire gene set, only one-third of human genes in KEGG, and pathways are fragmented. For this reason, there are few computational methods to use pathways for cancer subtype classification. RESULTS We present an explainable deep-learning model with attention mechanism and network propagation for cancer subtype classification. Each pathway is modeled by a graph convolutional network. Then, a multi-attention-based ensemble model combines several hundreds of pathways in an explainable manner. Lastly, network propagation on pathway-gene network explains why gene expression profiles in subtypes are different. In experiments with five TCGA cancer datasets, our method achieved very good classification accuracies and, additionally, identified subtype-specific pathways and biological functions. AVAILABILITY AND IMPLEMENTATION The source code is available at http://biohealth.snu.ac.kr/software/GCN_MAE. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sangseon Lee
- Department of Computer Science and Engineering, Institute of Engineering Research
| | | | - Taeheon Lee
- Department of Computer Science and Engineering, Institute of Engineering Research
| | - Inyoung Sung
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Republic of Korea
| | - Sun Kim
- Department of Computer Science and Engineering, Institute of Engineering Research.,Bioinformatics Institute.,Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Republic of Korea
| |
Collapse
|
6
|
Kim HJ, Moon JH, Chung H, Shin JS, Kim B, Kim JM, Kim JS, Yoon IH, Min BH, Kang SJ, Kim YH, Jo K, Choi J, Chae H, Lee WW, Kim S, Park CG. Bioinformatic analysis of peripheral blood RNA-sequencing sensitively detects the cause of late graft loss following overt hyperglycemia in pig-to-nonhuman primate islet xenotransplantation. Sci Rep 2019; 9:18835. [PMID: 31827198 PMCID: PMC6906328 DOI: 10.1038/s41598-019-55417-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2019] [Accepted: 11/12/2019] [Indexed: 01/19/2023] Open
Abstract
Clinical islet transplantation has recently been a promising treatment option for intractable type 1 diabetes patients. Although early graft loss has been well studied and controlled, the mechanisms of late graft loss largely remains obscure. Since long-term islet graft survival had not been achieved in islet xenotransplantation, it has been impossible to explore the mechanism of late islet graft loss. Fortunately, recent advances where consistent long-term survival (≥6 months) of adult porcine islet grafts was achieved in five independent, diabetic nonhuman primates (NHPs) enabled us to investigate on the late graft loss. Regardless of the conventional immune monitoring methods applied in the post-transplant period, the initiation of late graft loss could rarely be detected before the overt graft loss observed via uncontrolled blood glucose level. Thus, we retrospectively analyzed the gene expression profiles in 2 rhesus monkey recipients using peripheral blood RNA-sequencing (RNA-seq) data to find out the potential cause(s) of late graft loss. Bioinformatic analyses showed that highly relevant immunological pathways were activated in the animal which experienced late graft failure. Further connectivity analyses revealed that the activation of T cell signaling pathways was the most prominent, suggesting that T cell-mediated graft rejection could be the cause of the late-phase islet loss. Indeed, the porcine islets in the biopsied monkey liver samples were heavily infiltrated with CD3+ T cells. Furthermore, hypothesis test using a computational experiment reinforced our conclusion. Taken together, we suggest that bioinformatics analyses with peripheral blood RNA-seq could unveil the cause of insidious late islet graft loss.
Collapse
Affiliation(s)
- Hyun-Je Kim
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
- Department of Microbiology and Immunology, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
- Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, 03080, Republic of Korea
- Department of Dermatology and the Laboratory of Inflammatory Skin Diseases, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Ji Hwan Moon
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Republic of Korea
- Department of Biological Sciences, University at Buffalo, Buffalo, NY, 14260, USA
| | - Hyunwoo Chung
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
- Department of Microbiology and Immunology, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
- Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, 03080, Republic of Korea
| | - Jun-Seop Shin
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Bongi Kim
- Department of Microbiology and Immunology, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Jong-Min Kim
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Jung-Sik Kim
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Il-Hee Yoon
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Byoung-Hoon Min
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Seong-Jun Kang
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
- Department of Microbiology and Immunology, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
- Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, 03080, Republic of Korea
| | - Yong-Hee Kim
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Kyuri Jo
- Department of Computer Engineering, Chungbuk National University, Cheongju, 28644, Republic of Korea
| | - Joungmin Choi
- Division of Computer Science, Sookmyung Women's University, Seoul, 04310, Republic of Korea
| | - Heejoon Chae
- Division of Computer Science, Sookmyung Women's University, Seoul, 04310, Republic of Korea
| | - Won-Woo Lee
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
- Department of Microbiology and Immunology, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
- Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, 03080, Republic of Korea
| | - Sun Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Republic of Korea.
- Bioinformatics Institute, Department of Computer Science and Engineering, Seoul National University, Seoul, 08826, Republic of Korea.
- Department of Computer Science & Engineering, Seoul National University, Seoul, 08826, Republic of Korea.
| | - Chung-Gyu Park
- Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea.
- Department of Microbiology and Immunology, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea.
- Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, 03080, Republic of Korea.
- Cancer Research Institute, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea.
- Institute of Endemic Diseases, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea.
- Biomedical Research Institute, Seoul National University Hospital, Seoul, 03080, Republic of Korea.
| |
Collapse
|
7
|
Youssef I, Law J, Ritz A. Integrating protein localization with automated signaling pathway reconstruction. BMC Bioinformatics 2019; 20:505. [PMID: 31787091 PMCID: PMC6886211 DOI: 10.1186/s12859-019-3077-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Background Understanding cellular responses via signal transduction is a core focus in systems biology. Tools to automatically reconstruct signaling pathways from protein-protein interactions (PPIs) can help biologists generate testable hypotheses about signaling. However, automatic reconstruction of signaling pathways suffers from many interactions with the same confidence score leading to many equally good candidates. Further, some reconstructions are biologically misleading due to ignoring protein localization information. Results We propose LocPL, a method to improve the automatic reconstruction of signaling pathways from PPIs by incorporating information about protein localization in the reconstructions. The method relies on a dynamic program to ensure that the proteins in a reconstruction are localized in cellular compartments that are consistent with signal transduction from the membrane to the nucleus. LocPL and existing reconstruction algorithms are applied to two PPI networks and assessed using both global and local definitions of accuracy. LocPL produces more accurate and biologically meaningful reconstructions on a versatile set of signaling pathways. Conclusion LocPL is a powerful tool to automatically reconstruct signaling pathways from PPIs that leverages cellular localization information about proteins. The underlying dynamic program and signaling model are flexible enough to study cellular signaling under different settings of signaling flow across the cellular compartments.
Collapse
Affiliation(s)
- Ibrahim Youssef
- Biomedical Engineering Department, Cairo University, Giza, 12613, Egypt.,Biology Department, Reed College, Portland, OR 97202, USA
| | - Jeffrey Law
- Genetics, Bioinformatics, and Computational Biology, Virginia Tech, Blacksburg, VA 24061, USA
| | - Anna Ritz
- Biology Department, Reed College, Portland, OR 97202, USA.
| |
Collapse
|
8
|
Bonnici V, Busato F, Aldegheri S, Akhmedov M, Cascione L, Carmena AA, Bertoni F, Bombieri N, Kwee I, Giugno R. cuRnet: an R package for graph traversing on GPU. BMC Bioinformatics 2018; 19:356. [PMID: 30367572 PMCID: PMC6191969 DOI: 10.1186/s12859-018-2310-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND R has become the de-facto reference analysis environment in Bioinformatics. Plenty of tools are available as packages that extend the R functionality, and many of them target the analysis of biological networks. Several algorithms for graphs, which are the most adopted mathematical representation of networks, are well-known examples of applications that require high-performance computing, and for which classic sequential implementations are becoming inappropriate. In this context, parallel approaches targeting GPU architectures are becoming pervasive to deal with the execution time constraints. Although R packages for parallel execution on GPUs are already available, none of them provides graph algorithms. RESULTS This work presents cuRnet, a R package that provides a parallel implementation for GPUs of the breath-first search (BFS), the single-source shortest paths (SSSP), and the strongly connected components (SCC) algorithms. The package allows offloading computing intensive applications to GPU devices for massively parallel computation and to speed up the runtime up to one order of magnitude with respect to the standard sequential computations on CPU. We have tested cuRnet on a benchmark of large protein interaction networks and for the interpretation of high-throughput omics data thought network analysis. CONCLUSIONS cuRnet is a R package to speed up graph traversal and analysis through parallel computation on GPUs. We show the efficiency of cuRnet applied both to biological network analysis, which requires basic graph algorithms, and to complex existing procedures built upon such algorithms.
Collapse
Affiliation(s)
- Vincenzo Bonnici
- Department of Computer Science, University of Verona, Strada le Grazie, 15, Italy, Verona, Italy
| | - Federico Busato
- Department of Computer Science, University of Verona, Strada le Grazie, 15, Italy, Verona, Italy
| | - Stefano Aldegheri
- Department of Computer Science, University of Verona, Strada le Grazie, 15, Italy, Verona, Italy
| | - Murodzhon Akhmedov
- Institute of Oncology Research (IOR), Via Vincenzo Vela 6, Bellinzona, Switzerland
| | - Luciano Cascione
- Institute of Oncology Research (IOR), Via Vincenzo Vela 6, Bellinzona, Switzerland
| | | | - Francesco Bertoni
- Institute of Oncology Research (IOR), Via Vincenzo Vela 6, Bellinzona, Switzerland
| | - Nicola Bombieri
- Department of Computer Science, University of Verona, Strada le Grazie, 15, Italy, Verona, Italy
| | - Ivo Kwee
- Institute of Oncology Research (IOR), Via Vincenzo Vela 6, Bellinzona, Switzerland
| | - Rosalba Giugno
- Department of Computer Science, University of Verona, Strada le Grazie, 15, Italy, Verona, Italy.
| |
Collapse
|
9
|
Lee S, Park Y, Kim S. MIDAS: Mining differentially activated subpaths of KEGG pathways from multi-class RNA-seq data. Methods 2017; 124:13-24. [PMID: 28579402 DOI: 10.1016/j.ymeth.2017.05.026] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2017] [Accepted: 05/30/2017] [Indexed: 11/18/2022] Open
Abstract
Pathway based analysis of high throughput transcriptome data is a widely used approach to investigate biological mechanisms. Since a pathway consists of multiple functions, the recent approach is to determine condition specific sub-pathways or subpaths. However, there are several challenges. First, few existing methods utilize explicit gene expression information from RNA-seq. More importantly, subpath activity is usually an average of statistical scores, e.g., correlations, of edges in a candidate subpath, which fails to reflect gene expression quantity information. In addition, none of existing methods can handle multiple phenotypes. To address these technical problems, we designed and implemented an algorithm, MIDAS, that determines condition specific subpaths, each of which has different activities across multiple phenotypes. MIDAS utilizes gene expression quantity information fully and the network centrality information to determine condition specific subpaths. To test performance of our tool, we used TCGA breast cancer RNA-seq gene expression profiles with five molecular subtypes. 36 differentially activate subpaths were determined. The utility of our method, MIDAS, was demonstrated in four ways. All 36 subpaths are well supported by the literature information. Subsequently, we showed that these subpaths had a good discriminant power for five cancer subtype classification and also had a prognostic power in terms of survival analysis. Finally, in a performance comparison of MIDAS to a recent subpath prediction method, PATHOME, our method identified more subpaths and much more genes that are well supported by the literature information. AVAILABILITY http://biohealth.snu.ac.kr/software/MIDAS/.
Collapse
Affiliation(s)
- Sangseon Lee
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
| | - Youngjune Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Sun Kim
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea; Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea; Bioinformatics Institute, Seoul National University, Seoul, Republic of Korea.
| |
Collapse
|