1
|
Nguyen QH, Nguyen H, Oh EC, Nguyen T. Current approaches and outstanding challenges of functional annotation of metabolites: a comprehensive review. Brief Bioinform 2024; 25:bbae498. [PMID: 39397425 PMCID: PMC11471905 DOI: 10.1093/bib/bbae498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 09/03/2024] [Accepted: 10/02/2024] [Indexed: 10/15/2024] Open
Abstract
Metabolite profiling is a powerful approach for the clinical diagnosis of complex diseases, ranging from cardiometabolic diseases, cancer, and cognitive disorders to respiratory pathologies and conditions that involve dysregulated metabolism. Because of the importance of systems-level interpretation, many methods have been developed to identify biologically significant pathways using metabolomics data. In this review, we first describe a complete metabolomics workflow (sample preparation, data acquisition, pre-processing, downstream analysis, etc.). We then comprehensively review 24 approaches capable of performing functional analysis, including those that combine metabolomics data with other types of data to investigate the disease-relevant changes at multiple omics layers. We discuss their availability, implementation, capability for pre-processing and quality control, supported omics types, embedded databases, pathway analysis methodologies, and integration techniques. We also provide a rating and evaluation of each software, focusing on their key technique, software accessibility, documentation, and user-friendliness. Following our guideline, life scientists can easily choose a suitable method depending on method rating, available data, input format, and method category. More importantly, we highlight outstanding challenges and potential solutions that need to be addressed by future research. To further assist users in executing the reviewed methods, we provide wrappers of the software packages at https://github.com/tinnlab/metabolite-pathway-review-docker.
Collapse
Affiliation(s)
- Quang-Huy Nguyen
- Department of Computer Science and Software Engineering, Auburn University, Auburn, AL 36849, United States
| | - Ha Nguyen
- Department of Computer Science and Software Engineering, Auburn University, Auburn, AL 36849, United States
| | - Edwin C Oh
- Department of Internal Medicine, UNLV School of Medicine, University of Nevada, Las Vegas, NV 89154, United States
| | - Tin Nguyen
- Department of Computer Science and Software Engineering, Auburn University, Auburn, AL 36849, United States
| |
Collapse
|
2
|
Xue R, Xie M, Wu Z, Wang S, Zhang Y, Han Z, Li C, Tang Q, Wang L, Li D, Wang S, Yang H, Zhao RC. Mesenchymal Stem Cell-Derived Exosomes Promote Recovery of The Facial Nerve Injury through Regulating Macrophage M1 and M2 Polarization by Targeting the P38 MAPK/NF-Κb Pathway. Aging Dis 2024; 15:851-868. [PMID: 37548941 PMCID: PMC10917525 DOI: 10.14336/ad.2023.0719-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 07/19/2023] [Indexed: 08/08/2023] Open
Abstract
Facial nerve (FN) injury seriously affects human social viability and causes a heavy economic and social burden. Although mesenchymal stem cell-derived exosomes (MSC-Exos) promise therapeutic benefits for injury repair, there has been no evaluation of the impact of MSC-Exos administration on FN repair. Herein, we explore the function of MSC-Exos in the immunomodulation of macrophages and their effects in repairing FN injury. An ultracentrifugation technique was used to separate exosomes from the MSC supernatant. Administrating MSC-Exos to SD rats via local injection after FN injury promoted axon regeneration and myelination and alleviated local and systemic inflammation. MSC-Exos facilitated M2 polarization and reduced the M1-M2 polarization ratio. miRNA sequencing of MSC-Exos and previous literature showed that the MAPK/NF-κb pathway was a downstream target of macrophage polarization. We confirmed this hypothesis both in vivo and in vitro. Our findings show that MSC-Exos are a potential candidate for treating FN injury because they may have superior benefits for FN injury recovery and can decrease inflammation by controlling the heterogeneity of macrophages, which is regulated by the p38 MAPK/NF-κb pathway.
Collapse
Affiliation(s)
- Ruoyan Xue
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Mengyao Xie
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Zhiyuan Wu
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
- Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Shu Wang
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Yongli Zhang
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Zhijin Han
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Chen Li
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Qi Tang
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Liping Wang
- Institute of Basic Medical Sciences Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Peking Union Medical College Hospital, Center of Excellence in Tissue Engineering Chinese Academy of Medical Sciences, Beijing Key Laboratory, Beijing, China.
| | - Di Li
- Institute of Basic Medical Sciences Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Peking Union Medical College Hospital, Center of Excellence in Tissue Engineering Chinese Academy of Medical Sciences, Beijing Key Laboratory, Beijing, China.
| | - Shihua Wang
- Institute of Basic Medical Sciences Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Peking Union Medical College Hospital, Center of Excellence in Tissue Engineering Chinese Academy of Medical Sciences, Beijing Key Laboratory, Beijing, China.
| | - Hua Yang
- Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Robert Chunhua Zhao
- Institute of Basic Medical Sciences Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Peking Union Medical College Hospital, Center of Excellence in Tissue Engineering Chinese Academy of Medical Sciences, Beijing Key Laboratory, Beijing, China.
- School of Life Sciences, Shanghai University, Shanghai, China.
| |
Collapse
|
3
|
Chang LY, Lee MZ, Wu Y, Lee WK, Ma CL, Chang JM, Chen CW, Huang TC, Lee CH, Lee JC, Tseng YY, Lin CY. Gene set correlation enrichment analysis for interpreting and annotating gene expression profiles. Nucleic Acids Res 2024; 52:e17. [PMID: 38096046 PMCID: PMC10853793 DOI: 10.1093/nar/gkad1187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 11/17/2023] [Accepted: 11/29/2023] [Indexed: 02/10/2024] Open
Abstract
Pathway analysis, including nontopology-based (non-TB) and topology-based (TB) methods, is widely used to interpret the biological phenomena underlying differences in expression data between two phenotypes. By considering dependencies and interactions between genes, TB methods usually perform better than non-TB methods in identifying pathways that include closely relevant or directly causative genes for a given phenotype. However, most TB methods may be limited by incomplete pathway data used as the reference network or by difficulties in selecting appropriate reference networks for different research topics. Here, we propose a gene set correlation enrichment analysis method, Gscore, based on an expression dataset-derived coexpression network to examine whether a differentially expressed gene (DEG) list (or each of its DEGs) is associated with a known gene set. Gscore is better able to identify target pathways in 89 human disease expression datasets than eight other state-of-the-art methods and offers insight into how disease-wide and pathway-wide associations reflect clinical outcomes. When applied to RNA-seq data from COVID-19-related cells and patient samples, Gscore provided a means for studying how DEGs are implicated in COVID-19-related pathways. In summary, Gscore offers a powerful analytical approach for annotating individual DEGs, DEG lists, and genome-wide expression profiles based on existing biological knowledge.
Collapse
Affiliation(s)
- Lan-Yun Chang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Meng-Zhan Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Yujia Wu
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Wen-Kai Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Chia-Liang Ma
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Jun-Mao Chang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Ciao-Wen Chen
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Tzu-Chun Huang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Chia-Hwa Lee
- School of Medical Laboratory Science and Biotechnology, College of Medical Science and Technology, Taipei Medical University, New Taipei City 235, Taiwan
- Center for Intelligent Drug Systems and Smart Bio-devices (IDSB), National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- TMU Research Center of Cancer Translational Medicine, Taipei Medical University, Taipei 110, Taiwan
- Ph.D. Program in Medical Biotechnology, College of Medical Science and Technology, Taipei Medical University, New Taipei City 235, Taiwan
| | - Jih-Chin Lee
- Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei 110, Taiwan
| | - Yu-Yao Tseng
- Department of Food Science, Nutrition, and Nutraceutical Biotechnology, Shih Chien University, Taipei 104, Taiwan
| | - Chun-Yu Lin
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- Center for Intelligent Drug Systems and Smart Bio-devices (IDSB), National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- Cancer and Immunology Research Center, National Yang Ming Chiao Tung University, Taipei 112, Taiwan
- Institute of Data Science and Engineering, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- School of Dentistry, Kaohsiung Medical University, Kaohsiung 807, Taiwan
| |
Collapse
|
4
|
Yue Z, Slominski R, Bharti S, Chen JY. PAGER Web APP: An Interactive, Online Gene Set and Network Interpretation Tool for Functional Genomics. Front Genet 2022; 13:820361. [PMID: 35495152 PMCID: PMC9039620 DOI: 10.3389/fgene.2022.820361] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 03/17/2022] [Indexed: 12/30/2022] Open
Abstract
Functional genomics studies have helped researchers annotate differentially expressed gene lists, extract gene expression signatures, and identify biological pathways from omics profiling experiments conducted on biological samples. The current geneset, network, and pathway analysis (GNPA) web servers, e.g., DAVID, EnrichR, WebGestaltR, or PAGER, do not allow automated integrative functional genomic downstream analysis. In this study, we developed a new web-based interactive application, "PAGER Web APP", which supports online R scripting of integrative GNPA. In a case study of melanoma drug resistance, we showed that the new PAGER Web APP enabled us to discover highly relevant pathways and network modules, leading to novel biological insights. We also compared PAGER Web APP's pathway analysis results retrieved among PAGER, EnrichR, and WebGestaltR to show its advantages in integrative GNPA. The interactive online web APP is publicly accessible from the link, https://aimed-lab.shinyapps.io/PAGERwebapp/.
Collapse
Affiliation(s)
- Zongliang Yue
- Informatics Institute in the School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Radomir Slominski
- Informatics Institute in the School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
- Graduate Biomedical Sciences Program, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Samuel Bharti
- Informatics Institute in the School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Jake Y. Chen
- Informatics Institute in the School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| |
Collapse
|
5
|
Abstract
Perturbation in the normal function of the cell signaling pathways often leads to diseases. One of the factors that help understand the mechanism of diseases is the precise identification and investigation of perturbed signaling pathways. Pathway analysis methods have been developed as their purpose is to identify perturbed signaling pathways in given conditions. Among these methods, some consider the pathways topologies in their analysis, which are referred to as topology-based methods. Most of the topology-based methods used simple graph-based models to incorporate topology in their analysis, which have some limitations. We describe a new Pathway Analysis method using Petri net (PAPet) that uses the Petri net to model the signaling pathways and then propose an algorithm to measure the perturbation on a given pathway under a given condition. Modeling with Petri net has some advantages and could overcome the shortcomings of the simple graph-based models. We illustrate the capabilities of the proposed method using sensitivity, prioritization, mean reciprocal rank, and false-positive rate metrics on 36 real datasets from various diseases. The results of comparing PAPet with five pathway analysis methods FoPA, PADOG, GSEA, CePa and SPIA show that PAPet is the best one that provides a good compromise between all metrics. In addition, the results of applying methods to gene expression profiles in normal and Pancreatic Ductal Adenocarcinoma cancer (PDAC) samples show that the PAPet method achieves the best rank among others in finding the pathways that have been previously reported for PDAC. The PAPet method is available at https://github.com/fmansoori/PAPET.
Collapse
|
6
|
Yan S, Chi X, Chang X, Tian M. Analysing the meta-interaction between pathways by gene set topological impact analysis. BMC Genomics 2020; 21:748. [PMID: 33109101 PMCID: PMC7592530 DOI: 10.1186/s12864-020-07148-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2020] [Accepted: 10/13/2020] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Pathway analysis is widely applied in transcriptome analysis. Given certain transcriptomic changes, current pathway analysis tools tend to search for the most impacted pathways, which provides insight into underlying biological mechanisms. Further refining of the enriched pathways and extracting functional modules by "crosstalk" analysis have been proposed. However, the upstream/downstream relationships between the modules, which may provide extra biological insights such as the coordination of different functional modules and the signal transduction flow have been ignored. RESULTS To quantitatively analyse the upstream/downstream relationships between functional modules, we developed a novel GEne Set Topological Impact Analysis (GESTIA), which could be used to assemble the enriched pathways and functional modules into a super-module with a topological structure. We showed the advantages of this analysis in the exploration of extra biological insight in addition to the individual enriched pathways and functional modules. CONCLUSIONS GESTIA can be applied to a broad range of pathway/module analysis result. We hope that GESTIA may help researchers to get one additional step closer to understanding the molecular mechanism from the pathway/module analysis results.
Collapse
Affiliation(s)
- Shen Yan
- College of Agronomy, Sichuan Agricultural University, Chengdu, 611130, Sichuan, China
| | - Xu Chi
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 101300, China
- China National Center for Bioinformation, Chaoyang, Beijing, 101300, China
| | - Xiao Chang
- Department of Dermatology and Venereal Disease, Xuanwu Hospital, Capital Medical University, Beijing, 100053, China
| | - Mengliang Tian
- College of Agronomy, Sichuan Agricultural University, Chengdu, 611130, Sichuan, China.
| |
Collapse
|
7
|
Balomenos P, Dragomir A, Tsakalidis AK, Bezerianos A. Identification of differentially expressed subpathways via a bilevel consensus scoring of network topology and gene expression. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2020; 2020:5316-5319. [PMID: 33019184 DOI: 10.1109/embc44109.2020.9176556] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Identifying differentially expressed subpathways connected to the emergence of a disease that can be considered as candidates for pharmacological intervention, with minimal off-target effects, is a daunting task. In this direction, we present a bilevel subpathway analysis method to identify differentially expressed subpathways that are connected with an experimental condition, while taking into account potential crosstalks between subpathways which arise due to their connectivity in a combined multi-pathway network. The efficacy of the method is demonstrated on a hematopoietic stem cell aging dataset, with findings corroborated using recent literature.
Collapse
|
8
|
Bao Z, Zhu Y, Ge Q, Gu W, Dong X, Bai Y. Signaling Pathway Analysis Combined With the Strength Variations of Interactions Between Genes Under Different Conditions. IEEE ACCESS 2020; 8:138036-138045. [DOI: 10.1109/access.2020.3010796] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2025]
|
9
|
Mora A. Gene set analysis methods for the functional interpretation of non-mRNA data—Genomic range and ncRNA data. Brief Bioinform 2019; 21:1495-1508. [DOI: 10.1093/bib/bbz090] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 05/30/2019] [Accepted: 06/28/2019] [Indexed: 12/31/2022] Open
Abstract
Abstract
Gene set analysis (GSA) is one of the methods of choice for analyzing the results of current omics studies; however, it has been mainly developed to analyze mRNA (microarray, RNA-Seq) data. The following review includes an update regarding general methods and resources for GSA and then emphasizes GSA methods and tools for non-mRNA omics datasets, specifically genomic range data (ChIP-Seq, SNP and methylation) and ncRNA data (miRNAs, lncRNAs and others). In the end, the state of the GSA field for non-mRNA datasets is discussed, and some current challenges and trends are highlighted, especially the use of network approaches to face complexity issues.
Collapse
Affiliation(s)
- Antonio Mora
- Joint School of Life Sciences, Guangzhou Medical University and Guangzhou Institutes of Biomedicine and Health - Chinese Academy of Sciences
| |
Collapse
|
10
|
Thibodeau A, Shin DG. TriPOINT: a software tool to prioritize important genes in pathways and their non-coding regulators. Bioinformatics 2019; 35:2686-2689. [PMID: 30566622 PMCID: PMC6662310 DOI: 10.1093/bioinformatics/bty998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Revised: 11/05/2018] [Accepted: 12/17/2018] [Indexed: 11/14/2022] Open
Abstract
Summary Current approaches for pathway analyses focus on representing gene expression levels on graph representations of pathways and conducting pathway enrichment among differentially expressed genes. However, gene expression levels by themselves do not reflect the overall picture as non-coding factors play an important role to regulate gene expression. To incorporate these non-coding factors into pathway analyses and to systematically prioritize genes in a pathway we introduce a new software: Triangulation of Perturbation Origins and Identification of Non-Coding Targets. Triangulation of Perturbation Origins and Identification of Non-Coding Targets is a pathway analysis tool, implemented in Java that identifies the significance of a gene under a condition (e.g. a disease phenotype) by studying graph representations of pathways, analyzing upstream and downstream gene interactions and integrating non-coding regions that may be regulating gene expression levels. Availability and implementation The TriPOINT open source software is freely available at https://github.uconn.edu/ajt06004/TriPOINT under the GPL v3.0 license. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Asa Thibodeau
- Department of Computer Science & Engineering, University of Connecticut, Storrs, CT, USA
| | - Dong-Guk Shin
- Department of Computer Science & Engineering, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
11
|
Nguyen T, Mitrea C, Draghici S. Network-Based Approaches for Pathway Level Analysis. ACTA ACUST UNITED AC 2019; 61:8.25.1-8.25.24. [PMID: 30040185 DOI: 10.1002/cpbi.42] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Identification of impacted pathways is an important problem because it allows us to gain insights into the underlying biology beyond the detection of differentially expressed genes. In the past decade, a plethora of methods have been developed for this purpose. The last generation of pathway analysis methods are designed to take into account various aspects of pathway topology in order to increase the accuracy of the findings. Here, we cover 34 such topology-based pathway analysis methods published in the past 13 years. We compare these methods on categories related to implementation, availability, input format, graph models, and statistical approaches used to compute pathway level statistics and statistical significance. We also discuss a number of critical challenges that need to be addressed, arising both in methodology and pathway representation, including inconsistent terminology, data format, lack of meaningful benchmarks, and, more importantly, a systematic bias that is present in most existing methods. © 2018 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Tin Nguyen
- Department of Computer Science and Engineering, University of Nevada, Reno, Nevada
| | - Cristina Mitrea
- Department of Computer Science, Wayne State University, Detroit, Michigan
| | - Sorin Draghici
- Department of Computer Science, Wayne State University, Detroit, Michigan.,Department of Obstetrics and Gynecology, Wayne State University, Detroit, Michigan
| |
Collapse
|
12
|
Mansoori F, Rahgozar M, Kavousi K. FoPA: identifying perturbed signaling pathways in clinical conditions using formal methods. BMC Bioinformatics 2019; 20:92. [PMID: 30808299 PMCID: PMC6390332 DOI: 10.1186/s12859-019-2635-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2018] [Accepted: 01/17/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Accurate identification of perturbed signaling pathways based on differentially expressed genes between sample groups is one of the key factors in the understanding of diseases and druggable targets. Most pathway analysis methods prioritize impacted signaling pathways by incorporating pathway topology using simple graph-based models. Despite their relative success, these models are limited in describing all types of dependencies and interactions that exist in biological pathways. RESULTS In this work, we propose a new approach based on the formal modeling of signaling pathways. Signaling pathways are formally modeled, and then model checking tools are applied to find the likelihood of perturbation for each pathway in a given condition. By adopting formal methods, various complex interactions among biological parts are modeled, which can contribute to reducing the false-positive rate of the proposed approach. We have developed a tool named Formal model checking based pathway analysis (FoPA) based on this approach. FoPA is compared with three well-known pathway analysis methods: PADOG, CePa, and SPIA on the benchmark of 36 GEO datasets from various diseases by applying the target pathway technique. This validation technique eliminates the need for possibly biased human assessments of results. In the cases that, there is no apriori knowledge of all relevant pathways, simulated false inputs (permuted class labels and decoy pathways) are chosen as a set of negative controls to test the false positive rate of the methods. Finally, to further evaluate the efficiency of FoPA, it is applied to a list of autism-related genes. CONCLUSIONS The results obtained by the target pathway technique demonstrate that FoPA is able to prioritize target pathways as well as PADOG but better than CePa and SPIA. Also, the false-positive rate of finding significant pathways using FoPA is lower than other compared methods. Also, FoPA can detect more consistent relevant pathways than other methods. The results of FoPA on autism-related genes highlight the role of "Renin-angiotensin system" pathway. This pathway has been supposed to have a pivotal role in some neurodegenerative diseases, while little attention has been paid to its impact on autism development so far.
Collapse
Affiliation(s)
- Fatemeh Mansoori
- Database Research Group, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran
| | - Maseud Rahgozar
- Database Research Group, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran.
| | - Kaveh Kavousi
- Complex Biological Systems and Bioinformatics Lab (CBB), Bioinformatics department, University of Tehran, Tehran, Iran.
| |
Collapse
|
13
|
Ren J, Wang B, Li J. Integrating proteomic and phosphoproteomic data for pathway analysis in breast cancer. BMC SYSTEMS BIOLOGY 2018; 12:130. [PMID: 30577793 PMCID: PMC6302460 DOI: 10.1186/s12918-018-0646-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Background As protein is the basic unit of cell function and biological pathway, shotgun proteomics, the large-scale analysis of proteins, is contributing greatly to our understanding of disease mechanisms. Proteomics study could detect the changes of both protein expression and modification. With the releases of large-scale cancer proteome studies, how to integrate acquired proteomic and phosphoproteomic data in more comprehensive pathway analysis becomes implemented, but remains challenging. Integrative pathway analysis at proteome level provides a systematic insight into the signaling network adaptations in the development of cancer. Results Here we integrated proteomic and phosphoproteomic data to perform pathway prioritization in breast cancer. We manually collected and curated breast cancer well-known related pathways from the literature as target pathways (TPs) or positive control in method evaluation. Three different strategies including Hypergeometric test based over-representation analysis, Kolmogorov-Smirnov (K-S) test based gene set analysis and topology-based pathway analysis, were applied and evaluated in integrating protein expression and phosphorylation. In comparison, we also assessed the ranking performance of the strategy using information of protein expression or protein phosphorylation individually. Target pathways were ranked more top with the data integration than using the information from proteomic or phosphoproteomic data individually. In the comparisons of pathway analysis strategies, topology-based method outperformed than the others. The subtypes of breast cancer, which consist of Luminal A, Luminal B, Basal and HER2-enriched, vary greatly in prognosis and require distinct treatment. Therefore we applied topology-based pathway analysis with integrating protein expression and phosphorylation profiles on four subtypes of breast cancer. The results showed that TPs were enriched in all subtypes but their ranks were significantly different among the subtypes. For instance, p53 pathway ranked top in the Basal-like breast cancer subtype, but not in HER2-enriched type. The rank of Focal adhesion pathway was more top in HER2- subtypes than in HER2+ subtypes. The results were consistent with some previous researches. Conclusions The results demonstrate that the network topology-based method is more powerful by integrating proteomic and phosphoproteomic in pathway analysis of proteomics study. This integrative strategy can also be used to rank the specific pathways for the disease subtypes. Electronic supplementary material The online version of this article (10.1186/s12918-018-0646-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jie Ren
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Bo Wang
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Jing Li
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| |
Collapse
|
14
|
Feng C, Song C, Ning Z, Ai B, Wang Q, Xu Y, Li M, Bai X, Zhao J, Liu Y, Li X, Zhang J, Li C. ce-Subpathway: Identification of ceRNA-mediated subpathways via joint power of ceRNAs and pathway topologies. J Cell Mol Med 2018; 23:967-984. [PMID: 30421585 PMCID: PMC6349186 DOI: 10.1111/jcmm.13997] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 08/28/2018] [Accepted: 10/17/2018] [Indexed: 12/19/2022] Open
Abstract
Competing endogenous RNAs (ceRNAs) represent a novel mechanism of gene regulation that may mediate key subpathway regions and contribute to the altered activities of pathways. However, the classical methods used to identify pathways fail to specifically consider ceRNAs within the pathways and key regions impacted by them. We proposed a powerful strategy named ce-Subpathway for the identification of ceRNA-mediated functional subpathways. It provided an effective level of pathway analysis via integrating ceRNAs, differentially expressed (DE) genes and their key regions within the given pathways. We respectively analysed one pulmonary arterial hypertension (PAH) and one myocardial infarction (MI) data sets and demonstrated that ce-Subpathway could identify many subpathways whose corresponding entire pathways were ignored by those non-ceRNA-mediated pathway identification methods. And these pathways have been well reported to be associated with PAH/MI-related cardiovascular diseases. Further evidence showed reliability of ceRNA interactions and robustness/reproducibility of the ce-Subpathway strategy by several data sets of different cancers, including breast cancer, oesophageal cancer and colon cancer. Survival analysis was finally applied to illustrate the clinical application value of the ceRNA-mediated functional subpathways using another data sets of pancreatic cancer. Comprehensive analyses have shown the power of a joint ceRNAs/DE genes and subpathway strategy based on their topologies.
Collapse
Affiliation(s)
- Chenchen Feng
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Chao Song
- Department of Pharmacology, Daqing Campus, Harbin Medical University, Daqing, China
| | - Ziyu Ning
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Bo Ai
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Qiuyu Wang
- School of Nursing, Daqing Campus, Harbin Medical University, Daqing, China
| | - Yong Xu
- The fifth Affiliated Hospital of Harbin Medical University, Daqing, China
| | - Meng Li
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Xuefeng Bai
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Jianmei Zhao
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Yuejuan Liu
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Xuecang Li
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Jian Zhang
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| | - Chunquan Li
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, China
| |
Collapse
|
15
|
Hanoudi S, Donato M, Draghici S. Identifying biologically relevant putative mechanisms in a given phenotype comparison. PLoS One 2017; 12:e0176950. [PMID: 28486531 PMCID: PMC5423614 DOI: 10.1371/journal.pone.0176950] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2016] [Accepted: 04/19/2017] [Indexed: 11/19/2022] Open
Abstract
A major challenge in life science research is understanding the mechanism involved in a given phenotype. The ability to identify the correct mechanisms is needed in order to understand fundamental and very important phenomena such as mechanisms of disease, immune systems responses to various challenges, and mechanisms of drug action. The current data analysis methods focus on the identification of the differentially expressed (DE) genes using their fold change and/or p-values. Major shortcomings of this approach are that: i) it does not consider the interactions between genes; ii) its results are sensitive to the selection of the threshold(s) used, and iii) the set of genes produced by this approach is not always conducive to formulating mechanistic hypotheses. Here we present a method that can construct networks of genes that can be considered putative mechanisms. The putative mechanisms constructed by this approach are not limited to the set of DE genes, but also considers all known and relevant gene-gene interactions. We analyzed three real datasets for which both the causes of the phenotype, as well as the true mechanisms were known. We show that the method identified the correct mechanisms when applied on microarray datasets from mouse. We compared the results of our method with the results of the classical approach, showing that our method produces more meaningful biological insights.
Collapse
Affiliation(s)
- Samer Hanoudi
- Department of Computer Science, Wayne State University, Detroit, MI, United States of America
| | - Michele Donato
- Department of Computer Science, Wayne State University, Detroit, MI, United States of America
| | - Sorin Draghici
- Department of Computer Science, Wayne State University, Detroit, MI, United States of America
- Department of Obstetrics and Gynecology, Detroit, MI, United States of America
| |
Collapse
|
16
|
Kao PYP, Leung KH, Chan LWC, Yip SP, Yap MKH. Pathway analysis of complex diseases for GWAS, extending to consider rare variants, multi-omics and interactions. Biochim Biophys Acta Gen Subj 2016; 1861:335-353. [PMID: 27888147 DOI: 10.1016/j.bbagen.2016.11.030] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2016] [Revised: 10/17/2016] [Accepted: 11/19/2016] [Indexed: 12/20/2022]
Abstract
BACKGROUND Genome-wide association studies (GWAS) is a major method for studying the genetics of complex diseases. Finding all sequence variants to explain fully the aetiology of a disease is difficult because of their small effect sizes. To better explain disease mechanisms, pathway analysis is used to consolidate the effects of multiple variants, and hence increase the power of the study. While pathway analysis has previously been performed within GWAS only, it can now be extended to examining rare variants, other "-omics" and interaction data. SCOPE OF REVIEW 1. Factors to consider in the choice of software for GWAS pathway analysis. 2. Examples of how pathway analysis is used to analyse rare variants, other "-omics" and interaction data. MAJOR CONCLUSIONS To choose appropriate software tools, factors for consideration include covariate compatibility, null hypothesis, one- or two-step analysis required, curation method of gene sets, size of pathways, and size of flanking regions to define gene boundaries. For rare variants, analysis performance depends on consistency between assumed and actual effect distribution of variants. Integration of other "-omics" data and interaction can better explain gene functions. GENERAL SIGNIFICANCE Pathway analysis methods will be more readily used for integration of multiple sources of data, and enable more accurate prediction of phenotypes.
Collapse
Affiliation(s)
- Patrick Y P Kao
- Centre for Myopia Research, School of Optometry, The Hong Kong Polytechnic University, Hong Kong SAR, China
| | - Kim Hung Leung
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, China
| | - Lawrence W C Chan
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, China
| | - Shea Ping Yip
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, China.
| | - Maurice K H Yap
- Centre for Myopia Research, School of Optometry, The Hong Kong Polytechnic University, Hong Kong SAR, China
| |
Collapse
|