Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tarca AL, Draghici S, Bhatti G, Romero R. Down-weighting overlapping genes improves gene set analysis. BMC Bioinformatics 2012;13:136. [PMID: 22713124 PMCID: PMC3443069 DOI: 10.1186/1471-2105-13-136] [Citation(s) in RCA: 96] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 05/18/2012] [Indexed: 11/10/2022] Open

For:	Tarca AL, Draghici S, Bhatti G, Romero R. Down-weighting overlapping genes improves gene set analysis. BMC Bioinformatics 2012;13:136. [PMID: 22713124 PMCID: PMC3443069 DOI: 10.1186/1471-2105-13-136] [Citation(s) in RCA: 96] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 05/18/2012] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Nguyen H, Pham VD, Nguyen H, Tran B, Petereit J, Nguyen T. CCPA: cloud-based, self-learning modules for consensus pathway analysis using GO, KEGG and Reactome. Brief Bioinform 2024;25:bbae222. [PMID: 39041916 PMCID: PMC11264295 DOI: 10.1093/bib/bbae222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 03/15/2024] [Accepted: 04/25/2024] [Indexed: 07/24/2024] Open

Abstract

This manuscript describes the development of a resource module that is part of a learning platform named 'NIGMS Sandbox for Cloud-based Learning' (https://github.com/NIGMS/NIGMS-Sandbox). The module delivers learning materials on Cloud-based Consensus Pathway Analysis in an interactive format that uses appropriate cloud resources for data access and analyses. Pathway analysis is important because it allows us to gain insights into biological mechanisms underlying conditions. But the availability of many pathway analysis methods, the requirement of coding skills, and the focus of current tools on only a few species all make it very difficult for biomedical researchers to self-learn and perform pathway analysis efficiently. Furthermore, there is a lack of tools that allow researchers to compare analysis results obtained from different experiments and different analysis methods to find consensus results. To address these challenges, we have designed a cloud-based, self-learning module that provides consensus results among established, state-of-the-art pathway analysis techniques to provide students and researchers with necessary training and example materials. The training module consists of five Jupyter Notebooks that provide complete tutorials for the following tasks: (i) process expression data, (ii) perform differential analysis, visualize and compare the results obtained from four differential analysis methods (limma, t-test, edgeR, DESeq2), (iii) process three pathway databases (GO, KEGG and Reactome), (iv) perform pathway analysis using eight methods (ORA, CAMERA, KS test, Wilcoxon test, FGSEA, GSA, SAFE and PADOG) and (v) combine results of multiple analyses. We also provide examples, source code, explanations and instructional videos for trainees to complete each Jupyter Notebook. The module supports the analysis for many model (e.g. human, mouse, fruit fly, zebra fish) and non-model species. The module is publicly available at https://github.com/NIGMS/Consensus-Pathway-Analysis-in-the-Cloud. This manuscript describes the development of a resource module that is part of a learning platform named ``NIGMS Sandbox for Cloud-based Learning'' https://github.com/NIGMS/NIGMS-Sandbox. The overall genesis of the Sandbox is described in the editorial NIGMS Sandbox [1] at the beginning of this Supplement. This module delivers learning materials on the analysis of bulk and single-cell ATAC-seq data in an interactive format that uses appropriate cloud resources for data access and analyses.

Collapse

Li X, Zan X, Liu T, Dong X, Zhang H, Li Q, Bao Z, Lin J. Integrated edge information and pathway topology for drug-disease associations. iScience 2024;27:110025. [PMID: 38974972 PMCID: PMC11226970 DOI: 10.1016/j.isci.2024.110025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/06/2024] [Accepted: 05/15/2024] [Indexed: 07/09/2024] Open

Chambers BA, Basili D, Word L, Baker N, Middleton A, Judson RS, Shah I. Searching for LINCS to Stress: Using Text Mining to Automate Reference Chemical Curation. Chem Res Toxicol 2024;37:878-893. [PMID: 38736322 DOI: 10.1021/acs.chemrestox.3c00335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2024]

Abstract

Adaptive stress response pathways (SRPs) restore cellular homeostasis following perturbation but may activate terminal outcomes like apoptosis, autophagy, or cellular senescence if disruption exceeds critical thresholds. Because SRPs hold the key to vital cellular tipping points, they are targeted for therapeutic interventions and assessed as biomarkers of toxicity. Hence, we are developing a public database of chemicals that perturb SRPs to enable new data-driven tools to improve public health. Here, we report on the automated text-mining pipeline we used to build and curate the first version of this database. We started with 100 reference SRP chemicals gathered from published biomarker studies to bootstrap the database. Second, we used information retrieval to find co-occurrences of reference chemicals with SRP terms in PubMed abstracts and determined pairwise mutual information thresholds to filter biologically relevant relationships. Third, we applied these thresholds to find 1206 putative SRP perturbagens within thousands of substances in the Library of Integrated Network-Based Cellular Signatures (LINCS). To assign SRP activity to LINCS chemicals, domain experts had to manually review at least three publications for each of 1206 chemicals out of 181,805 total abstracts. To accomplish this efficiently, we implemented a machine learning approach to predict SRP classifications from texts to prioritize abstracts. In 5-fold cross-validation testing with a corpus derived from the 100 reference chemicals, artificial neural networks performed the best (F1-macro = 0.678) and prioritized 2479/181,805 abstracts for expert review, which resulted in 457 chemicals annotated with SRP activities. An independent analysis of enriched mechanisms of action and chemical use class supported the text-mined chemical associations (p < 0.05): heat shock inducers were linked with HSP90 and DNA damage inducers to topoisomerase inhibition. This database will enable novel applications of LINCS data to evaluate SRP activities and to further develop tools for biomedical information extraction from the literature.

Collapse

Candia J, Ferrucci L. Assessment of Gene Set Enrichment Analysis using curated RNA-seq-based benchmarks. PLoS One 2024;19:e0302696. [PMID: 38753612 PMCID: PMC11098418 DOI: 10.1371/journal.pone.0302696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 04/09/2024] [Indexed: 05/18/2024] Open

Abstract

Pathway enrichment analysis is a ubiquitous computational biology method to interpret a list of genes (typically derived from the association of large-scale omics data with phenotypes of interest) in terms of higher-level, predefined gene sets that share biological function, chromosomal location, or other common features. Among many tools developed so far, Gene Set Enrichment Analysis (GSEA) stands out as one of the pioneering and most widely used methods. Although originally developed for microarray data, GSEA is nowadays extensively utilized for RNA-seq data analysis. Here, we quantitatively assessed the performance of a variety of GSEA modalities and provide guidance in the practical use of GSEA in RNA-seq experiments. We leveraged harmonized RNA-seq datasets available from The Cancer Genome Atlas (TCGA) in combination with large, curated pathway collections from the Molecular Signatures Database to obtain cancer-type-specific target pathway lists across multiple cancer types. We carried out a detailed analysis of GSEA performance using both gene-set and phenotype permutations combined with four different choices for the Kolmogorov-Smirnov enrichment statistic. Based on our benchmarks, we conclude that the classic/unweighted gene-set permutation approach offered comparable or better sensitivity-vs-specificity tradeoffs across cancer types compared with other, more complex and computationally intensive permutation methods. Finally, we analyzed other large cohorts for thyroid cancer and hepatocellular carcinoma. We utilized a new consensus metric, the Enrichment Evidence Score (EES), which showed a remarkable agreement between pathways identified in TCGA and those from other sources, despite differences in cancer etiology. This finding suggests an EES-based strategy to identify a core set of pathways that may be complemented by an expanded set of pathways for downstream exploratory analysis. This work fills the existing gap in current guidelines and benchmarks for the use of GSEA with RNA-seq data and provides a framework to enable detailed benchmarking of other RNA-seq-based pathway analysis tools.

Collapse

Geistlinger L, Mirzayi C, Zohra F, Azhar R, Elsafoury S, Grieve C, Wokaty J, Gamboa-Tuz SD, Sengupta P, Hecht I, Ravikrishnan A, Gonçalves RS, Franzosa E, Raman K, Carey V, Dowd JB, Jones HE, Davis S, Segata N, Huttenhower C, Waldron L. BugSigDB captures patterns of differential abundance across a broad range of host-associated microbial signatures. Nat Biotechnol 2024;42:790-802. [PMID: 37697152 PMCID: PMC11098749 DOI: 10.1038/s41587-023-01872-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 06/20/2023] [Indexed: 09/13/2023]

Affiliation(s)

Ludwig Geistlinger Center for Computational Biomedicine, Harvard Medical School, Boston, MA, USA
Chloe Mirzayi Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Fatima Zohra Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Rimsha Azhar Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Shaimaa Elsafoury Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Clare Grieve Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Jennifer Wokaty Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Samuel David Gamboa-Tuz Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Pratyay Sengupta Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai, India Robert Bosch Centre for Data Science and Artificial Intelligence, Indian Institute of Technology (IIT) Madras, Chennai, India Centre for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology (IIT) Madras, Chennai, India
Issac Hecht WikiWorks, Boca Raton, FL, USA
Aarthi Ravikrishnan Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Rafael S Gonçalves Center for Computational Biomedicine, Harvard Medical School, Boston, MA, USA
Eric Franzosa Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Karthik Raman Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai, India Robert Bosch Centre for Data Science and Artificial Intelligence, Indian Institute of Technology (IIT) Madras, Chennai, India Centre for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology (IIT) Madras, Chennai, India
Vincent Carey Channing Division of Network Medicine, Mass General Brigham, Harvard Medical School, Boston, MA, USA
Jennifer B Dowd Leverhulme Centre for Demographic Science, University of Oxford, Oxford, UK
Heidi E Jones Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Sean Davis Departments of Biomedical Informatics and Medicine, University of Colorado Anschutz School of Medicine, Denver, CO, USA
Nicola Segata Department CIBIO, University of Trento, Trento, Italy Istituto Europeo di Oncologia (IEO) IRCSS, Milan, Italy
Curtis Huttenhower Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Levi Waldron Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA. Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA. Department CIBIO, University of Trento, Trento, Italy.

Collapse

Taiwo M, Huang E, Pathak V, Bellar A, Welch N, Dasarathy J, Streem D, McClain CJ, Mitchell MC, Barton BA, Szabo G, Dasarathy S, Schaefer EA, Luther J, Day LZ, Ouyang X, Suyavaran A, Mehal WZ, Jacobs JM, Goodman RP, Rotroff DM, Nagy LE. Proteomics identifies complement protein signatures in patients with alcohol-associated hepatitis. JCI Insight 2024;9:e174127. [PMID: 38573776 PMCID: PMC11141929 DOI: 10.1172/jci.insight.174127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 03/27/2024] [Indexed: 04/06/2024] Open

Affiliation(s)

Moyinoluwa Taiwo Department of Inflammation and Immunity
Emily Huang Department of Inflammation and Immunity
Vai Pathak Department of Quantitative Health Sciences, and
Annette Bellar Department of Inflammation and Immunity
Nicole Welch Department of Inflammation and Immunity Department of Gastroenterology and Hepatology, Cleveland Clinic, Cleveland, Ohio, USA
Jaividhya Dasarathy Department of Family Medicine, Metro Health Medical Center, Cleveland, Ohio, USA
David Streem Department of Psychiatry and Psychology, Cleveland Clinic Lutheran Hospital, Cleveland, Ohio, USA
Craig J. McClain Department of Medicine, University of Louisville, Louisville, Kentucky, USA
Mack C. Mitchell Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, USA
Bruce A. Barton Department of Population and Quantitative Health Sciences, University of Massachusetts Medical School, Worcester, Massachusetts, USA
Gyongyi Szabo Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
Srinivasan Dasarathy Department of Inflammation and Immunity Department of Gastroenterology and Hepatology, Cleveland Clinic, Cleveland, Ohio, USA Department of Molecular Medicine, Case Western Reserve University, Cleveland, Ohio, USA
Alcohol Hepatitis Network (AlcHepNet) Consortium See Supplemental Acknowledgments for information on the AlcHepNet Consortium
Esperance A. Schaefer Alcohol Liver Center, Division of Gastroenterology, Massachusetts General Hospital, Boston, Massachusetts, USA
Jay Luther Alcohol Liver Center, Division of Gastroenterology, Massachusetts General Hospital, Boston, Massachusetts, USA
Le Z. Day Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, USA
Xinshou Ouyang Department of Internal Medicine, Yale School of Medicine, New Haven, Connecticut, USA
Arumugam Suyavaran Department of Internal Medicine, Yale School of Medicine, New Haven, Connecticut, USA
Wajahat Z. Mehal Department of Internal Medicine, Yale School of Medicine, New Haven, Connecticut, USA
Jon M. Jacobs Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, USA
Russell P. Goodman Alcohol Liver Center, Division of Gastroenterology, Massachusetts General Hospital, Boston, Massachusetts, USA Endocrine Unit, Division of Gastroenterology, Massachusetts General Hospital, Boston, Massachusetts, USA
Daniel M. Rotroff Department of Quantitative Health Sciences, and Endocrine and Metabolism Institute and Center for Quantitative Metabolic Research, Cleveland Clinic, Cleveland, Ohio, USA
Laura E. Nagy Department of Inflammation and Immunity Department of Gastroenterology and Hepatology, Cleveland Clinic, Cleveland, Ohio, USA See Supplemental Acknowledgments for information on the AlcHepNet Consortium

Collapse

Baker BH, Freije S, MacDonald JW, Bammler TK, Benson C, Carroll KN, Enquobahrie DA, Karr CJ, LeWinn KZ, Zhao Q, Bush NR, Sathyanarayana S, Paquette AG. Placental transcriptomic signatures of prenatal and preconceptional maternal stress. Mol Psychiatry 2024;29:1179-1191. [PMID: 38212375 PMCID: PMC11176062 DOI: 10.1038/s41380-023-02403-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 12/20/2023] [Accepted: 12/22/2023] [Indexed: 01/13/2024]

Abstract

Prenatal exposure to maternal psychological stress is associated with increased risk for adverse birth and child health outcomes. Accumulating evidence suggests that preconceptional maternal stress may also be transmitted intergenerationally to negatively impact offspring. However, understanding of mechanisms linking these exposures to offspring outcomes, particularly those related to placenta, is limited. Using RNA sequencing, we identified placental transcriptomic signatures associated with maternal prenatal stressful life events (SLEs) and childhood traumatic events (CTEs) in 1 029 mother-child pairs in two birth cohorts from Washington state and Memphis, Tennessee. We evaluated individual gene-SLE/CTE associations and performed an ensemble of gene set enrichment analyses combing across 11 popular enrichment methods. Higher number of prenatal SLEs was significantly (FDR < 0.05) associated with increased expression of ADGRG6, a placental tissue-specific gene critical in placental remodeling, and decreased expression of RAB11FIP3, an endocytosis and endocytic recycling gene, and SMYD5, a histone methyltransferase. Prenatal SLEs and maternal CTEs were associated with gene sets related to several biological pathways, including upregulation of protein processing in the endoplasmic reticulum, protein secretion, and ubiquitin mediated proteolysis, and down regulation of ribosome, epithelial mesenchymal transition, DNA repair, MYC targets, and amino acid-related pathways. The directional associations in these pathways corroborate prior non-transcriptomic mechanistic studies of psychological stress and mental health disorders, and have previously been implicated in pregnancy complications and adverse birth outcomes. Accordingly, our findings suggest that maternal exposure to psychosocial stressors during pregnancy as well as the mother's childhood may disrupt placental function, which may ultimately contribute to adverse pregnancy, birth, and child health outcomes.

Collapse

Peng C, Chen Q, Tan S, Shen X, Jiang C. Generalized reporter score-based enrichment analysis for omics data. Brief Bioinform 2024;25:bbae116. [PMID: 38546324 PMCID: PMC10976918 DOI: 10.1093/bib/bbae116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 01/25/2024] [Accepted: 03/01/2024] [Indexed: 06/15/2024] Open

Chang LY, Lee MZ, Wu Y, Lee WK, Ma CL, Chang JM, Chen CW, Huang TC, Lee CH, Lee JC, Tseng YY, Lin CY. Gene set correlation enrichment analysis for interpreting and annotating gene expression profiles. Nucleic Acids Res 2024;52:e17. [PMID: 38096046 PMCID: PMC10853793 DOI: 10.1093/nar/gkad1187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 11/17/2023] [Accepted: 11/29/2023] [Indexed: 02/10/2024] Open

Affiliation(s)

Lan-Yun Chang Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
Meng-Zhan Lee Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
Yujia Wu Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
Wen-Kai Lee Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
Chia-Liang Ma Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
Jun-Mao Chang Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
Ciao-Wen Chen Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
Tzu-Chun Huang Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
Chia-Hwa Lee School of Medical Laboratory Science and Biotechnology, College of Medical Science and Technology, Taipei Medical University, New Taipei City 235, Taiwan Center for Intelligent Drug Systems and Smart Bio-devices (IDSB), National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan TMU Research Center of Cancer Translational Medicine, Taipei Medical University, Taipei 110, Taiwan Ph.D. Program in Medical Biotechnology, College of Medical Science and Technology, Taipei Medical University, New Taipei City 235, Taiwan
Jih-Chin Lee Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei 110, Taiwan
Yu-Yao Tseng Department of Food Science, Nutrition, and Nutraceutical Biotechnology, Shih Chien University, Taipei 104, Taiwan
Chun-Yu Lin Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan Center for Intelligent Drug Systems and Smart Bio-devices (IDSB), National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan Cancer and Immunology Research Center, National Yang Ming Chiao Tung University, Taipei 112, Taiwan Institute of Data Science and Engineering, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan School of Dentistry, Kaohsiung Medical University, Kaohsiung 807, Taiwan

Collapse

Liu Y, Lian G, Chen T. A novel multi-omics data analysis of dose-dependent and temporal changes in regulatory pathways due to chemical perturbation: a case study on caffeine. Toxicol Mech Methods 2024;34:164-175. [PMID: 37794615 DOI: 10.1080/15376516.2023.2265462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 09/26/2023] [Indexed: 10/06/2023]

Buzzao D, Castresana-Aguirre M, Guala D, Sonnhammer ELL. Benchmarking enrichment analysis methods with the disease pathway network. Brief Bioinform 2024;25:bbae069. [PMID: 38436561 PMCID: PMC10939300 DOI: 10.1093/bib/bbae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 01/10/2024] [Accepted: 02/03/2024] [Indexed: 03/05/2024] Open

Somers J, Fenner M, Kong G, Thirumalaisamy D, Yashar WM, Thapa K, Kinali M, Nikolova O, Babur Ö, Demir E. A framework for considering prior information in network-based approaches to omics data analysis. Proteomics 2023;23:e2200402. [PMID: 37986684 DOI: 10.1002/pmic.202200402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 09/20/2023] [Accepted: 09/21/2023] [Indexed: 11/22/2023]

Holubekova V, Loderer D, Grendar M, Mikolajcik P, Kolkova Z, Turyova E, Kudelova E, Kalman M, Marcinek J, Miklusica J, Laca L, Lasabova Z. Differential gene expression of immunity and inflammation genes in colorectal cancer using targeted RNA sequencing. Front Oncol 2023;13:1206482. [PMID: 37869102 PMCID: PMC10586664 DOI: 10.3389/fonc.2023.1206482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 08/24/2023] [Indexed: 10/24/2023] Open

Abstract

Introduction

Colorectal cancer (CRC) is a heterogeneous disease caused by molecular changes, as driver mutations, gene methylations, etc., and influenced by tumor microenvironment (TME) pervaded with immune cells with both pro- and anti-tumor effects. The studying of interactions between the immune system (IS) and the TME is important for developing effective immunotherapeutic strategies for CRC. In our study, we focused on the analysis of expression profiles of inflammatory and immune-relevant genes to identify aberrant signaling pathways included in carcinogenesis, metastatic potential of tumors, and association of Kirsten rat sarcoma virus (KRAS) gene mutation.

Methods

A total of 91 patients were enrolled in the study. Using NGS, differential gene expression analysis of 11 tumor samples and 11 matching non-tumor controls was carried out by applying a targeted RNA panel for inflammation and immunity genes containing 475 target genes. The obtained data were evaluated by the CLC Genomics Workbench and R library. The significantly differentially expressed genes (DEGs) were analyzed in Reactome GSA software, and some selected DEGs were used for real-time PCR validation.

Results

After prioritization, the most significant differences in gene expression were shown by the genes TNFRSF4, IRF7, IL6R, NR3CI, EIF2AK2, MIF, CCL5, TNFSF10, CCL20, CXCL11, RIPK2, and BLNK. Validation analyses on 91 samples showed a correlation between RNA-seq data and qPCR for TNFSF10, RIPK2, and BLNK gene expression. The top differently regulated signaling pathways between the studied groups (cancer vs. control, metastatic vs. primary CRC and KRAS positive and negative CRC) belong to immune system, signal transduction, disease, gene expression, DNA repair, and programmed cell death.

Conclusion

Analyzed data suggest the changes at more levels of CRC carcinogenesis, including surface receptors of epithelial or immune cells, its signal transduction pathways, programmed cell death modifications, alterations in DNA repair machinery, and cell cycle control leading to uncontrolled proliferation. This study indicates only basic molecular pathways that enabled the formation of metastatic cancer stem cells and may contribute to clarifying the function of the IS in the TME of CRC. A precise identification of signaling pathways responsible for CRC may help in the selection of personalized pharmacological treatment.

Collapse

Affiliation(s)

Veronika Holubekova Laboratory of Genomics and Prenatal Diagnostics, Biomedical Center in Martin, Jessenius Faculty of Medicine, Comenius University in Bratislava, Martin, Slovakia
Dusan Loderer Laboratory of Genomics and Prenatal Diagnostics, Biomedical Center in Martin, Jessenius Faculty of Medicine, Comenius University in Bratislava, Martin, Slovakia
Marian Grendar Laboratory of Bioinformatics and Biostatistics, Biomedical Center in Martin, Jessenius Faculty of Medicine, Comenius University in Bratislava, Martin, Slovakia
Peter Mikolajcik Clinic of Surgery and Transplant Center, Jessenius Faculty of Medicine in Martin, Comenius University in Bratislava, Martin University Hospital, Martin, Slovakia
Zuzana Kolkova Laboratory of Genomics and Prenatal Diagnostics, Biomedical Center in Martin, Jessenius Faculty of Medicine, Comenius University in Bratislava, Martin, Slovakia
Eva Turyova Department of Molecular Biology and Genomics, Jessenius Faculty of Medicine in Martin, Comenius University in Bratislava, Martin, Slovakia
Eva Kudelova Clinic of Surgery and Transplant Center, Jessenius Faculty of Medicine in Martin, Comenius University in Bratislava, Martin University Hospital, Martin, Slovakia
Michal Kalman Department of Pathological Anatomy, Jessenius Faculty of Medicine, Comenius University in Bratislava, Martin University Hospital, Martin, Slovakia
Juraj Marcinek Department of Pathological Anatomy, Jessenius Faculty of Medicine, Comenius University in Bratislava, Martin University Hospital, Martin, Slovakia
Juraj Miklusica Clinic of Surgery and Transplant Center, Jessenius Faculty of Medicine in Martin, Comenius University in Bratislava, Martin University Hospital, Martin, Slovakia
Ludovit Laca Clinic of Surgery and Transplant Center, Jessenius Faculty of Medicine in Martin, Comenius University in Bratislava, Martin University Hospital, Martin, Slovakia
Zora Lasabova Department of Molecular Biology and Genomics, Jessenius Faculty of Medicine in Martin, Comenius University in Bratislava, Martin, Slovakia

Collapse

Takeuchi F, Liang YQ, Shimizu-Furusawa H, Isono M, Ang MY, Mori K, Mori T, Kakazu E, Yoshio S, Kato N. Gene-regulation modules in nonalcoholic fatty liver disease revealed by single-nucleus ATAC-seq. Life Sci Alliance 2023;6:e202301988. [PMID: 37491046 PMCID: PMC10368228 DOI: 10.26508/lsa.202301988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 07/14/2023] [Accepted: 07/14/2023] [Indexed: 07/27/2023] Open

Affiliation(s)

Fumihiko Takeuchi Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan Medical Genomics Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan Systems Genomics Laboratory, Baker Heart and Diabetes Institute, Melbourne, Australia
Yi-Qiang Liang Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
Hana Shimizu-Furusawa Department of Hygiene and Public Health, School of Medicine, Teikyo University, Tokyo, Japan
Masato Isono Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
Mia Yang Ang Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan Department of Clinical Genome Informatics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Kotaro Mori Medical Genomics Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
Taizo Mori Department of Liver Diseases, The Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Chiba, Japan
Eiji Kakazu Department of Liver Diseases, The Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Chiba, Japan
Sachiyo Yoshio Department of Liver Diseases, The Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Chiba, Japan
Norihiro Kato Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan Medical Genomics Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan Department of Clinical Genome Informatics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan

Collapse

Hakobyan S, Stepanyan A, Nersisyan L, Binder H, Arakelyan A. PSF toolkit: an R package for pathway curation and topology-aware analysis. Front Genet 2023;14:1264656. [PMID: 37680201 PMCID: PMC10482229 DOI: 10.3389/fgene.2023.1264656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 08/09/2023] [Indexed: 09/09/2023] Open

Abstract

Most high throughput genomic data analysis pipelines currently rely on over-representation or gene set enrichment analysis (ORA/GSEA) approaches for functional analysis. In contrast, topology-based pathway analysis methods, which offer a more biologically informed perspective by incorporating interaction and topology information, have remained underutilized and inaccessible due to various limiting factors. These methods heavily rely on the quality of pathway topologies and often utilize predefined topologies from databases without assessing their correctness. To address these issues and make topology-aware pathway analysis more accessible and flexible, we introduce the PSF (Pathway Signal Flow) toolkit R package. Our toolkit integrates pathway curation and topology-based analysis, providing interactive and command-line tools that facilitate pathway importation, correction, and modification from diverse sources. This enables users to perform topology-based pathway signal flow analysis in both interactive and command-line modes. To showcase the toolkit's usability, we curated 36 KEGG signaling pathways and conducted several use-case studies, comparing our method with ORA and the topology-based signaling pathway impact analysis (SPIA) method. The results demonstrate that the algorithm can effectively identify ORA enriched pathways while providing more detailed branch-level information. Moreover, in contrast to the SPIA method, it offers the advantage of being cut-off free and less susceptible to the variability caused by selection thresholds. By combining pathway curation and topology-based analysis, the PSF toolkit enhances the quality, flexibility, and accessibility of topology-aware pathway analysis. Researchers can now easily import pathways from various sources, correct and modify them as needed, and perform detailed topology-based pathway signal flow analysis. In summary, our PSF toolkit offers an integrated solution that addresses the limitations of current topology-based pathway analysis methods. By providing interactive and command-line tools for pathway curation and topology-based analysis, we empower researchers to conduct comprehensive pathway analyses across a wide range of applications.

Collapse

Xu S, Leng Y, Feng G, Zhang C, Chen M. A gene pathway enrichment method based on improved TF-IDF algorithm. Biochem Biophys Rep 2023;34:101421. [PMID: 36923007 PMCID: PMC10009669 DOI: 10.1016/j.bbrep.2023.101421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 12/20/2022] [Accepted: 01/03/2023] [Indexed: 03/08/2023] Open

Hicks EM, Seah C, Cote A, Marchese S, Brennand KJ, Nestler EJ, Girgenti MJ, Huckins LM. Integrating genetics and transcriptomics to study major depressive disorder: a conceptual framework, bioinformatic approaches, and recent findings. Transl Psychiatry 2023;13:129. [PMID: 37076454 PMCID: PMC10115809 DOI: 10.1038/s41398-023-02412-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 03/17/2023] [Accepted: 03/24/2023] [Indexed: 04/21/2023] Open

Affiliation(s)

Emily M Hicks Pamela Sklar Division of Psychiatric Genomics, Departments of Psychiatry and of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA
Carina Seah Pamela Sklar Division of Psychiatric Genomics, Departments of Psychiatry and of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA
Alanna Cote Pamela Sklar Division of Psychiatric Genomics, Departments of Psychiatry and of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA
Shelby Marchese Pamela Sklar Division of Psychiatric Genomics, Departments of Psychiatry and of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA
Kristen J Brennand Pamela Sklar Division of Psychiatric Genomics, Departments of Psychiatry and of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA Department of Genetics, Yale University School of Medicine, New Haven, CT, 06511, USA Department of Psychiatry, Yale University School of Medicine, New Haven, CT, 06511, USA
Eric J Nestler Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA
Matthew J Girgenti Department of Psychiatry, Yale University School of Medicine, New Haven, CT, 06511, USA.
Laura M Huckins Pamela Sklar Division of Psychiatric Genomics, Departments of Psychiatry and of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, 10029, USA. Department of Psychiatry, Yale University School of Medicine, New Haven, CT, 06511, USA.

Collapse

Angel-Velez D, Meese T, Hedia M, Fernandez-Montoro A, De Coster T, Pascottini OB, Van Nieuwerburgh F, Govaere J, Van Soom A, Pavani K, Smits K. Transcriptomics Reveal Molecular Differences in Equine Oocytes Vitrified before and after In Vitro Maturation. Int J Mol Sci 2023;24:ijms24086915. [PMID: 37108081 PMCID: PMC10138936 DOI: 10.3390/ijms24086915] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 03/27/2023] [Accepted: 04/04/2023] [Indexed: 04/29/2023] Open

Affiliation(s)

Daniel Angel-Velez Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium Research Group in Animal Sciences-INCA-CES, Universidad CES, Medellin 050021, Colombia
Tim Meese Laboratory for Pharmaceutical Biotechnology, Faculty of Pharmaceutical Science, Ghent University, 9000 Ghent, Belgium
Mohamed Hedia Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium Department of Theriogenology, Faculty of Veterinary Medicine, Cairo University, Giza 12211, Egypt
Andrea Fernandez-Montoro Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Tine De Coster Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Osvaldo Bogado Pascottini Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Filip Van Nieuwerburgh Laboratory for Pharmaceutical Biotechnology, Faculty of Pharmaceutical Science, Ghent University, 9000 Ghent, Belgium
Jan Govaere Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Ann Van Soom Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Krishna Pavani Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium Department for Reproductive Medicine, Ghent University Hospital, Corneel Heymanslaan 10, 9000 Gent, Belgium
Katrien Smits Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium

Collapse

Sosa F, Uh K, Drum JN, Stoecklein KS, Davenport KM, Sofia Ortega M, Lee K, Hansen PJ. Disruption of CSF2RA in the bovine preimplantation embryo reduces development and affects embryonic gene expression in utero. REPRODUCTION AND FERTILITY 2023;4:RAF-23-0001. [PMID: 37000631 PMCID: PMC10160533 DOI: 10.1530/raf-23-0001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 03/31/2023] [Indexed: 04/01/2023] Open

Lu Y, Pang Z, Xia J. Comprehensive investigation of pathway enrichment methods for functional interpretation of LC-MS global metabolomics data. Brief Bioinform 2023;24:bbac553. [PMID: 36572652 PMCID: PMC9851290 DOI: 10.1093/bib/bbac553] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 10/31/2022] [Accepted: 11/15/2022] [Indexed: 12/28/2022] Open

Ai H, Meng F, Ai Y. PathwayKO: An integrated platform for deciphering the systems-level signaling pathways. Front Immunol 2023;14:1103392. [PMID: 37033947 PMCID: PMC10080220 DOI: 10.3389/fimmu.2023.1103392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Accepted: 03/01/2023] [Indexed: 04/11/2023] Open

Abstract

Systems characterization of immune landscapes in health, disease and clinical intervention cases is a priority in modern medicine. High-throughput transcriptomes accumulated from gene-knockout (KO) experiments are crucial for deciphering target KO signaling pathways that are impaired by KO genes at the systems-level. There is a demand for integrative platforms. This article describes the PathwayKO platform, which has integrated state-of-the-art methods of pathway enrichment analysis, statistics analysis, and visualizing analysis to conduct cutting-edge integrative pathway analysis in a pipeline fashion and decipher target KO signaling pathways at the systems-level. We focus on describing the methodology, principles and application features of PathwayKO. First, we demonstrate that the PathwayKO platform can be utilized to comprehensively analyze real-world mouse KO transcriptomes (GSE22873 and GSE24327), which reveal systemic mechanisms underlying the innate immune responses triggered by non-infectious extensive hepatectomy (2 hours after 85% liver resection surgery) and infectious CASP-model sepsis (12 hours after CASP-model surgery). Strikingly, our results indicate that both cases hit the same core set of 21 KO MyD88-associated signaling pathways, including the Toll-like receptor signaling pathway, the NFκB signaling pathway, the MAPK signaling pathway, and the PD-L1 expression and PD-1 checkpoint pathway in cancer, alongside the pathways of bacterial, viral and parasitic infections. These findings suggest common fundamental mechanisms between these immune responses and offer informative cues that warrant future experimental validation. Such mechanisms in mice may serve as models for humans and ultimately guide formulating the research paradigms and composite strategies to reduce the high mortality rates of patients in intensive care units who have undergone successful traumatic surgical treatments. Second, we demonstrate that the PathwayKO platform model-based assessments can effectively evaluate the performance difference of pathway analysis methods when benchmarked with a collection of proper transcriptomes. Together, such advances in methods for deciphering biological insights at the systems-level may benefit the fields of bioinformatics, systems immunology and beyond.

Collapse

Cousins H, Hall T, Guo Y, Tso L, Tzeng KTH, Cong L, Altman RB. Gene set proximity analysis: expanding gene set enrichment analysis through learned geometric embeddings, with drug-repurposing applications in COVID-19. Bioinformatics 2023;39:btac735. [PMID: 36394254 PMCID: PMC9805577 DOI: 10.1093/bioinformatics/btac735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 09/27/2022] [Accepted: 11/16/2022] [Indexed: 11/18/2022] Open

Maghsoudi Z, Nguyen H, Tavakkoli A, Nguyen T. A comprehensive survey of the approaches for pathway analysis using multi-omics data integration. Brief Bioinform 2022;23:6761962. [PMID: 36252928 PMCID: PMC9677478 DOI: 10.1093/bib/bbac435] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 08/26/2022] [Accepted: 09/08/2022] [Indexed: 02/07/2023] Open

Liu H, Yuan M, Mitra R, Zhou X, Long M, Lei W, Zhou S, Huang YE, Hou F, Eischen CM, Jiang W. CTpathway: a CrossTalk-based pathway enrichment analysis method for cancer research. Genome Med 2022;14:118. [PMID: 36229842 PMCID: PMC9563764 DOI: 10.1186/s13073-022-01119-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 09/26/2022] [Indexed: 11/22/2022] Open

Abstract

BACKGROUND

Pathway enrichment analysis (PEA) is a common method for exploring functions of hundreds of genes and identifying disease-risk pathways. Moreover, different pathways exert their functions through crosstalk. However, existing PEA methods do not sufficiently integrate essential pathway features, including pathway crosstalk, molecular interactions, and network topologies, resulting in many risk pathways that remain uninvestigated.

METHODS

To overcome these limitations, we develop a new crosstalk-based PEA method, CTpathway, based on a global pathway crosstalk map (GPCM) with >440,000 edges by combing pathways from eight resources, transcription factor-gene regulations, and large-scale protein-protein interactions. Integrating gene differential expression and crosstalk effects in GPCM, we assign a risk score to genes in the GPCM and identify risk pathways enriched with the risk genes.

RESULTS

Analysis of >8300 expression profiles covering ten cancer tissues and blood samples indicates that CTpathway outperforms the current state-of-the-art methods in identifying risk pathways with higher accuracy, reproducibility, and speed. CTpathway recapitulates known risk pathways and exclusively identifies several previously unreported critical pathways for individual cancer types. CTpathway also outperforms other methods in identifying risk pathways across all cancer stages, including early-stage cancer with a small number of differentially expressed genes. Moreover, the robust design of CTpathway enables researchers to analyze both bulk and single-cell RNA-seq profiles to predict both cancer tissue and cell type-specific risk pathways with higher accuracy.

CONCLUSIONS

Collectively, CTpathway is a fast, accurate, and stable pathway enrichment analysis method for cancer research that can be used to identify cancer risk pathways. The CTpathway interactive web server can be accessed here http://www.jianglab.cn/CTpathway/ . The stand-alone program can be accessed here https://github.com/Bioccjw/CTpathway .

Collapse

Affiliation(s)

Haizhou Liu Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
Mengqin Yuan Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
Ramkrishna Mitra Department of Pharmacology, Physiology, and Cancer Biology, Sidney Kimmel Cancer Center, Thomas Jefferson University, 233 South 10th St., Philadelphia, PA, 19107, USA
Xu Zhou Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
Min Long Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
Wanyue Lei Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
Shunheng Zhou Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
Yu-E Huang Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
Fei Hou Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
Christine M Eischen Department of Pharmacology, Physiology, and Cancer Biology, Sidney Kimmel Cancer Center, Thomas Jefferson University, 233 South 10th St., Philadelphia, PA, 19107, USA.
Wei Jiang Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China.

Collapse

Makrooni MA, O’Shea D, Geeleher P, Seoighe C. Random-effects meta-analysis of effect sizes as a unified framework for gene set analysis. PLoS Comput Biol 2022;18:e1010278. [PMID: 36197939 PMCID: PMC9576052 DOI: 10.1371/journal.pcbi.1010278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 10/17/2022] [Accepted: 09/18/2022] [Indexed: 11/06/2022] Open

Abstract

Gene set analysis (GSA) remains a common step in genome-scale studies because it can reveal insights that are not apparent from results obtained for individual genes. Many different computational tools are applied for GSA, which may be sensitive to different types of signals; however, most methods implicitly test whether there are differences in the distribution of the effect of some experimental condition between genes in gene sets of interest. We have developed a unifying framework for GSA that first fits effect size distributions, and then tests for differences in these distributions between gene sets. These differences can be in the proportions of genes that are perturbed or in the sign or size of the effects. Inspired by statistical meta-analysis, we take into account the uncertainty in effect size estimates by reducing the influence of genes with greater uncertainty on the estimation of distribution parameters. We demonstrate, using simulation and by application to real data, that this approach provides significant gains in performance over existing methods. Furthermore, the statistical tests carried out are defined in terms of effect sizes, rather than the results of prior statistical tests measuring these changes, which leads to improved interpretability and greater robustness to variation in sample sizes.

The role of gene set analysis is to identify groups of genes that are perturbed in a genomics experiment. There are many tools available for this task and they do not all test for the same types of changes. Here we propose a new way to carry out gene set analysis that involves first working out the distribution of the group effect in the gene set and then comparing this distribution to the equivalent distribution in other genes. Tests performed by existing tools for gene set analysis can be related to different comparisons in these distributions of group effects. A unified framework for gene set analysis provides for more explicit null hypotheses against which to test sets of genes for different types of responses to the experimental conditions. These results are more interpretable, because the group effect distributions can be compared visually, providing an indication of how the experimental effect differs between the gene sets.

Collapse

Grassi M, Tarantino B. SEMgsa: topology-based pathway enrichment analysis with structural equation models. BMC Bioinformatics 2022;23:344. [PMID: 35978279 PMCID: PMC9385099 DOI: 10.1186/s12859-022-04884-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 08/09/2022] [Indexed: 11/25/2022] Open

Abstract

Background

Pathway enrichment analysis is extensively used in high-throughput experimental studies to gain insight into the functional roles of pre-defined subsets of genes, proteins and metabolites. Methods that leverages information on the topology of the underlying pathways outperform simpler methods that only consider pathway membership, leading to improved performance. Among all the proposed software tools, there’s the need to combine high statistical power together with a user-friendly framework, making it difficult to choose the best method for a particular experimental environment.

Results

We propose SEMgsa, a topology-based algorithm developed into the framework of structural equation models. SEMgsa combine the SEM p values regarding node-specific group effect estimates in terms of activation or inhibition, after statistically controlling biological relations among genes within pathways. We used SEMgsa to identify biologically relevant results in a Coronavirus disease (COVID-19) RNA-seq dataset (GEO accession: GSE172114) together with a frontotemporal dementia (FTD) DNA methylation dataset (GEO accession: GSE53740) and compared its performance with some existing methods. SEMgsa is highly sensitive to the pathways designed for the specific disease, showing low p values (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 0.001$$\end{document}<0.001) and ranking in high positions, outperforming existing software tools. Three pathway dysregulation mechanisms were used to generate simulated expression data and evaluate the performance of methods in terms of type I error followed by their statistical power. Simulation results confirm best overall performance of SEMgsa.

Conclusions

SEMgsa is a novel yet powerful method for identifying enrichment with regard to gene expression data. It takes into account topological information and exploits pathway perturbation statistics to reveal biological information. SEMgsa is implemented in the R package SEMgraph, easily available at https://CRAN.R-project.org/package=SEMgraph.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-022-04884-8.

Collapse

Ai H, Li B, Meng F, Ai Y. CASP-Model Sepsis Triggers Systemic Innate Immune Responses Revealed by the Systems-Level Signaling Pathways. Front Immunol 2022;13:907646. [PMID: 35774781 PMCID: PMC9238352 DOI: 10.3389/fimmu.2022.907646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 04/28/2022] [Indexed: 12/05/2022] Open

Abstract

Colon ascendens stent peritonitis (CASP) surgery induces a leakage of intestinal contents which may cause polymicrobial sepsis related to post-operative failure of remote multi-organs (including kidney, liver, lung and heart) and possible death from systemic syndromes. Mechanisms underlying such phenomena remain unclear. This article aims to elucidate the mechanisms underlying the CASP-model sepsis by analyzing real-world GEO data (GSE24327_A, B and C) generated from mice spleen 12 hours after a CASP-surgery in septic MyD88-deficient and wildtype mice, compared with untreated wildtype mice. Firstly, we identify and characterize 21 KO MyD88-associated signaling pathways, on which true key regulators (including ligands, receptors, adaptors, transducers, transcriptional factors and cytokines) are marked, which were coordinately, significantly, and differentially expressed at the systems-level, thus providing massive potential biomarkers that warrant experimental validations in the future. Secondly, we observe the full range of polymicrobial (viral, bacterial, and parasitic) sepsis triggered by the CASP-surgery by comparing the coordinated up- or down-regulations of true regulators among the experimental treatments born by the three data under study. Finally, we discuss the observed phenomena of “systemic syndrome”, “cytokine storm” and “KO MyD88 attenuation”, as well as the proposed hypothesis of “spleen-mediated immune-cell infiltration”. Together, our results provide novel insights into a better understanding of innate immune responses triggered by the CASP-model sepsis in both wildtype and MyD88-deficient mice at the systems-level in a broader vision. This may serve as a model for humans and ultimately guide formulating the research paradigms and composite strategies for the early diagnosis and prevention of sepsis.

Collapse

Mubeen S, Tom Kodamullil A, Hofmann-Apitius M, Domingo-Fernández D. On the influence of several factors on pathway enrichment analysis. Brief Bioinform 2022;23:bbac143. [PMID: 35453140 PMCID: PMC9116215 DOI: 10.1093/bib/bbac143] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 03/21/2022] [Accepted: 03/30/2022] [Indexed: 02/01/2023] Open

Ke X, Wu H, Chen YX, Guo Y, Yao S, Guo MR, Duan YY, Wang NN, Shi W, Wang C, Dong SS, Kang H, Dai Z, Yang TL. Individualized pathway activity algorithm identifies oncogenic pathways in pan-cancer analysis. EBioMedicine 2022;79:104014. [PMID: 35487057 PMCID: PMC9117264 DOI: 10.1016/j.ebiom.2022.104014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 04/04/2022] [Accepted: 04/05/2022] [Indexed: 02/07/2023] Open

Abstract

Background

Accumulative evidences have shown that dysregulation of biological pathways contributed to the initiation and progression of malignant tumours. Several methods for pathway activity measurement have been proposed, but they are restricted to making comparisons between groups or sensitive to experimental batch effects.

Methods

We introduced a novel method for individualized pathway activity measurement (IPAM) that is based on the ranking of gene expression levels in individual sample. Taking advantage of IPAM, we calculated the pathway activity of 318 pathways from KEGG database in the 10528 tumour/normal samples of 33 cancer types from TCGA to identify characteristic dysregulated pathways among different cancer types.

Findings

IPAM precisely quantified the level of activity of each pathway in pan-cancer analysis and exhibited better performance in cancer classification and prognosis prediction over five widely used tools. The average ROC-AUC of cancer diagnostic model using tumour-educated platelets (TEPs) reached 92.84%, suggesting the potential of our algorithm in early diagnosis of cancer. We identified several pathways significantly deregulated and associated with patient survival in a large fraction of cancer types, such as tyrosine metabolism, fatty acid degradation, cell cycle, p53 signalling pathway and DNA replication. We also confirmed the dominant role of metabolic pathways in cancer pathway dysregulation and identified the driving factors of specific pathway dysregulation, such as PPARA for branched-chain amino acid metabolism and NR1I2, NR1I3 for fatty acid metabolism.

Interpretation

Our study will provide novel clues for understanding the pathological mechanisms of cancer, ultimately paving the way for personalized medicine of cancer.

Funding

A full list of funding can be found in the Acknowledgements section.

Collapse

Affiliation(s)

Xin Ke Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Hao Wu Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Yi-Xiao Chen National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710004, PR China
Yan Guo Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Shi Yao National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710004, PR China
Ming-Rui Guo Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Yuan-Yuan Duan Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Nai-Ning Wang Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Wei Shi Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Chen Wang Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Shan-Shan Dong Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China
Huafeng Kang Department of Oncology, The Second Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710004, PR China
Zhijun Dai Department of Breast Surgery, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, PR China
Tie-Lin Yang Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics and Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, PR China; National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710004, PR China.

Collapse

Network- and enrichment-based inference of phenotypes and targets from large-scale disease maps. NPJ Syst Biol Appl 2022;8:13. [PMID: 35473910 PMCID: PMC9042890 DOI: 10.1038/s41540-022-00222-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 03/22/2022] [Indexed: 01/09/2023] Open

Yue Z, Slominski R, Bharti S, Chen JY. PAGER Web APP: An Interactive, Online Gene Set and Network Interpretation Tool for Functional Genomics. Front Genet 2022;13:820361. [PMID: 35495152 PMCID: PMC9039620 DOI: 10.3389/fgene.2022.820361] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 03/17/2022] [Indexed: 12/30/2022] Open

Ogris C, Castresana-Aguirre M, Sonnhammer ELL. PathwAX II: Network-based pathway analysis with interactive visualization of network crosstalk. Bioinformatics 2022;38:2659-2660. [PMID: 35266519 PMCID: PMC9048662 DOI: 10.1093/bioinformatics/btac153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 02/03/2022] [Accepted: 03/09/2022] [Indexed: 11/28/2022] Open

Lycopene Supplementation to Serum-Free Maturation Medium Improves In Vitro Bovine Embryo Development and Quality and Modulates Embryonic Transcriptomic Profile. Antioxidants (Basel) 2022;11:antiox11020344. [PMID: 35204226 PMCID: PMC8868338 DOI: 10.3390/antiox11020344] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 02/02/2022] [Accepted: 02/08/2022] [Indexed: 02/08/2023] Open

Pavel A, Serra A, Cattelani L, Federico A, Greco D. Network Analysis of Microarray Data. Methods Mol Biol 2022;2401:161-186. [PMID: 34902128 DOI: 10.1007/978-1-0716-1839-4_11] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Marczyk M, Macioszek A, Tobiasz J, Polanska J, Zyla J. Importance of SNP Dependency Correction and Association Integration for Gene Set Analysis in Genome-Wide Association Studies. Front Genet 2021;12:767358. [PMID: 34956320 PMCID: PMC8696167 DOI: 10.3389/fgene.2021.767358] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 11/10/2021] [Indexed: 11/13/2022] Open

Abstract

A typical genome-wide association study (GWAS) analyzes millions of single-nucleotide polymorphisms (SNPs), several of which are in a region of the same gene. To conduct gene set analysis (GSA), information from SNPs needs to be unified at the gene level. A widely used practice is to use only the most relevant SNP per gene; however, there are other methods of integration that could be applied here. Also, the problem of nonrandom association of alleles at two or more loci is often neglected. Here, we tested the impact of incorporation of different integrations and linkage disequilibrium (LD) correction on the performance of several GSA methods. Matched normal and breast cancer samples from The Cancer Genome Atlas database were used to evaluate the performance of six GSA algorithms: Coincident Extreme Ranks in Numerical Observations (CERNO), Gene Set Enrichment Analysis (GSEA), GSEA-SNP, improved GSEA for GWAS (i-GSEA4GWAS), Meta-Analysis Gene-set Enrichment of variaNT Associations (MAGENTA), and Over-Representation Analysis (ORA). Association of SNPs to phenotype was calculated using modified McNemar's test. Results for SNPs mapped to the same gene were integrated using Fisher and Stouffer methods and compared with the minimum p-value method. Four common measures were used to quantify the performance of all combinations of methods. Results of GSA analysis on GWAS were compared to the one performed on gene expression data. Comparing all evaluation metrics across different GSA algorithms, integrations, and LD correction, we highlighted CERNO, and MAGENTA with Stouffer as the most efficient. Applying LD correction increased prioritization and specificity of enrichment outcomes for all tested algorithms. When Fisher or Stouffer were used with LD, sensitivity and reproducibility were also better. Using any integration method was beneficial in comparison with a minimum p-value method in specific combinations. The correlation between GSA results from genomic and transcriptomic level was the highest when Stouffer integration was combined with LD correction. We thoroughly evaluated different approaches to GSA in GWAS in terms of performance to guide others to select the most effective combinations. We showed that LD correction and Stouffer integration could increase the performance of enrichment analysis and encourage the usage of these techniques.

Collapse

Wang G, Kitaoka T, Crawford A, Mao Q, Hesketh A, Guppy FM, Ash GI, Liu J, Gerstein MB, Pitsiladis YP. Cross-platform transcriptomic profiling of the response to recombinant human erythropoietin. Sci Rep 2021;11:21705. [PMID: 34737331 PMCID: PMC8568984 DOI: 10.1038/s41598-021-00608-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Accepted: 10/11/2021] [Indexed: 11/08/2022] Open

Aranciaga N, Morton JD, Maes E, Gathercole JL, Berg DK. Proteomic determinants of uterine receptivity for pregnancy in early and mid-postpartum dairy cows†. Biol Reprod 2021;105:1458-1473. [PMID: 34647570 DOI: 10.1093/biolre/ioab190] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Revised: 08/03/2021] [Accepted: 10/13/2021] [Indexed: 11/14/2022] Open

Li X, Zhang B, Yu K, Bao Z, Zhang W, Bai Y. Identifying cancer specific signaling pathways based on the dysregulation between genes. Comput Biol Chem 2021;95:107586. [PMID: 34619555 DOI: 10.1016/j.compbiolchem.2021.107586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 08/10/2021] [Accepted: 09/26/2021] [Indexed: 11/26/2022]

Ramos M, Geistlinger L, Oh S, Schiffer L, Azhar R, Kodali H, de Bruijn I, Gao J, Carey VJ, Morgan M, Waldron L. Multiomic Integration of Public Oncology Databases in Bioconductor. JCO Clin Cancer Inform 2021;4:958-971. [PMID: 33119407 PMCID: PMC7608653 DOI: 10.1200/cci.19.00119] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Abstract

PURPOSE

Investigations of the molecular basis for the development, progression, and treatment of cancer increasingly use complementary genomic assays to gather multiomic data, but management and analysis of such data remain complex. The cBioPortal for cancer genomics currently provides multiomic data from > 260 public studies, including The Cancer Genome Atlas (TCGA) data sets, but integration of different data types remains challenging and error prone for computational methods and tools using these resources. Recent advances in data infrastructure within the Bioconductor project enable a novel and powerful approach to creating fully integrated representations of these multiomic, pan-cancer databases.

METHODS

We provide a set of R/Bioconductor packages for working with TCGA legacy data and cBioPortal data, with special considerations for loading time; efficient representations in and out of memory; analysis platform; and an integrative framework, such as MultiAssayExperiment. Large methylation data sets are provided through out-of-memory data representation to provide responsive loading times and analysis capabilities on machines with limited memory.

RESULTS

We developed the curatedTCGAData and cBioPortalData R/Bioconductor packages to provide integrated multiomic data sets from the TCGA legacy database and the cBioPortal web application programming interface using the MultiAssayExperiment data structure. This suite of tools provides coordination of diverse experimental assays with clinicopathological data with minimal data management burden, as demonstrated through several greatly simplified multiomic and pan-cancer analyses.

CONCLUSION

These integrated representations enable analysts and tool developers to apply general statistical and plotting methods to extensive multiomic data through user-friendly commands and documented examples.

Collapse

Affiliation(s)

Marcel Ramos Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY.,Roswell Park Comprehensive Cancer Center, Buffalo, NY
Ludwig Geistlinger Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY
Sehyun Oh Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY
Lucas Schiffer Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY.,Section of Computational Biomedicine, Boston University School of Medicine, Boston, MA
Rimsha Azhar Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY.,Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, NY
Hanish Kodali Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY
Ino de Bruijn Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY
Jianjiong Gao Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY.,Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY
Vincent J Carey Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
Martin Morgan Roswell Park Comprehensive Cancer Center, Buffalo, NY
Levi Waldron Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY

Collapse

Rondel FM, Hosseini R, Sahoo B, Knyazev S, Mandric I, Stewart F, Măndoiu II, Pasaniuc B, Porozov Y, Zelikovsky A. Pipeline for Analyzing Activity of Metabolic Pathways in Planktonic Communities Using Metatranscriptomic Data. J Comput Biol 2021;28:842-855. [PMID: 34264744 PMCID: PMC8575064 DOI: 10.1089/cmb.2021.0053] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Nguyen H, Tran D, Galazka JM, Costes SV, Beheshti A, Petereit J, Draghici S, Nguyen T. CPA: a web-based platform for consensus pathway analysis and interactive visualization. Nucleic Acids Res 2021;49:W114-W124. [PMID: 34037798 PMCID: PMC8262702 DOI: 10.1093/nar/gkab421] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 04/16/2021] [Accepted: 05/05/2021] [Indexed: 01/06/2023] Open

Bu D, Luo H, Huo P, Wang Z, Zhang S, He Z, Wu Y, Zhao L, Liu J, Guo J, Fang S, Cao W, Yi L, Zhao Y, Kong L. KOBAS-i: intelligent prioritization and exploratory visualization of biological functions for gene enrichment analysis. Nucleic Acids Res 2021;49:W317-W325. [PMID: 34086934 PMCID: PMC8265193 DOI: 10.1093/nar/gkab447] [Citation(s) in RCA: 726] [Impact Index Per Article: 242.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 04/24/2021] [Accepted: 05/09/2021] [Indexed: 12/20/2022] Open

Xie C, Jauhari S, Mora A. Popularity and performance of bioinformatics software: the case of gene set analysis. BMC Bioinformatics 2021;22:191. [PMID: 33858350 PMCID: PMC8050894 DOI: 10.1186/s12859-021-04124-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 04/08/2021] [Indexed: 11/22/2022] Open

Mansoori F, Rahgozar M, Kavousi K. A Pathway Analysis Approach Using Petri Net. IEEE J Biomed Health Inform 2021;25:874-880. [PMID: 32750945 DOI: 10.1109/jbhi.2020.3003996] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Ietswaart R, Gyori BM, Bachman JA, Sorger PK, Churchman LS. GeneWalk identifies relevant gene functions for a biological context using network representation learning. Genome Biol 2021;22:55. [PMID: 33526072 PMCID: PMC7852222 DOI: 10.1186/s13059-021-02264-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 01/05/2021] [Indexed: 12/13/2022] Open

Griss J, Viteri G, Sidiropoulos K, Nguyen V, Fabregat A, Hermjakob H. ReactomeGSA - Efficient Multi-Omics Comparative Pathway Analysis. Mol Cell Proteomics 2020;19:2115-2125. [PMID: 32907876 PMCID: PMC7710148 DOI: 10.1074/mcp.tir120.002155] [Citation(s) in RCA: 128] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 07/28/2020] [Indexed: 01/27/2023] Open

Kim W, Yoon SM, Kim S. A semi-automatic cell type annotation method for single-cell RNA sequencing dataset. Genomics Inform 2020;18:e26. [PMID: 33017870 PMCID: PMC7560448 DOI: 10.5808/gi.2020.18.3.e26] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Accepted: 03/27/2020] [Indexed: 11/21/2022] Open

Bao Z, Zhang B, Li L, Ge Q, Gu W, Bai Y. Identifying disease-associated signaling pathways through a novel effector gene analysis. PeerJ 2020;8:e9695. [PMID: 32864216 PMCID: PMC7430270 DOI: 10.7717/peerj.9695] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 07/20/2020] [Indexed: 12/21/2022] Open

Saberian N, Shafi A, Peyvandipour A, Draghici S. MAGPEL: an autoMated pipeline for inferring vAriant-driven Gene PanEls from the full-length biomedical literature. Sci Rep 2020;10:12365. [PMID: 32703994 PMCID: PMC7378213 DOI: 10.1038/s41598-020-68649-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Accepted: 06/17/2020] [Indexed: 11/09/2022] Open

Soubeyrand S, Nikpay M, Lau P, Turner A, Hoang HD, Alain T, McPherson R. CARMAL Is a Long Non-coding RNA Locus That Regulates MFGE8 Expression. Front Genet 2020;11:631. [PMID: 32625236 PMCID: PMC7311772 DOI: 10.3389/fgene.2020.00631] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Accepted: 05/26/2020] [Indexed: 12/27/2022] Open