Windels SFL, Malod-Dognin N, Pržulj N. Identifying cellular cancer mechanisms through pathway-driven data integration.
Bioinformatics 2022;
38:4344-4351. [PMID:
35916710 PMCID:
PMC9477533 DOI:
10.1093/bioinformatics/btac493]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 06/14/2022] [Accepted: 07/30/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION
Cancer is a genetic disease in which accumulated mutations of driver genes induce a functional reorganization of the cell by reprogramming cellular pathways. Current approaches identify cancer pathways as those most internally perturbed by gene expression changes. However, driver genes characteristically perform hub roles between pathways. Therefore, we hypothesize that cancer pathways should be identified by changes in their pathway-pathway relationships.
RESULTS
To learn an embedding space that captures the relationships between pathways in a healthy cell, we propose pathway-driven non-negative matrix tri-factorization. In this space, we determine condition-specific (i.e. diseased and healthy) embeddings of pathways and genes. Based on these embeddings, we define our 'NMTF centrality' to measure a pathway's or gene's functional importance, and our 'moving distance', to measure the change in its functional relationships. We combine both measures to predict 15 genes and pathways involved in four major cancers, predicting 60 gene-cancer associations in total, covering 28 unique genes. To further exploit driver genes' tendency to perform hub roles, we model our network data using graphlet adjacency, which considers nodes adjacent if their interaction patterns form specific shapes (e.g. paths or triangles). We find that the predicted genes rewire pathway-pathway interactions in the immune system and provide literary evidence that many are druggable (15/28) and implicated in the associated cancers (47/60). We predict six druggable cancer-specific drug targets.
AVAILABILITY AND IMPLEMENTATION
The code and data are available at: https://gitlab.bsc.es/swindels/pathway_driven_nmtf.
SUPPLEMENTARY INFORMATION
Supplementary data are available at Bioinformatics online.
Collapse