1
|
Meng Q, Wei L, Ma K, Shi M, Lin X, Ho JWK, Li Y, Zhang X. scDecouple: decoupling cellular response from infected proportion bias in scCRISPR-seq. Brief Bioinform 2024; 25:bbae011. [PMID: 38324621 PMCID: PMC10849189 DOI: 10.1093/bib/bbae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/18/2023] [Accepted: 01/05/2024] [Indexed: 02/09/2024] Open
Abstract
Single-cell clustered regularly interspaced short palindromic repeats-sequencing (scCRISPR-seq) is an emerging high-throughput CRISPR screening technology where the true cellular response to perturbation is coupled with infected proportion bias of guide RNAs (gRNAs) across different cell clusters. The mixing of these effects introduces noise into scCRISPR-seq data analysis and thus obstacles to relevant studies. We developed scDecouple to decouple true cellular response of perturbation from the influence of infected proportion bias. scDecouple first models the distribution of gene expression profiles in perturbed cells and then iteratively finds the maximum likelihood of cell cluster proportions as well as the cellular response for each gRNA. We demonstrated its performance in a series of simulation experiments. By applying scDecouple to real scCRISPR-seq data, we found that scDecouple enhances the identification of biologically perturbation-related genes. scDecouple can benefit scCRISPR-seq data analysis, especially in the case of heterogeneous samples or complex gRNA libraries.
Collapse
Affiliation(s)
- Qiuchen Meng
- MOE Key Lab of Bioinformatics & Bioinformatics Division BRNIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Lei Wei
- MOE Key Lab of Bioinformatics & Bioinformatics Division BRNIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Kun Ma
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
- Laboratory of Data Discovery for Health Limited (D24H), Hong Kong Science Park, Hong Kong SAR, China
| | - Ming Shi
- MOE Key Lab of Bioinformatics & Bioinformatics Division BRNIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Xinyi Lin
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
- Laboratory of Data Discovery for Health Limited (D24H), Hong Kong Science Park, Hong Kong SAR, China
| | - Joshua W K Ho
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
- Laboratory of Data Discovery for Health Limited (D24H), Hong Kong Science Park, Hong Kong SAR, China
| | - Yinqing Li
- MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China
- IDG-McGovern Institute for Brain Research, Center for Synthetic and Systems Biology, School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
| | - Xuegong Zhang
- MOE Key Lab of Bioinformatics & Bioinformatics Division BRNIST, Department of Automation, Tsinghua University, Beijing 100084, China
- Center for Synthetic and Systems Biology, School of Life Sciences and School of Medicine, Tsinghua University, Beijing 100084, China
| |
Collapse
|
2
|
Zhang C, Hu Y, Gao L. Defining and identifying cell sub-crosstalk pairs for characterizing cell-cell communication patterns. Sci Rep 2023; 13:15746. [PMID: 37735248 PMCID: PMC10514069 DOI: 10.1038/s41598-023-42883-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 09/15/2023] [Indexed: 09/23/2023] Open
Abstract
Current cell-cell communication analysis focuses on quantifying intercellular interactions at cell type level. In the tissue microenvironment, one type of cells could be divided into multiple cell subgroups that function differently and communicate with other cell types or subgroups via different ligand-receptor-mediated signaling pathways. Given two cell types, we define a cell sub-crosstalk pair (CSCP) as a combination of two cell subgroups with strong and similar intercellular crosstalk signals and identify CSCPs based on coupled non-negative matrix factorization. Using single-cell spatial transcriptomics data of mouse olfactory bulb and visual cortex, we find that cells of different types within CSCPs are significantly spatially closer with each other than those in the whole single-cell spatial map. To demonstrate the utility of CSCPs, we apply 13 cell-cell communication analysis methods to sampled single-cell transcriptomics datasets at CSCP level and reveal ligand-receptor interactions masked at cell type level. Furthermore, by analyzing single-cell transcriptomics data from 29 breast cancer patients with different immunotherapy responses, we find that CSCPs are useful predictive features to discriminate patients responding to anti-PD-1 therapy from non-responders. Taken together, partitioning a cell type pair into CSCPs enables fine-grained characterization of cell-cell communication in tissue and tumor microenvironments.
Collapse
Affiliation(s)
- Chenxing Zhang
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China
| | - Yuxuan Hu
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China.
| |
Collapse
|
3
|
Pirkl M, Büch J, Devaux C, Böhm M, Sönnerborg A, Incardona F, Abecasis A, Vandamme AM, Zazzi M, Kaiser R, Lengauer T, The EuResist Network Study Group. Analysis of mutational history of multidrug-resistant genotypes with a mutagenetic tree model. J Med Virol 2023; 95:e28389. [PMID: 36484375 DOI: 10.1002/jmv.28389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 11/24/2022] [Accepted: 12/01/2022] [Indexed: 12/14/2022]
Abstract
Human immunodeficiency virus (HIV) can develop resistance to all antiretroviral drugs. Multidrug resistance, however, is a rare event in modern HIV treatment, but can be life-threatening, particular in patients with very long therapy histories and in areas with limited access to novel drugs. To understand the evolution of multidrug resistance, we analyzed the EuResist database to uncover the accumulation of mutations over time. We hypothesize that the accumulation of resistance mutations is not acquired simultaneously and randomly across viral genotypes but rather tends to follow a predetermined order. The knowledge of this order might help to elucidate potential mechanisms of multidrug resistance. Our evolutionary model shows an almost monotonic increase of resistance with each acquired mutation, including less well-known nucleoside reverse transcriptase (RT) inhibitor-related mutations like K223Q, L228H, and Q242H. Mutations within the integrase (IN) (T97A, E138A/K G140S, Q148H, N155H) indicate high probability of multidrug resistance. Hence, these IN mutations also tend to be observed together with mutations in the protease (PR) and RT. We followed up with an analysis of the mutation-specific error rates of our model given the data. We identified several mutations with unusual rates (PR: M41L, L33F, IN: G140S). This could imply the existence of previously unknown virus variants in the viral quasispecies. In conclusion, our bioinformatics model supports the analysis and understanding of multidrug resistance.
Collapse
Affiliation(s)
- Martin Pirkl
- Institute of Virology, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Joachim Büch
- Institute of Virology, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Carole Devaux
- Department of Infection and Immunity, Luxembourg Institute of Health, Strassen, Luxembourg
| | - Michael Böhm
- Institute of Virology, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Anders Sönnerborg
- Department of Laboratory Medicine, Division of Clinical Microbiology, Karolinska Institute, Solna, Sweden
| | | | - Ana Abecasis
- Center for Global Health and Tropical Medicine, Instituto de Higiene e Medicina Tropical, Universidade Nova de Lisboa, Lisbon, Portugal
| | - Anne-Mieke Vandamme
- Center for Global Health and Tropical Medicine, Instituto de Higiene e Medicina Tropical, Universidade Nova de Lisboa, Lisbon, Portugal.,Department of Microbiology, Immunology and Transplantation, Clinical and Epidemiological Virology, Institute for the Future, Rega Institute for Medical Research, KU Leuven, Leuven, Belgium
| | - Maurizio Zazzi
- Department of Medical Biotechnologies, University of Siena, Siena, Italy
| | - Rolf Kaiser
- Institute of Virology, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Thomas Lengauer
- Institute of Virology, University Hospital Cologne, University of Cologne, Cologne, Germany
| | | |
Collapse
|
4
|
Lüönd F, Pirkl M, Hisano M, Prestigiacomo V, Kalathur RK, Beerenwinkel N, Christofori G. Hierarchy of TGFβ/SMAD, Hippo/YAP/TAZ, and Wnt/β-catenin signaling in melanoma phenotype switching. Life Sci Alliance 2021; 5:5/2/e202101010. [PMID: 34819356 PMCID: PMC8616544 DOI: 10.26508/lsa.202101010] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Revised: 11/11/2021] [Accepted: 11/12/2021] [Indexed: 12/13/2022] Open
Abstract
TGFβ, YAP/TAZ, and canonical Wnt/β-catenin signaling functionally interact in a hierarchical manner to induce the switching of melanoma cells from proliferative-to-invasive cell phenotype. In melanoma, a switch from a proliferative melanocytic to an invasive mesenchymal phenotype is based on dramatic transcriptional reprogramming which involves complex interactions between a variety of signaling pathways and their downstream transcriptional regulators. TGFβ/SMAD, Hippo/YAP/TAZ, and Wnt/β-catenin signaling pathways are major inducers of transcriptional reprogramming and converge at several levels. Here, we report that TGFβ/SMAD, YAP/TAZ, and β-catenin are all required for a proliferative-to-invasive phenotype switch. Loss and gain of function experimentation, global gene expression analysis, and computational nested effects models revealed the hierarchy between these signaling pathways and identified shared target genes. SMAD-mediated transcription at the top of the hierarchy leads to the activation of YAP/TAZ and of β-catenin, with YAP/TAZ governing an essential subprogram of TGFβ-induced phenotype switching. Wnt/β-catenin signaling is situated further downstream and exerts a dual role: it promotes the proliferative, differentiated melanoma cell phenotype and it is essential but not sufficient for SMAD or YAP/TAZ–induced phenotype switching. The results identify epistatic interactions among the signaling pathways underlying melanoma phenotype switching and highlight the priorities in targets for melanoma therapy.
Collapse
Affiliation(s)
- Fabiana Lüönd
- Department of Biomedicine, University of Basel, Basel, Switzerland
| | - Martin Pirkl
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Mizue Hisano
- Department of Biomedicine, University of Basel, Basel, Switzerland
| | | | - Ravi Kr Kalathur
- Department of Biomedicine, University of Basel, Basel, Switzerland
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | | |
Collapse
|
5
|
Zhang Y, Zhu L, Wang X. NEM-Tar: A Probabilistic Graphical Model for Cancer Regulatory Network Inference and Prioritization of Potential Therapeutic Targets From Multi-Omics Data. Front Genet 2021; 12:608042. [PMID: 33968127 PMCID: PMC8100334 DOI: 10.3389/fgene.2021.608042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 03/22/2021] [Indexed: 11/13/2022] Open
Abstract
Targeted therapy has been widely adopted as an effective treatment strategy to battle against cancer. However, cancers are not single disease entities, but comprising multiple molecularly distinct subtypes, and the heterogeneity nature prevents precise selection of patients for optimized therapy. Dissecting cancer subtype-specific signaling pathways is crucial to pinpointing dysregulated genes for the prioritization of novel therapeutic targets. Nested effects models (NEMs) are a group of graphical models that encode subset relations between observed downstream effects under perturbations to upstream signaling genes, providing a prototype for mapping the inner workings of the cell. In this study, we developed NEM-Tar, which extends the original NEMs to predict drug targets by incorporating causal information of (epi)genetic aberrations for signaling pathway inference. An information theory-based score, weighted information gain (WIG), was proposed to assess the impact of signaling genes on a specific downstream biological process of interest. Subsequently, we conducted simulation studies to compare three inference methods and found that the greedy hill-climbing algorithm demonstrated the highest accuracy and robustness to noise. Furthermore, two case studies were conducted using multi-omics data for colorectal cancer (CRC) and gastric cancer (GC) in the TCGA database. Using NEM-Tar, we inferred signaling networks driving the poor-prognosis subtypes of CRC and GC, respectively. Our model prioritized not only potential individual drug targets such as HER2, for which FDA-approved inhibitors are available but also the combinations of multiple targets potentially useful for the design of combination therapies.
Collapse
Affiliation(s)
- Yuchen Zhang
- Department of Biomedical Sciences, City University of Hong Kong, Hong Kong, China
| | - Lina Zhu
- Department of Biomedical Sciences, City University of Hong Kong, Hong Kong, China
| | - Xin Wang
- Department of Biomedical Sciences, City University of Hong Kong, Hong Kong, China.,Key Laboratory of Biochip Technology, Biotech and Health Centre, Shenzhen Research Institute, City University of Hong Kong, Shenzhen, China
| |
Collapse
|
6
|
Pirkl M, Beerenwinkel N. Inferring perturbation profiles of cancer samples. Bioinformatics 2021; 37:2441-2449. [PMID: 33617647 PMCID: PMC8388028 DOI: 10.1093/bioinformatics/btab113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Revised: 11/20/2020] [Accepted: 02/18/2021] [Indexed: 11/25/2022] Open
Abstract
Motivation Cancer is one of the most prevalent diseases in the world. Tumors arise due to important genes changing their activity, e.g. when inhibited or over-expressed. But these gene perturbations are difficult to observe directly. Molecular profiles of tumors can provide indirect evidence of gene perturbations. However, inferring perturbation profiles from molecular alterations is challenging due to error-prone molecular measurements and incomplete coverage of all possible molecular causes of gene perturbations. Results We have developed a novel mathematical method to analyze cancer driver genes and their patient-specific perturbation profiles. We combine genetic aberrations with gene expression data in a causal network derived across patients to infer unobserved perturbations. We show that our method can predict perturbations in simulations, CRISPR perturbation screens and breast cancer samples from The Cancer Genome Atlas. Availability and implementation The method is available as the R-package nempi at https://github.com/cbg-ethz/nempi and http://bioconductor.org/packages/nempi. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Martin Pirkl
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland.,Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland.,Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| |
Collapse
|
7
|
Holding AN, Cook HV, Markowetz F. Data generation and network reconstruction strategies for single cell transcriptomic profiles of CRISPR-mediated gene perturbations. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2020; 1863:194441. [PMID: 31756390 DOI: 10.1016/j.bbagrm.2019.194441] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Revised: 10/01/2019] [Accepted: 10/01/2019] [Indexed: 02/05/2023]
Abstract
Recent advances in single-cell RNA-sequencing (scRNA-seq) in combination with CRISPR/Cas9 technologies have enabled the development of methods for large-scale perturbation studies with transcriptional readouts. These methods are highly scalable and have the potential to provide a wealth of information on the biological networks that underlie cellular response. Here we discuss how to overcome several key challenges to generate and analyse data for the confident reconstruction of models of the underlying cellular network. Some challenges are generic, and apply to analysing any single-cell transcriptomic data, while others are specific to combined single-cell CRISPR/Cas9 data, in particular barcode swapping, knockdown efficiency, multiplicity of infection and potential confounding factors. We also provide a curated collection of published data sets to aid the development of analysis strategies. Finally, we discuss several network reconstruction approaches, including co-expression networks and Bayesian networks, as well as their limitations, and highlight the potential of Nested Effects Models for network reconstruction from scRNA-seq data. This article is part of a Special Issue entitled: Transcriptional Profiles and Regulatory Gene Networks edited by Dr. Dr. Federico Manuel Giorgi and Dr. Shaun Mahony.
Collapse
Affiliation(s)
- Andrew N Holding
- Department of Biology, University of York, York, UK; York Biomedical Research Institute, University of York, York, UK; CRUK Cambridge Institute, University of Cambridge, Robinson Way, Cambridge, UK; The Alan Turing Institute, 96 Euston Road, Kings Cross, London, UK
| | - Helen V Cook
- Department of Biology, University of York, York, UK
| | | |
Collapse
|
8
|
|