1
|
Wang H, Torous W, Gong B, Purdom E. Visualizing scRNA-Seq data at population scale with GloScope. Genome Biol 2024; 25:259. [PMID: 39380041 PMCID: PMC11463121 DOI: 10.1186/s13059-024-03398-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 09/20/2024] [Indexed: 10/10/2024] Open
Abstract
Increasingly, scRNA-Seq studies explore cell populations across different samples and the effect of sample heterogeneity on organism's phenotype. However, relatively few bioinformatic methods have been developed which adequately address the variation between samples for such population-level analyses. We propose a framework for representing the entire single-cell profile of a sample, which we call a GloScope representation. We implement GloScope on scRNA-Seq datasets from study designs ranging from 12 to over 300 samples and demonstrate how GloScope allows researchers to perform essential bioinformatic tasks at the sample-level, in particular visualization and quality control assessment.
Collapse
Affiliation(s)
- Hao Wang
- Division of Biostatistics, University of California, Berkeley, CA, USA
| | - William Torous
- Department of Statistics, University of California, Berkeley, CA, USA
| | - Boying Gong
- Division of Biostatistics, University of California, Berkeley, CA, USA
| | - Elizabeth Purdom
- Department of Statistics, University of California, Berkeley, CA, USA.
- Center for Computational Biology, University of California, Berkeley, CA, USA.
| |
Collapse
|
2
|
Almet AA, Tsai YC, Watanabe M, Nie Q. Inferring pattern-driving intercellular flows from single-cell and spatial transcriptomics. Nat Methods 2024; 21:1806-1817. [PMID: 39187683 PMCID: PMC11466815 DOI: 10.1038/s41592-024-02380-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 07/23/2024] [Indexed: 08/28/2024]
Abstract
From single-cell RNA-sequencing (scRNA-seq) and spatial transcriptomics (ST), one can extract high-dimensional gene expression patterns that can be described by intercellular communication networks or decoupled gene modules. These two descriptions of information flow are often assumed to occur independently. However, intercellular communication drives directed flows of information that are mediated by intracellular gene modules, in turn triggering outflows of other signals. Methodologies to describe such intercellular flows are lacking. We present FlowSig, a method that infers communication-driven intercellular flows from scRNA-seq or ST data using graphical causal modeling and conditional independence. We benchmark FlowSig using newly generated experimental cortical organoid data and synthetic data generated from mathematical modeling. We demonstrate FlowSig's utility by applying it to various studies, showing that FlowSig can capture stimulation-induced changes to paracrine signaling in pancreatic islets, demonstrate shifts in intercellular flows due to increasing COVID-19 severity and reconstruct morphogen-driven activator-inhibitor patterns in mouse embryogenesis.
Collapse
Affiliation(s)
- Axel A Almet
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA
- NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, CA, USA
| | - Yuan-Chen Tsai
- Department of Anatomy & Neurobiology, University of California, Irvine, Irvine, CA, USA
- Sue & Bill Gross Stem Cell Research Center, University of California, Irvine, Irvine, CA, USA
- School of Medicine, University of California, Irvine, Irvine, CA, USA
| | - Momoko Watanabe
- Department of Anatomy & Neurobiology, University of California, Irvine, Irvine, CA, USA
- Sue & Bill Gross Stem Cell Research Center, University of California, Irvine, Irvine, CA, USA
- School of Medicine, University of California, Irvine, Irvine, CA, USA
| | - Qing Nie
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA.
- NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, CA, USA.
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA.
| |
Collapse
|
3
|
Mitchel J, Gordon MG, Perez RK, Biederstedt E, Bueno R, Ye CJ, Kharchenko PV. Coordinated, multicellular patterns of transcriptional variation that stratify patient cohorts are revealed by tensor decomposition. Nat Biotechnol 2024:10.1038/s41587-024-02411-z. [PMID: 39313646 DOI: 10.1038/s41587-024-02411-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 08/28/2024] [Indexed: 09/25/2024]
Abstract
Tissue-level and organism-level biological processes often involve the coordinated action of multiple distinct cell types. The recent application of single-cell assays to many individuals should enable the study of how donor-level variation in one cell type is linked to that in other cell types. Here we introduce a computational approach called single-cell interpretable tensor decomposition (scITD) to identify common axes of interindividual variation by considering joint expression variation across multiple cell types. scITD combines expression matrices from each cell type into a higher-order matrix and factorizes the result using the Tucker tensor decomposition. Applying scITD to single-cell RNA-sequencing data on 115 persons with lupus and 83 persons with coronavirus disease 2019, we identify patterns of coordinated cellular activity linked to disease severity and specific phenotypes, such as lupus nephritis. scITD results also implicate specific signaling pathways likely mediating coordination between cell types. Overall, scITD offers a tool for understanding the covariation of cell states across individuals, which can yield insights into the complex processes that define and stratify disease.
Collapse
Affiliation(s)
- Jonathan Mitchel
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Program in Health Sciences and Technology, Harvard Medical School and Massachusetts Institute of Technology, Boston, MA, USA
| | - M Grace Gordon
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, USA
- UCSF Division of Rheumatology, Department of Medicine, University of California, San Francisco, San Francisco, CA, USA
| | - Richard K Perez
- School of Medicine, University of California, San Francisco, San Francisco, CA, USA
| | - Evan Biederstedt
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Raymund Bueno
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, USA
- UCSF Division of Rheumatology, Department of Medicine, University of California, San Francisco, San Francisco, CA, USA
| | - Chun Jimmie Ye
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, USA.
| | - Peter V Kharchenko
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- San Diego Institute of Science, Altos Labs, San Diego, CA, USA.
| |
Collapse
|
4
|
Dimitrov D, Schäfer PSL, Farr E, Rodriguez-Mier P, Lobentanzer S, Badia-I-Mompel P, Dugourd A, Tanevski J, Ramirez Flores RO, Saez-Rodriguez J. LIANA+ provides an all-in-one framework for cell-cell communication inference. Nat Cell Biol 2024; 26:1613-1622. [PMID: 39223377 PMCID: PMC11392821 DOI: 10.1038/s41556-024-01469-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 07/02/2024] [Indexed: 09/04/2024]
Abstract
The growing availability of single-cell and spatially resolved transcriptomics has led to the development of many approaches to infer cell-cell communication, each capturing only a partial view of the complex landscape of intercellular signalling. Here we present LIANA+, a scalable framework built around a rich knowledge base to decode coordinated inter- and intracellular signalling events from single- and multi-condition datasets in both single-cell and spatially resolved data. By extending and unifying established methodologies, LIANA+ provides a comprehensive set of synergistic components to study cell-cell communication via diverse molecular mediators, including those measured in multi-omics data. LIANA+ is accessible at https://github.com/saezlab/liana-py with extensive vignettes ( https://liana-py.readthedocs.io/ ) and provides an all-in-one solution to intercellular communication inference.
Collapse
Affiliation(s)
- Daniel Dimitrov
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
| | - Philipp Sven Lars Schäfer
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
| | - Elias Farr
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
| | - Pablo Rodriguez-Mier
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
| | - Sebastian Lobentanzer
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
| | - Pau Badia-I-Mompel
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
- GSK, Cellzome, Heidelberg, Germany
| | - Aurelien Dugourd
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
| | - Jovan Tanevski
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
| | - Ricardo Omar Ramirez Flores
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
| | - Julio Saez-Rodriguez
- Faculty of Medicine and Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany.
- European Bioinformatics Institute, European Molecular Biology Laboratory, Hinxton, UK.
| |
Collapse
|
5
|
Lapuente-Santana Ó, Sturm G, Kant J, Ausserhofer M, Zackl C, Zopoglou M, McGranahan N, Rieder D, Trajanoski Z, da Cunha Carvalho de Miranda NF, Eduati F, Finotello F. Multimodal analysis unveils tumor microenvironment heterogeneity linked to immune activity and evasion. iScience 2024; 27:110529. [PMID: 39161957 PMCID: PMC11331718 DOI: 10.1016/j.isci.2024.110529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 06/03/2024] [Accepted: 07/13/2024] [Indexed: 08/21/2024] Open
Abstract
The cellular and molecular heterogeneity of tumors is a major obstacle to cancer immunotherapy. Here, we use a systems biology approach to derive a signature of the main sources of heterogeneity in the tumor microenvironment (TME) from lung cancer transcriptomics. We demonstrate that this signature, which we called iHet, is conserved in different cancers and associated with antitumor immunity. Through analysis of single-cell and spatial transcriptomics data, we trace back the cellular origin of the variability explaining the iHet signature. Finally, we demonstrate that iHet has predictive value for cancer immunotherapy, which can be further improved by disentangling three major determinants of anticancer immune responses: activity of immune cells, immune infiltration or exclusion, and cancer-cell foreignness. This work shows how transcriptomics data can be integrated to derive a holistic representation of the phenotypic heterogeneity of the TME and to predict its unfolding and fate during immunotherapy with immune checkpoint blockers.
Collapse
Affiliation(s)
- Óscar Lapuente-Santana
- Department of Biomedical Engineering and Institute for Complex Molecular Systems, Eindhoven University of Technology, 5612 AZ Eindhoven, the Netherlands
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), 28029 Madrid, Spain
| | - Gregor Sturm
- Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
- Boehringer Ingelheim International Pharma GmbH & Co KG, 55216 Ingelheim am Rhein, Germany
| | - Joan Kant
- Department of Biomedical Engineering and Institute for Complex Molecular Systems, Eindhoven University of Technology, 5612 AZ Eindhoven, the Netherlands
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 2M9, Canada
| | - Markus Ausserhofer
- Department of Molecular Biology, Digital Science Center (DiSC), University of Innsbruck, 6020 Innsbruck, Austria
| | - Constantin Zackl
- Department of Molecular Biology, Digital Science Center (DiSC), University of Innsbruck, 6020 Innsbruck, Austria
| | - Maria Zopoglou
- Department of Molecular Biology, Digital Science Center (DiSC), University of Innsbruck, 6020 Innsbruck, Austria
| | - Nicholas McGranahan
- Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London WC1E 6DD, UK
- Cancer Genome Evolution Research Group, University College London Cancer Institute, London WC1E 6DD, UK
| | - Dietmar Rieder
- Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Zlatko Trajanoski
- Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | | | - Federica Eduati
- Department of Biomedical Engineering and Institute for Complex Molecular Systems, Eindhoven University of Technology, 5612 AZ Eindhoven, the Netherlands
| | - Francesca Finotello
- Department of Molecular Biology, Digital Science Center (DiSC), University of Innsbruck, 6020 Innsbruck, Austria
| |
Collapse
|
6
|
Guo B, Ling W, Kwon SH, Panwar P, Ghazanfar S, Martinowich K, Hicks SC. Integrating spatially-resolved transcriptomics data across tissues and individuals: challenges and opportunities. ARXIV 2024:arXiv:2408.00367v1. [PMID: 39130195 PMCID: PMC11312629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Advances in spatially-resolved transcriptomics (SRT) technologies have propelled the development of new computational analysis methods to unlock biological insights. As the cost of generating these data decreases, these technologies provide an exciting opportunity to create large-scale atlases that integrate SRT data across multiple tissues, individuals, species, or phenotypes to perform population-level analyses. Here, we describe unique challenges of varying spatial resolutions in SRT data, as well as highlight the opportunities for standardized preprocessing methods along with computational algorithms amenable to atlas-scale datasets leading to improved sensitivity and reproducibility in the future.
Collapse
Affiliation(s)
- Boyi Guo
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Wodan Ling
- Division of Biostatistics, Department of Population Health Sciences, Weill Cornell Medicine, NY, USA
| | - Sang Ho Kwon
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD, USA
- Solomon H. Snyder Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA
- Biochemistry, Cellular, and Molecular Biology Graduate Program, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Pratibha Panwar
- School of Mathematics and Statistics, The University of Sydney, NSW 2006, Australia
- Sydney Precision Data Science Centre, University of Sydney, NSW 2006, Australia
- Charles Perkins Centre, The University of Sydney, NSW 2006, Australia
| | - Shila Ghazanfar
- School of Mathematics and Statistics, The University of Sydney, NSW 2006, Australia
- Sydney Precision Data Science Centre, University of Sydney, NSW 2006, Australia
- Charles Perkins Centre, The University of Sydney, NSW 2006, Australia
| | - Keri Martinowich
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD, USA
- Solomon H. Snyder Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD, USA
- Johns Hopkins Kavli Neuroscience Discovery Institute, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Stephanie C. Hicks
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
7
|
Armingol E, Baghdassarian HM, Lewis NE. The diversification of methods for studying cell-cell interactions and communication. Nat Rev Genet 2024; 25:381-400. [PMID: 38238518 PMCID: PMC11139546 DOI: 10.1038/s41576-023-00685-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/01/2023] [Indexed: 05/20/2024]
Abstract
No cell lives in a vacuum, and the molecular interactions between cells define most phenotypes. Transcriptomics provides rich information to infer cell-cell interactions and communication, thus accelerating the discovery of the roles of cells within their communities. Such research relies heavily on algorithms that infer which cells are interacting and the ligands and receptors involved. Specific pressures on different research niches are driving the evolution of next-generation computational tools, enabling new conceptual opportunities and technological advances. More sophisticated algorithms now account for the heterogeneity and spatial organization of cells, multiple ligand types and intracellular signalling events, and enable the use of larger and more complex datasets, including single-cell and spatial transcriptomics. Similarly, new high-throughput experimental methods are increasing the number and resolution of interactions that can be analysed simultaneously. Here, we explore recent progress in cell-cell interaction research and highlight the diversification of the next generation of tools, which have yielded a rich ecosystem of tools for different applications and are enabling invaluable discoveries.
Collapse
Affiliation(s)
- Erick Armingol
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA, USA.
- Department of Paediatrics, University of California, San Diego, La Jolla, CA, USA.
| | - Hratch M Baghdassarian
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA, USA
- Department of Paediatrics, University of California, San Diego, La Jolla, CA, USA
| | - Nathan E Lewis
- Department of Paediatrics, University of California, San Diego, La Jolla, CA, USA.
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
8
|
Ramirez Flores RO, Schäfer PSL, Küchenhoff L, Saez-Rodriguez J. Complementing Cell Taxonomies with a Multicellular Analysis of Tissues. Physiology (Bethesda) 2024; 39:0. [PMID: 38319138 DOI: 10.1152/physiol.00001.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 01/31/2024] [Indexed: 02/07/2024] Open
Abstract
The application of single-cell molecular profiling coupled with spatial technologies has enabled charting of cellular heterogeneity in reference tissues and in disease. This new wave of molecular data has highlighted the expected diversity of single-cell dynamics upon shared external queues and spatial organizations. However, little is known about the relationship between single-cell heterogeneity and the emergence and maintenance of robust multicellular processes in developed tissues and its role in (patho)physiology. Here, we present emerging computational modeling strategies that use increasingly available large-scale cross-condition single-cell and spatial datasets to study multicellular organization in tissues and complement cell taxonomies. This perspective should enable us to better understand how cells within tissues collectively process information and adapt synchronized responses in disease contexts and to bridge the gap between structural changes and functions in tissues.
Collapse
Affiliation(s)
- Ricardo Omar Ramirez Flores
- Faculty of Medicine, Heidelberg University and Institute for Computational Biomedicine, Heidelberg University Hospital, Heidelberg, Germany
| | - Philipp Sven Lars Schäfer
- Faculty of Medicine, Heidelberg University and Institute for Computational Biomedicine, Heidelberg University Hospital, Heidelberg, Germany
| | - Leonie Küchenhoff
- Faculty of Medicine, Heidelberg University and Institute for Computational Biomedicine, Heidelberg University Hospital, Heidelberg, Germany
| | - Julio Saez-Rodriguez
- Faculty of Medicine, Heidelberg University and Institute for Computational Biomedicine, Heidelberg University Hospital, Heidelberg, Germany
| |
Collapse
|
9
|
Baghdassarian HM, Dimitrov D, Armingol E, Saez-Rodriguez J, Lewis NE. Combining LIANA and Tensor-cell2cell to decipher cell-cell communication across multiple samples. CELL REPORTS METHODS 2024; 4:100758. [PMID: 38631346 PMCID: PMC11046036 DOI: 10.1016/j.crmeth.2024.100758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 12/22/2023] [Accepted: 03/22/2024] [Indexed: 04/19/2024]
Abstract
In recent years, data-driven inference of cell-cell communication has helped reveal coordinated biological processes across cell types. Here, we integrate two tools, LIANA and Tensor-cell2cell, which, when combined, can deploy multiple existing methods and resources to enable the robust and flexible identification of cell-cell communication programs across multiple samples. In this work, we show how the integration of our tools facilitates the choice of method to infer cell-cell communication and subsequently perform an unsupervised deconvolution to obtain and summarize biological insights. We explain how to perform the analysis step by step in both Python and R and provide online tutorials with detailed instructions available at https://ccc-protocols.readthedocs.io/. This workflow typically takes ∼1.5 h to complete from installation to downstream visualizations on a graphics processing unit-enabled computer for a dataset of ∼63,000 cells, 10 cell types, and 12 samples.
Collapse
Affiliation(s)
- Hratch M Baghdassarian
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA; Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Daniel Dimitrov
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, 69120 Heidelberg, Germany
| | - Erick Armingol
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA; Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Julio Saez-Rodriguez
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, 69120 Heidelberg, Germany.
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
10
|
Schäfer PSL, Dimitrov D, Villablanca EJ, Saez-Rodriguez J. Integrating single-cell multi-omics and prior biological knowledge for a functional characterization of the immune system. Nat Immunol 2024; 25:405-417. [PMID: 38413722 DOI: 10.1038/s41590-024-01768-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 01/16/2024] [Indexed: 02/29/2024]
Abstract
The immune system comprises diverse specialized cell types that cooperate to defend the host against a wide range of pathogenic threats. Recent advancements in single-cell and spatial multi-omics technologies provide rich information about the molecular state of immune cells. Here, we review how the integration of single-cell and spatial multi-omics data with prior knowledge-gathered from decades of detailed biochemical studies-allows us to obtain functional insights, focusing on gene regulatory processes and cell-cell interactions. We present diverse applications in immunology and critically assess underlying assumptions and limitations. Finally, we offer a perspective on the ongoing technological and algorithmic developments that promise to get us closer to a systemic mechanistic understanding of the immune system.
Collapse
Affiliation(s)
- Philipp Sven Lars Schäfer
- Institute for Computational Bioscience, Faculty of Medicine and Heidelberg University Hospital, Heidelberg University, Heidelberg, Germany
| | - Daniel Dimitrov
- Institute for Computational Bioscience, Faculty of Medicine and Heidelberg University Hospital, Heidelberg University, Heidelberg, Germany
| | - Eduardo J Villablanca
- Division of Immunology and Allergy, Department of Medicine Solna, Karolinska Institute and Karolinska University Hospital, Stockholm, Sweden
- Center of Molecular Medicine, Stockholm, Sweden
| | - Julio Saez-Rodriguez
- Institute for Computational Bioscience, Faculty of Medicine and Heidelberg University Hospital, Heidelberg University, Heidelberg, Germany.
| |
Collapse
|
11
|
Joodaki M, Shaigan M, Parra V, Bülow RD, Kuppe C, Hölscher DL, Cheng M, Nagai JS, Goedertier M, Bouteldja N, Tesar V, Barratt J, Roberts IS, Coppo R, Kramann R, Boor P, Costa IG. Detection of PatIent-Level distances from single cell genomics and pathomics data with Optimal Transport (PILOT). Mol Syst Biol 2024; 20:57-74. [PMID: 38177382 PMCID: PMC10883279 DOI: 10.1038/s44320-023-00003-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/20/2023] [Accepted: 11/24/2023] [Indexed: 01/06/2024] Open
Abstract
Although clinical applications represent the next challenge in single-cell genomics and digital pathology, we still lack computational methods to analyze single-cell or pathomics data to find sample-level trajectories or clusters associated with diseases. This remains challenging as single-cell/pathomics data are multi-scale, i.e., a sample is represented by clusters of cells/structures, and samples cannot be easily compared with each other. Here we propose PatIent Level analysis with Optimal Transport (PILOT). PILOT uses optimal transport to compute the Wasserstein distance between two individual single-cell samples. This allows us to perform unsupervised analysis at the sample level and uncover trajectories or cellular clusters associated with disease progression. We evaluate PILOT and competing approaches in single-cell genomics or pathomics studies involving various human diseases with up to 600 samples/patients and millions of cells or tissue structures. Our results demonstrate that PILOT detects disease-associated samples from large and complex single-cell or pathomics data. Moreover, PILOT provides a statistical approach to find changes in cell populations, gene expression, and tissue structures related to the trajectories or clusters supporting interpretation of predictions.
Collapse
Affiliation(s)
- Mehdi Joodaki
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - Mina Shaigan
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - Victor Parra
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - Roman D Bülow
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany
| | - Christoph Kuppe
- Institute of Experimental Medicine and Systems Biology, RWTH Aachen University, Aachen, Germany
| | - David L Hölscher
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany
| | - Mingbo Cheng
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - James S Nagai
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
| | - Michaël Goedertier
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany
| | - Nassim Bouteldja
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany
| | - Vladimir Tesar
- Department of Nephrology, 1st Faculty of Medicine and General University Hospital, Charles University, Prague, Czech Republic
| | - Jonathan Barratt
- John Walls Renal Unit, University Hospital of Leicester National Health Service Trust, Leicester, UK
- Department of Cardiovascular Sciences, University of Leicester, Leicester, UK
| | - Ian Sd Roberts
- Department of Cellular Pathology, Oxford University Hospitals National Health Services Foundation Trust, Oxford, UK
| | - Rosanna Coppo
- Fondazione Ricerca Molinette, Regina Margherita Children's University Hospital, Torino, Italy
| | - Rafael Kramann
- Institute of Experimental Medicine and Systems Biology, RWTH Aachen University, Aachen, Germany
- Department of Internal Medicine, Nephrology and Transplantation, Erasmus Medical Center, Rotterdam, Netherlands
| | - Peter Boor
- Institute of Pathology, RWTH Aachen University Medical School, Aachen, Germany.
| | - Ivan G Costa
- Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, Aachen, Germany.
| |
Collapse
|
12
|
Ramirez Flores RO, Lanzer JD, Dimitrov D, Velten B, Saez-Rodriguez J. Multicellular factor analysis of single-cell data for a tissue-centric understanding of disease. eLife 2023; 12:e93161. [PMID: 37991480 PMCID: PMC10718529 DOI: 10.7554/elife.93161] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 11/14/2023] [Indexed: 11/23/2023] Open
Abstract
Biomedical single-cell atlases describe disease at the cellular level. However, analysis of this data commonly focuses on cell-type-centric pairwise cross-condition comparisons, disregarding the multicellular nature of disease processes. Here, we propose multicellular factor analysis for the unsupervised analysis of samples from cross-condition single-cell atlases and the identification of multicellular programs associated with disease. Our strategy, which repurposes group factor analysis as implemented in multi-omics factor analysis, incorporates the variation of patient samples across cell-types or other tissue-centric features, such as cell compositions or spatial relationships, and enables the joint analysis of multiple patient cohorts, facilitating the integration of atlases. We applied our framework to a collection of acute and chronic human heart failure atlases and described multicellular processes of cardiac remodeling, independent to cellular compositions and their local organization, that were conserved in independent spatial and bulk transcriptomics datasets. In sum, our framework serves as an exploratory tool for unsupervised analysis of cross-condition single-cell atlases and allows for the integration of the measurements of patient cohorts across distinct data modalities.
Collapse
Affiliation(s)
- Ricardo Omar Ramirez Flores
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, BioQuantHeidelbergGermany
| | - Jan David Lanzer
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, BioQuantHeidelbergGermany
| | - Daniel Dimitrov
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, BioQuantHeidelbergGermany
| | - Britta Velten
- Heidelberg University, Centre for Organismal Studies, Centre for Scientific ComputingHeidelbergGermany
| | - Julio Saez-Rodriguez
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, BioQuantHeidelbergGermany
| |
Collapse
|