1
|
Das Adhikari S, Yang J, Wang J, Cui Y. Recent advances in spatially variable gene detection in spatial transcriptomics. Comput Struct Biotechnol J 2024; 23:883-891. [PMID: 38370977 PMCID: PMC10869304 DOI: 10.1016/j.csbj.2024.01.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 01/22/2024] [Accepted: 01/22/2024] [Indexed: 02/20/2024] Open
Abstract
With the emergence of advanced spatial transcriptomic technologies, there has been a surge in research papers dedicated to analyzing spatial transcriptomics data, resulting in significant contributions to our understanding of biology. The initial stage of downstream analysis of spatial transcriptomic data has centered on identifying spatially variable genes (SVGs) or genes expressed with specific spatial patterns across the tissue. SVG detection is an important task since many downstream analyses depend on these selected SVGs. Over the past few years, a plethora of new methods have been proposed for the detection of SVGs, accompanied by numerous innovative concepts and discussions. This article provides a selective review of methods and their practical implementations, offering valuable insights into the current literature in this field.
Collapse
Affiliation(s)
- Sikta Das Adhikari
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
- Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA
| | - Jiaxin Yang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Jianrong Wang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Yuehua Cui
- Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
2
|
Yan G, Hua SH, Li JJ. Categorization of 31 computational methods to detect spatially variable genes from spatially resolved transcriptomics data. ARXIV 2024:arXiv:2405.18779v2. [PMID: 38855546 PMCID: PMC11160866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
In the analysis of spatially resolved transcriptomics data, detecting spatially variable genes (SVGs) is crucial. Numerous computational methods exist, but varying SVG definitions and methodologies lead to incomparable results. We review 31 state-of-the-art methods, categorizing SVGs into three types: overall, cell-type-specific, and spatial-domain-marker SVGs. Our review explains the intuitions underlying these methods, summarizes their applications, and categorizes the hypothesis tests they use in the trade-off between generality and specificity for SVG detection. We discuss challenges in SVG detection and propose future directions for improvement. Our review offers insights for method developers and users, advocating for category-specific benchmarking.
Collapse
Affiliation(s)
- Guanao Yan
- Department of Statistics, University of California, Los Angeles, CA 90095-1554
| | - Shuo Harper Hua
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305
| | - Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, CA 90095-1554
- Department of Human Genetics, University of California, Los Angeles, CA 90095-7088
- Department of Computational Medicine, University of California, Los Angeles, CA 90095-1766
- Department of Biostatistics, University of California, Los Angeles, CA 90095-1772
- Radcliffe Institute for Advanced Study, Harvard University, Cambridge, MA 02138
| |
Collapse
|
3
|
Ohn J, Seo MK, Park J, Lee D, Choi H. SpatialSPM: statistical parametric mapping for the comparison of gene expression pattern images in multiple spatial transcriptomic datasets. Nucleic Acids Res 2024; 52:e51. [PMID: 38676948 PMCID: PMC11194061 DOI: 10.1093/nar/gkae293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 03/19/2024] [Accepted: 04/05/2024] [Indexed: 04/29/2024] Open
Abstract
Spatial transcriptomic (ST) techniques help us understand the gene expression levels in specific parts of tissues and organs, providing insights into their biological functions. Even though ST dataset provides information on the gene expression and its location for each sample, it is challenging to compare spatial gene expression patterns across tissue samples with different shapes and coordinates. Here, we propose a method, SpatialSPM, that reconstructs ST data into multi-dimensional image matrices to ensure comparability across different samples through spatial registration process. We demonstrated the applicability of this method by kidney and mouse olfactory bulb datasets as well as mouse brain ST datasets to investigate and directly compare gene expression in a specific anatomical region of interest, pixel by pixel, across various biological statuses. Beyond traditional analyses, SpatialSPM is capable of generating statistical parametric maps, including T-scores and Pearson correlation coefficients. This feature enables the identification of specific regions exhibiting differentially expressed genes across tissue samples, enhancing the depth and specificity of ST studies. Our approach provides an efficient way to analyze ST datasets and may offer detailed insights into various biological conditions.
Collapse
Affiliation(s)
| | | | | | | | - Hongyoon Choi
- Portrai, Inc., Seoul 03136, Republic of Korea
- Department of Nuclear Medicine, Seoul National University Hospital, Seoul 03080, Republic of Korea
- Department of Nuclear Medicine, Seoul National University College of Medicine, Seoul 03080, Republic of Korea
| |
Collapse
|
4
|
Lin S, Cui Y, Zhao F, Yang Z, Song J, Yao J, Zhao Y, Qian BZ, Zhao Y, Yuan Z. Complete spatially resolved gene expression is not necessary for identifying spatial domains. CELL GENOMICS 2024; 4:100565. [PMID: 38781966 PMCID: PMC11228956 DOI: 10.1016/j.xgen.2024.100565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 02/29/2024] [Accepted: 04/30/2024] [Indexed: 05/25/2024]
Abstract
Spatially resolved transcriptomics (SRT) technologies have revolutionized the study of tissue organization. We introduce a graph convolutional network with an attention and positive emphasis mechanism, termed BINARY, relying exclusively on binarized SRT data to accurately delineate spatial domains. BINARY outperforms existing methods across various SRT data types while using significantly less input information. Our study suggests that precise gene expression quantification may not always be essential, inspiring further exploration of the broader applications of spatially resolved binarized gene expression data.
Collapse
Affiliation(s)
- Senlin Lin
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Yan Cui
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China; Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China
| | - Fangyuan Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Zhidong Yang
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
| | | | - Yu Zhao
- AI Lab, Tencent, Shenzhen, China
| | - Bin-Zhi Qian
- Fudan University Shanghai Cancer Center, Department of Oncology, Shanghai Medical College, The Human Phenome Institute, Zhangjiang-Fudan International Innovation Center, Fudan University, Shanghai, China
| | - Yi Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China; Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China.
| |
Collapse
|
5
|
Yu S, Li WV. spVC for the detection and interpretation of spatial gene expression variation. Genome Biol 2024; 25:103. [PMID: 38641849 PMCID: PMC11027374 DOI: 10.1186/s13059-024-03245-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2023] [Accepted: 04/10/2024] [Indexed: 04/21/2024] Open
Abstract
Spatially resolved transcriptomics technologies have opened new avenues for understanding gene expression heterogeneity in spatial contexts. However, existing methods for identifying spatially variable genes often focus solely on statistical significance, limiting their ability to capture continuous expression patterns and integrate spot-level covariates. To address these challenges, we introduce spVC, a statistical method based on a generalized Poisson model. spVC seamlessly integrates constant and spatially varying effects of covariates, facilitating comprehensive exploration of gene expression variability and enhancing interpretability. Simulation and real data applications confirm spVC's accuracy in these tasks, highlighting its versatility in spatial transcriptomics analysis.
Collapse
Affiliation(s)
- Shan Yu
- Department of Statistics, Unversity of Virginia, Charlottesville, 22903, VA, USA.
| | - Wei Vivian Li
- Department of Statistics, University of California, Riverside, 92521, CA, USA.
| |
Collapse
|
6
|
Samadi Z, Askary A. Spatial motifs reveal patterns in cellular architecture of complex tissues. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.08.588586. [PMID: 38645046 PMCID: PMC11030378 DOI: 10.1101/2024.04.08.588586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
Spatial organization of cells is crucial to both proper physiological function of tissues and pathological conditions like cancer. Recent advances in spatial transcriptomics have enabled joint profiling of gene expression and spatial context of the cells. The outcome is an information rich map of the tissue where individual cells, or small regions, can be labeled based on their gene expression state. While spatial transcriptomics excels in its capacity to profile numerous genes within the same sample, most existing methods for analysis of spatial data only examine distribution of one or two labels at a time. These approaches overlook the potential for identifying higher-order associations between cell types - associations that can play a pivotal role in understanding development and function of complex tissues. In this context, we introduce a novel method for detecting motifs in spatial neighborhood graphs. Each motif represents a spatial arrangement of cell types that occurs in the tissue more frequently than expected by chance. To identify spatial motifs, we developed an algorithm for uniform sampling of paths from neighborhood graphs and combined it with a motif finding algorithm on graphs inspired by previous methods for finding motifs in DNA sequences. Using synthetic data with known ground truth, we show that our method can identify spatial motifs with high accuracy and sensitivity. Applied to spatial maps of mouse retinal bipolar cells and hypothalamic preoptic region, our method reveals previously unrecognized patterns in cell type arrangements. In some cases, cells within these spatial patterns differ in their gene expression from other cells of the same type, providing insights into the functional significance of the spatial motifs. These results suggest that our method can illuminate the substantial complexity of neural tissues, provide novel insight even in well studied models, and generate experimentally testable hypotheses.
Collapse
Affiliation(s)
- Zainalabedin Samadi
- Department of Molecular, Cell and Developmental Biology, University of California, Los Angeles, Los Angeles, 90095, CA, USA
| | - Amjad Askary
- Department of Molecular, Cell and Developmental Biology, University of California, Los Angeles, Los Angeles, 90095, CA, USA
| |
Collapse
|
7
|
Guo X, Ning J, Chen Y, Liu G, Zhao L, Fan Y, Sun S. Recent advances in differential expression analysis for single-cell RNA-seq and spatially resolved transcriptomic studies. Brief Funct Genomics 2024; 23:95-109. [PMID: 37022699 DOI: 10.1093/bfgp/elad011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 12/09/2022] [Accepted: 03/10/2023] [Indexed: 04/07/2023] Open
Abstract
Differential expression (DE) analysis is a necessary step in the analysis of single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) data. Unlike traditional bulk RNA-seq, DE analysis for scRNA-seq or SRT data has unique characteristics that may contribute to the difficulty of detecting DE genes. However, the plethora of DE tools that work with various assumptions makes it difficult to choose an appropriate one. Furthermore, a comprehensive review on detecting DE genes for scRNA-seq data or SRT data from multi-condition, multi-sample experimental designs is lacking. To bridge such a gap, here, we first focus on the challenges of DE detection, then highlight potential opportunities that facilitate further progress in scRNA-seq or SRT analysis, and finally provide insights and guidance in selecting appropriate DE tools or developing new computational DE methods.
Collapse
Affiliation(s)
- Xiya Guo
- School of Public Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
- Key Laboratory of Trace Elements and Endemic Diseases, Center for Single Cell Omics and Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
| | - Jin Ning
- School of Public Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
- Key Laboratory of Trace Elements and Endemic Diseases, Center for Single Cell Omics and Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
| | - Yuanze Chen
- School of Public Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
- Key Laboratory of Trace Elements and Endemic Diseases, Center for Single Cell Omics and Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
| | - Guoliang Liu
- School of Public Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
- Key Laboratory of Trace Elements and Endemic Diseases, Center for Single Cell Omics and Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
| | - Liyan Zhao
- School of Public Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
- Key Laboratory of Trace Elements and Endemic Diseases, Center for Single Cell Omics and Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
| | - Yue Fan
- School of Public Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
- Key Laboratory of Trace Elements and Endemic Diseases, Center for Single Cell Omics and Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
| | - Shiquan Sun
- School of Public Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
- Key Laboratory of Trace Elements and Endemic Diseases, Center for Single Cell Omics and Health, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, P.R. China
| |
Collapse
|
8
|
Lin S, Zhao F, Wu Z, Yao J, Zhao Y, Yuan Z. Streamlining spatial omics data analysis with Pysodb. Nat Protoc 2024; 19:831-895. [PMID: 38135744 DOI: 10.1038/s41596-023-00925-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 10/02/2023] [Indexed: 12/24/2023]
Abstract
Advances in spatial omics technologies have improved the understanding of cellular organization in tissues, leading to the generation of complex and heterogeneous data and prompting the development of specialized tools for managing, loading and visualizing spatial omics data. The Spatial Omics Database (SODB) was established to offer a unified format for data storage and interactive visualization modules. Here we detail the use of Pysodb, a Python-based tool designed to enable the efficient exploration and loading of spatial datasets from SODB within a Python environment. We present seven case studies using Pysodb, detailing the interaction with various computational methods, ensuring reproducibility of experimental data and facilitating the integration of new data and alternative applications in SODB. The approach offers a reference for method developers by outlining label and metadata availability in representative spatial data that can be loaded by Pysodb. The tool is supplemented by a website ( https://protocols-pysodb.readthedocs.io/ ) with detailed information for benchmarking analysis, and allows method developers to focus on computational models by facilitating data processing. This protocol is designed for researchers with limited experience in computational biology. Depending on the dataset complexity, the protocol typically requires ~12 h to complete.
Collapse
Affiliation(s)
- Senlin Lin
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Fangyuan Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | | | | | - Yi Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| | - Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China.
| |
Collapse
|
9
|
Li Z, Patel ZM, Song D, Yan G, Li JJ, Pinello L. Benchmarking computational methods to identify spatially variable genes and peaks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.02.569717. [PMID: 38076922 PMCID: PMC10705556 DOI: 10.1101/2023.12.02.569717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2023]
Abstract
Spatially resolved transcriptomics offers unprecedented insight by enabling the profiling of gene expression within the intact spatial context of cells, effectively adding a new and essential dimension to data interpretation. To efficiently detect spatial structure of interest, an essential step in analyzing such data involves identifying spatially variable genes. Despite researchers having developed several computational methods to accomplish this task, the lack of a comprehensive benchmark evaluating their performance remains a considerable gap in the field. Here, we present a systematic evaluation of 14 methods using 60 simulated datasets generated by four different simulation strategies, 12 real-world transcriptomics, and three spatial ATAC-seq datasets. We find that spatialDE2 consistently outperforms the other benchmarked methods, and Moran's I achieves competitive performance in different experimental settings. Moreover, our results reveal that more specialized algorithms are needed to identify spatially variable peaks.
Collapse
Affiliation(s)
- Zhijian Li
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA
- Department of Pathology, Harvard Medical School, Boston, MA, USA
| | - Zain M. Patel
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA
- Department of Pathology, Harvard Medical School, Boston, MA, USA
| | - Dongyuan Song
- Interdepartmental Program of Bioinformatics, University of California, Los Angeles, CA, USA
| | - Guanao Yan
- Department of Statistics and Data Science, University of California, Los Angeles, CA, USA
| | - Jingyi Jessica Li
- Department of Statistics and Data Science, University of California, Los Angeles, CA, USA
| | - Luca Pinello
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA
- Department of Pathology, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
10
|
Adhikari SD, Yang J, Wang J, Cui Y. A SELECTIVE REVIEW OF RECENT DEVELOPMENTS IN SPATIALLY VARIABLE GENE DETECTION FOR SPATIAL TRANSCRIPTOMICS. ARXIV 2023:arXiv:2311.13801v1. [PMID: 38045476 PMCID: PMC10690303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
With the emergence of advanced spatial transcriptomic technologies, there has been a surge in research papers dedicated to analyzing spatial transcriptomics data, resulting in significant contributions to our understanding of biology. The initial stage of downstream analysis of spatial transcriptomic data has centered on identifying spatially variable genes (SVGs) or genes expressed with specific spatial patterns across the tissue. SVG detection is an important task since many downstream analyses depend on these selected SVGs. Over the past few years, a plethora of new methods have been proposed for the detection of SVGs, accompanied by numerous innovative concepts and discussions. This article provides a selective review of methods and their practical implementations, offering valuable insights into the current literature in this field.
Collapse
Affiliation(s)
- Sikta Das Adhikari
- Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Jiaxin Yang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Jianrong Wang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Yuehua Cui
- Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
11
|
Zhou R, Yang G, Zhang Y, Wang Y. Spatial transcriptomics in development and disease. MOLECULAR BIOMEDICINE 2023; 4:32. [PMID: 37806992 PMCID: PMC10560656 DOI: 10.1186/s43556-023-00144-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 08/29/2023] [Indexed: 10/10/2023] Open
Abstract
The proper functioning of diverse biological systems depends on the spatial organization of their cells, a critical factor for biological processes like shaping intricate tissue functions and precisely determining cell fate. Nonetheless, conventional bulk or single-cell RNA sequencing methods were incapable of simultaneously capturing both gene expression profiles and the spatial locations of cells. Hence, a multitude of spatially resolved technologies have emerged, offering a novel dimension for investigating regional gene expression, spatial domains, and interactions between cells. Spatial transcriptomics (ST) is a method that maps gene expression in tissue while preserving spatial information. It can reveal cellular heterogeneity, spatial organization and functional interactions in complex biological systems. ST can also complement and integrate with other omics methods to provide a more comprehensive and holistic view of biological systems at multiple levels of resolution. Since the advent of ST, new methods offering higher throughput and resolution have become available, holding significant potential to expedite fresh insights into comprehending biological complexity. Consequently, a rapid increase in associated research has occurred, using these technologies to unravel the spatial complexity during developmental processes or disease conditions. In this review, we summarize the recent advancement of ST in historical, technical, and application contexts. We compare different types of ST methods based on their principles and workflows, and present the bioinformatics tools for analyzing and integrating ST data with other modalities. We also highlight the applications of ST in various domains of biomedical research, especially development and diseases. Finally, we discuss the current limitations and challenges in the field, and propose the future directions of ST.
Collapse
Affiliation(s)
- Ran Zhou
- Department of Neurosurgery, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Gaoxia Yang
- Department of Neurosurgery, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
- National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu, 610041, Sichuan, China
| | - Yan Zhang
- National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu, 610041, Sichuan, China.
| | - Yuan Wang
- Department of Neurosurgery, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China.
| |
Collapse
|
12
|
Yuan Z, Yao J. Harnessing computational spatial omics to explore the spatial biology intricacies. Semin Cancer Biol 2023; 95:25-41. [PMID: 37400044 DOI: 10.1016/j.semcancer.2023.06.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 05/09/2023] [Accepted: 06/19/2023] [Indexed: 07/05/2023]
Abstract
Spatially resolved transcriptomics (SRT) has unlocked new dimensions in our understanding of intricate tissue architectures. However, this rapidly expanding field produces a wealth of diverse and voluminous data, necessitating the evolution of sophisticated computational strategies to unravel inherent patterns. Two distinct methodologies, gene spatial pattern recognition (GSPR) and tissue spatial pattern recognition (TSPR), have emerged as vital tools in this process. GSPR methodologies are designed to identify and classify genes exhibiting noteworthy spatial patterns, while TSPR strategies aim to understand intercellular interactions and recognize tissue domains with molecular and spatial coherence. In this review, we provide a comprehensive exploration of SRT, highlighting crucial data modalities and resources that are instrumental for the development of methods and biological insights. We address the complexities and challenges posed by the use of heterogeneous data in developing GSPR and TSPR methodologies and propose an optimal workflow for both. We delve into the latest advancements in GSPR and TSPR, examining their interrelationships. Lastly, we peer into the future, envisaging the potential directions and perspectives in this dynamic field.
Collapse
Affiliation(s)
- Zhiyuan Yuan
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.
| | | |
Collapse
|
13
|
Heumos L, Schaar AC, Lance C, Litinetskaya A, Drost F, Zappia L, Lücken MD, Strobl DC, Henao J, Curion F, Schiller HB, Theis FJ. Best practices for single-cell analysis across modalities. Nat Rev Genet 2023; 24:550-572. [PMID: 37002403 PMCID: PMC10066026 DOI: 10.1038/s41576-023-00586-w] [Citation(s) in RCA: 156] [Impact Index Per Article: 156.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/14/2023] [Indexed: 04/03/2023]
Abstract
Recent advances in single-cell technologies have enabled high-throughput molecular profiling of cells across modalities and locations. Single-cell transcriptomics data can now be complemented by chromatin accessibility, surface protein expression, adaptive immune receptor repertoire profiling and spatial information. The increasing availability of single-cell data across modalities has motivated the development of novel computational methods to help analysts derive biological insights. As the field grows, it becomes increasingly difficult to navigate the vast landscape of tools and analysis steps. Here, we summarize independent benchmarking studies of unimodal and multimodal single-cell analysis across modalities to suggest comprehensive best-practice workflows for the most common analysis steps. Where independent benchmarks are not available, we review and contrast popular methods. Our article serves as an entry point for novices in the field of single-cell (multi-)omic analysis and guides advanced users to the most recent best practices.
Collapse
Affiliation(s)
- Lukas Heumos
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- Institute of Lung Health and Immunity and Comprehensive Pneumology Center, Helmholtz Munich; Member of the German Center for Lung Research (DZL), Munich, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
| | - Anna C Schaar
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
- Munich Center for Machine Learning, Technical University of Munich, Garching, Germany
| | - Christopher Lance
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- Department of Paediatrics, Dr von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Anastasia Litinetskaya
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Felix Drost
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
| | - Luke Zappia
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Malte D Lücken
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- Institute of Lung Health and Immunity, Helmholtz Munich, Munich, Germany
| | - Daniel C Strobl
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
- Institute of Clinical Chemistry and Pathobiochemistry, School of Medicine, Technical University of Munich, Munich, Germany
- TranslaTUM, Center for Translational Cancer Research, Technical University of Munich, Munich, Germany
| | - Juan Henao
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
| | - Fabiola Curion
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Herbert B Schiller
- Institute of Lung Health and Immunity and Comprehensive Pneumology Center, Helmholtz Munich; Member of the German Center for Lung Research (DZL), Munich, Germany
| | - Fabian J Theis
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Munich, Germany.
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany.
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Garching, Germany.
- Munich Center for Machine Learning, Technical University of Munich, Garching, Germany.
| |
Collapse
|
14
|
Weber LM, Saha A, Datta A, Hansen KD, Hicks SC. nnSVG for the scalable identification of spatially variable genes using nearest-neighbor Gaussian processes. Nat Commun 2023; 14:4059. [PMID: 37429865 DOI: 10.1038/s41467-023-39748-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 06/23/2023] [Indexed: 07/12/2023] Open
Abstract
Feature selection to identify spatially variable genes or other biologically informative genes is a key step during analyses of spatially-resolved transcriptomics data. Here, we propose nnSVG, a scalable approach to identify spatially variable genes based on nearest-neighbor Gaussian processes. Our method (i) identifies genes that vary in expression continuously across the entire tissue or within a priori defined spatial domains, (ii) uses gene-specific estimates of length scale parameters within the Gaussian process models, and (iii) scales linearly with the number of spatial locations. We demonstrate the performance of our method using experimental data from several technological platforms and simulations. A software implementation is available at https://bioconductor.org/packages/nnSVG .
Collapse
Affiliation(s)
- Lukas M Weber
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Arkajyoti Saha
- Department of Statistics, University of Washington, Seattle, WA, USA
| | - Abhirup Datta
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Kasper D Hansen
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Stephanie C Hicks
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
15
|
Jones DC, Danaher P, Kim Y, Beechem JM, Gottardo R, Newell EW. An information theoretic approach to detecting spatially varying genes. CELL REPORTS METHODS 2023; 3:100507. [PMID: 37426750 PMCID: PMC10326450 DOI: 10.1016/j.crmeth.2023.100507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/03/2023] [Accepted: 05/25/2023] [Indexed: 07/11/2023]
Abstract
A key step in spatial transcriptomics is identifying genes with spatially varying expression patterns. We adopt an information theoretic perspective to this problem by equating the degree of spatial coherence with the Jensen-Shannon divergence between pairs of nearby cells and pairs of distant cells. To avoid the notoriously difficult problem of estimating information theoretic divergences, we use modern approximation techniques to implement a computationally efficient algorithm designed to scale with in situ spatial transcriptomics technologies. In addition to being highly scalable, we show that our method, which we call maximization of spatial information (Maxspin), improves accuracy across several spatial transcriptomics platforms and a variety of simulations when compared with a variety of state-of-the-art methods. To further demonstrate the method, we generated in situ spatial transcriptomics data in a renal cell carcinoma sample using the CosMx Spatial Molecular Imager and used Maxspin to reveal novel spatial patterns of tumor cell gene expression.
Collapse
Affiliation(s)
| | | | - Youngmi Kim
- NanoString Technologies, Inc., Seattle, WA, USA
| | | | - Raphael Gottardo
- Fred Hutchinson Cancer Center, Seattle, WA, USA
- Biomedical Data Science Center, Lausanne University Hospital, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Ludwig Institute for Cancer Research, Lausanne Branch, Lausanne, Switzerland
| | | |
Collapse
|
16
|
Zhao Y, Wang K, Hu G. DIST: spatial transcriptomics enhancement using deep learning. Brief Bioinform 2023; 24:6991203. [PMID: 36653906 DOI: 10.1093/bib/bbad013] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 12/19/2022] [Accepted: 01/03/2023] [Indexed: 01/20/2023] Open
Abstract
Spatially resolved transcriptomics technologies enable comprehensive measurement of gene expression patterns in the context of intact tissues. However, existing technologies suffer from either low resolution or shallow sequencing depth. Here, we present DIST, a deep learning-based method that imputes the gene expression profiles on unmeasured locations and enhances the gene expression for both original measured spots and imputed spots by self-supervised learning and transfer learning. We evaluate the performance of DIST for imputation, clustering, differential expression analysis and functional enrichment analysis. The results show that DIST can impute the gene expression accurately, enhance the gene expression for low-quality data, help detect more biological meaningful differentially expressed genes and pathways, therefore allow for deeper insights into the biological processes.
Collapse
Affiliation(s)
- Yanping Zhao
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Kui Wang
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Gang Hu
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| |
Collapse
|
17
|
Yue L, Liu F, Hu J, Yang P, Wang Y, Dong J, Shu W, Huang X, Wang S. A guidebook of spatial transcriptomic technologies, data resources and analysis approaches. Comput Struct Biotechnol J 2023; 21:940-955. [PMID: 38213887 PMCID: PMC10781722 DOI: 10.1016/j.csbj.2023.01.016] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 01/13/2023] [Accepted: 01/14/2023] [Indexed: 01/18/2023] Open
Abstract
Advances in transcriptomic technologies have deepened our understanding of the cellular gene expression programs of multicellular organisms and provided a theoretical basis for disease diagnosis and therapy. However, both bulk and single-cell RNA sequencing approaches lose the spatial context of cells within the tissue microenvironment, and the development of spatial transcriptomics has made overall bias-free access to both transcriptional information and spatial information possible. Here, we elaborate development of spatial transcriptomic technologies to help researchers select the best-suited technology for their goals and integrate the vast amounts of data to facilitate data accessibility and availability. Then, we marshal various computational approaches to analyze spatial transcriptomic data for various purposes and describe the spatial multimodal omics and its potential for application in tumor tissue. Finally, we provide a detailed discussion and outlook of the spatial transcriptomic technologies, data resources and analysis approaches to guide current and future research on spatial transcriptomics.
Collapse
Affiliation(s)
- Liangchen Yue
- Beijing Institute of Microbiology and Epidemiology, Beijing 100850, China
| | - Feng Liu
- College of Medical Informatics, Chongqing Medical University, Chongqing 400016, China
| | - Jiongsong Hu
- University of South China, Hengyang, Hunan 421001, China
| | - Pin Yang
- Anhui Medical University, Hefei 230022, Anhui, China
| | - Yuxiang Wang
- Beijing Institute of Microbiology and Epidemiology, Beijing 100850, China
| | - Junguo Dong
- Beijing Institute of Microbiology and Epidemiology, Beijing 100850, China
| | - Wenjie Shu
- Beijing Institute of Microbiology and Epidemiology, Beijing 100850, China
| | - Xingxu Huang
- Zhejiang Provincial Key Laboratory of Pancreatic Disease, the First Affiliated Hospital, and Institute of Translational Medicine, Zhejiang University School of Medicine, Hangzhou 310029, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Shengqi Wang
- Beijing Institute of Microbiology and Epidemiology, Beijing 100850, China
| |
Collapse
|
18
|
Duan H, Cheng T, Cheng H. Spatially resolved transcriptomics: advances and applications. BLOOD SCIENCE 2023; 5:1-14. [PMID: 36742187 PMCID: PMC9891446 DOI: 10.1097/bs9.0000000000000141] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 10/19/2022] [Indexed: 11/06/2022] Open
Abstract
Spatial transcriptomics, which is capable of both measuring all gene activity in a tissue sample and mapping where this activity occurs, is vastly improving our understanding of biological processes and disease. The field has expanded rapidly in recent years, and the development of several new technologies has resulted in spatially resolved transcriptomics (SRT) becoming highly multiplexed, high-resolution, and high-throughput. Here, we summarize and compare the major methods of SRT, including imaging-based methods, sequencing-based methods, and in situ sequencing methods. We also highlight some typical applications of SRT in neuroscience, cancer biology, developmental biology, and hematology. Finally, we discuss future possibilities for improving spatially resolved transcriptomic methods and the expected applications of such methods, especially in the adult bone marrow, anticipating that new developments will unlock the full potential of spatially resolved multi-omics in both biological research and the clinic.
Collapse
Affiliation(s)
- Honglin Duan
- State Key Laboratory of Experimental Hematology, Haihe Laboratory of Cell Ecosystem, National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, China
| | - Tao Cheng
- State Key Laboratory of Experimental Hematology, Haihe Laboratory of Cell Ecosystem, National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, China
- Center for Stem Cell Medicine, Chinese Academy of Medical Sciences, Tianjin, China
- Department of Stem Cell & Regenerative Medicine, Peking Union Medical College, Tianjin, China
| | - Hui Cheng
- State Key Laboratory of Experimental Hematology, Haihe Laboratory of Cell Ecosystem, National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, China
- Center for Stem Cell Medicine, Chinese Academy of Medical Sciences, Tianjin, China
- Department of Stem Cell & Regenerative Medicine, Peking Union Medical College, Tianjin, China
| |
Collapse
|
19
|
Bernstein MN, Ni Z, Prasad A, Brown J, Mohanty C, Stewart R, Newton MA, Kendziorski C. SpatialCorr identifies gene sets with spatially varying correlation structure. CELL REPORTS METHODS 2022; 2:100369. [PMID: 36590683 PMCID: PMC9795364 DOI: 10.1016/j.crmeth.2022.100369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Revised: 09/26/2022] [Accepted: 11/21/2022] [Indexed: 12/15/2022]
Abstract
Recent advances in spatially resolved transcriptomics technologies enable both the measurement of genome-wide gene expression profiles and their mapping to spatial locations within a tissue. A first step in spatial transcriptomics data analysis is identifying genes with expression that varies spatially, and robust statistical methods exist to address this challenge. While useful, these methods do not detect spatial changes in the coordinated expression within a group of genes. To this end, we present SpatialCorr, a method for identifying sets of genes with spatially varying correlation structure. Given a collection of gene sets pre-defined by a user, SpatialCorr tests for spatially induced differences in the correlation of each gene set within tissue regions, as well as between and among regions. An application to cutaneous squamous cell carcinoma demonstrates the power of the approach for revealing biological insights not identified using existing methods.
Collapse
Affiliation(s)
| | - Zijian Ni
- Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Aman Prasad
- Department of Dermatology, University of Wisconsin-Madison, Madison, WI 53715, USA
| | - Jared Brown
- Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Chitrasen Mohanty
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53792, USA
| | - Ron Stewart
- Morgridge Institute for Research, Madison, WI 53715, USA
| | - Michael A. Newton
- Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53792, USA
| | - Christina Kendziorski
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53792, USA
| |
Collapse
|
20
|
Overbey EG, Das S, Cope H, Madrigal P, Andrusivova Z, Frapard S, Klotz R, Bezdan D, Gupta A, Scott RT, Park J, Chirko D, Galazka JM, Costes SV, Mason CE, Herranz R, Szewczyk NJ, Borg J, Giacomello S. Challenges and considerations for single-cell and spatially resolved transcriptomics sample collection during spaceflight. CELL REPORTS METHODS 2022; 2:100325. [PMID: 36452864 PMCID: PMC9701605 DOI: 10.1016/j.crmeth.2022.100325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) have experienced rapid development in recent years. The findings of spaceflight-based scRNA-seq and SRT investigations are likely to improve our understanding of life in space and our comprehension of gene expression in various cell systems and tissue dynamics. However, compared to their Earth-based counterparts, gene expression experiments conducted in spaceflight have not experienced the same pace of development. Out of the hundreds of spaceflight gene expression datasets available, only a few used scRNA-seq and SRT. In this perspective piece, we explore the growing importance of scRNA-seq and SRT in space biology and discuss the challenges and considerations relevant to robust experimental design to enable growth of these methods in the field.
Collapse
Affiliation(s)
- Eliah G. Overbey
- Weill Cornell Medicine, New York, NY, USA
- Institute for Computational Biomedicine, New York, NY, USA
| | - Saswati Das
- Department of Biochemistry, Atal Bihari Vajpayee Institute of Medical Sciences & Dr. Ram Manohar Lohia Hospital, New Delhi, India
| | - Henry Cope
- School of Medicine, University of Nottingham, Derby DE22 3DT, UK
| | - Pedro Madrigal
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Zaneta Andrusivova
- Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Solène Frapard
- Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Rebecca Klotz
- KBR, Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | - Daniela Bezdan
- Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen 72076, Germany
- NGS Competence Center Tübingen (NCCT), University of Tübingen, Tübingen, German
- yuri GmbH, Meckenbeuren, Germany
| | | | - Ryan T. Scott
- KBR, Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | | | | | - Jonathan M. Galazka
- Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | - Sylvain V. Costes
- Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA
| | - Christopher E. Mason
- Weill Cornell Medicine, New York, NY, USA
- Institute for Computational Biomedicine, New York, NY, USA
- The Feil Family Brain and Mind Research Institute, New York, NY, USA
- The WorldQuant Initiative for Quantitative Prediction, New York, NY, USA
| | - Raul Herranz
- Centro de Investigaciones Biológicas Margarita Salas (CSIC), Madrid 28040, Spain
| | - Nathaniel J. Szewczyk
- School of Medicine, University of Nottingham, Derby DE22 3DT, UK
- Department of Biomedical Sciences, Heritage College of Osteopathic Medicine, Ohio University, Athens, OH 45701, USA
| | - Joseph Borg
- Department of Applied Biomedical Science, Faculty of Health Sciences, University of Malta, Msida, Malta
| | - Stefania Giacomello
- Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
21
|
Liu Q, Hsu CY, Shyr Y. Scalable and model-free detection of spatial patterns and colocalization. Genome Res 2022; 32:1736-1745. [PMID: 36223499 PMCID: PMC9528978 DOI: 10.1101/gr.276851.122] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 08/16/2022] [Indexed: 11/24/2022]
Abstract
The expeditious growth in spatial omics technologies enables the profiling of genome-wide molecular events at molecular and single-cell resolution, highlighting a need for fast and reliable methods to characterize spatial patterns. We developed SpaGene, a model-free method to discover spatial patterns rapidly in large-scale spatial omics studies. Analyzing simulation and a variety of spatially resolved transcriptomics data showed that SpaGene is more powerful and scalable than existing methods. Spatial expression patterns identified by SpaGene reconstruct unobserved tissue structures. SpaGene also successfully discovers ligand-receptor interactions through their colocalization.
Collapse
Affiliation(s)
- Qi Liu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
| | - Chih-Yuan Hsu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
| | - Yu Shyr
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
| |
Collapse
|
22
|
Williams CG, Lee HJ, Asatsuma T, Vento-Tormo R, Haque A. An introduction to spatial transcriptomics for biomedical research. Genome Med 2022; 14:68. [PMID: 35761361 PMCID: PMC9238181 DOI: 10.1186/s13073-022-01075-1] [Citation(s) in RCA: 206] [Impact Index Per Article: 103.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 06/19/2022] [Indexed: 01/04/2023] Open
Abstract
Single-cell transcriptomics (scRNA-seq) has become essential for biomedical research over the past decade, particularly in developmental biology, cancer, immunology, and neuroscience. Most commercially available scRNA-seq protocols require cells to be recovered intact and viable from tissue. This has precluded many cell types from study and largely destroys the spatial context that could otherwise inform analyses of cell identity and function. An increasing number of commercially available platforms now facilitate spatially resolved, high-dimensional assessment of gene transcription, known as 'spatial transcriptomics'. Here, we introduce different classes of method, which either record the locations of hybridized mRNA molecules in tissue, image the positions of cells themselves prior to assessment, or employ spatial arrays of mRNA probes of pre-determined location. We review sizes of tissue area that can be assessed, their spatial resolution, and the number and types of genes that can be profiled. We discuss if tissue preservation influences choice of platform, and provide guidance on whether specific platforms may be better suited to discovery screens or hypothesis testing. Finally, we introduce bioinformatic methods for analysing spatial transcriptomic data, including pre-processing, integration with existing scRNA-seq data, and inference of cell-cell interactions. Spatial -omics methods are already improving our understanding of human tissues in research, diagnostic, and therapeutic settings. To build upon these recent advancements, we provide entry-level guidance for those seeking to employ spatial transcriptomics in their own biomedical research.
Collapse
Affiliation(s)
- Cameron G Williams
- Department of Microbiology and Immunology, University of Melbourne, located at the Peter Doherty Institute for Infection and Immunity, Parkville, VIC, 3000, Australia
| | - Hyun Jae Lee
- Department of Microbiology and Immunology, University of Melbourne, located at the Peter Doherty Institute for Infection and Immunity, Parkville, VIC, 3000, Australia
| | - Takahiro Asatsuma
- Department of Microbiology and Immunology, University of Melbourne, located at the Peter Doherty Institute for Infection and Immunity, Parkville, VIC, 3000, Australia
| | - Roser Vento-Tormo
- Cellular Genetics Group, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Ashraful Haque
- Department of Microbiology and Immunology, University of Melbourne, located at the Peter Doherty Institute for Infection and Immunity, Parkville, VIC, 3000, Australia.
| |
Collapse
|
23
|
Zeng Z, Li Y, Li Y, Luo Y. Statistical and machine learning methods for spatially resolved transcriptomics data analysis. Genome Biol 2022; 23:83. [PMID: 35337374 PMCID: PMC8951701 DOI: 10.1186/s13059-022-02653-7] [Citation(s) in RCA: 48] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Accepted: 03/15/2022] [Indexed: 01/28/2023] Open
Abstract
The recent advancement in spatial transcriptomics technology has enabled multiplexed profiling of cellular transcriptomes and spatial locations. As the capacity and efficiency of the experimental technologies continue to improve, there is an emerging need for the development of analytical approaches. Furthermore, with the continuous evolution of sequencing protocols, the underlying assumptions of current analytical methods need to be re-evaluated and adjusted to harness the increasing data complexity. To motivate and aid future model development, we herein review the recent development of statistical and machine learning methods in spatial transcriptomics, summarize useful resources, and highlight the challenges and opportunities ahead.
Collapse
Affiliation(s)
- Zexian Zeng
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100084, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100084, China
- Department of Data Sciences, Dana Farber Cancer Institute, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Yawei Li
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
| | - Yiming Li
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
| | - Yuan Luo
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA.
- Northwestern University Clinical and Translational Sciences Institute, Chicago, IL, 60611, USA.
- Institute for Augmented Intelligence in Medicine, Northwestern University, Chicago, IL, 60611, USA.
- Center for Health Information Partnerships, Northwestern University, Chicago, IL, 60611, USA.
| |
Collapse
|
24
|
Walker BL, Cang Z, Ren H, Bourgain-Chang E, Nie Q. Deciphering tissue structure and function using spatial transcriptomics. Commun Biol 2022; 5:220. [PMID: 35273328 PMCID: PMC8913632 DOI: 10.1038/s42003-022-03175-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 02/16/2022] [Indexed: 01/31/2023] Open
Abstract
The rapid development of spatial transcriptomics (ST) techniques has allowed the measurement of transcriptional levels across many genes together with the spatial positions of cells. This has led to an explosion of interest in computational methods and techniques for harnessing both spatial and transcriptional information in analysis of ST datasets. The wide diversity of approaches in aim, methodology and technology for ST provides great challenges in dissecting cellular functions in spatial contexts. Here, we synthesize and review the key problems in analysis of ST data and methods that are currently applied, while also expanding on open questions and areas of future development.
Collapse
Affiliation(s)
- Benjamin L. Walker
- grid.266093.80000 0001 0668 7243The NSF-Simons Center for Multiscale Cell Fate Research, University of California Irvine, Irvine, CA USA ,grid.266093.80000 0001 0668 7243Department of Mathematics, University of California Irvine, Irvine, CA USA
| | - Zixuan Cang
- grid.266093.80000 0001 0668 7243The NSF-Simons Center for Multiscale Cell Fate Research, University of California Irvine, Irvine, CA USA ,grid.266093.80000 0001 0668 7243Department of Mathematics, University of California Irvine, Irvine, CA USA
| | - Honglei Ren
- grid.266093.80000 0001 0668 7243The NSF-Simons Center for Multiscale Cell Fate Research, University of California Irvine, Irvine, CA USA ,grid.266093.80000 0001 0668 7243Department of Mathematics, University of California Irvine, Irvine, CA USA
| | - Eric Bourgain-Chang
- grid.266093.80000 0001 0668 7243Department of Mathematics, University of California Irvine, Irvine, CA USA
| | - Qing Nie
- grid.266093.80000 0001 0668 7243The NSF-Simons Center for Multiscale Cell Fate Research, University of California Irvine, Irvine, CA USA ,grid.266093.80000 0001 0668 7243Department of Mathematics, University of California Irvine, Irvine, CA USA ,grid.266093.80000 0001 0668 7243Department of Developmental and Cell Biology, University of California Irvine, Irvine, CA USA
| |
Collapse
|
25
|
Spatial components of molecular tissue biology. Nat Biotechnol 2022; 40:308-318. [PMID: 35132261 DOI: 10.1038/s41587-021-01182-1] [Citation(s) in RCA: 99] [Impact Index Per Article: 49.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 12/03/2021] [Indexed: 02/06/2023]
Abstract
Methods for profiling RNA and protein expression in a spatially resolved manner are rapidly evolving, making it possible to comprehensively characterize cells and tissues in health and disease. To maximize the biological insights obtained using these techniques, it is critical to both clearly articulate the key biological questions in spatial analysis of tissues and develop the requisite computational tools to address them. Developers of analytical tools need to decide on the intrinsic molecular features of each cell that need to be considered, and how cell shape and morphological features are incorporated into the analysis. Also, optimal ways to compare different tissue samples at various length scales are still being sought. Grouping these biological problems and related computational algorithms into classes across length scales, thus characterizing common issues that need to be addressed, will facilitate further progress in spatial transcriptomics and proteomics.
Collapse
|
26
|
Abstract
AbstractSpatial omics data are advancing the study of tissue organization and cellular communication at an unprecedented scale. Flexible tools are required to store, integrate and visualize the large diversity of spatial omics data. Here, we present Squidpy, a Python framework that brings together tools from omics and image analysis to enable scalable description of spatial molecular data, such as transcriptome or multivariate proteins. Squidpy provides efficient infrastructure and numerous analysis methods that allow to efficiently store, manipulate and interactively visualize spatial omics data. Squidpy is extensible and can be interfaced with a variety of already existing libraries for the scalable analysis of spatial omics data.
Collapse
|
27
|
Abstract
Spatial transcriptomics is a rapidly growing field that promises to comprehensively characterize tissue organization and architecture at the single-cell or subcellular resolution. Such information provides a solid foundation for mechanistic understanding of many biological processes in both health and disease that cannot be obtained by using traditional technologies. The development of computational methods plays important roles in extracting biological signals from raw data. Various approaches have been developed to overcome technology-specific limitations such as spatial resolution, gene coverage, sensitivity, and technical biases. Downstream analysis tools formulate spatial organization and cell-cell communications as quantifiable properties, and provide algorithms to derive such properties. Integrative pipelines further assemble multiple tools in one package, allowing biologists to conveniently analyze data from beginning to end. In this review, we summarize the state of the art of spatial transcriptomic data analysis methods and pipelines, and discuss how they operate on different technological platforms.
Collapse
Affiliation(s)
- Ruben Dries
- Department of Medicine, Boston University School of Medicine, Boston, Massachusetts 02118, USA
- Bioinformatics Graduate Program, Boston University, Boston, Massachusetts 02215, USA
- Section of Computational Biomedicine, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Jiaji Chen
- Department of Medicine, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Natalie Del Rossi
- Department of Genetics and Genomic Sciences, Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA
| | - Mohammed Muzamil Khan
- Department of Medicine, Boston University School of Medicine, Boston, Massachusetts 02118, USA
- Bioinformatics Graduate Program, Boston University, Boston, Massachusetts 02215, USA
- Section of Computational Biomedicine, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Adriana Sistig
- Department of Genetics and Genomic Sciences, Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA
| | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences, Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA
- Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA
| |
Collapse
|