1
|
Liu C, Li X, Hu Q, Jia Z, Ye Q, Wang X, Zhao K, Liu L, Wang M. Decoding the blueprints of embryo development with single-cell and spatial omics. Semin Cell Dev Biol 2025; 167:22-39. [PMID: 39889540 DOI: 10.1016/j.semcdb.2025.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 01/18/2025] [Accepted: 01/18/2025] [Indexed: 02/03/2025]
Abstract
Embryonic development is a complex and intricately regulated process that encompasses precise control over cell differentiation, morphogenesis, and the underlying gene expression changes. Recent years have witnessed a remarkable acceleration in the development of single-cell and spatial omic technologies, enabling high-throughput profiling of transcriptomic and other multi-omic information at the individual cell level. These innovations offer fresh and multifaceted perspectives for investigating the intricate cellular and molecular mechanisms that govern embryonic development. In this review, we provide an in-depth exploration of the latest technical advancements in single-cell and spatial multi-omic methodologies and compile a systematic catalog of their applications in the field of embryonic development. We deconstruct the research strategies employed by recent studies that leverage single-cell sequencing techniques and underscore the unique advantages of spatial transcriptomics. Furthermore, we delve into both the current applications, data analysis algorithms and the untapped potential of these technologies in advancing our understanding of embryonic development. With the continuous evolution of multi-omic technologies, we anticipate their widespread adoption and profound contributions to unraveling the intricate molecular foundations underpinning embryo development in the foreseeable future.
Collapse
Affiliation(s)
- Chang Liu
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China; Shanxi Medical University-BGI Collaborative Center for Future Medicine, Shanxi Medical University, Taiyuan 030001, China; Shenzhen Proof-of-Concept Center of Digital Cytopathology, BGI Research, Shenzhen 518083, China
| | | | - Qinan Hu
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518005, China; Department of Pharmacology, School of Medicine, Southern University of Science and Technology, Shenzhen 518005, China
| | - Zihan Jia
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qing Ye
- BGI Research, Hangzhou 310030, China; China Jiliang University, Hangzhou 310018, China
| | | | - Kaichen Zhao
- College of Biomedicine and Health, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Longqi Liu
- BGI Research, Hangzhou 310030, China; Shanxi Medical University-BGI Collaborative Center for Future Medicine, Shanxi Medical University, Taiyuan 030001, China.
| | - Mingyue Wang
- BGI Research, Hangzhou 310030, China; Key Laboratory of Spatial Omics of Zhejiang Province, BGI Research, Hangzhou 310030, China.
| |
Collapse
|
2
|
Kumaran G, Carroll L, Muirhead N, Bottomley MJ. How Can Spatial Transcriptomic Profiling Advance Our Understanding of Skin Diseases? J Invest Dermatol 2025; 145:522-535. [PMID: 39177547 DOI: 10.1016/j.jid.2024.07.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 05/23/2024] [Accepted: 07/04/2024] [Indexed: 08/24/2024]
Abstract
Spatial transcriptomic (ST) profiling is the mapping of gene expression within cell populations with preservation of positional context and represents an exciting new approach to develop our understanding of local and regional influences upon skin biology in health and disease. With the ability to probe from a few hundred transcripts to the entire transcriptome, multiple ST approaches are now widely available. In this paper, we review the ST field and discuss its application to dermatology. Its potential to advance our understanding of skin biology in health and disease is highlighted through the illustrative examples of 3 research areas: cutaneous aging, tumorigenesis, and psoriasis.
Collapse
Affiliation(s)
- Girishkumar Kumaran
- Chinese Academy of Medical Sciences Oxford Institute, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Liam Carroll
- Chinese Academy of Medical Sciences Oxford Institute, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | | | - Matthew J Bottomley
- Chinese Academy of Medical Sciences Oxford Institute, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom.
| |
Collapse
|
3
|
Millard N, Chen JH, Palshikar MG, Pelka K, Spurrell M, Price C, He J, Hacohen N, Raychaudhuri S, Korsunsky I. Batch correcting single-cell spatial transcriptomics count data with Crescendo improves visualization and detection of spatial gene patterns. Genome Biol 2025; 26:36. [PMID: 40001084 PMCID: PMC11863647 DOI: 10.1186/s13059-025-03479-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 01/21/2025] [Indexed: 02/27/2025] Open
Abstract
Spatial transcriptomics facilitates gene expression analysis of cells in their spatial anatomical context. Batch effects hinder visualization of gene spatial patterns across samples. We present the Crescendo algorithm to correct for batch effects at the gene expression level and enable accurate visualization of gene expression patterns across multiple samples. We show Crescendo's utility and scalability across three datasets ranging from 170,000 to 7 million single cells across spatial and single-cell RNA sequencing technologies. By correcting for batch effects, Crescendo enhances spatial transcriptomics analyses to detect gene colocalization and ligand-receptor interactions and enables cross-technology information transfer.
Collapse
Affiliation(s)
- Nghia Millard
- Division of Rheumatology, Inflammation and Immunity, Brigham and Women's Hospital, Boston, MA, USA
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Center for Data Sciences, Brigham and Women's Hospital, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jonathan H Chen
- Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital (MGH) Cancer Center, Harvard Medical School, Boston, MA, USA
- Department of Pathology, MGH, Boston, MA, USA
| | - Mukta G Palshikar
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Center for Data Sciences, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Karin Pelka
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital (MGH) Cancer Center, Harvard Medical School, Boston, MA, USA
- UCSF Institute of Genomic Immunology, Gladstone Institutes, San Francisco, CA, USA
| | - Maxwell Spurrell
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital (MGH) Cancer Center, Harvard Medical School, Boston, MA, USA
- Department of Pathology, MGH, Boston, MA, USA
| | | | | | - Nir Hacohen
- Department of Immunology, Harvard Medical School, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital (MGH) Cancer Center, Harvard Medical School, Boston, MA, USA
| | - Soumya Raychaudhuri
- Division of Rheumatology, Inflammation and Immunity, Brigham and Women's Hospital, Boston, MA, USA.
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA.
- Center for Data Sciences, Brigham and Women's Hospital, Boston, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
- Harvard Medical School, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Ilya Korsunsky
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA.
- Center for Data Sciences, Brigham and Women's Hospital, Boston, MA, USA.
- Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
4
|
Xu K, Xu Y, Wang Z, Zhou XM, Zhang L. stDyer enables spatial domain clustering with dynamic graph embedding. Genome Biol 2025; 26:34. [PMID: 39980033 PMCID: PMC11843776 DOI: 10.1186/s13059-025-03503-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 02/12/2025] [Indexed: 02/22/2025] Open
Abstract
Spatially resolved transcriptomics (SRT) data provide critical insights into gene expression patterns within tissue contexts, necessitating effective methods for identifying spatial domains. We introduce stDyer, an end-to-end deep learning framework for spatial domain clustering in SRT data. stDyer combines Gaussian Mixture Variational AutoEncoder with graph attention networks to learn embeddings and perform clustering. Its dynamic graphs adaptively link units based on Gaussian Mixture assignments, improving clustering and producing smoother domain boundaries. stDyer's mini-batch strategy and multi-GPU support facilitate scalability to large datasets. Benchmarking against state-of-the-art tools, stDyer demonstrates superior performance in spatial domain clustering, multi-slice analysis, and large-scale dataset handling.
Collapse
Affiliation(s)
- Ke Xu
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Yu Xu
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Zirui Wang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Xin Maizie Zhou
- Department of Biomedical Engineering, Vanderbilt University, 2301 Vanderbilt Place, 37235, Nashville, Tennessee, USA.
| | - Lu Zhang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China.
- Institute of Systems Medicine and Health Sciences, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, China.
| |
Collapse
|
5
|
Torok J, Maia PD, Anand C, Raj A. Searching for the cellular underpinnings of the selective vulnerability to tauopathic insults in Alzheimer's disease. Commun Biol 2025; 8:195. [PMID: 39920421 PMCID: PMC11806020 DOI: 10.1038/s42003-025-07575-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 01/17/2025] [Indexed: 02/09/2025] Open
Abstract
Neurodegenerative diseases such as Alzheimer's disease exhibit pathological changes in the brain that proceed in a stereotyped and regionally specific fashion. However, the cellular underpinnings of regional vulnerability are poorly understood, in part because whole-brain maps of a comprehensive collection of cell types have been inaccessible. Here, we deployed a recent cell-type mapping pipeline, Matrix Inversion and Subset Selection (MISS), to determine the brain-wide distributions of pan-hippocampal and neocortical cells in the mouse, and then used these maps to identify general principles of cell-type-based selective vulnerability in PS19 mouse models. We found that hippocampal glutamatergic neurons as a whole were significantly positively associated with regional tau deposition, suggesting vulnerability, while cortical glutamatergic and GABAergic neurons were negatively associated. We also identified oligodendrocytes as the single-most strongly negatively associated cell type. Further, cell-type distributions were more predictive of end-time-point tau pathology than AD-risk-gene expression. Using gene ontology analysis, we found that the genes that are directly correlated to tau pathology are functionally distinct from those that constitutively embody the vulnerable cells. In short, we have elucidated cell-type correlates of tau deposition across mouse models of tauopathy, advancing our understanding of selective cellular vulnerability at a whole-brain level.
Collapse
Affiliation(s)
- Justin Torok
- University of CAlifornia, San Francisco, Department of Radiology, San Francisco, CA, 94143, USA
| | - Pedro D Maia
- University of Texas at Arlington, Department of Mathematics, Arlington, TX, 76019, USA
| | - Chaitali Anand
- University of CAlifornia, San Francisco, Institute for Neurodegenerative Diseases, San Francisco, CA, 94143, USA
| | - Ashish Raj
- University of CAlifornia, San Francisco, Department of Radiology, San Francisco, CA, 94143, USA.
| |
Collapse
|
6
|
Hu B, He R, Pang K, Wang G, Wang N, Zhu W, Sui X, Teng H, Liu T, Zhu J, Jiang Z, Zhang J, Zuo Z, Wang W, Ji P, Zhao F. High-resolution spatially resolved proteomics of complex tissues based on microfluidics and transfer learning. Cell 2025; 188:734-748.e22. [PMID: 39855194 DOI: 10.1016/j.cell.2024.12.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 10/12/2024] [Accepted: 12/17/2024] [Indexed: 01/27/2025]
Abstract
Despite recent advances in imaging- and antibody-based methods, achieving in-depth, high-resolution protein mapping across entire tissues remains a significant challenge in spatial proteomics. Here, we present parallel-flow projection and transfer learning across omics data (PLATO), an integrated framework combining microfluidics with deep learning to enable high-resolution mapping of thousands of proteins in whole tissue sections. We validated the PLATO framework by profiling the spatial proteome of the mouse cerebellum, identifying 2,564 protein groups in a single run. We then applied PLATO to rat villus and human breast cancer samples, achieving a spatial resolution of 25 μm and uncovering proteomic dynamics associated with disease states. This approach revealed spatially distinct tumor subtypes, identified key dysregulated proteins, and provided novel insights into the complexity of the tumor microenvironment. We believe that PLATO represents a transformative platform for exploring spatial proteomic regulation and its interplay with genetic and environmental factors.
Collapse
Affiliation(s)
- Beiyu Hu
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Ruiqiao He
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kun Pang
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guibin Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Ning Wang
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Wenzhuo Zhu
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Xin Sui
- Key Laboratory of Carcinogenesis and Translational Research, Department of Radiation Oncology, Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Huajing Teng
- Key Laboratory of Carcinogenesis and Translational Research, Department of Radiation Oncology, Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Tianxin Liu
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Junjie Zhu
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Zewen Jiang
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Jinyang Zhang
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhenqiang Zuo
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Weihu Wang
- Key Laboratory of Carcinogenesis and Translational Research, Department of Radiation Oncology, Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Peifeng Ji
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.
| | - Fangqing Zhao
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
7
|
Haviv D, Remšík J, Gatie M, Snopkowski C, Takizawa M, Pereira N, Bashkin J, Jovanovich S, Nawy T, Chaligne R, Boire A, Hadjantonakis AK, Pe'er D. The covariance environment defines cellular niches for spatial inference. Nat Biotechnol 2025; 43:269-280. [PMID: 38565973 PMCID: PMC11445396 DOI: 10.1038/s41587-024-02193-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 02/28/2024] [Indexed: 04/04/2024]
Abstract
A key challenge of analyzing data from high-resolution spatial profiling technologies is to suitably represent the features of cellular neighborhoods or niches. Here we introduce the covariance environment (COVET), a representation that leverages the gene-gene covariate structure across cells in the niche to capture the multivariate nature of cellular interactions within it. We define a principled optimal transport-based distance metric between COVET niches that scales to millions of cells. Using COVET to encode spatial context, we developed environmental variational inference (ENVI), a conditional variational autoencoder that jointly embeds spatial and single-cell RNA sequencing data into a latent space. ENVI includes two decoders: one to impute gene expression across the spatial modality and a second to project spatial information onto single-cell data. ENVI can confer spatial context to genomics data from single dissociated cells and outperforms alternatives for imputing gene expression on diverse spatial datasets.
Collapse
Affiliation(s)
- Doron Haviv
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Ján Remšík
- Human Oncology & Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Mohamed Gatie
- Developmental Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Catherine Snopkowski
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Meril Takizawa
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | | | | | | | - Tal Nawy
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ronan Chaligne
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Adrienne Boire
- Human Oncology & Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Department of Neurology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Brain Tumor Center, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Anna-Katerina Hadjantonakis
- Developmental Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Dana Pe'er
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
- Howard Hughes Medical Institute, New York, NY, USA.
| |
Collapse
|
8
|
Zhang M, Parker J, An L, Liu Y, Sun X. Flexible analysis of spatial transcriptomics data (FAST): a deconvolution approach. BMC Bioinformatics 2025; 26:35. [PMID: 39891065 PMCID: PMC11786350 DOI: 10.1186/s12859-025-06054-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 01/16/2025] [Indexed: 02/03/2025] Open
Abstract
MOTIVATION Spatial transcriptomics is a state-of-art technique that allows researchers to study gene expression patterns in tissues over the spatial domain. As a result of technical limitations, the majority of spatial transcriptomics techniques provide bulk data for each sequencing spot. Consequently, in order to obtain high-resolution spatial transcriptomics data, performing deconvolution becomes essential. Most existing deconvolution methods rely on reference data (e.g., single-cell data), which may not be available in real applications. Current reference-free methods encounter limitations due to their dependence on distribution assumptions, reliance on marker genes, or the absence of leveraging histology and spatial information. Consequently, there is a critical need for the development of highly flexible, robust, and user-friendly reference-free deconvolution methods capable of unifying or leveraging case-specific information in the analysis of spatial transcriptomics data. RESULTS We propose a novel reference-free method based on regularized non-negative matrix factorization (NMF), named Flexible Analysis of Spatial Transcriptomics (FAST), that can effectively incorporate gene expression data, spatial, and histology information into a unified deconvolution framework. Compared to existing methods, FAST imposes fewer distribution assumptions, utilizes the spatial structure information of tissues, and encourages interpretable factorization results. These features enable greater flexibility and accuracy, making FAST an effective tool for deciphering the complex cell-type composition of tissues and advancing our understanding of various biological processes and diseases. Extensive simulation studies have shown that FAST outperforms other existing reference-free methods. In real data applications, FAST is able to uncover the underlying tissue structures and identify the corresponding marker genes.
Collapse
Affiliation(s)
- Meng Zhang
- Department of Mathematics, University of Arizona, 617 N. Santa Rita Ave., Tucson, AZ, 85721, USA
| | - Joel Parker
- Department of Epidemiology and Biostatistics, University of Arizona, 1295 N. Martin Ave., Tucson, AZ, 85721, USA
| | - Lingling An
- Department of Agricultural and Biosystems Engineering, University of Arizona, 1177 East Fourth Street, Tucson, AZ, 85721, USA
| | - Yiwen Liu
- Department of Epidemiology and Biostatistics, University of Arizona, 1295 N. Martin Ave., Tucson, AZ, 85721, USA.
| | - Xiaoxiao Sun
- Department of Epidemiology and Biostatistics, University of Arizona, 1295 N. Martin Ave., Tucson, AZ, 85721, USA.
| |
Collapse
|
9
|
Wang Z, Dai R, Wang M, Lei L, Zhang Z, Han K, Wang Z, Guo Q. KanCell: dissecting cellular heterogeneity in biological tissues through integrated single-cell and spatial transcriptomics. J Genet Genomics 2025:S1673-8527(24)00310-2. [PMID: 39577768 DOI: 10.1016/j.jgg.2024.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2024] [Revised: 11/07/2024] [Accepted: 11/10/2024] [Indexed: 11/24/2024]
Abstract
KanCell is a deep learning model based on Kolmogorov-Arnold networks (KAN) designed to enhance cellular heterogeneity analysis by integrating single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) data. ST technologies provide insights into gene expression within tissue context, revealing cellular interactions and microenvironments. To fully leverage this potential, effective computational models are crucial. We evaluate KanCell on both simulated and real datasets from technologies such as STARmap, Slide-seq, Visium, and Spatial Transcriptomics. Our results demonstrate that KanCell outperforms existing methods across metrics like PCC, SSIM, COSSIM, RMSE, JSD, ARS, and ROC, with robust performance under varying cell numbers and background noise. Real-world applications on human lymph nodes, hearts, melanoma, breast cancer, dorsolateral prefrontal cortex, and mouse embryo brains confirmed its reliability. Compared with traditional approaches, KanCell effectively captures non-linear relationships and optimizes computational efficiency through KAN, providing an accurate and efficient tool for ST. By improving data accuracy and resolving cell type composition, KanCell reveals cellular heterogeneity, clarifies disease microenvironments, and identifies therapeutic targets, addressing complex biological challenges.
Collapse
Affiliation(s)
- Zhenghui Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Ruoyan Dai
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Mengqiu Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Lixin Lei
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Zhiwei Zhang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Kaitai Han
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Zijun Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Qianjin Guo
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China.
| |
Collapse
|
10
|
Cui T, Li YY, Li BL, Zhang H, Yu TT, Zhang JN, Qian FC, Yin MX, Fang QL, Hu ZH, Yan YX, Wang QY, Li CQ, Shang DS. SpatialRef: a reference of spatial omics with known spot annotation. Nucleic Acids Res 2025; 53:D1215-D1223. [PMID: 39417483 PMCID: PMC11701618 DOI: 10.1093/nar/gkae892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 09/19/2024] [Accepted: 09/26/2024] [Indexed: 10/19/2024] Open
Abstract
Spatial omics technologies have enabled the creation of intricate spatial maps that capture molecular features and tissue morphology, providing valuable insights into the spatial associations and functional organization of tissues. Accurate annotation of spot or domain types is essential for downstream spatial omics analyses, but this remains challenging. Therefore, this study aimed to develop a manually curated spatial omics database (SpatialRef, https://bio.liclab.net/spatialref/), to provide comprehensive and high-quality spatial omics data with known spot labels across multiple species. The current version of SpatialRef aggregates >9 million manually annotated spots across 17 Human, Mouse and Drosophila tissue types through extensive review and strict quality control, covering multiple spatial sequencing technologies and >400 spot/domain types from original studies. Furthermore, SpatialRef supports various spatial omics analyses about known spot types, including differentially expressed genes, spatially variable genes, Gene Ontology (GO)/KEGG annotation, spatial communication and spatial trajectories. With a user-friendly interface, SpatialRef facilitates querying, browsing and visualizing, thereby aiding in elucidating the functional relevance of spatial domains within the tissue and uncovering potential biological effects.
Collapse
Affiliation(s)
- Ting Cui
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Yan-Yu Li
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Bing-Long Li
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Han Zhang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Ting-Ting Yu
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Jia-Ning Zhang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Feng-Cui Qian
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Ming-Xue Yin
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Qiao-Li Fang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Zi-Hao Hu
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Yu-Xiang Yan
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Qiu-Yu Wang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Chun-Quan Li
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Key Laboratory of Rare Pediatric Diseases, Ministry of Education, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - De-Si Shang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| |
Collapse
|
11
|
Sun F, Li H, Sun D, Fu S, Gu L, Shao X, Wang Q, Dong X, Duan B, Xing F, Wu J, Xiao M, Zhao F, Han JDJ, Liu Q, Fan X, Li C, Wang C, Shi T. Single-cell omics: experimental workflow, data analyses and applications. SCIENCE CHINA. LIFE SCIENCES 2025; 68:5-102. [PMID: 39060615 DOI: 10.1007/s11427-023-2561-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 04/18/2024] [Indexed: 07/28/2024]
Abstract
Cells are the fundamental units of biological systems and exhibit unique development trajectories and molecular features. Our exploration of how the genomes orchestrate the formation and maintenance of each cell, and control the cellular phenotypes of various organismsis, is both captivating and intricate. Since the inception of the first single-cell RNA technology, technologies related to single-cell sequencing have experienced rapid advancements in recent years. These technologies have expanded horizontally to include single-cell genome, epigenome, proteome, and metabolome, while vertically, they have progressed to integrate multiple omics data and incorporate additional information such as spatial scRNA-seq and CRISPR screening. Single-cell omics represent a groundbreaking advancement in the biomedical field, offering profound insights into the understanding of complex diseases, including cancers. Here, we comprehensively summarize recent advances in single-cell omics technologies, with a specific focus on the methodology section. This overview aims to guide researchers in selecting appropriate methods for single-cell sequencing and related data analysis.
Collapse
Affiliation(s)
- Fengying Sun
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China
| | - Haoyan Li
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Dongqing Sun
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Shaliu Fu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
| | - Lei Gu
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Xin Shao
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
- National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, 314103, China
| | - Qinqin Wang
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Xin Dong
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Bin Duan
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
| | - Feiyang Xing
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Jun Wu
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China
| | - Minmin Xiao
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China.
| | - Fangqing Zhao
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Jing-Dong J Han
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China.
| | - Qi Liu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China.
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China.
| | - Xiaohui Fan
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.
- National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, 314103, China.
- Zhejiang Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China.
| | - Chen Li
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| | - Chenfei Wang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
| | - Tieliu Shi
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China.
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China.
- Key Laboratory of Advanced Theory and Application in Statistics and Data Science-MOE, School of Statistics, East China Normal University, Shanghai, 200062, China.
| |
Collapse
|
12
|
Yu Z, Yang Y, Chen X, Wong K, Zhang Z, Zhao Y, Li X. Accurate Spatial Heterogeneity Dissection and Gene Regulation Interpretation for Spatial Transcriptomics using Dual Graph Contrastive Learning. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2410081. [PMID: 39605202 PMCID: PMC11744562 DOI: 10.1002/advs.202410081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 10/31/2024] [Indexed: 11/29/2024]
Abstract
Recent advances in spatial transcriptomics have enabled simultaneous preservation of high-throughput gene expression profiles and the spatial context, enabling high-resolution exploration of distinct regional characterization in tissue. To effectively understand the underlying biological mechanisms within tissue microenvironments, there is a requisite for methods that can accurately capture external spatial heterogeneity and interpret internal gene regulation from spatial transcriptomics data. However, current methods for region identification often lack the simultaneous characterizing of spatial structure and gene regulation, thereby limiting the ability of spatial dissection and gene interpretation. Here, stDCL is developed, a dual graph contrastive learning method to identify spatial domains and interpret gene regulation in spatial transcriptomics data. stDCL adaptively incorporates gene expression data and spatial information via a graph embedding autoencoder, thereby preserving critical information within the latent embedding representations. In addition, dual graph contrastive learning is proposed to train the model, ensuring that the latent embedding representation closely resembles the actual spatial distribution and exhibits cluster similarity. Benchmarking stDCL against other state-of-the-art clustering methods using complex cortex datasets demonstrates its superior accuracy and effectiveness in identifying spatial domains. Our analysis of the imputation matrices generated by stDCL reveals its capability to reconstruct spatial hierarchical structures and refine differential expression assessment. Furthermore, it is demonstrated that the versatility of stDCL in interpretability of gene regulation, spatial heterogeneity at high resolution, and embryonic developmental patterns. In addition, it is also showed that stDCL can successfully annotate disease-associated astrocyte subtypes in Alzheimer's disease and unravel multiple relevant pathways and regulatory mechanisms.
Collapse
Affiliation(s)
- Zhuohan Yu
- School of Artificial IntelligenceJilin UniversityJilin130012China
| | - Yuning Yang
- Terrence Donnelly Centre for Cellular and Biomolecular ResearchUniversity of TorontoTorontoONM5S 3E1Canada
| | - Xingjian Chen
- Cutaneous Biology Research Center, Massachusetts General HospitalHarvard Medical SchoolBostonMA02115USA
| | - Ka‐Chun Wong
- Department of Computer ScienceCity University of Hong KongHong KongSAR999077Hong Kong
| | - Zhaolei Zhang
- Terrence Donnelly Centre for Cellular and Biomolecular ResearchUniversity of TorontoTorontoONM5S 3E1Canada
| | - Yuming Zhao
- College of Computer and Control EngineeringNortheast Forestry UniversityHarbin150040China
| | - Xiangtao Li
- School of Artificial IntelligenceJilin UniversityJilin130012China
- Department of Computer ScienceCity University of Hong KongHong KongSAR999077Hong Kong
| |
Collapse
|
13
|
Defard T, Desrentes A, Fouillade C, Mueller F. Homebuilt Imaging-Based Spatial Transcriptomics: Tertiary Lymphoid Structures as a Case Example. Methods Mol Biol 2025; 2864:77-105. [PMID: 39527218 DOI: 10.1007/978-1-0716-4184-2_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]
Abstract
Spatial transcriptomics methods provide insight into the cellular heterogeneity and spatial architecture of complex, multicellular systems. Combining molecular and spatial information provides important clues to study tissue architecture in development and disease. Here, we present a comprehensive do-it-yourself (DIY) guide to perform such experiments at reduced costs leveraging open-source approaches. This guide spans the entire life cycle of a project, from its initial definition to experimental choices, wet lab approaches, instrumentation, and analysis. As a concrete example, we focus on tertiary lymphoid structures (TLS), which we use to develop typical questions that can be addressed by these approaches.
Collapse
Affiliation(s)
- Thomas Defard
- Institut Pasteur, Université Paris Cité, Photonic Bio-Imaging, Centre de Ressources et Recherches Technologiques (UTechS-PBI, C2RT), Paris, France
- Institut Pasteur, Université Paris Cité, Imaging and Modeling Unit, Paris, France
- Centre for Computational Biology (CBIO), Mines Paris, PSL University, Paris, France
- Institut Curie, PSL University, Paris, France
- INSERM, U900, Paris, France
| | - Auxence Desrentes
- UMRS1135 Sorbonne University, Paris, France
- INSERM U1135, Paris, France
- Team "Immune Microenvironment and Immunotherapy", Centre for Immunology and Microbial Infections (CIMI), Paris, France
| | - Charles Fouillade
- Institut Curie, Inserm U1021-CNRS UMR 3347, University Paris-Saclay, PSL Research University, Centre Universitaire, Orsay, France
| | - Florian Mueller
- Institut Pasteur, Université Paris Cité, Photonic Bio-Imaging, Centre de Ressources et Recherches Technologiques (UTechS-PBI, C2RT), Paris, France.
- Institut Pasteur, Université Paris Cité, Imaging and Modeling Unit, Paris, France.
| |
Collapse
|
14
|
Zhang F, Shen Z, Huang S, Zhu Y, Yi M. SpaInGNN: Enhanced clustering and integration of spatial transcriptomics based on refined graph neural networks. Methods 2025; 233:42-51. [PMID: 39542070 DOI: 10.1016/j.ymeth.2024.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2024] [Revised: 10/22/2024] [Accepted: 11/07/2024] [Indexed: 11/17/2024] Open
Abstract
Recent developments in spatial transcriptomics (ST) technology have markedly enhanced the proposed capacity to comprehensively characterize gene expression patterns within tissue microenvironments while crucially preserving spatial context. However, the identification of spatial domains at the single-cell level remains a significant challenge in elucidating biological processes. To address this, SpaInGNN was developed, a sophisticated graph neural network (GNN) framework that accurately delineates spatial domains by integrating spatial location data, histological information, and gene expression profiles into low-dimensional latent embeddings. Additionally, to fully leverage spatial coordinate data, spatial integration using graph neural network (SpaInGNN) refines the graph constructed for spatial locations by incorporating both tissue image distance and Euclidean distance, following a pre-clustering of gene expression profiles. This refined graph is then embedded using a self-supervised GNN, which minimizes self-reconfiguration loss. By applying SpaInGNN to refined graphs across multiple consecutive tissue slices, this study mitigates the impact of batch effects in data analysis. The proposed method demonstrates substantial improvements in the accuracy of spatial domain recognition, providing a more faithful representation of the tissue organization in both mouse olfactory bulb and human lateral prefrontal cortex samples.
Collapse
Affiliation(s)
- Fangqin Zhang
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China
| | - Zhan Shen
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China
| | - Siyi Huang
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China
| | - Yuan Zhu
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China
| | - Ming Yi
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China.
| |
Collapse
|
15
|
Keller MS, Gold I, McCallum C, Manz T, Kharchenko PV, Gehlenborg N. Vitessce: integrative visualization of multimodal and spatially resolved single-cell data. Nat Methods 2025; 22:63-67. [PMID: 39333268 PMCID: PMC11725496 DOI: 10.1038/s41592-024-02436-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 08/16/2024] [Indexed: 09/29/2024]
Abstract
Multiomics technologies with single-cell and spatial resolution make it possible to measure thousands of features across millions of cells. However, visual analysis of high-dimensional transcriptomic, proteomic, genome-mapped and imaging data types simultaneously remains a challenge. Here we describe Vitessce, an interactive web-based visualization framework for exploration of multimodal and spatially resolved single-cell data. We demonstrate integrative visualization of millions of data points, including cell-type annotations, gene expression quantities, spatially resolved transcripts and cell segmentations, across multiple coordinated views. The open-source software is available at http://vitessce.io .
Collapse
Affiliation(s)
- Mark S Keller
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Ilan Gold
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Chuck McCallum
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Trevor Manz
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Peter V Kharchenko
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Altos Labs, San Diego, CA, USA
| | - Nils Gehlenborg
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
16
|
Wang J, Ye F, Chai H, Jiang Y, Wang T, Ran X, Xia Q, Xu Z, Fu Y, Zhang G, Wu H, Guo G, Guo H, Ruan Y, Wang Y, Xing D, Xu X, Zhang Z. Advances and applications in single-cell and spatial genomics. SCIENCE CHINA. LIFE SCIENCES 2024:10.1007/s11427-024-2770-x. [PMID: 39792333 DOI: 10.1007/s11427-024-2770-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 10/10/2024] [Indexed: 01/12/2025]
Abstract
The applications of single-cell and spatial technologies in recent times have revolutionized the present understanding of cellular states and the cellular heterogeneity inherent in complex biological systems. These advancements offer unprecedented resolution in the examination of the functional genomics of individual cells and their spatial context within tissues. In this review, we have comprehensively discussed the historical development and recent progress in the field of single-cell and spatial genomics. We have reviewed the breakthroughs in single-cell multi-omics technologies, spatial genomics methods, and the computational strategies employed toward the analyses of single-cell atlas data. Furthermore, we have highlighted the advances made in constructing cellular atlases and their clinical applications, particularly in the context of disease. Finally, we have discussed the emerging trends, challenges, and opportunities in this rapidly evolving field.
Collapse
Affiliation(s)
- Jingjing Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Fang Ye
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Haoxi Chai
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China
| | - Yujia Jiang
- BGI Research, Shenzhen, 518083, China
- BGI Research, Hangzhou, 310030, China
| | - Teng Wang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Xia Ran
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China
| | - Qimin Xia
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
| | - Ziye Xu
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Yuting Fu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guodong Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Hanyu Wu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guoji Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Hongshan Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Yijun Ruan
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China.
| | - Yongcheng Wang
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
| | - Dong Xing
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, 100871, China.
| | - Xun Xu
- BGI Research, Shenzhen, 518083, China.
- BGI Research, Hangzhou, 310030, China.
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI Research, Shenzhen, 518083, China.
| | - Zemin Zhang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
17
|
Hong F. Programmable DNA Reactions for Advanced Fluorescence Microscopy in Bioimaging. SMALL METHODS 2024:e2401279. [PMID: 39679773 DOI: 10.1002/smtd.202401279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 11/14/2024] [Indexed: 12/17/2024]
Abstract
Biological organisms are composed of billions of molecules organized across various length scales. Direct visualization of these biomolecules in situ enables the retrieval of vast molecular information, including their location, species, and quantities, which is essential for understanding biological processes. The programmability of DNA interactions has made DNA-based reactions a major driving force in extending the limits of fluorescence microscopy, allowing for the study of biological complexity at different scales. This review article provides an overview of recent technological advancements in DNA-based fluorescence microscopy, highlighting how these innovations have expanded the technique's capabilities in terms of target multiplexity, signal amplification, super-resolution, and mechanical properties. These advanced DNA-based fluorescence microscopy techniques have been widely used to uncover new biological insights at the molecular level.
Collapse
Affiliation(s)
- Fan Hong
- Department of Chemistry, University of Florida, Gainesville, FL, 32611, USA
- J. Crayton Pruitt Family Department of Biomedical Engineering, University of Florida, Gainesville, FL, 32611, USA
| |
Collapse
|
18
|
Liu T, Fang ZY, Zhang Z, Yu Y, Li M, Yin MZ. A comprehensive overview of graph neural network-based approaches to clustering for spatial transcriptomics. Comput Struct Biotechnol J 2024; 23:106-128. [PMID: 38089467 PMCID: PMC10714345 DOI: 10.1016/j.csbj.2023.11.055] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 11/24/2023] [Accepted: 11/27/2023] [Indexed: 10/16/2024] Open
Abstract
Spatial transcriptomics technologies enable researchers to accurately quantify and localize messenger ribonucleic acid (mRNA) transcripts at a high resolution while preserving their spatial context. The identification of spatial domains, or the task of spatial clustering, plays a crucial role in investigating data on spatial transcriptomes. One promising approach for classifying spatial domains involves the use of graph neural networks (GNNs) by leveraging gene expressions, spatial locations, and histological images. This study provided a comprehensive overview of the most recent GNN-based methods of spatial clustering methods for the analysis of data on spatial transcriptomics. We extensively evaluated the performance of current methods on prevalent datasets of spatial transcriptomics by considering their accuracy of clustering, robustness, data stabilization, relevant requirements, computational efficiency, and memory use. To this end, we explored 60 clustering scenarios by extending the essential frameworks of spatial clustering for the selection of the GNNs, algorithms of downstream clustering, principal component analysis (PCA)-based reduction, and refined methods of correction. We comparatively analyzed the performance of the methods in terms of spatial clustering to identify their limitations and outline future directions of research in the field. Our survey yielded novel insights, and provided motivation for further investigating spatial transcriptomics.
Collapse
Affiliation(s)
- Teng Liu
- Clinical Research Center (CRC), Clinical Pathology Center (CPC), Cancer Early Detection and Treatment Center (CEDTC) and Translational Medicine Research Center (TMRC), Chongqing University Three Gorges Hospital, Chongqing University, Wanzhou, Chongqing, China
- Chongqing Technical Innovation Center for Quality Evaluation and Identification of Authentic Medicinal Herbs, Chongqing, China
| | - Zhao-Yu Fang
- School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Zongbo Zhang
- Clinical Research Center (CRC), Clinical Pathology Center (CPC), Cancer Early Detection and Treatment Center (CEDTC) and Translational Medicine Research Center (TMRC), Chongqing University Three Gorges Hospital, Chongqing University, Wanzhou, Chongqing, China
| | - Yongxiang Yu
- Clinical Research Center (CRC), Clinical Pathology Center (CPC), Cancer Early Detection and Treatment Center (CEDTC) and Translational Medicine Research Center (TMRC), Chongqing University Three Gorges Hospital, Chongqing University, Wanzhou, Chongqing, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha 410083, China
- Hunan Provincial Engineering Research Center of Intelligent Computing in Biology and Medicine, Central South University, Changsha 410083, China
| | - Ming-Zhu Yin
- Clinical Research Center (CRC), Clinical Pathology Center (CPC), Cancer Early Detection and Treatment Center (CEDTC) and Translational Medicine Research Center (TMRC), Chongqing University Three Gorges Hospital, Chongqing University, Wanzhou, Chongqing, China
- Chongqing Technical Innovation Center for Quality Evaluation and Identification of Authentic Medicinal Herbs, Chongqing, China
| |
Collapse
|
19
|
Nie W, Yu Y, Wang X, Wang R, Li SC. Spatially Informed Graph Structure Learning Extracts Insights from Spatial Transcriptomics. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2403572. [PMID: 39382177 PMCID: PMC11615819 DOI: 10.1002/advs.202403572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 08/04/2024] [Indexed: 10/10/2024]
Abstract
Embeddings derived from cell graphs hold significant potential for exploring spatial transcriptomics (ST) datasets. Nevertheless, existing methodologies rely on a graph structure defined by spatial proximity, which inadequately represents the diversity inherent in cell-cell interactions (CCIs). This study introduces STAGUE, an innovative framework that concurrently learns a cell graph structure and a low-dimensional embedding from ST data. STAGUE employs graph structure learning to parameterize and refine a cell graph adjacency matrix, enabling the generation of learnable graph views for effective contrastive learning. The derived embeddings and cell graph improve spatial clustering accuracy and facilitate the discovery of novel CCIs. Experimental benchmarks across 86 real and simulated ST datasets show that STAGUE outperforms 15 comparison methods in clustering performance. Additionally, STAGUE delineates the heterogeneity in human breast cancer tissues, revealing the activation of epithelial-to-mesenchymal transition and PI3K/AKT signaling in specific sub-regions. Furthermore, STAGUE identifies CCIs with greater alignment to established biological knowledge than those ascertained by existing graph autoencoder-based methods. STAGUE also reveals the regulatory genes that participate in these CCIs, including those enriched in neuropeptide signaling and receptor tyrosine kinase signaling pathways, thereby providing insights into the underlying biological processes.
Collapse
Affiliation(s)
- Wan Nie
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
| | - Yingying Yu
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
| | - Xueying Wang
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
- City University of Hong Kong (Dongguan)Dongguan523000China
| | - Ruohan Wang
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
| | - Shuai Cheng Li
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
| |
Collapse
|
20
|
Kuemmerle LB, Luecken MD, Firsova AB, Barros de Andrade E Sousa L, Straßer L, Mekki II, Campi F, Heumos L, Shulman M, Beliaeva V, Hediyeh-Zadeh S, Schaar AC, Mahbubani KT, Sountoulidis A, Balassa T, Kovacs F, Horvath P, Piraud M, Ertürk A, Samakovlis C, Theis FJ. Probe set selection for targeted spatial transcriptomics. Nat Methods 2024; 21:2260-2270. [PMID: 39558096 PMCID: PMC11621025 DOI: 10.1038/s41592-024-02496-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 09/30/2024] [Indexed: 11/20/2024]
Abstract
Targeted spatial transcriptomic methods capture the topology of cell types and states in tissues at single-cell and subcellular resolution by measuring the expression of a predefined set of genes. The selection of an optimal set of probed genes is crucial for capturing the spatial signals present in a tissue. This requires selecting the most informative, yet minimal, set of genes to profile (gene set selection) for which it is possible to build probes (probe design). However, current selections often rely on marker genes, precluding them from detecting continuous spatial signals or new states. We present Spapros, an end-to-end probe set selection pipeline that optimizes both gene set specificity for cell type identification and within-cell type expression variation to resolve spatially distinct populations while considering prior knowledge as well as probe design and expression constraints. We evaluated Spapros and show that it outperforms other selection approaches in both cell type recovery and recovering expression variation beyond cell types. Furthermore, we used Spapros to design a single-cell resolution in situ hybridization on tissues (SCRINSHOT) experiment of adult lung tissue to demonstrate how probes selected with Spapros identify cell types of interest and detect spatial variation even within cell types.
Collapse
Affiliation(s)
- Louis B Kuemmerle
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- Institute for Tissue Engineering and Regenerative Medicine, Helmholtz Zentrum München, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Malte D Luecken
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- Institute of Lung Health & Immunity, Helmholtz Munich, Member of the German Center for Lung Research (DZL), Munich, Germany
- German Center for Lung Research (DZL), Gießen, Germany
| | - Alexandra B Firsova
- SciLifeLab and Department of Molecular Biosciences, Stockholm University, Stockholm, Sweden
| | | | - Lena Straßer
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
| | | | - Francesco Campi
- Helmholtz AI, Helmholtz Zentrum München, Neuherberg, Germany
| | - Lukas Heumos
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
- Institute of Lung Biology and Disease and Comprehensive Pneumology Center, Helmholtz Zentrum München, German Center for Lung Research (DZL), Munich, Germany
| | - Maiia Shulman
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
| | - Valentina Beliaeva
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
| | - Soroor Hediyeh-Zadeh
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Anna C Schaar
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
- Munich Center for Machine Learning, Technical University of Munich, Munich, Germany
| | - Krishnaa T Mahbubani
- Department of Surgery, University of Cambridge and Cambridge NIHR Biomedical Research Centre, Cambridge, UK
| | | | - Tamás Balassa
- Synthetic and Systems Biology Unit, Biological Research Centre, Eötvös Loránd Research Network, Szeged, Hungary
| | | | - Peter Horvath
- Synthetic and Systems Biology Unit, Biological Research Centre, Eötvös Loránd Research Network, Szeged, Hungary
- Institute of AI for Health, Helmholtz Zentrum München, Neuherberg, Germany
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Marie Piraud
- Helmholtz AI, Helmholtz Zentrum München, Neuherberg, Germany
| | - Ali Ertürk
- Institute for Tissue Engineering and Regenerative Medicine, Helmholtz Zentrum München, Neuherberg, Germany
- Institute for Stroke and Dementia Research, Klinikum der Universität München, Ludwig-Maximilians University Munich, Munich, Germany
- Munich Cluster for Systems Neurology (SyNergy), Munich, Germany
- School of Medicine, Koç University, İstanbul, Turkey
| | - Christos Samakovlis
- SciLifeLab and Department of Molecular Biosciences, Stockholm University, Stockholm, Sweden
- Cardiopulmonary Institute, Justus Liebig University, Giessen, Germany
| | - Fabian J Theis
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany.
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.
- School of Computation, Information and Technology, Technical University of Munich, Munich, Germany.
| |
Collapse
|
21
|
Liu Y, Yang C. Computational methods for alignment and integration of spatially resolved transcriptomics data. Comput Struct Biotechnol J 2024; 23:1094-1105. [PMID: 38495555 PMCID: PMC10940867 DOI: 10.1016/j.csbj.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Revised: 03/02/2024] [Accepted: 03/04/2024] [Indexed: 03/19/2024] Open
Abstract
Most of the complex biological regulatory activities occur in three dimensions (3D). To better analyze biological processes, it is essential not only to decipher the molecular information of numerous cells but also to understand how their spatial contexts influence their behavior. With the development of spatially resolved transcriptomics (SRT) technologies, SRT datasets are being generated to simultaneously characterize gene expression and spatial arrangement information within tissues, organs or organisms. To fully leverage spatial information, the focus extends beyond individual two-dimensional (2D) slices. Two tasks known as slices alignment and data integration have been introduced to establish correlations between multiple slices, enhancing the effectiveness of downstream tasks. Currently, numerous related methods have been developed. In this review, we first elucidate the details and principles behind several representative methods. Then we report the testing results of these methods on various SRT datasets, and assess their performance in representative downstream tasks. Insights into the strengths and weaknesses of each method and the reasons behind their performance are discussed. Finally, we provide an outlook on future developments. The codes and details of experiments are now publicly available at https://github.com/YangLabHKUST/SRT_alignment_and_integration.
Collapse
Affiliation(s)
- Yuyao Liu
- Department of Automation, School of Information Science and Technology, Tsinghua University, Beijing, China
| | - Can Yang
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China
| |
Collapse
|
22
|
Zhang D, Yu N, Sun X, Li H, Zhang W, Qiao X, Zhang W, Gao R. Deciphering spatial domains from spatially resolved transcriptomics through spatially regularized deep graph networks. BMC Genomics 2024; 25:1160. [PMID: 39614161 DOI: 10.1186/s12864-024-11072-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Accepted: 11/21/2024] [Indexed: 12/01/2024] Open
Abstract
BACKGROUND Recent advancements in spatially resolved transcriptomics (SRT) have opened up unprecedented opportunities to explore gene expression patterns within spatial contexts. Deciphering spatial domains is a critical task in spatial transcriptomic data analysis, aiding in the elucidation of tissue structural heterogeneity and biological functions. However, existing spatial domain detection methods ignore the consistency of expression patterns and spatial arrangements between spots, as well as the severe gene dropout phenomenon present in SRT data, resulting in suboptimal performance in identifying tissue spatial heterogeneity. RESULTS In this paper, we introduce a novel framework, spatially regularized deep graph networks (SR-DGN), which integrates gene expression profiles with spatial information to learn spatially-consistent and informative spot representations. Specifically, SR-DGN employs graph attention networks (GAT) to adaptively aggregate gene expression information from neighboring spots, considering local expression patterns between spots. In addition, the spatial regularization constraint ensures the consistency of neighborhood relationships between physical and embedded spaces in an end-to-end manner. SR-DGN also employs cross-entropy (CE) loss to model gene expression states, effectively mitigating the impact of noisy gene dropouts. CONCLUSIONS Experimental results demonstrate that SR-DGN outperforms state-of-the-art methods in spatial domain identification across SRT data from different sequencing platforms. Moreover, SR-DGN is capable of recovering known microanatomical structures, yielding clearer low-dimensional visualizations and more accurate spatial trajectory inferences.
Collapse
Affiliation(s)
- Daoliang Zhang
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Na Yu
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Xue Sun
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Haoyang Li
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Wenjing Zhang
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Xu Qiao
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China.
| | - Wei Zhang
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China.
| | - Rui Gao
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China.
| |
Collapse
|
23
|
Li J, Xiang S, Wei D. Deciphering progressive lesion areas in breast cancer spatial transcriptomics via TGR-NMF. Brief Bioinform 2024; 26:bbae707. [PMID: 39780487 PMCID: PMC11711100 DOI: 10.1093/bib/bbae707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 11/04/2024] [Accepted: 12/26/2024] [Indexed: 01/11/2025] Open
Abstract
Identifying spatial domains is critical for understanding breast cancer tissue heterogeneity and providing insights into tumor progression. However, dropout events introduces computational challenges and the lack of transparency in methods such as graph neural networks limits their interpretability. This study aimed to decipher disease progression-related spatial domains in breast cancer spatial transcriptomics by developing the three graph regularized non-negative matrix factorization (TGR-NMF). A unitization strategy was proposed to mitigate the impact of dropout events on the computational process, enabling utilization of the complete gene expression count data. By integrating one gene expression neighbor topology and two spatial position neighbor topologies, TGR-NMF was developed for constructing an interpretable low-dimensional representation of spatial transcriptomic data. The progressive lesion area that can reveal the progression of breast cancer was uncovered through heterogeneity analysis. Moreover, several related pathogenic genes and signal pathways on this area were identified by using gene enrichment and cell communication analysis.
Collapse
Affiliation(s)
- Juntao Li
- School of Mathematics and Statistics, Henan Normal University, 46 Jianshe East Road, 453007 Xinxiang, China
| | - Shan Xiang
- School of Mathematics and Statistics, Henan Normal University, 46 Jianshe East Road, 453007 Xinxiang, China
| | - Dongqing Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, 200240 Shanghai, China
| |
Collapse
|
24
|
Mo Y, Liu J, Zhang L. Deconvolution of spatial transcriptomics data via graph contrastive learning and partial least square regression. Brief Bioinform 2024; 26:bbaf052. [PMID: 39924717 PMCID: PMC11807730 DOI: 10.1093/bib/bbaf052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Revised: 12/19/2024] [Accepted: 01/24/2025] [Indexed: 02/11/2025] Open
Abstract
Deciphering the cellular abundance in spatial transcriptomics (ST) is crucial for revealing the spatial architecture of cellular heterogeneity within tissues. However, some of the current spatial sequencing technologies are in low resolutions, leading to each spot having multiple heterogeneous cells. Additionally, current spatial deconvolution methods lack the ability to utilize multi-modality information such as gene expression and chromatin accessibility from single-cell multi-omics data. In this study, we introduce a graph Contrastive Learning and Partial Least Squares regression-based method, CLPLS, to deconvolute ST data. CLPLS is a flexible method that it can be extended to integrate ST data and single-cell multi-omics data, enabling the exploration of the spatially epigenomic heterogeneity. We applied CLPLS to both simulated and real datasets coming from different platforms. Benchmark analyses with other methods on these datasets show the superior performance of CLPLS in deconvoluting spots in single cell level.
Collapse
Affiliation(s)
- Yuanyuan Mo
- School of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Juan Liu
- School of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Lihua Zhang
- School of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan 430072, China
| |
Collapse
|
25
|
Peng W, Zhang Z, Dai W, Ping Z, Fu X, Liu L, Liu L, Yu N. MVCLST: A spatial transcriptome data analysis pipeline for cell type classification based on multi-view comparative learning. Methods 2024; 232:115-128. [PMID: 39542071 DOI: 10.1016/j.ymeth.2024.11.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 11/03/2024] [Accepted: 11/04/2024] [Indexed: 11/17/2024] Open
Abstract
Recent advancements in spatial transcriptomics sequencing technologies can not only provide gene expression within individual cells or cell clusters (spots) in a tissue but also pinpoint the exact location of this expression and generate detailed images of stained tissue sections, which offers invaluable insights into cell type identification and cell function exploration. However, effectively integratingthegene expression data, spatial location information, and tissue images from spatial transcriptomics data presents a significant challenge for computational methodsin cell classification. In this work, we propose MVCLST, a multi-view comparative learningmethod to analyze spatial transcriptomicsdata for accurate cell type classification. MVCLSTconstructs two views based on gene expression profiles, cell coordinates and image features. The multi-view method we proposed can significantly enhance the effectiveness of feature extraction while avoiding the impact of erroneous information in organizing image or gene expression data. The model employs four separate encoders to capture shared and unique features within each view. To ensure consistency and facilitate information exchange between the two views, MVCLST incorporates a contrastive learning loss function. The extracted shared and private features from both views are fused using corresponding decoders. Finally, the model utilizes the Leiden algorithm to clusterthe learned featuresfor cell type identification. Additionally, we establish a framework called MVCLST-CCFS for spatial transcriptomicsdata analysis based on MVCLST and consistent clustering. Our method achieves excellent results in clustering on human dorsolateral prefrontal cortex data and the mouse brain tissue data. Italso outperforms state-of-the-art techniques in the subsequent search for highly variable genes across cell types on the mouse olfactory bulbdata.
Collapse
Affiliation(s)
- Wei Peng
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China.
| | - Zhihao Zhang
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China
| | - Wei Dai
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China.
| | - Zhihao Ping
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China
| | - Xiaodong Fu
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China.
| | - Li Liu
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China
| | - Lijun Liu
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China
| | - Ning Yu
- State University of New York, The College at Brockport, Department of Computing Sciences, 350 New Campus Drive, Brockport, NY 14422, United States.
| |
Collapse
|
26
|
Yang J, Wang L, Liu L, Zheng X. GraphPCA: a fast and interpretable dimension reduction algorithm for spatial transcriptomics data. Genome Biol 2024; 25:287. [PMID: 39511664 PMCID: PMC11545739 DOI: 10.1186/s13059-024-03429-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 10/29/2024] [Indexed: 11/15/2024] Open
Abstract
The rapid advancement of spatial transcriptomics technologies has revolutionized our understanding of cell heterogeneity and intricate spatial structures within tissues and organs. However, the high dimensionality and noise in spatial transcriptomic data present significant challenges for downstream data analyses. Here, we develop GraphPCA, an interpretable and quasi-linear dimension reduction algorithm that leverages the strengths of graphical regularization and principal component analysis. Comprehensive evaluations on simulated and multi-resolution spatial transcriptomic datasets generated from various platforms demonstrate the capacity of GraphPCA to enhance downstream analysis tasks including spatial domain detection, denoising, and trajectory inference compared to other state-of-the-art methods.
Collapse
Affiliation(s)
- Jiyuan Yang
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Lu Wang
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- The Guangxi Key Laboratory of Intelligent Precision Medicine, Guangxi Zhuang Autonomous Region, Nanning, China
| | - Lin Liu
- Institute of Natural Sciences, MOE-LSC, School of Mathematical Sciences, CMA-Shanghai, SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University and Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Xiaoqi Zheng
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| |
Collapse
|
27
|
Bartels T, Rowitch DH, Bayraktar OA. Generation of Mammalian Astrocyte Functional Heterogeneity. Cold Spring Harb Perspect Biol 2024; 16:a041351. [PMID: 38692833 PMCID: PMC11529848 DOI: 10.1101/cshperspect.a041351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2024]
Abstract
Mammalian astrocytes have regional roles within the brain parenchyma. Indeed, the notion that astrocytes are molecularly heterogeneous could help explain how the central nervous system (CNS) retains embryonic positional information through development into specialized regions into adulthood. A growing body of evidence supports the concept of morphological and molecular differences between astrocytes in different brain regions, which might relate to their derivation from regionally patterned radial glia and/or local neuron inductive cues. Here, we review evidence for regionally encoded functions of astrocytes to provide an integrated concept on lineage origins and heterogeneity to understand regional brain organization, as well as emerging technologies to identify and further investigate novel roles for astrocytes.
Collapse
Affiliation(s)
- Theresa Bartels
- Department of Paediatrics and Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge CB2 0AW, United Kingdom
| | - David H Rowitch
- Department of Paediatrics and Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge CB2 0AW, United Kingdom
| | - Omer Ali Bayraktar
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
| |
Collapse
|
28
|
Han S, Xu Q, Du Y, Tang C, Cui H, Xia X, Zheng R, Sun Y, Shang H. Single-cell spatial transcriptomics in cardiovascular development, disease, and medicine. Genes Dis 2024; 11:101163. [PMID: 39224111 PMCID: PMC11367031 DOI: 10.1016/j.gendis.2023.101163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 10/17/2023] [Accepted: 10/29/2023] [Indexed: 09/04/2024] Open
Abstract
Cardiovascular diseases (CVDs) impose a significant burden worldwide. Despite the elucidation of the etiology and underlying molecular mechanisms of CVDs by numerous studies and recent discovery of effective drugs, their morbidity, disability, and mortality are still high. Therefore, precise risk stratification and effective targeted therapies for CVDs are warranted. Recent improvements in single-cell RNA sequencing and spatial transcriptomics have improved our understanding of the mechanisms and cells involved in cardiovascular phylogeny and CVDs. Single-cell RNA sequencing can facilitate the study of the human heart at remarkably high resolution and cellular and molecular heterogeneity. However, this technique does not provide spatial information, which is essential for understanding homeostasis and disease. Spatial transcriptomics can elucidate intracellular interactions, transcription factor distribution, cell spatial localization, and molecular profiles of mRNA and identify cell populations causing the disease and their underlying mechanisms, including cell crosstalk. Herein, we introduce the main methods of RNA-seq and spatial transcriptomics analysis and highlight the latest advances in cardiovascular research. We conclude that single-cell RNA sequencing interprets disease progression in multiple dimensions, levels, perspectives, and dynamics by combining spatial and temporal characterization of the clinical phenome with multidisciplinary techniques such as spatial transcriptomics. This aligns with the dynamic evolution of CVDs (e.g., "angina-myocardial infarction-heart failure" in coronary artery disease). The study of pathways for disease onset and mechanisms (e.g., age, sex, comorbidities) in different patient subgroups should improve disease diagnosis and risk stratification. This can facilitate precise individualized treatment of CVDs.
Collapse
Affiliation(s)
- Songjie Han
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Qianqian Xu
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Yawen Du
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Chuwei Tang
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Herong Cui
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
- School of Life Sciences, Beijing University of Chinese Medicine, Beijing 102488, China
| | - Xiaofeng Xia
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Rui Zheng
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Yang Sun
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Hongcai Shang
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| |
Collapse
|
29
|
Xiu YH, Sun SL, Zhou BW, Wan Y, Tang H, Long HX. DGSIST: Clustering spatial transcriptome data based on deep graph structure Infomax. Methods 2024; 231:226-236. [PMID: 39413889 DOI: 10.1016/j.ymeth.2024.10.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 09/26/2024] [Accepted: 10/04/2024] [Indexed: 10/18/2024] Open
Abstract
Although spatial transcriptomics data provide valuable insights into gene expression profiles and the spatial structure of tissues, most studies rely solely on gene expression information, underutilizing the spatial data. To fully leverage the potential of spatial transcriptomics and graph neural networks, the DGSI (Deep Graph Structure Infomax) model is proposed. This innovative graph data processing model uses graph convolutional neural networks and employs an unsupervised learning approach. It maximizes the mutual information between graph-level and node-level representations, emphasizing flexible sampling and aggregation of nodes and their neighbors. This effectively captures and incorporates local information from nodes into the overall graph structure. Additionally, this paper developed the DGSIST framework, an unsupervised cell clustering method that integrates the DGSI model, SVD dimensionality reduction algorithm, and k-means++ clustering algorithm. This aims to identify cell types accurately. DGSIST fully uses spatial transcriptomics data and outperforms existing methods in accuracy. Demonstrations of DGSIST's capability across various tissue types and technological platforms have shown its effectiveness in accurately identifying spatial domains in multiple tissue sections. Compared to other spatial clustering methods, DGSIST excels in cell clustering and effectively eliminates batch effects without needing batch correction. DGSIST excels in spatial clustering analysis, spatial variation identification, and differential gene expression detection and directly applies to graph analysis tasks, such as node classification, link prediction, or graph clustering. Anticipation lies in the contribution of the DGSIST framework to a deeper understanding of the spatial organizational structures of diseases such as cancer.
Collapse
Affiliation(s)
- Yu-Han Xiu
- College of Information Science Technology, Hainan Normal University, HaiKou City 571158, China; Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, HaiKou City 571158, China
| | - Si-Lin Sun
- College of Information Science Technology, Hainan Normal University, HaiKou City 571158, China; Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, HaiKou City 571158, China
| | - Bing-Wei Zhou
- College of Information Science Technology, Hainan Normal University, HaiKou City 571158, China; Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, HaiKou City 571158, China
| | - Ying Wan
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, China
| | - Hua Tang
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, China; Medical Engineering & Medical Informatics Integration and Transformational Medicine Key Laboratory of Luzhou City, Luzhou 646000, China.
| | - Hai-Xia Long
- College of Information Science Technology, Hainan Normal University, HaiKou City 571158, China; Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, HaiKou City 571158, China.
| |
Collapse
|
30
|
Zeng Y, Song Y, Zhang C, Li H, Zhao Y, Yu W, Zhang S, Zhang H, Dai Z, Yang Y. Imputing spatial transcriptomics through gene network constructed from protein language model. Commun Biol 2024; 7:1271. [PMID: 39369061 PMCID: PMC11455941 DOI: 10.1038/s42003-024-06964-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 09/25/2024] [Indexed: 10/07/2024] Open
Abstract
Image-based spatial transcriptomic sequencing technologies have enabled the measurement of gene expression at single-cell resolution, but with a limited number of genes. Current computational approaches attempt to overcome these limitations by imputing missing genes, but face challenges regarding prediction accuracy and identification of cell populations due to the neglect of gene-gene relationships. In this context, we present stImpute, a method to impute spatial transcriptomics according to reference scRNA-seq data based on the gene network constructed from the protein language model ESM-2. Specifically, stImpute employs an autoencoder to create gene expression embeddings for both spatial transcriptomics and scRNA-seq data, which are used to identify the nearest neighboring cells between scRNA-seq and spatial transcriptomics datasets. According to the neighbored cells, the gene expressions of spatial transcriptomics cells are imputed through a graph neural network, where nodes are genes, and edges are based on cosine similarity between the ESM-2 embeddings of the gene-encoding proteins. The gene prediction uncertainty is further measured through a deep learning model. stImpute was shown to consistently outperform state-of-the-art methods across multiple datasets concerning imputation and clustering. stImpute also demonstrates robustness in producing consistent results that are insensitive to model parameters.
Collapse
Affiliation(s)
- Yuansong Zeng
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
- Jinfeng Laboratory, Chongqing, China
| | - Yujie Song
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Chengyang Zhang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Haoxuan Li
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
| | - Yongkang Zhao
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
| | - Weijiang Yu
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Shiqi Zhang
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
| | - Hongyu Zhang
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
| | - Zhiming Dai
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China.
| | - Yuedong Yang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
31
|
Wang N, Hong W, Wu Y, Chen Z, Bai M, Wang W, Zhu J. Next-generation spatial transcriptomics: unleashing the power to gear up translational oncology. MedComm (Beijing) 2024; 5:e765. [PMID: 39376738 PMCID: PMC11456678 DOI: 10.1002/mco2.765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2024] [Revised: 08/30/2024] [Accepted: 09/03/2024] [Indexed: 10/09/2024] Open
Abstract
The growing advances in spatial transcriptomics (ST) stand as the new frontier bringing unprecedented influences in the realm of translational oncology. This has triggered systemic experimental design, analytical scope, and depth alongside with thorough bioinformatics approaches being constantly developed in the last few years. However, harnessing the power of spatial biology and streamlining an array of ST tools to achieve designated research goals are fundamental and require real-world experiences. We present a systemic review by updating the technical scope of ST across different principal basis in a timeline manner hinting on the generally adopted ST techniques used within the community. We also review the current progress of bioinformatic tools and propose in a pipelined workflow with a toolbox available for ST data exploration. With particular interests in tumor microenvironment where ST is being broadly utilized, we summarize the up-to-date progress made via ST-based technologies by narrating studies categorized into either mechanistic elucidation or biomarker profiling (translational oncology) across multiple cancer types and their ways of deploying the research through ST. This updated review offers as a guidance with forward-looking viewpoints endorsed by many high-resolution ST tools being utilized to disentangle biological questions that may lead to clinical significance in the future.
Collapse
Affiliation(s)
- Nan Wang
- Cosmos Wisdom Biotech Co. LtdHangzhouChina
| | - Weifeng Hong
- Department of Radiation OncologyZhejiang Cancer HospitalHangzhouChina
- Hangzhou Institute of Medicine (HIM)Chinese Academy of SciencesHangzhouChina
- Zhejiang Key Laboratory of Radiation OncologyHangzhouChina
| | - Yixing Wu
- Department of Pulmonary and Critical Care MedicineZhongshan HospitalFudan UniversityShanghaiChina
| | - Zhe‐Sheng Chen
- Department of Pharmaceutical SciencesCollege of Pharmacy and Health SciencesInstitute for BiotechnologySt. John's UniversityQueensNew YorkUSA
| | - Minghua Bai
- Department of Radiation OncologyZhejiang Cancer HospitalHangzhouChina
- Hangzhou Institute of Medicine (HIM)Chinese Academy of SciencesHangzhouChina
- Zhejiang Key Laboratory of Radiation OncologyHangzhouChina
| | | | - Ji Zhu
- Department of Radiation OncologyZhejiang Cancer HospitalHangzhouChina
- Hangzhou Institute of Medicine (HIM)Chinese Academy of SciencesHangzhouChina
- Zhejiang Key Laboratory of Radiation OncologyHangzhouChina
| |
Collapse
|
32
|
Shen R, Cheng M, Wang W, Fan Q, Yan H, Wen J, Yuan Z, Yao J, Li Y, Yuan J. Graph domain adaptation-based framework for gene expression enhancement and cell type identification in large-scale spatially resolved transcriptomics. Brief Bioinform 2024; 25:bbae576. [PMID: 39508445 PMCID: PMC11541786 DOI: 10.1093/bib/bbae576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 09/25/2024] [Accepted: 10/29/2024] [Indexed: 11/15/2024] Open
Abstract
Spatially resolved transcriptomics (SRT) technologies facilitate gene expression profiling with spatial resolution in a naïve state. Nevertheless, current SRT technologies exhibit limitations, manifesting as either low transcript detection sensitivity or restricted gene throughput. These constraints result in diminished precision and coverage in gene measurement. In response, we introduce SpaGDA, a sophisticated deep learning-based graph domain adaptation framework for both scenarios of gene expression imputation and cell type identification in spatially resolved transcriptomics data by impartially transferring knowledge from reference scRNA-seq data. Systematic benchmarking analyses across several SRT datasets generated from different technologies have demonstrated SpaGDA's superior effectiveness compared to state-of-the-art methods in both scenarios. Further applied to three SRT datasets of different biological contexts, SpaGDA not only better recovers the well-established knowledge sourced from public atlases and existing scientific literature but also yields a more informative spatial expression pattern of genes. Together, these results demonstrate that SpaGDA can be used to overcome the challenges of current SRT data and provide more accurate insights into biological processes or disease development. The SpaGDA is available in https://github.com/shenrb/SpaGDA.
Collapse
Affiliation(s)
- Rongbo Shen
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou Medical University, No. 1 Xinzao Road, Xinzao Town, Panyu District, Guangzhou 510005, China
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
- Tencent AI Lab, Shenzhen 518000, China
| | - Meiling Cheng
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Wencang Wang
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Qi Fan
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Huan Yan
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Jiayue Wen
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Handan Road, Shanghai 200433, China
| | | | - Yixue Li
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou Medical University, No. 1 Xinzao Road, Xinzao Town, Panyu District, Guangzhou 510005, China
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Jiao Yuan
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou Medical University, No. 1 Xinzao Road, Xinzao Town, Panyu District, Guangzhou 510005, China
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| |
Collapse
|
33
|
Coullomb A, Monsarrat P, Pancaldi V. mosna reveals different types of cellular interactions predictive of response to immunotherapies and survival in cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.03.16.532947. [PMID: 36993595 PMCID: PMC10055099 DOI: 10.1101/2023.03.16.532947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Spatially resolved omics enable the discovery of tissue organization of biological or clinical importance. Despite the existence of several methods, performing a rational analysis including multiple algorithms while integrating different conditions such as clinical data is still not trivial. To make such investigations more accessible, we developed mosna, a Python package to analyze spatial omics data with respect to clinical or biological data and to gain insight on cell interaction patterns or tissue architecture of biological relevance. mosna is compatible with all spatial omics methods, it leverages tysserand to build accurate spatial networks, and is compatible with Squidpy. It proposes an analysis pipeline, in which increasingly complex features computed at each step can be explored in integration with clinical data, either with easy-to-use descriptive statistics and data visualization, or by seamlessly training machine learning models and identifying variables with the most predictive power. mosna can take as input any dataset produced by spatial omics methods, including sub-cellular resolved transcriptomics (MERFISH, seqFISH) and proteomics (CODEX, MIBI-TOF, low-plex immuno-fluorescence), as well as spot-based spatial transcriptomics (10x Visium). Integration with experimental metadata or clinical data is adapted to binary conditions, such as biological treatments or response status of patients, and to survival data. We demonstrate the proposed analysis pipeline on two spatially resolved proteomic datasets containing either binary response to immunotherapy or survival data. mosna identifies features describing cellular composition and spatial distribution that can provide biological insight regarding factors that affect response to immunotherapies or survival.
Collapse
Affiliation(s)
- Alexis Coullomb
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
- RESTORE Research Center, Université de Toulouse, INSERM 1301, CNRS 5070, EFS, ENVT, Toulouse, France
| | - Paul Monsarrat
- RESTORE Research Center, Université de Toulouse, INSERM 1301, CNRS 5070, EFS, ENVT, Toulouse, France
- Oral Medicine Department and Hospital of Toulouse - Toulouse Institute of Oral Medicine and Science, CHU de Toulouse, Toulouse, France
- Artificial and Natural Intelligence Toulouse Institute ANITI, Toulouse, France
| | - Vera Pancaldi
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
| |
Collapse
|
34
|
Liu W, Wang B, Bai Y, Liang X, Xue L, Luo J. SpaGIC: graph-informed clustering in spatial transcriptomics via self-supervised contrastive learning. Brief Bioinform 2024; 25:bbae578. [PMID: 39541189 PMCID: PMC11562840 DOI: 10.1093/bib/bbae578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 09/30/2024] [Accepted: 10/29/2024] [Indexed: 11/16/2024] Open
Abstract
Spatial transcriptomics technologies enable the generation of gene expression profiles while preserving spatial context, providing the potential for in-depth understanding of spatial-specific tissue heterogeneity. Leveraging gene and spatial data effectively is fundamental to accurately identifying spatial domains in spatial transcriptomics analysis. However, many existing methods have not yet fully exploited the local neighborhood details within spatial information. To address this issue, we introduce SpaGIC, a novel graph-based deep learning framework integrating graph convolutional networks and self-supervised contrastive learning techniques. SpaGIC learns meaningful latent embeddings of spots by maximizing both edge-wise and local neighborhood-wise mutual information of graph structures, as well as minimizing the embedding distance between spatially adjacent spots. We evaluated SpaGIC on seven spatial transcriptomics datasets across various technology platforms. The experimental results demonstrated that SpaGIC consistently outperformed existing state-of-the-art methods in several tasks, such as spatial domain identification, data denoising, visualization, and trajectory inference. Additionally, SpaGIC is capable of performing joint analyses of multiple slices, further underscoring its versatility and effectiveness in spatial transcriptomics research.
Collapse
Affiliation(s)
- Wei Liu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Bo Wang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Yuting Bai
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Xiao Liang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Li Xue
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| |
Collapse
|
35
|
Samaran J, Peyré G, Cantini L. scConfluence: single-cell diagonal integration with regularized Inverse Optimal Transport on weakly connected features. Nat Commun 2024; 15:7762. [PMID: 39237488 PMCID: PMC11377776 DOI: 10.1038/s41467-024-51382-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 08/06/2024] [Indexed: 09/07/2024] Open
Abstract
The abundance of unpaired multimodal single-cell data has motivated a growing body of research into the development of diagonal integration methods. However, the state-of-the-art suffers from the loss of biological information due to feature conversion and struggles with modality-specific populations. To overcome these crucial limitations, we here introduce scConfluence, a method for single-cell diagonal integration. scConfluence combines uncoupled autoencoders on the complete set of features with regularized Inverse Optimal Transport on weakly connected features. We extensively benchmark scConfluence in several single-cell integration scenarios proving that it outperforms the state-of-the-art. We then demonstrate the biological relevance of scConfluence in three applications. We predict spatial patterns for Scgn, Synpr and Olah in scRNA-smFISH integration. We improve the classification of B cells and Monocytes in highly heterogeneous scRNA-scATAC-CyTOF integration. Finally, we reveal the joint contribution of Fezf2 and apical dendrite morphology in Intra Telencephalic neurons, based on morphological images and scRNA.
Collapse
Affiliation(s)
- Jules Samaran
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France
| | - Gabriel Peyré
- CNRS and DMA de l'Ecole Normale Supérieure, CNRS, Ecole Normale Supérieure, Université PSL, Paris, France
| | - Laura Cantini
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France.
| |
Collapse
|
36
|
Duan H, Zhang Q, Cui F, Zou Q, Zhang Z. MVST: Identifying spatial domains of spatial transcriptomes from multiple views using multi-view graph convolutional networks. PLoS Comput Biol 2024; 20:e1012409. [PMID: 39235988 PMCID: PMC11376559 DOI: 10.1371/journal.pcbi.1012409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Accepted: 08/09/2024] [Indexed: 09/07/2024] Open
Abstract
Spatial transcriptome technology can parse transcriptomic data at the spatial level to detect high-throughput gene expression and preserve information regarding the spatial structure of tissues. Identifying spatial domains, that is identifying regions with similarities in gene expression and histology, is the most basic and critical aspect of spatial transcriptome data analysis. Most current methods identify spatial domains only through a single view, which may obscure certain important information and thus fail to make full use of the information embedded in spatial transcriptome data. Therefore, we propose an unsupervised clustering framework based on multiview graph convolutional networks (MVST) to achieve accurate spatial domain recognition by the learning graph embedding features of neighborhood graphs constructed from gene expression information, spatial location information, and histopathological image information through multiview graph convolutional networks. By exploring spatial transcriptomes from multiple views, MVST enables data from all parts of the spatial transcriptome to be comprehensively and fully utilized to obtain more accurate spatial expression patterns. We verified the effectiveness of MVST on real spatial transcriptome datasets, the robustness of MVST on some simulated datasets, and the reasonableness of the framework structure of MVST in ablation experiments, and from the experimental results, it is clear that MVST can achieve a more accurate spatial domain identification compared with the current more advanced methods. In conclusion, MVST is a powerful tool for spatial transcriptome research with improved spatial domain recognition.
Collapse
Affiliation(s)
- Hao Duan
- School of Computer Science and Technology, Hainan University, Haikou, China
| | - Qingchen Zhang
- School of Computer Science and Technology, Hainan University, Haikou, China
| | - Feifei Cui
- School of Computer Science and Technology, Hainan University, Haikou, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Zilong Zhang
- School of Computer Science and Technology, Hainan University, Haikou, China
| |
Collapse
|
37
|
Andersson A, Behanova A, Avenel C, Windhager J, Malmberg F, Wählby C. Points2Regions: Fast, interactive clustering of imaging-based spatial transcriptomics data. Cytometry A 2024; 105:677-687. [PMID: 38958502 DOI: 10.1002/cyto.a.24884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 05/30/2024] [Accepted: 06/13/2024] [Indexed: 07/04/2024]
Abstract
Imaging-based spatial transcriptomics techniques generate data in the form of spatial points belonging to different mRNA classes. A crucial part of analyzing the data involves the identification of regions with similar composition of mRNA classes. These biologically interesting regions can manifest at different spatial scales. For example, the composition of mRNA classes on a cellular scale corresponds to cell types, whereas compositions on a millimeter scale correspond to tissue-level structures. Traditional techniques for identifying such regions often rely on complementary data, such as pre-segmented cells, or lengthy optimization. This limits their applicability to tasks on a particular scale, restricting their capabilities in exploratory analysis. This article introduces "Points2Regions," a computational tool for identifying regions with similar mRNA compositions. The tool's novelty lies in its rapid feature extraction by rasterizing points (representing mRNAs) onto a pyramidal grid and its efficient clustering using a combination of hierarchical and k -means clustering. This enables fast and efficient region discovery across multiple scales without relying on additional data, making it a valuable resource for exploratory analysis. Points2Regions has demonstrated performance similar to state-of-the-art methods on two simulated datasets, without relying on segmented cells, while being several times faster. Experiments on real-world datasets show that regions identified by Points2Regions are similar to those identified in other studies, confirming that Points2Regions can be used to extract biologically relevant regions. The tool is shared as a Python package integrated into TissUUmaps and a Napari plugin, offering interactive clustering and visualization, significantly enhancing user experience in data exploration.
Collapse
Affiliation(s)
- Axel Andersson
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Andrea Behanova
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Christophe Avenel
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Jonas Windhager
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Filip Malmberg
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Carolina Wählby
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| |
Collapse
|
38
|
Liu L, Chen A, Li Y, Mulder J, Heyn H, Xu X. Spatiotemporal omics for biology and medicine. Cell 2024; 187:4488-4519. [PMID: 39178830 DOI: 10.1016/j.cell.2024.07.040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 07/05/2024] [Accepted: 07/23/2024] [Indexed: 08/26/2024]
Abstract
The completion of the Human Genome Project has provided a foundational blueprint for understanding human life. Nonetheless, understanding the intricate mechanisms through which our genetic blueprint is involved in disease or orchestrates development across temporal and spatial dimensions remains a profound scientific challenge. Recent breakthroughs in cellular omics technologies have paved new pathways for understanding the regulation of genomic elements and the relationship between gene expression, cellular functions, and cell fate determination. The advent of spatial omics technologies, encompassing both imaging and sequencing-based methodologies, has enabled a comprehensive understanding of biological processes from a cellular ecosystem perspective. This review offers an updated overview of how spatial omics has advanced our understanding of the translation of genetic information into cellular heterogeneity and tissue structural organization and their dynamic changes over time. It emphasizes the discovery of various biological phenomena, related to organ functionality, embryogenesis, species evolution, and the pathogenesis of diseases.
Collapse
Affiliation(s)
| | - Ao Chen
- BGI Research, Shenzhen 518083, China
| | | | - Jan Mulder
- Department of Neuroscience, Karolinska Institute, Stockholm, Sweden
| | - Holger Heyn
- Centro Nacional de Análisis Genómico (CNAG), Barcelona, Spain
| | - Xun Xu
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China.
| |
Collapse
|
39
|
Poovathingal S, Davie K, Borm LE, Vandepoel R, Poulvellarie N, Verfaillie A, Corthout N, Aerts S. Nova-ST: Nano-patterned ultra-dense platform for spatial transcriptomics. CELL REPORTS METHODS 2024; 4:100831. [PMID: 39111312 PMCID: PMC11384075 DOI: 10.1016/j.crmeth.2024.100831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 06/07/2024] [Accepted: 07/11/2024] [Indexed: 08/22/2024]
Abstract
Spatial transcriptomics workflows using barcoded capture arrays are commonly used for resolving gene expression in tissues. However, existing techniques are either limited by capture array density or are cost prohibitive for large-scale atlasing. We present Nova-ST, a dense nano-patterned spatial transcriptomics technique derived from randomly barcoded Illumina sequencing flow cells. Nova-ST enables customized, low-cost, flexible, and high-resolution spatial profiling of large tissue sections. Benchmarking on mouse brain sections demonstrates significantly higher sensitivity compared to existing methods at a reduced cost.
Collapse
Affiliation(s)
- Suresh Poovathingal
- VIB Center for AI & Computational Biology (VIB.AI), 3000 Leuven, Belgium; VIB-KU Leuven Center for Brain & Disease Research, CBD Technologies, Single Cell & Microfluidics Expertise Unit, 3000 Leuven, Belgium; Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA.
| | - Kristofer Davie
- VIB Center for AI & Computational Biology (VIB.AI), 3000 Leuven, Belgium; VIB-KU Leuven Center for Brain & Disease Research, CBD Technologies, Single Cell Bioinformatics Expertise Unit, 3000 Leuven, Belgium
| | - Lars E Borm
- VIB Center for AI & Computational Biology (VIB.AI), 3000 Leuven, Belgium; VIB-KU Leuven Center for Brain & Disease Research, 3000 Leuven, Belgium; Department of Human Genetics, KU Leuven, 3000 Leuven, Belgium
| | - Roel Vandepoel
- VIB Center for AI & Computational Biology (VIB.AI), 3000 Leuven, Belgium; VIB-KU Leuven Center for Brain & Disease Research, 3000 Leuven, Belgium; Department of Human Genetics, KU Leuven, 3000 Leuven, Belgium
| | - Nicolas Poulvellarie
- VIB-KU Leuven Center for Brain & Disease Research, CBD Technologies, Neuroengineering Expertise Unit, 3000 Leuven, Belgium
| | | | - Nikky Corthout
- Bio Imaging Core, VIB-KU Leuven Center for Brain & Disease Research, 3000 Leuven, Belgium
| | - Stein Aerts
- VIB Center for AI & Computational Biology (VIB.AI), 3000 Leuven, Belgium; VIB-KU Leuven Center for Brain & Disease Research, 3000 Leuven, Belgium; Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA; Department of Human Genetics, KU Leuven, 3000 Leuven, Belgium.
| |
Collapse
|
40
|
Yang W, Wang P, Xu S, Wang T, Luo M, Cai Y, Xu C, Xue G, Que J, Ding Q, Jin X, Yang Y, Pang F, Pang B, Lin Y, Nie H, Xu Z, Ji Y, Jiang Q. Deciphering cell-cell communication at single-cell resolution for spatial transcriptomics with subgraph-based graph attention network. Nat Commun 2024; 15:7101. [PMID: 39155292 PMCID: PMC11330978 DOI: 10.1038/s41467-024-51329-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 08/01/2024] [Indexed: 08/20/2024] Open
Abstract
The inference of cell-cell communication (CCC) is crucial for a better understanding of complex cellular dynamics and regulatory mechanisms in biological systems. However, accurately inferring spatial CCCs at single-cell resolution remains a significant challenge. To address this issue, we present a versatile method, called DeepTalk, to infer spatial CCC at single-cell resolution by integrating single-cell RNA sequencing (scRNA-seq) data and spatial transcriptomics (ST) data. DeepTalk utilizes graph attention network (GAT) to integrate scRNA-seq and ST data, which enables accurate cell-type identification for single-cell ST data and deconvolution for spot-based ST data. Then, DeepTalk can capture the connections among cells at multiple levels using subgraph-based GAT, and further achieve spatially resolved CCC inference at single-cell resolution. DeepTalk achieves excellent performance in discovering meaningful spatial CCCs on multiple cross-platform datasets, which demonstrates its superior ability to dissect cellular behavior within intricate biological processes.
Collapse
Affiliation(s)
- Wenyi Yang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Pingping Wang
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, China
| | - Shouping Xu
- Department of Breast Cancer, Harbin Medical University Cancer Hospital, Harbin, China
| | - Tao Wang
- School of Computer Science, Northwestern Polytechnical University, Xi'an, China
| | - Meng Luo
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Yideng Cai
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Chang Xu
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Guangfu Xue
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Jinhao Que
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Qian Ding
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Xiyun Jin
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, China
| | - Yuexin Yang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Fenglan Pang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Boran Pang
- Center for Difficult and Complicated Abdominal Surgery, Shanghai Tenth People's Hospital, Tongji University School of Medicine, Shanghai, China
| | - Yi Lin
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, China
| | - Huan Nie
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Zhaochun Xu
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, China.
| | - Yong Ji
- State Key Laboratory of Frigid Zone Cardiovascular Diseases (SKLFZCD), Harbin Medical University, Harbin, China.
| | - Qinghua Jiang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China.
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, China.
| |
Collapse
|
41
|
Kumar R, Kolloli A, Subbian S, Kaushal D, Shi L, Tyagi S. Imaging the Architecture of Granulomas Induced by Mycobacterium tuberculosis Infection with Single-molecule Fluorescence In Situ Hybridization. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2024; 213:526-537. [PMID: 38912840 PMCID: PMC11407750 DOI: 10.4049/jimmunol.2300068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 05/30/2024] [Indexed: 06/25/2024]
Abstract
Granulomas are an important hallmark of Mycobacterium tuberculosis infection. They are organized and dynamic structures created when immune cells assemble around the sites of infection in the lungs that locally restrict M. tuberculosis growth and the host's inflammatory responses. The cellular architecture of granulomas is traditionally studied by immunofluorescence labeling of surface markers on the host cells. However, very few Abs are available for model animals used in tuberculosis research, such as nonhuman primates and rabbits, and secreted immunological markers such as cytokines cannot be imaged in situ using Abs. Furthermore, traditional phenotypic surface markers do not provide sufficient resolution for the detection of the many subtypes and differentiation states of immune cells. Using single-molecule fluorescence in situ hybridization (smFISH) and its derivatives, amplified smFISH and iterative smFISH, we developed a platform for imaging mRNAs encoding immune markers in rabbit and macaque tuberculosis granulomas. Multiplexed imaging for several mRNA and protein markers was followed by quantitative measurement of the expression of these markers in single cells. An analysis of the combinatorial expressions of these markers allowed us to classify the cells into several subtypes, and to chart their densities within granulomas. For one mRNA target, hypoxia-inducible factor-1α, we imaged its mRNA and protein in the same cells, demonstrating the specificity of the probes. This method paves the way for defining granular differentiation states and cell subtypes from transcriptomic data, identifying key mRNA markers for these cell subtypes, and then locating the cells in the spatial context of granulomas.
Collapse
Affiliation(s)
- Ranjeet Kumar
- Public Health Research Institute, New Jersey Medical School, Rutgers University, Newark, New Jersey
| | - Afsal Kolloli
- Public Health Research Institute, New Jersey Medical School, Rutgers University, Newark, New Jersey
| | - Selvakumar Subbian
- Public Health Research Institute, New Jersey Medical School, Rutgers University, Newark, New Jersey
- Department of Medicine, New Jersey Medical School, Rutgers University, Newark, New Jersey
| | - Deepak Kaushal
- Southwest National Primate Research Center, Texas Biomedical Research Institute, San Antonio, Texas
| | - Lanbo Shi
- Public Health Research Institute, New Jersey Medical School, Rutgers University, Newark, New Jersey
- Department of Medicine, New Jersey Medical School, Rutgers University, Newark, New Jersey
| | - Sanjay Tyagi
- Public Health Research Institute, New Jersey Medical School, Rutgers University, Newark, New Jersey
- Department of Medicine, New Jersey Medical School, Rutgers University, Newark, New Jersey
| |
Collapse
|
42
|
Lee H, Langseth CM, Salas SM, Sariyar S, Metousis A, Rueda-Alaña E, Bekiari C, Lundberg E, Garcı A-Moreno F, Grillo M, Nilsson M. Open-source, high-throughput targeted in situ transcriptomics for developmental and tissue biology. Development 2024; 151:dev202448. [PMID: 39099456 PMCID: PMC11385644 DOI: 10.1242/dev.202448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 07/19/2024] [Indexed: 08/06/2024]
Abstract
Multiplexed spatial profiling of mRNAs has recently gained traction as a tool to explore the cellular diversity and the architecture of tissues. We propose a sensitive, open-source, simple and flexible method for the generation of in situ expression maps of hundreds of genes. We use direct ligation of padlock probes on mRNAs, coupled with rolling circle amplification and hybridization-based in situ combinatorial barcoding, to achieve high detection efficiency, high-throughput and large multiplexing. We validate the method across a number of species and show its use in combination with orthogonal methods such as antibody staining, highlighting its potential value for developmental and tissue biology studies. Finally, we provide an end-to-end computational workflow that covers the steps of probe design, image processing, data extraction, cell segmentation, clustering and annotation of cell types. By enabling easier access to high-throughput spatially resolved transcriptomics, we hope to encourage a diversity of applications and the exploration of a wide range of biological questions.
Collapse
Affiliation(s)
- Hower Lee
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, 171 65 Solna, Sweden
| | | | - Sergio Marco Salas
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, 171 65 Solna, Sweden
| | - Sanem Sariyar
- Science for Life Laboratory, Department of Protein Science, KTH - Royal Institute of Technology, 17165Stockholm, Sweden
| | - Andreas Metousis
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, 171 65 Solna, Sweden
| | - Eneritz Rueda-Alaña
- Achucarro Basque Center for Neuroscience, Scientific Park of the University of the Basque Country (UPV/EHU), 48940 Leioa, Spain
- Department of Neuroscience, Faculty of Medicine and Odontology, UPV/EHU, Barrio Sarriena s/n, 48940 Leioa, Bizkaia, Spain
| | - Christina Bekiari
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, 171 65 Solna, Sweden
| | - Emma Lundberg
- Science for Life Laboratory, Department of Protein Science, KTH - Royal Institute of Technology, 17165Stockholm, Sweden
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
- Department of Pathology, Stanford University, Stanford, CA 94305, USA
| | - Fernando Garcı A-Moreno
- Achucarro Basque Center for Neuroscience, Scientific Park of the University of the Basque Country (UPV/EHU), 48940 Leioa, Spain
- Department of Neuroscience, Faculty of Medicine and Odontology, UPV/EHU, Barrio Sarriena s/n, 48940 Leioa, Bizkaia, Spain
- IKERBASQUE Foundation, María Díaz de Haro 3, 6th Floor, 48013 BilbaoSpain
| | - Marco Grillo
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, 171 65 Solna, Sweden
| | - Mats Nilsson
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, 171 65 Solna, Sweden
| |
Collapse
|
43
|
Li Y, Luo Y. STdGCN: spatial transcriptomic cell-type deconvolution using graph convolutional networks. Genome Biol 2024; 25:206. [PMID: 39103939 DOI: 10.1186/s13059-024-03353-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Accepted: 07/26/2024] [Indexed: 08/07/2024] Open
Abstract
Spatially resolved transcriptomics integrates high-throughput transcriptome measurements with preserved spatial cellular organization information. However, many technologies cannot reach single-cell resolution. We present STdGCN, a graph model leveraging single-cell RNA sequencing (scRNA-seq) as reference for cell-type deconvolution in spatial transcriptomic (ST) data. STdGCN incorporates expression profiles from scRNA-seq and spatial localization from ST data for deconvolution. Extensive benchmarking on multiple datasets demonstrates that STdGCN outperforms 17 state-of-the-art models. In a human breast cancer Visium dataset, STdGCN delineates stroma, lymphocytes, and cancer cells, aiding tumor microenvironment analysis. In human heart ST data, STdGCN identifies changes in endothelial-cardiomyocyte communications during tissue development.
Collapse
Affiliation(s)
- Yawei Li
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
- Center for Collaborative AI in Healthcare, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA.
- Center for Collaborative AI in Healthcare, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA.
| |
Collapse
|
44
|
Yao D, Binan L, Bezney J, Simonton B, Freedman J, Frangieh CJ, Dey K, Geiger-Schuller K, Eraslan B, Gusev A, Regev A, Cleary B. Scalable genetic screening for regulatory circuits using compressed Perturb-seq. Nat Biotechnol 2024; 42:1282-1295. [PMID: 37872410 PMCID: PMC11035494 DOI: 10.1038/s41587-023-01964-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 08/22/2023] [Indexed: 10/25/2023]
Abstract
Pooled CRISPR screens with single-cell RNA sequencing readout (Perturb-seq) have emerged as a key technique in functional genomics, but they are limited in scale by cost and combinatorial complexity. In this study, we modified the design of Perturb-seq by incorporating algorithms applied to random, low-dimensional observations. Compressed Perturb-seq measures multiple random perturbations per cell or multiple cells per droplet and computationally decompresses these measurements by leveraging the sparse structure of regulatory circuits. Applied to 598 genes in the immune response to bacterial lipopolysaccharide, compressed Perturb-seq achieves the same accuracy as conventional Perturb-seq with an order of magnitude cost reduction and greater power to learn genetic interactions. We identified known and novel regulators of immune responses and uncovered evolutionarily constrained genes with downstream targets enriched for immune disease heritability, including many missed by existing genome-wide association studies. Our framework enables new scales of interrogation for a foundational method in functional genomics.
Collapse
Affiliation(s)
- Douglas Yao
- Program in Systems, Synthetic, and Quantitative Biology, Harvard University, Cambridge, MA, USA
| | - Loic Binan
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Jon Bezney
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Brooke Simonton
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Jahanara Freedman
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Chris J Frangieh
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Kushal Dey
- Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | | | | | - Alexander Gusev
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
| | - Aviv Regev
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Genentech, South San Francisco, CA, USA
| | - Brian Cleary
- Faculty of Computing and Data Sciences, Boston University, Boston, MA, USA.
- Department of Biology, Boston University, Boston, MA, USA.
- Department of Biomedical Engineering, Boston University, Boston, MA, USA.
- Program in Bioinformatics, Boston University, Boston, MA, USA.
- Biological Design Center, Boston University, Boston, MA, USA.
| |
Collapse
|
45
|
Zhang M, Zhang W, Ma X. ST-SCSR: identifying spatial domains in spatial transcriptomics data via structure correlation and self-representation. Brief Bioinform 2024; 25:bbae437. [PMID: 39228303 PMCID: PMC11372132 DOI: 10.1093/bib/bbae437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 07/31/2024] [Accepted: 08/20/2024] [Indexed: 09/05/2024] Open
Abstract
Recent advances in spatial transcriptomics (ST) enable measurements of transcriptome within intact biological tissues by preserving spatial information, offering biologists unprecedented opportunities to comprehensively understand tissue micro-environment, where spatial domains are basic units of tissues. Although great efforts are devoted to this issue, they still have many shortcomings, such as ignoring local information and relations of spatial domains, requiring alternatives to solve these problems. Here, a novel algorithm for spatial domain identification in Spatial Transcriptomics data with Structure Correlation and Self-Representation (ST-SCSR), which integrates local information, global information, and similarity of spatial domains. Specifically, ST-SCSR utilzes matrix tri-factorization to simultaneously decompose expression profiles and spatial network of spots, where expressional and spatial features of spots are fused via the shared factor matrix that interpreted as similarity of spatial domains. Furthermore, ST-SCSR learns affinity graph of spots by manipulating expressional and spatial features, where local preservation and sparse constraints are employed, thereby enhancing the quality of graph. The experimental results demonstrate that ST-SCSR not only outperforms state-of-the-art algorithms in terms of accuracy, but also identifies many potential interesting patterns.
Collapse
Affiliation(s)
- Min Zhang
- School of Computer Science and Technology, Xidian University, No. 2 South Taibai Road, 710071 Xi'an Shaanxi, China
- Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province, Xidian University, No. 2 South Taibai Road, 710071 Xi'an Shaanxi, China
| | - Wensheng Zhang
- School of Computer Science and Cyber Engineering, GuangZhou University, No. 230 Wai Huan Xi Road,Guangzhou Higher Education Mega Center, 510006 Guangzhou Guangdong, China
| | - Xiaoke Ma
- School of Computer Science and Technology, Xidian University, No. 2 South Taibai Road, 710071 Xi'an Shaanxi, China
- Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province, Xidian University, No. 2 South Taibai Road, 710071 Xi'an Shaanxi, China
| |
Collapse
|
46
|
Mo J, Bae J, Saqib J, Hwang D, Jin Y, Park B, Park J, Kim J. Current computational methods for spatial transcriptomics in cancer biology. Adv Cancer Res 2024; 163:71-106. [PMID: 39271268 DOI: 10.1016/bs.acr.2024.06.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024]
Abstract
Cells in multicellular organisms constitute a self-organizing society by interacting with their neighbors. Cancer originates from malfunction of cellular behavior in the context of such a self-organizing system. The identities or characteristics of individual tumor cells can be represented by the hallmark of gene expression or transcriptome, which can be addressed using single-cell dissociation followed by RNA sequencing. However, the dissociation process of single cells results in losing the cellular address in tissue or neighbor information of each tumor cell, which is critical to understanding the malfunctioning cellular behavior in the microenvironment. Spatial transcriptomics technology enables measuring the transcriptome which is tagged by the address within a tissue. However, to understand cellular behavior in a self-organizing society, we need to apply mathematical or statistical methods. Here, we provide a review on current computational methods for spatial transcriptomics in cancer biology.
Collapse
Affiliation(s)
- Jaewoo Mo
- School of Systems Biomedical Science, Soongsil University, Dongjak-Gu, Seoul, Republic of Korea
| | - Junseong Bae
- Interdisciplinary Program of Genomic Data Science, Pusan National University, Yangsan, Republic of Korea; Graduate School of Medical AI, Pusan National University, Yangsan, Republic of Korea
| | - Jahanzeb Saqib
- School of Systems Biomedical Science, Soongsil University, Dongjak-Gu, Seoul, Republic of Korea
| | - Dohyun Hwang
- Department of Information Convergence Engineering, Pusan National University, Yangsan, Republic of Korea
| | - Yunjung Jin
- School of Systems Biomedical Science, Soongsil University, Dongjak-Gu, Seoul, Republic of Korea
| | - Beomsu Park
- School of Systems Biomedical Science, Soongsil University, Dongjak-Gu, Seoul, Republic of Korea
| | - Jeongbin Park
- Interdisciplinary Program of Genomic Data Science, Pusan National University, Yangsan, Republic of Korea; Department of Information Convergence Engineering, Pusan National University, Yangsan, Republic of Korea; School of Biomedical Convergence Engineering, Pusan National University, Yangsan, Republic of Korea.
| | - Junil Kim
- School of Systems Biomedical Science, Soongsil University, Dongjak-Gu, Seoul, Republic of Korea.
| |
Collapse
|
47
|
Defard T, Laporte H, Ayan M, Soulier J, Curras-Alonso S, Weber C, Massip F, Londoño-Vallejo JA, Fouillade C, Mueller F, Walter T. A point cloud segmentation framework for image-based spatial transcriptomics. Commun Biol 2024; 7:823. [PMID: 38971915 PMCID: PMC11227573 DOI: 10.1038/s42003-024-06480-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 06/20/2024] [Indexed: 07/08/2024] Open
Abstract
Recent progress in image-based spatial RNA profiling enables to spatially resolve tens to hundreds of distinct RNA species with high spatial resolution. It presents new avenues for comprehending tissue organization. In this context, the ability to assign detected RNA transcripts to individual cells is crucial for downstream analyses, such as in-situ cell type calling. Yet, accurate cell segmentation can be challenging in tissue data, in particular in the absence of a high-quality membrane marker. To address this issue, we introduce ComSeg, a segmentation algorithm that operates directly on single RNA positions and that does not come with implicit or explicit priors on cell shape. ComSeg is applicable in complex tissues with arbitrary cell shapes. Through comprehensive evaluations on simulated and experimental datasets, we show that ComSeg outperforms existing state-of-the-art methods for in-situ single-cell RNA profiling and in-situ cell type calling. ComSeg is available as a documented and open source pip package at https://github.com/fish-quant/ComSeg .
Collapse
Affiliation(s)
- Thomas Defard
- Centre for Computational Biology (CBIO), Mines Paris, PSL University, 75006, Paris, France
- Institut Curie, PSL University, 75005, Paris, France
- INSERM, U900, 75005, Paris, France
- Institut Pasteur, Université Paris Cité, Imaging and Modeling Unit, F-75015, Paris, France
- Institut Pasteur, Université Paris Cité, Photonic Bio-Imaging, Centre de Ressources et Recherches Technologiques (UTechS-PBI, C2RT), F-75015, Paris, France
| | - Hugo Laporte
- Institut Curie, Inserm U1021-CNRS UMR 3347, University Paris-Saclay, PSL Research University, Centre Universitaire, Orsay, Cedex, France
- Institute of Cell Biology (Cancer Research), University Hospital Essen, Essen, Germany
| | - Mallick Ayan
- Institut Curie, Inserm U1021-CNRS UMR 3347, University Paris-Saclay, PSL Research University, Centre Universitaire, Orsay, Cedex, France
| | - Juliette Soulier
- Institut Curie, Inserm U1021-CNRS UMR 3347, University Paris-Saclay, PSL Research University, Centre Universitaire, Orsay, Cedex, France
| | - Sandra Curras-Alonso
- Institut Curie, Inserm U1021-CNRS UMR 3347, University Paris-Saclay, PSL Research University, Centre Universitaire, Orsay, Cedex, France
| | - Christian Weber
- Institut Pasteur, Université Paris Cité, Imaging and Modeling Unit, F-75015, Paris, France
- Institut Pasteur, Université Paris Cité, Photonic Bio-Imaging, Centre de Ressources et Recherches Technologiques (UTechS-PBI, C2RT), F-75015, Paris, France
| | - Florian Massip
- Centre for Computational Biology (CBIO), Mines Paris, PSL University, 75006, Paris, France
- Institut Curie, PSL University, 75005, Paris, France
- INSERM, U900, 75005, Paris, France
| | - José-Arturo Londoño-Vallejo
- Institut Curie, Inserm U1021-CNRS UMR 3347, University Paris-Saclay, PSL Research University, Centre Universitaire, Orsay, Cedex, France
| | - Charles Fouillade
- Institut Curie, Inserm U1021-CNRS UMR 3347, University Paris-Saclay, PSL Research University, Centre Universitaire, Orsay, Cedex, France
| | - Florian Mueller
- Institut Pasteur, Université Paris Cité, Imaging and Modeling Unit, F-75015, Paris, France.
- Institut Pasteur, Université Paris Cité, Photonic Bio-Imaging, Centre de Ressources et Recherches Technologiques (UTechS-PBI, C2RT), F-75015, Paris, France.
| | - Thomas Walter
- Centre for Computational Biology (CBIO), Mines Paris, PSL University, 75006, Paris, France.
- Institut Curie, PSL University, 75005, Paris, France.
- INSERM, U900, 75005, Paris, France.
| |
Collapse
|
48
|
Liu Y, Chen J, Lin C, Ke R. Multiplexed in situ RNA imaging by combFISH. Anal Bioanal Chem 2024; 416:3765-3774. [PMID: 38775954 DOI: 10.1007/s00216-024-05327-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Revised: 04/28/2024] [Accepted: 04/30/2024] [Indexed: 06/18/2024]
Abstract
Multiplexed in situ RNA imaging offers new opportunities for gene expression profiling by providing high-throughput spatial information. In this work, we present a cyclic combinatorial fluorescent in situ hybridization (combFISH) assay to achieve multiplexed detection of RNA in cell cultures and tissues. Specifically, multiplexing is achieved through cyclic interrogation of barcode sequences on the rolling circle amplicons generated from the padlock probe assay by using sets of combinatorial detection probes. Theoretically, combFISH can detect 64 genes in three hybridization cycles by combinatorial barcoding using 12 fluorescently labeled detection probes. Our method eliminates sequencing-by-ligation (SBL) chemistry in the in situ sequencing protocol and directly uses RNA as targets for ligation, making it more straightforward. We showed that our method works in fresh-frozen and formalin-fixed paraffin-embedded tissue sections. With its straightforward protocols, we expect our method to be adopted by the scientific community and extended to clinical settings.
Collapse
Affiliation(s)
- Yanxiu Liu
- School of Medicine, Huaqiao University, Xiamen, 361021, Fujian, China
- School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, Fujian, China
| | - Jiayu Chen
- School of Medicine, Huaqiao University, Xiamen, 361021, Fujian, China
- School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, Fujian, China
| | - Chen Lin
- School of Medicine, Huaqiao University, Xiamen, 361021, Fujian, China.
- School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, Fujian, China.
| | - Rongqin Ke
- School of Medicine, Huaqiao University, Xiamen, 361021, Fujian, China.
- School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, Fujian, China.
| |
Collapse
|
49
|
Sun ED, Ma R, Zou J. SPRITE: improving spatial gene expression imputation with gene and cell networks. Bioinformatics 2024; 40:i521-i528. [PMID: 38940132 PMCID: PMC11211834 DOI: 10.1093/bioinformatics/btae253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
MOTIVATION Spatially resolved single-cell transcriptomics have provided unprecedented insights into gene expression in situ, particularly in the context of cell interactions or organization of tissues. However, current technologies for profiling spatial gene expression at single-cell resolution are generally limited to the measurement of a small number of genes. To address this limitation, several algorithms have been developed to impute or predict the expression of additional genes that were not present in the measured gene panel. Current algorithms do not leverage the rich spatial and gene relational information in spatial transcriptomics. To improve spatial gene expression predictions, we introduce Spatial Propagation and Reinforcement of Imputed Transcript Expression (SPRITE) as a meta-algorithm that processes predictions obtained from existing methods by propagating information across gene correlation networks and spatial neighborhood graphs. RESULTS SPRITE improves spatial gene expression predictions across multiple spatial transcriptomics datasets. Furthermore, SPRITE predicted spatial gene expression leads to improved clustering, visualization, and classification of cells. SPRITE can be used in spatial transcriptomics data analysis to improve inferences based on predicted gene expression. AVAILABILITY AND IMPLEMENTATION The SPRITE software package is available at https://github.com/sunericd/SPRITE. Code for generating experiments and analyses in the manuscript is available at https://github.com/sunericd/sprite-figures-and-analyses.
Collapse
Affiliation(s)
- Eric D Sun
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, United States
| | - Rong Ma
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, United States
| | - James Zou
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, United States
| |
Collapse
|
50
|
Jin Y, Zuo Y, Li G, Liu W, Pan Y, Fan T, Fu X, Yao X, Peng Y. Advances in spatial transcriptomics and its applications in cancer research. Mol Cancer 2024; 23:129. [PMID: 38902727 PMCID: PMC11188176 DOI: 10.1186/s12943-024-02040-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Accepted: 06/10/2024] [Indexed: 06/22/2024] Open
Abstract
Malignant tumors have increasing morbidity and high mortality, and their occurrence and development is a complicate process. The development of sequencing technologies enabled us to gain a better understanding of the underlying genetic and molecular mechanisms in tumors. In recent years, the spatial transcriptomics sequencing technologies have been developed rapidly and allow the quantification and illustration of gene expression in the spatial context of tissues. Compared with the traditional transcriptomics technologies, spatial transcriptomics technologies not only detect gene expression levels in cells, but also inform the spatial location of genes within tissues, cell composition of biological tissues, and interaction between cells. Here we summarize the development of spatial transcriptomics technologies, spatial transcriptomics tools and its application in cancer research. We also discuss the limitations and challenges of current spatial transcriptomics approaches, as well as future development and prospects.
Collapse
Affiliation(s)
- Yang Jin
- Laboratory of Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Yuanli Zuo
- Laboratory of Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Gang Li
- Department of Thoracic Surgery, The Public Health Clinical Center of Chengdu, Chengdu, 610061, China
| | - Wenrong Liu
- Laboratory of Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Yitong Pan
- Laboratory of Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Ting Fan
- Laboratory of Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Xin Fu
- Laboratory of Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Xiaojun Yao
- Department of Thoracic Surgery, The Public Health Clinical Center of Chengdu, Chengdu, 610061, China.
| | - Yong Peng
- Laboratory of Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China.
- Frontier Medical Center, Tianfu Jincheng Laboratory, Chengdu, 610212, China.
| |
Collapse
|