1
|
Raina M, Cheng H, Ferreira RM, Stansfield T, Modak C, Cheng YH, Suryadevara HNSK, Xu D, Eadon MT, Ma Q, Wang J. Relation equivariant graph neural networks to explore the mosaic-like tissue architecture of kidney diseases on spatially resolved transcriptomics. Bioinformatics 2025; 41:btaf303. [PMID: 40358510 PMCID: PMC12165735 DOI: 10.1093/bioinformatics/btaf303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 04/23/2025] [Accepted: 05/12/2025] [Indexed: 05/15/2025] Open
Abstract
MOTIVATION Chronic kidney disease (CKD) and acute kidney injury (AKI) are prominent public health concerns affecting more than 15% of the global population. The ongoing development of spatially resolved transcriptomics (SRT) technologies presents a promising approach for discovering the spatial distribution patterns of gene expression within diseased tissues. However, existing computational tools are predominantly calibrated and designed on the ribbon-like structure of the brain cortex, presenting considerable computational obstacles in discerning highly heterogeneous mosaic-like tissue architectures in the kidney. Consequently, timely and cost-effective acquisition of annotation and interpretation in the kidney remains a challenge in exploring the cellular and morphological changes within renal tubules and their interstitial niches. RESULTS We present an empowered graph deep learning framework, REGNN (Relation Equivariant Graph Neural Networks), designed for SRT data analyses on heterogeneous tissue structures. To increase expressive power in the SRT lattice using graph modeling, REGNN integrates equivariance to handle n-dimensional symmetries of the spatial area, while additionally leveraging Positional Encoding to strengthen relative spatial relations of the nodes uniformly distributed in the lattice. Given the limited availability of well-labeled spatial data, this framework implements both graph autoencoder and graph self-supervised learning strategies. On heterogeneous samples from different kidney conditions, REGNN outperforms existing computational tools in identifying tissue architectures within the 10× Visium platform. This framework offers a powerful graph deep learning tool for investigating tissues within highly heterogeneous expression patterns and paves the way to pinpoint underlying pathological mechanisms that contribute to the progression of complex diseases. AVAILABILITY AND IMPLEMENTATION REGNN is publicly available at https://github.com/Mraina99/REGNN.
Collapse
Affiliation(s)
- Mauminah Raina
- Department of Biomedical Engineering and Informatics, Indiana University Indianapolis, Indianapolis, IN 46202, United States
| | - Hao Cheng
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | | | - Treyden Stansfield
- Department of Biomedical Engineering and Informatics, Indiana University Indianapolis, Indianapolis, IN 46202, United States
| | - Chandrima Modak
- Department of Biomedical Engineering and Informatics, Indiana University Indianapolis, Indianapolis, IN 46202, United States
| | - Ying-Hua Cheng
- Department of Medicine, Indiana University, Indianapolis, IN 46202, United States
| | - Hari Naga Sai Kiran Suryadevara
- Department of Biomedical Engineering and Informatics, Indiana University Indianapolis, Indianapolis, IN 46202, United States
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, United States
| | - Michael T Eadon
- Department of Medicine, Indiana University, Indianapolis, IN 46202, United States
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Juexin Wang
- Department of Biomedical Engineering and Informatics, Indiana University Indianapolis, Indianapolis, IN 46202, United States
| |
Collapse
|
2
|
Cui Y, Cui Y, Ding Y, Nakai K, Wei L, Le Y, Ye X, Sakurai T. OmniClust: A versatile clustering toolkit for single-cell and spatial transcriptomics data. Methods 2025; 238:84-94. [PMID: 40057293 DOI: 10.1016/j.ymeth.2025.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2025] [Revised: 02/24/2025] [Accepted: 03/05/2025] [Indexed: 03/22/2025] Open
Abstract
In recent years, RNA transcriptome sequencing technology has been continuously evolving, ranging from single-cell transcriptomics to spatial transcriptomics. Although these technologies are all based on RNA sequencing, each sequencing technology has its own unique characteristics, and there is an urgent need to develop an algorithmic toolkit that integrates both sequencing techniques. To address this, we have developed OmniClust, a toolkit based on single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics data. OmniClust employs deep learning algorithms for feature learning and clustering of spatial transcriptomics data, while utilizing machine learning algorithms for clustering scRNA-seq data. OmniClust was tested on 12 spatial transcriptomics benchmark datasets, demonstrating high clustering accuracy across multiple clustering evaluation metrics. It was also evaluated on four scRNA-seq benchmark datasets, achieving high clustering accuracy based on various clustering evaluation metrics. Furthermore, we applied OmniClust to downstream analyses of spatial transcriptomics and single-cell RNA breast cancer data, showcasing its potential to uncover and interpret the biological significance of cancer transcriptome data. In summary, OmniClust is a clustering tool designed for both single-cell transcriptomics and spatial transcriptomics data, demonstrating outstanding performance.
Collapse
Affiliation(s)
- Yaxuan Cui
- Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan
| | - Yang Cui
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | - Yi Ding
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | - Kenta Nakai
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan; Human Genome Center, The Institute of Medical Science, The University of Tokyo, Tokyo 108-8639, Japan
| | - Leyi Wei
- Centre for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Science, Macao Polytechnic University, Macao SAR, China
| | - Yuyin Le
- Department of Radiation Oncology Fuzhou Pulmonary Hospital of Fujian Province , Teaching Hospital of Fujian Medical University, China.
| | - Xiucai Ye
- Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan.
| | - Tetsuya Sakurai
- Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan
| |
Collapse
|
3
|
Zhu B, Gao S, Chen S, Wang Y, Yeung J, Bai Y, Huang AY, Yeo YY, Liao G, Mao S, Jiang ZG, Rodig SJ, Wong KC, Shalek AK, Nolan GP, Jiang S, Ma Z. CellLENS enables cross-domain information fusion for enhanced cell population delineation in single-cell spatial omics data. Nat Immunol 2025:10.1038/s41590-025-02163-1. [PMID: 40404817 DOI: 10.1038/s41590-025-02163-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Accepted: 04/14/2025] [Indexed: 05/24/2025]
Abstract
Delineating cell populations is crucial for understanding immune function in health and disease. Spatial omics technologies offer insights by capturing three complementary domains: single-cell molecular biomarker expression, cellular spatial relationships and tissue architecture. However, current computational methods often fail to fully integrate these multidimensional data, particularly for immune cell populations and intrinsic functional states. We introduce Cell Local Environment and Neighborhood Scan (CellLENS), a self-supervised computational method that learns cellular representations by fusing information across three spatial omics domains (expression, neighborhood and image). CellLENS markedly enhances de novo discovery of biologically relevant immune cell populations at fine granularity by integrating individual cells' molecular profiles with their neighborhood context and tissue localization. By applying CellLENS to diverse spatial proteomic and transcriptomic datasets across multiple tissue types and disease settings, we uncover unique immune cell populations functionally stratified according to their spatial contexts. Our work demonstrates the power of multi-domain data integration in spatial omics to reveal insights into immune cell heterogeneity and tissue-specific functions.
Collapse
Affiliation(s)
- Bokai Zhu
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Sheng Gao
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA
| | - Shuxiao Chen
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA
| | - Yuchen Wang
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- Department of Computer Science, City University of Hong Kong, Hong Kong, People's Republic of China
| | - Jason Yeung
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Yunhao Bai
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Amy Y Huang
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts Institute of Technology, Cambridge, MA, USA
- Dana-Farber Cancer Institute, Boston, MA, USA
| | - Yao Yu Yeo
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Guanrui Liao
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- Center of Hepato-Pancreato-Biliary Surgery, The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, People's Republic of China
| | - Shulin Mao
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Zhenghui G Jiang
- Division of Gastroenterology/Liver Center, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Scott J Rodig
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Ka-Chun Wong
- Department of Computer Science, City University of Hong Kong, Hong Kong, People's Republic of China
| | - Alex K Shalek
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
- Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Garry P Nolan
- Department of Pathology, Stanford University, Stanford, CA, USA.
| | - Sizun Jiang
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA.
- Dana-Farber Cancer Institute, Boston, MA, USA.
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
| | - Zongming Ma
- Department of Statistics and Data Science, Yale University, New Haven, CT, USA.
| |
Collapse
|
4
|
Chen M, Cheng R, He J, Chen J, Zhang J. SMOPCA: spatially aware dimension reduction integrating multi-omics improves the efficiency of spatial domain detection. Genome Biol 2025; 26:135. [PMID: 40399936 PMCID: PMC12096709 DOI: 10.1186/s13059-025-03576-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 04/12/2025] [Indexed: 05/23/2025] Open
Abstract
Technological advances have enabled us to profile multiple omics layers with spatial information, significantly enhancing spatial domain detection and advancing a variety of biomedical research fields. Despite these advancements, there is a notable lack of effective methods for modeling spatial multi-omics data. We introduce SMOPCA, a Spatial Multi-Omics Principal Component Analysis method designed to perform joint dimension reduction on multimodal data while preserving spatial dependencies. Extensive experiments reveal that SMOPCA outperforms existing single-modal and multimodal dimension reduction and clustering methods, across both single-cell and spatial multi-omics datasets derived from diverse technologies and tissue structures.
Collapse
Affiliation(s)
- Mo Chen
- National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu, China
- School of Artificial Intelligence, Nanjing University, Nanjing, Jiangsu, China
| | - Ruihua Cheng
- Big Data Statistics Research Center, Tianjin University of Finance and Economics, Tianjin, China
| | - Jianuo He
- National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu, China
- School of Artificial Intelligence, Nanjing University, Nanjing, Jiangsu, China
| | - Jun Chen
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA.
| | - Jie Zhang
- National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu, China.
- School of Artificial Intelligence, Nanjing University, Nanjing, Jiangsu, China.
| |
Collapse
|
5
|
Wang H, Cheng P, Wang J, Lv H, Han J, Hou Z, Xu R, Chen W. Advances in spatial transcriptomics and its application in the musculoskeletal system. Bone Res 2025; 13:54. [PMID: 40379648 DOI: 10.1038/s41413-025-00429-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 03/10/2025] [Accepted: 03/17/2025] [Indexed: 05/19/2025] Open
Abstract
While bulk RNA sequencing and single-cell RNA sequencing have shed light on cellular heterogeneity and potential molecular mechanisms in the musculoskeletal system in both physiological and various pathological states, the spatial localization of cells and molecules and intercellular interactions within the tissue context require further elucidation. Spatial transcriptomics has revolutionized biological research by simultaneously capturing gene expression profiles and in situ spatial information of tissues, gradually finding applications in musculoskeletal research. This review provides a summary of recent advances in spatial transcriptomics and its application to the musculoskeletal system. The classification and characteristics of data acquisition techniques in spatial transcriptomics are briefly outlined, with an emphasis on widely-adopted representative technologies and the latest technological breakthroughs, accompanied by a concise workflow for incorporating spatial transcriptomics into musculoskeletal system research. The role of spatial transcriptomics in revealing physiological mechanisms of the musculoskeletal system, particularly during developmental processes, is thoroughly summarized. Furthermore, recent discoveries and achievements of this emerging omics tool in addressing inflammatory, traumatic, degenerative, and tumorous diseases of the musculoskeletal system are compiled. Finally, challenges and potential future directions for spatial transcriptomics, both as a field and in its applications in the musculoskeletal system, are discussed.
Collapse
Affiliation(s)
- Haoyu Wang
- Department of Orthopedic Surgery, Hebei Medical University Third Hospital, Shijiazhuang, Hebei, China
- Key Laboratory of Biomechanics of Hebei Province, Shijiazhuang, Hebei, China
- NHC Key Laboratory of Intelligent Orthopedic Equipment, Shijiazhuang, Hebei, China
| | - Peng Cheng
- Department of Orthopedics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Juan Wang
- Department of Orthopedic Surgery, Hebei Medical University Third Hospital, Shijiazhuang, Hebei, China
- Key Laboratory of Biomechanics of Hebei Province, Shijiazhuang, Hebei, China
- NHC Key Laboratory of Intelligent Orthopedic Equipment, Shijiazhuang, Hebei, China
| | - Hongzhi Lv
- Department of Orthopedic Surgery, Hebei Medical University Third Hospital, Shijiazhuang, Hebei, China
- Key Laboratory of Biomechanics of Hebei Province, Shijiazhuang, Hebei, China
- NHC Key Laboratory of Intelligent Orthopedic Equipment, Shijiazhuang, Hebei, China
| | - Jie Han
- State Key Laboratory of Cellular Stress Biology, Cancer Research Center, School of Medicine, Faculty of Medicine and Life Sciences, Xiamen University, Xiamen, Fujian, China
| | - Zhiyong Hou
- Department of Orthopedic Surgery, Hebei Medical University Third Hospital, Shijiazhuang, Hebei, China
- Key Laboratory of Biomechanics of Hebei Province, Shijiazhuang, Hebei, China
- NHC Key Laboratory of Intelligent Orthopedic Equipment, Shijiazhuang, Hebei, China
| | - Ren Xu
- The First Affiliated Hospital of Xiamen University-ICMRS Collaborating Center for Skeletal Stem Cells, State Key Laboratory of Cellular Stress Biology, School of Medicine, Faculty of Medicine and Life Sciences, Xiamen University, Xiamen, Fujian, China.
| | - Wei Chen
- Department of Orthopedic Surgery, Hebei Medical University Third Hospital, Shijiazhuang, Hebei, China.
- Key Laboratory of Biomechanics of Hebei Province, Shijiazhuang, Hebei, China.
- NHC Key Laboratory of Intelligent Orthopedic Equipment, Shijiazhuang, Hebei, China.
| |
Collapse
|
6
|
Chelebian E, Avenel C, Wählby C. Combining spatial transcriptomics with tissue morphology. Nat Commun 2025; 16:4452. [PMID: 40360467 PMCID: PMC12075478 DOI: 10.1038/s41467-025-58989-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Accepted: 04/04/2025] [Indexed: 05/15/2025] Open
Abstract
Spatial transcriptomics has transformed our understanding of tissue architecture by preserving the spatial context of gene expression patterns. Simultaneously, advances in imaging AI have enabled extraction of morphological features describing the tissue. This review introduces a framework for categorizing methods that combine spatial transcriptomics with tissue morphology, focusing on either translating or integrating morphological features into spatial transcriptomics. Translation involves using morphology to predict gene expression, creating super-resolution maps or inferring genetic information from H&E-stained samples. Integration enriches spatial transcriptomics by identifying morphological features that complement gene expression. We also explore learning strategies and future directions for this emerging field.
Collapse
Affiliation(s)
- Eduard Chelebian
- Department of Information Technology and SciLifeLab, Uppsala University, Uppsala, Sweden
| | - Christophe Avenel
- Department of Information Technology and SciLifeLab, Uppsala University, Uppsala, Sweden
| | - Carolina Wählby
- Department of Information Technology and SciLifeLab, Uppsala University, Uppsala, Sweden.
| |
Collapse
|
7
|
Wang W, Zheng S, Shin SC, Chávez-Fuentes JC, Yuan GC. ONTraC characterizes spatially continuous variations of tissue microenvironment through niche trajectory analysis. Genome Biol 2025; 26:117. [PMID: 40340854 PMCID: PMC12060293 DOI: 10.1186/s13059-025-03588-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Accepted: 04/25/2025] [Indexed: 05/10/2025] Open
Abstract
Recent technological advances enable mapping of tissue spatial organization at single-cell resolution, but methods for analyzing spatially continuous microenvironments are still lacking. We introduce ONTraC, a graph neural network-based framework for constructing spatial trajectories at niche-level. Through benchmarking analyses using multiple simulated and real datasets, we show that ONTraC outperforms existing methods. ONTraC captures both normal anatomical structures and disease-associated tissue microenvironment changes. In addition, it identifies tissue microenvironment-dependent shifts in gene expression, regulatory network, and cell-cell interaction patterns. Taken together, ONTraC provides a useful framework for characterizing the structural and functional organization of tissue microenvironments.
Collapse
Affiliation(s)
- Wen Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Shiwei Zheng
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Sujung Crystal Shin
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | | | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.
| |
Collapse
|
8
|
Zhang W, Zhang Z, Yang H, Zhang T, Jiang S, Qiao N, Deng Z, Pan X, Shen HB, Yu DJ, Wang S. m2ST: dual multi-scale graph clustering for spatially resolved transcriptomics. Bioinformatics 2025; 41:btaf221. [PMID: 40272889 PMCID: PMC12085222 DOI: 10.1093/bioinformatics/btaf221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2025] [Revised: 03/20/2025] [Accepted: 04/22/2025] [Indexed: 04/26/2025] Open
Abstract
MOTIVATION Spatial clustering is a key analytical technique for exploring spatial transcriptomics data. Recent graph neural network-based methods have shown promise in spatial clustering but face notable challenges. One significant issue is that analyzing the functions and complex mechanisms of organisms from a single scale is difficult and most methods focus exclusively on the single-scale representation of transcriptomic data, potentially limiting the discriminative power of extracted features for spatial domain clustering. Furthermore, classical clustering algorithms are often applied directly to latent representation, making it a worthwhile endeavor to explore a tailored clustering method to further improve the accuracy of spatial domain annotation. RESULTS To address these limitations, we propose m2ST, a novel dual multi-scale graph clustering method. m2ST first uses a multi-scale masked graph autoencoder to extract representations across different scales from spatial transcriptomic data. To effectively compress and distill meaningful knowledge embedded in the data, m2ST introduces a random masking mechanism for node features and uses a scaled cosine error as the loss function. Additionally, we introduce a tailored multi-scale clustering framework that integrates scale-common and scale-specific information exploration into the clustering process, achieving more robust annotation performance. Shannon entropy is finally utilized to dynamically adjust the importance of different scales. Extensive experiments on multiple spatial transcriptomic datasets demonstrate the superior performance of m2ST compared to existing methods. AVAILABILITY AND IMPLEMENTATION https://github.com/BBKing49/m2ST.
Collapse
Affiliation(s)
- Wei Zhang
- The School of Artificial Intelligence and Computer Science, Nantong University, Nantong, 226019, China
- The School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, 214122, China
| | - Ziqi Zhang
- The School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, 214122, China
| | - Hailong Yang
- The School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, 214122, China
| | - Te Zhang
- The Lab for Uncertainty in Data and Decision Making (LUCID), School of Computer Science, University of Nottingham, Nottingham, NG81BB, United Kingdom
| | - Shu Jiang
- The School of Artificial Intelligence and Computer Science, Nantong University, Nantong, 226019, China
| | - Ning Qiao
- The School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, 214122, China
| | - Zhaohong Deng
- The School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, 214122, China
| | - Xiaoyong Pan
- Department of Automation, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Hong-Bin Shen
- Department of Automation, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Shitong Wang
- The School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, 214122, China
| |
Collapse
|
9
|
Tang S, Liu S, Buchman AS, Bennett DA, De Jager PL, Yang J, Hu J. Integrating spatial transcriptomics and snRNA-seq data enhances differential gene expression analysis results of AD-related phenotypes. HGG ADVANCES 2025; 6:100447. [PMID: 40329537 DOI: 10.1016/j.xhgg.2025.100447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2025] [Revised: 04/29/2025] [Accepted: 04/29/2025] [Indexed: 05/08/2025] Open
Abstract
Spatial transcriptomics (ST) data provide spatially informed gene expression profiles. However, power is limited for spatially informed differential gene expression (DGE) of complex diseases such as Alzheimer disease (AD), due to small sample sizes of ST data. Conversely, single-nucleus RNA sequencing (snRNA-seq) data offer larger sample sizes for cell-type-specific (CTS) analyses but lack spatial information. Here, we integrated ST and snRNA-seq data to enhance the power of spatially informed CTS DGE analysis of AD-related phenotypes. We first utilized the CeLEry tool to infer six cortical layers of ∼1.5 million cells in the snRNA-seq data that were profiled from the dorsolateral prefrontal cortex (DLPFC) tissue of 436 postmortem brains. Then, we conducted cortical layer- and cell-type-specific (LCS) and CTS DGE analyses based on the linear mixed model, for β-amyloid, tangle density, and cognitive decline. We identified 138 LCS significant genes with false discovery rate (FDR) q <0.05, including 103 for β-amyloid, 24 for tangle density, and 25 for cognitive decline. The majority of these LCS significant genes, including known AD risk genes such as APOE, KCNIP3, and CTSD, cannot be detected by CTS analyses. We also identified 2 genes shared across all 3 phenotypes and 10 shared between 2 phenotypes. Gene set enrichment analyses with the LCS DGE results of microglia in cortical layer 6 of β-amyloid identified 12 significant AD-related pathways. In conclusion, incorporating spatial information with snRNA-seq data enhanced the power of spatially informed DGE analyses. These identified LCS significant genes not only help illustrate the pathogenesis of AD but they also provide potential targets for developing therapeutics of AD.
Collapse
Affiliation(s)
- Shizhen Tang
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA; Department of Biostatistics and Bioinformatics, Emory University School of Public Health, Atlanta, GA 30322, USA
| | - Shihan Liu
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA; Department of Biostatistics and Bioinformatics, Emory University School of Public Health, Atlanta, GA 30322, USA
| | - Aron S Buchman
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL 60612, USA
| | - David A Bennett
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL 60612, USA
| | - Philip L De Jager
- Center for Translational and Computational Neuroimmunology, Department of Neurology and Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Jingjing Yang
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA.
| | - Jian Hu
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA.
| |
Collapse
|
10
|
Zuo C, Zhu J, Zou J, Chen L. Unravelling tumour spatiotemporal heterogeneity using spatial multimodal data. Clin Transl Med 2025; 15:e70331. [PMID: 40341789 PMCID: PMC12059211 DOI: 10.1002/ctm2.70331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2024] [Revised: 04/07/2025] [Accepted: 04/24/2025] [Indexed: 05/11/2025] Open
Abstract
Analysing the genome, epigenome, transcriptome, proteome, and metabolome within the spatial context of cells has transformed our understanding of tumour spatiotemporal heterogeneity. Advances in spatial multi-omics technologies now reveal complex molecular interactions shaping cellular behaviour and tissue dynamics. This review highlights key technologies and computational methods that have advanced spatial domain identification and their pseudo-relations, as well as inference of intra- and inter-cellular molecular networks that drive disease progression. We also discuss strategies to address major challenges, including data sparsity, high-dimensionality, scalability, and heterogeneity. Furthermore, we outline how spatial multi-omics enables novel insights into disease mechanisms, advancing precision medicine and informing targeted therapies. KEY POINTS: Advancements in spatial multi-omics facilitate our understanding of tumour spatiotemporal heterogeneity. AI-driven multimodal models uncover complex molecular interactions that underlie cellular behaviours and tissue dynamics. Combining multi-omics technologies and AI-enabled bioinformatics tools helps predict critical disease stages, such as pre-cancer, advancing precision medicine, and informing targeted therapeutic strategies.
Collapse
Affiliation(s)
- Chunman Zuo
- School of Life SciencesSun Yat‐sen UniversityGuangzhouChina
| | - Junchao Zhu
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell ScienceChinese Academy of SciencesShanghaiChina
| | - Jiawei Zou
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell ScienceChinese Academy of SciencesShanghaiChina
| | - Luonan Chen
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell ScienceChinese Academy of SciencesShanghaiChina
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced StudyUniversity of Chinese Academy of SciencesChinese Academy of SciencesHangzhouChina
- West China Biomedical Big Data Center, Med‐X Center for InformaticsWest China HospitalSichuan UniversityChengduChina
- School of Mathematical Sciences and School of AIShanghai Jiao Tong UniversityShanghaiChina
| |
Collapse
|
11
|
Luo B, Teng F, Tang G, Cen W, Liu X, Chen J, Qu C, Liu X, Liu X, Jiang W, Huang H, Feng Y, Zhang X, Jian M, Li M, Xi F, Li G, Liao S, Chen A, Yu W, Xu X, Zhang J. StereoMM: a graph fusion model for integrating spatial transcriptomic data and pathological images. Brief Bioinform 2025; 26:bbaf210. [PMID: 40407386 PMCID: PMC12100622 DOI: 10.1093/bib/bbaf210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2024] [Revised: 03/27/2025] [Accepted: 04/10/2025] [Indexed: 05/26/2025] Open
Abstract
Spatial omics technologies, generating high-throughput and multimodal data, have necessitated the development of advanced data integration methods to facilitate comprehensive biological and clinical treatment discoveries. Based on the cross-attention concept, we developed an AI learning based toolchain called StereoMM, a graph based fusion model that can incorporate omics data such as gene expression, histological images, and spatial location. StereoMM uses an attention module for omics data interaction and a graph autoencoder to integrate spatial positions and omics data in a self-supervised manner. Applying StereoMM across various cancer types and platforms has demonstrated its robust capability. StereoMM outperforms competitors in identifying spatial regions reflecting tumour progression and shows promise in classifying colorectal cancer patients into deficient mismatch repair and proficient mismatch repair groups. The comprehensive inter-modal integration and efficiency of StereoMM enable researchers to construct spatial views of integrated multimodal features efficiently, advancing thorough tissue and patient characterization.
Collapse
Affiliation(s)
- Bingying Luo
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Fei Teng
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Guo Tang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Weixuan Cen
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Xing Liu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Jinmiao Chen
- Center for Computational Biology and Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, 8 College Road, Singapore 169857, Singapore
| | - Chi Qu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Xuanzhu Liu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Xin Liu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Wenyan Jiang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Huaqiang Huang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Yu Feng
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
- BGI Collaborative Center for Future Medicine, Shanxi Medical University, No. 1258, Xinjiannan Road, Yingze District, Taiyuan 030001, China
| | - Xue Zhang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Min Jian
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Mei Li
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Feng Xi
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Guibo Li
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Sha Liao
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Ao Chen
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Weimiao Yu
- School of Biological Science, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore
| | - Xun Xu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
- BGI Research, Hangzhou, No. 203, Zhenzhong Road, Xihu District, Hangzhou 310030, China
| | - Jiajun Zhang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| |
Collapse
|
12
|
Wang W, Shin SC, Chávez-Fuentes JC, Yuan GC. A Robust Kernel-Based Workflow for Niche Trajectory Analysis. SMALL METHODS 2025; 9:e2401199. [PMID: 40411819 PMCID: PMC12103655 DOI: 10.1002/smtd.202401199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/01/2024] [Revised: 12/19/2024] [Indexed: 05/26/2025]
Abstract
Niche trajectory analysis is a promising framework for modeling spatially continuous variations of the tissue microenvironment. However, the existing approach is limited by its requirement of cell-type annotation as a necessary input, which can lead to unwanted technical variations. To overcome this limitation, a new kernel-based strategy is presented that models the structural composition of a niche as a continuous function in the gene expression space, thereby obviating the need for cell-type annotation. Further integration with cell-type deconvolution analysis extends its application to datasets with any spatial resolution. Applying this strategy to real datasets indicates enhanced performance in robustness and accuracy and provides new insights into injury or disease-associated tissue microenvironment changes. As such, a useful tool for spatial transcriptomics data analysis is provided.
Collapse
Affiliation(s)
- Wen Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Sujung Crystal Shin
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | | | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Tisch Cancer Institute, Black Family Stem Cell Institute, Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| |
Collapse
|
13
|
Guo CK, Xia CR, Peng G, Cao ZJ, Gao G. Learning Phenotype Associated Signature in Spatial Transcriptomics with PASSAGE. SMALL METHODS 2025; 9:e2401451. [PMID: 39905872 DOI: 10.1002/smtd.202401451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Revised: 12/31/2024] [Indexed: 02/06/2025]
Abstract
Spatially resolved transcriptomics (SRT) is poised to advance the understanding of cellular organization within complex tissues under various physiological and pathological conditions at unprecedented resolution. Despite the development of numerous computational tools that facilitate the automatic identification of statistically significant intra-/inter-slice patterns (like spatial domains), these methods typically operate in an unsupervised manner, without leveraging sample characteristics like physiological/pathological states. Here PASSAGE (Phenotype Associated Spatial Signature Analysis with Graph-based Embedding), a rationally-designed deep learning framework is presented for characterizing phenotype-associated signatures across multiple heterogeneous spatial slices effectively. In addition to its outstanding performance in systematic benchmarks, PASSAGE's unique capability in calling sophisticated signatures has been demonstrated in multiple real-world cases. The full package of PASSAGE is available at https://github.com/gao-lab/PASSAGE.
Collapse
Affiliation(s)
- Chen-Kai Guo
- Center for Cell Lineage and Development, CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, GIBH-HKU Guangdong-Hong Kong Stem Cell and Regenerative Medicine Research Centre, University of Chinese Academy of Sciences, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, 510530, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Chen-Rui Xia
- State Key Laboratory of Gene Function and Modulation Research, School of Life Sciences, Biomedical Pioneering Innovative Center (BIOPIC) and Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), Peking University, Beijing, 100871, China
- Changping Laboratory, Beijing, 102206, China
| | - Guangdun Peng
- Center for Cell Lineage and Development, CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, GIBH-HKU Guangdong-Hong Kong Stem Cell and Regenerative Medicine Research Centre, University of Chinese Academy of Sciences, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, 510530, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Zhi-Jie Cao
- State Key Laboratory of Gene Function and Modulation Research, School of Life Sciences, Biomedical Pioneering Innovative Center (BIOPIC) and Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), Peking University, Beijing, 100871, China
- Changping Laboratory, Beijing, 102206, China
| | - Ge Gao
- State Key Laboratory of Gene Function and Modulation Research, School of Life Sciences, Biomedical Pioneering Innovative Center (BIOPIC) and Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), Peking University, Beijing, 100871, China
- Changping Laboratory, Beijing, 102206, China
| |
Collapse
|
14
|
Wang J, Ye F, Chai H, Jiang Y, Wang T, Ran X, Xia Q, Xu Z, Fu Y, Zhang G, Wu H, Guo G, Guo H, Ruan Y, Wang Y, Xing D, Xu X, Zhang Z. Advances and applications in single-cell and spatial genomics. SCIENCE CHINA. LIFE SCIENCES 2025; 68:1226-1282. [PMID: 39792333 DOI: 10.1007/s11427-024-2770-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 10/10/2024] [Indexed: 01/12/2025]
Abstract
The applications of single-cell and spatial technologies in recent times have revolutionized the present understanding of cellular states and the cellular heterogeneity inherent in complex biological systems. These advancements offer unprecedented resolution in the examination of the functional genomics of individual cells and their spatial context within tissues. In this review, we have comprehensively discussed the historical development and recent progress in the field of single-cell and spatial genomics. We have reviewed the breakthroughs in single-cell multi-omics technologies, spatial genomics methods, and the computational strategies employed toward the analyses of single-cell atlas data. Furthermore, we have highlighted the advances made in constructing cellular atlases and their clinical applications, particularly in the context of disease. Finally, we have discussed the emerging trends, challenges, and opportunities in this rapidly evolving field.
Collapse
Affiliation(s)
- Jingjing Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Fang Ye
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Haoxi Chai
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China
| | - Yujia Jiang
- BGI Research, Shenzhen, 518083, China
- BGI Research, Hangzhou, 310030, China
| | - Teng Wang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Xia Ran
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China
| | - Qimin Xia
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
| | - Ziye Xu
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Yuting Fu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guodong Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Hanyu Wu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guoji Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Hongshan Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Yijun Ruan
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China.
| | - Yongcheng Wang
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
| | - Dong Xing
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, 100871, China.
| | - Xun Xu
- BGI Research, Shenzhen, 518083, China.
- BGI Research, Hangzhou, 310030, China.
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI Research, Shenzhen, 518083, China.
| | - Zemin Zhang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
15
|
Wang Q, Zhu H, Deng L, Xu S, Xie W, Li M, Wang R, Tie L, Zhan L, Yu G. Spatial Transcriptomics: Biotechnologies, Computational Tools, and Neuroscience Applications. SMALL METHODS 2025; 9:e2401107. [PMID: 39760243 DOI: 10.1002/smtd.202401107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 12/22/2024] [Indexed: 01/07/2025]
Abstract
Spatial transcriptomics (ST) represents a revolutionary approach in molecular biology, providing unprecedented insights into the spatial organization of gene expression within tissues. This review aims to elucidate advancements in ST technologies, their computational tools, and their pivotal applications in neuroscience. It is begun with a historical overview, tracing the evolution from early image-based techniques to contemporary sequence-based methods. Subsequently, the computational methods essential for ST data analysis, including preprocessing, cell type annotation, spatial clustering, detection of spatially variable genes, cell-cell interaction analysis, and 3D multi-slices integration are discussed. The central focus of this review is the application of ST in neuroscience, where it has significantly contributed to understanding the brain's complexity. Through ST, researchers advance brain atlas projects, gain insights into brain development, and explore neuroimmune dysfunctions, particularly in brain tumors. Additionally, ST enhances understanding of neuronal vulnerability in neurodegenerative diseases like Alzheimer's and neuropsychiatric disorders such as schizophrenia. In conclusion, while ST has already profoundly impacted neuroscience, challenges remain issues such as enhancing sequencing technologies and developing robust computational tools. This review underscores the transformative potential of ST in neuroscience, paving the way for new therapeutic insights and advancements in brain research.
Collapse
Affiliation(s)
- Qianwen Wang
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Hongyuan Zhu
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Lin Deng
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Shuangbin Xu
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Wenqin Xie
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Ming Li
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Rui Wang
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Liang Tie
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Li Zhan
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Guangchuang Yu
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
| |
Collapse
|
16
|
Song L, Chen W, Hou J, Guo M, Yang J. Spatially resolved mapping of cells associated with human complex traits. Nature 2025; 641:932-941. [PMID: 40108460 PMCID: PMC12095064 DOI: 10.1038/s41586-025-08757-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2024] [Accepted: 02/07/2025] [Indexed: 03/22/2025]
Abstract
Depicting spatial distributions of disease-relevant cells is crucial for understanding disease pathology1,2. Here we present genetically informed spatial mapping of cells for complex traits (gsMap), a method that integrates spatial transcriptomics data with summary statistics from genome-wide association studies to map cells to human complex traits, including diseases, in a spatially resolved manner. Using embryonic spatial transcriptomics datasets covering 25 organs, we benchmarked gsMap through simulation and by corroborating known trait-associated cells or regions in various organs. Applying gsMap to brain spatial transcriptomics data, we reveal that the spatial distribution of glutamatergic neurons associated with schizophrenia more closely resembles that for cognitive traits than that for mood traits such as depression. The schizophrenia-associated glutamatergic neurons were distributed near the dorsal hippocampus, with upregulated expression of calcium signalling and regulation genes, whereas depression-associated glutamatergic neurons were distributed near the deep medial prefrontal cortex, with upregulated expression of neuroplasticity and psychiatric drug target genes. Our study provides a method for spatially resolved mapping of trait-associated cells and demonstrates the gain of biological insights (such as the spatial distribution of trait-relevant cells and related signature genes) through these maps.
Collapse
Affiliation(s)
- Liyang Song
- School of Life Sciences, Westlake University, Hangzhou, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China
| | - Wenhao Chen
- School of Life Sciences, Westlake University, Hangzhou, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China
| | - Junren Hou
- School of Life Sciences, Westlake University, Hangzhou, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China
| | - Minmin Guo
- School of Life Sciences, Westlake University, Hangzhou, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China
| | - Jian Yang
- School of Life Sciences, Westlake University, Hangzhou, China.
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China.
| |
Collapse
|
17
|
Wang L, Bai X, Zhang C, Shi Q, Chen L. Spatially Aware Domain Adaptation Enables Cell Type Deconvolution from Multi-Modal Spatially Resolved Transcriptomics. SMALL METHODS 2025; 9:e2401163. [PMID: 39623794 DOI: 10.1002/smtd.202401163] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Revised: 11/16/2024] [Indexed: 05/26/2025]
Abstract
Spatially Resolved Transcriptomics (SRT) offers unprecedented opportunities to elucidate the cellular arrangements within tissues. Nevertheless, the absence of deconvolution methods that simultaneously model multi-modal features has impeded progress in understanding cellular heterogeneity in spatial contexts. To address this issue, SpaDA is developed, a novel spatially aware domain adaptation method that integrates multi-modal data (i.e., transcriptomics, histological images, and spatial locations) from SRT to accurately estimate the spatial distribution of cell types. SpaDA utilizes a self-expressive variational autoencoder, coupled with deep spatial distribution alignment, to learn and align spatial and graph representations from spatial multi-modal SRT data and single-cell RNA sequencing (scRNA-seq) data. This strategy facilitates the transfer of cell type annotation information across these two similarity graphs, thereby enhancing the prediction accuracy of cell type composition. The results demonstrate that SpaDA surpasses existing methods in cell type deconvolution and the identification of cell types and spatial domains across diverse platforms. Moreover, SpaDA excels in identifying spatially colocalized cell types and key marker genes in regions of low-quality measurements, exemplified by high-resolution mouse cerebellum SRT data. In conclusion, SpaDA offers a powerful and flexible framework for the analysis of multi-modal SRT datasets, advancing the understanding of complex biological systems.
Collapse
Affiliation(s)
- Lequn Wang
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, 200031, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xiaosheng Bai
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
| | - Chuanchao Zhang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
| | - Qianqian Shi
- Hubei Engineering Technology Research Center of Agricultural Big Data, Huazhong Agricultural University, Wuhan, 430070, China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Luonan Chen
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, 200031, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| |
Collapse
|
18
|
Ren S, Liao X, Liu F, Li J, Gao X, Yu B. Exploring the Latent Information in Spatial Transcriptomics Data via Multi-View Graph Convolutional Network Based on Implicit Contrastive Learning. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025:e2413545. [PMID: 40304359 DOI: 10.1002/advs.202413545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 01/09/2025] [Indexed: 05/02/2025]
Abstract
Latest developments in spatial transcriptomics enable thoroughly profiling of gene expression while preserving tissue microenvironment. Connecting gene expression with spatial arrangement is key for precise spatial domain identification, enhancing the comprehension of tissue microenvironments and biological processes. However, accurately analyzing spatial domains with similar gene expression and histological features is still challenging. This study introduces STMIGCL, a novel framework that leverages a multi-view graph convolutional network and implicit contrastive learning. First, it creates neighbor graphs from gene expression and spatial coordinates, and then combines these with gene expression through multi-view learning to learn low-dimensional representations. To further refine the obtained low-dimensional representations, a graph contrastive learning method with contrastive enhancement in the latent space is employed, aiming to better capture critical information in the data and improve the accuracy and discriminative power of the embeddings. Finally, an attention mechanism is used to adaptively integrate different views, capturing the importance of spots in various views to obtain the final spot representation. Experimental data confirms that STMIGCL significantly enhances spatial domain recognition precision and outperforms all baseline methods in tasks such as trajectory inference and Spatially Variable Genes (SVGs) recognition.
Collapse
Affiliation(s)
- Sheng Ren
- School of Data Science, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Xingyu Liao
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Farong Liu
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Jie Li
- School of Data Science, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Bin Yu
- School of Data Science, Qingdao University of Science and Technology, Qingdao, 266061, China
- School of Artificial Intelligence and Data Science, University of Science and Technology of China, Hefei, 230026, China
| |
Collapse
|
19
|
Salim A, Bhuva DD, Chen C, Tan CW, Yang P, Davis MJ, Yang JYH. SpaNorm: spatially-aware normalization for spatial transcriptomics data. Genome Biol 2025; 26:109. [PMID: 40301877 PMCID: PMC12039303 DOI: 10.1186/s13059-025-03565-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 03/31/2025] [Indexed: 05/01/2025] Open
Abstract
Normalization of spatial transcriptomics data is challenging due to spatial association between region-specific library size and biology. We develop SpaNorm, the first spatially-aware normalization method that concurrently models library size effects and the underlying biology, segregates these effects, and thereby removes library size effects without removing biological information. Using 27 tissue samples from 6 datasets spanning 4 technological platforms, SpaNorm outperforms commonly used single-cell normalization approaches while retaining spatial domain information and detecting spatially variable genes. SpaNorm is versatile and works equally well for multicellular and subcellular spatial transcriptomics data with relatively robust performance under different segmentation methods.
Collapse
Affiliation(s)
- Agus Salim
- Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, 3010, VIC, Australia.
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, 3052, VIC, Australia.
- School of Mathematics and Statistics, The University of Melbourne, Melbourne, 3010, VIC, Australia.
- Baker Heart and Diabetes Institute, Melbourne, 3004, VIC, Australia.
| | - Dharmesh D Bhuva
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, 3052, VIC, Australia.
- South Australian Immunogenomics Cancer Institute, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, 5005, SA, Australia.
- Precision Cancer Medicine, South Australian Health and Medical Research Institute (SAHMRI), Adelaide, 5000, SA, Australia.
- Frazer Institute, Faculty of Medicine, The University of Queensland, Woolloongabba, 4102, QLD, Australia.
| | - Carissa Chen
- School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Sydney, 2006, NSW, Australia
- Computational Systems Biology Unit, Children'S Medical Research Institute, Westmead, 2145, NSW, Australia
- Sydney Precision Data Science Centre, The University of Sydney, Sydney, 2006, NSW, Australia
| | - Chin Wee Tan
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, 3052, VIC, Australia
- Frazer Institute, Faculty of Medicine, The University of Queensland, Woolloongabba, 4102, QLD, Australia
- Department of Medical Biology, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Melbourne, 3010, VIC, Australia
| | - Pengyi Yang
- Computational Systems Biology Unit, Children'S Medical Research Institute, Westmead, 2145, NSW, Australia
- Sydney Precision Data Science Centre, The University of Sydney, Sydney, 2006, NSW, Australia
- School of Mathematics and Statistics, The University of Sydney, Sydney, 2006, NSW, Australia
- Charles Perkins Centre, The University of Sydney, Sydney, 2006, NSW, Australia
| | - Melissa J Davis
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, 3052, VIC, Australia
- School of Biomedicine, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, 5005, SA, Australia
- Isomorphic Labs, London, UK
| | - Jean Y H Yang
- Sydney Precision Data Science Centre, The University of Sydney, Sydney, 2006, NSW, Australia
- School of Mathematics and Statistics, The University of Sydney, Sydney, 2006, NSW, Australia
- Charles Perkins Centre, The University of Sydney, Sydney, 2006, NSW, Australia
| |
Collapse
|
20
|
Gong Y, Yuan X, Jiao Q, Yu Z. Unveiling fine-scale spatial structures and amplifying gene expression signals in ultra-large ST slices with HERGAST. Nat Commun 2025; 16:3977. [PMID: 40295488 PMCID: PMC12037780 DOI: 10.1038/s41467-025-59139-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Accepted: 04/08/2025] [Indexed: 04/30/2025] Open
Abstract
We propose HERGAST, a system for spatial structure identification and signal amplification in ultra-large-scale and ultra-high-resolution spatial transcriptomics data. To handle ultra-large spatial transcriptomics (ST) data, we consider the divide and conquer strategy and devise a Divide-Iterate-Conquer framework especially for spatial transcriptomics data analysis, which can also be adopted by other computational methods for extending to ultra-large-scale ST data analysis. To tackle the potential over-smoothing problem arising from data splitting, we construct a heterogeneous graph network to incorporate both local and global spatial relationships. In simulations, HERGAST consistently outperforms other methods across all settings with more than a 10% increase in average adjusted rand index (ARI). In real-world datasets, HERGAST's high-precision spatial clustering identifies SPP1+ macrophages intermingled within colorectal tumors, while the enhanced gene expression signals reveal unique spatial expression patterns of key genes in breast cancer.
Collapse
Affiliation(s)
- Yuqiao Gong
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Xin Yuan
- SJTU-Yale Joint Center for Biostatistics and Data Science Organization, Shanghai Jiao Tong University, Shanghai, China
- Institute of Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
- National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Qiong Jiao
- Department of Pathology, Shanghai Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| | - Zhangsheng Yu
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.
- SJTU-Yale Joint Center for Biostatistics and Data Science Organization, Shanghai Jiao Tong University, Shanghai, China.
- Institute of Translational Medicine, Shanghai Jiao Tong University, Shanghai, China.
- Clinical Research Institute, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| |
Collapse
|
21
|
Srinivasan G, Le MK, Azher Z, Liu X, Vaickus L, Kaur H, Kolling F, Palisoul S, Perreard L, Lau KS, Yao K, Levy J. Histology-Based Virtual RNA Inference Identifies Pathways Associated with Metastasis Risk in Colorectal Cancer. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.04.22.25326170. [PMID: 40313260 PMCID: PMC12045403 DOI: 10.1101/2025.04.22.25326170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
Colorectal cancer (CRC) remains a major health concern, with over 150,000 new diagnoses and more than 50,000 deaths annually in the United States, underscoring an urgent need for improved screening, prognostication, disease management, and therapeutic approaches. The tumor microenvironment (TME)-comprising cancerous and immune cells interacting within the tumor's spatial architecture-plays a critical role in disease progression and treatment outcomes, reinforcing its importance as a prognostic marker for metastasis and recurrence risk. However, traditional methods for TME characterization, such as bulk transcriptomics and multiplex protein assays, lack sufficient spatial resolution. Although spatial transcriptomics (ST) allows for the high-resolution mapping of whole transcriptomes at near-cellular resolution, current ST technologies (e.g., Visium, Xenium) are limited by high costs, low throughput, and issues with reproducibility, preventing their widespread application in large-scale molecular epidemiology studies. In this study, we refined and implemented Virtual RNA Inference (VRI) to derive ST-level molecular information directly from hematoxylin and eosin (H&E)-stained tissue images. Our VRI models were trained on the largest matched CRC ST dataset to date, comprising 45 patients and more than 300,000 Visium spots from primary tumors. Using state-of-the-art architectures (UNI, ResNet-50, ViT, and VMamba), we achieved a median Spearman's correlation coefficient of 0.546 between predicted and measured spot-level expression. As validation, VRI-derived gene signatures linked to specific tissue regions (tumor, interface, submucosa, stroma, serosa, muscularis, inflammation) showed strong concordance with signatures generated via direct ST, and VRI performed accurately in estimating cell-type proportions spatially from H&E slides. In an expanded CRC cohort controlling for tumor invasiveness and clinical factors, we further identified VRI-derived gene signatures significantly associated with key prognostic outcomes, including metastasis status. Although certain tumor-related pathways are not fully captured by histology alone, our findings highlight the ability of VRI to infer a wide range of "histology-associated" biological pathways at near-cellular resolution without requiring ST profiling. Future efforts will extend this framework to expand TME phenotyping from standard H&E tissue images, with the potential to accelerate translational CRC research at scale.
Collapse
Affiliation(s)
- Gokul Srinivasan
- Departments of Pathology and Laboratory Medicine and Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA 90048
| | - Minh-Khang Le
- Departments of Pathology and Laboratory Medicine and Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA 90048
| | - Zarif Azher
- Departments of Pathology and Laboratory Medicine and Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA 90048
- California Institute of Technology, Pasadena, CA, 91125
| | - Xiaoying Liu
- Department of Pathology and Laboratory Medicine, Dartmouth-Hitchcock Medical Center and Geisel School of Medicine at Dartmouth, Lebanon, NH 03766
| | - Louis Vaickus
- Department of Pathology and Laboratory Medicine, Dartmouth-Hitchcock Medical Center and Geisel School of Medicine at Dartmouth, Lebanon, NH 03766
| | - Harsimran Kaur
- Center for Computational Systems Biology, Department of Cell and Developmental Biology, Chemical and Physical Biology Program, Vanderbilt University School of Medicine, Nashville TN 37232
| | | | - Scott Palisoul
- Department of Pathology and Laboratory Medicine, Dartmouth-Hitchcock Medical Center and Geisel School of Medicine at Dartmouth, Lebanon, NH 03766
| | | | - Ken S. Lau
- Center for Computational Systems Biology, Department of Cell and Developmental Biology, Chemical and Physical Biology Program, Vanderbilt University School of Medicine, Nashville TN 37232
| | - Keluo Yao
- Departments of Pathology and Laboratory Medicine and Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA 90048
| | - Joshua Levy
- Departments of Pathology and Laboratory Medicine and Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA 90048
- Department of Pathology and Laboratory Medicine, Dartmouth-Hitchcock Medical Center and Geisel School of Medicine at Dartmouth, Lebanon, NH 03766
| |
Collapse
|
22
|
Lee Y, Lee M, Shin Y, Kim K, Kim T. Spatial Omics in Clinical Research: A Comprehensive Review of Technologies and Guidelines for Applications. Int J Mol Sci 2025; 26:3949. [PMID: 40362187 PMCID: PMC12071594 DOI: 10.3390/ijms26093949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2025] [Revised: 04/17/2025] [Accepted: 04/17/2025] [Indexed: 05/15/2025] Open
Abstract
Spatial omics integrates molecular profiling with spatial tissue context, enabling high-resolution analysis of gene expression, protein interactions, and epigenetic modifications. This approach provides critical insights into disease mechanisms and therapeutic responses, with applications in cancer, neurology, and immunology. Spatial omics technologies, including spatial transcriptomics, proteomics, and epigenomics, facilitate the study of cellular heterogeneity, tissue organization, and cell-cell interactions within their native environments. Despite challenges in data complexity and integration, advancements in multi-omics pipelines and computational tools are enhancing data accuracy and biological interpretation. This review provides a comprehensive overview of key spatial omics technologies, their analytical methods, validation strategies, and clinical applications. By integrating spatially resolved molecular data with traditional omics, spatial omics is transforming precision medicine, biomarker discovery, and personalized therapy. Future research should focus on improving standardization, reproducibility, and multimodal data integration to fully realize the potential of spatial omics in clinical and translational research.
Collapse
Affiliation(s)
- Yoonji Lee
- College of Medicine, The Catholic University of Korea, 222 Banpo-daero, Seocho-gu, Seoul 06591, Republic of Korea; (Y.L.); (M.L.); (Y.S.)
| | - Mingyu Lee
- College of Medicine, The Catholic University of Korea, 222 Banpo-daero, Seocho-gu, Seoul 06591, Republic of Korea; (Y.L.); (M.L.); (Y.S.)
| | - Yoojin Shin
- College of Medicine, The Catholic University of Korea, 222 Banpo-daero, Seocho-gu, Seoul 06591, Republic of Korea; (Y.L.); (M.L.); (Y.S.)
| | - Kyuri Kim
- College of Medicine, Ewha Womans University, 25 Magokdong-ro 2-gil, Gangseo-gu, Seoul 07804, Republic of Korea;
| | - Taejung Kim
- Department of Hospital Pathology, Yeouido St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, 10, 63-ro, Yeongdeungpo-gu, Seoul 07345, Republic of Korea
| |
Collapse
|
23
|
Fang S, Xu M, Cao L, Liu X, Bezulj M, Tan L, Yuan Z, Li Y, Xia T, Guo L, Kovacevic V, Hui J, Guo L, Liu C, Cheng M, Lin L, Wen Z, Josic B, Milicevic N, Qiu P, Lu Q, Li Y, Wang L, Hu L, Zhang C, Kang Q, Chen F, Deng Z, Li J, Li M, Li S, Zhao Y, Fan G, Zhang Y, Chen A, Li Y, Xu X. Stereopy: modeling comparative and spatiotemporal cellular heterogeneity via multi-sample spatial transcriptomics. Nat Commun 2025; 16:3741. [PMID: 40258830 PMCID: PMC12012134 DOI: 10.1038/s41467-025-58079-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 03/05/2025] [Indexed: 04/23/2025] Open
Abstract
Understanding complex biological systems requires tracing cellular dynamic changes across conditions, time, and space. However, integrating multi-sample data in a unified way to explore cellular heterogeneity remains challenging. Here, we present Stereopy, a flexible framework for modeling and dissecting comparative and spatiotemporal patterns in multi-sample spatial transcriptomics with interactive data visualization. To optimize this framework, we devise a universal container, a scope controller, and an integrative transformer tailored for multi-sample multimodal data storage, management, and processing. Stereopy showcases three representative applications: investigating specific cell communities and genes responsible for pathological changes, detecting spatiotemporal gene patterns by considering spatial and temporal features, and inferring three-dimensional niche-based cell-gene interaction network that bridges intercellular communications and intracellular regulations. Stereopy serves as both a comprehensive bioinformatics toolbox and an extensible framework that empowers researchers with enhanced data interpretation abilities and new perspectives for mining multi-sample spatial transcriptomics data.
Collapse
Affiliation(s)
| | - Mengyang Xu
- BGI Research, Shenzhen, China
- BGI Research, Qingdao, China
| | - Lei Cao
- BGI Research, Beijing, China
- BGI Research, Shenzhen, China
| | | | | | | | - Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Yao Li
- BGI Research, Qingdao, China
| | - Tianyi Xia
- BGI Research, Beijing, China
- BGI Research, Shenzhen, China
| | | | | | | | - Lidong Guo
- BGI Research, Qingdao, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | | | - Mengnan Cheng
- BGI Research, Shenzhen, China
- BGI Research, Hangzhou, China
| | | | | | | | | | | | - Qin Lu
- BGI Research, Shenzhen, China
| | | | | | - Luni Hu
- BGI Research, Beijing, China
- BGI Research, Shenzhen, China
| | | | | | | | | | - Junhua Li
- BGI Research, Shenzhen, China
- Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI Research, Shenzhen, China
- BGI Research, Riga, Latvia
| | - Mei Li
- BGI Research, Shenzhen, China
| | | | - Yi Zhao
- Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
| | - Guangyi Fan
- BGI Research, Shenzhen, China.
- BGI Research, Qingdao, China.
| | - Yong Zhang
- BGI Research, Shenzhen, China.
- BGI Research, Wuhan, China.
- Guangdong Bigdata Engineering Technology Research Center for Life Sciences, BGI research, Shenzhen, China.
| | - Ao Chen
- BGI Research, Shenzhen, China.
| | - Yuxiang Li
- BGI Research, Shenzhen, China.
- BGI Research, Wuhan, China.
- Guangdong Bigdata Engineering Technology Research Center for Life Sciences, BGI research, Shenzhen, China.
| | - Xun Xu
- BGI Research, Wuhan, China.
| |
Collapse
|
24
|
Liu F, Ren S, Li J, Lv H, Jiang F, Bin Yu. SGTB: A graph representation learning model combining transformer and BERT for optimizing gene expression analysis in spatial transcriptomics data. Comput Biol Chem 2025; 118:108482. [PMID: 40306096 DOI: 10.1016/j.compbiolchem.2025.108482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2025] [Revised: 04/05/2025] [Accepted: 04/17/2025] [Indexed: 05/02/2025]
Abstract
In recent years, spatial transcriptomics (ST) has emerged as an innovative technology that enables the simultaneous acquisition of gene expression information and its spatial distribution at the single-cell or regional level, providing deeper insights into cellular interactions and tissue organization, this technology provides a more holistic view of tissue organization and intercellular dynamics. However, existing methods still face certain limitations in data representation capabilities, making it challenging to fully capture complex spatial dependencies and global features. To address this, this paper proposes an innovative spatial multi-scale graph convolutional network (SGTB) based on large language models, integrating graph convolutional networks (GCN), Transformer, and BERT language models to optimize the representation of spatial transcriptomics data. The Graph Convolutional Network (GCN) employs a multi-layer architecture to extract features from gene expression matrices. Through iterative aggregation of neighborhood information, it captures spatial dependencies among cells and gene co-expression patterns, thereby constructing hierarchical cell embeddings. Subsequently, the model integrates an attention mechanism to assign weights to critical features and leverages Transformer layers to model global relationships, refining the ability of learned representations to reflect variations in spatial patterns. Finally, the model incorporates the BERT language model, mapping cell embeddings into textual inputs to exploit its deep semantic representation capabilities for high-dimensional feature extraction. These features are then fused with the embeddings generated by the Transformer, further optimizing feature learning for spatial transcriptomics data. This approach holds significant application value in improving the accuracy of tasks such as cell type classification and gene regulatory network construction, providing a novel computational framework for deep mining of spatial multi-scale biological data.
Collapse
Affiliation(s)
- Farong Liu
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China; School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Sheng Ren
- School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Jie Li
- School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Haoyang Lv
- School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Fenghui Jiang
- Editorial Office of Journal of Qingdao University of Science and Technology (Natural Science Edition), Qingdao University of Science and Technology, Qingdao 266061, China.
| | - Bin Yu
- School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China.
| |
Collapse
|
25
|
Paniagua K, Jin YF, Chen Y, Gao SJ, Huang Y, Flores M. Dissection of tumoral niches using spatial transcriptomics and deep learning. iScience 2025; 28:112214. [PMID: 40230519 PMCID: PMC11994907 DOI: 10.1016/j.isci.2025.112214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Revised: 10/05/2024] [Accepted: 03/10/2025] [Indexed: 04/16/2025] Open
Abstract
This study introduces TG-ME, an innovative computational framework that integrates transformer with graph variational autoencoder (GraphVAE) models for dissection of tumoral niches using spatial transcriptomics data and morphological images. TG-ME effectively identifies and characterizes niches in bench datasets and a high resolution NSCLC dataset. The pipeline consists in different stages that include normalization, spatial information integration, morphological feature extraction, gene expression quantification, single cell expression characterization, and tumor niche characterization. For this, TG-ME leverages advanced deep learning techniques that achieve robust clustering and profiling of niches across cancer stages. TG-ME can potentially provide insights into the spatial organization of tumor microenvironments (TME), highlighting specific niche compositions and their molecular changes along cancer progression. TG-ME is a promising tool for guiding personalized treatment strategies by uncovering microenvironmental signatures associated with disease prognosis and therapeutic outcomes.
Collapse
Affiliation(s)
- Karla Paniagua
- Department of Electrical and Computer Engineering, KLESSE School of Engineering and Integrated Design, University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Yu-Fang Jin
- Department of Electrical and Computer Engineering, KLESSE School of Engineering and Integrated Design, University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Yidong Chen
- Greehey Children Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
- Department of Population Health Science, The University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Shou-Jiang Gao
- Cancer Virology Program, UPMC Hillman Cancer Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Yufei Huang
- Cancer Virology Program, UPMC Hillman Cancer Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Mario Flores
- Department of Electrical and Computer Engineering, KLESSE School of Engineering and Integrated Design, University of Texas at San Antonio, San Antonio, TX 78249, USA
| |
Collapse
|
26
|
Morrissey ZD, Kumar P, Phan TX, Maienschein-Cline M, Leow A, Lazarov O. Neurogenesis drives hippocampal formation-wide spatial transcription alterations in health and Alzheimer's disease. FRONTIERS IN DEMENTIA 2025; 4:1546433. [PMID: 40309339 PMCID: PMC12041076 DOI: 10.3389/frdem.2025.1546433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/16/2024] [Accepted: 03/31/2025] [Indexed: 05/02/2025]
Abstract
The mechanism by which neurogenesis regulates the profile of neurons and glia in the hippocampal formation is not known. Further, the effect of neurogenesis on neuronal vulnerability characterizing the entorhinal cortex in Alzheimer's disease (AD) is unknown. Here, we used in situ sequencing to investigate the spatial transcription profile of neurons and glia in the hippocampal circuitry in wild-type mice and in familial AD (FAD) mice expressing varying levels of neurogenesis. This approach revealed that in addition to the dentate gyrus, neurogenesis modulates the cellular profile in the entorhinal cortex and CA regions of the hippocampus. Notably, enhancing neurogenesis in FAD mice led to partial restoration of neuronal and cellular profile in these brain areas, resembling the profile of their wild-type counterparts. This approach provides a platform for the examination of the cellular dynamics in the hippocampal formation in health and in AD.
Collapse
Affiliation(s)
- Zachery D. Morrissey
- Graduate Program in Neuroscience, University of Illinois Chicago, Chicago, IL, United States
- Department of Psychiatry, University of Illinois Chicago, Chicago, IL, United States
- Department of Anatomy and Cell Biology, University of Illinois Chicago, Chicago, IL, United States
| | - Pavan Kumar
- Department of Anatomy and Cell Biology, University of Illinois Chicago, Chicago, IL, United States
| | - Trongha X. Phan
- Department of Anatomy and Cell Biology, University of Illinois Chicago, Chicago, IL, United States
| | | | - Alex Leow
- Department of Psychiatry, University of Illinois Chicago, Chicago, IL, United States
- Department of Biomedical Engineering, University of Illinois Chicago, Chicago, IL, United States
| | - Orly Lazarov
- Department of Anatomy and Cell Biology, University of Illinois Chicago, Chicago, IL, United States
| |
Collapse
|
27
|
Kang L, Zhang Q, Qian F, Liang J, Wu X. Benchmarking computational methods for detecting spatial domains and domain-specific spatially variable genes from spatial transcriptomics data. Nucleic Acids Res 2025; 53:gkaf303. [PMID: 40240000 PMCID: PMC12000868 DOI: 10.1093/nar/gkaf303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2024] [Revised: 03/21/2025] [Accepted: 04/03/2025] [Indexed: 04/18/2025] Open
Abstract
Advances in spatially resolved transcriptomics (SRT) have led to the emergence of numerous computational methods for identifying spatial domains and spatially variable genes (SVGs); however, a comprehensive assessment of existing methods is lacking. We comprehensively benchmarked 19 methods for detecting spatial domains and domain-specific SVGs from SRT data, using 30 real-world datasets covering six SRT technologies and 27 synthetic datasets. We first evaluated the performance of these methods on spatial domain identification in terms of accuracy, stability, generalizability, and scalability. Results reveal that there is no single method that works best for all datasets, and the optimal method depends on the data, especially the SRT platform. Further, we proposed a quantitative strategy to evaluate domain-specific SVG recognition results and assessed the impact of spatial domains on SVG detection. We found that SVG detection based on spatial domains identified by different GNN methods have high accuracy but low concordance. Generally, the more accurate the recognized spatial domains, the higher the number and accuracy of domain-specific SVGs detected. Moreover, integrating spatial clustering results from different methods can lead to more robust and better clustering and SVG results. Practical guidelines were provided for choosing appropriate methods for spatial domain and domain-specific SVG identification.
Collapse
Affiliation(s)
- Liping Kang
- Department of Hematology, Children's Hospital of Soochow University, Suzhou 215000, China
- Cancer Institute, Suzhou Medical College, Soochow University, Suzhou 215000, China
| | - Qinglong Zhang
- Cancer Institute, Suzhou Medical College, Soochow University, Suzhou 215000, China
| | | | - Junyao Liang
- Cancer Institute, Suzhou Medical College, Soochow University, Suzhou 215000, China
| | - Xiaohui Wu
- Department of Hematology, Children's Hospital of Soochow University, Suzhou 215000, China
- Cancer Institute, Suzhou Medical College, Soochow University, Suzhou 215000, China
- MOE Key Laboratory of Geriatric Diseases and Immunology, Suzhou Medical College, Soochow University, Suzhou 215000, China
- Jiangsu Key Laboratory of Infection and Immunity, Soochow University, Suzhou 215000, China
- Pediatric Hematology & Oncology Key Laboratory of Higher Education Institutions in Jiangsu Province, Suzhou 215000, China
| |
Collapse
|
28
|
Bao X, Bai X, Liu X, Shi Q, Zhang C. Spatially informed graph transformers for spatially resolved transcriptomics. Commun Biol 2025; 8:574. [PMID: 40188303 PMCID: PMC11972348 DOI: 10.1038/s42003-025-08015-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 03/28/2025] [Indexed: 04/07/2025] Open
Abstract
Spatially resolved transcriptomics (SRT) has emerged as a powerful technique for mapping gene expression landscapes within spatial contexts. However, significant challenges persist in effectively integrating gene expression with spatial information to elucidate the heterogeneity of biological tissues. Here, we present a Spatially informed Graph Transformers framework, SpaGT, which leverages both node and edge channels to model spatially aware graph representation for denoising gene expression and identifying spatial domains. Unlike conventional graph neural networks, which rely on static, localized convolutional aggregation, SpaGT employs a structure-reinforced self-attention mechanism that iteratively evolves topological structural information and transcriptional signal representation. By replacing graph convolution with global self-attention, SpaGT enables the integration of both global and spatially localized information, thereby improving the detection of fine-grained spatial domains. We demonstrate that SpaGT achieves superior performance in identifying spatial domains and denoising gene expression data across diverse platforms and species. Furthermore, SpaGT facilitates the discovery of spatially variable genes with significant prognostic potential in cancer tissues. These findings establish SpaGT as a powerful tool for unraveling the complexities of biological tissues.
Collapse
Affiliation(s)
- Xinyu Bao
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Xiaosheng Bai
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Xiaoping Liu
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China.
| | - Qianqian Shi
- Hubei Engineering Technology Research Center of Agricultural Big Data, Huazhong Agricultural University, Wuhan, China.
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China.
| | - Chuanchao Zhang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China.
| |
Collapse
|
29
|
Leng J, Yu J, Wu LY, Chen H. Flexible integration of spatial and expression information for precise spot embedding via ZINB-based graph-enhanced autoencoder. Commun Biol 2025; 8:556. [PMID: 40186054 PMCID: PMC11971412 DOI: 10.1038/s42003-025-07965-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2024] [Accepted: 03/19/2025] [Indexed: 04/07/2025] Open
Abstract
Domain identification is a critical problem in spatially resolved transcriptomics data analysis, which aims to identify distinct spatial domains within a tissue that maintain both spatial continuity and expression consistency. The degree of coupling between expression data and spatial information in different datasets often varies significantly. Some regions have intact and clear boundaries, while others exhibit blurred boundaries with high intra-domain expression similarity. However, most domain identification methods do not adequately integrate expression and spatial information to flexibly identify different types of domains. To address these issues, we introduce Spot2vector, a computational framework that leverages a graph-enhanced autoencoder integrating zero-inflated negative binomial distribution modeling, combining both graph convolutional networks and graph attention networks to extract the latent embeddings of spots. Spot2vector encodes and integrates spatial and expression information, enabling effective identification of domains with diverse spatial patterns across spatially resolved transcriptomics data generated by different platforms. The decoders enable us to decipher the distribution and generation mechanisms of data while improving expression quality through denoising. Extensive validation and analyses demonstrate that Spot2vector excels in enhancing domain identification accuracy, effectively reducing data dimensionality, improving expression recovery and denoising, and precisely capturing spatial gene expression patterns.
Collapse
Affiliation(s)
| | - Jiating Yu
- School of Mathematics and Statistics, Nanjing University of Information Science & Technology, Nanjing, 210044, China
| | - Ling-Yun Wu
- IAM, MADIS, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | | |
Collapse
|
30
|
Yuan C, Patel K, Shi H, Wang HLV, Wang F, Li R, Li Y, Corces VG, Shi H, Das S, Yu J, Jin P, Yao B, Hu J. mcDETECT: Decoding 3D Spatial Synaptic Transcriptomes with Subcellular-Resolution Spatial Transcriptomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.27.645744. [PMID: 40236251 PMCID: PMC11996425 DOI: 10.1101/2025.03.27.645744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/17/2025]
Abstract
Spatial transcriptomics (ST) has shown great potential for unraveling the molecular mechanisms of neurodegenerative diseases. However, most existing analyses of ST data focus on bulk or single-cell resolution, overlooking subcellular compartments such as synapses, which are fundamental structures of the brain's neural network. Here we present mcDETECT, a novel framework that integrates machine learning algorithms and in situ ST (iST) with targeted gene panels to study synapses. mcDETECT identifies individual synapses based on the aggregation of synaptic mRNAs in three-dimensional (3D) space, allowing for the construction of single-synapse spatial transcriptome profiles. By benchmarking the synapse density measured by volume electron microscopy and genetic labeling, we demonstrate that mcDETECT can faithfully and accurately recover the spatial location of single synapses using iST data from multiple platforms, including Xenium, Xenium 5K, MERSCOPE, and CosMx. Based on the subsequent transcriptome profiling, we further stratify total synapses into various subtypes and explore their pathogenic dysregulation associated with Alzheimer's disease (AD) progression, which provides potential targets for synapse-specific therapies in AD progression.
Collapse
|
31
|
Min W, Fang D, Chen J, Zhang S. SpaMask: Dual masking graph autoencoder with contrastive learning for spatial transcriptomics. PLoS Comput Biol 2025; 21:e1012881. [PMID: 40179332 PMCID: PMC11968113 DOI: 10.1371/journal.pcbi.1012881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Accepted: 02/17/2025] [Indexed: 04/05/2025] Open
Abstract
Understanding the spatial locations of cell within tissues is crucial for unraveling the organization of cellular diversity. Recent advancements in spatial resolved transcriptomics (SRT) have enabled the analysis of gene expression while preserving the spatial context within tissues. Spatial domain characterization is a critical first step in SRT data analysis, providing the foundation for subsequent analyses and insights into biological implications. Graph neural networks (GNNs) have emerged as a common tool for addressing this challenge due to the structural nature of SRT data. However, current graph-based deep learning approaches often overlook the instability caused by the high sparsity of SRT data. Masking mechanisms, as an effective self-supervised learning strategy, can enhance the robustness of these models. To this end, we propose SpaMask, dual masking graph autoencoder with contrastive learning for SRT analysis. Unlike previous GNNs, SpaMask masks a portion of spot nodes and spot-to-spot edges to enhance its performance and robustness. SpaMask combines Masked Graph Autoencoders (MGAE) and Masked Graph Contrastive Learning (MGCL) modules, with MGAE using node masking to leverage spatial neighbors for improved clustering accuracy, while MGCL applies edge masking to create a contrastive loss framework that tightens embeddings of adjacent nodes based on spatial proximity and feature similarity. We conducted a comprehensive evaluation of SpaMask on eight datasets from five different platforms. Compared to existing methods, SpaMask achieves superior clustering accuracy and effective batch correction.
Collapse
Affiliation(s)
- Wenwen Min
- School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, China
| | - Donghai Fang
- School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, China
| | - Jinyu Chen
- School of Mathematics, Statistics and Mechanics, Beijing University of Technology, Beijing, China
| | - Shihua Zhang
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, Zhejiang, China
| |
Collapse
|
32
|
Birk S, Bonafonte-Pardàs I, Feriz AM, Boxall A, Agirre E, Memi F, Maguza A, Yadav A, Armingol E, Fan R, Castelo-Branco G, Theis FJ, Bayraktar OA, Talavera-López C, Lotfollahi M. Quantitative characterization of cell niches in spatially resolved omics data. Nat Genet 2025; 57:897-909. [PMID: 40102688 PMCID: PMC11985353 DOI: 10.1038/s41588-025-02120-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 02/05/2025] [Indexed: 03/20/2025]
Abstract
Spatial omics enable the characterization of colocalized cell communities that coordinate specific functions within tissues. These communities, or niches, are shaped by interactions between neighboring cells, yet existing computational methods rarely leverage such interactions for their identification and characterization. To address this gap, here we introduce NicheCompass, a graph deep-learning method that models cellular communication to learn interpretable cell embeddings that encode signaling events, enabling the identification of niches and their underlying processes. Unlike existing methods, NicheCompass quantitatively characterizes niches based on communication pathways and consistently outperforms alternatives. We show its versatility by mapping tissue architecture during mouse embryonic development and delineating tumor niches in human cancers, including a spatial reference mapping application. Finally, we extend its capabilities to spatial multi-omics, demonstrate cross-technology integration with datasets from different sequencing platforms and construct a whole mouse brain spatial atlas comprising 8.4 million cells, highlighting NicheCompass' scalability. Overall, NicheCompass provides a scalable framework for identifying and analyzing niches through signaling events.
Collapse
Affiliation(s)
- Sebastian Birk
- Institute of AI for Health, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
- School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
- Würzburg Institute of Systems Immunology (WüSI), University of Würzburg, Würzburg, Germany
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Irene Bonafonte-Pardàs
- Institute of Computational Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
- Biomedical Center (BMC), Physiological Chemistry, Faculty of Medicine, Ludwig Maximilian University of Munich, Planegg-Martinsried, Germany
| | | | - Adam Boxall
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Eneritz Agirre
- Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Fani Memi
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Anna Maguza
- Würzburg Institute of Systems Immunology (WüSI), University of Würzburg, Würzburg, Germany
- Faculty of Medicine, University of Würzburg, Würzburg, Germany
| | - Anamika Yadav
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Erick Armingol
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Rong Fan
- Department of Biomedical Engineering, Yale University, New Haven, CT, USA
- Yale Stem Cell Center and Yale Cancer Center, Yale University School of Medicine, New Haven, CT, USA
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
- Human and Translational Immunology Program, Yale University School of Medicine, New Haven, CT, USA
| | - Gonçalo Castelo-Branco
- Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
- Ming Wai Lau Centre for Reparative Medicine, Stockholm Node, Karolinska Institutet, Stockholm, Sweden
| | - Fabian J Theis
- School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Institute of Computational Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
| | | | - Carlos Talavera-López
- Würzburg Institute of Systems Immunology (WüSI), University of Würzburg, Würzburg, Germany.
- Faculty of Medicine, University of Würzburg, Würzburg, Germany.
| | - Mohammad Lotfollahi
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK.
- Institute of Computational Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany.
| |
Collapse
|
33
|
Marco Salas S, Kuemmerle LB, Mattsson-Langseth C, Tismeyer S, Avenel C, Hu T, Rehman H, Grillo M, Czarnewski P, Helgadottir S, Tiklova K, Andersson A, Rafati N, Chatzinikolaou M, Theis FJ, Luecken MD, Wählby C, Ishaque N, Nilsson M. Optimizing Xenium In Situ data utility by quality assessment and best-practice analysis workflows. Nat Methods 2025; 22:813-823. [PMID: 40082609 PMCID: PMC11978515 DOI: 10.1038/s41592-025-02617-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 02/04/2025] [Indexed: 03/16/2025]
Abstract
The Xenium In Situ platform is a new spatial transcriptomics product commercialized by 10x Genomics, capable of mapping hundreds of genes in situ at subcellular resolution. Given the multitude of commercially available spatial transcriptomics technologies, recommendations in choice of platform and analysis guidelines are increasingly important. Herein, we explore 25 Xenium datasets generated from multiple tissues and species, comparing scalability, resolution, data quality, capacities and limitations with eight other spatially resolved transcriptomics technologies and commercial platforms. In addition, we benchmark the performance of multiple open-source computational tools, when applied to Xenium datasets, in tasks including preprocessing, cell segmentation, selection of spatially variable features and domain identification. This study serves as an independent analysis of the performance of Xenium, and provides best practices and recommendations for analysis of such datasets.
Collapse
Affiliation(s)
- Sergio Marco Salas
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden.
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany.
| | - Louis B Kuemmerle
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany
- Institute for Tissue Engineering and Regenerative Medicine (iTERM), Helmholtz Zentrum München, Munich, Germany
| | | | - Sebastian Tismeyer
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Digital Health Center, Berlin, Germany
| | - Christophe Avenel
- Department of Information Technology and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Taobo Hu
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Habib Rehman
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Champaign, IL, USA
| | - Marco Grillo
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Paulo Czarnewski
- Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden
- Science for Life Laboratory, Department of Biochemistry and Biophysics, National Bioinformatics Infrastructure Sweden, Stockholm University, Stockholm, Sweden
| | - Saga Helgadottir
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Katarina Tiklova
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Axel Andersson
- Department of Information Technology and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Nima Rafati
- National Bioinformatics Infrastructure Sweden, Uppsala University, SciLifeLab, Department of Medical Biochemistry and Microbiology, Uppsala, Sweden
| | - Maria Chatzinikolaou
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Fabian J Theis
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany
| | - Malte D Luecken
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany
- Institute of Lung Health & Immunity, Helmholtz Munich; Member of the German Center for Lung Research (DZL), Munich, Germany
| | - Carolina Wählby
- Department of Information Technology and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Naveed Ishaque
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Digital Health Center, Berlin, Germany
| | - Mats Nilsson
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden.
| |
Collapse
|
34
|
Tu W, Zhang L. Integrating multiple spatial transcriptomics data using community-enhanced graph contrastive learning. PLoS Comput Biol 2025; 21:e1012948. [PMID: 40179111 PMCID: PMC11990772 DOI: 10.1371/journal.pcbi.1012948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2024] [Revised: 04/11/2025] [Accepted: 03/10/2025] [Indexed: 04/05/2025] Open
Abstract
Due to the rapid development of spatial sequencing technologies, large amounts of spatial transcriptomic datasets have been generated across various technological platforms or different biological conditions (e.g., control vs. treatment). Spatial transcriptomics data coming from different platforms usually has different resolutions. Moreover, current methods do not consider the heterogeneity of spatial structures within and across slices when modeling spatial transcriptomics data with graph-based methods. In this study, we propose a community-enhanced graph contrastive learning-based method named Tacos to integrate multiple spatial transcriptomics data. We applied Tacos to several real datasets coming from different platforms under different scenarios. Systematic benchmark analyses demonstrate Tacos's superior performance in integrating different slices. Furthermore, Tacos can accurately denoise the spatially resolved transcriptomics data.
Collapse
Affiliation(s)
- Wenqian Tu
- School of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| | - Lihua Zhang
- School of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| |
Collapse
|
35
|
Chen X, Ran Q, Tang J, Chen Z, Huang S, Shi X, Xi R. Benchmarking algorithms for spatially variable gene identification in spatial transcriptomics. Bioinformatics 2025; 41:btaf131. [PMID: 40139667 PMCID: PMC12036962 DOI: 10.1093/bioinformatics/btaf131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Revised: 03/15/2025] [Accepted: 03/27/2025] [Indexed: 03/29/2025] Open
Abstract
MOTIVATION The rapid development of spatial transcriptomics has underscored the importance of identifying spatially variable genes. As a fundamental task in spatial transcriptomic data analysis, spatially variable gene identification has been extensively studied. However, the lack of comprehensive benchmark makes it difficult to validate the effectiveness of various algorithms scattered across a large number of studies with real-world datasets. RESULTS In response, this article proposes a benchmark framework to evaluate algorithms for identifying spatially variable genes through the analysis of 30 synthesized and 74 real-world datasets, aiming to identify the best algorithms and their corresponding application scenarios. This framework can assist medical and life scientists in selecting suitable algorithms for their research, while also aid bioinformatics scientists in developing more powerful and efficient computational methods in spatial transcriptomic research. AVAILABILITY AND IMPLEMENTATION The source code of this benchmarking framework is available at both Github (https://github.com/XiDsLab/svg-benchmark) and Zenodo (https://doi.org/10.5281/zenodo.15031083). In addition, all real and synthetic datasets considered in this study are also publicly available at Zenodo (https://doi.org/10.5281/zenodo.7227771).
Collapse
Affiliation(s)
- Xuanwei Chen
- School of Mathematical Sciences, Peking University, Beijing 100871, China
| | - Qinghua Ran
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Junjie Tang
- Center for Statistical Science, Peking University, Beijing 100871, China
| | - Zihao Chen
- School of Mathematical Sciences, Peking University, Beijing 100871, China
| | - Siyuan Huang
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Xingjie Shi
- KLATASDS-MOE, Academy of Statistics and Interdisciplinary Sciences, School of Statistics, East China Normal University, Shanghai 200062, China
| | - Ruibin Xi
- School of Mathematical Sciences, Peking University, Beijing 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Center for Statistical Science, Peking University, Beijing 100871, China
| |
Collapse
|
36
|
Cai J, Wu S, Cheng H, Zhong W, Yuan GC, Ma P. Protocol to boost the robustness and accuracy of spatial transcriptomics algorithms using ensemble techniques. STAR Protoc 2025; 6:103608. [PMID: 39879360 PMCID: PMC11803146 DOI: 10.1016/j.xpro.2025.103608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Revised: 11/14/2024] [Accepted: 01/08/2025] [Indexed: 01/31/2025] Open
Abstract
Spatial transcriptomics enhances our understanding of cellular organization by mapping gene expression data to precise tissue locations. Here, we present a protocol for using weighted ensemble method for spatial transcriptomics (WEST), which uses ensemble techniques to boost the robustness and accuracy of existing algorithms. We describe steps for preprocessing data, obtaining embeddings from individual algorithms, and ensemble integrating all embeddings as a similarity matrix. We then detail procedures for using the similarity matrix to identify spatial domains and obtain new embeddings. For complete details on the use and execution of this protocol, please refer to Cai et al.1.
Collapse
Affiliation(s)
- Jiazhang Cai
- Department of Statistics, University of Georgia, 310 Herty Drive, Athens, GA 30602, USA
| | - Shushan Wu
- Department of Statistics, University of Georgia, 310 Herty Drive, Athens, GA 30602, USA
| | - Huimin Cheng
- Department of Biostatistics, Boston University, 801 Massachusetts Avenue Crosstown Center, Boston, MA 02118, USA
| | - Wenxuan Zhong
- Department of Statistics, University of Georgia, 310 Herty Drive, Athens, GA 30602, USA.
| | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1468 Madison Avenue, New York, NY 10029, USA.
| | - Ping Ma
- Department of Statistics, University of Georgia, 310 Herty Drive, Athens, GA 30602, USA.
| |
Collapse
|
37
|
Zhao PA, Li R, Adewunmi T, Garber J, Gustafson C, Kim J, Malone J, Savage A, Skene P, Li XJ. SPARROW reveals microenvironment-zone-specific cell states in healthy and diseased tissues. Cell Syst 2025; 16:101235. [PMID: 40112778 DOI: 10.1016/j.cels.2025.101235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 10/23/2024] [Accepted: 02/19/2025] [Indexed: 03/22/2025]
Abstract
Spatially resolved transcriptomics technologies have advanced our understanding of cellular characteristics within tissue contexts. However, current analytical tools often treat cell-type inference and cellular neighborhood identification as separate and hard clustering processes, limiting comparability across scales and samples. SPARROW addresses these challenges by jointly learning latent embeddings and soft clusterings of cell types and cellular organization. It outperformed state-of-the-art methods in cell-type inference and microenvironment zone delineation and uncovered zone-specific cell states in human and mouse tissues that competing methods missed. By integrating spatially resolved transcriptomics and single-cell RNA sequencing (scRNA-seq) data in a shared latent space, SPARROW achieves single-cell spatial resolution and whole-transcriptome coverage, enabling the discovery of both established and unknown microenvironment zone-specific ligand-receptor interactions in the human tonsil. Overall, SPARROW is a computational framework that provides a comprehensive characterization of tissue features across scales, samples, and conditions.
Collapse
Affiliation(s)
- Peiyao A Zhao
- Allen Institute for Immunology, Seattle, WA 98109, USA.
| | - Ruoxin Li
- Allen Institute for Immunology, Seattle, WA 98109, USA
| | - Temi Adewunmi
- Allen Institute for Immunology, Seattle, WA 98109, USA
| | | | | | - June Kim
- Allen Institute for Immunology, Seattle, WA 98109, USA
| | | | - Adam Savage
- Allen Institute for Immunology, Seattle, WA 98109, USA
| | - Peter Skene
- Allen Institute for Immunology, Seattle, WA 98109, USA
| | - Xiao-Jun Li
- Allen Institute for Immunology, Seattle, WA 98109, USA.
| |
Collapse
|
38
|
Wang Y, Liu Z, Ma X. MuCST: restoring and integrating heterogeneous morphology images and spatial transcriptomics data with contrastive learning. Genome Med 2025; 17:21. [PMID: 40082941 PMCID: PMC11907906 DOI: 10.1186/s13073-025-01449-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2024] [Accepted: 03/07/2025] [Indexed: 03/16/2025] Open
Abstract
Spatially resolved transcriptomics (SRT) simultaneously measure spatial location, histology images, and transcriptional profiles of cells or regions in undissociated tissues. Integrative analysis of multi-modal SRT data holds immense potential for understanding biological mechanisms. Here, we present a flexible multi-modal contrastive learning for the integration of SRT data (MuCST), which joins denoising, heterogeneity elimination, and compatible feature learning. MuCST accurately identifies spatial domains and is applicable to diverse datasets platforms. Overall, MuCST provides an alternative for integrative analysis of multi-modal SRT data ( https://github.com/xkmaxidian/MuCST ).
Collapse
Affiliation(s)
- Yu Wang
- School of Computer Science and Technology, Xidian University, No.2 South Taibai Road, Xi'an, 710071, Shaanxi, China
- Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province, Xidian University, No.2 South Taibai Road, Xi'an, 710071, Shaanxi, China
| | - Zaiyi Liu
- Department of Radiology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, 106 Zhongshan Er Road, Guangzhou, 510080, Guangdong, China
- Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application, 106 Zhongshan Er Road, Guangzhou, 510080, Guangdong, China
| | - Xiaoke Ma
- School of Computer Science and Technology, Xidian University, No.2 South Taibai Road, Xi'an, 710071, Shaanxi, China.
- Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province, Xidian University, No.2 South Taibai Road, Xi'an, 710071, Shaanxi, China.
| |
Collapse
|
39
|
Fang Z, Krusen K, Priest H, Wang M, Kim S, Sriram A, Yellanki A, Singh A, Horwitz E, Coskun AF. Graph-Based 3-Dimensional Spatial Gene Neighborhood Networks of Single Cells in Gels and Tissues. BME FRONTIERS 2025; 6:0110. [PMID: 40084126 PMCID: PMC11906096 DOI: 10.34133/bmef.0110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Revised: 02/12/2025] [Accepted: 02/13/2025] [Indexed: 03/16/2025] Open
Abstract
Objective: We developed 3-dimensional spatially resolved gene neighborhood network embedding (3D-spaGNN-E) to find subcellular gene proximity relationships and identify key subcellular motifs in cell-cell communication (CCC). Impact Statement: The pipeline combines 3D imaging-based spatial transcriptomics and graph-based deep learning to identify subcellular motifs. Introduction: Advancements in imaging and experimental technology allow the study of 3D spatially resolved transcriptomics and capture better spatial context than approximating the samples as 2D. However, the third spatial dimension increases the data complexity and requires new analyses. Methods: 3D-spaGNN-E detects single transcripts in 3D cell culture samples and identifies subcellular gene proximity relationships. Then, a graph autoencoder projects the gene proximity relationships into a latent space. We then applied explainability analysis to identify subcellular CCC motifs. Results: We first applied the pipeline to mesenchymal stem cells (MSCs) cultured in hydrogel. After clustering the cells based on the RNA count, we identified cells belonging to the same cluster as homotypic and those belonging to different clusters as heterotypic. We identified changes in local gene proximity near the border between homotypic and heterotypic cells. When applying the pipeline to the MSC-peripheral blood mononuclear cell (PBMC) coculture system, we identified CD4+ and CD8+ T cells. Local gene proximity and autoencoder embedding changes can distinguish strong and weak suppression of different immune cells. Lastly, we compared astrocyte-neuron CCC in mouse hypothalamus and cortex by analyzing 3D multiplexed-error-robust fluorescence in situ hybridization (MERFISH) data and identified regional gene proximity differences. Conclusion: 3D-spaGNN-E distinguished distinct CCCs in cell culture and tissue by examining subcellular motifs.
Collapse
Affiliation(s)
- Zhou Fang
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
- Machine Learning Graduate Program,
Georgia Institute of Technology, Atlanta, GA, USA
| | - Kelsey Krusen
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
| | - Hannah Priest
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
| | - Mingshuang Wang
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
| | - Sungwoong Kim
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
| | - Anirudh Sriram
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
| | - Ashritha Yellanki
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
| | - Ankur Singh
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
- Woodruff School of Mechanical Engineering,
Georgia Institute of Technology, Atlanta, GA, USA
- Interdisciplinary Bioengineering Graduate Program,
Georgia Institute of Technology, Atlanta, GA, USA
- Parker H. Petit Institute for Bioengineering and Bioscience, GeorgiaInstitute of Technology, Atlanta, GA 30332, USA
| | - Edwin Horwitz
- Department of Pediatrics,
Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Ahmet F. Coskun
- Wallace H. Coulter Department of Biomedical Engineering,
Georgia Institute of Technology and Emory University, Atlanta, GA, USA
- Interdisciplinary Bioengineering Graduate Program,
Georgia Institute of Technology, Atlanta, GA, USA
- Parker H. Petit Institute for Bioengineering and Bioscience, GeorgiaInstitute of Technology, Atlanta, GA 30332, USA
| |
Collapse
|
40
|
Liu M, Hernandez MO, Castven D, Lee HP, Wu W, Wang L, Forgues M, Hernandez JM, Marquardt JU, Ma L. Tumor cell villages define the co-dependency of tumor and microenvironment in liver cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.07.642107. [PMID: 40161587 PMCID: PMC11952337 DOI: 10.1101/2025.03.07.642107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Spatial cellular context is crucial in shaping intratumor heterogeneity. However, understanding how each tumor establishes its unique spatial landscape and what factors drive the landscape for tumor fitness remains significantly challenging. Here, we analyzed over 2 million cells from 50 tumor biospecimens using spatial single-cell imaging and single-cell RNA sequencing. We developed a deep learning-based strategy to spatially map tumor cell states and the architecture surrounding them, which we referred to as Spatial Dynamics Network (SDN). We found that different tumor cell states may be organized into distinct clusters, or 'villages', each supported by unique SDNs. Notably, tumor cell villages exhibited village-specific molecular co-dependencies between tumor cells and their microenvironment and were associated with patient outcomes. Perturbation of molecular co-dependencies via random spatial shuffling of the microenvironment resulted in destabilization of the corresponding villages. This study provides new insights into understanding tumor spatial landscape and its impact on tumor aggressiveness.
Collapse
Affiliation(s)
- Meng Liu
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA
| | - Maria O. Hernandez
- Spatial Imaging Technology Resource, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, USA
| | - Darko Castven
- Department of Medicine I, University Medical Center, Lübeck, Germany
| | - Hsin-Pei Lee
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA
| | - Wenqi Wu
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA
| | - Limin Wang
- Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Marshonna Forgues
- Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Jonathan M. Hernandez
- Surgical Oncology Program, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA
| | - Jens U. Marquardt
- Department of Medicine I, University Medical Center, Lübeck, Germany
| | - Lichun Ma
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA
- Liver Cancer Program, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA
| |
Collapse
|
41
|
Hackenberg M, Brunn N, Vogel T, Binder H. Infusing structural assumptions into dimensionality reduction for single-cell RNA sequencing data to identify small gene sets. Commun Biol 2025; 8:414. [PMID: 40069486 PMCID: PMC11897155 DOI: 10.1038/s42003-025-07872-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Accepted: 03/03/2025] [Indexed: 03/15/2025] Open
Abstract
Dimensionality reduction greatly facilitates the exploration of cellular heterogeneity in single-cell RNA sequencing data. While most of such approaches are data-driven, it can be useful to incorporate biologically plausible assumptions about the underlying structure or the experimental design. We propose the boosting autoencoder (BAE) approach, which combines the advantages of unsupervised deep learning for dimensionality reduction and boosting for formalizing assumptions. Specifically, our approach selects small sets of genes that explain latent dimensions. As illustrative applications, we explore the diversity of neural cell identities and temporal patterns of embryonic development.
Collapse
Grants
- Deutsche Forschungsgemeinschaft (DFG, German Research Foundation): Project-ID 322977937, GRK 2344
- Deutsche Forschungsgemeinschaft (DFG, German Research Foundation): Project-ID 322977937, GRK 2344 ; Deutsche Forschungsgemeinschaft (DFG, German Research Foundation): Project-ID 499552394, SFB 1597
- Deutsche Forschungsgemeinschaft (DFG, German Research Foundation): Project-ID 322977937, GRK 2344; Deutsche Forschungsgemeinschaft (DFG, German Research Foundation): Project-ID 499552394, SFB 1597
Collapse
Affiliation(s)
- Maren Hackenberg
- Institute of Medical Biometry and Statistics (IMBI), Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany.
- Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Freiburg, Germany.
| | - Niklas Brunn
- Institute of Medical Biometry and Statistics (IMBI), Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany.
- Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Freiburg, Germany.
| | - Tanja Vogel
- Institute of Anatomy and Cell Biology, Department Molecular Embryology, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Harald Binder
- Institute of Medical Biometry and Statistics (IMBI), Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
- Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Freiburg, Germany
- Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg, Germany
| |
Collapse
|
42
|
Jing SY, Wang HQ, Lin P, Yuan J, Tang ZX, Li H. Quantifying and interpreting biologically meaningful spatial signatures within tumor microenvironments. NPJ Precis Oncol 2025; 9:68. [PMID: 40069556 PMCID: PMC11897387 DOI: 10.1038/s41698-025-00857-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Accepted: 02/25/2025] [Indexed: 03/15/2025] Open
Abstract
The tumor microenvironment (TME) plays a crucial role in orchestrating tumor cell behavior and cancer progression. Recent advances in spatial profiling technologies have uncovered novel spatial signatures, including univariate distribution patterns, bivariate spatial relationships, and higher-order structures. These signatures have the potential to revolutionize tumor mechanism and treatment. In this review, we summarize the current state of spatial signature research, highlighting computational methods to uncover spatially relevant biological significance. We discuss the impact of these advances on fundamental cancer biology and translational research, address current challenges and future research directions.
Collapse
Affiliation(s)
- Si-Yu Jing
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China
| | - He-Qi Wang
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China
| | - Ping Lin
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China
| | - Jiao Yuan
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China
| | - Zhi-Xuan Tang
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China
| | - Hong Li
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China.
| |
Collapse
|
43
|
Li S, Hua H, Chen S. Graph neural networks for single-cell omics data: a review of approaches and applications. Brief Bioinform 2025; 26:bbaf109. [PMID: 40091193 PMCID: PMC11911123 DOI: 10.1093/bib/bbaf109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Revised: 02/09/2025] [Accepted: 02/25/2025] [Indexed: 03/19/2025] Open
Abstract
Rapid advancement of sequencing technologies now allows for the utilization of precise signals at single-cell resolution in various omics studies. However, the massive volume, ultra-high dimensionality, and high sparsity nature of single-cell data have introduced substantial difficulties to traditional computational methods. The intricate non-Euclidean networks of intracellular and intercellular signaling molecules within single-cell datasets, coupled with the complex, multimodal structures arising from multi-omics joint analysis, pose significant challenges to conventional deep learning operations reliant on Euclidean geometries. Graph neural networks (GNNs) have extended deep learning to non-Euclidean data, allowing cells and their features in single-cell datasets to be modeled as nodes within a graph structure. GNNs have been successfully applied across a broad range of tasks in single-cell data analysis. In this survey, we systematically review 107 successful applications of GNNs and their six variants in various single-cell omics tasks. We begin by outlining the fundamental principles of GNNs and their six variants, followed by a systematic review of GNN-based models applied in single-cell epigenomics, transcriptomics, spatial transcriptomics, proteomics, and multi-omics. In each section dedicated to a specific omics type, we have summarized the publicly available single-cell datasets commonly utilized in the articles reviewed in that section, totaling 77 datasets. Finally, we summarize the potential shortcomings of current research and explore directions for future studies. We anticipate that this review will serve as a guiding resource for researchers to deepen the application of GNNs in single-cell omics.
Collapse
Affiliation(s)
- Sijie Li
- School of Mathematical Sciences and The Key Laboratory of Pure Mathematics and Combinatorics, Ministry of Education (LPMC), Nankai University, No. 94 Weijin Road, Nankai District, Tianjin 300071, China
| | - Heyang Hua
- School of Mathematical Sciences and The Key Laboratory of Pure Mathematics and Combinatorics, Ministry of Education (LPMC), Nankai University, No. 94 Weijin Road, Nankai District, Tianjin 300071, China
| | - Shengquan Chen
- School of Mathematical Sciences and The Key Laboratory of Pure Mathematics and Combinatorics, Ministry of Education (LPMC), Nankai University, No. 94 Weijin Road, Nankai District, Tianjin 300071, China
| |
Collapse
|
44
|
Pang Y, Wang C, Zhang YZ, Wang Z, Imoto S, Lee TY. STForte: tissue context-specific encoding and consistency-aware spatial imputation for spatially resolved transcriptomics. Brief Bioinform 2025; 26:bbaf174. [PMID: 40254832 PMCID: PMC12009714 DOI: 10.1093/bib/bbaf174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2025] [Revised: 03/06/2025] [Accepted: 03/17/2025] [Indexed: 04/22/2025] Open
Abstract
Encoding spatially resolved transcriptomics (SRT) data serves to identify the biological semantics of RNA expression within the tissue while preserving spatial characteristics. Depending on the analytical scenario, one may focus on different contextual structures of tissues. For instance, anatomical regions reveal consistent patterns by focusing on spatial homogeneity, while elucidating complex tumor micro-environments requires more expression heterogeneity. However, current spatial encoding methods lack consideration of the tissue context. Meanwhile, most developed SRT technologies are still limited in providing exact patterns of intact tissues due to limitations such as low resolution or missed measurements. Here, we propose STForte, a novel pairwise graph autoencoder-based approach with cross-reconstruction and adversarial distribution matching, to model the spatial homogeneity and expression heterogeneity of SRT data. STForte extracts interpretable latent encodings, enabling downstream analysis by accurately portraying various tissue contexts. Moreover, STForte allows spatial imputation using only spatial consistency to restore the biological patterns of unobserved locations or low-quality cells, thereby providing fine-grained views to enhance the SRT analysis. Extensive evaluations of datasets under different scenarios and SRT platforms demonstrate that STForte is a scalable and versatile tool for providing enhanced insights into spatial data analysis.
Collapse
Affiliation(s)
- Yuxuan Pang
- Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1, Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
| | - Chunxuan Wang
- School of Data Science, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), 2001 Longxiang Road, Longgang, Shenzhen, 518172, China
| | - Yao-zhong Zhang
- Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1, Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
| | - Zhuo Wang
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), 2001 Longxiang Road, Longgang, Shenzhen, 518172, China
- School of Medicine, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), 2001 Longxiang Road, Longgang, Shenzhen, 518172, China
| | - Seiya Imoto
- Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1, Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
| | - Tzong-Yi Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, No. 75 Bo-Ai Street, Hsinchu 300, Taiwan
| |
Collapse
|
45
|
Zhu P, Shu H, Wang Y, Wang X, Zhao Y, Hu J, Peng J, Shang X, Tian Z, Chen J, Wang T. MAEST: accurately spatial domain detection in spatial transcriptomics with graph masked autoencoder. Brief Bioinform 2025; 26:bbaf086. [PMID: 40052440 PMCID: PMC11886571 DOI: 10.1093/bib/bbaf086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2024] [Revised: 01/27/2025] [Accepted: 02/17/2025] [Indexed: 03/10/2025] Open
Abstract
Spatial transcriptomics (ST) technology provides gene expression profiles with spatial context, offering critical insights into cellular interactions and tissue architecture. A core task in ST is spatial domain identification, which involves detecting coherent regions with similar spatial expression patterns. However, existing methods often fail to fully exploit spatial information, leading to limited representational capacity and suboptimal clustering accuracy. Here, we introduce MAEST, a novel graph neural network model designed to address these limitations in ST data. MAEST leverages graph masked autoencoders to denoise and refine representations while incorporating graph contrastive learning to prevent feature collapse and enhance model robustness. By integrating one-hop and multi-hop representations, MAEST effectively captures both local and global spatial relationships, improving clustering precision. Extensive experiments across diverse datasets, including the human brain, mouse hippocampus, olfactory bulb, brain, and embryo, demonstrate that MAEST outperforms seven state-of-the-art methods in spatial domain identification. Furthermore, MAEST showcases its ability to integrate multi-slice data, identifying joint domains across horizontal tissue sections with high accuracy. These results highlight MAEST's versatility and effectiveness in unraveling the spatial organization of complex tissues. The source code of MAEST can be obtained at https://github.com/clearlove2333/MAEST.
Collapse
Affiliation(s)
- Pengfei Zhu
- School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
- Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
| | - Han Shu
- School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
- Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
| | - Yongtian Wang
- School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
- Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
| | - Xiaofeng Wang
- General Surgery Department, The Affiliated Hospital of Northwest University: Xi’an No 3 Hospital, Xi’an 710018, China
| | - Yuan Zhao
- School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
- Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
| | - Jialu Hu
- School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
- Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
| | - Jiajie Peng
- School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
- Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
| | - Xuequn Shang
- School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
- Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
| | - Zhen Tian
- School of Computer Science and Artificial Intelligence, Zhengzhou University, No. 100 Science Avenue, Zhengzhou 450001, China
| | - Jing Chen
- School of Computer Science and Engineering, Xi’an University of Technology, No. 5 South Jinhua Road, Xi’an 710048, China
| | - Tao Wang
- School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
- Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 1 Dongxiang Road, Xi’an 710072, China
| |
Collapse
|
46
|
Li Y, Hu Q, Han S, Wang-Sattler R, Du W. Multi-Manifolds fusing hyperbolic graph network balanced by pareto optimization for identifying spatial domains of spatial transcriptomics. Brief Bioinform 2025; 26:bbaf162. [PMID: 40220278 PMCID: PMC11992958 DOI: 10.1093/bib/bbaf162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2024] [Revised: 02/28/2025] [Accepted: 03/19/2025] [Indexed: 04/14/2025] Open
Abstract
Identifying spatial domains for spatial transcriptomics is crucial for achieving comprehensive insights into the pathogenesis of gene expression. Increasingly, computational methods based on graph neural networks are being developed for spatial transcriptomics. However, previous methods have solely focused on the Euclidean manifold. To effectively exploit and explore the informative and deeper topological structures of inherent manifolds, we presented a Multi-Manifolds fusing hyperbolic graph network, balanced by Pareto optimization, for identifying spatial domains in Spatial Transcriptomics (MManiST). First, we developed multi-manifolds encoders for distinct manifolds using the hyperbolic neural network. Features from different manifolds were then combined using an attention mechanism, with multiple reconstruction losses balanced by Pareto optimization. Extensive experiments on commonly used benchmark datasets show that our method consistently outperforms seven state-of-the-art methods. Additionally, we investigated the validity of each component and the impact of fusion methods in ablation experiments.
Collapse
Affiliation(s)
- Ying Li
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Qianjin Street 2699, Changchun 130012, Jilin, China
| | - Qifeng Hu
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Qianjin Street 2699, Changchun 130012, Jilin, China
| | - Siyu Han
- TUM School of Medicine, Technical University of Munich, Ismaninger Straße 22, D-81675 Munich, Bavaria, Germany
| | - Rui Wang-Sattler
- Institute of Translational Genomics, Helmholtz Zentrum Munchen, Ingolstadter Landstraße 1, D-85764 Neuherberg, Bavaria, Germany
| | - Wei Du
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Qianjin Street 2699, Changchun 130012, Jilin, China
| |
Collapse
|
47
|
Xu X, Su J, Zhu R, Li K, Zhao X, Fan J, Mao F. From morphology to single-cell molecules: high-resolution 3D histology in biomedicine. Mol Cancer 2025; 24:63. [PMID: 40033282 PMCID: PMC11874780 DOI: 10.1186/s12943-025-02240-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Accepted: 01/18/2025] [Indexed: 03/05/2025] Open
Abstract
High-resolution three-dimensional (3D) tissue analysis has emerged as a transformative innovation in the life sciences, providing detailed insights into the spatial organization and molecular composition of biological tissues. This review begins by tracing the historical milestones that have shaped the development of high-resolution 3D histology, highlighting key breakthroughs that have facilitated the advancement of current technologies. We then systematically categorize the various families of high-resolution 3D histology techniques, discussing their core principles, capabilities, and inherent limitations. These 3D histology techniques include microscopy imaging, tomographic approaches, single-cell and spatial omics, computational methods and 3D tissue reconstruction (e.g. 3D cultures and spheroids). Additionally, we explore a wide range of applications for single-cell 3D histology, demonstrating how single-cell and spatial technologies are being utilized in the fields such as oncology, cardiology, neuroscience, immunology, developmental biology and regenerative medicine. Despite the remarkable progress made in recent years, the field still faces significant challenges, including high barriers to entry, issues with data robustness, ambiguous best practices for experimental design, and a lack of standardization across methodologies. This review offers a thorough analysis of these challenges and presents recommendations to surmount them, with the overarching goal of nurturing ongoing innovation and broader integration of cellular 3D tissue analysis in both biology research and clinical practice.
Collapse
Affiliation(s)
- Xintian Xu
- Institute of Medical Innovation and Research, Peking University Third Hospital, Beijing, China
- Cancer Center, Peking University Third Hospital, Beijing, China
- Department of Biochemistry and Molecular Biology, Beijing, Key Laboratory of Protein Posttranslational Modifications and Cell Function, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, China
| | - Jimeng Su
- Institute of Medical Innovation and Research, Peking University Third Hospital, Beijing, China
- Cancer Center, Peking University Third Hospital, Beijing, China
- College of Animal Science and Technology, Yangzhou University, Yangzhou, Jiangsu, China
| | - Rongyi Zhu
- Department of Biochemistry and Molecular Biology, Beijing, Key Laboratory of Protein Posttranslational Modifications and Cell Function, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, China
| | - Kailong Li
- Department of Biochemistry and Molecular Biology, Beijing, Key Laboratory of Protein Posttranslational Modifications and Cell Function, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, China
| | - Xiaolu Zhao
- State Key Laboratory of Female Fertility Promotion, Center for Reproductive Medicine, Department of Obstetrics and GynecologyNational Clinical Research Center for Obstetrics and Gynecology (Peking University Third Hospital)Key Laboratory of Assisted Reproduction (Peking University), Ministry of EducationBeijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Peking University Third Hospital, Beijing, China.
| | - Jibiao Fan
- College of Animal Science and Technology, Yangzhou University, Yangzhou, Jiangsu, China.
| | - Fengbiao Mao
- Institute of Medical Innovation and Research, Peking University Third Hospital, Beijing, China.
- Cancer Center, Peking University Third Hospital, Beijing, China.
- Beijing Key Laboratory for Interdisciplinary Research in Gastrointestinal Oncology (BLGO), Beijing, China.
| |
Collapse
|
48
|
Ospina OE, Manjarres-Betancur R, Gonzalez-Calderon G, Soupir AC, Smalley I, Tsai KY, Markowitz J, Khaled ML, Vallebuona E, Berglund AE, Eschrich SA, Yu X, Fridley BL. spatialGE Is a User-Friendly Web Application That Facilitates Spatial Transcriptomics Data Analysis. Cancer Res 2025; 85:848-858. [PMID: 39636739 PMCID: PMC11873723 DOI: 10.1158/0008-5472.can-24-2346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 10/21/2024] [Accepted: 11/27/2024] [Indexed: 12/07/2024]
Abstract
Spatial transcriptomics (ST) is a powerful tool for understanding tissue biology and disease mechanisms. However, the advanced data analysis and programming skills required can hinder researchers from realizing the full potential of ST. To address this, we developed spatialGE, a web application that simplifies the analysis of ST data. The application spatialGE provided a user-friendly interface that guides users without programming expertise through various analysis pipelines, including quality control, normalization, domain detection, phenotyping, and multiple spatial analyses. It also enabled comparative analysis among samples and supported various ST technologies. The utility of spatialGE was demonstrated through its application in studying the tumor microenvironment of two data sets: 10× Visium samples from a cohort of melanoma metastasis and NanoString CosMx fields of vision from a cohort of Merkel cell carcinoma samples. These results support the ability of spatialGE to identify spatial gene expression patterns that provide valuable insights into the tumor microenvironment and highlight its utility in democratizing ST data analysis for the wider scientific community. Significance: The spatialGE web application enables user-friendly exploratory analysis of spatial transcriptomics data by using a point-and-click interface to guide users from data input to discovery of spatial patterns, facilitating hypothesis generation.
Collapse
Affiliation(s)
- Oscar E. Ospina
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, Florida
| | | | | | - Alex C. Soupir
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, Florida
| | - Inna Smalley
- Department of Metabolism and Physiology, Moffitt Cancer Center, Tampa, Florida
| | - Kenneth Y. Tsai
- Department of Pathology, Moffitt Cancer Center, Tampa, Florida
| | - Joseph Markowitz
- Department of Cutaneous Oncology, Moffitt Cancer Center, Tampa, Florida
| | - Mariam L. Khaled
- Department of Metabolism and Physiology, Moffitt Cancer Center, Tampa, Florida
- Department of Biochemistry, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - Ethan Vallebuona
- Department of Metabolism and Physiology, Moffitt Cancer Center, Tampa, Florida
| | - Anders E. Berglund
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, Florida
| | - Steven A. Eschrich
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, Florida
| | - Xiaoqing Yu
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, Florida
| | - Brooke L. Fridley
- Division of Health Services and Outcomes Research, Children's Mercy, Kansas City, Missouri
| |
Collapse
|
49
|
Wang R, Qian Y, Guo X, Song F, Xiong Z, Cai S, Bian X, Wong MH, Cao Q, Cheng L, Lu G, Leung KS. STModule: identifying tissue modules to uncover spatial components and characteristics of transcriptomic landscapes. Genome Med 2025; 17:18. [PMID: 40033360 PMCID: PMC11874447 DOI: 10.1186/s13073-025-01441-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Accepted: 02/17/2025] [Indexed: 03/05/2025] Open
Abstract
Here we present STModule, a Bayesian method developed to identify tissue modules from spatially resolved transcriptomics that reveal spatial components and essential characteristics of tissues. STModule uncovers diverse expression signals in transcriptomic landscapes such as cancer, intraepithelial neoplasia, immune infiltration, outcome-related molecular features and various cell types, which facilitate downstream analysis and provide insights into tumor microenvironments, disease mechanisms, treatment development, and histological organization of tissues. STModule captures a broader spectrum of biological signals compared to other methods and detects novel spatial components. The tissue modules characterized by gene sets demonstrate greater robustness and transferability across different biopsies. STModule: https://github.com/rwang-z/STModule.git .
Collapse
Affiliation(s)
- Ran Wang
- CUHK-SDU Joint Laboratory on Reproductive Genetics, School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, 999077, China
- Center for Neuromusculoskeletal Restorative Medicine, Hong Kong Science Park, Shatin, New Territories, Hong Kong, 999077, China
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, 999077, China
| | - Yan Qian
- Department of Gastrointestinal Surgery Center, the First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 519082, China
| | - Xiaojing Guo
- Health Data Science Center, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, 518020, China
| | - Fangda Song
- School of Data Science, The Chinese University of Hong Kong (Shenzhen), Shenzhen, 518172, China
| | - Zhiqiang Xiong
- CUHK-SDU Joint Laboratory on Reproductive Genetics, School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, 999077, China
| | - Shirong Cai
- Department of Gastrointestinal Surgery Center, the First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 519082, China
| | - Xiuwu Bian
- Jinfeng Laboratory, Chongqing, 401329, China
| | - Man Hon Wong
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, 999077, China
| | - Qin Cao
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, 999077, China.
- Shenzhen Research Institute, the Chinese University of Hong Kong, Shenzhen, 518172, China.
| | - Lixin Cheng
- Health Data Science Center, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, 518020, China.
| | - Gang Lu
- CUHK-SDU Joint Laboratory on Reproductive Genetics, School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, 999077, China.
- Center for Neuromusculoskeletal Restorative Medicine, Hong Kong Science Park, Shatin, New Territories, Hong Kong, 999077, China.
- Jinfeng Laboratory, Chongqing, 401329, China.
- Shenzhen Research Institute, the Chinese University of Hong Kong, Shenzhen, 518172, China.
| | - Kwong Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, 999077, China.
- Jinfeng Laboratory, Chongqing, 401329, China.
- Department of Applied Data Science, Hong Kong Shue Yan University, North Point, Hong Kong Island, Hong Kong, 999077, China.
| |
Collapse
|
50
|
Lin S, Nguyen LL, McMellen A, Leibowitz MS, Davidson N, Spinosa D, Bitler BG. Leveraging Multi-omics to Disentangle the Complexity of Ovarian Cancer. Mol Diagn Ther 2025; 29:145-151. [PMID: 39557776 DOI: 10.1007/s40291-024-00757-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/29/2024] [Indexed: 11/20/2024]
Abstract
To better understand ovarian cancer lethality and treatment resistance, sophisticated computational approaches are required that address the complexity of the tumor microenvironment, genomic heterogeneity, and tumor evolution. The ovarian cancer tumor ecosystem consists of multiple tumors and cell types that support disease growth and progression. Over the last two decades, there has been a revolution in -omic methodologies to broadly define components and essential processes within the tumor microenvironment, including transcriptomics, metabolomics, proteomics, genome sequencing, and single-cell analyses. While most of these technologies comprehensively characterize a single biological process, there is a need to understand the biological and clinical impact of integrating multiple -omics platforms. Overall, multi-omics is an intriguing analytic framework that can better approximate biological complexity; however, data aggregation and integration pipelines are not yet sufficient to reliably glean insights that affect clinical outcomes.
Collapse
Affiliation(s)
- Shijuan Lin
- Divisions of Reproductive Sciences, Department of Obstetrics and Gynecology, University of Colorado Denver, Anschutz Medical Campus, 12700 East 19th Avenue, MS 8613, Aurora, CO, 80045, USA
| | - Lily L Nguyen
- Divisions of Reproductive Sciences, Department of Obstetrics and Gynecology, University of Colorado Denver, Anschutz Medical Campus, 12700 East 19th Avenue, MS 8613, Aurora, CO, 80045, USA
| | - Alexandra McMellen
- Center for Cancer and Blood Disorders, Children's Hospital Colorado, Aurora, CO, USA
| | - Michael S Leibowitz
- Center for Cancer and Blood Disorders, Children's Hospital Colorado, Aurora, CO, USA
| | - Natalie Davidson
- Divisions of Reproductive Sciences, Department of Obstetrics and Gynecology, University of Colorado Denver, Anschutz Medical Campus, 12700 East 19th Avenue, MS 8613, Aurora, CO, 80045, USA
| | - Daniel Spinosa
- Gynecologic Oncology, Department of Obstetrics and Gynecology, University of Colorado Denver, Anschutz Medical Campus, Aurora, CO, USA
| | - Benjamin G Bitler
- Divisions of Reproductive Sciences, Department of Obstetrics and Gynecology, University of Colorado Denver, Anschutz Medical Campus, 12700 East 19th Avenue, MS 8613, Aurora, CO, 80045, USA.
| |
Collapse
|