1
|
Manchel A, Gee M, Vadigepalli R. From sampling to simulating: Single-cell multiomics in systems pathophysiological modeling. iScience 2024; 27:111322. [PMID: 39628578 PMCID: PMC11612781 DOI: 10.1016/j.isci.2024.111322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/06/2024] Open
Abstract
As single-cell omics data sampling and acquisition methods have accumulated at an unprecedented rate, various data analysis pipelines have been developed for the inference of cell types, cell states and their distribution, state transitions, state trajectories, and state interactions. This presents a new opportunity in which single-cell omics data can be utilized to generate high-resolution, high-fidelity computational models. In this review, we discuss how single-cell omics data can be used to build computational models to simulate biological systems at various scales. We propose that single-cell data can be integrated with physiological information to generate organ-specific models, which can then be assembled to generate multi-organ systems pathophysiological models. Finally, we discuss how generic multi-organ models can be brought to the patient-specific level thus permitting their use in the clinical setting.
Collapse
Affiliation(s)
- Alexandra Manchel
- Daniel Baugh Institute of Functional Genomics/Computational Biology, Department of Pathology and Genomic Medicine, Thomas Jefferson University, Philadelphia, PA, USA
| | - Michelle Gee
- Daniel Baugh Institute of Functional Genomics/Computational Biology, Department of Pathology and Genomic Medicine, Thomas Jefferson University, Philadelphia, PA, USA
- Department of Chemical and Biomolecular Engineering, University of Delaware, Newark, DE, USA
| | - Rajanikanth Vadigepalli
- Daniel Baugh Institute of Functional Genomics/Computational Biology, Department of Pathology and Genomic Medicine, Thomas Jefferson University, Philadelphia, PA, USA
| |
Collapse
|
2
|
Ding Q, Yang W, Xue G, Liu H, Cai Y, Que J, Jin X, Luo M, Pang F, Yang Y, Lin Y, Liu Y, Sun H, Tan R, Wang P, Xu Z, Jiang Q. Dimension reduction, cell clustering, and cell-cell communication inference for single-cell transcriptomics with DcjComm. Genome Biol 2024; 25:241. [PMID: 39252099 PMCID: PMC11382422 DOI: 10.1186/s13059-024-03385-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 08/30/2024] [Indexed: 09/11/2024] Open
Abstract
Advances in single-cell transcriptomics provide an unprecedented opportunity to explore complex biological processes. However, computational methods for analyzing single-cell transcriptomics still have room for improvement especially in dimension reduction, cell clustering, and cell-cell communication inference. Herein, we propose a versatile method, named DcjComm, for comprehensive analysis of single-cell transcriptomics. DcjComm detects functional modules to explore expression patterns and performs dimension reduction and clustering to discover cellular identities by the non-negative matrix factorization-based joint learning model. DcjComm then infers cell-cell communication by integrating ligand-receptor pairs, transcription factors, and target genes. DcjComm demonstrates superior performance compared to state-of-the-art methods.
Collapse
Affiliation(s)
- Qian Ding
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Wenyi Yang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Guangfu Xue
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Hongxin Liu
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Yideng Cai
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Jinhao Que
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Xiyun Jin
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Meng Luo
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Fenglan Pang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Yuexin Yang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Yi Lin
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Yusong Liu
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Haoxiu Sun
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Renjie Tan
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Pingping Wang
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China.
| | - Zhaochun Xu
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China.
| | - Qinghua Jiang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China.
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China.
- State Key Laboratory of Frigid Zone Cardiovascular Diseases (SKLFZCD), Harbin Medical University, Harbin, 150076, China.
| |
Collapse
|
3
|
Ali M, Kuijs M, Hediyeh-zadeh S, Treis T, Hrovatin K, Palla G, Schaar AC, Theis FJ. GraphCompass: spatial metrics for differential analyses of cell organization across conditions. Bioinformatics 2024; 40:i548-i557. [PMID: 38940138 PMCID: PMC11256915 DOI: 10.1093/bioinformatics/btae242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
SUMMARY Spatial omics technologies are increasingly leveraged to characterize how disease disrupts tissue organization and cellular niches. While multiple methods to analyze spatial variation within a sample have been published, statistical and computational approaches to compare cell spatial organization across samples or conditions are mostly lacking. We present GraphCompass, a comprehensive set of omics-adapted graph analysis methods to quantitatively evaluate and compare the spatial arrangement of cells in samples representing diverse biological conditions. GraphCompass builds upon the Squidpy spatial omics toolbox and encompasses various statistical approaches to perform cross-condition analyses at the level of individual cell types, niches, and samples. Additionally, GraphCompass provides custom visualization functions that enable effective communication of results. We demonstrate how GraphCompass can be used to address key biological questions, such as how cellular organization and tissue architecture differ across various disease states and which spatial patterns correlate with a given pathological condition. GraphCompass can be applied to various popular omics techniques, including, but not limited to, spatial proteomics (e.g. MIBI-TOF), spot-based transcriptomics (e.g. 10× Genomics Visium), and single-cell resolved transcriptomics (e.g. Stereo-seq). In this work, we showcase the capabilities of GraphCompass through its application to three different studies that may also serve as benchmark datasets for further method development. With its easy-to-use implementation, extensive documentation, and comprehensive tutorials, GraphCompass is accessible to biologists with varying levels of computational expertise. By facilitating comparative analyses of cell spatial organization, GraphCompass promises to be a valuable asset in advancing our understanding of tissue function in health and disease. .
Collapse
Affiliation(s)
- Mayar Ali
- Institute of Computational Biology, Helmholtz Munich, Neuherberg, 85764, Germany
- Institute for Tissue Engineering and Regenerative Medicine, Helmholtz Munich, Neuherberg, 85764, Germany
- Graduate School of Systemic Neurosciences, Ludwig Maximilian University of Munich, Planegg-Martinsried, 82152, Germany
| | - Merel Kuijs
- Institute of Computational Biology, Helmholtz Munich, Neuherberg, 85764, Germany
- Department of Mathematics, TUM School of Computation, Information and Technology, Technical University of Munich, Munich, 80333, Germany
| | - Soroor Hediyeh-zadeh
- Institute of Computational Biology, Helmholtz Munich, Neuherberg, 85764, Germany
- TUM School of Life Sciences, Technical University of Munich, Freising, 85354, Germany
| | - Tim Treis
- Institute of Computational Biology, Helmholtz Munich, Neuherberg, 85764, Germany
- TUM School of Life Sciences, Technical University of Munich, Freising, 85354, Germany
| | - Karin Hrovatin
- Institute of Computational Biology, Helmholtz Munich, Neuherberg, 85764, Germany
- TUM School of Life Sciences, Technical University of Munich, Freising, 85354, Germany
| | - Giovanni Palla
- Institute of Computational Biology, Helmholtz Munich, Neuherberg, 85764, Germany
- TUM School of Life Sciences, Technical University of Munich, Freising, 85354, Germany
| | - Anna C Schaar
- Institute of Computational Biology, Helmholtz Munich, Neuherberg, 85764, Germany
- Department of Mathematics, TUM School of Computation, Information and Technology, Technical University of Munich, Munich, 80333, Germany
- Munich Center for Machine Learning, Technical University of Munich, Munich, 80333, Germany
| | - Fabian J Theis
- Institute of Computational Biology, Helmholtz Munich, Neuherberg, 85764, Germany
- Department of Mathematics, TUM School of Computation, Information and Technology, Technical University of Munich, Munich, 80333, Germany
- TUM School of Life Sciences, Technical University of Munich, Freising, 85354, Germany
| |
Collapse
|
4
|
Zuo C, Xia J, Chen L. Dissecting tumor microenvironment from spatially resolved transcriptomics data by heterogeneous graph learning. Nat Commun 2024; 15:5057. [PMID: 38871687 DOI: 10.1038/s41467-024-49171-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Accepted: 05/22/2024] [Indexed: 06/15/2024] Open
Abstract
Spatially resolved transcriptomics (SRT) has enabled precise dissection of tumor-microenvironment (TME) by analyzing its intracellular molecular networks and intercellular cell-cell communication (CCC). However, lacking computational exploration of complicated relations between cells, genes, and histological regions, severely limits the ability to interpret the complex structure of TME. Here, we introduce stKeep, a heterogeneous graph (HG) learning method that integrates multimodality and gene-gene interactions, in unraveling TME from SRT data. stKeep leverages HG to learn both cell-modules and gene-modules by incorporating features of diverse nodes including genes, cells, and histological regions, allows for identifying finer cell-states within TME and cell-state-specific gene-gene relations, respectively. Furthermore, stKeep employs HG to infer CCC for each cell, while ensuring that learned CCC patterns are comparable across different cell-states through contrastive learning. In various cancer samples, stKeep outperforms other tools in dissecting TME such as detecting bi-potent basal populations, neoplastic myoepithelial cells, and metastatic cells distributed within the tumor or leading-edge regions. Notably, stKeep identifies key transcription factors, ligands, and receptors relevant to disease progression, which are further validated by the functional and survival analysis of independent clinical data, thereby highlighting its clinical prognostic and immunotherapy applications.
Collapse
Affiliation(s)
- Chunman Zuo
- Institute of Artificial Intelligence, Shanghai Engineering Research Center of Industrial Big Data and Intelligent System, Donghua University, Shanghai, 201620, China.
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130022, China.
| | - Junjie Xia
- Institute of Artificial Intelligence, Shanghai Engineering Research Center of Industrial Big Data and Intelligent System, Donghua University, Shanghai, 201620, China
- Department of Applied Mathematics, Donghua University, Shanghai, 201620, China
| | - Luonan Chen
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, 200031, China.
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Hangzhou, 310024, China.
- West China Biomedical Big Data Center, Med-X center for informatics, West China Hospital, Sichuan University, Chengdu, 610041, China.
| |
Collapse
|
5
|
Lin S, Cui Y, Zhao F, Yang Z, Song J, Yao J, Zhao Y, Qian BZ, Zhao Y, Yuan Z. Complete spatially resolved gene expression is not necessary for identifying spatial domains. CELL GENOMICS 2024; 4:100565. [PMID: 38781966 PMCID: PMC11228956 DOI: 10.1016/j.xgen.2024.100565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 02/29/2024] [Accepted: 04/30/2024] [Indexed: 05/25/2024]
Abstract
Spatially resolved transcriptomics (SRT) technologies have revolutionized the study of tissue organization. We introduce a graph convolutional network with an attention and positive emphasis mechanism, termed BINARY, relying exclusively on binarized SRT data to accurately delineate spatial domains. BINARY outperforms existing methods across various SRT data types while using significantly less input information. Our study suggests that precise gene expression quantification may not always be essential, inspiring further exploration of the broader applications of spatially resolved binarized gene expression data.
Collapse
Affiliation(s)
- Senlin Lin
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Yan Cui
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China; Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China
| | - Fangyuan Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Zhidong Yang
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
| | | | - Yu Zhao
- AI Lab, Tencent, Shenzhen, China
| | - Bin-Zhi Qian
- Fudan University Shanghai Cancer Center, Department of Oncology, Shanghai Medical College, The Human Phenome Institute, Zhangjiang-Fudan International Innovation Center, Fudan University, Shanghai, China
| | - Yi Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China; Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China.
| |
Collapse
|
6
|
Ma Y, Liu L, Zhao Y, Hang B, Zhang Y. HyperGCN: an effective deep representation learning framework for the integrative analysis of spatial transcriptomics data. BMC Genomics 2024; 25:566. [PMID: 38840049 PMCID: PMC11155133 DOI: 10.1186/s12864-024-10469-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 05/29/2024] [Indexed: 06/07/2024] Open
Abstract
BACKGROUND Advances of spatial transcriptomics technologies enabled simultaneously profiling gene expression and spatial locations of cells from the same tissue. Computational tools and approaches for integration of transcriptomics data and spatial context information are urgently needed to comprehensively explore the underlying structure patterns. In this manuscript, we propose HyperGCN for the integrative analysis of gene expression and spatial information profiled from the same tissue. HyperGCN enables data visualization and clustering, and facilitates downstream analysis, including domain segmentation, the characterization of marker genes for the specific domain structure and GO enrichment analysis. RESULTS Extensive experiments are implemented on four real datasets from different tissues (including human dorsolateral prefrontal cortex, human positive breast tumors, mouse brain, mouse olfactory bulb tissue and Zabrafish melanoma) and technologies (including 10X visium, osmFISH, seqFISH+, 10X Xenium and Stereo-seq) with different spatial resolutions. The results show that HyperGCN achieves superior clustering performance and produces good domain segmentation effects while identifies biologically meaningful spatial expression patterns. This study provides a flexible framework to analyze spatial transcriptomics data with high geometric complexity. CONCLUSIONS HyperGCN is an unsupervised method based on hypergraph induced graph convolutional network, where it assumes that there existed disjoint tissues with high geometric complexity, and models the semantic relationship of cells through hypergraph, which better tackles the high-order interactions of cells and levels of noise in spatial transcriptomics data.
Collapse
Affiliation(s)
- Yuanyuan Ma
- School of Computer Engineering, Hubei University of Arts and Science, Xiangyang, China.
- Hubei Key Laboratory of Power System Design and Test for Electrical Vehicle, Hubei University of Arts and Science, Xiangyang, China.
| | - Lifang Liu
- School of Physics and Electronic Engineering, Hubei University of Arts and Science, Xiangyang, China
| | - Yongbiao Zhao
- School of Computer Engineering, Hubei University of Arts and Science, Xiangyang, China
- School of Computer, Central China Normal University, Wuhan, China
| | - Bo Hang
- School of Computer Engineering, Hubei University of Arts and Science, Xiangyang, China
| | - Yanduo Zhang
- School of Computer Engineering, Hubei University of Arts and Science, Xiangyang, China
| |
Collapse
|
7
|
Shi L, Jia W, Zhang R, Fan Z, Bian W, Mo H. High-throughput analysis of hazards in novel food based on the density functional theory and multimodal deep learning. Food Chem 2024; 442:138468. [PMID: 38266417 DOI: 10.1016/j.foodchem.2024.138468] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2023] [Revised: 12/30/2023] [Accepted: 01/15/2024] [Indexed: 01/26/2024]
Abstract
The emergence of cultured meat presents the potential for personalized food additive manufacturing, offering a solution to address future food resource scarcity. Processing raw materials and products in synthetic food products poses challenges in identifying hazards, impacting the entire industrial chain during the industry's further evolution. It is crucial to examine the correlation of biological information at different levels and to reveal the temporal dynamics jointly. Proposed active prevention method includes four aspects: (i) Investigating the molecular-level mechanism underlying the binding and dissociation of hazards with proteins represents a novel approach to mitigate matrix effect. (ii) Identifying distinct fragments is a pivotal advancement toward developing a novel screening strategy for hazards throughout the food chain. (iii) Designing an artificial intelligence model-based approach to acquire multi-dimensional histology data also holds significant potential for various applications. (iv) Integrating multimodal data is a practical approach to enhance evaluation and feedback control accuracy.
Collapse
Affiliation(s)
- Lin Shi
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an 710021, China
| | - Wei Jia
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an 710021, China; Shaanxi Testing Institute of Product Quality Supervision, Xi'an, Shaanxi 710048, China; Shaanxi Research Institute of Agricultural Products Processing Technology, Xi'an 710021, China; Shaanxi Sky Pet Biotechnology Co., Ltd, Xi'an 710075, China.
| | - Rong Zhang
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an 710021, China
| | - Zibian Fan
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an 710021, China
| | - Wenwen Bian
- Shaanxi Testing Institute of Product Quality Supervision, Xi'an, Shaanxi 710048, China
| | - Haizhen Mo
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an 710021, China; Shaanxi Research Institute of Agricultural Products Processing Technology, Xi'an 710021, China.
| |
Collapse
|
8
|
Zhang L, Liang S, Wan L. A multi-view graph contrastive learning framework for deciphering spatially resolved transcriptomics data. Brief Bioinform 2024; 25:bbae255. [PMID: 38801701 PMCID: PMC11129769 DOI: 10.1093/bib/bbae255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 01/27/2024] [Accepted: 05/14/2024] [Indexed: 05/29/2024] Open
Abstract
Spatially resolved transcriptomics data are being used in a revolutionary way to decipher the spatial pattern of gene expression and the spatial architecture of cell types. Much work has been done to exploit the genomic spatial architectures of cells. Such work is based on the common assumption that gene expression profiles of spatially adjacent spots are more similar than those of more distant spots. However, related work might not consider the nonlocal spatial co-expression dependency, which can better characterize the tissue architectures. Therefore, we propose MuCoST, a Multi-view graph Contrastive learning framework for deciphering complex Spatially resolved Transcriptomic architectures with dual scale structural dependency. To achieve this, we employ spot dependency augmentation by fusing gene expression correlation and spatial location proximity, thereby enabling MuCoST to model both nonlocal spatial co-expression dependency and spatially adjacent dependency. We benchmark MuCoST on four datasets, and we compare it with other state-of-the-art spatial domain identification methods. We demonstrate that MuCoST achieves the highest accuracy on spatial domain identification from various datasets. In particular, MuCoST accurately deciphers subtle biological textures and elaborates the variation of spatially functional patterns.
Collapse
Affiliation(s)
- Lei Zhang
- Department of Control Science and Engineering, Tongji University, No. 4800 Cao’an Road, 201804, Shanghai, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University, Lane 55, Chuanhe Road, 201210, Shanghai, China
| | - Shu Liang
- Department of Control Science and Engineering, Tongji University, No. 4800 Cao’an Road, 201804, Shanghai, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University, Lane 55, Chuanhe Road, 201210, Shanghai, China
| | - Lin Wan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, No. 55 Zhongguancun East Road, 100190, Beijing, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, 19A Yuquan Road, 100049, Beijing, China
| |
Collapse
|
9
|
Yuan Z, Zhao F, Lin S, Zhao Y, Yao J, Cui Y, Zhang XY, Zhao Y. Benchmarking spatial clustering methods with spatially resolved transcriptomics data. Nat Methods 2024; 21:712-722. [PMID: 38491270 DOI: 10.1038/s41592-024-02215-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 02/16/2024] [Indexed: 03/18/2024]
Abstract
Spatial clustering, which shares an analogy with single-cell clustering, has expanded the scope of tissue physiology studies from cell-centroid to structure-centroid with spatially resolved transcriptomics (SRT) data. Computational methods have undergone remarkable development in recent years, but a comprehensive benchmark study is still lacking. Here we present a benchmark study of 13 computational methods on 34 SRT data (7 datasets). The performance was evaluated on the basis of accuracy, spatial continuity, marker genes detection, scalability, and robustness. We found existing methods were complementary in terms of their performance and functionality, and we provide guidance for selecting appropriate methods for given scenarios. On testing additional 22 challenging datasets, we identified challenges in identifying noncontinuous spatial domains and limitations of existing methods, highlighting their inadequacies in handling recent large-scale tasks. Furthermore, with 145 simulated data, we examined the robustness of these methods against four different factors, and assessed the impact of pre- and postprocessing approaches. Our study offers a comprehensive evaluation of existing spatial clustering methods with SRT data, paving the way for future advancements in this rapidly evolving field.
Collapse
Affiliation(s)
- Zhiyuan Yuan
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China.
- Institute of Science and Technology for Brain-Inspired Intelligence; MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence; MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
| | - Fangyuan Zhao
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Senlin Lin
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yu Zhao
- Tencent AI Lab, Shenzhen, China
| | | | - Yan Cui
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China
- Institute of Science and Technology for Brain-Inspired Intelligence; MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence; MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
| | - Xiao-Yong Zhang
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China
| | - Yi Zhao
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
10
|
Zhu J, Dai H, Chen L. Revealing cell-cell communication pathways with their spatially coupled gene programs. Brief Bioinform 2024; 25:bbae202. [PMID: 38706319 PMCID: PMC11070651 DOI: 10.1093/bib/bbae202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/14/2024] [Accepted: 04/05/2024] [Indexed: 05/07/2024] Open
Abstract
Inference of cell-cell communication (CCC) provides valuable information in understanding the mechanisms of many important life processes. With the rise of spatial transcriptomics in recent years, many methods have emerged to predict CCCs using spatial information of cells. However, most existing methods only describe CCCs based on ligand-receptor interactions, but lack the exploration of their upstream/downstream pathways. In this paper, we proposed a new method to infer CCCs, called Intercellular Gene Association Network (IGAN). Specifically, it is for the first time that we can estimate the gene associations/network between two specific single spatially adjacent cells. By using the IGAN method, we can not only infer CCCs in an accurate manner, but also explore the upstream/downstream pathways of ligands/receptors from the network perspective, which are actually exhibited as a new panoramic cell-interaction-pathway graph, and thus provide extensive information for the regulatory mechanisms behind CCCs. In addition, IGAN can measure the CCC activity at single cell/spot resolution, and help to discover the CCC spatial heterogeneity. Interestingly, we found that CCC patterns from IGAN are highly consistent with the spatial microenvironment patterns for each cell type, which further indicated the accuracy of our method. Analyses on several public datasets validated the advantages of IGAN.
Collapse
Affiliation(s)
- Junchao Zhu
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Cell building, No. 320 Yueyang Road, Xuhui District, Shanghai 200031, China
| | - Hao Dai
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Cell building, No. 320 Yueyang Road, Xuhui District, Shanghai 200031, China
| | - Luonan Chen
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Cell building, No. 320 Yueyang Road, Xuhui District, Shanghai 200031, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Chinese Academy of Sciences, No. 1, Xiangshan Zhinong, Xihu District, Hangzhou 310024, China
| |
Collapse
|
11
|
Lin S, Zhao F, Wu Z, Yao J, Zhao Y, Yuan Z. Streamlining spatial omics data analysis with Pysodb. Nat Protoc 2024; 19:831-895. [PMID: 38135744 DOI: 10.1038/s41596-023-00925-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 10/02/2023] [Indexed: 12/24/2023]
Abstract
Advances in spatial omics technologies have improved the understanding of cellular organization in tissues, leading to the generation of complex and heterogeneous data and prompting the development of specialized tools for managing, loading and visualizing spatial omics data. The Spatial Omics Database (SODB) was established to offer a unified format for data storage and interactive visualization modules. Here we detail the use of Pysodb, a Python-based tool designed to enable the efficient exploration and loading of spatial datasets from SODB within a Python environment. We present seven case studies using Pysodb, detailing the interaction with various computational methods, ensuring reproducibility of experimental data and facilitating the integration of new data and alternative applications in SODB. The approach offers a reference for method developers by outlining label and metadata availability in representative spatial data that can be loaded by Pysodb. The tool is supplemented by a website ( https://protocols-pysodb.readthedocs.io/ ) with detailed information for benchmarking analysis, and allows method developers to focus on computational models by facilitating data processing. This protocol is designed for researchers with limited experience in computational biology. Depending on the dataset complexity, the protocol typically requires ~12 h to complete.
Collapse
Affiliation(s)
- Senlin Lin
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Fangyuan Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | | | | | - Yi Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| | - Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China.
| |
Collapse
|
12
|
Yuan Z. MENDER: fast and scalable tissue structure identification in spatial omics data. Nat Commun 2024; 15:207. [PMID: 38182575 PMCID: PMC10770058 DOI: 10.1038/s41467-023-44367-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 12/11/2023] [Indexed: 01/07/2024] Open
Abstract
Tissue structure identification is a crucial task in spatial omics data analysis, for which increasingly complex models, such as Graph Neural Networks and Bayesian networks, are employed. However, whether increased model complexity can effectively lead to improved performance is a notable question in the field. Inspired by the consistent observation of cellular neighborhood structures across various spatial technologies, we propose Multi-range cEll coNtext DEciphereR (MENDER), for tissue structure identification. Applied on datasets of 3 brain regions and a whole-brain atlas, MENDER, with biology-driven design, offers substantial improvements over modern complex models while automatically aligning labels across slices, despite using much less running time than the second-fastest. MENDER's identification power allows the uncovering of previously overlooked spatial domains that exhibit strong associations with brain aging. MENDER's scalability makes it freely appliable on a million-level brain spatial atlas. MENDER's discriminative power enables the differentiation of breast cancer patient subtypes obscured by single-cell analysis.
Collapse
Affiliation(s)
- Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, MOE Frontiers Center for Brain Science, Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, 200433, China.
| |
Collapse
|
13
|
Varrone M, Tavernari D, Santamaria-Martínez A, Walsh LA, Ciriello G. CellCharter reveals spatial cell niches associated with tissue remodeling and cell plasticity. Nat Genet 2024; 56:74-84. [PMID: 38066188 DOI: 10.1038/s41588-023-01588-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 10/23/2023] [Indexed: 12/20/2023]
Abstract
Tissues are organized in cellular niches, the composition and interactions of which can be investigated using spatial omics technologies. However, systematic analyses of tissue composition are challenged by the scale and diversity of the data. Here we present CellCharter, an algorithmic framework to identify, characterize, and compare cellular niches in spatially resolved datasets. CellCharter outperformed existing approaches and effectively identified cellular niches across datasets generated using different technologies, and comprising hundreds of samples and millions of cells. In multiple human lung cancer cohorts, CellCharter uncovered a cellular niche composed of tumor-associated neutrophil and cancer cells expressing markers of hypoxia and cell migration. This cancer cell state was spatially segregated from more proliferative tumor cell clusters and was associated with tumor-associated neutrophil infiltration and poor prognosis in independent patient cohorts. Overall, CellCharter enables systematic analyses across data types and technologies to decode the link between spatial tissue architectures and cell plasticity.
Collapse
Affiliation(s)
- Marco Varrone
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Cancer Center Léman, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Daniele Tavernari
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Cancer Center Léman, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Swiss Institute for Experimental Cancer Research, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Albert Santamaria-Martínez
- Swiss Cancer Center Léman, Lausanne, Switzerland
- Swiss Institute for Experimental Cancer Research, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Logan A Walsh
- Rosalind and Morris Goodman Cancer Institute, McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada
| | - Giovanni Ciriello
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
- Swiss Cancer Center Léman, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
14
|
Eliason J, Rao A. Investigating Ecological Interactions in the Tumor Microenvironment using Joint Species Distribution Models for Point Patterns. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.14.567108. [PMID: 38014073 PMCID: PMC10680696 DOI: 10.1101/2023.11.14.567108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
The tumor microenvironment (TME) is a complex and dynamic ecosystem that involves interactions between different cell types, such as cancer cells, immune cells, and stromal cells. These interactions can promote or inhibit tumor growth and affect response to therapy. Multitype Gibbs point process (MGPP) models are statistical models used to study the spatial distribution and interaction of different types of objects, such as the distribution of cell types in a tissue sample. Such models are potentially useful for investigating the spatial relationships between different cell types in the tumor microenvironment, but so far studies of the TME using cell-resolution imaging have been largely limited to spatial descriptive statistics. However, MGPP models have many advantages over descriptive statistics, such as uncertainty quantification, incorporation of multiple covariates and the ability to make predictions. In this paper, we describe and apply a previously developed MGPP method, the saturated pairwise interaction Gibbs point process model , to a publicly available multiplexed imaging dataset obtained from colorectal cancer patients. Importantly, we show how these methods can be used as joint species distribution models (JSDMs) to precisely frame and answer many relevant questions related to the ecology of the tumor microenvironment.
Collapse
|
15
|
Yuan Z, Yao J. Harnessing computational spatial omics to explore the spatial biology intricacies. Semin Cancer Biol 2023; 95:25-41. [PMID: 37400044 DOI: 10.1016/j.semcancer.2023.06.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 05/09/2023] [Accepted: 06/19/2023] [Indexed: 07/05/2023]
Abstract
Spatially resolved transcriptomics (SRT) has unlocked new dimensions in our understanding of intricate tissue architectures. However, this rapidly expanding field produces a wealth of diverse and voluminous data, necessitating the evolution of sophisticated computational strategies to unravel inherent patterns. Two distinct methodologies, gene spatial pattern recognition (GSPR) and tissue spatial pattern recognition (TSPR), have emerged as vital tools in this process. GSPR methodologies are designed to identify and classify genes exhibiting noteworthy spatial patterns, while TSPR strategies aim to understand intercellular interactions and recognize tissue domains with molecular and spatial coherence. In this review, we provide a comprehensive exploration of SRT, highlighting crucial data modalities and resources that are instrumental for the development of methods and biological insights. We address the complexities and challenges posed by the use of heterogeneous data in developing GSPR and TSPR methodologies and propose an optimal workflow for both. We delve into the latest advancements in GSPR and TSPR, examining their interrelationships. Lastly, we peer into the future, envisaging the potential directions and perspectives in this dynamic field.
Collapse
Affiliation(s)
- Zhiyuan Yuan
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.
| | | |
Collapse
|
16
|
徐 晨, 王 寅, 魏 东, 李 文, 钱 晔, 潘 新, 雷 大. [Advances of spatial omics in the individualized diagnosis and treatment of head and neck cancer]. LIN CHUANG ER BI YAN HOU TOU JING WAI KE ZA ZHI = JOURNAL OF CLINICAL OTORHINOLARYNGOLOGY, HEAD, AND NECK SURGERY 2023; 37:729-733;739. [PMID: 37830120 PMCID: PMC10722126 DOI: 10.13201/j.issn.2096-7993.2023.09.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Indexed: 10/14/2023]
Abstract
Spatialomics is another research hotspot of biotechnology after single-cell sequencing technology, which can make up for the defect that single-cell sequencing technology can not obtain cell spatial distribution information. Spatialomics mainly studies the relative position of cells in tissue samples to reveal the effect of cell spatial distribution on diseases. In recent years, spatialomics has made new progress in the pathogenesis, target exploration, drug development and many other aspects of head and neck tumors. This paper summarizes the latest progress of spatialomics in the diagnosis and treatment of head and neck cancer.
Collapse
Affiliation(s)
- 晨阳 徐
- 山东大学齐鲁医院耳鼻咽喉科,国家卫生健康委员会耳鼻喉科学重点实验室(山东大学)(济南,250012)Department of Otorhinolaryngology, Qilu Hospital of Shandong University, National Health Commission Key Laboratory of Otorhinolaryngology[Shandong University], Jinan, 250012, China
| | - 寅 王
- 山东大学齐鲁医院耳鼻咽喉科,国家卫生健康委员会耳鼻喉科学重点实验室(山东大学)(济南,250012)Department of Otorhinolaryngology, Qilu Hospital of Shandong University, National Health Commission Key Laboratory of Otorhinolaryngology[Shandong University], Jinan, 250012, China
| | - 东敏 魏
- 山东大学齐鲁医院耳鼻咽喉科,国家卫生健康委员会耳鼻喉科学重点实验室(山东大学)(济南,250012)Department of Otorhinolaryngology, Qilu Hospital of Shandong University, National Health Commission Key Laboratory of Otorhinolaryngology[Shandong University], Jinan, 250012, China
| | - 文明 李
- 山东大学齐鲁医院耳鼻咽喉科,国家卫生健康委员会耳鼻喉科学重点实验室(山东大学)(济南,250012)Department of Otorhinolaryngology, Qilu Hospital of Shandong University, National Health Commission Key Laboratory of Otorhinolaryngology[Shandong University], Jinan, 250012, China
| | - 晔 钱
- 山东大学齐鲁医院耳鼻咽喉科,国家卫生健康委员会耳鼻喉科学重点实验室(山东大学)(济南,250012)Department of Otorhinolaryngology, Qilu Hospital of Shandong University, National Health Commission Key Laboratory of Otorhinolaryngology[Shandong University], Jinan, 250012, China
| | - 新良 潘
- 山东大学齐鲁医院耳鼻咽喉科,国家卫生健康委员会耳鼻喉科学重点实验室(山东大学)(济南,250012)Department of Otorhinolaryngology, Qilu Hospital of Shandong University, National Health Commission Key Laboratory of Otorhinolaryngology[Shandong University], Jinan, 250012, China
| | - 大鹏 雷
- 山东大学齐鲁医院耳鼻咽喉科,国家卫生健康委员会耳鼻喉科学重点实验室(山东大学)(济南,250012)Department of Otorhinolaryngology, Qilu Hospital of Shandong University, National Health Commission Key Laboratory of Otorhinolaryngology[Shandong University], Jinan, 250012, China
| |
Collapse
|
17
|
Hong Y, Song K, Zhang Z, Deng Y, Zhang X, Zhao J, Jiang J, Zhang Q, Guo C, Peng C. The spatiotemporal dynamics of spatially variable genes in developing mouse brain revealed by a novel computational scheme. Cell Death Discov 2023; 9:264. [PMID: 37500639 PMCID: PMC10374563 DOI: 10.1038/s41420-023-01569-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Revised: 07/13/2023] [Accepted: 07/19/2023] [Indexed: 07/29/2023] Open
Abstract
To understand how brain regions form and work, it is important to explore the spatially variable genes (SVGs) enriched in specific brain regions during development. Spatial transcriptomics techniques provide opportunity to select SVGs in the high-throughput way. However, previous methods neglected the ranking order and combinatorial effect of SVGs, making them difficult to automatically select the high-priority SVGs from spatial transcriptomics data. Here, we proposed a novel computational pipeline, called SVGbit, to rank the individual and combinatorial SVGs for marker selection in various brain regions, which was tested in different kinds of public datasets for both human and mouse brains. We then generated the spatial transcriptomics and immunohistochemistry data from mouse brain at critical embryonic and neonatal stages. The results show that our ranking and clustering scheme captures the key SVGs which coincide with known anatomic regions in the developing mouse brain. More importantly, SVGbit can facilitate the identification of multiple gene combination sets in different brain regions. We identified three dynamical sub-regions which can be segregated by the staining of Sox2 and Calb2 in thalamus, and we also found that Nr4a2 expression gradually segregates the neocortex and hippocampus during the development. In summary, our work not only reveals the spatiotemporal dynamics of individual and combinatorial SVGs in developing mouse brain, but also provides a novel computational pipeline to facilitate the selection of marker genes from spatial transcriptomics data.
Collapse
Affiliation(s)
- Yingzhou Hong
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China
| | - Kai Song
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China
| | - Zongbo Zhang
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China
| | - Yuxia Deng
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China
| | - Xue Zhang
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China
| | - Jinqian Zhao
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China
| | - Jun Jiang
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China
| | - Qing Zhang
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China
| | - Chunming Guo
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China.
| | - Cheng Peng
- Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, 650500, China.
| |
Collapse
|