1
|
Wang C, Acosta D, McNutt M, Bian J, Ma A, Fu H, Ma Q. A single-cell and spatial RNA-seq database for Alzheimer's disease (ssREAD). Nat Commun 2024; 15:4710. [PMID: 38844475 PMCID: PMC11156951 DOI: 10.1038/s41467-024-49133-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 05/21/2024] [Indexed: 06/09/2024] Open
Abstract
Alzheimer's Disease (AD) pathology has been increasingly explored through single-cell and single-nucleus RNA-sequencing (scRNA-seq & snRNA-seq) and spatial transcriptomics (ST). However, the surge in data demands a comprehensive, user-friendly repository. Addressing this, we introduce a single-cell and spatial RNA-seq database for Alzheimer's disease (ssREAD). It offers a broader spectrum of AD-related datasets, an optimized analytical pipeline, and improved usability. The database encompasses 1,053 samples (277 integrated datasets) from 67 AD-related scRNA-seq & snRNA-seq studies, totaling 7,332,202 cells. Additionally, it archives 381 ST datasets from 18 human and mouse brain studies. Each dataset is annotated with details such as species, gender, brain region, disease/control status, age, and AD Braak stages. ssREAD also provides an analysis suite for cell clustering, identification of differentially expressed and spatially variable genes, cell-type-specific marker genes and regulons, and spot deconvolution for integrative analysis. ssREAD is freely available at https://bmblx.bmi.osumc.edu/ssread/ .
Collapse
Affiliation(s)
- Cankun Wang
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA
| | - Diana Acosta
- Department of Neuroscience, The Ohio State University, Columbus, OH, 43210, USA
| | - Megan McNutt
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA
| | - Jiang Bian
- Department of Health Outcomes & Biomedical Informatics, University of Florida, Gainesville, FL, 32606, USA
| | - Anjun Ma
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA
| | - Hongjun Fu
- Department of Neuroscience, The Ohio State University, Columbus, OH, 43210, USA.
- Chronic Brain Injury Program, The Ohio State University, Columbus, OH, 43210, USA.
| | - Qin Ma
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA.
| |
Collapse
|
2
|
Valihrach L, Zucha D, Abaffy P, Kubista M. A practical guide to spatial transcriptomics. Mol Aspects Med 2024; 97:101276. [PMID: 38776574 DOI: 10.1016/j.mam.2024.101276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 04/30/2024] [Accepted: 05/10/2024] [Indexed: 05/25/2024]
Abstract
Spatial transcriptomics is revolutionizing modern biology, offering researchers an unprecedented ability to unravel intricate gene expression patterns within tissues. From pioneering techniques to newly commercialized platforms, the field of spatial transcriptomics has evolved rapidly, ushering in a new era of understanding across various disciplines, from developmental biology to disease research. This dynamic expansion is reflected in the rapidly growing number of technologies and data analysis techniques developed and introduced. However, the expanding landscape presents a considerable challenge for researchers, especially newcomers to the field, as staying informed about these advancements becomes increasingly complex. To address this challenge, we have prepared an updated review with a particular focus on technologies that have reached commercialization and are, therefore, accessible to a broad spectrum of potential new users. In this review, we present the fundamental principles of spatial transcriptomic methods, discuss the challenges in data analysis, provide insights into experimental considerations, offer information about available resources for spatial transcriptomics, and conclude with a guide for method selection and a forward-looking perspective. Our aim is to serve as a guiding resource for both experienced users and newcomers navigating the complex realm of spatial transcriptomics in this era of rapid development. We intend to equip researchers with the necessary knowledge to make informed decisions and contribute to the cutting-edge research that spatial transcriptomics offers.
Collapse
Affiliation(s)
- Lukas Valihrach
- Laboratory of Gene Expression, Institute of Biotechnology of the Czech Academy of Sciences, Vestec, Czech Republic; Department of Cellular Neurophysiology, Institute of Experimental Medicine of the Czech Academy of Sciences, Prague, Czech Republic.
| | - Daniel Zucha
- Laboratory of Gene Expression, Institute of Biotechnology of the Czech Academy of Sciences, Vestec, Czech Republic; Department of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology, Prague, Czech Republic
| | - Pavel Abaffy
- Laboratory of Gene Expression, Institute of Biotechnology of the Czech Academy of Sciences, Vestec, Czech Republic
| | - Mikael Kubista
- Laboratory of Gene Expression, Institute of Biotechnology of the Czech Academy of Sciences, Vestec, Czech Republic.
| |
Collapse
|
3
|
Swain AK, Pandit V, Sharma J, Yadav P. SpatialPrompt: spatially aware scalable and accurate tool for spot deconvolution and domain identification in spatial transcriptomics. Commun Biol 2024; 7:639. [PMID: 38796505 PMCID: PMC11127982 DOI: 10.1038/s42003-024-06349-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 05/17/2024] [Indexed: 05/28/2024] Open
Abstract
Efficiently mapping of cell types in situ remains a major challenge in spatial transcriptomics. Most spot deconvolution tools ignore spatial coordinate information and perform extremely slow on large datasets. Here, we introduce SpatialPrompt, a spatially aware and scalable tool for spot deconvolution and domain identification. SpatialPrompt integrates gene expression, spatial location, and single-cell RNA sequencing (scRNA-seq) dataset as reference to accurately infer cell-type proportions of spatial spots. SpatialPrompt uses non-negative ridge regression and graph neural network to efficiently capture local microenvironment information. Our extensive benchmarking analysis on Visium, Slide-seq, and MERFISH datasets demonstrated superior performance of SpatialPrompt over 15 existing tools. On mouse hippocampus dataset, SpatialPrompt achieves spot deconvolution and domain identification within 2 minutes for 50,000 spots. Overall, domain identification using SpatialPrompt was 44 to 150 times faster than existing methods. We build a database housing 40 plus curated scRNA-seq datasets for seamless integration with SpatialPrompt for spot deconvolution.
Collapse
Affiliation(s)
- Asish Kumar Swain
- Department of Bioscience & Bioengineering, Indian Institute of Technology, Jodhpur, Rajasthan, 342030, India
| | - Vrushali Pandit
- Department of Bioscience & Bioengineering, Indian Institute of Technology, Jodhpur, Rajasthan, 342030, India
| | - Jyoti Sharma
- Department of Bioscience & Bioengineering, Indian Institute of Technology, Jodhpur, Rajasthan, 342030, India
| | - Pankaj Yadav
- Department of Bioscience & Bioengineering, Indian Institute of Technology, Jodhpur, Rajasthan, 342030, India.
- School of Artificial Intelligence and Data Science, Indian Institute of Technology, Jodhpur, Rajasthan, 342030, India.
| |
Collapse
|
4
|
Holze H, Talarmain L, Fennell KA, Lam EY, Dawson MA, Vassiliadis D. Analysis of synthetic cellular barcodes in the genome and transcriptome with BARtab and bartools. CELL REPORTS METHODS 2024; 4:100763. [PMID: 38670101 PMCID: PMC11133760 DOI: 10.1016/j.crmeth.2024.100763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 02/25/2024] [Accepted: 04/03/2024] [Indexed: 04/28/2024]
Abstract
Cellular barcoding is a lineage-tracing methodology that couples heritable synthetic barcodes to high-throughput sequencing, enabling the accurate tracing of cell lineages across a range of biological contexts. Recent studies have extended these methods by incorporating lineage information into single-cell or spatial transcriptomics readouts. Leveraging the rich biological information within these datasets requires dedicated computational tools for dataset pre-processing and analysis. Here, we present BARtab, a portable and scalable Nextflow pipeline, and bartools, an open-source R package, designed to provide an integrated end-to-end cellular barcoding analysis toolkit. BARtab and bartools contain methods to simplify the extraction, quality control, analysis, and visualization of lineage barcodes from population-level, single-cell, and spatial transcriptomics experiments. We showcase the utility of our integrated BARtab and bartools workflow via the analysis of exemplar bulk, single-cell, and spatial transcriptomics experiments containing cellular barcoding information.
Collapse
Affiliation(s)
- Henrietta Holze
- Peter MacCallum Cancer Centre, Melbourne, VIC 3000, Australia; Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Laure Talarmain
- Peter MacCallum Cancer Centre, Melbourne, VIC 3000, Australia; Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Katie A Fennell
- Peter MacCallum Cancer Centre, Melbourne, VIC 3000, Australia; Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Enid Y Lam
- Peter MacCallum Cancer Centre, Melbourne, VIC 3000, Australia; Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Mark A Dawson
- Peter MacCallum Cancer Centre, Melbourne, VIC 3000, Australia; Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC 3000, Australia; The University of Melbourne Centre for Cancer Research, The University of Melbourne, Melbourne, VIC 3000, Australia.
| | - Dane Vassiliadis
- Peter MacCallum Cancer Centre, Melbourne, VIC 3000, Australia; Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC 3000, Australia.
| |
Collapse
|
5
|
Cao J, Li C, Cui Z, Deng S, Lei T, Liu W, Yang H, Chen P. Spatial Transcriptomics: A Powerful Tool in Disease Understanding and Drug Discovery. Theranostics 2024; 14:2946-2968. [PMID: 38773973 PMCID: PMC11103497 DOI: 10.7150/thno.95908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 04/25/2024] [Indexed: 05/24/2024] Open
Abstract
Recent advancements in modern science have provided robust tools for drug discovery. The rapid development of transcriptome sequencing technologies has given rise to single-cell transcriptomics and single-nucleus transcriptomics, increasing the accuracy of sequencing and accelerating the drug discovery process. With the evolution of single-cell transcriptomics, spatial transcriptomics (ST) technology has emerged as a derivative approach. Spatial transcriptomics has emerged as a hot topic in the field of omics research in recent years; it not only provides information on gene expression levels but also offers spatial information on gene expression. This technology has shown tremendous potential in research on disease understanding and drug discovery. In this article, we introduce the analytical strategies of spatial transcriptomics and review its applications in novel target discovery and drug mechanism unravelling. Moreover, we discuss the current challenges and issues in this research field that need to be addressed. In conclusion, spatial transcriptomics offers a new perspective for drug discovery.
Collapse
Affiliation(s)
- Junxian Cao
- Beijing Key Laboratory of Traditional Chinese Medicine Basic Research on Prevention and Treatment for Major Diseases, Experimental Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China
- Analysis of Complex Effects of Proprietary Chinese Medicine, Hunan Provincial Key Laboratory, Yongzhou City, Hunan Province, China
| | - Caifeng Li
- Beijing Key Laboratory of Traditional Chinese Medicine Basic Research on Prevention and Treatment for Major Diseases, Experimental Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Zhao Cui
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Shiwen Deng
- Beijing Key Laboratory of Traditional Chinese Medicine Basic Research on Prevention and Treatment for Major Diseases, Experimental Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China
- Analysis of Complex Effects of Proprietary Chinese Medicine, Hunan Provincial Key Laboratory, Yongzhou City, Hunan Province, China
| | - Tong Lei
- Institute of Basic Theory for Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Wei Liu
- Beijing Key Laboratory of Traditional Chinese Medicine Basic Research on Prevention and Treatment for Major Diseases, Experimental Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Hongjun Yang
- Beijing Key Laboratory of Traditional Chinese Medicine Basic Research on Prevention and Treatment for Major Diseases, Experimental Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
- Analysis of Complex Effects of Proprietary Chinese Medicine, Hunan Provincial Key Laboratory, Yongzhou City, Hunan Province, China
| | - Peng Chen
- Beijing Key Laboratory of Traditional Chinese Medicine Basic Research on Prevention and Treatment for Major Diseases, Experimental Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China
- Analysis of Complex Effects of Proprietary Chinese Medicine, Hunan Provincial Key Laboratory, Yongzhou City, Hunan Province, China
| |
Collapse
|
6
|
Yin R, Chen R, Xia K, Xu X. A single-cell transcriptome atlas reveals the trajectory of early cell fate transition during callus induction in Arabidopsis. PLANT COMMUNICATIONS 2024:100941. [PMID: 38720464 DOI: 10.1016/j.xplc.2024.100941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 04/16/2024] [Accepted: 05/06/2024] [Indexed: 06/16/2024]
Abstract
The acquisition of pluripotent callus from somatic cells plays an important role in plant development studies and crop genetic improvement. This developmental process incorporates a series of cell fate transitions and reprogramming. However, our understanding of cell heterogeneity and mechanisms of cell fate transition during callus induction remains quite limited. Here, we report a time-series single-cell transcriptome experiment on Arabidopsis root explants that were induced in callus induction medium for 0, 1, and 4 days, and the construction of a detailed single-cell transcriptional atlas of the callus induction process. We identify the cell types responsible for initiating the early callus: lateral root primordium-initiating (LRPI)-like cells and quiescent center (QC)-like cells. LRPI-like cells are derived from xylem pole pericycle cells and are similar to lateral root primordia. We delineate the developmental trajectory of the dedifferentiation of LRPI-like cells into QC-like cells. QC-like cells are undifferentiated pluripotent acquired cells that appear in the early stages of callus formation and play a critical role in later callus development and organ regeneration. We also identify the transcription factors that regulate QC-like cells and the gene expression signatures that are related to cell fate decisions. Overall, our cell-lineage transcriptome atlas for callus induction provides a distinct perspective on cell fate transitions during callus formation, significantly improving our understanding of callus formation.
Collapse
Affiliation(s)
- Ruilian Yin
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 10049, China; BGI Research, Beijing 102601, China
| | - Ruiying Chen
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 10049, China; BGI Research, Beijing 102601, China
| | - Keke Xia
- BGI Research, Beijing 102601, China.
| | - Xun Xu
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 10049, China; BGI Research, Beijing 102601, China; Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, Guangdong, China.
| |
Collapse
|
7
|
Wu B, Shentu X, Nan H, Guo P, Hao S, Xu J, Shangguan S, Cui L, Cen J, Deng Q, Wu Y, Liu C, Song Y, Lin X, Wang Z, Yuan Y, Ma W, Li R, Li Y, Qian Q, Du W, Lai T, Yang T, Liu C, Ma X, Chen A, Xu X, Lai Y, Liu L, Esteban MA, Hui L. A spatiotemporal atlas of cholestatic injury and repair in mice. Nat Genet 2024; 56:938-952. [PMID: 38627596 DOI: 10.1038/s41588-024-01687-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Accepted: 02/09/2024] [Indexed: 05/09/2024]
Abstract
Cholestatic liver injuries, characterized by regional damage around the bile ductular region, lack curative therapies and cause considerable mortality. Here we generated a high-definition spatiotemporal atlas of gene expression during cholestatic injury and repair in mice by integrating spatial enhanced resolution omics sequencing and single-cell transcriptomics. Spatiotemporal analyses revealed a key role of cholangiocyte-driven signaling correlating with the periportal damage-repair response. Cholangiocytes express genes related to recruitment and differentiation of lipid-associated macrophages, which generate feedback signals enhancing ductular reaction. Moreover, cholangiocytes express high TGFβ in association with the conversion of liver progenitor-like cells into cholangiocytes during injury and the dampened proliferation of periportal hepatocytes during recovery. Notably, Atoh8 restricts hepatocyte proliferation during 3,5-diethoxycarbonyl-1,4-dihydro-collidin damage and is quickly downregulated after injury withdrawal, allowing hepatocytes to respond to growth signals. Our findings lay a keystone for in-depth studies of cellular dynamics and molecular mechanisms of cholestatic injuries, which may further develop into therapies for cholangiopathies.
Collapse
Affiliation(s)
- Baihua Wu
- State Key Laboratory of Cell Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Xinyi Shentu
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Haitao Nan
- State Key Laboratory of Cell Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | | | - Shijie Hao
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Jiangshan Xu
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | - Shuncheng Shangguan
- Joint School of Life Sciences, Guangzhou Institutes of Biomedicine and Health and Guangzhou Medical University, Guangzhou, China
- Laboratory of Integrative Biology, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- BGI Research, Shenzhen, China
| | - Lei Cui
- State Key Laboratory of Cell Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Jin Cen
- State Key Laboratory of Cell Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Qiuting Deng
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | - Yan Wu
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | - Chang Liu
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | - Yumo Song
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | - Xiumei Lin
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | | | - Yue Yuan
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | - Wen Ma
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | - Ronghai Li
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
| | - Yikang Li
- Division of Gastroenterology and Hepatology, Key Laboratory of Gastroenterology and Hepatology, Ministry of Health, State Key Laboratory for Oncogenes and Related Genes, NHC Key Laboratory of Digestive Diseases, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai Institute of Digestive Disease, Shanghai, China
| | - Qiwei Qian
- Division of Gastroenterology and Hepatology, Key Laboratory of Gastroenterology and Hepatology, Ministry of Health, State Key Laboratory for Oncogenes and Related Genes, NHC Key Laboratory of Digestive Diseases, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai Institute of Digestive Disease, Shanghai, China
| | - Wensi Du
- China National GeneBank, BGI Research, Shenzhen, China
| | - Tingting Lai
- China National GeneBank, BGI Research, Shenzhen, China
| | - Tao Yang
- China National GeneBank, BGI Research, Shenzhen, China
| | - Chuanyu Liu
- BGI Research, Hangzhou, China
- BGI Research, Shenzhen, China
- Shanxi Medical University-BGI Collaborative Center for Future Medicine, Shanxi Medical University, Taiyuan, China
| | - Xiong Ma
- Division of Gastroenterology and Hepatology, Key Laboratory of Gastroenterology and Hepatology, Ministry of Health, State Key Laboratory for Oncogenes and Related Genes, NHC Key Laboratory of Digestive Diseases, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai Institute of Digestive Disease, Shanghai, China
| | - Ao Chen
- BGI Research, Shenzhen, China
| | - Xun Xu
- BGI Research, Shenzhen, China
- Shanxi Medical University-BGI Collaborative Center for Future Medicine, Shanxi Medical University, Taiyuan, China
| | - Yiwei Lai
- BGI Research, Hangzhou, China.
- BGI Research, Shenzhen, China.
- Shanxi Medical University-BGI Collaborative Center for Future Medicine, Shanxi Medical University, Taiyuan, China.
| | - Longqi Liu
- BGI Research, Hangzhou, China.
- BGI Research, Shenzhen, China.
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.
- China National GeneBank, BGI Research, Shenzhen, China.
- Shanxi Medical University-BGI Collaborative Center for Future Medicine, Shanxi Medical University, Taiyuan, China.
| | - Miguel A Esteban
- BGI Research, Hangzhou, China.
- BGI Research, Shenzhen, China.
- Laboratory of Integrative Biology, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China.
- The Fifth Affiliated Hospital of Guangzhou Medical University-BGI Research Center for Integrative Biology, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.
| | - Lijian Hui
- State Key Laboratory of Cell Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| |
Collapse
|
8
|
Yuan Z, Zhao F, Lin S, Zhao Y, Yao J, Cui Y, Zhang XY, Zhao Y. Benchmarking spatial clustering methods with spatially resolved transcriptomics data. Nat Methods 2024; 21:712-722. [PMID: 38491270 DOI: 10.1038/s41592-024-02215-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 02/16/2024] [Indexed: 03/18/2024]
Abstract
Spatial clustering, which shares an analogy with single-cell clustering, has expanded the scope of tissue physiology studies from cell-centroid to structure-centroid with spatially resolved transcriptomics (SRT) data. Computational methods have undergone remarkable development in recent years, but a comprehensive benchmark study is still lacking. Here we present a benchmark study of 13 computational methods on 34 SRT data (7 datasets). The performance was evaluated on the basis of accuracy, spatial continuity, marker genes detection, scalability, and robustness. We found existing methods were complementary in terms of their performance and functionality, and we provide guidance for selecting appropriate methods for given scenarios. On testing additional 22 challenging datasets, we identified challenges in identifying noncontinuous spatial domains and limitations of existing methods, highlighting their inadequacies in handling recent large-scale tasks. Furthermore, with 145 simulated data, we examined the robustness of these methods against four different factors, and assessed the impact of pre- and postprocessing approaches. Our study offers a comprehensive evaluation of existing spatial clustering methods with SRT data, paving the way for future advancements in this rapidly evolving field.
Collapse
Affiliation(s)
- Zhiyuan Yuan
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China.
- Institute of Science and Technology for Brain-Inspired Intelligence; MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence; MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
| | - Fangyuan Zhao
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Senlin Lin
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yu Zhao
- Tencent AI Lab, Shenzhen, China
| | | | - Yan Cui
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China
- Institute of Science and Technology for Brain-Inspired Intelligence; MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence; MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
| | - Xiao-Yong Zhang
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China
| | - Yi Zhao
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
9
|
Zhou W, Yang T, Zeng L, Chen J, Wang Y, Guo X, You L, Liu Y, Du W, Yang F, Hua C, Cai J, van Hintum T, Liu H, Gu Y, Wei X, Wei T. LettuceDB: an integrated multi-omics database for cultivated lettuce. Database (Oxford) 2024; 2024:baae018. [PMID: 38557635 PMCID: PMC10984620 DOI: 10.1093/database/baae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 02/01/2024] [Accepted: 02/29/2024] [Indexed: 04/04/2024]
Abstract
Crop genomics has advanced rapidly during the past decade, which generated a great abundance of omics data from multi-omics studies. How to utilize the accumulating data becomes a critical and urgent demand in crop science. As an attempt to integrate multi-omics data, we developed a database, LettuceDB (https://db.cngb.org/lettuce/), aiming to assemble multidimensional data for cultivated and wild lettuce germplasm. The database includes genome, variome, phenome, microbiome and spatial transcriptome. By integrating user-friendly bioinformatics tools, LettuceDB will serve as a one-stop platform for lettuce research and breeding in the future. Database URL: https://db.cngb.org/lettuce/.
Collapse
Affiliation(s)
- Wenhui Zhou
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- BGI Research, Wuhan 430074, China
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China
| | - Tao Yang
- China National GeneBank, BGI Research, Shenzhen 518120, China
- Guangdong Genomics Data Center, BGl research, Shenzhen 518120, China
| | - Liucui Zeng
- BGI Research, Wuhan 430074, China
- South China Agricultural University, Guangzhou 510642, China
| | - Jing Chen
- China National GeneBank, BGI Research, Shenzhen 518120, China
| | - Yayu Wang
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China
| | - Xing Guo
- BGI Research, Wuhan 430074, China
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China
| | - Lijin You
- China National GeneBank, BGI Research, Shenzhen 518120, China
| | - Yiqun Liu
- China National GeneBank, BGI Research, Shenzhen 518120, China
| | - Wensi Du
- China National GeneBank, BGI Research, Shenzhen 518120, China
| | - Fan Yang
- China National GeneBank, BGI Research, Shenzhen 518120, China
| | - Cong Hua
- BGI Research, Wuhan 430074, China
- China National GeneBank, BGI Research, Shenzhen 518120, China
| | - Jia Cai
- BGI Research, Wuhan 430074, China
- China National GeneBank, BGI Research, Shenzhen 518120, China
| | - Theo van Hintum
- Centre for Genetic Resources, the Netherlands, P.O. Box 16, Wageningen 6700 AA, The Netherlands
| | - Huan Liu
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China
| | - Ying Gu
- BGI Research, Wuhan 430074, China
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China
| | - Xiaofeng Wei
- China National GeneBank, BGI Research, Shenzhen 518120, China
- Guangdong Genomics Data Center, BGl research, Shenzhen 518120, China
| | - Tong Wei
- BGI Research, Wuhan 430074, China
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China
| |
Collapse
|
10
|
Marconato L, Palla G, Yamauchi KA, Virshup I, Heidari E, Treis T, Vierdag WM, Toth M, Stockhaus S, Shrestha RB, Rombaut B, Pollaris L, Lehner L, Vöhringer H, Kats I, Saeys Y, Saka SK, Huber W, Gerstung M, Moore J, Theis FJ, Stegle O. SpatialData: an open and universal data framework for spatial omics. Nat Methods 2024:10.1038/s41592-024-02212-x. [PMID: 38509327 DOI: 10.1038/s41592-024-02212-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 02/14/2024] [Indexed: 03/22/2024]
Abstract
Spatially resolved omics technologies are transforming our understanding of biological tissues. However, the handling of uni- and multimodal spatial omics datasets remains a challenge owing to large data volumes, heterogeneity of data types and the lack of flexible, spatially aware data structures. Here we introduce SpatialData, a framework that establishes a unified and extensible multiplatform file-format, lazy representation of larger-than-memory data, transformations and alignment to common coordinate systems. SpatialData facilitates spatial annotations and cross-modal aggregation and analysis, the utility of which is illustrated in the context of multiple vignettes, including integrative analysis on a multimodal Xenium and Visium breast cancer study.
Collapse
Affiliation(s)
- Luca Marconato
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
- Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany
- Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences, Heidelberg, Germany
| | - Giovanni Palla
- Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
| | - Kevin A Yamauchi
- Department of Biosystems, Science and Engineering, ETH Zürich, Basel, Switzerland
- Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Isaac Virshup
- Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany
| | - Elyas Heidari
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
- Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany
- Division of Artificial Intelligence in Oncology, German Cancer Research Center, Heidelberg, Germany
| | - Tim Treis
- Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany
- Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany
| | | | - Marcella Toth
- Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany
| | - Sonja Stockhaus
- Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany
- TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
| | - Rahul B Shrestha
- Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany
| | - Benjamin Rombaut
- Data Mining and Modeling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
- VIB Center for AI and Computational Biology, Ghent, Belgium
| | - Lotte Pollaris
- Data Mining and Modeling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
- VIB Center for AI and Computational Biology, Ghent, Belgium
| | - Laurens Lehner
- Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany
- TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
| | - Harald Vöhringer
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
- Molecular Medicine Partnership Unit, Heidelberg, Germany
- Department of Medicine V, Hematology, Oncology, and Rheumatology, University of Heidelberg, Heidelberg, Germany
| | - Ilia Kats
- Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany
| | - Yvan Saeys
- Data Mining and Modeling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
- VIB Center for AI and Computational Biology, Ghent, Belgium
| | - Sinem K Saka
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Wolfgang Huber
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Moritz Gerstung
- Division of Artificial Intelligence in Oncology, German Cancer Research Center, Heidelberg, Germany
| | - Josh Moore
- German BioImaging - Gesellschaft für Mikroskopie und Bildanalyse e.V, Konstanz, Germany.
- Open Microscopy Environment Consortium, Munich, Germany.
| | - Fabian J Theis
- Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany.
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany.
- Department of Mathematics, Technical University of Munich, Munich, Germany.
- Cellular Genetics Programme, Wellcome Sanger Institute, Cambridge, UK.
| | - Oliver Stegle
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.
- Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany.
- Cellular Genetics Programme, Wellcome Sanger Institute, Cambridge, UK.
| |
Collapse
|
11
|
Lin S, Zhao F, Wu Z, Yao J, Zhao Y, Yuan Z. Streamlining spatial omics data analysis with Pysodb. Nat Protoc 2024; 19:831-895. [PMID: 38135744 DOI: 10.1038/s41596-023-00925-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 10/02/2023] [Indexed: 12/24/2023]
Abstract
Advances in spatial omics technologies have improved the understanding of cellular organization in tissues, leading to the generation of complex and heterogeneous data and prompting the development of specialized tools for managing, loading and visualizing spatial omics data. The Spatial Omics Database (SODB) was established to offer a unified format for data storage and interactive visualization modules. Here we detail the use of Pysodb, a Python-based tool designed to enable the efficient exploration and loading of spatial datasets from SODB within a Python environment. We present seven case studies using Pysodb, detailing the interaction with various computational methods, ensuring reproducibility of experimental data and facilitating the integration of new data and alternative applications in SODB. The approach offers a reference for method developers by outlining label and metadata availability in representative spatial data that can be loaded by Pysodb. The tool is supplemented by a website ( https://protocols-pysodb.readthedocs.io/ ) with detailed information for benchmarking analysis, and allows method developers to focus on computational models by facilitating data processing. This protocol is designed for researchers with limited experience in computational biology. Depending on the dataset complexity, the protocol typically requires ~12 h to complete.
Collapse
Affiliation(s)
- Senlin Lin
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Fangyuan Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | | | | | - Yi Zhao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| | - Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
- Center for Medical Research and Innovation, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Fudan University, Shanghai, China.
| |
Collapse
|
12
|
Rigden DJ, Fernández XM. The 2024 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res 2024; 52:D1-D9. [PMID: 38035367 PMCID: PMC10767945 DOI: 10.1093/nar/gkad1173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 11/23/2023] [Indexed: 12/02/2023] Open
Abstract
The 2024 Nucleic Acids Research database issue contains 180 papers from across biology and neighbouring disciplines. There are 90 papers reporting on new databases and 83 updates from resources previously published in the Issue. Updates from databases most recently published elsewhere account for a further seven. Nucleic acid databases include the new NAKB for structural information and updates from Genbank, ENA, GEO, Tarbase and JASPAR. The Issue's Breakthrough Article concerns NMPFamsDB for novel prokaryotic protein families and the AlphaFold Protein Structure Database has an important update. Metabolism is covered by updates from Reactome, Wikipathways and Metabolights. Microbes are covered by RefSeq, UNITE, SPIRE and P10K; viruses by ViralZone and PhageScope. Medically-oriented databases include the familiar COSMIC, Drugbank and TTD. Genomics-related resources include Ensembl, UCSC Genome Browser and Monarch. New arrivals cover plant imaging (OPIA and PlantPAD) and crop plants (SoyMD, TCOD and CropGS-Hub). The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). Over the last year the NAR online Molecular Biology Database Collection has been updated, reviewing 1060 entries, adding 97 new resources and eliminating 388 discontinued URLs bringing the current total to 1959 databases. It is available at http://www.oxfordjournals.org/nar/database/c/.
Collapse
Affiliation(s)
- Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | | |
Collapse
|
13
|
Lv T, Zhang Y, Li M, Kang Q, Fang S, Zhang Y, Brix S, Xu X. EAGS: efficient and adaptive Gaussian smoothing applied to high-resolved spatial transcriptomics. Gigascience 2024; 13:giad097. [PMID: 38373746 PMCID: PMC10939424 DOI: 10.1093/gigascience/giad097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 09/12/2023] [Accepted: 10/13/2023] [Indexed: 02/21/2024] Open
Abstract
BACKGROUND The emergence of high-resolved spatial transcriptomics (ST) has facilitated the research of novel methods to investigate biological development, organism growth, and other complex biological processes. However, high-resolved and whole transcriptomics ST datasets require customized imputation methods to improve the signal-to-noise ratio and the data quality. FINDINGS We propose an efficient and adaptive Gaussian smoothing (EAGS) imputation method for high-resolved ST. The adaptive 2-factor smoothing of EAGS creates patterns based on the spatial and expression information of the cells, creates adaptive weights for the smoothing of cells in the same pattern, and then utilizes the weights to restore the gene expression profiles. We assessed the performance and efficiency of EAGS using simulated and high-resolved ST datasets of mouse brain and olfactory bulb. CONCLUSIONS Compared with other competitive methods, EAGS shows higher clustering accuracy, better biological interpretations, and significantly reduced computational consumption.
Collapse
Affiliation(s)
- Tongxuan Lv
- BGI Research, Shenzhen 518083, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | | | - Mei Li
- BGI Research, Shenzhen 518083, China
- Department of Biotechnology and Biomedicine, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark
| | | | - Shuangsang Fang
- BGI Research, Shenzhen 518083, China
- BGI Research, Beijing 102601, China
| | | | | | - Xun Xu
- BGI Research, Shenzhen 518083, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
14
|
Chen JG, Chávez-Fuentes JC, O'Brien M, Xu J, Ruiz E, Wang W, Amin I, Sarfraz I, Guckhool P, Sistig A, Yuan GC, Dries R. Giotto Suite: a multi-scale and technology-agnostic spatial multi-omics analysis ecosystem. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.26.568752. [PMID: 38077085 PMCID: PMC10705291 DOI: 10.1101/2023.11.26.568752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
Emerging spatial omics technologies continue to advance the molecular mapping of tissue architecture and the investigation of gene regulation and cellular crosstalk, which in turn provide new mechanistic insights into a wide range of biological processes and diseases. Such technologies provide an increasingly large amount of information content at multiple spatial scales. However, representing and harmonizing diverse spatial datasets efficiently, including combining multiple modalities or spatial scales in a scalable and flexible manner, remains a substantial challenge. Here, we present Giotto Suite, a suite of open-source software packages that underlies a fully modular and integrated spatial data analysis toolbox. At its core, Giotto Suite is centered around an innovative and technology-agnostic data framework embedded in the R software environment, which allows the representation and integration of virtually any type of spatial omics data at any spatial resolution. In addition, Giotto Suite provides both scalable and extensible end-to-end solutions for data analysis, integration, and visualization. Giotto Suite integrates molecular, morphology, spatial, and annotated feature information to create a responsive and flexible workflow for multi-scale, multi-omic data analyses, as demonstrated here by applications to several state-of-the-art spatial technologies. Furthermore, Giotto Suite builds upon interoperable interfaces and data structures that bridge the established fields of genomics and spatial data science, thereby enabling independent developers to create custom-engineered pipelines. As such, Giotto Suite creates an immersive ecosystem for spatial multi-omic data analysis.
Collapse
Affiliation(s)
- Jiaji George Chen
- Section of Hematology and Medical Oncology, Boston University School of Medicine and Boston Medical Center, Boston, MA 02118, USA
| | | | - Matthew O'Brien
- Section of Hematology and Medical Oncology, Boston University School of Medicine and Boston Medical Center, Boston, MA 02118, USA
| | - Junxiang Xu
- Section of Hematology and Medical Oncology, Boston University School of Medicine and Boston Medical Center, Boston, MA 02118, USA
| | - Edward Ruiz
- Section of Hematology and Medical Oncology, Boston University School of Medicine and Boston Medical Center, Boston, MA 02118, USA
| | - Wen Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Iqra Amin
- Section of Hematology and Medical Oncology, Boston University School of Medicine and Boston Medical Center, Boston, MA 02118, USA
| | - Irzam Sarfraz
- Section of Hematology and Medical Oncology, Boston University School of Medicine and Boston Medical Center, Boston, MA 02118, USA
| | - Pratishtha Guckhool
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Adriana Sistig
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Ruben Dries
- Section of Hematology and Medical Oncology, Boston University School of Medicine and Boston Medical Center, Boston, MA 02118, USA
- Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118, USA
| |
Collapse
|