51
|
Park SY, Ter-Saakyan S, Faraci G, Lee HY. Immune cell identifier and classifier (ImmunIC) for single cell transcriptomic readouts. Sci Rep 2023; 13:12093. [PMID: 37495649 PMCID: PMC10372073 DOI: 10.1038/s41598-023-39282-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 07/22/2023] [Indexed: 07/28/2023] Open
Abstract
Single cell RNA sequencing has a central role in immune profiling, identifying specific immune cells as disease markers and suggesting therapeutic target genes of immune cells. Immune cell-type annotation from single cell transcriptomics is in high demand for dissecting complex immune signatures from multicellular blood and organ samples. However, accurate cell type assignment from single-cell RNA sequencing data alone is complicated by a high level of gene expression heterogeneity. Many computational methods have been developed to respond to this challenge, but immune cell annotation accuracy is not highly desirable. We present ImmunIC, a simple and robust tool for immune cell identification and classification by combining marker genes with a machine learning method. With over two million immune cells and half-million non-immune cells from 66 single cell RNA sequencing studies, ImmunIC shows 98% accuracy in the identification of immune cells. ImmunIC outperforms existing immune cell classifiers, categorizing into ten immune cell types with 92% accuracy. We determine peripheral blood mononuclear cell compositions of severe COVID-19 cases and healthy controls using previously published single cell transcriptomic data, permitting the identification of immune cell-type specific differential pathways. Our publicly available tool can maximize the utility of single cell RNA profiling by functioning as a stand-alone bioinformatic cell sorter, advancing cell-type specific immune profiling for the discovery of disease-specific immune signatures and therapeutic targets.
Collapse
Affiliation(s)
- Sung Yong Park
- Department of Molecular Microbiology and Immunology, Keck School of Medicine, University of Southern California, Los Angeles, USA
| | - Sonia Ter-Saakyan
- Department of Molecular Microbiology and Immunology, Keck School of Medicine, University of Southern California, Los Angeles, USA
| | - Gina Faraci
- Department of Molecular Microbiology and Immunology, Keck School of Medicine, University of Southern California, Los Angeles, USA
| | - Ha Youn Lee
- Department of Molecular Microbiology and Immunology, Keck School of Medicine, University of Southern California, Los Angeles, USA.
| |
Collapse
|
52
|
Ceglia N, Sethna Z, Freeman SS, Uhlitz F, Bojilova V, Rusk N, Burman B, Chow A, Salehi S, Kabeer F, Aparicio S, Greenbaum BD, Shah SP, McPherson A. Identification of transcriptional programs using dense vector representations defined by mutual information with GeneVector. Nat Commun 2023; 14:4400. [PMID: 37474509 PMCID: PMC10359421 DOI: 10.1038/s41467-023-39985-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Accepted: 07/04/2023] [Indexed: 07/22/2023] Open
Abstract
Deciphering individual cell phenotypes from cell-specific transcriptional processes requires high dimensional single cell RNA sequencing. However, current dimensionality reduction methods aggregate sparse gene information across cells, without directly measuring the relationships that exist between genes. By performing dimensionality reduction with respect to gene co-expression, low-dimensional features can model these gene-specific relationships and leverage shared signal to overcome sparsity. We describe GeneVector, a scalable framework for dimensionality reduction implemented as a vector space model using mutual information between gene expression. Unlike other methods, including principal component analysis and variational autoencoders, GeneVector uses latent space arithmetic in a lower dimensional gene embedding to identify transcriptional programs and classify cell types. In this work, we show in four single cell RNA-seq datasets that GeneVector was able to capture phenotype-specific pathways, perform batch effect correction, interactively annotate cell types, and identify pathway variation with treatment over time.
Collapse
Affiliation(s)
- Nicholas Ceglia
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
| | - Zachary Sethna
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Immuno-Oncology Service, Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Hepatopancreatobiliary Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Samuel S Freeman
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Florian Uhlitz
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Viktoria Bojilova
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Nicole Rusk
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Bharat Burman
- Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Andrew Chow
- Department of Medicine, Thoracic Oncology Service, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Sohrab Salehi
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Farhia Kabeer
- Department of Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, British Columbia, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Samuel Aparicio
- Department of Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, British Columbia, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Benjamin D Greenbaum
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Physiology, Biophysics & Systems Biology, Weill Cornell Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Sohrab P Shah
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Andrew McPherson
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| |
Collapse
|
53
|
Lee E, Chern K, Nissen M, Wang X, Huang C, Gandhi AK, Bouchard-Côté A, Weng AP, Roth A. SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data. Bioinformatics 2023; 39:i131-i139. [PMID: 37387130 PMCID: PMC10311307 DOI: 10.1093/bioinformatics/btad242] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open
Abstract
MOTIVATION Recent advances in spatial proteomics technologies have enabled the profiling of dozens of proteins in thousands of single cells in situ. This has created the opportunity to move beyond quantifying the composition of cell types in tissue, and instead probe the spatial relationships between cells. However, most current methods for clustering data from these assays only consider the expression values of cells and ignore the spatial context. Furthermore, existing approaches do not account for prior information about the expected cell populations in a sample. RESULTS To address these shortcomings, we developed SpatialSort, a spatially aware Bayesian clustering approach that allows for the incorporation of prior biological knowledge. Our method is able to account for the affinities of cells of different types to neighbour in space, and by incorporating prior information about expected cell populations, it is able to simultaneously improve clustering accuracy and perform automated annotation of clusters. Using synthetic and real data, we show that by using spatial and prior information SpatialSort improves clustering accuracy. We also demonstrate how SpatialSort can perform label transfer between spatial and nonspatial modalities through the analysis of a real world diffuse large B-cell lymphoma dataset. AVAILABILITY AND IMPLEMENTATION Source code is available on Github at: https://github.com/Roth-Lab/SpatialSort.
Collapse
Affiliation(s)
- Eric Lee
- Department of Molecular Oncology, BC Cancer Agency, 675 West 10th Avenue, Vancouver, BC V5Z1L3, Canada
- Graduate Bioinformatics Training Program, University of British Columbia, 100-570 West 7th Avenue, Vancouver, BC V5T4S6, Canada
| | - Kevin Chern
- Department of Statistics, University of British Columbia, 2207 Main Mall, Vancouver, BC V6T1Z4, Canada
| | - Michael Nissen
- Terry Fox Laboratory, British Columbia Cancer Research Centre, 675 West 10th Avenue, Vancouver, BC V5Z1L3, Canada
| | - Xuehai Wang
- Terry Fox Laboratory, British Columbia Cancer Research Centre, 675 West 10th Avenue, Vancouver, BC V5Z1L3, Canada
| | - IMAXT Consortium
- CRUK IMAXT Grand Challenge Consortium, Li Ka Shing Centre, Robinson Way, Cambridge CB20RE, United Kingdom
| | - Chris Huang
- Translational Medicine Hematology, Bristol Myers Squibb, 86 Morris Ave, Summit, NJ 07901, United States
| | - Anita K Gandhi
- Translational Medicine Hematology, Bristol Myers Squibb, 86 Morris Ave, Summit, NJ 07901, United States
| | - Alexandre Bouchard-Côté
- Department of Statistics, University of British Columbia, 2207 Main Mall, Vancouver, BC V6T1Z4, Canada
| | - Andrew P Weng
- Terry Fox Laboratory, British Columbia Cancer Research Centre, 675 West 10th Avenue, Vancouver, BC V5Z1L3, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, 2211 Wesbrook Mall, Vancouver, BC V6T1Z7, Canada
| | - Andrew Roth
- Department of Molecular Oncology, BC Cancer Agency, 675 West 10th Avenue, Vancouver, BC V5Z1L3, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, 2211 Wesbrook Mall, Vancouver, BC V6T1Z7, Canada
- Department of Computer Science, University of British Columbia, 2366 Main Mall, Vancouver, BC V6T1Z4, Canada
| |
Collapse
|
54
|
Zha W, Li C, Wu Y, Chen J, Li S, Sun M, Wu B, Shi S, Liu K, Xu H, Li P, Liu K, Yang G, Chen Z, Xu D, Zhou L, You A. Single-Cell RNA sequencing of leaf sheath cells reveals the mechanism of rice resistance to brown planthopper ( Nilaparvata lugens). FRONTIERS IN PLANT SCIENCE 2023; 14:1200014. [PMID: 37404541 PMCID: PMC10316026 DOI: 10.3389/fpls.2023.1200014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 04/26/2023] [Indexed: 07/06/2023]
Abstract
The brown planthopper (BPH) (Nilaparvata lugens) sucks rice sap causing leaves to turn yellow and wither, often leading to reduced or zero yields. Rice co-evolved to resist damage by BPH. However, the molecular mechanisms, including the cells and tissues, involved in the resistance are still rarely reported. Single-cell sequencing technology allows us to analyze different cell types involved in BPH resistance. Here, using single-cell sequencing technology, we compared the response offered by the leaf sheaths of the susceptible (TN1) and resistant (YHY15) rice varieties to BPH (48 hours after infestation). We found that the 14,699 and 16,237 cells (identified via transcriptomics) in TN1 and YHY15 could be annotated using cell-specific marker genes into nine cell-type clusters. The two rice varieties showed significant differences in cell types (such as mestome sheath cells, guard cells, mesophyll cells, xylem cells, bulliform cells, and phloem cells) in the rice resistance mechanism to BPH. Further analysis revealed that although mesophyll, xylem, and phloem cells are involved in the BPH resistance response, the molecular mechanism used by each cell type is different. Mesophyll cell may regulate the expression of genes related to vanillin, capsaicin, and ROS production, phloem cell may regulate the cell wall extension related genes, and xylem cell may be involved in BPH resistance response by controlling the expression of chitin and pectin related genes. Thus, rice resistance to BPH is a complicated process involving multiple insect resistance factors. The results presented here will significantly promote the investigation of the molecular mechanisms underlying the resistance of rice to insects and accelerate the breeding of insect-resistant rice varieties.
Collapse
Affiliation(s)
- Wenjun Zha
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Changyan Li
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Yan Wu
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Junxiao Chen
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Sanhe Li
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Minshan Sun
- Henan Assist Research Biotechnology Co., Ltd., Zhengzhou, China
| | - Bian Wu
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Shaojie Shi
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Kai Liu
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Huashan Xu
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Peide Li
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Kai Liu
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Guocai Yang
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Zhijun Chen
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Deze Xu
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Lei Zhou
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Aiqing You
- Key Laboratory of Crop Molecular Breeding, Ministry of Agriculture and Rural Affairs, Hubei Key Laboratory of Food Crop Germplasm and Genetic Improvement, Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| |
Collapse
|
55
|
Liu N, Jiang C, Yao X, Fang M, Qiao X, Zhu L, Yang Z, Gao X, Ji Y, Niu C, Cheng C, Qu K, Lin J. Single-cell landscape of primary central nervous system diffuse large B-cell lymphoma. Cell Discov 2023; 9:55. [PMID: 37308475 DOI: 10.1038/s41421-023-00559-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 04/29/2023] [Indexed: 06/14/2023] Open
Abstract
Understanding tumor heterogeneity and immune infiltrates within the tumor-immune microenvironment (TIME) is essential for the innovation of immunotherapies. Here, combining single-cell transcriptomics and chromatin accessibility sequencing, we profile the intratumor heterogeneity of malignant cells and immune properties of the TIME in primary central nervous system diffuse large B-cell lymphoma (PCNS DLBCL) patients. We demonstrate diverse malignant programs related to tumor-promoting pathways, cell cycle and B-cell immune response. By integrating data from independent systemic DLBCL and follicular lymphoma cohorts, we reveal a prosurvival program with aberrantly elevated RNA splicing activity that is uniquely associated with PCNS DLBCL. Moreover, a plasmablast-like program that recurs across PCNS/activated B-cell DLBCL predicts a worse prognosis. In addition, clonally expanded CD8 T cells in PCNS DLBCL undergo a transition from a pre-exhaustion-like state to exhaustion, and exhibit higher exhaustion signature scores than systemic DLBCL. Thus, our study sheds light on potential reasons for the poor prognosis of PCNS DLBCL patients, which will facilitate the development of targeted therapy.
Collapse
Affiliation(s)
- Nianping Liu
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
| | - Chen Jiang
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, Anhui, China
| | - Xinfeng Yao
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
| | - Minghao Fang
- Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
| | - Xiaolong Qiao
- Anhui University of Science and Technology, Huainan, Anhui, China
| | - Lin Zhu
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
| | - Zongcheng Yang
- Department of Stomatology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
| | - Xuyuan Gao
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
| | - Ying Ji
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
| | - Chaoshi Niu
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China
| | - Chuandong Cheng
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China.
| | - Kun Qu
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China.
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, Anhui, China.
- CAS Center for Excellence in Molecular Cell Sciences, The CAS Key Laboratory of Innate Immunity and Chronic Disease, University of Science and Technology of China, Hefei, Anhui, China.
| | - Jun Lin
- Department of Neurosurgery, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China.
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, Anhui, China.
- CAS Center for Excellence in Molecular Cell Sciences, The CAS Key Laboratory of Innate Immunity and Chronic Disease, University of Science and Technology of China, Hefei, Anhui, China.
| |
Collapse
|
56
|
Sant P, Rippe K, Mallm JP. Approaches for single-cell RNA sequencing across tissues and cell types. Transcription 2023; 14:127-145. [PMID: 37062951 PMCID: PMC10807473 DOI: 10.1080/21541264.2023.2200721] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/30/2023] [Indexed: 04/18/2023] Open
Abstract
Single-cell sequencing of RNA (scRNA-seq) has advanced our understanding of cellular heterogeneity and signaling in developmental biology and disease. A large number of complementary assays have been developed to profile transcriptomes of individual cells, also in combination with other readouts, such as chromatin accessibility or antibody-based analysis of protein surface markers. As scRNA-seq technologies are advancing fast, it is challenging to establish robust workflows and up-to-date protocols that are best suited to address the large range of research questions. Here, we review scRNA-seq techniques from mRNA end-counting to total RNA in relation to their specific features and outline the necessary sample preparation steps and quality control measures. Based on our experience in dealing with the continuously growing portfolio from the perspective of a central single-cell facility, we aim to provide guidance on how workflows can be best automatized and share our experience in coping with the continuous expansion of scRNA-seq techniques.
Collapse
Affiliation(s)
- Pooja Sant
- Single-cell Open Lab, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany
| | - Karsten Rippe
- Division Chromatin Networks, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany
| | - Jan-Philipp Mallm
- Single-cell Open Lab, German Cancer Research Center (DKFZ) and Bioquant, Heidelberg, Germany
| |
Collapse
|
57
|
Nikolic A, Maule F, Bobyn A, Ellestad K, Paik S, Marhon SA, Mehdipour P, Lun X, Chen HM, Mallard C, Hay AJ, Johnston MJ, Gafuik CJ, Zemp FJ, Shen Y, Ninkovic N, Osz K, Labit E, Berger ND, Brownsey DK, Kelly JJ, Biernaskie J, Dirks PB, Derksen DJ, Jones SJM, Senger DL, Chan JA, Mahoney DJ, De Carvalho DD, Gallo M. macroH2A2 antagonizes epigenetic programs of stemness in glioblastoma. Nat Commun 2023; 14:3062. [PMID: 37244935 PMCID: PMC10224928 DOI: 10.1038/s41467-023-38919-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 05/22/2023] [Indexed: 05/29/2023] Open
Abstract
Self-renewal is a crucial property of glioblastoma cells that is enabled by the choreographed functions of chromatin regulators and transcription factors. Identifying targetable epigenetic mechanisms of self-renewal could therefore represent an important step toward developing effective treatments for this universally lethal cancer. Here we uncover an epigenetic axis of self-renewal mediated by the histone variant macroH2A2. With omics and functional assays deploying patient-derived in vitro and in vivo models, we show that macroH2A2 shapes chromatin accessibility at enhancer elements to antagonize transcriptional programs of self-renewal. macroH2A2 also sensitizes cells to small molecule-mediated cell death via activation of a viral mimicry response. Consistent with these results, our analyses of clinical cohorts indicate that high transcriptional levels of this histone variant are associated with better prognosis of high-grade glioma patients. Our results reveal a targetable epigenetic mechanism of self-renewal controlled by macroH2A2 and suggest additional treatment approaches for glioblastoma patients.
Collapse
Affiliation(s)
- Ana Nikolic
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Francesca Maule
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Anna Bobyn
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Biological Sciences, Faculty of Science, University of Calgary, Calgary, AB, Canada
| | - Katrina Ellestad
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Seungil Paik
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | | | - Parinaz Mehdipour
- Princess Margaret Cancer Centre, Toronto, ON, Canada
- Ludwig Institute for Cancer Research, University of Oxford, Oxford, UK
| | - Xueqing Lun
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Huey-Miin Chen
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Claire Mallard
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Alexander J Hay
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Michael J Johnston
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Christopher J Gafuik
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Franz J Zemp
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Yaoqing Shen
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Nicoletta Ninkovic
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Katalin Osz
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Oncology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Elodie Labit
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Compararive Biology and Experimental Medicine, Faculty of Veterinary Medicine, and Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - N Daniel Berger
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Duncan K Brownsey
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Chemistry, Faculty of Science, University of Calgary, Calgary, AB, Canada
| | - John J Kelly
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Jeff Biernaskie
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Compararive Biology and Experimental Medicine, Faculty of Veterinary Medicine, and Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Peter B Dirks
- Program in Developmental and Stem Cell Biology, Hospital for Sick Children, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Darren J Derksen
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Chemistry, Faculty of Science, University of Calgary, Calgary, AB, Canada
| | - Steven J M Jones
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Donna L Senger
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Oncology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Jennifer A Chan
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Oncology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Douglas J Mahoney
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Microbiology, Immunology and Infectious Diseases, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Daniel D De Carvalho
- Princess Margaret Cancer Centre, Toronto, ON, Canada
- Department of Medical Biophysics, Faculty of Science, University of Toronto, Toronto, ON, Canada
| | - Marco Gallo
- Arnie Charbonneau Cancer Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
- Alberta Children's Hospital Research Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
| |
Collapse
|
58
|
Liu H, Li H, Sharma A, Huang W, Pan D, Gu Y, Lin L, Sun X, Liu H. scAnno: a deconvolution strategy-based automatic cell type annotation tool for single-cell RNA-sequencing data sets. Brief Bioinform 2023; 24:bbad179. [PMID: 37183449 DOI: 10.1093/bib/bbad179] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 03/29/2023] [Accepted: 04/19/2023] [Indexed: 05/16/2023] Open
Abstract
Undoubtedly, single-cell RNA sequencing (scRNA-seq) has changed the research landscape by providing insights into heterogeneous, complex and rare cell populations. Given that more such data sets will become available in the near future, their accurate assessment with compatible and robust models for cell type annotation is a prerequisite. Considering this, herein, we developed scAnno (scRNA-seq data annotation), an automated annotation tool for scRNA-seq data sets primarily based on the single-cell cluster levels, using a joint deconvolution strategy and logistic regression. We explicitly constructed a reference profile for human (30 cell types and 50 human tissues) and a reference profile for mouse (26 cell types and 50 mouse tissues) to support this novel methodology (scAnno). scAnno offers a possibility to obtain genes with high expression and specificity in a given cell type as cell type-specific genes (marker genes) by combining co-expression genes with seed genes as a core. Of importance, scAnno can accurately identify cell type-specific genes based on cell type reference expression profiles without any prior information. Particularly, in the peripheral blood mononuclear cell data set, the marker genes identified by scAnno showed cell type-specific expression, and the majority of marker genes matched exactly with those included in the CellMarker database. Besides validating the flexibility and interpretability of scAnno in identifying marker genes, we also proved its superiority in cell type annotation over other cell type annotation tools (SingleR, scPred, CHETAH and scmap-cluster) through internal validation of data sets (average annotation accuracy: 99.05%) and cross-platform data sets (average annotation accuracy: 95.56%). Taken together, we established the first novel methodology that utilizes a deconvolution strategy for automated cell typing and is capable of being a significant application in broader scRNA-seq analysis. scAnno is available at https://github.com/liuhong-jia/scAnno.
Collapse
Affiliation(s)
- Hongjia Liu
- State Key Laboratory of Digital Medical Engineering, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Huamei Li
- Department of General Surgery, Nanjing Drum Tower Hospital, the Affiliated Hospital of Nanjing University Medical School, Nanjing, 210008, PR China
| | - Amit Sharma
- Department of Neurosurgery, University Hospital Bonn, Bonn, Germany
| | | | - Duo Pan
- State Key Laboratory of Digital Medical Engineering, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Yu Gu
- State Key Laboratory of Digital Medical Engineering, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Lu Lin
- State Key Laboratory of Digital Medical Engineering, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Xiao Sun
- State Key Laboratory of Digital Medical Engineering, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Hongde Liu
- State Key Laboratory of Digital Medical Engineering, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China
| |
Collapse
|
59
|
Liu Y, Wei G, Li C, Shen LC, Gasser RB, Song J, Chen D, Yu DJ. TripletCell: a deep metric learning framework for accurate annotation of cell types at the single-cell level. Brief Bioinform 2023; 24:bbad132. [PMID: 37080771 PMCID: PMC10199768 DOI: 10.1093/bib/bbad132] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 02/02/2023] [Accepted: 03/14/2023] [Indexed: 04/22/2023] Open
Abstract
Single-cell RNA sequencing (scRNA-seq) has significantly accelerated the experimental characterization of distinct cell lineages and types in complex tissues and organisms. Cell-type annotation is of great importance in most of the scRNA-seq analysis pipelines. However, manual cell-type annotation heavily relies on the quality of scRNA-seq data and marker genes, and therefore can be laborious and time-consuming. Furthermore, the heterogeneity of scRNA-seq datasets poses another challenge for accurate cell-type annotation, such as the batch effect induced by different scRNA-seq protocols and samples. To overcome these limitations, here we propose a novel pipeline, termed TripletCell, for cross-species, cross-protocol and cross-sample cell-type annotation. We developed a cell embedding and dimension-reduction module for the feature extraction (FE) in TripletCell, namely TripletCell-FE, to leverage the deep metric learning-based algorithm for the relationships between the reference gene expression matrix and the query cells. Our experimental studies on 21 datasets (covering nine scRNA-seq protocols, two species and three tissues) demonstrate that TripletCell outperformed state-of-the-art approaches for cell-type annotation. More importantly, regardless of protocols or species, TripletCell can deliver outstanding and robust performance in annotating different types of cells. TripletCell is freely available at https://github.com/liuyan3056/TripletCell. We believe that TripletCell is a reliable computational tool for accurately annotating various cell types using scRNA-seq data and will be instrumental in assisting the generation of novel biological hypotheses in cell biology.
Collapse
Affiliation(s)
- Yan Liu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Guo Wei
- School of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Chen Li
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Victoria 3800, Australia
| | - Long-Chen Shen
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Robin B Gasser
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Victoria 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, Victoria 3800, Australia
| | - Dijun Chen
- School of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| |
Collapse
|
60
|
Geras A, Darvish Shafighi S, Domżał K, Filipiuk I, Rączkowska A, Szymczak P, Toosi H, Kaczmarek L, Koperski Ł, Lagergren J, Nowis D, Szczurek E. Celloscope: a probabilistic model for marker-gene-driven cell type deconvolution in spatial transcriptomics data. Genome Biol 2023; 24:120. [PMID: 37198601 PMCID: PMC10190053 DOI: 10.1186/s13059-023-02951-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 04/21/2023] [Indexed: 05/19/2023] Open
Abstract
Spatial transcriptomics maps gene expression across tissues, posing the challenge of determining the spatial arrangement of different cell types. However, spatial transcriptomics spots contain multiple cells. Therefore, the observed signal comes from mixtures of cells of different types. Here, we propose an innovative probabilistic model, Celloscope, that utilizes established prior knowledge on marker genes for cell type deconvolution from spatial transcriptomics data. Celloscope outperforms other methods on simulated data, successfully indicates known brain structures and spatially distinguishes between inhibitory and excitatory neuron types based in mouse brain tissue, and dissects large heterogeneity of immune infiltrate composition in prostate gland tissue.
Collapse
Affiliation(s)
- Agnieszka Geras
- Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
- Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Warsaw, Poland
| | - Shadi Darvish Shafighi
- Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Warsaw, Poland
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative - UMR, Paris, France
| | - Kacper Domżał
- Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Warsaw, Poland
| | - Igor Filipiuk
- Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Warsaw, Poland
| | - Alicja Rączkowska
- Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Warsaw, Poland
| | - Paulina Szymczak
- Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Warsaw, Poland
| | - Hosein Toosi
- KTH Royal Institute of Technology, Stockholm, Sweden
| | - Leszek Kaczmarek
- BRAINCITY, Nencki Institute of Experimental Biology of the Polish Academy of Sciences, Warsaw, Poland
| | - Łukasz Koperski
- Department of Pathology, Medical University of Warsaw, Warsaw, Poland
| | | | - Dominika Nowis
- Laboratory of Experimental Medicine, Medical University of Warsaw, Warsaw, Poland
| | - Ewa Szczurek
- Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Warsaw, Poland.
| |
Collapse
|
61
|
Teefy BB, Lemus AJ, Adler A, Xu A, Bhala R, Hsu K, Benayoun BA. Widespread sex-dimorphism across single-cell transcriptomes of adult African turquoise killifish tissues. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.05.539616. [PMID: 37214847 PMCID: PMC10197525 DOI: 10.1101/2023.05.05.539616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The African turquoise killifish (Nothobranchius furzeri), the shortest-lived vertebrate that can be bred in captivity, is an emerging model organism to study vertebrate aging. Here we describe the first multi-tissue, single-cell gene expression atlas of female and male turquoise killifish tissues comprising immune and metabolic cells from the blood, kidney, liver, and spleen. We were able to annotate 22 distinct cell types, define associated marker genes, and infer differentiation trajectories. Using this dataset, we found pervasive sex-dimorphic gene expression across cell types, especially in the liver. Sex-dimorphic genes tended to be involved in processes related to lipid metabolism, and indeed, we observed clear differences in lipid storage in female vs. male turquoise killifish livers. Importantly, we use machine-learning to predict sex using single-cell gene expression in our atlas and identify potential transcriptional markers for molecular sex identity in this species. As proof-of-principle, we show that our atlas can be used to deconvolute existing liver bulk RNA-seq data in this species to obtain accurate estimates of cell type proportions across biological conditions. We believe that this single-cell atlas can be a resource to the community that could notably be leveraged to identify cell type-specific genes for cell type-specific expression in transgenic animals.
Collapse
Affiliation(s)
- Bryan B. Teefy
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
| | - Aaron J.J. Lemus
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
- Molecular and Computational Biology Department, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA 90089, USA
| | - Ari Adler
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
| | - Alan Xu
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
- Quantitative & Computational Biology Department, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA 90089, USA
| | - Rajyk Bhala
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
| | - Katelyn Hsu
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
- Molecular and Computational Biology Department, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA 90089, USA
| | - Bérénice A. Benayoun
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
- Molecular and Computational Biology Department, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA 90089, USA
- Biochemistry and Molecular Medicine Department, USC Keck School of Medicine, Los Angeles, CA 90089, USA
- USC Norris Comprehensive Cancer Center, Epigenetics and Gene Regulation, Los Angeles, CA 90089, USA
- USC Stem Cell Initiative, Los Angeles, CA 90089, USA
| |
Collapse
|
62
|
Yang L, Ng YE, Sun H, Li Y, Chini LCS, LeBrasseur NK, Chen J, Zhang X. Single-cell Mayo Map ( scMayoMap ): an easy-to-use tool for cell type annotation in single-cell RNA-sequencing data analysis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.03.538463. [PMID: 37205463 PMCID: PMC10187171 DOI: 10.1101/2023.05.03.538463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Single-cell RNA-sequencing (scRNA-seq) has become a widely used tool for both basic and translational biomedical research. In scRNA-seq data analysis, cell type annotation is an essential but challenging step. In the past few years, several annotation tools have been developed. These methods require either labeled training/reference datasets, which are not always available, or a list of predefined cell subset markers, which are subject to biases. Thus, a user-friendly and precise annotation tool is still critically needed. We curated a comprehensive cell marker database named scMayoMapDatabase and developed a companion R package scMayoMap , an easy-to-use single cell annotation tool, to provide fast and accurate cell type annotation. The effectiveness of scMayoMap was demonstrated in 48 independent scRNA-seq datasets across different platforms and tissues. scMayoMap performs better than the currently available annotation tools on all the datasets tested. Additionally, the scMayoMapDatabase can be integrated with other tools and further improve their performance. scMayoMap and scMayoMapDatabase will help investigators to define the cell types in their scRNA-seq data in a streamlined and user-friendly way.
Collapse
|
63
|
Pearce H, Croft W, Nicol SM, Margielewska-Davies S, Powell R, Cornall R, Davis SJ, Marcon F, Pugh MR, Fennell É, Powell-Brett S, Mahon BS, Brown RM, Middleton G, Roberts K, Moss P. Tissue-Resident Memory T Cells in Pancreatic Ductal Adenocarcinoma Coexpress PD-1 and TIGIT and Functional Inhibition Is Reversible by Dual Antibody Blockade. Cancer Immunol Res 2023; 11:435-449. [PMID: 36689623 PMCID: PMC10068448 DOI: 10.1158/2326-6066.cir-22-0121] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 08/02/2022] [Accepted: 01/19/2023] [Indexed: 01/24/2023]
Abstract
Pancreatic ductal adenocarcinoma (PDAC) has a poor clinical outlook. Responses to immune checkpoint blockade are suboptimal and a much more detailed understanding of the tumor immune microenvironment is needed if this situation is to be improved. Here, we characterized tumor-infiltrating T-cell populations in patients with PDAC using cytometry by time of flight (CyTOF) and single-cell RNA sequencing. T cells were the predominant immune cell subset observed within tumors. Over 30% of CD4+ T cells expressed a CCR6+CD161+ Th17 phenotype and 17% displayed an activated regulatory T-cell profile. Large populations of CD8+ tissue-resident memory (TRM) T cells were also present and expressed high levels of programmed cell death protein 1 (PD-1) and TIGIT. A population of putative tumor-reactive CD103+CD39+ T cells was also observed within the CD8+ tumor-infiltrating lymphocytes population. The expression of PD-1 ligands was limited largely to hemopoietic cells whilst TIGIT ligands were expressed widely within the tumor microenvironment. Programmed death-ligand 1 and CD155 were expressed within the T-cell area of ectopic lymphoid structures and colocalized with PD-1+TIGIT+ CD8+ T cells. Combinatorial anti-PD-1 and TIGIT blockade enhanced IFNγ secretion and proliferation of T cells in the presence of PD-1 and TIGIT ligands. As such, we showed that the PDAC microenvironment is characterized by the presence of substantial populations of TRM cells with an exhausted PD-1+TIGIT+ phenotype where dual checkpoint receptor blockade represents a promising avenue for future immunotherapy.
Collapse
Affiliation(s)
- Hayden Pearce
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Wayne Croft
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
- Centre for Computational Biology, University of Birmingham, Birmingham, United Kingdom
| | - Samantha M. Nicol
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Sandra Margielewska-Davies
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Richard Powell
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Richard Cornall
- Nuffield Department of Medicine and Medical Research Council Human Immunology Unit, University of Oxford, Oxford, United Kingdom
| | - Simon J. Davis
- Radcliffe Department of Medicine and Medical Research Council Human Immunology Unit, University of Oxford, Oxford, United Kingdom
| | - Francesca Marcon
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Matthew R. Pugh
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Éanna Fennell
- Health Research Institute, Bernal Institute and School of Medicine, University of Limerick, Limerick, Ireland
| | - Sarah Powell-Brett
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
- University Hospitals Birmingham NHS Foundation Trust, Queen Elizabeth Hospital Birmingham, Birmingham, United Kingdom
| | - Brinder S. Mahon
- University Hospitals Birmingham NHS Foundation Trust, Queen Elizabeth Hospital Birmingham, Birmingham, United Kingdom
| | - Rachel M. Brown
- University Hospitals Birmingham NHS Foundation Trust, Queen Elizabeth Hospital Birmingham, Birmingham, United Kingdom
| | - Gary Middleton
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
- University Hospitals Birmingham NHS Foundation Trust, Queen Elizabeth Hospital Birmingham, Birmingham, United Kingdom
| | - Keith Roberts
- University Hospitals Birmingham NHS Foundation Trust, Queen Elizabeth Hospital Birmingham, Birmingham, United Kingdom
| | - Paul Moss
- Institute of Immunology and Immunotherapy, College of Medical and Dental Sciences, University of Birmingham, Birmingham, United Kingdom
- University Hospitals Birmingham NHS Foundation Trust, Queen Elizabeth Hospital Birmingham, Birmingham, United Kingdom
| |
Collapse
|
64
|
Ma W, Lu J, Wu H. Cellcano: supervised cell type identification for single cell ATAC-seq data. Nat Commun 2023; 14:1864. [PMID: 37012226 PMCID: PMC10070275 DOI: 10.1038/s41467-023-37439-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 03/15/2023] [Indexed: 04/05/2023] Open
Abstract
Computational cell type identification is a fundamental step in single-cell omics data analysis. Supervised celltyping methods have gained increasing popularity in single-cell RNA-seq data because of the superior performance and the availability of high-quality reference datasets. Recent technological advances in profiling chromatin accessibility at single-cell resolution (scATAC-seq) have brought new insights to the understanding of epigenetic heterogeneity. With continuous accumulation of scATAC-seq datasets, supervised celltyping method specifically designed for scATAC-seq is in urgent need. Here we develop Cellcano, a computational method based on a two-round supervised learning algorithm to identify cell types from scATAC-seq data. The method alleviates the distributional shift between reference and target data and improves the prediction performance. After systematically benchmarking Cellcano on 50 well-designed celltyping tasks from various datasets, we show that Cellcano is accurate, robust, and computationally efficient. Cellcano is well-documented and freely available at https://marvinquiet.github.io/Cellcano/ .
Collapse
Affiliation(s)
- Wenjing Ma
- Department of Computer Science, Emory University, 400 Dowman Drive, Atlanta, GA, 30322, USA
| | - Jiaying Lu
- Department of Computer Science, Emory University, 400 Dowman Drive, Atlanta, GA, 30322, USA
| | - Hao Wu
- Faculty of Computer Science and Control Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, 1068 Xueyuan Avenue, Shenzhen University Town, Shenzhen, 518055, P. R. China.
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, 1518 Clifton Road NE, Atlanta, GA, 30322, USA.
| |
Collapse
|
65
|
Pei G, Yan F, Simon LM, Dai Y, Jia P, Zhao Z. deCS: A Tool for Systematic Cell Type Annotations of Single-cell RNA Sequencing Data among Human Tissues. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:370-384. [PMID: 35470070 PMCID: PMC10626171 DOI: 10.1016/j.gpb.2022.04.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Revised: 03/25/2022] [Accepted: 04/07/2022] [Indexed: 02/02/2023]
Abstract
Single-cell RNA sequencing (scRNA-seq) is revolutionizing the study of complex and dynamic cellular mechanisms. However, cell type annotation remains a main challenge as it largely relies on a priori knowledge and manual curation, which is cumbersome and subjective. The increasing number of scRNA-seq datasets, as well as numerous published genetic studies, has motivated us to build a comprehensive human cell type reference atlas.Here, we present decoding Cell type Specificity (deCS), an automatic cell type annotation method augmented by a comprehensive collection of human cell type expression profiles and marker genes. We used deCS to annotate scRNA-seq data from various tissue types and systematically evaluated the annotation accuracy under different conditions, including reference panels, sequencing depth, and feature selection strategies. Our results demonstrate that expanding the references is critical for improving annotation accuracy. Compared to many existing state-of-the-art annotation tools, deCS significantly reduced computation time and increased accuracy. deCS can be integrated into the standard scRNA-seq analytical pipeline to enhance cell type annotation. Finally, we demonstrated the broad utility of deCS to identify trait-cell type associations in 51 human complex traits, providing deep insights into the cellular mechanisms underlying disease pathogenesis. All documents for deCS, including source code, user manual, demo data, and tutorials, are freely available at https://github.com/bsml320/deCS.
Collapse
Affiliation(s)
- Guangsheng Pei
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Fangfang Yan
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Lukas M Simon
- Therapeutic Innovation Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yulin Dai
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Peilin Jia
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA.
| | - Zhongming Zhao
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA; Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA; MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX 77030, USA; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, USA.
| |
Collapse
|
66
|
Jiao L, Wang G, Dai H, Li X, Wang S, Song T. scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings. Biomolecules 2023; 13:biom13040611. [PMID: 37189359 DOI: 10.3390/biom13040611] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 03/05/2023] [Accepted: 03/10/2023] [Indexed: 03/31/2023] Open
Abstract
Single-cell transcriptomics is rapidly advancing our understanding of the composition of complex tissues and biological cells, and single-cell RNA sequencing (scRNA-seq) holds great potential for identifying and characterizing the cell composition of complex tissues. Cell type identification by analyzing scRNA-seq data is mostly limited by time-consuming and irreproducible manual annotation. As scRNA-seq technology scales to thousands of cells per experiment, the exponential increase in the number of cell samples makes manual annotation more difficult. On the other hand, the sparsity of gene transcriptome data remains a major challenge. This paper applied the idea of the transformer to single-cell classification tasks based on scRNA-seq data. We propose scTransSort, a cell-type annotation method pretrained with single-cell transcriptomics data. The scTransSort incorporates a method of representing genes as gene expression embedding blocks to reduce the sparsity of data used for cell type identification and reduce the computational complexity. The feature of scTransSort is that its implementation of intelligent information extraction for unordered data, automatically extracting valid features of cell types without the need for manually labeled features and additional references. In experiments on cells from 35 human and 26 mouse tissues, scTransSort successfully elucidated its high accuracy and high performance for cell type identification, and demonstrated its own high robustness and generalization ability.
Collapse
|
67
|
Lee J, Kim M, Kang K, Yang CS, Yoon S. Hierarchical cell-type identifier accurately distinguishes immune-cell subtypes enabling precise profiling of tissue microenvironment with single-cell RNA-sequencing. Brief Bioinform 2023; 24:bbad006. [PMID: 36681937 PMCID: PMC10025442 DOI: 10.1093/bib/bbad006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 12/22/2022] [Accepted: 01/02/2023] [Indexed: 01/23/2023] Open
Abstract
Single-cell RNA-seq enabled in-depth study on tissue micro-environment and immune-profiling, where a crucial step is to annotate cell identity. Immune cells play key roles in many diseases, whereas their activities are hard to track due to their diverse and highly variable nature. Existing cell-type identifiers had limited performance for this purpose. We present HiCAT, a hierarchical, marker-based cell-type identifier utilising gene set analysis for statistical scoring for given markers. It features successive identification of major-type, minor-type and subsets utilising subset markers structured in a three-level taxonomy tree. Comparison with manual annotation and pairwise match test showed HiCAT outperforms others in major- and minor-type identification. For subsets, we qualitatively evaluated the marker expression profile demonstrating that HiCAT provide the clearest immune-cell landscape. HiCAT was also used for immune-cell profiling in ulcerative colitis and discovered distinct features of the disease in macrophage and T-cell subsets that could not be identified previously.
Collapse
Affiliation(s)
- Joongho Lee
- Dept. of Computer Science, College of SW Convergence, Dankook University, Yongin-si, Korea, 16890
| | - Minsoo Kim
- Dept. of Computer Science, College of SW Convergence, Dankook University, Yongin-si, Korea, 16890
| | - Keunsoo Kang
- Dept. of Microbiology, College of Natural Sciences, Dankook University, Cheonan-si, Korea, 31116
| | - Chul-Su Yang
- Dept. of Molecular and Life Science, Center for Bionano Intelligence Education and Research, Hanyang University, Ansan, Korea, 15588
| | - Seokhyun Yoon
- Dept. of Electronics & Electrical Eng., College of Engineering, Dankook University, Yongin-si Korea, 16890
| |
Collapse
|
68
|
Ximerakis M, Holton KM, Giadone RM, Ozek C, Saxena M, Santiago S, Adiconis X, Dionne D, Nguyen L, Shah KM, Goldstein JM, Gasperini C, Gampierakis IA, Lipnick SL, Simmons SK, Buchanan SM, Wagers AJ, Regev A, Levin JZ, Rubin LL. Heterochronic parabiosis reprograms the mouse brain transcriptome by shifting aging signatures in multiple cell types. NATURE AGING 2023; 3:327-345. [PMID: 37118429 PMCID: PMC10154248 DOI: 10.1038/s43587-023-00373-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 01/30/2023] [Indexed: 04/30/2023]
Abstract
Aging is a complex process involving transcriptomic changes associated with deterioration across multiple tissues and organs, including the brain. Recent studies using heterochronic parabiosis have shown that various aspects of aging-associated decline are modifiable or even reversible. To better understand how this occurs, we performed single-cell transcriptomic profiling of young and old mouse brains after parabiosis. For each cell type, we cataloged alterations in gene expression, molecular pathways, transcriptional networks, ligand-receptor interactions and senescence status. Our analyses identified gene signatures, demonstrating that heterochronic parabiosis regulates several hallmarks of aging in a cell-type-specific manner. Brain endothelial cells were found to be especially malleable to this intervention, exhibiting dynamic transcriptional changes that affect vascular structure and function. These findings suggest new strategies for slowing deterioration and driving regeneration in the aging brain through approaches that do not rely on disease-specific mechanisms or actions of individual circulating factors.
Collapse
Affiliation(s)
- Methodios Ximerakis
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA.
- Harvard Stem Cell Institute, Cambridge, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Kristina M Holton
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Richard M Giadone
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Ceren Ozek
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Monika Saxena
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Samara Santiago
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Xian Adiconis
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Danielle Dionne
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Lan Nguyen
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kavya M Shah
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Jill M Goldstein
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Caterina Gasperini
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Ioannis A Gampierakis
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Scott L Lipnick
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sean K Simmons
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sean M Buchanan
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Amy J Wagers
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
- Joslin Diabetes Center, Boston, MA, USA
- Paul F. Glenn Center for the Biology of Aging, Harvard Medical School, Boston, MA, USA
| | - Aviv Regev
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Howard Hughes Medical Institute, Koch Institute of Integrative Cancer Research, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Joshua Z Levin
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Lee L Rubin
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA.
- Harvard Stem Cell Institute, Cambridge, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
69
|
Xie YR, Chari VK, Castro DC, Grant R, Rubakhin SS, Sweedler JV. Data-Driven and Machine Learning-Based Framework for Image-Guided Single-Cell Mass Spectrometry. J Proteome Res 2023; 22:491-500. [PMID: 36695570 PMCID: PMC9901547 DOI: 10.1021/acs.jproteome.2c00714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Improved throughput of analysis and lowered limits of detection have allowed single-cell chemical analysis to go beyond the detection of a few molecules in such volume-limited samples, enabling researchers to characterize different functional states of individual cells. Image-guided single-cell mass spectrometry leverages optical and fluorescence microscopy in the high-throughput analysis of cellular and subcellular targets. In this work, we propose DATSIGMA (DAta-driven Tools for Single-cell analysis using Image-Guided MAss spectrometry), a workflow based on data-driven and machine learning approaches for feature extraction and enhanced interpretability of complex single-cell mass spectrometry data. Here, we implemented our toolset with user-friendly programs and tested it on multiple experimental data sets that cover a wide range of biological applications, including classifying various brain cell types. Because it is open-source, it offers a high level of customization and can be easily adapted to other types of single-cell mass spectrometry data.
Collapse
Affiliation(s)
- Yuxuan Richard Xie
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States
| | - Varsha K. Chari
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States
| | - Daniel C. Castro
- Department of Molecular and Integrative Physiology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States
| | - Romans Grant
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States
| | - Stanislav S. Rubakhin
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States
| | - Jonathan V. Sweedler
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States,Department of Molecular and Integrative Physiology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, United States,Mailing Address: Department of Chemistry, University of Illinois, 71 RAL, Box 63-5, 600 South Mathews Avenue, Urbana, Illinois 61801, United States; Phone: (217) 244-7359;
| |
Collapse
|
70
|
Liu X, Shen Q, Zhang S. Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network. Genome Res 2023; 33:96-111. [PMID: 36526433 PMCID: PMC9977153 DOI: 10.1101/gr.276868.122] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 12/09/2022] [Indexed: 12/23/2022]
Abstract
Cross-species comparative analyses of single-cell RNA sequencing (scRNA-seq) data allow us to explore, at single-cell resolution, the origins of the cellular diversity and evolutionary mechanisms that shape cellular form and function. Cell-type assignment is a crucial step to achieve that. However, the poorly annotated genome and limited known biomarkers hinder us from assigning cell identities for nonmodel species. Here, we design a heterogeneous graph neural network model, CAME, to learn aligned and interpretable cell and gene embeddings for cross-species cell-type assignment and gene module extraction from scRNA-seq data. CAME achieves significant improvements in cell-type characterization across distant species owing to the utilization of non-one-to-one homologous gene mapping ignored by early methods. Our large-scale benchmarking study shows that CAME significantly outperforms five classical methods in terms of cell-type assignment and model robustness to insufficiency and inconsistency of sequencing depths. CAME can transfer the major cell types and interneuron subtypes of human brains to mouse and discover shared cell-type-specific functions in homologous gene modules. CAME can align the trajectories of human and macaque spermatogenesis and reveal their conservative expression dynamics. In short, CAME can make accurate cross-species cell-type assignments even for nonmodel species and uncover shared and divergent characteristics between two species from scRNA-seq data.
Collapse
Affiliation(s)
- Xingyan Liu
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China;,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qunlun Shen
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China;,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Shihua Zhang
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China;,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China;,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China;,Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
| |
Collapse
|
71
|
Huang Y, Chang H, Chen X, Meng J, Han M, Huang T, Yuan L, Zhang G. A cell marker-based clustering strategy (cmCluster) for precise cell type identification of scRNA-seq data. QUANTITATIVE BIOLOGY 2023. [DOI: 10.15302/j-qb-022-0311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/11/2023]
|
72
|
Su M, Pan T, Chen QZ, Zhou WW, Gong Y, Xu G, Yan HY, Li S, Shi QZ, Zhang Y, He X, Jiang CJ, Fan SC, Li X, Cairns MJ, Wang X, Li YS. Data analysis guidelines for single-cell RNA-seq in biomedical studies and clinical applications. Mil Med Res 2022; 9:68. [PMID: 36461064 PMCID: PMC9716519 DOI: 10.1186/s40779-022-00434-8] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 11/18/2022] [Indexed: 12/03/2022] Open
Abstract
The application of single-cell RNA sequencing (scRNA-seq) in biomedical research has advanced our understanding of the pathogenesis of disease and provided valuable insights into new diagnostic and therapeutic strategies. With the expansion of capacity for high-throughput scRNA-seq, including clinical samples, the analysis of these huge volumes of data has become a daunting prospect for researchers entering this field. Here, we review the workflow for typical scRNA-seq data analysis, covering raw data processing and quality control, basic data analysis applicable for almost all scRNA-seq data sets, and advanced data analysis that should be tailored to specific scientific questions. While summarizing the current methods for each analysis step, we also provide an online repository of software and wrapped-up scripts to support the implementation. Recommendations and caveats are pointed out for some specific analysis tasks and approaches. We hope this resource will be helpful to researchers engaging with scRNA-seq, in particular for emerging clinical applications.
Collapse
Affiliation(s)
- Min Su
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing, 211166 China
| | - Tao Pan
- College of Biomedical Information and Engineering, the First Affiliated Hospital of Hainan Medical University, Hainan Medical University, Haikou, 571199 Hainan China
| | - Qiu-Zhen Chen
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing, 211166 China
| | - Wei-Wei Zhou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081 Heilongjiang China
| | - Yi Gong
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing, 211166 China
- Department of Immunology, Nanjing Medical University, Nanjing, 211166 China
| | - Gang Xu
- College of Biomedical Information and Engineering, the First Affiliated Hospital of Hainan Medical University, Hainan Medical University, Haikou, 571199 Hainan China
| | - Huan-Yu Yan
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing, 211166 China
| | - Si Li
- College of Biomedical Information and Engineering, the First Affiliated Hospital of Hainan Medical University, Hainan Medical University, Haikou, 571199 Hainan China
| | - Qiao-Zhen Shi
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing, 211166 China
| | - Ya Zhang
- College of Biomedical Information and Engineering, the First Affiliated Hospital of Hainan Medical University, Hainan Medical University, Haikou, 571199 Hainan China
| | - Xiao He
- Department of Laboratory Medicine, Women and Children’s Hospital of Chongqing Medical University, Chongqing, 401174 China
| | | | - Shi-Cai Fan
- Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen, 518110 Guangdong China
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081 Heilongjiang China
| | - Murray J. Cairns
- School of Biomedical Sciences and Pharmacy, Faculty of Health and Medicine, the University of Newcastle, University Drive, Callaghan, NSW 2308 Australia
- Precision Medicine Research Program, Hunter Medical Research Institute, New Lambton Heights, NSW 2305 Australia
| | - Xi Wang
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing, 211166 China
| | - Yong-Sheng Li
- College of Biomedical Information and Engineering, the First Affiliated Hospital of Hainan Medical University, Hainan Medical University, Haikou, 571199 Hainan China
| |
Collapse
|
73
|
Karlinsey K, Qu L, Matz AJ, Zhou B. A novel strategy to dissect multifaceted macrophage function in human diseases. J Leukoc Biol 2022; 112:1535-1542. [PMID: 35726704 DOI: 10.1002/jlb.6mr0522-685r] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 05/13/2022] [Accepted: 06/03/2022] [Indexed: 01/11/2023] Open
Abstract
Macrophages are widely distributed immune cells that play central roles in a variety of physiologic and pathologic processes, including obesity and cardiovascular disease (CVD). They are highly plastic cells that execute diverse functions according to a combination of signaling and environmental cues. While macrophages have traditionally been understood to polarize to either proinflammatory M1-like or anti-inflammatory M2-like states, evidence has shown that they exist in a spectrum of states between those 2 phenotypic extremes. In obesity-related disease, M1-like macrophages exacerbate inflammation and promote insulin resistance, while M2-like macrophages reduce inflammation, promoting insulin sensitivity. However, polarization markers are expressed inconsistently in adipose tissue macrophages, and they additionally exhibit phenotypes differing from the M1/M2 paradigm. In atherosclerotic CVD, activated plaque macrophages can also exist in a range of proinflammatory or anti-inflammatory states. Some of these macrophages scavenge lipids, developing into heterogeneous foam cell populations. To better characterize the many actions of macrophages in human disease, we have designed a novel set of computational tools: MacSpectrum and AtheroSpectrum. These tools provide information on the inflammatory polarization status, differentiation, and foaming of macrophages in both human and mouse samples, allowing for better characterization of macrophage subpopulations based on their function. Using these tools, we identified disease-relevant cell states in obesity and CVD, including the novel concept that macrophage-derived foam cell formation can follow homeostatic noninflammatory or pathogenic inflammatory foaming programs.
Collapse
Affiliation(s)
- Keaton Karlinsey
- Department of Immunology, School of Medicine, University of Connecticut, Farmington, Connecticut, USA
| | - Lili Qu
- Department of Immunology, School of Medicine, University of Connecticut, Farmington, Connecticut, USA
| | - Alyssa J Matz
- Department of Immunology, School of Medicine, University of Connecticut, Farmington, Connecticut, USA
| | - Beiyan Zhou
- Department of Immunology, School of Medicine, University of Connecticut, Farmington, Connecticut, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut, USA
| |
Collapse
|
74
|
Vázquez-García I, Uhlitz F, Ceglia N, Lim JLP, Wu M, Mohibullah N, Niyazov J, Ruiz AEB, Boehm KM, Bojilova V, Fong CJ, Funnell T, Grewal D, Havasov E, Leung S, Pasha A, Patel DM, Pourmaleki M, Rusk N, Shi H, Vanguri R, Williams MJ, Zhang AW, Broach V, Chi DS, Da Cruz Paula A, Gardner GJ, Kim SH, Lennon M, Long Roche K, Sonoda Y, Zivanovic O, Kundra R, Viale A, Derakhshan FN, Geneslaw L, Issa Bhaloo S, Maroldi A, Nunez R, Pareja F, Stylianou A, Vahdatinia M, Bykov Y, Grisham RN, Liu YL, Lakhman Y, Nikolovski I, Kelly D, Gao J, Schietinger A, Hollmann TJ, Bakhoum SF, Soslow RA, Ellenson LH, Abu-Rustum NR, Aghajanian C, Friedman CF, McPherson A, Weigelt B, Zamarin D, Shah SP. Ovarian cancer mutational processes drive site-specific immune evasion. Nature 2022; 612:778-786. [PMID: 36517593 PMCID: PMC9771812 DOI: 10.1038/s41586-022-05496-1] [Citation(s) in RCA: 65] [Impact Index Per Article: 32.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 10/28/2022] [Indexed: 12/15/2022]
Abstract
High-grade serous ovarian cancer (HGSOC) is an archetypal cancer of genomic instability1-4 patterned by distinct mutational processes5,6, tumour heterogeneity7-9 and intraperitoneal spread7,8,10. Immunotherapies have had limited efficacy in HGSOC11-13, highlighting an unmet need to assess how mutational processes and the anatomical sites of tumour foci determine the immunological states of the tumour microenvironment. Here we carried out an integrative analysis of whole-genome sequencing, single-cell RNA sequencing, digital histopathology and multiplexed immunofluorescence of 160 tumour sites from 42 treatment-naive patients with HGSOC. Homologous recombination-deficient HRD-Dup (BRCA1 mutant-like) and HRD-Del (BRCA2 mutant-like) tumours harboured inflammatory signalling and ongoing immunoediting, reflected in loss of HLA diversity and tumour infiltration with highly differentiated dysfunctional CD8+ T cells. By contrast, foldback-inversion-bearing tumours exhibited elevated immunosuppressive TGFβ signalling and immune exclusion, with predominantly naive/stem-like and memory T cells. Phenotypic state associations were specific to anatomical sites, highlighting compositional, topological and functional differences between adnexal tumours and distal peritoneal foci. Our findings implicate anatomical sites and mutational processes as determinants of evolutionary phenotypic divergence and immune resistance mechanisms in HGSOC. Our study provides a multi-omic cellular phenotype data substrate from which to develop and interpret future personalized immunotherapeutic approaches and early detection research.
Collapse
Affiliation(s)
- Ignacio Vázquez-García
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Florian Uhlitz
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Nicholas Ceglia
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Jamie L P Lim
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Michelle Wu
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Neeman Mohibullah
- Integrated Genomics Operation, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Juliana Niyazov
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Arvin Eric B Ruiz
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Kevin M Boehm
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Viktoria Bojilova
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Christopher J Fong
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Tyler Funnell
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Diljot Grewal
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Eliyahu Havasov
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Samantha Leung
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Arfath Pasha
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Druv M Patel
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Maryam Pourmaleki
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Nicole Rusk
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Hongyu Shi
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Rami Vanguri
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Marc J Williams
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Allen W Zhang
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Vance Broach
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Dennis S Chi
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Arnaud Da Cruz Paula
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ginger J Gardner
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Sarah H Kim
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Matthew Lennon
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Kara Long Roche
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Yukio Sonoda
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Oliver Zivanovic
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ritika Kundra
- Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Agnes Viale
- Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Fatemeh N Derakhshan
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Luke Geneslaw
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Shirin Issa Bhaloo
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ana Maroldi
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Rahelly Nunez
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Fresia Pareja
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Anthe Stylianou
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Mahsa Vahdatinia
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Yonina Bykov
- Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Rachel N Grisham
- Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Weill Cornell Medical Center, New York, NY, USA
| | - Ying L Liu
- Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Weill Cornell Medical Center, New York, NY, USA
| | - Yulia Lakhman
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ines Nikolovski
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Daniel Kelly
- Department of Information Systems, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Jianjiong Gao
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Andrea Schietinger
- Weill Cornell Medical Center, New York, NY, USA
- Immunology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Travis J Hollmann
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Bristol Myers Squibb, Princeton, NJ, USA
| | - Samuel F Bakhoum
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Robert A Soslow
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Lora H Ellenson
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Nadeem R Abu-Rustum
- Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Weill Cornell Medical Center, New York, NY, USA
| | - Carol Aghajanian
- Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Claire F Friedman
- Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Weill Cornell Medical Center, New York, NY, USA
| | - Andrew McPherson
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Britta Weigelt
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Dmitriy Zamarin
- Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
- Weill Cornell Medical Center, New York, NY, USA.
| | - Sohrab P Shah
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
| |
Collapse
|
75
|
Chen X, Chen Y, Chen X, Wei P, Lin Y, Wu Z, Lin Z, Kang D, Ding C. Single-cell RNA sequencing reveals intra-tumoral heterogeneity of glioblastoma and a pro-tumor subset of tumor-associated macrophages characterized by EZH2 overexpression. Biochim Biophys Acta Mol Basis Dis 2022; 1868:166534. [PMID: 36057370 DOI: 10.1016/j.bbadis.2022.166534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 08/09/2022] [Accepted: 08/23/2022] [Indexed: 10/14/2022]
Abstract
BACKGROUND Glioblastoma (GBM) is a highly heterogeneous disease with poor clinical outcome. AIM To comprehensively dissect molecular landscape of GBM and heterogeneous distribution and potential role of Enhancer of zeste homolog 2 (EZH2) in tumor microenvironment (TME). METHODS Single-cell RNA sequencing (scRNA-seq) analysis was performed in GBM samples from 8 patients. Deconvolution analysis, immunofluorescence (IF) microscopy, reverse-transcription quantitative polymerase chain reaction (RT-qPCR), colony formation experiments, and Cell Counting Kit-8 (CCK-8) assays were performed to confirmed the potential role of EZH2 in TME cells. RESULTS Malignant cells exhibited remarkable heterogeneity in abnormal metabolic patterns. A mesenchymal-2-like (MES2-like) GBM subcluster with glial-immune dual feature was firstly discovered, which were associated with highly activated hallmark pathways, immune evasion associated transcription factor (IRF8), and poor survival. The oncogene, EZH2, was heterogeneously expressed in malignant cells and immune cells consistent with proliferative genes, cell-cycle transcription factors, and similar activated hallmark pathways. In a tumor-associated macrophages (TAMs) subset (macrophage.3), EZH2 was highly expressed with similar changes of transcriptomic dynamics with cell-cycle genes and macrophages M2-phetotype genes. In addition, the subset tightly interacted with malignant cells. Deconvolution analysis showed increased abundance of the subset in GBM compared to low-grade glioma (LGG) and significant association with worse prognosis. Functional verification experiments confirmed the pro-tumor role of TAMs with EZH2 overexpression in GBM. CONCLUSIONS Our study illustrated a MES2-like GBM subcluster characterized by glial-immune dual feature and highlighted the pro-tumor role of a TAMs subset characterized by EZH2 overexpression.
Collapse
Affiliation(s)
- Xiaoyong Chen
- Department of Neurosurgery, Neurosurgery Research Institute, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China
| | - Yue Chen
- Department of Neurosurgery, Neurosurgery Research Institute, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China
| | - Xiangrong Chen
- Department of Neurosurgery, the Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Penghui Wei
- Department of Neurosurgery, Neurosurgery Research Institute, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China
| | - Yuanxiang Lin
- Department of Neurosurgery, Neurosurgery Research Institute, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China
| | - Zanyi Wu
- Department of Neurosurgery, Neurosurgery Research Institute, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China
| | - Zhangya Lin
- Department of Neurosurgery, Neurosurgery Research Institute, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China.
| | - Dezhi Kang
- Department of Neurosurgery, Neurosurgery Research Institute, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China.
| | - Chenyu Ding
- Department of Neurosurgery, Neurosurgery Research Institute, The First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China.
| |
Collapse
|
76
|
PEAR1 regulates expansion of activated fibroblasts and deposition of extracellular matrix in pulmonary fibrosis. Nat Commun 2022; 13:7114. [PMID: 36402779 PMCID: PMC9675736 DOI: 10.1038/s41467-022-34870-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 11/09/2022] [Indexed: 11/21/2022] Open
Abstract
Pulmonary fibrosis is a chronic interstitial lung disease that causes irreversible and progressive lung scarring and respiratory failure. Activation of fibroblasts plays a central role in the progression of pulmonary fibrosis. Here we show that platelet endothelial aggregation receptor 1 (PEAR1) in fibroblasts may serve as a target for pulmonary fibrosis therapy. Pear1 deficiency in aged mice spontaneously causes alveolar collagens accumulation. Mesenchyme-specific Pear1 deficiency aggravates bleomycin-induced pulmonary fibrosis, confirming that PEAR1 potentially modulates pulmonary fibrosis progression via regulation of mesenchymal cell function. Moreover, single cell and bulk tissue RNA-seq analysis of pulmonary fibroblast reveals the expansion of Activated-fibroblast cluster and enrichment of marker genes in extracellular matrix development in Pear1-/- fibrotic lungs. We further show that PEAR1 associates with Protein Phosphatase 1 to suppress fibrotic factors-induced intracellular signalling and fibroblast activation. Intratracheal aerosolization of monoclonal antibodies activating PEAR1 greatly ameliorates pulmonary fibrosis in both WT and Pear1-humanized mice, significantly improving their survival rate.
Collapse
|
77
|
Wen L, Li G, Huang T, Geng W, Pei H, Yang J, Zhu M, Zhang P, Hou R, Tian G, Su W, Chen J, Zhang D, Zhu P, Zhang W, Zhang X, Zhang N, Zhao Y, Cao X, Peng G, Ren X, Jiang N, Tian C, Chen ZJ. Single-cell technologies: From research to application. Innovation (N Y) 2022; 3:100342. [PMID: 36353677 PMCID: PMC9637996 DOI: 10.1016/j.xinn.2022.100342] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 10/13/2022] [Indexed: 11/09/2022] Open
Abstract
In recent years, more and more single-cell technologies have been developed. A vast amount of single-cell omics data has been generated by large projects, such as the Human Cell Atlas, the Mouse Cell Atlas, the Mouse RNA Atlas, the Mouse ATAC Atlas, and the Plant Cell Atlas. Based on these single-cell big data, thousands of bioinformatics algorithms for quality control, clustering, cell-type annotation, developmental inference, cell-cell transition, cell-cell interaction, and spatial analysis are developed. With powerful experimental single-cell technology and state-of-the-art big data analysis methods based on artificial intelligence, the molecular landscape at the single-cell level can be revealed. With spatial transcriptomics and single-cell multi-omics, even the spatial dynamic multi-level regulatory mechanisms can be deciphered. Such single-cell technologies have many successful applications in oncology, assisted reproduction, embryonic development, and plant breeding. We not only review the experimental and bioinformatics methods for single-cell research, but also discuss their applications in various fields and forecast the future directions for single-cell technologies. We believe that spatial transcriptomics and single-cell multi-omics will become the next booming business for mechanism research and commercial industry.
Collapse
Affiliation(s)
- Lu Wen
- Biomedical Pioneering Innovation Centre (BIOPIC), Peking University, Beijing 100871, China
| | - Guoqiang Li
- Biomedical Pioneering Innovation Centre (BIOPIC), Peking University, Beijing 100871, China
| | - Tao Huang
- Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai 200031, China
| | - Wei Geng
- School of Chemical Engineering and Technology, Sun Yat-Sen University, Zhuhai 519082, China
| | - Hao Pei
- Mozhuo Biotech (Zhejiang) Co., Ltd., Tongxiang, Jiaxing 314500, China
| | | | - Miao Zhu
- Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China
| | - Pengfei Zhang
- Department of Medical Oncology, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Rui Hou
- Geneis (Beijing) Co., Ltd., Beijing 100102, China
| | - Geng Tian
- Geneis (Beijing) Co., Ltd., Beijing 100102, China
| | - Wentao Su
- School of Food Science and Technology, Dalian Polytechnic University, Dalian 116034, China
| | - Jian Chen
- State Key Laboratory of Transducer Technology, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China
| | - Dake Zhang
- Key Laboratory of Biomechanics and Mechanobiology, Ministry of Education, Beijing Advanced Innovation Center for Biomedical Engineering, School of Engineering Medicine, Beihang University, Beijing 100083, China
| | - Pingan Zhu
- Department of Mechanical Engineering, City University of Hong Kong, Hong Kong 999077, China
| | - Wei Zhang
- Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China
| | - Xiuxin Zhang
- Center of Peony, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Key Laboratory of Biology and Genetic Improvement of Flower Crops (North China), Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Beijing 100081, China
| | - Ning Zhang
- Department of Hepatic Surgery, Fudan University Shanghai Cancer Center, Shanghai 200032, China
| | - Yunlong Zhao
- Advanced Technology Institute, University of Surrey, Guildford, Surrey, GU2 7XH, UK
- National Physical Laboratory, Teddington, Middlesex TW11 0LW, UK
| | - Xin Cao
- Institute of Clinical Science, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Guangdun Peng
- Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China
| | - Xianwen Ren
- Biomedical Pioneering Innovation Centre (BIOPIC), Peking University, Beijing 100871, China
| | - Nan Jiang
- West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China
- Jinfeng Laboratory, Chongqing 401329, China
| | - Caihuan Tian
- Center of Peony, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Key Laboratory of Biology and Genetic Improvement of Flower Crops (North China), Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Beijing 100081, China
| | - Zi-Jiang Chen
- Center for Reproductive Medicine, Shandong University, Jinan, Shandong, 250012, China
| |
Collapse
|
78
|
de Couto G, Mesquita T, Wu X, Rajewski A, Huang F, Akhmerov A, Na N, Wu D, Wang Y, Li L, Tran M, Kilfoil P, Cingolani E, Marbán E. Cell therapy attenuates endothelial dysfunction in hypertensive rats with heart failure and preserved ejection fraction. Am J Physiol Heart Circ Physiol 2022; 323:H892-H903. [PMID: 36083797 PMCID: PMC9602891 DOI: 10.1152/ajpheart.00287.2022] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 08/24/2022] [Accepted: 09/06/2022] [Indexed: 12/14/2022]
Abstract
Heart failure with preserved ejection fraction (HFpEF) is defined by increased left ventricular (LV) stiffness, impaired vascular compliance, and fibrosis. Although systemic inflammation, driven by comorbidities, has been proposed to play a key role, the precise pathogenesis remains elusive. To test the hypothesis that inflammation drives endothelial dysfunction in HFpEF, we used cardiosphere-derived cells (CDCs), which reduce inflammation and fibrosis, improving function, structure, and survival in HFpEF rats. Dahl salt-sensitive rats fed a high-salt diet developed HFpEF, as manifested by diastolic dysfunction, systemic inflammation, and accelerated mortality. Rats were randomly allocated to receive intracoronary infusion of CDCs or vehicle. Two weeks later, inflammation, oxidative stress, and endothelial function were analyzed. Single-cell RNA sequencing of heart tissue was used to assay transcriptomic changes. CDCs improved endothelial-dependent vasodilation while reducing oxidative stress and restoring endothelial nitric oxide synthase (eNOS) expression. RNA sequencing revealed CDC-induced attenuation of pathways underlying endothelial cell leukocyte binding and innate immunity. Exposure of endothelial cells to CDC-secreted extracellular vesicles in vitro reduced VCAM-1 protein expression and attenuated monocyte adhesion and transmigration. Cell therapy with CDCs corrects diastolic dysfunction, reduces oxidative stress, and restores vascular reactivity. These findings lend credence to the hypothesis that inflammatory changes of the vascular endothelium are important, if not central, to HFpEF pathogenesis.NEW & NOTEWORTHY We tested the concept that inflammation of endothelial cells is a major pathogenic factor in HFpEF. CDCs are heart-derived cell products with verified anti-inflammatory therapeutic properties. Infusion of CDCs reduced oxidative stress, restored eNOS abundance, lowered monocyte levels, and rescued the expression of multiple disease-associated genes, thereby restoring vascular reactivity. The salutary effects of CDCs support the hypothesis that inflammation of endothelial cells is a proximate driver of HFpEF.
Collapse
Affiliation(s)
- Geoffrey de Couto
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | - Thassio Mesquita
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | - Xiaokang Wu
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | - Alex Rajewski
- Applied Genomics, Computation and Translational Core, Cedars-Sinai Cancer, Cedars-Sinai Medical Center, Los Angeles, California
| | - Feng Huang
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | | | - Na Na
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | - Di Wu
- Applied Genomics, Computation and Translational Core, Cedars-Sinai Cancer, Cedars-Sinai Medical Center, Los Angeles, California
| | - Yizhou Wang
- Applied Genomics, Computation and Translational Core, Cedars-Sinai Cancer, Cedars-Sinai Medical Center, Los Angeles, California
| | - Liang Li
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | - My Tran
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | - Peter Kilfoil
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | - Eugenio Cingolani
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| | - Eduardo Marbán
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California
| |
Collapse
|
79
|
Mikolajewicz N, Gacesa R, Aguilera-Uribe M, Brown KR, Moffat J, Han H. Multi-level cellular and functional annotation of single-cell transcriptomes using scPipeline. Commun Biol 2022; 5:1142. [PMID: 36307536 PMCID: PMC9616830 DOI: 10.1038/s42003-022-04093-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 10/11/2022] [Indexed: 11/08/2022] Open
Abstract
Single-cell RNA-sequencing (scRNA-seq) offers functional insight into complex biology, allowing for the interrogation of cellular populations and gene expression programs at single-cell resolution. Here, we introduce scPipeline, a single-cell data analysis toolbox that builds on existing methods and offers modular workflows for multi-level cellular annotation and user-friendly analysis reports. Advances to scRNA-seq annotation include: (i) co-dependency index (CDI)-based differential expression, (ii) cluster resolution optimization using a marker-specificity criterion, (iii) marker-based cell-type annotation with Miko scoring, and (iv) gene program discovery using scale-free shared nearest neighbor network (SSN) analysis. Both unsupervised and supervised procedures were validated using a diverse collection of scRNA-seq datasets and illustrative examples of cellular transcriptomic annotation of developmental and immunological scRNA-seq atlases are provided herein. Overall, scPipeline offers a flexible computational framework for in-depth scRNA-seq analysis.
Collapse
Affiliation(s)
- Nicholas Mikolajewicz
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
| | - Rafael Gacesa
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | - Magali Aguilera-Uribe
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Kevin R Brown
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
| | - Jason Moffat
- Donnelly Centre, University of Toronto, Toronto, ON, Canada.
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
- Institute for Biomedical Engineering, University of Toronto, Toronto, ON, Canada.
| | - Hong Han
- Donnelly Centre, University of Toronto, Toronto, ON, Canada.
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada.
| |
Collapse
|
80
|
Chen Y, Zhang S. Automatic Cell Type Annotation Using Marker Genes for Single-Cell RNA Sequencing Data. Biomolecules 2022; 12:biom12101539. [PMID: 36291748 PMCID: PMC9599378 DOI: 10.3390/biom12101539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Revised: 10/01/2022] [Accepted: 10/11/2022] [Indexed: 11/16/2022] Open
Abstract
Recent advancement in single-cell RNA sequencing (scRNA-seq) technology is gaining more and more attention. Cell type annotation plays an essential role in scRNA-seq data analysis. Several computational methods have been proposed for automatic annotation. Traditional cell type annotation is to first cluster the cells using unsupervised learning methods based on the gene expression profiles, then to label the clusters using the aggregated cluster-level expression profiles and the marker genes’ information. Such procedure relies heavily on the clustering results. As the purity of clusters cannot be guaranteed, false detection of cluster features may lead to wrong annotations. In this paper, we improve this procedure and propose an Automatic Cell type Annotation Method (ACAM). ACAM delineates a clear framework to conduct automatic cell annotation through representative cluster identification, representative cluster annotation using marker genes, and the remaining cells’ classification. Experiments on seven real datasets show the better performance of ACAM compared to six well-known cell type annotation methods.
Collapse
Affiliation(s)
- Yu Chen
- School of Mathematical Sciences, Fudan University, Shanghai 200433, China
| | - Shuqin Zhang
- School of Mathematical Sciences, Fudan University, Shanghai 200433, China
- Key Laboratory of Mathematics for Nonlinear Science (Ministry of Education), Fudan University, Shanghai 200433, China
- Shanghai Key Laboratory for Contemporary Applied Mathematics, Fudan University, Shanghai 200433, China
- Correspondence:
| |
Collapse
|
81
|
Srivastava A, Bencomo T, Das I, Lee CS. Unravelling the landscape of skin cancer through single-cell transcriptomics. Transl Oncol 2022; 27:101557. [PMID: 36257209 PMCID: PMC9576539 DOI: 10.1016/j.tranon.2022.101557] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 09/12/2022] [Accepted: 09/15/2022] [Indexed: 11/15/2022] Open
Abstract
The human skin is a complex organ that forms the first line of defense against pathogens and external injury. It is composed of a wide variety of cells that work together to maintain homeostasis and prevent disease, such as skin cancer. The exponentially rising incidence of skin malignancies poses a growing public health challenge, particularly when the disease course is complicated by metastasis and therapeutic resistance. Recent advances in single-cell transcriptomics have provided a high-resolution view of gene expression heterogeneity that can be applied to skin cancers to define cell types and states, understand disease evolution, and develop new therapeutic concepts. This approach has been particularly valuable in characterizing the contribution of immune cells in skin cancer, an area of great clinical importance given the increasing use of immunotherapy in this setting. In this review, we highlight recent skin cancer studies utilizing bulk RNA sequencing, introduce various single-cell transcriptomics approaches, and summarize key findings obtained by applying single-cell transcriptomics to skin cancer.
Collapse
Affiliation(s)
- Ankit Srivastava
- Stanford Program in Epithelial Biology, Stanford University, Stanford, CA 94305 United States of America,Department of Microbiology, Tumor and Cell Biology, Science for Life Laboratory, Karolinska Institute, Stockholm 17177, Sweden
| | - Tomas Bencomo
- Stanford Program in Epithelial Biology, Stanford University, Stanford, CA 94305 United States of America
| | - Ishani Das
- Division of Oncology, School of Medicine, Stanford University, Stanford, CA 94305 United States of America
| | - Carolyn S. Lee
- Stanford Program in Epithelial Biology, Stanford University, Stanford, CA 94305 United States of America,Stanford Cancer Institute, Stanford University, Stanford, CA 94305 United States of America,Veterans Affairs Palo Alto Healthcare System, Palo Alto, CA 94304 United States of America,Corresponding author at: 269 Campus Drive, Room 2160, Stanford, CA 94305.
| |
Collapse
|
82
|
Grabski IN, Irizarry RA. A probabilistic gene expression barcode for annotation of cell types from single-cell RNA-seq data. Biostatistics 2022; 23:1150-1164. [PMID: 35770795 PMCID: PMC9802389 DOI: 10.1093/biostatistics/kxac021] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 05/10/2022] [Accepted: 05/22/2022] [Indexed: 01/07/2023] Open
Abstract
Single-cell RNA sequencing (scRNA-seq) quantifies gene expression for individual cells in a sample, which allows distinct cell-type populations to be identified and characterized. An important step in many scRNA-seq analysis pipelines is the annotation of cells into known cell types. While this can be achieved using experimental techniques, such as fluorescence-activated cell sorting, these approaches are impractical for large numbers of cells. This motivates the development of data-driven cell-type annotation methods. We find limitations with current approaches due to the reliance on known marker genes or from overfitting because of systematic differences, or batch effects, between studies. Here, we present a statistical approach that leverages public data sets to combine information across thousands of genes, uses a latent variable model to define cell-type-specific barcodes and account for batch effect variation, and probabilistically annotates cell-type identity from a reference of known cell types. The barcoding approach also provides a new way to discover marker genes. Using a range of data sets, including those generated to represent imperfect real-world reference data, we demonstrate that our approach substantially outperforms current reference-based methods, particularly when predicting across studies.
Collapse
Affiliation(s)
- Isabella N Grabski
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Rafael A Irizarry
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA and Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA
| |
Collapse
|
83
|
Li D, Ding J, Bar-Joseph Z. Unsupervised cell functional annotation for single-cell RNA-seq. Genome Res 2022; 32:1765-1775. [PMID: 35764397 PMCID: PMC9528981 DOI: 10.1101/gr.276609.122] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 06/10/2022] [Indexed: 11/25/2022]
Abstract
One of the first steps in the analysis of single-cell RNA sequencing (scRNA-seq) data is the assignment of cell types. Although a number of supervised methods have been developed for this, in most cases such assignment is performed by first clustering cells in low-dimensional space and then assigning cell types to different clusters. To overcome noise and to improve cell type assignments, we developed UNIFAN, a neural network method that simultaneously clusters and annotates cells using known gene sets. UNIFAN combines both low-dimensional representation for all genes and cell-specific gene set activity scores to determine the clustering. We applied UNIFAN to human and mouse scRNA-seq data sets from several different organs. We show, by using knowledge about gene sets, that UNIFAN greatly outperforms prior methods developed for clustering scRNA-seq data. The gene sets assigned by UNIFAN to different clusters provide strong evidence for the cell type that is represented by this cluster, making annotations easier.
Collapse
Affiliation(s)
- Dongshunyi Li
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - Jun Ding
- Meakins-Christie Laboratories, Department of Medicine, McGill University Health Centre, Montreal, Quebec, H4A 3J1, Canada
| | - Ziv Bar-Joseph
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
- Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| |
Collapse
|
84
|
Li PH, Kong XY, He YZ, Liu Y, Peng X, Li ZH, Xu H, Luo H, Park J. Recent developments in application of single-cell RNA sequencing in the tumour immune microenvironment and cancer therapy. Mil Med Res 2022; 9:52. [PMID: 36154923 PMCID: PMC9511789 DOI: 10.1186/s40779-022-00414-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Accepted: 08/20/2022] [Indexed: 11/10/2022] Open
Abstract
The advent of single-cell RNA sequencing (scRNA-seq) has provided insight into the tumour immune microenvironment (TIME). This review focuses on the application of scRNA-seq in investigation of the TIME. Over time, scRNA-seq methods have evolved, and components of the TIME have been deciphered with high resolution. In this review, we first introduced the principle of scRNA-seq and compared different sequencing approaches. Novel cell types in the TIME, a continuous transitional state, and mutual intercommunication among TIME components present potential targets for prognosis prediction and treatment in cancer. Thus, we concluded novel cell clusters of cancer-associated fibroblasts (CAFs), T cells, tumour-associated macrophages (TAMs) and dendritic cells (DCs) discovered after the application of scRNA-seq in TIME. We also proposed the development of TAMs and exhausted T cells, as well as the possible targets to interrupt the process. In addition, the therapeutic interventions based on cellular interactions in TIME were also summarized. For decades, quantification of the TIME components has been adopted in clinical practice to predict patient survival and response to therapy and is expected to play an important role in the precise treatment of cancer. Summarizing the current findings, we believe that advances in technology and wide application of single-cell analysis can lead to the discovery of novel perspectives on cancer therapy, which can subsequently be implemented in the clinic. Finally, we propose some future directions in the field of TIME studies that can be aided by scRNA-seq technology.
Collapse
Affiliation(s)
- Pei-Heng Li
- Department of Thyroid and Parathyroid Surgery, Laboratory of Thyroid and Parathyroid Disease, Frontiers Science Centre for Disease-Related Molecular Network, Department of General Surgery, West China Hospital, Sichuan University, Chengdu, 610044, China
| | - Xiang-Yu Kong
- Department of Thyroid and Parathyroid Surgery, Laboratory of Thyroid and Parathyroid Disease, Frontiers Science Centre for Disease-Related Molecular Network, Department of General Surgery, West China Hospital, Sichuan University, Chengdu, 610044, China
| | - Ya-Zhou He
- Department of Oncology, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, 610044, China
| | - Yi Liu
- Department of Rheumatology and Immunology, Rare Diseases Centre, West China Hospital, Sichuan University, Chengdu, 610044, China
| | - Xi Peng
- College of Computer Science, Sichuan University, Chengdu, 610065, China
| | - Zhi-Hui Li
- Department of Thyroid and Parathyroid Surgery, Laboratory of Thyroid and Parathyroid Disease, Frontiers Science Centre for Disease-Related Molecular Network, Department of General Surgery, West China Hospital, Sichuan University, Chengdu, 610044, China
| | - Heng Xu
- State Key Laboratory of Biotherapy and Cancer Centre, West China Hospital, Sichuan University and Collaborative Innovation Centre, Chengdu, 610044, China
| | - Han Luo
- Department of Thyroid and Parathyroid Surgery, Laboratory of Thyroid and Parathyroid Disease, Frontiers Science Centre for Disease-Related Molecular Network, Department of General Surgery, West China Hospital, Sichuan University, Chengdu, 610044, China.
| | - Jihwan Park
- School of Life Sciences, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005, Republic of Korea.
| |
Collapse
|
85
|
Li Z, Wang Y, Ganan-Gomez I, Colla S, Do KA. A machine learning-based method for automatically identifying novel cells in annotating single-cell RNA-seq data. Bioinformatics 2022; 38:4885-4892. [PMID: 36083008 PMCID: PMC9801963 DOI: 10.1093/bioinformatics/btac617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 09/06/2022] [Accepted: 09/08/2022] [Indexed: 01/07/2023] Open
Abstract
MOTIVATION Single-cell RNA sequencing (scRNA-seq) has been widely used to decompose complex tissues into functionally distinct cell types. The first and usually the most important step of scRNA-seq data analysis is to accurately annotate the cell labels. In recent years, many supervised annotation methods have been developed and shown to be more convenient and accurate than unsupervised cell clustering. One challenge faced by all the supervised annotation methods is the identification of the novel cell type, which is defined as the cell type that is not present in the training data, only exists in the testing data. Existing methods usually label the cells simply based on the correlation coefficients or confidence scores, which sometimes results in an excessive number of unlabeled cells. RESULTS We developed a straightforward yet effective method combining autoencoder with iterative feature selection to automatically identify novel cells from scRNA-seq data. Our method trains an autoencoder with the labeled training data and applies the autoencoder to the testing data to obtain reconstruction errors. By iteratively selecting features that demonstrate a bi-modal pattern and reclustering the cells using the selected feature, our method can accurately identify novel cells that are not present in the training data. We further combined this approach with a support vector machine to provide a complete solution for annotating the full range of cell types. Extensive numerical experiments using five real scRNA-seq datasets demonstrated favorable performance of the proposed method over existing methods serving similar purposes. AVAILABILITY AND IMPLEMENTATION Our R software package CAMLU is publicly available through the Zenodo repository (https://doi.org/10.5281/zenodo.7054422) or GitHub repository (https://github.com/ziyili20/CAMLU). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ziyi Li
- To whom correspondence should be addressed. or
| | - Yizhuo Wang
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Irene Ganan-Gomez
- Department of Leukemia, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Simona Colla
- Department of Leukemia, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Kim-Anh Do
- To whom correspondence should be addressed. or
| |
Collapse
|
86
|
Chen XY, Chen Y, Fang WH, Wu ZY, Wang DL, Xu YW, Yu LH, Lin YX, Kang DZ, Ding CY. Integrative and comparative single-cell analysis reveals transcriptomic difference between human tumefactive demyelinating lesion and glioma. Commun Biol 2022; 5:941. [PMID: 36085357 PMCID: PMC9463163 DOI: 10.1038/s42003-022-03900-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Accepted: 08/25/2022] [Indexed: 11/09/2022] Open
Abstract
Tumefactive demyelinating lesion (TDL) is an immune-mediated disease which can be misdiagnosed as glioma. At present, there is no study comparing difference between the two disorders at the cellular level. Here, we perform integrative and comparative single-cell RNA sequencing (ScRNA-seq) transcriptomic analysis on TDL and glioma lesions. At single-cell resolution, TDL is comprised primarily of immune cells, which is completely different from glioma. The integrated analysis reveals a TDL-specific microglial subset involving in B cell activation and proliferation. Comparative analysis highlights remyelination function of glial cells and demyelination function of T cells in TDL. Subclustering and pseudotime trajectory analysis of T cells in TDL reveal their heterogeneity and diverse functions involving in TDL pathogenesis and recovery process. Our study identifies substantial differences between TDL and glioma at single-cell resolution. The observed heterogeneity and potentially diverse functions of cells in TDL may be critical in disease progression. Integrative and comparative single-cell analysis reveals transcriptomic difference between human tumefactive demyelinating lesion and glioma.
Collapse
|
87
|
Loftus TJ, Shickel B, Balch JA, Tighe PJ, Abbott KL, Fazzone B, Anderson EM, Rozowsky J, Ozrazgat-Baslanti T, Ren Y, Berceli SA, Hogan WR, Efron PA, Moorman JR, Rashidi P, Upchurch GR, Bihorac A. Phenotype clustering in health care: A narrative review for clinicians. Front Artif Intell 2022; 5:842306. [PMID: 36034597 PMCID: PMC9411746 DOI: 10.3389/frai.2022.842306] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 07/26/2022] [Indexed: 01/03/2023] Open
Abstract
Human pathophysiology is occasionally too complex for unaided hypothetical-deductive reasoning and the isolated application of additive or linear statistical methods. Clustering algorithms use input data patterns and distributions to form groups of similar patients or diseases that share distinct properties. Although clinicians frequently perform tasks that may be enhanced by clustering, few receive formal training and clinician-centered literature in clustering is sparse. To add value to clinical care and research, optimal clustering practices require a thorough understanding of how to process and optimize data, select features, weigh strengths and weaknesses of different clustering methods, select the optimal clustering method, and apply clustering methods to solve problems. These concepts and our suggestions for implementing them are described in this narrative review of published literature. All clustering methods share the weakness of finding potential clusters even when natural clusters do not exist, underscoring the importance of applying data-driven techniques as well as clinical and statistical expertise to clustering analyses. When applied properly, patient and disease phenotype clustering can reveal obscured associations that can help clinicians understand disease pathophysiology, predict treatment response, and identify patients for clinical trial enrollment.
Collapse
Affiliation(s)
- Tyler J. Loftus
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
- Precision and Intelligent Systems in Medicine (PrismaP), University of Florida, Gainesville, FL, United States
- Intelligent Critical Care Center, University of Florida, Gainesville, FL, United States
| | - Benjamin Shickel
- Intelligent Critical Care Center, University of Florida, Gainesville, FL, United States
- Department of Medicine, University of Florida Health, Gainesville, FL, United States
| | - Jeremy A. Balch
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
| | - Patrick J. Tighe
- Departments of Anesthesiology, Orthopedics, and Information Systems/Operations Management, University of Florida Health, Gainesville, FL, United States
| | - Kenneth L. Abbott
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
| | - Brian Fazzone
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
| | - Erik M. Anderson
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
| | - Jared Rozowsky
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
| | - Tezcan Ozrazgat-Baslanti
- Precision and Intelligent Systems in Medicine (PrismaP), University of Florida, Gainesville, FL, United States
- Intelligent Critical Care Center, University of Florida, Gainesville, FL, United States
- Department of Medicine, University of Florida Health, Gainesville, FL, United States
| | - Yuanfang Ren
- Precision and Intelligent Systems in Medicine (PrismaP), University of Florida, Gainesville, FL, United States
- Intelligent Critical Care Center, University of Florida, Gainesville, FL, United States
- Department of Medicine, University of Florida Health, Gainesville, FL, United States
| | - Scott A. Berceli
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
| | - William R. Hogan
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, United States
| | - Philip A. Efron
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
| | - J. Randall Moorman
- Department of Medicine, University of Virginia, Charlottesville, VA, United States
| | - Parisa Rashidi
- Precision and Intelligent Systems in Medicine (PrismaP), University of Florida, Gainesville, FL, United States
- Intelligent Critical Care Center, University of Florida, Gainesville, FL, United States
- Departments of Biomedical Engineering, Computer and Information Science and Engineering, and Electrical and Computer Engineering, University of Florida, Gainesville, FL, United States
| | - Gilbert R. Upchurch
- Department of Surgery, University of Florida Health, Gainesville, FL, United States
| | - Azra Bihorac
- Precision and Intelligent Systems in Medicine (PrismaP), University of Florida, Gainesville, FL, United States
- Intelligent Critical Care Center, University of Florida, Gainesville, FL, United States
- Department of Medicine, University of Florida Health, Gainesville, FL, United States
| |
Collapse
|
88
|
Zubair A, Chapple RH, Natarajan S, Wright WC, Pan M, Lee HM, Tillman H, Easton J, Geeleher P. Cell type identification in spatial transcriptomics data can be improved by leveraging cell-type-informative paired tissue images using a Bayesian probabilistic model. Nucleic Acids Res 2022; 50:e80. [PMID: 35536287 PMCID: PMC9371936 DOI: 10.1093/nar/gkac320] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 04/13/2022] [Accepted: 04/21/2022] [Indexed: 11/12/2022] Open
Abstract
Spatial transcriptomics technologies have recently emerged as a powerful tool for measuring spatially resolved gene expression directly in tissues sections, revealing cell types and their dysfunction in unprecedented detail. However, spatial transcriptomics technologies are limited in their ability to separate transcriptionally similar cell types and can suffer further difficulties identifying cell types in slide regions where transcript capture is low. Here, we describe a conceptually novel methodology that can computationally integrate spatial transcriptomics data with cell-type-informative paired tissue images, obtained from, for example, the reverse side of the same tissue section, to improve inferences of tissue cell type composition in spatial transcriptomics data. The underlying statistical approach is generalizable to any spatial transcriptomics protocol where informative paired tissue images can be obtained. We demonstrate a use case leveraging cell-type-specific immunofluorescence markers obtained on mouse brain tissue sections and a use case for leveraging the output of AI annotated H&E tissue images, which we used to markedly improve the identification of clinically relevant immune cell infiltration in breast cancer tissue. Thus, combining spatial transcriptomics data with paired tissue images has the potential to improve the identification of cell types and hence to improve the applications of spatial transcriptomics that rely on accurate cell type identification.
Collapse
Affiliation(s)
- Asif Zubair
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Richard H Chapple
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Sivaraman Natarajan
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - William C Wright
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Min Pan
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Hyeong-Min Lee
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Heather Tillman
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - John Easton
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Paul Geeleher
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| |
Collapse
|
89
|
Iida K, Kondo J, Wibisana JN, Inoue M, Okada M. ASURAT: functional annotation-driven unsupervised clustering of single-cell transcriptomes. Bioinformatics 2022; 38:4330-4336. [PMID: 35924984 PMCID: PMC9477531 DOI: 10.1093/bioinformatics/btac541] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 07/04/2022] [Accepted: 08/01/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Single-cell RNA sequencing (scRNA-seq) analysis reveals heterogeneity and dynamic cell transitions. However, conventional gene-based analyses require intensive manual curation to interpret biological implications of computational results. Hence, a theory for efficiently annotating individual cells remains warranted. RESULTS We present ASURAT, a computational tool for simultaneously performing unsupervised clustering and functional annotation of disease, cell type, biological process and signaling pathway activity for single-cell transcriptomic data, using a correlation graph decomposition for genes in database-derived functional terms. We validated the usability and clustering performance of ASURAT using scRNA-seq datasets for human peripheral blood mononuclear cells, which required fewer manual curations than existing methods. Moreover, we applied ASURAT to scRNA-seq and spatial transcriptome datasets for human small cell lung cancer and pancreatic ductal adenocarcinoma, respectively, identifying previously overlooked subpopulations and differentially expressed genes. ASURAT is a powerful tool for dissecting cell subpopulations and improving biological interpretability of complex and noisy transcriptomic data. AVAILABILITY AND IMPLEMENTATION ASURAT is published on Bioconductor (https://doi.org/10.18129/B9.bioc.ASURAT). The codes for analyzing data in this article are available at Github (https://github.com/keita-iida/ASURATBI) and figshare (https://doi.org/10.6084/m9.figshare.19200254.v4). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Keita Iida
- To whom correspondence should be addressed.
| | - Jumpei Kondo
- Division of Health Sciences, Osaka University Graduate School of Medicine, Suita, Osaka 565-0871, Japan,Department of Clinical Bio-Resource Research and Development, Graduate School of Medicine Kyoto University, Kyoto 606-8501, Japan
| | | | - Masahiro Inoue
- Department of Clinical Bio-Resource Research and Development, Graduate School of Medicine Kyoto University, Kyoto 606-8501, Japan
| | - Mariko Okada
- Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan,Center for Drug Design and Research, National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki, Osaka 567-0085, Japan
| |
Collapse
|
90
|
Wei Y, Qin Q, Yan C, Hayes MN, Garcia SP, Xi H, Do D, Jin AH, Eng TC, McCarthy KM, Adhikari A, Onozato ML, Spentzos D, Neilsen GP, Iafrate AJ, Wexler LH, Pyle AD, Suvà ML, Dela Cruz F, Pinello L, Langenau DM. Single-cell analysis and functional characterization uncover the stem cell hierarchies and developmental origins of rhabdomyosarcoma. NATURE CANCER 2022; 3:961-975. [PMID: 35982179 PMCID: PMC10430812 DOI: 10.1038/s43018-022-00414-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 06/24/2022] [Indexed: 04/29/2023]
Abstract
Rhabdomyosarcoma (RMS) is a common childhood cancer that shares features with developing skeletal muscle. Yet, the conservation of cellular hierarchy with human muscle development and the identification of molecularly defined tumor-propagating cells has not been reported. Using single-cell RNA-sequencing, DNA-barcode cell fate mapping and functional stem cell assays, we uncovered shared tumor cell hierarchies in RMS and human muscle development. We also identified common developmental stages at which tumor cells become arrested. Fusion-negative RMS cells resemble early myogenic cells found in embryonic and fetal development, while fusion-positive RMS cells express a highly specific gene program found in muscle cells transiting from embryonic to fetal development at 7-7.75 weeks of age. Fusion-positive RMS cells also have neural pathway-enriched states, suggesting less-rigid adherence to muscle-lineage hierarchies. Finally, we identified a molecularly defined tumor-propagating subpopulation in fusion-negative RMS that shares remarkable similarity to bi-potent, muscle mesenchyme progenitors that can make both muscle and osteogenic cells.
Collapse
Affiliation(s)
- Yun Wei
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Qian Qin
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Chuan Yan
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Madeline N Hayes
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Sara P Garcia
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
| | - Haibin Xi
- Department of Microbiology, Immunology, and Molecular Genetics, University of California Los Angeles, Los Angeles, CA, USA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California Los Angeles, Los Angeles, CA, USA
| | - Daniel Do
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Alexander H Jin
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Tiffany C Eng
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Karin M McCarthy
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Abhinav Adhikari
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
| | - Maristela L Onozato
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
| | - Dimitrios Spentzos
- Center for Sarcoma and Connective Tissue Oncology, Department of Orthopedic Surgery, Massachusetts General Hospital, Boston, MA, USA
| | - Gunnlaugur P Neilsen
- Center for Sarcoma and Connective Tissue Oncology, Department of Orthopedic Surgery, Massachusetts General Hospital, Boston, MA, USA
| | - A John Iafrate
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
| | - Leonard H Wexler
- Department of Pediatrics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - April D Pyle
- Department of Microbiology, Immunology, and Molecular Genetics, University of California Los Angeles, Los Angeles, CA, USA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California Los Angeles, Los Angeles, CA, USA
| | - Mario L Suvà
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA
- Harvard Stem Cell Institute, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Filemon Dela Cruz
- Department of Pediatrics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Luca Pinello
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA.
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA.
| | - David M Langenau
- Molecular Pathology Unit, Massachusetts General Research Institute, Charlestown, MA, USA.
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA, USA.
- Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA.
- Harvard Stem Cell Institute, Cambridge, MA, USA.
| |
Collapse
|
91
|
scWizard: a web-based automated tool for classifying and annotating single cells and downstream analysis of single-cell RNA-seq data in cancers. Comput Struct Biotechnol J 2022; 20:4902-4909. [PMID: 36147672 PMCID: PMC9474308 DOI: 10.1016/j.csbj.2022.08.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 07/27/2022] [Accepted: 08/12/2022] [Indexed: 11/22/2022] Open
Abstract
scWizard provides comprehensive analysis pipeline for integration strategies of cancer scRNA-seq data. scWizard enables classification of 47 cell subtypes within the TME based on hierarchical model by deep neural network. scWizard gives a higher accuracy for annotation cell subtypes within the TME compared with five methods. scWizard packages is a point-and-click tool helping for researchers without proficient programming skills.
The emerging number of single-cell RNA-seq (scRNA-Seq) datasets allows the characterization of cell types across various cancer types. However, there is still lack of effective tools to integrate the various analysis of single-cells, especially for making fine annotation on subtype cells within the tumor microenvironment (TME). We developed scWizard, a point-and-click tool packaging automated process including our developed cell annotation method based on deep neural network learning and 11 downstream analyses methods. scWizard used 113,976 cells across 13 cancer types as a built-in reference dataset for training the hierarchical model enabling to automatedly classify and annotate 7 major cell types and 47 cell subtypes in the TME. scWizard provides a built-in pre-training set for user’s flexible choice, and gives a higher accuracy for annotation subtypes of tumor-derived T-lymphocytes/natural killer cells (T/NK) and myeloid cells from different cancer types compared with the existing five methods. scWizard has good robustness in three independent cancer datasets, with an accuracy of 0.98 in annotating major cell types, 0.85 in annotating myeloid cell subtypes and 0.79 in annotating T/NK cell subtypes, indicting the wide applicability of scWizard in different cell types of cancers. Finally, the automatic analysis and visualization function of scWizard are presented by using the intrahepatic cholangiocarcinoma (ICC) scRNA-Seq dataset as a case. scWizard focuses on decoding TME and covers various analysis flows for cancer scRNA-Seq study, and provides an easy-to-use tool and a user-friendly interface for researchers widely, to further accelerate the biological discovery of cancer research.
Collapse
|
92
|
Wang R, Peng G, Tam PPL, Jing N. Integration of computational analysis and spatial transcriptomics in single-cell study. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022:S1672-0229(22)00084-5. [PMID: 35901961 PMCID: PMC10372908 DOI: 10.1016/j.gpb.2022.06.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 06/08/2022] [Accepted: 06/19/2022] [Indexed: 04/08/2023]
Abstract
Recent advances of single-cell transcriptomics technologies and allied computational methodologies have revolutionized molecular cell biology. Meanwhile, pioneering explorations in spatial transcriptomics have opened avenues to address fundamental biological questions in health and diseases. Here, we review the technical attributes of single-cell RNA sequencing and spatial transcriptomics, and the core concepts of computational data analysis. We further highlight the challenges in the application of data integration methodologies and the interpretation of the biological context of the findings.
Collapse
Affiliation(s)
- Ran Wang
- State Key Laboratory of Cell Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai 200031, China
| | - Guangdun Peng
- CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
| | - Patrick P L Tam
- Embryology Research Unit, Children's Medical Research Institute, University of Sydney, Sydney, NSW 2145, Australia; School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, Sydney, NSW 2145, Australia
| | - Naihe Jing
- State Key Laboratory of Cell Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai 200031, China; Guangzhou Laboratory, Guangzhou 510005, China; CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
93
|
He J, Lin L, Chen J. Practical bioinformatics pipelines for single-cell RNA-seq data analysis. BIOPHYSICS REPORTS 2022; 8:158-169. [PMID: 37288243 PMCID: PMC10189648 DOI: 10.52601/bpr.2022.210041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Accepted: 03/01/2022] [Indexed: 11/05/2022] Open
Abstract
Single-cell RNA sequencing (scRNA-seq) is a revolutionary tool to explore cells. With an increasing number of scRNA-seq data analysis tools that have been developed, it is challenging for users to choose and compare their performance. Here, we present an overview of the workflow for computational analysis of scRNA-seq data. We detail the steps of a typical scRNA-seq analysis, including experimental design, pre-processing and quality control, feature selection, dimensionality reduction, cell clustering and annotation, and downstream analysis including batch correction, trajectory inference and cell-cell communication. We provide guidelines according to our best practice. This review will be helpful for the experimentalists interested in analyzing their data, and will aid the users seeking to update their analysis pipelines.
Collapse
Affiliation(s)
- Jiangping He
- Center for Cell Lineage and Atlas (CCLA), Bioland Laboratory (Guangzhou Regenerative Medicine and Health Guangdong Laboratory), Guangzhou 510320, China
| | - Lihui Lin
- Key Laboratory of Regenerative Biology of the Chinese Academy of Sciences and Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China
| | - Jiekai Chen
- Center for Cell Lineage and Atlas (CCLA), Bioland Laboratory (Guangzhou Regenerative Medicine and Health Guangdong Laboratory), Guangzhou 510320, China
- Key Laboratory of Regenerative Biology of the Chinese Academy of Sciences and Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China
| |
Collapse
|
94
|
Peng L, Wang F, Wang Z, Tan J, Huang L, Tian X, Liu G, Zhou L. Cell-cell communication inference and analysis in the tumour microenvironments from single-cell transcriptomics: data resources and computational strategies. Brief Bioinform 2022; 23:6618236. [PMID: 35753695 DOI: 10.1093/bib/bbac234] [Citation(s) in RCA: 45] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 05/15/2022] [Accepted: 05/19/2022] [Indexed: 12/12/2022] Open
Abstract
Carcinomas are complex ecosystems composed of cancer, stromal and immune cells. Communication between these cells and their microenvironments induces cancer progression and causes therapy resistance. In order to improve the treatment of cancers, it is essential to quantify crosstalk between and within various cell types in a tumour microenvironment. Focusing on the coordinated expression patterns of ligands and cognate receptors, cell-cell communication can be inferred through ligand-receptor interactions (LRIs). In this manuscript, we carry out the following work: (i) introduce pipeline for ligand-receptor-mediated intercellular communication estimation from single-cell transcriptomics and list a few available LRI-related databases and visualization tools; (ii) demonstrate seven classical intercellular communication scoring strategies, highlight four types of representative intercellular communication inference methods, including network-based approaches, machine learning-based approaches, spatial information-based approaches and other approaches; (iii) summarize the evaluation and validation avenues for intercellular communication inference and analyze the advantages and limitations for the above four types of cell-cell communication methods; (iv) comment several major challenges while provide further research directions for intercellular communication analysis in the tumour microenvironments. We anticipate that this work helps to better understand intercellular crosstalk and to further develop powerful cell-cell communication estimation tools for tumor-targeted therapy.
Collapse
Affiliation(s)
- Lihong Peng
- School of Computer Science, Hunan University of Technology, 412007, Hunan, China.,College of Life Sciences and Chemistry, Hunan University of Technology, 412007, Hunan, China
| | - Feixiang Wang
- School of Computer Science, Hunan University of Technology, 412007, Hunan, China
| | - Zhao Wang
- School of Computer Science, Hunan University of Technology, 412007, Hunan, China
| | - Jingwei Tan
- School of Computer Science, Hunan University of Technology, 412007, Hunan, China
| | - Li Huang
- Academy of Arts and Design, Tsinghua University, 10084, Beijing, China.,The Future Laboratory, Tsinghua University, 10084, Beijing, China
| | - Xiongfei Tian
- School of Computer Science, Hunan University of Technology, 412007, Hunan, China
| | - Guangyi Liu
- School of Computer Science, Hunan University of Technology, 412007, Hunan, China
| | - Liqian Zhou
- School of Computer Science, Hunan University of Technology, 412007, Hunan, China
| |
Collapse
|
95
|
Abstract
Single-cell RNA sequencing has led to unprecedented levels of data complexity. Although several computational platforms are available, performing data analyses for multiple datasets remains a significant challenge. Here, we provide a comprehensive analytical protocol to interrogate multiple datasets on SingCellaR, an analysis package in R. This tool can be applied to general single-cell transcriptome analyses. We demonstrate steps for data analyses and visualization using bespoke pipelines, in conjunction with existing analysis tools to study human hematopoietic stem and progenitor cells. For complete details on the use and execution of this protocol, please refer to Roy et al. (2021).
Collapse
Affiliation(s)
- Guanlin Wang
- MRC Molecular Haematology Unit, MRC WIMM, University of Oxford, Oxford OX3 9DS, UK
- Centre for Computational Biology, Medical Research Council Weatherall Institute of Molecular Medicine (MRC WIMM), University of Oxford, Oxford OX3 9DS, UK
| | - Wei Xiong Wen
- MRC Molecular Haematology Unit, MRC WIMM, University of Oxford, Oxford OX3 9DS, UK
- Centre for Computational Biology, Medical Research Council Weatherall Institute of Molecular Medicine (MRC WIMM), University of Oxford, Oxford OX3 9DS, UK
| | - Adam J. Mead
- MRC Molecular Haematology Unit, MRC WIMM, University of Oxford, Oxford OX3 9DS, UK
- National Institute for Health Research (NIHR) Oxford Biomedical Research Centre, Oxford OX4 2PG, UK
| | - Anindita Roy
- MRC Molecular Haematology Unit, MRC WIMM, University of Oxford, Oxford OX3 9DS, UK
- Department of Paediatrics, Children’s Hospital, John Radcliffe Hospital, and MRC WIMM, University of Oxford, Oxford OX3 9DS, UK
- National Institute for Health Research (NIHR) Oxford Biomedical Research Centre, Oxford OX4 2PG, UK
| | - Bethan Psaila
- MRC Molecular Haematology Unit, MRC WIMM, University of Oxford, Oxford OX3 9DS, UK
- National Institute for Health Research (NIHR) Oxford Biomedical Research Centre, Oxford OX4 2PG, UK
| | - Supat Thongjuea
- MRC Molecular Haematology Unit, MRC WIMM, University of Oxford, Oxford OX3 9DS, UK
- Centre for Computational Biology, Medical Research Council Weatherall Institute of Molecular Medicine (MRC WIMM), University of Oxford, Oxford OX3 9DS, UK
- National Institute for Health Research (NIHR) Oxford Biomedical Research Centre, Oxford OX4 2PG, UK
| |
Collapse
|
96
|
Wang HY, Zhao JP, Zheng CH, Su YS. scCNC: A method based on Capsule Network for Clustering scRNA-seq Data. Bioinformatics 2022; 38:3703-3709. [PMID: 35699473 DOI: 10.1093/bioinformatics/btac393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Revised: 05/28/2022] [Accepted: 06/11/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION A large number of studies have shown that clustering is a crucial step in scRNA-seq analysis. Most existing methods are based on unsupervised learning without the prior exploitation of any domain knowledge, which does not utilize available gold-standard labels. When confronted by the high dimensionality and general dropout events of scRNA-seq data, purely unsupervised clustering methods may not produce biologically interpretable clusters, which complicates cell type assignment. RESULTS In this paper, we propose a semi-supervised clustering method based on a capsule network named scCNC, that integrates domain knowledge into the clustering step. Significantly, we also propose a Semi-supervised Greedy Iterative Training (SGIT) method used to train the whole network. Experiments on some real scRNA-seq datasets show that scCNC can significantly improve clustering performance and facilitate downstream analyses. AVAILABILITY The source code of scCNC is freely available at https://github.com/WHY-17/scCNC. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hai-Yun Wang
- College of Mathematics and System Sciences, Xinjiang University, Urumqi, China
| | - Jian-Ping Zhao
- College of Mathematics and System Sciences, Xinjiang University, Urumqi, China.,Institute of Mathematics and Physics, Xinjiang University, Urumqi, China
| | - Chun-Hou Zheng
- College of Mathematics and System Sciences, Xinjiang University, Urumqi, China.,School of Artificial Intelligence, Anhui University, Hefei, China
| | - Yan-Sen Su
- School of Artificial Intelligence, Anhui University, Hefei, China
| |
Collapse
|
97
|
Dohmen J, Baranovskii A, Ronen J, Uyar B, Franke V, Akalin A. Identifying tumor cells at the single-cell level using machine learning. Genome Biol 2022; 23:123. [PMID: 35637521 PMCID: PMC9150321 DOI: 10.1186/s13059-022-02683-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 05/06/2022] [Indexed: 12/15/2022] Open
Abstract
Tumors are complex tissues of cancerous cells surrounded by a heterogeneous cellular microenvironment with which they interact. Single-cell sequencing enables molecular characterization of single cells within the tumor. However, cell annotation-the assignment of cell type or cell state to each sequenced cell-is a challenge, especially identifying tumor cells within single-cell or spatial sequencing experiments. Here, we propose ikarus, a machine learning pipeline aimed at distinguishing tumor cells from normal cells at the single-cell level. We test ikarus on multiple single-cell datasets, showing that it achieves high sensitivity and specificity in multiple experimental contexts.
Collapse
Affiliation(s)
- Jan Dohmen
- Bioinformatics and Omics Data Science Platform, Berlin Institute For Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Hannoversche Str.28, 10115, Berlin, Germany
| | - Artem Baranovskii
- Non-coding RNAs and Mechanisms of Cytoplasmic Gene Regulation Lab, Berlin Institute for Medical Systems Biology, Hannoversche Str. 28, 10115, Berlin, Germany
- Free University Berlin, Kaiserswerther Str. 16-18, 14195, Berlin, Germany
| | - Jonathan Ronen
- Bioinformatics and Omics Data Science Platform, Berlin Institute For Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Hannoversche Str.28, 10115, Berlin, Germany
| | - Bora Uyar
- Bioinformatics and Omics Data Science Platform, Berlin Institute For Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Hannoversche Str.28, 10115, Berlin, Germany
| | - Vedran Franke
- Bioinformatics and Omics Data Science Platform, Berlin Institute For Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Hannoversche Str.28, 10115, Berlin, Germany.
| | - Altuna Akalin
- Bioinformatics and Omics Data Science Platform, Berlin Institute For Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Hannoversche Str.28, 10115, Berlin, Germany.
| |
Collapse
|
98
|
PRRX1 is a master transcription factor of stromal fibroblasts for myofibroblastic lineage progression. Nat Commun 2022; 13:2793. [PMID: 35589735 PMCID: PMC9120014 DOI: 10.1038/s41467-022-30484-4] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 05/04/2022] [Indexed: 12/01/2022] Open
Abstract
Although stromal fibroblasts play a critical role in cancer progression, their identities remain unclear as they exhibit high heterogeneity and plasticity. Here, a master transcription factor (mTF) constructing core-regulatory circuitry, PRRX1, which determines the fibroblast lineage with a myofibroblastic phenotype, is identified for the fibroblast subgroup. PRRX1 orchestrates the functional drift of fibroblasts into myofibroblastic phenotype via TGF-β signaling by remodeling a super-enhancer landscape. Such reprogrammed fibroblasts have myofibroblastic functions resulting in markedly enhanced tumorigenicity and aggressiveness of cancer. PRRX1 expression in cancer-associated fibroblast (CAF) has an unfavorable prognosis in multiple cancer types. Fibroblast-specific PRRX1 depletion induces long-term and sustained complete remission of chemotherapy-resistant cancer in genetically engineered mice models. This study reveals CAF subpopulations based on super-enhancer profiles including PRRX1. Therefore, mTFs, including PRRX1, provide another opportunity for establishing a hierarchical classification system of fibroblasts and cancer treatment by targeting fibroblasts. Cancer associated fibroblasts are an important and highly heterogeneous component of the tumor microenvironment. Here the authors identify PRRX1 as a master transcription factor determining a fibroblast lineage with myofibroblastic phenotype, associated with unfavourable prognosis in several cancer types.
Collapse
|
99
|
Dong J, Zhang Y, Wang F. scSemiAE: a deep model with semi-supervised learning for single-cell transcriptomics. BMC Bioinformatics 2022; 23:161. [PMID: 35513780 PMCID: PMC9069784 DOI: 10.1186/s12859-022-04703-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 04/28/2022] [Indexed: 11/30/2022] Open
Abstract
Background With the development of modern sequencing technology, hundreds of thousands of single-cell RNA-sequencing (scRNA-seq) profiles allow to explore the heterogeneity in the cell level, but it faces the challenges of high dimensions and high sparsity. Dimensionality reduction is essential for downstream analysis, such as clustering to identify cell subpopulations. Usually, dimensionality reduction follows unsupervised approach. Results In this paper, we introduce a semi-supervised dimensionality reduction method named scSemiAE, which is based on an autoencoder model. It transfers the information contained in available datasets with cell subpopulation labels to guide the search of better low-dimensional representations, which can ease further analysis. Conclusions Experiments on five public datasets show that, scSemiAE outperforms both unsupervised and semi-supervised baselines whether the transferred information embodied in the number of labeled cells and labeled cell subpopulations is much or less. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04703-0.
Collapse
Affiliation(s)
- Jiayi Dong
- Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai, China.,School of Computer Science and Technology, Fudan University, Shanghai, China
| | - Yin Zhang
- Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai, China.,School of Computer Science and Technology, Fudan University, Shanghai, China
| | - Fei Wang
- Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai, China. .,School of Computer Science and Technology, Fudan University, Shanghai, China.
| |
Collapse
|
100
|
Abondio P, De Intinis C, da Silva Gonçalves Vianez Júnior JL, Pace L. SINGLE CELL MULTIOMIC APPROACHES TO DISENTANGLE T CELL HETEROGENEITY. Immunol Lett 2022; 246:37-51. [DOI: 10.1016/j.imlet.2022.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 04/16/2022] [Accepted: 04/26/2022] [Indexed: 11/29/2022]
|