1
|
Jin X, Hu Z, Yin J, Shan G, Zhao M, Liao Z, Liang J, Bi G, Cheng Y, Xi J, Chen Z, Lin M. Dissection of the cell communication interactions in lung adenocarcinoma identified a prognostic model with immunotherapy efficacy assessment and a potential therapeutic candidate gene ITGB1. Heliyon 2024; 10:e36599. [PMID: 39263115 PMCID: PMC11388764 DOI: 10.1016/j.heliyon.2024.e36599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Revised: 08/17/2024] [Accepted: 08/19/2024] [Indexed: 09/13/2024] Open
Abstract
Background The tumor microenvironment (TME) in lung adenocarcinoma (LUAD) influences tumor progression and immunosuppressive phenotypes through cell communication. We aimed to decipher cellular communication and molecular patterns in LUAD. Methods We analyzed scRNA-seq data from LUAD patients in multiple cohorts, revealing complex cell communication networks within the TME. Using cell chat analysis and COSmap technology, we inferred LUAD's spatial organization. Employing the NMF algorithm and survival screening, we identified a cell communication interactions (CCIs) model and validated it across various datasets. Results We uncovered intricate cell communication interactions within the TME, identifying three LUAD patient subtypes with distinct prognosis, clinical characteristics, mutation status, expression patterns, and immune infiltration. Our CCI model exhibited robust performance in prognosis and immunotherapy response prediction. Several potential therapeutic targets and agents for high CCI score patients with immunosuppressive profiles were identified. Machine learning algorithms pinpointed the novel candidate gene ITGB1 and validated its role in LUAD tumor phenotype in vitro. Conclusion Our study elucidates molecular patterns and cell communication interactions in LUAD as effective biomarkers and predictors of immunotherapy response. Targeting cell communication interactions offers novel avenues for LUAD immunotherapy and prognostic evaluations, with ITGB1 emerging as a promising therapeutic target.
Collapse
Affiliation(s)
- Xing Jin
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Zhengyang Hu
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Jiacheng Yin
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Guangyao Shan
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Mengnan Zhao
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Zhenyu Liao
- Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Jiaqi Liang
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Guoshu Bi
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Ye Cheng
- Institutes of Biomedical Sciences and Children's Hospital, Fudan University, Shanghai, China
| | - Junjie Xi
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Zhencong Chen
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Miao Lin
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| |
Collapse
|
2
|
Cao X, Huang YA, You ZH, Shang X, Hu L, Hu PW, Huang ZA. scPriorGraph: constructing biosemantic cell-cell graphs with prior gene set selection for cell type identification from scRNA-seq data. Genome Biol 2024; 25:207. [PMID: 39103856 DOI: 10.1186/s13059-024-03357-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 07/29/2024] [Indexed: 08/07/2024] Open
Abstract
Cell type identification is an indispensable analytical step in single-cell data analyses. To address the high noise stemming from gene expression data, existing computational methods often overlook the biologically meaningful relationships between genes, opting to reduce all genes to a unified data space. We assume that such relationships can aid in characterizing cell type features and improving cell type recognition accuracy. To this end, we introduce scPriorGraph, a dual-channel graph neural network that integrates multi-level gene biosemantics. Experimental results demonstrate that scPriorGraph effectively aggregates feature values of similar cells using high-quality graphs, achieving state-of-the-art performance in cell type identification.
Collapse
Affiliation(s)
- Xiyue Cao
- School of Computer Science, Northwestern Polytechnical University, Xi'an, China
| | - Yu-An Huang
- School of Computer Science, Northwestern Polytechnical University, Xi'an, China.
| | - Zhu-Hong You
- School of Computer Science, Northwestern Polytechnical University, Xi'an, China.
| | - Xuequn Shang
- School of Computer Science, Northwestern Polytechnical University, Xi'an, China
| | - Lun Hu
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China
| | - Peng-Wei Hu
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China
| | - Zhi-An Huang
- Research Office, City University of Hong Kong (Dongguan), Dongguan, 523000, China
| |
Collapse
|
3
|
Zheng X, Dozmorov MG, Espinoza L, Bowes MM, Bastacky S, Sawalha AH. Inducible deletion of Ezh2 in CD4+ T cells inhibits kidney T cell infiltration and prevents interstitial nephritis in MRL/lpr lupus-prone mice. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.04.583401. [PMID: 38496595 PMCID: PMC10942296 DOI: 10.1101/2024.03.04.583401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Systemic lupus erythematosus is a remitting relapsing autoimmune disease characterized by autoantibody production and multi-organ involvement. T cell epigenetic dysregulation plays an important role in the pathogenesis of lupus. We have previously demonstrated upregulation of the key epigenetic regulator EZH2 in CD4+ T cells isolated from lupus patients. To further investigate the role of EZH2 in the pathogenesis of lupus, we generated a tamoxifen-inducible CD4+ T cell Ezh2 conditional knockout mouse on the MRL/lpr lupus-prone background. We demonstrate that Ezh2 deletion abrogates lupus-like disease and prevents T cell differentiation. Single-cell analysis suggests impaired T cell function and activation of programed cell death pathways in EZH2-deficient mice. Ezh2 deletion in CD4+ T cells restricts TCR clonal repertoire and prevents kidney-infiltrating effector CD4+ T cell expansion and tubulointerstitial nephritis, which has been linked to end-stage renal disease in patients with lupus nephritis.
Collapse
|
4
|
Liu D, Fu Y, Wang X, Wang X, Fang X, Zhou Y, Wang R, Zhang P, Jiang M, Jia D, Wang J, Chen H, Guo G, Han X. Characterization of human pluripotent stem cell differentiation by single-cell dual-omics analyses. Stem Cell Reports 2023; 18:2464-2481. [PMID: 37995704 PMCID: PMC10724075 DOI: 10.1016/j.stemcr.2023.10.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 10/24/2023] [Accepted: 10/26/2023] [Indexed: 11/25/2023] Open
Abstract
In vivo differentiation of human pluripotent stem cells (hPSCs) has unique advantages, such as multilineage differentiation, angiogenesis, and close cell-cell interactions. To systematically investigate multilineage differentiation mechanisms of hPSCs, we constructed the in vivo hPSC differentiation landscape containing 239,670 cells using teratoma models. We identified 43 cell types, inferred 18 cell differentiation trajectories, and characterized common and specific gene regulation patterns during hPSC differentiation at both transcriptional and epigenetic levels. Additionally, we developed the developmental single-cell Basic Local Alignment Search Tool (dscBLAST), an R-based cell identification tool, to simplify the identification processes of developmental cells. Using dscBLAST, we aligned cells in multiple differentiation models to normally developing cells to further understand their differentiation states. Overall, our study offers new insights into stem cell differentiation and human embryonic development; dscBLAST shows favorable cell identification performance, providing a powerful identification tool for developmental cells.
Collapse
Affiliation(s)
- Daiyuan Liu
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China
| | - Yuting Fu
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China
| | - Xinru Wang
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China
| | - Xueyi Wang
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China
| | - Xing Fang
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China; Zhejiang Provincial Key Laboratory for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, Zhejiang 310058, China
| | - Yincong Zhou
- College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Renying Wang
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China
| | - Peijing Zhang
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China; Zhejiang Provincial Key Laboratory for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, Zhejiang 310058, China
| | - Mengmeng Jiang
- Liangzhu Laboratory, Zhejiang University, 1369 West Wenyi Road, Hangzhou 311121, China
| | - Danmei Jia
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China
| | - Jingjing Wang
- Liangzhu Laboratory, Zhejiang University, 1369 West Wenyi Road, Hangzhou 311121, China
| | - Haide Chen
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China; Liangzhu Laboratory, Zhejiang University, 1369 West Wenyi Road, Hangzhou 311121, China; M20 Genomics, Hangzhou, China
| | - Guoji Guo
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China; Zhejiang Provincial Key Laboratory for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, Zhejiang 310058, China; Liangzhu Laboratory, Zhejiang University, 1369 West Wenyi Road, Hangzhou 311121, China
| | - Xiaoping Han
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310058, China; Zhejiang Provincial Key Laboratory for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, Zhejiang 310058, China.
| |
Collapse
|
5
|
Zhao M, Lu T, Bi G, Hu Z, Liang J, Bian Y, Feng M, Zhan C. PLK1 regulating chemoradiotherapy sensitivity of esophageal squamous cell carcinoma through pentose phosphate pathway/ferroptosis. Biomed Pharmacother 2023; 168:115711. [PMID: 37879213 DOI: 10.1016/j.biopha.2023.115711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/08/2023] [Accepted: 10/11/2023] [Indexed: 10/27/2023] Open
Abstract
BACKGROUND Esophageal squamous cell carcinoma (ESCC) is the most common pathological type of esophageal cancer in China, accounting for more than 90 %. Most patients were diagnosed with advanced-stage ESCC, for whom new adjuvant therapy is recommended. Therefore, it is urgent to explore new therapeutic targets for ESCC. Ferroptosis, a newly discovered iron-dependent programmed cell death, has been shown to play an important role in carcinogenesis by many studies. This study explored the effect of Polo like kinase 1 (PLK1) on chemoradiotherapy sensitivity of ESCC through ferroptosis METHODS: In this study, we knocked out the expression of PLK1 (PLK1-KO) in ESCC cell lines (KYSE150 and ECA109) with CRISPR/CAS9. The effects of PLK1-knock out on G6PD, the rate-limiting enzyme of pentose phosphate pathway (PPP), and downstream NADPH and GSH were explored. The lipid peroxidation was observed by flow cytometry, and the changes in mitochondria were observed by transmission electron microscopy. Next, through the CCK-8 assay and clone formation assay, the sensitivity to cobalt 60 rays, paclitaxel, and cisplatin were assessed after PLK1-knock out, and the nude mouse tumorigenesis experiment further verified it. The regulation of transcription factor YY1 on PLK1 was evaluated by dual luciferase reporter assay. The expression and correlation of PLK1 and YY1, and their impact on prognosis were analyzed in more than 300 ESCC cases from the GEO database and our center. Finally, the above results were further proved by single-cell sequencing. RESULTS After PLK1 knockout, the expression of G6PD dimer and the level of NADPH and GSH in KYSE150 and ECA109 cells significantly decreased. Accordingly, lipid peroxidation increased, mitochondria became smaller, membrane density increased, and ferroptosis was more likely to occur. However, with the stimulation of exogenous GSH (10 mM), there was no significant difference in lipid peroxidation and ferroptosis between the PLK1-KO group and the control group. After ionizing radiation, the PLK1-KO group had higher lipid peroxidation ratio, more cell death, and was more sensitive to radiation, while exogenous GSH (10 mM) could eliminate this difference. Similar results could also be observed when receiving paclitaxel combined with cisplatin and chemoradiotherapy. The expression of PLK1, G6PD dimer, and the level of NADPH and GSH in KYSE150, ECA109, and 293 T cells stably transfected with YY1-shRNAs significantly decreased, and the cells were more sensitive to radiotherapy and chemotherapy. ESCC patients from the GEO database and our center, YY1 and PLK1 expression were significantly positively-correlated, and the survival of patients with high expression of PLK1 was significantly shorter. Further analysis of single-cell sequencing specimens of ESCC in our center confirmed the above results. CONCLUSION In ESCC, down-regulation of PLK1 can inhibit PPP, and reduce the level of NADPH and GSH, thereby promoting ferroptosis and improving their sensitivity to radiotherapy and chemotherapy. Transcription factor YY1 has a positive regulatory effect on PLK1, and their expressions were positively correlated. PLK1 may be a target for predicting and enhancing the chemoradiotherapy sensitivity of ESCC.
Collapse
Affiliation(s)
- Mengnan Zhao
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Tao Lu
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai 200032, China; Shanxi Province Cancer Hospital/Shanxi Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences/Cancer Hospital Affiliated to Shanxi Medical University
| | - Guoshu Bi
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Zhengyang Hu
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Jiaqi Liang
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Yunyi Bian
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Mingxiang Feng
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai 200032, China.
| | - Cheng Zhan
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai 200032, China.
| |
Collapse
|
6
|
Liao Z, Cheng Y, Zhang H, Jin X, Sun H, Wang Y, Yan J. A novel prognostic signature and immune microenvironment characteristics associated with disulfidptosis in papillary thyroid carcinoma based on single-cell RNA sequencing. Front Cell Dev Biol 2023; 11:1308352. [PMID: 38033866 PMCID: PMC10682199 DOI: 10.3389/fcell.2023.1308352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 11/01/2023] [Indexed: 12/02/2023] Open
Abstract
Background: Disulfidptosis is a newly discovered form of regulated cell death. The research on disulfidptosis and tumor progression remains unclear. Our research aims to explore the relationship between disulfidptosis-related genes (DRGs) and the clinical outcomes of papillary thyroid carcinoma (PTC), and its interaction on the tumor microenvironment. Methods: The single-cell RNA seq data of PTC was collected from GEO dataset GSE191288. We illustrated the expression patterns of disulfidptosis-related genes in different cellular components in thyroid cancer. LASSO analyses were performed to construct a disulfidptosis associated risk model in TCGA-THCA database. GO and KEGG analyses were used for functional analyses. CIBERSORT and ESTIMATE algorithm helped with the immune infiltration estimation. qRT‒PCR and flow cytometry was performed to validate the hub gene expression and immune infiltration in clinical samples. Results: We clustered PTC scRNA seq data into 8 annotated cell types. With further DRGs based scoring analyses, we found endothelial cells exhibited the most relationship with disulfidptosis. A 4-gene risk model was established based on the expression pattern of DRGs related endothelial cell subset. The risk model showed good independent prognostic value in both training and validation dataset. Functional enrichment and genomic feature analysis exhibited the significant correlation between tumor immune infiltration and the signature. The results of flow cytometry and immune infiltration estimation showed the higher risk scores was related to immuno-suppressive tumor microenvironment in PTC. Conclusion: Our study exhibited the role of disulfidptosis based signature in the regulation of tumor immune microenvironment and the survival of PTC patients. A 4-gene prognostic signature (including SNAI1, STC1, PKHD1L1 and ANKRD37) was built on the basis of disulfidptosis related endothelial cells. The significance of clinical outcome and immune infiltration pattern was validated robustly.
Collapse
Affiliation(s)
- Zhenyu Liao
- Department of General Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Ye Cheng
- Institutes of Biomedical Sciences and Children’s Hospital, Fudan University, Shanghai, China
| | - Huiru Zhang
- Shanghai Cancer Centre, Fudan University, Shanghai, China
| | - Xing Jin
- Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Hanxing Sun
- Department of General Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yue Wang
- Department of General Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Jiqi Yan
- Department of General Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| |
Collapse
|
7
|
CD36 + cancer-associated fibroblasts provide immunosuppressive microenvironment for hepatocellular carcinoma via secretion of macrophage migration inhibitory factor. Cell Discov 2023; 9:25. [PMID: 36878933 PMCID: PMC9988869 DOI: 10.1038/s41421-023-00529-z] [Citation(s) in RCA: 91] [Impact Index Per Article: 91.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 02/12/2023] [Indexed: 03/08/2023] Open
Abstract
Hepatocellular carcinoma (HCC) is an immunotherapy-resistant malignancy characterized by high cellular heterogeneity. The diversity of cell types and the interplay between tumor and non-tumor cells remain to be clarified. Single cell RNA sequencing of human and mouse HCC tumors revealed heterogeneity of cancer-associated fibroblast (CAF). Cross-species analysis determined the prominent CD36+ CAFs exhibited high-level lipid metabolism and expression of macrophage migration inhibitory factor (MIF). Lineage-tracing assays showed CD36+CAFs were derived from hepatic stellate cells. Furthermore, CD36 mediated oxidized LDL uptake-dependent MIF expression via lipid peroxidation/p38/CEBPs axis in CD36+ CAFs, which recruited CD33+myeloid-derived suppressor cells (MDSCs) in MIF- and CD74-dependent manner. Co-implantation of CD36+ CAFs with HCC cells promotes HCC progression in vivo. Finally, CD36 inhibitor synergizes with anti-PD-1 immunotherapy by restoring antitumor T-cell responses in HCC. Our work underscores the importance of elucidating the function of specific CAF subset in understanding the interplay between the tumor microenvironment and immune system.
Collapse
|
8
|
Chen J, Xu H, Tao W, Chen Z, Zhao Y, Han JDJ. Transformer for one stop interpretable cell type annotation. Nat Commun 2023; 14:223. [PMID: 36641532 PMCID: PMC9840170 DOI: 10.1038/s41467-023-35923-4] [Citation(s) in RCA: 34] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 01/09/2023] [Indexed: 01/15/2023] Open
Abstract
Consistent annotation transfer from reference dataset to query dataset is fundamental to the development and reproducibility of single-cell research. Compared with traditional annotation methods, deep learning based methods are faster and more automated. A series of useful single cell analysis tools based on autoencoder architecture have been developed but these struggle to strike a balance between depth and interpretability. Here, we present TOSICA, a multi-head self-attention deep learning model based on Transformer that enables interpretable cell type annotation using biologically understandable entities, such as pathways or regulons. We show that TOSICA achieves fast and accurate one-stop annotation and batch-insensitive integration while providing biologically interpretable insights for understanding cellular behavior during development and disease progressions. We demonstrate TOSICA's advantages by applying it to scRNA-seq data of tumor-infiltrating immune cells, and CD14+ monocytes in COVID-19 to reveal rare cell types, heterogeneity and dynamic trajectories associated with disease progression and severity.
Collapse
Affiliation(s)
- Jiawei Chen
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China
| | - Hao Xu
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China
| | - Wanyu Tao
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China
| | - Zhaoxiong Chen
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China
| | - Yuxuan Zhao
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China
| | - Jing-Dong J Han
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China.
| |
Collapse
|
9
|
Umu SU, Rapp Vander-Elst K, Karlsen VT, Chouliara M, Bækkevold ES, Jahnsen FL, Domanska D. Cellsnake: a user-friendly tool for single-cell RNA sequencing analysis. Gigascience 2022; 12:giad091. [PMID: 37889009 PMCID: PMC10603768 DOI: 10.1093/gigascience/giad091] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 08/25/2023] [Accepted: 10/05/2023] [Indexed: 10/28/2023] Open
Abstract
BACKGROUND Single-cell RNA sequencing (scRNA-seq) provides high-resolution transcriptome data to understand the heterogeneity of cell populations at the single-cell level. The analysis of scRNA-seq data requires the utilization of numerous computational tools. However, nonexpert users usually experience installation issues, a lack of critical functionality or batch analysis modes, and the steep learning curves of existing pipelines. RESULTS We have developed cellsnake, a comprehensive, reproducible, and accessible single-cell data analysis workflow, to overcome these problems. Cellsnake offers advanced features for standard users and facilitates downstream analyses in both R and Python environments. It is also designed for easy integration into existing workflows, allowing for rapid analyses of multiple samples. CONCLUSION As an open-source tool, cellsnake is accessible through Bioconda, PyPi, Docker, and GitHub, making it a cost-effective and user-friendly option for researchers. By using cellsnake, researchers can streamline the analysis of scRNA-seq data and gain insights into the complex biology of single cells.
Collapse
Affiliation(s)
- Sinan U Umu
- Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo 0372, Norway
| | | | - Victoria T Karlsen
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
| | - Manto Chouliara
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
| | - Espen Sønderaal Bækkevold
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
- Institute of Oral Biology, University of Oslo, Oslo 0372, Norway
| | - Frode Lars Jahnsen
- Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo 0372, Norway
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
| | - Diana Domanska
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
- Department of Microbiology, University of Oslo, Rikshospitalet, Oslo 0372, Norway
| |
Collapse
|
10
|
Jiang S, Qian Q, Zhu T, Zong W, Shang Y, Jin T, Zhang Y, Chen M, Wu Z, Chu Y, Zhang R, Luo S, Jing W, Zou D, Bao Y, Xiao J, Zhang Z. Cell Taxonomy: a curated repository of cell types with multifaceted characterization. Nucleic Acids Res 2022; 51:D853-D860. [PMID: 36161321 PMCID: PMC9825571 DOI: 10.1093/nar/gkac816] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 09/24/2022] [Indexed: 01/12/2023] Open
Abstract
Single-cell studies have delineated cellular diversity and uncovered increasing numbers of previously uncharacterized cell types in complex tissues. Thus, synthesizing growing knowledge of cellular characteristics is critical for dissecting cellular heterogeneity, developmental processes and tumorigenesis at single-cell resolution. Here, we present Cell Taxonomy (https://ngdc.cncb.ac.cn/celltaxonomy), a comprehensive and curated repository of cell types and associated cell markers encompassing a wide range of species, tissues and conditions. Combined with literature curation and data integration, the current version of Cell Taxonomy establishes a well-structured taxonomy for 3,143 cell types and houses a comprehensive collection of 26,613 associated cell markers in 257 conditions and 387 tissues across 34 species. Based on 4,299 publications and single-cell transcriptomic profiles of ∼3.5 million cells, Cell Taxonomy features multifaceted characterization for cell types and cell markers, involving quality assessment of cell markers and cell clusters, cross-species comparison, cell composition of tissues and cellular similarity based on markers. Taken together, Cell Taxonomy represents a fundamentally useful reference to systematically and accurately characterize cell types and thus lays an important foundation for deeply understanding and exploring cellular biology in diverse species.
Collapse
Affiliation(s)
| | | | | | - Wenting Zong
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yunfei Shang
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tong Jin
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yuansheng Zhang
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ming Chen
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zishan Wu
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yuan Chu
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Rongqin Zhang
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Sicheng Luo
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Wei Jing
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Dong Zou
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China
| | - Yiming Bao
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China,China National Center for Bioinformation, Beijing 100101, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jingfa Xiao
- Correspondence may also be addressed to Jingfa Xiao.
| | - Zhang Zhang
- To whom correspondence should be addressed. Tel: +86 10 84097261; Fax: +86 10 84097720;
| |
Collapse
|
11
|
A Novel Prognostic Signature Revealed the Interaction of Immune Cells in Tumor Microenvironment Based on Single-Cell RNA Sequencing for Lung Adenocarcinoma. J Immunol Res 2022; 2022:6555810. [PMID: 35812244 PMCID: PMC9270162 DOI: 10.1155/2022/6555810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 06/12/2022] [Accepted: 06/15/2022] [Indexed: 11/18/2022] Open
Abstract
Background The tumor immune microenvironment (TIME) played an important role in immunotherapy prognosis and treatment response. Immune cells constitute a large part of the tumor microenvironment and regulate tumor progression. Our research is dedicated to studying the infiltrating immune cell in lung adenocarcinoma (LUAD) and seeking potential targets. Methods The scRNA-seq data were collected from our FDZSH and two public datasets. The code for cell-type mapping algorithms was downloaded from the CIBERSORTx portal. The bioinformatics data of LUAD patients could be approached from The Cancer Genome Atlas (TCGA) portal. Weighted gene coexpression network analysis (WGCNA) and least absolute shrinkage and selection operator (LASSO) analyses were performed to construct a risk model. TIMER2 and TIDE helped with the immune infiltration estimation, while PROGENy helped the cancer-related pathways' enrichment analysis. GSE31210 dataset and IMVigor ICB therapy cohort validated our findings as the external validation datasets. Results We clustered the scRNA-seq dataset (integrating our FDZSH datasets and other public datasets) into 23 subpopulations. After curated cell annotation, we implemented Cibersort and WGCNA analysis to anchor the brown module and natural killer cell cluster1 due to the most relationship with tumor trait. The overlap of the brown module gene, natural killer cell signature, and DEGs of tumor and adjacent normal samples was screened by LASSO Cox regression. The obtained 5-gene risk model showed an excellent prognostic performance in the validation dataset. Furthermore, there was a correlation between risk score and tumor-infiltrating immune cells and tumor genomics abnormity. Patients with higher risk scores had a significantly lower immunotherapy response rate. Conclusion Our observations implied that immune cells played a pivotal role in TIME and established a 5-gene signature (including IDH2, ADRB2, SFTPC, CCDC69, and CCND2) on the basement of nature killer markers targeted by WGCNA analysis. The significance of clinical outcome and immunotherapy response prediction was validated robustly.
Collapse
|
12
|
Wang Z, Zhang H, Zhai Y, Li F, Shi X, Ying M. Single-Cell Profiling Reveals Heterogeneity of Primary and Lymph Node Metastatic Tumors and Immune Cell Populations and Discovers Important Prognostic Significance of CCDC43 in Oral Squamous Cell Carcinoma. Front Immunol 2022; 13:843322. [PMID: 35401551 PMCID: PMC8986980 DOI: 10.3389/fimmu.2022.843322] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2021] [Accepted: 02/28/2022] [Indexed: 12/24/2022] Open
Abstract
Although substantial progress has been made in biological research and clinical treatment in recent years, the clinical prognosis of oral squamous cell carcinoma (OSCC) is still not satisfactory. Tumor immune microenvironment (TIME) is a potential target, which plays an essential role in the response of anti-tumor immunity and immunotherapy. In this study, we used scRNA-seq data, revealing the heterogeneity of TIME between metastatic and primary site. We found that in the metastatic site, the content of cytotoxic T cells and classical activated macrophages (M1 macrophages) increases significantly, while alternately activated macrophages (M2 macrophages) and inflammatory cancer-associated fibroblasts (iCAFs) decrease, which may be due to the increased immunogenicity of OSCC cells in the metastatic site and the changes in some signal pathways. We also found that iCAFs may recruit alternately activated macrophages (M2 macrophages) by secreting CXCL12. Then, we described a regulatory network for communication between various TIME cells centered on OSCC cells, which can help to clarify the possible mechanism of lymph node metastasis in OSCC cells. By performing pseudotime trajectory analysis, we found that the expression CCDC43 is upregulated in more advanced OSCC cells and is an independent prognostic factor for poor living conditions. Other than this, the high expression of CCDC43 may impair the antitumor immunity of the human body and promote the metastasis of OSCC cells. Our research provides a profound insight into the immunological study of OSCC and an essential resource for future drug discovery.
Collapse
Affiliation(s)
- Zhenyu Wang
- Department of Molecular Biology and Biochemistry, Basic Medical College of Nanchang University, Nanchang, China
- Medical College of Nanchang University, Nanchang, China
| | - Hongbo Zhang
- Medical College of Nanchang University, Nanchang, China
| | - Yanan Zhai
- Medical College of Nanchang University, Nanchang, China
| | - Fengtong Li
- Medical College of Nanchang University, Nanchang, China
| | - Xueying Shi
- Medical College of Nanchang University, Nanchang, China
| | - Muying Ying
- Department of Molecular Biology and Biochemistry, Basic Medical College of Nanchang University, Nanchang, China
- *Correspondence: Muying Ying,
| |
Collapse
|
13
|
Wilson SB, Howden SE, Vanslambrouck JM, Dorison A, Alquicira-Hernandez J, Powell JE, Little MH. DevKidCC allows for robust classification and direct comparisons of kidney organoid datasets. Genome Med 2022. [PMID: 35189942 DOI: 10.1101/2021.01.20.427346] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/09/2023] Open
Abstract
BACKGROUND While single-cell transcriptional profiling has greatly increased our capacity to interrogate biology, accurate cell classification within and between datasets is a key challenge. This is particularly so in pluripotent stem cell-derived organoids which represent a model of a developmental system. Here, clustering algorithms and selected marker genes can fail to accurately classify cellular identity while variation in analyses makes it difficult to meaningfully compare datasets. Kidney organoids provide a valuable resource to understand kidney development and disease. However, direct comparison of relative cellular composition between protocols has proved challenging. Hence, an unbiased approach for classifying cell identity is required. METHODS The R package, scPred, was trained on multiple single cell RNA-seq datasets of human fetal kidney. A hierarchical model classified cellular subtypes into nephron, stroma and ureteric epithelial elements. This model, provided in the R package DevKidCC ( github.com/KidneyRegeneration/DevKidCC ), was then used to predict relative cell identity within published kidney organoid datasets generated using distinct cell lines and differentiation protocols, interrogating the impact of such variations. The package contains custom functions for the display of differential gene expression within cellular subtypes. RESULTS DevKidCC was used to directly compare between distinct kidney organoid protocols, identifying differences in relative proportions of cell types at all hierarchical levels of the model and highlighting variations in stromal and unassigned cell types, nephron progenitor prevalence and relative maturation of individual epithelial segments. Of note, DevKidCC was able to distinguish distal nephron from ureteric epithelium, cell types with overlapping profiles that have previously confounded analyses. When applied to a variation in protocol via the addition of retinoic acid, DevKidCC identified a consequential depletion of nephron progenitors. CONCLUSIONS The application of DevKidCC to kidney organoids reproducibly classifies component cellular identity within distinct single-cell datasets. The application of the tool is summarised in an interactive Shiny application, as are examples of the utility of in-built functions for data presentation. This tool will enable the consistent and rapid comparison of kidney organoid protocols, driving improvements in patterning to kidney endpoints and validating new approaches.
Collapse
Affiliation(s)
- Sean B Wilson
- Murdoch Children's Research Institute, Flemington Rd, Parkville, Victoria, Australia
- Department of Paediatrics, The University of Melbourne, Victoria, Parkville, Australia
| | - Sara E Howden
- Murdoch Children's Research Institute, Flemington Rd, Parkville, Victoria, Australia
- Department of Paediatrics, The University of Melbourne, Victoria, Parkville, Australia
| | | | - Aude Dorison
- Murdoch Children's Research Institute, Flemington Rd, Parkville, Victoria, Australia
| | - Jose Alquicira-Hernandez
- Garvan-Weizmann Centre for Cellular Genomics, The Kinghorn Cancer Centre, Darlinghurst, New South Wales, Australia
| | - Joseph E Powell
- Garvan-Weizmann Centre for Cellular Genomics, The Kinghorn Cancer Centre, Darlinghurst, New South Wales, Australia
- UNSW Cellular Genomics Futures Institute, University of New South Wales, Sydney, New South Wales, Australia
| | - Melissa H Little
- Murdoch Children's Research Institute, Flemington Rd, Parkville, Victoria, Australia.
- Department of Paediatrics, The University of Melbourne, Victoria, Parkville, Australia.
- Department of Anatomy and Neuroscience, The University of Melbourne, Victoria, Parkville, Australia.
- Novo Nordisk Foundation Centre for Stem Cell Medicine, Copenhagen, Denmark.
| |
Collapse
|
14
|
Wilson SB, Howden SE, Vanslambrouck JM, Dorison A, Alquicira-Hernandez J, Powell JE, Little MH. DevKidCC allows for robust classification and direct comparisons of kidney organoid datasets. Genome Med 2022; 14:19. [PMID: 35189942 PMCID: PMC8862535 DOI: 10.1186/s13073-022-01023-z] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Accepted: 02/08/2022] [Indexed: 12/20/2022] Open
Abstract
Background While single-cell transcriptional profiling has greatly increased our capacity to interrogate biology, accurate cell classification within and between datasets is a key challenge. This is particularly so in pluripotent stem cell-derived organoids which represent a model of a developmental system. Here, clustering algorithms and selected marker genes can fail to accurately classify cellular identity while variation in analyses makes it difficult to meaningfully compare datasets. Kidney organoids provide a valuable resource to understand kidney development and disease. However, direct comparison of relative cellular composition between protocols has proved challenging. Hence, an unbiased approach for classifying cell identity is required. Methods The R package, scPred, was trained on multiple single cell RNA-seq datasets of human fetal kidney. A hierarchical model classified cellular subtypes into nephron, stroma and ureteric epithelial elements. This model, provided in the R package DevKidCC (github.com/KidneyRegeneration/DevKidCC), was then used to predict relative cell identity within published kidney organoid datasets generated using distinct cell lines and differentiation protocols, interrogating the impact of such variations. The package contains custom functions for the display of differential gene expression within cellular subtypes. Results DevKidCC was used to directly compare between distinct kidney organoid protocols, identifying differences in relative proportions of cell types at all hierarchical levels of the model and highlighting variations in stromal and unassigned cell types, nephron progenitor prevalence and relative maturation of individual epithelial segments. Of note, DevKidCC was able to distinguish distal nephron from ureteric epithelium, cell types with overlapping profiles that have previously confounded analyses. When applied to a variation in protocol via the addition of retinoic acid, DevKidCC identified a consequential depletion of nephron progenitors. Conclusions The application of DevKidCC to kidney organoids reproducibly classifies component cellular identity within distinct single-cell datasets. The application of the tool is summarised in an interactive Shiny application, as are examples of the utility of in-built functions for data presentation. This tool will enable the consistent and rapid comparison of kidney organoid protocols, driving improvements in patterning to kidney endpoints and validating new approaches. Supplementary Information The online version contains supplementary material available at 10.1186/s13073-022-01023-z.
Collapse
Affiliation(s)
- Sean B Wilson
- Murdoch Children's Research Institute, Flemington Rd, Parkville, Victoria, Australia.,Department of Paediatrics, The University of Melbourne, Victoria, Parkville, Australia
| | - Sara E Howden
- Murdoch Children's Research Institute, Flemington Rd, Parkville, Victoria, Australia.,Department of Paediatrics, The University of Melbourne, Victoria, Parkville, Australia
| | | | - Aude Dorison
- Murdoch Children's Research Institute, Flemington Rd, Parkville, Victoria, Australia
| | - Jose Alquicira-Hernandez
- Garvan-Weizmann Centre for Cellular Genomics, The Kinghorn Cancer Centre, Darlinghurst, New South Wales, Australia
| | - Joseph E Powell
- Garvan-Weizmann Centre for Cellular Genomics, The Kinghorn Cancer Centre, Darlinghurst, New South Wales, Australia.,UNSW Cellular Genomics Futures Institute, University of New South Wales, Sydney, New South Wales, Australia
| | - Melissa H Little
- Murdoch Children's Research Institute, Flemington Rd, Parkville, Victoria, Australia. .,Department of Paediatrics, The University of Melbourne, Victoria, Parkville, Australia. .,Department of Anatomy and Neuroscience, The University of Melbourne, Victoria, Parkville, Australia. .,Novo Nordisk Foundation Centre for Stem Cell Medicine, Copenhagen, Denmark.
| |
Collapse
|
15
|
Xie B, Jiang Q, Mora A, Li X. Automatic cell type identification methods for single-cell RNA sequencing. Comput Struct Biotechnol J 2021; 19:5874-5887. [PMID: 34815832 PMCID: PMC8572862 DOI: 10.1016/j.csbj.2021.10.027] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 09/23/2021] [Accepted: 10/18/2021] [Indexed: 11/24/2022] Open
Abstract
Single-cell RNA sequencing (scRNA-seq) has become a powerful tool for scientists of many research disciplines due to its ability to elucidate the heterogeneous and complex cell-type compositions of different tissues and cell populations. Traditional cell-type identification methods for scRNA-seq data analysis are time-consuming and knowledge-dependent for manual annotation. By contrast, automatic cell-type identification methods may have the advantages of being fast, accurate, and more user friendly. Here, we discuss and evaluate thirty-two published automatic methods for scRNA-seq data analysis in terms of their prediction accuracy, F1-score, unlabeling rate and running time. We highlight the advantages and disadvantages of these methods and provide recommendations of method choice depending on the available information. The challenges and future applications of these automatic methods are further discussed. In addition, we provide a free scRNA-seq data analysis package encompassing the discussed automatic methods to help the easy usage of them in real-world applications.
Collapse
Affiliation(s)
- Bingbing Xie
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou 510060, Guangdong, China
| | - Qin Jiang
- Affiliated Eye Hospital of Nanjing Medical University, Nanjing, China
| | - Antonio Mora
- Joint School of Life Sciences, Guangzhou Medical University and Guangzhou Institutes of Biomedicine and Health (Chinese Academy of Sciences), Xinzao, Panyu District, Guangzhou 511436, Guangdong, China
| | - Xuri Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou 510060, Guangdong, China
| |
Collapse
|
16
|
Shen S, Sun Y, Matsumoto M, Shim WJ, Sinniah E, Wilson SB, Werner T, Wu Z, Bradford ST, Hudson J, Little MH, Powell J, Nguyen Q, Palpant NJ. Integrating single-cell genomics pipelines to discover mechanisms of stem cell differentiation. Trends Mol Med 2021; 27:1135-1158. [PMID: 34657800 DOI: 10.1016/j.molmed.2021.09.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 09/19/2021] [Accepted: 09/22/2021] [Indexed: 12/12/2022]
Abstract
Pluripotent stem cells underpin a growing sector that leverages their differentiation potential for research, industry, and clinical applications. This review evaluates the landscape of methods in single-cell transcriptomics that are enabling accelerated discovery in stem cell science. We focus on strategies for scaling stem cell differentiation through multiplexed single-cell analyses, for evaluating molecular regulation of cell differentiation using new analysis algorithms, and methods for integration and projection analysis to classify and benchmark stem cell derivatives against in vivo cell types. By discussing the available methods, comparing their strengths, and illustrating strategies for developing integrated analysis pipelines, we provide user considerations to inform their implementation and interpretation.
Collapse
Affiliation(s)
- Sophie Shen
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Yuliangzi Sun
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Maika Matsumoto
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Woo Jun Shim
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Enakshi Sinniah
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Sean B Wilson
- Murdoch Children's Research Institute, Melbourne, Australia
| | - Tessa Werner
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Zhixuan Wu
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | | | - James Hudson
- QIMR Berghofer Medical Research Institute, Brisbane, Australia
| | - Melissa H Little
- Murdoch Children's Research Institute, Melbourne, Australia; Department of Pediatrics, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne, Australia
| | - Joseph Powell
- Garvan-Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, Sydney, Australia; UNSW Cellular Genomics Futures Institute, UNSW, Sydney, Australia
| | - Quan Nguyen
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Nathan J Palpant
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia.
| |
Collapse
|
17
|
Huang Y, Zhang P. Evaluation of machine learning approaches for cell-type identification from single-cell transcriptomics data. Brief Bioinform 2021; 22:6145135. [PMID: 33611343 DOI: 10.1093/bib/bbab035] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 01/22/2021] [Accepted: 01/22/2021] [Indexed: 11/14/2022] Open
Abstract
Single-cell transcriptomics is rapidly advancing our understanding of the cellular composition of complex tissues and organisms. A major limitation in most analysis pipelines is the reliance on manual annotations to determine cell identities, which is time-consuming, irreproducible and sometimes lack canonical markers for certain cell types. There is a growing realization of the potential of machine learning models as a supervised classification approach that can significantly aid decision-making processes for cell-type identification. In this work, we performed a comprehensive and impartial evaluation of 10 machine learning models that automatically assign cell phenotypes. The performance of classification methods is estimated by using 20 publicly accessible single-cell RNA sequencing datasets with different sizes, technologies, species and levels of complexity. The performance of each model for within dataset (intra-dataset) and across datasets (inter-dataset) experiments based on the classification accuracy and computation time are both evaluated. Besides, the sensitivity to the number of input features, different annotation levels and dataset complexity was also been estimated. Results showed that most classifiers perform well on a variety of datasets with decreased accuracy for complex datasets, while the Linear Support Vector Machine (linear-SVM) and Logistic Regression classifier models have the best overall performance with remarkably fast computation time. Our work provides a guideline for researchers to select and apply suitable machine learning-based classification models in their analysis workflows and sheds some light on the potential direction of future improvement on automated cell phenotype classification tools based on the single-cell sequencing data.
Collapse
Affiliation(s)
- Yixuan Huang
- George Washington University School of Business, Washington, DC, USA
| | - Peng Zhang
- Division of Immunotherapy and the Director of Bioinformatics Core at the Institute of Human Virology, University of Maryland School of Medicine, MD, USA
| |
Collapse
|
18
|
Ghannoum S, Leoncio Netto W, Fantini D, Ragan-Kelley B, Parizadeh A, Jonasson E, Ståhlberg A, Farhan H, Köhn-Luque A. DIscBIO: A User-Friendly Pipeline for Biomarker Discovery in Single-Cell Transcriptomics. Int J Mol Sci 2021; 22:ijms22031399. [PMID: 33573289 PMCID: PMC7866810 DOI: 10.3390/ijms22031399] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 01/08/2021] [Accepted: 01/28/2021] [Indexed: 02/08/2023] Open
Abstract
The growing attention toward the benefits of single-cell RNA sequencing (scRNA-seq) is leading to a myriad of computational packages for the analysis of different aspects of scRNA-seq data. For researchers without advanced programing skills, it is very challenging to combine several packages in order to perform the desired analysis in a simple and reproducible way. Here we present DIscBIO, an open-source, multi-algorithmic pipeline for easy, efficient and reproducible analysis of cellular sub-populations at the transcriptomic level. The pipeline integrates multiple scRNA-seq packages and allows biomarker discovery with decision trees and gene enrichment analysis in a network context using single-cell sequencing read counts through clustering and differential analysis. DIscBIO is freely available as an R package. It can be run either in command-line mode or through a user-friendly computational pipeline using Jupyter notebooks. We showcase all pipeline features using two scRNA-seq datasets. The first dataset consists of circulating tumor cells from patients with breast cancer. The second one is a cell cycle regulation dataset in myxoid liposarcoma. All analyses are available as notebooks that integrate in a sequential narrative R code with explanatory text and output data and images. R users can use the notebooks to understand the different steps of the pipeline and will guide them to explore their scRNA-seq data. We also provide a cloud version using Binder that allows the execution of the pipeline without the need of downloading R, Jupyter or any of the packages used by the pipeline. The cloud version can serve as a tutorial for training purposes, especially for those that are not R users or have limited programing skills. However, in order to do meaningful scRNA-seq analyses, all users will need to understand the implemented methods and their possible options and limitations.
Collapse
Affiliation(s)
- Salim Ghannoum
- Department of Molecular Medicine, Institute of Basic Medical Sciences, University of Oslo, 0372 Oslo, Norway; (A.P.); (H.F.)
- Correspondence: (S.G.); (A.K.-L.); Tel.: +46-76-5770129 (S.G.)
| | - Waldir Leoncio Netto
- Oslo Centre for Biostatistics and Epidemiology, Faculty of Medicine, University of Oslo, 0372 Oslo, Norway;
| | - Damiano Fantini
- Department of Urology, Northwestern University, Chicago, IL 60611, USA;
| | | | - Amirabbas Parizadeh
- Department of Molecular Medicine, Institute of Basic Medical Sciences, University of Oslo, 0372 Oslo, Norway; (A.P.); (H.F.)
| | - Emma Jonasson
- Sahlgrenska Center for Cancer Research, Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Academy at University of Gothenburg, SE-41390 Gothenburg, Sweden; (E.J.); (A.S.)
| | - Anders Ståhlberg
- Sahlgrenska Center for Cancer Research, Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Academy at University of Gothenburg, SE-41390 Gothenburg, Sweden; (E.J.); (A.S.)
- Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, SE-41390 Gothenburg, Sweden
- Department of Clinical Genetics and Genomics, Sahlgrenska University Hospital, SE-41390 Gothenburg, Sweden
| | - Hesso Farhan
- Department of Molecular Medicine, Institute of Basic Medical Sciences, University of Oslo, 0372 Oslo, Norway; (A.P.); (H.F.)
| | - Alvaro Köhn-Luque
- Oslo Centre for Biostatistics and Epidemiology, Faculty of Medicine, University of Oslo, 0372 Oslo, Norway;
- Correspondence: (S.G.); (A.K.-L.); Tel.: +46-76-5770129 (S.G.)
| |
Collapse
|