1
|
Chang CJ, Hsu CY, Liu Q, Shyr Y. VICTOR: Validation and inspection of cell type annotation through optimal regression. Comput Struct Biotechnol J 2024; 23:3270-3280. [PMID: 39296808 PMCID: PMC11408377 DOI: 10.1016/j.csbj.2024.08.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 08/30/2024] [Accepted: 08/31/2024] [Indexed: 09/21/2024] Open
Abstract
Single-cell RNA sequencing provides unprecedent opportunities to explore the heterogeneity and dynamics inherent in cellular biology. An essential step in the data analysis involves the automatic annotation of cells. Despite development of numerous tools for automated cell annotation, assessing the reliability of predicted annotations remains challenging, particularly for rare and unknown cell types. Here, we introduce VICTOR: Validation and inspection of cell type annotation through optimal regression. VICTOR aims to gauge the confidence of cell annotations by an elastic-net regularized regression with optimal thresholds. We demonstrated that VICTOR performed well in identifying inaccurate annotations, surpassing existing methods in diagnostic ability across various single-cell datasets, including within-platform, cross-platform, cross-studies, and cross-omics settings.
Collapse
Affiliation(s)
- Chia-Jung Chang
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Department of Biomedical Engineering, National Cheng Kung University, Tainan, Taiwan
| | - Chih-Yuan Hsu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Qi Liu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Yu Shyr
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| |
Collapse
|
2
|
Thomas RA, Fiorini MR, Amiri S, Fon EA, Farhan SMK. ScRNAbox: empowering single-cell RNA sequencing on high performance computing systems. BMC Bioinformatics 2024; 25:319. [PMID: 39354372 PMCID: PMC11443813 DOI: 10.1186/s12859-024-05935-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 09/17/2024] [Indexed: 10/03/2024] Open
Abstract
BACKGROUND Single-cell RNA sequencing (scRNAseq) offers powerful insights, but the surge in sample sizes demands more computational power than local workstations can provide. Consequently, high-performance computing (HPC) systems have become imperative. Existing web apps designed to analyze scRNAseq data lack scalability and integration capabilities, while analysis packages demand coding expertise, hindering accessibility. RESULTS In response, we introduce scRNAbox, an innovative scRNAseq analysis pipeline meticulously crafted for HPC systems. This end-to-end solution, executed via the SLURM workload manager, efficiently processes raw data from standard and Hashtag samples. It incorporates quality control filtering, sample integration, clustering, cluster annotation tools, and facilitates cell type-specific differential gene expression analysis between two groups. We demonstrate the application of scRNAbox by analyzing two publicly available datasets. CONCLUSION ScRNAbox is a comprehensive end-to-end pipeline designed to streamline the processing and analysis of scRNAseq data. By responding to the pressing demand for a user-friendly, HPC solution, scRNAbox bridges the gap between the growing computational demands of scRNAseq analysis and the coding expertise required to meet them.
Collapse
Affiliation(s)
- Rhalena A Thomas
- Department of Neurology and Neurosurgery, Montreal Neurological Institute-Hospital, McGill University, Montreal, QC, H3A 2B4, Canada.
- The Neuro Early Drug Discovery Unit, Montreal Neurological Institute-Hospital, McGill University, Montreal, QC, H3A 2B4, Canada.
| | - Michael R Fiorini
- Department of Human Genetics, Montreal Neurological Institute-Hospital, McGill University, Montreal, QC, H3A 2B4, Canada
| | - Saeid Amiri
- Department of Neurology and Neurosurgery, Montreal Neurological Institute-Hospital, McGill University, Montreal, QC, H3A 2B4, Canada
| | - Edward A Fon
- Department of Neurology and Neurosurgery, Montreal Neurological Institute-Hospital, McGill University, Montreal, QC, H3A 2B4, Canada
- The Neuro Early Drug Discovery Unit, Montreal Neurological Institute-Hospital, McGill University, Montreal, QC, H3A 2B4, Canada
| | - Sali M K Farhan
- Department of Neurology and Neurosurgery, Montreal Neurological Institute-Hospital, McGill University, Montreal, QC, H3A 2B4, Canada.
- Department of Human Genetics, Montreal Neurological Institute-Hospital, McGill University, Montreal, QC, H3A 2B4, Canada.
| |
Collapse
|
3
|
Pan L, Mou T, Huang Y, Hong W, Yu M, Li X. Ursa: A Comprehensive Multiomics Toolbox for High-Throughput Single-Cell Analysis. Mol Biol Evol 2023; 40:msad267. [PMID: 38091963 PMCID: PMC10752348 DOI: 10.1093/molbev/msad267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 09/08/2023] [Accepted: 11/03/2023] [Indexed: 12/28/2023] Open
Abstract
The burgeoning amount of single-cell data has been accompanied by revolutionary changes to computational methods to map, quantify, and analyze the outputs of these cutting-edge technologies. Many are still unable to reap the benefits of these advancements due to the lack of bioinformatics expertise. To address this issue, we present Ursa, an automated single-cell multiomics R package containing 6 automated single-cell omics and spatial transcriptomics workflows. Ursa allows scientists to carry out post-quantification single or multiomics analyses in genomics, transcriptomics, epigenetics, proteomics, and immunomics at the single-cell level. It serves as a 1-stop analytic solution by providing users with outcomes to quality control assessments, multidimensional analyses such as dimension reduction and clustering, and extended analyses such as pseudotime trajectory and gene-set enrichment analyses. Ursa aims bridge the gap between those with bioinformatics expertise and those without by providing an easy-to-use bioinformatics package for scientists in hoping to accelerate their research potential. Ursa is freely available at https://github.com/singlecellomics/ursa.
Collapse
Affiliation(s)
- Lu Pan
- Institute of Environmental Medicine, Karolinska Institutet, Solna 171 65, Sweden
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Solna 171 65, Sweden
| | - Tian Mou
- School of Biomedical Engineering, Shenzhen University, Shenzhen, Guangdong 518060, China
| | - Yue Huang
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Solna 171 65, Sweden
| | - Weifeng Hong
- Department of Radiation Oncology, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Min Yu
- Department of General Surgery, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, Guangdong 510515, China
| | - Xuexin Li
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Solna 171 65, Sweden
- Department of General Surgery, The Fourth Affiliated Hospital, China Medical University, Shenyang 110032, China
| |
Collapse
|
4
|
Tzaferis C, Karatzas E, Baltoumas FA, Pavlopoulos GA, Kollias G, Konstantopoulos D. SCALA: A complete solution for multimodal analysis of single-cell Next Generation Sequencing data. Comput Struct Biotechnol J 2023; 21:5382-5393. [PMID: 38022693 PMCID: PMC10651449 DOI: 10.1016/j.csbj.2023.10.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 10/16/2023] [Accepted: 10/16/2023] [Indexed: 12/01/2023] Open
Abstract
Analysis and interpretation of high-throughput transcriptional and chromatin accessibility data at single-cell (sc) resolution are still open challenges in the biomedical field. The existence of countless bioinformatics tools, for the different analytical steps, increases the complexity of data interpretation and the difficulty to derive biological insights. In this article, we present SCALA, a bioinformatics tool for analysis and visualization of single-cell RNA sequencing (scRNA-seq) and Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq) datasets, enabling either independent or integrative analysis of the two modalities. SCALA combines standard types of analysis by integrating multiple software packages varying from quality control to the identification of distinct cell populations and cell states. Additional analysis options enable functional enrichment, cellular trajectory inference, ligand-receptor analysis, and regulatory network reconstruction. SCALA is fully parameterizable, presenting data in tabular format and producing publication-ready visualizations. The different available analysis modules can aid biomedical researchers in exploring, analyzing, and visualizing their data without any prior experience in coding. We demonstrate the functionality of SCALA through two use-cases related to TNF-driven arthritic mice, handling both scRNA-seq and scATAC-seq datasets. SCALA is developed in R, Shiny and JavaScript and is mainly available as a standalone version, while an online service of more limited capacity can be found at http://scala.pavlopouloslab.info or https://scala.fleming.gr.
Collapse
Affiliation(s)
- Christos Tzaferis
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
- Research Institute of New Biotechnologies and Precision Medicine, National and Kapodistrian University of Athens, Greece
| | - George Kollias
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
- Research Institute of New Biotechnologies and Precision Medicine, National and Kapodistrian University of Athens, Greece
- Department of Physiology, Medical School, National and Kapodistrian University of Athens, Greece
| | - Dimitris Konstantopoulos
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| |
Collapse
|
5
|
Christensen E, Luo P, Turinsky A, Husić M, Mahalanabis A, Naidas A, Diaz-Mejia JJ, Brudno M, Pugh T, Ramani A, Shooshtari P. Evaluation of single-cell RNAseq labelling algorithms using cancer datasets. Brief Bioinform 2022; 24:6965910. [PMID: 36585784 PMCID: PMC9851326 DOI: 10.1093/bib/bbac561] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 09/19/2022] [Accepted: 11/01/2022] [Indexed: 01/01/2023] Open
Abstract
Single-cell RNA sequencing (scRNA-seq) clustering and labelling methods are used to determine precise cellular composition of tissue samples. Automated labelling methods rely on either unsupervised, cluster-based approaches or supervised, cell-based approaches to identify cell types. The high complexity of cancer poses a unique challenge, as tumor microenvironments are often composed of diverse cell subpopulations with unique functional effects that may lead to disease progression, metastasis and treatment resistance. Here, we assess 17 cell-based and 9 cluster-based scRNA-seq labelling algorithms using 8 cancer datasets, providing a comprehensive large-scale assessment of such methods in a cancer-specific context. Using several performance metrics, we show that cell-based methods generally achieved higher performance and were faster compared to cluster-based methods. Cluster-based methods more successfully labelled non-malignant cell types, likely because of a lack of gene signatures for relevant malignant cell subpopulations. Larger cell numbers present in some cell types in training data positively impacted prediction scores for cell-based methods. Finally, we examined which methods performed favorably when trained and tested on separate patient cohorts in scenarios similar to clinical applications, and which were able to accurately label particularly small or under-represented cell populations in the given datasets. We conclude that scPred and SVM show the best overall performances with cancer-specific data and provide further suggestions for algorithm selection. Our analysis pipeline for assessing the performance of cell type labelling algorithms is available in https://github.com/shooshtarilab/scRNAseq-Automated-Cell-Type-Labelling.
Collapse
Affiliation(s)
| | | | - Andrei Turinsky
- Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada
| | - Mia Husić
- Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada
| | - Alaina Mahalanabis
- Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada
| | - Alaine Naidas
- Children’s Health Research Institute, Lawson Research Institute, London, ON, Canada
- Department of Pathology and Lab Medicine, University of Western Ontario, London, ON, Canada
| | | | - Michael Brudno
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Trevor Pugh
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Arun Ramani
- Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada
| | - Parisa Shooshtari
- Corresponding author: Parisa Shooshtari, Department of Pathology and Lab Medicine, University of Western Ontario, London, ON, Canada. Tel.: +1 (519) 685-8500 x55427. E-mail:
| |
Collapse
|
6
|
Basu A, Sarkar A, Bandyopadhyay S, Maulik U. In silico strategies to identify protein-protein interaction modulator in cell-to-cell transmission of SARS CoV2. Transbound Emerg Dis 2022; 69:3896-3905. [PMID: 36379049 DOI: 10.1111/tbed.14760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 07/08/2022] [Accepted: 09/15/2022] [Indexed: 11/16/2022]
Abstract
RNA sequence data from SARS CoV2 patients helps to construct a gene network related to this disease. A detailed analysis of the human host response to SARS CoV2 with expression profiling by high-throughput sequencing has been accomplished with primary human lung epithelial cell lines. Using this data, the clustered gene annotation and gene network construction are performed with the help of the String database. Among the four clusters identified, only 1 with 44 genes could be annotated. Interestingly, this corresponded to basal cells with p = 1.37e - 05, which is relevant for respiratory tract infection. Functional enrichment analysis of genes present in the gene network has been completed using the String database and the Network Analyst tool. Among three types of cell-cell communication, only the anchoring junction between the basal cell membrane and the basal lamina in the host cell is involved in the virus transmission. In this junction point, a hemidesmosome structure plays a vital role in virus spread from one cell to basal lamina in the respiratory tract. In this protein complex structure, different integrin protein molecules of the host cell are used to promote the spread of virus infection into the extracellular matrix. So, small molecular blockers of different anchoring junction proteins, such as integrin alpha 3, integrin beta 1, can provide efficient protection against this deadly viral disease. ORF8 from SARS CoV2 virus can interact with both integrin proteins of human host. By using molecular docking technique, a ternary complex of these three proteins is modelled. Several oligopeptides are predicted as modulators for this ternary complex. In silico analysis of these modulators is very important to develop novel therapeutics for the treatment of SARS CoV2.
Collapse
Affiliation(s)
- Anamika Basu
- Department of Biochemistry, Gurudas College, Kolkata, India
| | - Anasua Sarkar
- Computer Science and Engineering Department, Jadavpur University, Kolkata, India
| | | | - Ujjwal Maulik
- Computer Science and Engineering Department, Jadavpur University, Kolkata, India
| |
Collapse
|
7
|
Christensen E, Naidas A, Chen D, Husic M, Shooshtari P. TMExplorer: A tumour microenvironment single-cell RNAseq database and search tool. PLoS One 2022; 17:e0272302. [PMID: 36084081 PMCID: PMC9462821 DOI: 10.1371/journal.pone.0272302] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 07/17/2022] [Indexed: 02/01/2023] Open
Abstract
MOTIVATION The tumour microenvironment (TME) contains various cells including stromal fibroblasts, immune and malignant cells, and its composition can be elucidated using single-cell RNA sequencing (scRNA-seq). scRNA-seq datasets from several cancer types are available, yet we lack a comprehensive database to collect and present related TME data in an easily accessible format. RESULTS We therefore built a TME scRNA-seq database, and created the R package TMExplorer to facilitate investigation of the TME. TMExplorer provides an interface to easily access all available datasets and their metadata. The users can search for datasets using a thorough range of characteristics. The TMExplorer allows for examination of the TME using scRNA-seq in a way that is streamlined and allows for easy integration into already existing scRNA-seq analysis pipelines.
Collapse
Affiliation(s)
- Erik Christensen
- Department of Computer Science, University of Western Ontario, London, ON, Canada
- Children Health Research Institute, Victoria Research Labs, London, ON, Canada
| | - Alaine Naidas
- Children Health Research Institute, Victoria Research Labs, London, ON, Canada
- Department of Pathology and Lab Medicine, University of Western Ontario, London, ON, Canada
| | - David Chen
- Children Health Research Institute, Victoria Research Labs, London, ON, Canada
- Department of Pathology and Lab Medicine, University of Western Ontario, London, ON, Canada
| | - Mia Husic
- Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada
| | - Parisa Shooshtari
- Department of Computer Science, University of Western Ontario, London, ON, Canada
- Children Health Research Institute, Victoria Research Labs, London, ON, Canada
- Department of Pathology and Lab Medicine, University of Western Ontario, London, ON, Canada
- Lawson Health Research Institute, London, ON, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| |
Collapse
|
8
|
Lewsey MG, Yi C, Berkowitz O, Ayora F, Bernado M, Whelan J. scCloudMine: A cloud-based app for visualization, comparison, and exploration of single-cell transcriptomic data. PLANT COMMUNICATIONS 2022; 3:100302. [PMID: 35605202 PMCID: PMC9284053 DOI: 10.1016/j.xplc.2022.100302] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Revised: 12/13/2021] [Accepted: 01/20/2022] [Indexed: 06/12/2023]
Abstract
scCloudMine is a cloud-based application for visualization, comparison, and exploration of single-cell transcriptome data. It does not require an on-site, high-power computing server, installation, or associated expertise and expense. Users upload their own or publicly available scRNA-seq datasets after pre-processing for visualization using a web browser. The data can be viewed in two color modes-Cluster, representing cell identity, and Values, showing levels of expression-and data can be queried using keywords or gene identification number(s). Using the app to compare studies, we determined that some genes frequently used as cell-type markers are in fact study specific. The apparent cell-specific expression of PHO1;H3 differed between GFP-tagging and scRNA-seq studies. Some phosphate transporter genes were induced by protoplasting, but they retained cell specificity, suggesting that cell-specific responses to stress (i.e., protoplasting) can occur. Examination of the cell specificity of hormone response genes revealed that 132 hormone-responsive genes display restricted expression and that the jasmonate response gene TIFY8 is expressed in endodermal cells, in contrast to previous reports. It also appears that JAZ repressors have cell-type-specific functions. These features identified using scCloudMine highlight the need for resources to enable biological researchers to compare their datasets of interest under a variety of parameters. scCloudMine enables researchers to form new hypotheses and perform comparative studies and allows for the easy re-use of data from this emerging technology by a wide variety of users who may not have access or funding for high-performance on-site computing and support.
Collapse
Affiliation(s)
- Mathew G Lewsey
- La Trobe Institute for Agriculture and Food, La Trobe University, AgriBio Building, Bundoora, VIC 3086, Australia; Australian Research Council Research Hub for Medicinal Agriculture, La Trobe University, AgriBio Building, Bundoora, VIC 3086, Australia
| | - Changyu Yi
- La Trobe Institute for Agriculture and Food, La Trobe University, AgriBio Building, Bundoora, VIC 3086, Australia
| | - Oliver Berkowitz
- La Trobe Institute for Agriculture and Food, La Trobe University, AgriBio Building, Bundoora, VIC 3086, Australia; Australian Research Council Research Hub for Medicinal Agriculture, La Trobe University, AgriBio Building, Bundoora, VIC 3086, Australia
| | - Felipe Ayora
- BizData, Level 9/278, Collins Street, Melbourne, VIC 3000, Australia; Research and Advanced Computing, BizData, Level 31, 2-6, Gilmer Terrace, Wellington, 6011, New Zealand.
| | - Maurice Bernado
- BizData, Level 9/278, Collins Street, Melbourne, VIC 3000, Australia
| | - James Whelan
- La Trobe Institute for Agriculture and Food, La Trobe University, AgriBio Building, Bundoora, VIC 3086, Australia; Australian Research Council Research Hub for Medicinal Agriculture, La Trobe University, AgriBio Building, Bundoora, VIC 3086, Australia; Department of Animal, Plant and Soil Sciences, School of Life Science, La Trobe University, Bundoora, VIC 3086, Australia.
| |
Collapse
|
9
|
Dworkin LA, Clausen H, Joshi HJ. Applying transcriptomics to studyglycosylation at the cell type level. iScience 2022; 25:104419. [PMID: 35663018 PMCID: PMC9156939 DOI: 10.1016/j.isci.2022.104419] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Revised: 03/30/2022] [Accepted: 05/12/2022] [Indexed: 11/22/2022] Open
Abstract
The complex multi-step process of glycosylation occurs in a single cell, yet current analytics generally cannot measure the output (the glycome) of a single cell. Here, we addressed this discordance by investigating how single cell RNA-seq data can be used to characterize the state of the glycosylation machinery and metabolic network in a single cell. The metabolic network involves 214 glycosylation and modification enzymes outlined in our previously built atlas of cellular glycosylation pathways. We studied differential mRNA regulation of enzymes at the organ and single cell level, finding that most of the general protein and lipid oligosaccharide scaffolds are produced by enzymes exhibiting limited transcriptional regulation among cells. We predict key enzymes within different glycosylation pathways to be highly transcriptionally regulated as regulatable hotspots of the cellular glycome. We designed the Glycopacity software that enables investigators to extract and interpret glycosylation information from transcriptome data and define hotspots of regulation. RNA-seq can provide information on the glycosylation metabolic network state It is possible to readout glycosylation capacity from single cell RNA-seq data Genes regulating the biosynthesis of common glycan scaffolds show little regulation Key enzymes in the glycosylation network are predicted to be regulatable hotspots
Collapse
Affiliation(s)
- Leo Alexander Dworkin
- Copenhagen Center for Glycomics, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3, 2200 Copenhagen N, Denmark
| | - Henrik Clausen
- Copenhagen Center for Glycomics, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3, 2200 Copenhagen N, Denmark
| | - Hiren Jitendra Joshi
- Copenhagen Center for Glycomics, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3, 2200 Copenhagen N, Denmark
- Corresponding author
| |
Collapse
|
10
|
Jiang A, Lehnert K, You L, Snell RG. ICARUS, an interactive web server for single cell RNA-seq analysis. Nucleic Acids Res 2022; 50:W427-W433. [PMID: 35536286 PMCID: PMC9252722 DOI: 10.1093/nar/gkac322] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 04/14/2022] [Accepted: 04/21/2022] [Indexed: 01/01/2023] Open
Abstract
Here we present ICARUS, a web server to enable users without experience in R to undertake single cell RNA-seq analysis. The focal point of ICARUS is its intuitive tutorial-style user interface, designed to guide logical navigation through the multitude of pre-processing, analysis and visualization steps. ICARUS is easily accessible through a dedicated web server (https://launch.icarus-scrnaseq.cloud.edu.au/) and avoids installation of software on the user's computer. Notable features include the facility to apply quality control thresholds and adjust dimensionality reduction and cell clustering parameters. Data is visualized through 2D/3D UMAP and t-SNE plots and may be curated to remove potential confounders such as cell cycle heterogeneity. ICARUS offers flexible differential expression analysis with user-defined cell groups and gene set enrichment analysis to identify likely affected biological pathways. Eleven organisms including human, dog, mouse, rat, zebrafish, fruit fly, nematode, yeast, cattle, chicken and pig are currently supported. Visualization of multimodal data including those generated by CITE-seq and the 10X Genomics Multiome kit is included. ICARUS incorporates a function to save the current state of analysis avoiding computationally intensive steps during repeat analysis. The complete analysis of a typical single cell RNA-seq dataset by inexperienced users may be achieved in 1-2 h.
Collapse
Affiliation(s)
- Andrew Jiang
- Applied Translational Genetics Group, School of Biological Sciences, The University of Auckland, Auckland, New Zealand
| | - Klaus Lehnert
- Applied Translational Genetics Group, School of Biological Sciences, The University of Auckland, Auckland, New Zealand
| | - Linya You
- Department of Human Anatomy & Histoembryology, School of Basic Medical Sciences, Fudan University, Shanghai, China
| | - Russell G Snell
- Applied Translational Genetics Group, School of Biological Sciences, The University of Auckland, Auckland, New Zealand
| |
Collapse
|
11
|
Domanskyi S, Hakansson A, Meng M, Pham BK, Graff Zivin JS, Piermarocchi C, Paternostro G, Ferrara N. Naturally occurring combinations of receptors from single cell transcriptomics in endothelial cells. Sci Rep 2022; 12:5807. [PMID: 35388065 PMCID: PMC8987085 DOI: 10.1038/s41598-022-09616-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 03/25/2022] [Indexed: 11/18/2022] Open
Abstract
VEGF inhibitor drugs are part of standard care in oncology and ophthalmology, but not all patients respond to them. Combinations of drugs are likely to be needed for more effective therapies of angiogenesis-related diseases. In this paper we describe naturally occurring combinations of receptors in endothelial cells that might help to understand how cells communicate and to identify targets for drug combinations. We also develop and share a new software tool called DECNEO to identify them. Single-cell gene expression data are used to identify a set of co-expressed endothelial cell receptors, conserved among species (mice and humans) and enriched, within a network, of connections to up-regulated genes. This set includes several receptors previously shown to play a role in angiogenesis. Multiple statistical tests from large datasets, including an independent validation set, support the reproducibility, evolutionary conservation and role in angiogenesis of these naturally occurring combinations of receptors. We also show tissue-specific combinations and, in the case of choroid endothelial cells, consistency with both well-established and recent experimental findings, presented in a separate paper. The results and methods presented here advance the understanding of signaling to endothelial cells. The methods are generally applicable to the decoding of intercellular combinations of signals.
Collapse
Affiliation(s)
- Sergii Domanskyi
- Department of Physics and Astronomy, Michigan State University, East Lansing, MI, 48824, USA
| | - Alex Hakansson
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Michelle Meng
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Benjamin K Pham
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Joshua S Graff Zivin
- School of Global Policy and Strategy and Department of Economics, University of California, San Diego, 9500 Gilman Drive, MC 0519, La Jolla, CA, 92093, USA
| | - Carlo Piermarocchi
- Department of Physics and Astronomy, Michigan State University, East Lansing, MI, 48824, USA.
| | | | - Napoleone Ferrara
- Moores Cancer Center, University of California San Diego, La Jolla, CA, 92093, USA.
| |
Collapse
|
12
|
Long H, Reeves R, Simon MM. Mouse genomic and cellular annotations. Mamm Genome 2022; 33:19-30. [PMID: 35124726 PMCID: PMC8913471 DOI: 10.1007/s00335-021-09936-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 11/22/2021] [Indexed: 11/28/2022]
Abstract
AbstractMice have emerged as one of the most popular and valuable model organisms in the research of human biology. This is due to their genetic and physiological similarity to humans, short generation times, availability of genetically homologous inbred strains, and relatively easy laboratory maintenance. Therefore, following the release of the initial human reference genome, the generation of the mouse reference genome was prioritised and represented an important scientific resource for the mouse genetics community. In 2002, the Mouse Genome Sequencing Consortium published an initial draft of the mouse reference genome which contained ~ 96% of the euchromatic genome of female C57BL/6 J mice. Almost two decades on from the publication of the initial draft, sequencing efforts have continued to increase the completeness and accuracy of the C57BL/6 J reference genome alongside advances in genome annotation. Additionally new sequencing technologies have provided a wealth of data that has added to the repertoire of annotations associated with traditional genomic annotations. Including but not limited to advances in regulatory elements, the 3D genome and individual cellular states. In this review we focus on the reference genome C57BL/6 J and summarise the different aspects of genomic and cellular annotations, as well as their relevance to mouse genetic research. We denote a genomic annotation as a functional unit of the genome. Cellular annotations are annotations of cell type or state, defined by the transcriptomic expression profile of a cell. Due to the wide-ranging number and diversity of annotations describing the mouse genome, we focus on gene, repeat and regulatory element annotation as well as two relatively new technologies; 3D genome architecture and single-cell sequencing outlining their utility in genetic research and their current challenges.
Collapse
Affiliation(s)
- Helen Long
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire, OX11 0RD, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Richard Reeves
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire, OX11 0RD, UK
| | - Michelle M Simon
- MRC Harwell Institute, Mammalian Genetics Unit, Harwell Campus, Oxfordshire, OX11 0RD, UK.
| |
Collapse
|
13
|
Li J, Sheng Q, Shyr Y, Liu Q. scMRMA: single cell multiresolution marker-based annotation. Nucleic Acids Res 2022; 50:e7. [PMID: 34648021 PMCID: PMC8789072 DOI: 10.1093/nar/gkab931] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 09/10/2021] [Accepted: 09/28/2021] [Indexed: 01/22/2023] Open
Abstract
Single-cell RNA sequencing has become a powerful tool for identifying and characterizing cellular heterogeneity. One essential step to understanding cellular heterogeneity is determining cell identities. The widely used strategy predicts identities by projecting cells or cell clusters unidirectionally against a reference to find the best match. Here, we develop a bidirectional method, scMRMA, where a hierarchical reference guides iterative clustering and deep annotation with enhanced resolutions. Taking full advantage of the reference, scMRMA greatly improves the annotation accuracy. scMRMA achieved better performance than existing methods in four benchmark datasets and successfully revealed the expansion of CD8 T cell populations in squamous cell carcinoma after anti-PD-1 treatment.
Collapse
Affiliation(s)
- Jia Li
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Quanhu Sheng
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Yu Shyr
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Qi Liu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| |
Collapse
|
14
|
Pereira WJ, Almeida FM, Conde D, Balmant KM, Triozzi PM, Schmidt HW, Dervinis C, Pappas GJ, Kirst M. Asc-Seurat: analytical single-cell Seurat-based web application. BMC Bioinformatics 2021; 22:556. [PMID: 34794383 PMCID: PMC8600690 DOI: 10.1186/s12859-021-04472-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Accepted: 11/10/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Single-cell RNA sequencing (scRNA-seq) has revolutionized the study of transcriptomes, arising as a powerful tool for discovering and characterizing cell types and their developmental trajectories. However, scRNA-seq analysis is complex, requiring a continuous, iterative process to refine the data and uncover relevant biological information. A diversity of tools has been developed to address the multiple aspects of scRNA-seq data analysis. However, an easy-to-use web application capable of conducting all critical steps of scRNA-seq data analysis is still lacking. We present Asc-Seurat, a feature-rich workbench, providing an user-friendly and easy-to-install web application encapsulating tools for an all-encompassing and fluid scRNA-seq data analysis. Asc-Seurat implements functions from the Seurat package for quality control, clustering, and genes differential expression. In addition, Asc-Seurat provides a pseudotime module containing dozens of models for the trajectory inference and a functional annotation module that allows recovering gene annotation and detecting gene ontology enriched terms. We showcase Asc-Seurat's capabilities by analyzing a peripheral blood mononuclear cell dataset. CONCLUSIONS Asc-Seurat is a comprehensive workbench providing an accessible graphical interface for scRNA-seq analysis by biologists. Asc-Seurat significantly reduces the time and effort required to analyze and interpret the information in scRNA-seq datasets.
Collapse
Affiliation(s)
- W J Pereira
- School of Forest, Fisheries, and Geomatics Sciences, University of Florida, Gainesville, FL, 32611, USA.
| | - F M Almeida
- Department of Cell Biology, Institute of Biological Sciences, University of Brasília, Brasília, DF, 70910-900, Brazil
| | - D Conde
- School of Forest, Fisheries, and Geomatics Sciences, University of Florida, Gainesville, FL, 32611, USA
| | - K M Balmant
- School of Forest, Fisheries, and Geomatics Sciences, University of Florida, Gainesville, FL, 32611, USA
| | - P M Triozzi
- School of Forest, Fisheries, and Geomatics Sciences, University of Florida, Gainesville, FL, 32611, USA
| | - H W Schmidt
- School of Forest, Fisheries, and Geomatics Sciences, University of Florida, Gainesville, FL, 32611, USA
| | - C Dervinis
- School of Forest, Fisheries, and Geomatics Sciences, University of Florida, Gainesville, FL, 32611, USA
| | - G J Pappas
- Department of Cell Biology, Institute of Biological Sciences, University of Brasília, Brasília, DF, 70910-900, Brazil
| | - M Kirst
- School of Forest, Fisheries, and Geomatics Sciences, University of Florida, Gainesville, FL, 32611, USA
- Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| |
Collapse
|
15
|
Karimi MR, Karimi AH, Abolmaali S, Sadeghi M, Schmitz U. Prospects and challenges of cancer systems medicine: from genes to disease networks. Brief Bioinform 2021; 23:6361045. [PMID: 34471925 PMCID: PMC8769701 DOI: 10.1093/bib/bbab343] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 08/02/2021] [Accepted: 08/03/2021] [Indexed: 12/20/2022] Open
Abstract
It is becoming evident that holistic perspectives toward cancer are crucial in deciphering the overwhelming complexity of tumors. Single-layer analysis of genome-wide data has greatly contributed to our understanding of cellular systems and their perturbations. However, fundamental gaps in our knowledge persist and hamper the design of effective interventions. It is becoming more apparent than ever, that cancer should not only be viewed as a disease of the genome but as a disease of the cellular system. Integrative multilayer approaches are emerging as vigorous assets in our endeavors to achieve systemic views on cancer biology. Herein, we provide a comprehensive review of the approaches, methods and technologies that can serve to achieve systemic perspectives of cancer. We start with genome-wide single-layer approaches of omics analyses of cellular systems and move on to multilayer integrative approaches in which in-depth descriptions of proteogenomics and network-based data analysis are provided. Proteogenomics is a remarkable example of how the integration of multiple levels of information can reduce our blind spots and increase the accuracy and reliability of our interpretations and network-based data analysis is a major approach for data interpretation and a robust scaffold for data integration and modeling. Overall, this review aims to increase cross-field awareness of the approaches and challenges regarding the omics-based study of cancer and to facilitate the necessary shift toward holistic approaches.
Collapse
Affiliation(s)
| | | | | | - Mehdi Sadeghi
- Department of Cell & Molecular Biology, Semnan University, Semnan, Iran
| | - Ulf Schmitz
- Department of Molecular & Cell Biology, James Cook University, Townsville, QLD 4811, Australia
| |
Collapse
|
16
|
Liu Y, Zhou X, Wang X. Targeting the tumor microenvironment in B-cell lymphoma: challenges and opportunities. J Hematol Oncol 2021; 14:125. [PMID: 34404434 PMCID: PMC8369706 DOI: 10.1186/s13045-021-01134-x] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 08/03/2021] [Indexed: 02/08/2023] Open
Abstract
B-cell lymphoma is a group of hematological malignancies with high clinical and biological heterogeneity. The pathogenesis of B-cell lymphoma involves a complex interaction between tumor cells and the tumor microenvironment (TME), which is composed of stromal cells and extracellular matrix. Although the roles of the TME have not been fully elucidated, accumulating evidence implies that TME is closely relevant to the origination, invasion and metastasis of B-cell lymphoma. Explorations of the TME provide distinctive insights for cancer therapy. Here, we epitomize the recent advances of TME in B-cell lymphoma and discuss its function in tumor progression and immune escape. In addition, the potential clinical value of targeting TME in B-cell lymphoma is highlighted, which is expected to pave the way for novel therapeutic strategies.
Collapse
Affiliation(s)
- Yingyue Liu
- Department of Hematology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University, No. 324, Jingwu Road, Jinan, 250021, Shandong, China
| | - Xiangxiang Zhou
- Department of Hematology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University, No. 324, Jingwu Road, Jinan, 250021, Shandong, China.
- Department of Hematology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China.
- School of Medicine, Shandong University, Jinan, 250012, Shandong, China.
- Shandong Provincial Engineering Research Center of Lymphoma, Jinan, 250021, Shandong, China.
- Branch of National Clinical Research Center for Hematologic Diseases, Jinan, 250021, Shandong, China.
- National Clinical Research Center for Hematologic Diseases, The First Affiliated Hospital of Soochow University, Suzhou, 251006, China.
| | - Xin Wang
- Department of Hematology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University, No. 324, Jingwu Road, Jinan, 250021, Shandong, China.
- Department of Hematology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China.
- School of Medicine, Shandong University, Jinan, 250012, Shandong, China.
- Shandong Provincial Engineering Research Center of Lymphoma, Jinan, 250021, Shandong, China.
- Branch of National Clinical Research Center for Hematologic Diseases, Jinan, 250021, Shandong, China.
- National Clinical Research Center for Hematologic Diseases, The First Affiliated Hospital of Soochow University, Suzhou, 251006, China.
| |
Collapse
|
17
|
Kwon H, Mohammed M, Franzén O, Ankarklev J, Smith RC. Single-cell analysis of mosquito hemocytes identifies signatures of immune cell subtypes and cell differentiation. eLife 2021; 10:66192. [PMID: 34318744 PMCID: PMC8376254 DOI: 10.7554/elife.66192] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2021] [Accepted: 07/27/2021] [Indexed: 12/16/2022] Open
Abstract
Mosquito immune cells, known as hemocytes, are integral to cellular and humoral responses that limit pathogen survival and mediate immune priming. However, without reliable cell markers and genetic tools, studies of mosquito immune cells have been limited to morphological observations, leaving several aspects of their biology uncharacterized. Here, we use single-cell RNA sequencing (scRNA-seq) to characterize mosquito immune cells, demonstrating an increased complexity to previously defined prohemocyte, oenocytoid, and granulocyte subtypes. Through functional assays relying on phagocytosis, phagocyte depletion, and RNA-FISH experiments, we define markers to accurately distinguish immune cell subtypes and provide evidence for immune cell maturation and differentiation. In addition, gene-silencing experiments demonstrate the importance of lozenge in defining the mosquito oenocytoid cell fate. Together, our scRNA-seq analysis provides an important foundation for future studies of mosquito immune cell biology and a valuable resource for comparative invertebrate immunology.
Collapse
Affiliation(s)
- Hyeogsun Kwon
- Department of Entomology, Iowa State University, Ames, United States
| | - Mubasher Mohammed
- Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden
| | - Oscar Franzén
- Integrated Cardio Metabolic Centre, Department of Medicine, Karolinska Institutet, Novum, Huddinge, Sweden
| | - Johan Ankarklev
- Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden.,Microbial Single Cell Genomics facility, SciLifeLab, Biomedical Center (BMC) Uppsala University, Uppsala, Sweden
| | - Ryan C Smith
- Department of Entomology, Iowa State University, Ames, United States
| |
Collapse
|
18
|
Osorio D, Kuijjer ML, Cai JJ. rPanglaoDB: an R package to download and merge labeled single-cell RNA-seq data from the PanglaoDB database. Bioinformatics 2021; 38:580-582. [PMID: 34320637 PMCID: PMC8723139 DOI: 10.1093/bioinformatics/btab549] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 07/14/2021] [Accepted: 07/26/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Characterizing cells with rare molecular phenotypes is one of the promises of high throughput single-cell RNA sequencing (scRNA-seq) techniques. However, collecting enough cells with the desired molecular phenotype in a single experiment is challenging, requiring several samples preprocessing steps to filter and collect the desired cells experimentally before sequencing. Data integration of multiple public single-cell experiments stands as a solution for this problem, allowing the collection of enough cells exhibiting the desired molecular signatures. By increasing the sample size of the desired cell type, this approach enables a robust cell type transcriptome characterization. RESULTS Here, we introduce rPanglaoDB, an R package to download and merge the uniformly processed and annotated scRNA-seq data provided by the PanglaoDB database. To show the potential of rPanglaoDB for collecting rare cell types by integrating multiple public datasets, we present a biological application collecting and characterizing a set of 157 fibrocytes. Fibrocytes are a rare monocyte-derived cell type, that exhibits both the inflammatory features of macrophages and the tissue remodeling properties of fibroblasts. This constitutes the first fibrocytes' unbiased transcriptome profile report. We compared the transcriptomic profile of the fibrocytes against the fibroblasts collected from the same tissue samples and confirm their associated relationship with healing processes in tissue damage and infection through the activation of the prostaglandin biosynthesis and regulation pathway. AVAILABILITY AND IMPLEMENTATION rPanglaoDB is implemented as an R package available through the CRAN repositories https://CRAN.R-project.org/package=rPanglaoDB.
Collapse
Affiliation(s)
| | - Marieke L Kuijjer
- Centre for Molecular Medicine Norway (NCMM), University of Oslo, 0349 Oslo, Norway,Department of Pathology, Leiden University Medical Center, Leiden, The Netherlands
| | - James J Cai
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX 77843, USA,Department of Electrical & Computer Engineering, Texas A&M University, College Station, TX 77843, USA,Interdisciplinary Program of Genetics, Texas A&M University, College Station, TX 77843, USA,Center for Statistical Bioinformatics, Texas A&M University, College Station, TX 77843, USA
| |
Collapse
|
19
|
Gautam V, Mittal A, Kalra S, Mohanty SK, Gupta K, Rani K, Naidu S, Mishra T, Sengupta D, Ahuja G. EcTracker: Tracking and elucidating ectopic expression leveraging large-scale scRNA-seq studies. Brief Bioinform 2021; 22:6309926. [PMID: 34184038 DOI: 10.1093/bib/bbab237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 05/31/2021] [Accepted: 06/01/2021] [Indexed: 11/13/2022] Open
Abstract
Dramatic genomic alterations, either inducible or in a pathological state, dismantle the core regulatory networks, leading to the activation of normally silent genes. Despite possessing immense therapeutic potential, accurate detection of these transcripts is an ever-challenging task, as it requires prior knowledge of the physiological gene expression levels. Here, we introduce EcTracker, an R-/Shiny-based single-cell data analysis web server that bestows a plethora of functionalities that collectively enable the quantitative and qualitative assessments of bona fide cell types or tissue-specific transcripts and, conversely, the ectopically expressed genes in the single-cell ribonucleic acid sequencing datasets. Moreover, it also allows regulon analysis to identify the key transcriptional factors regulating the user-selected gene signatures. To demonstrate the EcTracker functionality, we reanalyzed the CRISPR interference (CRISPRi) dataset of the human embryonic stem cells differentiated into endoderm lineage and identified the prominent enrichment of a specific gene signature in the SMAD2 knockout cells whose identity was ambiguous in the original study. The key distinguishing features of EcTracker lie within its processing speed, availability of multiple add-on modules, interactive graphical user interface and comprehensiveness. In summary, EcTracker provides an easy-to-perform, integrative and end-to-end single-cell data analysis platform that allows decoding of cellular identities, identification of ectopically expressed genes and their regulatory networks, and therefore, collectively imparts a novel dimension for analyzing single-cell datasets.
Collapse
Affiliation(s)
- Vishakha Gautam
- Indraprastha Institute of Information Technology, Delhi, India
| | - Aayushi Mittal
- Indraprastha Institute of Information Technology, Delhi, India
| | - Siddhant Kalra
- Indraprastha Institute of Information Technology, Delhi, India
| | | | - Krishan Gupta
- Indraprastha Institute of Information Technology, Delhi, India
| | - Komal Rani
- Indraprastha Institute of Information Technology, Delhi, India
| | - Srivatsava Naidu
- Department of Biomedical Engineering, Indian Institute of Technology Ropar, India
| | | | - Debarka Sengupta
- Department of Computational Biology and Department of Computer Science at the Indraprastha Institute of Information Technology, India
| | - Gaurav Ahuja
- Department of Computational Biology at the Indraprastha Institute of Information Technology-Delhi (IIIT-Delhi), India
| |
Collapse
|
20
|
Clarke ZA, Andrews TS, Atif J, Pouyabahar D, Innes BT, MacParland SA, Bader GD. Tutorial: guidelines for annotating single-cell transcriptomic maps using automated and manual methods. Nat Protoc 2021; 16:2749-2764. [PMID: 34031612 DOI: 10.1038/s41596-021-00534-0] [Citation(s) in RCA: 77] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2020] [Accepted: 03/12/2021] [Indexed: 11/09/2022]
Abstract
Single-cell transcriptomics can profile thousands of cells in a single experiment and identify novel cell types, states and dynamics in a wide variety of tissues and organisms. Standard experimental protocols and analysis workflows have been developed to create single-cell transcriptomic maps from tissues. This tutorial focuses on how to interpret these data to identify cell types, states and other biologically relevant patterns with the objective of creating an annotated map of cells. We recommend a three-step workflow including automatic cell annotation (wherever possible), manual cell annotation and verification. Frequently encountered challenges are discussed, as well as strategies to address them. Guiding principles and specific recommendations for software tools and resources that can be used for each step are covered, and an R notebook is included to help run the recommended workflow. Basic familiarity with computer software is assumed, and basic knowledge of programming (e.g., in the R language) is recommended.
Collapse
Affiliation(s)
- Zoe A Clarke
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada.,The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Tallulah S Andrews
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada.,Ajmera Transplant Centre, Toronto General Hospital Research Institute, Toronto, Ontario, Canada.,Department of Immunology, University of Toronto, Toronto, Ontario, Canada
| | - Jawairia Atif
- Ajmera Transplant Centre, Toronto General Hospital Research Institute, Toronto, Ontario, Canada.,Department of Immunology, University of Toronto, Toronto, Ontario, Canada
| | - Delaram Pouyabahar
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada.,The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Brendan T Innes
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada.,The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Sonya A MacParland
- Ajmera Transplant Centre, Toronto General Hospital Research Institute, Toronto, Ontario, Canada. .,Department of Immunology, University of Toronto, Toronto, Ontario, Canada. .,Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada.
| | - Gary D Bader
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada. .,The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada. .,Department of Computer Science, University of Toronto, Toronto, Ontario, Canada. .,Lunenfeld-Tanenbaum Research Institute, Toronto, Ontario, Canada.
| |
Collapse
|
21
|
Marín-Sedeño E, de Morentin XM, Pérez-Pomares JM, Gómez-Cabrero D, Ruiz-Villalba A. Understanding the Adult Mammalian Heart at Single-Cell RNA-Seq Resolution. Front Cell Dev Biol 2021; 9:645276. [PMID: 34055776 PMCID: PMC8149764 DOI: 10.3389/fcell.2021.645276] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 04/09/2021] [Indexed: 12/24/2022] Open
Abstract
During the last decade, extensive efforts have been made to comprehend cardiac cell genetic and functional diversity. Such knowledge allows for the definition of the cardiac cellular interactome as a reasonable strategy to increase our understanding of the normal and pathologic heart. Previous experimental approaches including cell lineage tracing, flow cytometry, and bulk RNA-Seq have often tackled the analysis of cardiac cell diversity as based on the assumption that cell types can be identified by the expression of a single gene. More recently, however, the emergence of single-cell RNA-Seq technology has led us to explore the diversity of individual cells, enabling the cardiovascular research community to redefine cardiac cell subpopulations and identify relevant ones, and even novel cell types, through their cell-specific transcriptomic signatures in an unbiased manner. These findings are changing our understanding of cell composition and in consequence the identification of potential therapeutic targets for different cardiac diseases. In this review, we provide an overview of the continuously changing cardiac cellular landscape, traveling from the pre-single-cell RNA-Seq times to the single cell-RNA-Seq revolution, and discuss the utilities and limitations of this technology.
Collapse
Affiliation(s)
- Ernesto Marín-Sedeño
- Department of Animal Biology, Faculty of Sciences, Instituto Malagueño de Biomedicina, University of Málaga, Málaga, Spain
- BIONAND, Centro Andaluz de Nanomedicina y Biotecnología, Junta de Andalucía, Universidad de Málaga, Málaga, Spain
| | - Xabier Martínez de Morentin
- Traslational Bioinformatics Unit, Navarrabiomed, Complejo Hospitalario de Navarra, Instituto de Investigación Sanitaria de Navarra (IdiSNA), Universidad Pública de Navarra, Pamplona, Spain
| | - Jose M. Pérez-Pomares
- Department of Animal Biology, Faculty of Sciences, Instituto Malagueño de Biomedicina, University of Málaga, Málaga, Spain
- BIONAND, Centro Andaluz de Nanomedicina y Biotecnología, Junta de Andalucía, Universidad de Málaga, Málaga, Spain
| | - David Gómez-Cabrero
- Traslational Bioinformatics Unit, Navarrabiomed, Complejo Hospitalario de Navarra, Instituto de Investigación Sanitaria de Navarra (IdiSNA), Universidad Pública de Navarra, Pamplona, Spain
- Centre of Host-Microbiome Interactions, Faculty of Dentistry, Oral & Craniofacial Sciences, King’s College London, London, United Kingdom
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Adrián Ruiz-Villalba
- Department of Animal Biology, Faculty of Sciences, Instituto Malagueño de Biomedicina, University of Málaga, Málaga, Spain
- BIONAND, Centro Andaluz de Nanomedicina y Biotecnología, Junta de Andalucía, Universidad de Málaga, Málaga, Spain
| |
Collapse
|
22
|
Hoek A, Maibach K, Özmen E, Vazquez-Armendariz AI, Mengel JP, Hain T, Herold S, Goesmann A. WASP: a versatile, web-accessible single cell RNA-Seq processing platform. BMC Genomics 2021; 22:195. [PMID: 33736596 PMCID: PMC7977290 DOI: 10.1186/s12864-021-07469-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Accepted: 02/23/2021] [Indexed: 11/16/2022] Open
Abstract
Background The technology of single cell RNA sequencing (scRNA-seq) has gained massively in popularity as it allows unprecedented insights into cellular heterogeneity as well as identification and characterization of (sub-)cellular populations. Furthermore, scRNA-seq is almost ubiquitously applicable in medical and biological research. However, these new opportunities are accompanied by additional challenges for researchers regarding data analysis, as advanced technical expertise is required in using bioinformatic software. Results Here we present WASP, a software for the processing of Drop-Seq-based scRNA-Seq data. Our software facilitates the initial processing of raw reads generated with the ddSEQ or 10x protocol and generates demultiplexed gene expression matrices including quality metrics. The processing pipeline is realized as a Snakemake workflow, while an R Shiny application is provided for interactive result visualization. WASP supports comprehensive analysis of gene expression matrices, including detection of differentially expressed genes, clustering of cellular populations and interactive graphical visualization of the results. The R Shiny application can be used with gene expression matrices generated by the WASP pipeline, as well as with externally provided data from other sources. Conclusions With WASP we provide an intuitive and easy-to-use tool to process and explore scRNA-seq data. To the best of our knowledge, it is currently the only freely available software package that combines pre- and post-processing of ddSEQ- and 10x-based data. Due to its modular design, it is possible to use any gene expression matrix with WASP’s post-processing R Shiny application. To simplify usage, WASP is provided as a Docker container. Alternatively, pre-processing can be accomplished via Conda, and a standalone version for Windows is available for post-processing, requiring only a web browser. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07469-6.
Collapse
Affiliation(s)
- Andreas Hoek
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, 35392, Giessen, Germany.
| | - Katharina Maibach
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, 35392, Giessen, Germany.,Algorithmic Bioinformatics, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Ebru Özmen
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Ana Ivonne Vazquez-Armendariz
- Department of Internal Medicine II, and Cardio-Pulmonary Institute (CPI), Universities of Giessen and Marburg Lung Center (UGMLC), Member of the German Center for Lung Research (DZL) and The Institute of Lung Health (ILH), 35392, Giessen, Germany
| | - Jan Philipp Mengel
- Institute of Medical Microbiology, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Torsten Hain
- Institute of Medical Microbiology, Justus Liebig University Giessen, 35392, Giessen, Germany.,Center for Infection Research (DZIF), Justus-Liebig-University Giessen, Partner Site Giessen-Marburg-Langen, 35392, Giessen, Germany
| | - Susanne Herold
- Department of Internal Medicine II, and Cardio-Pulmonary Institute (CPI), Universities of Giessen and Marburg Lung Center (UGMLC), Member of the German Center for Lung Research (DZL) and The Institute of Lung Health (ILH), 35392, Giessen, Germany
| | - Alexander Goesmann
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, 35392, Giessen, Germany.,Center for Infection Research (DZIF), Justus-Liebig-University Giessen, Partner Site Giessen-Marburg-Langen, 35392, Giessen, Germany
| |
Collapse
|
23
|
Bernstein MN, Ni Z, Collins M, Burkard ME, Kendziorski C, Stewart R. CHARTS: a web application for characterizing and comparing tumor subpopulations in publicly available single-cell RNA-seq data sets. BMC Bioinformatics 2021; 22:83. [PMID: 33622236 PMCID: PMC7903756 DOI: 10.1186/s12859-021-04021-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 02/11/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Single-cell RNA-seq (scRNA-seq) enables the profiling of genome-wide gene expression at the single-cell level and in so doing facilitates insight into and information about cellular heterogeneity within a tissue. This is especially important in cancer, where tumor and tumor microenvironment heterogeneity directly impact development, maintenance, and progression of disease. While publicly available scRNA-seq cancer data sets offer unprecedented opportunity to better understand the mechanisms underlying tumor progression, metastasis, drug resistance, and immune evasion, much of the available information has been underutilized, in part, due to the lack of tools available for aggregating and analysing these data. RESULTS We present CHARacterizing Tumor Subpopulations (CHARTS), a web application for exploring publicly available scRNA-seq cancer data sets in the NCBI's Gene Expression Omnibus. More specifically, CHARTS enables the exploration of individual gene expression, cell type, malignancy-status, differentially expressed genes, and gene set enrichment results in subpopulations of cells across tumors and data sets. Along with the web application, we also make available the backend computational pipeline that was used to produce the analyses that are available for exploration in the web application. CONCLUSION CHARTS is an easy to use, comprehensive platform for exploring single-cell subpopulations within tumors across the ever-growing collection of public scRNA-seq cancer data sets. CHARTS is freely available at charts.morgridge.org.
Collapse
Affiliation(s)
| | - Zijian Ni
- Department of Statistics, University of Wisconsin - Madison, Madison, WI, 53706, USA
| | | | - Mark E Burkard
- Department of Medicine, Hematology/Oncology, University of Wisconsin - Madison, Madison, WI, 53705, USA
- University of Wisconsin Carbone Cancer Center, Madison, WI, 53705, USA
| | - Christina Kendziorski
- Department of Biostatistics and Medical Informatics, University of Wisconsin - Madison, Madison, WI, 53792, USA.
| | - Ron Stewart
- Morgridge Institute for Research, Madison, WI, 53715, USA.
| |
Collapse
|
24
|
Luo G, Gao Q, Zhang S, Yan B. Probing infectious disease by single-cell RNA sequencing: Progresses and perspectives. Comput Struct Biotechnol J 2020; 18:2962-2971. [PMID: 33106757 PMCID: PMC7577221 DOI: 10.1016/j.csbj.2020.10.016] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 10/12/2020] [Accepted: 10/13/2020] [Indexed: 02/07/2023] Open
Abstract
The increasing application of single-cell RNA sequencing (scRNA-seq) technology in life science and biomedical research has significantly increased our understanding of the cellular heterogeneities in immunology, oncology and developmental biology. This review will summarize the development of various scRNA-seq technologies; primarily discussing the application of scRNA-seq on infectious diseases, and exploring the current development, challenges, and potential applications of scRNA-seq technology in the future.
Collapse
Key Words
- 3C, Chromosome Conformation Capture
- ACE2, Angiotensin-Converting Enzyme 2
- ARDS, acute respiratory distress syndrome
- ATAC-seq, Assay for Transposase-Accessible Chromatin using sequencing
- BCR, B cell receptor
- CEL-seq, Cell Expression by Linear amplification and Sequencing
- CLU, clusterin
- COVID-19, corona virus disease 2019
- CRISPR, Clustered Regularly Interspaced Short Palindromic Repeats
- CytoSeq, gene expression cytometry
- DENV, dengue virus
- FACS, fluorescence-activated cell sorting
- GNLY, granulysin
- GO analysis, Gene Ontology analysis
- HIV, Human Immunodeficiency Virus
- IAV, Influenza A virus
- IGHV/HD/HJ/HC, Immune globulin heavy V/D/J/C/ region
- IGLV/LJ/LC, Immune globulin light V/J/C/ region
- ILC, Innate Lymphoid Cell
- Infectious diseases
- LIGER, Linked Inference of Genomics Experimental Relationships
- MAGIC, Markov Affinity-based Graph Imputation of Cells
- MARS-seq, Massively parallel single-cell RNA sequencing
- MATCHER, Manifold Alignment To CHaracterize Experimental Relationships
- MCMV, mouse cytomegalovirus
- MERFISH, Multiplexed, Error Robust Fluorescent In Situ Hybridization
- MLV, Moloney Murine Leukemia Virus
- MOFA, Multi-Omics Factor Analysis
- MOI, multiplicity of infection
- PBMCs, peripheral blood mononuclear cells
- PLAC8, placenta-associated 8
- SARS-CoV-2, severe acute respiratory syndrome coronavirus 2
- SAVER, Single-cell Analysis Via Expression Recovery
- SPLit-seq, split pool ligation-based tranome sequencing
- STARTRAC, Single T-cell Analysis by RNA sequencing and TCR TRACking
- STRT-seq, Single-cell Tagged Reverse Transcription sequencing
- Single-cell RNA sequencing
- TCR, T cell receptor
- TSLP, thymic stromal lymphopoietin
- UMAP, Uniform Manifold Approximation and Projection
- UMI, Unique Molecular Identifier
- mcSCRB-seq, molecular crowding single-cell RNA barcoding and sequencing
- pDCs, plasmacytoid dendritic cells
- scRNA-seq, single cell RNA sequencing technology
- sci-RNA-seq, single-cell combinatorial indexing RNA sequencing
- seqFISH, sequential Fluorescent In Situ Hybridization
- smart-seq, switching mechanism at 5′ end of the RNA transcript sequencing
- t-SNE, t-Distributed stochastic neighbor embedding
Collapse
Affiliation(s)
- Geyang Luo
- Shanghai Public Health Clinical Center, Fudan University, Shanghai, China
- Shanghai Public Health Clinical Center and Key Laboratory of Medical Molecular Virology (MOE/NHC/CAMS), Shanghai Medical College and School of Basic Medical Sciences, Fudan University, Shanghai, China
| | - Qian Gao
- Shanghai Public Health Clinical Center and Key Laboratory of Medical Molecular Virology (MOE/NHC/CAMS), Shanghai Medical College and School of Basic Medical Sciences, Fudan University, Shanghai, China
| | - Shuye Zhang
- Shanghai Public Health Clinical Center, Fudan University, Shanghai, China
| | - Bo Yan
- Shanghai Public Health Clinical Center, Fudan University, Shanghai, China
| |
Collapse
|