1
|
Chen S, Rivaud P, Park JH, Tsou T, Charles E, Haliburton JR, Pichiorri F, Thomson M. Dissecting heterogeneous cell populations across drug and disease conditions with PopAlign. Proc Natl Acad Sci U S A 2020; 117:28784-28794. [PMID: 33127759 PMCID: PMC7682438 DOI: 10.1073/pnas.2005990117] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Single-cell measurement techniques can now probe gene expression in heterogeneous cell populations from the human body across a range of environmental and physiological conditions. However, new mathematical and computational methods are required to represent and analyze gene-expression changes that occur in complex mixtures of single cells as they respond to signals, drugs, or disease states. Here, we introduce a mathematical modeling platform, PopAlign, that automatically identifies subpopulations of cells within a heterogeneous mixture and tracks gene-expression and cell-abundance changes across subpopulations by constructing and comparing probabilistic models. Probabilistic models provide a low-error, compressed representation of single-cell data that enables efficient large-scale computations. We apply PopAlign to analyze the impact of 40 different immunomodulatory compounds on a heterogeneous population of donor-derived human immune cells as well as patient-specific disease signatures in multiple myeloma. PopAlign scales to comparisons involving tens to hundreds of samples, enabling large-scale studies of natural and engineered cell populations as they respond to drugs, signals, or physiological change.
Collapse
Affiliation(s)
- Sisi Chen
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125;
- Beckman Center for Single-Cell Profiling and Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Paul Rivaud
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
- Beckman Center for Single-Cell Profiling and Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Jong H Park
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
- Beckman Center for Single-Cell Profiling and Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Tiffany Tsou
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
- Beckman Center for Single-Cell Profiling and Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Emeric Charles
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA 94720
| | | | - Flavia Pichiorri
- Department of Hematologic Malignancies Translational Science, City of Hope, Monrovia, CA 91016
| | - Matt Thomson
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125;
- Beckman Center for Single-Cell Profiling and Engineering, California Institute of Technology, Pasadena, CA 91125
| |
Collapse
|
2
|
Cook DP, Vanderhyden BC. Ovarian cancer and the evolution of subtype classifications using transcriptional profiling†. Biol Reprod 2020; 101:645-658. [PMID: 31187121 DOI: 10.1093/biolre/ioz099] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2019] [Revised: 05/23/2019] [Accepted: 06/09/2019] [Indexed: 02/06/2023] Open
Abstract
Ovarian cancer is a complex disease with multiple subtypes, each having distinct histopathologies and variable responses to treatment. This review highlights the technological milestones and the studies that have applied them to change our definitions of ovarian cancer. Over the past 50 years, technologies such as microarrays and next-generation sequencing have led to the discovery of molecular alterations that define each of the ovarian cancer subtypes and has enabled further subclassification of the most common subtype, high-grade serous ovarian cancer (HGSOC). Improvements in mutational profiling have provided valuable insight, such as the ubiquity of TP53 mutations in HGSOC tumors. However, the information derived from these technological advances has also revealed the immense heterogeneity of this disease, from variation between patients to compositional differences within single masses. In looking forward, the emerging technologies for single-cell and spatially resolved transcriptomics will allow us to better understand the cellular composition and structure of tumors and how these contribute to the molecular subtypes. Attempts to incorporate the complexities ovarian cancer has resulted in increasing sophistication of model systems, and the increased precision in molecular profiling of ovarian cancers has already led to the introduction of inhibitors of poly (ADP-ribose) polymerases as a new class of treatments for ovarian cancer with DNA repair deficiencies. Future endeavors to define increasingly accurate classification strategies for ovarian cancer subtypes will allow for confident prediction of disease progression and provide important insight into potentially targetable molecular mechanisms specific to each subtype.
Collapse
Affiliation(s)
- David P Cook
- Cancer Therapeutics Program, Ottawa Hospital Research Institute, Ottawa, Ontario, Canada.,Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Barbara C Vanderhyden
- Cancer Therapeutics Program, Ottawa Hospital Research Institute, Ottawa, Ontario, Canada.,Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Ontario, Canada.,Department of Obstetrics and Gynecology, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
3
|
Nuclei multiplexing with barcoded antibodies for single-nucleus genomics. Nat Commun 2019; 10:2907. [PMID: 31266958 PMCID: PMC6606589 DOI: 10.1038/s41467-019-10756-2] [Citation(s) in RCA: 95] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 05/22/2019] [Indexed: 12/30/2022] Open
Abstract
Single-nucleus RNA-seq (snRNA-seq) enables the interrogation of cellular states in complex tissues that are challenging to dissociate or are frozen, and opens the way to human genetics studies, clinical trials, and precise cell atlases of large organs. However, such applications are currently limited by batch effects, processing, and costs. Here, we present an approach for multiplexing snRNA-seq, using sample-barcoded antibodies to uniquely label nuclei from distinct samples. Comparing human brain cortex samples profiled with or without hashing antibodies, we demonstrate that nucleus hashing does not significantly alter recovered profiles. We develop DemuxEM, a computational tool that detects inter-sample multiplets and assigns singlets to their sample of origin, and validate its accuracy using sex-specific gene expression, species-mixing and natural genetic variation. Our approach will facilitate tissue atlases of isogenic model organisms or from multiple biopsies or longitudinal samples of one donor, and large-scale perturbation screens. Single-nucleus RNA-seq enables interrogation of complex tissues but is limited due to batch effects and processing costs. Here the authors use barcoded antibodies against the nuclear pore complex to label nuclei from distinct samples, and develop a computational tool to assign the sample of origin.
Collapse
|
4
|
Gaublomme JT, Li B, McCabe C, Knecht A, Yang Y, Drokhlyansky E, Van Wittenberghe N, Waldman J, Dionne D, Nguyen L, De Jager PL, Yeung B, Zhao X, Habib N, Rozenblatt-Rosen O, Regev A. Nuclei multiplexing with barcoded antibodies for single-nucleus genomics. Nat Commun 2019; 10:2907. [PMID: 31266958 DOI: 10.1101/476036] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 05/22/2019] [Indexed: 05/24/2023] Open
Abstract
Single-nucleus RNA-seq (snRNA-seq) enables the interrogation of cellular states in complex tissues that are challenging to dissociate or are frozen, and opens the way to human genetics studies, clinical trials, and precise cell atlases of large organs. However, such applications are currently limited by batch effects, processing, and costs. Here, we present an approach for multiplexing snRNA-seq, using sample-barcoded antibodies to uniquely label nuclei from distinct samples. Comparing human brain cortex samples profiled with or without hashing antibodies, we demonstrate that nucleus hashing does not significantly alter recovered profiles. We develop DemuxEM, a computational tool that detects inter-sample multiplets and assigns singlets to their sample of origin, and validate its accuracy using sex-specific gene expression, species-mixing and natural genetic variation. Our approach will facilitate tissue atlases of isogenic model organisms or from multiple biopsies or longitudinal samples of one donor, and large-scale perturbation screens.
Collapse
Affiliation(s)
- Jellert T Gaublomme
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA.
- Department of Biological Sciences, Columbia University, New York, NY, 10027, USA.
| | - Bo Li
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
- Center for Immunology and Inflammatory Diseases, Division of Rheumatology, Allergy, and Immunology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02129, USA
| | - Cristin McCabe
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Abigail Knecht
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Yiming Yang
- Center for Immunology and Inflammatory Diseases, Division of Rheumatology, Allergy, and Immunology Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02129, USA
| | - Eugene Drokhlyansky
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | | | - Julia Waldman
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Danielle Dionne
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Lan Nguyen
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Philip L De Jager
- Center for Translational & Computational Neuroimmunology, Columbia University Medical Center, New York, NY, 10019, USA
| | | | | | - Naomi Habib
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
- Edmond and Lily Safra Center for Brain Sciences, Hebrew University of Jerusalem, Jerusalem, 9190401, Israel
| | - Orit Rozenblatt-Rosen
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA.
| | - Aviv Regev
- Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA.
- Howard Hughes Medical Institute, Koch Institute of Integrative Cancer Research, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
| |
Collapse
|
5
|
Lanz TV, Pröbstel AK, Mildenberger I, Platten M, Schirmer L. Single-Cell High-Throughput Technologies in Cerebrospinal Fluid Research and Diagnostics. Front Immunol 2019; 10:1302. [PMID: 31244848 PMCID: PMC6579921 DOI: 10.3389/fimmu.2019.01302] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 05/22/2019] [Indexed: 01/08/2023] Open
Abstract
High-throughput single-cell technologies have recently emerged as essential tools in biomedical research with great potential for clinical pathology when studying liquid and solid biopsies. We provide an update on current single-cell methods in cerebrospinal fluid research and diagnostics, focusing on high-throughput cell-type specific proteomic and genomic technologies. Proteomic methods comprising flow cytometry and mass cytometry as well as genomic approaches including immune cell repertoire and single-cell transcriptomic studies are critically reviewed and future directions discussed.
Collapse
Affiliation(s)
- Tobias V. Lanz
- Department of Neurology, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
- Division of Immunology and Rheumatology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, United States
| | - Anne-Katrin Pröbstel
- Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, United States
- Departments of Medicine and Biomedicine, Neurologic Clinic and Policlinic, University Hospital Basel, University of Basel, Basel, Switzerland
| | - Iris Mildenberger
- Department of Neurology, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
| | - Michael Platten
- Department of Neurology, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
- DKTK Clinical Cooperation Unit Neuroimmunology and Brain Tumor Immunology, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Lucas Schirmer
- Department of Neurology, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
| |
Collapse
|
6
|
Guo C, Kong W, Kamimoto K, Rivera-Gonzalez GC, Yang X, Kirita Y, Morris SA. CellTag Indexing: genetic barcode-based sample multiplexing for single-cell genomics. Genome Biol 2019. [PMID: 31072405 DOI: 10.1101/335547] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2023] Open
Abstract
High-throughput single-cell assays increasingly require special consideration in experimental design, sample multiplexing, batch effect removal, and data interpretation. Here, we describe a lentiviral barcode-based multiplexing approach, CellTag Indexing, which uses predefined genetic barcodes that are heritable, enabling cell populations to be tagged, pooled, and tracked over time in the same experimental replicate. We demonstrate the utility of CellTag Indexing by sequencing transcriptomes using a variety of cell types, including long-term tracking of cell engraftment and differentiation in vivo. Together, this presents CellTag Indexing as a broadly applicable genetic multiplexing tool that is complementary with existing single-cell technologies.
Collapse
Affiliation(s)
- Chuner Guo
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Wenjun Kong
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Kenji Kamimoto
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Guillermo C Rivera-Gonzalez
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Xue Yang
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Yuhei Kirita
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Division of Nephrology, Department of Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Samantha A Morris
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA.
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA.
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA.
| |
Collapse
|
7
|
Guo C, Kong W, Kamimoto K, Rivera-Gonzalez GC, Yang X, Kirita Y, Morris SA. CellTag Indexing: genetic barcode-based sample multiplexing for single-cell genomics. Genome Biol 2019; 20:90. [PMID: 31072405 PMCID: PMC6509836 DOI: 10.1186/s13059-019-1699-y] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Accepted: 04/23/2019] [Indexed: 12/15/2022] Open
Abstract
High-throughput single-cell assays increasingly require special consideration in experimental design, sample multiplexing, batch effect removal, and data interpretation. Here, we describe a lentiviral barcode-based multiplexing approach, CellTag Indexing, which uses predefined genetic barcodes that are heritable, enabling cell populations to be tagged, pooled, and tracked over time in the same experimental replicate. We demonstrate the utility of CellTag Indexing by sequencing transcriptomes using a variety of cell types, including long-term tracking of cell engraftment and differentiation in vivo. Together, this presents CellTag Indexing as a broadly applicable genetic multiplexing tool that is complementary with existing single-cell technologies.
Collapse
Affiliation(s)
- Chuner Guo
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Wenjun Kong
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Kenji Kamimoto
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Guillermo C Rivera-Gonzalez
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Xue Yang
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Yuhei Kirita
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
- Division of Nephrology, Department of Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA
| | - Samantha A Morris
- Department of Developmental Biology, Washington University School of Medicine in St Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA.
- Department of Genetics, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA.
- Center of Regenerative Medicine, Washington University School of Medicine in St. Louis, 660 S. Euclid Avenue, Campus Box 8103, St. Louis, MO, 63110, USA.
| |
Collapse
|
8
|
Wolock SL, Lopez R, Klein AM. Scrublet: Computational Identification of Cell Doublets in Single-Cell Transcriptomic Data. Cell Syst 2019. [PMID: 30954476 DOI: 10.1101/357368v1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/24/2023]
Abstract
Single-cell RNA-sequencing has become a widely used, powerful approach for studying cell populations. However, these methods often generate multiplet artifacts, where two or more cells receive the same barcode, resulting in a hybrid transcriptome. In most experiments, multiplets account for several percent of transcriptomes and can confound downstream data analysis. Here, we present Single-Cell Remover of Doublets (Scrublet), a framework for predicting the impact of multiplets in a given analysis and identifying problematic multiplets. Scrublet avoids the need for expert knowledge or cell clustering by simulating multiplets from the data and building a nearest neighbor classifier. To demonstrate the utility of this approach, we test Scrublet on several datasets that include independent knowledge of cell multiplets. Scrublet is freely available for download at github.com/AllonKleinLab/scrublet.
Collapse
Affiliation(s)
- Samuel L Wolock
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Romain Lopez
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA; Centre de Mathématiques Appliquées, École polytechnique, Palaiseau 91120, France
| | - Allon M Klein
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
9
|
Wolock SL, Lopez R, Klein AM. Scrublet: Computational Identification of Cell Doublets in Single-Cell Transcriptomic Data. Cell Syst 2019; 8:281-291.e9. [PMID: 30954476 DOI: 10.1101/357368] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 09/28/2018] [Accepted: 11/28/2018] [Indexed: 05/24/2023]
Abstract
Single-cell RNA-sequencing has become a widely used, powerful approach for studying cell populations. However, these methods often generate multiplet artifacts, where two or more cells receive the same barcode, resulting in a hybrid transcriptome. In most experiments, multiplets account for several percent of transcriptomes and can confound downstream data analysis. Here, we present Single-Cell Remover of Doublets (Scrublet), a framework for predicting the impact of multiplets in a given analysis and identifying problematic multiplets. Scrublet avoids the need for expert knowledge or cell clustering by simulating multiplets from the data and building a nearest neighbor classifier. To demonstrate the utility of this approach, we test Scrublet on several datasets that include independent knowledge of cell multiplets. Scrublet is freely available for download at github.com/AllonKleinLab/scrublet.
Collapse
Affiliation(s)
- Samuel L Wolock
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Romain Lopez
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA; Centre de Mathématiques Appliquées, École polytechnique, Palaiseau 91120, France
| | - Allon M Klein
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
10
|
McGinnis CS, Murrow LM, Gartner ZJ. DoubletFinder: Doublet Detection in Single-Cell RNA Sequencing Data Using Artificial Nearest Neighbors. Cell Syst 2019; 8:329-337.e4. [PMID: 30954475 DOI: 10.1101/352484] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2018] [Revised: 11/15/2018] [Accepted: 03/06/2019] [Indexed: 05/24/2023]
Abstract
Single-cell RNA sequencing (scRNA-seq) data are commonly affected by technical artifacts known as "doublets," which limit cell throughput and lead to spurious biological conclusions. Here, we present a computational doublet detection tool-DoubletFinder-that identifies doublets using only gene expression data. DoubletFinder predicts doublets according to each real cell's proximity in gene expression space to artificial doublets created by averaging the transcriptional profile of randomly chosen cell pairs. We first use scRNA-seq datasets where the identity of doublets is known to show that DoubletFinder identifies doublets formed from transcriptionally distinct cells. When these doublets are removed, the identification of differentially expressed genes is enhanced. Second, we provide a method for estimating DoubletFinder input parameters, allowing its application across scRNA-seq datasets with diverse distributions of cell types. Lastly, we present "best practices" for DoubletFinder applications and illustrate that DoubletFinder is insensitive to an experimentally validated kidney cell type with "hybrid" expression features.
Collapse
Affiliation(s)
- Christopher S McGinnis
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
| | - Lyndsay M Murrow
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
| | - Zev J Gartner
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA; Chan Zuckerbeg Biohub, University of California, San Francisco, San Francisco, CA, USA; Center for Cellular Construction, University of California, San Francisco, San Francisco, CA, USA.
| |
Collapse
|
11
|
Stoeckius M, Zheng S, Houck-Loomis B, Hao S, Yeung BZ, Mauck WM, Smibert P, Satija R. Cell Hashing with barcoded antibodies enables multiplexing and doublet detection for single cell genomics. Genome Biol 2018; 19:224. [PMID: 30567574 PMCID: PMC6300015 DOI: 10.1186/s13059-018-1603-1] [Citation(s) in RCA: 579] [Impact Index Per Article: 96.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 12/04/2018] [Indexed: 12/11/2022] Open
Abstract
Despite rapid developments in single cell sequencing, sample-specific batch effects, detection of cell multiplets, and experimental costs remain outstanding challenges. Here, we introduce Cell Hashing, where oligo-tagged antibodies against ubiquitously expressed surface proteins uniquely label cells from distinct samples, which can be subsequently pooled. By sequencing these tags alongside the cellular transcriptome, we can assign each cell to its original sample, robustly identify cross-sample multiplets, and "super-load" commercial droplet-based systems for significant cost reduction. We validate our approach using a complementary genetic approach and demonstrate how hashing can generalize the benefits of single cell multiplexing to diverse samples and experimental designs.
Collapse
Affiliation(s)
- Marlon Stoeckius
- Technology Innovation Lab, New York Genome Center, New York, NY, USA
| | - Shiwei Zheng
- NYU Center for Genomics and Systems Biology, New York Genome Center, New York, NY, USA
| | | | - Stephanie Hao
- Technology Innovation Lab, New York Genome Center, New York, NY, USA
| | | | - William M Mauck
- NYU Center for Genomics and Systems Biology, New York Genome Center, New York, NY, USA
| | - Peter Smibert
- Technology Innovation Lab, New York Genome Center, New York, NY, USA.
| | - Rahul Satija
- NYU Center for Genomics and Systems Biology, New York Genome Center, New York, NY, USA.
| |
Collapse
|