Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Srivastava A, Malik L, Smith T, Sudbery I, Patro R. Alevin efficiently estimates accurate gene abundances from dscRNA-seq data. Genome Biol 2019;20:65. [PMID: 30917859 PMCID: PMC6437997 DOI: 10.1186/s13059-019-1670-y] [Citation(s) in RCA: 122] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Accepted: 03/05/2019] [Indexed: 12/15/2022] Open

For:	Srivastava A, Malik L, Smith T, Sudbery I, Patro R. Alevin efficiently estimates accurate gene abundances from dscRNA-seq data. Genome Biol 2019;20:65. [PMID: 30917859 PMCID: PMC6437997 DOI: 10.1186/s13059-019-1670-y] [Citation(s) in RCA: 122] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Accepted: 03/05/2019] [Indexed: 12/15/2022] Open

Number

Cited by Other Article(s)

101

Tsagiopoulou M, Maniou MC, Pechlivanis N, Togkousidis A, Kotrová M, Hutzenlaub T, Kappas I, Chatzidimitriou A, Psomopoulos F. UMIc: A Preprocessing Method for UMI Deduplication and Reads Correction. Front Genet 2021;12:660366. [PMID: 34122513 PMCID: PMC8193862 DOI: 10.3389/fgene.2021.660366] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 04/08/2021] [Indexed: 11/17/2022] Open

102

Gilis J, Vitting-Seerup K, Van den Berge K, Clement L. satuRn: Scalable analysis of differential transcript usage for bulk and single-cell RNA-sequencing applications. F1000Res 2021;10:374. [PMID: 36762203 PMCID: PMC9892655 DOI: 10.12688/f1000research.51749.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/26/2022] [Indexed: 11/20/2022] Open

103

Wilson GW, Derouet M, Darling GE, Yeung JC. scSNV: accurate dscRNA-seq SNV co-expression analysis using duplicate tag collapsing. Genome Biol 2021;22:144. [PMID: 33962667 PMCID: PMC8103760 DOI: 10.1186/s13059-021-02364-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 04/23/2021] [Indexed: 12/21/2022] Open

104

Cell-level metadata are indispensable for documenting single-cell sequencing datasets. PLoS Biol 2021;19:e3001077. [PMID: 33945522 PMCID: PMC8121533 DOI: 10.1371/journal.pbio.3001077] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 05/14/2021] [Indexed: 11/19/2022] Open

105

Gillen AE, Goering R, Taliaferro JM. Quantifying alternative polyadenylation in RNAseq data with LABRAT. Methods Enzymol 2021;655:245-263. [PMID: 34183124 DOI: 10.1016/bs.mie.2021.03.018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

106

Zheng H, Rao AM, Dermadi D, Toh J, Murphy Jones L, Donato M, Liu Y, Su Y, Dai CL, Kornilov SA, Karagiannis M, Marantos T, Hasin-Brumshtein Y, He YD, Giamarellos-Bourboulis EJ, Heath JR, Khatri P. Multi-cohort analysis of host immune response identifies conserved protective and detrimental modules associated with severity across viruses. Immunity 2021;54:753-768.e5. [PMID: 33765435 PMCID: PMC7988739 DOI: 10.1016/j.immuni.2021.03.002] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 12/03/2020] [Accepted: 03/01/2021] [Indexed: 02/08/2023]

Affiliation(s)

Hong Zheng Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, CA 94305, USA; Center for Biomedical Informatics Research, Department of Medicine, School of Medicine, Stanford University, CA 94305, USA
Aditya M Rao Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, CA 94305, USA; Immunology program, Stanford University, CA 94305, USA
Denis Dermadi Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, CA 94305, USA; Center for Biomedical Informatics Research, Department of Medicine, School of Medicine, Stanford University, CA 94305, USA
Jiaying Toh Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, CA 94305, USA; Immunology program, Stanford University, CA 94305, USA
Lara Murphy Jones Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, CA 94305, USA; Center for Biomedical Informatics Research, Department of Medicine, School of Medicine, Stanford University, CA 94305, USA; Division of Critical Care Medicine, Department of Pediatrics, School of Medicine, Stanford University, CA 94305, USA
Michele Donato Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, CA 94305, USA; Center for Biomedical Informatics Research, Department of Medicine, School of Medicine, Stanford University, CA 94305, USA
Yiran Liu Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, CA 94305, USA; Cancer Biology program, Stanford University, CA 94305, USA
Yapeng Su Institute for Systems Biology, Seattle, WA, USA
Cheng L Dai Institute for Systems Biology, Seattle, WA, USA
Sergey A Kornilov Institute for Systems Biology, Seattle, WA, USA
Minas Karagiannis 4(th) Department of Internal Medicine, National and Kapodistrian University of Athens, Medical School, 124 62 Athens, Greece
Theodoros Marantos 4(th) Department of Internal Medicine, National and Kapodistrian University of Athens, Medical School, 124 62 Athens, Greece
Yehudit Hasin-Brumshtein Inflammatix, Inc. Burlingame, CA, USA
Yudong D He Inflammatix, Inc. Burlingame, CA, USA
Evangelos J Giamarellos-Bourboulis 4(th) Department of Internal Medicine, National and Kapodistrian University of Athens, Medical School, 124 62 Athens, Greece
James R Heath Institute for Systems Biology, Seattle, WA, USA; Department of Bioengineering, University of Washington, Seattle, WA 98195
Purvesh Khatri Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, CA 94305, USA; Center for Biomedical Informatics Research, Department of Medicine, School of Medicine, Stanford University, CA 94305, USA.

Collapse

107

Modular, efficient and constant-memory single-cell RNA-seq preprocessing. Nat Biotechnol 2021;39:813-818. [PMID: 33795888 DOI: 10.1038/s41587-021-00870-2] [Citation(s) in RCA: 177] [Impact Index Per Article: 59.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Accepted: 02/09/2021] [Indexed: 11/08/2022]

108

Prokop JW, Bupp CP, Frisch A, Bilinovich SM, Campbell DB, Vogt D, Schultz CR, Uhl KL, VanSickle E, Rajasekaran S, Bachmann AS. Emerging Role of ODC1 in Neurodevelopmental Disorders and Brain Development. Genes (Basel) 2021;12:genes12040470. [PMID: 33806076 PMCID: PMC8064465 DOI: 10.3390/genes12040470] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2021] [Revised: 03/15/2021] [Accepted: 03/22/2021] [Indexed: 01/18/2023] Open

Affiliation(s)

Jeremy W. Prokop Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.) Department of Pharmacology and Toxicology, Michigan State University, East Lansing, MI 48824, USA Center for Research in Autism, Intellectual, and Other Neurodevelopmental Disabilities, Michigan State University, East Lansing, MI 48824, USA Correspondence: (J.W.P.); (A.S.B.)
Caleb P. Bupp Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.) Spectrum Health Medical Genetics, Grand Rapids, MI 49503, USA;
Austin Frisch Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.)
Stephanie M. Bilinovich Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.)
Daniel B. Campbell Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.) Center for Research in Autism, Intellectual, and Other Neurodevelopmental Disabilities, Michigan State University, East Lansing, MI 48824, USA Neuroscience Program, Michigan State University, East Lansing, MI 48824, USA
Daniel Vogt Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.) Center for Research in Autism, Intellectual, and Other Neurodevelopmental Disabilities, Michigan State University, East Lansing, MI 48824, USA Neuroscience Program, Michigan State University, East Lansing, MI 48824, USA
Chad R. Schultz Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.)
Katie L. Uhl Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.)
Elizabeth VanSickle Spectrum Health Medical Genetics, Grand Rapids, MI 49503, USA;
Surender Rajasekaran Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.) Pediatric Intensive Care Unit, Helen DeVos Children’s Hospital, Grand Rapids, MI 49503, USA Office of Research, Spectrum Health, Grand Rapids, MI 49503, USA
André S. Bachmann Department of Pediatrics and Human Development, Michigan State University, Grand Rapids, MI 49503, USA; (C.P.B.); (A.F.); (S.M.B.); (D.B.C.); (D.V.); (C.R.S.); (K.L.U.); (S.R.) Correspondence: (J.W.P.); (A.S.B.)

Collapse

109

Bilinovich SM, Uhl KL, Lewis K, Soehnlen X, Williams M, Vogt D, Prokop JW, Campbell DB. Integrated RNA Sequencing Reveals Epigenetic Impacts of Diesel Particulate Matter Exposure in Human Cerebral Organoids. Dev Neurosci 2021;42:195-207. [PMID: 33657557 DOI: 10.1159/000513536] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 12/02/2020] [Indexed: 12/25/2022] Open

Affiliation(s)

Stephanie M Bilinovich Department of Pediatrics & Human Development, Michigan State University, Grand Rapids, Michigan, USA
Katie L Uhl Department of Pediatrics & Human Development, Michigan State University, Grand Rapids, Michigan, USA
Kristy Lewis Department of Pediatrics & Human Development, Michigan State University, Grand Rapids, Michigan, USA
Xavier Soehnlen Department of Pediatrics & Human Development, Michigan State University, Grand Rapids, Michigan, USA
Michael Williams Department of Pediatrics & Human Development, Michigan State University, Grand Rapids, Michigan, USA.,Center for Research in Autism, Intellectual, and other Neurodevelopmental Disabilities, Michigan State University, East Lansing, Michigan, USA.,Neuroscience Program, Michigan State University, East Lansing, Michigan, USA
Daniel Vogt Department of Pediatrics & Human Development, Michigan State University, Grand Rapids, Michigan, USA.,Center for Research in Autism, Intellectual, and other Neurodevelopmental Disabilities, Michigan State University, East Lansing, Michigan, USA.,Neuroscience Program, Michigan State University, East Lansing, Michigan, USA
Jeremy W Prokop Department of Pediatrics & Human Development, Michigan State University, Grand Rapids, Michigan, USA.,Center for Research in Autism, Intellectual, and other Neurodevelopmental Disabilities, Michigan State University, East Lansing, Michigan, USA.,Department of Pharmacology and Toxicology, Michigan State University, East Lansing, Michigan, USA
Daniel B Campbell Department of Pediatrics & Human Development, Michigan State University, Grand Rapids, Michigan, USA, .,Center for Research in Autism, Intellectual, and other Neurodevelopmental Disabilities, Michigan State University, East Lansing, Michigan, USA, .,Neuroscience Program, Michigan State University, East Lansing, Michigan, USA,

Collapse

110

Cribbs AP, Filippakopoulos P, Philpott M, Wells G, Penn H, Oerum H, Valge-Archer V, Feldmann M, Oppermann U. Dissecting the Role of BET Bromodomain Proteins BRD2 and BRD4 in Human NK Cell Function. Front Immunol 2021;12:626255. [PMID: 33717143 PMCID: PMC7953504 DOI: 10.3389/fimmu.2021.626255] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 01/13/2021] [Indexed: 12/19/2022] Open

111

Mukherjee K, Xue L, Planutis A, Gnanapragasam MN, Chess A, Bieker JJ. EKLF/KLF1 expression defines a unique macrophage subset during mouse erythropoiesis. eLife 2021;10:61070. [PMID: 33570494 PMCID: PMC7932694 DOI: 10.7554/elife.61070] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 02/10/2021] [Indexed: 12/17/2022] Open

112

Wang YXR, Li L, Li JJ, Huang H. Network Modeling in Biology: Statistical Methods for Gene and Brain Networks. Stat Sci 2021;36:89-108. [PMID: 34305304 PMCID: PMC8296984 DOI: 10.1214/20-sts792] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

113

Van Buren S, Sarkar H, Srivastava A, Rashid NU, Patro R, Love MI. Compression of quantification uncertainty for scRNA-seq counts. Bioinformatics 2021;37:1699-1707. [PMID: 33471073 PMCID: PMC8289386 DOI: 10.1093/bioinformatics/btab001] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 11/16/2020] [Accepted: 01/04/2021] [Indexed: 11/13/2022] Open

Abstract

Motivation

Quantification estimates of gene expression from single-cell RNA-seq (scRNA-seq) data have inherent uncertainty due to reads that map to multiple genes. Many existing scRNA-seq quantification pipelines ignore multi-mapping reads and therefore underestimate expected read counts for many genes. alevin accounts for multi-mapping reads and allows for the generation of ‘inferential replicates’, which reflect quantification uncertainty. Previous methods have shown improved performance when incorporating these replicates into statistical analyses, but storage and use of these replicates increases computation time and memory requirements.

Results

We demonstrate that storing only the mean and variance from a set of inferential replicates (‘compression’) is sufficient to capture gene-level quantification uncertainty, while reducing disk storage to as low as 9% of original storage, and memory usage when loading data to as low as 6%. Using these values, we generate ‘pseudo-inferential’ replicates from a negative binomial distribution and propose a general procedure for incorporating these replicates into a proposed statistical testing framework. When applying this procedure to trajectory-based differential expression analyses, we show false positives are reduced by more than a third for genes with high levels of quantification uncertainty. We additionally extend the Swish method to incorporate pseudo-inferential replicates and demonstrate improvements in computation time and memory usage without any loss in performance. Lastly, we show that discarding multi-mapping reads can result in significant underestimation of counts for functionally important genes in a real dataset.

Availability and implementation

makeInfReps and splitSwish are implemented in the R/Bioconductor fishpond package available at https://bioconductor.org/packages/fishpond. Analyses and simulated datasets can be found in the paper’s GitHub repo at https://github.com/skvanburen/scUncertaintyPaperCode.

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

114

Acosta J, Ssozi D, van Galen P. Single-Cell RNA Sequencing to Disentangle the Blood System. Arterioscler Thromb Vasc Biol 2021;41:1012-1018. [PMID: 33441024 PMCID: PMC7901535 DOI: 10.1161/atvbaha.120.314654] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

115

Soneson C, Srivastava A, Patro R, Stadler MB. Preprocessing choices affect RNA velocity results for droplet scRNA-seq data. PLoS Comput Biol 2021;17:e1008585. [PMID: 33428615 PMCID: PMC7822509 DOI: 10.1371/journal.pcbi.1008585] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 01/22/2021] [Accepted: 11/30/2020] [Indexed: 12/25/2022] Open

116

Decoding myofibroblast origins in human kidney fibrosis. Nature 2021;589:281-286. [PMID: 33176333 PMCID: PMC7611626 DOI: 10.1038/s41586-020-2941-1] [Citation(s) in RCA: 385] [Impact Index Per Article: 128.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 10/19/2020] [Indexed: 01/29/2023]

117

Oh Y, Yang S, Liu X, Jana S, Izaddoustdar F, Gao X, Debi R, Kim DK, Kim KH, Yang P, Kassiri Z, Lakin R, Backx PH. Transcriptomic Bioinformatic Analyses of Atria Uncover Involvement of Pathways Related to Strain and Post-translational Modification of Collagen in Increased Atrial Fibrillation Vulnerability in Intensely Exercised Mice. Front Physiol 2020;11:605671. [PMID: 33424629 PMCID: PMC7793719 DOI: 10.3389/fphys.2020.605671] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Accepted: 11/26/2020] [Indexed: 02/06/2023] Open

Abstract

Atrial Fibrillation (AF) is the most common supraventricular tachyarrhythmia that is typically associated with cardiovascular disease (CVD) and poor cardiovascular health. Paradoxically, endurance athletes are also at risk for AF. While it is well-established that persistent AF is associated with atrial fibrosis, hypertrophy and inflammation, intensely exercised mice showed similar adverse atrial changes and increased AF vulnerability, which required tumor necrosis factor (TNF) signaling, even though ventricular structure and function improved. To identify some of the molecular factors underlying the chamber-specific and TNF-dependent atrial changes induced by exercise, we performed transcriptome analyses of hearts from wild-type and TNF-knockout mice following exercise for 2 days, 2 or 6 weeks of exercise. Consistent with the central role of atrial stretch arising from elevated venous pressure in AF promotion, all 3 time points were associated with differential regulation of genes in atria linked to mechanosensing (focal adhesion kinase, integrins and cell-cell communications), extracellular matrix (ECM) and TNF pathways, with TNF appearing to play a permissive, rather than causal, role in gene changes. Importantly, mechanosensing/ECM genes were only enriched, along with tubulin- and hypertrophy-related genes after 2 days of exercise while being downregulated at 2 and 6 weeks, suggesting that early reactive strain-dependent remodeling with exercise yields to compensatory adjustments. Moreover, at the later time points, there was also downregulation of both collagen genes and genes involved in collagen turnover, a pattern mirroring aging-related fibrosis. By comparison, twofold fewer genes were differentially regulated in ventricles vs. atria, independently of TNF. Our findings reveal that exercise promotes TNF-dependent atrial transcriptome remodeling of ECM/mechanosensing pathways, consistent with increased preload and atrial stretch seen with exercise. We propose that similar preload-dependent mechanisms are responsible for atrial changes and AF in both CVD patients and athletes.

Collapse

118

Zhang Z, Cui F, Wang C, Zhao L, Zou Q. Goals and approaches for each processing step for single-cell RNA sequencing data. Brief Bioinform 2020;22:6034054. [PMID: 33316046 DOI: 10.1093/bib/bbaa314] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 10/10/2020] [Accepted: 10/16/2020] [Indexed: 12/12/2022] Open

119

Tekman M, Batut B, Ostrovsky A, Antoniewski C, Clements D, Ramirez F, Etherington GJ, Hotz HR, Scholtalbers J, Manning JR, Bellenger L, Doyle MA, Heydarian M, Huang N, Soranzo N, Moreno P, Mautner S, Papatheodorou I, Nekrutenko A, Taylor J, Blankenberg D, Backofen R, Grüning B. A single-cell RNA-sequencing training and analysis suite using the Galaxy framework. Gigascience 2020;9:5931798. [PMID: 33079170 PMCID: PMC7574357 DOI: 10.1093/gigascience/giaa102] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 08/30/2020] [Indexed: 11/25/2022] Open

Abstract

Background

The vast ecosystem of single-cell RNA-sequencing tools has until recently been plagued by an excess of diverging analysis strategies, inconsistent file formats, and compatibility issues between different software suites. The uptake of 10x Genomics datasets has begun to calm this diversity, and the bioinformatics community leans once more towards the large computing requirements and the statistically driven methods needed to process and understand these ever-growing datasets.

Results

Here we outline several Galaxy workflows and learning resources for single-cell RNA-sequencing, with the aim of providing a comprehensive analysis environment paired with a thorough user learning experience that bridges the knowledge gap between the computational methods and the underlying cell biology. The Galaxy reproducible bioinformatics framework provides tools, workflows, and trainings that not only enable users to perform 1-click 10x preprocessing but also empower them to demultiplex raw sequencing from custom tagged and full-length sequencing protocols. The downstream analysis supports a range of high-quality interoperable suites separated into common stages of analysis: inspection, filtering, normalization, confounder removal, and clustering. The teaching resources cover concepts from computer science to cell biology. Access to all resources is provided at the singlecell.usegalaxy.eu portal.

Conclusions

The reproducible and training-oriented Galaxy framework provides a sustainable high-performance computing environment for users to run flexible analyses on both 10x and alternative platforms. The tutorials from the Galaxy Training Network along with the frequent training workshops hosted by the Galaxy community provide a means for users to learn, publish, and teach single-cell RNA-sequencing analysis.

Collapse

Affiliation(s)

Mehmet Tekman Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
Bérénice Batut Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
Alexander Ostrovsky Department of Biology, Johns Hopkins University, Mudd Hall 144, 3400 N. Charles Street, Baltimore, MD 21218, USA
Christophe Antoniewski ARTbio, Sorbonne Université, CNRS FR 3631, Inserm US 037, Paris, France.,Institut de Biologie Paris Seine, 9 Quai Saint-Bernard Université Pierre et Marie Curie, Campus Jussieu, Bâtiments A-B-C, 75005 Paris, France
Dave Clements Department of Biology, Johns Hopkins University, Mudd Hall 144, 3400 N. Charles Street, Baltimore, MD 21218, USA
Fidel Ramirez Boehringer Ingelheim International GmbH, Binger Strasse 173, 55216 Ingelheim am Rhein, Biberach, Germany
Graham J Etherington Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
Hans-Rudolf Hotz Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, 4058 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Maulbeerstrasse 66, 4058 Basel, Switzerland
Jelle Scholtalbers European Molecular Biology Laboratory, Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
Jonathan R Manning European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Lea Bellenger ARTbio, Sorbonne Université, CNRS FR 3631, Inserm US 037, Paris, France
Maria A Doyle Research Computing Facility, Peter MacCallum Cancer Centre, Melbourne, 305 Grattan Street, Victoria 3000, Australia.,Sir Peter MacCallum Department of Oncology, The University of Melbourne, Victoria 3010, Australia
Mohammad Heydarian Department of Biology, Johns Hopkins University, Mudd Hall 144, 3400 N. Charles Street, Baltimore, MD 21218, USA
Ni Huang European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK.,Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, UK
Nicola Soranzo Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
Pablo Moreno European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Stefan Mautner Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
Irene Papatheodorou European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Anton Nekrutenko Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
James Taylor Department of Biology, Johns Hopkins University, Mudd Hall 144, 3400 N. Charles Street, Baltimore, MD 21218, USA
Daniel Blankenberg Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, 9500 Euclid Avenue, NB21 Cleveland, OH 44195, USA
Rolf Backofen Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
Björn Grüning Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany

Collapse

120

Li B, Gould J, Yang Y, Sarkizova S, Tabaka M, Ashenberg O, Rosen Y, Slyper M, Kowalczyk MS, Villani AC, Tickle T, Hacohen N, Rozenblatt-Rosen O, Regev A. Cumulus provides cloud-based data analysis for large-scale single-cell and single-nucleus RNA-seq. Nat Methods 2020;17:793-798. [PMID: 32719530 PMCID: PMC7437817 DOI: 10.1038/s41592-020-0905-x] [Citation(s) in RCA: 104] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Accepted: 06/18/2020] [Indexed: 11/10/2022]

Affiliation(s)

Bo Li Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA. Division of Rheumatology, Allergy, and Immunology, Center for Immunology and Inflammatory Diseases, Massachusetts General Hospital, Boston, MA, USA. Department of Medicine, Harvard Medical School, Boston, MA, USA.
Joshua Gould Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Yiming Yang Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA Division of Rheumatology, Allergy, and Immunology, Center for Immunology and Inflammatory Diseases, Massachusetts General Hospital, Boston, MA, USA
Siranush Sarkizova Broad Institute of Harvard and MIT, Cambridge, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Marcin Tabaka Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Orr Ashenberg Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Yanay Rosen Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Michal Slyper Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Monika S Kowalczyk Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Alexandra-Chloé Villani Division of Rheumatology, Allergy, and Immunology, Center for Immunology and Inflammatory Diseases, Massachusetts General Hospital, Boston, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA Broad Institute of Harvard and MIT, Cambridge, MA, USA Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA
Timothy Tickle Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Nir Hacohen Department of Medicine, Harvard Medical School, Boston, MA, USA Broad Institute of Harvard and MIT, Cambridge, MA, USA Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA
Orit Rozenblatt-Rosen Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
Aviv Regev Klarman Cell Observatory, Broad Institute of Harvard and MIT, Cambridge, MA, USA. Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA. Koch Institute of Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA. Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.

Collapse

121

Hie B, Peters J, Nyquist SK, Shalek AK, Berger B, Bryson BD. Computational Methods for Single-Cell RNA Sequencing. Annu Rev Biomed Data Sci 2020. [DOI: 10.1146/annurev-biodatasci-012220-100601] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

122

Niebler S, Müller A, Hankeln T, Schmidt B. RainDrop: Rapid activation matrix computation for droplet-based single-cell RNA-seq reads. BMC Bioinformatics 2020;21:274. [PMID: 32611394 PMCID: PMC7329424 DOI: 10.1186/s12859-020-03593-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 06/09/2020] [Indexed: 12/19/2022] Open

123

Srivastava A, Malik L, Sarkar H, Patro R. A Bayesian framework for inter-cellular information sharing improves dscRNA-seq quantification. Bioinformatics 2020;36:i292-i299. [PMID: 32657394 PMCID: PMC7355277 DOI: 10.1093/bioinformatics/btaa450] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

124

Qian H, Kang X, Hu J, Zhang D, Liang Z, Meng F, Zhang X, Xue Y, Maimon R, Dowdy SF, Devaraj NK, Zhou Z, Mobley WC, Cleveland DW, Fu XD. Reversing a model of Parkinson's disease with in situ converted nigral neurons. Nature 2020;582:550-556. [PMID: 32581380 PMCID: PMC7521455 DOI: 10.1038/s41586-020-2388-4] [Citation(s) in RCA: 291] [Impact Index Per Article: 72.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Accepted: 05/13/2020] [Indexed: 12/21/2022]

Affiliation(s)

Hao Qian Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
Xinjiang Kang State Key Laboratory of Membrane Biology and Peking-Tsinghua Center for Life Sciences, Institute of Molecular Medicine, Peking University, Beijing, China.,MOE Key Lab of Medical Electrophysiology, ICR, Southwest Medical University, Luzhou, China
Jing Hu Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA.,Sichuan Provincial Key Laboratory for Human Disease Gene Study, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
Dongyang Zhang Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, USA
Zhengyu Liang Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
Fan Meng Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
Xuan Zhang Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
Yuanchao Xue Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA.,Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
Roy Maimon Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA.,Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
Steven F Dowdy Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
Neal K Devaraj Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, USA
Zhuan Zhou State Key Laboratory of Membrane Biology and Peking-Tsinghua Center for Life Sciences, Institute of Molecular Medicine, Peking University, Beijing, China
William C Mobley Department of Neurosciences and Center for Neural Circuits and Behavior, University of California, San Diego, La Jolla, CA, USA
Don W Cleveland Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA.,Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA, USA
Xiang-Dong Fu Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA. .,Institute of Genomic Medicine, University of California, San Diego, La Jolla, CA, USA.

Collapse

125

Van de Sande B, Flerin C, Davie K, De Waegeneer M, Hulselmans G, Aibar S, Seurinck R, Saelens W, Cannoodt R, Rouchon Q, Verbeiren T, De Maeyer D, Reumers J, Saeys Y, Aerts S. A scalable SCENIC workflow for single-cell gene regulatory network analysis. Nat Protoc 2020;15:2247-2276. [PMID: 32561888 DOI: 10.1038/s41596-020-0336-2] [Citation(s) in RCA: 518] [Impact Index Per Article: 129.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 04/17/2020] [Indexed: 11/09/2022]

Affiliation(s)

Bram Van de Sande VIB Center for Brain & Disease Research, KU Leuven, Leuven, Belgium.,Department of Human Genetics, KU Leuven, Leuven, Belgium
Christopher Flerin VIB Center for Brain & Disease Research, KU Leuven, Leuven, Belgium.,Department of Human Genetics, KU Leuven, Leuven, Belgium
Kristofer Davie VIB Center for Brain & Disease Research, KU Leuven, Leuven, Belgium
Maxime De Waegeneer VIB Center for Brain & Disease Research, KU Leuven, Leuven, Belgium.,Department of Human Genetics, KU Leuven, Leuven, Belgium
Gert Hulselmans VIB Center for Brain & Disease Research, KU Leuven, Leuven, Belgium.,Department of Human Genetics, KU Leuven, Leuven, Belgium
Sara Aibar VIB Center for Brain & Disease Research, KU Leuven, Leuven, Belgium.,Department of Human Genetics, KU Leuven, Leuven, Belgium
Ruth Seurinck Data Mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium.,Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Wouter Saelens Data Mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium.,Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Robrecht Cannoodt Data Mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium.,Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium.,Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
Quentin Rouchon Data Mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium.,Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Toni Verbeiren Janssen Pharmaceutica, Beerse, Belgium.,Data Intuitive, Ghent, Belgium
Dries De Maeyer Janssen Pharmaceutica, Beerse, Belgium
Joke Reumers Janssen Pharmaceutica, Beerse, Belgium
Yvan Saeys Data Mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium.,Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Stein Aerts VIB Center for Brain & Disease Research, KU Leuven, Leuven, Belgium. .,Department of Human Genetics, KU Leuven, Leuven, Belgium.

Collapse

126

Giansanti V, Tang M, Cittaro D. Fast analysis of scATAC-seq data using a predefined set of genomic regions. F1000Res 2020;9:199. [PMID: 32595951 PMCID: PMC7308914 DOI: 10.12688/f1000research.22731.2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/20/2020] [Indexed: 11/20/2022] Open

127

Love MI, Soneson C, Hickey PF, Johnson LK, Pierce NT, Shepherd L, Morgan M, Patro R. Tximeta: Reference sequence checksums for provenance identification in RNA-seq. PLoS Comput Biol 2020;16:e1007664. [PMID: 32097405 PMCID: PMC7059966 DOI: 10.1371/journal.pcbi.1007664] [Citation(s) in RCA: 154] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 03/06/2020] [Accepted: 01/18/2020] [Indexed: 11/19/2022] Open

128

Amezquita RA, Lun ATL, Becht E, Carey VJ, Carpp LN, Geistlinger L, Marini F, Rue-Albrecht K, Risso D, Soneson C, Waldron L, Pagès H, Smith ML, Huber W, Morgan M, Gottardo R, Hicks SC. Orchestrating single-cell analysis with Bioconductor. Nat Methods 2020;17:137-145. [PMID: 31792435 PMCID: PMC7358058 DOI: 10.1038/s41592-019-0654-x] [Citation(s) in RCA: 410] [Impact Index Per Article: 102.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Revised: 09/13/2019] [Accepted: 10/14/2019] [Indexed: 12/24/2022]

Affiliation(s)

Robert A Amezquita Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Aaron T L Lun Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK Bioinformatics and Computational Biology, Genentech Inc., San Francisco, CA, USA
Etienne Becht Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Vince J Carey Channing Division of Network Medicine, Brigham And Women's Hospital, Boston, MA, USA
Lindsay N Carpp Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Ludwig Geistlinger Graduate School of Public Health and Health Policy, City University of New York, New York, NY, USA Institute for Implementation Science in Population Health, City University of New York, New York, NY, USA
Federico Marini Center for Thrombosis and Hemostasis, Mainz, Germany Institute of Medical Biostatistics, Epidemiology and Informatics, Mainz, Germany
Kevin Rue-Albrecht Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
Davide Risso Department of Statistical Sciences, University of Padua, Padua, Italy Division of Biostatistics and Epidemiology, Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, NY, USA
Charlotte Soneson Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland SIB Swiss Institute of Bioinformatics, Basel, Switzerland
Levi Waldron Graduate School of Public Health and Health Policy, City University of New York, New York, NY, USA Institute for Implementation Science in Population Health, City University of New York, New York, NY, USA
Hervé Pagès Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Mike L Smith European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
Wolfgang Huber European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
Martin Morgan Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA
Raphael Gottardo Fred Hutchinson Cancer Research Center, Seattle, WA, USA.
Stephanie C Hicks Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.

Collapse

129

Papatheodorou I, Moreno P, Manning J, Fuentes AMP, George N, Fexova S, Fonseca NA, Füllgrabe A, Green M, Huang N, Huerta L, Iqbal H, Jianu M, Mohammed S, Zhao L, Jarnuczak AF, Jupp S, Marioni J, Meyer K, Petryszak R, Prada Medina CA, Talavera-López C, Teichmann S, Vizcaino JA, Brazma A. Expression Atlas update: from tissues to single cells. Nucleic Acids Res 2020;48:D77-D83. [PMID: 31665515 PMCID: PMC7145605 DOI: 10.1093/nar/gkz947] [Citation(s) in RCA: 202] [Impact Index Per Article: 50.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 10/07/2019] [Accepted: 10/16/2019] [Indexed: 12/16/2022] Open

Affiliation(s)

Irene Papatheodorou European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Pablo Moreno European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Jonathan Manning European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Alfonso Muñoz-Pomer Fuentes European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Nancy George European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Silvie Fexova European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Nuno A Fonseca European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Anja Füllgrabe European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Matthew Green European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Ni Huang European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
Laura Huerta European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Haider Iqbal European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Monica Jianu European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Suhaib Mohammed European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Lingyun Zhao European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Andrew F Jarnuczak European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Simon Jupp European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
John Marioni European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
Kerstin Meyer Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
Robert Petryszak European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Cesar Augusto Prada Medina European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Carlos Talavera-López Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
Sarah Teichmann Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
Juan Antonio Vizcaino European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
Alvis Brazma European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK

Collapse

130

Liu D. Algorithms for efficiently collapsing reads with Unique Molecular Identifiers. PeerJ 2019;7:e8275. [PMID: 31871845 PMCID: PMC6921982 DOI: 10.7717/peerj.8275] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Accepted: 11/22/2019] [Indexed: 11/20/2022] Open

131

Zhu A, Srivastava A, Ibrahim JG, Patro R, Love MI. Nonparametric expression analysis using inferential replicate counts. Nucleic Acids Res 2019;47:e105. [PMID: 31372651 PMCID: PMC6765120 DOI: 10.1093/nar/gkz622] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Revised: 06/11/2019] [Accepted: 07/11/2019] [Indexed: 11/13/2022] Open

132

Sarkar H, Srivastava A, Patro R. Minnow: a principled framework for rapid simulation of dscRNA-seq data at the read level. Bioinformatics 2019;35:i136-i144. [PMID: 31510649 PMCID: PMC6612833 DOI: 10.1093/bioinformatics/btz351] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

SUMMARY

With the advancements of high-throughput single-cell RNA-sequencing protocols, there has been a rapid increase in the tools available to perform an array of analyses on the gene expression data that results from such studies. For example, there exist methods for pseudo-time series analysis, differential cell usage, cell-type detection RNA-velocity in single cells, etc. Most analysis pipelines validate their results using known marker genes (which are not widely available for all types of analysis) and by using simulated data from gene-count-level simulators. Typically, the impact of using different read-alignment or unique molecular identifier (UMI) deduplication methods has not been widely explored. Assessments based on simulation tend to start at the level of assuming a simulated count matrix, ignoring the effect that different approaches for resolving UMI counts from the raw read data may produce. Here, we present minnow, a comprehensive sequence-level droplet-based single-cell RNA-sequencing (dscRNA-seq) experiment simulation framework. Minnow accounts for important sequence-level characteristics of experimental scRNA-seq datasets and models effects such as polymerase chain reaction amplification, cellular barcodes (CB) and UMI selection and sequence fragmentation and sequencing. It also closely matches the gene-level ambiguity characteristics that are observed in real scRNA-seq experiments. Using minnow, we explore the performance of some common processing pipelines to produce gene-by-cell count matrices from droplet-bases scRNA-seq data, demonstrate the effect that realistic levels of gene-level sequence ambiguity can have on accurate quantification and show a typical use-case of minnow in assessing the output generated by different quantification pipelines on the simulated experiment.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse