Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wu SH, Schwartz RS, Winter DJ, Conrad DF, Cartwright RA. Estimating error models for whole genome sequencing using mixtures of Dirichlet-multinomial distributions. Bioinformatics 2017;33:2322-2329. [PMID: 28334373 PMCID: PMC5860108 DOI: 10.1093/bioinformatics/btx133] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2016] [Revised: 01/22/2017] [Accepted: 03/07/2017] [Indexed: 12/30/2022] Open

For:	Wu SH, Schwartz RS, Winter DJ, Conrad DF, Cartwright RA. Estimating error models for whole genome sequencing using mixtures of Dirichlet-multinomial distributions. Bioinformatics 2017;33:2322-2329. [PMID: 28334373 PMCID: PMC5860108 DOI: 10.1093/bioinformatics/btx133] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2016] [Revised: 01/22/2017] [Accepted: 03/07/2017] [Indexed: 12/30/2022] Open

Number

Cited by Other Article(s)

Mangiola S, Roth-Schulze AJ, Trussart M, Zozaya-Valdés E, Ma M, Gao Z, Rubin AF, Speed TP, Shim H, Papenfuss AT. sccomp: Robust differential composition and variability analysis for single-cell data. Proc Natl Acad Sci U S A 2023;120:e2203828120. [PMID: 37549298 PMCID: PMC10438834 DOI: 10.1073/pnas.2203828120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2022] [Accepted: 05/18/2023] [Indexed: 08/09/2023] Open

Affiliation(s)

Stefano Mangiola Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC3052, Australia Department of Medical Biology, University of Melbourne, Parkville, VIC3052, Australia
Alexandra J. Roth-Schulze Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC3052, Australia Department of Medical Biology, University of Melbourne, Parkville, VIC3052, Australia
Marie Trussart Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC3052, Australia
Enrique Zozaya-Valdés Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC3052, Australia Department of Medical Biology, University of Melbourne, Parkville, VIC3052, Australia
Mengyao Ma Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC3052, Australia
Zijie Gao Melbourne Integrative Genomics, University of Melbourne, Parkville, VIC3052, Australia School of Mathematics and Statistics, University of Melbourne, Parkville, VIC3052, Australia
Alan F. Rubin Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC3052, Australia Department of Medical Biology, University of Melbourne, Parkville, VIC3052, Australia
Terence P. Speed Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC3052, Australia
Heejung Shim Melbourne Integrative Genomics, University of Melbourne, Parkville, VIC3052, Australia School of Mathematics and Statistics, University of Melbourne, Parkville, VIC3052, Australia
Anthony T. Papenfuss Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC3052, Australia Department of Medical Biology, University of Melbourne, Parkville, VIC3052, Australia

Collapse

Becker D, Champredon D, Chato C, Gugan G, Poon A. SUP: a probabilistic framework to propagate genome sequence uncertainty, with applications. NAR Genom Bioinform 2023;5:lqad038. [PMID: 37101658 PMCID: PMC10124968 DOI: 10.1093/nargab/lqad038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 02/15/2023] [Accepted: 04/06/2023] [Indexed: 04/28/2023] Open

Dang Z, Yang J, Wang L, Tao Q, Zhang F, Zhang Y, Luo Z. Sampling Variation of RAD-Seq Data from Diploid and Tetraploid Potato (Solanum tuberosum L.). PLANTS 2021;10:plants10020319. [PMID: 33562246 PMCID: PMC7915145 DOI: 10.3390/plants10020319] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/27/2020] [Revised: 01/24/2021] [Accepted: 02/02/2021] [Indexed: 12/02/2022]

Winter DJ, Wu SH, Howell AA, Azevedo RBR, Zufall RA, Cartwright RA. accuMUlate: a mutation caller designed for mutation accumulation experiments. Bioinformatics 2019;34:2659-2660. [PMID: 29566129 DOI: 10.1093/bioinformatics/bty165] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 03/15/2018] [Indexed: 11/13/2022] Open

Günther T, Nettelblad C. The presence and impact of reference bias on population genomic studies of prehistoric human populations. PLoS Genet 2019;15:e1008302. [PMID: 31348818 PMCID: PMC6685638 DOI: 10.1371/journal.pgen.1008302] [Citation(s) in RCA: 103] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Revised: 08/07/2019] [Accepted: 07/10/2019] [Indexed: 11/18/2022] Open

Abstract

Haploid high quality reference genomes are an important resource in genomic research projects. A consequence is that DNA fragments carrying the reference allele will be more likely to map successfully, or receive higher quality scores. This reference bias can have effects on downstream population genomic analysis when heterozygous sites are falsely considered homozygous for the reference allele. In palaeogenomic studies of human populations, mapping against the human reference genome is used to identify endogenous human sequences. Ancient DNA studies usually operate with low sequencing coverages and fragmentation of DNA molecules causes a large proportion of the sequenced fragments to be shorter than 50 bp-reducing the amount of accepted mismatches, and increasing the probability of multiple matching sites in the genome. These ancient DNA specific properties are potentially exacerbating the impact of reference bias on downstream analyses, especially since most studies of ancient human populations use pseudo-haploid data, i.e. they randomly sample only one sequencing read per site. We show that reference bias is pervasive in published ancient DNA sequence data of prehistoric humans with some differences between individual genomic regions. We illustrate that the strength of reference bias is negatively correlated with fragment length. Most genomic regions we investigated show little to no mapping bias but even a small proportion of sites with bias can impact analyses of those particular loci or slightly skew genome-wide estimates. Therefore, reference bias has the potential to cause minor but significant differences in the results of downstream analyses such as population allele sharing, heterozygosity estimates and estimates of archaic ancestry. These spurious results highlight how important it is to be aware of these technical artifacts and that we need strategies to mitigate the effect. Therefore, we suggest some post-mapping filtering strategies to resolve reference bias which help to reduce its impact substantially.

Collapse

Wong TKF, Ranjard L, Lin Y, Rodrigo AG. HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations. BMC Bioinformatics 2018;19:389. [PMID: 30348075 PMCID: PMC6198429 DOI: 10.1186/s12859-018-2424-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 10/09/2018] [Indexed: 11/10/2022] Open

Spooner W, McLaren W, Slidel T, Finch DK, Butler R, Campbell J, Eghobamien L, Rider D, Kiefer CM, Robinson MJ, Hardman C, Cunningham F, Vaughan T, Flicek P, Huntington CC. Haplosaurus computes protein haplotypes for use in precision drug design. Nat Commun 2018;9:4128. [PMID: 30297836 PMCID: PMC6175845 DOI: 10.1038/s41467-018-06542-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Accepted: 09/07/2018] [Indexed: 01/08/2023] Open

Ranjard L, Wong TKF, Rodrigo AG. Reassembling haplotypes in a mixture of pooled amplicons when the relative concentrations are known: A proof-of-concept study on the efficient design of next-generation sequencing strategies. PLoS One 2018;13:e0195090. [PMID: 29621260 PMCID: PMC5886459 DOI: 10.1371/journal.pone.0195090] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 03/18/2018] [Indexed: 12/02/2022] Open