Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Clerget-Darpoux F, Elston RC. Will formal genetics become dispensable? Hum Hered 2013;76:47-52. [PMID: 24107572 DOI: 10.1159/000354571] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open

Number

Cited by Other Article(s)

Elston RC. An Accidental Genetic Epidemiologist. Annu Rev Genomics Hum Genet 2020;21:15-36. [DOI: 10.1146/annurev-genom-103119-125052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Amorim A, Pinto N. Big data in forensic genetics. Forensic Sci Int Genet 2018;37:102-105. [PMID: 30142461 DOI: 10.1016/j.fsigen.2018.08.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Revised: 07/23/2018] [Accepted: 08/01/2018] [Indexed: 12/16/2022]

Fang H, Wu Y, Narzisi G, O'Rawe JA, Barrón LTJ, Rosenbaum J, Ronemus M, Iossifov I, Schatz MC, Lyon GJ. Reducing INDEL calling errors in whole genome and exome sequencing data. Genome Med 2014;6:89. [PMID: 25426171 PMCID: PMC4240813 DOI: 10.1186/s13073-014-0089-z] [Citation(s) in RCA: 120] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2014] [Accepted: 10/16/2014] [Indexed: 12/30/2022] Open

Abstract

Background

INDELs, especially those disrupting protein-coding regions of the genome, have been strongly associated with human diseases. However, there are still many errors with INDEL variant calling, driven by library preparation, sequencing biases, and algorithm artifacts.

Methods

We characterized whole genome sequencing (WGS), whole exome sequencing (WES), and PCR-free sequencing data from the same samples to investigate the sources of INDEL errors. We also developed a classification scheme based on the coverage and composition to rank high and low quality INDEL calls. We performed a large-scale validation experiment on 600 loci, and find high-quality INDELs to have a substantially lower error rate than low-quality INDELs (7% vs. 51%).

Results

Simulation and experimental data show that assembly based callers are significantly more sensitive and robust for detecting large INDELs (>5 bp) than alignment based callers, consistent with published data. The concordance of INDEL detection between WGS and WES is low (53%), and WGS data uniquely identifies 10.8-fold more high-quality INDELs. The validation rate for WGS-specific INDELs is also much higher than that for WES-specific INDELs (84% vs. 57%), and WES misses many large INDELs. In addition, the concordance for INDEL detection between standard WGS and PCR-free sequencing is 71%, and standard WGS data uniquely identifies 6.3-fold more low-quality INDELs. Furthermore, accurate detection with Scalpel of heterozygous INDELs requires 1.2-fold higher coverage than that for homozygous INDELs. Lastly, homopolymer A/T INDELs are a major source of low-quality INDEL calls, and they are highly enriched in the WES data.

Conclusions

Overall, we show that accuracy of INDEL detection with WGS is much greater than WES even in the targeted region. We calculated that 60X WGS depth of coverage from the HiSeq platform is needed to recover 95% of INDELs detected by Scalpel. While this is higher than current sequencing practice, the deeper coverage may save total project costs because of the greater accuracy and sensitivity. Finally, we investigate sources of INDEL errors (for example, capture deficiency, PCR amplification, homopolymers) with various data that will serve as a guideline to effectively reduce INDEL errors in genome sequencing.

Electronic supplementary material

The online version of this article (doi:10.1186/s13073-014-0089-z) contains supplementary material, which is available to authorized users.

Collapse

Affiliation(s)

Han Fang Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Stony Brook University, 100 Nicolls Rd, Stony Brook, NY USA ; Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
Yiyang Wu Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Stony Brook University, 100 Nicolls Rd, Stony Brook, NY USA
Giuseppe Narzisi Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; New York Genome Center, New York, NY USA
Jason A O'Rawe Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Stony Brook University, 100 Nicolls Rd, Stony Brook, NY USA
Laura T Jimenez Barrón Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Centro de Ciencias Genomicas, Universidad Nacional Autonoma de Mexico, Cuernavaca, Morelos Mexico
Julie Rosenbaum Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
Michael Ronemus Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
Ivan Iossifov Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
Michael C Schatz Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
Gholson J Lyon Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Stony Brook University, 100 Nicolls Rd, Stony Brook, NY USA

Collapse

Saint Pierre A, Genin E. How important are rare variants in common disease? Brief Funct Genomics 2014;13:353-61. [DOI: 10.1093/bfgp/elu025] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Schrodi SJ, Mukherjee S, Shan Y, Tromp G, Sninsky JJ, Callear AP, Carter TC, Ye Z, Haines JL, Brilliant MH, Crane PK, Smelser DT, Elston RC, Weeks DE. Genetic-based prediction of disease traits: prediction is very difficult, especially about the future. Front Genet 2014;5:162. [PMID: 24917882 PMCID: PMC4040440 DOI: 10.3389/fgene.2014.00162] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2014] [Accepted: 05/15/2014] [Indexed: 01/08/2023] Open