Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X. Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 2011;89:82-93. [PMID: 21737059 DOI: 10.1016/j.ajhg.2011.05.029] [Citation(s) in RCA: 1685] [Impact Index Per Article: 129.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2011] [Revised: 05/27/2011] [Accepted: 05/30/2011] [Indexed: 01/18/2023] Open

For:	Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X. Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 2011;89:82-93. [PMID: 21737059 DOI: 10.1016/j.ajhg.2011.05.029] [Citation(s) in RCA: 1685] [Impact Index Per Article: 129.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2011] [Revised: 05/27/2011] [Accepted: 05/30/2011] [Indexed: 01/18/2023] Open

Number

Cited by Other Article(s)

1651

Beaudoin M, Lo KS, N'Diaye A, Rivas MA, Dubé MP, Laplante N, Phillips MS, Rioux JD, Tardif JC, Lettre G. Pooled DNA resequencing of 68 myocardial infarction candidate genes in French canadians. ACTA ACUST UNITED AC 2012;5:547-54. [PMID: 22923420 DOI: 10.1161/circgenetics.112.963165] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

1652

Maity A, Sullivan PF, Tzeng JY. Multivariate phenotype association analysis by marker-set kernel machine regression. Genet Epidemiol 2012;36:686-95. [PMID: 22899176 DOI: 10.1002/gepi.21663] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2012] [Revised: 05/23/2012] [Accepted: 06/18/2012] [Indexed: 11/06/2022]

1653

Lee S, Emond MJ, Bamshad MJ, Barnes KC, Rieder MJ, Nickerson DA, Christiani D, Wurfel M, Lin X, Lin X. Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. Am J Hum Genet 2012;91:224-37. [PMID: 22863193 DOI: 10.1016/j.ajhg.2012.06.007] [Citation(s) in RCA: 712] [Impact Index Per Article: 59.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2012] [Revised: 05/22/2012] [Accepted: 06/12/2012] [Indexed: 12/23/2022] Open

1654

Epstein M, Duncan R, Jiang Y, Conneely K, Allen A, Satten G. A permutation procedure to correct for confounders in case-control studies, including tests of rare variation. Am J Hum Genet 2012;91:215-23. [PMID: 22818855 DOI: 10.1016/j.ajhg.2012.06.004] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2012] [Revised: 05/03/2012] [Accepted: 06/05/2012] [Indexed: 01/30/2023] Open

1655

Xu C, Ladouceur M, Dastani Z, Richards JB, Ciampi A, Greenwood CMT. Multiple regression methods show great potential for rare variant association tests. PLoS One 2012;7:e41694. [PMID: 22916111 PMCID: PMC3420665 DOI: 10.1371/journal.pone.0041694] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2012] [Accepted: 06/25/2012] [Indexed: 01/08/2023] Open

1656

Cheung YH, Wang G, Leal SM, Wang S. A fast and noise-resilient approach to detect rare-variant associations with deep sequencing data for complex disorders. Genet Epidemiol 2012;36:675-85. [PMID: 22865616 DOI: 10.1002/gepi.21662] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2012] [Accepted: 06/14/2012] [Indexed: 11/11/2022]

1657

Kuk AY, Li X, Xu J. A fast collapsed data method for estimating haplotype frequencies from pooled genotype data with applications to the study of rare variants. Stat Med 2012;32:1343-60. [DOI: 10.1002/sim.5540] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Accepted: 06/11/2012] [Indexed: 12/31/2022]

1658

Chen G, Yuan A, Zhou Y, Bentley AR, Zhou J, Chen W, Shriner D, Adeyemo A, Rotimi CN. Simultaneous Analysis of Common and Rare Variants in Complex Traits: Application to SNPs (SCARVAsnp). Bioinform Biol Insights 2012;6:177-85. [PMID: 22904618 PMCID: PMC3418150 DOI: 10.4137/bbi.s9966] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

1659

Smoothed functional principal component analysis for testing association of the entire allelic spectrum of genetic variation. Eur J Hum Genet 2012;21:217-24. [PMID: 22781089 DOI: 10.1038/ejhg.2012.141] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

1660

Sha Q, Wang S, Zhang S. Adaptive clustering and adaptive weighting methods to detect disease associated rare variants. Eur J Hum Genet 2012;21:332-7. [PMID: 22781093 DOI: 10.1038/ejhg.2012.143] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

1661

Chang D, Keinan A. Predicting signatures of "synthetic associations" and "natural associations" from empirical patterns of human genetic variation. PLoS Comput Biol 2012;8:e1002600. [PMID: 22792059 PMCID: PMC3390358 DOI: 10.1371/journal.pcbi.1002600] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2011] [Accepted: 05/23/2012] [Indexed: 11/18/2022] Open

Abstract

Genome-wide association studies (GWAS) have in recent years discovered thousands of associated markers for hundreds of phenotypes. However, associated loci often only explain a relatively small fraction of heritability and the link between association and causality has yet to be uncovered for most loci. Rare causal variants have been suggested as one scenario that may partially explain these shortcomings. Specifically, Dickson et al. recently reported simulations of rare causal variants that lead to association signals of common, tag single nucleotide polymorphisms, dubbed "synthetic associations". However, an open question is what practical implications synthetic associations have for GWAS. Here, we explore the signatures exhibited by such "synthetic associations" and their implications based on patterns of genetic variation observed in human populations, thus accounting for human evolutionary history -a force disregarded in previous simulation studies. This is made possible by human population genetic data from HapMap 3 consisting of both resequencing and array-based genotyping data for the same set of individuals from multiple populations. We report that synthetic associations tend to be further away from the underlying risk alleles compared to "natural associations" (i.e. associations due to underlying common causal variants), but to a much lesser extent than previously predicted, with both the age and the effect size of the risk allele playing a part in this phenomenon. We find that while a synthetic association has a lower probability of capturing causal variants within its linkage disequilibrium block, sequencing around the associated variant need not extend substantially to have a high probability of capturing at least one causal variant. We also show that the minor allele frequency of synthetic associations is lower than of natural associations for most, but not all, loci that we explored. Finally, we find the variance in associated allele frequency to be a potential indicator of synthetic associations.

Collapse

1662

Lin WY, Tiwari HK, Gao G, Zhang K, Arcaroli JJ, Abraham E, Liu N. Similarity-based multimarker association tests for continuous traits. Ann Hum Genet 2012;76:246-60. [PMID: 22497480 DOI: 10.1111/j.1469-1809.2012.00706.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

1663

Single Nucleotide Polymorphism (SNP) Detection and Genotype Calling from Massively Parallel Sequencing (MPS) Data. STATISTICS IN BIOSCIENCES 2012;5:3-25. [PMID: 24489615 DOI: 10.1007/s12561-012-9067-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

1664

Witte JS. Rare genetic variants and treatment response: sample size and analysis issues. Stat Med 2012;31:3041-50. [PMID: 22736504 DOI: 10.1002/sim.5428] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2011] [Accepted: 03/15/2012] [Indexed: 11/06/2022]

1665

Sha Q, Wang X, Wang X, Zhang S. Detecting association of rare and common variants by testing an optimally weighted combination of variants. Genet Epidemiol 2012;36:561-71. [PMID: 22714994 DOI: 10.1002/gepi.21649] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2012] [Revised: 04/13/2012] [Accepted: 05/09/2012] [Indexed: 11/07/2022]

1666

Ionita-Laza I, Makarov V, Buxbaum JD. Scan-statistic approach identifies clusters of rare disease variants in LRP2, a gene linked and associated with autism spectrum disorders, in three datasets. Am J Hum Genet 2012;90:1002-13. [PMID: 22578327 DOI: 10.1016/j.ajhg.2012.04.010] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2012] [Revised: 02/27/2012] [Accepted: 04/19/2012] [Indexed: 01/20/2023] Open

1667

Kang G, Lin D, Hakonarson H, Chen J. Two-stage extreme phenotype sequencing design for discovering and testing common and rare genetic variants: efficiency and power. Hum Hered 2012;73:139-47. [PMID: 22678112 DOI: 10.1159/000337300] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2011] [Accepted: 02/10/2012] [Indexed: 01/10/2023] Open

Abstract

Next-generation sequencing technology provides an unprecedented opportunity to identify rare susceptibility variants. It is not yet financially feasible to perform whole-genome sequencing on a large number of subjects, and a two-stage design has been advocated to be a practical option. In stage I, variants are discovered by sequencing the whole genomes of a small number of carefully selected individuals. In stage II, the discovered variants of a large number of individuals are genotyped to assess associations. Individuals with extreme phenotypes are typically selected in stage I. Using simulated data for unrelated individuals, we explore two important aspects of this two-stage design: the efficiency of discovering common and rare single-nucleotide polymorphisms (SNPs) in stage I and the impact of incomplete SNP discovery in stage I on the power of testing associations in stage II. We applied a sum test and a sum of squared score test for gene-based association analyses evaluating the power of the two-stage design. We obtained the following results from extensive simulation studies and analysis of the GAW17 dataset. When individuals with trait values more extreme than the 99.7-99th quantile were included in stage I, the two-stage design could achieve the same power as or even higher than the one-stage design if the rare causal variants had large effect sizes. In such design, fewer than half of the total SNPs including more than half of the causal SNPs were discovered, which included nearly all SNPs with minor allele frequencies (MAFs) ≥5%, more than half of the SNPs with MAFs between 1% and 5%, and fewer than half of the SNPs with MAFs <1%. Although a one-stage design may be preferable to identify multiple rare variants having small to moderate effect sizes, our observations support using the two-stage design as a cost-effective option for next-generation sequencing studies.

Collapse

1668

Fang S, Sha Q, Zhang S. Two adaptive weighting methods to test for rare variant associations in family-based designs. Genet Epidemiol 2012;36:499-507. [PMID: 22674630 DOI: 10.1002/gepi.21646] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2011] [Revised: 04/26/2012] [Accepted: 04/26/2012] [Indexed: 11/06/2022]

1669

Köttgen A, Yang Q, Shimmin LC, Tin A, Schaeffer C, Coresh J, Liu X, Rampoldi L, Hwang SJ, Boerwinkle E, Hixson JE, Kao WHL, Fox CS. Association of estimated glomerular filtration rate and urinary uromodulin concentrations with rare variants identified by UMOD gene region sequencing. PLoS One 2012;7:e38311. [PMID: 22693617 PMCID: PMC3365030 DOI: 10.1371/journal.pone.0038311] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2012] [Accepted: 05/08/2012] [Indexed: 11/19/2022] Open

Abstract

BACKGROUND

Recent genome-wide association studies (GWAS) have identified common variants in the UMOD region associated with kidney function and disease in the general population. To identify novel rare variants as well as common variants that may account for this GWAS signal, the exons and 4 kb upstream region of UMOD were sequenced.

METHODOLOGY/PRINCIPAL FINDINGS

Individuals (n = 485) were selected based on presence of the GWAS risk haplotype and chronic kidney disease (CKD) in the ARIC Study and on the extremes of of the UMOD gene product, uromodulin, in urine (Tamm Horsfall protein, THP) in the Framingham Heart Study (FHS). Targeted sequencing was conducted using capillary based Sanger sequencing (3730 DNA Analyzer). Variants were tested for association with THP concentrations and estimated glomerular filtration rate (eGFR), and identified non-synonymous coding variants were genotyped in up to 22,546 follow-up samples. Twenty-four and 63 variants were identified in the 285 ARIC and 200 FHS participants, respectively. In both studies combined, there were 33 common and 54 rare (MAF<0.05) variants. Five non-synonymous rare variants were identified in FHS; borderline enrichment of rare variants was found in the extremes of THP (SKAT p-value = 0.08). Only V458L was associated with THP in the FHS general-population validation sample (p = 9*10(-3), n = 2,522), but did not show direction-consistent and significant association with eGFR in both the ARIC (n = 14,635) and FHS (n = 7,520) validation samples. Pooling all non-synonymous rare variants except V458L together showed non-significant associations with THP and eGFR in the FHS validation sample. Functional studies of V458L revealed no alternations in protein trafficking.

CONCLUSIONS/SIGNIFICANCE

Multiple novel rare variants in the UMOD region were identified, but none were consistently associated with eGFR in two independent study samples. Only V458L had modest association with THP levels in the general population and thus could not account for the observed GWAS signal.

Collapse

Affiliation(s)

Anna Köttgen Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America Renal Division, Freiburg University Clinic, Freiburg, Germany
Qiong Yang Department of Biostatistics, Boston University School of Public Health, Boston, Massachussets, United States of America
Lawrence C. Shimmin Human Genetics Center, Division of Epidemiology and Disease Control, UT-Houston School of Public Health, Houston, Texas, United States of America
Adrienne Tin Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
Céline Schaeffer Dulbecco Telethon Institute and Division of Genetics and Cell Biology, San Raffaele Scientific Institute, Milan, Italy
Josef Coresh Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America Welch Center for Prevention, Epidemiology and Clinical Research, Johns Hopkins Medical Institutions, Baltimore, Maryland, United States of America
Xuan Liu Department of Biostatistics, Boston University School of Public Health, Boston, Massachussets, United States of America
Luca Rampoldi Dulbecco Telethon Institute and Division of Genetics and Cell Biology, San Raffaele Scientific Institute, Milan, Italy
Shih-Jen Hwang NHLBI's Framingham Heart Study and the Center for Population Studies, Framingham, Massachussets, United States of America
Eric Boerwinkle Human Genetics Center, Division of Epidemiology and Disease Control, UT-Houston School of Public Health, Houston, Texas, United States of America
James E. Hixson Human Genetics Center, Division of Epidemiology and Disease Control, UT-Houston School of Public Health, Houston, Texas, United States of America
W. H. Linda Kao Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America Welch Center for Prevention, Epidemiology and Clinical Research, Johns Hopkins Medical Institutions, Baltimore, Maryland, United States of America
Caroline S. Fox NHLBI's Framingham Heart Study and the Center for Population Studies, Framingham, Massachussets, United States of America Division of Endocrinology, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachussets, United States of America

Collapse

1670

Exome sequencing and the genetic basis of complex traits. Nat Genet 2012;44:623-30. [PMID: 22641211 DOI: 10.1038/ng.2303] [Citation(s) in RCA: 287] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

1671

Li H. U-statistics in genetic association studies. Hum Genet 2012;131:1395-401. [PMID: 22610525 DOI: 10.1007/s00439-012-1178-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2012] [Accepted: 05/07/2012] [Indexed: 11/25/2022]

1672

Zhang F, Chen Y, Liu C, Lu T, Yan H, Ruan Y, Yue W, Wang L, Zhang D. Systematic association analysis of microRNA machinery genes with schizophrenia informs further study. Neurosci Lett 2012;520:47-50. [PMID: 22595464 DOI: 10.1016/j.neulet.2012.05.028] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2012] [Revised: 04/23/2012] [Accepted: 05/05/2012] [Indexed: 10/28/2022]

1673

Statistical Challenges in Sequence-Based Association Studies with Population- and Family-Based Designs. STATISTICS IN BIOSCIENCES 2012. [DOI: 10.1007/s12561-012-9062-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

1674

Liu DJ, Leal SM. SEQCHIP: a powerful method to integrate sequence and genotype data for the detection of rare variant associations. ACTA ACUST UNITED AC 2012;28:1745-51. [PMID: 22556370 DOI: 10.1093/bioinformatics/bts263] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

1675

Current World Literature. Curr Opin Cardiol 2012;27:318-26. [DOI: 10.1097/hco.0b013e328352dfaf] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

1676

Chen GK, Chen G, Wei P, DeStefano AL. Incorporating biological information into association studies of sequencing data. Genet Epidemiol 2012;35 Suppl 1:S29-34. [PMID: 22128055 DOI: 10.1002/gepi.20646] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

1677

Liu DJ, Leal SM. A unified framework for detecting rare variant quantitative trait associations in pedigree and unrelated individuals via sequence data. Hum Hered 2012;73:105-22. [PMID: 22555759 DOI: 10.1159/000336293] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2011] [Accepted: 01/07/2012] [Indexed: 11/19/2022] Open

1678

Bacanu SA. On optimal gene-based analysis of genome scans. Genet Epidemiol 2012;36:333-9. [PMID: 22508187 DOI: 10.1002/gepi.21625] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2011] [Revised: 12/22/2011] [Accepted: 01/30/2012] [Indexed: 11/06/2022]

1679

Neale BM, Kou Y, Liu L, Ma'ayan A, Samocha KE, Sabo A, Lin CF, Stevens C, Wang LS, Makarov V, Polak P, Yoon S, Maguire J, Crawford EL, Campbell NG, Geller ET, Valladares O, Shafer C, Liu H, Zhao T, Cai G, Lihm J, Dannenfelser R, Jabado O, Peralta Z, Nagaswamy U, Muzny D, Reid JG, Newsham I, Wu Y, Lewis L, Han Y, Voight BF, Lim E, Rossin E, Kirby A, Flannick J, Fromer M, Shakir K, Fennell T, Garimella K, Banks E, Poplin R, Gabriel S, DePristo M, Wimbish JR, Boone BE, Levy SE, Betancur C, Sunyaev S, Boerwinkle E, Buxbaum JD, Cook EH, Devlin B, Gibbs RA, Roeder K, Schellenberg GD, Sutcliffe JS, Daly MJ. Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 2012;485:242-5. [PMID: 22495311 PMCID: PMC3613847 DOI: 10.1038/nature11011] [Citation(s) in RCA: 1278] [Impact Index Per Article: 106.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2011] [Accepted: 03/06/2012] [Indexed: 01/21/2023]

Affiliation(s)

Benjamin M. Neale Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, 02114 Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Yan Kou Pharmacology and Systems Therapeutics, Mount Sinai School of Medicine, New York, New York, 10029 Seaver Autism Center for Research and Treatment, Mount Sinai School of Medicine, New York, New York, 10029
Li Liu Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania, 15232
Avi Ma'ayan Pharmacology and Systems Therapeutics, Mount Sinai School of Medicine, New York, New York, 10029
Kaitlin E. Samocha Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, 02114 Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Aniko Sabo Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Chiao-Feng Lin Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, 19104
Christine Stevens Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Li-San Wang Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, 19104
Vladimir Makarov Seaver Autism Center for Research and Treatment, Mount Sinai School of Medicine, New York, New York, 10029 Department of Psychiatry, Mount Sinai School of Medicine, New York, New York, 10029
Paz Polak Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142 Division of Genetics, Department of Medicine Brigham & Women's Hospital and Harvard Medical School, Boston, Massachusetts, 02115
Seungtai Yoon Seaver Autism Center for Research and Treatment, Mount Sinai School of Medicine, New York, New York, 10029 Department of Psychiatry, Mount Sinai School of Medicine, New York, New York, 10029
Jared Maguire Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Emily L. Crawford Vanderbilt Brain Institute, Departments of Molecular Physiology & Biophysics and Psychiatry, Vanderbilt University, Nashville, Tennessee, 37232
Nicholas G. Campbell Vanderbilt Brain Institute, Departments of Molecular Physiology & Biophysics and Psychiatry, Vanderbilt University, Nashville, Tennessee, 37232
Evan T. Geller Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, 19104
Otto Valladares Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, 19104
Chad Shafer Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania, 15232
Han Liu Biostatistics Department and Computer Science Department, Johns Hopkins University, Baltimore, Maryland, 21205
Tuo Zhao Biostatistics Department and Computer Science Department, Johns Hopkins University, Baltimore, Maryland, 21205
Guiqing Cai Seaver Autism Center for Research and Treatment, Mount Sinai School of Medicine, New York, New York, 10029 Department of Psychiatry, Mount Sinai School of Medicine, New York, New York, 10029
Jayon Lihm Seaver Autism Center for Research and Treatment, Mount Sinai School of Medicine, New York, New York, 10029 Department of Psychiatry, Mount Sinai School of Medicine, New York, New York, 10029
Ruth Dannenfelser Pharmacology and Systems Therapeutics, Mount Sinai School of Medicine, New York, New York, 10029
Omar Jabado Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, New York, 10029
Zuleyma Peralta Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, New York, 10029
Uma Nagaswamy Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Donna Muzny Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Jeffrey G. Reid Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Irene Newsham Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Yuanqing Wu Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Lora Lewis Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Yi Han Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Benjamin F. Voight Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142 Department of Pharmacology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania 19104
Elaine Lim Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, 02114 Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Elizabeth Rossin Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, 02114 Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Andrew Kirby Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, 02114 Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Jason Flannick Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Menachem Fromer Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, 02114 Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Khalid Shakir Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Tim Fennell Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Kiran Garimella Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Eric Banks Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Ryan Poplin Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Stacey Gabriel Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Mark DePristo Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142
Jack R. Wimbish HudsonAlpha Institute for Biotechnology, Huntsville Alabama, 35806
Braden E. Boone HudsonAlpha Institute for Biotechnology, Huntsville Alabama, 35806
Shawn E. Levy HudsonAlpha Institute for Biotechnology, Huntsville Alabama, 35806
Catalina Betancur INSERM U952 and CNRS UMR 7224 and UPMC Univ Paris 06, 75005 Paris, France
Shamil Sunyaev Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142 Division of Genetics, Department of Medicine Brigham & Women's Hospital and Harvard Medical School, Boston, Massachusetts, 02115
Eric Boerwinkle Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030 Human Genetics Center, University of Texas Health Science Center at Houston, Houston, Texas, 77030
Joseph D. Buxbaum Seaver Autism Center for Research and Treatment, Mount Sinai School of Medicine, New York, New York, 10029 Department of Psychiatry, Mount Sinai School of Medicine, New York, New York, 10029 Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, New York, 10029 Friedman Brain Institute, Mount Sinai School of Medicine, New York, New York, 10029
Edwin H. Cook Department of Psychiatry, University of Illinois at Chicago, Chicago, Illinois, 60608
Bernie Devlin Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, 15213
Richard A. Gibbs Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, 77030
Kathryn Roeder Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania, 15232
Gerard D. Schellenberg Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, 19104
James S. Sutcliffe Vanderbilt Brain Institute, Departments of Molecular Physiology & Biophysics and Psychiatry, Vanderbilt University, Nashville, Tennessee, 37232
Mark J. Daly Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, 02114 Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts, 02142

Collapse

1680

Joint rare variant association test of the average and individual effects for sequencing studies. PLoS One 2012;7:e32485. [PMID: 22468164 PMCID: PMC3309869 DOI: 10.1371/journal.pone.0032485] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Accepted: 01/30/2012] [Indexed: 11/19/2022] Open

1681

Kinnamon DD, Hershberger RE, Martin ER. Reconsidering association testing methods using single-variant test statistics as alternatives to pooling tests for sequence data with rare variants. PLoS One 2012;7:e30238. [PMID: 22363423 PMCID: PMC3281828 DOI: 10.1371/journal.pone.0030238] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2011] [Accepted: 12/16/2011] [Indexed: 12/14/2022] Open

Abstract

Association tests that pool minor alleles into a measure of burden at a locus have been proposed for case-control studies using sequence data containing rare variants. However, such pooling tests are not robust to the inclusion of neutral and protective variants, which can mask the association signal from risk variants. Early studies proposing pooling tests dismissed methods for locus-wide inference using nonnegative single-variant test statistics based on unrealistic comparisons. However, such methods are robust to the inclusion of neutral and protective variants and therefore may be more useful than previously appreciated. In fact, some recently proposed methods derived within different frameworks are equivalent to performing inference on weighted sums of squared single-variant score statistics. In this study, we compared two existing methods for locus-wide inference using nonnegative single-variant test statistics to two widely cited pooling tests under more realistic conditions. We established analytic results for a simple model with one rare risk and one rare neutral variant, which demonstrated that pooling tests were less powerful than even Bonferroni-corrected single-variant tests in most realistic situations. We also performed simulations using variants with realistic minor allele frequency and linkage disequilibrium spectra, disease models with multiple rare risk variants and extensive neutral variation, and varying rates of missing genotypes. In all scenarios considered, existing methods using nonnegative single-variant test statistics had power comparable to or greater than two widely cited pooling tests. Moreover, in disease models with only rare risk variants, an existing method based on the maximum single-variant Cochran-Armitage trend chi-square statistic in the locus had power comparable to or greater than another existing method closely related to some recently proposed methods. We conclude that efficient locus-wide inference using single-variant test statistics should be reconsidered as a useful framework for devising powerful association tests in sequence data with rare variants.

Collapse

1682

Ladouceur M, Dastani Z, Aulchenko YS, Greenwood CMT, Richards JB. The empirical power of rare variant association methods: results from sanger sequencing in 1,998 individuals. PLoS Genet 2012;8:e1002496. [PMID: 22319458 PMCID: PMC3271058 DOI: 10.1371/journal.pgen.1002496] [Citation(s) in RCA: 89] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2011] [Accepted: 12/08/2011] [Indexed: 01/09/2023] Open

1683

Tomlinson I. Colorectal cancer genetics: from candidate genes to GWAS and back again. Mutagenesis 2012;27:141-2. [DOI: 10.1093/mutage/ger072] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open

1684

Detecting rare variant associations by identity-by-descent mapping in case-control studies. Genetics 2012;190:1521-31. [PMID: 22267498 PMCID: PMC3316661 DOI: 10.1534/genetics.111.136937] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

1685

Daye ZJ, Li H, Wei Z. A powerful test for multiple rare variants association studies that incorporates sequencing qualities. Nucleic Acids Res 2012;40:e60. [PMID: 22262732 PMCID: PMC3340416 DOI: 10.1093/nar/gks024] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

1686

Xing G, Lin CY, Wooding SP, Xing C. Blindly using Wald's test can miss rare disease-causal variants in case-control association studies. Ann Hum Genet 2012;76:168-77. [PMID: 22256951 DOI: 10.1111/j.1469-1809.2011.00700.x] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]

1687

Pongpanich M, Neely ML, Tzeng JY. On the Aggregation of Multimarker Information for Marker-Set and Sequencing Data Analysis: Genotype Collapsing vs. Similarity Collapsing. Front Genet 2012;2:110. [PMID: 22303404 PMCID: PMC3266618 DOI: 10.3389/fgene.2011.00110] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2011] [Accepted: 12/25/2011] [Indexed: 12/12/2022] Open

Abstract

Methods that collapse information across genetic markers when searching for association signals are gaining momentum in the literature. Although originally developed to achieve a better balance between retaining information and controlling degrees of freedom when performing multimarker association analysis, these methods have recently been proven to be a powerful tool for identifying rare variants that contribute to complex phenotypes. The information among markers can be collapsed at the genotype level, which focuses on the mean of genetic information, or the similarity level, which focuses on the variance of genetic information. The aim of this work is to understand the strengths and weaknesses of these two collapsing strategies. Our results show that neither collapsing strategy outperforms the other across all simulated scenarios. Two factors that dominate the performance of these strategies are the signal-to-noise ratio and the underlying genetic architecture of the causal variants. Genotype collapsing is more sensitive to the marker set being contaminated by noise loci than similarity collapsing. In addition, genotype collapsing performs best when the genetic architecture of the causal variants is not complex (e.g., causal loci with similar effects and similar frequencies). Similarity collapsing is more robust as the complexity of the genetic architecture increases and outperforms genotype collapsing when the genetic architecture of the marker set becomes more sophisticated (e.g., causal loci with various effect sizes or frequencies and potential non-linear or interactive effects). Because the underlying genetic architecture is not known a priori, we also considered a two-stage analysis that combines the two top-performing methods from different collapsing strategies. We find that it is reasonably robust across all simulated scenarios.

Collapse

1688

Ionita-Laza I, Makarov V, Yoon S, Raby B, Buxbaum J, Nicolae DL, Lin X. Finding disease variants in Mendelian disorders by using sequence data: methods and applications. Am J Hum Genet 2011;89:701-12. [PMID: 22137099 DOI: 10.1016/j.ajhg.2011.11.003] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Revised: 09/19/2011] [Accepted: 11/03/2011] [Indexed: 12/11/2022] Open

1689

Khetarpal SA, Edmondson AC, Raghavan A, Neeli H, Jin W, Badellino KO, Demissie S, Manning AK, DerOhannessian SL, Wolfe ML, Cupples LA, Li M, Kathiresan S, Rader DJ. Mining the LIPG allelic spectrum reveals the contribution of rare and common regulatory variants to HDL cholesterol. PLoS Genet 2011;7:e1002393. [PMID: 22174694 PMCID: PMC3234219 DOI: 10.1371/journal.pgen.1002393] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2011] [Accepted: 10/07/2011] [Indexed: 11/18/2022] Open

Abstract

Genome-wide association studies (GWAS) have successfully identified loci associated with quantitative traits, such as blood lipids. Deep resequencing studies are being utilized to catalogue the allelic spectrum at GWAS loci. The goal of these studies is to identify causative variants and missing heritability, including heritability due to low frequency and rare alleles with large phenotypic impact. Whereas rare variant efforts have primarily focused on nonsynonymous coding variants, we hypothesized that noncoding variants in these loci are also functionally important. Using the HDL-C gene LIPG as an example, we explored the effect of regulatory variants identified through resequencing of subjects at HDL-C extremes on gene expression, protein levels, and phenotype. Resequencing a portion of the LIPG promoter and 5' UTR in human subjects with extreme HDL-C, we identified several rare variants in individuals from both extremes. Luciferase reporter assays were used to measure the effect of these rare variants on LIPG expression. Variants conferring opposing effects on gene expression were enriched in opposite extremes of the phenotypic distribution. Minor alleles of a common regulatory haplotype and noncoding GWAS SNPs were associated with reduced plasma levels of the LIPG gene product endothelial lipase (EL), consistent with its role in HDL-C catabolism. Additionally, we found that a common nonfunctional coding variant associated with HDL-C (rs2000813) is in linkage disequilibrium with a 5' UTR variant (rs34474737) that decreases LIPG promoter activity. We attribute the gene regulatory role of rs34474737 to the observed association of the coding variant with plasma EL levels and HDL-C. Taken together, the findings show that both rare and common noncoding regulatory variants are important contributors to the allelic spectrum in complex trait loci.

Collapse

Affiliation(s)

Sumeet A. Khetarpal Institute for Translational Medicine and Therapeutics, Institute for Diabetes, Obesity, and Metabolism, and Cardiovascular Institute, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
Andrew C. Edmondson Institute for Translational Medicine and Therapeutics, Institute for Diabetes, Obesity, and Metabolism, and Cardiovascular Institute, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
Avanthi Raghavan Institute for Translational Medicine and Therapeutics, Institute for Diabetes, Obesity, and Metabolism, and Cardiovascular Institute, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
Hemanth Neeli Section of Hospital Medicine, Temple University Hospital, Philadelphia, Pennsylvania, United States of America
Weijun Jin Department of Cell Biology, State University of New York Downstate Medical Center, Brooklyn, New York, United States of America
Karen O. Badellino University of Pennsylvania School of Nursing, Philadelphia, Pennsylvania, United States of America
Serkalem Demissie Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts, United States of America Framingham Heart Study, National Heart, Lung, and Blood Institute, Framingham, Massachusetts, United States of America
Alisa K. Manning Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts, United States of America
Stephanie L. DerOhannessian Institute for Translational Medicine and Therapeutics, Institute for Diabetes, Obesity, and Metabolism, and Cardiovascular Institute, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
Megan L. Wolfe Institute for Translational Medicine and Therapeutics, Institute for Diabetes, Obesity, and Metabolism, and Cardiovascular Institute, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
L. Adrienne Cupples Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts, United States of America Framingham Heart Study, National Heart, Lung, and Blood Institute, Framingham, Massachusetts, United States of America
Mingyao Li Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
Sekar Kathiresan Cardiovascular Research Center and Center for Human Genetic Research, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Daniel J. Rader Institute for Translational Medicine and Therapeutics, Institute for Diabetes, Obesity, and Metabolism, and Cardiovascular Institute, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America * E-mail:

Collapse

1690

Udpa N, Zhou D, Haddad GG, Bafna V. Tests of selection in pooled case-control data: an empirical study. Front Genet 2011;2:83. [PMID: 22303377 PMCID: PMC3268381 DOI: 10.3389/fgene.2011.00083] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2011] [Accepted: 11/01/2011] [Indexed: 11/13/2022] Open

Abstract

For smaller organisms with faster breeding cycles, artificial selection can be used to create sub-populations with different phenotypic traits. Genetic tests can be employed to identify the causal markers for the phenotypes, as a precursor to engineering strains with a combination of traits. Traditional approaches involve analyzing crosses of inbred strains to test for co-segregation with genetic markers. Here we take advantage of cheaper next generation sequencing techniques to identify genetic signatures of adaptation to the selection constraints. Obtaining individual sequencing data is often unrealistic due to cost and sample issues, so we focus on pooled genomic data. We explore a series of statistical tests for selection using pooled case (under selection) and control populations. The tests generally capture skews in the scaled frequency spectrum of alleles in a region, which are indicative of a selective sweep. Extensive simulations are used to show that these approaches work well for a wide range of population divergence times and strong selective pressures. Control vs control simulations are used to determine an empirical False Positive Rate, and regions under selection are determined using a 1% FPR level. We show that pooling does not have a significant impact on statistical power. The tests are also robust to reasonable variations in several different parameters, including window size, base-calling error rate, and sequencing coverage. We then demonstrate the viability (and the challenges) of one of these methods in two independent Drosophila populations (Drosophila melanogaster) bred under selection for hypoxia and accelerated development, respectively. Testing for extreme hypoxia tolerance showed clear signals of selection, pointing to loci that are important for hypoxia adaptation. Overall, we outline a strategy for finding regions under selection using pooled sequences, then devise optimal tests for that strategy. The approaches show promise for detecting selection, even several generations after fixation of the beneficial allele has occurred.

Collapse

1691

Powers S, Gopalakrishnan S, Tintle N. Assessing the impact of non-differential genotyping errors on rare variant tests of association. Hum Hered 2011;72:153-60. [PMID: 22004945 DOI: 10.1159/000332222] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2011] [Accepted: 08/24/2011] [Indexed: 11/19/2022] Open

1692

A general framework for detecting disease associations with rare variants in sequencing studies. Am J Hum Genet 2011;89:354-67. [PMID: 21885029 DOI: 10.1016/j.ajhg.2011.07.015] [Citation(s) in RCA: 209] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2011] [Revised: 07/21/2011] [Accepted: 07/26/2011] [Indexed: 12/19/2022] Open