Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tang ZZ, Lin DY. MASS: meta-analysis of score statistics for sequencing studies. Bioinformatics 2013;29:1803-5. [PMID: 23698861 PMCID: PMC3702254 DOI: 10.1093/bioinformatics/btt280] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

For:	Tang ZZ, Lin DY. MASS: meta-analysis of score statistics for sequencing studies. Bioinformatics 2013;29:1803-5. [PMID: 23698861 PMCID: PMC3702254 DOI: 10.1093/bioinformatics/btt280] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Number

Cited by Other Article(s)

Jurgens SJ, Wang X, Choi SH, Weng LC, Koyama S, Pirruccello JP, Nguyen T, Smadbeck P, Jang D, Chaffin M, Walsh R, Roselli C, Elliott AL, Wijdeveld LFJM, Biddinger KJ, Kany S, Rämö JT, Natarajan P, Aragam KG, Flannick J, Burtt NP, Bezzina CR, Lubitz SA, Lunetta KL, Ellinor PT. Rare coding variant analysis for human diseases across biobanks and ancestries. Nat Genet 2024;56:1811-1820. [PMID: 39210047 DOI: 10.1038/s41588-024-01894-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Accepted: 08/01/2024] [Indexed: 09/04/2024]

Affiliation(s)

Sean J Jurgens Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Experimental Cardiology, Heart Center, Amsterdam Cardiovascular Sciences, Heart Failure and Arrhythmias, Amsterdam UMC location University of Amsterdam, Amsterdam, The Netherlands Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Xin Wang Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Seung Hoan Choi Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Lu-Chen Weng Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Satoshi Koyama Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
James P Pirruccello Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Division of Cardiology, University of California, San Francisco, CA, USA
Trang Nguyen Metabolism Program, The Broad Institute of Harvard and MIT, Cambridge, MA, USA
Patrick Smadbeck Metabolism Program, The Broad Institute of Harvard and MIT, Cambridge, MA, USA
Dongkeun Jang Metabolism Program, The Broad Institute of Harvard and MIT, Cambridge, MA, USA Program in Medical and Population Genetics, The Broad Institute of Harvard and MIT, Cambridge, MA, USA
Mark Chaffin Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Roddy Walsh Department of Experimental Cardiology, Heart Center, Amsterdam Cardiovascular Sciences, Heart Failure and Arrhythmias, Amsterdam UMC location University of Amsterdam, Amsterdam, The Netherlands
Carolina Roselli Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Amanda L Elliott Program in Medical and Population Genetics, The Broad Institute of Harvard and MIT, Cambridge, MA, USA Department of Psychiatry and Center for Genomic Medicine, Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital,Harvard Medical School, Boston, MA, USA Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
Leonoor F J M Wijdeveld Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Physiology, Amsterdam UMC location VU, Amsterdam, The Netherlands
Kiran J Biddinger Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Shinwan Kany Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Cardiology, University Heart and Vascular Center Hamburg-Eppendorf, Hamburg, Germany German Center for Cardiovascular Research (DZHK), Partner Site Hamburg/Kiel/Lübeck, Hamburg, Germany
Joel T Rämö Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
Pradeep Natarajan Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Krishna G Aragam Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Jason Flannick Metabolism Program, The Broad Institute of Harvard and MIT, Cambridge, MA, USA Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA Department of Pediatrics, Harvard Medical School, Boston, MA, USA
Noël P Burtt Metabolism Program, The Broad Institute of Harvard and MIT, Cambridge, MA, USA Program in Medical and Population Genetics, The Broad Institute of Harvard and MIT, Cambridge, MA, USA
Connie R Bezzina Department of Experimental Cardiology, Heart Center, Amsterdam Cardiovascular Sciences, Heart Failure and Arrhythmias, Amsterdam UMC location University of Amsterdam, Amsterdam, The Netherlands
Steven A Lubitz Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA Demoulas Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, MA, USA
Kathryn L Lunetta Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA NHLBI and Boston University's Framingham Heart Study, Framingham, MA, USA
Patrick T Ellinor Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA. Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA. Demoulas Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, MA, USA.

Collapse

Rajabli F, Kunkle BW. Strategies in Aggregation Tests for Rare Variants. Curr Protoc 2023;3:e931. [PMID: 37988228 DOI: 10.1002/cpz1.931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]

Quick C, Wen X, Abecasis G, Boehnke M, Kang HM. Integrating comprehensive functional annotations to boost power and accuracy in gene-based association analysis. PLoS Genet 2020;16:e1009060. [PMID: 33320851 PMCID: PMC7737906 DOI: 10.1371/journal.pgen.1009060] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Accepted: 08/18/2020] [Indexed: 11/19/2022] Open

Svishcheva GR, Belonogova NM, Zorkoltseva IV, Kirichenko AV, Axenovich TI. Gene-based association tests using GWAS summary statistics. Bioinformatics 2020;35:3701-3708. [PMID: 30860568 DOI: 10.1093/bioinformatics/btz172] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Revised: 02/12/2019] [Accepted: 03/11/2019] [Indexed: 01/09/2023] Open

Statistical Method Based on Bayes-Type Empirical Score Test for Assessing Genetic Association with Multilocus Genotype Data. Int J Genomics 2020;2020:4708152. [PMID: 32455126 PMCID: PMC7229558 DOI: 10.1155/2020/4708152] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 04/21/2020] [Indexed: 12/20/2022] Open

Khalique F, Khan SA, Butt WH, Matloob I. An Integrated Approach for Spatio-Temporal Cholera Disease Hotspot Relation Mining for Public Health Management in Punjab, Pakistan. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:ijerph17113763. [PMID: 32466471 PMCID: PMC7312960 DOI: 10.3390/ijerph17113763] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 05/14/2020] [Accepted: 05/18/2020] [Indexed: 12/13/2022]

Yang T, Kim J, Wu C, Ma Y, Wei P, Pan W. An adaptive test for meta-analysis of rare variant association studies. Genet Epidemiol 2020;44:104-116. [PMID: 31830326 PMCID: PMC6980317 DOI: 10.1002/gepi.22273] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 11/12/2019] [Accepted: 11/25/2019] [Indexed: 01/02/2023]

Povysil G, Petrovski S, Hostyk J, Aggarwal V, Allen AS, Goldstein DB. Rare-variant collapsing analyses for complex traits: guidelines and applications. Nat Rev Genet 2019;20:747-759. [PMID: 31605095 DOI: 10.1038/s41576-019-0177-4] [Citation(s) in RCA: 117] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/06/2019] [Indexed: 12/11/2022]

Wang L, Lee S, Qiao D, Cho MH, Silverman EK, Lange C, Won S. metaFARVAT: An Efficient Tool for Meta-Analysis of Family-Based, Case-Control, and Population-Based Rare Variant Association Studies. Front Genet 2019;10:572. [PMID: 31275357 PMCID: PMC6593391 DOI: 10.3389/fgene.2019.00572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 05/31/2019] [Indexed: 11/13/2022] Open

Weissenkampen JD, Jiang Y, Eckert S, Jiang B, Li B, Liu DJ. Methods for the Analysis and Interpretation for Rare Variants Associated with Complex Traits. CURRENT PROTOCOLS IN HUMAN GENETICS 2019;101:e83. [PMID: 30849219 PMCID: PMC6455968 DOI: 10.1002/cphg.83] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Wong KY, Zeng D, Lin DY. Robust Score Tests With Missing Data in Genomics Studies. J Am Stat Assoc 2019;114:1778-1786. [PMID: 31920211 DOI: 10.1080/01621459.2018.1514304] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Chien LC, Chiu YF. General retrospective mega-analysis framework for rare variant association tests. Genet Epidemiol 2018;42:621-635. [PMID: 30188589 DOI: 10.1002/gepi.22147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 06/05/2018] [Accepted: 06/05/2018] [Indexed: 11/09/2022]

Jiang Y, Chen S, McGuire D, Chen F, Liu M, Iacono WG, Hewitt JK, Hokanson JE, Krauter K, Laakso M, Li KW, Lutz SM, McGue M, Pandit A, Zajac GJM, Boehnke M, Abecasis GR, Vrieze SI, Zhan X, Jiang B, Liu DJ. Proper conditional analysis in the presence of missing data: Application to large scale meta-analysis of tobacco use phenotypes. PLoS Genet 2018;14:e1007452. [PMID: 30016313 PMCID: PMC6063450 DOI: 10.1371/journal.pgen.1007452] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2017] [Revised: 07/27/2018] [Accepted: 05/25/2018] [Indexed: 11/19/2022] Open

Abstract

Meta-analysis of genetic association studies increases sample size and the power for mapping complex traits. Existing methods are mostly developed for datasets without missing values, i.e. the summary association statistics are measured for all variants in contributing studies. In practice, genotype imputation is not always effective. This may be the case when targeted genotyping/sequencing assays are used or when the un-typed genetic variant is rare. Therefore, contributed summary statistics often contain missing values. Existing methods for imputing missing summary association statistics and using imputed values in meta-analysis, approximate conditional analysis, or simple strategies such as complete case analysis all have theoretical limitations. Applying these approaches can bias genetic effect estimates and lead to seriously inflated type-I or type-II errors in conditional analysis, which is a critical tool for identifying independently associated variants. To address this challenge and complement imputation methods, we developed a method to combine summary statistics across participating studies and consistently estimate joint effects, even when the contributed summary statistics contain large amounts of missing values. Based on this estimator, we proposed a score statistic called PCBS (partial correlation based score statistic) for conditional analysis of single-variant and gene-level associations. Through extensive analysis of simulated and real data, we showed that the new method produces well-calibrated type-I errors and is substantially more powerful than existing approaches. We applied the proposed approach to one of the largest meta-analyses to date for the cigarettes-per-day phenotype. Using the new method, we identified multiple novel independently associated variants at known loci for tobacco use, which were otherwise missed by alternative methods. Together, the phenotypic variance explained by these variants was 1.1%, improving that of previously reported associations by 71%. These findings illustrate the extent of locus allelic heterogeneity and can help pinpoint causal variants.

It is of great interest to estimate the joint effects of multiple variants from large scale meta-analyses, in order to fine-map causal variants and understand the genetic architecture for complex traits. The summary association statistics from participating studies in a meta-analysis often contain missing values at some variant sites, as the imputation methods may not work well and the variants with low imputation quality will be filtered out. Missingness is especially likely when the underlying genetic variant is rare or the participating studies use targeted genotyping array that is not suitable for imputation. Existing methods for conditional meta-analysis do not properly handle missing data, and can incorrectly estimate correlations between score statistics. As a result, they can produce highly inflated type-I errors for conditional analysis, which will result in overestimated phenotypic variance explained and incorrect identification of causal variants. We systematically evaluated this bias and proposed a novel partial correlation based score statistic. The new statistic has valid type-I errors for conditional analysis and much higher power than the existing methods, even when the contributed summary statistics contain a large fraction of missing values. We expect this method to be highly useful in the sequencing age for complex trait genetics.

Collapse

Affiliation(s)

Yu Jiang Department of Public Health Sciences, Penn State College of Medicine, Hershey, Pennsylvania, United States of America
Sai Chen Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
Daniel McGuire Department of Public Health Sciences, Penn State College of Medicine, Hershey, Pennsylvania, United States of America
Fang Chen Department of Public Health Sciences, Penn State College of Medicine, Hershey, Pennsylvania, United States of America
Mengzhen Liu Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
William G. Iacono Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
John K. Hewitt Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, Colorado, United States of America
John E. Hokanson Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, Colorado, United States of America
Kenneth Krauter Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, Colorado, United States of America
Markku Laakso Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland
Kevin W. Li Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
Sharon M. Lutz Department of Biostatistics and Informatics, University of Colorado, Anschutz Medical Campus, Aurora, Colorado, United States of America
Matthew McGue Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
Anita Pandit Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
Gregory J. M. Zajac Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
Michael Boehnke Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
Goncalo R. Abecasis Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
Scott I. Vrieze Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
Xiaowei Zhan Department of Clinical Science, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
Bibo Jiang Department of Public Health Sciences, Penn State College of Medicine, Hershey, Pennsylvania, United States of America * E-mail: (DJL); (BJ)
Dajiang J. Liu Department of Public Health Sciences, Penn State College of Medicine, Hershey, Pennsylvania, United States of America * E-mail: (DJL); (BJ)

Collapse

Yang J, Chen S, Abecasis G. Improved score statistics for meta-analysis in single-variant and gene-level association studies. Genet Epidemiol 2018;42:333-343. [PMID: 29696691 DOI: 10.1002/gepi.22123] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Revised: 03/04/2018] [Accepted: 03/16/2018] [Indexed: 01/09/2023]

Michailidou K. Meta-Analysis of Common and Rare Variants. Methods Mol Biol 2018;1793:73-88. [PMID: 29876892 DOI: 10.1007/978-1-4939-7868-7_6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Lu W, Wang X, Zhan X, Gazdar A. Meta-analysis approaches to combine multiple gene set enrichment studies. Stat Med 2017;37:659-672. [PMID: 29052247 DOI: 10.1002/sim.7540] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2016] [Revised: 07/02/2017] [Accepted: 09/29/2017] [Indexed: 11/09/2022]

Tang ZZ, Bunn P, Tao R, Liu Z, Lin DY. PreMeta: a tool to facilitate meta-analysis of rare-variant associations. BMC Genomics 2017;18:160. [PMID: 28196472 PMCID: PMC5310051 DOI: 10.1186/s12864-017-3573-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2016] [Accepted: 02/09/2017] [Indexed: 11/10/2022] Open

Discovery of rare variants for complex phenotypes. Hum Genet 2016;135:625-34. [PMID: 27221085 DOI: 10.1007/s00439-016-1679-1] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 04/28/2016] [Indexed: 12/27/2022]

Zhan X, Liu DJ. SEQMINER: An R-Package to Facilitate the Functional Interpretation of Sequence-Based Associations. Genet Epidemiol 2015;39:619-23. [PMID: 26394715 PMCID: PMC4794281 DOI: 10.1002/gepi.21918] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2015] [Revised: 07/01/2015] [Accepted: 07/17/2015] [Indexed: 11/23/2022]

Abstract

Next‐generation sequencing has enabled the study of a comprehensive catalogue of genetic variants for their impact on various complex diseases. Numerous consortia studies of complex traits have publically released their summary association statistics, which have become an invaluable resource for learning the underlying biology, understanding the genetic architecture, and guiding clinical translations. There is great interest in the field in developing novel statistical methods for analyzing and interpreting results from these genotype‐phenotype association studies. One popular platform for method development and data analysis is R. In order to enable these analyses in R, it is necessary to develop packages that can efficiently query files of summary association statistics, explore the linkage disequilibrium structure between variants, and integrate various bioinformatics databases. The complexity and scale of sequence datasets and databases pose significant computational challenges for method developers. To address these challenges and facilitate method development, we developed the R package SEQMINER for annotating and querying files of sequence variants (e.g., VCF/BCF files) and summary association statistics (e.g., METAL/RAREMETAL files), and for integrating bioinformatics databases. SEQMINER provides an infrastructure where novel methods can be distributed and applied to analyzing sequence datasets in practice. We illustrate the performance of SEQMINER using datasets from the 1000 Genomes Project. We show that SEQMINER is highly efficient and easy to use. It will greatly accelerate the process of applying statistical innovations to analyze and interpret sequence‐based associations. The R package, its source code and documentations are available from http://cran.r‐project.org/web/packages/seqminer and http://seqminer.genomic.codes/.

Collapse

Tao R, Zeng D, Franceschini N, North KE, Boerwinkle E, Lin DY. Analysis of Sequence Data Under Multivariate Trait-Dependent Sampling. J Am Stat Assoc 2015;110:560-572. [PMID: 26366025 DOI: 10.1080/01621459.2015.1008099] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs. Am J Hum Genet 2015;97:35-53. [PMID: 26094574 DOI: 10.1016/j.ajhg.2015.05.001] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2015] [Accepted: 05/01/2015] [Indexed: 01/01/2023] Open

Genetic variation in uncontrolled childhood asthma despite ICS treatment. THE PHARMACOGENOMICS JOURNAL 2015;16:158-63. [PMID: 25963336 DOI: 10.1038/tpj.2015.36] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Received: 11/06/2014] [Revised: 03/02/2015] [Accepted: 03/26/2015] [Indexed: 11/08/2022]

Wang Q, Lu Q, Zhao H. A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing. Front Genet 2015;6:149. [PMID: 25941534 PMCID: PMC4403555 DOI: 10.3389/fgene.2015.00149] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2015] [Accepted: 03/30/2015] [Indexed: 12/22/2022] Open

Feng S, Pistis G, Zhang H, Zawistowski M, Mulas A, Zoledziewska M, Holmen OL, Busonero F, Sanna S, Hveem K, Willer C, Cucca F, Liu DJ, Abecasis GR. Methods for association analysis and meta-analysis of rare variants in families. Genet Epidemiol 2015;39:227-38. [PMID: 25740221 DOI: 10.1002/gepi.21892] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2014] [Revised: 01/03/2015] [Accepted: 01/26/2015] [Indexed: 11/09/2022]

Vaitsiakhovich T, Drichel D, Herold C, Lacour A, Becker T. METAINTER: meta-analysis of multiple regression models in genome-wide association studies. ACTA ACUST UNITED AC 2014;31:151-7. [PMID: 25252781 DOI: 10.1093/bioinformatics/btu629] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Abstract

MOTIVATION

Meta-analysis of summary statistics is an essential approach to guarantee the success of genome-wide association studies (GWAS). Application of the fixed or random effects model to single-marker association tests is a standard practice. More complex methods of meta-analysis involving multiple parameters have not been used frequently, a gap that could be explained by the lack of a respective meta-analysis pipeline. Meta-analysis based on combining p-values can be applied to any association test. However, to be powerful, meta-analysis methods for high-dimensional models should incorporate additional information such as study-specific properties of parameter estimates, their effect directions, standard errors and covariance structure.

RESULTS

We modified 'method for the synthesis of linear regression slopes' recently proposed in the educational sciences to the case of multiple logistic regression, and implemented it in a meta-analysis tool called METAINTER. The software handles models with an arbitrary number of parameters, and can directly be applied to analyze the results of single-SNP tests, global haplotype tests, tests for and under gene-gene or gene-environment interaction. Via simulations for two-single nucleotide polymorphisms (SNP) models we have shown that the proposed meta-analysis method has correct type I error rate. Moreover, power estimates come close to that of the joint analysis of the entire sample. We conducted a real data analysis of six GWAS of type 2 diabetes, available from dbGaP (http://www.ncbi.nlm.nih.gov/gap). For each study, a genome-wide interaction analysis of all SNP pairs was performed by logistic regression tests. The results were then meta-analyzed with METAINTER.

AVAILABILITY

The software is freely available and distributed under the conditions specified on http://metainter.meb.uni-bonn.de.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Lee S, Abecasis G, Boehnke M, Lin X. Rare-variant association analysis: study designs and statistical tests. Am J Hum Genet 2014;95:5-23. [PMID: 24995866 DOI: 10.1016/j.ajhg.2014.06.009] [Citation(s) in RCA: 658] [Impact Index Per Article: 65.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2014] [Indexed: 12/30/2022] Open

Feng S, Liu D, Zhan X, Wing MK, Abecasis GR. RAREMETAL: fast and powerful meta-analysis for rare variants. Bioinformatics 2014;30:2828-9. [PMID: 24894501 PMCID: PMC4173011 DOI: 10.1093/bioinformatics/btu367] [Citation(s) in RCA: 91] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Tang ZZ, Lin DY. Meta-analysis of sequencing studies with heterogeneous genetic associations. Genet Epidemiol 2014;38:389-401. [PMID: 24799183 DOI: 10.1002/gepi.21798] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Revised: 02/05/2014] [Accepted: 02/06/2014] [Indexed: 01/06/2023]

Liu DJ, Peloso GM, Zhan X, Holmen OL, Zawistowski M, Feng S, Nikpay M, Auer PL, Goel A, Zhang H, Peters U, Farrall M, Orho-Melander M, Kooperberg C, McPherson R, Watkins H, Willer CJ, Hveem K, Melander O, Kathiresan S, Abecasis GR. Meta-analysis of gene-level tests for rare variant association. Nat Genet 2014;46:200-4. [PMID: 24336170 PMCID: PMC3939031 DOI: 10.1038/ng.2852] [Citation(s) in RCA: 144] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2013] [Accepted: 11/20/2013] [Indexed: 12/14/2022]

Affiliation(s)

Dajiang J. Liu Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
Gina M. Peloso Broad Institute of Harvard and MIT, Cambridge, MA Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA
Xiaowei Zhan Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
Oddgeir L. Holmen Department of Public Health and General Practice, Norwegian University of Science and Technology, Trondheim 7489, Norway St. Olav Hospital, Trondheim University Hospital, Trondheim, Norway
Matthew Zawistowski Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
Shuang Feng Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
Majid Nikpay University of Ottawa Heart Institute, Ottawa, Ontario, Canada
Paul L. Auer Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle WA 98109, USA School of Public Health, University of Wisconsin-Milwaukee
Anuj Goel Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom Department of Cardiovascular Medicine, University of Oxford, Oxford, UK
He Zhang Division of Cardiology, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109 Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109
Ulrike Peters Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle WA 98109, USA Department of Epidemiology, University of Washington School of Public Health, Seattle, WA
Martin Farrall Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom Department of Cardiovascular Medicine, University of Oxford, Oxford, UK
Marju Orho-Melander Department of Cardiovascular Medicine, University of Oxford, Oxford, UK Department of Clinical Sciences, Lund University, Malmö, Sweden
Charles Kooperberg Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle WA 98109, USA Department of Biostatistics, University of Washington School of Public Health, Seattle, WA
Ruth McPherson University of Ottawa Heart Institute, Ottawa, Ontario, Canada
Hugh Watkins Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom Department of Cardiovascular Medicine, University of Oxford, Oxford, UK
Cristen J. Willer Division of Cardiology, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109 Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109
Kristian Hveem Department of Public Health and General Practice, Norwegian University of Science and Technology, Trondheim 7489, Norway Levanger Hospital, Levanger, Norway
Olle Melander Department of Cardiovascular Medicine, University of Oxford, Oxford, UK Department of Clinical Sciences, Lund University, Malmö, Sweden
Sekar Kathiresan Broad Institute of Harvard and MIT, Cambridge, MA Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA Harvard Medical School, Cambridge, MA
Gonçalo R. Abecasis Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109

Collapse

Panoutsopoulou K, Tachmazidou I, Zeggini E. In search of low-frequency and rare variants affecting complex traits. Hum Mol Genet 2013;22:R16-21. [PMID: 23922232 PMCID: PMC3782074 DOI: 10.1093/hmg/ddt376] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open