1501
|
Raule N, Sevini F, Li S, Barbieri A, Tallaro F, Lomartire L, Vianello D, Montesanto A, Moilanen JS, Bezrukov V, Blanché H, Hervonen A, Christensen K, Deiana L, Gonos ES, Kirkwood TBL, Kristensen P, Leon A, Pelicci PG, Poulain M, Rea IM, Remacle J, Robine JM, Schreiber S, Sikora E, Eline Slagboom P, Spazzafumo L, Antonietta Stazi M, Toussaint O, Vaupel JW, Rose G, Majamaa K, Perola M, Johnson TE, Bolund L, Yang H, Passarino G, Franceschi C. The co-occurrence of mtDNA mutations on different oxidative phosphorylation subunits, not detected by haplogroup analysis, affects human longevity and is population specific. Aging Cell 2014; 13:401-7. [PMID: 24341918 PMCID: PMC4326891 DOI: 10.1111/acel.12186] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/14/2013] [Indexed: 01/01/2023] Open
Abstract
To re-examine the correlation between mtDNA variability and longevity, we examined mtDNAs from samples obtained from over 2200 ultranonagenarians (and an equal number of controls) collected within the framework of the GEHA EU project. The samples were categorized by high-resolution classification, while about 1300 mtDNA molecules (650 ultranonagenarians and an equal number of controls) were completely sequenced. Sequences, unlike standard haplogroup analysis, made possible to evaluate for the first time the cumulative effects of specific, concomitant mtDNA mutations, including those that per se have a low, or very low, impact. In particular, the analysis of the mutations occurring in different OXPHOS complex showed a complex scenario with a different mutation burden in 90+ subjects with respect to controls. These findings suggested that mutations in subunits of the OXPHOS complex I had a beneficial effect on longevity, while the simultaneous presence of mutations in complex I and III (which also occurs in J subhaplogroups involved in LHON) and in complex I and V seemed to be detrimental, likely explaining previous contradictory results. On the whole, our study, which goes beyond haplogroup analysis, suggests that mitochondrial DNA variation does affect human longevity, but its effect is heavily influenced by the interaction between mutations concomitantly occurring on different mtDNA genes.
Collapse
Affiliation(s)
- Nicola Raule
- BioPhysics and Biocomplexity and Department of Experimental Pathology; C.I. G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics; University of Bologna; Bologna 40126 Italy
| | - Federica Sevini
- BioPhysics and Biocomplexity and Department of Experimental Pathology; C.I. G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics; University of Bologna; Bologna 40126 Italy
| | | | - Annalaura Barbieri
- BioPhysics and Biocomplexity and Department of Experimental Pathology; C.I. G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics; University of Bologna; Bologna 40126 Italy
| | - Federica Tallaro
- Department of Cell Biology; University of Calabria; Rende 87036 Italy
| | - Laura Lomartire
- BioPhysics and Biocomplexity and Department of Experimental Pathology; C.I. G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics; University of Bologna; Bologna 40126 Italy
| | - Dario Vianello
- BioPhysics and Biocomplexity and Department of Experimental Pathology; C.I. G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics; University of Bologna; Bologna 40126 Italy
| | | | - Jukka S. Moilanen
- Institute of Clinical Medicine; University of Oulu; Oulu University Hospital and MRC Oulu; Oulu 90014 Finland
| | | | - Hélène Blanché
- Centre Polymorphisme Humaine; Fondation Jean Dausset; Paris 75010 France
| | | | - Kaare Christensen
- Institute of Public Health; University of Southern Denmark; Odense 5230 Denmark
| | | | | | - Tom B. L. Kirkwood
- School of Clinical Medical Sciences; Gerontology “Henry Wellcome”; University of Newcastle upon Tyne; Newcastle upon Tyne NE1 3BZ UK
| | | | - Alberta Leon
- Research & Innovation Soc.Coop. a r.l.; Padova 35127 Italy
| | | | - Michel Poulain
- Research Centre of Demographic Management for Public Administrations; UCL-GéDAP; Louvain-la-Neuve 1348 Belgium
| | - Irene M. Rea
- The Queen's University Belfast; Belfast BT7 1NN UK
| | - Josè Remacle
- Eppendorf Array Technologies; SA-EAT Research and Development; Namur 5000 Belgium
| | - Jean Marie Robine
- University of Montpellier; Val d'Aurelle Cancer Research Center; Montpellier 34090 France
| | - Stefan Schreiber
- Kiel Center for Functional Genomics; University Hospital Schleswig Holstein; Kiel 24105 Germany
| | - Ewa Sikora
- Nencki Institute of Experimental Biology; Polish Academy of Sciences; Warsaw 00-679 Poland
| | | | - Liana Spazzafumo
- INRCA-Italian National Research Centre on Aging; Ancona 60127 Italy
| | | | | | - James W. Vaupel
- Max Planck Institute for Demographic Research; Rostock 18057 Germany
| | - Giuseppina Rose
- Department of Cell Biology; University of Calabria; Rende 87036 Italy
| | - Kari Majamaa
- Institute of Clinical Medicine; University of Oulu; Oulu University Hospital and MRC Oulu; Oulu 90014 Finland
| | - Markus Perola
- National Public Health Institute; Helsinki 00260 Finland
| | - Thomas E. Johnson
- Institute for Behavioral Genetics; University of Colorado Boulder; Boulder CO 80309 USA
| | | | | | | | - Claudio Franceschi
- BioPhysics and Biocomplexity and Department of Experimental Pathology; C.I. G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics; University of Bologna; Bologna 40126 Italy
| |
Collapse
|
1502
|
Bis JC, White CC, Franceschini N, Brody J, Zhang X, Muzny D, Santibanez J, Gibbs R, Liu X, Lin H, Boerwinkle E, Psaty BM, North KE, Cupples LA, O’Donnell CJ. Sequencing of 2 subclinical atherosclerosis candidate regions in 3669 individuals: Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study. CIRCULATION. CARDIOVASCULAR GENETICS 2014; 7:359-64. [PMID: 24951662 PMCID: PMC4112104 DOI: 10.1161/circgenetics.113.000116] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
BACKGROUND Atherosclerosis, the precursor to coronary heart disease and stroke, is characterized by an accumulation of fatty cells in the arterial intimal-medial layers. Common carotid intima media thickness (cIMT) and plaque are subclinical atherosclerosis measures that predict cardiovascular disease events. Previously, genome-wide association studies demonstrated evidence for association with cIMT (SLC17A4) and plaque (PIK3CG). METHODS AND RESULTS We sequenced 120 kb around SLC17A4 (6p22.2) and 251 kb around PIK3CG (7q22.3) among 3669 European ancestry participants from the Atherosclerosis Risk in Communities (ARIC) study, Cardiovascular Health Study (CHS), and Framingham Heart Study (FHS) in Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium. Primary analyses focused on 438 common variants (minor allele frequency ≥1%), which were independently meta-analyzed. A 3' untranslated region CCDC71L variant (rs2286149), upstream from PIK3CG, was the most significant finding in cIMT (P=0.00033) and plaque (P=0.0004) analyses. A SLC17A4 intronic variant was also associated with cIMT (P=0.008). Both were in low linkage disequilibrium with the genome-wide association study single nucleotide polymorphisms. Gene-based tests including T1 count and sequence kernel association test for rare variants (minor allele frequency <1%) did not yield statistically significant associations. However, we observed nominal associations for rare variants in CCDC71L and SLC17A3 with cIMT and of the entire 7q22 region with plaque (P=0.05). CONCLUSIONS Common and rare variants in PIK3CG and SLC17A4 regions demonstrated modest association with subclinical atherosclerosis traits. Although not conclusive, these findings may help to understand the genetic architecture of regions previously implicated by genome-wide association studies and identify variants within these regions for further investigation in larger samples.
Collapse
Affiliation(s)
- Joshua C. Bis
- Cardiovascular Health Research Unit & Department of Medicine, University of Washington, Seattle, WA
| | - Charles C. White
- National Heart, Lung and Blood Institute’s Framingham Heart Study, Framingham
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Nora Franceschini
- Department of Epidemiology, University of North Carolina Chapel Hill, Chapel Hill, NC
| | - Jennifer Brody
- Cardiovascular Health Research Unit & Department of Medicine, University of Washington, Seattle, WA
| | - Xiaoling Zhang
- National Heart, Lung and Blood Institute’s Framingham Heart Study, Framingham
| | - Donna Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, University of Texas Health Science Center at Houston, Houston, TX
| | - Jireh Santibanez
- Human Genome Sequencing Center, Baylor College of Medicine, University of Texas Health Science Center at Houston, Houston, TX
| | - Richard Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, University of Texas Health Science Center at Houston, Houston, TX
| | - Xiaoming Liu
- Human Genetics Center, University of Texas Health Science Center at Houston, Houston, TX
| | - Honghuang Lin
- National Heart, Lung and Blood Institute’s Framingham Heart Study, Framingham
- Department of Medicine, Boston University School of Medicine, Boston, MA
| | - Eric Boerwinkle
- Human Genome Sequencing Center, Baylor College of Medicine, University of Texas Health Science Center at Houston, Houston, TX
- Human Genetics Center, University of Texas Health Science Center at Houston, Houston, TX
| | - Bruce M. Psaty
- Cardiovascular Health Research Unit & Department of Medicine, University of Washington, Seattle, WA
- Department of Epidemiology, University of Washington, Seattle, WA
- Department of Health Services, University of Washington, Seattle, WA
- Group Health Research Institute, Group Health, Seattle, WA
| | - Kari E. North
- Department of Epidemiology, University of North Carolina Chapel Hill, Chapel Hill, NC
- Carolina Center for Genome Sciences, University of North Carolina Chapel Hill, Chapel Hill, NC
| | - L. Adrienne Cupples
- National Heart, Lung and Blood Institute’s Framingham Heart Study, Framingham
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Christopher J. O’Donnell
- National Heart, Lung and Blood Institute’s Framingham Heart Study, Framingham
- Division of Intramural Research, National Heart, Lung and Blood Institute, Bethesda, MD
- Cardiology Division, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA
| |
Collapse
|
1503
|
Magnani JW, Brody JA, Prins BP, Arking DE, Lin H, Yin X, Liu CT, Morrison AC, Zhang F, Spector TD, Alonso A, Bis JC, Heckbert SR, Lumley T, Sitlani CM, Cupples LA, Lubitz SA, Soliman EZ, Pulit SL, Newton-Cheh C, O'Donnell CJ, Ellinor PT, Benjamin EJ, Muzny DM, Gibbs RA, Santibanez J, Taylor HA, Rotter JI, Lange LA, Psaty BM, Jackson R, Rich SS, Boerwinkle E, Jamshidi Y, Sotoodehnia N. Sequencing of SCN5A identifies rare and common variants associated with cardiac conduction: Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium. CIRCULATION. CARDIOVASCULAR GENETICS 2014; 7:365-73. [PMID: 24951663 PMCID: PMC4177904 DOI: 10.1161/circgenetics.113.000098] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
BACKGROUND The cardiac sodium channel SCN5A regulates atrioventricular and ventricular conduction. Genetic variants in this gene are associated with PR and QRS intervals. We sought to characterize further the contribution of rare and common coding variation in SCN5A to cardiac conduction. METHODS AND RESULTS In Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study, we performed targeted exonic sequencing of SCN5A (n=3699, European ancestry individuals) and identified 4 common (minor allele frequency >1%) and 157 rare variants. Common and rare SCN5A coding variants were examined for association with PR and QRS intervals through meta-analysis of European ancestry participants from CHARGE, National Heart, Lung, and Blood Institute's Exome Sequencing Project (n=607), and the UK10K (n=1275) and by examining Exome Sequencing Project African ancestry participants (n=972). Rare coding SCN5A variants in aggregate were associated with PR interval in European and African ancestry participants (P=1.3×10(-3)). Three common variants were associated with PR and QRS interval duration among European ancestry participants and one among African ancestry participants. These included 2 well-known missense variants: rs1805124 (H558R) was associated with PR and QRS shortening in European ancestry participants (P=6.25×10(-4) and P=5.2×10(-3), respectively) and rs7626962 (S1102Y) was associated with PR shortening in those of African ancestry (P=2.82×10(-3)). Among European ancestry participants, 2 novel synonymous variants, rs1805126 and rs6599230, were associated with cardiac conduction. Our top signal, rs1805126 was associated with PR and QRS lengthening (P=3.35×10(-7) and P=2.69×10(-4), respectively) and rs6599230 was associated with PR shortening (P=2.67×10(-5)). CONCLUSIONS By sequencing SCN5A, we identified novel common and rare coding variants associated with cardiac conduction.
Collapse
Affiliation(s)
- Jared W. Magnani
- NHLBI and Boston University’s Framingham Heart Study,
Framingham, MA
- Section of Cardiovascular Medicine, Boston University School of
Medicine, Boston, MA
| | - Jennifer A. Brody
- Cardiovascular Health Research Unit, Department of Medicine,
University of Washington, Seattle, WA
| | - Bram P. Prins
- Human Genetics Research Centre, St George’s University of
London, London, United Kingdom
| | - Dan E. Arking
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins
University School of Medicine, Baltimore, MD
| | - Honghuang Lin
- NHLBI and Boston University’s Framingham Heart Study,
Framingham, MA
- Section of Computational Biomedicine, Boston University School of
Medicine, Boston, MA
| | - Xiaoyan Yin
- NHLBI and Boston University’s Framingham Heart Study,
Framingham, MA
- Department of Biostatistics, Boston University School of Public
Health, Boston, MA
| | - Ching-Ti Liu
- NHLBI and Boston University’s Framingham Heart Study,
Framingham, MA
- Department of Biostatistics, Boston University School of Public
Health, Boston, MA
| | - Alanna C. Morrison
- Human Genetics Center, University of Texas Health Science Center,
Houston, TX
| | - Feng Zhang
- Human Genetics Research Centre, St George’s University of
London, London, United Kingdom
- Department of Twin Research and Genetic Epidemiology Unit, St
Thomas’ Campus, King’s College London, St Thomas’ Hospital, London,
United Kingdom
| | - Tim D. Spector
- Department of Twin Research and Genetic Epidemiology Unit, St
Thomas’ Campus, King’s College London, St Thomas’ Hospital, London,
United Kingdom
| | - Alvaro Alonso
- Division of Epidemiology and Community Health, University of
Minnesota, Minneapolis, MN
| | - Joshua C. Bis
- Cardiovascular Health Research Unit, Department of Medicine,
University of Washington, Seattle, WA
| | - Susan R. Heckbert
- Cardiovascular Health Research Unit, Department of Medicine,
University of Washington, Seattle, WA
- Department of Epidemiology, University of Washington, Seattle,
WA
| | - Thomas Lumley
- Department of Statistics, University of Auckland, Auckland, New
Zealand
| | - Colleen M. Sitlani
- Cardiovascular Health Research Unit, Department of Medicine,
University of Washington, Seattle, WA
| | - L. Adrienne Cupples
- NHLBI and Boston University’s Framingham Heart Study,
Framingham, MA
- Department of Biostatistics, Boston University School of Public
Health, Boston, MA
| | - Steven A. Lubitz
- Cardiovascular Research Center, Massachusetts General Hospital,
Charlestown, MA
- Cardiology Division, Department of Medicine, Massachusetts
General Hospital, Harvard Medical School, Boston, MA
| | - Elsayed Z. Soliman
- Epidemiological Cardiology Research Center (EPICARE), Wake
Forest University School of Medicine, Winston Salem, NC
| | - Sara L. Pulit
- Cardiovascular Research Center, Massachusetts General Hospital,
Charlestown, MA
- Broad Institute, Cambridge, MA
| | - Christopher Newton-Cheh
- NHLBI and Boston University’s Framingham Heart Study,
Framingham, MA
- Cardiovascular Research Center, Massachusetts General Hospital,
Charlestown, MA
- Broad Institute, Cambridge, MA
| | - Christopher J. O'Donnell
- NHLBI and Boston University’s Framingham Heart Study,
Framingham, MA
- Cardiology Division, Department of Medicine, Massachusetts
General Hospital, Harvard Medical School, Boston, MA
| | - Patrick T. Ellinor
- Cardiovascular Research Center, Massachusetts General Hospital,
Charlestown, MA
- Cardiology Division, Department of Medicine, Massachusetts
General Hospital, Harvard Medical School, Boston, MA
| | - Emelia J. Benjamin
- NHLBI and Boston University’s Framingham Heart Study,
Framingham, MA
- Section of Cardiovascular Medicine, Boston University School of
Medicine, Boston, MA
- Boston University Schools of Medicine and Public Health,
Boston, MA
| | - Donna M. Muzny
- Human Genome Sequencing Center, Baylor College of Medicine,
Houston, TX
| | - Richard A. Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine,
Houston, TX
| | - Jireh Santibanez
- Human Genome Sequencing Center, Baylor College of Medicine,
Houston, TX
| | | | - Jerome I. Rotter
- Medical Genetics Institute, Cedars-Sinai Medical Center, Los
Angeles, CA
| | - Leslie A. Lange
- Department of Genetics, University of North Carolina, Chapel
Hill, NC
| | - Bruce M. Psaty
- Cardiovascular Health Research Unit, Department of Medicine,
University of Washington, Seattle, WA
- Department of Epidemiology, University of Washington, Seattle,
WA
- Group Health Research Institute, Group Health Cooperative,
Seattle, WA
- Department of Health Services, University of Washington,
Seattle, WA
| | - Rebecca Jackson
- Department of Medicine, Wexner Medical Center, Ohio State
University, Columbus, OH
| | - Stephen S. Rich
- Center for Public Health Genomics, University of Virginia,
Charlottesville, VA
| | - Eric Boerwinkle
- Human Genetics Center, University of Texas Health Science Center,
Houston, TX
- Human Genome Sequencing Center, Baylor College of Medicine,
Houston, TX
| | - Yalda Jamshidi
- Human Genetics Research Centre, St George’s University of
London, London, United Kingdom
| | - Nona Sotoodehnia
- Cardiovascular Health Research Unit, Department of Medicine,
University of Washington, Seattle, WA
- Division of Cardiology, University of Washington, Seattle,
WA
| | | |
Collapse
|
1504
|
Lohmueller KE. The impact of population demography and selection on the genetic architecture of complex traits. PLoS Genet 2014; 10:e1004379. [PMID: 24875776 PMCID: PMC4038606 DOI: 10.1371/journal.pgen.1004379] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Accepted: 03/28/2014] [Indexed: 02/06/2023] Open
Abstract
Population genetic studies have found evidence for dramatic population growth in recent human history. It is unclear how this recent population growth, combined with the effects of negative natural selection, has affected patterns of deleterious variation, as well as the number, frequency, and effect sizes of mutations that contribute risk to complex traits. Because researchers are performing exome sequencing studies aimed at uncovering the role of low-frequency variants in the risk of complex traits, this topic is of critical importance. Here I use simulations under population genetic models where a proportion of the heritability of the trait is accounted for by mutations in a subset of the exome. I show that recent population growth increases the proportion of nonsynonymous variants segregating in the population, but does not affect the genetic load relative to a population that did not expand. Under a model where a mutation's effect on a trait is correlated with its effect on fitness, rare variants explain a greater portion of the additive genetic variance of the trait in a population that has recently expanded than in a population that did not recently expand. Further, when using a single-marker test, for a given false-positive rate and sample size, recent population growth decreases the expected number of significant associations with the trait relative to the number detected in a population that did not expand. However, in a model where there is no correlation between a mutation's effect on fitness and the effect on the trait, common variants account for much of the additive genetic variance, regardless of demography. Moreover, here demography does not affect the number of significant associations detected. These findings suggest recent population history may be an important factor influencing the power of association tests and in accounting for the missing heritability of certain complex traits.
Collapse
Affiliation(s)
- Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, Interdepartmental Program in Bioinformatics, University of California, Los Angeles, California, United States of America
| |
Collapse
|
1505
|
King CR, Nicolae DL. GWAS to Sequencing: Divergence in Study Design and Analysis. Genes (Basel) 2014; 5:460-76. [PMID: 24879455 PMCID: PMC4094943 DOI: 10.3390/genes5020460] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2013] [Revised: 05/13/2014] [Accepted: 05/15/2014] [Indexed: 12/03/2022] Open
Abstract
The success of genome-wide association studies (GWAS) in uncovering genetic risk factors for complex traits has generated great promise for the complete data generated by sequencing. The bumpy transition from GWAS to whole-exome or whole-genome association studies (WGAS) based on sequencing investigations has highlighted important differences in analysis and interpretation. We show how the loss in power due to the allele frequency spectrum targeted by sequencing is difficult to compensate for with realistic effect sizes and point to study designs that may help. We discuss several issues in interpreting the results, including a special case of the winner's curse. Extrapolation and prediction using rare SNPs is complex, because of the selective ascertainment of SNPs in case-control studies and the low amount of information at each SNP, and naive procedures are biased under the alternative. We also discuss the challenges in tuning gene-based tests and accounting for multiple testing when genes have very different sets of SNPs. The examples we emphasize in this paper highlight the difficult road we must travel for a two-letter switch.
Collapse
Affiliation(s)
| | - Dan L Nicolae
- Departments of Medicine, Statistics, and Human Genetics, University of Chicago, Chicago,IL 60637, USA.
| |
Collapse
|
1506
|
Wang M, Lin S. FamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads. ACTA ACUST UNITED AC 2014; 30:2611-8. [PMID: 24849576 DOI: 10.1093/bioinformatics/btu347] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
MOTIVATION In recent years, there has been an increasing interest in using common single-nucleotide polymorphisms (SNPs) amassed in genome-wide association studies to investigate rare haplotype effects on complex diseases. Evidence has suggested that rare haplotypes may tag rare causal single-nucleotide variants, making SNP-based rare haplotype analysis not only cost effective, but also more valuable for detecting causal variants. Although a number of methods for detecting rare haplotype association have been proposed in recent years, they are population based and thus susceptible to population stratification. RESULTS We propose family-triad-based logistic Bayesian Lasso (famLBL) for estimating effects of haplotypes on complex diseases using SNP data. By choosing appropriate prior distribution, effect sizes of unassociated haplotypes can be shrunk toward zero, allowing for more precise estimation of associated haplotypes, especially those that are rare, thereby achieving greater detection power. We evaluate famLBL using simulation to gauge its type I error and power. Compared with its population counterpart, LBL, highlights famLBL's robustness property in the presence of population substructure. Further investigation by comparing famLBL with Family-Based Association Test (FBAT) reveals its advantage for detecting rare haplotype association. AVAILABILITY AND IMPLEMENTATION famLBL is implemented as an R-package available at http://www.stat.osu.edu/∼statgen/SOFTWARE/LBL/.
Collapse
Affiliation(s)
- Meng Wang
- Department of Statistics, The Ohio State University, Columbus, OH 43210, USA
| | - Shili Lin
- Department of Statistics, The Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
1507
|
Yan Q, Tiwari HK, Yi N, Lin WY, Gao G, Lou XY, Cui X, Liu N. Kernel-machine testing coupled with a rank-truncation method for genetic pathway analysis. Genet Epidemiol 2014; 38:447-56. [PMID: 24849109 DOI: 10.1002/gepi.21813] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Revised: 04/09/2014] [Accepted: 04/10/2014] [Indexed: 01/09/2023]
Abstract
Traditional genome-wide association studies (GWASs) usually focus on single-marker analysis, which only accesses marginal effects. Pathway analysis, on the other hand, considers biological pathway gene marker hierarchical structure and therefore provides additional insights into the genetic architecture underlining complex diseases. Recently, a number of methods for pathway analysis have been proposed to assess the significance of a biological pathway from a collection of single-nucleotide polymorphisms. In this study, we propose a novel approach for pathway analysis that assesses the effects of genes using the sequence kernel association test and the effects of pathways using an extended adaptive rank truncated product statistic. It has been increasingly recognized that complex diseases are caused by both common and rare variants. We propose a new weighting scheme for genetic variants across the whole allelic frequency spectrum to be analyzed together without any form of frequency cutoff for defining rare variants. The proposed approach is flexible. It is applicable to both binary and continuous traits, and incorporating covariates is easy. Furthermore, it can be readily applied to GWAS data, exome-sequencing data, and deep resequencing data. We evaluate the new approach on data simulated under comprehensive scenarios and show that it has the highest power in most of the scenarios while maintaining the correct type I error rate. We also apply our proposed methodology to data from a study of the association between bipolar disorder and candidate pathways from Wellcome Trust Case Control Consortium (WTCCC) to show its utility.
Collapse
Affiliation(s)
- Qi Yan
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | | | | | | | | | | | | | | |
Collapse
|
1508
|
Fan R, Wang Y, Mills JL, Wilson AF, Bailey-Wilson JE, Xiong M. Functional linear models for association analysis of quantitative traits. Genet Epidemiol 2014; 37:726-42. [PMID: 24130119 DOI: 10.1002/gepi.21757] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2013] [Revised: 07/15/2013] [Accepted: 08/14/2013] [Indexed: 12/19/2022]
Abstract
Functional linear models are developed in this paper for testing associations between quantitative traits and genetic variants, which can be rare variants or common variants or the combination of the two. By treating multiple genetic variants of an individual in a human population as a realization of a stochastic process, the genome of an individual in a chromosome region is a continuum of sequence data rather than discrete observations. The genome of an individual is viewed as a stochastic function that contains both linkage and linkage disequilibrium (LD) information of the genetic markers. By using techniques of functional data analysis, both fixed and mixed effect functional linear models are built to test the association between quantitative traits and genetic variants adjusting for covariates. After extensive simulation analysis, it is shown that the F-distributed tests of the proposed fixed effect functional linear models have higher power than that of sequence kernel association test (SKAT) and its optimal unified test (SKAT-O) for three scenarios in most cases: (1) the causal variants are all rare, (2) the causal variants are both rare and common, and (3) the causal variants are common. The superior performance of the fixed effect functional linear models is most likely due to its optimal utilization of both genetic linkage and LD information of multiple genetic variants in a genome and similarity among different individuals, while SKAT and SKAT-O only model the similarities and pairwise LD but do not model linkage and higher order LD information sufficiently. In addition, the proposed fixed effect models generate accurate type I error rates in simulation studies. We also show that the functional kernel score tests of the proposed mixed effect functional linear models are preferable in candidate gene analysis and small sample problems. The methods are applied to analyze three biochemical traits in data from the Trinity Students Study.
Collapse
Affiliation(s)
- Ruzong Fan
- Biostatistics and Bioinformatics Branch, Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Rockville, Maryland, United States of America
| | | | | | | | | | | |
Collapse
|
1509
|
Abstract
This article focuses on conducting global testing for association between a binary trait and a set of rare variants (RVs), although its application can be much broader to other types of traits, common variants (CVs), and gene set or pathway analysis. We show that many of the existing tests have deteriorating performance in the presence of many nonassociated RVs: their power can dramatically drop as the proportion of nonassociated RVs in the group to be tested increases. We propose a class of so-called sum of powered score (SPU) tests, each of which is based on the score vector from a general regression model and hence can deal with different types of traits and adjust for covariates, e.g., principal components accounting for population stratification. The SPU tests generalize the sum test, a representative burden test based on pooling or collapsing genotypes of RVs, and a sum of squared score (SSU) test that is closely related to several other powerful variance component tests; a previous study (Basu and Pan 2011) has demonstrated good performance of one, but not both, of the Sum and SSU tests in many situations. The SPU tests are versatile in the sense that one of them is often powerful, although its identity varies with the unknown true association parameters. We propose an adaptive SPU (aSPU) test to approximate the most powerful SPU test for a given scenario, consequently maintaining high power and being highly adaptive across various scenarios. We conducted extensive simulations to show superior performance of the aSPU test over several state-of-the-art association tests in the presence of many nonassociated RVs. Finally we applied the SPU and aSPU tests to the GAW17 mini-exome sequence data to compare its practical performance with some existing tests, demonstrating their potential usefulness.
Collapse
|
1510
|
Vrieze SI, Feng S, Miller MB, Hicks BM, Pankratz N, Abecasis GR, Iacono WG, McGue M. Rare nonsynonymous exonic variants in addiction and behavioral disinhibition. Biol Psychiatry 2014; 75:783-9. [PMID: 24094508 PMCID: PMC3975816 DOI: 10.1016/j.biopsych.2013.08.027] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Revised: 08/02/2013] [Accepted: 08/26/2013] [Indexed: 10/26/2022]
Abstract
BACKGROUND Substance use is heritable, but few common genetic variants have been associated with these behaviors. Rare nonsynonymous exonic variants can now be efficiently genotyped, allowing exome-wide association tests. We identified and tested 111,592 nonsynonymous exonic variants for association with behavioral disinhibition and the use/misuse of nicotine, alcohol, and illicit drugs. METHODS Comprehensive genotyping of exonic variation combined with single-variant and gene-based tests of association was conducted in 7181 individuals; 172 candidate addiction genes were evaluated in greater detail. We also evaluated the aggregate effects of nonsynonymous variants on these phenotypes using Genome-wide Complex Trait Analysis. RESULTS No variant or gene was significantly associated with any phenotype. No association was found for any of the 172 candidate genes, even at reduced significance thresholds. All nonsynonymous variants jointly accounted for 35% of the heritability in illicit drug use and, when combined with common variants from a genome-wide array, accounted for 84% of the heritability. CONCLUSIONS Rare nonsynonymous variants may be important in etiology of illicit drug use, but detection of individual variants will require very large samples.
Collapse
Affiliation(s)
- Scott I Vrieze
- Center for Statistical Genetics (SIV, SF, GRA), Department of Biostatistics, University of Michigan, Ann Arbor, Michigan.
| | - Shuang Feng
- Center for Statistical Genetics (SIV, SF, GRA), Department of Biostatistics, University of Michigan, Ann Arbor, Michigan
| | - Michael B Miller
- Department of Psychology (MBM, WGI, MM), University of Minnesota, Minneapolis, Minnesota
| | - Brian M Hicks
- Department of Psychiatry (BMH), University of Michigan, Ann Arbor, Michigan
| | - Nathan Pankratz
- Department of Laboratory Medicine and Pathology (NP), University of Minnesota, Minneapolis, Minnesota
| | - Gonçalo R Abecasis
- Center for Statistical Genetics (SIV, SF, GRA), Department of Biostatistics, University of Michigan, Ann Arbor, Michigan
| | - William G Iacono
- Department of Psychology (MBM, WGI, MM), University of Minnesota, Minneapolis, Minnesota
| | - Matt McGue
- Department of Psychology (MBM, WGI, MM), University of Minnesota, Minneapolis, Minnesota
| |
Collapse
|
1511
|
Logsdon BA, Dai JY, Auer PL, Johnsen JM, Ganesh SK, Smith NL, Wilson JG, Tracy RP, Lange LA, Jiao S, Rich SS, Lettre G, Carlson CS, Jackson RD, O'Donnell CJ, Wurfel MM, Nickerson DA, Tang H, Reiner AP, Kooperberg C. A variational Bayes discrete mixture test for rare variant association. Genet Epidemiol 2014; 38:21-30. [PMID: 24482836 DOI: 10.1002/gepi.21772] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Recently, many statistical methods have been proposed to test for associations between rare genetic variants and complex traits. Most of these methods test for association by aggregating genetic variations within a predefined region, such as a gene. Although there is evidence that "aggregate" tests are more powerful than the single marker test, these tests generally ignore neutral variants and therefore are unable to identify specific variants driving the association with phenotype. We propose a novel aggregate rare-variant test that explicitly models a fraction of variants as neutral, tests associations at the gene-level, and infers the rare-variants driving the association. Simulations show that in the practical scenario where there are many variants within a given region of the genome with only a fraction causal our approach has greater power compared to other popular tests such as the Sequence Kernel Association Test (SKAT), the Weighted Sum Statistic (WSS), and the collapsing method of Morris and Zeggini (MZ). Our algorithm leverages a fast variational Bayes approximate inference methodology to scale to exome-wide analyses, a significant computational advantage over exact inference model selection methodologies. To demonstrate the efficacy of our methodology we test for associations between von Willebrand Factor (VWF) levels and VWF missense rare-variants imputed from the National Heart, Lung, and Blood Institute's Exome Sequencing project into 2,487 African Americans within the VWF gene. Our method suggests that a relatively small fraction (~10%) of the imputed rare missense variants within VWF are strongly associated with lower VWF levels in African Americans.
Collapse
|
1512
|
Zakharov S, Wang X, Liu J, Teo YY. Improving power for robust trans-ethnic meta-analysis of rare and low-frequency variants with a partitioning approach. Eur J Hum Genet 2014; 23:238-44. [PMID: 24801758 DOI: 10.1038/ejhg.2014.78] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2013] [Revised: 02/20/2014] [Accepted: 04/04/2014] [Indexed: 01/06/2023] Open
Abstract
While genome-wide association studies have discovered numerous bona fide variants that are associated with common diseases and complex traits; these variants tend to be common in the population and explain only a small proportion of the phenotype variance. The search for the missing heritability has thus switched to rare and low-frequency variants, defined as <5% in the population, but which are expected to have a bigger impact on phenotypic outcomes. The rarer nature of these variants coupled with the curse of testing multiple variants across the genome meant that large sample sizes will still be required despite the assumption of bigger effect sizes. Combining data from multiple studies in a meta-analysis will continue to be the natural approach in boosting sample sizes. However, the population genetics of rare variants suggests that allelic and effect size heterogeneity across populations of different ancestries is likely to pose a greater challenge to trans-ethnic meta-analysis of rare variants than to similar analyses of common variants. Here, we introduce a novel method to perform trans-ethnic meta-analysis of rare and low-frequency variants. The approach is centered on partitioning the studies into distinct clusters using local inference of genomic similarity between population groups, with the aim to minimize both the number of clusters and between-study heterogeneity in each cluster. Through a series of simulations, we show that our approach either performs similarly to or outperforms conventional and recently introduced meta-analysis strategies, particularly in the presence of allelic heterogeneity.
Collapse
Affiliation(s)
- Sergii Zakharov
- 1] Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore [2] Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore, Singapore
| | - Xu Wang
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore
| | - Jianjun Liu
- Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore, Singapore
| | - Yik-Ying Teo
- 1] Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore [2] Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore, Singapore [3] Department of Statistics and Applied Probability, National University of Singapore, Singapore, Singapore [4] NUS Graduate School for Integrative Science and Engineering, National University of Singapore, Singapore, Singapore [5] Life Sciences Institute, National University of Singapore, Singapore, Singapore
| |
Collapse
|
1513
|
Zhang F, Boerwinkle E, Xiong M. Epistasis analysis for quantitative traits by functional regression model. Genome Res 2014; 24:989-98. [PMID: 24803592 PMCID: PMC4032862 DOI: 10.1101/gr.161760.113] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The critical barrier in interaction analysis for rare variants is that most traditional statistical methods for testing interactions were originally designed for testing the interaction between common variants and are difficult to apply to rare variants because of their prohibitive computational time and poor ability. The great challenges for successful detection of interactions with next-generation sequencing (NGS) data are (1) lack of methods for interaction analysis with rare variants, (2) severe multiple testing, and (3) time-consuming computations. To meet these challenges, we shift the paradigm of interaction analysis between two loci to interaction analysis between two sets of loci or genomic regions and collectively test interactions between all possible pairs of SNPs within two genomic regions. In other words, we take a genome region as a basic unit of interaction analysis and use high-dimensional data reduction and functional data analysis techniques to develop a novel functional regression model to collectively test interactions between all possible pairs of single nucleotide polymorphisms (SNPs) within two genome regions. By intensive simulations, we demonstrate that the functional regression models for interaction analysis of the quantitative trait have the correct type 1 error rates and a much better ability to detect interactions than the current pairwise interaction analysis. The proposed method was applied to exome sequence data from the NHLBI’s Exome Sequencing Project (ESP) and CHARGE-S study. We discovered 27 pairs of genes showing significant interactions after applying the Bonferroni correction (P-values < 4.58 × 10−10) in the ESP, and 11 were replicated in the CHARGE-S study.
Collapse
Affiliation(s)
- Futao Zhang
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang 310058, China; Human Genetics Center, Division of Biostatistics, The University of Texas School of Public Health, Houston, Texas 77030, USA
| | - Eric Boerwinkle
- Human Genetics Center, Division of Biostatistics, The University of Texas School of Public Health, Houston, Texas 77030, USA
| | - Momiao Xiong
- Human Genetics Center, Division of Biostatistics, The University of Texas School of Public Health, Houston, Texas 77030, USA
| |
Collapse
|
1514
|
Tang ZZ, Lin DY. Meta-analysis of sequencing studies with heterogeneous genetic associations. Genet Epidemiol 2014; 38:389-401. [PMID: 24799183 DOI: 10.1002/gepi.21798] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Revised: 02/05/2014] [Accepted: 02/06/2014] [Indexed: 01/06/2023]
Abstract
Recent advances in sequencing technologies have made it possible to explore the influence of rare variants on complex diseases and traits. Meta-analysis is essential to this exploration because large sample sizes are required to detect rare variants. Several methods are available to conduct meta-analysis for rare variants under fixed-effects models, which assume that the genetic effects are the same across all studies. In practice, genetic associations are likely to be heterogeneous among studies because of differences in population composition, environmental factors, phenotype and genotype measurements, or analysis method. We propose random-effects models which allow the genetic effects to vary among studies and develop the corresponding meta-analysis methods for gene-level association tests. Our methods take score statistics, rather than individual participant data, as input and thus can accommodate any study designs and any phenotypes. We produce the random-effects versions of all commonly used gene-level association tests, including burden, variable threshold, and variance-component tests. We demonstrate through extensive simulation studies that our random-effects tests are substantially more powerful than the fixed-effects tests in the presence of moderate and high between-study heterogeneity and achieve similar power to the latter when the heterogeneity is low. The usefulness of the proposed methods is further illustrated with data from National Heart, Lung, and Blood Institute Exome Sequencing Project (NHLBI ESP). The relevant software is freely available.
Collapse
Affiliation(s)
- Zheng-Zheng Tang
- Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | | |
Collapse
|
1515
|
Sergouniotis PI, Chakarova C, Murphy C, Becker M, Lenassi E, Arno G, Lek M, MacArthur DG, Bhattacharya SS, Moore AT, Holder GE, Robson AG, Wolfrum U, Webster AR, Plagnol V. Biallelic variants in TTLL5, encoding a tubulin glutamylase, cause retinal dystrophy. Am J Hum Genet 2014; 94:760-9. [PMID: 24791901 PMCID: PMC4067560 DOI: 10.1016/j.ajhg.2014.04.003] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Accepted: 04/02/2014] [Indexed: 12/30/2022] Open
Abstract
In a subset of inherited retinal degenerations (including cone, cone-rod, and macular dystrophies), cone photoreceptors are more severely affected than rods; ABCA4 mutations are the most common cause of this heterogeneous class of disorders. To identify retinal-disease-associated genes, we performed exome sequencing in 28 individuals with "cone-first" retinal disease and clinical features atypical for ABCA4 retinopathy. We then conducted a gene-based case-control association study with an internal exome data set as the control group. TTLL5, encoding a tubulin glutamylase, was highlighted as the most likely disease-associated gene; 2 of 28 affected subjects harbored presumed loss-of-function variants: c.[1586_1589delAGAG];[1586_1589delAGAG], p.[Glu529Valfs(∗)2];[Glu529Valfs(∗)2], and c.[401delT(;)3354G>A], p.[Leu134Argfs(∗)45(;)Trp1118(∗)]. We then inspected previously collected exome sequence data from individuals with related phenotypes and found two siblings with homozygous nonsense variant c.1627G>T (p.Glu543(∗)) in TTLL5. Subsequently, we tested a panel of 55 probands with retinal dystrophy for TTLL5 mutations; one proband had a homozygous missense change (c.1627G>A [p.Glu543Lys]). The retinal phenotype was highly similar in three of four families; the sibling pair had a more severe, early-onset disease. In human and murine retinae, TTLL5 localized to the centrioles at the base of the connecting cilium. TTLL5 has been previously reported to be essential for the correct function of sperm flagella in mice and play a role in polyglutamylation of primary cilia in vitro. Notably, genes involved in the polyglutamylation and deglutamylation of tubulin have been associated with photoreceptor degeneration in mice. The electrophysiological and fundus autofluorescence imaging presented here should facilitate the molecular diagnosis in further families.
Collapse
Affiliation(s)
| | | | | | - Mirjana Becker
- Institute of Zoology, Focus Program Translational Neurosciences, Johannes Gutenberg University of Mainz, Mainz 55099, Germany
| | - Eva Lenassi
- UCL Institute of Ophthalmology, London EC1V 9EL, UK; Moorfields Eye Hospital, London EC1V 2PD, UK
| | - Gavin Arno
- UCL Institute of Ophthalmology, London EC1V 9EL, UK
| | - Monkol Lek
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Daniel G MacArthur
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | | | - Anthony T Moore
- UCL Institute of Ophthalmology, London EC1V 9EL, UK; Moorfields Eye Hospital, London EC1V 2PD, UK
| | - Graham E Holder
- UCL Institute of Ophthalmology, London EC1V 9EL, UK; Moorfields Eye Hospital, London EC1V 2PD, UK
| | - Anthony G Robson
- UCL Institute of Ophthalmology, London EC1V 9EL, UK; Moorfields Eye Hospital, London EC1V 2PD, UK
| | - Uwe Wolfrum
- Institute of Zoology, Focus Program Translational Neurosciences, Johannes Gutenberg University of Mainz, Mainz 55099, Germany
| | - Andrew R Webster
- UCL Institute of Ophthalmology, London EC1V 9EL, UK; Moorfields Eye Hospital, London EC1V 2PD, UK.
| | | |
Collapse
|
1516
|
Kinnamon DD, Martin ER. Valid Monte Carlo permutation tests for genetic case-control studies with missing genotypes. Genet Epidemiol 2014; 38:325-44. [PMID: 24723341 PMCID: PMC6391735 DOI: 10.1002/gepi.21805] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Revised: 12/30/2013] [Accepted: 02/28/2014] [Indexed: 02/04/2023]
Abstract
Monte Carlo permutation tests can be formally constructed by choosing a set of permutations of individual indices and a real-valued test statistic measuring the association between genotypes and affection status. In this paper, we develop a rigorous theoretical framework for verifying the validity of these tests when there are missing genotypes. We begin by specifying a nonparametric probability model for the observed genotype data in a genetic case-control study with unrelated subjects. Under this model and some minimal assumptions about the test statistic, we establish that the resulting Monte Carlo permutation test is exact level α if (1) the chosen set of permutations of individual indices is a group under composition and (2) the distribution of the observed genotype score matrix under the null hypothesis does not change if the assignment of individuals to rows is shuffled according to an arbitrary permutation in this set. We apply these conditions to show that frequently used Monte Carlo permutation tests based on the set of all permutations of individual indices are guaranteed to be exact level α only for missing data processes satisfying a rather restrictive additional assumption. However, if the missing data process depends on covariates that are all identified and recorded, we also show that Monte Carlo permutation tests based on the set of permutations within strata of individuals with identical covariate values are exact level α. Our theoretical results are verified and supplemented by simulations for a variety of missing data processes and test statistics.
Collapse
Affiliation(s)
- Daniel D. Kinnamon
- Division of Human Genetics, Department of Internal Medicine, The
Ohio State University Wexner Medical Center, Columbus, OH, USA
- Dr. John T. Macdonald Foundation Department of Human Genetics,
University of Miami Miller School of Medicine, Miami, FL, USA
| | - Eden R. Martin
- Dr. John T. Macdonald Foundation Department of Human Genetics,
University of Miami Miller School of Medicine, Miami, FL, USA
| |
Collapse
|
1517
|
Simulation of Finnish population history, guided by empirical genetic data, to assess power of rare-variant tests in Finland. Am J Hum Genet 2014; 94:710-20. [PMID: 24768551 DOI: 10.1016/j.ajhg.2014.03.019] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Accepted: 03/27/2014] [Indexed: 12/18/2022] Open
Abstract
Finnish samples have been extensively utilized in studying single-gene disorders, where the founder effect has clearly aided in discovery, and more recently in genome-wide association studies of complex traits, where the founder effect has had less obvious impacts. As the field starts to explore rare variants' contribution to polygenic traits, it is of great importance to characterize and confirm the Finnish founder effect in sequencing data and to assess its implications for rare-variant association studies. Here, we employ forward simulation, guided by empirical deep resequencing data, to model the genetic architecture of quantitative polygenic traits in both the general European and the Finnish populations simultaneously. We demonstrate that power of rare-variant association tests is higher in the Finnish population, especially when variants' phenotypic effects are tightly coupled with fitness effects and therefore reflect a greater contribution of rarer variants. SKAT-O, variable-threshold tests, and single-variant tests are more powerful than other rare-variant methods in the Finnish population across a range of genetic models. We also compare the relative power and efficiency of exome array genotyping to those of high-coverage exome sequencing. At a fixed cost, less expensive genotyping strategies have far greater power than sequencing; in a fixed number of samples, however, genotyping arrays miss a substantial portion of genetic signals detected in sequencing, even in the Finnish founder population. As genetic studies probe sequence variation at greater depth in more diverse populations, our simulation approach provides a framework for evaluating various study designs for gene discovery.
Collapse
|
1518
|
Wain LV, Sayers I, Soler Artigas M, Portelli MA, Zeggini E, Obeidat M, Sin DD, Bossé Y, Nickle D, Brandsma CA, Malarstig A, Vangjeli C, Jelinsky SA, John S, Kilty I, McKeever T, Shrine NRG, Cook JP, Patel S, Spector TD, Hollox EJ, Hall IP, Tobin MD. Whole exome re-sequencing implicates CCDC38 and cilia structure and function in resistance to smoking related airflow obstruction. PLoS Genet 2014; 10:e1004314. [PMID: 24786987 PMCID: PMC4006731 DOI: 10.1371/journal.pgen.1004314] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Accepted: 03/06/2014] [Indexed: 11/19/2022] Open
Abstract
Chronic obstructive pulmonary disease (COPD) is a leading cause of global morbidity and mortality and, whilst smoking remains the single most important risk factor, COPD risk is heritable. Of 26 independent genomic regions showing association with lung function in genome-wide association studies, eleven have been reported to show association with airflow obstruction. Although the main risk factor for COPD is smoking, some individuals are observed to have a high forced expired volume in 1 second (FEV1) despite many years of heavy smoking. We hypothesised that these "resistant smokers" may harbour variants which protect against lung function decline caused by smoking and provide insight into the genetic determinants of lung health. We undertook whole exome re-sequencing of 100 heavy smokers who had healthy lung function given their age, sex, height and smoking history and applied three complementary approaches to explore the genetic architecture of smoking resistance. Firstly, we identified novel functional variants in the "resistant smokers" and looked for enrichment of these novel variants within biological pathways. Secondly, we undertook association testing of all exonic variants individually with two independent control sets. Thirdly, we undertook gene-based association testing of all exonic variants. Our strongest signal of association with smoking resistance for a non-synonymous SNP was for rs10859974 (P = 2.34 × 10(-4)) in CCDC38, a gene which has previously been reported to show association with FEV1/FVC, and we demonstrate moderate expression of CCDC38 in bronchial epithelial cells. We identified an enrichment of novel putatively functional variants in genes related to cilia structure and function in resistant smokers. Ciliary function abnormalities are known to be associated with both smoking and reduced mucociliary clearance in patients with COPD. We suggest that genetic influences on the development or function of cilia in the bronchial epithelium may affect growth of cilia or the extent of damage caused by tobacco smoke.
Collapse
Affiliation(s)
- Louise V. Wain
- University of Leicester, Department of Health Sciences, Leicester, United Kingdom
- * E-mail:
| | - Ian Sayers
- Division of Respiratory Medicine, University of Nottingham, Queen's Medical Centre, Nottingham, United Kingdom
| | - María Soler Artigas
- University of Leicester, Department of Health Sciences, Leicester, United Kingdom
| | - Michael A. Portelli
- Division of Respiratory Medicine, University of Nottingham, Queen's Medical Centre, Nottingham, United Kingdom
| | | | - Ma'en Obeidat
- University of British Columbia Centre for Heart Lung Innovation, St. Paul's Hospital, Vancouver, Canada
| | - Don D. Sin
- University of British Columbia Centre for Heart Lung Innovation, St. Paul's Hospital, Vancouver, Canada
| | - Yohan Bossé
- Institut universitaire de cardiologie et de pneumologie de Québec, Department of Molecular Medicine, Laval University, Québec, Canada
| | - David Nickle
- Merck Research Laboratories, Boston, Massachusetts, United States of America
- Merck, Rahway, New Jersey, United States of America
| | - Corry-Anke Brandsma
- University of Groningen, University Medical Center Groningen, Department of Pathology and Medical Biology, GRIAC Research Institute, Groningen, The Netherlands
| | | | | | - Scott A. Jelinsky
- Pfizer Worldwide R&D, Cambridge, Massachusetts, United States of America
| | - Sally John
- Pfizer Worldwide R&D, Cambridge, Massachusetts, United States of America
| | - Iain Kilty
- Pfizer Worldwide R&D, Cambridge, Massachusetts, United States of America
| | - Tricia McKeever
- School of Community Health Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Nick R. G. Shrine
- University of Leicester, Department of Health Sciences, Leicester, United Kingdom
| | - James P. Cook
- University of Leicester, Department of Health Sciences, Leicester, United Kingdom
| | - Shrina Patel
- Department of Twin Research and Genetic Epidemiology, King's College London, London, United Kingdom
| | - Tim D. Spector
- Department of Twin Research and Genetic Epidemiology, King's College London, London, United Kingdom
| | - Edward J. Hollox
- University of Leicester, Department of Genetics, Leicester, United Kingdom
| | - Ian P. Hall
- Division of Respiratory Medicine, University of Nottingham, Queen's Medical Centre, Nottingham, United Kingdom
| | - Martin D. Tobin
- University of Leicester, Department of Health Sciences, Leicester, United Kingdom
- National Institute for Health Research (NIHR) Leicester Respiratory Biomedical Research Unit, Glenfield Hospital, Leicester, United Kingdom
| |
Collapse
|
1519
|
Billings LK, Jablonski KA, Ackerman RJ, Taylor A, Fanelli RR, McAteer JB, Guiducci C, Delahanty LM, Dabelea D, Kahn SE, Franks PW, Hanson RL, Maruthur NM, Shuldiner AR, Mayer-Davis EJ, Knowler WC, Florez JC. The influence of rare genetic variation in SLC30A8 on diabetes incidence and β-cell function. J Clin Endocrinol Metab 2014; 99:E926-30. [PMID: 24471563 PMCID: PMC4010688 DOI: 10.1210/jc.2013-2378] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
CONTEXT/OBJECTIVE The variant rs13266634 in SLC30A8, encoding a β-cell-specific zinc transporter, is associated with type 2 diabetes. We aimed to identify other variants in SLC30A8 that increase diabetes risk and impair β-cell function, and test whether zinc intake modifies this risk. DESIGN/OUTCOME: We sequenced exons in SLC30A8 in 380 Diabetes Prevention Program (DPP) participants and identified 44 novel variants, which were genotyped in 3445 DPP participants and tested for association with diabetes incidence and measures of insulin secretion and processing. We examined individual common variants and used gene burden tests to test 39 rare variants in aggregate. RESULTS We detected a near-nominal association between a rare-variant genotype risk score and diabetes risk. Five common variants were associated with the oral disposition index. Various methods aggregating rare variants demonstrated associations with changes in oral disposition index and insulinogenic index during year 1 of follow-up. We did not find a clear interaction of zinc intake with genotype on diabetes incidence. CONCLUSIONS Individual common and an aggregate of rare genetic variation in SLC30A8 are associated with measures of β-cell function in the DPP. Exploring rare variation may complement ongoing efforts to uncover the genetic influences that underlie complex diseases.
Collapse
Affiliation(s)
- Liana K Billings
- Center for Human Genetic Research (L.K.B., R.J.A., A.T., R.R.F., J.B.M., J.C.F.) and Diabetes Research Center (Diabetes Unit) (L.K.B., L.M.D., J.C.F.), Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114; Department of Medicine (L.K.B., L.M.D., J.C.F.), Harvard Medical School, and Department of Nutrition (P.W.F.), Harvard School of Public Health, Boston, Massachusetts 02115; Department of Medicine (L.K.B.), NorthShore University HealthSystem, Evanston, Illinois 60201; University of Chicago (L.K.B.), Pritzker School of Medicine, Chicago, Illinois 60637; The Biostatistics Center (K.A.J.), George Washington University, Rockville, Maryland 20852; Program in Medical and Population Genetics (A.T., J.B.M., C.G., J.C.F.), Broad Institute, Cambridge, Massachusetts 02142; Department of Epidemiology (D.D.), Colorado School of Public Health, University of Colorado, Denver, Colorado 80045; Division of Metabolism, Endocrinology, and Nutrition (S.E.K.), VA Puget Sound Health Care System and University of Washington, Seattle, Washington 98108; Department of Clinical Sciences (P.W.F.), Genetic and Molecular Epidemiology Unit, Lund University, SE-200 41 Malmö, Sweden; Diabetes Epidemiology and Clinical Research Section (R.L.H., W.C.K.), National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Phoenix, Arizona 85014; Department of Medicine (N.M.M.), Division of General Internal Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205; Department of Medicine (A.R.S.), Division of Endocrinology, Diabetes, and Nutrition, University of Maryland School of Medicine, Baltimore, Maryland 21201; and Department of Nutrition (E.J.M.-D.), University of North Carolina, Gillings School of Global Public Health, Chapel Hill, North Carolina 27599
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1520
|
Derkach A, Lawless JF, Sun L. Pooled Association Tests for Rare Genetic Variants: A Review and Some New Results. Stat Sci 2014. [DOI: 10.1214/13-sts456] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
1521
|
Wang GT, Peng B, Leal SM. Variant association tools for quality control and analysis of large-scale sequence and genotyping array data. Am J Hum Genet 2014; 94:770-83. [PMID: 24791902 PMCID: PMC4067555 DOI: 10.1016/j.ajhg.2014.04.004] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2014] [Accepted: 04/03/2014] [Indexed: 12/14/2022] Open
Abstract
Currently there is great interest in detecting associations between complex traits and rare variants. In this report, we describe Variant Association Tools (VAT) and the VAT pipeline, which implements best practices for rare-variant association studies. Highlights of VAT include variant-site and call-level quality control (QC), summary statistics, phenotype- and genotype-based sample selection, variant annotation, selection of variants for association analysis, and a collection of rare-variant association methods for analyzing qualitative and quantitative traits. The association testing framework for VAT is regression based, which readily allows for flexible construction of association models with multiple covariates and weighting themes based on allele frequencies or predicted functionality. Additionally, pathway analyses, conditional analyses, and analyses of gene-gene and gene-environment interactions can be performed. VAT is capable of rapidly scanning through data by using multi-process computation, adaptive permutation, and simultaneously conducting association analysis via multiple methods. Results are available in text or graphic file formats and additionally can be output to relational databases for further annotation and filtering. An interface to R language also facilitates user implementation of novel association methods. The VAT's data QC and association-analysis pipeline can be applied to sequence, imputed, and genotyping array, e.g., "exome chip," data, providing a reliable and reproducible computational environment in which to analyze small- to large-scale studies with data from the latest genotyping and sequencing technologies. Application of the VAT pipeline is demonstrated through analysis of data from the 1000 Genomes project.
Collapse
Affiliation(s)
- Gao T Wang
- Center for Statistical Genetics, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bo Peng
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Suzanne M Leal
- Center for Statistical Genetics, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
| |
Collapse
|
1522
|
Wang GT, Li B, Santos-Cortez RPL, Peng B, Leal SM. Power analysis and sample size estimation for sequence-based association studies. Bioinformatics 2014; 30:2377-8. [PMID: 24778108 DOI: 10.1093/bioinformatics/btu296] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
MOTIVATION Statistical methods have been developed to test for complex trait rare variant (RV) associations, in which variants are aggregated across a region, which is typically a gene. Power analysis and sample size estimation for sequence-based RV association studies are challenging because of the necessity to realistically model the underlying allelic architecture of complex diseases within a suitable analytical framework to assess the performance of a variety of RV association methods in an unbiased manner. SUMMARY We developed SEQPower, a software package to perform statistical power analysis for sequence-based association data under a variety of genetic variant and disease phenotype models. It aids epidemiologists in determining the best study design, sample size and statistical tests for sequence-based association studies. It also provides biostatisticians with a platform to fairly compare RV association methods and to validate and assess novel association tests. AVAILABILITY AND IMPLEMENTATION The SEQPower program, source code, multi-platform executables, documentation, list of association tests, examples and tutorials are available at http://bioinformatics.org/spower.
Collapse
Affiliation(s)
- Gao T Wang
- Center for Statistical Genetics, Department of Molecular and Human Genetics, Baylor College of Medicine and Department of Bioinformatics and Computational Biology, The University of Texas, M D Anderson Cancer Center, Houston, TX 77030, USA
| | - Biao Li
- Center for Statistical Genetics, Department of Molecular and Human Genetics, Baylor College of Medicine and Department of Bioinformatics and Computational Biology, The University of Texas, M D Anderson Cancer Center, Houston, TX 77030, USA
| | - Regie P Lyn Santos-Cortez
- Center for Statistical Genetics, Department of Molecular and Human Genetics, Baylor College of Medicine and Department of Bioinformatics and Computational Biology, The University of Texas, M D Anderson Cancer Center, Houston, TX 77030, USA
| | - Bo Peng
- Center for Statistical Genetics, Department of Molecular and Human Genetics, Baylor College of Medicine and Department of Bioinformatics and Computational Biology, The University of Texas, M D Anderson Cancer Center, Houston, TX 77030, USA
| | - Suzanne M Leal
- Center for Statistical Genetics, Department of Molecular and Human Genetics, Baylor College of Medicine and Department of Bioinformatics and Computational Biology, The University of Texas, M D Anderson Cancer Center, Houston, TX 77030, USA
| |
Collapse
|
1523
|
The three genetics (nuclear DNA, mitochondrial DNA, and gut microbiome) of longevity in humans considered as metaorganisms. BIOMED RESEARCH INTERNATIONAL 2014; 2014:560340. [PMID: 24868529 PMCID: PMC4017728 DOI: 10.1155/2014/560340] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 03/25/2014] [Indexed: 02/03/2023]
Abstract
Usually the genetics of human longevity is restricted to the nuclear genome (nDNA). However it is well known that the nDNA interacts with a physically and functionally separated genome, the mitochondrial DNA (mtDNA) that, even if limited in length and number of genes encoded, plays a major role in the ageing process. The complex interplay between nDNA/mtDNA and the environment is most likely involved in phenomena such as ageing and longevity. To this scenario we have to add another level of complexity represented by the microbiota, that is, the whole set of bacteria present in the different part of our body with their whole set of genes. In particular, several studies investigated the role of gut microbiota (GM) modifications in ageing and longevity and an age-related GM signature was found. In this view, human being must be considered as “metaorganism” and a more holistic approach is necessary to grasp the complex dynamics of the interaction between the environment and nDNA-mtDNA-GM of the host during ageing. In this review, the relationship between the three genetics and human longevity is addressed to point out that a comprehensive view will allow the researchers to properly address the complex interactions that occur during human lifespan.
Collapse
|
1524
|
Sun H, Wang S. A power set-based statistical selection procedure to locate susceptible rare variants associated with complex traits with sequencing data. Bioinformatics 2014; 30:2317-23. [PMID: 24755303 DOI: 10.1093/bioinformatics/btu207] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
MOTIVATION Existing association methods for rare variants from sequencing data have focused on aggregating variants in a gene or a genetic region because of the fact that analysing individual rare variants is underpowered. However, these existing rare variant detection methods are not able to identify which rare variants in a gene or a genetic region of all variants are associated with the complex diseases or traits. Once phenotypic associations of a gene or a genetic region are identified, the natural next step in the association study with sequencing data is to locate the susceptible rare variants within the gene or the genetic region. RESULTS In this article, we propose a power set-based statistical selection procedure that is able to identify the locations of the potentially susceptible rare variants within a disease-related gene or a genetic region. The selection performance of the proposed selection procedure was evaluated through simulation studies, where we demonstrated the feasibility and superior power over several comparable existing methods. In particular, the proposed method is able to handle the mixed effects when both risk and protective variants are present in a gene or a genetic region. The proposed selection procedure was also applied to the sequence data on the ANGPTL gene family from the Dallas Heart Study to identify potentially susceptible rare variants within the trait-related genes. AVAILABILITY AND IMPLEMENTATION An R package 'rvsel' can be downloaded from http://www.columbia.edu/∼sw2206/ and http://statsun.pusan.ac.kr.
Collapse
Affiliation(s)
- Hokeun Sun
- Department of Statistics, Pusan National University, Pusan 609-735, Korea and Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY 10032, USA
| | - Shuang Wang
- Department of Statistics, Pusan National University, Pusan 609-735, Korea and Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY 10032, USA
| |
Collapse
|
1525
|
Lin WY. Association testing of clustered rare causal variants in case-control studies. PLoS One 2014; 9:e94337. [PMID: 24736372 PMCID: PMC3988195 DOI: 10.1371/journal.pone.0094337] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2014] [Accepted: 03/12/2014] [Indexed: 11/18/2022] Open
Abstract
Biological evidence suggests that multiple causal variants in a gene may cluster physically. Variants within the same protein functional domain or gene regulatory element would locate in close proximity on the DNA sequence. However, spatial information of variants is usually not used in current rare variant association analyses. We here propose a clustering method (abbreviated as "CLUSTER"), which is extended from the adaptive combination of P-values. Our method combines the association signals of variants that are more likely to be causal. Furthermore, the statistic incorporates the spatial information of variants. With extensive simulations, we show that our method outperforms several commonly-used methods in many scenarios. To demonstrate its use in real data analyses, we also apply this CLUSTER test to the Dallas Heart Study data. CLUSTER is among the best methods when the effects of causal variants are all in the same direction. As variants located in close proximity are more likely to have similar impact on disease risk, CLUSTER is recommended for association testing of clustered rare causal variants in case-control studies.
Collapse
Affiliation(s)
- Wan-Yu Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
1526
|
Derkach A, Chiang T, Gong J, Addis L, Dobbins S, Tomlinson I, Houlston R, Pal DK, Strug LJ. Association analysis using next-generation sequence data from publicly available control groups: the robust variance score statistic. ACTA ACUST UNITED AC 2014; 30:2179-88. [PMID: 24733292 DOI: 10.1093/bioinformatics/btu196] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
MOTIVATION Sufficiently powered case-control studies with next-generation sequence (NGS) data remain prohibitively expensive for many investigators. If feasible, a more efficient strategy would be to include publicly available sequenced controls. However, these studies can be confounded by differences in sequencing platform; alignment, single nucleotide polymorphism and variant calling algorithms; read depth; and selection thresholds. Assuming one can match cases and controls on the basis of ethnicity and other potential confounding factors, and one has access to the aligned reads in both groups, we investigate the effect of systematic differences in read depth and selection threshold when comparing allele frequencies between cases and controls. We propose a novel likelihood-based method, the robust variance score (RVS), that substitutes genotype calls by their expected values given observed sequence data. RESULTS We show theoretically that the RVS eliminates read depth bias in the estimation of minor allele frequency. We also demonstrate that, using simulated and real NGS data, the RVS method controls Type I error and has comparable power to the 'gold standard' analysis with the true underlying genotypes for both common and rare variants. AVAILABILITY AND IMPLEMENTATION An RVS R script and instructions can be found at strug.research.sickkids.ca, and at https://github.com/strug-lab/RVS. CONTACT lisa.strug@utoronto.ca SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Andriy Derkach
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Theodore Chiang
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Jiafen Gong
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Laura Addis
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Sara Dobbins
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Ian Tomlinson
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Richard Houlston
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Deb K Pal
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Lisa J Strug
- Department of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, CanadaDepartment of Statistical Science, University of Toronto, Toronto, ON, Canada, Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
1527
|
He L, Sillanpää MJ, Ripatti S, Pitkäniemi J. Bayesian Latent Variable Collapsing Model for Detecting Rare Variant Interaction Effect in Twin Study. Genet Epidemiol 2014; 38:310-24. [DOI: 10.1002/gepi.21804] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2013] [Revised: 02/28/2014] [Accepted: 02/28/2014] [Indexed: 12/12/2022]
Affiliation(s)
- Liang He
- Department of Public Health; Hjelt Institute; University of Helsinki; Finland
| | - Mikko J. Sillanpää
- Department of Mathematical Sciences; University of Oulu; Oulu Finland
- Department of Biology and Biocenter Oulu; University of Oulu; Oulu Finland
| | - Samuli Ripatti
- Department of Public Health; Hjelt Institute; University of Helsinki; Finland
- Institute for Molecular Medicine Finland FIMM; University of Helsinki; Finland
- Human Genetics; Wellcome Trust Sanger Institute; United Kingdom
| | - Janne Pitkäniemi
- Department of Public Health; Hjelt Institute; University of Helsinki; Finland
- Finnish Cancer Registry; Institute for Statistical and Epidemiological Cancer Research; Helsinki Finland
| |
Collapse
|
1528
|
Kim DS, Crosslin DR, Auer PL, Suzuki SM, Marsillach J, Burt AA, Gordon AS, Meschia JF, Nalls MA, Worrall BB, Longstreth WT, Gottesman RF, Furlong CE, Peters U, Rich SS, Nickerson DA, Jarvik GP. Rare coding variation in paraoxonase-1 is associated with ischemic stroke in the NHLBI Exome Sequencing Project. J Lipid Res 2014; 55:1173-8. [PMID: 24711634 DOI: 10.1194/jlr.p049247] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2014] [Indexed: 11/20/2022] Open
Abstract
HDL-associated paraoxonase-1 (PON1) is an enzyme whose activity is associated with cerebrovascular disease. Common PON1 genetic variants have not been consistently associated with cerebrovascular disease. Rare coding variation that likely alters PON1 enzyme function may be more strongly associated with stroke. The National Heart, Lung, and Blood Institute Exome Sequencing Project sequenced the coding regions (exomes) of the genome for heart, lung, and blood-related phenotypes (including ischemic stroke). In this sample of 4,204 unrelated participants, 496 had verified, noncardioembolic ischemic stroke. After filtering, 28 nonsynonymous PON1 variants were identified. Analysis with the sequence kernel association test, adjusted for covariates, identified significant associations between PON1 variants and ischemic stroke (P = 3.01 × 10(-3)). Stratified analyses demonstrated a stronger association of PON1 variants with ischemic stroke in African ancestry (AA) participants (P = 5.03 × 10(-3)). Ethnic differences in the association between PON1 variants with stroke could be due to the effects of PON1Val109Ile (overall P = 7.88 × 10(-3); AA P = 6.52 × 10(-4)), found at higher frequency in AA participants (1.16% vs. 0.02%) and whose protein is less stable than the common allele. In summary, rare genetic variation in PON1 was associated with ischemic stroke, with stronger associations identified in those of AA. Increased focus on PON1 enzyme function and its role in cerebrovascular disease is warranted.
Collapse
Affiliation(s)
- Daniel Seung Kim
- Division of Medical Genetics, Department of Medicine University of Washington School of Medicine, Seattle, WA Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA
| | - David R Crosslin
- Division of Medical Genetics, Department of Medicine University of Washington School of Medicine, Seattle, WA Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA
| | - Paul L Auer
- Division of Medical Genetics, Department of Medicine University of Washington School of Medicine, Seattle, WA Zilber School of Public Health, University of Wisconsin-Milwaukee, Milwaukee, WI
| | - Stephanie M Suzuki
- Division of Medical Genetics, Department of Medicine University of Washington School of Medicine, Seattle, WA
| | - Judit Marsillach
- Division of Medical Genetics, Department of Medicine University of Washington School of Medicine, Seattle, WA Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA
| | - Amber A Burt
- Division of Medical Genetics, Department of Medicine University of Washington School of Medicine, Seattle, WA
| | - Adam S Gordon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA
| | | | - Mike A Nalls
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD
| | - Bradford B Worrall
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA Department of Neurology, University of Virginia, Charlottesville, VA Department of Public Health Sciences, University of Virginia, Charlottesville, VA
| | - W T Longstreth
- Department of Neurology University of Washington, Seattle, WA Department of Epidemiology, University of Washington, Seattle, WA
| | - Rebecca F Gottesman
- Cerebrovascular Division, Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Clement E Furlong
- Division of Medical Genetics, Department of Medicine University of Washington School of Medicine, Seattle, WA Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA
| | - Ulrike Peters
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA Department of Epidemiology, University of Washington, Seattle, WA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA
| | - Deborah A Nickerson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA
| | | | | |
Collapse
|
1529
|
Al Chawa T, Ludwig KU, Fier H, Pötzsch B, Reich RH, Schmidt G, Braumann B, Daratsianos N, Böhmer AC, Schuencke H, Alblas M, Fricker N, Hoffmann P, Knapp M, Lange C, Nöthen MM, Mangold E. Nonsyndromic cleft lip with or without cleft palate: Increased burden of rare variants within Gremlin-1, a component of the bone morphogenetic protein 4 pathway. ACTA ACUST UNITED AC 2014; 100:493-8. [PMID: 24706492 DOI: 10.1002/bdra.23244] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Revised: 02/20/2014] [Accepted: 03/10/2014] [Indexed: 11/11/2022]
Abstract
BACKGROUND The genes Gremlin-1 (GREM1) and Noggin (NOG) are components of the bone morphogenetic protein 4 pathway, which has been implicated in craniofacial development. Both genes map to recently identified susceptibility loci (chromosomal region 15q13, 17q22) for nonsyndromic cleft lip with or without cleft palate (nsCL/P). The aim of the present study was to determine whether rare variants in either gene are implicated in nsCL/P etiology. METHODS The complete coding regions, untranslated regions, and splice sites of GREM1 and NOG were sequenced in 96 nsCL/P patients and 96 controls of Central European ethnicity. Three burden and four nonburden tests were performed. Statistically significant results were followed up in a second case-control sample (n = 96, respectively). For rare variants observed in cases, segregation analyses were performed. RESULTS In NOG, four rare sequence variants (minor allele frequency < 1%) were identified. Here, burden and nonburden analyses generated nonsignificant results. In GREM1, 33 variants were identified, 15 of which were rare. Of these, five were novel. Significant p-values were generated in three nonburden analyses. Segregation analyses revealed incomplete penetrance for all variants investigated. CONCLUSION Our study did not provide support for NOG being the causal gene at 17q22. However, the observation of a significant excess of rare variants in GREM1 supports the hypothesis that this is the causal gene at chr. 15q13. Because no single causal variant was identified, future sequencing analyses of GREM1 should involve larger samples and the investigation of regulatory elements.
Collapse
Affiliation(s)
- Taofik Al Chawa
- Institute of Human Genetics, University of Bonn, Bonn, Germany; Klinikverbund St. Antonius und St. Josef, Wuppertal, Germany; Department of Genomics, Life and Brain Center, University of Bonn, Bonn, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1530
|
Sevini F, Giuliani C, Vianello D, Giampieri E, Santoro A, Biondi F, Garagnani P, Passarino G, Luiselli D, Capri M, Franceschi C, Salvioli S. mtDNA mutations in human aging and longevity: controversies and new perspectives opened by high-throughput technologies. Exp Gerontol 2014; 56:234-44. [PMID: 24709341 DOI: 10.1016/j.exger.2014.03.022] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2013] [Revised: 03/14/2014] [Accepted: 03/26/2014] [Indexed: 12/21/2022]
Abstract
The last 30 years of research greatly contributed to shed light on the role of mitochondrial DNA (mtDNA) variability in aging, although contrasting results have been reported, mainly due to bias regarding the population size and stratification, and to the use of analysis methods (haplogroup classification) that resulted to be not sufficiently adequate to grasp the complexity of the phenomenon. A 5-years European study (the GEHA EU project) collected and analyzed data on mtDNA variability on an unprecedented number of long-living subjects (enriched for longevity genes) and a comparable number of controls (matched for gender and ethnicity) in Europe. This very large study allowed a reappraisal of the role of both the inherited and the somatic mtDNA variability in aging, as an association with longevity emerged only when mtDNA variants in OXPHOS complexes co-occurred. Moreover, the availability of data from both nuclear and mitochondrial genomes on a large number of subjects paves the way for an evaluation at a very large scale of the epistatic interactions at a higher level of complexity. This scenario is expected to be even more clarified in the next future with the use of next generation sequencing (NGS) techniques, which are becoming applicable to evaluate mtDNA variability and, then, new mathematical/bioinformatic analysis methods are urgently needed. Recent advances of association studies on age-related diseases and mtDNA variability will also be discussed in this review, taking into account the bias hidden by population stratification. Finally, very recent findings in terms of mtDNA heteroplasmy (i.e. the coexistence of wild type and mutated copies of mtDNA) and aging as well as mitochondrial epigenetic mechanisms will also be discussed.
Collapse
Affiliation(s)
- Federica Sevini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, via S. Giacomo 12, 40126 Bologna, Italy; C.I.G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics, Biophysics and Biocomplexity, University of Bologna, via S. Giacomo 12, 40126, Bologna, Italy.
| | - Cristina Giuliani
- Department of Biological, Geological and Environmental Sciences, Laboratory of Anthropology, University of Bologna, Via Selmi 3, 40126 Bologna, Italy; Department of Biological, Geological and Environmental Sciences, Centre for Genome Biology, University of Bologna, Via Selmi 3, 40126 Bologna, Italy
| | - Dario Vianello
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, via S. Giacomo 12, 40126 Bologna, Italy
| | - Enrico Giampieri
- Department of Physics and Astronomy, Viale Berti Pichat 6/2, 40126 Bologna, Italy
| | - Aurelia Santoro
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, via S. Giacomo 12, 40126 Bologna, Italy
| | - Fiammetta Biondi
- C.I.G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics, Biophysics and Biocomplexity, University of Bologna, via S. Giacomo 12, 40126, Bologna, Italy
| | - Paolo Garagnani
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, via S. Giacomo 12, 40126 Bologna, Italy; C.I.G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics, Biophysics and Biocomplexity, University of Bologna, via S. Giacomo 12, 40126, Bologna, Italy
| | - Giuseppe Passarino
- Department of Biology, Ecology and Earth Science, University of Calabria, 87036 Rende, Italy
| | - Donata Luiselli
- Department of Biological, Geological and Environmental Sciences, Laboratory of Anthropology, University of Bologna, Via Selmi 3, 40126 Bologna, Italy; Department of Biological, Geological and Environmental Sciences, Centre for Genome Biology, University of Bologna, Via Selmi 3, 40126 Bologna, Italy
| | - Miriam Capri
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, via S. Giacomo 12, 40126 Bologna, Italy; C.I.G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics, Biophysics and Biocomplexity, University of Bologna, via S. Giacomo 12, 40126, Bologna, Italy
| | - Claudio Franceschi
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, via S. Giacomo 12, 40126 Bologna, Italy; C.I.G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics, Biophysics and Biocomplexity, University of Bologna, via S. Giacomo 12, 40126, Bologna, Italy; IRCCS, Institute of Neurological Sciences of Bologna, Ospedale Bellaria, Via Altura 3, 40139 Bologna, Italy; CNR, Institute of Organic Synthesis and Photoreactivity (ISOF), Via P. Gobetti 101, 40129 Bologna, Italy
| | - Stefano Salvioli
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, via S. Giacomo 12, 40126 Bologna, Italy; C.I.G. Interdepartmental Centre L. Galvani for Integrated Studies on Bioinformatics, Biophysics and Biocomplexity, University of Bologna, via S. Giacomo 12, 40126, Bologna, Italy
| |
Collapse
|
1531
|
Cook K, Benitez A, Fu C, Tintle N. Evaluating the impact of genotype errors on rare variant tests of association. Front Genet 2014; 5:62. [PMID: 24744770 PMCID: PMC3978329 DOI: 10.3389/fgene.2014.00062] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2013] [Accepted: 03/11/2014] [Indexed: 01/23/2023] Open
Abstract
The new class of rare variant tests has usually been evaluated assuming perfect genotype information. In reality, rare variant genotypes may be incorrect, and so rare variant tests should be robust to imperfect data. Errors and uncertainty in SNP genotyping are already known to dramatically impact statistical power for single marker tests on common variants and, in some cases, inflate the type I error rate. Recent results show that uncertainty in genotype calls derived from sequencing reads are dependent on several factors, including read depth, calling algorithm, number of alleles present in the sample, and the frequency at which an allele segregates in the population. We have recently proposed a general framework for the evaluation and investigation of rare variant tests of association, classifying most rare variant tests into one of two broad categories (length or joint tests). We use this framework to relate factors affecting genotype uncertainty to the power and type I error rate of rare variant tests. We find that non-differential genotype errors (an error process that occurs independent of phenotype) decrease power, with larger decreases for extremely rare variants, and for the common homozygote to heterozygote error. Differential genotype errors (an error process that is associated with phenotype status), lead to inflated type I error rates which are more likely to occur at sites with more common homozygote to heterozygote errors than vice versa. Finally, our work suggests that certain rare variant tests and study designs may be more robust to the inclusion of genotype errors. Further work is needed to directly integrate genotype calling algorithm decisions, study costs and test statistic choices to provide comprehensive design and analysis advice which appropriately accounts for the impact of genotype errors.
Collapse
Affiliation(s)
- Kaitlyn Cook
- Department of Mathematics, Carleton College Northfield, MN, USA
| | - Alejandra Benitez
- Department of Applied Mathematics, Brown University Providence, RI, USA
| | - Casey Fu
- Department of Mathematics, Massachusetts Institute of Technology Boston, MA, USA
| | - Nathan Tintle
- Department of Mathematics, Statistics and Computer Science, Dordt College Sioux Center, IA, USA
| |
Collapse
|
1532
|
Li M, He Z, Zhang M, Zhan X, Wei C, Elston RC, Lu Q. A generalized genetic random field method for the genetic association analysis of sequencing data. Genet Epidemiol 2014; 38:242-53. [PMID: 24482034 PMCID: PMC5241166 DOI: 10.1002/gepi.21790] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Revised: 11/28/2013] [Accepted: 12/21/2013] [Indexed: 01/23/2023]
Abstract
With the advance of high-throughput sequencing technologies, it has become feasible to investigate the influence of the entire spectrum of sequencing variations on complex human diseases. Although association studies utilizing the new sequencing technologies hold great promise to unravel novel genetic variants, especially rare genetic variants that contribute to human diseases, the statistical analysis of high-dimensional sequencing data remains a challenge. Advanced analytical methods are in great need to facilitate high-dimensional sequencing data analyses. In this article, we propose a generalized genetic random field (GGRF) method for association analyses of sequencing data. Like other similarity-based methods (e.g., SIMreg and SKAT), the new method has the advantages of avoiding the need to specify thresholds for rare variants and allowing for testing multiple variants acting in different directions and magnitude of effects. The method is built on the generalized estimating equation framework and thus accommodates a variety of disease phenotypes (e.g., quantitative and binary phenotypes). Moreover, it has a nice asymptotic property, and can be applied to small-scale sequencing data without need for small-sample adjustment. Through simulations, we demonstrate that the proposed GGRF attains an improved or comparable power over a commonly used method, SKAT, under various disease scenarios, especially when rare variants play a significant role in disease etiology. We further illustrate GGRF with an application to a real dataset from the Dallas Heart Study. By using GGRF, we were able to detect the association of two candidate genes, ANGPTL3 and ANGPTL4, with serum triglyceride.
Collapse
Affiliation(s)
- Ming Li
- Division of Biostatistics, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, Arkansas, United States of America
| | - Zihuai He
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Min Zhang
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Xiaowei Zhan
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Changshuai Wei
- Department of Epidemiology and Biostatics, Michigan State University, East Lansing, Michigan, United States of America
| | - Robert C. Elston
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Qing Lu
- Department of Epidemiology and Biostatics, Michigan State University, East Lansing, Michigan, United States of America
| |
Collapse
|
1533
|
Zeng P, Zhao Y, Zhang L, Huang S, Chen F. Rare variants detection with kernel machine learning based on likelihood ratio test. PLoS One 2014; 9:e93355. [PMID: 24675868 PMCID: PMC3968153 DOI: 10.1371/journal.pone.0093355] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2013] [Accepted: 03/03/2014] [Indexed: 11/18/2022] Open
Abstract
This paper mainly utilizes likelihood-based tests to detect rare variants associated with a continuous phenotype under the framework of kernel machine learning. Both the likelihood ratio test (LRT) and the restricted likelihood ratio test (ReLRT) are investigated. The relationship between the kernel machine learning and the mixed effects model is discussed. By using the eigenvalue representation of LRT and ReLRT, their exact finite sample distributions are obtained in a simulation manner. Numerical studies are performed to evaluate the performance of the proposed approaches under the contexts of standard mixed effects model and kernel machine learning. The results have shown that the LRT and ReLRT can control the type I error correctly at the given α level. The LRT and ReLRT consistently outperform the SKAT, regardless of the sample size and the proportion of the negative causal rare variants, and suffer from fewer power reductions compared to the SKAT when both positive and negative effects of rare variants are present. The LRT and ReLRT performed under the context of kernel machine learning have slightly higher powers than those performed under the context of standard mixed effects model. We use the Genetic Analysis Workshop 17 exome sequencing SNP data as an illustrative example. Some interesting results are observed from the analysis. Finally, we give the discussion.
Collapse
Affiliation(s)
- Ping Zeng
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China
- Department of Epidemiology and Biostatistics, School of Public Health, Xuzhou Medical College, Xuzhou, Jiangsu, China
| | - Yang Zhao
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China
| | - Liwei Zhang
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China
| | - Shuiping Huang
- Department of Epidemiology and Biostatistics, School of Public Health, Xuzhou Medical College, Xuzhou, Jiangsu, China
| | - Feng Chen
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China
- * E-mail:
| |
Collapse
|
1534
|
Test of rare variant association based on affected sib-pairs. Eur J Hum Genet 2014; 23:229-37. [PMID: 24667785 DOI: 10.1038/ejhg.2014.43] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2013] [Revised: 11/06/2013] [Accepted: 12/30/2013] [Indexed: 11/08/2022] Open
Abstract
With the development of sequencing techniques, there is increasing interest to detect associations between rare variants and complex traits. Quite a few statistical methods to detect associations between rare variants and complex traits have been developed for unrelated individuals. Statistical methods for detecting rare variant associations under family-based designs have not received as much attention as methods for unrelated individuals. Recent studies show that rare disease variants will be enriched in family data and thus family-based designs may improve power to detect rare variant associations. In this article, we propose a novel test to test association between the optimally weighted combination of variants and trait of interests for affected sib-pairs. The optimal weights are analytically derived and can be calculated from sampled genotypes and phenotypes. Based on the optimal weights, the proposed method is robust to the directions of the effects of causal variants and is less affected by neutral variants than existing methods are. Our simulation results show that, in all the cases, the proposed method is substantially more powerful than existing methods based on unrelated individuals and existing methods based on affected sib-pairs.
Collapse
|
1535
|
Gao F, Ballantyne C, Ma L, Virani SS, Keinan A, Brautbar A. Rare LPL gene variants attenuate triglyceride reduction and HDL cholesterol increase in response to fenofibric acid therapy in individuals with mixed dyslipidemia. Atherosclerosis 2014; 234:249-53. [PMID: 24704626 DOI: 10.1016/j.atherosclerosis.2014.03.010] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/12/2013] [Revised: 03/10/2014] [Accepted: 03/11/2014] [Indexed: 12/25/2022]
Abstract
OBJECTIVE Individuals with mixed dyslipidemia have elevated triglycerides (TG), low high-density lipoprotein cholesterol (HDL-C), and increased risk for coronary disease. Fibrate therapy is commonly used to lower TG and increase HDL-C. Common genetic variants are known to affect the response to fibrate therapy. We sought to identify rare genetic variants (frequency ≤ 1%) in genes involved in TG and HDL-C metabolism that affect the response to fenofibric acid (FA) therapy. METHODS Four genes with a major role in HDL-C and TG metabolism APOA1, APOC2, APOC-III and LPL were sequenced in 2385 participants with mixed dyslipidemia in a randomized, double-blind, active-controlled study comparing therapy with FA alone, in combination with statins, or statin alone. Rare variants collapsing or SKAT methods were used for the analysis. RESULTS Synonymous rare variants in the LPL gene were significantly associated with absolute HDL-C change (P = 9 × 10(-4)) and TG percent change (P = 6.76 × 10(-4)) in those treated with FA only. Participants with these rare variants had a 2 mg/dL increase in HDL-C and 39 mg/dL decrease in TG as compared to 6.2 mg/dL increase in HDL-C and 100 mg/dL decrease in TG in those without these variants. Rare variants in the APOC-III gene were associated with a modest 3 mg/dL less reduction in APOB (P = 8.72 × 10(-4)) in those receiving FA and statin. CONCLUSION In individuals with mixed dyslipidemia rare synonymous variants within LPL gene were associated with attenuated response to FA therapy while APOCIII rare variants were associated with a modest effect on APOB response to FA-statin therapy. These results should be replicated in a similar clinical trial for further confirmation.
Collapse
Affiliation(s)
- Feng Gao
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, USA
| | - Christie Ballantyne
- Section of Cardiovascular Research, Baylor College of Medicine, Houston, TX, USA; Center for Cardiovascular Disease Prevention, Methodist DeBakey Heart and Vascular Center, Houston, TX, USA
| | - Li Ma
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, USA; Department of Animal and Avian Sciences, University of Maryland, College Park, MD, USA
| | - Salim S Virani
- Section of Cardiovascular Research, Baylor College of Medicine, Houston, TX, USA; Center for Cardiovascular Disease Prevention, Methodist DeBakey Heart and Vascular Center, Houston, TX, USA; Michael E. DeBakey Veterans Affairs Medical Center Health Services Research and Development Center for Innovations, Houston, TX, USA
| | - Alon Keinan
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, USA.
| | - Ariel Brautbar
- Section of Cardiovascular Research, Baylor College of Medicine, Houston, TX, USA; Center for Cardiovascular Disease Prevention, Methodist DeBakey Heart and Vascular Center, Houston, TX, USA; Center for Human Genetics, Marshfield Clinic Research Foundation, Marshfield, WI, USA.
| |
Collapse
|
1536
|
Ortega VE, Meyers DA. Pharmacogenetics: implications of race and ethnicity on defining genetic profiles for personalized medicine. J Allergy Clin Immunol 2014; 133:16-26. [PMID: 24369795 DOI: 10.1016/j.jaci.2013.10.040] [Citation(s) in RCA: 139] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2013] [Revised: 10/22/2013] [Accepted: 10/23/2013] [Indexed: 01/06/2023]
Abstract
Pharmacogenetics is being used to develop personalized therapies specific to subjects from different ethnic or racial groups. To date, pharmacogenetic studies have been primarily performed in trial cohorts consisting of non-Hispanic white subjects of European descent. A "bottleneck" or collapse of genetic diversity associated with the first human colonization of Europe during the Upper Paleolithic period, followed by the recent mixing of African, European, and Native American ancestries, has resulted in different ethnic groups with varying degrees of genetic diversity. Differences in genetic ancestry might introduce genetic variation, which has the potential to alter the therapeutic efficacy of commonly used asthma therapies, such as β2-adrenergic receptor agonists (β-agonists). Pharmacogenetic studies of admixed ethnic groups have been limited to small candidate gene association studies, of which the best example is the gene coding for the receptor target of β-agonist therapy, the β2-adrenergic receptor (ADRB2). Large consortium-based sequencing studies are using next-generation whole-genome sequencing to provide a diverse genome map of different admixed populations, which can be used for future pharmacogenetic studies. These studies will include candidate gene studies, genome-wide association studies, and whole-genome admixture-based approaches that account for ancestral genetic structure, complex haplotypes, gene-gene interactions, and rare variants to detect and replicate novel pharmacogenetic loci.
Collapse
Affiliation(s)
- Victor E Ortega
- Center for Genomics and Personalized Medicine, Wake Forest School of Medicine, Winston-Salem, NC
| | - Deborah A Meyers
- Center for Genomics and Personalized Medicine, Wake Forest School of Medicine, Winston-Salem, NC.
| |
Collapse
|
1537
|
He Z, Zhang M, Zhan X, Lu Q. Modeling and testing for joint association using a genetic random field model. Biometrics 2014; 70:471-9. [PMID: 24628067 DOI: 10.1111/biom.12160] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2013] [Revised: 01/01/2014] [Accepted: 02/01/2014] [Indexed: 12/30/2022]
Abstract
Substantial progress has been made in identifying single genetic variants predisposing to common complex diseases. Nonetheless, the genetic etiology of human diseases remains largely unknown. Human complex diseases are likely influenced by the joint effect of a large number of genetic variants instead of a single variant. The joint analysis of multiple genetic variants considering linkage disequilibrium (LD) and potential interactions can further enhance the discovery process, leading to the identification of new disease-susceptibility genetic variants. Motivated by development in spatial statistics, we propose a new statistical model based on the random field theory, referred to as a genetic random field model (GenRF), for joint association analysis with the consideration of possible gene-gene interactions and LD. Using a pseudo-likelihood approach, a GenRF test for the joint association of multiple genetic variants is developed, which has the following advantages: (1) accommodating complex interactions for improved performance; (2) natural dimension reduction; (3) boosting power in the presence of LD; and (4) computationally efficient. Simulation studies are conducted under various scenarios. The development has been focused on quantitative traits and robustness of the GenRF test to other traits, for example, binary traits, is also discussed. Compared with a commonly adopted kernel machine approach, SKAT, as well as other more standard methods, GenRF shows overall comparable performance and better performance in the presence of complex interactions. The method is further illustrated by an application to the Dallas Heart Study.
Collapse
Affiliation(s)
- Zihuai He
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, U.S.A
| | - Min Zhang
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, U.S.A
| | - Xiaowei Zhan
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, U.S.A
| | - Qing Lu
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, U.S.A
| |
Collapse
|
1538
|
|
1539
|
Lee IH, Lee K, Hsing M, Choe Y, Park JH, Kim SH, Bohn JM, Neu MB, Hwang KB, Green RC, Kohane IS, Kong SW. Prioritizing disease-linked variants, genes, and pathways with an interactive whole-genome analysis pipeline. Hum Mutat 2014; 35:537-47. [PMID: 24478219 DOI: 10.1002/humu.22520] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2013] [Accepted: 01/23/2014] [Indexed: 01/02/2023]
Abstract
Whole-genome sequencing (WGS) studies are uncovering disease-associated variants in both rare and nonrare diseases. Utilizing the next-generation sequencing for WGS requires a series of computational methods for alignment, variant detection, and annotation, and the accuracy and reproducibility of annotation results are essential for clinical implementation. However, annotating WGS with up to date genomic information is still challenging for biomedical researchers. Here, we present one of the fastest and highly scalable annotation, filtering, and analysis pipeline-gNOME-to prioritize phenotype-associated variants while minimizing false-positive findings. Intuitive graphical user interface of gNOME facilitates the selection of phenotype-associated variants, and the result summaries are provided at variant, gene, and genome levels. Moreover, the enrichment results of specific variants, genes, and gene sets between two groups or compared with population scale WGS datasets that is already integrated in the pipeline can help the interpretation. We found a small number of discordant results between annotation software tools in part due to different reporting strategies for the variants with complex impacts. Using two published whole-exome datasets of uveal melanoma and bladder cancer, we demonstrated gNOME's accuracy of variant annotation and the enrichment of loss-of-function variants in known cancer pathways. gNOME Web server and source codes are freely available to the academic community (http://gnome.tchlab.org).
Collapse
Affiliation(s)
- In-Hee Lee
- Children's Hospital Informatics Program at the Harvard-MIT Division of Health Sciences and Technology, Department of Medicine, Boston Children's Hospital, Boston, Massachusetts, 02115
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1540
|
Lin H, Sinner MF, Brody JA, Arking DE, Lunetta KL, Rienstra M, Lubitz SA, Magnani JW, Sotoodehnia N, McKnight B, McManus DD, Boerwinkle E, Psaty BM, Rotter JI, Bis JC, Gibbs RA, Muzny D, Kovar CL, Morrison AC, Gupta M, Folsom AR, Kääb S, Heckbert SR, Alonso A, Ellinor PT, Benjamin EJ. Targeted sequencing in candidate genes for atrial fibrillation: the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Targeted Sequencing Study. Heart Rhythm 2014; 11:452-7. [PMID: 24239840 PMCID: PMC3943920 DOI: 10.1016/j.hrthm.2013.11.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Indexed: 12/22/2022]
Abstract
BACKGROUND Genome-wide association studies (GWAS) have identified common genetic variants that predispose to atrial fibrillation (AF). It is unclear whether rare and low-frequency variants in genes implicated by such GWAS confer additional risk of AF. OBJECTIVE To study the association of genetic variants with AF at GWAS top loci. METHODS In the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Targeted Sequencing Study, we selected and sequenced 77 target gene regions from GWAS loci of complex diseases or traits, including 4 genes hypothesized to be related to AF (PRRX1, CAV1, CAV2, and ZFHX3). Sequencing was performed in participants with (n = 948) and without (n = 3330) AF from the Atherosclerosis Risk in Communities Study, the Cardiovascular Health Study, the Framingham Heart Study, and the Massachusetts General Hospital. RESULTS One common variant (rs11265611; P = 1.70 × 10(-6)) intronic to IL6R (interleukin-6 receptor gene) was significantly associated with AF after Bonferroni correction (odds ratio 0.70; 95% confidence interval 0.58-0.85). The variant was not genotyped or imputed by prior GWAS, but it is in linkage disequilibrium (r(2) = .69) with the single-nucleotide polymorphism, with the strongest association with AF so far at this locus (rs4845625). In the rare variant joint analysis, damaging variants within the PRRX1 region showed significant association with AF after Bonferroni correction (P = .01). CONCLUSIONS We identified 1 common single-nucleotide polymorphism and 1 gene region that were significantly associated with AF. Future sequencing efforts with larger sample sizes and more comprehensive genome coverage are anticipated to identify additional AF-related variants.
Collapse
Affiliation(s)
- Honghuang Lin
- Department of Medicine, Boston University School of Medicine, Boston, Massachusetts; The NHLBI's Framingham Heart Study, Framingham, Massachusetts.
| | - Moritz F Sinner
- The NHLBI's Framingham Heart Study, Framingham, Massachusetts; Cardiovascular Research Center, Massachusetts General Hospital, Charlestown, Massachusetts; Department of Medicine I, University Hospital Munich, Campus Grosshadern, Ludwig-Maximilians-University, Munich, Germany
| | - Jennifer A Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, Washington
| | - Dan E Arking
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Kathryn L Lunetta
- The NHLBI's Framingham Heart Study, Framingham, Massachusetts; Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts
| | - Michiel Rienstra
- Cardiovascular Research Center, Massachusetts General Hospital, Charlestown, Massachusetts; Department of Cardiology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Steven A Lubitz
- Cardiovascular Research Center, Massachusetts General Hospital, Charlestown, Massachusetts; Cardiac Arrhythmia Service, Massachusetts General Hospital, Boston, Massachusetts
| | - Jared W Magnani
- Department of Medicine, Boston University School of Medicine, Boston, Massachusetts; The NHLBI's Framingham Heart Study, Framingham, Massachusetts
| | - Nona Sotoodehnia
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, Washington; Division of Cardiology, University of Washington, Seattle, Washington
| | - Barbara McKnight
- Department of Biostatistics, University of Washington, Seattle, Washington
| | - David D McManus
- Department of Medicine, University of Massachusetts Medical School, Worcester, Massachusetts
| | - Eric Boerwinkle
- Human Genetics Center, University of Texas Health Science Center at Houston, Houston, Texas
| | - Bruce M Psaty
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, Washington; Group Health Research Institute, Group Health Cooperative, Seattle, Washington; Department of Epidemiology, University of Washington, Seattle, Washington; Department of Health Services, University of Washington, Seattle, Washington
| | - Jerome I Rotter
- Institute for Translational Genomics and Population Sciences, Los Angeles BioMedical Research Institute at Harbor-UCLA Medical Center, Torrance, California
| | - Joshua C Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, Washington
| | - Richard A Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Donna Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Christie L Kovar
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Alanna C Morrison
- Human Genetics Center, University of Texas Health Science Center at Houston, Houston, Texas
| | - Mayetri Gupta
- Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts
| | - Aaron R Folsom
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, Minnesota
| | - Stefan Kääb
- Department of Medicine I, University Hospital Munich, Campus Grosshadern, Ludwig-Maximilians-University, Munich, Germany; Munich Heart Alliance, Munich, Germany
| | - Susan R Heckbert
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, Washington; Group Health Research Institute, Group Health Cooperative, Seattle, Washington; Department of Epidemiology, University of Washington, Seattle, Washington
| | - Alvaro Alonso
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, Minnesota
| | - Patrick T Ellinor
- Cardiovascular Research Center, Massachusetts General Hospital, Charlestown, Massachusetts; Cardiac Arrhythmia Service, Massachusetts General Hospital, Boston, Massachusetts; Center for Human Genetic Research, Massachusetts General Hospital, Boston, Massachusetts; Harvard Medical School, Boston, Massachusetts
| | - Emelia J Benjamin
- Department of Medicine, Boston University School of Medicine, Boston, Massachusetts; The NHLBI's Framingham Heart Study, Framingham, Massachusetts
| |
Collapse
|
1541
|
Preston MD, Dudbridge F. Utilising family-based designs for detecting rare variant disease associations. Ann Hum Genet 2014; 78:129-40. [PMID: 24571231 PMCID: PMC4292528 DOI: 10.1111/ahg.12051] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 11/17/2013] [Indexed: 01/04/2023]
Abstract
Rare genetic variants are thought to be important components in the causality of many diseases but discovering these associations is challenging. We demonstrate how best to use family-based designs to improve the power to detect rare variant disease associations. We show that using genetic data from enriched families (those pedigrees with greater than one affected member) increases the power and sensitivity of existing case-control rare variant tests. However, we show that transmission- (or within-family-) based tests do not benefit from this enrichment. This means that, in studies where a limited amount of genotyping is available, choosing a single case from each of many pedigrees has greater power than selecting multiple cases from fewer pedigrees. Finally, we show how a pseudo-case-control design allows a greater range of statistical tests to be applied to family data.
Collapse
Affiliation(s)
- Mark D Preston
- London School of Hygiene and Tropical MedicineKeppel Street, London, WC1E 7HT, UK
| | - Frank Dudbridge
- London School of Hygiene and Tropical MedicineKeppel Street, London, WC1E 7HT, UK
| |
Collapse
|
1542
|
Bonomo JA, Palmer ND, Hicks PJ, Lea JP, Okusa MD, Langefeld CD, Bowden DW, Freedman BI. Complement factor H gene associations with end-stage kidney disease in African Americans. Nephrol Dial Transplant 2014; 29:1409-14. [PMID: 24586071 DOI: 10.1093/ndt/gfu036] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Mutations in the complement factor H gene (CFH) region associate with renal-limited mesangial proliferative forms of glomerulonephritis including IgA nephropathy (IgAN), dense deposit disease (DDD) and C3 glomerulonephritis (C3GN). Lack of kidney biopsies could lead to under diagnosis of CFH-associated end-stage kidney disease (ESKD) in African Americans (AAs), with incorrect attribution to other causes. A prior genome-wide association study in AAs with non-diabetic ESKD implicated an intronic CFH single nucleotide polymorphism (SNP). METHODS Thirteen CFH SNPs (8 exonic, 2 synonymous, 2 3'UTR, and the previously associated intronic variant rs379489) were tested for association with common forms of non-diabetic and type 2 diabetes-associated (T2D) ESKD in 3770 AAs (1705 with non-diabetic ESKD, 1305 with T2D-ESKD, 760 controls). Most cases lacked kidney biopsies; those with known IgAN, DDD or C3GN were excluded. RESULTS Adjusting for age, gender, ancestry and apolipoprotein L1 gene risk variants, single SNP analyses detected 6 CFH SNPs (5 exonic and the intronic variant) as significantly associated with non-diabetic ESKD (P = 0.002-0.01), three of these SNPs were also associated with T2D-ESKD. Weighted CFH locus-wide Sequence Kernel Association Testing (SKAT) in non-diabetic ESKD (P = 0.00053) and T2D-ESKD (P = 0.047) confirmed significant evidence of association. CONCLUSIONS CFH was associated with commonly reported etiologies of ESKD in the AA population. These results suggest that a subset of cases with ESKD clinically ascribed to the effects of hypertension or glomerulosclerosis actually have CFH-related forms of mesangial proliferative glomerulonephritis. Genetic testing may prove useful to identify the causes of renal-limited kidney disease in patients with ESKD who lack renal biopsies.
Collapse
Affiliation(s)
- Jason A Bonomo
- Department of Molecular Medicine and Translational Science, Wake Forest School of Medicine, Winston-Salem, NC, USA Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Nicholette D Palmer
- Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, NC, USA Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Pamela J Hicks
- Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Janice P Lea
- Division of Renal Medicine, Department of Medicine, Emory School of Medicine, Atlanta, GA, USA
| | - Mark D Okusa
- Division of Nephrology, Department of Medicine, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Carl D Langefeld
- Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, NC, USA Department of Biostatistical Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Donald W Bowden
- Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, NC, USA Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Barry I Freedman
- Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, NC, USA Section on Nephrology, Department of Internal Medicine, Wake Forest School of Medicine, Winston-Salem, NC, USA
| |
Collapse
|
1543
|
Weeke P, Mosley JD, Hanna D, Delaney JT, Shaffer C, Wells QS, Van Driest S, Karnes JH, Ingram C, Guo Y, Shyr Y, Norris K, Kannankeril PJ, Ramirez AH, Smith JD, Mardis ER, Nickerson D, George AL, Roden DM. Exome sequencing implicates an increased burden of rare potassium channel variants in the risk of drug-induced long QT interval syndrome. J Am Coll Cardiol 2014; 63:1430-7. [PMID: 24561134 DOI: 10.1016/j.jacc.2014.01.031] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Revised: 12/09/2013] [Accepted: 01/07/2014] [Indexed: 10/25/2022]
Abstract
OBJECTIVES The aim of this study was to test the hypothesis that rare variants are associated with drug-induced long QT interval syndrome (diLQTS) and torsades de pointes. BACKGROUND diLQTS is associated with the potentially fatal arrhythmia torsades de pointes. The contribution of rare genetic variants to the underlying genetic framework predisposing to diLQTS has not been systematically examined. METHODS We performed whole-exome sequencing on 65 diLQTS patients and 148 drug-exposed control subjects of European descent. We used rare variant analyses (variable threshold and sequence kernel association test) and gene-set analyses to identify genes enriched with rare amino acid coding (AAC) variants associated with diLQTS. Significant associations were reanalyzed by comparing diLQTS patients with 515 ethnically matched control subjects from the National Heart, Lung, and Blood Grand Opportunity Exome Sequencing Project. RESULTS Rare variants in 7 genes were enriched in the diLQTS patients according to the sequence kernel association test or variable threshold compared with drug-exposed controls (p < 0.001). Of these, we replicated the diLQTS associations for KCNE1 and ACN9 using 515 Exome Sequencing Project control subjects (p < 0.05). A total of 37% of the diLQTS patients also had 1 or more rare AAC variants compared with 21% of control subjects (p = 0.009), in a pre-defined set of 7 congenital long QT interval syndrome (cLQTS) genes encoding potassium channels or channel modulators (KCNE1, KCNE2, KCNH2, KCNJ2, KCNJ5, KCNQ1, AKAP9). CONCLUSIONS By combining whole-exome sequencing with aggregated rare variant analyses, we implicate rare variants in KCNE1 and ACN9 as risk factors for diLQTS. Moreover, diLQTS patients were more burdened by rare AAC variants in cLQTS genes encoding potassium channel modulators, supporting the idea that multiple rare variants, notably across cLQTS genes, predispose to diLQTS.
Collapse
Affiliation(s)
- Peter Weeke
- Department of Medicine, Vanderbilt University, Nashville, Tennessee; Department of Cardiology, Copenhagen University Hospital, Gentofte, Denmark
| | | | - David Hanna
- Department of Genome Sciences, University of Washington, Seattle, Washington
| | | | | | - Quinn S Wells
- Department of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Sara Van Driest
- Department of Pediatrics, Vanderbilt University School of Medicine, Nashville, Tennessee
| | - Jason H Karnes
- Department of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Christie Ingram
- Department of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Yan Guo
- Vanderbilt Technologies for Advanced Genomics Analysis and Research Design, Nashville, Tennessee
| | - Yu Shyr
- Vanderbilt Technologies for Advanced Genomics Analysis and Research Design, Nashville, Tennessee
| | - Kris Norris
- Department of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Prince J Kannankeril
- Department of Pediatrics, Vanderbilt University School of Medicine, Nashville, Tennessee
| | - Andrea H Ramirez
- Department of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Joshua D Smith
- Department of Genome Sciences, University of Washington, Seattle, Washington
| | - Elaine R Mardis
- The Genome Institute, Washington University, St. Louis, Missouri
| | - Deborah Nickerson
- Department of Genome Sciences, University of Washington, Seattle, Washington
| | - Alfred L George
- Departments of Medicine and Pharmacology, Vanderbilt University, Nashville, Tennessee
| | - Dan M Roden
- Departments of Medicine and Pharmacology, Vanderbilt University, Nashville, Tennessee.
| |
Collapse
|
1544
|
Yan S, Li Y. BETASEQ: a powerful novel method to control type-I error inflation in partially sequenced data for rare variant association testing. ACTA ACUST UNITED AC 2014; 30:480-7. [PMID: 24336643 DOI: 10.1093/bioinformatics/btt719] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
SUMMARY Despite its great capability to detect rare variant associations, next-generation sequencing is still prohibitively expensive when applied to large samples. In case-control studies, it is thus appealing to sequence only a subset of cases to discover variants and genotype the identified variants in controls and the remaining cases under the reasonable assumption that causal variants are usually enriched among cases. However, this approach leads to inflated type-I error if analyzed naively for rare variant association. Several methods have been proposed in recent literature to control type-I error at the cost of either excluding some sequenced cases or correcting the genotypes of discovered rare variants. All of these approaches thus suffer from certain extent of information loss and thus are underpowered. We propose a novel method (BETASEQ), which corrects inflation of type-I error by supplementing pseudo-variants while keeps the original sequence and genotype data intact. Extensive simulations and real data analysis demonstrate that, in most practical situations, BETASEQ leads to higher testing powers than existing approaches with guaranteed (controlled or conservative) type-I error. AVAILABILITY AND IMPLEMENTATION BETASEQ and associated R files, including documentation, examples, are available at http://www.unc.edu/~yunmli/betaseq
Collapse
Affiliation(s)
- Song Yan
- Department of Biostatistics, University of North Carolina, 3101 McGavran-Greenberg Hall, Chapel Hill, NC 27599, USA, Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA and Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA
| | | |
Collapse
|
1545
|
Xu C, Tachmazidou I, Walter K, Ciampi A, Zeggini E, Greenwood CMT. Estimating genome-wide significance for whole-genome sequencing studies. Genet Epidemiol 2014; 38:281-90. [PMID: 24676807 PMCID: PMC4489336 DOI: 10.1002/gepi.21797] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2013] [Revised: 11/20/2013] [Accepted: 01/07/2014] [Indexed: 01/20/2023]
Abstract
Although a standard genome-wide significance level has been accepted for the testing of association between common genetic variants and disease, the era of whole-genome sequencing (WGS) requires a new threshold. The allele frequency spectrum of sequence-identified variants is very different from common variants, and the identified rare genetic variation is usually jointly analyzed in a series of genomic windows or regions. In nearby or overlapping windows, these test statistics will be correlated, and the degree of correlation is likely to depend on the choice of window size, overlap, and the test statistic. Furthermore, multiple analyses may be performed using different windows or test statistics. Here we propose an empirical approach for estimating genome-wide significance thresholds for data arising from WGS studies, and we demonstrate that the empirical threshold can be efficiently estimated by extrapolating from calculations performed on a small genomic region. Because analysis of WGS may need to be repeated with different choices of test statistics or windows, this prediction approach makes it computationally feasible to estimate genome-wide significance thresholds for different analysis choices. Based on UK10K whole-genome sequence data, we derive genome-wide significance thresholds ranging between 2.5 × 10−8 and 8 × 10−8 for our analytic choices in window-based testing, and thresholds of 0.6 × 10−8–1.5 × 10−8 for a combined analytic strategy of testing common variants using single-SNP tests together with rare variants analyzed with our sliding-window test strategy.
Collapse
Affiliation(s)
- ChangJiang Xu
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Canada; Lady Davis Institute for Medical Research, Jewish General Hospital, Montreal, Canada
| | | | | | | | | | | | | |
Collapse
|
1546
|
Purcell SM, Moran JL, Fromer M, Ruderfer D, Solovieff N, Roussos P, O’Dushlaine C, Chambert K, Bergen SE, Kähler A, Duncan L, Stahl E, Genovese G, Fernández E, Collins MO, Komiyama NH, Choudhary JS, Magnusson PKE, Banks E, Shakir K, Garimella K, Fennell T, de Pristo M, Grant SG, Haggarty S, Gabriel S, Scolnick EM, Lander ES, Hultman C, Sullivan PF, McCarroll SA, Sklar P. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 2014; 506:185-90. [PMID: 24463508 PMCID: PMC4136494 DOI: 10.1038/nature12975] [Citation(s) in RCA: 1023] [Impact Index Per Article: 93.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Accepted: 12/24/2013] [Indexed: 12/11/2022]
Abstract
Schizophrenia is a common disease with a complex aetiology, probably involving multiple and heterogeneous genetic factors. Here, by analysing the exome sequences of 2,536 schizophrenia cases and 2,543 controls, we demonstrate a polygenic burden primarily arising from rare (less than 1 in 10,000), disruptive mutations distributed across many genes. Particularly enriched gene sets include the voltage-gated calcium ion channel and the signalling complex formed by the activity-regulated cytoskeleton-associated scaffold protein (ARC) of the postsynaptic density, sets previously implicated by genome-wide association and copy-number variation studies. Similar to reports in autism, targets of the fragile X mental retardation protein (FMRP, product of FMR1) are enriched for case mutations. No individual gene-based test achieves significance after correction for multiple testing and we do not detect any alleles of moderately low frequency (approximately 0.5 to 1 per cent) and moderately large effect. Taken together, these data suggest that population-based exome sequencing can discover risk alleles and complements established gene-mapping paradigms in neuropsychiatric disease.
Collapse
Affiliation(s)
- Shaun M. Purcell
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
- Division of Psychiatric Genomics in the Department of Psychiatry, and Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Analytic and Translational Genetics Unit, Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Jennifer L. Moran
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Menachem Fromer
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
- Division of Psychiatric Genomics in the Department of Psychiatry, and Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Analytic and Translational Genetics Unit, Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Douglas Ruderfer
- Division of Psychiatric Genomics in the Department of Psychiatry, and Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Nadia Solovieff
- Analytic and Translational Genetics Unit, Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Panos Roussos
- Division of Psychiatric Genomics in the Department of Psychiatry, and Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Colm O’Dushlaine
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Kimberly Chambert
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Sarah E. Bergen
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
- Department of Medical Epidemiology and Biostatisics, Karolinska Institutet, Stockholm, SE-171 77, Sweden
| | - Anna Kähler
- Department of Medical Epidemiology and Biostatisics, Karolinska Institutet, Stockholm, SE-171 77, Sweden
| | - Laramie Duncan
- Analytic and Translational Genetics Unit, Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Eli Stahl
- Division of Psychiatric Genomics in the Department of Psychiatry, and Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Giulio Genovese
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Esperanza Fernández
- Center for Human Genetics, KU Leuven, 3000 Leuven, Belgium; VIB Center for Biology of Disease, 3000 Leuven, Belgium
| | - Mark O Collins
- Proteomic Mass Spectrometry, The Wellcome Trust Sanger Institute, Cambridge, UK
| | - Noboru H. Komiyama
- Proteomic Mass Spectrometry, The Wellcome Trust Sanger Institute, Cambridge, UK
| | - Jyoti S. Choudhary
- Proteomic Mass Spectrometry, The Wellcome Trust Sanger Institute, Cambridge, UK
| | - Patrik K. E. Magnusson
- Department of Medical Epidemiology and Biostatisics, Karolinska Institutet, Stockholm, SE-171 77, Sweden
| | - Eric Banks
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Khalid Shakir
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Kiran Garimella
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Tim Fennell
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Mark de Pristo
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Seth G.N. Grant
- Genes to Cognition Programme, Centre for Clinical Brain Sciences and Centre for Neuroregeneration, The University of Edinburgh, Edinburgh, UK
| | - Stephen Haggarty
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
- Department of Neurology, Harvard Medical School, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Stacey Gabriel
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Edward M. Scolnick
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Eric S. Lander
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
| | - Christina Hultman
- Department of Medical Epidemiology and Biostatisics, Karolinska Institutet, Stockholm, SE-171 77, Sweden
| | - Patrick F. Sullivan
- Department of Genetics, University of North Carolina, CB# 7264, Chapel Hill, NC, 27599-7264, USA
| | - Steven A. McCarroll
- Stanley Center for Psychiatric Research, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
- Medical & Population Genetics Program, Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA
- Department of Genetics, Harvard Medical School, Boston, MA, 02115, USA
| | - Pamela Sklar
- Division of Psychiatric Genomics in the Department of Psychiatry, and Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| |
Collapse
|
1547
|
Okada Y, Diogo D, Greenberg JD, Mouassess F, Achkar WAL, Fulton RS, Denny JC, Gupta N, Mirel D, Gabriel S, Li G, Kremer JM, Pappas DA, Carroll RJ, Eyler AE, Trynka G, Stahl EA, Cui J, Saxena R, Coenen MJH, Guchelaar HJ, Huizinga TWJ, Dieudé P, Mariette X, Barton A, Canhão H, Fonseca JE, de Vries N, Tak PP, Moreland LW, Bridges SL, Miceli-Richard C, Choi HK, Kamatani Y, Galan P, Lathrop M, Raj T, De Jager PL, Raychaudhuri S, Worthington J, Padyukov L, Klareskog L, Siminovitch KA, Gregersen PK, Mardis ER, Arayssi T, Kazkaz LA, Plenge RM. Integration of sequence data from a Consanguineous family with genetic data from an outbred population identifies PLB1 as a candidate rheumatoid arthritis risk gene. PLoS One 2014; 9:e87645. [PMID: 24520335 PMCID: PMC3919745 DOI: 10.1371/journal.pone.0087645] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2013] [Accepted: 12/19/2013] [Indexed: 12/30/2022] Open
Abstract
Integrating genetic data from families with highly penetrant forms of disease together with genetic data from outbred populations represents a promising strategy to uncover the complete frequency spectrum of risk alleles for complex traits such as rheumatoid arthritis (RA). Here, we demonstrate that rare, low-frequency and common alleles at one gene locus, phospholipase B1 (PLB1), might contribute to risk of RA in a 4-generation consanguineous pedigree (Middle Eastern ancestry) and also in unrelated individuals from the general population (European ancestry). Through identity-by-descent (IBD) mapping and whole-exome sequencing, we identified a non-synonymous c.2263G>C (p.G755R) mutation at the PLB1 gene on 2q23, which significantly co-segregated with RA in family members with a dominant mode of inheritance (P = 0.009). We further evaluated PLB1 variants and risk of RA using a GWAS meta-analysis of 8,875 RA cases and 29,367 controls of European ancestry. We identified significant contributions of two independent non-coding variants near PLB1 with risk of RA (rs116018341 [MAF = 0.042] and rs116541814 [MAF = 0.021], combined P = 3.2×10−6). Finally, we performed deep exon sequencing of PLB1 in 1,088 RA cases and 1,088 controls (European ancestry), and identified suggestive dispersion of rare protein-coding variant frequencies between cases and controls (P = 0.049 for C-alpha test and P = 0.055 for SKAT). Together, these data suggest that PLB1 is a candidate risk gene for RA. Future studies to characterize the full spectrum of genetic risk in the PLB1 genetic locus are warranted.
Collapse
Affiliation(s)
- Yukinori Okada
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Human Genetics and Disease Diversity, Tokyo Medical and Dental University Graduate School of Medical and Dental Sciences, Tokyo, Japan
- Laboratory for Statistical Analysis, Center for Integrative Medical Sciences, RIKEN, Yokohama, Japan
| | - Dorothee Diogo
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
| | - Jeffrey D. Greenberg
- New York University Hospital for Joint Diseases, New York, New York, United States of America
| | - Faten Mouassess
- Molecular Biology and Biotechnology Department, Human Genetics Division, Damascus, Syria
| | - Walid A. L. Achkar
- Molecular Biology and Biotechnology Department, Human Genetics Division, Damascus, Syria
| | - Robert S. Fulton
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Joshua C. Denny
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Namrata Gupta
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
| | - Daniel Mirel
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
| | - Stacy Gabriel
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
| | - Gang Li
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Joel M. Kremer
- Department of Medicine, Albany Medical Center and The Center for Rheumatology, Albany, New York, United States of America
| | - Dimitrios A. Pappas
- Division of Rheumatology, Department of Medicine, New York, Presbyterian Hospital, College of Physicians and Surgeons, Columbia University, New York, New York, United States of America
| | - Robert J. Carroll
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Anne E. Eyler
- Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Gosia Trynka
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
| | - Eli A. Stahl
- The Department of Psychiatry at Mount Sinai School of Medicine, New York, New York, United States of America
| | - Jing Cui
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Richa Saxena
- Center for Human Genetics Research, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Marieke J. H. Coenen
- Department of Human Genetics, Radboud University Medical Centre, Nijmegen, The Netherlands
| | - Henk-Jan Guchelaar
- Department of Clinical Pharmacy and Toxicology, Leiden University Medical Center, Leiden, The Netherlands
| | - Tom W. J. Huizinga
- Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Philippe Dieudé
- Service de Rhumatologie et INSERM U699 Hôpital Bichat Claude Bernard, Assistance Publique des Hôpitaux de Paris, Paris, France
- Université Paris 7-Diderot, Paris, France
| | - Xavier Mariette
- Institut National de la Santé et de la Recherche Médicale (INSERM) U1012, Université Paris-Sud, Rhumatologie, Hôpitaux Universitaires Paris-Sud, Assistance Publique-Hôpitaux de Paris (AP-HP), Le Kremlin Bicêtre, France
| | - Anne Barton
- Arthritis Research UK Epidemiology Unit, Centre for Musculoskeletal Research, University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
| | - Helena Canhão
- Rheumatology Research Unit, Instituto de Medicina Molecular, Faculdade de Medicina da Universidade de Lisboa, Lisbon, Portugal
- Rheumatology Department, Santa Maria Hospital–CHLN, Lisbon, Portugal
| | - João E. Fonseca
- Rheumatology Research Unit, Instituto de Medicina Molecular, Faculdade de Medicina da Universidade de Lisboa, Lisbon, Portugal
- Rheumatology Department, Santa Maria Hospital–CHLN, Lisbon, Portugal
| | - Niek de Vries
- Department of Clinical Immunology and Rheumatology & Department of Genome Analysis, Academic Medical Center/University of Amsterdam, Amsterdam, The Netherlands
| | - Paul P. Tak
- Department of Clinical Immunology and Rheumatology, Academic Medical Center/University of Amsterdam, Amsterdam, The Netherlands
- GlaxoSmithKline, Stevenage, United Kingdom
| | - Larry W. Moreland
- Division of Rheumatology and Clinical Immunology, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - S. Louis Bridges
- Division of Clinical Immunology and Rheumatology, Department of Medicine, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Corinne Miceli-Richard
- Institut National de la Santé et de la Recherche Médicale (INSERM) U1012, Université Paris-Sud, Rhumatologie, Hôpitaux Universitaires Paris-Sud, Assistance Publique-Hôpitaux de Paris (AP-HP), Le Kremlin Bicêtre, France
| | - Hyon K. Choi
- Channing Laboratory, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Section of Rheumatology, Boston University School of Medicine, Boston, Massachusetts, United States of America
- Clinical Epidemiology Research and Training Unit, Boston University School of Medicine, Boston, Massachusetts, United States of America
| | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, Center for Integrative Medical Sciences, RIKEN, Yokohama, Japan
- Centre d'Etude du Polymorphisme Humain (CEPH), Paris, France
| | - Pilar Galan
- Université Paris 13 Sorbonne Paris Cité, UREN (Nutritional Epidemiology Research Unit), Inserm (U557), Inra (U1125), Cnam, Bobigny, France
| | - Mark Lathrop
- McGill University and Génome Québec Innovation Centre, Montréal, Canada
| | - Towfique Raj
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Program in Translational NeuroPsychiatric Genomics, Institute for the Neurosciences, Department of Neurology, Brigham and Women's Hospital, Boston, Massachusetts, United States of America
| | - Philip L. De Jager
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Program in Translational NeuroPsychiatric Genomics, Institute for the Neurosciences, Department of Neurology, Brigham and Women's Hospital, Boston, Massachusetts, United States of America
| | - Soumya Raychaudhuri
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- NIHR Manchester Musculoskeletal Biomedical, Research Unit, Central Manchester NHS Foundation Trust, Manchester Academic Health Sciences Centre, Manchester, United Kingdom
| | - Jane Worthington
- Arthritis Research UK Epidemiology Unit, Centre for Musculoskeletal Research, University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
- National Institute for Health Research, Manchester Musculoskeletal Biomedical Research Unit, Central Manchester University Hospitals National Health Service Foundation Trust, Manchester Academic Health Sciences Centre, Manchester, United Kingdom
| | - Leonid Padyukov
- Rheumatology Unit, Department of Medicine (Solna), Karolinska Institutet, Stockholm, Sweden
| | - Lars Klareskog
- Rheumatology Unit, Department of Medicine (Solna), Karolinska Institutet, Stockholm, Sweden
| | - Katherine A. Siminovitch
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Canada
- Toronto General Research Institute, Toronto, Canada
- Department of Medicine, University of Toronto, Toronto, Canada
| | - Peter K. Gregersen
- The Feinstein Institute for Medical Research, North Shore–Long Island Jewish Health System, Manhasset, New York, United States of America
| | - Elaine R. Mardis
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Thurayya Arayssi
- Weill Cornell Medical College-Qatar, Education City, Doha, Qatar
| | - Layla A. Kazkaz
- Tishreen Hospital, Damascus, Syria
- Syrian Association for Rheumatology, Damascus, Syria
| | - Robert M. Plenge
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
1548
|
Peloso G, Auer P, Bis J, Voorman A, Morrison A, Stitziel N, Brody J, Khetarpal S, Crosby J, Fornage M, Isaacs A, Jakobsdottir J, Feitosa M, Davies G, Huffman J, Manichaikul A, Davis B, Lohman K, Joon A, Smith A, Grove M, Zanoni P, Redon V, Demissie S, Lawson K, Peters U, Carlson C, Jackson R, Ryckman K, Mackey R, Robinson J, Siscovick D, Schreiner P, Mychaleckyj J, Pankow J, Hofman A, Uitterlinden A, Harris T, Taylor K, Stafford J, Reynolds L, Marioni R, Dehghan A, Franco O, Patel A, Lu Y, Hindy G, Gottesman O, Bottinger E, Melander O, Orho-Melander M, Loos R, Duga S, Merlini P, Farrall M, Goel A, Asselta R, Girelli D, Martinelli N, Shah S, Kraus W, Li M, Rader D, Reilly M, McPherson R, Watkins H, Ardissino D, Zhang Q, Wang J, Tsai M, Taylor H, Correa A, Griswold M, Lange L, Starr J, Rudan I, Eiriksdottir G, Launer L, Ordovas J, Levy D, Chen YD, Reiner A, Hayward C, Polasek O, Deary I, Borecki I, Liu Y, Gudnason V, Wilson J, van Duijn C, Kooperberg C, Rich S, Psaty B, Rotter J, O’Donnell C, Rice K, Boerwinkle E, Kathiresan S, Cupples L, Cupples LA. Association of low-frequency and rare coding-sequence variants with blood lipids and coronary heart disease in 56,000 whites and blacks. Am J Hum Genet 2014; 94:223-32. [PMID: 24507774 DOI: 10.1016/j.ajhg.2014.01.009] [Citation(s) in RCA: 263] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2013] [Accepted: 01/09/2014] [Indexed: 10/25/2022] Open
Abstract
Low-frequency coding DNA sequence variants in the proprotein convertase subtilisin/kexin type 9 gene (PCSK9) lower plasma low-density lipoprotein cholesterol (LDL-C), protect against risk of coronary heart disease (CHD), and have prompted the development of a new class of therapeutics. It is uncertain whether the PCSK9 example represents a paradigm or an isolated exception. We used the "Exome Array" to genotype >200,000 low-frequency and rare coding sequence variants across the genome in 56,538 individuals (42,208 European ancestry [EA] and 14,330 African ancestry [AA]) and tested these variants for association with LDL-C, high-density lipoprotein cholesterol (HDL-C), and triglycerides. Although we did not identify new genes associated with LDL-C, we did identify four low-frequency (frequencies between 0.1% and 2%) variants (ANGPTL8 rs145464906 [c.361C>T; p.Gln121*], PAFAH1B2 rs186808413 [c.482C>T; p.Ser161Leu], COL18A1 rs114139997 [c.331G>A; p.Gly111Arg], and PCSK7 rs142953140 [c.1511G>A; p.Arg504His]) with large effects on HDL-C and/or triglycerides. None of these four variants was associated with risk for CHD, suggesting that examples of low-frequency coding variants with robust effects on both lipids and CHD will be limited.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - L Adrienne Cupples
- Department of Biostatistics, Boston University School of Public Health, Boston, MA 02118, USA; National Heart, Lung, and Blood Institute (NHLBI) Framingham Heart Study, Framingham, MA 01702, USA.
| |
Collapse
|
1549
|
Sha Q, Zhang S. A novel test for testing the optimally weighted combination of rare and common variants based on data of parents and affected children. Genet Epidemiol 2014; 38:135-43. [PMID: 24382753 PMCID: PMC4162402 DOI: 10.1002/gepi.21787] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Revised: 10/28/2013] [Accepted: 12/02/2013] [Indexed: 11/10/2022]
Abstract
With the development of sequencing technologies, the direct testing of rare variant associations has become possible. Many statistical methods for detecting associations between rare variants and complex diseases have recently been developed, most of which are population-based methods for unrelated individuals. A limitation of population-based methods is that spurious associations can occur when there is a population structure. For rare variants, this problem can be more serious, because the spectrum of rare variation can be very different in diverse populations, as well as the current nonexistence of methods to control for population stratification in population-based rare variant associations. A solution to the problem of population stratification is to use family-based association tests, which use family members to control for population stratification. In this article, we propose a novel test for Testing the Optimally Weighted combination of variants based on data of Parents and Affected Children (TOW-PAC). TOW-PAC is a family-based association test that tests the combined effect of rare and common variants in a genomic region, and is robust to the directions of the effects of causal variants. Simulation studies confirm that, for rare variant associations, family-based association tests are robust to population stratification although population-based association tests can be seriously confounded by population stratification. The results of power comparisons show that the power of TOW-PAC increases with an increase of the number of affected children in each family and TOW-PAC based on multiple affected children per family is more powerful than TOW based on unrelated individuals.
Collapse
Affiliation(s)
- Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, United States of America
| | | |
Collapse
|
1550
|
Liu DJ, Peloso GM, Zhan X, Holmen OL, Zawistowski M, Feng S, Nikpay M, Auer PL, Goel A, Zhang H, Peters U, Farrall M, Orho-Melander M, Kooperberg C, McPherson R, Watkins H, Willer CJ, Hveem K, Melander O, Kathiresan S, Abecasis GR. Meta-analysis of gene-level tests for rare variant association. Nat Genet 2014; 46:200-4. [PMID: 24336170 PMCID: PMC3939031 DOI: 10.1038/ng.2852] [Citation(s) in RCA: 144] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2013] [Accepted: 11/20/2013] [Indexed: 12/14/2022]
Abstract
The majority of reported complex disease associations for common genetic variants have been identified through meta-analysis, a powerful approach that enables the use of large sample sizes while protecting against common artifacts due to population structure and repeated small-sample analyses sharing individual-level data. As the focus of genetic association studies shifts to rare variants, genes and other functional units are becoming the focus of analysis. Here we propose and evaluate new approaches for performing meta-analysis of rare variant association tests, including burden tests, weighted burden tests, variable-threshold tests and tests that allow variants with opposite effects to be grouped together. We show that our approach retains useful features from single-variant meta-analysis approaches and demonstrate its use in a study of blood lipid levels in ∼18,500 individuals genotyped with exome arrays.
Collapse
Affiliation(s)
- Dajiang J. Liu
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
| | - Gina M. Peloso
- Broad Institute of Harvard and MIT, Cambridge, MA
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA
| | - Xiaowei Zhan
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
| | - Oddgeir L. Holmen
- Department of Public Health and General Practice, Norwegian University of Science and Technology, Trondheim 7489, Norway
- St. Olav Hospital, Trondheim University Hospital, Trondheim, Norway
| | - Matthew Zawistowski
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
| | - Shuang Feng
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
| | - Majid Nikpay
- University of Ottawa Heart Institute, Ottawa, Ontario, Canada
| | - Paul L. Auer
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle WA 98109, USA
- School of Public Health, University of Wisconsin-Milwaukee
| | - Anuj Goel
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
- Department of Cardiovascular Medicine, University of Oxford, Oxford, UK
| | - He Zhang
- Division of Cardiology, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle WA 98109, USA
- Department of Epidemiology, University of Washington School of Public Health, Seattle, WA
| | - Martin Farrall
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
- Department of Cardiovascular Medicine, University of Oxford, Oxford, UK
| | - Marju Orho-Melander
- Department of Cardiovascular Medicine, University of Oxford, Oxford, UK
- Department of Clinical Sciences, Lund University, Malmö, Sweden
| | - Charles Kooperberg
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle WA 98109, USA
- Department of Biostatistics, University of Washington School of Public Health, Seattle, WA
| | - Ruth McPherson
- University of Ottawa Heart Institute, Ottawa, Ontario, Canada
| | - Hugh Watkins
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
- Department of Cardiovascular Medicine, University of Oxford, Oxford, UK
| | - Cristen J. Willer
- Division of Cardiology, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109
| | - Kristian Hveem
- Department of Public Health and General Practice, Norwegian University of Science and Technology, Trondheim 7489, Norway
- Levanger Hospital, Levanger, Norway
| | - Olle Melander
- Department of Cardiovascular Medicine, University of Oxford, Oxford, UK
- Department of Clinical Sciences, Lund University, Malmö, Sweden
| | - Sekar Kathiresan
- Broad Institute of Harvard and MIT, Cambridge, MA
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA
- Harvard Medical School, Cambridge, MA
| | - Gonçalo R. Abecasis
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI 48109
| |
Collapse
|