51
|
Cochran JN, Geier EG, Bonham LW, Newberry JS, Amaral MD, Thompson ML, Lasseigne BN, Karydas AM, Roberson ED, Cooper GM, Rabinovici GD, Miller BL, Myers RM, Yokoyama JS. Non-coding and Loss-of-Function Coding Variants in TET2 are Associated with Multiple Neurodegenerative Diseases. Am J Hum Genet 2020; 106:632-645. [PMID: 32330418 PMCID: PMC7212268 DOI: 10.1016/j.ajhg.2020.03.010] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 03/20/2020] [Indexed: 12/13/2022] Open
Abstract
We conducted genome sequencing to search for rare variation contributing to early-onset Alzheimer's disease (EOAD) and frontotemporal dementia (FTD). Discovery analysis was conducted on 435 cases and 671 controls of European ancestry. Burden testing for rare variation associated with disease was conducted using filters based on variant rarity (less than one in 10,000 or private), computational prediction of deleteriousness (CADD) (10 or 15 thresholds), and molecular function (protein loss-of-function [LoF] only, coding alteration only, or coding plus non-coding variants in experimentally predicted regulatory regions). Replication analysis was conducted on 16,434 independent cases and 15,587 independent controls. Rare variants in TET2 were enriched in the discovery combined EOAD and FTD cohort (p = 4.6 × 10-8, genome-wide corrected p = 0.0026). Most of these variants were canonical LoF or non-coding in predicted regulatory regions. This enrichment replicated across several cohorts of Alzheimer's disease (AD) and FTD (replication only p = 0.0029). The combined analysis odds ratio was 2.3 (95% confidence interval [CI] 1.6-3.4) for AD and FTD. The odds ratio for qualifying non-coding variants considered independently from coding variants was 3.7 (95% CI 1.7-9.4). For LoF variants, the combined odds ratio (for AD, FTD, and amyotrophic lateral sclerosis, which shares clinicopathological overlap with FTD) was 3.1 (95% CI 1.9-5.2). TET2 catalyzes DNA demethylation. Given well-defined changes in DNA methylation that occur during aging, rare variation in TET2 may confer risk for neurodegeneration by altering the homeostasis of key aging-related processes. Additionally, our study emphasizes the relevance of non-coding variation in genetic studies of complex disease.
Collapse
Affiliation(s)
- J Nicholas Cochran
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Ethan G Geier
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, United States
| | - Luke W Bonham
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, United States
| | - J Scott Newberry
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Michelle D Amaral
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Michelle L Thompson
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Brittany N Lasseigne
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States; Department of Cell, Developmental and Integrative Biology, University of Alabama at Birmingham, Birmingham, AL 35294, United States
| | - Anna M Karydas
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, United States
| | - Erik D Roberson
- Center for Neurodegeneration and Experimental Therapeutics, Alzheimer's Disease Center, Departments of Neurology and Neurobiology, University of Alabama at Birmingham, Birmingham, AL 35294, United States
| | - Gregory M Cooper
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Gil D Rabinovici
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, United States; Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA 94158, United States
| | - Bruce L Miller
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, United States
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States
| | - Jennifer S Yokoyama
- Memory and Aging Center, Department of Neurology, University of California, San Francisco, San Francisco, CA 94158, United States; Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA 94158, United States.
| |
Collapse
|
52
|
Comprehensive functional annotation of susceptibility variants associated with asthma. Hum Genet 2020; 139:1037-1053. [PMID: 32240371 DOI: 10.1007/s00439-020-02151-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 03/18/2020] [Indexed: 01/02/2023]
Abstract
Genome-wide association studies (GWAS) have identified hundreds of primarily non-coding disease-susceptibility variants that further need functional interpretation to prioritize and discriminate the disease-relevant variants. We present a comprehensive genome-wide non-coding variant prioritization scheme followed by validation using Pyrosequencing and TaqMan assays in asthma. We implemented a composite Functional Annotation Score (cFAS) to investigate over 32,000 variants consisting of 1525 GWAS-lead asthma-susceptibility variants and their LD proxies (r2 ≥ 0.80). Functional annotation pipeline in cFAS revealed 274 variants with significant score at 1% false discovery rate. This study implicates a novel locus 4p16 (SLC26A1) with eQTL variant (rs11936407) and known loci in 17q12-21 and 5q22 which encode ORM1-like protein 3 (ORMDL3, rs406527, and rs12936231) and thymic stromal lymphopoietin (TSLP, rs3806932 and rs10073816) epithelial gene, respectively. Follow-up validation analysis through pyrosequencing of CpG sites in and nearby rs4065275 and rs11936407 showed genotype-dependent hypomethylation on asthma cases compared with healthy controls. Prioritized variants are enriched for asthma-specific histone modification associated with active chromatin (H3K4me1 and H3K27ac) in T cells, B cells, lung, and immune-related interferon gamma signaling pathways. Our findings, together with those from prior studies, suggest that SNPs can affect asthma by regulating enhancer activity, and our comprehensive bioinformatics and functional analysis could lead to biological insights into asthma pathogenesis.Graphic abstract.
Collapse
|
53
|
Li B, Lu Q, Zhao H. An evaluation of noncoding genome annotation tools through enrichment analysis of 15 genome-wide association studies. Brief Bioinform 2020; 20:995-1003. [PMID: 29106447 DOI: 10.1093/bib/bbx131] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Revised: 09/02/2017] [Indexed: 01/08/2023] Open
Abstract
Functionally annotating genetic variations is an essential yet challenging topic in human genetics research. As large consortia including ENCODE and Roadmap Epigenomics Project continue to generate high-throughput transcriptomic and epigenomic data, many computational frameworks have been developed to integrate these experimental data to predict functionality of genetic variations in both protein-coding and noncoding regions. Here, we compare a number of recently developed annotation frameworks for noncoding regions through enrichment analysis on genome-wide association studies (GWASs). We also compare several different strategies to quantify enrichment using GWAS summary statistics. Our analyses highlight the importance of jointly modeling context-specific annotations with genome-wide data in providing statistically powerful and biologically interpretable enrichment for complex disease associations. Our findings provide insights into when and how computational genome annotations may benefit future complex disease studies on the genome-wide scale.
Collapse
Affiliation(s)
- Boyang Li
- Department of Biostatistics, Yale School of Public Health
| | - Qiongshi Lu
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| |
Collapse
|
54
|
Sims R, Hill M, Williams J. The multiplex model of the genetics of Alzheimer's disease. Nat Neurosci 2020; 23:311-322. [PMID: 32112059 DOI: 10.1038/s41593-020-0599-5] [Citation(s) in RCA: 249] [Impact Index Per Article: 62.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2018] [Accepted: 01/24/2020] [Indexed: 12/25/2022]
Abstract
Genes play a strong role in Alzheimer's disease (AD), with late-onset AD showing heritability of 58-79% and early-onset AD showing over 90%. Genetic association provides a robust platform to build our understanding of the etiology of this complex disease. Over 50 loci are now implicated for AD, suggesting that AD is a disease of multiple components, as supported by pathway analyses (immunity, endocytosis, cholesterol transport, ubiquitination, amyloid-β and tau processing). Over 50% of late-onset AD heritability has been captured, allowing researchers to calculate the accumulation of AD genetic risk through polygenic risk scores. A polygenic risk score predicts disease with up to 90% accuracy and is an exciting tool in our research armory that could allow selection of those with high polygenic risk scores for clinical trials and precision medicine. It could also allow cellular modelling of the combined risk. Here we propose the multiplex model as a new perspective from which to understand AD. The multiplex model reflects the combination of some, or all, of these model components (genetic and environmental), in a tissue-specific manner, to trigger or sustain a disease cascade, which ultimately results in the cell and synaptic loss observed in AD.
Collapse
Affiliation(s)
- Rebecca Sims
- Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Matthew Hill
- Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
- UK Dementia Research Institute, School of Medicine, Cardiff University, Cardiff, UK
| | - Julie Williams
- Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK.
- UK Dementia Research Institute, School of Medicine, Cardiff University, Cardiff, UK.
| |
Collapse
|
55
|
Enhancing α-secretase Processing for Alzheimer's Disease-A View on SFRP1. Brain Sci 2020; 10:brainsci10020122. [PMID: 32098349 PMCID: PMC7071437 DOI: 10.3390/brainsci10020122] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Revised: 02/19/2020] [Accepted: 02/20/2020] [Indexed: 12/24/2022] Open
Abstract
Amyloid β (Aβ) peptides generated via sequential β- and γ-secretase processing of the amyloid precursor protein (APP) are major etiopathological agents of Alzheimer's disease (AD). However, an initial APP cleavage by an α-secretase, such as the a disintegrin and metalloproteinase domain-containing protein ADAM10, precludes β-secretase cleavage and leads to APP processing that does not produce Aβ. The latter appears to underlie the disease symptom-attenuating effects of a multitude of experimental therapeutics in AD animal models. Recent work has indicated that an endogenous inhibitor of ADAM10, secreted-frizzled-related protein 1 (SFRP1), is elevated in human AD brains and associated with amyloid plaques in mouse AD models. Importantly, genetic or functional attenuation of SFRP1 lowered Aβ accumulation and improved AD-related histopathological and neurological traits. Given SFRP1's well-known activity in attenuating Wnt signaling, which is also commonly impaired in AD, SFRP1 appears to be a promising therapeutic target for AD. This idea, however, needs to be addressed with care because of cancer enhancement potentials resulting from a systemic loss of SFRP1 activity, as well as an upregulation of ADAM10 activity. In this focused review, I shall discuss α-secretase-effected APP processing in AD with a focus on SFRP1, and explore the contrasting perspectives arising from the recent findings.
Collapse
|
56
|
Wang C, Zhang S. Large-scale determination and characterization of cell type-specific regulatory elements in the human genome. J Mol Cell Biol 2019; 9:463-476. [PMID: 29281093 DOI: 10.1093/jmcb/mjx058] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 12/19/2017] [Indexed: 01/05/2023] Open
Abstract
Histone modifications have been widely elucidated to play vital roles in gene regulation and cell identity. The Roadmap Epigenomics Consortium generated a reference catalog of several key histone modifications across >100s of human cell types and tissues. Decoding these epigenomes into functional regulatory elements is a challenging task in computational biology. To this end, we adopted a differential chromatin modification analysis framework to comprehensively determine and characterize cell type-specific regulatory elements (CSREs) and their histone modification codes in the human epigenomes of five histone modifications across 127 tissues or cell types. The CSREs show significant relevance with cell type-specific biological functions and diseases and cell identity. Clustering of CSREs with their specificity signals reveals distinct histone codes, demonstrating the diversity of functional roles of CSREs within the same cell or tissue. Last but not least, dynamics of CSREs from close cell types or tissues can give a detailed view of developmental processes such as normal tissue development and cancer occurrence.
Collapse
Affiliation(s)
- Can Wang
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Shihua Zhang
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
57
|
Fang L, Zhou Y, Liu S, Jiang J, Bickhart DM, Null DJ, Li B, Schroeder SG, Rosen BD, Cole JB, Van Tassell CP, Ma L, Liu GE. Comparative analyses of sperm DNA methylomes among human, mouse and cattle provide insights into epigenomic evolution and complex traits. Epigenetics 2019; 14:260-276. [PMID: 30810461 PMCID: PMC6557555 DOI: 10.1080/15592294.2019.1582217] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Sperm DNA methylation is crucial for fertility and viability of offspring but epigenome evolution in mammals is largely understudied. By comparing sperm DNA methylomes and large-scale genome-wide association study (GWAS) signals between human and cattle, we aimed to examine the DNA methylome evolution and its associations with complex phenotypes in mammals. Our analysis revealed that genes with conserved non-methylated promoters (e.g., ANKS1A and WNT7A) among human and cattle were involved in common system and embryo development, and enriched for GWAS signals of body conformation traits in both species, while genes with conserved hypermethylated promoters (e.g., TCAP and CD80) were engaged in immune responses and highlighted by immune-related traits. On the other hand, genes with human-specific hypomethylated promoters (e.g., FOXP2 and HYDIN) were engaged in neuron system development and enriched for GWAS signals of brain-related traits, while genes with cattle-specific hypomethylated promoters (e.g., LDHB and DGAT2) mainly participated in lipid storage and metabolism. We validated our findings using sperm-retained nucleosome, preimplantation transcriptome, and adult tissue transcriptome data, as well as sequence evolutionary features, including motif binding sites, mutation rates, recombination rates and evolution signatures. In conclusion, our results demonstrate important roles of epigenome evolution in shaping the genetic architecture underlying complex phenotypes, hence enhance signal prioritization in GWAS and provide valuable information for human neurological disorders and livestock genetic improvement.
Collapse
Affiliation(s)
- Lingzhao Fang
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA.,b Department of Animal and Avian Sciences , University of Maryland , College Park , MD , USA
| | - Yang Zhou
- c Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Education Ministry of China , Huazhong Agricultural University , Wuhan , Hubei , China
| | - Shuli Liu
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA.,d Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology , China Agricultural University , Beijing , China
| | - Jicai Jiang
- b Department of Animal and Avian Sciences , University of Maryland , College Park , MD , USA
| | - Derek M Bickhart
- e Dairy Forage Research Center , Agricultural Research Service, USDA , Madison , WI , USA
| | - Daniel J Null
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Bingjie Li
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Steven G Schroeder
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Benjamin D Rosen
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - John B Cole
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Curtis P Van Tassell
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Li Ma
- b Department of Animal and Avian Sciences , University of Maryland , College Park , MD , USA
| | - George E Liu
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| |
Collapse
|
58
|
Sakornsakolpat P, Prokopenko D, Lamontagne M, Reeve NF, Guyatt AL, Jackson VE, Shrine N, Qiao D, Bartz TM, Kim DK, Lee MK, Latourelle JC, Li X, Morrow JD, Obeidat M, Wyss AB, Bakke P, Barr RG, Beaty TH, Belinsky SA, Brusselle GG, Crapo JD, de Jong K, DeMeo DL, Fingerlin TE, Gharib SA, Gulsvik A, Hall IP, Hokanson JE, Kim WJ, Lomas DA, London SJ, Meyers DA, O'Connor GT, Rennard SI, Schwartz DA, Sliwinski P, Sparrow D, Strachan DP, Tal-Singer R, Tesfaigzi Y, Vestbo J, Vonk JM, Yim JJ, Zhou X, Bossé Y, Manichaikul A, Lahousse L, Silverman EK, Boezen HM, Wain LV, Tobin MD, Hobbs BD, Cho MH. Genetic landscape of chronic obstructive pulmonary disease identifies heterogeneous cell-type and phenotype associations. Nat Genet 2019; 51:494-505. [PMID: 30804561 PMCID: PMC6546635 DOI: 10.1038/s41588-018-0342-2] [Citation(s) in RCA: 215] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 12/20/2018] [Indexed: 11/09/2022]
Abstract
Chronic obstructive pulmonary disease (COPD) is the leading cause of respiratory mortality worldwide. Genetic risk loci provide new insights into disease pathogenesis. We performed a genome-wide association study in 35,735 cases and 222,076 controls from the UK Biobank and additional studies from the International COPD Genetics Consortium. We identified 82 loci associated with P < 5 × 10-8; 47 of these were previously described in association with either COPD or population-based measures of lung function. Of the remaining 35 new loci, 13 were associated with lung function in 79,055 individuals from the SpiroMeta consortium. Using gene expression and regulation data, we identified functional enrichment of COPD risk loci in lung tissue, smooth muscle, and several lung cell types. We found 14 COPD loci shared with either asthma or pulmonary fibrosis. COPD genetic risk loci clustered into groups based on associations with quantitative imaging features and comorbidities. Our analyses provide further support for the genetic susceptibility and heterogeneity of COPD.
Collapse
Affiliation(s)
- Phuwanat Sakornsakolpat
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Department of Medicine, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Dmitry Prokopenko
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Genetics and Aging Research Unit, Department of Neurology, Massachusetts General Hospital, Boston, MA, USA
| | - Maxime Lamontagne
- Institut Universitaire de Cardiologie et de Pneumologie de Québec, Québec, Quebec, Canada
| | - Nicola F Reeve
- Genetic Epidemiology Group, Department of Health Sciences, University of Leicester, Leicester, UK
| | - Anna L Guyatt
- Genetic Epidemiology Group, Department of Health Sciences, University of Leicester, Leicester, UK
| | - Victoria E Jackson
- Genetic Epidemiology Group, Department of Health Sciences, University of Leicester, Leicester, UK
| | - Nick Shrine
- Genetic Epidemiology Group, Department of Health Sciences, University of Leicester, Leicester, UK
| | - Dandi Qiao
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Traci M Bartz
- Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
- Department of Medicine, University of Washington, Seattle, WA, USA
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Deog Kyeom Kim
- Seoul National University College of Medicine, SMG-SNU Boramae Medical Center, Seoul, South Korea
| | - Mi Kyeong Lee
- Epidemiology Branch, National Institute of Environmental Health Sciences, National Institutes of Health, Department of Health and Human Services, Raleigh, NC, USA
| | - Jeanne C Latourelle
- Department of Neurology, Boston University School of Medicine, Boston, MA, USA
| | - Xingnan Li
- Department of Medicine, University of Arizona, Tucson, AZ, USA
| | - Jarrett D Morrow
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Ma'en Obeidat
- University of British Columbia Center for Heart Lung Innovation, St. Paul's Hospital, Vancouver, British Columbia, Canada
| | - Annah B Wyss
- Epidemiology Branch, National Institute of Environmental Health Sciences, National Institutes of Health, Department of Health and Human Services, Raleigh, NC, USA
| | - Per Bakke
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - R Graham Barr
- Department of Medicine, College of Physicians and Surgeons and Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Terri H Beaty
- Department of Epidemiology, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD, USA
| | | | - Guy G Brusselle
- Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
- Department of Respiratory Medicine, Ghent University Hospital, Ghent, Belgium
- Department of Respiratory Medicine, Erasmus Medical Center, Rotterdam, the Netherlands
| | - James D Crapo
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, National Jewish Health, Denver, CO, USA
| | - Kim de Jong
- University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands
- University of Groningen, University Medical Center Groningen, Groningen Research Institute for Asthma and COPD (GRIAC), Groningen, the Netherlands
| | - Dawn L DeMeo
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Tasha E Fingerlin
- Center for Genes, Environment and Health, National Jewish Health, Denver, CO, USA
- Department of Biostatistics and Informatics, University of Colorado Denver, Aurora, CO, USA
| | - Sina A Gharib
- Computational Medicine Core, Center for Lung Biology, UW Medicine Sleep Center, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Amund Gulsvik
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Ian P Hall
- Division of Respiratory Medicine, Queen's Medical Centre, University of Nottingham, Nottingham, UK
- National Institute for Health Research Nottingham Biomedical Research Centre, Nottingham, UK
| | - John E Hokanson
- Department of Epidemiology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Woo Jin Kim
- Department of Internal Medicine and Environmental Health Center, School of Medicine, Kangwon National University, Chuncheon, South Korea
| | - David A Lomas
- UCL Respiratory, University College London, London, UK
| | - Stephanie J London
- Epidemiology Branch, National Institute of Environmental Health Sciences, National Institutes of Health, Department of Health and Human Services, Raleigh, NC, USA
| | | | - George T O'Connor
- National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, USA
- Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Stephen I Rennard
- Pulmonary, Critical Care, Sleep and Allergy Division, Department of Internal Medicine, University of Nebraska Medical Center, Omaha, NE, USA
- Clinical Discovery Unit, AstraZeneca, Cambridge, UK
| | - David A Schwartz
- Department of Medicine, School of Medicine, University of Colorado Denver, Aurora, CO, USA
- Department of Immunology, School of Medicine, University of Colorado Denver, Aurora, CO, USA
| | - Pawel Sliwinski
- 2nd Department of Respiratory Medicine, Institute of Tuberculosis and Lung Diseases, Warsaw, Poland
| | - David Sparrow
- VA Boston Healthcare System and Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - David P Strachan
- Population Health Research Institute, St. George's University of London, London, UK
| | | | | | - Jørgen Vestbo
- School of Biological Sciences, University of Manchester, Manchester, UK
| | - Judith M Vonk
- University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands
- University of Groningen, University Medical Center Groningen, Groningen Research Institute for Asthma and COPD (GRIAC), Groningen, the Netherlands
| | - Jae-Joon Yim
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Seoul National University College of Medicine, Seoul, South Korea
| | - Xiaobo Zhou
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Yohan Bossé
- Institut Universitaire de Cardiologie et de Pneumologie de Québec, Québec, Quebec, Canada
- Department of Molecular Medicine, Laval University, Québec, Québec, Canada
| | - Ani Manichaikul
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - Lies Lahousse
- Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
- Department of Bioanalysis, Ghent University, Ghent, Belgium
| | - Edwin K Silverman
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - H Marike Boezen
- University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands
- University of Groningen, University Medical Center Groningen, Groningen Research Institute for Asthma and COPD (GRIAC), Groningen, the Netherlands
| | - Louise V Wain
- Genetic Epidemiology Group, Department of Health Sciences, University of Leicester, Leicester, UK
- National Institute for Health Research Leicester Respiratory Biomedical Research Centre, Glenfield Hospital, Leicester, UK
| | - Martin D Tobin
- Genetic Epidemiology Group, Department of Health Sciences, University of Leicester, Leicester, UK
- National Institute for Health Research Leicester Respiratory Biomedical Research Centre, Glenfield Hospital, Leicester, UK
| | - Brian D Hobbs
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Michael H Cho
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA.
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA.
| |
Collapse
|
59
|
Hu Y, Li M, Lu Q, Weng H, Wang J, Zekavat SM, Yu Z, Li B, Gu J, Muchnik S, Shi Y, Kunkle BW, Mukherjee S, Natarajan P, Naj A, Kuzma A, Zhao Y, Crane PK, Lu H, Zhao H. A statistical framework for cross-tissue transcriptome-wide association analysis. Nat Genet 2019; 51:568-576. [PMID: 30804563 PMCID: PMC6788740 DOI: 10.1038/s41588-019-0345-7] [Citation(s) in RCA: 197] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 01/09/2019] [Indexed: 12/12/2022]
Abstract
Transcriptome-wide association analysis is a powerful approach to studying the genetic architecture of complex traits. A key component of this approach is to build a model to impute gene expression levels from genotypes by using samples with matched genotypes and gene expression data in a given tissue. However, it is challenging to develop robust and accurate imputation models with a limited sample size for any single tissue. Here, we first introduce a multi-task learning method to jointly impute gene expression in 44 human tissues. Compared with single-tissue methods, our approach achieved an average of 39% improvement in imputation accuracy and generated effective imputation models for an average of 120% more genes. We describe a summary-statistic-based testing framework that combines multiple single-tissue associations into a powerful metric to quantify the overall gene-trait association. We applied our method, called UTMOST (unified test for molecular signatures), to multiple genome-wide-association results and demonstrate its advantages over single-tissue strategies.
Collapse
Affiliation(s)
- Yiming Hu
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Mo Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Qiongshi Lu
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA
| | - Haoyi Weng
- Division of Biostatistics, The Jockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, NT, Hong Kong
| | - Jiawei Wang
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Seyedeh M Zekavat
- Yale School of Medicine, New Haven, CT, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Zhaolong Yu
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Boyang Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Jianlei Gu
- SJTU-Yale Joint Center for Biostatistics, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiaotong University, Shanghai, China
| | - Sydney Muchnik
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
| | - Yu Shi
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Brian W Kunkle
- John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
| | | | - Pradeep Natarajan
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Adam Naj
- Center for Clinical Epidemiology and Biostatistic, and the Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Amanda Kuzma
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Yi Zhao
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Paul K Crane
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Hui Lu
- SJTU-Yale Joint Center for Biostatistics, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiaotong University, Shanghai, China
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA.
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA.
- SJTU-Yale Joint Center for Biostatistics, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiaotong University, Shanghai, China.
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA.
| |
Collapse
|
60
|
Li YI, Wong G, Humphrey J, Raj T. Prioritizing Parkinson's disease genes using population-scale transcriptomic data. Nat Commun 2019; 10:994. [PMID: 30824768 PMCID: PMC6397174 DOI: 10.1038/s41467-019-08912-9] [Citation(s) in RCA: 96] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Accepted: 02/05/2019] [Indexed: 12/23/2022] Open
Abstract
Genome-wide association studies (GWAS) have identified over 41 susceptibility loci associated with Parkinson's Disease (PD) but identifying putative causal genes and the underlying mechanisms remains challenging. Here, we leverage large-scale transcriptomic datasets to prioritize genes that are likely to affect PD by using a transcriptome-wide association study (TWAS) approach. Using this approach, we identify 66 gene associations whose predicted expression or splicing levels in dorsolateral prefrontal cortex (DLFPC) and peripheral monocytes are significantly associated with PD risk. We uncover many novel genes associated with PD but also novel mechanisms for known associations such as MAPT, for which we find that variation in exon 3 splicing explains the common genetic association. Genes identified in our analyses belong to the same or related pathways including lysosomal and innate immune function. Overall, our study provides a strong foundation for further mechanistic studies that will elucidate the molecular drivers of PD.
Collapse
Affiliation(s)
- Yang I Li
- Section of Genetic Medicine, Department of Medicine, and Department of Human Genetics, University of Chicago, Chicago, 60637, IL, USA
| | - Garrett Wong
- Departments of Neuroscience, and Genetics and Genomic Sciences, Ronald M. Loeb Center for Alzheimer's disease, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA
| | - Jack Humphrey
- UCL Genetics Institute, Gower Street, London, WC1E 6BT, UK
- Department of Neurodegenerative Disease, UCL Institute of Neurology, London, WC1E 6BT, UK
| | - Towfique Raj
- Departments of Neuroscience, and Genetics and Genomic Sciences, Ronald M. Loeb Center for Alzheimer's disease, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
| |
Collapse
|
61
|
Kichaev G, Bhatia G, Loh PR, Gazal S, Burch K, Freund MK, Schoech A, Pasaniuc B, Price AL. Leveraging Polygenic Functional Enrichment to Improve GWAS Power. Am J Hum Genet 2019; 104:65-75. [PMID: 30595370 DOI: 10.1101/222265] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 11/14/2018] [Indexed: 05/28/2023] Open
Abstract
Functional genomics data has the potential to increase GWAS power by identifying SNPs that have a higher prior probability of association. Here, we introduce a method that leverages polygenic functional enrichment to incorporate coding, conserved, regulatory, and LD-related genomic annotations into association analyses. We show via simulations with real genotypes that the method, functionally informed novel discovery of risk loci (FINDOR), correctly controls the false-positive rate at null loci and attains a 9%-38% increase in the number of independent associations detected at causal loci, depending on trait polygenicity and sample size. We applied FINDOR to 27 independent complex traits and diseases from the interim UK Biobank release (average N = 130K). Averaged across traits, we attained a 13% increase in genome-wide significant loci detected (including a 20% increase for disease traits) compared to unweighted raw p values that do not use functional data. We replicated the additional loci in independent UK Biobank and non-UK Biobank data, yielding a highly statistically significant replication slope (0.66-0.69) in each case. Finally, we applied FINDOR to the full UK Biobank release (average N = 416K), attaining smaller relative improvements (consistent with simulations) but larger absolute improvements, detecting an additional 583 GWAS loci. In conclusion, leveraging functional enrichment using our method robustly increases GWAS power.
Collapse
Affiliation(s)
- Gleb Kichaev
- Interdepartamental Program in Bioinformatics, University of California, Los Angeles, CA 90095, USA.
| | - Gaurav Bhatia
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA
| | - Po-Ru Loh
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA
| | - Steven Gazal
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA
| | - Kathryn Burch
- Interdepartamental Program in Bioinformatics, University of California, Los Angeles, CA 90095, USA
| | - Malika K Freund
- Department of Human Genetics, University of California, Los Angeles, CA 90095, USA
| | - Armin Schoech
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA
| | - Bogdan Pasaniuc
- Interdepartamental Program in Bioinformatics, University of California, Los Angeles, CA 90095, USA; Department of Human Genetics, University of California, Los Angeles, CA 90095, USA; Department Pathology and Laboratory Medicine, University of California, Los Angeles, CA 90095, USA
| | - Alkes L Price
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA; Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA.
| |
Collapse
|
62
|
Kichaev G, Bhatia G, Loh PR, Gazal S, Burch K, Freund MK, Schoech A, Pasaniuc B, Price AL. Leveraging Polygenic Functional Enrichment to Improve GWAS Power. Am J Hum Genet 2019; 104:65-75. [PMID: 30595370 PMCID: PMC6323418 DOI: 10.1016/j.ajhg.2018.11.008] [Citation(s) in RCA: 564] [Impact Index Per Article: 112.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 11/14/2018] [Indexed: 12/24/2022] Open
Abstract
Functional genomics data has the potential to increase GWAS power by identifying SNPs that have a higher prior probability of association. Here, we introduce a method that leverages polygenic functional enrichment to incorporate coding, conserved, regulatory, and LD-related genomic annotations into association analyses. We show via simulations with real genotypes that the method, functionally informed novel discovery of risk loci (FINDOR), correctly controls the false-positive rate at null loci and attains a 9%-38% increase in the number of independent associations detected at causal loci, depending on trait polygenicity and sample size. We applied FINDOR to 27 independent complex traits and diseases from the interim UK Biobank release (average N = 130K). Averaged across traits, we attained a 13% increase in genome-wide significant loci detected (including a 20% increase for disease traits) compared to unweighted raw p values that do not use functional data. We replicated the additional loci in independent UK Biobank and non-UK Biobank data, yielding a highly statistically significant replication slope (0.66-0.69) in each case. Finally, we applied FINDOR to the full UK Biobank release (average N = 416K), attaining smaller relative improvements (consistent with simulations) but larger absolute improvements, detecting an additional 583 GWAS loci. In conclusion, leveraging functional enrichment using our method robustly increases GWAS power.
Collapse
Affiliation(s)
- Gleb Kichaev
- Interdepartamental Program in Bioinformatics, University of California, Los Angeles, CA 90095, USA.
| | - Gaurav Bhatia
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA
| | - Po-Ru Loh
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA
| | - Steven Gazal
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA
| | - Kathryn Burch
- Interdepartamental Program in Bioinformatics, University of California, Los Angeles, CA 90095, USA
| | - Malika K Freund
- Department of Human Genetics, University of California, Los Angeles, CA 90095, USA
| | - Armin Schoech
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA
| | - Bogdan Pasaniuc
- Interdepartamental Program in Bioinformatics, University of California, Los Angeles, CA 90095, USA; Department of Human Genetics, University of California, Los Angeles, CA 90095, USA; Department Pathology and Laboratory Medicine, University of California, Los Angeles, CA 90095, USA
| | - Alkes L Price
- Department of Epidemiology. Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA; Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA.
| |
Collapse
|
63
|
Amlie-Wolf A, Tang M, Mlynarski EE, Kuksa PP, Valladares O, Katanic Z, Tsuang D, Brown CD, Schellenberg GD, Wang LS. INFERNO: inferring the molecular mechanisms of noncoding genetic variants. Nucleic Acids Res 2018; 46:8740-8753. [PMID: 30113658 PMCID: PMC6158604 DOI: 10.1093/nar/gky686] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Revised: 07/11/2018] [Accepted: 07/18/2018] [Indexed: 01/02/2023] Open
Abstract
The majority of variants identified by genome-wide association studies (GWAS) reside in the noncoding genome, affecting regulatory elements including transcriptional enhancers. However, characterizing their effects requires the integration of GWAS results with context-specific regulatory activity and linkage disequilibrium annotations to identify causal variants underlying noncoding association signals and the regulatory elements, tissue contexts, and target genes they affect. We propose INFERNO, a novel method which integrates hundreds of functional genomics datasets spanning enhancer activity, transcription factor binding sites, and expression quantitative trait loci with GWAS summary statistics. INFERNO includes novel statistical methods to quantify empirical enrichments of tissue-specific enhancer overlap and to identify co-regulatory networks of dysregulated long noncoding RNAs (lncRNAs). We applied INFERNO to two large GWAS studies. For schizophrenia (36,989 cases, 113,075 controls), INFERNO identified putatively causal variants affecting brain enhancers for known schizophrenia-related genes. For inflammatory bowel disease (IBD) (12,882 cases, 21,770 controls), INFERNO found enrichments of immune and digestive enhancers and lncRNAs involved in regulation of the adaptive immune response. In summary, INFERNO comprehensively infers the molecular mechanisms of causal noncoding variants, providing a sensitive hypothesis generation method for post-GWAS analysis. The software is available as an open source pipeline and a web server.
Collapse
Affiliation(s)
- Alexandre Amlie-Wolf
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Mitchell Tang
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Elisabeth E Mlynarski
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Pavel P Kuksa
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Otto Valladares
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Zivadin Katanic
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Debby Tsuang
- VA Puget Sound Health Care System, Department of Psychiatry and Behavioral Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Christopher D Brown
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics. Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Gerard D Schellenberg
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics. Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Li-San Wang
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics. Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
64
|
Natarajan P, Peloso GM, Zekavat SM, Montasser M, Ganna A, Chaffin M, Khera AV, Zhou W, Bloom JM, Engreitz JM, Ernst J, O'Connell JR, Ruotsalainen SE, Alver M, Manichaikul A, Johnson WC, Perry JA, Poterba T, Seed C, Surakka IL, Esko T, Ripatti S, Salomaa V, Correa A, Vasan RS, Kellis M, Neale BM, Lander ES, Abecasis G, Mitchell B, Rich SS, Wilson JG, Cupples LA, Rotter JI, Willer CJ, Kathiresan S. Deep-coverage whole genome sequences and blood lipids among 16,324 individuals. Nat Commun 2018; 9:3391. [PMID: 30140000 PMCID: PMC6107638 DOI: 10.1038/s41467-018-05747-8] [Citation(s) in RCA: 119] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Accepted: 06/22/2018] [Indexed: 12/20/2022] Open
Abstract
Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth >29X and analyze genotypes with four quantitative traits—plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation (~30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia. Common genetic variants associated with plasma lipids have been extensively studied for a better understanding of common diseases. Here, the authors use whole-genome sequencing of 16,324 individuals to analyze rare variant associations and to determine their monogenic and polygenic contribution to lipid traits.
Collapse
Affiliation(s)
- Pradeep Natarajan
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02114, USA.,Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA.,Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
| | - Gina M Peloso
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, 02118, USA
| | - Seyedeh Maryam Zekavat
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.,Yale School of Medicine, New Haven, CT, 06510, USA.,Department of Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06520, USA
| | - May Montasser
- School of Medicine, University of Maryland, Baltimore, MD, 21201, USA
| | - Andrea Ganna
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Mark Chaffin
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
| | - Amit V Khera
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02114, USA.,Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA.,Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
| | - Wei Zhou
- Department of Computational Medicine and Bioinformatics, School of Public Health, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jonathan M Bloom
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Jesse M Engreitz
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.,Society of Fellows, Harvard University, Cambridge, MA, 02138, USA
| | - Jason Ernst
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | | | | | - Maris Alver
- Estonian Genome Center, University of Tartu, Tartu, 51010, Estonia
| | - Ani Manichaikul
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, 22908, USA
| | - W Craig Johnson
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
| | - James A Perry
- School of Medicine, University of Maryland, Baltimore, MD, 21201, USA
| | - Timothy Poterba
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Cotton Seed
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Ida L Surakka
- Institute for Molecular Medicine Finland, Helsinki, 00290, Finland
| | - Tonu Esko
- Estonian Genome Center, University of Tartu, Tartu, 51010, Estonia
| | - Samuli Ripatti
- Institute for Molecular Medicine Finland, Helsinki, 00290, Finland
| | - Veikko Salomaa
- Institute for Molecular Medicine Finland, Helsinki, 00290, Finland
| | - Adolfo Correa
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - Ramachandran S Vasan
- Sections of Preventive Medicine and Epidemiology and Cardiology, Department of Medicine, Boston University School of Medicine, Boston, MA, 02118, USA.,Department of Epidemiology, Boston University School of Public Health, Boston, MA, 02118, USA.,Framingham Heart Study, Framingham, MA, 01702, USA
| | - Manolis Kellis
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.,Computer Science and Artificial Intelligence Lab (CSAIL), Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Benjamin M Neale
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02114, USA.,Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA.,Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Eric S Lander
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
| | - Goncalo Abecasis
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Braxton Mitchell
- School of Medicine, University of Maryland, Baltimore, MD, 21201, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, 22908, USA
| | - James G Wilson
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, 39216, USA.,Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - L Adrienne Cupples
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, 02118, USA.,Framingham Heart Study, Framingham, MA, 01702, USA
| | - Jerome I Rotter
- Institute for Translational Genomics and Population Sciences, LABioMed and Departments of Pediatrics and Medicine, Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
| | - Cristen J Willer
- Departments of Human Genetics, Internal Medicine, and Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Sekar Kathiresan
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02114, USA. .,Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA. .,Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.
| | | |
Collapse
|
65
|
Gallagher MD, Chen-Plotkin AS. The Post-GWAS Era: From Association to Function. Am J Hum Genet 2018; 102:717-730. [PMID: 29727686 DOI: 10.1016/j.ajhg.2018.04.002] [Citation(s) in RCA: 479] [Impact Index Per Article: 79.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Accepted: 04/04/2018] [Indexed: 12/13/2022] Open
Abstract
During the past 12 years, genome-wide association studies (GWASs) have uncovered thousands of genetic variants that influence risk for complex human traits and diseases. Yet functional studies aimed at delineating the causal genetic variants and biological mechanisms underlying the observed statistical associations with disease risk have lagged. In this review, we highlight key advances in the field of functional genomics that may facilitate the derivation of biological meaning post-GWAS. We highlight the evidence suggesting that causal variants underlying disease risk often function through regulatory effects on the expression of target genes and that these expression effects might be modest and cell-type specific. We moreover discuss specific studies as proof-of-principle examples for current statistical, bioinformatic, and empirical bench-based approaches to downstream elucidation of GWAS-identified disease risk loci.
Collapse
|
66
|
Ming J, Dai M, Cai M, Wan X, Liu J, Yang C. LSMM: a statistical approach to integrating functional annotations with genome-wide association studies. Bioinformatics 2018; 34:2788-2796. [DOI: 10.1093/bioinformatics/bty187] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2017] [Accepted: 03/27/2018] [Indexed: 01/27/2023] Open
Affiliation(s)
- Jingsi Ming
- Department of Mathematics, Hong Kong Baptist University, Hong Kong
| | - Mingwei Dai
- School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an, China
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong
| | - Mingxuan Cai
- Department of Mathematics, Hong Kong Baptist University, Hong Kong
| | - Xiang Wan
- Shenzhen Research Institute of Big Data, Shenzhen, China
| | - Jin Liu
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore
| | - Can Yang
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong
| |
Collapse
|
67
|
Hao X, Zeng P, Zhang S, Zhou X. Identifying and exploiting trait-relevant tissues with multiple functional annotations in genome-wide association studies. PLoS Genet 2018; 14:e1007186. [PMID: 29377896 PMCID: PMC5805369 DOI: 10.1371/journal.pgen.1007186] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Revised: 02/08/2018] [Accepted: 01/04/2018] [Indexed: 12/18/2022] Open
Abstract
Genome-wide association studies (GWASs) have identified many disease associated loci, the majority of which have unknown biological functions. Understanding the mechanism underlying trait associations requires identifying trait-relevant tissues and investigating associations in a trait-specific fashion. Here, we extend the widely used linear mixed model to incorporate multiple SNP functional annotations from omics studies with GWAS summary statistics to facilitate the identification of trait-relevant tissues, with which to further construct powerful association tests. Specifically, we rely on a generalized estimating equation based algorithm for parameter inference, a mixture modeling framework for trait-tissue relevance classification, and a weighted sequence kernel association test constructed based on the identified trait-relevant tissues for powerful association analysis. We refer to our analytic procedure as the Scalable Multiple Annotation integration for trait-Relevant Tissue identification and usage (SMART). With extensive simulations, we show how our method can make use of multiple complementary annotations to improve the accuracy for identifying trait-relevant tissues. In addition, our procedure allows us to make use of the inferred trait-relevant tissues, for the first time, to construct more powerful SNP set tests. We apply our method for an in-depth analysis of 43 traits from 28 GWASs using tissue-specific annotations in 105 tissues derived from ENCODE and Roadmap. Our results reveal new trait-tissue relevance, pinpoint important annotations that are informative of trait-tissue relationship, and illustrate how we can use the inferred trait-relevant tissues to construct more powerful association tests in the Wellcome trust case control consortium study.
Collapse
Affiliation(s)
- Xingjie Hao
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan, Hubei, China
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States of America
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, United States of America
| | - Ping Zeng
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States of America
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, United States of America
| | - Shujun Zhang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States of America
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, United States of America
| |
Collapse
|
68
|
Lu Q, Li B, Ou D, Erlendsdottir M, Powles RL, Jiang T, Hu Y, Chang D, Jin C, Dai W, He Q, Liu Z, Mukherjee S, Crane PK, Zhao H. A Powerful Approach to Estimating Annotation-Stratified Genetic Covariance via GWAS Summary Statistics. Am J Hum Genet 2017; 101:939-964. [PMID: 29220677 PMCID: PMC5812911 DOI: 10.1016/j.ajhg.2017.11.001] [Citation(s) in RCA: 111] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 10/25/2017] [Indexed: 02/08/2023] Open
Abstract
Despite the success of large-scale genome-wide association studies (GWASs) on complex traits, our understanding of their genetic architecture is far from complete. Jointly modeling multiple traits' genetic profiles has provided insights into the shared genetic basis of many complex traits. However, large-scale inference sets a high bar for both statistical power and biological interpretability. Here we introduce a principled framework to estimate annotation-stratified genetic covariance between traits using GWAS summary statistics. Through theoretical and numerical analyses, we demonstrate that our method provides accurate covariance estimates, thereby enabling researchers to dissect both the shared and distinct genetic architecture across traits to better understand their etiologies. Among 50 complex traits with publicly accessible GWAS summary statistics (Ntotal≈ 4.5 million), we identified more than 170 pairs with statistically significant genetic covariance. In particular, we found strong genetic covariance between late-onset Alzheimer disease (LOAD) and amyotrophic lateral sclerosis (ALS), two major neurodegenerative diseases, in single-nucleotide polymorphisms (SNPs) with high minor allele frequencies and in SNPs located in the predicted functional genome. Joint analysis of LOAD, ALS, and other traits highlights LOAD's correlation with cognitive traits and hints at an autoimmune component for ALS.
Collapse
Affiliation(s)
- Qiongshi Lu
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
| | - Boyang Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
| | - Derek Ou
- Yale School of Medicine, New Haven, CT 06510, USA
| | | | - Ryan L Powles
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT 06510, USA
| | | | - Yiming Hu
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
| | - David Chang
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT 06510, USA
| | | | - Wei Dai
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
| | - Qidu He
- Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Zefeng Liu
- Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Shubhabrata Mukherjee
- Division of General Internal Medicine, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Paul K Crane
- Division of General Internal Medicine, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA; Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT 06510, USA; VA Cooperative Studies Program Coordinating Center, West Haven, CT 06516, USA.
| |
Collapse
|