601
|
Meisner A, Kundu P, Zhang YD, Lan LV, Kim S, Ghandwani D, Pal Choudhury P, Berndt SI, Freedman ND, Garcia-Closas M, Chatterjee N. Combined Utility of 25 Disease and Risk Factor Polygenic Risk Scores for Stratifying Risk of All-Cause Mortality. Am J Hum Genet 2020; 107:418-431. [PMID: 32758451 DOI: 10.1016/j.ajhg.2020.07.002] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 07/01/2020] [Indexed: 12/13/2022] Open
Abstract
While genome-wide association studies have identified susceptibility variants for numerous traits, their combined utility for predicting broad measures of health, such as mortality, remains poorly understood. We used data from the UK Biobank to combine polygenic risk scores (PRS) for 13 diseases and 12 mortality risk factors into sex-specific composite PRS (cPRS). These cPRS were moderately associated with all-cause mortality in independent data within the UK Biobank: the estimated hazard ratios per standard deviation were 1.10 (95% confidence interval: 1.05, 1.16) and 1.15 (1.10, 1.19) for women and men, respectively. Differences in life expectancy between the top and bottom 5% of the cPRS were estimated to be 4.79 (1.76, 7.81) years and 6.75 (4.16, 9.35) years for women and men, respectively. These associations were substantially attenuated after adjusting for non-genetic mortality risk factors measured at study entry (i.e., middle age for most participants). The cPRS may be useful in counseling younger individuals at higher genetic risk of mortality on modification of non-genetic factors.
Collapse
Affiliation(s)
- Allison Meisner
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Prosenjit Kundu
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Yan Dora Zhang
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA; Department of Statistics, University of Hong Kong, 999077, Hong Kong
| | - Lauren V Lan
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Sungwon Kim
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Disha Ghandwani
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA; Indian Statistical Institute, Kolkata, West Bengal 700108, India
| | - Parichoy Pal Choudhury
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Sonja I Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Neal D Freedman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Montserrat Garcia-Closas
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Nilanjan Chatterjee
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA; Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
| |
Collapse
|
602
|
Thomas M, Sakoda LC, Hoffmeister M, Rosenthal EA, Lee JK, van Duijnhoven FJB, Platz EA, Wu AH, Dampier CH, de la Chapelle A, Wolk A, Joshi AD, Burnett-Hartman A, Gsur A, Lindblom A, Castells A, Win AK, Namjou B, Van Guelpen B, Tangen CM, He Q, Li CI, Schafmayer C, Joshu CE, Ulrich CM, Bishop DT, Buchanan DD, Schaid D, Drew DA, Muller DC, Duggan D, Crosslin DR, Albanes D, Giovannucci EL, Larson E, Qu F, Mentch F, Giles GG, Hakonarson H, Hampel H, Stanaway IB, Figueiredo JC, Huyghe JR, Minnier J, Chang-Claude J, Hampe J, Harley JB, Visvanathan K, Curtis KR, Offit K, Li L, Le Marchand L, Vodickova L, Gunter MJ, Jenkins MA, Slattery ML, Lemire M, Woods MO, Song M, Murphy N, Lindor NM, Dikilitas O, Pharoah PDP, Campbell PT, Newcomb PA, Milne RL, MacInnis RJ, Castellví-Bel S, Ogino S, Berndt SI, Bézieau S, Thibodeau SN, Gallinger SJ, Zaidi SH, Harrison TA, Keku TO, Hudson TJ, Vymetalkova V, Moreno V, Martín V, Arndt V, Wei WQ, Chung W, Su YR, Hayes RB, White E, Vodicka P, Casey G, Gruber SB, Schoen RE, Chan AT, Potter JD, Brenner H, Jarvik GP, Corley DA, Peters U, Hsu L. Genome-wide Modeling of Polygenic Risk Score in Colorectal Cancer Risk. Am J Hum Genet 2020; 107:432-444. [PMID: 32758450 PMCID: PMC7477007 DOI: 10.1016/j.ajhg.2020.07.006] [Citation(s) in RCA: 132] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 07/13/2020] [Indexed: 02/08/2023] Open
Abstract
Accurate colorectal cancer (CRC) risk prediction models are critical for identifying individuals at low and high risk of developing CRC, as they can then be offered targeted screening and interventions to address their risks of developing disease (if they are in a high-risk group) and avoid unnecessary screening and interventions (if they are in a low-risk group). As it is likely that thousands of genetic variants contribute to CRC risk, it is clinically important to investigate whether these genetic variants can be used jointly for CRC risk prediction. In this paper, we derived and compared different approaches to generating predictive polygenic risk scores (PRS) from genome-wide association studies (GWASs) including 55,105 CRC-affected case subjects and 65,079 control subjects of European ancestry. We built the PRS in three ways, using (1) 140 previously identified and validated CRC loci; (2) SNP selection based on linkage disequilibrium (LD) clumping followed by machine-learning approaches; and (3) LDpred, a Bayesian approach for genome-wide risk prediction. We tested the PRS in an independent cohort of 101,987 individuals with 1,699 CRC-affected case subjects. The discriminatory accuracy, calculated by the age- and sex-adjusted area under the receiver operating characteristics curve (AUC), was highest for the LDpred-derived PRS (AUC = 0.654) including nearly 1.2 M genetic variants (the proportion of causal genetic variants for CRC assumed to be 0.003), whereas the PRS of the 140 known variants identified from GWASs had the lowest AUC (AUC = 0.629). Based on the LDpred-derived PRS, we are able to identify 30% of individuals without a family history as having risk for CRC similar to those with a family history of CRC, whereas the PRS based on known GWAS variants identified only top 10% as having a similar relative risk. About 90% of these individuals have no family history and would have been considered average risk under current screening guidelines, but might benefit from earlier screening. The developed PRS offers a way for risk-stratified CRC screening and other targeted interventions.
Collapse
Affiliation(s)
- Minta Thomas
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Lori C Sakoda
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Division of Research, Kaiser Permanente Northern California, Oakland, CA 94612, USA
| | - Michael Hoffmeister
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Elisabeth A Rosenthal
- Department of Medicine (Medical Genetics), University of Washington Medical Center, Seattle, WA 98195, USA
| | - Jeffrey K Lee
- Division of Research, Kaiser Permanente Northern California, Oakland, CA 94612, USA
| | - Franzel J B van Duijnhoven
- Division of Human Nutrition and Health, Wageningen University & Research, Wageningen 176700, the Netherlands
| | - Elizabeth A Platz
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, and the Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD 21287, USA
| | - Anna H Wu
- University of Southern California, Preventative Medicine, Los Angeles, CA 90089, USA
| | - Christopher H Dampier
- Department of Surgery, University of Virginia Health System, Charlottesville, VA 22903, USA
| | - Albert de la Chapelle
- Department of Cancer Biology and Genetics and the Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| | - Alicja Wolk
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm 17177, Sweden
| | - Amit D Joshi
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | | | - Andrea Gsur
- Institute of Cancer Research, Department of Medicine I, Medical University Vienna, Vienna 1090, Austria
| | - Annika Lindblom
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm 17177, Sweden; Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm 17177, Sweden
| | - Antoni Castells
- Gastroenterology Department, Hospital Clínic, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), University of Barcelona, Barcelona 08007, Spain
| | - Aung Ko Win
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Bahram Namjou
- Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA; University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA; Cincinnati VA Medical Center, Cincinnati, OH 45229, USA
| | - Bethany Van Guelpen
- Department of Radiation Sciences, Oncology Unit, Umeå University, Umeå 90187, Sweden; Wallenberg Centre for Molecular Medicine, Umeå University, Umeå 90187, Sweden
| | - Catherine M Tangen
- SWOG Statistical Center, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Qianchuan He
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Christopher I Li
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Clemens Schafmayer
- Department of General Surgery, University Hospital Rostock, Rostock 18051, Germany
| | - Corinne E Joshu
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, and the Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD 21287, USA
| | - Cornelia M Ulrich
- Huntsman Cancer Institute and Department of Population Health Sciences, University of Utah, Salt Lake City, UT 84112, USA
| | - D Timothy Bishop
- Leeds Institute of Cancer and Pathology, University of Leeds, Leeds LS2 9JT, UK
| | - Daniel D Buchanan
- University of Melbourne Centre for Cancer Research, Victorian Comprehensive Cancer Centre, Parkville, VIC 3010, Australia; Colorectal Oncogenomics Group, Department of Clinical Pathology, The University of Melbourne, Parkville, VIC 3010, Australia; Genomic Medicine and Family Cancer Clinic, Royal Melbourne Hospital, Parkville, VIC 3010, Australia
| | - Daniel Schaid
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN 55905, USA
| | - David A Drew
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - David C Muller
- School of Public Health, Imperial College London, London SW7 2AZ, UK
| | - David Duggan
- Translational Genomics Research Institute - An Affiliate of City of Hope, Phoenix, AZ 85003, USA
| | - David R Crosslin
- Department of Bioinformatics and Medical Education, University of Washington Medical Center, Seattle, WA 98195, USA
| | - Demetrius Albanes
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Edward L Giovannucci
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA; Department of Nutrition, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA 02108, USA
| | - Eric Larson
- Kaiser Permanente Washington Research Institute, Seattle, WA 98101, USA
| | - Flora Qu
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Frank Mentch
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Graham G Giles
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC 3000, Australia; Cancer Epidemiology Division, Cancer Council Victoria, 615 St Kilda Road, Melbourne, VIC 3004, Australia; Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC 3168, Australia
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Heather Hampel
- Division of Human Genetics, Department of Internal Medicine, The Ohio State University Comprehensive Cancer Center, Columbus, OH 43210, USA
| | - Ian B Stanaway
- Department of Medicine (Medical Genetics), University of Washington Medical Center, Seattle, WA 98195, USA
| | - Jane C Figueiredo
- Department of Medicine, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA; Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Jeroen R Huyghe
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jessica Minnier
- School of Public Health, Oregon Health & Science University, Portland, OR 97239, USA
| | - Jenny Chang-Claude
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, 69120 Germany; University Medical Centre Hamburg-Eppendorf, University Cancer Centre Hamburg (UCCH), Hamburg 20246, Germany
| | - Jochen Hampe
- Department of Medicine I, University Hospital Dresden, Technische Universität Dresden (TU Dresden), Dresden 01062, Germany
| | - John B Harley
- Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA; University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA; Cincinnati VA Medical Center, Cincinnati, OH 45229, USA
| | - Kala Visvanathan
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, and the Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD 21287, USA
| | - Keith R Curtis
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Kenneth Offit
- Clinical Genetics Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY 10021, USA; Department of Medicine, Weill Cornell Medical College, NY 10065, USA
| | - Li Li
- Department of Family Medicine, University of Virginia, Charlottesville, VA 22903, USA
| | | | - Ludmila Vodickova
- Department of Molecular Biology of Cancer, Institute of Experimental Medicine of the Czech Academy of Sciences, 142 20 Prague 4, Czech Republic; Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University, 128 00 Prague, Czech Republic; Faculty of Medicine and Biomedical Center in Pilsen, Charles University, 323 00 Pilsen, Czech Republic
| | - Marc J Gunter
- Nutrition and Metabolism Section, International Agency for Research on Cancer, World Health Organization, Lyon 69372, France
| | - Mark A Jenkins
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Martha L Slattery
- Department of Internal Medicine, University of Utah, Salt Lake City, UT 84132, USA
| | - Mathieu Lemire
- PanCuRx Translational Research Initiative, Ontario, Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Michael O Woods
- Memorial University of Newfoundland, Discipline of Genetics, St. John's, NL A1B 3R7, Canada
| | - Mingyang Song
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02141, USA; Department of Nutrition, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA 02115, USA
| | - Neil Murphy
- Nutrition and Metabolism Section, International Agency for Research on Cancer, World Health Organization, Lyon 69372, France
| | - Noralane M Lindor
- Department of Health Science Research, Mayo Clinic, Scottsdale, AZ 85260, USA
| | - Ozan Dikilitas
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Paul D P Pharoah
- Department of Public Health and Primary Care, University of Cambridge, Cambridge CB2 0SR, UK
| | - Peter T Campbell
- Behavioral and Epidemiology Research Group, American Cancer Society, Atlanta, GA 30303, USA
| | - Polly A Newcomb
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; School of Public Health, University of Washington, Seattle, WA 98195, USA
| | - Roger L Milne
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC 3000, Australia; Cancer Epidemiology Division, Cancer Council Victoria, 615 St Kilda Road, Melbourne, VIC 3004, Australia; Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC 3168, Australia
| | - Robert J MacInnis
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC 3000, Australia; Cancer Epidemiology Division, Cancer Council Victoria, 615 St Kilda Road, Melbourne, VIC 3004, Australia
| | - Sergi Castellví-Bel
- Gastroenterology Department, Hospital Clínic, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), University of Barcelona, Barcelona 08007, Spain
| | - Shuji Ogino
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02141, USA; Program in MPE Molecular Pathological Epidemiology, Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Department of Oncologic Pathology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Sonja I Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Stéphane Bézieau
- Service de Génétique Médicale, Centre Hospitalier Universitaire (CHU) Nantes, Nantes 44093, France
| | - Stephen N Thibodeau
- Division of Laboratory Genetics, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN 85054, USA
| | - Steven J Gallinger
- Lunenfeld Tanenbaum Research Institute, Mount Sinai Hospital, University of Toronto, Toronto, ON M5G1X5, Canada
| | - Syed H Zaidi
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Tabitha A Harrison
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Temitope O Keku
- Center for Gastrointestinal Biology and Disease, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Thomas J Hudson
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Veronika Vymetalkova
- Department of Molecular Biology of Cancer, Institute of Experimental Medicine of the Czech Academy of Sciences, 142 20 Prague 4, Czech Republic; Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University, 128 00 Prague, Czech Republic; Faculty of Medicine and Biomedical Center in Pilsen, Charles University, 323 00 Pilsen, Czech Republic
| | - Victor Moreno
- Oncology Data Analytics Program, Catalan Institute of Oncology, L'Hospitalet de Llobregat, Barcelona 08908, Spain; CIBER Epidemiología y Salud Pública (CIBERESP), Madrid 28029, Spain; Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona 08907, Spain; ONCOBEL Program, Bellvitge Biomedical Research Institute (IDIBELL), L'Hospitalet de Llobregat, Barcelona 08908, Spain
| | - Vicente Martín
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid 28029, Spain; Biomedicine Institute (IBIOMED), University of León, León 24071, Spain
| | - Volker Arndt
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Wendy Chung
- Office of Research & Development, Department of Veterans Affairs, Washington, DC 20420, USA; Departments of Pediatrics and Medicine, Columbia University Medical Center, New York, NY 10032, USA
| | - Yu-Ru Su
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Richard B Hayes
- Division of Epidemiology, Department of Population Health, New York University School of Medicine, New York, NY 10016, USA
| | - Emily White
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Epidemiology, University of Washington, Seattle, WA 98195, USA
| | - Pavel Vodicka
- Department of Molecular Biology of Cancer, Institute of Experimental Medicine of the Czech Academy of Sciences, 142 20 Prague 4, Czech Republic; Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University, 128 00 Prague, Czech Republic; Faculty of Medicine and Biomedical Center in Pilsen, Charles University, 323 00 Pilsen, Czech Republic
| | - Graham Casey
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22903, USA
| | - Stephen B Gruber
- Department of Preventive Medicine, USC Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, CA 90089, USA
| | - Robert E Schoen
- Department of Medicine and Epidemiology, University of Pittsburgh Medical Center, Pittsburgh, PA 15219, USA
| | - Andrew T Chan
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA; Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02141, USA; Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA 02115, USA
| | - John D Potter
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Centre for Public Health Research, Massey University, Wellington 6140, New Zealand
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany; Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg 69120, Germany; German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Gail P Jarvik
- Department of Medicine (Medical Genetics), University of Washington Medical Center, Seattle, WA 98195, USA; Genome Sciences, University of Washington Medical Center, Seattle, WA 98195, USA
| | - Douglas A Corley
- Division of Research, Kaiser Permanente Northern California, Oakland, CA 94612, USA
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Epidemiology, University of Washington, Seattle, WA 98195, USA.
| | - Li Hsu
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Biostatistics, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
603
|
Deutsch AR, Selya AS. Stability in effects of different smoking-related polygenic risk scores over age and smoking phenotypes. Drug Alcohol Depend 2020; 214:108154. [PMID: 32645681 PMCID: PMC7423706 DOI: 10.1016/j.drugalcdep.2020.108154] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 06/17/2020] [Accepted: 06/24/2020] [Indexed: 10/23/2022]
Abstract
PURPOSE Polygenic risk scores (PRSs) for smoking behavior largely fail to consider the demonstrated developmental change in genetic influence over age and stage of smoking behaviors. Additionally, few studies have examined how stage-specific smoking PRSs (e.g. for initiation vs. smoking heaviness) generalize to other stages of risk. The current study examines the stability of PRS effects over age, and how specifically calibrated PRSs associate with other smoking phenotypes. METHODS 7228 participants were from the National Longitudinal Study of Adolescent to Adult Health, who had calculated PRSs for two smoking phenotypes, Centers for Disease Control and Prevention (CDC) smoking initiation status, and cigarettes per day (CPD). Four time-varying effects models estimated associations between both PRSs and four smoking phenotypes (CDC status, cigarettes/day on smoking days, any past-30 day smoking, and past-30 day daily smoking) over adolescence and young adulthood. FINDINGS The time-varying effects models demonstrated that both PRSs significantly associated with all four phenotypes age. PRS effects were similar, in both odds ratios and the overlap of 95 % confidence intervals. There were increases in PRS associations with quantity of smoking over age, and a decrease in PRS effects over age for the CDC smoking status phenotype over early to late adolescence. CONCLUSIONS Smoking PRSs can be robust predictors of smoking behavior over age. However, the lack of differentiation between specific PRSs and multiple smoking phenotypes, as well as the added contribution of both PRSs to explaining genetic variance, indicates a need to reconceptualize phenotypic measurement used to calibrate smoking PRSs.
Collapse
Affiliation(s)
- Arielle R. Deutsch
- Sanford Research, Behavioral Sciences,University of South Dakota School of Medicine, Pediatrics
| | - Arielle S. Selya
- Sanford Research, Behavioral Sciences,University of South Dakota School of Medicine, Pediatrics
| |
Collapse
|
604
|
Domingue BW, Fletcher J. Separating Measured Genetic and Environmental Effects: Evidence Linking Parental Genotype and Adopted Child Outcomes. Behav Genet 2020; 50:301-309. [PMID: 32350631 PMCID: PMC7442617 DOI: 10.1007/s10519-020-10000-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 04/24/2020] [Indexed: 12/14/2022]
Abstract
There has been widespread adoption of genome wide summary scores (polygenic scores) as tools for studying the importance of genetics and associated life course mechanisms across a range of demographic and socioeconomic outcomes. However, an often unacknowledged issue with these studies is that parental genetics impact both child environments and child genetics, leaving the effects of polygenic scores difficult to interpret. This paper uses multi-generational data containing polygenic scores for parents (n = 7193) and educational outcomes for adopted (n = 855) and biological (n = 20,939) children, many raised in the same families, which allows us to separate the influence of parental polygenic scores on children outcomes between environmental (adopted children) and environmental and genetic (biological children) effects. Our results complement recent work on "genetic nurture" by showing associations of parental polygenic scores with adopted children's schooling, providing additional evidence that polygenic scores combine genetic and environmental influences and that research designs are needed to separate these estimated impacts.
Collapse
Affiliation(s)
| | - Jason Fletcher
- La Follette School of Public Affairs, Department of Sociology, and Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, USA
| |
Collapse
|
605
|
Garman L, Montgomery CG, Rivera NV. Recent advances in sarcoidosis genomics: epigenetics, gene expression, and gene by environment (G × E) interaction studies. Curr Opin Pulm Med 2020; 26:544-553. [PMID: 32701681 PMCID: PMC7735660 DOI: 10.1097/mcp.0000000000000719] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
PURPOSE OF REVIEW We aim to review the most recent findings in genomics of sarcoidosis and highlight the gaps in the field. RECENT FINDINGS Original explorations of sarcoidosis subphenotypes, including cases associated with the World Trade Center and ocular sarcoidosis, have identified novel risk loci. Innovative gene--environment interaction studies utilizing modern analytical techniques have discovered risk loci associated with smoking and insecticide exposure. The application of whole-exome sequencing has identified genetic variants associated with persistent sarcoidosis and rare functional variations. A single epigenomics study has provided background knowledge of DNA methylation mechanisms in comparison with gene expression data. The application of machine-learning techniques has suggested new drug repositioning for the treatment of sarcoidosis. Several gene expression studies have identified prominent inflammatory pathways enriched in the affected tissue. SUMMARY Certainly, sarcoidosis research has recently advanced in the exploration of disease subphenotypes, utilizing novel analytical techniques, and including measures of clinical variation. Nevertheless, large-scale and diverse cohorts investigated with advanced sequencing methods, such as whole-genome and single-cell RNA sequencing, epigenomics, and meta-analysis coupled with cutting-edge analytic approaches, when employed, will broaden and translate genomics findings into clinical applications, and ultimately open venues for personalized medicine.
Collapse
Affiliation(s)
- Lori Garman
- Department of Genes and Human Disease, Oklahoma Medical Research Foundation, Oklahoma City, OK
| | - Courtney G. Montgomery
- Department of Genes and Human Disease, Oklahoma Medical Research Foundation, Oklahoma City, OK
| | - Natalia V. Rivera
- Department of Medicine Solna, Karolinska Institutet, Karolinska University Hospital, Stockholm, Sweden
- Rheumatology Division, Department of Medicine Solna, Karolinska Institutet, Karolinska University Hospital, Stockholm, Sweden
- Center of Molecular Medicine (CMM), Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
606
|
Domingue BW, Fletcher J. Separating Measured Genetic and Environmental Effects: Evidence Linking Parental Genotype and Adopted Child Outcomes. Behav Genet 2020; 50:301-309. [PMID: 32350631 DOI: 10.1101/698464] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 04/24/2020] [Indexed: 05/22/2023]
Abstract
There has been widespread adoption of genome wide summary scores (polygenic scores) as tools for studying the importance of genetics and associated life course mechanisms across a range of demographic and socioeconomic outcomes. However, an often unacknowledged issue with these studies is that parental genetics impact both child environments and child genetics, leaving the effects of polygenic scores difficult to interpret. This paper uses multi-generational data containing polygenic scores for parents (n = 7193) and educational outcomes for adopted (n = 855) and biological (n = 20,939) children, many raised in the same families, which allows us to separate the influence of parental polygenic scores on children outcomes between environmental (adopted children) and environmental and genetic (biological children) effects. Our results complement recent work on "genetic nurture" by showing associations of parental polygenic scores with adopted children's schooling, providing additional evidence that polygenic scores combine genetic and environmental influences and that research designs are needed to separate these estimated impacts.
Collapse
Affiliation(s)
| | - Jason Fletcher
- La Follette School of Public Affairs, Department of Sociology, and Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, USA
| |
Collapse
|
607
|
Choi SW, Mak TSH, O'Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc 2020; 15:2759-2772. [PMID: 32709988 PMCID: PMC7612115 DOI: 10.1038/s41596-020-0353-1] [Citation(s) in RCA: 996] [Impact Index Per Article: 199.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Accepted: 05/05/2020] [Indexed: 02/08/2023]
Abstract
A polygenic score (PGS) or polygenic risk score (PRS) is an estimate of an individual's genetic liability to a trait or disease, calculated according to their genotype profile and relevant genome-wide association study (GWAS) data. While present PRSs typically explain only a small fraction of trait variance, their correlation with the single largest contributor to phenotypic variation-genetic liability-has led to the routine application of PRSs across biomedical research. Among a range of applications, PRSs are exploited to assess shared etiology between phenotypes, to evaluate the clinical utility of genetic data for complex disease and as part of experimental studies in which, for example, experiments are performed that compare outcomes (e.g., gene expression and cellular response to treatment) between individuals with low and high PRS values. As GWAS sample sizes increase and PRSs become more powerful, PRSs are set to play a key role in research and stratified medicine. However, despite the importance and growing application of PRSs, there are limited guidelines for performing PRS analyses, which can lead to inconsistency between studies and misinterpretation of results. Here, we provide detailed guidelines for performing and interpreting PRS analyses. We outline standard quality control steps, discuss different methods for the calculation of PRSs, provide an introductory online tutorial, highlight common misconceptions relating to PRS results, offer recommendations for best practice and discuss future challenges.
Collapse
Affiliation(s)
- Shing Wan Choi
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA
| | | | - Paul F O'Reilly
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA.
| |
Collapse
|
608
|
Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network. BIOMED RESEARCH INTERNATIONAL 2020; 2020:2613091. [PMID: 32884937 PMCID: PMC7455828 DOI: 10.1155/2020/2613091] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 07/27/2020] [Accepted: 08/03/2020] [Indexed: 12/14/2022]
Abstract
Polycystic ovary syndrome (PCOS) is one of the most common metabolic and reproductive endocrinopathies. However, few studies have tried to develop a diagnostic model based on gene biomarkers. In this study, we applied a computational method by combining two machine learning algorithms, including random forest (RF) and artificial neural network (ANN), to identify gene biomarkers and construct diagnostic model. We collected gene expression data from Gene Expression Omnibus (GEO) database containing 76 PCOS samples and 57 normal samples; five datasets were utilized, including one dataset for screening differentially expressed genes (DEGs), two training datasets, and two validation datasets. Firstly, based on RF, 12 key genes in 264 DEGs were identified to be vital for classification of PCOS and normal samples. Moreover, the weights of these key genes were calculated using ANN with microarray and RNA-seq training dataset, respectively. Furthermore, the diagnostic models for two types of datasets were developed and named neuralPCOS. Finally, two validation datasets were used to test and compare the performance of neuralPCOS with other two set of marker genes by area under curve (AUC). Our model achieved an AUC of 0.7273 in microarray dataset, and 0.6488 in RNA-seq dataset. To conclude, we uncovered gene biomarkers and developed a novel diagnostic model of PCOS, which would be helpful for diagnosis.
Collapse
|
609
|
Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry 2020; 10:294. [PMID: 32826857 PMCID: PMC7442807 DOI: 10.1038/s41398-020-00957-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 06/24/2020] [Accepted: 07/22/2020] [Indexed: 11/08/2022] Open
Abstract
The accuracy of previous genetic studies in predicting polygenic psychiatric phenotypes has been limited mainly due to the limited power in distinguishing truly susceptible variants from null variants and the resulting overfitting. A novel prediction algorithm, Smooth-Threshold Multivariate Genetic Prediction (STMGP), was applied to improve the genome-based prediction of psychiatric phenotypes by decreasing overfitting through selecting variants and building a penalized regression model. Prediction models were trained using a cohort of 3685 subjects in Miyagi prefecture and validated with an independently recruited cohort of 3048 subjects in Iwate prefecture in Japan. Genotyping was performed using HumanOmniExpressExome BeadChip Arrays. We used the target phenotype of depressive symptoms and simulated phenotypes with varying complexity and various effect-size distributions of risk alleles. The prediction accuracy and the degree of overfitting of STMGP were compared with those of state-of-the-art models (polygenic risk scores, genomic best linear-unbiased prediction, summary-data-based best linear-unbiased prediction, BayesR, and ridge regression). In the prediction of depressive symptoms, compared with the other models, STMGP showed the highest prediction accuracy with the lowest degree of overfitting, although there was no significant difference in prediction accuracy. Simulation studies suggested that STMGP has a better prediction accuracy for moderately polygenic phenotypes. Our investigations suggest the potential usefulness of STMGP for predicting polygenic psychiatric conditions while avoiding overfitting.
Collapse
|
610
|
Wang M, Menon R, Mishra S, Patel AP, Chaffin M, Tanneeru D, Deshmukh M, Mathew O, Apte S, Devanboo CS, Sundaram S, Lakshmipathy P, Murugan S, Sharma KK, Rajendran K, Santhosh S, Thachathodiyl R, Ahamed H, Balegadde AV, Alexander T, Swaminathan K, Gupta R, Mullasari AS, Sigamani A, Kanchi M, Peterson AS, Butterworth AS, Danesh J, Di Angelantonio E, Naheed A, Inouye M, Chowdhury R, Vedam RL, Kathiresan S, Gupta R, Khera AV. Validation of a Genome-Wide Polygenic Score for Coronary Artery Disease in South Asians. J Am Coll Cardiol 2020; 76:703-714. [PMID: 32762905 PMCID: PMC7592606 DOI: 10.1016/j.jacc.2020.06.024] [Citation(s) in RCA: 71] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 06/05/2020] [Accepted: 06/08/2020] [Indexed: 12/26/2022]
Abstract
BACKGROUND Genome-wide polygenic scores (GPS) integrate information from many common DNA variants into a single number. Because rates of coronary artery disease (CAD) are substantially higher among South Asians, a GPS to identify high-risk individuals may be particularly useful in this population. OBJECTIVES This analysis used summary statistics from a prior genome-wide association study to derive a new GPSCAD for South Asians. METHODS This GPSCAD was validated in 7,244 South Asian UK Biobank participants and tested in 491 individuals from a case-control study in Bangladesh. Next, a static ancestry and GPSCAD reference distribution was built using whole-genome sequencing from 1,522 Indian individuals, and a framework was tested for projecting individuals onto this static ancestry and GPSCAD reference distribution using 1,800 CAD cases and 1,163 control subjects newly recruited in India. RESULTS The GPSCAD, containing 6,630,150 common DNA variants, had an odds ratio (OR) per SD of 1.58 in South Asian UK Biobank participants and 1.60 in the Bangladeshi study (p < 0.001 for each). Next, individuals of the Indian case-control study were projected onto static reference distributions, observing an OR/SD of 1.66 (p < 0.001). Compared with the middle quintile, risk for CAD was most pronounced for those in the top 5% of the GPSCAD distribution-ORs of 4.16, 2.46, and 3.22 in the South Asian UK Biobank, Bangladeshi, and Indian studies, respectively (p < 0.05 for each). CONCLUSIONS The new GPSCAD has been developed and tested using 3 distinct South Asian studies, and provides a generalizable framework for ancestry-specific GPS assessment.
Collapse
Affiliation(s)
- Minxian Wang
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| | | | | | - Aniruddh P Patel
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, Massachusetts; Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts; Cardiology Division, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts
| | - Mark Chaffin
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Thomas Alexander
- Kovai Medical Center and Hospital Research Foundation, Coimbatore, India
| | | | | | | | | | | | | | - Adam S Butterworth
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom; National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Genomics, University of Cambridge, Cambridge, United Kingdom
| | - John Danesh
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom; National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Genomics, University of Cambridge, Cambridge, United Kingdom; British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge, United Kingdom; National Institute for Health Research Cambridge Biomedical Research Centre, University of Cambridge and Cambridge University Hospitals, Cambridge, United Kingdom; Health Data Research UK Cambridge, Wellcome Genome Campus and University of Cambridge, Cambridge, United Kingdom; Department of Human Genetics, Wellcome Sanger Institute, Hinxton, United Kingdom
| | - Emanuele Di Angelantonio
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom; National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Genomics, University of Cambridge, Cambridge, United Kingdom
| | - Aliya Naheed
- International Centre for Diarrhoeal Disease Research, Dhaka, Bangladesh
| | - Michael Inouye
- Cambridge Baker Systems Genomics Initiative, Melbourne, Victoria, Australia, and Cambridge, United Kingdom; Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia; Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom; Department of Clinical Pathology and School of BioSciences, University of Melbourne, Parkville, Victoria, Australia; The Alan Turing Institute, London, United Kingdom
| | - Rajiv Chowdhury
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom; Centre for Non-Communicable Disease Research, Dhaka, Bangladesh
| | | | - Sekar Kathiresan
- Cardiology Division, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts; Verve Therapeutics, Cambridge, Massachusetts
| | | | - Amit V Khera
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, Massachusetts; Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts; Cardiology Division, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts.
| |
Collapse
|
611
|
Baker E, Escott-Price V. Polygenic Risk Scores in Alzheimer's Disease: Current Applications and Future Directions. Front Digit Health 2020; 2:14. [PMID: 34713027 PMCID: PMC8521998 DOI: 10.3389/fdgth.2020.00014] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 07/07/2020] [Indexed: 12/23/2022] Open
Abstract
Genome-wide association studies have identified nearly 40 genome-wide significant single nucleotide polymorphisms (SNPs) which are associated with Alzheimer's Disease (AD). Due to the polygenicity of AD, polygenic risk scores (PRS) have shown high potential for AD risk prediction. PRSs have been shown to successfully discriminate between AD cases and controls achieving a prediction accuracy of up to 84% based on area under the receiver operating curve. The prediction accuracy in AD is higher compared with other complex genetic disorders. PRS can be restricted to SNPs which reside in biologically relevant gene-sets; the predictive value of these gene-sets in the general population is not as high as genome-wide PRS, but they may play an important role to identify mechanisms of disease development and inform biological experiments. Multiple methods are available to derive PRSs, such as selecting SNPs based on statistical evidence of association with the disease or using prior evidence for SNP selection. All methods have advantages, but PRS produced using different methodologies are often not comparable, and results should be interpreted with care. Similarly, this is true when PRS is based on different background populations. With the exponential growth in development of digital electronic devices it is easy to calculate an individual's disease risk using public databases. A major limitation for the utility of PRSs is that the risk score is sample and method dependent. Therefore, replicability and interpretability of PRS is an important issue. PRS can be used to determine the probability of developing disease which incorporates information about disease risk in the general population or in a specific AD risk group. It is essential to consult with genetic counselors to ensure genetic risk is communicated appropriately.
Collapse
Affiliation(s)
- Emily Baker
- UK Dementia Research Institute at Cardiff University, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Valentina Escott-Price
- UK Dementia Research Institute at Cardiff University, School of Medicine, Cardiff University, Cardiff, United Kingdom.,MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, United Kingdom
| |
Collapse
|
612
|
Lieb W, Vasan RS. An update on genetic risk scores for coronary artery disease: are they useful for predicting disease risk and guiding clinical decisions? Expert Rev Cardiovasc Ther 2020; 18:443-447. [PMID: 32672491 DOI: 10.1080/14779072.2020.1797489] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Affiliation(s)
- Wolfgang Lieb
- Institute of Epidemiology, Kiel University , Kiel, Germany
| | - Ramachandran S Vasan
- The Framingham Heart Study , Framingham, MA, USA.,Sections of Preventive Medicine and Epidemiology, and Cardiovascular Medicine, Department of Medicine, Boston University School of Medicine , Boston, MA, USA.,Department of Epidemiology, Boston University School of Public Health
| |
Collapse
|
613
|
Sibling validation of polygenic risk scores and complex trait prediction. Sci Rep 2020; 10:13190. [PMID: 32764582 PMCID: PMC7411027 DOI: 10.1038/s41598-020-69927-7] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Accepted: 07/17/2020] [Indexed: 12/30/2022] Open
Abstract
We test 26 polygenic predictors using tens of thousands of genetic siblings from the UK Biobank (UKB), for whom we have SNP genotypes, health status, and phenotype information in late adulthood. Siblings have typically experienced similar environments during childhood, and exhibit negligible population stratification relative to each other. Therefore, the ability to predict differences in disease risk or complex trait values between siblings is a strong test of genomic prediction in humans. We compare validation results obtained using non-sibling subjects to those obtained among siblings and find that typically most of the predictive power persists in between-sibling designs. In the case of disease risk we test the extent to which higher polygenic risk score (PRS) identifies the affected sibling, and also compute Relative Risk Reduction as a function of risk score threshold. For quantitative traits we examine between-sibling differences in trait values as a function of predicted differences, and compare to performance in non-sibling pairs. Example results: Given 1 sibling with normal-range PRS score (< 84 percentile, < + 1 SD) and 1 sibling with high PRS score (top few percentiles, i.e. > + 2 SD), the predictors identify the affected sibling about 70–90% of the time across a variety of disease conditions, including Breast Cancer, Heart Attack, Diabetes, etc. 55–65% of the time the higher PRS sibling is the case. For quantitative traits such as height, the predictor correctly identifies the taller sibling roughly 80 percent of the time when the (male) height difference is 2 inches or more.
Collapse
|
614
|
Zhu Z, Hasegawa K, Ma B, Fujiogi M, Camargo CA, Liang L. Association of asthma and its genetic predisposition with the risk of severe COVID-19. J Allergy Clin Immunol 2020; 146:327-329.e4. [PMID: 32522462 PMCID: PMC7423602 DOI: 10.1016/j.jaci.2020.06.001] [Citation(s) in RCA: 163] [Impact Index Per Article: 32.6] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 05/29/2020] [Accepted: 06/03/2020] [Indexed: 12/13/2022]
Affiliation(s)
- Zhaozhong Zhu
- Program in Genetic Epidemiology and Statistical Genetics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Mass; Department of Emergency Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Mass; Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, Mass
| | - Kohei Hasegawa
- Department of Emergency Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Mass
| | - Baoshan Ma
- College of Information Science and Technology, Dalian Maritime University, Dalian, Liaoning, China
| | - Michimasa Fujiogi
- Department of Emergency Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Mass
| | - Carlos A Camargo
- Department of Emergency Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Mass
| | - Liming Liang
- Program in Genetic Epidemiology and Statistical Genetics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Mass; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Mass.
| |
Collapse
|
615
|
Wang Y, Guo J, Ni G, Yang J, Visscher PM, Yengo L. Theoretical and empirical quantification of the accuracy of polygenic scores in ancestry divergent populations. Nat Commun 2020; 11:3865. [PMID: 32737319 PMCID: PMC7395791 DOI: 10.1038/s41467-020-17719-y] [Citation(s) in RCA: 123] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2019] [Accepted: 07/10/2020] [Indexed: 02/03/2023] Open
Abstract
Polygenic scores (PGS) have been widely used to predict disease risk using variants identified from genome-wide association studies (GWAS). To date, most GWAS have been conducted in populations of European ancestry, which limits the use of GWAS-derived PGS in non-European ancestry populations. Here, we derive a theoretical model of the relative accuracy (RA) of PGS across ancestries. We show through extensive simulations that the RA of PGS based on genome-wide significant SNPs can be predicted accurately from modelling linkage disequilibrium (LD), minor allele frequencies (MAF), cross-population correlations of causal SNP effects and heritability. We find that LD and MAF differences between ancestries can explain between 70 and 80% of the loss of RA of European-based PGS in African ancestry for traits like body mass index and type 2 diabetes. Our results suggest that causal variants underlying common genetic variation identified in European ancestry GWAS are mostly shared across continents.
Collapse
Affiliation(s)
- Ying Wang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Jing Guo
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Guiyan Ni
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Jian Yang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
- Institute for Advanced Research, Wenzhou Medical University, Wenzhou, Zhejiang, 325027, China
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Loic Yengo
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia.
| |
Collapse
|
616
|
Coombes BJ, Ploner A, Bergen SE, Biernacka JM. A principal component approach to improve association testing with polygenic risk scores. Genet Epidemiol 2020; 44:676-686. [PMID: 32691445 DOI: 10.1002/gepi.22339] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Revised: 05/13/2020] [Accepted: 07/10/2020] [Indexed: 12/16/2022]
Abstract
Polygenic risk scores (PRSs) have become an increasingly popular approach for demonstrating polygenic influences on complex traits and for establishing common polygenic signals between different traits. PRSs are typically constructed using pruning and thresholding (P+T), but the best choice of parameters is uncertain; thus multiple settings are used and the best is chosen. Optimization can lead to inflated Type I error. Permutation procedures can correct this, but they can be computationally intensive. Alternatively, a single parameter setting can be chosen a priori for the PRS, but choosing suboptimal settings results in loss of power. We propose computing PRSs under a range of parameter settings, performing principal component analysis (PCA) on the resulting set of PRSs, and using the first PRS-PC in association tests. The first PC reweights the variants included in the PRS to achieve maximum variation over all PRS settings used. Using simulations and a real data application to study PRS association with bipolar disorder and psychosis in bipolar disorder, we compare the performance of the proposed PRS-PCA approach with a permutation test and an a priori selected p-value threshold. The PRS-PCA approach is simple to implement, outperforms the other strategies in most scenarios, and provides an unbiased estimate of prediction performance.
Collapse
Affiliation(s)
- Brandon J Coombes
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota
| | - Alexander Ploner
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Sarah E Bergen
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Joanna M Biernacka
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota.,Department of Psychiatry and Psychology, Mayo Clinic, Rochester, Minnesota
| |
Collapse
|
617
|
Yong SY, Raben TG, Lello L, Hsu SDH. Genetic architecture of complex traits and disease risk predictors. Sci Rep 2020; 10:12055. [PMID: 32694572 PMCID: PMC7374622 DOI: 10.1038/s41598-020-68881-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 06/30/2020] [Indexed: 01/30/2023] Open
Abstract
Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied, a large amount of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits-i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.
Collapse
Affiliation(s)
- Soke Yuen Yong
- Department of Physics and Astronomy, Michigan State University, East Lansing, USA.
| | - Timothy G Raben
- Department of Physics and Astronomy, Michigan State University, East Lansing, USA
| | - Louis Lello
- Department of Physics and Astronomy, Michigan State University, East Lansing, USA.,Genomic Prediction, North Brunswick, NJ, USA
| | - Stephen D H Hsu
- Department of Physics and Astronomy, Michigan State University, East Lansing, USA.,Genomic Prediction, North Brunswick, NJ, USA
| |
Collapse
|
618
|
Human essential hypertension: no significant association of polygenic risk scores with antihypertensive drug responses. Sci Rep 2020; 10:11940. [PMID: 32686723 PMCID: PMC7371738 DOI: 10.1038/s41598-020-68878-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 06/30/2020] [Indexed: 11/08/2022] Open
Abstract
Polygenic risk scores (PRSs) for essential hypertension, calculated from > 900 genomic loci, were recently found to explain a significant fraction of hypertension heritability and complications. To investigate whether variation of hypertension PRS also captures variation of antihypertensive drug responsiveness, we calculated two different PRSs for both systolic and diastolic blood pressure: one based on the top 793 independent hypertension-associated single nucleotide polymorphisms and another based on over 1 million genome-wide variants. Using our pharmacogenomic GENRES study comprising four different antihypertensive monotherapies (n ~ 200 for all drugs), we identified a weak, but (after Bonferroni correction) statistically nonsignificant association of higher genome-wide PRSs with weaker response to a diuretic. In addition, we noticed a correlation between high genome-wide PRS and electrocardiographic left ventricular hypertrophy. Finally, using data of the Finnish arm of the LIFE study (n = 346), we found that PRSs for systolic blood pressure were slightly higher in patients with drug-resistant hypertension than in those with drug-controlled hypertension (p = 0.03, not significant after Bonferroni correction). In conclusion, our results indicate that patients with elevated hypertension PRSs may be predisposed to difficult-to-control hypertension and complications thereof. No general association between a high PRS and less efficient drug responsiveness was noticed.
Collapse
|
619
|
Abstract
Since the initial success of genome-wide association studies (GWAS) in 2005, tens of thousands of genetic variants have been identified for hundreds of human diseases and traits. In a GWAS, genotype information at up to millions of genetic markers is collected from up to hundreds of thousands of individuals, together with their phenotype information. Several scientific goals can be accomplished through the analysis of GWAS data, including the identification of variants, genes, and pathways associated with diseases and traits of interest; the inference of the genetic architecture of these traits; and the development of genetic risk prediction models. In this review, we provide an overview of the statistical challenges in achieving these goals and recent progress in statistical methodology to address these challenges.
Collapse
Affiliation(s)
- Ning Sun
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut 06520, USA
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut 06520, USA
| |
Collapse
|
620
|
Zhu Z, Hasegawa K, Camargo CA, Liang L. Investigating asthma heterogeneity through shared and distinct genetics: Insights from genome-wide cross-trait analysis. J Allergy Clin Immunol 2020; 147:796-807. [PMID: 32693092 PMCID: PMC7368660 DOI: 10.1016/j.jaci.2020.07.004] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Revised: 07/03/2020] [Accepted: 07/09/2020] [Indexed: 12/17/2022]
Abstract
Asthma is a heterogeneous respiratory disease reflecting distinct pathobiologic mechanisms. These mechanisms are based, at least partly, on different genetic factors shared by many other conditions, such as allergic diseases and obesity. Investigating the shared genetic effects enables better understanding of the mechanisms of phenotypic correlations and is less subject to confounding by environmental factors. The increasing availability of large-scale genome-wide association study (GWAS) for asthma has enabled researchers to examine the genetic contributions to the epidemiologic associations between asthma subtypes and those between coexisting diseases and/or traits and asthma. Studies have found not only shared but also distinct genetic components between asthma subtypes, indicating that the heterogeneity is related to distinct genetics. This review summarizes a recently compiled analytic approach-genome-wide cross-trait analysis-to determine shared and distinct genetic architecture. The genome-wide cross-trait analysis features in several analytic aspects: genetic correlation, cross-trait meta-analysis, Mendelian randomization, polygenic risk score, and functional analysis. In this article, we discuss in detail the scientific goals that can be achieved by these analyses, their advantages, and their limitations. We also make recommendations for future directions: (1) ethnicity-specific asthma GWASs and (2) application of cross-trait methods to multiomics data to dissect the heritability found in GWASs. Finally, these analytic approaches are also applicable to complex and heterogeneous traits beyond asthma.
Collapse
Affiliation(s)
- Zhaozhong Zhu
- Program in Genetic Epidemiology and Statistical Genetics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Mass; Department of Emergency Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Mass; Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, Mass
| | - Kohei Hasegawa
- Department of Emergency Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Mass
| | - Carlos A Camargo
- Program in Genetic Epidemiology and Statistical Genetics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Mass; Department of Emergency Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Mass
| | - Liming Liang
- Program in Genetic Epidemiology and Statistical Genetics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Mass; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Mass.
| |
Collapse
|
621
|
Wang G, Sarkar A, Carbonetto P, Stephens M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J R Stat Soc Series B Stat Methodol 2020; 82:1273-1300. [DOI: 10.1111/rssb.12388] [Citation(s) in RCA: 491] [Impact Index Per Article: 98.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
622
|
Lees JA, Mai TT, Galardini M, Wheeler NE, Horsfield ST, Parkhill J, Corander J. Improved Prediction of Bacterial Genotype-Phenotype Associations Using Interpretable Pangenome-Spanning Regressions. mBio 2020; 11:e01344-20. [PMID: 32636251 PMCID: PMC7343994 DOI: 10.1128/mbio.01344-20] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 06/05/2020] [Indexed: 12/19/2022] Open
Abstract
Discovery of genetic variants underlying bacterial phenotypes and the prediction of phenotypes such as antibiotic resistance are fundamental tasks in bacterial genomics. Genome-wide association study (GWAS) methods have been applied to study these relations, but the plastic nature of bacterial genomes and the clonal structure of bacterial populations creates challenges. We introduce an alignment-free method which finds sets of loci associated with bacterial phenotypes, quantifies the total effect of genetics on the phenotype, and allows accurate phenotype prediction, all within a single computationally scalable joint modeling framework. Genetic variants covering the entire pangenome are compactly represented by extended DNA sequence words known as unitigs, and model fitting is achieved using elastic net penalization, an extension of standard multiple regression. Using an extensive set of state-of-the-art bacterial population genomic data sets, we demonstrate that our approach performs accurate phenotype prediction, comparable to popular machine learning methods, while retaining both interpretability and computational efficiency. Compared to those of previous approaches, which test each genotype-phenotype association separately for each variant and apply a significance threshold, the variants selected by our joint modeling approach overlap substantially.IMPORTANCE Being able to identify the genetic variants responsible for specific bacterial phenotypes has been the goal of bacterial genetics since its inception and is fundamental to our current level of understanding of bacteria. This identification has been based primarily on painstaking experimentation, but the availability of large data sets of whole genomes with associated phenotype metadata promises to revolutionize this approach, not least for important clinical phenotypes that are not amenable to laboratory analysis. These models of phenotype-genotype association can in the future be used for rapid prediction of clinically important phenotypes such as antibiotic resistance and virulence by rapid-turnaround or point-of-care tests. However, despite much effort being put into adapting genome-wide association study (GWAS) approaches to cope with bacterium-specific problems, such as strong population structure and horizontal gene exchange, current approaches are not yet optimal. We describe a method that advances methodology for both association and generation of portable prediction models.
Collapse
Affiliation(s)
- John A Lees
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - T Tien Mai
- Oslo Centre for Biostatistics and Epidemiology, Department of Biostatistics, University of Oslo, Oslo, Norway
| | - Marco Galardini
- Biological Design Center, Boston University, Boston, Massachusetts, USA
| | - Nicole E Wheeler
- Centre for Genomic Pathogen Surveillance, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Samuel T Horsfield
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Jukka Corander
- Oslo Centre for Biostatistics and Epidemiology, Department of Biostatistics, University of Oslo, Oslo, Norway
- Centre for Genomic Pathogen Surveillance, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Helsinki Institute of Information Technology, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| |
Collapse
|
623
|
Yanes T, McInerney-Leo AM, Law MH, Cummings S. The emerging field of polygenic risk scores and perspective for use in clinical care. Hum Mol Genet 2020; 29:R165-R176. [DOI: 10.1093/hmg/ddaa136] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 06/30/2020] [Accepted: 07/01/2020] [Indexed: 02/06/2023] Open
Abstract
Abstract
Genetic testing is used widely for diagnostic, carrier and predictive testing in monogenic diseases. Until recently, there were no genetic testing options available for multifactorial complex diseases like heart disease, diabetes and cancer. Genome-wide association studies (GWAS) have been invaluable in identifying single-nucleotide polymorphisms (SNPs) associated with increased or decreased risk for hundreds of complex disorders. For a given disease, SNPs can be combined to generate a cumulative estimation of risk known as a polygenic risk score (PRS). After years of research, PRSs are increasingly used in clinical settings. In this article, we will review the literature on how both genome-wide and restricted PRSs are developed and the relative merit of each. The validation and evaluation of PRSs will also be discussed, including the recognition that PRS validity is intrinsically linked to the methodological and analytical approach of the foundation GWAS together with the ethnic characteristics of that cohort. Specifically, population differences may affect imputation accuracy, risk magnitude and direction. Even as PRSs are being introduced into clinical practice, there is a push to combine them with clinical and demographic risk factors to develop a holistic disease risk. The existing evidence regarding the clinical utility of PRSs is considered across four different domains: informing population screening programs, guiding therapeutic interventions, refining risk for families at high risk, and facilitating diagnosis and predicting prognostic outcomes. The evidence for clinical utility in relation to five well-studied disorders is summarized. The potential ethical, legal and social implications are also highlighted.
Collapse
Affiliation(s)
- Tatiane Yanes
- Dermatology Research Centre, The University of Queensland Diamantina Institute, The University of Queensland, Brisbane, QLD 4102, Australia
| | - Aideen M McInerney-Leo
- Dermatology Research Centre, The University of Queensland Diamantina Institute, The University of Queensland, Brisbane, QLD 4102, Australia
| | - Matthew H Law
- Statistical Genetics Lab, QIMR Berghofer Medical Research Institute, Herston QLD 4006, Australia
- Faculty of Health, School of Biomedical Sciences, and Institute of Health and Biomedical Innovation, Queensland University of Technology, Kelvin Grove QLD 4059, Australia
| | | |
Collapse
|
624
|
Chun S, Imakaev M, Hui D, Patsopoulos NA, Neale BM, Kathiresan S, Stitziel NO, Sunyaev SR. Non-parametric Polygenic Risk Prediction via Partitioned GWAS Summary Statistics. Am J Hum Genet 2020; 107:46-59. [PMID: 32470373 PMCID: PMC7332650 DOI: 10.1016/j.ajhg.2020.05.004] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2019] [Accepted: 05/01/2020] [Indexed: 02/07/2023] Open
Abstract
In complex trait genetics, the ability to predict phenotype from genotype is the ultimate measure of our understanding of genetic architecture underlying the heritability of a trait. A complete understanding of the genetic basis of a trait should allow for predictive methods with accuracies approaching the trait's heritability. The highly polygenic nature of quantitative traits and most common phenotypes has motivated the development of statistical strategies focused on combining myriad individually non-significant genetic effects. Now that predictive accuracies are improving, there is a growing interest in the practical utility of such methods for predicting risk of common diseases responsive to early therapeutic intervention. However, existing methods require individual-level genotypes or depend on accurately specifying the genetic architecture underlying each disease to be predicted. Here, we propose a polygenic risk prediction method that does not require explicitly modeling any underlying genetic architecture. We start with summary statistics in the form of SNP effect sizes from a large GWAS cohort. We then remove the correlation structure across summary statistics arising due to linkage disequilibrium and apply a piecewise linear interpolation on conditional mean effects. In both simulated and real datasets, this new non-parametric shrinkage (NPS) method can reliably allow for linkage disequilibrium in summary statistics of 5 million dense genome-wide markers and consistently improves prediction accuracy. We show that NPS improves the identification of groups at high risk for breast cancer, type 2 diabetes, inflammatory bowel disease, and coronary heart disease, all of which have available early intervention or prevention treatments.
Collapse
Affiliation(s)
- Sung Chun
- Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Maxim Imakaev
- Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA
| | - Daniel Hui
- Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Systems Biology and Computer Science Program, Ann Romney Center for Neurological Diseases, Department of Neurology, Brigham & Women's Hospital, Boston, MA 02115, USA
| | - Nikolaos A Patsopoulos
- Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Systems Biology and Computer Science Program, Ann Romney Center for Neurological Diseases, Department of Neurology, Brigham & Women's Hospital, Boston, MA 02115, USA
| | - Benjamin M Neale
- Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Sekar Kathiresan
- Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA; Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Nathan O Stitziel
- Cardiovascular Division, Department of Medicine, Washington University School of Medicine, Saint Louis, MO 63110, USA; Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110, USA; McDonnell Genome Institute, Washington University School of Medicine, Saint Louis, MO 63110, USA.
| | - Shamil R Sunyaev
- Division of Genetics, Brigham and Women's Hospital, Boston, MA 02115, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA.
| |
Collapse
|
625
|
Fryett JJ, Morris AP, Cordell HJ. Investigation of prediction accuracy and the impact of sample size, ancestry, and tissue in transcriptome-wide association studies. Genet Epidemiol 2020; 44:425-441. [PMID: 32190932 PMCID: PMC8641384 DOI: 10.1002/gepi.22290] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Revised: 02/05/2020] [Accepted: 03/06/2020] [Indexed: 01/14/2023]
Abstract
In transcriptome-wide association studies (TWAS), gene expression values are predicted using genotype data and tested for association with a phenotype. The power of this approach to detect associations relies, at least in part, on the accuracy of the prediction. Here we compare the prediction accuracy of six different methods-LASSO, Ridge regression, Elastic net, Best Linear Unbiased Predictor, Bayesian Sparse Linear Mixed Model, and Random Forests-by performing cross-validation using data from the Geuvadis Project. We also examine prediction accuracy (a) at different sample sizes, (b) when ancestry of the prediction model training and testing populations is different, and (c) when the tissue used to train the model is different from the tissue to be predicted. We find that, for most genes, the expression cannot be accurately predicted, but in general sparse statistical models tend to outperform polygenic models at prediction. Average prediction accuracy is reduced when the model training set size is reduced or when predicting across ancestries and is marginally reduced when predicting across tissues. We conclude that using sparse statistical models and the development of large reference panels across multiple ethnicities and tissues will lead to better prediction of gene expression, and thus may improve TWAS power.
Collapse
Affiliation(s)
- James J. Fryett
- Population Health Sciences Institute, Faculty of Medical SciencesNewcastle UniversityNewcastle upon TyneUK
| | - Andrew P. Morris
- Division of Musculoskeletal and Dermatological SciencesUniversity of ManchesterManchesterUK
| | - Heather J. Cordell
- Population Health Sciences Institute, Faculty of Medical SciencesNewcastle UniversityNewcastle upon TyneUK
| |
Collapse
|
626
|
Ajnakina O, Cadar D, Steptoe A. Interplay between Socioeconomic Markers and Polygenic Predisposition on Timing of Dementia Diagnosis. J Am Geriatr Soc 2020; 68:1529-1536. [PMID: 32187654 PMCID: PMC7363562 DOI: 10.1111/jgs.16406] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Revised: 01/08/2020] [Accepted: 02/14/2020] [Indexed: 12/15/2022]
Abstract
OBJECTIVES Identifying the interplay between socioeconomic markers (education and financial resources) and polygenetic predisposition influencing the time of dementia and the diagnosis of clinical Alzheimer's disease (AD) dementia is of central relevance for preventive strategies. DESIGN Prospective cohort design. SETTING The English Longitudinal Study of Aging is a household survey data set of a representative sample. PARTICIPANTS A total of 7,039 individuals aged 50 years and older participated in the study. Of these, 320 (4.6%) were diagnosed with dementia over the 10-year follow-up. MEASUREMENTS Polygenic score (PGS) for Alzheimer's disease (AD-PGS) was calculated using summary statistics from the International Genomics of Alzheimer's Project. An accelerated failure time survival model was used to investigate interactions between AD-PGS and socioeconomic markers on the timing of dementia and clinical AD dementia diagnosis. RESULTS A one standard deviation increase in AD-PGS was associated with an accelerated time to dementia diagnosis by 4.8 months. The presence of the apolipoprotein E gene (APOE-ε4) was associated with an earlier dementia onset by approximately 24.9 months, whereas intermediate and low levels of wealth were associated with an accelerated time to dementia diagnosis by 12.0 months and 18.7 months, respectively. A multiplicative interaction between AD-PGS and years of completed schooling in decelerating the time to clinical AD dementia by 3.0 months suggests educational attainment may serve as a protective mechanism against AD diagnosis among older people with a higher polygenic risk. Interaction between AD-PGS and lower wealth accelerated the time to clinical AD dementia diagnosis by 21.1 to 24.1 months. CONCLUSION Socioeconomic markers influence the time to dementia and clinical AD dementia diagnosis, particularly in those with a higher polygenic predisposition. J Am Geriatr Soc 68:1529-1536, 2020.
Collapse
Affiliation(s)
- Olesya Ajnakina
- Department of Behavioural Science and HealthInstitute of Epidemiology and Health Care, University College LondonLondonUK
- Department of Biostatistics & Health InformaticsInstitute of Psychiatry, Psychology and Neuroscience, Kingʼs College LondonLondonUK
| | - Dorina Cadar
- Department of Behavioural Science and HealthInstitute of Epidemiology and Health Care, University College LondonLondonUK
| | - Andrew Steptoe
- Department of Behavioural Science and HealthInstitute of Epidemiology and Health Care, University College LondonLondonUK
| |
Collapse
|
627
|
Wells PM, Adebayo AS, Bowyer RCE, Freidin MB, Finckh A, Strowig T, Lesker TR, Alpizar-Rodriguez D, Gilbert B, Kirkham B, Cope AP, Steves CJ, Williams FMK. Associations between gut microbiota and genetic risk for rheumatoid arthritis in the absence of disease: a cross-sectional study. THE LANCET. RHEUMATOLOGY 2020; 2:e418-e427. [PMID: 33345197 PMCID: PMC7729822 DOI: 10.1016/s2665-9913(20)30064-3] [Citation(s) in RCA: 113] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
BACKGROUND Rheumatoid arthritis is a chronic inflammatory autoimmune disease that is associated with reduced life expectancy. The disease is heritable and an extensive repertoire of genetic variants have been identified. The gut microbiota might represent an environmental risk factor for rheumatoid arthritis. We aimed to assess whether known rheumatoid arthritis risk alleles were associated with the gut microbiota in a large population who do not have rheumatoid arthritis. METHODS In this cross-sectional study done in the UK and Switzerland, we used genotyping and microbiota data from previous studies of the TwinsUK cohort, excluding participants who had ever had a diagnosis of rheumatoid arthritis, as well as their unaffected co-twins. We used blood samples for genotyping and stool samples for the assessment of the gut microbiota. We generated a polygenic risk score (PRS) for rheumatoid arthritis in 1650 TwinsUK participants without the disease, based on 233 GWAS-identified single nucleotide polymorphisms associated with rheumatoid arthritis. We validated the PRS using logistic regression against rheumatoid arthritis diagnosis in 2686 UK Biobank individuals with a confirmed diagnosis of rheumatoid arthritis. Amplicon sequence variants (ASVs) were generated from 16S rRNA gene sequencing of stool samples and assessed for association with the PRS for rheumatoid arthritis. We validated the findings in an independent sample comprised of first-degree relatives of patients with rheumatoid arthritis from the SCREEN-RA cohort. Differential abundance of ASVs present in more than 5% of samples, grouped by ASV taxon annotation, against the rheumatoid arthritis PRS as a continuous variable was assessed using fixed-effects covariates. To account for multiple testing, the false discovery rate calculation was applied to all p values to generate q values, with a significance threshold of 0·05 determined a priori. FINDINGS We found that presence of Prevotella spp were positively associated with the rheumatoid arthritis PRS in TwinsUK participants (q<1 × 10-7). This finding was validated in SCREEN-RA participants (n=133) carrying established shared epitope risk alleles (q=0·0011). We also found an association between Prevotella spp and presence of preclinical rheumatoid arthritis phases (q=0·021). INTERPRETATION Prevotella spp in the gut microbiota are associated with the rheumatoid arthritis genotype in the absence of rheumatoid arthritis, including in individuals at high risk of developing rheumatoid arthritis. Our findings suggest that host genotype is associated with microbiota profile before disease onset. FUNDING Versus Arthritis.
Collapse
Affiliation(s)
- Philippa M Wells
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Adewale S Adebayo
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Ruth C E Bowyer
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Maxim B Freidin
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Axel Finckh
- Division of Rheumatology, Geneva University Hospital, Geneva, Switzerland
| | - Till Strowig
- Department of Microbial Immune Regulation, Helmholtz Centre for Infection Research, Braunschweig, Germany; Hannover Medical School, Hannover, Germany
| | - Till Robin Lesker
- Department of Microbial Immune Regulation, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | | | - Benoit Gilbert
- Division of Rheumatology, Geneva University Hospital, Geneva, Switzerland
| | - Bruce Kirkham
- Department of Rheumatology, Guy's and St Thomas' NHS Trust, London, UK
| | - Andrew P Cope
- Centre for Rheumatic Diseases, King's College London, London, UK; Centre for Inflammation Biology and Cancer Immunology, King's College London, London, UK
| | - Claire J Steves
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK; Department of Ageing and Health, St Thomas' Hospital, London, UK
| | - Frances M K Williams
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK.
| |
Collapse
|
628
|
Tong G, Guo G. The life-course association of birth-weight genes with self-rated health. BIODEMOGRAPHY AND SOCIAL BIOLOGY 2020; 65:268-286. [PMID: 32727274 PMCID: PMC8607814 DOI: 10.1080/19485565.2020.1765733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This study examines the impact of genes associated with normal-range birth weight (2500-4500 grams) on self-rated health in mid-to-late life course. Fifty-eight previously identified genetic variants that explain the variation in the normal-range birth weight were used to construct a genetic measure of birth weight for the non-Hispanic white sample from the Health and Retirement Study. Our results show that the genetic tendency toward higher birth weight predicts better self-rated health in mid-to-late life course net of various demographic, socioeconomic, and health behavioral factors. We also examine the heterogeneous effects of birth-weight genes across birth cohorts and age groups. Moreover, to clarify the paradox that higher birth weight can predict both better self-rated health and higher BMI, we show the positive association between birth weight genes and BMI can only hold within the normal-range BMI (18 ≤ BMI < 30). Overall, these findings suggest the genetic factors underlying the normal-range birth weight can have life-courseimpacts on health.
Collapse
Affiliation(s)
- Guangyu Tong
- Yale Center for Analytical Sciences and Department of Biostatistics, Yale University , New Haven, Connecticut, USA
| | - Guang Guo
- Department of Sociology, University of North Carolina at Chapel Hill , Chapel Hill, United States
| |
Collapse
|
629
|
Akingbuwa WA, Hammerschlag AR, Jami ES, Allegrini AG, Karhunen V, Sallis H, Ask H, Askeland RB, Baselmans B, Diemer E, Hagenbeek FA, Havdahl A, Hottenga JJ, Mbarek H, Rivadeneira F, Tesli M, van Beijsterveldt C, Breen G, Lewis CM, Thapar A, Boomsma DI, Kuja-Halkola R, Reichborn-Kjennerud T, Magnus P, Rimfeld K, Ystrom EIVIND, Jarvelin MR, Lichtenstein P, Lundstrom S, Munafò MR, Plomin R, Tiemeier H, Nivard MG, Bartels M, Middeldorp CM, and the Bipolar Disorder and Major Depressive Disorder Working Groups of the Psychiatric Genomics Consortium. Genetic Associations Between Childhood Psychopathology and Adult Depression and Associated Traits in 42 998 Individuals: A Meta-analysis. JAMA Psychiatry 2020; 77:715-728. [PMID: 32293669 PMCID: PMC7160753 DOI: 10.1001/jamapsychiatry.2020.0527] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 02/17/2020] [Indexed: 12/14/2022]
Abstract
Importance Adult mood disorders are often preceded by behavioral and emotional problems in childhood. It is yet unclear what explains the associations between childhood psychopathology and adult traits. Objective To investigate whether genetic risk for adult mood disorders and associated traits is associated with childhood disorders. Design, Setting, and Participants This meta-analysis examined data from 7 ongoing longitudinal birth and childhood cohorts from the UK, the Netherlands, Sweden, Norway, and Finland. Starting points of data collection ranged from July 1985 to April 2002. Participants were repeatedly assessed for childhood psychopathology from ages 6 to 17 years. Data analysis occurred from September 2017 to May 2019. Exposures Individual polygenic scores (PGS) were constructed in children based on genome-wide association studies of adult major depression, bipolar disorder, subjective well-being, neuroticism, insomnia, educational attainment, and body mass index (BMI). Main Outcomes and Measures Regression meta-analyses were used to test associations between PGS and attention-deficit/hyperactivity disorder (ADHD) symptoms and internalizing and social problems measured repeatedly across childhood and adolescence and whether these associations depended on childhood phenotype, age, and rater. Results The sample included 42 998 participants aged 6 to 17 years. Male participants varied from 43.0% (1040 of 2417 participants) to 53.1% (2434 of 4583 participants) by age and across all cohorts. The PGS of adult major depression, neuroticism, BMI, and insomnia were positively associated with childhood psychopathology (β estimate range, 0.023-0.042 [95% CI, 0.017-0.049]), while associations with PGS of subjective well-being and educational attainment were negative (β, -0.026 to -0.046 [95% CI, -0.020 to -0.057]). There was no moderation of age, type of childhood phenotype, or rater with the associations. The exceptions were stronger associations between educational attainment PGS and ADHD compared with internalizing problems (Δβ, 0.0561 [Δ95% CI, 0.0318-0.0804]; ΔSE, 0.0124) and social problems (Δβ, 0.0528 [Δ95% CI, 0.0282-0.0775]; ΔSE, 0.0126), and between BMI PGS and ADHD and social problems (Δβ, -0.0001 [Δ95% CI, -0.0102 to 0.0100]; ΔSE, 0.0052), compared with internalizing problems (Δβ, -0.0310 [Δ95% CI, -0.0456 to -0.0164]; ΔSE, 0.0074). Furthermore, the association between educational attainment PGS and ADHD increased with age (Δβ, -0.0032 [Δ 95% CI, -0.0048 to -0.0017]; ΔSE, 0.0008). Conclusions and Relevance Results from this study suggest the existence of a set of genetic factors influencing a range of traits across the life span with stable associations present throughout childhood. Knowledge of underlying mechanisms may affect treatment and long-term outcomes of individuals with psychopathology.
Collapse
Affiliation(s)
- Wonuola A. Akingbuwa
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Anke R. Hammerschlag
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
- Child Health Research Centre, The University of Queensland, Brisbane, Queensland, Australia
| | - Eshim S. Jami
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Andrea G. Allegrini
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Ville Karhunen
- Department of Epidemiology and Biostatistics, Imperial College London, London, United Kingdom
| | - Hannah Sallis
- University of Bristol School of Psychological Science, Bristol, United Kingdom
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
- Centre for Academic Mental Health, Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Helga Ask
- Department of Mental Disorders, Norwegian Institute of Public Health, Oslo, Norway
| | - Ragna B. Askeland
- Department of Mental Disorders, Norwegian Institute of Public Health, Oslo, Norway
| | - Bart Baselmans
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Elizabeth Diemer
- Child and Adolescent Psychiatry, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Fiona A. Hagenbeek
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Alexandra Havdahl
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
- Nic Waals Institute, Lovisenberg Diaconal Hospital, Oslo, Norway
- Department of Mental Disorders, Norwegian Institute of Public Health, Oslo, Norway
| | - Jouke-Jan Hottenga
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Hamdi Mbarek
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Qatar Genome Programme, Qatar Foundation, Doha, Qatar
| | - Fernando Rivadeneira
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
- Erasmus MC, Department of Epidemiology, University Medical Center Rotterdam, Rotterdam, the Netherlands
- Erasmus MC, Department of Internal Medicine, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Martin Tesli
- Department of Mental Disorders, Norwegian Institute of Public Health, Oslo, Norway
| | | | - Gerome Breen
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
- National Institute of Health Research Biomedical Research Centre, South London and Maudsley National Health Services Foundation Trust, London, London, United Kingdom
| | - Cathryn M. Lewis
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Anita Thapar
- Medical Research Council Centre for Neuropsychiatric Genetics and Genomics, Cardiff University, Cardiff, United Kingdom
| | - Dorret I. Boomsma
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Ralf Kuja-Halkola
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | | | - Per Magnus
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Kaili Rimfeld
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - EIVIND Ystrom
- Department of Mental Disorders, Norwegian Institute of Public Health, Oslo, Norway
- PROMENTA Research Center, Department of Psychology, University of Oslo, Oslo, Norway
| | - Marjo-Riitta Jarvelin
- Department of Epidemiology and Biostatistics, Imperial College London, London, United Kingdom
- Medical Research Council–Public Health England Centre for Environment and Health, Imperial College London, London, United Kingdom
- Center for Life Course Health Research, University of Oulu, Oulu, Finland
- Medical Research Center Oulu, Oulu, Finland
- Institute of Biomedicine and Biocenter of Oulu, Oulu, Finland
- Department of Life Sciences, Brunel University London College of Health and Life Sciences, London, United Kingdom
| | - Paul Lichtenstein
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Sebastian Lundstrom
- Centre for Ethics Law and Mental Health, Gillberg Neuropsychiatry Centre, University of Gothenburg, Gothenburg, Sweden
| | - Marcus R. Munafò
- University of Bristol School of Psychological Science, Bristol, United Kingdom
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
- National Institute of Health Research Biomedical Research Centre, University Hospitals Bristol National Health Services Foundation Trust, University of Bristol, Bristol, United Kingdom
| | - Robert Plomin
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Henning Tiemeier
- Child and Adolescent Psychiatry, Erasmus University Medical Center, Rotterdam, the Netherlands
- Department of Social and Behavioral Science, Harvard T. H. Chan School of Medicine, Boston, Massachusetts
| | - Michel G. Nivard
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Meike Bartels
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Christel M. Middeldorp
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Child Health Research Centre, The University of Queensland, Brisbane, Queensland, Australia
- Child and Youth Mental Health Service, Children's Health Queensland Hospital and Health Services, Brisbane, Queensland, Australia
| | | |
Collapse
|
630
|
de Zeeuw EL, Hottenga JJ, Ouwens KG, Dolan CV, Ehli EA, Davies GE, Boomsma DI, van Bergen E. Intergenerational Transmission of Education and ADHD: Effects of Parental Genotypes. Behav Genet 2020; 50:221-232. [PMID: 32026073 PMCID: PMC7355279 DOI: 10.1007/s10519-020-09992-w] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 01/08/2020] [Indexed: 11/30/2022]
Abstract
It remains a challenge to determine whether children resemble their parents due to nature, nurture, or a mixture of both. Here we used a design that exploits the distinction between transmitted and non-transmitted alleles in genetic transmission from parent to offspring. Two separate polygenic scores (PGS) were calculated on the basis of the transmitted and non-transmitted alleles. The effect of the non-transmitted PGS is necessarily mediated by parental phenotypes, insofar as they contribute to the rearing environment of the offspring (genetic nurturing). We calculated transmitted and non-transmitted PGSs associated with adult educational attainment (EA) and PGSs associated with childhood ADHD in a general population sample of trios, i.e. child or adult offspring and their parents (N = 1120-2518). We tested if the EA and ADHD (non-)transmitted PGSs were associated with childhood academic achievement and ADHD in offspring. Based on the earlier findings for shared environment, we hypothesized to find genetic nurturing for academic achievement, but not for ADHD. In adults, both transmitted (R2 = 7.6%) and non-transmitted (R2 = 1.7%) EA PGSs were associated with offspring EA, evidencing genetic nurturing. In children around age 12, academic achievement was associated with the transmitted EA PGSs (R2 = 5.7%), but we found no support for genetic nurturing (R2 ~ 0.1%). The ADHD PGSs were not significantly associated with academic achievement (R2 ~ 0.6%). ADHD symptoms in children were only associated with transmitted EA PGSs and ADHD PGSs (R2 = 1-2%). Based on these results, we conclude that the associations between parent characteristics and offspring outcomes in childhood are mainly to be attributable to the effects of genes that are shared by parents and children.
Collapse
Affiliation(s)
- Eveline L de Zeeuw
- Department of Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7-9, 1081 BT, Amsterdam, The Netherlands.
- Amsterdam Public Health Research Institute, VUmc, Amsterdam, The Netherlands.
| | - Jouke-Jan Hottenga
- Department of Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7-9, 1081 BT, Amsterdam, The Netherlands
| | - Klaasjan G Ouwens
- Department of Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7-9, 1081 BT, Amsterdam, The Netherlands
| | - Conor V Dolan
- Department of Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7-9, 1081 BT, Amsterdam, The Netherlands
| | - Erik A Ehli
- Avera Institute for Human Genetics, Avera McKennan Hospital & University Health Center, Sioux Falls, SD, USA
| | - Gareth E Davies
- Avera Institute for Human Genetics, Avera McKennan Hospital & University Health Center, Sioux Falls, SD, USA
| | - Dorret I Boomsma
- Department of Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7-9, 1081 BT, Amsterdam, The Netherlands
- Amsterdam Public Health Research Institute, VUmc, Amsterdam, The Netherlands
| | - Elsje van Bergen
- Department of Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7-9, 1081 BT, Amsterdam, The Netherlands
- Amsterdam Public Health Research Institute, VUmc, Amsterdam, The Netherlands
| |
Collapse
|
631
|
McGue M, Willoughby EA, Rustichini A, Johnson W, Iacono WG, Lee JJ. The Contribution of Cognitive and Noncognitive Skills to Intergenerational Social Mobility. Psychol Sci 2020; 31:835-847. [PMID: 32603210 DOI: 10.1177/0956797620924677] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
We investigated intergenerational educational and occupational mobility in a sample of 2,594 adult offspring and 2,530 of their parents. Participants completed assessments of general cognitive ability and five noncognitive factors related to social achievement; 88% were also genotyped, allowing computation of educational-attainment polygenic scores. Most offspring were socially mobile. Offspring who scored at least 1 standard deviation higher than their parents on both cognitive and noncognitive measures rarely moved down and frequently moved up. Polygenic scores were also associated with social mobility. Inheritance of a favorable subset of parent alleles was associated with moving up, and inheritance of an unfavorable subset was associated with moving down. Parents' education did not moderate the association of offspring's skill with mobility, suggesting that low-skilled offspring from advantaged homes were not protected from downward mobility. These data suggest that cognitive and noncognitive skills as well as genetic factors contribute to the reordering of social standing that takes place across generations.
Collapse
Affiliation(s)
- Matt McGue
- Department of Psychology, University of Minnesota
| | | | | | - Wendy Johnson
- Department of Psychology, The University of Edinburgh
- Centre for Cognitive Ageing and Cognitive Epidemiology, The University of Edinburgh
| | | | - James J Lee
- Department of Psychology, University of Minnesota
| |
Collapse
|
632
|
Folkersen L, Pain O, Ingason A, Werge T, Lewis CM, Austin J. Impute.me: An Open-Source, Non-profit Tool for Using Data From Direct-to-Consumer Genetic Testing to Calculate and Interpret Polygenic Risk Scores. Front Genet 2020; 11:578. [PMID: 32714365 PMCID: PMC7340159 DOI: 10.3389/fgene.2020.00578] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Accepted: 05/11/2020] [Indexed: 01/07/2023] Open
Abstract
To date, interpretation of genomic information has focused on single variants conferring disease risk, but most disorders of major public concern have a polygenic architecture. Polygenic risk scores (PRSs) give a single measure of disease liability by summarizing disease risk across hundreds of thousands of genetic variants. They can be calculated in any genome-wide genotype data-source, using a prediction model based on genome-wide summary statistics from external studies. As genome-wide association studies increase in power, the predictive ability for disease risk will also increase. Although PRSs are unlikely ever to be fully diagnostic, they may give valuable medical information for risk stratification, prognosis, or treatment response prediction. Public engagement is therefore becoming important on the potential use and acceptability of PRSs. However, the current public perception of genetics is that it provides "yes/no" answers about the presence/absence of a condition, or the potential for developing a condition, which in not the case for common, complex disorders with polygenic architecture. Meanwhile, unregulated third-party applications are being developed to satisfy consumer demand for information on the impact of lower-risk variants on common diseases that are highly polygenic. Often, applications report results from single-nucleotide polymorphisms (SNPs) and disregard effect size, which is highly inappropriate for common, complex disorders where everybody carries risk variants. Tools are therefore needed to communicate our understanding of genetic vulnerability as a continuous trait, where a genetic liability confers risk for disease. Impute.me is one such tool, whose focus is on education and information on common, complex disorders with polygenetic architecture. Its research-focused open-source website allows users to upload consumer genetics data to obtain PRSs, with results reported on a population-level normal distribution. Diseases can only be browsed by International Classification of Diseases, 10th Revision (ICD-10) chapter-location or alphabetically, thus prompting the user to consider genetic risk scores in a medical context of relevance to the individual. Here, we present an overview of the implementation of the impute.me site, along with analysis of typical usage patterns, which may advance public perception of genomic risk and precision medicine.
Collapse
Affiliation(s)
- Lasse Folkersen
- Institute of Biological Psychiatry, Mental Health Centre Sankt Hans, Copenhagen, Denmark
| | - Oliver Pain
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, United Kingdom
| | - Andrés Ingason
- Institute of Biological Psychiatry, Mental Health Centre Sankt Hans, Copenhagen, Denmark
| | - Thomas Werge
- Institute of Biological Psychiatry, Mental Health Centre Sankt Hans, Copenhagen, Denmark
| | - Cathryn M. Lewis
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, United Kingdom
- Department of Medical & Molecular Genetics, Faculty of Life Sciences & Medicine, King’s College London, London, United Kingdom
| | - Jehannine Austin
- Department of Psychiatry, University of British Columbia, Vancouver, BC, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
633
|
Song L, Liu A, Shi J. SummaryAUC: a tool for evaluating the performance of polygenic risk prediction models in validation datasets with only summary level statistics. Bioinformatics 2020; 35:4038-4044. [PMID: 30911754 DOI: 10.1093/bioinformatics/btz176] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2018] [Revised: 02/08/2019] [Accepted: 03/22/2019] [Indexed: 12/21/2022] Open
Abstract
MOTIVATION Polygenic risk score (PRS) methods based on genome-wide association studies (GWAS) have a potential for predicting the risk of developing complex diseases and are expected to become more accurate with larger training datasets and innovative statistical methods. The area under the ROC curve (AUC) is often used to evaluate the performance of PRSs, which requires individual genotypic and phenotypic data in an independent GWAS validation dataset. We are motivated to develop methods for approximating AUC of PRSs based on the summary level data of the validation dataset, which will greatly facilitate the development of PRS models for complex diseases. RESULTS We develop statistical methods and an R package SummaryAUC for approximating the AUC and its variance of a PRS when only the summary level data of the validation dataset are available. SummaryAUC can be applied to PRSs with SNPs either genotyped or imputed in the validation dataset. We examined the performance of SummaryAUC using a large-scale GWAS of schizophrenia. SummaryAUC provides accurate approximations to AUCs and their variances. The bias of AUC is typically <0.5% in most analyses. SummaryAUC cannot be applied to PRSs that use all SNPs in the genome because it is computationally prohibitive. AVAILABILITY AND IMPLEMENTATION https://github.com/lsncibb/SummaryAUC. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lei Song
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA.,Cancer Genomics Research Laboratory, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Aiyi Liu
- Biostatistics and Bioinformatics Branch, Division of Intramural Population Health Research, National Institute of Child Health and Human Development, Bethesda, MD, USA
| | - Jianxin Shi
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | | |
Collapse
|
634
|
Chande AT, Rishishwar L, Conley AB, Valderrama-Aguirre A, Medina-Rivas MA, Jordan IK. Ancestry effects on type 2 diabetes genetic risk inference in Hispanic/Latino populations. BMC MEDICAL GENETICS 2020; 21:132. [PMID: 32580712 PMCID: PMC7315475 DOI: 10.1186/s12881-020-01068-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 06/10/2020] [Indexed: 12/18/2022]
Abstract
Background Hispanic/Latino (HL) populations bear a disproportionately high burden of type 2 diabetes (T2D). The ability to predict T2D genetic risk using polygenic risk scores (PRS) offers great promise for improved screening and prevention. However, there are a number of complications related to the accurate inference of genetic risk across HL populations with distinct ancestry profiles. We investigated how ancestry affects the inference of T2D genetic risk using PRS in diverse HL populations from Colombia and the United States (US). In Colombia, we compared T2D genetic risk for the Mestizo population of Antioquia to the Afro-Colombian population of Chocó, and in the US, we compared European-American versus Mexican-American populations. Methods Whole genome sequences and genotypes from the 1000 Genomes Project and the ChocoGen Research Project were used for genetic ancestry inference and for T2D polygenic risk score (PRS) calculation. Continental ancestry fractions for HL genomes were inferred via comparison with African, European, and Native American reference genomes, and PRS were calculated using T2D risk variants taken from multiple genome-wide association studies (GWAS) conducted on cohorts with diverse ancestries. A correction for ancestry bias in T2D risk inference based on the frequencies of ancestral versus derived alleles was developed and applied to PRS calculations in the HL populations studied here. Results T2D genetic risk in Colombian and US HL populations is positively correlated with African and Native American ancestry and negatively correlated with European ancestry. The Afro-Colombian population of Chocó has higher predicted T2D risk than Antioquia, and the Mexican-American population has higher predicted risk than the European-American population. The inferred relative risk of T2D is robust to differences in the ancestry of the GWAS cohorts used for variant discovery. For trans-ethnic GWAS, population-specific variants and variants with same direction effects across populations yield consistent results. Nevertheless, the control for bias in T2D risk prediction confirms that explicit consideration of genetic ancestry can yield more reliable cross-population genetic risk inferences. Conclusions T2D associations that replicate across populations provide for more reliable risk inference, and modeling population-specific frequencies of ancestral and derived risk alleles can help control for biases in PRS estimation.
Collapse
Affiliation(s)
- Aroon T Chande
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, Atlanta, GA, 30332, USA.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia
| | - Lavanya Rishishwar
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia
| | - Andrew B Conley
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia
| | - Augusto Valderrama-Aguirre
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, Atlanta, GA, 30332, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Biomedical Research Institute (COL0082529), Cali, Colombia.,Universidad Santiago de Cali, Cali, Colombia
| | - Miguel A Medina-Rivas
- PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Centro de Investigación en Biodiversidad y Hábitat, Universidad Tecnológica del Chocó, Quibdó, Chocó, Colombia
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, Atlanta, GA, 30332, USA. .,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, USA. .,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.
| |
Collapse
|
635
|
Xie T, Wang B, Nolte IM, van der Most PJ, Oldehinkel AJ, Hartman CA, Snieder H. Genetic Risk Scores for Complex Disease Traits in Youth. CIRCULATION-GENOMIC AND PRECISION MEDICINE 2020; 13:e002775. [PMID: 32527150 PMCID: PMC7439939 DOI: 10.1161/circgen.119.002775] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
BACKGROUND For most disease-related traits the magnitude of the contribution of genetic factors in adolescents remains unclear. METHODS Twenty continuous traits related to anthropometry, cardiovascular and renal function, metabolism, and inflammation were selected from the ongoing prospective Tracking Adolescents' Individual Lives Survey cohort in the Netherlands with measurements of up to 5 waves from age 11 to 22 years (n=1354, 47.6% males) and all traits available at the third wave (mean age [SD]=16.22 [0.66]). For each trait, unweighted and weighted genetic risk scores (GRSs) were generated based on significantly associated single nucleotide polymorphisms identified from literature. The variance explained by the GRSs in adolescents were estimated by linear regression after adjustment for covariates. RESULTS Except for ALT (alanine transaminase), all GRSs were significantly associated with their traits. The trait variance explained by the GRSs was highest for lipoprotein[a] (39.59%) and varied between 0.09% (ALT) and 18.49% (LDL [low-density lipoprotein]) for the other traits. For most traits, the variances explained in adolescents were comparable with or slightly smaller than those in adults. Significant increases of trait levels (except ALT) and increased risks for overweight/obesity (odds ratio, 6.41 [95% CI, 2.95-15.56]) and hypertension (odds ratio, 2.86 [95% CI, 1.39-6.17]) were found in individuals in the top GRS decile compared with those at the bottom decile. CONCLUSIONS Variances explained by adult-based GRSs for disease-related traits in adolescents, although still relatively modest, were comparable with or slightly smaller than in adults offering promise for improved risk prediction at early ages.
Collapse
Affiliation(s)
- Tian Xie
- Department of Epidemiology (T.X., B.W., I.M.N., P.J.v.d.M., H.S.), University of Groningen, University Medical Center Groningen, the Netherlands
| | - Bin Wang
- Department of Epidemiology (T.X., B.W., I.M.N., P.J.v.d.M., H.S.), University of Groningen, University Medical Center Groningen, the Netherlands
| | - Ilja M Nolte
- Department of Epidemiology (T.X., B.W., I.M.N., P.J.v.d.M., H.S.), University of Groningen, University Medical Center Groningen, the Netherlands
| | - Peter J van der Most
- Department of Epidemiology (T.X., B.W., I.M.N., P.J.v.d.M., H.S.), University of Groningen, University Medical Center Groningen, the Netherlands
| | - Albertine J Oldehinkel
- Department of Psychiatry (A.J.O., C.A.H.), University of Groningen, University Medical Center Groningen, the Netherlands
| | - Catharina A Hartman
- Department of Psychiatry (A.J.O., C.A.H.), University of Groningen, University Medical Center Groningen, the Netherlands
| | - Harold Snieder
- Department of Epidemiology (T.X., B.W., I.M.N., P.J.v.d.M., H.S.), University of Groningen, University Medical Center Groningen, the Netherlands
| |
Collapse
|
636
|
Vink JM, Veul L, Abdellaoui A, Hottenga JJ, Boomsma DI, Verweij KJH. Illicit drug use and the genetic overlap with Cannabis use. Drug Alcohol Depend 2020; 213:108102. [PMID: 32585418 DOI: 10.1016/j.drugalcdep.2020.108102] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Revised: 05/01/2020] [Accepted: 05/26/2020] [Indexed: 11/16/2022]
Abstract
BACKGROUND The use of illicit substances is correlated, meaning that individuals who use one illicit substance are more likely to also use another illicit substance. This association could (partly) be explained by overlapping genetic factors. Genetic overlap may indicate a common underlying genetic predisposition, or can be the result of a causal association. METHODS Polygenic scores for lifetime cannabis use were generated in a sample of Dutch participants (N = 8348). We tested the association of a PGS for cannabis use with ecstasy, stimulants and a broad category of illicit drug use. To explore the nature of the relationship: (1) these analyses were repeated separately in cannabis users and non-users and (2) monozogytic twin pairs discordant for cannabis use were compared on their drug use. RESULTS The lifetime prevalence was 24.8 % for cannabis, 6.2 % for ecstasy, 6.5 % for stimulants and 7.1 % for any illicit drug use. Significant, positive associations were found between PGS for cannabis use with ecstasy use, stimulants and any illicit drug use. These associations seemed to be stronger in cannabis users compared to non-users for both ecstasy and stimulant use, but only in people born after 1968 and not significant after correction for multiple testing. The discordant twin pair analyses suggested that cannabis use could play a causal role in drug use. CONCLUSIONS The genetic liability underlying cannabis use significantly explained variability in ecstasy, stimulant and any illicit drug use. Further research should further explore the underlying mechanism to understand the nature of the association.
Collapse
Affiliation(s)
- Jacqueline M Vink
- Behavioural Science Institute, Radboud University, Montessorilaan 3, 6525 HR, Nijmegen, the Netherlands.
| | - Laura Veul
- Amsterdam UMC, location AMC, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, the Netherlands
| | - Abdel Abdellaoui
- Amsterdam UMC, location AMC, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, the Netherlands
| | - Jouke-Jan Hottenga
- Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit, De Boelelaan 1105, 1081 HV, Amsterdam, the Netherlands
| | - Dorret I Boomsma
- Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit, De Boelelaan 1105, 1081 HV, Amsterdam, the Netherlands
| | - Karin J H Verweij
- Amsterdam UMC, location AMC, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, the Netherlands
| |
Collapse
|
637
|
Shared polygenic risk for ADHD, executive dysfunction and other psychiatric disorders. Transl Psychiatry 2020; 10:182. [PMID: 32518222 PMCID: PMC7283259 DOI: 10.1038/s41398-020-00872-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 05/15/2020] [Accepted: 05/26/2020] [Indexed: 12/14/2022] Open
Abstract
Many psychiatric disorders are associated with impaired executive functioning (EF). The associated EF component varies by psychiatric disorders, and this variation might be due to genetic liability. We explored the genetic association between five psychiatric disorders and EF in clinically-recruited attention deficit hyperactivity disorder (ADHD) children using polygenic risk score (PRS) methodology. Genome-wide association study (GWAS) summary data for ADHD, major depressive disorder (MDD), schizophrenia (SZ), bipolar disorder (BIP) and autism were used to calculate the PRSs. EF was evaluated by the Stroop test for inhibitory control, the trail-making test for cognitive flexibility, and the digital span test for working memory in a Chinese ADHD cohort (n = 1147). Exploratory factor analysis of the three measures identified one principal component for EF (EF-PC). Linear regression models were used to analyze the association between each PRS and the EF measures. The role of EF measures in mediating the effects of the PRSs on ADHD symptoms was also analyzed. The result showed the PRSs for MDD, ADHD and BIP were all significantly associated with the EF-PC. For each EF component, the association results were different for the PRSs of the five psychiatric disorders: the PRSs for ADHD and MDD were associated with inhibitory control (adjusted P = 0.0183 and 0.0313, respectively), the PRS for BIP was associated with working memory (adjusted P = 0.0416), and the PRS for SZ was associated with cognitive flexibility (adjusted P = 0.0335). All three EF measures were significantly correlated with ADHD symptoms. In mediation analyses, the ADHD and MDD PRSs, which were associated with inhibitory control, had significant indirect effects on ADHD symptoms through the mediation of inhibitory control. These findings indicate that the polygenic risks for several psychiatric disorders influence specific executive dysfunction in children with ADHD. The results helped to clarify the relationship between risk genes of each mental disorder and the intermediate cognitive domain, which may further help elucidate the risk genes and motivate efforts to develop EF measures as a diagnostic marker and future treatment target.
Collapse
|
638
|
Aragam KG, Dobbyn A, Judy R, Chaffin M, Chaudhary K, Hindy G, Cagan A, Finneran P, Weng LC, Loos RJF, Nadkarni G, Cho JH, Kember RL, Baras A, Reid J, Overton J, Philippakis A, Ellinor PT, Weiss ST, Rader DJ, Lubitz SA, Smoller JW, Karlson EW, Khera AV, Kathiresan S, Do R, Damrauer SM, Natarajan P. Limitations of Contemporary Guidelines for Managing Patients at High Genetic Risk of Coronary Artery Disease. J Am Coll Cardiol 2020; 75:2769-2780. [PMID: 32498804 PMCID: PMC7346975 DOI: 10.1016/j.jacc.2020.04.027] [Citation(s) in RCA: 92] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Revised: 03/31/2020] [Accepted: 04/07/2020] [Indexed: 11/30/2022]
Abstract
BACKGROUND Polygenic risk scores (PRS) for coronary artery disease (CAD) identify high-risk individuals more likely to benefit from primary prevention statin therapy. Whether polygenic CAD risk is captured by conventional paradigms for assessing clinical cardiovascular risk remains unclear. OBJECTIVES This study sought to intersect polygenic risk with guideline-based recommendations and management patterns for CAD primary prevention. METHODS A genome-wide CAD PRS was applied to 47,108 individuals across 3 U.S. health care systems. The authors then assessed whether primary prevention patients at high polygenic risk might be distinguished on the basis of greater guideline-recommended statin eligibility and higher rates of statin therapy. RESULTS Of 47,108 study participants, the mean age was 60 years, and 11,020 (23.4%) had CAD. The CAD PRS strongly associated with prevalent CAD (odds ratio: 1.4 per SD increase in PRS; p < 0.0001). High polygenic risk (top 20% of PRS) conferred 1.9-fold odds of developing CAD (p < 0.0001). However, among primary prevention patients (n = 33,251), high polygenic risk did not correspond with increased recommendations for statin therapy per the American College of Cardiology/American Heart Association (46.2% for those with high PRS vs. 46.8% for all others, p = 0.54) or U.S. Preventive Services Task Force (43.7% vs. 43.7%, p = 0.99) or higher rates of statin prescriptions (25.0% vs. 23.8%, p = 0.04). An additional 4.1% of primary prevention patients may be recommended for statin therapy if high CAD PRS were considered a guideline-based risk-enhancing factor. CONCLUSIONS Current paradigms for primary cardiovascular prevention incompletely capture a polygenic susceptibility to CAD. An opportunity may exist to improve CAD prevention efforts by integrating both genetic and clinical risk.
Collapse
Affiliation(s)
- Krishna G Aragam
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts; Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts
| | - Amanda Dobbyn
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Renae Judy
- Department of Surgery, Perlman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Mark Chaffin
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts
| | - Kumardeep Chaudhary
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - George Hindy
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts
| | - Andrew Cagan
- Research Computing, Partners HealthCare, Charlestown, Massachusetts
| | - Phoebe Finneran
- Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts
| | - Lu-Chen Weng
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts; Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts
| | - Ruth J F Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York; The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Girish Nadkarni
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Judy H Cho
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York; Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Rachel L Kember
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Aris Baras
- Regeneron Genetics Center, Tarrytown, New York
| | | | | | - Anthony Philippakis
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts
| | - Patrick T Ellinor
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts; Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts
| | - Scott T Weiss
- Department of Medicine, Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, Massachusetts
| | - Daniel J Rader
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Steven A Lubitz
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts; Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts
| | - Jordan W Smoller
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts; Department of Psychiatry, Massachusetts General Hospital, Boston, Massachusetts; Stanley Center for Psychiatric Research, Broad Institute, Boston, Massachusetts
| | - Elizabeth W Karlson
- Department of Medicine, Division of Rheumatology, Immunology and Allergy, Brigham and Women's Hospital, Boston, Massachusetts
| | - Amit V Khera
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts
| | - Sekar Kathiresan
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts
| | - Ron Do
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.
| | - Scott M Damrauer
- Department of Surgery, Perlman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania; Corporal Michael Crescenz VA Medical Center, Philadelphia, Pennsylvania
| | - Pradeep Natarajan
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts; Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts.
| |
Collapse
|
639
|
Ishigaki K, Akiyama M, Kanai M, Takahashi A, Kawakami E, Sugishita H, Sakaue S, Matoba N, Low SK, Okada Y, Terao C, Amariuta T, Gazal S, Kochi Y, Horikoshi M, Suzuki K, Ito K, Koyama S, Ozaki K, Niida S, Sakata Y, Sakata Y, Kohno T, Shiraishi K, Momozawa Y, Hirata M, Matsuda K, Ikeda M, Iwata N, Ikegawa S, Kou I, Tanaka T, Nakagawa H, Suzuki A, Hirota T, Tamari M, Chayama K, Miki D, Mori M, Nagayama S, Daigo Y, Miki Y, Katagiri T, Ogawa O, Obara W, Ito H, Yoshida T, Imoto I, Takahashi T, Tanikawa C, Suzuki T, Sinozaki N, Minami S, Yamaguchi H, Asai S, Takahashi Y, Yamaji K, Takahashi K, Fujioka T, Takata R, Yanai H, Masumoto A, Koretsune Y, Kutsumi H, Higashiyama M, Murayama S, Minegishi N, Suzuki K, Tanno K, Shimizu A, Yamaji T, Iwasaki M, Sawada N, Uemura H, Tanaka K, Naito M, Sasaki M, Wakai K, Tsugane S, Yamamoto M, Yamamoto K, Murakami Y, Nakamura Y, Raychaudhuri S, Inazawa J, Yamauchi T, Kadowaki T, Kubo M, Kamatani Y. Large-scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases. Nat Genet 2020; 52:669-679. [PMID: 32514122 DOI: 10.1038/s41588-020-0640-3] [Citation(s) in RCA: 339] [Impact Index Per Article: 67.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 05/01/2020] [Indexed: 12/20/2022]
Abstract
The overwhelming majority of participants in current genetic studies are of European ancestry. To elucidate disease biology in the East Asian population, we conducted a genome-wide association study (GWAS) with 212,453 Japanese individuals across 42 diseases. We detected 320 independent signals in 276 loci for 27 diseases, with 25 novel loci (P < 9.58 × 10-9). East Asian-specific missense variants were identified as candidate causal variants for three novel loci, and we successfully replicated two of them by analyzing independent Japanese cohorts; p.R220W of ATG16L2 (associated with coronary artery disease) and p.V326A of POT1 (associated with lung cancer). We further investigated enrichment of heritability within 2,868 annotations of genome-wide transcription factor occupancy, and identified 378 significant enrichments across nine diseases (false discovery rate < 0.05) (for example, NKX3-1 for prostate cancer). This large-scale GWAS in a Japanese population provides insights into the etiology of complex diseases and highlights the importance of performing GWAS in non-European populations.
Collapse
Affiliation(s)
- Kazuyoshi Ishigaki
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Center for Data Sciences, Harvard Medical School, Boston, MA, USA.,Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Masato Akiyama
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Masahiro Kanai
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Atsushi Takahashi
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Genomic Medicine, Research Institute, National Cerebral and Cardiovascular Center, Osaka, Japan
| | - Eiryo Kawakami
- Medical Sciences Innovation Hub Program (MIH), RIKEN, Yokohama, Japan.,Laboratory for Developmental Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University, Chiba, Japan
| | - Hiroki Sugishita
- Laboratory for Developmental Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Saori Sakaue
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka, Japan.,Department of Allergy and Rheumatology, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Nana Matoba
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Genetics and UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Siew-Kee Low
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Cancer Precision Medicine Center, Japanese Foundation for Cancer Research, Tokyo, Japan
| | - Yukinori Okada
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka, Japan.,Laboratory of Statistical Immunology, WPI Immunology Frontier Research Center, Osaka University, Osaka, Japan.,Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Osaka, Japan
| | - Chikashi Terao
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Tiffany Amariuta
- Center for Data Sciences, Harvard Medical School, Boston, MA, USA.,Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Graduate School of Arts and Sciences, Harvard University, Cambridge, MA, USA
| | - Steven Gazal
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Yuta Kochi
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Genomic Function and Diversity, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
| | - Momoko Horikoshi
- Laboratory for Genomics of Diabetes and Metabolism, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Ken Suzuki
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka, Japan.,Laboratory for Genomics of Diabetes and Metabolism, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Diabetes and Metabolic Diseases, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Kaoru Ito
- Laboratory for Cardiovascular Genomics and Informatics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Satoshi Koyama
- Laboratory for Cardiovascular Genomics and Informatics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Kouichi Ozaki
- Medical Genome Center, National Center for Geriatrics and Gerontology, Obu, Japan
| | - Shumpei Niida
- Medical Genome Center, National Center for Geriatrics and Gerontology, Obu, Japan
| | - Yasushi Sakata
- Department of Cardiovascular Medicine, Osaka University Graduate School of Medicine, Osaka, Japan
| | - Yasuhiko Sakata
- Department of Cardiovascular Medicine, Tohoku University Graduate School of Medicine, Tohoku, Japan
| | - Takashi Kohno
- Division of Genome Biology, National Cancer Center Research Institute, Tokyo, Japan
| | - Kouya Shiraishi
- Division of Genome Biology, National Cancer Center Research Institute, Tokyo, Japan
| | - Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Makoto Hirata
- Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Koichi Matsuda
- Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
| | - Masashi Ikeda
- Department of Psychiatry, Fujita Health University School of Medicine, Aichi, Japan
| | - Nakao Iwata
- Department of Psychiatry, Fujita Health University School of Medicine, Aichi, Japan
| | - Shiro Ikegawa
- Laboratory for Bone and Joint Diseases, RIKEN Center for Integrative Medical Sciences, Tokyo, Japan
| | - Ikuyo Kou
- Laboratory for Bone and Joint Diseases, RIKEN Center for Integrative Medical Sciences, Tokyo, Japan
| | - Toshihiro Tanaka
- Laboratory for Cardiovascular Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Human Genetics and Disease Diversity, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo, Japan
| | - Hidewaki Nakagawa
- Laboratory for Genome Sequencing Analysis, RIKEN Center for Integrative Medical Sciences, Tokyo, Japan
| | - Akari Suzuki
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Tomomitsu Hirota
- Laboratory for Respiratory and Allergic Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Mayumi Tamari
- Laboratory for Respiratory and Allergic Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Kazuaki Chayama
- Department of Gastroenterology and Metabolism, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, Japan
| | - Daiki Miki
- Department of Gastroenterology and Metabolism, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, Japan
| | - Masaki Mori
- Department of Surgery and Sciences, Graduate School of Medicine, Kyushu University, Fukuoka, Japan
| | - Satoshi Nagayama
- Department of Gastroenterological Surgery, The Cancer Institute Hospital of the Japanese Foundation for Cancer Research, Tokyo, Japan
| | - Yataro Daigo
- Department of Medical Oncology and Cancer Center, and Center for Advanced Medicine against Cancer, Shiga University of Medical Science, Shiga, Japan.,Center for Antibody and Vaccine Therapy, Research Hospital, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Yoshio Miki
- Department of Genetic Diagnosis, The Cancer Institute, Japanese Foundation for Cancer Research, Tokyo, Japan
| | - Toyomasa Katagiri
- Division of Genome Medicine, Institute for Genome Research, Tokushima University, Tokushima, Japan
| | - Osamu Ogawa
- Department of Urology, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Wataru Obara
- Department of Urology, Iwate Medical University School of Medicine, Iwate, Japan
| | - Hidemi Ito
- Division of Cancer Information and Control, Aichi Cancer Center Research Institute, Nagoya, Japan.,Division of Descriptive Cancer Epidemiology, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Teruhiko Yoshida
- Division of Genetics, National Cancer Center Research Institute, Tokyo, Japan
| | - Issei Imoto
- Division of Molecular Genetics, Aichi Cancer Center Research Institute, Nagoya, Japan.,Risk Assessment Center, Aichi Caner Center Hospital, Nagoya, Japan.,Division of Cancer Genetics, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | | | - Chizu Tanikawa
- Laboratory of Genome Technology, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | | | | | - Shiro Minami
- Department of Bioregulation, Nippon Medical School, Kawasaki, Japan
| | | | - Satoshi Asai
- Division of Pharmacology, Department of Biomedical Science, Nihon University School of Medicine, Tokyo, Japan.,Division of Genomic Epidemiology and Clinical Trials, Clinical Trials Research Center, Nihon University School of Medicine, Tokyo, Japan
| | - Yasuo Takahashi
- Division of Genomic Epidemiology and Clinical Trials, Clinical Trials Research Center, Nihon University School of Medicine, Tokyo, Japan
| | - Ken Yamaji
- Department of Internal Medicine and Rheumatology, Juntendo University Graduate School of Medicine, Tokyo, Japan
| | - Kazuhisa Takahashi
- Department of Respiratory Medicine, Juntendo University Graduate School of Medicine, Tokyo, Japan
| | - Tomoaki Fujioka
- Department of Urology, Iwate Medical University School of Medicine, Iwate, Japan
| | - Ryo Takata
- Department of Urology, Iwate Medical University School of Medicine, Iwate, Japan
| | - Hideki Yanai
- Fukujuji Hospital, Japan Anti-Tuberculosis Association, Tokyo, Japan
| | | | | | - Hiromu Kutsumi
- Center for Clinical Research and Advanced Medicine, Shiga University of Medical Science, Shiga, Japan
| | - Masahiko Higashiyama
- Department of General Thoracic Surgery, Osaka International Cancer Institute, Osaka, Japan
| | - Shigeo Murayama
- Department of Neurology and Neuropathology (the Brain Bank for Aging Research), Tokyo Metropolitan Geriatric Hospital and Institute of Gerontology, Tokyo, Japan
| | - Naoko Minegishi
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Kichiya Suzuki
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Kozo Tanno
- Iwate Tohoku Medical Megabank Organization, Iwate Medical University, Iwate, Japan
| | - Atsushi Shimizu
- Iwate Tohoku Medical Megabank Organization, Iwate Medical University, Iwate, Japan
| | - Taiki Yamaji
- Division of Epidemiology, Center for Public Health Sciences, National Cancer Center, Tokyo, Japan
| | - Motoki Iwasaki
- Division of Epidemiology, Center for Public Health Sciences, National Cancer Center, Tokyo, Japan
| | - Norie Sawada
- Division of Epidemiology, Center for Public Health Sciences, National Cancer Center, Tokyo, Japan
| | - Hirokazu Uemura
- Department of Preventive Medicine, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, Japan.,College of Nursing Art and Science, University of Hyogo, Akashi, Japan
| | - Keitaro Tanaka
- Department of Preventive Medicine, Saga University Faculty of Medicine, Saga, Japan
| | - Mariko Naito
- Department of Preventive Medicine, Nagoya University Graduate School of Medicine, Nagoya, Japan.,Department of Oral Epidemiology, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, Japan
| | - Makoto Sasaki
- Iwate Tohoku Medical Megabank Organization, Iwate Medical University, Iwate, Japan
| | - Kenji Wakai
- Department of Preventive Medicine, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Shoichiro Tsugane
- Center for Public Health Sciences, National Cancer Center, Tokyo, Japan
| | - Masayuki Yamamoto
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Kazuhiko Yamamoto
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yoshinori Murakami
- Division of Molecular Pathology, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Yusuke Nakamura
- Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Soumya Raychaudhuri
- Center for Data Sciences, Harvard Medical School, Boston, MA, USA. .,Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA. .,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA. .,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA. .,Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK.
| | - Johji Inazawa
- Department of Molecular Cytogenetics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan. .,Bioresource Research Center, Tokyo Medical and Dental University, Tokyo, Japan.
| | - Toshimasa Yamauchi
- Department of Diabetes and Metabolic Diseases, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.
| | - Takashi Kadowaki
- Department of Diabetes and Metabolic Diseases, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.
| | - Michiaki Kubo
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.
| | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan. .,Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan.
| |
Collapse
|
640
|
Shi H, Burch KS, Johnson R, Freund MK, Kichaev G, Mancuso N, Manuel AM, Dong N, Pasaniuc B. Localizing Components of Shared Transethnic Genetic Architecture of Complex Traits from GWAS Summary Data. Am J Hum Genet 2020; 106:805-817. [PMID: 32442408 PMCID: PMC7273527 DOI: 10.1016/j.ajhg.2020.04.012] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 04/20/2020] [Indexed: 12/19/2022] Open
Abstract
Despite strong transethnic genetic correlations reported in the literature for many complex traits, the non-transferability of polygenic risk scores across populations suggests the presence of population-specific components of genetic architecture. We propose an approach that models GWAS summary data for one trait in two populations to estimate genome-wide proportions of population-specific/shared causal SNPs. In simulations across various genetic architectures, we show that our approach yields approximately unbiased estimates with in-sample LD and slight upward-bias with out-of-sample LD. We analyze nine complex traits in individuals of East Asian and European ancestry, restricting to common SNPs (MAF > 5%), and find that most common causal SNPs are shared by both populations. Using the genome-wide estimates as priors in an empirical Bayes framework, we perform fine-mapping and observe that high-posterior SNPs (for both the population-specific and shared causal configurations) have highly correlated effects in East Asians and Europeans. In population-specific GWAS risk regions, we observe a 2.8× enrichment of shared high-posterior SNPs, suggesting that population-specific GWAS risk regions harbor shared causal SNPs that are undetected in the other GWASs due to differences in LD, allele frequencies, and/or sample size. Finally, we report enrichments of shared high-posterior SNPs in 53 tissue-specific functional categories and find evidence that SNP-heritability enrichments are driven largely by many low-effect common SNPs.
Collapse
Affiliation(s)
- Huwenbo Shi
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
| | - Kathryn S Burch
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA.
| | - Ruth Johnson
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Malika K Freund
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Gleb Kichaev
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Nicholas Mancuso
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Astrid M Manuel
- Department of Biological Sciences, Florida International University, Miami, FL 33199, USA
| | - Natalie Dong
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Bogdan Pasaniuc
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
641
|
Lamri A, Mao S, Desai D, Gupta M, Paré G, Anand SS. Fine-tuning of Genome-Wide Polygenic Risk Scores and Prediction of Gestational Diabetes in South Asian Women. Sci Rep 2020; 10:8941. [PMID: 32488059 PMCID: PMC7265287 DOI: 10.1038/s41598-020-65360-y] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Accepted: 04/23/2020] [Indexed: 12/11/2022] Open
Abstract
Gestational diabetes Mellitus (GDM) affects 1 in 7 births and is associated with numerous adverse health outcomes for both mother and child. GDM is suspected to share a large common genetic background with type 2 diabetes (T2D). The aim of our study was to characterize different GDM polygenic risk scores (PRSs) and test their association with GDM using data from the South Asian Birth Cohort (START). PRSs were derived for 832 South Asian women from START using the pruning and thresholding (P + T), LDpred, and GraBLD methods. Weights were derived from a multi-ethnic and a white Caucasian study of the DIAGRAM consortium. GDM status was defined using South Asian-specific glucose values in response to an oral glucose tolerance test. Association with GDM was tested using logistic regression. Results were replicated in South Asian women from the UK Biobank (UKB) study. The top ranking P + T, LDpred and GraBLD PRSs were all based on DIAGRAM's multi-ethnic study. The best PRS was highly associated with GDM in START (AUC = 0.62, OR = 1.60 [95% CI = 1.44-1.69]), and in South Asian women from UKB (AUC = 0.65, OR = 1.69 [95% CI = 1.28-2.24]). Our results highlight the importance of combining genome-wide genotypes and summary statistics from large multi-ethnic studies to optimize PRSs in South Asians.
Collapse
Affiliation(s)
- Amel Lamri
- Department of Medicine, McMaster University Hamilton, Ontario, Canada
- Population Health Research Institute (PHRI), Hamilton, Ontario, Canada
| | - Shihong Mao
- Population Health Research Institute (PHRI), Hamilton, Ontario, Canada
| | - Dipika Desai
- Population Health Research Institute (PHRI), Hamilton, Ontario, Canada
| | - Milan Gupta
- Department of Medicine, McMaster University Hamilton, Ontario, Canada
- Canadian Collaborative Research Network (CCRN), Brampton, ON, Canada
| | - Guillaume Paré
- Population Health Research Institute (PHRI), Hamilton, Ontario, Canada
- Department of Pathology and Molecular Medicine, McMaster University, Hamilton, Ontario, Canada
| | - Sonia S Anand
- Department of Medicine, McMaster University Hamilton, Ontario, Canada.
- Population Health Research Institute (PHRI), Hamilton, Ontario, Canada.
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada.
| |
Collapse
|
642
|
A positively selected FBN1 missense variant reduces height in Peruvian individuals. Nature 2020; 582:234-239. [PMID: 32499652 PMCID: PMC7410362 DOI: 10.1038/s41586-020-2302-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Accepted: 03/10/2020] [Indexed: 01/21/2023]
Abstract
On average, Peruvian individuals are among the shortest in the world1. Here we show that Native American ancestry is associated with reduced height in an ethnically diverse group of Peruvian individuals, and identify a population-specific, missense variant in the FBN1 gene (E1297G) that is significantly associated with lower height. Each copy of the minor allele (frequency of 4.7%) reduces height by 2.2 cm (4.4 cm in homozygous individuals). To our knowledge, this is the largest effect size known for a common height-associated variant. FBN1 encodes the extracellular matrix protein fibrillin 1, which is a major structural component of microfibrils. We observed less densely packed fibrillin-1-rich microfibrils with irregular edges in the skin of individuals who were homozygous for G1297 compared with individuals who were homozygous for E1297. Moreover, we show that the E1297G locus is under positive selection in non-African populations, and that the E1297 variant shows subtle evidence of positive selection specifically within the Peruvian population. This variant is also significantly more frequent in coastal Peruvian populations than in populations from the Andes or the Amazon, which suggests that short stature might be the result of adaptation to factors that are associated with the coastal environment in Peru.
Collapse
|
643
|
Papageorge NW, Thom K. Genes, Education, and Labor Market Outcomes: Evidence from the Health and Retirement Study. JOURNAL OF THE EUROPEAN ECONOMIC ASSOCIATION 2020; 18:1351-1399. [PMID: 32587483 PMCID: PMC7297142 DOI: 10.1093/jeea/jvz072] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Recent advances have led to the discovery of specific genetic variants that predict educational attainment. We study how these variants, summarized as a linear index-known as a polygenic score-are associated with human capital accumulation and labor market outcomes in the Health and Retirement Study (HRS). We present two main sets of results. First, we find evidence that the genetic factors measured by this score interact strongly with childhood socioeconomic status in determining educational outcomes. In particular, although the polygenic score predicts higher rates of college graduation on average, this relationship is substantially stronger for individuals who grew up in households with higher socioeconomic status relative to those who grew up in poorer households. Second, the polygenic score predicts labor earnings even after adjusting for completed education, with larger returns in more recent decades. These patterns suggest that the genetic traits that promote education might allow workers to better accommodate ongoing skill biased technological change. Consistent with this interpretation, we find a positive association between the polygenic score and nonroutine analytic tasks that have benefited from the introduction of new technologies. Nonetheless, the college premium remains a dominant determinant of earnings differences at all levels of the polygenic score. Given the role of childhood SES in predicting college attainment, this raises concerns about wasted potential arising from limited household resources.
Collapse
|
644
|
Dale PS, von Stumm S, Selzam S, Hayiou-Thomas ME. Does the Inclusion of a Genome-Wide Polygenic Score Improve Early Risk Prediction for Later Language and Literacy Delay? JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:1467-1478. [PMID: 32343923 PMCID: PMC7842126 DOI: 10.1044/2020_jslhr-19-00161] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 10/25/2019] [Accepted: 01/27/2020] [Indexed: 06/11/2023]
Abstract
Purpose The ability to identify children early in development who are at substantial risk for language/literacy difficulties would have great benefit both for the children and for the educational and therapeutic institutions that serve them. Information that is relatively easily available prior to the age of 3 years, such as late talking, family history of language/literacy difficulties, and socioeconomic status, have some but very limited predictive power. Here, we examine whether the inclusion of a DNA-based genome-wide polygenic score that has been shown to capture children's genetic propensity for educational attainment (EA3) adds enough prediction to yield a clinically useful score. Method Data are longitudinal scores of 1,420 children from the Twins Early Development Study, who were assessed at ages 2 and 3 years on language and nonverbal ability and at 12 years of age on oral language, word decoding, and reading comprehension. Five risk factors were examined: expressive vocabulary, nonverbal ability (these two from parent report), family history, mothers' education, and EA3. Analyses were conducted both for continuous and categorically defined measures of risk and outcome. Results Language and literacy abilities at 12 years of age were significantly but modestly predicted by the risk factors, with a small but significant added prediction from EA3. Indices of diagnostic validity for poor outcomes, such as sensitivity and area under the curve statistics, were poor in all cases. Conclusions We conclude that, at present, clinically useful prediction from toddlerhood remains an unattained goal. Supplemental Material https://doi.org/10.23641/asha.12170331.
Collapse
Affiliation(s)
- Philip S. Dale
- Department of Speech & Hearing Sciences, University of New Mexico, Albuquerque
| | | | | | | |
Collapse
|
645
|
Lewis CM, Vassos E. Polygenic risk scores: from research tools to clinical instruments. Genome Med 2020; 12:44. [PMID: 32423490 PMCID: PMC7236300 DOI: 10.1186/s13073-020-00742-5] [Citation(s) in RCA: 755] [Impact Index Per Article: 151.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2019] [Accepted: 05/01/2020] [Indexed: 12/19/2022] Open
Abstract
Genome-wide association studies have shown unequivocally that common complex disorders have a polygenic genetic architecture and have enabled researchers to identify genetic variants associated with diseases. These variants can be combined into a polygenic risk score that captures part of an individual's susceptibility to diseases. Polygenic risk scores have been widely applied in research studies, confirming the association between the scores and disease status, but their clinical utility has yet to be established. Polygenic risk scores may be used to estimate an individual's lifetime genetic risk of disease, but the current discriminative ability is low in the general population. Clinical implementation of polygenic risk score (PRS) may be useful in cohorts where there is a higher prior probability of disease, for example, in early stages of diseases to assist in diagnosis or to inform treatment choices. Important considerations are the weaker evidence base in application to non-European ancestry and the challenges in translating an individual's PRS from a percentile of a normal distribution to a lifetime disease risk. In this review, we consider how PRS may be informative at different points in the disease trajectory giving examples of progress in the field and discussing obstacles that need to be addressed before clinical implementation.
Collapse
Affiliation(s)
- Cathryn M Lewis
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, de Crespigny Park, London, SE5 8AF, UK.
- Department of Medical and Molecular Genetics, Faculty of Life Sciences and Medicine, King's College London, London, UK.
| | - Evangelos Vassos
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, de Crespigny Park, London, SE5 8AF, UK
| |
Collapse
|
646
|
Engemann K, Pedersen CB, Agerbo E, Arge L, Børglum AD, Erikstrup C, Hertel O, Hougaard DM, McGrath JJ, Mors O, Mortensen PB, Nordentoft M, Sabel CE, Sigsgaard T, Tsirogiannis C, Vilhjálmsson BJ, Werge T, Svenning JC, Horsdal HT. Association Between Childhood Green Space, Genetic Liability, and the Incidence of Schizophrenia. Schizophr Bull 2020; 46:1629-1637. [PMID: 32415773 PMCID: PMC8496913 DOI: 10.1093/schbul/sbaa058] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Childhood exposure to green space has previously been associated with lower risk of developing schizophrenia later in life. It is unclear whether this association is mediated by genetic liability or whether the 2 risk factors work additively. Here, we investigate possible gene-environment associations with the hazard ratio (HR) of schizophrenia by combining (1) an estimate of childhood exposure to residential-level green space based on the normalized difference vegetation index (NDVI) from Landsat satellite images, with (2) genetic liability estimates based on polygenic risk scores for 19 746 genotyped individuals from the Danish iPSYCH sample. We used information from the Danish registers of health, residential address, and socioeconomic status to adjust HR estimates for established confounders, ie, parents' socioeconomic status, and family history of mental illness. The adjusted HRs show that growing up surrounded by the highest compared to the lowest decile of NDVI was associated with a 0.52-fold (95% confidence interval [CI]: 0.40 to 0.66) lower schizophrenia risk, and children with the highest polygenic risk score had a 1.24-fold (95% CI: 1.18 to 1.30) higher schizophrenia risk. We found that NDVI explained 1.45% (95% CI: 1.07 to 1.90) of the variance on the liability scale, while polygenic risk score for schizophrenia explained 1.01% (95% CI: 0.77 to 1.46). Together they explained 2.40% (95% CI: 1.99 to 3.07) with no indication of a gene-environment interaction (P = .29). Our results suggest that risk of schizophrenia is associated additively with green space exposure and genetic liability, and provide no support for an environment-gene interaction between NDVI and schizophrenia.
Collapse
Affiliation(s)
- Kristine Engemann
- Section for Ecoinformatics and Biodiversity, Department of Bioscience, Aarhus University, 8000 Aarhus C, Denmark,Center for Biodiversity Dynamics in a Changing World (BIOCHANGE), Aarhus University, 8000 Aarhus C, Denmark,Centre for Integrated Register-Based Research, CIRRAU, Aarhus University, 8210 Aarhus V, Denmark,Big Data Centre for Environment and Health, Aarhus University, 8210 Aarhus V, Denmark,To whom correspondence should be addressed; Section for Ecoinformatics and Biodiversity, Department of Bioscience, Aarhus University, Ny Munkegade 114, 8000 Aarhus C, Denmark; tel: +45-87154316; fax: +45-87154323; e-mail:
| | - Carsten Bøcker Pedersen
- Centre for Integrated Register-Based Research, CIRRAU, Aarhus University, 8210 Aarhus V, Denmark,Big Data Centre for Environment and Health, Aarhus University, 8210 Aarhus V, Denmark,National Centre for Register-Based Research, Aarhus BSS, Department of Economics and Business Economics, Aarhus University, 8210 Aarhus V, Denmark,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark
| | - Esben Agerbo
- Centre for Integrated Register-Based Research, CIRRAU, Aarhus University, 8210 Aarhus V, Denmark,National Centre for Register-Based Research, Aarhus BSS, Department of Economics and Business Economics, Aarhus University, 8210 Aarhus V, Denmark,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark
| | - Lars Arge
- Center for Massive Data Algorithmics, MADALGO, and Department of Computer Science, Aarhus University, 8200 Aarhus N, Denmark
| | - Anders Dupont Børglum
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark,Department of Biomedicine and Center for Integrative Sequencing, iSEQ, Aarhus University, 8000 Aarhus C, Denmark,Center for Genomics and Personalized Medicine, Aarhus University and Aarhus University Hospital, 8000 Aarhus C, Denmark
| | - Christian Erikstrup
- Big Data Centre for Environment and Health, Aarhus University, 8210 Aarhus V, Denmark,Department of Clinical Medicine, Aarhus University, 8200 Aarhus N, Denmark
| | - Ole Hertel
- Big Data Centre for Environment and Health, Aarhus University, 8210 Aarhus V, Denmark,Department of Environmental Science, Aarhus University, 4000 Roskilde, Denmark
| | - David Michael Hougaard
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark,Department for Congenital Disorders, Statens Serum Institut, 2300 Copenhagen S, Denmark
| | - John J McGrath
- National Centre for Register-Based Research, Aarhus BSS, Department of Economics and Business Economics, Aarhus University, 8210 Aarhus V, Denmark,Queensland Brain Institute, The University of Queensland, St Lucia, QLD, Australia,Queensland Centre for Mental Health Research, The Park Centre for Mental Health, Wacol, QLD, Australia
| | - Ole Mors
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark,Psychosis Research Unit, Aarhus University Hospital Psychiatry, 8200 Aarhus N, Denmark
| | - Preben Bo Mortensen
- Centre for Integrated Register-Based Research, CIRRAU, Aarhus University, 8210 Aarhus V, Denmark,National Centre for Register-Based Research, Aarhus BSS, Department of Economics and Business Economics, Aarhus University, 8210 Aarhus V, Denmark,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark
| | - Merete Nordentoft
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark,Mental Health Centre Copenhagen, Capital Region of Denmark, Copenhagen University Hospital, 2100 Copenhagen Ø, Denmark
| | - Clive Eric Sabel
- Big Data Centre for Environment and Health, Aarhus University, 8210 Aarhus V, Denmark,Department of Environmental Science, Aarhus University, 4000 Roskilde, Denmark
| | - Torben Sigsgaard
- Big Data Centre for Environment and Health, Aarhus University, 8210 Aarhus V, Denmark,Department of Public Health, Aarhus University, 8000 Aarhus C, Denmark
| | - Constantinos Tsirogiannis
- Center for Massive Data Algorithmics, MADALGO, and Department of Computer Science, Aarhus University, 8200 Aarhus N, Denmark
| | - Bjarni Jóhann Vilhjálmsson
- National Centre for Register-Based Research, Aarhus BSS, Department of Economics and Business Economics, Aarhus University, 8210 Aarhus V, Denmark,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark
| | - Thomas Werge
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus University, 8210 Aarhus V, Denmark,Department of Clinical Medicine, University of Copenhagen, 2200 Copenhagen N, Denmark,Institute of Biological Psychiatry, Copenhagen Mental Health Services, 4000 Roskilde, Denmark
| | - Jens-Christian Svenning
- Section for Ecoinformatics and Biodiversity, Department of Bioscience, Aarhus University, 8000 Aarhus C, Denmark,Center for Biodiversity Dynamics in a Changing World (BIOCHANGE), Aarhus University, 8000 Aarhus C, Denmark
| | - Henriette Thisted Horsdal
- Big Data Centre for Environment and Health, Aarhus University, 8210 Aarhus V, Denmark,National Centre for Register-Based Research, Aarhus BSS, Department of Economics and Business Economics, Aarhus University, 8210 Aarhus V, Denmark
| |
Collapse
|
647
|
Cano-Gamez E, Trynka G. From GWAS to Function: Using Functional Genomics to Identify the Mechanisms Underlying Complex Diseases. Front Genet 2020; 11:424. [PMID: 32477401 PMCID: PMC7237642 DOI: 10.3389/fgene.2020.00424] [Citation(s) in RCA: 341] [Impact Index Per Article: 68.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 04/06/2020] [Indexed: 12/19/2022] Open
Abstract
Genome-wide association studies (GWAS) have successfully mapped thousands of loci associated with complex traits. These associations could reveal the molecular mechanisms altered in common complex diseases and result in the identification of novel drug targets. However, GWAS have also left a number of outstanding questions. In particular, the majority of disease-associated loci lie in non-coding regions of the genome and, even though they are thought to play a role in gene expression regulation, it is unclear which genes they regulate and in which cell types or physiological contexts this regulation occurs. This has hindered the translation of GWAS findings into clinical interventions. In this review we summarize how these challenges have been addressed over the last decade, with a particular focus on the integration of GWAS results with functional genomics datasets. Firstly, we investigate how the tissues and cell types involved in diseases can be identified using methods that test for enrichment of GWAS variants in genomic annotations. Secondly, we explore how to find the genes regulated by GWAS loci using methods that test for colocalization of GWAS signals with molecular phenotypes such as quantitative trait loci (QTLs). Finally, we highlight potential future research avenues such as integrating GWAS results with single-cell sequencing read-outs, designing functionally informed polygenic risk scores (PRS), and validating disease associated genes using genetic engineering. These tools will be crucial to identify new drug targets for common complex diseases.
Collapse
Affiliation(s)
- Eddie Cano-Gamez
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Gosia Trynka
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Open Targets, Wellcome Genome Campus, Cambridge, United Kingdom
| |
Collapse
|
648
|
Shen X, Howard DM, Adams MJ, Hill WD, Clarke TK, Deary IJ, Whalley HC, McIntosh AM. A phenome-wide association and Mendelian Randomisation study of polygenic risk for depression in UK Biobank. Nat Commun 2020; 11:2301. [PMID: 32385265 PMCID: PMC7210889 DOI: 10.1038/s41467-020-16022-0] [Citation(s) in RCA: 75] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Accepted: 04/02/2020] [Indexed: 12/15/2022] Open
Abstract
Depression is a leading cause of worldwide disability but there remains considerable uncertainty regarding its neural and behavioural associations. Here, using non-overlapping Psychiatric Genomics Consortium (PGC) datasets as a reference, we estimate polygenic risk scores for depression (depression-PRS) in a discovery (N = 10,674) and replication (N = 11,214) imaging sample from UK Biobank. We report 77 traits that are significantly associated with depression-PRS, in both discovery and replication analyses. Mendelian Randomisation analysis supports a potential causal effect of liability to depression on brain white matter microstructure (β: 0.125 to 0.868, pFDR < 0.043). Several behavioural traits are also associated with depression-PRS (β: 0.014 to 0.180, pFDR: 0.049 to 1.28 × 10−14) and we find a significant and positive interaction between depression-PRS and adverse environmental exposures on mental health outcomes. This study reveals replicable associations between depression-PRS and white matter microstructure. Our results indicate that white matter microstructure differences may be a causal consequence of liability to depression. Depression is correlated with many brain-related traits. Here, Shen et al. perform phenome-wide association studies of a depression polygenic risk score (PRS) and find associations with 51 behavioural and 26 neuroimaging traits which are further followed up on using Mendelian randomization and mediation analyses.
Collapse
Affiliation(s)
- Xueyi Shen
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - David M Howard
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK.,Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
| | - Mark J Adams
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - W David Hill
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK.,Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Toni-Kim Clarke
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | | | - Ian J Deary
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK.,Department of Psychology, University of Edinburgh, Edinburgh, UK
| | | | - Andrew M McIntosh
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK. .,Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK. .,Department of Psychology, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
649
|
Dikilitas O, Schaid DJ, Kosel ML, Carroll RJ, Chute CG, Denny JA, Fedotov A, Feng Q, Hakonarson H, Jarvik GP, Lee MTM, Pacheco JA, Rowley R, Sleiman PM, Stein CM, Sturm AC, Wei WQ, Wiesner GL, Williams MS, Zhang Y, Manolio TA, Kullo IJ. Predictive Utility of Polygenic Risk Scores for Coronary Heart Disease in Three Major Racial and Ethnic Groups. Am J Hum Genet 2020; 106:707-716. [PMID: 32386537 DOI: 10.1016/j.ajhg.2020.04.002] [Citation(s) in RCA: 99] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Accepted: 03/31/2020] [Indexed: 12/28/2022] Open
Abstract
Because polygenic risk scores (PRSs) for coronary heart disease (CHD) are derived from mainly European ancestry (EA) cohorts, their validity in African ancestry (AA) and Hispanic ethnicity (HE) individuals is unclear. We investigated associations of "restricted" and genome-wide PRSs with CHD in three major racial and ethnic groups in the U.S. The eMERGE cohort (mean age 48 ± 14 years, 58% female) included 45,645 EA, 7,597 AA, and 2,493 HE individuals. We assessed two restricted PRSs (PRSTikkanen and PRSTada; 28 and 50 variants, respectively) and two genome-wide PRSs (PRSmetaGRS and PRSLDPred; 1.7 M and 6.6 M variants, respectively) derived from EA cohorts. Over a median follow-up of 11.1 years, 2,652 incident CHD events occurred. Hazard and odds ratios for the association of PRSs with CHD were similar in EA and HE cohorts but lower in AA cohorts. Genome-wide PRSs were more strongly associated with CHD than restricted PRSs were. PRSmetaGRS, the best performing PRS, was associated with CHD in all three cohorts; hazard ratios (95% CI) per 1 SD increase were 1.53 (1.46-1.60), 1.53 (1.23-1.90), and 1.27 (1.13-1.43) for incident CHD in EA, HE, and AA individuals, respectively. The hazard ratios were comparable in the EA and HE cohorts (pinteraction = 0.77) but were significantly attenuated in AA individuals (pinteraction= 2.9 × 10-3). These results highlight the potential clinical utility of PRSs for CHD as well as the need to assemble diverse cohorts to generate ancestry- and ethnicity PRSs.
Collapse
Affiliation(s)
- Ozan Dikilitas
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Daniel J Schaid
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN 55905, USA
| | - Matthew L Kosel
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN 55905, USA
| | - Robert J Carroll
- Department of Biomedical Informatics, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37212, USA
| | - Christopher G Chute
- Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Joshua A Denny
- Department of Biomedical Informatics, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37212, USA
| | - Alex Fedotov
- Irving Institute for Clinical and Translational Research, Columbia University Medical Center, New York, NY 10032, USA
| | - QiPing Feng
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37212, USA
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Gail P Jarvik
- Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | | | - Jennifer A Pacheco
- Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Robb Rowley
- National Human Genome Research Institute, Bethesda, MD 20892, USA
| | - Patrick M Sleiman
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - C Michael Stein
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37212, USA
| | | | - Wei-Qi Wei
- Department of Biomedical Informatics, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37212, USA
| | - Georgia L Wiesner
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37212, USA
| | | | | | - Teri A Manolio
- National Human Genome Research Institute, Bethesda, MD 20892, USA
| | - Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN 55905, USA.
| |
Collapse
|
650
|
Yang S, Zhou X. Accurate and Scalable Construction of Polygenic Scores in Large Biobank Data Sets. Am J Hum Genet 2020; 106:679-693. [PMID: 32330416 PMCID: PMC7212266 DOI: 10.1016/j.ajhg.2020.03.013] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 03/30/2020] [Indexed: 01/24/2023] Open
Abstract
Accurate construction of polygenic scores (PGS) can enable early diagnosis of diseases and facilitate the development of personalized medicine. Accurate PGS construction requires prediction models that are both adaptive to different genetic architectures and scalable to biobank scale datasets with millions of individuals and tens of millions of genetic variants. Here, we develop such a method called Deterministic Bayesian Sparse Linear Mixed Model (DBSLMM). DBSLMM relies on a flexible modeling assumption on the effect size distribution to achieve robust and accurate prediction performance across a range of genetic architectures. DBSLMM also relies on a simple deterministic search algorithm to yield an approximate analytic estimation solution using summary statistics only. The deterministic search algorithm, when paired with further algebraic innovations, results in substantial computational savings. With simulations, we show that DBSLMM achieves scalable and accurate prediction performance across a range of realistic genetic architectures. We then apply DBSLMM to analyze 25 traits in UK Biobank. For these traits, compared to existing approaches, DBSLMM achieves an average of 2.03%-101.09% accuracy gain in internal cross-validations. In external validations on two separate datasets, including one from BioBank Japan, DBSLMM achieves an average of 14.74%-522.74% accuracy gain. In these real data applications, DBSLMM is 1.03-28.11 times faster and uses only 7.4%-24.8% of physical memory as compared to other multiple regression-based PGS methods. Overall, DBSLMM represents an accurate and scalable method for constructing PGS in biobank scale datasets.
Collapse
Affiliation(s)
- Sheng Yang
- Department of Biostatistics, Nanjing Medical University, Nanjing, Jiangsu 211166, China; Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|