51
|
Zhang Y, Lu Q, Ye Y, Huang K, Liu W, Wu Y, Zhong X, Li B, Yu Z, Travers BG, Werling DM, Li JJ, Zhao H. SUPERGNOVA: local genetic correlation analysis reveals heterogeneous etiologic sharing of complex traits. Genome Biol 2021; 22:262. [PMID: 34493297 PMCID: PMC8422619 DOI: 10.1186/s13059-021-02478-w] [Citation(s) in RCA: 71] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Accepted: 08/23/2021] [Indexed: 01/09/2023] Open
Abstract
Local genetic correlation quantifies the genetic similarity of complex traits in specific genomic regions. However, accurate estimation of local genetic correlation remains challenging, due to linkage disequilibrium in local genomic regions and sample overlap across studies. We introduce SUPERGNOVA, a statistical framework to estimate local genetic correlations using summary statistics from genome-wide association studies. We demonstrate that SUPERGNOVA outperforms existing methods through simulations and analyses of 30 complex traits. In particular, we show that the positive yet paradoxical genetic correlation between autism spectrum disorder and cognitive performance could be explained by two etiologically distinct genetic signatures with bidirectional local genetic correlations.
Collapse
Affiliation(s)
- Yiliang Zhang
- Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT, 06520, USA
| | - Qiongshi Lu
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Department of Statistics, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Yixuan Ye
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06510, USA
| | - Kunling Huang
- Department of Statistics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Wei Liu
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06510, USA
| | - Yuchang Wu
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Xiaoyuan Zhong
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Boyang Li
- Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT, 06520, USA
| | - Zhaolong Yu
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06510, USA
| | - Brittany G Travers
- Occupational Therapy Program in the Department of Kinesiology, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Donna M Werling
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - James J Li
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Psychology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT, 06520, USA.
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06510, USA.
- Department of Genetics, Yale School of Medicine, New Haven, CT, 06510, USA.
| |
Collapse
|
52
|
Zhang Y, Cheng Y, Jiang W, Ye Y, Lu Q, Zhao H. Comparison of methods for estimating genetic correlation between complex traits using GWAS summary statistics. Brief Bioinform 2021; 22:bbaa442. [PMID: 33497438 PMCID: PMC8425307 DOI: 10.1093/bib/bbaa442] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 12/12/2020] [Accepted: 12/30/2020] [Indexed: 01/03/2023] Open
Abstract
Genetic correlation is the correlation of phenotypic effects by genetic variants across the genome on two phenotypes. It is an informative metric to quantify the overall genetic similarity between complex traits, which provides insights into their polygenic genetic architecture. Several methods have been proposed to estimate genetic correlation based on data collected from genome-wide association studies (GWAS). Due to the easy access of GWAS summary statistics and computational efficiency, methods only requiring GWAS summary statistics as input have become more popular than methods utilizing individual-level genotype data. Here, we present a benchmark study for different summary-statistics-based genetic correlation estimation methods through simulation and real data applications. We focus on two major technical challenges in estimating genetic correlation: marker dependency caused by linkage disequilibrium (LD) and sample overlap between different studies. To assess the performance of different methods in the presence of these two challenges, we first conducted comprehensive simulations with diverse LD patterns and sample overlaps. Then we applied these methods to real GWAS summary statistics for a wide spectrum of complex traits. Based on these experiments, we conclude that methods relying on accurate LD estimation are less robust in real data applications due to the imprecision of LD obtained from reference panels. Our findings offer guidance on how to choose appropriate methods for genetic correlation estimation in post-GWAS analysis.
Collapse
Affiliation(s)
- Yiliang Zhang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
| | - Youshu Cheng
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
| | - Wei Jiang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
| | - Yixuan Ye
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT 06510, USA
| | - Qiongshi Lu
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53706, USA
- Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
- Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT 06510, USA
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| |
Collapse
|
53
|
Jansen AG, Jansen PR, Savage JE, Kraft J, Skarabis N, Polderman TJC, Dieleman GC. The predictive capacity of psychiatric and psychological polygenic risk scores for distinguishing cases in a child and adolescent psychiatric sample from controls. J Child Psychol Psychiatry 2021; 62:1079-1089. [PMID: 33825194 PMCID: PMC8453516 DOI: 10.1111/jcpp.13370] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 10/30/2020] [Accepted: 11/18/2020] [Indexed: 12/15/2022]
Abstract
BACKGROUND Psychiatric traits are heritable, highly comorbid and genetically correlated, suggesting that genetic effects that are shared across disorders are at play. The aim of the present study is to quantify the predictive capacity of common genetic variation of a variety of traits, as captured by their PRS, to predict case-control status in a child and adolescent psychiatric sample including controls to reveal which traits contribute to the shared genetic risk across disorders. METHOD Polygenic risk scores (PRS) of 14 traits were used as predictor phenotypes to predict case-control status in a clinical sample. Clinical cases (N = 1,402), age 1-21, diagnostic categories: Autism spectrum disorders (N = 492), Attention-deficit/ hyperactivity disorders (N = 471), Anxiety (N = 293), disruptive behaviors (N = 101), eating disorders (N = 97), OCD (N = 43), Tic disorder (N = 50), Disorder of infancy, childhood or adolescence NOS (N = 65), depression (N = 64), motor, learning and communication disorders (N = 59), Anorexia Nervosa (N = 48), somatoform disorders (N = 47), Trauma/stress (N = 39) and controls (N = 1,448, age 17-84) of European ancestry. First, these 14 PRS were tested in univariate regression analyses. The traits that significantly predicted case-control status were included in a multivariable regression model to investigate the gain in explained variance when leveraging the genetic effects of multiple traits simultaneously. RESULTS In the univariate analyses, we observed significant associations between clinical status and the PRS of educational attainment (EA), smoking initiation (SI), intelligence, neuroticism, alcohol dependence, ADHD, major depression and anti-social behavior. EA (p-value: 3.53E-20, explained variance: 3.99%, OR: 0.66), and SI (p-value: 4.77E-10, explained variance: 1.91%, OR: 1.33) were the most predictive traits. In the multivariable analysis with these eight significant traits, EA and SI, remained significant predictors. The explained variance of the PRS in the model with these eight traits combined was 5.9%. CONCLUSION Our study provides more insights into the genetic signal that is shared between childhood and adolescent psychiatric disorders. As such, our findings might guide future studies on psychiatric comorbidity and offer insights into shared etiology between psychiatric disorders. The increase in explained variance when leveraging the genetic signal of different predictor traits supports a multivariable approach to optimize precision accuracy for general psychopathology.
Collapse
Affiliation(s)
- Arija G. Jansen
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive ResearchVrije Universiteit AmsterdamAmsterdamThe Netherlands
- Department of Child and Adolescent Psychiatry/PsychologyErasmus University Medical CenterRotterdamThe Netherlands
| | - Philip R. Jansen
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive ResearchVrije Universiteit AmsterdamAmsterdamThe Netherlands
- Department of Child and Adolescent Psychiatry/PsychologyErasmus University Medical CenterRotterdamThe Netherlands
- Department of Clinical GeneticsAmsterdam UMC, Vrije Universiteit Medical CenterAmsterdamThe Netherlands
| | - Jeanne E. Savage
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive ResearchVrije Universiteit AmsterdamAmsterdamThe Netherlands
| | - Julia Kraft
- Department of Psychiatry and PsychotherapyCharité, Universitätsmedizin BerlinBerlinGermany
- Berlin School of Mind and BrainHumboldt University of BerlinBerlinGermany
| | - Nora Skarabis
- Department of Psychiatry and PsychotherapyCharité, Universitätsmedizin BerlinBerlinGermany
| | - Tinca J. C. Polderman
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive ResearchVrije Universiteit AmsterdamAmsterdamThe Netherlands
- Child and Adolescent Psychiatry and Psychosocial Care, Amsterdam UMCVrije Universiteit Amsterdam, Amsterdam Public HealthAmsterdamThe Netherlands
| | - Gwen C. Dieleman
- Department of Child and Adolescent Psychiatry/PsychologyErasmus University Medical CenterRotterdamThe Netherlands
| |
Collapse
|
54
|
Shan N, Xie Y, Song S, Jiang W, Wang Z, Hou L. A novel transcriptional risk score for risk prediction of complex human diseases. Genet Epidemiol 2021; 45:811-820. [PMID: 34245595 DOI: 10.1002/gepi.22424] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Revised: 06/08/2021] [Accepted: 06/24/2021] [Indexed: 11/06/2022]
Abstract
Recently polygenetic risk score (PRS) has been successfully used in the risk prediction of complex human diseases. Many studies incorporated internal information, such as effect size distribution, or external information, such as linkage disequilibrium, functional annotation, and pleiotropy among multiple diseases, to optimize the performance of PRS. To leverage on multiomics datasets, we developed a novel flexible transcriptional risk score (TRS), in which messenger RNA expression levels were imputed and weighted for risk prediction. In simulation studies, we demonstrated that single-tissue TRS has greater prediction power than LDpred, especially when there is a large effect of gene expression on the phenotype. Multitissue TRS improves prediction accuracy when there are multiple tissues with independent contributions to disease risk. We applied our method to complex traits, including Crohn's disease, type 2 diabetes, and so on. The single-tissue TRS method outperformed LDpred and AnnoPred across the tested traits. The performance of multitissue TRS is trait-dependent. Moreover, our method can easily incorporate information from epigenomic and proteomic data upon the availability of reference datasets.
Collapse
Affiliation(s)
- Nayang Shan
- Center for Statistical Science, Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Yuhan Xie
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
| | - Shuang Song
- Center for Statistical Science, Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Wei Jiang
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
| | - Zuoheng Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
| | - Lin Hou
- Center for Statistical Science, Department of Industrial Engineering, Tsinghua University, Beijing, China.,MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, China
| |
Collapse
|
55
|
Moldovan A, Waldman YY, Brandes N, Linial M. Body Mass Index and Birth Weight Improve Polygenic Risk Score for Type 2 Diabetes. J Pers Med 2021; 11:582. [PMID: 34205563 PMCID: PMC8233887 DOI: 10.3390/jpm11060582] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 06/10/2021] [Accepted: 06/17/2021] [Indexed: 12/25/2022] Open
Abstract
One of the major challenges in the post-genomic era is elucidating the genetic basis of human diseases. In recent years, studies have shown that polygenic risk scores (PRS), based on aggregated information from millions of variants across the human genome, can estimate individual risk for common diseases. In practice, the current medical practice still predominantly relies on physiological and clinical indicators to assess personal disease risk. For example, caregivers mark individuals with high body mass index (BMI) as having an increased risk to develop type 2 diabetes (T2D). An important question is whether combining PRS with clinical metrics can increase the power of disease prediction in particular from early life. In this work we examined this question, focusing on T2D. We present here a sex-specific integrated approach that combines PRS with additional measurements and age to define a new risk score. We show that such approach combining adult BMI and PRS achieves considerably better prediction than each of the measures on unrelated Caucasians in the UK Biobank (UKB, n = 290,584). Likewise, integrating PRS with self-reports on birth weight (n = 172,239) and comparative body size at age ten (n = 287,203) also substantially enhance prediction as compared to each of its components. While the integration of PRS with BMI achieved better results as compared to the other measurements, the latter are early-life measurements that can be integrated already at childhood, to allow preemptive intervention for those at high risk to develop T2D. Our integrated approach can be easily generalized to other diseases, with the relevant early-life measurements.
Collapse
Affiliation(s)
- Avigail Moldovan
- Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel;
| | | | - Nadav Brandes
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem 91904, Israel;
| | - Michal Linial
- Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel;
| |
Collapse
|
56
|
Abstract
As genome-wide association studies have continued to identify loci associated with complex traits, the implications of and necessity for proper use of these findings, including prediction of disease risk, have become apparent. Many complex diseases have numerous associated loci with detectable effects implicating risk for or protection from disease. A common contemporary approach to using this information for disease prediction is through the application of genetic risk scores. These scores estimate an individual's liability for a specific outcome by aggregating the effects of associated loci into a single measure as described in the previous version of this article. Although genetic risk scores have traditionally included variants that meet criteria for genome-wide significance, an extension known as the polygenic risk score has been developed to include the effects of more variants across the entire genome. Here, we describe common methods and software packages for calculating and interpreting polygenic risk scores. In this revised version of the article, we detail information that is needed to perform a polygenic risk score analysis, considerations for planning the analysis and interpreting results, as well as discussion of the limitations based on the choices made. We also provide simulated sample data and a walkthrough for four different polygenic risk score software. © 2021 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Michael D Osterman
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio
| | - Tyler G Kinzy
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio
| | - Jessica N Cooke Bailey
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio
| |
Collapse
|
57
|
Albiñana C, Grove J, McGrath JJ, Agerbo E, Wray NR, Bulik CM, Nordentoft M, Hougaard DM, Werge T, Børglum AD, Mortensen PB, Privé F, Vilhjálmsson BJ. Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction. Am J Hum Genet 2021; 108:1001-1011. [PMID: 33964208 PMCID: PMC8206385 DOI: 10.1016/j.ajhg.2021.04.014] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 04/20/2021] [Indexed: 12/12/2022] Open
Abstract
The accuracy of polygenic risk scores (PRSs) to predict complex diseases increases with the training sample size. PRSs are generally derived based on summary statistics from large meta-analyses of multiple genome-wide association studies (GWASs). However, it is now common for researchers to have access to large individual-level data as well, such as the UK Biobank data. To the best of our knowledge, it has not yet been explored how best to combine both types of data (summary statistics and individual-level data) to optimize polygenic prediction. The most widely used approach to combine data is the meta-analysis of GWAS summary statistics (meta-GWAS), but we show that it does not always provide the most accurate PRS. Through simulations and using 12 real case-control and quantitative traits from both iPSYCH and UK Biobank along with external GWAS summary statistics, we compare meta-GWAS with two alternative data-combining approaches, stacked clumping and thresholding (SCT) and meta-PRS. We find that, when large individual-level data are available, the linear combination of PRSs (meta-PRS) is both a simple alternative to meta-GWAS and often more accurate.
Collapse
Affiliation(s)
- Clara Albiñana
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark.
| | - Jakob Grove
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Department of Biomedicine and Center for Integrative Sequencing, iSEQ, Aarhus University, 8000 Aarhus C, Denmark; Center for Genomics and Personalized Medicine, CGPM, Aarhus University, 8000 Aarhus C, Denmark; Bioinformatics Research Centre, Aarhus University, 8000 Aarhus C, Denmark
| | - John J McGrath
- National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark; Queensland Centre for Mental Health Research, The Park Centre for Mental Health, Brisbane, QLD 4076, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia
| | - Esben Agerbo
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark
| | - Naomi R Wray
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia
| | - Cynthia M Bulik
- Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA; Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, 171 77 Stockholm, Sweden; Department of Nutrition, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA
| | - Merete Nordentoft
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Copenhagen University Hospital, Mental Health Centre Copenhagen Mental Health Services in the Capital Region of Denmark, 2100 Copenhagen Ø, Denmark; Department of Clinical Medicine, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - David M Hougaard
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Center for Neonatal Screening, Department for Congenital Disorders, Statens Serum Institut, 2300 Copenhagen S, Denmark
| | - Thomas Werge
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Institute of Biological Psychiatry, MHC Sct. Hans, Mental Health Services Copenhagen, 4000 Roskilde, Denmark; Department of Clinical Medicine, University of Copenhagen, 2200 Copenhagen N, Denmark; Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, 1350 Copenhagen K, Denmark
| | - Anders D Børglum
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Department of Biomedicine and Center for Integrative Sequencing, iSEQ, Aarhus University, 8000 Aarhus C, Denmark; Center for Genomics and Personalized Medicine, CGPM, Aarhus University, 8000 Aarhus C, Denmark
| | - Preben Bo Mortensen
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark
| | - Florian Privé
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark
| | - Bjarni J Vilhjálmsson
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark; Bioinformatics Research Centre, Aarhus University, 8000 Aarhus C, Denmark.
| |
Collapse
|
58
|
Tang H, He Z. Advances and challenges in quantitative delineation of the genetic architecture of complex traits. QUANTITATIVE BIOLOGY 2021; 9:168-184. [PMID: 35492964 PMCID: PMC9053444 DOI: 10.15302/j-qb-021-0249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Background Genome-wide association studies (GWAS) have been widely adopted in studies of human complex traits and diseases. Results This review surveys areas of active research: quantifying and partitioning trait heritability, fine mapping functional variants and integrative analysis, genetic risk prediction of phenotypes, and the analysis of sequencing studies that have identified millions of rare variants. Current challenges and opportunities are highlighted. Conclusion GWAS have fundamentally transformed the field of human complex trait genetics. Novel statistical and computational methods have expanded the scope of GWAS and have provided valuable insights on the genetic architecture underlying complex phenotypes.
Collapse
Affiliation(s)
- Hua Tang
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Zihuai He
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA
- Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
59
|
Neumann JT, Riaz M, Bakshi A, Polekhina G, Thao LTP, Nelson MR, Woods RL, Abraham G, Inouye M, Reid CM, Tonkin AM, Williamson JD, Donnan GA, Brodtmann A, Cloud GC, McNeil JJ, Lacaze P. Predictive Performance of a Polygenic Risk Score for Incident Ischemic Stroke in a Healthy Older Population. Stroke 2021; 52:2882-2891. [PMID: 34039031 DOI: 10.1161/strokeaha.120.033670] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
[Figure: see text].
Collapse
Affiliation(s)
- Johannes T Neumann
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia.,Department of Cardiology, University Heart and Vascular Centre, Hamburg, Germany (J.T.N.).,German Centre for Cardiovascular Research, Partner Site Hamburg/Kiel/Lübeck, Germany (J.T.N.)
| | - Moeen Riaz
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia
| | - Andrew Bakshi
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia
| | - Galina Polekhina
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia
| | - Le T P Thao
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia
| | - Mark R Nelson
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia.,Menzies Institute for Medical Research, University of Tasmania, Hobart (M.R.N.)
| | - Robyn L Woods
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia
| | - Gad Abraham
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia (G.A., M.I.)
| | - Michael Inouye
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia (G.A., M.I.).,Department of Public Health and Primary Care, University of Cambridge, United Kingdom (M.I.)
| | - Christopher M Reid
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia.,School of Public Health, Curtin University, Perth, Western Australia, Australia (C.M.R.)
| | - Andrew M Tonkin
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia
| | - Jeff D Williamson
- Department of Internal Medicine, Sticht Center on Aging and Alzheimer's Prevention, Section on Gerontology and Geriatric Medicine, Wake Forest School of Medicine, Winston-Salem, NC (J.D.W.)
| | - Geoffrey A Donnan
- Melbourne Brain Centre, Royal Melbourne Hospital (G.A.D., A. Brodtmann), University of Melbourne, Australia
| | - Amy Brodtmann
- Melbourne Brain Centre, Royal Melbourne Hospital (G.A.D., A. Brodtmann), University of Melbourne, Australia.,The Florey Institute of Neuroscience and Mental Health (A. Brodtmann), University of Melbourne, Australia
| | - Geoffrey C Cloud
- Department of Neuroscience, Central Clinical School (G.C.C.), Monash University, Melbourne, Australia.,Department of Neurology, Alfred Hospital, Melbourne, Australia (G.C.C.)
| | - John J McNeil
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia
| | - Paul Lacaze
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine (J.T.N., M.R., A. Bakshi, G.P., L.T.P.T., M.R.N., R.L.W., C.M.R., A.M.T., J.J.M., P.L.), Monash University, Melbourne, Australia
| |
Collapse
|
60
|
Risner VA, Benca-Bachman CE, Bertin L, Smith AK, Kaprio J, McGeary JE, Chesler E, Knopik V, Friedman N, Palmer RHC. Multi-polygenic Analysis of Nicotine Dependence in Individuals of European Ancestry. Nicotine Tob Res 2021; 23:2102-2109. [PMID: 34008017 DOI: 10.1093/ntr/ntab105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 05/14/2021] [Indexed: 11/12/2022]
Abstract
INTRODUCTION Heritability estimates of nicotine dependence (ND) range from 40-70%, but discovery GWAS of ND are underpowered and have limited predictive utility. In this work, we leverage genetically correlated traits and diseases to increase the accuracy of polygenic risk prediction. METHODS We employed a multi-trait model using summary statistic-based best linear unbiased predictors (SBLUP) of genetic correlates of DSM-IV diagnosis of ND in 6,394 individuals of European Ancestry (prevalence = 45.3%, %female = 46.8%, µage = 40.08 [s.d. = 10.43]) and 3,061 individuals from a nationally-representative sample with Fagerström Test for Nicotine Dependence symptom count (FTND; 51.32% female, mean age = 28.9 [s.d. = 1.70]). Polygenic predictors were derived from GWASs known to be phenotypically and genetically correlated with ND (i.e., Cigarettes per Day (CPD), the Alcohol Use Disorders Identification Test (AUDIT-Consumption and AUDIT-Problems), Neuroticism, Depression, Schizophrenia, Educational Attainment, Body Mass Index (BMI), and Self-Perceived Risk-Taking); including Height as a negative control. Analyses controlled for age, gender, study site, and the first 10 ancestral principal components. RESULTS The multi-trait model accounted for 3.6% of the total trait variance in DSM-IV ND. Educational Attainment (β=-0.125; 95% confidence interval (CI): [-0.149,-0.101]), CPD (0.071 [0.047,0.095]), and Self-Perceived Risk-Taking (0.051 [0.026,0.075]) were the most robust predictors. PGS effects on FTND were limited. CONCLUSIONS Risk for ND is not only polygenic, but also pleiotropic. Polygenic effects on ND that are accessible by these traits are limited in size and act additively to explain risk.
Collapse
Affiliation(s)
- Victoria A Risner
- Behavioral Genetics of Addiction Laboratory, Department of Psychology at Emory University, Atlanta GA
| | - Chelsie E Benca-Bachman
- Behavioral Genetics of Addiction Laboratory, Department of Psychology at Emory University, Atlanta GA
| | - Lauren Bertin
- Behavioral Genetics of Addiction Laboratory, Department of Psychology at Emory University, Atlanta GA
| | - Alicia K Smith
- Smith Lab, Department of Gynecology and Obstetrics, Emory University School of Medicine, Atlanta GA
| | - Jaakko Kaprio
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki Finland.,Department of Public Health, University of Helsinki, Helsinki Finland
| | - John E McGeary
- Department of Psychiatry & Human Behavior, Brown University, Providence RI.,The Genomic Laboratory, Providence VA Medical Center, Providence RI
| | | | - Valerie Knopik
- Department of Human Development and Family Studies, College of Health and Human Sciences, Purdue University, West Lafayette IN
| | - Naomi Friedman
- Department of Psychology, University of Colorado at Boulder, Boulder, CO
| | - Rohan H C Palmer
- Behavioral Genetics of Addiction Laboratory, Department of Psychology at Emory University, Atlanta GA
| |
Collapse
|
61
|
Porcu E, Sjaarda J, Lepik K, Carmeli C, Darrous L, Sulc J, Mounier N, Kutalik Z. Causal Inference Methods to Integrate Omics and Complex Traits. Cold Spring Harb Perspect Med 2021; 11:a040493. [PMID: 32816877 PMCID: PMC8091955 DOI: 10.1101/cshperspect.a040493] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Major biotechnological advances have facilitated a tremendous boost to the collection of (gen-/transcript-/prote-/methyl-/metabol-)omics data in very large sample sizes worldwide. Coordinated efforts have yielded a deluge of studies associating diseases with genetic markers (genome-wide association studies) or with molecular phenotypes. Whereas omics-disease associations have led to biologically meaningful and coherent mechanisms, the identified (non-germline) disease biomarkers may simply be correlates or consequences of the explored diseases. To move beyond this realm, Mendelian randomization provides a principled framework to integrate information on omics- and disease-associated genetic variants to pinpoint molecular traits causally driving disease development. In this review, we show the latest advances in this field, flag up key challenges for the future, and propose potential solutions.
Collapse
Affiliation(s)
- Eleonora Porcu
- Center for Integrative Genomics, University of Lausanne, Lausanne 1015, Switzerland
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
- University Center for Primary Care and Public Health, University of Lausanne, Lausanne 1010, Switzerland
| | - Jennifer Sjaarda
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
- University Center for Primary Care and Public Health, University of Lausanne, Lausanne 1010, Switzerland
| | - Kaido Lepik
- University Center for Primary Care and Public Health, University of Lausanne, Lausanne 1010, Switzerland
- Institute of Computer Science, University of Tartu, Tartu 50409, Estonia
| | - Cristian Carmeli
- University Center for Primary Care and Public Health, University of Lausanne, Lausanne 1010, Switzerland
| | - Liza Darrous
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
- University Center for Primary Care and Public Health, University of Lausanne, Lausanne 1010, Switzerland
| | - Jonathan Sulc
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
- University Center for Primary Care and Public Health, University of Lausanne, Lausanne 1010, Switzerland
| | - Ninon Mounier
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
- University Center for Primary Care and Public Health, University of Lausanne, Lausanne 1010, Switzerland
| | - Zoltán Kutalik
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
- University Center for Primary Care and Public Health, University of Lausanne, Lausanne 1010, Switzerland
- Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter EX2 5AX, United Kingdom
| |
Collapse
|
62
|
Warrier V, Baron-Cohen S. Childhood trauma, life-time self-harm, and suicidal behaviour and ideation are associated with polygenic scores for autism. Mol Psychiatry 2021; 26:1670-1684. [PMID: 31659270 PMCID: PMC8159746 DOI: 10.1038/s41380-019-0550-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/28/2019] [Revised: 07/26/2019] [Accepted: 08/19/2019] [Indexed: 12/28/2022]
Abstract
Autistic individuals experience significantly elevated rates of childhood trauma, self-harm and suicidal behaviour and ideation (SSBI). Is this purely the result of negative environmental experiences, or does this interact with genetic predisposition? In this study we investigated if a genetic predisposition for autism is associated with childhood trauma using polygenic scores (PGS) and genetic correlations in the UK Biobank (105,222 < N < 105,638), and tested potential mediators and moderators of the association between autism, childhood trauma and SSBI. Autism PGS were significantly associated with childhood trauma (max R2 = 0.096%, P < 2 × 10-16), self-harm ideation (max R2 = 0.108%, P < 2 × 10-16), and self-harm (max R2 = 0.13%, P < 2 × 10-16). Supporting this, we identified significant genetic correlations between autism and childhood trauma (rg = 0.36 ± 0.05, P = 8.13 × 10-11), self-harm ideation (rg = 0.49 ± 0.05, P = 4.17 × 10-21) and self-harm (rg = 0.48 ± 0.05, P = 4.58 × 10-21), and an over-transmission of PGS for the two SSBI phenotypes from parents to autistic probands. Male sex negatively moderated the effect of autism PGS on childhood trauma (β = -0.023 ± 0.005, P = 6.74 × 10-5). Further, childhood trauma positively moderated the effect of autism PGS on self-harm score (β = 8.37 × 10-3 ± 2.76 × 10-3, P = 2.42 × 10-3) and self-harm ideation (β = 7.47 × 10-3 ± 2.76 × 10-3, P = 6.71 × 10-3). Finally, depressive symptoms, quality and frequency of social interactions, and educational attainment were significant mediators of the effect of autism PGS on SSBI, with the proportion of effect mediated ranging from 0.23 (95% CI: 0.09-0.32) for depression to 0.008 (95% CI: 0.004-0.01) for educational attainment. Our findings identify that a genetic predisposition for autism is associated with adverse life-time outcomes, which represent complex gene-environment interactions, and prioritizes potential mediators and moderators of this shared biology. It is important to identify sources of trauma for autistic individuals in order to reduce their occurrence and impact.
Collapse
Affiliation(s)
- Varun Warrier
- Department of Psychiatry, Autism Research Centre, University of Cambridge, Cambridge, UK.
| | - Simon Baron-Cohen
- Department of Psychiatry, Autism Research Centre, University of Cambridge, Cambridge, UK.
| |
Collapse
|
63
|
Pain O, Glanville KP, Hagenaars SP, Selzam S, Fürtjes AE, Gaspar HA, Coleman JRI, Rimfeld K, Breen G, Plomin R, Folkersen L, Lewis CM. Evaluation of polygenic prediction methodology within a reference-standardized framework. PLoS Genet 2021; 17:e1009021. [PMID: 33945532 PMCID: PMC8121285 DOI: 10.1371/journal.pgen.1009021] [Citation(s) in RCA: 90] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 05/14/2021] [Accepted: 03/28/2021] [Indexed: 12/16/2022] Open
Abstract
The predictive utility of polygenic scores is increasing, and many polygenic scoring methods are available, but it is unclear which method performs best. This study evaluates the predictive utility of polygenic scoring methods within a reference-standardized framework, which uses a common set of variants and reference-based estimates of linkage disequilibrium and allele frequencies to construct scores. Eight polygenic score methods were tested: p-value thresholding and clumping (pT+clump), SBLUP, lassosum, LDpred1, LDpred2, PRScs, DBSLMM and SBayesR, evaluating their performance to predict outcomes in UK Biobank and the Twins Early Development Study (TEDS). Strategies to identify optimal p-value thresholds and shrinkage parameters were compared, including 10-fold cross validation, pseudovalidation and infinitesimal models (with no validation sample), and multi-polygenic score elastic net models. LDpred2, lassosum and PRScs performed strongly using 10-fold cross-validation to identify the most predictive p-value threshold or shrinkage parameter, giving a relative improvement of 16-18% over pT+clump in the correlation between observed and predicted outcome values. Using pseudovalidation, the best methods were PRScs, DBSLMM and SBayesR. PRScs pseudovalidation was only 3% worse than the best polygenic score identified by 10-fold cross validation. Elastic net models containing polygenic scores based on a range of parameters consistently improved prediction over any single polygenic score. Within a reference-standardized framework, the best polygenic prediction was achieved using LDpred2, lassosum and PRScs, modeling multiple polygenic scores derived using multiple parameters. This study will help researchers performing polygenic score studies to select the most powerful and predictive analysis methods.
Collapse
Affiliation(s)
- Oliver Pain
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
- NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, United Kingdom
| | - Kylie P. Glanville
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Saskia P. Hagenaars
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Saskia Selzam
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Anna E. Fürtjes
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Héléna A. Gaspar
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Jonathan R. I. Coleman
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Kaili Rimfeld
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Gerome Breen
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
- NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, United Kingdom
| | - Robert Plomin
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Lasse Folkersen
- Institute of Biological Psychiatry, Sankt Hans Hospital, Copenhagen, Denmark
| | - Cathryn M. Lewis
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
- NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, United Kingdom
- Department of Medical and Molecular Genetics, Faculty of Life Sciences and Medicine, King’s College London, London, United Kingdom
| |
Collapse
|
64
|
Restuadi R, Garton FC, Benyamin B, Lin T, Williams KL, Vinkhuyzen A, van Rheenen W, Zhu Z, Laing NG, Mather KA, Sachdev PS, Ngo ST, Steyn FJ, Wallace L, Henders AK, Visscher PM, Needham M, Mathers S, Nicholson G, Rowe DB, Henderson RD, McCombe PA, Pamphlett R, Blair IP, Wray NR, McRae AF. Polygenic risk score analysis for amyotrophic lateral sclerosis leveraging cognitive performance, educational attainment and schizophrenia. Eur J Hum Genet 2021; 30:532-539. [PMID: 33907316 PMCID: PMC9090723 DOI: 10.1038/s41431-021-00885-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 03/24/2021] [Accepted: 04/01/2021] [Indexed: 12/12/2022] Open
Abstract
Amyotrophic Lateral Sclerosis (ALS) is recognised to be a complex neurodegenerative disease involving both genetic and non-genetic risk factors. The underlying causes and risk factors for the majority of cases remain unknown; however, ever-larger genetic data studies and methodologies promise an enhanced understanding. Recent analyses using published summary statistics from the largest ALS genome-wide association study (GWAS) (20,806 ALS cases and 59,804 healthy controls) identified that schizophrenia (SCZ), cognitive performance (CP) and educational attainment (EA) related traits were genetically correlated with ALS. To provide additional evidence for these correlations, we built single and multi-trait genetic predictors using GWAS summary statistics for ALS and these traits, (SCZ, CP, EA) in an independent Australian cohort (846 ALS cases and 665 healthy controls). We compared methods for generating the risk predictors and found that the combination of traits improved the prediction (Nagelkerke-R2) of the case-control logistic regression. The combination of ALS, SCZ, CP, and EA, using the SBayesR predictor method gave the highest prediction (Nagelkerke-R2) of 0.027 (P value = 4.6 × 10-8), with the odds-ratio for estimated disease risk between the highest and lowest deciles of individuals being 3.15 (95% CI 1.96-5.05). These results support the genetic correlation between ALS, SCZ, CP and EA providing a better understanding of the complexity of ALS.
Collapse
Affiliation(s)
- Restuadi Restuadi
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Fleur C Garton
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Beben Benyamin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia.,Australian Centre for Precision Health, University of South Australia Cancer Research Institute, School of Health Sciences, University of South Australia, Adelaide, SA, Australia
| | - Tian Lin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Kelly L Williams
- Centre for MND Research, Faculty of Medicine and Health Sciences, Macquarie University, Sydney, NSW 2109, Australia
| | - Anna Vinkhuyzen
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | | | - Zhihong Zhu
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Nigel G Laing
- Centre for Medical Research, University of Western Australia, Nedlands, WA, Australia.,Harry Perkins Institute of Medical Research, Nedlands, WA, Australia
| | - Karen A Mather
- Centre for Healthy Brain Ageing, School of Psychiatry, University of New South Wales, Sydney, NSW, Australia.,Neuroscience Research Australia Institute, Randwick, NSW, Australia
| | - Perminder S Sachdev
- Centre for Healthy Brain Ageing, School of Psychiatry, University of New South Wales, Sydney, NSW, Australia.,Neuropsychiatric Institute, The Prince of Wales Hospital, UNSW, Randwick, NSW, Australia
| | - Shyuan T Ngo
- Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia.,The Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD, Australia.,Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Frederik J Steyn
- Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia.,School of Biomedical Sciences, The University of Queensland, Brisbane, QLD, Australia
| | - Leanne Wallace
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Anjali K Henders
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia.,Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Merrilee Needham
- Fiona Stanley Hospital, Perth, WA, Australia.,Notre Dame University, Fremantle, WA, Australia.,Institute for Immunology and Infectious Diseases, Murdoch University, Perth, WA, Australia
| | - Susan Mathers
- Calvary Health Care Bethlehem, Parkdale, VIC, Australia
| | - Garth Nicholson
- Centre for MND Research, Faculty of Medicine and Health Sciences, Macquarie University, Sydney, NSW 2109, Australia.,ANZAC Research Institute, Concord Repatriation General Hospital, Sydney, NSW, Australia
| | - Dominic B Rowe
- Centre for MND Research, Faculty of Medicine and Health Sciences, Macquarie University, Sydney, NSW 2109, Australia
| | - Robert D Henderson
- Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia.,The Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD, Australia.,Department of Neurology, Royal Brisbane and Women's Hospital, Brisbane, QLD, Australia
| | - Pamela A McCombe
- Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia.,Department of Neurology, Royal Brisbane and Women's Hospital, Brisbane, QLD, Australia
| | - Roger Pamphlett
- Discipline of Pathology and Department of Neuropathology, Brain and Mind Centre, The University of Sydney, Sydney, NSW, Australia
| | - Ian P Blair
- Centre for MND Research, Faculty of Medicine and Health Sciences, Macquarie University, Sydney, NSW 2109, Australia
| | - Naomi R Wray
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia.,Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Allan F McRae
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia.
| |
Collapse
|
65
|
Cai M, Xiao J, Zhang S, Wan X, Zhao H, Chen G, Yang C. A unified framework for cross-population trait prediction by leveraging the genetic correlation of polygenic traits. Am J Hum Genet 2021; 108:632-655. [PMID: 33770506 PMCID: PMC8059341 DOI: 10.1016/j.ajhg.2021.03.002] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Accepted: 03/01/2021] [Indexed: 12/29/2022] Open
Abstract
The development of polygenic risk scores (PRSs) has proved useful to stratify the general European population into different risk groups. However, PRSs are less accurate in non-European populations due to genetic differences across different populations. To improve the prediction accuracy in non-European populations, we propose a cross-population analysis framework for PRS construction with both individual-level (XPA) and summary-level (XPASS) GWAS data. By leveraging trans-ancestry genetic correlation, our methods can borrow information from the Biobank-scale European population data to improve risk prediction in the non-European populations. Our framework can also incorporate population-specific effects to further improve construction of PRS. With innovations in data structure and algorithm design, our methods provide a substantial saving in computational time and memory usage. Through comprehensive simulation studies, we show that our framework provides accurate, efficient, and robust PRS construction across a range of genetic architectures. In a Chinese cohort, our methods achieved 7.3%-198.0% accuracy gain for height and 19.5%-313.3% accuracy gain for body mass index (BMI) in terms of predictive R2 compared to existing PRS approaches. We also show that XPA and XPASS can achieve substantial improvement for construction of height PRSs in the African population, suggesting the generality of our framework across global populations.
Collapse
Affiliation(s)
- Mingxuan Cai
- Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Jiashun Xiao
- Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Shunkang Zhang
- Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Xiang Wan
- Shenzhen Research Institute of Big Data, Shenzhen 518172, China
| | - Hongyu Zhao
- SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University, Shanghai 201111, China; Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA
| | - Gang Chen
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China.
| | - Can Yang
- Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
| |
Collapse
|
66
|
Facal F, Flórez G, Blanco V, Rodríguez J, Pereiro C, Fernández JM, Fariñas E, Estévez V, Gómez-Trigo J, Gurriarán X, Sáiz P, Vázquez FL, Arrojo M, Costas J. Genetic predisposition to alcohol dependence: The combined role of polygenic risk to general psychopathology and to high alcohol consumption. Drug Alcohol Depend 2021; 221:108556. [PMID: 33561667 DOI: 10.1016/j.drugalcdep.2021.108556] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 11/25/2020] [Accepted: 01/04/2021] [Indexed: 11/18/2022]
Abstract
BACKGROUND High alcohol consumption and alcohol dependence are only partly genetically correlated and they differ considerably in their correlations with other traits. The existence of genetic correlation among alcohol dependence and psychiatric disorders may be attributed to the presence of a general psychopathology factor, the p factor. This study investigates the relationship of polygenic risk to general psychopathology and to high alcohol consumption on alcohol dependence. METHODS Participants were 524 alcohol-dependent patients and 729 controls. Polygenic risk scores (PRS) were computed for alcohol consumption (drinks per week) and nine psychiatric disorders. Principal component analysis (PCA) applied to the psychiatric PRS was used to calculate the first principal component as a proxy of the polygenic p factor. RESULTS Both the polygenic p factor and the drinks per week PRS were associated with alcohol dependence in our sample. Both variables are only weakly correlated, contributing additively to the risk for alcohol dependence. Sensitivity analyses showed that the polygenic p factor was also associated with alcohol dependence in the subset of patients without any psychiatric or substance use comorbidity. CONCLUSIONS Polygenic risk for alcohol dependence can be split at least into two components, involved in general psychopathology and high alcohol consumption. The first component of PCA based on PRS for different psychiatric disorders allows estimation of the contribution of the polygenic p factor to alcohol dependence. The pleiotropic effects of genetic variants across psychiatric disorders are mainly manifested as alcohol dependence in some patients.
Collapse
Affiliation(s)
- Fernando Facal
- Instituto de Investigación Sanitaria (IDIS) de Santiago de Compostela, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain; Servizo de Psiquiatría, Complexo Hospitalario Universitario de Santiago de Compostela, Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain; Universidade de Santiago de Compostela (USC), Galicia, Spain
| | - Gerardo Flórez
- Unidade de Conductas Adictivas, Servizo de Psiquiatría, Complexo Hospitalario Universitario de Ourense (CHUO), Servizo Galego de Saúde (SERGAS), Ourense, Galicia, Spain
| | - Vanessa Blanco
- Department of Evolutionary and Educational Psychology, Universidade de Santiago de Compostela, Santiago de Compostela, Galicia, Spain
| | - Julio Rodríguez
- Instituto de Investigación Sanitaria (IDIS) de Santiago de Compostela, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain
| | - César Pereiro
- Unidade Asistencial de Drogodependencias (ACLAD), A Coruña, Galicia, Spain
| | | | - Emilio Fariñas
- Unidade Municipal de Atención a Drogodependientes (UMAD), Santiago de Compostela, Galicia, Spain
| | - Valentín Estévez
- Unidade de Conductas Adictivas, Servizo de Psiquiatría, Complexo Hospitalario Universitario de Ourense (CHUO), Servizo Galego de Saúde (SERGAS), Ourense, Galicia, Spain
| | - Jesús Gómez-Trigo
- Servizo de Psiquiatría, Complexo Hospitalario Universitario de Santiago de Compostela, Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain; Unidade de Conductas Adictivas, Servizo de Psiquiatría, Complexo Hospitalario Universitario de Ourense (CHUO), Servizo Galego de Saúde (SERGAS), Ourense, Galicia, Spain
| | - Xaquín Gurriarán
- Instituto de Investigación Sanitaria (IDIS) de Santiago de Compostela, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain
| | - Pilar Sáiz
- Department of Psychiatry, Universidad of Oviedo, Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Investigación Sanitaria del Principado de Asturias (ISPA), Servicio de Salud del Principado de Asturias (SESPA), Oviedo, Spain
| | - Fernando Lino Vázquez
- Department of Clinical Psychology and Psychobiology, Universidade de Santiago de Compostela, Santiago de Compostela, Galicia, Spain
| | - Manuel Arrojo
- Instituto de Investigación Sanitaria (IDIS) de Santiago de Compostela, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain; Servizo de Psiquiatría, Complexo Hospitalario Universitario de Santiago de Compostela, Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain
| | - Javier Costas
- Instituto de Investigación Sanitaria (IDIS) de Santiago de Compostela, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain.
| |
Collapse
|
67
|
Nabais MF, Laws SM, Lin T, Vallerga CL, Armstrong NJ, Blair IP, Kwok JB, Mather KA, Mellick GD, Sachdev PS, Wallace L, Henders AK, Zwamborn RAJ, Hop PJ, Lunnon K, Pishva E, Roubroeks JAY, Soininen H, Tsolaki M, Mecocci P, Lovestone S, Kłoszewska I, Vellas B, Furlong S, Garton FC, Henderson RD, Mathers S, McCombe PA, Needham M, Ngo ST, Nicholson G, Pamphlett R, Rowe DB, Steyn FJ, Williams KL, Anderson TJ, Bentley SR, Dalrymple-Alford J, Fowder J, Gratten J, Halliday G, Hickie IB, Kennedy M, Lewis SJG, Montgomery GW, Pearson J, Pitcher TL, Silburn P, Zhang F, Visscher PM, Yang J, Stevenson AJ, Hillary RF, Marioni RE, Harris SE, Deary IJ, Jones AR, Shatunov A, Iacoangeli A, van Rheenen W, van den Berg LH, Shaw PJ, Shaw CE, Morrison KE, Al-Chalabi A, Veldink JH, Hannon E, Mill J, Wray NR, McRae AF. Meta-analysis of genome-wide DNA methylation identifies shared associations across neurodegenerative disorders. Genome Biol 2021; 22:90. [PMID: 33771206 PMCID: PMC8004462 DOI: 10.1186/s13059-021-02275-5] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 01/20/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND People with neurodegenerative disorders show diverse clinical syndromes, genetic heterogeneity, and distinct brain pathological changes, but studies report overlap between these features. DNA methylation (DNAm) provides a way to explore this overlap and heterogeneity as it is determined by the combined effects of genetic variation and the environment. In this study, we aim to identify shared blood DNAm differences between controls and people with Alzheimer's disease, amyotrophic lateral sclerosis, and Parkinson's disease. RESULTS We use a mixed-linear model method (MOMENT) that accounts for the effect of (un)known confounders, to test for the association of each DNAm site with each disorder. While only three probes are found to be genome-wide significant in each MOMENT association analysis of amyotrophic lateral sclerosis and Parkinson's disease (and none with Alzheimer's disease), a fixed-effects meta-analysis of the three disorders results in 12 genome-wide significant differentially methylated positions. Predicted immune cell-type proportions are disrupted across all neurodegenerative disorders. Protein inflammatory markers are correlated with profile sum-scores derived from disease-associated immune cell-type proportions in a healthy aging cohort. In contrast, they are not correlated with MOMENT DNAm-derived profile sum-scores, calculated using effect sizes of the 12 differentially methylated positions as weights. CONCLUSIONS We identify shared differentially methylated positions in whole blood between neurodegenerative disorders that point to shared pathogenic mechanisms. These shared differentially methylated positions may reflect causes or consequences of disease, but they are unlikely to reflect cell-type proportion differences.
Collapse
Grants
- U24 AG021886 NIA NIH HHS
- U01 AG016976 NIA NIH HHS
- Department of Health
- U01 AG024904 NIA NIH HHS
- 108890/Z/15/Z Wellcome Trust
- 503480 Medical Research Council
- TURNER/OCT15/972-797 Motor Neurone Disease Association
- U01 AG032984 NIA NIH HHS
- 082604/2/07/Z Wellcome Trust
- R01 AG033193 NIA NIH HHS
- R01 HL105756 NHLBI NIH HHS
- MR/R024804/1 Medical Research Council
- National Health and Medical Research Council
- Motor Neurone Disease Research Institute of Australia Ice Bucket Challenge
- Medical Research Council (UK)
- Economic and Social Research Council
- National Institute for Health Research (NIHR)
- the European Community’s Health Seventh Framework Programme
- Horizon 2020 Programme
- MND Association and the Wellcome Trust.
- European Research Council (ERC)
- EU Joint Programme – Neurodegenerative Disease Research ()
- EU Joint Programme - Neurodegenerative Disease Research (JPND)
- Australian Research Council
- Mater Foundation
- ForeFront - NHMRC
- Australian National Health and Medical Research Council
- University of Otago Research Grant, together with financial support from the Jim and Mary Carney Charitable Trust
- Commonwealth Scientific Industrial and research Organization (CSIRO), Edith Cowan University (ECU), Mental Health Research institute (MHRI), National Ageing Research Institute (NARI), Austin Health, CogState Ltd
- National Health and Medical Research Council and the Dementia Collaborative Research Centres program (DCRC2), as well as funding from the Science and Industry Endowment Fund (SIEF) and the Cooperative Research Centre (CRC) for Mental Health – funded throug
- EU Joint Programme - Neurodegenerative Disease Research (JPND), co-funded through the Australian National Health and Medical Research (NHMRC) Council, Motor Neurone Disease Research Institute of Australia Ice Bucket Challenge,
- EU Joint Programme - Neurodegenerative Disease Research (JPND), United Kingdom Medical Research Council, Economic and Social Research Council, Motor Neuro Disease Association (GB), National Institute for Health Research (NIHR) Biomedical Research Centre at
- EU Joint Programme - Neurodegenerative Disease Research (JPND), European Research Council (ERC) under the European Union's Horizon 2020 research and innovation program, PPP Allowance made available by Health~Holland, Top Sector Life Sciences & Health, Unit
- National Health and Medical Research Council, Australian Research Council, Mater Foundation,
- Australian National Health and Medical Research Council (
- University of Otago Research Grant, Jim and Mary Carney Charitable Trust
- Commonwealth Scientific Industrial and research Organization (CSIRO), Edith Cowan University (ECU), Mental Health Research institute (MHRI), National Ageing Research Institute (NARI), Austin Health, CogState Ltd., National Health and Medical Research Counc
- EFPIA companies and SMEs as part of InnoMed (Innovative Medicines in Europe), an Integrated Project funded by the European Union of the Sixth Framework program
Collapse
Affiliation(s)
- Marta F Nabais
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
- University of Exeter Medical School, RILD Building, RD&E Hospital Wonford, Barrack Road, Exeter, EX2 5DW, UK
| | - Simon M Laws
- School of Medical and Health Sciences, Edith Cowan University, 270 Joondalup Dr, Joondalup, WA, 6027, Australia
| | - Tian Lin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Costanza L Vallerga
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
- Department of Internal Medicine, Erasmus MC, University Medical Center, 3015GD, Rotterdam, The Netherlands
| | | | - Ian P Blair
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, School of Health Sciences, University of South Australia, Adelaide, SA, 5001, Australia
| | - John B Kwok
- Brain and Mind Centre, Sydney Medical School, The University of Sydney, Sydney, Australia
| | - Karen A Mather
- Centre for Healthy Brain Ageing, School of Psychiatry, University of New South Wales, Sydney, NSW, 2031, Australia
- Neuroscience Research Australia Institute, Randwick, NSW, 2031, Australia
| | - George D Mellick
- Griffith Institute for Drug Discovery (GRIDD), Griffith University, Brisbane, Australia
| | - Perminder S Sachdev
- Centre for Healthy Brain Ageing, School of Psychiatry, University of New South Wales, Sydney, NSW, 2031, Australia
- Neuropsychiatric Institute, The Prince of Wales Hospital, UNSW, Randwick, NSW, 2031, Australia
| | - Leanne Wallace
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Anjali K Henders
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Ramona A J Zwamborn
- Department of Neurology, University Medical Center Utrecht Brain Center, Utrecht University, Utrecht, The Netherlands
| | - Paul J Hop
- Department of Neurology, University Medical Center Utrecht Brain Center, Utrecht University, Utrecht, The Netherlands
| | - Katie Lunnon
- University of Exeter Medical School, RILD Building, RD&E Hospital Wonford, Barrack Road, Exeter, EX2 5DW, UK
| | - Ehsan Pishva
- University of Exeter Medical School, RILD Building, RD&E Hospital Wonford, Barrack Road, Exeter, EX2 5DW, UK
| | - Janou A Y Roubroeks
- University of Exeter Medical School, RILD Building, RD&E Hospital Wonford, Barrack Road, Exeter, EX2 5DW, UK
| | - Hilkka Soininen
- Institute of Clinical Medicine, Neurology, University of Eastern Finland, Kuopio, Finland
| | - Magda Tsolaki
- 1st Department of Neurology, Memory and Dementia Unit, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Patrizia Mecocci
- Department of Medicine, Institute of Gerontology and Geriatrics, University of Perugia, Perugia, Italy
| | - Simon Lovestone
- Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK
| | | | - Bruno Vellas
- INSERM U 558, University of Toulouse, Toulouse, France
| | - Sarah Furlong
- Centre for Motor Neuron Disease Research, Macquarie University, Sydney, NSW, 2109, Australia
| | - Fleur C Garton
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Robert D Henderson
- Queensland Brain Institute, The University of Queensland, Brisbane, QLD, 4072, Australia
- Centre for Clinical Research, The University of Queensland, Brisbane, QLD, 4019, Australia
- Department of Neurology, Royal Brisbane and Women's Hospital, Brisbane, QLD, 4029, Australia
| | - Susan Mathers
- Calvary Health Care Bethlehem, Parkdale, VIC, 3195, Australia
| | - Pamela A McCombe
- Centre for Clinical Research, The University of Queensland, Brisbane, QLD, 4019, Australia
- Department of Neurology, Royal Brisbane and Women's Hospital, Brisbane, QLD, 4029, Australia
| | - Merrilee Needham
- Fiona Stanley Hospital, Perth, WA, 6150, Australia
- Notre Dame University, Fremantle, WA, 6160, Australia
- Institute for Immunology and Infectious Diseases, Murdoch University, Perth, WA, 6150, Australia
| | - Shyuan T Ngo
- Queensland Brain Institute, The University of Queensland, Brisbane, QLD, 4072, Australia
- Centre for Clinical Research, The University of Queensland, Brisbane, QLD, 4019, Australia
- The Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Garth Nicholson
- ANZAC Research Institute, Concord Repatriation General Hospital, Sydney, NSW, 2139, Australia
| | - Roger Pamphlett
- Discipline of Pathology and Department of Neuropathology, Brain and Mind Centre, The University of Sydney, Sydney, NSW, 2050, Australia
| | - Dominic B Rowe
- Centre for Motor Neuron Disease Research, Macquarie University, Sydney, NSW, 2109, Australia
| | - Frederik J Steyn
- Department of Neurology, Royal Brisbane and Women's Hospital, Brisbane, QLD, 4029, Australia
- School of Biomedical Sciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Kelly L Williams
- Centre for Motor Neuron Disease Research, Macquarie University, Sydney, NSW, 2109, Australia
| | - Tim J Anderson
- New Zealand Brain Research Institute, Christchurch, New Zealand
- Department of Medicine, University of Otago, Christchurch, New Zealand
| | - Steven R Bentley
- Eskitis Institute for Drug Discovery, Griffith University, Brisbane, Australia
| | - John Dalrymple-Alford
- New Zealand Brain Research Institute, Christchurch, New Zealand
- School of Psychology, Speech and Hearing, University of Canterbury, Christchurch, New Zealand
| | - Javed Fowder
- Griffith Institute for Drug Discovery (GRIDD), Griffith University, Brisbane, Australia
| | - Jacob Gratten
- Mater Research, Translational Research Institute, Brisbane, Australia
- Mater Research Institute, The University of Queensland, Brisbane, Australia
| | - Glenda Halliday
- Brain and Mind Research Centre, Sydney Medical School, The University of Sydney, Sydney, Australia
| | - Ian B Hickie
- Brain and Mind Research Centre, Sydney Medical School, The University of Sydney, Sydney, Australia
| | - Martin Kennedy
- Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
| | - Simon J G Lewis
- Brain and Mind Research Centre, Sydney Medical School, The University of Sydney, Sydney, Australia
| | - Grant W Montgomery
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - John Pearson
- Department of Pathology, University of Otago, Christchurch, New Zealand
| | - Toni L Pitcher
- New Zealand Brain Research Institute, Christchurch, New Zealand
- Department of Medicine, University of Otago, Christchurch, New Zealand
| | - Peter Silburn
- Queensland Brain Institute, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Futao Zhang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Jian Yang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| | - Anna J Stevenson
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Robert F Hillary
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Riccardo E Marioni
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Sarah E Harris
- Department of Psychology, Lothian Birth Cohorts group, University of Edinburgh, 7 George Square, Edinburgh, EH8 9JZ, UK
| | - Ian J Deary
- Department of Psychology, Lothian Birth Cohorts group, University of Edinburgh, 7 George Square, Edinburgh, EH8 9JZ, UK
| | - Ashley R Jones
- Department of Basic and Clinical Neuroscience, King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, SE5 9RX, UK
| | - Aleksey Shatunov
- Department of Basic and Clinical Neuroscience, King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, SE5 9RX, UK
| | - Alfredo Iacoangeli
- Department of Basic and Clinical Neuroscience, King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, SE5 9RX, UK
| | - Wouter van Rheenen
- Department of Neurology, University Medical Center Utrecht Brain Center, Utrecht University, Utrecht, The Netherlands
| | - Leonard H van den Berg
- Department of Neurology, University Medical Center Utrecht Brain Center, Utrecht University, Utrecht, The Netherlands
| | | | - Cristopher E Shaw
- Department of Basic and Clinical Neuroscience, King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, SE5 9RX, UK
| | | | - Ammar Al-Chalabi
- Department of Basic and Clinical Neuroscience, King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, SE5 9RX, UK
- King's College Hospital, London, SE5 9RS, UK
| | - Jan H Veldink
- Department of Neurology, University Medical Center Utrecht Brain Center, Utrecht University, Utrecht, The Netherlands
| | - Eilis Hannon
- University of Exeter Medical School, RILD Building, RD&E Hospital Wonford, Barrack Road, Exeter, EX2 5DW, UK
| | - Jonathan Mill
- University of Exeter Medical School, RILD Building, RD&E Hospital Wonford, Barrack Road, Exeter, EX2 5DW, UK
- Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, SE5 8AF, UK
| | - Naomi R Wray
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
- Queensland Brain Institute, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Allan F McRae
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia.
| |
Collapse
|
68
|
Selle ML, Steinsland I, Lindgren F, Brajkovic V, Cubric-Curik V, Gorjanc G. Hierarchical Modelling of Haplotype Effects on a Phylogeny. Front Genet 2021; 11:531218. [PMID: 33519886 PMCID: PMC7844322 DOI: 10.3389/fgene.2020.531218] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Accepted: 12/15/2020] [Indexed: 11/13/2022] Open
Abstract
We introduce a hierarchical model to estimate haplotype effects based on phylogenetic relationships between haplotypes and their association with observed phenotypes. In a population there are many, but not all possible, distinct haplotypes and few observations per haplotype. Further, haplotype frequencies tend to vary substantially. Such data structure challenge estimation of haplotype effects. However, haplotypes often differ only due to few mutations, and leveraging similarities can improve the estimation of effects. We build on extensive literature and develop an autoregressive model of order one that models haplotype effects by leveraging phylogenetic relationships described with a directed acyclic graph. The phylogenetic relationships can be either in a form of a tree or a network, and we refer to the model as the haplotype network model. The model can be included as a component in a phenotype model to estimate associations between haplotypes and phenotypes. Our key contribution is that we obtain a sparse model, and by using hierarchical autoregression, the flow of information between similar haplotypes is estimated from the data. A simulation study shows that the hierarchical model can improve estimates of haplotype effects compared to an independent haplotype model, especially with few observations for a specific haplotype. We also compared it to a mutation model and observed comparable performance, though the haplotype model has the potential to capture background specific effects. We demonstrate the model with a study of mitochondrial haplotype effects on milk yield in cattle. We provide R code to fit the model with the INLA package.
Collapse
Affiliation(s)
- Maria Lie Selle
- Department of Mathematical Sciences, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
| | - Ingelin Steinsland
- Department of Mathematical Sciences, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
| | - Finn Lindgren
- School of Mathematics, University of Edinburgh, Edinburgh, United Kingdom
| | - Vladimir Brajkovic
- Department of Animal Science, Faculty of Agriculture, University of Zagreb, Zagreb, Croatia
| | - Vlatka Cubric-Curik
- Department of Animal Science, Faculty of Agriculture, University of Zagreb, Zagreb, Croatia
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
69
|
Liang Z, Tan C, Prakapenka D, Ma L, Da Y. Haplotype Analysis of Genomic Prediction Using Structural and Functional Genomic Information for Seven Human Phenotypes. Front Genet 2020; 11:588907. [PMID: 33324447 PMCID: PMC7726221 DOI: 10.3389/fgene.2020.588907] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Accepted: 10/28/2020] [Indexed: 11/19/2022] Open
Abstract
Genomic prediction using multi-allelic haplotype models improved the prediction accuracy for all seven human phenotypes, the normality transformed high density lipoproteins, low density lipoproteins, total cholesterol, triglycerides, weight, and the original height and body mass index without normality transformation. Eight SNP sets with 40,941-380,705 SNPs were evaluated. The increase in prediction accuracy due to haplotypes was 1.86-8.12%. Haplotypes using fixed chromosome distances had the best prediction accuracy for four phenotypes, fixed number of SNPs for two phenotypes, and gene-based haplotypes for high density lipoproteins and height (tied for best). Haplotypes of coding genes were more accurate than haplotypes of all autosome genes that included both coding and noncoding genes for triglycerides and weight, and nearly the same as haplotypes of all autosome genes for the other phenotypes. Haplotypes of noncoding genes (mostly lncRNAs) only improved the prediction accuracy over the SNP models for high density lipoproteins, total cholesterol, and height. ChIP-seq haplotypes had better prediction accuracy than gene-based haplotypes for total cholesterol, body mass index and low density lipoproteins. The accuracy of ChIP-seq haplotypes was most striking for low density lipoproteins, where all four haplotype models with ChIP-seq haplotypes had similarly high prediction accuracy over the best prediction model with gene-based haplotypes. Haplotype epistasis was shown to be the reason for the increased accuracy due to haplotypes. Low density lipoproteins had the largest haplotype epistasis heritability that explained 14.70% of the phenotypic variance and was 31.27% of the SNP additive heritability, and the largest increase in prediction accuracy relative to the best SNP model (8.12%). Relative to the SNP additive heritability of the same regions, noncoding genes had the highest haplotype epistasis heritability, followed by coding genes and ChIP-seq for the seven phenotypes. SNP and haplotype heritability profiles showed that the integration of SNP and haplotype additive values compensated the weakness of haplotypes in estimating SNP heritabilities for four phenotypes, whereas models with haplotype additive values fully accounted for SNP additive values for three phenotypes. These results showed that haplotype analysis can be a method to utilize functional and structural genomic information to improve the accuracy of genomic prediction.
Collapse
Affiliation(s)
- Zuoxiang Liang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Cheng Tan
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, China
| | - Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, United States
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| |
Collapse
|
70
|
Waldman ID, Poore HE, Luningham JM, Yang J. Testing structural models of psychopathology at the genomic level. World Psychiatry 2020; 19:350-359. [PMID: 32931100 PMCID: PMC7491626 DOI: 10.1002/wps.20772] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Genome-wide association studies (GWAS) have revealed hundreds of genetic loci associated with the vulnerability to major psychiatric disorders, and post-GWAS analyses have shown substantial genetic correlations among these disorders. This evidence supports the existence of a higher-order structure of psychopathology at both the genetic and phenotypic levels. Despite recent efforts by collaborative consortia such as the Hierarchical Taxonomy of Psychopathology (HiTOP), this structure remains unclear. In this study, we tested multiple alternative structural models of psychopathology at the genomic level, using the genetic correlations among fourteen psychiatric disorders and related psychological traits estimated from GWAS summary statistics. The best-fitting model included four correlated higher-order factors - externalizing, internalizing, thought problems, and neurodevelopmental disorders - which showed distinct patterns of genetic correlations with external validity variables and accounted for substantial genetic variance in their constituent disorders. A bifactor model including a general factor of psychopathology as well as the four specific factors fit worse than the above model. Several model modifications were tested to explore the placement of some disorders - such as bipolar disorder, obsessive-compulsive disorder, and eating disorders - within the broader psychopathology structure. The best-fitting model indicated that eating disorders and obsessive-compulsive disorder, on the one hand, and bipolar disorder and schizophrenia, on the other, load together on the same thought problems factor. These findings provide support for several of the HiTOP higher-order dimensions and suggest a similar structure of psychopathology at the genomic and phenotypic levels.
Collapse
Affiliation(s)
| | | | - Justin M. Luningham
- Department of Population Health SciencesGeorgia State UniversityAtlantaGAUSA
| | - Jingjing Yang
- Department of ‐Human GeneticsEmory University School of MedicineAtlantaGAUSA
| |
Collapse
|
71
|
Choi SW, Mak TSH, O'Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc 2020; 15:2759-2772. [PMID: 32709988 PMCID: PMC7612115 DOI: 10.1038/s41596-020-0353-1] [Citation(s) in RCA: 879] [Impact Index Per Article: 175.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Accepted: 05/05/2020] [Indexed: 02/08/2023]
Abstract
A polygenic score (PGS) or polygenic risk score (PRS) is an estimate of an individual's genetic liability to a trait or disease, calculated according to their genotype profile and relevant genome-wide association study (GWAS) data. While present PRSs typically explain only a small fraction of trait variance, their correlation with the single largest contributor to phenotypic variation-genetic liability-has led to the routine application of PRSs across biomedical research. Among a range of applications, PRSs are exploited to assess shared etiology between phenotypes, to evaluate the clinical utility of genetic data for complex disease and as part of experimental studies in which, for example, experiments are performed that compare outcomes (e.g., gene expression and cellular response to treatment) between individuals with low and high PRS values. As GWAS sample sizes increase and PRSs become more powerful, PRSs are set to play a key role in research and stratified medicine. However, despite the importance and growing application of PRSs, there are limited guidelines for performing PRS analyses, which can lead to inconsistency between studies and misinterpretation of results. Here, we provide detailed guidelines for performing and interpreting PRS analyses. We outline standard quality control steps, discuss different methods for the calculation of PRSs, provide an introductory online tutorial, highlight common misconceptions relating to PRS results, offer recommendations for best practice and discuss future challenges.
Collapse
Affiliation(s)
- Shing Wan Choi
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA
| | | | - Paul F O'Reilly
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA.
| |
Collapse
|
72
|
Polygenic risk score as clinical utility in psychiatry: a clinical viewpoint. J Hum Genet 2020; 66:53-60. [PMID: 32770057 DOI: 10.1038/s10038-020-0814-y] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 06/21/2020] [Accepted: 07/19/2020] [Indexed: 01/06/2023]
Abstract
Genome-wide association studies (GWASs) have detected many susceptible variants for common diseases, including psychiatric disorders. However, because of the small effect size of each variant, clinical utility that aims for risk prediction and/or diagnostic assistance based on the individual "variants" is difficult to use. Therefore, to improve the statistical power, polygenic risk score (PRS) has been established and applied in the GWAS as a robust analytic tool. Although PRS has potential predictive ability, because of its current "insufficient" discriminative power at the individual level for clinical use, it remains limited solely in the research area, specifically in the psychiatric field. For a better understanding of the PRS, in this review, we (1) introduce the clinical features of psychiatric disorders, (2) summarize the recent GWAS/PRS findings in the psychiatric disorders, (3) evaluate the problems of PRS, and (4) propose its possible utility to apply PRS into the psychiatric clinical setting.
Collapse
|
73
|
Abstract
Since the initial success of genome-wide association studies (GWAS) in 2005, tens of thousands of genetic variants have been identified for hundreds of human diseases and traits. In a GWAS, genotype information at up to millions of genetic markers is collected from up to hundreds of thousands of individuals, together with their phenotype information. Several scientific goals can be accomplished through the analysis of GWAS data, including the identification of variants, genes, and pathways associated with diseases and traits of interest; the inference of the genetic architecture of these traits; and the development of genetic risk prediction models. In this review, we provide an overview of the statistical challenges in achieving these goals and recent progress in statistical methodology to address these challenges.
Collapse
Affiliation(s)
- Ning Sun
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut 06520, USA
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut 06520, USA
| |
Collapse
|
74
|
Efficient polygenic risk scores for biobank scale data by exploiting phenotypes from inferred relatives. Nat Commun 2020; 11:3074. [PMID: 32555176 PMCID: PMC7299943 DOI: 10.1038/s41467-020-16829-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 05/25/2020] [Indexed: 01/06/2023] Open
Abstract
Polygenic risk scores are emerging as a potentially powerful tool to predict future phenotypes of target individuals, typically using unrelated individuals, thereby devaluing information from relatives. Here, for 50 traits from the UK Biobank data, we show that a design of 5,000 individuals with first-degree relatives of target individuals can achieve a prediction accuracy similar to that of around 220,000 unrelated individuals (mean prediction accuracy = 0.26 vs. 0.24, mean fold-change = 1.06 (95% CI: 0.99-1.13), P-value = 0.08), despite a 44-fold difference in sample size. For lifestyle traits, the prediction accuracy with 5,000 individuals including first-degree relatives of target individuals is significantly higher than that with 220,000 unrelated individuals (mean prediction accuracy = 0.22 vs. 0.16, mean fold-change = 1.40 (1.17-1.62), P-value = 0.025). Our findings suggest that polygenic prediction integrating family information may help to accelerate precision health and clinical intervention. Genetic data from large cohorts of unrelated individuals can be used to create polygenic risk scores, which could be used to predict individual risk of developing a specific disease. Here the authors show that smaller cohorts of related individuals can provide similarly powerful predictive ability.
Collapse
|
75
|
Nguyen TH, Dobbyn A, Brown RC, Riley BP, Buxbaum JD, Pinto D, Purcell SM, Sullivan PF, He X, Stahl EA. mTADA is a framework for identifying risk genes from de novo mutations in multiple traits. Nat Commun 2020; 11:2929. [PMID: 32522981 PMCID: PMC7287090 DOI: 10.1038/s41467-020-16487-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2018] [Accepted: 05/06/2020] [Indexed: 11/12/2022] Open
Abstract
Joint analysis of multiple traits can result in the identification of associations not found through the analysis of each trait in isolation. Studies of neuropsychiatric disorders and congenital heart disease (CHD) which use de novo mutations (DNMs) from parent-offspring trios have reported multiple putatively causal genes. However, a joint analysis method designed to integrate DNMs from multiple studies has yet to be implemented. We here introduce multiple-trait TADA (mTADA) which jointly analyzes two traits using DNMs from non-overlapping family samples. We first demonstrate that mTADA is able to leverage genetic overlaps to increase the statistical power of risk-gene identification. We then apply mTADA to large datasets of >13,000 trios for five neuropsychiatric disorders and CHD. We report additional risk genes for schizophrenia, epileptic encephalopathies and CHD. We outline some shared and specific biological information of intellectual disability and CHD by conducting systems biology analyses of genes prioritized by mTADA.
Collapse
Affiliation(s)
- Tan-Hoang Nguyen
- Division of Psychiatric Genomics, Department of Genetics and Genomic Sciences, Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Virginia Institute for Psychiatric and Behavioral Genetics, Department of Psychiatry, Virginia Commonwealth University, Richmond, VA, USA.
| | - Amanda Dobbyn
- Division of Psychiatric Genomics, Department of Genetics and Genomic Sciences, Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ruth C Brown
- Virginia Institute for Psychiatric and Behavioral Genetics, Department of Psychiatry, Virginia Commonwealth University, Richmond, VA, USA
| | - Brien P Riley
- Virginia Institute for Psychiatric and Behavioral Genetics, Department of Psychiatry, Virginia Commonwealth University, Richmond, VA, USA
| | - Joseph D Buxbaum
- Seaver Autism Center, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Dalila Pinto
- Seaver Autism Center, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- The Mindich Child Health & Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Shaun M Purcell
- Sleep Center, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Patrick F Sullivan
- Departments of Genetics and Psychiatry, University of North Carolina, Chapel Hill, NC, USA
| | - Xin He
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
- Grossman Institute for Neuroscience, Quantitative Biology and Human Behavior, University of Chicago, Chicago, IL, USA.
| | - Eli A Stahl
- Division of Psychiatric Genomics, Department of Genetics and Genomic Sciences, Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
76
|
Shafquat A, Crystal RG, Mezey JG. Identifying novel associations in GWAS by hierarchical Bayesian latent variable detection of differentially misclassified phenotypes. BMC Bioinformatics 2020; 21:178. [PMID: 32381021 PMCID: PMC7204256 DOI: 10.1186/s12859-020-3387-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 01/24/2020] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Heterogeneity in the definition and measurement of complex diseases in Genome-Wide Association Studies (GWAS) may lead to misdiagnoses and misclassification errors that can significantly impact discovery of disease loci. While well appreciated, almost all analyses of GWAS data consider reported disease phenotype values as is without accounting for potential misclassification. RESULTS Here, we introduce Phenotype Latent variable Extraction of disease misdiagnosis (PheLEx), a GWAS analysis framework that learns and corrects misclassified phenotypes using structured genotype associations within a dataset. PheLEx consists of a hierarchical Bayesian latent variable model, where inference of differential misclassification is accomplished using filtered genotypes while implementing a full mixed model to account for population structure and genetic relatedness in study populations. Through simulations, we show that the PheLEx framework dramatically improves recovery of the correct disease state when considering realistic allele effect sizes compared to existing methodologies designed for Bayesian recovery of disease phenotypes. We also demonstrate the potential of PheLEx for extracting new potential loci from existing GWAS data by analyzing bipolar disorder and epilepsy phenotypes available from the UK Biobank. From the PheLEx analysis of these data, we identified new candidate disease loci not previously reported for these datasets that have value for supplemental hypothesis generation. CONCLUSION PheLEx shows promise in reanalyzing GWAS datasets to provide supplemental candidate loci that are ignored by traditional GWAS analysis methodologies.
Collapse
Affiliation(s)
- Afrah Shafquat
- Department of Computational Biology, Cornell University, Ithaca, NY USA
| | - Ronald G. Crystal
- Department of Genetic Medicine, Weill Cornell Medicine, New York, NY USA
- Department of Medicine, Weill Cornell Medicine, New York, NY USA
| | - Jason G. Mezey
- Department of Computational Biology, Cornell University, Ithaca, NY USA
- Department of Genetic Medicine, Weill Cornell Medicine, New York, NY USA
| |
Collapse
|
77
|
Merikangas KR, Merikangas AK. Harnessing Progress in Psychiatric Genetics to Advance Population Mental Health. Am J Public Health 2020; 109:S171-S175. [PMID: 31242010 DOI: 10.2105/ajph.2019.304948] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Advances in genomics and neuroscience have ushered in unprecedented opportunities to increase our understanding of the biological underpinnings of mental disorders, yet there has been limited progress in translating knowledge on genetic risk factors to reduce the burden of these conditions in the population. We describe the challenges and opportunities afforded by the growth of large-scale population health databases, progress in genomics, and collaborative efforts in epidemiology and neuroscience to develop informed population-wide interventions for mental disorders. Future progress is likely to benefit from the following efforts: expansion of large collaborative studies of mental disorders to include more systematically ascertained multiethnic samples from biobanks and registries, harmonization of phenotypic characterization in registry and population samples to extend clinical diagnosis to transdiagnostic concepts, systematic investigation of the influences of both specific and nonspecific environmental factors that may combine with genetic susceptibility to confer increased risk of specific mental disorders, and implementation of study designs that can inform gene-environment interactions. Such data can ultimately be combined to develop comprehensive models of risks of, interventions for, and outcomes of mental disorders. With its focus on phenotypic characterization, sampling, study designs, and analytic methods, epidemiology will be central to progress in translating genomics to public health.
Collapse
Affiliation(s)
- Kathleen Ries Merikangas
- Kathleen Ries Merikangas is with the Genetic Epidemiology Research Branch, Division of Intramural Research Program, National Institute of Mental Health, Bethesda, MD. Alison K. Merikangas is with the Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Alison K Merikangas
- Kathleen Ries Merikangas is with the Genetic Epidemiology Research Branch, Division of Intramural Research Program, National Institute of Mental Health, Bethesda, MD. Alison K. Merikangas is with the Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| |
Collapse
|
78
|
Beesley LJ, Salvatore M, Fritsche LG, Pandit A, Rao A, Brummett C, Willer CJ, Lisabeth LD, Mukherjee B. The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities. Stat Med 2020; 39:773-800. [PMID: 31859414 PMCID: PMC7983809 DOI: 10.1002/sim.8445] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 09/10/2019] [Accepted: 11/16/2019] [Indexed: 01/03/2023]
Abstract
Biobanks linked to electronic health records provide rich resources for health-related research. With improvements in administrative and informatics infrastructure, the availability and utility of data from biobanks have dramatically increased. In this paper, we first aim to characterize the current landscape of available biobanks and to describe specific biobanks, including their place of origin, size, and data types. The development and accessibility of large-scale biorepositories provide the opportunity to accelerate agnostic searches, expedite discoveries, and conduct hypothesis-generating studies of disease-treatment, disease-exposure, and disease-gene associations. Rather than designing and implementing a single study focused on a few targeted hypotheses, researchers can potentially use biobanks' existing resources to answer an expanded selection of exploratory questions as quickly as they can analyze them. However, there are many obvious and subtle challenges with the design and analysis of biobank-based studies. Our second aim is to discuss statistical issues related to biobank research such as study design, sampling strategy, phenotype identification, and missing data. We focus our discussion on biobanks that are linked to electronic health records. Some of the analytic issues are illustrated using data from the Michigan Genomics Initiative and UK Biobank, two biobanks with two different recruitment mechanisms. We summarize the current body of literature for addressing these challenges and discuss some standing open problems. This work complements and extends recent reviews about biobank-based research and serves as a resource catalog with analytical and practical guidance for statisticians, epidemiologists, and other medical researchers pursuing research using biobanks.
Collapse
Affiliation(s)
| | | | | | - Anita Pandit
- University of Michigan, Department of Biostatistics
| | - Arvind Rao
- University of Michigan, Department of Computational Medicine and Bioinformatics
| | - Chad Brummett
- University of Michigan, Department of Anesthesiology
| | - Cristen J. Willer
- University of Michigan, Department of Computational Medicine and Bioinformatics
| | | | | |
Collapse
|
79
|
Song S, Jiang W, Hou L, Zhao H. Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies. PLoS Comput Biol 2020; 16:e1007565. [PMID: 32045423 PMCID: PMC7039528 DOI: 10.1371/journal.pcbi.1007565] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 02/24/2020] [Accepted: 11/25/2019] [Indexed: 12/29/2022] Open
Abstract
Genetic risk prediction is an important problem in human genetics, and accurate prediction can facilitate disease prevention and treatment. Calculating polygenic risk score (PRS) has become widely used due to its simplicity and effectiveness, where only summary statistics from genome-wide association studies are needed in the standard method. Recently, several methods have been proposed to improve standard PRS by utilizing external information, such as linkage disequilibrium and functional annotations. In this paper, we introduce EB-PRS, a novel method that leverages information for effect sizes across all the markers to improve prediction accuracy. Compared to most existing genetic risk prediction methods, our method does not need to tune parameters nor external information. Real data applications on six diseases, including asthma, breast cancer, celiac disease, Crohn's disease, Parkinson's disease and type 2 diabetes show that EB-PRS achieved 307.1%, 42.8%, 25.5%, 3.1%, 74.3% and 49.6% relative improvements in terms of predictive r2 over standard PRS method with optimally tuned parameters. Besides, compared to LDpred that makes use of LD information, EB-PRS also achieved 37.9%, 33.6%, 8.6%, 36.2%, 40.6% and 10.8% relative improvements. We note that our method is not the first method leveraging effect size distributions. Here we first justify our method by presenting theoretical optimal property over existing methods in this class of methods, and substantiate our theoretical result with extensive simulation results. The R-package EBPRS that implements our method is available on CRAN.
Collapse
Affiliation(s)
- Shuang Song
- Center for Statistical Science, Tsinghua University, Beijing, China
- Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Wei Jiang
- Department of Biostatistics, School of Public Health, Yale University, New Haven, Connecticut, United States of America
| | - Lin Hou
- Center for Statistical Science, Tsinghua University, Beijing, China
- Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Hongyu Zhao
- Department of Biostatistics, School of Public Health, Yale University, New Haven, Connecticut, United States of America
| |
Collapse
|
80
|
White IMS, Hill WG. Effect of heterogeneity in recombination rate on variation in realised relationship. Heredity (Edinb) 2020; 124:28-36. [PMID: 31222091 PMCID: PMC6906284 DOI: 10.1038/s41437-019-0241-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 05/21/2019] [Accepted: 05/27/2019] [Indexed: 01/06/2023] Open
Abstract
Individuals of a specified pedigree relationship vary in the proportion of the genome they share identical by descent, i.e. in their realised or actual relationship. Predictions of the variance in realised relationship have previously been based solely on the proportion of the map length shared, which requires the implicit assumption that both recombination rate and genetic information are uniformly distributed along the genome. This ignores the possible existence of recombination hotspots, and fails to distinguish between coding and non-coding sequences. In this paper, we therefore quantify the effects of heterogeneity in recombination rate at broad and fine-scale levels on the variation in realised relationship. Variance is usually greater on a chromosome with a non-uniform recombination rate than on a chromosome with the same map length and uniform recombination rate, especially if recombination rates are higher towards chromosome ends. Reductions in variance can also be obtained, however, and the overall pattern of change is quite complex. In general, local (fine-scale) variation in recombination rate, e.g. hotspots, has a small influence on the variance in realised relationship. Differences in rates across longer regions and between chromosome ends can increase or decrease the variance in a realised relationship, depending on the genomic architecture.
Collapse
Affiliation(s)
- Ian M S White
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Charlotte Auerbach Road, Edinburgh, EH9 3FL, UK.
| | - William G Hill
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Charlotte Auerbach Road, Edinburgh, EH9 3FL, UK
| |
Collapse
|
81
|
Allegrini AG, Cheesman R, Rimfeld K, Selzam S, Pingault J, Eley TC, Plomin R. The p factor: genetic analyses support a general dimension of psychopathology in childhood and adolescence. J Child Psychol Psychiatry 2020; 61:30-39. [PMID: 31541466 PMCID: PMC6906245 DOI: 10.1111/jcpp.13113] [Citation(s) in RCA: 107] [Impact Index Per Article: 21.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/09/2019] [Indexed: 12/12/2022]
Abstract
BACKGROUND Diverse behaviour problems in childhood correlate phenotypically, suggesting a general dimension of psychopathology that has been called the p factor. The shared genetic architecture between childhood psychopathology traits also supports a genetic p. This study systematically investigates the manifestation of this common dimension across self-, parent- and teacher-rated measures in childhood and adolescence. METHODS The sample included 7,026 twin pairs from the Twins Early Development Study (TEDS). First, we employed multivariate twin models to estimate common genetic and environmental influences on p based on diverse measures of behaviour problems rated by children, parents and teachers at ages 7, 9, 12 and 16 (depressive traits, emotional problems, peer problems, autism traits, hyperactivity, antisocial behaviour, conduct problems and psychopathic tendencies). Second, to assess the stability of genetic and environmental influences on p across time, we conducted longitudinal twin modelling of the first phenotypic principal components of childhood psychopathological measures across each of the four ages. Third, we created a genetic p factor in 7,026 unrelated genotyped individuals based on eight polygenic scores for psychiatric disorders to estimate how a general polygenic predisposition to mostly adult psychiatric disorders relates to childhood p. RESULTS Behaviour problems were consistently correlated phenotypically and genetically across ages and raters. The p factor is substantially heritable (50%-60%) and manifests consistently across diverse ages and raters. However, residual variation in the common factor models indicates unique contributions as well. Genetic correlations of p components across childhood and adolescence suggest stability over time (49%-78%). A polygenic general psychopathology factor derived from studies of psychiatric disorders consistently predicted a general phenotypic p factor across development (0.3%-0.9%). CONCLUSIONS Diverse forms of psychopathology generally load on a common p factor, which is highly heritable. There are substantial genetic influences on the stability of p across childhood. Our analyses indicate genetic overlap between general risk for psychiatric disorders in adulthood and p in childhood, even as young as age 7. The p factor has far-reaching implications for genomic research and, eventually, for diagnosis and treatment of behaviour problems.
Collapse
Affiliation(s)
- Andrea G. Allegrini
- Social, Genetic and Developmental Psychiatry CentreInstitute of Psychiatry, Psychology and NeuroscienceKing's College LondonLondonUK
| | - Rosa Cheesman
- Social, Genetic and Developmental Psychiatry CentreInstitute of Psychiatry, Psychology and NeuroscienceKing's College LondonLondonUK
| | - Kaili Rimfeld
- Social, Genetic and Developmental Psychiatry CentreInstitute of Psychiatry, Psychology and NeuroscienceKing's College LondonLondonUK
| | - Saskia Selzam
- Social, Genetic and Developmental Psychiatry CentreInstitute of Psychiatry, Psychology and NeuroscienceKing's College LondonLondonUK
| | - Jean‐Baptiste Pingault
- Social, Genetic and Developmental Psychiatry CentreInstitute of Psychiatry, Psychology and NeuroscienceKing's College LondonLondonUK
- Division of Psychology and Language SciencesUniversity College LondonLondonUK
| | - Thalia C. Eley
- Social, Genetic and Developmental Psychiatry CentreInstitute of Psychiatry, Psychology and NeuroscienceKing's College LondonLondonUK
| | - Robert Plomin
- Social, Genetic and Developmental Psychiatry CentreInstitute of Psychiatry, Psychology and NeuroscienceKing's College LondonLondonUK
| |
Collapse
|
82
|
Abraham G, Malik R, Yonova-Doing E, Salim A, Wang T, Danesh J, Butterworth AS, Howson JMM, Inouye M, Dichgans M. Genomic risk score offers predictive performance comparable to clinical risk factors for ischaemic stroke. Nat Commun 2019; 10:5819. [PMID: 31862893 PMCID: PMC6925280 DOI: 10.1038/s41467-019-13848-1] [Citation(s) in RCA: 110] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 11/28/2019] [Indexed: 01/17/2023] Open
Abstract
Recent genome-wide association studies in stroke have enabled the generation of genomic risk scores (GRS) but their predictive power has been modest compared to established stroke risk factors. Here, using a meta-scoring approach, we develop a metaGRS for ischaemic stroke (IS) and analyse this score in the UK Biobank (n = 395,393; 3075 IS events by age 75). The metaGRS hazard ratio for IS (1.26, 95% CI 1.22-1.31 per metaGRS standard deviation) doubles that of a previous GRS, identifying a subset of individuals at monogenic levels of risk: the top 0.25% of metaGRS have three-fold risk of IS. The metaGRS is similarly or more predictive compared to several risk factors, such as family history, blood pressure, body mass index, and smoking. We estimate the reductions needed in modifiable risk factors for individuals with different levels of genomic risk and suggest that, for individuals with high metaGRS, achieving risk factor levels recommended by current guidelines may be insufficient to mitigate risk.
Collapse
Affiliation(s)
- Gad Abraham
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia.
- Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
- Department of Clinical Pathology, University of Melbourne, Parkville, VIC, Australia.
| | - Rainer Malik
- Institute for Stroke and Dementia Research, University Hospital, Ludwig-Maximilians-Universität LMU, Munich, Germany
| | - Ekaterina Yonova-Doing
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Agus Salim
- Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
- Department of Mathematics and Statistics, La Trobe University, Melbourne, VIC, Australia
| | - Tingting Wang
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
| | - John Danesh
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
- British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge, UK
- National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Genomics, University of Cambridge, Cambridge, UK
- National Institute for Health Research Cambridge Biomedical Research Centre, University of Cambridge and Cambridge University Hospitals, Cambridge, UK
- Health Data Research UK Cambridge, Wellcome Genome Campus and University of Cambridge, Cambridge, UK
- Department of Human Genetics, Wellcome Sanger Institute, Hinxton, UK
| | - Adam S Butterworth
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
- British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge, UK
- National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Genomics, University of Cambridge, Cambridge, UK
- National Institute for Health Research Cambridge Biomedical Research Centre, University of Cambridge and Cambridge University Hospitals, Cambridge, UK
- Health Data Research UK Cambridge, Wellcome Genome Campus and University of Cambridge, Cambridge, UK
- Department of Human Genetics, Wellcome Sanger Institute, Hinxton, UK
| | - Joanna M M Howson
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
- National Institute for Health Research Cambridge Biomedical Research Centre, University of Cambridge and Cambridge University Hospitals, Cambridge, UK
| | - Michael Inouye
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia.
- Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
- Department of Clinical Pathology, University of Melbourne, Parkville, VIC, Australia.
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
- Health Data Research UK Cambridge, Wellcome Genome Campus and University of Cambridge, Cambridge, UK.
- The Alan Turing Institute, London, UK.
| | - Martin Dichgans
- Institute for Stroke and Dementia Research, University Hospital, Ludwig-Maximilians-Universität LMU, Munich, Germany.
- German Center for Neurodegenerative Diseases (DZNE), Munich, Germany.
- Munich Cluster for Systems Neurology (SyNergy), Munich, Germany.
| |
Collapse
|
83
|
Flügge F, Figge L, Duhm-Harbeck P, Kammler R, Habermann JK. How clinical biobanks can support precision medicine: from standardized preprocessing to treatment guidance. EXPERT REVIEW OF PRECISION MEDICINE AND DRUG DEVELOPMENT 2019. [DOI: 10.1080/23808993.2019.1690395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Friedemann Flügge
- Interdisciplinary Center for Biobanking-Lübeck, University of Lübeck, Lübeck, Germany
| | - Lena Figge
- Interdisciplinary Center for Biobanking-Lübeck, University of Lübeck, Lübeck, Germany
| | | | - Rosita Kammler
- Translational Research Coordination for International Breast Cancer Study Group and European Thoracic Oncology Platform, Bern, Switzerland
- European, Middle Eastern and African Society for Biopreservation and Biobanking, Brussels, Belgium
| | - Jens K. Habermann
- Interdisciplinary Center for Biobanking-Lübeck, University of Lübeck, Lübeck, Germany
- European, Middle Eastern and African Society for Biopreservation and Biobanking, Brussels, Belgium
- Section for Translational Surgical Oncology and Biobanking, Department of Surgery, University of Lübeck and University Hospital Schleswig-Holstein (UKSH), Lübeck, Germany
| |
Collapse
|
84
|
Ni G, Amare AT, Zhou X, Mills N, Gratten J, Lee SH. The genetic relationship between female reproductive traits and six psychiatric disorders. Sci Rep 2019; 9:12041. [PMID: 31427629 PMCID: PMC6700195 DOI: 10.1038/s41598-019-48403-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 07/31/2019] [Indexed: 12/31/2022] Open
Abstract
Female reproductive behaviours have important implications for evolutionary fitness and health of offspring. Here we used the second release of UK Biobank data (N = 220,685) to evaluate the association between five female reproductive traits and polygenic risk scores (PRS) projected from genome-wide association study summary statistics of six psychiatric disorders (N = 429,178). We found that the PRS of attention-deficit/hyperactivity disorder (ADHD) were strongly associated with age at first birth (AFB) (genetic correlation of -0.68 ± 0.03), age at first sexual intercourse (AFS) (-0.56 ± 0.03), number of live births (NLB) (0.36 ± 0.04) and age at menopause (-0.27 ± 0.04). There were also robustly significant associations between the PRS of eating disorder (ED) and AFB (0.35 ± 0.06), ED and AFS (0.19 ± 0.06), major depressive disorder (MDD) and AFB (-0.27 ± 0.07), MDD and AFS (-0.27 ± 0.03) and schizophrenia and AFS (-0.10 ± 0.03). These associations were mostly explained by pleiotropic effects and there was little evidence of causal relationships. Our findings can potentially help improve reproductive health in women, hence better child outcomes. Our findings also lend partial support to the evolutionary hypothesis that causal mutations underlying psychiatric disorders have positive effects on reproductive success.
Collapse
Affiliation(s)
- Guiyan Ni
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, 4072, Australia
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia
- School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Azmeraw T Amare
- South Australian Academic Health Science and Translation centre, South Australian Health and Medical Research Institute (SAHMRI), Adelaide, South Australia, Australia
- Discipline of Psychiatry, School of Medicine, University of Adelaide, Adelaide, SA, Australia
| | - Xuan Zhou
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia
| | - Natalie Mills
- Discipline of Psychiatry, School of Medicine, University of Adelaide, Adelaide, SA, Australia
| | - Jacob Gratten
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, 4072, Australia
- Mater Research Institute, University of Queensland, Brisbane, Queensland, 4072, Australia
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia.
| |
Collapse
|
85
|
Lambert SA, Abraham G, Inouye M. Towards clinical utility of polygenic risk scores. Hum Mol Genet 2019; 28:R133-R142. [DOI: 10.1093/hmg/ddz187] [Citation(s) in RCA: 249] [Impact Index Per Article: 41.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 07/11/2019] [Accepted: 07/24/2019] [Indexed: 02/06/2023] Open
Abstract
Abstract
Prediction of disease risk is an essential part of preventative medicine, often guiding clinical management. Risk prediction typically includes risk factors such as age, sex, family history of disease and lifestyle (e.g. smoking status); however, in recent years, there has been increasing interest to include genomic information into risk models. Polygenic risk scores (PRS) aggregate the effects of many genetic variants across the human genome into a single score and have recently been shown to have predictive value for multiple common diseases. In this review, we summarize the potential use cases for seven common diseases (breast cancer, prostate cancer, coronary artery disease, obesity, type 1 diabetes, type 2 diabetes and Alzheimer’s disease) where PRS has or could have clinical utility. PRS analysis for these diseases frequently revolved around (i) risk prediction performance of a PRS alone and in combination with other non-genetic risk factors, (ii) estimation of lifetime risk trajectories, (iii) the independent information of PRS and family history of disease or monogenic mutations and (iv) estimation of the value of adding a PRS to specific clinical risk prediction scenarios. We summarize open questions regarding PRS usability, ancestry bias and transferability, emphasizing the need for the next wave of studies to focus on the implementation and health-economic value of PRS testing. In conclusion, it is becoming clear that PRS have value in disease risk prediction and there are multiple areas where this may have clinical utility.
Collapse
Affiliation(s)
- Samuel A Lambert
- Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
- MRC/BHF Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK
- Cambridge Substantive Site, Health Data Research UK, Wellcome Genome Campus, Hinxton, UK
| | - Gad Abraham
- Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
- Department of Clinical Pathology, University of Melbourne, Parkville, VIC 3010, Australia
| | - Michael Inouye
- Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK
- Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
- MRC/BHF Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK
- Cambridge Substantive Site, Health Data Research UK, Wellcome Genome Campus, Hinxton, UK
- Department of Clinical Pathology, University of Melbourne, Parkville, VIC 3010, Australia
| |
Collapse
|
86
|
Psychiatric Polygenic Risk Scores as Predictor for Attention Deficit/Hyperactivity Disorder and Autism Spectrum Disorder in a Clinical Child and Adolescent Sample. Behav Genet 2019; 50:203-212. [PMID: 31346826 PMCID: PMC7355275 DOI: 10.1007/s10519-019-09965-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 07/10/2019] [Indexed: 12/31/2022]
Abstract
Neurodevelopmental disorders such as attention deficit/hyperactivity disorder (ADHD) and autism spectrum disorder (ASD) are highly heritable and influenced by many single nucleotide polymorphisms (SNPs). SNPs can be used to calculate individual polygenic risk scores (PRS) for a disorder. We aim to explore the association between the PRS for ADHD, ASD and for Schizophrenia (SCZ), and ADHD and ASD diagnoses in a clinical child and adolescent population. Based on the most recent genome wide association studies of ADHD, ASD and SCZ, PRS of each disorder were calculated for individuals of a clinical child and adolescent target sample (N = 688) and for adult controls (N = 943). We tested with logistic regression analyses for an association with (1) a single diagnosis of ADHD (N = 280), (2) a single diagnosis of ASD (N = 295), and (3) combining the two diagnoses, thus subjects with either ASD, ADHD or both (N = 688). Our results showed a significant association of the ADHD PRS with ADHD status (OR 1.6, P = 1.39 × 10−07) and with the combined ADHD/ASD status (OR 1.36, P = 1.211 × 10−05), but not with ASD status (OR 1.14, P = 1). No associations for the ASD and SCZ PRS were observed. In sum, the PRS of ADHD is significantly associated with the combined ADHD/ASD status. Yet, this association is primarily driven by ADHD status, suggesting disorder specific genetic effects of the ADHD PRS.
Collapse
|
87
|
Martin AR, Daly MJ, Robinson EB, Hyman SE, Neale BM. Predicting Polygenic Risk of Psychiatric Disorders. Biol Psychiatry 2019; 86:97-109. [PMID: 30737014 PMCID: PMC6599546 DOI: 10.1016/j.biopsych.2018.12.015] [Citation(s) in RCA: 142] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/09/2018] [Revised: 11/18/2018] [Accepted: 12/08/2018] [Indexed: 12/27/2022]
Abstract
Genetics provides two major opportunities for understanding human disease-as a transformative line of etiological inquiry and as a biomarker for heritable diseases. In psychiatry, biomarkers are very much needed for both research and treatment, given the heterogenous populations identified by current phenomenologically based diagnostic systems. To date, however, useful and valid biomarkers have been scant owing to the inaccessibility and complexity of human brain tissue and consequent lack of insight into disease mechanisms. Genetic biomarkers are therefore especially promising for psychiatric disorders. Genome-wide association studies of common diseases have matured over the last decade, generating the knowledge base for increasingly informative individual-level genetic risk prediction. In this review, we discuss fundamental concepts involved in computing genetic risk with current methods, strengths and weaknesses of various approaches, assessments of utility, and applications to various psychiatric disorders and related traits. Although genetic risk prediction has become increasingly straightforward to apply and common in published studies, there are important pitfalls to avoid. At present, the clinical utility of genetic risk prediction is still low; however, there is significant promise for future clinical applications as the ancestral diversity and sample sizes of genome-wide association studies increase. We discuss emerging data and methods aimed at improving the value of genetic risk prediction for disentangling disease mechanisms and stratifying subjects for epidemiological and clinical studies. For all applications, it is absolutely critical that polygenic risk prediction is applied with appropriate methodology and control for confounding to avoid repeating some mistakes of the candidate gene era.
Collapse
Affiliation(s)
- Alicia R Martin
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts.
| | - Mark J Daly
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts
| | - Elise B Robinson
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts; Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, Massachusetts
| | - Steven E Hyman
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts; Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts
| | - Benjamin M Neale
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts
| |
Collapse
|
88
|
Misganaw B, Guffanti G, Lori A, Abu-Amara D, Flory JD, Mueller S, Yehuda R, Jett M, Marmar CR, Ressler KJ, Doyle FJ. Polygenic risk associated with post-traumatic stress disorder onset and severity. Transl Psychiatry 2019; 9:165. [PMID: 31175274 PMCID: PMC6555815 DOI: 10.1038/s41398-019-0497-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Accepted: 05/07/2019] [Indexed: 01/14/2023] Open
Abstract
Post-traumatic stress disorder (PTSD) is a psychiatric illness with a highly polygenic architecture without large effect-size common single-nucleotide polymorphisms (SNPs). Thus, to capture a substantial portion of the genetic contribution, effects from many variants need to be aggregated. We investigated various aspects of one such approach that has been successfully applied to many traits, polygenic risk score (PRS) for PTSD. Theoretical analyses indicate the potential prediction ability of PRS. We used the latest summary statistics from the largest published genome-wide association study (GWAS) conducted by Psychiatric Genomics Consortium for PTSD (PGC-PTSD). We found that the PRS constructed for a cohort comprising veterans of recent wars (n = 244) explains a considerable proportion of PTSD onset (Nagelkerke R2 = 4.68%, P = 0.003) and severity (R2 = 4.35%, P = 0.0008) variances. However, the performance on an African ancestry sub-cohort was minimal. A PRS constructed with schizophrenia GWAS also explained a significant fraction of PTSD diagnosis variance (Nagelkerke R2 = 2.96%, P = 0.0175), confirming previously reported genetic correlation between the two psychiatric ailments. Overall, these findings demonstrate the important role polygenic analyses of PTSD will play in risk prediction models as well as in elucidating the biology of the disorder.
Collapse
Affiliation(s)
- Burook Misganaw
- Harvard John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
| | - Guia Guffanti
- Department of Psychiatry, Harvard Medical School and McLean Hospital, Belmont, MA, USA
| | - Adriana Lori
- Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, GA, USA
| | - Duna Abu-Amara
- Steven and Alexandra Cohen Veterans Center for the Study of Posttraumatic Stress and Traumatic Brain Injury; and Department of Psychiatry, NYU School of Medicine, New York, NY, USA
| | - Janine D Flory
- Department of Psychiatry, James J. Peters Veterans Affairs Medical Center, Bronx, NY, USA
- The Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Susanne Mueller
- Center for Imaging of Neurodegenerative Diseases, San Francisco Veterans Affairs Medical Center, San Francisco, CA, USA
- Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA, USA
| | - Rachel Yehuda
- Department of Psychiatry, James J. Peters Veterans Affairs Medical Center, Bronx, NY, USA
- The Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Marti Jett
- Integrative Systems Biology, United States Army Medical Research and Material Command, United States Army Center for Environmental Health Research, Frederick, MD, USA
| | - Charles R Marmar
- Steven and Alexandra Cohen Veterans Center for the Study of Posttraumatic Stress and Traumatic Brain Injury; and Department of Psychiatry, NYU School of Medicine, New York, NY, USA
| | - Kerry J Ressler
- Department of Psychiatry, Harvard Medical School and McLean Hospital, Belmont, MA, USA
| | - Francis J Doyle
- Harvard John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
89
|
Allegrini AG, Selzam S, Rimfeld K, von Stumm S, Pingault JB, Plomin R. Genomic prediction of cognitive traits in childhood and adolescence. Mol Psychiatry 2019; 24:819-827. [PMID: 30971729 PMCID: PMC6986352 DOI: 10.1038/s41380-019-0394-4] [Citation(s) in RCA: 77] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 02/12/2019] [Accepted: 02/14/2019] [Indexed: 12/21/2022]
Abstract
Recent advances in genomics are producing powerful DNA predictors of complex traits, especially cognitive abilities. Here, we leveraged summary statistics from the most recent genome-wide association studies of intelligence and educational attainment, with highly genetically correlated traits, to build prediction models of general cognitive ability and educational achievement. To this end, we compared the performances of multi-trait genomic and polygenic scoring methods. In a representative UK sample of 7,026 children at ages 12 and 16, we show that we can now predict up to 11% of the variance in intelligence and 16% in educational achievement. We also show that predictive power increases from age 12 to age 16 and that genomic predictions do not differ for girls and boys. We found that multi-trait genomic methods were effective in boosting predictive power. Prediction accuracy varied across polygenic score approaches, however results were similar for different multi-trait and polygenic score methods. We discuss general caveats of multi-trait methods and polygenic score prediction, and conclude that polygenic scores for educational attainment and intelligence are currently the most powerful predictors in the behavioural sciences.
Collapse
Affiliation(s)
- A G Allegrini
- Social, Genetic and Developmental Psychiatry Centre, King's College London, London, UK.
| | - S Selzam
- Social, Genetic and Developmental Psychiatry Centre, King's College London, London, UK
| | - K Rimfeld
- Social, Genetic and Developmental Psychiatry Centre, King's College London, London, UK
| | - S von Stumm
- Department of Education, University of York, Heslington, York, UK
| | - J B Pingault
- Clinical Educational and Health Psychology, Division of Psychology and Language Sciences, University College London, London, UK
| | - R Plomin
- Social, Genetic and Developmental Psychiatry Centre, King's College London, London, UK
| |
Collapse
|
90
|
Grotzinger AD, Rhemtulla M, de Vlaming R, Ritchie SJ, Mallard TT, Hill WD, Ip HF, Marioni RE, McIntosh AM, Deary IJ, Koellinger PD, Harden KP, Nivard MG, Tucker-Drob EM. Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat Hum Behav 2019; 3:513-525. [PMID: 30962613 PMCID: PMC6520146 DOI: 10.1038/s41562-019-0566-x] [Citation(s) in RCA: 430] [Impact Index Per Article: 71.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Accepted: 02/22/2019] [Indexed: 12/18/2022]
Abstract
Genetic correlations estimated from genome-wide association studies (GWASs) reveal pervasive pleiotropy across a wide variety of phenotypes. We introduce genomic structural equation modelling (genomic SEM): a multivariate method for analysing the joint genetic architecture of complex traits. Genomic SEM synthesizes genetic correlations and single-nucleotide polymorphism heritabilities inferred from GWAS summary statistics of individual traits from samples with varying and unknown degrees of overlap. Genomic SEM can be used to model multivariate genetic associations among phenotypes, identify variants with effects on general dimensions of cross-trait liability, calculate more predictive polygenic scores and identify loci that cause divergence between traits. We demonstrate several applications of genomic SEM, including a joint analysis of summary statistics from five psychiatric traits. We identify 27 independent single-nucleotide polymorphisms not previously identified in the contributing univariate GWASs. Polygenic scores from genomic SEM consistently outperform those from univariate GWASs. Genomic SEM is flexible and open ended, and allows for continuous innovation in multivariate genetic analysis.
Collapse
Affiliation(s)
| | - Mijke Rhemtulla
- Department of Psychology, University of California, Davis, Davis, CA, USA
| | - Ronald de Vlaming
- Department of Economics, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
- Erasmus University Rotterdam Institute for Behavior and Biology, Rotterdam, The Netherlands
| | - Stuart J Ritchie
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
- Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Travis T Mallard
- Department of Psychology, University of Texas at Austin, Austin, TX, USA
| | - W David Hill
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
- Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Hill F Ip
- Department of Biological Psychology, Vrije Universiteit University Amsterdam, Amsterdam, The Netherlands
| | - Riccardo E Marioni
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Andrew M McIntosh
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - Ian J Deary
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
- Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Philipp D Koellinger
- Department of Economics, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
- Erasmus University Rotterdam Institute for Behavior and Biology, Rotterdam, The Netherlands
| | - K Paige Harden
- Department of Psychology, University of Texas at Austin, Austin, TX, USA
- Population Research Center, University of Texas at Austin, Austin, TX, USA
| | - Michel G Nivard
- Department of Biological Psychology, Vrije Universiteit University Amsterdam, Amsterdam, The Netherlands
| | - Elliot M Tucker-Drob
- Department of Psychology, University of Texas at Austin, Austin, TX, USA
- Population Research Center, University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
91
|
Abstract
PURPOSE OF REVIEW With improved next-generation sequencing technology, open-access genetic databases and increased awareness of complex trait genetics, we are entering a new era of risk assessment in which genetic-based risk scores (GRSs) will play a clinical role. We review the concepts underlying polygenic models of disease susceptibility and challenges in clinical implementation. RECENT FINDINGS Polygenic risk scores are currently used in genetic research on dyslipidemias and cardiovascular disease (CVD). Although the underlying principles for constructing polygenic scores for lipids are established, the lack of consensus on which score to use is indicated by the large number - about 50 - that have been published. Recently, large-scale polygenic scores for CVD appear to afford superior risk prediction compared to small-scale scores. Despite the potential benefits of GRSs, certain biases towards ethnicity and sex need to be worked through. SUMMARY We are on the verge of clinical application of GRSs to provide incremental information on dyslipidemia and CVD risk above and beyond traditional clinical variables. Additional work is required to develop a consensus of how such scores will be constructed and measured in a validated manner, as well as clinical indications for their use.
Collapse
Affiliation(s)
- Jacqueline S Dron
- Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University
- Robarts Research Institute, Schulich School of Medicine and Dentistry, Western University
| | - Robert A Hegele
- Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University
- Robarts Research Institute, Schulich School of Medicine and Dentistry, Western University
- Department of Medicine, Schulich School of Medicine and Dentistry, Western University, London, Ontario, Canada
| |
Collapse
|
92
|
Wray NR, Kemper KE, Hayes BJ, Goddard ME, Visscher PM. Complex Trait Prediction from Genome Data: Contrasting EBV in Livestock to PRS in Humans: Genomic Prediction. Genetics 2019; 211:1131-1141. [PMID: 30967442 PMCID: PMC6456317 DOI: 10.1534/genetics.119.301859] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Accepted: 01/20/2019] [Indexed: 12/20/2022] Open
Abstract
In this Review, we focus on the similarity of the concepts underlying prediction of estimated breeding values (EBVs) in livestock and polygenic risk scores (PRS) in humans. Our research spans both fields and so we recognize factors that are very obvious for those in one field, but less so for those in the other. Differences in family size between species is the wedge that drives the different viewpoints and approaches. Large family size achievable in nonhuman species accompanied by selection generates a smaller effective population size, increased linkage disequilibrium and a higher average genetic relationship between individuals within a population. In human genetic analyses, we select individuals unrelated in the classical sense (coefficient of relationship <0.05) to estimate heritability captured by common SNPs. In livestock data, all animals within a breed are to some extent "related," and so it is not possible to select unrelated individuals and retain a data set of sufficient size to analyze. These differences directly or indirectly impact the way data analyses are undertaken. In livestock, genetic segregation variance exposed through samplings of parental genomes within families is directly observable and taken for granted. In humans, this genomic variation is under-recognized for its contribution to variation in polygenic risk of common disease, in both those with and without family history of disease. We explore the equation that predicts the expected proportion of variance explained using PRS, and quantify how GWAS sample size is the key factor for maximizing accuracy of prediction in both humans and livestock. Last, we bring together the concepts discussed to address some frequently asked questions.
Collapse
Affiliation(s)
- Naomi R Wray
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4067, Australia
- Queensland Brain Institute, The University of Queensland, Brisbane, Queensland 4067, Australia
| | - Kathryn E Kemper
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4067, Australia
| | - Benjamin J Hayes
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, St Lucia, Queensland 4072, Australia
| | - Michael E Goddard
- AgriBio, Centre for AgriBioscience, Department of Economic Development, Jobs, Transport and Resources, Bundoora, Victoria, Australia
- Faculty of Land and Food Resources, University of Melbourne, Parkville, Victoria, Australia
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4067, Australia
- Queensland Brain Institute, The University of Queensland, Brisbane, Queensland 4067, Australia
| |
Collapse
|
93
|
Ormel J, Hartman CA, Snieder H. The genetics of depression: successful genome-wide association studies introduce new challenges. Transl Psychiatry 2019; 9:114. [PMID: 30877272 PMCID: PMC6420566 DOI: 10.1038/s41398-019-0450-5] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Accepted: 02/13/2019] [Indexed: 12/15/2022] Open
Abstract
The recent successful genome-wide association studies (GWASs) for depression have yielded more than 80 replicated loci and brought back the excitement that had evaporated during the years of negative GWAS findings. The identified loci provide anchors to explore their relevance for depression, but this comes with new challenges. Using the watershed model of genotype-phenotype relationships as a conceptual aid and recent genetic findings on other complex phenotypes, we discuss why it took so long and identify seven future challenges. The biggest challenge involves the identification of causal mechanisms since GWAS associations merely flag genomic regions without a direct link to underlying biological function. Furthermore, the genetic association with the index phenotype may also be part of a more extensive causal pathway (e.g., from variant to comorbid condition) or be due to indirect influences via intermediate traits located in the causal pathways to the final outcome. This challenge is highly relevant for depression because even its narrow definition of major depressive disorder captures a heterogeneous set of phenotypes which are often measured by even more broadly defined operational definitions consisting of a few questions (minimal phenotyping). Here, Mendelian randomization and future discovery of additional genetic variants for depression and related phenotypes will be of great help. In addition, reduction of phenotypic heterogeneity may also be worthwhile. Other challenges include detecting rare variants, determining the genetic architecture of depression, closing the "heritability gap", and realizing the potential for personalized treatment. Along the way, we identify pertinent open questions that, when addressed, will advance the field.
Collapse
Affiliation(s)
- Johan Ormel
- Departments of Epidemiology and Psychiatry, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.
| | - Catharina A Hartman
- Departments of Epidemiology and Psychiatry, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Harold Snieder
- Departments of Epidemiology and Psychiatry, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| |
Collapse
|
94
|
Abstract
PURPOSE OF REVIEW Anxiety disorders are among the most common mental disorders with a lifetime prevalence of over 20%. Clinically, anxiety is not thought of as a homogenous disorder, but is subclassified in generalized, panic, and phobic anxiety disorder. Anxiety disorders are moderately heritable. This review will explore recent genetic and epigenetic approaches to anxiety disorders explaining differential susceptibility risk. RECENT FINDINGS A substantial portion of the variance in susceptibility risk can be explained by differential inherited and acquired genetic and epigenetic risk. Available data suggest that anxiety disorders are highly complex and polygenic. Despite the substantial progress in genetic research over the last decade, only few risk loci for anxiety disorders have been identified so far. This review will cover recent findings from large-scale genome-wide association studies as well as newer epigenome-wide studies. Progress in this area will likely require analysis of much larger sample sizes than have been reported to date. We discuss prospects for clinical translation of genetic findings and future directions for research.
Collapse
|
95
|
Chung W, Chen J, Turman C, Lindstrom S, Zhu Z, Loh PR, Kraft P, Liang L. Efficient cross-trait penalized regression increases prediction accuracy in large cohorts using secondary phenotypes. Nat Commun 2019; 10:569. [PMID: 30718517 PMCID: PMC6361917 DOI: 10.1038/s41467-019-08535-0] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2016] [Accepted: 01/17/2019] [Indexed: 01/15/2023] Open
Abstract
We introduce cross-trait penalized regression (CTPR), a powerful and practical approach for multi-trait polygenic risk prediction in large cohorts. Specifically, we propose a novel cross-trait penalty function with the Lasso and the minimax concave penalty (MCP) to incorporate the shared genetic effects across multiple traits for large-sample GWAS data. Our approach extracts information from the secondary traits that is beneficial for predicting the primary trait based on individual-level genotypes and/or summary statistics. Our novel implementation of a parallel computing algorithm makes it feasible to apply our method to biobank-scale GWAS data. We illustrate our method using large-scale GWAS data (~1M SNPs) from the UK Biobank (N = 456,837). We show that our multi-trait method outperforms the recently proposed multi-trait analysis of GWAS (MTAG) for predictive performance. The prediction accuracy for height by the aid of BMI improves from R2 = 35.8% (MTAG) to 42.5% (MCP + CTPR) or 42.8% (Lasso + CTPR) with UK Biobank data. Information of genetic architectures of complex traits can be leveraged for predicting phenotypes. Here, the authors develop CTPR (Cross-Trait Penalized Regression), a method for multi-trait polygenic risk prediction using individual-level genotypes and/or summary statistics from large cohorts.
Collapse
Affiliation(s)
- Wonil Chung
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Jun Chen
- Division of Biomedical Statistics and Informatics and Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA
| | - Constance Turman
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Sara Lindstrom
- Department of Epidemiology, University of Washington, Seattle, WA, 98195, USA
| | - Zhaozhong Zhu
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Po-Ru Loh
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.,Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Peter Kraft
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Liming Liang
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA. .,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA. .,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.
| |
Collapse
|
96
|
Liu HJ, Yan J. Crop genome-wide association study: a harvest of biological relevance. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2019; 97:8-18. [PMID: 30368955 DOI: 10.1111/tpj.14139] [Citation(s) in RCA: 119] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Revised: 10/13/2018] [Accepted: 10/22/2018] [Indexed: 05/20/2023]
Abstract
With the advent of rapid genotyping and next-generation sequencing technologies, genome-wide association study (GWAS) has become a routine strategy for decoding genotype-phenotype associations in many species. More than 1000 such studies over the last decade have revealed substantial genotype-phenotype associations in crops and provided unparalleled opportunities to probe functional genomics. Beyond the many 'hits' obtained, this review summarizes recent efforts to increase our understanding of the genetic architecture of complex traits by focusing on non-main effects including epistasis, pleiotropy, and phenotypic plasticity. We also discuss how these achievements and the remaining gaps in our knowledge will guide future studies. Synthetic association is highlighted as leading to false causality, which is prevalent but largely underestimated. Furthermore, validation evidence is appealing for future GWAS, especially in the context of emerging genome-editing technologies.
Collapse
Affiliation(s)
- Hai-Jun Liu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Jianbing Yan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
97
|
New approaches in psychiatric drug development. Eur Neuropsychopharmacol 2018; 28:983-993. [PMID: 30056086 DOI: 10.1016/j.euroneuro.2018.06.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/17/2018] [Revised: 06/18/2018] [Accepted: 06/25/2018] [Indexed: 02/03/2023]
Abstract
Numerous novel neuroscience-based drug targets have been identified in recent years. However, it remains unclear how these targets relate to the expression of symptoms in central nervous system (CNS) disorders in general and psychiatric disorders in particular. To discuss this issue, a New Frontiers Meetings of European College of Neuropsychopharmacology (ECNP) was organized to address the challenges in translational neuroscience research that are impeding the effective development of new treatments. The main aim of this meeting was to discuss scientific insights, concepts and methodologies in order to improve drug development for psychiatric disorders. The meeting was designed to bring together stakeholders from academia, pharmaceutical industry, and regulatory agencies. Here we provide a synopsis of the proceedings from the meeting entitled 'New approaches to psychiatric drug development'. New views on psychiatric drug development were presented to address the challenges and pitfalls as identified by the different stakeholders. The general conclusion of the meeting was that drug discovery could be stimulated by designing new classification and sensitive assessment tools for psychiatric disorders, which bear closer relationships to neuropharmacological and neuroscientific developments. This is in line with the vision of precision psychiatry in which patients are clustered, not merely on symptoms, but primarily on biological phenotypes that represent pathophysiological relevant and 'drugable' processes. To achieve these goals, a closer collaboration between all stakeholders in early stages of development is essential to define the research criteria together and to reach consensus on new quantitative biological methodologies and etiology-directed treatments.
Collapse
|
98
|
Kim H, Kim SY, Joly Y. South Korea: in the midst of a privacy reform centered on data sharing. Hum Genet 2018; 137:627-635. [PMID: 30121900 PMCID: PMC6132641 DOI: 10.1007/s00439-018-1920-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Accepted: 07/28/2018] [Indexed: 12/23/2022]
Abstract
With rapid developments in genomic and digital technologies, genomic data sharing has become a key issue for the achievement of precision medicine in South Korea. The legal and administrative framework for data sharing and protection in this country is currently under intense scrutiny from national and international stakeholders. Policymakers are assessing the relevance of specific restrictions in national laws and guidelines for better alignment with international approaches. This manuscript will consider key issues in international genome data sharing in South Korea, including consent, privacy, security measures, compatible adequacy and oversight, and map out an approach to genomic data sharing that recognizes the importance of patient engagement and responsible use of data in South Korea.
Collapse
Affiliation(s)
- Hannah Kim
- Asian Institute for Bioethics and Health Law, Yonsei University, Seoul, South Korea
| | - So Yoon Kim
- Asian Institute for Bioethics and Health Law, Yonsei University, Seoul, South Korea
| | - Yann Joly
- Centre of Genomics and Policy, McGill University, Montreal, Canada.
| |
Collapse
|
99
|
Wu S, Li JS, Mai J, Chang MW. Three-Dimensional Electrohydrodynamic Printing and Spinning of Flexible Composite Structures for Oral Multidrug Forms. ACS APPLIED MATERIALS & INTERFACES 2018; 10:24876-24885. [PMID: 29953813 DOI: 10.1021/acsami.8b08880] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
A simple method to rapidly customize and to also mass produce oral dosage forms is arguably a current bottleneck in the development of modern personalized medicine. Specifically, delayed-release mechanisms with well-controlled dosage profiles for combinations of traditional Chinese herbal extracts and Western medications are not well established. Herein, we demonstrate a novel multidrug-loaded membrane sandwich with structures infused with ibuprofen (IBU) and Ganoderma lucidum polysaccharide (GLP) using three-dimensional electrohydrodynamic printing and electrospinning techniques. The resulting flexible membrane consists of microscaled, multilayered cellulose acetate (CA) membranes loaded with IBU in the shape of either concentric squares or circles, as the top and bottom layers of a sandwich structure. In between the CA-IBU layers are randomly electrospun polyvinyl pyrrolidone (PVP) layers loaded with GLP. The complete fibrous membrane sandwich can be folded and embedded into a 0-size capsule to achieve oral compliance. Simulated in vitro testing of gastric and intestinal fluids demonstrated a triphasic release profile. There was an immediate release of GLP after gastric juices dissolved the capsule shell and the PVP, followed by the short-term release of 60% of the IBU within an hour afterward, and the remaining IBU was released in a sustained manner following a Fickian diffusion profile. In summary, this multidrug (both hydrophilic and/or hydrophobic) oral system with precision-designed structures should enable personalized therapeutic dosing.
Collapse
Affiliation(s)
| | | | - John Mai
- Alfred E. Mann Institute for Biomedical Engineering at the University of Southern California , Los Angeles 90007 , California , United States
| | | |
Collapse
|
100
|
Genomic Prediction Using Individual-Level Data and Summary Statistics from Multiple Populations. Genetics 2018; 210:53-69. [PMID: 30021793 DOI: 10.1534/genetics.118.301109] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 07/16/2018] [Indexed: 01/27/2023] Open
Abstract
This study presents a method for genomic prediction that uses individual-level data and summary statistics from multiple populations. Genome-wide markers are nowadays widely used to predict complex traits, and genomic prediction using multi-population data are an appealing approach to achieve higher prediction accuracies. However, sharing of individual-level data across populations is not always possible. We present a method that enables integration of summary statistics from separate analyses with the available individual-level data. The data can either consist of individuals with single or multiple (weighted) phenotype records per individual. We developed a method based on a hypothetical joint analysis model and absorption of population-specific information. We show that population-specific information is fully captured by estimated allele substitution effects and the accuracy of those estimates, i.e., the summary statistics. The method gives identical result as the joint analysis of all individual-level data when complete summary statistics are available. We provide a series of easy-to-use approximations that can be used when complete summary statistics are not available or impractical to share. Simulations show that approximations enable integration of different sources of information across a wide range of settings, yielding accurate predictions. The method can be readily extended to multiple-traits. In summary, the developed method enables integration of genome-wide data in the individual-level or summary statistics from multiple populations to obtain more accurate estimates of allele substitution effects and genomic predictions.
Collapse
|