1
|
Jayasinghe D, Eshetie S, Beckmann K, Benyamin B, Lee SH. Advancements and limitations in polygenic risk score methods for genomic prediction: a scoping review. Hum Genet 2024:10.1007/s00439-024-02716-8. [PMID: 39542907 DOI: 10.1007/s00439-024-02716-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 10/31/2024] [Indexed: 11/17/2024]
Abstract
This scoping review aims to identify and evaluate the landscape of Polygenic Risk Score (PRS)-based methods for genomic prediction from 2013 to 2023, highlighting their advancements, key concepts, and existing gaps in knowledge, research, and technology. Over the past decade, various PRS-based methods have emerged, each employing different statistical frameworks aimed at enhancing prediction accuracy, processing speed and memory efficiency. Despite notable advancements, challenges persist, including unrealistic assumptions regarding sample sizes and the polygenicity of traits necessary for accurate predictions, as well as limitations in exploring hyper-parameter spaces and considering environmental interactions. We included studies focusing on PRS-based methods for risk prediction that underwent methodological evaluations using valid approaches and released computational tools/software. Additionally, we restricted our selection to studies involving human participants that were published in English language. This review followed the standard protocol recommended by Joanna Briggs Institute Reviewer's Manual, systematically searching Ovid MEDLINE, Ovid Embase, Scopus and Web of Science databases. Additionally, searches included grey literature sources like pre-print servers such as bioRxiv, and articles recommended by experts to ensure comprehensive and diverse coverage of relevant records. This study identified 34 studies detailing 37 genomic prediction methods, the majority of which rely on linkage disequilibrium (LD) information and necessitate hyper-parameter tuning. Nine methods integrate functional/gene annotation, while 12 are suitable for cross-ancestry genomic prediction, with only one considering gene-environment (GxE) interaction. While some methods require individual-level data, most leverage summary statistics, offering flexibility. Despite progress, challenges remain. These include computational complexity and the need for large sample sizes for high prediction accuracy. Furthermore, recent methods exhibit varying effectiveness across traits, with absolute accuracies often falling short of clinical utility. Transferability across ancestries varies, influenced by trait heritability and diversity of training data, while handling admixed populations remains challenging. Additionally, the absence of standard error measurements for individual PRSs, crucial in clinical settings, underscores a critical gap. Another issue is the lack of customizable graphical visualization tools among current software packages. While genomic prediction methods have advanced significantly, there is still room for improvement. Addressing current challenges and embracing future research directions will lead to the development of more universally applicable, robust, and clinically relevant genomic prediction tools.
Collapse
Affiliation(s)
- Dovini Jayasinghe
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia.
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, SA, 5000, Australia.
| | - Setegn Eshetie
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, SA, 5000, Australia
- College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Kerri Beckmann
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
| | - Beben Benyamin
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, SA, 5000, Australia
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, SA, 5000, Australia
| |
Collapse
|
2
|
Patel Y, Shin J, Sliz E, Tang A, Mishra A, Xia R, Hofer E, Rajula HSR, Wang R, Beyer F, Horn K, Riedl M, Yu J, Völzke H, Bülow R, Völker U, Frenzel S, Wittfeld K, Van der Auwera S, Mosley TH, Bouteloup V, Lambert JC, Chêne G, Dufouil C, Tzourio C, Mangin JF, Gottesman RF, Fornage M, Schmidt R, Yang Q, Witte V, Scholz M, Loeffler M, Roshchupkin GV, Ikram MA, Grabe HJ, Seshadri S, Debette S, Paus T, Pausova Z. Genetic risk factors underlying white matter hyperintensities and cortical atrophy. Nat Commun 2024; 15:9517. [PMID: 39496600 PMCID: PMC11535513 DOI: 10.1038/s41467-024-53689-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Accepted: 10/18/2024] [Indexed: 11/06/2024] Open
Abstract
White matter hyperintensities index structural abnormalities in the cerebral white matter, including axonal damage. The latter may promote atrophy of the cerebral cortex, a key feature of dementia. Here, we report a study of 51,065 individuals from 10 cohorts demonstrating that higher white matter hyperintensity volume associates with lower cortical thickness. The meta-GWAS of white matter hyperintensities-associated cortical 'atrophy' identifies 20 genome-wide significant loci, and enrichment in genes specific to vascular cell types, astrocytes, and oligodendrocytes. White matter hyperintensities-associated cortical 'atrophy' showed positive genetic correlations with vascular-risk traits and plasma biomarkers of neurodegeneration, and negative genetic correlations with cognitive functioning. 15 of the 20 loci regulated the expression of 54 genes in the cerebral cortex that, together with their co-expressed genes, were enriched in biological processes of axonal cytoskeleton and intracellular transport. The white matter hyperintensities-cortical thickness associations were most pronounced in cortical regions with higher expression of genes specific to excitatory neurons with long-range axons traversing through the white matter. The meta-GWAS-based polygenic risk score predicts vascular and all-cause dementia in an independent sample of 500,348 individuals. Thus, the genetics of white matter hyperintensities-related cortical atrophy involves vascular and neuronal processes and increases dementia risk.
Collapse
Affiliation(s)
- Yash Patel
- The Hospital for Sick Children, Toronto, Ontario, Canada
- Departments of Physiology and Nutritional Sciences, University of Toronto, Toronto, Ontario, Canada
| | - Jean Shin
- The Hospital for Sick Children, Toronto, Ontario, Canada
- Departments of Physiology and Nutritional Sciences, University of Toronto, Toronto, Ontario, Canada
| | - Eeva Sliz
- Research Unit of Population Health, Faculty of Medicine, University of Oulu, Oulu, Finland
| | - Ariana Tang
- The Hospital for Sick Children, Toronto, Ontario, Canada
- Departments of Physiology and Nutritional Sciences, University of Toronto, Toronto, Ontario, Canada
| | - Aniket Mishra
- University of Bordeaux, INSERM, Bordeaux Population Health research center, UMR1219, Bordeaux, France
| | - Rui Xia
- The Brown Foundation Institute of Molecular Medicine, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Edith Hofer
- Institut für Medizinische Informatik, Statistik und Dokumentation, Graz, Austria
- Division of Neurogeriatrics, Department of Neurology, Medical University of Graz, Graz, Austria
| | - Hema Sekhar Reddy Rajula
- University of Bordeaux, INSERM, Bordeaux Population Health research center, UMR1219, Bordeaux, France
| | - Ruiqi Wang
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Frauke Beyer
- University of Bordeaux, INSERM, Bordeaux Population Health research center, UMR1219, Bordeaux, France
- Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Katrin Horn
- Institute for Medical Informatics, Statistics and Epidemiology; Leipzig University, Leipzig, Germany
| | - Max Riedl
- Institute for Medical Informatics, Statistics and Epidemiology; Leipzig University, Leipzig, Germany
| | - Jing Yu
- Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
- Department of Radiology and Nuclear Medicine, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Henry Völzke
- Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Robin Bülow
- Institute of Diagnostic Radiology and Neuroradiology, University Medicine Greifswald, Greifswald, Germany
| | - Uwe Völker
- Interfaculty Institute of Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany
| | - Stefan Frenzel
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, Greifswald, Germany
| | - Katharina Wittfeld
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, Greifswald, Germany
| | - Sandra Van der Auwera
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, Greifswald, Germany
- German Centre for Neurodegenerative Diseases (DZNE), Site Rostock/Greifswald, Greifswald, Germany
| | - Thomas H Mosley
- The MIND Center, The University of Mississippi Medical Center, Jackson, MS, USA
| | - Vincent Bouteloup
- University of Bordeaux, INSERM, Bordeaux Population Health research center, UMR1219, Bordeaux, France
- CHU Bordeaux, CIC 1401 EC, Pôle Santé Publique, Bordeaux, France
| | - Jean-Charles Lambert
- U1167-RID-AGE facteurs de risque et déterminants moléculaires des maladies liées au vieillissement, INSERM, CHU Lille, Institut Pasteur de Lille, University of Lille, Lille, France
| | - Geneviève Chêne
- University of Bordeaux, INSERM, Bordeaux Population Health research center, UMR1219, Bordeaux, France
- Department of Public Health, CHU de Bordeaux, Bordeaux, France
| | - Carole Dufouil
- University of Bordeaux, INSERM, Bordeaux Population Health research center, UMR1219, Bordeaux, France
| | - Christophe Tzourio
- University of Bordeaux, INSERM, Bordeaux Population Health research center, UMR1219, Bordeaux, France
- Department of Public Health, CHU de Bordeaux, Bordeaux, France
| | | | - Rebecca F Gottesman
- National Institute of Neurological Disorders and Stroke Intramural Research Program, Bethesda, Maryland, USA
| | - Myriam Fornage
- The Brown Foundation Institute of Molecular Medicine, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Reinhold Schmidt
- Division of Neurogeriatrics, Department of Neurology, Medical University of Graz, Graz, Austria
| | - Qiong Yang
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Veronica Witte
- Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Markus Scholz
- Institute for Medical Informatics, Statistics and Epidemiology; Leipzig University, Leipzig, Germany
| | - Markus Loeffler
- Institute for Medical Informatics, Statistics and Epidemiology; Leipzig University, Leipzig, Germany
- Leipzig Research Centre for Civilization Diseases; Leipzig University, Leipzig, Germany
| | - Gennady V Roshchupkin
- Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
- Department of Radiology and Nuclear Medicine, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - M Arfan Ikram
- Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Hans J Grabe
- Interfaculty Institute of Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, Greifswald, Germany
| | | | - Stephanie Debette
- University of Bordeaux, INSERM, Bordeaux Population Health research center, UMR1219, Bordeaux, France
- Bordeaux University Hospital, Department of Neurology, Institute for Neurodegenerative Diseases, Bordeaux, France
| | - Tomas Paus
- Centre hospitalier universitaire Sainte-Justine, University of Montreal, Montreal, Canada.
- Departments of Psychiatry and Neuroscience, Faculty of Medicine, University of Montreal, Montreal, Canada.
- Department of Psychiatry, McGill University, Montreal, Canada.
- ECOGENE-21, Chicoutimi, Canada.
| | - Zdenka Pausova
- The Hospital for Sick Children, Toronto, Ontario, Canada.
- Departments of Physiology and Nutritional Sciences, University of Toronto, Toronto, Ontario, Canada.
- Centre hospitalier universitaire Sainte-Justine, University of Montreal, Montreal, Canada.
- ECOGENE-21, Chicoutimi, Canada.
- Department of Pediatrics, Faculty of Medicine, University of Montreal, Montreal, Canada.
| |
Collapse
|
3
|
Gunn S, Wang X, Posner DC, Cho K, Huffman JE, Gaziano M, Wilson PW, Sun YV, Peloso G, Lunetta KL. Comparison of methods for building polygenic scores for diverse populations. HGG ADVANCES 2024; 6:100355. [PMID: 39323095 PMCID: PMC11532986 DOI: 10.1016/j.xhgg.2024.100355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 09/22/2024] [Accepted: 09/22/2024] [Indexed: 09/27/2024] Open
Abstract
Polygenic scores (PGSs) are a promising tool for estimating individual-level genetic risk of disease based on the results of genome-wide association studies (GWASs). However, their promise has yet to be fully realized because most currently available PGSs were built with genetic data from predominantly European-ancestry populations, and PGS performance declines when scores are applied to target populations different from the populations from which they were derived. Thus, there is a great need to improve PGS performance in currently under-studied populations. In this work we leverage data from two large and diverse cohorts the Million Veterans Program (MVP) and All of Us (AoU), providing us the unique opportunity to compare methods for building PGSs for multi-ancestry populations across multiple traits. We build PGSs for five continuous traits and five binary traits using both multi-ancestry and single-ancestry approaches with popular Bayesian PGS methods and both MVP META GWAS results and population-specific GWAS results from the respective African, European, and Hispanic MVP populations. We evaluate these scores in three AoU populations genetically similar to the respective African, Admixed American, and European 1000 Genomes Project superpopulations. Using correlation-based tests, we make formal comparisons of the PGS performance across the multiple AoU populations. We conclude that approaches that combine GWAS data from multiple populations produce PGSs that perform better than approaches that utilize smaller single-population GWAS results matched to the target population, and specifically that multi-ancestry scores built with PRS-CSx outperform the other approaches in the three AoU populations.
Collapse
Affiliation(s)
- Sophia Gunn
- Biostatistics, Boston University School of Public Health, Boston, MA, USA; VA Boston Healthcare System, Boston, MA, USA.
| | - Xin Wang
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA; Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Daniel C Posner
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC) , Boston, MA, USA
| | - Kelly Cho
- Department of Medicine, Harvard Medical School, Boston, MA, USA; MVP Boston Coordinating Center, VA Boston Healthcare System, Boston, MA, USA; Department of Medicine, Division of Aging, Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Jennifer E Huffman
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC) , Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA; Palo Alto Veterans Institute for Research (PAVIR), Palo Alto Health Care System, Palo Alto, CA, USA
| | - Michael Gaziano
- Department of Medicine, Harvard Medical School, Boston, MA, USA; MVP Boston Coordinating Center, VA Boston Healthcare System, Boston, MA, USA; Department of Medicine, Division of Aging, Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Peter W Wilson
- VA Atlanta Healthcare System, Decatur, GA, USA; Division of Cardiology, Department of Medicine, Emory University School of Medicine, Atlanta, GA, USA; Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Yan V Sun
- VA Atlanta Healthcare System, Decatur, GA, USA; Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Gina Peloso
- Biostatistics, Boston University School of Public Health, Boston, MA, USA; VA Boston Healthcare System, Boston, MA, USA
| | - Kathryn L Lunetta
- Biostatistics, Boston University School of Public Health, Boston, MA, USA
| |
Collapse
|
4
|
Ellis CA, Oliver KL, Harris RV, Ottman R, Scheffer IE, Mefford HC, Epstein MP, Berkovic SF, Bahlo M. Inflation of polygenic risk scores caused by sample overlap and relatedness: Examples of a major risk of bias. Am J Hum Genet 2024; 111:1805-1809. [PMID: 39168121 PMCID: PMC11393675 DOI: 10.1016/j.ajhg.2024.07.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 07/18/2024] [Accepted: 07/19/2024] [Indexed: 08/23/2024] Open
Abstract
Polygenic risk scores (PRSs) are an important tool for understanding the role of common genetic variants in human disease. Standard best practices recommend that PRSs be analyzed in cohorts that are independent of the genome-wide association study (GWAS) used to derive the scores without sample overlap or relatedness between the two cohorts. However, identifying sample overlap and relatedness can be challenging in an era of GWASs performed by large biobanks and international research consortia. Although most genomics researchers are aware of best practices and theoretical concerns about sample overlap and relatedness between GWAS and PRS cohorts, the prevailing assumption is that the risk of bias is small for very large GWASs. Here, we present two real-world examples demonstrating that sample overlap and relatedness is not a minor or theoretical concern but an important potential source of bias in PRS studies. Using a recently developed statistical adjustment tool, we found that excluding overlapping and related samples was equal to or more powerful than adjusting for overlap bias. Our goal is to make genomics researchers aware of the magnitude of risk of bias from sample overlap and relatedness and to highlight the need for mitigation tools, including independent validation cohorts in PRS studies, continued development of statistical adjustment methods, and tools for researchers to test their cohorts for overlap and relatedness with GWAS cohorts without sharing individual-level data.
Collapse
Affiliation(s)
- Colin A Ellis
- Department of Neurology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Karen L Oliver
- Epilepsy Research Centre, Department of Medicine, University of Melbourne, Austin Health, Heidelberg, VIC 3084, Australia; Population Health and Immunity Division, the Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia; Department of Medical Biology, University of Melbourne, Melbourne, VIC 3052, Australia
| | - Rebekah V Harris
- Epilepsy Research Centre, Department of Medicine, University of Melbourne, Austin Health, Heidelberg, VIC 3084, Australia
| | - Ruth Ottman
- Departments of Neurology and Epidemiology, and the Gertrude H. Sergievsky Center, Columbia University Irving Medical Center, New York, NY 10032, USA; Division of Translational Epidemiology and Mental Health Equity, New York State Psychiatric Institute, New York, NY 10032, USA
| | - Ingrid E Scheffer
- Epilepsy Research Centre, Department of Medicine, University of Melbourne, Austin Health, Heidelberg, VIC 3084, Australia; Department of Paediatrics, Royal Children's Hospital, University of Melbourne, Parkville, VIC 3052, Australia; The Florey Institute and Murdoch Children's Research Institute, Parkville, VIC 3052, Australia
| | - Heather C Mefford
- Center for Pediatric Neurological Disease Research, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Michael P Epstein
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30307, USA
| | - Samuel F Berkovic
- Epilepsy Research Centre, Department of Medicine, University of Melbourne, Austin Health, Heidelberg, VIC 3084, Australia
| | - Melanie Bahlo
- Population Health and Immunity Division, the Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia; Department of Medical Biology, University of Melbourne, Melbourne, VIC 3052, Australia.
| |
Collapse
|
5
|
Jo J, Ha N, Ji Y, Do A, Seo JH, Oh B, Choi S, Choe EK, Lee W, Son JW, Won S. Genetic determinants of obesity in Korean populations: exploring genome-wide associations and polygenic risk scores. Brief Bioinform 2024; 25:bbae389. [PMID: 39207728 PMCID: PMC11359806 DOI: 10.1093/bib/bbae389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 06/24/2024] [Indexed: 09/04/2024] Open
Abstract
East Asian populations exhibit a genetic predisposition to obesity, yet comprehensive research on these traits is limited. We conducted a genome-wide association study (GWAS) with 93,673 Korean subjects to uncover novel genetic loci linked to obesity, examining metrics such as body mass index, waist circumference, body fat ratio, and abdominal fat ratio. Participants were categorized into non-obese, metabolically healthy obese (MHO), and metabolically unhealthy obese (MUO) groups. Using advanced computational methods, we developed a multifaceted polygenic risk scores (PRS) model to predict obesity. Our GWAS identified significant genetic effects with distinct sizes and directions within the MHO and MUO groups compared with the non-obese group. Gene-based and gene-set analyses, along with cluster analysis, revealed heterogeneous patterns of significant genes on chromosomes 3 (MUO group) and 11 (MHO group). In analyses targeting genetic predisposition differences based on metabolic health, odds ratios of high PRS compared with medium PRS showed significant differences between non-obese and MUO, and non-obese and MHO. Similar patterns were seen for low PRS compared with medium PRS. These findings were supported by the estimated genetic correlation (0.89 from bivariate GREML). Regional analyses highlighted significant local genetic correlations on chromosome 11, while single variant approaches suggested widespread pleiotropic effects, especially on chromosome 11. In conclusion, our study identifies specific genetic loci and risks associated with obesity in the Korean population, emphasizing the heterogeneous genetic factors contributing to MHO and MUO.
Collapse
Affiliation(s)
- Jinyeon Jo
- Department of Public Health Sciences, Graduate school of Public Health, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
| | - Nayoung Ha
- Department of Public Health Sciences, Graduate school of Public Health, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
| | - Yunmi Ji
- Interdisciplinary Program in Bioinformatics, College of Natural Sciences, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
| | - Ahra Do
- Interdisciplinary Program in Bioinformatics, College of Natural Sciences, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
| | - Je Hyun Seo
- Veterans Health Service Medical Center, Veterans Medical Research Institute, 53, Jinhwangdo-ro 61-gil, Gangdong-gu, Seoul, 05368, South Korea
| | - Bumjo Oh
- Department of Family Medicine, Seoul Metropolitan Government Seoul National University Boramae Medical Center, 20, Boramae-ro 5-gil, Dongjak-gu, Seoul, 07061, South Korea
| | - Sungkyoung Choi
- Department of Applied Mathematics, Hanyang University (ERICA), 55, Hanyang-deahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, South Korea
| | - Eun Kyung Choe
- Division of Colorectal Surgery, Department of Surgery, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, South Korea
- Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, 39FL, 152, Teheran-ro, Gangnam-gu, Seoul, 06236, South Korea
| | - Woojoo Lee
- Department of Public Health Sciences, Graduate school of Public Health, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
- Institute of Health and Environment, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
| | - Jang Won Son
- Division of Endocrinology, Department of Internal Medicine, Bucheon St. Mary's hospital, The Catholic University of Korea, 327, Sosa-ro, Bucheon-si, Gyeonggi-do, Bucheon, 14647, South Korea
| | - Sungho Won
- Department of Public Health Sciences, Graduate school of Public Health, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
- Interdisciplinary Program in Bioinformatics, College of Natural Sciences, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
- Institute of Health and Environment, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
- RexSoft Corps, Seoul National University Administration Building, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea
| |
Collapse
|
6
|
Leonenko G, Bauermeister S, Ghanti D, Stevenson‐Hoare J, Simmonds E, Brookes K, Morgan K, Chaturvedi N, Elliott P, Thomas A, Wareham N, Gallacher J, Escott‐Price V. Dementias Platform UK: Bringing genetics into life. Alzheimers Dement 2024; 20:3281-3289. [PMID: 38506636 PMCID: PMC11095482 DOI: 10.1002/alz.13782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 12/21/2023] [Accepted: 01/29/2024] [Indexed: 03/21/2024]
Abstract
INTRODUCTION The Dementias Platform UK (DPUK) Data Portal is a data repository bringing together a wide range of cohorts. Neurodegenerative dementias are a group of diseases with highly heterogeneous pathology and an overlapping genetic component that is poorly understood. The DPUK collection of independent cohorts can facilitate research in neurodegeneration by combining their genetic and phenotypic data. METHODS For genetic data processing, pipelines were generated to perform quality control analysis, genetic imputation, and polygenic risk score (PRS) derivation with six genome-wide association studies of neurodegenerative diseases. Pipelines were applied to five cohorts. DISCUSSION The data processing pipelines, research-ready imputed genetic data, and PRS scores are now available on the DPUK platform and can be accessed upon request though the DPUK application process. Harmonizing genome-wide data for multiple datasets increases scientific opportunity and allows the wider research community to access and process data at scale and pace.
Collapse
Affiliation(s)
| | - Sarah Bauermeister
- Dementias Platform UKDepartment of PsychiatryUniversity of OxfordOxfordUK
| | - Dipanwita Ghanti
- Dementias Platform UKDepartment of PsychiatryUniversity of OxfordOxfordUK
| | - Joshua Stevenson‐Hoare
- Division of Psychological Medicine and Clinical NeurosciencesSchool of MedicineCardiff UniversityCardiffUK
| | | | - Keeley Brookes
- BiosciencesSchool of Science and TechnologyNottingham Trent UniversityNottinghamUK
| | | | | | - Paul Elliott
- MRC Centre for Environment and HealthSchool of Public HealthImperial College LondonLondonUK
- UK Dementia Research InstituteImperial College LondonLondonUK
| | - Alan Thomas
- Translational and Clinical Research InstituteNewcastle UniversityNewcastleUK
| | | | - John Gallacher
- Dementias Platform UKDepartment of PsychiatryUniversity of OxfordOxfordUK
| | - Valentina Escott‐Price
- Dementia Research InstituteCardiff UniversityCardiffUK
- Division of Psychological Medicine and Clinical NeurosciencesSchool of MedicineCardiff UniversityCardiffUK
| |
Collapse
|
7
|
Lori A, Pearce BD, Katrinli S, Carter S, Gillespie CF, Bradley B, Wingo AP, Jovanovic T, Michopoulos V, Duncan E, Hinrichs RC, Smith A, Ressler KJ. Genetic risk for hospitalization of African American patients with severe mental illness reveals HLA loci. Front Psychiatry 2024; 15:1140376. [PMID: 38469033 PMCID: PMC10925622 DOI: 10.3389/fpsyt.2024.1140376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 02/07/2024] [Indexed: 03/13/2024] Open
Abstract
Background Mood disorders such as major depressive and bipolar disorders, along with posttraumatic stress disorder (PTSD), schizophrenia (SCZ), and other psychotic disorders, constitute serious mental illnesses (SMI) and often lead to inpatient psychiatric care for adults. Risk factors associated with increased hospitalization rate in SMI (H-SMI) are largely unknown but likely involve a combination of genetic, environmental, and socio-behavioral factors. We performed a genome-wide association study in an African American cohort to identify possible genes associated with hospitalization due to SMI (H-SMI). Methods Patients hospitalized for psychiatric disorders (H-SMI; n=690) were compared with demographically matched controls (n=4467). Quality control and imputation of genome-wide data were performed following the Psychiatric Genetic Consortium (PGC)-PTSD guidelines. Imputation of the Human Leukocyte Antigen (HLA) locus was performed using the HIBAG package. Results Genome-wide association analysis revealed a genome-wide significant association at 6p22.1 locus in the ubiquitin D (UBD/FAT10) gene (rs362514, p=9.43x10-9) and around the HLA locus. Heritability of H-SMI (14.6%) was comparable to other psychiatric disorders (4% to 45%). We observed a nominally significant association with 2 HLA alleles: HLA-A*23:01 (OR=1.04, p=2.3x10-3) and HLA-C*06:02 (OR=1.04, p=1.5x10-3). Two other genes (VSP13D and TSPAN9), possibly associated with immune response, were found to be associated with H-SMI using gene-based analyses. Conclusion We observed a strong association between H-SMI and a locus that has been consistently and strongly associated with SCZ in multiple studies (6p21.32-p22.1), possibly indicating an involvement of the immune system and the immune response in the development of severe transdiagnostic SMI.
Collapse
Affiliation(s)
- Adriana Lori
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
- Department of Population Science, American Cancer Society, Atlanta, GA, United States
| | - Brad D. Pearce
- Department of Epidemiology, Rollins School of Public Health, Atlanta, GA, United States
| | - Seyma Katrinli
- Department of Gynecology and Obstetrics, Emory University, Atlanta, GA, United States
| | - Sierra Carter
- Department of Psychology, Georgia State University, Atlanta, GA, United States
| | - Charles F. Gillespie
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
| | - Bekh Bradley
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
| | - Aliza P. Wingo
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
- Mental Health Service Line, Department of Veterans Affairs Health Care System, Decatur, GA, United States
| | - Tanja Jovanovic
- Department of Psychiatry and Behavioral Neuroscience, Wayne State University, Detroit, MI, United States
| | - Vasiliki Michopoulos
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
| | - Erica Duncan
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
- Mental Health Service Line, Department of Veterans Affairs Health Care System, Decatur, GA, United States
| | - Rebecca C. Hinrichs
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
| | - Alicia Smith
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
- Department of Gynecology and Obstetrics, Emory University, Atlanta, GA, United States
| | - Kerry J. Ressler
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, United States
- Department of Psychiatry, Harvard Medical School and McLean Hospital, Belmont, MA, United States
| |
Collapse
|
8
|
Rajabli F. Addressing Overlapping Sample Challenges in Genome-Wide Association Studies: Meta-Reductive Approach. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.08.570867. [PMID: 38168201 PMCID: PMC10760010 DOI: 10.1101/2023.12.08.570867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Polygenic risk scores (PRS) are instrumental in genetics, offering insights into an individual level genetic risk to a range of diseases based on accumulated genetic variations. These scores rely on Genome-Wide Association Studies (GWAS). However, precision in PRS is often challenged by the requirement of extensive sample sizes and the potential for overlapping datasets that can inflate PRS calculations. In this study, we present a novel methodology, Meta-Reductive Approach (MRA), that was derived algebraically to adjust GWAS results, aiming to neutralize the influence of select cohorts. Our approach recalibrates summary statistics using algebraic derivations. Validating our technique with datasets from Alzheimer's disease studies, we showed perfect correlation between summary statistics of proposed approach and "leave-one-out" strategy. This innovative method offers a promising avenue for enhancing the accuracy of PRS, especially when derived from meta-analyzed GWAS data.
Collapse
Affiliation(s)
- Farid Rajabli
- John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
- Dr. John T Macdonald Foundation Department of Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
| |
Collapse
|
9
|
Park DK, Chen M, Kim S, Joo YY, Loving RK, Kim HS, Cha J, Yoo S, Kim JH. Overestimated prediction using polygenic prediction derived from summary statistics. BMC Genom Data 2023; 24:52. [PMID: 37710206 PMCID: PMC10500750 DOI: 10.1186/s12863-023-01151-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 08/16/2023] [Indexed: 09/16/2023] Open
Abstract
BACKGROUND When polygenic risk score (PRS) is derived from summary statistics, independence between discovery and test sets cannot be monitored. We compared two types of PRS studies derived from raw genetic data (denoted as rPRS) and the summary statistics for IGAP (sPRS). RESULTS Two variables with the high heritability in UK Biobank, hypertension, and height, are used to derive an exemplary scale effect of PRS. sPRS without APOE is derived from International Genomics of Alzheimer's Project (IGAP), which records ΔAUC and ΔR2 of 0.051 ± 0.013 and 0.063 ± 0.015 for Alzheimer's Disease Sequencing Project (ADSP) and 0.060 and 0.086 for Accelerating Medicine Partnership - Alzheimer's Disease (AMP-AD). On UK Biobank, rPRS performances for hypertension assuming a similar size of discovery and test sets are 0.0036 ± 0.0027 (ΔAUC) and 0.0032 ± 0.0028 (ΔR2). For height, ΔR2 is 0.029 ± 0.0037. CONCLUSION Considering the high heritability of hypertension and height of UK Biobank and sample size of UK Biobank, sPRS results from AD databases are inflated. Independence between discovery and test sets is a well-known basic requirement for PRS studies. However, a lot of PRS studies cannot follow such requirements because of impossible direct comparisons when using summary statistics. Thus, for sPRS, potential duplications should be carefully considered within the same ethnic group.
Collapse
Affiliation(s)
- David Keetae Park
- Department of Biomedical Engineering, Columbia University, New York, USA
| | - Mingshen Chen
- Department of Applied Mathematics & Statistics, Stony Brook University, New York, USA
| | - Seungsoo Kim
- Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
| | - Yoonjung Yoonie Joo
- Samsung Advanced Institute for Health Sciences & Technology (SAHIST), Sungkyunkwan University, Samsung Medical Center, Seoul, South Korea
| | - Rebekah K Loving
- Department of Biology, California Institute of Technology, Pasadena, USA
| | - Hyoung Seop Kim
- Department of Physical Medicine and Rehabilitation, Dementia Center, National Health Insurance Service Ilsan Hospital, Goyang, South Korea
| | - Jiook Cha
- Department of Psychology, Brain and Cognitive Sciences, AI Institute, Seoul National University, Seoul, South Korea
| | - Shinjae Yoo
- Computational Science Initiative, Brookhaven National Lab. Computer Science and Math, Building 725, Room 2-189, Upton, NY, 11973, USA.
| | - Jong Hun Kim
- Department of Neurology, Dementia Center, National Health Insurance Service Ilsan Hospital, 100 Ilsan-ro Ilsandong-gu, Goyang, Gyeonggi-Do, 10444, South Korea.
| |
Collapse
|