1
|
Zhu L, Yan S, Cao X, Zhang S, Sha Q. Integrating External Controls by Regression Calibration for Genome-Wide Association Study. Genes (Basel) 2024; 15:67. [PMID: 38254957 PMCID: PMC10815702 DOI: 10.3390/genes15010067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 12/30/2023] [Accepted: 01/01/2024] [Indexed: 01/24/2024] Open
Abstract
Genome-wide association studies (GWAS) have successfully revealed many disease-associated genetic variants. For a case-control study, the adequate power of an association test can be achieved with a large sample size, although genotyping large samples is expensive. A cost-effective strategy to boost power is to integrate external control samples with publicly available genotyped data. However, the naive integration of external controls may inflate the type I error rates if ignoring the systematic differences (batch effect) between studies, such as the differences in sequencing platforms, genotype-calling procedures, population stratification, and so forth. To account for the batch effect, we propose an approach by integrating External Controls into the Association Test by Regression Calibration (iECAT-RC) in case-control association studies. Extensive simulation studies show that iECAT-RC not only can control type I error rates but also can boost statistical power in all models. We also apply iECAT-RC to the UK Biobank data for M72 Fibroblastic disorders by considering genotype calling as the batch effect. Four SNPs associated with fibroblastic disorders have been detected by iECAT-RC and the other two comparison methods, iECAT-Score and Internal. However, our method has a higher probability of identifying these significant SNPs in the scenario of an unbalanced case-control association study.
Collapse
Affiliation(s)
| | | | | | | | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI 49931, USA; (L.Z.); (S.Y.); (X.C.); (S.Z.)
| |
Collapse
|
2
|
Rajabli F, Kunkle BW. Strategies in Aggregation Tests for Rare Variants. Curr Protoc 2023; 3:e931. [PMID: 37988228 DOI: 10.1002/cpz1.931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]
Abstract
Genome-wide association studies (GWAS) successfully identified numerous common variants involved in complex diseases, but only limited heritability was explained by these findings. Advances in high-throughput sequencing technology made it possible to assess the contribution of rare variants in common diseases. However, study of rare variants introduces challenges due to low frequency of rare variants. Well-established common variant methods were underpowered to identify the rare variants in GWAS. To address this challenge, several new methods have been developed to examine the role of rare variants in complex diseases. These approaches are based on testing the aggregate effect of multiple rare variants in a predefined genetic region. Provided here is an overview of statistical approaches and the protocols explaining step-by-step analysis of aggregations tests with the hands-on experience using R scripts in four categories: burden tests, adaptive burden tests, variance-component tests, and combined tests. Also explained are the concepts of rare variants, permutation tests, kernel methods, and genetic variant annotation. At the end we discuss relevant topics of bioinformatics tools for annotation, family-based design of rare-variant analysis, population stratification adjustment, and meta-analysis. © 2023 The Authors. Current Protocols published by Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Farid Rajabli
- Dr. John T. Macdonald Foundation Department of Human Genetics, John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, Florida, USA
| | - Brian W Kunkle
- Dr. John T. Macdonald Foundation Department of Human Genetics, John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, Florida, USA
| |
Collapse
|
3
|
Liang X, Sun H. Weighted Selection Probability to Prioritize Susceptible Rare Variants in Multi-Phenotype Association Studies with Application to a Soybean Genetic Data Set. J Comput Biol 2023; 30:1075-1088. [PMID: 37871292 DOI: 10.1089/cmb.2022.0487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2023] Open
Abstract
Rare variant association studies with multiple traits or diseases have drawn a lot of attention since association signals of rare variants can be boosted if more than one phenotype outcome is associated with the same rare variants. Most of the existing statistical methods to identify rare variants associated with multiple phenotypes are based on a group test, where a pre-specified genetic region is tested one at a time. However, these methods are not designed to locate susceptible rare variants within the genetic region. In this article, we propose new statistical methods to prioritize rare variants within a genetic region when a group test for the genetic region identifies a statistical association with multiple phenotypes. It computes the weighted selection probability (WSP) of individual rare variants and ranks them from largest to smallest according to their WSP. In simulation studies, we demonstrated that the proposed method outperforms other statistical methods in terms of true positive selection, when multiple phenotypes are correlated with each other. We also applied it to our soybean single nucleotide polymorphism (SNP) data with 13 highly correlated amino acids, where we identified some potentially susceptible rare variants in chromosome 19.
Collapse
Affiliation(s)
- Xianglong Liang
- Department of Statistic, Pusan National University, Busan, Korea
| | - Hokeun Sun
- Department of Statistic, Pusan National University, Busan, Korea
| |
Collapse
|
4
|
Boutry S, Helaers R, Lenaerts T, Vikkula M. Rare variant association on unrelated individuals in case-control studies using aggregation tests: existing methods and current limitations. Brief Bioinform 2023; 24:bbad412. [PMID: 37974506 DOI: 10.1093/bib/bbad412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 10/14/2023] [Accepted: 10/28/2023] [Indexed: 11/19/2023] Open
Abstract
Over the past years, progress made in next-generation sequencing technologies and bioinformatics have sparked a surge in association studies. Especially, genome-wide association studies (GWASs) have demonstrated their effectiveness in identifying disease associations with common genetic variants. Yet, rare variants can contribute to additional disease risk or trait heterogeneity. Because GWASs are underpowered for detecting association with such variants, numerous statistical methods have been recently proposed. Aggregation tests collapse multiple rare variants within a genetic region (e.g. gene, gene set, genomic loci) to test for association. An increasing number of studies using such methods successfully identified trait-associated rare variants and led to a better understanding of the underlying disease mechanism. In this review, we compare existing aggregation tests, their statistical features and scope of application, splitting them into the five classical classes: burden, adaptive burden, variance-component, omnibus and other. Finally, we describe some limitations of current aggregation tests, highlighting potential direction for further investigations.
Collapse
Affiliation(s)
- Simon Boutry
- Human Molecular Genetics, de Duve Institute, University of Louvain, Avenue Hippocrate 74 (+5) bte B1.74.06, 1200 Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussels, 1050 Brussels, Belgium
| | - Raphaël Helaers
- Human Molecular Genetics, de Duve Institute, University of Louvain, Avenue Hippocrate 74 (+5) bte B1.74.06, 1200 Brussels, Belgium
| | - Tom Lenaerts
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussels, 1050 Brussels, Belgium
- Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium
- Artificial Intelligence laboratory, Vrije Universiteit Brussel, 1050 Brussels, Belgium
| | - Miikka Vikkula
- Human Molecular Genetics, de Duve Institute, University of Louvain, Avenue Hippocrate 74 (+5) bte B1.74.06, 1200 Brussels, Belgium
- WELBIO department, WEL Research Institute, avenue Pasteur, 6, 1300 Wavre, Belgium
| |
Collapse
|
5
|
Boutry S, Helaers R, Lenaerts T, Vikkula M. Excalibur: A new ensemble method based on an optimal combination of aggregation tests for rare-variant association testing for sequencing data. PLoS Comput Biol 2023; 19:e1011488. [PMID: 37708232 PMCID: PMC10522036 DOI: 10.1371/journal.pcbi.1011488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 09/26/2023] [Accepted: 09/04/2023] [Indexed: 09/16/2023] Open
Abstract
The development of high-throughput next-generation sequencing technologies and large-scale genetic association studies produced numerous advances in the biostatistics field. Various aggregation tests, i.e. statistical methods that analyze associations of a trait with multiple markers within a genomic region, have produced a variety of novel discoveries. Notwithstanding their usefulness, there is no single test that fits all needs, each suffering from specific drawbacks. Selecting the right aggregation test, while considering an unknown underlying genetic model of the disease, remains an important challenge. Here we propose a new ensemble method, called Excalibur, based on an optimal combination of 36 aggregation tests created after an in-depth study of the limitations of each test and their impact on the quality of result. Our findings demonstrate the ability of our method to control type I error and illustrate that it offers the best average power across all scenarios. The proposed method allows for novel advances in Whole Exome/Genome sequencing association studies, able to handle a wide range of association models, providing researchers with an optimal aggregation analysis for the genetic regions of interest.
Collapse
Affiliation(s)
- Simon Boutry
- Human Molecular Genetics, de Duve Institute, University of Louvain, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussels, Brussels, Belgium
| | - Raphaël Helaers
- Human Molecular Genetics, de Duve Institute, University of Louvain, Brussels, Belgium
| | - Tom Lenaerts
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussels, Brussels, Belgium
- Machine Learning Group, Université Libre de Bruxelles, Brussels, Belgium
- Artificial Intelligence laboratory, Vrije Universiteit Brussel, Brussels, Belgium
| | - Miikka Vikkula
- Human Molecular Genetics, de Duve Institute, University of Louvain, Brussels, Belgium
- WELBIO department, WEL Research Institute, Wavre, Belgium
| |
Collapse
|
6
|
Chen F, Wang X, Jang SK, Quach BC, Weissenkampen JD, Khunsriraksakul C, Yang L, Sauteraud R, Albert CM, Allred NDD, Arnett DK, Ashley-Koch AE, Barnes KC, Barr RG, Becker DM, Bielak LF, Bis JC, Blangero J, Boorgula MP, Chasman DI, Chavan S, Chen YDI, Chuang LM, Correa A, Curran JE, David SP, Fuentes LDL, Deka R, Duggirala R, Faul JD, Garrett ME, Gharib SA, Guo X, Hall ME, Hawley NL, He J, Hobbs BD, Hokanson JE, Hsiung CA, Hwang SJ, Hyde TM, Irvin MR, Jaffe AE, Johnson EO, Kaplan R, Kardia SLR, Kaufman JD, Kelly TN, Kleinman JE, Kooperberg C, Lee IT, Levy D, Lutz SM, Manichaikul AW, Martin LW, Marx O, McGarvey ST, Minster RL, Moll M, Moussa KA, Naseri T, North KE, Oelsner EC, Peralta JM, Peyser PA, Psaty BM, Rafaels N, Raffield LM, Reupena MS, Rich SS, Rotter JI, Schwartz DA, Shadyab AH, Sheu WHH, Sims M, Smith JA, Sun X, Taylor KD, Telen MJ, Watson H, Weeks DE, Weir DR, Yanek LR, Young KA, Young KL, Zhao W, Hancock DB, Jiang B, Vrieze S, Liu DJ. Multi-ancestry transcriptome-wide association analyses yield insights into tobacco use biology and drug repurposing. Nat Genet 2023; 55:291-300. [PMID: 36702996 PMCID: PMC9925385 DOI: 10.1038/s41588-022-01282-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 12/08/2022] [Indexed: 01/27/2023]
Abstract
Most transcriptome-wide association studies (TWASs) so far focus on European ancestry and lack diversity. To overcome this limitation, we aggregated genome-wide association study (GWAS) summary statistics, whole-genome sequences and expression quantitative trait locus (eQTL) data from diverse ancestries. We developed a new approach, TESLA (multi-ancestry integrative study using an optimal linear combination of association statistics), to integrate an eQTL dataset with a multi-ancestry GWAS. By exploiting shared phenotypic effects between ancestries and accommodating potential effect heterogeneities, TESLA improves power over other TWAS methods. When applied to tobacco use phenotypes, TESLA identified 273 new genes, up to 55% more compared with alternative TWAS methods. These hits and subsequent fine mapping using TESLA point to target genes with biological relevance. In silico drug-repurposing analyses highlight several drugs with known efficacy, including dextromethorphan and galantamine, and new drugs such as muscle relaxants that may be repurposed for treating nicotine addiction.
Collapse
Affiliation(s)
- Fang Chen
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Xingyan Wang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Seon-Kyeong Jang
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | | | - J Dylan Weissenkampen
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA
- Department of Psychology, Penn State College of Medicine, Hershey, PA, USA
| | | | - Lina Yang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Renan Sauteraud
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Christine M Albert
- Department of Cardiology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Preventive Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | | | - Donna K Arnett
- College of Public Health, University of Kentucky, Lexington, KY, USA
| | - Allison E Ashley-Koch
- Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC, USA
- Department of Medicine, Duke University Medical Center, Durham, NC, USA
- Duke Comprehensive Sickle Cell Center, Duke University Medical Center, Durham, NC, USA
| | - Kathleen C Barnes
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Center, Aurora, CO, USA
| | - R Graham Barr
- Department of Medicine, Columbia University Medical Center, New York, NY, USA
| | - Diane M Becker
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Lawrence F Bielak
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Joshua C Bis
- Department of Medicine, Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
| | - John Blangero
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Meher Preethi Boorgula
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Center, Aurora, CO, USA
| | - Daniel I Chasman
- Division of Preventive Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Sameer Chavan
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Center, Aurora, CO, USA
| | - Yii-Der I Chen
- Department of Pediatrics, Institute for Translational Genomics and Population Sciences, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Lee-Ming Chuang
- Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
| | - Adolfo Correa
- Department of Medicine, Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, USA
| | - Joanne E Curran
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Sean P David
- University of Chicago, Chicago, IL, USA
- NorthShore University Health System, Evanston, IL, USA
| | - Lisa de Las Fuentes
- Department of Medicine, Division of Biostatistics and Cardiovascular Division, Washington University School of Medicine, St. Louis, MO, USA
| | - Ranjan Deka
- Department of Environmental and Public Health Sciences, College of Medicine, University of Cincinnati, Cincinnati, OH, USA
| | - Ravindranath Duggirala
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Jessica D Faul
- Institute for Social Research, Survey Research Center, University of Michigan, Ann Arbor, MI, USA
| | - Melanie E Garrett
- Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC, USA
- Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - Sina A Gharib
- Department of Medicine, Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
- Computational Medicine Core at Center for Lung Biology, Division of Pulmonary, Critical Care and Sleep Medicine, University of Washington, Seattle, WA, USA
| | - Xiuqing Guo
- Department of Pediatrics, Institute for Translational Genomics and Population Sciences, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Michael E Hall
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
| | - Nicola L Hawley
- Department of Epidemiology (Chronic Disease), School of Public Health, Yale University, New Haven, CT, USA
| | - Jiang He
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Brian D Hobbs
- Harvard Medical School, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - John E Hokanson
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Chao A Hsiung
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan
| | - Shih-Jen Hwang
- The Population Sciences Branch, Division of Intramural Research, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
- The Framingham Heart Study, Framingham, MA, USA
| | - Thomas M Hyde
- Lieber Institute for Brain Development, Baltimore, MD, USA
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Marguerite R Irvin
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Andrew E Jaffe
- Lieber Institute for Brain Development, Baltimore, MD, USA
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Mental Health and Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Human Genetics and Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | | | - Robert Kaplan
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, The Bronx, NY, USA
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Sharon L R Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Joel D Kaufman
- Departments of Environmental & Occupational Health Sciences, Medicine, and Epidemiology, University of Washington Seattle, Seattle, WA, USA
| | - Tanika N Kelly
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Joel E Kleinman
- Lieber Institute for Brain Development, Baltimore, MD, USA
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | | | - I-Te Lee
- Department of Internal Medicine, Division of Endocrinology and Metabolism, Taichung Veterans General Hospital, Taichung, Taiwan
| | - Daniel Levy
- The Population Sciences Branch, Division of Intramural Research, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sharon M Lutz
- Department of Population Medicine, Harvard Pilgrim Health Care, Boston, MA, USA
| | - Ani W Manichaikul
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Lisa W Martin
- Division of Cardiology, George Washington University School of Medicine and Health Sciences, Washington, DC, USA
| | - Olivia Marx
- Department of Biomedical Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Stephen T McGarvey
- Department of Epidemiology, International Health Institute, Brown University School of Public Health, Providence, RI, USA
| | - Ryan L Minster
- Department of Human Genetics and Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Matthew Moll
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Karine A Moussa
- Penn State Huck Institutes of Life Sciences, Penn State College of Medicine, University Park, PA, USA
| | - Take Naseri
- Ministry of Health, Government of Samoa, Apia, Samoa
| | - Kari E North
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Elizabeth C Oelsner
- Department of Medicine, Columbia University Medical Center, New York, NY, USA
| | - Juan M Peralta
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Patricia A Peyser
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Bruce M Psaty
- Department of Medicine, Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Department of Health Systems and Population Health, University of Washington, Seattle, WA, USA
| | - Nicholas Rafaels
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Center, Aurora, CO, USA
| | - Laura M Raffield
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Jerome I Rotter
- Department of Pediatrics, Institute for Translational Genomics and Population Sciences, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | | | - Aladdin H Shadyab
- Herbert Wertheim School of Public Health and Human Longevity Science, University of California San Diego, La Jolla, CA, USA
| | | | - Mario Sims
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
| | - Jennifer A Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Institute for Social Research, Survey Research Center, University of Michigan, Ann Arbor, MI, USA
| | - Xiao Sun
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Kent D Taylor
- Department of Pediatrics, Institute for Translational Genomics and Population Sciences, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Marilyn J Telen
- Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - Harold Watson
- Faculty of Medical Sciences, University of the West Indies, Cave Hill Campus, Barbados
| | - Daniel E Weeks
- Department of Human Genetics and Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
| | - David R Weir
- Institute for Social Research, Survey Research Center, University of Michigan, Ann Arbor, MI, USA
| | - Lisa R Yanek
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Kendra A Young
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Kristin L Young
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Wei Zhao
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Institute for Social Research, Survey Research Center, University of Michigan, Ann Arbor, MI, USA
| | | | - Bibo Jiang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA.
| | - Scott Vrieze
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA.
| | - Dajiang J Liu
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA.
| |
Collapse
|
7
|
Defo J, Awany D, Ramesar R. From SNP to pathway-based GWAS meta-analysis: do current meta-analysis approaches resolve power and replication in genetic association studies? Brief Bioinform 2023; 24:6972298. [PMID: 36611240 DOI: 10.1093/bib/bbac600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 11/30/2022] [Accepted: 12/06/2022] [Indexed: 01/09/2023] Open
Abstract
Genome-wide association studies (GWAS) have benefited greatly from enhanced high-throughput technology in recent decades. GWAS meta-analysis has become increasingly popular to highlight the genetic architecture of complex traits, informing about the replicability and variability of effect estimations across human ancestries. A wealth of GWAS meta-analysis methodologies have been developed depending on the input data and the outcome information of interest. We present a survey of current approaches from SNP to pathway-based meta-analysis by acknowledging the range of resources and methodologies in the field, and we provide a comprehensive review of different categories of Genome-Wide Meta-analysis methods employed. These methods highlight different levels at which GWAS meta-analysis may be done, including Single Nucleotide Polymorphisms, Genes and Pathways, for which we describe their framework outline. We also discuss the strengths and pitfalls of each approach and make suggestions regarding each of them.
Collapse
Affiliation(s)
- Joel Defo
- Division of Human Genetics, Department of Pathology, Faculty of Health Sciences, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, 7925, Observatory, South Africa.,South African Medical Research Council Genomic and Personalized Medicine Research Unit
| | - Denis Awany
- South African Tuberculosis Vaccine Initiative (SATVI), University of Cape Town, 7925, South Africa
| | - Raj Ramesar
- Division of Human Genetics, Department of Pathology, Faculty of Health Sciences, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, 7925, Observatory, South Africa.,South African Medical Research Council Genomic and Personalized Medicine Research Unit
| |
Collapse
|
8
|
Saunders GRB, Wang X, Chen F, Jang SK, Liu M, Wang C, Gao S, Jiang Y, Khunsriraksakul C, Otto JM, Addison C, Akiyama M, Albert CM, Aliev F, Alonso A, Arnett DK, Ashley-Koch AE, Ashrani AA, Barnes KC, Barr RG, Bartz TM, Becker DM, Bielak LF, Benjamin EJ, Bis JC, Bjornsdottir G, Blangero J, Bleecker ER, Boardman JD, Boerwinkle E, Boomsma DI, Boorgula MP, Bowden DW, Brody JA, Cade BE, Chasman DI, Chavan S, Chen YDI, Chen Z, Cheng I, Cho MH, Choquet H, Cole JW, Cornelis MC, Cucca F, Curran JE, de Andrade M, Dick DM, Docherty AR, Duggirala R, Eaton CB, Ehringer MA, Esko T, Faul JD, Fernandes Silva L, Fiorillo E, Fornage M, Freedman BI, Gabrielsen ME, Garrett ME, Gharib SA, Gieger C, Gillespie N, Glahn DC, Gordon SD, Gu CC, Gu D, Gudbjartsson DF, Guo X, Haessler J, Hall ME, Haller T, Harris KM, He J, Herd P, Hewitt JK, Hickie I, Hidalgo B, Hokanson JE, Hopfer C, Hottenga J, Hou L, Huang H, Hung YJ, Hunter DJ, Hveem K, Hwang SJ, Hwu CM, Iacono W, Irvin MR, Jee YH, Johnson EO, Joo YY, Jorgenson E, Justice AE, Kamatani Y, Kaplan RC, Kaprio J, Kardia SLR, Keller MC, Kelly TN, Kooperberg C, Korhonen T, Kraft P, Krauter K, Kuusisto J, Laakso M, Lasky-Su J, Lee WJ, Lee JJ, Levy D, Li L, Li K, Li Y, Lin K, Lind PA, Liu C, Lloyd-Jones DM, Lutz SM, Ma J, Mägi R, Manichaikul A, Martin NG, Mathur R, Matoba N, McArdle PF, McGue M, McQueen MB, Medland SE, Metspalu A, Meyers DA, Millwood IY, Mitchell BD, Mohlke KL, Moll M, Montasser ME, Morrison AC, Mulas A, Nielsen JB, North KE, Oelsner EC, Okada Y, Orrù V, Palmer ND, Palviainen T, Pandit A, Park SL, Peters U, Peters A, Peyser PA, Polderman TJC, Rafaels N, Redline S, Reed RM, Reiner AP, Rice JP, Rich SS, Richmond NE, Roan C, Rotter JI, Rueschman MN, Runarsdottir V, Saccone NL, Schwartz DA, Shadyab AH, Shi J, Shringarpure SS, Sicinski K, Skogholt AH, Smith JA, Smith NL, Sotoodehnia N, Stallings MC, Stefansson H, Stefansson K, Stitzel JA, Sun X, Syed M, Tal-Singer R, Taylor AE, Taylor KD, Telen MJ, Thai KK, Tiwari H, Turman C, Tyrfingsson T, Wall TL, Walters RG, Weir DR, Weiss ST, White WB, Whitfield JB, Wiggins KL, Willemsen G, Willer CJ, Winsvold BS, Xu H, Yanek LR, Yin J, Young KL, Young KA, Yu B, Zhao W, Zhou W, Zöllner S, Zuccolo L, Batini C, Bergen AW, Bierut LJ, David SP, Gagliano Taliun SA, Hancock DB, Jiang B, Munafò MR, Thorgeirsson TE, Liu DJ, Vrieze S. Genetic diversity fuels gene discovery for tobacco and alcohol use. Nature 2022; 612:720-724. [PMID: 36477530 PMCID: PMC9771818 DOI: 10.1038/s41586-022-05477-4] [Citation(s) in RCA: 116] [Impact Index Per Article: 58.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 10/25/2022] [Indexed: 12/12/2022]
Abstract
Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury1-4. These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries5. Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.
Collapse
Affiliation(s)
| | - Xingyan Wang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Fang Chen
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Seon-Kyeong Jang
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | - Mengzhen Liu
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | - Chen Wang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Shuang Gao
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Yu Jiang
- Department of Epidemiology & Population Health at Stanford University, Stanford, CA, USA
| | | | - Jacqueline M Otto
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | - Clifton Addison
- Jackson Heart Study (JHS) Graduate Training and Education Center (GTEC), Department of Epidemiology and Biostatistics, School of Public Health, Jackson State University, Jackson, MS, USA
| | - Masato Akiyama
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Department of Ocular Pathology and Imaging Science, Kyushu University Graduate School of Medical Sciences, Fukuoka, Japan
| | - Christine M Albert
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Preventive Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Fazil Aliev
- Department of Psychiatry, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, USA
| | - Alvaro Alonso
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Donna K Arnett
- Dean's Office and Department of Epidemiology, College of Public Health, University of Kentucky, Lexington, KY, USA
| | - Allison E Ashley-Koch
- Department of Medicine and Duke Comprehensive Sickle Cell Center, Duke University School of Medicine, Durham, NC, USA
- Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, NC, USA
| | - Aneel A Ashrani
- Division of Hematology, Department of Medicine, Mayo Clinic College of Medicine and Science, Rochester, MN, USA
| | - Kathleen C Barnes
- Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
- Tempus, Chicago, IL, USA
| | - R Graham Barr
- Department of Medicine, Columbia University Medical Center, New York, NY, USA
| | - Traci M Bartz
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Diane M Becker
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Lawrence F Bielak
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Emelia J Benjamin
- Department of Medicine, Boston Medical Center, Boston University School of Medicine, Boston, MA, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
| | - Joshua C Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | | | - John Blangero
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | | | - Jason D Boardman
- Institute of Behavioral Science, University of Colorado Boulder, Boulder, CO, USA
| | - Eric Boerwinkle
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Dorret I Boomsma
- Netherlands Twin Register, Dept Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Meher Preethi Boorgula
- Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Donald W Bowden
- Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Jennifer A Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Brian E Cade
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
| | - Daniel I Chasman
- Division of Preventive Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Sameer Chavan
- Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Yii-Der Ida Chen
- Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Zhengming Chen
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Iona Cheng
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA, USA
- UCSF Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, CA, USA
| | - Michael H Cho
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Hélène Choquet
- Kaiser Permanente Northern California (KPNC), Division of Research, Oakland, CA, USA
| | - John W Cole
- Department of Neurology, Baltimore Veterans Affairs Medical Center, Baltimore, MD, USA
- Division of Vascular Neurology, Department of Neurology, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Marilyn C Cornelis
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | | | - Joanne E Curran
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Mariza de Andrade
- Division of Clinical Trials and Biostatistics, Department of Quantitative Health Sciences, Mayo Clinic College of Medicine and Science, Rochester, MN, USA
| | - Danielle M Dick
- Department of Psychiatry, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, USA
| | - Anna R Docherty
- Department of Psychiatry, University of Utah School of Medicine, Salt Lake City, UT, USA
- Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Virginia, USA
- Huntsman Mental Health Institute, Salt Lake City, UT, USA
| | - Ravindranath Duggirala
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Charles B Eaton
- Department of Family Medicine, Brown University, Providence, RI, USA
| | - Marissa A Ehringer
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
- Department of Integrative Physiology, University of Colorado Boulder, Boulder, CO, USA
| | - Tõnu Esko
- Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Jessica D Faul
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Lilian Fernandes Silva
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland
| | - Edoardo Fiorillo
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche (CNR), Monserrato, Italy
| | - Myriam Fornage
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Barry I Freedman
- Department of Internal Medicine-Section on Nephrology, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Maiken E Gabrielsen
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
| | - Melanie E Garrett
- Department of Medicine and Duke Comprehensive Sickle Cell Center, Duke University School of Medicine, Durham, NC, USA
- Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, NC, USA
| | - Sina A Gharib
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
- Division of Pulmonary, Critical Care, and Sleep Medicine, Department of Medicine, University of Washington, Seattle, WA, USA
- Center for Lung Biology, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Christian Gieger
- Research Unit Molecular Epidemiology, Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Nathan Gillespie
- Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Virginia, USA
| | - David C Glahn
- Department of Psychiatry & Behavioral Sciences, Boston Children's Hospital & Harvard Medical School, Boston, MA, USA
| | - Scott D Gordon
- Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Australia
| | - Charles C Gu
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA
| | - Dongfeng Gu
- Department of Epidemiology and Key Laboratory of Cardiovascular Epidemiology, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Daniel F Gudbjartsson
- deCODE Genetics/Amgen, Inc., Reykjavik, Iceland
- School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland
| | - Xiuqing Guo
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Jeffrey Haessler
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Michael E Hall
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
| | - Toomas Haller
- Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Kathleen Mullan Harris
- Department of Sociology and the Carolina Population Center, University of North Carolina, Chapel Hill, NC, USA
| | - Jiang He
- Department of Epidemiology, Tulane University, New Orleans, LA, USA
- Translational Sciences Institute, Tulane University, New Orleans, LA, USA
| | - Pamela Herd
- McCourt School of Public Policy, Georgetown University, Washington, DC, USA
| | - John K Hewitt
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
- Department Of Psychology and Neuroscience, University of Colorado Boulder, Boulder, CO, USA
| | - Ian Hickie
- Youth Mental Health & Technology Team, Brain and Mind Centre, University of Sydney, Sydney, Australia
| | - Bertha Hidalgo
- Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
| | - John E Hokanson
- Department of Epidemiology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Christian Hopfer
- Department of Psychiatry, University of Colorado Anschutz Medical Center, Denver, CO, USA
| | - JoukeJan Hottenga
- Netherlands Twin Register, Dept Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Lifang Hou
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Hongyan Huang
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Yi-Jen Hung
- Institute of Preventive Medicine, National Defense Medical Center, New Taipei City, Taiwan
| | - David J Hunter
- Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Kristian Hveem
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
- HUNT Research Center, Department of Public Health and Nursing, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
- Department of Research, Innovation and Education, St. Olavs Hospital, Trondheim University Hospital, Trondheim, Norway
| | - Shih-Jen Hwang
- Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Chii-Min Hwu
- Section of Endocrinology and Metabolism, Department of Medicine, Taipei Veterans General Hospital, Taipei, Taiwan
| | - William Iacono
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | - Marguerite R Irvin
- Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Yon Ho Jee
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Eric O Johnson
- GenOmics, Bioinformatics, and Translational Research Center, RTI International, Research Triangle Park, NC, USA
- Fellow Program, RTI International, Research Triangle Park, NC, USA
| | - Yoonjung Y Joo
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Institute of Data Science, Korea University, Seoul, South Korea
| | | | - Anne E Justice
- Department of Population Health Sciences, Geisinger, Danville, PA, USA
- Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Yoichiro Kamatani
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
| | - Robert C Kaplan
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Jaakko Kaprio
- Institute for Molecular Medicine Finland - FIMM, University of Helsinki, Helsinki, Finland
| | - Sharon L R Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Matthew C Keller
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
- Department Of Psychology and Neuroscience, University of Colorado Boulder, Boulder, CO, USA
| | - Tanika N Kelly
- Department of Epidemiology, Tulane University, New Orleans, LA, USA
- Translational Sciences Institute, Tulane University, New Orleans, LA, USA
| | - Charles Kooperberg
- Department of Biostatistics, University of Washington, Seattle, WA, USA
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Tellervo Korhonen
- Institute for Molecular Medicine Finland - FIMM, University of Helsinki, Helsinki, Finland
| | - Peter Kraft
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Kenneth Krauter
- Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, CO, USA
| | - Johanna Kuusisto
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland
- Center for Medicine and Clinical Research, Kuopio University Hospital, Kuopio, Finland
| | - Markku Laakso
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland
| | - Jessica Lasky-Su
- Brigham and Women's Hospital, Department of Medicine, Channing Division of Network Medicine, Boston, MA, USA
| | - Wen-Jane Lee
- Department of Medical Research, Taichung Veterans General Hospital, Taichung City, Taiwan
| | - James J Lee
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | - Daniel Levy
- Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Liming Li
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Kevin Li
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Yuqing Li
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA, USA
| | - Kuang Lin
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Penelope A Lind
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, Australia
- School of Biomedical Sciences, Faculty of Medicine, University of Queensland, Brisbane, Australia
- School of Biomedical Sciences, Queensland University of Technology, Brisbane, Australia
| | - Chunyu Liu
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Donald M Lloyd-Jones
- Departments of Preventive Medicine, Medicine, and Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Sharon M Lutz
- Department of Population Medicine, Harvard Pilgrim Health Care Institute, Boston, MA, USA
- Department of Biostatics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Jiantao Ma
- Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
- Division of Nutrition Epidemiology and Data Science, Friedman School of Nutrition Science and Policy, Tufts University, Boston, MA, USA
| | - Reedik Mägi
- Department of Integrative Physiology, University of Colorado Boulder, Boulder, CO, USA
| | - Ani Manichaikul
- Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Nicholas G Martin
- Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Australia
| | - Ravi Mathur
- GenOmics, Bioinformatics, and Translational Research Center, RTI International, Research Triangle Park, NC, USA
| | - Nana Matoba
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Department of Genetics, UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Patrick F McArdle
- Division of Endocrinology, Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Matt McGue
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | - Matthew B McQueen
- Department of Integrative Physiology, University of Colorado, Boulder, CO, USA
| | - Sarah E Medland
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, Australia
| | | | | | - Iona Y Millwood
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Braxton D Mitchell
- Division of Endocrinology, Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
- Geriatrics Research and Education Clinical Center, Baltimore Veterans Administration Medical Center, Baltimore, MD, USA
| | - Karen L Mohlke
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | - Matthew Moll
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - May E Montasser
- Division of Endocrinology, Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Alanna C Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Antonella Mulas
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche (CNR), Monserrato, Italy
| | - Jonas B Nielsen
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
- Department of Internal Medicine, Division of Cardiovascular Medicine, University of Michigan, Ann Arbor, MI, USA
| | - Kari E North
- Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Elizabeth C Oelsner
- Department of Medicine, Columbia University Medical Center, New York, NY, USA
| | - Yukinori Okada
- Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
- Department of Genome Informatics, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan
| | - Valeria Orrù
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche (CNR), Monserrato, Italy
| | - Nicholette D Palmer
- Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Teemu Palviainen
- Institute for Molecular Medicine Finland - FIMM, University of Helsinki, Helsinki, Finland
| | - Anita Pandit
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - S Lani Park
- Population Sciences of the Pacific Program, University of Hawaii Cancer Center, Honolulu, HI, USA
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Epidemiology, University of Washington, Seattle, WA, USA
| | - Annette Peters
- Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Institute for Medical Information Processing, Biometry and Epidemiology, Ludwig Maximilians University Munich, Munich, Germany
- German Centre for Cardiovascular Research, DZHK, Partner Site Munich, Munich, Germany
| | - Patricia A Peyser
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Tinca J C Polderman
- Department of Clinical Developmental Psychology, Vrije Universiteit, Amsterdam, The Netherlands
- Department of Child and Adolescent Psychiatry, Amsterdam UMC, Amsterdam, The Netherlands
| | - Nicholas Rafaels
- Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
- Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Robert M Reed
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Alex P Reiner
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Epidemiology, University of Washington, Seattle, WA, USA
| | - John P Rice
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, USA
| | - Stephen S Rich
- Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Nicole E Richmond
- Department of Epidemiology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Carol Roan
- Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, WI, USA
| | - Jerome I Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Michael N Rueschman
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
| | | | - Nancy L Saccone
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - David A Schwartz
- Division of Pulmonary Sciences and Critical Care Medicine; Department of Medicine and Immunology, University of Colorado, Aurora, CO, USA
| | - Aladdin H Shadyab
- Herbert Wertheim School of Public Health and Human Longevity Science, University of California, San Diego, La Jolla, CA, USA
| | | | | | - Kamil Sicinski
- Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, WI, USA
| | - Anne Heidi Skogholt
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
| | - Jennifer A Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Nicholas L Smith
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA, USA
- Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, WA, USA
| | - Nona Sotoodehnia
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
- Division of Cardiology, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Michael C Stallings
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
- Department Of Psychology and Neuroscience, University of Colorado Boulder, Boulder, CO, USA
| | | | - Kari Stefansson
- deCODE Genetics/Amgen, Inc., Reykjavik, Iceland
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
| | - Jerry A Stitzel
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
| | - Xiao Sun
- Department of Epidemiology, Tulane University, New Orleans, LA, USA
| | - Moin Syed
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | | | - Amy E Taylor
- MRC Integrative Epidemiology Unit, Population Health Sciences, University of Bristol, Bristol, UK
- National Institute for Health Research Biomedical Research Centre at the University Hospitals Bristol NHS Foundation Trust and the University of Bristol, Bristol, UK
- Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Kent D Taylor
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Marilyn J Telen
- Department of Medicine and Duke Comprehensive Sickle Cell Center, Duke University School of Medicine, Durham, NC, USA
| | - Khanh K Thai
- Kaiser Permanente Northern California (KPNC), Division of Research, Oakland, CA, USA
| | - Hemant Tiwari
- Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Constance Turman
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | | | - Tamara L Wall
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
| | - Robin G Walters
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - David R Weir
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Scott T Weiss
- Brigham and Women's Hospital, Department of Medicine, Channing Division of Network Medicine, Boston, MA, USA
| | - Wendy B White
- Jackson Heart Study Undergraduate Training and Education Center, Tougaloo College, Tougaloo, MS, USA
| | - John B Whitfield
- Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Australia
| | - Kerri L Wiggins
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Gonneke Willemsen
- Netherlands Twin Register, Dept Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Cristen J Willer
- Department of Internal Medicine, Division of Cardiology, University of Michigan, Ann Arbor, MI, USA
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Bendik S Winsvold
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
- Department of Research and Innovation, Division of Clinical Neuroscience, Oslo University Hospital, Oslo, Norway
- Department of Neurology, Oslo University Hospital, Oslo, Norway
| | - Huichun Xu
- Division of Endocrinology, Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Lisa R Yanek
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Jie Yin
- Kaiser Permanente Northern California (KPNC), Division of Research, Oakland, CA, USA
| | - Kristin L Young
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Kendra A Young
- Department of Epidemiology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Bing Yu
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Wei Zhao
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Wei Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Sebastian Zöllner
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Luisa Zuccolo
- MRC Integrative Epidemiology Unit, Population Health Sciences, University of Bristol, Bristol, UK
- Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
- Health Data Science Centre, Fondazione Human Technopole, Milan, Italy
| | - Chiara Batini
- Department of Population Health Sciences, University of Leicester, Leicester, UK
| | - Andrew W Bergen
- Oregon Research Institute, Springfield, OR, USA
- BioRealm, LLC, Walnut, CA, USA
| | - Laura J Bierut
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, USA
| | - Sean P David
- Outcomes Research Network & Department of Family Medicine, NorthShore University HealthSystem, Evanston, IL, USA
- Department of Family Medicine, University of Chicago, Chicago, IL, USA
| | - Sarah A Gagliano Taliun
- Department of Medicine, Université de Montréal, Montréal, Québec, Canada
- Department of Neurosciences, Université de Montréal, Montréal, Québec, Canada
- Research Centre, Montréal Heart Institute, Montréal, Québec, Canada
| | - Dana B Hancock
- GenOmics, Bioinformatics, and Translational Research Center, RTI International, Research Triangle Park, NC, USA
| | - Bibo Jiang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Marcus R Munafò
- MRC Integrative Epidemiology Unit, Population Health Sciences, University of Bristol, Bristol, UK
- National Institute for Health Research Biomedical Research Centre at the University Hospitals Bristol NHS Foundation Trust and the University of Bristol, Bristol, UK
- School of Psychological Science, University of Bristol, Bristol, UK
| | | | - Dajiang J Liu
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA.
| | - Scott Vrieze
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA.
| |
Collapse
|
9
|
Aborageh M, Krawitz P, Fröhlich H. Genetics in parkinson's disease: From better disease understanding to machine learning based precision medicine. FRONTIERS IN MOLECULAR MEDICINE 2022; 2:933383. [PMID: 39086979 PMCID: PMC11285583 DOI: 10.3389/fmmed.2022.933383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 08/30/2022] [Indexed: 08/02/2024]
Abstract
Parkinson's Disease (PD) is a neurodegenerative disorder with highly heterogeneous phenotypes. Accordingly, it has been challenging to robustly identify genetic factors associated with disease risk, prognosis and therapy response via genome-wide association studies (GWAS). In this review we first provide an overview of existing statistical methods to detect associations between genetic variants and the disease phenotypes in existing PD GWAS. Secondly, we discuss the potential of machine learning approaches to better quantify disease phenotypes and to move beyond disease understanding towards a better-personalized treatment of the disease.
Collapse
Affiliation(s)
- Mohamed Aborageh
- Bonn-Aachen International Center for Information Technology (B-IT), Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| | - Peter Krawitz
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Bonn, Germany
| | - Holger Fröhlich
- Bonn-Aachen International Center for Information Technology (B-IT), Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
| |
Collapse
|
10
|
Wang R, Lin DY, Jiang Y. EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing. PLoS Genet 2022; 18:e1010251. [PMID: 35709291 PMCID: PMC9242467 DOI: 10.1371/journal.pgen.1010251] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 06/29/2022] [Accepted: 05/12/2022] [Indexed: 11/18/2022] Open
Abstract
More than a decade of genome-wide association studies (GWASs) have identified genetic risk variants that are significantly associated with complex traits. Emerging evidence suggests that the function of trait-associated variants likely acts in a tissue- or cell-type-specific fashion. Yet, it remains challenging to prioritize trait-relevant tissues or cell types to elucidate disease etiology. Here, we present EPIC (cEll tyPe enrIChment), a statistical framework that relates large-scale GWAS summary statistics to cell-type-specific gene expression measurements from single-cell RNA sequencing (scRNA-seq). We derive powerful gene-level test statistics for common and rare variants, separately and jointly, and adopt generalized least squares to prioritize trait-relevant cell types while accounting for the correlation structures both within and between genes. Using enrichment of loci associated with four lipid traits in the liver and enrichment of loci associated with three neurological disorders in the brain as ground truths, we show that EPIC outperforms existing methods. We apply our framework to multiple scRNA-seq datasets from different platforms and identify cell types underlying type 2 diabetes and schizophrenia. The enrichment is replicated using independent GWAS and scRNA-seq datasets and further validated using PubMed search and existing bulk case-control testing results.
Collapse
Affiliation(s)
- Rujin Wang
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Dan-Yu Lin
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina, United States of America
- * E-mail: (D-YL); (YJ)
| | - Yuchao Jiang
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Department of Genetics, School of Medicine, University of North Carolina, Chapel Hill, North Carolina, United States of America
- * E-mail: (D-YL); (YJ)
| |
Collapse
|
11
|
Lee JY, Shen PS, Cheng KF. A robust association test with multiple genetic variants and covariates. Stat Appl Genet Mol Biol 2022; 21:sagmb-2021-0029. [DOI: 10.1515/sagmb-2021-0029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 05/20/2022] [Indexed: 11/15/2022]
Abstract
Abstract
Due to the advancement of genome sequencing techniques, a great stride has been made in exome sequencing such that the association study between disease and genetic variants has become feasible. Some powerful and well-known association tests have been proposed to test the association between a group of genes and the disease of interest. However, some challenges still remain, in particular, many factors can affect the performance of testing power, e.g., the sample size, the number of causal and non-causal variants, and direction of the effect of causal variants. Recently, a powerful test, called T
REM
, is derived based on a random effects model. T
REM
has the advantages of being less sensitive to the inclusion of non-causal rare variants or low effect common variants or the presence of missing genotypes. However, the testing power of T
REM
can be low when a portion of causal variants has effects in opposite directions. To improve the drawback of T
REM
, we propose a novel test, called T
ROB
, which keeps the advantages of T
REM
and is more robust than T
REM
in terms of having adequate power in the case of variants with opposite directions of effect. Simulation results show that T
ROB
has a stable type I error rate and outperforms T
REM
when the proportion of risk variants decreases to a certain level and its advantage over T
REM
increases as the proportion decreases. Furthermore, T
ROB
outperforms several other competing tests in most scenarios. The proposed methodology is illustrated using the Shanghai Breast Cancer Study.
Collapse
Affiliation(s)
- Jen-Yu Lee
- Department of Statistics , Feng Chia University , Taichung , Taiwan, ROC
| | - Pao-Sheng Shen
- Department of Statistics , Tunghai University , Taichung , Taiwan, ROC
| | - Kuang-Fu Cheng
- Biostatistics Center , Taipei Medical University , Taipei , Taiwan, ROC
- Department of Business Administration , Asia University , Taichung , Taiwan, ROC
| |
Collapse
|
12
|
Li B, Jin B, Capra JA, Bush WS. Integration of Protein Structure and Population-Scale DNA Sequence Data for Disease Gene Discovery and Variant Interpretation. Annu Rev Biomed Data Sci 2022; 5:141-161. [PMID: 35508071 DOI: 10.1146/annurev-biodatasci-122220-112147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The experimental and computational techniques for capturing information about protein structures and genetic variation within the human genome have advanced dramatically in the past 20 years, generating extensive new data resources. In this review, we discuss these advances, along with new approaches for determining the impact a genetic variant has on protein function. We focus on the potential of new methods that integrate human genetic variation into protein structures to discover relationships to disease, including the discovery of mutational hotspots in cancer-related proteins, the localization of protein-altering variants within protein regions for common complex diseases, and the assessment of variants of unknown significance for Mendelian traits. We expect that approaches that integrate these data sources will play increasingly important roles in disease gene discovery and variant interpretation. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Bian Li
- Department of Biological Sciences and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, USA
| | - Bowen Jin
- Graduate Program in Systems Biology and Bioinformatics, Department of Nutrition, School of Medicine, Case Western Reserve University, Cleveland, Ohio, USA
| | - John A Capra
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA;
| | - William S Bush
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
| |
Collapse
|
13
|
Misawa K. Genotype Value Decomposition: Simple Methods for the Computation of Kernel Statistics. ADVANCED GENETICS (HOBOKEN, N.J.) 2022; 3:2100066. [PMID: 36620199 PMCID: PMC9744480 DOI: 10.1002/ggn2.202100066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Indexed: 01/11/2023]
Abstract
Recent advances in sequencing technologies enable genome-wide analyses for thousands of individuals. The sequential kernel association test (SKAT) is a widely used method to test for associations between a phenotype and a set of rare variants. As the sample size of human genetics studies increases, the computational time required to calculate a kernel is becoming more and more problematic. In this study, a new method to obtain kernel statistics without calculating a kernel matrix is proposed. A simple method for the computation of two kernel statistics, namely, a kernel statistic based on a genetic relationship matrix (GRM) and one based on an identity by state (IBS) matrix, are proposed. By using this method, calculation of the kernel statistics can be conducted using vector calculation without matrix calculation. The proposed method enables one to conduct SKAT for large samples of human genetics.
Collapse
Affiliation(s)
- Kazuharu Misawa
- Department of Human GeneticsYokohama City University Graduate School of Medicine3‐9 Fukuura, Kanazawa‐kuYokohama236‐0004Japan
| |
Collapse
|
14
|
Li S, Li S, Su S, Zhang H, Shen J, Wen Y. Gene Region Association Analysis of Longitudinal Quantitative Traits Based on a Function-On-Function Regression Model. Front Genet 2022; 13:781740. [PMID: 35265102 PMCID: PMC8899465 DOI: 10.3389/fgene.2022.781740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 01/04/2022] [Indexed: 11/13/2022] Open
Abstract
In the process of growth and development in life, gene expressions that control quantitative traits will turn on or off with time. Studies of longitudinal traits are of great significance in revealing the genetic mechanism of biological development. With the development of ultra-high-density sequencing technology, the associated analysis has tremendous challenges to statistical methods. In this paper, a longitudinal functional data association test (LFDAT) method is proposed based on the function-on-function regression model. LFDAT can simultaneously treat phenotypic traits and marker information as continuum variables and analyze the association of longitudinal quantitative traits and gene regions. Simulation studies showed that: 1) LFDAT performs well for both linkage equilibrium simulation and linkage disequilibrium simulation, 2) LFDAT has better performance for gene regions (include common variants, low-frequency variants, rare variants and mixture), and 3) LFDAT can accurately identify gene switching in the growth and development stage. The longitudinal data of the Oryza sativa projected shoot area is analyzed by LFDAT. It showed that there is the advantage of quick calculations. Further, an association analysis was conducted between longitudinal traits and gene regions by integrating the micro effects of multiple related variants and using the information of the entire gene region. LFDAT provides a feasible method for studying the formation and expression of longitudinal traits.
Collapse
Affiliation(s)
- Shijing Li
- College of Computer and Information Science, Fujian Agriculture and Forestry University, Fuzhou, China.,> Institute of Statistics and Application, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Shiqin Li
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Shaoqiang Su
- College of Computer and Information Science, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Hui Zhang
- College of Computer and Information Science, Fujian Agriculture and Forestry University, Fuzhou, China.,> Institute of Statistics and Application, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Jiayu Shen
- College of Computer and Information Science, Fujian Agriculture and Forestry University, Fuzhou, China.,> Institute of Statistics and Application, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yongxian Wen
- College of Computer and Information Science, Fujian Agriculture and Forestry University, Fuzhou, China.,> Institute of Statistics and Application, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
15
|
Clarelli F, Barizzone N, Mangano E, Zuccalà M, Basagni C, Anand S, Sorosina M, Mascia E, Santoro S, Guerini FR, Virgilio E, Gallo A, Pizzino A, Comi C, Martinelli V, Comi G, De Bellis G, Leone M, Filippi M, Esposito F, Bordoni R, Martinelli Boneschi F, D'Alfonso S. Contribution of Rare and Low-Frequency Variants to Multiple Sclerosis Susceptibility in the Italian Continental Population. Front Genet 2022; 12:800262. [PMID: 35047017 PMCID: PMC8762330 DOI: 10.3389/fgene.2021.800262] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 11/17/2021] [Indexed: 12/15/2022] Open
Abstract
Genome-wide association studies identified over 200 risk loci for multiple sclerosis (MS) focusing on common variants, which account for about 50% of disease heritability. The goal of this study was to investigate whether low-frequency and rare functional variants, located in MS-established associated loci, may contribute to disease risk in a relatively homogeneous population, testing their cumulative effect (burden) with gene-wise tests. We sequenced 98 genes in 588 Italian patients with MS and 408 matched healthy controls (HCs). Variants were selected using different filtering criteria based on allelic frequency and in silico functional impacts. Genes showing a significant burden (n = 17) were sequenced in an independent cohort of 504 MS and 504 HC. The highest signal in both cohorts was observed for the disruptive variants (stop-gain, stop-loss, or splicing variants) located in EFCAB13, a gene coding for a protein of an unknown function (p < 10-4). Among these variants, the minor allele of a stop-gain variant showed a significantly higher frequency in MS versus HC in both sequenced cohorts (p = 0.0093 and p = 0.025), confirmed by a meta-analysis on a third independent cohort of 1298 MS and 1430 HC (p = 0.001) assayed with an SNP array. Real-time PCR on 14 heterozygous individuals for this variant did not evidence the presence of the stop-gain allele, suggesting a transcript degradation by non-sense mediated decay, supported by the evidence that the carriers of the stop-gain variant had a lower expression of this gene (p = 0.0184). In conclusion, we identified a novel low-frequency functional variant associated with MS susceptibility, suggesting the possible role of rare/low-frequency variants in MS as reported for other complex diseases.
Collapse
Affiliation(s)
- Ferdinando Clarelli
- Laboratory of Human Genetics of Neurological Disorders, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Nadia Barizzone
- Department of Health Sciences, UPO, University of Eastern Piedmont, and CAAD (Center for Translational Research on Autoimmune and Allergic Disease), Novara, Italy
| | - Eleonora Mangano
- Institute for Biomedical Technologies, National Research Council of Italy, Segrate, Italy
| | - Miriam Zuccalà
- Department of Health Sciences, UPO, University of Eastern Piedmont, and CAAD (Center for Translational Research on Autoimmune and Allergic Disease), Novara, Italy
| | - Chiara Basagni
- Department of Health Sciences, UPO, University of Eastern Piedmont, and CAAD (Center for Translational Research on Autoimmune and Allergic Disease), Novara, Italy
| | - Santosh Anand
- Department of Informatics, Systems and Communications (DISCo), University of Milano-Bicocca, Milan, Italy
| | - Melissa Sorosina
- Laboratory of Human Genetics of Neurological Disorders, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Elisabetta Mascia
- Laboratory of Human Genetics of Neurological Disorders, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Silvia Santoro
- Laboratory of Human Genetics of Neurological Disorders, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | | | | | | | - Eleonora Virgilio
- Department of Translational Medicine, Section of Neurology and IRCAD, UNIUPO, Novara, Italy
| | - Antonio Gallo
- MS Center, I Division of Neurology, Department of Advanced Medical and Surgical Sciences (DAMSS), University of Campania "Luigi Vanvitelli", Naples, Italy
| | - Alessandro Pizzino
- Department of Health Sciences, UPO, University of Eastern Piedmont, and CAAD (Center for Translational Research on Autoimmune and Allergic Disease), Novara, Italy
| | - Cristoforo Comi
- Department of Translational Medicine, Section of Neurology and IRCAD, UNIUPO, Novara, Italy
| | - Vittorio Martinelli
- Neurology Unit and Neurorehabilitation Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | | | - Gianluca De Bellis
- Institute for Biomedical Technologies, National Research Council of Italy, Segrate, Italy
| | - Maurizio Leone
- Neurology Unit, Fondazione IRCCS Casa Sollievo Della Sofferenza, San Giovanni Rotondo, Italy
| | - Massimo Filippi
- Neurology Unit and Neurorehabilitation Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy.,Vita-Salute San Raffaele University, Milan, Italy.,Neuroimaging Research Unit, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy.,Neurophysiology Service, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Federica Esposito
- Laboratory of Human Genetics of Neurological Disorders, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Milan, Italy.,Neurology Unit and Neurorehabilitation Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Roberta Bordoni
- Institute for Biomedical Technologies, National Research Council of Italy, Segrate, Italy
| | - Filippo Martinelli Boneschi
- Department of Pathophysiology and Transplantation (DEPT), Dino Ferrari Centre, Neuroscience Section, University of Milan, Milan, Italy.,Neurology Unit, MS Centre, Foundation IRCCS Ca' Granda Ospedale Maggiore Policlinico, Milan, Italy
| | - Sandra D'Alfonso
- Department of Health Sciences, UPO, University of Eastern Piedmont, and CAAD (Center for Translational Research on Autoimmune and Allergic Disease), Novara, Italy
| |
Collapse
|
16
|
Mascia E, Clarelli F, Zauli A, Guaschino C, Sorosina M, Barizzone N, Basagni C, Santoro S, Ferrè L, Bonfiglio S, Biancolini D, Pozzato M, Guerini FR, Protti A, Liguori M, Moiola L, Vecchio D, Bresolin N, Comi G, Filippi M, Esposito F, D'Alfonso S, Martinelli-Boneschi F. Burden of rare coding variants in an Italian cohort of familial multiple sclerosis. J Neuroimmunol 2022; 362:577760. [PMID: 34922125 DOI: 10.1016/j.jneuroim.2021.577760] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 10/18/2021] [Accepted: 10/31/2021] [Indexed: 12/30/2022]
Abstract
BACKGROUND Multiple Sclerosis (MS) is a chronic inflammatory and neurodegenerative demyelinating disease of the central nervous system. It is a complex and heterogeneous disease caused by a combination of genetic and environmental factors, and it can cluster in families. OBJECTIVE to evaluate at gene-level the aggregate contribution of predicted damaging low-frequency and rare variants to MS risk in multiplex families. METHODS We performed whole exome sequencing (WES) in 28 multiplex MS families with at least 3 MS cases (81 affected and 42 unaffected relatives) and 38 unrelated healthy controls. A gene-based burden test was then performed, focusing on two sets of candidate genes: i) literature-driven selection and ii) data-driven selection. RESULTS We identified 11 genes enriched with predicted damaging low-frequency and rare variants in MS compared to healthy individuals. Among them, UBR2 and DST were the two genes with the strongest enrichment (p = 5 × 10-4 and 3 × 10-4, respectively); interestingly enough the association signal in UBR2 is driven by rs62414610, which was present in 25% of analysed families. CONCLUSION Despite limitations, this is one of the first studies evaluating the aggregate contribution of predicted damaging low-frequency and rare variants in MS families using WES data. A replication effort in independent cohorts is warranted to validate our findings and to evaluate the role of identified genes in MS pathogenesis.
Collapse
Affiliation(s)
- E Mascia
- Laboratory of Human Genetics of Neurological Disorders, Institute of Experimental Neurology (INSPE), Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - F Clarelli
- Laboratory of Human Genetics of Neurological Disorders, Institute of Experimental Neurology (INSPE), Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - A Zauli
- Laboratory of Human Genetics of Neurological Disorders, Institute of Experimental Neurology (INSPE), Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - C Guaschino
- Laboratory of Human Genetics of Neurological Disorders, Institute of Experimental Neurology (INSPE), Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy; Department of Neurology, Sant'Antonio Abate Hospital, Gallarate, Italy
| | - M Sorosina
- Laboratory of Human Genetics of Neurological Disorders, Institute of Experimental Neurology (INSPE), Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - N Barizzone
- Department of Health Sciences, Center on Autoimmune and Allergic Diseases (CAAD), UPO, University of Eastern Piedmont, A. Avogadro, 28100 Novara, Italy
| | - C Basagni
- Department of Health Sciences, Center on Autoimmune and Allergic Diseases (CAAD), UPO, University of Eastern Piedmont, A. Avogadro, 28100 Novara, Italy
| | - S Santoro
- Laboratory of Human Genetics of Neurological Disorders, Institute of Experimental Neurology (INSPE), Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - L Ferrè
- Laboratory of Human Genetics of Neurological Disorders, Institute of Experimental Neurology (INSPE), Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy; Neurology Unit, IRCCS San Raffaele Scientific Institute, Via Olgettina 48, 20132 Milan, Italy
| | - S Bonfiglio
- Center for Omics Sciences, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - D Biancolini
- Center for Omics Sciences, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - M Pozzato
- Neurology Unit and MS Centre, Foundation IRCCS Ca' Granda Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122 Milan, Italy
| | - F R Guerini
- IRCCS Fondazione Don Carlo Gnocchi, ONLUS, Milan, Italy
| | - A Protti
- Ospedale Niguarda, Department of Neurology, Milan, Italy
| | - M Liguori
- National Research Council, Institute of Biomedical Technologies, Bari Unit, 70126 Bari, Italy
| | - L Moiola
- Neurology Unit, IRCCS San Raffaele Scientific Institute, Via Olgettina 48, 20132 Milan, Italy
| | - D Vecchio
- SCDU Neurology, AOU Maggiore della Carità, 28100 Novara, Italy
| | - N Bresolin
- Department of Pathophysiology and Transplantation (DEPT), Dino Ferrari Centre, Neuroscience Section, University of Milan, Via Francesco Sforza 35, 20122 Milan, Italy
| | - G Comi
- Institute of Experimental Neurology, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - M Filippi
- Neurology Unit, IRCCS San Raffaele Scientific Institute, Via Olgettina 48, 20132 Milan, Italy; Vita-Salute San Raffaele University, Via Olgettina 58, 20132 Milan, Italy; Neuroimaging Research Unit, Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 48, 20132 Milan, Italy; Neurophysiology Unit, IRCCS San Raffaele Scientific Institute, San Raffaele Scientific Institute, Via Olgettina 48, 20132 Milan, Italy
| | - F Esposito
- Laboratory of Human Genetics of Neurological Disorders, Institute of Experimental Neurology (INSPE), Division of Neuroscience, IRCCS San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy; Neurology Unit, IRCCS San Raffaele Scientific Institute, Via Olgettina 48, 20132 Milan, Italy
| | - S D'Alfonso
- Department of Health Sciences, Center on Autoimmune and Allergic Diseases (CAAD), UPO, University of Eastern Piedmont, A. Avogadro, 28100 Novara, Italy
| | - F Martinelli-Boneschi
- Neurology Unit and MS Centre, Foundation IRCCS Ca' Granda Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122 Milan, Italy; Department of Pathophysiology and Transplantation (DEPT), Dino Ferrari Centre, Neuroscience Section, University of Milan, Via Francesco Sforza 35, 20122 Milan, Italy.
| |
Collapse
|
17
|
Zheng R, Du M, Ge Y, Gao F, Xin J, Lv Q, Qin C, Zhu Y, Gu C, Wang M, Zhu Q, Guo Z, Ben S, Chu H, Ye D, Zhang Z, Wang M. Identification of low-frequency variants of UGT1A3 associated with bladder cancer risk by next-generation sequencing. Oncogene 2021; 40:2382-2394. [PMID: 33658628 DOI: 10.1038/s41388-021-01672-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 01/11/2021] [Accepted: 01/20/2021] [Indexed: 12/31/2022]
Abstract
Although genome-wide association studies (GWASs) have successfully revealed many common risk variants for bladder cancer, the heritability is still largely unexplained. We hypothesized that low-frequency variants involved in bladder cancer risk could reveal the unexplained heritability. Next-generation sequencing of 113 patients and 118 controls was conducted on 81 genes/regions of known bladder cancer GWAS loci. A two-stage validation comprising 3,350 cases and 4,005 controls was performed to evaluate the effects of low-frequency variants on bladder cancer risk. Biological experiments and techniques, including electrophoretic mobility shift assays, CRISPR/Cas9, RNA-Seq, and bioinformatics approaches, were performed to assess the potential functions of low-frequency variants. The low-frequency variant rs28898617 was located in the first exon of UGT1A3 and was significantly associated with increased bladder cancer risk (odds ratio = 1.50, P = 3.10 × 10-6). Intriguingly, rs28898617 was only observed in the Asian population, but monomorphism was observed in the European population. The risk-associated G allele of rs28898617 increased UGT1A3 expression, facilitated UGT1A3 transcriptional activity, and enhanced the binding activity. In addition, UGT1A3 deletion significantly inhibited the proliferation, invasion, and migration of bladder cancer cells and xenograft tumor growth. Mechanistically, UGT1A3 induced LAMC2 expression by binding CBP and promoting histone acetylation, which remarkably promoted the progression of bladder cancer. This is the first targeted sequencing study to reveal that the novel low-frequency variant rs28898617 and its associated gene UGT1A3 are involved in bladder cancer development, providing new insights into the genetic architecture of bladder cancer.
Collapse
Affiliation(s)
- Rui Zheng
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China.,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Mulong Du
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China.,Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Yuqiu Ge
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China.,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Fang Gao
- Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China.,Key Laboratory of Environmental Medicine Engineering, Ministry of Education of China, School of Public Health, Southeast University, Nanjing, China
| | - Junyi Xin
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China.,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Qiang Lv
- Department of Urology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Chao Qin
- Department of Urology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Yao Zhu
- Department of Urology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Chengyuan Gu
- Department of Urology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Mengyun Wang
- Department of Urology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Qiuyuan Zhu
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China.,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Zheng Guo
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China.,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Shuai Ben
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China.,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Haiyan Chu
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China.,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Dingwei Ye
- Department of Urology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Zhengdong Zhang
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China. .,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China.
| | - Meilin Wang
- Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, China. .,Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China. .,The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu School, Nanjing Medical University, Nanjing, China. .,Jiangsu Cancer Hospital, Jiangsu Institute of Cancer Research, The Affiliated Cancer Hospital of Nanjing Medical University, Nanjing, China.
| |
Collapse
|
18
|
Han C, Yu T, Qin W, Liao X, Huang J, Liu Z, Yu L, Liu X, Chen Z, Yang C, Wang X, Mo S, Zhu G, Su H, Li J, Qin X, Gui Y, Mo Z, Li L, Peng T. Genome-wide association study of the TP53 R249S mutation in hepatocellular carcinoma with aflatoxin B1 exposure and infection with hepatitis B virus. J Gastrointest Oncol 2020; 11:1333-1349. [PMID: 33457005 PMCID: PMC7807280 DOI: 10.21037/jgo-20-510] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Accepted: 12/16/2020] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Exposure to dietary aflatoxin B1 (AFB1) induces DNA damage and mutation in the TP53 gene at codon 249, known as the TP53 R249S mutation, and is a major risk factor for hepatocellular carcinoma (HCC). AFB1 and the hepatitis B virus (HBV) together exert synergistic effects that promote carcinogenesis and TP53 R249S mutation in HCC. METHODS A genome-wide association study (GWAS) of whole genome exons was conducted using 485 HCC patients with chronic HBV infection. This was followed by an independent replication study conducted using 270 patients with chronic HBV infection. Immunohistochemistry was used to evaluate TP53 expression in all samples. This showed a correlation between codon 249 mutations and TP53 expression. Susceptibility variants for the TP53 R249S mutation in HCC were identified based on both the GWAS and replication study. The associations between identified variants and the expression levels of their located genes were analyzed in 20 paired independent samples. RESULTS The likelihood of positive TP53 expression was found to be higher in HCC patients with the R249S mutation both in the GWAS (P<0.001) and the replication study (P=0.006). The combined analyses showed that the TP53 R249S mutation was significantly associated with three single nucleotide polymorphisms (SNPs): ADAMTS18 rs9930984 (adjusted P=4.84×10-6), WDR49 rs75218075 (adjusted P=7.36×10-5), and SLC8A3 rs8022091 (adjusted P=0.042). The TP53 R249S mutation was found to be highly associated with the TT genotypes of rs9930984 (additive model, P=0.01; dominant model, P=6.43×10-5) and rs75218075 (additive model, P=0.002; dominant model, P=2.16×10-4). Additionally, ADAMTS18 mRNA expression was significantly higher in HCC tissue compared with its expression in paired non-tumor tissue (P=0.041), and patients carrying the TT genotype at rs9930984 showed lower ADAMTS18 expression in non-tumor tissue compared with patients carrying the GT genotype (P=0.0028). WDR49 expression was markedly lower in HCC tissue compared with paired non-tumor tissue (P=0.0011). CONCLUSIONS TP53 expression is significantly associated with the R249S mutation in HCC. Our collective results suggest that rs9930984, rs75218075, and rs8022091 are associated with R249S mutation susceptibility in HCC patients exposed to AFB1 and HBV infection.
Collapse
Affiliation(s)
- Chuangye Han
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
- School of Basic Medical Sciences, Guangxi Medical University, Nanning, China
| | - Tingdong Yu
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Wei Qin
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Xiwen Liao
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Jianlu Huang
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Zhengtao Liu
- Division of Hepatobiliary and Pancreatic Surgery, Department of Surgery, the First Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, China
| | - Long Yu
- Department of Hepatobiliary and Pancreatic Surgery, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Xiaoguang Liu
- Department of Hepatobiliary Surgery, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| | - Zhiwei Chen
- Department of General Surgery, Northern Jiangsu People’s Hospital, Yangzhou, China
| | - Chengkun Yang
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Xiangkun Wang
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Shutian Mo
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Guangzhi Zhu
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Hao Su
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Jiaquan Li
- Medical Scientific Research Center, Guangxi Medical University, Nanning, China
| | - Xue Qin
- Department of Clinical Laboratory, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Ying Gui
- Department of Clinical Laboratory, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Zengnan Mo
- Center for Genomics and Personalized Medicine, Guangxi Medical University, Nanning, China
| | - Lequn Li
- Department of Hepatobiliary Surgery, Affiliated Tumor Hospital of Guangxi Medical University, Nanning, China
| | - Tao Peng
- Department of Hepatobiliary Surgery, the First Affiliated Hospital of Guangxi Medical University, Nanning, China
| |
Collapse
|
19
|
Grassano M, Calvo A, Moglia C, Brunetti M, Barberis M, Sbaiz L, Canosa A, Manera U, Vasta R, Corrado L, D'Alfonso S, Mazzini L, Scholz SW, Dalgard C, Ding J, Gibbs RJ, Chia R, Traynor BJ, Chiò A. Mutational Analysis of Known ALS Genes in an Italian Population-Based Cohort. Neurology 2020; 96:e600-e609. [PMID: 33208543 DOI: 10.1212/wnl.0000000000011209] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 09/21/2020] [Indexed: 11/15/2022] Open
Abstract
OBJECTIVE To assess the burden of rare genetic variants and to estimate the contribution of known amyotrophic lateral sclerosis (ALS) genes in an Italian population-based cohort, we performed whole genome sequencing in 959 patients with ALS and 677 matched healthy controls. METHODS We performed genome sequencing in a population-based cohort (Piemonte and Valle d'Aosta Registry for ALS [PARALS]). A panel of 40 ALS genes was analyzed to identify potential disease-causing genetic variants and to evaluate the gene-wide burden of rare variants among our population. RESULTS A total of 959 patients with ALS were compared with 677 healthy controls from the same geographical area. Gene-wide association tests demonstrated a strong association with SOD1, whose rare variants are the second most common cause of disease after C9orf72 expansion. A lower signal was observed for TARDBP, proving that its effect on our cohort is driven by a few known causal variants. We detected rare variants in other known ALS genes that did not surpass statistical significance in gene-wise tests, thus highlighting that their contribution to disease risk in our cohort is limited. CONCLUSIONS We identified potential disease-causing variants in 11.9% of our patients. We identified the genes most frequently involved in our cohort and confirmed the contribution of rare variants in disease risk. Our results provide further insight into the pathologic mechanism of the disease and demonstrate the importance of genome-wide sequencing as a diagnostic tool.
Collapse
Affiliation(s)
- Maurizio Grassano
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy.
| | - Andrea Calvo
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Cristina Moglia
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Maura Brunetti
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Marco Barberis
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Luca Sbaiz
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Antonio Canosa
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Umberto Manera
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Rosario Vasta
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Lucia Corrado
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Sandra D'Alfonso
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Letizia Mazzini
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Sonja W Scholz
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Clifton Dalgard
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Jinhui Ding
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Raphael J Gibbs
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Ruth Chia
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Bryan J Traynor
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | - Adriano Chiò
- From "Rita Levi Montalcini" Department of Neuroscience (M.G., A. Calvo, C.M., A. Canosa, U.M., R.V., A. Chiò), University of Turin, Italy; Biocomputational Group (J.D., R.J.G.) and Neuromuscular Diseases Research Section (M.G., R.C., B.J.T.), Laboratory of Neurogenetics, National Institute on Aging, NIH, Porter Neuroscience Research Center, Bethesda, MD; Laboratory of Genetics, Department of Clinical Pathology (M. Brunetti, M. Barberis, L.S.), Azienda Ospedaliero Universitaria Città della Salute e della Scienza di Torino, Turin; Department of Health Sciences Interdisciplinary Research Center of Autoimmune Diseases (L.C., S.D.), "Amedeo Avogadro" University of Eastern Piedmont; ALS Center (L.M.), Department of Neurology, Azienda Ospedaliera Universitaria Maggiore della Carità, Novara, Italy; Neurodegenerative Diseases Research Unit, Laboratory of Neurogenetics (S.W.S.), National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda; Department of Neurology (S.W.S., B.J.T.), Johns Hopkins University Medical Center; Department of Anatomy, Physiology & Genetics (C.D.), and The American Genome Center, Collaborative Health Initiative Research Program (C.D.), Uniformed Services University of the Health Sciences, Bethesda, MD; and Institute of Cognitive Sciences and Technologies (A. Chiò), National Council of Research, Rome, Italy
| | | |
Collapse
|
20
|
Li Z, Liu Y, Lin X. Simultaneous Detection of Signal Regions Using Quadratic Scan Statistics With Applications to Whole Genome Association Studies. J Am Stat Assoc 2020; 117:823-834. [PMID: 35845434 PMCID: PMC9285665 DOI: 10.1080/01621459.2020.1822849] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 06/18/2020] [Accepted: 08/25/2020] [Indexed: 01/03/2023]
Abstract
We consider in this paper detection of signal regions associated with disease outcomes in whole genome association studies. Gene- or region-based methods have become increasingly popular in whole genome association analysis as a complementary approach to traditional individual variant analysis. However, these methods test for the association between an outcome and the genetic variants in a pre-specified region, e.g., a gene. In view of massive intergenic regions in whole genome sequencing (WGS) studies, we propose a computationally efficient quadratic scan (Q-SCAN) statistic based method to detect the existence and the locations of signal regions by scanning the genome continuously. The proposed method accounts for the correlation (linkage disequilibrium) among genetic variants, and allows for signal regions to have both causal and neutral variants, and the effects of signal variants to be in different directions. We study the asymptotic properties of the proposed Q-SCAN statistics. We derive an empirical threshold that controls for the family-wise error rate, and show that under regularity conditions the proposed method consistently selects the true signal regions. We perform simulation studies to evaluate the finite sample performance of the proposed method. Our simulation results show that the proposed procedure outperforms the existing methods, especially when signal regions have causal variants whose effects are in different directions, or are contaminated with neutral variants. We illustrate Q-SCAN by analyzing the WGS data from the Atherosclerosis Risk in Communities study.
Collapse
Affiliation(s)
- Zilin Li
- Harvard University T H Chan School of Public Health, Biostatistics, 655 Huntington Avenue, Boston, 02115 United States
| | - Yaowu Liu
- Southwestern University of Finance and Economics School of Statistics, Chengdu, 610074 China
| | - Xihong Lin
- Harvard University T H Chan School of Public Health, Biostatistics, 655 Huntington Avenue, Boston, 02115 United States
| |
Collapse
|
21
|
Tang ZZ, Sliwoski GR, Chen G, Jin B, Bush WS, Li B, Capra JA. PSCAN: Spatial scan tests guided by protein structures improve complex disease gene discovery and signal variant detection. Genome Biol 2020; 21:217. [PMID: 32847609 PMCID: PMC7448521 DOI: 10.1186/s13059-020-02121-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Accepted: 07/27/2020] [Indexed: 12/25/2022] Open
Abstract
Germline disease-causing variants are generally more spatially clustered in protein 3-dimensional structures than benign variants. Motivated by this tendency, we develop a fast and powerful protein-structure-based scan (PSCAN) approach for evaluating gene-level associations with complex disease and detecting signal variants. We validate PSCAN's performance on synthetic data and two real data sets for lipid traits and Alzheimer's disease. Our results demonstrate that PSCAN performs competitively with existing gene-level tests while increasing power and identifying more specific signal variant sets. Furthermore, PSCAN enables generation of hypotheses about the molecular basis for the associations in the context of protein structures and functional domains.
Collapse
Affiliation(s)
- Zheng-Zheng Tang
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, 53715 WI USA
- Wisconsin Institute for Discovery, Madison, 53715 WI USA
| | - Gregory R. Sliwoski
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, 37232 TN USA
| | - Guanhua Chen
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, 53715 WI USA
| | - Bowen Jin
- Department for Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, 44106 OH USA
| | - William S. Bush
- Department for Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, 44106 OH USA
- Institute for Computational Biology, Case Western Reserve University, Cleveland, 44106 OH USA
| | - Bingshan Li
- Department of Molecular Physiology & Biophysics, Vanderbilt University Medical Center, Nashville, 37232 TN USA
| | - John A. Capra
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, 37232 TN USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, 37232 TN USA
- Departments of Biological Sciences and Computer Science, Vanderbilt University, Nashville, 37232 TN USA
- Center for Structural Biology, Vanderbilt University, Nashville, 37232 TN USA
| |
Collapse
|
22
|
Luo L, Shen J, Zhang H, Chhibber A, Mehrotra DV, Tang ZZ. Multi-trait analysis of rare-variant association summary statistics using MTAR. Nat Commun 2020; 11:2850. [PMID: 32503972 PMCID: PMC7275056 DOI: 10.1038/s41467-020-16591-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 05/09/2020] [Indexed: 12/13/2022] Open
Abstract
Integrating association evidence across multiple traits can improve the power of gene discovery and reveal pleiotropy. Most multi-trait analysis methods focus on individual common variants in genome-wide association studies. Here, we introduce multi-trait analysis of rare-variant associations (MTAR), a framework for joint analysis of association summary statistics between multiple rare variants and different traits. MTAR achieves substantial power gain by leveraging the genome-wide genetic correlation measure to inform the degree of gene-level effect heterogeneity across traits. We apply MTAR to rare-variant summary statistics for three lipid traits in the Global Lipids Genetics Consortium. 99 genome-wide significant genes were identified in the single-trait-based tests, and MTAR increases this to 139. Among the 11 novel lipid-associated genes discovered by MTAR, 7 are replicated in an independent UK Biobank GWAS analysis. Our study demonstrates that MTAR is substantially more powerful than single-trait-based tests and highlights the value of MTAR for novel gene discovery.
Collapse
Affiliation(s)
- Lan Luo
- Department of Statistics, University of Wisconsin-Madison, Madison, Wisconsin, 53706, USA
| | - Judong Shen
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, New Jersey, 07065, USA
| | - Hong Zhang
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, New Jersey, 07065, USA
| | - Aparna Chhibber
- Genetics and Pharmacogenomics, Merck & Co., Inc., West Point, Pennsylvania, 19446, USA
| | - Devan V Mehrotra
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., North Wales, Pennsylvania, 19454, USA
| | - Zheng-Zheng Tang
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, 53715, USA.
- Wisconsin Institute for Discovery, Madison, Wisconsin, 53715, USA.
| |
Collapse
|
23
|
Jiang Y, Chen S, Wang X, Liu M, Iacono WG, Hewitt JK, Hokanson JE, Krauter K, Laakso M, Li KW, Lutz SM, McGue M, Pandit A, Zajac GJ, Boehnke M, Abecasis GR, Vrieze SI, Jiang B, Zhan X, Liu DJ. Association Analysis and Meta-Analysis of Multi-Allelic Variants for Large-Scale Sequence Data. Genes (Basel) 2020; 11:genes11050586. [PMID: 32466134 PMCID: PMC7288273 DOI: 10.3390/genes11050586] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 05/19/2020] [Accepted: 05/21/2020] [Indexed: 11/16/2022] Open
Abstract
There is great interest in understanding the impact of rare variants in human diseases using large sequence datasets. In deep sequence datasets of >10,000 samples, ~10% of the variant sites are observed to be multi-allelic. Many of the multi-allelic variants have been shown to be functional and disease-relevant. Proper analysis of multi-allelic variants is critical to the success of a sequencing study, but existing methods do not properly handle multi-allelic variants and can produce highly misleading association results. We discuss practical issues and methods to encode multi-allelic sites, conduct single-variant and gene-level association analyses, and perform meta-analysis for multi-allelic variants. We evaluated these methods through extensive simulations and the study of a large meta-analysis of ~18,000 samples on the cigarettes-per-day phenotype. We showed that our joint modeling approach provided an unbiased estimate of genetic effects, greatly improved the power of single-variant association tests among methods that can properly estimate allele effects, and enhanced gene-level tests over existing approaches. Software packages implementing these methods are available online.
Collapse
Affiliation(s)
- Yu Jiang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA 17033, USA; (Y.J.); (X.W.); (D.J.L.)
| | - Sai Chen
- Illumina Inc., 5200 Illuminay Way, San Diego, CA 92122, USA;
| | - Xingyan Wang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA 17033, USA; (Y.J.); (X.W.); (D.J.L.)
| | - Mengzhen Liu
- Department of Psychology, University of Minnesota, Minneapolis, MN 55454, USA; (M.L.); (M.M.); (S.I.V.)
| | - William G. Iacono
- Department of Psychiatry, University of Minnesota, Minneapolis, MN 55454, USA;
| | - John K. Hewitt
- Institute for Behavioral Genetics, University of Colorado Boulder, Aurora, CO 80045, USA; (J.K.H.); (K.K.)
| | - John E. Hokanson
- Department of Epidemiology, School of Public Health, University of Colorado Denver, Aurora, CO 80045, USA;
| | - Kenneth Krauter
- Institute for Behavioral Genetics, University of Colorado Boulder, Aurora, CO 80045, USA; (J.K.H.); (K.K.)
| | - Markku Laakso
- Department of Medicine, University of Eastern Finland and Kuopio University Hospital, 70211 Kuopio, Finland;
| | - Kevin W. Li
- Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; (K.W.L.); (A.P.); (G.J.M.Z.); (M.B.); (G.R.A.)
| | - Sharon M. Lutz
- Department of Biostatistics and Informatics, University of Colorado, Anschutz Medical Campus, Aurora, CO 80045, USA;
| | - Matthew McGue
- Department of Psychology, University of Minnesota, Minneapolis, MN 55454, USA; (M.L.); (M.M.); (S.I.V.)
| | - Anita Pandit
- Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; (K.W.L.); (A.P.); (G.J.M.Z.); (M.B.); (G.R.A.)
| | - Gregory J.M. Zajac
- Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; (K.W.L.); (A.P.); (G.J.M.Z.); (M.B.); (G.R.A.)
| | - Michael Boehnke
- Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; (K.W.L.); (A.P.); (G.J.M.Z.); (M.B.); (G.R.A.)
| | - Goncalo R. Abecasis
- Center of Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; (K.W.L.); (A.P.); (G.J.M.Z.); (M.B.); (G.R.A.)
| | - Scott I. Vrieze
- Department of Psychology, University of Minnesota, Minneapolis, MN 55454, USA; (M.L.); (M.M.); (S.I.V.)
| | - Bibo Jiang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA 17033, USA; (Y.J.); (X.W.); (D.J.L.)
- Correspondence: (B.J.); (X.Z.)
| | - Xiaowei Zhan
- Department of Clinical Science, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
- Correspondence: (B.J.); (X.Z.)
| | - Dajiang J. Liu
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA 17033, USA; (Y.J.); (X.W.); (D.J.L.)
| |
Collapse
|
24
|
Fu X, Zhang X, Jiang T, Huang Y, Cheng P, Tang D, Gao J, Du J. Association Between Lifelong Premature Ejaculation and Polymorphism of Tryptophan Hydroxylase 2 Gene in the Han Population. Sex Med 2020; 8:223-229. [PMID: 32169437 PMCID: PMC7261684 DOI: 10.1016/j.esxm.2020.02.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2019] [Revised: 02/05/2020] [Accepted: 02/09/2020] [Indexed: 01/02/2023] Open
Abstract
Introduction Premature ejaculation (PE) is widely regarded as one of the most common sexual dysfunctions in men. The neurobiogenesis of PE is complex and involves the serotoninergic (5-HT) system. Aim In this study, we investigated whether polymorphisms in the tryptophan hydroxylase 2 (TPH2) gene were associated with lifelong PE (LPE). Methods A total of 121 men diagnosed with LPE were recruited from our outpatient clinics and 94 healthy controls from the health examination center. Intravaginal ejaculation latency time (IELT) was measured using a stopwatch. The PE diagnostic tool (PEDT) data were collected at the same time. All subjects with LPE and healthy controls were genotyped for polymorphisms in the TPH2 gene. Allele and genotype frequencies of single-nucleotide polymorphisms (SNPs) were compared between the patients and controls. Main Outcome Measure The main outcome measures are IELT and PEDT to diagnose LPE. The association of LPE with TPH2 gene polymorphisms in these areas was investigated. Results The IELT, PEDT scores, and education levels in the LPE group were significantly different from those in the control group. Statistically significant differences were found in the SNPs of SNV019 and rs4290270. The frequencies of the G allele and G/A genotype of SNV019 were significantly higher in the patients with LPE than in the controls (P = .045 and .037, respectively). The A allele and A/A genotype of rs4290270 were more frequent in the patients with LPE than in the controls (P = .037 and .049, respectively). In the dominant model of inheritance, the SNV019 polymorphism in the patients with LPE was significantly different from that in the controls (odds ratio [95% confidence interval] = 2.936 [1.066–8.084], P = .037). In men with LPE, there was no statistically significant association between genotype and median IELT. Conclusion The SNPs SNV019 and rs4290270 of the TPH2 gene seemed to be associated with LPE in the Han population. Men with the A allele of SNV019 or the T allele of rs4290270 may be less likely to suffer from LPE. Fu X, Zhang X, Jiang T, et al. Association Between Lifelong Premature Ejaculation and Polymorphism of Tryptophan Hydroxylase 2 Gene in the Han Population. Sex Med 2020;8:223–229.
Collapse
Affiliation(s)
- Xu Fu
- Department of Urology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China
| | - Xiansheng Zhang
- Department of Urology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China.
| | - Tao Jiang
- Department of Urology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China
| | - Yuanyuan Huang
- Department of Urology, The Fourth Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China
| | - Peng Cheng
- Department of Urology, The Second Affiliated Hospital of Wannan Medical College, Wuhu, Anhui, China
| | - Dongdong Tang
- Department of Obstetrics and Gynecology, Reproductive Medicine Center, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China
| | - Jingjing Gao
- Department of Urology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China
| | - JunHua Du
- Department of Urology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China
| |
Collapse
|
25
|
van 't Hof FNG, Lai D, van Setten J, Bots ML, Vaartjes I, Broderick J, Woo D, Foroud T, Rinkel GJE, de Bakker PIW, Ruigrok YM. Exome-chip association analysis of intracranial aneurysms. Neurology 2019; 94:e481-e488. [PMID: 31732565 DOI: 10.1212/wnl.0000000000008665] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 08/01/2019] [Indexed: 11/15/2022] Open
Abstract
OBJECTIVE To investigate to what extent low-frequency genetic variants (with minor allele frequencies <5%) affect the risk of intracranial aneurysms (IAs). METHODS One thousand fifty-six patients with IA and 2,097 population-based controls from the Netherlands were genotyped with the Illumina HumanExome BeadChip. After quality control (QC) of samples and single nucleotide variants (SNVs), we conducted a single variant analysis using the Fisher exact test. We also performed the variable threshold (VT) test and the sequence kernel association test (SKAT) at different minor allele count (MAC) thresholds of >5 and >0 to test the hypothesis that multiple variants within the same gene are associated with IA risk. Significant results were tested in a replication cohort of 425 patients with IA and 311 controls, and results of the 2 cohorts were combined in a meta-analysis. RESULTS After QC, 995 patients with IA and 2,080 controls remained for further analysis. The single variant analysis comprising 46,534 SNVs did not identify significant loci at the genome-wide level. The gene-based tests showed a statistically significant association for fibulin 2 (FBLN2) (best p = 1 × 10-6 for the VT test, MAC >5). Associations were not statistically significant in the independent but smaller replication cohort (p > 0.57) but became slightly stronger in a meta-analysis of the 2 cohorts (best p = 4.8 × 10-7 for the SKAT, MAC ≥1). CONCLUSION Gene-based tests indicated an association for FBLN2, a gene encoding an extracellular matrix protein implicated in vascular wall remodeling, but independent validation in larger cohorts is warranted. We did not identify any significant associations for single low-frequency genetic variants.
Collapse
Affiliation(s)
- Femke N G van 't Hof
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH.
| | - Dongbing Lai
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Jessica van Setten
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Michiel L Bots
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Ilonca Vaartjes
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Joseph Broderick
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Daniel Woo
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Tatiana Foroud
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Gabriel J E Rinkel
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Paul I W de Bakker
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| | - Ynte M Ruigrok
- From the Department of Neurology and Neurosurgery (F.N.G.v.H., G.J.E.R., Y.M.R.), Brain Center Rudolf Magnus, Department of Cardiology (J.v.S.), Department of Medical Genetics (P.I.W.d.B.), Centre for Molecular Medicine, and Department of Epidemiology (M.L.B., I.V., P.I.W.d.B.), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands; Department of Medical and Molecular Genetics (D.L., T.F.), Indiana University School of Medicine, Indianapolis; and Department of Neurology and Rehabilitation Medicine (J.B., D.W.), University of Cincinnati School of Medicine, OH
| |
Collapse
|
26
|
An B, Gao X, Chang T, Xia J, Wang X, Miao J, Xu L, Zhang L, Chen Y, Li J, Xu S, Gao H. Genome-wide association studies using binned genotypes. Heredity (Edinb) 2019; 124:288-298. [PMID: 31641238 DOI: 10.1038/s41437-019-0279-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Revised: 09/25/2019] [Accepted: 09/26/2019] [Indexed: 01/23/2023] Open
Abstract
Linear mixed models (LMM) that tests trait association one marker at a time have been the most popular methods for genome-wide association studies. However, this approach has potential pitfalls: over conservativeness after Bonferroni correction, ignorance of linkage disequilibrium (LD) between neighboring markers, and power reduction due to overfitting SNP effects. So, multiple locus models that can simultaneously estimate and test all markers in the genome are more appropriate. Based on the multiple locus models, we proposed a bin model that combines markers into bins based on their LD relationships. A bin is treated as a new synthetic marker and we detect the associations between bins and traits. Since the number of bins can be substantially smaller than the number of markers, a penalized multiple regression method can be adopted by fitting all bins to a single model. We developed an innovative method to bin the neighboring markers and used the least absolute shrinkage and selection operator (LASSO) method. We compared BIN-Lasso with SNP-Lasso and Q + K-LMM in a simulation experiment, and showed that the new method is more powerful with less Type I error than the other two methods. We also applied the bin model to a Chinese Simmental beef cattle population for bone weight association study. The new method identified more significant associations than the classical LMM. The bin model is a new dimension reduction technique that takes advantage of biological information (i.e., LD). The new method will be a significant breakthrough in associative genomics in the big data era.
Collapse
Affiliation(s)
- Bingxing An
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xue Gao
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Tianpeng Chang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jiangwei Xia
- Institute of Basic Medical Science, Westlake Institute for Advanced Study, Hangzhou, China
| | - Xiaoqiao Wang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jian Miao
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lingyang Xu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lupei Zhang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yan Chen
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Junya Li
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Shizhong Xu
- Department of Botany and Plant Sciences, University of California, Riverside, CA, USA
| | - Huijiang Gao
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.
| |
Collapse
|
27
|
Tang ZZ, Chen G. Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis. Biostatistics 2019; 20:698-713. [PMID: 29939212 PMCID: PMC7410344 DOI: 10.1093/biostatistics/kxy025] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Revised: 04/26/2018] [Accepted: 05/06/2018] [Indexed: 12/19/2022] Open
Abstract
There is heightened interest in using high-throughput sequencing technologies to quantify abundances of microbial taxa and linking the abundance to human diseases and traits. Proper modeling of multivariate taxon counts is essential to the power of detecting this association. Existing models are limited in handling excessive zero observations in taxon counts and in flexibly accommodating complex correlation structures and dispersion patterns among taxa. In this article, we develop a new probability distribution, zero-inflated generalized Dirichlet multinomial (ZIGDM), that overcomes these limitations in modeling multivariate taxon counts. Based on this distribution, we propose a ZIGDM regression model to link microbial abundances to covariates (e.g. disease status) and develop a fast expectation-maximization algorithm to efficiently estimate parameters in the model. The derived tests enable us to reveal rich patterns of variation in microbial compositions including differential mean and dispersion. The advantages of the proposed methods are demonstrated through simulation studies and an analysis of a gut microbiome dataset.
Collapse
Affiliation(s)
- Zheng-Zheng Tang
- Department of Biostatistics and Medical Informatics, University of
Wisconsin-Madison, Madison, WI, USA and Wisconsin Institute for
Discovery, Madison, WI, USA
| | - Guanhua Chen
- Department of Biostatistics and Medical Informatics, University of
Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
28
|
Ma Y, Jun GR, Zhang X, Chung J, Naj AC, Chen Y, Bellenguez C, Hamilton-Nelson K, Martin ER, Kunkle BW, Bis JC, Debette S, DeStefano AL, Fornage M, Nicolas G, van Duijn C, Bennett DA, De Jager PL, Mayeux R, Haines JL, Pericak-Vance MA, Seshadri S, Lambert JC, Schellenberg GD, Lunetta KL, Farrer LA. Analysis of Whole-Exome Sequencing Data for Alzheimer Disease Stratified by APOE Genotype. JAMA Neurol 2019; 76:1099-1108. [PMID: 31180460 PMCID: PMC6563544 DOI: 10.1001/jamaneurol.2019.1456] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2018] [Accepted: 03/22/2019] [Indexed: 12/13/2022]
Abstract
IMPORTANCE Previous genome-wide association studies of common variants identified associations for Alzheimer disease (AD) loci evident only among individuals with particular APOE alleles. OBJECTIVE To identify APOE genotype-dependent associations with infrequent and rare variants using whole-exome sequencing. DESIGN, SETTING, AND PARTICIPANTS The discovery stage included 10 441 non-Hispanic white participants in the Alzheimer Disease Sequencing Project. Replication was sought in 2 independent, whole-exome sequencing data sets (1766 patients with AD, 2906 without AD [controls]) and a chip-based genotype imputation data set (8728 patients with AD, 9808 controls). Bioinformatics and functional analyses were conducted using clinical, cognitive, neuropathologic, whole-exome sequencing, and gene expression data obtained from a longitudinal cohort sample including 402 patients with AD and 647 controls. Data were analyzed between March 2017 and September 2018. MAIN OUTCOMES AND MEASURES Score, Firth, and sequence kernel association tests were used to test the association of AD risk with individual variants and genes in subgroups of APOE ε4 carriers and noncarriers. Results with P ≤ 1 × 10-5 were further evaluated in the replication data sets and combined by meta-analysis. RESULTS Among 3145 patients with AD and 4213 controls lacking ε4 (mean [SD] age, 83.4 [7.6] years; 4363 [59.3.%] women), novel genome-wide significant associations were obtained in the discovery sample with rs536940594 in AC099552 (odds ratio [OR], 88.0; 95% CI, 9.08-852.0; P = 2.22 × 10-7) and rs138412600 in GPAA1 (OR, 1.78; 95% CI, 1.44-2.2; meta-P = 7.81 × 10-8). GPAA1 was also associated with expression in the brain of GPAA1 (β = -0.08; P = .03) and its repressive transcription factor, FOXG1 (β = 0.13; P = .003), and global cognition function (β = -0.53; P = .009). Significant gene-wide associations (threshold P ≤ 6.35 × 10-7) were observed for OR8G5 (P = 4.67 × 10-7), IGHV3-7 (P = 9.75 × 10-16), and SLC24A3 (P = 2.67 × 10-12) in 2377 patients with AD and 706 controls with ε4 (mean [SD] age, 75.2 [9.6] years; 1668 [54.1%] women). CONCLUSIONS AND RELEVANCE The study identified multiple possible novel associations for AD with individual and aggregated rare variants in groups of individuals with and without APOE ε4 alleles that reinforce known and suggest additional pathways leading to AD.
Collapse
Affiliation(s)
- Yiyi Ma
- Department of Medicine (Biomedical Genetics), Boston University Schools of Medicine and Public Health, Boston, Massachusetts
- Center for Translational & Computational Neuroimmunology, Multiple Sclerosis Clinical Care and Research Center, Division of Neuroimmunology, Columbia University Medical Center, New York, New York
- Department of Neurology, Columbia University Medical Center, New York, New York
| | - Gyungah R. Jun
- Department of Medicine (Biomedical Genetics), Boston University Schools of Medicine and Public Health, Boston, Massachusetts
- Department of Biostatistics, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
| | - Xiaoling Zhang
- Department of Medicine (Biomedical Genetics), Boston University Schools of Medicine and Public Health, Boston, Massachusetts
| | - Jaeyoon Chung
- Department of Medicine (Biomedical Genetics), Boston University Schools of Medicine and Public Health, Boston, Massachusetts
| | - Adam C. Naj
- Perelman School of Medicine, University of Pennsylvania, Philadelphia
| | - Yuning Chen
- Department of Biostatistics, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
| | - Celine Bellenguez
- Universite de Lille, INSERM UMR1167, Institute Pasteur de Lille, Lille, France
| | - Kara Hamilton-Nelson
- John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, Florida
| | - Eden R. Martin
- John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, Florida
| | - Brian W. Kunkle
- John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, Florida
| | - Joshua C. Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle
| | - Stéphanie Debette
- Bordeaux Population Health Research Center, UMR1219, University Bordeaux, Inserm, Bordeaux, France
- Department of Neurology, Bordeaux University Hospital, Bordeaux, France
| | - Anita L. DeStefano
- Department of Biostatistics, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
- Department of Neurology, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
| | - Myriam Fornage
- School of Public Health, University of Texas Health Science Center at Houston, Houston
| | - Gaël Nicolas
- UNIROUEN, Inserm U1245, Normandie University, Rouen, France
- Department of Genetics, Rouen University Hospital, Rouen, France
- Normandy Centre for Genomic and Personalized Medicine, Centre National de Référence pour les Malades Alzheimer Jeunes, Rouen, France
| | - Cornelia van Duijn
- Department of Epidemiology, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - David A. Bennett
- Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago, Illinois
| | - Philip L. De Jager
- Center for Translational & Computational Neuroimmunology, Multiple Sclerosis Clinical Care and Research Center, Division of Neuroimmunology, Columbia University Medical Center, New York, New York
- Department of Neurology, Columbia University Medical Center, New York, New York
- Program in Translational NeuroPsychiatric Genomics, Institute for the Neurosciences, Departments of Neurology and Psychiatry, Brigham and Women's Hospital, Boston, Massachusetts
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts
| | - Richard Mayeux
- Department of Neurology, Columbia University Medical Center, New York, New York
| | - Jonathan L Haines
- Institute for Computational Biology, Department of Population & Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio
| | - Margaret A. Pericak-Vance
- John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, Florida
| | - Sudha Seshadri
- Department of Neurology, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
- National Heart, Lung, and Blood Institute’s Framingham Heart Study, Framingham, Massachusetts
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health Sciences Center, San Antonio
| | | | | | - Kathryn L. Lunetta
- Department of Biostatistics, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
| | - Lindsay A. Farrer
- Department of Medicine (Biomedical Genetics), Boston University Schools of Medicine and Public Health, Boston, Massachusetts
- Department of Biostatistics, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
- Department of Neurology, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
- Department of Ophthalmology, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
- Department of Epidemiology, Boston University Schools of Medicine and Public Health, Boston, Massachusetts
| |
Collapse
|
29
|
Li Z, Li X, Liu Y, Shen J, Chen H, Zhou H, Morrison AC, Boerwinkle E, Lin X. Dynamic Scan Procedure for Detecting Rare-Variant Association Regions in Whole-Genome Sequencing Studies. Am J Hum Genet 2019; 104:802-814. [PMID: 30982610 PMCID: PMC6507043 DOI: 10.1016/j.ajhg.2019.03.002] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Accepted: 03/01/2019] [Indexed: 11/19/2022] Open
Abstract
Whole-genome sequencing (WGS) studies are being widely conducted in order to identify rare variants associated with human diseases and disease-related traits. Classical single-marker association analyses for rare variants have limited power, and variant-set-based analyses are commonly used by researchers for analyzing rare variants. However, existing variant-set-based approaches need to pre-specify genetic regions for analysis; hence, they are not directly applicable to WGS data because of the large number of intergenic and intron regions that consist of a massive number of non-coding variants. The commonly used sliding-window method requires the pre-specification of fixed window sizes, which are often unknown as a priori, are difficult to specify in practice, and are subject to limitations given that the sizes of genetic-association regions are likely to vary across the genome and phenotypes. We propose a computationally efficient and dynamic scan-statistic method (Scan the Genome [SCANG]) for analyzing WGS data; this method flexibly detects the sizes and the locations of rare-variant association regions without the need to specify a prior, fixed window size. The proposed method controls for the genome-wise type I error rate and accounts for the linkage disequilibrium among genetic variants. It allows the detected sizes of rare-variant association regions to vary across the genome. Through extensive simulated studies that consider a wide variety of scenarios, we show that SCANG substantially outperforms several alternative methods for detecting rare-variant-associations while controlling for the genome-wise type I error rates. We illustrate SCANG by analyzing the WGS lipids data from the Atherosclerosis Risk in Communities (ARIC) study.
Collapse
Affiliation(s)
- Zilin Li
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Xihao Li
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Yaowu Liu
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Jincheng Shen
- Department of Population Health Sciences, University of Utah, Salt Lake City, UT 84108, USA
| | - Han Chen
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, the University of Texas Health Science Center at Houston, Houston, TX 77030, USA; Center for Precision Health, School of Public Health and School of Biomedical Informatics, the University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Hufeng Zhou
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Alanna C Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, the University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Eric Boerwinkle
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, the University of Texas Health Science Center at Houston, Houston, TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Xihong Lin
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Department of Statistics, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
30
|
Qin H, Zhao J, Zhu X. Identifying Rare Variant Associations in Admixed Populations. Sci Rep 2019; 9:5458. [PMID: 30931973 PMCID: PMC6443736 DOI: 10.1038/s41598-019-41845-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 03/12/2019] [Indexed: 12/27/2022] Open
Abstract
An admixed population and its ancestral populations bear different burdens of a complex disease. The ancestral populations may have different haplotypes of deleterious alleles and thus ancestry-gene interaction can influence disease risk in the admixed population. Among admixed individuals, deleterious haplotypes and their ancestries are dependent and can provide non-redundant association information. Herein we propose a local ancestry boosted sum test (LABST) for identifying chromosomal blocks that harbor rare variants but have no ancestry switches. For such a stable ancestral block, our LABST exploits ancestry-gene interaction and the number of rare alleles therein. Under the null of no genetic association, the test statistic asymptotically follows a chi-square distribution with one degree of freedom (1-df). Our LABST properly controlled type I error rates under extensive simulations, suggesting that the asymptotic approximation was accurate for the null distribution of the test statistic. In terms of power for identifying rare variant associations, our LABST uniformly outperformed several famed methods under four important modes of disease genetics over a large range of relative risks. In conclusion, exploiting ancestry-gene interaction can boost statistical power for rare variant association mapping in admixed populations.
Collapse
Affiliation(s)
- Huaizhen Qin
- Department of Epidemiology, College of Public Health and Health Professions and College of Medicine, University of Florida, Gainesville, FL, 32611, USA
- Department of Global Biostatistics and Data Science, Tulane University School of Public Health and Tropical Medicine, 1440 Canal Street, New Orleans, LA, 70112, USA
| | - Jinying Zhao
- Department of Epidemiology, College of Public Health and Health Professions and College of Medicine, University of Florida, Gainesville, FL, 32611, USA
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, 10900 Euclid Avenue, Cleveland, Ohio, 44106, USA.
| |
Collapse
|
31
|
Weissenkampen JD, Jiang Y, Eckert S, Jiang B, Li B, Liu DJ. Methods for the Analysis and Interpretation for Rare Variants Associated with Complex Traits. CURRENT PROTOCOLS IN HUMAN GENETICS 2019; 101:e83. [PMID: 30849219 PMCID: PMC6455968 DOI: 10.1002/cphg.83] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
With the advent of Next Generation Sequencing (NGS) technologies, whole genome and whole exome DNA sequencing has become affordable for routine genetic studies. Coupled with improved genotyping arrays and genotype imputation methodologies, it is increasingly feasible to obtain rare genetic variant information in large datasets. Such datasets allow researchers to gain a more complete understanding of the genetic architecture of complex traits caused by rare variants. State-of-the-art statistical methods for the statistical genetics analysis of sequence-based association, including efficient algorithms for association analysis in biobank-scale datasets, gene-association tests, meta-analysis, fine mapping methods that integrate functional genomic dataset, and phenome-wide association studies (PheWAS), are reviewed here. These methods are expected to be highly useful for next generation statistical genetics analysis in the era of precision medicine. © 2019 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
| | - Yu Jiang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey PA
| | - Scott Eckert
- Department of Public Health Sciences, Penn State College of Medicine, Hershey PA
| | - Bibo Jiang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey PA
| | - Bingshan Li
- Department of Molecular Physiology and Biophysics, Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN
| | - Dajiang J. Liu
- Department of Public Health Sciences, Penn State College of Medicine, Hershey PA
| |
Collapse
|
32
|
Multivariate association test for rare variants controlling for cryptic and family relatedness. CAN J STAT 2019. [DOI: 10.1002/cjs.11475] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
33
|
Chen Z, Wang K. Gene-based sequential burden association test. Stat Med 2019; 38:2353-2363. [PMID: 30706509 DOI: 10.1002/sim.8111] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Revised: 11/29/2018] [Accepted: 01/10/2019] [Indexed: 11/10/2022]
Abstract
Detecting the association between a set of variants and a phenotype of interest is the first and important step in genetic and genomic studies. Although it attracted a large amount of attention in the scientific community and several related statistical approaches have been proposed in the literature, powerful and robust statistical tests are still highly desired and yet to be developed in this area. In this paper, we propose a powerful and robust association test, which combines information from each individual single-nucleotide polymorphisms based on sequential independent burden tests. We compare the proposed approach with some popular tests through a comprehensive simulation study and real data application. Our results show that, in general, the new test is more powerful; the gain in detecting power can be substantial in many situations, compared to other methods.
Collapse
Affiliation(s)
- Zhongxue Chen
- Department of Epidemiology and Biostatistics, School of Public Health, Indiana University Bloomington, Bloomington, Indiana
| | - Kai Wang
- Department of Biostatistics, College of Public Health, University of Iowa, Iowa City, Iowa
| |
Collapse
|
34
|
Lin DY. A simple and accurate method to determine genomewide significance for association tests in sequencing studies. Genet Epidemiol 2019; 43:365-372. [PMID: 30623491 DOI: 10.1002/gepi.22183] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Revised: 11/30/2018] [Accepted: 12/04/2018] [Indexed: 11/05/2022]
Abstract
Whole-exome sequencing (WES) and whole-genome sequencing (WGS) studies are underway to investigate the impact of genetic variants on complex diseases and traits. It is customary to perform single-variant association tests for common variants and region-based association tests for rare variants. The latter may target variants with similar or opposite effects, interrogate variants with different frequencies or different functional annotations, and examine a variety of regions. The large number of tests that are performed necessitates adjustment for multiple testing. The conventional Bonferroni correction is overly conservative as the test statistics are correlated. To address this challenge, we propose a simple and accurate method based on parametric bootstrap to assess genomewide significance. We show that the correlations of the test statistics are determined primarily by the genotypes, such that the same significance threshold can be used in different studies that share a common sequencing platform. We demonstrate the usefulness of the proposed method with WES data from the National Heart, Lung, and Blood Institute Exome Sequencing Project and WGS data from the 1000 Genomes Project. We recommend the p value of 5 × 1 0 - 9 as the genomewide significance threshold for testing all common and low-frequency variants (MAFs ≥ 0.1%) in the human genome.
Collapse
Affiliation(s)
- Dan-Yu Lin
- Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina
| |
Collapse
|
35
|
Project MinE: study design and pilot analyses of a large-scale whole-genome sequencing study in amyotrophic lateral sclerosis. Eur J Hum Genet 2018; 26:1537-1546. [PMID: 29955173 PMCID: PMC6138692 DOI: 10.1038/s41431-018-0177-4] [Citation(s) in RCA: 101] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2017] [Revised: 04/10/2018] [Accepted: 04/26/2018] [Indexed: 11/16/2022] Open
Abstract
The most recent genome-wide association study in amyotrophic lateral sclerosis (ALS) demonstrates a disproportionate contribution from low-frequency variants to genetic susceptibility to disease. We have therefore begun Project MinE, an international collaboration that seeks to analyze whole-genome sequence data of at least 15 000 ALS patients and 7500 controls. Here, we report on the design of Project MinE and pilot analyses of successfully sequenced 1169 ALS patients and 608 controls drawn from the Netherlands. As has become characteristic of sequencing studies, we find an abundance of rare genetic variation (minor allele frequency < 0.1%), the vast majority of which is absent in public datasets. Principal component analysis reveals local geographical clustering of these variants within The Netherlands. We use the whole-genome sequence data to explore the implications of poor geographical matching of cases and controls in a sequence-based disease study and to investigate how ancestry-matched, externally sequenced controls can induce false positive associations. Also, we have publicly released genome-wide minor allele counts in cases and controls, as well as results from genic burden tests.
Collapse
|
36
|
Combined linkage and association analysis identifies rare and low frequency variants for blood pressure at 1q31. Eur J Hum Genet 2018; 27:269-277. [PMID: 30262922 DOI: 10.1038/s41431-018-0277-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Revised: 07/12/2018] [Accepted: 08/28/2018] [Indexed: 12/24/2022] Open
Abstract
High blood pressure (BP) is a major risk factor for cardiovascular disease (CVD) and is more prevalent in African Americans as compared to other US groups. Although large, population-based genome-wide association studies (GWAS) have identified over 300 common polymorphisms modulating inter-individual BP variation, largely in European ancestry subjects, most of them do not localize to regions previously identified through family-based linkage studies. This discrepancy has remained unexplained despite the statistical power differences between current GWAS and prior linkage studies. To address this issue, we performed genome-wide linkage analysis of BP traits in African-American families from the Family Blood Pressure Program (FBPP) and genotyped on the Illumina Human Exome BeadChip v1.1. We identified a genomic region on chromosome 1q31 with LOD score 3.8 for pulse pressure (PP), a region we previously implicated in DBP studies of European ancestry families. Although no reported GWAS variants map to this region, combined linkage and association analysis of PP identified 81 rare and low frequency exonic variants accounting for the linkage evidence. Replication analysis in eight independent African ancestry cohorts (N = 16,968) supports this specific association with PP (P = 0.0509). Additional association and network analyses identified multiple potential candidate genes in this region expressed in multiple tissues and with a strong biological support for a role in BP. In conclusion, multiple genes and rare variants on 1q31 contribute to PP variation. Beyond producing new insights into PP, we demonstrate how family-based linkage and association studies can implicate specific rare and low frequency variants for complex traits.
Collapse
|
37
|
Wu X, Guan T, Liu DJ, León Novelo LG, Bandyopadhyay D. ADAPTIVE-WEIGHT BURDEN TEST FOR ASSOCIATIONS BETWEEN QUANTITATIVE TRAITS AND GENOTYPE DATA WITH COMPLEX CORRELATIONS. Ann Appl Stat 2018; 12:1558-1582. [PMID: 30214655 DOI: 10.1214/17-aoas1121] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
High-throughput sequencing has often been used to screen samples from pedigrees or with population structure, producing genotype data with complex correlations rendered from both familial relation and linkage disequilibrium. With such data, it is critical to account for these genotypic correlations when assessing the contribution of variants by gene or pathway. Recognizing the limitations of existing association testing methods, we propose Adaptive-weight Burden Test (ABT), a retrospective, mixed-model test for genetic association of quantitative traits on genotype data with complex correlations. This method makes full use of genotypic correlations across both samples and variants, and adopts "data-driven" weights to improve power. We derive the ABT statistic and its explicit distribution under the null hypothesis, and demonstrate through simulation studies that it is generally more powerful than the fixed-weight burden test and family-based SKAT in various scenarios, controlling for the type I error rate. Further investigation reveals the connection of ABT with kernel tests, as well as the adaptability of its weights to the direction of genetic effects. The application of ABT is illustrated by a whole genome analysis of genes with common and rare variants associated with fasting glucose from the NHLBI "Grand Opportunity" Exome Sequencing Project.
Collapse
Affiliation(s)
- Xiaowei Wu
- Department of Statistics, Virginia Tech, 250 Drillfield Drive, MC0439, Blacksburg, VA 24061, USA
| | - Ting Guan
- Department of Statistics, Virginia Tech, 250 Drillfield Drive, MC0439, Blacksburg, VA 24061, USA
| | - Dajiang J Liu
- Department of Public Health Sciences, Hershey Institute of Personalized Medicine, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA
| | - Luis G León Novelo
- Department of Biostatistics, School of Public Health, University of Texas Health Science Center, Houston, TX 77030, USA
| | | |
Collapse
|
38
|
Chien LC, Chiu YF. General retrospective mega-analysis framework for rare variant association tests. Genet Epidemiol 2018; 42:621-635. [PMID: 30188589 DOI: 10.1002/gepi.22147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 06/05/2018] [Accepted: 06/05/2018] [Indexed: 11/09/2022]
Abstract
Here, we describe a retrospective mega-analysis framework for gene- or region-based multimarker rare variant association tests. Our proposed mega-analysis association tests allow investigators to combine longitudinal and cross-sectional family- and/or population-based studies. This framework can be applied to a continuous, categorical, or survival trait. In addition to autosomal variants, the tests can be applied to conduct mega-analyses on X-chromosome variants. Tests were built on study-specific region- or gene-level quasiscore statistics and, therefore, do not require estimates of effects of individual rare variants. We used the generalized estimating equation approach to account for complex multiple correlation structures between family members, repeated measurements, and genetic markers. While accounting for multilevel correlations and heterogeneity across studies, the test statistics were computationally efficient and feasible for large-scale sequencing studies. The retrospective aspect of association tests helps alleviate bias due to phenotype-related sampling and type I errors due to misspecification of phenotypic distribution. We evaluated our developed mega-analysis methods through comprehensive simulations with varying sample sizes, covariates, population stratification structures, and study designs across multiple studies. To illustrate application of the proposed framework, we conducted a mega-association analysis combining a longitudinal family study and a cross-sectional case-control study from Genetic Analysis Workshop 19.
Collapse
Affiliation(s)
- Li-Chu Chien
- Center for Fundamental Science, Kaohsiung Medical University, Kaohsiung, Taiwan, ROC
| | - Yen-Feng Chiu
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan, ROC
| |
Collapse
|
39
|
Vandekar SN, Reiss PT, Shinohara RT. Interpretable High-Dimensional Inference Via Score Projection with an Application in Neuroimaging. J Am Stat Assoc 2018; 114:820-830. [PMID: 31548755 DOI: 10.1080/01621459.2018.1448826] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
In the fields of neuroimaging and genetics, a key goal is testing the association of a single outcome with a very high-dimensional imaging or genetic variable. Often, summary measures of the high-dimensional variable are created to sequentially test and localize the association with the outcome. In some cases, the associations between the outcome and summary measures are significant, but subsequent tests used to localize differences are underpowered and do not identify regions associated with the outcome. Here, we propose a generalization of Rao's score test based on projecting the score statistic onto a linear subspace of a high-dimensional parameter space. The approach provides a way to localize signal in the high-dimensional space by projecting the scores to the subspace where the score test was performed. This allows for inference in the high-dimensional space to be performed on the same degrees of freedom as the score test, effectively reducing the number of comparisons. Simulation results demonstrate the test has competitive power relative to others commonly used. We illustrate the method by analyzing a subset of the Alzheimer's Disease Neuroimaging Initiative dataset. Results suggest cortical thinning of the frontal and temporal lobes may be a useful biological marker of Alzheimer's disease risk.
Collapse
Affiliation(s)
- Simon N Vandekar
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA 19104
| | - Philip T Reiss
- Department of Statistics, University of Haifa, Haifa, Israel
| | - Russell T Shinohara
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA 19104
| |
Collapse
|
40
|
Su YR, Di C, Bien S, Huang L, Dong X, Abecasis G, Berndt S, Bezieau S, Brenner H, Caan B, Casey G, Chang-Claude J, Chanock S, Chen S, Connolly C, Curtis K, Figueiredo J, Gala M, Gallinger S, Harrison T, Hoffmeister M, Hopper J, Huyghe JR, Jenkins M, Joshi A, Le Marchand L, Newcomb P, Nickerson D, Potter J, Schoen R, Slattery M, White E, Zanke B, Peters U, Hsu L. A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics. Am J Hum Genet 2018; 102:904-919. [PMID: 29727690 PMCID: PMC5986723 DOI: 10.1016/j.ajhg.2018.03.019] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Accepted: 03/15/2018] [Indexed: 01/05/2023] Open
Abstract
Genome-wide association studies (GWASs) have successfully identified thousands of genetic variants for many complex diseases; however, these variants explain only a small fraction of the heritability. Recently, genetic association studies that leverage external transcriptome data have received much attention and shown promise for discovering novel variants. One such approach, PrediXcan, is to use predicted gene expression through genetic regulation. However, there are limitations in this approach. The predicted gene expression may be biased, resulting from regularized regression applied to moderately sample-sized reference studies. Further, some variants can individually influence disease risk through alternative functional mechanisms besides expression. Thus, testing only the association of predicted gene expression as proposed in PrediXcan will potentially lose power. To tackle these challenges, we consider a unified mixed effects model that formulates the association of intermediate phenotypes such as imputed gene expression through fixed effects, while allowing residual effects of individual variants to be random. We consider a set-based score testing framework, MiST (mixed effects score test), and propose two data-driven combination approaches to jointly test for the fixed and random effects. We establish the asymptotic distributions, which enable rapid calculation of p values for genome-wide analyses, and provide p values for fixed and random effects separately to enhance interpretability over GWASs. Extensive simulations demonstrate that our approaches are more powerful than existing ones. We apply our approach to a large-scale GWAS of colorectal cancer and identify two genes, POU5F1B and ATF1, which would have otherwise been missed by PrediXcan, after adjusting for all known loci.
Collapse
Affiliation(s)
- Yu-Ru Su
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
| | - Chongzhi Di
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Stephanie Bien
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Licai Huang
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Xinyuan Dong
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
| | - Goncalo Abecasis
- Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA
| | - Sonja Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD 20850, USA
| | - Stephane Bezieau
- Service de Génétique Médicale Centre Hospitalier Universitaire (CHU) Nantes, Nantes 44093, France
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Bette Caan
- Division of Research, Kaiser Permanente Medical Care Program of Northern California, Oakland, CA 94612, USA
| | - Graham Casey
- Public Health Sciences Division, University of Virginia, Charlottesville, VA 22908, USA
| | - Jenny Chang-Claude
- Division of Cancer Epidemiology, German Cancer Research Center, Heidelberg 69009, Germany
| | - Stephen Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD 20850, USA
| | - Sai Chen
- Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Charles Connolly
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Keith Curtis
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jane Figueiredo
- Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA
| | - Manish Gala
- Division of Gastroenterology, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Steven Gallinger
- Department of Surgery, Mount Sinai Hospital, Toronto, ON M5G 1X5, Canada
| | - Tabitha Harrison
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Michael Hoffmeister
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - John Hopper
- Melborne School of Population Health, The University of Melborne, Carlton, VIC 3010, Australia
| | - Jeroen R Huyghe
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Mark Jenkins
- Melborne School of Population Health, The University of Melborne, Carlton, VIC 3010, Australia
| | - Amit Joshi
- Department of Epidemiology, Harvard School of Public Health, Boston, MA 02115, USA
| | - Loic Le Marchand
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Polly Newcomb
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Epidemiology, University of Washington School of Public Health, Seattle, WA 98109, USA
| | | | - John Potter
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Epidemiology, University of Washington School of Public Health, Seattle, WA 98109, USA
| | - Robert Schoen
- Department of Medicine and Epidemiology, University of Pittsburgh Medical Center, Pittsburgh, PA 15213, USA
| | - Martha Slattery
- Department of Internal Medicine, University of Utah Health Sciences Center, Salt Lake City, UT 84132, USA
| | - Emily White
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Epidemiology, University of Washington School of Public Health, Seattle, WA 98109, USA
| | - Brent Zanke
- Division of Hematology, Faculty of Medicine, The University of Ottawa, Ottawa, ON K1Y 4E9, USA
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Epidemiology, University of Washington School of Public Health, Seattle, WA 98109, USA
| | - Li Hsu
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Biostatistics, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
41
|
Genetic variation in 117 myelination-related genes in schizophrenia: Replication of association to lipid biosynthesis genes. Sci Rep 2018; 8:6915. [PMID: 29720671 PMCID: PMC5931982 DOI: 10.1038/s41598-018-25280-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 04/10/2018] [Indexed: 01/18/2023] Open
Abstract
Schizophrenia is a serious psychotic disorder with high heritability. Several common genetic variants, rare copy number variants and ultra-rare gene-disrupting mutations have been linked to disease susceptibility, but there is still a large gap between the estimated and explained heritability. Since several studies have indicated brain myelination abnormalities in schizophrenia, we aimed to examine whether variants in myelination-related genes could be associated with risk for schizophrenia. We established a set of 117 myelination genes by database searches and manual curation. We used a combination of GWAS (SCZ_N = 35,476; CTRL_N = 46,839), exome chip (SCZ_N = 269; CTRL_N = 336) and exome sequencing data (SCZ_N = 2,527; CTRL_N = 2,536) from schizophrenia cases and healthy controls to examine common and rare variants. We found that a subset of lipid-related genes was nominally associated with schizophrenia (p = 0.037), but this signal did not survive multiple testing correction (FWER = 0.16) and was mainly driven by the SREBF1 and SREBF2 genes that have already been linked to schizophrenia. Further analysis demonstrated that the lowest nominal p-values were p = 0.0018 for a single common variant (rs8539) and p = 0.012 for burden of rare variants (LRP1 gene), but none of them survived multiple testing correction. Our findings suggest that variation in myelination-related genes is not a major risk factor for schizophrenia.
Collapse
|
42
|
Derkach A, Zhang H, Chatterjee N. Power Analysis for Genetic Association Test (PAGEANT) provides insights to challenges for rare variant association studies. Bioinformatics 2018; 34:1506-1513. [PMID: 29194474 PMCID: PMC5925788 DOI: 10.1093/bioinformatics/btx770] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 10/02/2017] [Accepted: 11/27/2017] [Indexed: 12/18/2022] Open
Abstract
Motivation Genome-wide association studies are now shifting focus from analysis of common to rare variants. As power for association testing for individual rare variants may often be low, various aggregate level association tests have been proposed to detect genetic loci. Typically, power calculations for such tests require specification of large number of parameters, including effect sizes and allele frequencies of individual variants, making them difficult to use in practice. We propose to approximate power to a varying degree of accuracy using a smaller number of key parameters, including the total genetic variance explained by multiple variants within a locus. Results We perform extensive simulation studies to assess the accuracy of the proposed approximations in realistic settings. Using these simplified power calculations, we develop an analytic framework to obtain bounds on genetic architecture of an underlying trait given results from genome-wide association studies with rare variants. Finally, we provide insights into the required quality of annotation/functional information for identification of likely causal variants to make meaningful improvement in power. Availability and implementation A shiny application that allows a variety of Power Analysis of GEnetic AssociatioN Tests (PAGEANT), in R is made publicly available at https://andrewhaoyu.shinyapps.io/PAGEANT/. Contact nilanjan@jhu.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Andriy Derkach
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Haoyu Zhang
- Department of Biostatistics, Bloomberg School of Public Health, School of Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Nilanjan Chatterjee
- Department of Biostatistics, Bloomberg School of Public Health, School of Medicine, Johns Hopkins University, Baltimore, MD, USA
- Department of Oncology, School of Medicine, Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
43
|
Reexamining Dis/Similarity-Based Tests for Rare-Variant Association with Case-Control Samples. Genetics 2018; 209:105-113. [PMID: 29545466 DOI: 10.1534/genetics.118.300769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Accepted: 03/02/2018] [Indexed: 11/18/2022] Open
Abstract
A properly designed distance-based measure can capture informative genetic differences among individuals with different phenotypes and can be used to detect variants responsible for the phenotypes. To detect associated variants, various tests have been designed to contrast genetic dissimilarity or similarity scores of certain subject groups in different ways, among which the most widely used strategy is to quantify the difference between the within-group genetic dissimilarity/similarity (i.e., case-case and control-control similarities) and the between-group dissimilarity/similarity (i.e., case-control similarities). While it has been noted that for common variants, the within-group and the between-group measures should all be included; in this work, we show that for rare variants, comparison based on the two within-group measures can more effectively quantify the genetic difference between cases and controls. The between-group measure tends to overlap with one of the two within-group measures for rare variants, although such overlap is not present for common variants. Consequently, a dissimilarity or similarity test that includes the between-group information tends to attenuate the association signals and leads to power loss. Based on these findings, we propose a dissimilarity test that compares the degree of SNP dissimilarity within cases to that within controls to better characterize the difference between two disease phenotypes. We provide the statistical properties, asymptotic distribution, and computation details for a small sample size of the proposed test. We use simulated and real sequence data to assess the performance of the proposed test, comparing it with other rare-variant methods including those similarity-based tests that use both within-group and between-group information. As similarity-based approaches serve as one of the dominating approaches in rare-variant analysis, our results provide some insight for the effective detection of rare variants.
Collapse
|
44
|
Song M, Wheeler W, Caporaso NE, Landi MT, Chatterjee N. Using imputed genotype data in the joint score tests for genetic association and gene-environment interactions in case-control studies. Genet Epidemiol 2018; 42:146-155. [PMID: 29178451 PMCID: PMC5811375 DOI: 10.1002/gepi.22093] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Revised: 09/15/2017] [Accepted: 09/27/2017] [Indexed: 01/26/2023]
Abstract
Genome-wide association studies (GWAS) are now routinely imputed for untyped single nucleotide polymorphisms (SNPs) based on various powerful statistical algorithms for imputation trained on reference datasets. The use of predicted allele counts for imputed SNPs as the dosage variable is known to produce valid score test for genetic association. In this paper, we investigate how to best handle imputed SNPs in various modern complex tests for genetic associations incorporating gene-environment interactions. We focus on case-control association studies where inference for an underlying logistic regression model can be performed using alternative methods that rely on varying degree on an assumption of gene-environment independence in the underlying population. As increasingly large-scale GWAS are being performed through consortia effort where it is preferable to share only summary-level information across studies, we also describe simple mechanisms for implementing score tests based on standard meta-analysis of "one-step" maximum-likelihood estimates across studies. Applications of the methods in simulation studies and a dataset from GWAS of lung cancer illustrate ability of the proposed methods to maintain type-I error rates for the underlying testing procedures. For analysis of imputed SNPs, similar to typed SNPs, the retrospective methods can lead to considerable efficiency gain for modeling of gene-environment interactions under the assumption of gene-environment independence. Methods are made available for public use through CGEN R software package.
Collapse
Affiliation(s)
- Minsun Song
- Department of Statistiscs, Sookmyung Women’s University, Seoul, Korea
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Rockville, Maryland, United States of America
| | - William Wheeler
- Information Management Services, Inc., Rockville, Maryland, United States of America
| | - Neil E. Caporaso
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Rockville, Maryland, United States of America
| | - Maria Teresa Landi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Rockville, Maryland, United States of America
| | - Nilanjan Chatterjee
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Rockville, Maryland, United States of America
- Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, United States of America
- Department of Oncology, School of Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| |
Collapse
|
45
|
Li Y, Xiang Y, Xu C, Shen H, Deng H. Rare variant association analysis in case-parents studies by allowing for missing parental genotypes. BMC Genet 2018; 19:7. [PMID: 29334894 PMCID: PMC5769338 DOI: 10.1186/s12863-018-0597-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Accepted: 01/04/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The development of next-generation sequencing technologies has facilitated the identification of rare variants. Family-based design is commonly used to effectively control for population admixture and substructure, which is more prominent for rare variants. Case-parents studies, as typical strategies in family-based design, are widely used in rare variant-disease association analysis. Current methods in case-parents studies are based on complete case-parents data; however, parental genotypes may be missing in case-parents trios, and removing these data may lead to a loss in statistical power. The present study focuses on testing for rare variant-disease association in case-parents study by allowing for missing parental genotypes. RESULTS In this report, we extended the collapsing method for rare variant association analysis in case-parents studies to allow for missing parental genotypes, and investigated the performance of two methods by using the difference of genotypes between affected offspring and their corresponding "complements" in case-parent trios and TDT framework. Using simulations, we showed that, compared with the methods just only using complete case-parents data, the proposed strategy allowing for missing parental genotypes, or even adding unrelated affected individuals, can greatly improve the statistical power and meanwhile is not affected by population stratification. CONCLUSIONS We conclude that adding case-parents data with missing parental genotypes to complete case-parents data set can greatly improve the power of our strategy for rare variant-disease association.
Collapse
Affiliation(s)
- Yumei Li
- School of Mathematics and Computational Science, Huaihua University, Huaihua, Hunan, 418008, People's Republic of China. .,Center for Bioinformatics and Genomics, Department of Global Biostatistics and Data Science, Tulane University, New Orleans, LA, 70112, USA.
| | - Yang Xiang
- School of Mathematics and Computational Science, Huaihua University, Huaihua, Hunan, 418008, People's Republic of China
| | - Chao Xu
- Center for Bioinformatics and Genomics, Department of Global Biostatistics and Data Science, Tulane University, New Orleans, LA, 70112, USA
| | - Hui Shen
- Center for Bioinformatics and Genomics, Department of Global Biostatistics and Data Science, Tulane University, New Orleans, LA, 70112, USA
| | - Hongwen Deng
- Center for Bioinformatics and Genomics, Department of Global Biostatistics and Data Science, Tulane University, New Orleans, LA, 70112, USA. .,Center for Bioinformatics and Genomics, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA, 70112, USA.
| |
Collapse
|
46
|
Abstract
Meta-analysis is a statistical technique that is widely used for improving the power to detect associations, by synthesizing data from independent studies, and is extensively used in the genomic analyses of complex traits. Estimates from different studies are combined and the results effectively provide the power of a much larger study. Meta-analysis also has the potential of discovering heterogeneity in the effects among the different studies. This chapter provides an overview of the methods used for meta-analysis of common and rare single variants and also for gene/region-based analyses; common variants are mainly identified via genome-wide association studies (GWAS) and rare variants through various types of sequencing experiments.
Collapse
Affiliation(s)
- Kyriaki Michailidou
- Department of Electron Microscopy/Molecular Pathology, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus.
| |
Collapse
|
47
|
Abstract
While genome-wide association studies have been very successful in identifying associations of common genetic variants with many different traits, the rarer frequency spectrum of the genome has not yet been comprehensively explored. Technological developments increasingly lift restrictions to access rare genetic variation. Dense reference panels enable improved genotype imputation for rarer variants in studies using DNA microarrays. Moreover, the decreasing cost of next generation sequencing makes whole exome and genome sequencing increasingly affordable for large samples. Large-scale efforts based on sequencing, such as ExAC, 100,000 Genomes, and TopMed, are likely to significantly advance this field.The main challenge in evaluating complex trait associations of rare variants is statistical power. The choice of population should be considered carefully because allele frequencies and linkage disequilibrium structure differ between populations. Genetically isolated populations can have favorable genomic characteristics for the study of rare variants.One strategy to increase power is to assess the combined effect of multiple rare variants within a region, known as aggregate testing. A range of methods have been developed for this. Model performance depends on the genetic architecture of the region of interest.
Collapse
Affiliation(s)
- Karoline Kuchenbaecker
- Wellcome Trust Sanger Institute, Cambridge, UK. .,University College London, London, UK.
| | - Emil Vincent Rosenbaum Appel
- Novo Nordisk Foundation Center for Basic Metabolic Research, Section for Metabolic Genetics, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
48
|
Ortega VE, Celedón JC. The Advent of High-Throughput Sequencing Studies of Chronic Obstructive Pulmonary Disease. Am J Respir Crit Care Med 2017; 193:1323-4. [PMID: 27304235 DOI: 10.1164/rccm.201601-0074ed] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Affiliation(s)
- Victor E Ortega
- 1 Center for Genomics and Personalized Medicine Wake Forest School of Medicine Winston-Salem, North Carolina
| | - Juan C Celedón
- 2 Children's Hospital of Pittsburgh of UPMC University of Pittsburgh Pittsburgh, Pennsylvania
| |
Collapse
|
49
|
A Powerful Variant-Set Association Test Based on Chi-Square Distribution. Genetics 2017; 207:903-910. [PMID: 28912342 DOI: 10.1534/genetics.117.300287] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 09/10/2017] [Indexed: 01/19/2023] Open
Abstract
Detecting the association between a set of variants and a given phenotype has attracted a large amount of attention in the scientific community, although it is a difficult task. Recently, several related statistical approaches have been proposed in the literature; powerful statistical tests are still highly desired and yet to be developed in this area. In this paper, we propose a powerful test that combines information from each individual single nucleotide polymorphism (SNP) based on principal component analysis without relying on the eigenvalues associated with the principal components. We compare the proposed approach with some popular tests through a simulation study and real data applications. Our results show that, in general, the new test is more powerful than its competitors considered in this study; the gain in detecting power can be substantial in many situations.
Collapse
|
50
|
A gene-based test of association through an orthogonal decomposition of genotype scores. Hum Genet 2017; 136:1385-1394. [PMID: 28864915 DOI: 10.1007/s00439-017-1839-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 08/26/2017] [Indexed: 10/18/2022]
Abstract
The burden test and the sequence kernel association test (SKAT) are two popular methods for detecting association with rare variants. Treated as two different sources of association information, they are adaptively combined to form an optimal SKAT (SKAT-O) method for optimal power. We show that the burden test is part of rather than independent of the SKAT. We introduce a new test statistic that is the sum of the burden statistic and a statistic asymptotically independent of the burden statistic. The performance of this new test statistic is demonstrated through extensive simulation studies and applications to a Genetic Analysis Workshop 17 data set and the Ocular Hypertension Treatment Study data.
Collapse
|