1
|
Church JA, Grigorenko EL, Fletcher JM. The Role of Neural and Genetic Processes in Learning to Read and Specific Reading Disabilities: Implications for Instruction. READING RESEARCH QUARTERLY 2023; 58:203-219. [PMID: 37456924 PMCID: PMC10348696 DOI: 10.1002/rrq.439] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Accepted: 06/29/2021] [Indexed: 07/18/2023]
Abstract
To learn to read, the brain must repurpose neural systems for oral language and visual processing to mediate written language. We begin with a description of computational models for how alphabetic written language is processed. Next, we explain the roles of a dorsal sublexical system in the brain that relates print and speech, a ventral lexical system that develops the visual expertise for rapid orthographic processing at the word level, and the role of cognitive control networks that regulate attentional processes as children read. We then use studies of children, adult illiterates learning to read, and studies of poor readers involved in intervention, to demonstrate the plasticity of these neural networks in development and in relation to instruction. We provide a brief overview of the rapid increase in the field's understanding and technology for assessing genetic influence on reading. Family studies of twins have shown that reading skills are heritable, and molecular genetic studies have identified numerous regions of the genome that may harbor candidate genes for the heritability of reading. In selected families, reading impairment has been associated with major genetic effects, despite individual gene contributions across the broader population that appear to be small. Neural and genetic studies do not prescribe how children should be taught to read, but these studies have underscored the critical role of early intervention and ongoing support. These studies also have highlighted how structured instruction that facilitates access to the sublexical components of words is a critical part of training the brain to read.
Collapse
Affiliation(s)
| | - Elena L Grigorenko
- University of Houston, Texas, USA; Baylor College of Medicine, Houston, Texas, USA; and St. Petersburg State University, Russia
| | | |
Collapse
|
2
|
Liang J, Wang H, Cade BE, Kurniansyah N, He KY, Lee J, Sands SA, A. Brody J, Chen H, Gottlieb DJ, Evans DS, Guo X, Gharib SA, Hale L, Hillman DR, Lutsey PL, Mukherjee S, Ochs-Balcom HM, Palmer LJ, Purcell S, Saxena R, Patel SR, Stone KL, Tranah GJ, Boerwinkle E, Lin X, Liu Y, Psaty BM, Vasan RS, Manichaikul A, Rich SS, Rotter JI, Sofer T, Redline S, Zhu X. Targeted Genome Sequencing Identifies Multiple Rare Variants in Caveolin-1 Associated with Obstructive Sleep Apnea. Am J Respir Crit Care Med 2022; 206:1271-1280. [PMID: 35822943 PMCID: PMC9746833 DOI: 10.1164/rccm.202203-0618oc] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 07/06/2022] [Indexed: 01/04/2023] Open
Abstract
Rationale: Obstructive sleep apnea (OSA) is a common disorder associated with increased risk for cardiovascular disease, diabetes, and premature mortality. There is strong clinical and epidemiologic evidence supporting the importance of genetic factors influencing OSA but limited data implicating specific genes. Objectives: To search for rare variants contributing to OSA severity. Methods: Leveraging high-depth genomic sequencing data from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and imputed genotype data from multiple population-based studies, we performed linkage analysis in the CFS (Cleveland Family Study), followed by multistage gene-based association analyses in independent cohorts for apnea-hypopnea index (AHI) in a total of 7,708 individuals of European ancestry. Measurements and Main Results: Linkage analysis in the CFS identified a suggestive linkage peak on chromosome 7q31 (LOD = 2.31). Gene-based analysis identified 21 noncoding rare variants in CAV1 (Caveolin-1) associated with lower AHI after accounting for multiple comparisons (P = 7.4 × 10-8). These noncoding variants together significantly contributed to the linkage evidence (P < 10-3). Follow-up analysis revealed significant associations between these variants and increased CAV1 expression, and increased CAV1 expression in peripheral monocytes was associated with lower AHI (P = 0.024) and higher minimum overnight oxygen saturation (P = 0.007). Conclusions: Rare variants in CAV1, a membrane-scaffolding protein essential in multiple cellular and metabolic functions, are associated with higher CAV1 gene expression and lower OSA severity, suggesting a novel target for modulating OSA severity.
Collapse
Affiliation(s)
- Jingjing Liang
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, Ohio
| | - Heming Wang
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts
| | - Brian E. Cade
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts
| | - Nuzulul Kurniansyah
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
| | - Karen Y. He
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, Ohio
| | - Jiwon Lee
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
| | - Scott A. Sands
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
| | | | - Han Chen
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, and
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas
| | - Daniel J. Gottlieb
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
- VA Boston Healthcare System, Boston, Massachusetts
| | - Daniel S. Evans
- California Pacific Medical Center Research Institute, San Francisco, California
| | - Xiuqing Guo
- Institute for Translational Genomics and Population Sciences and
- Department of Pediatrics, Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, California
| | - Sina A. Gharib
- Computational Medicine Core, Center for Lung Biology, University of Washington Medicine Sleep Center, Department of Medicine
| | - Lauren Hale
- Family, Population, and Preventive Medicine, Program in Public Health, Renaissance School of Medicine, Stony Brook University, Stony Brook, New York
| | - David R. Hillman
- Department of Pulmonary Physiology and Sleep Medicine, Sir Charles Gairdner Hospital, Perth, Western Australia, Australia
| | - Pamela L. Lutsey
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, Minnesota
| | - Sutapa Mukherjee
- Sleep Health Service, Respiratory and Sleep Service, Southern Adelaide Local Health Network, Adelaide, South Australia, Australia
- Adelaide Institute for Sleep Health, Flinders Health and Medical Research Institute, College Medicine and Public Health, Flinders University, Adelaide, South Australia, Australia
| | - Heather M. Ochs-Balcom
- Department of Epidemiology and Environmental Health, School of Public Health and Health Professions, University at Buffalo, Buffalo, New York
| | - Lyle J. Palmer
- School of Public Health, University of Adelaide, Adelaide, South Australia, Australia
| | - Shaun Purcell
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts
| | - Richa Saxena
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts
- Center for Genomic Medicine and
- Department of Anesthesia, Pain and Critical Care Medicine, Massachusetts General Hospital, Boston, Massachusetts
| | - Sanjay R. Patel
- Division of Pulmonary, Allergy, and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Katie L. Stone
- California Pacific Medical Center Research Institute, San Francisco, California
| | - Gregory J. Tranah
- California Pacific Medical Center Research Institute, San Francisco, California
| | - Eric Boerwinkle
- Cardiovascular Health Research Unit, Department of Medicine
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Xihong Lin
- Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts
| | - Yongmei Liu
- Department of Medicine, Division of Cardiology, Duke Molecular Physiology Institute, Duke University Medical Center, Durham, North Carolina
| | - Bruce M. Psaty
- Cardiovascular Health Research Unit, Department of Medicine
- Department of Epidemiology, and
- Department of Health Services and Population Health, University of Washington, Seattle, Washington
| | - Ramachandran S. Vasan
- Framingham Heart Study, Framingham, Massachusetts
- Section of Preventive Medicine and Epidemiology and
- Section of Cardiology, Department of Medicine, School of Medicine, and
- Department of Epidemiology, School of Public Health, Boston University, Boston, Massachusetts; and
| | - Ani Manichaikul
- Center for Public Health Genomics and
- Biostatistics Section, Department of Public Health Sciences, University of Virginia, Charlottesville, Virginia
| | | | - Jerome I. Rotter
- California Pacific Medical Center Research Institute, San Francisco, California
- Institute for Translational Genomics and Population Sciences and
| | - Tamar Sofer
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
- Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, Ohio
| | - TOPMed Sleep Working Group
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, Ohio
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts
- Division of Sleep Medicine, Harvard Medical School, and
- Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts
- Cardiovascular Health Research Unit, Department of Medicine
- Computational Medicine Core, Center for Lung Biology, University of Washington Medicine Sleep Center, Department of Medicine
- Department of Epidemiology, and
- Department of Health Services and Population Health, University of Washington, Seattle, Washington
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, and
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas
- VA Boston Healthcare System, Boston, Massachusetts
- California Pacific Medical Center Research Institute, San Francisco, California
- Institute for Translational Genomics and Population Sciences and
- Department of Pediatrics, Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, California
- Family, Population, and Preventive Medicine, Program in Public Health, Renaissance School of Medicine, Stony Brook University, Stony Brook, New York
- Department of Pulmonary Physiology and Sleep Medicine, Sir Charles Gairdner Hospital, Perth, Western Australia, Australia
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, Minnesota
- Sleep Health Service, Respiratory and Sleep Service, Southern Adelaide Local Health Network, Adelaide, South Australia, Australia
- Adelaide Institute for Sleep Health, Flinders Health and Medical Research Institute, College Medicine and Public Health, Flinders University, Adelaide, South Australia, Australia
- Department of Epidemiology and Environmental Health, School of Public Health and Health Professions, University at Buffalo, Buffalo, New York
- School of Public Health, University of Adelaide, Adelaide, South Australia, Australia
- Center for Genomic Medicine and
- Department of Anesthesia, Pain and Critical Care Medicine, Massachusetts General Hospital, Boston, Massachusetts
- Division of Pulmonary, Allergy, and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
- Department of Medicine, Division of Cardiology, Duke Molecular Physiology Institute, Duke University Medical Center, Durham, North Carolina
- Framingham Heart Study, Framingham, Massachusetts
- Section of Preventive Medicine and Epidemiology and
- Section of Cardiology, Department of Medicine, School of Medicine, and
- Department of Epidemiology, School of Public Health, Boston University, Boston, Massachusetts; and
- Center for Public Health Genomics and
- Biostatistics Section, Department of Public Health Sciences, University of Virginia, Charlottesville, Virginia
| |
Collapse
|
3
|
Yang J, Han Y, Lee JH, Yoo HJ. Association of the MACROD2 rs6110695 A>G polymorphism with an increasing WBC count in a Korean population. Immun Inflamm Dis 2022; 10:e669. [PMID: 35759225 PMCID: PMC9233196 DOI: 10.1002/iid3.669] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 05/19/2022] [Accepted: 06/04/2022] [Indexed: 11/27/2022] Open
Abstract
INTRODUCTION We aimed to find a novel candidate gene related to the white blood cell (WBC) count in a Korean population. Since WBC count has been reported to have a relation to the risk of chronic diseases according to previous literature, WBC level prediction can be helpful for managing future risk of chronic disease development. In this aspect, a gene newly found in the present study is expected to be utilized as a tool for judging an individual's WBC level. METHODS Based on the 153 study participants' genotype data produced by the Korean Chip. The mono-adenosine diphosphate ribosylhydrolase 2 (MACROD2) rs6110695 A>G polymorphism had a significant strong association with WBC count, thus, the MACROD2 gene emerged as a novel candidate gene for WBC count. To verify the effects of the single-nucleotide polymorphisms on WBC count, the participants were grouped according to the rs6110695 AA and AG genotypes. RESULTS WBC to apolipoprotein A-I ratio, WBC count, granulocyte to lymphocyte ratio, monocyte to platelet ratio, and interferon-γ level were significantly higher in the AG genotype group than in the AA genotype group. Through the receiver operating characteristic curve analysis, the rs6110695 AA and AG genotypes were discriminated by the optimal WBC count cutoff value of 5.450. As expected, the results in the participants having a WBC count over 5.450 were similar to the AG genotype group. CONCLUSIONS We revealed that the MACROD2 rs6110695 AG genotype has an association with increasing WBC count. Since, as previous literature described, WBC count is one of the main risk factors for chronic diseases, WBC count measurement in individuals with the rs6110695 AG genotype that was found in the present study may help manage future chronic disease risk.
Collapse
Affiliation(s)
- Jihye Yang
- Department of Food and NutritionNational Leading Research Laboratory of Clinical Nutrigenetics/Nutrigenomics, College of Human EcologyYonsei UniversitySeoulRepublic of Korea
| | - Youngmin Han
- Department of Food and NutritionNational Leading Research Laboratory of Clinical Nutrigenetics/Nutrigenomics, College of Human EcologyYonsei UniversitySeoulRepublic of Korea
| | - Jong Ho Lee
- Department of Food and NutritionNational Leading Research Laboratory of Clinical Nutrigenetics/Nutrigenomics, College of Human EcologyYonsei UniversitySeoulRepublic of Korea
- Research Center for Silver Science, Institute of Symbiotic Life‐TECHYonsei UniversitySeoulRepublic of Korea
| | - Hye Jin Yoo
- Department of Food and NutritionNational Leading Research Laboratory of Clinical Nutrigenetics/Nutrigenomics, College of Human EcologyYonsei UniversitySeoulRepublic of Korea
- Research Center for Silver Science, Institute of Symbiotic Life‐TECHYonsei UniversitySeoulRepublic of Korea
| |
Collapse
|
4
|
He KY, Kelly TN, Wang H, Liang J, Zhu L, Cade BE, Assimes TL, Becker LC, Beitelshees AL, Bielak LF, Bress AP, Brody JA, Chang YPC, Chang YC, de Vries PS, Duggirala R, Fox ER, Franceschini N, Furniss AL, Gao Y, Guo X, Haessler J, Hung YJ, Hwang SJ, Irvin MR, Kalyani RR, Liu CT, Liu C, Martin LW, Montasser ME, Muntner PM, Mwasongwe S, Naseri T, Palmas W, Reupena MS, Rice KM, Sheu WHH, Shimbo D, Smith JA, Snively BM, Yanek LR, Zhao W, Blangero J, Boerwinkle E, Chen YDI, Correa A, Cupples LA, Curran JE, Fornage M, He J, Hou L, Kaplan RC, Kardia SLR, Kenny EE, Kooperberg C, Lloyd-Jones D, Loos RJF, Mathias RA, McGarvey ST, Mitchell BD, North KE, Peyser PA, Psaty BM, Raffield LM, Rao DC, Redline S, Reiner AP, Rich SS, Rotter JI, Taylor KD, Tracy R, Vasan RS, Morrison AC, Levy D, Chakravarti A, Arnett DK, Zhu X. Rare coding variants in RCN3 are associated with blood pressure. BMC Genomics 2022; 23:148. [PMID: 35183128 PMCID: PMC8858539 DOI: 10.1186/s12864-022-08356-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 02/01/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND While large genome-wide association studies have identified nearly one thousand loci associated with variation in blood pressure, rare variant identification is still a challenge. In family-based cohorts, genome-wide linkage scans have been successful in identifying rare genetic variants for blood pressure. This study aims to identify low frequency and rare genetic variants within previously reported linkage regions on chromosomes 1 and 19 in African American families from the Trans-Omics for Precision Medicine (TOPMed) program. Genetic association analyses weighted by linkage evidence were completed with whole genome sequencing data within and across TOPMed ancestral groups consisting of 60,388 individuals of European, African, East Asian, Hispanic, and Samoan ancestries. RESULTS Associations of low frequency and rare variants in RCN3 and multiple other genes were observed for blood pressure traits in TOPMed samples. The association of low frequency and rare coding variants in RCN3 was further replicated in UK Biobank samples (N = 403,522), and reached genome-wide significance for diastolic blood pressure (p = 2.01 × 10- 7). CONCLUSIONS Low frequency and rare variants in RCN3 contributes blood pressure variation. This study demonstrates that focusing association analyses in linkage regions greatly reduces multiple-testing burden and improves power to identify novel rare variants associated with blood pressure traits.
Collapse
Affiliation(s)
- Karen Y He
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road, Cleveland, OH, 44106, USA
| | - Tanika N Kelly
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Heming Wang
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
| | - Jingjing Liang
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road, Cleveland, OH, 44106, USA
| | - Luke Zhu
- Center for Human Genetics & Genomics, New York University Grossman School of Medicine, New York, NY, USA
| | - Brian E Cade
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
| | - Themistocles L Assimes
- Department of Medicine (Division of Cardiovascular Medicine), Stanford University, Palo Alto, CA, USA
| | - Lewis C Becker
- GeneSTAR Research Program, Department of Medicine, Divisions of Cardiology and General Internal Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Amber L Beitelshees
- Division of Endocrinology, Diabetes, and Nutrition, Program for Personalized and Genomic Medicine, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Lawrence F Bielak
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Adam P Bress
- Department of Population Health Sciences, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Jennifer A Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Yen-Pei Christy Chang
- Division of Endocrinology, Diabetes, and Nutrition, Program for Personalized and Genomic Medicine, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Yi-Cheng Chang
- Graduate Institute of Medical Genomics and Proteomics, National Taiwan University, Taipei City, Taiwan
- Institute of Biomedical Sciences, Academia Sinica, Taipei City, Taiwan
- Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
| | - Paul S de Vries
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Ravindranath Duggirala
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Ervin R Fox
- Division of Cardiovascular Diseases, Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
| | - Nora Franceschini
- Department of Epidemiology, UNC Gillings School of Global Public Health, Chapel Hill, NC, USA
| | - Anna L Furniss
- Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, USA
| | - Yan Gao
- Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS, USA
| | - Xiuqing Guo
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Jeffrey Haessler
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Yi-Jen Hung
- Institute of Preventive Medicine, National Defense Medical Center, New Taipei City, Taiwan
| | - Shih-Jen Hwang
- Department of Biostatistics, School of Public Health, Boston University, Boston, MA, USA
- Framingham Heart Study, National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, USA
| | - Marguerite Ryan Irvin
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AB, USA
| | - Rita R Kalyani
- GeneSTAR Research Program, Department of Medicine, Division of Endocrinology, Diabetes and Metabolism, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Ching-Ti Liu
- Department of Biostatistics, School of Public Health, Boston University, Boston, MA, USA
- Framingham Heart Study, National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, USA
| | - Chunyu Liu
- Department of Biostatistics, School of Public Health, Boston University, Boston, MA, USA
- Framingham Heart Study, National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, USA
| | - Lisa Warsinger Martin
- Division of Cardiology, Department of Medicine, George Washington University, Washington, DC, USA
| | - May E Montasser
- Division of Endocrinology, Diabetes, and Nutrition, Program for Personalized and Genomic Medicine, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Paul M Muntner
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AB, USA
| | | | - Take Naseri
- Ministry of Health, Government of Samoa, Apia, Samoa
| | - Walter Palmas
- Division of Cardiology, Columbia University Irving Medical Center, New York, NY, USA
| | | | - Kenneth M Rice
- Department of Biostatistics, School of Public Health, University of Washington, Seattle, WA, USA
| | - Wayne H-H Sheu
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Taichung Veterans General Hospital, Taichung City, Taiwan
| | - Daichi Shimbo
- Division of Cardiology, Columbia University Irving Medical Center, New York, NY, USA
| | - Jennifer A Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Beverly M Snively
- Department of Biostatistics and Data Science, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Lisa R Yanek
- GeneSTAR Research Program, Department of Medicine, Division of General Internal Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Wei Zhao
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - John Blangero
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Eric Boerwinkle
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Yii-Der Ida Chen
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
- Division of Genomic Outcomes, Department of Pediatrics, Harbor-UCLA Medical Center Professor of Pediatrics, UCLA, Torrance, CA, USA
| | - Adolfo Correa
- Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, USA
| | - L Adrienne Cupples
- Department of Biostatistics, School of Public Health, Boston University, Boston, MA, USA
- Framingham Heart Study, National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, USA
| | - Joanne E Curran
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Myriam Fornage
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Jiang He
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Lifang Hou
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University Chicago, Evanston, IL, USA
| | - Robert C Kaplan
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, USA
| | - Sharon L R Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Eimear E Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Donald Lloyd-Jones
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, USA
| | - Ruth J F Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Rasika A Mathias
- GeneSTAR Research Program, Department of Medicine, Divisions of Allergy and Clinical Immunology and General Internal Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Stephen T McGarvey
- International Health Institute and Department of Epidemiology, School of Public Health, Brown University, Providence, RI, USA
- Department of Anthropology, Brown University, Providence, RI, USA
| | - Braxton D Mitchell
- Division of Endocrinology, Diabetes, and Nutrition, Program for Personalized and Genomic Medicine, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
- Geriatrics Research and Education Clinical Center, Veterans Affairs Medical Center, Baltimore, MD, USA
| | - Kari E North
- Department of Epidemiology, UNC Gillings School of Global Public Health, Chapel Hill, NC, USA
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Patricia A Peyser
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Bruce M Psaty
- Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Systems and Population Health, University of Washington, Seattle, WA, USA
| | - Laura M Raffield
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - D C Rao
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
| | - Alex P Reiner
- Department of Epidemiology, University of Washington, Seattle, WA, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Jerome I Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Kent D Taylor
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Russell Tracy
- Department of Pathology & Laboratory Medicine, Larner College of Medicine, University of Vermont, Burlington, VT, USA
- Department of Biochemistry, University of Vermont, Burlington, VT, USA
| | - Ramachandran S Vasan
- Framingham Heart Study, National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, USA
- Department of Medicine, School of Medicine, Boston University, Boston, MA, USA
| | - Alanna C Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Daniel Levy
- Framingham Heart Study, National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, USA
- Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Aravinda Chakravarti
- Center for Human Genetics & Genomics, New York University Grossman School of Medicine, New York, NY, USA
| | - Donna K Arnett
- University of Kentucky College of Public Health, Lexington, KY, USA
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road, Cleveland, OH, 44106, USA.
| |
Collapse
|
5
|
Cheng CF, Hsieh AR, Liang WM, Chen CC, Chen CH, Wu JY, Lin TH, Liao CC, Huang SM, Huang YC, Ban B, Lin YJ, Tsai FJ. Genome-Wide and Candidate Gene Association Analyses Identify a 14-SNP Combination for Hypertension in Patients With Type 2 Diabetes. Am J Hypertens 2021; 34:651-661. [PMID: 33276381 DOI: 10.1093/ajh/hpaa203] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Revised: 10/19/2020] [Accepted: 12/02/2020] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND High blood pressure is common and comorbid with type 2 diabetes (T2D). Almost 50% of patients with T2D have high blood pressure. Patients with both conditions of hypertension (HTN) and T2D are at risk for cardiovascular diseases and mortality. The study aim was to investigate genetic risk factors for HTN in T2D patients. METHODS This study included 999 T2D (cohort 1) patients for the first genome scan stage and 922 T2D (cohort 2) patients for the replication stage. Here, we investigated the genetic susceptibility and cumulative weighted genetic risk score for HTN in T2D patients of Han Chinese descent in Taiwan. RESULTS Thirty novel genetic single nucleotide polymorphisms (SNPs) were associated with HTN in T2D after adjusting for age and body mass index (P value <1 × 10-4). Eight blood pressure-related and/or HTN-related genetic SNPs were associated with HTN in T2D after adjusting for age and body mass index (P value <0.05). Linkage disequilibrium and cumulative weighted genetic risk score analyses showed that 14 of the 38 SNPs were associated with risk of HTN in a dose-dependent manner in T2D (Cochran-Armitage trend test: P value <0.0001). The 14-SNP cumulative weighted genetic risk score was also associated with increased regression tendency of systolic blood pressure in T2D (SBP = 122.05 + 0.8 × weighted genetic risk score; P value = 0.0001). CONCLUSIONS A cumulative weighted genetic risk score composed of 14 SNPs is important for HTN, increased tendency of systolic blood pressure, and may contribute to HTN risk in T2D in Taiwan.
Collapse
Affiliation(s)
- Chi-Fung Cheng
- Graduate Institute of Biostatistics, School of Public Health, China Medical University, Taichung, Taiwan
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
| | - Ai-Ru Hsieh
- Department of Statistics, Tamkang University, New Taipei City, Taiwan
| | - Wen-Miin Liang
- Graduate Institute of Biostatistics, School of Public Health, China Medical University, Taichung, Taiwan
| | - Ching-Chu Chen
- Division of Endocrinology and Metabolism, Department of Medicine, China Medical University Hospital, Taichung, Taiwan
| | - Chien-Hsiun Chen
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
- School of Chinese Medicine, China Medical University, Taichung, Taiwan
| | - Jer-Yuarn Wu
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
- School of Chinese Medicine, China Medical University, Taichung, Taiwan
| | - Ting-Hsu Lin
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
| | - Chiu-Chu Liao
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
| | - Shao-Mei Huang
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
| | - Yu-Chuen Huang
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
- School of Chinese Medicine, China Medical University, Taichung, Taiwan
| | - Bo Ban
- Chinese Research Center for Behavior Medicine in Growth and Development, Jining, Shandong, China
| | - Ying-Ju Lin
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
- School of Chinese Medicine, China Medical University, Taichung, Taiwan
| | - Fuu-Jen Tsai
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
- School of Chinese Medicine, China Medical University, Taichung, Taiwan
- Department of Biotechnology and Bioinformatics, Asia University, Taichung, Taiwan
| |
Collapse
|
6
|
Kim HR, Jin HS, Eom YB. Association of MACROD2 gene variants with obesity and physical activity in a Korean population. Mol Genet Genomic Med 2021; 9:e1635. [PMID: 33624934 PMCID: PMC8123725 DOI: 10.1002/mgg3.1635] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 12/18/2020] [Accepted: 02/10/2021] [Indexed: 01/22/2023] Open
Abstract
Background Obesity is a serious and common complex disease caused by the influence of genetic and environmental factors. Therefore, we aimed to evaluate the effect of genetic variants on obesity and the possibility of preventing obesity through physical activity using association analysis. Methods This study analyzed the association between obesity and variants in the MACROD2 gene in the Korean association resource (KARE) cohort using logistic regression analysis. Linear regression analysis was performed for obesity‐related phenotype traits including body mass index (BMI), body fat percentage (BFP), abdominal fat percentage (AbFP), and the waist‐to‐hip ratio (WHR). The level of physical activity was analyzed by dividing the participants into two groups according to the cutoff of one hour or more of daily intense activity. Results As a result, rs6079275 in the MACROD2 gene had the highest significance in obesity and phenotypic characteristics. Minor allele carriers (CC, CG) of rs6079275 decreased the obesity risk (OR = 0.57, 95% CI = 0.40–0.82, p = 2.34 × 10−3) and showed a tendency to decrease the risk of BMI (β = −0.312, p = 8.99 × 10−4), BFP (β = −0.482, p = 4.19 × 10−3) and AbFP (β = −0.0051, p = 5.96 × 10−4). In addition, the participants with the minor allele (C) of rs6079275 had a reduced obesity risk with high physical activity (OR = 0.23, 95% CI: 0.14–0.93, p = 0.036). Conclusions This study demonstrated that variants in the MACROD2 gene were correlated with obesity, phenotypic traits, and physical activity in the Korean population. Therefore, we suggest the possibility of preventing obesity by identifying this genetic variation and the interactive effect of lifestyle in Koreans.
Collapse
Affiliation(s)
- Hye-Rim Kim
- Department of Medical Sciences, Graduate School, Soonchunhyang University, Asan, Republic of Korea
| | - Hyun-Seok Jin
- Department of Biomedical Laboratory Science, College of Life and Health Sciences, Hoseo University, Asan, Chungnam, Republic of Korea
| | - Yong-Bin Eom
- Department of Medical Sciences, Graduate School, Soonchunhyang University, Asan, Republic of Korea.,Department of Biomedical Laboratory Science, College of Medical Sciences, Soonchunhyang University, Asan, Republic of Korea
| |
Collapse
|
7
|
Elston RC. An Accidental Genetic Epidemiologist. Annu Rev Genomics Hum Genet 2020; 21:15-36. [DOI: 10.1146/annurev-genom-103119-125052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
I briefly describe my early life and how, through a series of serendipitous events, I became a genetic epidemiologist. I discuss how the Elston–Stewart algorithm was discovered and its contribution to segregation, linkage, and association analysis. New linkage findings and paternity testing resulted from having a genotyping lab. The different meanings of interaction—statistical and biological—are clarified. The computer package S.A.G.E. (Statistical Analysis for Genetic Epidemiology), based on extensive method development over two decades, was conceived in 1986, flourished for 20 years, and is now freely available for use and further development. Finally, I describe methods to estimate and test hypotheses about familial correlations, and point out that the liability model often used to estimate disease heritability estimates the heritability of that liability, rather than of the disease itself, and so can be highly dependent on the assumed distribution of that liability.
Collapse
Affiliation(s)
- Robert C. Elston
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio 44106, USA
| |
Collapse
|
8
|
Zhou X, Wang M, Lin S. Detecting rare haplotypes associated with complex diseases using both population and family data: Combined logistic Bayesian Lasso. Stat Methods Med Res 2020; 29:3340-3350. [DOI: 10.1177/0962280220927728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Haplotype-based association methods have been developed to understand the genetic architecture of complex diseases. Compared to single-variant-based methods, haplotype methods are thought to be more biologically relevant, since there are typically multiple non-independent genetic variants involved in complex diseases, and the use of haplotypes implicitly accounts for non-independence caused by linkage disequilibrium. In recent years, with the focus moving from common to rare variants, haplotype-based methods have also evolved accordingly to uncover the roles of rare haplotypes. One particular approach is regularization-based, with the use of Bayesian least absolute shrinkage and selection operator (Lasso) as an example. This type of methods has been developed for either case-control population data (the logistic Bayesian Lasso (LBL)) or family data (family-triad-based logistic Bayesian Lasso (famLBL)). In some situations, both family data and case-control data are available; therefore, it would be a waste of resources if only one of them could be analyzed. To make full usage of available data to increase power, we propose a unified approach that can combine both case-control and family data (combined logistic Bayesian Lasso (cLBL)). Through simulations, we characterized the performance of cLBL and showed the advantage of cLBL over existing methods. We further applied cLBL to the Framingham Heart Study data to demonstrate its utility in real data applications.
Collapse
Affiliation(s)
- Xiaofei Zhou
- Department of Statistics, The Ohio State University, Columbus, OH, USA
| | - Meng Wang
- Battelle Center for Mathematical Medicine, Nationwide Children’s Hospital, Columbus, OH, USA
| | - Shili Lin
- Department of Statistics, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
9
|
Meng J, Zhu W, Li C, Jon K. A novel association test for rare variants based on algebraic statistics. J Theor Biol 2020; 493:110228. [PMID: 32135159 DOI: 10.1016/j.jtbi.2020.110228] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Revised: 01/22/2020] [Accepted: 02/29/2020] [Indexed: 01/26/2023]
Abstract
With the rapid growth of next-generation sequencing technology, more and more rare variants are available in the human genome. In recent years, the point of study has already changed direction to rare variants in genome-wide association studies (GWAS). Although a variety of approaches have been proposed to test associations between rare variants and phenotypes of interest, it is far from the end of this problem, and it is worth exploring new statistical methods based on special features of rare variants. As we all know, the most direct way is to evaluate the association in a two-way contingency table if the phenotype is a discrete variable. The numbers of observations are very close or equal to 0s for most of cells in the contingency table due to the extremely low mutation rates of rare variants. In this paper, we propose a novel association test for rare variants based on a generalization of Fisher's exact test, and the p-value of this exact test can be computed under the multivariate hypergeometric distribution in the framework of algebraic statistics. Simulation results show that our proposed method outperforms the existing methods, despite there is heterogeneity among causal variants. We also successfully apply our method into the genetic association study of coronary artery disease and hypertension from the Wellcome Trust Case Control Consortium.
Collapse
Affiliation(s)
- Jingbo Meng
- Key Laboratory for Applied Statistics of MOE, School of Mathematics and Statistics, Northeast Normal University, Changchun 130024, China
| | - Wensheng Zhu
- Key Laboratory for Applied Statistics of MOE, School of Mathematics and Statistics, Northeast Normal University, Changchun 130024, China.
| | - Canhui Li
- Key Laboratory for Applied Statistics of MOE, School of Mathematics and Statistics, Northeast Normal University, Changchun 130024, China
| | - Kyongson Jon
- Key Laboratory for Applied Statistics of MOE, School of Mathematics and Statistics, Northeast Normal University, Changchun 130024, China; Faculty of Mathematics, Kim Il Sung University, Pyongyang, 999093, Democratic People's Republic of Korea
| |
Collapse
|
10
|
Do AN, Zhao W, Baldridge AS, Raffield LM, Wiggins KL, Shah SJ, Aslibekyan S, Tiwari HK, Limdi N, Zhi D, Sitlani CM, Taylor KD, Psaty BM, Sotoodehnia N, Brody JA, Rasmussen‐Torvik LJ, Lloyd‐Jones D, Lange LA, Wilson JG, Smith JA, Kardia SLR, Mosley TH, Vasan RS, Arnett DK, Irvin MR. Genome-wide meta-analysis of SNP and antihypertensive medication interactions on left ventricular traits in African Americans. Mol Genet Genomic Med 2019; 7:e00788. [PMID: 31407531 PMCID: PMC6785453 DOI: 10.1002/mgg3.788] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Revised: 02/14/2019] [Accepted: 04/22/2019] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Left ventricular (LV) hypertrophy affects up to 43% of African Americans (AAs). Antihypertensive treatment reduces LV mass (LVM). However, interindividual variation in LV traits in response to antihypertensive treatments exists. We hypothesized that genetic variants may modify the association of antihypertensive treatment class with LV traits measured by echocardiography. METHODS We evaluated the main effects of the three most common antihypertensive treatments for AAs as well as the single nucleotide polymorphism (SNP)-by-drug interaction on LVM and relative wall thickness (RWT) in 2,068 participants across five community-based cohorts. Treatments included thiazide diuretics (TDs), angiotensin converting enzyme inhibitors (ACE-Is), and dihydropyridine calcium channel blockers (dCCBs) and were compared in a pairwise manner. We performed fixed effects inverse variance weighted meta-analyses of main effects of drugs and 2.5 million SNP-by-drug interaction estimates. RESULTS We observed that dCCBs versus TDs were associated with higher LVM after adjusting for covariates (p = 0.001). We report three SNPs at a single locus on chromosome 20 that modified the association between RWT and treatment when comparing dCCBs to ACE-Is with consistent effects across cohorts (smallest p = 4.7 × 10-8 , minor allele frequency range 0.09-0.12). This locus has been linked to LV hypertrophy in a previous study. A marginally significant locus in BICD1 (rs326641) was validated in an external population. CONCLUSIONS Our study identified one locus having genome-wide significant SNP-by-drug interaction effect on RWT among dCCB users in comparison to ACE-I users. Upon additional validation in future studies, our findings can enhance the precision of medical approaches in hypertension treatment.
Collapse
Affiliation(s)
- Anh N. Do
- Department of EpidemiologyUniversity of Alabama at BirminghamBirminghamAlabamaUSA
| | - Wei Zhao
- Department of EpidemiologyUniversity of MichiganAnn ArborMichiganUSA
| | | | - Laura M. Raffield
- Department of GeneticsUniversity of North CarolinaChapel HillNorth CarolinaUSA
| | - Kerri L. Wiggins
- Cardiovascular Health Research Unit, Department of MedicineUniversity of WashingtonSeattleWashingtonUSA
| | - Sanjiv J. Shah
- Feinberg School of MedicineNorthwestern UniversityChicagoIllinoisUSA
| | - Stella Aslibekyan
- Department of EpidemiologyUniversity of Alabama at BirminghamBirminghamAlabamaUSA
| | - Hemant K. Tiwari
- Department of BiostatisticsUniversity of Alabama at BirminghamBirminghamAlabamaUSA
| | - Nita Limdi
- Department of NeurologyUniversity of Alabama at BirminghamBirminghamAlabamaUSA
| | - Degui Zhi
- School of Biomedical InformaticsUniversity of Texas Health Sciences Center at HoustonHoustonTexasUSA
| | - Colleen M. Sitlani
- Cardiovascular Health Research Unit, Department of MedicineUniversity of WashingtonSeattleWashingtonUSA
| | - Kent D. Taylor
- Department of Pediatrics, The Institute for Translational Genomics and Population SciencesLABioMed at Harbor‐UCLA Medical CenterSeattleWashingtonUSA
| | - Bruce M. Psaty
- Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology and Health ServicesUniversity of WashingtonSeattleWashingtonUSA
- Kaiser Permanente Washington Health Research InstituteSeattleWashingtonUSA
| | - Nona Sotoodehnia
- Cardiovascular Health Research Unit, Division of Cardiology, Departments of Medicine and EpidemiologyUniversity of WashingtonSeattleWashingtonUSA
| | - Jennifer A. Brody
- Cardiovascular Health Research Unit, Department of MedicineUniversity of WashingtonSeattleWashingtonUSA
| | - Laura J. Rasmussen‐Torvik
- Department of Preventive Medicine Feinberg School of MedicineNorthwestern UniversityChicagoIllinoisUSA
| | | | - Leslie A. Lange
- Department of MedicineUniversity of Colorado DenverAuroraColoradoUSA
| | - James G. Wilson
- Department of Physiology and BiophysicsUniversity of Mississippi Medical CenterJacksonMississippiUSA
| | - Jennifer A. Smith
- Department of EpidemiologyUniversity of MichiganAnn ArborMichiganUSA
| | | | - Thomas H. Mosley
- Department of MedicineUniversity of Mississippi Medical CenterJacksonMississippiUSA
| | - Ramachandran S. Vasan
- Departments of Medicine and Preventive MedicineBoston University School of MedicineBostonMassachusettsUSA
| | - Donna K. Arnett
- College of Public HealthUniversity of KentuckyLexingtonKentuckyUSA
| | - Marguerite R. Irvin
- Department of EpidemiologyUniversity of Alabama at BirminghamBirminghamAlabamaUSA
| |
Collapse
|
11
|
Zhang J, Wu B, Sha Q, Zhang S, Wang X. A general statistic to test an optimally weighted combination of common and/or rare variants. Genet Epidemiol 2019; 43:966-979. [PMID: 31498476 DOI: 10.1002/gepi.22255] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Revised: 06/17/2019] [Accepted: 07/30/2019] [Indexed: 11/10/2022]
Abstract
Both genome-wide association study and next-generation sequencing data analyses are widely employed to identify disease susceptible common and/or rare genetic variants. Rare variants generally have large effects though they are hard to detect due to their low frequencies. Currently, many existing statistical methods for rare variants association studies employ a weighted combination scheme, which usually puts subjective weights or suboptimal weights based on some adhoc assumptions (e.g., ignoring dependence between rare variants). In this study, we analytically derived optimal weights for both common and rare variants and proposed a general and novel approach to test association between an optimally weighted combination of variants (G-TOW) in a gene or pathway for a continuous or dichotomous trait while easily adjusting for covariates. Results of the simulation studies show that G-TOW has properly controlled type I error rates and it is the most powerful test among the methods we compared when testing effects of either both rare and common variants or rare variants only. We also illustrate the effectiveness of G-TOW using the Genetic Analysis Workshop 17 (GAW17) data. Additionally, we applied G-TOW and other competitive methods to test disease-associated genes in real data of schizophrenia. The G-TOW has successfully verified genes FYN and VPS39 which are associated with schizophrenia reported in existing publications. Both of these genes are missed by the weighted sum statistic and the sequence kernel association test. Simulation study and real data analysis indicate that G-TOW is a powerful test.
Collapse
Affiliation(s)
- Jianjun Zhang
- Department of Mathematics, University of North Texas, Denton, Texas
| | - Baolin Wu
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota
| | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan
| | - Shuanglin Zhang
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan
| | - Xuexia Wang
- Department of Mathematics, University of North Texas, Denton, Texas
| |
Collapse
|
12
|
Chen G, Yuan A, Cai T, Li CM, Bentley AR, Zhou J, N Shriner D, A Adeyemo A, N Rotimi C. Measuring gene-gene interaction using Kullback-Leibler divergence. Ann Hum Genet 2019; 83:405-417. [PMID: 31206606 DOI: 10.1111/ahg.12324] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Revised: 03/30/2019] [Accepted: 04/12/2019] [Indexed: 12/29/2022]
Abstract
Genome-wide association studies (GWAS) are used to investigate genetic variants contributing to complex traits. Despite discovering many loci, a large proportion of "missing" heritability remains unexplained. Gene-gene interactions may help explain some of this gap. Traditionally, gene-gene interactions have been evaluated using parametric statistical methods such as linear and logistic regression, with multifactor dimensionality reduction (MDR) used to address sparseness of data in high dimensions. We propose a method for the analysis of gene-gene interactions across independent single-nucleotide polymorphisms (SNPs) in two genes. Typical methods for this problem use statistics based on an asymptotic chi-squared mixture distribution, which is not easy to use. Here, we propose a Kullback-Leibler-type statistic, which follows an asymptotic, positive, normal distribution under the null hypothesis of no relationship between SNPs in the two genes, and normally distributed under the alternative hypothesis. The performance of the proposed method is evaluated by simulation studies, which show promising results. The method is also used to analyze real data and identifies gene-gene interactions among RAB3A, MADD, and PTPRN on type 2 diabetes (T2D) status.
Collapse
Affiliation(s)
- Guanjie Chen
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland
| | - Ao Yuan
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington, DC
| | - Tao Cai
- Experimental Medicine Section, Laboratory of Sensory Biology, NIDCR, NIH, Bethesda, Maryland
| | - Chuan-Ming Li
- Division of Scientific Program, National Institute of Deafness and Other Communication Disorders, Rockville, Maryland, 20892
| | - Amy R Bentley
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland
| | - Jie Zhou
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland
| | - Daniel N Shriner
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland
| | - Adebowale A Adeyemo
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland
| | - Charles N Rotimi
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
13
|
Jones RM, Melton PE, Pinese M, Rea AJ, Ingley E, Ballinger ML, Wood DJ, Thomas DM, Moses EK. Identification of novel sarcoma risk genes using a two-stage genome wide DNA sequencing strategy in cancer cluster families and population case and control cohorts. BMC MEDICAL GENETICS 2019; 20:69. [PMID: 31053105 PMCID: PMC6499942 DOI: 10.1186/s12881-019-0808-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 04/16/2019] [Indexed: 12/26/2022]
Abstract
Background Although familial clustering of cancers is relatively common, only a small proportion of familial cancer risk can be explained by known cancer predisposition genes. Methods In this study we employed a two-stage approach to identify candidate sarcoma risk genes. First, we conducted whole exome sequencing in three multigenerational cancer families ascertained through a sarcoma proband (n = 19) in order to prioritize candidate genes for validation in an independent case-control cohort of sarcoma patients using family-based association and segregation analysis. The second stage employed a burden analysis of rare variants within prioritized candidate genes identified from stage one in 560 sarcoma cases and 1144 healthy ageing controls, for which whole genome sequence was available. Results Variants from eight genes were identified in stage one. Following gene-based burden testing and after correction for multiple testing, two of these genes, ABCB5 and C16orf96, were determined to show statistically significant association with cancer. The ABCB5 gene was found to have a higher burden of putative regulatory variants (OR = 4.9, p-value = 0.007, q-value = 0.04) based on allele counts in sarcoma cases compared to controls. C16orf96, was found to have a significantly lower burden (OR = 0.58, p-value = 0.0004, q-value = 0.003) of regulatory variants in controls compared to sarcoma cases. Conclusions Based on these genetic association data we propose that ABCB5 and C16orf96 are novel candidate risk genes for sarcoma. Although neither of these two genes have been previously associated with sarcoma, ABCB5 has been shown to share clinical drug resistance associations with melanoma and leukaemia and C16orf96 shares regulatory elements with genes that are involved with TNF-alpha mediated apoptosis in a p53/TP53-dependent manner. Future genetic studies in other family and population cohorts will be required for further validation of these novel findings. Electronic supplementary material The online version of this article (10.1186/s12881-019-0808-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rachel M Jones
- The Curtin UWA Centre for Genetic Origins of Health and Disease, Faculty of Health Sciences, Curtin University and Faculty of Health and Medical Sciences, M409 The University of Western Australia, 35 Stirling Hwy, Crawley, 6009, Western Australia.,Medical School, Faculty of Health and Medical Sciences, University of Western Australia, Crawley, Australia
| | - Phillip E Melton
- The Curtin UWA Centre for Genetic Origins of Health and Disease, Faculty of Health Sciences, Curtin University and Faculty of Health and Medical Sciences, M409 The University of Western Australia, 35 Stirling Hwy, Crawley, 6009, Western Australia.,School of Pharmacy and Biomedical Sciences, Faculty of Health Sciences, Curtin University, Bentley, Western Australia
| | - Mark Pinese
- Cancer Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Alexander J Rea
- The Curtin UWA Centre for Genetic Origins of Health and Disease, Faculty of Health Sciences, Curtin University and Faculty of Health and Medical Sciences, M409 The University of Western Australia, 35 Stirling Hwy, Crawley, 6009, Western Australia
| | - Evan Ingley
- School of Veterinary and Life Sciences, Murdoch University, Murdoch, Australia.,Harry Perkins Institute of Medical Research, Murdoch, Western Australia.,The Centre for Medical Research, The University of Western Australia, Crawley, Australia
| | - Mandy L Ballinger
- Cancer Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | | | - David J Wood
- Medical School, Faculty of Health and Medical Sciences, University of Western Australia, Crawley, Australia
| | - David M Thomas
- Cancer Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Eric K Moses
- The Curtin UWA Centre for Genetic Origins of Health and Disease, Faculty of Health Sciences, Curtin University and Faculty of Health and Medical Sciences, M409 The University of Western Australia, 35 Stirling Hwy, Crawley, 6009, Western Australia. .,School of Pharmacy and Biomedical Sciences, Faculty of Health Sciences, Curtin University, Bentley, Western Australia. .,School of Biomedical Sciences, Faculty of Health and Medical Sciences, The University of Western Australia, Crawley, Australia.
| |
Collapse
|
14
|
Ko B, Jin HS. MACROD2 Polymorphisms Are Associated with Hypertension in Korean Population. KOREAN JOURNAL OF CLINICAL LABORATORY SCIENCE 2019. [DOI: 10.15324/kjcls.2019.51.1.57] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Affiliation(s)
- Bokyung Ko
- Department of Biomedical Laboratory Science, College of Life and Health Sciences, Hoseo University, Asan, Korea
| | - Hyun-Seok Jin
- Department of Biomedical Laboratory Science, College of Life and Health Sciences, Hoseo University, Asan, Korea
| |
Collapse
|
15
|
Datta AS, Lin S, Biswas S. A Family-Based Rare Haplotype Association Method for Quantitative Traits. Hum Hered 2019; 83:175-195. [PMID: 30799419 DOI: 10.1159/000493543] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Accepted: 09/07/2018] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The variants identified in genome-wide association studies account for only a small fraction of disease heritability. A key to this "missing heritability" is believed to be rare variants. Specifically, we focus on rare haplotype variant (rHTV). The existing methods for detecting rHTV are mostly population-based, and as such, are susceptible to population stratification and admixture, leading to an inflated false-positive rate. Family-based methods are more robust in this respect. METHODS We propose a method for detecting rHTVs associated with quantitative traits called family-based quantitative Bayesian LASSO (famQBL). FamQBL can analyze any type of pedigree and is based on a mixed model framework. We regularize the haplotype effects using Bayesian LASSO and estimate the posterior distributions using Markov chain Monte Carlo methods. RESULTS We conduct simulation studies, including analyses of Genetic Analysis Workshop 18 simulated data, to study the properties of famQBL and compare with a standard family-based haplotype association test implemented in FBAT (family-based association test) software. We find famQBL to be more powerful than FBAT with well-controlled false-positive rates. We also apply famQBL to the Framingham Heart Study data and detect an rHTV associated with diastolic blood pressure. CONCLUSION FamQBL can help uncover rHTVs associated with quantitative traits.
Collapse
Affiliation(s)
- Ananda S Datta
- Department of Mathematical Sciences, University of Texas at Dallas, Richardson, Texas, USA
| | - Shili Lin
- Department of Statistics, The Ohio State University, Columbus, Ohio, USA
| | - Swati Biswas
- Department of Mathematical Sciences, University of Texas at Dallas, Richardson, Texas, USA,
| |
Collapse
|
16
|
He KY, Li X, Kelly TN, Liang J, Cade BE, Assimes TL, Becker LC, Beitelshees AL, Bress AP, Chang YPC, Chen YDI, de Vries PS, Fox ER, Franceschini N, Furniss A, Gao Y, Guo X, Haessler J, Hwang SJ, Irvin MR, Kalyani RR, Liu CT, Liu C, Martin LW, Montasser ME, Muntner PM, Mwasongwe S, Palmas W, Reiner AP, Shimbo D, Smith JA, Snively BM, Yanek LR, Boerwinkle E, Correa A, Cupples LA, He J, Kardia SLR, Kooperberg C, Mathias RA, Mitchell BD, Psaty BM, Vasan RS, Rao DC, Rich SS, Rotter JI, Wilson JG, Chakravarti A, Morrison AC, Levy D, Arnett DK, Redline S, Zhu X. Leveraging linkage evidence to identify low-frequency and rare variants on 16p13 associated with blood pressure using TOPMed whole genome sequencing data. Hum Genet 2019; 138:199-210. [PMID: 30671673 PMCID: PMC6404531 DOI: 10.1007/s00439-019-01975-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2018] [Accepted: 01/10/2019] [Indexed: 01/05/2023]
Abstract
In this study, we investigated low-frequency and rare variants associated with blood pressure (BP) by focusing on a linkage region on chromosome 16p13. We used whole genome sequencing (WGS) data obtained through the NHLBI Trans-Omics for Precision Medicine (TOPMed) program on 395 Cleveland Family Study (CFS) European Americans (CFS-EA). By analyzing functional coding variants and non-coding rare variants with CADD score > 10 residing within the chromosomal region in families with linkage evidence, we observed 25 genes with nominal statistical evidence (burden or SKAT p < 0.05). One of the genes is RBFOX1, an evolutionarily conserved RNA-binding protein that regulates tissue-specific alternative splicing that we previously reported to be associated with BP using exome array data in CFS. After follow-up analysis of the 25 genes in ten independent TOPMed studies with individuals of European, African, and East Asian ancestry, and Hispanics (N = 29,988), we identified variants in SLX4 (p = 2.19 × 10-4) to be significantly associated with BP traits when accounting for multiple testing. We also replicated the associations previously reported for RBFOX1 (p = 0.007). Follow-up analysis with GTEx eQTL data shows SLX4 variants are associated with gene expression in coronary artery, multiple brain tissues, and right atrial appendage of the heart. Our study demonstrates that linkage analysis of family data can provide an efficient approach for detecting rare variants associated with complex traits in WGS data.
Collapse
Affiliation(s)
- Karen Y He
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Xiaoyin Li
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Tanika N Kelly
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, 70112, USA
| | - Jingjing Liang
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Brian E Cade
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, 02115, USA
| | | | - Lewis C Becker
- GeneSTAR Research Program, Divisions of Cardiology and General Internal Medicine, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Amber L Beitelshees
- Program for Personalized and Genomic Medicine, Division of Endocrinology Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
| | - Adam P Bress
- Department of Population Health Sciences, University of Utah School of Medicine, Salt Lake City, UT, 84132, USA
| | - Yen-Pei Christy Chang
- Program for Personalized and Genomic Medicine, Division of Endocrinology Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
| | - Yii-Der Ida Chen
- Departments of Pediatrics and Medicine, Institute for Translational Genomics and Population Sciences, LABioMed at Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
| | - Paul S de Vries
- Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, Human Genetics Center, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Ervin R Fox
- Division of Cardiovascular Diseases, Department of Medicine, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - Nora Franceschini
- Department of Epidemiology, UNC Gillings School of Global Public Health, Chapel Hill, NC, 27599, USA
| | - Anna Furniss
- Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - Yan Gao
- Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - Xiuqing Guo
- Departments of Pediatrics and Medicine, Institute for Translational Genomics and Population Sciences, LABioMed at Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
| | - Jeffrey Haessler
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
| | - Shih-Jen Hwang
- Boston University's and National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, 01702, USA
| | - Marguerite Ryan Irvin
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AB, 35294, USA
| | - Rita R Kalyani
- GeneSTAR Research Program, Division of Endocrinology, Diabetes and Metabolism, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Ching-Ti Liu
- Boston University's and National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, 01702, USA
| | - Chunyu Liu
- Boston University's and National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, 01702, USA
| | - Lisa Warsinger Martin
- Division of Cardiology, Department of Medicine, George Washington University, Washington, DC, 20052, USA
| | - May E Montasser
- Program for Personalized and Genomic Medicine, Division of Endocrinology Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
| | - Paul M Muntner
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AB, 35294, USA
| | | | - Walter Palmas
- Division of General Medicine, Columbia University Medical Center, New York, NY, 10032, USA
| | - Alex P Reiner
- Department of Epidemiology, University of Washington, Seattle, WA, 98195, USA
| | - Daichi Shimbo
- Division of General Medicine, Columbia University Medical Center, New York, NY, 10032, USA
| | - Jennifer A Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Beverly M Snively
- Department of Biostatistical Sciences, Wake Forest University School of Medicine, Winston-Salem, NC, 27101, USA
| | - Lisa R Yanek
- GeneSTAR Research Program, Division of General Internal Medicine, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Eric Boerwinkle
- Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, Human Genetics Center, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Adolfo Correa
- Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - L Adrienne Cupples
- Boston University's and National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, 01702, USA
| | - Jiang He
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, 70112, USA
| | - Sharon L R Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
| | - Rasika A Mathias
- GeneSTAR Research Program, Divisions of Allergy and Clinical Immunology and General Internal Medicine, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Braxton D Mitchell
- Program for Personalized and Genomic Medicine, Division of Endocrinology Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
- Geriatrics Research and Education Clinical Center, Veterans Affairs Medical Center, Baltimore, MD, 21201, USA
| | - Bruce M Psaty
- Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Services, University of Washington, Seattle, WA, 98195, USA
- Kaiser Permanente Washington Health Research Institute, Seattle, WA, 98101, USA
| | - Ramachandran S Vasan
- Boston University's and National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, 01702, USA
| | - D C Rao
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, 22908, USA
| | - Jerome I Rotter
- Departments of Pediatrics and Medicine, Institute for Translational Genomics and Population Sciences, LABioMed at Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
| | - James G Wilson
- Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - Aravinda Chakravarti
- Department of Medicine, Center for Human Genetics and Genomics, New York University Langone Health, New York, NY, 10016, USA
| | - Alanna C Morrison
- Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, Human Genetics Center, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Daniel Levy
- Boston University's and National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, MA, 01702, USA
- Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Donna K Arnett
- University of Kentucky College of Public Health, Lexington, KY, 40508, USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, 02115, USA
- Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, 02215, USA
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, 44106, USA.
| |
Collapse
|
17
|
Guo Y, Zhou Y. A modified association test for rare and common variants based on affected sib-pair design. J Theor Biol 2019; 467:1-6. [PMID: 30707975 DOI: 10.1016/j.jtbi.2019.01.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Accepted: 01/08/2019] [Indexed: 11/18/2022]
Abstract
Current genome-wide association analysis has identified a great number of rare and common variants associated with common complex traits, however, more effective approaches for detecting associations between rare and common variants with common diseases are still demanded. Approaches for detecting rare variant association analysis will compromise the power when detecting the effects of rare and common variants simultaneously. In this paper, we extend an existing method of testing for rare variant association based on affected sib pairs (TOW-sib) and propose a variable weight test for rare and common variants association based on affected sib pairs (abbreviated as VW-TOWsib). The VW-TOWsib can be used to achieve the purpose of detecting the association of rare and common variants with complex diseases. Simulation results in various scenarios show that our proposed method is more powerful than existing methods for detecting effects of rare and common variants. At the same time, the VW-TOWsib also performs well as a method for rare variant association analysis.
Collapse
Affiliation(s)
- Yixing Guo
- Department of Statistics, School of Mathematical Sciences, Heilongjiang University and Heilongjiang Provincial Key Laboratory of the Theory and Computation of Complex Systems, Harbin 150080, China
| | - Ying Zhou
- Department of Statistics, School of Mathematical Sciences, Heilongjiang University and Heilongjiang Provincial Key Laboratory of the Theory and Computation of Complex Systems, Harbin 150080, China.
| |
Collapse
|
18
|
Combined linkage and association analysis identifies rare and low frequency variants for blood pressure at 1q31. Eur J Hum Genet 2018; 27:269-277. [PMID: 30262922 DOI: 10.1038/s41431-018-0277-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Revised: 07/12/2018] [Accepted: 08/28/2018] [Indexed: 12/24/2022] Open
Abstract
High blood pressure (BP) is a major risk factor for cardiovascular disease (CVD) and is more prevalent in African Americans as compared to other US groups. Although large, population-based genome-wide association studies (GWAS) have identified over 300 common polymorphisms modulating inter-individual BP variation, largely in European ancestry subjects, most of them do not localize to regions previously identified through family-based linkage studies. This discrepancy has remained unexplained despite the statistical power differences between current GWAS and prior linkage studies. To address this issue, we performed genome-wide linkage analysis of BP traits in African-American families from the Family Blood Pressure Program (FBPP) and genotyped on the Illumina Human Exome BeadChip v1.1. We identified a genomic region on chromosome 1q31 with LOD score 3.8 for pulse pressure (PP), a region we previously implicated in DBP studies of European ancestry families. Although no reported GWAS variants map to this region, combined linkage and association analysis of PP identified 81 rare and low frequency exonic variants accounting for the linkage evidence. Replication analysis in eight independent African ancestry cohorts (N = 16,968) supports this specific association with PP (P = 0.0509). Additional association and network analyses identified multiple potential candidate genes in this region expressed in multiple tissues and with a strong biological support for a role in BP. In conclusion, multiple genes and rare variants on 1q31 contribute to PP variation. Beyond producing new insights into PP, we demonstrate how family-based linkage and association studies can implicate specific rare and low frequency variants for complex traits.
Collapse
|
19
|
Hsieh AR, Chen DP, Chattopadhyay AS, Li YJ, Chang CC, Fann CSJ. A non-threshold region-specific method for detecting rare variants in complex diseases. PLoS One 2017; 12:e0188566. [PMID: 29190701 PMCID: PMC5708778 DOI: 10.1371/journal.pone.0188566] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Accepted: 11/09/2017] [Indexed: 11/23/2022] Open
Abstract
A region-specific method, NTR (non-threshold rare) variant detection method, was developed—it does not use the threshold for defining rare variants and accounts for directions of effects. NTR also considers linkage disequilibrium within the region and accommodates common and rare variants simultaneously. NTR weighs variants according to minor allele frequency and odds ratio to combine the effects of common and rare variants on disease occurrence into a single score and provides a test statistic to assess the significance of the score. In the simulations, under different effect sizes, the power of NTR increased as the effect size increased, and the type I error of our method was controlled well. Moreover, NTR was compared with several other existing methods, including the combined multivariate and collapsing method (CMC), weighted sum statistic method (WSS), sequence kernel association test (SKAT), and its modification, SKAT-O. NTR yields comparable or better power in simulations, especially when the effects of linkage disequilibrium between variants were at least moderate. In an analysis of diabetic nephropathy data, NTR detected more confirmed disease-related genes than the other aforementioned methods. NTR can thus be used as a complementary tool to help in dissecting the etiology of complex diseases.
Collapse
Affiliation(s)
- Ai-Ru Hsieh
- Graduate Institute of Biostatistics, China Medical University, Taichung, Taiwan
| | - Dao-Peng Chen
- Institute of Biomedical Sciences, Academia Sinica, Nankang, Taipei, Taiwan
| | | | - Ying-Ju Li
- Institute of Biomedical Sciences, Academia Sinica, Nankang, Taipei, Taiwan
| | - Chien-Ching Chang
- Institute of Biomedical Sciences, Academia Sinica, Nankang, Taipei, Taiwan
| | - Cathy S. J. Fann
- Institute of Biomedical Sciences, Academia Sinica, Nankang, Taipei, Taiwan
- * E-mail:
| |
Collapse
|
20
|
Ali F, Zhang J. Mixture model-based association analysis with case-control data in genome wide association studies. Stat Appl Genet Mol Biol 2017; 16:173-187. [PMID: 28723613 DOI: 10.1515/sagmb-2016-0022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Multilocus haplotype analysis of candidate variants with genome wide association studies (GWAS) data may provide evidence of association with disease, even when the individual loci themselves do not. Unfortunately, when a large number of candidate variants are investigated, identifying risk haplotypes can be very difficult. To meet the challenge, a number of approaches have been put forward in recent years. However, most of them are not directly linked to the disease-penetrances of haplotypes and thus may not be efficient. To fill this gap, we propose a mixture model-based approach for detecting risk haplotypes. Under the mixture model, haplotypes are clustered directly according to their estimated disease penetrances. A theoretical justification of the above model is provided. Furthermore, we introduce a hypothesis test for haplotype inheritance patterns which underpin this model. The performance of the proposed approach is evaluated by simulations and real data analysis. The results show that the proposed approach outperforms an existing multiple testing method.
Collapse
|
21
|
Wang H, Cade BE, Chen H, Gleason KJ, Saxena R, Feng T, Larkin EK, Vasan RS, Lin H, Patel SR, Tracy RP, Liu Y, Gottlieb DJ, Below JE, Hanis CL, Petty LE, Sunyaev SR, Frazier-Wood AC, Rotter JI, Post W, Lin X, Redline S, Zhu X. Variants in angiopoietin-2 (ANGPT2) contribute to variation in nocturnal oxyhaemoglobin saturation level. Hum Mol Genet 2017; 25:5244-5253. [PMID: 27798093 DOI: 10.1093/hmg/ddw324] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Accepted: 09/19/2016] [Indexed: 12/30/2022] Open
Abstract
Genetic determinants of sleep-disordered breathing (SDB), a common set of disorders that contribute to significant cardiovascular and neuropsychiatric morbidity, are not clear. Overnight nocturnal oxygen saturation (SaO2) is a clinically relevant and easily measured indicator of SDB severity but its genetic contribution has never been studied. Our recent study suggests nocturnal SaO2 is heritable. We performed linkage analysis, association analysis and haplotype analysis of average nocturnal oxyhaemoglobin saturation in participants in the Cleveland Family Study (CFS), followed by gene-based association and additional tests in four independent samples. Linkage analysis identified a peak (LOD = 4.29) on chromosome 8p23. Follow-up association analysis identified two haplotypes in angiopoietin-2 (ANGPT2) that significantly contributed to the variation of SaO2 (P = 8 × 10-5) and accounted for a portion of the linkage evidence. Gene-based association analysis replicated the association of ANGPT2 and nocturnal SaO2. A rare missense SNP rs200291021 in ANGPT2 was associated with serum angiopoietin-2 level (P = 1.29 × 10-4), which was associated with SaO2 (P = 0.002). Our study provides the first evidence for the association of ANGPT2, a gene previously implicated in acute lung injury syndromes, with nocturnal SaO2, suggesting that this gene has a broad range of effects on gas exchange, including influencing oxygenation during sleep.
Collapse
Affiliation(s)
- Heming Wang
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
| | - Brian E Cade
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA.,Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
| | - Han Chen
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Kevin J Gleason
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA.,Department of Public Health Sciences, University of Chicago, Chicago, IL, USA
| | - Richa Saxena
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA.,Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA.,Center for Human Genetic Research and Department of Anesthesia, Pain, and Critical Care Medicine, Massachusetts General Hospital, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
| | - Tao Feng
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
| | - Emma K Larkin
- Department of Medicine, Division of Allergy, Pulmonary and Critical Care, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Ramachandran S Vasan
- Preventive Medicine & Epidemiology, Boston University School of Medicine, Boston, MA, USA.,Framingham Heart Study, Framingham, MA
| | - Honghuang Lin
- Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Sanjay R Patel
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA.,Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA.,Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Russell P Tracy
- Department of Pathology & Laboratory Medicine, University of Vermont, Burlington, VT, USA
| | - Yongmei Liu
- Epidemiology and Prevention Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Daniel J Gottlieb
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA.,Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA.,Sleep Disorders Center, VA Boston Healthcare System, Boston, MA, USA
| | - Jennifer E Below
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Craig L Hanis
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Lauren E Petty
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Shamil R Sunyaev
- Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA.,Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | | | - Jerome I Rotter
- Institute for Translational Genomics and Population Sciences, Los Angeles BioMedical Research Institute at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Wendy Post
- Division of Cardiology, Johns Hopkins University, Baltimore, MD, USA
| | - Xihong Lin
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA.,Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA.,Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Xiaofeng Zhu
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
| |
Collapse
|
22
|
He KY, Wang H, Cade BE, Nandakumar P, Giri A, Ware EB, Haessler J, Liang J, Smith JA, Franceschini N, Le TH, Kooperberg C, Edwards TL, Kardia SLR, Lin X, Chakravarti A, Redline S, Zhu X. Rare variants in fox-1 homolog A (RBFOX1) are associated with lower blood pressure. PLoS Genet 2017; 13:e1006678. [PMID: 28346479 PMCID: PMC5386302 DOI: 10.1371/journal.pgen.1006678] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 04/10/2017] [Accepted: 03/09/2017] [Indexed: 12/23/2022] Open
Abstract
Many large genome-wide association studies (GWAS) have identified common blood pressure (BP) variants. However, most of the identified BP variants do not overlap with the linkage evidence observed from family studies. We thus hypothesize that multiple rare variants contribute to the observed linkage evidence. We performed linkage analysis using 517 individuals in 130 European families from the Cleveland Family Study (CFS) who have been genotyped on the Illumina OmniExpress Exome array. The largest linkage peak was observed on chromosome 16p13 (MLOD = 2.81) for systolic blood pressure (SBP). Follow-up conditional linkage and association analyses in the linkage region identified multiple rare, coding variants in RBFOX1 associated with reduced SBP. In a 17-member CFS family, carriers of the missense variant rs149974858 are normotensive despite being obese (average BMI = 60 kg/m2). Gene-based association test of rare variants using SKAT-O showed significant association with SBP (p-value = 0.00403) and DBP (p-value = 0.0258) in the CFS participants and the association was replicated in large independent replication studies (N = 57,234, p-value = 0.013 for SBP, 0.0023 for PP). RBFOX1 is expressed in brain tissues, the atrial appendage and left ventricle in the heart, and in skeletal muscle tissues, organs/tissues which are potentially related to blood pressure. Our study showed that associations of rare variants could be efficiently detected using family information.
Collapse
Affiliation(s)
- Karen Y. He
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Heming Wang
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Brian E. Cade
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Division of Sleep Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Priyanka Nandakumar
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Ayush Giri
- Division of Epidemiology, Department of Medicine, Institute for Medicine and Public Health, Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Erin B. Ware
- Biosocial Methods Collaborative, Institute for Social Research, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Jeffrey Haessler
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Jingjing Liang
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Jennifer A. Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Nora Franceschini
- Department of Epidemiology, UNC Gillings School of Global Public Health, Chapel Hill, North Carolina, United States of America
| | - Thu H. Le
- Department of Medicine, Division of Nephrology, University of Virginia, Charlottesville, Virginia, United States of America
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Todd L. Edwards
- Division of Epidemiology, Department of Medicine, Institute for Medicine and Public Health, Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Sharon L. R. Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Xihong Lin
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Aravinda Chakravarti
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Division of Sleep Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| | - Xiaofeng Zhu
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio, United States of America
- * E-mail:
| |
Collapse
|
23
|
Yang X, Wang S, Zhang S, Sha Q. Detecting association of rare and common variants based on cross-validation prediction error. Genet Epidemiol 2017; 41:233-243. [PMID: 28176359 DOI: 10.1002/gepi.22034] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Revised: 11/22/2016] [Accepted: 11/26/2016] [Indexed: 12/13/2022]
Abstract
Despite the extensive discovery of disease-associated common variants, much of the genetic contribution to complex traits remains unexplained. Rare variants may explain additional disease risk or trait variability. Although sequencing technology provides a supreme opportunity to investigate the roles of rare variants in complex diseases, detection of these variants in sequencing-based association studies presents substantial challenges. In this article, we propose novel statistical tests to test the association between rare and common variants in a genomic region and a complex trait of interest based on cross-validation prediction error (PE). We first propose a PE method based on Ridge regression. Based on PE, we also propose another two tests PE-WS and PE-TOW by testing a weighted combination of variants with two different weighting schemes. PE-WS is the PE version of the test based on the weighted sum statistic (WS) and PE-TOW is the PE version of the test based on the optimally weighted combination of variants (TOW). Using extensive simulation studies, we are able to show that (1) PE-TOW and PE-WS are consistently more powerful than TOW and WS, respectively, and (2) PE is the most powerful test when causal variants contain both common and rare variants.
Collapse
Affiliation(s)
- Xinlan Yang
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, USA
| | | | - Shuanglin Zhang
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, USA
| | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, USA
| |
Collapse
|
24
|
Abstract
Population of ethnic mixtures can be useful in genetic studies. Admixture mapping, or mapping by admixture linkage disequilibrium (MALD), is specially developed for admixed populations and can supplement traditional genome-wide association analyses in the search for genetic variants underlying complex traits. Admixture mapping tests the association between a trait and locus-specific ancestries. The locus-specific ancestries are in linkage disequilibrium (LD), which is generated by an admixture process between genetically distinct ancestral populations. Because of the highly correlated-locus specific ancestries, admixture mapping performs many fewer independent tests across the genome than current genome-wide association analysis. Therefore, admixture mapping can be more powerful because it reduces the penalty due to multiple tests. In this chapter, we introduce the theory behind admixture mapping and explain how to conduct the analysis in practice.
Collapse
|
25
|
Zhu H, Wang Z, Wang X, Sha Q. A novel statistical method for rare-variant association studies in general pedigrees. BMC Proc 2016; 10:193-196. [PMID: 27980635 PMCID: PMC5133499 DOI: 10.1186/s12919-016-0029-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Both population-based and family-based designs are commonly used in genetic association studies to identify rare variants that underlie complex diseases. For any type of study design, the statistical power will be improved if rare variants can be enriched in the samples. Family-based designs, with ascertainment based on phenotype, may enrich the sample for causal rare variants and thus can be more powerful than population-based designs. Therefore, it is important to develop family-based statistical methods that can account for ascertainment. In this paper, we develop a novel statistical method for rare-variant association studies in general pedigrees for quantitative traits. This method uses a retrospective view that treats the traits as fixed and the genotypes as random, which allows us to account for complex and undefined ascertainment of families. We then apply the newly developed method to the Genetic Analysis Workshop 19 data set and compare the power of the new method with two other methods for general pedigrees. The results show that the newly proposed method increases power in most of the cases we consider, more than the other two methods.
Collapse
Affiliation(s)
- Huanhuan Zhu
- Department of Mathematical Sciences, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931 USA
| | - Zhenchuan Wang
- Department of Mathematical Sciences, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931 USA
| | - Xuexia Wang
- Department of Mathematics, University of North Texas, 1155 Union Circle #311430, Denton, TX 76203-5017 USA
| | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931 USA
| |
Collapse
|
26
|
Liang J, Cade BE, Wang H, Chen H, Gleason KJ, Larkin EK, Saxena R, Lin X, Redline S, Zhu X. Comparison of Heritability Estimation and Linkage Analysis for Multiple Traits Using Principal Component Analyses. Genet Epidemiol 2016; 40:222-32. [PMID: 27027516 DOI: 10.1002/gepi.21957] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Revised: 11/30/2015] [Accepted: 12/14/2015] [Indexed: 12/16/2022]
Abstract
A disease trait often can be characterized by multiple phenotypic measurements that can provide complementary information on disease etiology, physiology, or clinical manifestations. Given that multiple phenotypes may be correlated and reflect common underlying genetic mechanisms, the use of multivariate analysis of multiple traits may improve statistical power to detect genes and variants underlying complex traits. The literature, however, has been unclear as to the optimal approach for analyzing multiple correlated traits. In this study, heritability and linkage analysis was performed for six obstructive sleep apnea hypopnea syndrome (OSAHS) related phenotypes, as well as principal components of the phenotypes and principal components of the heritability (PCHs) using the data from Cleveland Family Study, which include both African and European American families. Our study demonstrates that principal components generally result in higher heritability and linkage evidence than individual traits. Furthermore, the PCHs can be transferred across populations, strongly suggesting that these PCHs reflect traits with common underlying genetic mechanisms for OSAHS across populations. Thus, PCHs can provide useful traits for using data on multiple phenotypes and for genetic studies of trans-ethnic populations.
Collapse
Affiliation(s)
- Jingjing Liang
- Department of Epidemiology and Biostatistics, School of Medicine, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Brian E Cade
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, Massachusetts, United States of America.,Division of Sleep Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Heming Wang
- Department of Epidemiology and Biostatistics, School of Medicine, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Han Chen
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Kevin J Gleason
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, Massachusetts, United States of America.,Division of Sleep Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Emma K Larkin
- Department of Medicine, Division of Allergy, Pulmonary and Critical Care, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Richa Saxena
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, Massachusetts, United States of America.,Division of Sleep Medicine, Harvard Medical School, Boston, Massachusetts, United States of America.,Center for Human Genetic Research and Department of Anesthesia, Pain, and Critical Care Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America.,Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
| | - Xihong Lin
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, Massachusetts, United States of America.,Division of Sleep Medicine, Harvard Medical School, Boston, Massachusetts, United States of America.,Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| | - Xiaofeng Zhu
- Department of Epidemiology and Biostatistics, School of Medicine, Case Western Reserve University, Cleveland, Ohio, United States of America
| |
Collapse
|
27
|
Abstract
CONTEXT Polycystic ovary syndrome (PCOS) is a common complex genetic disease. It is characterized by hyperandrogenism, gonadotropin secretory changes, polycystic ovarian morphology, and insulin resistance. The etiology of PCOS remains unknown, but modern genetic approaches, such as genome-wide association studies (GWAS), Mendelian randomization, and next-generation sequencing, promise to identify the pathways that are primarily disrupted. EVIDENCE ACQUISITION The literature on PCOS, including the author's research, is discussed. EVIDENCE SYNTHESIS Recent genetic analyses are reviewed. CONCLUSIONS Considerable progress has been made mapping PCOS susceptibility genes. GWAS have implicated gonadotropin secretion and action as important primary defects in disease pathogenesis in European and Han Chinese PCOS cohorts, respectively. European women with the National Institutes of Health and Rotterdam phenotypes as well as those with self-reported PCOS have some gene regions in common, such as chromosome 11p14.1 region containing the FSH B polypeptide (FSHB) gene, suggesting shared genetic susceptibility. Several chromosomal signals are significant in both Han Chinese and European PCOS cohorts, suggesting that the susceptibility genes in these regions are evolutionarily conserved. In addition, GWAS have suggested that DENND1A, epidermal growth factor signaling, and DNA repair pathways play a role in PCOS pathogenesis. Only a small amount of the heritability of PCOS is accounted for by the common susceptibility variants mapped so far. Future studies should clarify the contribution of rare genetic variants and epigenetic factors to the PCOS phenotype. Furthermore, Mendelian randomization can be used to clarify causal relationships, and phenome-wide association studies can provide insight into health risks associated with PCOS susceptibility variants.
Collapse
Affiliation(s)
- Andrea Dunaif
- Division of Endocrinology, Metabolism, and Molecular Medicine, Department of Medicine, Feinberg School of Medicine, Northwestern University, Chicago, Illinois 60611
| |
Collapse
|
28
|
Chang LC, Li B, Fang Z, Vrieze S, McGue M, Iacono WG, Tseng GC, Chen W. A computational method for genotype calling in family-based sequencing data. BMC Bioinformatics 2016; 17:37. [PMID: 26772743 PMCID: PMC4715317 DOI: 10.1186/s12859-016-0880-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 01/06/2016] [Indexed: 12/12/2022] Open
Abstract
Background As sequencing technologies can help researchers detect common and rare variants across the human genome in many individuals, it is known that jointly calling genotypes across multiple individuals based on linkage disequilibrium (LD) can facilitate the analysis of low to modest coverage sequence data. However, genotype-calling methods for family-based sequence data, particularly for complex families beyond parent-offspring trios, are still lacking. Results In this study, first, we proposed an algorithm that considers both linkage disequilibrium (LD) patterns and familial transmission in nuclear and multi-generational families while retaining the computational efficiency. Second, we extended our method to incorporate external reference panels to analyze family-based sequence data with a small sample size. In simulation studies, we show that modeling multiple offspring can dramatically increase genotype calling accuracy and reduce phasing and Mendelian errors, especially at low to modest coverage. In addition, we show that using external panels can greatly facilitate genotype calling of sequencing data with a small number of individuals. We applied our method to a whole genome sequencing study of 1339 individuals at ~10X coverage from the Minnesota Center for Twin and Family Research. Conclusions The aggregated results show that our methods significantly outperform existing ones that ignore family constraints or LD information. We anticipate that our method will be useful for many ongoing family-based sequencing projects. We have implemented our methods efficiently in a C++ program FamLDCaller, which is available from http://www.pitt.edu/~wec47/famldcaller.html. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-0880-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lun-Ching Chang
- Division of Cancer Treatment and Diagnosis, National Cancer Institute, Bethesda, MD, 20892, USA.
| | - Bingshan Li
- Department of Molecular Physiology & Biophysics, Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
| | - Zhou Fang
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
| | - Scott Vrieze
- Department of Psychology & Neuroscience, Institute for Behavioral Genetics, University of Colorado, Boulder, CO, 80309, USA.
| | - Matt McGue
- Department of Psychology, University of Minnesota, Minneapolis, MN, 55455, USA.
| | - William G Iacono
- Department of Psychology, University of Minnesota, Minneapolis, MN, 55455, USA.
| | - George C Tseng
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
| | - Wei Chen
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA. .,Division of Pulmonary Medicine, Allergy and Immunology, Children's Hospital of Pittsburgh of UPMC, Pittsburgh, PA, 15224, USA.
| |
Collapse
|
29
|
Galesloot TE, Janss LL, Burgess S, Kiemeney LALM, den Heijer M, de Graaf J, Holewijn S, Benyamin B, Whitfield JB, Swinkels DW, Vermeulen SH. Iron and hepcidin as risk factors in atherosclerosis: what do the genes say? BMC Genet 2015; 16:79. [PMID: 26159428 PMCID: PMC4498499 DOI: 10.1186/s12863-015-0246-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2015] [Accepted: 06/30/2015] [Indexed: 01/05/2023] Open
Abstract
Background Previous reports suggested a role for iron and hepcidin in atherosclerosis. Here, we evaluated the causality of these associations from a genetic perspective via (i) a Mendelian randomization (MR) approach, (ii) study of association of atherosclerosis-related single nucleotide polymorphisms (SNPs) with iron and hepcidin, and (iii) estimation of genomic correlations between hepcidin, iron and atherosclerosis. Results Analyses were performed in a general population sample. Iron parameters (serum iron, serum ferritin, total iron-binding capacity and transferrin saturation), serum hepcidin and genome-wide SNP data were available for N = 1,819; non-invasive measurements of atherosclerosis (NIMA), i.e., presence of plaque, intima media thickness and ankle-brachial index (ABI), for N = 549. For the MR, we used 12 iron-related SNPs that were previously identified in a genome-wide association meta-analysis on iron status, and assessed associations of individual SNPs and quartiles of a multi-SNP score with NIMA. Quartile 4 versus quartile 1 of the multi-SNP score showed directionally consistent associations with the hypothesized direction of effect for all NIMA in women, indicating that increased body iron status is a risk factor for atherosclerosis in women. We observed no single SNP associations that fit the hypothesized directions of effect between iron and NIMA, except for rs651007, associated with decreased ferritin concentration and decreased atherosclerosis risk. Two of six NIMA-related SNPs showed association with the ratio hepcidin/ferritin, suggesting that an increased hepcidin/ferritin ratio increases atherosclerosis risk. Genomic correlations were close to zero, except for hepcidin and ferritin with ABI at rest [−0.27 (SE 0.34) and −0.22 (SE 0.35), respectively] and ABI after exercise [−0.29 (SE 0.34) and −0.30 (0.35), respectively]. The negative sign indicates an increased atherosclerosis risk with increased hepcidin and ferritin concentrations. Conclusions Our results suggest a potential causal role for hepcidin and ferritin in atherosclerosis, and may indicate that iron status is causally related to atherosclerosis in women. Electronic supplementary material The online version of this article (doi:10.1186/s12863-015-0246-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Tessel E Galesloot
- Radboud university medical center, Radboud Institute for Health Sciences, Nijmegen, The Netherlands.
| | - Luc L Janss
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark.
| | - Stephen Burgess
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
| | - Lambertus A L M Kiemeney
- Radboud university medical center, Radboud Institute for Health Sciences, Nijmegen, The Netherlands.
| | - Martin den Heijer
- Department of Internal Medicine, VU Medical Centre, Amsterdam, The Netherlands.
| | - Jacqueline de Graaf
- Department of General Internal Medicine, Division of Vascular Medicine, Radboud university medical center, Nijmegen, The Netherlands.
| | - Suzanne Holewijn
- Department of General Internal Medicine, Division of Vascular Medicine, Radboud university medical center, Nijmegen, The Netherlands. .,Research Vascular Center Rijnstate, Arnhem, The Netherlands.
| | - Beben Benyamin
- The University of Queensland, Queensland Brain Institute, St Lucia, Queensland, 4072, Australia. .,QIMR Berghofer Medical Research Institute, Brisbane, Queensland, 4029, Australia.
| | - John B Whitfield
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, 4029, Australia.
| | - Dorine W Swinkels
- Radboud university medical center, Radboud Institute for Health Sciences, Nijmegen, The Netherlands.
| | - Sita H Vermeulen
- Radboud university medical center, Radboud Institute for Health Sciences, Nijmegen, The Netherlands.
| |
Collapse
|
30
|
Wang X, Zhang S, Li Y, Li M, Sha Q. A powerful approach to test an optimally weighted combination of rare variants in admixed populations. Genet Epidemiol 2015; 39:294-305. [PMID: 25758547 DOI: 10.1002/gepi.21894] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2014] [Revised: 01/09/2015] [Accepted: 01/26/2015] [Indexed: 11/09/2022]
Abstract
Population stratification has long been recognized as an issue in genetic association studies because unrecognized population stratification can lead to both false-positive and false-negative findings and can obscure true association signals if not appropriately corrected. This issue can be even worse in rare variant association analyses because rare variants often demonstrate stronger and potentially different patterns of stratification than common variants. To correct for population stratification in genetic association studies, we proposed a novel method to Test the effect of an Optimally Weighted combination of variants in Admixed populations (TOWA) in which the analytically derived optimal weights can be calculated from existing phenotype and genotype data. TOWA up weights rare variants and those variants that have strong associations with the phenotype. Additionally, it can adjust for the direction of the association, and allows for local ancestry difference among study subjects. Extensive simulations show that the type I error rate of TOWA is under control in the presence of population stratification and it is more powerful than existing methods. We have also applied TOWA to a real sequencing data. Our simulation studies as well as real data analysis results indicate that TOWA is a useful tool for rare variant association analyses in admixed populations.
Collapse
Affiliation(s)
- Xuexia Wang
- Joseph J. Zilber School of Public Health, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, United States of America
| | | | | | | | | |
Collapse
|
31
|
Kullback-Leibler divergence for detection of rare haplotype common disease association. Eur J Hum Genet 2015; 23:1558-65. [PMID: 25735482 DOI: 10.1038/ejhg.2015.25] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2014] [Revised: 11/16/2014] [Accepted: 01/28/2015] [Indexed: 12/12/2022] Open
Abstract
Rare haplotypes may tag rare causal variants of common diseases; hence, detection of such rare haplotypes may also contribute to our understanding of complex disease etiology. Because rare haplotypes frequently result from common single-nucleotide polymorphisms (SNPs), focusing on rare haplotypes is much more economical compared with using rare single-nucleotide variants (SNVs) from sequencing, as SNPs are available and 'free' from already amassed genome-wide studies. Further, associated haplotypes may shed light on the underlying disease causal mechanism, a feat unmatched by SNV-based collapsing methods. In recent years, data mining approaches have been adapted to detect rare haplotype association. However, as they rely on an assumed underlying disease model and require the specification of a null haplotype, results can be erroneous if such assumptions are violated. In this paper, we present a haplotype association method based on Kullback-Leibler divergence (hapKL) for case-control samples. The idea is to compare haplotype frequencies for the cases versus the controls by computing symmetrical divergence measures. An important property of such measures is that both the frequencies and logarithms of the frequencies contribute in parallel, thus balancing the contributions from rare and common, and accommodating both deleterious and protective, haplotypes. A simulation study under various scenarios shows that hapKL has well-controlled type I error rates and good power compared with existing data mining methods. Application of hapKL to age-related macular degeneration (AMD) shows a strong association of the complement factor H (CFH) gene with AMD, identifying several individual rare haplotypes with strong signals.
Collapse
|
32
|
Wang M, Lin S. Detecting associations of rare variants with common diseases: collapsing or haplotyping? Brief Bioinform 2015; 16:759-68. [PMID: 25596401 DOI: 10.1093/bib/bbu050] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Indexed: 01/11/2023] Open
Abstract
In recent years, a myriad of new statistical methods have been proposed for detecting associations of rare single-nucleotide variants (SNVs) with common diseases. These methods can be generally classified as 'collapsing' or 'haplotyping' based. The former is the predominant class, composed of most of the rare variant association methods proposed to date. However, recent works have suggested that haplotyping-based methods may offer advantages and can even be more powerful than collapsing methods in certain situations. In this article, we review and compare collapsing- versus haplotyping-based methods/software in terms of both power and type I error. For collapsing methods, we consider three approaches: Combined Multivariate and Collapsing, Sequence Kernel Association Test and Family-Based Association Test (FBAT): the first two are population based and are among the most popular; the last test is family based, a modification from the popular FBAT to accommodate rare SNVs. For haplotyping-based methods, we include Logistic Bayesian Lasso (LBL) for population data and family-based LBL (famLBL) for family (trio) data. These two methods are selected, as they can be used to test association for specific rare and common haplotypes. Our results show that haplotype methods can be more powerful than collapsing methods if there are interacting SNVs leading to larger haplotype effects. Even if only common SNVs are genotyped, haplotype methods can still detect specific rare haplotypes that tag rare causal SNVs. As expected, family-based methods are robust, whereas population-based methods are susceptible, to population substructure. However, the population-based haplotype approach appears to have smaller inflation of type I error than its collapsing counterparts.
Collapse
|
33
|
Wei C, Li M, He Z, Vsevolozhskaya O, Schaid DJ, Lu Q. A weighted U-statistic for genetic association analyses of sequencing data. Genet Epidemiol 2014; 38:699-708. [PMID: 25331574 DOI: 10.1002/gepi.21864] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2014] [Revised: 08/15/2014] [Accepted: 09/05/2014] [Indexed: 12/13/2022]
Abstract
With advancements in next-generation sequencing technology, a massive amount of sequencing data is generated, which offers a great opportunity to comprehensively investigate the role of rare variants in the genetic etiology of complex diseases. Nevertheless, the high-dimensional sequencing data poses a great challenge for statistical analysis. The association analyses based on traditional statistical methods suffer substantial power loss because of the low frequency of genetic variants and the extremely high dimensionality of the data. We developed a Weighted U Sequencing test, referred to as WU-SEQ, for the high-dimensional association analysis of sequencing data. Based on a nonparametric U-statistic, WU-SEQ makes no assumption of the underlying disease model and phenotype distribution, and can be applied to a variety of phenotypes. Through simulation studies and an empirical study, we showed that WU-SEQ outperformed a commonly used sequence kernel association test (SKAT) method when the underlying assumptions were violated (e.g., the phenotype followed a heavy-tailed distribution). Even when the assumptions were satisfied, WU-SEQ still attained comparable performance to SKAT. Finally, we applied WU-SEQ to sequencing data from the Dallas Heart Study (DHS), and detected an association between ANGPTL 4 and very low density lipoprotein cholesterol.
Collapse
Affiliation(s)
- Changshuai Wei
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, United States of America; Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, Texas, United States of America
| | | | | | | | | | | |
Collapse
|
34
|
Milane A, Abdallah J, Kanbar R, Khazen G, Ghassibe-Sabbagh M, Salloum AK, Youhanna S, Saad A, El Bayeh H, Chammas E, Platt DE, Hager J, Gauguier D, Zalloua P, Abchee A. Association of hypertension with coronary artery disease onset in the Lebanese population. SPRINGERPLUS 2014; 3:533. [PMID: 25279324 PMCID: PMC4176843 DOI: 10.1186/2193-1801-3-533] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/13/2014] [Accepted: 09/09/2014] [Indexed: 12/22/2022]
Abstract
The onset of coronary artery disease (CAD) is influenced by cardiovascular risk factors that often occur in clusters and may build on one another. The objective of this study is to examine the relationship between hypertension and CAD age of onset in the Lebanese population. This retrospective analysis was performed on data extracted from Lebanese patients (n = 3,753). Logistic regression examined the association of hypertension with the age at CAD diagnosis after controlling for other traditional risk factors. The effect of antihypertensive drugs and lifestyle changes on the onset of CAD was also investigated. Results showed that hypertension is associated with late onset CAD (OR=0.656, 95% CI=0.504-0.853, p=0.001). Use of antihypertensive drugs showed a similar association with delayed CAD onset. When comparing age of onset in CAD patients with traditional risk factors such as hypertension, diabetes, hyperlipidemia, obesity, smoking and family history of CAD, the age of onset was significantly higher for patients with hypertension compared to those with any of the other risk factors studied (p < 0.001). In conclusion, hypertension and its treatment are associated with late coronary atherosclerotic manifestations in Lebanese population. This observation is currently under investigation to clarify its genetic and/or environmental mechanisms.
Collapse
Affiliation(s)
- Aline Milane
- Lebanese American University, School of Pharmacy, Byblos 36, Lebanon
| | - Jad Abdallah
- Lebanese American University, School of Pharmacy, Byblos 36, Lebanon
| | - Roy Kanbar
- Lebanese American University, School of Pharmacy, Byblos 36, Lebanon
| | - Georges Khazen
- School of Arts and Sciences, Lebanese American University, Byblos 36, Lebanon
| | | | | | - Sonia Youhanna
- School of Medicine, Lebanese American University, Beirut, 1102 2801 Lebanon
| | - Aline Saad
- Lebanese American University, School of Pharmacy, Byblos 36, Lebanon
| | - Hamid El Bayeh
- School of Medicine, Lebanese American University, Beirut, 1102 2801 Lebanon
| | - Elie Chammas
- School of Medicine, Lebanese American University, Beirut, 1102 2801 Lebanon
| | - Daniel E Platt
- Bioinformatics and Pattern Discovery, IBM T. J. Watson Research Centre, Yorktown Hgts, NY 10598 USA
| | - Jörg Hager
- CEA-Genomics Institute, Centre National de Génotypage, Evry, 91057 France
| | - Dominique Gauguier
- The Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Headington, Oxford, OX3 7BN UK ; INSERM UMRS872, Centre de Recherche des Cordeliers, 15 Rue de l'école de Médecine, Paris, 75006 France
| | - Pierre Zalloua
- School of Medicine, Lebanese American University, Beirut, 1102 2801 Lebanon ; Harvard School of Public Health, Boston, MA 02215 USA
| | - Antoine Abchee
- Department of Internal Medicine, American University of Beirut, Beirut, Lebanon
| | | |
Collapse
|
35
|
Zhang Q, Wang L, Koboldt D, Boreki IB, Province MA. Adjusting family relatedness in data-driven burden test of rare variants. Genet Epidemiol 2014; 38:722-7. [PMID: 25169066 DOI: 10.1002/gepi.21848] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2014] [Revised: 07/01/2014] [Accepted: 07/16/2014] [Indexed: 11/08/2022]
Abstract
Family data represent a rich resource for detecting association between rare variants (RVs) and human traits. However, most RV association analysis methods developed in recent years are data-driven burden tests which can adaptively learn weights from data but require permutation to evaluate significance, thus are not readily applicable to family data, because random permutation will destroy family structure. Direct application of these methods to family data may result in a significant inflation of false positives. To overcome this issue, we have developed a generalized, weighted sum mixed model (WSMM), and corresponding computational techniques that can incorporate family information into data-driven burden tests, and allow adaptive and efficient permutation test in family data. Using simulated and real datasets, we demonstrate that the WSMM method can be used to appropriately adjust for genetic relatedness among family members and has a good control for the inflation of false positives. We compare WSMM with a nondata-driven, family-based Sequence Kernel Association Test (famSKAT), showing that WSMM has significantly higher power in some cases. WSMM provides a generalized, flexible framework for adapting different data-driven burden tests to analyze data with any family structures, and it can be extended to binary and time-to-onset traits, with or without covariates.
Collapse
Affiliation(s)
- Qunyuan Zhang
- Division of Statistical Genomics, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | | | | | | | | |
Collapse
|
36
|
Guo W, Shugart YY. The power comparison of the haplotype-based collapsing tests and the variant-based collapsing tests for detecting rare variants in pedigrees. BMC Genomics 2014; 15:632. [PMID: 25070353 PMCID: PMC4131059 DOI: 10.1186/1471-2164-15-632] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2013] [Accepted: 07/18/2014] [Indexed: 11/20/2022] Open
Abstract
Background Both common and rare genetic variants have been shown to contribute to the etiology of complex diseases. Recent genome-wide association studies (GWAS) have successfully investigated how common variants contribute to the genetic factors associated with common human diseases. However, understanding the impact of rare variants, which are abundant in the human population (one in every 17 bases), remains challenging. A number of statistical tests have been developed to analyze collapsed rare variants identified by association tests. Here, we propose a haplotype-based approach. This work inspired by an existing statistical framework of the pedigree disequilibrium test (PDT), which uses genetic data to assess the effects of variants in general pedigrees. We aim to compare the performance between the haplotype-based approach and the rare variant-based approach for detecting rare causal variants in pedigrees. Results Extensive simulations in the sequencing setting were carried out to evaluate and compare the haplotype-based approach with the rare variant methods that drew on a more conventional collapsing strategy. As assessed through a variety of scenarios, the haplotype-based pedigree tests had enhanced statistical power compared with the rare variants based pedigree tests when the disease of interest was mainly caused by rare haplotypes (with multiple rare alleles), and vice versa when disease was caused by rare variants acting independently. For most of other situations when disease was caused both by haplotypes with multiple rare alleles and by rare variants with similar effects, these two approaches provided similar power in testing for association. Conclusions The haplotype-based approach was designed to assess the role of rare and potentially causal haplotypes. The proposed rare variants-based pedigree tests were designed to assess the role of rare and potentially causal variants. This study clearly documented the situations under which either method performs better than the other. All tests have been implemented in a software, which was submitted to the Comprehensive R Archive Network (CRAN) for general use as a computer program named rvHPDT.
Collapse
Affiliation(s)
| | - Yin Yao Shugart
- Division of Intramural Division Program, National Institute of Mental Health, National Institute of Health, 35 Convent Drive, Bethesda, MD 20892, USA.
| |
Collapse
|
37
|
Abstract
The cost of next-generation sequencing is now approaching that of the first generation of genome-wide single-nucleotide genotyping panels, but this is still out of reach for large-scale epidemiologic studies with tens of thousands of subjects. Furthermore, the anticipated yield of millions of rare variants poses serious challenges for distinguishing causal from noncausal variants for disease. We explore the merits of using family-based designs for sequencing substudies to identify novel variants and prioritize them for their likelihood of causality. While the sharing of variants within families means that family-based designs may be less efficient for discovery than sequencing of a comparable number of unrelated individuals, the ability to exploit cosegregation of variants with disease within families helps distinguish causal from noncausal ones. We introduce a score test criterion for prioritizing discovered variants in terms of their likelihood of being functional. We compare the relative statistical efficiency of 2-stage versus1-stage family-based designs by application to the Genetic Analysis Workshop 18 simulated sequence data.
Collapse
Affiliation(s)
- Zhao Yang
- Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90089-9234, USA
| | - Duncan C Thomas
- Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90089-9234, USA
| |
Collapse
|
38
|
Feng T, Zhu X. Whole genome sequencing data from pedigrees suggests linkage disequilibrium among rare variants created by population admixture. BMC Proc 2014; 8:S44. [PMID: 25519326 PMCID: PMC4143626 DOI: 10.1186/1753-6561-8-s1-s44] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Next-generation sequencing technologies have been designed to discover rare and de novo variants and are an important tool for identifying rare disease variants. Many statistical methods have been developed to test, using next-generation sequencing data, for rare variants that are associated with a trait. However, many of these methods make assumptions that rare variants are in linkage equilibrium in a gene. In this report, we studied whether transmitted or untransmitted haplotypes carry an excess of rare variants using the whole genome sequencing data of 15 large Mexican American pedigrees provided by the Genetic Analysis Workshop 18. We observed that an excess of rare variants are carried on either transmitted or nontransmitted haplotypes from parents to offspring. Further analyses suggest that such nonrandom associations among rare variants can be attributed to population admixture and single-nucleotide variant calling errors. Our results have significant implications for rare variant association studies, especially those conducted in admixed populations.
Collapse
Affiliation(s)
- Tao Feng
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road, Cleveland, OH 44106, USA
| | - Xiaofeng Zhu
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Wolstein Research Building, 2103 Cornell Road, Cleveland, OH 44106, USA
| |
Collapse
|
39
|
Wang H, Zhu X. De novo mutations discovered in 8 Mexican American families through whole genome sequencing. BMC Proc 2014; 8:S24. [PMID: 25519376 PMCID: PMC4143763 DOI: 10.1186/1753-6561-8-s1-s24] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
De novo mutations enrich the sequence diversity and carry the clue of evolutional selection. Recent studies suggest the de novo mutations could be one of the risk factors for complex diseases. We conducted a survey of de novo mutations using the whole genome sequence data but only available on the odd autosomes of Mexican American families provided by Genetic Analysis Workshop 18. We extracted 8 three-generation families who have sequencing data available from 20 large pedigrees. By comparing the known single nucleotide variants (SNVs) in dbSNP129 and the de novo variants transmitted in the Mexican American families, we were able to estimate a de novo mutation rate of 1.64(±0.42) × 10(-8) per position per haploid genome. This result is consistent with the estimates in literature that required many extensive validation efforts, such as genotyping and further resequencing. Our analysis suggests the importance of using family samples for studying rare variants.
Collapse
Affiliation(s)
- Heming Wang
- Department of Epidemiology and Biostatistics, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH 44106-4945, USA
| | - Xiaofeng Zhu
- Department of Epidemiology and Biostatistics, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH 44106-4945, USA
| |
Collapse
|
40
|
Mathias RA. Introduction to genetics and genomics in asthma: genetics of asthma. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 795:125-55. [PMID: 24162907 DOI: 10.1007/978-1-4614-8603-9_9] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
While asthma is a heterogeneous disease, a strong genetic basis has been firmly established. Rather than being a single disease entity, asthma consists of related, overlapping syndromes [Barnes (Proc Am Thor Soc 8:143-148, 2011)] including three general domains: variable airway obstruction, airway hyper-responsiveness, and airway inflammation with a considerable proportion, but not all, of asthma being IgE-mediated further adding to its heterogeneity. This chapter reviews the approaches to the elucidation of genetics of asthma from the early evidence of familial clustering to the current state of knowledge with genome-wide approaches. The conclusion is that research efforts have led to a tremendous repository of genetic determinants of asthma, most of which fall into the above phenotypic domains of the syndrome. We now look to future integrative approaches of genetics, genomics (Chap. 10), and epigenetics (Chap. 11) to better understand the causal mechanism through which, these genetic loci act in manifesting asthma.
Collapse
Affiliation(s)
- Rasika Ann Mathias
- Division of Allergy and Clinical Immunology, Johns Hopkins University School of Medicine, 5501 Hopkins Bayview Circle, 3B.79, Baltimore, MD, 21224, USA,
| |
Collapse
|
41
|
Wang M, Lin S. FamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads. ACTA ACUST UNITED AC 2014; 30:2611-8. [PMID: 24849576 DOI: 10.1093/bioinformatics/btu347] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
MOTIVATION In recent years, there has been an increasing interest in using common single-nucleotide polymorphisms (SNPs) amassed in genome-wide association studies to investigate rare haplotype effects on complex diseases. Evidence has suggested that rare haplotypes may tag rare causal single-nucleotide variants, making SNP-based rare haplotype analysis not only cost effective, but also more valuable for detecting causal variants. Although a number of methods for detecting rare haplotype association have been proposed in recent years, they are population based and thus susceptible to population stratification. RESULTS We propose family-triad-based logistic Bayesian Lasso (famLBL) for estimating effects of haplotypes on complex diseases using SNP data. By choosing appropriate prior distribution, effect sizes of unassociated haplotypes can be shrunk toward zero, allowing for more precise estimation of associated haplotypes, especially those that are rare, thereby achieving greater detection power. We evaluate famLBL using simulation to gauge its type I error and power. Compared with its population counterpart, LBL, highlights famLBL's robustness property in the presence of population substructure. Further investigation by comparing famLBL with Family-Based Association Test (FBAT) reveals its advantage for detecting rare haplotype association. AVAILABILITY AND IMPLEMENTATION famLBL is implemented as an R-package available at http://www.stat.osu.edu/∼statgen/SOFTWARE/LBL/.
Collapse
Affiliation(s)
- Meng Wang
- Department of Statistics, The Ohio State University, Columbus, OH 43210, USA
| | - Shili Lin
- Department of Statistics, The Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
42
|
Abstract
This article focuses on conducting global testing for association between a binary trait and a set of rare variants (RVs), although its application can be much broader to other types of traits, common variants (CVs), and gene set or pathway analysis. We show that many of the existing tests have deteriorating performance in the presence of many nonassociated RVs: their power can dramatically drop as the proportion of nonassociated RVs in the group to be tested increases. We propose a class of so-called sum of powered score (SPU) tests, each of which is based on the score vector from a general regression model and hence can deal with different types of traits and adjust for covariates, e.g., principal components accounting for population stratification. The SPU tests generalize the sum test, a representative burden test based on pooling or collapsing genotypes of RVs, and a sum of squared score (SSU) test that is closely related to several other powerful variance component tests; a previous study (Basu and Pan 2011) has demonstrated good performance of one, but not both, of the Sum and SSU tests in many situations. The SPU tests are versatile in the sense that one of them is often powerful, although its identity varies with the unknown true association parameters. We propose an adaptive SPU (aSPU) test to approximate the most powerful SPU test for a given scenario, consequently maintaining high power and being highly adaptive across various scenarios. We conducted extensive simulations to show superior performance of the aSPU test over several state-of-the-art association tests in the presence of many nonassociated RVs. Finally we applied the SPU and aSPU tests to the GAW17 mini-exome sequence data to compare its practical performance with some existing tests, demonstrating their potential usefulness.
Collapse
|
43
|
Test of rare variant association based on affected sib-pairs. Eur J Hum Genet 2014; 23:229-37. [PMID: 24667785 DOI: 10.1038/ejhg.2014.43] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2013] [Revised: 11/06/2013] [Accepted: 12/30/2013] [Indexed: 11/08/2022] Open
Abstract
With the development of sequencing techniques, there is increasing interest to detect associations between rare variants and complex traits. Quite a few statistical methods to detect associations between rare variants and complex traits have been developed for unrelated individuals. Statistical methods for detecting rare variant associations under family-based designs have not received as much attention as methods for unrelated individuals. Recent studies show that rare disease variants will be enriched in family data and thus family-based designs may improve power to detect rare variant associations. In this article, we propose a novel test to test association between the optimally weighted combination of variants and trait of interests for affected sib-pairs. The optimal weights are analytically derived and can be calculated from sampled genotypes and phenotypes. Based on the optimal weights, the proposed method is robust to the directions of the effects of causal variants and is less affected by neutral variants than existing methods are. Our simulation results show that, in all the cases, the proposed method is substantially more powerful than existing methods based on unrelated individuals and existing methods based on affected sib-pairs.
Collapse
|
44
|
Sha Q, Zhang S. A novel test for testing the optimally weighted combination of rare and common variants based on data of parents and affected children. Genet Epidemiol 2014; 38:135-43. [PMID: 24382753 PMCID: PMC4162402 DOI: 10.1002/gepi.21787] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Revised: 10/28/2013] [Accepted: 12/02/2013] [Indexed: 11/10/2022]
Abstract
With the development of sequencing technologies, the direct testing of rare variant associations has become possible. Many statistical methods for detecting associations between rare variants and complex diseases have recently been developed, most of which are population-based methods for unrelated individuals. A limitation of population-based methods is that spurious associations can occur when there is a population structure. For rare variants, this problem can be more serious, because the spectrum of rare variation can be very different in diverse populations, as well as the current nonexistence of methods to control for population stratification in population-based rare variant associations. A solution to the problem of population stratification is to use family-based association tests, which use family members to control for population stratification. In this article, we propose a novel test for Testing the Optimally Weighted combination of variants based on data of Parents and Affected Children (TOW-PAC). TOW-PAC is a family-based association test that tests the combined effect of rare and common variants in a genomic region, and is robust to the directions of the effects of causal variants. Simulation studies confirm that, for rare variant associations, family-based association tests are robust to population stratification although population-based association tests can be seriously confounded by population stratification. The results of power comparisons show that the power of TOW-PAC increases with an increase of the number of affected children in each family and TOW-PAC based on multiple affected children per family is more powerful than TOW based on unrelated individuals.
Collapse
Affiliation(s)
- Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, United States of America
| | | |
Collapse
|
45
|
Li B, Liu DJ, Leal SM. Identifying rare variants associated with complex traits via sequencing. ACTA ACUST UNITED AC 2014; Chapter 1:Unit 1.26. [PMID: 23853079 DOI: 10.1002/0471142905.hg0126s78] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Although genome-wide association studies have been successful in detecting associations with common variants, there is currently an increasing interest in identifying low-frequency and rare variants associated with complex traits. Next-generation sequencing technologies make it feasible to survey the full spectrum of genetic variation in coding regions or the entire genome. The association analysis for rare variants is challenging, and traditional methods are ineffective, however, due to the low frequency of rare variants, coupled with allelic heterogeneity. Recently a battery of new statistical methods has been proposed for identifying rare variants associated with complex traits. These methods test for associations by aggregating multiple rare variants across a gene or a genomic region or among a group of variants in the genome. In this unit, we describe key concepts for rare variant association for complex traits, survey some of the recent methods, discuss their statistical power under various scenarios, and provide practical guidance on analyzing next-generation sequencing data for identifying rare variants associated with complex traits.
Collapse
Affiliation(s)
- Bingshan Li
- Department of Molecular Physiology and Biophysics, Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, USA
| | | | | |
Collapse
|
46
|
Turkmen AS, Lin S. Blocking approach for identification of rare variants in family-based association studies. PLoS One 2014; 9:e86126. [PMID: 24465912 PMCID: PMC3900483 DOI: 10.1371/journal.pone.0086126] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2013] [Accepted: 12/09/2013] [Indexed: 01/14/2023] Open
Abstract
With the advent of next-generation sequencing technology, rare variant association analysis is increasingly being conducted to identify genetic variants associated with complex traits. In recent years, significant effort has been devoted to develop powerful statistical methods to test such associations for population-based designs. However, there has been relatively little development for family-based designs although family data have been shown to be more powerful to detect rare variants. This study introduces a blocking approach that extends two popular family-based common variant association tests to rare variants association studies. Several options are considered to partition a genomic region (gene) into "independent" blocks by which information from SNVs is aggregated within a block and an overall test statistic for the entire genomic region is calculated by combining information across these blocks. The proposed methodology allows different variants to have different directions (risk or protective) and specification of minor allele frequency threshold is not needed. We carried out a simulation to verify the validity of the method by showing that type I error is well under control when the underlying null hypothesis and the assumption of independence across blocks are satisfied. Further, data from the Genetic Analysis Workshop [Formula: see text] are utilized to illustrate the feasibility and performance of the proposed methodology in a realistic setting.
Collapse
Affiliation(s)
- Asuman S Turkmen
- Statistics Department, The Ohio State University, Columbus, Ohio, United States of America ; Statistics Department, The Ohio State University, Newark, Ohio, United States of America
| | - Shili Lin
- Statistics Department, The Ohio State University, Columbus, Ohio, United States of America
| |
Collapse
|
47
|
Won S, Kim Y, Lange C. On rare-variant analysis in population-based designs: decomposing the likelihood to two informative components. Hum Hered 2014; 76:76-85. [PMID: 24434864 DOI: 10.1159/000357643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2012] [Accepted: 11/29/2013] [Indexed: 11/19/2022] Open
Abstract
Various analytical approaches have been suggested for the characterization of rare variants. One main approach is to collapse the genetic information of rare variants in a region and to construct an overall test statistic. Here, we proposed a new approach based on collapsed genotype scores. By utilizing the information of the association signal that is ignored in collapsing methods, i.e. the configuration of rare alleles, we constructed a more powerful test and compared it with existing rare-variant approaches. With extensive simulation studies, we showed that our method performs better than existing approaches, and we applied our method to a sequencing study of nonsyndromic cleft lip illustrating the practical advantages of the proposed method.
Collapse
Affiliation(s)
- Sungho Won
- Department of Applied Statistics, Chung-Ang University, Seoul, Korea
| | | | | |
Collapse
|
48
|
Xu Y, Gong W, Peng J, Wang H, Huang J, Ding H, Wang DW. Functional analysis LRP6 novel mutations in patients with coronary artery disease. PLoS One 2014; 9:e84345. [PMID: 24427284 PMCID: PMC3888387 DOI: 10.1371/journal.pone.0084345] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2013] [Accepted: 11/14/2013] [Indexed: 02/05/2023] Open
Abstract
Background Genetic architecture of coronary artery disease (CAD) is still to be defined. Since low density lipoprotein receptor-related protein 6 (LRP6) gene play critical roles in Wnt signal transduction which are important for vascular development and endodermis specification, we therefore resequenced it to search for mutations in CAD patients. Methods We systemically sequenced all the exons and promoter region of LRP6 gene in a sample of 380 early onset CAD patients and 380 control subjects in Chinese. Results In total, we identified 5 patient-specific mutations including K82N (two patients), S488Y (one patient), P1066T (two patients), P1206H (two patients) and I1264V (one patient) All these mutations located at the extracellular domain of LRP6 gene. In vitro functional analysis of patient-specific mutations demonstrated that these mutations resulted in a significant reduction in both protein level transporting to cell membrane and downstream Wnt signal activity. Furthermore, we found that LRP6 novel mutations attenuated proliferation and migration of human umbilical vein endothelial cells (HUVECs) when compared with wild type (WT) LRP6. Conclusion Our results demonstrated that these loss-of-function variants might contribute to disease liability in a subset of CAD and defects in Wnt signal activation might be important contributing factors for the onset of CAD.
Collapse
Affiliation(s)
- Yujun Xu
- The Institute of Hypertension and Department of Internal Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Intensive Care Unit, West China Hospital, Sichuan University, Chengdu, China
| | - Wei Gong
- The Institute of Hypertension and Department of Internal Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Jia Peng
- The Institute of Hypertension and Department of Internal Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Echocardiography Laboratory, Sichuan Provincial Hospital, Chengdu, China
| | - Haoran Wang
- The Institute of Hypertension and Department of Internal Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Jin Huang
- The Institute of Hypertension and Department of Internal Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Hu Ding
- The Institute of Hypertension and Department of Internal Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Genetic Diagnosis Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- * E-mail: (DWW); (HD)
| | - Dao Wen Wang
- The Institute of Hypertension and Department of Internal Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Genetic Diagnosis Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- * E-mail: (DWW); (HD)
| |
Collapse
|
49
|
Thomas DC, Yang Z, Yang F. Two-phase and family-based designs for next-generation sequencing studies. Front Genet 2013; 4:276. [PMID: 24379824 PMCID: PMC3861783 DOI: 10.3389/fgene.2013.00276] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2013] [Accepted: 11/19/2013] [Indexed: 12/21/2022] Open
Abstract
The cost of next-generation sequencing is now approaching that of early GWAS panels, but is still out of reach for large epidemiologic studies and the millions of rare variants expected poses challenges for distinguishing causal from non-causal variants. We review two types of designs for sequencing studies: two-phase designs for targeted follow-up of genomewide association studies using unrelated individuals; and family-based designs exploiting co-segregation for prioritizing variants and genes. Two-phase designs subsample subjects for sequencing from a larger case-control study jointly on the basis of their disease and carrier status; the discovered variants are then tested for association in the parent study. The analysis combines the full sequence data from the substudy with the more limited SNP data from the main study. We discuss various methods for selecting this subset of variants and describe the expected yield of true positive associations in the context of an on-going study of second breast cancers following radiotherapy. While the sharing of variants within families means that family-based designs are less efficient for discovery than sequencing unrelated individuals, the ability to exploit co-segregation of variants with disease within families helps distinguish causal from non-causal ones. Furthermore, by enriching for family history, the yield of causal variants can be improved and use of identity-by-descent information improves imputation of genotypes for other family members. We compare the relative efficiency of these designs with those using unrelated individuals for discovering and prioritizing variants or genes for testing association in larger studies. While associations can be tested with single variants, power is low for rare ones. Recent generalizations of burden or kernel tests for gene-level associations to family-based data are appealing. These approaches are illustrated in the context of a family-based study of colorectal cancer.
Collapse
Affiliation(s)
- Duncan C Thomas
- Department of Preventive Medicine, University of Southern California Los Angeles, CA, USA
| | - Zhao Yang
- Department of Preventive Medicine, University of Southern California Los Angeles, CA, USA
| | - Fan Yang
- Department of Preventive Medicine, University of Southern California Los Angeles, CA, USA
| |
Collapse
|
50
|
Wang X, Lee S, Zhu X, Redline S, Lin X. GEE-based SNP set association test for continuous and discrete traits in family-based association studies. Genet Epidemiol 2013; 37:778-86. [PMID: 24166731 PMCID: PMC4007511 DOI: 10.1002/gepi.21763] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2013] [Revised: 08/17/2013] [Accepted: 09/10/2013] [Indexed: 12/17/2022]
Abstract
Family-based genetic association studies of related individuals provide opportunities to detect genetic variants that complement studies of unrelated individuals. Most statistical methods for family association studies for common variants are single marker based, which test one SNP a time. In this paper, we consider testing the effect of an SNP set, e.g., SNPs in a gene, in family studies, for both continuous and discrete traits. Specifically, we propose a generalized estimating equations (GEEs) based kernel association test, a variance component based testing method, to test for the association between a phenotype and multiple variants in an SNP set jointly using family samples. The proposed approach allows for both continuous and discrete traits, where the correlation among family members is taken into account through the use of an empirical covariance estimator. We derive the theoretical distribution of the proposed statistic under the null and develop analytical methods to calculate the P-values. We also propose an efficient resampling method for correcting for small sample size bias in family studies. The proposed method allows for easily incorporating covariates and SNP-SNP interactions. Simulation studies show that the proposed method properly controls for type I error rates under both random and ascertained sampling schemes in family studies. We demonstrate through simulation studies that our approach has superior performance for association mapping compared to the single marker based minimum P-value GEE test for an SNP-set effect over a range of scenarios. We illustrate the application of the proposed method using data from the Cleveland Family GWAS Study.
Collapse
Affiliation(s)
- Xuefeng Wang
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA 02115
| | - Seunggeun Lee
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA 02115
| | - Xiaofeng Zhu
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA 44106
| | - Susan Redline
- Department of Medicine, Brigham and Women’s Hospital and Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA
| | - Xihong Lin
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA 02115
| |
Collapse
|