1
|
Pillalamarri V, Shi W, Say C, Yang S, Lane J, Guallar E, Pankratz N, Arking DE. Whole-exome sequencing in 415,422 individuals identifies rare variants associated with mitochondrial DNA copy number. HGG ADVANCES 2023; 4:100147. [PMID: 36311265 PMCID: PMC9615038 DOI: 10.1016/j.xhgg.2022.100147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 09/23/2022] [Indexed: 10/14/2022] Open
Abstract
Inter-individual variation in the number of copies of the mitochondrial genome, called mitochondrial DNA copy number (mtDNA-CN), reflects mitochondrial function and has been associated with various aging-related diseases. We examined 415,422 exomes of self-reported White ancestry individuals from the UK Biobank and tested the impact of rare variants, at the level of single variants and through aggregate variant-set tests, on mtDNA-CN. A survey across nine variant sets tested enrichment of putatively causal variants and identified 14 genes at experiment-wide significance and three genes at marginal significance. These included associations at known mtDNA depletion syndrome genes (mtDNA helicase TWNK, p = 1.1 × 10-30; mitochondrial transcription factor TFAM, p = 4.3 × 10-15; mtDNA maintenance exonuclease MGME1, p = 2.0 × 10-6) and the V617F dominant gain-of-function mutation in the tyrosine kinase JAK2 (p = 2.7 × 10-17), associated with myeloproliferative disease. Novel genes included the ATP-dependent protease CLPX (p = 8.4 × 10-9), involved in mitochondrial proteome quality, and the mitochondrial adenylate kinase AK2 (p = 4.7 × 10-8), involved in hematopoiesis. The most significant association was a missense variant in SAMHD1 (p = 4.2 × 10-28), found on a rare, 1.2-Mb shared ancestral haplotype on chromosome 20. SAMHD1 encodes a cytoplasmic host restriction factor involved in viral defense response and the mitochondrial nucleotide salvage pathway, and is associated with Aicardi-Goutières syndrome 5, a childhood encephalopathy and chronic inflammatory response disorder. Rare variants were enriched in Mendelian mtDNA depletion syndrome loci, and these variants implicated core processes in mtDNA replication, nucleoid structure formation, and maintenance. These data indicate that strong-effect mutations from the nuclear genome contribute to the genetic architecture of mtDNA-CN.
Collapse
Affiliation(s)
- Vamsee Pillalamarri
- McKusick-Nathans Institute, Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
- Predoctoral Program in Human Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
- Maryland Genetics Epidemiology and Medicine Training Program, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Wen Shi
- McKusick-Nathans Institute, Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Conrad Say
- McKusick-Nathans Institute, Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Stephanie Yang
- McKusick-Nathans Institute, Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
- Vertex Pharmaceuticals, Inc., Boston, MA 02210, USA
| | - John Lane
- Department of Laboratory Medicine and Pathology, University of Minnesota Medical School, Minneapolis, MN 55455, USA
| | - Eliseo Guallar
- Departments of Epidemiology and Medicine and Welch Center for Prevention, Epidemiology and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Nathan Pankratz
- Department of Laboratory Medicine and Pathology, University of Minnesota Medical School, Minneapolis, MN 55455, USA
| | - Dan E. Arking
- McKusick-Nathans Institute, Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| |
Collapse
|
2
|
Parrish RL, Gibson GC, Epstein MP, Yang J. TIGAR-V2: Efficient TWAS tool with nonparametric Bayesian eQTL weights of 49 tissue types from GTEx V8. HGG ADVANCES 2022; 3:100068. [PMID: 35047855 PMCID: PMC8756507 DOI: 10.1016/j.xhgg.2021.100068] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 11/01/2021] [Indexed: 01/12/2023] Open
Abstract
Standard transcriptome-wide association study (TWAS) methods first train gene expression prediction models using reference transcriptomic data and then test the association between the predicted genetically regulated gene expression and phenotype of interest. Most existing TWAS tools require cumbersome preparation of genotype input files and extra coding to enable parallel computation. To improve the efficiency of TWAS tools, we developed Transcriptome-Integrated Genetic Association Resource V2 (TIGAR-V2), which directly reads Variant Call Format (VCF) files, enables parallel computation, and reduces up to 90% of computation cost (mainly due to loading genotype data) compared to the original version. TIGAR-V2 can train gene expression imputation models using either nonparametric Bayesian Dirichlet process regression (DPR) or Elastic-Net (as used by PrediXcan), perform TWASs using either individual-level or summary-level genome-wide association study (GWAS) data, and implement both burden and variance-component statistics for gene-based association tests. We trained gene expression prediction models by DPR for 49 tissues using Genotype-Tissue Expression (GTEx) V8 by TIGAR-V2 and illustrated the usefulness of these Bayesian cis-expression quantitative trait locus (eQTL) weights through TWASs of breast and ovarian cancer utilizing public GWAS summary statistics. We identified 88 and 37 risk genes, respectively, for breast and ovarian cancer, most of which are either known or near previously identified GWAS (∼95%) or TWAS (∼40%) risk genes and three novel independent TWAS risk genes with known functions in carcinogenesis. These findings suggest that TWASs can provide biological insight into the transcriptional regulation of complex diseases. The TIGAR-V2 tool, trained Bayesian cis-eQTL weights, and linkage disequilibrium (LD) information from GTEx V8 are publicly available, providing a useful resource for mapping risk genes of complex diseases.
Collapse
Affiliation(s)
- Randy L. Parrish
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Greg C. Gibson
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Michael P. Epstein
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Jingjing Yang
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA
| |
Collapse
|
3
|
Pang H, Xia Y, Luo S, Huang G, Li X, Xie Z, Zhou Z. Emerging roles of rare and low-frequency genetic variants in type 1 diabetes mellitus. J Med Genet 2021; 58:289-296. [PMID: 33753534 PMCID: PMC8086251 DOI: 10.1136/jmedgenet-2020-107350] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Revised: 01/06/2021] [Accepted: 01/10/2021] [Indexed: 12/12/2022]
Abstract
Type 1 diabetes mellitus (T1DM) is defined as an autoimmune disorder and has enormous complexity and heterogeneity. Although its precise pathogenic mechanisms are obscure, this disease is widely acknowledged to be precipitated by environmental factors in individuals with genetic susceptibility. To date, the known susceptibility loci, which have mostly been identified by genome-wide association studies, can explain 80%–85% of the heritability of T1DM. Researchers believe that at least a part of its missing genetic component is caused by undetected rare and low-frequency variants. Most common variants have only small to modest effect sizes, which increases the difficulty of dissecting their functions and restricts their potential clinical application. Intriguingly, many studies have indicated that rare and low-frequency variants have larger effect sizes and play more significant roles in susceptibility to common diseases, including T1DM, than common variants do. Therefore, better recognition of rare and low-frequency variants is beneficial for revealing the genetic architecture of T1DM and for providing new and potent therapeutic targets for this disease. Here, we will discuss existing challenges as well as the great significance of this field and review current knowledge of the contributions of rare and low-frequency variants to T1DM.
Collapse
Affiliation(s)
- Haipeng Pang
- National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, and Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China
| | - Ying Xia
- National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, and Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China
| | - Shuoming Luo
- National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, and Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China
| | - Gan Huang
- National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, and Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China
| | - Xia Li
- National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, and Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China
| | - Zhiguo Xie
- National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, and Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China
| | - Zhiguang Zhou
- National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, and Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China
| |
Collapse
|
4
|
Fonseca DJ, Morel A, Llinás-Caballero K, Bolívar-Salazar D, Laissue P. Whole-Exome Sequencing in Patients Affected by Stevens-Johnson Syndrome and Toxic Epidermal Necrolysis Reveals New Variants Potentially Contributing to the Phenotype. PHARMACOGENOMICS & PERSONALIZED MEDICINE 2021; 14:287-299. [PMID: 33688237 PMCID: PMC7935440 DOI: 10.2147/pgpm.s289869] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2020] [Accepted: 12/26/2020] [Indexed: 12/17/2022]
Abstract
Background Adverse drug reactions (ADRs) are frequent occurring events that can essentially be defined as harmful or unpleasant symptoms secondary to the use of a medicinal product. ADRs involve a wide spectrum of clinical manifestations ranging from minor itching and rash to life-threatening reactions. Stevens-Johnson syndrome (SJS) and toxic epidermal necrolysis (TEN) are rare ADRs. SJS-TEN may be considered a polygenic pathology due to additive/epistatic effects caused by sequence variants in numerous genes. Next-generation sequencing (NGS) represents a potentially interesting exploration tool in such scenario as it facilitates the simultaneous analysis of large genomic regions and genes at affordable cost. Methods The present study has involved using whole-exome sequencing (WES) for the first time on SJS-TEN patients. It involved robust and innovative multistep bioinformatics analysis focusing on 313 candidate genes potentially participating in the disease’s aetiology, specific drugs’ metabolism and gene regulation. Results We identified combinations of frequently occurring and rare variants that may contribute to the disease’s pathogenesis. Depending on the specific drug being taken, different variants (and alleles) in NAT2, CYP2D8, CYP2B6, ABCC2, UGT2B7 and TCF3 were identified as coherent candidates representing potential future markers for SJS-TEN. Conclusion The present study proposed and has described (for the first time) a large-scale genomic analysis of patients affected by SJS-TEN. The genes and variants identified represent relevant candidates potentially participating in the disease’s pathogenesis. Corroborating that proposed by others, we found that complex combinations of frequently occurring and rare variants participating in particular drug metabolism molecular cascades could be associated with the phenotype. TCF3 TF may be considered a coherent candidate for SJS-TEN that should be analysed in new cohorts of patients having ADRs.
Collapse
Affiliation(s)
- Dora Janeth Fonseca
- Center for Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad Del Rosario, Bogotá, Colombia
| | - Adrien Morel
- Center for Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad Del Rosario, Bogotá, Colombia
| | - Kevin Llinás-Caballero
- Center for Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad Del Rosario, Bogotá, Colombia
| | - David Bolívar-Salazar
- Center for Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad Del Rosario, Bogotá, Colombia
| | - Paul Laissue
- Center for Research in Genetics and Genomics-CIGGUR, GENIUROS Research Group, School of Medicine and Health Sciences, Universidad Del Rosario, Bogotá, Colombia.,BIOPAS Laboratoires, Orphan Diseases Unit, BIOPAS GROUP, Bogotá, Colombia
| |
Collapse
|
5
|
El Hayek L, Tuncay IO, Nijem N, Russell J, Ludwig S, Kaur K, Li X, Anderton P, Tang M, Gerard A, Heinze A, Zacher P, Alsaif HS, Rad A, Hassanpour K, Abbaszadegan MR, Washington C, DuPont BR, Louie RJ, Couse M, Faden M, Rogers RC, Abou Jamra R, Elias ER, Maroofian R, Houlden H, Lehman A, Beutler B, Chahrour MH. KDM5A mutations identified in autism spectrum disorder using forward genetics. eLife 2020; 9:56883. [PMID: 33350388 PMCID: PMC7755391 DOI: 10.7554/elife.56883] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 12/06/2020] [Indexed: 12/29/2022] Open
Abstract
Autism spectrum disorder (ASD) is a constellation of neurodevelopmental disorders with high phenotypic and genetic heterogeneity, complicating the discovery of causative genes. Through a forward genetics approach selecting for defective vocalization in mice, we identified Kdm5a as a candidate ASD gene. To validate our discovery, we generated a Kdm5a knockout mouse model (Kdm5a-/-) and confirmed that inactivating Kdm5a disrupts vocalization. In addition, Kdm5a-/- mice displayed repetitive behaviors, sociability deficits, cognitive dysfunction, and abnormal dendritic morphogenesis. Loss of KDM5A also resulted in dysregulation of the hippocampal transcriptome. To determine if KDM5A mutations cause ASD in humans, we screened whole exome sequencing and microarray data from a clinical cohort. We identified pathogenic KDM5A variants in nine patients with ASD and lack of speech. Our findings illustrate the power and efficacy of forward genetics in identifying ASD genes and highlight the importance of KDM5A in normal brain development and function.
Collapse
Affiliation(s)
- Lauretta El Hayek
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, United States
| | - Islam Oguz Tuncay
- Department of Neuroscience, University of Texas Southwestern Medical Center, Dallas, United States
| | - Nadine Nijem
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, United States
| | - Jamie Russell
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, United States
| | - Sara Ludwig
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, United States
| | - Kiran Kaur
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, United States
| | - Xiaohong Li
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, United States
| | - Priscilla Anderton
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, United States
| | - Miao Tang
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, United States
| | - Amanda Gerard
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, United States.,Texas Children's Hospital, Houston, United States
| | - Anja Heinze
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Pia Zacher
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany.,The Saxon Epilepsy Center Kleinwachau, Radeberg, Germany
| | - Hessa S Alsaif
- Department of Genetics, King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia
| | - Aboulfazl Rad
- Cellular and Molecular Research Center, Sabzevar University of Medical Sciences, Sabzevar, Islamic Republic of Iran
| | - Kazem Hassanpour
- Non-Communicable Diseases Research Center, Sabzevar University of Medical Sciences, Sabzevar, Islamic Republic of Iran
| | - Mohammad Reza Abbaszadegan
- Pardis Clinical and Genetics Laboratory, Mashhad, Islamic Republic of Iran.,Division of Human Genetics, Avicenna Research Institute, Mashhad University of Medical Sciences, Mashhad, Islamic Republic of Iran
| | | | | | | | -
- Department of Medical Genetics, University of British Columbia, British Columbia Children's and Women's Hospital Research Institute, Vancouver, Canada
| | - Madeline Couse
- Department of Medical Genetics, University of British Columbia, British Columbia Children's and Women's Hospital Research Institute, Vancouver, Canada
| | - Maha Faden
- Department of Genetics, King Saud Medical City, Riyadh, Saudi Arabia
| | | | - Rami Abou Jamra
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Ellen R Elias
- Department of Pediatrics and Genetics, University of Colorado School of Medicine, Aurora, United States
| | - Reza Maroofian
- Department of Neuromuscular Diseases, University College London, Queen Square Institute of Neurology, London, United Kingdom
| | - Henry Houlden
- Department of Neuromuscular Diseases, University College London, Queen Square Institute of Neurology, London, United Kingdom
| | - Anna Lehman
- Department of Medical Genetics, University of British Columbia, British Columbia Children's and Women's Hospital Research Institute, Vancouver, Canada
| | - Bruce Beutler
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, United States
| | - Maria H Chahrour
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Neuroscience, University of Texas Southwestern Medical Center, Dallas, United States.,Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas, United States.,Peter O'Donnell Jr. Brain Institute, University of Texas Southwestern Medical Center, Dallas, United States
| |
Collapse
|
6
|
Kang JTL, Rosenberg NA. Mathematical Properties of Linkage Disequilibrium Statistics Defined by Normalization of the Coefficient D = pAB - pApB. Hum Hered 2020; 84:127-143. [PMID: 32045910 DOI: 10.1159/000504171] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 10/10/2019] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Many statistics for measuring linkage disequilibrium (LD) take the form of a normalization of the LD coefficient D. Different normalizations produce statistics with different ranges, interpretations, and arguments favoring their use. METHODS Here, to compare the mathematical properties of these normalizations, we consider 5 of these normalized statistics, describing their upper bounds, the mean values of their maxima over the set of possible allele frequency pairs, and the size of the allele frequency regions accessible given specified values of the statistics. RESULTS We produce detailed characterizations of these properties for the statistics d and ρ, analogous to computations previously performed for r2. We examine the relationships among the statistics, uncovering conditions under which some of them have close connections. CONCLUSION The results contribute insight into LD measurement, particularly the understanding of differences in the features of different LD measures when computed on the same data.
Collapse
Affiliation(s)
- Jonathan T L Kang
- Department of Biology, Stanford University, Stanford, California, USA,
| | - Noah A Rosenberg
- Department of Biology, Stanford University, Stanford, California, USA
| |
Collapse
|
7
|
Lorenzo-Salazar JM, Ma SF, Jou J, Hou PC, Guillen-Guio B, Allen RJ, Jenkins RG, Wain LV, Oldham JM, Noth I, Flores C. Novel idiopathic pulmonary fibrosis susceptibility variants revealed by deep sequencing. ERJ Open Res 2019; 5:00071-2019. [PMID: 31205927 PMCID: PMC6556557 DOI: 10.1183/23120541.00071-2019] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 04/07/2019] [Indexed: 12/13/2022] Open
Abstract
Background Specific common and rare single nucleotide variants (SNVs) increase the likelihood of developing sporadic idiopathic pulmonary fibrosis (IPF). We performed target-enriched sequencing on three loci previously identified by a genome-wide association study to gain a deeper understanding of the full spectrum of IPF genetic risk and performed a two-stage case–control association study. Methods A total of 1.7 Mb of DNA from 181 IPF patients was deep sequenced (>100×) across 11p15.5, 14q21.3 and 17q21.31 loci. Comparisons were performed against 501 unrelated controls and replication studies were assessed in 3968 subjects. Results 36 SNVs were associated with IPF susceptibility in the discovery stage (p<5.0×10−8). After meta-analysis, the strongest association corresponded to rs35705950 (p=9.27×10−57) located upstream from the mucin 5B gene (MUC5B). Additionally, a novel association was found for two co-inherited low-frequency SNVs (<5%) in MUC5AC, predicting a missense amino acid change in mucin 5AC (lowest p=2.27×10−22). Conditional and haplotype analyses in 11p15.5 supported the existence of an additional contribution of MUC5AC variants to IPF risk. Conclusions This study reinforces the significant IPF associations of these loci and implicates MUC5AC as another key player in IPF susceptibility. Deep sequencing of genome-wide association study hits identified novel low-frequency variants associated with IPF susceptibility.http://bit.ly/2IF4AT8
Collapse
Affiliation(s)
- Jose M Lorenzo-Salazar
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain.,These authors contributed equally to this work
| | - Shwu-Fan Ma
- Division of Pulmonary and Critical Care Medicine, University of Virginia, Charlottesville, VA, USA.,These authors contributed equally to this work
| | - Jonathan Jou
- College of Medicine, University of Illinois, Chicago, IL, USA.,These authors contributed equally to this work
| | - Pei-Chi Hou
- Division of Pulmonary and Critical Care Medicine, University of Virginia, Charlottesville, VA, USA
| | - Beatriz Guillen-Guio
- Research Unit, Hospital Universitario N.S. de Candelaria, Universidad de La Laguna, Santa Cruz de Tenerife, Spain
| | - Richard J Allen
- Dept of Health Sciences, University of Leicester, Leicester, UK
| | - R Gisli Jenkins
- NIHR Biomedical Research Centre, Respiratory Research Unit, University of Nottingham, Nottingham, UK
| | - Louise V Wain
- Dept of Health Sciences, University of Leicester, Leicester, UK.,National Institute for Health Research, Leicester Respiratory Biomedical Research Centre, Glenfield Hospital, Leicester, UK
| | - Justin M Oldham
- Pulmonary and Critical Care Medicine, University of California at Davis, Sacramento, CA, USA
| | - Imre Noth
- Division of Pulmonary and Critical Care Medicine, University of Virginia, Charlottesville, VA, USA
| | - Carlos Flores
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain.,Research Unit, Hospital Universitario N.S. de Candelaria, Universidad de La Laguna, Santa Cruz de Tenerife, Spain.,CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid, Spain
| |
Collapse
|
8
|
Randesi M, van den Brink W, Levran O, Blanken P, van Ree JM, Ott J, Kreek MJ. VMAT2 gene ( SLC18A2) variants associated with a greater risk for developing opioid dependence. Pharmacogenomics 2019; 20:331-341. [PMID: 30983500 DOI: 10.2217/pgs-2018-0137] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Aim: To determine if selected serotonergic and noradrenergic gene variants are associated with heroin addiction. Subjects & methods: A total of 126 variants in 19 genes in subjects with Dutch European ancestry from The Netherlands. Subjects included 281 opioid-dependent volunteers in methadone maintenance or heroin-assisted treatment, 163 opioid-exposed but not opioid-dependent volunteers who have been using illicit opioids but never became opioid-dependent and 153 healthy controls. Results: Nominal associations were indicated for 20 variants in six genes including an experiment-wise significant association from the combined effect of three SLC18A2 SNPs (rs363332, rs363334 and rs363338) with heroin dependence (pfinal = 0.047). Conclusion: Further studies are warranted to confirm and elucidate the role of these variants in the vulnerability to opioid addiction.
Collapse
Affiliation(s)
- Matthew Randesi
- Laboratory of the Biology of Addictive Diseases, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA
| | - Wim van den Brink
- Department of Psychiatry, Amsterdam Institute for Addiction Research, Academic Medical Center, University of Amsterdam, PO Box 22660, 1100 DD Amsterdam, The Netherlands
| | - Orna Levran
- Laboratory of the Biology of Addictive Diseases, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA
| | - Peter Blanken
- Department of Psychiatry, Amsterdam Institute for Addiction Research, Academic Medical Center, University of Amsterdam, PO Box 22660, 1100 DD Amsterdam, The Netherlands.,Parnassia Addiction Research Centre (PARC, Brijder Addiction Treatment), PO Box 53002, 2505 AA The Hague, The Netherlands
| | - Jan M van Ree
- Rudolf Magnus Brain Center, University Medical Center Utrecht, Utrecht University, Heidelberglaan 100, 3584 CX Utrecht, The Netherlands
| | - Jurg Ott
- Laboratory of Statistical Genetics, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA
| | - Mary Jeanne Kreek
- Laboratory of the Biology of Addictive Diseases, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA
| |
Collapse
|
9
|
Weissenkampen JD, Jiang Y, Eckert S, Jiang B, Li B, Liu DJ. Methods for the Analysis and Interpretation for Rare Variants Associated with Complex Traits. CURRENT PROTOCOLS IN HUMAN GENETICS 2019; 101:e83. [PMID: 30849219 PMCID: PMC6455968 DOI: 10.1002/cphg.83] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
With the advent of Next Generation Sequencing (NGS) technologies, whole genome and whole exome DNA sequencing has become affordable for routine genetic studies. Coupled with improved genotyping arrays and genotype imputation methodologies, it is increasingly feasible to obtain rare genetic variant information in large datasets. Such datasets allow researchers to gain a more complete understanding of the genetic architecture of complex traits caused by rare variants. State-of-the-art statistical methods for the statistical genetics analysis of sequence-based association, including efficient algorithms for association analysis in biobank-scale datasets, gene-association tests, meta-analysis, fine mapping methods that integrate functional genomic dataset, and phenome-wide association studies (PheWAS), are reviewed here. These methods are expected to be highly useful for next generation statistical genetics analysis in the era of precision medicine. © 2019 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
| | - Yu Jiang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey PA
| | - Scott Eckert
- Department of Public Health Sciences, Penn State College of Medicine, Hershey PA
| | - Bibo Jiang
- Department of Public Health Sciences, Penn State College of Medicine, Hershey PA
| | - Bingshan Li
- Department of Molecular Physiology and Biophysics, Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN
| | - Dajiang J. Liu
- Department of Public Health Sciences, Penn State College of Medicine, Hershey PA
| |
Collapse
|
10
|
Association of FMO3 Variants with Blood Pressure in the Atherosclerosis Risk in Communities Study. Int J Hypertens 2019; 2019:2137629. [PMID: 30906589 PMCID: PMC6397986 DOI: 10.1155/2019/2137629] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Revised: 01/23/2019] [Accepted: 02/03/2019] [Indexed: 12/03/2022] Open
Abstract
Flavin containing monooxygenase 3 [FMO3] encodes dimethylaniline monooxygenase [N-oxide-forming] 3, which breaks down nitrogen-containing compounds, and has been implicated in blood pressure regulation. Studies have reported conflicting results of the association of a common nonsynonymous variant, E158K (rs2266782), with hypertension. We examined the associations of E158K, along with rare and low frequency exonic variants (minor allele frequency [MAF]<5%) in FMO3 with hypertension, systolic blood pressure (SBP), and diastolic blood pressure (DBP). We included 7,350 European Americans and 2,814 African Americans in the Atherosclerosis Risk in Communities (ARIC) study with exome sequencing of FMO3. The association of FMO3 variants with SBP and DBP was tested using single variant and gene-based tests followed by the replication or interrogation of significant variants in ancestry-specific cohorts based on Bonferroni corrected thresholds. E158K had significant association with higher SBP in African Americans in ARIC (p=0.03), and two low frequency variants had significant association with higher SBP in African Americans (rs200985584, MAF 0.1%, p=0.0003) and European Americans (rs75904274, MAF 1.7%, p=0.006). These associations were not significant with additional samples: E158K in a meta-analysis of SBP of African ancestry (N=30,841, p=0.43) that included ARIC participants and the two low frequency variants in an independent ancestry-specific exome sequencing study of blood pressure (rs200985584, p=0.94; rs75904274, p=0.81). Our study does not support the association of E158K and low frequency variants in FMO3 with blood pressure and demonstrates the importance of replication in genetic studies.
Collapse
|
11
|
Exome-wide rare variant analyses of two bone mineral density phenotypes: the challenges of analyzing rare genetic variation. Sci Rep 2018; 8:220. [PMID: 29317680 PMCID: PMC5760616 DOI: 10.1038/s41598-017-18385-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Accepted: 12/11/2017] [Indexed: 11/08/2022] Open
Abstract
Performance of a recently developed test for association between multivariate phenotypes and sets of genetic variants (MURAT) is demonstrated using measures of bone mineral density (BMD). By combining individual-level whole genome sequenced data from the UK10K study, and imputed genome-wide genetic data on individuals from the Study of Osteoporotic Fractures (SOF) and the Osteoporotic Fractures in Men Study (MrOS), a data set of 8810 individuals was assembled; tests of association were performed between autosomal gene-sets of genetic variants and BMD measured at lumbar spine and femoral neck. Distributions of p-values obtained from analyses of a single BMD phenotype are compared to those from the multivariate tests, across several region definitions and variant weightings. There is evidence of increased power with the multivariate test, although no new loci for BMD were identified. Among 17 genes highlighted either because there were significant p-values in region-based association tests or because they were in well-known BMD genes, 4 windows in 2 genes as well as 6 single SNPs in one of these genes showed association at genome-wide significant thresholds with the multivariate phenotype test but not with the single-phenotype test, Sequence Kernel Association Test (SKAT).
Collapse
|
12
|
Hansen RD, Christensen AF, Olesen J. Family studies to find rare high risk variants in migraine. J Headache Pain 2017; 18:32. [PMID: 28255817 PMCID: PMC5334193 DOI: 10.1186/s10194-017-0729-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 01/27/2017] [Indexed: 12/17/2022] Open
Abstract
INTRODUCTION Migraine has long been known as a common complex disease caused by genetic and environmental factors. The pathophysiology and the specific genetic susceptibility are poorly understood. Common variants only explain a small part of the heritability of migraine. It is thought that rare genetic variants with bigger effect size may be involved in the disease. Since migraine has a tendency to cluster in families, a family approach might be the way to find these variants. This is also indicated by identification of migraine-associated loci in classical linkage-analyses in migraine families. A single migraine study using a candidate-gene approach was performed in 2010 identifying a rare mutation in the TRESK potassium channel segregating in a large family with migraine with aura, but this finding has later become questioned. The technologies of next-generation sequencing (NGS) now provides an affordable tool to investigate the genetic variation in the entire exome or genome. The family-based study design using NGS is described in this paper. We also review family studies using NGS that have been successful in finding rare variants in other common complex diseases in order to argue the promising application of a family approach to migraine. METHOD PubMed was searched to find studies that looked for rare genetic variants in common complex diseases through a family-based design using NGS, excluding studies looking for de-novo mutations, or using a candidate-gene approach and studies on cancer. All issues from Nature Genetics and PLOS genetics 2014, 2015 and 2016 (UTAI June) were screened for relevant papers. Reference lists from included and other relevant papers were also searched. For the description of the family-based study design using NGS an in-house protocol was used. RESULTS Thirty-two successful studies, which covered 16 different common complex diseases, were included in this paper. We also found a single migraine study. Twenty-three studies found one or a few family specific variants (less than five), while other studies found several possible variants. Not all of them were genome wide significant. Four studies performed follow-up analyses in unrelated cases and controls and calculated odds ratios that supported an association between detected variants and risk of disease. Studies of 11 diseases identified rare variants that segregated fully or to a large degree with the disease in the pedigrees. CONCLUSION It is possible to find rare high risk variants for common complex diseases through a family-based approach. One study using a family approach and NGS to find rare variants in migraine has already been published but with strong limitations. More studies are under way.
Collapse
Affiliation(s)
- Rikke Dyhr Hansen
- Danish Headache Center, Department of Neurology, Rigshospitalet Glostrup, University of Copenhagen, Glostrup, DK-2600 Denmark
| | - Anne Francke Christensen
- Danish Headache Center, Department of Neurology, Rigshospitalet Glostrup, University of Copenhagen, Glostrup, DK-2600 Denmark
| | - Jes Olesen
- Danish Headache Center, Department of Neurology, Rigshospitalet Glostrup, University of Copenhagen, Glostrup, DK-2600 Denmark
| |
Collapse
|
13
|
Persyn E, Karakachoff M, Le Scouarnec S, Le Clézio C, Campion D, Consortium FE, Schott JJ, Redon R, Bellanger L, Dina C. DoEstRare: A statistical test to identify local enrichments in rare genomic variants associated with disease. PLoS One 2017; 12:e0179364. [PMID: 28742119 PMCID: PMC5524342 DOI: 10.1371/journal.pone.0179364] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Accepted: 05/29/2017] [Indexed: 01/01/2023] Open
Abstract
Next-generation sequencing technologies made it possible to assay the effect of rare variants on complex diseases. As an extension of the "common disease-common variant" paradigm, rare variant studies are necessary to get a more complete insight into the genetic architecture of human traits. Association studies of these rare variations show new challenges in terms of statistical analysis. Due to their low frequency, rare variants must be tested by groups. This approach is then hindered by the fact that an unknown proportion of the variants could be neutral. The risk level of a rare variation may be determined by its impact but also by its position in the protein sequence. More generally, the molecular mechanisms underlying the disease architecture may involve specific protein domains or inter-genic regulatory regions. While a large variety of methods are optimizing functionality weights for each single marker, few evaluate variant position differences between cases and controls. Here, we propose a test called DoEstRare, which aims to simultaneously detect clusters of disease risk variants and global allele frequency differences in genomic regions. This test estimates, for cases and controls, variant position densities in the genetic region by a kernel method, weighted by a function of allele frequencies. We compared DoEstRare with previously published strategies through simulation studies as well as re-analysis of real datasets. Based on simulation under various scenarios, DoEstRare was the sole to consistently show highest performance, in terms of type I error and power both when variants were clustered or not. DoEstRare was also applied to Brugada syndrome and early-onset Alzheimer's disease data and provided complementary results to other existing tests. DoEstRare, by integrating variant position information, gives new opportunities to explain disease susceptibility. DoEstRare is implemented in a user-friendly R package.
Collapse
Affiliation(s)
- Elodie Persyn
- INSERM, CNRS, UNIV Nantes, l’institut du thorax, Nantes, France
| | - Matilde Karakachoff
- INSERM, CNRS, UNIV Nantes, l’institut du thorax, Nantes, France
- CHU Nantes, l’institut du thorax, Nantes, France
| | | | - Camille Le Clézio
- Inserm U1079, Rouen University, Normandy Center for Genomic Medicine and Personalized Medicine, Normandy University, Rouen, France
| | - Dominique Campion
- Inserm U1079, Rouen University, Normandy Center for Genomic Medicine and Personalized Medicine, Normandy University, Rouen, France
| | | | - Jean-Jacques Schott
- INSERM, CNRS, UNIV Nantes, l’institut du thorax, Nantes, France
- CHU Nantes, l’institut du thorax, Nantes, France
| | - Richard Redon
- INSERM, CNRS, UNIV Nantes, l’institut du thorax, Nantes, France
- CHU Nantes, l’institut du thorax, Nantes, France
| | - Lise Bellanger
- Laboratoire de Mathématiques Jean Leray, UMR CNRS 6629, Nantes, France
- * E-mail: (LB); (CD)
| | - Christian Dina
- INSERM, CNRS, UNIV Nantes, l’institut du thorax, Nantes, France
- CHU Nantes, l’institut du thorax, Nantes, France
- * E-mail: (LB); (CD)
| |
Collapse
|
14
|
Ye BD, McGovern DP. Genetic variation in IBD: progress, clues to pathogenesis and possible clinical utility. Expert Rev Clin Immunol 2016; 12:1091-107. [PMID: 27156530 PMCID: PMC5083126 DOI: 10.1080/1744666x.2016.1184972] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Epidemiological and clinical studies have suggested that the pathogenesis of inflammatory bowel disease (IBD) is strongly influenced by genetic predisposition. Beyond the limitations of linkage analysis, multiple genome-wide association studies, their meta-analyses, and targeted genotyping array techniques have broadened our understanding of the genetic architecture of IBD. Currently, over 200 single nucleotide polymorphisms are known to be associated with susceptibility to IBD and through functional analysis of genes and loci, a substantial proportion of pathophysiologic mechanisms have been revealed. However, because only a modest fraction of predicted heritability can be explained by known genes/loci, additional strategies are needed including the identification of rare variants with large effect sizes to help explain the missing heritability. Considerable progress is also being made on applying outcomes of genetic research in diagnostics, classification, prognostics, and the development of new therapeutics of IBD.
Collapse
Affiliation(s)
- Byong Duk Ye
- Department of Gastroenterology and Inflammatory Bowel Disease Center, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
- F. Widjaja Foundation Inflammatory Bowel and Immunobiology Research Institute, Medical Genetics Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, USA
| | - Dermot P.B. McGovern
- F. Widjaja Foundation Inflammatory Bowel and Immunobiology Research Institute, Medical Genetics Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, USA
| |
Collapse
|
15
|
Daneshjou R, Cavallari LH, Weeke PE, Karczewski KJ, Drozda K, Perera MA, Johnson JA, Klein TE, Bustamante CD, Roden DM, Shaffer C, Denny JC, Zehnder JL, Altman RB. Population-specific single-nucleotide polymorphism confers increased risk of venous thromboembolism in African Americans. Mol Genet Genomic Med 2016; 4:513-20. [PMID: 27652279 PMCID: PMC5023936 DOI: 10.1002/mgg3.226] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Revised: 04/28/2016] [Accepted: 05/03/2016] [Indexed: 01/03/2023] Open
Abstract
INTRODUCTION African Americans have a higher incidence of venous thromboembolism (VTE) than European descent individuals. However, the typical genetic risk factors in populations of European descent are nearly absent in African Americans, and population-specific genetic factors influencing the higher VTE rate are not well characterized. METHODS We performed a candidate gene analysis on an exome-sequenced African American family with recurrent VTE and identified a variant in Protein S (PROS1) V510M (rs138925964). We assessed the population impact of PROS1 V510M using a multicenter African American cohort of 306 cases with VTE compared to 370 controls. Additionally, we compared our case cohort to a background population cohort of 2203 African Americans in the NHLBI GO Exome Sequencing Project (ESP). RESULTS In the African American family with recurrent VTE, we found prior laboratories for our cases indicating low free Protein S levels, providing functional support for PROS1 V510M as the causative mutation. Additionally, this variant was significantly enriched in the VTE cases of our multicenter case-control study (Fisher's Exact Test, P = 0.0041, OR = 4.62, 95% CI: 1.51-15.20; allele frequencies - cases: 2.45%, controls: 0.54%). Similarly, PROS1 V510M was also enriched in our VTE case cohort compared to African Americans in the ESP cohort (Fisher's Exact Test, P = 0.010, OR = 2.28, 95% CI: 1.26-4.10). CONCLUSIONS We found a variant, PROS1 V510M, in an African American family with VTE and clinical laboratory abnormalities in Protein S. Additionally, we found that this variant conferred increased risk of VTE in a case-control study of African Americans. In the ESP cohort, the variant is nearly absent in ESP European descent subjects (n = 3, allele frequency: 0.03%). Additionally, in 1000 Genomes Phase 3 data, the variant only appears in African descent populations. Thus, PROS1 V510M is a population-specific genetic risk factor for VTE in African Americans.
Collapse
Affiliation(s)
- Roxana Daneshjou
- Department of Genetics Stanford University School of Medicine Stanford California 94305
| | - Larisa H Cavallari
- Department of Pharmacotherapy and Translational Research University of Florida Gainesville Florida 32610
| | - Peter E Weeke
- Department of MedicineVanderbilt UniversityNashvilleTennessee37201; The Department of CardiologyCopenhagen University HospitalGentofteDenmark
| | - Konrad J Karczewski
- Department of GeneticsStanford University School of MedicineStanfordCalifornia94305; Biomedical Informatics Training ProgramStanford University School of MedicineStanfordCalifornia 94305
| | - Katarzyna Drozda
- Department of Pharmacy Practice University of Illinois at Chicago Chicago Illinois 60612
| | - Minoli A Perera
- Department of Medicine University of Chicago Chicago Illinois 60637
| | - Julie A Johnson
- Department of Pharmacotherapy and Translational Research University of Florida Gainesville Florida 32610
| | - Teri E Klein
- Department of Genetics Stanford University School of Medicine Stanford California 94305
| | - Carlos D Bustamante
- Department of Genetics Stanford University School of Medicine Stanford California 94305
| | - Dan M Roden
- Department of MedicineVanderbilt UniversityNashvilleTennessee37201; Department of PharmacologyVanderbilt UniversityNashvilleTennessee37201; Department of BiomedicalInformatics Vanderbilt UniversityNashvilleTennessee37201
| | - Christian Shaffer
- Department of Medicine Vanderbilt University Nashville Tennessee 37201
| | - Joshua C Denny
- Department of Medicine Vanderbilt University Nashville Tennessee 37201
| | - James L Zehnder
- Department of Medicine Stanford University School of Medicine Stanford California 94305
| | - Russ B Altman
- Department of GeneticsStanford University School of MedicineStanfordCalifornia94305; Department of BioengineeringStanford University School of MedicineStanfordCalifornia94305
| |
Collapse
|
16
|
Raszek MM, Guan LL, Plastow GS. Use of Genomic Tools to Improve Cattle Health in the Context of Infectious Diseases. Front Genet 2016; 7:30. [PMID: 27014337 PMCID: PMC4780072 DOI: 10.3389/fgene.2016.00030] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Accepted: 02/18/2016] [Indexed: 12/15/2022] Open
Abstract
Although infectious diseases impose a heavy economic burden on the cattle industry, the etiology of many disorders that affect livestock is not fully elucidated, and effective countermeasures are often lacking. The main tools available until now have been vaccines, antibiotics and antiparasitic drugs. Although these have been very successful in some cases, the appearance of parasite and microbial resistance to these treatments is a cause of concern. Next-generation sequencing provides important opportunities to tackle problems associated with pathogenic illnesses. This review describes the rapid gains achieved to track disease progression, identify the pathogens involved, and map pathogen interactions with the host. Use of novel genomic tools subsequently aids in treatment development, as well as successful creation of breeding programs aimed toward less susceptible livestock. These may be important tools for mitigating the long term effects of combating infection and helping reduce the reliance on antibiotic treatment.
Collapse
Affiliation(s)
- Mikolaj M Raszek
- Livestock Gentec, Department of Agricultural, Food and Nutritional Science, University of Alberta Edmonton, AB, Canada
| | - Le L Guan
- Livestock Gentec, Department of Agricultural, Food and Nutritional Science, University of Alberta Edmonton, AB, Canada
| | - Graham S Plastow
- Livestock Gentec, Department of Agricultural, Food and Nutritional Science, University of Alberta Edmonton, AB, Canada
| |
Collapse
|
17
|
Shtir C, Aldahmesh MA, Al-Dahmash S, Abboud E, Alkuraya H, Abouammoh MA, Nowailaty SR, Al-Thubaiti G, Naim EA, ALYounes B, Binhumaid FS, ALOtaibi AB, Altamimi AS, Alamer FH, Hashem M, Abouelhoda M, Monies D, Alkuraya FS. Exome-based case–control association study using extreme phenotype design reveals novel candidates with protective effect in diabetic retinopathy. Hum Genet 2015; 135:193-200. [DOI: 10.1007/s00439-015-1624-8] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2015] [Accepted: 12/09/2015] [Indexed: 01/16/2023]
|
18
|
Tada H, Kawashiri MA, Konno T, Yamagishi M, Hayashi K. Common and Rare Variant Association Study for Plasma Lipids and Coronary Artery Disease. J Atheroscler Thromb 2015; 23:241-56. [PMID: 26347050 DOI: 10.5551/jat.31393] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Blood lipid levels are highly heritable and modifiable risk factors for coronary artery disease (CAD), and are the leading cause of death worldwide. These facts have motivated human genetic association studies that have the substantial potential to define the risk factors that are causal and to identify pathways and therapeutic targets for lipids and CAD.The success of the HapMap project that provided an extensive catalog of human genetic variations and the development of microarray based genotyping chips (typically containing variations with allele frequencies > 5%) facilitated common variant association study (CVAS; formerly termed genome-wide association study, GWAS) identifying disease-associated variants in a genome-wide manner. To date, 157 loci associated with blood lipids and 46 loci with CAD have been successfully identified, accounting for approximately 12%-14% of heritability for lipids and 10% of heritability for CAD. However, there is yet a major challenge termed "missing heritability problem," namely the observation that loci detected by CVAS explain only a small fraction of the inferred genetic variations. To explain such missing portions, focuses in genetic association studies have shifted from common to rare variants. However, it is challenging to apply rare variant association study (RVAS) in an unbiased manner because such variants typically lack the sufficient number to be identified statistically.In this review, we provide a current understanding of the genetic architecture mostly derived from CVAS, and several updates on the progress and limitations of RVAS for lipids and CAD.
Collapse
Affiliation(s)
- Hayato Tada
- Division of Cardiovascular Medicine, Kanazawa University Graduate School of Medicine
| | | | | | | | | |
Collapse
|
19
|
Choosing Subsamples for Sequencing Studies by Minimizing the Average Distance to the Closest Leaf. Genetics 2015; 201:499-511. [PMID: 26307072 PMCID: PMC4596665 DOI: 10.1534/genetics.115.176909] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Accepted: 08/14/2015] [Indexed: 12/29/2022] Open
Abstract
Imputation of genotypes in a study sample can make use of sequenced or densely genotyped external reference panels consisting of individuals that are not from the study sample. It also can employ internal reference panels, incorporating a subset of individuals from the study sample itself. Internal panels offer an advantage over external panels because they can reduce imputation errors arising from genetic dissimilarity between a population of interest and a second, distinct population from which the external reference panel has been constructed. As the cost of next-generation sequencing decreases, internal reference panel selection is becoming increasingly feasible. However, it is not clear how best to select individuals to include in such panels. We introduce a new method for selecting an internal reference panel—minimizing the average distance to the closest leaf (ADCL)—and compare its performance relative to an earlier algorithm: maximizing phylogenetic diversity (PD). Employing both simulated data and sequences from the 1000 Genomes Project, we show that ADCL provides a significant improvement in imputation accuracy, especially for imputation of sites with low-frequency alleles. This improvement in imputation accuracy is robust to changes in reference panel size, marker density, and length of the imputation target region.
Collapse
|
20
|
Chen R, Wei Q, Zhan X, Zhong X, Sutcliffe JS, Cox NJ, Cook EH, Li C, Chen W, Li B. A haplotype-based framework for group-wise transmission/disequilibrium tests for rare variant association analysis. ACTA ACUST UNITED AC 2015; 31:1452-9. [PMID: 25568282 DOI: 10.1093/bioinformatics/btu860] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 12/23/2014] [Indexed: 12/30/2022]
Abstract
MOTIVATION A major focus of current sequencing studies for human genetics is to identify rare variants associated with complex diseases. Aside from reduced power of detecting associated rare variants, controlling for population stratification is particularly challenging for rare variants. Transmission/disequilibrium tests (TDT) based on family designs are robust to population stratification and admixture, and therefore provide an effective approach to rare variant association studies to eliminate spurious associations. To increase power of rare variant association analysis, gene-based collapsing methods become standard approaches for analyzing rare variants. Existing methods that extend this strategy to rare variants in families usually combine TDT statistics at individual variants and therefore lack the flexibility of incorporating other genetic models. RESULTS In this study, we describe a haplotype-based framework for group-wise TDT (gTDT) that is flexible to encompass a variety of genetic models such as additive, dominant and compound heterozygous (CH) (i.e. recessive) models as well as other complex interactions. Unlike existing methods, gTDT constructs haplotypes by transmission when possible and inherently takes into account the linkage disequilibrium among variants. Through extensive simulations we showed that type I error was correctly controlled for rare variants under all models investigated, and this remained true in the presence of population stratification. Under a variety of genetic models, gTDT showed increased power compared with the single marker TDT. Application of gTDT to an autism exome sequencing data of 118 trios identified potentially interesting candidate genes with CH rare variants. AVAILABILITY AND IMPLEMENTATION We implemented gTDT in C++ and the source code and the detailed usage are available on the authors' website (https://medschool.vanderbilt.edu/cgg). CONTACT bingshan.li@vanderbilt.edu or wei.chen@chp.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rui Chen
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Qiang Wei
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Xiaowei Zhan
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Xue Zhong
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - James S Sutcliffe
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Nancy J Cox
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Edwin H Cook
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Chun Li
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Wei Chen
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Bingshan Li
- Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
21
|
Neale BM, Sklar P. Genetic analysis of schizophrenia and bipolar disorder reveals polygenicity but also suggests new directions for molecular interrogation. Curr Opin Neurobiol 2014; 30:131-8. [PMID: 25544106 DOI: 10.1016/j.conb.2014.12.001] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2014] [Revised: 11/25/2014] [Accepted: 12/01/2014] [Indexed: 12/12/2022]
Abstract
Over the last few years, genetics research has made significant strides in identifying many risk factors for schizophrenia and bipolar disorder. These risk factors include inherited common single nucleotide polymorphisms, copy number variants, and rare single nucleotide variants, as well as rare de novo variants. For all variants, the common theme has been that of polygenicity, meaning that many small genetic risk factors influence risk in the population and that no gene or variant on its own has been shown to be fully deterministic of schizophrenia or bipolar. When taken together, biological themes that have emerged including the importance of synaptic function and calcium signaling. This has implications for our understanding of the biological underpinnings of these diseases.
Collapse
Affiliation(s)
- Benjamin M Neale
- Analytical and Translational Genetics Unit and Psychiatric and Neurodevelopmental Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA.
| | - Pamela Sklar
- Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| |
Collapse
|
22
|
Cheeseman IH, McDew-White M, Phyo AP, Sriprawat K, Nosten F, Anderson TJC. Pooled sequencing and rare variant association tests for identifying the determinants of emerging drug resistance in malaria parasites. Mol Biol Evol 2014; 32:1080-90. [PMID: 25534029 PMCID: PMC4379400 DOI: 10.1093/molbev/msu397] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
We explored the potential of pooled sequencing to swiftly and economically identify selective sweeps due to emerging artemisinin (ART) resistance in a South-East Asian malaria parasite population. ART resistance is defined by slow parasite clearance from the blood of ART-treated patients and mutations in the kelch gene (chr. 13) have been strongly implicated to play a role. We constructed triplicate pools of 70 slow-clearing (resistant) and 70 fast-clearing (sensitive) infections collected from the Thai–Myanmar border and sequenced these to high (∼150-fold) read depth. Allele frequency estimates from pools showed almost perfect correlation (Lin’s concordance = 0.98) with allele frequencies at 93 single nucleotide polymorphisms measured directly from individual infections, giving us confidence in the accuracy of this approach. By mapping genome-wide divergence (FST) between pools of drug-resistant and drug-sensitive parasites, we identified two large (>150 kb) regions (on chrs. 13 and 14) and 17 smaller candidate genome regions. To identify individual genes within these genome regions, we resequenced an additional 38 parasite genomes (16 slow and 22 fast-clearing) and performed rare variant association tests. These confirmed kelch as a major molecular marker for ART resistance (P = 6.03 × 10−6). This two-tier approach is powerful because pooled sequencing rapidly narrows down genome regions of interest, while targeted rare variant association testing within these regions can pinpoint the genetic basis of resistance. We show that our approach is robust to recurrent mutation and the generation of soft selective sweeps, which are predicted to be common in pathogen populations with large effective population sizes, and may confound more traditional gene mapping approaches.
Collapse
Affiliation(s)
| | | | - Aung Pyae Phyo
- Shoklo Malaria Research Unit, Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, Thailand
| | - Kanlaya Sriprawat
- Shoklo Malaria Research Unit, Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, Thailand
| | - François Nosten
- Shoklo Malaria Research Unit, Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, Thailand Centre for Tropical Medicine, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | | |
Collapse
|
23
|
Effective filtering strategies to improve data quality from population-based whole exome sequencing studies. BMC Bioinformatics 2014; 15:125. [PMID: 24884706 PMCID: PMC4098776 DOI: 10.1186/1471-2105-15-125] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2013] [Accepted: 04/16/2014] [Indexed: 12/12/2022] Open
Abstract
Background Genotypes generated in next generation sequencing studies contain errors which can significantly impact the power to detect signals in common and rare variant association tests. These genotyping errors are not explicitly filtered by the standard GATK Variant Quality Score Recalibration (VQSR) tool and thus remain a source of errors in whole exome sequencing (WES) projects that follow GATK’s recommended best practices. Therefore, additional data filtering methods are required to effectively remove these errors before performing association analyses with complex phenotypes. Here we empirically derive thresholds for genotype and variant filters that, when used in conjunction with the VQSR tool, achieve higher data quality than when using VQSR alone. Results The detailed filtering strategies improve the concordance of sequenced genotypes with array genotypes from 99.33% to 99.77%; improve the percent of discordant genotypes removed from 10.5% to 69.5%; and improve the Ti/Tv ratio from 2.63 to 2.75. We also demonstrate that managing batch effects by separating samples based on different target capture and sequencing chemistry protocols results in a final data set containing 40.9% more high-quality variants. In addition, imputation is an important component of WES studies and is used to estimate common variant genotypes to generate additional markers for association analyses. As such, we demonstrate filtering methods for imputed data that improve genotype concordance from 79.3% to 99.8% while removing 99.5% of discordant genotypes. Conclusions The described filtering methods are advantageous for large population-based WES studies designed to identify common and rare variation associated with complex diseases. Compared to data processed through standard practices, these strategies result in substantially higher quality data for common and rare association analyses.
Collapse
|
24
|
Cardinale CJ, Kelsen JR, Baldassano RN, Hakonarson H. Impact of exome sequencing in inflammatory bowel disease. World J Gastroenterol 2013; 19:6721-9. [PMID: 24187447 PMCID: PMC3812471 DOI: 10.3748/wjg.v19.i40.6721] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/11/2013] [Revised: 09/11/2013] [Accepted: 09/16/2013] [Indexed: 02/06/2023] Open
Abstract
Approaches to understanding the genetic contribution to inflammatory bowel disease (IBD) have continuously evolved from family- and population-based epidemiology, to linkage analysis, and most recently, to genome-wide association studies (GWAS). The next stage in this evolution seems to be the sequencing of the exome, that is, the regions of the human genome which encode proteins. The GWAS approach has been very fruitful in identifying at least 163 loci as being associated with IBD, and now, exome sequencing promises to take our genetic understanding to the next level. In this review we will discuss the possible contributions that can be made by an exome sequencing approach both at the individual patient level to aid with disease diagnosis and future therapies, as well as in advancing knowledge of the pathogenesis of IBD.
Collapse
|