1
|
Tanigawa Y, Kellis M. Hypometric genetics: Improved power in genetic discovery by incorporating quality control flags. Am J Hum Genet 2024; 111:2478-2493. [PMID: 39442521 PMCID: PMC11568753 DOI: 10.1016/j.ajhg.2024.09.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 09/26/2024] [Accepted: 09/27/2024] [Indexed: 10/25/2024] Open
Abstract
Balancing the tradeoff between quantity and quality of phenotypic data is critical in omics studies. Measurements below the limit of quantification (BLQ) are often tagged in quality control fields, but these flags are currently underutilized in human genetics studies. Extreme phenotype sampling is advantageous for mapping rare variant effects. We hypothesize that genetic drivers, along with environmental and technical factors, contribute to the presence of BLQ flags. Here, we introduce "hypometric genetics" (hMG) analysis and uncover a genetic basis for BLQ flags, indicating an additional source of genetic signal for genetic discovery, especially from phenotypic extremes. Applying our hMG approach to n = 227,469 UK Biobank individuals with metabolomic profiles, we reveal more than 5% heritability for BLQ flags and report biologically relevant associations, for example, at APOC3, APOA5, and PDE3B loci. For common variants, polygenic scores trained only for BLQ flags predict the corresponding quantitative traits with 91% accuracy, validating the genetic basis. For rare coding variant associations, we find an asymmetric 65.4% higher enrichment of metabolite-lowering associations for BLQ flags, highlighting the impact of putative loss-of-function variants with large effects on phenotypic extremes. Joint analysis of binarized BLQ flags and the corresponding quantitative metabolite measurements improves power in Bayesian rare variant aggregation tests, resulting in an average of 181% more prioritized genes. Our approach is broadly applicable to omics profiling. Overall, our results underscore the benefit of integrating quality control flags and quantitative measurements and highlight the advantage of joint analysis of population-based samples and phenotypic extremes in human genetics studies.
Collapse
Affiliation(s)
- Yosuke Tanigawa
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA; Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA; Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
2
|
Venkataraman GR, DeBoever C, Tanigawa Y, Aguirre M, Ioannidis AG, Mostafavi H, Spencer CCA, Poterba T, Bustamante CD, Daly MJ, Pirinen M, Rivas MA. Bayesian model comparison for rare-variant association studies. Am J Hum Genet 2021; 108:2354-2367. [PMID: 34822764 PMCID: PMC8715195 DOI: 10.1016/j.ajhg.2021.11.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Accepted: 11/02/2021] [Indexed: 12/12/2022] Open
Abstract
Whole-genome sequencing studies applied to large populations or biobanks with extensive phenotyping raise new analytic challenges. The need to consider many variants at a locus or group of genes simultaneously and the potential to study many correlated phenotypes with shared genetic architecture provide opportunities for discovery not addressed by the traditional one variant, one phenotype association study. Here, we introduce a Bayesian model comparison approach called MRP (multiple rare variants and phenotypes) for rare-variant association studies that considers correlation, scale, and direction of genetic effects across a group of genetic variants, phenotypes, and studies, requiring only summary statistic data. We apply our method to exome sequencing data (n = 184,698) across 2,019 traits from the UK Biobank, aggregating signals in genes. MRP demonstrates an ability to recover signals such as associations between PCSK9 and LDL cholesterol levels. We additionally find MRP effective in conducting meta-analyses in exome data. Non-biomarker findings include associations between MC1R and red hair color and skin color, IL17RA and monocyte count, and IQGAP2 and mean platelet volume. Finally, we apply MRP in a multi-phenotype setting; after clustering the 35 biomarker phenotypes based on genetic correlation estimates, we find that joint analysis of these phenotypes results in substantial power gains for gene-trait associations, such as in TNFRSF13B in one of the clusters containing diabetes- and lipid-related traits. Overall, we show that the MRP model comparison approach improves upon useful features from widely used meta-analysis approaches for rare-variant association analyses and prioritizes protective modifiers of disease risk.
Collapse
Affiliation(s)
| | - Christopher DeBoever
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
| | - Yosuke Tanigawa
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
| | - Matthew Aguirre
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
| | | | | | | | - Timothy Poterba
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Carlos D Bustamante
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA; Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Mark J Daly
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Matti Pirinen
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki 00014, Finland; Department of Public Health, University of Helsinki, Helsinki 00014, Finland; Department of Mathematics and Statistics, University of Helsinki, Helsinki 00014, Finland.
| | - Manuel A Rivas
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
3
|
De Vilder EYG, Martin L, Lefthériotis G, Coucke P, Van Nieuwerburgh F, Vanakker OM. Rare Modifier Variants Alter the Severity of Cardiovascular Disease in Pseudoxanthoma Elasticum: Identification of Novel Candidate Modifier Genes and Disease Pathways Through Mixture of Effects Analysis. Front Cell Dev Biol 2021; 9:612581. [PMID: 34169069 PMCID: PMC8218811 DOI: 10.3389/fcell.2021.612581] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 05/11/2021] [Indexed: 12/30/2022] Open
Abstract
Introduction: Pseudoxanthoma elasticum (PXE), an ectopic mineralization disorder caused by pathogenic ABCC6 variants, is characterized by skin, ocular and cardiovascular (CV) symptoms. Due to striking phenotypic variability without genotype-phenotype correlations, modifier genes are thought to play a role in disease variability. In this study, we evaluated the collective modifying effect of rare variants on the cardiovascular phenotype of PXE. Materials and Methods: Mixed effects of rare variants were assessed by Whole Exome Sequencing in 11 PXE patients with an extreme CV phenotype (mild/severe). Statistical analysis (SKAT-O and C-alpha testing) was performed to identify new modifier genes for the CV PXE phenotype and enrichment analysis for genes significantly associated with the severe cohort was used to evaluate pathway and gene ontology features. Results Respectively 16 (SKAT-O) and 74 (C-alpha) genes were significantly associated to the severe cohort. Top significant genes could be stratified in 3 groups–calcium homeostasis, association with vascular disease and induction of apoptosis. Comparative analysis of both analyses led to prioritization of four genes (NLRP1, SELE, TRPV1, and CSF1R), all signaling through IL-1B. Conclusion This study explored for the first time the cumulative effect of rare variants on the severity of cardiovascular disease in PXE, leading to a panel of novel candidate modifier genes and disease pathways. Though further validation is essential, this panel may aid in risk stratification and genetic counseling of PXE patients and will help to gain new insights in the PXE pathophysiology.
Collapse
Affiliation(s)
- Eva Y G De Vilder
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium.,The Research Foundation - Flanders, Ghent, Belgium.,Department of Ophthalmology, Ghent University Hospital, Ghent, Belgium
| | - Ludovic Martin
- Department of Dermatology, Angers University Hospital, Angers, France
| | - Georges Lefthériotis
- Department of Vascular Physiology and Sports Medicine, Angers University, Angers, France
| | - Paul Coucke
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Filip Van Nieuwerburgh
- Department of Pharmaceutics, Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | - Olivier M Vanakker
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| |
Collapse
|
4
|
Gordon D, Londono D, Patel P, Kim W, Finch SJ, Heiman GA. An Analytic Solution to the Computation of Power and Sample Size for Genetic Association Studies under a Pleiotropic Mode of Inheritance. Hum Hered 2017; 81:194-209. [PMID: 28315880 DOI: 10.1159/000457135] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 01/20/2017] [Indexed: 01/14/2023] Open
Abstract
Our motivation here is to calculate the power of 3 statistical tests used when there are genetic traits that operate under a pleiotropic mode of inheritance and when qualitative phenotypes are defined by use of thresholds for the multiple quantitative phenotypes. Specifically, we formulate a multivariate function that provides the probability that an individual has a vector of specific quantitative trait values conditional on having a risk locus genotype, and we apply thresholds to define qualitative phenotypes (affected, unaffected) and compute penetrances and conditional genotype frequencies based on the multivariate function. We extend the analytic power and minimum-sample-size-necessary (MSSN) formulas for 2 categorical data-based tests (genotype, linear trend test [LTT]) of genetic association to the pleiotropic model. We further compare the MSSN of the genotype test and the LTT with that of a multivariate ANOVA (Pillai). We approximate the MSSN for statistics by linear models using a factorial design and ANOVA. With ANOVA decomposition, we determine which factors most significantly change the power/MSSN for all statistics. Finally, we determine which test statistics have the smallest MSSN. In this work, MSSN calculations are for 2 traits (bivariate distributions) only (for illustrative purposes). We note that the calculations may be extended to address any number of traits. Our key findings are that the genotype test usually has lower MSSN requirements than the LTT. More inclusive thresholds (top/bottom 25% vs. top/bottom 10%) have higher sample size requirements. The Pillai test has a much larger MSSN than both the genotype test and the LTT, as a result of sample selection. With these formulas, researchers can specify how many subjects they must collect to localize genes for pleiotropic phenotypes.
Collapse
Affiliation(s)
- Derek Gordon
- Department of Genetics, The State University of New Jersey, Piscataway, NJ, USA
| | | | | | | | | | | |
Collapse
|
5
|
Marini S, Limongelli I, Rizzo E, Malovini A, Errichiello E, Vetro A, Da T, Zuffardi O, Bellazzi R. A Data Fusion Approach to Enhance Association Study in Epilepsy. PLoS One 2016; 11:e0164940. [PMID: 27984588 PMCID: PMC5161322 DOI: 10.1371/journal.pone.0164940] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2016] [Accepted: 10/04/2016] [Indexed: 11/25/2022] Open
Abstract
Among the scientific challenges posed by complex diseases with a strong genetic component, two stand out. One is unveiling the role of rare and common genetic variants; the other is the design of classification models to improve clinical diagnosis and predictive models for prognosis and personalized therapies. In this paper, we present a data fusion framework merging gene, domain, pathway and protein-protein interaction data related to a next generation sequencing epilepsy gene panel. Our method allows integrating association information from multiple genomic sources and aims at highlighting the set of common and rare variants that are capable to trigger the occurrence of a complex disease. When compared to other approaches, our method shows better performances in classifying patients affected by epilepsy.
Collapse
Affiliation(s)
- Simone Marini
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
- * E-mail: ,
| | - Ivan Limongelli
- Genomic Core Center, IRCCS Fondazione San Matteo, Pavia, Italy
- enGenome S.r.l., Via Ferrata 5, Pavia, Italy
- Centre for Health Technologies, University of Pavia, Pavia, Italy
| | - Ettore Rizzo
- enGenome S.r.l., Via Ferrata 5, Pavia, Italy
- Centre for Health Technologies, University of Pavia, Pavia, Italy
| | | | | | - Annalisa Vetro
- Genomic Core Center, IRCCS Fondazione San Matteo, Pavia, Italy
- Department of Molecular Medicine, University of Pavia, Pavia, Italy
| | - Tan Da
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
| | - Orsetta Zuffardi
- Genomic Core Center, IRCCS Fondazione San Matteo, Pavia, Italy
- Department of Molecular Medicine, University of Pavia, Pavia, Italy
| | - Riccardo Bellazzi
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
- Centre for Health Technologies, University of Pavia, Pavia, Italy
- IRCCS Fondazione S. Maugeri, Pavia, Italy
| |
Collapse
|
6
|
Tayo BO, Tong L, Cooper RS. Association of polymorphisms in the aldosterone-regulated sodium reabsorption pathway with blood pressure among Hispanics. BMC Proc 2016; 10:343-348. [PMID: 27980660 PMCID: PMC5133472 DOI: 10.1186/s12919-016-0054-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND Whereas genome-wide association study (GWAS) has proven to be an important tool for discovery of variants influencing many human diseases and traits, unfortunately its performance has not been much of all-around success for some complex conditions, for example, hypertension. Because some of the existing effective pharmacotherapeutic agents act by targeting known biological pathways, pathway-based analytical approaches could lead to more success in discovery of disease-associated variants. The objective of the present study was to identify functional variants associated with blood pressure in the aldosterone-regulated sodium reabsorption pathway using the simulated and real blood pressure phenotypes provided for Genetic Analysis Workshop 19. METHODS The present analysis included 1942 samples with exome sequencing data and for whom blood pressure phenotypes were available. Because only odd-numbered autosomes were available, we restricted analysis to 127 quality-controlled single-nucleotide polymorphisms from the aldosterone-regulated sodium reabsorption pathway. We performed pathway-based association analysis using appropriate regression models for single variant, haplotype and epistasis association analyses. To account for multiple comparisons, statistical significance was empirically derived by permutation procedure and Bonferroni correction. RESULTS The topmost pathway-based association signals were observed in PRKCA gene for diastolic blood pressure (DBP), systolic blood pressure (SBP), and mean arterial pressure (MAP) in both real and simulated data. The associations remained significant (P <0.05) after multiple testing corrections for the number of genes. Similarly, the pathway-based 2-locus epistasis analysis indicated significant interactions between INSR and PRKCG for SBP and MAP; INS and PIK3R2 for DBP; PIK3CD and ATP1B2 for hypertension in the real data set. We also observed significant within-gene interactions in INSR for SBP, DBP, and hypertension in the simulated data set. CONCLUSION The findings from this study show that pathway-based analytical approach targeting known biological pathways can be useful in identification of disease-associated variants that are otherwise undetectable by GWAS. The approach takes advantage of the assumption of nonindependence of variants within and across pathway genes which leads to reduced penalty of multiple testing and thus less-stringent statistical significance threshold.
Collapse
Affiliation(s)
- Bamidele O. Tayo
- Department of Public Health Sciences, Stritch School of Medicine, Loyola University Chicago, Maywood, IL 60153 USA
| | - Liping Tong
- Department of Public Health Sciences, Stritch School of Medicine, Loyola University Chicago, Maywood, IL 60153 USA
| | - Richard S. Cooper
- Department of Public Health Sciences, Stritch School of Medicine, Loyola University Chicago, Maywood, IL 60153 USA
| |
Collapse
|
7
|
AP-SKAT: highly-efficient genome-wide rare variant association test. BMC Genomics 2016; 17:745. [PMID: 27654840 PMCID: PMC5031335 DOI: 10.1186/s12864-016-3094-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Accepted: 09/15/2016] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Genome-wide association studies have revealed associations between single-nucleotide polymorphisms (SNPs) and phenotypes such as disease symptoms and drug tolerance. To address the small sample size for rare variants, association studies tend to group gene or pathway level variants and evaluate the effect on the set of variants. One of such strategies, known as the sequential kernel association test (SKAT), is a widely used collapsing method. However, the reported p-values from SKAT tend to be biased because the asymptotic property of the statistic is used to calculate the p-value. Although this bias can be corrected by applying permutation procedures for the test statistics, the computational cost of obtaining p-values with high resolution is prohibitive. RESULTS To address this problem, we devise an adaptive SKAT procedure termed AP-SKAT that efficiently classifies significant SNP sets and ranks them according to the permuted p-values. Our procedure adaptively stops the permutation test when the significance level is outside some confidence interval of the estimated p-value for a binomial distribution. To evaluate the performance, we first compare the power and sample size calculation and the type I error rates estimate of SKAT, SKAT-O, and the proposed procedure using genotype data in the SKAT R package and from 1000 Genome Project. Through computational experiments using whole genome sequencing and SNP array data, we show that our proposed procedure is highly efficient and has comparable accuracy to the standard procedure. CONCLUSIONS For several types of genetic data, the developed procedure could achieve competitive power and sample size under small and large sample size conditions with controlling considerable type I error rates, and estimate p-values of significant SNP sets that are consistent with those estimated by the standard permutation test within a realistic time. This demonstrates that the procedure is sufficiently powerful for recent whole genome sequencing and SNP array data with increasing numbers of phenotypes. Additionally, this procedure can be used in other association tests by employing alternative methods to calculate the statistics.
Collapse
|
8
|
Moore CCB, Basile AO, Wallace JR, Frase AT, Ritchie MD. A biologically informed method for detecting rare variant associations. BioData Min 2016; 9:27. [PMID: 27582876 PMCID: PMC5006419 DOI: 10.1186/s13040-016-0107-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Accepted: 06/18/2016] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND BioBin is a bioinformatics software package developed to automate the process of binning rare variants into groups for statistical association analysis using a biological knowledge-driven framework. BioBin collapses variants into biological features such as genes, pathways, evolutionary conserved regions (ECRs), protein families, regulatory regions, and others based on user-designated parameters. BioBin provides the infrastructure to create complex and interesting hypotheses in an automated fashion thereby circumventing the necessity for advanced and time consuming scripting. PURPOSE OF THE STUDY In this manuscript, we describe the software package for BioBin, along with type I error and power simulations to demonstrate the strengths and various customizable features and analysis options of this variant binning tool. RESULTS Simulation testing highlights the utility of BioBin as a fast, comprehensive and expandable tool for the biologically-inspired binning and analysis of low-frequency variants in sequence data. CONCLUSIONS AND POTENTIAL IMPLICATIONS The BioBin software package has the capability to transform and streamline the analysis pipelines for researchers analyzing rare variants. This automated bioinformatics tool minimizes the manual effort of creating genomic regions for binning such that time can be spent on the much more interesting task of statistical analyses. This software package is open source and freely available from http://ritchielab.com/software/biobin-download.
Collapse
Affiliation(s)
| | - Anna Okula Basile
- Department of Biochemistry and Molecular Biology, Center for Systems Genomics, The Pennsylvania State University, University Park, PA 16802 USA
| | - John Robert Wallace
- Biomedical and Translational Informatics, Geisinger Health System, Danville, PA 17821 USA
| | - Alex Thomas Frase
- Biomedical and Translational Informatics, Geisinger Health System, Danville, PA 17821 USA
| | - Marylyn DeRiggi Ritchie
- Department of Biochemistry and Molecular Biology, Center for Systems Genomics, The Pennsylvania State University, University Park, PA 16802 USA
- Biomedical and Translational Informatics, Geisinger Health System, Danville, PA 17821 USA
| |
Collapse
|
9
|
Priest JR, Osoegawa K, Mohammed N, Nanda V, Kundu R, Schultz K, Lammer EJ, Girirajan S, Scheetz T, Waggott D, Haddad F, Reddy S, Bernstein D, Burns T, Steimle JD, Yang XH, Moskowitz IP, Hurles M, Lifton RP, Nickerson D, Bamshad M, Eichler EE, Mital S, Sheffield V, Quertermous T, Gelb BD, Portman M, Ashley EA. De Novo and Rare Variants at Multiple Loci Support the Oligogenic Origins of Atrioventricular Septal Heart Defects. PLoS Genet 2016; 12:e1005963. [PMID: 27058611 PMCID: PMC4825975 DOI: 10.1371/journal.pgen.1005963] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Accepted: 03/07/2016] [Indexed: 12/15/2022] Open
Abstract
Congenital heart disease (CHD) has a complex genetic etiology, and recent studies suggest that high penetrance de novo mutations may account for only a small fraction of disease. In a multi-institutional cohort surveyed by exome sequencing, combining analysis of 987 individuals (discovery cohort of 59 affected trios and 59 control trios, and a replication cohort of 100 affected singletons and 533 unaffected singletons) we observe variation at novel and known loci related to a specific cardiac malformation the atrioventricular septal defect (AVSD). In a primary analysis, by combining developmental coexpression networks with inheritance modeling, we identify a de novo mutation in the DNA binding domain of NR1D2 (p.R175W). We show that p.R175W changes the transcriptional activity of Nr1d2 using an in vitro transactivation model in HUVEC cells. Finally, we demonstrate previously unrecognized cardiovascular malformations in the Nr1d2tm1-Dgen knockout mouse. In secondary analyses we map genetic variation to protein-interaction networks suggesting a role for two collagen genes in AVSD, which we corroborate by burden testing in a second replication cohort of 100 AVSDs and 533 controls (p = 8.37e-08). Finally, we apply a rare-disease inheritance model to identify variation in genes previously associated with CHD (ZFPM2, NSD1, NOTCH1, VCAN, and MYH6), cardiac malformations in mouse models (ADAM17, CHRD, IFT140, PTPRJ, RYR1 and ATE1), and hypomorphic alleles of genes causing syndromic CHD (EHMT1, SRCAP, BBS2, NOTCH2, and KMT2D) in 14 of 59 trios, greatly exceeding variation in control trios without CHD (p = 9.60e-06). In total, 32% of trios carried at least one putatively disease-associated variant across 19 loci,suggesting that inherited and de novo variation across a heterogeneous group of loci may contribute to disease risk.
Collapse
Affiliation(s)
- James R. Priest
- Division of Pediatric Cardiology, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
- Cardiovascular Institute, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Kazutoyo Osoegawa
- Department of Pathology, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Nebil Mohammed
- University of California San Francisco Benioff Children’s Hospital Oakland, University of California San Francisco, San Francisco, California, United States of America
| | - Vivek Nanda
- Department of Vascular Surgery, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Ramendra Kundu
- Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Kathleen Schultz
- University of California San Francisco Benioff Children’s Hospital Oakland, University of California San Francisco, San Francisco, California, United States of America
| | - Edward J. Lammer
- University of California San Francisco Benioff Children’s Hospital Oakland, University of California San Francisco, San Francisco, California, United States of America
| | - Santhosh Girirajan
- Departments of Biochemistry, Molecular Biology, and Anthropology, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Todd Scheetz
- College of Engineering, University of Iowa, Iowa City, Iowa, United States of America
| | - Daryl Waggott
- Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Francois Haddad
- Cardiovascular Institute, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
- Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Sushma Reddy
- Division of Pediatric Cardiology, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
- Cardiovascular Institute, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Daniel Bernstein
- Division of Pediatric Cardiology, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
- Cardiovascular Institute, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Trudy Burns
- College of Public Health, University of Iowa, Iowa City, Iowa, United States of America
| | - Jeffrey D. Steimle
- Department of Pathology, University of Chicago, Chicago, Illinois, United States of America
| | - Xinan H. Yang
- Department of Pathology, University of Chicago, Chicago, Illinois, United States of America
| | - Ivan P. Moskowitz
- Department of Pathology, University of Chicago, Chicago, Illinois, United States of America
| | - Matthew Hurles
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Richard P. Lifton
- Department of Genetics, Yale University, New Haven, Connecticut, United States of America
- Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
| | - Debbie Nickerson
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Michael Bamshad
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Department of Pediatrics, University of Washington, Seattle, Washington, United States of America
| | - Evan E. Eichler
- Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Seema Mital
- Department of Pediatrics, University of Toronto, Toronto, Ontario, Canada
| | - Val Sheffield
- Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
- Division of Medical Genetics, University of Iowa Carver College of Medicine, Iowa City, Iowa, United States of America
| | - Thomas Quertermous
- Cardiovascular Institute, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
- Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| | - Bruce D. Gelb
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mt. Sinai, New York, New York, United States of America
| | - Michael Portman
- Department of Pediatrics, University of Washington, Seattle, Washington, United States of America
| | - Euan A. Ashley
- Cardiovascular Institute, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
- Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford University, Stanford, California, United States of America
| |
Collapse
|
10
|
Basile AO, Wallace JR, Peissig P, McCarty CA, Brilliant M, Ritchie MD. KNOWLEDGE DRIVEN BINNING AND PHEWAS ANALYSIS IN MARSHFIELD PERSONALIZED MEDICINE RESEARCH PROJECT USING BIOBIN. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016; 21:249-260. [PMID: 26776191 PMCID: PMC4824557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Next-generation sequencing technology has presented an opportunity for rare variant discovery and association of these variants with disease. To address the challenges of rare variant analysis, multiple statistical methods have been developed for combining rare variants to increase statistical power for detecting associations. BioBin is an automated tool that expands on collapsing/binning methods by performing multi-level variant aggregation with a flexible, biologically informed binning strategy using an internal biorepository, the Library of Knowledge (LOKI). The databases within LOKI provide variant details, regional annotations and pathway interactions which can be used to generate bins of biologically-related variants, thereby increasing the power of any subsequent statistical test. In this study, we expand the framework of BioBin to incorporate statistical tests, including a dispersion-based test, SKAT, thereby providing the option of performing a unified collapsing and statistical rare variant analysis in one tool. Extensive simulation studies performed on gene-coding regions showed a Bin-KAT analysis to have greater power than BioBin-regression in all simulated conditions, including variants influencing the phenotype in the same direction, a scenario where burden tests often retain greater power. The use of Madsen- Browning variant weighting increased power in the burden analysis to that equitable with Bin-KAT; but overall Bin-KAT retained equivalent or higher power under all conditions. Bin-KAT was applied to a study of 82 pharmacogenes sequenced in the Marshfield Personalized Medicine Research Project (PMRP). We looked for association of these genes with 9 different phenotypes extracted from the electronic health record. This study demonstrates that Bin-KAT is a powerful tool for the identification of genes harboring low frequency variants for complex phenotypes.
Collapse
Affiliation(s)
- Anna O Basile
- Department of Biochemistry, Microbiology and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | | | | | | | | | | |
Collapse
|
11
|
Seldin MF. The genetics of human autoimmune disease: A perspective on progress in the field and future directions. J Autoimmun 2015; 64:1-12. [PMID: 26343334 PMCID: PMC4628839 DOI: 10.1016/j.jaut.2015.08.015] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Accepted: 08/23/2015] [Indexed: 12/18/2022]
Abstract
Progress in defining the genetics of autoimmune disease has been dramatically enhanced by large scale genetic studies. Genome-wide approaches, examining hundreds or for some diseases thousands of cases and controls, have been implemented using high throughput genotyping and appropriate algorithms to provide a wealth of data over the last decade. These studies have identified hundreds of non-HLA loci as well as further defining HLA variations that predispose to different autoimmune diseases. These studies to identify genetic risk loci are also complemented by progress in gene expression studies including definition of expression quantitative trait loci (eQTL), various alterations in chromatin structure including histone marks, DNase I sensitivity, repressed chromatin regions as well as transcript factor binding sites. Integration of this information can partially explain why particular variations can alter proclivity to autoimmune phenotypes. Despite our incomplete knowledge base with only partial definition of hereditary factors and possible functional connections, this progress has and will continue to facilitate a better understanding of critical pathways and critical changes in immunoregulation. Advances in defining and understanding functional variants potentially can lead to both novel therapeutics and personalized medicine in which therapeutic approaches are chosen based on particular molecular phenotypes and genomic alterations.
Collapse
Affiliation(s)
- Michael F Seldin
- Department of Biochemistry and Molecular Medicine, University of California, Davis, Tupper Hall Room 4453, Davis, CA 95616, USA; Division of Rheumatology and Allergy, Department of Medicine, University of California, Davis, Tupper Hall Room 4453, Davis, CA 95616, USA.
| |
Collapse
|
12
|
Liu G, Liu Y, Jiang Q, Jiang Y, Feng R, Zhang L, Chen Z, Li K, Liu J. Convergent Genetic and Expression Datasets Highlight TREM2 in Parkinson’s Disease Susceptibility. Mol Neurobiol 2015; 53:4931-8. [DOI: 10.1007/s12035-015-9416-7] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2015] [Accepted: 09/01/2015] [Indexed: 10/23/2022]
|
13
|
Gross JB, Meyer B, Perkins M. The rise of Astyanax cavefish. Dev Dyn 2015; 244:1031-1038. [PMID: 25601346 PMCID: PMC4508244 DOI: 10.1002/dvdy.24253] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2014] [Revised: 01/08/2015] [Accepted: 01/10/2015] [Indexed: 01/01/2023] Open
Abstract
Numerous animals have invaded subterranean caverns and evolved remarkably similar features. These features include loss of vision and pigmentation, and gains in nonvisual sensation. This broad convergence echoes smaller-scale convergence, in which members of the same species repeatedly evolve the same cave-associated phenotypes. The blind Mexican tetra of the Sierra de El Abra region of northeastern Mexico has a complex origin, having recurrently colonized subterranean environments through numerous invasions of surface-dwelling fish. These colonizations likely occurred ∼1-5 MYa. Despite evidence of historical and contemporary gene flow between cave and surface forms, the cave-associated phenotype appears to remain quite stable in nature. This model system has provided insight to the mechanisms of phenotypic regression, the genetic basis for constructive trait evolution, and the origin of behavioral novelties. Here, we document the rise of this model system from its discovery by a Mexican surveyor in 1936, to a powerful system for cave biology and contemporary genetic research. The recently sequenced genome provides exciting opportunities for future research, and will help resolve several long-standing biological problems. Developmental Dynamics 244:1031-1038, 2015. © 2015 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Joshua B Gross
- University of Cincinnati, Department of Biological Sciences, Cincinnati Ohio
| | - Bradley Meyer
- University of Cincinnati, Department of Biological Sciences, Cincinnati Ohio
| | - Molly Perkins
- University of Cincinnati, Department of Biological Sciences, Cincinnati Ohio
| |
Collapse
|