51
|
Rediscovering the value of families for psychiatric genetics research. Mol Psychiatry 2019; 24:523-535. [PMID: 29955165 PMCID: PMC7028329 DOI: 10.1038/s41380-018-0073-x] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Revised: 01/11/2018] [Accepted: 03/26/2018] [Indexed: 01/09/2023]
Abstract
As it is likely that both common and rare genetic variation are important for complex disease risk, studies that examine the full range of the allelic frequency distribution should be utilized to dissect the genetic influences on mental illness. The rate limiting factor for inferring an association between a variant and a phenotype is inevitably the total number of copies of the minor allele captured in the studied sample. For rare variation, with minor allele frequencies of 0.5% or less, very large samples of unrelated individuals are necessary to unambiguously associate a locus with an illness. Unfortunately, such large samples are often cost prohibitive. However, by using alternative analytic strategies and studying related individuals, particularly those from large multiplex families, it is possible to reduce the required sample size while maintaining statistical power. We contend that using whole genome sequence (WGS) in extended pedigrees provides a cost-effective strategy for psychiatric gene mapping that complements common variant approaches and WGS in unrelated individuals. This was our impetus for forming the "Pedigree-Based Whole Genome Sequencing of Affective and Psychotic Disorders" consortium. In this review, we provide a rationale for the use of WGS with pedigrees in modern psychiatric genetics research. We begin with a focused review of the current literature, followed by a short history of family-based research in psychiatry. Next, we describe several advantages of pedigrees for WGS research, including power estimates, methods for studying the environment, and endophenotypes. We conclude with a brief description of our consortium and its goals.
Collapse
|
52
|
Reimand J, Isserlin R, Voisin V, Kucera M, Tannus-Lopes C, Rostamianfar A, Wadi L, Meyer M, Wong J, Xu C, Merico D, Bader GD. Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap. Nat Protoc 2019; 14:482-517. [PMID: 30664679 PMCID: PMC6607905 DOI: 10.1038/s41596-018-0103-9] [Citation(s) in RCA: 1051] [Impact Index Per Article: 175.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Pathway enrichment analysis helps researchers gain mechanistic insight into gene lists generated from genome-scale (omics) experiments. This method identifies biological pathways that are enriched in a gene list more than would be expected by chance. We explain the procedures of pathway enrichment analysis and present a practical step-by-step guide to help interpret gene lists resulting from RNA-seq and genome-sequencing experiments. The protocol comprises three major steps: definition of a gene list from omics data, determination of statistically enriched pathways, and visualization and interpretation of the results. We describe how to use this protocol with published examples of differentially expressed genes and mutated cancer genes; however, the principles can be applied to diverse types of omics data. The protocol describes innovative visualization techniques, provides comprehensive background and troubleshooting guidelines, and uses freely available and frequently updated software, including g:Profiler, Gene Set Enrichment Analysis (GSEA), Cytoscape and EnrichmentMap. The complete protocol can be performed in ~4.5 h and is designed for use by biologists with no prior bioinformatics training.
Collapse
Affiliation(s)
- Jüri Reimand
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Ruth Isserlin
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | | | - Mike Kucera
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | | | | | - Lina Wadi
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Mona Meyer
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Jeff Wong
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | - Changjiang Xu
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | - Daniele Merico
- Deep Genomics Inc., Toronto, ON, Canada
- The Centre for Applied Genomics (TCAG), The Hospital for Sick Children, Toronto, ON, Canada
| | - Gary D Bader
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
53
|
Guo Y, Zhou Y. A modified association test for rare and common variants based on affected sib-pair design. J Theor Biol 2019; 467:1-6. [PMID: 30707975 DOI: 10.1016/j.jtbi.2019.01.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Accepted: 01/08/2019] [Indexed: 11/18/2022]
Abstract
Current genome-wide association analysis has identified a great number of rare and common variants associated with common complex traits, however, more effective approaches for detecting associations between rare and common variants with common diseases are still demanded. Approaches for detecting rare variant association analysis will compromise the power when detecting the effects of rare and common variants simultaneously. In this paper, we extend an existing method of testing for rare variant association based on affected sib pairs (TOW-sib) and propose a variable weight test for rare and common variants association based on affected sib pairs (abbreviated as VW-TOWsib). The VW-TOWsib can be used to achieve the purpose of detecting the association of rare and common variants with complex diseases. Simulation results in various scenarios show that our proposed method is more powerful than existing methods for detecting effects of rare and common variants. At the same time, the VW-TOWsib also performs well as a method for rare variant association analysis.
Collapse
Affiliation(s)
- Yixing Guo
- Department of Statistics, School of Mathematical Sciences, Heilongjiang University and Heilongjiang Provincial Key Laboratory of the Theory and Computation of Complex Systems, Harbin 150080, China
| | - Ying Zhou
- Department of Statistics, School of Mathematical Sciences, Heilongjiang University and Heilongjiang Provincial Key Laboratory of the Theory and Computation of Complex Systems, Harbin 150080, China.
| |
Collapse
|
54
|
Zhang X, Basile AO, Pendergrass SA, Ritchie MD. Real world scenarios in rare variant association analysis: the impact of imbalance and sample size on the power in silico. BMC Bioinformatics 2019; 20:46. [PMID: 30669967 PMCID: PMC6343276 DOI: 10.1186/s12859-018-2591-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Accepted: 12/26/2018] [Indexed: 11/11/2022] Open
Abstract
Background The development of sequencing techniques and statistical methods provides great opportunities for identifying the impact of rare genetic variation on complex traits. However, there is a lack of knowledge on the impact of sample size, case numbers, the balance of cases vs controls for both burden and dispersion based rare variant association methods. For example, Phenome-Wide Association Studies may have a wide range of case and control sample sizes across hundreds of diagnoses and traits, and with the application of statistical methods to rare variants, it is important to understand the strengths and limitations of the analyses. Results We conducted a large-scale simulation of randomly selected low-frequency protein-coding regions using twelve different balanced samples with an equal number of cases and controls as well as twenty-one unbalanced sample scenarios. We further explored statistical performance of different minor allele frequency thresholds and a range of genetic effect sizes. Our simulation results demonstrate that using an unbalanced study design has an overall higher type I error rate for both burden and dispersion tests compared with a balanced study design. Regression has an overall higher type I error with balanced cases and controls, while SKAT has higher type I error for unbalanced case-control scenarios. We also found that both type I error and power were driven by the number of cases in addition to the case to control ratio under large control group scenarios. Based on our power simulations, we observed that a SKAT analysis with case numbers larger than 200 for unbalanced case-control models yielded over 90% power with relatively well controlled type I error. To achieve similar power in regression, over 500 cases are needed. Moreover, SKAT showed higher power to detect associations in unbalanced case-control scenarios than regression. Conclusions Our results provide important insights into rare variant association study designs by providing a landscape of type I error and statistical power for a wide range of sample sizes. These results can serve as a benchmark for making decisions about study design for rare variant analyses. Electronic supplementary material The online version of this article (10.1186/s12859-018-2591-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xinyuan Zhang
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Anna O Basile
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - Sarah A Pendergrass
- Biomedical and Translational Informatics Institute, Geisinger, Danville, PA, USA
| | - Marylyn D Ritchie
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA. .,Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA.
| |
Collapse
|
55
|
Saad M, Wijsman EM. Association score testing for rare variants and binary traits in family data with shared controls. Brief Bioinform 2019; 20:245-253. [PMID: 28968627 DOI: 10.1093/bib/bbx107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Indexed: 11/12/2022] Open
Abstract
Genome-wide association studies have been an important approach used to localize trait loci, with primary focus on common variants. The multiple rare variant-common disease hypothesis may explain the missing heritability remaining after accounting for identified common variants. Advances of sequencing technologies with their decreasing costs, coupled with methodological advances in the context of association studies in large samples, now make the study of rare variants at a genome-wide scale feasible. The resurgence of family-based association designs because of their advantage in studying rare variants has also stimulated more methods development, mainly based on linear mixed models (LMMs). Other tests such as score tests can have advantages over the LMMs, but to date have mainly been proposed for single-marker association tests. In this article, we extend several score tests (χcorrected2, WQLS, and SKAT) to the multiple variant association framework. We evaluate and compare their statistical performances relative with the LMM. Moreover, we show that three tests can be cast as the difference between marker allele frequencies (AFs) estimated in each of the group of affected and unaffected subjects. We show that these tests are flexible, as they can be based on related, unrelated or both related and unrelated subjects. They also make feasible an increasingly common design that only sequences a subset of affected subjects (related or unrelated) and uses for comparison publicly available AFs estimated in a group of healthy subjects. Finally, we show the great impact of linkage disequilibrium on the performance of all these tests.
Collapse
Affiliation(s)
- Mohamad Saad
- Department of Biostatistics, University of Washington, Seattle, USA.,Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, USA.,Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Ellen M Wijsman
- Department of Biostatistics, University of Washington, Seattle, USA.,Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, USA
| |
Collapse
|
56
|
Asgarbeik S, Mohammad Amoli M, Enayati S, Bandarian F, Nasli-Esfahani E, Forouzanfar K, Razi F, Angaji SA. The Role of ERRFI1+808T/G Polymorphism in Diabetic Nephropathy. INTERNATIONAL JOURNAL OF MOLECULAR AND CELLULAR MEDICINE 2019; 8:49-55. [PMID: 32351909 PMCID: PMC7175607 DOI: 10.22088/ijmcm.bums.8.2.49] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Accepted: 06/06/2019] [Indexed: 11/17/2022]
Abstract
Nephropathy is a common diabetes complication. ERRFI1 gene which participates in various cellular pathways has been proposed as a candidate gene in diabetic nephropathy. This study aimed to investigate the role of +808T/G polymorphism (rs377349) in ERRFI1 gene in diabetic nephropathy. In this case-control study, patients including diabetes with nephropathy (DN=104), type 2 diabetes without nephropathy (DM=100), and healthy controls (HC=106) were included. DNA was extracted from blood, and genotyping of the +808T/G polymorphism was carried out using PCR-RFLP technique. The differences for genotype and allele frequencies for +808T/G polymorphism in ERRFI1 gene between DN vs. HC and DN+DM vs. HC were significant (P<0.05) while no significant difference between DN and DM was observed. The allele frequencies were significantly different in DN vs. HC and DN+DM vs. HC in males but not in females. G allele of +808T/G polymorphism in ERRFI1 gene has no significant role in development and progression of diabetic nephropathy in diabetes patients while it is a risk allele for developing diabetes in Iranian population.
Collapse
Affiliation(s)
- Saeedeh Asgarbeik
- Department of Cellular and Molecular Biology, Faculty of Biological Sciences, Kharazmi University, Tehran, Iran
| | - Mahsa Mohammad Amoli
- Metabolic Disorders Research Center, Endocrinology and Metabolism Molecular-Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Samaneh Enayati
- Endocrinology and Metabolism Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Fatemeh Bandarian
- Personalized Medicine Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Ensieh Nasli-Esfahani
- Diabetes Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Katayoon Forouzanfar
- Elderly Health Research Center, Endocrinology and Metabolism Population Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Farideh Razi
- Diabetes Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Seyed Abdolhamid Angaji
- Department of Cellular and Molecular Biology, Faculty of Biological Sciences, Kharazmi University, Tehran, Iran
| |
Collapse
|
57
|
|
58
|
Abstract
Genome-wide association studies (GWAS) can identify genetic variants responsible for naturally occurring and quantitative phenotypic variation. Association studies therefore provide a powerful complement to approaches that rely on de novo mutations for characterizing gene function. Although bacteria should be amenable to GWAS, few GWAS have been conducted on bacteria, and the extent to which nonindependence among genomic variants (e.g., linkage disequilibrium [LD]) and the genetic architecture of phenotypic traits will affect GWAS performance is unclear. We apply association analyses to identify candidate genes underlying variation in 20 biochemical, growth, and symbiotic phenotypes among 153 strains of Ensifer meliloti For 11 traits, we find genotype-phenotype associations that are stronger than expected by chance, with the candidates in relatively small linkage groups, indicating that LD does not preclude resolving association candidates to relatively small genomic regions. The significant candidates show an enrichment for nucleotide polymorphisms (SNPs) over gene presence-absence variation (PAV), and for five traits, candidates are enriched in large linkage groups, a possible signature of epistasis. Many of the variants most strongly associated with symbiosis phenotypes were in genes previously identified as being involved in nitrogen fixation or nodulation. For other traits, apparently strong associations were not stronger than the range of associations detected in permuted data. In sum, our data show that GWAS in bacteria may be a powerful tool for characterizing genetic architecture and identifying genes responsible for phenotypic variation. However, careful evaluation of candidates is necessary to avoid false signals of association.IMPORTANCE Genome-wide association analyses are a powerful approach for identifying gene function. These analyses are becoming commonplace in studies of humans, domesticated animals, and crop plants but have rarely been conducted in bacteria. We applied association analyses to 20 traits measured in Ensifer meliloti, an agriculturally and ecologically important bacterium because it fixes nitrogen when in symbiosis with leguminous plants. We identified candidate alleles and gene presence-absence variants underlying variation in symbiosis traits, antibiotic resistance, and use of various carbon sources; some of these candidates are in genes previously known to affect these traits whereas others were in genes that have not been well characterized. Our results point to the potential power of association analyses in bacteria, but also to the need to carefully evaluate the potential for false associations.
Collapse
|
59
|
Lee JY, Kim J, Kim SW, Park SK, Ahn SH, Lee MH, Suh YJ, Noh DY, Son BH, Cho YU, Lee SB, Lee JW, Hopper JL, Sung J. BRCA1/2-negative, high-risk breast cancers (BRCAX) for Asian women: genetic susceptibility loci and their potential impacts. Sci Rep 2018; 8:15263. [PMID: 30323354 PMCID: PMC6189145 DOI: 10.1038/s41598-018-31859-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 08/22/2018] [Indexed: 02/06/2023] Open
Abstract
"BRCAX" refers breast cancers occurring in women with a family history predictive of being a BRCA1/2 mutation carrier, but BRCA1/2 genetic screening has failed to find causal mutations. In this study, we report the findings of the genetic architecture of BRCAX with novel and redefined candidate loci and their potential impacts on preventive strategy. We performed a genome-wide association study involving 1,469 BRCAX cases from the Korean Hereditary Breast Cancer study, and high-risk breast cancer cases (1,482 Asians and 9,902 Europeans) from the Breast Cancer Association Consortium. We also evaluated the previously reported susceptibility loci for their roles in the high-risk breast cancers. We have identified three novel loci (PDE7B, UBL3, and a new independent marker in CDKN2B-AS1) associated with BRCAX, and replicated previously reported SNPs (24 of 92) and moderate/high-penetrance (seven of 23) genes for Korean BRCAX. For the novel candidate loci, evidence supported their roles in regulatory function. We estimated that the common low-penetrance loci might explain a substantial part of high-risk breast cancer (39.4% for Koreans and 24.0% for Europeans). Our study findings suggest that common genetic markers with lower penetrance constitute a part of susceptibility to high-risk breast cancers, with potential implications for a more comprehensive genetic screening test.
Collapse
Affiliation(s)
- Joo-Yeon Lee
- Department of Health Science, Graduate School of Public Health, Seoul National University, Seoul, Republic of Korea
| | - Jisun Kim
- Department of Surgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Sung-Won Kim
- Department of Surgery, Daerim St. Mary's Hospital, Seoul, Republic of Korea
| | - Sue K Park
- Department of Preventive Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
- Department of Biomedical Science, Seoul National University College of Medicine, Seoul, Republic of Korea
- Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Sei Hyun Ahn
- Department of Surgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Min Hyuk Lee
- Department of Surgery, Soonchunhyang University Seoul Hospital, Seoul, Republic of Korea
| | - Young Jin Suh
- Department of Surgery, St. Vincent's Hospital, The Catholic University of Korea School of Medicine, Seoul, Republic of Korea
| | - Dong-Young Noh
- Cancer Research Institute, Seoul National University College of Medicine, Seoul, Republic of Korea
- Department of Surgery, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Byung Ho Son
- Department of Surgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Young Up Cho
- Department of Surgery, Yonsei University College of Medicine, Yonsei Cancer Center, Seoul, Republic of Korea
| | - Sae Byul Lee
- Department of Surgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Jong Won Lee
- Department of Surgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea.
| | - John L Hopper
- Department of Health Science, Graduate School of Public Health, Seoul National University, Seoul, Republic of Korea.
- Centre for Epidemiology and Biostatistics, University of Melbourne, Carlton, Victoria, Australia.
| | - Joohon Sung
- Department of Health Science, Graduate School of Public Health, Seoul National University, Seoul, Republic of Korea.
- Institute of Health & Environment, Seoul National University, Seoul, Republic of Korea.
- Bio-MAX Institute, Seoul National University, Seoul, Republic of Korea.
| |
Collapse
|
60
|
Zhu B, Mirabello L, Chatterjee N. A subregion-based burden test for simultaneous identification of susceptibility loci and subregions within. Genet Epidemiol 2018; 42:673-683. [PMID: 29931698 PMCID: PMC6185783 DOI: 10.1002/gepi.22134] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Revised: 04/14/2018] [Accepted: 05/04/2018] [Indexed: 01/08/2023]
Abstract
In rare variant association studies, aggregating rare and/or low frequency variants, may increase statistical power for detection of the underlying susceptibility gene or region. However, it is unclear which variants, or class of them, in a gene contribute most to the association. We proposed a subregion-based burden test (REBET) to simultaneously select susceptibility genes and identify important underlying subregions. The subregions are predefined by shared common biologic characteristics, such as the protein domain or functional impact. Based on a subset-based approach considering local correlations between combinations of test statistics of subregions, REBET is able to properly control the type I error rate while adjusting for multiple comparisons in a computationally efficient manner. Simulation studies show that REBET can achieve power competitive to alternative methods when rare variants cluster within subregions. In two case studies, REBET is able to identify known disease susceptibility genes, and more importantly pinpoint the unreported most susceptible subregions, which represent protein domains essential for gene function. R package REBET is available at https://dceg.cancer.gov/tools/analysis/rebet.
Collapse
Affiliation(s)
- Bin Zhu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institute of Health, Bethesda, MD 20892, USA
| | - Lisa Mirabello
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institute of Health, Bethesda, MD 20892, USA
| | - Nilanjan Chatterjee
- Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD 21205, USA
- Department of Oncology, School of Medicine, Johns Hopkins University, Baltimore, MD 21205, USA
| |
Collapse
|
61
|
Wang X, Boekstegers F, Brinster R. Methods and results from the genome-wide association group at GAW20. BMC Genet 2018; 19:79. [PMID: 30255814 PMCID: PMC6157187 DOI: 10.1186/s12863-018-0649-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND This paper summarizes the contributions from the Genome-wide Association Study group (GWAS group) of the GAW20. The GWAS group contributions focused on topics such as association tests, phenotype imputation, and application of empirical kinships. The goals of the GWAS group contributions were varied. A real or a simulated data set based on the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) study was employed by different methods. Different outcomes and covariates were considered, and quality control procedures varied throughout the contributions. RESULTS The consideration of heritability and family structure played a major role in some contributions. The inclusion of family information and adaptive weights based on data were found to improve power in genome-wide association studies. It was proven that gene-level approaches are more powerful than single-marker analysis. Other contributions focused on the comparison between pedigree-based kinship and empirical kinship matrices, and investigated similar results in heritability estimation, association mapping, and genomic prediction. A new approach for linkage mapping of triglyceride levels was able to identify a novel linkage signal. CONCLUSIONS This summary paper reports on promising statistical approaches and findings of the members of the GWAS group applied on real and simulated data which encompass the current topics of epigenetic and pharmacogenomics.
Collapse
Affiliation(s)
- Xuexia Wang
- University of North Texas, GAB 459, 1155 Union Circle #311430, Denton, TX 76203 USA
| | - Felix Boekstegers
- Institute of Medical Biometry and Informatics, University of Heidelberg, Im Neuenheimer Feld 130.3, 69120 Heidelberg, Germany
| | - Regina Brinster
- Institute of Medical Biometry and Informatics, University of Heidelberg, Im Neuenheimer Feld 130.3, 69120 Heidelberg, Germany
| |
Collapse
|
62
|
Butkiewicz M, Blue EE, Leung YY, Jian X, Marcora E, Renton AE, Kuzma A, Wang LS, Koboldt DC, Haines JL, Bush WS. Functional annotation of genomic variants in studies of late-onset Alzheimer's disease. Bioinformatics 2018; 34:2724-2731. [PMID: 29590295 PMCID: PMC6084586 DOI: 10.1093/bioinformatics/bty177] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 03/17/2018] [Accepted: 03/23/2018] [Indexed: 01/01/2023] Open
Abstract
Motivation Annotation of genomic variants is an increasingly important and complex part of the analysis of sequence-based genomic analyses. Computational predictions of variant function are routinely incorporated into gene-based analyses of rare-variants, though to date most studies use limited information for assessing variant function that is often agnostic of the disease being studied. Results In this work, we outline an annotation process motivated by the Alzheimer's Disease Sequencing Project, illustrate the impact of including tissue-specific transcript sets and sources of gene regulatory information and assess the potential impact of changing genomic builds on the annotation process. While these factors only impact a small proportion of total variant annotations (∼5%), they influence the potential analysis of a large fraction of genes (∼25%). Availability and implementation Individual variant annotations are available via the NIAGADS GenomicsDB, at https://www.niagads.org/genomics/ tools-and-software/databases/genomics-database. Annotations are also available for bulk download at https://www.niagads.org/datasets. Annotation processing software is available at http://www.icompbio.net/resources/software-and-downloads/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mariusz Butkiewicz
- Department of Population and Quantitative Health Sciences, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| | - Elizabeth E Blue
- Division of Medical Genetics, University of Washington, Seattle, WA, USA
| | - Yuk Yee Leung
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Xueqiu Jian
- Division of Epidemiology, Human Genetics and Environmental Sciences, University of Texas Health Science Center, Houston, TX, USA
| | - Edoardo Marcora
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Alan E Renton
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Amanda Kuzma
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Li-San Wang
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | | | - Jonathan L Haines
- Department of Population and Quantitative Health Sciences, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| | - William S Bush
- Department of Population and Quantitative Health Sciences, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| |
Collapse
|
63
|
Vélez JI, Lopera F, Creagh PK, Piñeros LB, Das D, Cervantes-Henríquez ML, Acosta-López JE, Isaza-Ruget MA, Espinosa LG, Easteal S, Quintero GA, Silva CT, Mastronardi CA, Arcos-Burgos M. Targeting Neuroplasticity, Cardiovascular, and Cognitive-Associated Genomic Variants in Familial Alzheimer's Disease. Mol Neurobiol 2018; 56:3235-3243. [PMID: 30112632 PMCID: PMC6476862 DOI: 10.1007/s12035-018-1298-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2018] [Accepted: 08/02/2018] [Indexed: 11/24/2022]
Abstract
The identification of novel genetic variants contributing to the widespread in the age of onset (AOO) of Alzheimer’s disease (AD) could aid in the prognosis and/or development of new therapeutic strategies focused on early interventions. We recruited 78 individuals with AD from the Paisa genetic isolate in Antioquia, Colombia. These individuals belong to the world largest multigenerational and extended pedigree segregating AD as a consequence of a dominant fully penetrant mutation in the PSEN1 gene and exhibit an AOO ranging from the early 1930s to the late 1970s. To shed light on the genetic underpinning that could explain the large spread of the age of onset (AOO) of AD, 64 single nucleotide polymorphisms (SNP) associated with neuroanatomical, cardiovascular, and cognitive measures in AD were genotyped. Standard quality control and filtering procedures were applied, and single- and multi-locus linear mixed-effects models were used to identify AOO-associated SNPs. A full two-locus interaction model was fitted to define how identified SNPs interact to modulate AOO. We identified two key epistatic interactions between the APOE*E2 allele and SNPs ASTN2-rs7852878 and SNTG1-rs16914781 that delay AOO by up to ~ 8 years (95% CI 3.2–12.7, P = 1.83 × 10−3) and ~ 7.6 years (95% CI 3.3–11.8, P = 8.69 × 10−4), respectively, and validated our previous finding indicating that APOE*E2 delays AOO of AD in PSEN1 E280 mutation carriers. This new evidence involving APOE*E2 as an AOO delayer could be used for developing precision medicine approaches and predictive genomics models to potentially determine AOO in individuals genetically predisposed to AD.
Collapse
Affiliation(s)
- Jorge I. Vélez
- Genomics and Predictive Medicine Group, Department of Genome Sciences, John Curtin School of Medical Research, The Australian National University, Canberra, ACT 2600 Australia
- Universidad del Norte, Barranquilla, Colombia
| | - Francisco Lopera
- Neuroscience Research Group, University of Antioquia, Medellín, Colombia
| | - Penelope K. Creagh
- Genomics and Predictive Medicine Group, Department of Genome Sciences, John Curtin School of Medical Research, The Australian National University, Canberra, ACT 2600 Australia
| | - Laura B. Piñeros
- GENIUROS, Center for Research in Genetics and Genomics, Institute of Translational Medicine, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Debjani Das
- Genome Diversity and Health Group, Department of Genome Sciences, John Curtin School of Medical Research, The Australian National University, ACT, Canberra, 2600 Australia
| | - Martha L. Cervantes-Henríquez
- Universidad del Norte, Barranquilla, Colombia
- Grupo de Neurociencias del Caribe, Universidad Simón Bolívar, Barranquilla, Colombia
| | - Johan E. Acosta-López
- Grupo de Neurociencias del Caribe, Universidad Simón Bolívar, Barranquilla, Colombia
| | | | - Lady G. Espinosa
- INPAC Research Group, Fundación Universitaria Sanitas, Bogotá, Colombia
| | - Simon Easteal
- Genome Diversity and Health Group, Department of Genome Sciences, John Curtin School of Medical Research, The Australian National University, ACT, Canberra, 2600 Australia
| | - Gustavo A. Quintero
- Studies in Translational Microbiology and Emerging Diseases (MICROS) Research Group, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Claudia Tamar Silva
- GENIUROS, Center for Research in Genetics and Genomics, Institute of Translational Medicine, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Claudio A. Mastronardi
- Genomics and Predictive Medicine Group, Department of Genome Sciences, John Curtin School of Medical Research, The Australian National University, Canberra, ACT 2600 Australia
- Neuroscience Group (NeUROS), Institute of Translational Medicine, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Mauricio Arcos-Burgos
- Genomics and Predictive Medicine Group, Department of Genome Sciences, John Curtin School of Medical Research, The Australian National University, Canberra, ACT 2600 Australia
- GENIUROS, Center for Research in Genetics and Genomics, Institute of Translational Medicine, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| |
Collapse
|
64
|
Fernández MV, Budde J, Del-Aguila JL, Ibañez L, Deming Y, Harari O, Norton J, Morris JC, Goate AM, Cruchaga C. Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease. Front Neurosci 2018; 12:209. [PMID: 29670507 PMCID: PMC5893779 DOI: 10.3389/fnins.2018.00209] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2017] [Accepted: 03/15/2018] [Indexed: 12/22/2022] Open
Abstract
Gene-based tests to study the combined effect of rare variants on a particular phenotype have been widely developed for case-control studies, but their evolution and adaptation for family-based studies, especially studies of complex incomplete families, has been slower. In this study, we have performed a practical examination of all the latest gene-based methods available for family-based study designs using both simulated and real datasets. We examined the performance of several collapsing, variance-component, and transmission disequilibrium tests across eight different software packages and 22 models utilizing a cohort of 285 families (N = 1,235) with late-onset Alzheimer disease (LOAD). After a thorough examination of each of these tests, we propose a methodological approach to identify, with high confidence, genes associated with the tested phenotype and we provide recommendations to select the best software and model for family-based gene-based analyses. Additionally, in our dataset, we identified PTK2B, a GWAS candidate gene for sporadic AD, along with six novel genes (CHRD, CLCN2, HDLBP, CPAMD8, NLRP9, and MAS1L) as candidate genes for familial LOAD.
Collapse
Affiliation(s)
- Maria V. Fernández
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, United States
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
| | - John Budde
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, United States
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
| | - Jorge L. Del-Aguila
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, United States
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
| | - Laura Ibañez
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, United States
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
| | - Yuetiva Deming
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, United States
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
| | - Oscar Harari
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, United States
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
| | - Joanne Norton
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, United States
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
| | - John C. Morris
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
- Knight Alzheimer's Disease Research Center, Washington University School of Medicine, St. Louis, MO, United States
| | - Alison M. Goate
- Department of Neuroscience, Ronald M. Loeb Center for Alzheimer's Disease, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | | | | | - Carlos Cruchaga
- Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, United States
- Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, United States
| |
Collapse
|
65
|
Chen L, Wang Y, Zhou Y. Association analysis of multiple traits by an approach of combining P values. J Genet 2018; 97:79-85. [PMID: 29666327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Increasing evidence shows that one variant can affect multiple traits, which is a widespread phenomenon in complex diseases. Joint analysis of multiple traits can increase statistical power of association analysis and uncover the underlying genetic mechanism. Although there are many statistical methods to analyse multiple traits, most of these methods are usually suitable for detecting common variants associated with multiple traits. However, because of low minor allele frequency of rare variant, these methods are not optimal for rare variant association analysis. In this paper, we extend an adaptive combination of P values method (termed ADA) for single trait to test association between multiple traits and rare variants in the given region. For a given region, we use reverse regression model to test each rare variant associated with multiple traits and obtain the P value of single-variant test. Further, we take the weighted combination of these P values as the test statistic. Extensive simulation studies show that our approach is more powerful than several other comparison methods in most cases and is robust to the inclusion of a high proportion of neutral variants and the different directions of effects of causal variants.
Collapse
Affiliation(s)
- Lili Chen
- Department of Mathematics, School of Sciences, Harbin Institute of Technology, Harbin 150001, People's Republic of China.
| | | | | |
Collapse
|
66
|
Russo A, Di Gaetano C, Cugliari G, Matullo G. Advances in the Genetics of Hypertension: The Effect of Rare Variants. Int J Mol Sci 2018; 19:E688. [PMID: 29495593 PMCID: PMC5877549 DOI: 10.3390/ijms19030688] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 02/19/2018] [Accepted: 02/26/2018] [Indexed: 12/22/2022] Open
Abstract
Worldwide, hypertension still represents a serious health burden with nine million people dying as a consequence of hypertension-related complications. Essential hypertension is a complex trait supported by multifactorial genetic inheritance together with environmental factors. The heritability of blood pressure (BP) is estimated to be 30-50%. A great effort was made to find genetic variants affecting BP levels through Genome-Wide Association Studies (GWAS). This approach relies on the "common disease-common variant" hypothesis and led to the identification of multiple genetic variants which explain, in aggregate, only 2-3% of the genetic variance of hypertension. Part of the missing genetic information could be caused by variants too rare to be detected by GWAS. The use of exome chips and Next-Generation Sequencing facilitated the discovery of causative variants. Here, we report the advances in the detection of novel rare variants, genes, and/or pathways through the most promising approaches, and the recent statistical tests that have emerged to handle rare variants. We also discuss the need to further support rare novel variants with replication studies within larger consortia and with deeper functional studies to better understand how new genes might improve patient care and the stratification of the response to antihypertensive treatments.
Collapse
Affiliation(s)
- Alessia Russo
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy.
- Italian Institute for Genomic Medicine (IIGM, Formerly HuGeF), 10126 Turin, Italy.
| | - Cornelia Di Gaetano
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy.
- Italian Institute for Genomic Medicine (IIGM, Formerly HuGeF), 10126 Turin, Italy.
| | - Giovanni Cugliari
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy.
- Italian Institute for Genomic Medicine (IIGM, Formerly HuGeF), 10126 Turin, Italy.
| | - Giuseppe Matullo
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy.
- Italian Institute for Genomic Medicine (IIGM, Formerly HuGeF), 10126 Turin, Italy.
| |
Collapse
|
67
|
Chen L, Wang Y, Zhou Y. Association analysis of multiple traits by an approach of combining
$$P$$
P
values. J Genet 2018. [DOI: 10.1007/s12041-018-0885-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
68
|
Timpson NJ, Greenwood CMT, Soranzo N, Lawson DJ, Richards JB. Genetic architecture: the shape of the genetic contribution to human traits and disease. Nat Rev Genet 2018; 19:110-124. [PMID: 29225335 DOI: 10.1038/nrg.2017.101] [Citation(s) in RCA: 240] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Genetic architecture describes the characteristics of genetic variation that are responsible for heritable phenotypic variability. It depends on the number of genetic variants affecting a trait, their frequencies in the population, the magnitude of their effects and their interactions with each other and the environment. Defining the genetic architecture of a complex trait or disease is central to the scientific and clinical goals of human genetics, which are to understand disease aetiology and aid in disease screening, diagnosis, prognosis and therapy. Recent technological advances have enabled genome-wide association studies and emerging next-generation sequencing studies to begin to decipher the nature of the heritable contribution to traits and disease. Here, we describe the types of genetic architecture that have been observed, how architecture can be measured and why an improved understanding of genetic architecture is central to future advances in the field.
Collapse
Affiliation(s)
- Nicholas J Timpson
- MRC Integrative Epidemiology Unit, School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Clifton, Bristol BS8 2BN, UK
| | - Celia M T Greenwood
- Lady Davis Institute for Medical Research, Jewish General Hospital, McGill University, 3755 Cote Ste Catherine, Montréal, Québec H3T 1E2, Canada.,Department of Oncology, McGill University, 3755 Cote Ste Catherine, Montréal, Québec H3T 1E2, Canada.,Departments of Human Genetics and Epidemiology, Biostatistics and Occupational Health, McGill University, 3755 Cote Ste Catherine, Montréal, Québec H3T 1E2, Canada
| | - Nicole Soranzo
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, UK.,Department of Haematology, University of Cambridge, Long Road, Cambridge CB2 0PT, UK
| | - Daniel J Lawson
- MRC Integrative Epidemiology Unit, School of Social and Community Medicine, University of Bristol, Oakfield House, Oakfield Grove, Clifton, Bristol BS8 2BN, UK
| | - J Brent Richards
- Departments of Human Genetics and Epidemiology, Biostatistics and Occupational Health, McGill University, 3755 Cote Ste Catherine, Montréal, Québec H3T 1E2, Canada.,Department of Medicine, Lady Davis Institute for Medical Research, Jewish General Hospital, McGill University, 3755 Cote Ste Catherine, Montréal, Québec H3T 1E2, Canada.,Department of Twin Research & Genetic Epidemiology, King's College London, St Thomas' Campus, Lambeth Palace Road, London SE1 7EH, UK
| |
Collapse
|
69
|
Kirichenko AV, Zorkoltseva IV, Belonogova NM, Axenovich TI. Use of Genotypes of Common Variants for Genome-Wide Regional Association Analysis. RUSS J GENET+ 2018. [DOI: 10.1134/s1022795418010076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
70
|
Association analysis of rare and common variants with multiple traits based on variable reduction method. Genet Res (Camb) 2018; 100:e2. [PMID: 29386084 DOI: 10.1017/s0016672317000052] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Pleiotropy, the effect of one variant on multiple traits, is widespread in complex diseases. Joint analysis of multiple traits can improve statistical power to detect genetic variants and uncover the underlying genetic mechanism. Currently, a large number of existing methods target one common variant or only rare variants. Increasing evidence shows that complex diseases are caused by common and rare variants. Here we propose a region-based method to test both rare and common variant associated multiple traits based on variable reduction method (abbreviated as MULVR). However, in the presence of noise traits, the MULVR method may lose power, so we propose the MULVR-O method, which jointly analyses the optimal number of traits associated with genetic variants by the MULVR method, to guard against the effect of noise traits. Extensive simulation studies show that our proposed method (MULVR-O) is applied to not only multiple quantitative traits but also qualitative traits, and is more powerful than several other comparison methods in most scenarios. An application to the two genes (SHBG and CHRM3) and two phenotypes (systolic blood pressure and diastolic blood pressure) from the GAW19 dataset illustrates that our proposed methods (MULVR and MULVR-O) are feasible and efficient as a region-based method.
Collapse
|
71
|
Pihlstrøm L, Wiethoff S, Houlden H. Genetics of neurodegenerative diseases: an overview. HANDBOOK OF CLINICAL NEUROLOGY 2018; 145:309-323. [PMID: 28987179 DOI: 10.1016/b978-0-12-802395-2.00022-5] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Genetic factors are central to the etiology of neurodegeneration, both as monogenic causes of heritable disease and as modifiers of susceptibility to complex, sporadic disorders. Over the last two decades, the identification of disease genes and risk loci has led to some of the greatest advances in medicine and invaluable insights into pathogenic mechanisms and disease pathways. Large-scale research efforts, novel study designs, and advances in methodology are rapidly expanding our understanding of the genome and the genetic architecture of neurodegenerative disease. Here, we review major developments in the field to date, highlighting overarching historic trends and general insights. Monogenic neurodegenerative diseases are discussed from the perspectives of both rare Mendelian forms of common disorders, such as Alzheimer disease and Parkinson disease, and heterogeneous heritable conditions, including ataxias and spastic paraplegias. Next, we summarize the experiences from investigations of complex neurodegenerative disorders, including genomewide association studies. In the final section, we reflect upon the limitations of current findings and outline important future directions. Genetics plays an essential role in translational research, ultimately aiming to develop novel disease-modifying therapies for neurodegenerative disorders. We anticipate that individual genetic profiling will also be increasingly relevant in a clinical context, with implications for patient care in line with the proposed ideal of personalized medicine.
Collapse
Affiliation(s)
- Lasse Pihlstrøm
- Institute of Clinical Medicine, University of Oslo, Oslo, Norway; UCL Institute of Neurology, National Hospital for Neurology and Neurosurgery, London, United Kingdom
| | - Sarah Wiethoff
- UCL Institute of Neurology, National Hospital for Neurology and Neurosurgery, London, United Kingdom; Center for Neurology and Hertie Institute for Clinical Brain Research, Eberhard-Karls-University, Tübingen, Germany
| | - Henry Houlden
- UCL Institute of Neurology, National Hospital for Neurology and Neurosurgery, London, United Kingdom.
| |
Collapse
|
72
|
Gu X, Gu D, He J, Rao DC, Hixson JE, Chen J, Li J, Huang J, Wu X, Rice TK, Shimmin LC, Kelly TN. Resequencing Epithelial Sodium Channel Genes Identifies Rare Variants Associated With Blood Pressure Salt-Sensitivity: The GenSalt Study. Am J Hypertens 2018; 31:205-211. [PMID: 29036630 PMCID: PMC5861537 DOI: 10.1093/ajh/hpx169] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Revised: 09/04/2017] [Accepted: 09/18/2017] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND A resequencing study of renal epithelial sodium channel (ENaC) genes was conducted to identify rare variants associated with blood pressure (BP) salt-sensitivity. METHODS The Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study was conducted among 1,906 participants who underwent a 7-day low-sodium followed by a 7-day high-sodium feeding-study. The 300 most salt-sensitive and 300 most salt-resistant GenSalt participants were selected for the resequencing study. Three ENaC genes (SCNN1A, SCNN1B, and SCNN1G) were resequenced using capillary-based sequencing methods. Traditional burden tests were utilized to examine association between rare variants and BP salt-sensitivity. Associations of low-frequency and common variants were tested using single-marker analyses. RESULTS Carriers of SCNN1A rare variants had a 0.52 [95% confidence interval (CI): 0.32-0.85] decreased odds of BP salt-sensitivity compared with noncarriers. Neither SCNN1B nor SCNN1G associated with salt-sensitivity of BP in rare variant analyses (P = 0.65 and 0.48, respectively). In single-marker analyses, 3 independent common variants in SCNN1A, rs11614164, rs4764586, and rs3741914, associated with salt-sensitivity after Bonferroni correction (P = 4.4 × 10-4, 1.1 × 10-8, and 1.3 × 10-3). Each copy of the minor allele of rs4764586 was associated with a 1.36-fold (95% CI: 1.23-1.52) increased odds of salt-sensitivity, whereas each copy of the minor allele of rs11614164 and rs3741914 was associated with 0.68-fold (95% CI: 0.55-0.84) and 0.69-fold (95% CI: 0.54-0.86) decreased odds of salt-sensitivity, respectively. CONCLUSIONS This study demonstrated for the first time a relationship between rare variants in the ENaC pathway and BP salt-sensitivity. Future replication and functional studies are needed to confirm the findings in this study. CLINICAL TRIAL REGISTRY Trial Number NCT00721721.
Collapse
Affiliation(s)
- Xiaoying Gu
- Department of Epidemiology, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine and School of Medicine, New Orleans, Louisiana, USA
| | - Dongfeng Gu
- Department of Epidemiology, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jiang He
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine and School of Medicine, New Orleans, Louisiana, USA
| | - Dabeeru C Rao
- Division of Biostatistics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - James E Hixson
- Department of Epidemiology, Human Genetics and Environmental Sciences, University of Texas School of Public Health, Houston, Texas, USA
| | - Jichun Chen
- Department of Epidemiology, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jianxin Li
- Department of Epidemiology, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jianfeng Huang
- Department of Epidemiology, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Xigui Wu
- Department of Epidemiology, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Treva K Rice
- Division of Biostatistics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Lawrence C Shimmin
- Department of Epidemiology, Human Genetics and Environmental Sciences, University of Texas School of Public Health, Houston, Texas, USA
| | - Tanika N Kelly
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine and School of Medicine, New Orleans, Louisiana, USA
| |
Collapse
|
73
|
Abstract
While genome-wide association studies have been very successful in identifying associations of common genetic variants with many different traits, the rarer frequency spectrum of the genome has not yet been comprehensively explored. Technological developments increasingly lift restrictions to access rare genetic variation. Dense reference panels enable improved genotype imputation for rarer variants in studies using DNA microarrays. Moreover, the decreasing cost of next generation sequencing makes whole exome and genome sequencing increasingly affordable for large samples. Large-scale efforts based on sequencing, such as ExAC, 100,000 Genomes, and TopMed, are likely to significantly advance this field.The main challenge in evaluating complex trait associations of rare variants is statistical power. The choice of population should be considered carefully because allele frequencies and linkage disequilibrium structure differ between populations. Genetically isolated populations can have favorable genomic characteristics for the study of rare variants.One strategy to increase power is to assess the combined effect of multiple rare variants within a region, known as aggregate testing. A range of methods have been developed for this. Model performance depends on the genetic architecture of the region of interest.
Collapse
Affiliation(s)
- Karoline Kuchenbaecker
- Wellcome Trust Sanger Institute, Cambridge, UK. .,University College London, London, UK.
| | - Emil Vincent Rosenbaum Appel
- Novo Nordisk Foundation Center for Basic Metabolic Research, Section for Metabolic Genetics, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
74
|
Lee JY, Moon S, Kim YK, Lee SH, Lee BS, Park MY, Park JE, Jang Y, Han BG. Genome-based exome sequencing analysis identifies GYG1, DIS3L and DDRGK1 are associated with myocardial infarction in Koreans. J Genet 2017; 96:1041-1046. [DOI: 10.1007/s12041-017-0854-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
75
|
England J, Drouin S, Beaulieu P, St-Onge P, Krajinovic M, Laverdière C, Levy E, Marcil V, Sinnett D. Genomic determinants of long-term cardiometabolic complications in childhood acute lymphoblastic leukemia survivors. BMC Cancer 2017; 17:751. [PMID: 29126409 PMCID: PMC5681795 DOI: 10.1186/s12885-017-3722-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 10/30/2017] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND While cure rates for childhood acute lymphoblastic leukemia (cALL) now exceed 80%, over 60% of survivors will face treatment-related long-term sequelae, including cardiometabolic complications such as obesity, insulin resistance, dyslipidemia and hypertension. Although genetic susceptibility contributes to the development of these problems, there are very few studies that have so far addressed this issue in a cALL survivorship context. METHODS In this study, we aimed at evaluating the associations between common and rare genetic variants and long-term cardiometabolic complications in survivors of cALL. We examined the cardiometabolic profile and performed whole-exome sequencing in 209 cALL survivors from the PETALE cohort. Variants associated with cardiometabolic outcomes were identified using PLINK (common) or SKAT (common and rare) and a logistic regression was used to evaluate their impact in multivariate models. RESULTS Our results showed that rare and common variants in the BAD and FCRL3 genes were associated (p<0.05) with an extreme cardiometabolic phenotype (3 or more cardiometabolic risk factors). Common variants in OGFOD3 and APOB as well as rare and common BAD variants were significantly (p<0.05) associated with dyslipidemia. Common BAD and SERPINA6 variants were associated (p<0.05) with obesity and insulin resistance, respectively. CONCLUSIONS In summary, we identified genetic susceptibility loci as contributing factors to the development of late treatment-related cardiometabolic complications in cALL survivors. These biomarkers could be used as early detection strategies to identify susceptible individuals and implement appropriate measures and follow-up to prevent the development of risk factors in this high-risk population.
Collapse
Affiliation(s)
- Jade England
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
| | - Simon Drouin
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
| | - Patrick Beaulieu
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
| | - Pascal St-Onge
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
| | - Maja Krajinovic
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
| | - Caroline Laverdière
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
- Departments of Pediatrics, Université de Montréal, Montreal, Quebec, H3T 1C5 Canada
| | - Emile Levy
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
- Departments of Nutrition, Université de Montréal, Montreal, Quebec, H3T 1C5 Canada
| | - Valérie Marcil
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
- Departments of Nutrition, Université de Montréal, Montreal, Quebec, H3T 1C5 Canada
| | - Daniel Sinnett
- Research Centre, Sainte-Justine University Health Center, 3175 chemin de la Côte-Sainte-Catherine, Montreal, Quebec, H3T 1C5 Canada
- Departments of Pediatrics, Université de Montréal, Montreal, Quebec, H3T 1C5 Canada
| |
Collapse
|
76
|
SweGen: a whole-genome data resource of genetic variability in a cross-section of the Swedish population. Eur J Hum Genet 2017; 25:1253-1260. [PMID: 28832569 PMCID: PMC5765326 DOI: 10.1038/ejhg.2017.130] [Citation(s) in RCA: 131] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2017] [Revised: 06/09/2017] [Accepted: 07/18/2017] [Indexed: 12/14/2022] Open
Abstract
Here we describe the SweGen data set, a comprehensive map of genetic variation in the Swedish population. These data represent a basic resource for clinical genetics laboratories as well as for sequencing-based association studies by providing information on genetic variant frequencies in a cohort that is well matched to national patient cohorts. To select samples for this study, we first examined the genetic structure of the Swedish population using high-density SNP-array data from a nation-wide cohort of over 10 000 Swedish-born individuals included in the Swedish Twin Registry. A total of 1000 individuals, reflecting a cross-section of the population and capturing the main genetic structure, were selected for whole-genome sequencing. Analysis pipelines were developed for automated alignment, variant calling and quality control of the sequencing data. This resulted in a genome-wide collection of aggregated variant frequencies in the Swedish population that we have made available to the scientific community through the website https://swefreq.nbis.se. A total of 29.2 million single-nucleotide variants and 3.8 million indels were detected in the 1000 samples, with 9.9 million of these variants not present in current databases. Each sample contributed with an average of 7199 individual-specific variants. In addition, an average of 8645 larger structural variants (SVs) were detected per individual, and we demonstrate that the population frequencies of these SVs can be used for efficient filtering analyses. Finally, our results show that the genetic diversity within Sweden is substantial compared with the diversity among continental European populations, underscoring the relevance of establishing a local reference data set.
Collapse
|
77
|
He H, Lei L, Chen E, Xu X, Wang L, Pan J, Yang F, Wang M, Dong J, Yang J. The screening of the functional microRNA binding site SNPs in sporadic colorectal cancer genes. Cancer Biol Ther 2017; 18:407-413. [PMID: 28494187 DOI: 10.1080/15384047.2017.1323584] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Sporadic colorectal cancer (sCRC) is one of the most commonly diagnosed cancers worldwide, but few genetic markers have been identified and used for its early detection. MicroRNAs are diverse cellular regulators in cancer pathogenesis that bind to the 3'-untranslated region (3'-UTR) of their target mRNAs, and variants within the miRNA target sites on sCRC-related genes may influence its pathogenesis. To investigate this possibility, we used a bioinformatical method to screen SNPs for putative changes in miRNA recognition sites within the 3'-UTR of sCRC-related genes. The rs11466537 single nucleotide polymorphism was predicted to modify the regulation of hsa-miR-1193 on the Transforming Growth Factor β Receptor II (TGFBR2) gene. Additionally, luciferase reporter assays indicated that hsa-miR-1193 bound the T allele more strongly than the A allele of rs11466537 (with A being the less frequent variant), and real time-polymerase chain reaction and western blot analysis showed that TGFBR2 is significantly repressed by hsa-miR-1193. Furthermore, overexpression of hsa-miR-1193 promoted HT-29 cell proliferation, while the loss of hsa-miR-1193 inhibited the process. Finally, the rs11466537 genotyping result revealed that the frequency of A allele carriers was 1.5% in the control blood samples, but 0 in the sCRC patients' normal colon tissue samples. Our results demonstrated that hsa-miR-1193 may be involved in sCRC tumourigenesis at least in part by suppression of TGFBR2, and the A allele of rs11466537 disturbed the regulation of hsa-miR-1193 on TGFBR2.
Collapse
Affiliation(s)
- Hongjuan He
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Lei Lei
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Erfei Chen
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Xiaona Xu
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Lili Wang
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Junqiang Pan
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Fangfang Yang
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Min Wang
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Jing Dong
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| | - Jin Yang
- a College of Life Science, Institute of Preventive Genomic Medicine, Northwest University , Xi'an , Shaanxi , China
| |
Collapse
|
78
|
Marvel SW, Rotroff DM, Wagner MJ, Buse JB, Havener TM, McLeod HL, Motsinger-Reif AA. Common and rare genetic markers of lipid variation in subjects with type 2 diabetes from the ACCORD clinical trial. PeerJ 2017; 5:e3187. [PMID: 28480134 PMCID: PMC5417062 DOI: 10.7717/peerj.3187] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 03/15/2017] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Individuals with type 2 diabetes are at an increased risk of cardiovascular disease. Alterations in circulating lipid levels, total cholesterol (TC), low-density lipoprotein (LDL), high-density lipoprotein (HDL), and triglycerides (TG) are heritable risk factors for cardiovascular disease. Here we conduct a genome-wide association study (GWAS) of common and rare variants to investigate associations with baseline lipid levels in 7,844 individuals with type 2 diabetes from the ACCORD clinical trial. METHODS DNA extracted from stored blood samples from ACCORD participants were genotyped using the Affymetrix Axiom Biobank 1 Genotyping Array. After quality control and genotype imputation, association of common genetic variants (CV), defined as minor allele frequency (MAF) ≥ 3%, with baseline levels of TC, LDL, HDL, and TG was tested using a linear model. Rare variant (RV) associations (MAF < 3%) were conducted using a suite of methods that collapse multiple RV within individual genes. RESULTS Many statistically significant CV (p < 1 × 10-8) replicate findings in large meta-analyses in non-diabetic subjects. RV analyses also confirmed findings in other studies, whereas significant RV associations with CNOT2, HPN-AS1, and SIRPD appear to be novel (q < 0.1). DISCUSSION Here we present findings for the largest GWAS of lipid levels in people with type 2 diabetes to date. We identified 17 statistically significant (p < 1 × 10-8) associations of CV with lipid levels in 11 genes or chromosomal regions, all of which were previously identified in meta-analyses of mostly non-diabetic cohorts. We also identified 13 associations in 11 genes based on RV, several of which represent novel findings.
Collapse
Affiliation(s)
- Skylar W Marvel
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, United States of America
| | - Daniel M Rotroff
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, United States of America.,Department of Statistics, North Carolina State University, Raleigh, NC, United States of America
| | - Michael J Wagner
- Center for Pharmacogenomics and Individualized Therapy, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
| | - John B Buse
- Division of Endocrinology, University of North Carolina School of Medicine, Chapel Hill, NC, United States of America
| | - Tammy M Havener
- Center for Pharmacogenomics and Individualized Therapy, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
| | | | - Alison A Motsinger-Reif
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, United States of America.,Department of Statistics, North Carolina State University, Raleigh, NC, United States of America
| | | |
Collapse
|
79
|
Kelly TN, Li C, Hixson JE, Gu D, Rao DC, Huang J, Rice TK, Chen J, Cao J, Li J, Anderson CE, He J. Resequencing Study Identifies Rare Renin-Angiotensin-Aldosterone System Variants Associated With Blood Pressure Salt-Sensitivity: The GenSalt Study. Am J Hypertens 2017; 30:495-501. [PMID: 28199472 PMCID: PMC5861585 DOI: 10.1093/ajh/hpx004] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2016] [Revised: 01/02/2017] [Accepted: 01/30/2017] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND The role of rare variants in blood pressure (BP) salt-sensitivity is unknown. We conducted a resequencing study of the renin-angiotensin-aldosterone system (RAAS) to identify rare variants associated with BP salt-sensitivity among participants of the Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study. METHODS The GenSalt study was conducted among 1,906 participants who underwent a 7-day low-sodium (51.3 mmol sodium/day) followed by a 7-day high-sodium feeding study (307.8 mmol sodium/day). The 300 most salt-sensitive and 300 most salt-resistant GenSalt participants were selected for the resequencing study. Seven RAAS genes were resequenced using capillary-based sequencing methods. Rare variants were tested for association with BP salt-sensitivity using traditional burden tests. Single-marker analyses were employed to test associations of low-frequency and common variants. RESULTS Aggregate rare variant analysis revealed an association of the RAAS pathway with BP salt-sensitivity. Carriers of rare RAAS variants had a 1.55-fold [95% confidence interval (CI): 1.15, 2.10] higher odds of salt-sensitivity compared to noncarriers (P = 0.004), a finding which was significant after Bonferroni correction. A nominal association of the APLN gene with salt-sensitivity was also identified, with rare APLN variants conferring a 2.22-fold (95% CI: 1.05, 6.58) higher odds of salt-sensitivity (P = 0.03). Single-marker analyses did not identify variant-BP salt-sensitivity associations after Bonferroni adjustment. A nominal association of a low-frequency, missense RENBP variant was identified. Each minor allele of rs78377269 conferred a 2.21-fold (95% CI: 1.10, 4.42) increased odds of salt-sensitivity (P = 0.03). CONCLUSIONS This study presents of the first evidence of a contribution of rare RAAS variants to BP salt-sensitivity. Clinical Trial RegistryTrial Number: NCT00721721.
Collapse
Affiliation(s)
- Tanika N Kelly
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| | - Changwei Li
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia at Athens, Athens, Georgia, USA
| | - James E Hixson
- Department of Epidemiology, Human Genetics and Environmental Sciences, University of Texas School of Public Health, Houston, Texas, USA
| | - Dongfeng Gu
- Cardiovascular Institute and Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, and Chinese National Center for Cardiovascular Disease Control and Research, Beijing, China
| | - Dabeeru C Rao
- Division of Biostatistics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Jianfeng Huang
- Cardiovascular Institute and Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, and Chinese National Center for Cardiovascular Disease Control and Research, Beijing, China
| | - Treva K Rice
- Division of Biostatistics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Jichun Chen
- Cardiovascular Institute and Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, and Chinese National Center for Cardiovascular Disease Control and Research, Beijing, China
| | - Jie Cao
- Cardiovascular Institute and Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, and Chinese National Center for Cardiovascular Disease Control and Research, Beijing, China
| | - Jianxin Li
- Cardiovascular Institute and Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, and Chinese National Center for Cardiovascular Disease Control and Research, Beijing, China
| | - Christopher E Anderson
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| | - Jiang He
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
- Department of Epidemiology, School of Medicine, New Orleans, Louisiana, USA
| |
Collapse
|
80
|
Liu M, Moon S, Wang L, Kim S, Kim YJ, Hwang MY, Kim YJ, Elston RC, Kim BJ, Won S. On the association analysis of CNV data: a fast and robust family-based association method. BMC Bioinformatics 2017; 18:217. [PMID: 28420343 PMCID: PMC5395793 DOI: 10.1186/s12859-017-1622-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2016] [Accepted: 03/31/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Copy number variation (CNV) is known to play an important role in the genetics of complex diseases and several methods have been proposed to detect association of CNV with phenotypes of interest. Statistical methods for CNV association analysis can be categorized into two different strategies. First, the copy number is estimated by maximum likelihood and association of the expected copy number with the phenotype is tested. Second, the observed probe intensity measurements can be directly used to detect association of CNV with the phenotypes of interest. RESULTS For each strategy we provide a statistic that can be applied to extended families. The computational efficiency of the proposed methods enables genome-wide association analysis and we show with simulation studies that the proposed methods outperform other existing approaches. In particular, we found that the first strategy is always more efficient than the second strategy no matter whether copy numbers for each individual are well identified or not. With the proposed methods, we performed genome-wide CNV association analyses of hematological trait, hematocrit, on 521 Korean family samples. CONCLUSIONS We found that statistical analysis with the expected copy number is more powerful than the statistic with the probe intensity measurements regardless of the accuracy of the estimation of copy numbers.
Collapse
Affiliation(s)
- Meiling Liu
- Department of Applied Statistics, Chung-Ang University, Seoul, 156-756, South Korea.,Department of Bioinformatics and Computational Biology, Iowa State University, Ames, IA, 50011, USA
| | - Sanghoon Moon
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Cheongju-si, Chungcheongbuk-do, 363-951, South Korea
| | - Longfei Wang
- Interdisciplinary Program of Bioinformatics, Seoul National University, Seoul, 151-742, South Korea
| | - Sulgi Kim
- Naver Labs, 235 Pangyoyeok-ro, Bundang-gu, Seongnam-si, Gyeonggi-do, 13494, South Korea
| | - Yeon-Jung Kim
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Cheongju-si, Chungcheongbuk-do, 363-951, South Korea
| | - Mi Yeong Hwang
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Cheongju-si, Chungcheongbuk-do, 363-951, South Korea
| | - Young Jin Kim
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Cheongju-si, Chungcheongbuk-do, 363-951, South Korea
| | - Robert C Elston
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Bong-Jo Kim
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Cheongju-si, Chungcheongbuk-do, 363-951, South Korea.
| | - Sungho Won
- Interdisciplinary Program of Bioinformatics, Seoul National University, Seoul, 151-742, South Korea. .,Department of Public Health Science, Seoul National University, Seoul, 151-742, South Korea. .,Institute of Health and Environment, Seoul National University, Seoul, 151-742, South Korea.
| |
Collapse
|
81
|
Longitudinal data analysis for rare variants detection with penalized quadratic inference function. Sci Rep 2017; 7:650. [PMID: 28381821 PMCID: PMC5429681 DOI: 10.1038/s41598-017-00712-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 03/08/2017] [Indexed: 11/08/2022] Open
Abstract
Longitudinal genetic data provide more information regarding genetic effects over time compared with cross-sectional data. Coupled with next-generation sequencing technologies, it becomes reality to identify important genes containing both rare and common variants in a longitudinal design. In this work, we adopted a weighted sum statistic (WSS) to collapse multiple variants in a gene region to form a gene score. When multiple genes in a pathway were considered together, a penalized longitudinal model under the quadratic inference function (QIF) framework was applied for efficient gene selection. We evaluated the estimation accuracy and model selection performance under different model settings, then applied the method to a real dataset from the Genetic Analysis Workshop 18 (GAW18). Compared with the unpenalized QIF method, the penalized QIF (pQIF) method achieved better estimation accuracy and higher selection efficiency. The pQIF remained optimal even when the working correlation structure was mis-specified. The real data analysis identified one important gene, angiotensin II receptor type 1 (AGTR1), in the Ca2+/AT-IIR/α-AR signaling pathway. The estimated effect implied that AGTR1 may have a protective effect for hypertension. Our pQIF method provides a general tool for longitudinal sequencing studies involving large numbers of genetic variants.
Collapse
|
82
|
Zhou T, Souzeau E, Sharma S, Landers J, Mills R, Goldberg I, Healey PR, Graham S, Hewitt AW, Mackey DA, Galanopoulos A, Casson RJ, Ruddle JB, Ellis J, Leo P, Brown MA, MacGregor S, Lynn DJ, Burdon KP, Craig JE. Whole exome sequencing implicates eye development, the unfolded protein response and plasma membrane homeostasis in primary open-angle glaucoma. PLoS One 2017; 12:e0172427. [PMID: 28264060 PMCID: PMC5338784 DOI: 10.1371/journal.pone.0172427] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 02/03/2017] [Indexed: 11/18/2022] Open
Abstract
PURPOSE To identify biological processes associated with POAG and its subtypes, high-tension (HTG) and normal-tension glaucoma (NTG), by analyzing rare potentially damaging genetic variants. METHODS A total of 122 and 65 unrelated HTG and NTG participants, respectively, with early onset advanced POAG, 103 non-glaucoma controls and 993 unscreened ethnicity-matched controls were included in this study. Study participants without myocilin disease-causing variants and non-glaucoma controls were subjected to whole exome sequencing on an Illumina HiSeq2000. Exomes of participants were sequenced on an Illumina HiSeq2000. Qualifying variants were rare in the general population (MAF < 0.001) and potentially functionally damaging (nonsense, frameshift, splice or predicted pathogenic using SIFT or Polyphen2 software). Genes showing enrichment of qualifying variants in cases were selected for pathway and network analysis using InnateDB. RESULTS POAG cases showed enrichment of rare variants in camera-type eye development genes (p = 1.40×10-7, corrected p = 3.28×10-4). Implicated eye development genes were related to neuronal or retinal development. HTG cases were significantly enriched for key regulators in the unfolded protein response (UPR) (p = 7.72×10-5, corrected p = 0.013). The UPR is known to be involved in myocilin-related glaucoma; our results suggest the UPR has a role in non-myocilin causes of HTG. NTG cases showed enrichment in ion channel transport processes (p = 1.05×10-4, corrected p = 0.027) including calcium, chloride and phospholipid transporters involved in plasma membrane homeostasis. Network analysis also revealed enrichment of the MHC Class I antigen presentation pathway in HTG, and the EGFR1 and cell-cycle pathways in both HTG and NTG. CONCLUSION This study suggests that mutations in eye development genes are enriched in POAG. HTG can result from aberrant responses to protein misfolding which may be amenable to molecular chaperone therapy. NTG is associated with impaired plasma membrane homeostasis increasing susceptibility to apoptosis.
Collapse
Affiliation(s)
- Tiger Zhou
- Flinders University, Department of Ophthalmology, Bedford Park, South Australia, Australia
- * E-mail:
| | - Emmanuelle Souzeau
- Flinders University, Department of Ophthalmology, Bedford Park, South Australia, Australia
| | - Shiwani Sharma
- Flinders University, Department of Ophthalmology, Bedford Park, South Australia, Australia
| | - John Landers
- Flinders University, Department of Ophthalmology, Bedford Park, South Australia, Australia
| | - Richard Mills
- Flinders University, Department of Ophthalmology, Bedford Park, South Australia, Australia
| | - Ivan Goldberg
- University of Sydney Discipline of Ophthalmology, Sydney, Australia
- Glaucoma Unit, Sydney Eye Hospital, Sydney, Australia
| | - Paul R. Healey
- University of Sydney Discipline of Ophthalmology, Sydney, Australia
- Centre for Vision Research, Westmead Institute for Medical Research, University of Sydney, Sydney, Australia
| | - Stuart Graham
- University of Sydney Discipline of Ophthalmology, Sydney, Australia
| | - Alex W. Hewitt
- University of Tasmania Menzies Institute for Medical Research, Hobart, Australia
| | - David A. Mackey
- University of Western Australia Centre for Ophthalmology and Visual Science, Lions Eye Institute, Perth, Australia
| | - Anna Galanopoulos
- University of Adelaide, Discipline of Ophthalmology & Visual Sciences, Adelaide, Australia
| | - Robert J. Casson
- University of Adelaide, Discipline of Ophthalmology & Visual Sciences, Adelaide, Australia
| | - Jonathan B. Ruddle
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, Melbourne, Australia
| | - Jonathan Ellis
- University of Queensland Diamantina Institute, Translational Research Institute, Princess Alexandra Hospital, Woolloongabba, Australia
| | - Paul Leo
- University of Queensland Diamantina Institute, Translational Research Institute, Princess Alexandra Hospital, Woolloongabba, Australia
| | - Matthew A. Brown
- University of Queensland Diamantina Institute, Translational Research Institute, Princess Alexandra Hospital, Woolloongabba, Australia
| | - Stuart MacGregor
- Statistical Genetics, QIMR Berghofer Medical Research Institute, Royal Brisbane Hospital, Brisbane, Australia
| | - David J. Lynn
- EMBL Australia Group, Infection & Immunity Theme, South Australian Medical and Health Research Institute, Adelaide, Australia
- Flinders University, School of Medicine, Adelaide, Australia
| | - Kathryn P. Burdon
- Flinders University, Department of Ophthalmology, Bedford Park, South Australia, Australia
- University of Tasmania Menzies Institute for Medical Research, Hobart, Australia
| | - Jamie E. Craig
- Flinders University, Department of Ophthalmology, Bedford Park, South Australia, Australia
| |
Collapse
|
83
|
Petersen BS, Fredrich B, Hoeppner MP, Ellinghaus D, Franke A. Opportunities and challenges of whole-genome and -exome sequencing. BMC Genet 2017; 18:14. [PMID: 28193154 PMCID: PMC5307692 DOI: 10.1186/s12863-017-0479-5] [Citation(s) in RCA: 127] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Accepted: 01/26/2017] [Indexed: 01/08/2023] Open
Abstract
Recent advances in the development of sequencing technologies provide researchers with unprecedented possibilities for genetic analyses. In this review, we will discuss the history of genetic studies and the progress driven by next-generation sequencing (NGS), using complex inflammatory bowel diseases as an example. We focus on the opportunities, but also challenges that researchers are facing when working with NGS data to unravel the genetic causes underlying diseases.
Collapse
Affiliation(s)
| | - Broder Fredrich
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | - Marc P Hoeppner
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | - David Ellinghaus
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | - Andre Franke
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany.
| |
Collapse
|
84
|
Detecting disease association with rare variants in case-parents studies. J Hum Genet 2017; 62:549-552. [DOI: 10.1038/jhg.2017.1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2016] [Revised: 12/27/2016] [Accepted: 12/27/2016] [Indexed: 11/09/2022]
|
85
|
Lu ZH, Khondker Z, Ibrahim JG, Wang Y, Zhu H. Bayesian longitudinal low-rank regression models for imaging genetic data from longitudinal studies. Neuroimage 2017; 149:305-322. [PMID: 28143775 DOI: 10.1016/j.neuroimage.2017.01.052] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Revised: 12/27/2016] [Accepted: 01/22/2017] [Indexed: 12/29/2022] Open
Abstract
To perform a joint analysis of multivariate neuroimaging phenotypes and candidate genetic markers obtained from longitudinal studies, we develop a Bayesian longitudinal low-rank regression (L2R2) model. The L2R2 model integrates three key methodologies: a low-rank matrix for approximating the high-dimensional regression coefficient matrices corresponding to the genetic main effects and their interactions with time, penalized splines for characterizing the overall time effect, and a sparse factor analysis model coupled with random effects for capturing within-subject spatio-temporal correlations of longitudinal phenotypes. Posterior computation proceeds via an efficient Markov chain Monte Carlo algorithm. Simulations show that the L2R2 model outperforms several other competing methods. We apply the L2R2 model to investigate the effect of single nucleotide polymorphisms (SNPs) on the top 10 and top 40 previously reported Alzheimer disease-associated genes. We also identify associations between the interactions of these SNPs with patient age and the tissue volumes of 93 regions of interest from patients' brain images obtained from the Alzheimer's Disease Neuroimaging Initiative.
Collapse
Affiliation(s)
- Zhao-Hua Lu
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Zakaria Khondker
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Joseph G Ibrahim
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Yue Wang
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Hongtu Zhu
- Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, TX, USA.
| | | |
Collapse
|
86
|
Meta-analysis of quantitative pleiotropic traits for next-generation sequencing with multivariate functional linear models. Eur J Hum Genet 2016; 25:350-359. [PMID: 28000696 DOI: 10.1038/ejhg.2016.170] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Revised: 07/26/2016] [Accepted: 09/27/2016] [Indexed: 11/09/2022] Open
Abstract
To analyze next-generation sequencing data, multivariate functional linear models are developed for a meta-analysis of multiple studies to connect genetic variant data to multiple quantitative traits adjusting for covariates. The goal is to take the advantage of both meta-analysis and pleiotropic analysis in order to improve power and to carry out a unified association analysis of multiple studies and multiple traits of complex disorders. Three types of approximate F -distributions based on Pillai-Bartlett trace, Hotelling-Lawley trace, and Wilks's Lambda are introduced to test for association between multiple quantitative traits and multiple genetic variants. Simulation analysis is performed to evaluate false-positive rates and power of the proposed tests. The proposed methods are applied to analyze lipid traits in eight European cohorts. It is shown that it is more advantageous to perform multivariate analysis than univariate analysis in general, and it is more advantageous to perform meta-analysis of multiple studies instead of analyzing the individual studies separately. The proposed models require individual observations. The value of the current paper can be seen at least for two reasons: (a) the proposed methods can be applied to studies that have individual genotype data; (b) the proposed methods can be used as a criterion for future work that uses summary statistics to build test statistics to meta-analyze the data.
Collapse
|
87
|
Lee JY, Chen CY, Cheng KF. A model-free test for detecting disease association signals with multiple genetic variants and covariates. Stat Methods Med Res 2016; 27:2596-2609. [PMID: 30103661 DOI: 10.1177/0962280216683224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Discoveries and analyses of genetic variants at a gene or exome based on high-throughput sequencing technology are increasingly feasible. Although many well-known association tests have already been proposed in literature for testing whether a group of variants in a target region is associated with a disease of interest, however, the analytic challenges still remain profound. The power performance of these tests generally depends on the sample size, numbers of causal and neutral variants, variant frequency, effect size, and direction. Some of these factors are not easily controllable in practical applications. Further complications arise from missing genotype, population stratification or misspecification of the working model. Previous studies showed that many model-based tests might create false positive results or decrease power when there was population stratification effect or missing genotype and simple imputation was used. Here, we demonstrate by simulations that type I errors of the well-known model-based tests are often inflated as well, even the working model deviates slightly from the true model. We propose a model-free test and show this test to be almost uniformly most powerful among the competing tests under very general simulation conditions with covariates. This test does not require genotype data to be complete and hence difficult imputation can be avoided. We also discuss how to adjust for the effect of population stratification based on principal components, and use a Shanghai Breast Cancer Study to demonstrate application of the new test.
Collapse
Affiliation(s)
- J Y Lee
- 1 Statistics Department, Feng Chia University, Taiwan
| | - Chiu-Ying Chen
- 2 Department of Public Health, China Medical University, Taiwan
| | - K F Cheng
- 3 College of Management and Big Data Center, Asia University, Taichung, Taiwan.,4 Biostatistics Center, Taipei Medical University, Taipei, Taiwan
| |
Collapse
|
88
|
Abstract
With the advance of sequencing technologies, it has become a routine practice to test for association between a quantitative trait and a set of rare variants (RVs). While a number of RV association tests have been proposed, there is a dearth of studies on the robustness of RV association testing for nonnormal distributed traits, e.g., due to skewness, which is ubiquitous in cohort studies. By extensive simulations, we demonstrate that commonly used RV tests, including sequence kernel association test (SKAT) and optimal unified SKAT (SKAT-O), are not robust to heavy-tailed or right-skewed trait distributions with inflated type I error rates; in contrast, the adaptive sum of powered score (aSPU) test is much more robust. Here we further propose a robust version of the aSPU test, called aSPUr. We conduct extensive simulations to evaluate the power of the tests, finding that for a larger number of RVs, aSPU is often more powerful than SKAT and SKAT-O, owing to its high data-adaptivity. We also compare different tests by conducting association analysis of triglyceride levels using the NHLBI ESP whole-exome sequencing data. The QQ plots for SKAT and SKAT-O were severely inflated (λ = 1.89 and 1.78, respectively), while those for aSPU and aSPUr behaved normally. Due to its relatively high robustness to outliers and high power of the aSPU test, we recommend its use complementary to SKAT and SKAT-O. If there is evidence of inflated type I error rate from the aSPU test, we would recommend the use of the more robust, but less powerful, aSPUr test.
Collapse
|
89
|
Sirovich L. A new structural approach to genomic discovery of disease: example of adult-onset diabetes. BIOLOGICAL CYBERNETICS 2016; 110:383-391. [PMID: 27443641 DOI: 10.1007/s00422-016-0692-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 07/12/2016] [Indexed: 06/06/2023]
Abstract
This paper reports on an investigation of disease discovery from genomic data, by methods which depart substantially from customary practices found in the investigation of genome-wide association studies. Such data in general are composed of the genomic content from two contrasting phenotypes, e.g., disease versus control populations, and the analysis proceeds under the hypothesis that populational dissimilarities might reveal disease risk alleles. The proposed suite of new methods is in part based on information theory (Shannon in Bell Syst Tech J 27:379-423, 1948a; Bell Syst Tech J 27:623-656, 1948b; Jaynes in Phys Rev 106:620-630, 1957), and strong evidence will be given of the effectiveness of this new approach. The methodology extends naturally and successfully to predicting genomic disposition to disease arising from large collections of weakly contributing genomic loci. Evidence will be advanced that the example of adult-onset diabetes ("type 2 diabetes") is such a candidate disease, and in this case, probably for the first time, it can be demonstrated that disease prediction is possible. Another novel element of this study is the search and identification of potential beneficial genomic loci that may counter a disease. The generality of the methodology suggests that it might extend to other diseases.
Collapse
Affiliation(s)
- Lawrence Sirovich
- Center for Studies in Physics and Biology, Rockefeller University, New York, NY, 10065, USA.
| |
Collapse
|
90
|
Tobias JH, Gregson CL. Genetic Studies of Endophenotypes From Spine CT Scans Provide Novel Insights Into the Contribution of Mechanosensory Pathways to Vertebral Fractures and Spinal Curvature. J Bone Miner Res 2016; 31:2073-2076. [PMID: 27859714 DOI: 10.1002/jbmr.3032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/03/2016] [Revised: 10/24/2016] [Accepted: 11/06/2016] [Indexed: 11/06/2022]
Affiliation(s)
- Jon H Tobias
- Musculoskeletal Research Unit, School of Clinical Sciences, University of Bristol, Bristol, UK
| | - Celia L Gregson
- Musculoskeletal Research Unit, School of Clinical Sciences, University of Bristol, Bristol, UK
| |
Collapse
|
91
|
|
92
|
Kim YJ, Lee J, Kim BJ, Park T. PreCimp: Pre-collapsing imputation approach increases imputation accuracy of rare variants in terms of collapsed variables. Genet Epidemiol 2016; 41:41-50. [PMID: 27859580 DOI: 10.1002/gepi.22020] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Revised: 08/17/2016] [Accepted: 09/21/2016] [Indexed: 12/22/2022]
Abstract
Imputation is widely used for obtaining information about rare variants. However, one issue concerning imputation is the low accuracy of imputed rare variants as the inaccurate imputed rare variants may distort the results of region-based association tests. Therefore, we developed a pre-collapsing imputation method (PreCimp) to improve the accuracy of imputation by using collapsed variables. Briefly, collapsed variables are generated using rare variants in the reference panel, and a new reference panel is constructed by inserting pre-collapsed variables into the original reference panel. Following imputation analysis provides the imputed genotypes of the collapsed variables. We demonstrated the performance of PreCimp on 5,349 genotyped samples using a Korean population specific reference panel including 848 samples of exome sequencing, Affymetrix 5.0, and exome chip. PreCimp outperformed a traditional post-collapsing method that collapses imputed variants after single rare variant imputation analysis. Compared with the results of post-collapsing method, PreCimp approach was shown to relatively increase imputation accuracy about 3.4-6.3% when dosage r2 is between 0.6 and 0.8, 10.9-16.1% when dosage r2 is between 0.4 and 0.6, and 21.4 ∼ 129.4% when dosage r2 is below 0.4.
Collapse
Affiliation(s)
- Young Jin Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Korea.,Division of Structural and Functional Genomics, Center for Genome Science, Korean National Institute of Health, Osong, Chungchungbuk-do, Korea
| | - Juyoung Lee
- Division of Structural and Functional Genomics, Center for Genome Science, Korean National Institute of Health, Osong, Chungchungbuk-do, Korea
| | - Bong-Jo Kim
- Division of Structural and Functional Genomics, Center for Genome Science, Korean National Institute of Health, Osong, Chungchungbuk-do, Korea
| | | | - Taesung Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Korea.,Department of Statistics, Seoul National University, Seoul, Korea
| |
Collapse
|
93
|
Svishcheva GR, Belonogova NM, Axenovich TI. Functional linear models for region-based association analysis. RUSS J GENET+ 2016. [DOI: 10.1134/s1022795416100124] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
94
|
Darst BF, Engelman CD. Transmission and decorrelation methods for detecting rare variants using sequencing data from related individuals. BMC Proc 2016; 10:203-207. [PMID: 27980637 PMCID: PMC5133523 DOI: 10.1186/s12919-016-0031-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND Advances in whole genome sequencing have enabled the investigation of rare variants, which could explain some of the missing heritability that genome-wide association studies are unable to detect. Most methods to detect associations with rare variants are developed for unrelated individuals; however, several methods exist that utilize family studies and could have better power to detect such associations. METHODS Using whole genome sequencing data and simulated phenotypes provided by the organizers of the Genetic Analysis Workshop 19 (GAW19), we compared family-based methods that test for associations between rare and common variants with a quantitative trait. This was done using 2 fairly novel methods: family-based association test for rare variants (FBAT-RV), which is a transmission-based method that utilizes the transmission of genetic information from parent to offspring; and Minimum p value Optimized Nuisance parameter Score Test Extended to Relatives (MONSTER), which is a decorrelation method that instead attempts to adjust for relatedness using a regression-based method. We also considered family-based association test linear combination (FBAT-LC) and FBAT-Min P, which are slightly older methods that do not allow for the weighting of rare or common variants, but contrast some of the limitations of FBAT-RV. RESULTS MONSTER had much higher overall power than FBAT-RV and FBAT-Min P. Interestingly, FBAT-LC had similar overall power as MONSTER. MONSTER had the highest power for a gene accounting for a larger percent of the phenotypic variance, whereas MONSTER and FBAT-LC both had the highest power for a gene accounting for moderate variance. FBAT-LC had the highest power for a gene accounting for the least variance. CONCLUSIONS Based on the simulated data from GAW19, MONSTER and FBAT-LC were the most powerful of the methods assessed. However, there are limitations to each of these methods that should be carefully considered when conducting an analysis of rare variants in related individuals. This emphasizes the need for methods that can incorporate the advantages of each of these methods into 1 family-based association test for rare variants.
Collapse
Affiliation(s)
- Burcu F. Darst
- University of Wisconsin, Madison, WI USA
- Department of Population Health Sciences, University of Wisconsin School of Medicine and Public Health, Madison, WI USA
| | - Corinne D. Engelman
- University of Wisconsin, Madison, WI USA
- Department of Population Health Sciences, University of Wisconsin School of Medicine and Public Health, Madison, WI USA
| |
Collapse
|
95
|
Cogni R, Cao C, Day JP, Bridson C, Jiggins FM. The genetic architecture of resistance to virus infection in Drosophila. Mol Ecol 2016; 25:5228-5241. [PMID: 27460507 PMCID: PMC5082504 DOI: 10.1111/mec.13769] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Revised: 07/03/2016] [Accepted: 07/05/2016] [Indexed: 12/18/2022]
Abstract
Variation in susceptibility to infection has a substantial genetic component in natural populations, and it has been argued that selection by pathogens may result in it having a simpler genetic architecture than many other quantitative traits. This is important as models of host-pathogen co-evolution typically assume resistance is controlled by a small number of genes. Using the Drosophila melanogaster multiparent advanced intercross, we investigated the genetic architecture of resistance to two naturally occurring viruses, the sigma virus and DCV (Drosophila C virus). We found extensive genetic variation in resistance to both viruses. For DCV resistance, this variation is largely caused by two major-effect loci. Sigma virus resistance involves more genes - we mapped five loci, and together these explained less than half the genetic variance. Nonetheless, several of these had a large effect on resistance. Models of co-evolution typically assume strong epistatic interactions between polymorphisms controlling resistance, but we were only able to detect one locus that altered the effect of the main effect loci we had mapped. Most of the loci we mapped were probably at an intermediate frequency in natural populations. Overall, our results are consistent with major-effect genes commonly affecting susceptibility to infectious diseases, with DCV resistance being a near-Mendelian trait.
Collapse
Affiliation(s)
- Rodrigo Cogni
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK.
- Department of Ecology, University of São Paulo, São Paulo, 05508-900, Brazil.
| | - Chuan Cao
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Jonathan P Day
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Calum Bridson
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Francis M Jiggins
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| |
Collapse
|
96
|
Jiang W, Xue JH, Yu W. What is the probability of replicating a statistically significant association in genome-wide association studies? Brief Bioinform 2016; 18:928-939. [DOI: 10.1093/bib/bbw091] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Indexed: 11/14/2022] Open
|
97
|
Johnston I, Hancock T, Mamitsuka H, Carvalho L. Gene-proximity models for genome-wide association studies. Ann Appl Stat 2016. [DOI: 10.1214/16-aoas907] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
98
|
Lee J, Kim YJ, Lee J, Kim BJ, Lee S, Park T. Gene-set association tests for next-generation sequencing data. Bioinformatics 2016; 32:i611-i619. [PMID: 27587681 PMCID: PMC5013913 DOI: 10.1093/bioinformatics/btw429] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
MOTIVATION Recently, many methods have been developed for conducting rare-variant association studies for sequencing data. These methods have primarily been based on gene-level associations but have not been proven to be as effective as expected. Gene-set-level tests have shown great advantages over gene-level tests in terms of power and robustness, because complex diseases are often caused by multiple genes that comprise of biological gene sets. RESULTS Here, we propose several novel gene-set tests that employ rapid and efficient dimensionality reduction. The performance of these tests was investigated using extensive simulations and application to 1058 whole-exome sequences from a Korean population. We identified some known pathways and novel pathways whose rare or common variants are associated with elevated liver enzymes and replicated the results in an independent cohort. AVAILABILITY AND IMPLEMENTATION Source R code for our algorithm is freely available at http://statgen.snu.ac.kr/software/QTest CONTACT tspark@stats.snu.ac.kr SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jaehoon Lee
- Department of Statistics, Seoul National University, Seoul 151-742, Korea
| | - Young Jin Kim
- Division of Structural and Functional Genomics, Korean National Institute of Health, Osong, Chungchungbuk-Do 363-951, Korea
| | - Juyoung Lee
- Division of Structural and Functional Genomics, Korean National Institute of Health, Osong, Chungchungbuk-Do 363-951, Korea
| | - Bong-Jo Kim
- Division of Structural and Functional Genomics, Korean National Institute of Health, Osong, Chungchungbuk-Do 363-951, Korea
| | - Seungyeoun Lee
- Department of Mathematics and Statistics, Sejong University, Seoul 143-747, Korea
| | - Taesung Park
- Department of Statistics, Seoul National University, Seoul 151-742, Korea
| |
Collapse
|
99
|
Larson NB, McDonnell S, Albright LC, Teerlink C, Stanford J, Ostrander EA, Isaacs WB, Xu J, Cooney KA, Lange E, Schleutker J, Carpten JD, Powell I, Bailey-Wilson J, Cussenot O, Cancel-Tassin G, Giles G, MacInnis R, Maier C, Whittemore AS, Hsieh CL, Wiklund F, Catolona WJ, Foulkes W, Mandal D, Eeles R, Kote-Jarai Z, Ackerman MJ, Olson TM, Klein CJ, Thibodeau SN, Schaid DJ. Post hoc Analysis for Detecting Individual Rare Variant Risk Associations Using Probit Regression Bayesian Variable Selection Methods in Case-Control Sequencing Studies. Genet Epidemiol 2016; 40:461-9. [PMID: 27312771 PMCID: PMC5063501 DOI: 10.1002/gepi.21983] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Revised: 04/22/2016] [Accepted: 04/27/2016] [Indexed: 12/27/2022]
Abstract
Rare variants (RVs) have been shown to be significant contributors to complex disease risk. By definition, these variants have very low minor allele frequencies and traditional single-marker methods for statistical analysis are underpowered for typical sequencing study sample sizes. Multimarker burden-type approaches attempt to identify aggregation of RVs across case-control status by analyzing relatively small partitions of the genome, such as genes. However, it is generally the case that the aggregative measure would be a mixture of causal and neutral variants, and these omnibus tests do not directly provide any indication of which RVs may be driving a given association. Recently, Bayesian variable selection approaches have been proposed to identify RV associations from a large set of RVs under consideration. Although these approaches have been shown to be powerful at detecting associations at the RV level, there are often computational limitations on the total quantity of RVs under consideration and compromises are necessary for large-scale application. Here, we propose a computationally efficient alternative formulation of this method using a probit regression approach specifically capable of simultaneously analyzing hundreds to thousands of RVs. We evaluate our approach to detect causal variation on simulated data and examine sensitivity and specificity in instances of high RV dimensionality as well as apply it to pathway-level RV analysis results from a prostate cancer (PC) risk case-control sequencing study. Finally, we discuss potential extensions and future directions of this work.
Collapse
Affiliation(s)
- Nicholas B. Larson
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, MN
| | - Shannon McDonnell
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, MN
| | - Lisa Cannon Albright
- Dept. Internal Medicine, University of Utah School of Medicine, Salt Lake City, UT
| | - Craig Teerlink
- Dept. Internal Medicine, University of Utah School of Medicine, Salt Lake City, UT
| | | | | | | | - Jianfeng Xu
- NorthShore University Health System Research Institute, Chicago, IL
| | - Kathleen A. Cooney
- Depts. of Internal Medicine and Urology, University of Michigan Medical School, Ann Arbor, MI
| | - Ethan Lange
- Dept. of Genetics, University of North Carolina, Chapel Hill, NC
| | - Johanna Schleutker
- Dept. of Medical Biochemistry and Genetics, Institute of Biomedicine, University of Turku, Finland
| | - John D. Carpten
- Integrated Cancer Genomics Division, The Translational Genomics Research Institute, Phoenix, AZ
| | | | - Joan Bailey-Wilson
- Statistical Genetics Section, National Human Genome Research Institute, Bethesda, MD
| | | | | | - Graham Giles
- Cancer Epidemiology Centre, Cancer Council Victoria, and Centre for Epidemiology and Biostatistics, School of Population and Global Health, University of Melbourne, Melbourne, Australia
| | - Robert MacInnis
- Cancer Epidemiology Centre, Cancer Council Victoria, and Centre for Epidemiology and Biostatistics, School of Population and Global Health, University of Melbourne, Melbourne, Australia
| | | | | | - Chih-Lin Hsieh
- Dept. of Urology, University of Southern California, Los Angeles, CA
| | - Fredrik Wiklund
- Dept. of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | | | - William Foulkes
- Depts. Of Oncology and Human Genetics, Montreal General Hospital, Montreal QC, Canada
| | - Diptasri Mandal
- Dept. of Genetics, LSU Health Sciences Center, New Orleans, LA
| | - Rosalind Eeles
- Genetics and Epidemiology, Institute of Cancer Research, Sutton Surrey, UK
| | - Zsofia Kote-Jarai
- Genetics and Epidemiology, Institute of Cancer Research, Sutton Surrey, UK
| | | | - Timothy M. Olson
- Dept. of Pediatric and Adolescent Medicine, Mayo Clinic, Rochester, MN
| | | | | | - Daniel J. Schaid
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, MN
| |
Collapse
|
100
|
POSTULA MAREK, JANICKI PIOTRKAZIMIERZ, ROSIAK MAREK, EYILETEN CEREN, ZAREMBA MAŁGORZATA, KAPLON-CIESLICKA AGNIESZKA, SUGINO SHIGEKAZU, KOSIOR DARIUSZARTUR, OPOLSKI GRZEGORZ, FILIPIAK KRZYSZTOFJERZY, MIROWSKA-GUZEL DAGMARA. Targeted deep resequencing of ALOX5 and ALOX5AP in patients with diabetes and association of rare variants with leukotriene pathways. Exp Ther Med 2016; 12:415-421. [PMID: 27347071 PMCID: PMC4906979 DOI: 10.3892/etm.2016.3334] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2015] [Accepted: 02/11/2016] [Indexed: 02/07/2023] Open
Abstract
The aim of the present study was to investigate a possible association between the accumulation of rare coding variants in the genes for arachidonate 5-lipoxygenase (ALOX5) and ALOX5-activating protein (ALOX5AP), and corresponding production of leukotrienes (LTs) in patients with type 2 diabetes mellitus (T2DM) receiving acetylsalicylic therapy. Twenty exons and corresponding introns of the selected genes were resequenced in 303 DNA samples from patients with T2DM using pooled polymerase chain reaction amplification and next-generation sequencing, using an Illumina HiSeq 2000 sequencing system. The observed non-synonymous variants were further confirmed by individual genotyping of DNA samples comprising of all individuals from the original discovery pools. The association between the investigated phenotypes was based on LTB4 and LTE4 concentrations, and the accumulation of rare missense variants (genetic burden) in investigated genes was evaluated using statistical collapsing tests. A total of 10 exonic variants were identified for each resequenced gene, including 5 missense and 5 synonymous variants. The rare missense variants did not exhibit statistically significant differences in the accumulation pattern between the patients with low and high LTs concentrations. As the present study only included patients with T2DM, it is unclear whether the absence of observed association between the accumulation of rare missense variants in investigated genes and LT production is associated with diabetic populations only or may also be applied to other populations.
Collapse
Affiliation(s)
- MAREK POSTULA
- Department of Experimental and Clinical Pharmacology, Medical University of Warsaw, Center for Preclinical Research and Technology CEPT, Warsaw 02-097, Poland
- Perioperative Genomics Laboratory, Penn State University, College of Medicine, Hershey, PA 17033, USA
| | - PIOTR KAZIMIERZ JANICKI
- Perioperative Genomics Laboratory, Penn State University, College of Medicine, Hershey, PA 17033, USA
| | - MAREK ROSIAK
- Department of Experimental and Clinical Pharmacology, Medical University of Warsaw, Center for Preclinical Research and Technology CEPT, Warsaw 02-097, Poland
- Department of Cardiology and Hypertension, Central Clinical Hospital, The Ministry of the Interior, Warsaw 02-507, Poland
| | - CEREN EYILETEN
- Department of Experimental and Clinical Pharmacology, Medical University of Warsaw, Center for Preclinical Research and Technology CEPT, Warsaw 02-097, Poland
| | - MAŁGORZATA ZAREMBA
- Department of Experimental and Clinical Pharmacology, Medical University of Warsaw, Center for Preclinical Research and Technology CEPT, Warsaw 02-097, Poland
| | | | - SHIGEKAZU SUGINO
- Perioperative Genomics Laboratory, Penn State University, College of Medicine, Hershey, PA 17033, USA
| | - DARIUSZ ARTUR KOSIOR
- Department of Cardiology and Hypertension, Central Clinical Hospital, The Ministry of the Interior, Warsaw 02-507, Poland
- Department of Applied Physiology, Mossakowski Medical Research Centre, Polish Academy of Sciences, Warsaw 02-106, Poland
| | - GRZEGORZ OPOLSKI
- Department of Cardiology, Medical University of Warsaw, Warsaw 02-091, Poland
| | | | - DAGMARA MIROWSKA-GUZEL
- Department of Experimental and Clinical Pharmacology, Medical University of Warsaw, Center for Preclinical Research and Technology CEPT, Warsaw 02-097, Poland
| |
Collapse
|