1
|
Kim Y, Chi YY, Zou F. An efficient integrative resampling method for gene-trait association analysis. Genet Epidemiol 2019; 44:197-207. [PMID: 31820489 DOI: 10.1002/gepi.22271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 10/27/2019] [Accepted: 11/25/2019] [Indexed: 11/07/2022]
Abstract
Genetic association studies are popular for identifying genetic variants, such as single nucleotide polymorphisms (SNPs), that are associated with complex traits. Statistical tests are commonly performed one SNP at a time with an assumed mode of inheritance such as recessive, additive, or dominant genetic model. Such analysis can result in inadequate power when the employed model deviates from the underlying true genetic model. We propose an integrative association test procedure under a generalized linear model framework to flexibly model the data from the above three common genetic models and beyond. A computationally efficient resampling procedure is adopted to estimate the null distribution of the proposed test statistic. Simulation results show that our methods maintain the Type I error rate irrespective of the existence of confounding covariates and achieve adequate power compared to the methods with the true genetic model. The new methods are applied to two genetic studies on the resistance of severe malaria and sarcoidosis.
Collapse
Affiliation(s)
- Yeonil Kim
- Early Development Statistics, Merck & Co., Inc., Rahway, New Jersey
| | - Yueh-Yun Chi
- Department of Biostatistics, University of Florida, Gainesville, Florida
| | - Fei Zou
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| |
Collapse
|
2
|
Lai YC, Kao CF, Lu ML, Chen HC, Chen PY, Chen CH, Shen WW, Wu JY, Lu RB, Kuo PH. Investigation of associations between NR1D1, RORA and RORB genes and bipolar disorder. PLoS One 2015; 10:e0121245. [PMID: 25789810 PMCID: PMC4366256 DOI: 10.1371/journal.pone.0121245] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2014] [Accepted: 02/12/2015] [Indexed: 11/18/2022] Open
Abstract
Several genes that are involved in the regulation of circadian rhythms are implicated in the susceptibility to bipolar disorder (BD). The current study aimed to investigate the relationships between genetic variants in NR1D1 RORA, and RORB genes and BD in the Han Chinese population. We conducted a case-control genetic association study with two samples of BD patients and healthy controls. Sample I consisted of 280 BD patients and 200 controls. Sample II consisted of 448 BD patients and 1770 healthy controls. 27 single nucleotide polymorphisms in the NR1D1, RORA, and RORB genes were genotyped using GoldenGate VeraCode assays in sample I, and 492 markers in the three genes were genotyped using Affymetrix Genome-Wide CHB Array in sample II. Single marker and gene-based association analyses were performed using PLINK. A combined p-value for the joining effects of all markers within a gene was calculated using the rank truncated product method. Multifactor dimensionality reduction (MDR) method was also applied to test gene-gene interactions in sample I. All markers were in Hardy-Weinberg equilibrium (P>0.001). In sample I, the associations with BD were observed for rs4774388 in RORA (OR = 1.53, empirical p-value, P = 0.024), and rs1327836 in RORB (OR = 1.75, P = 0.003). In Sample II, there were 45 SNPs showed associations with BD, and the most significant marker in RORA was rs11639084 (OR = 0.69, P = 0.002), and in RORB was rs17611535 (OR = 3.15, P = 0.027). A combined p-value of 1.6×10−6, 0.7, and 1.0 was obtained for RORA, RORB and NR1D1, respectively, indicting a strong association for RORA with the risk of developing BD. A four way interaction was found among markers in NR1D1, RORA, and RORB with the testing accuracy 53.25% and a cross-validation consistency of 8 out of 10. In sample II, 45 markers had empirical p-values less than 0.05. The most significant markers in RORA and RORB genes were rs11639084 (OR = 0.69, P = 0.002), and rs17611535 (OR = 3.15, P = 0.027), respectively. Gene-based association was significant for RORA gene (P = 0.0007). Our results support for the involvement of RORs genes in the risk of developing BD. Investigation of the functional properties of genes in the circadian pathway may further enhance our understanding about the pathogenesis of bipolar illness.
Collapse
Affiliation(s)
- Yin-Chieh Lai
- Department of Public Health & Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Chung-Feng Kao
- Department of Public Health & Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Mong-Liang Lu
- Department of Psychiatry, Taipei Medical University-Wan Fang Medical Center, Taipei, Taiwan
| | - Hsi-Chung Chen
- Department of Psychiatry & Center of Sleep Disorders, National Taiwan University Hospital, Taipei, Taiwan
| | - Po-Yu Chen
- Department of Psychiatry, Taipei City Psychiatric Center, Taipei City Hospital, Taipei, Taiwan
| | - Chien-Hsiun Chen
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
| | - Winston W Shen
- Department of Psychiatry, Taipei Medical University-Wan Fang Medical Center, Taipei, Taiwan
| | - Jer-Yuarn Wu
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
| | - Ru-Band Lu
- Department of Psychiatry, National Cheng Kung University Hospital, Tainan, Taiwan
| | - Po-Hsiu Kuo
- Department of Public Health & Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan; Research Center for Genes, Environment and Human Health, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
3
|
Loley C, König IR, Hothorn L, Ziegler A. A unifying framework for robust association testing, estimation, and genetic model selection using the generalized linear model. Eur J Hum Genet 2013; 21:1442-8. [PMID: 23572026 DOI: 10.1038/ejhg.2013.62] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2012] [Revised: 02/08/2013] [Accepted: 03/07/2013] [Indexed: 11/09/2022] Open
Abstract
The analysis of genome-wide genetic association studies generally starts with univariate statistical tests of each single-nucleotide polymorphism. The standard approach is the Cochran-Armitage trend test or its logistic regression equivalent although this approach can lose considerable power if the underlying genetic model is not additive. An alternative is the MAX test, which is robust against the three basic modes of inheritance. Here, the asymptotic distribution of the MAX test is derived using the generalized linear model together with the Delta method and multiple contrasts. The approach is applicable to binary, quantitative, and survival traits. It may be used for unrelated individuals, family-based studies, and matched pairs. The approach provides point and interval effect estimates and allows selecting the most plausible genetic model using the minimum P-value. R code is provided. A Monte-Carlo simulation study shows that the asymptotic MAX test framework meets type I error levels well, has good power, and good model selection properties for minor allele frequencies ≥0.3. Pearson's χ(2)-test is superior for lower minor allele frequencies with low frequencies for the rare homozygous genotype. In these cases, the model selection procedure should be used with caution. The use of the MAX test is illustrated by reanalyzing findings from seven genome-wide association studies including case-control, matched pairs, and quantitative trait data.
Collapse
Affiliation(s)
- Christina Loley
- 1] Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany [2] Medizinische Klinik II, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany
| | | | | | | |
Collapse
|
4
|
Joo J, Kwak M, Chen Z, Zheng G. Efficiency robust statistics for genetic linkage and association studies under genetic model uncertainty. Stat Med 2010; 29:158-80. [PMID: 19918942 DOI: 10.1002/sim.3759] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
When testing genetic linkage and association, test statistics that follow a normal or Chi-square distributions are often used. These statistics are usually derived under a specific mode of inheritance (genetic model). Common genetic models include, but not limited to, the recessive, additive, multiplicative, and dominant models. For many diseases, their underlying genetic models are often unknown. Instead, a family of scientifically plausible genetic models may be available, which includes the four commonly used models. Hence, the optimal test is not available. Employing a single test statistic which is optimal for one model may suffer from substantial loss of power when the model is misspecified. In this situation efficient robust tests are useful. In this tutorial, we first review several commonly used robust statistics, including maximum efficiency robust tests, maximal tests, and constrained likelihood ratio tests for three common designs in genetic studies: (i) linkage analysis using affected sib-pairs, (ii) association studies using parents-offspring trios, and (iii) case-control association studies (unmatched and matched). Codes in the R statistical language for applying these robust statistics to test for linkage and association are presented with examples. We also provide some comparisons of the performance of the various robust tests via simulation studies. Guidelines for applications are also given for each study design. Finally, applications of robust tests to genome-wide association studies and meta-analysis are discussed.
Collapse
Affiliation(s)
- Jungnam Joo
- Office of Biostatistics Research, National Heart, Lung and Blood Institute, Bethesda, MD 20892, USA
| | | | | | | |
Collapse
|
6
|
Wang K, Sheffield VC. A constrained-likelihood approach to marker-trait association studies. Am J Hum Genet 2005; 77:768-80. [PMID: 16252237 PMCID: PMC1271386 DOI: 10.1086/497434] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2005] [Accepted: 08/24/2005] [Indexed: 11/03/2022] Open
Abstract
Marker-trait association analysis is an important statistical tool for detecting DNA variants responsible for genetic traits. In such analyses, an analysis model of the mean genetic effects of the genotypes is often specified. For instance, the effect of the disease allele on the trait is often specified to be dominant, recessive, additive, or multiplicative. Although this model-based approach is powerful when the analysis model is correctly specified, it has been found to have low power sometimes when the specified model is incorrect. We introduce an approach that does not require the specification of a particular genetic model. This approach is built upon a constrained maximum likelihood in which the mean genetic effect of the heterozygous genotype is required to not exceed those of the two homozygous genotypes. The asymptotic distribution of the likelihood-ratio statistic is derived for two special cases. A simulation study suggests that this new approach has power comparable to that of the model-based method when the analysis model is correctly specified. This approach uses one marker at a time (i.e., it is a single-marker analysis). However, given the latest findings that powerful inferential procedures for haplotype analyses can be constructed from single-marker analyses, we expect this approach to be useful for haplotype analyses.
Collapse
Affiliation(s)
- Kai Wang
- Program in Public Health Genetics, College of Public Health, University of Iowa, Iowa City, IA, 52242, USA.
| | | |
Collapse
|
7
|
Zheng G, Chen Z. Comparison of maximum statistics for hypothesis testing when a nuisance parameter is present only under the alternative. Biometrics 2005; 61:254-8. [PMID: 15737101 DOI: 10.1111/j.0006-341x.2005.030531.x] [Citation(s) in RCA: 128] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
In many practical problems, a hypothesis testing involves a nuisance parameter which appears only under the alternative hypothesis. Davies (1977, Biometrika 64, 247-254) proposed the maximum of the score statistics over the whole range of the nuisance parameter as a test statistic for this type of hypothesis testing. Freidlin, Podgor, and Gastwirth (1999, Biometrics 55, 883-886) studied two other simpler maximum test statistics, the maximum of the score statistics at two extreme points of the nuisance parameter, and the maximum of the score statistics at three points of the nuisance parameter including the two extreme points. In this article, we compare the powers of these three maximum-type statistics in the context of three genetic problems.
Collapse
Affiliation(s)
- Gang Zheng
- Office of Biostatistics Research, DECA, National Heart, Lung and Blood Institute, 6701 Rockledge Drive, MSC 7938, Bethesda, Maryland 20892, USA.
| | | |
Collapse
|
8
|
Zheng G, Chen Z. Robust Tests for Candidate-Gene Association Using Case-Parents Trios with a Multi-Allelic Marker. Biom J 2004. [DOI: 10.1002/bimj.200410057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|