1
|
Lou XY, Hou TT, Liu SY, Xu HM, Lin F, Tang X, MacLeod SL, Cleves MA, Hobbs CA. Innovative approach to identify multigenomic and environmental interactions associated with birth defects in family-based hybrid designs. Genet Epidemiol 2021; 45:171-189. [PMID: 32996630 PMCID: PMC8495752 DOI: 10.1002/gepi.22363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 09/08/2020] [Accepted: 09/11/2020] [Indexed: 11/09/2022]
Abstract
Genes, including those with transgenerational effects, work in concert with behavioral, environmental, and social factors via complex biological networks to determine human health. Understanding complex relationships between causal factors underlying human health is an essential step towards deciphering biological mechanisms. We propose a new analytical framework to investigate the interactions between maternal and offspring genetic variants or their surrogate single nucleotide polymorphisms (SNPs) and environmental factors using family-based hybrid study design. The proposed approach can analyze diverse genetic and environmental factors and accommodate samples from a variety of family units, including case/control-parental triads, and case/control-parental dyads, while minimizing potential bias introduced by population admixture. Comprehensive simulations demonstrated that our innovative approach outperformed the log-linear approach, the best available method for case-control family data. The proposed approach had greater statistical power and was capable to unbiasedly estimate the maternal and child genetic effects and the effects of environmental factors, while controlling the Type I error rate against population stratification. Using our newly developed approach, we analyzed the associations between maternal and fetal SNPs and obstructive and conotruncal heart defects, with adjustment for demographic and lifestyle factors and dietary supplements. Fourteen and 11 fetal SNPs were associated with obstructive and conotruncal heart defects, respectively. Twenty-seven and 17 maternal SNPs were associated with obstructive and conotruncal heart defects, respectively. In addition, maternal body mass index was a significant risk factor for obstructive defects. The proposed approach is a powerful tool for interrogating the etiological mechanism underlying complex traits.
Collapse
Affiliation(s)
- Xiang-Yang Lou
- Department of Biostatistics, College of Public Health and Health Professions and College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Ting-Ting Hou
- Department of Biostatistics, College of Public Health and Health Professions and College of Medicine, University of Florida, Gainesville, Florida, USA
- Institute of Bioinformatics and Institute of Crop Science, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Shou-Ye Liu
- Institute of Bioinformatics and Institute of Crop Science, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Hai-Ming Xu
- Institute of Bioinformatics and Institute of Crop Science, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Feng Lin
- Institute of Bioinformatics and Institute of Crop Science, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Xinyu Tang
- The US Food and Drug Administration, Silver Spring, Maryland, USA
| | | | - Mario A. Cleves
- Department of Pediatrics, Morsani College of Medicine, Health Informatics Institute, University of South Florida, Tampa, Florida, USA
| | - Charlotte A. Hobbs
- Rady Children’s Institute for Genomic Medicine, San Diego, California, USA
| |
Collapse
|
2
|
Huang LO, Infante-Rivard C, Labbe A. Analysis of Case-Parent Trios Using a Loglinear Model with Adjustment for Transmission Ratio Distortion. Front Genet 2016; 7:155. [PMID: 27630667 PMCID: PMC5005337 DOI: 10.3389/fgene.2016.00155] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Accepted: 08/16/2016] [Indexed: 01/16/2023] Open
Abstract
Transmission of the two parental alleles to offspring deviating from the Mendelian ratio is termed Transmission Ratio Distortion (TRD), occurs throughout gametic and embryonic development. TRD has been well-studied in animals, but remains largely unknown in humans. The Transmission Disequilibrium Test (TDT) was first proposed to test for association and linkage in case-trios (affected offspring and parents); adjusting for TRD using control-trios was recommended. However, the TDT does not provide risk parameter estimates for different genetic models. A loglinear model was later proposed to provide child and maternal relative risk (RR) estimates of disease, assuming Mendelian transmission. Results from our simulation study showed that case-trios RR estimates using this model are biased in the presence of TRD; power and Type 1 error are compromised. We propose an extended loglinear model adjusting for TRD. Under this extended model, RR estimates, power and Type 1 error are correctly restored. We applied this model to an intrauterine growth restriction dataset, and showed consistent results with a previous approach that adjusted for TRD using control-trios. Our findings suggested the need to adjust for TRD in avoiding spurious results. Documenting TRD in the population is therefore essential for the correct interpretation of genetic association studies.
Collapse
Affiliation(s)
- Lam O. Huang
- Department of Epidemiology, Biostatistics and Occupational Health, McGill UniversityMontréal, QC, Canada
| | - Claire Infante-Rivard
- Department of Epidemiology, Biostatistics and Occupational Health, McGill UniversityMontréal, QC, Canada
| | - Aurélie Labbe
- Department of Epidemiology, Biostatistics and Occupational Health, McGill UniversityMontréal, QC, Canada
- Department of Psychiatry, McGill UniversityMontréal, QC, Canada
- Douglas Mental Health University InstituteMontréal, QC, Canada
| |
Collapse
|
3
|
Li S, Chen J, Guo J, Jing BY, Tsang SY, Xue H. Likelihood Ratio Test for Multi-Sample Mixture Model and Its Application to Genetic Imprinting. J Am Stat Assoc 2015. [DOI: 10.1080/01621459.2014.939272] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
4
|
Abstract
Genomic imprinting is a genetic phenomenon in which certain alleles are differentially expressed in a parent-of-origin-specific manner, and plays an important role in the study of complex traits. For a diallelic marker locus in human, the parentalasymmetry tests Q-PAT(c) with any constant c were developed to detect parent-of-origin effects for quantitative traits. However, these methods can only be applied to deal with nuclear families and thus are not suitable for extended pedigrees. In this study, by making no assumption about the distribution of the quantitative trait, we first propose the pedigree parentalasymmetry tests Q-PPAT(c) with any constant c for quantitative traits to test for parent-of-origin effects based on nuclear families with complete information from general pedigree data, in the presence of association between marker alleles under study and quantitative traits. When there are any genotypes missing in pedigrees, we utilize Monte Carlo (MC) sampling and estimation and develop the Q-MCPPAT(c) statistics to test for parent-of-origin effects. Various simulation studies are conducted to assess the performance of the proposed methods, for different sample sizes, genotype missing rates, degrees of imprinting effects and population models. Simulation results show that the proposed methods control the size well under the null hypothesis of no parent-of-origin effects and Q-PPAT(c) are robust to population stratification. In addition, the power comparison demonstrates that Q-PPAT(c) and Q-MCPPAT(c) for pedigree data are much more powerful than Q-PAT(c) only using two-generation nuclear families selected from extended pedigrees.
Collapse
|
5
|
A powerful association test for qualitative traits incorporating imprinting effects using general pedigree data. J Hum Genet 2014; 60:77-83. [PMID: 25518739 DOI: 10.1038/jhg.2014.109] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Revised: 11/19/2014] [Accepted: 11/24/2014] [Indexed: 11/08/2022]
Abstract
For qualitative traits and diallelic marker loci, the pedigree disequilibrium test (PDT) based on general pedigrees and its extension (Monte Carlo PDT (MCPDT)) for dealing with missing genotypes are simple and powerful tests for association. There is an increasing interest of incorporating imprinting into association analysis. However, PDT and MCPDT do not take account of the information on imprinting effects in the analysis, which may reduce their test powers when the effects are present. On the other hand, the transmission disequilibrium test with imprinting (TDTI*) combines imprinting into the mapping of association variants. However, TDTI* only accommodates two-generation nuclear families and thus is not suitable for extended pedigrees. In this article, we first extend PDT to incorporate imprinting and propose PDTI for complete pedigrees (no missing genotypes). To fully utilize pedigrees with missing genotypes, we further develop the Monte Carlo PDTI (MCPDTI) statistic based on Monte Carlo sampling and estimation. Both PDTI and MCPDTI are derived in a two-stage framework. Simulation study shows that PDTI and MCPDTI control the size well under the null hypothesis of no association and are more powerful than PDT and TDTI* (based on a sample of nuclear families randomly selecting from pedigrees) when imprinting effects exist.
Collapse
|
6
|
Powerful tests for association on quantitative trait loci incorporating imprinting effects. J Hum Genet 2013; 58:384-90. [PMID: 23552672 DOI: 10.1038/jhg.2013.22] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Genomic imprinting is an important epigenetic factor in complex traits study, and there has recently been considerable interest in association study for quantitative traits by incorporating imprinting. However, these methods need the assumptions of Hardy-Weinberg equilibrium or only use information from families with one child. In this paper, by taking imprinting into account and making no assumption about the distribution of the quantitative traits, we propose two novel classes of Q-C-TDTI(c) and Q-C-MAX(c) family-based association tests for quantitative traits. The tests flexibly accommodate family data with missing parental genotype and with multiple siblings. Q-C-TDTI(c) is derived from a two-stage analysis, where in the first stage Q-C-PAT(c) is applied to test for imprinting effects and in the second stage we select the most appropriate statistic among three transmission disequilibrium tests for association according to the finding from Q-C-PAT(c). Another proposed Q-C-MAX(c) approach takes the maximum of the three statistics. Compared with the existing alternative methods, the simulation results demonstrate that the two proposed tests are robust to population stratification and have better performance for testing association under various scenarios. Further, the powerful and versatile Q-C-TDTI(c) test is applied to analyze Framingham Heart Study data.
Collapse
|
7
|
Xia F, Zhou JY, Fung WK. A powerful approach for association analysis incorporating imprinting effects. ACTA ACUST UNITED AC 2011; 27:2571-7. [PMID: 21798962 DOI: 10.1093/bioinformatics/btr443] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION For a diallelic marker locus, the transmission disequilibrium test (TDT) is a simple and powerful design for genetic studies. The TDT was originally proposed for use in families with both parents available (complete nuclear families) and has further been extended to 1-TDT for use in families with only one of the parents available (incomplete nuclear families). Currently, the increasing interest of the influence of parental imprinting on heritability indicates the importance of incorporating imprinting effects into the mapping of association variants. RESULTS In this article, we extend the TDT-type statistics to incorporate imprinting effects and develop a series of new test statistics in a general two-stage framework for association studies. Our test statistics enjoy the nature of family-based designs that need no assumption of Hardy-Weinberg equilibrium. Also, the proposed methods accommodate complete and incomplete nuclear families with one or more affected children. In the simulation study, we verify the validity of the proposed test statistics under various scenarios, and compare the powers of the proposed statistics with some existing test statistics. It is shown that our methods greatly improve the power for detecting association in the presence of imprinting effects. We further demonstrate the advantage of our methods by the application of the proposed test statistics to a rheumatoid arthritis dataset. CONTACT wingfung@hku.hk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Fan Xia
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong
| | | | | |
Collapse
|
8
|
Hu YQ, Zhou JY. Inferring Haplotype/Disease Association by Joint Use of Case-Parents Trios and Case-Parent Pairs. Ann Hum Genet 2010; 74:263-74. [PMID: 20529016 DOI: 10.1111/j.1469-1809.2010.00563.x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Yue-Qing Hu
- Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai 200433, China.
| | | |
Collapse
|
9
|
Cui Y, Li G, Li S, Wu R. Designs for linkage analysis and association studies of complex diseases. Methods Mol Biol 2010; 620:219-242. [PMID: 20652506 DOI: 10.1007/978-1-60761-580-4_6] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Genetic linkage analysis has been a traditional means for identifying regions of the genome with large genetic effects that contribute to a disease. Following linkage analysis, association studies are widely pursued to fine-tune regions with significant linkage signals. For complex diseases which often involve function of multi-genetic variants each with small or moderate effect, linkage analysis has little power compared to association studies. In this chapter, we give a brief review of design issues related to linkage analysis and association studies with human genetic data. We introduce methods commonly used for linkage and association studies and compared the relative merits of the family-based and population-based association studies. Compared to candidate gene studies, a genomewide blind searching of disease variant is proving to be a more powerful approach. We briefly review the commonly used two-stage designs in genome-wide association studies. As more and more biological evidences indicate the role of genomic imprinting in disease, identifying imprinted genes becomes critically important. Design and analysis in genetic mapping imprinted genes are introduced in this chapter. Recent efforts in integrating gene expression analysis and genetic mapping, termed expression quantitative trait loci (eQTLs) mapping or genetical genomics analysis, offer new prospect in elucidating the genetic architecture of gene expression. Designs in genetical genomics analysis are also covered in this chapter.
Collapse
Affiliation(s)
- Yuehua Cui
- Department of Statistics and Probability, Michigan State University, East Lansing, MI, USA
| | | | | | | |
Collapse
|
10
|
Tiwari HK, Barnholtz-Sloan J, Wineinger N, Padilla MA, Vaughan LK, Allison DB. Review and evaluation of methods correcting for population stratification with a focus on underlying statistical principles. Hum Hered 2008; 66:67-86. [PMID: 18382087 PMCID: PMC2803696 DOI: 10.1159/000119107] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
When two or more populations have been separated by geographic or cultural boundaries for many generations, drift, spontaneous mutations, differential selection pressures and other factors may lead to allele frequency differences among populations. If these 'parental' populations subsequently come together and begin inter-mating, disequilibrium among linked markers may span a greater genetic distance than it typically does among populations under panmixia [see glossary]. This extended disequilibrium can make association studies highly effective and more economical than disequilibrium mapping in panmictic populations since less marker loci are needed to detect regions of the genome that harbor phenotype-influencing loci. However, under some circumstances, this process of intermating (as well as other processes) can produce disequilibrium between pairs of unlinked loci and thus create the possibility of confounding or spurious associations due to this population stratification. Accordingly, researchers are advised to employ valid statistical tests for linkage disequilibrium mapping allowing conduct of genetic association studies that control for such confounding. Many recent papers have addressed this need. We provide a comprehensive review of advances made in recent years in correcting for population stratification and then evaluate and synthesize these methods based on statistical principles such as (1) randomization, (2) conditioning on sufficient statistics, and (3) identifying whether the method is based on testing the genotype-phenotype covariance (conditional upon familial information) and/or testing departures of the marginal distribution from the expected genotypic frequencies.
Collapse
Affiliation(s)
- Hemant K Tiwari
- Department of Biostatistics, Section on Statistical Genetics, University of Alabama at Birmingham, Birmingham, AL 35294, USA.
| | | | | | | | | | | |
Collapse
|
11
|
|