1
|
de Andrade M, Warwick Daw E, Kraja AT, Fisher V, Wang L, Hu K, Li J, Romanescu R, Veenstra J, Sun R, Weng H, Zhou W. The challenge of detecting genotype-by-methylation interaction: GAW20. BMC Genet 2018; 19:81. [PMID: 30255819 PMCID: PMC6157121 DOI: 10.1186/s12863-018-0650-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND GAW20 working group 5 brought together researchers who contributed 7 papers with the aim of evaluating methods to detect genetic by epigenetic interactions. GAW20 distributed real data from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) study, including single-nucleotide polymorphism (SNP) markers, methylation (cytosine-phosphate-guanine [CpG]) markers, and phenotype information on up to 995 individuals. In addition, a simulated data set based on the real data was provided. RESULTS The 7 contributed papers analyzed these data sets with a number of different statistical methods, including generalized linear mixed models, mediation analysis, machine learning, W-test, and sparsity-inducing regularized regression. These methods generally appeared to perform well. Several papers confirmed a number of causative SNPs in either the large number of simulation sets or the real data on chromosome 11. Findings were also reported for different SNPs, CpG sites, and SNP-CpG site interaction pairs. CONCLUSIONS In the simulation (200 replications), power appeared generally good for large interaction effects, but smaller effects will require larger studies or consortium collaboration for realizing a sufficient power.
Collapse
Affiliation(s)
- Mariza de Andrade
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, 200 First St. SW, Rochester, MN 55905 USA
| | - E. Warwick Daw
- Division of Statistical Genomics, Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 Euclid Ave, Saint Louis, MO 63110 USA
| | - Aldi T. Kraja
- Division of Statistical Genomics, Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 Euclid Ave, Saint Louis, MO 63110 USA
| | - Virginia Fisher
- Department of Biostatistics, Boston University School of Public Health, Boston, 715 Albany St, Boston, MA 02118 USA
| | - Lan Wang
- Department of Biostatistics, Boston University School of Public Health, Boston, 715 Albany St, Boston, MA 02118 USA
| | - Ke Hu
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH 44106 USA
| | - Jing Li
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH 44106 USA
| | - Razvan Romanescu
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, University of Toronto, 600 University Ave, Toronto, ON M5G 1X5 Canada
| | - Jenna Veenstra
- Department of Biology, Dordt College, 498 4th Ave. NE, Sioux Center, IA 51250 USA
- Department of Mathematics and Statistics, Dordt College, 498 4th Ave. NE, Sioux Center, IA 51250 USA
| | - Rui Sun
- Division of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Shatin, N.T, Hong Kong, SAR China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Haoyi Weng
- Division of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Shatin, N.T, Hong Kong, SAR China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Wenda Zhou
- Department of Statistics, Columbia University, 1255 Amsterdam Avenue, New York, NY 10027 USA
| |
Collapse
|
2
|
Auerbach J, Howey R, Jiang L, Justice A, Li L, Oualkacha K, Sayols-Baixeras S, Aslibekyan SW. Causal modeling in a multi-omic setting: insights from GAW20. BMC Genet 2018; 19:74. [PMID: 30255779 PMCID: PMC6157026 DOI: 10.1186/s12863-018-0645-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Increasingly available multilayered omics data on large populations has opened exciting analytic opportunities and posed unique challenges to robust estimation of causal effects in the setting of complex disease phenotypes. The GAW20 Causal Modeling Working Group has applied complementary approaches (eg, Mendelian randomization, structural equations modeling, Bayesian networks) to discover novel causal effects of genomic and epigenomic variation on lipid phenotypes, as well as to validate prior findings from observational studies. RESULTS Two Mendelian randomization studies have applied novel approaches to instrumental variable selection in methylation data, identifying bidirectional causal effects of CPT1A and triglycerides, as well as of RNMT and C6orf42, on high-density lipoprotein cholesterol response to fenofibrate. The CPT1A finding also emerged in a Bayesian network study. The Mendelian randomization studies have implemented both existing and novel steps to account for pleiotropic effects, which were independently detected in the GAW20 data via a structural equation modeling approach. Two studies estimated indirect effects of genomic variation (via DNA methylation and/or correlated phenotypes) on lipid outcomes of interest. Finally, a novel weighted R2 measure was proposed to complement other causal inference efforts by controlling for the influence of outlying observations. CONCLUSIONS The GAW20 contributions illustrate the diversity of possible approaches to causal inference in the multi-omic context, highlighting the promises and assumptions of each method and the benefits of integrating both across methods and across omics layers for the most robust and comprehensive insights into disease processes.
Collapse
Affiliation(s)
- Jonathan Auerbach
- Department of Statistics, Columbia University, 1255 Amsterdam Ave, New York, NY 10027 USA
| | - Richard Howey
- Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle-upon-Tyne, NE1 3BZ UK
| | - Lai Jiang
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, 1020 Pine Avenue West, Montréal, Quebec, H3A 1A2 Canada
| | - Anne Justice
- Biomedical and Translational Informatics, Geisinger Health, 100 North Academy Ave, Danville, PA 17822 USA
| | - Liming Li
- State Key Laboratory of Genetic Engineering, Institute of Biostatistics, School of Life Sciences, Fudan University, 2005 Songhu Road, Shanghai, 200438 China
| | - Karim Oualkacha
- Département de Mathématiques, Université du Québec à Montréal, 2920 Chemin de la Tour, Montréal, Quebec, H3T 1 J4 Canada
| | - Sergi Sayols-Baixeras
- Cardiovascular Epidemiology and Genetics Research Group, IMIM (Hospital del Mar Medical Research Institute); Universitat Pompeu Fabra; CIBER Cardiovascular Diseases (CIBERCV), 08003 Barcelona, Catalonia Spain
| | - Stella W. Aslibekyan
- Department of Epidemiology, University of Alabama at Birmingham, 1665 University Blvd, RPHB 230J, Birmingham, AL 35294 USA
| |
Collapse
|