1
|
Di Scipio M, Khan M, Mao S, Chong M, Judge C, Pathan N, Perrot N, Nelson W, Lali R, Di S, Morton R, Petch J, Paré G. A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets. Nat Commun 2023; 14:5196. [PMID: 37626057 PMCID: PMC10457310 DOI: 10.1038/s41467-023-40913-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 08/16/2023] [Indexed: 08/27/2023] Open
Abstract
Identification of gene-by-environment interactions (GxE) is crucial to understand the interplay of environmental effects on complex traits. However, current methods evaluating GxE on biobank-scale datasets have limitations. We introduce MonsterLM, a multiple linear regression method that does not rely on model specification and provides unbiased estimates of variance explained by GxE. We demonstrate robustness of MonsterLM through comprehensive genome-wide simulations using real genetic data from 325,989 individuals. We estimate GxE using waist-to-hip-ratio, smoking, and exercise as the environmental variables on 13 outcomes (N = 297,529-325,989) in the UK Biobank. GxE variance is significant for 8 environment-outcome pairs, ranging from 0.009 - 0.071. The majority of GxE variance involves SNPs without strong marginal or interaction associations. We observe modest improvements in polygenic score prediction when incorporating GxE. Our results imply a significant contribution of GxE to complex trait variance and we show MonsterLM to be well-purposed to handle this with biobank-scale data.
Collapse
Affiliation(s)
- Matteo Di Scipio
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
| | - Mohammad Khan
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
| | - Shihong Mao
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
| | - Michael Chong
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON, Canada
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, ON, Canada
| | - Conor Judge
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
| | - Nazia Pathan
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
| | - Nicolas Perrot
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
| | - Walter Nelson
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
| | - Ricky Lali
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
| | - Shuang Di
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Robert Morton
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, ON, Canada
| | - Jeremy Petch
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada
| | - Guillaume Paré
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada.
- Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON, Canada.
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, ON, Canada.
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada.
| |
Collapse
|
2
|
Dimou N, Kim AE, Flanagan O, Murphy N, Diez-Obrero V, Shcherbina A, Aglago EK, Bouras E, Campbell PT, Casey G, Gallinger S, Gruber SB, Jenkins MA, Lin Y, Moreno V, Ruiz-Narvaez E, Stern MC, Tian Y, Tsilidis KK, Arndt V, Barry EL, Baurley JW, Berndt SI, Bézieau S, Bien SA, Bishop DT, Brenner H, Budiarto A, Carreras-Torres R, Cenggoro TW, Chan AT, Chang-Claude J, Chanock SJ, Chen X, Conti DV, Dampier CH, Devall M, Drew DA, Figueiredo JC, Giles GG, Gsur A, Harrison TA, Hidaka A, Hoffmeister M, Huyghe JR, Jordahl K, Kawaguchi E, Keku TO, Larsson SC, Le Marchand L, Lewinger JP, Li L, Mahesworo B, Morrison J, Newcomb PA, Newton CC, Obon-Santacana M, Ose J, Pai RK, Palmer JR, Papadimitriou N, Pardamean B, Peoples AR, Pharoah PDP, Platz EA, Potter JD, Rennert G, Scacheri PC, Schoen RE, Su YR, Tangen CM, Thibodeau SN, Thomas DC, Ulrich CM, Um CY, van Duijnhoven FJB, Visvanathan K, Vodicka P, Vodickova L, White E, Wolk A, Woods MO, Qu C, Kundaje A, Hsu L, Gauderman WJ, Gunter MJ, Peters U. Probing the diabetes and colorectal cancer relationship using gene - environment interaction analyses. Br J Cancer 2023; 129:511-520. [PMID: 37365285 PMCID: PMC10403521 DOI: 10.1038/s41416-023-02312-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 05/10/2023] [Accepted: 06/06/2023] [Indexed: 06/28/2023] Open
Abstract
BACKGROUND Diabetes is an established risk factor for colorectal cancer. However, the mechanisms underlying this relationship still require investigation and it is not known if the association is modified by genetic variants. To address these questions, we undertook a genome-wide gene-environment interaction analysis. METHODS We used data from 3 genetic consortia (CCFR, CORECT, GECCO; 31,318 colorectal cancer cases/41,499 controls) and undertook genome-wide gene-environment interaction analyses with colorectal cancer risk, including interaction tests of genetics(G)xdiabetes (1-degree of freedom; d.f.) and joint testing of Gxdiabetes, G-colorectal cancer association (2-d.f. joint test) and G-diabetes correlation (3-d.f. joint test). RESULTS Based on the joint tests, we found that the association of diabetes with colorectal cancer risk is modified by loci on chromosomes 8q24.11 (rs3802177, SLC30A8 - ORAA: 1.62, 95% CI: 1.34-1.96; ORAG: 1.41, 95% CI: 1.30-1.54; ORGG: 1.22, 95% CI: 1.13-1.31; p-value3-d.f.: 5.46 × 10-11) and 13q14.13 (rs9526201, LRCH1 - ORGG: 2.11, 95% CI: 1.56-2.83; ORGA: 1.52, 95% CI: 1.38-1.68; ORAA: 1.13, 95% CI: 1.06-1.21; p-value2-d.f.: 7.84 × 10-09). DISCUSSION These results suggest that variation in genes related to insulin signaling (SLC30A8) and immune function (LRCH1) may modify the association of diabetes with colorectal cancer risk and provide novel insights into the biology underlying the diabetes and colorectal cancer relationship.
Collapse
Affiliation(s)
- Niki Dimou
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France.
| | - Andre E Kim
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Orlagh Flanagan
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
| | - Neil Murphy
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
| | - Virginia Diez-Obrero
- Unit of Biomarkers and Susceptibility, Oncology Data Analytics Program, Catalan Institute of Oncology, Barcelona, 08908, Spain
- Colorectal Cancer Group, ONCOBELL Program, Bellvitge Biomedical Research Institute, Barcelona, 08908, Spain
- Consortium for Biomedical Research in Epidemiology and Public Health, Barcelona, 08908, Spain
- Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, 08908, Spain
| | - Anna Shcherbina
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Elom K Aglago
- School of Public Health, Imperial College London, London, United Kingdom
| | - Emmanouil Bouras
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece
| | - Peter T Campbell
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Graham Casey
- Department of Public Health Sciences, Center for Public Health Genomics, Charlottesville, VA, USA
| | - Steven Gallinger
- Lunenfeld Tanenbaum Research Institute, Mount Sinai Hospital, University of Toronto, Toronto, ON, Canada
| | - Stephen B Gruber
- Center for Precision Medicine, Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Mark A Jenkins
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Victoria, Australia
| | - Yi Lin
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Victor Moreno
- Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, 08908, Spain
- Oncology Data Analytics Program, Catalan Institute of Oncology-IDIBELL, L'Hospitalet de Llobregat, Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
- ONCOBEL Program, Bellvitge Biomedical Research Institute (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain
| | - Edward Ruiz-Narvaez
- Department of Nutritional Sciences, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Mariana C Stern
- Department of Population and Public Health Sciences & USC Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Yu Tian
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- School of Public Health, Capital Medical University, Beijing, China
| | - Kostas K Tsilidis
- School of Public Health, Imperial College London, London, United Kingdom
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece
| | - Volker Arndt
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Elizabeth L Barry
- Department of Epidemiology, Geisel School of Medicine at Dartmouth, Hanover, NH, USA
| | - James W Baurley
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
- BioRealm LLC, Walnut, CA, USA
| | - Sonja I Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Stéphane Bézieau
- Nantes Université, CHU Nantes, Service de Génétique médicale, F-44000, Nantes, France
| | - Stephanie A Bien
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - D Timothy Bishop
- Leeds Institute of Cancer and Pathology, University of Leeds, Leeds, UK
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Arif Budiarto
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
- Computer Science Department, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia
| | - Robert Carreras-Torres
- Colorectal Cancer Group, ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L'Hospitalet de Llobregat, 8908, Barcelona, Spain
| | - Tjeng Wawan Cenggoro
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
| | - Andrew T Chan
- Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
| | - Jenny Chang-Claude
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- University Medical Centre Hamburg-Eppendorf, University Cancer Centre Hamburg (UCCH), Hamburg, Germany
| | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Xuechen Chen
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Medical Faculty Heidelberg, Heidelberg University, Heidelberg, Germany
| | - David V Conti
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Christopher H Dampier
- Department of Public Health Sciences, Center for Public Health Genomics, Charlottesville, VA, USA
- Department of General Surgery, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Matthew Devall
- Department of Family Medicine, University of Virginia, Charlottesville, VA, USA
| | - David A Drew
- Clinical & Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Jane C Figueiredo
- Department of Medicine, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Graham G Giles
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Victoria, Australia
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, VIC, Australia
- Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC, Australia
| | - Andrea Gsur
- Center for Cancer Research, Medical University of Vienna, Vienna, Austria
| | - Tabitha A Harrison
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Akihisa Hidaka
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Michael Hoffmeister
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jeroen R Huyghe
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Kristina Jordahl
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Eric Kawaguchi
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Temitope O Keku
- Center for Gastrointestinal Biology and Disease, University of North Carolina, Chapel Hill, NC, USA
| | - Susanna C Larsson
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | | | - Juan Pablo Lewinger
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Li Li
- Department of Family Medicine, University of Virginia, Charlottesville, VA, USA
| | - Bharuno Mahesworo
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
| | - John Morrison
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Polly A Newcomb
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
- School of Public Health, University of Washington, Seattle, WA, USA
| | - Christina C Newton
- Department of Population Science, American Cancer Society, Atlanta, GA, USA
| | - Mireia Obon-Santacana
- Unit of Nutrition, Environment and Cancer, Cancer Epidemiology Research Program, Catalan Institute of Oncology (ICO-IDIBELL), Avda Gran Via Barcelona 199-203, 08908L'Hospitalet de Llobregat, Barcelona, Spain
| | - Jennifer Ose
- Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
- Department of Population Health Sciences, University of Utah, Salt Lake City, UH, USA
| | - Rish K Pai
- Department of Laboratory Medicine and Pathology, Mayo Clinic Arizona, Scottsdale, AZ, USA
| | - Julie R Palmer
- Slone Epidemiology Center at Boston University, Boston, MA, USA
| | - Nikos Papadimitriou
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
| | - Bens Pardamean
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
| | - Anita R Peoples
- Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
- Department of Population Health Sciences, University of Utah, Salt Lake City, UH, USA
| | - Paul D P Pharoah
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Elizabeth A Platz
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - John D Potter
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA, USA
- Research Centre for Hauora and Health, Massey University, Wellington, New Zealand
| | - Gad Rennert
- Department of Community Medicine and Epidemiology, Lady Davis Carmel Medical Center, Haifa, Israel
- Ruth and Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
- Clalit National Cancer Control Center, Haifa, Israel
| | - Peter C Scacheri
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH, USA
| | - Robert E Schoen
- Department of Medicine and Epidemiology, University of Pittsburgh Medical Center, Pittsburgh, PA, USA
| | - Yu-Ru Su
- Biostatistics Division, Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA
| | - Catherine M Tangen
- SWOG Statistical Center, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Stephen N Thibodeau
- Division of Laboratory Genetics, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Duncan C Thomas
- Department of Population and Public Health Sciences & USC Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Cornelia M Ulrich
- Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
- Department of Population Health Sciences, University of Utah, Salt Lake City, UH, USA
| | - Caroline Y Um
- Department of Population Science, American Cancer Society, Atlanta, GA, USA
| | | | - Kala Visvanathan
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA, USA
| | - Pavel Vodicka
- Department of Molecular Biology of Cancer, Institute of Experimental Medicine of the Czech Academy of Sciences, Prague, Czech Republic
- Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University, Prague, Czech Republic
- Faculty of Medicine and Biomedical Center in Pilsen, Charles University, Pilsen, Czech Republic
| | - Ludmila Vodickova
- Department of Molecular Biology of Cancer, Institute of Experimental Medicine of the Czech Academy of Sciences, Prague, Czech Republic
- Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University, Prague, Czech Republic
- Faculty of Medicine and Biomedical Center in Pilsen, Charles University, Pilsen, Czech Republic
| | - Emily White
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA, USA
| | - Alicja Wolk
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Michael O Woods
- Memorial University of Newfoundland, Discipline of Genetics, St. John's, NL, Canada
| | - Conghui Qu
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Anshul Kundaje
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Li Hsu
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - W James Gauderman
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Marc J Gunter
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
- School of Public Health, Imperial College London, London, United Kingdom
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA, USA
| |
Collapse
|
3
|
Zhou Z, Ku HC, Manning SE, Zhang M, Xing C. A Varying Coefficient Model to Jointly Test Genetic and Gene-Environment Interaction Effects. Behav Genet 2023; 53:374-382. [PMID: 36622576 PMCID: PMC10277225 DOI: 10.1007/s10519-022-10131-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 12/18/2022] [Indexed: 01/10/2023]
Abstract
Most human traits are influenced by the interplay between genetic and environmental factors. Many statistical methods have been proposed to screen for gene-environment interaction (GxE) in the post genome-wide association study era. However, most of the existing methods assume a linear interaction between genetic and environmental factors toward phenotypic variations, which diminishes statistical power in the case of nonlinear GxE. In this paper, we present a flexible statistical procedure to detect GxE regardless of whether the underlying relationship is linear or not. By modeling the joint genetic and GxE effects as a varying-coefficient function of the environmental factor, the proposed model is able to capture dynamic trajectories of GxE. We employ a likelihood ratio test with a fast Monte Carlo algorithm for hypothesis testing. Simulations were conducted to evaluate validity and power of the proposed model in various settings. Real data analysis was performed to illustrate its power, in particular, in the case of nonlinear GxE.
Collapse
Affiliation(s)
- Zhengyang Zhou
- Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX, USA.
| | - Hung-Chih Ku
- Department of Mathematical Sciences, DePaul University, Chicago, IL, USA
| | - Sydney E Manning
- Department of Pharmacotherapy, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Ming Zhang
- Department of Statistical Science, Southern Methodist University, Dallas, TX, USA
| | - Chao Xing
- McDermott Center for Human Growth and Development and Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, USA.
| |
Collapse
|
4
|
Martins J, Yusupov N, Binder EB, Brückl TM, Czamara D. Early adversity as the prototype gene × environment interaction in mental disorders? Pharmacol Biochem Behav 2022; 215:173371. [PMID: 35271857 DOI: 10.1016/j.pbb.2022.173371] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 02/03/2022] [Accepted: 02/28/2022] [Indexed: 10/18/2022]
Abstract
Childhood adversity (CA) as a significant stressor has consistently been associated with the development of mental disorders. The interaction between CA and genetic variants has been proposed to play a substantial role in disease etiology. In this review, we focus on the gene by environment (GxE) paradigm, its background and interpretation and stress the necessity of its implementation in psychiatric research. Further, we discuss the findings supporting GxCA interactions, ranging from candidate gene studies to polygenic and genome-wide approaches, their strengths and limitations. To illustrate potential underlying epigenetic mechanisms by which GxE effects are translated, we focus on results from FKBP5 × CA studies and discuss how molecular evidence can supplement previous GxE findings. In conclusion, while GxE studies constitute a valuable line of investigation, more harmonized GxE studies in large, deep-phenotyped, longitudinal cohorts, and across different developmental stages are necessary to further substantiate and understand reported GxE findings.
Collapse
Affiliation(s)
- Jade Martins
- Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Munich 80804, Germany.
| | - Natan Yusupov
- Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Munich 80804, Germany; International Max Planck Research School for Translational Psychiatry (IMPRS-TP), Munich, Germany
| | - Elisabeth B Binder
- Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Munich 80804, Germany; Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, GA 30329, USA
| | - Tanja M Brückl
- Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Munich 80804, Germany
| | - Darina Czamara
- Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Munich 80804, Germany
| |
Collapse
|
5
|
Majumdar A, Burch KS, Haldar T, Sankararaman S, Pasaniuc B, Gauderman WJ, Witte JS. A two-step approach to testing overall effect of gene-environment interaction for multiple phenotypes. Bioinformatics 2021; 36:5640-5648. [PMID: 33453114 DOI: 10.1093/bioinformatics/btaa1083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 12/09/2020] [Accepted: 12/17/2020] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION While gene-environment (GxE) interactions contribute importantly to many different phenotypes, detecting such interactions requires well-powered studies and has proven difficult. To address this, we combine two approaches to improve GxE power: simultaneously evaluating multiple phenotypes and using a two-step analysis approach. Previous work shows that the power to identify a main genetic effect can be improved by simultaneously analyzing multiple related phenotypes. For a univariate phenotype, two-step methods produce higher power for detecting a GxE interaction compared to single step analysis. Therefore, we propose a two-step approach to test for an overall GxE effect for multiple phenotypes. RESULTS Using simulations we demonstrate that, when more than one phenotype has GxE effect (i.e., GxE pleiotropy), our approach offers substantial gain in power (18%-43%) to detect an aggregate-level GxE effect for a multivariate phenotype compared to an analogous two-step method to identify GxE effect for a univariate phenotype. We applied the proposed approach to simultaneously analyze three lipids, LDL, HDL and Triglyceride with the frequency of alcohol consumption as environmental factor in the UK Biobank. The method identified two loci with an overall GxE effect on the vector of lipids, one of which was missed by the competing approaches. AVAILABILITY We provide an R package MPGE implementing the proposed approach which is available from CRAN: https://cran.r-project.org/web/packages/MPGE/index.html.
Collapse
Affiliation(s)
- Arunabha Majumdar
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA.,Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - Kathryn S Burch
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
| | | | - Sriram Sankararaman
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
| | - Bogdan Pasaniuc
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA.,Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
| | - W James Gauderman
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - John S Witte
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| |
Collapse
|
6
|
Campbell PT, Lin Y, Bien SA, Figueiredo JC, Harrison TA, Guinter MA, Berndt SI, Brenner H, Chan AT, Chang-Claude J, Gallinger SJ, Gapstur SM, Giles GG, Giovannucci E, Gruber SB, Gunter M, Hoffmeister M, Jacobs EJ, Jenkins MA, Le Marchand L, Li L, McLaughlin JR, Murphy N, Milne RL, Newcomb PA, Newton C, Ogino S, Potter JD, Rennert G, Rennert HS, Robinson J, Sakoda LC, Slattery ML, Song Y, White E, Woods MO, Casey G, Hsu L, Peters U. Association of Body Mass Index With Colorectal Cancer Risk by Genome-Wide Variants. J Natl Cancer Inst 2021; 113:38-47. [PMID: 32324875 PMCID: PMC7781451 DOI: 10.1093/jnci/djaa058] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Revised: 02/27/2020] [Accepted: 04/17/2020] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Body mass index (BMI) is a complex phenotype that may interact with genetic variants to influence colorectal cancer risk. METHODS We tested multiplicative statistical interactions between BMI (per 5 kg/m2) and approximately 2.7 million single nucleotide polymorphisms with colorectal cancer risk among 14 059 colorectal cancer case (53.2% women) and 14 416 control (53.8% women) participants. All analyses were stratified by sex a priori. Statistical methods included 2-step (ie, Cocktail method) and single-step (ie, case-control logistic regression and a joint 2-degree of freedom test) procedures. All statistical tests were two-sided. RESULTS Each 5 kg/m2 increase in BMI was associated with higher risks of colorectal cancer, less so for women (odds ratio [OR] = 1.14, 95% confidence intervals [CI] = 1.11 to 1.18; P = 9.75 × 10-17) than for men (OR = 1.26, 95% CI = 1.20 to 1.32; P = 2.13 × 10-24). The 2-step Cocktail method identified an interaction for women, but not men, between BMI and a SMAD7 intronic variant at 18q21.1 (rs4939827; Pobserved = .0009; Pthreshold = .005). A joint 2-degree of freedom test was consistent with this finding for women (joint P = 2.43 × 10-10). Each 5 kg/m2 increase in BMI was more strongly associated with colorectal cancer risk for women with the rs4939827-CC genotype (OR = 1.24, 95% CI = 1.16 to 1.32; P = 2.60 × 10-10) than for women with the CT (OR = 1.14, 95% CI = 1.09 to 1.19; P = 1.04 × 10-8) or TT (OR = 1.07, 95% CI = 1.01 to 1.14; P = .02) genotypes. CONCLUSION These results provide novel insights on a potential mechanism through which a SMAD7 variant, previously identified as a susceptibility locus for colorectal cancer, and BMI may influence colorectal cancer risk for women.
Collapse
Affiliation(s)
- Peter T Campbell
- Behavioral and Epidemiology Research Group, American Cancer Society, Atlanta, GA, USA
| | - Yi Lin
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Stephanie A Bien
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Jane C Figueiredo
- Department of Medicine, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Tabitha A Harrison
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Mark A Guinter
- Behavioral and Epidemiology Research Group, American Cancer Society, Atlanta, GA, USA
| | - Sonja I Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Division of Preventive Oncology, German Cancer Research Center (DKFZ), and National Center for Tumor Diseases (NCT), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Andrew T Chan
- Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital, and Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
| | - Jenny Chang-Claude
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- University Medical Centre Hamburg-Eppendorf, University Cancer Centre Hamburg (UCCH), Hamburg, Germany
| | - Steven J Gallinger
- Lunenfeld Tanenbaum Research Institute, Mount Sinai Hospital, University of Toronto, Toronto, Ontario, Canada
| | - Susan M Gapstur
- Behavioral and Epidemiology Research Group, American Cancer Society, Atlanta, GA, USA
| | - Graham G Giles
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, Victoria, Australia
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Victoria, Australia
- Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, Victoria, Australia
| | - Edward Giovannucci
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
- Department of Nutrition, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
| | - Stephen B Gruber
- Center for Precision Medicine and Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Marc Gunter
- Section of Nutrition and Metabolism, International Agency for Research on Cancer, Lyon, France
| | - Michael Hoffmeister
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Eric J Jacobs
- Behavioral and Epidemiology Research Group, American Cancer Society, Atlanta, GA, USA
| | - Mark A Jenkins
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Victoria, Australia
| | | | - Li Li
- Department of Family Medicine and Cancer Center, University of Virginia, Charlottesville, VA, USA
| | - John R McLaughlin
- Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
| | - Neil Murphy
- Section of Nutrition and Metabolism, International Agency for Research on Cancer, Lyon, France
| | - Roger L Milne
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, Victoria, Australia
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Victoria, Australia
- Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, Victoria, Australia
| | - Polly A Newcomb
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Christina Newton
- Behavioral and Epidemiology Research Group, American Cancer Society, Atlanta, GA, USA
| | - Shuji Ogino
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
- Program in MPE Molecular Pathological Epidemiology, Department of Pathology, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
- Department of Pathology, Brigham & Women’s Hospital, and Harvard Medical School, Boston, MA, USA
| | - John D Potter
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Gad Rennert
- Department of Community Medicine and Epidemiology, Lady Davis Carmel Medical Center, Haifa, Israel
- Ruth and Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
- Clalit National Cancer Control Center, Haifa, Israel
| | - Hedy S Rennert
- Department of Community Medicine and Epidemiology, Lady Davis Carmel Medical Center, Haifa, Israel
- Ruth and Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
- Clalit National Cancer Control Center, Haifa, Israel
| | - Jennifer Robinson
- Department of Epidemiology, College of Public Health, University of Iowa, Iowa City, IA, USA
| | - Lori C Sakoda
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Martha L Slattery
- Department of Internal Medicine, University of Utah, Salt Lake City, UT, USA
| | - Yiqing Song
- Department of Epidemiology, Richard M. Fairbanks School of Public Health, Indiana University, Indianapolis, IN, USA
| | - Emily White
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Epidemiology, University of Washington School of Public Health, Seattle, WA, USA
| | - Michael O Woods
- Discipline of Genetics, Memorial University of Newfoundland, St. John’s, Newfoundland, Canada
| | - Graham Casey
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Li Hsu
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Epidemiology, University of Washington School of Public Health, Seattle, WA, USA
| |
Collapse
|
7
|
Zhang S, Xue Y, Zhang Q, Ma C, Wu M, Ma S. Identification of gene-environment interactions with marginal penalization. Genet Epidemiol 2019; 44:159-196. [PMID: 31724772 DOI: 10.1002/gepi.22270] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2019] [Revised: 10/05/2019] [Accepted: 10/25/2019] [Indexed: 12/29/2022]
Abstract
Gene-environment (G-E) interaction analysis has been extensively conducted for complex diseases. In marginal analysis, the common practice is to conduct likelihood-based (and other "standard") estimation with each marginal model, and then select significant G-E interactions and main effects based on p values and multiple comparisons adjustment. One limitation of this approach is that the identification results often do not respect the "main effects, interactions" hierarchy, which has been stressed in recent G-E interaction analyses. There is some recent effort tackling this problem, however, with very complex formulations. Another limitation of the common practice is that it may not perform well when regularization is needed, for example, because of "non-normal" distributions. In this article, we propose a marginal penalization approach which adopts a novel penalty to directly tackle the aforementioned problems. The proposed approach has a framework more coherent with that of the recently developed joint analysis methods and an intuitive formulation, and can be effectively realized. In simulation, it outperforms the popular significance-based analysis and simple penalization-based alternatives. Promising findings are made in the analysis of a single-nucleotide polymorphism and a gene expression data.
Collapse
Affiliation(s)
- Sanguo Zhang
- School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Yuan Xue
- School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China.,Department of Biostatistics, Yale University, New Haven, Connecticut
| | - Qingzhao Zhang
- Department of Statistics, School of Economics, Xiamen University, Xiamen, China
| | - Chenjin Ma
- Department of Biostatistics, Yale University, New Haven, Connecticut.,School of Statistics, Renmin University, Beijing, China
| | - Mengyun Wu
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| | - Shuangge Ma
- Department of Biostatistics, Yale University, New Haven, Connecticut
| |
Collapse
|
8
|
Yu Y, Xia L, Lee S, Zhou X, Stringham HM, Boehnke M, Mukherjee B. Subset-Based Analysis Using Gene-Environment Interactions for Discovery of Genetic Associations across Multiple Studies or Phenotypes. Hum Hered 2019; 83:283-314. [PMID: 31132756 DOI: 10.1159/000496867] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Accepted: 01/04/2019] [Indexed: 01/11/2023] Open
Abstract
OBJECTIVES Classical methods for combining summary data from genome-wide association studies only use marginal genetic effects, and power can be compromised in the presence of heterogeneity. We aim to enhance the discovery of novel associated loci in the presence of heterogeneity of genetic effects in subgroups defined by an environmental factor. METHODS We present a pvalue-assisted subset testing for associations (pASTA) framework that generalizes the previously proposed association analysis based on subsets (ASSET) method by incorporating gene-environment (G-E) interactions into the testing procedure. We conduct simulation studies and provide two data examples. RESULTS Simulation studies show that our proposal is more powerful than methods based on marginal associations in the presence of G-E interactions and maintains comparable power even in their absence. Both data examples demonstrate that our method can increase power to detect overall genetic associations and identify novel studies/phenotypes that contribute to the association. CONCLUSIONS Our proposed method can be a useful screening tool to identify candidate single nucleotide polymorphisms that are potentially associated with the trait(s) of interest for further validation. It also allows researchers to determine the most probable subset of traits that exhibit genetic associations in addition to the enhancement of power.
Collapse
Affiliation(s)
- Youfei Yu
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| | - Lu Xia
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| | - Seunggeun Lee
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
| | - Heather M Stringham
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
| | - Michael Boehnke
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA, .,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA,
| |
Collapse
|
9
|
Zhao N, Zhang H, Clark JJ, Maity A, Wu MC. Composite kernel machine regression based on likelihood ratio test for joint testing of genetic and gene–environment interaction effect. Biometrics 2019; 75:625-637. [DOI: 10.1111/biom.13003] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Accepted: 10/09/2018] [Indexed: 12/17/2022]
Affiliation(s)
- Ni Zhao
- Department of BiostatisticsJohns Hopkins UniversityBaltimore, Maryland
| | - Haoyu Zhang
- Department of BiostatisticsJohns Hopkins UniversityBaltimore, Maryland
| | - Jennifer J. Clark
- Department of BiostatisticsUniversity of North Carolina at Chapel HillChapel Hill, North Carolina
| | - Arnab Maity
- Department of StatisticsNorth Carolina State UniversityRaleigh, North Carolina
| | - Michael C. Wu
- Public Health Sciences Division,Fred Hutchinson Cancer Research CenterSeattle, Washington
| |
Collapse
|
10
|
Moss LC, Gauderman WJ, Lewinger JP, Conti DV. Using Bayes model averaging to leverage both gene main effects and G × E interactions to identify genomic regions in genome-wide association studies. Genet Epidemiol 2018; 43:150-165. [PMID: 30456811 DOI: 10.1002/gepi.22171] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2018] [Revised: 08/30/2018] [Indexed: 11/08/2022]
Abstract
Genome-wide association studies typically search for marginal associations between a single-nucleotide polymorphism (SNP) and a disease trait while gene-environment (G × E) interactions remain generally unexplored. More powerful methods beyond the simple case-control (CC) approach leverage either marginal effects or CC ascertainment to increase power. However, these potential gains depend on assumptions whose aptness is often unclear a priori. Here, we review G × E methods and use simulations to highlight performance as a function of main and interaction effects and the association of the two factors in the source population. Substantial variation in performance between methods leads to uncertainty as to which approach is most appropriate for any given analysis. We present a framework that (a) balances the robustness of a CC approach with the power of the case-only (CO) approach; (b) incorporates main SNP effects; (c) allows for incorporation of prior information; and (d) allows the data to determine the most appropriate model. Our framework is based on Bayes model averaging, which provides a principled statistical method for incorporating model uncertainty. We average over inclusion of parameters corresponding to the main and G × E interaction effects and the G-E association in controls. The resulting method exploits the joint evidence for main and interaction effects while gaining power from a CO equivalent analysis. Through simulations, we demonstrate that our approach detects SNPs within a wide range of scenarios with increased power over current methods. We illustrate the approach on a gene-environment scan in the USC Children's Health Study.
Collapse
Affiliation(s)
- Lilit C Moss
- Division of Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, California
| | - William J Gauderman
- Division of Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, California
| | - Juan Pablo Lewinger
- Division of Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, California
| | - David V Conti
- Division of Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, California
| |
Collapse
|
11
|
Alarcon F, Nuel G. Detecting latent exposure in genome-wide association studies using a breakpoint model for logistic regression. Stat Methods Med Res 2018; 28:1781-1792. [PMID: 29921158 DOI: 10.1177/0962280218776385] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Detecting gene-environment (G × E) interactions in the context of genome-wide association studies (GWAS) is a challenging problem since standard methods generally present a lack of power. An additional difficulty arises from the fact that the causal exposure is seldom observed and only a proxy of this exposure is observed. This leads to an additional drop in terms of power and it explains the failure of standard methods in detecting interactions, even very strong ones. In this article, we consider the latent exposure as a source of heterogeneity and we propose a new powerful method, named "Breakpoint Model for Logistic Regression" (BMLR), based on a breakpoint model, in order to detect G × E interactions when causal exposure is unobserved. First, the BMLR method is compared to the ordered-subset analysis for case-control method, which has been developed for the same purpose, through simulations. This highlights the ability of BMLR to detect the heterogeneity, and therefore, to detect interaction with latent exposure. Finally, the BMLR method is compared to standard methods, such as Plink, to perform a GWAS on a published realistic benchmark.
Collapse
Affiliation(s)
- Flora Alarcon
- 1 Laboratoire MAP5, Université Paris Descartes and CNRS, Sorbonne Paris Cité, Paris, France
| | - Gregory Nuel
- 2 Institute of Mathematics (INSMI), National Center for French Research (CNRS), Paris, France.,3 Stochastic and Biology Group, LPSM (CNRS 8001), Sorbonne Université, Paris, France
| |
Collapse
|
12
|
McAllister K, Mechanic LE, Amos C, Aschard H, Blair IA, Chatterjee N, Conti D, Gauderman WJ, Hsu L, Hutter CM, Jankowska MM, Kerr J, Kraft P, Montgomery SB, Mukherjee B, Papanicolaou GJ, Patel CJ, Ritchie MD, Ritz BR, Thomas DC, Wei P, Witte JS. Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases. Am J Epidemiol 2017; 186:753-761. [PMID: 28978193 PMCID: PMC5860428 DOI: 10.1093/aje/kwx227] [Citation(s) in RCA: 116] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 03/14/2017] [Accepted: 03/16/2017] [Indexed: 12/25/2022] Open
Abstract
Recently, many new approaches, study designs, and statistical and analytical methods have emerged for studying gene-environment interactions (G×Es) in large-scale studies of human populations. There are opportunities in this field, particularly with respect to the incorporation of -omics and next-generation sequencing data and continual improvement in measures of environmental exposures implicated in complex disease outcomes. In a workshop called "Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases," held October 17-18, 2014, by the National Institute of Environmental Health Sciences and the National Cancer Institute in conjunction with the annual American Society of Human Genetics meeting, participants explored new approaches and tools that have been developed in recent years for G×E discovery. This paper highlights current and critical issues and themes in G×E research that need additional consideration, including the improved data analytical methods, environmental exposure assessment, and incorporation of functional data and annotations.
Collapse
Affiliation(s)
| | - Leah E. Mechanic
- Correspondence to Dr. Leah E. Mechanic, Genomic Epidemiology Branch, Epidemiology and Genomics Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, 9609 Medical Center Drive, Room 4E104, MSC 9763, Bethesda, MD 20892 (e-mail: )
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Gauderman WJ, Mukherjee B, Aschard H, Hsu L, Lewinger JP, Patel CJ, Witte JS, Amos C, Tai CG, Conti D, Torgerson DG, Lee S, Chatterjee N. Update on the State of the Science for Analytical Methods for Gene-Environment Interactions. Am J Epidemiol 2017; 186:762-770. [PMID: 28978192 PMCID: PMC5859988 DOI: 10.1093/aje/kwx228] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 04/24/2017] [Accepted: 04/25/2017] [Indexed: 12/14/2022] Open
Abstract
The analysis of gene-environment interaction (G×E) may hold the key for further understanding the etiology of many complex traits. The current availability of high-volume genetic data, the wide range in types of environmental data that can be measured, and the formation of consortiums of multiple studies provide new opportunities to identify G×E but also new analytical challenges. In this article, we summarize several statistical approaches that can be used to test for G×E in a genome-wide association study. These include traditional models of G×E in a case-control or quantitative trait study as well as alternative approaches that can provide substantially greater power. The latest methods for analyzing G×E with gene sets and with data in a consortium setting are summarized, as are issues that arise due to the complexity of environmental data. We provide some speculation on why detecting G×E in a genome-wide association study has thus far been difficult. We conclude with a description of software programs that can be used to implement most of the methods described in the paper.
Collapse
Affiliation(s)
- W. James Gauderman
- Correspondence to Dr. W. James Gauderman, Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, 2001 North Soto Street, 202-K, Los Angeles, CA 90032 (e-mail: )
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
The importance of gene-environment interactions in human obesity. Clin Sci (Lond) 2017; 130:1571-97. [PMID: 27503943 DOI: 10.1042/cs20160221] [Citation(s) in RCA: 113] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 05/23/2016] [Indexed: 12/16/2022]
Abstract
The worldwide obesity epidemic has been mainly attributed to lifestyle changes. However, who becomes obese in an obesity-prone environment is largely determined by genetic factors. In the last 20 years, important progress has been made in the elucidation of the genetic architecture of obesity. In parallel with successful gene identifications, the number of gene-environment interaction (GEI) studies has grown rapidly. This paper reviews the growing body of evidence supporting gene-environment interactions in the field of obesity. Heritability, monogenic and polygenic obesity studies provide converging evidence that obesity-predisposing genes interact with a variety of environmental, lifestyle and treatment exposures. However, some skepticism remains regarding the validity of these studies based on several issues, which include statistical modelling, confounding, low replication rate, underpowered analyses, biological assumptions and measurement precision. What follows in this review includes (1) an introduction to the study of GEI, (2) the evidence of GEI in the field of obesity, (3) an outline of the biological mechanisms that may explain these interaction effects, (4) methodological challenges associated with GEI studies and potential solutions, and (5) future directions of GEI research. Thus far, this growing body of evidence has provided a deeper understanding of GEI influencing obesity and may have tremendous applications in the emerging field of personalized medicine and individualized lifestyle recommendations.
Collapse
|
15
|
Sun Z, Mukherjee B, Estes JP, Vokonas PS, Park SK. Exposure enriched outcome dependent designs for longitudinal studies of gene-environment interaction. Stat Med 2017; 36:2947-2960. [PMID: 28497531 DOI: 10.1002/sim.7332] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Revised: 03/08/2017] [Accepted: 04/20/2017] [Indexed: 12/15/2022]
Abstract
Joint effects of genetic and environmental factors have been increasingly recognized in the development of many complex human diseases. Despite the popularity of case-control and case-only designs, longitudinal cohort studies that can capture time-varying outcome and exposure information have long been recommended for gene-environment (G × E) interactions. To date, literature on sampling designs for longitudinal studies of G × E interaction is quite limited. We therefore consider designs that can prioritize a subsample of the existing cohort for retrospective genotyping on the basis of currently available outcome, exposure, and covariate data. In this work, we propose stratified sampling based on summaries of individual exposures and outcome trajectories and develop a full conditional likelihood approach for estimation that adjusts for the biased sample. We compare the performance of our proposed design and analysis with combinations of different sampling designs and estimation approaches via simulation. We observe that the full conditional likelihood provides improved estimates for the G × E interaction and joint exposure effects over uncorrected complete-case analysis, and the exposure enriched outcome trajectory dependent design outperforms other designs in terms of estimation efficiency and power for detection of the G × E interaction. We also illustrate our design and analysis using data from the Normative Aging Study, an ongoing longitudinal cohort study initiated by the Veterans Administration in 1963. Copyright © 2017 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Zhichao Sun
- Department of Biostatistics, University of Michigan, Ann Arbor, 48109, MI, U.S.A
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan, Ann Arbor, 48109, MI, U.S.A.,Department of Epidemiology, University of Michigan, Ann Arbor, 48109, MI, U.S.A
| | - Jason P Estes
- Department of Biostatistics, University of Michigan, Ann Arbor, 48109, MI, U.S.A
| | - Pantel S Vokonas
- Veterans Affairs Normative Aging Study, VA Boston Healthcare System, Department of Medicine, Boston University School of Medicine, Boston, 02118, MA, U.S.A
| | - Sung Kyun Park
- Department of Epidemiology, University of Michigan, Ann Arbor, 48109, MI, U.S.A.,Department of Environmental Health Sciences, University of Michigan, Ann Arbor, 48109, MI, U.S.A
| |
Collapse
|
16
|
Rava M, Ahmed I, Kogevinas M, Le Moual N, Bouzigon E, Curjuric I, Dizier MH, Dumas O, Gonzalez JR, Imboden M, Mehta AJ, Tubert-Bitter P, Zock JP, Jarvis D, Probst-Hensch NM, Demenais F, Nadif R. Genes Interacting with Occupational Exposures to Low Molecular Weight Agents and Irritants on Adult-Onset Asthma in Three European Studies. ENVIRONMENTAL HEALTH PERSPECTIVES 2017; 125:207-214. [PMID: 27504716 PMCID: PMC5289825 DOI: 10.1289/ehp376] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Revised: 04/18/2016] [Accepted: 06/13/2016] [Indexed: 06/06/2023]
Abstract
BACKGROUND The biological mechanisms by which cleaning products and disinfectants-an emerging risk factor-affect respiratory health remain incompletely evaluated. Studying genes by environment interactions (G × E) may help identify new genes related to adult-onset asthma. OBJECTIVES We identified interactions between genetic polymorphisms of a large set of genes involved in the response to oxidative stress and occupational exposures to low molecular weight (LMW) agents or irritants on adult-onset asthma. METHODS Our data came from three large European cohorts: Epidemiological Family-based Study of the Genetics and Environment of Asthma (EGEA), Swiss Cohort Study on Air Pollution and Lung and Heart Disease in Adults (SAPALDIA), and European Community Respiratory Health Survey in Adults (ECRHS). A candidate pathway-based strategy identified 163 genes involved in the response to oxidative stress and potentially related to exposures to LMW agents/irritants. Occupational exposures were evaluated using an asthma job-exposure matrix and job-specific questionnaires for cleaners and healthcare workers. Logistic regression models were used to detect G × E interactions, adjusted for age, sex, and population ancestry, in 2,599 adults (mean age, 47 years; 60% women, 36% exposed, 18% asthmatics). p-Values were corrected for multiple comparisons. RESULTS Ever exposure to LMW agents/irritants was associated with current adult-onset asthma [OR = 1.28 (95% CI: 1.04, 1.58)]. Eight single nucleotide polymorphism (SNP) by exposure interactions at five loci were found at p < 0.005: PLA2G4A (rs932476, chromosome 1), near PLA2R1 (rs2667026, chromosome 2), near RELA (rs931127, rs7949980, chromosome 11), PRKD1 (rs1958980, rs11847351, rs1958987, chromosome 14), and PRKCA (rs6504453, chromosome 17). Results were consistent across the three studies and after accounting for smoking. CONCLUSIONS Using a pathway-based selection process, we identified novel genes potentially involved in adult asthma by interaction with occupational exposure. These genes play a role in the NF-κB pathway, which is involved in inflammation. Citation: Rava M, Ahmed I, Kogevinas M, Le Moual N, Bouzigon E, Curjuric I, Dizier MH, Dumas O, Gonzalez JR, Imboden M, Mehta AJ, Tubert-Bitter P, Zock JP, Jarvis D, Probst-Hensch NM, Demenais F, Nadif R. 2017. Genes interacting with occupational exposures to low molecular weight agents and irritants on adult-onset asthma in three European studies. Environ Health Perspect 125:207-214; http://dx.doi.org/10.1289/EHP376.
Collapse
Affiliation(s)
- Marta Rava
- Inserm, U1168, VIMA: Aging and Chronic Diseases, Epidemiological and Public Health Approaches, Villejuif, France
- Spanish National Cancer Research Centre (CNIO), Genetic and Molecular Epidemiology Group, Human Cancer Genetics Program, Madrid, Spain
| | - Ismail Ahmed
- Inserm UMR 1181 [Biostatistics, Biomathematics, Pharmacoepidemiology and Infectious Diseases (B2PHI)], Villejuif, France
- Institut Pasteur, UMR 1181, B2PHI, Paris, France
- Univ Versailles St.-Quentin-en-Yvelines, UMR 1181, B2PHI, Montigny le Bretonneux, France
| | - Manolis Kogevinas
- ISGlobal, Centre for Research in Environmental Epidemiology (CREAL), Barcelona, Spain
- CIBER Epidemiología y Salud Pública, Madrid, Spain
| | - Nicole Le Moual
- Inserm, U1168, VIMA: Aging and Chronic Diseases, Epidemiological and Public Health Approaches, Villejuif, France
- Univ Versailles St.-Quentin-en-Yvelines, UMR-S 1168, Montigny le Bretonneux, France
| | - Emmanuelle Bouzigon
- Inserm, UMR-946, Genetic Variation and Human Diseases Unit, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, Institut Universitaire d’Hématologie, Paris, France
| | - Ivan Curjuric
- Department of Epidemiology and Public Health, Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Switzerland
| | - Marie-Hélène Dizier
- Inserm, UMR-946, Genetic Variation and Human Diseases Unit, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, Institut Universitaire d’Hématologie, Paris, France
| | - Orianne Dumas
- Inserm, U1168, VIMA: Aging and Chronic Diseases, Epidemiological and Public Health Approaches, Villejuif, France
- Univ Versailles St.-Quentin-en-Yvelines, UMR-S 1168, Montigny le Bretonneux, France
| | - Juan R. Gonzalez
- ISGlobal, Centre for Research in Environmental Epidemiology (CREAL), Barcelona, Spain
- CIBER Epidemiología y Salud Pública, Madrid, Spain
| | - Medea Imboden
- Department of Epidemiology and Public Health, Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Switzerland
| | - Amar J. Mehta
- Department of Epidemiology and Public Health, Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Switzerland
- Department of Environmental Health, Harvard School of Public Health, Boston, Massachusetts, USA
| | - Pascale Tubert-Bitter
- Inserm UMR 1181 [Biostatistics, Biomathematics, Pharmacoepidemiology and Infectious Diseases (B2PHI)], Villejuif, France
- Institut Pasteur, UMR 1181, B2PHI, Paris, France
- Univ Versailles St.-Quentin-en-Yvelines, UMR 1181, B2PHI, Montigny le Bretonneux, France
| | - Jan-Paul Zock
- ISGlobal, Centre for Research in Environmental Epidemiology (CREAL), Barcelona, Spain
- CIBER Epidemiología y Salud Pública, Madrid, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Deborah Jarvis
- Respiratory Epidemiology and Public Health, Imperial College, London, United Kingdom
- MRC-HPA (Medical Research Council and Health Protection Agency) Centre for Environment and Health, London, United Kingdom
| | - Nicole M. Probst-Hensch
- Department of Epidemiology and Public Health, Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Switzerland
| | - Florence Demenais
- Inserm, UMR-946, Genetic Variation and Human Diseases Unit, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, Institut Universitaire d’Hématologie, Paris, France
| | - Rachel Nadif
- Inserm, U1168, VIMA: Aging and Chronic Diseases, Epidemiological and Public Health Approaches, Villejuif, France
- Univ Versailles St.-Quentin-en-Yvelines, UMR-S 1168, Montigny le Bretonneux, France
| |
Collapse
|
17
|
Methodical Considerations. HUMAN VACCINES 2017. [DOI: 10.1016/b978-0-12-802302-0.00006-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
18
|
Boonstra PS, Mukherjee B, Gruber SB, Ahn J, Schmit SL, Chatterjee N. Tests for Gene-Environment Interactions and Joint Effects With Exposure Misclassification. Am J Epidemiol 2016; 183:237-47. [PMID: 26755675 DOI: 10.1093/aje/kwv198] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2014] [Accepted: 07/15/2015] [Indexed: 12/12/2022] Open
Abstract
The number of methods for genome-wide testing of gene-environment (G-E) interactions continues to increase, with the aim of discovering new genetic risk factors and obtaining insight into the disease-gene-environment relationship. The relative performance of these methods, assessed on the basis of family-wise type I error rate and power, depends on underlying disease-gene-environment associations, estimates of which may be biased in the presence of exposure misclassification. This simulation study expands on a previously published simulation study of methods for detecting G-E interactions by evaluating the impact of exposure misclassification. We consider 7 single-step and modular screening methods for identifying G-E interaction at a genome-wide level and 7 joint tests for genetic association and G-E interaction, for which the goal is to discover new genetic susceptibility loci by leveraging G-E interaction when present. In terms of statistical power, modular methods that screen on the basis of the marginal disease-gene relationship are more robust to exposure misclassification. Joint tests that include main/marginal effects of a gene display a similar robustness, which confirms results from earlier studies. Our results offer an increased understanding of the strengths and limitations of methods for genome-wide searches for G-E interaction and joint tests in the presence of exposure misclassification.
Collapse
|
19
|
Dai JY, Zhang XC, Wang CY, Kooperberg C. Augmented case-only designs for randomized clinical trials with failure time endpoints. Biometrics 2015; 72:30-8. [PMID: 26347982 DOI: 10.1111/biom.12392] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2014] [Revised: 07/01/2015] [Accepted: 07/01/2015] [Indexed: 02/05/2023]
Abstract
Under suitable assumptions and by exploiting the independence between inherited genetic susceptibility and treatment assignment, the case-only design yields efficient estimates for subgroup treatment effects and gene-treatment interaction in a Cox model. However it cannot provide estimates of the genetic main effect and baseline hazards, that are necessary to compute the absolute disease risk. For two-arm, placebo-controlled trials with rare failure time endpoints, we consider augmenting the case-only design with random samples of controls from both arms, as in the classical case-cohort sampling scheme, or with a random sample of controls from the active treatment arm only. The latter design is motivated by vaccine trials for cost-effective use of resources and specimens so that host genetics and vaccine-induced immune responses can be studied simultaneously in a bigger set of participants. We show that these designs can identify all parameters in a Cox model and that the efficient case-only estimator can be incorporated in a two-step plug-in procedure. Results in simulations and a data example suggest that incorporating case-only estimators in the classical case-cohort design improves the precision of all estimated parameters; sampling controls only in the active treatment arm attains a similar level of efficiency.
Collapse
Affiliation(s)
- James Y Dai
- Fred Hutchinson Cancer Research Center and University of Washington, Seattle, Washington
| | - Xinyi Cindy Zhang
- Fred Hutchinson Cancer Research Center and University of Washington, Seattle, Washington
| | - Ching-Yun Wang
- Fred Hutchinson Cancer Research Center and University of Washington, Seattle, Washington
| | - Charles Kooperberg
- Fred Hutchinson Cancer Research Center and University of Washington, Seattle, Washington
| |
Collapse
|
20
|
Zhang FT, Zhu ZH, Tong XR, Zhu ZX, Qi T, Zhu J. Mixed Linear Model Approaches of Association Mapping for Complex Traits Based on Omics Variants. Sci Rep 2015. [PMID: 26223539 PMCID: PMC5155518 DOI: 10.1038/srep10298] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Precise prediction for genetic architecture of complex traits is impeded by the limited understanding on genetic effects of complex traits, especially on gene-by-gene (GxG) and gene-by-environment (GxE) interaction. In the past decades, an explosion of high throughput technologies enables omics studies at multiple levels (such as genomics, transcriptomics, proteomics, and metabolomics). The analyses of large omics data, especially two-loci interaction analysis, are very time intensive. Integrating the diverse omics data and environmental effects in the analyses also remain challenges. We proposed mixed linear model approaches using GPU (Graphic Processing Unit) computation to simultaneously dissect various genetic effects. Analyses can be performed for estimating genetic main effects, GxG epistasis effects, and GxE environment interaction effects on large-scale omics data for complex traits, and for estimating heritability of specific genetic effects. Both mouse data analyses and Monte Carlo simulations demonstrated that genetic effects and environment interaction effects could be unbiasedly estimated with high statistical power by using the proposed approaches.
Collapse
Affiliation(s)
- Fu-Tao Zhang
- Institute of Bioinformatics, Zhejiang University, Hangzhou, China
| | - Zhi-Hong Zhu
- Institute of Bioinformatics, Zhejiang University, Hangzhou, China
| | - Xiao-Ran Tong
- Institute of Bioinformatics, Zhejiang University, Hangzhou, China
| | - Zhi-Xiang Zhu
- Institute of Bioinformatics, Zhejiang University, Hangzhou, China
| | - Ting Qi
- Institute of Bioinformatics, Zhejiang University, Hangzhou, China
| | - Jun Zhu
- Institute of Bioinformatics, Zhejiang University, Hangzhou, China
| |
Collapse
|
21
|
Bureau A, Croteau J, Couture C, Vohl MC, Bouchard C, Pérusse L. Estimating genetic effect sizes under joint disease-endophenotype models in presence of gene-environment interactions. Front Genet 2015; 6:248. [PMID: 26284107 PMCID: PMC4516976 DOI: 10.3389/fgene.2015.00248] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 07/08/2015] [Indexed: 12/18/2022] Open
Abstract
Effects of genetic variants on the risk of complex diseases estimated from association studies are typically small. Nonetheless, variants may have important effects in presence of specific levels of environmental exposures, and when a trait related to the disease (endophenotype) is either normal or impaired. We propose polytomous and transition models to represent the relationship between disease, endophenotype, genotype and environmental exposure in family studies. Model coefficients were estimated using generalized estimating equations and were used to derive gene-environment interaction effects and genotype effects at specific levels of exposure. In a simulation study, estimates of the effect of a genetic variant were substantially higher when both an endophenotype and an environmental exposure modifying the variant effect were taken into account, particularly under transition models, compared to the alternative of ignoring the endophenotype. Illustration of the proposed modeling with the metabolic syndrome, abdominal obesity, physical activity and polymorphisms in the NOX3 gene in the Quebec Family Study revealed that the positive association of the A allele of rs1375713 with the metabolic syndrome at high levels of physical activity was only detectable in subjects without abdominal obesity, illustrating the importance of taking into account the abdominal obesity endophenotype in this analysis.
Collapse
Affiliation(s)
- Alexandre Bureau
- Laboratoire de Biostatistique et Psychiatrie Génétique, Centre de Recherche de l'Institut Universitaire en Santé Mentale de Québec Québec, QC, Canada ; Département de Médecine Sociale et Préventive, Université Laval Québec, QC, Canada
| | - Jordie Croteau
- Laboratoire de Biostatistique et Psychiatrie Génétique, Centre de Recherche de l'Institut Universitaire en Santé Mentale de Québec Québec, QC, Canada
| | | | - Marie-Claude Vohl
- Institut sur la Nutrition et les Aliments Fonctionnels, Université Laval Québec, QC, Canada ; École de Nutrition, Université Laval Québec, QC, Canada
| | - Claude Bouchard
- Human Genomics Laboratory, Pennington Biomedical Research Center Baton Rouge, LA, USA
| | - Louis Pérusse
- Département de Kinésiologie, Université Laval Québec, QC, Canada ; Institut sur la Nutrition et les Aliments Fonctionnels, Université Laval Québec, QC, Canada
| |
Collapse
|
22
|
Stenzel SL, Ahn J, Boonstra PS, Gruber SB, Mukherjee B. The impact of exposure-biased sampling designs on detection of gene-environment interactions in case-control studies with potential exposure misclassification. Eur J Epidemiol 2015; 30:413-23. [PMID: 24894824 PMCID: PMC4256150 DOI: 10.1007/s10654-014-9908-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Accepted: 04/25/2014] [Indexed: 10/25/2022]
Abstract
With limited funding and biological specimen availability, choosing an optimal sampling design to maximize power for detecting gene-by-environment (G-E) interactions is critical. Exposure-enriched sampling is often used to select subjects with rare exposures for genotyping to enhance power for tests of G-E effects. However, exposure misclassification (MC) combined with biased sampling can affect characteristics of tests for G-E interaction and joint tests for marginal association and G-E interaction. Here, we characterize the impact of exposure-biased sampling under conditions of perfect exposure information and exposure MC on properties of several methods for conducting inference. We assess the Type I error, power, bias, and mean squared error properties of case-only, case-control, and empirical Bayes methods for testing/estimating G-E interaction and a joint test for marginal G (or E) effect and G-E interaction across three biased sampling schemes. Properties are evaluated via empirical simulation studies. With perfect exposure information, exposure-enriched sampling schemes enhance power as compared to random selection of subjects irrespective of exposure prevalence but yield bias in estimation of the G-E interaction and marginal E parameters. Exposure MC modifies the relative performance of sampling designs when compared to the case of perfect exposure information. Those conducting G-E interaction studies should be aware of exposure MC properties and the prevalence of exposure when choosing an ideal sampling scheme and method for characterizing G-E interactions and joint effects.
Collapse
Affiliation(s)
- Stephanie L Stenzel
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA,
| | | | | | | | | |
Collapse
|
23
|
Assessing gene-environment interactions for common and rare variants with binary traits using gene-trait similarity regression. Genetics 2015; 199:695-710. [PMID: 25585620 DOI: 10.1534/genetics.114.171686] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Accounting for gene-environment (G×E) interactions in complex trait association studies can facilitate our understanding of genetic heterogeneity under different environmental exposures, improve the ability to discover susceptible genes that exhibit little marginal effect, provide insight into the biological mechanisms of complex diseases, help to identify high-risk subgroups in the population, and uncover hidden heritability. However, significant G×E interactions can be difficult to find. The sample sizes required for sufficient power to detect association are much larger than those needed for genetic main effects, and interactions are sensitive to misspecification of the main-effects model. These issues are exacerbated when working with binary phenotypes and rare variants, which bear less information on association. In this work, we present a similarity-based regression method for evaluating G×E interactions for rare variants with binary traits. The proposed model aggregates the genetic and G×E information across markers, using genetic similarity, thus increasing the ability to detect G×E signals. The model has a random effects interpretation, which leads to robustness against main-effect misspecifications when evaluating G×E interactions. We construct score tests to examine G×E interactions and a computationally efficient EM algorithm to estimate the nuisance variance components. Using simulations and data applications, we show that the proposed method is a flexible and powerful tool to study the G×E effect in common or rare variant studies with binary traits.
Collapse
|
24
|
Abstract
A framework is presented that allows an investigator to estimate the portion of the effect of one exposure that is attributable to an interaction with a second exposure. We show that when the 2 exposures are statistically independent in distribution, the total effect of one exposure can be decomposed into a conditional effect of that exposure when the second is absent and also a component due to interaction. The decomposition applies on difference or ratio scales. We discuss how the components can be estimated using standard regression models, and how these components can be used to evaluate the proportion of the total effect of the primary exposure attributable to the interaction with the second exposure. In the setting in which one of the exposures affects the other, so that the 2 are no longer statistically independent in distribution, alternative decompositions are discussed. The various decompositions are illustrated with an example in genetic epidemiology. If it is not possible to intervene on the primary exposure of interest, the methods described in this article can help investigators to identify other variables that, if intervened upon, would eliminate the largest proportion of the effect of the primary exposure.
Collapse
Affiliation(s)
- Tyler J VanderWeele
- From the Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA
| | | |
Collapse
|
25
|
Ueki M. On the choice of degrees of freedom for testing gene-gene interactions. Stat Med 2014; 33:4934-48. [DOI: 10.1002/sim.6264] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Revised: 06/13/2014] [Accepted: 06/20/2014] [Indexed: 12/24/2022]
Affiliation(s)
- Masao Ueki
- Tohoku Medical Megabank Organization; Tohoku University, Graduate School of Medicine; 2-1 Seiryo-machi, Aoba-ku Sendai 980-8573 Japan
| |
Collapse
|
26
|
Lunceford JK, Cheng J, Wong P, Mehrotra DV. Ancestry Adjustments in Genome-Wide Association Studies of Randomized Clinical Trials. Stat Biopharm Res 2014. [DOI: 10.1080/19466315.2013.873000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
27
|
Goodman M, Mandel JS, DeSesso JM, Scialli AR. Atrazine and pregnancy outcomes: a systematic review of epidemiologic evidence. ACTA ACUST UNITED AC 2014; 101:215-36. [PMID: 24797711 PMCID: PMC4265844 DOI: 10.1002/bdrb.21101] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Accepted: 01/17/2014] [Indexed: 01/04/2023]
Abstract
Atrazine (ATR) is a commonly used agricultural herbicide that has been the subject of epidemiologic studies assessing its relation to reproductive health problems. This review evaluates both the consistency and the quality of epidemiologic evidence testing the hypothesis that ATR exposure, at usually encountered levels, is a risk factor for birth defects, small for gestational age birth weight, prematurity, miscarriages, and problems of fetal growth and development. We followed the current methodological guidelines for systematic reviews by using two independent researchers to identify, retrieve, and evaluate the relevant epidemiologic literature on the relation of ATR to various adverse outcomes of birth and pregnancy. Each eligible paper was summarized with respect to its methods and results with particular attention to study design and exposure assessment, which have been cited as the main areas of weakness in ATR research. As a quantitative meta-analysis was not feasible, the study results were categorized qualitatively as positive, null, or mixed. The literature on ATR and pregnancy-related health outcomes is growing rapidly, but the quality of the data is poor with most papers using aggregate rather than individual-level information. Without good quality data, the results are difficult to assess; however, it is worth noting that none of the outcome categories demonstrated consistent positive associations across studies. Considering the poor quality of the data and the lack of robust findings across studies, conclusions about a causal link between ATR and adverse pregnancy outcomes are not warranted.
Collapse
|
28
|
Huang YT. Integrative modeling of multiple genomic data from different types of genetic association studies. Biostatistics 2014; 15:587-602. [PMID: 24705142 DOI: 10.1093/biostatistics/kxu014] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Genome-wide association studies (GWASs) and expression-/methylation-quantitative trait loci (eQTL/mQTL) studies constitute popular approaches for investigating the association of single nucleotide polymorphisms (SNPs) with disease and expression/methylation, respectively. Here, we propose to integrate QTL studies to more powerfully test the SNP effect on disease in GWASs when they are conducted among different subjects. We propose a model for the joint effect of SNPs, methylation, and gene expression on disease risk and obtain the marginal model for SNPs by integrating out methylation and expression. We characterize all possible causal relations among SNPs, methylation, and expression and study the corresponding null hypotheses of no SNP effect in terms of the regression coefficients in the joint model. We develop a score test for variance components of regression coefficients to evaluate the genetic effect. We further propose an omnibus test to accommodate different models. We illustrate the utility of the proposed method in an asthma GWAS study, a brain tumor study, and numerical simulations.
Collapse
Affiliation(s)
- Yen-Tsung Huang
- Department of Epidemiology, Brown University, Providence, RI 02912, USA
| |
Collapse
|
29
|
Schoeps A, Rudolph A, Seibold P, Dunning AM, Milne RL, Bojesen SE, Swerdlow A, Andrulis I, Brenner H, Behrens S, Orr N, Jones M, Ashworth A, Li J, Cramp H, Connley D, Czene K, Darabi H, Chanock SJ, Lissowska J, Figueroa JD, Knight J, Glendon G, Mulligan AM, Dumont M, Severi G, Baglietto L, Olson J, Vachon C, Purrington K, Moisse M, Neven P, Wildiers H, Spurdle A, Kosma VM, Kataja V, Hartikainen JM, Hamann U, Ko YD, Dieffenbach AK, Arndt V, Stegmaier C, Malats N, Arias Perez J, Benítez J, Flyger H, Nordestgaard BG, Truong T, Cordina-Duverger E, Menegaux F, Silva IDS, Fletcher O, Johnson N, Häberle L, Beckmann MW, Ekici AB, Braaf L, Atsma F, van den Broek AJ, Makalic E, Schmidt DF, Southey MC, Cox A, Simard J, Giles GG, Lambrechts D, Mannermaa A, Brauch H, Guénel P, Peto J, Fasching PA, Hopper J, Flesch-Janys D, Couch F, Chenevix-Trench G, Pharoah PDP, Garcia-Closas M, Schmidt MK, Hall P, Easton DF, Chang-Claude J. Identification of new genetic susceptibility loci for breast cancer through consideration of gene-environment interactions. Genet Epidemiol 2014; 38:84-93. [PMID: 24248812 PMCID: PMC3995140 DOI: 10.1002/gepi.21771] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2013] [Revised: 10/15/2013] [Accepted: 10/15/2013] [Indexed: 11/06/2022]
Abstract
Genes that alter disease risk only in combination with certain environmental exposures may not be detected in genetic association analysis. By using methods accounting for gene-environment (G × E) interaction, we aimed to identify novel genetic loci associated with breast cancer risk. Up to 34,475 cases and 34,786 controls of European ancestry from up to 23 studies in the Breast Cancer Association Consortium were included. Overall, 71,527 single nucleotide polymorphisms (SNPs), enriched for association with breast cancer, were tested for interaction with 10 environmental risk factors using three recently proposed hybrid methods and a joint test of association and interaction. Analyses were adjusted for age, study, population stratification, and confounding factors as applicable. Three SNPs in two independent loci showed statistically significant association: SNPs rs10483028 and rs2242714 in perfect linkage disequilibrium on chromosome 21 and rs12197388 in ARID1B on chromosome 6. While rs12197388 was identified using the joint test with parity and with age at menarche (P-values = 3 × 10(-07)), the variants on chromosome 21 q22.12, which showed interaction with adult body mass index (BMI) in 8,891 postmenopausal women, were identified by all methods applied. SNP rs10483028 was associated with breast cancer in women with a BMI below 25 kg/m(2) (OR = 1.26, 95% CI 1.15-1.38) but not in women with a BMI of 30 kg/m(2) or higher (OR = 0.89, 95% CI 0.72-1.11, P for interaction = 3.2 × 10(-05)). Our findings confirm comparable power of the recent methods for detecting G × E interaction and the utility of using G × E interaction analyses to identify new susceptibility loci.
Collapse
Affiliation(s)
- Anja Schoeps
- Department of Cancer Epidemiology, German Cancer Research
Center (DKFZ), Heidelberg, Germany
- Institute of Public Health, University of Heidelberg,
Heidelberg, Germany
| | - Anja Rudolph
- Department of Cancer Epidemiology, German Cancer Research
Center (DKFZ), Heidelberg, Germany
| | - Petra Seibold
- Department of Cancer Epidemiology, German Cancer Research
Center (DKFZ), Heidelberg, Germany
| | - Alison M. Dunning
- Department of Oncology, University of Cambridge, Cambridge,
United Kingdom
| | - Roger L. Milne
- Genetic and Molecular Epidemiology Group, Spanish National
Cancer Research Centre (CNIO), Madrid, Spain
| | - Stig E. Bojesen
- Department of Clinical Biochemistry, Herlev Hospital,
Copenhagen University Hospital, Herlev, Denmark
- Copenhagen General Population Study, Herlev Hospital,
Copenhagen University Hospital, Herlev, Denmark
| | - Anthony Swerdlow
- Department of Genetics and Epidemiology, Institute of
Cancer Research, Sutton, United Kingdom
| | - Irene Andrulis
- Department of Molecular Genetics, Lunenfeld-Tanenbaum
Research Institute of Mount Sinai Hospital, Toronto, Canada
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research,
German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium, Heidelberg, Germany
| | - Sabine Behrens
- Department of Cancer Epidemiology, German Cancer Research
Center (DKFZ), Heidelberg, Germany
| | - Nicholas Orr
- Department of Breast Cancer Research, Institute of Cancer
Research, London, United Kingdom
| | - Michael Jones
- Copenhagen General Population Study, Herlev Hospital,
Copenhagen University Hospital, Herlev, Denmark
| | - Alan Ashworth
- Department of Human Genetics, Genome Institute of
Singapore, Singapore, Singapore
| | - Jingmei Li
- Department of Human Genetics, Genome Institute of
Singapore, Singapore, Singapore
| | - Helen Cramp
- Department of Oncology, Institute for Cancer Studies,
University of Sheffield, Sheffield, United Kingdom
| | - Dan Connley
- Department of Oncology, Institute for Cancer Studies,
University of Sheffield, Sheffield, United Kingdom
| | - Kamila Czene
- Department of Medical Epidemiology and Biostatistics,
Karolinska Institutet, Stockholm, Sweden
| | - Hatef Darabi
- Department of Medical Epidemiology and Biostatistics,
Karolinska Institutet, Stockholm, Sweden
| | - Stephen J. Chanock
- Division of Cancer Epidemiology and Genetics, National
Cancer Institute, Rockville, Maryland, United States of America
| | - Jolanta Lissowska
- Department of Cancer Epidemiology and Prevention, M.
Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Warsaw,
Poland
| | - Jonine D. Figueroa
- Division of Cancer Epidemiology and Genetics, National
Cancer Institute, Rockville, Maryland, United States of America
| | - Julia Knight
- Lunenfeld-Tanenbaum Research Institute of Mount Sinai
Hospital, Toronto, Canada
- Prosserman Centre for Health Research, Toronto,
Canada
| | - Gord Glendon
- Prosserman Centre for Health Research, Toronto,
Canada
| | - Anna M. Mulligan
- Laboratory Medicine Program, University Health Network,
Toronto, Canada
| | - Martine Dumont
- Cancer Genomics Laboratory, Centre Hospitalier
Universitaire de Québec Research Center, Laval University, Québec,
Canada
- Department of Molecular Medicine, Faculty of Medicine,
Quebec, Canada
| | - Gianluca Severi
- Cancer Epidemiology Centre, Cancer Council Victoria,
Melbourne, Australia
| | - Laura Baglietto
- Cancer Epidemiology Centre, Cancer Council Victoria,
Melbourne, Australia
- Centre for Molecular, Environmental, Genetic, and
Analytic Epidemiology, University of Melbourne, Melbourne, Australia
| | - Janet Olson
- Department of Health Sciences Research, Mayo Clinic,
Minnesota, United States of America
| | - Celine Vachon
- Department of Health Sciences Research, Mayo Clinic,
Minnesota, United States of America
| | - Kristen Purrington
- Department of Health Sciences Research, Mayo Clinic,
Minnesota, United States of America
| | - Matthieu Moisse
- Vesalius Research Center (VRC), VIB, Flanders,
Belgium
- Laboratory of Translational Genetics, Department of
Oncology, University of Leuven, Leuven, Belgium
| | - Patrick Neven
- Department of Multidisciplinary Breast Cancer, University
Hospital Gasthuisberg, Leuven, Belgium
| | - Hans Wildiers
- Department of Multidisciplinary Breast Cancer, University
Hospital Gasthuisberg, Leuven, Belgium
| | - Amanda Spurdle
- Department of Molecular Cancer Epidemiology, Queensland
Institute of Medical Research, Brisbane Australia
| | | | - Vesa Kataja
- Pathology Department, University of Kuopio, Kuopio,
Finland
| | | | - Ute Hamann
- Division of Molecular Genetics of Breast Cancer, German
Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Yon-Dschun Ko
- Department of Internal Medicine, Evangelische Kliniken
Bonn GmbH, Johanniter Krankenhaus, Bonn, Germany
| | - Aida K. Dieffenbach
- Division of Clinical Epidemiology and Aging Research,
German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium, Heidelberg, Germany
| | - Volker Arndt
- Division of Clinical Epidemiology and Aging Research,
German Cancer Research Center (DKFZ), Heidelberg, Germany
| | | | - Núria Malats
- Genetic and Molecular Epidemiology Group, Spanish National
Cancer Research Centre (CNIO), Madrid, Spain
| | - JoséI. Arias Perez
- Servicio de Cirugía General y Especialidades,
Hospital Monte Naranco, Oviedo, Spain
| | - Javier Benítez
- Human Genetics Group, Spanish National Cancer Reserach
Centre (CNIO), Madrid, Spain
| | - Henrik Flyger
- Department of Breast Surgery, Herlev Hospital, Copenhagen
University Hospital, Herlev, Denmark
| | - Børge G. Nordestgaard
- Department of Clinical Biochemistry, Herlev Hospital,
Copenhagen University Hospital, Herlev, Denmark
- Copenhagen General Population Study, Herlev Hospital,
Copenhagen University Hospital, Herlev, Denmark
| | - Théresè Truong
- Unité Mixte de Recherche Scientifique (UMRS) 1018,
University Paris-Sud, Villejuif, France
- INSERM (National Institute of Health and Medical
Research), CESP (Center for Research in Epidemiology and Population Health), U1018,
Environmental Epidemiology of Cancer, Villejuif, France
| | - Emilie Cordina-Duverger
- Unité Mixte de Recherche Scientifique (UMRS) 1018,
University Paris-Sud, Villejuif, France
- INSERM (National Institute of Health and Medical
Research), CESP (Center for Research in Epidemiology and Population Health), U1018,
Environmental Epidemiology of Cancer, Villejuif, France
| | - Florence Menegaux
- Unité Mixte de Recherche Scientifique (UMRS) 1018,
University Paris-Sud, Villejuif, France
- INSERM (National Institute of Health and Medical
Research), CESP (Center for Research in Epidemiology and Population Health), U1018,
Environmental Epidemiology of Cancer, Villejuif, France
| | - Isabel dos Santos Silva
- Department of Non-Communicable Disease Epidemiology,
London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Olivia Fletcher
- Breakthrough Breast Cancer Research Centre, Institute of
Cancer Research, London, United Kingdom
| | - Nichola Johnson
- Breakthrough Breast Cancer Research Centre, Institute of
Cancer Research, London, United Kingdom
| | - Lothar Häberle
- Department of Gynecology and Obstetrics, University
Hospital Erlangen, Friedrich-Alexander University Erlangen-Nuremberg, Erlangen,
Germany
| | - Matthias W. Beckmann
- Department of Gynecology and Obstetrics, University
Hospital Erlangen, Friedrich-Alexander University Erlangen-Nuremberg, Erlangen,
Germany
| | - Arif B. Ekici
- Institute of Human Genetics, Friedrich-Alexander
University Erlangen-Nuremberg, Erlangen, Germany
| | - Linde Braaf
- Division of Molecular Pathology, Netherlands Cancer
Institute, Amsterdam, The Netherlands
- Division of Psychosocial Research and Epidemiology,
Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Femke Atsma
- Department of Donor Studies, Sanquin Nijmegen, Nijmegen,
The Netherlands
| | - Alexandra J. van den Broek
- Division of Psychosocial Research and Epidemiology,
Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Enes Makalic
- Department of Analytic Epidemiology, University of
Melbourne, Melbourne, Australia
| | - Daniel F. Schmidt
- Centre for Molecular, Environmental, Genetic, and
Analytic Epidemiology, University of Melbourne, Melbourne, Australia
| | | | - Angela Cox
- Department of Oncology, Institute for Cancer Studies,
University of Sheffield, Sheffield, United Kingdom
| | - Jacques Simard
- Cancer Genomics Laboratory, Centre Hospitalier
Universitaire de Québec Research Center, Laval University, Québec,
Canada
- Department of Molecular Medicine, Faculty of Medicine,
Quebec, Canada
| | - Graham G. Giles
- Cancer Epidemiology Centre, Cancer Council Victoria,
Melbourne, Australia
- Centre for Molecular, Environmental, Genetic, and
Analytic Epidemiology, University of Melbourne, Melbourne, Australia
| | - Diether Lambrechts
- Vesalius Research Center (VRC), VIB, Flanders,
Belgium
- Laboratory of Translational Genetics, Department of
Oncology, University of Leuven, Leuven, Belgium
| | - Arto Mannermaa
- Department of Pathology and Forensic Medicine, Kuopio
University Hospital, University of Kuopio, Kuopio, Finland
| | - Hiltrud Brauch
- Dr. Margarete Fischer-Bosch Institute of Clinical
Pharamcology, Stuttgart, Germany
| | - Pascal Guénel
- Unité Mixte de Recherche Scientifique (UMRS) 1018,
University Paris-Sud, Villejuif, France
- INSERM (National Institute of Health and Medical
Research), CESP (Center for Research in Epidemiology and Population Health), U1018,
Environmental Epidemiology of Cancer, Villejuif, France
| | - Julian Peto
- Department of Non-Communicable Disease Epidemiology,
London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Peter A. Fasching
- Department of Gynecology and Obstetrics, University
Hospital Erlangen, Friedrich-Alexander University Erlangen-Nuremberg, Erlangen,
Germany
- Department of Medicine, David Geffen School of Medicine,
University of California, Los Angeles, United States
| | - John Hopper
- Centre for Molecular, Environmental, Genetic, and
Analytic Epidemiology, University of Melbourne, Victoria, Australia
| | - Dieter Flesch-Janys
- Department of Cancer Epidemiology/Clinical Cancer
Registry, University Clinic Hamburg-Eppendorf, Hamburg, Germany
- Institute for Medical Biometrics and Epidemiology,
University Clinic Hamburg-Eppendorf, Hamburg, Germany
| | - Fergus Couch
- Department of Experimental Pathology, Mayo Clinic,
Rochester, Minnesota, United States of America
| | - Georgia Chenevix-Trench
- Department of Molecular Cancer Epidemiology, Queensland
Institute of Medical Research, Brisbane Australia
| | - Paul D. P. Pharoah
- Department of Oncology and Public Health and Primary
Care, University of Cambridge, Cambridge, United Kingdom
| | - Montserrat Garcia-Closas
- Division of Genetics and Epidemiology, Breakthrough
Breast Cancer Research Centre, Institute of Cancer Research, London, United
Kingdom
| | - Marjanka K. Schmidt
- Division of Molecular Pathology, Netherlands Cancer
Institute, Amsterdam, The Netherlands
- Division of Psychosocial Research and Epidemiology,
Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Per Hall
- Department of Medical Epidemiology and Biostatistics,
Karolinska Institutet, Stockholm, Sweden
| | - Douglas F. Easton
- Department of Public Health and Primary Care, University
of Cambridge, Cambridge, United Kingdom
| | - Jenny Chang-Claude
- Correspondence to: Jenny Chang-Claude, Department
of Cancer Epidemiology, German Cancer Research Center (DKFZ), Im Neuenheimer
Feld 581, 69120 Heidelberg, Germany.
| |
Collapse
|
30
|
Abstract
AbstractIn this tutorial, we provide a broad introduction to the topic of interaction between the effects of exposures. We discuss interaction on both additive and multiplicative scales using risks, and we discuss their relation to statistical models (e.g. linear, log-linear, and logistic models). We discuss and evaluate arguments that have been made for using additive or multiplicative scales to assess interaction. We further discuss approaches to presenting interaction analyses, different mechanistic forms of interaction, when interaction is robust to unmeasured confounding, interaction for continuous outcomes, qualitative or “crossover” interactions, methods for attributing effects to interactions, case-only estimators of interaction, and power and sample size calculations for additive and multiplicative interaction.
Collapse
|
31
|
Hutter CM, Mechanic LE, Chatterjee N, Kraft P, Gillanders EM. Gene-environment interactions in cancer epidemiology: a National Cancer Institute Think Tank report. Genet Epidemiol 2013; 37:643-57. [PMID: 24123198 PMCID: PMC4143122 DOI: 10.1002/gepi.21756] [Citation(s) in RCA: 83] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Revised: 08/06/2013] [Accepted: 08/14/2013] [Indexed: 01/04/2023]
Abstract
Cancer risk is determined by a complex interplay of genetic and environmental factors. Genome-wide association studies (GWAS) have identified hundreds of common (minor allele frequency [MAF] > 0.05) and less common (0.01 < MAF < 0.05) genetic variants associated with cancer. The marginal effects of most of these variants have been small (odds ratios: 1.1-1.4). There remain unanswered questions on how best to incorporate the joint effects of genes and environment, including gene-environment (G × E) interactions, into epidemiologic studies of cancer. To help address these questions, and to better inform research priorities and allocation of resources, the National Cancer Institute sponsored a "Gene-Environment Think Tank" on January 10-11, 2012. The objective of the Think Tank was to facilitate discussions on (1) the state of the science, (2) the goals of G × E interaction studies in cancer epidemiology, and (3) opportunities for developing novel study designs and analysis tools. This report summarizes the Think Tank discussion, with a focus on contemporary approaches to the analysis of G × E interactions. Selecting the appropriate methods requires first identifying the relevant scientific question and rationale, with an important distinction made between analyses aiming to characterize the joint effects of putative or established genetic and environmental factors and analyses aiming to discover novel risk factors or novel interaction effects. Other discussion items include measurement error, statistical power, significance, and replication. Additional designs, exposure assessments, and analytical approaches need to be considered as we move from the current small number of success stories to a fuller understanding of the interplay of genetic and environmental factors.
Collapse
Affiliation(s)
- Carolyn M Hutter
- Epidemiology and Genomics Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | | | | | | | | |
Collapse
|
32
|
Winham SJ, Biernacka JM. Gene-environment interactions in genome-wide association studies: current approaches and new directions. J Child Psychol Psychiatry 2013; 54:1120-34. [PMID: 23808649 PMCID: PMC3829379 DOI: 10.1111/jcpp.12114] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/03/2013] [Indexed: 01/20/2023]
Abstract
BACKGROUND Complex psychiatric traits have long been thought to be the result of a combination of genetic and environmental factors, and gene-environment interactions are thought to play a crucial role in behavioral phenotypes and the susceptibility and progression of psychiatric disorders. Candidate gene studies to investigate hypothesized gene-environment interactions are now fairly common in human genetic research, and with the shift toward genome-wide association studies, genome-wide gene-environment interaction studies are beginning to emerge. METHODS We summarize the basic ideas behind gene-environment interaction, and provide an overview of possible study designs and traditional analysis methods in the context of genome-wide analysis. We then discuss novel approaches beyond the traditional strategy of analyzing the interaction between the environmental factor and each polymorphism individually. RESULTS Two-step filtering approaches that reduce the number of polymorphisms tested for interactions can substantially increase the power of genome-wide gene-environment studies. New analytical methods including data-mining approaches, and gene-level and pathway-level analyses, also have the capacity to improve our understanding of how complex genetic and environmental factors interact to influence psychologic and psychiatric traits. Such methods, however, have not yet been utilized much in behavioral and mental health research. CONCLUSIONS Although methods to investigate gene-environment interactions are available, there is a need for further development and extension of these methods to identify gene-environment interactions in the context of genome-wide association studies. These novel approaches need to be applied in studies of psychology and psychiatry.
Collapse
Affiliation(s)
- Stacey J Winham
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester MN 55905
| | - Joanna M. Biernacka
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester MN 55905,Department of Psychiatry and Psychology, Mayo Clinic, Rochester MN 55905
| |
Collapse
|
33
|
Vanderweele TJ, Ko YA, Mukherjee B. Environmental confounding in gene-environment interaction studies. Am J Epidemiol 2013; 178:144-52. [PMID: 23821317 DOI: 10.1093/aje/kws439] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
We show that, in the presence of uncontrolled environmental confounding, joint tests for the presence of a main genetic effect and gene-environment interaction will be biased if the genetic and environmental factors are correlated, even if there is no effect of either the genetic factor or the environmental factor on the disease. When environmental confounding is ignored, such tests will in fact reject the joint null of no genetic effect with a probability that tends to 1 as the sample size increases. This problem with the joint test vanishes under gene-environment independence, but it still persists if estimating the gene-environment interaction parameter itself is of interest. Uncontrolled environmental confounding will bias estimates of gene-environment interaction parameters even under gene-environment independence, but it will not do so if the unmeasured confounding variable itself does not interact with the genetic factor. Under gene-environment independence, if the interaction parameter without controlling for the environmental confounder is nonzero, then there is gene-environment interaction either between the genetic factor and the environmental factor of interest or between the genetic factor and the unmeasured environmental confounder. We evaluate several recently proposed joint tests in a simulation study and discuss the implications of these results for the conduct of gene-environment interaction studies.
Collapse
|
34
|
Dai JY, Li SS, Gilbert PB. Case-only method for cause-specific hazards models with application to assessing differential vaccine efficacy by viral and host genetics. Biostatistics 2013; 15:196-203. [PMID: 23813283 DOI: 10.1093/biostatistics/kxt018] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Cause-specific proportional hazards models are commonly used for analyzing competing risks data in clinical studies. Motivated by the objective to assess differential vaccine protection against distinct pathogen types in randomized preventive vaccine efficacy trials, we present an alternative case-only method to standard maximum partial likelihood estimation that applies to a rare failure event, e.g. acquisition of HIV infection. A logistic regression model is fit to the counts of cause-specific events (infecting pathogen type) within study arms, with an offset adjusting for the randomization ratio. This formulation of cause-specific hazard ratio estimation permits immediate incorporation of host-genetic factors to be assessed as effect modifiers, an important area of vaccine research for identifying immune correlates of protection, thus inheriting the estimation efficiency, and cost benefits of the case-only estimator commonly used for assessing gene-treatment interactions. The method is used to reassess HIV genotype-specific vaccine efficacy in the RV144 trial, providing nearly identical results to standard Cox methods, and to assess if and how this vaccine efficacy depends on Fc-γ receptor genes.
Collapse
Affiliation(s)
- James Y Dai
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | | | | |
Collapse
|
35
|
Kauffmann F, Demenais F. Gene-environment interactions in asthma and allergic diseases: challenges and perspectives. J Allergy Clin Immunol 2013. [PMID: 23195523 DOI: 10.1016/j.jaci.2012.10.038] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The concept of gene-environment (GxE) interactions has dramatically evolved in the last century and has now become a central theme in studies that assess the causes of human disease. Despite the numerous efforts to discover genes associated in asthma and allergy through various approaches, including the recent genome-wide association studies, investigation of GxE interactions has been mainly limited to candidate genes, candidate environmental exposures, or both. This review discusses the various strategies from hypothesis-driven strategies to the full agnostic search of GxE interactions with an illustration from recently published articles. Challenges raised by each piece of the puzzle (ie, phenotype, environment, gene, and analysis of GxE interaction) are put forward, and tentative solutions are proposed. New perspectives to integrate various types of data generated by new sequencing technologies and to progress toward a systems biology approach of disease are outlined. The future of a molecular network-based approach of disease to which GxE interactions are related requires space for innovative and multidisciplinary research. Assembling the various parts of a puzzle in a complex system could well occur in a way that might not necessarily follow the rules of logic.
Collapse
Affiliation(s)
- Francine Kauffmann
- INSERM, CESP Centre for research in Epidemiology and Population Health, U1018, Respiratory and Environmental Epidemiology Team, Villejuif, France
| | | |
Collapse
|
36
|
Dai JY, Kooperberg C, Leblanc M, Prentice RL. Two-stage testing procedures with independent filtering for genome-wide gene-environment interaction. Biometrika 2012; 99:929-944. [PMID: 23843674 DOI: 10.1093/biomet/ass044] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Several two-stage multiple testing procedures have been proposed to detect gene-environment interaction in genome-wide association studies. In this article, we elucidate general conditions that are required for validity and power of these procedures, and we propose extensions of two-stage procedures using the case-only estimator of gene-treatment interaction in randomized clinical trials. We develop a unified estimating equation approach to proving asymptotic independence between a filtering statistic and an interaction test statistic in a range of situations, including marginal association and interaction in a generalized linear model with a canonical link. We assess the performance of various two-stage procedures in simulations and in genetic studies from Women's Health Initiative clinical trials.
Collapse
Affiliation(s)
- James Y Dai
- Public Health Science Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A. ,
| | | | | | | |
Collapse
|
37
|
Huang Y, Ballinger DG, Stokowski R, Beilharz E, Robinson JG, Liu S, Robinson RD, Henderson VW, Rossouw JE, Prentice RL. Exploring the interaction between SNP genotype and postmenopausal hormone therapy effects on stroke risk. Genome Med 2012; 4:57. [PMID: 22794791 PMCID: PMC3580413 DOI: 10.1186/gm358] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2012] [Revised: 06/27/2012] [Accepted: 07/13/2012] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Genome-wide association studies have identified several genomic regions that are associated with stroke risk, but these provide an explanation for only a small fraction of familial stroke aggregation. Genotype by environment interactions may contribute further to such an explanation. The Women's Health Initiative (WHI) clinical trial found increased stroke risk with postmenopausal hormone therapy (HT) and provides an efficient setting for evaluating genotype-HT interaction on stroke risk. METHODS We examined HT by genotype interactions for 392 SNPs selected from candidate gene studies, and 2,371 SNPs associated with changes in blood protein concentrations after hormone therapy, in analyses that included 2,045 postmenopausal women who developed stroke during WHI clinical trial and observational study follow-up and one-to-one matched controls. A two-stage procedure was implemented where SNPs passing the first stage screening based on marginal association with stroke risk were tested in the second stage for interaction with HT using case-only analysis. RESULTS The two-stage procedure identified two SNPs, rs2154299 and rs12194855, in the coagulation factor XIII subunit A (F13A1) region and two SNPs, rs630431 and rs560892, in the proprotein convertase subtilisin kexin 9 (PCSK9) region, with an estimated false discovery rate <0.05 based on interaction tests. Further analyses showed significant stroke risk interaction between these F13A1 SNPs and estrogen plus progestin (E+P) treatment for ischemic stroke and for ischemic and hemorrhagic stroke combined, and suggested interactions between PCSK9 SNPs with either E+P or estrogen-alone treatment. CONCLUSIONS Genotype by environment interaction information may help to define genomic regions relevant to stroke risk. Two-stage analysis among postmenopausal women generates novel hypotheses concerning the F13A1 and PCSK9 genomic regions and the effects of hormonal exposures on postmenopausal stroke risk for subsequent independent validation.
Collapse
Affiliation(s)
- Ying Huang
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109, USA
| | - Dennis G Ballinger
- Complete Genomics, Inc, 2071 Stierlin Court, Mountain View, CA 94043, USA
| | - Renee Stokowski
- Aria Diagnostics, 5945 Optical Court, San Jose, CA 95138, USA
| | - Erica Beilharz
- Complete Genomics, Inc, 2071 Stierlin Court, Mountain View, CA 94043, USA
| | - Jennifer G Robinson
- Departments of Epidemiology and Medicine, University of Iowa, 200 Medicine Admin. Bldg, Iowa City, IA 52242, USA
| | - Simin Liu
- School of Public Health, University of California at Los Angeles, 650 Charles E Young Dr. South, Los Angeles, CA 90095, USA
| | - Randal D Robinson
- School of Medicine, UT Heath Science Center, 7703 Floyd Curl Drive, San Antonio, TX 78229, USA
| | - Victor W Henderson
- Health Research & Policy (Epidemiology) and Neurology & Neurological Sciences, Stanford School of Medicine, 450 Serra Mall, Stanford, CA 94305, USA
| | - Jacques E Rossouw
- Women's Health Initiative Project Office, National Heart, Lung, and Blood Institute, 6701 Rockledge Drive, Bethesda, MD 20892, USA
| | - Ross L Prentice
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109, USA
| |
Collapse
|