1
|
Herrera-Luis E, Benke K, Volk H, Ladd-Acosta C, Wojcik GL. Gene-environment interactions in human health. Nat Rev Genet 2024; 25:768-784. [PMID: 38806721 DOI: 10.1038/s41576-024-00731-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/03/2024] [Indexed: 05/30/2024]
Abstract
Gene-environment interactions (G × E), the interplay of genetic variation with environmental factors, have a pivotal impact on human complex traits and diseases. Statistically, G × E can be assessed by determining the deviation from expectation of predictive models based solely on the phenotypic effects of genetics or environmental exposures. Despite the unprecedented, widespread and diverse use of G × E analytical frameworks, heterogeneity in their application and reporting hinders their applicability in public health. In this Review, we discuss study design considerations as well as G × E analytical frameworks to assess polygenic liability dependent on the environment, to identify specific genetic variants exhibiting G × E, and to characterize environmental context for these dynamics. We conclude with recommendations to address the most common challenges and pitfalls in the conceptualization, methodology and reporting of G × E studies, as well as future directions.
Collapse
Affiliation(s)
- Esther Herrera-Luis
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Kelly Benke
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Heather Volk
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Christine Ladd-Acosta
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Genevieve L Wojcik
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
2
|
Yang S, Cai H, Ye K. Phenome-wide association study of population-differentiating genetic variants around gene ACSL1. Evol Med Public Health 2024; 12:178-190. [PMID: 39386187 PMCID: PMC11462608 DOI: 10.1093/emph/eoae024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 09/08/2024] [Indexed: 10/12/2024] Open
Abstract
Background and objectives Demographic dynamics and natural selection during human evolution shaped the present-day patterns of genetic variations, and geographically varying genetic factors contribute to different disease prevalences across human populations. This study aims to evaluate the presence of positive selection on the gene encoding long-chain fatty acyl-CoA synthetase 1 (ACSL1) and the phenotypic impacts of population-differentiating genetic variants around this gene. Methodology Three types of statistical tests for positive selection, based on site frequency spectrum, extended haplotype homozygosity and population differentiation, were applied to the whole-genome sequencing data from the 1000 Genomes Project. A phenome-wide association study of ACSL1 was performed with published genome-wide association studies (GWAS) and transcriptome-wide association studies, including phenome-wide studies in biobanks. Results Genetic variants associated with ACSL1 expression in various tissues exhibit geographically varying allele frequencies. Three types of statistical tests consistently supported the presence of positive selection on the coding and regulatory regions of ACSL1 in African, European, South Asian and East Asian populations. A phenome-wide association study of ACSL1 revealed associations with type 2 diabetes, blood glucose, age at menopause, mean platelet volume and mean reticulocyte volume. The top allele associated with lower diabetes risk has the highest frequency in European populations, whereas the top allele associated with later menopause has the highest frequency in African populations. Conclusions and implications Positive selection on ACSL1 resulted in geographically varying genetic variants, which may contribute to differential phenotypes across human populations, including type 2 diabetes and age at menopause.
Collapse
Affiliation(s)
- Shuang Yang
- Department of Genetics, Franklin College of Arts and Sciences, University of Georgia, Athens, GA 30602, USA
| | - Houjian Cai
- Department of Pharmaceutical and Biomedical Sciences, College of Pharmacy, University of Georgia, Athens, GA 30602, USA
| | - Kaixiong Ye
- Department of Genetics, Franklin College of Arts and Sciences, University of Georgia, Athens, GA 30602, USA
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
3
|
Kontou PI, Bagos PG. The goldmine of GWAS summary statistics: a systematic review of methods and tools. BioData Min 2024; 17:31. [PMID: 39238044 PMCID: PMC11375927 DOI: 10.1186/s13040-024-00385-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 08/27/2024] [Indexed: 09/07/2024] Open
Abstract
Genome-wide association studies (GWAS) have revolutionized our understanding of the genetic architecture of complex traits and diseases. GWAS summary statistics have become essential tools for various genetic analyses, including meta-analysis, fine-mapping, and risk prediction. However, the increasing number of GWAS summary statistics and the diversity of software tools available for their analysis can make it challenging for researchers to select the most appropriate tools for their specific needs. This systematic review aims to provide a comprehensive overview of the currently available software tools and databases for GWAS summary statistics analysis. We conducted a comprehensive literature search to identify relevant software tools and databases. We categorized the tools and databases by their functionality, including data management, quality control, single-trait analysis, and multiple-trait analysis. We also compared the tools and databases based on their features, limitations, and user-friendliness. Our review identified a total of 305 functioning software tools and databases dedicated to GWAS summary statistics, each with unique strengths and limitations. We provide descriptions of the key features of each tool and database, including their input/output formats, data types, and computational requirements. We also discuss the overall usability and applicability of each tool for different research scenarios. This comprehensive review will serve as a valuable resource for researchers who are interested in using GWAS summary statistics to investigate the genetic basis of complex traits and diseases. By providing a detailed overview of the available tools and databases, we aim to facilitate informed tool selection and maximize the effectiveness of GWAS summary statistics analysis.
Collapse
Affiliation(s)
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131, Lamia, Greece.
| |
Collapse
|
4
|
Yamamoto Y, Shirai Y, Edahiro R, Kumanogoh A, Okada Y. Large-scale cross-trait genetic analysis highlights shared genetic backgrounds of autoimmune diseases. Immunol Med 2024:1-10. [PMID: 39171621 DOI: 10.1080/25785826.2024.2394258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Accepted: 08/15/2024] [Indexed: 08/23/2024] Open
Abstract
Disorders associated with the immune system burden multiple organs, although the shared biology exists across the diseases. Preceding family-based studies reveal that immune diseases are heritable to varying degrees, providing the basis for immunogenomics. The recent cost reduction in genetic analysis intensively promotes biobank-scale studies and the development of frameworks for statistical genetics. The accumulating multi-layer omics data, including genome-wide association studies (GWAS) and RNA-sequencing at single-cell resolution, enable us to dissect the genetic backgrounds of immune-related disorders. Although autoimmune and allergic diseases are generally categorized into different disease categories, epidemiological studies reveal the high incidence of autoimmune and allergic disease complications, suggesting the shared genetics and biology between the disease categories. Biobank resources and consortia cover multiple immune-related disorders to accumulate phenome-wide associations of genetic variants and enhance researchers to analyze the shared and heterogeneous genetic backgrounds. The emerging post-GWAS and integrative multi-omics analyses provide genetic and biological insights into the multicategorical disease associations.
Collapse
Affiliation(s)
- Yuji Yamamoto
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Department of Respiratory Medicine and Clinical Immunology, Osaka University Graduate School of Medicine, Suita, Japan
| | - Yuya Shirai
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Department of Respiratory Medicine and Clinical Immunology, Osaka University Graduate School of Medicine, Suita, Japan
- Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
- RIKEN Center for Integrative Medical Sciences, Laboratory for Systems Genetics, Yokohama, Japan
| | - Ryuya Edahiro
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Department of Respiratory Medicine and Clinical Immunology, Osaka University Graduate School of Medicine, Suita, Japan
- RIKEN Center for Integrative Medical Sciences, Laboratory for Systems Genetics, Yokohama, Japan
| | - Atsushi Kumanogoh
- Department of Respiratory Medicine and Clinical Immunology, Osaka University Graduate School of Medicine, Suita, Japan
- Department of Immunopathology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives (OTRI), Osaka University, Suita, Japan
- Center for Infectious Diseases for Education and Research (CiDER), Osaka University, Suita, Japan
- Japan Agency for Medical Research and Development, Core Research for Evolutional Science and Technology (AMED-CREST), Tokyo, Japan
- Center for Advanced Modalities and DDS (CAMaD), Osaka University, Suita, Japan
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
- RIKEN Center for Integrative Medical Sciences, Laboratory for Systems Genetics, Yokohama, Japan
- Center for Infectious Diseases for Education and Research (CiDER), Osaka University, Suita, Japan
- Center for Advanced Modalities and DDS (CAMaD), Osaka University, Suita, Japan
- Department of Genome Informatics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
- Premium Research Institute for Human Metaverse Medicine (WPI-PRIMe), Osaka University, Suita, Japan
| |
Collapse
|
5
|
Kachuri L, Chatterjee N, Hirbo J, Schaid DJ, Martin I, Kullo IJ, Kenny EE, Pasaniuc B, Witte JS, Ge T. Principles and methods for transferring polygenic risk scores across global populations. Nat Rev Genet 2024; 25:8-25. [PMID: 37620596 PMCID: PMC10961971 DOI: 10.1038/s41576-023-00637-2] [Citation(s) in RCA: 50] [Impact Index Per Article: 50.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/11/2023] [Indexed: 08/26/2023]
Abstract
Polygenic risk scores (PRSs) summarize the genetic predisposition of a complex human trait or disease and may become a valuable tool for advancing precision medicine. However, PRSs that are developed in populations of predominantly European genetic ancestries can increase health disparities due to poor predictive performance in individuals of diverse and complex genetic ancestries. We describe genetic and modifiable risk factors that limit the transferability of PRSs across populations and review the strengths and weaknesses of existing PRS construction methods for diverse ancestries. Developing PRSs that benefit global populations in research and clinical settings provides an opportunity for innovation and is essential for health equity.
Collapse
Affiliation(s)
- Linda Kachuri
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Nilanjan Chatterjee
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Jibril Hirbo
- Department of Medicine Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Daniel J Schaid
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Iman Martin
- Division of Genomic Medicine, National Human Genome Research Institute, Bethesda, MD, USA
| | - Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
| | - Eimear E Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Bogdan Pasaniuc
- Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - John S Witte
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA.
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
- Department of Genetics, Stanford University, Stanford, CA, USA.
| | - Tian Ge
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
6
|
Pham DT, Westerman KE, Pan C, Chen L, Srinivasan S, Isganaitis E, Vajravelu ME, Bacha F, Chernausek S, Gubitosi-Klug R, Divers J, Pihoker C, Marcovina SM, Manning AK, Chen H. Re-analysis and meta-analysis of summary statistics from gene-environment interaction studies. Bioinformatics 2023; 39:btad730. [PMID: 38039147 PMCID: PMC10724851 DOI: 10.1093/bioinformatics/btad730] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 10/26/2023] [Accepted: 11/30/2023] [Indexed: 12/03/2023] Open
Abstract
MOTIVATION statistics from genome-wide association studies enable many valuable downstream analyses that are more efficient than individual-level data analysis while also reducing privacy concerns. As growing sample sizes enable better-powered analysis of gene-environment interactions, there is a need for gene-environment interaction-specific methods that manipulate and use summary statistics. RESULTS We introduce two tools to facilitate such analysis, with a focus on statistical models containing multiple gene-exposure and/or gene-covariate interaction terms. REGEM (RE-analysis of GEM summary statistics) uses summary statistics from a single, multi-exposure genome-wide interaction study to derive analogous sets of summary statistics with arbitrary sets of exposures and interaction covariate adjustments. METAGEM (META-analysis of GEM summary statistics) extends current fixed-effects meta-analysis models to incorporate multiple exposures from multiple studies. We demonstrate the value and efficiency of these tools by exploring alternative methods of accounting for ancestry-related population stratification in genome-wide interaction study in the UK Biobank as well as by conducting a multi-exposure genome-wide interaction study meta-analysis in cohorts from the diabetes-focused ProDiGY consortium. These programs help to maximize the value of summary statistics from diverse and complex gene-environment interaction studies. AVAILABILITY AND IMPLEMENTATION REGEM and METAGEM are open-source projects freely available at https://github.com/large-scale-gxe-methods/REGEM and https://github.com/large-scale-gxe-methods/METAGEM.
Collapse
Affiliation(s)
- Duy T Pham
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, United States
| | - Kenneth E Westerman
- Department of Medicine, Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA 02114, United States
- Metabolism Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, United States
- Department of Medicine, Harvard Medical School, Boston, MA 02115, United States
| | - Cong Pan
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, United States
| | - Ling Chen
- Department of Medicine, Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA 02114, United States
- Metabolism Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, United States
| | - Shylaja Srinivasan
- Department of Pediatrics, University of California, San Francisco, CA 94158, United States
| | - Elvira Isganaitis
- Research Division, Joslin Diabetes Center, Boston, MA 02115, United States
| | - Mary Ellen Vajravelu
- Department of Pediatrics, University of Pittsburgh School of Medicine, Pittsburgh, PA 15224, United States
| | - Fida Bacha
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, United States
| | - Steve Chernausek
- Department of Pediatrics, The University of Oklahoma College of Medicine, Oklahoma City, OK 73117, United States
| | - Rose Gubitosi-Klug
- Department of Pediatrics, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Jasmin Divers
- Department of Foundations of Medicine, New York University, New York, NY 10016, United States
| | - Catherine Pihoker
- Department of Pediatrics, University of Washington School of Medicine, Seattle, WA 98105, United States
| | - Santica M Marcovina
- Northwest Lipid Metabolism and Diabetes Research Laboratories, Department of Medicine, University of Washington, Seattle, WA 98105, United States
| | - Alisa K Manning
- Department of Medicine, Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA 02114, United States
- Metabolism Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, United States
- Department of Medicine, Harvard Medical School, Boston, MA 02115, United States
| | - Han Chen
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, United States
| |
Collapse
|