1
|
Kontou PI, Bagos PG. The goldmine of GWAS summary statistics: a systematic review of methods and tools. BioData Min 2024; 17:31. [PMID: 39238044 PMCID: PMC11375927 DOI: 10.1186/s13040-024-00385-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 08/27/2024] [Indexed: 09/07/2024] Open
Abstract
Genome-wide association studies (GWAS) have revolutionized our understanding of the genetic architecture of complex traits and diseases. GWAS summary statistics have become essential tools for various genetic analyses, including meta-analysis, fine-mapping, and risk prediction. However, the increasing number of GWAS summary statistics and the diversity of software tools available for their analysis can make it challenging for researchers to select the most appropriate tools for their specific needs. This systematic review aims to provide a comprehensive overview of the currently available software tools and databases for GWAS summary statistics analysis. We conducted a comprehensive literature search to identify relevant software tools and databases. We categorized the tools and databases by their functionality, including data management, quality control, single-trait analysis, and multiple-trait analysis. We also compared the tools and databases based on their features, limitations, and user-friendliness. Our review identified a total of 305 functioning software tools and databases dedicated to GWAS summary statistics, each with unique strengths and limitations. We provide descriptions of the key features of each tool and database, including their input/output formats, data types, and computational requirements. We also discuss the overall usability and applicability of each tool for different research scenarios. This comprehensive review will serve as a valuable resource for researchers who are interested in using GWAS summary statistics to investigate the genetic basis of complex traits and diseases. By providing a detailed overview of the available tools and databases, we aim to facilitate informed tool selection and maximize the effectiveness of GWAS summary statistics analysis.
Collapse
Affiliation(s)
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131, Lamia, Greece.
| |
Collapse
|
2
|
Lorincz-Comi N, Yang Y, Li G, Zhu X. MRBEE: A bias-corrected multivariable Mendelian randomization method. HGG ADVANCES 2024; 5:100290. [PMID: 38582968 PMCID: PMC11053334 DOI: 10.1016/j.xhgg.2024.100290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 04/01/2024] [Accepted: 04/01/2024] [Indexed: 04/08/2024] Open
Abstract
Mendelian randomization (MR) is an instrumental variable approach used to infer causal relationships between exposures and outcomes, which is becoming increasingly popular because of its ability to handle summary statistics from genome-wide association studies. However, existing MR approaches often suffer the bias from weak instrumental variables, horizontal pleiotropy and sample overlap. We introduce MRBEE (MR using bias-corrected estimating equation), a multivariable MR method capable of simultaneously removing weak instrument and sample overlap bias and identifying horizontal pleiotropy. Our extensive simulations and real data analyses reveal that MRBEE provides nearly unbiased estimates of causal effects, well-controlled type I error rates and higher power than comparably robust methods and is computationally efficient. Our real data analyses result in consistent causal effect estimates and offer valuable guidance for conducting multivariable MR studies, elucidating the roles of pleiotropy, and identifying total 42 horizontal pleiotropic loci missed previously that are associated with myopia, schizophrenia, and coronary artery disease.
Collapse
Affiliation(s)
- Noah Lorincz-Comi
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Yihe Yang
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Gen Li
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA.
| |
Collapse
|
3
|
Jin YJ, Wu XY, An ZY. The Application of Mendelian Randomization in Cardiovascular Disease Risk Prediction: Current Status and Future Prospects. Rev Cardiovasc Med 2024; 25:262. [PMID: 39139440 PMCID: PMC11317336 DOI: 10.31083/j.rcm2507262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 03/05/2024] [Accepted: 03/11/2024] [Indexed: 08/15/2024] Open
Abstract
Cardiovascular disease (CVD), a leading cause of death and disability worldwide, and is associated with a wide range of risk factors, and genetically associated conditions. While many CVDs are preventable and early detection alongside treatment can significantly mitigate complication risks, current prediction models for CVDs need enhancements for better accuracy. Mendelian randomization (MR) offers a novel approach for estimating the causal relationship between exposure and outcome by using genetic variation in quasi-experimental data. This method minimizes the impact of confounding variables by leveraging the random allocation of genes during gamete formation, thereby facilitating the integration of new predictors into risk prediction models to refine the accuracy of prediction. In this review, we delve into the theory behind MR, as well as the strengths, applications, and limitations behind this emerging technology. A particular focus will be placed on MR application to CVD, and integration into CVD prediction frameworks. We conclude by discussing the inclusion of various populations and by offering insights into potential areas for future research and refinement.
Collapse
Affiliation(s)
- Yi-Jing Jin
- Peking University Health Science Center, 100191 Beijing, China
- Department of Cardiology, Peking University First Hospital, 100034
Beijing, China
| | - Xing-Yuan Wu
- Peking University Health Science Center, 100191 Beijing, China
| | - Zhuo-Yu An
- Peking University Health Science Center, 100191 Beijing, China
- Peking University Institute of Hematology, Peking University People's
Hospital, 100044 Beijing, China
| |
Collapse
|
4
|
Wang H, Reid BM, Richmond RC, Lane JM, Saxena R, Gonzalez BD, Fridley BL, Redline S, Tworoger SS, Wang X. Impact of insomnia on ovarian cancer risk and survival: a Mendelian randomization study. EBioMedicine 2024; 104:105175. [PMID: 38823087 PMCID: PMC11169961 DOI: 10.1016/j.ebiom.2024.105175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/14/2024] [Accepted: 05/16/2024] [Indexed: 06/03/2024] Open
Abstract
BACKGROUND Insomnia is the most common sleep disorder in patients with epithelial ovarian cancer (EOC). We investigated the causal association between genetically predicted insomnia and EOC risk and survival through a two-sample Mendelian randomization (MR) study. METHODS Insomnia was proxied using genetic variants identified in a genome-wide association study (GWAS) meta-analysis of UK Biobank and 23andMe. Using genetic associations with EOC risk and overall survival from the Ovarian Cancer Association Consortium (OCAC) GWAS in 66,450 women (over 11,000 cases with clinical follow-up), we performed Iterative Mendelian Randomization and Pleiotropy (IMRP) analysis followed by a set of sensitivity analyses. Genetic associations with survival and response to treatment in ovarian cancer study of The Cancer Genome Atlas (TCGA) were estimated controlling for chemotherapy and clinical factors. FINDINGS Insomnia was associated with higher risk of endometrioid EOC (OR = 1.60, 95% CI 1.05-2.45) and lower risk of high-grade serous EOC (HGSOC) and clear cell EOC (OR = 0.79 and 0.48, 95% CI 0.63-1.00 and 0.27-0.86, respectively). In survival analysis, insomnia was associated with shorter survival of invasive EOC (OR = 1.45, 95% CI 1.13-1.87) and HGSOC (OR = 1.4, 95% CI 1.04-1.89), which was attenuated after adjustment for body mass index and reproductive age. Insomnia was associated with reduced survival in TCGA HGSOC cases who received standard chemotherapy (OR = 2.48, 95% CI 1.13-5.42), but was attenuated after adjustment for clinical factors. INTERPRETATION This study supports the impact of insomnia on EOC risk and survival, suggesting treatments targeting insomnia could be pivotal for prevention and improving patient survival. FUNDING National Institutes of Health, National Cancer Institute. Full funding details are provided in acknowledgments.
Collapse
Affiliation(s)
- Heming Wang
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Brigham and Women's Hospital, Boston, MA, USA
| | - Brett M Reid
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
| | - Rebecca C Richmond
- Medical Research Council Integrative Epidemiology Unit, University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK; NIHR Oxford Health Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Jacqueline M Lane
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Brigham and Women's Hospital, Boston, MA, USA
| | - Richa Saxena
- Center for Genomic Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA; Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Brian D Gonzalez
- Department of Health Outcomes and Behavior, Moffitt Cancer Center, Tampa, FL, USA
| | - Brooke L Fridley
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA; Children's Mercy Hospital, Kansas City, MO, USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Brigham and Women's Hospital, Boston, MA, USA
| | - Shelley S Tworoger
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA; Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA
| | - Xuefeng Wang
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA.
| |
Collapse
|
5
|
Liu YX, Yang WT, Li Y. Different effects of 24 dietary intakes on gastroesophageal reflux disease: A mendelian randomization. World J Clin Cases 2024; 12:2370-2381. [PMID: 38765751 PMCID: PMC11099402 DOI: 10.12998/wjcc.v12.i14.2370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 02/11/2024] [Accepted: 04/02/2024] [Indexed: 04/29/2024] Open
Abstract
BACKGROUND In observational studies, dietary intakes are associated with gastroesophageal reflux disease (GERD). AIM To conduct a two-sample mendelian randomization (MR) analysis to determine whether those associations are causal. METHODS To explore the relationship between dietary intake and the risk of GERD, we extracted appropriate single nucleotide polymorphisms from genome-wide association study data on 24 dietary intakes. Three methods were adopted for data analysis: Inverse variance weighting, weighted median methods, and MR-Egger's method. The odds ratio (OR) and 95% confidence interval (CI) were used to evaluate the causal association between dietary intake and GERD. RESULTS Our univariate Mendelian randomization (UVMR) results showed significant evidence that pork intake (OR, 2.83; 95%CI: 1.76-4.55; P = 1.84 × 10-5), beer intake (OR, 2.70, 95%CI: 2.00-3.64; P = 6.54 × 10-11), non-oily fish intake (OR, 2.41; 95%CI: 1.49-3.91; P = 3.59 × 10-4) have a protective effect on GERD. In addition, dried fruit intake (OR, 0.37; 95%CI: 0.27-0.50; 6.27 × 10-11), red wine intake (OR, 0.34; 95%CI: 0.25-0.47; P = 1.90 × 10-11), cheese intake (OR, 0.46; 95%CI: 0.39-0.55; P =3.73 × 10-19), bread intake (OR, 0.72; 95%CI: 0.56-0.92; P = 0.0009) and cereal intake (OR, 0.45; 95%CI: 0.36-0.57; P = 2.07 × 10-11) were negatively associated with the risk of GERD. There was a suggestive association for genetically predicted coffee intake (OR per one SD increase, 1.22, 95%CI: 1.03-1.44; P = 0.019). Multivariate Mendelian randomization further confirmed that dried fruit intake, red wine intake, cheese intake, and cereal intake directly affected GERD. In contrast, the impact of pork intake, beer intake, non-oily fish intake, and bread intake on GERD was partly driven by the common risk factors for GERD. However, after adjusting for all four elements, there was no longer a suggestive association between coffee intake and GERD. CONCLUSION This study provides MR evidence to support the causal relationship between a broad range of dietary intake and GERD, providing new insights for the treatment and prevention of GERD.
Collapse
Affiliation(s)
- Yu-Xin Liu
- Department of Oncology, Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu 610072, Sichuan Province, China
| | - Wen-Tao Yang
- Department of Cardiovascular, Chengdu Integrated TCM & Western Medicine Hospital, Chengdu 610041, Sichuan Province, China
| | - Yang Li
- Department of Nuclear Medicine, Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu 610072, Sichuan Province, China
| |
Collapse
|
6
|
Zhu X, Yang Y, Lorincz-Comi N, Li G, Bentley AR, de Vries PS, Brown M, Morrison AC, Rotimi CN, Gauderman WJ, Rao DC, Aschard H. An approach to identify gene-environment interactions and reveal new biological insight in complex traits. Nat Commun 2024; 15:3385. [PMID: 38649715 PMCID: PMC11035594 DOI: 10.1038/s41467-024-47806-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 04/10/2024] [Indexed: 04/25/2024] Open
Abstract
There is a long-standing debate about the magnitude of the contribution of gene-environment interactions to phenotypic variations of complex traits owing to the low statistical power and few reported interactions to date. To address this issue, the Gene-Lifestyle Interactions Working Group within the Cohorts for Heart and Aging Research in Genetic Epidemiology Consortium has been spearheading efforts to investigate G × E in large and diverse samples through meta-analysis. Here, we present a powerful new approach to screen for interactions across the genome, an approach that shares substantial similarity to the Mendelian randomization framework. We identify and confirm 5 loci (6 independent signals) interacted with either cigarette smoking or alcohol consumption for serum lipids, and empirically demonstrate that interaction and mediation are the major contributors to genetic effect size heterogeneity across populations. The estimated lower bound of the interaction and environmentally mediated heritability is significant (P < 0.02) for low-density lipoprotein cholesterol and triglycerides in Cross-Population data. Our study improves the understanding of the genetic architecture and environmental contributions to complex traits.
Collapse
Affiliation(s)
- Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA.
| | - Yihe Yang
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Noah Lorincz-Comi
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Gen Li
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Amy R Bentley
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Paul S de Vries
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Michael Brown
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Alanna C Morrison
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Charles N Rotimi
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - W James Gauderman
- Division of Biostatistics, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA
| | - Dabeeru C Rao
- Center for Biostatistics and Data Science, Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine, St. Louis, MO, USA
| | - Hugues Aschard
- Institut Pasteur, Université Paris Cité, Department of Computational Biology, F-75015, Paris, France
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| |
Collapse
|
7
|
Meng Z, Wang J, Lin L, Wu C. Sensitivity analysis with iterative outlier detection for systematic reviews and meta-analyses. Stat Med 2024; 43:1549-1563. [PMID: 38318993 PMCID: PMC10947935 DOI: 10.1002/sim.10008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 10/03/2023] [Accepted: 12/21/2023] [Indexed: 02/07/2024]
Abstract
Meta-analysis is a widely used tool for synthesizing results from multiple studies. The collected studies are deemed heterogeneous when they do not share a common underlying effect size; thus, the factors attributable to the heterogeneity need to be carefully considered. A critical problem in meta-analyses and systematic reviews is that outlying studies are frequently included, which can lead to invalid conclusions and affect the robustness of decision-making. Outliers may be caused by several factors such as study selection criteria, low study quality, small-study effects, and so on. Although outlier detection is well-studied in the statistical community, limited attention has been paid to meta-analysis. The conventional outlier detection method in meta-analysis is based on a leave-one-study-out procedure. However, when calculating a potentially outlying study's deviation, other outliers could substantially impact its result. This article proposes an iterative method to detect potential outliers, which reduces such an impact that could confound the detection. Furthermore, we adopt bagging to provide valid inference for sensitivity analyses of excluding outliers. Based on simulation studies, the proposed iterative method yields smaller bias and heterogeneity after performing a sensitivity analysis to remove the identified outliers. It also provides higher accuracy on outlier detection. Two case studies are used to illustrate the proposed method's real-world performance.
Collapse
Affiliation(s)
- Zhuo Meng
- Department of Statistics, College of Arts and Sciences, Florida State University, Tallahassee, FL, U.S.A
| | - Jingshen Wang
- Division of Biostatistics, School of Public Health, University of California, Berkeley, Berkeley, CA, U.S.A
| | - Lifeng Lin
- Department of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, The University of Arizona, Tucson, AZ, U.S.A
| | - Chong Wu
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, U.S.A
| |
Collapse
|
8
|
Lorincz-Comi N, Yang Y, Zhu X. simmrd: An open-source tool to perform simulations in Mendelian randomization. Genet Epidemiol 2024; 48:59-73. [PMID: 38263619 DOI: 10.1002/gepi.22544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Revised: 12/13/2023] [Accepted: 12/19/2023] [Indexed: 01/25/2024]
Abstract
Mendelian randomization (MR) has become a popular tool for inferring causality of risk factors on disease. There are currently over 45 different methods available to perform MR, reflecting this extremely active research area. It would be desirable to have a standard simulation environment to objectively evaluate the existing and future methods. We present simmrd, an open-source software for performing simulations to evaluate the performance of MR methods in a range of scenarios encountered in practice. Researchers can directly modify the simmrd source code so that the research community may arrive at a widely accepted framework for researchers to evaluate the performance of different MR methods.
Collapse
Affiliation(s)
- Noah Lorincz-Comi
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Yihe Yang
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| |
Collapse
|
9
|
Deng L, Fu S, Yu K. Bias and mean squared error in Mendelian randomization with invalid instrumental variables. Genet Epidemiol 2024; 48:27-41. [PMID: 37970963 DOI: 10.1002/gepi.22541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 10/09/2023] [Accepted: 10/26/2023] [Indexed: 11/19/2023]
Abstract
Mendelian randomization (MR) is a statistical method that utilizes genetic variants as instrumental variables (IVs) to investigate causal relationships between risk factors and outcomes. Although MR has gained popularity in recent years due to its ability to analyze summary statistics from genome-wide association studies (GWAS), it requires a substantial number of single nucleotide polymorphisms (SNPs) as IVs to ensure sufficient power for detecting causal effects. Unfortunately, the complex genetic heritability of many traits can lead to the use of invalid IVs that affect both the risk factor and the outcome directly or through an unobserved confounder. This can result in biased and imprecise estimates, as reflected by a larger mean squared error (MSE). In this study, we focus on the widely used two-stage least squares (2SLS) method and derive formulas for its bias and MSE when estimating causal effects using invalid IVs. Using those formulas, we identify conditions under which the 2SLS estimate is unbiased and reveal how the independent or correlated pleiotropic effects influence the accuracy and precision of the 2SLS estimate. We validate these formulas through extensive simulation studies and demonstrate the application of those formulas in an MR study to evaluate the causal effect of the waist-to-hip ratio on various sleeping patterns. Our results can aid in designing future MR studies and serve as benchmarks for assessing more sophisticated MR methods.
Collapse
Affiliation(s)
- Lu Deng
- School of Statistics and Data Science, Nankai University, Tianjin, China
| | - Sheng Fu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Kai Yu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| |
Collapse
|
10
|
Zhu X, Yang Y, Lorincz-Comi N, Li G, Bentley A, de Vries PS, Brown M, Morrison AC, Rotimi C, James Gauderman W, Rao DC, Aschard H. A new Approach to Identify Gene-Environment Interactions and Reveal New Biological Insight in Complex traits. RESEARCH SQUARE 2023:rs.3.rs-3338723. [PMID: 37886448 PMCID: PMC10602131 DOI: 10.21203/rs.3.rs-3338723/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]
Abstract
There is a long-standing debate about the magnitude of the contribution of gene-environment interactions to phenotypic variations of complex traits owing to the low statistical power and few reported interactions to date. To address this issue, the CHARGE Gene-Lifestyle Interactions Working Group has been spearheading efforts to investigate G × E in large and diverse samples through meta-analysis. Here, we present a powerful new approach to screen for interactions across the genome, an approach that shares substantial similarity to the Mendelian randomization framework. We identified and confirmed 5 loci (6 independent signals) interacting with either cigarette smoking or alcohol consumption for serum lipids, and empirically demonstrated that interaction and mediation are the major contributors to genetic effect size heterogeneity across populations. The estimated lower bound of the interaction and environmentally mediated contribution ranges from 1.76% to 14.05% of SNP heritability of serum lipids in Cross-Population data. Our study improves the understanding of the genetic architecture and environmental contributions to complex traits.
Collapse
Affiliation(s)
- Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Yihe Yang
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Noah Lorincz-Comi
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Gen Li
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Amy Bentley
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Paul S de Vries
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Michael Brown
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Alanna C Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Charles Rotimi
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - W. James Gauderman
- Biostatistics, Department of Preventive Medicine, University of Southern California, Los Angeles, CA, USA
| | - DC Rao
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA
| | - Hugues Aschard
- Institut Pasteur, Université Paris Cité, Department of Computational Biology, F-75015 Paris, France
| |
Collapse
|
11
|
Lorincz-Comi N, Yang Y, Li G, Zhu X. MRBEE: A novel bias-corrected multivariable Mendelian Randomization method. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.10.523480. [PMID: 37066391 PMCID: PMC10103949 DOI: 10.1101/2023.01.10.523480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Mendelian randomization (MR) is an instrumental variable approach used to infer causal relationships between exposures and outcomes and can apply to summary data from genome-wide association studies (GWAS). Since GWAS summary statistics are subject to estimation errors, most existing MR approaches suffer from measurement error bias, whose scale and direction are influenced by weak instrumental variables and GWAS sample overlap, respectively. We introduce MRBEE (MR using Bias-corrected Estimating Equation), a novel multivariable MR method capable of simultaneously removing measurement error bias and identifying horizontal pleiotropy. In simulations, we showed that MRBEE is capable of effectively removing measurement error bias in the presence of weak instrumental variables and sample overlap. In two independent real data analyses, we discovered that the causal effect of BMI on coronary artery disease risk is entirely mediated by blood pressure, and that existing MR methods may underestimate the causal effect of cannabis use disorder on schizophrenia risk compared to MRBEE. MRBEE possesses significant potential for advancing genetic research by providing a valuable tool to study causality between multiple risk factors and disease outcomes, particularly as a large number of GWAS summary statistics become publicly available.
Collapse
Affiliation(s)
- Noah Lorincz-Comi
- Department of Population and Quantitative Health Sciences, School of Medicine
- Case Western Reserve University, Cleveland, OH 44106, USA June 12, 2023
| | - Yihe Yang
- Department of Population and Quantitative Health Sciences, School of Medicine
- Case Western Reserve University, Cleveland, OH 44106, USA June 12, 2023
| | - Gen Li
- Department of Population and Quantitative Health Sciences, School of Medicine
- Case Western Reserve University, Cleveland, OH 44106, USA June 12, 2023
| | - Xiaofeng Zhu
- This work was supported by grant HG011052 (to X.Z.) from the National Human Genome Research Institute (NHGRI), USA.
| |
Collapse
|
12
|
Lorincz-Comi N, Yang Y, Li G, Zhu X. MRBEE: A novel bias-corrected multivariable Mendelian Randomization method. RESEARCH SQUARE 2023:rs.3.rs-2464632. [PMID: 36778480 PMCID: PMC9915796 DOI: 10.21203/rs.3.rs-2464632/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Mendelian Randomization (MR) has been widely applied to infer causality of exposures on outcomes in the genome wide association (GWAS) era. Existing approaches are often subject to biases from multiple sources including weak instruments, sample overlap, and measurement error. We introduce MRBEE, a computationally efficient multivariable MR method that can correct for all known biases simultaneously, which is demonstrated in theory, simulations, and real data analysis. In comparison, all existing MR methods are biased. In two independent real data analyses, we observed that the causal effect of BMI on coronary artery disease risk is completely mediated by blood pressure, and that existing MR methods drastically underestimate the causal effect of cannabis use disorder on schizophrenia risk compared to MRBEE. We demonstrate that MRBEE can be a useful tool in studying causality between multiple risk factors and a disease outcome, especially as more GWAS summary statistics are being made publicly available.
Collapse
Affiliation(s)
- Noah Lorincz-Comi
- Department of Population and Quantitative Health Sciences, Case Western Reserve University
| | - Yihe Yang
- Department of Population and Quantitative Health Sciences, Case Western Reserve University
| | - Gen Li
- Department of Population and Quantitative Health Sciences, Case Western Reserve University
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University
| |
Collapse
|
13
|
Ding P, Gorenflo MP, Zhu X, Xu R. Aspirin Use and Risk of Alzheimer's Disease: A 2-Sample Mendelian Randomization Study. J Alzheimers Dis 2023; 92:989-1000. [PMID: 36846997 PMCID: PMC11220559 DOI: 10.3233/jad-220787] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]
Abstract
BACKGROUND Observational studies have shown inconsistent findings of the relationships between aspirin use and the risk of Alzheimer's disease (AD). OBJECTIVE Since residual confounding and reverse causality were challenging issues inherent in observational studies, we conducted a 2-sample Mendelian randomization analysis (MR) to investigate whether aspirin use was causally associated with the risk of AD. METHODS We conducted 2-sample MR analyses utilizing summary genetic association statistics to estimate the potential causal relationship between aspirin use and AD. Single-nucleotide variants associated with aspirin use in a genome-wide association study (GWAS) of UK Biobank were considered as genetic proxies for aspirin use. The GWAS summary-level data of AD were derived from a meta-analysis of GWAS data from the International Genomics of Alzheimer's Project (IGAP) stage I. RESULTS Univariable MR analysis based on these two large GWAS data sources showed that genetically proxied aspirin use was associated with a decreased risk of AD (Odds Ratio (OR): 0.87; 95%CI: 0.77-0.99). In multivariate MR analyses, the causal estimates remained significant after adjusting for chronic pain, inflammation, heart failure (OR = 0.88, 95%CI = 0.78-0.98), or stroke (OR = 0.87, 95%CI = 0.77-0.99), but was attenuated when adjusting for coronary heart disease, blood pressure, and blood lipids. CONCLUSION Findings from this MR analysis suggest a genetic protective effect of aspirin use on AD, possibly influenced by coronary heart disease, blood pressure, and lipid levels.
Collapse
Affiliation(s)
- Pingjian Ding
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Maria P. Gorenflo
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Rong Xu
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| |
Collapse
|
14
|
Ding P, Gurney M, Perry G, Xu R. Association of COVID-19 with Risk and Progression of Alzheimer's Disease: Non-Overlapping Two-Sample Mendelian Randomization Analysis of 2.6 Million Subjects. J Alzheimers Dis 2023; 96:1711-1720. [PMID: 38007657 PMCID: PMC11037518 DOI: 10.3233/jad-230632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2023]
Abstract
BACKGROUND Epidemiological studies showed that COVID-19 increases risk of Alzheimer's disease (AD). However, it remains unknown if there is a potential genetic predispositional effect. OBJECTIVE To examine potential effects of genetic susceptibility of COVID-19 on the risk and progression of AD, we performed a non-overlapping 2-sample Mendelian randomization (MR) study using summary statistics from genome-wide association studies (GWAS). METHODS Two-sample Mendelian randomization (MR) analysis of over 2.6 million subjects was used to examine whether genetic susceptibility of COVID-19 is not associated with the risk of AD, cortical amyloid burden, hippocampal volume, or AD progression score. Additionally, a validation analysis was performed on a combined sample size of 536,190 participants. RESULTS We show that the AD risk was not associated with genetic susceptibility of COVID-19 risk (OR = 0.98, 95% CI 0.81-1.19) and COVID-19 severity (COVID-19 hospitalization: OR = 0.98, 95% CI 0.9-1.07, and critical COVID-19: OR = 0.98, 95% CI 0.92-1.03). Genetic predisposition to COVID-19 is not associated with AD progression as measured by hippocampal volume, cortical amyloid beta load, and AD progression score. These findings were replicated in a set of 536,190 participants. Consistent results were obtained across models based on different GWAS summary statistics, MR estimators and COVID-19 definitions. CONCLUSIONS Our findings indicated that the genetic susceptibility of COVID-19 is not associated with the risk and progression of AD.
Collapse
Affiliation(s)
- Pingjian Ding
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Mark Gurney
- Department of Psychiatry and Behavioral Sciences, Miller School of Medicine, University of Miami, Miami, FL, USA
| | - George Perry
- Department of Neuroscience, Development and Regenerative Biology, College of Sciences, The University of Texas at San Antonio, San Antonio, TX, USA
| | - Rong Xu
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| |
Collapse
|
15
|
Dapas M, Dunaif A. Deconstructing a Syndrome: Genomic Insights Into PCOS Causal Mechanisms and Classification. Endocr Rev 2022; 43:927-965. [PMID: 35026001 PMCID: PMC9695127 DOI: 10.1210/endrev/bnac001] [Citation(s) in RCA: 82] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Indexed: 01/16/2023]
Abstract
Polycystic ovary syndrome (PCOS) is among the most common disorders in women of reproductive age, affecting up to 15% worldwide, depending on the diagnostic criteria. PCOS is characterized by a constellation of interrelated reproductive abnormalities, including disordered gonadotropin secretion, increased androgen production, chronic anovulation, and polycystic ovarian morphology. It is frequently associated with insulin resistance and obesity. These reproductive and metabolic derangements cause major morbidities across the lifespan, including anovulatory infertility and type 2 diabetes (T2D). Despite decades of investigative effort, the etiology of PCOS remains unknown. Familial clustering of PCOS cases has indicated a genetic contribution to PCOS. There are rare Mendelian forms of PCOS associated with extreme phenotypes, but PCOS typically follows a non-Mendelian pattern of inheritance consistent with a complex genetic architecture, analogous to T2D and obesity, that reflects the interaction of susceptibility genes and environmental factors. Genomic studies of PCOS have provided important insights into disease pathways and have indicated that current diagnostic criteria do not capture underlying differences in biology associated with different forms of PCOS. We provide a state-of-the-science review of genetic analyses of PCOS, including an overview of genomic methodologies aimed at a general audience of non-geneticists and clinicians. Applications in PCOS will be discussed, including strengths and limitations of each study. The contributions of environmental factors, including developmental origins, will be reviewed. Insights into the pathogenesis and genetic architecture of PCOS will be summarized. Future directions for PCOS genetic studies will be outlined.
Collapse
Affiliation(s)
- Matthew Dapas
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Andrea Dunaif
- Division of Endocrinology, Diabetes and Bone Disease, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
16
|
Boehm FJ, Zhou X. Statistical methods for Mendelian randomization in genome-wide association studies: A review. Comput Struct Biotechnol J 2022; 20:2338-2351. [PMID: 35615025 PMCID: PMC9123217 DOI: 10.1016/j.csbj.2022.05.015] [Citation(s) in RCA: 101] [Impact Index Per Article: 50.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 05/08/2022] [Accepted: 05/09/2022] [Indexed: 11/15/2022] Open
Abstract
Genome-wide association studies have yielded thousands of associations for many common diseases and disease-related complex traits. The identified associations made it possible to identify the causal risk factors underlying diseases and investigate the causal relationships among complex traits through Mendelian randomization. Mendelian randomization is a form of instrumental variable analysis that uses SNP associations from genome-wide association studies as instruments to study and uncover causal relationships between complex traits. By leveraging SNP genotypes as instrumental variables, or proxies, for the exposure complex trait, investigators can tease out causal effects from observational data, provided that necessary assumptions are satisfied. We discuss below the development of Mendelian randomization methods in parallel with the growth of genome-wide association studies. We argue that the recent availability of GWAS summary statistics for diverse complex traits has motivated new Mendelian randomization methods with relaxed causality assumptions and that this area continues to offer opportunities for robust biological discoveries.
Collapse
Affiliation(s)
- Frederick J. Boehm
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
17
|
Zhu X, Zhu L, Wang H, Cooper RS, Chakravarti A. Genome-wide pleiotropy analysis identifies novel blood pressure variants and improves its polygenic risk scores. Genet Epidemiol 2022; 46:105-121. [PMID: 34989438 PMCID: PMC8863647 DOI: 10.1002/gepi.22440] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 12/07/2021] [Indexed: 01/21/2023]
Abstract
Systolic and diastolic blood pressure (S/DBP) are highly correlated modifiable risk factors for cardiovascular disease (CVD). We report here a bidirectional Mendelian Randomization (MR) and horizontal pleiotropy analysis of S/DBP summary statistics from the UK Biobank (UKB)-International Consortium for Blood Pressure (ICBP) (UKB-ICBP) BP genome-wide association study and construct a composite genetic risk score (GRS) by including pleiotropic variants. The composite GRS captures greater (1.11-3.26 fold) heritability for BP traits and increases (1.09- and 2.01-fold) Nagelkerke's R2 for hypertension and CVD. We replicated 118 novel BP horizontal pleiotropic variants including 18 novel BP loci using summary statistics from the Million Veteran Program (MVP) study. An additional 219 novel BP signals and 40 novel loci were identified after a meta-analysis of the UKB-ICBP and MVP summary statistics but without further independent replication. Our study provides further insight into BP regulation and provides a novel way to construct a GRS by including pleiotropic variants for other complex diseases.
Collapse
Affiliation(s)
- Xiaofeng Zhu
- Department of Population and Quantitative Health SciencesCase Western Reserve UniversityClevelandOhioUSA
| | - Luke Zhu
- Department of Medicine, Center for Human Genetics & GenomicsNew York University Langone HealthNew YorkNew YorkUSA
| | - Heming Wang
- Division of Sleep and Circadian DisordersBrigham and Women's HospitalBostonMassachusettsUSA
| | - Richard S. Cooper
- Department of Public Health Sciences, Stritch School of MedicineLoyola University ChicagoMaywoodIllinoisUSA
| | - Aravinda Chakravarti
- Department of Medicine, Center for Human Genetics & GenomicsNew York University Langone HealthNew YorkNew YorkUSA
| |
Collapse
|
18
|
Sanderson E, Glymour MM, Holmes MV, Kang H, Morrison J, Munafò MR, Palmer T, Schooling CM, Wallace C, Zhao Q, Smith GD. Mendelian randomization. NATURE REVIEWS. METHODS PRIMERS 2022; 2:6. [PMID: 37325194 PMCID: PMC7614635 DOI: 10.1038/s43586-021-00092-5] [Citation(s) in RCA: 467] [Impact Index Per Article: 233.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 12/21/2021] [Indexed: 06/17/2023]
Abstract
Mendelian randomization (MR) is a term that applies to the use of genetic variation to address causal questions about how modifiable exposures influence different outcomes. The principles of MR are based on Mendel's laws of inheritance and instrumental variable estimation methods, which enable the inference of causal effects in the presence of unobserved confounding. In this Primer, we outline the principles of MR, the instrumental variable conditions underlying MR estimation and some of the methods used for estimation. We go on to discuss how the assumptions underlying an MR study can be assessed and give methods of estimation that are robust to certain violations of these assumptions. We give examples of a range of studies in which MR has been applied, the limitations of current methods of analysis and the outlook for MR in the future. The difference between the assumptions required for MR analysis and other forms of non-interventional epidemiological studies means that MR can be used as part of a triangulation across multiple sources of evidence for causal inference.
Collapse
Affiliation(s)
- Eleanor Sanderson
- Medical Research Council (MRC) Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - M. Maria Glymour
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - Michael V. Holmes
- Medical Research Council (MRC) Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- MRC Population Health Research Unit, University of Oxford, Oxford, UK
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Hyunseung Kang
- Department of Statistics, University of Wisconsin-Madison, Madison, WI, USA
| | - Jean Morrison
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Marcus R. Munafò
- Medical Research Council (MRC) Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- School of Psychological Science, University of Bristol, Bristol, UK
- National Institute for Health Research (NIHR), Biomedical Research Centre, University of Bristol, Bristol, UK
| | - Tom Palmer
- Medical Research Council (MRC) Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - C. Mary Schooling
- School of Public Health, Li Ka Shing, Faculty of Medicine, The University of Hong Kong, Hong Kong, China
- School of Public Health, City University of New York, New York, USA
| | - Chris Wallace
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), University of Cambridge, Cambridge, UK
| | - Qingyuan Zhao
- Statistical Laboratory, University of Cambridge, Cambridge, UK
| | - George Davey Smith
- Medical Research Council (MRC) Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
- National Institute for Health Research (NIHR), Biomedical Research Centre, University of Bristol, Bristol, UK
| |
Collapse
|
19
|
Lin Z, Deng Y, Pan W. Combining the strengths of inverse-variance weighting and Egger regression in Mendelian randomization using a mixture of regressions model. PLoS Genet 2021; 17:e1009922. [PMID: 34793444 PMCID: PMC8639093 DOI: 10.1371/journal.pgen.1009922] [Citation(s) in RCA: 86] [Impact Index Per Article: 28.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Revised: 12/02/2021] [Accepted: 11/02/2021] [Indexed: 11/19/2022] Open
Abstract
With the increasing availability of large-scale GWAS summary data on various traits, Mendelian randomization (MR) has become commonly used to infer causality between a pair of traits, an exposure and an outcome. It depends on using genetic variants, typically SNPs, as instrumental variables (IVs). The inverse-variance weighted (IVW) method (with a fixed-effect meta-analysis model) is most powerful when all IVs are valid; however, when horizontal pleiotropy is present, it may lead to biased inference. On the other hand, Egger regression is one of the most widely used methods robust to (uncorrelated) pleiotropy, but it suffers from loss of power. We propose a two-component mixture of regressions to combine and thus take advantage of both IVW and Egger regression; it is often both more efficient (i.e. higher powered) and more robust to pleiotropy (i.e. controlling type I error) than either IVW or Egger regression alone by accounting for both valid and invalid IVs respectively. We propose a model averaging approach and a novel data perturbation scheme to account for uncertainties in model/IV selection, leading to more robust statistical inference for finite samples. Through extensive simulations and applications to the GWAS summary data of 48 risk factor-disease pairs and 63 genetically uncorrelated trait pairs, we showcase that our proposed methods could often control type I error better while achieving much higher power than IVW and Egger regression (and sometimes than several other new/popular MR methods). We expect that our proposed methods will be a useful addition to the toolbox of Mendelian randomization for causal inference.
Collapse
Affiliation(s)
- Zhaotong Lin
- Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Yangqing Deng
- Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Wei Pan
- Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota, United States of America
| |
Collapse
|
20
|
Zhu X. Mendelian randomization and pleiotropy analysis. QUANTITATIVE BIOLOGY 2021; 9:122-132. [PMID: 34386270 PMCID: PMC8356909 DOI: 10.1007/s40484-020-0216-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Revised: 05/16/2020] [Accepted: 05/21/2020] [Indexed: 10/23/2022]
Abstract
BACKGROUND Mendelian randomization (MR) analysis has become popular in inferring and estimating the causality of an exposure on an outcome due to the success of genome wide association studies. Many statistical approaches have been developed and each of these methods require specific assumptions. RESULTS In this article, we review the pros and cons of these methods. We use an example of high-density lipoprotein cholesterol on coronary artery disease to illuminate the challenges in Mendelian randomization investigation. CONCLUSION The current available MR approaches allow us to study causality among risk factors and outcomes. However, novel approaches are desirable for overcoming multiple source confounding of risk factors and an outcome in MR analysis.
Collapse
Affiliation(s)
- Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| |
Collapse
|
21
|
Xue H, Shen X, Pan W. Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects. Am J Hum Genet 2021; 108:1251-1269. [PMID: 34214446 PMCID: PMC8322939 DOI: 10.1016/j.ajhg.2021.05.014] [Citation(s) in RCA: 94] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 05/25/2021] [Indexed: 12/23/2022] Open
Abstract
With the increasing availability of large-scale GWAS summary data on various complex traits and diseases, there have been tremendous interests in applications of Mendelian randomization (MR) to investigate causal relationships between pairs of traits using SNPs as instrumental variables (IVs) based on observational data. In spite of the potential significance of such applications, the validity of their causal conclusions critically depends on some strong modeling assumptions required by MR, which may be violated due to the widespread (horizontal) pleiotropy. Although many MR methods have been proposed recently to relax the assumptions by mainly dealing with uncorrelated pleiotropy, only a few can handle correlated pleiotropy, in which some SNPs/IVs may be associated with hidden confounders, such as some heritable factors shared by both traits. Here we propose a simple and effective approach based on constrained maximum likelihood and model averaging, called cML-MA, applicable to GWAS summary data. To deal with more challenging situations with many invalid IVs with only weak pleiotropic effects, we modify and improve it with data perturbation. Extensive simulations demonstrated that the proposed methods could control the type I error rate better while achieving higher power than other competitors. Applications to 48 risk factor-disease pairs based on large-scale GWAS summary data of 3 cardio-metabolic diseases (coronary artery disease, stroke, and type 2 diabetes), asthma, and 12 risk factors confirmed its superior performance.
Collapse
Affiliation(s)
- Haoran Xue
- School of Statistics, University of Minnesota, Minneapolis, MN 55455, USA; Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455, USA
| | - Xiaotong Shen
- School of Statistics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Wei Pan
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455, USA.
| |
Collapse
|
22
|
Wang Y, Guo P, Liu L, Zhang Y, Zeng P, Yuan Z. Mendelian Randomization Highlights the Causal Role of Normal Thyroid Function on Blood Lipid Profiles. Endocrinology 2021; 162:6136226. [PMID: 33587120 DOI: 10.1210/endocr/bqab037] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Indexed: 12/13/2022]
Abstract
The association between thyroid function and dyslipidemia has been well documented in observational studies. However, observational studies are prone to confounding, making it difficult to conduct causal inference. We performed a 2-sample bidirectional Mendelian randomization (MR) using summary statistics from large-scale genome-wide association studies of thyroid stimulating hormone (TSH), free T4 (FT4), and blood lipids. We chose the inverse variance-weighted (IVW) method for the main analysis, and consolidated results through various sensitivity analyses involving 6 different MR methods under different model specifications. We further conducted genetic correlation analysis and colocalization analysis to deeply reflect the causality. The IVW method showed per 1 SD increase in normal TSH was significantly associated with a 0.048 SD increase in total cholesterol (TC; P < 0.001) and a 0.032 SD increase in low-density lipoprotein cholesterol (LDL; P = 0.021). A 1 SD increase in normal FT4 was significantly associated with a 0.056 SD decrease in TC (P = 0.014) and a 0.072 SD decrease in LDL (P = 0.009). Neither TSH nor FT4 showed causal associations with high-density lipoprotein cholesterol and triglycerides. No significant causal effect of blood lipids on normal TSH or FT4 can be detected. All results were largely consistent when using several alternative MR methods, and were reconfirmed by both genetic correlation analysis and colocalization analysis. Our study suggested that, even within reference range, higher TSH or lower FT4 are causally associated with increased TC and LDL, whereas no reverse causal association can be found.
Collapse
Affiliation(s)
- Yanjun Wang
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Ping Guo
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Lu Liu
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Yanan Zhang
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Ping Zeng
- Department of Epidemiology and Biostatistics, Xuzhou Medical University, Xuzhou 221004, China
| | - Zhongshang Yuan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| |
Collapse
|
23
|
Lorincz-Comi N, Zhu X. Cardiometabolic risks of SARS-CoV-2 hospitalization using Mendelian Randomization. Sci Rep 2021; 11:7848. [PMID: 33846372 PMCID: PMC8042041 DOI: 10.1038/s41598-021-86757-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Accepted: 03/19/2021] [Indexed: 01/31/2023] Open
Abstract
Many cardiometabolic conditions have demonstrated associative evidence with COVID-19 hospitalization risk. However, the observational designs of the studies in which these associations are observed preclude causal inferences of hospitalization risk. Mendelian Randomization (MR) is an alternative risk estimation method more robust to these limitations that allows for causal inferences. We applied four MR methods (MRMix, IMRP, IVW, MREgger) to publicly available GWAS summary statistics from European (COVID-19 GWAS n = 2956) and multi-ethnic populations (COVID-19 GWAS n = 10,908) to better understand extant causal associations between Type II Diabetes (GWAS n = 659,316), BMI (n = 681,275), diastolic and systolic blood pressure, and pulse pressure (n = 757,601 for each) and COVID-19 hospitalization risk across populations. Although no significant causal effect evidence was observed, our data suggested a trend of increasing hospitalization risk for Type II diabetes (IMRP OR, 95% CI 1.67, 0.96-2.92) and pulse pressure (OR, 95% CI 1.27, 0.97-1.66) in the multi-ethnic sample. Type II diabetes and Pulse pressure demonstrates a potential causal association with COVID-19 hospitalization risk, the proper treatment of which may work to reduce the risk of a severe COVID-19 illness requiring hospitalization. However, GWAS of COVID-19 with large sample size is warranted to confirm the causality.
Collapse
Affiliation(s)
- Noah Lorincz-Comi
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA.
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Wolstein Research Building Room 1317, 2103 Cornell Rd, Cleveland, OH, 44106, USA.
| |
Collapse
|