1
|
Su Y, Xu S, Ma Y, Yin P, Hao X, Zhou J, Fung WK, Jiang H, Wang P. A Modified Debiased Inverse-Variance Weighted Estimator in Two-Sample Summary-Data Mendelian Randomization. Stat Med 2024. [PMID: 39453381 DOI: 10.1002/sim.10245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 08/29/2024] [Accepted: 09/26/2024] [Indexed: 10/26/2024]
Abstract
Mendelian randomization uses genetic variants as instrumental variables to estimate the causal effect of exposure on outcome from observational data. A common challenge in Mendelian randomization is that many genetic variants are only modestly or even weakly associated with the exposure of interest, a setting known as many weak instruments. Conventional methods, such as the popular inverse-variance weighted (IVW) estimator, could be heavily biased toward zero when the instrument strength is weak. To address this issue, the debiased IVW (dIVW) estimator and the penalized IVW (pIVW) estimator, which are shown to be robust to many weak instruments, were recently proposed. However, we find that the dIVW estimator tends to produce an exaggerated estimate of the causal effect, especially when the effective sample size is small. Although the pIVW estimator has better statistical properties, it is slightly more complex, and the idea behind this method is also a bit less intuitive. Therefore, we propose a modified debiased IVW (mdIVW) estimator that directly multiplies a shrinkage factor with the original dIVW estimator. After this simple modification, we prove that the mdIVW estimator not only has second-order bias with respect to the effective sample size, but also has smaller variance and mean squared error than the preceding two estimators. We then extend the proposed method to account for the presence of instrumental variable selection and balanced horizontal pleiotropy. We demonstrate the improvement of the mdIVW estimator over the competing ones through extensive simulation studies and real data analysis.
Collapse
Affiliation(s)
- Youpeng Su
- Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Siqi Xu
- Clinical Research Centre, Guangzhou First People's Hospital, School of Medicine, South China University of Technology, Guangzhou, China
| | - Yilei Ma
- Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Ping Yin
- Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Xingjie Hao
- Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Jiyuan Zhou
- Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou, China
| | - Wing Kam Fung
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China
| | - Hongwei Jiang
- Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Peng Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
2
|
Hu X, Cai M, Xiao J, Wan X, Wang Z, Zhao H, Yang C. Benchmarking Mendelian randomization methods for causal inference using genome-wide association study summary statistics. Am J Hum Genet 2024; 111:1717-1735. [PMID: 39059387 PMCID: PMC11339627 DOI: 10.1016/j.ajhg.2024.06.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 06/26/2024] [Accepted: 06/26/2024] [Indexed: 07/28/2024] Open
Abstract
Mendelian randomization (MR), which utilizes genetic variants as instrumental variables (IVs), has gained popularity as a method for causal inference between phenotypes using genetic data. While efforts have been made to relax IV assumptions and develop new methods for causal inference in the presence of invalid IVs due to confounding, the reliability of MR methods in real-world applications remains uncertain. Instead of using simulated datasets, we conducted a benchmark study evaluating 16 two-sample summary-level MR methods using real-world genetic datasets to provide guidelines for the best practices. Our study focused on the following crucial aspects: type I error control in the presence of various confounding scenarios (e.g., population stratification, pleiotropy, and family-level confounders like assortative mating), the accuracy of causal effect estimates, replicability, and power. By comprehensively evaluating the performance of compared methods over one thousand exposure-outcome trait pairs, our study not only provides valuable insights into the performance and limitations of the compared methods but also offers practical guidance for researchers to choose appropriate MR methods for causal inference.
Collapse
Affiliation(s)
- Xianghong Hu
- School of Mathematical Sciences, Institute of Statistical Sciences, Shenzhen University, Shenzhen 518060, China; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China
| | - Mingxuan Cai
- Department of Biostatistics, City University of Hong Kong, Hong Kong, China
| | - Jiashun Xiao
- Shenzhen Research Institute of Big Data, Shenzhen 518172, China
| | - Xiaomeng Wan
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China
| | - Zhiwei Wang
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 06520, USA.
| | - Can Yang
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China; Big Data Bio-Intelligence Lab, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
| |
Collapse
|