1
|
Hou L, Geng Z, Yuan Z, Shi X, Wang C, Chen F, Li H, Xue F. MRSL: a causal network pruning algorithm based on GWAS summary data. Brief Bioinform 2024; 25:bbae086. [PMID: 38487847 PMCID: PMC10940843 DOI: 10.1093/bib/bbae086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 02/01/2024] [Accepted: 02/15/2024] [Indexed: 03/18/2024] Open
Abstract
Causal discovery is a powerful tool to disclose underlying structures by analyzing purely observational data. Genetic variants can provide useful complementary information for structure learning. Recently, Mendelian randomization (MR) studies have provided abundant marginal causal relationships of traits. Here, we propose a causal network pruning algorithm MRSL (MR-based structure learning algorithm) based on these marginal causal relationships. MRSL combines the graph theory with multivariable MR to learn the conditional causal structure using only genome-wide association analyses (GWAS) summary statistics. Specifically, MRSL utilizes topological sorting to improve the precision of structure learning. It proposes MR-separation instead of d-separation and three candidates of sufficient separating set for MR-separation. The results of simulations revealed that MRSL had up to 2-fold higher F1 score and 100 times faster computing time than other eight competitive methods. Furthermore, we applied MRSL to 26 biomarkers and 44 International Classification of Diseases 10 (ICD10)-defined diseases using GWAS summary data from UK Biobank. The results cover most of the expected causal links that have biological interpretations and several new links supported by clinical case reports or previous observational literatures.
Collapse
Affiliation(s)
- Lei Hou
- Beijing International Center for Mathematical Research, Peking University, Beijing, People’s Republic of China, 100871
| | - Zhi Geng
- School of Mathematics and Statistics, Beijing Technology and Business University, Beijing, People’s Republic of China, 100048
| | - Zhongshang Yuan
- Department of Epidemiology and Health Statistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, People’s Republic of China, 250000
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, People’s Republic of China, 250000
| | - Xu Shi
- Department of Biostatistics, University of Michigan, Ann Arbor, USA
| | - Chuan Wang
- Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, People's Republic of China, 250000
| | - Feng Chen
- School of Public Health, Nanjing Medical University, Nanjing, China, 211166
| | - Hongkai Li
- Department of Epidemiology and Health Statistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, People’s Republic of China, 250000
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, People’s Republic of China, 250000
| | - Fuzhong Xue
- Department of Epidemiology and Health Statistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, People’s Republic of China, 250000
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, People’s Republic of China, 250000
- Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, People's Republic of China, 250000
| |
Collapse
|