1
|
Zhai S, Guo B, Wu B, Mehrotra DV, Shen J. Integrating multiple traits for improving polygenic risk prediction in disease and pharmacogenomics GWAS. Brief Bioinform 2023:7169140. [PMID: 37200155 DOI: 10.1093/bib/bbad181] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 03/30/2023] [Accepted: 04/21/2023] [Indexed: 05/20/2023] Open
Abstract
Polygenic risk score (PRS) has been recently developed for predicting complex traits and drug responses. It remains unknown whether multi-trait PRS (mtPRS) methods, by integrating information from multiple genetically correlated traits, can improve prediction accuracy and power for PRS analysis compared with single-trait PRS (stPRS) methods. In this paper, we first review commonly used mtPRS methods and find that they do not directly model the underlying genetic correlations among traits, which has been shown to be useful in guiding multi-trait association analysis in the literature. To overcome this limitation, we propose a mtPRS-PCA method to combine PRSs from multiple traits with weights obtained from performing principal component analysis (PCA) on the genetic correlation matrix. To accommodate various genetic architectures covering different effect directions, signal sparseness and across-trait correlation structures, we further propose an omnibus mtPRS method (mtPRS-O) by combining P values from mtPRS-PCA, mtPRS-ML (mtPRS based on machine learning) and stPRSs using Cauchy Combination Test. Our extensive simulation studies show that mtPRS-PCA outperforms other mtPRS methods in both disease and pharmacogenomics (PGx) genome-wide association studies (GWAS) contexts when traits are similarly correlated, with dense signal effects and in similar effect directions, and mtPRS-O is consistently superior to most other methods due to its robustness under various genetic architectures. We further apply mtPRS-PCA, mtPRS-O and other methods to PGx GWAS data from a randomized clinical trial in the cardiovascular domain and demonstrate performance improvement of mtPRS-PCA in both prediction accuracy and patient stratification as well as the robustness of mtPRS-O in PRS association test.
Collapse
Affiliation(s)
- Song Zhai
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Bin Guo
- Data and Genome Science, Merck & Co., Inc., Cambridge, MA 02141, USA
| | - Baolin Wu
- Department of Epidemiology and Biostatistics, University of California Irvine, Irvine, CA 92697, USA
| | - Devan V Mehrotra
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., North Wales, PA 19454, USA
| | - Judong Shen
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, NJ 07065, USA
| |
Collapse
|
2
|
Song S, Wang S, Li N, Chang S, Dai S, Guo Y, Wu X, Cheng Y, Zeng S. Genome-wide association study to identify SNPs and candidate genes associated with body size traits in donkeys. Front Genet 2023; 14:1112377. [PMID: 36926587 PMCID: PMC10011486 DOI: 10.3389/fgene.2023.1112377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 02/14/2023] [Indexed: 03/08/2023] Open
Abstract
The Yangyuan donkey is a domestic animal breed mainly distributed in the northwest region of Hebei Province. Donkey body shape is the most direct production index, can fully reflect the donkey's growth status, and is closely related to important economic traits. As one of the main breeding selection criteria, body size traits have been widely used to monitor animal growth and evaluate the selection response. Molecular markers genetically linked to body size traits have the potential to accelerate the breeding process of animals via marker-assisted selection. However, the molecular markers of body size in Yangyuan donkeys have yet to be explored. In this study, we performed a genome-wide association study to identify the genomic variations associated with body size traits in a population of 120 Yangyuan donkeys. We screened 16 single nucleotide polymorphisms that were significantly associated with body size traits. Some genes distributed around these significant SNPs were considered candidates for body size traits, including SMPD4, RPS6KA6, LPAR4, GLP2R, BRWD3, MAGT1, ZDHHC15, and CYSLTR1. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses indicated that these genes were mainly involved in the P13K-Akt signaling pathway, Rap1 signaling pathway, regulation of actin cytoskeleton, calcium signaling pathway, phospholipase D signaling pathway, and neuroactive ligand-receptor interactions. Collectively, our study reported on a list of novel markers and candidate genes associated with body size traits in donkeys, providing useful information for functional gene studies and offering great potential for accelerating Yangyuan donkey breeding.
Collapse
Affiliation(s)
- Shuang Song
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Shiwei Wang
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Nan Li
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Siyu Chang
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Shizhen Dai
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Yajun Guo
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Xuan Wu
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Yuanweilu Cheng
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Shenming Zeng
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
3
|
Cao X, Xing L, He H, Zhang X. Views on GWAS statistical analysis. Bioinformation 2020; 16:393-397. [PMID: 32831520 PMCID: PMC7434950 DOI: 10.6026/97320630016393] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 04/15/2020] [Accepted: 04/17/2020] [Indexed: 11/23/2022] Open
Abstract
Genome-wide association study (GWAS) is a popular approach to investigate relationships between genetic information and diseases. A number of associations are tested in a study and the results are often corrected using multiple adjustment methods. It is observed that GWAS studies suffer adequate statistical power for reliability. Hence, we document known models for reliability assessment using improved statistical power in GWAS analysis.
Collapse
Affiliation(s)
- Xiaowen Cao
- Department of Mathematics, Hebei University of Technology, Tianjin, China.,Department of Mathematics and Statistics, University of Victoria, BC, Canada
| | - Li Xing
- Department of Mathematics and Statistics, University of Saskatchewan, Saskatoon, SK, Canada
| | - Hua He
- Department of Mathematics, Hebei University of Technology, Tianjin, China
| | - Xuekui Zhang
- Department of Mathematics and Statistics, University of Victoria, BC, Canada
| |
Collapse
|
4
|
Xu Y, Xing L, Su J, Zhang X, Qiu W. Model-based clustering for identifying disease-associated SNPs in case-control genome-wide association studies. Sci Rep 2019; 9:13686. [PMID: 31548641 PMCID: PMC6757104 DOI: 10.1038/s41598-019-50229-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Accepted: 09/09/2019] [Indexed: 12/18/2022] Open
Abstract
Genome-wide association studies (GWASs) aim to detect genetic risk factors for complex human diseases by identifying disease-associated single-nucleotide polymorphisms (SNPs). The traditional SNP-wise approach along with multiple testing adjustment is over-conservative and lack of power in many GWASs. In this article, we proposed a model-based clustering method that transforms the challenging high-dimension-small-sample-size problem to low-dimension-large-sample-size problem and borrows information across SNPs by grouping SNPs into three clusters. We pre-specify the patterns of clusters by minor allele frequencies of SNPs between cases and controls, and enforce the patterns with prior distributions. In the simulation studies our proposed novel model outperforms traditional SNP-wise approach by showing better controls of false discovery rate (FDR) and higher sensitivity. We re-analyzed two real studies to identifying SNPs associated with severe bortezomib-induced peripheral neuropathy (BiPN) in patients with multiple myeloma (MM). The original analysis in the literature failed to identify SNPs after FDR adjustment. Our proposed method not only detected the reported SNPs after FDR adjustment but also discovered a novel BiPN-associated SNP rs4351714 that has been reported to be related to MM in another study.
Collapse
Affiliation(s)
- Yan Xu
- Department of Mathematics and Statistics, University of Victoria, Victoria, BC, Canada
| | - Li Xing
- Department of Mathematics and Statistics, University of Saskatchewan, Saskatoon, SK, Canada
| | - Jessica Su
- Channing Division of Network Medicine, Brigham and Women's Hospital/Harvard Medical School, Boston, MA, USA
| | - Xuekui Zhang
- Department of Mathematics and Statistics, University of Victoria, Victoria, BC, Canada.
| | - Weiliang Qiu
- Channing Division of Network Medicine, Brigham and Women's Hospital/Harvard Medical School, Boston, MA, USA
| |
Collapse
|