1
|
Wang J, Wang W, Li H. Sparse block signal detection and identification for shared cross-trait association analysis. Ann Appl Stat 2022. [DOI: 10.1214/21-aoas1523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Jianqiao Wang
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania
| | - Wanjie Wang
- Department of Statistics and Applied Probability, National University of Singapore
| | - Hongzhe Li
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania
| |
Collapse
|
2
|
Wang T, Lu H, Zeng P. Identifying pleiotropic genes for complex phenotypes with summary statistics from a perspective of composite null hypothesis testing. Brief Bioinform 2021; 23:6375058. [PMID: 34571531 DOI: 10.1093/bib/bbab389] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 08/06/2021] [Accepted: 08/28/2021] [Indexed: 12/13/2022] Open
Abstract
Pleiotropy has important implication on genetic connection among complex phenotypes and facilitates our understanding of disease etiology. Genome-wide association studies provide an unprecedented opportunity to detect pleiotropic associations; however, efficient pleiotropy test methods are still lacking. We here consider pleiotropy identification from a methodological perspective of high-dimensional composite null hypothesis and propose a powerful gene-based method called MAIUP. MAIUP is constructed based on the traditional intersection-union test with two sets of independent P-values as input and follows a novel idea that was originally proposed under the high-dimensional mediation analysis framework. The key improvement of MAIUP is that it takes the composite null nature of pleiotropy test into account by fitting a three-component mixture null distribution, which can ultimately generate well-calibrated P-values for effective control of family-wise error rate and false discover rate. Another attractive advantage of MAIUP is its ability to effectively address the issue of overlapping subjects commonly encountered in association studies. Simulation studies demonstrate that compared with other methods, only MAIUP can maintain correct type I error control and has higher power across a wide range of scenarios. We apply MAIUP to detect shared associated genes among 14 psychiatric disorders with summary statistics and discover many new pleiotropic genes that are otherwise not identified if failing to account for the issue of composite null hypothesis testing. Functional and enrichment analyses offer additional evidence supporting the validity of these identified pleiotropic genes associated with psychiatric disorders. Overall, MAIUP represents an efficient method for pleiotropy identification.
Collapse
Affiliation(s)
- Ting Wang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Haojie Lu
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Ping Zeng
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.,Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.,Key Laboratory of Human Genetics and Environmental Medicine, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| |
Collapse
|
3
|
Bone WP, Siewert KM, Jha A, Klarin D, Damrauer SM, Chang KM, Tsao PS, Assimes TL, Ritchie MD, Voight BF. Multi-trait association studies discover pleiotropic loci between Alzheimer's disease and cardiometabolic traits. Alzheimers Res Ther 2021; 13:34. [PMID: 33541420 PMCID: PMC7860582 DOI: 10.1186/s13195-021-00773-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 01/11/2021] [Indexed: 12/14/2022]
Abstract
BACKGROUND Identification of genetic risk factors that are shared between Alzheimer's disease (AD) and other traits, i.e., pleiotropy, can help improve our understanding of the etiology of AD and potentially detect new therapeutic targets. Previous epidemiological correlations observed between cardiometabolic traits and AD led us to assess the pleiotropy between these traits. METHODS We performed a set of bivariate genome-wide association studies coupled with colocalization analysis to identify loci that are shared between AD and eleven cardiometabolic traits. For each of these loci, we performed colocalization with Genotype-Tissue Expression (GTEx) project expression quantitative trait loci (eQTL) to identify candidate causal genes. RESULTS We identified three previously unreported pleiotropic trait associations at known AD loci as well as four novel pleiotropic loci. One associated locus was tagged by a low-frequency coding variant in the gene DOCK4 and is potentially implicated in its alternative splicing. Colocalization with GTEx eQTL data identified additional candidate genes for the loci we detected, including ACE, the target of the hypertensive drug class of ACE inhibitors. We found that the allele associated with decreased ACE expression in brain tissue was also associated with increased risk of AD, providing human genetic evidence of a potential increase in AD risk from use of an established anti-hypertensive therapeutic. CONCLUSION Our results support a complex genetic relationship between AD and these cardiometabolic traits, and the candidate causal genes identified suggest that blood pressure and immune response play a role in the pleiotropy between these traits.
Collapse
Affiliation(s)
- William P Bone
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Katherine M Siewert
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Anupama Jha
- Department of Computer and Information Science, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Derek Klarin
- Boston VA Healthcare System, Boston, MA, 02130, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, 02114, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Scott M Damrauer
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, PA, 19104, Philadelphia, USA
- Corporal Michael Crescenz VA Medical Center, Philadelphia, PA, 19104, USA
- Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Kyong-Mi Chang
- Corporal Michael Crescenz VA Medical Center, Philadelphia, PA, 19104, USA
- Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Philip S Tsao
- VA Palo Alto Health Care System, Palo Alto, CA, 94550, USA
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Themistocles L Assimes
- VA Palo Alto Health Care System, Palo Alto, CA, 94550, USA
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Marylyn D Ritchie
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Center for Precision Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Benjamin F Voight
- Corporal Michael Crescenz VA Medical Center, Philadelphia, PA, 19104, USA.
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
- Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
4
|
Hébert F, Causeur D, Emily M. An adaptive decorrelation procedure for signal detection. Comput Stat Data Anal 2021. [DOI: 10.1016/j.csda.2020.107082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
5
|
Zhao SD, Nguyen YT. Nonparametric false discovery rate control for identifying simultaneous signals. Electron J Stat 2020. [DOI: 10.1214/19-ejs1663] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
6
|
Hu Y, Li M, Lu Q, Weng H, Wang J, Zekavat SM, Yu Z, Li B, Gu J, Muchnik S, Shi Y, Kunkle BW, Mukherjee S, Natarajan P, Naj A, Kuzma A, Zhao Y, Crane PK, Lu H, Zhao H. A statistical framework for cross-tissue transcriptome-wide association analysis. Nat Genet 2019; 51:568-576. [PMID: 30804563 PMCID: PMC6788740 DOI: 10.1038/s41588-019-0345-7] [Citation(s) in RCA: 220] [Impact Index Per Article: 36.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 01/09/2019] [Indexed: 12/12/2022]
Abstract
Transcriptome-wide association analysis is a powerful approach to studying the genetic architecture of complex traits. A key component of this approach is to build a model to impute gene expression levels from genotypes by using samples with matched genotypes and gene expression data in a given tissue. However, it is challenging to develop robust and accurate imputation models with a limited sample size for any single tissue. Here, we first introduce a multi-task learning method to jointly impute gene expression in 44 human tissues. Compared with single-tissue methods, our approach achieved an average of 39% improvement in imputation accuracy and generated effective imputation models for an average of 120% more genes. We describe a summary-statistic-based testing framework that combines multiple single-tissue associations into a powerful metric to quantify the overall gene-trait association. We applied our method, called UTMOST (unified test for molecular signatures), to multiple genome-wide-association results and demonstrate its advantages over single-tissue strategies.
Collapse
Affiliation(s)
- Yiming Hu
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Mo Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Qiongshi Lu
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA
| | - Haoyi Weng
- Division of Biostatistics, The Jockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, NT, Hong Kong
| | - Jiawei Wang
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Seyedeh M Zekavat
- Yale School of Medicine, New Haven, CT, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Zhaolong Yu
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Boyang Li
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Jianlei Gu
- SJTU-Yale Joint Center for Biostatistics, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiaotong University, Shanghai, China
| | - Sydney Muchnik
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
| | - Yu Shi
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Brian W Kunkle
- John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
| | | | - Pradeep Natarajan
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Adam Naj
- Center for Clinical Epidemiology and Biostatistic, and the Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Amanda Kuzma
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Yi Zhao
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Paul K Crane
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Hui Lu
- SJTU-Yale Joint Center for Biostatistics, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiaotong University, Shanghai, China
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA.
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA.
- SJTU-Yale Joint Center for Biostatistics, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiaotong University, Shanghai, China.
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA.
| |
Collapse
|
7
|
Zhao SD, Cai TT, Li H. Optimal detection of weak positive latent dependence between two sequences of multiple tests. J MULTIVARIATE ANAL 2017; 160:169-184. [PMID: 29203948 PMCID: PMC5711487 DOI: 10.1016/j.jmva.2017.06.009] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
It is frequently of interest to jointly analyze two paired sequences of multiple tests. This paper studies the problem of detecting whether there are more pairs of tests that are significant in both sequences than would be expected by chance. The asymptotic detection boundary is derived in terms of parameters such as the sparsity of non-null cases in each sequence, the effect sizes of the signals, and the magnitude of the dependence between the two sequences. A new test for detecting weak dependence is also proposed, shown to be asymptotically adaptively optimal, studied in simulations, and applied to study genetic pleiotropy in 10 pediatric autoimmune diseases.
Collapse
Affiliation(s)
- Sihai Dave Zhao
- Department of Statistics, University of Illinois at Urbana-Champaign, IL, United States
| | - T. Tony Cai
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA, United States
| | - Hongzhe Li
- Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|