Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Guo Z, Wang W, Cai TT, Li H. Optimal Estimation of Genetic Relatedness in High-dimensional Linear Models. J Am Stat Assoc 2018;114:358-369. [PMID: 38434789 PMCID: PMC10907007 DOI: 10.1080/01621459.2017.1407774] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Revised: 10/01/2017] [Indexed: 10/18/2022]

For:	Guo Z, Wang W, Cai TT, Li H. Optimal Estimation of Genetic Relatedness in High-dimensional Linear Models. J Am Stat Assoc 2018;114:358-369. [PMID: 38434789 PMCID: PMC10907007 DOI: 10.1080/01621459.2017.1407774] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Revised: 10/01/2017] [Indexed: 10/18/2022]

Number

Cited by Other Article(s)

Zhao B, Yang X, Zhu H. Estimating trans-ancestry genetic correlation with unbalanced data resources. J Am Stat Assoc 2024;119:839-850. [PMID: 39219674 PMCID: PMC11364214 DOI: 10.1080/01621459.2024.2344703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 04/07/2024] [Indexed: 09/04/2024]

Zhao B, Zou F, Zhu H. Cross-trait prediction accuracy of summary statistics in genome-wide association studies. Biometrics 2023;79:841-853. [PMID: 35278218 PMCID: PMC9464799 DOI: 10.1111/biom.13661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 02/25/2022] [Indexed: 11/27/2022]

Abstract

In the era of big data, univariate models have widely been used as a workhorse tool for quickly producing marginal estimators; and this is true even when in a high-dimensional dense setting, in which many features are "true," but weak signals. Genome-wide association studies (GWAS) epitomize this type of setting. Although the GWAS marginal estimator is popular, it has long been criticized for ignoring the correlation structure of genetic variants (i.e., the linkage disequilibrium [LD] pattern). In this paper, we study the effects of LD pattern on the GWAS marginal estimator and investigate whether or not additionally accounting for the LD can improve the prediction accuracy of complex traits. We consider a general high-dimensional dense setting for GWAS and study a class of ridge-type estimators, including the popular marginal estimator and the best linear unbiased prediction (BLUP) estimator as two special cases. We show that the performance of GWAS marginal estimator depends on the LD pattern through the first three moments of its eigenvalue distribution. Furthermore, we uncover that the relative performance of GWAS marginal and BLUP estimators highly depends on the ratio of GWAS sample size over the number of genetic variants. Particularly, our finding reveals that the marginal estimator can easily become near-optimal within this class when the sample size is relatively small, even though it ignores the LD pattern. On the other hand, BLUP estimator has substantially better performance than the marginal estimator as the sample size increases toward the number of genetic variants, which is typically in millions. Therefore, adjusting for the LD (such as in the BLUP) is most needed when GWAS sample size is large. We illustrate the importance of our results by using the simulated data and real GWAS.

Collapse

Carpentier A, Collier O, Comminges L, Tsybakov AB, Wang Y. Estimation of the ℓ2-norm and testing in sparse linear regression with unknown variance. BERNOULLI 2022. [DOI: 10.3150/21-bej1436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Chen HY, Li H, Argos M, Persky VW, Turyk ME. Statistical Methods for Assessing the Explained Variation of a Health Outcome by a Mixture of Exposures. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:2693. [PMID: 35270383 PMCID: PMC8910055 DOI: 10.3390/ijerph19052693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 02/13/2022] [Accepted: 02/18/2022] [Indexed: 12/04/2022]

Livne I, Azriel D, Goldberg Y. Improved estimators for semi-supervised high-dimensional regression model. Electron J Stat 2022. [DOI: 10.1214/22-ejs2070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Chen X, Liu Q, Tong XT. Dimension independent excess risk by stochastic gradient descent. Electron J Stat 2022. [DOI: 10.1214/22-ejs2055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Zhang Y, Lu Q, Ye Y, Huang K, Liu W, Wu Y, Zhong X, Li B, Yu Z, Travers BG, Werling DM, Li JJ, Zhao H. SUPERGNOVA: local genetic correlation analysis reveals heterogeneous etiologic sharing of complex traits. Genome Biol 2021;22:262. [PMID: 34493297 PMCID: PMC8422619 DOI: 10.1186/s13059-021-02478-w] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Accepted: 08/23/2021] [Indexed: 01/09/2023] Open

Affiliation(s)

Yiliang Zhang Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT, 06520, USA
Qiongshi Lu Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA Department of Statistics, University of Wisconsin-Madison, Madison, WI, 53706, USA Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, WI, 53706, USA
Yixuan Ye Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06510, USA
Kunling Huang Department of Statistics, University of Wisconsin-Madison, Madison, WI, 53706, USA
Wei Liu Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06510, USA
Yuchang Wu Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
Xiaoyuan Zhong Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
Boyang Li Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT, 06520, USA
Zhaolong Yu Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06510, USA
Brittany G Travers Occupational Therapy Program in the Department of Kinesiology, University of Wisconsin-Madison, Madison, WI, 53706, USA Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
Donna M Werling Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, 53706, USA
James J Li Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA Department of Psychology, University of Wisconsin-Madison, Madison, WI, 53706, USA
Hongyu Zhao Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT, 06520, USA. Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06510, USA. Department of Genetics, Yale School of Medicine, New Haven, CT, 06510, USA.

Collapse

Zhang Y, Cheng Y, Jiang W, Ye Y, Lu Q, Zhao H. Comparison of methods for estimating genetic correlation between complex traits using GWAS summary statistics. Brief Bioinform 2021;22:bbaa442. [PMID: 33497438 PMCID: PMC8425307 DOI: 10.1093/bib/bbaa442] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 12/12/2020] [Accepted: 12/30/2020] [Indexed: 01/03/2023] Open

Comminges L, Collier O, Ndaoud M, Tsybakov AB. Adaptive robust estimation in sparse vector model. Ann Stat 2021. [DOI: 10.1214/20-aos2002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Zhao B, Zhu H. On Genetic Correlation Estimation With Summary Statistics From Genome-Wide Association Studies. J Am Stat Assoc 2021;117:1-11. [PMID: 35757777 PMCID: PMC9232179 DOI: 10.1080/01621459.2021.1906684] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 03/12/2021] [Accepted: 03/16/2021] [Indexed: 01/03/2023]

Wang J, Li H. Estimation of genetic correlation with summary association statistics. Biometrika 2021. [DOI: 10.1093/biomet/asab030] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open

Guo Z, Renaux C, Bühlmann P, Cai T. Group inference in high dimensions with applications to hierarchical testing. Electron J Stat 2021. [DOI: 10.1214/21-ejs1955] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Javanmard A, Lee JD. A flexible framework for hypothesis testing in high dimensions. J R Stat Soc Series B Stat Methodol 2020. [DOI: 10.1111/rssb.12373] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Tony Cai T, Guo Z. Semisupervised inference for explained variance in high dimensional linear regression and its applications. J R Stat Soc Series B Stat Methodol 2020. [DOI: 10.1111/rssb.12357] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]