1
|
Ma L, Wang J, Chen H, Liu L. Semiparametrically Efficient Method for Enveloped Central Space. J Am Stat Assoc 2023; 119:2166-2177. [PMID: 39464305 PMCID: PMC11499873 DOI: 10.1080/01621459.2023.2252134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 06/07/2023] [Accepted: 08/13/2023] [Indexed: 10/29/2024]
Abstract
The estimation of the central space is at the core of the sufficient dimension reduction (SDR) literature. However, it is well known that the finite-sample estimation suffers from collinearity among predictors. Cook et al. (2013) proposed the predictor envelope method under linear models that can alleviate the problem by targeting a bigger space - which not only envelopes the central information, but also partitions the predictors by finding an uncorrelated set of material and immaterial predictors. One limitation of the predictor envelope is that it has strong distributional and modeling assumptions and therefore, it cannot be readily used in semiparametric settings where SDR usually nests. In this paper, we generalize the envelope model by defining the enveloped central space and propose a semiparametric method to estimate it. We derive the entire class of regular and asymptotically linear (RAL) estimators as well as the locally and globally semiparametrically efficient estimators for the enveloped central space. Based on the connection between predictor envelope and partial least square (PLS), our methods can also be used to calculate the PLS space beyond linearity. In the simulations, our methods are shown to be both robust and accurate for estimating the enveloped central space under different settings. Moreover, the downstream analysis using state-of-the-art methods such as machine learning (ML) methods has the potential to achieve much better predictions. We further illustrate our methods in a heart failure study.
Collapse
Affiliation(s)
- Linquan Ma
- School of Statistics, University of Minnesota at Twin Cities
- Department of Statistics, University of Wisconsin-Madison
| | - Jixin Wang
- School of Statistics, University of Minnesota at Twin Cities
- Department of Statistics, Rice University
| | - Han Chen
- School of Statistics, University of Minnesota at Twin Cities
- Department of Statistics, University of California at Davis
| | - Lan Liu
- School of Statistics, University of Minnesota at Twin Cities
| |
Collapse
|
2
|
Dennis Cook R, Forzani L, Liu L. Partial least squares for simultaneous reduction of response and predictor vectors in regression. J MULTIVARIATE ANAL 2023. [DOI: 10.1016/j.jmva.2023.105163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
|
3
|
Dong Y, Soale AN, Power MD. A selective review of sufficient dimension reduction for multivariate response regression. J Stat Plan Inference 2023. [DOI: 10.1016/j.jspi.2023.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
|
4
|
Su Z, Li B, Cook D. Envelope model for function-on-function linear regression. J Comput Graph Stat 2023. [DOI: 10.1080/10618600.2022.2163652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- Zhihua Su
- Department of Statistics, University of Florida
| | - Bing Li
- Department of Statistics, Pennsylvania State University
| | - Dennis Cook
- School of Statistics, University of Minnesota
| |
Collapse
|
5
|
Basa J, Cook RD, Forzani L, Marcos M. Asymptotic distribution of one‐component partial least squares regression estimators in high dimensions. CAN J STAT 2022. [DOI: 10.1002/cjs.11755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Jerónimo Basa
- Departamento de Matemática, Facultad de Ingeniería Química Universidad Nacional del Litoral ‐ CONICET Santa Fe Argentina
| | - R. Dennis Cook
- School of Statistics University of Minnesota Minneapolis USA
| | - Liliana Forzani
- Departamento de Matemática, Facultad de Ingeniería Química Universidad Nacional del Litoral ‐ CONICET Santa Fe Argentina
| | - Miguel Marcos
- Departamento de Matemática, Facultad de Ingeniería Química Universidad Nacional del Litoral ‐ CONICET Santa Fe Argentina
| |
Collapse
|
6
|
Franks AM. Reducing subspace models for large-scale covariance regression. Biometrics 2022; 78:1604-1613. [PMID: 34458980 DOI: 10.1111/biom.13531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 06/29/2021] [Accepted: 07/08/2021] [Indexed: 12/30/2022]
Abstract
We develop an envelope model for joint mean and covariance regression in the large p, small n setting. In contrast to existing envelope methods, which improve mean estimates by incorporating estimates of the covariance structure, we focus on identifying covariance heterogeneity by incorporating information about mean-level differences. We use a Monte Carlo EM algorithm to identify a low-dimensional subspace that explains differences in both means and covariances as a function of covariates, and then use MCMC to estimate the posterior uncertainty conditional on the inferred low-dimensional subspace. We demonstrate the utility of our model on a motivating application on the metabolomics of aging. We also provide R code that can be used to develop and test other generalizations of the response envelope model.
Collapse
Affiliation(s)
- Alexander M Franks
- Department of Statistics and Applied Probability, University of California Santa Barbara, Santa Barbara, California, USA
| |
Collapse
|
7
|
Park Y, Su Z, Chung D. Envelope-based partial partial least squares with application to cytokine-based biomarker analysis for COVID-19. Stat Med 2022; 41:4578-4592. [PMID: 36111618 PMCID: PMC9350235 DOI: 10.1002/sim.9526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 05/27/2022] [Accepted: 06/27/2022] [Indexed: 11/18/2022]
Abstract
Partial least squares (PLS) regression is a popular alternative to ordinary least squares regression because of its superior prediction performance demonstrated in many cases. In various contemporary applications, the predictors include both continuous and categorical variables. A common practice in PLS regression is to treat the categorical variable as continuous. However, studies find that this practice may lead to biased estimates and invalid inferences (Schuberth et al., 2018). Based on a connection between the envelope model and PLS, we develop an envelope-based partial PLS estimator that considers the PLS regression on the conditional distributions of the response(s) and continuous predictors on the categorical predictors. Root-n consistency and asymptotic normality are established for this estimator. Numerical study shows that this approach can achieve more efficiency gains in estimation and produce better predictions. The method is applied for the identification of cytokine-based biomarkers for COVID-19 patients, which reveals the association between the cytokine-based biomarkers and patients' clinical information including disease status at admission and demographical characteristics. The efficient estimation leads to a clear scientific interpretation of the results.
Collapse
Affiliation(s)
- Yeonhee Park
- Department of Biostatistics and Medical InformaticsUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| | - Zhihua Su
- Department of StatisticsUniversity of FloridaGainesvilleFloridaUSA
| | - Dongjun Chung
- Department of Biomedical InformaticsThe Ohio State UniversityColumbusOhioUSA
| |
Collapse
|
8
|
Extreme partial least-squares. J MULTIVARIATE ANAL 2022. [DOI: 10.1016/j.jmva.2022.105101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
9
|
Zhao Y, Van Keilegom I, Ding S. Envelopes for censored quantile regression. Scand Stat Theory Appl 2022. [DOI: 10.1111/sjos.12602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Yue Zhao
- Research Centre for Operations Research and Statistics (ORSTAT), KU Leuven
| | | | - Shanshan Ding
- Department of Applied Economics and Statistics University of Delaware
| |
Collapse
|
10
|
Response envelopes for linear coregionalization models. J MULTIVARIATE ANAL 2022. [DOI: 10.1016/j.jmva.2022.105015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
11
|
Cook RD. A slice of multivariate dimension reduction. J MULTIVARIATE ANAL 2022. [DOI: 10.1016/j.jmva.2021.104812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
12
|
Ekvall KO. Targeted principal components regression. J MULTIVARIATE ANAL 2022. [DOI: 10.1016/j.jmva.2022.104995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
13
|
Zhang J, Huang Z. Efficient simultaneous partial envelope model in multivariate linear regression. J STAT COMPUT SIM 2021. [DOI: 10.1080/00949655.2021.1995866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Jing Zhang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, People's Republic of China
- School of Mathematics and Finance, Chuzhou University, Chuzhou, Anhui, People's Republic of China
| | - Zhensheng Huang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, People's Republic of China
| |
Collapse
|
14
|
Ma L, Liu L, Yang W. Envelope method with ignorable missing data. Electron J Stat 2021; 15:4420-4461. [PMID: 37842008 PMCID: PMC10571183 DOI: 10.1214/21-ejs1881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
Envelope method was recently proposed as a method to reduce the dimension of responses in multivariate regressions. However, when there exists missing data, the envelope method using the complete case observations may lead to biased and inefficient results. In this paper, we generalize the envelope estimation when the predictors and/or the responses are missing at random. Specifically, we incorporate the envelope structure in the expectation-maximization (EM) algorithm. As the parameters under the envelope method are not pointwise identifiable, the EM algorithm for the envelope method was not straightforward and requires a special decomposition. Our method is guaranteed to be more efficient, or at least as efficient as, the standard EM algorithm. Moreover, our method has the potential to outperform the full data MLE. We give asymptotic properties of our method under both normal and non-normal cases. The efficiency gain over the standard EM is confirmed in simulation studies and in an application to the Chronic Renal Insufficiency Cohort (CRIC) study.
Collapse
Affiliation(s)
- Linquan Ma
- Department of Statistics, University of Wisconsin - Madison, Madison, Wisconsin, USA
- School of Statistics, University of Minnesota at Twin Cities, Minneapolis, Minnesota, USA
| | - Lan Liu
- School of Statistics, University of Minnesota at Twin Cities, Minneapolis, Minnesota, USA
| | - Wei Yang
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
15
|
Zhang J, Huang Z, Jiang Z. Groupwise partial envelope model: efficient estimation in multivariate linear regression. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2021.1921800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Jing Zhang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
- School of Mathematics and Finance, Chuzhou University, Chuzhou, Anhui, P. R. China
| | - Zhensheng Huang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
| | - Zhiqiang Jiang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
| |
Collapse
|
16
|
Affiliation(s)
- Yuyang Shi
- School of Industrial and Systems Engineering Georgia Institute of Technology Atlanta 30332 GA USA
- School of Statistics University of Minnesota at Twin Cities Minneapolis 55455 MN USA
| | - Linquan Ma
- Department of Statistics University of Wisconsin‐Madison Madison 53706 WI USA
- School of Statistics University of Minnesota at Twin Cities Minneapolis 55455 MN USA
| | - Lan Liu
- School of Statistics University of Minnesota at Twin Cities Minneapolis 55455 MN USA
| |
Collapse
|
17
|
Comparing six shrinkage estimators with large sample theory and asymptotically optimal prediction intervals. Stat Pap (Berl) 2020. [DOI: 10.1007/s00362-020-01193-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
18
|
Zhang X, Lee CE, Shao X. Envelopes in multivariate regression models with nonlinearity and heteroscedasticity. Biometrika 2020. [DOI: 10.1093/biomet/asaa036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Summary
Envelopes have been proposed in recent years as a nascent methodology for sufficient dimension reduction and efficient parameter estimation in multivariate linear models. We extend the classical definition of envelopes in Cook et al. (2010) to incorporate a nonlinear conditional mean function and a heteroscedastic error. Given any two random vectors ${X}\in\mathbb{R}^{p}$ and ${Y}\in\mathbb{R}^{r}$, we propose two new model-free envelopes, called the martingale difference divergence envelope and the central mean envelope, and study their relationships to the standard envelope in the context of response reduction in multivariate linear models. The martingale difference divergence envelope effectively captures the nonlinearity in the conditional mean without imposing any parametric structure or requiring any tuning in estimation. Heteroscedasticity, or nonconstant conditional covariance of ${Y}\mid{X}$, is further detected by the central mean envelope based on a slicing scheme for the data. We reveal the nested structure of different envelopes: (i) the central mean envelope contains the martingale difference divergence envelope, with equality when ${Y}\mid{X}$ has a constant conditional covariance; and (ii) the martingale difference divergence envelope contains the standard envelope, with equality when ${Y}\mid{X}$ has a linear conditional mean. We develop an estimation procedure that first obtains the martingale difference divergence envelope and then estimates the additional envelope components in the central mean envelope. We establish consistency in envelope estimation of the martingale difference divergence envelope and central mean envelope without stringent model assumptions. Simulations and real-data analysis demonstrate the advantages of the martingale difference divergence envelope and the central mean envelope over the standard envelope in dimension reduction.
Collapse
Affiliation(s)
- X Zhang
- Department of Statistics, Florida State University, 117 N.Woodward Ave., Tallahassee, Florida 32306, U.S.A
| | - C E Lee
- Department of Business Analytics and Statistics, University of Tennessee, Knoxville, 916 Volunteer Blvd, Knoxville, Tennessee 37996, U.S.A
| | - X Shao
- Department of Statistics, University of Illinois at Urbana Champaign, 725 South Wright St, Champaign, Illinois 61820, U.S.A
| |
Collapse
|
19
|
Kong D, An B, Zhang J, Zhu H. L2RM: Low-rank Linear Regression Models for High-dimensional Matrix Responses. J Am Stat Assoc 2020; 115:403-424. [PMID: 33408427 PMCID: PMC7781207 DOI: 10.1080/01621459.2018.1555092] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Revised: 11/11/2018] [Accepted: 11/26/2018] [Indexed: 10/27/2022]
Abstract
The aim of this paper is to develop a low-rank linear regression model (L2RM) to correlate a high-dimensional response matrix with a high dimensional vector of covariates when coefficient matrices have low-rank structures. We propose a fast and efficient screening procedure based on the spectral norm of each coefficient matrix in order to deal with the case when the number of covariates is extremely large. We develop an efficient estimation procedure based on the trace norm regularization, which explicitly imposes the low rank structure of coefficient matrices. When both the dimension of response matrix and that of covariate vector diverge at the exponential order of the sample size, we investigate the sure independence screening property under some mild conditions. We also systematically investigate some theoretical properties of our estimation procedure including estimation consistency, rank consistency and non-asymptotic error bound under some mild conditions. We further establish a theoretical guarantee for the overall solution of our two-step screening and estimation procedure. We examine the finite-sample performance of our screening and estimation methods using simulations and a large-scale imaging genetic dataset collected by the Philadelphia Neurodevelopmental Cohort (PNC) study.
Collapse
Affiliation(s)
- Dehan Kong
- Department of Statistical Sciences, University of Toronto
| | - Baiguo An
- School of Statistics, Capital University of Economics and Business
| | - Jingwen Zhang
- Department of Biostatistics, University of North Carolina at Chapel Hill
| | - Hongtu Zhu
- Department of Biostatistics, University of North Carolina at Chapel Hill
| |
Collapse
|
20
|
Eck DJ, Geyer CJ, Cook RD. Combining envelope methodology and aster models for variance reduction in life history analyses. J Stat Plan Inference 2020. [DOI: 10.1016/j.jspi.2019.08.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
21
|
|
22
|
Affiliation(s)
- Minji Lee
- Department of Statistics University of Florida Gainesville Florida USA
| | - Zhihua Su
- Department of Statistics University of Florida Gainesville Florida USA
| |
Collapse
|
23
|
|
24
|
Chen T, Su Z, Yang Y, Ding S. Efficient estimation in expectile regression using envelope models. Electron J Stat 2020. [DOI: 10.1214/19-ejs1664] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
25
|
Wang W, Zhang X, Li L. Common reducing subspace model and network alternation analysis. Biometrics 2019; 75:1109-1120. [DOI: 10.1111/biom.13099] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Accepted: 05/22/2019] [Indexed: 12/21/2022]
Affiliation(s)
- Wenjing Wang
- Department of Statistics Florida State University Tallahassee Florida
| | - Xin Zhang
- Department of Statistics Florida State University Tallahassee Florida
| | - Lexin Li
- Department of Biostatistics and Epidemiology University of California Berkeley California
| |
Collapse
|
26
|
|
27
|
Jain Y, Ding S, Qiu J. Sliced inverse regression for integrative multi-omics data analysis. Stat Appl Genet Mol Biol 2019; 18:/j/sagmb.ahead-of-print/sagmb-2018-0028/sagmb-2018-0028.xml. [PMID: 30685747 DOI: 10.1515/sagmb-2018-0028] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Advancement in next-generation sequencing, transcriptomics, proteomics and other high-throughput technologies has enabled simultaneous measurement of multiple types of genomic data for cancer samples. These data together may reveal new biological insights as compared to analyzing one single genome type data. This study proposes a novel use of supervised dimension reduction method, called sliced inverse regression, to multi-omics data analysis to improve prediction over a single data type analysis. The study further proposes an integrative sliced inverse regression method (integrative SIR) for simultaneous analysis of multiple omics data types of cancer samples, including MiRNA, MRNA and proteomics, to achieve integrative dimension reduction and to further improve prediction performance. Numerical results show that integrative analysis of multi-omics data is beneficial as compared to single data source analysis, and more importantly, that supervised dimension reduction methods possess advantages in integrative data analysis in terms of classification and prediction as compared to unsupervised dimension reduction methods.
Collapse
Affiliation(s)
- Yashita Jain
- Center for Bioinformatics and Computational Biology, University of Delaware, 15 Innovation Way, Newark, DE 19711, USA
| | - Shanshan Ding
- Center for Bioinformatics and Computational Biology, University of Delaware, 15 Innovation Way, Newark, DE 19711, USA.,Department of Applied Economics and Statistics, University of Delaware, 531 S College Ave., Newark, DE 19711, USA
| | - Jing Qiu
- Center for Bioinformatics and Computational Biology, University of Delaware, 15 Innovation Way, Newark, DE 19711, USA.,Department of Applied Economics and Statistics, University of Delaware, 531 S College Ave., Newark, DE 19711, USA
| |
Collapse
|
28
|
Sutton M, Mengersen K, Liquet B. [HDDA] sparse subspace constrained partial least squares. J STAT COMPUT SIM 2018. [DOI: 10.1080/00949655.2018.1555830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Matthew Sutton
- School of Mathematical Sciences, ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - Kerrie Mengersen
- School of Mathematical Sciences, ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - Benoit Liquet
- School of Mathematical Sciences, ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
- Laboratory of Mathematics and Their Applications, University of Pau and Pays de lAdour, Pau, France
| |
Collapse
|
29
|
Zhang X, Mai Q. Efficient Integration of Sufficient Dimension Reduction and Prediction in Discriminant Analysis. Technometrics 2018. [DOI: 10.1080/00401706.2018.1512901] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Affiliation(s)
- Xin Zhang
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Qing Mai
- Department of Statistics, Florida State University, Tallahassee, FL
| |
Collapse
|
30
|
Affiliation(s)
- Lei Wang
- Department of Applied Economics and Statistics; University of Delaware; Newark DE 19716 USA
| | - Shanshan Ding
- Department of Applied Economics and Statistics; University of Delaware; Newark DE 19716 USA
| |
Collapse
|
31
|
|
32
|
Zhang X, Wang C, Wu Y. Functional envelope for model-free sufficient dimension reduction. J MULTIVARIATE ANAL 2018. [DOI: 10.1016/j.jmva.2017.09.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
33
|
Park Y, Su Z, Zhu H. Groupwise envelope models for imaging genetic analysis. Biometrics 2017; 73:1243-1253. [PMID: 28323341 PMCID: PMC5608647 DOI: 10.1111/biom.12689] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Revised: 02/01/2017] [Accepted: 02/01/2017] [Indexed: 11/28/2022]
Abstract
Motivated by searching for associations between genetic variants and brain imaging phenotypes, the aim of this article is to develop a groupwise envelope model for multivariate linear regression in order to establish the association between both multivariate responses and covariates. The groupwise envelope model allows for both distinct regression coefficients and distinct error structures for different groups. Statistically, the proposed envelope model can dramatically improve efficiency of tests and of estimation. Theoretical properties of the proposed model are established. Numerical experiments as well as the analysis of an imaging genetic data set obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) study show the effectiveness of the model in efficient estimation. Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database.
Collapse
Affiliation(s)
- Yeonhee Park
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, U.S.A
| | - Zhihua Su
- Department of Statistics, University of Florida, Gainesville, FL 32611, U.S.A
| | - Hongtu Zhu
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, U.S.A
| |
Collapse
|
34
|
Ding S, Dennis Cook R. Matrix variate regressions and envelope models. J R Stat Soc Series B Stat Methodol 2017. [DOI: 10.1111/rssb.12247] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
35
|
Affiliation(s)
- Xin Zhang
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Lexin Li
- Division of Biostatistics, University of California, Berkeley, CA
| |
Collapse
|
36
|
Affiliation(s)
- Lexin Li
- Division of Biostatistics, University of California at Berkeley, Berkley, CA
| | - Xin Zhang
- Department of Statistics, Florida State University, Tallahassee, FL
| |
Collapse
|
37
|
Affiliation(s)
- R. Dennis Cook
- School of Statistics; University of Minnesota; Minneapolis, MN 55455
| | - Liliana Forzani
- Researcher of CONICET; Facultad de Ingeniería Química; UNL, Santiago del Estero 2819, Santa Fe Argentina
| |
Collapse
|
38
|
|
39
|
Su Z, Zhu G, Chen X, Yang Y. Sparse envelope model: efficient estimation and response variable selection in multivariate linear regression. Biometrika 2016. [DOI: 10.1093/biomet/asw036] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
40
|
|
41
|
Affiliation(s)
- R. Dennis Cook
- School of Statistics 313 Ford Hall, 224 Church St. SE, University of Minnesota, Minneapolis, MN 55455
| | - Zhihua Su
- Department of Statistics 102 Griffin-Floyd Hall, University of Florida, Gainesville, FL 32606
| |
Collapse
|
42
|
Li G, Yang D, Nobel AB, Shen H. Supervised singular value decomposition and its asymptotic properties. J MULTIVARIATE ANAL 2016. [DOI: 10.1016/j.jmva.2015.02.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
43
|
|
44
|
|
45
|
Li Y, Udén P, von Rosen D. A two-step estimation method for grouped data with connections to the extended growth curve model and partial least squares regression. J MULTIVARIATE ANAL 2015. [DOI: 10.1016/j.jmva.2015.03.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
46
|
Sun Q, Zhu H, Liu Y, Ibrahim JG. SPReM: Sparse Projection Regression Model For High-dimensional Linear Regression. J Am Stat Assoc 2015; 110:289-302. [PMID: 26527844 DOI: 10.1080/01621459.2014.892008] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
The aim of this paper is to develop a sparse projection regression modeling (SPReM) framework to perform multivariate regression modeling with a large number of responses and a multivariate covariate of interest. We propose two novel heritability ratios to simultaneously perform dimension reduction, response selection, estimation, and testing, while explicitly accounting for correlations among multivariate responses. Our SPReM is devised to specifically address the low statistical power issue of many standard statistical approaches, such as the Hotelling's T2 test statistic or a mass univariate analysis, for high-dimensional data. We formulate the estimation problem of SPREM as a novel sparse unit rank projection (SURP) problem and propose a fast optimization algorithm for SURP. Furthermore, we extend SURP to the sparse multi-rank projection (SMURP) by adopting a sequential SURP approximation. Theoretically, we have systematically investigated the convergence properties of SURP and the convergence rate of SURP estimates. Our simulation results and real data analysis have shown that SPReM out-performs other state-of-the-art methods.
Collapse
Affiliation(s)
- Qiang Sun
- Department of Biostatistics, University of North Carolina at Chapel Hill, NC 27599-7420
| | - Hongtu Zhu
- Department of Biostatistics, University of North Carolina at Chapel Hill, NC 27599-7420
| | - Yufeng Liu
- Department of Statistics and Operation Research, University of North Carolina at Chapel Hill, CB 3260, Chapel Hill, NC 27599
| | - Joseph G Ibrahim
- Department of Biostatistics, University of North Carolina at Chapel Hill, NC 27599-7420
| | | |
Collapse
|
47
|
|
48
|
Affiliation(s)
- R. Dennis Cook
- School of Statistics, University of Minnesota, Minneapolis, MN 55455
| | - Xin Zhang
- Department of Statistics Florida State University, Tallahassee, FL 32306
| |
Collapse
|
49
|
|
50
|
HELLAND INGES, SAEBØ SOLVE, TJELMELAND HA. Near Optimal Prediction from Relevant Components. Scand Stat Theory Appl 2012. [DOI: 10.1111/j.1467-9469.2011.00770.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|