1
|
Statistical inference on the significance of rows and columns for matrix-valued data in an additive model. TEST-SPAIN 2023. [DOI: 10.1007/s11749-023-00852-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
|
2
|
Zhang J, Sun WW, Li L. Generalized Connectivity Matrix Response Regression with Applications in Brain Connectivity Studies. J Comput Graph Stat 2022; 32:252-262. [PMID: 36970553 PMCID: PMC10035565 DOI: 10.1080/10618600.2022.2074434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 04/23/2022] [Indexed: 10/18/2022]
Abstract
Multiple-subject network data are fast emerging in recent years, where a separate connectivity matrix is measured over a common set of nodes for each individual subject, along with subject covariates information. In this article, we propose a new generalized matrix response regression model, where the observed network is treated as a matrix-valued response and the subject covariates as predictors. The new model characterizes the population-level connectivity pattern through a low-rank intercept matrix, and the effect of subject covariates through a sparse slope tensor. We develop an efficient alternating gradient descent algorithm for parameter estimation, and establish the non-asymptotic error bound for the actual estimator from the algorithm, which quantifies the interplay between the computational and statistical errors. We further show the strong consistency for graph community recovery, as well as the edge selection consistency. We demonstrate the efficacy of our method through simulations and two brain connectivity studies.
Collapse
Affiliation(s)
- Jingfei Zhang
- Department of Management Science, Miami Herbert Business School, University of Miami, Miami, FL, 33146
| | - Will Wei Sun
- Krannert School of Management, Purdue University, West Lafayette, IN, 47906
| | - Lexin Li
- Department of Biostatistics and Epidemiology, School of Public Health, University of California at Berkeley, Berkeley, CA, 94720
| |
Collapse
|
3
|
Zhou Y, Zhang AR, Zheng L, Wang Y. Optimal High-order Tensor SVD via Tensor-Train Orthogonal Iteration. IEEE TRANSACTIONS ON INFORMATION THEORY 2022; 68:3991-4019. [PMID: 36274655 PMCID: PMC9585995 DOI: 10.1109/tit.2022.3152733] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
This paper studies a general framework for high-order tensor SVD. We propose a new computationally efficient algorithm, tensor-train orthogonal iteration (TTOI), that aims to estimate the low tensor-train rank structure from the noisy high-order tensor observation. The proposed TTOI consists of initialization via TT-SVD [1] and new iterative backward/forward updates. We develop the general upper bound on estimation error for TTOI with the support of several new representation lemmas on tensor matricizations. By developing a matching information-theoretic lower bound, we also prove that TTOI achieves the minimax optimality under the spiked tensor model. The merits of the proposed TTOI are illustrated through applications to estimation and dimension reduction of high-order Markov processes, numerical studies, and a real data example on New York City taxi travel records. The software of the proposed algorithm is available online (https://github.com/Lili-Zheng-stat/TTOI).
Collapse
Affiliation(s)
- Yuchen Zhou
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Anru R Zhang
- Departments of Biostatistics & Bioinformatics, Computer Science, Mathematics, and Statistical Science, Duke University, Durham, NC 27710, USA
| | - Lili Zheng
- Department of Electrical and Computer Engineering, Rice University, Houston, TX 77005, USA
| | - Yazhen Wang
- Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| |
Collapse
|
4
|
Cai JF, Li J, Xia D. Generalized Low-rank plus Sparse Tensor Estimation by Fast Riemannian Optimization. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2063131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Jian-Feng Cai
- Department of Mathematics, Hong Kong University of Science and Technology
| | - Jingyang Li
- Department of Mathematics, Hong Kong University of Science and Technology
| | - Dong Xia
- Department of Mathematics, Hong Kong University of Science and Technology
| |
Collapse
|
5
|
Xia D, Zhang AR, Zhou Y. Inference for low-rank tensors—no need to debias. Ann Stat 2022. [DOI: 10.1214/21-aos2146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- Dong Xia
- Department of Mathematics, Hong Kong University of Science and Technology
| | - Anru R. Zhang
- Departments of Biostatistics & Bioinformatics, Computer Science, Mathematics, and Statistical Science, Duke University
| | - Yuchen Zhou
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania
| |
Collapse
|
6
|
Li L, Zeng J, Zhang X. Generalized Liquid Association Analysis for Multimodal Data Integration. J Am Stat Assoc 2022; 118:1984-1996. [PMID: 38099062 PMCID: PMC10720690 DOI: 10.1080/01621459.2021.2024437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 12/27/2021] [Indexed: 10/19/2022]
Abstract
Multimodal data are now prevailing in scientific research. One of the central questions in multimodal integrative analysis is to understand how two data modalities associate and interact with each other given another modality or demographic variables. The problem can be formulated as studying the associations among three sets of random variables, a question that has received relatively less attention in the literature. In this article, we propose a novel generalized liquid association analysis method, which offers a new and unique angle to this important class of problems of studying three-way associations. We extend the notion of liquid association of Li (2002) from the univariate setting to the sparse, multivariate, and high-dimensional setting. We establish a population dimension reduction model, transform the problem to sparse Tucker decomposition of a three-way tensor, and develop a higher-order orthogonal iteration algorithm for parameter estimation. We derive the non-asymptotic error bound and asymptotic consistency of the proposed estimator, while allowing the variable dimensions to be larger than and diverge with the sample size. We demonstrate the efficacy of the method through both simulations and a multimodal neuroimaging application for Alzheimer's disease research.
Collapse
Affiliation(s)
- Lexin Li
- University of California at Berkeley
| | | | | |
Collapse
|
7
|
|
8
|
Han R, Willett R, Zhang AR. An optimal statistical and computational framework for generalized tensor estimation. Ann Stat 2022. [DOI: 10.1214/21-aos2061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Rungang Han
- Department of Statistics, University of Wisconsin-Madison
| | - Rebecca Willett
- Departments of Statistics and Computer Science, University of Chicago
| | - Anru R. Zhang
- Department of Statistics, University of Wisconsin-Madison
| |
Collapse
|
9
|
Zhang J, Yuan Y, Qu A. Tensor factorization recommender systems with dependency. Electron J Stat 2022. [DOI: 10.1214/22-ejs1978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Jiuchen Zhang
- Department of Statistics, University of California, Irvine
| | - Yubai Yuan
- Department of Statistics, University of California, Irvine
| | - Annie Qu
- Department of Statistics, University of California, Irvine
| |
Collapse
|
10
|
Tang X, Li L. Multivariate Temporal Point Process Regression. J Am Stat Assoc 2021; 118:830-845. [PMID: 37519438 PMCID: PMC10373792 DOI: 10.1080/01621459.2021.1955690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 05/23/2021] [Accepted: 07/05/2021] [Indexed: 10/20/2022]
Abstract
Point process modeling is gaining increasing attention, as point process type data are emerging in a large variety of scientific applications. In this article, motivated by a neuronal spike trains study, we propose a novel point process regression model, where both the response and the predictor can be a high-dimensional point process. We model the predictor effects through the conditional intensities using a set of basis transferring functions in a convolutional fashion. We organize the corresponding transferring coefficients in the form of a three-way tensor, then impose the low-rank, sparsity, and subgroup structures on this coefficient tensor. These structures help reduce the dimensionality, integrate information across different individual processes, and facilitate the interpretation. We develop a highly scalable optimization algorithm for parameter estimation. We derive the large sample error bound for the recovered coefficient tensor, and establish the subgroup identification consistency, while allowing the dimension of the multivariate point process to diverge. We demonstrate the efficacy of our method through both simulations and a cross-area neuronal spike trains analysis in a sensory cortex study.
Collapse
Affiliation(s)
- Xiwei Tang
- Department of Statistics, University of Virginia, Charlottesville, VA
| | - Lexin Li
- Department of Biostatistics and Epidemiology, University of California, Berkeley, CA
| |
Collapse
|
11
|
Deng Y, Tang X, Qu A. Correlation Tensor Decomposition and Its Application in Spatial Imaging Data. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2021.1938083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Yujia Deng
- Department of Statistics, University of Illinois, Urbana-Champaign, IL
| | - Xiwei Tang
- Department of Statistics, University of Virginia, Charlottesville, VA
| | - Annie Qu
- Department of Statistics, University of California, Irvine, CA
| |
Collapse
|
12
|
Deng K, Zhang X. Tensor envelope mixture model for simultaneous clustering and multiway dimension reduction. Biometrics 2021; 78:1067-1079. [PMID: 34010459 DOI: 10.1111/biom.13486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 04/28/2021] [Accepted: 05/06/2021] [Indexed: 11/26/2022]
Abstract
In the form of multidimensional arrays, tensor data have become increasingly prevalent in modern scientific studies and biomedical applications such as computational biology, brain imaging analysis, and process monitoring system. These data are intrinsically heterogeneous with complex dependencies and structure. Therefore, ad-hoc dimension reduction methods on tensor data may lack statistical efficiency and can obscure essential findings. Model-based clustering is a cornerstone of multivariate statistics and unsupervised learning; however, existing methods and algorithms are not designed for tensor-variate samples. In this article, we propose a tensor envelope mixture model (TEMM) for simultaneous clustering and multiway dimension reduction of tensor data. TEMM incorporates tensor-structure-preserving dimension reduction into mixture modeling and drastically reduces the number of free parameters and estimative variability. An expectation-maximization-type algorithm is developed to obtain likelihood-based estimators of the cluster means and covariances, which are jointly parameterized and constrained onto a series of lower dimensional subspaces known as the tensor envelopes. We demonstrate the encouraging empirical performance of the proposed method in extensive simulation studies and a real data application in comparison with existing vector and tensor clustering methods.
Collapse
Affiliation(s)
- Kai Deng
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| | - Xin Zhang
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| |
Collapse
|
13
|
Dai B, Shen X, Pan W. Two-level monotonic multistage recommender systems. Electron J Stat 2021. [DOI: 10.1214/21-ejs1924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Ben Dai
- School of Statistics, University of Minnesota, Minneapolis, MN 55455, Department of Statistics, The Chinese University of Hong Kong, Shatin, Hong Kong SAR
| | - Xiaotong Shen
- School of Statistics, University of Minnesota, Minneapolis, MN 55455
| | - Wei Pan
- Division of Biostatistics, University of Minnesota, Minneapolis, MN 55455
| |
Collapse
|
14
|
A Low-Rank Tensor Factorization Using Implicit Similarity in Trust Relationships. Symmetry (Basel) 2020. [DOI: 10.3390/sym12030439] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Low-rank tensor factorization can not only mine the implicit relationships between data but also fill in the missing data when working with complex data. Compared with the traditional collaborative filtering (CF) algorithm, the changes are essentially proposed, from traditional matrix analysis to three-dimensional spatial analysis. Based on low-rank tensor factorization, this paper proposes a recommendation model that comprehensively considers local information and global information, in other words, combining the similarity between trust users and low-rank tensor factorization. First, the similarity between trusted users is measured to capture local information between users by trusting similar preferences of users when selecting items. Then, the users’ similarity is integrated into the tensor, and the low-rank tensor factorization is used to better maintain and describe the internal structure of the data to obtain global information. Furthermore, based on the idea of the alternating least squares method, the conjugate gradient (CG) optimization algorithm for the model of this paper is designed. The local and global information is used to generate the optimal expected result in an iterative process. Finally, we conducted a large number of comparative experiments on the Ciao dataset and the FilmTrust dataset. Experimental results show that the algorithm has less precision loss under the data set with lower density. Thus, not only can a perfect compromise between accuracy and coverage be achieved, but also the computational complexity can be reduced to meet the need for real-time results.
Collapse
|
15
|
Affiliation(s)
- Will Wei Sun
- Department of Management Science, University of Miami Business School, Miami, FL
| | - Lexin Li
- Division of Biostatistics, University of California, Berkeley, Berkeley, CA
| |
Collapse
|
16
|
Bi X, Qu A, Shen X. Multilayer tensor factorization with applications to recommender systems. Ann Stat 2018. [DOI: 10.1214/17-aos1659] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|