1
|
Dai X. Nonparametric estimation via partial derivatives. J R Stat Soc Series B Stat Methodol 2025; 87:319-336. [PMID: 40225199 PMCID: PMC11985098 DOI: 10.1093/jrsssb/qkae093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 07/04/2024] [Accepted: 08/17/2024] [Indexed: 04/15/2025]
Abstract
Traditional nonparametric estimation methods often lead to a slow convergence rate in large dimensions and require unrealistically large dataset sizes for reliable conclusions. We develop an approach based on partial derivatives, either observed or estimated, to effectively estimate the function at near-parametric convergence rates. This novel approach and computational algorithm could lead to methods useful to practitioners in many areas of science and engineering. Our theoretical results reveal behaviour universal to this class of nonparametric estimation problems. We explore a general setting involving tensor product spaces and build upon the smoothing spline analysis of variance framework. For d-dimensional models under full interaction, the optimal rates with gradient information on p covariates are identical to those for the ( d - p ) -interaction models without gradients and, therefore, the models are immune to the curse of interaction. For additive models, the optimal rates using gradient information are n , thus achieving the parametric rate. We demonstrate aspects of the theoretical results through synthetic and real data applications.
Collapse
Affiliation(s)
- Xiaowu Dai
- Department of Statistics and Data Science, and Biostatistics, University of California, Los Angeles, CA 90095, USA
| |
Collapse
|
2
|
Chen X, Liu H, Men J, You J. High-dimensional partially linear functional Cox models. Biometrics 2025; 81:ujae164. [PMID: 39808421 DOI: 10.1093/biomtc/ujae164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 11/30/2024] [Accepted: 12/20/2024] [Indexed: 01/16/2025]
Abstract
As a commonly employed method for analyzing time-to-event data involving functional predictors, the functional Cox model assumes a linear relationship between the functional principal component (FPC) scores of the functional predictors and the hazard rates. However, in practical scenarios, such as our study on the survival time of kidney transplant recipients, this assumption often fails to hold. To address this limitation, we introduce a class of high-dimensional partially linear functional Cox models, which accommodates the non-linear effects of functional predictors on the response and allows for diverging numbers of scalar predictors and FPCs as the sample size increases. We employ the group smoothly clipped absolute deviation method to select relevant scalar predictors and FPCs, and use B-splines to obtain a smoothed estimate of the non-linear effect. The finite sample performance of the estimates is evaluated through simulation studies. The model is also applied to a kidney transplant dataset, allowing us to make inferences about the non-linear effects of functional predictors on patients' hazard rates, as well as to identify significant scalar predictors for long-term survival time.
Collapse
Affiliation(s)
- Xin Chen
- School of Statistics and Mathematics, Shanghai Lixin University of Accounting and Finance, Shanghai 201209, China
- Institute of Data Science and Interdisciplinary Studies, Shanghai Lixin University of Accounting and Finance, Shanghai 201209, China
| | - Hua Liu
- School of Economics and Finance, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Jiaqi Men
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai 200433, China
| | - Jinhong You
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai 200433, China
| |
Collapse
|
3
|
Rao AR, Reimherr M. Non-linear Functional Modeling using Neural Networks. J Comput Graph Stat 2023. [DOI: 10.1080/10618600.2023.2165498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
4
|
Liu Y, Li Y, Carroll RJ, Wang N. Predictive Functional Linear Models with Diverging Number of Semiparametric Single-Index Interactions. JOURNAL OF ECONOMETRICS 2022; 230:221-239. [PMID: 36017081 PMCID: PMC9398183 DOI: 10.1016/j.jeconom.2021.03.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
When predicting crop yield using both functional and multivariate predictors, the prediction performances benefit from the inclusion of the interactions between the two sets of predictors. We assume the interaction depends on a nonparametric, single-index structure of the multivariate predictor and reduce each functional predictor's dimension using functional principal component analysis (FPCA). Allowing the number of FPCA scores to diverge to infinity, we consider a sequence of semiparametric working models with a diverging number of predictors, which are FPCA scores with estimation errors. We show that the parametric component of the model is root-n consistent and asymptotically normal, the overall prediction error is dominated by the estimation of the nonparametric interaction function, and justify a CV-based procedure to select the tuning parameters.
Collapse
Affiliation(s)
- Yanghui Liu
- School of Economics and Statistics, Guangzhou University, China
| | - Yehua Li
- Department of Statistics, University of California, Riverside, CA, 92521, USA
| | - Raymond J Carroll
- Department of Statistics, Texas A&M University, College Station, TX 77843-3143, and School of Mathematical and Physical Sciences, University of Technology Sydney, Broadway NSW 2007, Australia
| | - Naisyin Wang
- Department of Statistics, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
5
|
Park Y, Li B, Li Y. Crop Yield Prediction Using Bayesian Spatially Varying Coefficient Models with Functional Predictors. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2123333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Affiliation(s)
- Yeonjoo Park
- Management Science and Statistics, University of Texas at San Antonio
| | - Bo Li
- Department of Statistics, University of Illinois at Urbana-Champaign
| | - Yehua Li
- Department of Statistics, University of California at Riverside
| |
Collapse
|
6
|
Abstract
In this paper, we study statistical inference in functional quantile regression for scalar response and a functional covariate. Specifically, we consider a functional linear quantile regression model where the effect of the covariate on the quantile of the response is modeled through the inner product between the functional covariate and an unknown smooth regression parameter function that varies with the level of quantile. The objective is to test that the regression parameter is constant across several quantile levels of interest. The parameter function is estimated by combining ideas from functional principal component analysis and quantile regression. An adjusted Wald testing procedure is proposed for this hypothesis of interest, and its chi-square asymptotic null distribution is derived. The testing procedure is investigated numerically in simulations involving sparse and noisy functional covariates and in a capital bike share data application. The proposed approach is easy to implement and the R code is published online at https://github.com/xylimeng/fQR-testing.
Collapse
Affiliation(s)
- Meng Li
- Department of Statistics, Rice University, Houston, TX
| | | | - Arnab Maity
- Department of Statistics, North Carolina State University, Raleigh, NC
| | - Ana-Maria Staicu
- Department of Statistics, North Carolina State University, Raleigh, NC
| |
Collapse
|
7
|
Xu J, Cui W. A new RKHS-based global testing for functional linear model. Stat Probab Lett 2022. [DOI: 10.1016/j.spl.2021.109277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
8
|
Naiman J, Song PX. Multivariate Functional Kernel Machine Regression and Sparse Functional Feature Selection. ENTROPY 2022; 24:e24020203. [PMID: 35205498 PMCID: PMC8871497 DOI: 10.3390/e24020203] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 01/25/2022] [Accepted: 01/26/2022] [Indexed: 02/04/2023]
Abstract
Motivated by mobile devices that record data at a high frequency, we propose a new methodological framework for analyzing a semi-parametric regression model that allow us to study a nonlinear relationship between a scalar response and multiple functional predictors in the presence of scalar covariates. Utilizing functional principal component analysis (FPCA) and the least-squares kernel machine method (LSKM), we are able to substantially extend the framework of semi-parametric regression models of scalar responses on scalar predictors by allowing multiple functional predictors to enter the nonlinear model. Regularization is established for feature selection in the setting of reproducing kernel Hilbert spaces. Our method performs simultaneously model fitting and variable selection on functional features. For the implementation, we propose an effective algorithm to solve related optimization problems in that iterations take place between both linear mixed-effects models and a variable selection method (e.g., sparse group lasso). We show algorithmic convergence results and theoretical guarantees for the proposed methodology. We illustrate its performance through simulation experiments and an analysis of accelerometer data.
Collapse
|
9
|
Abstract
Ordinary differential equation (ODE) is widely used in modeling biological and physical processes in science. In this article, we propose a new reproducing kernel-based approach for estimation and inference of ODE given noisy observations. We do not assume the functional forms in ODE to be known, or restrict them to be linear or additive, and we allow pairwise interactions. We perform sparse estimation to select individual functionals, and construct confidence intervals for the estimated signal trajectories. We establish the estimation optimality and selection consistency of kernel ODE under both the low-dimensional and high-dimensional settings, where the number of unknown functionals can be smaller or larger than the sample size. Our proposal builds upon the smoothing spline analysis of variance (SS-ANOVA) framework, but tackles several important problems that are not yet fully addressed, and thus extends the scope of existing SS-ANOVA as well. We demonstrate the efficacy of our method through numerous ODE examples.
Collapse
Affiliation(s)
- Xiaowu Dai
- Department of Economics and Simons Institute for the Theory of Computing, the University of California, Berkeley, Berkeley, CA
| | - Lexin Li
- Department of Economics and Simons Institute for the Theory of Computing, the University of California, Berkeley, Berkeley, CA
- Department of Biostatistics and Epidemiology, the University of California, Berkeley, Berkeley, CA
| |
Collapse
|
10
|
Solea E, Dette H. Nonparametric and high-dimensional functional graphical models. Electron J Stat 2022. [DOI: 10.1214/22-ejs2087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Affiliation(s)
- Eftychia Solea
- CREST and ENSAI, Rennes, France, Ruhr-Universität Bochum, Germany
| | - Holger Dette
- CREST and ENSAI, Rennes, France, Ruhr-Universität Bochum, Germany
| |
Collapse
|
11
|
Yi M, Li Z, Tang Y. F‐type testing in functional linear models. Stat (Int Stat Inst) 2021. [DOI: 10.1002/sta4.420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Menghan Yi
- School of Statistics East China Normal University Shanghai China
| | - Zaixing Li
- School of Science, China University of Mining and Technology (Beijing) Beijing China
| | - Yanlin Tang
- School of Statistics East China Normal University Shanghai China
| |
Collapse
|
12
|
Jadhav S, Ma S. An association test for functional data based on Kendall’s Tau. J MULTIVARIATE ANAL 2021. [DOI: 10.1016/j.jmva.2021.104740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
13
|
Wang J, Wong RKW, Zhang X. Low-Rank Covariance Function Estimation for Multidimensional Functional Data. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1820344] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Jiayi Wang
- Department of Statistics, Texas A&M University, College Station, TX
| | | | - Xiaoke Zhang
- Department of Statistics, George Washington University, Washington, DC
| |
Collapse
|
14
|
Additive functional regression in reproducing kernel Hilbert spaces under smoothness condition. METRIKA 2020. [DOI: 10.1007/s00184-020-00797-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
15
|
Nonparametric operator-regularized covariance function estimation for functional data. Comput Stat Data Anal 2019. [DOI: 10.1016/j.csda.2018.05.013] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
16
|
Wong RKW, Li Y, Zhu Z. Partially Linear Functional Additive Models for Multivariate Functional Data. J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2017.1411268] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
| | - Yehua Li
- Department of Statistics, University of California, Riverside, CA
| | - Zhengyuan Zhu
- Department of Statistics & Statistical Laboratory, Iowa State University, Ames, IA
| |
Collapse
|
17
|
Maeng H, Fryzlewicz P. Regularised forecasting via smooth-rough partitioning of the regression coefficients. Electron J Stat 2019. [DOI: 10.1214/19-ejs1573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
18
|
Lin H, Jiang X, Lian H, Zhang W. Reduced rank modeling for functional regression with functional responses. J MULTIVARIATE ANAL 2019. [DOI: 10.1016/j.jmva.2018.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
19
|
|
20
|
Han K, Müller HG, Park BU. Smooth backfitting for additive modeling with small errors-in-variables, with an application to additive functional regression for multiple predictor functions. BERNOULLI 2018. [DOI: 10.3150/16-bej898] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
21
|
Reimherr M, Sriperumbudur B, Taoufik B. Optimal prediction for additive function-on-function regression. Electron J Stat 2018. [DOI: 10.1214/18-ejs1505] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
22
|
Yang J, Cox DD, Lee JS, Ren P, Choi T. Efficient Bayesian hierarchical functional data analysis with basis function approximations using Gaussian-Wishart processes. Biometrics 2017; 73:1082-1091. [PMID: 28395117 PMCID: PMC5634932 DOI: 10.1111/biom.12705] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Revised: 03/01/2017] [Accepted: 03/01/2017] [Indexed: 11/28/2022]
Abstract
Functional data are defined as realizations of random functions (mostly smooth functions) varying over a continuum, which are usually collected on discretized grids with measurement errors. In order to accurately smooth noisy functional observations and deal with the issue of high-dimensional observation grids, we propose a novel Bayesian method based on the Bayesian hierarchical model with a Gaussian-Wishart process prior and basis function representations. We first derive an induced model for the basis-function coefficients of the functional data, and then use this model to conduct posterior inference through Markov chain Monte Carlo methods. Compared to the standard Bayesian inference that suffers serious computational burden and instability in analyzing high-dimensional functional data, our method greatly improves the computational scalability and stability, while inheriting the advantage of simultaneously smoothing raw observations and estimating the mean-covariance functions in a nonparametric way. In addition, our method can naturally handle functional data observed on random or uncommon grids. Simulation and real studies demonstrate that our method produces similar results to those obtainable by the standard Bayesian inference with low-dimensional common grids, while efficiently smoothing and estimating functional data with random and high-dimensional observation grids when the standard Bayesian inference fails. In conclusion, our method can efficiently smooth and estimate high-dimensional functional data, providing one way to resolve the curse of dimensionality for Bayesian functional data analysis with Gaussian-Wishart processes.
Collapse
Affiliation(s)
- Jingjing Yang
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan 48109, U.S.A
| | - Dennis D Cox
- Department of Statistics, Rice University, Houston, Texas 77005, U.S.A
| | - Jong Soo Lee
- Department of Mathematical Sciences, University of Massachusetts Lowell, Lowell, Massachusetts 01854, U.S.A
| | - Peng Ren
- Suntrust Banks Inc, Atlanta, Georgia 30308, U.S.A
| | - Taeryon Choi
- Department of Statistics, Korea University, Seoul 136-701, Republic of Korea
| |
Collapse
|
23
|
Willis MJ, von Stosch M. Simultaneous parameter identification and discrimination of the nonparametric structure of hybrid semi-parametric models. Comput Chem Eng 2017. [DOI: 10.1016/j.compchemeng.2017.05.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
24
|
Reiss PT, Goldsmith J, Shang HL, Ogden RT. Methods for scalar-on-function regression. Int Stat Rev 2017; 85:228-249. [PMID: 28919663 PMCID: PMC5598560 DOI: 10.1111/insr.12163] [Citation(s) in RCA: 69] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 12/28/2015] [Indexed: 01/16/2023]
Abstract
Recent years have seen an explosion of activity in the field of functional data analysis (FDA), in which curves, spectra, images, etc. are considered as basic functional data units. A central problem in FDA is how to fit regression models with scalar responses and functional data points as predictors. We review some of the main approaches to this problem, categorizing the basic model types as linear, nonlinear and nonparametric. We discuss publicly available software packages, and illustrate some of the procedures by application to a functional magnetic resonance imaging dataset.
Collapse
Affiliation(s)
- Philip T. Reiss
- Department of Child and Adolescent Psychiatry and Department of Population Health, New York University School of Medicine
- Department of Statistics, University of Haifa
| | - Jeff Goldsmith
- Department of Biostatistics, Columbia University Mailman School of Public Health
| | - Han Lin Shang
- Research School of Finance, Actuarial Studies and Statistics, Australian National University
| | - R. Todd Ogden
- Department of Biostatistics, Columbia University Mailman School of Public Health
- New York State Psychiatric Institute
| |
Collapse
|
25
|
Su YR, Di CZ, Hsu L. Hypothesis testing in functional linear models. Biometrics 2017; 73:551-561. [PMID: 28295175 DOI: 10.1111/biom.12624] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 08/01/2016] [Accepted: 10/01/2016] [Indexed: 11/30/2022]
Abstract
Functional data arise frequently in biomedical studies, where it is often of interest to investigate the association between functional predictors and a scalar response variable. While functional linear models (FLM) are widely used to address these questions, hypothesis testing for the functional association in the FLM framework remains challenging. A popular approach to testing the functional effects is through dimension reduction by functional principal component (PC) analysis. However, its power performance depends on the choice of the number of PCs, and is not systematically studied. In this article, we first investigate the power performance of the Wald-type test with varying thresholds in selecting the number of PCs for the functional covariates, and show that the power is sensitive to the choice of thresholds. To circumvent the issue, we propose a new method of ordering and selecting principal components to construct test statistics. The proposed method takes into account both the association with the response and the variation along each eigenfunction. We establish its theoretical properties and assess the finite sample properties through simulations. Our simulation results show that the proposed test is more robust against the choice of threshold while being as powerful as, and often more powerful than, the existing method. We then apply the proposed method to the cerebral white matter tracts data obtained from a diffusion tensor imaging tractography study.
Collapse
Affiliation(s)
- Yu-Ru Su
- Biostatistics, Division of Public Health Sciences Fred Hutchinson Cancer Research Center, Seattle, Washington, U.S.A
| | - Chong-Zhi Di
- Biostatistics, Division of Public Health Sciences Fred Hutchinson Cancer Research Center, Seattle, Washington, U.S.A
| | - Li Hsu
- Biostatistics, Division of Public Health Sciences Fred Hutchinson Cancer Research Center, Seattle, Washington, U.S.A
| |
Collapse
|
26
|
Kong D, Staicu AM, Maity A. Classical Testing in Functional Linear Models. J Nonparametr Stat 2016; 28:813-838. [PMID: 28955155 PMCID: PMC5611856 DOI: 10.1080/10485252.2016.1231806] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 07/09/2016] [Indexed: 10/21/2022]
Abstract
We extend four tests common in classical regression - Wald, score, likelihood ratio and F tests - to functional linear regression, for testing the null hypothesis, that there is no association between a scalar response and a functional covariate. Using functional principal component analysis, we re-express the functional linear model as a standard linear model, where the effect of the functional covariate can be approximated by a finite linear combination of the functional principal component scores. In this setting, we consider application of the four traditional tests. The proposed testing procedures are investigated theoretically for densely observed functional covariates when the number of principal components diverges. Using the theoretical distribution of the tests under the alternative hypothesis, we develop a procedure for sample size calculation in the context of functional linear regression. The four tests are further compared numerically for both densely and sparsely observed noisy functional data in simulation experiments and using two real data applications.
Collapse
Affiliation(s)
- Dehan Kong
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, 27599, U.S.A
| | - Ana-Maria Staicu
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, U.S.A
| | - Arnab Maity
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, U.S.A
| |
Collapse
|
27
|
Zhang L, Baladandayuthapani V, Zhu H, Baggerly KA, Majewski T, Czerniak BA, Morris JS. Functional CAR models for large spatially correlated functional datasets. J Am Stat Assoc 2016; 111:772-786. [PMID: 28018013 DOI: 10.1080/01621459.2015.1042581] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
We develop a functional conditional autoregressive (CAR) model for spatially correlated data for which functions are collected on areal units of a lattice. Our model performs functional response regression while accounting for spatial correlations with potentially nonseparable and nonstationary covariance structure, in both the space and functional domains. We show theoretically that our construction leads to a CAR model at each functional location, with spatial covariance parameters varying and borrowing strength across the functional domain. Using basis transformation strategies, the nonseparable spatial-functional model is computationally scalable to enormous functional datasets, generalizable to different basis functions, and can be used on functions defined on higher dimensional domains such as images. Through simulation studies, we demonstrate that accounting for the spatial correlation in our modeling leads to improved functional regression performance. Applied to a high-throughput spatially correlated copy number dataset, the model identifies genetic markers not identified by comparable methods that ignore spatial correlations.
Collapse
Affiliation(s)
- Lin Zhang
- The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A
| | | | | | - Keith A Baggerly
- The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A
| | - Tadeusz Majewski
- The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A
| | - Bogdan A Czerniak
- The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A
| | - Jeffrey S Morris
- The University of Texas M.D. Anderson Cancer Center, Houston, Texas, U.S.A
| |
Collapse
|
28
|
Chen K, Zhang X, Petersen A, Müller HG. Quantifying Infinite-Dimensional Data: Functional Data Analysis in Action. STATISTICS IN BIOSCIENCES 2015. [DOI: 10.1007/s12561-015-9137-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|