1
|
Bryner D, Srivastava A. Shape Analysis of Functional Data With Elastic Partial Matching. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:9589-9602. [PMID: 34818189 PMCID: PMC9714315 DOI: 10.1109/tpami.2021.3130535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Elastic Riemannian metrics have been used successfully for statistical treatments of functional and curve shape data. However, this usage suffers from a significant restriction: the function boundaries are assumed to be fixed and matched. In practice, functional data often comes with unmatched boundaries. It happens, for example, in dynamical systems with variable evolution rates, such as COVID-19 infection rate curves associated with different geographical regions. Here, we develop a Riemannian framework that allows for partial matching, comparing, and clustering of functions with phase variability and uncertain boundaries. We extend past work by (1) Defining a new diffeomorphism group G over the positive reals that is the semidirect product of a time-warping group and a time-scaling group; (2) Introducing a metric that is invariant to the action of G; (3) Imposing a Riemannian Lie group structure on G to allow for an efficient gradient-based optimization for elastic partial matching; and (4) Presenting a modification that, while losing the metric property, allows one to control the amount of boundary disparity in the registration. We illustrate this framework by registering and clustering shapes of COVID-19 rate curves, identifying basic patterns, minimizing mismatch errors, and reducing variability within clusters compared to previous methods.
Collapse
|
2
|
Curve Registration of Functional Data for Approximate Bayesian Computation. STATS 2021. [DOI: 10.3390/stats4030045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Approximate Bayesian computation is a likelihood-free inference method which relies on comparing model realisations to observed data with informative distance measures. We obtain functional data that are not only subject to noise along their y axis but also to a random warping along their x axis, which we refer to as the time axis. Conventional distances on functions, such as the L2 distance, are not informative under these conditions. The Fisher–Rao metric, previously generalised from the space of probability distributions to the space of functions, is an ideal objective function for aligning one function to another by warping the time axis. We assess the usefulness of alignment with the Fisher–Rao metric for approximate Bayesian computation with four examples: two simulation examples, an example about passenger flow at an international airport, and an example of hydrological flow modelling. We find that the Fisher–Rao metric works well as the objective function to minimise for alignment; however, once the functions are aligned, it is not necessarily the most informative distance for inference. This means that likelihood-free inference may require two distances: one for alignment and one for parameter inference.
Collapse
|
3
|
Regression models using shapes of functions as predictors. Comput Stat Data Anal 2020. [DOI: 10.1016/j.csda.2020.107017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
4
|
Pini A, Markström JL, Schelin L. Test–retest reliability measures for curve data: an overview with recommendations and supplementary code. Sports Biomech 2019; 21:179-200. [DOI: 10.1080/14763141.2019.1655089] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Alessia Pini
- Department of Statistics, Umeå School of Business, Economics and Statistics, Umeå University, Umeå, Sweden
- Department of Statistical Sciences, Catholic University of the Sacred Heart, Milan, Italy
| | - Jonas L Markström
- Department of Community Medicine and Rehabilitation, Physiotherapy, Umeå University, Umeå, Sweden
| | - Lina Schelin
- Department of Statistics, Umeå School of Business, Economics and Statistics, Umeå University, Umeå, Sweden
| |
Collapse
|
5
|
Chen Y, Marron JS, Zhang J. Modeling seasonality and serial dependence of electricity price curves with warping functional autoregressive dynamics. Ann Appl Stat 2019. [DOI: 10.1214/18-aoas1234] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
6
|
Pini A, Spreafico L, Vantini S, Vietti A. Multi-aspect local inference for functional data: Analysis of ultrasound tongue profiles. J MULTIVARIATE ANAL 2019. [DOI: 10.1016/j.jmva.2018.11.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
7
|
Abstract
In studies of gait, continuous measurement of force exerted by the ground on a body, or ground reaction force (GRF), provides valuable insights into biomechanics, locomotion, and the possible presence of pathology. However, gold-standard measurement of GRF requires a costly in-lab observation obtained with sophisticated equipment and computer systems. Recently, in-shoe sensors have been pursued as a relatively inexpensive alternative to in-lab measurement. In this study, we explore the properties of continuous in-shoe sensor recordings using a functional data analysis approach. Our case study is based on measurements of three healthy subjects, with more than 300 stances (defined as the period between the foot striking and lifting from the ground) per subject. The sensor data show both phase and amplitude variabilities; we separate these sources via curve registration. We examine the correlation of phase shifts across sensors within a stance to evaluate the pattern of phase variability shared across sensors. Using the registered curves, we explore possible associations between in-shoe sensor recordings and GRF measurements to evaluate the in-shoe sensor recordings as a possible surrogate for in-lab GRF measurements.
Collapse
|
8
|
Pigoli D, Hadjipantelis PZ, Coleman JS, Aston JAD. The statistical analysis of acoustic phonetic data: exploring differences between spoken Romance languages. J R Stat Soc Ser C Appl Stat 2018. [DOI: 10.1111/rssc.12258] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
9
|
Zhang Z, Klassen E, Srivastava A. Phase-Amplitude Separation and Modeling of Spherical Trajectories. J Comput Graph Stat 2018. [DOI: 10.1080/10618600.2017.1340892] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Zhengwu Zhang
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY
| | - Eric Klassen
- Department of Mathematics, Florida State University, Tallahassee, FL
| | - Anuj Srivastava
- Department of Statistics, Florida State University, Tallahassee, FL
| |
Collapse
|
10
|
Warmenhoven J, Cobley S, Draper C, Harrison A, Bargary N, Smith R. Considerations for the use of functional principal components analysis in sports biomechanics: examples from on-water rowing. Sports Biomech 2017; 18:317-341. [DOI: 10.1080/14763141.2017.1392594] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- John Warmenhoven
- Exercise and Sports Science, Faculty of Health Sciences, University of Sydney, Sydney, Australia
| | - Stephen Cobley
- Exercise and Sports Science, Faculty of Health Sciences, University of Sydney, Sydney, Australia
| | - Conny Draper
- Exercise and Sports Science, Faculty of Health Sciences, University of Sydney, Sydney, Australia
| | - Andrew Harrison
- Physical Education and Sport Sciences Department, University of Limerick, Limerick, Ireland
| | - Norma Bargary
- Department of Mathematics and Statistics, University of Limerick, Limerick, Ireland
| | - Richard Smith
- Exercise and Sports Science, Faculty of Health Sciences, University of Sydney, Sydney, Australia
| |
Collapse
|
11
|
Abstract
We congratulate the authors for their excellent work that provides a clear overview of the large and now mature field of regression models for functional data. We here complement their discussion indicating some directions of further research that we deem particularly important.
Collapse
|
12
|
Abstract
Researchers are increasingly interested in regression models for functional data. This article discusses a comprehensive framework for additive (mixed) models for functional responses and/or functional covariates based on the guiding principle of reframing functional regression in terms of corresponding models for scalar data, allowing the adaptation of a large body of existing methods for these novel tasks. The framework encompasses many existing as well as new models. It includes regression for ‘generalized’ functional data, mean regression, quantile regression as well as generalized additive models for location, shape and scale (GAMLSS) for functional data. It admits many flexible linear, smooth or interaction terms of scalar and functional covariates as well as (functional) random effects and allows flexible choices of bases—particularly splines and functional principal components—and corresponding penalties for each term. It covers functional data observed on common (dense) or curve-specific (sparse) grids. Penalized-likelihood-based and gradient-boosting-based inference for these models are implemented in R packages refund and FDboost , respectively. We also discuss identifiability and computational complexity for the functional regression models covered. A running example on a longitudinal multiple sclerosis imaging study serves to illustrate the flexibility and utility of the proposed model class. Reproducible code for this case study is made available online.
Collapse
Affiliation(s)
- Sonja Greven
- Department of Statistics, Ludwig-Maximilians-Universität München, Germany
| | - Fabian Scheipl
- Department of Statistics, Ludwig-Maximilians-Universität München, Germany
| |
Collapse
|
13
|
Kurtek S. A geometric approach to pairwise Bayesian alignment of functional data using importance sampling. Electron J Stat 2017. [DOI: 10.1214/17-ejs1243] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
14
|
|
15
|
Marron JS, Ramsay JO, Sangalli LM, Srivastava A. Functional Data Analysis of Amplitude and Phase Variation. Stat Sci 2015. [DOI: 10.1214/15-sts524] [Citation(s) in RCA: 81] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
16
|
Ieva F, Paganoni AM. Discussion of “multivariate functional outlier detection” by M. Hubert, P. Rousseeuw and P. Segaert. STAT METHOD APPL-GER 2015. [DOI: 10.1007/s10260-015-0303-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|