1
|
Dong M, Telesca D, Guindani M, Sugar C, Webb SJ, Jeste S, Dickinson A, Levin AR, Shic F, Naples A, Faja S, Dawson G, McPartland JC, Şentürk D. Modeling intra-individual inter-trial EEG response variability in autism. Stat Med 2024; 43:3239-3263. [PMID: 38822707 DOI: 10.1002/sim.10131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 03/29/2024] [Accepted: 05/20/2024] [Indexed: 06/03/2024]
Abstract
Autism spectrum disorder (autism) is a prevalent neurodevelopmental condition characterized by early emerging impairments in social behavior and communication. EEG represents a powerful and non-invasive tool for examining functional brain differences in autism. Recent EEG evidence suggests that greater intra-individual trial-to-trial variability across EEG responses in stimulus-related tasks may characterize brain differences in autism. Traditional analysis of EEG data largely focuses on mean trends of the trial-averaged data, where trial-level analysis is rarely performed due to low neural signal to noise ratio. We propose to use nonlinear (shape-invariant) mixed effects (NLME) models to study intra-individual inter-trial EEG response variability using trial-level EEG data. By providing more precise metrics of response variability, this approach could enrich our understanding of neural disparities in autism and potentially aid the identification of objective markers. The proposed multilevel NLME models quantify variability in the signal's interpretable and widely recognized features (e.g., latency and amplitude) while also regularizing estimation based on noisy trial-level data. Even though NLME models have been studied for more than three decades, existing methods cannot scale up to large data sets. We propose computationally feasible estimation and inference methods via the use of a novel minorization-maximization (MM) algorithm. Extensive simulations are conducted to show the efficacy of the proposed procedures. Applications to data from a large national consortium find that children with autism have larger intra-individual inter-trial variability in P1 latency in a visual evoked potential (VEP) task, compared to their neurotypical peers.
Collapse
Affiliation(s)
- Mingfei Dong
- Department of Biostatistics, University of California, Los Angeles, California
| | - Donatello Telesca
- Department of Biostatistics, University of California, Los Angeles, California
| | - Michele Guindani
- Department of Biostatistics, University of California, Los Angeles, California
| | - Catherine Sugar
- Department of Biostatistics, University of California, Los Angeles, California
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, California
| | - Sara J Webb
- Center for Child Health, Behavior and Development, Seattle Children's Research Institute, Seattle, Washington
- Department of Psychiatry and Behavioral Sciences, School of Medicine, University of Washington, Seattle, Washington
| | - Shafali Jeste
- Division of Neurology, Department of Pediatrics, Children's Hospital Los Angeles, Los Angeles, California
| | - Abigail Dickinson
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, California
| | - April R Levin
- Division of Neurology, Boston Children's Hospital, Boston, Massachusetts
- Division of Neurology, Harvard Medical School, Boston, Massachusetts
| | - Frederick Shic
- Center for Child Health, Behavior and Development, Seattle Children's Research Institute, Seattle, Washington
- Department of Pediatrics, School of Medicine, University of Washington, Seattle, Washington
| | - Adam Naples
- Child Study Center, School of Medicine, Yale University, New Haven, Connecticut
| | - Susan Faja
- Laboratory of Cognitive Neuroscience, Division of Developmental Medicine, Boston Children's Hospital, Boston, Massachusetts
- Department of Pediatrics, Harvard Medical School, Boston, Massachusetts
| | - Geraldine Dawson
- Duke Center for Autism and Brain Development, Duke University, Durham, North Carolina
| | - James C McPartland
- Child Study Center, School of Medicine, Yale University, New Haven, Connecticut
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, California
| |
Collapse
|
2
|
Li R, Xiao L. Latent factor model for multivariate functional data. Biometrics 2023; 79:3307-3318. [PMID: 37661821 PMCID: PMC10840703 DOI: 10.1111/biom.13924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 08/17/2023] [Indexed: 09/05/2023]
Abstract
For multivariate functional data, a functional latent factor model is proposed, extending the traditional latent factor model for multivariate data. The proposed model uses unobserved stochastic processes to induce the dependence among the different functions, and thus, for a large number of functions, may provide a more parsimonious and interpretable characterization of the otherwise complex dependencies between the functions. Sufficient conditions are provided to establish the identifiability of the proposed model. The performance of the proposed model is assessed through simulation studies and an application to electroencephalography data.
Collapse
Affiliation(s)
- Ruonan Li
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, U.S.A
| | - Luo Xiao
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, U.S.A
| |
Collapse
|
3
|
Boland J, Telesca D, Sugar C, Jeste S, Dickinson A, DiStefano C, Şentürk D. Central Posterior Envelopes for Bayesian Functional Principal Component Analysis. JOURNAL OF DATA SCIENCE : JDS 2023; 21:715-734. [PMID: 38883309 PMCID: PMC11178334 DOI: 10.6339/23-jds1085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2024]
Abstract
Bayesian methods provide direct inference in functional data analysis applications without reliance on bootstrap techniques. A major tool in functional data applications is the functional principal component analysis which decomposes the data around a common mean function and identifies leading directions of variation. Bayesian functional principal components analysis (BFPCA) provides uncertainty quantification on the estimated functional model components via the posterior samples obtained. We propose central posterior envelopes (CPEs) for BFPCA based on functional depth as a descriptive visualization tool to summarize variation in the posterior samples of the estimated functional model components, contributing to uncertainty quantification in BFPCA. The proposed BFPCA relies on a latent factor model and targets model parameters within a mixed effects modeling framework using modified multiplicative gamma process shrinkage priors on the variance components. Functional depth provides a center-outward order to a sample of functions. We utilize modified band depth and modified volume depth for ordering of a sample of functions and surfaces, respectively, to derive at CPEs of the mean and eigenfunctions within the BFPCA framework. The proposed CPEs are showcased in extensive simulations. Finally, the proposed CPEs are applied to the analysis of a sample of power spectral densities (PSD) from resting state electroencephalography (EEG) where they lead to novel insights on diagnostic group differences among children diagnosed with autism spectrum disorder and their typically developing peers across age.
Collapse
Affiliation(s)
- Joanna Boland
- Department of Biostatistics, University of California, Los Angeles, Los Angeles, CA 90025, USA
| | - Donatello Telesca
- Department of Biostatistics, University of California, Los Angeles, Los Angeles, CA 90025, USA
| | - Catherine Sugar
- Department of Biostatistics, University of California, Los Angeles, Los Angeles, CA 90025, USA
- Department of Statistics, University of California, Los Angeles, Los Angeles, CA 90025, USA
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90025, USA
| | - Shafali Jeste
- Division of Neurology, Children’s Hospital Los Angeles, Los Angeles, CA 90027, USA
| | - Abigail Dickinson
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90025, USA
| | - Charlotte DiStefano
- Division of Neurology, Children’s Hospital Los Angeles, Los Angeles, CA 90027, USA
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, Los Angeles, CA 90025, USA
- Department of Statistics, University of California, Los Angeles, Los Angeles, CA 90025, USA
| |
Collapse
|
4
|
Cronin JN, Crockett DC, Perchiazzi G, Farmery AD, Camporota L, Formenti F. Intra-tidal PaO 2 oscillations associated with mechanical ventilation: a pilot study to identify discrete morphologies in a porcine model. Intensive Care Med Exp 2023; 11:60. [PMID: 37672140 PMCID: PMC10482813 DOI: 10.1186/s40635-023-00544-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 08/28/2023] [Indexed: 09/07/2023] Open
Abstract
BACKGROUND Within-breath oscillations in arterial oxygen tension (PaO2) can be detected using fast responding intra-arterial oxygen sensors in animal models. These PaO2 signals, which rise in inspiration and fall in expiration, may represent cyclical recruitment/derecruitment and, therefore, a potential clinical monitor to allow titration of ventilator settings in lung injury. However, in hypovolaemia models, these oscillations have the potential to become inverted, such that they decline, rather than rise, in inspiration. This inversion suggests multiple aetiologies may underlie these oscillations. A correct interpretation of the various PaO2 oscillation morphologies is essential to translate this signal into a monitoring tool for clinical practice. We present a pilot study to demonstrate the feasibility of a new analysis method to identify these morphologies. METHODS Seven domestic pigs (average weight 31.1 kg) were studied under general anaesthesia with muscle relaxation and mechanical ventilation. Three underwent saline-lavage lung injury and four were uninjured. Variations in PEEP, tidal volume and presence/absence of lung injury were used to induce different morphologies of PaO2 oscillation. Functional principal component analysis and k-means clustering were employed to separate PaO2 oscillations into distinct morphologies, and the cardiorespiratory physiology associated with these PaO2 morphologies was compared. RESULTS PaO2 oscillations from 73 ventilatory conditions were included. Five functional principal components were sufficient to explain ≥ 95% of the variance of the recorded PaO2 signals. From these, five unique morphologies of PaO2 oscillation were identified, ranging from those which increased in inspiration and decreased in expiration, through to those which decreased in inspiration and increased in expiration. This progression was associated with the estimates of the first functional principal component (P < 0.001, R2 = 0.88). Intermediate morphologies demonstrated waveforms with two peaks and troughs per breath. The progression towards inverted oscillations was associated with increased pulse pressure variation (P = 0.03). CONCLUSIONS Functional principal component analysis and k-means clustering are appropriate to identify unique morphologies of PaO2 waveform associated with distinct cardiorespiratory physiology. We demonstrated novel intermediate morphologies of PaO2 waveform, which may represent a development of zone 2 physiologies within the lung. Future studies of PaO2 oscillations and modelling should aim to understand the aetiologies of these morphologies.
Collapse
Affiliation(s)
- John N Cronin
- Department of Anaesthesia and Perioperative Medicine, St. Thomas' Hospital, Guy's and St. Thomas' NHS Foundation Trust, Westminster Bridge Road, London, SE1 7EH, UK.
- Faculty of Life Sciences and Medicine, King's College London, London, UK.
| | - Douglas C Crockett
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
| | - Gaetano Perchiazzi
- Hedenstierna Laboratory, Department of Surgical Sciences, University of Uppsala, Uppsala, Sweden
| | - Andrew D Farmery
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
| | - Luigi Camporota
- Faculty of Life Sciences and Medicine, King's College London, London, UK
- Department of Intensive Care, Guy's and St. Thomas' NHS Foundation Trust, London, UK
| | - Federico Formenti
- Faculty of Life Sciences and Medicine, King's College London, London, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
| |
Collapse
|
5
|
Zhang J, Siegle GJ, Sun T, D’andrea W, Krafty RT. Interpretable principal component analysis for multilevel multivariate functional data. Biostatistics 2023; 24:227-243. [PMID: 34545394 PMCID: PMC10102903 DOI: 10.1093/biostatistics/kxab018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 03/27/2021] [Accepted: 04/12/2021] [Indexed: 11/14/2022] Open
Abstract
Many studies collect functional data from multiple subjects that have both multilevel and multivariate structures. An example of such data comes from popular neuroscience experiments where participants' brain activity is recorded using modalities such as electroencephalography and summarized as power within multiple time-varying frequency bands within multiple electrodes, or brain regions. Summarizing the joint variation across multiple frequency bands for both whole-brain variability between subjects, as well as location-variation within subjects, can help to explain neural reactions to stimuli. This article introduces a novel approach to conducting interpretable principal components analysis on multilevel multivariate functional data that decomposes total variation into subject-level and replicate-within-subject-level (i.e., electrode-level) variation and provides interpretable components that can be both sparse among variates (e.g., frequency bands) and have localized support over time within each frequency band. Smoothness is achieved through a roughness penalty, while sparsity and localization of components are achieved by solving an innovative rank-one based convex optimization problem with block Frobenius and matrix $L_1$-norm-based penalties. The method is used to analyze data from a study to better understand reactions to emotional information in individuals with histories of trauma and the symptom of dissociation, revealing new neurophysiological insights into how subject- and electrode-level brain activity are associated with these phenomena. Supplementary materials for this article are available online.
Collapse
Affiliation(s)
- Jun Zhang
- Department of Biostatistics, University of Pittsburgh, 130 De Soto Street, Pittsburgh, PA, 15261, USA
| | - Greg J Siegle
- Department of Psychiatry, University of Pittsburgh, 3811 O’Hara Street, Pittsburgh, PA, 15213, USA
| | - Tao Sun
- Center for Applied Statistics, School of Statistics, Renmin University of China, 59 Zhongguancun Street, Beijing, 100872, China
| | - Wendy D’andrea
- Department of Psychology, New School for Social Research, 80 Fifth Avenue, New York, NY, 10011, USA
| | - Robert T Krafty
- Department of Biostatistics and Bioinformatics, Emory University, 1518 Clifton Road NE, Atlanta, GA, 30322, USA
| |
Collapse
|
6
|
Zhang N, Pan Y, Chen Q, Zhai Q, Liu N, Huang Y, Sun T, Lin Y, He L, Hou Y, Yu Q, Li H, Chen S. Application of EEG in migraine. Front Hum Neurosci 2023; 17:1082317. [PMID: 36875229 PMCID: PMC9982126 DOI: 10.3389/fnhum.2023.1082317] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 02/03/2023] [Indexed: 02/19/2023] Open
Abstract
Migraine is a common disease of the nervous system that seriously affects the quality of life of patients and constitutes a growing global health crisis. However, many limitations and challenges exist in migraine research, including the unclear etiology and the lack of specific biomarkers for diagnosis and treatment. Electroencephalography (EEG) is a neurophysiological technique for measuring brain activity. With the updating of data processing and analysis methods in recent years, EEG offers the possibility to explore altered brain functional patterns and brain network characteristics of migraines in depth. In this paper, we provide an overview of the methodology that can be applied to EEG data processing and analysis and a narrative review of EEG-based migraine-related research. To better understand the neural changes of migraine or to provide a new idea for the clinical diagnosis and treatment of migraine in the future, we discussed the study of EEG and evoked potential in migraine, compared the relevant research methods, and put forwards suggestions for future migraine EEG studies.
Collapse
Affiliation(s)
- Ning Zhang
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
- Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Tongji Shanxi Hospital, Third Hospital of Shanxi Medical University, Taiyuan, China
- Tongji Medical College, Tongji Hospital, Huazhong University of Science and Technology, Wuhan, China
| | - Yonghui Pan
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Qihui Chen
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Qingling Zhai
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Ni Liu
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Yanan Huang
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Tingting Sun
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Yake Lin
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Linyuan He
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Yue Hou
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Qijun Yu
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Hongyan Li
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| | - Shijiao Chen
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
| |
Collapse
|
7
|
Han R, Shi P, Zhang AR. Guaranteed Functional Tensor Singular Value Decomposition. J Am Stat Assoc 2023; 119:995-1007. [PMID: 39055126 PMCID: PMC11267031 DOI: 10.1080/01621459.2022.2153689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 11/27/2022] [Indexed: 12/03/2022]
Abstract
This paper introduces the functional tensor singular value decomposition (FTSVD), a novel dimension reduction framework for tensors with one functional mode and several tabular modes. The problem is motivated by high-order longitudinal data analysis. Our model assumes the observed data to be a random realization of an approximate CP low-rank functional tensor measured on a discrete time grid. Incorporating tensor algebra and the theory of Reproducing Kernel Hilbert Space (RKHS), we propose a novel RKHS-based constrained power iteration with spectral initialization. Our method can successfully estimate both singular vectors and functions of the low-rank structure in the observed data. With mild assumptions, we establish the non-asymptotic contractive error bounds for the proposed algorithm. The superiority of the proposed framework is demonstrated via extensive experiments on both simulated and real data.
Collapse
Affiliation(s)
- Rungang Han
- Department of Statistical Science, Duke University, Durham, NC 27710
| | - Pixu Shi
- Department of Biostatistics & Bioinformatics, Duke University
| | - Anru R. Zhang
- Department of Biostatistics & Bioinformatics, Computer Science, Mathematics, and Statistical Science, Duke University
| |
Collapse
|
8
|
Li Y, Nguyen DV, Kürüm E, Rhee CM, Banerjee S, Şentürk D. Multilevel Varying Coefficient Spatiotemporal Model. Stat (Int Stat Inst) 2022; 11:e438. [PMID: 35693320 PMCID: PMC9175782 DOI: 10.1002/sta4.438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 11/13/2021] [Indexed: 11/11/2022]
Abstract
Over 785,000 individuals in the U.S. have end-stage renal disease (ESRD) with about 70% of patients on dialysis, a life-sustaining treatment. Dialysis patients experience frequent hospitalizations. In order to identify risk factors of hospitalizations, we utilize data from the large national database, United States Renal Data System (USRDS). To account for the hierarchical structure of the data, with longitudinal hospitalization rates nested in dialysis facilities and dialysis facilities nested in geographic regions across the U.S., we propose a multilevel varying coefficient spatiotemporal model (M-VCSM) where region- and facility-specific random deviations are modeled through a multilevel Karhunen-Loéve (KL) expansion. The proposed M-VCSM includes time-varying effects of multilevel risk factors at the region- (e.g., urbanicity and area deprivation index) and facility-levels (e.g., patient demographic makeup) and incorporates spatial correlations across regions via a conditional autoregressive (CAR) structure. Efficient estimation and inference is achieved through the fusion of functional principal component analysis (FPCA) and Markov Chain Monte Carlo (MCMC). Applications to the USRDS data highlight significant region- and facility-level risk factors of hospitalizations and characterize time periods and spatial locations with elevated hospitalization risk. Finite sample performance of the proposed methodology is studied through simulations.
Collapse
Affiliation(s)
- Yihao Li
- Department of Biostatistics, University of California, Los Angeles, CA 90095, USA
| | - Danh V Nguyen
- Department of Medicine, University of California Irvine, Orange, CA 92868, USA
| | - Esra Kürüm
- Department of Statistics, University of California, Riverside, CA 92521, USA
| | - Connie M Rhee
- Department of Medicine, University of California Irvine, Orange, CA 92868, USA
- Harold Simmons Center for Chronic Disease Research and Epidemiology, University of California Irvine School of Medicine, Orange, CA 92868, USA
| | - Sudipto Banerjee
- Department of Biostatistics, University of California, Los Angeles, CA 90095, USA
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, CA 90095, USA
| |
Collapse
|
9
|
Conditional Kaplan–Meier Estimator with Functional Covariates for Time-to-Event Data. STATS 2022. [DOI: 10.3390/stats5040066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Due to the wide availability of functional data from multiple disciplines, the studies of functional data analysis have become popular in the recent literature. However, the related development in censored survival data has been relatively sparse. In this work, we consider the problem of analyzing time-to-event data in the presence of functional predictors. We develop a conditional generalized Kaplan–Meier (KM) estimator that incorporates functional predictors using kernel weights and rigorously establishes its asymptotic properties. In addition, we propose to select the optimal bandwidth based on a time-dependent Brier score. We then carry out extensive numerical studies to examine the finite sample performance of the proposed functional KM estimator and bandwidth selector. We also illustrated the practical usage of our proposed method by using a data set from Alzheimer’s Disease Neuroimaging Initiative data.
Collapse
|
10
|
Yi Y, Billor N, Liang M, Cao X, Ekstrom A, Zheng J. Classification of EEG signals: An interpretable approach using functional data analysis. J Neurosci Methods 2022; 376:109609. [PMID: 35483504 DOI: 10.1016/j.jneumeth.2022.109609] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 03/25/2022] [Accepted: 04/21/2022] [Indexed: 11/17/2022]
Abstract
Electroencephalography (EEG) is a noninvasive method to record electrical activity of the brain. The EEG data is continuous flow of voltages, in this paper, we consider them as functional data, and propose a three-stage algorithm based on functional data analysis, with the advantage of interpretability. Specifically, the time and frequency information are extracted by wavelet transform in the first stage. Then, functional testing is utilized to select EEG channels and frequencies that show significant differences for different human behaviors. In the third stage, we propose to use penalized multiple functional logistic regression to interpretably classify human behaviors. With simulation and a scalp EEG data as validation set, we show that the proposed three-stage algorithm provides an interpretable classification of the scalp EEG signals.
Collapse
Affiliation(s)
- Yuyan Yi
- Department of Mathematics and Statistics, Auburn University, USA.
| | - Nedret Billor
- Department of Mathematics and Statistics, Auburn University, USA.
| | - Mingli Liang
- Department of Psychiatry, Department of Neurosurgery, Yale University, USA.
| | - Xuan Cao
- Department of Mathematical Sciences, University of Cincinnati, USA.
| | - Arne Ekstrom
- Department of Psychology, University of Arizona, USA.
| | - Jingyi Zheng
- Department of Mathematics and Statistics, Auburn University, USA.
| |
Collapse
|
11
|
Scheffler AW, Dickinson A, DiStefano C, Jeste S, Şentürk D. Covariate-adjusted hybrid principal components analysis for region-referenced functional EEG data. STATISTICS AND ITS INTERFACE 2022; 15:209-223. [PMID: 35664510 PMCID: PMC9165697 DOI: 10.4310/21-sii712] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Electroencephalography (EEG) studies produce region-referenced functional data via EEG signals recorded across scalp electrodes. The high-dimensional data can be used to contrast neurodevelopmental trajectories between diagnostic groups, for example between typically developing (TD) children and children with autism spectrum disorder (ASD). Valid inference requires characterization of the complex EEG dependency structure as well as covariate-dependent heteroscedasticity, such as changes in variation over developmental age. In our motivating study, EEG data is collected on TD and ASD children aged two to twelve years old. The peak alpha frequency, a prominent peak in the alpha spectrum, is a biomarker linked to neurodevelopment that shifts as children age. To retain information, we model patterns of alpha spectral variation, rather than just the peak location, regionally across the scalp and chronologically across development. We propose a covariate-adjusted hybrid principal components analysis (CA-HPCA) for EEG data, which utilizes both vector and functional principal components analysis while simultaneously adjusting for covariate-dependent heteroscedasticity. CA-HPCA assumes the covariance process is weakly separable conditional on observed covariates, allowing for covariate-adjustments to be made on the marginal covariances rather than the full covariance leading to stable and computationally efficient estimation. The proposed methodology provides novel insights into neurodevelopmental differences between TD and ASD children.
Collapse
Affiliation(s)
| | - Abigail Dickinson
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, USA
| | - Charlotte DiStefano
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, USA
| | - Shafali Jeste
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, USA
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, USA
| |
Collapse
|
12
|
Yu CH, Li M, Noe C, Fischer-Baum S, Vannucci M. Bayesian inference for stationary points in gaussian process regression models for event-related potentials analysis. Biometrics 2022. [PMID: 34997758 DOI: 10.1111/biom.13621] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 12/01/2021] [Accepted: 12/16/2021] [Indexed: 12/01/2022]
Abstract
Stationary points embedded in the derivatives are often critical for a model to be interpretable and may be considered as key features of interest in many applications. We propose a semiparametric Bayesian model to efficiently infer the locations of stationary points of a nonparametric function, which also produces an estimate of the function. We use Gaussian processes as a flexible prior for the underlying function and impose derivative constraints to control the function's shape via conditioning. We develop an inferential strategy that intentionally restricts estimation to the case of at least one stationary point, bypassing possible mis-specifications in the number of stationary points and avoiding the varying dimension problem that often brings in computational complexity. We illustrate the proposed methods using simulations and then apply the method to the estimation of event-related potentials (ERP) derived from electroencephalography (EEG) signals. We show how the proposed method automatically identifies characteristic components and their latencies at the individual level, which avoids the excessive averaging across subjects which is routinely done in the field to obtain smooth curves. By applying this approach to EEG data collected from younger and older adults during a speech perception task, we are able to demonstrate how the time course of speech perception processes changes with age. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Cheng-Han Yu
- Department of Mathematical and Statistical Sciences, Marquette University, Milwaukee, WI, USA
| | - Meng Li
- Department of Statistics, Rice University, Houston, TX, USA
| | - Colin Noe
- Department of Psychological Science, Rice University, Houston, TX 77005
| | | | | |
Collapse
|
13
|
Saeidi M, Karwowski W, Farahani FV, Fiok K, Taiar R, Hancock PA, Al-Juaid A. Neural Decoding of EEG Signals with Machine Learning: A Systematic Review. Brain Sci 2021; 11:1525. [PMID: 34827524 PMCID: PMC8615531 DOI: 10.3390/brainsci11111525] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Revised: 11/04/2021] [Accepted: 11/11/2021] [Indexed: 11/16/2022] Open
Abstract
Electroencephalography (EEG) is a non-invasive technique used to record the brain's evoked and induced electrical activity from the scalp. Artificial intelligence, particularly machine learning (ML) and deep learning (DL) algorithms, are increasingly being applied to EEG data for pattern analysis, group membership classification, and brain-computer interface purposes. This study aimed to systematically review recent advances in ML and DL supervised models for decoding and classifying EEG signals. Moreover, this article provides a comprehensive review of the state-of-the-art techniques used for EEG signal preprocessing and feature extraction. To this end, several academic databases were searched to explore relevant studies from the year 2000 to the present. Our results showed that the application of ML and DL in both mental workload and motor imagery tasks has received substantial attention in recent years. A total of 75% of DL studies applied convolutional neural networks with various learning algorithms, and 36% of ML studies achieved competitive accuracy by using a support vector machine algorithm. Wavelet transform was found to be the most common feature extraction method used for all types of tasks. We further examined the specific feature extraction methods and end classifier recommendations discovered in this systematic review.
Collapse
Affiliation(s)
- Maham Saeidi
- Computational Neuroergonomics Laboratory, Department of Industrial Engineering and Management Systems, University of Central Florida, Orlando, FL 32816, USA; (F.V.F.); (K.F.)
| | - Waldemar Karwowski
- Computational Neuroergonomics Laboratory, Department of Industrial Engineering and Management Systems, University of Central Florida, Orlando, FL 32816, USA; (F.V.F.); (K.F.)
| | - Farzad V. Farahani
- Computational Neuroergonomics Laboratory, Department of Industrial Engineering and Management Systems, University of Central Florida, Orlando, FL 32816, USA; (F.V.F.); (K.F.)
- Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Krzysztof Fiok
- Computational Neuroergonomics Laboratory, Department of Industrial Engineering and Management Systems, University of Central Florida, Orlando, FL 32816, USA; (F.V.F.); (K.F.)
| | - Redha Taiar
- MATIM, Moulin de la Housse, Université de Reims Champagne Ardenne, CEDEX 02, 51687 Reims, France;
| | - P. A. Hancock
- Department of Psychology, University of Central Florida, Orlando, FL 32816, USA;
| | - Awad Al-Juaid
- Industrial Engineering Department, Taif University, Taif 26571, Saudi Arabia;
| |
Collapse
|
14
|
Bi-Smoothed Functional Independent Component Analysis for EEG Artifact Removal. MATHEMATICS 2021. [DOI: 10.3390/math9111243] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Motivated by mapping adverse artifactual events caused by body movements in electroencephalographic (EEG) signals, we present a functional independent component analysis based on the spectral decomposition of the kurtosis operator of a smoothed principal component expansion. A discrete roughness penalty is introduced in the orthonormality constraint of the covariance eigenfunctions in order to obtain the smoothed basis for the proposed independent component model. To select the tuning parameters, a cross-validation method that incorporates shrinkage is used to enhance the performance on functional representations with a large basis dimension. This method provides an estimation strategy to determine the penalty parameter and the optimal number of components. Our independent component approach is applied to real EEG data to estimate genuine brain potentials from a contaminated signal. As a result, it is possible to control high-frequency remnants of neural origin overlapping artifactual sources to optimize their removal from the signal. An R package implementing our methods is available at CRAN.
Collapse
|
15
|
Li Y, Nguyen DV, Banerjee S, Rhee CM, Kalantar-Zadeh K, Kürüm E, Şentürk D. Multilevel modeling of spatially nested functional data: Spatiotemporal patterns of hospitalization rates in the US dialysis population. Stat Med 2021; 40:3937-3952. [PMID: 33902165 DOI: 10.1002/sim.9007] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Revised: 02/15/2021] [Accepted: 04/08/2021] [Indexed: 11/12/2022]
Abstract
End-stage renal disease patients on dialysis experience frequent hospitalizations. In addition to known temporal patterns of hospitalizations over the life span on dialysis, where poor outcomes are typically exacerbated during the first year on dialysis, variations in hospitalizations among dialysis facilities across the US contribute to spatial variation. Utilizing national data from the United States Renal Data System (USRDS), we propose a novel multilevel spatiotemporal functional model to study spatiotemporal patterns of hospitalization rates among dialysis facilities. Hospitalization rates of dialysis facilities are considered as spatially nested functional data (FD) with longitudinal hospitalizations nested in dialysis facilities and dialysis facilities nested in geographic regions. A multilevel Karhunen-Loéve expansion is utilized to model the two-level (facility and region) FD, where spatial correlations are induced among region-specific principal component scores accounting for regional variation. A new efficient algorithm based on functional principal component analysis and Markov Chain Monte Carlo is proposed for estimation and inference. We report a novel application using USRDS data to characterize spatiotemporal patterns of hospitalization rates for over 400 health service areas across the US and over the posttransition time on dialysis. Finite sample performance of the proposed method is studied through simulations.
Collapse
Affiliation(s)
- Yihao Li
- Department of Biostatistics, University of California, Los Angeles, California
| | - Danh V Nguyen
- Department of Medicine, UC Irvine School of Medicine, Orange, California
| | - Sudipto Banerjee
- Department of Biostatistics, University of California, Los Angeles, California
| | - Connie M Rhee
- Department of Medicine, UC Irvine School of Medicine, Orange, California
| | | | - Esra Kürüm
- Department of Statistics, University of California, Riverside, California
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, California
| |
Collapse
|
16
|
Shangguan P, Qiu T, Liu T, Zou S, Liu Z, Zhang S. Feature extraction of EEG signals based on functional data analysis and its application to recognition of driver fatigue state. Physiol Meas 2021; 41:125004. [PMID: 33126235 DOI: 10.1088/1361-6579/abc66e] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
OBJECTIVE Our objective is to study how to obtain features which can reflect the continuity and internal dynamic changes of electroencephalography (EEG) signals and study an effective method for fatigued driving state recognition based on the obtained features. APPROACH A method of EEG signalfeature extraction based on functional data analysis is proposed. Combined with kernel principal component analysis method, the obtained features are applied to the recognition of driver fatigue state, and a corresponding recognition model of fatigued driving state is constructed. MAIN RESULTS The recognition model is tested on the real collected driver fatigue EEG signals by selecting a suitable classifier. The test results show that the proposed driver fatigue state recognition method has good recognition effect, especially on the classifier based on decision tree, with an average accuracy of 99.50%. SIGNIFICANCE The extracted features well reflect the continuityand internal dynamic changes of the EEG signals, and it is of great significance and application value to study an effective method of fatigued driver state recognition based on the features.
Collapse
Affiliation(s)
- Pengpeng Shangguan
- Department of Computer, Nanchang University, Nanchang Jiangxi, 330029, People's Republic of China
| | | | | | | | | | | |
Collapse
|
17
|
Li T, Li T, Zhu Z, Zhu H. Regression Analysis of Asynchronous Longitudinal Functional and Scalar Data. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1844211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Ting Li
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| | - Tengfei Li
- Department of Radiology and Biomedical Research Imaging Center (BRIC), University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Zhongyi Zhu
- Department of Statistics, Fudan University, Shanghai, China
| | - Hongtu Zhu
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| |
Collapse
|
18
|
Scheffler A, Telesca D, Li Q, Sugar CA, Distefano C, Jeste S, Şentürk D. Hybrid principal components analysis for region-referenced longitudinal functional EEG data. Biostatistics 2020; 21:139-157. [PMID: 30084925 DOI: 10.1093/biostatistics/kxy034] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Revised: 01/25/2018] [Accepted: 06/11/2018] [Indexed: 11/12/2022] Open
Abstract
Electroencephalography (EEG) data possess a complex structure that includes regional, functional, and longitudinal dimensions. Our motivating example is a word segmentation paradigm in which typically developing (TD) children, and children with autism spectrum disorder (ASD) were exposed to a continuous speech stream. For each subject, continuous EEG signals recorded at each electrode were divided into one-second segments and projected into the frequency domain via fast Fourier transform. Following a spectral principal components analysis, the resulting data consist of region-referenced principal power indexed regionally by scalp location, functionally across frequencies, and longitudinally by one-second segments. Standard EEG power analyses often collapse information across the longitudinal and functional dimensions by averaging power across segments and concentrating on specific frequency bands. We propose a hybrid principal components analysis for region-referenced longitudinal functional EEG data, which utilizes both vector and functional principal components analyses and does not collapse information along any of the three dimensions of the data. The proposed decomposition only assumes weak separability of the higher-dimensional covariance process and utilizes a product of one dimensional eigenvectors and eigenfunctions, obtained from the regional, functional, and longitudinal marginal covariances, to represent the observed data, providing a computationally feasible non-parametric approach. A mixed effects framework is proposed to estimate the model components coupled with a bootstrap test for group level inference, both geared towards sparse data applications. Analysis of the data from the word segmentation paradigm leads to valuable insights about group-region differences among the TD and verbal and minimally verbal children with ASD. Finite sample properties of the proposed estimation framework and bootstrap inference procedure are further studied via extensive simulations.
Collapse
Affiliation(s)
- Aaron Scheffler
- Department of Biostatistics, University of California Los Angeles, 650 Charles E Young Drive, Los Angeles, CA, USA
| | - Donatello Telesca
- Department of Biostatistics, University of California Los Angeles, 650 Charles E Young Drive, Los Angeles, CA, USA
| | - Qian Li
- Department of Biostatistics, University of California Los Angeles, 650 Charles E Young Drive, Los Angeles, CA, USA
| | - Catherine A Sugar
- Department of Biostatistics, University of California Los Angeles, 650 Charles E Young Drive, Los Angeles, CA, USA.,Department of Psychiatry and Biobehavioral Sciences, University of California Los Angeles, 757 Westwood Plaza, Los Angeles, CA, USA
| | - Charlotte Distefano
- Department of Psychiatry and Biobehavioral Sciences, University of California Los Angeles, 757 Westwood Plaza, Los Angeles, CA, USA
| | - Shafali Jeste
- Department of Psychiatry and Biobehavioral Sciences, University of California Los Angeles, 757 Westwood Plaza, Los Angeles, CA, USA
| | - Damla Şentürk
- Department of Biostatistics, University of California Los Angeles, 650 Charles E Young Drive, Los Angeles, CA, USA
| |
Collapse
|
19
|
Margaritella N, Inácio V, King R. Parameter clustering in Bayesian functional principal component analysis of neuroscientific data. Stat Med 2020; 40:167-184. [PMID: 33040367 DOI: 10.1002/sim.8768] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 08/02/2020] [Accepted: 09/17/2020] [Indexed: 11/07/2022]
Abstract
The extraordinary advancements in neuroscientific technology for brain recordings over the last decades have led to increasingly complex spatiotemporal data sets. To reduce oversimplifications, new models have been developed to be able to identify meaningful patterns and new insights within a highly demanding data environment. To this extent, we propose a new model called parameter clustering functional principal component analysis (PCl-fPCA) that merges ideas from functional data analysis and Bayesian nonparametrics to obtain a flexible and computationally feasible signal reconstruction and exploration of spatiotemporal neuroscientific data. In particular, we use a Dirichlet process Gaussian mixture model to cluster functional principal component scores within the standard Bayesian functional PCA framework. This approach captures the spatial dependence structure among smoothed time series (curves) and its interaction with the time domain without imposing a prior spatial structure on the data. Moreover, by moving the mixture from data to functional principal component scores, we obtain a more general clustering procedure, thus allowing a higher level of intricate insight and understanding of the data. We present results from a simulation study showing improvements in curve and correlation reconstruction compared with different Bayesian and frequentist fPCA models and we apply our method to functional magnetic resonance imaging and electroencephalogram data analyses providing a rich exploration of the spatiotemporal dependence in brain time series.
Collapse
Affiliation(s)
| | - Vanda Inácio
- School of Mathematics, University of Edinburgh, Edinburgh, UK
| | - Ruth King
- School of Mathematics, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
20
|
Shamshoian J, Şentürk D, Jeste S, Telesca D. Bayesian analysis of longitudinal and multidimensional functional data. Biostatistics 2020; 23:558-573. [PMID: 33017019 DOI: 10.1093/biostatistics/kxaa041] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Revised: 08/31/2020] [Accepted: 09/04/2020] [Indexed: 11/13/2022] Open
Abstract
Multi-dimensional functional data arises in numerous modern scientific experimental and observational studies. In this article, we focus on longitudinal functional data, a structured form of multidimensional functional data. Operating within a longitudinal functional framework we aim to capture low dimensional interpretable features. We propose a computationally efficient nonparametric Bayesian method to simultaneously smooth observed data, estimate conditional functional means and functional covariance surfaces. Statistical inference is based on Monte Carlo samples from the posterior measure through adaptive blocked Gibbs sampling. Several operative characteristics associated with the proposed modeling framework are assessed comparatively in a simulated environment. We illustrate the application of our work in two case studies. The first case study involves age-specific fertility collected over time for various countries. The second case study is an implicit learning experiment in children with autism spectrum disorder.
Collapse
Affiliation(s)
- John Shamshoian
- Department of Biostatistics, University of California, Los Angeles, CA, USA
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, CA, USA
| | - Shafali Jeste
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, CA, USA
| | - Donatello Telesca
- Department of Biostatistics, University of California, Los Angeles, CA, USA
| |
Collapse
|
21
|
Martínez-Hernández I, Genton MG. Nonparametric trend estimation in functional time series with application to annual mortality rates. Biometrics 2020; 77:866-878. [PMID: 32797623 DOI: 10.1111/biom.13353] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 08/10/2020] [Indexed: 11/30/2022]
Abstract
We address the problem of trend estimation for functional time series. Existing contributions either deal with detecting a functional trend or assuming a simple model. They consider neither the estimation of a general functional trend nor the analysis of functional time series with a functional trend component. Similarly to univariate time series, we propose an alternative methodology to analyze functional time series, taking into account a functional trend component. We propose to estimate the functional trend by using a tensor product surface that is easy to implement, to interpret, and allows to control the smoothness properties of the estimator. Through a Monte Carlo study, we simulate different scenarios of functional processes to show that our estimator accurately identifies the functional trend component. We also show that the dependency structure of the estimated stationary time series component is not significantly affected by the error approximation of the functional trend component. We apply our methodology to annual mortality rates in France.
Collapse
Affiliation(s)
| | - Marc G Genton
- King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| |
Collapse
|
22
|
Campos E, Hazlett C, Tan P, Truong H, Loo S, DiStefano C, Jeste S, Şentürk D. Principle ERP reduction and analysis: Estimating and using principle ERP waveforms underlying ERPs across tasks, subjects and electrodes. Neuroimage 2020; 212:116630. [PMID: 32087372 PMCID: PMC7594508 DOI: 10.1016/j.neuroimage.2020.116630] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 01/03/2020] [Accepted: 02/10/2020] [Indexed: 11/28/2022] Open
Abstract
Event-related potentials (ERP) waveforms are the summation of many overlapping signals. Changes in the peak or mean amplitude of a waveform over a given time period, therefore, cannot reliably be attributed to a particular ERP component of ex ante interest, as is the standard approach to ERP analysis. Though this problem is widely recognized, it is not well addressed in practice. Our approach begins by presuming that any observed ERP waveform - at any electrode, for any trial type, and for any participant - is approximately a weighted combination of signals from an underlying set of what we refer to as principle ERPs, or pERPs. We propose an accessible approach to analyzing complete ERP waveforms in terms of their underlying pERPs. First, we propose the principle ERP reduction (pERP-RED) algorithm for investigators to estimate a suitable set of pERPs from their data, which may span multiple tasks. Next, we provide tools and illustrations of pERP-space analysis, whereby observed ERPs are decomposed into the amplitudes of the contributing pERPs, which can be contrasted across conditions or groups to reveal which pERPs differ (substantively and/or significantly) between conditions/groups. Differences on all pERPs can be reported together rather than selectively, providing complete information on all components in the waveform, thereby avoiding selective reporting or user discretion regarding the choice of which components or windows to use. The scalp distribution of each pERP can also be plotted for any group/condition. We demonstrate this suite of tools through simulations and on real data collected from multiple experiments on participants diagnosed with Autism Spectrum Disorder and Attention Deficit Hyperactivity Disorder. Software for conducting these analyses is provided in the pERPred package for R.
Collapse
Affiliation(s)
- Emilie Campos
- Department of Biostatistics, University of California, Los Angeles, USA
| | - Chad Hazlett
- Departments of Statistics and Political Science, University of California, Los Angeles, USA
| | - Patricia Tan
- Department of Psychiatry, University of California, Los Angeles, USA
| | - Holly Truong
- Department of Psychiatry, University of California, Los Angeles, USA
| | - Sandra Loo
- Department of Psychiatry, University of California, Los Angeles, USA
| | | | - Shafali Jeste
- Department of Psychiatry, University of California, Los Angeles, USA
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, USA.
| |
Collapse
|
23
|
Li Q, Şentürk D, Sugar CA, Jeste S, DiStefano C, Frohlich J, Telesca D. Inferring Brain Signals Synchronicity from a Sample of EEG Readings. J Am Stat Assoc 2019; 114:991-1001. [PMID: 33100436 DOI: 10.1080/01621459.2018.1518233] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Inferring patterns of synchronous brain activity from a heterogeneous sample of electroencephalograms (EEG) is scientifically and methodologically challenging. While it is intuitively and statistically appealing to rely on readings from more than one individual in order to highlight recurrent patterns of brain activation, pooling information across subjects presents non-trivial methodological problems. We discuss some of the scientific issues associated with the understanding of synchronized neuronal activity and propose a methodological framework for statistical inference from a sample of EEG readings. Our work builds on classical contributions in time-series, clustering and functional data analysis, in an effort to reframe a challenging inferential problem in the context of familiar analytical techniques. Some attention is paid to computational issues, with a proposal based on the combination of machine learning and Bayesian techniques.
Collapse
Affiliation(s)
- Qian Li
- Department of Biostatistics, University of California, Los Angeles
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles.,Department of Statistics, University of California, Los Angeles
| | - Catherine A Sugar
- Department of Biostatistics, University of California, Los Angeles.,Department of Statistics, University of California, Los Angeles.,Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles
| | - Shafali Jeste
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles
| | - Charlotte DiStefano
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles
| | - Joel Frohlich
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles
| | | |
Collapse
|
24
|
Zhu H, Versace F, Cinciripini PM, Rausch P, Morris JS. Robust and Gaussian spatial functional regression models for analysis of event-related potentials. Neuroimage 2018; 181:501-512. [PMID: 30057352 DOI: 10.1016/j.neuroimage.2018.07.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Revised: 06/01/2018] [Accepted: 07/03/2018] [Indexed: 10/28/2022] Open
Abstract
Event-related potentials (ERPs) summarize electrophysiological brain response to specific stimuli. They can be considered as correlated functions of time with both spatial correlation across electrodes and nested correlations within subjects. Commonly used analytical methods for ERPs often focus on pre-determined extracted components and/or ignore the correlation among electrodes or subjects, which can miss important insights, and tend to be sensitive to outlying subjects, time points or electrodes. Motivated by ERP data in a smoking cessation study, we introduce a Bayesian spatial functional regression framework that models the entire ERPs as spatially correlated functional responses and the stimulus types as covariates. This novel framework relies on mixed models to characterize the effects of stimuli while simultaneously accounting for the multilevel correlation structure. The spatial correlation among the ERP profiles is captured through basis-space Matérn assumptions that allow either separable or nonseparable spatial correlations over time. We induce both adaptive regularization over time and spatial smoothness across electrodes via a correlated normal-exponential-gamma (CNEG) prior on the fixed effect coefficient functions. Our proposed framework includes both Gaussian models as well as robust models using heavier-tailed distributions to make the regression automatically robust to outliers. We introduce predictive methods to select among Gaussian vs. robust models and models with separable vs. non-separable spatiotemporal correlation structures. Our proposed analysis produces global tests for stimuli effects across entire time (or time-frequency) and electrode domains, plus multiplicity-adjusted pointwise inference based on experiment-wise error rate or false discovery rate to flag spatiotemporal (or spatio-temporal-frequency) regions that characterize stimuli differences, and can also produce inference for any prespecified waveform components. Our analysis of the smoking cessation ERP data set reveals numerous effects across different types of visual stimuli.
Collapse
Affiliation(s)
- Hongxiao Zhu
- Department of Statistics, Virginia Tech, Blacksburg, VA, USA.
| | - Francesco Versace
- Department of Behavioral Science, The University of Texas M.D. Anderson Cancer Center, Houston, TX, USA
| | - Paul M Cinciripini
- Department of Behavioral Science, The University of Texas M.D. Anderson Cancer Center, Houston, TX, USA
| | - Philip Rausch
- Department of Psychology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Jeffrey S Morris
- Department of Biostatistics, The University of Texas M.D. Anderson Cancer Center, Houston, TX, USA
| |
Collapse
|
25
|
Lee W, Miranda MF, Rausch P, Baladandayuthapani V, Fazio M, Downs JC, Morris JS. Bayesian Semiparametric Functional Mixed Models for Serially Correlated Functional Data, with Application to Glaucoma Data. J Am Stat Assoc 2018; 114:495-513. [PMID: 31235987 PMCID: PMC6590079 DOI: 10.1080/01621459.2018.1476242] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Revised: 12/01/2017] [Indexed: 10/14/2022]
Abstract
Glaucoma, a leading cause of blindness, is characterized by optic nerve damage related to intraocular pressure (IOP), but its full etiology is unknown. Researchers at UAB have devised a custom device to measure scleral strain continuously around the eye under fixed levels of IOP, which here is used to assess how strain varies around the posterior pole, with IOP, and across glaucoma risk factors such as age. The hypothesis is that scleral strain decreases with age, which could alter biomechanics of the optic nerve head and cause damage that could eventually lead to glaucoma. To evaluate this hypothesis, we adapted Bayesian Functional Mixed Models to model these complex data consisting of correlated functions on spherical scleral surface, with nonparametric age effects allowed to vary in magnitude and smoothness across the scleral surface, multi-level random effect functions to capture within-subject correlation, and functional growth curve terms to capture serial correlation across IOPs that can vary around the scleral surface. Our method yields fully Bayesian inference on the scleral surface or any aggregation or transformation thereof, and reveals interesting insights into the biomechanical etiology of glaucoma. The general modeling framework described is very flexible and applicable to many complex, high-dimensional functional data.
Collapse
Affiliation(s)
- Wonyul Lee
- Department of Biostatistics, University of Texas M.D. Anderson Cancer Center, Houston, TX 77230
| | - Michelle F Miranda
- Department of Biostatistics, University of Texas M.D. Anderson Cancer Center, Houston, TX 77230
| | - Philip Rausch
- Department of Psychology, Institut für Psychologie, Humboldt-Universität zu Berlin, Germany
| | | | - Massimo Fazio
- Department of Ophthalmology, University of Alabama at Birmingham, Birmingham, AL 35294
| | - J Crawford Downs
- Department of Ophthalmology, University of Alabama at Birmingham, Birmingham, AL 35294
| | - Jeffrey S Morris
- Department of Biostatistics, University of Texas M.D. Anderson Cancer Center, Houston, TX 77230
| |
Collapse
|
26
|
|