1
|
Zhou F, He K, Wang K, Xu Y, Ni Y. Functional Bayesian networks for discovering causality from multivariate functional data. Biometrics 2023; 79:3279-3293. [PMID: 37635676 PMCID: PMC10840881 DOI: 10.1111/biom.13922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Accepted: 08/10/2023] [Indexed: 08/29/2023]
Abstract
Multivariate functional data arise in a wide range of applications. One fundamental task is to understand the causal relationships among these functional objects of interest. In this paper, we develop a novel Bayesian network (BN) model for multivariate functional data where conditional independencies and causal structure are encoded by a directed acyclic graph. Specifically, we allow the functional objects to deviate from Gaussian processes, which is the key to unique causal structure identification even when the functions are measured with noises. A fully Bayesian framework is designed to infer the functional BN model with natural uncertainty quantification through posterior summaries. Simulation studies and real data examples demonstrate the practical utility of the proposed model.
Collapse
Affiliation(s)
- Fangting Zhou
- Department of Statistics, Texas A&M University, College Station, Texas, USA
- Center for Applied Statistics, Institute of Statistics and Big Data, Renmin University of China, Beijing, China
| | - Kejun He
- Center for Applied Statistics, Institute of Statistics and Big Data, Renmin University of China, Beijing, China
| | - Kunbo Wang
- Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, Maryland, USA
| | - Yanxun Xu
- Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, Maryland, USA
| | - Yang Ni
- Department of Statistics, Texas A&M University, College Station, Texas, USA
| |
Collapse
|
2
|
Rueda C, Rodríguez-Collado A. Functional clustering of neuronal signals with FMM mixture models. Heliyon 2023; 9:e20639. [PMID: 37867904 PMCID: PMC10589779 DOI: 10.1016/j.heliyon.2023.e20639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 09/21/2023] [Accepted: 10/03/2023] [Indexed: 10/24/2023] Open
Abstract
The identification of unlabeled neuronal electric signals is one of the most challenging open problems in neuroscience, widely known as Spike Sorting. Motivated to solve this problem, we propose a model-based approach within the mixture modeling framework for clustering oscillatory functional data called MixFMM. The core of the approach is the FMM (Frequency Modulated Möbius) waves, which are non-linear parametric time functions, flexible enough to describe different oscillatory patterns and simple enough to be estimated efficiently. In particular, specific model parameters describe the phase, amplitude and shape of the waveforms. A mixture model is defined using FMM waves as basic functions and gaussian errors, and an EM algorithm is proposed for estimating the parameters. Spike Sorting (SS) has received considerable attention in the literature, and different functional clustering approaches have been considered. We have conducted a fair comparative analysis of the MixFMM with three competitors. Two of them are traditional methods in functional clustering and widely used in Spike Sorting. The third is an approach that has proven superior to many others solving Spike Sorting problems. The datasets used for validation include benchmarking simulated and real cases. The internal and external validation indexes confirm a better performance of the MixFMM on real data sets against the three competitors and an outstanding performance in simulated data against traditional approaches.
Collapse
Affiliation(s)
- Cristina Rueda
- Department of Statistics and Operations Research, University of Valladolid, 47011 Valladolid, Spain
- Mathematics Research Institute of the University of Valladolid (IMUVA), 47011 Valladolid, Spain
| | | |
Collapse
|
3
|
Reimherr M, Sriperumbudur B, Kang HB. Optimal function-on-scalar regression over complex domains. Electron J Stat 2023. [DOI: 10.1214/22-ejs2096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Matthew Reimherr
- Department of Statistics, Pennsylvania State University, University Park, PA 16802, USA
| | - Bharath Sriperumbudur
- Department of Statistics, Pennsylvania State University, University Park, PA 16802, USA
| | - Hyun Bin Kang
- Department of Statistics, Western Michigan University, Kalamazoo, MI 49008, USA
| |
Collapse
|
4
|
Nunez R, Harris A, Ibrahim O, Keller J, Wikle CK, Robinson E, Zukerman R, Siesky B, Verticchio A, Rowe L, Guidoboni G. Artificial Intelligence to Aid Glaucoma Diagnosis and Monitoring: State of the Art and New Directions. PHOTONICS 2022; 9:810. [PMID: 36816462 PMCID: PMC9934292 DOI: 10.3390/photonics9110810] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Recent developments in the use of artificial intelligence in the diagnosis and monitoring of glaucoma are discussed. To set the context and fix terminology, a brief historic overview of artificial intelligence is provided, along with some fundamentals of statistical modeling. Next, recent applications of artificial intelligence techniques in glaucoma diagnosis and the monitoring of glaucoma progression are reviewed, including the classification of visual field images and the detection of glaucomatous change in retinal nerve fiber layer thickness. Current challenges in the direct application of artificial intelligence to further our understating of this disease are also outlined. The article also discusses how the combined use of mathematical modeling and artificial intelligence may help to address these challenges, along with stronger communication between data scientists and clinicians.
Collapse
Affiliation(s)
- Roberto Nunez
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
| | - Alon Harris
- Department of Ophthalmology, Icahn School of Medicine at Mt. Sinai, New York, NY 10029, USA
| | - Omar Ibrahim
- Department of Electrical Engineering, Tikrit University, Tikrit P.O. Box 42, Iraq
| | - James Keller
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
| | | | - Erin Robinson
- Department of Social Work, University of Missouri, Columbia, MO 65211, USA
| | - Ryan Zukerman
- Department of Ophthalmology, Edward S. Harkness Eye Institute, Columbia University Irving Medical Center, New York-Presbyterian Hospital, New York, NY 10034, USA
| | - Brent Siesky
- Department of Ophthalmology, Icahn School of Medicine at Mt. Sinai, New York, NY 10029, USA
| | - Alice Verticchio
- Department of Ophthalmology, Icahn School of Medicine at Mt. Sinai, New York, NY 10029, USA
| | - Lucas Rowe
- Department of Ophthalmology, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Giovanna Guidoboni
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
- Department of Mathematics, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
5
|
Huo S, Morris JS, Zhu H. Ultra-Fast Approximate Inference Using Variational Functional Mixed Models. J Comput Graph Stat 2022; 32:353-365. [PMID: 37608921 PMCID: PMC10441618 DOI: 10.1080/10618600.2022.2107532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 07/23/2022] [Indexed: 10/16/2022]
Abstract
While Bayesian functional mixed models have been shown effective to model functional data with various complex structures, their application to extremely high-dimensional data is limited due to computational challenges involved in posterior sampling. We introduce a new computational framework that enables ultra-fast approximate inference for high-dimensional data in functional form. This framework adopts parsimonious basis to represent functional observations, which facilitates efficient compression and parallel computing in basis space. Instead of performing expensive Markov chain Monte Carlo sampling, we approximate the posterior distribution using variational Bayes and adopt a fast iterative algorithm to estimate parameters of the approximate distribution. Our approach facilitates a fast multiple testing procedure in basis space, which can be used to identify significant local regions that reflect differences across groups of samples. We perform two simulation studies to assess the performance of approximate inference, and demonstrate applications of the proposed approach by using a proteomic mass spectrometry dataset and a brain imaging dataset. Supplementary materials are available online.
Collapse
Affiliation(s)
| | - Jeffrey S Morris
- Department of Biostatistics, Epidemiology and Informatics, Department of Statistics, University of Pennsylvania
| | | |
Collapse
|
6
|
Sassi G, Chiann C. Estimation of trace-variogram using Legendre–Gauss quadrature. BRAZ J PROBAB STAT 2022. [DOI: 10.1214/22-bjps536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Gilberto Sassi
- Departamento de Estatística, Instituto de Matemática e Estatística, Universidade Federal da Bahia, Salvador, Bahia, Brazil https://gilberto-sassi.github.io
| | - Chang Chiann
- Departamento de Estatística, Instituto de Matemática e Estatística, Universidade Federal da Bahia, São Paulo, São Paulo, Brazil
| |
Collapse
|
7
|
Li R, Xiao L, Smirnova E, Cui E, Leroux A, Crainiceanu CM. Fixed-effects inference and tests of correlation for longitudinal functional data. Stat Med 2022; 41:3349-3364. [PMID: 35491388 PMCID: PMC9283332 DOI: 10.1002/sim.9421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Revised: 01/30/2022] [Accepted: 03/05/2022] [Indexed: 11/19/2022]
Abstract
We propose an inferential framework for fixed effects in longitudinal functional models and introduce tests for the correlation structures induced by the longitudinal sampling procedure. The framework provides a natural extension of standard longitudinal correlation models for scalar observations to functional observations. Using simulation studies, we compare fixed effects estimation under correctly and incorrectly specified correlation structures and also test the longitudinal correlation structure. Finally, we apply the proposed methods to a longitudinal functional dataset on physical activity. The computer code for the proposed method is available at https://github.com/rli20ST758/FILF.
Collapse
Affiliation(s)
- Ruonan Li
- Department of StatisticsNorth Carolina State UniversityRaleighNorth CarolinaUSA
| | - Luo Xiao
- Department of StatisticsNorth Carolina State UniversityRaleighNorth CarolinaUSA
| | - Ekaterina Smirnova
- Department of BiostatisticsVirginia Commonwealth UniversityRichmondVirginiaUSA
| | - Erjia Cui
- Department of BiostatisticsJohns Hopkins Bloomberg School of Public HealthBaltimoreMarylandUSA
| | - Andrew Leroux
- Department of Biostatistics and InformaticsColorado School of Public HealthAuroraColoradoUSA
| | - Ciprian M. Crainiceanu
- Department of BiostatisticsJohns Hopkins Bloomberg School of Public HealthBaltimoreMarylandUSA
| |
Collapse
|
8
|
Meyer MJ, Morris JS, Gazes RP, Coull BA. Ordinal probit functional outcome regression with application to computer-use behavior in rhesus monkeys. Ann Appl Stat 2022; 16:537-550. [DOI: 10.1214/21-aoas1513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Mark J. Meyer
- Department of Mathematics and Statistics, Georgetown University
| | - Jeffrey S. Morris
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania
| | - Regina Paxton Gazes
- Department of Psychology and Program in Animal Behavior, Bucknell University
| | - Brent A. Coull
- Department of Biostatistics, Harvard T.H. Chan School of Public Health
| |
Collapse
|
9
|
Li Y, Qiu Y, Xu Y. From multivariate to functional data analysis: fundamentals, recent developments, and emerging areas. J MULTIVARIATE ANAL 2022; 188:104806. [PMID: 39040141 PMCID: PMC11261241 DOI: 10.1016/j.jmva.2021.104806] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Functional data analysis (FDA), which is a branch of statistics on modeling infinite dimensional random vectors resided in functional spaces, has become a major research area for Journal of Multivariate Analysis. We review some fundamental concepts of FDA, their origins and connections from multivariate analysis, and some of its recent developments, including multi-level functional data analysis, high-dimensional functional regression, and dependent functional data analysis. We also discuss the impact of these new methodology developments on genetics, plant science, wearable device data analysis, image data analysis, and business analytics. Two real data examples are provided to motivate our discussions.
Collapse
Affiliation(s)
- Yehua Li
- University of California - Riverside, Riverside, CA 92521, USA
| | - Yumou Qiu
- Iowa State University, Ames, IA 50011, USA
| | - Yuhang Xu
- Bowling Green State University, Bowling Green, OH 43403, USA
| |
Collapse
|
10
|
Zemplenyi M, Meyer MJ, Cardenas A, Hivert MF, Rifas-Shiman SL, Gibson H, Kloog I, Schwartz J, Oken E, DeMeo DL, Gold DR, Coull BA. Function-on-function regression for the identification of epigenetic regions exhibiting windows of susceptibility to environmental exposures. Ann Appl Stat 2021; 15:1366-1385. [DOI: 10.1214/20-aoas1425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Michele Zemplenyi
- Department of Biostatistics, Harvard T. H. Chan School of Public Health
| | - Mark J. Meyer
- Department of Mathematics and Statistics, Georgetown University
| | - Andres Cardenas
- Division of Environmental Health Sciences, University of California, Berkeley
| | | | | | - Heike Gibson
- Department of Environmental Health, Harvard T. H. Chan School of Public Health
| | - Itai Kloog
- Department of Geography and Environmental Development, Ben-Gurion University
| | - Joel Schwartz
- Department of Environmental Health, Harvard T. H. Chan School of Public Health
| | - Emily Oken
- Department of Population Medicine, Harvard Medical School
| | - Dawn L. DeMeo
- Center for Chest Diseases, Brigham and Women’s Hospital
| | - Diane R. Gold
- Department of Environmental Health, Harvard T. H. Chan School of Public Health
| | - Brent A. Coull
- Department of Biostatistics, Harvard T. H. Chan School of Public Health
| |
Collapse
|
11
|
Li T, Li T, Zhu Z, Zhu H. Regression Analysis of Asynchronous Longitudinal Functional and Scalar Data. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1844211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Ting Li
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| | - Tengfei Li
- Department of Radiology and Biomedical Research Imaging Center (BRIC), University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Zhongyi Zhu
- Department of Statistics, Fudan University, Shanghai, China
| | - Hongtu Zhu
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| |
Collapse
|
12
|
Shamshoian J, Şentürk D, Jeste S, Telesca D. Bayesian analysis of longitudinal and multidimensional functional data. Biostatistics 2020; 23:558-573. [PMID: 33017019 DOI: 10.1093/biostatistics/kxaa041] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Revised: 08/31/2020] [Accepted: 09/04/2020] [Indexed: 11/13/2022] Open
Abstract
Multi-dimensional functional data arises in numerous modern scientific experimental and observational studies. In this article, we focus on longitudinal functional data, a structured form of multidimensional functional data. Operating within a longitudinal functional framework we aim to capture low dimensional interpretable features. We propose a computationally efficient nonparametric Bayesian method to simultaneously smooth observed data, estimate conditional functional means and functional covariance surfaces. Statistical inference is based on Monte Carlo samples from the posterior measure through adaptive blocked Gibbs sampling. Several operative characteristics associated with the proposed modeling framework are assessed comparatively in a simulated environment. We illustrate the application of our work in two case studies. The first case study involves age-specific fertility collected over time for various countries. The second case study is an implicit learning experiment in children with autism spectrum disorder.
Collapse
Affiliation(s)
- John Shamshoian
- Department of Biostatistics, University of California, Los Angeles, CA, USA
| | - Damla Şentürk
- Department of Biostatistics, University of California, Los Angeles, CA, USA
| | - Shafali Jeste
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, CA, USA
| | - Donatello Telesca
- Department of Biostatistics, University of California, Los Angeles, CA, USA
| |
Collapse
|
13
|
Yang H, Baladandayuthapani V, Rao AUK, Morris JS. Quantile Function on Scalar Regression Analysis for Distributional Data. J Am Stat Assoc 2019; 115:90-106. [PMID: 32981991 PMCID: PMC7517594 DOI: 10.1080/01621459.2019.1609969] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Revised: 03/08/2019] [Accepted: 04/07/2019] [Indexed: 02/05/2023]
Abstract
Radiomics involves the study of tumor images to identify quantitative markers explaining cancer heterogeneity. The predominant approach is to extract hundreds to thousands of image features, including histogram features comprised of summaries of the marginal distribution of pixel intensities, which leads to multiple testing problems and can miss out on insights not contained in the selected features. In this paper, we present methods to model the entire marginal distribution of pixel intensities via the quantile function as functional data, regressed on a set of demographic, clinical, and genetic predictors to investigate their effects of imaging-based cancer heterogeneity. We call this approach quantile functional regression, regressing subject-specific marginal distributions across repeated measurements on a set of covariates, allowing us to assess which covariates are associated with the distribution in a global sense, as well as to identify distributional features characterizing these differences, including mean, variance, skewness, heavy-tailedness, and various upper and lower quantiles. To account for smoothness in the quantile functions, account for intrafunctional correlation, and gain statistical power, we introduce custom basis functions we call quantlets that are sparse, regularized, near-lossless, and empirically defined, adapting to the features of a given data set and containing a Gaussian subspace so non-Gaussianness can be assessed. We fit this model using a Bayesian framework that uses nonlinear shrinkage of quantlet coefficients to regularize the functional regression coefficients and provides fully Bayesian inference after fitting a Markov chain Monte Carlo. We demonstrate the benefit of the basis space modeling through simulation studies, and apply the method to Magnetic resonance imaging (MRI) based radiomic dataset from Glioblastoma Multiforme to relate imaging-based quantile functions to various demographic, clinical, and genetic predictors, finding specific differences in tumor pixel intensity distribution between males and females and between tumors with and without DDIT3 mutations.
Collapse
Affiliation(s)
- Hojin Yang
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030
| | | | - Arvind U K Rao
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030
| | - Jeffrey S Morris
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030
| |
Collapse
|